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Foreword to the 
First English Edition 








The Mathematical Society of Japan takes 
pleasure in presenting this second edition of 
our Encyclopedic Dictionary of Mathematics 
to every researcher and user of mathematics. 
It is intended to be a compact, up-to-date 
source of information comprising, as com- 
pletely as possible, all significant results in all 
fields of our Science, pure and applied, from 
the elementary to the advanced level. The 
success of the first edition owed much to the 
kind assistance given by the American Mathe- 
matical Society. As described in the preface, 
the members of our Society have taken re- 
sponsibility for compiling this new edition. 
We hope that it will be as useful to the mathe- 
maticians of today as the first edition was 

to the mathematicians of yesterday. We also 
hope that this edition will be followed in 
years to come by subsequent ones incorporat- 
ing the future development of our Science. 


Hikosaburo Komatsu 
President 1985-1987 
Mathematical Society of Japan 


The American Mathematical Society welcomes 
the publication of the Encyclopedic Dictionary 
of Mathematics. For many years we have been 
fascinated by the publication in Japanese, 
Iwanami Sugaku Ziten, because we saw that 
this was an encyclopedia that contained effec- 
tive and penetrating information about all 

the fields of advanced mathematical research. 
We were also frustrated because we could not 
read Japanese and so we could not really 
reach out to this expert and effective source 

of information. We now welcome the fact that 
the second Japanese edition has been trans- 
lated into English and we look forward to the 
fascination which we can now have in getting 
at this rich mine of information. 


Saunders MacLane 
President 1973-1974 
American Mathematical Society 


Preface to the 
Second English Edition 


This second edition of the Encyclopedic Dic- 
tionary of Mathematics is in substance an 
English version of the third edition of Iwanami 
Sugaku Ziten (in Japanese). We shall explain 
how these two versions are related to each 
other and how the present edition differs from 
the first English edition. For the sake of sim- 
plicity we abbreviate Encyclopedic Dictionary 
of Mathematics and Iwanami Sugaku Ziten to 
EDM and ISZ, respectively, and indicate the 
numbers of editions beyond the first by Arabic 
numerals. 

The prefaces of the previous editions will 
clarify how ISZ, its augumented edition, ISZ2, 
ISZ3, and EDM came into existence in 1954, 
1960, 1968, 1985, and 1977, respectively. EDM, 
published nine years later than ISZ2, consisted 
of its English translation and some new mate- 
rials added to update its contents. In retro- 
spect, it was felt that this procedure was not 
adequate to cope with the recent rapid pro- 
gress of mathematics, and a suggestion was 
raised in the Mathematical Society of Japan 
that ISZ3 and EDM2 be produced simulta- 
neously. The favorable reaction of the mathe- 
matical public to EDM encouraged us greatly. 

In 1978 an agreement was made among the 
Society, Iwanami Shoten, Publishers, and The 
MIT Press for the publication of ISZ3 and 
EDM2. An editorial committee was estab- 
lished in the Society with the members named 
in the preface to ISZ3. Manuscripts were pre- 
pared simultaneously in Japanese and English 
by each contributor, with Yuji Ito acting as 
linguistic consultant. 

I should mention that we benefited greatly 
from the kind comments on EDM by the 
following mathematicians: J. F. Adams, M. 
Atiyah, A. Borel, H. Cartan, K. Chandrase- 
kharan, S. S. Chern, J. Dieudonné, E. Hewitt, 
F. Hirzebruch, O. Lehto, J. L. Lions, L. Markus, 
and J. P. Serre. In particular, we are deeply 
grateful to J. Dieudonné for his many detailed 
comments. 

Compared with ISZ3 and EDM, EDM2 has 
the following characteristics: 

(1) EDM2 contains many more new mathe- 
matical results than EDM. For the details we 
refer the reader to the main points of revision 
mentioned in the preface to ISZ3, because 
EDM? has the same mathematical content 
as ISZ3 and the additions made to ISZ2 and 
subsequently in EDM are of relatively minor 
importance. 

(2) The Japanese textbooks listed in the 
article references of ISZ3 are replaced in 
EDM2 by references to standard textbooks 


written in English. Such replacement was not 
done in compiling EDM. 

(3) Years of birth and death are included in 
the Name Index of EDM2 as far as possible. 
This information was given in ISZ2 but was 
removed in compiling EDM. 

(4) The Subject Index of EDM is so de- 
signed that every concept consisting of two or 
more words can be traced from each of these 
component words. (This principle was adopted 
in ISZ2 and EDM but was not rigidly fol- 
lowed in ISZ3 due to the lack of space.) 

(5) While all editions of ISZ were in one 
volume and EDM was in two volumes, EDM2 
is in four volumes: I. Forewords, prefaces, 
introduction, text A-E; II. Text F—N; III. 
Text O-Z; IV. Appendices, indexes and other 
backmatter. The Systematic List of Articles 
appears in volume IV, page 1857. 

While EDM2 is more voluminous than 
ISZ3, we hope that, being written in English 
and printed with generous margins, it can 
easily provide readers with information on 
every significant result of today's mathematics 
and so will be useful to a large number of 
mathematicians. I am responsible for any 
shortcomings that may exist despite all our 
efforts, and I would appreciate any remarks 
from the readers. I hope to have occasion to 
remedy any such shortcomings in the future. 

Finally I would like to repeat my thanks to 
all the collaborators named in my preface to 
ISZ3. Also I express my gratitude to Yuji Ito 
for his tremendous effort to polish the English 
of all manuscripts, to Shigeru litaka for his 
laborious assistance in the final stage of the 
work, to Mrs. M. Nawata for her excellent 
secretarial work, and to the staff of the De- 
partment of Mathematics at Gakushuin Uni- 
versity for providing me with an office for 
compilation for eight years. 


Kiyosi Itó 
December 1986 


Preface to the 
Third Japanese Edition 


The first edition of Iwanami Sügaku Ziten was 
published in 1954, and a revised and aug- 
mented edition appeared in 1960. Extensive 
revision of the work was carried out sub- 
sequently, and the result was published in 1968 
as the second edition, which has retained its 
valuable and useful life for the past 17 years. 
The English translation of this second edition 
published by The MIT Press has achieved 
international recognition. In the meantime, 
remarkable progress has been made in mathe- 
matical sciences: recognition of the interrela- 
tionships amongst various branches within 
mathematics has increased significantly, and 
the formulation of mathematics as a synthetic 
entity is in the making. Furthermore, advanced 
mathematical theories have been utilized fre- 
quently in the physical, biological, and social 
sciences, and expectations for mathematics 

to be the basis of all the sciences have been 
increasing. In order to cope with these devel- 
opments, it was decided that the second edition 
of Sügaku Ziten should be further revised and 
an updated version be published as the third 
edition. The English translation of this third 
edition will also be published by The MIT 
Press. 

The main points of the revision are as 
follows: 

(1) On the Size and Scope of the Encyclo- 
pedia: The total number of articles has in- 
creased only slightly from that of the second 
edition, from 436 to 450. However, in view 
of the rapid and extensive development in 
mathematics in recent years, many old articles 
have been reorganized and unified, and much 
new material has been added. Consequently, 
the mathematical content of the encyclopedia 
has increased a great deal over that of the 
second edition, and this, together with the 
authors' efforts to render the articles more 
readable than before, necessitated a 50% in- 
crease in the number of pages devoted to the 
main text. The account of computer science 
has been taken out of the subject area of 
Numerical Analysis, and a new subject area 
has been created for it; thus mathematics is 
now classified into 21 subject areas altogether 
instead of 20. 

(2) On the Arrangement of the Articles: In 
the second edition, the title of each article 
was spelled out in Romanized Japanese, and 
articles were arranged alphabetically. In the 
third edition, however, the titles of the articles 
are arranged in the order of the Japanese syl- 
labary (katakana) and each article is numbered 


accordingly. To each article a parenthetical 
number ts also attached, giving the subject 
area to which the article belongs and its place 
in that list. 

(3) On the Text: Sections within each arti- 
cle are headed by letters A, B, C,...; the ini- 
tial section of each article gives an outline 
and is devoted to an introductory and general 
account of the topic with which the article 
is concerned. In the actual exposition of 
the articles, every effort has been made to 
pay close attention to the interrelationships 
amongst various fields of mathematics. Even 
when there was no need to revise the original 
article, changes were made in the bibliography 
to list items that would be more accessible to 
the readers. 

(4) On the Appendices: The appendices have 
been revised and augmented along with the 
text. Numerical tables were deleted when- 


_ever it was felt that scientific calculators or 


microcomputers could easily reproduce their 
contents. 

(5) On the Indexes: The listing of mathe- 
matical terms in Japanese has been changed, 
as in the case of the arrangement of articles, 
from the alphabetical order to the order of 
the Japanese syllabary. However, terms in 
Japanese that start with a European word 
have been separated out, and, in order to 
facilitate finding, have been listed indepen- 
dently in an Index for Mathematical Terms in 
Japanese with European Headings. The loca- 
tion of a term listed in the indexes is indicated 
by the article number and the letter heading 
of the section in the article where the term 
appears. 


We here outline briefly how the process of 
compilation of this edition was organized. In 
the summer of 1978 the Mathematical Society 
of Japan decided, following a proposal by 
Professors Shókichi Iyanaga and Yukiyosi 
Kawada, the chief editors of the previous 
editions of Sugaku Ziten, to undertake the 
comptlation of the third edition. Following 
a resolution by the members of the Soci- 
ety, Kunthiko Kodaira, Sigeru Mizohata, 
Itiro Tamura, Nagayoshi Iwahori, Tosihusa 
Kimura, and myself have joined Iyanaga and 
Kawada to form an editorial committee and 
have formulated basic guidelines for the com- 
pilation of the third edition. The work of 
editing and compiling articles in each subject 
area was delegated to the editors designated 
for the areas, and I was asked to undertake the 
responsibility of putting the entire volume in 
order. 

The subject areas and their editors are as 
follows: 


xi 


I Logic and Foundations: Shóji Maehara 


II Sets, General Topology, and Cate- 
gories: Shóji Maehara, Hikosaburo 
Komatsu, Masayoshi Nagata 


In Algebra: Masayoshi Nagata 
IV Group Theory: Nagayoshi Iwahori 
V Number Theory: Takayoshi Mitsui 


VI Eucltdean and Projective Geometry: 
Itiro Tamura 


VII Differential Geometry: Morio Obata, 
Shigeru litaka, Itiro Tamura 


VIII Algebraic Geometry: Shigeru litaka 
IX Topology: Itiro Tamura 
X Analysis: Sigeru Mizohata, Kiyosi Itó 


XI Complex Analysis: Yukio Kusunoki, 
Shigeru litaka 


XII Functional Analysis: Hikosaburo 
Komatsu 


XIIE Differential, Integral, and Functional 
Equations: Tosihusa Kimura, Sigeru 
Mizohata 

XIV Special Functions: Sin Hitotumatu 

XV ` Numerical Analysis: Masaya Yamaguti 


XVI Computer Science and Combinatorics: 
Sin Hitotumatu 


Probability Theory: Kiyosi Itó 
XVIII Statistics: Kei Takeuchi 


XIX Mathematical Programming and 
Operations Research: Shigeru Furuya 


XX Mechanics and Theoretical Physics: 
Huzihiro Araki 


XXI History of Mathematics: Shókichi 
lyanaga 
Appendices: Sin Hitotumatu 

Before the actual process of compilation 
started, the editorial committee had met sev- 
eral times, and two meetings of all the subject- 
area editors had been called. Furthermore, 
editors for each area consulted with other 
experts in their fields as they selected the 
titles of articles to be included. In areas where 
remarkable development had taken place, such 
as differential geometry, functional analysis, 
probability theory, and applied mathemat- 
ics, extensive revision and insertion of many 
new articles were proposed by the respective 
editors. For this reason, the original articles 
were thoroughly reorganized and systematized. 
We resolved to keep the whole work in one 
volume, even though a number of new articles 
have had to be added. 

The selection of articles and of their re- 
spective authors was finally completed in the 
spring of 1980, and 197 colleagues were asked 
to write the articles. The names of these con- 
tributors and those of the previous editions are 
listed elsewhere in this volume. To all of the 
authors we express our sincere gratitude. 

Editing of the entire manuscript has been 
carried out since the summer of 1982, and we 
are deeply indebted to the following colleagues 
for their painstaking efforts in checking cross 
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references and tn proofreading: Nobuyosi Moto- 
hashi (Foundations, Set Theory), Takeo Yoko- 
numa (Algebra, Group Theory), Takayoshi 
Mitsui (Number Theory), Tetsuro Kawasaki 
(Geometry, Topology), Shigeru litaka, Isao 
Wakabayashi (Algebraic Geometry), Morio 
Obata, Koichi Ogiue, Osamu Kobayashi (Dif- 
ferential Geometry), Seizó Ito, Hisashi Oka- 
moto (Analysis), Hiroaki Aikawa, Makoto 
Ohtsuka (Computer Analysis), Hikosaburo 
Komatsu, Akihiko Miyachi (Functional 
Analysis), Kazuo Okamoto, Datsuke Fujiwara 
(Differential Equations), Sin Hitotumatu 
(Special Functions), Teruo Ushijima (Nu- 
merical Analysis), Hideo Wada (Computer 
Science), Yasunori Okabe (Probability Theory), 
Mituaki Huzii, Yoshihiro Yajima (Statistics), 
Shigeru Furuya (Mathematical Programming), 
Koichi Nakamura (Theoretical Physics), 
Shüichi Okamoto (History of Mathematics), 
Kosaku Okutsu (History of Mathematics, 
Number Theory). 

On those occasions when it became neces- 
sary to rapidly revise manuscripts in order to 
unify the presentation, we were forced to go 
through the revision without consulting the 
authors of the manuscripts. I am responsible 
for all such revisions and hereby express my 
apologies to the authors concerned. 

As for the indexes, we received assistance 
from Takeo Yokonuma, Koichi Yano, Hiroaki 
Aikawa, and Hisasht Okamoto in all aspects 
of the work, which lasted for two years, 
through the final proofreading. Preliminary 
work on the Name Index was carried out by 
staff members of the University of Tokyo, and 
Seizó Ito supervised its final compilation. We 
also asked many other colleagues, and in par- 
ticular Nobuyuki Ikeda and Huzihiro Araki, 
to participate in the preparation of the Name 
Index, using resources available at different 
universities. To all of these people goes our 
sincere gratitude for their assistance. 

Last August, when the completion of the 
work was drawing near, it became necessary 
for me to leave Japan for some time, and this 
made it imperative that I entrust Shigeru 
litaka with responsibility for supervising the 
work at the final stage of editing. I am most 
grateful to Iitaka for having agreed to take 
over this responsibility and for having brought 
the work to completion under the pressure of 
time. 

From the beginning of this project, we have 
received an unlimited amount of assistance 
from the members of the Dictionary Depart- 
ment of Iwanami Shoten, Publishers. Messrs. 
Ikutaro Sasaki, Hirotomo Ushida, Kazuhiko 
Uetake, and Nagao Sato in particular, have 
made supreme efforts and have come up fre- 
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quently with innovative ideas to make this 
dictionary polished and perfect. To them and 
also to those at Dai-Nippon Printing Co., who 
typeset with the use of computers the entire 
text, including the complicated mathematical 
formulas, and who have cooperated with us 
fully in the process of editing the indexes, goes 
our deep gratitude. 


Kiyost Ié 
October 1985 
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Preface to the 
First English Edition 


The first and second editions of Iwanami 
Sugaku Ziten (in Japanese) were published, 
respectively, in April 1954 and June 1968 by 
Iwanami Shoten, Publishers, Tokyo. Begin- 
ning in the late 1950s, a number of unsuccess- 
ful attempts were made to arrange for translat- 
ing the Sugaku Ziten into European languages. 
Finally an agreement for an English transla- 
tion was made between The MIT Press and 
the Mathematical Society of Japan in July 
1968. The discussions were carried on first 

by Professor Kósaku Yosida, then presi- 

dent of the Mathematical Society of Japan, 
and later by Yukiyosi Kawada, who suc- 
ceeded him in April 1968. Throughout these 
initial negotiations, which lasted from 1966 to 
1968, we received the kindest assistance 

from Dr. Gordon Walker, Executive Director 
of the American Mathematical Society, 

and from Professors W. T. Martin and Shizuo 
Kakutani. 

The agreement for the project was shortly 
followed by the establishment of a commit- 
tee for the English edition of Sugaku Ziten 
within the Mathematical Society of Japan, 
with the following membership: Professors 
Yasuo Akizuki, Shigeru Furuya, Sin Hito- 
tumatu, Masuo Hukuhara, Isao Imai, Shó- 
kichi lyanaga, Yukiyosi Kawada, Kunihiko 
Kodaira, Atuo Komatu, Hirokichi Kudo, 
Shóji Maehara, Yukio Mimura, Kiyoshi 
Noshiro, Shigeo Sasaki, Shoji Ura, Nobuo 
Yoneda, and Kósaku Yosida. This committee 
requested the original authors of the articles 
and other members of the Society to translate 
the work. A list of translators will be found at 
the end of this work. 

In November 1968, an advisory committee 
for the project was formed with the following 
membership: Professor Edwin Hewitt (chair- 
man), Dr. Sydney H. Gould, Professor Shizuo 
Kakutant, Professor Kenneth O. May, and 
Professor Isaac Namioka. 

As the translating began, we were immedi- 
ately faced with problems concerning unifica- 
tion of terminology and style, some of which 
were inherent in the differences between the 
structures of our two languages—for example, 
the fact that the Japanese language makes no 
distinction between singular and plural forms 
of nouns. 

In August 1969, Professor Hewitt kindly 
arranged a meeting at the University of Wash- 
ington, Seattle, that included the members 
of the Japanese and American committees 
and a representative of The MIT Press. It was 
agreed during this meeting that the transla- 
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tion should be faithful, with only a minimum 
number of changes, such as the correction of 
mathematical errors; whereas the references 
to each article might be augmented consider- 
ably for the convenience of Western readers. 
Professor Kenneth O. May volunteered to 
review the entire translated manuscript, and 
Professors Isaac Namioka and Shizuo Kaku- 
tani, who are proficient in both Japanese and 
English, proposed to read through some of the 
manuscript of the translated articles. It was 
also agreed that the Systematic List of Arti- 
cles should appear in French, German, and 
Russian, as well as in English. 

We owe very much to the American com- 
mittee: Professor Hewitt organized the whole 
work, and Professor May revised the entire 
manuscript and gave us important advice con- 
cerning the appendices, according to which we 
deleted some of the numerical tables which 
may be easily found in readily accessible West- 
ern books. Professor Namioka reviewed a 
great part of the manuscript, transmitting his 
views to Professor May, who forwarded them 
to us with his comments. All of this assistance 
helped us greatly in making our final decisions. 
Professor Kakutani gave us very detailed and 
important advice on the choice of reference 
works. 

We were also assisted concerning English 
terminology and reference books by the fol- 
lowing Japanese mathematicians working in 
American universities: Tadatoshi Akiba, Pro- 
fessors Kiyosi Io Tatsuji Kambayashi, Tosio 
Kato, Teruhisa Matsusaka, Katsumi Nomizu, 
Ichiro Satake, Michio Suzuki, and Gaisi 
Takeutt. 

The Mathematical Society of Japan estab- 
lished the following double reviewing system: 
group A with its twenty subgroups, each 
headed by the members listed below, reviewed 
their respective subjects; while group B re- 
viewed the whole manuscript, mainly from the 
linguistic standpoint. 

Group A 

1. Foundations of mathematics: Shóji 
Maehara 

2. Set theory: Atuo Komatu and Shóji 
Maehara 

3. Algebra: Akira Hattori, Masayoshi Nagata, 
and Hideyuki Matsumura 

4. Group theory: Shingo Murakami, Mitsuo 
Sugiura, and Reiji Takahashi 

5. Number theory: Yukiyosi Kawada and 
Tomio Kubota 

6. Geometry, 7. Differential geometry: Shó- 
kichi lyanaga, Shigeo Sasaki, and Kentaro 
Yano 

8. Algebraic geometry: Yasuo Akizuki and 
Kunthiko Kodaira 

9. Topology: Atuo Komatu 


10. Real analysis: Sin Hitotumatu, Shunji 
Kametant, and Shigeki Yano 
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11. Complex analysis: Kiyoshi Noshiro 


12. Functional analysis: Yukio Mimura and 
Kósaku Yosida 


13. Differential equations: Masuo Hukuhara 
and Sigeru Mizohata 


14. Special functions: Sin Hitotumatu 
15. Numerical analysis, 16. Probability 
theory: Kiyosi Itó 

17. Statistics: Hirokichi Kudo 


18. Information theory: Tosio Kitagawa and 
Hirofumi Uzawa 


19. Theoretical physics: Isao Imai and 
Kazuhiko Nishijima 


20. History of mathematics: Tamotsu Murata 
Group B 
Kenichi Iyanaga and Mitsuyo Iyanaga 

Professor Sin Hitotumatu also assisted us in 
translating the titles of Japanese books given 
in the references and the explanations attached 
to the lists of formulas and numerical tables in 
the appendices. We are also grateful for the 
generous cooperation offered to us by our 
colleagues in the Department of Mathemat- 
ics, Faculty of Science, University of Tokyo: 
Hiroshi Fujita, Shigeru Furuya, Akio Hat- 
tori, Seizó Itó, Nagayoshi Iwahori, Tosihusa 
Kimura, Kunihiko Kodaira, Hikosaburo 
Komatsu, Akihiro Nozaki, and Itiro Tamura. 
We are indebted as well to Professors Walter 
L. Bailey of the University of Chicago and 
Yuji Ito of Brown University for many valu- 
able consultations concerning both mathe- 
matical and linguistic questions. 

In translating the Systematic List of Articles 
into French, German, and Russian, we were 
assisted by Professor Hideya Matsumoto in 
Paris, Professor Emanuel Sperner in Hamburg, 
Professor Katsuhiro Chiba in Tokyo, and 
Professor Arkadii Maltsev in Moscow. 

We began to send the manuscript to The 
MIT Press in March 1970 and finished send- 
ing it in July 1972. The manuscript was edited 
there, then sent to Professors May, Namioka, 
and Kakutani, and finally was sent back to us 
with their comments and questions. All of the 
references were carefully checked by Laura 
Platt. 

Iwanami Shoten, Publishers, have always 
been cooperative with us. In the office of 
the Mathematical Society of Japan, Yoko 
Endo, Reiko Nagase, and Chieko Sagawa 
helped us with their efficient secretarial 
work. 

The fruition of this project was made pos- 
sible only by the gracious assistance offered to 
us by many people, including those already 
mentioned. We should like to express our 
most sincere gratitude to all those who have 
helped us so kindly. 


Shókichi Iyanaga, Yukiyosi Kawada 
August 1973 
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Addition made in January 1976 


After the procedures described above, the 
whole manuscript of this Encyclopedia was 
sent to The MIT Press in August 1973. It was, 
however, toward the end of November 1975 
that the final decision was made by The MIT 
Press to send the manuscript to composition 
in early 1976 in order that the work be pub- 
lished in 1977. 

At the same time, we were asked to review 
and update the manuscript up to the end of 
February 1976. We are now making our best 
effort to this effect with the kind help, especially 
from the linguistic viewpoint, of Dr. E. J. 
Brody. 

In so doing, we have noticed that perhaps 
too much emphasis has been given to results 
obtained by Japanese mathematicians and that 
there are still many things in this book which 
should be improved. 

We hope that ongoing revisions will be 
carried out in subsequent editions. 


xiv 


Preface to the 
Second Japanese Edition 


Seven and a half years have passed since the 


revised and augmented edition of Iwanami 


Sugaku Ziten was published. The nature and 
purpose of this book remain the same as de- 
scribed in the preface to the first edition: It is 
an encyclopedic dictionary with articles of 
medium length aimed at presenting the whole 
of mathematics in a lucid system, giving exact 
definitions of important terms in both pure 
and applied mathematics, and describing the 
present state of research in each field, together 
with historical background and some per- 
spectives for the future. However, mathemat- 
ical science is in rapid motion, and the “pre- 
sent state of research" changes constantly. 
The present updated second edition has been 
published to remedy this situation as far as 
possible. 

The main points of revision are as follows: 

(1) On the Articles and the Size of the Ency- 
clopedia: From the articles of the last edition, 
we have removed those whose importance has 
diminished recently (e.g., Geometry of Tri- 
angles), while we have added new articles in 
domains of growing importance (e.g., Cate- 
gories and Functors; K-Theory). Many articles 
concerning applied mathematics in the first 
edition were short; in this edition, they have 
been combined into articles of medium length 
to save space and to systematize the presenta- 
tion. The number of articles, 593 in the first 
edition, has thus diminished to 436. We have 
made every effort to keep the size of the ency- 
clopedia as it was, but the substantial aug- 
mentation of content has necessarily brought 
about an enlargement of about 30%. 

(2) On the Text: When the title of an article 
has remained the same as in the first edition, 
we have reviewed the whole text and revised 
whenever necessary. Especially for the fun- 
damental ideas, we have endeavored to give 
thorough explanations. In the first edition we 
gave English, French, and German transla- 
tions of article titles; in the present edition, we 
have also added Russian. The bibliographies 
at the ends of articles have been updated. 

(3) On Terminology: In previous editions we 
endeavored to unify the terminology of the 
whole encyclopedia so that the reader would 
have no difficulties with cross-references. Here 
we have done this once again with the hope of 
attaining results more perfect than before. 

(4) On the Appendices: The appendices were 
designed to supplement the text efficiently. 
Some overlapping of the appendices with the 
text found in previous editions has been re- 
moved. Also deleted in this edition are elemen- 
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tary formulas in analytic geometry and tables 
of statistical distributions, which can be easily 
found in other books. However, we have 
added some formulas in topology, the theory 
of probability, and statistics, as well as tables 
of characters of finite groups, etc. 

(5) On the Indexes: Important terms are 
listed multiply in the indexes to facilitate find- 
ing, e.g., the term transcendental singularit y 
appears under both transcendental and singu- 
larity. Both names in the text and those in the 
references are included in the Name Index of 
this edition. The numbers of items in the Index 
of Mathematical Terms in Japanese and in 
European Languages and in the Name Index in 
this edition are 17740, 10124, and 2438, com- 
pared with 8254, 8070, and 1279, respectively, in 
the previous edition. 


The compilation of this edition was organized 
as follows. In the spring of 1964 we began to 
select the titles of articles with the aid of the 
following colleagues: in set theory and foun- 
dations of mathematics, Shóji Maehara; in 
algebra and number theory, Yasuo Akizuki, 
Yukiyosi Kawada; in differential geometry, 
theory of Lie groups, and topology, Yozo 
Matsushima, Atuo Komatu; in analysis, 
Masuo Hukuhara, Kósaku Yosida, Shunji 
Kametani, Sin Hitotumatu; in probability 
theory, statistics, and mathematics for pro- 
gramming, Kiyosi It6, Hirokicht Kudo, and 
Shigeru Furuya; in theoretical physics, Isao 
Imai; and for the appendices, Sin Hitotumatu. 
I have participated in compiling the articles 
on geometry and the history of mathematics. 
Kawada and Hitotumatu undertook the re- 
sponsibility of putting the volume in order. 

The work of selecting titles was completed 
in the summer of 1964. We then asked 173 
colleagues to contribute articles. The names of 
these contributors and those of the previous 
editions are listed elsewhere. To all of them 
goes our most sincere gratitude. 

In editing the manuscript, we were assisted 
by the following colleagues in addition to 
those mentioned already: m set theory and 
foundations of mathematics, Setsuya Seki and 
Tsurane Iwamura; in algebra and number 
theory, Masayosi Nagata, Akira Hattori, 
Hideyuki Matsumura, Ichiro Satake, Tikao 
Tatuzawa; in geometry, theory of Lie groups, 
and topology, Singo Murakami, Hideki Ozeki, 
Noboru Tanaka, Kiiti Morita, Hirosi Toda, 
Minoru Nakaoka, Masahiro Sugawara, Shóró 
Araki; in analysis, Kiyoshi Noshiro, Yáüsaku 
Komatu, Seizo Ito, Hiroshi Fujita, Shige- 
Toshi Kuroda, Sigeru Mizohata, Masaya 
Yamaguti, Tosiya Saito, Tosihusa Kimura, 
Masahiro Iwano; in probability theory, sta- 
tistics, and mathematics of programming, 
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Nobuyuki Ikeda, Tadashi Ueno, Masashi 
Okamoto, Haruki Morimoto, Kei Takeuchi, 
Goro Ishii, Tokitake Kusama, Hukukane 
Nikaido, Toshio Kitagawa; and in theoretical 
physics, Ryogo Kubo, Hironari Miyazawa, 
Yoshihide Kozai. 

After the summer of 1965, we entered into 
the period of finer technical editing, in which 
we were assisted by the following colleagues: 
in algebra, Keijiro Yamazaki, Shin-ichiro 
Ihara, Takeshi Kondo; in geometry, Tadashi 
Nagano, Mitsuo Sugiura, Ichiro Tamura, 
Kiyoshi Katase; in analysis, Nobuyuki Suita, 
Kotaro Oikawa, Kenkiti Kasahara, Tosinobu 
Muramatsu, Hikosaburo Komatsu, Setuzó 
Yosida, Hiroshi Tanaka; and tn history, 
Tamotsu Murata. 

We were also assisted by Katsuhiro Chiba 
in the Russian-language translation of the 
article titles, and by Osamu Kóta and Kiyoshi 
Katase in the indexing. 

Proof sheets began appearing in the spring 
of 1966. In proofreading, Kaoru Sekino, 
Osamu Kóta, Kiyoshi Katase, and Teruo 
Ushijima helped us, as well as Mrs. Rieko 
Fujisaki. Miss Yoko Endo worked with all of 
us throughout the entire revision at the office 
of the Mathematical Society of Japan. She 
helped us especially in looking for and check- 
ing references and preparing the Name Index. 

Yukiyosi Kawada supervised the whole 
work, succeeding me tn the role I had played 
in the compilation of the first and augmented 
editions. Sin Hitotumatu collaborated with 
him throughout, especially on the appendices. 
The second and the third proof sheets of the 
text were read by Kawada; the fourth proof by 
Hitotumatu; the proof sheets of appendices by 
both Kawada and Hitotumatu. 

The Editorial Committee of the Mathe- 
matical Society of Japan asked me to write 
this preface. Having edited the previous edi- 
tions and realizing full well the difficulty of the 
task, I would like to express my particular 
gratitude to Kawada. In the Dictionary De- 
partment of Iwanami Shoten, Publishers, 
Messrs. Hiroshi Horie, Tetsuo Misaka, Shi- 
geki Kobayashi, and Toshio Kouda were very 
cooperative in their collaboration with us. 

To them and also to those who typeset and 
printed this book at Dai-Nippon Printing Co. 
and Shaken Co. goes our gratitude. 


S. lyanaga 
March 1968 


Preface to the Revised and 
Augmented Japanese Edition 


Six years have passed since the publication of 
the first edition of this encyclopedia. This re- 
vised and augmented edition incorporates the 
achievement of these years. It contains, to- 
gether with the correction of errors found in 
the first edition, some new articles such as 
Abelian Varieties, Automata, Sheaves, Homo- 
logical Algebra, Information Theory, and also 
supplements to articles in the first edition such 
as Complex Multiplication, Computers, and 
Manifolds. These additional items render the 
new edition 93 pages longer than the previous 
one. Each article has been thoroughly re- 
vised, and the indexes have been completely 
rewritten. 

We were assisted by the following colleagues 
in selecting articles, writing, and proofreading: 
In set theory and foundations of mathematics, 
Sigekatu Kuroda, Setsuya Seki; in algebra 
and number theory, Tadao Tannaka, Tsuneo 
Tamagawa: in real analysis, Shunji Kametani, 
Kósaku Yosida; in function theory, Kiyoshi 
Noshiro, Sin Hitotumatu; in theory of dif- 
ferential and functional equations, Masuo 
Hukuhara, Masahiro Iwano, Ken Yamanaka; 
in functional analysis, KÓósaku Yosida, Seizó 
Itó; in geometry, Shigeo Sasaki, Nagayoshi 
Iwahori; in topology, Atuo Komatu, Itiro 
Tamura, Nobuo Yoneda; in theory of proba- 
bility, Kiyosi Itó, Seizó H6; in statistics, Toshio 
Kitagawa, Sigeiti Moriguti, Tatsuo Kawata; 
in applied mathematics, Sigeiti Moriguti; and 
in mechanics and theoretical physics, Taka- 
hiko Yamanouchi, Isao Imai. The revision 
and augmentation of the articles concerning 
the history of mathematics was done by 
myself. Miles. Yóko Tao, Eiko Miyagawa, and 
Mutsuko Nogami worked in the office of the 
Society. 

The names of authors who contributed to 
the completion of this edition have been added 
to the original list of contributors. 

The project of editing this edition started 
in the summer of 1958. We acknowledge our 
deep gratitude to all those who have col- 
laborated with us since that time. 


S. Iyanaga 
August 1960 
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This encyclopedia, Iwanami Sugaku Ziten, was 
compiled by the Mathematical Society of 
Japan at the request of the Iwanami Shoten, 
Publishers, who have hitherto published a 
series of scientific dictionaries such as Iwanami 
Rikagaku-Ziten (Iwanami Encyclopedia of 
Physics and Chemistry) and Iwanami Tetugaku- 
Ziten (Iwanami Encyclopedia of Philosophy). 
As mentioned in the prefaces to these volumes, 
the importance of such encyclopedias in clari- 
fying the present state of each science is obvious 
if we observe the rapid pace of contemporary 
research. Mathematics is also in rapid motion. 
As a fundamental part of exact science, it 
serves as a basis of all science and technology. 
It also retains its close contact with philo- 
sophy. Therefore, the significance of having 

an encyclopedia of mathematics cannot be 
overemphasized. 

Mathematics have made remarkable pro- 
gress in the 20th century. As for the situation 
toward the end of the 19th century, we quote 
the following passage from the article Mathe- 
matics in the 19th Century of this encyclo- 
pedia: "Toward the end of [the 19th] century, 
the subjects of mathematical research became 
highly differentiated. Branches split into more 
specialized areas of study, while unexpected 
relations were found between previously 
unconnected fields. The situation became 
so complicated that it was difficult to view 
mathematics as a whole. It was in these cir- 
cumstances that in 1898, at the suggestion of 
Franz Meyer and under the sponsorship of the 
Academies of Gottingen, Berlin, and Vienna, a 
project was initiated to compile an encyclo- 
pedia of the mathematical sciences. Entitled 
the Enzyklopädie der mathematischen Wissen- 
schaften, it was completed in 20 years...." 

One of the characteristics of 20th-century 
mathematics is the conscious utilization of the 
axiomatic method and of general concepts 
such as sets and mappings, which serve as 
foundations of different theories. Indeed, 
mathematics is being reorganized on the basis 
of topology and algebra. One such example of 
reorganization is found in Bourbaki’s Eléments 
de mathématique; some fifteen volumes of this 
series have been published since 1939, and 
more are coming. This encyclopedia, with 
its limited size, cannot contain proofs for 
theorems. However, we intend to present a 
lucid view of the totality of mathematics, 
including its historical background and future 
possibilities. 

Each article of this encyclopedia is of me- 
dium length— sufficiently short to permit 
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the reader to find exact definitions of notions, 
and sufficiently long to contain explanations 
clarifying how important concepts in the same 
field are related to each other. The problem of 
choosing adequate titles required some de- 
liberation. The Systematic List of Articles, 
classified according to specific fields, shows 
those we have chosen. The Index of Terms 
contains detailed references for each notion. 
The appendices, including formulas and tables, 
supplement the text, and will be particularly 
useful for applied mathematicians. 

The project of compiling this encyclopedia 
was proposed in the spring of 1947 by the 
Steering Committee of the Mathematical 
Society of Japan. It was promptly adopted, 
and the selection of articles in specific fields 
was started by the sectional committees of the 
Society. After seven years, our encyclopedia 
is finally appearing. We shall not give a de- 
tailed description of how our work proceeded 
through all these years. We list simply the 
names of those who assisted us greatly and 
to whom we should like to express our deep 
gratitude. 

The president of the Society at the start of 
this project was the late Professor Tadahiko 
Kubota; but our work has been supported also 
by Professors Teiji Takagi, Zyoiti Suetuna, and 
Masatsugu Tsuji as well as by other leading 
members of the Society. 

At the stage of selecting articles, we were 
assisted by the following colleagues: in history 
and the foundations of mathematics, Sige- 
katu Kuroda, Motokiti Kondo; in algebra 
and number theory, Kenjiro Shoda, Tadasi 
Nakayama, Masao Sugawara, Yukiyosi 
Kawada, Kenkichi Iwasawa; in geometry, 
Kentaro Yano, Asajiro Ichida; in function 
theory, Kiyoshi Noshiro, Yûsaku Komatu; in 
the theory of differential and functional equa- 
tions, Masuo Hukuhara, Shigeru Furuya; in 
topology, Atuo Komatu, Ryoji Shizuma; in 
functional analysis, Yukio Mimura, Shizuo 
Kakutani, Kôsaku Yosida; in the theory of 
probability and statistics, Tatsuo Kawata, 
Toshio Kitagawa, Junjiro Ogawa; and in 
applied mathematics, Ayao Amemiya, Isao 
Imai, Kunihiko Kodaira, Shigeiti Moriguti. 

We asked 190 colleagues to contribute arti- 
cles, which were collected in 1949. Since then 
we have spent an unexpectedly long time edit- 
ing them. Terminology had to be unified 
throughout the encyclopedia so that the reader 
would have no trouble with cross-references. 
Repetitions had to be eliminated and gaps 
filled. Part of the manuscript thus had to be 
rewritten a number of times. We have made 
out utmost effort in this editing work, but we 
are not completely without apprehension that 
our result has still left something to be desired. 
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For any shortcomings in the work, I take 
complete responsibility, as I have acted as the 
editor-in-chief. Also, since we have rewritten 
the manuscript, as already mentioned, we have 
refrained from printing the name of the original 
author of each article; for this, I must request 
the understanding of the contributors. 

In the stage of editing and proofreading, 
we were assisted by the following colleagues: 
Yukio Mimura, Yukiyosi Kawada, Kazuo 
Matsuzaka, Sin Hitotumatu, Setsuo Fuku- 
tomi, Setsuya Seki, Shoji Irie, Shigeo Sasaki, 
Tatsuo Kawata, Sigekatu Kuroda, Yûsaku Ko- 
matu, Ayao Amemiya, Isao Iami, Tosio Kato, 
Tsurane Iwamura, Morikuni Goto, Kôsaku 
Yosida, Jirô Tamura, Yasuo Akizuki, Kiyoshi 
Noshiro, Motosaburo Masuyama, Sigeiti 
Moriguti, Osamu Kôta, Nobuo Yoneda, 
Tsuneo Tamagawa, Jun-ichi Hano; and more 
particularly, in the foundations of mathematics, 
Sigekatu Kuroda, Tsurane Iwamura; in alge- 
bra and number theory, Kazuo Matsuzaka, 
Yukiyosi Kawada; in algebraic geometry, 
Yakuo Akizuki; in real analysis, Tatsuo Ka- 
wata; in complex analysis, Yûsaku Komatu, 
Sin Hitotumatu, Jirô Tamura; in functional 
analysis, Kôsaku Yosida; in topology, Setsuo 
Fukutomi, Nobuo Yoneda; in the theory of 
probability and statistics, Motosaburo Masu- 
yama, Sigeiti Moriguti; and in applied mathe- 
matics, Ayao Amemiya, Isao Imai, Tosio 
Kato, Sigeiti Moriguti. 

The portraits of Abel and Riemann were 
kindly loaned to us by Torataro Shimomura. 

The formulas in the appendices were com- 
piled by Isao Imai, Sin Hitotumatu, and Sigeiti 
Moriguti; the Subject Index (in Japanese and 
European languages) by Osamu Kóta and 
Mrs. Hiroko Ide; the Name Index and Com- 
ments on Journals and Serials by Setsuo 
Fukutomi. Setsuo Fukutomi has taken an 
active part in our work ever since 1948 and 
given much effort to collecting and rewriting 
the manuscript and to unifying terminology. 
The editorial staff of Iwanami Shoten, Pub- 
lishers, has always been cooperative. Without 
their generous support, this encyclopedia 
could never have been published. 

I should like to express my sincere gratitude 
to all those who have collaborated with us 
directly or indirectly. 


S. Iyanaga 
March 1954 


Introduction 


The text of this Encyclopedic Dictionary con- 
sists of 450 articles arranged alphabetically, 
beginning with 1 Abel and ending with 450 
Zeta Functions. Most of these articles are 
divided into sections, indicated by A, B, C, ..., 
AA, BB, .... Cross-references to articles, e.g., 
to the second article, are of the form: (— 2 
Abelian Groups) or (— 2 Abelian Groups A), 
according as the whole article or a specific 
section is being referred to. Citations in the 
indexes are also given in terms of article and 
section numbers. 

Key terms accompanied by their definitions 
in the text are printed in boldface. AII of these 
terms are found in the Subject Index at the 
end of the volume. 

The sign * means that the term preceded by 
it can be found in the index. A list of special 
notations used throughout the work (with 
explanations of their meanings) appears after 
the appendices. 

A Systematic List of Articles, showing the 
general structure of the work, wil! be found on 
p. 1857 (vol. IV). The number in parentheses 
after each article title refers to this systematic 
classification into subject areas; e.g., "Abelian 
Varieties (VIIL.5)" means that the article on 
Abelian varieties is the fifth article in Section 
VIII of the systematic list. 

Books and articles in journals are cited in the 
text by numbers in brackets: [1], [2], .... At 
the end of each article there is a section of 
references in which, for books, the name of the 
author or authors, title, name of the publisher, 
year of publication, and the number of the 
edition are given; for journals, the name of the 
author, title of the article, name of the journal, 
and the volume numbers and inclusive page 
numbers are given in this order. (The names of 
journals and publishers are abbreviated as 
indicated in the lists at the end of the work.) 











The Cyrillic alphabet is transliterated as 


follows: 
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Abel, Niels Henrik 


1 (XX1.12) 
Abel, Niels Henrik 


Niels Henrik Abel (August 5, 1802- April 6, 
1829) was born a son of a poor pastor in the 
hamlet of Findó in Norway. In 1822, he en- 
tered the University of Christiania; however, 
he studied mathematics almost entirely on his 
own. He was recognized as a promising stu- 
dent by his senior, Holmboe, and after gradu- 
ation he studied abroad in Berlin and Paris. 
In Berlin he met and was aided by A. Crelle, 
the founder of the Journal für die Reine und 
Angewandte Mathematik, and participated in 
the founding of this journal. Although he did 
brilliant work in Paris, he did not gain the 
fame he deserved. He returned to Norway in 
May 1827, but, unable to find a job, he was 
obliged to fight poverty while continuing his 
research. He died at twenty-six of tuberculosis. 

His best-known works are the result that 
algebraic equations of order five or above 
cannot generally be solved algebraically, the 
result that tAbelian equations can be solved 
algebraically, the theory of *binomial series 
and of series in general, the theory of tellip- 
tic functions and more generally of talge- 
braic functions, the introduction of tAbeltan 
integrals, and the establishment of *Abel's 
theorem. His work in both algebra and analy- 
sis, written in a style conducive to easy com- 
prehension, reached the highest level of attain- 
ment of his time. 
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2 (IV.2) 
Abelian Groups 


A. General Remarks 


A *group G is called an Abelian group (or 
commutative group) if G satisfies the commuta- 
tive law ab — ba for all a, be G. In this article, G 
always denotes an Abelian group. Every tsub- 
group of G is a *normal subgroup, and all the 
elements of finite order in G form a subgroup 


T, for which the *factor group G/T has no 
elements of finite order except the identity e. 
T is called the (maximal) torsion subgroup of G. 
If G — T, then the Abelian group G is called a 
torsion group (or periodic group). On the other 
hand, if T= {e}, then G is called torsion-free; if 
T#G, Tz (ej, then G is called mixed. If the 
order of every element of a torsion group G is 
a power of a fixed prime number p, then G is 
called an Abelian p-group (or primary Abelian 
group). An Abelian torsion group is the fdirect 
sum of primary Abelian groups. Thus the 
study of torsion groups is reduced to that of 
primary Abelian groups. 


B. Finite Abelian Groups 


The following fundamental theorem on finite 
Abelian groups was established by L. Kro- 
necker, G. Frobenius, and L. Stickelberger in 
the 1870s. An Abelian group G of order p", 
where p is a prime number, is a direct product 
of *cyclic subgroups Zi,...,Z,:G— Z, x... x 
Z,. If Z; is of order při, then n=n,+...+n,, 
and we can assume that n; 2 n;,,. A direct 
product decomposition of G, as above, is not 
unique, but n,,...,, are determined uniquely 
by G. The system {p",...,p} or {m,,...,n,} is 
called the system of invariants (or type) of G, 
and a system of tgenerators {z,,...,z,} of 
Zi,- Z is called a basis of G. An Abelian 
group of type (p. p, ..., p) is called an elemen- 
tary Abelian group. The decomposition of a 
finite Abelian group into a *direct sum of 


subsets (not necessarily of subgroups) was 


considered by G. Hajós (1942) and applied 


successfully to a problem of number theory 
(— 151 Finite Groups). 


C. Finitely Generated Abelian Groups 


The theory of *finitely generated Abelian 
groups, i.e., Abelian groups generated by a 
finite number of elements, is as old as that of 
finite Abelian groups. The direct product of 
"infinite cyclic groups is called a free Abelian 
group. À finitely generated Abelian group G is 
the direct product of a finite Abelian group 
and a free Abelian group. The finite factor is 
the torsion subgroup of G. The free factor of the 
group G is not necessarily unique; however, 
the number of infinite cyclic factors of the free 
factor is uniquely determined and is called the 
rank of G. Two finitely generated Abelian 
groups are isomorphic if they have isomorphic 
maximal torsion subgroups and the same 
rank. This theory can be extended to the 
theory of tmodules over a ‘principal ideal 
domain (— 67 Commutative Rings K). 


D. Torsion Groups 


The structure of Abelian p-groups is relatively 
well known, compared with other infinitely 
generated Abelian torsion groups. In the 
1920s, H. Prüfer made the first important 
contribution to the study of Abelian p-groups, 
and H. Ulm and L. Zippin completed the 
theory for countable groups in the 1930s. The 
uncountable case was first treated by L. Kuli- 
kov in the 1940s, but the study of this case is 
still in progress. 

An Abelian p-group G # {e} is called divis- 
ible (or complete) if for any aeG there is an 
element xeG satisfying x? =a. A divisible 
group is a tdirect sum of Abelian groups of 
type p” (Prüfer). Here a group of type p” is 
isomorphic to the tmultiplicative group of all 
the p”th roots of unity (n— 1,2, ...) in the 
complex number field. Let G be any Abelian 
p-group. The maximal divisible subgroup 
V of G is a direct product factor of G:G= 
V x R, where R has no divisible subgroups. 
An Abelian p-group without a divisible sub- 
group is called a reduced Abelian group. 

An element x of an Abelian p-group G is 
said to have infinite height if for any n there is 
an element y,€G satisfying x = y7". The ele- 
ments of infinite height form a subgroup G! of 
G. If G! = {e} and G is countable, then G is 
decomposed uniquely into the direct sum of 
cyclic groups. This assertion fails if the hy- 
pothesis of countability is dropped. By *trans- 
finite induction we can define G^ as follows. If 
B is an fisolated ordinal number, then G’ = 
(G^ IV: if f is a tlimit ordinal number, then 
G^ = (1,5 G*. For the least ordinal number t 
such that G'— G**!, G" is the maximal divis- 
ible subgroup of G. If G is reduced, then G'— 
{e}. We call x the type of an Abelian p-group 
G. Fora « t, G* = G"/G**! is called the Ulm 
factor of G, and the sequence G°, ..., G*, ... 

(x « 1) is called the sequence of Ulm factors 
of G. Each Ulm factor G* has no element of 
infinite height, and if « « x — 1, then G* has an 
element of arbitrarily large order. Let t be a 
countable ordinal number; assume that for 
any ordinal number a <t there is given a 
countable Abelian p-group A, such that A, 
has no element of infinite height, and that 
for «x t— 1 A, has an element of arbitrarily 
large order. Then there is a reduced coun- 
table Abelian p-group which is of type x with 
a sequence of Ulm factors isomorphic to Ag, 
Ah, Al, (a. t) (Zippin). Two reduced 
countable Abelian p-groups A and B are iso- 
morphic if they have the same type qt; and for 
any «<t the Ulm factors A* and B* are iso- 
morphic. The assertion fails if the hypothesis 
of countability is dropped. 
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E. Torsion-Free Groups 


In Abelian groups, the group operation is 
often written a+b, using the additive notation; 
an additively written group, called an additive 
group, is generally assumed to be Abelian. In 
the rest of this article we consider exclusively 
additive Abelian groups, of which the additive 
group Z of rational integers is the most primi- 
tive example. In such a group the identity 
element is called the zero element and is de- 
noted by 0; the inverse of a is denoted by — a, 
and we write a+(—b)=a—b. The direct sum 
of additive groups A, (4e A) is called a free 
additive group if each A; is isomorphic to Z. 
An additive group G is regarded as a tmodule 
over the tring Z, to which the notion of linear 
independence is applicable (— 277 Modules). 
Elements a,,...,a, of G are linearly dependent 
if there are integers n,, ..., n, not all of which 
are zero such that n,a,+...+n,a,=0. Those 
that are not linearly dependent are termed 
linearly independent. An infinite set of elements 
of G is called linearly independent if the ele- 
ments of any finite subset are linearly inde- 
pendent. If there are N elements of G that are 
linearly independent, but if any N + 1 elements 
of G are linearly dependent, then N is called 
the rank of G. Such a system of N linearly 
independent elements is called a maximal 
independent system. A torsion-free additive 
group G is not necessarily free if G is not 
finitely generated. 

The first important work on torsion-free 
additive groups was done by F. W. Levi (1917). 
A. G. Kurosh (1937) completed the theory in 
the case of finite rank. In the general case little 
is known, and I. Kaplansky, J. Rotman, and 
others are continuing the investigation. 

The additive group Q of rational numbers is 
of rank 1, and conversely any additive group 
of rank 1 is isomorphic to some subgroup of 
Q. An additive group G is called divisible (or 
complete) if for any ae G and for any integer n 
there is an element x,€ G such that nx,—a. A 
divisible torsion-free additive group is isomor- 
phic to a direct sum of some copies of Q. For 
any torsion-free additive group G there is a 
divisible torsion-free additive group containing 
G. A minimal additive group F among these 
groups is uniquely determined up to isomor- 
phism and has the rational number field Q as 
an toperator domain. Let Q = {a/b|(a, b) - 

1, p{b} be the ring of p-integers in Q, and let 
G, be the smallest Q?-subgroup of F contain- 
ing G. Let Q, be the *p-adic number field and 
Z, the ring of tp-adic integers. Extending the 
operator domain Q to Q, we obtain naturally 
a Q,-module F, from F. Let G, be the natural 
closure of G, in F,. Then G, has Z, as an 
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operator domain and thus becomes a Z,- 
module. A Z,-module of rank N is isomorphic 
to the direct sum of x, copies of Q, and N —«, 
copies of Z,:G, 2 3,Q,v, t X, Zw, (n= 
ld ness ., N — kK,). Here x, is called 
the p-rank of G. As the invariants of G, Kurosh 
gives the rank, the p-ranks for all primes p, 
and a certain equivalence class of the sequence 
of the matrices ON. Here p ranges over all 
primes, and W, is the matrix of coefficients 
when the elements of a maximal independent 
system of F are written as linear combinations 
of (v 


n? Wal 


F. General Abelian Groups 


An Abelian group is, in general, an textension 
of a torsion group by a torsion-free group. A 
torsion group T is called bounded if there is an 
integer n such that 1" — 1 for all te T. Suppose 
there is a torsion group T. Then T' is a direct 
summand of an Abelian group G which con- 
tains T as its maximal torsion subgroup if and 
only if T is the direct product of a divisible 
group and a bounded group (R. Baer and S. 
Fomin). 


G. Characters 


A character y of an Abelian group G is a func- 
tion which assigns to each ae G a complex 
number y(a) of absolute value 1 and satisfies 
X(ab) = y(a)y(b) for all a, be G. The product y= 
X1X2 Of two characters y, and y; is defined 

by x(a) 7 x(a) y;(a), and y is also a character 
of G. Thus all the characters of G form an 
Abelian group C(G), which is called the char- 
acter group of G. The identity element of the 
character group is the identity character (or 
principal character) y such that y(a)= 1 for all 
ac G. If G is finite, then G = C(G). This implies 
the duality G 2 C[C(G)]. This fact was ex- 
tended by L. S. Pontryagin to tlocally compact 
"topological Abelian groups (— 422 Topo- 
logical Abelian Groups B- D). For additive 
Abelian groups with operator domains — 277 
Modules. 
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A. History 


Except for C. F. Gauss, whose work on this 
subject saw the light many years after his 
death, N. H. Abel was the first to consider 
talgebraic functions as functions of complex 
variables and to discover double periods of 
telliptic functions such as x = x(u), which is the 
inverse function of an elliptic integral 





| * dx 

u= ; 

V fax) 

where f4(x) is a polynomial in x of degree 4. C. 
G. Jacobi expressed elliptic functions explicitly 
as ratios of theta series. As a natural generali- 
zation of elliptic functions, which are tAbelian 
functions of genus 1, Jacobi and his successors 
studied the inverse function of the thyperellip- 
tic integral, or more generally, the inverse 
function of the Abelian integral. By investigat- 
ing hyperelliptic integrals of the first kind of 
genus 2, 





i dx, | 2 adr: 

V felxi) fo(X2) 
where f,(x) is a polynomial of degree 6 in x, 
Jacobi obtained multiple-valued functions 
with quadruple periods. He then discovered 


the remarkable fact that if we consider sums of 
two integrals 





* dx, d 2 dx; 
V fo(%1) J fol St 
“x ESI *2 adr: 
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the elementary symmetric functions s, =x, 
+x, and s; — x, x; of x, and x, are single- 
valued functions of u; and u, with quadruple 
periods. He also conjectured that these func- 
tions s, and s; might be expressed explicitly in 
terms of theta series of u, and u;; this conjec- 
ture was later confirmed by J. G. Rosenhain 
and A. Gópel. 

In the latter half of the 19th century, the 
general theory of Abelian functions was es- 
tablished. The central subject was Jacobi's 
inverse problem (— Section L), which is a 
natural generalization of the above results. B. 
Riemann solved this problem by expressing 
elementary Abelian functions (— Section M) 
as rational functions of theta functions. 

The theory of functions with multiple 
periods was developed further by H. Poincaré, 
G. Frobenius, and E. Picard. In the 20th cen- 
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tury, the importance of the theories of Abelian 
functions and Abelian varieties has become 
more obvious with the development of the 
theory of functions of several complex vari- 
ables and algebraic geometry. In particular, 
problems intimately related to number theory 
have given rise to the development of a purely 
algebraic theory of Abelian varieties. Valuable 
contributions have been made by S. Lefschetz, 
C. L. Siegel (tSiegel modular forms), A. Weil 
(algebraic theory of Abelian varieties, syste- 
matization of the theory of theta functions), 
and by D. Mumford (tmoduli theory, algebraic 
theory of theta functions). 


B. Algebraic Theory 


When a tgroup variety is tcomplete as a 
variety, the group law is commutative; such a 
group variety is called an Abelian variety (— 
12 Algebraic Varieties H.). Let B be a sub- 
variety of an Abelian variety A, and assume 
that B is a subgroup of A as an abstract group. 
Then B has the structure of an Abelian variety 
whose law of composition is induced by that 
of A, and B is called an Abelian subvariety of 
A. More generally, when an algebraic subset 8 
is a subgroup of A, then the component B of B 
containing the identity element is an Abelian 
subvariety, and 38 is a union of B and a finite 
number of cosets of B. When A is defined over 
a field k, then any Abelian subvariety of A is 
defined over a finite tseparable extension of k 
(W. L. Chow’s theorem). An Abelian variety 

A is called simple if A has no Abelian sub- 
varieties other than A itself and 0. 

Every frational mapping of an algebraic 
variety V into an Abelian variety is defined at 
each simple point of V. This implies that an 
Abelian variety is tabsolutely minimal. 


C. Homomorphisms 


A rational mapping of an Abelian variety A 
into an Abelian variety B is called a rational 
homomorphism (or simply homomorphism) if f 
is a group homomorphism. Let F be a rational 
mapping of A into B; then F can be uniquely 
expressed as follows: F(x) = Fo(x)+ F(0) (x e A), 
where F, is a homomorphism and F(0) is the 
image of the unit element 0 of A. Hence the 
structure of an Abelian variety (as a group 
variety) is uniquely determined by the underly- 
ing algebraic-variety structure. 

When a rational homomorphism f is bira- 
tional, f is called a birational isomorphism (or 
simply isomorphism). It is clear that a rational 
isomorphism is an abstract isomorphism, but 
the converse is not necessarily true. Let A, B 
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be two Abelian varieties. We say that A is 
isogenous to B if the dimension of A is equal to 
that of B and there exists a surjective homo- 
morphism of A onto B, or equivalently, if there 
exists a surjective homomorphism of A onto B 
whose kernel is finite. The relation of isogeny 
is an equivalence relation. For an Abelian 
variety A and an Abelian subvariety X of A 
there is an Abelian subvariety Y of A such 
that the natural homomorphism X x Y^ 

A: (x, y) ox 4*- y is an isogeny (Poincaré's com- 
plete reducibility theorem). In particular, every 
Abelian variety is isogenous to a product of 
simple Abelian varieties that are determined 
uniquely up to isogeny and order. 

Let A, B be two Abelian varieties; we denote 
by Hom(A, B) the additive group of rational 
homomorphisms of A into B. When a rational 
homomorphism 4 is surjective, then the degree 
v(4) of 4 is defined by A(A)= v(4) B as talgebraic 
cycles. If 4 is an isogeny, then v(4) 40, and the 
order of the kernel [t|te A, 4t —0j is at most 
v(4) and equal to v(A) if and only if å is fsepa- 
rable. The additive group Hom(A, B) is shown 
to be free of rank x 4dim A- dim B by the l-adic 
representation (— Section E). If A — B, then 
Hom(A, A) has a ring structure; it is called the 
ring of endomorphisms (or endomorphism ring) 
of A and is denoted by 9I(A). The tensor prod- 
uct 9[,(A4) = 9[(A) © Q, where Q is the field 
of rational numbers, is an 'associative alge- 
bra over Q. If A is simple, then 9[5(A) is a 
‘division algebra. More generally, 9I,(A) is iso- 
morphic to a direct product of some ttotal 
matrix algebras over division algebras; thus 
9I5(A) is fsemisimple. In particular, if A is 1- 
dimensional (in other words, if A is an telliptic 
curve), the types of 2{,(A) are well known; 
when the characteristic p —0, then Y,(A) is 
either the field of rational numbers or an 
timaginary quadratic field. When p> 0, aside 
from these two fields, we have a tquaternion 
algebra over Q as a possible type of (A). 

Let k be a finite field with q elements. An 
algebraic integer is called a Weil number for g 
if every conjugate of it has absolute value a. 
If A is an Abelian variety defined and simple 
over k, the qth power endomorphism of A: 
xx^ determines a conjugacy class of Weil 
numbers for q, as Weil showed (— 450 Zeta 
Functions). Moreover, we have the following 
classification theorem (J. Tate, T. Honda): 
There is a one-to-one correspondence between 
the set of all k-isogeny classes of k-simple 
Abelian varieties over k and the set of all con- 
jugacy classes of Weil numbers for q. Tate also 
determined the structure of the division alge- 
bra Wg (4) over Q, which is described in terms 
of the decomposition of the qth power endo- 
morphism into prime ideals. 
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D. Divisors 


Let (5 be the additive group of tdivisors on an 
Abelian variety A and (5, be the subgroup of 
divisors that are talgebraically equivalent to 0. 
Then the factor group (5/6, has no ‘torsion 
part; this implies that for an Abelian variety 
tnumerical equivalence coincides with *alge- 
braic equivalence. We denote this relation 
by z. Given an element a of A, the translation 
TI: AS3xx-F-acA gives a tbirational trans- 
formation, which is everywhere fbiregular, on 
the underlying variety of A; we denote by X, 
the image of a divisor X on A. Then X =0 if 
and only if X, is linearly equivalent to X for 
each point a of A. The *Albanese variety of an 
Abelian variety A is A itself, and the ! Picard 
variety A of A is isogenous to A. In particular, 
for the ‘Jacobian variety J of an algebraic 
curve, J is isomorphic to J itself. The Picard 
variety A of Ais isomorphic to A (duality 
theorem). Let X be a divisor on A; the map- 
ping athe ‘linear equivalence class of the 
divisor X,— X, ac A, is a rational homomor- 
phism of A into A, and we denote it by gy. If 
Q x —0, then X =0, and vice versa; hence we 
have an additive map of 6/6, into Hom(A, A). 
If o, is surjective, we say that X is nondegener- 
ate. A tpositive divisor X is nondegenerate if 
and only if X is *ample, and then nX is tvery 
ample for n z 3. There always exist positive 
nondegenerate divisors on an Abelian variety; 
therefore an Abelian variety is a *projective 
variety. For a given divisor X on A, we can 
find n suitable points u,,...,u,, where n is the 
dimension of A, so that the intersection prod- 
uct X, -...: X, is defined. We denote by (X'^) 
the tdegree of the zero cycle X, -...- X, . If X 
is positive nondegenerate, then the dimension 
(X) of the tdefining module of the tcomplete 
linear system determined by X is equal to 
(X y/n! (Poincaré's theorem). Furthermore, the 
degree v(py) of py, where X is any divisor on 
A, is given by the formula v(py)=((X)/n!)? 
(Frobenius's theorem). 

For a nondegenerate divisor X on A there is 
a unique integer i=i(X), 0€ i dim A, called 
the index of X, such that H?(A, €(X)) 20 for 
p#iand H'(A,0(X)) 40. We have i(—X)= 
dim A —i(X) and i(X)=0 if and only if X is 
ample. 


E. l-adic Representations 


Let A be an Abelian variety of dimension n. 
For a given prime number l, let G,(A) denote 
the group of points on A whose order is a 
power of l. If / is different from the character- 
istic of the base field of A, then the group 
(5,(A) is isomorphic to the direct product of 2n 


factor groups Q,/Z;,, where Q, is the field of l- 
adic numbers and Z, is the group of l-adic 
integers (— 439 Valuations). We call such an 
isomorphism the /-adic coordinate system of 
©,(A). Now let 4 be a rational homomorphism 
of A into an Abelian variety B of dimension m. 
Then we can see that 4 induces a homomor- 
phism of 65,(A) into (5,(B). This shows that by 
placing /-adic coordinate systems on ©,(4) and 
(5,(B) respectively, we get a matrix representa- 
tion M,(A) of 4 with 2m rows and 2n columns. 
The representation A4 Mj(A) is faithful, and 
M,(A) is called the /-adic representation of 2. 
When A — B, then 4— Mj(A) is a faithful repre- 
sentation of the ring of endomorphisms 9I(A). 
This representation can be naturally extended 
to the representation of the algebra dat Ay; 

the characteristic polynomial of the /-adic 
representation M,(A) (where J is an element 

of Xo (4)) is a polynomial with coefficients in 
Q. Moreover, the polynomial does not depend 
on the choice of the prime number /. When 

AE UA), then v(A) is equal to det MOL The 
trace of M,(A) ts usually written as o(A). 

Let 4 be a rational homomorphism of A 
into B and Y be a divisor on B. Then by the 
correspondence cl(Y)—cl(4 !(Y)), we obtain a 
rational homomorphism of B into A, called the 
transpose of 4 and denoted by ‘4, where cl 
means the linear equivalence class (— 16 Alge- 
braic Varieties M). If X is a nondegenerate 
divisor on A, then the composition map IER 
A of 'py:A—A and the canonical isomorphism 
AAA satisfy the equality Bo oy =v(px)d 
(0 — the identity mapping of A). We denote by 
9x! the element (1/v(o,)) f in Hom(A, A) & Q. 
The correspondence x^, o zx o'wo@y 
(xe 9I5(A)) is an tinvolution of 91,(4) and 
is of order 1 or 2. If «40, then a(a’oa)>0 
(Castelnuovo's lemma). A. Weil was the first to 
recognize the importance of this theorem in 
connection with tRiemann’s hypothesis on 
*congruence zeta functions. 


F. Differential Forms 


A ‘differential form œ on an Abelian variety A 
of dimension n is called invariant if T*% =a 
for every point ae A, where T, is the trans- 
lation by a (— Section D). The differential form 
*of the first kind is invariant, and conversely, 
every invariant differential form is of the first 
kind. Let K be the *universal domain and 
K(A) be the tfunction field of A. The set of the 
linear differential forms of the first kind on A 
is a linear space over K of dimension n, and its 
basis becomes a basis over K(A) of the linear 
space consisting of all linear differential forms 
on A. An invariant derivation on A is a deriva- 
tion D in K(A)satisfying (Df)o T, = D(f oT, 


for any element f of K(A) and every point a of 
A. For a linear differential form w=» f, du; 
and a derivation D, we put <œ, D> =È f;Du;. 
Then 4o, D» is a bilinear form in œ and D. A 
derivation D is invariant if and only if <œ, D> 
is a constant function for every invariant linear 
differential form œ. Similarly, a linear differen- 
tial form c is invariant if and only if (c, D» is 
a constant function for every invariant deriva- 
tion D. The linear space of invariant linear 
differential forms and that of invariant deriva- 
tions are dual to each other with respect to 
the bilinear form (c, D». 

Now consider the case when the character- 
istic p of the universal domain is positive. The 
automorphism aa^, ac K, of the universal 
domain K induces a group isomorphism of A; 
we denote by A? the image of A and by x? the 
image of a point x of A. The image A? is an 
Abelian variety, and the group isomorphism 
n:xxP, xe A, is an isogeny of A onto A”. Let 
B be another Abelian variety and let 4 be an 
isogeny of A onto B. If there is an isogeny x: 
B- A? such that 7 = uo À, then we say that A is 
of height 1. The function field K(B) of B can be 
considered as a subfield of the function field 
K(A) of A by the mapping 4. If 4 is of height 1 
and v(A)— p, there exists an invariant deriva- 
tion D of K(A) with the constant field K(B), 
uniquely determined up to constants. More- 
over, we can choose D so that D? =D or D? «Q0. 
In the first case 4 is said to be of type (i,); in 
the second case it is said to be of type (i). An 
isogeny whose degree is a prime different from 
the characteristic p is said to be of type (s,), 
and a tseparable isogeny whose degree is p 
is said to be of type (s;). Any isogeny can be 
written as a product of isogenies of these four 
types. 


G. Polarized Abelian Varieties 


Let X be a divisor on an Abelian variety A; we 
denote by X the class of divisors X' such that 
mX =m X' for suitably chosen positive inte- 
gers m, m'. When the class X contains positive 
nondegenerate divisors, we say that X deter- 
mines a polarization on A, and the couple 

(A, X) is called a polarized Abelian variety. In 
particular, if A is a Jacobian variety whose 
polarization X is determined by a theta divi- 
sor, we call (4, X) the canonically polarized 
Jacobian variety. If an endomorphism « of A 
keeps the polarization invariant, i.e., if the 
class determined by «^! (X) coincides with the 
class X, then a is called an endomorphism of 
the polarized Abelian variety (A, X). In partic- 
ular, if « is an automorphism of A, then we 
say that « is an automorphism of (A, X). The 
group of all automorphisms of a polarized 
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Abelian variety is finite. In particular, the 
group of automorphisms of a canonically 
polarized Jacobian variety is finite. Hence 
follows the famous theorem concerning the 
finiteness of the group of automorphisms of an 
talgebraic curve of genus not less than 2. 

On the other hand, the algebraic equiva- 
lence class of a nondegenerate divisor is called 
an inhomogeneous polarization. (The above 
polarization is then sometimes called a homo- 
geneous polarization.) An inhomogeneous 
polarization X determines an isogeny Qy: 
A Â uniquely. If gy is an isomorphism, 
the polarization is called principal. An endo- 
morphism of an inhomogeneously polarized 
Abelian variety can be defined similarly. 


H. Analytic Theory 


For the rest of this article we take the complex 
number field C as the universal domain, and in 
this case we can utilize analytic and topolog- 
ical methods. 

Let C" be an n-dimensional vector space 
over C. In a natural way, the space C" be- 
comes a 2n-dimensional vector space R?" 
over the real field R, and the mapping J:z-> 
ue z, zeC', is an R-linear automorphism 
of R?” such that J? = —1. Conversely, if for an 
even-dimensional R-vector space R?" such 
a mapping J is given, then by putting (a+ 
J-i b)x=ax+bJx (xeR?";a,beR), we can 
introduce an n-dimensional complex linear 
structure into R?". We then say that J deter- 
mines a complex structure on R?"; we denote 
by C" z (R?", J) the space having the complex 
structure determined by J. Let c, ..., 0, be 
2n R linearly independent points on C" = 
(R?", J). Then the tlattice D generated by 
these points is discrete, and the factor group 
T" = C"/D is a complex torus of dimension n. 
We fix a basis of C" and introduce a complex 
coordinate system on CH. Utilizing the basis 
€, ..., O5, OË R?", we also introduce a real 
coordinate system on R2". We then obtain an 
n x 2n matrix Q — (oj), where the (@,;, ...,,;) 
are the complex coordinates of œ;; the ma- 
trix Q is called the period matrix of T". Let 
(Zi ...,z,) be the complex coordinates of 
a point ze €" and (x,, ..., x;,) be the real 
coordinates. Then we have '(z,, ...,z,)— 
Q'(x, ..., x5,). If we let the same symbol J 
stand for the representation matrix of the 
linear transformation J with respect to the 
basis %4, ..., €5,, then we have J-1 Q=QJ. 
The underlying real Lie group of T" is a 2n- 
dimensional (real) ttorus group (R/Z)?". Hence 
the *Poincaré polynomial of T" is (14- x)?". 
Any tHermitian metric on C" (as a vector 
space) induces a ‘flat "Kahler metric; hence the 
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Hodge numbers (— Kahler Manifolds C) are 
hra = (0. 


I. Theta Functions 


A holomorphic function f(z) on C" 2 (R", J) is 
called a theta function if for every de D we 
have f(z 4- d) 2 f(z)exp(l,(z) 4- c4), where l(z) is 
a linear form on C" which, as for c4, depends 
on d. The set of zeros of a theta function f, 
which we write as ( f), determines an analytic 
divisor on T". Conversely, for every effective 
analytic divisor X on T", there exists a theta 
function f such that ( f) 2 X. With respect to 
the real coordinate system x, ..., ae deter- 
mined by the basis @,,...,@2,, we can find 2n 
x 2n matrices A, Ag and a 1 x 2n matrix b, 
with elements in C, such that the transforma- 
tion formula f(x + a) = f(x) exp / —1(aAx 
-i'aAga 4-'ba)) (where A =A, (mod Z), Ag 

= Ao) holds for every 1 x 2n matrix a whose 
elements are rational integers. Moreover, if 
we put E= A—'A, then E is an talternating 
matrix whose elements are rational integers, 
and $= EJ is a tpositive semidefinite sym- 
metric matrix. Conversely, if there exists such 
an alternating matrix E we can find a theta 
function. (There does exist, however, a com- 
plex torus on which no theta function exists 
other than trivial ones, i.e., ones of the form 
exp(o(z)), where (z) is a polynomial of degree 
at most 2.) 

A theta function f is called nondegenerate if 
it cannot be a function of n — 1 complex vari- 
ables, and f is nondegenerate if and only if the 
matrix $= EJ is positive definite. A complex 
torus has the structure of an Abelian variety if 
and only if there exists a nondegenerate theta 
function, i.e., if and only if there exists an 
alternating matrix E whose elements are ra- 
tional integers such that EJ is a positive de- 
finite symmetric matrix. The latter condition 
is satisfied if and only if there exists an alter- 
nating matrix E whose elements are rational 
integers such that Q'E^!'Q—0, ,/—1 Q'E"'© 
7 0 (positive definite Hermitian matrix). In 
particular, a period matrix Q satisfying these 
conditions is called a Riemann matrix, and the 
rational matrix ‘E~! is called the principal 
matrix belonging to Q. 

Determining a polarization on an Abelian 
variety can be reduced to designating a class of 
principal matrices obtained from a principal 
matrix by multiplying by positive integers. Let 
X be a positive divisor on an Abelian variety 
T" — C"/D, and let f be a theta function such 
that ( /) 2 X. Then the divisor X is nondegen- 
erate if and only if the theta function f is non- 
degenerate, and the latter statement holds if 
and only if the alternating matrix E obtained 


from f is tinvertible. For a given principal 
matrix 'E ^!, we can choose suitable coordinate 
systems of C" and D so that O—(1,, F) and E = 


—A 0 
a diagonal matrix whose elements are telemen- 
tary divisors of E. In these situations, AF is 
symmetric and its imaginary part positive 
definite, i.e., AF is a point of the 'Siegel upper 
half-space S,,. Thus to the polarized Abelian 
variety (where the polarization is determined 
by 'E ^!) there corresponds a point of G,. This 
gives a one-to-one correspondence between 
the isomorphism classes of Abelian varieties 
inhomogeneously polarized by a principal 
matrix with given elementary divisors and the 
points of the factor space G,,/T,(A), where 
L,(A) is a subgroup that is tcommensurable 
to the Siegel tmodular group of degree n and 
operates on GC, discontinuously. The case 
of principal polarization corresponds to the 
case when A= £,, and then I,(A) is the Siegel 
modular group itself. S,,/T,(A) is the coarse 
*moduli space of Abelian varieties, polarized 
as above; the projective embedding of G,/T (A) 
is given by means of tSiegel modular forms. 


0 A 
( H where J, is a unit matrix and A is 


J. Abelian Functions 


Meromorphic periodic functions on C" with 
periods c, ..., 0, (i.€., meromorphic func- 
tions on T2 CD, D = Zo, OD... 9 Za,,) are 
called Abelian functions with periods D (or on 
T). The quotient of two theta functions having 
the same periods and the same transforma- 
tion formulas is an Abelian function, and con- 
versely any Abelian function can be written as 
such. All Abelian functions on T form a field 
C(T) called the Abelian function field. If T has 
the structure of an Abelian variety, then C(T) 
coincides with the field of rational functions on 
T. In general, for any complex torus T there 
are an Abelian variety T' (possibly of dim 0) 
and a surjective homomorphism 4: T T' 
which induces an isomorphism 4*:C(T^)z; 
C(T). Such a pair CT, 4: T2 T^) is unique up 

to isomorphism, and is called the *algebraic 
reduction of T. 


K. Homomorphisms 


Let C" — (R?", J) (i=1, 2) be complex linear 
spaces; an R-linear mapping f: R?": o R?": is 
C-linear if and only if the relation f oJ, = 

J o f holds. Let D; be tlattice groups of C" for 
i— 1, 2. If a C-linear mapping A: C: €» 
satisfies A(D,) c D,, then A induces a com- 
plex analytic homomorphism of T, = C":/D, 
to T; =C"/D,. Conversely, any complex 
analytic homomorphism of T, to T, is ob- 


tained in this way. Let T, and T, be Abelian 
varieties, and let Q, =(w”, ... , oS) and Q, 

— (o1? ..., 2) be their Riemann matrices. 
Then for a homomorphism 4: T, >T,, we 

can find a representation matrix W(A) with 
complex coefficients; and with respect to the 
real coordinate systems (c, ... af) and 

(wi, ..., 012), we can find a representation 
matrix M(A) with coefficients in Z such that 
W(A)Q, =Q, M(A). Conversely, if for a complex 
matrix W there is a matrix M, with coefficients 
in Z, satisfying the relation given above, then 
W gives a homomorphism of T, to T,. The 
above equation is called Hurwitz's relation. 
The notion of l-adic coordinate system, which 
is valid for a general characteristic, corre- 
sponds to that of the lattice group, and the l- 
adic representation M,(A) of 4 is the abstrac- 
tion of the integral representation M (2). 


L. Abelian Integrals 


Let R be a compact *Riemann surface of genus 
gzl(— 11 Algebraic Functions) and let w be 
a sum of tAbelian differentials of the first kind 
or of the second kind. Then the *period of œ 
along a cycle 7 depends only on the thomology 
class of y. The set of all differentials of the first 
kind forms a complex linear space of dimen- 
sion g; we denote it by Dy. Let P be a point 
and P, a fixed point of R; then we denote by 
u(P) the vector integral (fp c, , .... o 
where (@,,...,@,) is a basis of Dy and the path 
from P, to P is common to every integral. The 
correspondence P —u(P) is not a single-valued 
mapping; the totality of differences of values 
of u(P) coincides with the group D consisting 
of periods (foi. sls o. where y varies over 
all cycles. Let a set of cycles To, el bea 
basis of the homology group, with coefficients 
in Z; then 2g column vectors of the g x 2g 
matrix Q — (o), @; f, c; are linearly inde- 
pendent over R. Since the group D coincides 
with the set of linear combinations, with co- 
efficients in Z, of column vectors of Q, this D 
is a flattice of rank 2g, and the matrix Q is a 
period matrix of the complex torus T? = C?/D 
of dimension g. 

For a basis of the homology group with 
coefficients in Z, we take tnormal sections 
31, Dan Bai, X2g Of R, and let the same 
symbol Q stand for the period matrix (cj), 
w= Loo then we have 


J/ —1 QE'Q»0 


(positive definite Hermitian matrix), where 


QE'O=0, 


0 L , ; : 
E= I 0 (with J, the unit matrix). 


This implies that E is a principal matrix be- 
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longing to Q; we call the equality and inequal- 
ity just given Riemann’s period relation and 
Riemann’s period inequality, respectively. Fur- 
thermore, we can choose a suitable basis of Do 
so that the period matrix Q is of the form 

(4, F) with Fe S, (— Section I). We consider 
the function 9(u), u — (u,, ..., uj), defined by an 
infinite series 


9(u) - 3 exp(2x,/ —1 (m'u tr 3mF'm)), 


where the sum is taken over all row vectors 

m — (m,, ...,m,) with coefficients in Z. If u is in 
a bounded region, then the series for 9(u) is 
uniformly and absolutely convergent. Hence 
3(u) is a holomorphic function of u. This is 

a theta function corresponding to the princi- 


0 I 
pal matrix E -( i al and is called the 
"fg 


Riemann theta function. As 3(u) is nondegen- 
erate, the complex torus T? = C?/D has the 
structure of an Abelian variety. If we regard 
the Riemann surface 9R as an algebraic curve, 
this Abelian variety T? is precisely the Jaco- 
bian variety of the curve R. The collection of 
zeros of 9(u), a divisor on T*, defines the ca- 
nonical polarization on the Jacobian variety 
(— Section G). 

The correspondence P —u(P), pe induces 
a well-defined mapping o of R into T^ = 
C*/D. Moreover, if we set p(A)= 25, 9(P)— 
275-1 9(Qj for any divisor A =P, ...P/Q, ...Q, 
of degree 0, then (A) is a point on T? = CID. 
which is represented by the vector (27. fo, W, 
ELM Lach, of C*, and the mapping A> 
Q(A) is a homomorphism of the group (5, of 
divisors of degree 0 onto T? = C*/D. The kernel 
of this homomorphism coincides with the 
group 6, of *principal divisors (Abel's theo- 
rem). Hence a divisor A =P, ... P./Q, ... Q, of 
degree 0 is a divisor of some function if and 
only if we have Zi Lë z 0 (mod D) (i= 
1,...,g), or equivalently, the left-hand side is 
0 for a suitable path. 

Given g fixed points P,,..., P, on R and 
given (u,, ..., uj) as any vector of C*, the prob- 
lem of finding g points Q,, ..., Q, satisfying 
relations È$- (2/2 u; (mod D), i— 1, ...,g, is 
called Jacobi's inverse problem. To solve the 
problem, we take a divisor A of degree 0 such 
that the class (A) (mod D) is represented by 
(ui, .... ug); then, by virtue of the *Riemann- 
Roch theorem, there exists a divisor Q, ... Q, 
satisfying 


AzQ, ...Q,/P, ... P, (mod G)). 


Abel’s theorem implies that the set of points 
{Q;,...,Q,} is a solution of Jacobi's problem. 
Moreover, for general (u,,...,u,), the solution 
is unique; i.e., there exists a subvariety X of 
dimension g —2 on C such that the solution is 
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unique up to order if and only if (u,, ... ,u,) 
does not lie on X. In particular, if every point 
P, coincides with the fixed point P, that ap- 
peared in the definition of u(P), the subvariety 
on T? — C*/D determined by X is obtained in 
the following way: Let W,+...+W,,-, bea 
canonical divisor on *R, and put c=@(W,)+... 
t 9(W,,..5). Then the locus of points c — @(R,) 
~...—@(R,_2) (where g — 2 points R,,..., R, 
are taken independently over all points of 9i) is 
the desired subvariety. 

In terms of the theory of complex manifolds 
the above result can be rewritten as follows: Let 
SYR) be the symmetric product of R and X the 
subvariety of T? — C*/D induced from X. Then 
the holomorphic map $*(q): S*(9R) 5 T? induced 
from o:91— T? is bimeromorphic and isomor- 
phic outside that X whose elements corre- 
spond to effective divisor classes of degree g 
contained in canonical divisors on H. 


M. Elementary Abelian Functions 


Let z be a nonconstant meromorphic func- 
tion on 9i. Then for any u-(u,, ...,uj)e C? 
that does not lie on X, there exist g points 
Q,,...,Q, (uniquely determined up to order) as 
the solution of Jacobi's problem. Therefore 

the elementary symmetric functions 


g 
-> Salu; z)= | | 2(Q;) 
i<j j=l 
are well defined if u lies outside the variety ¥ 
of dimension g — 2. Each function s;(u; z), re- 
garded as a function of u, can be extended 
uniquely to an Abelian function in the whole 
space C*; the function is denoted by the same 
symbol s,(u; z). The Abelian functions s, (u; z), 

s,(u; z) are called the elementary Abelian 
functions obtained from z. 

Now let K be the field of Abelian functions 
on C"/D and k the field of meromorphic func- 
tions on 9t; then the dimension of K over C is 
g, and [K : C(s; (u; z), ...,s,(u; z))] ^r*, where r 
is the degree of the function z that is given by 
[k: C(z)]. Moreover, if we take any function w 
such that k= C(z, w), then we have 


K — C(s,(u;z), ..., (u; w), ..., 


S,(u; w)), 


where s; (u; w), ..., s, (u; w) are the elementary 
Abelian functions obtained from w. 

We can write any elementary Abelian func- 
tion as a rational function of Riemann theta 
functions; therefore any Abelian function can 
be written as a rational function of Riemann 
theta functions. Furthermore, if u and v are 
variable vectors, then s,(u+ v; z) can be repre- 


SJ z); Sy 
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sented as an algebraic function of s,(u; Z), ..., 
S,(U; z), sı (0; Z), Salt: z); i.e., we can choose 
a suitable polynomial H,(Z; X,,..., X; Y;, 

., Y,) with coefficients in C so that H,(s,(u+ 
v; Z); S1 (U; Z), ...5,(U; Z); S4 (0; Z), -..,5,(v;Z)) =9. 
This algebraic addition formula with respect 
to the elementary Abelian functions s,(u; z), i= 
1,...,g, is a function-theoretic interpretation 
of the fact that the addition map X x X > 
X :(x, y) ox + y is a tmorphism of algebraic 
varieties. 

As the study of *Abelian integrals of the first 
kind led us to the theory of Jacobian varieties, 
*Abelian integrals of the second and the third 
kind give rise to the theory of tgeneralized 
Jacobian varieties (— 9 Algebraic Curves). 

The theory of Abelian varieties has signifi- 
cant applications to number theory, as shown 
by the following examples: the theory of tun- 
ramified Abelian extensions with respect to a 
function field of several variables defined over 
a finite field (S. Lang), the theory of heights of 
points on an Abelian variety (Weil, A. Néron, 
J. Tate), and the theory of complex multipli- 
cation (— 73 Complex Multiplication) in the 
case of higher dimensions (Y. Taniyama, G. 
Shimura). 


N. Some Recent Results 


(1) Level Structure, Moduli of Abelian 
Varieties. Let A be an Abelian variety over k 
of dimension g and n a positive integer which 
the characteristic of k does not divide. A level 
n structure on A is defined to be a set of 2g 
points 96,,...,05, on A which form a basis for 
the group of points of order n on A. 

Let A(g, d, n; k) be the set of triples: (1) an 
Abelian variety A over k of dimension g, 
(ii) an inhomogeneous polarization X on A 
with v(o) — d?, and (iii) a level n structure 
Ci, ...,05, Of A, all up to isomorphisms. Simi- 
larly we can define A(g, d, n; S) for Abelian 
schemes over a scheme S. The correspondence 
S2 A(g, d, n; S) defines a functor ig, d, n). 
D. Mumford has shown that there exists 
the tcoarse moduli scheme A(g, d, n) quasi- 
projective over Spec(Z[1/n]), and that it is even 
fine if nz 3[6]. He used the technique of "Hu. 
bert schemes and 'stable points (— 16 Alge- 
braic Varieties). One of the key steps of his 
proof is to show that for an embedding d: A> 
P" of degree r De, the degree of (A) in P") 
over an algebraically closed field k and a posi- 
tive integer n such that char(k) does not divide 
n and n7 J/ (m * 1)r, the point éis we 
in (P")"* is stable with respect to the action 
of PGL(m), where the x; are the points of order 
n on A (with an arbitrary order). Mumford 
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later showed another method of constructing 
the moduli of polarized Abelian varieties by 
using algebraic theta constants [7]. 


(2) Néron Minimal Models, Good and Stable 
Reduction. Let R be a discrete valuation ring 
with residue field k and quotient field K. For 
an Abelian variety A over K, there exists a 
smooth group scheme .o of finite type over S 
= Spec(R), called the Néron minimal model of 
A, such that for every scheme A" smooth over S 
there is a canonical isomorphism 


Hom;(S', adi Homg(S;, A), 


where S, is the pullback of S’ by Spec(K)-^ 
Spec(R) (A. Néron, M. Raynaud). In particu- 
lar, we have £g = A. Denote by A, the fiber of 
W over the closed point of S. 

If . is proper over S, we say that A has a 
good reduction at R. If the connected compo- 
nent Aj of A, containing 0 has no unipotent 
radical (or equivalently, A8 is an extension of 
an Abelian variety by an "algebraic torus over 
a finite algebraic extension of K), we say that A 
has a stable reduction. If there is a finite sepa- 
rable extension K' of K with a prolongation R' 
of R to K' such that A x g K' has good (stable) 
reduction, we say that A has a potential good 
(stable) reduction at R. Let K, be a separable 
closure of K and R a prolongation of R to 
k For a prime number /#char(k), we have 
a canonical homomorphism p:Gal(K,/K) 
Aut((5,(A)), called a monodromy. Then A 
has potential good reduction if and only if the 
image of the finertia group I(R) by p is a finite 
group (J.-P. Serre and Tate). Every Abelian 
variety A over K has potential stable reduc- 
tion at R (stable reduction theorem, A. Gro- 
thendieck [16]). 


(3) Graded Ring of Theta Functions. If f is a 
theta function with a period system D, f" is 
also a theta function with the same period 
system for every positive n. We denote by S, 
the vector space of the theta functions with the 
period system D subject to the same trans- 
formation law as f". If we denote by X the 
effective divisor on T=C*/D defined by f, then 
S, can be naturally considered as the fdefining 
module of the *complete linear system of nX, 
and the dimension of S, ( 2 [(nX)) is equal to 
the product of the nonzero diagonal elements 
of nA (— Section I) (Frobenius). If ge S, and 
heS,, then gheS,,.,; hence S= @,505, isa 
*graded ring, which is normal and finitely 
generated. For m>2 and n È 3, the product 
map $, x $,— Ban is surjective (D. Mumford, 
S. Koizumi). If the elementary divisors of E 
can be divided by an integer 74, the kernel of 
the natural graded mapping S(S,)—S (where 
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S(S,) denotes the tsymmetric algebra over $,) 
is generated by the quadratic relations De. the 
part of degree 2) for sufficiently large degrees 
(Mumford). Geometrically this means that if 
X =(f), f €5,, is nondegenerate, then, with the 
projective embedding i: T5 P" defined by the 
complete linear system of X, i(T) is an inter- 
section of quadrics in P" containing i(T). 

Mumford has developed a theory of alge- 
braic theta functions that also works for the 
positive characteristic case [7] and has proved 
the above results in general. 
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A. General Remarks 


Let N be the set of natural numbers, and let A, 
B, ... be subsets of N. The sum C=A+ B is 
defined as the set (c|ce A, ce B, or c=a+b, 
ae A, be Bj. A finite sum of subsets of N is 
defined similarly. If N is the sum A+...+A 

(r times), then we say that A is a basis of order 
r in N. Let A(x) denote the number of integers 
in A that do not exceed x. The density of A 

is defined as inf A(x)/x. If A(n) z an, B(n)> 

pn (0 « o, B <1) for all ne N, then we have 
(A+ B)(n) 2 (min(1, «+ p))n. This result was 


stated by E. Landau (1937) without proof; A. J. 


Khinchin had given a proof (1932) for the case 
a=, and H. B. Mann (1942) and E. Artin and 
P. Scherk (1943) succeeded in proving the 
statement for the general case. Suppose that 
the densities of A and B are, respectively, x 
and O. If B is a basis of finite order in N, then 
the density of A + B is greater than that of A 
(Khinchin and P. Erdós, 1936). Let P be the 
set of all prime numbers. Though the density 
of P is 0, the density of P+ P is positive (L. G. 
Shnirel'man, 1930). Hence P 1s a basis of finite 
order in N; in other words, there exists a posi- 
tive integer r such that every natural number 
can be expressed as a sum of at most r primes. 
Though the density of the set Q of the kth 
powers of natural numbers is 0, there exists a 
positive integer s(k) such that the sum Q +... 
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4- Q (s(k) times) is a set of positive density. L. 
K. Hua (1956) gave a simple proof of this fact, 
based on Yu. V. Linnik's idea. It follows, 
therefore, that any natural number n can be 
expressed as a sum n=ai+...+a*, where 
a; € N, t 2 s(k). This result had already been 
shown by D. Hilbert (1909). 

An ancient method of finding prime num- 
bers is *Eratosthenes' sieve. V. Brun (1920) 
devised a new sieve method to express an 
arbitrary integer n as the sum of two integers 
n—à t b, where the number of prime factors 
of a and b is as small as possible. This method 
was improved by H. A. Rademacher (1924), 
Landau (1931), A. Bustab (1937), and others. 
Among these the method found by A. Selberg 
(1952) is notable (— 123 Distribution of Prime 
Numbers E). 


B. Farey Sequences 


Let 1 be a positive integer. We arrange in 
increasing order the set of all positive irreduc- 
ible fractions lying between 0 and 1 whose 
denominators do not exceed t. This sequence 
is called the Farey sequence of order x. For 
example, the Farey sequence of order 5 con- 
sists of 


Í 1 1 2 ł 
"45 b 3 5-0 
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A necessary and sufficient condition that a 
fraction a/b be directly followed by a fraction 
c/d in the Farey sequence of order n is b+d> 
n4- 1, bc — ad — 1. In this case the fraction 

(a 4- c)/(b +d) is called the mediant of a/b and 
c/d. Interpolating the Farey sequence of order 
n with such mediants (a + c)/(b + d) satisfying 
b+d=n-+1, we obtain the Farey sequence 

of order n+ 1. 

Let a/q be a fraction in the Farey sequence 
of order t, and a'/q', a"/q" be adjacent mem- 
bers of a/q in the sequence such that a'/q' < 
a/q « a" [q" .' The interval [(a' + a)/(q' +q), 

(a 4- a")/(q 4- q")] is known as the Farey arc 
surrounding a/q. In particular, if a/q —0/1, 
then we set [ — 1/n, 1/n] to be the Farey arc 
surrounding 0/1, where n=[t]4+1([ ] is the 
tGauss symbol). We can thus decompose the 
interval [—1/n, 1 — 1/n] into a disjoint union 
of Farey arcs. If « is contained in the Farey 
arc surrounding a/q, then |a — a/q| <1/qt. 
Therefore, for a given tÈ 1 and a real a, we 
can prove the existence of a/q such that 


(lal, q)— L, |a — a/q| « Y/qt. 


Let f(E)= X6 a,0" be a power series which 
is convergent in the disk |C| 1. Then, by 
Cauchy's integral formula, 


0cqxt, 
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To estimate such an integral, we sometimes 
utilize the decomposition of the interval [0, 1] 
into a disjoint union of Farey arcs as men- 
tioned above. This method is called the circle 
method, and the subdivision of the interval is 
known as the Farey dissection. 

Given a positive number c, a Farey arc 
around a/q is usually called a major are (or 
basic interval) if q does not exceed the given 
bound c; otherwise, it is called a minor arc (or 
supplementary interval). Usually, the principal 
part of the previously mentioned integral is 
derived from the integral over the major arcs, 
and the residual part is provided by the in- 
tegral over the minor arcs. 


C. Goldbach's Problem 


Goldbach's problem is found in letters (1742) 
he exchanged with L. Euler. In them he stated 
that every positive integer can be expressed as 
the sum of primes. More precisely, he conjec- 
tured that any even integer not smaller than 6 
can be expressed as the sum of two odd primes 
and that any odd integer not smaller than 9 
can be expressed as the sum of three odd 
primes. 

I. M. Vinogradov (1937) proved that every 
sufficiently large odd integer can be expressed 
as the sum of three primes. Let N be a suffi- 
ciently large odd integer. If we write 





A(q, N)= Ha KS exp( ix) 


3 
9 (q) \<a<q 
(a,q)=1 


then the series $(N) = 72, A(q, N) is ab- 
solutely convergent and is equal to 


1 1 

IL n= mae Se 
It is known that S(N)- 6/z? for all N. If we 
denote by r(N) the number of solutions of 
N —p, t p? * ps, then r(N) - (N?/2(log Ny): 
S(N). To prove this, Vinogradov used the cir- 
cle method. He employed the *prime number 
theorem for arithmetic progressions to esti- 
mate the integrals over the major arcs and 
devised an ingenious method to estimate the 
series X, , exp(2ziap) in the computation of 
the integrals on the minor arcs. A finite or 
infinite sum of exponential functions such as 
this is called a trigonometric sum. More gener- 
ally, we consider trigonometric sums of several 
variables. Vinogradov provided detailed re- 
marks and calculations [6]. 

In the case of even integers, the problem is 
still unsolved, although J. G. van der Corput, 
T. Estermann, and N. G. Chudanov proved 
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simultaneously (1938) that almost all even 
integers (i.e., except a set of density 0) can be 
expressed as the sum of two primes. For these 
problems, Linnik (1946) and Chudanov (1947) 
introduced function-theoretic methods. They 
obtained the density theorem concerning the 
zeros of L-series, and A. Zulauf (1952) con- 
tinued along the same lines. These methods 
had been suggested by G. H. Hardy and J. E. 
Littlewood (c. 1919), although they had as- 
sumed that the *extended Riemann hypothesis 
held. 

We denote by P; a number having at most j 
prime factors counted with their multiplicity. 
Using his sieve method, Brun (1919) proved 
that every sufficiently large even integer 2n can 
be expressed in the form 2n = P4 + P. Radema- 
cher (1924) improved this result and obtained 
2n = P; + P}. Applying his new sieve method to 
this problem, Selberg (1950) proved that 2n— 
P, + P}. On the other hand, using the "large 
sieve, A. Rényi proved that 2n — P, + P, for 
some k. Afterwards, combining Richert's sieve 
with.a large sieve, Chen Jing-Run (1973) 
proved that 2n = P, + P; for large 2n [11]. 


D. Polygonal Numbers 


Let m be an integer greater than 3, and let 
a,=1, a,.,-—a,=(m—2)n+1 (n=1,2,...). 
The sequence {a,} forms the system of polyg- 
onal numbers of order m. The general term 

of {a,} is given by n -3(m—2) (n? — n) (n= 
1,2, ...). Such a, are said to be triangular 
numbers if m = 3, square numbers if m — 4, and 
pentagonal numbers if m= 5. 

P. Fermat (1636) stated that every natural 
number can be expressed as the sum of m 
polygonal numbers of order m. This conjecture 
was proved by A. M. Legendre (1798) for m— 
3, by J. L. Lagrange (1772) for m—4, and by 
A. L. Cauchy (1813) for the general case. With 
regard to Lagrange's result, Legendre noticed 
that in order that a positive integer n be ex- 
pressed as the sum of three squares, it is neces- 
sary and sufficient that n not be of the form 
4^ (8m + 7). 

Given a positive integer n, the number of 
integral solutions of the equation xj +x3+... 
+x?2=n is denoted by r,(n). For example, r,(5) 
= 8. The tgenerating function Lz r;(n)n ? 
can be expressed as 3, ?- .. „(m° -- n?) ?, where 
the term corresponding to m=n=0 is omitted; 
the function is equal to AC, (s), where C,(s) is 
the *Dedekind zeta function of the Gaussian 
number field K — Q(/-1). The equation 
4Ck(s) — AC(s)L(s, x) (where x(n) 2(— 4/n)) leads 
to 
r(n-4»'(—0" 9, 


m|n 
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where >’ means the sum over all odd factors 
m of n. This result was obtained by C. G. J. 
Jacobi (1829). He also obtained the following 
formula: 


ra(n)=8 Y m, 


min. Am 


where >’ means the sum over all divisors m of 
n not divisible by 4 (Hardy and E. M. Wright, 
C. L. Siegel, 1964). Let q =exp(2zit) (Im x > 0), 
and 


fta) 14- Y ring" 
n=1 


=(1 +2q +244 +24? 4- ... y. 


Hardy (1920) considered the variation of f(q) 
for q=aexp(2zih/k) (with 0 —a < 1) as a1; he 
obtained 


S D 1 —s/2 
f(q) - e? KE log ( 
a 


where S, , 2 25-, exp(2zi(h/k)j*). Furthermore, 
he constructed the singular series 


p nisi?) -l oe 


Mnj-——— — V AL. E d 
; ro £^" ^ 
S A 
Ausck" Y (S reo ( Zei ‘ 
l&hxk i k 
(h,k}=1 


and showed that r,(n) = p,(n) for s= 5, 6, 7, and 
8. P. T. Bateman (1951) proved the same equa- 
tion for s=3 and 4. If 529 then r,(n)— p,(n) + 
O(n**) (Hardy, Littlewood, and S. Rama- 
nujan, c. 1919). The detailed exposition of this 
result is in the notes of A. Z. Walfisz (1952) 
and Rademacher [4]. H. D. Kloostermann 
(1926) and Estermann (1962) studied the equa- 
tion ax? + bx +cx3 c dx$ — n, which led to a 
new field of study concerning the *Klooster- 
mann sum. For instance, estimates such as 


p-i xtd 
2. exp (2x S " «2 /p 


are obtained by using the theory of zeta func- 
tions in algebraic function fields in one vari- 
able (A. Weil, 1948) (— 450 Zeta Functions). 











E. Waring's Problem 


The first formulation of Waring's problem is 
found in E. Waring, Meditationes algebraicae 
(1770), in which he discusses the problem of 
expressing an arbitrary positive integer as the 
sum of at most nine cubes or as the sum of at 
most nineteen biquadratics. Hilbert proved (— 
Section A) that there exists a positive integer 
s(k) such that, for any integer N, the equation 


xF E xš +... +x =N 
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has a nonnegative integral solution if s > s(K). 
We denote by g(k) the least value of s(k), and 
by G(k) the least value of s(k) for which the 
equation is solvable with at most finitely many 
exceptions of N. Research concerning g(k) and 
G(k) received its initial impetus from the circle 
method considered by Hardy and Littlewood, 
and it underwent considerable development in 
the works of H. Weyl and Vinogradov. 

Let r,(N) be the number of solutions of the 
above equation. We then have 


1 s 
um- | ( 2: oppe 
o \xsnilk 


x exp( — 2niNa) da. 


If we make the Farey dissection, translating 
the interval [0, 1] slightly, then the main term 
of r.(N) is provided by the integrals over major 
arcs, and the residual term is derived from the 
integrals over minor arcs. Let p be a prime, 
and let M(N, p!) denote the number of solu- 
tions of the congruence equation 


xE x5... xFz N (mod p’). 


Then lim, o M(N, p/p'^ ! — y (N) is not zero, 
and the infinite product [T, x, (N) 2 S(N) con- 
verges for sz: 4k, where S(N) is larger than a 
positive constant which is determined inde- 
pendently of the choice of N. On the other 
hand, let 


Eis a 
Slag) = Y. exp (2i ; 
d 


x=0 


A(qN)-q^ Y. S(a, ver za) 


Ixaxq 


(a,q)=1 


Then ZZ: A(q, N) is absolutely convergent, 
and the sum is equal to S(N). According to 
Hua (1959) we have 
AN) SIN) C sen 

I (s/k) 
provided that s>2k?(2logk -- loglog k +c). 
Next, if we denote by V(N, ô) the volume of the 
closed region satisfying N « x5 4- x5 o... 
xF <N +ô in the s-dimensional Euclidean 
space, then lim, — V(N, 6)/6 = y , (N) exists and 
is equal to (UL + t/K/F(s/K)) N**^!. Hence we 
can show that the principal part of r,(N) is 
equal to the infinite product IT. x,(N), where p 
runs over all finite and infinite prime spots in 
Q. This is a generalization of the singular 
series studied by Hardy. 

With regard to g(k), there are studies by L. 
E. Dickson (1936) and others. It is easy to see 
that g(k) > 2* + (3/2)* — 2 and that G(k) 2 k - 1. 
It has been shown that g(3)=9, G(3) «7 (G. L. 
Watson, Linnik, 1947) and that g(4) « 37, 
G(4)= 16 (H. Davenport, 1939). More gener- 
ally, Vinogradov (1959) proved that 
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G(k) € 2klogk--4kloglogk 


t2klogloglogk + ck. 


To prove this, Vinogradov (1934) introduced 
the following integral, which is closely related 
to the *prime number theorem: 


1 1 
ie [^| 
0 0 


Y, expOni(a, x - a, x? +... 


x«P 





2s 


+ 0%,x*)) 





Hua (1949) improved Vinogradov’s result and 
proved that if s>4k(k+1)+ Ik, then 


I(P) « (SsP*'(log Py?! P25~ KR + 12 +8, 


where 6 =4k(k + 1)(1—1/k)'"'. Concerning 
I(P), another notable approach was made by 
A. A. Karacuba and N. M. Korobov (1963). 
Further investigation proved that I(P)= 
Cres p?s kk +1)/2 + o(P?25 Mk*1y2) if s>ck? logk 
(Vinogradov, Hua, 1959). The result is called 
the Vinogradov mean value theorem. 

There are many variations and generaliza- 
tions of this theorem. Vinogradov and Hua 
(1944) studied the problem of representing an 
arbitrary N as N=p*+p3+...+p* (with p; 
prime). Hua (1937) and others also considered 
the problem of representing N as N = f(x,)+ 
f (x5) - ... - f(x), where f(x) is a given poly- 
nomial. Also, let 


n n n 
C(x1, X2, ...,X,)— 2, X. Y Cj X; XjXx 


j=1 k=1 


ll 
= 
Ki 
li 


be a homogeneous polynomial of degree 3 
with integral coefficients. Davenport (1963) 
proved that if nz 16, then the equation 

C(x,, X5, ..., X,) - 0 has at least one nontrivial 
integral solution. There are further develop- 
ments of the theory of representations of in- 
tegers by forms in many variables by V. A. 
Tartakovskii, H. Davenport, B. J. Birch, and 
D. J. Lewis. 


F. Additive Problems in an Algebraic Number 
Field 


Siegel (1922) considered the generalization of 
Hardy's square sum problem to the case of an 
algebraic number field. He later (1945) studied 
the generalized Waring's problem in an alge- 
braic number field K of finite degree: Let I 

be the principal order of K, and let J, be the 
subring of I generated by the kth powers of 
integers in K. It is easily seen that the index 

(I: J,) is finite. Hence our concern regarding 
s(k) must be restricted to integers contained in 
J,. Another question is how to extend the 
Farey dissection to an algebraic number field. 
Siegel succeeded in solving these difficulties. 


da, ... day. 
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His ingenuity is seen in his way of dealing with 
the minor arcs, which provided a stimulus to 
the research of T. Mitsui (1960). 

A generalization of Goldbach’s problem to 
the case of an algebraic number field was 
obtained by Mitsui (1960) and O. Korner 
(1961). 

As another extension of the Vinogradov 
three primes theorem, Mitsui (1971) proved 
the following theorem: Let K be an algebraic 
number field of degree n. Let C be the prin- 
cipal ideal class generated by a totally positive 
number in K, and P be the set of prime ideals 
of degree 1 contained in C. Let N be a positive 
integer and J,(N) be the number of representa- 
tions of N as the sum of the norms of s prime 
ideals belonging to P, 

L(N)- Kë 1, peP (Ixixs) 
N=Np,+...+Nb, 

If N is sufficiently large and s>3, we have the 

asymptotic formula 


Ns} _, loglog N 
+0( NS 1— —— |, 

(log Ny ( (log N ei 
where A, is a positive constant depending on s 
and K independent of N, and S(N) denotes the 
singular series. If sz N (mod 2D), where D is 
the discriminant of K, then S(N) 2 c» 0, where 
c is a constant. Later this problem was ex- 
tended by Mitsui and T. Tatuzawa (1981). 


I(N) — A,S(N) 
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A. General Remarks 


The study of sums of tindependent random 
variables has been one of the main topics in 
modern probability theory (— 250 Limit 
Theorems in Probability Theory). Combin- 
ing the ideas of this study with the considera- 
tion of tstochastic processes with continuous 
time parameter, we get the notion of additive 
processes. 


B. Definitions and Fundamental Properties 


A real-valued tstochastic process {X;}o <:<u> 
denoted by X (t) (0 € t « oc) for the rest of this 
article, where for simplicity we assume that 
X (0) 20, is called an additive process (or pro- 
cess with independent increments) if for any t, < 
ty<...<t,, Xt) - X(t) (= 1,2, ..., n) are 
*independent. An additive process is essentially 
the same as a 'spatially homogeneous tMar- 
kov process De, a Markov process on R! that 
is invariant under translations). When, for any 
h»0 and t»5s, X(t-- h) - X(s-- h) and X(t)— 
X (s) have the same law, i.e., the distribution 
law of X (t) — X(s) depends only on t — s, we 
call the additive process X (t) temporally homo- 
geneous. This is essentially the same notion as 
a temporally and spatially homogeneous Mar- 
kov process. 

Let X (t) be a given additive process. If f(t) is 
a function of t only, then clearly Y(t) 2 X(t) — 
f (t) is also an additive process, and we can 
choose f(t) in such a way that for every t0 
and for every sequence s,1t (s, |t), Y(s,) con- 
verges with probability one. Here, lim Vis, 
is independent of a particular choice of s,,, 
and we denote it by Y(t —) (Y(t+)). We call 
such Y(t) a central process and also say that 
Y(t) is obtained from X(t) by centering. This 
f(t) is given, for example, by the condition 
E(arctan(X (t) — f(t))) 2 0. 


16 


Let Y(t) be a centered additive process. 
Then Y(t —) — Y (t) — Y(t+) for all t>0, except 
on an at most countable t-set S, and ren is 
called a fixed point of discontinuity of Y (t). 
Then Y; (t) «lim, ,, U,(t) exists with proba- 
bility 1, where 
Uz 3 (YGi*)- Y(;-) 


O<i<n 
S;<t 


+ ¥(t)—¥(t—)—G, 


now 


C? being a constant determined by E(arctan 
U,(0)) and S={s,} (j=1,2,...). Y(O= 
Y(t)— Y, (t) is a centered additive process 
without any fixed point of discontinuity. Fur- 
thermore, Y, (t) and Y;(t) are independent. 
Thus we have a decomposition of Y(t): Y(t) 
Y, (t) - Y (t), where Y, (t) and Y;(t) are mutu- 
ally independent additive processes. The struc- 
ture of Y, (t) is simple, and it is not worthwhile 
to study its behavior in more detail. On the 
other hand, since Y; (t) is a centered additive 
process without any fixed point of discontinu- 
ity, it is an additive process that is *continuous 
in probability. Let Y, (t) be a *separable modi- 
fication of Y; (t). Then the discontinuities of 
almost all sample functions of Y,(t) are of the 
first kind. If we set Y*(t)= RULL Y*(t) isa 
*modification of Y,(t), and almost all sample 
functions of Y*(t) are right continuous and 
have a left-hand limit at every t. In the study 
of the process Y,(t), it is always convenient to 
take such a modification. Thus we give the 
following general definition: An additive pro- 
cess is called a Lévy process if it is continuous 
in probability and almost all sample functions 
are right continuous and have a left-hand limit 
at every te[0, oo) [3,9]. 

The notions of additive processes and Lévy 
processes can also be considered for R'-valued 
processes. 


C. Additive Processes and Infinitely Divisible 
Distributions 


Let X (t) be a Lévy process and ®,, (s « t) be 
the fdistribution of X (t) — X (s). Then ®,, is an 
‘infinitely divisible distribution {(— 341 Proba- 
bility Measures G). Conversely, for a given 
infinitely divisible distribution we can con- 
struct an essentially unique temporally homo- 
geneous Lévy process X (t) such that ® coin- 
cides with the distribution of X(1). If X(t) is 
temporally homogeneous, the ‘characteristic 
function @,,(z)= E(e'* X9 X9») of the distri- 
bution ®,, is given in the form o, — exp((t — 
s)y (z)); hence the law of the process X (t) is 
completely determined by the function w(z). 
By the *Lévy-K hinchin canonical form, y(z) is 
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written in the form 
W(z)= imz— 7 di (e —1- 2 nito 
(1) 


where m, ve R, v>0, and n(du) is a nonnegative 





"OG 


20 u? 


+u? 





measure on R — {0} such that | n(du) 


« oo. These m, v, and n(du) are uniquely deter- 
mined by w(z). 


D. Basic Additive Processes 


Wiener Process. When almost all sample func- 
tions of a Lévy process X(t) are continuous, 
the distribution of X (t) — X (s) is a *normal 
distribution. If, further, X (t) is temporally 
homogeneous, (z) has the form y(z) 2 imz — 
lvz?. In particular, if m=0 and v=1, then 

X (t) is called a Wiener process or Brownian 
motion. This stochastic process was introduced 
by N. Wiener (1923) as a mathematical model 
for the random movement of colloidal par- 
ticles first observed by a British botanist, R. 
Brown. This is one of the most fundamental 
and important stochastic processes in modern 
probability theory (— 45 Brownian Motion). 


Poisson Processes. When almost all sample 
functions of a Lévy process are increasing step 
functions with only jumps of size 1, the distri- 
bution of X (t) — X(s) is a Poisson distribution. 
If, further, X (t) is temporally homogeneous, 

V (z) in (1) has the form y(z) 2 A(e** — 1) (4 0), 
and X(t) is called a Poisson process. Let X (t) 
be a Poisson process, and let Tj, T+ T, To + 
T, + T, ... be successive jumping times of a 
sample function X (t). Then 7, Ti, T5, ... is 

a sequence of mutually independent ran- 
dom variables with the common exponential 
distribution P(Tedt)- 4e ?' dt. Conversely, 
given such a sequence {7,}, if we define X (t) = 
inf(n| Ty 4- T, t... T,» t], then X(t) isa 
Poisson process. Thus, for example, the num- 
ber of telephone calls at a switchboard is a 
Poisson process when the intervals between 
successive calls can be regarded as inde- 
pendent and having a common exponential 
distribution. 


E. The Structure of the General Lévy Process 
[8,9] 


In this section we restrict ourselves for sim- 
plicity to temporally homogeneous Lévy pro- 
cesses. As we noted, the probability law of the 
process X (t) is determined by the function 

y (z). The Lévy-Khinchin formula (1) in a 
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certain sense shows that y is a combination of 
a Wiener process and Poisson processes. This 
fact can be seen more clearly from the Lévy-Itó 
theorem, which states that the sample function 
of X (t) itself can be expressed as a composite 
of those of a Wiener process and Poisson 
processes. The Lévy-Itó theorem actually 
implies formula (1) and, moreover, clarifies its 
probabilistic meaning. 

The Lévy-Itó theorem can be summarized 
as follows: Let U be a Borel subset of R which 
has a positive distance from the origin, and let 
N (t, U) be the number of s such that X(s)— 
X(s—)eU,sxt. Then N(t, U) is a Poisson 
process. The expectation E(N (t, U)) can be 
written in the form tn(U), where n(U) defines 
a nonnegative Borel measure on R — {0}; fur- 
thermore, it satisfies 


o0 u? 
| Ear se 


Next, we set 


| uN(t, du) 
lu e 
» (X()-X(s—-). 


sst 
|X(s)—X(s—)i ze 


Jil 


I 


Generally, S,(t) diverges as ¢]0. However, with 
probability one, a centered process 


x u 

S, (t) = S,.(t)— INE 

converges uniformly in t on every finite inter- 
val as £|O. Furthermore, X (t) — lim, e S,(t) is 
continuous with probability one. However, a 
Lévy process X (t) of which almost all sample 
functions are continuous has the form mt + ./v 
B(t), where m, v>0 are constants and B(t) is a 
Wiener process. Hence we have 


X(t)=mt+./v B(t) 


elo 


sim (uve nu Zem) (2) 
lu|>e 


Furthermore, we can show that if 
TW CNET Ce 

are disjoint, then 

B(t), N (t, Uj), N(t, Uz), ..., N(t, U,) 


are mutually independent Lévy processes. In 
particular, in (2) the terms are mutually inde- 
pendent. The m, v, and n(du) in (2) correspond, 
of course, to those in (1). Conversely, given m, 
v, and n(du), we can construct B(t) and N (t, U) 
with the properties above, and then (2) defines 
a Lévy process which corresponds to v(z) 
given by (1). The measure n(du) is called the 
Lévy measure of X (t). 
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F. Examples of Lévy Processes 


Compound Poisson Processes. A temporally 
homogeneous Lévy process is called a com- 
pound Poisson process 1f almost all sample 
functions are step functions, namely, if v(z) in 
(1) is given by 


y= | ^ (e — Dn(du) 


—00 


A zl n(du) « oc. 
If we set D(du)=(1/A)n(du), then (du) is a 
probability distribution on R. 6 is the distri- 
bution of the size of jumps when they occur. 
A compound Poisson process is constructed 
in the following way: Let To, Tj, T,,...; U,, 
U,, ... be mutually independent random vari- 
ables such that 


P(T,edt) Ae "dt, t>0, 


P(U,€du)= o (du), 

and let 

N (t) 2 U, +U, +... + Unos 
where 

n —inf(n| To - T, +... - T; t]. 


Then X (t) is a compound Poisson process. 
Thus the number of jumps of X (t) follows a 
Poisson process, and the size of each jump 
obeys the distribution ®. 


Stable Processes. A temporally homogeneous 
Lévy process X (t) is called a stable process if 
for every a 0 we can find b — 0 and c real 
such that the processes X, (t) 2 X (at) and X, (t) 
=hX(t)+ ct are equivalent in law. It is called a 
strictly stable process if in the above c can be 
chosen to be 0. X (t) is a stable process (resp. a 
strictly stable process) if and only if the corre- 
sponding infinitely divisible distribution is a 
*quasistable distribution (resp. tstable distri- 
bution). The *exponent a (0a x 2) of the 
quasistable distribution is called the exponent 
(or index) of the stable process. y/(z) in (1) cor- 
responding to a stable process is given as 
follows: 





e ` d 
ver inzec, | (e — 1) 
ü u 
o 
| d 
«c. (e s TEE 
ES vi 


UP ec izu \du 
=imz+C iu. Wego | 
y(z) — imz Sr ( us 


LC Q T izu N du 
Sas. 1+u? Jul?’ 
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a=2. 
Here m, C}, C , and v are real constants such 
that C, 20, C. 20 with C, +C_>Oand v» 0. 
V (z) corresponds to a strictly stable process if 
and only if m=0 when «41 and C, =C_ 
when «= 1. The above v (z) is also expressed as 
follows: 

(i) ifO0<a<2, 741, then 


V(z) 2 imz — cg|zl* 1 —ifitan s — 
2 |z| 


where 


and 
TX 
€9— —(C, uic 


PS od 


Pe C 


if «z2; 


(ti) ifa 2 1, then 


n |z] 


pel) +126 ogiz); 


where 


y=m+a(C,+C_) 


When «= 2, it is thus essentially a Wiener 
process: X(t)=mt+/v B(t), where B(t) is a 
Wiener process. When a=1, it is called a 
Cauchy process, a symmetric Cauchy process if 
m-U and C,=C_ or equivalently y= f —0, or 
an asymmetric Cauchy process if f 40. Gener- 
ally it is called a symmetric stable process if 
m —0 and C, —C. or a=2 and m=0. In par- 
ticular, for the symmetric Cauchy process cor- 
responding to v(z)— — |z|, we have 


pe 4 
Potes | mad 
Next, when 0 «a «1 and m=C_ —0, almost 
all sample functions of X (t) are purely dis- 
continuous increasing functions (e. sums of 
positive jumps). In this case, X(t) is called a 
one-sided stable process of the exponent « (or 
subordinator of the exponent x). Now, if X(t) is 
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a symmetric stable process of the exponent 

p (0« B x 2) and Y(t) is a subordinator of the 
exponent « which is independent of X (t), then 
Z(t) — X(Y(t)) is a symmetric stable process 
of the exponent af. This operation is called 

a subordination and is closely related to the 
theory of tfractional powers of tinfinitesimal 
generators of semigroups (— 261 Markov 
Processes) [2]. 

A stable process X (t) is defined in a simi- 
lar way when X(t) takes values in an N- 
dimensional space RI In particular, if X (t) is 
a symmetric stable process of the exponent « 
(0 « «x 2) given by 


E(ei* Xe») = g AT ze R^, 


then for every bounded measurable function 
f(x) with compact support, we have 


dl fos X(t) 


V((N —0)/2) 


= — yN 
2* n"? [ (a/2) ef? yl fy) dy. 


The right-hand side is the *Riesz potential of 
order a (— 338 Potential Theory). This fact 
is a generalization of a well-known rela- 

tion between Brownian motion and Newton- 
ian potential (— 45 Brownian Motion), and 
through this relation we can study several 
properties of sample functions and also com- 
pute various quantities related to stable pro- 
cesses [1]. 


G. Sample Path Properties of Lévy Processes 


Let X(t) be a temporally homogeneous Lévy 
process. For a Borel set B in R, the hitting 
time og is defined by 


8g —inf(t» 0| X(t)e B]. 


Recurrence. X (t) is called recurrent if op < oo 
a.s. for every nonempty open set B. Otherwise 
it is called transient. X(t) is recurrent if and 

1 
only OR EE 
Ornstein). If, in particular, E(X(1)) exists, it 1s 
recurrent if and only if E(X(1)) 20. When X (t) 
is a stable process, it is recurrent if and only if 
a>1,ora=1 and fj —0. 





dz — oo (Port and Stone, 


Hitting Probabilities for Single Points. If B= 
{a}, ge is denoted by og, OER is said to be 
regular for X(t) ifa, =O a.s. Set C={xeR| 
P(c, « o0) » 0). The following result is due to 
Kesten [10]: 

(i) If v 40, then C — R and 0 is regular; 

(ii) If v=0 and f+, |u|n(du) = oo, then either C 
=R and O is regular, or C= Ø and 0 is not 
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1 
regular according to whether | Re———— dz 
: Deag 


« oo for all 2» 0 or = oo for some 4» 0; 
(iii) If v 20 and f? , |u|n(du) < oo, then the 
following four cases are possible, where 
a=m-— (gu(1-4-u?) ! n(du) and S(n) is the sup- 
port of the Lévy measure n: 
(a) C 2 Ø when a=0, 
(b) C 2(0, oo) when a>0 and S(n) c (0, oo), 
(c) C 2(—00,0) when a «O0 and S(n) c ( —o0,0), 
and 
(d) C=R in the remaining case. 

For further properties of sample functions 


^ [4]. 


H. Generalization of Additive Processes 


A temporally homogeneous Lévy process is, as 
we have seen, essentially a temporally homo- 
geneous Markov process on R which is homo- 
geneous in space (i.e., invariant under transla- 
tions of the space). Thus, on a homogeneous 
space when homogeneity in space makes sense, 
we can generalize the notion of additive pro- 
cesses. Let M be a thomogeneous space with 
transformation group G. A temporally homo- 
geneous Markov process X (t) is called an 
invariant Markov process, (or homogeneous 
Markov process) if its system of ttransition 
probabilities ( P(t, x, E)} satisfies P(t, x, E) = 
P(t, gx, gE) for all ge G. Thus an additive pro- 
cess is exactly an invariant Markov process 
on R“ when G is the group of translations. G. 
A. Hunt determined all invariant Markov 
processes when M is a ?Lie group or a factor 
space of a Lie group [6]. 

Let G be a Lie group and A — A(G) be the 
(left-invariant) "Lie algebra of G. Let G, = 
GU {a} be a tone-point compactification of 
G, and C be the set of all continuous functions 
on G,. We can define Yf(YeEA, f eC) as usual by 


n(t) - exptY, R,f(x) — f(xo), 


when the limit exists uniformly. Let C; — 

(f € C| Y(Xf) exists for every X, Ye A]. Let 
Xis X5, ..., X, be a basis of A(G), and let x,, 
X5, ..., X, be functions in C, such that x,(e)= 
0 and X;(xj(e) 2; (5j — 1,2, ...,d; e is the 
unit element of G). Take a neighborhood of e, 
and define a function q(g) — È$; x2(g) for g 
contained in the neighborhood, and extend 
this function to G, in such a way that o eC, 
and oz k» 0 (k- constant) outside of the 
neighborhood of e. Then ge G defines a trans- 
formation of G, by 1,0 — go, t,@ =a, and in 
this way G, is supplied with the structure of a 
thomogeneous space with the transformation 
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group G. Now let X(t) be an invariant Markov 
process on G, which is *continuous in proba- 
bility. Then the tsemigroup 7; (which is a 
*strongly continuous semigroup on C) of the 
process X(t) is characterized as follows: The 
domain of the infinitesimal generator A of T, 
contains C,, and for f € C; 


ARD XN+ Y, ayX OG) 


J= 


+ | d (to) — f(t) 
Giel 
Së xo) do 


where aj, a; are real numbers (i, j= 1,2, ...,d) 
such that (a; is a symmetric nonnegative 
definite matrix, and n(do) is a nonnegative 
measure on G, — (ej such that fe,- te; o(o). 
n(do) « oo. Conversely, given such aj, a; and 
n(do), there exists one and only one invariant 
Markov process on G, whose infinitesimal 
generator is given as above. 

A similar result is obtained when M isa 
factor space of a Lie group by its compact 
subgroup. Furthermore, for more concrete 
homogeneous spaces such as spheres or Loba- 
chevskii spaces (more generally, tsymmetric 
Riemannian spaces) the canonical form of the 
invariant Markov processes and infinitely 
divisible laws is obtained by making use of 
harmonic analysis [5]. 
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A. Introduction 


The concept of idele was first introduced by C. 
Chevalley (J. Math. Pures Appl., (9) 15 (1936); 
Ann. Math., (2) 41 (1940)), for talgebraic num- 
ber fields. Later on, this concept and the allied 
concept of adele were defined for fsimple alge- 
bras and also for falgebraic groups over 
algebraic number fields, and the two concepts 
became important in the arithmetical theory of 
these objects. We shall first explain the general 
concept of restricted direct product, by means 
of which adeles and ideles will be defined. 


B. Restricted Direct Product 


Let I be an index set. Suppose we are given, 
for each pel, a locally compact group G,, and 
for each p except for a given finite set, say 

Pis Dä, Pr à compact open subgroup U, of 
G,. Let G be the subgroup of the direct prod- 
uct Į I,e; G, consisting of elements (g,) whose 
G,-components g, lie in U,, except for a finite 
number of p. Put U =I Li- G, x I, ,,, Up- 
Then U is a locally compact group with re- 
spect to the fproduct topology. The group G 
can be supplied naturally with a topology with 
respect to which G is a locally compact group 
and the quotient space G/U is discrete. The 
group G together with this topology is called 
the restricted direct product of {G,} with re- 
spect to {U,}. 


C. Adeles and Ideles 


Let k be an talgebraic number field of finite 
degree and / be the totality of finite and in- 
finite tprime divisors of k. For each pel we 
denote by k, and k,* the fcompletion of k with 
respect to p and the multiplicative group of 
nonzero elements of k,, respectively. Further- 
more, for each finite prime divisor p, we de- 
note by o, and u, the ring of *p-adic integers 
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of k and the multiplicative group of *units of 
D,, respectively. 

(1) Since o, is a compact open subgroup of 
k, as an additive group, we can construct the 
restricted direct product A, of {k,} with re- 
spect to {o,}. Then A, is a locally compact ring 
with respect to the componentwise ring oper- 
ations. We call A, the adele ring (or ring of 
valuation vectors) of k, and an element of A, an 
adele (or valuation vector) of k. The element of 
the direct product TL, k, whose p-component 
is a fixed element of k for all p is an adele. We 
call such an adele a principal adele. Since u, is a 
compact open subgroup of k," for each finite 
prime p, we can construct the restricted direct 
product J, of {k,*} with respect to {u,}. We 
call J, the idele group of k and an element of J, 
an idele of k. The element of the direct product 
II k,” whose p-component is a fixed element 
of k for all p is an idele. We call such an idele a 
principal idele. Each element b of J, induces an 
automorphism f, of the additive group A, 
defined by f;(a) 2 b: a (ac A,). Thus J, can be 
regarded as a subgroup of the automorphism 
group Aut(A,) of the additive group A,. The 
topology of J, coincides with the relative 
topology of J, as a subgroup of Aut(A,). We 
note, however, that the topology of J, is dif- 
ferent from the relative topology of J, asa 
subspace of A,, and the former is stronger 
than the latter. Finally, for a tfunction field 
in one variable over a *finite field, the adele 
ring and the idele group can be defined 
similarly. 

(2) Let R be a tnormal simple algebra over k 
and O be a *maximal order of 9i. For each 
pel put 91, — 91 Q ,k,, and for each finite 
prime divisor p put O,—0,: D. Then ©, is a 
compact open additive subgroup of ®,. By the 
adele ring A4 of R we mean the restricted 
direct product of {,} with respect to (O,j. 
Let R,” and U, be the multiplicative group of 
nonzero divisors of R, and the multiplicative 
group of the units of O,, respectively. (W, can 
be defined only if p is a finite prime divisor.) 
By the idele group Jy of R we mean the re- 
stricted direct product of (3R," 1 with respect to 
(15). The notion of principal adele (or idele) of 
R can be defined similarly, as in (1). The struc- 
tures, as topological groups, of Ay and Jg do 
not depend on the choice of a maximal order 
©. The adele ring A, and the idele group J, 
described in (1) are special cases of Ay and Jy, 
respectively. 

(3) Let G be a linear talgebraic group de- 
fined over k, and let G, be the set of k,- 
‘rational points of the group for each pel. For 
each finite prime divisor p, let U, be the set of 
elements « of G, such that the coordinates of 
both a and a are p-adic integers. We can 
then construct the restricted direct product of 
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{G,} with respect to {U,}, which is called the 
idele group (or adele group) of G. 

In the following section we focus on describ- 
ing the fundamental properties of adeles and 
ideles of an algebraic number field k. We shall 
start, however, by observing more generally 
those adeles and ideles of a normal simple 
algebra R over k. (For the properties of the 
adele group of algebraic groups — [7]; 13 
Algebraic Groups.) 


D. The Structures of the Adele Ring and Idele 
Group 


Let R be a normal simple algebra over an 
algebraic number field of finite degree k. We 
identify the totality of principal adeles of R 
(principal ideles of R) with R (R*), and denote 
it by the same letter R (91 "). Then R (R*) isa 
discrete subgroup of Ax (J4). The quotient 
group AAR is compact. Denoting by |2,|, 

(x, €k,) and N,(«,) (x, € R,) the tnormalized 
valuation of k, and the treduced norm from R, 
to k, respectively, we define, for aE Jy, a posi- 
tive number: V(a) 2 II,.;| N,(,)],, where a= 
(x). We call V(a) the volume of a. If a is a prin- 
cipal idele, we have V(a) 2 1 by the fproduct 
formula on valuations. Denote by A (he set 
of ideles a with V(a)=1 and put CH 2 J9/9*. 
Then C$ has finite volume with respect to 

the *Haar measure of Jy. Furthermore, C$ is 
compact if and only if 9i is a tdivision algebra. 
In particular, C? is compact. Let Q be the field 
of rational numbers. For each rational prime 
p, we define a character 4, of the completion 
Q, of Q with respect to the p-adic topology 
(by a character of Q,, we mean a continuous 
homomorphism from Q, to the 1-dimensional 
torus R/Z): If p=p,, is the infinite prime of Q, 
then we put Ap, X) —xmodZ (xeQ,). If p is 
finite, then we let 4, be the composite of the 
following three canonical homomorphisms, 
namely, the one from Q, to Q,/Z,, the one 
from Q,/Z, to Q/Z, and the one from Q/Z to 
R/Z. We define a character 4, of R, as follows: 
À, — AO tr(91,/Q,), where p is the rational 
prime divisible by p and tr(91,/Q,) denotes the 
treduced trace from R, to Q,. For x, ye ,, 
put (x, y), 2 expQ iA, (xy)). Then the additive 
group R, is self-dual relative to (x, y),. Fur- 
thermore, if we put (a, b» = TI, (a, bp), for 

a —(a,) and b=(b,)e Ag then A, is self-dual 
relative to (a, b». The tannihilator of the 
group of principal adeles with respect to (a, b> 
is R. Hence it follows from *Pontryagin’s 
duality theorem that A,,/ is compact. Hence- 
forth let R=k. We call the quotient group 

C, — J,/k* (an element of C,) the idele class 
group of k (an idele class). If a character x of J, 
satisfies the condition y(a)- 1 for all xek (i.e., 
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if y is a character of C,), we call such a charac- 
ter a Gróssencharakter (or Hecke character). 
Gróssencharakters were introduced by E. 
Hecke as characters of a certain type of the 
"deal group of k (Math. Z., 1 (1918), 5 (1920)), 
but they are essentially the same as the ones 
defined above [1]. Let D, be the connected 
component of the identity element of C,. Then 
C,/D, is totally disconnected and compact. 
Hence a Gróssencharakter y is of finite order if 
and only if 7(D,)= 1. We can prove by !class 
field theory that C,/D, is canonically isomor- 
phic to the Galois group over k of the maxi- 
mal Abelian extension of k (— 59 Class Field 
Theory). For the structure of D,, the following 
fact is known: Let r, and r, be the number of 
treal infinite prime divisors and timaginary 
infinite prime divisors of k, respectively. Then 
the dual group of D, is isomorphic to R x 

Q" tr"! xZ^, where R is the additive group of 
real numbers with the usual topology and 

Q (Z) is the additive group of rational numbers 
(rational integers) with the discrete topology. 
Let F be a function field in one variable over a 
finite field Fo. The properties of the adele ring 
and idele group of F are similar to the prop- 
erties of A, and J,, while the group Cp has 

a simpler structure than C,. To explain the 
structure of Cp, let F be the maximal Abelian 
extension of F, G be the Galois group of F/F, 
and Gp be the subgroup of G consisting of the 
elements c such that a(a) 2a" for all zech, 
(=the talgebraic closure of Fo), where q is the 
number of elements of the finite field Fy and n 
is a given rational integer. Also, let G? be the 
subgroup of G; consisting of the elements 
inducing the identity map on F,. G? is a com- 
pact group with respect to the tKrull topology. 
G, can be naturally supplied with a topology 
such that the group Gr is a locally compact 
group and the quotient group G,/G$ is dis- 
crete. Then class field theory implies that Gp is 
isomorphic to C, as a topological group. 

The following characterization of the adele 
ring of a number field or function field in one 
variable over a finite field is the work of K. 
Iwasawa (Ann. Math., (2) 57 (1953)). Let A bea 
tsemisimple commutative and locally compact 
topological ring with unity 1. Assume that A is 
neither discrete nor compact, and moreover 
that A contains a discrete subfield kal and 
A/k is compact. Then k is an algebraic number 
field or a function field in one variable over a 
finite field, and A is isomorphic to the adele 
ring of k as a topological ring. 


E. Ideles and Cohomology 


Let K be a Galois extension of finite degree of 
an algebraic number field k, and © be the 
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Galois group of the extension K/k. © operates 
naturally on the idele group J, and the idele 
class group C; of K. The structures of the 
*cohomology groups of © with the coefficient 
groups J, and C, were investigated by G. 
Hochschild, T. Nakayama, E. Artin, J. Tate, 
and others. In particular, we have H'(G, Cx)= 
{0} and H?(G, Co) = Z/nZ (cyclic group of 
order n), where n2 [K:Kk]. These facts play an 
important role in one of the proofs of class 
field theory (— [3]; 59 Class Field Theory). 
Furthermore, A. Weil introduced the so-called 
Weil group, which is a *group extension of a 
certain type of Ce by ©. He defined the most 
general L-functions, which include both 
tArtin L-functions and "Hecke L-functions 
with Gróssencharakters (— [2]; 450 Zeta 
Functions). 


F. Fourier Analysis on the Adele Group 


*Dedekind zeta functions and Hecke L- 
functions are *meromorphic on the whole 
complex plane and satisfy functional equa- 
tions of certain types. This can be proved by 
methods of Fourier analysis on the adele 
group A, (Artin, Iwasawa, Tate [1,8,9]). Fora 
continuous complex-valued function g(a) on 
A, Satisfying suitable conditions, we define the 
Fourier transform of g(a) as follows: 


=| p(a)<a, b> db, 
Ak 


where db denotes the Haar measure on A,. 
By normalizing db suitably and applying 
*Poisson’s summation formula, we get, for 
each idele a of k, 

3. o(ax) - V(a) ! *  ó(a Tal 

ack ack 

This is called the O-formula. Consider the 
following integral on J,: 


«=| V(aFx(a)o(a)d*a, 
Jk 


where d*a denotes the Haar measure on J}, 
sis a complex number, and y is a Grössen- 
charakter of k, namely, a character of C,. This 
integral converges if s> 1, and by using the 
O-formula one can show that £(s) is meromor- 
phic on the whole complex plane and satis- 
fies a functional equation of a certain type. 
When the function q is of special type, then 
the above integral can be explicitly expressed 
as the product of L-functions, I -functions, and 
exponential functions. This method of express- 
ing L-functions by integrals on J, and apply- 
ing the O-formula can be applied to investi- 
gate tHey zeta functions and L-functions of 
various types defined for a simple algebra (— 
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450 Zeta Functions) (G. Fujisaki [6]; T. Tama- 
gawa, Ann. Math., 77 (1963)). 
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A. Construction of Affine Spaces 


An affine space A is constructed as follows: Let 
V be a tvector space over a ffield K, and let A 
be a nonempty set. For any vector ae V and 
any element p of A, suppose that an addition p 
-- ac A is defined so as to satisfy the following 
three conditions: (1) p 4- 0 — p (0 being a zero 
vector); (ii) (p +a) +b=p+(a+b) (a; be V); and 
(iii) for any qe A there exists a unique vector 
ac V such that q— p +a. (Condition (i) follows 
from (ii) and (iii).) Then we call A an affine 
space, V the standard vector space of A, and K 
the coefficient field of 4. Each element of A is 
called a point. 

If we fix an arbitrary point oe A, there is a 
one-to-one correspondence between A and V 
given by the mapping sending pe A to aeV 
such that p=o +a. Such an element a of V is 
called a position vector of p with the initial 
point o and is denoted by op. We say that r - 1 
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points p, (0 € c & r) of A are independent if r 
vectors a; = pop; (1 &i &r) are linearly inde- 
pendent in V; otherwise, they are said to be 
dependent. This definition of dependence of 
points p, is independent of the choice of the 
initial point among them. If V is of dimension 
n, we say that A is of dimension n, dim A =n; in 
this case, we sometimes write A" instead of A 
and V" instead of V. The affine space A is of 
dimension n if and only if the maximum num- 
ber of independent points in A is n 4 1. 

Next, for any vector subspace V* of V" and 
an arbitrary point pe A", we put 45 = (qe A"| 
q=p+x,xeV*} and call it a subspace of A”. It 
is an affine space of dimension k: Conversely, 
every subset of A" that is an affine space under 
the affine space structure of A can be expressed 
in this form. A!, A?, and A"^! in A” are called 
a line, plane, and hyperplane, respectively. A set 
that consists of only one point is also consid- 
ered as a subspace A. For subspaces A’ and 
A’ of A", we denote by A"f1 A5 the intersection 
(e, the set-theoretic intersection) of A’ and A’, 
and by A"U AS the join of A" and A’ (i.e., the 
intersection of all subspaces that contain both 
A’ and A‘). Then A" 4^ is the affine space of 
highest dimension contained in A’ and A^, and 
A’ UA‘ is the affine space of lowest dimension 
that contains A’ and A*. If r+ 1 points are 
given in A”, there always exists a subspace A" 
that contains all these points. In particular, if 
the points are independent, then such an A’ is 
unique. Moreover, if A'N A: z Ø (Ø is the 
empty set), then we have r - s - dim(A" U A4?) 4- 
dim(A'" 1 A5). This is called the dimension 
theorem (or intersection theorem) of affine 
geometry. 

Next suppose that r+ 1 points p, (Q<a<r) 
in A" are independent, and put p,,, — pg. Let 
q, be an arbitrary point on p,Up,,, that differs 
from p, and p,.,. If å% are elements of K such 
that 4** Pada dÉ, then qo, .... q, are de- 
pendent if and only if 492! ... 4^ 2(— 1)*!. And 
if rz 2 and e, 2 q,Up,.3U -U Patr (Pis — Pia 
if iz 1), then o9,...,0, have a point in common 
if and only if 494! ... 4 — 1. The former is 
called Menelaus's theorem, and the latter 
Ceva's theorem. 

The set L(A) of all subspaces (including Cj 
considered as an affine space of dimension —1) 
constitutes a flattice with respect to the inclu- 
sion relation. 


B. Parallelism in Affine Spaces 


Let A’ and A? be subspaces of A". We say 
that A’ and A5 are parallel in the wider sense 
if either of the following conditions holds: 
(i) A> 4? or ASDA’; or (it) A'N A= 0 

and dim(A’U A?) & r + s. Next, let A’ and 
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B' be subspaces of A" of the same dimension. 
If A" and B' coincide, or A'(1B' 2 Ø and 
dim(A’U B") 2 r + 1, then they are said to be 
parallel in the narrower sense (or simply paral- 
lel), and we denote the relation by A’ // B”. If 
r=s= 1, the definitions of parallelism in the 
narrower sense and wider sense are equivalent. 
If r 1, parallelism in the narrower sense im- 
plies parallelism in the wider sense. For two 
sets (a,) and (b,) (0 & x <r) of r -- 1 independent 
points, let V" and W” be vector spaces with 
bases dod; and bob; (1 <i<r), respectively. 
Then A' 2 agU ...Ua, and B' — bqU ... Ub, are 
parallel if and only if V’ = W”; and for an 
arbitrary point p, there exists a unique r- 
dimensional subspace that is parallel to A” and 
passes through point p. If A’ and B? are paral- 
lel in the wider sense, then there exist sub- 
spaces A‘ and B' (t x 1) of A’ and B5 that are 
parallel to each other. Moreover, if neither A’ 
nor B* is contained in the other and if t is the 
largest integer with the property just given, 
then we have t=r+s+1—dim(A’U B’). 
Parallelism between subspaces of A" is an 
equivalence relation. Specifically, the equiva- 
lence class of a 1-dimensional subspace A! is 
called a point at infinity and is denoted by AN. 
Given a subspace A’ of A", the set of points at 
infinity 49. represented by lines 4! con- 
tained in A’ is denoted by A^; '; we have 
A’ // B! if and only if A77! = B'.!. A’! is 
called a space at infinity. In particular, the set 
A^! is called the hyperplane at infinity. The 
set-theoretic sum A” U A"! = A" is supplied 
with the structure of a tprojective space; the 
“points” in A" are elements of A”, and the 
“lines” in A" are A! U A® and A}. 


C. Coordinates of Affine Spaces 


If we fix a point o in A" and a basis fe,, ...,e,! 
of the standard vector space V", then any 
point p in A" is uniquely expressed as 


n 


p-ocYx'*e, 


i=1 


xek. (1) 


The system %} — (o; e,, ..., e,) is called an affine 
frame (simply the frame) of A"; the point o 

is called its origin, and e; is called the ith 

unit vector. The mapping sending p to (x!, 

..., X") gives a *bijection of A" to KI: we call 
(x, ..., x") affine coordinates of p with respect 
to %, and x’ the ith affine coordinate. In partic- 
ular, if K is a topological field (e.g., the real 
number field R or the complex number field 
C), this bijection A” K” induces a topology of 
A", which can be shown to be independent of 
the choice of %. For the remainder of this 
article, by “coordinates” we mean affine co- 
ordinates unless otherwise stated. Putting 
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aj; 0e; (1 <i<n), we sometimes call (0; a,, 
....0,) an affine frame. Further, putting /; 2 oU 
aj, tr; oUa,U...Uaj; ,Ua;,4U...Ua,, we call 
a;, l;, and 7; the ith unit point, the ith coordi- 
nate axis, and the ith coordinate hyperplane, 
respectively. 

Assume that subspaces A’ and A’ (r,s >0,r + 
s=n) are not parallel in the wider sense. For 
a point p of A", denote by A'(p) the subspace 
that passes through p and is parallel to A’, and 
put q= A’(p)N A7. A mapping o: A" 47 de- 
fined by o(p) 2 q is called a parallel projection 
on A? with respect to A’. In particular, if A" = 
zn; and A4^—l; (r2n—1,s- 1), we write g(p)  p,. 
Then the ith coordinate x! of p is an element 
of K such that op; = x'ód;. Hence such coordi- 
nates are also called parallel coordinates (or 
Cartesian coordinates). 

Suppose that we are given r+ 1 points 
bo, ..., b, of A" and r + 1 elements 49, ..., 4" of 
K such that 27. 4* — 1. We fix a point o of A". 
If a point p in A" satisfies op = Xho A*ob,, then 
bop =X; bob; hence p is contained in the 
subspace bo U... Ub,. Conversely, if a point p is 
contained in the latter subspace, then there 
exists a system (A9, ..., 4") such that 
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The system (49, ..., 4") has a geometric mean- 
ing since we also have o'p— Za 4*o0'b, if we 
replace the point o by any other point o' of A". 
The elements 2°, ..., 4" are called barycentric 
coordinates of p with respect to (5o, ..., b,j. In 
particular, if (bo, ..., b,) are independent, 

then the barycentric coordinates (4°, ..., 4") 
are uniquely determined by the point p on 

bo U... Ub,. Furthermore, let (yl, ..., y") be 
affine coordinates of p with respect to an affine 
frame 5. and let (x1, ..., x?) be affine coordi- 
nates of b, (x — 0, ...,r). Then p belongs to the 
subspace A' — bgU ... Ub, if and only if y'= 
Dio A“ xi (i9 1, ..., n). In this case we say 

that the system of the linear equations y'= 

Y xi (È A* = 1) gives a parametric represen- 
tation of the subspace A’ (by parameters 4°). 
Specifically, if r 2 n — 1, the solvability of the 
system of equations y'— 272-1 A* xi (i— 1, ..., n), 
1 =) 4* implies the equation E}; y'u;- uo for 
some nontrivial constants fo, ..., D, Hence the 
latter equation represents the hyperplane z= 
A". If a point p has barycentric coordi- 
nates 49 — Al... ss Ai «(r- 1) ! with respect 
to (bo, ..., b,}, it is called the barycenter of bo, 
b,, ...,b,. The barycenter is uniquely deter- 
mined by the set (5,, ..., b,] and is denoted 

by g(bo, ..., b). Specifically, the barycenter 

of two points b, and b, is called the midpoint 
(or middle point) of b, and b,. If we divide 

B — (bs, ..., bi into two sets of points and if 
g, and g, are barycenters of these two sets 

of points, respectively, then g, Ug; passes 
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through the barycenter of B. More generally, a 
point with barycentric coordinates (4°, ... , 4") 
with respect to (bo, ..., b,] is called a barycen- 
ter of bo, ..., b, with weights 1°, ..., 4". 


D. Affine Spaces over Ordered Fields 


Suppose the coefficient field K is an *ordered 
field (e.g., the real number field R). Given a 
hyperplane z of A", we take an affine frame for 
which z is the nth coordinate hyperplane. If we 
denote coordinates of points with respect to 
this frame by (x, ..., x"), then the equation of 
z is given by x"=0. Let A". and A” be sets of 
points whose nth coordinates are positive and 
negative, respectively. They are called half- 
spaces of A" divided by z. The union of z and 
a half-space is called a closed half-space. A 
half-space of a subspace A’ of A" (divided by 
some A’! on A") is called a half-space of di- 
mension r. For a point p of A" that does not lie 
on z, the half-space containing p is called the 
side of p with respect to z. In particular, when 
n — 1, let p and q be two points on a line /. The 
closed side of q with respect to p is called the 
(closed) half-line (or ray) from p to q. The inter- 
section of the closed half-lines emanating from 
p to q and from q to p is called the segment 
joining p and q and is denoted by pq. Clearly 
Pd — qp. A subset C of A" is called a convex set 
if the segment joining two arbitrary points of 
C is contained in C. Each half-space of each 
dimension is convex. For any family C, of 
convex sets, ( ), C, is also convex. Therefore, 
for any subset D in A" there exists a minimal 
convex set that contains D. It is called the 
convex closure (or convex hull) of D. The con- 
vex closure C(P) of a finite set of points P= 
(Do. .... py) in A” is called a convex cell, and 
dim(pgU ... U py) is called the dimension of the 
convex cell. In particular, when pg, ..., p, are 
independent, C(P) is called a k-dimensional 
simplex with vertices po, ..., p,. The 1-dimen- 
sional simplex having two distinct points p 
and q as vertices is the segment pq, and the 
vertices p and q are called ends of the segment. 
A point is regarded as a 0-dimensional sim- 
plex. Each 2-dimensional or 3-dimensional 
simplex is called a triangle or tetrahedron, 
respectively. A k-dimensional simplex S with 
vertices po, ..., p, Is a set of points whose 
barycentric coordinates 4* (0 € a <k, 3 4^ — 1) 
with respect to the vertices satisfy 4*z 0. On 
the other hand, if we put A* =p U ... U p, and 
7, =Po U ... Up, ,UPp444 U Up,, and denote 
by A* the side of p, in A* and by A* the closed 
side of p, in AR with respect to z,, then the 
simplex S is given by ( 5.9 4%, and [f-o A} is 
called an open simplex. A" has the structure of 
a ttopological space in which the set of open n- 
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dimensional simplexes forms a "base of topen 
sets. In particular, if K is R, the topology of A" 
thus defined is compatible with the one that is 
naturally induced by the topology of R. With 
respect to this topology, A" is a *Hausdorff 
space. The terms open and closed used before 
for n-dimensional simplexes agree with the 
corresponding notions with respect to this 
topology. 

A subset of A" is said to be bounded if it is 
contained in some simplex. A bounded set 
obtained through a finite process of construct- 
ing intersections and unions from a finite 
number of closed half-spaces is called a poly- 
hedron. The points of a convex polyhedron are 
characterized by several linear inequalities 
satisfied by their coordinates. A set of points 
whose coordinates (x!, ..., x") satisfy hi<x'< 
k! for ki h' e K is called a parallelotope; it is a 
polyhedron whose *interior is called an open 
parallelotope. A simplex is a polyhedron, and 
polyhedra admit tsimplicial decompositions. 
A polyhedron can also be defined as the set- 
theoretic union of a finite number of simplexes. 

Let P be a finite set of points, and let its 
convex closure C(P) be a convex cell of dimen- 
sion m. Then C(P) is contained in an m- 
dimensional subspace A”. The tboundary of 
C(P) in the topological space A” is called the 
boundary of a convex cell C(P). We can take a 
subset Q of P so that dim C(Q) 2m — 1 and 
C(Q) is the intersection of the boundary of 
C(P) and an (m — 1)-dimensional subspace. 
Such a C(Q) is called a face of C(P), and 
we denote this relation by C(P)>C(Q). If 
C(P)>C(P,)>...>C(P,), then C(P,) is called 
an (m — s)-dimensional face of C(P). A 0- 
dimensional face is called a vertex, and a 1- 
dimensional face is called an edge. Suppose 
that C(P)>C(Q) for P={po,...,p,} and Q 
= {Pigs s piu). Then F=p,,U...Up,,_, isa 
hyperplane of E = po U ...U p,, and C(P) is 
contained in a closed side of E divided by F. 
Therefore, if C(P) has d (m — 1)-dimensional 
faces, then C(P) is expressed as the intersection 
of d m-dimensional closed half-spaces. This 
shows that any convex cell is a polyhedron. 


E. Affine Transformations 


A mapping o: A" A" is an affine mapping if 
there is a linear mapping 9: V" V" of the 
standard vector spaces of A" and A" such that 
o(p-- x) o(p) 4- 9(x) holds for any pe A" and 
any xe V". An affine mapping of A" into itself 
is called an affine transformation (or affinity) 
of A". Specifically, a bijective affine transforma- 
tion is called a regular (or proper) affine trans- 
formation. An affine transformation q of A" is 
characterized by each one of the following 
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properties: (1) Let oe A" be a fixed point. Then 
gy is a mapping of A" onto itself that can be 
expressed as 


Q(o * X) 2 0 a + f(x), (2) 


where a is a fixed vector of V" and f is a 
linear transformation of V", (ii) The mapping 
p: A" 2 A" is a mapping such that o(a)o(b)— 
^: e(p)o(q) if ab — 4- pj (4€ K). Moreover, if 
the *characteristic of K is not equal to 2, an 
affine transformation is also characterized as 
follows: (iii) ọ is a mapping that sends [ines 
into lines and preserves the ratio of each pair 
of parallel segments. 

The set Wi Am of all regular affine transfor- 
mations of A" constitutes a group that we call 
the group of affine transformations. If the 
linear mapping f associated with a regular 
affine transformation o is the identity map- 
ping, then ¢ is called a translation. The set 
%(A”) of all translations is a normal subgroup 
of 9[(A") and is called the group of translations. 
The group of translations is isomorphic to V 
regarded as an additive group. The vector 
group B(A”) (i.e., an additive group of a linear 
space) acts ‘simply transitively on A". We see 
that Y(A")/B(A")=GL(n, K), where GL(n, K) 
denotes the *general linear group. The set of 
all regular affine transformations that leave a 
point o of A" invariant constitutes a subgroup 
(5(A4") of 9L ( 4"); it is called an 'isotropy group 
at o and is isomorphic to GL(n, K). Let § = 
(0; e, , , e.) be an affine frame of A” with 
origin o, and let ọ be a regular affine trans- 
formation of A” given by (2); put x 2 X x'ej, 
olo - x) » o +} X'e;,a = Da'e,, and f(ej) = 
Y. aje;. Then ois expressed with respect to (Y 
by the following equation: 


X'=a'+ ¥ ajx*, det(ai))#0, 1 <i<n. o 
k=] 

Conversely, a transformation that is given by 

(3) is a regular afftne transformation. Elements 

of B(A") and (5(A") are expressed with respect 

to dé by 


| &isn, (4) 


Y—-Yax* det(a)z0, leisen (5) 
1 


k= 
respectively. Hence 9I(A4") is represented as a 
'semidirect product group of %(A") and (F>(A”). 
In particular, a regular affine transformation 
that is represented by x' = ax! (1 «ix n) for 
some ae K (a z 0)is called a similarity with the 
origin 0 as center. 

According to F. Klein, the objects we deal 
with in affine geometry are the properties 
(parallelism, barycenters, etc.) that are 1nvar- 
iant under regular affine transformations. 


26 


Subsets S, and S, of A” are called affinely 
congruent if there exists a regular affine trans- 
formation o sending A, onto S,. For a fixed 
k, two k-dimensional simplexes are affinely 
congruent. Now we fix an affine frame (y in A” 
and denote by x; the coordinates of n 4-1 
points p, (0 &a«n)in A”. Then the quantity 
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is called the volume with respect to d of the 1- 
dimensional simplex with vertices po, , p,. If 
9 is a regular affine transformation given by 
(3), we have V — det(aj) V. Hence the ratio of 
volumes of two n-dimensional simplexes is 
independent of the choice of coordinate sys- 
tems, and is invariant under regular affine 
transformations. 

A regular affine transformation given by (3) 
satisfying det(aj) - ] is called an equivalent 
affnity. The set of all equivalent affinities 
constitutes a subgroup of 9[(A4"). The geometry 
belonging to this group is called affine geome- 
try in the narrower sense. For instance, the 
concept of volume is an invariant in affine 
geometry in the narrower sense. 


F. Relation to Projective Geometry 


Let P" be a projective space over a coefficient 
field K (= 343 Projective Geometry). If we fix 
a hyperplane 7z, in P", then the set of projec- 
tive transformations that leave z, invariant 
constitutes a subgroup of the group of tprojec- 
tive transformations of P"; this subgroup is 
isomorphic to a group of regular affine trans- 
formations. Actually, if we use a projective 
frame Joe, a,, ,a, U] such that a, , a, are 
points on z,,, then each projective transfor- 
mation leaving 7, invariant is expressed by 
equations of the same form as (3) with respect 
to the tinhomogeneous projective coordinates. 
The point set A” complementary to 7,, in P" 

is an affine space, and z, coincides with the 
hyperplane at infinity. Moreover, two distinct 
lines in P" are parallel in A" if they meet on the 
hyperplane at infinity. Hence, denoting by 

(0, I5 .., |^) the -homogeneous projective CO- 
ordinates of the intersection of a line] in A" 
and z,,, we call (I^, , |") the direction ratio of 
1. A projective transformation leaving each 
point of z, invariant induces a translation. 
The ‘principle of duality that holds in projec- 
tive geometry does not hold in affine geome- 
try. The *pole of the hyperplane at infinity, 
with respect to a quadric hypersurface, is 
called the center of the quadric hypersurface. A 
regular quadric hypersurface is called central 
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or noncentral according as its center belongs to 
A" or is a point at inlinity. Quadric hypersur- 
faces in an affine space are classilied in several 
ways, by taking account of their relations with 
the hyperplane at infinity (2 78 Conic Sec- 
tions, 350 Quadric Surfaces). 
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8 (111.1) 
Algebra 


Though the word “algebra” usually refers to 

a field of mathematics as will be explained 
below, the word may also denote specific 
mathematical structures such as *associative 
algebras, ‘Jordan algebras, *Clifford algebras, 
etc. The first concepts concerning “unknowns” 
in algebra originated in India, whence came 
also our decimal positional system of numer- 
ation. These ideas were introduced to Europe 
through Arabia in the Renaissance period. F. 
*Viéte systematized them into a symbolic 
method, called algebra, representing numbers 
by letters. The first problem of algebra was 
solving equations. Before Viéte, G. Cardano 
and L. Ferrari had solved algebraic equations 
of degrees 3 and 4; the solution of equations of 
lower degree had been known from antiquity. 
The effort to solve equations of higher degree 
remained unresolved until the middle of the 
19th Century, when N. H. "Abel and E. *Galois 
proved the nonexistence of algebraic solutions 
of such equations. They considered not only 
individual roots of these equations but also 
any rational transforms of their roots at the 
same time, and thus were led to the concept of 
'fields. They also noticed that the problem of 
algebraic solution could be characterized by 
properties of permutation groups of the roots. 
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After the discovery of the Galois group, group 
theory and group-theoretical considerations 
maintained the central position in algebra for 
some time (— 172 Galois Theory). These de- 
veloped into the “abstract algebra" of this 
Century in the general atmosphere of arithme- 
tzation and of axiomatization of mathematics. 
At the turn of the century the monumental 
textbook in three volumes by H. Weber [1] 
was considered the standard work on algebra. 
Then there appeared in 1910 an epoch-making 
paper [2] by E. Steinitz on the abstract theory 
of lields. 

The main objects of algebra today are falge- 
braic systems of various kinds, such as *groups, 
*rings, ‘fields, and *modules. Another fun- 
damental concept of algebra is that of fiso- 
morphism or of homomorphism. The col- 
lection of algebraic systems of a given kind, 
together with the homomorphisms among 
them, gives rise to the notion of ‘category; a 
functor is a sort of homomorphism between 
categories (— 52 Categories and Functors). 
These notions were first used in thomological 
algebra, created in the 1940s by methods 
transferred from topology to algebra; now 
they are of basic significance to the whole of 
mathematics. 

An important branch of algebra with wide 
applications is the theory of fvector spaces, or 
more generally that of tmodules over a ring. 
This branch is called linear algebra. Homo- 
morphisms between finitely generated modules 
can be represented by fmatrices. Another 
branch of algebra, called frepresentation 
theory, is concerned with representations of 
groups or rings by matrices. The methods of 
modern algebra provide useful and powerful 
tools for the whole of mathematics, in partic- 
ular for the theory of numbers and algebraic 
geometry. 

The present development of algebra owes 
much to the activity of the German school in 
the late 1920s represented by E. Noether, E. 
Artin, W. Krull, and B. L. van der Waerden. 
The book by van der Waerden [3] has had a 
great impact on mathematics. N. Bourbaki [4] 
has been influenced by van der Waerden but 
gives accounts of more recent developments, 
particularly in linear algebra. In Japan, M. 
Sono, who worked at about the same period 
as E. Noether, was a forerunner in this field; 
after him, algebraists of the Kyoto School, Y. 
Akizuki, M. Nagata, and their followers, did 
notable research, especially in algebraic geom- 
etry. On the other hand, K. Shoda studied 
with E. Noether toward 1930 in Germany; his 
school includes such algebraists as T. Naka- 
yama, K. Asano, and G. Azumaya. Finally 
K. Morita and his disciples have made signif- 
icant contributions to homological algebra. 
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9 (VIII.2) 
Algebraic Curves 


A. General Remarks 


An ‘algebraic variety of dimension 1 is called 
an algebraic curve (for analytic theory 2 11 
Algebraic Functions). The theory of algebraic 
curves has two aspects, the geometry of 1- 
dimensional complex manifolds in projective 
spaces and the theory of function fields of 
transcendence degree 1 (— 3 Abelian Varieties, 
16 Algebraic Varieties). The number-theoretic 
study of algebraic function fields concerns the 
latter theory (- 73 Complex Multiplication, 
450 Zeta Functions). In this article, the geo- 
metric aspect of the theory is emphasized. 

We denote the *universal domain by K. 


B. Classical Results on Plane Algebraic Curves 


Let f(X, Y) be a polynomial of degree m in 
two variables X and Y. A point set in an affine 
two-space defined by f(X, Y) = 0 is called the 
plane algebraic curve of degree m. If we set 
F(Xo, X, X5) = Xo f(X1/Xo, X2/Xo), the 
homogeneous polynomial F defines an alge- 
braic curve of degree min a projective plane 
Di The curve is called irreducible if the poly- 
nomial f(X, Y) is irreducible. A curve of degree 
1 is said to be a line. Some results of this sec- 
tion are valid only in the case where the char- 
acteristic of K is zero. 

Let C be a plane curve defined by the equa- 
tion f(X, Y) 2 0. A point P = (a, b) on C is 
called an r-ple point if f(X +a, Y +b) has no 
term of degree « r in X and Y. At an r-ple 
point there are r tangent straight lines (count- 
ing multiplicity). An r-ple point with r 1 is 
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called a multiple point (or singular point). If 
these tangents are distinct, P is called ordi- 
nary. An ordinary double point is called a 
node; e.g., the origin for X? + Y? -3X Y = 0. 

An algebraic curve can be transformed into a 
plane curve that has only ordinary multiple 
points by a finite number of plane {Cremona 
transformations (*quadratic transformations of 
the projective plane into itself). 

Let C be anvirreducible plane curve of de- 
gree m> | in a projective plane. The set of all 
tangent lines at nonsingular points of C deter- 
mines a set of points in the dual projective 
plane, and its closure is an algebraic curve C, 
that is called the dual curve of C. The dual 
curve of C becomes the original C. The degree 
m of C is said to be the class of C, which is 
equal to the number of tangent lines to C 
drawn from a general point. A nonsingular 
point P is called a point of inflection (or a flex) 
if the tangent line at P has contact of order 
> 2. If C is defined by an irreducible homoge- 
neous polynomial F(X,, X,, X.), the curve 
defined by det(0^ F/0X;0 X;) - D is said to be the 
Hessian of C. A nonsingular point P is a point 
of inflection if and only if P is contained in the 
Hessian. A singular point P is said to be a cusp 
if C is detined by an equation Y? = X? + higher 
terms, in terms of suitable affine coordinates X 
and Y where P - (0,0). Whenever the singular 
points of C are only v nodes and y cusps, the 
effective genus g is given by the formula g = 
(m= 1) (m- 2)/2 =v -y. In addition, suppose 
that the dual curve has only v' nodes and y' 
cusps as singularities. Then C has y' points 
of inflection, and m and y’ are given by m' = 
mm 1)—-2v—3y and y 23m(m—2) — 6v — 8y. 
Moreover, m=m(m- 1)—2v — 3y and y = 
3m'(m —2) 6v  8y'hold. These formulas are 
called Plücker formulas. 

For example, a nonsingular plane curve of 
degree 3 is an elliptic curve, Le g = 1, of class 
6, which has 9 points of inflection. A non- 
singular plane curve of degree 4 has, in gen- 
eral, 24 points of inflection and 28 bitangents. 
This results from the Plücker formulas. 


C. Fundamental Notions 


In what follows, by a curve we mean an alge- 
braic variety of dimension 1. Let I be a non- 
singular complete irreducible curve. An ele- 
ment of the free Abelian group generated by 
points of T is called a divisor. A divisor is 
written in the form a = È n; P, with n;e Z. The 
integer n= > n; is called the degree of a and is 
denoted by deg a. The expression for a divisor a 
is said to be reduced if P, # P, for | #j. A divisor 
whose reduced expression has only positive 
coefficients is called a positive divisor (or effec- 
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tive divisor, or integral divisor) and this is 
denoted by a7(). The group of divisors on T is 
denoted by G(T), and the subgroup consisting 
of divisors of degree 0 is denoted by G(T). Let 
P be a point of I. The subset of the function 
field K(I) of I consisting of functions regular 
at P forms a valuation ring Rp for a fdiscrete 
valuation of K(T). A prime element t of Rp is 
called a local parameter at P. Let vp be the 
‘normalized valuation of KO) defined by Rp; 
the integer vp( f) is called the order off at P. 
The point P is a zero off if vp( f) » 0; it is a 
pole off if vp( f) « 0. There are only a finite 
number of poles and zeros of a given function 
f. The divisor È vp( f) P is called the divisor of 
the function f and is denoted by (f). The set of 
divisors of functions forms a subgroup G, of 
G,. Any divisor a in G is called a principal 
divisor (we also say that a is tlinearly equiva- 
lent to zero and write a ~ 0). 

Let a be an arbitrary divisor. The set of all 
positive divisors that are linearly equivalent to 
a forms a complete linear system |a| deter- 
mined by a. We set L(a) ={ fe K(r) (f) + 
a0) U (0j. Then L(a) is a finite-dimensional 
vector space over K, and 1-dimensional sub- 
spaces of L(a) correspond bijectively to the 
elements of aj. We set J(a) = dim, L(a) and 
dim |a|=/(a)— 1. Then dim|aj is called the 
dimension of aJ. For any divisor a, the integer 
deg a = dim|a is nonnegative and bounded. 
The supremum g of such integers is called the 
genus of I’. The nonnegative integer i(a) = 
g ~dega +dim a is called the speciality index 
of a. 

Let œ be a tdifferential form on I', P be a 
point of J’, and f be a local parameter at P. 
Then o can be written in the form w = fdt. We 
now set vp(c) = vp( f) and (w) = X vp(o)P. Then 
(w) is a well-defined divisor, and the class of 
(w) in G/G, is independent of the choice of o, 
This divisor class is called the canonical class; 
any divisor in this class is called a canonical 
divisor (or differential divisor) and is denoted 
by t. We have J(f) = y, deg [= 2g ~ 2. Given a 
divisor a, the index i(a) is equal to the number 
of linearly independent differentials c) such 
that (@)>a, i(a)=l(f—a). The equality J(a)= 
deg a —g + 1 + i(a) is called the Riemann-Roch 
theorem. 

For any irreducible algebraic curve I’, there 
exists a birationally equivalent nonsingular 
curve f that is unique up to isomorphism. The 
genus of [ is called the effective genus of Į, A 
curve whose effective genus is zero is called a 
rational (or unicursal) curve. An elliptic curve is 
a curve whose effective genus is 1. An irreduc- 
ible curve with genus > 2 is called a hyper- 
elliptic curve if K(T) is a quadratic extension of 
a field K(t) for some t. 

A positive divisor a on a nonsingular com- 
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plete irreducible curve F is called a special 
divisor if i(a) > 0. In this case, 2 (J(a) — 1) < deg a, 
where equality holds if and only if a = 0 or a w 
f, or Į is a hyperelliptic curve (Clifford's 
theorem). Let [' be a nonsingular complete 
irreducible curve and k be a sublield of K such 
that I is defined over k. Denoting by k the 
algebraic closure of k, we call a divisor p = 

2 n;P, on T a prime rational divisor over kif 

p satisfies the following three conditions: (1) p 
is invariant under any automorphism g of k/k; 
(ii) for any j, there exists an automorphism 

o; of k/k such that P, = Po; Gii) n, = . =n, = 
[k(P,):k],. An element in the subgroup of 
G(T) generated by prime rational k-divisors is 
called a k-rational divisor. Let k(I)be the 
subset of K(I) consisting of functions f defined 
over k. Then K(I) is a subfield of K(I), and the 
quotient field of K(I) 69 ,K = K(T). k(I)is 
called the function field of T over k. Let p be a 
prime rational k-divisor, and let P be a point 
of p. Then Rp N k(I) is a valuation ring of k(T) 
uniquely determined by p and independent of 
the choice of the point P in p. We call this 
valuation ring the valuation ring determined 


by r. 


D. Algehraic Function Fields 


Let k be a field, and let K be a finite separable 
extension of a purely transcendental extension 
k(x) of k such that k is maximally algebraic in 
K. Then K is called an algehraic function field 
over k of dimension 1 (or of transcendence 
degree 1). The equivalence class of fexponen- 
tial valuations of K that are trivial over k is 
called a prime divisor of K /k. An element of the 
free Abelian group generated by prime divisors 
is called a divisor of K/k. The group opera- 
tion in the divisor group of K/k is usually 
denoted multiplicatively. Let Rp be the 
valuation ring of the prime divisor P, and 

let Mp be the maximal ideal of R,. The de- 
gree deg P of the prime divisor P is defined by 
[(Rp/M p): K. If we replace the terms: curve T 
by function field K/k; K(I) by K; K by k and 
points on T by prime divisors of K/k, we can 
develop the theory of the function field K/k, 
which is similar to the theory of nonsingular 
curves T (- Sections B, C). Thus we define the 
genus of the function field K/k. 

Suppose we are given an algebraic function 
field K/k of dimension 1. An algebraic curve I 
defined over kis called a model of Kik if k(I) 
and K are k-isomorphic. For any function field 
of dimension 1, there always exist two ele- 
ments x and y in K such that K= k(x, y). Let 
f(X, Y) be an irreducible polynomial such that 
f(x, y) = 0. Then the plane curve defined by the 
equation f(X, Y) = 0 is a model of K/k. Among 
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the models of K/k there exists a *normal model 
T, over k that is unique up to isomorphism 
(and the uniqueness of the normal model of 
the function field within the birational equiva- 
lence class of varieties holds only for curves). In 
particular, if k is the complex number tield, the 
normal model F, is the *Riemann surface of 
the function lield K/k. If T has no singular 
point, the theory of the curve I and the 
theory of the function tield K/k are essentially 
identical. (This occurs, for example, when & is 
tperfect.) In that case the genus of Tọ is equal 
to that of K /k, In general, the genus of the 
function field is not less than the genus of the 
normal model Ta, and it is greater than the 
latter Te has a singular point. If the genus of 
K/k is zero, we can take a plane quadratic 
curve as a model of K/k. Moreover, K/k has a 
prime divisor of degree 1 if and only if K is a 
purely transcendental extension of k. A func- 
tion lield K/k of genus 1 is called an elliptic 
function field. If K has a prime divisor of de- 
gree 1, an elliptic function field K has a model 
of a plane cubic curve. Moreover, if the char- 
acteristic of the lield & 1s different from 2, we 
can take as the model T, the curve defined by 
anequationoftheform Y?24X?—g,X —g,. 
This is called Weierstrass’s canonical form. 
The number j-(g;? Hiel ! g;? #O)is a 
birational invariant of J}. 


E. Jacobian Varieties 


Let [ be a nonsingular curve. A tgroup variety 
J is called the Jacobian variety of T if it has the 
following four properties (we fix an algebrai- 
cally closed ‘field k of definition for Į and J): 
(i) There exists an isomorphism d (of-abstract 
groups) of Go(I)/G,(I) into J. (ii) is contin- 
uous in the following sense: Let ii, b be elements 
of Go(D)/G(T) represented by a, b. If b isa 
specialization of a over a field K( > k), then 
(b) is also a specialization of (a) over K. (iii) 
If there exists a K-rational divisor in the class 
à, then the point ®(a) is also K-rational. (iv) 
For any £eJ, there exists a k(C)-rational divi- 
sor a in G such that O(a mod G,) = ë. A group 
variety J satisfying these conditions is neces- 
sarily a complete variety, hence an fAbelian 
variety, and is determined uniquely up to 
isomorphism. The construction of Jacobian 
varieties over a lield of arbitrary characteristic 
is due to A. Weil [27] (for analytic construc- 
tion > 11 Algebraic Functions C). 

Let P be a *generic point of F over k, and let 
P, be a fixed k-rational point. Then g(P)= 
Q(P — P) defines a rational mapping of F into 
J, and @, which is an isomorphism of Į and its 
image Q(T), is determined uniquely by ® up to 
translation on J. This mapping q is called the 
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canonical function on TI The dimension of J is 
equal to the genus g of Į. If P,,..., P, are 
independent generic points of T over k, then 
k(P,, ..., Pj), is the function field of J over k, 
where k(P, , . , P), is the subfield invariant 
under the group of g! automorphisms (P,,.., 
EAR... Il The Jacobian variety of T is 
also the +Picard variety of T, and it is equal to 
the tAlbanese variety of F (- 16 Algebraic 
Varieties P). Hence for any function f on I 
with values in an Abelian variety A, there 
exists a unique homomorphism 4 of J into A 
such that f=A0 ọ+ const. This 4 is called the 
linear extension of f. 

Let © be the set of points on J that can be 
written as p(P,) ++ ob JL Then Q is an 
irreducible subvariety of codimension 1. The 
divisor @ is called the canonical divisor of J. 
The Jacobian variety that is polarized by the 
divisor © is called the canonically polarized 
Jacobian variety (— 3 Abelian Varieties G). If 
two curves Į and J” are birationally equiva- 
lent, the canonically polarized Jacobians of I 
and I” are isomorphic. Conversely, if the 
canonically polarized Jacobian varieties J of T 
and J’ of ['" are isomorphic, then T and J” are 
birationally equivalent (Torelli’s theorem). Let 
r be any integer such that 1 <r < g, and let W, 
be the set of points that are written in the 
form 


O(P,)+...+0(P,) 


(W, = 9(D), W,., = 9, W, = J). Then we have 
©” =r! W,., (fnumerically equivalent) and 
(Q) = y!, where ©" is the class of intersec- 
tions of r copies of €), The existence of a divi- 
sor @ is characteristic for Jacobian varieties. 
Actually, if A is an Abelian variety of dimen- 
sion n that has: an irreducible subvariety X"^! 
of codimension 1 and a positive l-cycle C such 
that (X)=n! and X" P? —(n IC then C 
is a nonsingular irreducible curve, A is the 
Jacobian variety of C, and X is the canonical 
divisor. The canonical divisor €) is defined 
by a 'theta function in the classical case. 
For a divisor a = È n;P,, we define (a) to be 
Yno(B)eJ. For lixed numbers | &r&g, 0« d, 
welet W? denote (9(a)eJ |l(a) 2 d- 1, dega= 
Y}. Then W? € W, and is a tsubscheme of J. 
Wecall the number p=g—({d+ 1)(g—r+d) 
the Brill-Noether number. Then dim W‘ > p. 
Moreover, if I is a general curve, then dim we 
= p. In particular, if p < 0, then W?= LG. This 
result has been verified recently by S. Kleiman, 
D. Laksov, P. Griffiths, and J. Harris [9, 14]. 
Let TU be a nonsingular curve, and let o be a 
differential form on T. If the divisor (w) is 7-0, 
the œ is called a regular 1-form or differential 
form of tbe first kind. Let Q be the tsheaf of 
germs of regular differential forms. A differen- 
tial form of the first kind is an element of 


H (T , Q), and vice versa. Let f be a canonical 
divisor. Then we have a natural isomorphism 
H*(T, Q) = L(f), and the number of linearly 
independent differential forms of the first kind 
is equal to the genus g of I. The tresidue of a 
differential can be defined as in the classical 
case. A differential that has nonzero residues is 
called a differential of the third kind. The re- 
sidue theorem > Res, c = 0 holds for any dif- 
ferential o. The form o is called a differential 
form of the second kind iffor any Pe [ there 
exists a rational function fp such that c — dfp is 
regular at P. The set of differential forms of the 
second kind forms a linear space G, over the 
universal domain and contains the subspace 
G, consisting of the differential forms of the 
first kind. The quotient space G;/dk(T) has 
dimension 29 or y according as the character- 
istic of the universal domain is 0 or not. 

When the characteristic p of the universal 
domain is positive, we have what is called the 
Cartier operator. Let [ be a curve defined over 
a perfect tield k, let L = k(I), and let t be an 
element of L that is transcendental over k and 
such that L/k(t) is separable. Then any dif- 
ferential œ of L/k is written uniquely as () 

—(fd fft... f? t” !)dt, where Gel, 
Then the Cartier operator C given by Co 

= f,-1 dt is well defined and independent of 
the choice of t and leaves G, invariant. Hence 
given a basis o. ,@, of G,, we obtain a 
matrix (a,) with coefficient in L by Co, = 

bs G;,(0;( 1 Xi € y). This gx g matrix A is called 
the Hasse-Witt matrix of Į. The class of 

A modulo the transformations of the form 

S PAS is a birational invariant of [' and plays 
an important role in the theory of unramified 
cyclic p-extensions of the algebraic function 


field. 


F. Generalized Jacobian Varieties 


The notion of linear equivalence of divisors on 
a nonsingular curve can be extended to a more 
general situation. Such attempts have been 
made by M. Noether, F. Severi, and M. Rosen- 
licht, who succeeded in obtaining sucha gen- 
eralization [22]. 

Let I' be an algebraic curve, and let 
P, , , P, be singular points of T, Let Dp. be 
the flocal ring of P. We set D = (\j_, Dp, and 
['2»rI- {P,, i P3. An element of the free 
Abelian group G(T) generated by points of Į” 
is called a ['-divisor. Let a be a ['-divisor and 
set L(a) - 1 f eO |( f) c a7- 0; U {0}. Then L(a) 
is a finite-dimensional linear space (over the 
universal domain). The dimension of L(a) is 
denoted by Ia), and we set dim|a|— (a) — 1. 
The upper bound z of deg(a) = dim| a|is a 
nonnegative integer and is called the O-genus 
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of I, We call i(a) 2 x — dega - dim|a| the O- 
speciality index of the divisor a. Let C bea 
nonsingular curve birationally equivalent 

to [', and let Q,,. . . , Q, be points of C that 
correspond to singular points of Į. An O- 
differential ois a differential form on C (of 
K(I) = K(C)) such that 27.., Reso, fc = 0 for 
any f c£). Then i(a) is equal to the number of 
linearly independent $5-differentials o such 
that (@)>a in J’. The equality l(a) = dega— 

T + 1 + i(a) is called the generalized Riemann- 
Roch theorem. An ©-differential o is called an 
-differential of the first kind if o is regular 
everywhere on Į”, The number of linearly 
independent ©-differentials of the first kind is 
equal to the O-genus 7. Let g be the effective 
genus of I, i.e., the genus of C. Then we have 
the equality 7 —g = dim, (10/9) = ô, where Os 
the integral closure of 8 in K(T). The set of O- 
differentials forms an D-module that is in 
general not of rank 1. Hence in this case, we 
do not have the “canonical divisor." Let c be 
the conductor of O/D. Then c determines a 
divisor in a natural way. If we denote the de- 
gree of this divisor by d, we have the tnequal- 
ity ó- 1 «d € 20. We have d=20 if and only 
if the set of O-differentials forms an D-module 
of rank 1. This case occurs, for example, if Į is 
a curve on a nonsingular surface or a complete 
intersection. Two [-divisors a and b are said 
to be O-linearly equivalent if there exists a unit 
f of O such that a-b=(f). The set of T- 
divisors that are O-linearly equivalent to zero 
forms a subgroup GA) of G(T). There exists a 
group variety Jo, unique up to isomorphism, 
that satisfies the four conditions required for 
Jacobian varieties (- Section E) with respect 
to the class group Golf VGU, The variety Jy 
is called the generalized Jacobian variety. The 
generalized Jacobian variety is not complete, 
in general. If J is the Jacobian variety of C, 
then J, is an extension of J by a connected 
‘linear algebraic group I.. Any Abelian exten- 
sion of the function tield of I' can be obtained 
by the tisogentes of the generalized Jacobian 
variety of I’ [22]. This fact plays an important 
role in class field theory over algebraic func- 
tion fields (2 59 Class Field Theory G). The 
theory for nonsingular curves is considered as 
the special case in which © = K(I). 

Suppose that I’ is situated in a projective 
space of dimension n. Let p be the prime ideal 
in k[Xo, Xi, > X,] deiining [' and y(p, m) be 
the number of linearly independent homoge- 
neous polynomials of degree m modulo p. Then 
y(p, m) is a polynomial in m for large m. This 
polynomial is called the Hilbert polynomial of 
p (or T`). Let c be the constant term of the 
Hilbert polynomial. The number p,(I) = 1 —c 
is called the arithmetic genus of Į and is equal 
to the O-genus of T. Let be a nonsingular 
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irreducible curve in P? of degree d. If I is 
contained in a plane, then p,(I)- (d 1): 
(d-2)/2. Otherwise, p,(I) d?/A—d -- 1 when 
dis even, and p,(I) (d? — 1)/4-d+ 1 when d 
is odd [11]. 


G. Sheaf Theory 


Let [' be an irreducible curve and 0, be the 
local ring of p. Then (j. = U 0, is an talgebraic 
coherent sheaf, which is called the structure 
sheaf of I’, and dim, HI, (/;) is equal to the 
arithmetic genus 7 of I, Let a be a I-divisor, 
and let @,-(a) be the tsheaf of germs of rational 
functions f such that ( f) - a»- 0 and f € Og for 
every singular point Q of T (- 383 Sheaves 
D). Then dim, HI, O,(a)) is equal to the 
speciality index i(a), and dim, H9(L, (6 .(a)) 

is equal to I(a). When I has no singular 
point, the Riemann-Roch theorem is deduced 
naturally from tSerre’s duality theorem: 
HIT. 6 (a) s H*(T, €, —a)). 


H. Algebraic Correspondence 


Let I be a nonsingular curve. A divisor of 

the product variety Į x T is called an alge- 
braic correspondence of ['[26, 27]. Let Dy 

be the subgroup consisting of divisors that 

are linearly equivalent to degenerate divisors 
ax l'-I x b Then the class group ¢(I) = 

G(I x D)/D, is called the group of classes of 
algebraic correspondences. We write X =0 if 

X is an element of D}. Let X be an algebraic 
correspondence, k a fjeld of delinition for T 
over which X is rational, and P a generic 
point of I over k. Then X(P)— pr;[X(P x T)] 
is rational over KP). The composite X ,0 X, 
of two correspondences X, and X,is defined 
by (X, 0 X;)(P) X,(X;(P)) whenever they 
have meaning. The composite X, o X, deter- 
mines an element of (I) that depends only on 
the classes of X, and X,. This multiplication 
supplies the group @(I) with the structure of 
an associative ring. This ring is called the 
correspondeuce ring of F, The correspondence 
ring €(I) and the ring .» of endomorphisms of 
the Jacobian variety J are isomorphic, and the 
isomorphism is given by the following rule: Let 
Z be an element of G(T), and let X be a divisor 
in č, Let P be a generic point of I with re- 
ference to kover which X is rational. Let Py be 
a k-rational point of I’, Then the class of 

X(P) — X(P,) modulo GI) is independent of 
the choice of a divisor X in the given class. We 
set P(P)- G(X(P)— X(P))) andlet 4 be the 
linear extension of Y. The correspondence 
&4 is an anti-isomorphism of €(I) and og. 
Now we set = af GO Then .¥, contains 
an automorphism 1 of order 2 called an in- 
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volution. Let 1 be a rational prime different 
from the characteristic p. Then A has a faithful 
representation by 2g x 2g matrices with co- 
efficients in J-adic integers. The ftrace g of 

this representation has the property that 

c( HU f) >0 if P 40 (Castelnuovo's lemma). £g 
is an algebra of finite tank over Q, and of is a 
finitely generated Abelian group. Based on 
these results A. Weil proved the Riemann 
hypothesis for congruent C-functions on a non- 
singular curve (- 450 Zeta Functions P), 


I. Coverings 


Let I and C be nonsingular curves such that 
there exists a regular mapping x: l'2 C. Then 
there is an injection of the function field K(C) 
into K(T). If K(f) is separably algebraic over 
K(C, then [ is called a covering (curve) of C. 
The integer [K(I): K(C)] = n is called the de- 
gree of covering. Let P be a point of T and let 
Q=n(P). Let t, s be local parameters at P on T 
and at Q on C, respectively. The nonnegative 
integer vp(ds/dt) is called the differential index 
at P and is denoted by mp. The index mp is 
zero except for a finite number of points. The 
divisor È mp P is called the branch divisor. The 
covering Į is called an unramified covering if 
the branch divisor is zero. If we denote the 
branch divisor by a, we have the formula 
2g(D) - 2- n2g(C) —2) - dega, where ol! 
and g(C) are genera of [` and C, respectively. 
This is called the Riemann-Hurwitz formula. 
This formula yields at once that a rational 
curve has no nontrivial unramified covering 
and that I' can be an unramified covering of 
itself if and only if Į is an elliptic curve. 


J. Theory of Moduli 


Let „M (k) be the set of isomorphy classes of 
complete nonsingular irreducible curves (here 
simply called "curves") of genus g defined over 
a field k. We can endow .ZJ(k) with a structure 
of an algebraic variety over k with the prop- 
erty that for any smooth family (over K 2:6 
S of curves of genus y the map T:S.Z,(k) 
sending s to the isomorphy class of the fiber 
x '(s) is a *morphism. This algebraic variety is 
called the (coarse) moduli space of curves of 
genus g. Furthermore, the moduli space over 
Spec(Z) exists (D. Mumford [16]). .Z(k) is 
tnormal, irreducible and tquasiprojective (but 
not complete for g > 0) of dimension 0 (g = 0), 1 
(g=1), 39 —3 (g >2) (23g —3-- dim Aut(C)) 
with only quotient singularity [5]. 

Since when g = 1 any elliptic curve C over 
an algebraically closed lield k is isomorphic to 
a plane curve with the Weierstrass canonical 
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form y? 24x? — gax — g, (if char(k) Z2; the case 
of char(k) = 2 needs a slight modification), the 
correspondence Cj(g5, g4) (— Section D) 
defines an isomorphism A, ~ A; (affine line). 
In the case of g > 2 and k= C we have another 
function-theoretic construction of .@, due to 
Teichmüller (- 11 Algebraic Functions F). 

Let ./, be the coarse moduli space (over Z) 
of principally polarized Abelian varieties of 
dimension g (— 3 Abelian Varieties 1). For a 
curve C we denote the Jacobian variety of C 
by J(C) (- Section E). The correspondence 
C—J(C) defines a morphism i:.4,—., which 
is injective (Theorem of Torelli). It is even an 
timmersion (F. Oort, J. Steenbrinck). If g= 1, 
2, 3, then i is an open immersion whose image 
we can describe. 

For k=C we can express the above map 
i by using periods of curves. Namely, let 
Xis, Ge By,, D, be a (canonical) basis of 
H, (C, Z) detined by "normal sections of C 
(considered as a real surface). Let w,,.., @ be 
a basis of 'differential forms of the first kind 
on C with n co; = 0; Then the matrix Q = 
(oi is symmetric and has a positive definite 
imaginary part, i.e., is an element of the tSie- 
gel upper half-space S, of degree y. With the 
identification . 7, = S,/Sp(g, Z) (- 3 Abelian 
Varieties 1) the map (A, is nothing but 
the one sending the isomorphy class of C to 
Rmod Sp(g, Z). For g =4 the closure of (A. 
is a principal divisor detined explicitly with 
theta constants (Schottky, J.-I. Igusa). 


K. Stable Curves 


A reduced connected complete curve C over k 
is called a stable curve of genus g ( > (0) if(1) C 
has only ordinary double points as possible 
singularities; (ii) when I' is a smooth rational 
irreducible component of C, then F intersects 
the other components in more than 2 points; 
(iii) dim, H' (C, Co) =g. 

There exists a *coarse moduli space 5, of 
stable curves of genus g which contains JL, 
as a Zariski open subset. The space 9, detined 
over Spec(Z) is tcomplete and even tpro- 
jective; hence Y, gives a compactification of 
M, The completeness of 5, follows from the 
stable reduction theorem as follows: let R be a 
discrete valuation ring with quotient field K 
and C be a smooth connected curve over K. 
Then there exists a finite separable algebraic 
extension L of K such that the curve C x x L 
extends to a flat family of stable curves over 
Spec(R,), where R, denotes the integral clo- 
sure of R in L. In this case we say that the 
curve C x x L has a stable reduction in HR. 
With the above notation a curve C over K 
has a stable reduction in R if and only if its 
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Jacobian variety J(C) has a ‘stable reduction 
in R (- 3 Abelian Varieties N). 

Over C, -¥, has the tSatake compactification 
,, which is a disjoint union of .%,,,0 <g' eg, 
as a set. The injection (Aa, (- Section J) 
extends to a morphism j: 4, «£, that sends 
the isomorphy class of a stable curve C = 
C, UU C, (C, irreducible) to that of J (C) 

x x J(Gea,, g'— X;genus(C;, where Č, is 
the tnormalization of C;. In particular the 
closure of the image of i in . is the set of 
products of Jacobian varieties. 
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A. General Remarks 


Let F (XiX) > AKG, Xm) be r 
tpolynomials in m variables X,,., X, Over a 
tfield k. Then the equations 


F, =0,. ..) F,=0 


are called algebraic equations in m unknowns. 
When we consider these equations simulta- 
neously, where r > 2, we call them a system of r 
equations or simultaneous equations. (For r = 1, 
a system of one equation means the single 
equation F, = 0.) Coefficients of F,,, F, are 
called coefficients of the system, and the great- 
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est of the degrees of F}, . . , F, is called the 
degree of the system. 

To solve a system of equations (henceforth 
in this article we shall omit the word “alge- 
braic”) means to find the common tzero points 
Gn an talgebraically closed field containing k) 
of elements F;,. . , of the tpolynomial ring 
k[X,, » X.]. If there exist no common zero 
points, the system is said to be inconsistent; if 
there exists a finite number of such points, it 
is said to be regular; and if there are an infinite 
number of such points, it is called indetermin- 
ate. The telimination method allows us to 
reduce the problem of solving a system of r 
equations to the case r = 1. In particular, any 
regular system of equations can be reduced to 
the case m=r= 1. 


B. Equations in One Unknown 


For the above reason, it is important to con- 
sider an equation of the form f(X ) = 0, where 


Ay £0. (1) 


This gives the general form of an algebraic 
equation in one unknown. 

According as f (X ) is reducible or not in the 
tpolynomial ring &(X ], the equationf(X)-O 
is called reducible or irreducible (- 337 Poly- 
nomials). In some talgebraic extension field K 
of k, f(X) can be factored: 


f(X)- ag(X —o)(X —25)...(X — an). (2) 


f(X)5agX" +a, X" ! +... +a, 


Xis- 5%, are called the roots of the equation 


f (X) =O. Hence, any algebraic equation of 


degree n has exactly n roots (Kronecker's 
theorem). Now, ( — 1)'a;/ag is equal to the tele- 
mentary symmetric function of degree į of 
%15+++5a,. Some of the roots o, 9, may be 
identical. If x appears p times in q,, . , a, We 
say that & is a p-tuple root, and p is called the 
multiplicity of the root g. When p = 1, x is 
called a simple root, and when p > 2, « is called 
a multiple root. Let B,,..., f,be all the distinct 
roots among a,,. ,%,, and let p; be the multi- 
plicity of B{i= 1, , v). Then 


F(X) =ao(X — yy... (X — f^, (2) 
p*t.tp,z-2n. 

If p,,. , p, are not divisible by the *character- 
istic of k, the greatest common divisor g off 
and 

f'-nayX" ^"! c(n—1)a, X"? DEE THE 


is (X — fp) ! ...(X — f,» +. Thus we can 
reduce the multiplicity of every root to 1 by 
dividing f by g. Any irreducible equation over 
a field of characteristic 0 has no multiple roots. 
Equation (1) has multiple roots if and only if 
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its *discriminant D is equal to 0 (- 149 Fields; 
172 Galois Theory). 


C. Equations of Special Type 


In Sections C and D we assume that the char- 
acteristic of k is zero. 


Binomial Equations. An equation of the type 
X" =a =Q is called a binomial equation. It is 
solved by root extraction. Let n/a (mtb root of 
u) be one of the roots (if a is a positive real 
number, %/a usually denotes a positive real 
root). Then n/a multiplied by 1, EC, En 
are the roots of X" —a —0, where ( is a tprim- 
itive mth root of unity. 


Reciprocal Equations. An equation a, X” + 
a,X" '+...+a,=0 is called a reciprocal 
equation if dy) =a, à; —4à, ,,d5— d, 5,....À 
reciprocal equation of an odd degree n = 

2m + thas a root X ze l, and dividing the left 
side by X + 1 we get a reciprocal equation of 
degree 2m, A reciprocal equation of degree 

n = 2m is reduced to an equation of degree m 
in Y= X + X ^! and the quadratic equation 
X?—-XY+1=0. 


D. Equations of Lower Degree 
(- Appendix A, Table 1) 


(1) A linear equation ao X + a,=0 hasa single 
root —d,/dy.(2) the roots of a quadratic equa- 
tion de X? - a, X +a, =0 are given by (—a, 
-/ai—4aga,)/2ag. (3) To solve a cubic 
equation ao X? - a, X^ a; X +a,=0, we set 
Au Day da — 2a? —27a2a,, A, a2 —3aga;, 
and solve the quadratic equation T? A, T 

+ A}=0. Let t, and t, be the roots of this 
quadratic equation, and let ou be any cube root 
of |, Then (-a, 4- w/t, FO 3/1, 3a, is a 
root of the original cubic equation. (Cardano's 
formula). If we apply this method to a cubic 
equation aX? + bX? +cX +d=0 with real 
coefficients, we need to use complex cube 
roots even if the roots of the equation are real. 
In fact, it has been proved that it is not pos- 
sible to solve this equation within the real 
numbers in this case; Lë, if the cubic equation 
is irreducible over the extension Qía, b, c, d) of 
the rational number field Q, and if all of its 
roots are real, it is impossible to find the roots 
only by rational operations and with real 
radicals. This is called the Casus irreducibilis. 
(4) A quartic equation de X^ +a, X 2+ ax ? 
+a, X +a, = 0 can be solved by means of 
reduction to a cubic equation (L. Ferrari) (— 
Appendix A, Table 1). Generally, the proce- 
dure of solving an algebraic equation, Le. 
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finding the roots of a given equation from its 
coefficients by means of a finite number of 
rational operations and extractions of radicals, 
is called a solution by radicals (or algebraic 
solution). The *general algebraic equation 
whose degree is > 5 cannot be solved by 
radicals ON H. Abel) (— 172 Galois Theory). 


E. Analytic Tbeory 


In this section, k denotes the field R of real 
numbers or the field C of complex numbers. 
These cases have been studied for a long time, 
for practical reasons. 

Concerning the case k = C, the field C is 
'algebraically closed; i.e., every equation with 
coefficients from C has a root in C (Gauss's 
tbeorem, called the fundamental tbeorem of 
algebra). Accordingly, in the field C, we always 
have equations (2) and (2^). 

Let «,,, x, be the roots of equation (1). 
Then each 2; is a continuous function of coeffi- 
cients a,, a, , a, Concerning the location of 
roots of f(X)-0 and f(X )=0 on the complex 
plane, we have the following theorems: 

(1) Any convex polygon on the complex 
plane containing the roots of f(X)=0 also 
contains the roots off’ (X) = 0 (Gauss). 

(2) Let C be a rectifiable ‘Jordan curve not 
passing through a root off(X) = 0. Then 
the number (C, f) of the roots of TUN )=0 
lying in the region enclosed by C is equal to 
( 1/2ni) fe o fed, where the multiplicity 
of the roots is taken into account. 

(3) Let C be a Jordan curve on the complex 
plane. If | f(z)|>|g(z)| at every point z on C, 
then the equations f= 0 and f+ g = 0 have the 
same number of roots (counting multiplicity) 
within the region enclosed by € (Roucbé's 
theorem). 

(4) The absolute value of a root of equation 
(1) is less than 


M=max(|a,/do|, ....|a,/ag|) ^ 1. 


(5) Let D be the *discriminant of f, and as- 
sume that |x;| <M (i— I, ..., n). Then |a; — xjl? 
z D/2My' À? =E, Since the value of 
|D|is known fromf'and one value of M is 
given by theorem (4), we have one value of E. 
If we draw a circle on the complex plane with 
center at the origin and with radius M, and if 
we cover it with a net whose meshes have 
diameters less than JE /2, then the interior of 
each mesh contains at most one root of f=0. 

When k=R,1¢., fERLX ], let B,,... . f, 
denote the distinct roots off= 0, and recall 
equation (2°). Suppose that f,,, pe R and 
the others ¢R. Then v—4 is an even integer 2x, 
and we can renumber f),,,,,f, so that f,,, 
cn Pare fl (ff denotes the con- 
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jugate of p) and Piu = Disks tix Pat = Py- 
In this case, f,,., f; are the real roots of 


equation (2), and the other fe are called the 
imaginary roots. 

(6) IFfeR[X]and aja,» .. 72,70, then 
the absolute value of any root of equation (1) 
is less than ] (Kakeya-Enestrém theorem). 

Concerning the real roots of an equationf 
= 0, where f eR[ XL we have the following 
theorems: Let N(a, b)(a, be R) denote the 
number of real roots in the interval (a, b). 
Furthermore, let V(c,, c5, . €p) denote the 
number of changes of sign in the sequence 
C1,€5, , Cp of real numbers, which is defined 
as follows: Suppose that we have the sequence 
Cy rely, after deleting the terms c; = 0 from 
the sequence ¢,, c5, , c. Then 


1c 
C )-3à (1 —sgnc, c,,,.). 

(7) NCO, œ)= V(ag, a, , a,) (mod 2) and 
N « V(Descartes's theorem). 

(8 Let Ver: V(f(c), f'(c), . . P“). Then 
N(a, b) =V(a) V(b) (mod 2) and N € V(a) - 
V(b) (Fourier's theorem). 

(9) We may assume that f =0 has no multi- 
ple roots. Construct a finite series f, = f, f, = 
f’, „fı of polynomials over R such that 
f-i*fdi-fui forj=1,2,..../—land fjeR, 
by successive application of the tdivision al. 
gorithm. Let V()- V(folo),file. Mell 
Then N(a, b) = V(a) = V(b) (Sturm's theorem). 
By means of this theorem we can determine the 
location of real roots as precisely as we wish. 

(10) In order that every root o; of an equa- 
tion f = 0 with a, > O lies on the left side of 
the imaginary axis, i.e., Re «;< 0, it is necessary 
and sufficient that in the following matrix the 
principal minors composed of the first r rows 
and first r columns be positive for al] r = 
1,2, , n (Hurwitz's theorem): 


Vey Gaye 


e 
e 
mo: 
i 

N 
Di 


Also, for f e C[ X ], various results have been 
obtained about under what conditions all the 
roots off= 0 lie on one side of a given straight 
line or inside a given circle (e.g. the unit circle) 
(- 301 Numerical Solution of Algebraic 
Equations). 
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A. Definition 


An algebraic function is a multiple-valued 
‘analytic function w = w(z) defined by an firre- 
ducible algebraic equation P(z, w) =0 with 
complex coefficients. 


B. History and Methods 


The theory of algebraic functions evolved from 
the works of C. F. Gauss, N. H. Abel, and C. 
G. J. Jacobi on felliptic functions in the early 
19th Century. Stimulated by their works, B. 
Riemann and K. Weierstrass established the 
foundations of the theory of complex functions 
and developed the important theory of alge- 
braic functions. 

The equation P(z, w) = 0 defines a curve in 
the 2-dimensional complex tprojective space 
with inhomogeneous coordinates z, w. Inves- 
tigations from this point of view were initiated 
by Riemann, A. Clebsch, and P. Gordan. 

This approach was followed by A. Brill, M. 
Noether, and the Italian school (F. Severi, C. 
Segre, etc.) and has developed into contempo- 
rary algebraic geometry (— 9 Algebraic 
Curves, 12 Algebraic Geometry). 

The set of tfunction elements w(z) satisfying 
P(z, w) = 0 is a tcomplex manifold 91, a closed 
(= compact) *Riemann surface, on which z and 
w are tmeromorphic functions. The field Ky, 
consisting of the meromorphic functions on R 
is an talgebraic function field C(z, w). Con- 
versely, for any closed Riemann surface 98, the 
field Ky is an talgebraic function field in one 
variable over C, and any pair of functions z 
and w with Ky = C(z, w) has the property that 
R is tconformally equivalent to the Riemann 
surface determined in the above fashion by the 
irreducible algebraic equation P(z, w) = O satis- 
fied by z and w. Two Riemann surfaces R,, R, 
determined by the equations P, = 0, P, = 0 are 
conformally equivalent if and only if the fields 
Ky, and Ka are C-isomorphic. This condition 
is equivalent to the existence of a tbirational 
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transformation between the algebraic curves 
P, = 0, P, = 0. The “analytic method" (the 
method of studying algebraic functions as 
functions on Riemann surfaces) is the creation 
of Riemann. It was extended by F. Klein and 
D. Hilbert, and later by H. Weyl, who es- 
tablished in his monograph [6] a rigorous 
foundation of the analytic method for the 
theory of algebraic functions. 

Given an arbitrary algebraic function field 
K in one variable over C, the set R of its 
*prime divisors with a suitable topology and 
analytic structure is a closed Riemann surface 
whose function field Ky coincides with K. The 
"algebraic method" (the method of studying 
algebraic functions as elements of an algebraic 
function tield) was founded by J. W. Dedekind 
and H. Weber at the end of the 19th Century. 
In the 20th Century, the algebraic method has 
made remarkable progress, owing to the devel- 
opment of abstract algebra. It covers the case 
of an arbitrary ground lield as well as that 
of more than one variable. The theory of 
algebraic functions has had considerable in- 
fluence on the development of number theory 
because of a basic analogy between the two 
subjects. 

The universal covering spaces (surface) $ of 
a closed Riemann surface R can be regarded, 
by conformal mapping, as the Riemann 
sphere, the plane, or the unit disk (or, equiva- 
lently, to the upper half-plane) if the genus g 
of Ris 0, 1, or 22, respectively. Then the 
tcovering transformation group G, consisting 
of tlinear fractional transformations without 
fixed points in 9t, is *properly discontinuous 
and has a compact ‘fundamental domain. 
Conversely, if D is one of the three domains 
just mentioned and if G is the group just de- 
scribed, then = D/G is a closed Riemann 
surface such that D and G are its universal 
covering space and covering transformation 
group. A meromorphic function on §R is repre- 
sented as an tautomorphic function on 9t with 
respect to G. If y = 0, then G= {1 }, R= R, and 
Ka is the field of rational functions. If g — 1, 
then Ky is the field of ‘elliptic functions. The 
study of algebraic functions as automorphic 
functions was initiated by H. Poincaré and 
Klein. Recently, C. L. Siegel made a remark- 
able contribution to the investigation of the 
case of several variables. The theory of auto- 
morphic functions is also related to number 
theory. Works of E. Hecke, M. Eichler, and 
G. Shimura on this domain are noteworthy 
(- 32 Automorphic Functions, 73 Complex 
Multiplication). 

Another important topic concerning alge- 
braic functions (closed Riemann surfaces) is 
the problem of moduli. Riemann stated, with- 
out rigorous proof, that the set of conformal 
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equivalence classes of closed Riemann surfaces 
of genus g (> 2) depends on 3g = 3 complex 
parameters, called moduli. This has led to the 
modern theory of Teichmiiller spaces, which is 
developing into an extensive new field (- 234 
Kleinian Groups, 416 Teichmiiller Spaces). 

In the rest of this article, we deal mainly 
with the analytic method (for the case of two 
variables  [1]). 


C. Abelian Differentials 


An Abelian differential on a closed Riemann 
surface R is, by definition, a complex tdif- 
ferential form = a(z)dz, where a(z) is a mero- 
morphic function of a local parameter z. Such 
a differential is said to be of the first kind if 
a(z) is holomorphic, of the second kind if the 
residue vanishes everywhere, and of the tbird 
kind ` otherwise. 

The indelinite integral W(p) = Lë w of an 
Abelian differential œw, where py is assumed not 
to be a pole of @, is called an Abelian integral. 
It is said to be of the first, second, or tbird kind 
if the same holds for o If y is a 1-tcycle on R, 
the quantity Lo is referred to as the period of 
œ along y. An elliptic integral is defined to be 
an Abelian integral on a closed Riemann sur- 
face of genus 1. For example, this is the case if 
the equation P(z, w) = 0 defining the surface is 
of degree 2 with respect to w and of degree 3 
or 4 with respect to z. More generally, a closed 
Riemann surface is called byperelliptic if P(z, w) 
is of degree 2 with respect to w, or, equiva- 
lently, if R carries a meromorphic function 
with exactly two poles. An Abelian integral on 
such a surface is called a hyperelliptic integral. 

On a closed surface 98, let V, be the linear 
space over C of the Abelian differentials of the 
first kind. Given a l-cycle x of R, there exists a 
unique c, V, such that Re f ,€, 18 equal to the 
tintersection number (X, y) for every 1 -cycle y. 
This differential is also characterized by the 
property ((c, @,)g =) In WA*O,= — oe 
ho for every we V,. If 104, . ..a.,} forma 
basis of the 1-dimensional thomology group 
with integral coefficients, then Re œ, (i= 1, ., 
2g) form a basis of the linear space V, over R 
of the "harmonic differentials on R as well as 
that of the space ( Re w œw €V,}. Accordingly, 


dim, r, =g, dim, V, = 2g. 


These identities show a close relationship 
between the topological structure of R and the 
space of the Abelian differentials on R (— 194 
Harmonic Integrals). 

One can choose a 1-dimensional homology 
basis {a;,0,;;}7-, SO that (2,05) =(Xg+i 04) = 
0, (2;, 6, ,;) — 1, and (2j, 0,,,) 0 (+j) (j= 
1,2, , g). Such a basis is called a canonical 
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homology basis. If œ and g are Abelian dif- 
ferentials of the first kind, then we have 


eio 71$ (| o a 


Zo 
-— | o| d ZU. 
g*j aj 
uy 


Together, these are called the (Riemann) period 
relation. The second formula implies that c) 
vanishes identically if the periods of œw along 
alla; (j 2 1,.., y) are zero. 

Let 0445 X, be a 1-dimensional homology 
basis, and let w,, . ,@, form a basis of V, over 
C. The g x 2g matrix Q with Lu as its (i, j)- 
component is called a period matrix. Corre- 
sponding to the change of bases (a) and (w), it 
is subject to transformation into the form 
AQM, where A is a g x g invertible complex 
matrix and M is a 2g x 2g integral square 
matrix with determinant + 1. Conversely, two 
Riemann surfaces are conformally equivalent if 
they possess period matrices transformable to 
each other in this manner (Torelli’s theorem). 
We can choose @,,..., 0, so that the cor- 
responding period matrix with respect to a 
canonical homology basis becomes (/,, T) with 
the g x g unit matrix 7,. Then from the period 
relation, T is symmetric and Im T is positive 
definite. 

On the complex linear space C?, consider 
the subgroup generated by the 2g column 
vectors of a period matrix Q (the subgroup 
is also denoted by Q). Since it is of rank 2g 
and properly discontinuous, a group mani- 
fold C?/Q) is obtained. It is determined by R 
uniquely up to analytic isomorphism and is 
called the Jacobian variety of R. The general- 
ized Jacobian variety is introduced in a simi- 
lar fashion by means of Abelian integrals of 
the second and third kinds (- 9 Algebraic 
Curves). 


D. The Riemann-Roch Theorem 


In the present context, a 0-tchain with inte- 
gral coefficients on a Riemann surface H is 
referred to as a divisor. A divisor d =} nip; 

(n; € Z, p; e R) is an integral divisor (or positive 
divisor) if n; > 0 in the reduced expression; d is 
a prime divisor if it consists of a single point p, 
and n, = 1. A divisor of a meromorphic func- 
tion for an Abelian differential o is defined 
by taking the p; as the zeros (poles) off or œ 
and nj( = nj) as the multiplicity of the zero 


38 


(pole) at p;. The divisors on R constitute an 
Abelian group D in which principal divisors, 
i.e., divisors of meromorphic functions, consti- 
tute a subgroup $. The factor group D/$ is 
called the divisor class group; an element of it is 
called a divisor class. The divisors of Abelian 
differential constitute a single divisor class, 
which is referred to as the canonical divisor 
class (or differential divisor class). The degree 
and the dimension of a divisor class D are 
detined as follows, independent of the choice of 
the representative d =}, n;p;e D:degD = Xn; 
dim D = dimc{ f|f is meromorphic, (divisor of 
f)+ dis a positive divisor}. For example, the 
degree of the principal divisor class is zero. 

In terms of these concepts, the Riemann- 
Roch theorem is stated as follows: For a divi- 
sor class D on a closed Riemann surface R of 
genus g and for an integer n, we have 


dim(D + nW)—dim(— D —(1—n)W) 
=deg D+(2n—1)(g—-1), 


where W is the canonical divisor class (- 9 
Algebraic Curves). 

This theorem implies the following prop- 
erties of R: (i) deg W=2g 2. (ii) The holo- 
morphic invariant forms dz? (i.e., analytic 
tensors of order 2), referred to as quadratic 
differentials, constitute a linear space over C of 
dimension 0 (if g= 0), 1 (if g =1), or of dimen- 
sion 3g = 3 (if g > 2). The quadratic differentials 
have close connection with textremal quasi- 
conformal mappings and play an important 
role in the theory of Teichmiiller spaces (^ 
352 Quasiconformal Mappings, 416 Teich- 
müller Spaces). (iii) For a point pe , a posi- 
tive integer m is called a gap value if R carries 
no meromorphic function having a pole only 
at p with multiplicity m. Then if g = 0, no point 
has gap values; and if g > 1, every point p has 
exactly g gap values; in this case, p also has 
a nongap value m € g + 1. A point p is called 
an ordinary point if the gap values at p are 
1,2, , g; otherwise p is called a W eierstrass 
point. If y 22, then the total number N of 
Weierstrass points is not less than 2g + 2 and 
not greater than (g = 1)g(g+ 1) (A. Hurwitz). 
Moreover, the case N = 2g + 2 occurs if and 
only if ‘R is hyperelliptic, and then the gap 
values at Weierstrass points are 1, 3,..,2g 1. 
This implies that every closed Riemann surface 
of genus 2 is hyperelliptic. (1v) Suppose the 
genus g of Ris 2 2. A conformal mapping f of 
R onto itself with the property that every 1- 
cycle y is always homologous to f(y) is neces- 
sarily the identity transformation. Also, § is 
known to admit only a finite number of con- 
formal mappings onto itself (H. Schwarz); 
the total number does not exceed 84(g — 1) 
(Hurwitz). 
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E. Abel's Tbeorem 


Abel’s tbeorem is stated as follows: A divisor d 
of degree zero is a principal divisor if and only 
if it is expressed as d= ĝy by means of a 1- 
chain y that has the property that f, œ = 0 for 
every oe E. 

Given a divisor class D of degree zero, con- 
sider a 1-chain y with Odre D. For every l-cycle 
g there corresponds the quantity 


sald) =e0p( 2m / —1 Re | o), 
y 


independent of the choice of y. Thus D deter- 
mines a character on the 1-dimensional ho- 
mology group, called the integral character. 
Conversely, every character on the homology 
group is shown to be the integral character 
of some D. In terms of this notion, Abel's 
theorem can be stated as follows: D is the 
principal divisor class if and only if y (D) 

1 for every o. This result shows that the 1- 
dimensional homology group with integral 
coefficients and the group of the divisor classes 
of degree zero (with compact topology) are, 
with respect to integral characters, mutually 
dual (in the sense of Pontryagin) topological 
Abelian groups (— 422 Topological Abelian 
Groups). For the relationship between Abelian 
integrals and Jacobian varieties, in particular 
the *Jacobi inverse problem, *Abelian func- 
tions, and *Riemann theta functions, — 3 
Abelian Varieties L. Also — references to 234 
Kleinian Groups, 352 Quasiconformal Map- 
pings, 367 Riemann Surfaces, 416 Teichmüller 
Spaces. 
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A. Introduction 


Algebraic geometry is the branch of mathe- 
matics that deals with talgebraic varieties, that 
is, point sets detined by several algebraic equa- 
tions in a space of any dimension or those 
derived from these sets by means of certain 
constructions (- 16 Algebraic Varieties). It 
may also be considered to be a theory of the 
tfield of algebraic functions in several variables 
in geometric language, and it is closely related 
to the theories of complex analytic manifolds, 
commutative algebra, and homological alge- 
bra. It also has an important connection 

with number theory through the theories of 
automorphic functions, Diophantine equa- 
tions, and zeta functions. 

To investigate local properties of algebraic 
varieties we consider varieties embedded in an 
taffine space; to study global properties we 
usually consider varieties contained in fprojec- 
tive spaces. A quantity (or property) that is 
invariant under ‘projective transformations, 
isomorphisms, i.e., tbiregular and tbirational 
transformations, or birational transformations 
is called a tprojective invariant, a relative 
invariant, or an absolute invariant (birational 
invariant), respectively. The study of projective 
invariants is a part of projective geometry, 
whose methods are important in algebraic 
geometry. The notions of relative invariant 
and absolute invariant are used, for example, 
in the classification of algebraic varieties. 
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We usually assume that the coordinates of 
each point of the variety belong to a certain 
fixed "feld K. In the classical case, namely, 
when the field K is the field C of complex 
numbers, the algebraic varieties are considered 
as complex spaces and are studied by apply- 
ing the theories of partial differential equa- 
tions, *differential geometry, etc. Topological 
methods may also be applied. Algebraic geom- 
etry originated from such studies, but, for the 
study of properties such as trational mappings 
or talgebraic systems, it became necessary 
to consider as well the case where the ground 
field K is not talgebraically closed. Further- 
more, to apply this to number theory, it is 
necessary to establish the theory over the field 
of any tcharacteristic p. For this purpose it is 
necessary to establish a theory for varieties 
having ground domains as general as possible. 


B. History 


Analytic geometry began with the study of 
lines and quadratic curves (surfaces) and later 
came to include the study of cubic and quartic 
curves (surfaces), and so on. These subjects 
originally belonged exclusively to analytic (or 
projective) geometry. At that time, the study 
could not have been described by so specific a 
title as algebraic geometry. 

The study of such theories as the construc- 
tion of an algebraic plane curve by families 
of curves of lower degree or the talgebraic 
m-n correspondence on a straight line prob- 
ably began with research such as that by M. 
Chasles. The most outstanding event in the 
history of algebraic geometry was the intro- 
duction and development of the theory of 
algebraic functions C 11 Algebraic Func- 
tions) by B. Riemann (1857). Before that time 
the degree of an algebraic curve (surface) was 
the only quantity known to be a projective 
invariant of the curve (surface). 

With the theory of algebraic functions, 
Riemann gathered into one family all the 
Curves that can be transformed onto each 
other by birational transformations. As the 
basis for his study, Riemann examined bira- 
tional transformations in place of projective 
transformations. This idea led to the notion of 
the so-called *Riemann surface. The ‘genus of 
the surface was obtained as the characteristic 
number of the family of curves. The concept 
of genus was the first absolute invariant to 
appear in the history of algebraic geometry. 
Riemann based his theory on fAbelian inte- 
grals using *Dirichlet's principle, under the 
assumption that any algebraic curve reduces 
to one without tsingularities. 

After Riemann many mathematicians tried 
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to reconstruct the theory more precisely with- 
out using transcendental methods. M. Noether 
attempted this reconstruction by using geo- 
metric methods. Using the tCremona trans- 
formation, he confirmed Riemann’s assump- 
tion for curves: that any algebraic curve on a 
plane can be transformed by a birational 
transformation to a plane curve without sin- 
gularities except for simple tnodes. He also 
contributed in making more precise the basis 
conditions for the tRiemann-Roch theorem, 
which is considered to be one of the most 
important theorems in the field. His results 
on space curves and surfaces are also note- 
worthy. J. Plücker detined the concept of 
genus in geometric terms and introduced the 
tPliicker coordinates. A. Cayley and A. Brill 
worked along similar lines. Cayley’s idea was 
developed later by B. L. van der Waerden and 
W. L. Chow, who introduced the tassociated 
form of an algebraic variety and its tChow 
coordinates. 

Around 1890 the Italian school of algebraic 
geometry appeared. Following the tradition 
established by Noether, they employed alge- 
brogeometric methods and uncovered many 
new facts concerning algebraic surfaces. 
Among those who belonged to this school 
were G. Castelnuovo, F. Enriques, and F. 
Severi. 

In France, H. Poincaré and E. Picard ini- 
tiated their study of algebraic functions of two 
complex variables. After them S. Lefschetz 
investigated the theory of complex algebraic 
surfaces [11, 123. The results attained by the 
Italian and French schools were very sugges- 
tive but lacked rigorous foundations. 

On the other hand, rigorous number- 
theoretic theories of algebraic curves appeared 
in Germany. R. Dedekind and H. Weber de- 
veloped the theory of algebraic function fields 
parallel to that of falgebraic number fields. K. 
Hensel introduced the concept of tp-adic num- 
bers in analogy to fpower series expansions of 
analytic functions. E. Noether constructed an 
abstract theory of tpolynomial ideals from a 
formal theory by E. Lasker and F. S. Macau- 
ley. Under her influence there appeared the 
arithmetic algebraic geometry (of curves) 
over an abstract tield as developed by F. K. 
Schmidt and others. 

In the higer-dimensional case, van der 
Waerden attempted to create a more rigorous 
foundation for algebraic geometry under the 
influence of Noether's abstract ideal theory 
(c. 1930) [14]. He introduced the concept of 
*generic points and tspecialization, and specifi- 
cally defined the *multiplicity of intersections 
of two varieties in a projective space. He 
succeeded in getting a rigorous proof of Be- 
zout's theorem: In n-dimensional projective 
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space, the number of intersections of an r- 
dimensional algebraic subvariety of degree | 
with an (n — r)-dimensional subvariety of 
degree m is always [m if they intersect in only 
a finite number of points. 

The problem of intersections was taken up 
by C. Chevalley and A. Weil in the 1940s. 
Chevalley developed the ideal theory of ‘local 
rings (studied initially by W. Krull); he intro- 
duced topological concepts and applied them 
to the problem of intersections. The theory in 
this direction was later extended further by P. 
Samuel, M. Nagata, and J.-P. Serre. 

Weil gave foundations of algebraic geometry 
over an abstract field and reconstructed the 
theory by introducing geometric language to 
designate objects of abstract algebra [15]. He 
thus gave quite a new aspect to the theory and 
extended H. Hasse's arithmetization of the 
theory of algebraic functions in one variable to 
the case of several variables. Reconstructing 
Severi's theory of algebraic correspondence 
over abstract fields, he succeeded in proving 
an analogy of the *Riemann hypothesis on 
‘congruent zeta functions (- 450 Zeta Func- 
tions N). He also constructed, purely algebrai- 
cally, the entire theory of *Abelian varieties 
independent of characteristic. 

Around 1930, 0. Zariski gave another 
foundation to algebraic geometry by applying 
the generalized -- Valuation theory that had 
been introduced by Krull. Zariski clarified 
especially the properties of birational trans- 
formations by using valuation theory. Zariski's 
main theorem states that if a birational map- 
ping is not 'regular at a tnormal point P (- 

16 Algebraic Varieties 1), each component of 
the image of P by the mapping is of dimension 
21. 

Zariski also solved the problem of treso- 
lution of singularities in the affirmative in the 
case of characteristic 0 for varieties of dimen- 
sion « 3. The affirmative resolution of this 
problem (which Riemann assumed) says that 
any algebraic variety in a projective space can 
be transformed birationally to a projective 
algebraic variety without singularities. In 
1964, H. Hironaka gave an affirmative 
answer for any dimension in the case of char- 
acteristic 0. 

Along with the achievements in algebraic 
methods, great development took place in 
analytic methods. Unification of the concepts 
of Riemann surfaces and *Riemannian mani- 
folds led to the concept of tcomplex analytic 
manifolds. Furthermore, G. de Rham's theo- 
rem on the duality of topologically defined 
homology and cohomology based on differen- 
tial forms was proved; also, W. V. D. Hodge's 
theory of tharmonic integrals was developed. 
In the case of the complex dimension 1, any 
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compact Riemann surface is derived from a 
certain projective algebraic curve. However, 
the situation is not so simple in the case of 
higher dimensions. Weil's concept of tabstract 
complete algebraic varieties can be consid- 
ered as an analog of compact complex mani- 
folds. If a compact complex analytic variety is 
projective, then it must be an algebraic variety 
(fChow’s theorem). K. Kodaira proved that a 
necessary and sufficient condition for a com- 
pact complex analytic manifold to be biholo- 
morphically equivalent to a projective com- 
plex analytic manifold is that the manifold is 

a tHodge variety. 

Using results on harmonic integrals, J. Igusa 
and Weil established the theory of +Picard and 
tAlbanese varieties associated with algebraic 
manifolds of arbitrary dimensions as a gen- 
eralization of the theory of tJacobian varieties 
associated with algebraic curves (- 9 Alge- 
braic Curves E). Thus many ambiguities in 
the theory as developed by the Italian school 
were clarified. Later the theory was gen- 
eralized to the case of characteristic p by T. 
Matsusaka, Chow, and S. Lang. The duality 
theorem in this case was ]ater proved by M. 
Nishi and P. Cartier (- 3 Abelian Varieties 
D). 

The concept of *sheaves (- 383 Sheaves) 
had already been used in Kodaira's theory. 
Serre defined an abstract algebraic variety as a 
tringed space by using an analogy to the con- 
cept of complex analytic spaces ; he considered 
it as a topological space with respect to the 
*Zariski topology. By introducing tcoherent 
algebraic sheaves, Serre claritied the idea that 
classical invariants (such as tarithmetic genus) 
may be considered cohomological quantities 
(— 16 Algebraic Varieties E). 

A. Grothendieck invented the concept of a 
'scheme, which is far more general than that of 
an algebraic variety, by admitting the existence 
of *nilpotent elements in structure sheaves and 
taking as a coordinate ring a general commu- 
tative ring with unity element. By the device 
of taking into account nilpotent elements, an 
analog of the method of successive approxima- 
tion in analysis is now applicable. By master- 
ful use of cohomological techniques, Grothen- 
dieck derived many results, including Zariski’s 
important theorems. 

In the classical case, F. Hirzebruch gen- 
eralized the Riemann-Roch theorem to higher- 
dimensional manifolds. He made use of the 
language of sheaves and some topological 
results of A. Borel and R. Thom [S]. Later 
Grothendieck generalized the theorem for the 
abstract case as well. His idea in this work is 
recognized as the origin of ! K-theory. 

Every nonsingular complete curve of genus 
0 is isomorphic to the projective line, and any 


12 Ref. 
Algebraic Geometry 


nonsingular complete curve of genus 1 is iṣọ- 
morphic to a projective curve delined by the 
equation X2 X, 2 X,(X, -X,)(X, —AX,) for 
some 4 z 0, 1. On the other hand, the set of all 
isomorphism classes of nonsingular complete 
curves of genus g > 1 is parametrized by a 
normal quasiprojective variety of dimension 
3g = 3. Such facts were first discussed by Rie- 
mann as the problem of moduli. 

Concerning the moduli of manifolds of 
higher dimensions in the classical case, 
Kodaira and D. C. Spencer developed their 
theory of tdeformations of complex structures 
(^ 72 Complex Manifolds G). The meaning of 
number of tmoduli is clarified by deformation 
theory. Deformation theory has been extended 
in various ways, and deformation is considered 
as one of the fundamental concepts in alge- 
braic geometry. 

To investigate the global structure of the 
moduli varieties, D. Mumford introduced 
geometric invariant theory (- 16 Algebraic 
Varieties W). 

Etale and crystalline cohomologies initiated 
by Grothendieck and others are useful for the 
study of algebraic varieties of positive charac- 
teristic. In particular, the conjecture made by 
Weil concerning *congruent Zeta functions has 
been solved affirmatively by P. Deligne with 
the help of étale cohomology [2]. 

Many important questions have been 
answered by means of the geometric theory. 
For example, (1) every tvector bundle on Aj is 
trivial (- 16 Algebraic Varieties Z); (2) there 
exist tunirational but nonrational fields over C 
(- 16 Algebraic Varieties J); (3) the funda- 
mental groups of the complements of node- 
curves on Pé are commutative (- 16 Alge- 
braic Varieties I); (4) the cancellation theorem 
holds for A? (— 15 Algebraic Surfaces H); (5) 
whenever the tangent vector bundles are am- 
ple, the varieties are P; (^ 16 Algebraic Var- 
ieties R). 
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A. Definitions and General Remarks [1,2,8] 
Let k be a field and Q a tuniversal domain 
containing it. An affine algebraic group G 
defined over Kis, by definition, a group G 
which has the structure of a (not necessarily 
irreducible) falgebraic variety defined over kin 
an affine space ON such that the group oper- 
ation (x, y) x ! y on G is an everywhere 
*regular trational mapping defined over k For 
such a group G, the set G, of all k-trational 
points on G is an abstract group. The tirreduc- 
ible component G, of G (viewed as an alge- 
braic set) containing the identity element eg is 
unique and is a normal subgroup defined over 
k with finite index in G; the decomposition of 
G into (absolute) irreducible components 
coincides with that into the cosets of G by G.. 
When G = Gp, the group G is called connected. 
It should be noted that for an algebraic group 
G defined over K, a coset gG, is not necessarily 
delined over K and if it is, a representative g 
cannot necessarily be taken to be k-rational. 
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However, if kis an infinite field and G is con- 
nected, and if moreover k is tperfect or G is 
‘reductive, then G, is tZariski dense in G ([9], 
A. Borel and T. A. Springer [30] ); hence G is 
uniquely determined by G,. (In this case, G, is 
sometimes referred to as a k-group.) When 

kis a topological geld, G, is a topological 
group with respect to the natural topology de- 
fined by that of k, which is generally stronger 
than the Zariski topology on G,. For instance, 
when k= R, (Gj)g is a "Lie group with finitely 
many connected components. 

A subgroup H of an affine algebraic group 
G, which is Zariski closed, 1s an affine alge- 
braic group with respect to its natural induced 
structure and is called an algebraic subgroup 
of G. If H is defined over k then H is k-closed; 
the converse is also true when Kis perfect, 
(An affine algebraic set A is called k-closed 
if A is acommon zero of a set of polynomial 
equations with coefficients in K The set A is 
k-closed if and only if all irreducible com- 
ponents of A are detined over the algebraic 
closure k of k, and for every *Galois auto- 
morphism g of k/k, A* — A.) The notions of 
homomorphism and isomorphism for alge- 
braic groups can be detined in a natural 
manner. For instance, for affine algebraic 
groups G and G' defined over K, a rational 
bomomorpbism : G + G’ defined over K or a 
k-morpbism for short, is a homomorphism of 
G into G' that is at the same time an (every- 
where regular) rational mapping defined over 
k. For a k-morphism o of G into G’, the (set- 
theoretic) image @(G) is a closed subgroup 
of G' defined over K, the kernel o ~! (e^) is a k- 
closed subgroup of G, and dim 9(G) = dim G = 
dim `! (e°). In particular, when dim G = 
dim o(G)— dim G’ (or equivalently, when 
~(Gp) = Gy and o! (e') is finite), o is called an 
isogeny. (Two groups G and G’ are called 
isogenous if there exist a third group G" and 
isogenies G" 5G, G" 5 G'.) When a k-morphism 
o is bijective and o lis also an (everywhere 
regular) rational mapping defined over k, @ is 
called a birational isomorphism defined over K, 
or a k-isomorphism for short. It should be 
noted that a rational homomorphism which 
is an isomorphism of abstract groups is 
not necessarily an isomorphism of algebraic 
groups (e.g. a Frobenius homomorphism); a 
similar statement holds for an injective con- 
tinuous homomorphism of topological groups. 

Given a connected affine algebraic group G 
and a closed subgroup H, both defined over k, 
the quotient space G/H has the uniquely deter- 
mined structure of an algebraic variety defined 
over k such that the canonical mapping G-> 
G/H is separable. The 'function field of G/H 
is then identified with the subfield of the func- 
tion field of G formed by all H-invariant ele- 
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ments. In particular, if H is a closed normal 
subgroup, then G/H has a natural structure of 
an affine algebraic group (defined over the 
same ground field k) [8, 10, 11]. 

As an example, we have the group GL{n) of 
all n x n nonsingular matrices (xj). GL(n) may 
be viewed as an algebraic set in Q"' *!, defined 
by a single equation det(x;) y = 1, and as such 
is a connected algebraic group defined over 
the ‘prime field. In general, an algebraic group 
reahzed as a closed subgroup of GL{(n) is called 
a linear algebraic group. Since an affine alge- 
braic group is always isomorphic to a linear 
algebraic group, these two terminologies are 
essentially synonymous [ 1, 2]. 


B. Generalization of the Definition 


Replacing the term affine algebraic set (or 
affine cariety) in the definition of an affine 
algebraic group by a more general term ‘alge- 
braic variety, we obtain the notion of an 
algebraic group (algebraic group variety, or 
simply group variety). On this subject, the 
following facts are fundamental. A *complete 
connected algebraic group is an Abelian 
variety (— 3 Abelian Varieties). More gener- 
ally, given a connected algebraic group G 
defined over k, there always exists a (k-closed) 
largest linear connected closed normal sub- 
group L, and the factor group G/L is an 
Abelian variety. Furthermore, for a closed 
normal subgroup H of G, the factor group 
G/H is complete if and only if H > L (Che- 
valley’s theorem [8 ]). In particular, if a con- 
nected algebraic group G is complete and 
linear at the same time, then G reduces to the 
identity group. In view of these theorems, the 
study of algebraic groups can be reduced, in a 
sense, to the study of Abelian varieties and 
linear algebraic groups. For this reason, we 
henceforth restrict ourselves to linear algebraic 
groups, which are simply called algebraic 
groups. (The notion of tgeneralized Jacobian 
variety, introduced by M. Rosenlicht [12], is 
an example of an algebraic group in a general 
sense; — 9 Algebraic Curves.) The notion of 
algebraic groups has been generahzed further 
to that of ‘group schemes by A. Grothendieck 


[3,4]. 


C. Lie Algebras 


Since an algebraic group G defined over k has 
no singularities, the *tangent space g to G at 
the identity element e is detined and has the 
same dimension as G : dim g = dim G. The space 
g can be identified in a natural manner with 
the space of all left-invariant *derivations of 
the function field of G, and thus has the struc- 
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ture of a Lie algebra defined over K(— 248 Lie 
Algebras). We call g (the Lie algebra g, over k 
of all k-rational points in g) the Lie algebra of 
G (of the k-group G,). If G is a linear algebraic 
group contained in GL (n), then g, is a Lie 
subalgebra of al(n, k) with the Lie product 
defined by [x, y] = xy = yx; a linear Lie algebra 
corresponding to a linear algebraic group is 
called an algebraic Lie algebra. When the 
characteristic of Kis zero, conditions for a 
linear Lie algebra to be algebraic can be given 
in terms of the replica [5]. Also, in the case of 
characteristic zero, for x € gl(n, K), zen, if and 
only if exp(tx)e G, where ( is a variable over k 
and exp(tx) is understood as a fformal power 
series in f (contained in LL From this, we can 
prove, exactly as in the theory of Lie groups 
(— 249 Lie Croups), a one-to-one correspon- 
dence between k-closed subgroups H of G and 
algebraic Lie subalgebras b, of g,, establish- 
ing a complete parallelism of the theories of 
algebraic groups and Lie algebras [5]. This 
parallelism breaks down when k has positive 
characteristic [1, 2]. On the other hand, over a 
field of characteristic p > 0, we have formal 
groups, an analog of local Lie groups intro- 
duced by J. Dieudonné [ 13], and also *hyper- 
algebras, which play the role of Lie algebras in 
characteristic 0. 


D. Tori [1,2] 


The group G,, = GL(1), the multiplicative 
group of nonzero elements in Q, is a 1- 
dimensional connected algebraic group defined 
over the prime field. In general, an algebraic 
group G that is isomorphic to the direct prod- 
uct (G,,)" is called an (algebraic) torus. When a 
torus G defined over K is isomorphic to (G,,) 
over an extension K of k, G is called K-trivial 
(or K-split), and the field K is called a splitting 
field for G. A torus G defined over Kalways has 
a splitting field K which is a finite separable 
extension of k 

In general, a rational homomorphism y of 
an algebraic group G into G, is called a char- 
acter of G. If we define the sum of two charac- 
ters y, and x» of G by Gri  y3)(g) lgl: ys(g) 
(g € G), the totality of characters of G is an 
additive group, called the character module of 
G and denoted by X(G). Let G be a torus 
defined over k and X =X(G) its character 
module, and let K be a splitting field for G 
that is a finite tGalois extension of k. If a K- 
isomorphism G z (G,,)" is given by the corre- 
spondence G3g(y,(g), , y,(g)) then the 7; 
are characters of G, and X is a 'free module of 
rank n generated by y,,,.,,y,. Furthermore, if 
I’ denotes the Galois group of K/k, then for 
c c I and ye X, the conjugate y* is also a char- 
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acter of G; under this action of I’, X becomes a 
right f-module. We have complete duahty 
between a torus G and its character module X 
in the following sense. There exists a one-to- 
one correspondence between the closed sub- 
groups G, (defined over K) of G and a (T- 
invariant) submodule X, of X for which X/X, 
has no p-*torsion (where p is the character- 
istic of k). This correspondence is determined 
by the relation of the annihilators X,= Gi, 

G, — X1, and under the correspondence the 
character modules of G, and of G /G, are 
canonically identiiied with X/X, and X, 
respectively. Furthermore, let G’ be another 
torus (defined over Kand split over K) with the 
character module X', and suppose that we 
have a (k-) homomorphism (: G— G'. Then we 
can define a (I’-) homomorphism 'g: X’> X, 
called the dual homomorphism of q, by the 
relation ‘p(y’) = yo 9 for Y € X^; conversely, 
any (I-) homomorphism of X’ into X is ob- 
tained uniquely in this manner. In particular, 
Q is a (k-) isomorphism if and only if its dual 
mis a (I-) isomorphism. Since for any free (I-) 
module X of finite rank there always exists 

a torus G (defined over Kand split over K) 
such that X(G) = X (as a special case of the 
existence theorem of k-forms), the categories 
of all tori (defined over Kand split over K) and 
that of all free ([--) modules of finite rank are 
mutually dual. 


E. Semisimple Elements and Unipotent 
Elements 


A matrix q is called *semisimple if it is diago- 
nalizable, i.e., if the *minimal polynomial of q 
has only simple roots. A matrix q is called 
*unipotentifa 1 is nilpotent, i.e., if all charac- 
teristic roots of q are equal to 1, (When the 
characteristic of the ground field is zero, the 
unipotent elements y in GL(n, K) and the nilpo- 
tent elements x in ql(n, k) are in one-to-one 
correspondence by the relation jj = expx.) Any 
nonsingular matrix a can be written uniquely 
as a product of a nonsingular semisimple ma- 
trix a' and a unipotent matrix a" of the same 
size which are mutually commutative: a — 
ua oo (multiplicative Jordan decompo- 
sition); d (a^) is called the semisimple (unipo- 
tent) part of a and is denoted by a, (a,); a, can 
be expressed as a polynomial of the matrix a 
with scalar coefficients. For an element a of a 
(linear) algebraic group G, the semisimplicity 
(unipotency) of a does not depend on the 
matrix representation of G. Moreover, these 
properties are preserved by homomorphisms 
of algebraic groups. Also, if qe G, then a,, aE 
G. 

For an algebraic group G, we denote the 
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totality of semisimple (unipotent) elements 
contained in G by G, (G,) and call it the semi- 
simple (unipotent) part of G. (Note that G, and 
G, are not necessarily subgroups.) A torus G is 
then characterized by the property that G — 
Go = G,. On the other hand, an algebraic group 
G such that G = G, is called unipotent. For 
instance, the additive group of the universal 


domain, 
1 x 
G,= xe», 
0 1 
is a 1 -dimensional connected unipotent alge- 
braic group. 





F. Solvable Groups and Nilpotent Groups 
[1,2, 7,9] 


For two closed normal subgroups H,, H, 
(deiined over k) of an algebraic group G, the 
*commutator group [H,, H,] (in the sense of 
abstract group theory) is also a closed normal 
subgroup (defined over k) of G. In view of this 
fact, an algebraic group G is called solvable 
(nilpotent), when it is ‘solvable (nilpotent) as 
an abstract group. For example, the totality 
T(n) of n x n nonsingular upper unipatent ma- 
trices, 1.e., matrices of the form 


Zu A 





- | , is a connected solvable algebraic 
0 * 
group. A unipotent algebraic group is always 
nilpotent. 

For any connected solvable algebraic group 
G c GL(n), there always exists an element a in 
GL(n) such that a^! Ga c T(n) (Lie-Kolebin 
tbeorem [ 1,2]). A connected solvable algebraic 
group G has a ‘composition series G = Gy DG, 
2:«26,- {e} such that each G; is a connected 
closed normal subgroup of G and G, ,/G; 
is isomorphic to either G,, or G,. If G is de- 
fined over k the subgroup G, is a connected k- 
closed normal subgroup of G, and for any 
maximal torus T in G, we have a decompo- 
sition into a tsemidirect product G = IG, (in 
the sense of algebraic groups, i.e., the natural 
map T x G,G is birational). [t is known 
that for any algebraic group G defined over k 
there exists a maximal torus defined over k 
([4], A. Borel and T. A. Springer [30]). G is 
nilpotent if and only if G has a unique maxi- 
mal torus T; when that is so, T=G, and T is 
contained in the *center of G. For a connected 
solvable algebraic group G defined over k we 
can take a € GL(n, K such that a ! Ga c Tin) 
(see the Lie-Kolchin theorem) if and only if all 
characters y € X(G) are defined over k; when 
this condition is satisfied, G is called k-solvable. 
G, is then defined over k, and G/G, is a k- 
trivial torus. 
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When the characteristic of k is zero, any 
commutative unipotent algebraic group (de- 
fined over K is (k) isomorphic to the direct 
product (G,)". When kis an algebraically 
closed field of characteristic p > 0, any con- 
nected commutative unipotent algebraic group 
defined over k is k-isogenous to a direct prod- 
uct of a certain number of the groups W,, of 
*Witt vectors (of length m) (Chevalley-Chow 
theorem [ 12]). A 1 -dimensional connected 
unipotent algebraic group detined over a per- 
fect field kis k-isomorphic to G, [ 1, 2]. 


G. Borel's Theory 


Let G be an algebraic group and V an alge- 
braic variety (both defined over k). We say 
that Visa transformation space of G (defined 
over k), or simply G acts on V, if there is given 
an everywhere regular rational mapping G x 
V3(g.v)gve V (defined over K such that 
g1(950) — (9195), ev —v (g,,g; € G, ve V). When 
the action of G on V is *transitive, V is called a 
*homogeneous space of G. For a closed syb- 
group H of a connected algebraic group G 
(both defined over k, the quotient space G/H 
has the natural structure of a homogeneous 
space of G (deiined over k). A. Borel [1,2] 
proved the following theorems: 

(1) If G is a connected solvable algebraic 
group and Va complete transformation space 
of G, then G has at least one fixed point in V. 
More precisely, in order that a connected 
algebraic group G defined over k be k-solvable, 
it is necessary and sufficient that for any com- 
plete transformation space V of G defined over 
k for which V, # (Zj, G have at least one k- 
rational fixed point in V [9]. 

(2) Let G be a connected algebraic group. A 
maximal connected solvable closed subgroup 
of G is called a Borel subgroup of G. Then (1) all 
pairs (T, B) formed by a maximal torus T in G 
and a Borel subgroup B containing it are con- 
jugate to each other with respect to inner auto- 
morphisms of G. (ii) For a closed subgroup 
H of G, the quotient space G/H is complete 
if and only if H contains a Borel subgroup 
of G; and, when that is so, G/H is actually a 
*projective algebraic variety. (For instance, 
if G = GL(n), B = T(n), then G/B is a so-called 
‘flag manifold.) (iii) The conjugates of B(T) 
cover the whole group G (G,). A closed sub- 
group of G is called parabolic if it contains 
a Borel subgroup of G. A parabolic subgroup 
H coincides with its own *normalizer N(H); in 
particular, H is always connected. Parabolic 
subgroups are significant in the theory of 
automorphic functions. (For the parabolic 
subgroups associated with BN-pairs — Sec- 
tions Q, R.) 
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When G is a connected algebraic group 
detined over a perfect field k, proposition (i) 
can be sharpened: The pairs (A, H) formed by 
a maximal k-trivial torus A in G and a maxi- 
mal connected k-solvable subgroup H con- 
taining it are conjugate to each other with 
respect to the inner automorphisms detined by 
elements in G,. The normalizer N(H) of such a 
k-solvable subgroup H is a minimal k-closed 
parabolic subgroup of G. When the maximal 
connected k-solvable subgroups of G are re- 
duced to the identity group, G is called k- 
compact or  k-anisotropic. (Otherwise, G is 
called k-isotropic.) For instance, the tortho- 
gonal group G = SO(n, f) of a tquadratic form 
f of n variables is k-compact if and only if the 
form f is anisotropic, i.e., the homogeneous 
equation f — 0 has no solution other than 
zero in k Similar facts hold for other classt- 
cal groups. When Kis a *local field, G is k- 
compact if and only if G, is compact as a 
topological group. In general, a k-compact 
group is treductive. 


H. The Weyl Group 


Let G be a connected algebraic group and Q 
an arbitrary torus in G. The ‘centralizer Z(Q) 
of Q is then connected and coincides with the 
connected component of the normalizer N(Q). 
Hence the factor group W= N(Q)/Z(Q) is finite 
and can be identified with a subgroup of the 
automorphism group of Q (or of its character 
module X(Q)) in a natural manner. The group 
W is called the Weyl group of G relative to Q. 
In particular, when Q = T (a maximal torus), 
the order of W is equal to the number of Borel 
subgroups containing T. In this case, the çen- 
tralizer C = Z(T) is called a Cartan subgroup of 
G; it is characterized by the property that C is 

a (maximal) connected nilpotent closed sub- 
group of G which coincides with the connected 
component of its own normalizer N(C). The 
notions of Borel subgroups, Cartan subgroups, 
and maximal tori are preserved under rational 
homomorphisms of algebraic groups. 


1. Semisimple Groups and Reductive Groups 


In an algebraic group G defined over k, there 
exists a largest connected solvable closed 
normal subgroup R, called the radical of G. 
The unipotent part R, of R is called the unipo- 
tent radical of G. When R = {e}, G is called 
semisimple. When R is a torus, namely, Rz 
{e}, G is called reductive. Semisimplicity and 
reductiveness are preserved under forming a 
direct product and taking the image (or in- 
verse image) of an isogeny. For a reductive 
group G, the +commutator subgroup D(G) is 
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e 
semisimple, and G = D(G): R, D(G) NR = finite; 
in other words, G is isogenous to the direct 
product of a connected semisimple algebraic 
group and a torus. In general, if R is the rad- 
ical of a connected algebraic group G and R, is 
the unipotent radical of R, then the factor 
groups G/R, G/R,, are semisimple and reduc- 
tive, respectively. Furthermore, if the charac- 
teristic of the field kis zero, there exists a 
reductive closed subgroup H of G such that G 
decomposes into a semidirect product G = 
H. R, (Chevalley decomposition [5]). (In this 
case, R and R, are k-closed, and H can be 
taken to be k-rational; such an H is unique up 
to inner automorphisms defined by elements in 
G,.) Also in the case of characteristic zero, 
reductive algebraic groups are characterized 
by the property that all rational representa- 
tions are completely reducible. But when k 
has the characteristic p >0, this property char- 
acterizes tori (M. Nagata). 


J. Root Systems [1,2,14] 

Let G be a connected semisimple algebraic 
group, T a maximal torus, and X =X(T) its 
character module. A character xe X is called a 
root of G relative to T if there exists an jso- 
morphism x, of G, onto its image in G such 
that 


tx(Dt-x,(a()£) f or all £eG,, teT. 


For a root 4, such an isomorphism x, is 
uniquely determined up to a scalar multipli- 
cation in G,; hence we put P, 7 x,(G,). 

If we denote by t the totality of roots (rela- 
tive to T), rsatisfies the following axioms, 
where E = X @ Q and E* is the *dual space of 
E with respect to the inner product ( >: (i) For 
each a € r, there corresponds &* € ES such that 
(x*, a> =2 and <a*, f) € Z for all fer, (ii) If we 
define a reflection w, of E by 


w,x-cx—(a*,x»u for xcE, 


then w,fer for all Ber. (In particular, w,«— 
—ger.) (iii) If a, fer are linearly dependent, 
then fj = +«. (iv) If dim E= r, t containsr 
linearly independent elements. 

In general, a finite subset r in a finite- 
dimensional vector space E over Q satisfying 
the axioms (1)-(1v) is called a root system in E 
(This root system is sometimes said to be 
reduced, to distinguish it from the root system 
defined in Section Q, which does not satisfy 
axiom (iii).) For a root system r, the elements 
ap of E* corresponding to er are uniquely 
determined by these conditions, and the set 
r* = {a*} is a root system in E* (a* er* is 
called a coroot). Also, the group W of linear 
transformations of E (E*) generated by w, 
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(w,«) with ger is finite and is called the Weyl 
group of the root system r, If we identify E* 
with E by means of any W-invariant (positive 
definite) metric on E. then a* =(2/<a, a>)a. 
When r is a root system of a semisimple alge- 
braic group, 


(a*.y»eZ forall wer, yex, (1) 


so that X is W-invariant, and the Weyl group 
of the root system r can be identified with the 
group N(T)/Z(T) of Section H. (In general, a 
maximal torus in a connected reductive alge- 
braic group coincides with its own centralizer, 
so that the Weyl group W can be identified 
with N( T)/T.) 

When a tlinear ordering (compatible with 
the addition) is given in E, we denote by r, the 
set of all positive roots in r. An element «er, 
is called a simple root if it cannot be written as 
a=% +o" witha’, aer, IfA-(a,,...,a,] is 
the totality of (distinct) simple roots in r, , the 
elements «,,, a, are linearly independent, 
and any root yer can be written uniquely in 
the form a = TZ mjo;, with m;e Z, m; > 0. In 
general, a subset A of r having this property is 
called a fundamental system; a fundamental 
system is always obtained in the manner ex- 
plained from a linear ordering on E. For a 
fundamental system A, the cone A, in E*, 
defined as the set of x in E* satisfying the 
inequalities Co, x) > 0 (1 € i € r), is called a 
Weyl chamber. If we denote by L, the hyper- 
plane defined by the linear equation (a, x) = 0 
for a root 2, then E* —| Jer La = (Ja Aa, and W 
acts *simply transitively on the set of all Weyl 
chambers {A,'. The Weyl group W is gen- 
erated by r reflections Wy, (1 &i&r). 

In a semisimple algebraic group G, Borel 
subgroups B containing a (fixed) maximal 
torus Tare in one-to-one correspondence with 
the fundamental systems A (or r+) relative to T 
by the relation B, = IL, P,, where P, = x,(G,). 
(More precisely, every element in B, can be 
written uniquely as a product of the elements 
in P,, where the ordering of the P, is taken 
arbitrarily.) 


K. Bruhat Decomposition 


If we take a representative s, of we W in N(T), 
there is a decomposition G=|)wew Bä, P 
(disjoint union). Furthermore, if for we W we 
put N, = IL, nwe, P,, and in particular N = 
N, = B,, and denote by wa the unique element 
in W such that Wa — -A, then the element 
in Bs, B can be written uniquely as a product 
of elements in Nww,» s, T, N. Hence we have 
G= U Nite Ns 

weW 


which is called a Bruhat decomposition of G. In 
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particular, if we put N? = sọ, (Nä, (which is 
the unipotent part of the Borel subgroup cor- 
responding to -A), then N'TN is a Zariski 
open set in G, and the natural map N’ x T x 
N =G is birational. This implies that the 
function field of G is rational (e, ‘purely 
transcendental over Q). 


L. Structure of Semisimple Groups 


A subset r, of a root system r is called a closed 
subsystem if v, , r= r,, where t, , denotes the 
submodule of X generated by r, A closed 
subsystem satisfies conditions (i), (ii), and (iii) 
of a root system. For a closed subsystem r, of 
a root system r of a semisimple algebraic 
group G, the subgroup G(r,) of G generated by 
the P, (x er ,) is a semisimple closed subgroup 
with a maximal torus T, =(G(r,)M T), of 
which the root system relative to T, coincides 
with the restriction of r, on T, and the coroot 
system can be identified with rf = (2* |aer,]). 
The subgroup G(r,) is normal if and only if 

r —r, is also a closed subsystem; when this is 
so, G- G(t): G(r—r,). G(r, ) G(r—r,) = finite. 
All connected closed normal subgroups of G 
are obtained in this manner. In order that G 
be simple (sometimes called ahsolutely simple 
or almost simple) as an algebraic group (i.e., 
without proper connected normal subgroups), 
it is necessary and sufftcient that t be irreduc- 
ible (i.e., r cannot be decomposed into a dis- 
joint union of two proper closed subsystems). 
In general, a root system r can be decomposed 
uniquely into the disjoint union r = r, UU r, 
of irreducible closed subsystems r; such that 

r; UU r (IX i € S) are also closed subsystems; 
correspondingly, G is isogenous to the direct 
product G, x x G, of (absolutely) simple 
algebraic groups G; = Girl (G is actually a 
direct product if it is simply connected or an 
adjoint group.) The subgroups G; are deter- 
mined uniquely and only by G. 


M. k-Forms [15] 


Let K be an extension of k and G, an alge- 
braic group defined over K. An algebraic 
group G defmed over k is called a k-form 

of G, if there is a K-isomorphism f of G onto 
G,. Suppose further that Kik is finite sepa- 
rable, and for every Galois automorphism 

c of k/k, put o, — f^of '. Then q, is an iso- 
morphism of G, onto Gf, and the q, satisfy 
the relation 9; o 9, = Ps Conversely, given 

a collection of isomorphisms { ¢,} satisfying 
these conditions, there always exists a k-form 
G (with a K-isomorphism f onto G, such 
that 9, — f^o f TL which is unique up to k- 
isomorphism (Weil). In particular, if K/k is a 
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finite Galois extension with Galois group I, 
then { g,} is a (continuous) *1-cocycle of I' in 
Aut,(G,) (the group of all K-automorphisms 
of G,), and by the above correspondence the k- 
isomorphism classes of k-forms G are in one- 
to-one correspondence with the (continuous) 
1-tcohomology classes of the cocycle EN (in 
the cohomology set H'(F, Aut,(G,))) (- 172 
Galois Theory J) 

To a given finite separable extension K /k of 
degree d and an algebraic group G, defined 
over K of dimension n, we can associate a 
certain algebraic group NR, AJ: defined over k 
of dimension dn, which is obtained from G, by 
restricting the ground field [ 19]. A more pre- 
cise definition is as follows. Let Io, G3, . , 4} 
(c, = 1) be a set of automorphisms of k/k such 
that o; K( 1 €i d) are all distinct. Then one 
can find a k-form G of Gs IH, Gti with an 
isomorphism f: GG, such that 9, 2 f^ of! 
is given by à, ((x;)) = (xj), where i” is defined 
by the relation (6,0) K = ojo K. If we denote 
by p, the canonical projection of C, onto its 
first component G, and put p= p, o f, then the 
pair (G, p) is uniquely characterized (up to k- 
isomorphism) by the following universality 
property: If G’ is any algebraic group defined 
over Kand o is a K-morphism of CG into G}, 
then there exists a (uniquely determined) k- 
morphism ¢ of G’ into G such that 9 = po @. 
The group G (together with p) is denoted by 
Rx(G,). For the group of rational points, 

G, = G, g. When the algebraic group G, has 
some additional structure (such as that of 
‘vector space, falgebra, etc.), then Ro: 
automatically has the same kind of additional 
structure. 


N. Chevalley's Fundamental Theorems 


Let G, G' be connected semisimple algebraic 
groups, and let T( T’) be a maximal torus in 
G(G'), X(X°) its character module, r (r) a root 
system of G (G’) relative to T (T), etc. If we 
have an *isogeny ọ of G onto G’ such that 
<p(T) = T’, then there is a bijection xa' of r 
onto r’ such that y(x’) = qa%, where y is the 
dual homomorphism of o T and q, is a posi- 
tive integer, which equals 1 if the characteristic 
is zero and is a power of p if the characteristic 
is p» 0. Conversely, any injective homomor- 
phism y : X'2 X satisfying this condition (with 
respect to a certain bijection rar and q,) 
cornes from an isogeny 9: GG' in the manne! 
already stated. In particular, (y is an isomor- 
phism if and only if y is an isomorphism such 
that y(r)—r (Le., g, =! for all xer) [7]. The 
isomorphism class of G is thus completely 
determined by the pair (X, r), so that we some- 
times write G = G(X,r). A connected semi- 
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simple algebraic group G defined over K is 
called of Chevalley type over k(or k-split) if 
there exists a k-trivial maximal torus T in G. If, 
in the above theorem, G and G’ are of Cheval- 
ley type over k and T and 7” are k-trivial, then 
the theorem remains true if we replace isogeny 
by k-isogeny. In particular, the k-isomorphism 
class of a connected semisimple algebraic 
group of Chevalley type over kis completely 
determined by (X, r). Chevalley also showed 
that, for any pair (X, r) satisfying condition (1) 
above, there exists a connected semisimple 
algebraic group G(X, r) of Chevalley type 
defined over the prime field. Therefore, since 
the classification of semisimple algebraic 
groups of Chevalley type is reduced essentially 
to that of root systems (X, r), it turns out that, 
over any ground field K, there exist as many 
connected simple algebraic groups of Cheval- 
ley type as connected simple complex Lie 
groups (— 249 Lie Groups; Appendix A, Table 
5.1). 

For a given semisimple algebraic group 
G = G(X, r) delined over k, put Xo = r, (= 
the submodule of X generated by r), X? = 
{xeF|<a*,x>eZ for all xer}. Then we have 
natural isogenies GX.  G(X,r) —^ G(Xs, x) 
(with q, = 1), all of which can be taken to be 
defined over k. The group G(X°, t) (G(Xo, D) is 
called the simply connected group (the adjoint 
group) isogenous to G. When the characteristic 
of Kis zero, these isogenies (which are already 
known in the classical theory of complex Lie 
groups) are essentially the only possible 1so- 
genies among the semisimple algebraic groups. 
But when the characteristic is p > 0, there are, 
in addition to these, the Frobenius homo- 
morphism (with q, = p) and the following 
"singular" isogenies (for which g, = 1 or p de- 
pending on «):B,2C,, Fy > F4 (p=2), G;2G; 
(p = 3). In particular, when kis a finite field, 
taking the set of fixed points of the singular k- 
isogenies, we obtain the simple finite groups of 
M. Suzuki and R. Ree (- 151 Finite Groups). 


0. Classification Theory 


A connected semisimple algebraic group G 
defined over kis called k- (almost) simple if 
there is no proper connected closed normal 
subgroup of G defined over k. (When G is k- 
simple and k-split, the factor group D(G,)/ 
center is an abstract simple group except for 
a few special cases [ 17]. For more general 
results — Section Q.) For a k-simple algebraic 
group G, let G, be any one of its absolutely 
simple components, and let k, be the smallest 
field of definition for G, containing K. Then 
k/k is a finite separable extension, and G is k- 
isogenous to R, ,(G,). Hence the problem of 
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classifying all k-simple groups (up to isogeny) 
is equivalent to that of tinding aj] k,-forms of 
simple groups of Chevalley type. This latter 
problem can be reduced, in principle, to the 
classification of compact k,-forms and that of 
certain diagrams (De, *Dynkin diagrams along 
with an action of the Galois group) [ 16], J. 
Tits [30] (- Appendix A, Table 5.1). For 
instance, when kis a linite field (or, more gen- 
erally, a field of dimension < 1 [ 15]), there is 
no compact k-simple group; hence, using a 
simple classification theory of the diagrams, 
we can show that the only absolutely simple 
algebraic groups G deiined over k are either of 
Chevalley type or of the types introduced by 
R. Steinberg (denoted by 74,, ?D,, ?D,, 9D,, 
^E.). Connected semisimple algebraic groups 
composed of the groups of these types are 
characterized by the property that they have a 
Borel subgroup defined over k. Such groups 
are said to be of Steinberg type over k(or k- 
quasi-split). Absolutely simple algebraic groups 
over a *p-adic lield have been classified by M. 
Kneser and J. Tits [30,31]. When the charac- 
teristic of kis not equal to 2, the classification 
of simple groups of classical type (except for the 
type D,) is known to be equivalent to that of 
semisimple tassociative algebras with *invo- 
lution [ 18]. A similar relation also holds be- 
tween some of the exceptional simple groups 
and *Cayley algebras or !Jordan algebras (H. 
Hijikata, T. A. Springer, J. Tits). 
The following is a het of absolutely simple 
algebraic groups of classical type. 
l. kforms of SL(n) (n > 2). 
L1 Gy, =SL(m, 8) - (ge M,(S)] N(g)- 1j, 
where § is a tcentral division algebra 
over k with (R:k)=r?, n=mr, and N 
denotes the treduced norm in M,(&). 
1.2. Gi 2 SU(m 8, f) - (ge SL(m, vi 
f(gx.gy) 9 f(x. y) for x. ye 8", where 
& is a central division algebra over a 
quadratic extension K of k with an in- 
volution ; of the second kind (which 
means that {tek |é' =ë} =k), (K:k')= 
r?, n= mr, and f is a (nondegenerate) 
tHermitian form of m variables over 
& with respect to the involution ;, 
II. k-forms of SO(n) (n2 3, n Z4), Sp(n) (n 
even, n 2 2) 
6,,— SU (n, 8, f), where § is a central 
division algebra over k with an involu- 
tion ; of the first kind (i.e., such that 
(éek|Z'— Zi =k), (8:k)=r?, no mr, 
and f is a nondegenerate c-Hermitian 
form of m variables over & with re- 
spect to the involution z. In this case, 
dml Zek |č'=č} =r(r + £g)/2 with e, = 
t 1, and G} is a k-form of SO or Sp 
according as ce = 1 or = 1. (SO(8) may 
have other k-forms coming from the so- 
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called triality.) The case where the 
characteristic of kis 2 can also be dis- 
cussed by a method given by J. Tits 
(Inventiones Math. 5, 1968). 

When k is a local field or an algebraic num- 
ber field, the only central division algebra with 
an involution of the first kind is a *quaternion 
algebra (and, if ; is the "canonical involution," 
then e= 1). 


P. Algebraic Groups over an Algebraic 
Number Field 


Let G be a connected algebraic group detined 
over an algebraic number field k of finite de- 
gree. Let {v} be the totality of *prime divisors 
(i.e., equivalence classes of valuations) of k 
Taking the *restricted direct product of a 
family of locally compact topological groups 
{ G,,}, we obtain a locally compact topological 
group G,, called the adele group of G[19] (- 6 
Adeles and Ideles). In particular, when G = 
Gm, the adele group J =(G,,), is exactly the 
tidele group introduced by Chevalley in class 
field theory. If we identify xe G, with an 

adele whose components are all equal to x, 

G, becomes a discrete subgroup of G,. 

Concerning the tiniteness property of G,/G,, 
the following results have been obtained 
[21,22]: A character ze X,(G)( = the module 
of all k-rational characters of G) gives rise 
to a (continuous) homomorphism y,: G, 

I 2(G,),. Put G$-(geG,||y,(g)| 1. forall 
XE X,(G)}, where || is the standard norm in I. 
Then G% is tunimodular, and the quotient 
space G4/G, is of finite volume with respect to 
the (unique) invariant measure on it. G2/G, 
(G,/G,) is compact if and only if the semi- 
simple part G/R (the reductive part G/R,) of G 
is k-compact. From the arithmetic point of 
view, it is important to determine explicitly the 
volume of G9 /G, with respect to the invariant 
measure normalized in a certain manner; such 
a volume is called the Tamagawa number of G 
and is usually denoted by 7(G) [19,23]. For 
instance, Siegel's formulas on the volume of 
the fundamental domain of the unit group of a 
quadratic form f over k are essentially equiva- 
lent to a theorem on the Tamagawa number 
stating that «(SO( f )) = 2. 

Let 9 be the tring of integers in kand L an 
o-lattice in the vector space on which G is 
acting. We can define in a natural manner an 
action of G, on the set of all o-lattices; then the 
orbit G4 L (G, L) of L with respect to G, (G) is 
called the genus (class) of L. The ‘stability 
subgroup G, , of Lin G, is open, and the 
double coset space G, ,  G,/G, is finite (fi- 
niteness of the class number). Moreover, let 
{v,,, v,} be the totality of "infinite prime 
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divisors of K, and put G,, = []}_, Gy. Then G, 
is a Lie group, and the canonical projection 
G, on G, of G, , C G, is a discrete subgroup of 
finite type. (In general, (discrete) subgroups of 
G, which are *commensurable with G, are 
called arithmetic subgroups.) As in the adele 
case, ye X,(G) gives rise to a (continuous) 
homomorphism 3. : G, RI. and if G9, = 
19€G. |z,,(g)|=1}, then the quotient space 
G9. /G, is of finite volume. Moreover, G9 /G, 
(G, /G,) is compact if and only if G9/G, (G,/G,) 
is compact. 

In addition to these, the fapproximation 
theorem and the *Hasse principle are also 
extended to (classical, or general) algebraic 
groups (M. Eichler, M. Kneser, G, Shimura, 
Hijikata, Springer; — [30]). 


Q. Structure of Reductive Groups [24] 


Let G be a connected ‘reductive group defined 
over a field k. Then G has a tmaximal torus 
defined over k and G is ‘split over a finite 
‘separable extension K of K. The structure of 
the group Gy can be discussed as in Sections I, 
J, K, and L. Here, we discuss the structure of 
G,, i.e., a *k-form of G.. 

Let A be a maximal k-split torus of G, i.e., a 
k-subtorus of G that is k-split and maximal 
with respect to these properties. Any two such 
tori are conjugate over k i.e., by an element of 
G,. Their dimension is called the k-rank of G. 
To say that G is of k-rank zero is equivalent to 
saying that G is *k-anisotropic. The centralizer 
Z = Z,(S) of S in G is a reductive k-group, 
and its derived group is k-anisotropic. Let 
N = N,(S) be the normalizer of § in G. The 
*Weyl group ,W 2 N/Z relative to § is called 
the k-Weyl group of G, A k-root of G with 
respect to S is a nontrivial character of S that 
appears when one diagonahzes the represen- 
tation of S in the +Lie algebra g of G, 5 operat- 
ing via adjoint representation. Denote by ,r 
the set of all k-roots of G with respect to S. 
There is a decomposition of the Lie algebra q 
of G:g=Go + Leer Ga Where g,={X eg|Ad(s)X 
— (s) X for all seS). Then go is the Lie alge- 
bra of Z, and there is a unique unipotent k- 
subgroup P, of G normalized by S such that its 
Lie algebra is gą. The set ,t is a froot system in 
a suitable Euclidean space whose Weyl group 
is isomorphic to ,W; if G is k-split, ¿r is the 
ordinary root system, and the P, are as in 
Section J. In general, ,r need not be reduced 
De, axiom (iii) in Section J need not be satis- 
fied), nor should dim 9, = dim P, always be 1. 

A closed subgroup of G defined over k 
which is minimal among the parabolic sub- 
groups of G is called a minimal paraholic k- 
subgroup of G. (If G is k-split, a minimal para- 
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bolic k-subgroup is a *Borel subgroup of G.) 
Any two minimal parabohc k-subgroups are 
conjugate to each other over k. If P is one such 
subgroup, then there exists a maximal k-split 
torus S such that P is the semidirect product 
of the reductive k-subgroup Z = Z,(S) and the 
tunipotent radical U = R,(P) of P. The ex- 
pression P = ZU is called a Levi-decomposition 
of P, and Z is called a Levi-suhgroup of P. Any 
two Levi-subgroups of P are conjugate by an 
element in U,. There is an ordering of ,r such 
that P is generated by Z and P, with & > 0. The 
minimal parabolic k-subgroups containing a 
given maximal k-split torus $ correspond to 
the *Weyl chambers of ,r. They are permuted 
simply transitively by the Weyl group ,W. 

Fix an ordering of ,t, and let ,A be the ffun- 
damental system of ,r with respect to the given 
order. For any subset () of ,A, denote by P, the 
subgroup generated by Z = Z,(S) and E 
where 4 is a linear combination of the roots of 
,¢ in which all roots not in () occur with a 
coefficient > 0. Then P, contains P and, in 
particular, P, = P. Pj is called a standard para- 
bolic k-subgroup of G containing P. Any para- 
bolic k-subgroup of G is conjugate over k to a 
unique P,. If S, is the identity component of 
(\zeolker a), then $ is a k-split torus of G and 
P, = Z(S) R, (P5). This shows that any parabohc 
k-subgroup of G has a Levi-decomposition 
and its Levi-subgroups are conjugate to each 
other over k. Let P be a minimal parabolic k- 
subgroup of G containing a maximal k-split 
torus S. Put U = R,(P), Z = Z,(S), and N = 
N,(S). Then, N = N,Z, so G, is the disjoint 
union over ,W of double cosets Dun, P, (we 
KW), where n, is a representative in N, of 
we, W. More precisely, if we, W, there exist 
two k-subgroups U;, and U”, such that U = 
U‘, x U% (product of k-varieties), the map 
U‘, x PO Un, P sending (x, y) onto xn, y is an 
isomorphism, and 
(G/P, GB V) x((U,)). 

we 
where 7 is the projection G— G/P. This is 
called a relative Bruhat decomposition. If G is 
k-split, this gives an ordinary Bruhat decom- 
position (— Section K). If is a subset of Â, 
let W, be the subgroup of ,W generated by 
reflections defined by the o's in 0. If 0, 0' are 
subsets of ,A, then there is a bijection of dou- 
ble cosets 


(Py) NG,/CPs = Wo\k W/ Wy. 


(Note that all these properties follow from the 
fact that (P,, N,) is a BN-pair in G,; — Section 
R.) 

If G is k-isotropic, let G* be the subgroup of 
G, generated by all U,, where U runs over the 
unipotent radicals of the minimal parabolic k- 
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subgroups of G. Kneser and Tits conjectured 
that when G is semisimple and simply con- 
nected and k arbitrary, G, = G*. Platonov 
showed that this conjecture fails for some k- 
forms of SL, (n > 2), but it is true in many 
cases, e.g., when G is tk-split or tk-quasi-split 
(J. Tits, Sém. Bourbaki 29e, no. 505, 1976677). 
In this case, if G is simple as an algebraic 
group and the cardinality of k is 2 4, then 
G,/Z(G,) is simple as an abstract group. The 
connected semisimple algebraic k-groups over 
a finite field k are k-split or k-quasi-split (— 
Section 0). J. Tits has given a reduction pro- 
cess for classifying the groups over an arbi- 
trary field k. He defined the index of the k- 
group (to a certain extent a generalization of 
Witt's theorem characterizing a quadratic 
form by means of its index and anisotropic 
kernel) and gave all possible indices of the k- 
groups and also a complete list in the case of 
local geld (J. Tits [30,31 ]). 


R. Buildings and BN-pairs [25] 


The origin of the notations of buildings and 
BN-pairs lies in an attempt to give a systema- 
tic procedure for the geometric interpretation 
of the semisimple Lie groups and, in partic- 
ular, the exceptional groups. The theory has 
various applications to the groups of Lie types. 
To describe this precisely, we must introduce a 
number of detinitions. 

A complex A is a set with an order relation 
c, read “is a face of" or “is contained in” such 
that for a given element A, the ordered subset 
S(A) of al] faces of A is isomorphic to the set of 
all subsets of a set. The subset S(A) of A is 
called a simplex in A. A complex has a smallest 
element, which we denote by 0. For an element 
A, the number rk A of minimal nonzero faces 
of A is called the rank of A. Define rk A = 
sup(rk A| AEA). A morpbism y: A—A' (where 
A and A' are complexes) is a mapping of the 
underlying sets such that for every AEA, 

S(A) = S(a(A)). A subcomplex of A is a com- 
plex whose underlying set is a subset of A such 
that the inclusion is a morphism. If Aeh, the 
star St(A) of A is the set of all elements of A 
containing A. With the order relation induced 
from A, St(A) is a complex. If BeSt(A) the 
rank of B in St(A) is called the codimension of 
A in B and is denoted by codimg A. 

A complex A is called a cbamber complex if 
every element is contained in a maximal ele- 
ment, which is called a cbamber, and if given 
two chambers C, C’, there exists a sequence of 
chambers C = Cp, C, ,, C, = C such that 
c odin. , (Cj, C) =codime, (C_, NC) <1 
for all i= 1, 2, , m. A chamber complex is 
called tbick (resp. tbin) if every element of 
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codimension lis contained in at least three 
(resp. exactly two) chambers. An endomor- 
phism o of a thin chamber complex A is called 
a folding if 9? = ọ and if any chamber on ¢(A) 
is the image of exactly two chambers in A by 
ọ. For any folding ¢, (A) is called a root. A 
thin chamber complex is called a Coxeter 
complex if for any pair of adjacent chambers 
C, C’, i.e., chambers such that codim,(C N C’) = 
|, there is a root containing C and not C’. Let 
X be a Coxeter complex. For any root oi) 

of X, there is only one root ~'(2), called the 
opposite of o(X), such that o(X)f1 e(2) does 
not contain any chamber and g(2) U y’(2) = 2; 
there is also an involutive automorphism 
called the reflection associated with ¢ that 
transforms @(Z) onto UL The group W(C) 
generated by all reflections of Z is called the 
Weyl group of X, which turns out to be a 
Coxeter group, i.e., there exist sets / and (mj;j, 
(i,j) € 1 x I, where mj; are integers or % and 

mj; = 1, such that the group is presented by 

the generators {r;};.; and the fundamental 
relations (r;r)"" = 1, i, jel, m; 7 oo. A Coxeter 
complex is called irreducible if it is not the 
join of two nonempty Coxeter subcomplexes. 

A building is a thick chamber complex A 
with a system d of Coxeter subcomplexes, 
called the apartments of A such that (i) every 
two simplexes of A belong to an apartment. (11) 
If Y. 2” eA, there exists an isomorphism of 2 
onto LU that fixes X N X (elementwise). Since 
the apartments of a building are isomorphic to 
each other, we can define the Weyl group, 
rank, and irreducibility of the building to be 
those of its apartment. A building with finitely 
many chambers is said to be of spberical type. 
If a building can be realized as a simplicial 
decomposition of a Euclidean space, then one 
says it is of Euclidean type. 

Examples of buildings are provided by BN- 
pairs. A BN-pair or Tits system in a group G is 
a system (B, N) consisting of two subgroups of 
G such that 
(BNO) B and N generate G; 

(BN) BüON-H-aN;and 

(BN2) The group W= N/H has a generating 
set R such that for any r e R and any we W, 
(BN2’) rBw c BwBfBrwB, 

(BN2") rBr £B. 

The group W is called the Weyl group of the 
BN-pair. For any subset $ of R, let W, denote 
the subgroup of W generated by S. Set P = 
B W,B. Then P, is a subgroup of G and the 
mapping SP, is a bijection of the lattice of 
all subsets of R onto the lattice of all sub- 
groups of G containing B. A subgroup of G is 
called parabolic if it is conjugate to some Ps. 
Let A be the set of all left cosets of all sub- 
groups P, S c R, ordered by the opposite of 
the inclusion relation. Let G operate on A on 


13 Ref. 


Algebraic ` Groups 


the left. Let X be the subset {nP, ne N, $c Rj 
of A, and dl be the set of G-translates of X. 
Then (A, Y) is a building whose Weyl group is 
W. 

Let G be a reductive algebraic group defined 
over a field K and P be a minimal parabolic k- 
subgroup of G containing a maximal k-split 
torus $ and set N = N,(S). Then (P,, Nj) isa 
BN-pair in G,. Therefore there is a building of 
spherical type associated with the group G. 
Conversely, the buildings of rank > 3 and 
irreducible spherical type (roughly speaking) 
all turn out to be associated with simple alge- 
braic or classical groups (J. Tits [25]). This 
result gives a complete and unified description 
of structures that were discovered previously 
in certain cases. For example, the building of 
type A, gives a tprojective space (E. Abe, T. 
Tsuzuku), that of type C, gives a polar space 
(Veldkamp), and that of a k-form of type E, 
gives a Cayley space (J. Tits). As an applica- 
tion, one can show that a finite building of 
rank 2 3 and irreducible spherical type is 
isomorphic to the building of an ‘absolutely 
simple algebraic group over a finite field. 

When kis local De, endowed with a com- 
plete discrete valuation whose residue field is 
Perfect), the reductive group defined over k 
has another BN-pair such that the associated 
building is Euclidean. This theory was ini- 
tiated by N. Iwahori and H. Matsumoto [26], 
who considered split semisimple groups. Later, 
quasi-split and classical groups were studied 
by H. Hijikata, and the theory for the general 
case was given by F. Bruhat and J. Tits 
[27,28]. To distinguish from the usual BN- 
pair structure, the subgroups conjugate to B in 
this case are called Iwabori subgroups, and 
parabolic subgroups are called paraholic sub- 
groups. The Euclidean buildings are the “ultra- 
metric analogs" of the *Riemannian symmetric 
spaces. In other words, in the study of p-adic 
simple groups, they play a role similar to that 
of the symmetric spaces in the theory of simple 
groups. 
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A. Introduction 


A complex number that satisfies an algebraic 
equation with rational integral coefficients is 
said to be an algebraic number. If the coeffi- 
cient of the term of highest degree of the equa- 
tion is 1, this algebraic number is said to be 

an algebraic integer. The set A of All algebraic 
numbers is a field which is the *algebraic clo- 
sure of the rational number tield Q in the 
complex number tield C. The set 1 of all alge- 
braic integers is an tintegral domain which 
contains the integral domain Z of all the ra- 
tional integers. The ‘field of quotients of J is A. 


B. Principal Order 


An extension field k of Q of finite degree 
(which we shall always suppose to be con- 
tained in C) is said to be an algebraic number 
field of finite degree, and kis a subfield of A. 
The intersection o = kN I is an integral domain 
whose field of quotients is k; o is called the 
principal order of k (More generally, a subring 
R of o containing 1 is said to be an order of k if 
the field of quotients of R is k The set f of all 
elements y of o such that yo C R is an ideal of o; 
in addition f is called the conductor of R.) Let n 
be the degree of kover Q. Then the additive 
group of the principal order o of kis a tfree 
Abelian group of trank n. A tbasis (o,. @,) 
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of o as a free Abelian group (or Z-module) is 
said to be a minimal basis of o (or of k). Let of? 
(i = 1, , n) be tconjugate elements of w; over 
Q, and let A = DW be the determinant whose 
(i,j) entry is ol Then D, = A? is a rational 
integer that is independent of the choice of a 
minimal basis of o. D, is called the discrimi- 
nant of k If k z Q, then |D,| 1 (Minkowski's 
theorem, 1891). For any given rational integer 
m there are only a finite number of algebraic 
number fields whose discriminants are equal 
to m (C. Hermite and H. Minkowski, 1896). 
The proof of these theorems depends on the 
methods of geometry of numbers (- 182 
Geometry of Numbers). 


C. Ideals of tbe Principal Order 


An "ideal a of the principal order o is said to 
be an integral ideal of k. In particular, a prime 
ideal ( 0) of o is called simply a prime ideal of 
k. The domain o is not necessarily a tprincipal 
ideal ring but is always a *Dedekind domain. 
That is, every ideal a of o is uniquely ex- 
pressed (up to the order of the factors) as a 
finite product of powers of prime ideals of o. 
This theorem is called the fundamental theo- 
rem of tbe principal order o. 

The quotient ring o/a of o by an ideal a (#0) 
of o is a finite ring. The number of elements 
of o/a is called the absolute norm of a and is 
denoted by N(a). We have N(ab) = N(a)N(b). 
Every prime ideal p (4 0) of o is a fmaximal 
ideal of o, and o/p is a Imite field. Let the 
‘characteristic of o/p be p, where p is a prime 
number. Then o/p is a Imite extension of the 
*prime field Z/pZ. Let the degree of o/p over 
Z/pZ be f. Then N(p) = p^, and f is said to be 
the degree of the prime ideal p. 

Let $ be a complex variable. The (complex- 
valued) function 


Ob) =) air Ua -(N()) 7) 


of s € C is called the Dedekind zeta function of 
k (R. Dedekind, 1871). Here the summation 
extends over all ideals a of o, and the product 
extends over all prime ideals p of o. This series 
converges absolutely for Re s > 1, and the 
function C, (s) has a single-valued fanalytic 
continuation to a +meromorphic function 

on the whole complex plane (- 450 Zeta 
Functions). 


D. Units 


An algebraic integer ¢ of kis said to be a unit 
of kif z^! is also an algebraic integer. Hence € 
is a unit of kif and only if the +Principal ideal 
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(E) is o. The set E, of all units of k forms an 
Abelian group under multiplication, which is 
called the unit group of k. The set of allele- 
ments of E, of finite order coincides with the 
set of all the roots of unity contained in k and 
forms a cyclic group of a finite order w. Let n 
be the degree of k over Q. Then for each ele- 
ment x€ k there are n conjugate elements ai" 
over Q. Let a (i = 1, , r,) be real for any 
xe k, and let a:*? and g14"2*) (j = 1, . , r,) 
be pairs of complex conjugates for any ae k. 
Then we have n = r, + 2r,. The unit group E, 
of k is the direct product of a cyclic group of 
order w and the free Abelian multiplicative 
group of trank r = r; + r; — 1. This theorem 
is called Dirichlets unit theorem (1846). A basis 
(€i, . &,) of this free group is called a system 
of fundamental units of k. 

Let 1! —log|a*?| (i21, ..., rj), (las 
2log|a9?| (jar, +1, . .., ri 4 r;) for uek. Forr 
elements 4,,, 4, of E,, 


On Fg, On, | 
dE Hn. ms pw 


On, ly, jy, 


is called the regulator of (1, , 4,) (Dedekind). 
In order for y,,...,4, to be multiplicatively 
independent, it is necessary and sufficient 
that R [y,,. 5,] #9. The absolute value of 
R [n -> y,] takes the minimum positive 
value R for fundamental units (¢,,.., €). R = 
R [¢,,,E,] is independent of the choice of 
fundamental units (¢,,., &j) of k. R is called 
the regulator of k. In general, Kin. 5, ]|/R 
is equal to the index [£,: H] of the group 
H generated by the roots of unity in k and 
ne, on, H. W. Leopoldt conjectured that 
units in k, which are multiplicatively indepen- 
dent over Z, remain multiplicatively indepen- 
dent over Z, (the ring of p-adic integers) when 
they are considered as elements of the tensor 
product k ® Q, over Q. This conjecture was 
affirmatively proved in some special cases by 
J. Ax (Illinois J. Math., 9 (1965)) and others. 

If k/Q is a Galois extension, there exists a 
unit ¢ of k such that the conjugates of € over Q 
contain r multiplicatively independent units 
(Minkowski's theorem). 


E. Ideal Classes 


An o-module contained in k (Le., oa c a) such 
that xa C o holds for some element a ( #0) of k 
is said to be a fractional ideal of k. For two 
fractional ideals a, b of k the "product" ab 
defined by (X af. (finite sum)|a;e a, fj, € b] is 
also a fractional ideal. Thus the set of the 
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fractional ideals of k forms a multiplicative 
commutative semigroup. For a fractional ideal 
atheseta 1—-(x&k xa c o) is also a fractional 
ideal of k, and we have aa "co. Thus the set 
of all nonzero fractional ideals of k forms an 
Abelian group 3, under multiplication with 

o as identity. Each fractional ideal a (#0) 

is uniquely expressed as a finite product of 
powers of prime ideals, if we admit negative 
powers. Namely, %, is a free Abelian multi- 
plicative group with the set of all prime ideals 
as basis. Given fractional ideals a and b, we 
say that a is divisible by b if a c b; in this case, 
we call b a divisor of a and a a multiple of b. 
Also, a C b if and only if there exists an integral 
ideal c such that a = bc. Given fractional ideals 
a=T] pyi and b — TTafi (e; 0, f; #0), we say 
that a and b are relatively prime if {p;} and (qj 
are disjoint. Usually a fractional ideal of k is 
simply called an ideal of k. 

For an element x ( # 0) of k, (x) = ao is a 
(fractional) ideal of k, and (a) is said to be a 
principal ideal of k. The set P, of all principal 
ideals (x) (x € k, 2:0) is a subgroup of Jy. 
Since (x) = o is equivalent to «e E,, we have 
P, = k*/E,, where k* is the multiplicative group 
of all nonzero elements of k. 

Each coset of A modulo P, is called an 
ideal class of k, and the group &, = 5,/P, is 
called the ideal class group of k. Each ideal 
class contains an integral ideal a with N(a) « 
,/|D,| (more precisely, with N (a) « (4/n)'z(n!/n") 

|D,|). From this it follows that €, is a finite 
Abelian group. The order h of €, is called the 
class number of k. For the calculation of the 
class number the fresidue at the pole s = 1 of 
the Dedekind zeta function is used. Namely, 
lim (s- L5 (s) - gh, 


slt 


g= 27 r8 Ru w, |D,], 


where R, is the regulator of k and w, is the 
number of roots of unity in k (Dedekind, 
1877). This formula is used, in particular, for 
the computation of the class numbers of 
*quadratic fields and tcyclotomic fields (- 347 
Quadratic Fields). The class numbers of cubic 
and quartic (real) tcyclic fields over Q were 
computed by H. Hasse in the case where 

the *conductor of k/Q is less than 100 (Abh. 
Deutsch. Akad. Wiss. Berlin, 2 (1948)). Hasse 
has also given a detailed computation of the 
class number of cyclotomic fields [ 15). In 
general, let the degree H = [k: Q] be fixed and 
let [D,| oc. Then 


lim (log(A, RjJ/log,/|D,|) — 1. 


(This formula was proved for n = 2 by C. L. 
Siegel, 1935, and for general n by R. Brauer, 
Amer. J. Math., 69 (1947). 
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F. Valuations 


All the tArchimedean and ‘non-Archimedean 
valuations of an algebraic number field k can 
be obtained as follows (- 439 Valuations): 


Archimedean Valuations. Let n= [k: Q], and 
let n conjugates of we k be a), , a™ such that 
a? (i= 1, , rj) is real, and al and air 
(j21., r,) are pairs of complex conjugates. 
W e write |a]; ]a'?] (j 4, +r); these 

are Archimedean valuations of k that are 

not mutually equivalent. The equivalence 
classes of these valuations are denoted by 

pi’, ..., p%*"), respectively, and are called the 
tinfinite prime divisors of k. The first r; infinite 
prime divisors are called trea] and the remain- 
ing 7; are called "imaginary (or complex). The 
valuations of k defined by 


Ixlon mia]; FL... pr 
jo 3 
=|al;, jen-ctl..., rir, 


are called tnormal valuations of k. Here — |,» 
(jor, 1, ..., ri +r) are valuations in the 
wider sense. If r, =n we call k a totally real 
field, and if r, 0 wecall k a totally imaginary 
field. 


Non-Archimedean Valuations. Let p be a prime 
ideal of k and a an element of k. Let (x)= p^b, 
where p and b are relatively prime. Put v,(a) = 
a. Then for any constant p (0 < p <1), 


lal, e p^ 


is a non-Archimedean valuation of k. This 
valuation of k is called the tp-adic valuation of 
k; p-adic valuations for different prime ideals 
are mutually inequivalent. The valuation Ia, 
with p —(N(p)) ! is called a *normal valuation 
of k. The equivalence class of valuations con- 
taining — |, is denoted by the same letter p 
and is called a ‘finite prime divisor of k. 

A formal finite product of powers of finite or 
infinite prime divisors m* = I] p? is called a 
divisor of k. If all e; 2 0, then m* is called an 
integral divisor of k. Given divisors m* = 
II pfi and n*- II p/i, we write m* n*if es 
f Go 2) 

Any valuation of k is equivalent to one of 
the valuations defined previously (A. Ost- 
rowski, 1918; E. Artin, 1932). For any element 
a (#0) of k the *product formula [T,|«|, = 1 
holds, where p runs over all finite and infinite 
prime divisors of k and — |, are the normal 
valuations of k. Conversely, let k be a field, 
and letV={ — |j be a set of inequivalent 
valuations of k such that (i) for any a ek (a £0) 
|x|, z | holds only for a finite number of p in 
V; (ii) the product formula I], lol, = 1 (aek, 
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x #0) holds; and (iii) there is at least one 
Archimedean valuation in V. Then k is an 
algebraic number field and Vis the set of all 
the prime divisors of k (Artin and G. Whaples, 
Bull. Amer. Math. Soc., 5 1 (1945)). 


G. Ideal Classes in the Narrow Sense 


For a, Bek, the expression a = ß (mod p?) 
means a? f? > 0 for a real infinite prime divi- 
sor pË and a f? % 0 for an imaginary infi- 
nite prime divisor p. We call an element 

x € k totally positive if al] real conjugates o? 
(i= 1,. , r,) are positive. In the notation just 
given, this means a = 1 (mod p9) (i = 1, , ri). 
The set of all principal ideals (a) generated by 
totally positive elements a € k is a multipli- 
cative subgroup P, of P,. Each coset of A 
modulo Pj is called an ideal class of k in the 
narrow sense. Let E; be the group of all totally 
positive units of k. Then we have (3,: P4.) - 


h2^ (E, Eg). 


H. Multiplicative Congruence 


Let m be an integral ideal of k, and let k*(m) 
be the multiplicative group of all elements a in 
k such that (a) is relatively prime to m. Any 
element XE k*(m) Can be expressed in the form 
Y = B/ysuch that f, y € o and (fj), (y) are rela- 
tively prime to m. 

Consider an integral divisor m* = m [J] p? 
which is a formal product of m and intinite 
prime divisors p? of k. We call m the finite 
part of m*. Given an element ae k*(m) and 
elements fj, y € k*(m)N o such that a = f/y, 
we set a = 1 (mod*nt*) if P= y (mod nr), and 
a = 1 (mod p®). The set of al| a in k*(m) such 
that a = 1 (mod" m*) forms a multiplicative 
group. We write a= f (mod" m*) for a, Bek if 
a/B € k*(m) and ai = 1 (mod " m*). This con- 
gruence is called the multiplicative congruence. 
In the following discussion we shall write 
mod m* for mod” m*. 

We denote by y, (m) the group of all ideals 
of k that are relatively prime to an integral 
ideal m, and by Sim?) the group of all prin- 
cipal ideals (a) such that a € k*(m), a = 1 
(mod m*); S(m*) is known as the ray modulo 
m*. Any subgroup H of %,(m) which contains 
S(m*) is called an ideal group modulo m*, and 
the factor group 3,(m)/H is called a group of 
congruence classes of ideals modulo m*. 

If nt m* for integral divisors m* and n* of 
k, then J (n) C S, (n) and S(m*) c S(n*). If H 
is an ideal group modulo n*, then Q(H) = 
H N Jím) is an ideal group modulo m*, and 
we have %,(1)/H = s,(m)/@(H). For any given 
ideal group H, modulo m* there is a smallest 


141 
Algebraic Number Fields 


integral divisor {* such that {* m*, and there 
exists an ideal group H modulo TE with ®(H) 
= H, Oe, if there is an ideal group H’ modulo 
n* with @(H’) = Ho, then T n*). We call j* the 
conductor of the ideal group H. The notion of 
multiplicative congruence is used in felass field 
theory and in the theory of tnorm-residue 
symbols. 


I. Ideal Tbeory for Relative Extensions 


If an algebraic number lield K has a subfield k, 
we say that Kik is a relative algebraic number 
field. Let be the principal order of K. For a 
(fractional) ideal a of k, Oa is an ideal of K. 
We write Da = E(a) and call E(a) the exten- 
sion of a to K. For ideals a, b of k, wehave 
E(ab) — E(a) E(b) and E(a)f1k — a. 

Let V, : K 2 C be k-isomorphisms (i = 
l, ...,n), where n 2 [K:k]. We write K® = 
V (K) and A? =Y-(A)for Ae K. For an 
ideal A of K, UW — | 4? A e UA} is an ideal 
of KC?) and Q{ is called the conjugate ideal 
of Yin K®, Let L be the composite field of 
KI , Kit Then the ideal generated by 
AH... UA” in L is the extension of an ideal a 
of k We write a = Nell and call a the rela- 
tive norm of d over k. We have Ny, (9138) = 
Neal fN, (DI and Ny, (E(a)) = a" (for an 
ideal a of K). In particular, for K =Q, Nal) 
-(N(30)). 

Let p be a prime ideal of k. Then E(p) = 
Pr PY Bam O, where D, P, are 
prime ideals of K. Let f; be the degree of the 
finite field O/P, over o/p. Then ND pi 
f; is called the relative degree of d. over K and 
e; is called the (relative) ramification index of 
P; over K We have the relation n = Y? , e; f. 
between these numbers. If e, = =e,= 1, the 
prime ideal p is said to be unramified for KI 
Otherwise, p is said to be ramified for K/k. If 
every prime ideal of kis unramified for K jk, we 
call K/k an unramified extension. (For an in- 
finite prime divisor p , of k we write p, = 

14 BO" if the Archimedean valuation 

|,, of k can be extended to g Archimedean 
valuations Lui (i21,...,g)of K, where e; = 
2 if PË is imaginary and p,, is real, e; = 1 
otherwise.) 


J. Relative Differents and Relative 
Discriminants 


Let K/k be a relative algebraic number field 
and o, 8 be the principal orders of k, K, re- 
spectively. Put M= {A ek Trei AD) c oj, 
where Trk is the ttrace (- 149 Fields J). Then 
Wt is a (fractional) ideal of K and t! — Dy is 
an integral ideal of K; Dx, is called the rela- 
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tive different of K over k. When k =Q, D be is 
simply called the different of K. For, Kk, 
we have the chain theorem: Dip = Dik Den, 

Let the conjugates of Ae K over kbe 
AU), , A'"', and assume that A} — A. Put 
du (A) DIS. 4(A -A”) for Ae K. If Ae, 
thenàg4(A)e Dri Dex, is generated by 
ës ALA Ae OD]. The integral ideal (€? gen- 
erated by (A — A? A eS jin the field L = 
KK?) K® was called an element by D. 
Hilbert. We also have Du = EVES, EM. 
The integral ideal by, = Ng, (2) of Kis 
called the relative discriminant of K/K. If k= Q 
dko —(Dy). 

For the relative different Dy ik to be divisible 
by a prime ideal ‘ of K, it is necessary and 
sufficient that E(p) = PP? Pie with e » 1, 
where p = d N k (Dedekind’s discriminant 
theorem, 1882). Hence a prime ideal p of kis 
ramilied for K/k if and only if p divides the 
relative discriminant dx; there are thus only a 


+ 


linite number of prime ideals of K which ramify 
for K/k. In particular, K/k is unramilied if and 
only if Den = 0. 


K. Arithmetic of Galois Extensions 


Let K/k be a relative algebraic number field 
such that K is a *Galois extension of K of 
degree n, and let G be the *Galois group of 
K/k. Let o, D be the principal order of k, K, 
respectively. The conjugate ideals of an ideal 
d of K are given by 9I* 2 14"| AEA! (ce G). 
If Nq , (9I) = a, then E(a) = IT, Wl’. For a 
prime ideal p of K, E(p) 2 GB, B; ... P, 
where Nga(P) =p (i91... .. g), n= efg, and 
JD... P, are mutually conjugate prime ideals 
of K over k 

Hilbert (1894) developed the decomposition 
theory of a prime ideal p of kK for a Galois 
extension K/k in terms of the Galois group G 
as follows: Let $ be a prime ideal of ©. Then 


Z-iceG|W'- yj 


is a subgroup of the Galois group G of K/k. Z 
is called the decomposition group of $ over k. 
Let G =| j; Zt, be the left coset decomposition 
of G. Then 35, = $": (i = 1, . , g) are al] the 
conjugate ideals of $$ over k 

The subgroup 


T={cEZ|A°=A(mod), AED} 

of the decomposition group Z is normal, and 
T is called the inertia group of $ over k The 
quotient group Z/T is a cyclic group of order f 


(the relative degree of BL There exists an 
element g of Z such that 


AED, 


and g is uniquely determined mod T; cT gen- 


A* = AN (mod 35), 
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erates the cyclic group Z/T. This g is called the 
Frobenius substitution (or Frobenius 
phism) of $ over k. For m= 1,2, , 


y"—íacZ| A" A(mod P"*'), AED} 


automor- 


are normal subgroups of Z; the group V! is 
called the mth ramification group of $ over k 
Let 


ye — NE you Veith 


=VOID 2 yat, 


zs pua yet) — | 


Let V,— Vo"! (9 =0,1,...,), where vo = —1. 
In particular, Yo =V =T. The integers v|, 
v, .. are called the ramification numbers of 5. 
The group T/V; is isomorphic to a subgroup of 
the multiplicative group of the finite field O/B. 
Hence T/V, is a cyclic group whose order e, 
is a divisor of N(38) — 1. The group V/V, ,, 
(m > 1) is isomorphic to a subgroup of the 
additive group of the finite field Cl Hence 
Vin/Vm+1 is an Abelian group of type (p, p, , p) 
whose order divides ND) From ez T|= 
(T: HU, it follows that e —eo p^. (ey, p)= 1. 
Here G|denotes the order of a finite group G. 
Hence the decomposition group of * is a 
*solvable group. The relation between the 
ramification numbers for K jk and those for an 
intermediate Galois extension F/k was com- 
pletely determined by J. Herbrand (J. Math. 
Pures Appl., 10 (1931)) [II]. 

Let P be the P-component of the relative 
different Dx, of a Galois extension K/k. Then 


i- S pas oov n X (V° 


In particular, 42 O0 if T21,andd-e 1 if 
pep 

Let kz, kr, and kpm) be the intermediate 
fields which correspond to the subgroups Z, T, 
and V™, respectively, in the sense of *Galois 
theory; the fields k}, ky, and k,«m) are called 
the decomposition field, the inertia field, and 
the mth ramification field of R, respectively. 
Let * be a prime ideal of K containing p, and 
let p; and p, be prime ideals in k, and k, such 
that pz = PN k, and p, = PNky. Then we 
have Elp)=pzp P? for kl; E(pj)- p; 
and NM. (py) p pf for k,/k,; and E(p,) iV 
for K/ky. 

If a prime ideal p of kis unramified for a 
Galois extension K/k then we have E(p) - 
P Pa Pa B= (m UE By) = 
pl. and n = fg. The Frobenius automor- 
phism o;: ds A"? (mod $8) (A e D) for the 
prime ideal 3l, is uniquely determined, and its 
order is f. Since 3B, — Bi, we have c; = vj oz, 
Hence 0,,, d belong to the same ‘con- 
jugate class of G. In particular, if G is an 
Abelian group, then e; — ,,= o, and 
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A" = AU" (mod p), AED. 


We then write 


E G ) 


and call this symbol the Artin symbol for p 

for the Abelian extension K fk. For an ideal 
a= [I p° of k that is relatively prime to the 

relative discriminant of K/k, we define 


(lu e 


Evidently, we have 


VoU 


The arithmetic of quadratic fields (— 347 
Quadratic Fields) and the arithmetic of cyclo- 
tomic fields (— Section L) have been developed 
since the 19th Century. 





L. Arithmetic of Cyclotomic Fields 


A complex number č whose mth power is | but 
whose m'th power is not 1 for m’ < m is called 
an mth primitive root of unity. There are (m) 
primitive roots of unity: exp(2xir/m) ((r, m) = 1), 
where ¢ is *Euler's function. These qm) primi- 
tive roots of unity are the zeros of an irreduc- 
ible polynomial over Q of degree «p(m) 
FX) [TG 19, 

An 
where uis the *MObtus function. The coeffi- 
cient of the highest term of F,(X) is 1, and the 
other coefficients are all rational integers. 
F,(X) is called a cyclotomic polynomial. An 
example is 





F,(X) -(X'* 
-2X*- 


DOC — DX? — DONT 


X431, 


The algebraic number field K „= Q(C,) 
(m = exp(2zi/m)) obtained by adjoining an 
mth primitive root of unity to Q is a Galois 
extension over Q of degree (m) whose Galois 
group G is isomorphic to the multiplicative 
Abelian group of ‘reduced residue classes of Z 
modulo m:G — (o, [Cz — C, (r,m)= 1]. K,, is 
called the mth cyclotomic field. Cyclotomic 
fields are *Abelian extensions of Q. Conversely, 
every Abelian extension of Q is a subfield of a 
cyclotomic field (Kronecker’s theorem, 18.53, 
1877). 

We can choose (1, Za Cm -> Cat!) as a 
minimal basis of K,. Let m= If I? [ be 
the decompositions of m in powers of prime 
numbers /,, ,]. Put KÜ— Ky. Then K,, 
is the composite field K„= KK... Ki 
The different of K, is given by Dx ig 
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Dy tgDk Oo... Dee, and the discriminant 
of K, is given by Dy, = Dgo Dgo @, = 
o(m)/ọ(l":)). If m= l^ the discriminant of K 
is Dy, =el* (ac I (hl — h— IL where ¢= 
— 1 if /"=4 or 123 (mod4), and e=1 other- 
wise. Hence the discriminant of K m is D, = 
Ge mill. p^ Tim! Suppose that S 
m2 (mod 4). Then a prime number p ramifies 
for K,,/Q if and only if p divides m. In par- 
ticular, if m= [^ (m> 2), then only 1 ramifies for 
K „/Q and (I) 2199", NCT) = | ([ is explicitly 
given by [= (l-i,,,)). For /# 2 the ramification 
numbers for | are v, - | 1 G= 1,2,), and 

the ramitication fields are K, Kp, For [= 2 
the ramification numbers are 1,3,7, and the 


m 


ramification fields are Q, K4, Kg, 

In K,,/Q a prime number p ( ptm) is decom- 
posed as (p)= 1), N(p)2p/(i2 1, ....y) 
and fg = qn(m). Here the degree f of p, is de- 
termined as the minimal positive integer f 
such that p^ = 1 (mod m) Hence the decom- 
position law of a prime number p in K,,/Q is 
derermined by its residue class modulo m. This 
is a prototype form of class tield theory (- 59 
Class Field Theory). 

The class number of the cyclotomic tield K,, 
can be calculated by Dedekind's formula (- 
Section E; see also Hilbert [4]). Here we shall 
give the result for m=! (a prime number). 

Let r be a 4 Primitive root modulo l. For ( = 
exp(2ni/l), we put 


7 Lë? de 1/2 
um (is) i 


Then ¢ is a unit in K,. Define an element g of 
the Galois group of K,/Q by {°= ¢", and put 
GE (120,1, ...). Then £o, £1, ..., €, 1 (p= 








(l— 3)/2) are multiplicatively independent units. 


That is, the regulator R Io En £4] = 
E40. The units &,¢,,.... €—, are called 
circular units. The class number h of K, is the 
product of two factors, h = h, h,. Here h, is the 
class number of the real subfield K; = Q(¢ + 
(~'), E. E. Kummer called E, the first factor 
and h, the second factor of the class number h. 
Let X1» X2», fj-, be the multiplicative char- 
acters of the treduced residue classes of Z 
modulo 1, and let y{i= 1, , p+ 1) be the 
characters among them such that y,( = 1) = =1, 
Then 


EX p*lp-ci/1-1 E 
het ; EY auo). jo zt 





(21)? i-1 Na-1 E 


(Kummer, 1850). Here Rọ is the regulator of 
K;. Since circular units belong to K;, the class 
number h, of K; is equal to the index of the 
subgroup generated by +1, zo, e, , in the 
group of units of K;, The class number h of 
K, is equal to 1 for {< 19 and it has been con. 
jectured that there exist no more fields K, 
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with hz 1. H. L. Montgomery and K. Uchida 
solved this conjecture by proving that the first 
factor bz 1for|» 19 (Tóhoku M ath. J, 23 
(1971)). J. M. Masley and Montgomery proved 
that there are precisely 29 distinct K,, with 
class number | (J, Reine Angew. Math., 
286/287 (1976). 

According to Kummer, | divides h if and 
only if l divides h,. Since h, can be computed 
explicitly, we can readily determine whether 
l| h or not (- 145 Fermat's Problem; Ap- 
pendix B, Table 4.111). A prime is called regu- 
lar if /{h, otherwise it is called irregular. Thus 
an odd prime l> 5 is irregular if and only if 
there exists an even integer j with 2 <j <]—3 
such that | divides the numerator of the jth 
Bernoulli number B,. This criterion of Kum- 
mer can be strengthened as follows. Let § 
denote the /-Sylow subgroup of the ideal 
class group & D of the prime cyclotomic Geld 
K, = Q(¢,). Because the Galois group G of 
OU HO operates on S, the Abelian group § 
becomes naturally a Z,[G]-module. Choosing 
the canonical character 0: G— Z£ defined 
by 67 = C? for any oe G, we have a direct 
decomposition § = Ta S9, where S? = 
fseS| s? 2 sf el, c e G}. Then it is necessary and 
sufficient for || (the numerator of Bj) that S"? 
#0 (K. A. Ribet, Inventiones Math., 34 (1976)). 
Moreover, because G is naturally isomorphic 
to the multiplicative group of the reduced 
residue classes of Z/IZ, each 0! becomes a 
*Dirichlet character. Let 


Jib. 
Bois: A 0 '(a)a; 
aco; ÄM 


then this number, regarded as an element in 
Q,, is equal to a factor of the product (up to 
+ 1 and 2) appearing in the class number 
formula for h,. Let m; be the l-exponent of 

B, a-i for each odd i (2 < i & [— 3). Then it is 
conjectured that the order of S® is precisely 
equal to /"i for each odd ; with 2<i</—3, In 
particular, if S 2 [T; even S” is trivial, then 
this is known to be true. In general, it is also 
conjectured that lth, holds for any prime | 
(Vandiver's conjecture). If the group § is 
cychc, then § is of order l"i (A. Wiles, Inven- 
tiones Math., 58 (1980)). 

When a Galois extension K over a finite 
algebraic number field k has the Galois group 
isomorphic to the additive group of the [-adic 
integer ring Z, the extension K/k is called a 
Z,-extension or r-extension. Then for each 
integer n > 0 there exists a unique subfield k, of 
K with degree |" over k. Let k = K, 7 Q(C) (L9 2), 
and let K be the union of all /"*! th cyclotomic 
fields K &«1(n > 0). Then K/k is a typical ex- 
ample of a Z,-extension with subfields k, = 
Km=+ 1, Let K/k be a Z,-extension with sub- 
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fields k,, and let e denote the [-component 
of the class number of k,. Then there exist 
integers 42 0, u> 0, v independently of n 
such that e, = 4n + yl" + v for all sufftciently 
large numbers n (K. Iwasawa, Bull. Amer. 
Math. Soc., 65 (1959)). These numbers A, 4, 
v are called the Iwasawa invariants for the 
Z,-extension Kik There exists a unique 
Z,-extension K? over the rationals Q. An ex- 
tension K°k over k obtained by taking the 
composite of K? and kis called the basic Zy- 
extension of Kor the cyclotomic Z,-extension 
of k. We denote its Iwasawa invariants by 
Ay(K), uu (k), VK). It is known that there are Z,- 
extensions with arbitrarily large o. But it is 
conjectured that p(k) = 0 holds for the basic 
Z,-extension of any k In particular, in the case 
k= K, = Q(¢,) it has been computed that p(k) = 
0 for |< 125,000 (for |< 4001 by K. Iwasawa 
and C. C. Sims, J. Math. Soc. Japan, 18 (1966); 
for |< 30,000 by W. Johnson, Math. Comp, 29 
(1975); for [< 125,000 by S. Wagstaff, Math. 
Comp, 32 (1978)). By applying the theory of 
uniform distributions to Lade situations, B. 
Ferrero and L. C. Washington (Ann. Math., 
109 (1978)) proved that p(k) 2 0 when k/Q is 
Abelian. In particular, when K= K, and k, = 
K,"*!, e, > 0 if and only if the class number 
of K, is divisible by / (P. Furtwangler, 1911). 
Since any quadratic field is a subfield of a 
cyclotomic field (by a tGaussian sum formula 
we have Q(./m) C Q(C,), where dis the dis- 
criminant of Q(/m)), the computation of the 
class number of quadratic fields and the proof 
of the law of reciprocity for the {Legendre 
symbol follow from the arithmetic of cyclo- 
tomic fields. 


M. Aritbmetic of Kummer Extensions 


Assume that an algebraic number field k con- 
tains an nth primitive root of unity. Then a 
"Kummer extension K =k(3/u) (uek) isa 
teyclic extension of k Assume that [K: k] =n. 
In order that a prime ideal p of k ramify for 
K/k, it is necessary that p be a divisor appear- 
ing in (n) or (uj). If p4(n) and v,(“) #0 (modn), 
then p ramifies for K/k. A prime ideal p which 
is relatively prime to (ul has the decomposition 
E(p) = 3B, P, with distinct primes $, in K if 
and only if the equation u= ¢" (mod p") is 
satisfied by some £ € o for any positive integer 
m In particular, if p-(n) and v,( Ii) = 0, we have 
E(p) = 3B, P, and only if jj = č” (mod p) is 
solvable in o. 

If for an element o of 0 


y=" (mod p) 


is solvable by some €€ o, u is said to be a 
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residue of tbe ntb power modulo p. Assume 
that p)(n) and v,(u) = 0. Let f be the minimal 
positive integer such that p/ is a residue of the 
nth power modulo p. Then p is decomposed 
in K as E(p) — B, P, and Nga (88) — p^ 
(i21, ...,9). 


N. Power-Residue Symbol 


Let C, = exp(2zi/n) e K and let p be relatively 
prime to (n) and (a) (x€ k). Then for some r we 
have 


g(N0- 1 = FF (mod p), 


and we write 


This symbol is called the nth power-residue 
symbol (Kummer). Generalizing this definition, 
we can define the nth power-residue symbol 
(a/b), for an ideal b of k which is relatively 
prime to the relative discriminant of k(2/a)/k 
by using the *Artin symbol (( K/k)/b): 


cir) e(t) 


This symbol satisfies 


LEI) 
S 


if all the symbols are well delined. In partic- 
ular, à is a residue of the nth power modulo p 
if and only if (x/p), = 1. This symbol coincides 
with the *quadratic residue symbol for n —2, 
k=Q, and pz 2. 








0. Law of Reciprocity for tbe Power-Residue 
Symbol 


Several formulas concerning the power-residue 
symbol are known which are similar to that 
for the quadratic residue symbol (F. G. M. 
Eisenstein, Kummer, Furtwangler, Takagi, 
Artin, Hasse). These can be proved by means 
of Artin's *general law of reciprocity in class 
field theory (— 59 Class Field Theory). 

There are many formulas concerning the 
reciprocity of the power-residue symbol. One 
of them is as follows: Letn = Į be a prime 
number. Let a, fe kand assume that (i) g is 
totally positive; (ii) v,(a) = 0 (mod I) if v,(B) #0 
(mod /), and v,(f)=0 (mod I) if v,(a) #0 (mod J), 
for any prime ideal p; and (iii) a= 1 (mod l) and 
p = 1 (mod( 1 —{,)), Then 
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Qr 


(general law of reciprocity, Hasse, 1924). This 
result is a generalization of the formula of 
Eisenstein (1850). If a(ck) is totally positive 
and y= 1 (mod IL then 





(complementary 
1924). 


law of reciprocity, Hasse, 


P. Norm Residue 


Let m be an integral divisor of k such that 
m = IT; při IT; p2 (e; > 0) with finite prime divi- 
sors {p;} and infinite prime divisors (pj, 
and let f be an element of k that is relatively 
prime to m. For a relative algebraic num- 
ber iield K/k and an element B of K, we set 
B = Nyj4( B) (mod m) if the following two con- 
ditions are satisfied: (i) f =Nx,(B) (mod pfi) for 
every finite p; and (ii) f? >0 for every infinite 
prime p such that dl is real and its extension 
to K is imaginary. fe k is then said to be a 
norm residue modulo m for K/k if there exists a 
number B of K such that B= Nx,,(B) (modm). 
Let p be a finite prime divisor of k. If f isa 
norm residue modulo p* for a sufficiently large 
c, then B is a norm residue modulo p* for any 
e >c. Let c be the smallest such integer (c > 0). 
Then the ideal [, = p^ is said to be the p- 
conductor of norm residue for K/k. If p is un- 
ramified for K/k, then c = 0; i.e., every Bek 
which is relatively prime to p is a norm residue 
modulo p* for any e > 0. For a ramified p put 
f, = p‘. For a Galois extension K/k, c is not 
greater than 


y Dé Up Vu 
D (tpa — Up) = : 
p=0 | Vol E n i70 V| 





In particular, for an Abelian extension Kik this 
value is an integer and is equal to c (Hasse, J. 
Fac. Sci. Univ. Tokyo, 1934). For example, the 
l-conductor of the cyclotomic field Kai is 1". 
We define the p -conductor for K/k for an 
infinite prime divisor p, of k by f. =p, if Do 
is real and its extension to K is imaginary, and 
EN = | otherwise (- 257 Local Fields F). 
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Q. Norm Residue Symbol 


For an Abelian extension K/k, the positive 
divisor 


Wl 


(where p runs over all finite and infinite prime 
divisors of k) is called the conductor of K/k 
(— 59 Class Field Theory). For a e k (a #0) 
take % such that a/a, = 1 (mod f,) and %& = 1 
(mod ff; +), and put (x) = p*b with b relatively 
prime to p. Then b is relatively prime to the 
relative discriminant Dx;,. We define a new 
symbol by 


a, K/k " K/k 
( p LU" ide 


where ((K/k)/b) is the Artin symbol. This 

value is independent of the choice of the 
auxiliary element xy. The new symbol is called 
the norm-residue symbol (Hasse, J. Reine 
Angew. Math., 162 (1930)). In particular, for 
an infinite real prime divisor p) of k whose 
extension PY’ for K is imaginary, we have 


a, Kik 
"T =] or =0 


according to whether the conjugate a is 
positive or negative, where g is the automor- 
phism of K/k induced from the complex con- 
jugation of the completion C of K with respect 
to PY. 

The norm-residue symbol has the following 
properties: 


a) (= 2 D ei (= 2) 
p J \ p p JI 


(2) if p is unramified for Ki, then 


D et É (RE) 
p/NXP 


(3) in order that o be a norm residue modulo f, 
for K/k, it is necessary and sufficient that 


(5 2 ps 
P ^ 


(4) the product formula for the norm-residue 
symbol (Hasse) is 


a, Kik 
=1 
r( p 


where p runs over all finite and infinite prime 
divisors of k; and (5) if the domain of the 
variable g is the whole k (#0), or the set of 
all x such that (o) is relatively prime to p, or 
the set of alla such that «= 1 (mod p”) (v, + 
I<mv,,,), then the range of ((a, K/k)/p) is 
the decomposition group Z of P. the inertia 
group T of p, or the ramification group V, 
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of p, respectively (Hasse, S. Iyanaga, 1933) 
(- 257 Local Fields F). 


R. Hilbert Norm-Residue Symbol 


The symbol first introduced by Hilbert for 
quadratic fields can be defined in a general 
algebraic number field k containing an nth 
primitive root C, of unity. Let a, Be k(x 0, 

p #0), and let p be a prime divisor. Then the 
following nth root of unity ((, Dn, is defined 
by using the norm-residue symbol: 


Claren 
e E 


p 


This symbol ((a, f)/p), is called the Hilbert 
norm-residue symbol. It is also called the 
Hilbert-Hasse norm-residue symbol. For o, ai, 
Bek, wehave 


ax PA foB (m.p. 
ul p )-( p JI p ) 


(2) the law of symmetry: 


Gear 
Don Xa 


and (3) the product formula for the Hilbert 
norm-residue symbol: 


"CHL 


(Hilbert, Furtwängler, Takagi, Artin, Hasse). 
For detailed properties concerning the norm- 
residue symbol, power-residue symbol, and 
Hilbert norm-residue symbol and for refer- 
ences for them see Hasse [6]. 

In general, the problem of obtaining various 
laws of reciprocity for the power-residue sym- 











bol is reduced to the one of computing the 
Hilbert norm-residue symbol explicitly. De- 
tailed formulas for these symbols are called 
explicit reciprocity laws; they are treated as 
a topic in the number theory of local fields 
(- 257 Local Fields H). 


S. Density Tbeorem 


Let M be a set of prime ideals of k If 


1 1 

j —_——/ log —— 
MI | "ed 
exists, its value is said to be the density of M. 
The density of the set of all prime ideals of k is 
l. Let H be an ideal group modulo an integral 
divisor m. Then the density of the set of all 
prime ideals contained in each coset of (m) 
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modulo H is 1 /(S(n): H). In particular, let H 
be the ray S(m). Then this result implies that 
each coset of (n) modulo S(m) contains 
intinitely many prime ideals (a generalization 
to algebraic number fields of the tprime num- 
ber theorem for arithmetic progression). 

Let K/k be a Galois extension, C be a conju- 
gate class of the Galois group G of K/k, and 
M(C) be the set of all prime ideals p of Ksuch 
that the Frobenius automorphism of each 
prime factor 3B; of p in K belongs to C. Then 
the density of MO is |C|/|G| (Cbebotarev's 
density tbeorem, Math. Ann, 95 (1926). 

Each element g of the Galois group G of K/k 
can be expressed by the permutation z of the 
conjugate fields K"),, K of K over K. Let z 
be expressed as the product of v tcycles of 
length fi, ‚f Hence n =f, + +f, Let 
Chis LI be the set of all such g in G, and 
let M(f,, ,f,) be the set of all prime ideals 
p of Ksuch that p is decomposed in K/k as 
the product of r prime ideals of K with rela- 
tive degree f,,. , f,. Then the density of 
M(f,, fis Cf... .. ll (Artin, Math. 
Ann, 89 (1923). 


T. Relation to the Arithmetic of Local Fields 


It is quite useful to investigate the relation 
between the arithmetic of algebraic number 
fields and that of local fields. For example, let 
a prime ideal p of an algebraic number field k 
be decomposed as E(p)= Wi. o, NB) 
— pi (i 1,...,g) in an extension K of k. Let 
Ka and k, be the completion of K and k with 
respect to -adic and p-adic valuations, re- 
spectively. Then we have [Ky k] = e; f, and 
K Gy kp = Ky, ++ Kg (direct sum). The 
relative different Dx, is expressed as (the 
p-component of Dx) = TTL, Dx, j, Fora 
Galois extension K/k the p-conductor i = DÉ 
for the norm-residue and the conductor p* of 
local extension K,,/k, have the same exponent 
c. For a local field K wh, each norm-residue 
modulo f, is a norm of an element of K,. 
Hence precise results concerning the norm- 
residue in local fields can be applied imme- 
diately to a global field K/k (- 257 Local 
Fields). 

We can also apply the method of the idele 
group of an algebraic number field K, and 
therefrom we can prove results concerning the 
ideal class group, unit group, and zeta function 
of k(-^ 6 Adeles and and Ideles). 


U. History of the Arithmetic of Algebraic 
Number Fields 


C. F. Gauss (1832) was the first to generalize 
the notion of integers to algebraic number 
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fields in considering the elements of Z[./ —1 ], 
now called Gaussian integers (Z [f ] is the 


principal order of Q/-1 )). After investiga- 
tions by G. L. Dirichlet and Kummer, the 
notion of ideals was introduced by Dedekind 
(1871) [2]. L. Kronecker gave another founda- 
tion for the arithmetic of algebraic number fields 
(1882) [3]. Dirichlet proved the unit theorem 
and, introducing the analytic method to num- 
ber theory, gave the class number formula of 
quadratic fields (— 347 Quadratic Fields). H. 
Minkowski first applied the theory of lattice 
points to number theory (- 182 Geometry of 
Numbers), and K. Hensel introduced the p- 
adic method (- 257 Local Fields). Hilbert 
(1897) [4] and Hasse (1926, 1927, 1930) [6] 
summarized the main results on the arithmetic 
of algebraic number fields known at that time. 
In particular, Hilbert's report centered around 
the arithmetic of Galois extensions, and 
Hasse's around the class field theory obtained 
by T. Takagi, E. Artin, and H. Hasse (- 59 
Class Field Theory). Since c. 1950, when the 
notions of ideles and adeles were introduced, 
cohomology-theoretic methods have been 
successfully applied to number theory (— 6 
Adeles and Ideles). Recently various local 
methods, for example, the Iwasawa theory of 
Z,-extensions, fformal groups, and tp-adic L- 
functions (T. Kubota and H. W. Leopoldt, J. 
Reine Angew. Math., 214/215 (1964)) have been 
frequently applied to research in algebraic 
number fields. 
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15 (VIII.3) 
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A. Definition 


An algebraic variety of dimension 2 is called 
an algebraic surface. In this article, by a sur- 
face we mean a complete irreducible algebraic 
surface defined over an algebraically closed 
field K. 


B. History 


The history of algebraic surfaces originated 
with the study of algebraic functions of two 
variables. In the case of algebraic functions of 
one variable, the introduction of tRiemann 
surfaces attached to such functions played an 
essential role in the development of the theory. 
The study of algebraic functions of two vari- 
ables led naturally to the consideration of the 
surfaces defined by a suitable polynomial 
equation. H. Poincaré and E. Picard are 
among those who studied the homological 
structure of the surface defined by the equation 
P(x,y, Z) = 0. The theory of *Abelian integrals 
(Picard integrals) is one of the consequences of 
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such topological investigations. S. Lefschetz 
obtained further results in this direction. 

M. Noether and geometers of the Italian 
school, such as F. Enriques, G. Castelnuovo, 
and F. Severi, studied algebrogeometric prop- 
erties of algebraic surfaces. In particular, the 
Italian school geometers recognized the im- 
portance of irregularity and thoroughly inves- 
tigated its geometric meaning. In the early 
20th Century they succeeded in constructing 
the great edifice of the theory of algebraic 
surfaces. Though some of their results lack 
rigorous proof, efforts to build a foundation 
for those results have led to the recent devel- 
opments in algebraic geometry. A signifi- 
cant contribution to the modernization of 
the theory was made by 0. Zariski and K. 
Kodaira. 

The resolution of singularities of an alge- 
bratc surface is one of the most fundamental 
problems in the field. When the tuniversal 
domain is the complex number field, function- 
theoretic methods were used by Italian-school 
geometers and R. J. Walker (Ann. Math., 36 
(1935)). Zariski introduced the tyaluation- 
theoretic method to deal with the problem 
when the characteristic of the universal do- 
main is zero. S. Abhyankar (1966) succeeded 
in resolving the case of positive characteristics. 


C. Divisors and Linear Systems 


In what follows, let $ denote a nonsingular 
surface. $ can be embedded into some projec- 
tive space. Let X be a linear system of divisors 
on $ and fy, ,f, be a basis of the detining 
module for X over K. Associated with X we 
have a rational mapping @, : $5 P" defined by 
Q,(P) x (fo(P), . . . , f, (P)) for general points P 
on S. Pullbacks of hyperplanes by ®,. are 
called variable components of 2. Any member 
of X is a sum of a variable component and a 
fixed component of X. Let §’ denote the clo- 
sure of the image of S by ®,.. If dim S’ = 2, a 
general variable component is irreducible. If 
dim S’ = 1, then a general variable component 
is composed of an algebraic system of dimen- 
sion 1, which is called an algebraic pencil. 
These result from Bertini's theorems. For any 
divisors D and D' on S, the intersection number 
(or the fKronecker index) IO, D’) is defmed; 
this number is a symmetric bilinear form such 
that I(D: D)  I(D,: D) for any divisors D, and 
D, linearly equivalent to D and D. respec- 
tively. If C is a nonsingular curve on S, I(C: D) 
coincides with the degree of the restriction 
CD ofD to C. I(D: D) - (D?) issaidtobethe 
self-intersection number of D. If D is an ample 
divisor on S, then (D?) » 0andI(D: C) > 0 for 
any irreducible curve C on S. These properties 
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characterize an ample divisor (Y. Nakai; = 16 
Algebraic Varieties E). 

Let 2 be an irreducible linear system of 
dimension r( > 1), and let C be a generic com- 
ponent of X. Let C' be a member of X different 
from C. Then the set of C-divisors C- C’ forms 
a linear system of dimension v- 1 on C. This 
is called the trace of X on C and is denoted 
by Tr. Z. The trace is, in general, not com- 
plete. The integer dim |Tr.Z| - dim Tr. E = 
ALL is called the deficiency of Z. The defi- 
ciency of the complete linear system |D| is 
denoted by S(D). 

Let x be a point of S, and let €, be the local 
ring of x. Then (5 = | J.., @, is an talgebraic 
coherent sheaf, called the ‘structure sheaf of S. 
Let D be a divisor on S. The sheaf of germs of 
rational functions f such that(f)+D>0 is 
denoted by G', (D). Then H°(S,@s(D)) isa defin- 
ing module for the complete linear system D |. 
DO if and only if (,(— D)is a sheaf of C,- 
ideals. The quotient sheaf (0;/6.(— D) will be 
denoted by 0,. If D is a tprime divisor, then @, 
is the structure sheaf of the algebraic curve D 
(— 9 Algebraic Curves). Let Z be a sheaf on S. 
We set 


(S, F)=E?-o(— 1)! dim HS, F). 


215, Gel will be denoted simply by (S). We 
call p(S)= y(S)  lthe aritbmetic genus of 
the algebraic surface S. Sometimes y(S) is re- 
ferred to as the arithmetic genus of S. We set 
Xs(D) = x(S) x(S, Os(—D)). The integer p, (D) 
-] —y;(D) is, by definition, the arithmetic 
genus of the divisor D. If D is a prime diyi- 
sor, then p, (D) coincides with the arithmetic 
genus of the algebraic curve, i.e., p, (D) = 
dim H' (D, Op). 


D. Riemann-Roch Theorem 


Let $ be a nonsingular surface and let K de- 
note a canonical divisor on S, i.e, K = (œ) for 
some nonzero rational 2-form () on S. If C is a 
nonsingular irreducible curve on $, (K - C): C 
becomes a canonical divisor on C; hence, 
deg((K + C): C) 22p,(C) - 2 byacorollaryto 
the Riemann-Roch theorem on C (+9 Alge- 
braic Curves C). Since IK + C): C) = deg((K + 
C): C), it follows that p,(C) 2 I((K + C): C)/2 + 


1. Moreover, the formula 
p,(D)=I((K + D: DJ241 


holds for an arbitrary curve D on S; this is 
called the adjunction formula. For any divisor, 
we definep, (D) tobeI(D (D +K))/2+1.Then 


15, C(D))= x(8)— 4s( — D) - x(S) + p, 7 D) 1 
-I(D-(D K))/2+ x(S). 
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This formula is called the Riemann-Rocb 
theorem on S (— 366 Riemann-Roch Theo- 
rems C). Applying Serre's duality theorem to 
D (— 16 Algebraic Varieties E) we have 


dim H'(S, €(D)) — dim H? "(S, ((K — D)) 
for 120, 1, 2. 


In particular, dim H?(S, O(D))=I(K — D), 

which is called the index of speciality of D; 
dim H'(S, ©(D))=dim H' (S, O(K — D)) is called 
the superabundance of D, denoted by h'(D). 
The inequality 


I(D) - (K — D) 
> x(S, (D)) =I(D: (D = K))2 (S) 


is called the Riemann-Roch inequality, where 
equality holds if and only if ^'(D) = 0. FD isa 
curve with s connected components, then 


(D+ K)=h'(—D)+ p,(D)—s- x(S). 


In addition, if bi = 0, then h!( D) = 0 and 
thus (D+ K) 2 p,(D) —s-- y(S). This is called 
the Riemann-Roch theorem for the adjoint 
system. Note that D + K| is called the adjoint 
system of D. The Noether formula, (K?) + 
c,(S)= 12y(S), is a special case of Hirzebruch's 
theorem of Riemann-Roch type (- 366 
Riemann-Roch Theorems B). Here, c,(S) 
denotes the second Chern number of S, which 
coincides with the Euler number of § if K is 
the field of complex numbers. 

Let Div(S) denote the group of all divi- 
sors on S; by linearity we can define the bi- 
linear form I(D: D') on Div(S)) = Div(S) & ,Q. 
J = {DE Div(S)) I(D- D') =0 for all D'} is 
a subgroup and X = Div(S),/J is a finite- 
dimensional vector space over Q, on which the 
nondegenerate bilinear form / is induced. 7 has 
a unique positive eigenvalue; thus the other 
eigenvalues are all negative. This is called the 
index theorem of Hodge; it is derived from the 
Riemann-Roch theorem on S. From this, we 
inferthatif D? - I(D:D)-0, then I(D-D'? z 
D? -D forany D’, X is said to be the Neron- 
Severi group of $ and dim X is said to be the 
Picard number of $ (— 16 Algebraic Vari- 
eties P). 

To study [(mD) asa functionof m >> 0,0. 
Zariski writes an arbitrary effective divisor D 
as a sum of D and DO’ eDiv(S), with non- 
negative rational coefficients such that (1) D? 
is arithmetically effective (or, numerically 
semipositive), ie, I(D*.C)20 for any curve C 
on $; (2) D^? 20 or the intersection matrix of 
the support of DC? is negative detinite; and 
(3) I(D?- D) 20. Such a decomposition is 
unique and is called the Zariski decomposition 
of D [7]. If mD ? is a divisor for some m > 0, 
then /(mD) — l(mD(^?). 
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E. Invariants of Algebraic Surfaces 


There are many invariants besides the arith- 
metic genus discussed earlier. We set 4? = 
dim, H?(S, Q9). Then h”? is equal to the 
number of linearly independent holomorphic 
2-forms; it is called the geometric genus and is 
usually denoted by p,. Since h® ! gives the 
maximum among the deficiencies ó(X) of linear 
systems on S, it is called the maximal defi- 
ciency of S. For a divisor C such that h! (C) = 
0, we have ó(C) — h?:!. The number h°! was 
formerly called the irregularity of S, because 
h°- ! was considered to be a correction term in 
the equality p,(S) = p, — NIT. The study of 
higher-dimensional varieties showed, however, 
that it was unnatural to regard h®:! as a cor- 
rection term. At present, by irregularity we 
mean the dimension of the Picard variety of § 
(— 16 Algebraic Varieties P), and we denote 
this number by q. 

When $ is defined over the complex number 
field, we have hf = *?. In particular, q = 
Mi, ! 2 bID This number is equal to the num- 
ber of linearly independent *Abelian simple 
integrals of the first kind; it is also equal to 
one-half the first Betti number of S. In cases 
with positive characteristic, these equalities 
do not hold in general. J.-P. Serre gave an 
example of an algebraic surface § such that 
h?: ! x «9, and J. Igusa gave an example such 
that q « 49:1 2 4^9. Let K be a canonical divi- 
sor of S. The number P, =/(iK) is called the i- 
genus, and P; (i= 2,3, . ) are generally called 
plurigenera. If P, - 0 and d  n,then P, is also 
zero. The numbers p,(S) = h^? — n? +, p,(S) = 
hè’, ab’, ht, P (i=2,3,...)(P,=p,) ae 
tabsolute invariants of A: i.e., they take the 
same values for any nonsingular surface S’ 
that is birationally equivalent to S. However, 
ht! is not an absolute invariant. For a pro- 
jective plane, all plurigenera P, vanish and 
the irregularity q = 0. Thus if S$ is a rational sur- 
face (- 16 Algebraic Varieties J), i.e., a sur- 
face which is birationally equivalent to P?, 
all P, =q = 0. Conversely, any surface with 
q =P, =0 is a rational surface. This is called 
Castelnuovo’s criterion. A ruled surface is de- 
fined to be a surface that is birationally equiv- 
alent to a product of the projective line and a 
curve. AIL P, of a ruled surface equal 0 and 
any surface with P, =P, = 0 is a ruled sur- 
face. This is called the criterion of ruled sur- 
faces (Enriques). 


F. Characteristic Linear 
Families 


Systems of Algebraic 


One of the central problems considered by the 
Italian school was to prove that the irregular- 
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ity q is equal to the maximal deficiency h® £, 
For that purpose, Severi introduced the notion 
of characteristic linear systems of algebraic 
families. Let 2 be an irreducible algebraic 
family of positive divisors on S such that a 
generic member C of X is an irreducible non- 
singular curve, and let r be the dimension of X. 
Let 2 be a 1-dimensional subfamily of 2 
containing C as a simple member, and let C 
be a generic member of X T Then the speciali- 
zation of C'* C over the specialization C’-+C is 
a well-defined C-divisor of degree y= ] (C: C). 
The set of C-divisors thus obtained is called 
the characteristic set. The characteristic set 
forms an (v — 1)-dimensional linear system and 
contains Tr, C|as a subfamily. This linear 
system is called the characteristic linear system 
of JL For any algebraic family of dimension r, 
we have € dim Cl + q. In particular, there 
exists an algebraic family X that contains oo? 
linear systems and such that for a generic 
curve C we have A! (C) = 0. For such an alge- 
braic family, we have the equality r = dim C| 

+ q; hence the inequality q « Ni: ! follows. 
Moreover, if the characteristic linear system is 
complete, we have q = h™!, The proof of the 
completeness of characteristic linear systems 
given by Severi is valid only in some special 
cases (€.g., the case p, = 0). For a complex 
algebraic manifold, a rigorous proof was given 
later. When the characteristic is positive, the 
completeness does not hold in general (Igusa); 
however, for the surface with p, = 0, the com- 
pleteness holds (Y. Nakai). The completeness 
holds if and only if the Picard scheme of $ is 
reduced [ 14]. 


G. Birational Transformations 
Surfaces 


of Algebraic 


Let S and S’ denote nonsingular surfaces. If 
there exists a birational morphism T: SS’, we 
say that S dominates A, and we write S > S’, In 
addition, if T is not an isomorphism, we write 
S >S’. In case there does not exist an S" with 

S > S’, S is said to be relatively minimal. On the 
other hand, if we have §’ > S for any A which 
is birationally equivalent to S, § is said to be 
minimal. Any minimal surface is, by definition, 
relatively minimal. If a minimal (resp. rela- 
tively minimal) surface S is birationally equiva- 
lent to S’, we say that $ is a minimal (resp. 
relatively minimal) model of S’ or of the field 
K(S). A necessary and sufficient condition for 
S to have a minimal model is that S not be a 
ruled surface (Castelnuovo and Enriques). 

Let S be a nonsingular surface and P be a 
point on S. Replacing P by a projective line 
P!, we have a nonsingular surface XI and a 
birational morphism "TAS AN such that T-‘(P) 
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=E=P' and S’—E=S—{P} by T. T:§’3§ 
(or T! in some references) is said to be a 
locally quadratic transformation. Any bira- 
tional morphism between nonsingular sur- 
faces is a composition of locally quadratic 
transformations and an isomorphism. Given 
a birational mapping T: $5 $' and P on S’, 
TT #P }is called an exceptional curve when- 
ever it is not a point. Moreover, it is called an 
exceptional curve of the first kind if T is regu- 
lar along T! (P). Otherwise, it is of the second 
kind. Exceptional curves consist of irreducible 
rational curves. An irreducible curve E is an 
exceptional curve of the first kind if and only 
if (E?) = 1 and E x P! S is relatively minimal 
if and only if S has no exceptional curves of 
the first kind; S is minimal if and only if S has 
no exceptional curves at all. A relatively mini- 
mal surface that is not minimal is a ruled 
surface. Such a surface is either P? or a P!- 
bundle over a curve. In particular, a relatively 
minimal rational surface is either P? or a P!- 
bundle over P". Any surface of the latter type 
is occasionally called a Hirzebruch surface and 
its (K?) is 8, where K denotes a canonical 
divisor. However, (K?) of P? is 9. Define the 
linear genus of a rational surface to be 10. If S 
is not a rational surface, taking a relatively 
minimal surface S’, we deline the linear genus 
p! of S to be (K?) -- 1 of S. 


H. Examples of Algehraic Surfaces 


Let $, denote a nonsingular surface in P? 
defined by a homogeneous polynomial of 
degree m. Let H denote a divisor on $, in- 
duced from a (hyper)plane on P?, Then the 
canonical divisor K is linearly equivalent to 
(m—4)H, ie. K-(m-4)H. Hence, p, —(m— 1) 
(m 2) (m — 3)/6. Moreover, q = 0j if K is the 
field of complex numbers, S, is simply con- 
nected. $, is isomorphic to the product of two 
copies of Pt, i.e. S, x P! x Pi: hence, it is a ra- 
tional surface. §, is also rational. There exist 
27 lines on $,. Contracting 6 mutually disjoint 
lines among these 27 lines, we obtain a pro- 
jective plane. Conversely, given 6 points on 
P? in general position, by performing locally 
quadratic transformations with these points 
as centers we get a cubic surface $4 and a 
birational morphism T: S$, —P"7. The inverse 
images by T of the 6 given points, the proper 
transforms of 15 lines connecting every pair of 
points chosen from the 6, and 6 conics passing 
every 5 of the 6 points by T !, exhaust 27 lines 
on S,. If m = 4, then K ^ 0. In general, a non- 
singular surface with K ~ 0 and q = 0 is said to 
be a K3 surface (- 72 Complex Manifolds J). 
K3 surfaces have certain properties similar 

to Abelian surfaces that are defined to be 2- 
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dimensional Abelian varieties (— 3 Abelian 
Varieties). For K = C, the period mapping 
defined by integrating regular 2-forms has 
been studied extensively in connection with 
moduli theory. 

In general, surfaces with the bigenus P, = 
p, = 1 and g =0 are birationally equivalent to 
K3 surfaces; surfaces with P, = p, = 1 and q- 2 
are birationally equivalent to Abelian surfaces. 
Every Abelian surface has the involution i 
defined by i(x) = -a and the quotient surface 
by iis a singular surface with 16 ordinary 
double points. By performing 16 locally qua- 
dratic transformations successively with these 
singular points as centers we obtain a K3 
surface. Such a K3 surface is called a K ummer 
surface. The original singular surface is also 
called a Kummer surface. A quartic surface in 
P? with 16 double points is an example of a 
Kummer surface. 

The theory of birational classification of 
surfaces was developed by Castelnuovo, En- 
riques, and others of the Ttalian school. This 
theory has been extensively enriched and gen- 
eralized in various ways. Kodaira's theory of 
analytic surfaces includes classification of 
algebraic surfaces (— 72 Complex Manifolds I, 
J), and classification of surfaces in the positive- 
characteristic case has been recently studied in 
detail [4, 5]. When the field K(S) is a subfield 
of a purely transcendental extension K(X, Y), § 
is said to be a unirational surface. If the exten- 
sion K(X, Y)/K(S) is separable, S is a rational 
surface. However, if it is inseparable, § may 
become nonrational, a K3 surface, an elliptic 
surface, or a surface of general type [5, 12]. 
Even for noncomplete surfaces, we have a 
classification theory [ 10] similar to the previ- 
ous ones. The following result is one of the 
applications: whenever S x A x A), § is iso- 
morphic to A?, Here A" means the affine n- 
space (M. Miyanishi, T. Sugie, T. Fujita; 

[7,9, 13]). 

Let I be the ring of integers of a real qua- 
dratic field of discriminant d. The Hilbert 
modular group G = SL(2, II +1} acts on the 
product #7, J^ being the complex upper half- 
plane. The normal complex space #7/G can 
be compactified by adding a finite number of 
points and thus a compact norfhal surface is 
obtained. Resolving these singularities in the 
canonical minimal way, we have a nonsingular 
surface Y (d) over C, which is called the Hilbert 
modular surface with discriminant d. Y(d) is 
simply connected; hence, q( Y(d) 20. f d 2 5, 8, 
12, 13, 17, 21, 24, 28, 33, 60, then Y (d) is a 
rational surface. If d — 29, 37, 40, 41, 44, 56, 57, 
69, 105, then Y (d) is birationally equivalent to 
a K3 surface. If d 2 53, 61, 65, 73, 76, 77, 85, 88, 
92, 93, 120, 140, 165, then Y (d) is an elliptic 
surface with x( Y (d)) = 1, Otherwise, it is a 
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surface of general type (— [ 19,201 in 72 Com- 
plex Manifolds). 

For O-cycles on a surface — 16 Algebraic 
Varieties J. 
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16 (VIII.4) 
Algebraic Varieties 


A. Affine Algebraic Varieties and Projective 
Algebraic Varieties 


Fix a field k. A subset of the n-dimensional 
taffine space k” over k is called an affine alge- 
braic variety (or simply affine variety) if it 

can be expressed as the set of the common 
zeros of a (finite or intinite) set of polynomials 
F(X,, . X.) with coefficients in E Similarly, a 
subset of the n-dimensional *projective space 
P"(K) over & that can be expressed as the set of 
the common zeros of a set of homogeneous 
polynomials G;( Y,, . , Y.) is called a projective 
algebraic variety (or simply projective variety). 
In this section, variety means either an affine 
or projective variety. A variety which is a 
subset of another variety is called a subvariety. 
These varieties are the forerunners of the 
modern, more general versions of algebraic 
varieties, which we will discuss later. 

When V is an affine variety in k”, the set of 
the polynomials in k[X] = kLX,, , X,] that 
vanish at every point of V form an ideal I(V) 
of k[ X]. The residue class ring A, = kLX]/I(V) 
is called the coordinate ring (or affine ring) of 
V. We can regard A, as the ring of k-valued 
functions on V that Can be expressed as poly- 
nomials of the coordinates of k”. When Vis a 
projective variety, the *homogeneous ideal 
generated by the homogeneous polynomials in 
RE Y... Y] that vanish at every point 
of V is denoted by I(V), and the ring A, = 
k [ Y ]/I( V) is called the bomogeneous coordi- 
nate ring of V. 

A variety Vis said to be reducible or irreduc- 
ible according as it is the union of two proper 
subvarieties or not. A maximal irreducible 
subvariety of V is called an irreducible compo- 
nent of V. Any variety can be written uniquely 
as the union of a finite number of irreducible 
components. A variety V is irreducible if and 
only if I( V) is a tprime ideal. When that is the 
case, the field of quotients of A, (when Vis 
affine) or the subfield of the field of quotients 
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of A, consisting of the homogeneous elements 
of degree 0 (when V is projective) is called the 
function field of V and is denoted by k(V). 
Elements of k( V) are called rational functions 
(or simply functions) on the variety V. The field 
k(V) is tfinitely generated over k. The tran- 
scendence degree of k(V) over k is called the 
dimension of V. When V is reducible, the max- 
imum of the dimensions of its irreducible 
components is called its dimension. If W is a 
subvariety of an irreducible variety V, then 
dim V = dim W is called the codimension of W 
on V. A subvariety of pure codimension 1 of 
an affine or projective space can be defined by 
a single equation and is called a hypersurface. 
If the ideal I(V) of a variety V of dimension r 
in a projective space P"(k) is generated by n-r 
homogeneous polynomials, then Vis called a 
complete intersection. Compared with general 
varieties, complete intersections have some 
simpler properties. On the other hand, many 
important varieties are not complete intersec- 
tions, e.g., Abelian varieties of dimension > 2. 

The intersections and finite unions of sub- 
varieties on a variety V are also subvarieties. 
Thus the subvarieties can be taken as the 
*system of closed sets of a topology on V (- 
426 Topology), which is called the Zariski 
topology of the variety V. When k is the field of 
complex numbers, V can be viewed as an 
tanalytic space, and the topology of Vas such 
(the "usual" topology) is much stronger than 
the Zariski topology. For the rest of this arti- 
cle, varieties will be considered as having Zar- 
iski topologies unless stated otherwise. Terms 
such as Zariski open, Zariski closed, and Zar- 
iski dense are used to mean open, closed, or 
dense in a Zariski topology. Suppose a con- 
dition (P) concerning the points of an irreduc- 
ible variety V (concerning the elements of a 
set M parametrized by the points of V) is 
satisfied in a nonempty Zariski open set of V. 
Then we say that the condition (P) holds at 
almost all points of the variety V or at general 
points of V (almost all elements of the set M). 

Let U and V be affine varieties in k" and km 
respectively. Then the product set U x Vis an 
affine variety in K"*" and is called the product 
algebraic variety (or simply the product) of U 
and V. Note that the Zariski topology on 
U x Vis stronger than the product of the 
topologies of U and V. When k is falgebrai- 
cally closed, U x Vis irreducible if U and 
V are irreducible. 

Suppose that k is algebraically closed. Let d 
be a ‘prime ideal of k[ X] = k[X,, , X,], and 
let V be the affine variety in k" detined as the 
zero points of $. Then I(V) = $ (Hilbert zero 
point theorem) (- 369 Rings of Polynomials 
D). Therefore there exists a one-to-one corre- 
spondence between the set of prime ideals of 
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k[ X] and the set of irreducible varieties in k”. 
In particular, the tmaximal ideals correspond 
to the points of k”. Similarly, there exists a 
one-to-one correspondence between the set of 
homogeneous prime ideals of k[ Y] other than 

i-o KEI Y] and the set of irreducible sub- 
varieties in P’’(k). 

When we deal with nonlinear algebraic 
equations, we cannot expect asimple, clearcut 
theory without assuming that k is algebraically 
closed. Hence we take an algebraically closed 
field K containing k and regard a variety V in 
K" as a subset of the variety V, in K" defined 
by the same equations. From now on, we 
suppose that k is algebraically closed. If the 
ideal I(V) of k[X] or k[ Y] determined by a 
variety Vis generated by polynomials with 
coefficients in a subfield K of k, we say that V 
is defined over K or that k’ is a field of defi- 
nition for V. Any variety has the smallest field 
of definition, which is finitely generated over 
the prime field. In the theory of A. Weil [92], 
we fix an algebraically closed field K that has 
an infinite transcendence degree over the 
prime field, This K is called the universal 
domain. A point of Vis called a k’-rational 
point of V if all of its coordinates belong to a 
subfield k’ of K. Let K,, K, be two extension 
fields of a field L, and let (x)e K}, (y)e K5. 

We say (y) is a specialization of the point (x) 
over L (notation: (x) p( y)) if all polynomials 
f(X)ELLX,,....X,] satisfying f(x) = 0 also 
satisfy f(y) = 0; in other words, if there exists a 
homomorphism of L-algebras L [x,, ,x,] > 
L[y,,..y.] mapping x; to y;. Let K be the 
universal domain, Van irreducible variety in 
K", and k' (c K) a field of definition for V 
having a finite transcendence degree over the 
prime field. Then there exists a point (x) of V 
such that all points of V are specializations of 
(x) over k’. Such a point (x) (in general not 
uniquely determined) is called a generic point 
of V over k’. The ring K'[x] is isomorphic to 
kLX]/I( V) AK'[X] over k’. (Some authors use 
the term generic point to mean almost all points 
as defined earlier.) 


B. Local Rings 


Let V be an affine variety and let W be an 
irreducible subvariety of V, Let By be the 
subset of A, consisting of the elements that 
vanish identically on W. Then y is a prime 
ideal of A,. The ring of fquotients of A, with 
respect to Py is denoted by Dy, y or by Dy 
and is called the local ring of W on V (or of 

V at W). Suppose for simplicity that Vis irre- 
ducible. Then Dy is the subring { f/g |f, g € Ay, 
g By} of k(V), and the tresidue field of Dy 
modulo the maximal ideal Bu Dy can be 
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identilied with k( W). When a function o on 
V (o € k( V)) belongs to Dy, it is said to be 
regular at W. For a given function ge D V), 
the set of the points of V where ọ is regular 
is Zariski open. In the case of a projective 
variety, the local ring Dy , is defined as the 
subring of the ring of quotients of A, with 
respect to $y, consisting of the homogeneous 
elements of degree 0. 

A mapping from an open set U of a variety 
V to k that is regular at every point of U is 
called a regular function on U. The ring of 
regular functions on U is denoted by A,. By 
assigning A, to each open set U, we can define 
a ‘sheaf of rings @ on V, of which the ‘stalk 
Oy, at a point XE V is the local ring Sy, ,. The 
sheaf C, is called the sheaf of germs of regular 
functions on V (or the structure sheaf of V) 
(- 383 Sheaves H). 


C. General Definition 


Consider a pair (V, 0) of a topological space 
V and a sheaf (? of germs of mappings from 

V to k. If V has a finite open covering (U;) 
such that each (U,, 0| U) is isomorphic to an af- 
fine variety V, (in the sense that there exists a 
homeomorphism from U;to V; that transforms 
0| U, to the structure sheaf of V), the pair (V, 0) 
is called a prealgebraic variety over k, and 0 

is called its structure sheaf. Usually (V, (0) is 
denoted simply by V. 

A regular mapping between prealgebraic 
varieties is detined as a continuous mapping 
g: V 5 V' satisfying poge@, , for any xe V 
and oe Oy: gx Furthermore, if g is a homeo- 
morphism and g^ is also regular, then g 
is called an isomorphism or a hiregular map- 
ping. The Cartesian product X x Y of pre- 
algebraic varieties X and Y is locally a prod- 
uct of affine varieties. Therefore X x Y has 
the structure of a prealgebraic variety. A pre- 
algebraic variety X is called an algebraic va- 
riety if the image of the diagonal mapping X > 
X x X is closed in the Zariski topology of 
the product variety X x X (“separation con- 
dition"). (This definition is due to J.-P. Serre 
[81]. The separation condition corresponds 
to tHausdorff’s separation axiom. If Wis a 
locally closed subset (i.e., the intersection of 
an open set and a closed set) of an algebraic 
variety V, then it becomes an algebraic variety 
in a natural manner (the germs of regular 
functions at Pe W are taken to be the germs of 
functions induced on W by the functions in 
Oy, p). Locally closed subvarieties of k” or P"(K) 
are called quasi-affine or quasiprojective alge- 
braic varieties, respectively. Definitions of 
irreducibility and local rings for general alge- 
braic varieties are given in the same manner 
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as before. In this article, algebraic varieties will 
often be referred to simply as varieties. 

The notion of an irreducible algebraic 
variety was developed from that of abstract 
algebraic variety (or simply abstract variety) 
defined by Weil. 


D. Scbemes 


The set of prime ideals ( 4 (1)) of a commuta- 
tive ring A with unity element 1 is denoted by 
Spec(A) and is called the spectrum of A. For 
any subset a of A, we denote by V(a) the set of 
the prime ideals containing a. We define a 
topology on Spec(A) in which the closed sets 
are V(a). This, again, is called the Zariski 
topology of Spec(A). For an element f of A, the 
open set D( f)=Spec(A)— V(f) is called an 
elementary open set. The elementary open sets 
form a base of open sets in the Zariski topol- 
ogy of Spec(A). The set of closed points is 
nothing but the set of maximal ideals of A. 
Assigning to each point $ of Spec(A) the tring 
of quotients A,, we obtain a sheaf of rings 4 
on Spec(A). We have the equality l'(D(f), A)= 
A,, where A, is the tring of quotients by the 
multiplicative system { f” n z 0j. In particular, 
we have l'(Spec(A), A) = A. Regarded as a 
‘local-ringed space with 4 as the structure 
sheaf, Spec(A) is called an affine scheme. 

A local-ringed space X which is locally 
isomorphic to an affine scheme is called a 
scbeme. A morphism of schemes is, by defi- 
nition, a morphism between them as local- 
ringed spaces. Thus we obtain a ‘category 
whose objects are schemes. We denote it by 
(Sch). Giving a morphism f: X 5 Spec(A) is 
equivalent to giving a ring homomorphism 
I(f): AST(X, 0,). Hence the category of 
affine schemes (which is a "full subcategory of 
(Sch)) is contravariantly equivalent to the 
category of commutative rings. If there is given 
a morphism of schemes f: X S, X is said to 
be an S-scheme or a scheme over §, and f is 
called the structure morphism and $ the base 
scheme. For two S-schemes f: X 98$, g: YS, 
a morphism of S-schemes is defined to be a 
morphism of schemes h; X > Y with f 2goh. 
Thus we obtain the category of S-schemes 
denoted by (Sch/S). Spec(Z) is the unique ‘final 
object in (Sch), hence (Sch) is nothing but 
(Sch/Spec(Z)). 

The tfiber product always exists in (Sch). In 
fact in the case of affine S-schemes X = Spec(B) 
and Y= Spec(C) with § = Spec(A) we have 
Xx,Y= Spec(B Gi. C), and in the general case 
we construct X x , Y by patching together fiber 
products of affine schemes. 

A morphism f: X 5$ is called separated if 
the image of the diagonal morphism Ay; : 
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X ^ X xs X is closed. We also say that X is 
separated Over $ or X is a separated S-scheme. 
A scheme X is said to be separated if it is 
separated over Spec(Z). All affine schemes are 
separated. 

When K is a field, Spec(K) is a space having 
only one point and equipped with K as the 
stalk of the structure sheaf. For a point x of a 
scheme X, denote by k(x) the residue field of 
Oy... For f eC, . we call the residue class off 
in k(x) the value off at x, denoted by f(x). We 
have a natural morphism i,: Spec(k(x))— X 
whose image is {x}. More generally, we call a 
morphism i of a spectrum Spec(K) of a field K 
to X a point of X with values in K. Such a 
point is determined by a point x in X and an 
embedding of k(x) in K. A point of X with 
values in an algebraically closed field is called 
a geometric point. For a morphism f: X > 
S and a point $ in S, the fiber product X x , 
Spec(k(s)) is called the fiber off over s and 
denoted by f'^!(s) For a geometric point 
Spec(K)S, X x sSpec(K) is called a geo- 
metric liber. 

A scheme X is called reduced if the local ring 
at each point of X has no tnilpotent elements. 
A scheme is said to be irreducible if its underly- 
ing topological space is not a union of two 
proper closed subsets. A scheme is called in- 
tegral if it is reduced and irreducible. Every 
local ring of an integral scheme is an tintegral 
domain. If a scheme X has an affine open 
covering { U; = Spec(A,)} such that every A; 
is a tNoetherian ring, X is said to be locally 
Noetherian. A locally Noetherian scheme is 
called Noetherian if its underlying topological 
space is  +(quasi-)Compact. 

A morphism f: X 9 Y = Spec(A) is said to be 
locally of finite type (of finite type) if X has an 
open affine covering (a finite open affme cover- 
ing) {U,= Spec(A,)} such that each A; is a 
finitely generated A-algebra. A general mor- 
phism f:X — Y is said to be locally of finite 
type (of finite type) if there is an open affine 
covering {V;} of Y such that every restriction 
of f:f (VV; is locally of finite type (of 
finite type). If f: X — Y is (locally) of finite 
type we say that X is (locally) of finite type 
over Y. 

A scheme of finite type over a field K (i.e., 
over Spec( K)) is called an algebraic scheme 
over K. There is a natural equivalence of 
categories between the category of reduced 
separated algebraic K-schemes (as a full sub- 
category of (Sch/K)) and the category of alge- 
braic varieties over K (defined in Section C) 
equipped with regular mappings as mor- 
phisms. Hence we identify these categories 
from now on. Occasionally, algebraic vari- 
ety means irreducible variety. Nonalgebraic 
schemes are also important as tools for the 
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study of algebraic varieties. For example, for a 
point x in a scheme X there is a canonical 
monomorphism j,: Spec(@y Ja X by which 
the unique closed point of Spec(Cy Ji map- 
ped to x. If for two algebraic K-schemes X, Y 
and for two points xe X, ye Y there is a K- 
isomorphism Oy „= fe, then suitable neigh- 
borhoods of x and y are isomorphic over K. 

Many concepts concerning varieties, e.g., 
dimension, generic points, specialization, can 
be naturally extended to the case of schemes 
by virtue of commutative ring theory. 

A morphism of schemes f': X 5 Y is called 
proper if it satisfies the following two con- 
ditions: (1) f is separated and of finite type, (2) 
for every scheme T and for every morphism T 
— Y, the morphism X x , T T obtained from 
f by the "change of base" is a closed mapping. 
We also say that X is a proper Y-scheme or X 
is proper over Y. A proper algebraic K-scheme 
is called complete. A projective variety is com- 
plete, while an affine variety over K is com- 
plete only when it is of dimension zero. Every 
algebraic variety can be embedded in a com- 
plete variety (M. Nagata [63]). 

A morphism of schemes f: X — Y is called 
affine if every inverse image by f of an open 
affine subset of Y is again an affine scheme. 

A morphism of schemes f: X 5 Y is called 
finite if it is of finite type and there is an affine 
open covering { U;- Spec(A,)} of Y such that 
f (U) = Spec(B,), where B, is tintegral over 
A;. For a locally Noetherian scheme Y and a 
morphism of schemes f: X > Y the following 
three conditions are equivalent: (i) f is finite; 
(ii) f is affine and proper; (iii) f is proper and 
every fiber off is a finite set. For a finite sur- 
jective morphism of Noetherian schemes f: 

X — Y, X is an affine scheme if and only if Y 
is an affine scheme. 

A morphism of schemesf: X Y is said to 
be flat if for each point x € X, Oy , is a tflat 
Oy, module. If, moreover, fis surjective, 
then f is called faithfully flat. Assume that 
g: Y’+ Y is a faithfully flat morphism of finite 
type of locally Noetherian schemes and f: X > 
Y is a morphism of schemes. Then for many 
important properties of morphisms, f has these 
properties if and only if the pull-back fy.: 

Xx y Y * Y! has the same properties (theory of 
descent [29, 30]). 


E. Cohomology Theory 


Let (X, 0) be a ringed space. An C-Module 
De, a sheaf of O-modules) F is said to be 
quasicoherent if for each point x of X there 
exist a neighborhood U of x and an fexact 
sequence MN F0, where M and N are 
free Oy-Modules. An (/-Module F is said to 
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be of finite type if F is locally generated by a 
finite number of sections over (; F is of finite 
presentation if, locally, there exists an exact se- 
quence (0? — (0* ^ F 0 where p and q are posi- 
tive integers (they need not be globally con- 
stant); F is coherent if(i) F is of finite type and 
(ii) the kernel of any homomorphism (C > Fy 
(where n is an arbitrary positive integer, and U 
is an open set) is of finite type. Obviously, if F 
is coherent, then F is of finite presentation, 
which implies that F is quasicoherenet and of 
finite type. In the category of U-Modules, the 
full subcategory of coherent sheaves is closed 
under almost all operations of sheaves. If ( 
itself is coherent as an O-Module, (/ is said to 
be a coherent sheaf of rings. In this case every 
(-Module of finite presentation is coherent. 

The structure sheaf of a locally Noetherian 
scheme is a coherent sheaf of rings. On a lo- 
cally Noetherian scheme X, every quasicoher- 
ent sub-@,-Module or quotient @,-Module of 
a coherent 6,- Module is coherent. A coherent 
€,-Module on an algebraic variety V is called 
a coherent algebraic sheaf. 

Let X = Spec(A) be an affine scheme. Then 
every quasicoherent (/,-Module F on X is 
generated by its global sections. The corre- 
spondence F 2 I(X, F) defines an equivalence 
beeween the category of quasicoherent sheaves 
on X and the category of A-modules; if A is 
Noetherian, then the coherent sheaves and the 
finite A-modules correspond to each other 
under this equivalence. 

Let X be a separated scheme, and W= {U,} 
an affine open covering of X. For each quasi- 
coherent @,-Module F, the cohomology group 
H4(X, F) is canonically isomorphic to the 
tCech cohomology H4(U, F) (- 383 Sheaves 
F). If X is of dimension d, then H*(X, F) 20 
for every sheaf F of Abelian groups on X and 
q>d. 

For a scheme X we define the cohomological 
dimension cd(X) to be the largest integer d 
such that H4(X, F) #0 for a quasicoherent Oy- 
Module F on X [35]. The cohomological 
dimension cd(X) does not exceed the dimen- 
sion of X. If X is an affine scheme, then cd(X) 
= 0. The converse is true under the assump- 
tion that X is Noetherian (Serre’s criterion 
[29, III, For an algebraic scheme X of dimen- 
sion n, cd(X) = nif and only if X is complete 
(S. L. Kleiman). 

Let f: X — Y be a proper morphism of 
Noetherian schemes. Then for every coherent 
0,-Module F and for every qz 0, Rf (F) (> 
200 Homological Algebra) is also coherent. In 
the special case of Y = Spec(k) with a field k 
this means that for an algebraic coherent sheaf 
F on a complete variety X the cohomology 
group H*(X, F) is a finite-dimensional vector 
space over k. 


Let X be a scheme over k and let F be a 
locally free &,-Module of rank y (i.e., an Oy- 
Module which is locally isomorphic to (5). If 
we take an open covering {U,} of X and iso- 
morphisms @;: Fiy, fär, then p;o ø; ! defines 
a morphism gj: U;f1 U;> GL(n, k), which is 
called the coordinate transformation of F. If we 
construct a tvector bundle B on X by the same 
coordinate transformations fij» then F can be 
regarded as the sheaf B(B) of germs of sections 
of B. By means of the canonical homomor- 
phism GL(n, k)> PGL(n— 1, k) we can con- 
struct a projective bundle P(F) on X (which is 
said to be associated with F). (Note that in 
[29], P(F) is defined to be a projective bundle 
with coordinate transformations wl, Le, as- 
sociated with the dual of F in our sense.) This 
procedure of associating P(F) with a locally 
free (/,-Module F can be generalized for any 
quasicoherent Os-Module of finite type on 
an arbitrary scheme S. 

A closed (locally closed) S-subscheme f: X 
S of p: P(F)5S is called a projective scheme 
(quasiprojective scheme) over S, or f is said to 
be a projective morphism (quasiprojective mor- 
phism). A projective morphism is proper. A 
reduced projective scheme over a field k is 
nothing but a projective variety over k. We 
can develop the theory of projective schemes 
by means of tgraded rings in a way similar to 
affine schemes. 

A locally free Ó,-Module of rank 1 is called 
an invertihle sheaf. Invertible sheaves corre- 
spond to line bundles up to isomorphisms. Let 
P = P*(k) be a projective space, (y,, y. . . , Yy) 
a system of homogeneous coordinates of P, 
and U; the open subset of P defined by y; z 0. 
Denote by (/(n) the invertible sheaf on P de- 
fined by the coordinate transformation de 
(y;/ y)". More generally, let p: P = P(F)2 be 
the projective bundle associated with a locally 
free Us-Module F of rank N + 1 on a scheme 
S. Then there is an invertible sheaf O(n) = 
CU on P with the properties: (i) for each 
SES its restriction to the fiber p! (s) = P^(K(s)) 
is O(n) detined above; (ii) p, (C(n)) = S'(F) for 
n » 0 where S"(F) denotes the nth symmetric 
product of F. The invertible sheaf €(1) is called 
the tautological line hundle on P. (Note that 
(( 1) in the sense of [29] is O( -1) in our sense, 
but since the definition of P(E) is also differ- 
ent, the above property (ii) holds without 
modification.) 

For a quasiprojective S-scheme f: X 9 P(E) 
9$, the restriction of (f( 1) to X is denoted by 
Oy( 1) (or simply OU 1). An invertible sheaf L on 
X is called very ample over § if there exist a 
locally free €,-Module of finite type E on § 
and an S-immersion i; X — P(F) such that 
Oy( 1) = L; L is called relatively ample over $ or 
S-ample if L?" is very ample over $ for a cer- 
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tain n > 0 (- Section N). When S is an affine 
scheme, an ample (very ample) sheaf over § is 
simply called ample (very ample). There is the 
following cohomological criterion of ample- 
ness (generalized Serre’s theorem). 

Let Y be a Noetherian scheme, f: Xo Y a 
proper morphism, and L an invertible (,- 
Module. Then the following four conditions 
are equivalent: (i) L is f-ample; (ii) for each 
coherent @,-Module F there is an integer 
N such that R*f,(F & L9") = 0 for alln > N 
and q > 0; (iii) for each coherent sheaf of 
ideals Y of (?, there is an integer N such that 
Rf, (4 & L®")=0 for alln>N. They imply 
the condition (iv): for each coherent Oy- 
Module F there is an integer N such that the 
canonical homomorphisms f*f,(F © L9") 
F G L?" are surjective for all n> N. 

Let X be a scheme proper over a field k 
(such an X is called a k-complete scheme), 
and let F be a coherent Q-module. Then all 
H*(X, F) are finite-dimensional k-vector 
Spaces (finiteness theorem), and y(F) = 
Y ( -1) dim H*(X, P) is a finite number, 
called the —Euler-Poincaré characteristic of F 
over X. For every invertible sheaf L, y(F & 
L9") is a polynomial in m, which is said to 
be the Snapper polynomial [48]. Supp F, de- 
fined to be the set (x EX F, £0}, is a closed 
subset of X and is called the support of F. The 
degree of the polynomial y(F & L9") is at 
most Y = dim supp F, and we have y(F & L9") 
= e'm’/r! + lower terms in m for some e € Z. e 
is said to be the intersection number of L” with 
F [48]. When W is a closed subscheme of X 
defined by an (/,-1deal I, (L' - W) is defined to 
be the intersection number of L" with (0, /. f, 
where r = dim W. If L is ample, then (L’: W) > 
0 for any W. In particular, when W= X, it 
follows that (L") > 0, where n = dim X. The 
converse of this fact is the Nakai-Moishezon 
criterion, saying that if (L' W) > 0 for any 
closed subvariety W of X where r = dim W, 
then L is ample [56,64]. 

When L is ample, H*(X, F 9) L8") = 0 for 
any d > 0 and for sufficiently large m; hence 
y(F & L9")- dim H*(X, F & L9") and the 
Snapper polynomial y(F & L9" is also called 
the Hilbert polynomial or the Hilbert charac- 
teristic function of F.If L is the invertible sheaf 
detined by a hyperplane section of X in PN, i.e., 
L = (@y(1), then y(L9") = dim H*(X, L8" = 
dim Rm for sufficiently large m where @ R, 
denotes the homogeneous coordinate ring of 
X in Př. 

In general, for a complete irreducible variety 
V of dimension v, we put 


1) (6) Y. (Ae 


(h°-4 = dim H*(V, ©,)) and call it the arithmetic 
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genus of V. Classically, the number p,(V) de- 
fined by p(V)  ( CD'(x(V)— Uschi — ho! 
+...+h°:} was called the arithmetic genus of 
V, instead of y( V). When V is a nonsingular 
irreducible curve, p,(V) is the usual tgenus. If 
V is a projective variety, the constant term of 
the Hilbert polynomial is y(V) and the coeffi- 
cient of its highest term is (deg V)/r!. 

Let V be a tnormal variety, and D a tdivisor 
on V. If, for each point x EV, we denote by 
((D), the set of the functions fe K( V) that 
satisfy (f) - D 2 0 on some neighborhood of x, 
we obtain a coherent algebraic sheaf €(D), and 
we have dim H?( V, O(D)) = I(D). If moreover, V 
is complete, we put y,(D) —2y(V)— y(0(— DI 
and call it the virtual arithmetic genus of D. 
Classically, the number p(D) A —1) !(y,(D) 
= 1) was called by that name. When D is effec- 
tive and has no multiple components, y,(D) 
coincides with the arithmetic genus of D re- 
garded as a variety. In general, y,(D) stays in- 
variant if D is replaced by a divisor that is 
algebraically equivalent to D (-» Section N). 

If D is a Cartier divisor, O(D) is an inver- 
tible sheaf. For two Cartier divisors D,, D}, 
O(D, * D;) - €(D,) ® O(D,), and D; and D, 
are linearly equivalent if and only if O(D,)= 
((D, ). 

Let V be an irreducible nonsingular variety 
and let Q? denote the sheaf of germs of regular 
differential forms of degree p(Q? — (),). If V is 
complete, then we denote dim H*(V, O^) by 
Die 

Serre's duality theorem: Let V be a nonsin- 
gular complete irreducible variety of dimen- 
sion r, B an algebraic vector bundle over V, 
and B* the dual vector bundle of B. Denote by 
B and #* the sheaves of germs of sections of 
B and B*, respectively. Then (i) H'(V, Q”) is 
canonically isomorphic to k, and (ii) H4 ( V, B) 
and H" *( V, 4* & (Y) are dual to each other as 
linear spaces by means of the "cup product of 
the above spaces with H'(V, (Y) x K. In partic- 
ular, H*( V, QP) is dual to HI *(V, O" "y; hence 
we have h^? =h Pra, 

This theorem was extensively generalized 
by Grothendieck in the category of schemes 
[1, 331. 

When the field Kis of characteristic 0, we 
furthermore have h”! — t^? by complex con- 
jugation (— 232 Kahler Manifolds C); but in 
characteristic p, there are examples for which 
this symmetry does not hold [61]. In general, 
h®-4 is a trelative invariant but not an tab- 
solute invariant; however, as h? is the dimen- 
sion of the hnear space of the differential forms 
of the first kind with degree p, h^? is an ab- 
solute invariant. Hence 9? is also an absolute 
invariant in the case of characteristic 0. When 
the characteristic is positive, the absolute 
invariance of °:? has not yet been proved. 
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F. Simple Points and Singular Points 


Let V be a variety over an algebraically closed 
field k We say that a point P of Vissimple 

or that Visnonsingular or smooth at P if the 
local ring Op is a fregular local ring. Since the 
problem is local, we may assume that Vis an 
affine variety in K”. Then the simplicity of P on 
Vis equivalent to the following condition: P is 
contained in only one irreducible component 
of V, and if that component has dimension r 
there exist n-r polynomials F((X) in IV) such 
that rank (CF/OX J-P = n—r.A point of V 
that IS not simple is called a singular point or 
a multiple point. The set of singular points on 
V (called the singular locus of V) is a proper 
closed subset of V. A variety with no multiple 
points is called smooth or nonsingular. 

This notion can be made relative. A mor- 
phism f: X 5 Y of a locally Noetherian scheme 
is called smooth if f is flat and locally of finite 
type and all the geometric fibers off are non- 
singular. In the case of an affine morphism 
f: X =Spec(R[X,,..., Xs fis fS) Y= 
Spec(R) of relative dimension r (by which we 
mean the dimension of the general fiber) with 
a Noetherian ring R, the smoothness off 
amounts to a condition that rank ((0f;/0Xj)(x)) 
= $ at each point x of X. 

When for a point P of a variety V the local 
ring Opis *normal, P is called a normal point. 
A simple point is normal. The set of normal 
points is a nonempty open subset of V. An 
irreducible variety whose points are all normal 
is called a normal algebraic variety (or simply 
normal variety). The singular locus of a normal 
variety has codimension z 2. For an irreduc- 
ible variety V, there exists a pair (V', f) of a 
normal variety V' and a birational finite mor- 
phism f:V'—5 V; V’ is unique up to isomor- 
phisms and is called the derived normal model 
or normalization of V. 

Simplicity and normality for V at a sub- 
variety W are detined in the same way as at 
a point by using the local ring Oy w. 

For a morphism f: X 2 Y of locally Noe- 
therian schemes, locally of finite type, the fol- 
lowing three conditions are equivalent: (i) f is 
smooth and every fiber off is a discrete set; (ii) 
fis flat and every geometric fiber over Spec(K) 
off is a union of spectra of fields isomorphic 
to K; (iii) f is flat and every fiber off over 
y € Y is a union of spectra of fields that are 
finite tseparable extensions of k(y). These 
conditions are local with respect to X. If a 
morphism f satisfies these equivalent con- 
ditions, we say that f isétale or X is étale over 
Y. A morphism 


f: X= Spec(RLX,, , XM . | hi) 
—> Y= Spec(R) 
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is étale if and only if det((0f;/0 X5) (x)) 0 for all 
xe X. Hence étale morphisms correspond to 
local isomorphisms in the analytic category. 
Fora surjective étale morphism f: X 2 Y many 
important geometric properties (reduced, 
integral, normal, nonsingular, etc.) hold on 

X if and only if they hold on Y (theory of 
descent). 


G. Dimension Theorems 


Let V be an irreducible variety, and let U and 
W be irreducible subvarieties of V. Then any 
irreducible component of Uf) W that is simple 
on V has dimension > dim U + dim W = dim V. 
When the equality holds, the component is 
called a proper component of the intersection 
UN W. If every component of U N W that is 
simple on Vis proper, we say that U and W 
properly intersect on V. Any two subvarieties 
U and W of P"(k) with dim U + dim W2n 
intersect each other. When Vis an irreducible 
r-dimensional variety in P"(k), the number of 
points of intersection VN L of V with an (n= 
r)-dimensional linear variety L is independent 
of the choice of L as long as L is in a “gen- 
eral position.” This number is called the degree 
of Vand is denoted by deg( V). Letting (( 1) be 
the fundamental sheaf of P’(k), we have deg V 
—(6(1)- V) (- Section E). 


H. Group Varieties 


An algebraic variety G is called an talgebraic 
group if it has a group structure and if the 
mapping G x GG sending (x, y) to xy ‘is 

a morphism. Every algebraic group is quasi- 
projective (Chow). If G is irreducible, then 

it is also called a group variety; a complete 
group variety is called an *Abelian variety (- 
3 Abelian Varieties B, 13 Algebraic Groups B). 
A scheme G over another scheme § equipped 
with morphisms over $:G x stat GG, and 
SG, called multiplication, inverse, and unit 
section, respectively, which satisfy the relations 
corresponding to the usual axioms of group, is 
called a group scheme (over S). As a point set, 
G is not a group, while, for any scheme T over 
$ the set G(T) = Hom,(T, G) of the morphisms 
from T to G is a group (- 52 Categories and 
Functors M). Consider an algebraic group 
scheme G over § = Spec(k). If the characteristic 
of k is zero, then G is necessarily reduced, so 
an algebraic group scheme over k is essen- 
tially the same as an algebraic group; if k has 
characteristic p, there exist algebraic group 
schemes over k that are not reduced. 
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Į. Rational Mappings 


Let f: V-t V? be a morphism of varieties. If V 
is not complete, the image f(V) is not always 
closed; the closure off(V) (in V?) is called the 
closed image of V. The image f(V) contains an 
open dense subset of the closed image. 

Let V and W be irreducible varieties. A 
closed subset T of Vx W is called an algebraic 
correspondence of V and W. We say that points 
P e V and Q e W correspond to each other by T 
if (P, Q)e T. If T is irreducible and the closed 
image of the projection T V coincides with 
V, then the function field k(V) can be identified 
with a subfield of k(T); if we have k(V) = k(T) 
with this identification, then T is called a 
rational mapping from Vto W. Moreover, if 
the same conditions are satisfied for W, then T 
is called a birational mapping (of birational 
correspondence or birational transformation), 
and in this case we have k(V) = k( W). A mor- 
phism can be considered a special kind of 
rational mapping by taking the graph. If T is a 
rational mapping from V to W and W, is the 
closed image of T in W, then k( W,) can be 
regarded as a subfield of k(T) = k( V). If k(V) is 
'separably generated (tpurely inseparable) over 
k( W,), then T is said to be separable (purely 
inseparable). 

Let T be a rational mapping from V to W, 
and let V’ and W’ be irreducible subvarieties 
of V and W, respectively. If there exists an 
irreducible subvariety T’ of T whose projec- 
tions have the closed images V’ and W’, then 
we say V’ and W’ correspond to each other by 
T. The union of irreducible subvarieties of W 
that correspond to V' by T is a closed subset of 
W; it is called the proper transform of V’ by T 
and is denoted by T[V"]. Note that V' 2 V" 
does not imply T[V'] 2 T[ V"]. The set of 
points of W that correspond to the points of 
V’ is called the total transform of V’ by T and 
is denoted by Tf V’}. Identifying k(T) with 
k(V), we have OD, p» Dr in general; if the 
equality holds, we say that T is regular (or 
defined) along V’. In that case, W' is the unique 
irreducible subvariety of W corresponding to 
V? by T. If V’ is simple and of codimension | 
in V, then T is always regular along V*. The set 
U of points of V at which T is regular is a 
nonempty open subset, and the restriction of T 
to U defines a morphism from U to W. A 
rational mapping can be defined as the closure 
of the graph of a morphism defined on an 
open subset of V. The study of a rational 
mapping can be reduced to that of the bira- 
tional morphism T V. Zariski's main theo- 
rem: Let §: X 2 Y be a birational mapping, 
and assume that the inverse S! : Y-X is 
regular, and that X is normal along an irre- 
ducible subvariety X". If there exists an irre- 
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ducible component Y’ of S[ X] with dim X'2 
dim Y', then $ is regular along X’. [tfollows 
from the above main theorem that, if T:V— 
W is a rational mapping and if P is a normal 
point of V such that T[P] contains an iso- 
lated point, then T is regular at P. 

For a birational mapping T: V W between 
complete irreducible varieties, a subvariety V 
of V is said to be fundamental when dim T[V"] 
> dim V’. When V’ is a point (curve) we say V’ 
is a fundamental point (fundamental curve) with 
respect to T. The most classical example of a 
birational correspondence with fundamental 
points is the quadratic transformation T' of a 
projective plane onto itself given by (xg: x, : x5) 
—(x4X3:X5Xo:xgox,) with a suitable coordinate 
system. In this case, T? = identity and the 
points P, defined by x; = 0 for j # i correspond 
to the lines x;=0 by T. Let Cr, denote the 
group of birational transformations of P"(k) 
into itself. These transformations are called 
Cremona transformations when n > 2. Cr is 
generated by linear transformations and qua- 
dratic transformations (M. Noether). Recently, 
Cr, has been studied in detail by M. Dema- 
zure [21] and H. Umemura [91]. Let V be a 
complete nonsingular irreducible variety over 
k [t is called relatively minimal if every bira- 
tional morphism from V to a complete non- 
singular variety V’ is an isomorphism. It is 
called minimal if every birational mapping 
from a nonsingular variety V' to Vis a mor- 
phism. Replacing birational mapping by ra- 
tional mapping, we can define strong (abso- 
lute) minimality. Abelian varieties and com- 
plete nonsingular curves with positive genera 
are strongly minimal. In general, minimality 
implies relative minimality. 

Zariski's main theorem is closely related to 
the general connectedness theorem due to W. 
Fulton and J. Hansen stated as follows: Let P 
be the product of r copies of P," and A be the 
diagonal subvariety of P. If X is an irreducible 
projective variety and f: X —P is a morphism 
with dim f(X)>(r—1)m, then f ' (A) is con- 
nected [22]. The following results are derived 
from this theorem: (1) if X is singular with only 
normal crossings (- Section L), then X can- 
not be imbedded in P?" !. where n = dim X. (2) 
Let X be a nonsingular subvariety of P^ which 
is not contained in any hyperplane. Suppose 
that by the projection z,: PV P"! with cen- 
ter x, x being a general point of P, X is iso- 
morphic to n. (X). Then 3: dim X € 2(N = 2) 
(Zak's theorem). (3) Let X be a nonsingular 
subvariety of P and H be an arbitrary hyper- 
plane section of X. Then dim(Sing H) « 
codim X (J. Roberts). 

By a similar connectedness theorem, it was 
proved that if C is a (reducible) curve with 
normal crossings on P? over C, then the fun- 
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damental group of P? -C is commutative, 
which was conjectured by 0. Zariski [20]. 


J. Rational Varieties 

An irreducible algebraic variety V over k 
whose function field is purely transcendental 
over k is called a rational variety. A complete 
smooth surface § over an algebraically closed 
field is rational if and only if P,(S) = q(S) = 0 
(the Castelnuovo-Zariski criterion, 2 15 Alge- 
braic Surfaces E). 

If the function field of V has a finite alge- 
braic extension which is purely transcendental 
over k, then Vis called unirational. A unira- 
tional curve is in fact rational. More generally, 
if the function field k(C) of a curve C over 
k is contained in a field finitely generated 
and purely transcendental over k, then C is 
rational (Liiroth’s theorem). A unirational 
surface over an algebraically closed field of 
characteristic zero is rational by virtue of the 
above criterion, but in the case of positive 
characteristic there are unirational surfaces 
which are not rational (Zariski). There are 
nonrational unirational threefolds even of 
characteristic zero; for example, all smooth 
cubic hypersurfaces in P* (C. H. Clemens and 
P. A. Griffiths [ 16]), and some smooth quartic 
hypersurfaces in P* (V. A. Iskovskii and Yu. 1. 
Manin [45]). See [6]. 


K. Monoidal Transformations 


Let V be an irreducible variety and ¥ be a 
sheaf of ideals of (/,. For any affine open set U 
of V, .£|U is determined by an ideal a of the 
coordinate ring A of U, Let a,, a, , a, be a 
system of generators of a, and let U' be the 
graph of a rational mapping from U to P”(k) 
such that the points PeU and (ag(P):a,(P): 

: a, (P))e P"(k) correspond to each other by 
U’. Then U’ is uniquely determined (up to 
isomorphisms) by U and a only. Suppose 

V has a covering by affine open sets U;. We 
obtain U; over U; as before. By patching them 
together, we get a birational morphism T: 
V'5 V, which is unique up to isomorphisms. 
This T with V' is called the monoidal trans- 
formation or blowing up of V by the ideal 
sheaf d Note that in some references, the 
inverse transformation T is said to be the 
monoidal transformation. The inverse image 
ideal T ^!(.4) is an invertible sheaf, that is 
relatively ample over V. If W is the support of 
Cy/ J and W# V, then T-'(W) has codimen- 
sion | and T gives rise to the isomorphism 
from V’ = T  (W) onto V- W. Thus one can 
say that V' is obtained from V by replacing W 
with T-'(W), which is locally defined by prin- 
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cipal ideals. If every point of W is a nonsin- 
gular point of V and W is itself nonsingular, 

T '!(W)is the projective bundle over W whose 
fiber is the projective r-space where r = dim V 
= dim W — 1. In general, if W is a subvariety 
defined by a sheaf of ideals .7, then T is said to 
be the monoidal transformation with center W. 
In particular, if W is a point, T is called the 
(locally) ^ quadratic transformation. 


L. Resolution of Singularities 


Given an arbitrary irreducible variety V, we 
have the problem of finding Out a nonsingular 
projective variety V' birationally equivalent 
to V. This is called the problem of resolution 
of singularities. In the case of characteristic 
zero, this problem was solved by Zariski 
(1944) for dimension < 3 and by H. Hironaka 
(1964) [37] for any dimension. In the case of 
characteristic p, S. Abhyankar solved the 2- 
dimensional case (1956) and the 3-dimensional 
case (1966). Hironaka’s theorem of resolution 
of singularities is stated as follows; Let V be a 
variety over a field of characteristic zero. Then 
there exists a finite sequence of morphisms of 
varieties: V.+ V, >. V; > = V such that (1) 
V, has no singular points, (2) each V,,, — V; is a 
monoidal transformation of V; with center D;, 
(3) each D; is a nonsingular subvariety, (4) 
each V, is normally flat along D. Here, Vis 
said to be normally flat along a subscheme D 
of V defined by the sheaf of ideals f, if the 
quotient modules $?/Y?"" are flat Mp ,- 
modules for all p and all points x of D. 

Let V be a nonsingular variety and Dan 
effective divisor on V. D is said to be a divisor 
with (only) normal crossings at x € V, if D is 
defined by f,. f, such that (f,, His a part 
of a tsystem of local coordinates around x. D 
is said to be a divisor with only normal cross- 
ings, if it is SO everywhere. For any subvariety 
W on V, there exist h: HAN that is a composi- 
tion of monoidal transformations with non- 
singular centers, such that h^ (W) has only 
normal crossings. This results from Hironaka's 
main theorem II [37]. The normal crossing 
divisor is also defined for complex manifolds, 
and similar results hold. 


M. Cycles and Divisors 


Let V be an irreducible variety. We denote by 
$8, the set of r-dimensional irreducible sub- 
varieties of V that are simple on V Oe, are not 
contained in the singular locus of V), and by 
3,(V) the free Abelian group with basis 23,. 
Elements of 3,( V) are called cycles of dimen- 
sion Y (or r-cycles) on V. Let A and B be r- 
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cycles; A =} n; 4, B =£ m; A;(A;EB,, A; # A; 
if i +j). If n; > m; for alli, then we write A > B. 
If A 20, then A is said to be a positive cycle. 
For a O-cycle A — n;P, the integer deg(A) = 
X n; is called the degree of A. 

A cycle of codimension 1 is called a divisor. 
A divisor > 0 is usually called effective in- 
stead of positive. If Vis of dimension d and if 
WO  ».,,the local ring Oy wisa !discrete 
valuation ring. The *normalized valuation 
defined by it is denoted by p, ( L For a func- 
tion f € «( V), we say that Wis a zero of order n 
if o (f) 2 n» 0, and that Wis a pole of order 
—nifvy(f)=n<0. Any function fe K V), 
other than the constant 0, has at most a finite 
number of zeros and poles. We denote by 
(f) the sum X vy(f) W extended over all the 
zeros W off, and put (f^); (f), and (f)g— 
(f),=(. We call (Jo; (f), and (f) the 
zero divisor, the pole divisor, and the divisor 
off; respectively. The divisor (f) is equal to 
X vy (f) W, where the summation is taken over 
Wem, ,, and we have (fg) (f) --(g). When V 
is complete and the singular locus of V has 
codimension » 1, then fis constant if and only 
if(f), 20 (or (f)o-0). Let D, and D, be divi- 
sors; if there exists a function f(  0)e KV) 
such that D, — D, =(f), then D, and D, are 
called linearly equivalent to each other and we 
write D, ^ D,. The linear equivalence class 
containing a divisor D is denoted by cl(D). A 
divisor which is linearly equivalent to 0 on a 
neighborhood of each point of V is called a 
Cartier divisor (some authors call a Cartier 
divisor simply a divisor). If V is smooth, then 
any divisor is a Cartier divisor. If a divisor D 
can be written as D=f) on an open set U, 
then the function fis called a local equation of 
D on U. Let T: V'2 V be a rational mapping 
from a normal variety V' to a complete variety 
V, let D be a Cartier divisor on V, and assume 
that the closed image of T is not contained in 
D. Since T is regular on some open set U such 
that codim( V U)> 1, we have a morphism 
9 = T y and the pullback @*(D) defined by 
composing the local equations of D with o 
Taking the closure of this divisor in V', we 
obtain a divisor on V', which is denoted by 
TAD). 


N. Divisors and Linear Systems 


Let V be a complete irreducible variety, fo, 
IJ. f, be elements of the function field KV) 
of V, and D be a divisor on V satisfying (fi) + 
D > 0 for each i. Then the set X of the divisors 
of the form (È a;f) + D, where the a; are 
elements of k and not all zero, is called a linear 
system. The linear space kf, + kf, +. + kf, is 
called a defming module of 2. The divisors in 
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Z are positive and are linearly equivalent to 
each other; if every positive divisor that is 
linearly equivalent to a member of X belongs 
to 2, then Z is said to be a complete linear 
system. For any linear system Z, there exists a 
unique complete linear system containing it, 
which is denoted by ||. The maximal positive 
divisor D, that is contained in all divisors of X 
is called the fixed component of X, and for 
each DEC we call D = D, the variable compo- 
nent of D (or of X). A point P of V is called a 
base point of a linear system JL if P is on each 
variable component of 2. A linear system X 

is called irreducible if its generic member is 
irreducible; otherwise it is called reducible. The 
dimension of a defining module of a linear 
system Z is denoted by 1(C); we call I(C)- 1 the 
dimension of X and denote it by dim X. A 
linear system of dimension 1 is called a linear 
pencil. 

A defining module of a linear system X is 
determined uniquely up to k-isomorphisms. 
Let L be a defming module, and let fo, fis- , f, 
be a linearly independent basis of L over 
k If we associate to each point P of V the 
point Q —(fg(P): f (P): ...: ,(P)) of the n- 
dimensional projective space, then we obtain 
a rational mapping ®, from V to another 
variety V’. Outside the base points of X, the 
rational mapping OD; is regular; and the base 
points are the fundamental points of d. We 
say that D, is the rational mapping defined 
by the linear system 2, When Chas no tixed 
components and ®; is a closed immersion, 

AJ is said to be very ample (or ample). For a 
divisor D, the set of positive divisors that are 
linearly equivalent to D is a linear system, 
which is called the complete linear system and 
is denoted by |D|. We usually write /(D) in- 
stead of [(|D]). If |D] is very ample, we say that 
D is very ample. We say that D is ample (or 
nondegenerate) if mD is very ample for some 

m» 0. 


0. Differential Forms 


Let V be an n-dimensional irreducible variety, 
and let A = k( V) be its function field. We de- 
note by D* the set of derivations of $ over 

k, i.e, the k-linear mappings D: &o f satisfy- 
ing Di fg) - D(f)g + fD(g). Then D* is an n- 
dimensional linear space over &. Let D denote 
the fdual space of D* over R. For each feg, 
let df be an element of D defined by (df, D» = 
D(f) (De D*), Let x,,x5,. .. , x, be a separat- 
ing transcendence basis of KĘ over k, in the 
sense that x,, ,x, are algebraically inde- 
pendent over k and § is a tseparable algebraic 
extension over K(X p Xj. (Such a basis exists 
under the weaker hypothesis that Kis tperfect.) 
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Then dx,,.... dx, form a basis of D over R. 
The homogeneous elements of degree r of the 
tGrassmann algebra of D over & are called 
differential forms of degree r on V (or rational 
r-forms). The set of the differential forms of 
degree H is a 1-dimensional linear space over & 
spanned by dx, A dX, A. A ON. 

A set of n functions f,, , f, in & is called a 
system of local coordinates on an open set U of 
V iff; — fi (P), -fa — f, (P) is a tregular system 
of parameters of the local ring , for each 
Pe U. In that case, fi, ,f,is also a separat- 
ing transcendence basis of K, If P is a simple 
point of V, then there exists a system of local 
coordinates on a suitable neighborhood of P. 
Let (9 be a differential form of degree r on V, 
and write w= X, < m (nl, A ^ dfi, 
where (fi, ., fa) is a system of local coordi- 
nates around P. If the coefficients (o; are 
regular at P, then o is said to be regular at P. 

When Vis a complete variety without sin- 
gular points, a differential form that is every- 
where regular on V is called a differential form 
of tbe first kind (ora regular form); the dif- 
ferential forms of the first kind are determined 
by the function field A and are independent of 
the choice of the nonsingular model V. The 
number of linearly independent differential 
forms of the first kind, of degree n, is denoted 
by p, and is called the geometric genus of V. 

Let V be a complete variety, W an irreduc- 
ible subvariety of V of codimension 1, and P 
a point of W that is simple on V. Choose a 
system of local coordinates (fi). Given a dif- 
ferential form œ on V, we write it as a “poly- 
nomial” in the di and denote by v, (o) the 
minimum of the values of the coefficients for 
the fyaluation Gel ). The number vy (c) is 
determined by w and W, and it is independent 
of the choice of P and of the local coordinates. 
Then c defines a divisor (w) = Dy vy (c) W on 
V, which is called the divisor of a differential 
form (y. The divisor of a differential form of 
degree n (= dim V) is called a canonical divisor 
and is usually denoted by K. The canonical 
divisors form a linear equivalence class of 
divisors. 


P. Albanese Variety, Picard Variety, Néron- 
Severi Group 


Let V be an irreducible variety. Then we can 
construct a couple (A, f) consisting of an 
*Abelian variety called the Albanese variety of 
V and a rational mapping f: V-4A (called an 
Albanese mapping) such that: (i) the image of 
f generates A, L.e., the sum off with itself n 
times, F: V"— A, is generically surjective for 
sufticiently large n; (ii) for every rational map- 
ping g: V— B of V into an Abelian variety B, 
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there exist a homomorphism h; A 2 B and a 
point be B such that g=h: f +b. The Albanese 
variety is uniquely determined up to isomor- 
phisms and f is determined up to translations. 

In the case of k = C, if Vis a complete non- 
singular variety and if q is the dimension of the 
linear space of differential 1-forms of the first 
kind on V, then the first tBetti number i?, is 
equal to 2q. Let œw, , @ be a basis of the 
linear space and let ?,,, y5, be a basis of 
the first homology group modulo torsion. Put 
x; — f, cx and a, = (2, . %4). Then the period 
vectors x; (1 <j < 29) are linearly independent 
over R in C*. If T denotes the discrete sub- 
group of C! generated by the a, then the quo- 
tient group OI is the Albanese variety of 
V. The Albanese mapping is given by the map- 
ping Pe, ol (modT), where P 
is a variable point on V and Q is a fixed point 
on V (> 232 Kahler Manifolds C). 

Replacing the term “rational mapping" by 
“morphism” in the definition, we can define the 
strict Albanese variety of V and prove its 
existence. [t is a quotient Abelian variety of 
the Albanese variety of V. If V is nonsingular, 
both coincide by virtue of the *strong mini- 
mality of an Abelian variety (- Section J). 

Let V be a complete normal variety, U the 
set of the simple points of V, and D a divisor 
on V. Then D is said to be algebraically equiva- 
lent to 0 if there exist a nonsingular curve C, 

a divisor I’ on U x C, and two points P and 

Q on C such that D can be written as D — 
ož(T) = 95(E), where 9p and 9g are the mor- 
phisms o, UA U x PSU x C and pọ:U > 

U x Q-U x C. We denote by G(V), 6,(V), 
and (5,(V) the set of all divisors on V, the set 
of divisors that are algebraically equivalent to 0, 
and the set of divisors that are linearly equiva- 
lent to 0, respectively. We can introduce a 
canonical structure of an Abelian variety into 
(5 (V)/G,(V), which is called the Picard variety 
of V. The dimension g of the Picard variety is 
called the number of irregularity of V; if q = 0 
we say that Vis regular. 

The Albanese variety and the Picard variety 
of V are tisogeneous to each other, and each 
one is the Picard variety of the other. If Vis 
a curve, they are isomorphic and called the 
Jacobian variety (— 9 Algebraic Curves E). 

Using Cartier divisors instead of divisors, we 
get an analogous theory to construct another 
kind of Picard variety which turns out to be 
isomorphic to the Picard variety of the strict 
Albanese variety of V. The group of the linear 
equivalence classes of Cartier divisors can be 
identified with H!( V, 0%), where (fj is the sheaf 
of multiplicative groups of the invertible ele- 
ments in 0,. From this point of view, we can 
generalize the theory of Picard variety to the 
case of schemes also. The theory thus obtained 
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is called the theory of Picard schemes [60]. 
The quotient group NS(V) = 6(V)/G,( V) is 
finitely generated [66] and is called the Neron- 
Severi group of V. We cal] the rank of NS(V) 
the Picard number of V and denote it by p(V). 
In the case of a nonsingular projective vari- 
ety over k = C we have an inequality p(V) < 
h^ (V) (= dim, H' (V, Q})) and the Lefschetz 
number B,( V) — p( V) is a birational invariant 
(where B,(V) is the second Betti number of 
V) [39]. For the positive characteristic case, 
however, the above inequality does not hold 
in general [61]. 

The torsion part of NS(V) is 6,(V)/6,( V), 
where &,( V) denotes the group of divisors 
numerically equivalent to zero (- Section Q) 
(T. Matsusaka). The last fact cannot be gen- 
eralized for higher codimensional cycles [25]. 


Q. General Intersection Theory 


Let V be an irreducible variety of dimension n, 
and let A and B be irreducible subvarieties of 
V of dimension r and s, respectively. If C is a 
proper component of A N B, we can define the 
intersection multiplicity i(A B, C; V) of A and 
B along C on V, which has properties consis- 
tent with our geometric intuitions. In partic- 
ular, it is invariant under biregular mappings. 
If A and B intersect properly on V and if 
C,, , Cy are the proper components of A N B, 
we define an (r+s—n)-cycle A: B by A: B= 
Y, i(A. B. C,; V)C, and call it the intersection 
product of A and B. If each component X, of 
an r-cycle X =}, n, X, and each component Y, 
of an s-cycle Y = Lig mg Y, intersect properly, 
we define 
X-Y-X,X nmi X, Yp. 
Then we have the associative law (X Y): Z = 
X*(Y- Z) for cycles X, Y, and Z whenever 
both sides are defined. Two r-cycles X, and X, 
are said to be numerically equivalent if for 
every (n = r)-cycle Y that intersects them pro- 
perly we have deg(X, Y) = deg(X, Y). 

The theory of intersection is one of the most 
basis theories in algebraic geometry, for the 
other theories can be constructed from it [92]. 


R. Chow Rings 


Let U and T be nonsingular irreducible vari- 
eties. If Z is a cycle on U x T such that Z- 

(U x t) is defined for every point of T, we 

put Z-(U x t) 2 X(t) x t, and we obtain a 
family {X(t)} of cycles on U parametrized by 
the points of T. Such a family is called an 
algebraic family of cycles. Two cycles X, and 
X, whose difference is equal to the difference 
of two cycles in an algebraic family are said to 
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be algebraically equivalent. In particular, if X, 
— X, can be expressed as the difference of two 
cycles of an algebraic family parametrized by 
the points of the affine line, then X, and X, 

are said to be rationally equivalent. The set of 
the cycles that are algebraically (rationally) 
equivalent to 0 is a subgroup 3,( U)(3,,,( U)) of 
J(U). For divisors, rational equivalence coin- 
cides with linear equivalence. 

Let f: V—U be a morphism between non- 
singular irreducible varieties. For an irre- 
ducible subvariety W of V, let W' denote the 
closure of f( W), and put f;(W) = 0 if dim W> 
dim W’ and (ON) m W’ if dim W = dim W’, 
where m= [k(W): k( W’)] is the degree of the 
morphism W— W’. Extending f. ; by linearity, 
we obtain a module homomorphism f}: 3( V) 
— 3(U). If f is proper, then fa induces a 
module homomorphism f, from 3(V)/3,,(V) 
=A(V) to A(U). 

Let U and V be nonsingular irreducible 
projective varieties, and f: V--U be a mor- 
phism with graph T. If Ye 3(U) is such that 
I: (V x Y) is defined, we denote by f#(Y) the 
image of I’: (Vx Y) under the induced isomor- 
phism I — V. Each class of the rational equiva- 
lence class group A(U) = 3(U)/3,.:( U) contains 
a cycle for which (7 is defined. Hence we can 
define f *: A(U)5 A(V). Let A:U >U x U be 
the diagonal morphism, and define x: y = 
A*(x x y) for x, ye A(U). Then A(U) is a ring 
with respect to this product; moreover, it is a 
*graded ring with the grading by codimension. 
This graded ring A(U) is called the Chow ring 
of U, and the mappingf*: A(U)+ Ar V) is a 
ring homomorphism. If f is proper, we have 
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Zero Cycles. Let V be a nonsingular irreduc- 
ible projective variety over an uncountable 
algebraically closed field of characteristic 
zero (say C). Denote by A,,(V) the group of 
classes of O-cycles of degree 0 on V modulo 
rational equivalence. We say A,(V) is finite- 
dimensional if the mapping V" x V”+A,(V) 
sending (a,, ...,a,, Di, b,) to 3a; — XD; is 
surjective for a certain n. 

If A,(V) is finite-dimensional, then the 
canonical mapping A,( V)-»Alb( V) induced 
from V 5 Alb( V) is bijective (A. A. Rojtman). 
In general, the torsion part of A,(V) is isomor- 
phic to that of Alb( V) (Rojtman [75]). If 
kri V) > 0 for some p> 1, then A,(V) is not 
finite-dimensional (Mumford, Rojtman [74]). 
If Vis a surface which is not of general type 
and with h?-°( V) = 0, then A,(V) is isomorphic 
to Alb( V) (S. Bloch, A. S. Kas, D. Lieberman 
[13]). There exist surfaces of general type with 
h>? 20 such that A,(V) =O. For instance, 
Godeaux surfaces are such surfaces (H. [nose 


[43]). 
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One Cycles. Let V be a nonsingular projective 
irreducible variety and N(V) denote (numer- 
ical equivalence classes of l-cycles on V} G9 R. 
Via the intersection pairing, N(V) is dual to 
the Neron-Severi group tensored by R (- 
Section P); and let NE(V) denote the smallest 
cone in N(V) containing all effective l-cycles. 
S. Mori studied the structure of NE(V) in 
detail. One of his results is stated as follows: If 
the anticanonical divisor is ample, then there 
exist rational irreducible curves l ,, l, such 
that (1) —(Ll;: Ky) «n4- 1 and (2) NE(V)= 
R,[!] * ...- R, [1,], where n=dim V, K, is 
the canonical divisor on V, and [/;] denotes 
the class represented by 1,. Moreover, Mori 
has proved that (1) if K, is not numerically 
effective, e.g., if — Ky is ample, then V con- 
tains a rational curve, and (2) if the tangent 
bundle of V is tample, then Vis isomorphic 
to the projective space [59]. (This is called 

the Hartshorne conjecture.) 


S. Chow Coordinates, Hilbert Schemes 


Consider an irreducible algebraic correspon- 
dence T between irreducible varieties V and 
W. Let V' and W' be the closed images of the 
projections of T to V and W, and let a and c 
be their dimensions, respectively. Take generic 
points P, Q of V’, W', and consider the total 
transforms T{P} of P to W and T/QJ of Q 
to V. Denoting the dimensions of 7/ P} and 
T{ Q} by b and d, respectively, we have a + 

b - c +d, where both sides are equal to the 
dimension of T. This property is called the 
principle of counting constants. 

This simple principle has wide application. 
For instance, let V be an r-dimensional variety 
in P"(K), and let £,;u,X;—0 (0€ i€ r) be the 
equations of r 4- 1 hyperplanes H;. The con- 
dition VN (Hy N N H,) # Ø defines an irreduc- 
ible algebraic correspondence T between V 
and the multiprojective space W = F'"(k) x x 
P"(k) with uj as coordinates: T = {(x,u)|xeV, 
> UjX; — 0}. In this case, we have a =r, b = 
(n D +1), and d =0 in the notation intro- 
duced previously, so that ¢=n(r 4-1)— 1. This 
implies that W' is of codimension 1 in W; 
hence W’ is defined by a single equation F(uj; 
= 0. This form F is the associated form of V of 
B. L. van der Waerden and W. L. Chow. It is 
a homogeneous form of degree d (d = degt VJ) 
in each (ug, . Mal and is symmetric in the 
indices į, More generally, for a positive cycle 
X= n, V, of dimension r and of degree d 
(=È n, degt V,)) in P'(k), the product JI F7* of 
the associated forms F, of V, is called the as- 
sociated form of X. The coefficients of F, 
arranged in a fixed order and regarded as the 
homogeneous coordinates of a point of a pro- 
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jective space, are called the Chow coordinates 
of X. This is a natural generalization of the 
Plucker coordinates. Given r, d and a projec- 
tive variety U( c P"(K)), the set of Chow coor- 
dinates of the positive cycles that are con- 
tained in U, whose dimension is r and whose 
degree is d, is a projective variety called a 
Chow variety. 

In scheme theory, the Hilbert scheme is 
introduced in the following way [60]. Let P7 
be the n-dimensional projective space over 
Spec Z. For a locally Noetherian scheme S, 
and for a closed subscheme Z of P7 x S, we 
have the composition of the immersion of Z 
and the projection P7 x $ to S; this is denoted 
by f: Z 9 S. For any point s of S, fz !(s) is 
a closed subscheme of Pj, and the restric- 
tion of O(1) to fz !(s) is denoted by L,. Put 
MS) = {Z c P} x S} | fz is flat}. Then M be- 
cornes a contravariant functor to the cate- 
gory of sets, which is representable, Le. there 
exists a locally Noetherian scheme M such 
that M is naturally isomorphic to the functor 
Hom( —, M). M is written as Hilb(P7) and 
is called the Hilhert scheme. There exists a 
closed subscheme W of P7 x Hilb(P7) with flat 
fw such that for any Ze M(S), there exists a 
unique 9: $— M = Hilb(P7) in such a way that 
Z = W x 4$. In particular, if X is a closed sub- 
scheme of P over a field k, there exist SE 
Hilb(P7) and a field extension k/k(s) such that 
X = fy (s) & vk, Thus Hilb(P7) parametrizes 
all closed subschemes of DI Fix a polynomial 
P and define .Z/" (S) to be (Ze M(S)| x(L8") 
= P(m) for all s). Æ? is also a contravariant 
functor, represented by a scheme Hilb"(P7) 
which is projective over Spec Z. The direct 
sum of all HURT (P7) is just Hilb(P¥). 


T. Algehraic Geometry and Complex Analytic 
Geometry 


When k = C, an algebraic variety is called a 
complex algehraic variety, and it has the struc- 
ture of a fcomplex analytic manifold or (if it 
has singular points) of an analytic space. If we 
denote by Ob „ the ring of holomorphic func- 
tions at a point x of V, then Oy . C O,,, and 
their tcompletions coincide. If x is a simple 
point of V, then Oy is the tring of convergent 
power series, and its completion is the tring of 
formal power series, The prime ideals of Oy . 
remain prime under completion (M. Nagata). 
Because of this, the analytic behavior of Vin a 
neighborhood of x can be investigated alge- 
braically through the completion of Op x. 

If Vis complete, then the analytic coher- 

ent sheaves on V and the algebraic coherent 
sheaves on V correspond to each other bijec- 
tively; consequently, for propositions that can 
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be stated in terms of coherent sheaves, the 
results in the analytic sense remain valid in the 
algebraic sense also, and vice versa (J.-P. Serre 
[82]) (- 72 Complex Manifolds E). 


U. Topology of Algebraic Varieties 


Every algebraic variety defined over R (or C) 
can be triangulated by real analytic cells [46]. 
Let V be a nonsingular connected algebraic 
variety defined over C. For an algebraic auto- 
morphism g of C we can define V^ by letting 
g operate on the coefficients of the defming 
equations of open affine coverings of V. V and 
V* are not necessarily homeomorphic or even 
of the same homotopy type [83]. Grothen- 
deck has shown that there is a !spectral se- 
quence Ef? = H*( V, OQ) H***( V, C), called a 
Hodge spectral sequence. This spectral ge- 
quence can be defined in a purely algebraic 
way and H?**( V, C) can be considered as the 
thypercohomology of the de Rham complex 

{ T(V, Q5), d] of V. If Vis projective, V carries a 
Kahler metric and by the theory of tharmonic 
integrals the Hodge spectral sequence degener- 
ates and Ei? is C-isomorphic to the complex 
conjugate of ET? (- 232 Kahler Manifolds B). 
This is also the case if V is complete. 

The topology of a nonsingular projective 
surface was studied by Lefschetz using the 
method of Lefschetz pencils. For a projective 
nonsingular irreducible variety V of dimension 
ndelined over C, a Lefschetz pencil | W,},<p: of 
V is, by definition, a linear pencil consisting of 
hyperplane sections W, of V such that: (1) for 
all t eU = Pc — (t4, tz, ta}, W, is nonsingular; 
(ii) each W, has only one singular point that is 
an ordinary double point; and (iii) W N W,, 
is nonsingular, where we assume 0, 00 EU. 
Embed V into P" by a high multiple of a 
hyperplane of V and take a generic linear 
pencil {H,} of hyperplanes in P", Then { W, = 
H, N V] is a Lefschetz pencil of V. By blowing 
up V along W, N W, we obtain a smooth 
variety V and a surjective morphism m: D Pi 
Let W=7 !(0), x —nx|n ! (U). R?n,.Q is a 
local system attached to the monodromy 
representation 9,:1,(U,0) GL(H"(W,Q)). o, 
is trivial if p # n = 1. For each point t; there 
corresponds a cocycle 6, of H"! ( W, Q) called a 
vanishing cocycle such that if y; is a loop based 
at 0 going once (counterclockwise) around t; 
we have, for each xe H" !(W, Q), 9,(y;)(x)= 
X t (x, 0;>0;, where ( ) is the intersection 
pairing of H"^' (W, Q). o, (y) is called a Picard- 
Lefschetz transformation. The main results due 
to Lefschetz are restated as follows. (1) (Weak 
Lefschetz theorem). The natural homomor- 
phism H'( V, Q)—> H'( W, Q) is an isomor- 
phism for 0 €i n — 2 and is an injection for 
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izn-— 1, or equivalently H(V,W, Q) = O0 for 

0 <i <n = 1. (2) (Strong Lefschetz theorem). Let 
č be the cohomology class of H?( V, Q) corre- 
sponding to the hyperplane section W, and let 
L:H*(V,Q)5 H**?(V, Q) be the homomor- 
phism detined by the cup product with ¢. Then 
foreachi«n,L" ! : H(V,Q)5 H?"( V, Q) isan 
isomorphism. The weak Lefschetz theorem is 
true for a cohomology with integral coeffi- 
cients. In fact, V-W has the homotopy type 
of a real n-dimensional finite CW complex, 

and ët V, W) =0fo y < n. The strong Lefschetz 
theorem is equivalent to the statement that 
H"^( W, Q) is the direct sum of the vector 
space spanned by the vanishing cocycles 6, 
and the vector space spanned by the invar- 
iant cocyles Oe. 9, 4,(y)x2x,i- 1, ,d). 
Lefschetz's original proof of this statement is 
incomplete, and no direct topological proof is 
known. The transcendental proof of (2) is 
given by the theory of harmonic integrals. A 
version of Lefschetz pencils is a proper mor- 
phism f: X 2D (z||z| «c] of a complex mani- 
fold X onto a disk D such that f* =f |f !(D*), 
D* =D = {0}, is of maximal rank at every point 
of f ^! (D*). Fix a pointse D*. 2,(D*,s) oper- 
ates on HI W, D, W zf ^ (s), and we have a 
representation Q;: zt, (D*, s) GL(H(W, Z)). 
For a loop y based at $ and going once around 
0 the Picard-Lefschetz transformation (y) is 
quasiunipotent (i.e., for a certain integer m, 
9,(y)" is unipotent). 

For a nonsingular projective variety detined 
over afield Æ with characteristic p> 0, the 
above Lefschetz theorems hold for an l-adic 
cohomology (Iz p) [19, 30 (SGA 7)] € Sec- 
tion AA). Using the theory of finite étale cover- 
ings of an algebraic variety detined over a field 
k, we can define the algebraic fundamental 
group and the algebraic homotopy groups, 
which are profinite completions of the topo- 
logical fundamental group and the topological 
homotopy groups, respectively, where k = C 
[5, 30 (SGA 1)]. Let (X,x) be a germ of a 
complex space with isolated singular point x. 
(X,x) is always algebraizable, Le. the com- 
pletion Oy y of the analytic local ring € , is 
isomorphic to the completion of the local ring 
of a closed point of an algebraic variety de- 
fined over C [3]. For topological type of al- 
gebraic surfaces — Moishezon [57]. 


V. Hodge Theory 


Let Hp be a finite-dimensional real vector 
space containing a lattice H}, and let H = 
Hg Get be its complexification. A Hodge 
structure of weight m on H (or Hg) is, by de- 
finition, a direct sum decomposion H = 
ptam H^, HIS Hp, where Hr? isa 
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complex vector subspace and the overbar de- 
notes complex conjugation. If H and H' carry 
Hodges structures of weight m and m’, respec- 
tively, then H (9 HI, Hom,(H, H'), APH, and 
H* carry Hodge structures of weight m + mw, 
m' —m, pm, and —m, respectively. For a Hodge 
structure H ofweight m, F'H— (p,,,H*" *, 
p=0,1,., m, induces a decreasing filtration. 
Let H be a Hodge structure of weight m, and 
let Q be a bilinear form on H. If the follow- 
ing three conditions are satisfied, the Hodge 
structure H is said to be polarized by Q. (1) Q 
is detined over Q and is symmetric (skew- 
symmetric) if m is even (odd). (ii) Q (HP, H? ^") 
= 0 unless p = p',q = d. (ii) (/ — 1) *Q(v, 5) > 
0 for nonzero wœ HP? Let V be a compact 
Kahler manifold. Then H = H"( KO carries 
the Hodge structure induced by the type (p, q)- 
decomposition (— 232 Kahler manifolds B). 
This is also the case if V is a compact complex 
manifold which is the image of a holomorphic 
mapping from a compact Kahler manifold of 
the same dimension. Moreover, if Vis projec- 
tive, the Hodge-Riemann bilinear relations 
define a natural polarization on the subspace 
P of H consisting of all primitive cohomology 
classes. 

Each algebraic cycle W of codimension 
s determines a cohomology class [W] e 
H?*( V, Q), which belongs to H**( V). (Such 
a class is called an algebraic cycle.) The con- 
verse of this fact is called the Hodge conjecture, 
which says that Hin V, Q) H*'*(V) is spanned 
by algebraic cycles. The case s = 1 has been 
verified by Hodge., Lefschetz, and Kodaira. 

Let V(W) bea smooth irreducible algebraic 
variety detined Over C (complex manifold), and 
let o Ha W bea projective smooth morphism 
with connected tibers. Then Hz Rio, C is a 
flat vector bundle on W with the flat connec- 
tion V. V is often called the Gauss-Manin 
connection, and it can be detined algebraically 
if W is also algebraic. 

For each fiber V, ^ qo ' (s), se W, the filtra- 
tion F'H"(V. C= Q,,, H*"*( V) induces a 
complex subbundle F? and the connection V 
has the property V(C(F^)) c O(F’ ! & T*), 
where T is the tangent bundle of W [26]. 
Moreover, if W is algebraic, V is a differential 
equation with regular singular points on W 
where W 1s a smooth compactification of W, 
such that W — W isa divisor with normal 
crossings (— Section L). If we consider the 
subbundle P of H consisting of all primitive 
cohomology classes, the polarization on each 
fiber induces a Hermitian pseudometric on 
P. Curvatures of bundles P N F” have been 
studied by Griffiths [26]. There exists a classi- 
fying space D for polarized Hodge structures 
and there exists a holomorphic mapping of the 
universal covering W of W into D, usually 


called a period mapping. D may not be a 
bounded symmetric domain but has several 
interesting properties [24, 27]. In some cases 
D/T with a suitable discrete subgroup T is the 
moduli space of polarized algebraic varieties 
(e.g., curves, Abelian varieties). 

P. Deligne [ 1 8] has generalized Hodge 
theory to arbitrary algebraic varieties (more 
generally schemes of finite type over C). The 
simplest case is the Hodge theory of a smooth 
noncomplete irreducible variety X. By Na- 
gata's embedding theorem [63] there exists a 
complete algebraic variety X such that Y = 
X-X is a subvariety. By virtue of Hironaka's 
resolution theorem we can assume that X 
is nonsingular and that Y is a divisor with 
normal crossings. Let Q}(log Y) be a sheaf 
of germs of meromorphic 1-forms with loga- 
rithmic pole along Y, i.e. locally written as 
Èi- a(x)(dxi/x)) + lu a(x)dx;, where 
(x,,. x,) is a system of local coordinates with 
center pe Y in X such that x x, = 0 is a local 
equation of Y and aj(x), a,x) are holomor- 
phic at p. Using the complex (Q&(log Y)= 
A?Qi(log Y), d}, with a suitable filtration, 
Deligne has shown that H = H"(X, C) carries 
a mixed Hodge structure and this structure is 
independent of the choice of X. The mixed 
Hodge structure on H consists of two finite 
filtrations, Le, Oc.. CW, , CW, C. CH, the 
weight filtration which is detined over Q, and 0 
c. cF?c F"'!c. cH, the Hodge filtration 
such that F” induces on W,/W,., a Hodge 
structure of weight n. As a corollary he has 
shown that a meromorphic p-form on X with 
logarithmic pole along Y (i.e., a section of 
OQ? (log Y) is d-closed on X, and o = 0 if and 
only if œ, is zero in H^(X, C). An important 
application of the theory of this mixed Hodge 
structure on H"(X, C) is the following. Let 
V and W be smooth irreducible varieties, 
and let 9: V W be a smooth projective mor- 
phism. If V is a smooth compactification of 
V, the canonical homomorphism H"(V, Q) 
HI W, R™o,,Q) is surjective. Fix a point sE 
W. 7, (W, s) operates on H"( V,,Q). Then this 
action is semisimple [ 18]. 


W. Deformations, Moduli, Algebraic Spaces 


In this section for simplicity the field k is as- 
sumed to be algebraically closed. Let X be an 
algebraic scheme over k A (flat) deformation of 
X over a connected scheme S oyer kwith base 
point sp consists of the following data: (1) A 
morphism p:X—S that is flat and of finite 
type. If X is complete, pis also proper. (2) A 
closed point sy € S such that the fiber X x 5 k(s;) 
is isomorphic to X. For any closed point es, 
the liber X,=X x, K9) is called a flat deforma- 
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tion of X. If X is smooth and complete, we 
assume further that p is smooth. Similarly, we 
can define a deformation of a polarized alge- 
braic manifold, an embedding deformation 
of X in an algebraic scheme Y over k a de- 
formation of an affine scheme with isolated 
singular points, and a deformation of vector 
bundles on a fixed algebraic scheme over k 
etc. The theory has two aspects: local theory 
and global theory. 

Let (R, m) be a complete Noetherian local 
ring such that R/m = k Set R, = R/m". A for- 
mal deformation X, of X is a sequence {X,} 
such that (i) X, is a deformation of X over 
Spec(R,) and (ii) there is a compatible se- 
quence of isomorphisms X, ® gr, R,., 5 X,., 
for any n. Let (FLA/k) be the category of 
finite-dimensional commutative local k- 
algebras. The local theory of deformation is 
the study of the covariant functor F of (FLA /k) 
to (Set), where, for Ae(FLA/K), F(A) is the set 
of isomorphism classes of deformations of X 
over Spec(A). The functor is in general neither 
representable nor prorepresentable (i.e., there 
exists a formal deformation X, of X such that 
F(A) = Hom, el R, A)). But, under reasonably 
mild conditions on F, F has the hull R [78]. 
That is, there is a formal deformation Xs of X 
and a natural transformation j: G-F, where 
G(A) = Hom, 4,(R, A), such that j is for- 
mally smooth (Le. for any surjection A’+A in 
(FLA/k), G(A') G(A) x sa F(A’) is surjective) 
and G(k[c]) F (k[c]) is bijective for the ring 
of dual numbers k[¢]. The formal deformation 
Xp is called a versal deformation of X. The 
hull R is unique up to noncanonical isomor- 
phism. The deformation functor F has the hull 
R if X is (1) a complete algebraic scheme over 
k, (11) an affine scheme with isolated singular 
points, (iii) a polarized algebraic variety over 
k or (iv) a vector bundle on a complete alge- 
braic scheme over k etc. If there exists a de- 
formation o: X 2 S of X over a scheme S 
with base point sọ over Ksuch that R = Ó, v 
X, x3 @R,, the formal deformation (7, : 

X, Spec(R,)j is called algebraizable. Alge- 
braizability of the versa] deformation has been 
studied by M. Artin. Since the assumption that 
S is a scheme is rather restrictive, Artin has 
introduced the notion of algebraic spaces and 
has considered algebraizability in the category 
of algebraic spaces [2-4]. For a complete 
algebraic variety, the vereal deformation is not 
necessarily algebraizable and we need to con- 
sider deformations of polarized algebraic 
varieties. The versal deformation of an affine 
variety with an isolated singularity is alge- 
braizable in the category of algebraic spaces. 
For the global theory of deformations, we 
need the projectivity assumption, and the 
theory is essentially reduced to the theory of 
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Hilbert schemes (or Chow varieties) (— Sec- 
tion S). The problem of moduli is considered 
as the study of the set A4 of all isomorphism 
classes of deformations of X. Usually we con- 
sider the moduli of polarized varieties. 

Let (Sch) be the category of Noetherian 
schemes and .# a contravariant functor from 
(Sch) to (Set) delined by M(X) = (isomor- 
phism classes of families of polarized varieties 
parametrized by X eOb(Sch), eg, families 
of polarized Abelian varieties with(out) addi- 
tional conditions]. The functor .@ is called 
a moduli functor. If jt is represented by a 
scheme M, M is called a fine moduli scheme 
[60]. In this case there is the universal family 
m:¥ —+M.In many cases the moduli functor 
is not representable. A coarse moduli scheme 
for a given moduli problem is a scheme A4 
together with a natural transformation d: 

M 2 Hom( —, M)such that (1) ¢(Spec(k)): 

M (Spec(k)) 2 Hom(Spec(k), M) is bijective for 
any algebrically closed field k, and (2) for 

any scheme N and any natural transformation 
y : 5 Hom( —, N), there is a unique natural 
transformation 4: Hom( —, M)5 Hom( —, N) 
with w= 4: 9. A coarse moduli scheme is called 
a moduli space or a moduli scheme. 

In many cases, the moduli space can be 
obtained as the quotient space of a certain 
(locally closed) subset H of a Hilbert scheme 
by the following equivalence relation: s-s’ EH 
if and only if X,^ X, as polarized varieties, 
where z: X2 H, x !(s) 2 X,(T. Matsusaka 
[53]). This equivalence relation is often in- 
duced by an action of a 'reductive algebraic 
group G. Suppose that a reductive algebraic 
group G operates on an algebraic k-scheme Z. 
A G-invariant morphism f: Z Y (i.e., for the 
trivial action of G on Y, f is a G-equivariant 
morphism) is called a geometric quotient if (1) 
f is a surjective affine morphism and LIA 
= 0y, (2) if X is a G-stable closed subset of Z, 
then f(X) is closed in Y, and (3) for x ,, x5 € Z, 
f(x,) 2 f(x;) if and only if the G-orbits of x, 
and x,are the same. Let G be a reductive 
algebraic group, y: G— Aut( V) a rational rep- 
resentation on a finite-dimensional vector 
space V over k, and vg Z a G-invariant point. 
Then there exists a G-invariant homogeneous 
polynomial F of degree 7 | on V such that 
F(v 9) 0 (W. J. Haboush [32]). This implies 
that if a reductive group G operates on an 
algebraic k-affine scheme Spec(A), then the 
invariant ring 4$ is a finitely generated k- 
algebra and, moreover, if any G-orbit in 
Spec(A) is closed, the natural morphism 
Spec(A)—Spec(A*) is a geometric quotient. 
For a quasiprojective scheme Z over k with 
an action of a reductive group G we need a 
notion of stable points [60,62]. The subset 
of stable points of Z consists of all geometric 
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points of a G-stable open subscheme Z? of Z, 
and there exists a geometric quotient f; Z' Y 
where Y is quasiprojective (Mumford [60], C. 
S. Seshadri [79], Haboush [32]). In this way 
Mumford has shown the existence of coarse 
moduli schemes of nonsingular complete 
irreducible algebraic curves and polarized 
Abelian varieties. But, in general, analysis of 
stable points is very difficult and it is desirable 
to extend the category of schemes so that it 
becomes easier to obtain a quotient. Matsu- 
saka has introduced the notion of a Q-variety 
[52]. M. Artin has introduced the notion of an 
algebraic space (D. Knutson [49]). 

An algebraic space X of finite type consists 
of an affine scheme U and a closed subscheme 
R CU xU such that (1) R is an tequivalence 
relation, and (2) the projections p;: R+U (i = 
1,2) are étale. (These are often written as R3 
U 5 X.) A morphism g: V X of an affine 
scheme V to an algebraic space X consists of 
a closed subscheme Wc U x V such that (1) 
the projection W> V is étale and surjective, 
and (2) the two closed subschemes R x y W, 
Wx, W of Ux UxV are equal. Let SAVY 
be an algebraic space. Then Hom( Y, X) is de- 
fined as the kernel of Hom(V, X)3Hom(S, X). 
If Y is an affine scheme, this definition of 
Hom( Y, X) is equivalent to the previous defi- 
nition by virtue of the étaie descent. Thus 
algebraic spaces form a category which con- 
tains the category of schemes. We can define 
the structure sheaf of an algebraic space and 
construct a cohomology theory. Many impor- 
tant notions and theorems for schemes can 
be generalized to those for algebraic spaces. 
Every algebraic space has a dense open subset 
that is an affine scheme. A group algebraic 
space is a group scheme (J. P. Murre). Suppose 
k = C. If an algebraic group G operates on an 
algebraic k-scheme properly with a finite stabi- 
lizer group, the quotient space exists as an 
algebraic space. In this way H. Popp has 
shown the existence of moduli spaces of alge- 
braic surfaces of general type as algebraic 
spaces [70,71] (- 15 Algebraic Surfaces; also 
[54]). Moreover, every separated algebraic 
space X of finite type over C carries a natural 
structure X" of an analytic space. If there 
exists a proper modification morphism f: X^" 
— Y of a separated algebraic space X onto an 
analytic space Y, then Y carries a structure of 
an algebraic space and f becomes a morphism 
of algebraic spaces (Artin [2]). For any alge- 
braically closed field k, Artin has introduced 
the notion of formal algebraic space and 
formal contraction and has obtained results 
similar to those for algebraic spaces [49]. 

An irreducible compact complex space X 
whose algebraic dimension (— 72 Complex 
Manifolds F) is equal to dim X is called a 
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Moishezon space. As a corollary of the above 
theorem, any Moishezon space X carries a 
structure M of a compact algebraic space such 
that X MI, 


X. Formal Schemes 


Let A be a ring which we assume to be Noe- 
therian, for simplicity, and Tan ideal of A. 
Taking TDL, a as a fundamental system of 
neighborhoods of 0, we can introduce a struc- 
ture of a topological ring into A called J-adic 
topology. The fcompletion of A with I-adic to- 
pology is isomorphic to the projective limit 4 
— Hm, o A/I" (here A/I" are regarded as dis- 
crete topological rings) and called the comple- 
tion of A along /. If A is Noetherian, then A is 
again Noetherian. There is a canonical con- 
tinuous homomorphism i: AA whose kernel 
comprises the zero divisors a with a = 1cI 
(intersection theorem of Krull; — 284 Noeth- 
erian Rings B). If j is an isomorphism, we Say 
A is complete with respect to I. The topology 
of A is the f-adic topology where [= i(I) À and 
A is complete with respect to f, Take a Noeth- 
erian ring A complete with respect to I which 
we consider as an [-adic topological ring by 
identifying its completion along / with A. 

On X « V(I)c Spec(A) we can define a sheaf 
of topological rings (4 by T'(Z(f), 0,) = 
lim,s9 A,/I"A, for Z(f)- DUT TE with 

[€ A. We call (X, Col the format spectrum of A 
and write Spf(A). I is called a defining ideal of 
Spf(A). A (locally Noetherian) formal scheme is 
by definition a topological local ringed space 
which is locally isomorphic to a formal spec- 
trum (of a Noetherian ring). If we define mor- 
phisms between two formal schemes by those 
in the category of topological local ringed 
spaces, the formal schemes form a category. 

For two formal spectra Spf(A) and Spf(B) 
with deiining ideals 7 and J, respectively, the 
direct product Spf(A) x Spf(B) in the category 
of formal schemes is the formal spectrum of 
the completion of A & B along the image of 
1®B+A® J. Similarly, we can construct a 
fiber product of formal schemes. A formal 
scheme X is called separated if the image of the 
diagonal morphism Ay: X2 X x X is closed (^ 
Section D). 

For a Noetherian ring A with an ideal J, the 
formal spectrum Spf(A) (with a defining ideal 
Í) is called the completion of Spec(A) along 
P'(D. Similarly for a Noetherian scheme X and 
a closed subscheme X" we can define the com- 
pletion X D of X along X’. Every completion of 
a separated scheme is separated. For a coher- 
ent sheaf F on X one can define its completion 
Fix along X’, which is again coherent (under 
the assumption that X is locally Noetherian). 
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Thus we can develop a theory of formal 
schemes in a way similar to that of schemes, 
which we call “formal geometry" (for the more 
general definitions and further discussions see 
[2; 29, I, III; 303). Roughly speaking, a function 
on Xy. is a formal Taylor series with respect 
to the direction normal to X" whose coeffi- 
cients are regular functions on X". The method 
of formal completion enables us to introduce 
"analytic" or "intinitesimal" methods in alge- 
braic geometry. Among many important 
theorems, we state here the following two 
theorems. (1) The fundamental theorem of 
proper mapping: Let f: X— Y be a proper mor- 
phism of locally Noetherian schemes, Y' a 
closed subscheme of Y, and X’ = X x y Y’ the 
inverse image of Y'. Denote the respective 
completions of X and Y along X' and Y’ by 
Á and Y, respectively. We have the induced 
proper morphism of formal schemes f » e 
Y. Then we have canonical isomorphisms, 
(R'f CF) y. = Rf y.) n > 0, for every coher- 
ent, &,-Module F on X. This theorem can 
be applied to prove  Zariski's connectedness 
theorem: for a proper morphism f: X > Y of 
locally Noetherian schemes with f,(€,) = Oy, 
every fiber f^! (y) off is connected and non- 
empty for ye Y. (2) We use the same nota- 
tion as in (1) and assume, moreover, that Y — 
Spec(A) for a Noetherian ring A, complete 
with respect to an ideal J, and Y’ = V(I). Then 
the correspondence FF gives an equiva- 
lence between the category of coherent y- 
Modules with proper support over Y and the 
category of coherent (-Modules with proper 
support over f. This theorem plays an impor- 
tant role in the theory of *deformations of 
algebraic varieties. 


Y. Algebraic Vector Bundles 


In this section a vector bundle is a locally free 
sheaf of finite constant rank (- Section E). A 
quotient sheaf of a vector bundle is called a 
quotient bundle if it is a vector bundle. A sub- 
sheaf F of a vector bundle E is a subbundle 
when both F and E/F are vector bundles. A 
vector bundle is said to be indecomposable 
unless it is a direct sum of proper subbundles. 
Every vector bundle E on P! is a direct sum of 
line bundles, that is, E ~ @ @(a;) (Grothen- 
dieck). This property characterizes P! in the 
category of nonsingular projective varieties. In 
fact, if X is a nonsingular projective variety 
with dim X > 0 and X x P! and if r is an in- 
teger with r > dim X, there are stable (see 
below), a fortiori, indecomposable vector 
bundles on X of rank r (J. Simonis and M. 
Maruyama). Vector bundles are closely related 
to subschemes of the base variety. Let E be a 
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vector bundle of rank 2 on a nonsingular 
quasiprojective variety X and (( 1) an tample 
invertible sheaf on X. For n »() and general 
se H*(X, E(n)), E(n)/sO,= L & I with L a line 
bundle and [ an ideal in OC, which defines a 
smooth subscheme Y- (s), of codimension 2. 
When X is, for example, P" (n > 3) or affine, 
the converse holds. Let Y be a subscheme of X 
purely of codimension 2 and locally of com- 
plete intersection. If wy A4 &® Oy for an inver- 
tible sheaf M on X, there is a vector bundle 

E of rank 2 and se H*(X, E) with (s)y = Y. 
Moreover, E is decomposable if and only if Y 
is globally a complete intersection. An Abelian 
surface can be embedded in P^(C), and hence 
we have an indecomposable vector bundle of 
rank 2 on P^(C) [40]. For n > 5, we have no 
examples of indecomposable vector bundles of 
rank 2 on P”(C) [36]. Every vector bundle on 
A” =k" is trivial (D. Quillen [72], A. Suslin). 
From this and the fact stated above several 
results can be deduced; every nonsingular 
curve in A? is set-theoretically a complete 
intersection (L. Szpiro). 

For a vector bundle E on a complete 
scheme, the following are equivalent: (1) for 
every coherent €,-module F, F © S'(E) is 
generated by its global sections for n>0, (ii) 
for every coherent sheaf F on X, H'(X, F @ 
S"(B)) = 0 for alli» 0 and n > 0, (iii) the ttauto- 
logical line bundle (/( 1) on P(E) is ample. A 
vector bundle having these properties is said 
to be ample. This is a generalization of the 
notion of ampleness of invertible sheaves 
(- Section E). The set of ample vector bundles 
is closed under several operations [34]. No 
good criterion such as Nakai-Moishezon's 
for the ampleness of invertible sheaves (— 
Section E) is yet known. The tangent bundle 
of a nonsingular complete variety X is ample 
if and only if X =P” (— Section R; S. Mori 
[59]). 

Let Y be a nonsingular projective variety 
over an algebraically closed field and Oy( 1) an 
ample invertible sheaf on Y. A coherent sheaf 
E on Y is said to be stable (or, semistable) with 
respect to (),( 1) if E is torsion-free and if for 
every coherent subsheaf F of E with 0 4 F # E, 
X(F(m))/r(F)< (or <)x(E(m))/r(E) for all m0, 
where r(*) denotes the rank. If E is stable (or 
semistable) and locally free, it is called a stable 
(resp. semistable) vector bundle. Let f: X S be 
a 'smooth, projective, geometrically integral 
morphism and (/,( 1) an f-ample invertible 
sheaf on X. For a numerical polynomial H 
and an S-scheme T, set L¥is(T)={E E isa 
coherent sheaf on X, with the properties (a) 
and (b)}/ ~; (a) E is T-flat, (b) for every geo- 
metric fiber X, E, = E x, is stable with re- 
spect to Gel 1) x and y(E,(m)) = H (m), and if 
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E, = E, ® f7#(L) with L an invertible sheaf on 
T, E, ~E}. XX, is a tcontravariant functor of 
the category of flocally Noetherian S-schemes 
to the category of sets. If § is of finite type 
over a 'universally Japanese ring A, 3X, has 
a coarse moduli scheme My)s(H) (— Section 
W) and it is tlocally of finite type over S (D. 
Mumford, Seshadri, D. Gieseker, Maruyama 
[23, 51 ]). The set of the classes of semistable 
sheaves under a suitable equivalence relation 
(S-equivalence) on geometric libers of X over § 
also has a coarse moduli scheme My AH) and 
M ,,(H) is its open subscheme. Moreover, it is 
known that M,,,(H) is projective over $ in 
some cases, for example, when A is a field of 
characteristic zero. When dim X/S= 1, Xx P? 
or P^, the structures of M,,,(H) have been 
extensively studied [ 11, 69]. Theories of vector 
bundles are used in theoretical physics [9]. 


Z. Torus Embeddings 


Over an algebraically closed field K, a torus 
embedding, or a toric variety, is a normal 
scheme X locally of finite type over k on which 
an algebraic torus T acts with a dense open 
orbit isomorphic to T. Such X's, as well as 
many of their algebrogeometric properties, can 
be described very simply in terms of cones in 
real affine spaces, as Demazure [21] first saw 
in the nonsingular case in connection with 
algebraic subgroups of the Cremona trans- 
formation group, and then as D. Mumford et 
al. [47] as well as K. Miyake and T. Oda [67] 
saw in the general case immediately after H. 
Sumihiro [SS] proved a basic theorem on 
linear algebraic group actions. 

The group N, written additively, of one- 
parameter subgroups of an r-dimensional 
algebraic torus T is a free Abelian group of 
rank f. A convex rational polyhedral cone g in 
Nr = R(9; N with 0 as the vertex is the set of 
nonnegative linear combinations of a finite 
number of elements of N such that gf (—o) = 
{O}. A subset 1 of g is called a face, and is 
denoted by 1 < o, if there exists a linear func- 
tional m on Ng having nonnegative values on 
c and t={yeo|m(y)=0}. A fan A in Np isa 
collection of such o's satisfying the conditions 
(i) A30, 0 » 15 A31and (ii) A35, 0° >0> 
alo <a’. 

Each o &€A gives rise to an affine torus em- 
bedding U, as follows: Let M be the Z-module 
dual to N; hence A4 is the group of tcharacters 
of T. For c eA, the set MNa={meM|m(y)>0 
for all yeo} is seen to be a linitely generated 
additive subsemigroup of M containing 0 
and generating M as a group. U, is then the 
spectrum of the semigroup algebra k[M N 6]. 
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Thanks to the condition (ii) above, the Us 
can be naturally pasted together to produce 
a torus embedding X = S aU 

Every torus embedding is obtained in this 
way. Equivariant dominant morphisms be- 
tween torus embeddings can be described in 
terms of Z-linear maps between N's which 
send a fan to another. 

Many algebrogeometric properties of X can 
be described in terms of A, e.g., X is nonsin- 
gular if and only if every g € A can be spanned 
by a part of a Z-basis of N. X is complete 
(proper over X) if and only if the union of g'sin 
A coincides with N,. X has at worst cyclic 
quotient singularities if and only if every oe A 
is simplicial, i.e., if it is spanned by R-linearly 
independent elements of N. An equivariant 
resolution of singularities of X exists and is 
obtained by a suitable subdivision of A (Mum- 
ford et al. [47]). X has only rational singular- 
ities, hence is Cohen-Macaulay (M. Hochster 
[38] and Mumford et al. [47]). The set of T- 
orbits in X is in one-to-one correspondence 
with ech A reduced T-invariant subscheme Y 
of X is the union of T-orbits, and hence corre- 
sponds to a subset X of A. M.-N. Ishida [44] 
determined when Y is Cohen-Macaulay or 
Gorenstein in terms of the combinatorics of 2. 

A T-invariant Cartier divisor D on X cor- 
responds to a support function h, which is a 
continuous R-valued function on the union 
(Le which is (i) positively homogeneous, 
ie., h(Ay)=Ah(y) for 4» 0 and ye | J,., o, (ii) 
Z-valued on N f ( y sea 0) and (iii) linear 
when restricted to each ce A. Various prop- 
erties of the invertible sheaf (,(D) (— Section 
E) can be described in terms of h, For in- 
stance, when X is complete, (,(D) is tample if 
and only if h is upper convex, i.e., h(y) +h(y’) € 
h(y +y), and moreover, A is the coarest fan 
with the property (iii) above. The cohomology 
of O,(D) can be calculated by means of h 
(Demazure [21] and V. Danilov [ 17]). 

A support function h, on the other hand, 
gives rise to a convex polyhedron in Mp with 
vertices in M. In this way, certain aspects of 
the geometry of convex sets can be thought of 
as a part of the theory of projective varieties 
(R. Stanley [87], B. Teissier [89]). 

Mumford et al. [47] introduced a more gen- 
eral concept of toroidal embedding: A normal 
algebraic variety Y and a nonsingular Zariski 
open subset U such that Y >U is formally 
isomorphic at each point to a torus embedding 
X 2 T. This concept has been used very effec- 
tively to prove important theorems: (1) Sys- 
tematic nice compactitications of arithmetic 
quotients of bounded symmetric domains (D. 
Mumford et al. [7], 1. Satake [77]). Y. Nami- 
kawa [65] worked out the details in the case 
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of the Siegel upper planes, improving their 
earlier Satake compactifications studied by I. 
Satake, W. Baily, J.-I. Igusa, and A. Borel [12]. 
(2) Semistable reduction theorem (Mumford et 
al. [47]). Let f: V —€C be a flat morphism from 
a nonsingular variety V to a nonsingular 
curve C in characteristic 0. After a finite base 
extension C’— C and a modification V'— 
Vx-C', we cangetf ': V'— C' whose singular 
fibers are reduced with only nonsingular com- 
ponents crossing normally with each other 

(- Section L). Without the reducedness 
requirement, the existence is derived from a 
result of Hironaka [37]. 


AA. Etale Topology 


Let ¥ be a category. We say that a Gro- 
thendieck topology on Y is given if, for each 
SE Ob(S), families of morphisms (covering 
families of S) are given and satisfy the follow- 
ing conditions: (1) If o: T>S is an isomor- 
phism, io To S} is a covering family, (2) if 
Lo, R; > S], is a covering family, for any 
morphism o: S'S, the fiber product R; = 
Rix, S exists and the induced family { Q; : 
R;—>S'} is a covering family of S’, (3) if (6: 
R;>S};er is a covering family and if for each 
ie I, Ia, Si, a> Ri}aca; is a covering family, 
then (9?s,a:S; 7S} icraca, 1$ a covering 
family. In general, it is more convenient to 
use a notion of sieves to define a Grothen- 
dieck topology [30 (SGA 4)].) A category with 
Grothendieck topology is called a site. Let X 
be a scheme and Et/X the category whose 
objects consists of schemes étale over X. If we 
choose a family of morphisms (9;: Y,» Y hier 
with (Jier éi Y) = Y, as a covering family of 

Y € Ob(Et/X), this defines the étale topology 
on X and the étale site X,,. Similarly, one can 
define the Zariski site X,,, (resp. the flat site 
Xj), using open immersions (resp. flat mor- 
phisms locally of finite type) A presheaf on a 
site Y is a contravariant functor from Y to 
(Set). A presheaf F is a sheaf if, for any cover- 
ing family (fj: R;>R}ien F(R) Iie F(R)3 
IL; jer (Ri x pR;) is exact. If Y has a final ob- 
ject X, the functor F  F(X) is left exact on 
the category of Abelian sheaves on 4 and 
the cohomology groups H: (X, F) are de- 
fined as the right derived functors. Using 
covering families, one can define the Cech 
cohomology H +(X, F) as usual. In the follow- 
ing, a sheaf means an Abelian sheaf. For a 
*geometric point j: x X of a scheme, an étale 
neighborhood U of x consists of an étale mor- 
phism f;U-»X and a morphism j: x U such 
that f oj = i. For a sheaf F on X, the stalk 
F, at a geometric point x is defined by F, = 
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lim I'( U, F), U being the étale neighborhoods 
of x. On X a the sheaf G,, is defined by G,(S)= 
IS, Gef for each SeOb(Et/X). The kernel of 
the nth power homomorphism G,, 5 G,, is 
denoted by ,. The sequence 124,2 G,, 5 

G, — | is exact in X „if pis prime to resid- 

ual characteristics of X, but not necessarily 
exact in X,,,. There are canonical isomor- 
phisms Pic(X)= H! (Xz; OF) %H'(Xq, Ga) 3 
H'(Xq, G,). For X = Spec(k), k a field, the 
étale cohomology theory of sheaves on X, is 
equivalent to the tGalois cohomology theory 
over k; hence H'(X4, FI HG, F;), where 

G = Gal(k/k), k the tseparable closure of k and 
F; is the stalk of F at the geometric point Gi. 
of X. For a morphism f: X 5 Y of schemes 

and a sheaf F on X,, the direct image sheaf 
Ía F of F is defined by f, F(S) = F(X x ,S), 
SeOb(Et/Y), and higher direct-image sheaves 
R'f, E are detined by the right-derived functor. 
Let X be a separated scheme of linite type 
over a field k. By a theorem of Nagata [63], 
there exists a scheme X proper over k and 

an open immersion j: X 5 X. For a torsion 
sheaf F on X, let j,F be a sheaf on X, ex- 
tended by 0 outside X. The cohomology 
with compact support H4(X,,, F) is defined 


et? 
by H(X a, F)= H*(Xa jF). This is inde- 
pendent of compactifications. Similarly, for a 
separated morphism f: X Ah of linite type of 
schemes and a torsion sheaf F on X, one can 
define higher direct image sheaf with compact 
support R4f,F. For the étale topology, torsion 
sheaves are important. All torsion sheaves on 
X,, on a Noetherian scheme X are inductive 
limits of constructible sheaves. A sheaf F on 
Xa is called locally constructible (constant) if F 
is represented by an étale covering of X. A 
sheaf F on X ,, is called constructible if there 


exists a linite surjective family of subscheme X; 


of X such that the restriction of F to X; is 
locally constructible (constant). This is equiva- 
lent to saying that every irreducible closed 
subscheme Z of X contains a nonempty open 
subscheme U such that the restriction F to U 
is locally constant and has finite stalks, i.e., 
there is a covering family (¢;: U;>U} such that 
F U,is constant and the stalk F. is a finite 
Abelian group for all geometric points x of U. 
The cohomology of a torsion sheaf or a con- 
structible sheaf has properties similar to those 
of the classical cohomology. Let f: X 5$ be a 
proper morphism and F a torsion sheaf on 
Xa Then the stalk R?f, F at a geometric 
point s of A is isomorphic to H*(X, 4, F) 
where X,=X x ; Spec k(s). If f is a separated 
morphism of linite type of Noetherian schemes, 
a similar fact holds for the cohomology with 
compact support. Moreover, if F is a con- 
structible sheaf, R?f,F is also constructible 
(finiteness theorem). For an affine scheme X of 
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finite type over a separably closed field and F 
a torsion sheaf on X,,, one has H*(X,, F) =0 
for q > dim X. Let f: X 2$ be a separated 
morphism of schemes of linite type on C, Fa 
torsion sheaf on X,,. One has a canonical 
isomorphism (R?f Hm ^; R?f?" F^". In partic- 
ular, if X is proper over C, one has H*(X ,, 
Z/(n)) s H*(X"", Z/(n)). On the other hand, 

for a nonsingular complete curve C over an 
algebraically closed field, one always has 
H'(C.,, Z) = 0. For a geometric point s of § 
the strict localization of S at s is the ring 

Os s= lim T(U, Oy), U being étale neighbor- 
hoods of s. A geometric point f of § is called a 
generalization of ; if t is detined by an alge- 
braic closure of the residue field of a point of 
Spec (, ,. In this case, $ is called a special- 
ization of t. If f: X S is a proper smooth 
morphism the sheaf R4f, Z/(n) is locally con- 
structible (constant), and ifs is a specializa- 
tion of t one can define a cospecialization of 
H'(X, à, Z/(n)) to H(X, 4, Z/(n)) which is 
bijective. Let X be a scheme of finite type over 
a field k. If | #ch(k) is prime, one can define the 
l-adic cohomology H*(X, Qj) = lim, H*(X, Z/ 
(I) &7, Q,. The Lade cohomology has as 
many good properties as the classical co- 
homology (— Section U). 
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A. General Remarks 


If an analytic function f satisfies an tirreduc- 
ible algebraic equation 


Ao(2)f* - A ()f '+...+A,(z)=0 (1) 


with single-valued meromorphic functions 
Aj(z) in a domain G in the complex z-plane, 
then f is called a k-valued algebroidal function 
in G. With no loss of generality, we Can as- 
sume that there is no common zero among the 
Aj(z) and that all the A,(z) are tholomorphic in 
G. When k = 1, the solution of (1) is a single- 
valued meromorphic function in G. If all the 
Aj(z) are polynomials, then f is an talgebraic 
function. Thus algebroidal functions can be 
regarded as extensions of single-valued to 
multiple-valued functions and also as exten- 
sions of algebraic to *transcendental functions. 
Since (1) is irreducible, its discriminant D(z) 
does not vanish identically. For all the points 
a satisfying D(a) Z0, Ag(a) 40, ae G, equation 
(1) determines k holomorphic function ele- 
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ments f,(z) f,(z) in a suitable neighborhood 
of a that determine the analytic function f. 
They can be *prolonged analytically in G in 
the wider sense. At any point satisfying A,(z) 
= 0, at least one element has a pole; and at any 
point satisfying D(z) = 0, there may appear 
ramified elements. Therefore an algebroidal 
function can be defined as a finitely multiple- 
valued analytic function in G with the excep- 
tion of poles and talgebraic branch points. 
Every algebroidal function f(z) determines a 
tRiemann surface, which may be considered a 
tcovering surface 3 of G. This surface 3 is a k- 
sheeted *covering surface over G with no sin- 
gular point except for algebraic branch points. 
Also, f reduces to a single-valued meromor- 
phic function on 33, and all the function ele- 
ments over a point z are different. A k-valued 
algebroidal function can also be characterized 
by this property. 

These two (equivalent) delinitions of alge- 
broidal functions give rise to two distinct 
methods of studying these functions. In adopt- 
ing the first definition, we can make use of 
results in the single-valued case, as did G. 
Rémoundos and G. Valiron. When the lat- 
ter definition is adopted, we can use several 
methods that are also applicable in the single- 
valued case, as did H. Selberg and E. Ulrich. 

Research on algebroidal functions has been 
carried out mainly for the case where G is the 
finite plane |z|< oo or the unit disk |z|< 1. 
Almost all the known results for algebroidal 
functions are extensions of those on single- 
valued functions, but several results particu- 
larly relevant to algebroidal functions have 
been discovered. The existence of branch 
points makes it difficult to investigate alge- 
broidal functions in some cases. 


B. Absolute Value 


Among several results on the absolute values 
of algebroidal functions, the tmaximum prin- 
ciple, one of the basic principles, holds on the 
Riemann surface 3. The following relation 
holds among the A,(z) and | f,(z)|. Assume that 
(1) has the form 


SE +A (2) fe 1+... 4+4,(z)=0. (1) 


Then log( 1 + A(z))/log( 1 + Pai is bounded, 
where A(z)=max|A,(z)| and F(z) — max | f,(z)]. 
An algebroidal function that has no pole 
in z| «oo is called an entire algebroidal func- 

tion. The successive derivatives of an entire 
algebroidal function may have poles at every 
branch point, which is a departure from the 
case of single-valued integral functions. An 
algebroidal function defined by (1) or (1’) with 
entire functions A;(z) and zero-free ,4,,(z) is 
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entire. Dividing both sides of (1) by A,(z), we 
obtain equation (1’), and all its coefficients are 
entire. In equation (1^), the forder, type, and 
class of an entire algebroidal function coincide 
with those of the largest A(z) (- 429 Tran- 
scendental Entire Functions). 

For an entire algebroidal function of order 
less than 1/2, F(z) tends to infinity uniformly 
along a sequence of concentric circles |z| =r, 
(r, — oo). However, it can be shown that not 
all the branches of f(z) necessarily tend to in- 
finity on the Riemann surface jj; in this sense, 
*Wiman’s theorem does not remain true. But, 
using min, max, | f,(z), one can obtain some 
extensions of the generalized Wiman theorem. 


C. Picard's Tbeorem and Its Extension 


Rémoundos first extended tPicard's theorem 
and *Borel’s theorem to an algebroidal func- 
tion. Every k-valued transcendental alge- 
broidal function in the finite plane takes on 
every value inlinitely often with at most 2k 
exceptional values. There are examples where 
2k values are actually omitted. Hence the 
theorem is the best one possible in this sense. 
T. Varopoulos (Bull. Soc. Math. France 53 
(1925), 23-34) introduced the degeneracy index 


Asdmi(esci;.. 
+¢,A,=0} 


(0x 4<k= 1) into equation (1) and showed 
that the number of Picard's exceptional values 
off is at most k+ A+ 1. There is no single- 
valued meromorphic function with A > 0, so 
this result is relevant only for algebroidal 
functions. Extending this idea further, J. Du- 
fresnoy and others obtained more precise 
results. The situation is the same for the Borel 


CEC [eo Ao tc, A t+... 


exceptional values. That is, the +Convergence 
exponent of f(z) — w = 0 coincides with the 
torder off except for at mcst 2k values, for 
which the convergence exponents off are 
less than the order off (E. Borel). There are 
at most 2k polynomials P(z) for which f(z) — 
P(z) = 0 has at most a finite number of roots 
(Borel), and furthermore, using the degeneracy 
index, we Can give more precise results. 

Selberg was the first to extend the tNevan- 
linna theory of meromorphic functions to 
algebroidal functions (— 272 Meromorphic 
Functions). Almost simultaneously, Valiron 
obtained the same results starting from the 
coefficients of (1); then Ullrich improved the 
results by considering the effect of branch 
points of 3. 

Let 3, be the part of 3 over [z| € r, let n(r, w 
be the number of roots of f(z) = w = 0 in 3,, 
and let n(r, 3) be the number of branch points 
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of 3,. With these notations we write 
|." t n(n(0, w) 
Wl n (nt) — n0) Oa 
kyo t k 
m(r, w) 
| | ——— do, wx oo 
T EL Oo - , ‘ 
Ikr Ja. E qe) S 


T(r, w) 2 m(r, w) +N (r, w). 


Let T(r, f) be the flogarithmic integral of the 
spherical area of the image of 3, under w = 


f, 


dI« f (te^) 
T(r, f) B ; lla Hfaeepy d do, 


and let N(r, 3) be the logarithmic integral of 
n(r, 3). Then we have T(r, w) =T(r, f) + O(1) 
and the ramification theorem: 


N(r, 3) <(2k —2) T(r, f) + O(1). 

Also, T(r, f) - O(logr) holds if and only if f is 
algebraic. Let .4(z) be the maximum of |A,{z)|, 
and let 


Jr 


1 
= —— Wi 
u(r) eil log A(re'?)dq. 


Then T(r, f) = p(r) + 0( 1). As the second funda- 
mental theorem we have 


È Wi w> -DTE f) N(. 3) 
o Ni (r, w) +O(logrT) 
v=] 


>(q=2k) T(r, f) 


+ y N,(r,w,)+ O(logrT), 
v-1 

where N, (r, w) is the logarithmic integral of 
ni(r, w) which is the sum of the multiplicity 
minus one of all the roots of f(x) =w 20 in A 
Furthermore, the deficiency, ramification index 
of f(z), and ramification index of the surface 3 
are defined by 


6(w)= 1 —limsup N(r, w)/T(r, f) 
=liminfm(r, w)/T(r, f), 
D(w) = liminf N,(r, w)/ T(r, f), 
€=hminf N(r, 3)/T(r, f). 
With these notations, we have 
3 d(w) +) Dow) < 24+ E< 2k. 


These results contain the Picard theorem 
and the Borel theorem. Furthermore, by con- 
sidering the effect of branch points, the Ahlfors 
theory of covering surfaces can be extended to 
algebroidal functions (Y. Tumura). By using 
this result, *Bloch's theorem can be obtained 
very simply. 


But these results do not contain the Varo- 
poulos result. Taking into consideration the 
degeneracy index 4, H. Cartan proved the 
following inequality when 4 = 0: 

q 
(q- k - )T «AN (r, w,) + S(r), 
and for general 4 he conjectured that the 
number qk = 1 may be changed to the num- 
ber q—k — 4 — 1. This conjecture is still un- 
solved except for some special cases where for 
example, q = k+ 4 + 2 (N. Toda, Nagoya Math. 
J., 91 (1983), 37-47). 

Interesting results follow upon a study of 
this degeneracy index. For example, when f is 
an entire algebroidal function, if $2*1! ô(w,) 
2k 2(w, #00), there are at least km 1 
*Picard’s exceptional values in {w,} (Niino, 
Ozawa, and Toda). 

With respect to the relation between the 
number of exceptional values and the order of 
fin |z| < oo, there are some results similar to 
those for single-valued functions. For example, 
if f has k+ 1 Picards exceptional values, the 
order off must be a positive integer or oo. 


D. Asymptotic Values and Other Results 


In the single-valued case, Valiron, L. Ahlfors, 
W. H. J. Fuchs, A. Edrei, W. K. Hayman, and 
others studied the tBorel direction, the number 
of tasymptotic values, the relation between the 
deficiency values and the asymptotic values, 
etc. However, almost none of the correspond- 
ing results holds for the algebroidal case as 
shown by several counterexamples. 

There is no relationship between the order 
of an entire algebroidal function and the num- 
ber of its finite tasymptotic values, which is 
quite different from the single-valued case. 
Furthermore, it is possible to have an infinite 
number of asymptotic values even if the order 
is equal to zero. If an algebroidal function f 
satisfies lim inf T(r, f)/(log r)? « +00, then it 
has at most k asymptotic values (Valiron- 
Tumura). The Ahlfors theorem, which is con- 
cerned with the number of tdirect transcen- 
dental singular points of the inverse function 
and the order of a meromorphic function in 
the single-valued case, was extended to the 
algebroidal function by Lü Yinian (Scientia 
Sinica, 23 (1980)). 

The tJulia direction or the Borel direction 
for an algebroidal function is defined not on 3 
but on [z| < cc because of the appearance of 
branch points. With respect to the Julia direc- 
tion, there are results similar to those for the 
single-valued case, but it is unknown in gen- 
eral whether the Borel direction exists. A. 
Rauch proved that if Ip T(r, dr ar mq. 
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then there is a GE with an angle of at least 
n/p in which L(o)— [^log* F(re'^/r^*! dr 
diverges. Apart rom. the theory of distribution 
of values, Selberg obtained some conditions 
under which the inverse functions of fAbelian 
integrals of a special kind reduce to alge- 
broidal functions. 
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A. History 


The theory of almost periodic functions was 
originated by H. Bohr in 1924 as a result of his 
study of !Dirichlet series. The theory provides 
a method of studying a wide class of trig- 
onometric series (^ 159 Fourier Series) of 
general type. Further generalizations were 
made by N. Wiener, V. V. Stepanov, A. S. 
Besikovich, S. Bochner, and others. H. Weyl, 
J. von Neumann, and others clarified the rela- 
tions between this theory and *representation 
of groups, specifically, the relations between 
almost periodic functions in a *topological 
group and representation theory of a "Compact 
group. 


B. Almost Periodic Functions in the Sense of 
Bohr 


Let f(x) be a complex-valued continuous 
function defined for all real values of x. A 
number 1 is called a translation numher of f(x) 
belonging to ¢ > 0 if 

sup |fix+1)—fOl<e. 


TOXO 


If for any e> 0 there exists a number fiel > 0 
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such that any interval of length /(c) contains a 
translation number off belonging to x, then 
f(x) is called almost periodic in the sense of 
Bohr. We denote by B the set of all almost 
periodic functions in the sense of Bohr. 

If f(x) is *periodic with a period p, then 
f € B, because each number /> p plays the role 
of el for any c» 0. Anyf € B is bounded and 
uniformly continuous. A necessary and suffi- 
cient condition for a bounded continuous 
function on ( —00, 00) to belong to B is that for 
any given sequence {h,} of real numbers, there 
exist a subsequence {h, } such that the se- 
quence of functions { f(x + hy )} is uniformly 
convergent in (—oo, 0); i.e., the set { f(x+ 
h) he( —oo, oo)] is *totally bounded with re- 
spect to the uniform norm | f| Zsup f(x)|in 
the space of bounded continuous functions in 
(—00, 00). 

If f(x) € B, then f( —x), f(x), af (x) (where 
a is a complex number), and f(x + h) (where 
h is a real number)c B. If f(x), g(x)e B, then 
f(x) g(x) and f(x)g(x)e B. If f,(x)e B and 
{ f,(x)} converges uniformly to f(x), then f(x) 
c B. For any real number A, expisx (where 
iis the fimaginary unit) is continuous and 
periodic. Hence the polynomial function 
M. 2, expiA,x € B. Moreover, if the latter 
function converges uniformly to X a, exp iA,x 
as m tends to oo, then the limit function 
is also an element of B. The polynomial 
15-10, EXP iA,x and the series 3,7, Xn expiA,x 
are called a generalized trigonometric poly- 
nomial and a generalized trigonometric series, 
respectively. 

For any f eB, its mean exists: 


1 [OT 
M[f]= lim | f(x) dx. 
a 


The convergence of the right-hand formula 
is uniform in ae( —00, oo), and the limit is 
independent of the choice of a. Thus M[ f ] 

is a "linear functional defined on B. Since 

M [expiAx]- 1 for 420 and =0 for 140, the 
family {exp idx — « 4 < ooj is an fortho- 
normal system with respect to the tinner prod- 
uct (f, g) MI f(x)g(x)] defined on B. Let 
a(A)= M[f (x)exp( iAx)]forany f € B; then 
there exist countably many values of 4 for 
which a(4) differs from zero. Denote these 
values of 4 by 4,, 4, . and write a(4,) = 

X, We call the numbers a,, Ga, a,, the 
Fourier coefficients of f(x). The formal series 
Prei Xn exp Lat is called the Fourier series 

of f(X). Moreover, the Parseval equality 
MU|f()/?]=X2, |e, |? is valid for any f eB. 
For every periodic function, these definitions 
coincide with the ordinary Fourier coefficients 
and the Fourier series (— 159 Fourier Series). 
Any almost periodic function in the sense of 
Bohr is uniquely determined by its Fourier 


02 


coefficients; i.e., if two almost periodic func- 
tions have the same Fourier series, then they 
are identical. For any f € B, its Fourier series 
does not always converge uniformly, but 

f(x) € B can be approximated uniformly by a 
sequence of trigonometric polynomials. Hence 
the almost periodic functions in the sense of 
Bohr are also called uniformly almost periodic 
functions. 


C. Generalizations of Almost Periodic 
Functions 


Let C( —00, oo) be the space (- 168 Func- 
tion Spaces) of all bounded continuous func- 
tions on (—00, oo) with distance p( f, g) = 
SUD. eeu HL f(x) - g(x)]. Then a uniformly 
almost periodic function is the limit of a se- 
quence of trigonometric polynomials with 
respect to this distance. Generally, let p bea 
‘distance function introduced in a function 
space (whose elements are not necessarily 
continuous in (—oo, 0)). Then the limit of a 
sequence of generalized trigonometric poly- 
nomials with respect to the distance p is called 
an almost periodic function with respect to p. 
For example, for p > 1, we set 


atl l/p 
fal 19 - sup l | EES 
Deal f. SI = 


Lf Up 
lim sup t | |f i) acuta} 
lo -a<a<am LU Ja 

These are distance functions. The properties of 
the corresponding almost periodic functions 
and their relations to other classes of almost 
periodic functions have been studied by Besi- 
kovich [ 1]. 


D. Analytic Almost Periodic Functions 


Let D be a strip domain, a < Re z < b, defined 
in a complex plane. For any tholomorphic 
function f(z) in D and & > 0, a real number 1 is 
called a translation number off belonging to 
£» 0 if sup,.p|f(z *- it) — f(z)| <e. If for any e» 
O there exists a number (e) > O such that any 
interval of length (el contains a translation 
number off belonging to €, then f(x) is called 
an analytic almost periodic function in D. We 
denote the set of all analytic almost periodic 
functions in D = {a < Rez < b}by A(a, D). If we 
fix an x ina «x < b, then g(y) = f (x + ly) for 
any f(z)e A(a, b) belongs to B. 

For any f € Á(a, b) there corresponds a 
*Dirichlet series 27^, o, exp 4,z such that two 
analytic almost periodic functions are identi- 
cal if the corresponding Dirichlet series are 
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identical. Here the coefficients 


ty = My[ f(x * iy) exp( = ii, y)] 


are determined independently of x (a <x <b), 
and Parseval's identity 


M, [lf + iyl^]12 » |x, |? exp24,x 


holds (Bohr [3]). If the series 
) a, exp A,x exp id, y 


at x = a and x = D represent the Fourier series 
of Lil and f,(y)e B, respectively, then there 
exists f € A(a, b) such that f(z) is continuous on 
D and 


f(aviy-f). — f(bxiy- f). 


The behavior of f¢A(a, b) at the boundary or 
exterior points of the domain D = (a < Re z < 
b) has also been investigated by Besikovich 


[1]. 


E. Almost Periodic Functions on Groups 


Von Neumann defined almost periodic func- 
tions on any group, generalizing the charac- 
terization of uniformly almost periodic func- 
tions on (— 00, o0). Let B(G) be the set of all 
complex-valued bounded functions on a group 
G. Then B(G) is a metric ue with the tdis- 
tance p(f,g)=supyeg| f(x) — 969]. If for any 
f € B(G) the set A,= { f, ,(x x)= ` echt )|a, beG} 
is totally bounded in the metric space B(G), we 
call f an almost periodic function on the group 
G. This condition is equivalent to the total 
boundedness of B, = { f,(x) = f(xa) ae G} or 
C, = {af (x)= f(ax)|aeG}. We denote the set 
of almost periodic functions on G by ¥(G). 
For f(x), g(x)e .«s/(G), the linear combina- 
tions a: f(x) -- b: g(x) (a, beC) and the prod- 
uct f(x)g(x) are both contained in .¢(G). If 
f, € «f (G) and | fa} converges to f uniformly on 
G, then fe.A(G). If feat then fay, faraf € 
d(G) also. Hence «/(G) is a closed subalgebra 
invariant under two-sided translation in the 
*Banach algebra i?(G). For any f€./(G) there 
exists only one number M[ f] in the closure 
(with respect to the distance p in B(G)) of 


äis Er 





es p cif (a;xbj) | c; 0, c; — 1, aj, b; ec} 
(= the least closed tconvex set including A,). 
We call M[ f] the mean off on G. The map- 
ping f—>M[f]is a linear functional on (G), 
and we have M[f]z0 1f f z 0. 


F. Relation to Bounded Representation 


Suppose that we are given a finite-dimensional 
matrix representation D(x) = (d,(x)) of a group 
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G. Then the following three conditions are 
equivalent: (i) All the d,(x) are bounded on G. 
(ii) Al] the d,(x) are almost periodic in G. (iii) 
The representation J) is equivalent toa repre- 
sentation by unitary matrices. The inner prod- 
uct (f 8) = MLf(x)g(x)] provides the algebra 
A(G) with the structure of a tpre-Hilbert 
space. Let H(G) be the Hilbert space that is the 
completion of s% (G). If we select D”) = (d#(x)) 
from each L, where L is an equivalence class of 
bounded irreducible representations ol G, and 
if n; is the order of Di. then {( (1/./n,)d2( x)1« 

i,j <n}, A€ L} is a tcomplete ER sys- 
tem in the Hilbert space H(G). Any f(x) € 
(G) can be approximated uniformly in G 
by a finite linear combination of the dix). 


G. Almost Periodic Functions on Topological 
Groups 


When G is a tseparated topological group, we 
denote the set of all continuous functions on G 
contained in ./(G) by £ (G). The statements 
of the theorems in the previous section con- 
cerning it) and the representation D remain 
valid if we replace 4 (G) by d,(G) and replace 
D by a continuous representation of G. In 
particular, if G is the additive group of real 
numbers R, then d,(R) is exactly B. 


H. Relation to Compact Groups 


Every continuous function on a compact 
group G is almost periodic; i.e., 2 ,(G) = C(G). 
The mean value M[ f ] of f'e.«/,(G) is identical 
to fe f(x) dx, where the tHaar measure dx is 
normalized so that fe dx= 1. In this case, the 
theory of bounded representations discussed 
above is the Peter-Weyl theory (- 69 Com- 
pact Groups). 

In general, let G be a separated topological 
group. There exists a continuous homomor- 
phism o of G onto a compact group K = K(G) 
with the following two properties: (i) For any 
compact group K' and a continuous homo- 
morphism q': G— K’, there exists a continuous 
homomorphism y : K > K'such that o' = yo 
g. (ii) Such a pair K =(K, o) is unique up to 
isomorphism. K is called the Bohr compactifi- 
cation of G, and ¢ is called the canonical map- 
ping. In particular, suppose that G is a locally 
compact Abelian group and G* is its *charac- 
ter group. We denote by G' the group G* with 
discrete topology. Let K be the character 
group of G°, and let ~* be the identity map- 
ping G' 2 G* and @ be its conjugate mapping 
G—K, which is a continuous homomorphism. 
Then K is the Bohr compactification of G 
with the canonical mapping p. A necessary 
and sufficient condition for f on G to be con- 
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tinuous almost periodic is that a continuous 
function f'on K such thatf = f'o o must 
exist. If this condition is satisfied, then the 
mean M [f] is identical to fk f'(x) dx. For 
any finite-dimensional continuous unitary 
representation DN of K, D = D’o is a finite- 
dimensional continuous unitary represen- 
tation of G, and vice versa. Hence there exists 
a canonical isomorphism (determined by D = 
D’o ọ) between the equivalence classes of 
finite-dimensional unitary representations 

of a separated topological group G and the 
equivalence classes of finite-dimensional uni- 
tary representations of its Bohr compactifica- 
tion K. The 'kernel of the canonical mapping 
9: GK is identical to the intersection of all 
the kernels of finite-dimensional continuous 
unitary representations of G. 


1. Maximally Almost Periodic Croups 


Let G be a topological group. If for each pair 
a, b of distinct elements of G there exists a 
continuous almost periodic function f on G 
such that f(a) #f(b), then G is called a maxi- 
mally almost periodic group. This is the case 
if and only if G has sufliciently many finite- 
dimensional unitary representations. For a 
connected locally compact group G, the fol- 
lowing six conditions are equivalent: (1) G is a 
maximally almost periodic group. (2) There is 
a one-to-one continuous homomorphism from 
G into a compact group. (3) G is the direct 
product of a compact group and a vector 
group R”. (4) G is the *projective limit of +Lie 
groups that are locally isomorphic to compact 
groups. (5) The quotient group G/Z is com- 
pact, where Z is the center of G. (6) The system 
of all neighborhoods that are invariant under 
the tinner automorphisms constitutes a basis 
for the neighborhood system of the unit [7]. 

Moreover, any discrete free group is maxi- 
mally almost periodic. If there is no continu- 
OUS almost periodic function except constant 
functions, the topological group is called 
minimally almost periodic. Any noncompact 
connected Temple Lie group is minimally 
almost periodic. 
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A. History 


The term analog computation isa generic term 
describing various techniques of computa- 
tion employing diagrams, or physical systems 
whose equations are similar to the mathemat- 
ical problems in question. The history of ana- 
log computation is probably as old as that of 
digital computation; for example, the ancient 
Greeks tried to solve cubic equations using 
diagrams, and the astrolabe widely used by 
astronomers through the medieval period is 
also a kind of analog computer. Soon after 
the discovery of logarithms, the slide rule was 
invented. In the 18th Century, the planimeter, 
used to measure plane areas, was introduced, 
and in the 19th Century, the nomogram ap- 
peared (- Section D). In the first half of 20th 
Century, a large electronic analog computer 
was developed, thus predating the lirst prac- 
tical digital computer. 

However, analog computation has an essen- 
tial defect, namely, the limitation of accuracy. 
Today it is useful for simple calculation, but is 
rapidly becoming less important as the devel- 
opment of digital computers, including pocket 
calculators, advances. 


B. Graphical Calculation 

Graphical calculation is a method of compu- 
tation by means of geometric constructions 
using common drawing tools. Some typical 
examples of practical graphical calculation are 
the following: evaluation of linear functions of 
several variables (J. Massau, 1887), of systems 
of linear equations (F. J. van den Berg, 1888), 
of polynomials (J. A. Segner, 1761), of alge- 
braic equations (Lill, 1867), graphical integra- 
tion, graphical differentiation, and the solu- 
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tion of ordinary differential equations of the 
first order (J. Massau, 1878), of linear differ- 
ential equations (Czuber), and of ordinary 
differential equations of the second order 
(Lord Kelvin, 1892). 

Using bisquare-root graph paper, sold 
under the names of binomial probability 
paper or stochastic paper; we can handle tF- 
distributions or other probability distributions 
reducible to F-distributions, such as binomial 
or normal distributions, to a fairly good de- 
gree of approximation. This grapbical method 
of statistical inference is, even now, a powerful 
method of statistical quality control. 


C. Graphical Mechanics 


Graphical mechanics is the graphical treatment 
of mechanical problems, especially problems of 
equilibrium. In this method, the fundamental 
constructions are the composition and reso- 
lution of forces by means of force polygons 
(Fig. 1). This method is also applicable to 
problems in dynamics when they can be re- 
duced to problems of equilibrium by means of 
d'Alembert's principle. 





Fig. 1 

An example of the graphical method of constructing 
the composite and the line of action of three given 
forces F,, F;, and F,. In the force polygon in (b), the 
vector 03 represents the composite R. On the link 
polygon in (a), the point d is a point on the line of 
action of the force. If the fourth force F, has the 
same magnitude as R with opposite direction, both 
polygons are closed, and the four forces F,, F,, F,, 
and F, are in equilibrium. 


Graphical mechanics is most convenient 
when applied to problems reducible to 2- 
dimensional structural mechanics. But we can 
also use 1t for 3-dimensional problems if we 
work on projections, e.g., on the plan and the 
elevation, of the original body. In recent years, 
however, the amount of work involved in 
geometric construction has been deemed non- 
negligible, and the technique of computer 
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graphics is now considered to be much more 
convenient. Here the computation is done by a 
digital computer, and the result is displayed in 
a convenient graphical form. 


D. Nomograms 


Nomograms are charts in which we can easily 
read off the corresponding value u, from given 
values Uis> Upi when there is a relation 
F(u,,...,u,) = 0 among m real variables u; , 

.... Ha, The construction of nomograms has 
been thoroughly investigated by M. d'Ocagne 
[5]. 

A function of two variables F(u, ,u2)= 0 can 
be represented by a two-sided scale or by func- 
tional paper. The most useful nomograms are 
those for functions of three variables, F(u}, 
Uy, u4) = 0. Changing this into f(x, y, uj) - O 
(i — 1,2, 3) and drawing the curves f(x, y. u;) = 0 
for fixed values of u;, we have an intersection 
chart, where three curves for the correspond- 
ing values Hu, Us, Ha meet at a point. In prac- 
tice, however, it is much more convenient to 
use the alignment chart, which is a dual of the 
intersection chart. It is especially useful when 
the relation F(u,, u;,u4) - 0 canbe decom- 
posed into an equation of the form 


fin) gilu) Ay (uy) 
falu) 922) h;(uj)| =0. (1) 
Linz) gail h3(us) 


Putting x; = f;/h;, y; 7 g;/h; (i = 1,2, 3) and 
drawing the curves (x;, y;) with the parameter 
u; scaled on it (u;-scaled), the corresponding 
values u,, u5, and y, satisfying relation (1) lie 
on a straight line (Fig. 2). Using this property 
we can easily read off the corresponding values 
(by laying down a ruler). In the strict sense, the 
term “nomogram” usually refers specifically to 
an alignment chart. 


uj uo Hi 





Fig. 2 


For four or more variables, we can apply 
similar techniques if the function is separable 
into functions of three variables. If it is not 
separable, we can sometimes apply such tech- 
niques such as two-functional scales with 
functional networks, cocircular charts, co- 
planar charts, or moving charts. Using func- 
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tional approximations, an approximate nom- 
ogram has recently been constructed with 
errors considered to be admissible in practical 
use. 

Historically, the origin of Hilbert’s 13th 
problem, which asks if it is possible to rep- 
resent a function of many variables as the 
composite of functions of two variables, 
cornes from a study of nomograms with many 
variables. 


E. Analog Computers 


The analog computer is a special purpose 
machine designed for a specified analog com- 
putation. In a wider sense it includes special 
devices that perform tanalog simulation. The 
following mechanical devices are well known: 
the pantograph for copying plans, the harmonic 
analyzer for obtaining the Fourier expansion 
of a periodic function, and V. Bush’s differen- 
tial analyzer. Since about 1940 large electronic 
analog computers for solving differential equa- 
tions have been extensively developed. 

Analog computers have up to recently had 
the advantages of ease of construction, sim- 
plicity, and inexpensive operation, and they 
were also considered to be fast enough for use 
in real-time computation. To compensate for 
the limitations of analog computers in com- 
parison to digital ones, several hyhrid com- 
puters have also been used. However, with 
the rapid progress of digital computers and 
digital-analog converters, the analog com- 
puters are now considered less important than 
in the past. 


F. Curve Fitting 


Curve fitting is a method of finding a sim- 

ple curve y = f(x) supplying the best possible 
approximation to the values y, y,, for 
discrete values X;, x,, of the independent 
variable. A polynomial passing through all the 
given points is constructed by means of tinter- 
polation. For experimental data we usually 
construct a curve by using methods, such as 
the "method of least squares, that take the 
errors due to observation into account. 

A function constructed to best fit the ob- 
served values of the function y = f(x) express- 
ing some physical law is called an empirical 
formula, in contrast to the theoretical formula. 
Usually, we first assume that the function 
contains several empirical constants, which are 
then determined to fit the experimental data. 

The most common case is a linear approx- 
imation (using a linear function) with a suit- 
able change of variables. Semilogarithmic 
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paper, logarithmic paper, or probability paper 
are used to facilitate transformations in terms 
of logarithms or normal distributions. 

We also frequently use polynomials or trig- 
onometric polynomials of lower degree. To 
determine the empirical constants the follow- 
ing methods are used: the graphing method, 
the selection of certain points, the mean value 
method, or the method of least squares. 


G. Orthogonal Polynomials 


Let there be given the values yg, y,,. y, , for 
discrete values x = 0, 1, . , n— 1 with equal 
differences. A polynomial f(x) of degree k (<n) 
that minimizes the sum of the squares G,(n) = 


X(y,— f(m)) is given by 


v-0 
n-] 
d 2. Ve Al, m)/S, (n), 
n-1 
S,(n)- 2, (qu mir, 


and we have G,(n) 2 37,5 y2 Ändä 
Here the function q,(n, x), called a Chebyshev 
q-function, is defined by 


amor SIDE) 


In practical applications, it is better to replace 
q,(n, x) by the function 
din, x) - q,(n, x)/2~’v!M,(n), 


where M,(n) is the greatest common divisor of 


v+tm\/fn—-l—m 
( V ) 21. n. 
m V—n 


The values q? (n, m) (m—0, 1, ...,n — 1) are mu- 
tually coprime integers. The function q*(n, m) 
is called the simplest Chebyshev g-function or 
simplest orthogonal polynomial, and is some- 
times denoted by X,,,(x), Cu) or Pn, (X). 
When two functions satisfy the condition 


(fg $ Song =0, 


they are called orthogonal for a finite sum. If 


from 1,x,x?,..., x” we construct a system 


orthogonal with respect to this definition, then 
we have the polynomial 


reas rr UI 


which has the following connection with q,: 


q,(n, x) 
-((-1)'(n— 1)/2(n— v— 1) P, 1 09. 
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The polynomial P, ,(x) is called a Chebyshev 
orthogonal polynomial or sometimes simply an 
orthogonal polynomial. The polynomials P, ,(x) 
are orthogonal for the finite sum. If n- oo and 
0 €x« lI, the function P, ,(x) tends to P,(1— 
2x), where P, is the ‘Legendre polynomial. 
These functions may be conveniently used in 
least-squares curve fitting. 
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The origin of analysis can be traced back to 
the time when Eudoxus (4th century B.c.) and 
Archimedes (3rd century s.c.) devised the so- 
called method of exhaustion for calculating 
the area of a plane figure and the volume of a 
solid. Their objects of investigation were re- 
stricted, however, to particular types of figures 
or solids. In the 16th and 17th centuries, F. 
Viéte, J. Kepler, and B. Cavalieri again took 
up this problem. In the 17th Century, the prob- 
lem of drawing a tangent to a given curve 

was studied by R. Descartes, P. de Fermat, 

B. Pascal, and J. Wallis. Fermat, in particular, 
applied the result to find the maxima and 
minima of certain functions. It is worth noting 
that a certain type of mathematics powerful 
enough to produce similar results was inde- 
pendently developed in Japan around that 
time. In 1684, G. Leibniz in Germany intro- 
duced the symbols dx and dy in treating the 
same problem. He proved that dy/dx repre- 
sents the slope of the tangent to the curve at 

a given point and discovered a new operation 
to calculate it. In 1686, he established the “‘in- 
verse tangent method," which is what we now 
call integral calculus. He also introduced the 
notation f. On the other hand, I. Newton in 
England developed his “method of fluxions,” 
corresponding to our differential and integral 
calculus, from the viewpoint of mechanics. But 
neither Leibniz nor Newton formulated the 
fundamental concepts rigorously, and there- 
fore they were criticized severely by many 
contemporary scholars. The new calculus 
gained ground in Great Britain rather slowly. 
B. Taylor in England and C. MacLaurin in 
Scotland demonstrated its usefulness in 1715 
and in 1745, respectively. On the continent, 
however, Leibniz's symbolic calculus was 
taken up by mathematicians of the Bernoulli 
family, G. F. A. de l'Hópital, G. Fagnano, and 
many others; with it, they solved many scien- 
tific problems which until then had remained 
intractable. This motivated subsequent re- 
searchers to pose new problems in the form of 
differential equations. 

One of these problems, treated by J. le 
Rond d'Alembert in relation to the vibration 
of a chord, concerns the tpartial differential 
equation 


0^ y/dt? =a*d? y/dx* (1) 


for y = y(t, x) with the boundary conditions 
y = 0 for x = 0 and x = Į, He obtained the solu- 
tion yz f(at - x) — f(at — x), where f is an 
arbitrary function of period 21. In 1753, D. 
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Bernoulli showed that solutions of equation 
(1) are given by functions of the form 


a E knx . krx 
yeu X (aeo + sin!) 
E 


These two kinds of solutions gave rise to the 
question of whether an arbitrary function can 
be expressed by a *trigonometric series. This 
problem was studied by A. C. Clairaut, J. L. 
Lagrange, and L. Euler. In 1807, J. Fourier in 
France, in treating a problem on the conduc- 
tion of heat, claimed that an arbitrary function 
of period 2r can be expressed as 


a4 <= : 
y + 3 (a, cos kx +b, sin kx), (2) 
Deal 
where the coefficients a, and b, are given by 


1 T 
a= f f(x)cos kx dx, 
8 J-1 


Ke | f (x)sin kx dx. (3) 
n =R 

This series is now called the !Fourier series, 

but Fourier never verilied the fact that the 

series (2) with coefficients (3) converges and 

represents f(x). It was only as late as 1820 that 

A. L. Cauchy in France first noted that to 

treat a series properly, one must examine its 

convergence. 

In the 19th Century, the concept of ffunc- 
tions, which had been taken in the sense of 
"analytic expressions," came to be detined by 
the correspondence relation. Cauchy clarified 
the ideas of ‘limit and continuity, tdifferentia- 
bility and ‘integrability. He showed that a 
function that is continuous in a bounded 
closed interval is integrable in that interval. 
But his proof was not rigorous, as he lacked 
the notion of tuniform continuity. In his 1854 
paper on the trigonometric series, B. Riemann 
in Germany considered the integrability of 
functions that might be discontinuous and 
introduced the concept of what we now call 
the *Riemann integral. 

The theory of tsets, initiated by G. Cantor in 
Germany in his paper of 1874, revolutionized 
analysis. R. Baire, E. Borel, and H. Lebesgue 
in France contributed to the establishment of 
analysis on the basis of set theory. Baire made 
a classification of discontinuous functions. 
Generalizing his results, Lebesgue gave a 
definition of analytic expressions, thus clarify- 
ing the term that had been used vaguely since 
the time of Euler. Lebesgue also tried to define 
the concepts of the integral of a function, the 
length of a curve, and the area of a surface 
from the most general viewpoint. In generaliz- 
ing the notion of *tmeasure introduced by 
Borel, he established in 1902 the theory of 
*Lebesgue measure with which he laid the 
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foundations of the theory of Lebesgue inte- 
grals. The introduction of this theory gave 

to the theory of Fourier series a new turn in 
the direction of functional analysis. Measure 
theory was also employed by A. N. Kolmo- 
gorov of the Soviet Union to lay a solid foun- 
dation for tprobability theory in 1933. 

The study of functions of a complex variable 
was originated by Cauchy in the first half of 
the 19th Century. He began his research by 
introducing the notion of *monogenic func- 
tions"; a function which is monogenic at every 
point of a domain is what we now call a tholo- 
morphic function. He established *Cauchy's 
integral theorem and integral formulas for 
these functions, and deduced from these theo- 
rems the *residue theorem for functions with 
poles. Making use of the integral formula, 
Cauchy proved that a function that is 
holomorphic at a point a can be expanded in 
a power series of the form Y? S a,(z =a)” in a 
neighborhood of this point. 

Riemann considered a complex variable w 
as a function of another complex variable z 
when dw/dz is independent of the value of 
the differential dz. This amounts to the same 
thing as a “monogenic function” of Cauchy. 
Riemann’s mapping theorem became a model 
for subsequent developments. Riemann intro- 
duced the concept of tRiemann surfaces in 
order to tuniformize multivalued functions. 
This important idea was basic to the progress 
of analysis and ttopology in the 20th Century. 

K. Weierstrass, who was a contemporary of 
Riemann, developed the theory of functions 
of a complex variable from a purely analytic 
viewpoint. He delined an element of a function 
to be a power series Zog: of of z -a, 
representing a holomorphic function in the 
interior of its tcircle of convergence, and de- 
fined an analytic function to be an aggregate 
of such elements that are derived from one of 
them by means of tanalytic continuations, 
along all curves having the point a as the 
initial point. 

Riemann and Weierstrass constructed their 
theory to complete the theory of felliptic func- 
tions and tAbelian functions that was initiated 
by N. H. Abel and C. G. J. Jacobi. Their re- 
sults were a high point of 19th-century class- 
ical mathematics. H. Poincaré in France built 
upon their work. Another high point in the 
theory of functions of a complex variable was 
reached when J. Hadamard and C. de La Vallée 
Poussin in France made use of it to prove the 
*prime number theorem in 1896. 

Weierstrass also initiated the study of func- 
tions of several complex variables and was 
succeeded by Poincaré, P. Cousin, and E. 
Picard in France. They tried to extend the 
theory of functions of one complex variable to 
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that of many variables. F. Hartogs, however, 
discovered a phenomenon quite different from 
the case of one variable. Cousin posed in 1895 
the problem of constructing a function of 
several complex variables that has assigned 
poles. This problem was pursued by A. Weil in 
France and solved by K. Oka in Japan in 
1936. Also, the problem of characterizing 
tdomains of holomorphy was investigated by 
E. E. Levi in Italy, H. Cartan in France, and 
P. Thullen and K. Stein in Germany and was 
finally solved by Oka in 1953. H. Cartan and 
J.-P. Serre reformulated these results in terms 
of tcohomology with coefficients in sheaves. 
This considerably influenced the formulations 
of mathematics thereafter. These results were 
further generalized to tanalytic space by H. 
Grauert, R. Remmert, and Stein in Germany. 
The theory of *complex manifolds, which are 
generalizations of Riemann surfaces to several 
variables, was initiated by W. V. D. Hodge in 
England and K. Kodaira in Japan and con- 
tinued by F. Hirzebruch in Germany, M. F. 
Atiyah in England, and I. M. Singer in the 
United States. 

Differential calculus gives a general method 
of finding extreme values of a given function. 
Likewise, in order to find a function that pro- 
duces an extremal of the given ffunctional, the 
calculus of variations was created in the 18th 
Century. For example, Euler considered the 
problem of finding a particular function y(x) 
that renders the functional [7 F(x, y. y') dx 
an extreme value among all those functions 
y(X) for which the plane curve y 2 y(x) passes 
through two given points (a, A) and (b, B) of the 
plane; he showed that this y(x) must satisfy 
the differential equation dP, /dx = F, = 0 (1744). 
Lagrange, W. R. Hamilton, and others devel- 
oped this result into a general tvariational 
principle that governs not only classical tme- 
chanics but also *quantum mechanics. 

From research on the continuity or differen- 
tiability of the functional with respect to y 
emerged the idea of considering a function as a 
“point” in a !function space. This gave rise to 
‘functional analysis, a branch of analysis that 
treats functions as elements of certain spaces 
and utilizes the methods of algebra and topol- 
ogy. The first result in this regard was the 
theory of fintegral equations of V. Volterra in 
Italy and E. I. Fredholm in Sweden in the 
beginning of the 20th Century. Fredholm's 
work was motivated by his desire to solve the 
tDirichlet problem, the solution of which had 
been used by Riemann in the proof of his 
mapping theorem, etc. However, Riemann's 
own proof of existence of the solution, called 
the *Dirichlet principle, was not rigorous, and 
attempts to save the proof provided one of the 
central problems of analysis for some time. 
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Fredholm's solutions were different from Rie- 
mann% but D. Hilbert in Germany was able 
to justify Riemann's original proof. Hilbert's 
proof was later simplitied and generalized by 
R. Courant, H. Weyl, and others. Hilbert also 
introduced the ‘function spaces /, and L, to 
study the 'eigenvalue problem of integral 
equations with tsymmetric kernels. Later J. 
von Neumann in Germany established tspec- 
tral theory in abstract tHilbert spaces and 
applied it to the mathematical foundations 
of quantum mechanics (1929). S. Banach in 
Poland created the theory of "mear operators 
in Banach spaces (1932). This theory was 
further generalized to that of linear topolog- 
ical spaces and was applied to the theory of 
distributions. 

In their study of partial differential equa- 
tions and Fourier analysis, Hadamard, J. 
Leray, S. L. Sobolev, T. Carleman, and many 
others had to extend the notion of functions; 
they also enlarged the notion of derivatives. L. 
Schwartz in France introduced tdistributions 
and defined derivatives in the sense of distribu- 
tions to unify these generalizations (1945). M. 
Sato in Japan defined more general generalized 
functions, called thyperfunctions (1958). It has 
become evident that both distributions and 
hyperfunctions have provided the most power- 
ful tools in recent research in the general 
theory of tpartial differential equations, to 
which L. Hormander in Sweden has made 
outstanding contributions. 
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A. Holomorphic Functions 


As in the case of tholomorphic functions of 
one complex variable, the definition of holo- 
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morphic functions can be given in two ways: 
the first definition utihzes differentiability, 
following the approach of B. Riemann; and 
the second method utilizes the notion of power 
series expansion as developed by K. Weier- 
strass. In this article, N = (0, 1,2,. j. 


B. Power Series 


Let z be an n-tuple of complex variables 
2153552, and C — (6, ess Belt à point of C^, 

An infinite sum P of monomials oz c} = 
d, as 7 C1) (7 e) (E (s, ss, Kn 
N”), where a, € C, is called a power series 

with center c and coefficients a,. If, for a bi- 
jection o of N onto N”, the simple series 

È pen lapp hZ = c)"?|s convergent at z = 2°, 
we say that P is absolutely convergent at z. 
Its sum at z° denoted by X a,(z° — c, is de- 
fined as the sum Xa, (z? = c)*?), which is 
independent of the choice of o If P is uni- 
formly bounded at z?, then P is absolutely 
convergent at every point of the open polydisk 
S — (z|Iz;— ej «Iz? — cl j 5 L ..., n). Further- 
more, in this case P converges absolutely and 
uniformly on every compact set in S (N. H. 
Abel). 

The convergence domain of a power series P 
is the set D of points z? such that P is ab- 
solutely convergent at every point in a neigh- 
borhood of z°, The interior of the set B of 
points at which the infinite sum P is uniformly 
bounded is equal to D. A (complete) Reinhardt 
domain with center c is a domain D in C" such 
that whenever D contains Zo. the domain D 
also contains the torus {z |z- c] - |z? ci 
1, ,n} (the closed polydisk {z| |zi—cj| «|z? — 
cj, j= 1l... nj). If the convergence domain D 
of the power series P is not empty, it is a com- 
plete Reinhardt domain and is also logarithmi- 
cally convex; that is, the set D— Ujtzlz- ej) is 
mapped onto a convex domain in R" by the 
mapping z;log|z;— cj| j= 1, . . ., n). The set D 
of points at which P is absolutely convergent 
is, in general, greater than D, and it is possible 
that f) contains exterior points of D. A thorn of 
D is the set of exterior points of D contained in 
D that are located on the planes {z DE Cj g= 
1,. , n). An n-tuple r e R", is called a set of 
associated convergence radii if P is absolutely 
convergent at every point of {z| |z;— cj < 
r,j=1,...,n}but not of {z| Iz, ^ cil» rj 
1,, nj. An n-tuple of associated conver- 
gence radii may not be uniquely determined, 
but it satisties 


lim sup (art = 1, 
|k|> t oo 


(E. Lemaire). 
Let f be a complex-valued function defined 
in a neighborhood of z° € C". If there exists a 


convergent power series P with center z? such 
that at every point of a neighborhood of z° the 
value off’ and the sum of P coincide, then f is 
called analytic at z? in the sense of Weierstrass, 
and P is the Taylor expansion of f at z?. 


C. Differentiability 


Let f be a complex-valued function detined in 
a neighborhood of z? e C". [f in a neighbor- 
hood of z? we have 


f(z) — f(z9) e gute: —z2)--... o (z,— z2) +8, 


with «,,...,4,€C and 
lim ce, — zil +... +|Zn— Za |)=0, (1) 
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then we say that f is (totally) differentiable at 
z?. The function f is then continuous at z°, 
and the partial derivatives éf/éz; (j = 1, , n) 
exist. Furthermore, the Cauchy-Riemann 
differential equations 0f/0z;—0 (j= 1, , n) 
hold, where ¢f/0z;=(1/2)(0f/0x;—iéf/éy,) and 
ff02,— (1/2)(0f]0x,-- idf/ay,) with z,= x, + iy, 
We say that f is holomorphic at z° in the sense 
of Riemann if f is differentiable at every point 
in a neighborhood of z?. Analyticity in the 
sense of Weierstrass is equivalent to holomor- 
phy in the sense of Riemann. Furthermore, if 
the partial derivatives Of/éz, (| — 1, ...,m) exist 
at every point in a neighborhood of z?, then f 
is, without assuming continuity, proved to be 
holomorphic. Thus the holomorphy off in 
each variable z; implies the holomorphy off in 
zz(z,,. -)z) (Hartogs’s theorem of holomor- 
phy, 1906). 

A complex-valued function in a domain G c 
C" is called holomorphic or analytic in G if it 
is holomorphic at every point of G. Let H(G) 
be the ring of holomorphic functions in G. For 
f =u +ive N(G), uand satisfy in G the dif- 
ferential equations e^u(z, 2)/02;0%, = 0; that is, 





^u ĝu 
SE =0, 2 
Ox;0X, | OyjOy, (2) 
Ou Aru 





=0, j, k=1,...,n. 
Oxjy, Ox,0y; ` dece 
A tdistribution TED'(G) is called plurihar- 
monic in G if it satisfies C) in G. Then T is 
harmonic and hence a real analytic function 
by *Weyl's lemma. 

Let G; be a domain in the z-plane with 
piecewise smooth boundary C;. If f e H(G) (G 
= [Ti-, G;) is continuous on G, then 


j= 
EX i 10) 
Qni Ja, Le (6 72)... 7 2) 


f(z.zeG, 


pa bel 0.246 (3) 
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(Cauchy’s integral representation). Thus if n > 2, 
then f is determined by its values only on the 
proper subset C = Ci x x C, of @G, which is 
called the skeleton (or determining set) of G. 
For (pluriharmonic) functions of several com- 
plex variables, the boundary value problem, 
not necessarily solvable in its classical form, is 
not so effective as the Dirichlet problem in one 
complex variable. 

As in the case of one variable, the * Laurent 
expansion is valid for every holomorphic 
function in a domain of the form G = It, 
where the G; are circular annuli c C. Suppose 
that we are given f, € H(G,) and f, € H(G;), 
where G,, G, are domains in C" such that G N 
G, is nonempty and connected. If f, = f; on 
{z|}z,-z?|<r,y=y,j=1,...,n}, where z°= 
x? + iy? € Gi N G,, then there exists a unique 
f € H(G, U G;) such that f|G; — f, and f|G,= f; 
(theorem of identity. Thus tanalytic continu- 
ation proceeds as in the case of one variable. 
Similarly, some fundamental theorems in one 
variable, such as *Liouville’s theorem on ten- 
tire functions and the ‘maximum principle, 
hold also for several variables. However, there 
are some properties that reveal the differences 
between the cases of one and several variables. 
For instance, the set of zeros of a holomorphic 
function (— 23 Analytic Spaces B) has no 
isolated point for n z 2. The investigation of 
these remarkable differences is one of the 
purposes of the theory of analytic functions of 
several complex variables. 


D. Shilov Boundaries 


While the maximum principle holds for a 
holomorphic function in a domain G, the set of 
points where the maximum is attained may be 
a proper subset § of 6G. For instance, if G is 
the product of annuli as before, then the skele- 
ton of G can be taken as S. In connection with 
the theory of tnormed rings, G, E. Shilov 
proved that there exists a unique smallest 
member $, (called the Shilov boundary of G) 

in the family of closed subsets S such that 

sup (If(z)l|ze$] =sup{|f(2)||z€G} for every 
f € H(G) continuous on G. The structure of 

Sp is investigated in detail together with 

the pseudoconvexity of G connected with it. 
Applying tPerron’s method for the Dirichlet 
problem to *plurisubharmonic functions and 
the Shilov boundary, H. J. Bremermann solved 
one type of boundary problem. 


E. Local Theory 


Let f and g be two functions defined in neigh- 
borhoods of a set Sc C". If f= g in a neigh- 
borhood of S, then f and g are called equiva- 
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lent with respect to S. The germ off on S, 
denoted by fs, is the equivalence class off: A 
germ of a bolomorpbic function on S is the 
germ on S of a holomorphic function defined 
in a neighborhood of S, and H(S) denotes the 
ring of germs of holomorphic functions on S. 
Given a point 0 in C", H(0) = H( {0}) is iso- 
morphic to the ring H, of convergent power 
series at O, Le, the power series that are ab- 
solutely convergent in some neighborhoods of 
0. For every nonzero function fe H(O), there 
exists a system of coordinates (z,, , Z,,) cen- 
tered at O such that f(0, , 0, z,) z 0 for every 
Z, # 0 in a neighborhood of z, = 0. In a neigh- 
borhood of 0, f is then equal to the product of 
an invertible element of H, and a distinguished 
pseudopolynomial 


P(z,) =z t a,(z;, feet os z,-.)z] Tt 


H 2,-1)€ H,., [z,]. 


with a,(0,...,0)=...=a,(0, ...,0) 20, and P(z,) 
is uniquely determined by f and the coordi- 
nates z,,, z, (Weierstrass's preparation 
theorem). It follows from this that H, is an n- 
dimensional tregular local ring. Considering 
H(0) as the tinductive limit of flocally convex 
rings H(U), where U ranges over a base for a 
neighborhood system of 0, H. Cartan proved 
the preparation theorem in a more precise 
form in which the association f 4d; is con- 
tinuous with respect to the supremum norm. 
Based on a deep consideration of this situ- 
ation, K. Oka proved a theorem of funda- 
mental importance: The fsheaf Co, defined by 
(en, = H(Z) (ze C") is coherent. 


Tas sos 


F. Domains of Holomorphy 


Given a domain G C G” for nz 2 it may be 
that there exists a domain G' strictly greater 
than G such that all the functions that are 
holomorphic in G extend to holomorphic 
functions in G’. For instance, let S = S’ x o, 
where S’ and g are open polydisks in (z,,. , 
Z, ,)-space and z,-space, respectively, and 

let T c C" be an open set. If there exists an 
open set U (# (f) c S' such that (U x e)U(S' x 
0c) C T and if S N T is connected, then all the 
functions that are holomorphic in T extend 
uniquely to holomorphic functions in SU T 
(Hartogs’s continuation theorem). In particular, 
if A is an tanalytic set in a domain G c C" with 
dim A n 2, then all the functions that are 
holomorphic in G-A extend uniquely to 
holomorphic functions in G. Furthermore, if A 
is an analytic set in G with A 4G, then every 
f € H(G — A) that is locally bounded at the 
points of A extends uniquely to a holomorphic 
function in G (Riemann% continuation theorem 
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for n> 2). The domain G, of holomorphy for f 
is defined to be the maximal domain to which 
f may be continued analytically. A domain G 
is called a domain of holomorphy if G = G, for 
some Te H(G). However, G, is, in general, not 
a subdomain of C", 6, is, generally, a mani- 
fold spread over C”; i.e., G, is a connected n- 
dimensional 'complex analytic manifold with a 
holomorphic mapping q: 6,2 C of maximum 
Jacobian rank (o is then an open mapping). 
The same is true for the common existence 
domain of functions in a subfamily of H(G). 
The common existence domain G of all the 
functions in H(G) is called the envelope of 
holomorphy. A  holomorphicall; complete 
domain is a domain G such that G = C. These 
notions carry over to the case where G is a 
manifold spread over C". The (general) Levi 
problem of determining the conditions for a 
given domain to be holomorphically complete 
is fundamental to the theory of analytic func- 
tions of several complex variables (— Section 
I). In connection with this problem, various 
notions of pseudoconvexity of holomorphi- 
cally complete domains are defined. 


G. Pseudoconvexity 


An upper semicontinuous real-valued function 
u(—o <u< --oo) in a domain GCC" is said 
to be plurisubharmonic if for every z? e G and 
every a € Ç” the function u(z? + ta) of t is tsub- 
harmonic (including the constant —oo) in all 
the connected components of {t z"-- tge G}. A 
domain G is said to be pseudoconvex (or d- 
pseudoconvex) if u = = logd, is plurisubhar- 
monic in G, where d,(Z) is the distance from 

Zz eG to 0G with respect to any norm in C". 
Every connected component of the interior of 
the intersection of a family of pseudoconvex 
domains is pseudoconvex, and the union of an 
increasing sequence of pseudoconvex domains 
is pseudoconvex. Suppose that we are given a 
domain G and a function y of class C? in a 
neighborhood of G such that G = (z|u(z) < 0} 
and, for some ¢>0, 3, (0^u/0202,)ajd, > elal? 
for every ae C". Then the domain G is said to 
be strongly pseudoconvex. Strong pseudo- 
convexity implies pseudoconvexity. Every 
pseudoconvex domain is exhausted by an 
increasing sequence of strongly pseudoconvex 
domains. An open set Pc C" is called an ana- 
lytic polyhedron if P= {z| |y,(2)) < ,a— 

1,, N}, x,€ H(P) (a= 1, ... N). Then every 
connected component of P is pseudoconvex. A 
Weil domain is a connected and bounded 
analytic polyhedron P defined by y,(a = 

1, , N) with N Z n, such that for every k 

(1 € K& n) the intersection of the hypersurfaces 
Ix] 1 (1 <i<k) is of dimension «2n — K. 
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H. Holomorphic Convexity 


A domain G c C" is called holomorphically 
convex if for every compact set K CG, K = 
Tue)? IE) <supyex |f(o)]} (the holomor- 
phic hull of K) is a compact set contained in 
G. (For a domain G contained in an tanalytic 
set we can similarly define holomorphic con- 
vexity of G.) Every connected component of 
the intersection of a family of holomorphically 
convex domains is holomorphically convex, 
and a holomorphically convex domain is 
exhausted by an increasing sequence of Weil 
domains. Holomorphic completeness implies 
holomorphic convexity. The converse is true 
for domains in C”, If G is holomorphically 
convex, then for every point ( of Q( there 
exists an f € H(G) such that f is not locally 


bounded at ¢ (H. Cartan and P. Thullen, 1932). 


Hence a holomorphically convex domain is a 
domain of holomorphy. Thus a domain in C 
is holomorphically convex if and only if it is a 
domain of holomorphy. (The same is true for 
unramified covering domains over C" (Oka, 
1953).) The union of an increasing sequence of 
domains of holomorphy is a domain of holo- 
morphy (H. Behnke and K. Stein [2], 1939). 
Suppose that we are given a domain G and 
domains S,, T, (a = 1,2,) such that $, U 7” c 
G and supr, |f] - super |f] for every fe H(G). 
Suppose also that $} = lim §, is bounded. We 
Say that the continuity principle holds in G if 
Thc G (Tp = lim T,) implies SoG G. The con- 
tinuity principle holds in a domain of holo- 
morphy (Hartogs’s theorem of continuity). 
This implies that if G is a bounded domain 

C €" (n > 2) with connected boundary 0G, 
then every function holomorphic in a neigh- 
borhood of QG extends to a holomorphic 
function in G (Hartogs-Osgood theorem). In 
particular, for n > 2, the set of singular points 
of a holomorphic function has no isolated 





point. A domain is pseudoconvex if the con- 
tinuity principle holds there. Hence a domain 
of holomorphy is pseudoconvex. 


I. The Levi Problem 


Let G be a domain in C" and z?&0G. If there 
exists an open neighborhood U of z? such that 
every connected component of G N Ų isa 
domain of holomorphy, then G is called Car- 
tan pseudoconvex at z?. On the other hand, if 
every 1-dimensional analytic set that has z? as 
an ordinary point contains points not belong- 
ing to G U {z°} in the neighborhoods of z, 
then G is called Levi pseudoconvex at 7° Fyr- 
thermore, G is called locally Cartan (Levi) 
pseudoconvex if G is Cartan (Levi) pseudo- 
convex at every point of 0G. Every domain of 
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holomorphy is locally Cartan pseudoconvex. If 
G is pseudoconvex and there exists a neighbor- 
hood U of z? such that GN U — (z|e(z) «0], 
where $9 € C! (U), then G is Levi pseudoconvex 
ate”, 

The (proper) Levi problem of whether every 
pseudoconvex domain is a domain of holo- 
morphy was proposed by E. E. Levi (1911). 
After unsuccessful efforts by various mathe- 
maticians to solve the problem, 1t was affirma- 
tively solved by Oka (1942 for n = 2 and 1953 
for manifolds spread over C" for n > 2), H. 
Bremermann, and F. Norguet. The problem 
was solved also by H. Grauert [ 14] (1958) in a 
more general form (- Section L) using results 
on linear topological spaces and by L. Hór- 
mander [ 1 8] (1965) using methods of the 
theory of partial differential equations. A 
fundamental step in Oka's solution is his 
gluing theorem: Let G be a bounded domain 
c C". If every connected component of G, = 
{z|x,>a}NG and G; 2 (z|x, <b}NG (a «b) 
is a domain of holomorphy, then G is a do- 
main of holomorphy. Indeed, by virtue of 
the Behnke-Stein theorem and the fact that 
every pseudoconvex domain is the union of an 
increasing sequence of bounded locally Cartan 
pseudoconvex domains, it suffices to solve the 
Levi problem in the case of a bounded locally 
Cartan pseudoconvex domain. The Levi prob- 
lem in this case is solved by the gluing theo- 
rem. Various integral representations of holo- 
morphic functions are known besides the 
Cauchy representation. The Bergmann-Weil 
integral representation in a Weil domain was 
used as an important means of solving the 
Levi problem. 


J. Holomorphic Mappings 


Holomorphic functions with values in a tquasi- 
complete tlocally convex complex vector space 
E have also been investigated. The classical 
theory described above has been generalized, 
to some extent, to this case. In this way, many 
applications of the theory have been dis- 
covered. An E-valued function in a domain G 
C C" is holomorphic if and only if the map- 
ping uof: GC is holomorphic for every çon- 
tinuous linear form yon E. By this theorem, 
most problems concerning E-valued holo- 
morphic functions can be reduced to those of 
ordinary holomorphic functions. Note that the 
vector space H(G) of ordinary holomorphic 
functions in G is a *Fréchet space. The spaces 
C? and complex *Banach spaces belong to the 
above category of E. A. C?-valued holomor- 
phic function in a domain G c C" is called a 
holomorphic (or analytic) mapping of G into 
C?. An isomorphism in the category of do- 
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mains G c C" and holomorphic mappings is 
called an analytic isomorphism (or biholomor- 
phic mapping). An automorphism in the cate- 
gory is called an analytic (or holomorphic) 
automorphism. With every domain G c C 

is associated the sheaf (/; of germs of holo- 
morphic functions over G. Thus we have the 
notion of a tringed space (G, 0,). A complex 
analytic manifold can be defined as a (Haus- 
dorff) ringed space that is locally isomorphic 
to some (G, 0,). 

A meromorphic function in G is a function 
that is locally the quotient of two holomor- 
phic functions with denominator #0. It may 
be defined more rigorously as a meromor- 
phic mapping of G into P,(C) (- 23 Analytic 
Spaces D). 


K. The Cousin Problems 


The Cousin problems are those of constructing 
meromorphic functions with given Zeros or 
poles. In terms of sheaves the problems are 
stated as follows: Let Jf be the sheaf of germs 
of meromorphic functions over a domain G c 
C". The first Cousin problem asks whether 

the mapping T(G, X) V (G, A) induced 

by the exact sequence 030, 4,9 2.0 
(Po = Hg/Og)is surjective, where I'(G, F) is 
the module of tsections of ¥ over G (- 383 
Sheaves C). Let Zei be the sheaf of multiplica- 
tive groups of germs of meromorphic functions 
not identically 0 and (,* be the subsheaf of 
A* formed by germs of nonzero holomorphic 
functions. The second Cousin problem asks 
whether the mapping T(G, %#*)—> P(G, Dg) 
(Dg = HG*/Og*)is surjective. P. Cousin (1895) 
solved the first problem for G = C" or [T;., G; 
and the second problem for G — C", Oka (1935) 
proved that the first problem is solvable in 
every domain of holomorphy. In solving the 
second problem in a domain of holomorphy, 
Oka established the notion of ‘fiber bundles 
and proved that the problem for any domain 
is reduced to holomorphic triviality of a 
holomorphic principal fiber bundle over the 
domain and that holomorphic triviality is 
equivalent to topological triviality when the 
domain is of holomorphy (Oka's principle). 
Using the solutions of the Cousin problems, 
Oka proved his gluing theorem, described in 
Section 1. 


L. Stein Manifolds 


Abstracting certain important properties of a 
domain of holomorphy, Stein [33] introduced 
the following category of complex analytic 
manifolds (X, Oy): (1) X is paracompact (i.e., 
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each connected component of X has a count- 
able open base). (2) Functions in F(X, Oy) 
separate the points of X. (3) For every point 

X € X there exists a system of local coordi- 
nates around x that is formed by functions in 
I'(X, Cy). (4) X is holomorphically convex. 
(X, 0,) is then called a Stein manifold. It was 
later discovered by Grauert [ 12] that con- 
ditions (2) and (4) imply (1) and (3). 

Applying the theory of teohomology with 
coefficients in sheaves, H. Cartan and J.-P. 
Serre obtained for an 'analytic coherent sheaf 
F on a Stein manifold X, the following funda- 
mental theorems of Stein manifolds. Theorem 
A: H*(X, F) generates the stalk Fy (as an Oy- 
module) at every point x of X. Theorem B: 
HX, F) =0 for all q> 0 [7, 30]. Conversely, 
for a complex analytic manifold X, if for every 
analytic coherent sheaf $ of ideals defined by 
a O-dimensional analytic set in X De, a dis- 
crete subset of X) H! (X, .4) = 0, then X is a 
Stein manifold. Furthermore, if TUN. 0y) = 
I ( Y, 0y) for a Stein manifold Y (as in the case 
where X c CL then the fundamental theorem 
A for every coherent sheaf of ideals implies 
that X is a Stein manifold (I. Wakabayashi). 

Due to the fundamental theorems, most 
results on domains of holomorphy hold un- 
changed for Stein manifolds. For instance, the 
first Cousin problem is always solvable. The 
second Cousin problem is solvable if and only 
if H?(X, Z)=O. An n-dimensional Stein mani- 
fold can be realized as a (ramitied) covering 
domain of holomorphy over C". Furthermore, 
some theorems on differentiable manifolds 
have analogs on Stein manifolds. For instance, 
the cohomology groups of the complex of 
holomorphic differential forms over a Stein 
manifold X are isomorphic to the cohomol- 
ogy groups H*(X, C) (analog of tde Rham's 
theorem). Every n-dimensional Stein manifold 
X is realized as a closed complex analytic 
submanifold in C?"*!: that is, there exists an 
injective *proper holomorphic mapping f: 

X 2 C?"*! with df #0. Consider all the holo- 
morphic tprinciple tiber bundles over a Stein 
manifold X whose fibers are isomorphic to a 
complex Lie group G. The analytic isomor- 
phism classes of the bundles and the elements 
in H!(X, CH (where G^ is the sheaf of germs 
of holomorphic mappings of X into G) are in 
one-to-one correspondence. The same is true 
for the topological isomorphism classes of the 
bundles and the elements in H'(X, G^) (where 
G“ is the sheaf of germs of continuous map- 
pings of X into G). The mapping H! (X, G*)— 
H'(X, G') induced by the canonical injection 
G^ > G is bijective (Grauert [ 13]). Every rela- 
tively compact domain in a complex analytic 
manifold is holomorphically convex if it is 
strongly pseudoconvex (Grauert [14]). Hence 
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such a domain is a Stein manifold. It follows 
from this that every real analytic manifold 
with countable base for open sets is realized as 
a closed real analytic submanifold of some R”. 

The notion of Stein manifolds is generalized 
to that of weakly 1-complete manifolds: a 
complex manifold X is called a weakly 1- 
complete manifold if there exists a plurisubhar- 
monic function y of class C® on X, such that 
for any ceR, X,= {xe X |u(x)<c} is relatively 
compact. The family of such manifolds in- 
cludes compact complex manifolds too. For a 
weakly 1-complete manifold also, vanishing 
theorems of cohomology have been estab- 
lished by H. Hironaka, S. Nakano [26], and H. 
Kazama [2] (- 232 Kahler Manifolds D). 

The theory of entire functions of two vari- 
ables has been developed from a new view- 
point established by T. Nishino. An entire 
function f of two variables defines a family of 
Riemann surfaces | f 2 c c eC} on C?, and 
investigations of the structure of such a family 
play an important role. For instance the fol- 
lowing is proved in this way: If every irreduc- 
ible component off- c (CE C) is biholomor- 
phic to C!, then there exists an entire function 
g such that ( f, y): C? 2C? is a biholomorphic 
mapping (Nishino [28]). In this theorem, the 
analyticity of g is obtained from the fact that 
C? is a Stein manifold. 


M. Continuation of Analytic Sets 


The application of the theory of cohomology 
with coefficients in sheaves is not restricted 
to problems concerning Stein manifolds. 
Given Gp = {z||z;|<1,1<j<n} (nz 3), Gi = 
{z||z1|<e|z|<1,2<j<n}, and G™=G,U 
(Go —{2|2.=...=Z,_,=0}) (3<m<n), we have 
H°(G™, Ogm) = 0 (1 <p «m — 2) (Scheja's 
theorem [32]). Let F be a coherent analytic 
sheaf over a domain G c C". If, for every point 
z of an analytic set ASG, ¥,= {0} or p< 
n—dim, A-2 hd, F (where hd, ¥ is the 
thomological dimension of the € ,-module 
F), then it follows from Scheja’s theorem that 
the mapping H?(G, 4) H"(G = A, F) induced 
by the canonical injection G — A G is bijec- 
tive. This generalizes Hartogs's continuation 
theorem for holomorphic functions, which 
corresponds to the case p= 0. 

Besides the continuation of holomorphic 
functions, we Can consider the continuation of 
analytic sets. Let A be an analytic set in a 
domain G c C" and S an analytic setin A. 
A point ze G is said to be regular (essentially 
singular) with respect to § if the closure § of S 
in G is (is not) analytic at z. If dim A « dim, S 
for every point z € S, then S is analytic in G. If 
S is purely d-dimensional and S is analytic at 
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a point of a d-dimensional irreducible com- 
ponent A' of A, then S is analytic at every 
point of A’ that is not located in any other ir- 
reducible component of A. Furthermore, if 
dim A < dim S and S is purely d-dimensional, 
then the following hold: (1) The set E of essen- 
tial singularities of S is, if not empty, a purely 
d-dimensional analytic set in G, formed by 
irreducible components of A. (2) If every 
irreducible component of A contains points 
of E not located in any other irreducible com- 
ponent of A, then A CE, and A is, if not 
empty, purely d-dimensional. (3) If every d- 
dimensional irreducible component of A con- 
tains points that are regular with respect to S, 
then S is a purely d-dimensional analytic set 
in G (Thullen, Remmert, and Stein). By these 
results it is possible to give a proof for 'Chow's 
theorem that every analytic set in P,(C) is 
algebraic. 

The continuation of holomorphic functions 
is related to the continuation of their graphs. 
W. Rothstein investigated the continuation of 
analytic sets to obtain the following analog 
to Hartogs's theorem of continuity: If G = 
G, UG, G, - (z|]z| «1/2 E lef < 1}, G= 
{2| Izi|« 1, 12«5-. [z^ < 1}, and G= 
{z||z,|<1, Ei lzjl? <1} (the envelope of 
holomorphy of G) with n> 3, then every purely 
(n — D)-dimensional analytic set A in G extends 
to an analytic set in G; that is, there exists a 
purely (n = D-dimensional analytic set Ain 
G such that A = ÄN G. K. Kasahara and H. 
Fujimoto generalized this theorem to the case 
of analytic spaces. 


N. Nevanlinna Theory for Several Complex 
Variables 


The Nevanlinna theory investigates holomor- 
phic mappings between complex manifolds. 
In function theory of one variable, for a holo- 
morphic mapping f :C>P!(C) from C into 
the 1-dimensional tcomplex projective space, 
the famous +Picard theorem states that if f 
omits three values, then f must be constant. 
R. Nevanlinna developed the quantitative 
theory of value distributions off: L. Ahlfors 
[1], introduced the geometric approach, and 
enunciated the principle that the negative 
curvature of the image manifold restricts a 
holomorphic mapping. In higher-dimensional 
situations, this principle has been realized in 
many cases. The Nevanlinna theory for a 
holomorphic mapping f: C5 P"(C) with re- 
spect to thyperplanes was established by Ahl- 
fors and H. Weyl and J. Weyl [34]. Holomor- 
phic mappings into projective spaces have 
been studied in detail by W. Stoll, H. Fujimoto, 
and M. Green. Following the work of S. S. 
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Chern [9] of around 1970, the equidimen- 
sional case was investigated (P. A. Grifliths, K. 
Kodaira, S. Kobayashi and T. Ochiai). J. A. 
Carlson and Grifliths [6] succeeded in obtain- 
ing a tdefect relation for equidimensional 
holomorphic mappings and thypersurfaces of 
the image manifold. This yields the following 
Picard-type theorem (after the formulation by 
F. Sakai, [nventiones Math., 16 (1974)): Let X 
be a tprojective algebraic manifold of dimen- 
sion n and f: ("5 X a holomorphic mapping. 
Let D be a hypersurface of X. We take a 
tdesingularization t: X*+X so that the in- 
verse image D* = q^! (D) has at most normal 
crossings as singularities. We denote by K* a 
tcanonical divisor of X*. We assume that 

dim H°(X*, O(m(K* +D*))) 0 


n 


um T 
If f omits D, then the Jacobian off vanishes 
everywhere. As a corollary, it follows that the 
tuniversal covering manifold of X -D cannot 
be C", When X = P"(C) and P is a nonsingular 
hypersurface of degree d, the above assump- 
tion is satisfied if dz n + 2. In the nonequidi- 
mensional case, Ochiai [29] (with supple- 
mentary works by M. Green, Y. Kawamata, 
and P. Wong) verified the following asser- 
tion, which was first stated by A. Bloch with 
rough arguments: Let X be a projective alge- 
braic manifold of dimension n, and suppose 
dim H'(X,@y) >n; then the image of a holo- 
morphic mapping f: CX is contained in a 
proper subvariety of X (— 124 Distributions 
of Values of Functions of a Complex Variable; 
272 Meromorphic Functions). 


0. Hyperbolic Manifolds 


Every complex analytic space X has two jn- 
variant tpseudodistances: the Carathéodory 
pseudodistance c, and the Kobayashi pseudo- 
distance d,, both of which generalize the Poin- 
caré distance p of the unit disk D = {|z| <1}. 
These pseudodistances can be defined by the 
property that dy is the largest pseudodistance 
among all pseudodistances à, on X for which 
all holomorphic mappings (X, ó,)—(D, p) are 
distance-decreasing, while cy is the smallest 
among all pseudodistances ó, for which all 
holomorphic mappings (D, p)2(X, 6) are 
distance-decreasing. Then cy < d,. If Y is an- 
other complex analytic space, every holomor- 
phic mapping f:X > Y is distance-decreasing 
with respect to either its Carathéodory or 
Kobayashi pseudodistance. This may be con- 
sidered to be a generalization of the Schwarz- 
Pick lemma. (The ‘Schwarz lemma is often 
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referred to by this name, since G. Pick also 
investigated the distance-decreasing property.) 
The Kobayashi pseudodistance d, can also 
be obtained by integrating an infinitesimal 
differential pseudometric called the Koba- 
yashi pseudometric in the same manner as the 
Riemannian distance is obtained from the 
Riemannian metric (H. L. Royden). 

If dy is a (complete) distance on X, then X is 
said to be (complete) hyperbolic. A Riemann 
surface is hyperbolic in this sense if and only 
if its universal covering is biholomorphic to 
the unit disk. If X is open in Y and if, for 
every pair of sequences of points {p,} and 
la. in X converging to distinct points of X, 
lim d ,(p,, q,) is positive, then X is said to be 
hyperbolically embedded in Y. The Riemann 
sphere CU 1 oo] minus three points is not 
only complete hyperbolic but also hyperboli- 
cally embedded in the Riemann sphere. Every 
holomorphic mapping of C into a hyperbolic 
complex analytic space is constant, while every 
holomorphic mapping of the punctured disk 
D* = {0<|z|< 1j into a hyperbolically embed- 
ded space X c Y extends to a holomorphic 
mapping of D into Y. Thus the classical little 
Picard theorem reduces to the statement that 
C = (0, 1} is hyperbolic, while the great Picard 
theorem reduces to showing that C — (0, 1} is 
hyperbolically embedded in the Riemann 
sphere. These classical theorems can be gen- 
eralized in two ways. If M is a tsymmetric 
bounded domain and Į is a discrete arithmetic 
group acting freely on M, then M/T is not 
only complete hyperbolic but also hyperboli- 
cally embedded in its Satake compactification 
(Kobayashi and Ochiai). If X is the comple- 
ment of 2n 4- 1 hyperplanes in general position 
in the complex projective space P"(C), then X 
is complete hyperbolic and hyperbolically 
embedded in P”(C) (a restatement of a re- 
sult going back to E. Borel, A. Bloch, and H. 
Cartan). 

Although there are some noncompact non- 
hyperbolic complex manifolds X for which 
every holomorphic mapping f: X is con- 
stant, a compact complex manifold X is hyper- 
bolic if and only if every holomorphic map- 
ping f: C X is constant (R. Brody). 

For the ?*Teichmüller space X = T, of com- 
pact Riemann surfaces of genus g, the Koba- 
yashi distance d, agrees with the Teichmüller 
distance which had been introduced before 
the complex structure of T, was defined (H. L. 
Royden). 

In the study of pseudoconvex domains, the 
*Bergman metric, the Carathéodory distance, 
and the Kobayashi distance serve as useful 
tools. Their behavior at the boundary has been 
studied extensively. 


107 


Given an n-dimensional complex mani- 
fold X, an invariant measure V, can be de- 
fined as the largest measure such that every 
holomorphic mapping f from the polydisk 
D" with invariant measure into X is measure- 
decreasing. For an algebraic manifold of gen- 
eral type, this measure is everywhere positive. 


P. Bounded Domains in C" 


For any bounded domain in C” there is natu- 
rally assigned a tKahler metric called the 
Bergman metric. Using the invariance of this 
metric, E. Cartan proved that all the Her- 
mitian symmetric spaces of noncompact type 
are realized as bounded domains. In view of 
the fact that some important period matrix 
domains (e.g., the 19-dimensional bounded 
symmetric domain of type IV, the Siegel upper 
space, etc.) are of this type, it is obviously of 
great significance in algebraic geometry to 
study discontinuous subgroups of the auto- 
morphism groups of such domains (Pyatetskii- 
Shapiro). 

On the other hand, it is almost impossible 
for general bounded domains to determine the 
explicit form of their Bergman metrics. But 
Cartan also investigated strongly pseudo- 
convex real hypersurfaces in C?, and he solved 
completely the (local) equivalence problem for 
them, introducing a definite type of *Cartan 
connections over them in a functorial way. 
Thereafter N. Tanaka (and for hypersurfaces 
also Chern and J. Moser) generalized this 
result to all pseudoconvex real submanifolds 
in higher-dimensional complex manifolds. 
One can further apply this result to the equi- 
valence problem of bounded domains with 
smooth strongly pseudoconvex boundaries, 
for C. Fefferman [ 10] proved by analyzing the 
boundary behavior of the Bergman metric that 
every biholomorphic mapping between two 
such domains is extended smoothly up to their 
boundaries (I. Naruki [27] gave an alternative 


proof). 


Q. History 


In connection with *Abelian functions, ana- 
lytic functions of several complex variables 
have been studied sporadically since the time 
of Riemann and Weierstrass (H. Poincaré, 
Cousin). A series of investigations by Hartogs 
([ 17] (1906), etc.) that revealed the distinctive 
properties of several complex variables ini- 
tiated a new epoch in complex analysis. Levi 
(1910— 1911) generalized Hartogs's results to 
the case of meromorphic functions, introduced 
the notion of pseudoconvexity, and proposed 
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the so-called Levi problem. After a lapse of 
time, many contributions to this new area of 
complex analysis have been made since 1920. 
The study by K. Reinhardt (1921) of analytic 
automorphisms was further developed by C. 
Carathéodory and Behnke. The !kernel func- 
tion introduced by S. Bochner and S. Berg- 
mann (1922) produced many remarkable 
results. In contrast with tPicard’s theorem 

in one variable, P. Fatou found a holomor- 
phic mapping f: C? 5 C? with nonvanishing 
Jacobian such that the image f(C?) has an 
exterior point. 

The theory of analytic functions of several 
complex variables has flourished since 1926. 
Behnke and Thullen in Münster, together with 
G. Julia and H. Cartan in Paris, were the most 
active investigators. The results on *normal 
families of analytic functions of several com- 
plex variables (Julia [20], 1926), the uniqueness 
theorem of holomorphic mappings (H. Cartan, 
1930), and a characterization of a domain of 
holomorphy by holomorphic convexity (Car- 
tan and Thullen, 1932) are their most remark- 
able achievements. Behnke and Thullen [3] 
systematized the results obtained since the 
discovery of the theory by providing a com- 
plete bibliography of articles up to 1934. 

The three major unsolved problems at that 
time-those of Cousin, Levi, and the approx- 
imation of holomorphic functions-were 
intensively studied by Oka from 1936, who has 
given complete solutions [30]. The investiga- 
tion of ideals of holomorphic functions by H. 
Cartan (1944), together with that of ideals with 
undetermined domains by Oka, has developed 
into the theory of coherent analytic sheaves. 
The notion of analytic spaces, first introduced 
by Behnke and Stein (1951), extended the field 
of investigation in the theory of analytic func- 
tions of several complex variables. The theory 
of cohomology with coefficients in sheaves has 
been effectively applied by H. Cartan and 
Serre (1951—1952). The introduction of the 
notion of Stein manifolds (195 1) came at the 
same time. Grauert's deep investigations since 
1955, together with those of Stein and Rem- 
mert, have contributed greatly to the develop- 
ment of the theory of analytic spaces. In the 
1960s, active investigations took place also in 
the United States [16]. The theory of auto- 
morphic functions of several complex variables 
has been developed by C. L. Siegel, [. Satake, 
and others in connection with the theory of 
numbers. Entire functions of two variables 
have been investigated from a new point of 
view by Nishino and others since 1968 (— last 
paragraph of Section N). The notion of a 
hyperbolic manifold introduced by Kobayashi 
(1967) enables us to obtain many results on 
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complex manifolds by means of the methods of 
differential geometry. The theory of automor- 
phisms of bounded pseudoconvex domains 
was developed extensively in the 1970s by 
Fefferman, Naruki, and others. The theory of 
an envelope of holomorphy has also been 
successfully applied to the theory of elemen- 
tary particles in physics. 


References 


[1] L. Ahlfors, An extension of Schwarz's 
lemma, Trans. Amer. Math. Soc., 43 (1938), 
359-364. 

[2] H. Benhke and K. Stein, Konvergenten- 
folgen von Regularitätsbereichen und der 
Meromorphiekonvexität, Math. Ann., 116 
(1939), 204-216. 

[3] H. Behnke and P. Thullen, Theorie der 
Funktionen mehrerer komplexer Veränderli- 
chen, Springer, 1934; second edition, 1970. 
[4] L. Bers, Introduction to several complex 
variables, Courant Institute, 1964. 

[5] S. Bochner and W. T. Martin, Several 
complex variables, Princeton Univ. Press, 
1948. 

[6] J. A. Carlson and P. A. Griffiths, A defect 
relation for equidimensional holomorphic 
mappings between algebraic varieties, Ann. 
Math., (2) 95 (1972), 557—584. 

[7] Sém. H. Cartan, Ecole Norm. Sup., 1951- 
1952. 

[8] H. Cartan, Oeuvres, Springer, 1979. 

[9] S. S. Chern, On holomorphic mappings of 
Hermitian manifolds of the same dimension, 
Amer. Math. Soc. Proc. Symposia in Pure 
Math., 11 (1968), 157-170. 

[ 10] C. Fefferman, The Bergman kernel and 
biholomorphic mappings of pseudoconvex 
domains, Inventiones Math., 26 (1974), 1—65. 
[ 11]B. A. Fuks, Special chapters in the theory 
of analytic functions of several complex vari- 
ables, Amer. Math. Soc. Transl. of Math. 
Monographs, 1965. 

[ 123 H. Grauert, Charakterisierung der 
holomorph-vollstándiger komplexen Raume, 
Math. Ann., 129 (1955), 233-259. 

[ 13] H. Grauert, Holomorphe Funktionen mit 
Werten in komplexen Lieschen Gruppen, 
Math. Ann., 133(1957), 450—472. 

[ 14] H. Grauert, On Levi's problem and the 
imbedding of real-analytic manifolds, Ann. 
Math., (2) 68 (1958), 460—472. 

[ 1 5] H. Grauert and R. Remmert, Theory of 
Stein spaces, Springer, 1979. 

[ 16] R. C. Gunning and H. Rossi, Analytic 
functions of several complex variables, 
Prentice-Hall, 1965. 

[ 17] F. Hartogs, Zur Theorie der analytischen 


108 


Funktionen mehrerer unabhängiger Ver- 
änderlichen, insbesondere über die Darstellung 
derselben durch Reihen, welche nach Potenzen 
einer Veränderlichen fortschreiten, Math. 
Ann., 62 (1906), 1-88. 

[18] L. Hórmander, L? estimates and exis- 
tence theorems for the à operator, Acta Math., 
113 (1965), 89-152. 

[ 193 L. Hormander, An introduction to com- 
plex analysis in several complex variables, Van 
Nostrand, 1966. 

[20] G. Julia, Sur les familles de fonctions 
analytiques de plusieurs variables, Acta Math., 
47 (1926), 53-115. 

[21] H. Kazama, Approximation theorem and 
application to Nakano's vanishing theorem for 
weakly 1-complete manifolds, Mem. Fac. Sci. 
Kyushu Univ., 27 (1973), 221—240. 

[22] S. Kobayashi, Hyperbolic manifolds and 
holomorphic mappings, Dekker, 1970. 

[23] S. Kobayashi, Intrinsic distances, mea- 
sures and geometric function theory, Bull. 
Amer. Math. Soc., 82 (1976), 357—416. 

[24] P. Lelong, Fonctions plurisoushar- 
moniques et formes différentielles positives, 
Cordon & Breach, 1968. 

[25] L. Nachbin, Holomorphic functions, 
domains of holomorphy and local properties, 
North-Holland, 1970. 

[26] S. Nakano, Vanishing theorems for 
weakly 1-complete manifolds, IL, Publ. Res. 
Inst. Math. Sci., 10 (1974), 101-1 10. 

[27] 1. Naruki, On extendibility of isomor- 
phisms of Cartan connections and biholomor- 
phic mappings of bounded domains, Tóhoku 
Math. J., 28 (1976), 117-122. 

[28] T. Nishino, Nouvelles recherches sur les 
fonctions entiéres de plusieurs variables com- 
plexes (II), Fonctions entières qui se réduisent 
à celles d'une variables, J. Math. Kyoto Univ., 
9 (1969), 221-274. 

[29] T. Ochiai, On holomorphic curves in 
algebraic varieties with ample irregularity, 
Inventiones Math., 43 (1977), 83-96. 

[30] K. Oka, Sur les fonctions analytiques de 
plusieurs variables (collected papers), Iwanami, 
1961. 

[31] W. F. Osgood, Lehrbuch der Funk- 
tionentheorie II, Teubner, 1924; revised edi- 
tion, 1929 (Chelsea, 1965). 

[32] G. Scheja, Riemannsche Hebbarkeitssátze 
für Cohomologieklassen, Math. Ann., 144 
(1961), 345—360. 

[33] K. Stein, Analytische Functionen meh- 
rerer komplexer Veranderlichen zu vorgege- 
benen Periodizitátsmoduln und das zweite 
Cousinsche Problem, Math. Ann., 123 (1951), 
201-222. 

[34] H. Weyl and J. Weyl, Meromorphic 
functions and analytic curves, Ann. Math. 


109 


Studies 12, Princeton Univ. Press, 1943. 
Also = references to 23 Analytic Spaces. 


22 (1.12) 
Analytic Sets 


A. General Remarks 


The notion of analytic sets was first defined by 
N. N. Luzin and M. Ya. Suslin in 1916, and 

it was extended to that of projective sets by 
operations such as complementation and pro- 
jection (Luzin, 1924). Most mathematicians, 
including Luzin and W. Sierpinski, who 
worked in this field, were in agreement with 
"French empiricism (or tsemi-intuitionism), 
which defended the standpoint of R. Baire, 

E. Borel, H. Lebesgue, and others. An object is 
said to be effectively given if it can be uniquely, 
individually, and unambiguously determined 
in finite terms so that anyone can reach the 
same object by following the defining pro- 
cedure. Semi-intuitionists claim that only 
effectively given objects have mathematical 
existence, and they do not recognize as a 
mathematical object something that needs the 
axiom of choice for its definition. From this 
point of view, *Borel sets were “well-defined” 
sets to which classical analysis had to be re- 
stricted. Thus the question was raised whether 
it is possible to extend the class of Borel sets to 
a wider class of sets with the same certainty. 
Lebesgue (J. M ath. Pures Appl., 1 (1905)) de- 
fined a function not belonging to any class 

of Baire functions by using the totality of 
‘ordinals of the second class. (Later, this 
method was systematically developed as the 
*theory of sieves by Luzin.) However, it did 
not satisfy Borel as being effective. Can we, 
then, extend the Borel sets without any use of 
ordinals of the second class? The discovery of 
analytic sets gave an affirmative answer. 

In this article (except in Section 1), we treat a 
space (denoted by X, Y, . ) that is thomeo- 
morphic to a ‘complete tseparable *metric 
space and its subspace. Denote by 9i the space 
of irrational numbers (a metric space consts- 
ting of the irrational numbers e R with the 
metric |x — y| of x and y). The following prop- 
erties of a subset S of a space X are equivalent: 
(i) § is a continuous image of N; (ii) $ is a con- 
tinuous image of a Borel set in X; (iii) S is the 
projection of a closed set in a product space 
X xN; (iv) § is the projection of a Borel set in 
X x Y. We calla set satisfying one of these 
properties an analytic set, an A set, or a J} set 
(in X). The complement of an analytic set is 
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caled a complementary analytic set (or simply 
coanalytic set), a CA set, or a HI set. 


B. The Operation A and Sieves 


When to each (n, , , ng) of finite sequences of 
natural numbers there corresponds a unique 
element E(n,, , n,) of a family F of sets, this 
correspondence {E(n,, . . , n,)} is called a 
schema of Suslin (or system of Suslin) consist- 
ing of sets in F. Denoting an infinite sequence 
of natural numbers by {n;}, the set given by 
Uus [k Ego, ml is called the kernel of a 
system of Suslin, and the operation of taking 
the kernel is called the operation A (analytic 
operation). 

Let Q be the set of al] rational numbers 
between 0 and 1 and F be a family of sets. 
Take a family 1 C,j,«q of sets belonging to F 
with the index set Q (or more geometrically, a 
subset C= |] «gC, x (r) of X x Q when F is a 
family of subsets of a space X), and call it a 
sieve consisting of sets in F. Denoting by inj a 
(strictly) monotone decreasing sequence of 
elements of Q, we call the set LTC, 
(namely, the set of all x such that C? = 
{r (x, ) € C} is not well-ordered by the 
order < of rational numbers) the set obtained 
by a sieve C or the sieved set obtained by C. If 
the family F is closed with respect to countable 
intersection, then the family of al] sets ob- 
tained by sieves consisting of sets in F is ident- 
ical to the family of all sets obtained by apply- 
ing the operation A to F. When F consists of 
all the closed sets in a given space, this is the 
family of all analytic sets. In particular, it is 
sufficient to take the family of closed intervals 
as F when X is the space of real numbers. 
Note that we can define sieves and sieved sets 
more generally by using the space of real num- 
bers R instead of the set Q of rationals. 


C. Properties of Analytic Sets 


It is evident from the definition that every 
‘Borel set is analytic. If a Borel set is uncount- 
able, then it is the union of a countable set 
and a one-to-one continuous image of 9t. The 
analyticity of sets is invariant under countable 
unions, intersections, and Cartesian products 
and the operation A and *Borel-measurable 
transformations. An uncountable analytic set 
contains a tperfect subset (Suslin). Therefore 
the possible cardinality of an analytic set is at 
most countable or is that of the continuum. 
Every analytic set enjoys the *Baire property, 
and in Euclidean space every analytic set is 
'Lebesgue measurable (Luzin, Sierpinski). If a 
set E in the Euclidean plane is analytic (coana- 
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lytic), then I(E) is also analytic (coanalytic), 
where I(E) is the set of all x such that the 
section E™ of E that is parallel to the y-axis 
has a positive measure (M. Kondó and T. 
Tugué). Concerning the Baire property, the 
similar result for a set E of X x Y is obtained 
by replacing *to have a positive measure" by 
"to be the *second category (nonmeager) set" 
(A. S. Kechris). The Lebesgue measure of an 
analytic set is effectively calculable (in the 
sense that the measure of a X} (or X1(a)) set is 
a X! (resp. X1(a)) real number (- Section G 
and 356 Recursive Functions) (H. Tanaka). 
Every analytic (coanalytic) set E can be de- 
composed into N, Borel sets. This decompo- 
sition is called a decomposition of E into 
constituents. An analytic (coanalytic) set is a 
Borel set if and only if it is decomposable into 
a countable number of constituents (Luzin, 
Sierpinski). In a space with the cardinality of 
the continuum, there exist analytic sets that 
are not Borelian. For example, in the space 

C( [0, 1]) of continuous functions on the inter- 
val [0, ]] (2. 168 Function Spaces) the set of 
all differentiable functions is coanalytic but not 
Borelian (S. Mazurkiewicz). 

The following theorems are especially im- 
portant in analytic set theory. Luzin's first 
principle (the first separation theorem): For 
every pair of disjoint analytic sets A,, A,, there 
exists a Borel set B such that A, c B and 
BILA, = Ø. An immediate corollary of 
Luzin’s first principle is Suslin's theorem: If 
both A and X = A are analytic,’ then A is a 
Borel set. Luzin's second principle (the second 
separation theorem): For every pair of analytic 
sets A and B, there exist complementary analy- 
tic sets C and D such that 4—Bc C, B-Ac 
D, and CND = (Qj. A one-to-one continu- 
ous image of a Borel set is Borelian (Suslin). 
More generally, for a given B-measurable 
function f defined on a Borel set B, the set A 
(cf(B)) of all points y whose inverse images 
f |) are singletons is a complementary 
analytic set (Luzin’s unicity theorem). In this 
theorem, we can replace “a singleton” by “a 
*g-compact set" (V. Ya. Arsenin, K. Kunugui). 
Therefore, if a set is the image of a Borel set by 
a continuous function such that the inverse 
image of each point is a o-compact set, then it 
is a Borel set. 


D. Generalization to Projective Sets 


A projective set of class n is inductively defined 
as follows: (1) the Borel sets are the projective 
sets of class 0; (ii) the projective sets of class 
2n + | are the continuous images of the sets of 
class 2n; (iii) the projective sets of class 2n are 
the complements of the sets of class 2n = 1. 
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The projective sets of class 1 are exactly the 
analytic sets, and those of class 2 are the com- 
plementary analytic sets. The following are 
fundamental properties of projective sets. 
Denote by L, the family of the projective sets 
of class n. Then (1) L;,C Lan and L;,,, C 
Lan+24+ (K= 1,2, . ..). 2) the property of 
being a set of class n is invariant under count- 
able unions, intersections, and Cartesian 
products and homeomorphisms; (3) a contin- 
uous image of a projective set of class 2n 4- 1 
is of the same class; (4) the projection on X of 
a set of class 2n + | in X x Y is a set of class 
2n + 1 in X; (5) the family of the projective sets 
of class 2n + 1 in a space X is the family of the 
projections of all sets of class 2n in X x X (or 
X x 90); (6) the kernel of a system of Suslin 
consisting of sets of class nis a projective set 
of the same class, where n 40, 2. 

We frequently call a projective set of class 
2n lapP setora E eet, and that of class 2n 
a C, set or a II, set. A B, set or a Aj set is a set 
that is both P, and C,. The respective families 
of these sets are also denoted by P, (22), 

C, (111), and B, Ak respectively. In general, for 
a family § of sets in a space X, we denote by 
Cé the family of the complements X-E of all 
sets E in Sy. We write Sep,(%) and Sepy(%) for 
the propositions obtained by substituting “set 
in S," "set in & and in CY,” and “set of CR” 
for “analytic set," “Borel set,” and “coanalytic 
set,” respectively, in Luzin’s first principle and 
Luzin’s second principle, respectively. Fur- 
thermore, we say that the reduction principle 
holds for iy, and denote it by Red(%), when for 
any sets A, BE & there exist A,, B, € & such 
that A, c A, B, c B, A, UB, = AUB, and 

A, N B, = @. C. Kuratowski introduced the 
latter principle Red(%) which implies Sep,(C%) 
and Sepy(C%), and proved Red(171). Classi- 
cally, reduction or separation principles for the 
projective sets of higher classes were not 
settled except for Sep,(/7}), Sep, (773) (P. S. 
Novikov), and Red(Z3) (Kuratowski). Now- 
adays, it is known that these principles for 
EL, H1),,, are undecidable from the axiom- 
atic set theory ZFC. If we assume tthe axiom 
of constructibility V = L, then Red(£}} holds 
for n> 3 (J. W. Addison). On the other hand, 
under the assumption of *projective deter- 
minacy PD (— Section H), it follows that 
Red(/7}) (Red(21)) holds when p is odd (even) 
(A. Martin, Addison and Y. N. Moschovakis). 


E. Universal Sets 


A set U in Jt x X is called the universal set 
for the projective sets of classn in X if for any 
projective set P of class n in X, there exists 
Zz €t such that P 2 (x|(zo, x) eU]. Concern- 
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ing universal sets, we have the following result: 
for every n> 0, there exists a universal set for 
the projective sets of class n in X that is of the 
same class in 9t x X. Hence in a space with 

the cardinality of the continuum, there exists a 
projective set of class n + 1 which is not of 
class n. In general, if a class % of sets is closed 
under taking a universal set and continuous 
preimages, then % and C% cannot both have 
the reduction property. Therefore Red(42]) or 
Red(22) fails for each n> 1. 

Any two universal sets of analytic sets are 
tBorel isomorphic. On the other hand, an 
analytic set E is not Borel isomorphic to any 
universal analytic set if the complement of E is 
an uncountable set without a perfect subset (A. 
Maitra and C. Ryll-Nardzewski). Hence, there 
are at least two equivalence classes of uncount- 
able analytic sets with respect to the Borel 
isomorphism zx under the assumption of V= 
L, since the latter assumption implies the 
existence of such a set E (- Section H). 


F. Uniformization Principle 


The uniformization problem arose during 
investigations of implicit functions. For a set E 
in a space X x Y, uniformization of E is the 
finding of a subset V of E such that 


Vx(3y((x, y)e E) «»3ty((x. ye V)), 


where 3!y is the *quantifier which means 
"there exists exactly one y." A Borel set can be 
uniformized by choosing a suitable coanalytic 
set (Luzin). Kondó's uniformization theorem 
(Japan. J. Math., 15( 1938)) is the most impor- 
tant result in descriptive set theory: every 
coanalytic set is uniformizable by a coanalytic 
set. As a corollary to this, any X1 set is uni- 
formizable by a Z} set, and every ©} set is 
obtained as a one-to-one continuous image 
of a coanalytic set. 

Kondó's original proof was very difficult to 
understand. Since teffective descriptive set 
theory was introduced by Kleene and Ad- 
dison, the proof has been simplitied by Ad- 
dison, and a more elegant one has been given 
by J. R. Shoenfield [7, p. 188]. Nowadays, the 
theorem is also called the Novikov-Kondó- 
Addison theorem, and is one of the most 
powerful and fundamental theorems not only 
in descriptive set theory, but also in the tfoun- 
dations of mathematics. 

The uniformization of an analytic set is, in 
general, not to be found among analytic or 
coanalytic sets. There was a conjecture that 
any analytic set is uniformizable by specifying 
an A, set (difference of two analytic sets). 
Recently, this conjecture was negatively settled 
(J. Steel, Martin). Assuming that V= L, the 
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uniformization of a £} (n > 3) set is determined 
by specifying a X set, and that of a H; (n > 2) 
set by specifying a In set. On the other hand, 
if an axiom system of set theory (e.g., ZF; — 33 
Axiomatic Set Theory) is consistent, then it is 
still consistent even if we add to it the follow- 
ing proposition: There exists a H: set whose 
uniformization is impossible by any choice of a 
definable set in the system (P. J. Cohen, A. 
Lévy). However, if we assume the existence of 
a tmeasurable cardinal (MC), it is known that 
every IT} set is uniformizable by a M4 set 
(Martin and R. M. Solovay). On the other 
hand, PD implies “every M! (X1) set is uni- 
formizable by a M! (21) set for each odd 

(resp. even) n” (Moschovakis). 

There are sufficient conditions on sections 
E'? of a Borel set E in the Cartesian product 
X x Y for E to be uniformizable by the Borel 
set as follows: all sections E(? are (i) countable 
(Luzin, Novikov), (ii) a-compact (Arsenin, 
Kunugui); or, as "large section property," (iii) 
in the second category (S. K. Thomason, P. G. 
Hinman), or (iv) of positive measure (Tanaka, 
G. E. Sacks). For applications of descriptive 
set theory to analysis, an important uniformi- 
zation result is von Neumann's selection 
theorem: for any A set E C X x Y, there exists 
a tBaire measurable and *absolutely measur- 
able function f: X ^ Y (- 270 Measure 
Theory L (vi)) such that 


Vx(3y(Gx, yje E) <> (x, f(x))€ E). 


Concerning implicit functions, any Borel set 
E c X x Y such that all sections FE) are at 
most countable is expressed by a union of 
intersections of E with graphs of some Baire 
functions f,: X > Y, neN (Luzin). Recently, it 
was shown known as a generalization of this 
that any Borel set Ec X x Y such that all 
sections are a-compact is a union of countably 
many Borel sets E, for each of which all sec- 
tions Ef? are compact (J. Saint-Raymond). 


G. Kleene’s Hierarchy and Effectiveness 


First, projective set theory in any space is 
reducible to the theory in the space of irra- 
tional numbers. Second, if we introduce a 
tweak topology in the set NX = en of *number- 
theoretic functions x with one argument, the 
resulting topological space N^ is homeomor- 
phic to the *Baire zero-dimensional space 

of irrational numbers. Third, any subset B of 
N” is open and closed in this topology if and 
only if there exist a function £(eN") and a 
predicate A*(x) that is *general recursive in č 
such that ae B<> A*(x). Fourth, logical opera- 
tions such as 1, v , A, 3x (where x is a varti- 
able ranging over the natural numbers), and Ja 
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exactly correspond to the operations (on sets) 
complementation, union, intersection, count- 
able union, and projection, respectively. On 
the basis of these facts, projective set theory is 
regarded as the theory of the N" tanalytic hier- 
archy of Kleene. Here, the following example 
is remarkable: We can construct a X1 set which 
is universal for the analytic sets (namely, a 

X! [N"] set) in the space of irrational num- 
bers (— 356 Recursive Functions H). 

The connection between projective set 
theory and logic has been discussed by C. 
Kuratowski and A. Tarski. From their point of 
view, semi-intuitionists such as Borel regard 
the set of natural numbers to be precisely clear 
in itself and also the continuum to be immedi- 
ately recognizable by our geometric intuition. 
In their argument rational numbers do not 
play such an important role. They take, a 
priori, the set of irrational numbers as the 
fundamental domain, and intervals with ra- 
tional extremities as the simplest sets of points 
among the subsets of the fundamental domain 
as the starting point of their argument. Here, 
the fundamental domain or each interval is 
not conceived as a totality of its elements, but 
recognized as a "uniform extent." In contrast 
to this, singletons and individual irrational 
numbers are not so simple. For this reason 
Borel introduced the notion of calculable 
numbers to study definable real numbers. 
Following Luzin, we say that a calculable 
number is a constructible real number in the 
sense that we can give it by an arithmetical 
approximation as precisely as we want. Now, 
this notion is nearly identical to the notion of 
an effectively calculable real number given by 
A. Church or A. M. Turing. 

In the mathematics of the semi-intuitionists, 
the word "effective" has played an especially 
important role. Although these mathema- 
ticians have always agreed not to accept the 
taxiom of choice, the exact meaning of “effec- 
tive" has differed slightly among different 
members of their group or in different stages 
of the development of the theory. Such dif- 
ferences mainly arose in connection with the 
question: How can we tell whether given en- 
titles are finitary or individual? One way to 
guess the original intention held by Borel and 
others when they used the term “effective” is to 
replace the term by "recursive." Nowadays, the 
concept of "effectiveness" is used in this sense 
(— 356 Recursive Functions C), and all clas- 
sical results in descriptive set theory essentially 
have effective versions (or refinements via 
relativization). For example, the Novikov- 
Kondó-Addison theorem is described as fol- 
lows: Any M} (IT} (x) for an aec) set is uni- 
formizable by a HI (resp. M! (x)) set. Similarly, 
Suslin's theorem is that every analytic and 
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coanalytic (i.e., 41(x)) set A is a Borel set of 
class v(x) which is effective relative to the 
"definability x” of given A, where v(a) is the 
+Constructive ordinal relative to g (Kleene, 
Tugué and Tanaka, A. Louveau, etc A 


H. Further Results in Axiomatic Set Theory 
and Strong Axioms 


The recent development of axiomatic set 
theory has yielded the following propositions. 
Under the assumption that V= L (K. Gödel, 
Novikov, Addison): (1) there exists an uncount- 
able 77} set that does not contain any perfect 
subset; (2) there exists a nonmeasurable 4} 
set; (3) there exists a 4} set that does not have 
the Baire property. On the other hand, if the 
axioms of set theory ZFC plus "there exists an 
"inaccessible cardinal number" are consistent, 
then so is "every projective set is Lebesgue 
measurable, has the Baire property, and has a 
perfect subset when it is uncountable" with 
them (Solovay). Concerning these properties, 
'Martin's axiom MA and "27»» N," (MA + 

< CH is consistent with ZFC; 2 33 Axiom- 
atic Set Theory E) implies that every Xj is 
measurable, and has the Baire property 
(Martin and Solovay). The possible cardinality 
of a X; set is at most N, or that of the continu- 
um, which is implied from “every Z} set is a 
union of N, Borel sets" (Sierpinski). It is 
known that one cannot prove or refute from 
ZFC the converse of the latter statement. If 
one assumes MC (the existence of a measur- 
able cardinal), then every Xj set is a union of 
N, Borel sets. Therefore MC implies that the 
possible cardinality of a £} set is at most &, 
or that of the continuum. 

With each set A C "c we associate the 
following infinite game of perfect information 
G(A), played by two players I and II. First 
player I chooses no € , then player II chooses 
n, € w, then I chooses n; € œ, and so on. The 
game ends after œ steps. Let r(k) = ną; if XE A, 
then [ wins G(A); otherwise II wins. A strategy 
(for 1 or II) is a rule that tells the player what 
move to make, depending on the previous 
moves of both players. A winning strategy is 
one such that the player who follows it always 
wins. The game G(A) is determined if one of 
the players has a winning strategy. The axiom 
of projective determinacy PD is the assertion 
that for every projective set A( Col the game 
G(A) is determined. PD is widely used to solve 
problems on projective sets that are not decid- 
able from ZFC, and is a plausible hypothesis 
that does not seem to contradict the axiom of 
choice at present; and it has pleasing conse- 
quences in descriptive set theory. For n 3 3, 
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reductions and uniformization results of 2} 
(17) sets mentioned earlier are such examples. 
Furthermore, PD implies that every projective 
set is Lebesgue measurable, has the Baire 
property, and contains a perfect subset if it is 
uncountable. 

Now, write Determinacy ( X!) (Determinacy 
(A])), or simply, Det(}) (Det(A})), when G(A) 
is determined for every Z% (Aj) set A. Using 
the fact that the open game 1s determined, 
namely Det(2°) (D. Gale and F. M. Stewart), 
D. Blackwell has given a short proof of 
Sep;(Z1). The extremely difficult problem was 
to show Det(A}) in ZF (indeed, one cannot 
prove Det(A1) only from the Zermelo axioms 
Z; = 33 Axiomatic Set Theory B). In 1975, 
Martin solved this problem. Concerning 2), 
Det 1) cannot be proved from only the ZF 
axioms, but MC implies De E) (Martin). 
However, it is known that Det(£}) cannot be 
proved even if we assume MC. 

By a pre-well-ordering, we mean a relation 
< having all the properties of a well-ordering 
except for tantisymmetry. Denote by à] the 
least ordinal that cannot be the length of a 
preewell-ordering on "o belonging to Aj. It 
is a classical result that 6} =w, (= K,). The 
following results are mainly due to Martin: 

ô} «N,, MC 2 ô! < N,, and also, PD > 

9; « N,. Let x be an infinite cardinal. A set 

A c X is called x-Suslin when there exists a 
closed set Cc X x "k (?k—1 flf :ox]) such 
that 


xeA«» 3f [f e?k ^ (x, f)eC]. 


A is X,-Suslin iff Ae Z1 If A is a X2 set, then 
A is N,-Suslin (Shoenfield). Under the assump- 
tions of MC, if A is a Ei set, then it is X,- 
Suslin (Martin). Furthermore, Det(A},,) implies 
the following facts (Kechris, Moschovakis): 
every 25,,, set is x-Suslin, where x is the 
cardinal of 85,,,; every Din set in x-Suslin 
for some «k < 61,,, 

The axiom of determinacy AD is the asser- 
tion that the game G(A) is determined for 
every set A. AD implies that every set is 
measurable (J. Mycielski and S. Swierczkow- 
ski), has the Baire property, and has a perfect 
subset if it is uncountable (Morton Davis). 
Though AD contradicts the axiom of choice, 
some consequences of AD in the area of pro- 
jective sets are more desirable than the conse- 
quences of the axiom of choice. The following 
are examples of consequences of AD. (1) Aisa 
Z! set iff A is a union of N, Borel sets (Mos- 
chovakis). (2) For each n, à! is a cardinal 
(Moschovakis). (3) Generalization of Suslin's 
theorem holds as follows: 4} = Bg: (= the 
smallest Boolean algebra containing the open 
sets and closed under complementation and 
unions of length <6}) (Martin, Moschovakis). 
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[. Polish Spaces, Luzin Spaces, and Suslin 
Spaces 


A topological space homeomorphic to a com- 
plete separable metric space is called a Polish 
space. A subspace E of a Polish space X is 
Polish if and only if it is a G;-subset of X, Le. a 
countable intersection of open subsets of X 
(Aleksandrov and Uryson). A Hausdorff top- 
ological space X is called a Luzin space (resp. 
Suslin space) if we can find a Polish space S 
and a continuous bijective (resp. surjective) 
mapping f: S— X. Every Polish space is a 
Luzin space and every Luzin space is a Suslin 
space. 

Let X be a Hausdorff topological space. A 
subset E of X is called a standard set (resp. 
analytic set) if the set E with the relative top- 
ology is a Luzin (resp. Suslin) space. The 
analytic subsets are closed under analytic 
operations. Every analytic set is obtained from 
closed sets by applying the analytic operation. 
Every analytic set is universally measurable. 
Every standard set is a Borel set. Every count- 
able union or intersection of standard sets is 
standard. 

A subset E of a Suslin space is a Borel set if 
and only if both E and E" are analytic. A 
subset E of a Luzin space is a Borel set if and 
only if E is standard. 

Let f be a Borel measurable mapping from 
a Suslin space X into another Suslin space Y. 
Then the image f(X) is an analytic subset of Y. 
Furthermore, if f is injective, then f gives a 
*Borel isomorphism between X and fl X). If 
both X and Y are Luzin spaces and if f is 
injective, then f(X) is a Borel subset of Y. 
Every Suslin space is *Borel isomorphic to an 
analytic subset of R, and every Luzin space is 
Borel isomorphic to one of the following 
spaces: (1) N,2 (1,2, ...,n], (2) N2 (1,2, ...]. 
B R. 

The selection theorem due to von Neumann 
(- Section F) holds when X and Y are Suslin 
spaces. This fact and its ramifications [ 15,161 
are useful in nonlinear functional analysis and 
control theory. 

Practically all useful spaces appearing in 
functional analysis and probability theory are 
Polish or Luzin spaces. Examples: (1) Every 
locally compact Hausdorff space with a count- 
able open base is Polish. (ii) Every separable 
Banach space is Polish. (iii) The set C of all 
continuous functions on [0, 1] with the *top- 
ology of uniform convergence is a Polish 
space. (iv) The set D of all right continuous 
functions on [0, 1] with left limits is a Polish 
space when it is endowed with the Skorokhod 
topology (- 250 Limit Theorems in Proba- 
bility Theory E). (v) The space % of distri- 
butions and the space 5" of tempered distri- 
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butions (- 125 Distributions and Hyperfunc- 
tions) are Luzin spaces. 
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A. General Remarks 


An tanalytic function of a complex variable 
has as its natural domain of detinition a +Rie- 
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mann surface, Le, a 1 -dimensional complex 
analytic manifold. In the case of several com- 
plex variables, the set of zeros of an analytic 
function, the quotient space of a domain by 

a tproperly discontinuous group of analytic 
automorphisms, the existence domain of an 
talgebroidal function, etc., are, strictly speak- 
ing, not necessarily complex analytic mani- 
folds. It is necessary to consider a more gen- 
eral category of complex analytic manifolds 
with singularities, and the notion of analytic 
spaces is drawn from these examples. Many of 
the properties of complex analytic manifolds 
are extended to analytic spaces; on the other 
hand, theories specific to analytic spaces have 
also been developed. 


B. Analytic Sets 


We say that a subset A of a complex analytic 
manifold G is an analytic set in G if itis a 
closed subset and each point of A has a neigh- 
borhood U such that Uf A is the set of com- 
mon Zeros of a finite number of holomorphic 
functions in U. Specifically, if A is locally the 
set of zeros of a single holomorphic function 
that does not vanish identically, then A is 
called principal. Two subsets A, and S, of G 
are called equivalent at z? e G if there exists 
a neighborhood U of z? such that S, NU = 
S, N U. By this equivalence relation, every 
subsets $ of G defines its germ $,, at 2° A 
germ of an analytic set at z? is the germ at z? 
of an analytic set in a neighborhood of 7°, 
Each germ A, of an analytic set at z? = Qe G is 
associated with an ideal [(A9) = {f| f e H(0), 
f| A. =0} in the ring H(O) of tgerms of holo- 
morphic functions at 0. We call A, reducible 
if A, is the union of two germs of analytic sets 
A,’ and A," with Aj z A, Ag" # A; other- 
wise, A, is called irreducible. An analytic set A 
is called irreducible at 0 if the germ at 0 of A 
is irreducible. Properties of A, and TA.) cor- 
respond to each other. Thus A, is irreducible 
if and only if [( A,) is prime. 

As the ring H(0) is *Noetherian, in a neigh- 
borhood of every point z? an analytic set A 
is represented as a union of a finite number 
of analytic sets A, that are irreducible at z?. 
These A, are essentially unique. If an analytic 
set A is irreducible at z°, then there exists a 
system of local coordinates (z,, . , z.) centered 
at z? and a pair of natural numbers d <n and k 
such that, in a neighborhood of z9, A is a k- 
sheeted ramified covering space with covering 
mapping @:(Z,,.... Za} (z,, Za); 1.€., for an 
analytic set R in a neighborhood of 0 € C4, o : 
A—qo !(R)2C'— R is, in a neighborhood of 
z°, a k-sheeted covering mapping, where A 
o ^! (R) is a connected d-dimensional com- 
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plex analytic manifold in the neighborhood. 
The coordinates of the k points of A —9 "UR 
over each point (z,,, z,) are holomorphic in 
these variables. The number d is the (local) 
dimension of A at z° and is denoted by dim,» A. 
From this local representation, we obtain 
Riickert’s zero-point theorem: Every prime 
ideal 58 in H(0) is equal to [(A,), with A, an 
irreducible germ of an analytic set at 0. In this 
case the dimension at 0 of an analytic set A 
that defines A, is equal to the tKrull dimen- 
sion of the "local ring H(0)/§8. The theory of 
local rings is very important in the study of 
germs of analytic sets. The dimension of a 
general analytic set A at z? is detined by 
dim,» A =sup;dim,oA;, where A —| J; A; in a 
neighborhood of z?, with the A, irreducible 
at 2° If dim,» A; is equal to d for all į, then A 
is called purely d-dimensional at z?. The 
(global) dimension of A is defined by dim A = 
Sup,., dim, A. A purely d-dimensional ana- 
lytic set is defined to be an analytic set that is 
purely d-dimensional at every one of its points. 

A point z? of an analytic set A is called 
ordinary (regular or simple) if A has the struc- 
ture of a complex analytic submanifold in a 
neighborhood of z°, The set A’ of ordinary 
points of A is dense and open in A. The set A* 
= A AT of singular (not ordinary) points is an 
analytic set in G. If A is purely d-dimensional, 
then A' is a d-dimensional complex analytic 
manifold and A* is an analytic set of dimen- 
sion « d — 1. 

Let A be an analytic set of dimension d in G, 
and B a purely d'-dimensional analytic set in G 
-A with d' > d. Then the closure B of B in G is 
a purely d'-dimensional analytic set (Remmert- 
Stein continuation theorem [17]). 

For every analytic set A in G, the *analytic 
sheaf .7(A) of germs of holomorphic functions 
over G that vanish on A is fcoherent (H. Car- 
tan). We call ./(A) the sheaf of ideals defined 
by an analytic set A. Let Oç be the sheaf of 
germs of holomorphic functions over G. Then 
Gi —(06/.4(A)|A is a coherent sheaf of rings 
over A. (, is called the sheaf of germs of holo- 
morphic functions on an analytic set A. 


C. Analytic Spaces 


A 'ringed space (X, 0,) with Hausdorff base 
space X is called an analytic space if for every 
point x& X, there exists an open neighborhood 
U of x such that the ringed space (U, (0, | U) is 
isomorphic to a ringed space (A, C.L where A 
is an analytic set in an open set G of some C”. 
The structure sheaf Qy is then called a sheaf of 
germs of holomorphic functions. The notion of 
tholomorphic mapping from one open set in 
C" into another is generalized to the case of 
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mappings from one analytic set into another. 
An analytic set Y in an analytic space X and 
the sheaf (0, of germs of holomorphic functions 
on Y are detined as in the case where X is a 
complex manifold. The ringed space (Y, 0,) is 
an analytic space and is called an analytic 
suhspace of X. For an analytic space X, the 
notions of dim, X, dim X, irreducibility, and 
pure dimensionality are detined as for an 
analytic set A c G c C". Every analytic space X 
is the union of a locally finite family of irreduc- 
ible analytic subspaces X, called the irreduc- 
ihle components of X. 

Let ọ : X -» Y be a holomorphic mapping 
of an analytic space X into another, Y. Its 
rank at xe X is detined by r,(x) = dim, X = 
dim, @ '(o(x)). The number r, — sup, xr, (x) 
is called the rank of «p, The set of degeneracy 
E, of ọ is the set of points xc X such that 
Foix (X) € ben: for an irreducible component 
X’ of X through x. The mapping ¢ is non- 
degenerate if E, = @. For any k €N, [xe X 
FX) S k] 1s an analytic set (R. Remmert[ 16]). 
In particular, E, is analytic. For a holomor- 
phic mapping o :X > Y, the inverse image of 
an analytic set in Y is an analytic set in X. 
However, the image of an analytic set is not 
necessarily analytic. If (p is *proper, then the 
image Q(X') of an analytic set X" in X is an 
analytic set in Y of dimension ry. and is 
irreducible if X" is irreducible (Remmert’s 
theorem [ 16]). 


D. Modifications and Resolution of 
Singularities 


Let M be a subset of an analytic space X. If, 
for every point x € X, there exists an open 
neighborhood U of x and and an analytic set 
M* in U, containing UM M such that U-M* 
is dense in U, then M is called analytically 
thin. Let p: X 2 Y be a holomorphic map- 
ping. Suppose that there exist two analyti- 
cally thin sets A4 c X, N c Y such that o 
induces an isomorphism between X — M and 
Y =N. Then X is called a holomorphic modifi- 
cation of Y. If furthermore ¢ is proper, then X 
is called a proper modification of Y. A mon- 
oidal transformation of an analytic space X 
with respect to a coherent sheaf of ideals .¥ is 
detined as in the case of a complex manifold or 
an algebraic variety (— 16 Algebraic Varieties 
L; 72 Complex Manifolds H). Tt is a proper 
modification f: X* — X such that the inverse 
image ideal sheaf f "8 "Gu is inyertible, and 
is universal among all proper holomorphic 
mappings h: Z—X with the property that 
h~' -Oz is invertible, where f "Të is the 
"inverse image of ZU J is the sheaf of ideals 
detined by an analytic set Y in X, the mon- 
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oidal transformation of X with respect to Zi 
often called the blowing-up of X with center Y. 

H. Hironaka [ 14] proved that, if X is an 
analytic space which is countable at infinity 
(i.e., a countable union of compact sets), then 
there is a proper modification z: X' X with 
X’ smooth De, free from singular points). Such 
a modification is called a desingularization (or 
resolution of singularities) of X. Moreover, 
over any relatively compact open set U of X, 7 
is the product of a finite sequence of blowing- 
ups 1: X; Xi- (X; =X), with smooth centers 
Y/ , along which X; , is tnormally flat. This 
deep result enables one to derive properties of 
analytic spaces from those of complex 
manifolds. 

Let X and Y be two analytic spaces and G 
an analytic set in X x Y. If the canonical pro- 
jection z: G9 X is a holomorphic (or proper) 
modification, then we say that a meromorphic 
mapping (a proper meromorphic mapping) jt 
of X into Y is defined. The set G is then called 
the graph of u. A holomorphic mapping o: X 
> Y can be viewed as a proper meromorphic 
mapping. Let ji: X > Y be a proper meromor- 
phic mapping. Then u(x) (the projection of 
1 !(x) into Y) is a nonempty analytic set in Y 
for every point x € X. Moreover, there exists 
an analytic set N, with X-N dense in X, such 
that u maps SN into Y holomorphically. 
The smallest set N with this property is called 
the set of points of indeterminacy or the sin- 
gularity set of x. A meromorphic mapping f:X 
P" (C) is called a meromorphic function on X 
if none of the irreducible components of X is 
mapped to {co} by f. The set f ^! (0) = x((X x 
(03) 1G) is called the set of zero points of X, 
and the set f~'(00) is called the set of poles. 
These are analytic sets in X. Let fiex fy be 
meromorphic functions on X. Then, by a 
suitable proper modification of X, one can 
eliminate the points of indeterminacy of the 
meromorphic mapping f: X-(P!(C)) defined 
by xo(fi(x), . , f, (x)), ie., one can modify f to 
be holomorphic. The ring of meromorphic 
functions on X is invariant under proper 
modifications of X. If X is irreducible and 
compact, then the field of meromorphic func- 
tions on X is a simple algebraic extension of 
the ‘field of rational functions of k ( € dim X) 
variables. 

Let (X, 0,) be an analytic space. A point 
x € X is called normal for X if €, „is a 'normal 
local ring. The set of nonnormal points for X 
is an analytically thin analytic set in X (K. 
Oka). Every ordinary point of X is normal. 
We call X normal if every one of its points is 
normal. Every nondegenerate holomorphic 
mapping of an irreducible X into an irreduc- 
ible and normal Y is an open mapping if its 
rank is equal to the dimension of Y (Remmert). 
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For every analytic space X, there exists a 
proper modification v: X 2 X with X normal 
such that v is nondegenerate and v X — v^! (S) 
is an isomorphism, where § is the set of non- 
normal points. Such a proper modification of 
X is unique up to isomorphisms. We call X 

a normalization of X with normalizing map- 
ping v. 

Let o : X 2 Y be a holomorphic modifi- 
cation. Suppose that Y is normal at (x?) 
(x?& X). If the set o ! (o(x?)) contains an iso- 
lated point, then o ^! (p(x9?)) 2 x9, and o is an 
isomorphism in a neighborhood of x? (an 
analog of !Zariskr's main theorem). In partic- 
ular, if o : X — Y is a holomorphic modifica- 
tion and Y is normal, then o maps X —E, 
isomorphically onto the dense open set ok 
E,). Furthermore, if a holomorphic mapping 
(: X > Y is injective and X and Y are irreduc- 
ible, normal, and n-dimensional, then @(X) 
is an open set in Y and o 1: o(X)9 X is 
holomorphic. 


E. Analytic Spaces in the Sense of Behnke and 
Stein 


Let o : GG be a proper continuous mapping 
of a connected ‘locally compact space G onto 
a domain G c C". The triple 65 = (G,¢, G) is an 
analytic covering space over G if the following 
conditions are satisfied: (i) o ! (z9) is a finite set 
for every point z? c G. (ii) There exists an ana- 
lytic set A c G of dimension € n = 1 such that 
o G=@'(A)is a local homeomorphism and 
every point of g !(A) has a fundamental sys- 
tem of neighborhoods U such that both U 
and U — o ^! (A) z Qj are connected. As © is 
unramitied over G-A, the number of points in 
Q ~! (z?) is constant for z? € G-A and is called 
the number of sheets of (5, A point Ze G is 
called a ramification point of (5 if the restric- 
tion of g to any neighborhood of Z is not a 
homeomorphism. Denote by B the set of 
ramification points of (5. Then g(B) C A is an 
analytic set of dimension n= 1. Let f be a 
continuous complex-valued function in an 
open set D in G. We call f holomorphic in D if 
for every point ZU ED B and for every open 
neighborhood V of z? = (29) over which ¢ is 
a homeomorphism, fo 9 ! is holomorphic in 
V. Denote by Og the sheaf of germs of holo- 
morphic functions over C. Then (G, Og) is a 
ringed space. An analytic space in the sense of 
Behnke and Stein is a Hausdorff ringed space 
(X, 0,) that is locally isomorphic to a ringed 
space of the form (G, 6) [3]. tRiemann's 
theorem on removable singularities holds for 
such spaces. Every normal analytic space is an 
analytic space in the sense of Behnke and 
Stein. An analytic covering space © =(G, o, G) 
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is a C-covering space (covering space in the 
sense of Cartan) if for every point z? € G there 
exist an open neighborhood V of Z" and a 
holomorphic function gin U = @ ! (V) which 
can be defined by an irreducible polynomial 
of degree k such that its coefficients are holo- 
morphic functions on V and the coefficient of 
its highest term is 1, where k is the number of 
sheets of (5, A C-analytic space is an analytic 
space in the sense of Behnke and Stein that is 
locally isomorphic to a C-covering space. The 
category of C-analytic spaces coincides with 
that of normal analytic spaces. According to 
H. Grauert and Remmert [ 10], every analytic 
covering space is a C-analytic covering space. 
Therefore every analytic space in the sense of 
Behnke and Stein is a normal analytic space. 
Let R be an equivalence relation in an ana- 
lytic space X. Given a subset A of X, denote 
by R[A]the set of points of X which are R- 
equivalent to points of A. We call R proper if 
R[K] is compact for every compact set K in 
X. Let 9 be a proper holomorphic mapping of 
an analytic space X into another Y. For x, 
X'E X, let x =x’(R) be defined by (x) = ol 
The equivalence relation R is then proper. We 
consider the quotient space X/R and the 
canonical projection p: X 2 X/R. With each 
open set U in the quotient space X /R we can 
associate the ring of holomorphic functions in 
p |(U) that are constant on p (X)for every x 
€ U. This leads to a ringed space (X/R, C, /R), 
which is proved to be an analytic space by 
Grauert's theorem [9]: All the ‘direct images of 
a coherent analytic sheaf over X by a proper 
holomorphic mapping 9: X — Y are coherent. 
For every proper equivalence relation R in X, 
the ringed space (X/R, Cy/R) is an analytic 
space if and only if for every point xc X/R 
there exists an open neighborhood V of x such 
that functions in I'( V, @,/R) separate the 
points of V (H. Cartan). 


F. Stein Spaces 


For an analytic space (X, My) let us consider 
the following conditions: (i) Functions in 

I(X, Oy) separate the points of X. (ii) X is K- 
complete; i.e., for every point xe X there exist a 
finite number of f€ (X, Oy) (i 2 1, , k) such 
that the holomorphic mapping f-—(f):X AC 
is nondegenerate at x. (iii) Every compact 
analytic set in X is a finite set. Condition (1) 
implies (ii), and (ii) implies (iii). If an irreducible 
analytic space X is K-complete, then X is a 
countable union of compact sets (Grauert 

[S]). In fact, if n= dim X, there exist functions 
fie V(X,€4) (i2 1, ...,n)such that the holo- 
morphic mapping f —(f): X 5C" is nondegen- 
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erate. The notion of holomorphic convexity 
(— 21 Analytic Functions of Several Com- 
plex Variables H) is carried over to analytic 
spaces. For a holomorphically convex analytic 
space, conditions (1), (ii), and (iii) are equivalent 
(Grauer [8]). A Stein space (or holomorphi- 
cally complete space) is a holomorphically 
convex analytic space that satisfies one of the 
conditions (1), (ii), or (iii). In a holomorphically 
convex analytic space (X, (,), let R be the 
equivalence relation defined by I(X, 6,); i.e., 
for x, x’ eX, x z x (R) if and only if f(x) = f(x’) 
for every f e l'(X,6,). Then R is proper. The 
analytic space (X/R, €, /R)is a Stein space. A 
Stein space is a generalization of the notion of 
a Stein manifold. Fundamental theorems A 
and B on Stein manifolds hold Verbatim for 
Stein spaces (— 21 Analytic Functions of 
Several Complex Variables L). Therefore the 
main properties of Stein manifolds are in- 
herited by Stein spaces. Let o: X 2 Y bea 
holomorphic mapping of an analytic space X 
into another, Y. If for every xe X all the con- 
nected components of the fibers o! (o(x)) are 
compact, then the equivalence relation R’ 
defined by those components (Le., for x and x’ 
in X, x = DA if and only if x and x' belong to 
the same component of @ !(o(x))) is proper, 
and the ringed space (X/R', &,/R') is an ana- 
lytic space. In particular, if X is a holomor- 
phically convex irreducible analytic space 

and R is the equivalence relation defined by 
F(X, Cy), then all the fibers of the canonical 
projection p: X 2 X/R are connected. 


G. Further Topics 


The notion of analytic space cap be gen- 
eralized as follows (Grauert [9]). A ringed 
space (X, Col is a general analytic space if it is 
locally isomorphic to a ringed space (A, 3), 
where A is an analytic set in a domain G € C", 
and A, = (ei Zil, for some coherent analytic 
subsheaf f of .¥(A) such that Supp(Cg/ 7) 
(={z€G (65/4), #0})= A An analytic sub- 
space of (X, (/,) is a ringed space (Y, Oy) where 
Y = Supp((/,/.7) and Oy = Dal? for some co- 
herent sheaf of ideals $ of ©y. (Y, (,) is also a 
general analytic space. A. Douady [6] showed 
that for any general analytic space (X, (y) 
there exists a natural structure of a general 
analytic space on the totality of all the com- 
pact analytic subspaces of (X, €). The result- 
ing analytic space is called the Douady space 
of (X, Gel, For the proof, the notion of a 
Banach analytic space is used, which is ob- 
tained by first defining analytic subspaces in 
an open subset of a complex ‘Banach space 
and then patching them. A Douady space can 
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be used, for example, to show that the identity 
component of the analytic automorphism 
group of a compact Kahler manifold is a 
‘complex Lie group which is naturally an 
extension of a *complex torus by a "mear 
algebraic group (A. Fujiki, D. Lieberman; = 
232 Kahler Manifolds C). 
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24 (Xx1.1) 
Ancient Mathematics 


A. General Remarks 


To determine the beginning of the history of 
mathematics, one must detine the term “math- 
ematics.” Only speculation based on the ob- 
servation of primitive peoples today can be 
made regarding the development of the number 
concept among prehistoric peoples. The pre- 
historic period ended in Egypt and Meso- 
potamia c. 3000 B.c., and a little later in the 
valleys of the large rivers in India and China. 

Since the basis of the civilizations in the 
river valleys of the ancient world was agricul- 
ture, the administrators first had to control 
watering systems through irrigation, drainage, 
pumping, and canalization; second, they had 
to measure land and harvests for tax collec- 
tion; and third, they had to establish a calen- 
dar by observation of the heavenly bodies. 

All these tasks demanded some knowledge 
of mathematics. 

Additional knowledge of mathematics was 
certainly needed for construction of the pal- 
aces and tombs. We know something of the 
development of mathematical knowledge 
during these ages from some recovered arti- 
facts, but there remains the possibility of new 
finds that will bring about a basic change in 
our knowledge of the history of mathematics 
during this period. 


B. Mathematics in Egypt 


The main sources for our understanding of the 
history of mathematics in Egypt are the Mos- 
cow papyrus and the more important Rhind 
papyrus, both discovered in the 19th Century. 
The Greeks place the origin of their mathe- 
matics in Egypt, but it seems that Egyptian 
mathematics was limited to practical mathe- 
matics. The Egyptians had a decimal numera- 
tion system, but the place value was not clear; 
they used fractions, which they always decom- 
posed into the sums of unit fractions (i.e., frac- 
tions with 1 as numerator); they solved the 
problems of everyday arithmetic that were 
reducible to linear equations; they computed 
approximate areas and volumes of some fig- 
ures for the purpose of measurement of farm- 
land or granaries and for construction work; 
they had exact formulas for the computation 
of areas of triangles and of trapezoids; and 
they used (16/9)? 23.1605 as the value of 
T; but no trace has been found to prove the 
existence of demonstrative mathematics in 
ancient Egypt. 
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C. Mathematics in Mesopotamia 


Sources abound for the study of Mesopota- 
mian mathematics, and these sources may 
very well increase in the future. The Meso- 
potamians kept exact records of astronomical 
observations for long periods of time. Their 
more advanced mathematics was not limited 
to practical use, as was that of the Egyptians. 
They used a sexagesimal system of numeration 
with place value, and also used sexagesimal 
fractions; however, they lacked a cipher to 
denote zero until the 4th Century B.C., and they 
did not have a symbol corresponding to our 
decimal point, so that exact place value had to 
be determined from the context of each ex- 
pression. They had a multiplication table and 
tables of inverses, squares, and cubes of num- 
bers, and they used these tables to solve equa- 
tions, even some simple equations of the third 
degree, as well as simultaneous equations of 
the second degree for two unknowns. They 
had accurate solutions for quadratic equations 
(expressed in words); they discarded negative 
roots, but they admitted both positive roots 
when two existed. They studied integral solu- 
tions of oi +b? = c? (the largest of their solu- 
tions were 12,709, 13,500, and 18,541) and 
approximate computation of quadratic roots. 
which suggests some relation to Greek mathe- 
matics. We have evidence that some of the 
geometric algebra in Euclid’s Elements can be 
traced to Mesopotamian algebra. Some his- 
torians also affirm that the concept of dem- 
onstration in Greek mathematics originated 
with the Mesopotamians, but this theory lacks 
suffcient proof. 

By the 7th Century, the Mayas in Central 
America also possessed a numeration system, 
with the base 20. As far as we know, the Meso- 
potamians and the Mayas were the earliest 
people to possess numeration systems with 
place value (- 57 Chinese Mathematics, 187 
Greek Mathematics, 209 Indian Mathematics). 
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Approximation Methods in 
Physics 


A. Introduction 


It is often possible to solve a differential equa- 
tion of mathematical physics analytically by 
expanding the solution in a series with respect 
to a small (or large) parameter involved in the 
equation. In general, such an approach is 
called a perturbation method (- Section B). 
For an equation which is difficult to treat 
analytically, it is possible to get a numerical 
solution either by replacing the derivatives by 
difference quotients (*difference method), or by 
expanding the solution in terms of suitable 
functions and determining the expansion coef- 
ficients numerically from the equation re- 
written in a weak form ('variational method, 
*method of weighted residuals, *Galerkin's 
method, ‘finite-element method, etc.). For such 
numerical methods — 46 Calculus of Vari- 
ations, 301 Numerical Solution of Algebraic 
Equations, 302 Numerical Solution of Linear 
Equations, 303 Numerical Solution of Ordi- 
nary Differential Equations, 304 Numerical 
Solution of Partial Differential Equations, 441 
Variational Principles. 

In Section C we describe a method for deriv- 
ing an asymptotic expression for a function 
written as an integral in a complex plane. 


B. Perturbation Method 


(1) Regular Perturbation. (a) Initial-value prob- 
lem. Consider an initial-value problem of an 
ordinary differential equation involving a 
small parameter €; 


du 
4p | 559. (1) 


u=b(e) at t=0. (2) 


We assume that f is sufficiently smooth as a 
function of t and is regular as a function of u 
and £, and that b is a regular function of c. 

Substituting the power series in « for Lu 
and b, 


f(t, we) = f Ot, u) +f t, u) +, 
ult; e) - ut) + eu Xt) + . ..) 
bel = BO + cpu, 
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into equations (1) and (2) and equating terms 
of equal power, we get 


(0) 
du^ ftu), — 0) Mi 
dt s A > 
du"' 0 0 1 1 0 
1 =f! Xt, uí ul ff Xt, u! d ui = pO), 
(2) 
du = f Xt uu?) gon uu? 
dt SET Seck 


+ fOr, uu + Ve, uv), 
u?X0) = po, 


the solutions of which determine the perturba- 

tion series for y, The equation for wll (the 

unperturbed equation) is nonlinear in general, 

but those for ul). vil are inhomogeneous 

linear equations with identical principal parts. 
The system of differential equations 


du; : 
de ete ty 6) (i 1,2, ...,9), 
u; = b;(e) at t=0, 


or a differential equation of higher order, 


d"u d" y 
Hun eee) 


d 
uc bc) , ..., TERMI at r=0, 





can be treated in a similar way. 

(b) Boundary-value problem. Given a linear 
differential operator H = H) +eV involving 
a small parameter ¢ linearly (Hp is a linear 
differential operator and Vis a function of 


(x4, X5, X.)), consider the boundary-value 
problem 

(Hy Lei f(xy,...,X,)  (x,,...,x)eQ, (3) 
Bu=0  (x,...,x,Je6Q, (4) 


where B is a linear and homogeneous 
operator. 
Substitution of the Taylor expansion 


U=Upteu, + 

into equations (3) and (4) gives 
Hougy—f, Bu, =9, 
Hyu,t+Vug=0, Bu, =0, 


of which the solutions are 
uo—-Kf, u,=—KVKf, 
u;— KVKVKf,.... 


Here K = Hy lis an integral operator such that, 
for any function p= v(X,, X5, , x), we have 
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G(x pt XR Gista 
ad [Čist Sn) 


X v(i s ede, de, 


where G(x,, ..., Kal, Ča) is the Green's 
function for the unperturbed problem. The 
perturbation series 


u=Kf—eKVKf+e*KVKVKf—... 

converges if VK is bounded and |e| « 1/| VK ||. 
(c) Eigenvalue problem. The solution of an 

eigenvalue problem 

(Hg t £V)u Au (5) 


with the boundary condition Bu = O for the 
operators considered in (b) can be obtained in 
power series in e; 


Je Ag tea p e? AC) 4 ..., 

u — ug t e + e? u? s, 

where uy and 4, are the eigenfunction and 
nondegenerate eigenvalue, respectively, of the 
unperturbed operator Hy, which is assumed to 


be self-adjoint. Substitution of the series into 
(5) gives 

(H,— 4g = — Vu, +2 up, (6) 
(Hy —ÀguO? = Vu) AP ug FAD YD 

(Hy — iou we Vu + AP ug + 40940) 


$Y), 


If we normalize u, and H by the conditions 
(ug, ug) = 1 and (u, ug) = 1, we get from equation 
(6) 


AO = (ue, ug) =o. 


Let § be the inverse of Hy — Aal in the sub- 
space perpendicular to ug (we put Su, = 0), 
then each term of the perturbation series for u 
and 4 is obtained in the following way from 
the system of the equations given above: 


u= —SVu,, 

A?) = = (VS vu,, uj), 

i? = S(V—a)SVuo, 

A =(VS(V —a)SVug, Uo), 

u?) = — §(V—a)S(V—a)SVug 
LIKES ue, ug)S? Vus, 


(2) Singular Perturbation. Formal application 
of the procedure described in (1) often fails 
when the term including the highest-order 
derivative is multiplied by a small parameter 
(or a lower-order term is multiplied by a large 
parameter), i.e., when we deal with singular 


perturbation problems. Typical methods for 
treating such cases are described below. 

(a) WKB method. The method for getting an 
asymptotic solution of the second-order dif- 
ferential equation 
dzu 
deo k? P(x)u=0 (7) 
for large values of k was developed for prob- 
lems arising in classical wave motion (Jeffreys, 
1924 [1]) and for problems of quantum 
mechanics (Wentzel, Kramers, Brillouin, 

1926), and is called the Jeffreys method, WKB 
method, or WKBJ method. 

If the function P in (7) does not vary rapidly, 
the solution is expected to have the form 


uocexp (iko(x)], 


with @(x) not very different from a linear func- 
tion of x, since the exact solution for P = P, = 
const is exp( + ik / Py x). 

Substitution of this expression for 4 in equa- 
tion (7) gives 


iko" — k? o? +k?P=0. 
Neglecting the first term, we get 


X 
o? = P, 1.€., p(x) = | P 1/2 dx. 


This result suggests a transformation of the 
form 


X 
w= P^, | P1? dx, 


which gives 


d?w 
a HK —Q)w=0, (8) 
where 
d? 2 
Q- P^ li EC air 


Equation (8) can be transformed further into a 
Volterra's integral equation of the second 


kind: 
w(z; k) Ae + Be Te 


+f sin{k(z—{)} Q(C) )dt. 


Therefore, if we expand w as 


] 
w(z; le wot) Fr wi) ess (9) 
we have 
Wo(z) = Ae™ + Be”, 


us | ' ein(kiz- QV OwO 


Zo 
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If P is positive for the interval zọ &z & z, in 
question and has continuous derivatives up to 
second order, the foregoing series converges to 
the solution of (8). 

On the other hand, when k takes complex 
values, the series (9) does not converge in 
general. The series expansion appropriate for 
this case is 


w(z k)eexp(iy(z;k), p= tiers 
and we get the asymptotic expansion for w: 


w-exp tikztT x | 04 gs Q(z) 


7 2 
tgo 20- | Q i| 


(Q'2—20(2Yj) + | 





1 
Tiet 
Now, in many cases, P takes both positive 
and negative values within the interval in 
question, e.g., in the problem of transonic flow 
of a gas, in the problem of determining the 
reflectivity and transmissivity of a wave propa- 
gating in a nonuniform medium, or in the 
quantum-mechanical problem of a material 
wave passing through a potential wall. The 
point at which P vanishes is called a turning 
point. In this case there arises the problem of 
analytic continuation of the solution obtained 
in one side into the other through the turning 
point. For the special case when Pa x" (the 
turning point is chosen as x = 0), it can be 
shown that the exact solution of equation (7) is 
explicitly given by 


wo /z Z,(kz), 


where v = 1/(n + 2), and Z, is a cylindrical 

function of order v. Based on this fact, the 
continuation formula for the case when P 
is approximated by x" near x = 0 has been 
obtained. 

(b) Lighthill’s method. With nonlinear equa- 
tions, formal application of a perturbation 
procedure sometimes leads to the difficulty 
that the solution of the unperturbed equation 
has a singularity not exhibited by the exact 
solution of the original equation and whose 
order increases in higher approximations, SO 
that the perturbation series ceases to have any 
meaning near this point. Such a difficulty can 
often be overcome by so-called coordinate 
straining. This method has been applied to a 
number of fluid-dynamical problems, such 
as those arising in theories of aircraft wings, 
boundary layers, and shock waves. 

Before describing the method, it is conve- 
nient to sketch Poincaré% method of getting 
the perturbation serles for the period of a 
nonlinear oscillation. In order to solve the 
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nonlinear equation 


du 
—= f(u; To, £), 


dt 

where T, is the period of oscillation when the 
small parameter ¢ is taken to be zero, Poincaré 
wrote series expansions for both y and the 
period T: 


u-ug(t)--eu,(t) -..., | T- Toc eT, 9 ..., 


and determined T,, T}, .. . in such a manner as 
to avoid resonance at each step in the deter- 
mination of u,(t), u,(t), . . 

Based on this idea, Lighthill(1949) [2] 
developed coordinate straining in order to 
overcome the difficulty stated above. We illus- 
trate the method here by means of the follow- 
ing simple example. 

Consider a boundary-value problem of a 
nonlinear differential equation 


(teu) +u=0 (O<x< 1): (10) 


u(l)= 1. (11) 


Application of the usual perturbation method 
would give the series solution 


NN. 1 1 an 1 1 " 
meet eL e 
xc Ve aJ. 2x x? 


As is Seen, however, the order of an apparent 
singularity occurring at x = 0 increases as we 
proceed to higher approximations so that it is 
not possible to see the true behavior of the 
solution near x = 0. 

In Lighthill’s method, we assume that series 
expansion is valid not only for t but also for 
the independent variable x, i.e., 


u —ug( Sr eus (E) t... 


Substitution of these series into equations (10) 
and (11) gives 


x=€+ex,(G)4+.... 


$ug-uo—0, ug(1)=1, 


Zu, Tu, = — (uo Xi T UX, T uguo), 


u(1)-0,  x,(1)—0, 


From the first equation we have ug = 1/£. 
Before solving the second equation for u, , we 
determine x, (Č) so as to avoid the increase 
of the order of singularity in ug(£) at č =O. 
This requirement is satislied by taking x, — 
(1/2) (€ 1/2), and this gives u, = 0; thus, at 
this stage of approximation, we have 


1 
u—uo() Fei) 5, 


senec) 
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The guiding principle is the same for higher 
approximations. (The foregoing expression 
happens to be the exact solution, but this a 
fortuitous result of our choice of this particular 
example.) 

The convergence of the series obtained by 
Lighthill’s method has been proved by Wasow 
(1955) [3] for the more general problem 


(x+ uc T q(x)u 2 r(x), 


where e, a, and b are positive constants and 
q(x) and r(x) are functions regular in |x| <a. 

Lighthill’s method can also be applied to 
partial differential equations. Sometimes it is 
called the PLK method (Poincaré-Lighthill-Kuo 
method), after its successful application to the 
problem of the boundary layer of a thin flat 
plate by Kuo. 

(c) Method of matched asymptotic expan- 
sions. In some cases it is possible to get a uni- 
formly valid series solution if we divide the 
domain into two or more subdomains, solve 
the equation by use of a suitable independent 
variable for each subdomain, and then deter- 
mine the coefficients in each solution by the 
process of matching neighboring solutions 
on their common boundary. Such a perturba- 
tion technique was developed in treating the 
boundary-layer equation in fluid dynamics 
and has been systematized into the method 
of matched asymptotic expansions. 

The idea of the method can be shown by 
the following simple example. Consider the 
boundary-value problem 


(0« x « 1), 


wi — 0, u(1)=1, (12) 


where £ is a small positive parameter and a is a 
constant such that 0 «a « 1. In order to solve 
this problem, we first take x itself to be an 
independent variable (the outer variable) in the 
domain x >> g and expand the solution as 


u= Us) +eU,(x)+ 


Because the highest derivative is multiplied by 
a small parameter g in equation (12), the per- 
turbation equations to be solved are all of first 
order, of which the first one is 


dU, 


Sg Wil) 


Solving this, we get an approximation for the 
outer solution: 
Us(x) 2 1 — a(1— x). 


On the other hand, we introduce a new 
independent variable Z (the inner variable) by 
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putting x = ec for x ze This transforms equa- 
tion (12) into 
d'r dr 
— + EA, 
d? dé 
If we assume the expansion 


v= Vy) eV)... 


we have the equation for Vp: 
d^V, dV, 
nmt 
di^ dë 
This gives an approximation for the inner 
solution: 


Lille" 

Finally, in order to match the inner and the 
outer solutions, we equate their values at some 
point within the interval ¿<< x << 1 (£ oo for 
the inner solution, x 0 for the outer solution): 


Vo(o0) - Us(0) — 1 -a, 


V,(0) — 0. 


from which the coefficient in V; is determined 
to bec = 1 -a. Therefore we have an approxi- 
mate solution valid in the whole interval 
O<x<l: 


ux Datz Vo($) - -a) 
—ax-(1—a)(1—e ^). 


C. Method of Steepest Descent 


If an analytic function f(z) of a complex vari- 
able z is expressed in terms of an analytic 
function g(t) by an integral 


ra= exp{zg(t)} dt, (13) 


where C is a curve on the complex t-plane, 
then an asymptotic expression for f(z) for 
large values of |z| can be derived by the 

method of steepest descent (the saddle-point 
method). The idea of the method may be 
traced back to Riemann, and various asymp- 
totic expressions for cylindrical functions were 
obtained by Debye (1909) [5]. 

The point at which the first derivative of g(t) 
in the integral (13) vanishes is called a saddle 
point. The function g(t) is expanded in a power 
series near this point as 


1 
g(t) — q(to) + 7.9 (to) (t — tg? +.. 


We have an inequality 
zg'(to)(t — to) «0 


along the line 


T 
L: arg(t — t9) 2 — — zargízg" (to) ]. (14) 
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Therefore the absolute value of the integrand 
exp{zg(t)} reaches a maximum at tọ on the 
line L and decreases along it more rapidly 
than along any other direction. Hence if we 
deform the integration path in such a way 
that it passes through the saddle point tọ in 
the direction of steepest descent, the value of 
the integrand is practically zero on the new 
path except very near to tọ when |z] is large, 
whereas the value of the integral remains the 
same. Therefore we can get an asymptotic 
expression of f(z) for large |z| by truncating 
the Taylor series for g(t) up to the second term 
and taking the line L as the integration path: 


1 
rosene! exp rar tc?) d 


2n 
= re expizg(to)j- 


As an example Stirling’s formula 
n!z./2nn n'e" 


for a large positive integer is derived if we start 
with the formula 


n!= | pU | e" "(ns)" d(ns) 


0 
0 
=p"! | exp{n(logs—s)} ds 
D 


and make use of the method of steepest descent. 
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26 (Xx1.4) 
Arab Mathematics 


The role of the Arabs in cultural history has 
been partly that of cultural transmitter. Between 
the 7th and 13th centuries, they established a 
religious empire that extended from India to 
Spain; later it was divided into the eastern and 
western empires. The caliphs of these empires 
encouraged research in the sciences, so the 
capitals Baghdad and Cordova became centers 
of culture where scholars from different coun- 
tries gathered. 

Arabic scholarship is sometimes called the 
stepfather of European culture. During the 
13th Century, Alphonso X (1252- 1284) invited 
Islamic and Hebrew scholars to the Spanish 
court to translate their writings on algebra, 
medicine, and astronomy into Spanish. This 
accomplishment earned him the title of AL 
phonso the Wise. 

The first contact between Greek and Indian 
mathematics took place in Baghdad under 
Caliph Al-Mansur (754-775); Euclid's Ele- 
ments was introduced by way of the Byzantine 
Empire, while Brahmagupta's Brahmasphuta- 
siddhanta came directly from India. Many 
mathematical texts found in the Eastern Ro- 
man Empire and Syria, including some Greek 
works, were translated into Arabic. Though it 
is difficult to discern essential scientific ad- 
vances in Arabian works, the diffusion of 
these translations was instrumental in the 
development of European mathematics. 

The Arabs did not use written numerals 
until Mohammed's time (570—632). Signs 
representing numbers had been introduced 
into Arabia when its influence encompassed 
Egypt and Greece. Indian numerals were 
imported with Brahmagupta's book and 
became our present Árabic numerals after a 
series of modifications. 

Among all the branches of Arab mathe- 
matics, algebra was the most advanced. [t 
started with Alkwarizmi's (820) Al Gebr W'al 
Muquabala, the origin of the word "algebra." 
It was the first mathematical book written in 
Arabic. Its content was essentially a variety of 
methods of solving algebraic equations. AI 
gebr means "transposition of negative terms 
on one side of the equation to the other side 
and changing their signs." and al muqua- 
bala means "simplification of the equation by 
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gathering similar terms." For example, qua- 
dratic equations can be brought by these 
methods into one of the following three types: 
x? = px+q; x? +q=px; x? + px =q, where p, d 
are positive numbers. The Arabs expressed the 
rule of solving these equations verbally. They 
apparently knew that quadratic equations 
have two roots, but they adopted only posi- 
tive roots; when the equation had two posi- 
tive roots they adopted the smaller root. The 
proof was given geometrically. It is possible 
that they learned geometric proofs from the 
Greeks. 

For the Arabs, geometry was secondary to 
algebra. They did not appreciate proof as seen 
in Euclid’s Elements. The book on conic sec- 
tions by Apollonius was also translated into 
Arabic, but no essential progress was made in 
this area, The only remarkable contribution 
was that of Omar Khayyam, author of The 
Rubáiyát, who applied conic sections to the 
solution of the cubic equation x? 4- bx = a. 

In trigonometry, Al Battani (c. 858-929) left 
a notable contribution. He studied The Almag- 
est, the Arabic translation of Ptolemy's astro- 
nomical work. He added nothing outstand- 
ing to plane trigonometry, but obtained such 
formulas as cos a= cos b cosc + sin b sinc œs a 
for spherical triangles, which were not men- 
tioned in The Almagest. 
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A. General Theory 


Let g be a tDedekind domain (i.e., an integral 
domain in which every ideal is uniquely de- 
composed into a product of prime ideals), let F 
be the field of quotients of g, and let A be a 
‘separable algebra of finite degree over F. A g- 
lattice a of A is a g-submodule of A that is 
linitely generated over g and satisfies A = Fa. If 
a subring o of A is a g-lattice containing g, 

then o is called an order. A maximal order is 

an order that is not contained in any other 
order. A maximal order always exists although 
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it may not be unique; in particular, if A is 
commutative, it has only one maximal order. 
Ifweput o,2(xeA|xacaj (o, 2 (xeA|axc 
a}) for a g-lattice a of A, then o, (0,) is an 

order of A and is called the left (right) order of 
a. If the left order of a is maximal, then so 1s 

the right order; the converse is also true. If 0, 
and p, are maximal, we call a a normai g- 
lattice. To describe the same situation we say 
that a is a left p;-ideal or a right o,-ideal. If 

0; = 9, = 0, then we say that a is a two-sided p- 
ideal. A lattice a with a Co, (or equivalently 
with a c p) is called an integral g-lattice or 
integral left (right) ideal. The product ab of 
two normal g-lattices a, b is called a proper 
product if the right order of a coincides with 
the left order of b. The proper product defines 
the structure of a tgroupoid on the set of all 
normal g-lattices of A. In particular, the in- 
verse a 1 of a normal g-lattice a satisfying 
aa !—0,a 'a=0, is given bya ! 2 (xeA|xa 
Colis (x cA axcor}. For a fixed maximal 
order o, a maximal integral two-sided o-ideal p 
different from o is called a prime ideal of o. If p 
is prime, then o/p is the matrix algebra of 
degree « over a division algebra, and xk is 
called the capacity of the prime ideal p. The set 
of all two-sided o-ideals forms a multiplicative 
group, of which prime ideals are independent 
generators. 


B. Maximal Orders of a Simple Ring 


In the rest of this article, A is a tsimple ring, F 
is the *center of A, and o is a maximal order of 
A. The prime ideals q of o and the prime ideals 
p of g are in one-to-one correspondence by 
the relation qN g= P; o/q is a simple algebra 
over g/p, and q°= po for some natural num- 
ber e. The different d of o is defined by b ! = 
(xe Al Tr(xo)c gj. (Tr is the treduced trace 
from A to F;D is an integral two-sided o-ideal, 
and is divisible by q*^! if qf = po.) For q to 
divide D, it is necessary and sufficient that 
either e > 1 or o/q not be separable over q/p. 
In particular, if A is a "total matrix algebra 
over F,then d= o, q = po. The ideal of g gener- 
ated by the treduced norms (to F) of the ele- 
ments in a normal g-lattice a of A is denoted by 
N,,,(a). If ab is a proper product, then we have 
Ngjp(ab) = Na4jp(a)N4p(b), where N,4,,(D) does 
not depend on the choice of o; this is called 

the discriminant of A. If [A : F] = n? and q°= 
po, then Narla) = p^, ef =n. 


C. Simple Rings over a Local Field 


Let F be a field that is complete with respect 
to a ‘discrete valuation whose field of residue 
classes is finite, Let g be the *valuation ring of 
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F, p be the maximal ideal of g, and A be a 
simple algebra with F as its center. If A is a 
‘division algebra and LA : F] = n’, then A has 
only one maximal order o, and o has only one 
prime ideal q such that q"= po; o/q is an exten- 
sion of degree n of g/p. If A is not necessarily a 
division algebra, the relation £o£ ! = o° with 
an element ë of A holds between two maximal 
orders o and o' of A. Furthermore, for any left 
(right) o-ideal a, there exists an element & such 
that a = og (a = a0). By using the notation of 
teyclic algebra, we Can express A in the form 
(K,, c, n”). Here K, is the unramified extension 
of degree n over F, g is the tFrobenius substi- 
tution of K,/F, 1 is a prime element of F, and 
0 € v <n. The element of Q/Z determined by 
r/n (mod Z) is denoted by {A} and is called the 
tHasse invariant of the algebra class contain- 
ing A. The mapping A -{A} gives an isomor- 
phism of the *Brauer group of F onto the 
additive group Q/Z. If M is an extension of F 
of finite degree, then (A"] = [M: F] {A} holds 
for the algebra A™ obtained from A by scalar 
extension (- 29 Associative Algebras). 


D. Simple Rings over an Algebraic Number 
Field 


Let F be an algebraic number field of finite 
degree, and let A be a simple algebra with 
center F. Then A is a cyclic algebra and is 
isomorphic to a total matrix algebra over a 
division algebra D. The order n of the algebra 
class of A over F is determined by n? = [D : F] 
(H. Hasse, R. Brauer, and E. Noether). 

Denote by F, the completion of F with 
respect to a tprime divisor p of F, and let A, be 
the algebra obtained from A by the scalar 
extension F, over F. For a finite prime divisor 
p, the meaning of IA is as before; for an 
infinite prime divisor p, put {A,} = 0 or1/2 
(mod Z) according as A, is a total matrix alge- 
bra over F, or not. Furthermore, define the 
subgroup J, of Q/Z by 


J, =Q/Z, p a finite prime 


divisor, 


p areal infinite 
prime divisor, 


= (0, 1/2 (mod Z)} 


= {0}, 


p a complex infinite 
prime divisor. 


Now let J be the subgroup of the direct prod- 
uct II, J, consisting of all elements of the 

form (a,) (x, € J,) such that æ, = 0 except for a 
finite number of prime divisors and Ya, = 0. 
Then A+/( f A,}) gives rise to an isomorphism 
of the Brauer group over F onto J (Hasse). 
Each {A,/ is called the tp-invariant of A. In 
particular, A is a total matrix algebra over F if 


27E 


Arithmetic of Associative Algebras 


and only if A, is a total matrix algebra over 

F, for all p (- 29 Associative Algebras). These 
theorems are closely related to fclass field 
theory. 

If o is a maximal order of A and a, b are left 
o-ideals, then ac = b with an element Z of A 
defines an equivalence relation between a and 
b. The number of equivalence classes of left 
ideals with respect to this equivalence is called 
the class number of A; it is independent of the 
choice of o and is equal to the class number 
defined by using right ideals. The product of 
all real infinite prime divisors p with {A,} = 1/2 
is denoted by P. If P, is the product of all 
inlinite prime divisors and [A : F] = 4, then A is 
called a totally definite quaternion algebra. 

If o is a maximal order of A and A is not a 
totally detinite quaternion algebra, then a^ 
N,,,(a) gives a one-to-one correspondence 
between the classes of left o-ideals and the 
congruence classes of ideals of F modulo P. 
(- 14 Algebraic Number Fields H) (Eichler's 
theorem). 

In particular, if A is a total matrix algebra 
over F, then the class number of A is equal to 
the class number of F. The class number of a 
totally detinite quaternion algebra was deter- 
mined by M. Eichler by using the zeta function 
(- Section F) of A [4]. 

Let o be a maximal order of A, a be an 
integral two-sided o-ideal, b be an integer of F, 
and t be an element of o. Furthermore, assume 
b = 1 (mod P,), Na;r(&)= b (mod a N F) (tmulti- 
plicative congruence). Then there exists an 
element f of o such that Nyjp(f) = b, B 6 
(mod a) (Nr is the reduced norm), provided 
that A is not a totally definite quaternion 
algebra [5]. This theorem, which is called 
Eichler's approximation theorem, is widely 
applicable; e.g., it yields the previous theorem 
on the class number and can be generalized to 
the case of semisimple talgebraic groups (— 13 
Algebraic Groups). 


E. Algebras over a Function Field 


The Hasse-Brauer-Noether and Hasse theo- 
rems also hold for *normal simple algebras 
over a ‘field of algebraic functions of one vari- 
able over a finite field. On the other hand, a 
normal simple algebra over a tield K of alge- 
braic functions of one variable over an alge- 
braically closed field is a total matrix algebra 
over K (Tsen's theorem). 


F. Other Notions 


Adeles and ideles for a simple algebra A over 
an algebraic number field of finite degree can 
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be introduced as in the case of number fields 
(— 6 Adeles and Ideles). If N(a) stands for the 
number of elements in o/a, where o is a maxi- 
mal order of A and a is an integral left o-ideal, 
then the zeta function of the simple algebra A, 
called the Hey zeta function, is detined by C ,(s) 
=}, N(a) ? (the sum over all integral left o- 
ideals). This function has properties similar to 
those of Dedekind zeta functions (- 450 Zeta 
Functions L). Let p be an infinite prime divisor 
of the center F of A, and ]et G, be the group of 
elements in A, with the reduced norm 1. Put 
G = J], G, (the product over all infinite prime 
divisors of F). Then the group lU of units with 
the reduced norm 1 in o is naturally regarded 
as a subgroup of G. To be more precise, [is a 
discrete subgroup of G, the volume of G/T is 
finite with respect to an invariant measure, 
and G/T is compact if and only if A is a divi- 
sion algebra (— 122 Discontinuous Groups). 
This result can be viewed as a special case of 
more general facts about semisimple algebraic 
groups (- 13 Algebraic Groups). If K is a 
*maximal compact subgroup of G, then T gives 
rise to a ‘discontinuous group operating on 
the homogeneous space G/K, and we obtain 
tautomorphic forms with respect to I, The 
case where A is a tquaternion algebra has 
been studied extensively (- 32 Automorphic 
Functions). 
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28 (XXl1 3) 
Artin, Emil 


Emil Artin (March 3, 1898- December 20, 
1962) was born in Vienna. After he studied at 
the Universities of Vienna, Leipzig, and Gót- 
tingen, he taught at the University of Ham- 
burg from 1923 to 1937. In 1937 he left Ger- 
many under the Nazi regime for America, 
where he taught at Notre Dame, Indiana, and 
Princeton universities. He returned to Ger- 
many in 1948 and taught again at the Univer- 
sity of Hamburg until his fatal heart attack. 
In his thesis (1923), he proved affirmatively 
the *Riemann hypothesis in function fields 
over finite constant fields in the hyperelliptic 
case, and conjectured that this should be valid 
in general. This was eventually verified by A. 
Weil in 1941. Also in 1923, he introduced the 
*L-function for Galois extension and was led 
to the tgeneral law of reciprocity, which he 
stated as a conjecture and proved four years 
later, thus bringing tclass field theory to com- 
pletion. Around the same period, he estab- 
lished in collaboration with 0. Schreier the 
theory of tformally real fields and solved the 
17th *problem of Hilbert. He also initiated the 
theory of *braids, which he later developed 
with F. Bohnenblust in the 1940s. In view of 
these and other ingenious works as well as 
his inspiring teaching in all areas of mathe- 
matics, Artin is considered to have been one of 
the most influential mathematicians of this 
Century. 
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Associative Algebras 


A. Fundamental Concepts 


Let K be a commutative ring with unity ele- 
ment 1 (- 368 Rings A), and let A be a ring 
which is a *unitary K-module (- 277 Mod- 
ules). Such a ring A is called an associative 
algebra over K (or simply algebra over K) if 

it satisfies the condition 4(ab) = (4a)b = a(Ab) 
(2e K; a, be A), An (associative) algebra A over 
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K is often written A/K, and K is called the 
coefficient ring (or ground ring) of the algebra 
A= AJK. In particular, if K is a tfield, then it is 
called the coefficient field (or ground field) of A. 
Algebras over fields have been studied in de- 
tail. Notions such as zero algebra, unitary 
algebra, commutative algebra, (semi) simple 
algebra, and division algebra are replicas of the 
respective ones for rings (- 368 Rings). Con- 
sidering both structures as rings and as K- 
modules, homomorphisms and isomorphisms 
are defined in a natural manner, and are called 
algebra homomorphism and algebra jsomor- 
phism, respectively. In this connection, sub- 
algebra, quotient algebra (or residue class 
algebra), and direct product of algebras are 
also defined as in the case of rings. 

An tideal of an algebra A is defined as an 
ideal of the ring A, which is at the same time a 
submodule of the module A over the coeffi- 
cient ring. The fradical of an algebra A con- 
sidered as a ring is then an ideal of A in this 
sense. In fact, the existence of a unity in an 
algebra A implies that any ideal of A consid- 
ered as a ring is necessarily an ideal of the 
algebra A. 

In the rest of this article, we assume that all 
rings have a unity element and that all homo- 
morphisms are unitary. Hence, when we con- 
sider subalgebras of an algebra A, we require 
that they share the unity element with A. If e is 
the unity element of an algebra A over K, then 
the mapping 4— Je = 4' (4€ K) is a homomor- 
phism K - A whose image Ke is contained in 
the *center of A, and the scalar multiplication 
Aa is equal to the ring multiplication 4'a (4e K, 
AE A). Conversely, given a homomorphism of 
K into a ring A whose image is contained in 
the center of A, we can regard A as an algebra 
over K in an obvious way. Hence we are given 
an algebra A over K if and only if there exists 
a pair (A, p) of a ring A and a homomorphism 
p: K 4A whose image is contained in the 
center of A. There exists a uniquely determined 
(unitary) homomorphism of the ring Z of 
rational integers into any ring; hence any ring 
can be regarded as an algebra Over Z. If the 
coefficient ring K of a nonzero algebra A is a 
field, then K can be regarded as a subfield 
contained in the center of A, and the unity 
element | of K coincides with the unity ele- 
ment of A. 

Let A and B be algebras over K. Then the 
'tensor product A ®, B of K-modules is an 
algebra over K under the multiplication 
(a &b)(a' & b’) zaa' @ bb’ (a, a' €A, b, b'e B) 
This algebra is called the tensor product of 
algebras A and B. Moreover, the mapping 
a4a Q 1 (resp. b21 @ b) (ae A, be B) gives an 
algebra homomorphism A-/ A @ x B (resp. B^ 
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A Gy B, which is called the canonical homo- 
morphism. In particular, if A is a commutative 
algebra, then A (9, B can be regarded as an 
algebra over A by this canonical homomor- 
phism; and in this case A (y B is called the 
algebra obtained by extension of the coeffi- 
cient ring of B to A (or a scalar extension of B 
by AJ, and is often denoted by B^. The algebra 
K Gy B is canonically isomorphic to B. Fur- 
thermore, (A Gy B) & C and A (B Qg C) 
are canonically isomorphic and are written 

A & B &) C. Let A and B be commutative 
algebras over K. Then, for any commutative 
algebra C and homomorphisms a : A+ C, fj: B 
>C, there exists one and only one homomor- 
phism y: A Gy BC such that a(a) — y(a Q 1), 
B(b) = Y( 1 & b) (ae A, b e B). This property char- 
acterizes the tensor product A & B of com- 
mutative algebras A and B. In this sense, 

A Gy B is sometimes called the coproduct of A 
and B (- 52 Categories and Functors E). 


B. Examples of Associative Algebras 


As we mentioned, any ring can be regarded as 
an algebra over the ring Z of rational integers. 
But it is often useful to deal with algebras over 
“large? or more “efficient” coefficient rings. 
For instance, we have many rings which are 
algebras Over a commutative ring K, such as 
the fring of polynomials, the tring of formal 
power series in n variables with coefficients in 
K, the tendomorphism ring of a K-module, 
and the tfull matrix ring of degree n over K 

(- 368 Rings C). There are other important 
classes of algebras, such as (semi) group alge- 
bras, Hecke algebras, and crossed-product 
algebras, which wil] be explained later., These 
algebras are defined by a canonical basis con- 
nected directly with a (semi) group structure. 
On the other hand, the ttensor algebra and 
the exterior algebra of linear spaces and the 
*Clifford algebra associated with a given 
quadratic form are also important (- 61 
Clifford Algebras, 256 Linear Spaces). 

The most frequently used example of a 
division algebra is the quaternion field H (often 
called Hamilton’s quaternion algebra, W. R. 
Hamilton, 1858). This is a 4-dimensional 
linear space over the real number field R with 
basis ( 1, i,j,k}, with the following laws of 
multiplication: 1 is a unity element, i? 2j? = k? 
= —1,ij=—ji=k, jk= —kj=i, and ki- —ik =j. 
An element of H is called a quaternion. The 
only finite-dimensional division algebras over 
the real number field R are the real number 
field R, the complex number field C, and the 
quaternion field H. 
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C. Group Algebras and Hecke Algebras 


Let K be a commutative ring and KI" be the 
direct sum >',.g K, of modules K,, where each 
K, is isomorphic to K with a group G as the 
index set (- 277 Modules F). The elements of 
K'9 are the families (A,)seg of elements of K 
whose components are all zero except for a 
finite number of them. Let {u,}seg be the 
canonical basis of KI) namely, u, is an ele- 
ment of K‘@) of which the sth component is 1 
and the others are 0. The module K ©) has the 
structure of an algebra over K, where the law 
of multiplication is determined by u,u, = 

us(s, t€ C). This algebra is called the group 
algebra of G over K. The product 4 x y of 
elements 4 = (4) and u= (u) of K is then 
given by 


(sp) Y, Ag. set (1) 


s=rl 


Each basis element u, is often identified with 
the group element s, and in this manner, the 
group G is regarded as a basis of K‘, which is 
usually written KG or K[G]. 

In this definition, the group G can be re- 
placed by a semigroup G, and then the algebra 
K is called a semigroup algebra. As an 
example, let N be the additive semigroup of all 
nonnegative rational integers and N” be the 
direct product of n copies of N. If we denote 
the elements (i,, , i) of N” by Xj Xj» and 
use multiplication instead of addition, then the 
semigroup algebra of N" over K is exactly the 
tring of polynomials K [X,,-, X,]. On the 
other hand, if G is a semigroup, then even 
when it is infinite, it may occur that for any 
SE G, there exists only a finite number of pairs 
(v, l) of elements of G such that s = rl. In this 
case, formula (1) also defines the law of multi- 
plication on the *Cartesian product Kĉ. This 
algebra is called a large semigroup algebra 
and contains KI) as a subalgebra. In par- 
ticular, the large semigroup algebra of N” 
is exactly the tring of formal power series 
KIEX (tss Xe li], 

Let H be a subgroup of a group G, and 
assume that the index of H (1sHs ' in H is 
fmite for any s € G. This assumption is equiva- 
lent to the condition that any double coset of 
G by H is a union of a finite number of left 
as well as right cosets. Let H\G, G/H, and 
H\G/H be the set of all right cosets, ‘left 
cosets, and *double cosets, respectively. Then 
each element of the direct sum K^) KOM, 
or KIT DN can be regarded as a function 
defined on G taking a constant value on each 
right, left, or double coset, respectively. Con- 
versely, any function defined on G can be 


regarded as an element of. KIT K(6/I or 
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KIT 6/H) i£ it takes a constant value on each 
left, right, or double coset, respectively, and if 
it vanishes everywhere except on a finite num- 
ber of cosets of respective type. Under such 
identification, let 4., 4,, A, and 4, denote the 
values of a function 4:G— K on seG,le H\G, 
r cG/H,andt e H \G/H, respectively. For any 
A, pe K 9/7 we define a function J » u by 
su 2 Ath, SEG, (2) 
where the right-hand side is the sum taken 
over all pairs (r, 1) such that serl, re G/H, 

le H\ G. It can be shown that this sum is a 
tinite sum and that 4 x ue KI DH) Hence the 
module K^ 6/ has the structure of an alge- 
bra over K, which is called the Hecke alge- 
bra of (G, H) over K and often denoted by 
H;,(G, H). I£ H = fe) the j£, (G, H) is exactly 
the group algebra of G. In general, #,(G, H) 
can be regarded as an algebra obtained by 
extending the coefficient ring of HAG, H) to 
K. Furthermore, K ^ (K (G/ Hh can be re- 
garded as a right (left) module over the group 
algebra KI and the endomorphism ring 
Egio( K ^9) (resp. duc! Kl" Hlp is canonically 
isomorphic (anti-isomorphic) to the Hecke 
algebra KI GH 


D. General Crossed Product 


Let G be a group that operates on a commuta- 
tive ring L, and denote the operation by (s, 4)> 
s(a) (se G, Ae L); thus for any se G the map- 
ping 4-»s(A) (4e L) is an automorphism of £, 
satisfying s(t(4)) = st(4) (s, te G). For any 4, 

pe L'9), we define the product 4 a ue UD) by 


zu =), AME) L(t), se G, (3) 


where { f(r,1)j, «c is a given family of elements 
of L. If this family satisfies the equations 


f(s, r) f (sr, l) = s( f(r, D) f(s, rl), 


then L'® forms a ring. In terms of the canon- 
ical basis (4,1, e, this ring structure is defined 
by the formulas u,u,= f(r, l)u,, u,4 = s(4)u, 
(2e L). If K is the subring of L consisting of all 
4€ L such that s(2)— A(se G), then the ring Un 
is an algebra over K, called the crossed product 
of L and G with respect to the given operation 
and the given factor set f of G. In a narrower 
sense of the term, we consider only the case 
when L is a field, G is a finite group, f(r, |) #9, 
and G operates on L faithfully. In this case, G 
can be identified with the *Galois group of a 
tinite ‘Galois extension L/K, and so the crossed 
product is written (L/K, f). This is a fcentral 
simple algebra over K (- Sections E, F). 

If the operation of G in the above- 
mentioned general crossed product is trivial, 


s,r,leG, 
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namely, L = K, the crossed product of K and G 
is called an algebra extension of G over K with 
respect to f. Usually, we assume that K is a 
field and f(r, 1) #0. If f(r, D= 1, the algebra 
extension is the group algebra. If G is a finite 
group whose order is prime to the *character- 
istic of K, then any algebra extension of G over 
K (in particular, the group algebra) is always 
'semisimple and *separable. If G is a finite 
Abelian group and fis a factor set of G, then a 
bihomomorphism o: G x G—K* (a mapping 
which is a homomorphism in each variable) 

is detined by ¢(s,t)= f(s, t): (t, s) '. An alge- 
bra extension of G over K with respect to f 

is a central simple algebra if and only if @ is 
nondegenerate. In particular, if G is a direct 
product of n copies of a group of order 2, 

then by choosing n elements 5;,52,.,5, Of G 
suitably, any element of G can be uniquely ex- 
pressed as 5S, s,.5,, where (a, b, ., z} is a 
subset of { 1,. , n} in the natural order. Hence, 
if we take He 2) = Ug, U, as a basis of an 
algebra extension, then any factor set f is 
determined by Te, 5j) = Zo, where Àj 1 (i «j), 
A= + 1 (i »j), and 4j are arbitrary. When 4, 
= -1 (i>j), the corresponding algebra exten- 
sion is a !Clifford algebra. Furthermore, if Ài 
= 0, then it is a tGrassmann algebra. If 4;,40 
and nis even, then it is a central simple alge- 
bra whenever the characteristic of K is not 2 
(- 61 Clifford Algebras). 

Now let K be the real number field R and 
A, be the Clifford algebra with 4;,= 1. Then 
A, is the quaternion field. Elements of A, 
are called sedenions, and are important in 
‘spinor theory and *Dirac's equation. In gen- 
eral, let K be an arbitrary field whose charac- 
teristic is not 2 and putn = 2, uns l, 45,7 
—1, 41 =4#0, and 45, 2 u #0, in the pre- 
vious notation. Then the corresponding cen- 
tral simple algebra Q is called a (general- 
ized) quaternion algebra. Thus Q has a basis 
11, u,v, w} satisfying the following laws: 1 is 
the unity element, w = up = —vu, u? = 4, and 
v-u(AÀ,ucK). Any central simple algebra of 
dimension 4 is isomorphic to a certain quater- 
nion algebra. (In particular, if K = R and 4 = yi 
= -], then Q coincides with the quaternion 
field H.) For any element x = g + fur yv + dw 
of Q, the element X 2a — fu = yv = Ów is said to 
be conjugate to x, and N(x) = xxe K is called 
the norm of x. An element x of Q is invertible 
in Q if and only if N(x) z 0. 


E. Finite-Dimensional Associative Algebras 
over a Field 


For the rest of this article, we assume that the 
algebras considered are unitary and finite 
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dimensional over a field K. Then by general 
properties of left (right) tArtinian rings, any 
associative algebra A has the following struc- 
ture: the tradical N of A is the greatest 'nilpo- 
tent ideal, and the quotient algebra A/N = A is 
'semisimple and decomposed into a direct sum 


of ideals which are simple algebras: 
A=A,+...+Ay,. 


Each simple component A i is a full matrix ring 
of degree r; over a certain division algebra D. 
and A; is decomposed into a direct sum of r; 
minimal left ideals which are mutually A . 
isomorphic: 


Bers. 4 zi 4 air; 8 
Aus Ae(U +... +A, 1<i<n, 


where SL ... ef? are orthogonal *idempotent 
elements of A; whose sum is equal to the unity 
element of 4 i On the other hand, 


A,—e(U A +... +8094 


gives a decomposition of A, into the direct 
sum of minimal right ideals that are mutually 
A-isomorphic. Moreover, we can choose an 
idempotent element ef? of A from each residue 
class ef? such that (ef?) forms a system of 
orthogonal idempotent elements, whose sum is 
equal to the unity element 1 of A, and 


Fi n ri 
Ae A, Y Ae? Y V MA 
i=] s=1 


i=1 s=1 


p= 


gives a decomposition of A into a direct sum 
of tdirect indecomposable left (right) ideals. 
Here, Aej? and Aef (e? A and e? A) are A- 
isomorphic if and only if i =j. Conversely, 
every decomposition of A into indecom- 
posable ideals is obtained as above. The A- 
submodule Ne of Ae is the unique maxi- 
mal proper submodule, and Ae? /Nef? and 
Ae (Ne! are A-isomorphic if and only if i = j. 
Any simple A-module is A-isomorphic to a 
certain Aef"/Nef? (- Sections H, I). 

Any simple algebra A over K is isomorphic 
to a full matrix ring M,(D) over a certain divi- 
sion algebra D. This is called Wedderburn's 
tbeorem. Here n is determined uniquely by A, 
and D is also determined uniquely by A up to 
isomorphism. Moreover, the center of A is 
isomorphic to the center of D. If the center of 
A coincides with K, then A is called a central 
simple algebra (or normal simple algebra) over 
K. In this case, any isomorphism of two simple 
subalgebras of A can be extended to an tinner 
automorphism of A. Let V(B) denote the !com- 
mutor of a simple subalgebra B of A. Then 
V(B) is also a simple subalgebra and V(I’(B)) 
= B,dim A = dim B: dim V(B). In particular, if 
B is central over K, then there is a canonical 
isomorphism Az B ® , V(B). If D is a central 
division algebra, then any maximal commuta- 
tive subalgebra L of D is a field and satisties 
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(dim LI = dim D. Moreover, there are tsepa- 
rable extensions over K among such L. In 
general, the dimension of a central simple 
algebra A is a square number r?, where r is 
called the degree of A. 

Two central simple algebras are said to be 
similar if they are isomorphic to full matrix 
rings over the same division algebra. This is an 
equivalence relation, and each equivalence 
class is called an algebra class. On the other 
hand, if A is a central simple algebra and B is a 
simple algebra, then A ®x B is a simple alge- 
bra. Moreover, if B is central, then A Gi, B is 
also a central simple algebra. If A and B are 
similar to A’ and B’, respectively, then A @, B 
is similar to A’ (9, B’. Hence, the tensor prod- 
uct ® defines multiplication in the set (K) 
of the algebra classes over K. If A is a central 
simple algebra and A? is an algebra anti- 
isomorphic to A, then A? is also central simple 
and A (9, A? is isomorphic to a full matrix 
ring over K. This shows that (K) forms a 
group, which is called the Brauer group, after 
R. Brauer, who introduced this concept, or the 
algebra class group over K. For a central 
simple algebra A, the degree of a division 
algebra similar to it is called the Schur index of 
A (or of the algebra class of A), and the order 
of the algebra class of A in the Brauer group is 
called the exponent of A. The Schur index is 
divisible by the exponent; and conversely, the 
exponent is divisible by every prime divisor of 
the Schur index. 


F. Extensions of Coefficient Fields 


Let L be an ‘extension tield over a field K. 
Then for any algebra A over K, L (9y A can be 
regarded as an algebra over L. This algebra is 
denoted by AF and is called an algebra gb- 
tained by extending the coefficient field to L 
Let us denote the radical of a ring A by R(A). 
An algebra A over K is called a separable 
algebra if it satisfies (A^) = {0} for any exten- 
sion field L over K. In the special case when A 
is an talgebraic extension field over K, A is a 
separable algebra if and only if every element 
of A (or every element of a subset which gener- 
ates A) is tseparable over K (- 149 Fields). A 
(finite-dimensional) algebra A over K is sepa- 
rable if and only if A is semisimple and the 
center of every simple component of A isa 
separable extension over K. If the quotient 
algebra A/9R(A) of an algebra A is separable, 
then there exists a subalgebra S such that 
A=S+R(A) and SN R(A)= {0}, and S is 
uniquely determined up to inner automor- 
phisms (Wedderburn-Mal'tsev tbeorem). 

In order that an algebra A over K be central 
simple, it is necessary and sufficient that A^ be 


simple for any extension tield L over K. This 
latter statement holds if and only if A* is iso- 
morphic to a full matrix algebra over L for a 
certain extension tield L over K. Such an exten- 
sion field is a tsplitting field of A (- 362 Re- 
presentations F). For a central simple algebra 
A, a (finite) extension field L of degree r over K 
is a splitting field of A if and only if there exists 
a central simple algebra B of degree r which is 
similar to A having a subfield which is K- 
isomorphic to L. In this case, r is divisible by 
the Schur index of A. Furthermore, A has a 
separable splitting field whose extension de- 
gree is equal to the Schur index of A. This 
shows that A has a splitting tield which is a 
finite Galois extension tield over K. 

Let L be a finite *Galois extension over a 
tield K and G be its tGalois group. If f isa 
factor set with respect to the operation of G on 
L*, then the crossed product (L/K, f) is a cen- 
tral simple algebra over K (- Section D). The 
product fg of two factor sets f and gis also a 
factor set, and the set of all factor sets forms an 
tAbelian group. Thus (L/K, fg) is similar to 
(L/K, f ) G,(L/K, g). On the other hand, factor 
sets f and g are said to be associated with each 
other if there exists a family {A,},.¢ of elements 
of L such that 


fr.) g(r, Dr(A)AS A, r, leG. 


Hence, f and g are associated with each other 
if and only if (L/K, f) and (L/K, g) are similar. 
Therefore, the mapping f —(L/K, f) gives a 
monomorphism of the group H?(G, L*) of all 
associated classes of factor sets (which can be 
identilied with the 2-dimensional tcohomology 
group of G with coefficients in L*) into the 
Brauer group 4(K) over K. Its image coin- 
cides with the subgroup of all algebra classes 
which have L as a splitting field. In particular, 
any algebra class is similar to the crossed 
product of a certain finite Galois extension L 
and its Galois group G (R. Brauer, E. Noether, 
A. Albert, K. Shoda, and others). 


G. Cyclic Algebras 


Let Z be a ‘cyclic extension field of degree n 
over a field K. Then the crossed product of 
Z and its Galois group G is called a cyclic 
algebra over K. For a fixed generator 5 of 

G and any element a 4 0 of K, a factor set 
físi, s (0 <i, j <n) can be defined by f(s‘, s?) 
=] (itj<n)and f(s', s?) 2a (i- j 2 n). Let 

(Z, s, &) denote the corresponding crossed 
product. Then (Z, s, a) and (Z, s, fj) are similar if 
and only if «/f is a norm of a certain element 
of Z into K. On the other hand, any crossed 
product of Z and G is similar to a certain 

(Z, S, cc), and the correspondence x —(Z, s, a) 
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gives an isomorphism of K*/N; ,(Z*) (the 
norm class group of Z/K) to the group con- 
sisting of the algebra classes over K which 
have Z as a splitting field. If K is a tp-adic field 
or a finite *algebraic number geld, then any 
central simple algebra over K is isomorphic to 
a certain cyclic algebra. In this section, we 
describe this situation in detail. 

Let K be a p-adic field and q the number of 
residue classes modulo p. If A is a central 
simple algebra over K, then its Schur index 
coincides with the exponent, which is called 
simply the index. A finite extension tield L over 
K is a splitting field for A if and only if the 
degree of L is a multiple of the index of A. If n 
is the degree of A, then the field W= Kiwi ob- 
tained by adjoining to K a tprimitive (q^ (st 
root of unity o is a cyclic (and tunramified) 
extension of degree n over K, and its Galois 
group is generated by the automorphism 
g of Wdetermined by o? — of e K = iden- 
tity. Moreover, we have A x (W, o, q) fora 
certain yc K*. Let v be the exponential p- 
adic valuation v(a) of x. Then v/n (mod Z) is 
uniquely determined by the algebra class of A, 
which is called the Hasse invariant of A (or of 
the algebra class of A). By assigning to each of 
the algebra classes its Hasse invariant, we get 
an isomorphism of the Brauer group B(K) 
over K to the group Q/Z, the additive group 
of the rational numbers mod Z (H. Hasse, 
1931). 

Let K be a finite algebraic number tield and 
A be a central simple algebra over K. Let p be 
a (finite or intinite) prime divisor of K and K, 
the tp-adic extension field over K. The algebra 
A, which is obtained from A by extending the 
coefficient tield to K p is a central simple alge- 
bra over K,. Except for a finite number of p, 
A, is isomorphic to a full matrix ring over K,, 
and A itself is isomorphic to a ful] matrix ring 
over K if and only if A, is isomorphic to a full 
matrix ring over K, for all p. The index m, of 
A, is called the p-index of A, and the Hasse 
invariant of A, is called the p-invariant of A, 
which is denoted by (A/p). If p is an "infinite 
prime divisor, then m, is equal to 1 or 2, and in 
each case, we define the p-invariant by setting 
(A/p) = 0 or 1/2 (mod Z) correspondingly. The 
Schur index of A is the L.C.M. of the p-indices 
m, for all p and coincides with the exponent of 
A. This is called simply the index of A. On the 
other hand, the p-invariants satisfy (A/p) = 0 
(mod Z) except for a finite number of p, and 


5 (4/p)=0 (mod Z). 


p 


Conversely, given a rational number p, for 
each p such that (1) f, = 0 (mod Z) except for a 
finite number of p; (ii)p, z 0 (mod Z) if p is 
infinite and imaginary, p, = 0 or 1/2 (mod Z) if 
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p is infinite and real; (iii) 3,0, =0 (mod Z), 
then there is a uniquely determined algebra 
class of central simple algebras A over K such 
that (A/p)= De (mod Z) for each p. In this way 
the structure of the Brauer group over a finite 
algebraic number Iield is completely deter- 
mined (Hasse, 1933). 


H. Frobenius Algebras 


Let A be an algebra over a field K. A is called 
a Frobenius algebra if its *regular representa- 
tion and fcoregular representation (— 362 
Representations E) are similar. Thus A is a 
Frobenius algebra if the left A-module A and 
the dual module A* of the right A-module A 
are isomorphic as left A-modules. Let 


ri 


A-Y Y Ad - Y Y eA 
là 


be direct decompositions of A into indecom- 
posable left (right) ideals (— Section E). We 
denote ef! by ej. Then A is a Frobenius alge- 
bra if and only if there exists a permutation 7 
on 1,. , nsuch that (i) Ae, X (e44 A)*; (ii) r; = 
lxi): When there exists a permutation satisfy- 
ing only condition (1), A is called a quasi- 
Frobenius ^ algebra. 

For a subset § of A, the ideals I(S) = 
(ae A|aS 20] and r(S)= (ae A|Sa=0} are 
called the left annihilator and right annihilator 
of S, respectively. Then A is a quasi-Frobenius 
algebra if and only if (ii) Jr =1 and r(l(r))=r 
for any left ideal | and any right ideal r, In 
general, if we are given a left (right) A-module 
M, we denote the right (left) A-module 
Hom (M, A) by fi. Then if A is a quasi- 
Frobenius algebra, there is a canonical 1so- 
morphism MEM , and the annihilator relation 
gives a One-to-one and dual correspondence 
between the set of the submodules of M and 
the set of submodules of M (M. Hall). If a 
quasi-Frobenius algebra A satisfies (iv) dim r + 
dim I(r) = dim] + dim r(I) = dim A for any left 
ideal | and any right ideal r, then A is a Fro- 
benius algebra; the converse is also true. 

A criterion for an algebra A to be a Frobe- 
nius algebra is that there is a linear form x» 
A(x) on A such that if A(xa)=0 for all XE A, 
then a = 0. Moreover, if 7 satishes A(xy)= 
A( yx) (x, y€ A), then A is called a symmetric 
algebra. For example, semisimple algebras 
and group algebras are symmetric algebras. 
If A is a symmetric algebra, then for any left 
(right) A-module M the right (left) A-modules 
M* = Hom,(M, K) and M = Hom, (M, A) are 
canonically A-isomorphic. 

If A is a Frobenius algebra, the radical N of 
A satisfies (IN) = r(N), and the annihilator of 
N is a principal left and principal right ideal; 
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the converse is also true. For a two-sided ideal 
z of a Frobenius algebra A, the quotient alge- 
bra A/z is a Frobenius algebra if and only if 
l(z) and r(z) are a principal left and a principal 
right ideal, respectively. If A is a symmetric 
algebra, then any two-sided ideal z satisties 
l(z) = r(z), and A/z is also a symmetric algebra 
if and only if 1(z) = r(z) is a principal ideal 
generated by an element in the center. 

Furthermore, for any extension L of the 
coefficient Tield K, A4. is a quasi-Frobenius 
algebra (resp. Frobenius algebra, symmetric 
algebra) if and only if A is (T. Nakayama, 
1939, 1941). The concept of Frobenius algebras 
has been extended to algebras B over a ring A 
(F. Kasch, 1954). 


I. Uniserial Algebras 


In the notation of the preceding section, if each 
indecomposable left ideal Ae, (right ideal e A) 
of an algebra A over K has a unique compo- 
sition series, then A is called a generalized 
uniserial algebra. If an algebra A is decom- 
posed into a direct sum of ideals which are 
primary rings, then it is called a uniserial alge- 
hra. Any left module over a generalized uni- 
serial algebra A is decomposed into a direct 
sum of submodules which are A-homomorphic 
images of Ae;: An algebra whose radical N is 

a principal left and principal right ideal is a 
generalized uniserial algebra. For an algebra A 
to be uniserial, it is necessary and sufficient 
that every two-sided ideal of A be a principal 
left and principal right ideal. Hence A is uni- 
serial if and only if every quotient algebra of A 
is a Frobenius algebra. If A^ is uniserial for an 
extension field L of K, then A itself is uniserial. 
The converse, however, is not always true. If 
AL is uniserial for any extension feld L of K, 
then A is called an absolutely uniserial algehra. 
For A to be absolutely uniserial it is necessary 
and sufficient that its radical N be a principal 
ideal generated by an element in the center Z 
and Z be decomposed into a direct sum of 
simple extensions of K (i.e., ideals of the form 
K [a]) (K. Asano, G. Kóthe, Nakayama, G. 
Azumaya). 


J. Algebraic Algebras 


Here, we consider general (not necessarily 
finite-dimensional) algebras A over a field K. 
We say that A is an algehraic algehra if every 
element of A is algebraic over K, i.e., every 
element of A is a root of a certain polynomial 
with coefficients in K. We say that A satisties a 
polynomial identity p(X,, . . , X.) = 0 or that A 
is a PI-algebra with an identity p(X,,, X) = 
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0 if there exists a nonzero (noncommutative) 
polynomial p(X,, ,X,) in X,,, X, with 
coefficients in K such that pí(a,, , a) = 0 for 
all a, € A. A Pl-algebra satisfies an identity 
which is homogeneous linear in each variable, 
and also an identity of the form e Te E 
O(where[ Jis the sum È+ x,....x; taken 
over all permutations of 1,2,, n and + is the 
sign of the permutation). An algebra is said to 
be locally finite if any finite number of e]e- 
ments of A generate a finite-dimensional 
subalgebra. For Pl-algebras, an affirmative 
answer was found for Kurosh's problem, 
which asks whether an algebraic algebra is 
locally finite if the degree of any element q of 
A De dim K [a]) is bounded. 


K. Brauer Group of a Ring 


Let R be a commutative ring. An R-algebra is 
called a separable algebra if A is projective as 
a two-sided A-module (- 200 Homological 
Algebra F). When the base ring is a field, this 
agrees with the classical notion of separability; 
and A is separable over R if and only if 4/mA 
is separable over the residue field R/m for 
every maximal ideal m of R. A central separa- 
ble algebra is also called an Azumaya algebra. 
If P is a finitely generated faithful projective 
R-module (briefly: R-progenerator), the endo- 
morphism ring End,(P) is an Azumaya R- 
algebra. Azumaya algebras A, and A, are 
said to be in the same class (similar) if there 
exist R-progenerators P, and P, such that 

A, €) End, (P,) = A, © End,(P,). The set of 
similarity classes forms an Abelian group 
with respect to ®. This is called the Brauer 
group B(R) of R (Auslander and Goldman 
[10]). Every element of B(R) is of finite order 
[12,13]. 

B(R) is a covariant functor from commuta- 
tive rings to Abelian groups. If R is a field, 
B(R) coincides with the classically defined one 
(- Section E). If R is a fHenselian local ring 
with the residue field k, the mapping B(R)-+ 
B(k) is an isomorphism [9]. If Risa 'regular 
ring with the quotient field K, B(R) B(K) is 
iue If further dim R < 2, we have B(R) = 
(), B(R,), P running over all primes of height 
1 of E Where R, is the localization of R at p 
and B(R) and B(R,) are considered as em- 
bedded in B(K) [10]. As an example, put- 
ting these facts together with the structure of 
B(K) of the algebraic number field K (— Sec- 
tion G), we have the structure of B(R) of the 
ring of integers of K: B(R) = 0 if K is totally 
imaginary, and € (Z/2Z) + if K has r( > 0) real 
infinite places. 

A commutative R-algebra § is called a split- 
ting ring of A if the S-algebra § @ A is isomor- 
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phic to End,(P) for some S-progenerator P. 
We denote by B(S/R) the subgroup of B(R) 
consisting of all algebra classes split by S. 
Since an Azumaya algebra over a ring need 
not have a Galois extension as splitting ring, 
the description of the Brauer groups by means 
of Galois cohomology ceases to have full 
generality. Instead we have the following exact 
sequence of tAmitsur cohomology, assuming 
A is an R-progenerator (Chase and Rosenberg 
[11]: 0H (S/R, U)2 Pic(R) H S/R, Pic) 
5 H*(S/R, U)JO B(S/R)o H (S/R, Pic)^ 
H *(S/R, U)5 , where we denote by U( 7) 
and Pic(T) of a commutative ring T the unit 
group and the Picard group of rank 1 projec- 
tive modules of T, respectively. The full Brauer 
group B(R) is mapped monomorphically into 
H?(R, U)-lim H* (S/R, U), the limit over the 
faithfully flat R-algebras S. 

Grothendieck and others studied the Brauer 
groups in a more general geometrical context 


[12]. 
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A. Asymptotic Series and Asymptotic 
Expansions 


Let ¢,(x), n=0, 1, 2, be functions delined in 
R,. The sequence of functions {@,(x)}?2o is 
called an asymptotic sequence for x 2 oo if 


Qni (x) =0(¢,(x)) as X> 00 (1) 


for each n. When {ọ,(x)} Zo is an asymptotic 
sequence, the formal series 3. a, o, (x) is called 
an asymptotic series. In most cases, the func- 
tions @,(x) have the form v(x)q(x)". When 
Q,(x) -- "an asymptotic series is called an 
SCC power series. A function f(x) de- 
fined on R, is said to have the asymptotic 
expansion (or be asymptotically developable in 
the form) 


F(X) ~ ageo(x) +a G(X) +... +4,9,(x) +... (2) 
as X oo if f(x) satisfies 

f(x) = ag9o(x) — a4 P(X) — ...—a,q,(x) 

= O(0,41(%)) (3) 


for any integer nz 0 as x2 oo. The coefficients 
a, (n = 0, 1, ) appearing in (2) are uniquely 
determined. This fact immediately follows 
from the formulas 


a, = lim f(x)/go(x).... 


a,— lim (f(x) —aoqo(x)— 


x>w 


—ûn-1 Pn-1 (x))/@,(x). 


For example, the *Hankel function of the lirst 
kind HU It) has an asymptotic expansion 


Hy (x) 








| 4v -1)(47 — 3°). . . [42 —(2m—1)°7] 

S eg "H 

If v +4 is not a positive integer, KEE Ti) 

(— 2ix) " is not convergent, and (4) means that 
for each n, 


(v, m) 


=o ls D IS. 
Rm 24) LC Sen 


—-O0(x " 2?) 


HEG) — 








for X200. 

Assume that f(x) and g(x) admit asymptotic 
expansions in power series for x-t oo, Then 
all functions delined by f(x)-4- g(x), f(x)— 


g(x), f(x)g(x), f(x)/g(x) (lm... g(x) 40) admit 
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asymptotic expansions in power series for x» 
oo. Furthermore, if f'(x) has an asymptotic 
expansion for x— oo, its expansion is obtained 
by termwise differentiation of the f(x)'s. 

For any asymptotic power series bed o nX 
there always exists a smooth function f(x) 
defined in R, such that 


—n 


f(x)-)ag-ca,x ' x-ajx ? +... 


for x oo, and all the derivatives of f(x) are 
asymptotically developable in power series. 
Indeed, a function delined by 


d Lait 0(x/t,)x 


satisfies the required properties if (t) is a 
smooth function satisfying 


0, rel, 
so m 


and the sequence It 9 satislies 1 € to St; S 


.. and 


[n(n - 1)...(2n— 1)2na,1, ! | x2" 


for all n. 

Until now only asymptotic series for x2 
oo have been considered, but the notion of 
asymptotic series has been adapted in yari- 
ous forms to the spaces of functions under 
consideration. 

(a) Asymptotic series in a complex domain. 
Let a be a boundary point of an open con- 
nected domain D in the complex z-plane. When 
the asymptotic behavior for za of holomor- 
phic functions in D is considered, we adopt as 
an asymptotic sequence (9,(z)]5:9 of holo- 
morphic functions in D satisfying for each N 


Pn+i(Z)= 0(@, (2) 


as z tends to a through D. Let D be an angular 
domain with vertex at x, For any asymptotic 
power series Èo a,(z = x)" there always exists 
a function f(z) that is holomorphic in D and 
that admits an asymptotic expansion 


f(z)~ag+a,(z—a)+...+a,(2-—a)" +... (5) 


for za. Concerning the uniform convergence 
of asymptotic expansions, Carleman’s theorem 
is well known [2]: If the asymptotic expansion 
(5) is valid as z tends to x through an arbi- 
trary angular domain in the Riemann sur- 
face of log(z =a) having g as its vertex, then 

the asymptotic expansion (5) is uniformly 
convergent. 

(b) Asymptotic expansions by a large para- 
meter. In the theory of differential equations, 
functions with a large parameter are often 
considered. Let O be a domain in R" and ka 
parameter > 1, and let g(x, K), n20, 1, 2,. , 
be functions defined for (x, K) € Q x [ 1, 00). If 
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they satisfy for each xeQ and n 


IEN k)=0(@,(x, k)) (6) 


for k oo, the sequence oz, k)} io is called 
an asymptotic sequence, and the formal series 
20 a,l x)o,t x, K) is called an asymptotic series. 
A function f(x, K) is said to be asymptotically 
developable in the form 


F(X, k) ~ ag(x) @o(x, k) + a, (x) (x, k) +... 
T Ay(X) qo, (X, k) Se 


for k oo if for every xeQ and every n 


f(x, k) = Ag(X)Po(x, k) RICCA a, 4109, (x, k) 

= O(q,(x, k)) (1) 
holds as k— œ. Especially, if the order rela- 
tions (6) and (7) hold for each n uniformly in 
x€Q, we say that f(x, K is uniformly asymp- 


totically developable. In most cases, the 
o, (x, K are of the form w(x, k)k ". 


B. Methods of Asymptotic Expansion 


There are several general methods of obtaining 
asymptotic expansions of functions represented 
by integrals with a large parameter. These 
methods are useful in applications because 
most *special functions are represented by 
integrals (- 39 Bessel Functions, 389 Special 
Functions). 

(a) Laplace's method. Let h(s) be a smooth 
real-valued function in [a — ô, a + 6] (0 > 0) 
satisfying 


h(a)« h(s) for all se[a—ó,a--ó] — (aj 
and 
h’(a) > 0, 
and let g(s)e C*([a—6,a+6]). Then 
até 
I(x) -| e **S) g(s)ds 
O-S 
has an asymptotic expansion 
Jn 
I(x) "y e "ha F^ 
x 





1 
LE tex b... 
h(a) 


fortes] 


for x 00, where the c, are determined by 
derivatives of h(s) ats = aof order < 2n+ 2 
and those of g(s) at § = a of order € 2n. 

As an application of this method, con- 
sider the T -function T (x) - f e 't* ! dt. By a 
change of variable t = xs, we have I(x) = 
by e "Sr ege 71 ds, Since for any ô > 0 


1-ó 0D 
eg ia legs. 71 di * e SIS logs) 71 ds 
0 146 


Ste air — (x0) 
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the asymptotic expansion by the Laplace 
method 


1+ô 2m 
g "6 l9. lgs pe * —(1+ce,x7'+...) 
1-3 x 


implies *Stirling’s formula 


(x)= /2rx* ^ e-*(1--O(x !)). 


(b) Stationary phase method. Let h) be a 
real-valued smooth function detined in [a 


6, a + ô] such that 
h'(u) = 0, h’(u) 4 0 and h'(s) £0 for s #a, 


and let g(s) be a function in Cg (a — ô, a + 6). 
Then 


atà ] 
I(x) = | ex^) a(s) ds 


0-S 
has an asymptotic expansion 
, , 1 a 
EAR Se 


Jx IR" (a) 
tex "+ d 


where g= A" (a)/| h"(u)) and the c, are deter- 
mined by derivatives of h(s) at s = a of order 
< 2n + 2 and those of g(s) at s = a of order 

« 2n. 

Further, "integration by parts and the 
tmethod of steepest decent are often used to 
derive asymptotic expansions of integrals with 
a large parameter. 


C. Application to Differential Equations 


For a system of linear ordinary differential 
equations with an "irregular singular point at 
z =Q, even when there exists a formal power 
series solution, the power series is generally 
divergent. Taking the hint given by Stirling’s 
formula for the [-function, Poincaré intro- 
duced the notion of asymptotic expansions 
and succeeded in giving an analytic meaning 
to formal solutions of divergent type. Actual 
solutions for *difference equations, tdifference- 
differential equations, and ‘ordinary differen- 
tial equations of canonical form (including 
second-order linear ‘differential equations 

of confluent type) always have asymptotic 
expansions. 

In the study of certain ordinary differential 
equations containing a large parameter k, an 
asymptotic solution is often obtained in the 
form 


u(x, 9 we" 9ac(x)-ca(x)k ! + 
+a,(x)k-"+...) 
(*WKB method). Also, in the study of linear 
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partial differential equations, an asymptotic 
solution of the above form plays an important 
role. For example, consider the Schrodinger 
equation 

1 ôu 1 


Lilu] =- Au — DD ss D. 
Jul JOE Gay (9 
An asymptotic series 


; 1 
u(x,t; A mw ell. Lui WEST (x, Ua ien 
l 


t a,(x, t) (5) + ) (8) 


satisfies L,[u] -0 for4—ooif 


v(x)p- O0 (eikonal equation), 





-—2Vo-:Va,—4^qa,- Aa, ,, n=0, 1, 2, 


(transport equation), 


where we set a _, = 0. For each asymptotic 
solution of the form (8) there exists an actual 
solution of L;[f(x, t; 4)] = 0 which is asymp- 
totically developable in the form 


fot 2) ¥ e" 999a (x, OAI? 
n=0 


Then asymptotic solutions of the form (8) can 
describe the fundamental properties of the 
phenomenon governed by the equation L,[u] 
eg, 

*Geometric optics can also be illustrated by 
asymptotic solutions of the form (8) (— 325 
Partial Differential Equations of Hyperbolic 
Type L). The ideas of asymptotic series, asymp- 
totic expansions, and asymptotic solutions 
are essential to the theory of tpseudodifferen- 
tial operators and *Fourier integral operators 
(— 345 Pseudodifferential Operators). 
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A. General Remarks 


Automaton is a generic term for the math- 
ematical models for automatic machines hav- 
ing memory devices and capable of perform- 
ing definite acts within each unit of time. 

There are several types of automata, namely, 
Turing machines for the theory of computa- 
bility [6], linear bounded automata, push- 
down automata, and finite automata for math- 
ematical language theory [10-1 3]. These 
models are also used in the theory of 'complex- 
ity of computation of algorithms [14]. 


B. Turing Machines 


Since the tarithmetization of metamathematics 
was used in the proof of tG6del’s incomplete- 
ness theorem, mathematical formulations of 
the notion of teffectively computable functions 
have been attempted by many mathematicians, 
including S. C. Kleene, A. Church, and others, 
as well as by Gödel himself (— 356 Recursive 
Functions). With this end in mind, A. M. Tur- 
ing and E. L. Post independently introduced a 
sort of ideal computer [ 1,2], called a Turing 
machine. 

Such a machine is capable of being in one 
of a finite numberof internal states (possible 
contents of a memory device) dos, x ta 05 
fixed for the machine; and the machine is 
supplied with a tape divided into squares. The 
tape is infinite in both directions. Each square 
can be blank or can have printed on it any one 
of the previously specified symbols So, si, . , 
Sm» Where Za stands for the blank. In any situa- 
tion the number of nonblank squares is finite. 
Let Q= lge, di, d], M = {S05 S15 ..., Sm}, and 
consider the mapping t: Qx M-Q x M x 
IR L, 0j. A Turing machine T is specified by 
(Q, M, t, qo, F), where F is a subset of Q called 
a set of final states and du is the specified ini- 
tial state. If t(q, s) = (q', s’, x), then the 5-tuple 
(q, 8, q’, s’, X) is called an instruction for T. The 
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machine acts as follows: (1) At the start, the 
machine is in the state doa. Some sequence of 
symbols in M are written on the tape as an 
input, and the machine is on the square next 
to the leftmost nonblank symbol. (2) With 
respect to the present state q and the scanned 
symbol, the machine selects an instruction 
(q, 5. q', S, X); then the next state becomes d. s 
is replaced by s', and the machine moves one 
square right or left or stands still according 
as x is R, L, or 0. (3) The machine stops when 
the state becomes an element of F, and then 
the sequence of symbols on the tape is the 
result of the computation and considered to 
be the output. A Turing machine is called 
deterministic if there is always at most one 
instruction associated with a given pair (q, SJ; 
otherwise it is called nondeterministic. 

We represent a natural number y on the 
tape by y + | squares, each of which has the 
symbol s, printed on it. A pair (y, , Y2) is repre- 
sented on the tape by the following sequence 
of symbols of length y, + y; + 5: one sọ, the 
representation of y,; one sọ, the representation 
of y,; and one sọ. Similarly, in general we can 
represent a k-tuple (y,, . , yy) of natural num- 
bers on the tape. Let ® be a partial function 
of k variables, k> 0. We say that a Turing ma- 
chine T computes 6 if T starts with the tape 
in which the representation of [X05 op) is 
printed and stops with the tape having the 
representation of (X,,, x,,x) when ®(x,, 

. > X) is defined and equal to x. A partial 
function is computable (in the sense of Turing) 
if there exists a Turing machine that com- 
putes the function. It is known that a partial 
function is computable if and only if it is par- 
tial recursive. The equivalence of comput- 
ability (in the sense of Turing) and recursive- 
ness for number-theoretic functions is strong 
evidence for the validity of Church's thesis 

(- 356 Recursive Functions). 

We can apply a Turing machine not only to 
the computation of number-theoretic functions 
but also to the transformation of a finite se- 
quence of letters in any language with a finite 
list of letters. A finite nonempty list of distinct 
symbols is called an alphabet, and a finite se- 
quence of symbols in the alphabet is called 
a word in that alphabet. All the words in an 
alphabet constitute a ffree monoid if we add 
the empty sequence and think of the concate- 
nation of two words as its multiplication. We 
call a finite nonempty list (A ,, Bj), . (A.. B,) 
of pairs of words a set of production rules. By 
an application of a rule (A;, B;) to a word W, 
we mean that an occurrence of A, exists in W, 
and by the application, A; is replaced by B,. 
We say that two words R and $ are equivalent 
if they are transformable into each other by a 
finite number of applications of the production 
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rules. The following problem is called Tbue's 
(general) problem: [s there an algorithm for 
deciding, for any given alphabet and a set of 
production rules, whether or not any given 
two words are equivalent? E. L. Post and A. A. 
Markov proved by applying Turing machines 
that this decision problem is unsolvable [4, 3] 
(a negative solution for the so-called word 
problem for semigroups). 


C. Universal Turing Machine and 
Generalizations 


Turing showed that it is possible to construct 
a machine U with the following property [ 1]: 
If we supply U with a tape on which a suitably 
coded list of instructions of any given machine 
T is printed then U successively prints on it 
the behavior of T which results from the exe- 
cution of the instructions of T. In other words, 
U simulates the operations performed by any 
machine. Such a machine U is called a univer- 
sal Turing machine. We can constructa univer- 
sal Turing machine with only two internal 
states and also one having only two symbols, 
including the blank s, (C. E. Shannon and J. 
McCarthy [7 ]). 

Let dn... i, be given functions that are all 
defined everywhere on the domain considered. 
Then we can relativize (— 356 Recursive Func- 
tions) the notion of computability of functions 
(in the sense of Turing) to y pacer Wes Indeed, 
we Can consider a machine acting as follows: 
When the machine is in a state di, (jo b. k 
and we assume that these states are specified 
correspondingly to V, . , i, and are not in 
F), then the machine prints the representation 
of w(y,,, Y.) to the right of (yj, , y.) 
and scans the resulting (n;+ 1)-tuple (3555 
Yn; GE , y,,)). In this process, the symbol 
a that is written to the right of (y,, . . , y,,) is 
moved one step to the right so that a is pre- 
served on the right of the resulting (n + 1)- 
tuple. Extending the preceding, Kleene defined 
a machine that computes a functional of finite 
types [S]. A functional of variables of any 
finite types is computable by such a general- 
ized Turing machine if and only if it is partial 
recursive. 


D. Language and Áutomata 


Let M and N be mutually disjoint alphabets 
and V 2 MU N. Then G =(N, M, P, S) is called 
a Chomsky grammar if P is a set of production 
rules consisting of finite number of pairs of 
elements of V* and § is specified element of N, 
where V* means the free monoid generated by 
V. For a word ye V*, if the result of a finite 


31E 
Automata 


number of applications of rules in P is v, then 
we write uv, and pis said to be derived from 
u by G. The language generated by G is L(G) 
= {u S>uand ue M*j. N. Chomsky classi- 
fied the grammars into three families with 
respect to the form of production rules; (1) 
Context-sensitive grammar: For each (u geb, 
the length of u is less than or equal to the 
length of p, (2) Context-free grammar: For 
each (u, v)e P, u is an element of N and v is 

not a empty word. (3) Regular grammar: For 
each (u, v) e P, ue N and v is either an element 
of M or of the form ax, where qc M and xe N. 
Let F,(M), F,(M), and F,(M) be the families 
of languages generated by the grammars in 
the corresponding families classified as above. 
Then Fi(M)2 F,(M)2 F3(M) holds and this 

is called a Chomsky hierarchy [ 10]. Formal 
language theory studies various structures of 
these families of languages. 

Corresponding to the three families of lan- 
guages, we have the following families of auto- 
mata (1) A deterministic (nondeterministic) 
linear bounded automaton is a Turing machine 
with input tape, namely, T= (Q, MUN, t, qo, F), 
where Q is the set of states, M an input alpha- 
bet, N a tape alphabet, { is a mapping from 
Qx(MUlej) x(NU {e}) into Q xN x {L,R} 
(the power set of Q x N x {L, RJ), du € Q is the 
initial state, and F is a subset of Q. A word 
we M* is placed on the input tape, and T 
starts its computation from the initial state 
with blank tape and by reading one symbol 
from the input tape, and with respect to the 
mapping f, we decide the configuration of the 
next step. If at a certain stage, T reaches a 
state in F when T reads up w and if the length 
of the tape used is a constant multiple of the 
length of the input tape, then w is said to be 
accepted by T. (The special symbol e makes 
it possible to change the configuration without 
moving the input tape.) (2) A push-down auto- 
maton is a Turing machine with an input 
tape. The working tape is of infinite length 
and moves in only one direction, say to the 
right. The computation with input word we 
M* starts with the position at the left end of 
the work tape. It uses the tape according to a 
first-in—first-out principle; namely, the ma- 
chine may write a symbol to the blank square, 
but if it is on the rightmost nonblank square, 
it may not go left without erasing the non- 
blank symbol. If it reaches a state in F, then 
w is said to be accepted by T. (3) A finite auto- 
maton is a machine without its infinite tape; 

a word w is accepted if T reaches a state in F 
(without using the symbol e). 

Let L(T) be the set of words over M ac- 
cepted by T; then L(T) is called the language 
accepted by T. Corresponding to the three 
families of automata, DF(M) and NF.(M), i= 
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1, 2, 3, denote the family of languages ac- 
cepted by deterministic or nondeterministic 
automata. It is known that DF(M) = NF,(M) = 
F(M) for i22 and 3, and NF,(M)=F,(M); 
however, whether DF, (M) = NF,(M) holds or 
not is still open, and this problem is called the 
LBA problem. These automata are used to 
study the closure properties under various 
operations within the families F,(M). In par- 
ticular, the LBA problem is equivalent to the 
problem of whether F,(M) is closed under 
complementation. 


E. Applications of Automata Theory 


Finite automata are formulated as models for 
digital circuits with memory. Therefore there 
exist structural studies of finite automata; for 
example, methods to decompose a given finite 
automaton to smaller automata correspond- 
ing to a circuit and its subcircuits are studied 
using tsemigroup theory [14,17]. 

Context-free grammars are useful in describ- 
ing the syntax of programming languages. For 
a context-free grammar G, a derivation tree of 
x € L(G) is a tree whose terminal nodes are 
labeled by each symbol of x, its root is labeled 
by S, and other nodes are labeled by some 
elements of N in such a way that if for a node 
with Ac N the nodes connected by an out- 
going edge from the former have labels x,, 
vie, An, then (A, x,,, x) is a production rule 
of G. Parsing is an algorithm to construct 
a derivation tree of each XE L(G). Various 
parsing algorithms have been studied within 
subfamilies of context-free grammars [ 16] and 
they are used in the construction of compilers. 
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A. General Remarks [1] 


Let X be a (complex) 'analytic manifold and F 
a *discontinuous group of (complex) analytic 
automorphisms of X. A set of tholomorphic 
functions (without zero) on X, {j,(z)) (ye I), is 
called a factor of automorphy if it satisfies the 
condition j,,.(z) =j,(y(z))j,,(2) for all y. viel, 
z€ X. A tmeromorphic function f on X is 
called a (multiplicative) automorphic func- 
tion with respect to the factor of automorphy 
{j,(z)} if it satisfies the condition f(y(z))= 
fj (z) for all yel, ze X. When all j,(z) 

are identically equal to 1, i.e., when f(z) is T- 
invariant, f is simply called an automorphic 
function with respect to I, When all j, are con- 
stant (and hence y-j. is a "quasicharacter" 
of I), f is called a multiplicative function. If 
we denote by J,(z) the functional determinant 
of the transformation y, then for an integer m, 
an automorphic function f with respect to 
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a factor of automorphy of the form j,,(z) = 
J,(z))” is called an automorphic form of weight 
m. (In this case, it is customary to assume 

that fis holomorphic. Also, in this and most 
other cases, we add a suitable condition on 

the behavior off at the “points at infinity,” 
when IX is not compact (- 234 Kleinian 
Groups).) 


B. The Case of One Variable [2-6] 
Except for the cases where I’ is a tinite group 
or its extension by a free Abelian group of 
rank <2(— 134 Elliptic Functions E), it is 
essentially enough to consider the case where 
X is the upper half-plane § = [ze C Im z» 0j. 
In this case, [ may be obtained as a !discrete 
subgroup of the *special linear group G = 
SL(2, R). In the following, we restrict our- 
selves to the case where T is a tFuchsian group 
of the first kind Ge, the case where the tHaar 
measure u(U $5 of the quotient space I'\ is 
finite; 2. 122 Discontinuous Groups). An 
automorphic function (or Fuchsian function) f 
with respect to [ is, by definition, a meromor- 
phic function f on § satisfying the following 
conditions: (i) f is T-invariant, i.e., f(yz) 2 f(z) 
for all ye T; Gi) f is also meromorphic around 
any tcusp x, of T, Le., if @ is a real linear 
fractional transformation mapping x, to 00 
(e.g. Paz) = —1/(z — xo)) and if the trans- 
formation zz + h (h » 0) is a generator of 
ul dën ! (where I, is the 'stabilizer of xo), 
then f(qg !(z)) can be expanded into a *Lau- 
rent series of q, = exp((2ni/h)z) (which has only 
finitely many terms with negative exponent) in 
a neighborhood Im z > y, of oo ER is called the 
local parameter around the cusp x) If we 
denote by R, the compact *Riemann surface 
obtained from the quotient space I'\ by 
adjoining a certain (tinite) number of “points 
at infinity" corresponding to the equivalence 
classes of cusps of TI. then conditions (i) and 
(ii) amount to saying that f gives rise in a 
natural manner to a meromorphic function on 
Rr. Thus the field p of all automorphic func- 
tions with respect to I' can be identified with 
the *algebraic function field belonging to the 
Riemann surface Rp (— 11 Algebraic Func- 
tions A). (While any (nonconstant) automor- 
phic function with respect to a Fuchsian group 
of the first kind I has the real axis as its tnat- 
ural boundary, an automorphic function with 
respect to a Fuchsian group of the second 
kind can always be analytically extended to 
the lower half-plane through a neighborhood 
of any “ordinary point" on the real axis (= 
234 Kleinian Groups).) 

Let kbe an even integer. A (holomorphic) 
automorphic form (or Fuchsian form) f of 
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weight k or of dimension -k with respect to [ 
is, by definition, a holomorphic function f on 
§ satisfying the following conditions: (i) For 


N 
ene Ki Tozls f(z) (cz - d. (In 


other words, J,(z) "? = (da(z)/dz) "? = (cz + d} 
is the factor of automorphy.) (ii) f is also 
holomorphic around any cusp Xy, i.e., in the 
above notation, we have an integral *Fou- 
rier expansion” f(qg ! (z)) (dog '(z)/dz)*? = 

Y 94,q,. Note that this definition of weight 
is different from that given in Section A (- 
Sections F, G). If, moreover, the constant term 
a, in this Fourier expansion vanishes at all 
cusps of [, f is called a cusp form. In partic- 
ular, fis a cusp form of weight 2 if and only 

if f(z)dz gives rise in a natural manner to a 
‘differential form of the first kind on the 
Riemann surface Rp. (By a slight modifica- 
tion of condition (ii), we can also define an 
automorphic form of odd weight k. When 


-1 0 . 
( 0 Der , condition (1) (for k odd) 1m- 
plies that f is identically equal to zero. So as- 


—1 0 
sume that ( 0 kr Then cusps X, are 


classified into two categories, according as 
G Ee € Pol. Po ! or not, and in the first 
case we should replace the power series in q, in 
condition (ii) by q” x (power series in q,).) 
(Note that the above definition of weight is 
slightly different from the general one given in 
Section A. See also the Siegel and Hilbert 
modular cases (- Sections F, G).) 

We denote by 9Ji, (T) (resp. &, (I) the linear 
space of all automorphic forms (cusp forms) of 
weight k with respect to I’, Since clearly the 
relations Mi, Mi, My, Wh, Sy C Sy 4, hold, 
the direct sum M(T) = 3,901, (D) is a (commuta- 
tive) *graded algebra, of which S(T) = Y, G,(T) 
is an ideal. It is known that 99i, (T) is of finite 
dimension, and d, = dim 9Ji, (D), d) = dim S(r) 
can be determined (from the *Riemann-Roch 
theorem) as follows: 


d,=0 for  k«0; 
| for t$ 0 
d =], d? = i 1 
y ? for a 
g for t = 0, 
d,= d? =g; 
2 een fore “HOO. e 


NIS 


d?=d,—t for k even, 24, 


where $ is the number of the equivalence 
classes of telliptic fixed points of ', e; (1 « i « s) 
is the order of the stabilizers of these elliptic 
points, t is the number of the equivalence 
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classes of cusps of T, and g is the tgenus of the 
—1 0 
Riemann surface Rp. (When ( 0 M 


we can obtain a similar formula for k odd 
except for d, , d$, which, in general, cannot be 
determined from the Riemann-Roch theorem.) 

One method of constructing automorphic 
forms is provided by Poincaré series as fol- 
lows: Let ga be a ‘linear fractional transfor- 
mation that maps the unit disk to the upper 
half-plane (e.g., the *Cayley transformation), 
and put I” = oj ! 'oy; let o be a function 
holomorphic on the unit disk including its 
boundary (e.g., a polynomial). Then for k 74, 
the series 


P,z)- H e (s'(z))(do'(z)/dz)" 
ae 


is tuniformly convergent in the wide sense (Le., 
uniformly convergent on every compact set) in 
the unit disk and expresses an automorphic 
cusp form of weight k with respect to I’, .e., 
we have P,(cs! (z)) (des! (z)/dz)? e S(T). 
Conversely, every element in G,(I) (k 24) can 
be expressed in this form. A series of this type 
is called a theta-Fuchsian series of Poincaré (or 
simply Poincaré series). 

For k 23, we can define a (positive definite 
Hermitian) inner product on Glas follows: 


(f, 9) -Í f(z)g(z) y* ? dx dy, 
F 


where z = x + iy and F is a 'fundamental region 
of F mn This is called the P etersson metric. 
Since this integral converges also for f, ge 
99i, (T) if one of them belongs to GIL we 
can define the orthogonal complement €,(I) 
of S, (1) in MT). As we shall see, €,(1) is 
generated by *Eisenstein series. 

Any automorphic function f with respect to 
T can be expressed as a quotient of two auto- 
morphic forms f, , f; € 9t, (D) for a sufficiently 
large k. If we put w — f(z), the inverse function 
z =f (w)can be expressed as the quotient of 
two linearly independent solutions of a "near 
differential equation of the form d?z/dw? = 
q(w)z, where o(w) is an *algebraic function 
belonging to the Riemann surface Rp. 


C. Modular Functions and Modular Forms 

(4, 7,8] 

The (elliptic) *Àmodular group 

IzI(l) 
=SL(2, Z) 
IC b 
"De d 


is a Fuchsian group of the first kind acting on 





abe deZ,ad-be=1} 


the upper half-plane $. The quotient space 
I'\ can be compactified to get a compact 
Riemann surface Ry, = '\ U {00} of genus 
zero by joining one point at infinity corre- 
sponding to the cusp oo, around which we 
take q = e?*? as a local parameter. The dimen- 
sion d, = dim 9Jt, (I) (k even, > 2) is given as 
follows: 


4 _ S[k/12] for k=2 (mod 12), 
F [k/A2]4 1 for k£2 (mod 12), 


where [ ] is the *Gauss symbol. 

More generally, an automorphic function 
(automorphic form) with respect to the prin- 
cipal congruence subgroup 


b 0 
ren" Jl ME 


is called a modular function (modular form) of 
level N. (For the number of cusps of T( N), 
denoted by t(N), and the genus of Rue > 122 
Discontinuous Groups.) At any cusp of T(N), 
the local parameter q, in (ii) is given by qy = 
exp(2ziz/N). For N, kè 3, the dimensions d,, 
d$ are given by the general formula of the 
preceding paragraph (including odd k). For 

k 2 3, we defme the (extended) Eisenstein series 


by 
| 


y 
— pd 
m; s (moa N) (M; + MZ) 


G,(z;c,, c4, N)= 


where €, Co are integers such that (c,, c2, N) = 
1 and the symbol Y denotes the summa- 

tion excepting the pair (m,, m;) = (0,0). Then 
G,(z; c4, co, N) depends only on c1, c; (mod N), 
and if we take a set {(c,,c2)} suchthat í(c,,c;), 
(—c,, —€3)) forms a complete set of repre- 
sentatives of the (primitive) residue classes 
(mod N), then the corresponding set of Eisen- 
stein series forms a basis of the space &,(I'(N)) 
(k 2:3), whence we obtain dim €, = d, = d? = 
t(N) (kè 3) (for details, including the case k= 
1, 2, se E. Hecke, Ath Math San Univ. Ham- 
burg 5 (1927)). 

The Fourier coefficient a, of the Eisenstein 
series can be calculated easily, and we get an 
estimate a, = O(n* 1 **) for k> 2, where c is an 
arbitrary positive number. On the other hand, 
for cusp forms, Hecke (Abh. Math. Sem. Univ. 
Hamburg, 5 (1927)) gave a, = O(n"?). In an 
attempt to improve this estimate, H. D. Kloos- 
terman was led to consider the sum 


u,v,q)— xp{ —-| ux+— 
E NE 


(u,v.4 € Z), 


called the Kloosterman sum, which is also 
related to the arithmetic of quadratic forms. 
Using A. Weil’s estimate |K (u, v, p)| € 2 /p 
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(where p is an odd prime, (u, p) = (v, p) 71), 
based on the analog of the Riemann hypoth- 
esis, we obtain a, = O(n*? ^ !^**). The estimate 
a,= O(nk/?~¥2**) is a weak form of what is 
known as the Ramanujan-Petersson conjec- 
ture (— Section D). In the case N = 1, we ob- 
tain the classical Eisenstein series G,(z) = 

G,(z; 0,0, 1) (Keven, > 4), from which we 

define the modular forms g,(z)= 60G,(z), 

g3(z) = 140 G,(z), and A(z) = gł = 27g3 of weight 
4, 6, and 12, respectively. If we denote by (2 = 
go{u; 1, z) the Weierstrass *(o-function with 

the fundamental period (1, z) we have the 
relation f’? 2 49 — g, (9 — g4, and A(z) is the 
‘discriminant of the cubic polynomial appear- 
ing in this relation. It is known that every 
modular form can be expressed uniquely as a 
polynomial in g5 and g3, or in other words, we 
have gn(L(1) zx C95, g3]. The polynomial in 
J, 93 expressing a modular form is, moreover, 
isobaric, Le. consists of terms of the form 
C9," g4 , cet, where 24+ 3v is a constant 
called the weight of the isobaric polynomial. 
Also, A(Z) is a cusp form of the smallest weight, 
and the ideal of all cusp forms, S(T ( 1)), is a 
principal ideal in R(I'( 1)) generated by A(z). 
The Fourier expansion of the Eisenstein series 
G,(z) is given as follows: 





I pups. RHET 
— On —[ B. ( 1X2 k-1 
G,(z) = (2n) al kt CH) ms i) 


where q —e?"* and B, is the *Bernoulli num- 
ber. The discriminant A(z) is expressed as an 
infinite product as follows: 


DEET [ (am 


If we put J(z) 2 g3/A, the function J(z) is a 
modular function and gives an 'analytic iso- 
morphism of the Riemann surface Ry) = 
T( (pu {co} onto the Riemann sphere 
CU {oo} (which maps C, 4, «0 to 0, 1, oo, 
respectively). Hence the field of modular func- 
tions Kra) is a rational function field C(J) 
generated by the function J. The analytic 
isomorphism class of tcomplex tori of dimen- 
sion 1 (= felliptic curves) Ei, vir C/(Za,+ 
Zo») with the fundamental period (@,, œ) 
is uniquely determined by the ['( 1)-equivalence 
class of the modulus 7 = c, Jo, (e $), and hence 
by the value J (1) of the function J. This is the 
historical origin of the name of tmodular 
function [4]. 

As an example of a modular function of 
level 2, we have the 4-function: 

e(1--2/2)— p(2/2) 


II eu- ec ^ 


which gives an analytic isomorphism of the 
Riemann surface 981,5, = I (2 $ U (three 
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points) to the Riemann sphere CU { œ} (map- 
ping 0, 1, oo to 1, o0, 0, respectively; this prop- 
erty is used in a proof of the fPicard theo- 
rem). Hence we again have Ry.) = C(4). The 
following relation holds between J and A 


4(1—4- 4p 


J(225 eg 


In general, the field Me of modular func- 
tions of level N is generated by (g>/g3): (o((a, 
t a5z)/N; 1,2) (a,,a; € Z). 


D. The Hecke Ring and Its Representation 
[S-1 13 


In general, let [ be a group, ĮI a subgroup of 
[, and suppose that, for every gef, oTo ! 
and [' are *tcommensurable, which amounts to 
saying that every double coset Tel is a union 
of a finite number of left or right cosets of 

[. Then in the free module generated by all 
double cosets ToT (oe f), we can define a 
bilinear associative product as follows: 


Poll — "ne f; p ol ol, 


where, writing l'a - | J; Fo; rtr =); Te, we 
denote by m(a, 7; p) the number of pairs (i,j) 
such that T'o;z; Un, The associative ring thus 
obtained is called the Hecke ring (or Hecke 
algehra) and is denoted by X (T, D). dif f is a 
'topological group and T is an open compact 
subgroup, the Hecke ring #(I,T) can also be 
interpreted as the ring of all Z-valued continu- 
ous functions on Î with compact support 
provided with a *convolution product; — 29 
Associative Algebras C.) 

As an example, setting [ = GL*(2, Q), T = 
SL(2, Z), we obtain a Hecke ring #(f,T). In 
virtue of the theory of telementary divisors, a 
complete set of representatives of III is 
given by 


da 


a 
We write T(a,,a;) - V " 





d4,,05,€Q,a,,a;70,a, jas}. 


I. From the 
a2 
relation T(a,@,,4,4,)T(a;',a;')=T(a,,a,) 
follows the commutativity of # (f, I). Fur- 
thermore, if we put, for a positive integer n, 
L-T(n) Y T(a.4) 


a, a EZ 
aja, =n 


then the following multiplication formula 
holds: 


T(n): Tim) =F dT(d, d) . 


din, m 


T (nm/d?). 


We obtain a representation of J£ rr )in 
ft (D) as follows. For T=TeT = UT;e 
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X (T, P) and f eSt,(D) we put 
(f| D) 9, flal) (ciz +d) * det(s;), 


where 6; = 6 ) and lis a fixed integer. This 
i i 
representation leaves ©,{I) and GU) invar- 
iant. In particular, the representation of T, 
(with [=k 1), called the Hecke operator 
(Hecke, Math. Ann., 114 (1937)), is THermitian 
with respect to the Petersson metric. 
Following Hecke, we associate with every 
modular form f(z) = $ o4 Ag’ (qQ=er™) a 
*Dirichlet series p(s) = LD, a,n *. Since a, = 
O(n* '**) for feM, (k > 2), o(s) is absolutely 
convergent in the half-plane Res > k The 
conditions for f to be a modular form of 
weight k are equivalent to the following con- 
ditions for q: (s-k) 9 (s) can be extended to an 
Tentire function of fmite *genus (actually, of 
genus 1) and, if we put R(s) = (2z) *T'(s)e(s), 
R(s) satisfies a functional equation of the 
form R(K- s) = (—1)*?R(s). A correspondence 
between f and o that satislies these con- 
ditions is one-to-one; in fact we have a, = 
( 1? Res,.., R(s), and the function g(x) = f(ix) 
— a(x > 0) and R(S) are related by the tMellin 
transform as follows: 


0 
Risi=| gG)x* * dx, 
0 
1 
g(x) 2 —— R(s)x-" ds. 
2ni Res=09 


This correspondence between Dirichlet series 
and automorphic forms can be generalized to 
the case of a certain discrete subgroup of 
SL(2, R), which is not necessarily of the first 
kind (Hecke, Math. Ann, 112 (1936); [ 10]). It is 
also known that if the functional equations for 

2» 1x(n)a,n ? hold for sufficiently many char- 
acters y, then f=)", a,q"is a cusp form with 
a character for some congruence subgroup 
(Weil, Math. Ann, 168 (1967); [12]). 

Now, suppose that a linear subspace W of 
9X, (D) is invariant under all T, (n —1,2,.. . ), 
and let (f,,. . ., f.) (x 2 dim Mè) be a basis of Mi. 
If we denote by A(n) the K x K matrix represent- 
ing T, in this basis, it can be shown that there 
exist x x K (complex) matrices B® (1 <i <x) 
and A(0) such that we have 


K 


2) =) a" n) =>), fil2)B®. 


Similarly we associate with F(z) a matrix- 
valued Dirichlet series 
SH IT 
n=] 
Then from the multiplicative property of the 
A(n), we obtain the following tEuler product 
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expression of ®(s): 


O(s)=[P]U.-A(p)p ^p" 71) 
P 


where [. denotes the unit matrix of degree x. 
In particular, when dim 9M = 1, Le, when W 
= Cf, where f is an teigenfunction of all Hecke 
operators 7), the corresponding Dirichlet 
series (p(s) has an Euler product of the follow- 
ing form: 
p(s)=[](1-A,p +p. 

p 
For instance, the Dirichlet series correspond- 
ing to the Eisenstein series G, is (2(27i)* (k— 
1))C(s)C(s— K+ D) (where ( is the Riemann 
zeta function), which has a well-known Euler 
product. Since S,, = CA, the Dirichlet series 
corresponding to A(z) also has an Euler prod- 
uct. (This is part of the Ramanujan conjecture. 
Ramanujan also,conjectured that the quadra- 
tic polynomial 1 —4, X + p! ! X? appearing in 
this Euler product has imaginary roots, Le., 
|A,|< 2p! 7.) Generalizing the Ramanujan 
conjecture, Petersson conjectured that all 
eigenvalues of the Hecke operator T, in Š, 
have absolute value < 2p* !'? (Ramanujan- 
Petersson conjecture. P. Deligne proved this 
conjecture for K> 2, and his method extends to 
the case when T is a congruence subgroup 
(Publ. Math. Inst. HES, 53 (1974)). For k= 1, 
see Deligne and J.-P. Serre (Ann. Sci, Eade 
Nom Sup., 7 (1974)). It may be noted that this 
conjecture was proved earlier for K= 2 (and for 
almost all p) by M. Eichler (1954) and by G. 
Shimura (1958) (— 450 Zeta Functions M). 


E. The Case of Many Variables [1, 15.161 
Let X be a bounded domain in CH and F a 
tdiscontinuous group of analytic automor- 
phisms of X. The Poincaré series of weight (= 
Section A) mz 2, defined similarly to the case 
of the unit disk, converges normally in X and 
expresses an automorphic form of weight m If 
TAX is compact, let (f,, , fẹ) be a basis of the 
space of Poincaré series of weight m Then for 
a sufficiently large m (which is a multiple of 
the order of I, for allx € X), the map X 32 
(f,(z), ,f.(z)) defines in a natural manner a 
projective embedding of the quotient space 
IX into P*^! (C), which actually gives an 
analytic isomorphism of [^X onto a tnormal 
projective algebraic variety. It follows that the 
field of automorphic functions with respect to 
T is an talgebraic function field of dimension 
N, and that every automorphic function can 
be written as the quotient of two Poincaré 
series of the same weight. In the case where 
T\X is not compact, we first discuss some 
examples. 
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F. Siegel Modular Functions [15,17,18] 

The Siegel upper half-space $, of degree H (or 
Siegel space of degree n) is, by definition, the 
space of all nx n complex symmetric matrices 
Z-X-iY with the imaginary part Y > 0 (posi- 
tive definite). (, is analytically equivalent to a 
'symmetric bounded domain.) The group of all 
(complex) analytic automorphisms of $, is 
given by the real (projective) tsymplectic group 
Sp(n, R)/{ + Lan}, which acts transitively on 

H, by Z>0(Z)=(AZ + B/(CZ + D) foro = 


A B 
L ef Sp(n, R). The subgroup I, = Sp(n, Z) 


consisting of integral matrices, or the corre- 
sponding group of linear fractional transfor- 
mations, is called the Siegel modular group 

of degree n T, is a *discontinuous group of the 
first kind acting on §,. 

A Siegel modular form f of weight k or of 
dimension -k is, by definition, a holomorphic 
function on §, satisfying the following con- 

B 

NL I, f(o(Z)) 7 
f(Z)det(CZ + D: (ii) f has an integral Fourier 
expansion of the form f(Z) = Ero are" (02, 
where T runs over all half-integral *positive 
semidefinite symmetric matrices of degree n. 
(Note that the Jacobian determinant J,(Z) of 


A B 
g- ( ei is given by J,(Z) = det(CZ + 


D) * !) For nZ 2, condition (ii) is superfluous 
(M. Koecher, Math. Z., 59 (1954)). We denote 
by MP =M,(T,) the space of all Siegel modu- 
lar forms of degree n and weight k When 


. Z ; 
we viteg,3z= (^ 3) with Z, € $,.,,3€ 
C" ,ze$,, and 


ditions: (i) f ^ 
1üuons: (1) lor g = 
OW 


Ms f(Z)= 3 a,(Z,, Aler, 
n=0 


it turns out that ag(Z,, 3) depends only on Z; 
and, writing it as fi (Z,), we have f, eM". 
The mapping d: f f, thus defined is a linear 
mapping from Wt” into SW? P, which is suf- 
jective if Kis even and > 2n (H. Maass, Math. 
Ann., 123 (1951)). We denote the kernel of ® 
by S(” and call feS” a cusp form (viewing 
H,- as a cusp of §,). For fe IM, the fol- 
lowing three conditions are equivalent: (a) 
fe SP; (b) fZ) 2 X rare rITA: (o) 

| f(Z)det( Y)*” is bounded. We have Ij” = 
(0) if K<Oor if both k and n are odd, and 
MW = C. In general, dim WMP is finite and 

= OKT?) (k-t oo). The graded algebra 
MT) = EL My” is finitely generated, and 
its Krull dimension is n(n + 1)/2 + 1. In par- 
ticular, for n= 2 the structure of the graded 
algebra I(T) is determined explicitly. (The 
even part of this algebra is isomorphic to the 
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polynomial ring of four variables.) Also, 

dim JR? and the singularities of the quotient 
variety ID, $5, are known explicitly (J. 1. Igusa, 
Ame. J. Mdh, 84 (1962), 86 (1964); U. 
Christian). 

As an example of Siegel modular forms, we 
have the  Eisenstein-Poincaré series, delined as 
follows: 

Es (Z) = e2nitr(Sa(z)) det(CZ + D) E 
cer, SAT, 

where S is an Ax H rational symmetric matrix 

> 0, and we put 


U mu" 
r- s es Jer 'USU =S, 





e? tr(ST) = det( un 


This series is convergent for k> n + rank $ + 1 
if rank S <n and for k » 2n if S — 0, and the 
totality of Es , (those series with S > 0) spans 
M (SH) (Maass, Math. Ann., 123 (1951)). 
*Theta series delined by integral quadratic 
forms are examples of Siegel modular forms 
(with a certain level) and are of great signifi- 
cance in the arithmetic of quadratic forms 
(C. L. Siegel, Ann. Mdh, (2) 36 (1935). 

The quotient space I; ^ $, can be compacti- 
fied as follows (I. Satake and W. Baily [15]): 
There exists a positive integer kọ such that, 
for any multiple k of ko, any basis of Wty” 
defines in a natural manner a one-to-one 
biholomorphic projective embedding of I5, 
of which the image is a Zariski open set of a 
normal projective algebraic variety. Since the 
structure of this projective variety is indepen- 
dent of k, we denote it simply by I$, . In 
other words, (A8, is the projective variety 
Proj 90t(I,) associated with the graded algebra 
MT). Then we have I, A$, = Ur- TAS,- 
When nz 2 for r < n, [A $,, is of codimension 
> 2 inI,\§,,, So that (by virtue of Hartogs's 
continuation theorem) the conditions at the 
points at infinity in the delinitions of modular 
functions and forms become superfluous. In 
fact, for n > 2, if we define a Siegel modular 
function of degree n simply as a Į -invariant 
meromorphic function on §,, then it can auto- 
matically be extended to a meromorphic func- 
tion on the compactilication I, $,, and hence 
be expressed as the quotient of two modular 
forms of the same weight. It also follows that 
the field of all Siegel modular functions of 
degree n is an algebraic function lield of di- 
mension n(n 4- 1)/2. (These results can also be 
obtained from the tpseudoconcavity of I, S, 





without using compactilication (A. Andreotti 
and H. Grauert, Nachr. Akud. Wiss. Góttin- 
gen).) For n = 2, the field of Siegel modular 
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functions is a rational function lield. On the 
other hand, for ri = 1 mod 8, n> 9, the field 
of Siegel modular functions is not rational 
(E. Freitag, Abh. Math. Sem Univ. Hamburg 
(1978). 

Let I' be a subgroup of I, of finite index. 
Then [A $, can be compactified in the same 
way as above [23]. However the singularities 
of I$, are extraordinarily complicated (Igusa 
[24]). When T is the principal congruence 
subgroup T,(N) of level N > 2, the blowing-up 
of (NJ $, along the cusps is nonsingular for 
n = 2, 3 (Igusa, Mah Ann, 168 (1967). Fol- 
lowing Igusa's idea, it is possible to construct 
an explicit nonsingular compactification of 
ID, in which D is a bounded symmetric 
domain and T is a suitable arithmetic sub- 
group of Aut(D) ([19]; see also Satake, Bull. 
Amer. Math. Soc, 79 (1973)). For D= $x $ 
and T the Hilbert modular group (— Section 
G), F. Hirzebruch has given explicit nonsin- 
gular compactilications of F\D (Ensagnenet 
Math, 19 (1973). 

The Hecke theory (— Section D) can also be 
extended to the Siegel modular case. For any 
integer M> O, let $,, = St? be the set of 2n x 2n 
integral matrices M such that 'MJM - mJ, 

0 =i " i i 
where EI J and J, is the identity 
matrix of degree n. We define the Hecke 
operator T(m) on MP by 


T(m)f — m" ""*02 X 
(4 8)-Mevim) 


det(CZ 


+ D) *f(MZ), 


where fe qno, A, = U MeV(m) IM, and V(m) is 
a complete system of representatives of I,AS,,. 
Then for any prime number p, the formal 
operator series D,(X)= DY", T(p") X" is form- 
ally equal to a rational function in X: 
PAX) 

Q(X) 
in which P,(X) and Q,(X) are polynomials in 
X of degree 2" = 2 and 2", respectively, with 
coefficients in the ring of Hecke operators 
(Satake, Publ. Math. Inst. HES, 18 (1964)). For 


n =2 we have (Shimura, Proc. Nat. Acud. Sd. 
US, 49 (1963)) 


D,(X)2 (1 p? * *x?)(1— T(p)X +(T(p) 
T(p?) =p tx aps T(pX? 
ET we d t. 


Let f(Z)eM be a Siegel modular form of 
degree 2 such that T(m)f — A(m)f for all mè 1. 
We define L ,(s) by 


D(X) 





e / 
Lade CQs—2k--4) X E 
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in which ((s) is the Riemann zeta function. It is 
known that L s(5) can be meromorphically 
continued to the entire plane and satisfies the 
functional equation 


V,Qk = 2—5) =¥,(5), 


where Y,(s) = (2) ?T'(sI'(s K+2)L,(s) and 
the function (s = k)(s — k+ 2)'¥,(s) is entire (A. 
Andrianov, Trudy Math. Inst. Stédoo,, 112 
(1971) 


G. Hilbert Modular Functions 


Following a suggestion of Hilbert, 0. Blu- 
menthal studied a generalization of modular 
functions of the following type: Let K be a 
totally real talgebraic number field of Imite 
degree, and let K'?, , K™ be the conjugates 
of K, where n 2 [K : Q]. (“Totally real” means 
that all the K are real.) For à € K, we denote 
by ge K® the ith conjugate of a. Let D be the 
+Principal order of K, and consider the group 


Tp =SL(2,D) 


Jr? 


We define the action of Fp on the n-fold prod- 
uct of the upper half-plane "= {z = (z,. . . z,) 
z;€H} by 


(1) (1) a (n) 
TE ntp z,- ) 





dE 


yUz, A 00 yz ës 


x e 
for g= (x ; ) then T°, becomes a discontinu- 


ous group of the first kind (Tẹ can also be 
considered as an "irreducible discrete subgroup 
of SL(2, R)“). The group I} is called the Hil- 
bert modular group of K (in the strict sense). 
(The defmition is sometimes modilied by re- 
placing the condition a — fy — 1 by, say, xô 
By «(totally positive tunit).) If the tclass num- 
ber of K is h, the quotient space I, $" can 

be compactified by adjoining h points at in- 
finity. Therefore, if n 2 2, a Hilbert modular 
function can be defined as a meromorphic 
function on mi invariant under ['4; and simi- 
larly a Hilbert modular form f of weigbt k 

or of dimension -k as a holomorphic function 
f on $” such that (f| 90) = flo2)) TT, 69; 
000) * 5 f(z) for all cT. (In the latter case, 
fis holomorphic at all cusps, i.e., for every 

c € SLQ, K), f|o has an integral Fourier ex- 
pansion.) For Hilbert modular functions and 
forms, results quite similar to those for the 
case n = 1 or the case of Siegel modular groups 
have been obtained (Kloosterman, Maass, K. 
B. Gundlach, H. Klingen). 
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H. Furtber Generalizations 


As further generalizations of the notion of 
modular function, we have Hilbert-Siegel 
modular functions (1.1. Pyatetskii-Shapiro, 
Baily, Christian [20], where we can tind some 
150 references), Hermitian modular functions 
(H. Braun, Klingen), etc. For the most general 
case, Le. for an arithmetically defined discon- 
tinuous group acting on a *symmetric bounded 
domain, a unified theory of automorphic func- 
tions has been established in the works of 
Pyatetskii-Shapiro [21,22] and Baily and A. 
Borel [23,24]. 

On the other hand, exactly as in the classical 
theory, where the elliptic modular function 
gave an invariant of l-dimensional complex 
tori, generalized modular functions can be 
viewed as giving an analytic invariant of a 
certain family of (polarized) tAbelian varieties. 
From this point of view, a deep number- 
theoretic (and algebrogeometric) study of 
automorphic functions (initiated by Hecke and 
Eichler) has been substantially carried forward 
by the work of Shimura (see the serjes of his 
papers in Ann. M &h. starting from vol. 70 
(1959); see also [9, 24, 25]). 

From the analytic point of view, the theory 
of automorphic functions is closely connected 
with the unitary representation of G in the 
space L,(T\G) (or its adelic analog) [26, 27] 
(for adelic analogs = [ 12,131). In this respect, 
the *trace formula of A. Selberg [28], gen- 
eralizing the --Poisson summation formula, 
is of fundamental importance; and actually 
it can be used effectively for calculations of 
the dimension of the space of automorphic 
forms and of the trace of Hecke operators 
(R. P. Langlands, Amer. J. M &h., 85 (1963); 

H. Shimizu, Ann. M &h., (2) 77 (1963), J. Fac. 
Sd. Univ. Tokyo, 10 (1963); also [24]). When 
X = G/K is a symmetric domain, we can define, 
for any representation p of K, a (matrix- 
valued) canonical automorphy factor, by 
which we defme (vector-valued) automorphic 
forms with respect to a discrete subgroup [^ of 
G, and under a further condition (say, I free, 
IX compact, and the thighest weight of p 
sufftciently large) we obtain a formula for the 
dimension of the space of such automorphic 
forms in terms of the tarithmetic genus of IX 
and certain numbers related to the “dual” X, 
= G,/K and the representation p [ 15,29, 30]. 
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33 (1.5) 
Axiomatic Set Theory 


A. General Remarks 


Axiomatic set theory pursues the goal of rees- 
tablishing the essentials of G. Cantor's rather 
intuitive fset theory by axiomatic construc- 
tions consistent with modern theories of the 
foundations of mathematics. 

A system of axioms for set theory was first 
given by E. Zermelo [47], and was completed 
by A. Fraenkel [8]. J. von Neumann [30] 
expressed it in *symbolic logic, gave a formal 
generalization, and eliminated ambiguous 
concepts. P. Bernays and K. Godel [2,11] 
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further refined and simplified von Neumann's 
formulation. The theories based on the sys- 
tems before and after the formal generaliza- 
tion are called Zermelo-Fraenkel set theory 
(ZF) and Bernays-Gédel set theory (BG), 
respectively. 


B. Zermelo-Fraenkel Set Theory 


ZF is a formal system expressed in ffirst-order 
predicate logic with the predicate symbol — 
(equality) and based on the following axioms 
1-9 (- 411 Symbolic Logic). These axioms 
do not contain any predicate symbol other 
than e, where x€ y is read “x is an element 
of y." Any formula containing only Easa 
predicate symbol is called a set-theoretic for- 
mula. Following the usual convention, we omit 
the outermost universal quantifier, and use 
restricted quantifiers such as 3x € a, Vx € a 
Axiom | (axiom of extensionality): 


Vx(xeazxeb)oa- b. 


This asserts that sets formed by the same 
elements are equal. The formula x € a(x € b) is 
denoted by a c b. This means “a is a subset of 
b." Then Axiom 1 can be expressed by 


acb^bca-azb. 

Axiom 2 (axiom of the unordered pair): 
dxVy(yex=y=avy=b). 
This asserts the existence, for any sets a and b, 
of a set x having a and bp as its only elements. 
This x is called the unordered pair of a and b 


and is denoted by (a, b). The notion of ordered 
pair is characterized by 


(a,b) - (c, d) & a c Ab —d. 


An example of such a set is (a, b) = 1 (a, a}, 


{a,b} }. 


Axiom 3 (axiom of the sum set): 
dx Vy(yex = 4zea(yez)). 


This asserts the existence for any set a of the 
sum (or union) x of all the sets that are ele- 
ments of a. This x is denoted by Ua or S(a). 
We writeau bforU (a, b}and a’ for 
aU {a,a}. 

Axiom 4 (axiom of the power set): 


Yxdy(yex=Vze y(zea)). 


This asserts the existence for any set a of the 
power set x consisting of all the subsets of a. 
This x is denoted by P(a). We have S(P(a)) = 
a, SO $ is a left inverse of P, and the products 
SP and PS are idempotent. 

Axiom 5 (axiom of the empty set): 


dx Vy( 1lye x). 
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This asserts the existence of the empty set. The 
empty set is denoted by (7j or 0. 
Axiom 6 (axiom of infinity): 


Jx(0ex v Vyex( y'ex)). 


This asserts the existence of a set containing 
all the natural numbers, where 0, | = 0’ = {0}, 
22 1'={0, 1}, 3=2'={0,1,2}, This defini- 
tion of natural number is due to von 
Neumann. 

Axiom 7 (axiom of separation): 


jx Vy(yex &yea^ A(y)). 


This asserts that the existence for any set a and 
a formula A( y) of a set x consisting of all 
elements of a satisfying A( y). This x is denoted 
by (yea A( y)). This is rather a schema for an 
intinite number of axioms, for there are an 
infinite number of A( y). This axiom, also 
called the axiom of comprehension or axiom 
of subsets, was introduced by Zermelo. 

For example, the set of all natural numbers 
is introduced by 


[xea|Vy(0ey^Vzey(z' ey) > zer), 


where a is a set satisfying Axiom 6. The set of 
all natural numbers is denoted by œ or N. 
Axiom 8 (axiom of replacement): 


ixVyea(3zA( y, z) o dz e xA( y, z)). 


This asserts the existence for any set a of a set 
x such that for any y of a, if there exists a z 
satisfying A( y, z) then such z exists in x. If the 
relation A( y, z) determines a function, then the 
image of a set by the relation is included in a 
set, SO by Axiom 7, the image is a set. This 
axiom was introduced by Fraenkel. 

Axiom 9 (axiom of regularity): 


ixA(x)odx(A(x) ^ Vyex( 1A( y))). 


This asserts that if there exists a set satisfying 

A(x), then there is a set x satisfying A(x) but 

every element y of x does not satisfy the prop- 

erty ,4(y). This axiom is also called the axiom 

of foundation. Its dual expression, called €- 

induction, is of course equivalent to Axiom 9. 
Axiom 9’ (axiom of E-induction): 


Vx(VyexA(y) A(x))-5 VxA(x). 


Using this we can show that xex, 

TEY A y € x), etc. If we assume the axiom of 
choice stated below, then this is equivalent to 
the nonexistence of an infinite descending 
sequence 


X,€ ... € X4€ X4. 


If a model of set theory satisfles the axiom of 
regularity and has an infinite descending se- 
quence that is not in the model, then such a 
model is called a nonstandard model. 
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Axiom 10 (axiom of choice): 
VxeadyA(x, y) 3yVxeaA(x, y(x)). 


This asserts that if for any element x of a there 
is a set y such that A(x, y), then there is a 
choice function y for the formula, i.e., 

A(x, y(x)) for all x in a. Usually a function is 
represented by its graph. A set fis called a 
function detined on a if 


VxVy((x, y)ef—5xea), | Vxeady((x, y)e f). 


VxeaVyVz((x, y)ef ^(x, z)e f 5y-z). 


This formula is denoted by Fnc( f); then the 
formula A(x, f (x)) is an abbreviation of 


Fne(f) ^ Jy((x, y)e f ^ A(x, y)). 


The axiom of choice is equivalent to many 
properties, such as the well-ordering theorem, 
Zorn's lemma, and Tikhonov's theorem on the 
product of compact Spaces, and it is used 
widely and essentially in mathematics. 

The system of axioms 1 to 9 is called 
Zermelo-Fraenkel set theory and is denoted by 
ZF; the system ZF minus the axiom of replace- 
ment is called Zermelo set theory, denoted by 
Z; and the system ZF plus the axiom of choice 
is denoted by ZFC. 

The system Z is weaker than ZF. Indeed, the 
existence of the set œw of all the natural num- 
bers and of P(ro), P(P(w)), can be proved 
in Z, but the existence of the set{w, p(w), 
P(P(c)), .} cannot be proved in Z. However, 
we can prove its existence in ZF. 

The theory ZF minus the axiom of intinity is 
called general set theory. Its consistency can be 
reduced to the consistency of the theory of 
natural numbers as follows. Let n be any 
natural number and n = 2": + 2^ +1 + 
2"(n, < n, < < ną) be its binary expan- 
sion. We can define a model of general set 
theory by identifying a set with the natural 
number n and defining m € n by “m appears as 
one of the n;.” In general set theory we can 
show the existence of 0, P(O), P(P(0)), but 
the existence of the set (0, P(O), P(P(0)), .} 
cannot be proved. However, we can prove its 
existence in Z. The set consisting of all heredi- 
tary finite sets is the smallest model of general 
set theory. 


C. Bernays-Gódel Set Theory 


The existence of (u| A(u)} for an arbitrary set- 
theoretic formula .4(u) cannot be deduced 
from the axioms of ZF. We call this object a 
class to distinguish it from sets. We introduce 
a generalized logical system of first-order 
predicate logic by adding to the logical system 
used in ZF class variables, class constants, and 
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inference rules with respect to quantifiers for 
classes. For any set-theoretic formula A(u)in 
which no *bound class variable occurs, we 
adopt 


iX Vu(ue X S A(u)) 


as an axiom, where capital letters X are class 
variables. The set theory thus obtained is 
equivalent to Gódel set theory [11]. Von 
Neumann axiomatized set theory by making 
use of the notion of functions instead of that of 
classes [30]. In refining this theory and intro- 
ducing the notion of classes, Bernays and 
Godel [2,11] initiated Bernays-Gódel set 
theory, BG. 

ZF and BG are related as follows: Any 
formula provable in ZF is provable in BG, and 
any set-theoretic formula provable in BG and 
having neither class variable nor class constant 
is provable in ZF. In this sense, the systems 
can be regarded as essentially equivalent, but 
as BG has class variables and class constants, 
it is more convenient for expressing set- 
theoretic notions. 

Von Neumann detined the following func- 
tion R by *transfinite induction: 


RO) = Ø, RUR y B(R()), 
where y and D are ordinal numbers and 


Up; B(R(B)) denotes the set sum of $(R(0)), 
DRUI... BCR(B)), ... (B <a). The function R 
can be detined by a set-theoretic formula, so it 
exists as a class. Now consider the model 

M = M(a) for a fixed ordinal number x, We 
define sets of the model M as elements of R(«), 
and classes of the model as subsets of R(x). We 
denote the e relation of the model by eu. For 
classes X and Yof the model, we write XE, 
Ye X c Y Then a necessary and sufficient 
condition for R(x) to be a model of BG is that 
xis an tinaccessible ordinal number (— 3 12 
Ordinal Numbers). The existence of an in- 
accessible ordinal number cannot be deduced 
from the axioms of ZF. There is a series of 
studies of axiomatization of set theory in 
which any number of inaccessible ordinal 
numbers is assumed to exist [23]. When R(a) 
is a model of BG (ZF), it is called a natural 
model of BG (ZF). Furthermore, if H is U a R(x) 
then H satisfies all the axioms of BG (ZF). As 
we do not need the axiom of regularity for 
defining the class H, we see that BG (ZF) is 
consistent as long as BG (ZF) without the 
axiom of regularity is consistent. 


D. Independence of the Continuum Hypothesis 
and the Axiom of Choice 


These axiomatizations of set theory motivated 
a series of studies from the standpoint of the 
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tfoundations of mathematics on problems that 
remained unsolved after the appearance of 
Cantor's primitive set theory. Among these, 
the problem of the relation between the con- 
tinuum hypothesis and the axiom of choice 
was central. 


Consistency of the Axiom of Choice and the 
Continuum Hypothesis. Gödel [11] proved 
that if ZF without the axiom of choice is con- 
sistent, then the system obtained by adding to 
ZF the axiom of choice and the continuum 
hypothesis is also consistent. To show this, he 
constructed a model of ZF satisfying the 
axiom of choice and the generalized con- 
tinuum hypothesis as follows. Assume first that 
M is an arbitrary domain of objects among 
which the e relation is detined. By a formula 
on M, we understand a formula having con- 
stants of M as its only constants, having € as 
its sole predicate symbol, and further having 
exclusively variables whose ranges are re- 
stricted to M. Let us denote by M' the totality 
of subsets of M detined by a formula A(x) on 
M. Now we put Mass (5, M, = Ma Mp = 
(La Ma if B is a "limit ordinal number. We 
call x constructible if x € M, for some ordinal 
number «, assumed to be less than the first 
inaccessible ordinal number, if any. We denote 
the totality of constructible sets by L, and the 
totality of sets of ZF by V. We call the asser- 
tion V= L, that is, every set is constructible, 
the axiom of constructibility. If we add this 
axiom to ZF, the axiom of choice and the 
generalized continuum hypothesis become 
provable. On the other hand, if we regard 
elements of L as sets of the model and the 
original e relation as the e relation of the 
model, we have a model of ZF in which the 
axiom of constructibility holds. 


Independence of the Axiom of Choice and tbe 
Continuum Hypothesis. Since the result of 
Gödel, attempts have been made to prove that 
the axiom of choice is independent of the other 
axioms. Fraenkel constructed a model of set 
theory without satisfying the axiom of choice, 
starting from a countable number of objects 
that are not sets. A. Mostowski constructed in 
ZF a model of set theory having objects that 
are not sets, and he proved that the model 
satisfies the axiom: Every set can be linearly 
ordered, but does not satisfy the axiom of 
choice [27]. E. Mendelson constructed a 
model of set theory that does not satisfy the 
axiom of choice by making use of an infinite 
descending chain a, 3 4) 3 a3 3. [25]. These 
models, however, do not satisfy al] the axioms 
of ZF or of Zermelo set theory minus the 
axiom of choice, even though they satisfy most 
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of the axioms. Consequently, they are not 
sufficient for proving independence. 

P. J. Cohen [3,4] proved the following 
results in connection with the independence of 
the axiom of choice, the continuum hypo- 
thesis, and V2 L: If ZF is consistent, each of 
the following conditions has a model. (1) The 
axiom of choice as well as the generalized 
continuum hypothesis holds, but there exists 
an a satisfying a ¢ L and a C w, (2) $c) is not 
well-ordered. (3) The axiom of choice holds, 
but the continuum hypothesis does not hold. 
(4) SB(33(c))) cannot be linearly ordered. 

By (1), we see that V= L is independent of 
the axiom of choice and the generalized con- 
tinuum hypothesis: (2) shows the independence 
of the axiom of choice; and (3) shows that the 
continuum hypothesis is independent of the 
axiom of choice. In (4), R (3B(c)) corresponds 
to the set F of all real-valued functions detined 
on the interval [0, 1], and (4) shows that the 
proposition “F can be linearly ordered" is not 
deducible in ZF without the axiom of choice. 


E. Some Recent Results 


Boolean-Valued Set Theory. D. Scott and 
R. M. Solovay defined models of set theory in 
which the set-theoretic formula has values in a 
complete Boolean algebra. This viewpoint is 
motivated by P. J. Cohen's original notions of 
the forcing relation and the generic filter. 

According to the relation between sets and 
their representing functions, a function f: A+ 
B corresponds to a subset of A when B is a 
complete Boolean algebra. Sb by transtinite 
induction we put 
yp- LJ P999, 

[ica 

where P/?(A) is the Boolean-valued power set 
of A defined by 


ue P'P(4)zu: AO B. 


Let V? be the union of al] V"s; then the 
truth values of the formulas Tue v] and [u= v] 
are detined by transtinite induction as 


[uev] = ss v(x): [x =u], 
[v=o] 


= [I| «= [xer}- [I 


xedom (u) xedom(v) 


v(x) => [xeu]. 


where a — b is an abbreviation for the element 
—a- b in B. 

In V'? the following properties are satistied: 
(i) The truth value of any formula provable 
from ZFC is 1, the largest element of B. 

(ii) Any complete homomorphism h: B, > B, 
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can be extended to 
h: V8) V», 
so that the commutative relation 


h([A(uy, ....,) ) S [AQQ) ... hlun))] 


is satisfied for every restricted formula; further, 
if his an epimorphism then it is satisfied for 
every formula. 

(iii) (maximum principle) For any formula A(x) 
there is an element y of V? such that 


(3x409] = [AW]. 


Moreover, if [JxR(x)] = 1 then there an ele- 
ment u of V such that [R(u)] = 1 and 


[3x(R(x) A A(x) | = [AQ]. 


Property (1) means that if /A/ « 1, then A is 
not provable from ZFC. Many consistency 
results in set theory are obtained in this way 
by constructing special partial order struc- 
tures, topological spaces, or complete Boolean 
algebras. 


Axiom of Strong Infinity. The axiom of infin- 
ity, Axiom 6, of ZF asserts that there exists a 
set including the set œ of all natural numbers. 
The set (9 is the smallest infinite cardinal num- 
ber. In general, the axiom of strong intinity is 
stated in a form such that certain special prop- 
erties of q are satistied for an uncountable 
cardinal number. 

Though such an axiom asserts the existence 
of large cardinal numbers, their properties 
sometimes reflect, for example, those of the 
real numbers, or of the set of real numbers, etc. 

We state here some typical examples: 

(i) Weakly inaccessible cardinal numher. This 
is a regular limit cardinal number, that is, 


Cflm,)=@,, —Vfi«a(og., < Oa) 


(ii) Strongly inaccessible cardinal number. 


cf(o,)2 a VB < a(2?8 < 0). 


(ui) Weakly compact cardinal number. @, is 
uncountable and the space X = 2?« with < œw,- 
topology is w,-compact. 

(iv) Measurable cardinal number. c», is un- 
countable and there is a nonprincipal <œ,- 
additive 2-valued measure y: P(w,)->{0, 1). 

(v) Strongly compact cardinal number. œ, is 
uncountable and any product of 2 = (0, 1 j with 
< w,-topology is w,-compact. 

The strength of these properties increases in 
the order presented here. Properties (1) to (iii) 
are compatible with the axiom of constructi- 
bility V= L. And (iv) implies V z L while it is 
compatible with V= L(y). But (v) implies 
VÆ L(a) for any set a. 

There are many other stronger axioms of 


150 


infinity, and the consistency proofs of these 
“axioms” within ZF or ZFC would involve 
essential difficulties. Inconsistency proofs of 
these properties, if they exist, would be very 
interesting. 


F. Examples of Results 


(1) Cardinality and Cofinality (— 3 12 Ordinal 
Numbers). (i) (W. B. Easton [6]) Let W be a 
model of ZFC (the Zermelo-Frankel axioms 
plus the axiom of choice) in which the GCH 
(generahzed continuum hypothesis) is valid, 
Le, Va(2?« = 0,41), and let g be a function 
from ordinals to ordinals in Wt such that Va, 
Dia < B => 0, € Da gl and Vala, < cf(c, ()). 
Then there is a Boolean model St of ZFC, 9t 5 
Wt, having the same cofinality and satisfying 
2?« =(g(a) for every regular cardinal. (This 
means that Kónig's condition (cf(2?«) > @,) is 
the only restriction on the cardinality of 
powers of regular oi. / 

(ii) (J. H. Silver [39]) Suppose 


o «cf(a)- cf(o,) < o. 

Then for any 4 « cf(a) 

Vv «a(2?v€ 0,,,)22"* X 0,, ;. 
However, the validity of the implication 
Vn «o(2?»— 0,41) 222 = 0,4 


still remains an open question. And it is 
known that if there is a model for 


Vn ln — 0,44), 29° 9) elo 


then there is a model in which there are many 
measurable cardinal numbers. 

(iii) (K. Prikry [31]) Let œ, be a measurable 
cardinal number in a model M of ZFC. Then 
there is a Boolean extension N of M in which 
the notion of cardinality is not changed but 
the notion of cofinality is changed, that is 
cf(w,) = o, in M but cf(o) = @ in N. 

(iv) (Solovay) Let o, be a strongly compact 
cardinal number. Then for any strong-limit 
singular cardinal c, > c, the continuum hypo- 
thesis holds at cog; that is, the implication 


cf(oj «os,  Vy«pQ?r«oa,)»2?85— Oe 


is provable. 


(2) Lebesgue Measurability and the Baire Prop- 
erty. As is well known, every Aj (Borel) set 
(and consequently every 2j (analytic) set) of 
real numbers is Lebesgue measurable and has 
the Baire property. 

(i) (Gódel) V= L implies the existence of a 
A} set of real numbers that is neither Lebesgue 
measurable nor has the Baire property. 


Let DC denote the principle of depending 
choice: 


VxeadyeaP(x, y) 
23f:oaVneoP( f(n), f(nt D). 


DC is adequate for the development of the 
classical notions of measure theory, such as 
Jordan decomposition, the Radon-Nikodym 
derivate, etc. Let I denote the hypothesis 
da(cf(w,) = 2,4 VB < a(2?8 < w,)) (strongly 
inaccessible). 

Gi) (Solovay [40]) The consistency of ZFC 
and 1 implies that of ZF and either of the 
following two axioms: 

(a) DC plus the hypothesis that every set of 
real numbers is Lebesgue measurable and has 
the Baire property. 

(b) The axiom of choice plus the hypothesis 
that every set of real numbers definable by an 
() sequence of ordinals is Lebesgue measurable 
and has the Baire property. 

Axiom (a) means that DC is not strong 
enough to construct a (Lebesgue) nonmeasur- 
able set, while Axiom (b) implies that every 
projective set is Lebesgue measurable and has 
the Baire property, and hence implies that the 
set of all constructible real numbers, as a X1 
set, has Lebesgue measure 0, and is of the first 
category. 


(3) Martin’s Axiom. Let B be a Boolean 
algebra. We say that B satisfies c, C.C. (chain 
condition) if the cardinality of every disjoint 
family of positive elements of B is at most @,. 
Let B* denote the topological space consisting 
of all homomorphisms h: B2 = (0, 1} with 
the open base U(a)= (h|h(a) 2 1} (ae B); then 
B* is a *Baire space. Then Martin's axiom 
(MA) is: Let B be an œ c.c. Boolean algebra 
and a < 2". Then the intersection of a dense 
open sets is dense in B*. Since {h X h(a,)= 
h(b)} is dense and open in B* if X, a, = b, 

MA means the existence of an h e B* preserv- 
ing any given set of x ( < 2") equations in B, If 
2" = w,, MA merely reduces to the Baire prop- 
erty of B*. However, if 2" > œ, then the œ 

C.C. hypothesis is essential, for there exists a 

B satisfying o C.C. such that BS contains €, 
dense open sets with empty intersection. 

(i) (Solovay and S. Tennenbaum [43]) The 
consistency of ZF implies that of ZFC, MA, 
and 2? > %4. 

(ii) (D. A. Martin and Solovay [24]) ZFC, 
MA, and 2” > o, imply the propositions: 

(a) Vo, < 2?(2?1 = 2^), hence 2” = 2%; 

(b) the totality of the first category sets of 
Lebesgue measure zero sets is a-additive for 
anya<2”; 

(c) every X i set of real numbers is Lebesgue 
measurable and has the Baire property. 
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(4) Suslin's Hypothesis (SH) is: Every dense, 
linear, order complete set without end points, 
having at most q disjoint intervals, is order 
isomorphic to the continuum of real numbers. 

(1) (T. J. Jech, Tennenbaum) The consistency 
of ZF implies that of (a) ZFC, 7 SH, and 
GCH, as well as (b) ZFC, '1SH, and 2” > w, 

(i1) (R. Jensen [15]) The consistency of ZF 
implies that of ZFC, SH, and 2” = @,. 

(iii) (Solovay and Tennenbaum [43]) ZFC, 
MA, and 2” > œ imply SH. 

(iv) Jensen [15]) V=L implies 7 SH. 


(5) Measurahle and Real-Valued Measurahle 
Cardinals. A cardinal K > % is said to be 
measurahle if there is a measure 4: Bx) 

{0, 1} with (a) u(x)— 1, (b) Vv < x(u({v})=0) 
and (c) (2, <a A) = Zen H(A) and K is said 
to be real-valued measurahle if there exists a 
JU: B(x)  [0, 1] that satifies (a), (b) and (c), and 
is not measurable. 

Let MC (RMC) denote the existence of a 
measurable cardinal (real-valued measurable 
cardinal). 

(i) (S. Ulam [45]) The existence of an W- 
additive measure vu: 8(A)— [0, 1], with p(A)= 
1 and Vxe A(u([x]) 0), implies RMC or MC; 
RMC implies the existence of an extension 
of Lebesgue measure defined on $( [0, 1]); 
every real-valued measurable cardinal is € 2" 
and weakly inaccessible. 

(ii) Every measurable cardinal is strongly 
(hyper) inaccessible, and (a) J«F(x)3a < 
c, F(a) for any X i formula F(a) on the ordinal 
numbers, (b) JaF(a)3« <p, F(a) for any H. 
formula F(a) and the smallest measurable p. 
(Many results have been obtained concerning 
the ordinal magnitude of œ, and the measur- 
able cardinals.) 

(iii) (Solovay [41]) The consistency of ZFC 
and MC is equivalent to that of ZFC and 
RMC. 

(iv) (Martin and Solovay [24]) ZFC, RMC, 
and MA are not consistent. 

(v) (Lévy and Solovay [21 ]) The consistency 
of ZFC and MC implies that of ZFC, MC, 
MA, and 2? > œ. 

(vi) (Solovay) ZFC and MC imply that 
every Xj set of real numbers is Lebesgue 
measurable. 

(vii) (Martin, Solovay) ZFC, MC, MA, and 
2” > o, imply that every Z} set of real num- 
bers is Lebesgue measurable and has the Baire 
property. 

(viii) (Silver [38]) The consistency of ZFC 
and MC implies that of ZFC and MC as well 
as the existence of (Lebesgue) non-measurable 
A} sets of real numbers. 


(6) Axiom of Determinateness. We consider the 
following inlinite game. Let A be a set consist- 
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ing of functions œw to œ. The players I and II 
choose alternatively natural numbers n; and 
m;, and, for the resulting inlinite sequence, if 


(ni, Mi, n5, ..., ny, m, ...)eA, 


then 1 wins, otherwise II wins. 

A function c: | Jeo c" c is called a 
strategy. If II plays according to the sequence 
m —(m,,m»,....m,, . ) and Į plays according 
to the strategy oc, then the resulting sequence is 
o*m 


—(o(f), m,,o(m,), ..., m, o(m;, ..., my), L 


The strategy g is called a winning strategy for 
I of the game defined above, if for any play m 
of IL the resulting play o*m is in the set A, and 
similarly for the winning strategy of II. 

The axiom of determinateness AD is the 
assertion that for any A c wm”, the player I or 
II has a winning strategy. 

(i) (J. Mycielski [28]) Assume ZF and AD; 
then we have: 

(a) axiom of choice for countable sequence of 
sets of real numbers; 

(b) every set of real numbers is Lebesgue 
measurable and has the Baire property, so the 
axiom of choice does not hold; 

(c) every uncountable set of real numbers 
contains a perfect set. 

(ii) (Solovay) Assume ZF and AD; then we 
have: 

(a) cardinalities of c», and 2" are not 
comparable; 

(b) œ and œ, are measurable cardinals, but o, 
is a singular cardinal such that cf(w,) = (Dy. 

(iii) (Martin) In ZFC, every Borel, namely 
Ai, game is determined. 

(iv) (Martin, Solovay) In ZFC: 

(a) if there exists a measurable cardinal, then 
every analytic, namely Hi, game is determined; 
(b) if every A}-game is determined, then there 
is a model of ZFC with many measurable 
cardinal numbers. 

Since the axiom of determinateness AD is 
very strong, there is some suspicion that it 
might contradict ZF. But at present there is no 
such evidence. 
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Axiom of Choice and 
Equivalents 


A. The Axiom of Choice 


In set theory, the following axiom is known as 
the axiom of choice: For any nonempty family 
9I of nonempty subsets of a set X, there exists 
a single-valued function f, called the choice 
function, whose domain is 9f, such that for 
every element A of d the value f(A) is a mém- 
ber of A. This axiom is equivalent to each of 
the following three propositions: (1) If (A.]... 
is a family of sets not containing the empty set 
and with an index set A, then the Cartesian 
product II i A, is not the empty set. (2) If a set 
A is a disjoint union U , A, of a family of 
subsets {A,},,, which does not contain the 
empty set, there exists a subset B, called the 
choice set, of A such that every intersection of 
B and A (A€ A) contains one and only one 
element. (3) For every mapping f from a set A 
onto a set B, there is a mapping from B to A 
such that fo g = 1, (identity mapping). Also 
equivalent to the axiom of choice are the well- 
ordering theorem and Zorn's lemma, which 
are discussed in the following sections. 


B. The Well-Ordering Theorem 


In 1904, E. Zermelo [2] first stated the axiom 
of choice and used it for his proof of the well- 
ordering theorem, which says that every set Can 
be twell-ordered by an appropriate tordering. 
Conversely, the well-ordering theorem implies 
the axiom of choice. Many important results 
in set theory can be obtained by using the 
axiom of choice, for example, that *cardinal 
numbers are comparable, or that various 
definitions of the tiniteness or infiniteness 

of sets are equivalent. Various important 
theorems outside of set theory, e.g., the exis- 
tence of bases in a ‘linear space, tcompactness 
of the direct product of compact *topological 
spaces (fTikhonov’s theorem), the existence 
of a subset which is not tLebesgue measurable 
in Euclidean space, etc., are proved using the 
axiom of choice. But for those proofs the well- 
ordering theorem or Zorn’s lemma (stated 
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below) are used more often than the axiom of 
choice. 

Using the well-ordering theorem, it can be 
proved in the following manner that for every 
field k, any linear space X over k has a basis. 
Let {x,},-, be an enumeration of X. By ttrans- 
finite induction, we can define the function f 
from X to B= (0, 1j such that (1) if X} = 0, 
f(xo) = 0; if x #0, f(xg) = 1; (ii) for v >0,if x, 
is expressed as a linear combination of ẹlẹ- 
ments of {x |u « v, f(x,) — 1], f(x,) « 6; other- 
wise f(x,) 2 1. Then U 2 (xeX|f(x)- 1] isa 
basis of X. 


C. Zorn's Lemma 


An ordered set X is called an inductively 
ordered set if every *totally ordered subset of X 
has an tupper bound. A condition C for sets is 
called a condition of finite character if a set X 
satisfies C if and only if every finite subset of 
X satisfies C. A condition C for functions is 
called a condition of finite character if C is a 
condition of finite character for the graph of 
the function. Zorn's lemma [4] can be stated 
in any one of the following ways, which are all 
equivalent to the axiom of choice. It is often 
more convenient to use than the axiom of 
choice or the well-ordering theorem. 

(1) Every inductively ordered set has at least 
one maximal element. 

(2) If every well-ordered subset of an ordered 
set M has an upper bound, then there is at 
least one maximal element in M, 

(3) Every ordered set A4 has a well-ordered 
subset W such that every upper bound of M 
belongs to W. 

(4) For a condition C of finite character for 
sets, every set X has a maximal (for the rela- 
tion of the inclusion) subset of X that satis- 
Des C. 

(5) Let C be a condition of finite character for 
functions from X to Y, Ther, in the set of 
functions that satisfy C, there is a function 
whose domain is maximal (for the relation of 
the inclusion). 

Using Zorn's lemma (1), we can prove again 
in the following way any linear space X over a 
field k has a basis. Let 9[ be the set of all non- 
empty subsets A of X such that arbitrary finite 
subsets of A are linearly independent over k. 9f 
is not empty. If we order d by the relation of 
inclusion, then 9[ is an inductively ordered set. 
By Zorn's lemma ( 1), there is a maximal ele- 
ment U of 9f. Since U is maximal, U is a basis 
of X. 

The same theorem is proved as follows 
using Zorn’s lemma (4). Condition C for the 
subset A of X, that arbitrary finite subsets of A 
are linearly independent over k, is a condition 
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of finite character. Hence there is a maximal 
subset U that satisties C and is a basis of X. 

Concerning recent developments regarding 
the axiomatic basis for the axiom of choice 
-> 33 Axiomatic Set Theory D. 
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A. History 


A mathematical theory is based on a specific 
system of axioms, 1.€., a system of hypotheses 
from which the whole theory is deduced with- 
out reliance on other assumptions. 

One of the first deductive methods of math- 
ematical reasoning was utilized by Thales, who 
returned to Greece from Egypt with the knowl- 
edge of surveying methods, and who deduced 
additional results from that empirical knowl- 
edge. His method gave impetus to the devel- 
opment of Greek geometry, which flowered 
with the Pythagorean school and research by 
members of Plato's Academy. In the course of 
this development, the deductive method led to 
the idea of constructing the whole theory upon 
a system of "absolutely obvious" statements 
from which the whole theory could be de- 
duced. Euclid systematized Greek geometry in 
his Elements utilizing this idea. His work 
became the basis of geometry after the Re- 
naissance, and Greek geometry came to be 
called *Euclidean geometry. In the Elements 
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Euclid called the basic obvious statements 
common notions when they were of general 
nature, and postulates when they were specifi- 
cally geometric. Both were later called axioms 
(or ` postulates). 

Among the axioms stated by Euclid, the 
"lifth postulate” concerning parallels was 
longer and more complicated than the other 
axioms. Many efforts were made to deduce 
this particular axiom from the other axioms. 
The failure of these attempts suggested the 
possibility of establishing a tnon-Euclidean 
geometry, which was actually done by N. 1. 
Lobachevskii and J. Bolyai, who replaced the 
fifth postulate by its negation and showed that 
the new system of “axioms” was as valid as the 
classical one. This development naturally led 
to a new evaluation of the idea of axioms, and 
eventually the traditional concept of recogniz- 
ing the axioms as obvious truths was replaced 
by the understanding that they are hypotheses 
for a theory. D. Hilbert [ 1] established the 
latter idea as axiomatization and claimed that 
the whole science of mathematics should be 
built upon a system of axioms. His idea be- 
came the foundation of present-day mathe- 
matics. Hilbert reorganized classical geometry 
based upon his idea and published his result in 
Foundations of Geometry (1899). 


B. Systems of Axioms 


The system of axioms of a theory, i.e., the 
system of basic hypotheses from which we 
hope to deduce the whole theory, is written in 
undefined terms (or in terms of undefined con- 
cepts) by means of which all other terms are 
defined. On the other hand, a given theory is 
axiomatized by specifying such a system of 
axioms upon which the theory may be re- 
organized. It should also be noted that a sys- 
tem of axioms determines a ‘structure (— 409 
Structures). 

A system of axioms is considered to be 
mathematically valid if and only if it is consist- 
ent. It is also desirable that the axioms in such 
a system be mutually independent De. the 
negation of any one of the axioms is still con- 
sistent with the others). When such a system is 
not independent, it can be simplified by delet- 
ing redundant axioms from it. 

When any two models of a system of axioms 
are isomorphic to each other, we call the sys- 
tem complete or categorical (— 409 Struc- 
tures). For example, the system of axioms (I)- 
(V) postulated by Hilbert as the foundation 
for Euclidean geometry is complete (— 155 
Foundations of Geometry), whereas the sys- 
tems of axioms for the theories of groups, 
‘rings, or tfields are not complete since there 
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are nonisomorphic groups, etc. Although it is 
desirable that the systems of axioms postu- 
lated for a given theory (e.g., the ‘theory of real 
numbers, or Euclidean geometry) be complete, 
the study of partial systems that may not be 
complete is also important (— 156 Founda- 
tions of Mathematics). 
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36 (XIl.1 7) 
Banach Algebras 


A. Definition [1—4] 


A real or complex *Banach space R is called a 
Banach algebra (or normed ring) if a multiplica- 
tion law between elements of R is introduced 
that makes R an 'associative algebra and if 
vull & |x|] || yl is satisfied. The complex case 
is generally easier to handle, and a real Banach 
algebra can always be embedded in a complex 
one isomorphically and isometrically. When 
a Banach algebra contains a unity element e 
with respect to the multiplication, i.e., it is 
unital, we can suppose |e||- 1. If it is not yni- 
tal, we can adjoin a unity element to it. 


B. Examples 


1. Let Call) be the set of complex- (real-) 
valued continuous functions x{é) on a locally 
compact space Jt that vanish at intinity. 
Endowed with the pointwise operations 

and the *supremum norm (- 168 Function 
Spaces), it forms a complex (real) Banach 
algebra. In the complex case, it has the invo- 
lution x*(£)— x(£). 

2. Let X be a Banach space. The set Z(X) 
of tbounded linear operators on X forms a 
Banach algebra if we define addition, multi- 
plication by scalars, multiplication between 
elements, and the norm in the usual fashion. 

3. Let G be a locally compact Hausdorff 
group (- 423 Topological Groups) and ji be 
its tleft-invariant Haar measure. The Banach 
space L,(G) (with respect to u) can be made 
into a Banach algebra by defining xy(g) = 
[x(f)y(h ! g)du(h) (- Section L). 

4. Every subalgebra of a Banach algebra 
R, which is closed in the norm topology, is a 
Banach algebra with respect to the original 
algebraic operations and norm and is called a 
closed subalgebra of R. 

Throughout this article, all Banach algebras 
are supposed to be complex Banach algebras 
unless otherwise specified. 


C. Spectrum of an Element 


We define a new operation x: y in R by setting 
X'y=X+y—xy. Ifx-y=y-x=0, y is called a 
quasi-inverse of x. When a unity element e 
exists, y is a quasi-inverse of x if and only if 
e-y is an inverse of e x. For a complex 
number A such that |Z| > |x||, we see that 
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A 'x possesses a quasi-inverse y given by the 
strongly convergent series y= — 0, 4 "x". 
The set of all complex numbers 4 such that 

À !x does not have a quasi-inverse is called 
the spectrum of x and is denoted by Sp(x). 
(When x itself does not have an inverse, in 
particular, when R does not contain a unity 
element, we include 0 in this set.) Sp(x) is a 
bounded closed set in the complex plane, and 
sup{ |å] 4eSp(x)) =lim,, ,,, ("li In Exam- 
ple 2, this spectrum is the spectrum of the 
operator x, and the problem of determining 
the behavior of this spectrum constitutes one 
of the central problems in the theories of 
Banach and Hilbert spaces. 


D. Representations of a General Banach 
Algebra [ 1,2] 


We understand by a representation of a 
Banach algebra R on a Banach space X any 
algebraic homomorphism z of R into the alge- 
bra B(X) (see Example 2) satisfying Total < 
||x|| for all xe R. We call X the representation 
space. A Banach algebra always possesses 

an isomorphic and isometric representation. 
Especially important are the irreducible rep- 
resentations. A vector subspace (closed or not) 
Y of X is an invariant subspace if z(x) Y c Y 
for any XE R. A representation is called alge- 
braically irreducible if the invariant subspaces 
are trivial, Le, they are only {0} or X. A repre- 
sentation is said to be topologically irreduc- 
ible if closed invariant subspaces are trivial. 
The *kernel of an algebraically irreducible 
representation is called a primitive ideal, which 
can alternatively be detined in the following 
way: A left ideal J ( # {0}, R) is regular, by 
definition, if R contains an element u, a unity 
element modulo J, such that x — xu € J for any 
XE R. Such an ideal is always contained in a 
maximal left ideal, which in turn is necessarily 
regular and closed. A two-sided ideal I is 
primitive if it is the set of elements a in R for 
which aR c J, where J is some fixed regular 
maximal left ideal. If R is commutative, a 
primitive ideal is a regular maximal ideal, and 
conversely. The intersection of all primitive 
ideals is called the radical of R, and when it 

is (0), R is called semisimple. 

The set of primitive ideals 3 is known as the 
structure space of R, in which the hull-kernel 
topology (or Jacobson topology) is introduced. 
The closure of a set 9[ in J is, under this 
topology, the set of primitive ideals containing 
the intersection of the ideals in 9L, This topol- 
ogy is rather intractable; even in commutative 
cases, it does not coincide with the Gel’fand 
topology (— Section E), in general. 
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E. The Gel'fand Representation of a 
Commutative Banach Algebra 


A complex Banach algebra is a field if and 
only if it coincides with the field C of complex 
numbers (Gel’fand-Mazur theorem). This is the 
most fundamental fact in the study of com- 
mutative Banach algebras. Now let R be a 
commutative Banach algebra. Then every 
regular maximal ideal J of R is closed and the 
quotient algebra R/J is isomorphic to C, so 
that the quotient mapping R-R/J is viewed 
as a complex-valued homomorphism of R. 
Conversely, if 9 is a nonzero complex-valued 
homomorphism of R, it has norm <1 as a 
linear functional on R and its kernel, say Rg, is 
a regular maximal ideal of R such that @ is 
exactly the quotient mapping R > R/R,. The 
set %II(R) of nonzero complex-valued homo- 
morphisms (or regular maximal ideals) of R 
endowed with the tweak* topology of the 
Banach space tdual R' of R is called the maxi- 
mal ideal space of R. WU HI" a locally com- 
pact Hausdorff space, and its topology is 
called the Gel’fand topology. For each xE R we 
define a function X on 9(R) by setting Viol = 
(x). Then the mapping xx is a homo- 
morphism of R into the algebra Co(SR(R)) of 
all continuous complex-valued functions on 
M(R) vanishing at infinity. This is the Gel’fand 
representation of R whose image, the Gel’fand 
transform of R, is denoted by R. Concerning 
this, we have (1) 9R(R) is compact if R has a 
unity element; (2) Sp(x) equals the closure of 
the range ZOU RI of X; (3) the representation 
x is norm-decreasing if Co(St(R)) is en- 
dowed with the tsupremum norm; (4) ||la 
(=sup{|x(¢)|| peWUR)}) equals lim, ... |x"] *. 
The tkernel of the Gel’fand representation of 
R is the radical of R, which consists of gen- 
eralized nilpotent elements of R. 


F. Banach Star Algebras 


An involution in a Banach algebra R is an 
operation x-»x* that satisfies (1) (x + y)* =x* 
mp (2) (Ax)* = Ax*; (3) (xy)*= y*x*; (4) (x*)* 
=x, A Banach algebra with an involution is 
called a Banach »-algebra. A x-homomorphism 
(p between two Banach -algebras is an alge- 
braic homomorphism which preserves invo- 
lutions, i.e., D(x*) = 6(x)*. To represent a 
Banach +-algebra we prefer a *-representation, 
i.e., a representation X- T, on a Hilbert space 
such that T,, is equal to the adjoint T;* of T, 
for any xe R. 


G. C*-Algebras [5-10] 


A Banach »-algebra A satisfying |x*x|| = la? 
for al] x € A is called a C*-algebra. Every C*- 
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ilgebra is «-isomorphic and -isometric to a 
Janach algebra of operators on a Hilbert 
pace (see Example 2) that contains, along 
with an operator, its adjoint (the Gel’fand- 
Vaimark theorem) A C*-algebra is semi- 
imple, and a commutative C*-algebra A is 
«isomorphic to Cell A under the Gel fand 
'epresentation. A topologically irreducible +- 
'epresentation of a C*-algebra is also algebrai- 
:ally irreducible (R. V. Kadison), and the set of 
‘unitary equivalence classes of these irreduc- 
ble «-representations is called the dual space. 
(t becomes a topological space if we introduce 
he hull-kernel topology inherited from the 
structure space, but other topologies are also 
introduced. Moreover, for the study of separ- 
able C*-algebras, Borel structure in the sense 
of G. W. Mackey is a very powerful tool. C*- 
algebras also have intimate connections with 
the theory of *unitary representations of a 
topological group (see below) and with quan- 
tum physics. Many works have been published 
on *-representations, dual spaces, etc. 

A linear functional @ on a C*-algebra A is 
said to be positive if «@(x*x) > 0 for any x € A. 
For any nonzero positive linear functional () 
of A, there exist a *-representation (z,,, H) of 
A on a Hilbert space If, and a vector č, E€ He 
such that ca) — (n, (a), Čo) for all ae A and 
that the subspace {7,,(a)é,,|a€ A} is dense in 
H We call (za, H) a cyclic representation of 
A induced by o. A *-representation (z, H) of A 
is called universal if for any *-representation 
(p, K) of A there exists a !g-weakly continuous 
*-homomorphism j of n(A)” onto p(A)" such 
that p(a) - (f ox)(a) for all ae A, where x(A)" 
is the double tcommutant of n(A) and so is 
a tyon Neumann algebra (— 308 Operator 
Algebras). Von Neumann algebras m(A)' for 
universal representations 1 of A are mutually 
*-isomorphic, so that they determine the envel- 
oping von Neumann algebra of A. Especially, 
the direct sum z of all cyclic representations 
x, is a universal representation and there is a 
unique isometric isomorphism of n(A)" onto 
the second dual A** of A that is bicontinu- 
ous in the o-weak topology of n(A)" and the 
c(A**, A*)-topology of A**. So A** is identi- 
fied with the enveloping von Neumann algebra 
of A. 


H. Some Classes of C*-Algebras 


Let A be a C*-algebra. If is a CCR (liminal 
C*-) algebra if it is mapped to the algebra of 
compact operators under any irreducible x- 
representation. It is a GCR (postliminal C*-) 
algehra if every nonzero quotient C*-algebra 
of A has a nonzero CCR closed two-sided 
ideal. These classes of C*-algebras have inter- 
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esting properties. For example, the dual space 
of a separable C*-algebra is a * T, space (tT, 
space) if and only if the algebra is GCR (CCR) 
(J. Glimm). 

A is called an AF (approximately finite) 
algebra if it is the uniform closure of an in- 
creasing sequence of finite-dimensional C*- 
algebras (0. Bratteli). AF algebras are classi- 
fied by so-called dimension groups. TO see this, 
we call an ordered Abelian group G (written 
additively) a Riesz group if (a) for any integer 
n>Q and any geG, ng 20 implies g>0 and 
(b) for any g;, h; (1 <i<m,1<j<n) in G with 
gi <h; for alli and j, there exists k € G with 
gi Sk <h; for all i, j. Then the isomorphism 
classes of AF algebras correspond bijectively 
to the isomorphism classes, as local semi- 
groups, of generating upward-directed heredi- 
tary subsets of the positive cones of countable 
Riesz groups, and a dimension group, defined 
otherwise, is exactly a countable Riesz group 
(G. Elliot, E. G. Effros, D. E. Handelman and 
C. L. Shen) (— M). 

Now consider two C*-algebras A and B. 
The algebraic ttensor product A ® B over C 
becomes a *-algebra in the natural way. A 
norm Bon A (9 B is a C*-crossnorm if |xy||; X 
(ele lylls and ||x*x\l p= Lais for x, YEA P 
and [a & bl; « la| |b|| for ae A, be B. The 
completion of A Q) B under such a norm f is 
a C*-algebra, which is denoted by A (9, B. 
There are two special C*-crossnorms on A Gi 
i?: ||! ll and |l tnax The former is called the 
spatial (minimal, injective) C*-crossnorm and 
is defined by |/x\|min=SUPp, y I(E G WO, 
where o and y run over all «-representations 
of A and B, respectively. The algebra A ® min B 
is called the spatial tensor product of A and B. 
The latter is called the greatest C*-crossnorm 
and is defined by LN nn = supr || T(x)|], where 
T runs over all »-representations of A (9) B. 
And the algebra A Gi. B is called the projec- 
tive C*-tensor product of A and B. Any C*- 
crossnorm f on A ® B lies in between these 
two norms, ie, Je ais Ilg < [x |a, for 
x€A © B. A C*-algebra A is called nuclear if 
A ® B has a unique C*-crossnorm for any C*- 
algebra B. It is known that any GCR algebra 
is nuclear and that an inductive limit of nu- 
clear C*-algebras (e.g., any AF algebra) is 
nuclear. Given a linear mapping 9 between 
C*-algebras A and B, we define for any inte- 
ger n 2 1 a linear mapping 9,:A ® M, B (9 
M, by setting ~,(x © ey) = g(x) Q ej, where 
{e;} are the matrix units for the C*-algebra 
M, of nx n complex matrices. @ is said to be 
completely positive if «, is positive for any n > 
1. A C*-algebra A is then called injective if, 
for any C*-algebras B, C with BS C and any 
completely positive contraction o: B— A, there 
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exists a completely positive contraction @: 
C+A that extends o. A C*-algebra A is 
nuclear if and only if its enveloping von Neu- 
mann algebra A** is injective. Every injective 
C*-algebra B is not necessarily a von Neu- 
mann algebra but every AW*-algebra, i.e., 
every maximal commutative -subalgebra M 
of B, is monotone complete (i.e., every increas- 
ing net of self-adjoint elements has a least 
upper bound in M) and the lattice of projec- 
tions in B is conditionally complete. 


I. Crossed Product 


Let A be a C*-algebra, G a locally compact 
Hausdorff group with left-invariant mea- 
sure u, and x a continuous action of G on A 
as x-automorphisms «,, det. Let K(G, A) 

be the linear space of continuous functions 
from G to A with compact support, which is 

a x-algebra under the multiplication y * z 

and the involution y* given by (y x z)(g) = 

Le y(h)a,(z(h*g)) du(h) and y*(g)  A(g Tt 
ag(y(g !)*) for y, ze K(G, A), where A is 

the tmodular function of G. By completing 
K(G, A) under the norm || y], 2f; | rll du(g), 
we get a Banach +-algebra L,(G, A). L,(G, A) is 
then made into a C*-algebra by furnishing the 
norm ||x|!=sup, ||7(x)|], where z runs over all 
*-representations of L,(G, A). This C*-algebra, 
denoted by A ®, G, is the crossed product of 

A by G relative to the action a. Crossed prod- 
ucts are useful in the structure theory of C*- 
algebras. 


J. Extensions by C*-Algebras (BDF Tbeory) 
[12] 


Let H be a separable infinite-dimensional 
Hilbert space and € (H) the ideal of Z(H) 
consisting of all compact operators. The quo- 
tient C*-algebra BA) LEIH) is called the 
Calkin algebra. We denote it by 2(H) and 

the quotient mapping by 1: 3B(H)> A H). By 
an extension of Z(H) by a separable unital 
C*-algebra A we mean a unital (preserving 
unity elements) «-isomorphism t of A into 
AH), We call two extensions t4, t, :A — 2(H) 
equivalent if, for some unitary operator t€ 
BH), n(u)t, (a)jn(u)* =t, (a) for any ae A, and 
denote by Ext(A) the set of all equivalence 
classes [t] of extensions t by A. Ext(A) forms 
a commutative semigroup with respect to the 
addition [7] = Te, ] - [15], where (a) 2 x, (a) ® 
v,(a)e AH) © 2(H)c AH © H). Call an ex- 
tension T; A 2(H) trivial if there is a unital 
*-isomorphism 6 of A into Z(H) with t = zo. 
Then (1) all trivial extensions are equivalent, 


and we define the unity element of the semi- 
group Ext(A) (D. Voiculescu); (2) Ext(A) is a 
group if A is commutative (L. G. Brown, R. G. 
Douglas, and P. A. Fillmore) or, more gener- 
ally, if A is nuclear (M. D. Choi and E. G. 
Effros); (3) Ext(A) is not always a group (J. 
Anderson). For further information — 390 
Spectral Analysis of Operators J. 


K. Derivations in C*-Algebras 


A linear operator 6 in a C*-algebra A is a 
derivation in A if its domain D(6) is a dense 
subalgebra of A and ó(xy) = ó(x)y + xó(y) for x, 
ye D(6). dis a *-derivation if, moreover, x € D(6) 
implies x* € D(S) and ó(x*) = ó(x)*. 6 is called 
bounded (unbounded, closed) if it is bounded 
(unbounded, closed) as a linear operator. 

Every bounded derivation A is expressed 
as (x) — dx — xd with some d in the envelop- 
ing von Neumann algebra A** of A(R. V. 
Kadison and S. Sakai). The element d can be 
taken from the multiplier algebra M(A) = 
{be A**|bA + Abe Aj of A if A is simple 
(Sakai) or if A has continuous trace (C. A. 
Akemann, Elliott, G. K. Pedersen, and J. 
Tomiyama). If A is separable, we see that all 
bounded derivations in A are given by ele- 
ments in M(A) if and only if A is the C*-direct 
sum of a family of simple C*-algebras and a 
C*-algebra with continuous trace (Elliott, 
Akemann, and Pedersen) [13] and that every 
bounded derivation in the quotient C*-algebra 
A/I, I being any closed two-sided ideal, can be 
lifted to a derivation in A (Pedersen). 

Next we consider unbounded derivation. By 
a *-automorphism group on A we mean a tone- 
parameter group p, t € R, of «-automorphisms 
of A such that, for each xE A, p,x is contin- 
uous in f € R. A C*-dynamical system is a 
pair consisting of a C*-algebra A and a x- 
automorphism group p, on A. The fact that the 
time evolution of a physical system is often 
represented by such a dynamical system has 
made the study of unbounded derivations 
quite active. We have to see if a given deriva- 
tion ó is tclosable and if, in case of a closable 
6, the closure ô generates a «-automorphism 
group. Sample results: (1) If a *-derivation 6 
is well-behaved in the sense that for every self- 
adjoint x in D(6) there exists a !state ọ with 
Ie(x)| = Il and @(d(x))=0, then 6 is closable 
and its closure is well-behaved (A. Kishimoto 
and Sakai); (2) if a *-derivation ó is closable, 
its closure § generates a «-automorphism 
group if and only if à is well-behaved and 
(1 +6)D(6) is dense in A (R. T. Powers and 
S. Sakai; 0. Bratteli and D. W. Robinson). 
For further results — [ 14]. 


36L 
Banach Algebras 


L. Applications to the Theory of Topological 
Groups [2,3,6,15] 


Banach algebras have many applications in 
different branches of mathematics and quan- 
tum physics. Here we mention some that con- 
cern topological groups. Let G be a locally 
compact Hausdorff group and p be its left- 
invariant measure. We make the complex 
algebra L,(G) of Example 3 into a Banach 
*-algebra by detining x*(g)=x(g !)A(g !), 
where A is the *tmodular function of G. This is 
called the L,-algebra (or group algebra). It is 
not C* but is semisimple. Considering a uni- 
tary representation of G is equivalent to con- 
sidering a *-representation of the L,-algebra. 
Replace the norm of xe L,(G) by sup|z(x)l|, 
where the supremum is taken over all the +- 
representations. The new norm satisties the C* 
condition, and the completion of L,(G) with 
respect to this norm is a C*-algebra, which we 
call the C*-group algebra of G. The dual of the 
C*-group algebra thus detined is called the 
dual of the group G, and this notion plays an 
important role in the study of topological 
groups. Unitary representations of a group G, 
*-representations of the L,-algebra of G, and 
*-representations of the C*-group algebra of G 
are all characterized by positive definite func- 
tions on G. A function p(g) is positive definite, 
by definition, if it is measurable on G and 





[po h)x(g)x(h) du(g)du(h) 20 


for any continuous function x(g) with compact 
support. 

The Abelian case. When G is an Abelian 
group (- 422 Topological Abelian Groups), a 
regular maximal ideal M of the L,-algebra R 
of G and a character y of G are in a one-to-one 
correspondence by the relation 


(the left-hand side is the value of x at M under 
the Gel’fand representation). Moreover, the set 
of regular maximal ideals of R provided with 
the Gel" Gand topology and the set G of charac- 
ters of G provided with the Pontryagin topol- 
ogy (the *character group of G) are homeo- 
morphic by this correspondence. Therefore the 
Gel'fand transform of an element x of the L,- 
algebra R is seen to be a function X(y) on G 
defined by the integral on the right-hand side 
in the above expression, which is properly 
called the Fourier transform of x. Of course, 
the Fourier transform can be detined for other 
classes of functions (e.g., for the L,-space over 
G. the Fourier transform in the sense of Plan- 
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cherel) and classical theories of Fourier 
series, Fourier integrals, and harmonic analysis 
(- 192 Harmonic Analysis) are studied from a 
more extensive point of view. Thus the state- 
ment that the Fourier transform of an element 
x of the L,-algebra of G is a continuous func- 
tion vanishing at inlinity (— Section E), for 
example, is a version of the classical fRiemann- 
Lebesgue theorem. Bochner's tbeorem in 
classical *Fourier analysis is restated thus: A 
continuous fpositive definite function on an 
Abelian group can be put in the form 


p(g)7 | Y(g) dp(v), 


where p is a uniquely determined bounded 
positive *Radon measure on the character 
group G. Developing these theories further, we 
obtain an alternative proof of the *Pontryagin 
duality theorem (H. Cartan and R. Godement). 
A closed ideal I in the L, -algebra R deter- 
mines a set Z(I) in G as the set of common 
zeros of the Fourier transforms of elements of 
I. We ask whether, conversely, J is character- 
ized by Z(1). This question is the problem of 
spectral syntbesis, and many important results 
have been obtained. The statement that J must 
coincide with R when Z(1) is empty is a for- 
mulation of the generalized Tauberian tbeorem 
of N. Wiener. A considerable simplification 

of the proof was accomplished by using the 
theory of Banach algebras, which was the first 
application of the theory (T. M. Gel fand). 


M. Holomorpbic Functional Calculus 
[16,171 


Let R be a unital commutative Banach algebra 
with maximal ideal space M = 9R(R). We de- 
fine the joint spectrum Sp(X) of a finite n-tuple 
X= xis Xy in R by (0639), Zell 

Q9 €90l], a nonempty compact subset of C". 
Then the bolomorpbic functional calculus says 
the following: For each X = (x,,.. xX.J c R 
there exists a unique algebra homomorphism 
®, of the algebra H(Sp(X)) of tgerms of holo- 
morphic functions on Sp(X) into R with the 
following properties: (1) o, (1) = 1, the unity 
element of R, and y(z) = x, 1 Si <n; (2) if 
n«m, X'={x,,....x} and F(z,,.... z,)— 
F(z,,.... z,) with Fe H(Sp(X)) then 6, (F)— 
®,(F); (3) if { F, k = 1,2, } are holomorphic 
in a fixed neighborhood U of Sp(X) and F,— 
F uniformly on U, then 0,(F,)>®,(F) (L. 
Waelbroeck). From this follows the implicit 
function tbeorem: Let Xis X ER, fEeC(M, 
and let F(w,z,,.. , Z„) be holomorphic in a 
neighborhood of the set Sp( f, X1,- - , Xp) = 
(Go): (@),, %,(@))| EM. Suppose that 
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F(f. X4, ..., X,) 20 but 0F/3w #0 on Sp( f, x,, 
.++)X,). Then there exists a unique ye R such 
that =f and F(y,x,,.,x,)=0(R. Arens and 
A. Calderon). As an application we obtain 
Shilov’s idempotent tbeorem: If K is a closed- 
open subset of W, there exists x e R with X= 

1 on K and £20 on 9t — K. Set Q(R) 2 (xe 
R|e?"* 2 1). Then Q(R) is an additive sub- 
group of R and Shilov’s theorem says that 

the Gel’fand representation gives an jsomor- 
phism of Q(R) onto Q(C(W)) (= H? (WR, Z)). 
Another theorem of this sort is the Arens- 
Royden tbeorem: R ~'/exp(R) z C(9X) '/ 
exp(C(9)) X H! (M, Z), where R ^! is the set 
of invertible elements of R and exp(R)= {e*| 
XE R}. Further extensions related to K-theory 
have been given by J. L. Taylor and others. 
(For functional calculus for one variable > 
25 1 Linear Operators) 
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A. General Remarks 


The notion of Banach space was introduced in 
1922 by S. Banach and N. Wiener, working 
independently. The idea was to apply topo- 
logical and algebraic methods to fundamental 
problems of analysis, such as mapping prob- 
lems in infinite-dimensional function spaces 

(- 168 Function Spaces, 197 Hilbert Spaces, 
25 1 Linear Operators). 


B. Definition of Banach Spaces 


We associate to each element x of a t]inear 
space X over the real (or complex) number 
field a real number ||x|| satisfying the following 
conditions: (i) |x|| 20 for all x, and |x|| =0 is 
equivalent to x =Q; (ii) ||ax|| ^ |a] : ||x|| for any 
real (complex) number 4; (iii) x + y|| < |x|| + 

|| yl]. Then |[x|| is called the norm of the vector 
x, and X is called a normed linear space. The 
norm is thus an extension of the notion of 

the length of a vector in a *Euclidean space. A 
normed linear space X is a tmetric space under 
the distance p(x, y) UX — HI. We write s- 

Jm, x, 2 x or simply x,x when lim, ||X, 
—x||= 0 and say that x, converges strongly to 
x. If this metric space X is !'complete, then X 
is called a Banach space. In normed linear 
spaces the addition and the multiplication 

by scalars are continuous. A closed linear 
subspace M of a Banach space X is again a 
Banach space, and the tquotient space KIM 
becomes a Banach space if the norm of a tcoset 
is defined by ||x + M | z inf, ||x +mlj. The 
subset (x |x|| € 1} of a normed linear space 

X 1s called the (closed) unit ball (or unit sphere) 
of X. 

Examples. fFunction spaces C, L, (l<p< 
oo) M, W, Ho, tsequence spaces c, |, m and 
BV are all Banach spaces (- 168 Function 
Spaces). 


noo 
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C. Linear Operators and Linear Functionals 


Suppose that a linear subspace D(T) of a 
linear space X is the (definition) domain of 

a mapping T with values in a linear space 

X, T is called a linear operator if T(xx + 

By) = a Tx + BTy for any scalars o, f and x, y c 
D(T). R(T) 2 (Txe X, |xeD(T)]) is called the 
range of T. In the special case where X, is 

the real or complex number lield, T is called 
a linear functional. If X and X, are both 
normed linear spaces, then T is continuous if 
and only if s-lim,, Tx, = Tx whenever S- 
Dm, x,-x. This is equivalent to the condi- 
tion SUPxep(7), [xi el | Tx|| « oo. In particular, if 
D(T) =X, the linear operator T is continuous 
if and only if the set {Tx |x| <1} is bounded. 
In this case, T is called a bounded linear oper- 
ator, and (TI = sup, «| Tx ||is called the 
norm ofthe operator T. In particular, a linear 
operator T satisfying | T|| < l is called a con- 
traction. (Sometimes T is called a contraction 
only when IT || < 1. In that case, an operator 
with | T |& 1 is called a nonexpansive opera- 
tor.) The scalar multiple, sum, and product of 
linear operators are defined by (xT)x = a( Tx), 
(T +S)x= Tx +Sx, and(ST)x = S( Tx), respec- 
tively. The identity operator J in X is defined 
by I: x =x for all x EX. If the inverse mapping 
T ! of xo Tx exists, then it is called the in- 
verse operator of T. A normed linear space X 
is said to be isomorphic to a normed linear 
space Y if there exists a bounded linear opera- 
tor T from X onto Y with bounded inverse. 

If T can be chosen to be isometric fe | Tx| = 
||x|| for all xe X), then X is said to be iso- 
metrically isomorphic to Y. 


D. The Dual Space and the Dual Operator 


The totality of continuous linear functionals 
f defined on a normed linear space X is a 
Banach space X’ under the previously de- 
fined linear operations and the norm | f|| = 
SUP)x) <1 | f(x). This X" is called the dual (or 
conjugate) space of X. In view of useful prop- 
erties of the *inner product in "Hilbert spaces, 
it is sometimes convenient to write (x, f > for 
f(x). Let X and Y be normed linear spaces, 
and let T be a linear operator with the dense 
domain D(T) inXandtherange R(T) in Y. 
If ( f, g) is a pair with fe Y' and ge X'satisfy- 
ing the equation (Tx fF) = (x, g) for any X 
D(T), then g is determined uniquely by f The 
operator T' defined by T'f— gis linear and is 
called the dual operator (or conjugate or adjoint 
operator) of T. This is an extension of the 
notion of the *transpose of a matrix in matrix 
theory. If T is a bounded linear operator then 
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T' is also a bounded linear operator such that 


ITI = ITI. 


E. The Weak Topology and the Strong 
Topology 


Let X be a normed linear space and X" its 
dual space. Take a finite number of elements 
Xi X2, X, from X’, and consider the subset 
of X:(xeX |supi is IG xi) <e}, & 0. If 
we take the totality of such subsets of X as a 
tfundamental system of neighborhoods of 0 
of X, then X is a ‘locally convex topological 
linear space, denoted sometimes by X,,. This 
topology is called the weak topology of X. If 
a sequence [x,) C X converges to x € X with 
respect to the weak topology of X, then it is 
said to converge weakly. This is equivalent to 
the convergence (x,, f 52 (x,f) for any f e 
X’. The original topology of X determined by 
the norm is then called the strong topology 
of X, and to stress the strong topology we 
sometimes write X, in place of the original 
X. Take a finite number of elements x,, X3, 
«X, from X, and consider the subset of 
X':(x eX'|sup, il (x x »| &e), e» 0. If 
we take the totality of such subsets of X" asa 
fundamental system of neighborhoods of O of 
X’, then X’ is a "locally convex topological 
linear space. We write this space as X, and 
call the topology the weak* topology of X’. 
The topology of X’ detined by the norm | f || is 
called the strong topology of X’, and to stress 
the strong topology we write X;. The terms 
“weak” and “weakly” are used in reference to 
the weak topology, for instance, weak closed- 
ness and weakly closed. Similar conventions 
are used for the weak* and strong topologies. 
The unit ball of the dual space of a normed 
linear space X is weak*-compact (Banach- 
Alaoglu theorem). Then by the tKrein-Milman 
theorem (— 424 Topological Linear Spaces) 
the unit ball of X’ is the weak* closure of the 
fconvex hull of its textreme points. If X is a 
Banach space, a convex subset K of X" is 
weak*-closed if and only if the intersection of 
K and each weak*-compact subset is weak*- 
closed ` (Krein-Shmul'yan ` theorem). 


F. The Hahn-Banach Extension Theorem 


Let M be a linear subspace of a real linear 
space X and p(x) a real-valued functional 
detined on X such that p(x+ y) € p(x) - p(y} 
and p(Ax)=Ap(x) for all x. ye X and 4 z 0. If 
a linear functional f, defined on M satisfies 
fi (X) <p(x) for all x € M, then there is a linear 
functional f on X which extends f, and satis- 
fies f(x) x p(x) for all xe X (the Hahn-Banach 
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(extension) theorem). The Hahn-Banach exten- 
sion theorem has many applications. (1) Let M 
be a linear subspace of a normed linear space 
X. Then for any f, € M', we can construct an 
f € X' such that f(x)— f, (x) for all xe M and 
lf || 9 Wf, ll. Q) For any xo #0 of a normed 
linear space X, we can construct an hE X 

such that fo(xo) — xol and Lis 1. (3) For 
any closed linear subspace M of a normed 
linear space X and a point x, é M we can 
construct fo¢X’ such that Il =, fo(xo) = 
inf,.4 Ire —m|| and fo(x)=0 for all xe M. A 
proposition more general than (3) is Mazur's 
theorem, which is useful in applications: (4) Let 
a closed subset M of a normed linear space X 
be convex. Then for any zo € M wecan con- 
struct an Joe X' such that f(x) > fo(x) for all 
XE M. By (4) we can prove, e.g., that a convex 
set of a normed linear space 1§ weakly closed 
if it is strongly closed. This proposition has 
the following corollaries: (5) A convex set A 
containing 0 is closed if and only if (49)? = 

A, where A? is the tpolar of A (the bipolar 
theorem). (6) For two closed convex sets A and 
B, containing 0, the polar of A N B coincides 
with the weak* closure of the convex hull of 
A? U B^. For complex linear spaces, most of 
the propositions in this section are valid with 
Re f(x) instead of f(x). 

The Hahn-Banach theorem can also be em- 
ployed to prove the existence of the general- 
ized limit (or Banach limit) Lim, ,,, C, defined 
for all bounded real sequences (C,) such that 
lim inf, o C, € Lim, ,, C, « lim sup, ,,, C, and 


noo 


Lim, ,, (a£, + By,,) = a Lim, ,,, 6, + f Lim, ,, Ha 


G. Duality in Normed Linear Spaces 


An element Xo of a normed linear space X 
gives rise to an element x, of (X) determined 
by (xg, x > = (x', xo» for all x' e X;. If we write 
Xo =J Xos then J is a linear operator satisfying 
|Jxgl| = |xg|| by (2), and so the space X is 
isometrically isomorphic to a linear subspace 
of (X); If X, coincides with (X;), under this 
isomorphism, we call X a reflexive (or regular) 
Banach space. A necessary and sufficient con- 
dition for the normed linear space X to be 
reflexive is that the unit ball of X be weakly 
compact. A convenient criterion for the reflex- 
ivity of X is that any bounded sequence {x,} of 
X contains a subsequence weakly convergent 
to a point of X (Eherlein-Shmul’yan theorem). 
In this connection, a Banach space X is reflex- 
ive if and only if each X' E X’ attains its norm, 
i.e, there is an xo € X such that ||xq || = 1 and 
|xo, x ^| zx || Bames's theorem). 

A normed linear space is said to be uni- 
formly convex if for any ¿> 0 there exists a 6 > 
0 such that |x] <1, | y & 1 and [x —y|| >e 


165 


implies | x + y| <2 = 6, A normed linear space 
is said to be uniformly smooth if for any ¢ > 0 
there is a 6>0 such that | x| 2 1 and | y| <ô 
imply |x-- yl + |x— yl &2- e| yl. A Banach 
space X is uniformly convex (resp. uniformly 
smooth) if and only if the dual X" is uniformly 
smooth (resp. uniformly convex). If a Banach 
space X is uniformly convex or smooth, it 

is isomorphic to a Banach space that is 
uniformly convex and uniformly smooth 

(P. Enflo). The space L, with 1 < p « mis 
uniformly convex and uniformly smooth. 
Any uniformly convex or uniformly smooth 
Banach space is reflexive (Milman’s theorem). 


H. The Resonance Theorem 


Let {T,} be a sequence of bounded linear 
operators from a Banach space X into a 
normed linear space Y. The uniform bounded- 
ness theorem (resonance theorem or Banach- 
Steinhaus theorem) states that sup, — | T,l < oo 
if sup,,>; || T;xl| < oo for every xe X. As a corol- 
lary, we have sup, — ||x,|| < oo for any weakly 
convergent sequence of X. Another corollary 
states that the set (xe X limsup,..,, | Tx « 
oo] either coincides with X or is a subset of X 
of the first category. This implies the so-called 
principle of condensation of singularities, which 
gives a general existence theorem for func- 
tions exhibiting various kinds of singularities, 
for example, a continuous function whose 
*Fourier expansion diverges at every point of a 
*perfect set of points having the cardinal num- 
ber of the continuum. 


I. The Closed Graph Theorem 


Let T be a bounded linear operator from a 
Banach space X into a normed linear space Y. 
If the image of the unit ball of X under T is 
dense in the unit ball of Y, then for any £ 20 
and any y, € Y the equation Tx =y, has a 
solution x with |x | X (1 + c) //y, | (Banach's 
theorem). By using the "Rare category theo- 
rem, we can derive from this the open map- 
ping theorem: Every bounded linear operator 
from a Banach space X onto a Banach space 
Y maps each open set in X onto an open set 
in Y. As an application of the open mapping 
theorem, we can prove the closed graph theo- 
rem: A linear operator T from the whole of a 
Banach space into a Banach space is Con- 
tinuous if and only if T is a closed operator, 
Le, s-lim, ,,, x, =x and s-lim,, TX, = y imply 
Tx = y. This theorem plays an important role 
in modern treatments of linear partial differen- 
tial equations. 


noo 
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J. The Closed Range Theorem 


Let X and Y be Banach spaces and Ta linear 
closed operator with domain D(T) dense in X 
and with range R(T) in Y. Under these con- 
ditions, the following four propositions are 
mutually equivalent. (1) R(T) is a closed set in 
Y. (2) R(T’) is a closed set in X’. (3) R(T) = 
{yeY|<y, y*> 20 for all y*eD(T’) such that 
T'y* =0}. (4) R(T) 2 [x* e X' | (x, x*) =0 for 
all e D( T) such that Tx = 0}. These four 
propositions, as a whole, are called the closed 
range theorem. This theorem implies (5) R(T) 
= Y if and only if T' has a continuous inverse; 
and (6) R(T’) = X’ if and only if T has a con- 
tinuous inverse. The following theorem is of 
similar nature: the following three propositions 
on two closed linear subspaces M and N of a 
Banach space are mutually equivalent. (7) M + 
N is closed. (8) M? -- N? is strongly closed. 

(9) M? + N° is weak*-closed. 

The Hahn-Banach theorem, the resonance 
theorem, the open mapping theorem, the 
closed graph theorem, and the closed range 
theorem can be extended to various classes of 
tlocally convex topological linear spaces. By 
virtue of this extension, we are able not only to 
treat various fundamental problems of analysis 
from a unified viewpoint but also to develop 
the theory of functional analysis itself in a new 
direction (— 424 Topological Linear Spaces; 
concerning linear operators on a Banach space 
— 68 Compact and Nuclear Operators, 251 
Linear Operators, 390 Spectral Analysis of 
Operators). 


K. Differential and [ntegral Calculus of 
Functions with Values in Banach Spaces 


Calculus involving functions from a set to a 
Banach space is also an effective tool in vart- 
ous problems. A function x(t) detined on an 
interval [a, b] with values in a Banach space X 
is said to be strongly (weakly) continuous if x(t) 
converges strongly (weakly) to x(to) as tty. 
For a strongly (weakly) continuous function 
x(t), the Riemann integral can be defined in a 
standard way, using strong (weak) convergence 
of the Riemann sum 


b 
| x(t)dt 2 lim x(t/)(t., — tj). 
d 
The fundamental theorem of calculus, 1.e., 
strong (weak) differentiability of the indefinite 
integral, remains true. Various defmitions of 
integrals of a Banach space-valued function 
on a fmeasure space are discussed elsewhere 
(- 443 Vector-Valued Integrals). 

Now let x(A) be defined on a domain Q in 
the complex plane with values in a complex 
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Banach space X. x(A) is said to be holomorphic 
if f(x(A)) is tholomorphic in () for every fe X’. 
If x(A) is holomorphic, then there exists an X- 
valued function y(4) on Q such that 


IC G0-x0)-y0))902a s 60. 
Therefore there is no difference between 
"strong" and “weak” in analyticity. *Cauchy’s 
integral theorem remains true for a holo- 
morphic function x(A) with values in X, and 
the "Laurent expansion 


oo 


x(4)— Y, a(4— Ag", 


n- -o 
1 "- 
sui SL — Ag) "^! dA, 
ani Jc 


is valid with the integral taken in the Rieman- 
nian sense. Banach space-valued holomorphic 
functions on complex (or real) analytic mani- 
folds of higher dimension can be defined in a 
natural way by means of power series expan- 
sion. A function @(A) defined on a domain in 
the complex plane with values in the Banach 
space B(X, Y) of bounded linear operators 
from X to Y becomes holomorphic if fi (O(A)x) 
is holomorphic for every ve X and fe Y’. An 
operator-valued holomorphic function is often 
called an analytic operator function. 


L. The Approximation Property 


A Banach space X is said to have the approxi- 
mation property if there is a family {S,} of 
bounded linear operators of !finite rank in X 
such that inf, | (S, — I) T | - 0 for all tcompact 
linear operators T in X. There is a Banach 
space that fails to have the approximation 
property (Enflo [9]). More surprisingly, the 
space of all bounded linear operators on an 
infinite-dimensional Hilbert space fails to have 
the approximation property (A. Szankowski). 
The approximation property plays a deci- 
sive role in the theory of ttensor products of 
Banach spaces. A Banach space X is said to 
have the hounded approximation property if a 
family {S,} in the definition of the approxi- 
mation property can be taken bounded, Le, 
sup, ||S,|| < oo. The bounded approximation 
property does not follow from the approxi- 
mation property. The bounded approximation 
property is closely related to the existence of a 
basis. A sequence {e,} in a Banach space X is 
called a Schauder hasis or simply a hasis (or 
hase) for X if to each XE X there corresponds 
a unique sequence of numbers (cc,] such that 
bm, x — Xt-, o6; || 50. Most separable 
Banach spaces appearing in analysis have 
bases. A separable Banach space has the 
bounded approximation property if and only 
if it is isomorphic to a complemented linear 


166 


subspace (— Section N) of a Banach space 
with a basis (A. Pełczyński). 


M. Injective and Projective Banach Spaces 


Banach spaces of the type *C(Q) with compact 
Q and of the type tl (D) on a set Q play a spe- 
cial role in the theory of Banach space. This 

is already seen in the fact that every Banach 
space is isometrically isomorphic to a subspace 
of a space C(Q,) as well as to a quotient space 
of a space L (0,). A Banach space X is said to 
be injective if for any Banach space Y and 

its linear subspace M, each bounded linear 
operator T from M to X can be extended to a 
bounded linear operator T from Y to X, that 
is, Tx = Tx for gl] XE M. A Banach space is jn- 
jective if and only if it is isomorphic to a com- 
plemented linear subspace of the space C(R) 
for a compact Hausdorff space Q with the 
property that the closure of every open set is 
open. Such a topological space is called tex- 
tremally disconnected. The *maximal ideal 
space of the tBanach algebra L,.(Q) is extrem- 
ally disconnected. Hence the Banach space 

L ,(Q) is injective, Whether every injective 
Banach space is isomorphic to a space C(0) is 
still an open problem. However, a Banach 
space is isometrically isomorphic to a space 
C(Q) with Q extremally disconnected if and 
only if it is injective with the property of norm- 
preserving extension, Le, || TI -|T| is always 
possible — (Nachbin-Goodner-Kelley ` theorem). ` In 
this connection, the following propositions on 
a Banach space X are mutually equivalent. (1) 
X’ is isometrically isomorphic to a space 

tL, (x). 2) For any Banach space Y, its linear 
subspace M, and e> 0, each compact linear 
operator T from M to X can be extended to a 
compact linear operator T from Y to X with 
| T\|<(1+8)||T|| Q. Lindenstrauss). A Banach 
space X is said to be projective if for any 
Banach space Y and its closed linear subspace 
M, each bounded linear operator § from X to 
the quotient space Y/M is lifted to a bounded 
linear operator § from X to Y, ie., p(Sx)= Sx 
for all xe X, where q is the quotient mapping 
from Y to Y/M. Projectivity of X is character- 
ized by its being isomorphic to the space /,(Q) 
on a set UL A Banach space is isometrically 
isomorphic to a space l} (R) if and only if it 

is projective with the property of norm- 
preserving lifting, Le. (S| = |S || is always 
possible (- [8]). 


N. Complemented Subspaces Problems 


A linear subspace M of a Banach space X is 
complemented, i.e., there is a closed linear 
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subspace N such that M AN = {0} and M + 

N = X if and only if M is the range of a 
bounded projection P, i.e., P? = P and R(P) = 
M. Each nonzero closed hnear subspace of a 
Hilbert space is complemented, or more pre- 
cisely, it is the range of a projection of norm 
one (- 197 Hilbert Spaces). This property 
distinguishes Hilbert spaces from general 
Banach spaces: (1) A Banach space of more 
than 3 dimensions is isometrically isomorphic 
to a Hilbert space if each nonzero closed linear 
subspace is the range of a projection of norm 
one (Kakutani’s theorem). (2) A Banach space 
is isomorphic to a Hilbert space if each closed 
linear subspace is complemented (J. Linden- 
strauss and L. Tzafriri [ 10]). 


0. Quasi-Banach Spaces and Fréchet Spaces 


Let X be a linear space over the real (or com- 
plex) number field. Suppose that a real-valued 
function | x|| on X satisfies (i) and (ii) of Sec- 
tion B and (iii) |x + y| <k(llx||+ | yll with a 
constant k > 1 independent of x and y. Then 
||x|| is called the quasinorm of x, and X equip- 
ped with a quasinorm is called a quasinormed 
linear space. Let 0 < p < 1 be the root of the 
equation k = 2/9)! Then there is a distance 
d(x, y) = d(x -y) depending only on x -y such 
that d(x—y)< |x—yl]?<2d(x—y). Hence a 
quasinormed linear space is a metric space in 
which a sequence x, converges to x if and 
only if ||x, —x | 0. If a quasinormed linear 
space X is complete under this metric, then X 
is called a quasi-Banach space. The "function 
space L, is a quasi-Banach space for 0 < p « 1. 

If we denote by |x||the distance d(x-0) of 
a quasinormed hnear space, then it satisfies (1) 
and (iii) of Section B and (ii^) | ^ x| = || x|| and 
Im, ||u,x,—ax|| =0 whenever a, and 
Dm, |x, — x|| =O. A functional |x|| satisfying 
(i), (0), and (iii) is called a pseudonorm. If a 
hnear space X equipped with a pseudonorm is 
complete, then X is called a Fréchet space (in 
the sense of Banach). The ‘function space $(Q) 
is a Fréchet space. Quasinormed linear spaces 
and Fréchet spaces are ttopological linear 
spaces, but they need not be tlocally convex. 
Hence it is possible that there is no continuous 
linear functional except for zero. However, the 
open mapping theorem and the closed graph 
theorem hold for linear operators from a 
Fréchet space into a Fréchet space. 
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38 (XXLl1 A 
Bernoulli Family 


The Bernoullis, Protestants who came origi- 
nally from Holland and settled in Switzerland, 
were a signiticant family to the mathematics 
of the 17th Century. In a single Century, the 
family produced eight brilliant mathemati- 
cians, all of whom played important roles in 
the development of calculus. 

The brothers James (1654-1705) and John 
(1667— 1748) and Daniel (1700— 1782), John’s 
second son, were especially outstanding. James 
and John were close friends of G. W. tLeibniz, 
with whom they exchanged the correspon- 
dence through which it might be said that 
calculus developed. James studied problems 
related to the ttautochrone and ?brachisto- 
chrone, as well as problems in geometry, dy- 
namics, and other fields, including the 'isoperi- 
metric problem. He was the first to change the 
name calculus summatoris to calculus integralis 
(1690). His Ars conjectandi was published after 
his death in 1713; in it is found the Tlaw of 
large numbers, which made his name promi- 
nent in the theory of tprobability. James had 
little guidance, learning mathematics on his 
own. He was a professor of experimental 
physics at the University of Basel and later 
became a professor of mathematics. He taught 
mathematics to his brother John, who gue, 
ceeded him as professor at the University. 
John's many achievements appeared in such 
publications of the time as Acta eruditorum 
and Journal des savants. In 1701, the begin- 
nings of the *calculus of variations were seen in 
his solution to the isoperimetric problem. He 
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was the first to use the term functio, the root of 
the present term function (1714). 

Despite discord between the brothers and 
also between fathers and sons, the Bernoullis 
were ardent teachers and brilliant researchers, 
who instructed not only their sons but also 
such mathematicians as tEuler. Their achieve- 
ments were numerous in consolidating the 
content and form of calculus and also in ex- 
panding its application. Daniel was especially 
outstanding in the theory of probability; he 
also made contributions to the field of thydro- 
dynamics and to the tkinetic theory of gases. 
The eldest John’s eldest son Nicolas (1695- 
1726) achieved distinction as a professor of 
mathematics in St. Petersburg. Daniel's 
youngest brother, John (1710—1790), suc- 
ceeded his father, John Sr., as a professor at 
the University of Basel. The son of John, Jr., 
also named John (1744— 1807), was the chair- 
man of mathematics at the Academy of Berlin. 
His brother, another James (1759-1789), was 
a professor of experimental physics at the 
University of Basel. Nicolas (1687—1759), a 
grandson of the founder Nicolas (162331708) 
and son of Nicolas the painter (1662-1716), 
held Galileo's old chair of mathematics at 
Padua from 1716 to 1719. 


Nicolas 
1623-1708 
James Nicolas (painter) John 
1654-1705 1662-1716 1667-1748 
Nicolas Nicolas Daniel John 


1687-1759 169551726 1700-82 1710-90 


John James 
1744-1807 — 1759-89 


In this article, the first names have been 
given in English. The German names corre- 
sponding to James, John, and Nicolas are 
Jakob, Johann, and Nikolaus, respectively. 
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39 (XIV.8) 
Bessel Functions 


A. General Remarks 


Bessel functions were first introduced in order 
to solve *Kepler's equation concerning plane- 
tary motions and were systematically investi- 
gated by F. W. Bessel in 1824. Since then they 
have appeared in various problems and have 
become important. 


B. Bessel Functions 


Separating variables for the Helmholtz equa- 
tion AV + EI = 0 in terms of cylindrical 


coordinates, we obtain  Bessel's differential 
equation 

dw ld w vi 

—+-—+{ 1—— ]}w=0 1 
dz? zdz ( =) (0) 


for the component of the radius vector. The 


following two linearly independent solutions 
of(D: 


1 E 
ie: | gum 
H L, 


1 ME 
H'?)(z) -f Eet, (2) 
L; 


Tt 


are called the Hankel functions of the first and 
second kind, respectively, where the contour L i 
of the first integration is a curve from ( = g + 0) 
+i% to —0— ioo, and L, is a curve from +0 
— ioo to (1 — 0) -- ioo. If both z and v are real, 
we have 


Hj (z)- Hy(2), 
where Z is the complex conjugate of z. Hence 
Jc) =(Hy” (2) - HP(2)2, 

N,(2) SY,(2)=(H0(z) = HP (2))2i (4 


H?)(z) = Ht? (2), (3) 





are real functions, If both z and v are complex, 
the functions J,(z) and N,(z) defined in (4) are 
also called Bessel functions and Neumann 
functions, respectively. The other names for 
J,(z), N,(z), and H,(z) are Bessel functions of 
the first, second, and third kind, respectively. 
Each of them satisfies the following recurrence 
formulas: 

4C,(z) 


See = aa (z) um C, (z), 


Qv/z) C.) =C,_,(z) + C44 G). (5) 


In general, functions satisfying the simulta- 
neous 'differential-difference equations (5) are 
called cylindrical functions. Every cylindrical 
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function C,(z) is represented in the form C,(z) 
-a, (9) Hz) all HPG), where a,(v) and 
a,(v) are arbitrary periodic functions of period 
1 with respect to v. 

If v = n (an integer) we have 


J_,(Z)=(—1)"Jn(2), 

N_,(z)=(—1)°N, (2), (6) 
which show the linear dependency of J_, and 
J,, and N_,, and N,, respectively. If v #n(an 
integer), as the fundamental solutions of (1) we 
can take a pair J, and J_,, or N, and N_,. In 
(2) if we take as a contour of integration a 
curve from (— z 4-0) - ioo to (x—0)+i00, we 
obtain an integral representation for J,(Z), 
which yields the relations 


Je = em J (7), 
d ize o mt (2. (7) 


If v = n (an integer) and Re z > 0, we obtain 


1 f7 
na=z | eicaint + ing dt (8) 
-1 
1 (* 
4| cos(zsin 6 — nC) dc, (9) 
( 


which is called Bessel’s integral. These repre- 
sentations imply the following expansions by 
means of generating functions: 


genie Y Agent, (10) 


n—-—oo 


cos(zsin 2) =Jo(z)-+2 Y. Jos (z)cos2nt, 
n-1 


x (11) 
sin(zsin () 2 2 ), Jy ,4,(z)sinQnt+ IN. 
n=0 
Making a change of variable y= exp( — it) in 


(2), we obtain 


1 Z l Me 
no-z] orl la “i du, (12) 
L 


where L is a contour starting at the point at 
infinity with the argument = 7, encircling the 
origin in the positive direction, and tending 
to the point at infinity with the argument 7, 
From (12) we obtain a power series expansion 


z "9 (—1)" gm 


Aire en mifte Am 13" 


(13) 
obtained also from (1) by a power series ex- 
pansion at z = 0, which is a tregular singular 
point of (1). Substituting (13) into 


N,(z) = (os vzJ,(z) = J_,(z))/sin vz, (14) 


we obtain a power series expansion for N,(z). 
A power series expansion for N,(z) for an inte- 
ger n is obtained by taking the limit yn (— 
Appendix A, Table 19). In particular, if v — 
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n + 1/2 (n is an integer), we have 
(22 M5 d” [sinz 
Jn dy 





J n+12(z)=(—1)” ’ 
Z 


n-0,1,2,..., 


which is represented by elementary functions 
and is sometimes called simply the half Bessel 
function. Bessel functions for half-integers have 
appeared also as radius vector components 
when the variables in the Helmholtz equation 
are separated by spherical coordinates. The 
function 


ja(z)= n/2z Jn+12(2) 
is called the spherical Bessel function. 
We have the following addition theorem: 
H(kp)e"’ = Y J (kr) HY, (kr leit, 
u=1,2, 


where 
p— /riri—2r,ricosq, 


pcosw=r,—r,cosg, —psinwy —r;sin q. 


C. Zero Points of the Function) ,(z) 


From the differential equations satisfied by 
J,(az), We have 


1 
(p | 2J,(a2) J,(Be)dz 
DN 


— BJ. (HIB) 24948) (15) 
By letting fia in (15), we have 


|, z(J,(az))* dz 


A1 ur +00?) (16) 


If à and f are distinct roots of J,(z) = 0, we 
have from (15) 


] 
| zJ,(az)J,(Bz)dz=0, | Rev» -1. (17) 
0 


The integral formulas (15), (16), and (17) are 
called Lommel’s integrals. 

As for the zero points of J,(Z), the following 
facts are well known: J(0) 2 0 if v >0. J,(z) has 
no multiple zero points other than z = 0. J,( —«) 
= 0 if J (x) = 0. Every zero point of J (Z) is real 
if v > -1. Between two adjacent zero points 
that are positive, there exists one and only one 
zero point of J, ,(z) and J,,,(z), respectively. 
LS) has a countably inlinite set of zero points 
on the real axis. When v z 0 is rational, every 
zero point of J,(z) except z = 0 is a ttranscen- 
dental number. The transcendentality of 7 is 
a special case of this result for v = 1/2. 
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D. Expansion by means of Bessel Functions 


Let f(r, oi be defined for 0<r <1, —1« «m, 
and Xn do Du 2> ZI m ` E < An, s < EE S= 


1, 2, for every n) be zero points of J,(x) (n = 
0, 1,2,. .. ) Then we have an expansion 


(a, ,cosno +b, ,sinno)J, (x, st), 
(18) 


which is called the Fourier-Bessel series. The 
coefficients a, . and b, , are determined by the 
properties of tFourier series and (16) and (17) 
as follows: 


ei - En 
b. s zt, (0, ))? 


Ion COS no 
J, n,s d dr; 
X | NC 9)J, (a. JA ^ r dq dr 


£y — 1, Eje mm 
The integral transformation 
o 
g(y)- | xf G)J, (xy) dx (19) 
0 


is called the Fourier-Bessel transform. If f(x) is 
sufficiently smooth and tends rapidly to zero 
as X oo, the following inversion formula 
holds: 


f(x) = | yg(y)J, (xy) dy. (20) 


There are other types of series expansions in 
terms of Bessel functions as follows: Dini's 
series 


8 


y JA. X) 
1 


(åm is the mth positive root of xJ,(x) -- HJ,(x) 
= 0, where H is a real constant); Kapteyn's 
series 


oo 


» a, J , all T m)x); 


m-1 

Schlómilch's series 
d oo 

2y 2 a, Jo(mx); 
2 m=0 


and the generalized Schlómilch series 


i ay J (MX) T b,H,(mx) 


1 ag 
dean 2. (mx/2)* ) 





where H,(mx) is the Struve function (- 
Section F). 
E. Asymptotic Expansion 


If z|or v is sufftciently large, the asymptotic 
representation for Bessel functions is obtained 
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by applying the tmethod of the steepest de- 
scent for (2). If |z|>|v|, we have 

2 T mn 
HG). |—expi| z——v—— ], 
d nz dii d 3 4 
—n<argz<2n, 
2 T mn 
H?(z)~_ [—exp| len 
EE EEN 
—2n<argz<7, 
2 
J\(z)~ /—cos pepe ] 
TZ 2 4 
—n<AargZ <T, 


2 
N (z) ~ LZale-ZeZl 
TZ 2 4 


Hence H{?(z) tends to zero as |z| oo in the 
upper half-plane, and becomes large exponen- 
tially as |z| 2 oo in the lower half-plane. The 
results for Hf? are obtained by interchanging 
“upper half-plane” with “lower half-plane” in 
this statement. 

If both |z| and |v| are sufficiently large, we 
have the Debye asymptotic representations. 
For example, if z = v sec fj (v 2 0, f» 0), we have 


HE: ”?(v sec B) ~ (nv tan f/2) !? 
x exp( + i(v(tan fi — f) — 7/4)). 
Ifz=vsecha (v>0,a>0), we have 
J, (v sech a) 
^ (2nvtanha) !? exp v(tanh a — a), 
N, (v sech a) 
~ (nv tanh q/2) !? exp via = tanh aj, 
If |v| ^ |z|, we have 


anf 


t 
HU: ?(vsec f) - ——- 


B 


xexp( +i Ri v (ian B — tan? p all 


x H(5?((v/3)tan? f) - 0(v^!), 


which is called Watson’s formula. 


F. The Wagner Function 


As an application of Bessel functions to the 
theory of nonstationary aircraft wings, T. 
Theodorsen introduced the function 


C(z) = HPHP) + AY(z)) 


[6], and H. Wagner considered the function 


1 2C(—i 
k,(s) al ee) 
Br 


2ni W 


[5], where Hf? (z), H(z) are Hankel func- 
tions, and for means a Bromwich integral 
giving the inverse Laplace transform. Then 
C(z) and k,(S) are called the Theodorsen func- 
tion and Wagner function, respectively. The 
function k,(S) is equal to the lift coefficient 
when a 2-dimensional flat wing suddenly 
proceeds forward a distance s at an angle of 
incidence 1/7. 


G. Functions Related to Bessel Functions 


The following functions are closely related to 


Bessel functions (- Appendix A, Table 19.IV). 


(1) Modified Bessel functions. 
Hai = e "7? J(e'*? 2), 
wë 

(2) Kelvin functions. 
ber, (z) +i bei,(z) = J,(e +3" z), 
her,(z) 4- i hei,(z) = HP (e* "^ 7), 
ker,(z)= — (n/2)hei,(z), 
kei,(z) z (1/2)her, (z). 

(3) Struve function. 

vo TT TE 
Nel È mame 
(4) Anger function. 


J,(zj= 1 cos(vO — z sin 0) d0. 


0 


When v is an integer n, we have J,(z)= J,,(z). 
(5 H. F. Weber function. 


1 n 
Eau; | sin(v0 — z sin 0) d0. 
XJ 


The last three functions satisfy certain in- 
homogeneous Bessel differential equations. 
Many other functions, such as *Airy's integral, 
Can be represented by Bessel functions. 
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A. General Remarks 


Biometrics is the branch of science that ap- 
plies mathematical and statistical methods 
to biological problems, and deals with all the 
phenomena that affect the physical, social, and 
psychological well-being of human beings. 
These phenomena involve the relationships 
of groups of human beings to other human 
beings, to animals, microbes, and plants, and 
to physical and chemical elements in the en- 
vironment. In dealing with these problems the 
biometrician encounters such theoretical tasks 
as analyzing autocorrelated data in time series 
and such practical undertakings as cost-vs- 
benefit evaluations of health programs. 
Biometrics began in the middle of the 17th 
century when Sir William Petty and John 
Graunt developed a new method of analyzing 
the London Bills of Mortahty. Petty and 
Graunt essentially invented the field of vital 


H 


statistics by studying the reports of christen- 
ings and causes of death and proposed a 
method called “political arithmetic.” 

Some fields to which biometries is relevant 
are given below. 


B. Statistical Genetics 


After early development in vital statistics, 
statistical genetics was founded on the new 
ideas emerging in statistics. Major contri- 
butions were made by Charles Darwin (1809— 
1882), Francis Galton (1822-1911), Karl 
Pearson (1857- 1936), and Ronald A. Fisher 
(1890-1962). 

Galton was the first to use the term "regres- 
sion" in statistics (> 403 Statistical Models 
D), when he observed that sons regressed 
linearly on their fathers with respect to stature. 
He called the phenomenon a "regression to 
mediocrity" because the deviations of the 
stature of sons were less than those of fathers. 
This gave rise to the measurement of COT- 
relation in the bivariate normal distribution 
by means of the coefficient of correlation 
(Pearson, 1897). Pearson is credited with the 
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creation of the discipline of biometry (bio- 
metrics), and he established the journal Bio- 
metrika to promote studies in the tield. Fisher's 
major contributions were to genetics and 
statistical theories. His General theory of nat- 
ural selection appeared in 1930. This Jandmark 
book, along with earlier and later publica- 
tions, represented Fisher’s attempts to give 
quantitative form to Darwin’s views and to 
frame a statistical theory of evolution. 


C. Bioassay 


Bioassay is a set of techniques for evaluating 
the effectiveness of dosages of drugs by moni- 
toring biological responses. It entails the use of 
special transformations, such as probits and 
logits, as well as the application of regression 
to the estimation of dosages that are p percent 
effective within stated confidence limits. Prob- 
lems to be solved include measuring relative 
potency, slope-ratio assays, and quantal re- 
sponses vis-à-vis tolerance distributions. 


D. Demography 


Demography, which includes traditional vital 
statistics, rates and ratios, life tables, com- 
peting risks, actuarial statistics, and census 
enumeration techniques, is a part of bio- 
metrics. In this category, many tabulations of 
data consist of time series of events or rates 
classitied by age. For the analysis of such data, 
the cohort analysis techniques described in 
Hastings and Berry [3] are employed. 


E. Epidemiology 


The quantitative description of an epidemic 
should state the sensitivity and specificity of 
any diagnostic tests, as well as the true in- 
cidence or prevalence of the epidemic from 
survey results. Within an epidemiological 
theory, a disease is studied by the use of deter- 
ministic and stochastic models, wherein the 
theory of Markov chains can be applied (- 
260 Markov Chains). Differential equations 
involving probability-generating functions or 
tmoment-generating functions can be solved to 
yield the tprobability distribution, tmean, and 
tvariance of the number of infected individuals 
as functions of time [4, 5]. Fundamental to 
this whole field of application is a clear under- 
standing of causality and association. 

When clinical trials are possible, two groups 
of persons, “treated” and “untreated,” are 
monitored over a period of time with regard to 
the incidence or recovery from the disease 
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under study. The techniques in this procedure 
include compiling reports on persons to be 
observed, using double-blind techniques, and 
combining multiple-response variables by use 
of multivariate analysis (- 280 Multivariate 
Analysis). 

If clinical intervention is forbidden on 
ethical grounds, e.g., studying congenital mal- 
formation by infecting pregnant women with 
German measles, the relative risk of expo- 
sure is estimated from retrospective studies. 
In practice, almost al] statistical studies in 
epidemiology are retrospective with the sole 
exception of clinical trials. The research is ex 
post facto because investigators are mostly 
confined to describing and analyzing sudden 
and/or obvious events in the etiology of the 
disease. 


F. Clinical Trials 


In clinical trials, many problems have arisen 
for which biometricians have had to develop 
special techniques. One such technique takes 
account of unexpected adverse effects of drugs, 
and the consequent early termination of a 
trial. Moreover, when data demonstrate a 
trend earlier than expected, investigators will 
desire to end the accession of patients and to 
stop further treatment with what may be an 
inferior regimen. This means that the bio- 
metrician must be familiar with the problems 
of multiple examinations of data, multiple 
comparisons, and other adjustment procedures 
required by the ex post facto dredging of data. 


G. Future Trends 


There are two areas in which the biometrician 
has played a leading role recently. These are 
pertinent in many different applications and 
problems, and considerable methodological 
research has been devoted to the two areas. 
The areas are "mathematical modeling" and 
"effects of hazardous substances." 


Mathematical Modeling. The relationship 
between a set of independent variables and the 
dependent or response variable(s) is usually 
referred to as a mathematical model. The 
model may take the form of a standard multi- 
ple regression analysis with a single response 
variable or with multiple response variables as 
in multivariate analysis (- 280 Multivariate 
Analysis). 

It is generally assumed that specialists with 
substantive knowledge of the specitic field 
of applications (epidemiology, toxicology, 
pharmacology, radiology, genetics, etc.) play a 
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crucial role in determining a model or rela- 
tionship between a set of independent vari- 
ables and response variable(s). However, it is 
mainly the biometrician who finally selects the 
specific model establishing the functional 
relationship and who attempts to measure the 
strength or influence of the independent vari- 
ables therein. The biometrician is also ex- 
pected to contribute substantially to the deci- 
sion as to whether the relationship is causal, or 
merely one of association or correlation. For 
example, in the measurement of carcinogenic- 
ity of a food additive or drug, questions arise 
as to whether a substance can be judged harm- 
ful if it “accelerates” the appearance of a tumor 
even though it does not increase the incidence 
of the abnormal growth. In general, the answer 
to this question is in the affirmative when an 
unequivocal dosage-response relationship 

is indicated between the substance and the 
tumor. 


Effects of Hazardous Substances. With the 
successful conquest of most of the infectious 
diseases that have plagued mankind through- 
out history, health authorities have recently 
been concentrating on two chronic diseases 
whose etiology is yet to be determined: cardio- 
vascular disease and cancer. In both cases 
there is no disagreement with the thesis that 
heredity exercises a determining influence, but 
the role of the environment in causing many 
cases is also unquestioned. Measurement of 
the risk due to potentially toxic substances in 
the environment, principally with respect to 
these two diseases, represents the greatest chal- 
lenge to the biometrician today. The social 
benefits of success make this a tantalizing area 
of research, though exceptional complexities 
are involved. 

For related topics -> 263 Mathematical 
Models in Biology. 


References 


[1] D. J. Finney, Statistical method in biolog- 
ical assay, Griffin, second edition, 1964. 

[2] D. R. Cox, The analysis of binary data, 
Methuen, 1970. 

[3] D. W. Hastings and L. G. Berry (eds.), 
Cohort analysis: A collection of interdiscipli- 
nary readings, Scripps Foundation for Re- 
search in Population Problems, Oxford, Ohio, 
1979. 

[4] N. T. J. Bailey, The mathematical theory of 
epidemics, Griffin, 1957. 

[5] M. S. Bartlett, Stochastic population mod- 
els in ecology and epidemiology, Methuen, 
1960. 


41 B 
Boltzmann Equation 


41 (XX.20) 
Boltzmann Equation 


A. Introduction 


The Boltzmann equation is an equation of 
motion of a rarefied gas given by L. Boltzmann 
in 1872 [1]. He used it successfully in his 
pioneering work on the kinetic theory of gases 
and on the more general statistical mechan- 
ics (- 402 Statistical Mechanics), but the 
equation was held to be inconsistent with the 
classical mechanics used in its derivation; 
objections were raised by, e.g., J. Loschmidt 
(1895) and E. Zermelo (1896). Through the 
controversies it became widely recognized that 
the Boltzman equation should be justiiied by 
means of the *theory of probability, and many 
such justifications have been proposed. Also, 
this equation has been studied extensively 

as a significant nonlinear partial differential 
equation. 


B. Boltzmann Equation 


Let f= f(t, x, č) be the density of gas molecules 
having position x € R? and velocity č € R° at 
time t. The Boltzmann equation is a conser- 
vation law for f of the form [1-3] 


f7 - VS -alt x): Vif * QLf]. (1) 


Here a(t, x) denotes the external force and Q is 
a quadratic nonlinear integral operator in é- 
space describing binary collisions of the mole- 
cules. The integral kernel of Q, called the colli- 
sion cross section, depends on the intermole- 
cular forces. The two classical examples are the 
hard ball model (a gas of rigid spheres) and the 
inverse power law potential (one proportional 
tor °, where ris the intermolecular distance 
and 5 > 1). The latter gives rise to singularities 
in the kernel, and a cutoff is customarily em- 
ployed to avoid this difficulty. Grad’s hard 

(s 2 5) and soft (s « 5) cutoff potentials may be 
effectively used in the study of (1) [3]. 

If the gas is contained in a vessel (domain), 
then f must also satisfy boundary conditions 
determined by the assumed law of reflection 
at the walls (boundaries) (specular reflection, 
random reflection, etc.). For spatially homo- 
geneous gases with no external forces, (1) re- 
duces to 


4 QLf]. (2) 


with f = f(t, č). The Maxwellians, those solu- 
tions satisfying the tMaxwell-Boltzmann dis- 
tribution law, are the only stationary solutions 
of (2); they describe equilibrium states. 
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C. Justification by tbe  Tbeory of Probability 


Consider (2). The collision process is a tMar- 
kov process, and the tmaster equation ob- 
tained from the tChapman-Kolmogorov equal- 
ity is equivalent to (2) under the assump- 
tion of “propagation of molecular chaos” (G. 
Uhlenbeck (1942)). This assumption becomes 
valid in the limit as N (total number of mole- 
cules) > oo (M. Kac (1955), H. McKean 
(1967), F. Griinbaum (1971)) [4]. Also, non- 
linear Markov processes defined by (2) (with 
probability density f/N) have been studied 
(McKean (1967), H. Tanaka [5]). No such 
results have yet been established for (1). 


D. Existence of Solutions 


T. Carleman [2] gave the first solutions of the 
Boltzmann equation, solving globally in time 
the *Cauchy problem for (2) for the hard ball 
model. His result has been extended to a wide 
class of potentials. 

The spatially inhomogeneous case (1) is also 
known to have global solutions if a(t, x) =0 
and if the initial data for f are nearly Max- 
wellian. This was first proved by S. Ukai [6] 
assuming periodicity in x, and then proved for 
the Cauchy problem by Ukai [7] and T. 
Nishida and K. Imai [8], and for the initial 
boundary problem by J. P. Guiraud [9] (ran- 
dom reflection) and Y. Shizuta and K. Asano 
[ 10] (specular reflection) for bounded domains, 
and by Ukai and Asano [ 1 1] for exterior 
domains, all assuming Grad’s hard cutoff 
potentials. The case of soft cutoff potentials 
was also solved (Ukai and Asano, R. Caflisch 
(1980)). All the solutions are unique and 
tend to Maxwellians as t> 00, with certain 
decay rates. There are some results also on 
1-dimensional shock wave solutions (B. Nico- 
laenko (1975)) and stationary solutions (J: P. 
Guiraud (1972) (bounded domains) Ukai and 
Asano (1980) (exterior domains)). 

For initial data far from the Maxwellian, (1) 
remains unsolved even locally in time for hard 
cutoff potentials. As for noncutoff potentials, 
the existence theorem in [5] is the only result 
known So far for either (1) or (2). 


E. Hilbert Expansion 


Put f €= UN with g- 1/N.If f* has a power 
series expansion in €, then (1) gives an infinite 
system of equations for the coefficients. D. 
Hilbert (19 12) proposed a method of solving 
this system as an application of his theory of 
integral equations, initiating the attempt to 
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solve the Boltzmann equation. No proofs of 
convergence exist, but his expansion and the 
improved one by S. Chapman (1917) and D. 
Enskog (19 17) made possible the applications 
of (1) to hydrodynamics. The zeroth-order 
approximation to the expansion gives rise to 
the compressible Euler equation, and the first- 
order approximation yields the compressible 
Navier-Stokes equation. The solutions f* of 
the Cauchy problem for (1) with nearly Max- 
wellian initial data converge to those of the 
compressible Euler equation locally in tas 
£0 (Nishida [12], Caflisch (1980), and 

to those of the compressible Navier-Stokes 
equation asymptotically as t + oo when => 0 is 
fixed [13]. 
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A. Boolean Algebras 


Boolean algebra was introduced by G. Boole 
to study logical operations (- 411 Symbolic 
Logic). It is now included within the more 
general concept of tlattice or lattice-ordered 
set (- 243 Lattices) and appears not only in 
logic but also very often in analysis in the form 
of a particular lattice of sets, e.g., the lattice of 
measurable sets. 

Let L be a given set and suppose that to any 
pair of its elements x, y there correspond two 
elements x U y, x N y of L (called join and meet 
of x and y, respectively) such that the follow- 
inglaws arevalid: (1) x Up=yUx, xNy=yfx 
(commutative law); (2) x U (y Uz) = (x Uy) Uz, 
xN(yNz)=(xNy)Nz (associative law); (3) xU 
(y Nx) =(x U y Nx =x (absorption law); (4) x U 
(yAz)=(xUy)N(xUz), xn(yUz) S(xn will 
(x N z) (distributive law); (x, y, ze L). From 
(1), (2), and (3) it follows further that x U x = 
x Nx = x (idempotent law). If x < y is defined 
to mean x U y = y, L becomes an *ordered set 
with respect to the ordering € Now suppose, 
moreover, that the following law holds: (5) 
there exist a least element O and a greatest 
element J, and for any element x there exists an 
element x’ satisfying x Ux = ],x N x’ =0 (law of 
complementation). Then L is called a Boolean 
algebra (or *Boolean lattice). In this case x’ is 
uniquely determined by x and is called the 
complement of x. The binary operations 
(x, y)-x U y, x Ny together with the operation 
Xx are called Boolean operations. These 
operations obey de Morgan’s law (xU y)’ = 
x Ny xN =x u y. 


B. Generalized Boolean — Algebras 


Suppose that a < b holds for two given ele- 
ments a, b of an ordered set. Then the set of all 
elements x satisfying a <x <b is denoted by 
[a, b] and is called an interval. An interval of a 
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Boolean algebra is also a Boolean algebra with 
respect to the induced operations U and N, 
where the least and greatest elements are a and 
b, respectively, and the complement of x in 

[a, b] is equal to a U (x N b) =(a Ux’) N b. More 
generally, if a set L with two operations U, f) 
satisfying (1)-(4) above has a least element 0 
and if each interval of L satisties (5) (1e., is a 
Boolean algebra), then L is called a generalized 
Boolean ` algebra. 


C. Boolean Rings 


A ring L satisfying the condition xx = x for all 
x € L (i.e., all of its elements are 'idempotent) is 
called a generalized Boolean ring, and if it has 
a unity element then it is called a Boolean ring. 
A generalized Boolean ring L satisfies X +x = 0 
for all x € L and is necessarily a commutative 
ring. A (generalized) Boolean algebra L be- 
cornes a (generalized) Boolean ring if for any 
elements x, y of L the sum x + y is defmed to 
be the complement of x Ny in the interval 

[0, x U y]. and the product xy is delined to be 
X[1y. A nonempty subset J of a (generalized) 
Boolean algebra L is an ideal with respect to 
the corresponding structure of the ring if and 
only if xU yeJ for x, yeJ and xf1yeJ for 
X€J, ye L. More generally, in any lattice, a 
nonempty subset that satisfies these conditions 
is sometimes called an ideal of the lattice. 


D. Representation of a Boolean Algebra 


Any Boolean algebra L is isomorphic to a 
Boolean lattice of subsets in a set X. If Lis of 
finite "height, then L is isomorphic to the 
Boolean lattice P(X) of all subsets of X. In 
general X can be taken to be the set of all 
maximal ideals of L, Let ae L and let O(a)= 
im|me X, a¢m}. The isomorphism is obtained 
by the mapping a-+O(a). If we define a topol- 
ogy in X such that (O(a) ae L} is the topen 
base, then X is a compact, totally discon- 
nected T, space and O(a) is characterized as 

a compact open set in X. Such a space X is 
called a Boolean space (M. H. Stone [3,4]). 

In any complete Boolean algebra L, the 
complete distributive laws hold: (sup; xj)f1y = 
sup;(x;f! y) and its dual. These are equivalent 
to the stronger relations: (sup; x;)PY(sup; yj) = 
sup,,(x; N yj and its dual. In order for a 
Boolean algebra L to be isomorphic to the 
Boolean algebra P(X) of all subsets of X, it is 
necessary and sufficient that the following 
strongest complete distributive laws hold: 
infí(sup;x;) = supy(infjx; o) (where F is the 
set of all functions @ assigning to each ie I a 
value o(i)eJ(i)) and its dual. 
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A. General Remarks 


A complex-valued function defined on a subset 
E of the complex z-plane is called a bounded 
function defined on E if its range f(E) is 
bounded, that is, if there exists a positive 
constant M such that | f(z)| < M on E. How- 
ever, when studying the theory of bounded 
functions, we usually restrict ourselves to the 
consideration of tanalytic or tharmonic func- 
tions. On the other hand, the classes of func- 
tions f(z) satisfying conditions such as Re f (z) 
»0 or a «arg f(z) « B rather than | f(z) x M 
are studied by a method similar to that ap- 
plied to the study of bounded functions. 

tSchwarz’s lemma, tLiouville’s theorem, and 
*Riemann's theorem on the removability of 
singularities (which will be explained later) are 
among the classical theorems in the theory of 
bounded functions. 


B. Maximum Principle 


When a function f(z) 1s holomorphic and not 
constant in a domain D of the complex plane, 

| f (z)| never attains its maximum in the interior 
of D. In particular, when f(z) is continuous on 
the bounded closed domain D= D V op. the 
maximum of | f(z)| on D is taken on its bound- 
ary QD. This fact is called the maximum (mod- 
ulus) principle. 

As a direct application of the maximum 
principle, we can deduce Schwarz's lemma: If a 
holomorphic function f(z) in |z| « R satisfies 
|f(z) & M and f(0) 20, we have |f(z) & M: 
|z|/R (|z| « R). The equality at zo 0 « |z;o| « R, 
occurs only for the functions f(z) = e?M z/ R 
(where A is a real constant). 
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C. Lindelóf's Principle 


E. Lindelof extended Schwarz's lemma and 
obtained various extensions of the maximum 
principle, from which he, together with E. 
Phragmén, deduced several useful theorems on 
the behavior of a function that is single-valued 
and holomorphic in a neighborhood of the 
boundary. We mention some representative 
theorems: 

Let z = q (C) and w = y(C) both be *mero- 
morphic and tunivalent functions in |{]< 1 that 
map |{|< 1 onto D, and D, , respectively. Set 
Q(0) 2 z, and w(0)=w,. Let Dip) and D,(p) 
denote the images of |£] € p (0 < p< 1) under 
the mappings @ and y, respectively. Under 
these circumstances, if a function f(z) that is 
holomorphic in D, satisfies f(D,) C D, and f(zọ) 
= Wo, then f(D,(p)) C D,(p. Furthermore, 
unless f(z) maps D, onto D,, univalently, 
f(DAp)) is contained in the interior of D,(p) 
(an extension of Schwarz's lemma). 

Let f(z) be analytic in a bounded domain D 
but not necessarily single-valued. Suppose that 
| f(z)| is single-valued. Suppose, furthermore, 
that there is a positive constant M such that, 
for each boundary point ¢ of D, except for a 
finite number of boundary points and for each 
£ > 0, the inequality | f(z)| < M holds on the 
intersection of D with a suitable neighborhood 
of C, and suppose also that each of the excep- 
tional points has a neighborhood such that 


f(z) is bounded on the intersection of D with 


this neighborhood. Under these assumptions 
we have | f(z)| < M. Moreover, if | f(z9)| - M at 
a point zo of D, then f(z) is a constant (an 
extension of the maximum principle). 

Let f(z) be holomorphic in an angular 
domain W: arg z| « ax/2. Suppose that there is 
a constant M such that, for each ¢ > 0, each 
finite boundary point has a neighborhood 
such that | f(z)| < M +¢ on the intersection of D 
with this neighborhood, and that for some 
positive number f > g and for sufftciently large 
|z| the inequality | f(z)|<exp|z|'/* holds. Under 
these assumptions we have | f(z) < M in D 
(Phragmén-Lindelóf theorem). 

Let f(z) be a function that is holomorphic 
and bounded in a closed angular domain W: 
& < arg z < fl except for the point at infinity. 
Suppose that f(z)a as z oo along a side of 
the angle and that f(z) b as zoo along the 
other side of it. Then we have q= band f(z)>a 
uniformly as z oo in W (Lindelöf ’s asympto- 
tic value theorem). 


D. Bounded Functions in a Disk 


If f(z) is a bounded holomorphic function in 
the unit disk |z| « 1, it has a limit at every 
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point Ze on the circle C: |z|= 1, except for a set 
of 1 -dimensional measure zero, as z tends to Zo 
from within an angle with vertex at zo and çon- 
tained in |z| « 1 (or along a *Stolz’s path at zp) 
(Fatou's theorem). Under the same assump- 
tion, if the boundary value function f(e") = 
lim, ,,. 9 f(re?), which exists by Fatou's theo- 
rem, is equal to a constant a for a set of posi- 
tive measure on the circle C, then f(z) =a in 
|z| < 1 (F. and M. Riesz theorem). These theo- 
rems are valid for some kinds of meromorphic 
functions in |z| < 1 (- 272 Meromorphic 
Functions D). 


E. Three-Circle Theorem and Related 
Theorems 


Let f(z) be a function that is single-valued, 
holomorphic, and not identically zero in an 
annulus p< |z| « R. Set M(r)= max, | f(z) 
(p <r < R) for f(z). Then log Mtr) is a tcon- 
vex function of log r in log p < log r<log R 
(Hadamard's three-circle theorem). The same 
assertion holds for a function f(z) that is not 
necessarily single-valued, as long as | f(z)| is 
single-valued. When f(z) is single-valued, a 
stronger assertion can be obtained (0. Teich- 
müller) The following theorems are regarded 
as the analog of Hadamard's three-circle 
theorem for the respective basic regions: 

Set L(c) sup. real flo + it)| (<0 « p) 
for a function that is bounded and regular in a 
strip x < Re z < fl. Then log L(a) is a convex 
function of g in a < g < f (Doetsch's three-line 
theorem). 

Set Jel- lim sup, „œ | f(o + it)| (x < 0 < P) for 
a function that is holomorphic and bounded in 
a half-strip x < Rez < f, Imz > 0. Then log Wal 
is a convex function of g in a < c < fi (Hardy- 
Littlewood ^ theorem). 

Set 


1 2n 
L(r)=—— IP qg 
ER | irre”) 
for a holomorphic function in a disk |z| < R. 
Then for every p» 0,log wo 1s an increasing 
convex function of logr for —oo < logr < log R 
(Hardy's theorem). 


F. Hardy Class 


Hardy's theorem motivates us to introduce a 
class of functions. An analytic function f in the 
unit disk is said to belong to the Hardy class 
HP’ (0 « p « oo) if I,(r, f) - Qn) ff" | fre”) d0 
remains bounded as r1. For the case p= oo, 
fe H” if I, (r, f)=max,,,-,| f(z)| is bounded. 
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An analytic function fis said to be of the class 
N if (2"log* |f(re^)| d0 is bounded for r< 1. 

f €Nifand only if f = qi, where ode", 

f € H? if and only if | f|? has a tharmonic majo- 
rant. If 1 <p € oo, H” is atBanach space with 
the norm || f ||, — (sup, I,(r, f))"? for p< oo 
and || f| , sup, L,(r, f) for p= œ, and if 0< 
p <1, H’ isa complete metric space with the 
metric sup, <; lpr, f —g). For 0«p«q« oo the 
inclusion relation N > H? > H* 2 H? holds, so 
that Fatou's theorem and that of F. and M. 
Riesz are valid for functions of HP (p> 0). 

For a nonnegative integer p and a sequence 
of nonzero complex numbers a, |a,| « 1, the 
infinite product B(z) = z^ 17, (o, (o, —z)/|a,| (1 

&,z)) converges locally uniformly in the unit 
disk if and only if 2, (1 —|a,]) « oo. Then 
B(z) is called a Blaschke product and {a,} a 
Blaschke sequence. If the sequence f, }U {0} is 
the set of zeros of an analytic function f (an m- 
tuple zero appears m times in (oj, and z = 0 is 
a p-tuple zero), B(z) is said to be generated by 
the zeros of f. If f e N, then 1, (1—|a,|)< oo 
for the set {a,} of zeros off: An analytic func- 
tion f in the unit disk is called an inner func- 
tion if its modulus is less than 1 and its non- 
tangential limit on the unit circle is of modulus 
1 almost everywhere. A Blaschke product is an 
inner function. An inner function f without 
zeros is said to be singular if f(0) > 0. A sin- 
gular inner function f can be represented by 
f(z)=exp{ — fó"(e" + z)(e" —z)du(0)} with a 
positive measure p tsingular with respect to 
Lebesgue measure. An outer function F(z) for 
HP (p > 0) is an analytic function that can be 
represented by 


; 1 2n pi? +7 
F(z) = eer | 8,108 V (0) ao} 
0 Se 


where r is a real number and y/(0) is a non- 
negative function such that V(0)e L’? and 
logy(0)e L+. An H?-function f (p» 0) is fac- 
torized uniquely as f(z) = B(z)S(z) F(z), where 
B(z) is a Blaschke product (generated by the 
zeros off), S(z) a singular inner function, and 
F(z) is an outer function for H? (with y(0)= 
1/(e9)]) (factorization theorem). Conversely, 
such a product belongs to H”. The inter- 
polation prohlem raised by R. C. Buck asks 
whether or not there exists a sequence of dis- 
tinct points {z, |z,| « 1) such that, given any 
bounded sequence of complex numbers Ju.) 
there is a function f € H® for which f(z,) Wa 
The sequence {z,} in the problem is called an 
interpolating sequence. L. Carleson proved that 
a necessary and sufficient condition for {z,} to 
be an interpolating sequence is that there be a 
620 such that IL,z,l(z, —zJ/(1 —z,7,2ó 
(k=1,2,...), An interpolating sequence is a 
Blaschke sequence, and any sequence {z, || z, 
= 1} contains an interpolating subsequence 
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tending to the unit circle. An analogous theo- 
rem is obtained for a function of H? (p> 1). 


G. Corona Problem 


The Hardy space H® on the unit disk D is 
furthermore a commutative *Banach algebra 
with identity. Let M denote the *maximal ideal 
space of H" It is a compact Hausdorff space 
with respect to the tGel’fand topology. To 
each point ze D, there correspond a homomor- 
phism 9;: 9,(f) =f(z) for f e H^, and a mazi- 
mal ideal M, = 1 fe H® | f(z) = 0} as its kernel. 
This correspondence gives a continuous injec- 
tion of D into the maximal ideal space M of 
H”. By identifying z and M,, D is regarded as 
an open subset of M, and the corona problem 
then asks: Are there points of M outside the 
closure of D? The negative answer was given 
by L. Carleson, that is, D is dense in A4 (corona 
theorem). The corona theorem is obtained as a 
direct consequence of the following theorem 
and is equivalent to it. For functions Lef", 
i=1,2,..,n,satisfying X|fi(z)| äm D with 
some 6 >Q, there exist gge H”, i=1,...,n, such 
that > f(z)g,(z) = 1 in D. As a trivial conse- 
quence of the corona theorem, we have the 
following cluster value theorem: Let C( f, {) 
denote the ‘cluster set of f € H” at ( with 

|| * 1; then C(f,Q)=f(M), where f is the 
Gel’fand transform off and M, is the tfiber of 
M over ¢, M is decomposed into pairwise 
disjoint *Gleason parts. A Gleason part is a 
connected open set or a singleton according as 
each point of it is captured in the closure of an 
interpolating sequence or not. Each point of 
the *Shilov boundary I of M forms by itself a 
trivial Gleason part, and Į is *adherent to a 
sequence S in D with the property that the set 
of its nontangential limit points covers the unit 
circle (Brown-Shield-Zeller theorem). Let D 

be, in turn, a general bounded domain. The 
corona theorem is proved for some classes of 
such domains. A Shilov boundary I lies only 
over the set of points on OD which are not 
*removable for bounded analytic functions. It 
is adherent to a sequence S in D on which 
sups| f(z)] — |f |... For any fe H” and CeóD, 
the boundary of fi (M,) is contained in fi T) if T, 
= IMM, is not empty. This, together with the 
cluster value theorem, has an important im- 
plication in the theory of cluster sets. The ex- 
tremal problem of maximizing f '(zj)l, Zap D, 
among all fe H?” satisfying II fll in 1 has the 
unique normalized solution G such that Gel 
= max | f Gell, G(z) is called the Ahlfors func- 
tion. The Ahlfors function has unit modulus on 
the Shilov boundary (Fisher's theorem). The 
theory of H* is also applied to the problem of 
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'rational approximation of bounded analytic 
functions. 


H. Applications of the Maximum Principle 


Theorems of the following type are useful for 
some problems of holomorphic functions: 

Let D be a domain. Suppose that there 
exists an arc of angular measure g that is on a 
circle of radius R centered at a point z of D 
and not contained in D. Let C denote the 
intersection of the boundary of D with the disk 
|z 7 zo| « R. If f(z) is a single-valued holom- 
orphic function that satisfies f(z)| € M, and if 
lim sup, ,; f(z)|& M for every (e C, then the 
inequality f(z9)| < M! miir holds for every 
positive integer n satisfying 2z/n < a (Lindeléf’s 
theorem). 

Let D be a domain bounded by two seg- 
ments OA, OB both starting from 0 and mak- 
ing an angle zo, and a Jordan arc ÁB, and 
let R be the maximal distance between 0 and 
the points on ÁB. Suppose that f(z) is holo- 
morphic in D and that lim sup] f(z)| as zo 
Ce ðD with ze D is not greater than M for 
(€ OAU OB and mfor (e KB. Then we have 
L/(z)| M! Ami (where 4 —(|z|/R)') at every 
point on the bisector of the angle / AOB in D 
(Carlemann's theorem). 


I. Holomorphic Functions with Positive Real 
Parts 


Holomorphic functions with positive real parts 
are intimately connected with bounded func- 
tions. Concerning these functions we have the 
following classical result, which is equivalent 
to Schwarz's lemma: If f(z) is holomorphic in 
z|< R, Re f (z) > 0 in the same domain, and 
J(0)= 1, then (R—|zI(R t |zl) & Re f(z) <(R + 
\z\)/(R =|z\)(\z|< R). The right or left inequality 
becomes equality for some z}, 0 < |zg| < R, only 
i fl2)=(Rzp Fzoz)/(R +292), respectively. 

In order to prove various results for the 
class of functions with positive real part, 
Herglotz's integral representation, which is 
based on Poisson's integral representation and 
unique to this class, can be used effectively. It 
is given by 


25 Q9. 7 
f(z | = 


0 e? — 7 





dp(o), |z| « R, 


where p(q)is monotone increasing (real- 
valued) with total variation 1 and is deter- 
mined uniquely up to an additive constant by 
f(z). An analogous integral representation is 
introduced for a holomorphic function in an 
annulus. 

Similar to the case of holomorphic functions 
is the notion of Hardy classes to harmonic 
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functions, that is, a harmonic function u in the 
unit disk is said to be of the class? (0< p< 
oo) if I, (r, u) is bounded for r < 1. u €h” if and 
only if y+ ive H”, where vis conjugate to u. h! 
is equal to the vector space of functions which 
are representable as a difference of two non- 
negative harmonic functions. The Herglotz 
theorem stated above is then a corollary of the 
theorem of integral representation for ht- 
functions. 


J. Coefficient Problems 


There are many classical results for partial 
sums and coefficients of the Taylor expan- 
sion of bounded functions in a disk. Let f(z) 
= Änt be the Taylor expansion of a 
bounded function in |z| € 1. Set its partial sum 
Sz) E otcz (n=O, 1,...). and let t,(z)= 
(L/(n - 1) 37-95, (z) (n=0, 1, ... ), which is the 
sequence of the arithmetic Means of the par- 
tial sums (the Fejér sums). Then | f(z)| € 1 in 
Iz| « 1 if and only if |t,(z)| <1 for |z| 2 1 (n= 

0, 1, ) (L. Fejér). Thus the sequences {t,(z)} 
for bounded functions f(z) are uniformly 
bounded, whereas the sequences {s,,(z)} are not 
uniformly bounded. Indeed the maximum 
value of |s,(1)| over the set of function f satis- 
fying | f(z)| € 1 for|z| « 1 is 


an {—1/2\? 
1+) ( e 
dk J 

= y (= oy 

j=0 2-4... 2j 
(G,~x 'lognas n>). 

The following result is decisive for coeffi- 
cient problems: Set h,, = 3/5 oc, jc, j(u & v), 
h,, = h,, for f(z) = Y. o c,z", let m; be the 
maximal eigenvalue (a nonnegative real num- 
ber) of the Hermitian matrix ( —h,,) and 
let m=lim m,( 2 0). 

Let H be a Hermitian form in inlinitely 
many variables given by 


G, 


n 
n,v—-0» 


2 


o0 o0 
=m) b 
y=0 H 


-0 








oo 
5 CyXu+v 
v=0 


Then a necessary and sufficient condition for 

| f(z)| € 1 when |z| < 1 is that H be positive 
semidetinite, i.e., the sequence of the ‘principal 
minor determinants of H, 


1 2 m 
A = An 
EH 


is all positive or positive for an initial finite 
number of them and zero for the remainder 
(I. Schur). 
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A corresponding result for functions with 
positive real part in a disk can be stated in a 
simpler form: A holomorphic function f(z) = 
1/242 7.,¢,2" in |z| « 1 satisfies Re f(z) 20 if 
and only if 


D ie iss gc. 
1 d 

i “1150, n=1,2,... 

Elfen 


(C. Carathéodory). Furthermore, for n — 
1,2, , when we regard (c,, . , c,) as a point 
of complex n-dimensional Euclidean space, we 
can determine the domain of existence of 
points satisfying this criterion by Carathéo- 
dory. This result is generalized for coefficients 
of the Laurent expansion of a function that is 
holomorphic and single-valued in an annulus. 
Next, there are some results for a function 
that omits two values: If f(z) = a, + a, z 
... (a4 0) is holomorphic in |z| « R and 
f(z) Z0, 1 in the same domain |z| < R, then 
there exists a constant Lige, a.), depending 
only on a, and a,, such that R < L(ag, a4) 
(Laudau's tbeorem). Under these circum- 
stances, | f(z)| « Side, 0) in |z|< OR for 0 «0 « 1, 
where S(a,, 0) is a constant depending only on 
a, and f (Scbottoky's tbeorem). These theo- 
rems have applications in value distribution 
theory. On the other hand, coefficient prob- 
lems have been investigated as extremal prob- 
lems in. H?-spaces under general conditions. 


K. Angular Derivative 


Let f(z) be holomorphic in |z| « 1. If f(z) wo 
uniformly as z 29 along Stolz paths with end 
point at Zo and if the limit lim... (( f(z) — wo)/(z 
— Zoll = D exists, we call D the angular deriva- 
tive of j"(z) at za, In the case of the half-plane 
Re z > 0, the angular derivative at the point Zu 
on the imaginary axis is similarly defined. It 
should be noted that f(z) = wọ is replaced by 
1/f(z) for wọ = oo and 1/(z = zo) by z for zo = œ. 
In the latter case, a Stolz path is a path con- 
tained in an angular domain |argz| <a ( < 2/2) 
and tending to oo, The study of angular de- 
rivatives was initiated by G. Julia (1920) and 
J. Wolff (1926) and was further advanced by 
Carathéodory (1929) and E. Landau and G. 
Varilon (1929). 

A fundamental theorem for angular deriva- 
tives can be stated as follows: If a holomorphic 
function f(z) in Re z> 0 satisfies Re f(z) 2 0, 
there exists a constant c (0 € c < +00) such that 
f(z)/z 5c and f(a uniformly as z— oo 
along every Stolz path. Moreover, the pth 
derivative of f(z) for an arbitrary positive 
p, denoted by D?f(z), has the property that 
z^! DPf(z)S c/T (2 — p) uniformly. Further- 
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more, the inequality Re f(z) > c Re z holds 
everywhere in Re z > 0. An analogous theorem 
is valid for the unit disk. 

An important problem in the theory of 
conformal mapping is to find some condition 


for a mapping w — f(z) of the unit disk (or half- 


plane) G onto a simply connected domain B to 
have a nonzero and finite angular derivative at 
a boundary point ze, that is, the condition for 
conformality at the boundary point. Cara- 
théodory showed that a sufftcient condition is 
the existence of two circles that are mutually 
inscribed and circumscribed at the boundary 
point wọ = f(zo) Æ oo of B and that lie inside 
and outside of B, respectively. Ahlfors later 
established a necessary and sufftcient condi- 
tion for the existence of the angular derivative 
by making use of his tdistortion theorem for a 
strip domain. The angular derivative was 

used by Wolff in his research on the iteration 
of conformal mappings. 
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A. General Remarks 


A brancbing process is a mathematical model 
for random motion of a family of particles 
each of which is in an isolated process of multi- 
plication and death. Examples of such ran- 
dom motions are population growth, miosis of 
genes, growth of the numbers of neutrons in an 
atomic chain reactton, and cascade showers of 
cosmic rays. The simplest and most fundamen- 
ta] branching process is discussed in Section 
B. For a historical introduction to the study 

of branching processes — Kendall [2] and 
Harris [3]. 


B. Galton-Watson Processes 


Although there is a similar process with con- 
tinuous parameter t€ R, we consider here 
only the case of a discrete time parameter 
(i = 0, 1,2,. ). Suppose that we are given a 
family of particles of the same kind. Each 
member of the family splits into several par- 
ticles according to a given probability law 
independently of the other members and its 
own past history. Let Z, be the number of 
particles of the family at a moment (or gener- 
ation) n; then {Z,} gives rise to a "Markov 
chain. This is called the Galton-Watson pro- 
cess. A. precise mathematical description of the 
process is as follows: A Galton-Watson pro- 
cess is a Markov chain (Z, n=0, 1, .. . } on the 
nonnegative integers with ftransition proba- 
bility defined in terms of a given probability 
distribution (p, k=0, 1, }, p, 20. X op, = L 
by 

; 4 fpř if ist, jž0, 
py= PUZa EE if i=0, j20, 


ij 
where the probability distribution { nl k= 

1, . } denotes the i-fold tconvolution of 
the probability distribution { p, K=0,1,...} 
and ó; is the fKronecker delta. Then p, is 
interpreted as the probability that an object 
existing in the nth generation has k children in 
the (n + 1)th generation. We assume that Zo E 
1 a.s. for the rest of this section. An impor- 
tant tool in the analysis of the process is a 
tgenerating function f(s) = Zj2o pel, p. 
Denote the generating function Y? 9 ise - 
k)s* (S< 1) of Z, by f, (s). Then we have fo(s)= 
S, fi (s) - f(s) fois) =f) (5j — 0, 1,2, .), 
and the texpectation of Z, is given by E(Z,) = 
m", where m= f ( 1 - E (Z,) is the expecta- 
tion of Z,. A Galton-Watson process is said 
to be subcritical, critical, or supercritical if 
m«l1,-l,or» 1, respectively. If we have Z,7 


0 for some n, then Z,,,= Z,,,7..- 0. The 
probability q = P (lim, Z, = 0) is called the 
extinction probability. The case fj(s) = s is 
excluded in the following. The extinction pro- 
bability q of the process {Z,} is the smallest 
nonnegative solution of the equation s = f(s). 
Itis lifm<1 and <1 if m» 1. Moreover, 
P(lim, Z, = oo) 2 1 —q when m» 1, and hence 
the process {Z, is transient. 

If we put W, = Z,/m" when m < oo, {W,} 
gives rise to a *martingale, and the limit W= 
lim, ,,, W, exists with probability 1. If m> 
1, then the tmoment-generating function 
q(s)  E(exp( —sW)) of W satisfies the Königs- 
Schröder equation (ms) = f(q(s)), Re s > 0. If 
m <1, then P(Z, = 0 for some n) = 1. The fol- 
lowing theorem on the conditional distribu- 
tion of Z,, given that Z, z 0, was first proved 
by Yaglom [6] under moment restrictions. If 
m «1, there exist b, = lim, P(Z, - k Z, £0) k = 
1,2 ,.... Then (b,|k2 1,2 ,... Jisaprobability 
distribution and its generating function g(s)= 
Y, b,s*, |s| < 1, is the unique solution of 
the equation g( f(s))=mg(s)+ / —m, |s| <1, 
among generating functions vanishing at 0. 
Furthermore, when m < 1, zc jb; « œ if and 
only if 2/3; p(jlogj) < o (ie. E[Z, log Z,] < 
oo). 

If m= 1 and c? = variance of Z, « oo, then 
lim, P[Z,/n> u| Z, £0] =expl —2u/o7], uz: 0. 
If c? = oo, then this result is still correct with 
the interpretation that the limit in the left- 
hand side equals 1. 

For further details regarding the theory of 
Galton-Watson processes — [3, S]. 


C. Multitype Galton-Watson Processes 


These are generalizations of the Galton- 
Watson process, involving k types of particles 
(k > 2), say, Tj, Ta, , T,. Let p'(r,, T2., rj) be 
the probability that a particle of type T, splits 
into 7, particles of type T, (m = 1,2,. . , k), and 
set Ter, S2,- S)-7 X, p'(ri rs. Si Sh? 

. S. The number of particles Z, =(Z}, Z2, 
...,Z,) at a moment n gives rise to a (Markov 
chain over Z* , and its transition probability 
is determined by p'(r,, r2, . r,) as for Galton- 
Watson processes. Here Z^ denotes the space 
of all k-dimensional tlattice points whose 
components are nonnegative integers. We 
now define a mathematical model of the 
process mentioned above. A Markov chain 
{Z,,|n=0,1,...} on Z^ is called a multi (k)- 
type Galton-Watson process if its transition 
probability is given by 


Pj = P[Z,4,=j Z,= i] = coefficient of 
k 


| | s in 


=1 


Z> 


(f!(5,, 55, 


MET 


~ 
H 
— 
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for i - (1,15, .. cn i) 9j GJ» ee je Z^ n= 
0, 1,... and s—(s,,55,..., $)€ D,, where D, = 
(s (51.55, S S)eR* [Is <1, [= 1,2 ! eng k} 
When mj-—óf'(1,. 1)/0s;« oo (i,j= 1,2, 
. , k), the *conditional expectation of Z,,,., 
given Zm is given as E[Zm+n Z,]- Z,, M", 
where M is the matrix (mj). From now on we 
assume that m, < oo (i,j = 1,2,., k). A multi- 
type Galton-Watson process {Z, n = 0, 1,. } 
is said to be singular if the generating func- 
tions Tex, es Bh [2 1,2. ,..., karealllinear 
in 5,, $5,, S With no constant terms. We 
assume nonsingularity throughout. If there is 
a positive integer N such that every compo- 
nent of M is positive, the process Z,is said 
to be positively regular. In this case, M hasa 
positive teigenvalue 4 that is simple, and 4> 
|u| for all other eigenvalues u. The eigenvalue 
A plays the role of min the case of the Galton- 
Watson process. For every i, IX i <k, set qt = 
P[Z, —0 for some n|Z, — ej], where e; — (e, ei, 
EC ed = dn, Then q' is called the extinction 
probability of the process (Z, n = 0, 1, .} 
starting with a single particle of type T,; let 
q — (q!, q?,.. . q*). Suppose that the process 
{Z, n=0, 1,... } is positively regular and not 
singular. Then q=(1,1,.... D) if A< 1, and 0< 
q'<1(i=1,2,...,k) f 4» 1, and q satisfies the 
equation q=(f'(q), f7(q), . Talk Further- 
more, the process {Z, n= 0,1,. . . yis ttran- 
sient, Le, P[Z, = j inlinitely often] = 0 for any 
j #0, je Z*. Moreover, if 4» 1, then W, = 
Z,,/A" converges with probability 1, and 
lim ZA -vW as. 
where v denotes the positive left teigenvector 
v=(v,,., 4) for A of the matrix M, and W is 
a nonnegative *random variable. P[ W> 0] > 
O holds if and only if 


E[Zi)logZt)] «oo forall 1 &ij«k. (1) 


Here Z{) denotes the number of type T; par- 
ticles in the first generation for a process with 
Zo = e, 1&i&k. If condition (1) holds, the 
*moment-generating functions of W, @,(a) = 
E|exp[ ^ 4W ]| Zo - ej; i=1,2,...,k, a>0, 
satisfy dal — f (p, (2/4), 9o(a/4), . , 9.0/2). 
Ixixk. 

When A « 1, a theorem analogous to Ya- 
glom’s holds. For further information = Joffe 
and Spitzer (J. Math. Anal. Appl. 19 (1967)) 
and Athreya and Ney [S]. 


D. Markov Branching Processes 


The branching processes dealt with in Sections 
B and C are limited in the sense that gener- 
ation times are lixed. We now formulate a 
continuous-time version of branching pro- 
cesses. The treatment in this section is limited 
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to Markov processes that are extensions to 
continuous time of the Galton-Watson pro- 
cess. A Markov process {X,, P;} on Z}, is 
called a Markov branching process if the tran- 
sition probability P(t, i,j)= P,[X,=j]@jeZ\, 
t 2 0) satislies 


È P(ti, Ae Pt, uw) 
j-0 j=0 


for all ie Zl, and |s| 1. Then there exist 

a positive constant a and a probability 

distribution ( p; K= 0,2,3,.} such that 

Im, al 1 — P(t, i. i))/t = iaif ie Z}, and 

lim, o PU. i, j)/t =iap,_;4,if j2i— 1, jt i. The 

+Kolmogorov equations are 

d j*1 

— P(t, ij)= —jaP(t,i,j)+a In, P(t, i,l) 

di J jaP(t, 5j ON Pj-i+1 
(‘forward equation) 


and 


d 
dp b ij)- — iaP(t, ij) 


oo 


tia Y 


l-i-l.izi 


Dn Plt, Lj) 


(tbackward equation). 


Then a and { p, k=0,2,3,. } are interpreted 
as follows. An object existing at f has a proba- 
bility adt of dying in the time interval [t,t+ «o 
of length dt. If it dies at any time t, the proba- 
bilities are po, P2, p3, . that it is replaced by 

0, 2, 3,. objects. When ZZ, P(t, i, K) < 1, the 
number of particles attains +20 in a finite time 
interval with positive probability. In order that 
Eo P(t, i, K) = 1 for allie Z}, it is necessary 
and sufticient that for each 1 £7 0, f 1 (flu) 
—u) ! du — oo, where Dat 3o. ja 1 pjs for 
Is|& 1. 

Most of the theory of the Galton-Watson 
process carries over to the continuous case. In 
particular, concerning the flimit distribution of 
X,, more precise results have been obtained. 
(For detailed discussion of Markov branching 
processes — [3,8].) 

Some work has also been done on the 
asymptotic behavior for temporally inhomo- 
geneous cases of multitype Markov branching 
processes. 


E. Branching Markov Processes 


We now give a systematic treatment of gen- 
eral branching processes in which an object is 
characterized by a parameter x in a *compact 
*Hausdorff space $ with a countable open 
thase. The results are formulated in terms of 
strong Markov processes. Set S? = (0), where 
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Q is an extra point. For every positive integer 
n, let "=S x Sx... x S/ », where ~ isthe 


n 

tequivalence relation given by the tpermu- 
tations of coordinates. S” is compact with 
respect to the *quotient topology, and its 
ttopological direct sum S = 37, S" is locally 
compact. Let $-SU {A} be the one-point 
tcompactification of S. A point x eS is de- 
noted by x 2 [x,, x5, Kal if x is the tequiv- 
alence class containing (x4, X25.. , x, e$"). 

x = [x] is denoted simply by x. Set B(S) = 

Lf 1f a bounded Borel measurable function on 
S}, B*(S)- {FEBO |I fll «0. C()={flfa 
bounded continuous function on S}, and 
C*(S) = C(S) N B*(S). B(S), C(S), . are defined 
similarly. For f eB*(S), define fe B(S) by f(x) 
Lifx-à, Tli feo) if x= [x xax ]E 
8", and = 0 if x =A. A ‘strong Markov pro- 
cess X = {X,, P,} on is called a branching 
Markov process if its semigroup {T,} (- 261 
Markov Processes) satisfies 


^ ~~ GG 
T,f(x) -(T,f)s(x) for every xeS and 
fe B*(S), 


where for oe B(S), de is the restriction of 

gon S [10-12]. We set(i) Z,=n if X,e S", 
n=0,1,...,00, where §” = {A}, and (ii) T= 

inf{ t Z, * Zo}. Z, and q are called the number 
of particles and the first splitting time, respec- 
tively. Let X = {x,, P,] be a strong Markov 
process on S. We assume that the semigroup 
{H,} of X is a strongly continuous semigroup 
on C(S). Let o be a nonnegative continuous 
additive functional of X. Let p(X), n =0, 2, 
3,. be a sequence of nonnegative functions 
in B(S) such that E£ 9,4, P(X) 21. Con- 
sider a sequence of stochastic kernels z,(x, dy), 
n-—0,2,3,.... on S x S", i.e., for lixed A € B(S"), 
T,(x, A) is a measurable function of x, and 

for lixed x it is a probability on (S", 8(S")), 
where B(S") is the topological c-field on S". 
Set n(x, D) = Ln=o,n# pO) (x, DN S”) for 
De®B(S), xeS, where S(S) is the topological g- 
field on S, Then there exists a unique branch- 
ing Markov process X = {X,, P,} on $ satisfy- 
ing the following: (1) The Markov process {X,, 
t « x, P,j on S obtained from X by shorten- 
ing its tlifetime is equivalent to the Markov 
process obtained by ‘killing X at a rate dg, 

(- 261 Markov Processes). (ii) For any DE 
S(S) and 4-0, E,[exp[ - 4v]; X;eD|X, ]- 
E,[exp[ —4At]|X,_ Te ., D) as. on {t< œ} 
for every x ES. (iii) With probability one, X, = 
A for allt > lim,;..t,, where t, denotes the nth 
splitting time [10]. This process X = {X,, P,} is 
referred to as the {x,, (y, ™}-branching Markov 
process. 


Example 1. Let $—1a,,45,...,a,j; then S can 
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be identified with Z* and $ with ZF = Z^ U 

{ œ}. Therefore a branching Markov process X 
on § is a Markov process on 2k such that its 

transition probability P(t, i, j), i, je Z^ satisfies 


t, LU, E i) St se. si 


k 
SS I] b» P(t, Er (ji Jas. JJ) 
i=1 (fe eet 
. D D d 
x dei éi 


for i=(i,iz, este (ei, 
-SE D,. 


Di, o. o)=1 
$—(5,,82,-. 


Then the process X is called a multi (Q-type 
Markov  brancbing process. 

Example 2. Let $ = [0, co], k(x) be a non- 
negative locally integrable function on [0, 00), 
and p,(x), n = 0, 1, , be a sequence of non- 
negative measurable functions on [0, oo) such 
that 3 p,(x)= 1 and p, (x) 0. We extend 
k(x) and p,(x) (n Z2) as functions on [0, co] by 
setting them {0 0 at oo, and set p,(oo) = 1. 
Define a stochastic kernel (x, dy) on $ x § by 


> Diät. o, ... 3) (DN S”) 
n=0 Sv 
= n 
n(x, D) = if x ell, 00), 
dite, eq) (DNS?) if x= oo. 


Let X = Ixa Pl be the uniform motion on § 
with the PUE (H,j such that for f e C(S) 
H,f(x) = fox t if xe[0, oo) and = f(ao) if 

SE Set p, = Ge ,) ds. Then we have a 
ix, oo. nj- peus RA process, and we 
call it an age-dependent brancbing process. 
We now discuss some fundamental prop- 
erties of {x,, o, n}-branching Markov pro- 
cesses. For the sake of simplicity we assume 
the following: (i) 9, — fo k(x,)ds,keC*(S)= 
(fe C(S)) f >0}, (ii) p,(x)e C* (S) and 


TAX, Diät, vil 
ee ped 


for De B(S"). Then {T,} is a strongly con- 
tinuous semigroup on C,(S) = {fe CIS) 

lim, ,4 f(x) 20]. Let A be the tinfinitesimal 
generator of {H,} (- 261 Markov Processes) 
and YA) be the domain of A. If fe 2(A)N 
C*(S), then u(t, x) - (T, f) (x)e P(A), and it 
satisfies ĝu/ôt = Au - k(F(;u) u),u(0+,x)= 
f(x), where F(x; g)= [aó(y)n(x, dy) [10]. 

Let .@ be the space of all nonnegative 
"Radon measures on § and Z, be the sub- 
space of .# consisting of al] probability Radon 
measures on $ endowed with the topology of 
tweak convergence. Set # = M U {6}, where ó 
is an extra point and = [0, oo] x Mg. Define 
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a mapping o". 3(4, A) (A, Ag)e Af by 
p(A, Ap) = Ady if À < oo and = dif 4 = 00, and 
define the topology of .@ as the strongest of all 
the topologies rendering p continuous. Set 
C**(S)={feC(S)|f>0}, and for feC**(S), 
detine a function ®,(4) on æ by ®,{4) = 
Gg (A, d if Ae M and =0 if 420, where 
) = fs f(x)A(dx). Consider a mapping y : $5 
p a w(x)=0 if x =6,= 2-19 px if 
x= E X3, s X JES", and = @ ifx=A. Let 
M, be the subspace of d consisting of a11 non- 
negative integer-valued Radon measures on S, 
i.e., &,={W(x)|xeS}. We denote by 2 the 
space of all right-continuous mappings v: 
IO. 20) M, whose discontinuities are at most 
of the first kind and such that v, =ô for t > 
$ if v = 6. Consider a branching Markov 
process ( X,, P,}, and let Pi be the tprob- 
ability law on Z of the stochastic process 
{v,=w(X,;0< t< œ} with X,=x, where Uz 
w(x). Then the branching property can be 
rewritten as follows: For every tz0, feC**(S) 
and Uu, i5 € Mp, E, +a [expL 7 (v. 119 
E, [expl — (v, f£) ] E, LexpE— (v 1] (> 
Silverstein, Z. Wahrscheinlichkeitstheorie und 
Verw. Gebiete, 9 (1968); for a treatment of 
branching processes in the framework of 
‘martingale problems — Holley and Stroock, 
Publ. Res. Inst. Math. Sci., 14 (1978)). 

There are analogous processes which pre- 
serve the basic features of the branching 
property. A Markov process { p, P,} on M 
with ó as a *trap is called a continuous-state 
branching process if jt satisties the property 
that for every t 20, f eC ' (S), and pr, j, € M, 
E, enl eXP E Qu. f)]] 7 E, [expE— Qu. ZU 
E, [exp[ — (u, f)]]. This concept was intro- 
duced by Jiiina [ 15] for some special cases. 
For further information 2 Jifina (Third 
Prague Conf., 1964), Lamperti (Bull. Amer. 
Math. Soc., 73 (1967)), Watanabe (J, Math. 
Kyoto Univ., 8 (1968)), and Fujimagari and 
Motoo (Kédai Math. Sem. Rep., 23 (1971)). 
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A. General Remarks 


R. Brown, an English botanist, observed in 
1827 that the minute particles comprising the 
pollen of plants, when suspended in water, 
exhibit peculiarly erratic movements [ 1]. The 
physical explanation of this phenomenon is 
that haphazard impulses are given to the sus- 
pended particles by collisions with molecules 
of the fluid. Let X(t) be the x-coordinate of a 
particle at time t, Then X(t) is treated as a 
trandom variable, and the distribution of X(t) 
-X(s) is a tnormal distribution N (0, D|t —s|), 
with tmean 0 and *tvariance D t-s |, where D 
is a positive constant. To be more exact, such 
a family of random variables {X(t)} is now 
considered as the family of random variables 
determining a tstochastic process. Various 
aspects of the theory were analyzed by A. 
Einstein [2], L. Bachelier, N. Wiener, P. Lévy, 
and others. 
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B. Wiener Processes 


Let T be the real line Rt or a subinterval. A 
stochastic process {X(t)},,, defined on a 
*probability space (Q, B, P) is called a Wiener 
process on R if it satisfies the following three 
conditions: (1) X (t, o) eR" (te T, oe, (2) The 
R"-valued random variables X(t,), X(t)— 
X(t;-1), j= 2,3,, n, are tindependent for 
any f «t; «€... «t, tie T (j=1,2,...,n), where 
n is an arbitrary positive integer. (3) If X (t) is 
the ith component of the vector X(t), then the 
IK (1 & i€ d) are independent as sto- 
chastic processes and every increment X (t) 

— X (s) is normally distributed with mean 0 
and variance (—s| A Wiener process is also 
called a Brownian motion. A Wiener process 
is a temporary homogeneous additive pro- 
cess. A fseparable Wiener process has continu- 
ous paths with probability 1. Conversely, if 
{X(t)},e7 is a temporary homogeneous addi- 
tive process on R! whose tsample function is 
continuous with probability 1 and the incre- 
ment X(t) = X(s) has mean O and variance 

t —s|, then it is a Wiener process (- 407 Sto- 
chastic Processes, 406 Stochastic Differential 
Equations). Let 1255,49 be an tincreasing 
family of o-subfields of 23. We assume that 
EA is right continuous. A d-dimensional 
continuous process X=(X(t)),,,,, is called a 
d-dimensional 1.2; )-Brownian motion if it is 
t{ F -adapted and satisfies 


ElexpLi<é, X (0 — X(35]] F] 
=exp[—(t—s)|¢|?/2] 


a.s. for every Zebi, O<s<t, 


where |é| denotes the tnorm of €€R?[9]. Then 
X satisfies conditions (1)-(3) mentioned above, 
and hence it is a Brownian motion on R’ (- 
406 Stochastic Differential Equations). 

Let {X,(@)} (k 20, 1, . ..) be a sequence of 
independent random variables defined on a 
probability space (Q, B, P) such that each X, 
has the normal distribution N (0, 1) with mean 
0 and variance 1. Then the series 


t 2 2 sin nt 
eso [25 xo SCH 


converges uniformly in t € T with probability 1, 
and its limit, denoted by X (t, W), is a Wiener 
process [3, 10]. 

Let W° = C( [0, oo) 5 R’) be the space of all 
continuous functions w: [0, oo) 5 Ri endowed 
with the topology of uniform convergence on 
finite intervals and 8( W^) be the ttopological 
o-field. Let X2(X(0),,,, be a Brownian 
motion on Bi and H be the probability law of 
X(0). The probability law P* of the Brownian 
motion X on ( W^, 8( W4)) is called the d- 





te[0,x]z T, 
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dimensional Wiener measure with the initial 
distribution 4 [9, 11, 12]. For a probability p 
on (R?, *B(R?) the d-dimensional Wiener mea- 
sure P, with the initial distribution I isa prob- 
ability on ( W^, ®( W^)) characterized by the 
property that for every 0 = tg « t, « « t, and 


Ae BUBI j= 1,2, uM, 
P,[w(ti)e 41, w(t5)e A5, ....w(t,)e A,] 


n 1 4 
“le lo 
RM 
xexp| —————— | (dx) dx, ...dx,, 
d HOER oJ4X, 


where 8(R‘) denotes the topological g-field on 
Rd. 


C. Brownian Motion as a Diffusion Process 


In terms of the general framework of *Markov 
processes, Brownian motion is a typical exam- 
ple of a diffusion process (— 115 Diffusion Pro- 
cesses). Let X = (X,(w), Hi Pj be a continuous 
Markov process on R* (—- 26 1 Markov Pro- 
cesses) with the ftransition probability 


d 1 
(2nt)"Pe x { ETA 


Be URL 


P(t, x, B)= | 


B 
t>0, xeR4, 


For each xE R^, the process {X,(w), Bj isa 
Wiener process in the sense mentioned above. 
The Markov process X, which is a collection 
of Wiener processes {X,, P, starting at x, is 
said to be a d-dimensional Brownian motion. A 
d-dimensional Brownian motion possesses the 
‘strong Markov property. Let (5 be the tgen- 
erator of the tsemigroup T, corresponding to 
X. A bounded uniformly continuous function 
f defined on R? belongs to the domain of (5 if 
its partial derivatives ĝf/ôx; and 0? f/0x;Ox;, LI 
= ],2,. d, are bounded and uniformly con- 
tinuous. For such a function f we have (f(x) 
=(1/2)Af(x), where A is the fLaplacian[8, 12]. 


D. Brownian Motions and Potentials 


For g > 0, the function GJ x) detined by 


Eit -an ES 2 
oua-Al e “(2nt) ol F ix] Ja 
is said to be the cc-order Green's function. 
Since Brownian motion is fnonrecurrent for 
d > 3, the limit Go ,(x)=lim, | G,(x), x e R, 
exists for > 3, and G,+(x) is equal to K,(x) = 
(F(d/2 -1)/4n"?)|x ~**?, which is the kernel 
for the *Newtonian potentials. Brownian 
motion is trecurrent when d < 2 and G,+(x) = 
+0, xe R2. In this case, K,(x) is defined by 
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Ko(x) =lim, )9(G,(x) = G,(xo)), and Ko(x) = 
(1/2n)log(1/|x|), when d=2 and |x,|- 1. 

This is the kernel for the tlogarithmic poten- 
tials. When d = 1 and x, = 0, K,(x) = —|x|/2. 
Using this relationship, we can express many 
concepts of classical potential theory in an 
elegant form in probability language. Let 

X = {X(t), P,) be the d-dimensional Brown- 
ian motion. Given a set A, set c4 =inf{t {> 

0, X(t)e A}, where the infimum over the 
empty set is understood to be +00. Then 0, is 
called the thitting time of X for the set A (— 
261 Markov Processes). For a Green domain 
Oe, a domain which is a Green space) D in 
R’ (dz 2), set 2g”(t, x, y)dy =P (X (t)edy, o; 
t), X, ye D. Then the right-hand side of this 
equation is the *transition probability of the 
Brownian motion on D with the tabsorbing 
barrier QD, Then GP(x, y) =|? g(t, x, y)dt is 
*Green's function of D. If B is a compact sub- 
set of a Green domain D or an open subset 
with compact closure B c D, then the hitting 
probability p(x) = P,(ag < dap) is the equilib- 
rium potential of B relative to D. 

Suppose A is an tanalytic subset of Ri. 
*Blumenthal’s O-1 law implies that Bio, =0) 
= 1 or 0. The point x is said to be regular for 
A if this probability is 1 and irregular for A 
otherwise. Let B be a compact subset in Ri 
(d 2 2). Then *Wiener's test (— 120 Dirichlet 
Problem) states that x is regular or irregular 
for B according as the following series diverge 
Or converge: 


where C(B,) is the *Newtonian capacity (the 
logarithmic capacity relative to a bounded 
domain when d =2) of the set B, 2 ly 2 ^* < 
ly-x| «2 4 1B [12]. Suppose D is a bounded 
domain in R (d 2 2). A point x e QD is regular 
or irregular for R^ — D according as x is regu- 
lar or irregular in the sense of the *Dirichlet 
problem for D. Given a continuous boundary 
function f on QD, u(x) —E,( f(X (o55)) is the 
solution of the generalized Dirichlet problem 
for D. Given x€D, the distribution h(x, B)= 
P.(X(o,;5)€ B) (Be98(0D)) used in the solution 
u(x) = fap (y) R(x, dy) of the generalized Diri- 
chlet problem is the tharmonic measure of OD 
as viewed from x. 


E. 1-Dimensional Brownian Motion 


In his monograph [6], P. Lévy gave a pro- 
found description of the fine structure of the 
individual 1-dimensional Brownian path 


45 F 
Brownian Motion 


(X (t, oi, o. Let us set 
c, = the hitting time to the point a in R’, 


m(t,@)= min X(s,o), 


go &s&t 


M(t,@)= max 


Gg SSK1 


(s, ol, 


Yo(t, ol i | X (t, c), 


Y, (t, o) Ze? w), Fdo (0) 
i(f, 0) = M(t,@)—X(t,w), tzog(o), 
and 
DN t< Go (e), 
Y,(t, o) = E o)-mí(t,o), t2oo(w). 


Then we have (1) Po(M(t)>a)=2P)(M(t)> 

a, X (t) - a) - 2P(M(t)» a, X(t) «a)— 
P,(|X(t)| >a) (reflection principle of D.Andreé). 
(2) The stochastic process {a,, 0< a< oo, Po} 
is a tone-sided stable process with exponent 
1/2, that is, it is additive and homogeneous 
with the law 


Polop — o, & t) = Pol-a & t) 
' b-a 
=uo /2ns? 


eg 5-225 ds, 


O<a<b, t20. 


(3) Let o ^! (t, c) be the right continuous in- 
verse function of 


e(t, ul Xto, XX (S, c9) ds, 


0 


where Xto, «)(*) is the *indicator function of the 
interval [0, oo). Set Y3(t,@)=X(p (t, e), œ). 
Then the four processes { Y,(t, w), O<t «oo, P,j 
(xe[0, œ), 0<i<3) on [0, oo) have the same 
probability law. Each of them is a diffusion 
process with transition probability 


P(t, x, B) 


1 
Zeg sc e ges y? 20 dy, 
| 


Bett, eh t>0, xe[0, 0), 


and is said to be a Brownian motion on [0, oo) 
with a freflecting barrier at the origin. (4) As a 
consequence of (3), if X(0, œ) = 0 a.s., then, for 

fxed t, M(t), -m(t), and Y(t) (0 &ix3) have a 
common distribution. For example, 


Po(M(t) 2 a) = Po(c, < t) = 2Po(X (t) a) 


S 2 
Z Mr GX Pt dx 


a» 0, 
nt |, 


P (X (t)eda, M (t)edb) 


S le 79? dadb, 0«a«b. 
T 
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(5) The diffusion process {X (t, w), 0 &t« oo, P.) 
(x € (0, 00)) obtained from a 1-dimensional 
Brownian motion by shortening its flifetime is 
called a Brownian motion on (0, oo) with an 
'absorbing barrier at the origin, and its tran- 
sition probability is given by 


P(t, x, B) 


(e "e" X dent kexapën dy, 





1 
JL 
Be B((0, ch 


The arcsin law is valid for many functionals 
of 1-dimensional Brownian motion. For 
example, 


2 0 
du 4o. rts ode Je arcsin [^ 
0 n t 


0«0 «xt, 


t»0, xe(0, 00). 


Deet, 


Po(t,(@) < s) =(2/n)arcsin at ; 


where 1,(o)=sup{s| X(s, o) 2 0, 0« s« t]. 
The visiting set Z(c)- (IX, iss Di of a 
Brownian path is a ttotally disconnected 
set. Its *Lebesgue measure is 0, and the 
tHausdorff-Besikovich dimension number of 
Z(o)is 1/2. 
Consider a -dimensional Wiener process 
{X(t)}o<;<. Starting at the origin. Let us set 


0 ied 
SEN t0, 
and 
B,(t)=cX(t/c?), t20, (c>0). 


Then the stochastic processes {B,(t)}o<:<. 
and [B,(t)] o <j<q are likewise 1-dimensional 
Wiener processes starting at the origin. Hence 
the properties of the Wiener process starting at 
the origin in a neighborhood of t = 0 (t = œ) 
can be obtained from those in a neighborhood 
of t= oo (t=0). 


F. d-Dimensional Brownian Motion 


Almost all paths of d-dimensional Brown- 
ian motion are continuous but are not of 
tbounded variation on any finite interval. 
Accordingly, they cannot have lengths. A 
positive, continuous, increasing function o 
defined on (to, oo) with tọ > 0 is said to belong 
to tbe upper (lower) class witb respect to local 
continuity if P,((inf{t || X(£,@)|> Ji (t/t), t» 
0) > 0) = 1 (0) Kolmogorov's test states that 
o belongs to the upper class or to the lower 
class with respect to local continuity according 
as 

9 

tH one 2 dr 


to 
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converges or diverges. For example, 
p(t)=(2logi, t - (d -2)logit + 2logayt +. 
*-2log,.,, t (2 Ó)log,, t)? 


belongs to the upper or lower class with re- 
spect to local continuity according as 6 > 0 or 
ó x0, where log, t =loglog,,_;)¢ and log,,,t= 
log t. The law of the iterated logarithm for ]- 
dimensional Brownian motion, 


( | X (t, 00) — X (s, @)| 
P| imsup ————————————- =1, 
ros ,/2|t—s|loglog 1/|t—s| 


is a special case of this example. For the case d 
= 1, consider the space-time Brownian motion 
{(—t, X(t), Po}. Set D, = {(s, X)| —1/to <s <0, 
| —5 q( —1/s) & x « oo). Then O=(0,0) is a 
regular point of D, or an irregular point of D, 
for the space-time Brownian motion accord- 
ing as 9 belongs to the lower or upper class 
with respect to local continuity. (- 261 Mar- 
kov Processes). Thus Kolmogorov's test is the 
Wiener test for space-time Brownian motion. 
Let y be a positive, continuous decreasing 
function defined on To, oo) with tọ > 0. Then 


d IX (to) « t Va/.cso) »0) 


=0 or 1. 


We have 
T. 
| ,UOY at mot <% when d > 3, 
lo 
and 
a dt 
———————-— 00 Of « 00 when d = 2. 
I (og gi 


To describe uniform continuity of a path on 
the interval [0, 1], take a positive, continuous 
increasing function y defined on Tra, oo) with 
to 7 0 and set g(t) = JE V(1/t). Then y is said 
to helong to the upper class with respect to 
uniform continuity if almost all paths X(t, œ) 
(0x t € 1) satisfy the *Lipschitz condition rela- 
tive to Q, that is, for almost all œ there exists 
an E(w) > 0 such that 0 < t—s|«&(o) implies 
|X (t, )) X (s, a) « o |t — s). And y is said to 
belong to the lower class with respect to uni- 
form continuity if almost all paths X(t, œ) 

(0 €t € 1) do not satisfy the Lipschitz condition 
relative to y. Then y belongs to the upper or 
lower class with respect to uniform continuity 
according as 


DO 
| dir "Ze von dr 
t 


o 


converges or diverges [ 16]. For example, 
v(t)  Qlogt «(d -4)logo t +21088t +. 
*2loga,t-- (2+ d)log,, t)? 
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belongs to the upper or lower class with re- 
spect to uniform continuity according as § > 0 
or § <0. The following theorem on the uni- 
form continuity of 1-dimensional Brownian 
motion is a special case of this criterion: 


B, limsup | X (t,o) — X (s, ol = | a]. 


Kat, 2I -sllog LAC-s| 


Now we state some other properties of 
Brownian paths. Let A be a set of zero *outer 
capacity in R° (d> 2). Then BON (t)e A for 
some t > 0) = 0 for any x e Ri Let A be a plane 
set with positive tinner capacity. Then P,(X (t) 
€ A for intinitely many t larger than any given 
§ > 0) = 1 for anyx c R?. With probability 1, 
the Lebesgue measure of the set (X (t œ) |0 < 
t « oo) in Riis zero for dz 2. With proba- 
bility 1 this set is everywhere dense in R? 
when d = 2 and nowhere dense in R^ when 
d>3[ 13,15, 17]. Almost all 2-dimensional 
Brownian paths have k-fold multiple points 
for any integer K> 2. In the 3-dimensional 
case, almost all paths have "double points but 
cannot have any triple point. In the d( >4)- 
dimensional case, almost al] paths have no 
double point [18—20]. 


G. Itó's Formula and Brownian Local Time 


The following formula of Itó [21] is of funda- 
mental importance in the theory of stochastic 
processes. 

Itó's formula. Let X={X(t) AXX,(t), X,(t), 
KAL Pj be a d-dimensional Brownian 
motion and f(x) f(x,. x2... , xj) bea C?- 
function delined on Ri Let us set <p(t) = 
f(X(0)— AX (0). Then Jett, a is a continu- 
ous tadditive functional of the Brownian mo- 
tion X (- 261 Markov Processes), and we 
have 


t 


d 
foc racon- | 


0 


Q 
Laun 
Xi 


«5 | 'anocenas tz 0, 


where the first term in the right-hand side is a 
stochastic integral with respect to the Brown- 
ian motion X [22] (- 406 Stochastic Differ- 
ential Equations). In particular, if f is a C?- 
function defined on R4, Itó's formula can be 
rewritten as 


where the right-hand side is a stochastic inte- 
gral of the *Stratonovich type with respect to 
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the Brownian motion X [9] (- 406 Stochastic 
Differential Equations). 

Consider a 1-dimensional Brownian motion 
X = (X(t), P,). By the local time or the sojourn 
time density of X we mean a family of non- 
negative random variables Jett, x)} (te [0, oo), 
xéR’) such that, with probability 1, the fol- 
lowing holds: 

(a) the mapping [0, oo) x R! a(t; x) 5 o(t, x)e R! 
is continuous, 
(b) for every Borel subset A of Ri and f >0, 


I x (X (s))ds= d p(t, x)dx. 


The notion of the local time of Brownian 
motion was first introduced by Lévy [6]. The 
family of random variables { g(t, x)} (t > 0, 

XE R!) defmed by 


(t,x) - (X (t) —3)* —(X(0)—x)* 
l 
- l Xix, XX (5) dX (s) 


satisties properties (a) and (b) mentioned above 
and hence it is a local time [9, 22]. Here a+ is 
the bigger of a and O. It is clear that the local 
time g(t, x) of X is given by 
1 t 
e(t, x) = lim Ae | X(x~e,x+0)(X (s)) ds 
for every xER',t>0. 

Furthermore, we have 
lott, b) — p(t, a)| 

ólog1/ó 


« 2, / max oral l, 


P ( limsup 


b-a=6|0,a<b 


aeR! 


and 


t,0)— lt, 
n (imsuol )- ett.0) 


— —« 2 «tt, dÉ l. 
510 dloglog 1/6 


Lévy [6] studied the fine structure of the local 
time. Define the visiting sets Z += (t| Yo(t, c») 
=0} and Z^ ={t| Y,(t,@)=0}, where Y,(t, c») 
and Y, (t, œ) are the stochastic processes de- 
fined in Section E above. Then we have 


: TE 
Po (um E. X [the number of flat stretches of 
M(s,o) (0€sxt) of length 2c] = M(t, c), 
120) =I. 


Since the two diffusion processes X * = 
{Y)(t),0<t< oo, land X ={Y,(t),0<t< 
oo, P,} define the same probability law on 

W( [0, 00)), there exists a functional q* (t, œ) of 
xt corresponding to M(t, w) of X ~. Here, 
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W( [0, oc)) denotes the space of all continuous 
functions w: [0, oo) [0, oc). The flat stretches 
of the graph of o *(t, œ) are the open intervals 
Z,(nz1)such that X,., 2, —[0,00)— Z+ 
Then we have 


P; (ip E [the number of intervals 
D 


Z c [0,t) of length »d-e't.o.20) Ed. 


Furthermore, we have 


P, (im fe x [the total length of the inter- 
l0 N 2e 


vals Z, c [0, t) of length « c] 2 9 ' (t, w), 


i20)-: 


and 


n (impo | Su dE ds 


0 


=9"(t0)t20)= 1 


Let d,(t, w) be the number of times that the 
reflecting Brownian path Y,(s, ol crosses down 
from £> 0 to 0 before time f, Then 


Po (im ed, (t, q)- CND w), E = l. 


H. Flows and Random Distributions 


The flow derived from the 1-dimensional 
Wiener process IX laten 18 'Kolmogorov's 
flow. It has tmixing properties of all orders 
and is *ergodic (- 136 Ergodic Theory, 39.5 
Stationary Processes). The fstationary process 
with independent values at every point corre- 
sponding to the tcharacteristic functional 


exp(—3| p(t)? i) 


on the *Schwartz space Y defines the same 
probability law with the stationary process 
obtained by differentiation of the Wiener 
process in the tdistribution sense [23] (- 
395 Stationary Processes, 407 Stochastic 
Processes). 


I. Generalizations of Brownian Motion 


In addition to the Brownian motion described 
above, there are several stochastic processes 
that are also called Brownian motion. A Gauss- 
ian system (X (a)}aer~ defined on a proba- 
bility space (Q, B, P) is said to be a Brownian 
motion witb an N-dimensional time parameter 


189 


if&i) E(X (a))=0, Gi) E(X (a) X (b) 23(la] + 

IN Jo ML (iii) P(X(0)=0)=1. Leta*bethe 
tspherical inversion of a € R“ with respect to 
the unit sphere. Set X *(a)— |a| X (a*) (a Z0) 
and X*(O) = 0. Then (X *(a) taer" defines the 
same probability law with a Brownian mo- 
tion with an N-dimensional time parameter. 
Almost all paths of a Brownian motion with 
an N-dimensional time parameter are con- 
tinuous. À positive, continuous, increasing 
function o defined on [to, oo) with ty » 0 is said 
to belong to the upper (lower) class with re- 
spect to local continuity if the probability 

that the closure of the set (a |X(a, w)| > /\al 
oi 1/|al), |a| > 0} contains the origin 0 is equal 
to 0 (1). Then @ belongs to the upper or lower 
class with respect to local continuity according 
as the integral 


t 


0 


Lem" zu 
t 


converges or diverges. For example, 


q(t)- 2log5,t--2N *-  1)logat2logat 4 


--2log,,.,,t-- (2 d)log,, t)? 


belongs to the upper or lower class with re- 
spect to local continuity according as 6 > 0 or 
ô <0. As a special case, we have 


d limsup ESC 


————M———-|j-21. 
a0 ./2|a|loglog/|a| | 


Take a positive, continuous increasing func- 
tion 9 defined on [tg, 00) with to 0, and set 
y(t) - Jt o(1/t). If almost all paths X (a, c») 
(|a| < 1) satisfy the Lipschitz condition relative 
to w(t), then @ is said to belong to the upper 
class with respect to uniform continuity. It is 
said to belong to the lower class with respect 
to uniform continuity if, with probability 1, 
these paths do not satisfy the Lipschitz con- 
dition relative to w(t). Then o belongs to the 
upper or lower class with respect to uniform 
continuity according as 


0 
| oti te ee dt 
t 


0 


converges or diverges (T. Sirao, Nagoya Math. 
J., 17 (1960)). For example, 


p(t) = QNlog t+ (4N + (log, att 2 log,s)t 
+.. +2loge1)t+(2+ Slog, t)" 


belongs to the upper or lower class with re- 
spect to uniform continuity according as à > 0 
or 6< 0. As a special case, 


i 1X (a, 0) — X (b, m)! 
P lim su o 
H stellen ,/2N |a — b|log Ulo — b] 


=1)=1 
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(For general information about Brownian 
motion with a multidimensional time para- 
meter — P. Lévy [6] and H. P. McKean, 
Theory cf Prob. Appl. 8 (1963).) 

Let X(t) be a Wiener process. L. S. Ornstein 
and G. E. Uhlenbeck based their investigation 
of the irregular movements of small particles 
immersed in a liquid on Langevin's equation 


dU (t) - — aU (t)dt + Bd X (t), 


where U (t) is the velocity of a particle. The 
first term on the right-hand side of this tsto- 
chastic differential equation (- 406 Stochastic 
Differential Equations) is due to frictional 
resistance or its analog, which is thought to be 
proportional to the velocity. The second term 
represents random external force. The station- 
ary solution of this equation is given by 


U(t) 3i fe **? dX (u). 


The stochastic process {U(t)} a is a 
stationary (Gaussian Markov process with 
covariance function y(t) = (f?/2a)e *"l. This 
process is called ` Ornstein-Uhlenheck Brownian 
motion [24] (— 176 Gaussian Processes). 

Brownian motion has been defined on state 
spaces that are *Riemannian spaces or +Lie 
groups, and its properties are being investi- 
gated (— 5 Additive Processes, 115 Diffusion 
Processes). 
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A. General Remarks 


One of the first objects of differential calculus 
was to systematize the theory of the extrema 
of functions of a linite number of indepen- 
dent variables. In the calculus of variations 
we consider functionals De. real-valued or 
complex-valued functions delined on a func- 
tion space {u} consisting of functions delined 
on a certain domain B), originally so named 
by J. Hadamard. Such a functional is denoted 
by J[u B] (or simply J[u]). A function u, 
which is considered an independent variable of 
the functional, is called an argument function 
(or admissible function). 

Concrete examples of functionals are the 
length of a curve y = f(x) and the area of a 
surface z = z(x, y), which are expressed by 


L[y] -| IA Q9) dx 


and 
S[z] =f V Lo zd zi dxdy, 
B 


respectively. Furthermore, consider a curve y — 
y(x) connecting two given points (x,, yg) and 
(x4, y.) with y, >y,. The time in which a parti- 
cle slides down without friction from (x,, y,,) 
to (x4, y,) along this curve under constant 
gravity acting in the direction of the positive 
y-axis is expressed by the functional 


ich (1L y) (y— yo) dx, 
Xo 
k constant. (1) 


Let F(...) be a known real-valued function 
that depends on a certain number of indepen- 
dent variables and on argument functions of 
these variables, together with derivatives of 
these functions up to a certain order. Then a 
typical problem in the calculus of variations is 
formulated as an extremal problem of a func- 
tional that is expressed by the integral with 
F(...)as the integrand, for example, 


J[u]= |: F(x, u(x), u(x), . . ,u?(x))dx, 


J{u, v] = F(x, y u(x, y), Uy, Hax 
[u, v] ut (xX, Y), us, Uy; 


v(x, y. Ux; Uy, nn ) dx dy. 


For instance, a curve minimizing (1) is the 
curve of steepest descent. 

In extremal problems, suitable boundary 
conditions can be assigned to argument func- 
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tions. On the other hand, there are so-called 
conditional problems in tbe calculus of vari- 
ations. A typical example of this sort is the 
tisoperimetric problem, ie, the determination 
of the curve that bounds a domain with maxi- 
mal area among all curves on a plane with 
given length. In general, an extremal problem 
of a functional under a subsidiary condition 
that the value of another given functional 
remain lixed is called a generalized isoperi- 
metric problem. In addition to these, there are 
Lagrange's problem, in which a finiteness 
condition is imposed, and Hilbert's problem, in 
which a condition consisting of differential 
equations is imposed. 

The birth of the calculus of variations was 
almost simultaneous with that of differential 
and integral calculus. Johann Bernoulli, Jakob 
Bernoulli, L. Euler, and others had dealt with 
several concrete problems of the calculus of 
variations when in 1760, J. L. Lagrange intro- 
duced a general method of dealing with vari- 
ational problems connected with mechanics. 
Then an equation bearing the name of Euler 
or Lagrange was introduced. 


B. Euler's Equation 


As an example, consider the simplest vari- 
ational problem 


x 

J[y]= | F(x, yx), y'(x))dx = min. (2) 
Xo 

Let the boundary condition viel = y.. y(x4) 

=y, be assigned to the argument function 

y(x). Consider a family of admissible functions 

Y (x; €)= y(x)+ n(x), where n(x) is any lixed 

function vanishing at both endpoints and 

& is a parameter. If y(x) gives the minimum 

of J[ y], then the function of ¢, JẸ Y], must 

attain a minimum for ¢ = 0. The condition 

(OT YJ/& ),, = 0 is written in the form 


0 -| (F,n  F,m)dx 


Xi d 
-| de Zen 


by taking into account the boundary con- 
dition. By making use of the arbitrariness of 
n(x), we conclude that 


0= P, Eje E Bye YE - Y Eyy (3) 
holds, in view of the following lemma: Let q(x) 
be continuous in [x9, x,] and y(x) be a func- 
tion of class CH that satisfies n(x) = n'(xo)=- - 

- n? (x9) 20, n(x1) =n (x)=... 9 1'?(x,) 20 

(0 q« p). If 2: nG)o(x)dx —0 holds for any 
such n(x), then o(x)—0. (Here class C? can be 
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replaced by class C? when q is supposed finite. 
If ne C°”, then q = oo is admitted.) This is called 
the fundamental lemma in the calculus of vari- 
ations. We call (3) the Euler-Lagrange differ- 
ential equation (or Euler's equation for the 
extremal problem). Since this equation is of 
the second order, y(x) can be determined by 
means of the boundary condition. 


d 
The quantity [F], — F, =e F; contained in 
x 


equation (3) is called the variational derivative 
of F with respect to y. Furthermore, dy = nde 
and Al = (6J [ Y\/&),,, de are called the first 
variations of the argument function y and of 
the functional J[ y], respectively. If n(x) is not 
subject to the condition that it must vanish at 
the endpoints, then the first variation of J 
becomes 


AI- | [F] ôydx+ EF, à] 


In comparison with an ordinary extremal 
problem T (x4, s Xp) = min in differential 
calculus, [F], and Al correspond to gradfand 
df, respectively. In general, a solution of the 
Euler-Lagrange differential equation [F], =0 
is called a stationary function for the vari- 
ational problem, and its graph is called a 
stationary curve. 

For a variational problem involving several 
argument functions, we have only to write the 
system of Euler-Lagrange differential equa- 
tions corresponding to them. For a problem 


x 
=| "F(x, pt... ¥™)dx=min 


*o 
whose integrand involves derivatives of higher 
orders of an argument function, the Euler- 
Lagrange differential equation is 


m d" 
[F], 32 LU y =O. 
For a problem involving a double integral 
J[u] = F(x, y, Uu, Ux, yy) dx dy - min, 
S Sg 


the equation is 


Ó d 
Oz [F],— F,—-F, 


n 
Ox 


* By y" 

For a generalized isoperimetric problem, for 
example J[ y]- min, K [y] = c, the Euler- 
Lagrange equation becomes 


[F+4G],=0 


with the so-called Lagrange multiplier A, where 
F and G denote the integrands of J and K, 
respectively. Two integration constants con- 
tained in a general solution of this differential 
equation of the second order and an undeter- 
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mined constant 4 can be determined, for in- 
stance, by the boundary condition y(X9) = yo, 
y(x,) = y. and a subsidiary condition K [y] = c. 
As an example, for the classical proper isoperi- 
metric problem F = y, G = ,/1 + y?, the equa- 
tion is 1 AU, / 1 4- y = 0, which after 
integration leads to (x —a)? + (y = ff 2 4? 
(— 228 Isoperimetric Problems). 

Besides the case of fixed endpoints, there is 
a boundary condition, for instance, that an 
endpoint (x, y.) of the argument function 
y = y(x) must lie on a given curve T(x, y) = 0. 
For the case of such a movable endpoint, the 
extremal function is subject to the condition of 
transversality, 


(F—-y F,)T, — F, T, 20, 


X=X1. 


C. Sufficient Conditions 


A. M. Legendre introduced the notion of the 
second variation, corresponding to differential 
quotients of the second order in differential 
calculus, in order to discuss sufficient con- 
ditions. Concerning the simplest problem (2), 
the inequality F,,,.(x, yo{x), yo(x)) 2 0 is neces- 
sary in order for yo(x) to give the minimum. 
Conversely, the inequality Fy, > 0 and Jacobi’s 
condition (which is stated below) imply that 
y = yo(x) gives a weak minimum. Here “weak 
minimum” means the minimum when a family 
of admissible functions { yey, <E y —-ygl« 
e} is considered a neighborhood of y,. 
Jacobi's condition: Let y be a solution of a 
linear ordinary differential equation of the 
second order, 


d du d 
ES Ven = Fy ax Dy u —0, 
u(xo) = 0; 


then the smallest zero of ų that is greater than 
Xo (i.e., the conjugate point of xX) is greater 
than the right endpoint x; 

K. Weierstrass derived suflicient conditions 
for a strong minimum by extending the range 
of admissible functions to { y — yo «£j. Re- 
sults that were obtained until about that time 
constitute the content of what is usually called 
the classical theory of the calculus of variations. 

If for the variations problem (2) there exists 
a unique curve through every point in a do- 
main on the xy-plane that belongs to a one- 
parameter family of stationary Curves of the 
functional 


x 
J[y]= | F(x, y, y)dx, 
Xo 


then the domain is called a field of stationary 
curves. Let the parameter value of the curve 
through a point (x, y) in such a family of 
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stationary curves y = @(x; a) be denoted by X 

= a(x, y The slope p(x, y) = [9'(x; 2) Ja=acx,yy is 
called the slope of the tield at the point (x, y) 
or, by regarding x, y as variables, the slope 
function of the field. The value of a curvilinear 
integral 


Ic— ILy] 
dE 
C 


is then determined and depends only on the 
two endpoints of the curve C. We call I, 
Hilbert's invariant integral. In view of the 
property mentioned above, we can denote the 
value of the functional J for a function y = y(x) 
representing a curve C by Je. Then any admis- 
sible curve C that passes through a tield em- 
bedding a stationary curve C, satisfies 


0€ AJ —Jc— Jc, — | é(x, y; p, y )dx. 

Here i 

(x,y; p, Y) 

= F(x, y. y) - F(x, y. p) —Q' —p)F,(x, y, p) 


is the &-function introduced by Weierstrass. 
For C, to give the minimum of J[ y], it suffices 
that ë > 0 hold for every point (x, y) in. the field 
and every value y’ (— 279 Morse Theory). 


D. Optimal Control 


Let a system of differential equations 
dx;/dt = fí(x,, . - s Xy; Dr, uy), 
(ui) u)eQ; x(t)ox, i=1,..) n, (4) 


be given, where u,,.., y, are parameters. In 
general, a problem of optimal control is to 
determine u; = u,(t) (to St  t,) such that the 


value of a functional 


md: Un a nde 


assumes a minimum, where x(t) are the solu- 
tons of (4) and are considered functions in 
Ui,- H and t. Such a problem is a kind of 
conditional variational problem. But since the 
existence region of y is restricted, certain con- 
ditions in the form of inequalities are imposed, 
and furthermore y is not necessarily continu- 
ous, and in many cases the problem cannot 
be treated within the classical theory of the 
calculus of variations (— 86 Control Theory). 


E. The Direct Method in the Calculus of 
Variations 


In mathematical physics tvariational principles 
are derived from discussions of formal corre- 
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spondence between a functional J[u] to be 
minimized and Euler's equation for J [u]. This 
is certainly one of the important methods in 
the calculus of variations, but it is also possible 
to investigate a stationary function Va on the 
basis of its stationary character and indepen- 
dently of Euler’s equation. This is called the 
direct method in the calculus of variations. It 
plays an important role in the theoretical 
treatment of the existence and uniqueness of 
solutions, and it is also significant as a tech- 
nique for approximate or numerical solutions. 
When a differential equation is given inde- 
pendently of the calculus of variations, it is 
possible to apply the direct method if a func- 
tional whose Euler’s equation is the given 
differential equation can be constructed. 

Let D be a bounded domain in m- 
dimensional space and feL.,(D) be a real- 
valued function. Consider the variational 
problem of minimizing the functional 


s= idu ds-2] fu dx. 
D D 


Here we suppose the set of admissible func- 
tions, denoted by Aj, to be the Hilbert space 
obtained by completing the function space 
Cg (D) with respect to the norm 


1/2 
N(u) d ladutd | ar) 
D D 


Utilizing F. Riesz’s representation theorem in 
Hilbert spaces, it can be shown that there 
exists a minimum value | in A, which is 
uniquely realized by certain ug € A;. Since the 
function uj belongs to A,, it can be shown that 
the boundary condition 


Ul ap =9 (5) 


is satisfied in a generalized sense. Furthermore, 
in view of J[ug] € J[ug + o] being valid for 
any oc C$ (D), it can be verified that the 
equation 


-Au-f (6) 


is satistied in D in the sense of differentiation 
of tdistributions. In other words, the station- 
ary function ug is a solution in the wide sense 
(a tweak solution) of the classical boundary 
value problem for *Poisson’s equation for- 
mulated by (6) and (5). If a function space A, 
with 4, 2 A, > C%(D) is taken as the set of 
admissible functions, the value |= J [ugo] be- 
cornes the greatest lower bound of J in A J 

In this case, if (u,1 z; is any minimizing se- 
quence from A,, that is, ifu,éA,,n=1,2,..; 
J[u,] >l (n oo), then it converges to ug in the 
sense that 


N(u,—Up) 70 (n0) (7) 


In other words, the solution in the wide sense 
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ug of the boundary value problem can be 
constructed as the limit of a minimizing se- 
quence that consists of sufficiently smooth 
functions vanishing on the boundary. In the 
proof of the fact that a solution in the wide 
sense uo coincides with the classical solution of 
the boundary value problem under an assump- 
tion of suitable smoothness for f and 0D, a 
standard argument has been established for 
proving the regularity of a solution in the 
wide sense. 

The technique of obtaining the solution of 
the boundary value problem as the limit of 
a minimizing sequence was proposed by B. 
Riemann concerning the classical ?Dirichlet 
problem and was completed by D. Hilbert. 
This pioneering work led to the recent treat- 
ment of boundary value problems by utilizing 
Hilbert spaces. For +Self-adjoint boundary 
value problems, the method stated for the 
above example has been generalized almost 
directly to the cases of differential operators of 
higher order and with variable coefficients (- 
323 Partial Differential Equations of Elliptic 
Type). By making use of some auxiliary argu- 
ments, this technique can be applied exten- 
sively to the construction of several kinds of 
mapping functions in the theory of functions 
of a complex variable, to the solution of 
"integral equations of the second kind, and 
also in other fields [3,4]. 

The eigenvalue problem, which is formu- 
lated by 


Hu-iu, u#0, (8) 


with a +Self-adjoint operator H in a Hilbert 
space, can also be transformed into a vari- 
ational problem for the *Rayleigh quotient 


R[u] =(Hu, uu]? (9) 


(= 298 Numerical Computation of 
Eigenvalues). 


F. Solution of Differential Equations by the 
Direct Method 


In view of the convergence shown in (7), a 
minimizing sequence can be regarded as an 
approximating sequence for a solution of the 
boundary value problem or a stationary func- 
tion ug. Let a function 


usa x5 S Wr Crest) (10) 


involving an n-vectorc =(c,,...,¢,) asa para- 
meter be admissible for any c. If J[u,(-; c)] = 
F(c), obtained by substituting u, into J, is 
minimized at c = c?, then u,( *; c?) is considered 
as a function that approximates ug most pre- 
cisely within the family u,( *; c). This vector c? 
is obtained, in general, by solving the simulta- 
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neous equations 
0J[u,(;c0)]/0670, |. j2 1, ..., n. (11) 


The function u, appearing in (10) is often taken 
to be a so-called linear admissible function. 
For instance, in the example above, let {o}: 
be a system of independent functions complete 
in A,, and set 


u,— C1Q1T C40, (12) 


The method that constructs a minimizing 
sequence uj, u,,. by determining the value of 
the c, in (12) by (11) is called Ritz's method, 
and o, is called a coordinate function in this 
method. As for the rate of convergence in 
approximation by Ritz's method, as well as 
for the estimation of errors, there are several 
results by the Soviet school in addition to 
those of E. Trefftz [3, 5]. (Other methods of 
constructing minimizing sequences are stated 
in detail in [3]; concerning a connection with 
*Galerkin's method — 304 Numerical Solution 
of Partial Differential Equations.) 

Ritz’s method applied to eigenvalue prob- 
lems is called the Rayleigh-Ritz method. Since 
a stationary value of the Rayleigh quotient is 
itself an eigenvalue in this case, the precision of 
approximation is far better for eigenvalues than 
for eigenfunctions, so that it is a convenient 
method for the approximate computation of 
eigenvalues. 
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47 (XXI1 5) 
Cantor, Georg 


George Cantor (March 3, 1845-January 6, 
1918), the founder of set theory, was born in 
St. Petersburg into a Jewish merchant family 
that settled in Germany in 1856. He studied 
mathematics, physics, and philosophy in Zu- 
rich and at the University of Berlin. After re- 
ceiving his degree in 1867 in Berlin, he became 
a lecturer at the University of Halle and served 
as a professor at that university from 1879 to 
1905. In 1884, under the strain of opposition 
to his ideas and his efforts to prove the fcon- 
tinuum hypothesis, he suffered the lirst of 
many attacks of depression which continued 
to hospitalize him from time to time until 

his death. 

The thesis he wrote for his degree concerned 
the theory of numbers; however, he arrived at 
set theory from his research concerning the 
uniqueness of *trigonometric series. In 1874, he 
introduced for the lirst time the concept of 
Cardinal numbers, with which he proved that 
there were “more” ttranscendental numbers 
than talgebraic numbers. This result caused 
a sensation in the mathematical world and 
became the subject of a great deal of con- 
troversy. Cantor was troubled by the oppo- 
sition of L. tKronecker, but he was supported 
by J. W. R. tDedekind and G. Mittag-Leffler. 
In his note on point-set theory, he wrote, in 
connection regard with his concept of infinity, 
“The essence of mathematics lies in its free- 
dom!” In addition to his work on cardinal 
numbers, he laid the basis for the concepts of 
torder types, ttransfinite ordinals, and the 
theory of real numbers by means of ffunda- 
mental sequences. He also studied general 
point sets in Euclidean space and defined the 
concepts of faccumulation point, ‘closed set, 
and topen set. He was a pioneer of the point- 
set theory that led to the development of 
general topology. 
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48(X.31) 
Capacity 


A. General Remarks 


The electric capacity of a conductor in the 3- 
dimensional Euclidean space R? is defined as 
the ratio of a given positive charge on the 
conductor to the value of the potential on the 
surface. This definition of capacity is indepen- 
dent of the given charge. The capacity of a set 
as a mathematical notion was defined lirst by 
N. Wiener (1924) and was developed by 0. 
Frostman, C. J. de La Vallée Poussin, and 
several other French mathematicians in con- 
nection with tpotential theory. 


B. Energy 


Let Q be a tlocally compact Hausdorff space 
and Q(x, y) be a flower semicontinuous func- 
tion on Qx Q such that —oo « $ € oc. A mea- 
sure jt will mean a nonnegative "Radon mea- 
sure with compact 'support S,. Denote by 
(x, u) the tpotential { B(x, y)du(y) of a mea- 
sure u with kernel and by ( Di ul the energy 
[f dudp of u. Let X be a set in Q, and denote 
by Uy the class of normalized measures u G.e., 
of measures H satisfying u(Q) — 1) with S, c X. 
Let K be a nonempty compact set in Q, Set 
W(K)=inf(p, ul for ue Yg, and W(@)= oo for 
the empty set GI. For (x, y) = 1/|x — y|in Q 

= R^ the general solution u(x) of the tDirich- 
let problem (texterior problem) for the bound- 
ary function 1 in the unbounded component of 
R? — K is equal to the potential of an tequilib- 
rium mass-distribution. Therefore, if S is a 
smooth surface surrounding K and normals 
are drawn outward to S, the integral -( 1/4 r): 
[s(0u/On)de of the normal derivative is equal 
to 1/W(K). This is the capacity of K delined 
by N. Wiener (J. Math. Phys., MIT, 3 (1924)) 
when K is a closed region. Vallée-Poussin 
(Ann. Inst. H. Poincaré 2 (1932)) called the 
supremum of u(R?°) the Newtonian capacity of 
a bounded *Borel set E, where o runs through 
the class of measures p with S, c E whose 
*Newtonian potentials are not greater than 1 
in R?. If E is compact, the Newtonian capacity 
coincides with Wiener's capacity. For the 
tlogarithmic potential in R*, e "9 is called 
the logarithmic capacity. When tGreen's func- 
tion g(z, oo) with the pole at the point at infin- 
ity exists in the unbounded component of R? 
— K, lim, ,.(g(z, oo) ^ log|z|) is called Rohin's 
constant and can be shown to be equal to 

W (K). (For the relation between Robin's con- 
stant and treduced extremal distance — 143 
Extremal Length.) In the case of a general 
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kernel it is difficult to define capacity as above 
by means of W(K), and hence the value of 
W(K) itself instead of the capacity of K is often 
used. When W(K)= co, we can say that K is of 
capacity zero. The minimum value of the 
"Gauss integral (u, u) — 2 f f duis a generaliza- 
tion of W(K), where ue 4/, and f is an upper 
semicontinuous function bounded above on K. 


C. Minimax Value 


Suppose that we are given the kernel «b as 
above. For a set X CH and a measure Jt, 

set U (u; X) = Sup, y P(x, u) and V(p; X)- 
inf, y P(x, u). Next, for Y c Q, set U(Y) = 

inf U(u;S,), V(Y) = supV(j Sek U(X, Y) = 

inf U( u, X), and V(X, Y) = sup V( y; X), where 
ue ty. If dia, y) = @(y, x) is taken as a kernel 
instead of ®(x, y), then the notations u/(K), 
U(u; X), V(u; X), are used correspondingly. 
For any compact set K the following relations 


hold: 
W(K)= W(K)« CI(K)= ü(K) 


U(Q, K) 3 V(K,O) 
<{0(Q, K)= V(K,Q) 


S E K)=V(K, 2 
V(K)= V(K). 


U(K, K)=V(K, K) 


Examples show that all the inequalities can be 
strict. The tminimax theorem in the *theory of 
games plays an important role in the proof of 
these inequalities [7]. Even if the kernel is 
symmetric, the inequalities can be strict except 
for the equality W(K)= U(K). When the kernel 
is positive, we can define the quantities which 
correspond to U(Y), V(Y), U(X, Y), V(X, Y) 
by considering the class of u with S, c Y and 
(u, 4) = 1 instead of Wy. 


D. Transfinite Diameter 


As k—oo, 


D,(K)-k !(k—1) ! mt S Y, tes, xj) 
Kyser XEK ZY 

decreases and the limit D (K) is equal to W (K). 
For the logarithmic kernel in R?, M. Fekete 
defined D (K) and called e72% the transfinite 
diameter of K (1923). F. Leja and his school in 
Poland studied relations between transfinite 
diameter and *conformal mapping. Next, set 


k 
kR,(X, Y)2 sup inf}, O(x, xj). 
Kips tees ef xeX i=l 
Then R(X, Y) =lim R,(X, Y) exists as k— oo, 
and we have R(K, Y)= V(K, Y). 

Fekete introduced R(K)=R(K, K) in R? 
(1923). G. Polya and G. Szegó computed D(K) 
and R(K) for special K and cc-kernel r * (y 2 0) 
in R? and R? [12]. The equality D(K)=R(K) 
holds for the logarithmic kernel in R? and the 
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Newtonian kernel in R?. The maximum of the 
absolute value on K of *Chebyshev's poly- 
nomial (— 336 Polynomial Approximation) 
of order k with respect to K in R? is equal to 
exp( - kR,(K)). 


E. Evans's Theorem 


In order that K be of Newtonian capacity 
Zero, it is necessary and sufficient that there 
exist a measure j| on K such that the New- 
tonian potential of 4t is equal to % at every 
point of K. This result was proved by G. C. 
Evans and H. Selberg independently (1935) 
and is called Evans's theorem (or the Eyans- 
Selberg theorem). The corresponding theorem 
in R? is often applied in the theory of functions 
[ 15,16]. A similar potential exists in case of a 
general kernel if and only if R(K, K)= œ. 


F. Nonadditivity of Capacity 


Many kinds of capacity satisfy the inequality 
cap( Us X.) < X, cap X,, where a capacity is 
denoted by cap. Even the Newtonian capacity 
C is not necessarily additive, but it satisfies 
C(K,UK;)- C(K,0 KJ) & C(K,) - C(K;) (G. 
Choquet [2]). Choquet [3] proved that X can 
be divided into mutually disjoint sets X, and 
X, such that C,(X) = C,(X,) = C,(X,), where 
C,(X) is the Newtonian inner (or interior) ca- 
pacity delined to be sup, - y C(K) if X 4 Ø 

and 0 if X = Ø. 


G. Relation to Hausdorff Measure 


There are many studies of relations between 
capacity and *Hausdorff measure [ 1]. Frost- 
man [6] introduced the notion of capacitary 
dimension and observed that it coincides with 
the Hausdorff dimension. The capacity of 
product sets has been evaluated from above 
and below [ 11]. For compact sets K c R”, K’ 
c R”, their dimensions a, f. and the dimension 
yof K x K’, we have the relation a + <y € 
min(m + a, n+), where the equalities are 
attained by general Cantor sets. There are 
also works on the evaluation of capacities 

of general Cantor sets [10,15]. If K is a con- 
tinuum of logarithmic capacity 1 in a plane, 
then its diameter d satisfies 2 <d < 4, and its 
area A satisfies A«n [S]. Consider the sum K 
={z,+...+2,|z,€K,,1<k<n} ofcontinua 
K,,.... K,ina plane. The logarithmic capacity 
of K is strictly greater than the sum of the 
logarithmic capacities of. K,, , K, except 
when all K, are convex and mutually similar 
[14]. By various fsymmetrizations the log- 
arithmic capacity decreases in general (— 228 
Isoperimetric Problems; also [ 133). 
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H. Capacitability 


The Newtonian outer (or exterior) capacity 
C,(X) is defined by inf C;(G), where G ranges 
Over an open set containing X. The inequality 
C, (X) € C,(X) holds, in general. When the 
equality holds, X is called capacitable. Cho- 
quet (1955) [2] proved that all fanalytic sets 
and hence tBorel sets are capacitable but there 
exists an analytic set whose complement is not 
capacitable. He himself generalized his result 
on capacitability in the following way [4]: Let 
Q be an abstract space, 9 a nondecreasing 
function defined on the family of all subsets of 
Q, and # some family of subsets of Q that is 
closed under the formation of finite unions 
and countable intersections. Assume that 
q(H,)|o(H) as H, in # decreases to H and 
that o(X,)fo(X)as X,] X. When p(X) is equal 
to sup{p(H)| HEX, H c Xj, X is called 

(o, #)-capacitable. Choquet detined 2 - 
Suslin sets and showed that they are (p, #)- 
capacitable. M. Kishi [9], Choquet [5], and B. 
Fuglede [7] investigated capacitability with 
respect to several kinds of capacity more gen- 
eral than Newtonian capacity. We can discuss 
capacitability with respect to quantities de- 
fined in connection with the *Gauss varia- 
tional problem. 
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Cardinal Numbers 


A. Definition 


The general concept of cardinal number is an 
extension of that of natural number (Cantor 

[ 1]). When there exists a tone-to-one corre- 
spondence whose tdomain is a set A and 
whose trange is a set B, this set B is said to be 
equipotent (or equipollent) to A, and this rela- 
tion is denoted by A-B. The relation ~ is 

an *equivalence relation, and each equivalence 
class under this relation is said to be a cardinal 
number. The class of all sets equipotent to a set 
A is denoted by A (or A|) and is said to be the 
cardinal number (power, cardinality, or potency) 
of tbe set A. When A is a finite set, A 1s said to 
be finite, and when A is an infinite set, A is 
said to be infinite (or transfinite). When the 
cardinal number of a set A is m, A is also said 
to consist of m members (or m elernents). In 
this sense, O and the natural numbers are 
considered to express finite cardinal numbers. 
For example, 0= @, 1 = (0), 2 = {0,1}, etc. 
Examples of infinite cardinal numbers: A set A 
which is equipotent to the set N of all natural 
numbers is said to be countably infinite, and 
the cardinal number of the set N is denoted by 
a. A set A which is finite or countably infinite 
is said to be countable. The cardinal number of 
the set of all real numbers is denoted by c and 
is called the cardinal number of tbe continuum. 
Moreover, the cardinal number of the set of all 
real-valued functions whose domain is the 
interval [0, 1] is denoted by f. These three 
cardinal numbers are known to be distinct. 
Henceforth in this article, lower-case German 
letters denote cardinal numbers. For a defi- 
nition of cardinal numbers using the concept 
of ordinal numbers — 312 Ordinal Numbers. 
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B. Ordering of Cardinal Numbers 


m2>n or n<m will mean that there exist sets 
A and B such that m A, n= B, and A >B. 

A x B does not necessarily imply m z n. For 
example, the cardinal number of the set of all 
positive even numbers is also a. m z n and 
n>m imply m =n (Bernshtein’s theorem). 
Since the ?reflexive and ‘transitive laws for the 
relation € between cardinal numbers are 
obvious, the relation is an tordering relation. 
The ‘well-ordering theorem imples that > is a 
*total ordering (comparability theorem for 
cardinal numbers). m >n means that m zn 
and m z n. When B x mi, B is said to be at 
most nt. 


C. Sum, Product, and Power of Cardinal 
Numbers 


For cardinal numbers m and n, choose sets A 
and B so that nt— A, 1 — B, and AQ B- Qj, 
and put s= AU B. Then s is uniquely deter- 
mined by m and n. The s is said to be the sum 
of the cardinal numbers m and n and is de- 
noted by m +n. If the sets A, B are chosen as 
described above, the cardinal numbers of the 
*Cartesian product A x B and of the set of 
functions A? are called the product of m and n 
and the nth power of m, denoted by mn and 
m", respectively. These operations are also 
determined by m and n. For these three oper- 
ations, the following laws are valid: commuta- 
tive laws m+n=n+m, mn = nm; associa- 

tive laws (m +n) + p =m + (n + p), (mn)p = 
m(np); distributive law p(m +n)= pm + pn; 
exponential laws m" *? = m"m?, m™ =(m"}, 
(mn)" = mi". In particular, if A =m, then 2" 
is the cardinal number of the tpower set ‘B(A) 
of A. 

Addition and multiplication of more than 
two cardinal numbers can be defined as fol- 
lows. Let A be any set, and suppose that to 
any element 4 of A there corresponds a unique 
cardinal number m,. Let M, be a set such that 
M,=m,, and M, M^ DI for As A. Then 
the cardinal number of the fdisjoint union 
1, M, is said to be the sum of all m, and is 
denoted by 2, m,. The cardinal number of the 
Cartesian product [], M, is said to be the 
product of all m,(A€A) and is denoted by 
II, m;. The axiom of choice can be stated as 
follows: If nt; #0 for all AeA, then IT; m, 40. 





D. The Continuum Hypothesis 


For a, c, and f defined as before, [225 c2 2^» 
a. In general, 2" >m holds for any cardinal 
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number m (Cantor). The hypothesis which 
asserts that for any m, there does not exist an 
n such that 2" — n ntis called the gener- 
alized continuum hypothesis. In particular, this 
hypothesis restricted to the case where m =a 
is called the continuum hypothesis. After Can- 
tor stated this hypothesis (J. Reine Angew. 
Math., 84 (1878)), it remained an open ques- 
tion for many years. In particular, Cantor 
himself repeatedly tried to prove it, and W. 
Sierpinski pursued various related hypotheses. 
Finally, the continuum hypothesis and the 
generalized continuum hypothesis were proved 
to be independent of the axioms of set theory 
by K. Godel (1940) [3] and P. J. Cohen (1963) 
[4] (— 33 Axiomatic Set Theory). 


E. Cardinality of Ordinal Numbers 


Lower-case Greek letters will stand here for 
tordinal numbers. The cardinal number of 
{€|E<a} will be denoted by x, which is called 
the cardinality of the ordinal number o or the 
cardinal number corresponding to x. When a 
cardinal number m corresponds to some 
ordinal number, the minimum among ordinal 
numbers x with x — m 1s called the initial or- 
dinal number corresponding to m. An initial 
ordinal number corresponding to an infinite 
cardinal number is called a transfinite initial 
ordinal number. There exists a unique corre- 
spondence fj c, from the class of ordinal 
numbers onto the class of all transfinite initial 
ordinal numbers such that fj >y implies œg > 
w,. In particular, wg — o, and an ordinal 
number č such that č « c, is called a count- 
able ordinal number. c is called the fth 
transfinite initial ordinal number. The cardi- 
nality of c is denoted by N; (N is the Hebrew 
letter aleph). In particular, a is denoted by No 
(aleph zero). We have He >X, if and only if 

f 2: y, and in this case Ng +N, 2 N,, NN, = 
Se The axiom of choice implies that every 
infinite cardinal number is an N,. Hence, in 
this case, the continuum hypothesis can be 
formulated as 2^» 2 N,, and the generalized 
continuum hypothesis can be formulated as 
25" — N54 for every ordinal number $. 


F. Finiteness and Infiniteness 


Dedekind [5] defined a set A to be infinite if A 
is equipotent to a proper subset of itself, and 
to be finite otherwise. It 1s also possible to 
define finiteness and infiniteness of sets as 
follows: A set A is finite if there exists a *well- 
ordering of A such that its *dual ordering ts 
also a well-ordering, and A is infinite other- 
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wise. If a set is finite in the latter sense, then it 
is also finite in the sense of Dedekind. Under 
the axiom of choice, these two definitions can 
be shown to be equivalent. 
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Cartan, Elie 


Elie Cartan (April 9, 1869—May 6, 1951) was 
born at Dolomieu in the French province of 
Isére. He entered the Ecole Normale Supé- 
rieure in Paris in 1888 and graduated in 1891, 
having at the same time qualified in the agrége 
examination. Beginning his research immedi- 
ately, he completed his thesis on the structure 
of continuous transformation groups [2] in 
1894 at the age of 25. 

Cartan was a professor first at the Univer- 
sity of Montpellier, later at the University of 
Lyon, then the University of Nancy, and fi- 
nally in 1912 at the University of Paris. He 
freely used the tmoving coordinate system 
introduced by J. G. Darboux, and contri- 
buted to many areas, such as the theory of 
tLie groups, the theory of tPfaffian forms, the 
theory of tinvariant integrals, ttopology, tdif- 
ferential geometry (especially the geometry 
of tconnections), and theoretical physics. His 
doctoral thesis is still an object of interest 
among young researchers today, and the con- 
cept of connection introduced by him is funda- 
mental in the field of differential geometry. 
Henri Cartan (1904—) is his eldest son. 
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Catastrophe Theory 


A. General Remarks 


Catastrophe theory was originally proposed 
by R. Thom [1,2] in late 1960. This theory 
provides certain mathematical models for the 
evolution of forms in nature, in particular in 
biology and the natural languages. In recent 
years, E. C. Zeeman and others have devel- 
oped and applied the theory to various fields, 
including the physical sciences, medicine, 
economics, and sociology; a detailed bibli- 
ography may be found in [3,9, 10]. 


B. Static Models 


The basic concept of the theory is the static 
model. This is a family of potential functions 
f: X AR, where X is a subset of R”, containing 
a neighborhood of the origin, and the para- 
meter u lies in a neighborhood U of the origin 
in R”. 

We regard R" as the internal space or state 
space, which is parametrized by the various 
variables that are relevant to the process under 
study, and R’ is the external space or control 
space, which can be either the physical space- 
time continuum in which the process under 
consideration takes place or a space of control 
parameters that govern the process. As a rule, 
we assume that the dimension r <4, although 
the dimension n can be arbitrary large. 

The static model is local in nature, and any 
one of the local minima of f,, called a local 
regime at ue U, is a candidate for the state 
of the model corresponding to the control 
point u. 

Mathematically, a static model is a germ of 
C*-functions f: R" x RR at 0 that is an 
unfolding (r-dimensional extension family) of a 
germ of C?-functions y = f |R" x (0): R"5R at 
0; further details are given in Section D. 
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C. Classification of Singularities 


Let ó(n, m) be the vector space of *germs of 
C”-functions f: R" 5 R" at 0, and let Z(n)c 
&(n, n) be the subset of invertible germs R"— 
R" mapping 0 to 0. 

The germs y, £e &(n, m) are called right 
equivalent if there exists an he Z(n) such that 
noh=€. Let &(n) 2 &(n, 1). Then it follows in 
the usual manner that &(n) is a local algebra 
with the unique maximal ideal Æ =.@(n)= 
ine&(n)|n(0) 20]. A germ e &(n) such that 
(0) = D (0) OU is called a singularity. For any 
singularity 7, we define the codimension of n by 


codim y = dimg(.Z//(0n/0x;5 sim) 


where (0n/0x;»;,, is the ideal of &(n) gen- 
erated by 0n/0x,,...,6n/0x, and x =(x,,...,X,) 
denotes the coordinate system of R". 

The following result was proved by J. N. 
Mather [4]: Up to the addition of a nondegen- 
erate quadratic form in other variables, and up 
to multiplication by +1, a singularity of codi- 
mension «4 and 21 is right equivalent to one 
of the y appearing in Thom's list of the seven 
elementary catastrophes below. 


D. Unfoldings 


Let 4 be a singularity. An r-unfolding of y is a 
germ fe.@(n+r) such that f|R" x {0} =n; this 
unfolding is denoted by (r, f). Let (r, f) and 
(s, g) be unfoldings of 7. A morphism (o, ®, £): 
(r, f ) (s. g) consists of (i) a germ o e&(n 4r, 
n +s) such that o|R" x (0j — identity, (ii) a 
germ Oe ó(r, s) such that z,0 9 — 600, (iii) a 
germ £e Zirl such that f —go«q--zon,, where 
T,:R'"x R'3R' is the projection. In this case 
we say that the unfolding (r, f) is induced by 
(o, ®, £) from (s, g). A morphism is an isomor- 
phism if o and © are diffeomorphic germs. 
The addition of unfoldings (r, f) and (s, g) of 
5 ts defined by (r, f) - (s, g) 2 (r- s, f - g — 1), 
where the last term on the right is given by 
(f+g—n)(x, u, v) = f(x, u) Y g(x, v) - n(x). 
Thus if the constant unfolding (s, 4) of 7 is 
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defined by n(x, u) 2 q(x), we have (r, f) -- (s. — 
(r^ s, f). 

An unfolding (r, f) of y is said to be versal if 
any unfolding of nis induced by (r, f) and a 
suitable morphism. A versal unfolding (r, f) 
with minimal r is said to be universal. The 
following facts have also been proved by 
Mather [4]: 

A singularity ye Zu) has a versal unfolding 
if and only if codim nis finite. Any two r-versal 
unfoldings of n are isomorphic. Every versal 
unfolding is isomorphic to (r, f) 4- constant, 
where r 2 codim and (r, f) is a universal 
unfolding defined as follows: If {b,,...,b,} < 
Mn) is a system of representatives for a basis 
of nl nix, Aert, then the unfolding f of 
n is defined by f(x, u) 2 g(x) - bu (x)u, +... + 
b (x)u,. 


E. The Seven Elementary Catastrophes 


Let f and g be germs in &(n+r). We say that f 
and g are equivalent as r-unfoldings if there 
exist he Z(r), a family of H € Z(n), where ue 
UcR', and an se æ (r) such that f(x, y) 
g(H,(x), h(u)) + e(u). 

We say that a static model (r, f) is stable if 
any small perturbation (r, g) of (r, f) in é(n +r) 
(with the Whitney C”-topology) is equivalent 
to (r, f). The following is the main result ob- 
served by Thom and proved by Mather and 
others (for references see the bibliography in 
[3]). Suppose that r x 4. Then the set of stable 
static models (r, f) is an open dense subset of 
&(n+r), and up to the addition of a nondegen- 
erate quadratic form and multiplication by 
+1, any stable static model (r, f) is equivalent 
to one of the models with the standard poten- 
tials F, which are the universal unfoldings 
of singularities y in Table 1 (x, y denote inter- 
nal variables, and u, v, w, t denote external 
variables). 

The static models with these standard po- 
tentials are known as the elementary catas- 
trophes and can serve as qualitative models 
of various natural processes. 


Table 1. Thom’s List of the Seven Elementary Catastrophes 


r Singularity n Standard Potential F Name 

1 x? x? + ux Fold 

2 x? x4 + ux? + ox Cusp (Riemann-Hugo-Niot) 
3 x x? + ux? + ox? + wx Swallowtail (Dovetail) 

3 x? +y? x? +y? + uxy + vx + wy Hyperbolic umbilic 

3 x!- xy? x? — xy? + u(x? + yp?) + vx + wy Elliptic umbilic 

4 xa x8 tx* ux? vx? + wx Butterfly 

4 x? yc y* x?y t y* ux? oy? +wx + ty Parabolic umbilic 
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F. Catastrophe Sets 


A process for the static model feé&(n+r) is a 
subset s of X x U, where X and U are neigh- 
borhoods of the origin in R" and R', respec- 
tively. If s is a process and ue U, we define s, = 
sN(X x (uj). We say that ue U is a regular 
point for s if there exists a neighborhood V of 
uin U and a homeomorphism h: X x V X x 
V such that 7,0 ha, on X x V and h(s N 

(X x V)) 25, x V. A catastrophe point is a non- 
regular point in U. The set of all catastrophe 
points is called the catastrophe set. 

There are several conventions with regard to 
the definition of a process for the unfolding 
(r, f). One of these is the Maxwell convention, 
which requires that s, be a least local regime at 
ue U. The catastrophe set of the model (r, f) 
according to the Maxwell convention consists 
of those points u of the control space R” where 
the least minimum of f, is attained for at least 
two points or where this minimum is attained 
at a unique point but is not stable. 

Another is the perfect delay convention, 
which assigns to each path t in U a mapping 
(possibly discontinuous) m,:t— X x x such that 
n,(m,(u)) =u, (m,(u), u) is a local regime, and m, 
remains continuous for a maximum interval 
on the path t. Consider the set 


A= ((x, ue R" x R’|dz f(x, u) is degenerate} 


and its image B — z,(A), called the bifurcation 
set, under the projection z,: R” x R’>R’. Then 
the points of the bifurcation set B are impor- 
tant candidates for catastrophe points of the 
static model with respect to the perfect delay 
convention. 

For geometrical studies of the elementary 
catastrophes — [6—8]. The static models have 
been generalized to metabolic models, the 
structure of which largely remains an open 
question [2]. In this connection, the bibli- 
ographies of [3,9, 10] appear to be fairly 
complete. 
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Categories and Functors 


A. Categories 


Consider the family of all tgroups. Given two 
groups X and Y, denote the set of all thomo- 
morphisms of X to Y by Hom(X, Y). If X, Y, 
and Z are groups and if f: X > Y and g: Y^ 
Z are homomorphisms, we can compose 
them to get a homomorphism go f: X >Z. 

In general, suppose that we are given, as in 
this example, (1) a family IN of mathematical 
objects, and (2) for every pair (X, Y) of objects 
in W, a set Hom(X, Y) whose elements are 
called morphisms; and let fe Hom(X, Y) 
and ge Hom(Y, Z) determine a morphism 
gofeHom(X, Z), which is called their com- 
posite. A morphism f eHom(X, Y) is also 
written f: X > Y. Suppose further that these 
morphisms satisfy the following axioms: (1) if 
f: X5 Y,g:YoZ,and h: ZW are mor- 
phisms, then (ho g)of —ho(go f); (2) for 
each object X EM there exists a morphism 
1,:X—X such that for any f: X — Y and 
g:ZX we have f oly—f and 140g2g; 
(3) Hom(X, Y) and Hom(X’, Y?) are disjoint 
unless X = X' and Y= Y'. Then we call the 
whole system (i.e., the family of objects Mt, the 
morphisms, and the composition of mor- 
phisms) a category. The elements in IN are 
called the objects of the category. 

By axioms (1) and (2), the set Hom(X, X) is 
a semigroup (with respect to the composition 
of morphisms) which has 1, as the identity 
element. Hence 1, is determined uniquely by 
X. On the other hand, axiom (3) implies that 
a morphism f determines the objects X and 
Y such that fe Hom(X, Y). From these facts 
we can give an alternative definition of cate- 
gory using only the morphisms and their 
composition. 

The totality of the objects (or morphisms) in 
a category € is denoted by Ob(£) (or Fl(€); 
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the notation Fl comes from the French word 
flèche). The relation xe Ob(@) is often ab- 
breviated to xe €, while Hom(X, Y) is written 
Hom,(X, Y) if necessary. A subcategory of a 
category € is a category €' with Ob(€")c 
Ob(@), such that for X, Ye €" we have 
Hom,.(X, Y) c Hom,(X, Y) and the compo- 
sition in €" is the restriction of € to €". If 
Hom,.(X, Y) - Hom,(X, Y) for all X, Ye, 
we say that €' is a full subcategory of €. 

We define the product category €, x 6, of 
two categories in the canonical way, using the 
pairs of objects and the pairs of morphisms. 


B. Examples of Categories 


(1) Taking all sets as the objects, all mappings 
as the morphisms, and the composition of 
mappings as the composition, we obtain a 
category called the category of sets, denoted by 
(Sets) (or (Ens) from the French ensemble). For 
the empty set we make the convention that 
Hom(@, Y) contains just one element for any 
Y and that Hom(Y, @) is empty if Y z Ø. 

(2) As we have seen, taking all groups as the 
objects and the homomorphisms as the mor- 
phisms, we get the category of groups, written 
(Gr). If we limit the objects to *Abelian groups, 
we get the category of Abelian groups (Ab) as a 
full subcategory of (Gr). 

(3) Fix a ring R. The left R-modules and 
their R-linear mappings define the category of 
left R-modules, which we denote by 47. The 
category of right R-modules, Æg, is defined 
similarly. When R is tunitary, we usually 
limit the objects of 4/7 and Mpg to *unitary 
modules. If R is commutative we can identify 
rM With Mg. When R —Z (the ring of ra- 
tional integers), 47 can be identified with 
(Ab). When R is a field, 4. Z7 is also called the 
category of linear spaces over R. 

(4) Taking rings as objects and homomor- 
phisms of rings as morphisms, we obtain the 
category of rings. The subcategory consisting 
of unitary commutative rings and unitary 
homomorphisms is called the category of 
commutative rings and is denoted by (Rings). 

(5) If we take ‘differentiable manifolds as 
objects and differentiable mappings as mor- 
phisms, we obtain the category of differentiable 
manifolds. Similarly, for tanalytic manifolds 
and analytic mappings we obtain the category 
of analytic manifolds. 

(6) Taking topological spaces as objects and 
continuous mappings as morphisms, we get a 
category called the category of topological 
spaces and denoted by (Top). On the other 
hand, if we take the thomotopy classes of 
continuous mappings as morphisms and 
define their composition in the natural way, 
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we obtain another category, which is called the 
homotopy category of topological spaces. 

(7) Fix a tpreordered set 7. Taking the ele- 
ments of J as the objects and the pairs (x, y) of 
elements of J with x € y as the (unique) mor- 
phism from x to y, we get a category, in which 
we define the composite of the morphisms 
(x, y) and ( y, z) to be (x, z). 

In examples (1) through (6), the totality of 
the objects Ob(@) is not a set, but a fclass 
(— 381 Sets G; for the logical foundation of 
category theory — [3,9]). 


C. Diagrams 


If a set of arrows {A,} and a set of points {Bp} 
are given in such a way that each arrow A, has 
a unique initial point and a unique endpoint, 
then we say that {A,, B,} is a diagram. (Usu- 
ally, we consider the case where each point B, 
is the initial point or the endpoint of at least 
one A, (Fig. 1).) Let € be a category and 

{A,, Bs} a diagram. If we associate a mor- 
phism f, in € with each arrow A, and an 
object Z; € € with each point Be so that 

f, c Hom(Z;, Z,) whenever A, has the initial 
point B, and the endpoint B,, then we say that 
{ Ja Zg} is a diagram in the category @ (Fig. 2). 
Suppose, furthermore, that the following con- 
dition is satisfied: For any pair of points B, 
and B,, and for any sequence of adjacent 
arrows A. A, starting at B, and ending 
at B, (i.e., the initial point of A. is Bẹ, the 
endpoint of A, is the initial point of A... and 
the endpoint of A, is B,), the composite 

fa © Sem 10 Ofa, (c Hom(B,, B,)) depends 
only on B, and B,. Then the diagram in @ is 
said to be a commutative diagram. For exam- 
ple, commutativity of Fig. 2 is equivalent to 


fao fi fao h= fs- 


B, A, B 
oe X, a X, 
Ås d 
dëi, Ah 
Än X 
B, B Pm d 
Fig. 1 Fig. 2 


D. Miscellaneous Definitions 


A morphism f: X > Y in a category € is called 
an isomorphism (or equivalence) if there exists a 
morphism g: Yo X such that fog=ly,gof= 
1,. In this case, g is determined uniquely by 

f and is itself an isomorphism. We call g the 
inverse morphism of f. Then the inverse of g 

is f. An isomorphism is sometimes written 
f: X 5Y. Two objects X and Y are said to be 
isomorphic if there is an isomorphism X >Y, 
and then we write X = Y. The composite of 
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isomorphisms is again an isomorphism. In 
particular, an isomorphism X >X is an inver- 
tible element of the semigroup Hom(X, X), 
and is called an automorphism of X. Examples 
of isomorphisms are bijections in (Sets), group 
isomorphisms in (Gr), the R-isomorphisms in 
gM, ring isomorphisms in the category of 
rings, tdiffeomorphisms in the category of 
differentiable manifolds, and homeomorphisms 
in (Top). 

A morphism f: X > Y is called a monomor- 
phism (or injection) if for any object Z and for 
any morphisms u, v: Z ^ X(u # v) we have 
fouzfov.Dually, f: X —Y is called an epi- 
morphism (or surjection) if for any u, v: Y 
Z(u#v) we have uo f #vo f. In the category 
of sets the monomorphisms and the epimor- 
phisms coincide, respectively, with the injec- 
tions and the surjections as mappings (— 381 
Sets). A monomorphism which is at the same 
time an epimorphism is called a bijection. An 
isomorphism is always a bijection, but the 
converse is false in some categories. 

Two monomorphisms f; : X, X and 
TL X55 X (with the same X) are said to 
be equivalent if there exist g,: X, — X, and 
d X5 X, such that f, = f; og, and f; = 
f,0 9g; (Fig. 3). An equivalence class with 
respect to this equivalence relation is called a 
subobject of X. Similarly, we define a quotient 
object of X as an equivalence class of 
epimorphisms X > X;. 


Fig. 3 


An object e of a category € is called a 
final object of € if for every object Y of €, 
Hom(Y,e) contains one and only one element. 
Dually, an object e ts called an initial object 
(or cofinal object) if Hom(e', Y) contains one 
and only one element for every Ye. If e, and 
e, are final objects, then there is a unique iso- 
morphism e, Se, and similarly for initial 
objects. A set with only one element is the 
final object in (Sets), and a space with only 
one point is the final object in (Top). In the 
category (Gr) (resp. (Ab)), the trivial group {1} 
(or {0}) is the final object and the initial object 
at the same time. In the category of commuta- 
tive rings, the zero ring {0} is the final object 
and the ring of rational integers Z ts the initial 
object. 


E. Product and Coproduct 


Let X, and X, be objects of a category €. We 
say that a triple (P, p,, p;) consisting of an 
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object P and morphisms np P X, (i= 1,2) is 
the product (or direct product) of X, and X, if 
for any pair of morphisms f;: X X; (i= 1,2), 
there exists a unique morphism f: X P with 
pio f = f; (i— 1,2) (Fig. 4). If (P", p4, p) is an- 
other product of X, and X,, then by virtue of 
this definition there is a unique morphism 

f: P-P' such that p;o f =p; (i— 1,2), and f is 
an isomorphism. The product is unique in this 
sense. The product (or any one of the prod- 
ucts) of X, and X, is denoted by X, x X, or 
by X, II X;. 


Fig. 4 


The product in the categories of sets, of 
groups, of rings, and of topological spaces 
coincides with the notion of *direct product in 
the respective systems. In a general category, 
the product does not always exist. Suppose the 
product X x X exists for an object X; then 
there is a unique morphism Ay: X > X x X such 
that 1y 2 p, 0 Ay — p; 0 A,, which is called the 
diagonal morphism of X. Let f: X ^ X; (i— 1.2) 
be morphisms and assume that the products 
(X4 x X3, pi, P2) (X5 x X5, P1, p?) exist. Then 
there is a unique morphism f: X, x X, 

X| x X5 satisfying p;o f — f;o p; (i— 1,2). This 
f is denoted by f, x f2. On the other hand, if 
gi: X 5 X,(i— 1,2) are given, the unique mor- 
phism g: X o X, x X, with pjog—g;(i—1,2) is 
denoted by (g,,92). We have (g;,g;) -(g1 X 
g;) O Ay if X x X exists. 

The dual notion of product is coproduct. 
We say that a triple (S, j,,j,) of an object S and 
morphisms j;: X; S (i= 1,2) is the coproduct 
(or direct sum) of X, and X, if for any mor- 
phisms f;: X, X (i— 1,2) there exists a unique 
morphism f: S X with foj;— f; (i= 1,2) 

(Fig. 5). The coproduct, like the product, is 
uniquely determined up to canonical iso- 
morphisms. It is denoted by X, + X, or by 

X, U X5. The coproduct in (Gr) is the "free 
product. In (Ab), or more generally in wë. the 
product of two objects can be identified with 
the coproduct (= direct sum) (^ 277 Modules 
F). The coproduct in the category of com- 
mutative rings is the "tensor product over Z. 


X 
YEN 
LASN 


Xi h Ié X» 


Fig. 5 


Product and coproduct can also be defined 
for a family (X;j;., of objects. Namely, the 
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product of {X;};.; is an object P together with 
a family of morphisms p;: P X; (i€ I) having 
the property that for any family of morphisms 
f: X 5 X; (ied), there exists a unique morphism 
f: X 9 P such that p;o f = f; (ie /). The product 
is unique up to canonical isomorphisms, and 
similarly for the coproduct (— Sections F, L). 


F. Dual Category 


In the theory of categories we often encounter 
the dual treatment of notions and propo- 
sitions. To be precise, we may define the no- 
tion of the dual category 4? of a category € as 
follows: The objects of €? are those of €, Le, 
Ob(@°) = Ob(@); for any objects X and Y we 
put Hom,.(X, Y) 2 Hom,(Y, X); if f: Xo Y 
and g: Y>Z in €? (1e, f: YO X and g:Zo 

Y in €), then the composite go f in €? is de- 
fined to be fog in €. It is clear that @° then 
satisfies the axioms of a category. Quite gener- 
ally, given a proposition concerning objects 
and morphisms we can construct another 
proposition by reversing the directions of the 
morphisms, and we call the latter the dual 
proposition of the former. The dual propo- 
sition of a proposition in € coincides with a 
proposition in €^. For instance, a monomor- 
phism (epimorphism) in € is an epimorphism 
(monomorphism) in €^, and the final (initial) 
object in € is the initial (final) object in @°. 
The product (coproduct) in € is the coproduct 
(product) in €^. Although the notion of the 
dual category is defined quite formally, it is 
useful in describing relations between specific 
categories. The dual category of (Ab), for 
instance, is equivalent to the category of 
commutative compact topological groups 
(tPontryagin’s duality theorem). 


G. Categories over an Object 


Fix a category € and an object See A pair 
(X, f) of an object X €@ and a morphism f: X 
—$ is called an object over S or an S-object, 
and f is called its structure morphism. We 
often omit f and simply say "an S-object X" if 
there is no danger of misunderstanding. If 
(X, f) and (Y, g) are S-objects, a morphism 
h: X 5 Y such that f=goh is called an S- 
morphism from (X, f) to (Y, g). The category 
whose objects are the S-objects and whose 
morphisms are the S-morphisms is called the 
category of S-objects in €, and is denoted by 
eis It has (S, 1,) as the final object. The prod- 
uct of two S-objects X and Y, taken in @/S, is 
called the fiber product of X and Y over S (in 
€), and is denoted by X xs Y or XIIs Y. The 
dual notion of the fiber product is called fiber 
sum (or amalgamated sum). Thus for two 
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morphisms f:S—X and g:$ Y, the fiber 
product of X and Y over S in @°/S is the fiber 
sum of X and Y (with respect to S); it is de- 
noted by X |], Y. 

Let € be the category of commutative rings 
and Kee Then the family of K-objects in @° 
is precisely the family of commutative K- 
algebras. The fiber product A x, B in €^, i.e., 
A [I« B in €, is the tensor product A @x B of 
algebras. 


H. Functors 


Let @ and @’ be categories. A covariant functor 
F from € to €" is a rule which associates (1) 
with each object X in €, an object F(X) in @’, 
and (2) with each morphism f: X > Y in €, 

a morphism F(f): F(X)— F(Y) such that 
F(gof)- F(g)o F(f), F(1x) 2 ln. A con- 
travariant functor is defined dually, by mod- 
ifying this definition to F( f): F(Y) F(X), 
F(gof)- F(f)oF(g). Thus a contravariant 
functor from € to €' is the same as a covariant 
functor from the dual category €? to @ (or 
from € to €'?). Functor is a general term for 
both covariant functors and contravariant 
functors, but some authors use the word ex- 
clusively in the sense of a covariant functor. 

A functor in several variables is defined to be 
a functor from the product category of the 
categories in which the variables take their 
values. 

A covariant functor F:€ —€' is said to be 
faithful (fully faithful) if for any X, Yeg, the 
mapping Hom(X, Y) Hom(F(X), F(Y)) in- 
duced by F is injective (bijective), and simi- 
larly for contravariant functors. A faithful 
covariant functor F:@-»@’ which maps dis- 
tinct objects of € to distinct objects of €" is 
called an embedding, and in this case @ can 
be identified with a subcategory of @” by F. 

A fully faithful covariant functor F:€ €' is 
called an equivalence (between the categories) if 
it satisfies the condition that for any object X' 
of 6”, there exists an object X of € such that 
F(X)z X'. In this case we can consider the 

two categories essentially the same. A con- 
travariant functor from € to @’ which defines 
an equivalence from €? to €' is called an 
antiequivalence. 


I. Examples of Functors 


(1) Let € be the category of groups (or rings). 
For any X e€ let F(X) be the underlying set of 
X (i.e., the set obtained from X by “forgetting” 
its structure as a group or ring), and for any 
homomorphism f put F( f) — f. Then we get 

a faithful covariant functor (often called the 
forgetful functor) E :@ — (Sets). 
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(2) Let € be any category and fix an object 
X of €. Then we get a covariant functor hy:@ 
—(Sets) as follows: With each Ye@ we associ- 
ate the set Hom(X, Y), and with each mor- 
phism f: Y Y' in € the mapping f o(where 
f o:Hom(X, Y)>Hom(X, Y?) is defined by 
(f o)(g) 2 f og). Similarly we define a con- 
travariant functor h*:@ 5 (Sets) by h*(Y)= 
Hom(Y, X) and ^*(f)2 0 f. 

(3) Let p: A B be a homomorphism of 
rings. With each left A-module M associate 
the tscalar extension p*(M)= Be, M, and 
with each A-homomorphism f associate the 
B-homomorphism p*( f)— 15 © f. Then we get 
a covariant functor of: ,@—>,.%. 

(4) Let R be a ring. With each left R-module 
M associate its dual module M* =Hom,(M, 
R), and to each R-linear mapping f associate 
its tdual mapping 'f =o f. Then we get a con- 
travariant functor pM — fg, and similarly for 
Mp pM. 

(5) For each differentiable manifold X let 
F(X) denote the commutative ring of the dif- 
ferentiable functions on X, and for each dif- 
ferentiable mapping f: X > Y let F( f) be the 
ring homomorphism o f: F(Y) F(X). Then F 
is a faithful contravariant functor. 

(6) Fix an Abelian group A. By associating 
with each topological space X the cohomology 
group H(X, A) and with each continuous map- 
ping f: X > Y the homomorphism H(Y, A) 
H(X, A) induced by f, we obtain a contrava- 
riant functor from (Top) to (Ab). 

(7) Fix a topological space X, and let T(X) 
be the set of the open sets in X. Then T(X) is 
ordered by inclusion, so it is a category (— 
Section B, no. 7). The contravariant functors 
from T(X) to (Ab) are precisely the *pre- 
sheaves of Abelian groups over X. We can use 
any category instead of (Ab) to define a pre- 
sheaf over X (— 383 Sheaves). 


J. Natural Transformations 


Let € and e be categories, and denote by 
Hom(@%, @’) the collection of all covariant 
functors € €'. Let F, GeHom($, €"). A nat- 
ural transformation (or functorial morphism) 
from F to G is a function which assigns to 
each object X of € a morphism (X): F(X)5 
G(X) in €' such that for any morphism f: X ^ 
Y in €, the equation G(f)o g(X)=@(Y)oF(f) 
holds; in other words, the accompanying dia- 
gram is commutative: 


o) 
X | F(X)  G(X) 


f F(f) GCf) 
EL Z5 l 
Y | F(Y) GY) 
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A natural transformation between contrava- 
riant functors is defined similarly. For in- 
stance, let A and B be Abelian groups and let 
H'(:, A) and HI: . B) be the contravariant 
functors of tcohomology viewed as functors 
(Top)-*(Sets). Then the natural transform- 
ations between them are the tcohomology 
operations. 

Let o: FG be a natural transformation, 
and suppose that o(X): F(X) G(X) is an 
isomorphism for every X €@. Then the inverse 
transformation G— F of ọ exists, and q is 
called a natural equivalence (functorial isomor- 
phism or isomorphism) and is written o: FSG. 

Suppose that Ob(€) is a set. Then the collec- 
tion Hom(F, G) of all natural transformations 
FG is also a set, and hence we can consider 
Hom(@, €") a category in which the objects are 
the covariant functors € 5 €", the morphisms 
are the natural transformations, and the com- 
position of morphisms is the natural one. Then 
Hom(€°, €") is the category of contravariant 
functors from € to €'. In particular, the cate- 
gory Hom(@°, (Sets)) is sometimes denoted 
by €. 

Given a category €, a covariant (resp. con- 
travariant) functor E (Sets), and an object 
X €€, we can define a canonical bijection 
®,:Hom(h,, Ei F(X) (resp. Hom(h^, HG 
F(X)) by 6,(o) = o(X)1,y. (The functors hy 
and h* were defined in Section I.) The inverse 
mapping of ®, assigns to če F(X) the natural 
transformation ọ:hy—>F defined by og(Y)u— 
F(u)&(Y€€). In particular, if we take F = 
hy(h*), we obtain a canonical bijection 
Hom(h,, hy) SHom(Y, X) (Hom(h*, bi 
Hom(X, Y)). It follows that there is a fully 
faithful contravariant (covariant) functor € > 
Hom(€, (Sets)) (€ +Hom(@°, (Sets)) = €) 
which associates h,(h*) with X e€. 


K. Adjoint Functors 


Let F: 2 €' and F':€' € be covariant 
functors. Suppose that there is a rule which 
assigns to each pair of objects M e € and 
M'e€' a bijective mapping Oy, m : Hom,(M, 
F’(M'))SHom,.(F(M), M’) such that for any 
pair of morphisms N 5 M in € and M'—N' 
in @’, the following diagram induced by the 
morphisms is commutative: 


Home( M, F'( M^) = Home( F(M), M’) 


| | 


Hom,(N,F’(N’)) *3 Home(F(N),N?) 


Then we say that F is a left adjoint functor of 
F' and that F' is a right adjoint functor of F. 
We can regard Hom,.(F(M), M") as a functor 
from € x €' (contravariant in the variable 
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Mee covariant in the variable M'€@’) to the 
category of sets, and similarly for Hom,(M, 
F'(M')). This commutativity of the diagram 
means that these two functors are isomorphic 
(— Section J). 

For instance, let A and B be rings and La 
fixed B-A-tbimodule. Let F: 4,7 4.7 and 
F': pM — 4M be the functors defined by 


F(M)=L®,M, F'(M’)=Hom,(L,M’), 


where the assignment for morphisms is defined 
in the natural way. Then F is the left adjoint of 
F' and F' is the right adjoint of F. In parti- 
cular, let p: A>B be a homomorphism and 
consider the case L — B. Then F is the functor 
p* : 44 — and F’ is the functor py: gM 3 
aM, so that p* is the left adjoint of p, (and 

p, is the right adjoint of p*) (— 277 Modules, 
K, L; for more examples of adjoint functors — 


[11]). 


L. Representation of Functors 


We begin by discussing an example. Let T be 
a set, and consider the following problem: Is 
it possible to find a group X and a mapping 
€:T—X such that, for any group Y and for 
any mapping 4: T Y, there exists a unique 
homomorphism u: X > Y with uo č =n? The 
answer Is yes; it is enough to take the "Tree 
group X generated by T and the canonical 
injection č: T5 X (Fig. 6). On the other hand, 
let F(Y) be the set of all mappings T> Y, and 
for each group homomorphism f: Y Y' 
define the mapping F( f): F(Y) F(Y’) by 
F(f)n=fon (ye F(Y)). Then we get a co- 
variant functor from the category @ of groups 
to the category of sets, F : € (Sets). We can 
now reformulate the condition on X €@ and 
£c F(X) as follows: 

For any Ye@ and for any ne F(Y), there 
exists a unique morphism u: X > Y such that 
F(u)é — n. 


T. 
| ? 
48 


Fig. 6 


Proceeding to the general case, let 6 be an 
arbitrary category and let F: € (Sets) bea 
functor. If there exist an object X of @ and an 
element Z of F(X) satisfying the condition just 
stated (with the modification u: Y — X in the 
contravariant case), then we say that the pair 
(X, č) represents the functor F, or less specifi- 
cally, that X represents F, and we call č the 
canonical element of F(X). We also say that F 
is representable. The condition stated above is 
a formulation of the so-called universal map- 
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ping property. If (X’, ¢’) also represents F, the 
unique morphism u: X > X' (or X'2 X) with 
F(u)€=€' is necessarily an isomorphism. 

When (X, £) represents F, the natural trans- 
formation q:hy  F (h* 5 F in the contrava- 
riant case) which corresponds to € by the 
canonical bijection Dy: Hom(hy, F)2x F(X) is 
an isomorphism. Conversely, if there is a func- 
torial isomorphism o: hy F (or h* > F) for 
some X €@, then the object X represents F, 
with the canonical element of F(X) the element 
which corresponds to o by the canonical bijec- 
tion Dy, ie, — o(X)ly. 

We have already seen the example of a free 
group; here we list a few more examples. (1) 
Let ( X;];,, be a family of objects in a category 
€. For each Yeg we put F(Y)- [T,.; Hom(Y, 
Xj), and for each morphism f: Y Y’ we define 
the mapping F(f): F(Y')S F(Y) by F(f)(f) = 
(fio f). Then we get a contravariant functor 
F:«€ (Sets). A pair (X, č) which represents 
F (where £e F(X)— II; ,, Hom(X, X;)) is the 
product of {X;}. Thus representability of F is 
equivalent to the existence of the product of 
1 Xj], and similarly for the coproduct. 

(2) Let R be a ring, M a right R-module, and 
N a left R-module. For each Abelian group Y 
let F(Y) denote the set of the R-balanced map- 
pings M x N >Y (— 277 Modules J). Since a 
homomorphism f: Y Y' induces a natural 
mapping F( f): F(Y) F(Y") by composition, 
we obtain a covariant functor F:(Ab)— (Sets). 
This functor is representable: the pair consist- 
ing of the tensor product M Ge N and the 
canonical mapping M x N>M Gy N repre- 
sent it. 

(3) Let R be a commutative ring and S a 
subset of R. For each commutative ring Y, let 
F(Y) denote the set of homomorphisms R >Y 
that map the elements of S to invertible ele- 
ments of Y. As in the preceding example, we 
obtain a covariant functor F: (Rings) (Sets). 
This functor is represented by the tring of 
quotients S ! R and the canonical homomor- 
phism R>S7'R. 


M. Groups in a Category 


Let @ be a category with a final object e, and 
assume that a finite product always exists in @. 
If an object Ge € and morphisms «:G x G^ 

G, B: GG, £:e—G are given such that the 

the diagrams of Fig. 7 are commutative, then 
(G, a, B,c) is called a group in € (group object in 
€ or €-group). 

If € is the category of sets, then « defines a 
law of composition in the set G, and the image 
of e by e is the identity element and f(x) is the 
inverse of x, so that G is an ordinary group. If 
€ is the category of topological spaces (analy- 
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tic manifolds, algebraic varieties, 'schemes) 
then G is a *topological group (*Lie group, 
falgebraic group, *group scheme). 

We can also define the €-group by lifting 
the group concept in (Sets) to the category € 
by means of the functor h*. Namely, let G be 
an object of €, and suppose that for each Ye € 
the set A (Y) - Hom(Y, G) is equipped with a 
group structure and that for each morphism 
f: YY’ the induced mapping h9?(Y') h9(Y) 
is a group homomorphism. In other words, 
suppose that bi is a contravariant functor 
from € to the category of groups. Then the 
object G with the additional structure on A is 
called a €-group. This definition is equivalent 
to the one given above. 





GXGXxG--—-6GxG 
l5Xa o 
GxG z G 
EK (15,2) 
GX eG XG G——GXG 
195 2d | L 
G e S G 





N. Additive Categories 


A category € is called an additive category if 
for each pair X, Y e€, the set of morphisms 
Hom(X, Y) has the structure of an additive 
group such that (1) the composition of mor- 
phisms is distributive in both ways: ho( f +9) 
=hofthog,(ft+tg)oh=foh+goh; (2) 
there exists an object 0’ with Hom(0’,0’) = {0}; 
(3) the product (or the coproduct) of any two 
objects exists. Then the object 0’ in (2) is a final 
and initial object, and is called the zero object. 
Both the product and the coproduct of any 
two objects exist and can be identified. The 
dual category of an additive category is also 
an additive category. A functor F from an 
additive category to another is called an addi- 
tive functor if F( f+g)=F(f)+F(g) holds 

for morphisms. In an additive category @, 
Hom(X, Y) is an additive functor from € to 
(Ab) in each variable. 

For any ring R, the category of left (or right) 
R-modules is an additive category. The follow- 
ing definitions are generalizations of the corre- 
sponding concepts in the theory of modules. 
The kernel of a morphism f: A B is a pair 
consisting of an object A' and a monomor- 
phism i: A'— A with foi=0, such that any 
morphism u: X >A with fou=0 is divisible 
by i (that is, u=iov for some v: X >A’). Dual- 
ly, the cokernel of f is a pair consisting of 
an object B' and an epimorphism j: B B' 
with jo f =0 which divides any morphism 
u: Bo X with uo f 20. We write A' = Ker f, B' 
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= Coker f. The kernel of j: B Coker f is 
called the image of f and is denoted by Im f; 
the cokernel of i: Ker f> A is called the 
coimage of f and is denoted by Com f. If all 
these exist, it follows from the definitions that 
there is a unique morphism Com f ^5 Im f 
such that the composite of A Coim f ^5Im f 
B is equala to f. 

An additive category € is called an Abelian 
category if it satisfies the following conditions: 
(1) every morphism has a kernel and a coker- 
nel, (2) for every morphism f, the morphism 
Com f — Im f just mentioned is an isomor- 
phism. The dual category of an Abelian cate- 
gory is also Abelian. The categories of Abel- 
ian groups, of R-modules, and of sheaves 
of @-modules on a tringed space (X, @) are 
important examples of Abelian categories. 
Many propositions which are valaid in (Ab) 
remain valid in any Abelian category. In parti- 
cular, the notion of an *exact sequence is de- 
fined in an Abelian category in the same way 
as in (Ab), and the fiber product and fiber sum 
of a finite number of objects always exist in an 
Abelian category. A functor between Abelian 
categories which carries exact sequences into 
exact sequences is called an exact functor; 
(such a functor is automatically additive). If € 
is a category of which Ob(@) is a set and if €" 
is an Abelian category, then Hom(@, €") is an 
Abelian category. Given an Abelian category 
€ and a subcategory "e" which satisfies certain 
conditions, one can construct an Abelian 
category @/@' which is called the quotient 
category (Serre's theory of classes of Abelian 
groups; — [8]). 

If € is an Abelian category of which Ob(@) 
is a set, there is an embedding of € into the 
category wë of modules over some ring R by 
a fully faithful flat exact covariant functor (full 
embedding theorem, B. Mitchell, Amer. J. 
Math., 86 (1964)). This remarkable theorem 
enables us to extend results obtained for 
modules to the case of Abelian categories. 

The notions of category and functor were 
introduced in [7] and were applied first in 
topology and then in homological algebra and 
algebraic geometry (— 200 Homological 
Algebra). 
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53 (XXI.17) 
Cauchy, Augustin Louis 


The French mathematician Augustin Louis 
Cauchy (August 21, 1789-May 25, 1857) 
graduated from the Ecole Polytechnique in 
1807 and from the Ecole des Ponts et Chaus- 
sees in 1810, to become a civil engineer. In 
1816, his mathematical works were recognized, 
and he was appointed a member of the Aca- 
démie des Sciences while a professor at the 
Ecole Polytechnique. After the July revolution 
in 1830, he refused to pledge loyalty to Louis- 
Philippe and fled to Turin; he later moved to 
Prague. He returned to France after the revo- 
lution of 1848 and became a professor at the 
University of Paris, where he remained until 
his death. He was a Catholic and a Royalist all 
his life. 

His scientific contributions were numer- 
ous and covered many fields. In algebra, he 
did pioneering work in *determinants and in 
the theory of ‘groups. He also made notable 
achievements in theoretical physics, optics, 
and the theory of elasticity. His main field 
was analysis. He was interested in making 
analysis rigorous by giving calculus a solid 
foundation in such works as Cours d'analyse 
de l'Ecole Polytechnique (1821). In his paper 
"Memoire sur les intégrales définies prises 
entre les limites imaginaires” (1825), he proved 
the main theorem of the theory of functions of 
a complex variable. Another important work 
is his proof of the existence theorem for the 
solutions of tdifferential equations in the cases 
of real and complex variables 
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Cayley Algebras 


Let Q be a !quaternion algebra over a field K 
of characteristic zero. A general Cayley algebra 
€ 1s a 2-dimensional Q-tmodule Q + Qe with 
the multiplication (q + re)(s + te) 7 (qs + ytr) 4- 
(tq 4- rs)e, where q, r, s, te Q, y is a given ele- 
ment in K, and t, s are the tconjugate quater- 
nions of t, s, respectively. The elements of € 
are called Cayley numbers; € is a nonassocia- 
tive, talternative algebra of dimension 8 over 
K (— 231 Jordan Algebras). The map a= 
q+re=>aā =q —re is an *antiautomorphism 

of €. Define two maps € —5 K by N(a) - aa — aa 
(norm of a) and T(a) 2a +4 (trace of a). Then 
every a in € satisfies the equation x? — T(a)x + 
N(a) —0. Furthermore, N(ab)= N (a)N(b) for 

a, b in €. The tquadratic form N (x)= T(xx)/2 
characterizes €. In particular, any two (non- 
associative) general Cayley algebras over the 
same field K which are not falternative fields 
are isomorphic. 

In order for € to be an alternative field, 
either of the following two conditions is neces- 
sary and sufficient: (1) N(a) 20 implies a —0; 

(ii) Q is a noncommutative division algebra 
and y cannot be expressed in the form c? — 
ÀE? — un? + Au? (o, En, Ce K). (For the mean- 
ing of A, u with respect to Q — 29 Associative 
Algebras D.) Every alternative field of finite 
dimension is a general Cayley algebra. 

In particular, when Q is the *quaternion 
field over the real number field with 4 = u = 
— 1, the general Cayley algebra over Q with 
y= —1 is called the Cayley algebra. When K 
is an talgebraic number field of finite degree, 
there are only a finite number of nonisomor- 
phic general Cayley algebras over K. 

The Lie algebra D(@) of all tderivations of a 
general Cayley algebra € is a *simple Lie 
algebra of type G,. If K is the real number 
field, the identity component of the group of 
all tautomorphisms of the Cayley algebra @ is 
a compact simply connected tsimple Lie group 
of type G,. The Cayley algebra @ is the unique 
alternative field over the real number field K. 
This last fact is important because of the fol- 
lowing proposition: In the theory of *non- 
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Desarguesian projective planes, the field which 
gives rise to the coordinates is an alternative 
field. Let QP, be the set of all 3 x 3 fHermitian 
matrices A over the Cayley algebra @ such 
that tr A— 1, A? = A. Then we can define a 
structure of a projective plane on €P,, which 
with this structure is called the Cayley projec- 
tive plane. Furthermore, let 3 be the set of all 
3x3 Hermitian matrices over €, with a multi- 
plication in "3 defined by A: B—(1/2)(BA + 
AB). The identity component G of the group 
of all automorphisms of 3 is a compact simply 
connected simple Lie group of type F,. This 
group G acts on LP, transitively, and 9P, = 
G/Spin(9) (— 249 Lie Groups; Appendix A, 
Table 5.IIT). 
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55 (XX.6) 
Celestial Mechanics 


A. General Remarks 


The motions of planets, comets, the moon, and 
satellites in our solar system are the main 
topics in celestial mechanics (— 392 Spherical 
Astronomy). However, studies in this subject 
can also include motions of fixed and binary 
stars in our galaxy, equilibrium figures of 
celestial bodies, and rotational motions of the 
earth and the moon. 

Celestial mechanics is usually based on 
*Newtonian mechanics, with the effects of 
*general relativity sometimes taken into ac- 
count to determine corrections to the orbits of 
celestial bodies. Therefore the main task of 
celestial mechanics is to solve differential 
equations of motion based on Newtonian 
mechanics. However, since the equations for 
the problem of nz 2 bodies cannot be solved 
rigorously (— 420 Three-Body Problem), 
appropriate methods have been devised where- 
by we may obtain approximate solutions of 
the equations with accuracy comparable to 
that of observations. 

The two-body problem, which concerns the 
behavior of two celestial bodies regarded as 
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points exerting mutual interactions, can be 
reduced to a one-body problem with reference 
to a central force, since integrals of motion of 
the center of gravity for the system exist. The 
*Hamilton-Jacobi equation for the one-body 
problem is of *separable type and can be 
solved completely. The orbit for the two-body 
problem is a *conic with one of its *foci at the 
center of gravity. The majority of celestial 
bodies in the solar system actually perform 
elliptic motions. *Kepler’s orbital elements for 
elliptic motion are functions of the integration 
constants in the solution of the Hamilton- 
Jacobi equation and are determined by the 
initial conditions. 


B. Perturbations 


In studying the tn-body problem, we first solve 
certain two-body problems and then apply the 
method of *perturbations, i.e., the method of 
variation of constants, in order to obtain solu- 
tions developed as tpower series of small 
parameters. The parameters are ratios of the 
masses of planets to that of the sun for plane- 
tary motions and the ratio of the geocentric 
lunar distance to the solar distance for lunar 
motion. 

Electronic computers have made it possible 
to compute planetary coordinates for long 
intervals of time by solving numerically dif- 
ferential equations of motion including all 
possible interactions. However, in discuss- 
ing the stability of the solar system, analytic 
methods are more effective, particularly the 
method of obtaining secular perturbations by 
eliminating short-periodic terms by canonical 
transformations. This is one of the averaging 
methods of solving differential equations. 
However, as the solutions obtained by means 
of perturbation methods are not always con- 
vergent, most important problems related 
to the stability of motion have not yet been 
solved rigorously. Secular perturbations for 
planetary motions can be derived by solving 
differential equations that are linearized by 
neglecting cubic powers of orbital eccentric- 
ities and inclinations to the ecliptic (— 309 
Orbit Determination), which are small quan- 
tities. The teigenvalues of these linear differen- 
tial equations correspond to mean angular 
velocities of the perihelion and the ascending 
node. The equations for the eigenvalues are 
called secular equations. 


C. Artificial Satellites 


To discuss motions of artificial satellites close 
to the earth, the latter cannot be regarded as a 
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point or as a sphere but must be assumed to 
be an oblate spheroid, i.e., an tellipsoid of 
revolution. The effects of oblateness on the 
motion of satellites can be derived as pertur- 
bations of the theoretical elliptic motions ob- 
tained as the solutions of this two-body prob- 
lem under the assumption that the earth is 
spherical. Also, by utilizing a special potential 
very close to the geopotential, we can find a 
Hamilton-Jacobi equation of separable type 
which is solvable. This special potential ap- 
pears in the problem of two fixed centers with 
equal masses situated on an imaginary axis. 
When the geopotential is assumed to be axi- 
ally symmetric, the equations of motion for the 
satellites have two degrees of freedom; there- 
fore there appear two fundamental frequencies 
related to the special potential. When these 
two frequencies are equal, the problem is 
called a critical inclination problem and is 
important from the mathematical point of 
view. Theories for satellites can be applied to 
the motions of fixed stars in the galaxy. 


D. Equilibrium Figures 


There is a large literature concerning equilib- 
rium figures and the stabilities of celestial 
bodies assumed to consist of spinning fluids. 
The two-body problem with tidal interactions 
is particularly important; problems concerning 
the evolution of the earth-moon system are 
special cases of such a problem. 

The theory of rotation of the earth as it is 
affected by tprecession, ‘nutation, and latitude 
variations is also a part of celestial mechanics; 
for this theory, elastic theory and geophysics 
are applied. 

For the n-body problem — 420 Three-Body 
Problem. 
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Characteristic Classes 


A. General Remarks 


The theory of characteristic classes arose from 
the problem of whether or not there exists a 
ttangent r-frame field on a tdifferentiable 
manifold (E. Stiefel [5]) (— 105 Differentiable 
Manifolds). The importance of the character- 
istic class as a fundamental invariant of *vec- 
tor bundle structure is now fully recognized 
(— 114 Differential Topology, 147 Fiber 
Bundles, 427 Topology of Lie Groups and 
Homogeneous Spaces). 


B. Stiefel-Whitney Classes 


Let € =(E, B, R") be an n-dimensional real 
*vector bundle (called an R"-bundle) with 
tparacompact Hausdorff tbase space B, fiber 
R”, and *orthogonal group O(n) as the fstruc- 
ture group. Then the element w;(£) of the i- 
dimensional tcohomology group H'(B; Z,) 
(i=1,2,...,n) of the base space B with coeffi- 
cient in Z, = Z/2Z, called the i-dimensional (or 
ith) Stiefel-Whitney class, and the element w(¢) 
=1+w,(&)+...+w,(€) of the Àcohomology 
ring H*(B; Z.), called the total Stiefel- Whitney 
class, are defined as follows. First, we deal with 
the case n= 1. We call the infinite-dimensional 
real projective space the tinductive limit P^(R) 
= lim P"(R) of the finite-dimensional real *pro- 
jective space P"(R). The nontrivial tline bundle 
y, over the infinite-dimensional real projective 
space P”(R) is a universal R'-bundle (univer: 
sal bundle for the orthogonal group O(1)); 
therefore any R'-bundle é =(E, B, R!) is equiv- 
alent to an *induced bundle from the universal 
bundle y, by a fclassifying mapping f: B5 
P^(R):£z f;*y,. We define the 1-dimensional 
universal Stiefel- Whitney class w,(y,) to be 

the generator of H'(P^(Ry; Z;) and set w,(£)— 
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fw,(y,). For general n, we consider the prin- 
cipal O(n)-bundle (P, B, O(n)) *associated with 
the given R"-bundle č =(E, B, R”). Let Q, be 
the subgroup of O(n) consisting of all diag- 
onal matrices. Then the orbit space Y= P/Q, 
is the base space of the tprincipal Q,-bundle 
5n —(P, P/Q,,Q,). Let p: Y>B=P/O(n) be the 
natural projection. Then the R"-bundle p*é 
over Y induced by p is associated with y and 
is equivalent to the "Whitney sum of n line 
bundles (F. Hirzebruch [4]), p*£z£,  ... ON 
&,. Moreover, the homomorphism p*: H*(B; 
Z,) 5 H*(Y, Z,) is injective (A. Borel [1]). 
Therefore we can uniquely define the total 
Stiefel- Whitney class wë) of the R"-bundle by 
the relation p*w(Z) 2 w(£,)... w(£,). The Stiefel- 
Whitney classes defined above are compatible 
with bundle mappings f:w(f*é)= f *w(£) (— 
147 Fiber Bundles M). The Stiefel- Whitney 
class w,(y,) € H'(Bo,; Z2) (1 <i<n) of the 
universal R"-bundle y, over the ‘classifying 
space Bom is called the i-dimensional univer- 
sal Stiefel-Whitney class. For the Whitney sum 
of vector bundles, we have w(é ® yn) = w(£)w(). 
In order for an R"-bundle Z to be forient- 
able, namely, for č to have an SO(n)-structure, 
it is necessary and sufficient that w,(€)= 
0. For an oriented R"-bundle Z the Euler- 
Poincaré class X, (c) is defined to be the tpri- 
mary obstruction X,(£)e H"(B; Z) for con- 
structing a *cross section of the *associated 
(n — 1)-sphere bundle. In particular, the Euler- 
Poincaré class of the universal bundle for 
SO(n) is called the universal Euler-Poincaré 
class. X,,(€) mod 2 is equal to w,„(č). If n is odd, 
we have 2X,(£) — 0. 


C. Chern Classes 


We consider an n-dimensional complex vector 
bundle c — (E, B, C") (called C"-bundle in the 
following) with a paracompact Hausdorff base 
space B, fiber C", and unitary group U (n) as 
the structure group. The cohomology class 
ci(co)e H?'(B; Z) (i— 1,2, ..., n), called the 2i- 
dimensional (or ith) Chern class, and the total 
Chern class c(w)=1 +c (0) 4- ... - c,(v)e 
H*(B; Z) are defined as follows. In the case 
n=1, let C^ =lim C" (the inductive limit of 
the *complex Euclidean spaces C"), S” be 

its unit sphere, and P^(C) be the infinite- 
dimensional complex projective space consist- 
ing of all complex lines through the orgin O of 
C”. Then the natural mapping S? P*(C) 
defines a universal principal U(1)-bundle 

(S*, P*(C), U(1)). Letz be its associated 
universal C!-bundle. Then we define the 1- 
dimensional universal Chern class c,(7,)é 
H?(P*(C); Z) to be the cohomology class that 
takes the value —1 on the cycle S?(zP!(C)c 
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P^(C)) with the natural orientation. Since a 
general C'-bundle €=(E, B, C!) is induced 
from y; by a classifying mapping f.: B5 
P"(C), we set c, (C) f-*c,(y,). When n» 1, let 
(P, B, U (n)) be the principal U (n)-bundle asso- 
ciated with the given C"-bundle £ —(E, B, C"). 
Let T, be the subgroup of U (n) consisting of 
all diagonal matrices (which is a *maximal 
torus of U(n)). Then the quotient space Y= 
P/T, is the base space of the principal T,- 
bundle y =(P, P/T,, T,). Let p: Y B— P/U (n) 
be the natural projection. Then the C"-bundle 
p* € over Y is associated with y and is equiva- 
lent to the Whitney sum of n complex line 
bundles: p* £z č, OG...  £,. Moreover, 

p* : H*(B; Z) 5 H*(Y; Z) is a monomorphism 
(Borel [1], Hirzebruch [4]). Therefore we can 
uniquely define the total Chern class c(£) of the 
C"-bundle Z by the relation p*c(é)=c(é,)... 
c(£,). The Chern classes, as defined above, 
are compatible with bundle mappings f (i.c., 
c(f *&)-—f*c(£)) (— 147 Fiber Bundles N). 
The Chern class c,(y,)€ H*!(Bygy; Z) (1 «i n) 
of the universal C"-bundle 7, over Bym is 
called the 2i-dimensional universal Chern class. 
Let č, y be complex vector bundles over B. 
Then we have c(& Q 1) 2 c(C)c(r). By the nat- 
ural inclusion U(n) c SO(2n), we can identify 
a C"-bundle o with an oriented R?"-bundle 
Og. Then we have c;(c)mod 2 = w,(«g), 

Wai (Op) 0 (1—0, LL c (0) = X, og). 


Examples. Let (S*"*!, P"(C), U(1)) be the *Hopf 
bundle, and let yf be the associated complex 
line bundle. Then the classifying mapping of y? 
is the natural inclusion P"(C)5 BU(1) « PC 
and c(y1) 2 1 — g,, where g,¢ H*(P"(C); Z) is the 
cohomology class dual to the homology class 
represented by the hyperplane P" ^! (C). On the 
other hand, for the complex line bundle £1 = 
(P"^! (C)) determined by the tdivisor (— 72 
Complex Manifolds) P" ! (C) c P"(C), we have 
e(€{)=1+4+g,, and £1, y1 are dual to each other. 
CH is called the canonical line bundle over the 
complex projective space. Moreover, the Whit- 
ney sum t Q e, of the tangent bundle «(P"(C)) 
of the n-dimensional complex projective space 
P"(C) and the trivial C’-bundle e, is equivalent 
to the Whitney sum of (n+ 1) copies of £1. 
Therefore we have c(«(P"(C))) 2 c(£t 0 7 @ 
Gp) (10g. 


D. Pontryagin Classes 


Utilizing the inclusion mapping O(n) c U (n) of 
structure groups, we can associate a C"-bundle 
£c £O / —1é with an R"-bundle é. We 
define the 4i-dimensional (or ith) Pontryagin 
class of the R"-bundle Z by p,(€)=(—1)'c,(é¢) 
(e H*'(B; Z)) (i= 1,2, ..., [n/2]) (Hirzebruch 
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[4]). In particular, the Pontryagin classes of 
the universal bundle for O(n) are called the 
universal Pontryagin classes. We also define 
the total Pontryagin classes p(c) — 1+ p,(£) 4 
<- + Dyia (6). We have 2c5;,, (£c) 20. For R"- 
bundles č and y, p(£ n) & p(&)p(r) modulo 2- 
torsion elements. We have p;(£) (mod 2)= 
(wa,(£))? and p,(&) - (X;, (2)? for the oriented 
R?"-bundle €. Moreover, for a complex vector 
bundle c, we have (Wu [8]) 


( —1Y pilor) = 2 ( —lY'ci(o)c3,. (e), 


where we put c)(w)=1. 
All such classes as defined in Sections B-D 
are called characteristic classes. 


E. Other Definitions of Characteristic Classes 


Axiomatic Definition. (1) For a C"-bundle c 
over a paracompact Hausdorff base space B, 
Chern classes c;(£)e H?(B; Z) (i20) are de- 
fined, and we have co(&)= 1, c;(£) 20 (i n). (2) 
For the total Chern class c(£) 2 X5, c;(£), we 
have c( f*¢)= f *c(£) for each bundle mapping 
f. (3) For the Whitney sum, we have c(é Q 4) = 
c(č): e(n). (4) Normalization condition: For 
the canonical line bundle £f, we have c(é")= 1 
+g, (— Section C). We can verify the existence 
and uniqueness of c;(&) satisfying these four 
conditions, so Chern classes can be defined 
axiomatically by these conditions (Hirzebruch 
[4]). We can similarly defined Stiefel- Whitney 
classes axiomatically. 


Definition in Terms of Obstruction Classes. 
When the base space is a "CW complex, we 
can define the Chern class of a C"-bundle é 
over B as follows: Let, ,,(C) Ui .,,, 
x U(q— 1) be the tcomplex Stiefel manifold of 
all orthonormal (n — q + 1)-frames in C" with 
Hermitian metric. Then V, ,..,,, (C) is *(2q—2)- 
connected, and its (2g — 1)-dimensional 
*homotopy group a, Ab, ,.,(C)) =Z. Let 
& be the tassoctated bundle of E with fiber 

ka all Then the tprimary obstruction 

(c H?*(B; Z)) to constructing a ‘cross section of 
& coincides with the Chern class c,(£). Anal- 
ogously, we can interpret w,(€) for an R"- 
bundle Z as an obstruction class (— 147 Fiber 
Bundles). 


Definition in Terms of Schubert Varieties. 

We denote by C* the subspace defined by 

Zk+1 =Zk+2 = --- = Zp}y 50 of the space C"*"— 
{Zis -> Znan) lz:€C,i=1,...,n+ N}, and fix 
the sequence of subspaces C! c C? c... c C"*N, 
The set of all complex n-planes X through 

the origin O in C"** forms the tcomplex 
Grassmann manifold AM. e ,(C). We denote 
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by E(y,’) the set of all pairs (X, v), where 
X€M,, y. ,(C) and vis a vector in X. Then 
we can define a 2N-universal complex n- 
dimensional vector bundle y" with base space 
M, y. (C), with total space E(y") and projec- 
tion (X, v)> X. Let o —(o(1), ..., c(n)) bea 
sequence of integers satisfying the condition 
O<a@(1)<...<@(n)<N. Then the set e, of all 
n-planes X c C"*" through the origin O satis- 
fying dim(X NC) =i, dim(X  C* 9971) = 
i—1,i=1, 2,...,n, forms a real (2 ZER, ox(i))- 
dimensional open 'cell. The set of all these 
open cells e, gives a tcellular subdivision of 
M, y, (C) as a CW complex. The closure e, 
of e, is a cellular subcomplex of M, JC 
called a Schubert variety. This is a tpseudo- 
manifold with canonical orientation and 
represents a (237, w(i))-dimensional inte- 
gral cycle, called a Schubert cycle. We denote 
€,, by (w(1), ..., «x(n)). All these homology 
classes form the basis of the homology group 
HAM. u,(C); Z). The cocycle dual to the cycle 
(0, ...,0 1,..., 1) represents the Chern class 

—— cL—— 

n—q q 
c (ys E H^*(M,, v (C); Z). For the real Grass- 
mann manifold M, , ,(R), we can analogously 
define the universal Stiefel- Whitney classes. 


Thom's Definition. Let é be an R"-bundle over 
B, B, be its "Thom space, and UE H"(B,; Z;) 
be the "fundamental class of B;. Let j: B B, 
be the inclusion induced from the zero cross 
section and o: H*(B; Z;) x H**"(B.; Z,) be the 
*Thom-Gysin isomorphism. Then we have j* U 
=w,(¢), 9 '(Sq'U)- w(&) (0 & ix n), where 

Sq' is the *Steenrod square (R. Thom [6]). 


Definition in Terms of Differential Forms. 

Let B be a tdifferentiable manifold and £— 
(P., B, U (n)) be a differentiable principal U (n)- 
bundle over B. Let O —(Q,j be the tcurvature 
form corresponding to the tconnection form 

0 — (0), i, j — 1, ...,n, on P.. Then Q,, is a 
complex-valued 2-form, and Q,,= —Q,,. For 
the matrix Q, we consider the differential form 


w=) w,=det|1+(2x./—-1)'QO}, 


where I is the unit matrix, the multiplication 
in the determinant is the texterior product, 
and y, is the part of degree 2q in y. Then y is 
defined as a real form independent of the con- 
nection c. We have dy, —0, and the coho- 
mology class of ( — 1)*y, in H?*(B; R) is the 
Chern class c,(£) with real coefficients (Borel 
and Hirzebruch [2], S. S. Chern [3]). 


Definition in Terms of Symmetric Polynomials. 
(— 427 Topology of Lie Groups and Homoge- 
neous Spaces.) 
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F. Characteristic Classes of Manifolds 


For a differentiable (complex or almost com- 
plex) manifold M, the characteristic classes of 
its tangent bundle are called characteristic 
classes of the manifold M. We shall denote 
Stiefel- Whitney classes, Pontryagin classes, 
Euler-Poincaré classes, and Chern classes of M 
by w,(M), p;(M), X,(M), and c;(M), respec- 
tively. These are invariants of differentiable 
structures, orientations, or (almost) complex 
structures of a manifold M if M is a differ- 
entiable, oriented differential, or (almost) com- 
plex manifold. By the Stiefel-Whitney num- 
bers of an n-dimensional manifold M, we 
mean the values of n-dimensional monomials 
of Stiefel- Whitney classes of M on the fun- 
damental homology class ((w,(M)' w(My?... 
w,(M)")[M]&€Z;, where r, - 2r; +... + nr,— 
n, r Z0. We can define integer-valued Pon- 
tryagin numbers and Chern numbers similarly. 
These numbers are called generally character- 
istic numbers of the given manifold. In partic- 
ular, X,(M)[M]- x(M)is the *Euler-Poincaré 
characteristic. 

In the case of topological manifolds, we 
can define characteristic classes in the follow- 
ing sense. Let M be a closed n-dimensional 
topological manifold and X" the genera- 
tor of H"(M; Z,). By defining X'(Y")- 
(XiY" )[M]eZ, for X'eH(M;Z,) Y" ‘ec 
HS (M; Z;), we have an isomorphism 
H'(M;Z,)=Hom(H" ‘(M, Z,); Z,). The ele- 
ment ve H'(M; Z,), corresponding to the 
homomorphism VS Sai Y" [M] under this 
isomorphism is called the Wu class of M, 
where Sq' is the tSteenrod square. Moreover, 
we call w,— 31.9 Sq! 'u;e H'(M; Z3) the Stiefel- 
Whitney class of the topological manifold M. 
Then for any tdifferentiable structure Z, we 
have w(M, 2) — w;. Therefore Stiefel- Whitney 
classes of differentiable manifolds are topolog- 
ical invariants (more precisely homotopy type 
invariants) (Thom [6], W. T. Wu [8]). J. W. 
Milnor [9] proved that Pontryagin classes of 
differentiable manifolds are not topological 
invariants. The image of p,(M) by the homo- 
morphism H*'(M; Z)5 H*'(M; Q) induced by 
the inclusion Z c Q (the rational number field) 
is called the rational Pontryagin class. In 1966, 
S. P. Novikov [10] proved the topological 
invariance of the rational Pontryagin class 
(— Section H). 


G. Index Theorem for Differentiable 
Manifolds 


Let M be an oriented closed manifold of di- 
mension 4k. Putting f(x, y) 2 x: y[ M] for 
elements x, y of the 2k-dimensional real coho- 
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mology group H?*(M; R), we obtain a bilinear 
form on H?*(M; R). The 'signature of the 
quadratic form f(x, x) (namely, (number of 
positive terms) — (number of negative terms) in 
its canonical form) is a topological invariant 
(homotopy type invariant) of the manifold M. 
We call it the index or the signature of the 
manifold M and denote it by t(M). If the di- 
mension of M is not divisible by 4, we define 
t(M)=0. For the product of manifolds we 
have t(M x N)=1(M)-1(N). Also t(M) is an 
invariant of the tcobordism class of M (Thom 
[6]). 

The index t of a differentiable manifold 
gives a homomorphism of the tcobordism ring 
Q into the ring Z of integers. Hirzebruch inves- 
tigated the multiplicative property of t and 
gave its expression by means of Pontryagin 
numbers. Let P be the ith telementary sym- 
metric function of indeterminates f,,..., f. 
Then a homogeneous part of the formal power 
series 


o 


i=1 tanh, f, 


of f,,..., B, is a symmetric polynomial of 
B,,---,B,, and therefore a polynomial of P, 
with rational coefficients. For k n, we de- 
note the homogeneous part of degree k by 
L,(P,,..., P). Specifically, if P; are the Pontrya- 
gin classes p;(M**) of a 4k-dimensional closed 
differentiable manifold MI. then L,(P,, ..., P) 
is a 4k-dimensional cohomology class of M**. 
Then we have the formula 


t(M**)=L,(P,, <- B)EM**], 


called the index theorem of differentiable mani- 
folds (or Hirzebruch index theorem). For exam- 
ple, L, =(1/3)P,, La — (1/45) (7P, — P2), and 

L, (1/945) (62P, — 13P, P, - 2P3), ... (^ 114 
Differential Topology). Later this index theo- 
rem was generalized to the tAtiyah-Singer 
index theorem (— 237 K-Theory). 


H. Combinatorial Pontryagin Classes 


Let K be an oriented n-dimensional compact 
thomology manifold, and let 2” be the bound- 
ary of an oriented (r+ 1)-simplex, namely, the 
combinatorial r-sphere. Let f: K 9 X" H bea 
tpiecewise linear mapping. Then for almost all 
points y of £" *, f ! (y) is an oriented 4i- 
dimensional compact homology manifold, and 
its index t( f (y)) is independent of y. We 
denote this by t(f). The (f) is an invariant of 
the homotopy class of f. Let o be the funda- 
mental class of H" *'(x" ^' Z). Then for n2 8i 
+2, there exists a unique cohomology class 

I, - l(K)e H*(K; Q) such that for any piece- 
wise linear mapping f: K — H. we have 
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(l;: f *e) [K] 2 1( f). We can remove the restric- 
tion n 2 8i - 2 if we take K x Z" for K and 
define /,(K) to be LK x 2”) for sufficiently 
large m. If K is a *C!-triangulation of a dif- 
ferentiable manifold M, l;(K) coincides with 
the class L;(P,, ..., P) defined by Hirzebruch 
(R. Thom [7]; V. Rokhlin and A. Shvarts), 
where P; — p((M) is the Pontryagin class of 

M. Since the variable P; can be expressed as 

a polynomial with rational coefficients of 
Lj(B,, ..., P, j&i, we define the combina- 
torial Pontryagin class p,(K) of a homology 
manifold K as the polynomial of [,(K) with 
rational coefficients. Therefore, if K is a C}- 
triangulation of a differentiable manifold M, 
we have p,(K)= p;(M) (modulo torsion ele- 
ments). The class LIKEN and consequently p;(K) 
are important combinatorial invariants of K. 
These classes are topological invariants [10]. 
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Chinese Mathematics 


A. Mathematics in the Chao, Han, and Tang 
Dynasties (3rd Century B.c.- 10th Century A.D.) 


In ancient China, the art of divination, called 
yi, was used in government administration. 
This was a kind of calculation that used pieces 
called tse. The book embodying it, called the 
I-Ching, is still popularly used. It shows that 
"numbers" or mathematics was seriously uti- 
lized in China at that time. The multiplica- 
tion table for numbers up to nine (called the 
Pythagorean table in the West) was known in 
China from the legendary period. However, 
mathematics in the Greek sense, that 1s, math- 
ematics as a logically systematized science, was 
unknown in ancient China. 

Suanching-Shihshu, or the Ten Books on 
Arithmetic— namely, Choupi-Suanching, 
Chiuchang-Suanshu, Haitao-Suanching (edited 
by Liu Hui), Suntzu-Suanching, Wutsao- 
Suanching, Hsiahouyang-Suanching, 
Changchiu-Suanching, Wuching-Suanshu, 
Chiku-Suanching (edited by Wang Hsiao- 
Tong), and Shushu-Chiyi (edited by Hsu Yue) 
—came into being between the 2nd century 
B.C. and the 6th century A.D., from the Chao 
to the Han eras, with the exception of the 
Chiku-Suanching compiled in the Tang era. 
These are the only mathematical texts from 
this early period whose authors and times of 
publication are known. They were used in the 
civil service examination for selecting adminis- 
trators up to the beginning of the Sung era 
(960 4.p.). The most important among them is 
Chinchang-Suanshu, or the Book of Arithmetic, 
which contains nine chapters. It treats positive 
and negative fractions with laws of operations 
on signed numbers, equations, and the elemen- 
tary mathematical knowledge of daily life. The 
Chiku-Suanching contains a number of prob- 
lems reducible to equations of the 3rd and 
4th degrees. 

There were also two works called San Tung 
Shu (edited by Tong Chuan) and Chui Shu 
(edited by Tsu Chung-Chih), but no copies of 
them are extant. Later works, one from the Sui 
era (published in 636) and another from the 
Tang era, tell us that the latter contained the 
result 3.1415927 > 1 > 3.1415926 and the ap- 
proximate values 355/113 and 22/7 for mn. 

In the 1st century a.D. Buddhism was intro- 
duced from India, and paper was invented. 
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However, despite the communication with 
India, neither the Indian numeration system, 
written calculation, nor the abacus was at that 
time widely used in China. The extraction of 
square or cube roots was done with calculat- 
ing rods. 


B. Mathematics in the Sung and Yuan 
Dynasties (10th—14th Centuries) 


In the Sung and Yuan periods contact was 
made with the Arab world. In the 13th cen- 
tury, a mechanical algebra utilizing calculating 
rods made remarkable progress; this can be 
attributed to Arab influence. Toward the 

end of the Sung era appeared the Shushu- 
Chiuchang by Ch'in Chiu-Shao and the Yiku- 
Yentan by Li Chih. The former gives a method 
like Horner's for approximate solution of 
equations, and the latter gives the principle of 
tienyuan-shu, i.e. the mechanical algebra of 
this period. The principle of tienyuan-shu was 
further expounded in the Suanhsueh Chimeng 
(1295) and the Suyuan Yuchien (1303) by Shih 
Shih-Chich, the Yanghui Suanfa by Yang Hui, 
and other works. These were introduced into 
Japan and they influenced the wasan (Japanese 
mathematics) of early times. Until recently, no 
further original mathematical ideas appeared 
in China. 


C. Mathematics after the Ming Era (15th 
Century) 


In this epoch, European renaissance civiliza- 
tion began to influence the Orient. In 1607, 
Matteo Ricci (1552—1610) translated Books I- 
IV of Euclid's Elements into Chinese with the 
aid of Hsu Kuang-Chi. In 1592 Suanfa Tang- 
tsung by Ch'éng Ta-Wei appeared, which 
dealt with the use of the abacus. This book 
had great influence upon wasan. 

No development was seen in the indigenous 
mathematics of the Ching era, that is, after 
the 17th century, but science and technology 
were imported by Christian missionaries. This 
brought about calendar reform from the lunar 
to the solar method. On the other hand, new 
editions of classical works such as the Ten 
Books on Arithmetic began to appear in this 
period. Emperor Kang Hsi-Ti (1655-1722), 
who was in correspondence with Leibniz, 
asked Ferdinand Verbiest (renamed Nan Huai 
Jen in Chinese) to compile Shuli-Chingwen 
(Advanced mathematics), whose 53 chapters 
were completed in 1723. This book dealt with 
European-style algebra and trigonometry. In 
the latter half of the 19th century, Alexander 
Wylie translated a number of Western mathe- 
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matical books into Chinese, including Books 
VII-XIII of Euclid's Elements and some works 
on calculus. Many current Chinese and Japa- 
nese mathematical terms originated with this 
translation. 
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C*-Functions and Quasi- 
Analytic Functions 


A. General Remarks 


An example of a *C?-function is a treal ana- 
lytic function, which is defined to be a function 
that can be expressed as a power series that 
converges in a neighborhood of each point 

of the domain where the function is defined. 
Many examples, however, show that real 
analytic functions form a rather small subset 
of the C*?-functions. Sometimes C?-functions 
(not real analytic functions) play essential roles 
in the development of theories of analysis (— 
105 Differentiable Manifolds S). On the other 
hand, there is a subfamily of C*-functions 
having some remarkable properties in com- 
mon with the family of real analytic func- 
tions. This family is called the family of quasi- 
analytic functions. It has been an important 
object of study since the beginning of the 20th 
century. On the other hand, the Gevrey class 
functions, which are no longer quasi-analytic 
in general, also constitute an important subset 
of C*-functions. The first part of this article 
deals with C?-functions, the second part with 
quasi-analytic functions, and the third part 
with Gevrey class functions. 


B. C°-Functions 


Let Q be an open set of the n-dimensional real 
Euclidean space R". A real-valued function 
f(x, ..., Xp) defined on Q is called a function 
of class C? on Q (or C?-function on Q) if 

f(x, ..., Xn) is continuously differentiable up to 
any order. The totality of C®-functions defined 
on Q is denoted by C?(O). It is an *associative 
algebra over the real number field R. A contin- 
uous function f defined on some closed set F 
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of R” is called a C?-function on F if there exist 
an open neighborhood U of F and a function 
ge C^(U) such that f — g|F. This definition is 
equivalent to the following (H. Whitney [6]): 
For any multi-index a —(a,, ...,2,), we can find 
a continuous function f"(x,,...,x,) on F such 
that (i) f ?(x,, ..., x,) 9 f(xi, ..., x,) and (ii) for 
any positive integer r and for every multi- 
index a with |x| <r, 


lim PAS IO 
IER 








Sg Y peso le 


la+Bl<r p! 


where || || denotes the Euclidean norm of R”. 


C. Local Theory of C”-Functions 


We shall now introduce an equivalence rela- 
tion ~ in C*(R"), defined as follows: f ~g 
<> f |U =g|U for some open neighborhood 

U of the origin. Let £, denote the quotient set 
C” (R")/ ~, which naturally inherits the struc- 
ture of an associative algebra from C? (R"). An 
element of &, is called a germ of a C*-function 
at the origin. We denote the germ of fe C^(R") 
by f. To feg, we assign the formal Taylor 
expansion 23(D^f(0)/2!)x? around 0, where 

D*f means (0^ * - *"f)(01x, ...0'"x,) for «= 
(r4, ...,r,). This assignment induces a homo- 
morphism « from 6, to the tring of formal 
power series R[[x;, ..., x,]] of n variables. 
The homomorphism : is surjective but not 
injective. Put A,—: ^! (0) € &,. A function f 
whose germ f belongs to A, is called a flat 
function. The function g(x) defined by g(x)= 
exp( —1/x?) when x 40 and 9(0)—0 is an 
example of a flat function on R!. A close study 
of the relationship between &, and R[[x;, 
...,X4]] leads to the preparation theorem for 
C^-functions, which can be stated as follows: 
Let F(x,, ..., x,)e £, satisfy F(0,...,0, x,) 
xPg(x,) (ge 4,,g(0) 40). Then any feg, can 

be expressed as f — FQ + R, where Õe g, and 


RE Ären, x, ,)xi with rje&, (B. 
Malgrange [3]). 
Let f(x,,..., x,) be a symmetric function in 


(x4, ..., x,) of class C”. Then there exists a 
germ ĝe é, such that fox. wig Ry m doas 
ol where o,, ...,0, denote elementary sym- 
metric functions with respect to x,, ..., x, (G. 
Glaeser, Malgrange). Let Ted, satisfy f(x) — 
f(— x). Then there exists a germ de 6, such 
that f(x) 2 (x?) (H. Whitney). 


D. Global Results 


Case of n Variables. C* (Q) becomes a *Fréchet 
space when it is endowed with the topology of 
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uniform convergence on compact sets for all 
partial derivatives. Let J and J, be two closed 
"ideals of C°(Q). Then we have J =J, if and 
only if t,(J)=1,(J,) for each x EQ, where t, is 
the mapping from C? (Q) to the ring of the 
formal power series that assigns the formal 
Taylor series of f around x (Whitney). 


Case of One Variable. In the case of one vari- 
able, further information can be obtained from 
various points of view. In the following, f 
denotes a C?-function defined on the unit 
interval I —[0, 1]. If f satisfies f(1)— 1, f (0) 
=... = TI 9(0)-0, then we have 


1 1 1 
SC SC + SS k, 
where m; ^ sup(| f(x)||xe/} and where k 
is some constant independent of the choice 
of f and r (E. Borel). Similar kinds of inequal- 
ities were obtained by A. N. Kolmogorov, 
A. Gorny, and H. Cartan. Let A be an arbi- 
trary countable set of real numbers. If for any 
xel we can find an integer r(x) such that 
f "9 (x)e A, then such a function f is neces- 
sarily a polynomial. The interval J can be 
divided into three disjoint subsets: SV, S^, 
and S$’. These are characterized as follows: 
For xeS the formal Taylor series c,( f) of f 
around x converges to f in some neighbor- 
hood of x. For xe SY”, t (f) diverges. And for 
xe SQ), t ( f) converges in some neighbor- 
hood of x but does not tend to f. Then Sf is 
an open set and S¥ is a G,-set, while SY” is 
an F,-set of the tfirst category. Conversely, let 
I— S8, +S, +S, be any partition of J into an 
open set $,, a G;-set $5, and an F,-set S, of the 
first category. Then there is some f eC" (1) 
with S; — S(^ (i= 1,2, 3) [5]. 


E. Relations between C*-Functions and Real 
Analytic Functions 


Let C?(I) be the set of real analytic functions 
on I. Then C*(I) is a subalgebra of C”(I). 
Applying the above result in the case of S, = 
Ø, we find a function feC”(/) that admits 
no real analytic function coinciding with f 

in a subinterval of I. Actually, functions with 
such a property are distributed densely in 

C" (I). A necessary and sufficient condition for 
a function fe C*(I) to belong to C?(I) is that 
for suitable constants A and k, | f "(x)| € 
Ak"n!, xeI, n=0, 1, 2, ..., be valid (Pring- 
sheim's theorem). If f (x) 20 for all xeJ and 
n —0,1,2, ..., then fe C*?(I) (S. N. Bernshtein). 
For any open set Q (c R”), the set C^(Q) of 
real analytic functions on Q is dense in C”(Q) 
(polynomial approximation theorem). This re- 
sult is true even when the topology of C^(Q) 
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is replaced by a stronger one (Whitney [6]). 
Let f € C*(Q) and pe C”(Q). Then we can 
find ge C?(Q) satisfying f =gọ if and only if 
for any xEQ, t,( f) is divisible by 1,(@) in the 
ring of formal power series (S. Lojasiewicz, 
Malgrange [3]). 


F. Quasi-Analytic Functions 


The investigation of quasi-analytic functions 
began with the attempt to obtain an intrinsic 
characterization of analytic functions. Borel 
defined monogenic functions as functions 
differentiable on their domains of definition, 
which can be any subset of the complex plane, 
not necessarily assumed to be open (— 198 
Holomorphic Functions Q). Similar to com- 
plex analytic functions, monogenic functions 
are uniquely determined by their values on 
any curve. While quasi-analyticity can be 
defined by such properties, it is customary to 
approach quasi-analytic functions from an- 
other aspect, that is, the behavior of higher 
derivatives of C?-functions. 

Generally, a subset B of C*(I) is called a set 
of quasi-analytic functions if the mapping t,: 
B—R[[x]] defined in Section D is injective at 
each point x eJ. The functions belonging to B 
are called quasi-analytic (with respect to B). 
Here, an important problem is to characterize 
a set of quasi-analytic functions B by specializ- 
ing the image of c, (B). 

Now let {M,,} be a sequence of positive 
numbers. Let C(M,) be the subset of C^(TI) 
consisting of f such that 


[f(x)|<Ak"M,, xel, n=0,1,2,..., 


where A= A( f) and k — k( f) are constant. 
Then Pringsheim's theorem simply asserts that 
C(n!) z CCD). 

In 1912, J. Hadamard raised the problem of 
determining the condition that the sequence 
{M,} should satisfy so that C(M,) becomes a 
set of quasi-analytic functions [9]. A. Denjoy 
showed that if 


M, — (nlog! nlog?n...log? ny", 
where 
log! n=logn, log? n —log(log? ! n), 
pec 


then C(M,) is a set of quasi-analytic functions 
[10]. Later he derived an improved result that 
the condition X M, "= œ is sufficient. T. 
Carleman first gave a necessary and sufficient 
condition for C(M,) to be a set of quasi- 
analytic functions, and later A. Ostrowski and 
T. Bang gave another version of the same 
condition [11—13]. The condition states essen- 
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tially the following: A necessary and sufficient 
condition for C(M,) to be a family of quasi- 
analytic functions in the interval (a, b) is given 
by either (i) X f, ! = +00, where fj, 2 inf, Mj’* 
(Carleman), or (ii) f” (log T(r))/(r?) dr = oo, 
where T(r) — sup, , (r"/M,) (Ostrowski, Bang). 
S. Mandelbrojt and T. Bang also gave another 
condition [13, 14]. (The simplest proof of these 
results is found in [13] or [15], where the 
proof follows Bang's idea.) 

Related to the above theorem, we also have 
the following: Let ( M,) be a sequence of posi- 
tive numbers with X(M,/M,,,) « oco. For a 0 
we can find f € C(M,) defined on ( —oo, oo) such 
that f(0) 0, f (- à) 20. Moreover, for 0 « 

a « fi there exists fe C(M,) such that f(0) — 0, 
f™=0(a<x<f,n=0,1,2,...) [16]. 

Suppose that we are given an interval J and 
increasing sequences {v,} and {M,} of positive 
numbers. Then we have the problem of finding 
suitable conditions on {v} and {M,} under 
which the mapping f  ( f"(xo)} gives an 
injective mapping from C(M,) to the sequences 
above. When (v,) and (M, satisfy the above 
conditions, then a function belonging to C(M,) 
is called quasi-analytic (v,) in the generalized 
sense. The study of the inclusion relation be- 
tween two families C(M,) and C(M,) also 
deserves attention. In [15] the relation be- 
tween C(M,) and C(n!) — C*(I) is discussed in 
detail. There are many open problems con- 
cerning the relationship between C(M,) and 
C(M,) in general. 

Quasi-analytic functions are closely related 
to problems in various branches of analysis, in 
particular the theories of complex analytic 
functions, Fourier series, Fourier integrals, 
Dirichlet series, and asymptotic expansions 
[8, 15, 17]. 


G. Gevrey Class Functions 


This class of functions has its origin in the 
study of tparabolic equations. A C®-function 
f(x) defined in a domain of R" is called of 
Gevrey class s (1 &s « oo) if for every com- 
pact set K in that domain there exist positive 
constants A, and C, such that 


max |D*f(x)|< Ax Cf a!" 


for all multi-indices a (|eJ=a,+...t+4a,, a!= 
a,!a5!...a,!). A typical example is 


ot exp(—I/x) for x>0, 
gt 0 for x «0, 


which is of Gevrey class 2. Let us denote the 
function space of a Gevrey class s by y. Evi- 
dently y cy? if s « s'. Let f(x)e y? and 
F(y)ey85 then xt F(f(x))ey. In particular, 
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the sum and the product of two functions in 
y® belong also to 79. Furthermore, the tim- 
plicit function theorem holds in this class. 
Unlike the class of analytic functions, we can 
use this class of functions of class s (> 1) fora 
tpartition of unity, and several problems in 
partial differential equations can be treated 
within this function space. Gevrey class func- 
tions are discussed in a complete form in [20] 
(— also [19]). 
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A. History 


The notion of a class field was first introduced 
by D. Hilbert (1898). Let k be an talgebraic 
number field and K a *Galois extension of k. 
Hilbert called such a field K a class field over k 
(or K/k was called a class field) when the fol- 
lowing property was satisfied: A *prime ideal 

p of k of absolute degree 1 (i.e., a prime ideal 
whose 'absolute norm is a prime number) is 
decomposed in K as the product of prime 
ideals of K of absolute degree 1 if and only if 
p is a *principal ideal. (Such a field K is now 
said to be an absolute class field over k in 
order to distinguish it from a class field defined 
later more generally by T. Takagi as explained 
below.) Hilbert conjectured the following theo- 
rems (1)-(4) together with the principal ideal 
theorem (— Section D), and proved them in 
some special cases. (1) For any algebraic num- 
ber field k there exists one and only one class 
field K/k. (2) A class field K/k is an tAbelian 
extension whose *Galois group is isomorphic 
to the "deal class group of k. Hence the degree 
n=(K:k] is equal to the *class number h of k. 
(3) The trelative different of a class field K/k is 
the principal order; thus K/k is an tunramified 
extension. (4) Let p be a prime ideal of k, and 
let f be the smallest positive integer such that 
p/ is a principal ideal. Then p is decomposed 
in the class field K/k as p=, P... Py, 

Nga (88) — p^, fg —n. 

Hilbert was led to these conjectures by the 
analog to the theory of talgebraic functions in 
one variable. Theorems (1)-(4) were proved by 
P. Furtwängler (Math. Ann., 63 (1907)), but 
these results were subsumed under the class 
field theory of Takagi, who generalized the 
notion of class field and proved that every 
Abelian extension of k is a class field over k (J. 
Coll. Sci. Imp. Univ. Tokyo, (9) 41 (1920)). Since 
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then, the arithmetic of Abelian extensions of k 
has developed through this theory. In Takagi's 
paper, L. Kronecker's problem concerning 
Abelian extensions of an imaginary quadratic 
field (2 73 Complex Multiplication) was 
solved simultaneously; this had been an open 
problem since the 19th century. Later, E. Artin 
proved the general law of reciprocity (Abh. 
Math. Sem. Univ. Hamburg, 5 (1927)) which put 
class field theory into its complete form. The 
original proof by Takagi was rather com- 
plicated, and H. Hasse, Artin, J. Herbrand, C. 
Chevalley, and others tried to simplify it. In 
particular, Chevalley introduced the notion 

of tideles and gave a purely arithmetic proof. 
On the other hand, attempts are also being 
made to generalize this theory to non-Abelian 
extensions. We mention here the results of 

G. Shimura [15] and Y. Ihara [16]. Today, 
class field theory is considered one of the 

most beautiful theories in mathematics. 


B. Definition of a Class Field 


Let k be an algebraic number field. For the 
definition of a general class field over k, we 
need a generalization of the ideal class group 
of k (2 14 Algebraic Number Fields H). Let m 
be an tintegral divisor of k, and let (n) be the 
multiplicative group of all tfractional ideals of 
k which are trelatively prime to m. For the 
rest of this article, we mean by an ideal of k 

a fractional ideal of k. Denote by S(m) the 
tray modulo m. Let H(m) be an fideal group 
modulo m, that is, a subgroup of (m) con- 
taining S(m). A Galois extension K of k is said 
to be a class field over k for the ideal group 

H (m) if the following property is satisfied: A 
prime ideal p of k of absolute degree 1 which is 
relatively prime to m is decomposed in K as 
the product of prime ideals of K of absolute 
degree 1 if and only if p belongs to H(m). The 
absolute class field of Hilbert is the case where 
m —(1) and H (m) ts the group of all principal 
ideals of k. 

A class field K/k for an ideal group H is 
uniquely determined by H (uniqueness theo- 
rem). The fconductor f of H is said to be the 
conductor of the class field for H. The ideal 
group H corresponding to the class field K/k is 
determined by K as follows: H(f)/S(f) is the 
union of all cosets C of 3(f) modulo S(f) such 
that C contains a frelative norm Nx,(W) of 
some ideal d of K which is relatively prime to 
f. In general, let K/k be a Galois extension and 
nt be an integral divisor of k. Let H(m) be the 
union of all cosets C of 3(m) modulo S(m) such 
that C contains a relative norm Nx (91) of 
some ideal W of K which is relatively prime to 
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m. Then H(m) is a multiplicative subgroup of 
Sm), and the index h=(3(m): H(m)) is not 
greater than the degree n=[K:k]. We have 
h=n if and only if K/k is the class field for 

H. Hence a class field K/k can be defined as 
a Galois extension of k such that h=n for a 
suitable integral divisor m of k. 


C. Fundamental Theorems in Class Field 
Theory 


(1) Main theorem: Any Abelian extension K/k 
is a class field over k for a suitable ideal group 
H. 

(2) Existence theorem: For any ideal group 
H (m) there exists one and only one class field 
for H (m). 

(3) Composition theorem: Let K, and K, be 
class fields for H, and H,, respectively. Then 
the composite field K, K, is the class field over 
k for H, N H}. Consequently, K , >K, if and 
only if H, c H,. 

(4) Isomorphism theorem: The Galois group 
of a class field K/k for H(m) is isomorphic to 
S(n)/H (m). In particular, every class field 
K/k is an Abelian extension of k. 

(5) Decomposition theorem: Let f be the 
conductor of the class field for H. If p is a 
prime ideal of k relatively prime to f and f is 
the smallest positive integer with p/ eH, then 
p is decomposed in K/k as p — 3B, 3B, ... P, 
Ne, B) = p^, fg n. 

(6) Conductor-ramification theorem: Let f be 
the conductor of a class field K/k. Then f is not 
divisible by any prime divisor that is unrami- 
fied for K/k, and f is divisible by every prime 
divisor that ramifies for K/k. Let f=[Tf,, 

T, = p°. Then f, coincides with the tp-conduc- 
tor of K/k, and the exponent c can be explicitly 
expressed by the order of the tramification 
groups and the tramification numbers of p for 
K/k (— 14 Algebraic Number Fields P). 

(7) Let p be a prime ideal of k that rami- 
fies for K/k. Let H, be the ideal group of k 
such that (i) the conductor of H, is relatively 
prime to p and (ii) H, is the minimal ideal 
group of k containing H with property (1). 

Let nz [K:k], e - (H,: HL and p/ € H,, where 
p^(d « f )€ H,. Then p is decomposed in K/k as 
p =(P; 3, ...58)5, Nka B) = p^, n=efg. 

(8) Translation theorem: Let K/k be the class 
field for an ideal group H(m), and let Q be an 
arbitrary finite extension of k. Then KQ/Q is 
the class field for H*, where H* is the ideal 
group of Q consisting of all ideals b of Q with 
Noj(b)e H(m). In particular, the conductor of 
KQ/Q is a divisor of the conductor of K/k. 

(9) Artin’s general law of reciprocity: Let K/k 
be the class field for an idea group H with the 
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conductor f. We denote the *Artin symbol of 
an ideal a of k that is relatively prime to f by 


(K/a) — (=) , 
a 
Let a mapping ® from (f) to the Galois 
group G of K/k be defined by (a) 2 (K/a) for 
ae (f). Then induces the isomorphism 
3(D/H(f) z G. Namely, the isomorphism men- 
tioned in (4) is explicitly given by the Artin 
symbol. Also, the ideal group H(f) is character- 
ized as the set of all ideals a such that ae %(f) 
and (K/a)=1. From this theorem we can prove 
all the known laws of reciprocity for power- 
residue and norm-residue symbols (— 14 
Algebraic Number Fields O, Q, R). 

From the general results of class field theory 
we can systematically derive all the known 
theorems concerning the arithmetic of qua- 
dratic fields, cyclotomic fields, and Kummer 
extensions. 


D. Principal Ideal Theorem 


Let K/k be an absolute class field. Then the 
extension of any ideal of k to K is a principal 
ideal of K. This theorem is called the principal 
ideal theorem. It was conjectured by Hilbert, 
formulated by Artin as a theorem of group 
theory, and proved by Furtwängler (Abh. 
Math. Sem. Univ. Hamburg, 7 (1930)). Later a 
simple proof was given by S. Iyanaga (Abh. 
Math. Sem. Univ. Hamburg, 10 (1934)). This 
theorem was also generalized to the following 
general principal ideal theorem (Iyanaga, 
Japan. J. Math., 7 (1930)): Let K/k be the class 
field for the ray S(f), and let f= 4$, where D is 
the relative different of K/k. Then the exten- 
sion to K of any ideal of k that is relatively 
prime to f belongs to S(%). Put & [I W”. 
Then v is equal to the ramification number 

v, - 1 (—^ 14 Algebraic Number Fields K). For 
an absolute class field K/k, let the extension of 
an ideal a to K be (O(a)). Then we can choose 
QO(a)e K such that O(a)O(b)"* O(ab) `! ek, 
where o(a) — (K/a) is the Artin symbol for a 
(T. Tannaka, Ann. Math., 67 (1958)). This 
result can also be generalized for the class 
field for S(f). 


E. Theory of Genera 


Let K/k be a Galois extension and let H(m) be 
an ideal group of k. The set of all ideals d of K 
relatively prime to m such that Ne lU be- 
longs to H(m) forms an ideal group of K. This 
ideal group is said to be the principal genus for 
H. Each coset of 3(m) modulo the principal 
genus for H is said to be a genus for H. In 
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particular, let K/k be a cyclic extension with 
the conductor f, and let H(f) be the ideal group 
of k generated by Nx,(A) (4e K) and S(f). 
Then the principal genus for H(f) is the ideal 
group formed by the ideal classes of K of 

the form C! ^, where c is a generator of the 
Galois group of K/k (— 347 Quadratic Fields 
F). In general, let K/k be an Abelian extension, 
and let j be the conductor of K/k. Then for the 
ideal § =I] 38" of K defined in Section D, 

Nx (S(m)) = S(mi) for an arbitrary integral 
ideal m of k. In particular, let K/k be a cyclic 
extension, and let H = S(mf). Then the prin- 
cipal genus for H is the ideal group consist- 
ing of all cosets of time) modulo Sim) of 
the form B! * (Herbrand; Iyanaga, J. Reine 
Angew. Math., 171 (1934)). 


F. Class Field Tower Problem and 
Construction Problem 


Furtwangler considered the following prob- 
lem: Let k be a given algebraic number field, 
k —ko c k, c k; ... be the sequence of fields 
such that k; is the absolute class field over k;_,, 
and K , be the union of all the k;. Is K, a 
finite extension of k? The answer is yes if and 
only if k, is of class number 1 for some n. This 
problem is called the class field tower problem. 
Artin remarked that if for every algebraic 
number field F of degree n we have the inequal- 
ity | Dp| > (1/4)? ':(n"/n!? > (ne?/Ay /2ne V6") 
for the *discriminant Dp, then K ,,/k is always 
finite [1, p. 46]. E. S. Golod and I. R. Shafa- 
revich (1964) solved the class field tower prob- 
lem negatively; they proved that K ,,/k is 
infinite if k; (i= 1,2, ...) is the maximal unrami- 
fied Abelian p-extension of k;_, for a fixed 
prime number p and if the inequality y 2 34 
24 / p +2 holds, where y is the minimal num- 
ber of generators of the p-component of the 
ideal class group of k and p is the rank of the 
unit group of k. (We call an extension K/k a p- 
extension if the degree [K :k] is a power ofa 
prime number p.) For example, the class field 
tower K „/k is actually infinite if k is an imagi- 
nary quadratic field (p — 1) and y 2 7 for p=2, 
for example, k = Q(,/ —3:5:7-11-13:17-19). 
Construction problem. Let k be a given alge- 
braic number fteld and G a finite group. The 
construction problem asks us whether there 
exists a Galois extension K/k such that its 
Galois group Gal(K/k) is isomorphic to G. If G 
is Abeltan the problem can be solved affirma- 
tively by using class field theory. This problem 
was also solved affirmatively for p-groups by 
A. Scholtz and H. Reinhardt in 1937, and for 
general solvable groups by Shafarevich in 1954 
(Izv. Akad. Nauk SSSR, ser. mat. 18). 
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G. Class Field Theory for Algebraic Function 
Fields and Local Class Field Theory 


F. K. Schmidt developed an analog of class 
field theory for Abelian extensions over an 
algebraic function field in one variable with 
finite coefficient field (1930; — [8]). An arith- 
metic proof was given by M. Moriya (1938). 
An analog of class field theory for local fields 
with finite residue-class fields, called local class 
field theory (— 257 Local Fields) was first 
developed by Hasse, and later Chevalley gave 
an algebraic derivation (1933). 


H. Cohomology of Groups and Class Field 
Theory 


For the purpose of simplifying the proof of the 
main theorems in class field theory, the theory 
of *Galois cohomology was developed by T. 
Nakayama, G. Hochschild, A. Weil, Artin, J. 
Tate, and others. In particular, Artin and Tate 
[9] constructed class field theory on the basis 
of the cohomology theory of finite groups as fol- 
lows: Let G be a finite group, A a *G-module 
(or a multiplicative commutative group with 
the operator domain G), and H"(G, A) the nth 
tcohomology group (n=0, +1, +2,...) of 

G with coefficients in A (— 200 Homolog- 

ical Algebra N). Then we have H°(G, A) z: 
A®/N,(A), where A? is the set of all G-invariant 
elements in A and N,(A) is the set of all ele- 
ments of the form NG(a) 2 Zeng (ae A). We 
can consider Z a G-module by defining on =n 
(ne Z,o €G). Let A, B, C be G-modules such 
that a G-bilinear mapping (A, B) C is de- 
fined. Then we can define the tcup product 
(a, B) a ~ B (we H'(G, A), Be H*(G, B), a~ Be 
I *(G, C)) for r, se Z with the usual prop- 
erties. Let 4 be a G-module and H a sub- 
group of G. Then the restriction homomor- 
phism R4: É"(G, 4) FI" (H, A) and the 
tinjection homomorphism Inj,,,¢:H"(H, 4) 
É"(G, A) are defined for ne Z. If H isa 

normal subgroup of G, then the l'inflation 
homomorphism Inf; 44: HIGH, AP) 
H"(G, A) can be defined for n> 1 (— 200 
Homological Algebra M). 

Let k be an algebraic number field, and let 
K be a Galois extension of k of degree n with 
the Galois group G — G(K/k). The multiplica- 
tive group K* =K — {0}, the tidele group Jy of 
K, and the idele class group Cx of K are multi- 
plicative commutative groups with G as their 
operator domain. The fundamental formulas 
in Galois cohomology for class field theory are 


H'(G, C,) 20, (1) 
AP (G, C) = Z/nZ. (2) 
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It is possible to realize the isomorphism of 
(2) by the invariant inv, ,,: Ê? (G, Cx) > {(r/n) 
(mod Z)|r=0, 1, ...,n— 1j in such a way that 
the following properties hold, where the canon- 
ical cohomology class for K/k is the element 
Ex, of H?(G, Cx) such that invy,, £j, —(1/n) 
(mod Z): (i) for kc le K, G=G(K/k), H = 
G(K/l), the relation Resg,, £,;, = &,; holds; 
(11) 1f //k 1s also a Galois extension with F — 
G(l/k), then we have Int, Čik = EK, (m= 
CK :1]); (iit) for a cyclic extension K/k we 
have Ivv Čr = Ly Nol Kl (mod Z), where 
p runs over all prime divisors of k and inv, is 
the invariant in the local theory (— 257 Local 
Fields E). By these properties, the canonical 
cohomology class £y, is uniquely determined. 
After these preliminaries we can state Tate's 
theorem, from which the fundamental theo- 
rems in class field theory follow. 

Tate's theorem. Let K/k be a Galois exten- 
sion with the Galois group G. Then we have the 
isomorphism ®,: H"^?(G, Z) « H"(G, C4) (n= 
0, +1, +2,...) that is given explicitly by 
Q,(x) Eu — x, where £j, € HO, Cx) is the 
canonical cohomology class for K/k (Ann. 
Math., (2) 56 (1952)). 

Corollary 1. Since B^ ?(G, Z) : G/[G, G] and 
À? (G, Ck) x C Ny (Cy), we have the isomor- 
phism ®,:G/[G, G] = C/N&, Let f(t, o) 
(t,o € G) be a 2-tcocycle belonging to £g. 
Then by the explicit expression for the cup 
product we obtain the isomorphism 


®:a( moa G])> 


I] Te, ol ! (mod N62) . 


teG 


This is an analog of the result in local theory 
that was proved earlier by T. Nakayama and 
Y. Akizuki (Math. Ann., 112 (1936)). 

Corollary 2. For an Abelian extension K/k 
we have the isomorphism 0,:G Œ C,/Nxj(Cx). 
O5! has the property of being the norm- 
residue symbol for C,, and from this iso- 
morphism we can prove immediately Artin's 
law of reciprocity. Thus we can prove the main 
theorems in class field theory by cohomology- 
theoretic methods [9]. 

We can also see, by generalizing this iso- 
morphism to infinite Abelian extensions, that 
the Galois group of the maximal Abelian 
extension of k over the ground field k with 
tKrull topology is algebraically and topologi- 
cally isomorphic to C,/D,, where D, is the 
connected component of the unit element in 
C,. The structures of D, and C,/D, were ex- 
plicitly determined by Artin and T. Kubota, 
respectively (— 6 Adeles and Ideles D). 

If we assume the fundamental formulas (1) 
and (2) stated above and several other simple 
assumptions as axioms for an infinite ex- 
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tension of a fixed ground field, we can de- 
velop the results stated in this section purely 
cohomology-theoretically. Such a system is 
called a class formation (Artin; — [9]). In ad- 
dition to the cases of algebraic number fields, 
algebraic function fields in one variable with 
finite coefficient fields, and local fields with 
finite residue-class fields, which we have men- 
tioned already, we also know several other 
cases for which analogies of class field theory 
are valid. These analogies can be explained 
systematically by using class formation the- 
ory (Y. Kawada, Duke Math. J., 22 (1955)). 
Examples are (1) the theory of unramified 
Abelian extensions of an algebraic function 
field in one variable with algebraically closed 
constant field of characteristic 0 (Tate and 
Kawada, Amer. J. Math., 77 (1955)); (2) the 
theory of Kummer extensions over a field k 
such that (1) the characteristic of k is O, (ii) k 
contains all the roots of unity, and (iii) for any 
Galois extension K/k, Nka(K )= k; (3) the 
theory of Abelian p-extensions of a field of 
characteristic p (E. Witt, J. Reine Angew. 
Math., 176 (1963); I. Satake and Kawada, J. 
Fac. Sci. Univ. Tokyo, 7 (1955)); (4) the theory 
of unramified Abelian p-extensions of an 
algebraic function field in one variable with 
algebraically closed constant field of char- 
acteristic p (Hasse and Witt, Monatsh. Math., 
43 (1936), H. L. Schmid, I. R. Shafarevich, 
Kawada, T. Tamagawa); and (5) the theory 
of Abelian extensions of a local field with 
algebraically closed residue-class fields (J.-P. 
Serre, Bull. Soc. Math. France, 89 (1961)). 

An analogy of class field theory for infinite 
Abelian extensions was considered by Her- 
brand, Moriya, M. Mori, and Kawada. 
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A. Introduction 


The general linear groups, unitary groups, 
orthogonal groups, symplectic groups, etc., 
that are described below are all called classical 
groups (— 13 Algebraic Groups, 151 Finite 
Groups, 248 Lie Algebras, 249 Lie Groups). 


B. General Linear Groups 
Let V be a "linear space of dimension n over a 


*field K, and let GL(V) denote the set of all 
‘linear mappings of V onto V (hence they are 
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all bijections). Then GL(V) is a group under 
the composition of mappings. This group is 
called the general linear group (or full linear 
group) on V. Let e,,...,e, bea basis of V over 
K, and let (Gi be the matrix associated with an 
element A of GL(V): Ae; 2 3;2je;. Then the 
mapping A (xj) is an isomorphism of GL(V) 
onto the multiplicative group GL(n, K) of all 
nxn 'invertible matrices over K. We can 

thus identify the group GL(V) with GL(n, K). 
GL(n, K) is called the general linear group of 
degree n over K. Consider the homomorphism 
A—|A|(|A] is the determinant of A) of GL(V) 
onto the multiplicative group K* = K — {0}. Its 
kernel SL(V) is a normal subgroup of GL(V) 
and is called the special linear group (or uni- 
modular group) on V. The subgroup SL(n, K) 
—(A|AeGL(n, K), | A| 2 1) of GL(n, K) corre- 
sponds to SL(V) under the above isomorphism 
GL(V)z GL(n, K). SL(n, K) is called the special 
linear group of degree n over K. Unless n=2 
and K is the "inte field F, = GF(2), SL(n, K) is 
the tcommutator subgroup of GL(n, K). The 
tcenter 3 of GL(n, K) coincides with the set 

of all scalar matrices a! (xe K*), and the 

center 39 of SL(n, K) is a finite group given by 
30 SL(n, K) 2 (a1]|ue K, a" — 1). 

Now let P(V) be the projective space of 
dimension n — 1 obtained from a linear space 
V of dimension n. Namely, P(V) is the set 
of all linear subspaces of dimension 1. Then 
there exists a natural homomorphism q of 
GL(V) into the group of all projective trans- 
formations of P(V), and the tkernel of o 
coincides with the center 3 of GL(V). Hence 
o(GL(V))=GL(V)/3. This group is written as 
PGL(V) and is called the projective general 
linear group on P(V). Similarly, PGL(n, K) 
= GL(n, K)/5 is called the projective general 
linear group of degree n over K. The quotient 
group SL(n, K)/3o of SL(n, K) by the center 3, 
is called the projective special linear group and 
is written as PSL(n, K) or LF (n, K) (linear 
fractional group). 

The groups GL(n, K), SL(n, EL etc., are also 
written as GL,(K), SL,(K), etc. In particular, 
when K is the ffinite field F,, these groups 
are denoted by GL(n, q), SL(n; q), PGL(n, q), 
PSL(n, q), LF(n, q). 


Simplicity of PSL(n, k). When n=2 and K = 
F,, PSL(2,2) x G, (the tsymmetric group of 
degree 3). When n=2 and K =F;, PSL(2,3)= 
9I, (the alternating group of degree 4). Except 
for these cases, the group PSL(n, K) (n 2 2) is 
a noncommutative *simple group (— 151 
Finite Groups I). 

Suppose that K is the finite field F,, and let 
a(n, q), Din, q), y(n, q), ó(n, q) denote the orders 
of GL(n, q), SL(n,q), PGL(n, q), PSL(n, q), res- 
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pectively. Then we have 

a(n, q) - (q" — 1) (q" — 9)...(a" — a" A 
B(n, q) — y(n, q) a(n, q)/(q — 1), 

din, q) — y(n, q)/d, 


where d — (n, q — 1) (the greatest common divi- 
sor of n and q— 1). 


C. Properties as Lie Groups 


If the ground field K is the field R of real num- 
bers (the field C of complex numbers), the 
above groups are all *Lie groups (*complex Lie 
groups). In particular, SL(n, C) is a tsimply 
connected, tsimple, and fsemisimple complex 
Lie group of type A,_,, and PSL(n, C) is the 
*adjoint group of the complex simple Lie 
algebra of type A, ,. 


D. Determination of the Rational 
Representations of GL(V) 


In Sections D and E, the field K is assumed to 
be of characteristic 0. Let p be a homomor- 
phism of GL(V) 2 GL(n, K) into GL(m, K} (p: 
A — (o5) ^ B —(B2)). Then if each fi? is a rational 
function (or polynomial or analytic function) 
in (x1,a1, ..., a7) over K, p is called a rational 
representation (or polynomial or analytic repre- 
sentation) of degree m of GL(V). (We suppose 
that K is R or € when we consider analytic 
functions.) For example, every rational repre- 
sentation of degree 1 can be expressed as A 
| A|° (e is an integer). In particular, if K is the 
field C of complex numbers, every analytic 
representation of GL(n, C) is a rational repre- 
sentation. Since GL(n, C) is the tcomplexifica- 
tion of the tunitary group U (n), there exists a 
one-to-one correspondence between the com- 
plex analytic representations of GL(n, C) and 
the continuous representations of U(n); this 
correspondence preserves equivalence, irre- 
ducibility, ttensor product, and direct sum 

of the representations (— 249 Lie Groups). 
Hence, determining the rational representa- 
tions of GL(n, C) is equivalent to determining 
the continuous representations of U(n). In the 
general case, the rational representations of 
GL(V)=GL(n, K) are all completely reducible. 
For any rational representation p of GL(V), 
there exists a natural number e such that the 
representation p': A—]|A]J*p(A) is a polynomial 
representation. Hence in order to determine 
the rational representations of GL(V), it is 
sufficient to determine the irreducible poly- 
nomial representations of GL(V), which, as 
described below, can be obtained by decom- 
posing the representations on the *tensor space 
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Vy"—yG...G V of degree m (m copies of V) 
(m=1,2,...). For AeGL(V), define D,(A)e 
GL(V") as the tensor product 


D,(A)— AQ ... © A (m copies of A). 
Namely, for v,, ..., v, € V, we have 
D,(A)(v, © ... © v,)- Av, ©... G9 Av, 


The mapping GL(V)3 A—D,(A)e GL(V")isa 
polynomial representation of degree n" of 
GL(V). Now let (H be the tassociative 
algebra of all linear mappings of V" into V" 
(total matrix algebra), and let 9I be the sub- 
algebra of €(V") generated by (D,(A)| Ae 
GL(V)}. Next, for an element o of the symmet- 
ric group G,, of degree m, define B,e GL(V") 
by B,(v, @ ... @ Vm) = 05-11)  ... 051. 
Then the mapping o B, is a representation 
of €, on V". Thus we obtain a representation 
t of the *group ring K[G,,] of S,, over K on 
V":K[G, ] L(V”). Set (K[G,]) = 8. Then 
A and 3B are fcommutors of each other in 
(V^) ie, A= (Xe €(V")| XB = BX (for all 
Be$8)j, B={XeEL(V")| AX = XA (for all 
Jet. 

Now for a right ideal r of B, let (Hm be the 
subspace of V" composed of all the finite sums 
of the form È Bx (Ber, x e V"). Then the fol- 
lowing statements hold: 

(1) «(V") is invariant under d hence it is a 
subspace of V" invariant under GL(V). Con- 
versely, for any subspace U of V" invariant 
under GL(V), there exists a unique right ideal r 
of B such that U —-r(V"). 

(2) Let r,, r, be right ideals of B, and put 
U, =r; (V^), U; 2 r,(V"). Then r; Sr: (as right 
&,,- modules) if and only if U, = U, (as repre- 
sentation spaces of GL(V)). 

(3) The mapping t —r(V") is a lattice iso- 
morphism of the tlattice of right ideals of B 
onto the lattice of GL(V)-invariant subspaces 
of V". Hence if r =r; +r, (direct sum), then 
U = U, + U, (direct sum). Also, r(V") gives an 
irreducible representation of GL(V) if and only 
if r is a minimal right ideal of B. 

Since the algebra K[G,,] is a tsemisimple 
algebra, B can be considered as a two-sided 
ideal of K[S,,,]. Hence a minimal right ideal r 
of B is also a minimal right ideal of K [Sn], 
and the tidempotent element e which generates 
r is a !primitive idempotent of K[ S, From 
the theory of symmetric groups (— 362 Repre- 
sentations H) the primitive idempotents of 
K[G,,] are all given (up to isomorphism) by 
' Young's diagrams T(f,,....f) (fi 2 f22...2 
f, 50, mo f, ... + f). In this setting, we have 

(4) Let & — e( fi, ..., Ją) be the primitive 
idempotent determined by Young's diagram 
T( fis ..., f). Then eK [S,,] c 88 if and only if 
k <n. In this case, put eK[S,] =r, (V?) — 
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e(V")— V"(T(f,,...,f)) and denote the 
irreducible representation of GL(V) on 
V"(T(fi, ...,f)) by AP D(A; fis ....f,). We call 
(fi... f) the signature of this irreducible 
representation. 

(5) The representation D(A; f,,..., fa) is an 
irreducible polynomial representation of 
GL(V). Furthermore, for any irreducible poly- 
nomial representation p of GL(V), there exists 
a unique D(A; fi, ..., f.) equivalent to p. For ex- 
ample, if k= 1, then f, =m, e- (m!) E csn, 
and V"(T(m)) is the space of 'symmetric ten- 
sors of degree m. If f, —...— f= 1, then k=m, 
£— (m!) t. Dc, (sgno)c, and V"(T(L, ...,1)) 
is the space of talternating tensors of degree m. 

(6) Let x(4; f,,..., fa) be the tcharacter of the 
irreducible representation D(A; f,, ..., f,). Then 


X(A; fi. suos) 


gt e? t gn 
ei el SN B 
(uL um & i 
— DREI P 
g^ 1 g^ ? £ H 


where £,, ...,£, are the feigenvalues of A and 
l, 5 f, -(n— 1), L 2 f; 4 (n—2), ..., 1, — f, (set 
Luz... =f, =0). Hence the degree d of 
D(A; fi» ..., f) is expressed as 


d=D(I,,...,1,)/D(n—1,..., 1,0), 


where D(x;, ..., x,) = ITi- (x; ch 

(7) In particular, denote the character 
of D(A; m) by p, — p, (A). Then they satisfy 
I —zA| ! 2 po piz t pz? +... and 


XA; fis Ji) 


Dr, Dr, +t Dr, +in-1) 
| Pr.-1 Py, Pf +in-2) 
Pj-m-1 Pfaz-2) ^" Ph 
where we put fe+1 =Jk+2 =- = fa =9, Bu 
p.57... —0. This matrix is simply written as 


Ip nun Pil With the convention that in 
each row, we set /, — fi c (n— 1), ...,1, 4 = 
fact lm fa 


E. Determination of the Rational Represent- 
ations of SL(V) 


The rational representations of SL(V) are 
completely reducible. By restricting any 
irreducible representation D(A; fi, .... fa) 
(f, 2f;2..2f,20)of GL(V) to SL(V), we 
get an irreducible rational representation 
D(A; f,, ..., f) of SL(V). Furthermore, any 
irreducible rational representation of SL(V) 
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can be obtained in this way. D(A; f,, ..., LI 
and D(A; f, Care equivalent representa- 
tions of SL(V) if and only if f, — firi — f! Ei 
(i—1,...,n— 1). 


F. Unitary Groups 


The set U (n) of all n x n tunitary matrices with 
complex elements is a group under multipli- 
cation (— 269 Matrices). This group U(n) is 
called the unitary group (or unitary transfor- 
mation group) of degree n. The subset of U (n) 
consisting of all matrices of determinant 1 is a 
normal subgroup of U (n). This group is called 
the special unitary group and is denoted by 

SU (n). 

U (n) and SU (n) are subgroups of GL(n, C) 
and SL(n, C), respectively, and can be obtained 
from these groups through the tunitary restric- 
tion. Hence they are both compact, connected 
Lie groups; in particular, SU(1) is composed 
only of the identity and U(1) is the multiplica- 
tive group of all complex numbers of absolute 
value 1. The center 3 of U (n) is the set of all 
diagonal matrices AI(4€C, |A|= 1), and we 
have 


SH, 3: SU(n)- U(n) 


U (n)/SU (n) & UO) 


Moreover, for n z2, SU (n) is a simple, semi- 
simple, and simply connected Lie group, which 
gives one of four infinite series of simple com- 
pact Lie groups. 

U (n)/3 is denoted by PU (n) and is called the 
projective unitary group. We have the relations 
PU (n) z SU (n3 SU (n) 3N SU (n) z Z/nZ. 
Hence PU (n) is locally isomorphic to SU (n). 


G. Irreducible Representations of U (n) 


Restricting the irreducible representation 
D(A; fi, ..., fg of GL(n, C) on SU(n), we obtain 
a continuous irreducible representation of 

SU (n), and conversely, all continuous irreduc- 
ible representations of SU (n) are obtained in 
this manner. Similarly, any continuous irre- 
ducible representations of U (n) are given by 
A-|A|*D(A; f,, ..., f), where e is an integer. 
Since both U (n) and SU (n) are compact, any 
continuous representation of these groups 
can be decomposed into a direct sum of the 
irreducible representations mentioned above 
(^ 69 Compact Groups). 

The representation theory of U (n) and SU (n) 
is important as the most typical and concrete 
example of the representation theory of gen- 
eral compact Lie groups (— 69 Compact 
Groups, 248 Lie Algebras, 249 Lie Groups). 
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H. Unitary Groups over General Fields 


A unitary matrix and the unitary group can 
also be defined over some fields other than the 
field C of complex numbers. Namely, let P be 
a field and K a quadratic extension field of P; 
for an element € of K, let Ë be the tconjugate 
of € over P. Then a matrix of degree n with 
entries in K is called a unitary matrix of K 
(relative to P) if it leaves invariant the tHer- 
mitian form č č, - £,& +... + £,£,. The multi- 
plicative group consisting of all unitary mat- 
rices is called the unitary group over K (relative 
to P) and is denoted by U (n, K, P); its sub- 
group consisting of all unitary matrices of 
determinant 1 is called the special unitary 
group over K and is denoted by SU(n, K, P). 
The quotient group of SU(n, K, P) by its 
subgroup consisting of all AI (4" 2 1,|4| — 1) is 
called the projective special unitary group over 
K and is denoted by PSU (n, K, P). In partic- 
ular, when K and P are the finite fields F, 

and F, (4 — p"), U (n, K, P), SU (n, K, P), 

PSU (n, K, P) are written simply as U (n, q), 

SU (n, q), PSU (n, q). Then for n> 3, each 

PSU (n, q) is a noncommutative simple group, 
except for PSU(3,2) (— 151 Finite Groups I). 


I. Orthogonal Groups 


The set of all torthogonal matrices of degree n 
(with real entries) forms a group under multi- 
plication. This group O(n) is called the ortho- 
gonal group (or orthogonal transformation 
group) of degree n. The subset of O(n) consist- 
ing of all orthogonal matrices of determinant 1 
forms a normal subgroup of O(n) of index 2. 
This group SO(n) (also denoted by O; ) is 
called the rotation group (special orthogonal 
group or proper orthogonal group) of degree n. 
Geometrically, O(n) is the set of all orthogonal 
transformations leaving a point in Euclidean 
space of dimension n fixed, and SO(n) is com- 
posed of all rotations around the point. 

Both O(n) and SO(n) are compact Lie 
groups, and SO(n) coincides with the con- 
nected component of O(n) which contains the 
identity. For n=3 or nz 5, each SO(n) is a 
simple and semisimple Lie group. Following 
the theory of Lie algebras, we divide the set 
of all SO(n) (nz 3 but nz 4) into two classes 
according as n is even or odd, and we thus get 
two of the four infinite series of simple and 
semisimple compact Lie groups (for SO(4), for 
example, see [1]). 

Although SO(n) (n> 3) is a connected Lie 
group, it is not simply connected. The simply 
connected Lie group which is locally isomor- 
phic to SO(n) is called the spinor group and is 
denoted by Spin(n). SO(n) is isomorphic to the 
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quotient group of Spin(n) by a normal sub- 
group of order 2. Let 3 be the center of Spin(n). 
Then 3z Z/2Z for odd n, 3z Z/AZ for nz2 
(mod 4), and 3 = (Z/2Z) @ (Z/2Z) for n=0 
(mod4) (— 61 Clifford Algebras). 

The group O(n, C) of all complex ortho- 
gonal matrices is called the complex ortho- 
gonal group, and the group SO(n, C) of all 
matrices in O(n, C) of determinant 1 is called 
the complex special orthogonal group. SO(n, C) 
(n2 3,n 4) is a simple and semisimple com- 
plex Lie group. 


J. Irreducible Representations of Orthogonal 
Groups 


In the same way as for GL (n, K), the irreduc- 
ible representations of O(n) can be obtained 
by decomposing the tensor product D,,(A) 
— AG ...69 A of m copies of an orthogonal 
matrix A using Young's diagram. Namely, con- 
sider the Young's diagram T( fi, fo, ...5f,) 
such that the sum of the lengths of the first 
column and of the second column is not 
greater than n, and call it an O(n) diagram. 
Then to any O(n) diagram T— T(f,, ..., f). 
there corresponds an absolutely irreducible 
representation D?(A; fi, fo, ..., fa), and the 
representations D°(A; f,, f2, ..., f.) correspond- 
ing to two distinct O(n) diagrams are mutually 
inequivalent. D,,(A) can be decomposed into 
the direct sum of those representations D°(A; 
fis fos f) such that f= f, +... + f, takes the 
values m, m — 2, m—4, .... Furthermore, any 
continuous irreducible representation of O(n) 
is equivalent to a D?(A; fi, f2, ..., f.) obtained 
from some O(n) diagram T= T( fi, f2, --- fa) 
In general, two O(n) diagrams T and 7” are 
called mutually associated diagrams if the sum 
of the lengths of their first columns is equal to 
n and if the lengths of each column other than 
the first one coincide. In particular, if T= 
T( fi. ft, f and 2k=n, then T is said to 
be self-associated. The set of all O(n) diagrams 
can be divided into pairs of mutually asso- 
ciated T, T’ (and self-associated T= T"). 
Suppose that we are given mutually asso- 
ciated diagrams T and T’ and that the length 
k of the first column of T= T(f,, fo, .... f.) IS 
not greater than n/2. Then the character y4(A) 
of D?(A; fi, f2, ..., f) corresponding to T and 
the character y 7.(A) of the irreducible repre- 
sentation corresponding to T" are given by 


x1(4) — FEH — Pi-(os1y» 
Pi-(o-2) — Di- (o2) + Pi — pi- asl 
Xr(A) S |Alxr(A) | v— [n/2], 


where p; and |pi(, 4, — Pi-(o41» -| have the 
same meaning as in the formula for the char- 
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acters of irreducible representations of 
GL (n, K). 

The irreducible representations of SO(n) can 
be obtained immediately from those of O(n). 
Namely, if T= T( f,, fo, ..., f) 1s not self- 
associated, D?(A; f,, fo, ..., f,) is irreducible as 
a representation of SO(n), and the represen- 
tations of SO(n) derived from T and the asso- 
ciated T" coincide. If T is self-associated, 
D°(A; fis fos ..., f.) can be decomposed into 
two irreducible representations of SO(n) of the 
same degree over the field of complex num- 
bers. Furthermore, the irreducible represen- 
tations of SO(n) obtained in this way from 
different pairs of associated diagrams are 
mutually inequivalent, while any continuous 
irreducible representation of SO(n) is equiva- 
lent to one of these representations. For the 
representations of SO(3) (the rotation group of 
degree 3) — 353 Racah Algebra. 

Since SO(n) is isomorphic to the quotient 
group of Spin(n) by a normal subgroup N of 
order 2, a continuous representation of Spin(n) 
which is not the identity representation on N 
can be considered as a double-valued repre- 
sentation of SO(n). This representation is called 
the spin representation and is important in the 
field of applied mathematics. 

The orthogonal group O(n) consists of all 
n xn real matrices which leave invariant the 
quadratic form E: + ... + £2, while the group of 
all n x n real matrices which leave invariant the 
quadratic form £2 +... + £2 — £2,, —... of 
signature (r, n — r) is called the Lorentz group 
of signature (r,n—r). The case for n=4 and r= 
3 is used in special relativity (— 359 Relativ- 
ity). Let Gy be the connected component of the 
identity of the Lorentz group of signature 
(3, 1). Then Gg is called the proper Lorentz 
group. For c =(g;;)€G, we have |a|= +1 and 
g44 = | Or g44 € —1. Moreover, we have Go 
={o||ol=1, g4,2 1}, G/Go =(Z/2Z) 6 (Z/2Z) 
('four group), and Gy = SL(2, C)/{ +1}. 


K. Orthogonal Groups over General Fields 


Orthogonal groups can also be defined over 
other general fields than the field of real num- 
bers as follows: Fix a tquadratic form Q(¢, č) 
=" ,=1%6;€; (In #0) over a field K. Then a 
linear transformation of &,(i=1, 2, ..., n) over 
K which leaves Q invariant is called an ortho- 
gonal transformation with respect to Q. The set 
of all orthogonal transformations forms a 
group. This group is denoted by O(n, K, Q) or 
simply O(Q) and is called the orthogonal (trans- 
formation) group over K with respect to Q. In 
particular, the normal subgroup of all trans- 
formations in O(n, K, Q) of determinant 1 is 
called the special orthogonal group over K with 
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respect to Q and is denoted by SO(n, K, Q) (or 
simply SO(Q)). O(n) and SO(n) are special cases 
of O(n, K, Q) and SO(n, KOL where K is the 
field R of real numbers and Q(£, £) is the unit 
quadratic form £2 + £2 4 ... €?. 

Let Q(n, K, Q) be the tcommutator subgroup 
of O(n, K, Q). Then this subgroup coincides 
with the commutator subgroup of SO(n, K, Q). 
If K is of characteristic 42, and if n 2 5 and 
the tindex v of Q > 1, then O(n, K, Q)/3 (3 is the 
center of Q(n, K, Q)) is a simple group, where 3 
= (I) or 3={ EIN (L. Dickson, J. Dieudonné). 
Suppose that K is a finite field F, (of character- 
istic #2). Then we have v=m if n=2m+ 1, and 
v=m or m—1 ifn=2m. Hence vz2 if nz 5. If 
v=0 and K =R, we have Q(n, R, Q) 2 SO(n) 
and, as mentioned before, SO (n)/5 is simple for 
nz 5. The same proposition also holds when 
K is an *algebraic number field (M. Kneser, 
1956). If K is of characteristic 2, then O(n, K, Q) 
= SO(n, K, Q), 3= (I), and Q(n, K, Q) is a sim- 
ple group in many cases (Dieudonné [5]). For 
the case where K is a finite field (Dickson) 

— 151 Finite Groups I. 


L. Symplectic Groups 


Let či, Ča, ..., 65, and Mu, 1, ... Nan be two sets 
of variables, and suppose that the same linear 
transformation A over a field K acts on them 
(from the left). If A leaves the tbilinear form 
Zlatan invariant, this linear 
transformation (or the corresponding matrix) 
A is called a symplectic transformation (sym- 
plectic matrix) of degree 2n. The set of all sym- 
plectic transformations (or matrices) of degree 
2n over K forms a group denoted by Sp(n, K) 
and called the symplectic group (symplectic 
transformation group, complex group, or 
Abelian linear group) over K. 

Any matrix in Sp(n, K) is always of determi- 
nant 1, and the center 3 of Sp(n, K) consists of 
I and —I. The quotient group PSp(n, K) of 
Sp(n, K) by 3 is called the projective symplectic 
group over K. Except for the three cases n= 1, 
K=F,;n=1, K =F;; and n=2, K=F,, the 
group PSp(n, K) (n> 1) is always simple. 


Properties of Symplectic Groups as Lie Groups. 


When K is the field C of complex numbers or 
the field R of real numbers, Sp(n, K) is a Lie 
group. The intersection of the complex sym- 
plectic group Sp(n, C) and the unitary group 
U (2n), namely, the unitary restriction of 
Sp(n, C), is denoted by Sp(n) and is called the 
unitary symplectic group (or simply symplectic 
group). Sp(n, C) is a simple and semisimple 
complex Lie group, and both Sp(n, R) and 
Sp(n) are simple and semisimple Lie groups. 
Moreover, Sp(n) is compact and simply con- 
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nected and gives one of the four series of 
simple, semisimple and compact Lie groups 
(— 249 Lie Groups). 

Let H" be the linear space of dimension n 
over the *quaternion field H. Define the inner 
product of two elements x —-(x,, ..., x,) and y 
—(Y1, Kal in H” by (x, y)e xiyi Fab, 
(y; is the tconjugate quaternion of y;), and 
consider the group of all linear transforma- 
tions which leave this inner product invar- 
iant. Then this group is isomorphic to Sp(n). 
Sp(n) is thus compared with the orthogonal 
group O(n), which leaves invariant the inner 
product of a linear space over the field R of 
real numbers and with the unitary group U (n), 
which has the same property over the field C 
of complex numbers (C. Chevalley [4, ch. 1]). 


M. Irreducible Representations of Symplectic 
Groups 


In the same way as for GL(n, K), the represen- 
tation D,(A) — A ® ... ® A (tensor product of 
m copies of A) of Sp(n, C) can be decomposed 
into irreducible components using Young's 
diagram. Namely, for any Young's diagram 
T-—T(fi.f;...,f,) (k <n) such that the num- 
ber k of rows is not greater than n, an irreduci- 
ble representation D*(A; f,,..., f) of Sp(n, C) 
is determined. These D*(A; f, ..., f,) are mutu- 
ally inequivalent, and D,,(A) can be decom- 
posed into the direct sum of representations 
D'(A; fis f», ..., f) such that f= f, +... + f, is 
equal to any of the values m, m—2, m— 4, .... 
The character of D(A; fi, fa, ..., f.) is given by 


X1(4) 9 i-i Pi-n+2 T Pim Dir t Pi- 2n 2]. 


where p; and |Pi-n+1sPi-n+2 LP. have the 
same meaning as in the formula for the char- 
acters of the irreducible representations of 
GL(n, K). 

For the matrices A in Sp(n), D'(A; fis fo, ..., 
fi) gives rise to a continuous irreducible 
representation of Sp(n). Furthermore, any con- 
tinuous irreducible representation of Sp(n) is 
equivalent to a representation D(A; fi, fo, ..., 
fj) corresponding to some diagram T. 


N. Relations among Various Classical Groups 


There are some isomorphisms (homomor- 
phisms) among the classical groups mentioned 
above. For general fields K — [1, 5]. For finite 
fields K — 151 Finite Groups I. For K =R or 
C, the following isomorphisms hold: SO(3)z 
SU (2)/{ +1}, SU(2)= Sp(1), SO(S) = Sp(2)/ 
{+I}, SO(6)=SO(4)/{ +1} (— 248 Lie Alge- 
bras, 249 Lie Groups). 
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O. Classical Groups over Noncommutative 
Fields 


Let V be a right linear space over a non- 
commutative field K. Then the set of all linear 
transformations of V forms a group under the 
multiplication defined by the composition of 
mappings. This group GL(V) is called the 
general linear group on V. It is isomorphic to 
the multiplicative group of all n x n invertible 
matrices with entries in K. The commutator 
subgroups SL(V) and SL(n, K) of GL(V) and 
GL(n, K), respectively, are called the special 
linear group of degree n on V and over K, 
respectively. Now, suppose that an element A 
of GL(V) leaves each element of a subspace U 
of dimension n — 1 of V fixed. Choose an ele- 
ment x of V which does not belong to U, and 
set Ax = xa (mod U); xe K depends not only on 
A but also on the choice of x. However, the 
conjugate class ¢= [4x4 ! | Ae K*] of ain the 
multiplicative group K* of K is determined 
only by A. In particular, if à — (1) and Az I, 
then A is called a transvection. For a *matrix 
unit Ej, B;(a) I - Ej; is a transvection if 
ij and «40. SL(V) coincides with the sub- 
group of GL(V) generated by all transvections. 
This fact also holds when K is a commutative 
field, except when n=2 and K —F,. In this 
case, transvections generate the whole GL(2, 2), 
which is isomorphic to the symmetric group 
©, of degree 3 and does not coincide with 

the commutator subgroup. The center 3 of 
GL(n, K) consists of all scalar matrices corre- 
sponding to nonzero elements in the center of 
K. Let C be the commutator subgroup of the 
multiplicative group K* of K. Then for nz 2, 
GL(n; K)/SL(n, K) is isomorphic to K*/C. This 
isomorphism can be obtained by appropriately 
defining, for Ae GL(n, K), an element det A of 
K*/C which is called the determinant of A 
[6,9]. The center 3o of SL(n, K) is {al |a" eC]. 
The quotient group PSL(n, K) — SL(n, K)/3 is 
called the projective special linear group of 
degree n over K. If K is a noncommutative 
field, then PSL(n, K) (nz 2) is always a simple 
group [5, 8]. 

Next, let K be any field (commutative or 
noncommutative), and let V be a right linear 
space of dimension n over K. Consider a Her- 
mitian form f(x, y) (2 256 Linear Spaces) on 
V relative to an tinvolution J of K. If for a 
fixed element e in the center of K we have 
f(x, y)  ef( y, x), then f is called an c-Hermitian 
form. For the rest of this article, f is assumed 
to be an e-Hermitian form on V. Let W bea 
subspace of V. If f(x, y) 0 for any x, ye W, 
then W is called a totally isotropic subspace. 
The largest dimension m of the totally iso- 
tropic subspaces of V is called the index of f. 
We always have 2m <n. If f(Ax, Ay) — f(x, y) 
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for any x, ye V, then A is called a unitary trans- 
formation relative to f. 

The set U (n, K, f) of all unitary transforma- 
tions relative to f forms a subgroup of GL(V). 
This group is called the unitary group relative 
to f. Also, the group SU (n, K, f)= U(n, K, f) 
SL(n, K) is called the special unitary group. 
When J — 1 and e=1, a unitary transformation 
(unitary group) is called an orthogonal trans- 
formation (orthogonal group), and U(n, K, f) is 
written as O(n, K, f). Also, when J— 1 and 
€= —1, a unitary transformation (unitary 
group) is called a symplectic transformation 
(symplectic group), and U (n, K, f) is written as 
Sp(n, K). In fact, in these cases, for arbitrary 
choice of f, the corresponding groups are 
mutually isomorphic. 

An e-Hermitian form f is called an ez-trace 
form if for any xe V, there exists an «e K 
which satisfies f(x, x) 2a +ga". If J — 1, = —1 
(hence K is commutative) or £= 1 and K is of 
characteristic Z2, then any e-Hermitian form 
is an -trace form. If f is an e-trace form, a 
linear mapping B of any subspace W of V into 
V such that for any x, ye W, f(Bx, By)— f(x, y) 
can be extended to an element A of the unitary 
group U (n, K, f) relative to f (Witt’s theorem). 
In particular, U (n, K, f) acts transitively on the 
maximal totally isotropic subspaces, and their 
dimensions are equal to the index m of f. Now, 
let P be a Pythagorean ordered field (an or- 
dered field which contains square roots of 
any positive element). If K 2 P and J— 1, or if 
K= P(/-1 ) or if the noncommutative field 
K 1s a *quaternion algebra over P and J is the 
operation of tconjugation of K, then for two 
Hermitian forms f, f’, their unitary groups 
U (n, K, f) and U(r’, K, f") are isomorphic if 
and only if n=n’, and the indices of f and f’ 
are equal. In this case, U (n, K, f) can be writ- 
ten as U (n, m, K), where m is the index of f. If 
the field K is a tquaternion algebra over P and 
f is an tanti-Hermitian form, there exists an 
orthogonal basis (e;) of V such that f(e; ej) 2j 
(quaternion unit), 1 x i € n. Hence, in this case, 
the,unitary group U (n, K, f) relative to f is 
determined only by n and K. 

Suppose that we are given an e-trace form f 
over a general field K whose index m is not 
equal to 0. We exclude the case where J —1 
and = 1. Then the unitary group U(n, K, f) 
contains transvections. Let T(n, K, f) denote 
the subgroup of U (n, K, f) generated by trans- 
vections which are unitary transformations. If 
mz 2, then T(n, K, f) is the commutator sub- 
group of U (n, K, f). The center W, of T(n, K, f) 
coincides with the intersection of T(n, K, f), 
and the center 3 of GL(n, K). If n 23 and K 
contains more than 25 elements, then the quo- 
tient group T(n, K, f/)/W, is a simple group [6]. 
Also, if K is commutative and nz 2, mz 1, 
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J x 1, then T(n, K, f) - SU (n, K, f), except for 
the case where n=3, K =F}. 

If K is the field R of real numbers, the field 
C of complex numbers, or the quaternion field 
H, then GL(n, K), SL(n, K), and U(n, K, f) are 
all Lie groups. In particular, SL(n, K) and 
U (n, K, f) are simple Lie groups except in the 
following three cases: (1) n= 1, K =R or C; (2) 
n=2, K=R, J=1, €=1; (3)n=4, K-RorC, 
J=1,¢=1, m=2. In cases (1) and (2) they are 
commutative groups, and in case (3) they are 
locally direct sums of two noncommutative 
simple groups. 

Suppose that K =H. Since H contains C as 
a subfield, a vector space V of dimension n 
over H has the structure of a vector space of 
dimension 2n over C. From this fact, GL(n, H) 
can be considered as a subgroup of GL(2n, C) 
in a natural way. 

Each of the complex classical simple groups 
G — SL (n, C), SO (n, C), Sp(n, C) has the struc- 
ture of an talgebraic group defined over R 
(— 13 Algebraic Groups). The real forms of 
G, i.e., the algebraic subgroups of G whose 
scalar extension to C is G, can be realized as 
SL(n, K), U (n, K, f) corresponding to K =R, C, 
H. Namely, a real form of a complex classical 
group G is conjugate in G to one of the follow- 
ing groups: (i) The real forms of SL(n, C): 
SL(n, R) (type AI); SL(k, H) only for n=2k 
(type AH); and the special unitary group 
SU (n, m, C), O m x [n/2], relative to a Her- 
mitian form f of index m (type AIID). (ii) The 
real forms of SO(2n + 1, C): the proper ortho- 
gonal group SO(2n + 1,m, R), Ox m xn, relative 
to a quadratic form of index m on a space of 
dimension 2n + 1 (type BI and BII). (iii) The 
real forms of SO(2n, C): SO(2n, m, RL O<m<n 
(type DI and DII); and U (n, H, f) relative to an 
anti-Hermitian form f on H (type DUT (iv) 
The real forms of Sp(n, C): Sp(n, R) (type CI); 
the unitary group U (2n, m, H), O€ mx n, rela- 
tive to a Hermitian form f of index mon H 
(type CII); and Sp(n) corresponds to the special 
case m — 0. The quotient groups of these real 
forms by their centers can all be realized as the 
groups of automorphisms of semisimple alge- 
bras with involutions J which commute with 
J (A. Weil [10]). 
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A. Definitions and Basic Properties 


Let V be an n-dimensional tlinear space over a 
field K, and let Q be a tquadratic form on V. 
Denote the ttensor algebra over V by T(V), the 
tensor multiplication by ®. Let /(Q) be the 
two-sided ideal of T(V) generated by the ele- 
ments x & x —Q(x): 1 (xe V). The resulting 
tquotient associative algebra T(V)/I(Q) is then 
denoted by C(Q) and is called the Clifford 
algebra of the quadratic form Q. The elements 
of C(Q) are called Clifford numbers. 

The composite of two canonical mappings 
1: V9 T(V), o: T(V) C(Q) is a linear injection 
cot:VC(Q). Hence we can regard V asa 
linear subspace of C(Q) via cot. Then C(Q) 
is an associative algebra over K generated 
by 1 and V. Furthermore, x? = Q(x): 1 for 
every x in V. 

Indeed, C(Q) is the universal associative 
algebra with these properties. That is, let A be 
any associative algebra with a unity element, 
and let f: VA be a linear mapping such that 
f(x)? - Q(X): 1 for every x in V. Then f can be 
extended uniquely to an algebra homomor- 
phism Í :C(Q)— A with f (1)— 1. Furthermore, 
let be the tsymmetric bilinear form associ- 
ated with 0: (x, y) - Q(x + y) - Q(x) - Q(y), 

x, ye V. Then xy yx 2 Ó(x, y): 1 for every 
x, y in V. C(Q) is of dimension 2" over K. If 
€, ...,€,is a basis of V, then 


1, ei, eje; (i<j), ..., eg, Cn 


form a basis of C(Q). In particular, if {e;} is an 
orthogonal basis relative to Q, we have 


e? —Q(ej: E 
ij=1,...,7, iÆj. (1) 


e;ej= —ejei, 
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In this case, C(Q) can be defined as an as- 
sociative algebra (with a unity element) gen- 
erated by the {e,} together with the defining 
relations (1). In particular, for Q =0, C(Q) is 
the exterior algebra (tGrassmann algebra) 
over V. 


B. The Principal Automorphism and the 
Principal Antiautomorphism of C(Q) 


There exists a unique automorphism « of the 
algebra C(Q) such that a(x) 2 —x for every x in 
V. This automorphism o is called the principal 
automorphism of C(Q), and we have a? = 1. 
Also, there exists a unique antiautomorphism 
p of the algebra C(Q) such that f(x) — x for 
every x in V. This antiautomorphism f is 
called the principal antiautomorphism of C(Q), 
and we have fi? — 1. 

For the rest of this article we assume that 
the discriminant of Q is 40. We also assume 
for the sake of simplicity that the characteristic 
of Kis #2. Let C* -C*(Q--K:1- V? V^ 
-..,andC 2C (Q)=V+V34+V%H+.... 
Then C(Q) is the direct sum of the linear 
subspaces C* (Q) and C7 (Q). Furthermore, 
CC cC ("C oC Uc .6C and 
C'C cC*.Thus C(Q) - C* - C^ has the 
structure of a tgraded algebra with the index 
group {+1}, and C* is a subalgebra of C(Q). 
The elements of C*(Q), C7 (Q) are called even 
elements and odd elements, respectively. We 
have dim C*(Q)=dimC (Q)=2""'. 


C. The Structure of C(Q) and C*(Q) 


C(Q) and C * (Q) are both tseparable, tsemi- 
simple associative algebras over K. Suppose n 
is even: n 2 2r. Then C(Q) is a tsimple algebra 
with K as its center; the center Z of C ' (Q) is 2- 
dimensional over K. Let e,, ...,e, be an ortho- 
gonal basis of V. Then 1 and z 22'e, ... e, form 
a basis of Z, and we have 


2? - 2" ( —1yQ(ej)...Q(e) -(-1YD, 


where D is the tdiscriminant of ® relative to 
the basis {e,}. Thus if (—1YD has a square root 
in K, Z= K @ K (direct sum), and so C * (Q) is 
decomposed into the direct sum of two simple 
algebras. If (—1)’D does not have a square 
root in K, then Z is a field and C+ (Q) isa 
simple algebra. In particular, if the tindex of Q 
(Oe, the dimension of a maximal ttotally sin- 
gular subspace of V (— 348 Quadratic Forms)) 
is r, C(Q) is isomorphic to the *total matrix 
algebra of degree 2” over K, and C * (Q) is 
isomorphic to the direct sum of two copies of 
the total matrix algebra of degree 2" ! over K. 
Now suppose that n is odd: n —2r + 1. Then 


61 E 
Clifford Algebras 


C * (Q) is a simple algebra with K as its center. 
(In particular, if Q is of index r, then C * (Q) is 
isomorphic to the total matrix algebra of 
degree 2’ over K.) The center Z of C(Q) is 2- 
dimensional over K, and we have C(Q)= 

Z @,x C" (Q). If e,,..., e, is an orthogonal 

basis of V, then 1 and z— e, ...e, form a basis 
of Z. Putting z' 22'*!z, we have z? =2(—1)'D, 
where D is the discriminant of ® relative to 
(ej). Thus if 2(—1)'D has a square root in K, 
C(Q) is the direct sum of two 2?'-dimensional 
simple algebras. If 2( — 1)D has no square root 
in K, then C(Q) is a simple algebra. 


D. The Clifford Group 


Let G be the set of all invertible elements s in 
C(Q) such that sVs ! = V. Then G forms a 
group relative to the multiplication of C(Q). 
This group G is called the Clifford group 

of the quadratic form Q. The subgroup CG" 

= Gf C* (Q) is called the special Clifford 
group. The linear transformation o(s): xo 
sxs~' of V induced by seG belongs to the 
torthogonal group O(Q) of V relative to Q. 
Moreover, the mapping s— ¢(s) is a homomor- 
phism from G into O(Q). Thus q is a frepre- 
sentation of G on V. This representation q is 
called the vector representation of G. The tker- 
nel of o consists of invertible elements in the 
center Z of C(Q). If xe GNV, then Q(x) z0 and 
— q(x) is the reflection mapping of V relative 
to the hyperplane orthogonal to x. If n=dimV 
is odd, 9(G) 2 9(G ^) 2 SO(Q). If n is even, p(G) 
= 0(Q), e(G *) - SO(Q). 

Exploiting the principal antiautomorphism 
D of C(Q), we obtain a homomorphism N:G* 
— K* (the multiplicative group of K) defined 
by N(s)= B(s)s (seG ^), and N(s) is called the 
spinorial norm of se G^. The normal subgroup 
of G* defined as the kernel of N is denoted by 
Go and is called the reduced Clifford group (of 
Q). The subgroup (Gù ) of SO(Q) is denoted 
by Og (Q) and is called the reduced orthogonal 
group. 

In particular, when the ground field K is the 
real number field R, O5 (Q) coincides with the 
"identity component of the tLorentz group 
O(Q). Furthermore, if Q is definite, Of (Q)= 
SO(n), so that the identity component Spin(n) 
of Gj is a tsimply connected fcovering group 
of SO(n) via the covering homomorphism o 
(with each point in SO(n) covered twice). The 
group Spin(n) is called the spinor group of 
degree n. 


E. Spin Representations 


In this section we assume that the ground field 
K is the complex number field C and that n 
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= dim Vz3. Then we have 0$ (Q) z SO(n, C), 
so Gg is a simply connected covering group of 
SO (n, C) via the covering homomorphism q. In 
this section we denote Gg by Spin(n, C) and 
call it the complex spinor group of degree n. 
Spin(n, C) is the tcomplexification (— 249 Lie 
Groups) of the compact Lie group Spin(n) and 
is a complex analytic subgroup of the tcom- 
plex Lie group C(Q)* consisting of all inver- 
tible elements of C(Q). With the bracket 
operation [x, y] 2 xy — yx, C(Q) becomes the 
tLie algebra of C(Q)*. Furthermore, the Lie 
subalgebra of C(Q) associated with the com- 
plex analytic subgroup Spin(n, C) is given by 
2;.,Ce;e;, where e;,...,e, is an orthogonal 
basis of V. The spin representations of the 
group Spin(n, C) are defined as follows: 

(1) When n is odd: n=2r + 1. Since C * (Q) is 
isomorphic to a total matrix algebra of degree 
2' over C, C ' (Q) has a unique (up to equiva- 
lence) tirreducible representation p, which is of 
degree 2’. The restriction of 6 on Spin(n, C) (on 
Spin(n)) defines an irreducible representation p 
of degree 2” of Spin(n, C) (of Spin(n)); p is called 
the spin representation of the group Spin(n, C) 
(of Spin(n)). The elements in the representation 
space of p are called spinors. Thus we can say 
that a spinor is a quantity with 2’ components 
which obey the transformation law according 
to the spin representation (— 258 Lorentz 
Group). This representation p defines a rep- 
resentation of the Lie algebra so(n, C) of 
Spin(n, C) (note that so(n, C) is a tcomplex 
simple Lie algebra of type B,). This represen- 
tation of so(n, C) is also called the spin repre- 
sentation of so(n, C). Note that p is not well 
defined on SO(n, C) or on SO(n); p is of valence 
2 on SO(n, C) or on SO(n). 

(2) When n is even: n —2r. Since C(Q) is 
isomorphic to a total matrix algebra of degree 
2' over C, C(Q) has a unique (up to equiva- 
lence) irreducible representation p, which is of 
degree 2". The restriction of p on Spin(n, C) (on 
Spin(n)) defines a representation p of degree 2" 
of Spin(n, C) (of Spin(n)) p is called the spin 
representation of the group Spin(n, C) (of 
Spin(n)). This representation p is, however, not 
irreducible; p is decomposed into the direct 
sum of two irreducible representations p, and 
p_. They are not equivalent to each other, and 
both are of degree 2" ^!. By taking a suitable 
minimal left ideal L of C(Q) as the representa- 
tion space of the representation f, we obtain 
the representation spaces L', L^ of p4, p-, 
respectively, by putting L* = LN C* (Q) and 
L =LNC (Q) The representation o" (or p`) 
is called the half-spin representation of the 
group Spin(n, C) or of the group Spin(n). The 
elements in the representation space of p* (or 
p )are called half-spinors. Again, p^ and p^ 
are not well defined on SO(n, C) or on SO(n). 
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They are of valence 2 on these groups. The 
representations of the Lie algebra of the Lie 
group SO(n, C) (note that this Lie algebra is a 
tcomplex simple Lie algebra of type D,) asso- 
ciated with p^, p^ are also called half-spin 
representations of this Lie algebra. 
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A. Cluster Sets of Functions Meromorphic in 
an Arbitrary Domain 


Let D be an arbitrary domain in the complex 
z-plane, I its boundary, and E a *totally dis- 
connected closed set contained in I'. Let w= 
f(z) be a single-valued *meromorphic func- 
tion defined in D. Then for each point z, in F, 
we can define the following sets related to the 
mapping w= f(z) in the complex w-plane (or 
on the complex w-sphere CL 


The Cluster Set. A value x is called a cluster 
value of f(z) at zo if there exists a sequence of 
points (z,) such that 


f(z,)a. 


The totality Cp( f. zo) of all the cluster values of 
f(z) at ze is called the cluster set of f(z) at zo 
or, more precisely, the interior cluster set. It is 
a nonempty, closed, but not necessarily con- 
nected set. 


Z,€D, | z,—zg, 


The Boundary Cluster Set. The set of all values 
a such that there exist a sequence of points 
feaz of T — {Zo} (resp. F — {zo} — E) and a se- 
quence of points {w,} in the complex w-plane 
satisfying 
EnZo — WeCp(f e Wa 


is called the boundary cluster set of f(z) at zo 
and is denoted by Cj f Zo) (resp. Cp. &( f, zo)). 
These are closed sets, and 


Cr &( f, 20) < Cc Cf, Zo) € Col f Zo). 


If zoe I — E or zo is an isolated point of E, then 
Co ell zo) = Ci f, zo). Furthermore, Cr f, zo) 
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(resp. Cr. eil, zo)) is empty if and only if zy is 
an isolated boundary point (resp. Ze is an 
exterior point of I — E). 


Range of Values. The set of values « such that 


LEO kt 


is called the range of values of f(z) at zy and is 
denoted by R,( f; zo). In other words, Rp( f, Zo) 
is the set of values « assumed by f(z) infinitely 
often in any neighborhood of zo and is a tG;- 
set. 


Zz,€D, | z,—ze, 


The Asymptotic Set. Let z, be an taccessible 
boundary point of D. If f(z) converges to a 
value « as z tends to ze along a simple arc in D 
terminating at zo, then o is called an asymp- 
totic value of f at zo, The totality Ap{f zo) 

of asymptotic values of f at za is called the 
asymptotic set of f at Zo. If zo is an inacces- 
sible boundary point, we let Ap{ f. Zo) be the 
empty set. 


B. Iversen-Beurling-Kunugui Theorems 


Suppose now that E is empty, and put 
Q= Cpl f; zo) — Cr Cf, zo). 


tIversen’s theorems in the case where D is the 
unit disk |z|< 1 and Zp is a point on |z| — 1 are 
generalized as follows. 
(1) Q— Rpl f, zo) c Ap( f, zo) (K. Noshiro, 1936). 
(2) First Beurling-Kunugui theorem: If zo is 
not an isolated boundary point, then Q is an 
open set. 
(3) Second Beurling-Kunugui theorem: Sup- 
pose that the open set Q is not empty. Then 
f(z) assumes every value belonging to each 
component Q,, of Q infinitely often, with two 
possible exceptions belonging to O,, that is, 
Q, (€ — Ry( f, Zo)) consists of at most two 
values (an extension of tPicard’s theorem on 
an isolated essential singularity). 

Next suppose that E is not empty and of 
tlogarithmic capacity zero, and put Q= 
Colf, zo) — Cre f, zo). If «eO — Rolf, Zo), then 
either « is an asymptotic value of f(z) at zo or 
there exists a sequence of points ¢,¢ E (n= 
1,2, ...) converging to ze such that « is an 
asymptotic value of f(z) at each C, (Noshiro, 
1937). Furthermore, if zy is contained in the 
closure of I — E, then Q is an open set (which 
may be empty), and Q— Rp( f, zo) is at most of 
logarithmié capacity zero (M. Tsuji, 1943). If E 
is contained in a single component I of the 
boundary T, zua is contained in the closure of T 
— E, and Q is nonempty, then w= f(z) assumes 
every value belonging to each component Q, 
of Q infinitely often, with two possible excep- 
tions belonging to Q, (Noshiro, 1950). In par- 
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ticular, if f is bounded in a neighborhood of 
Zo, the number of such exceptional values is at 
most 1. This is still true if each point of E is 
contained in a boundary component that is a 
*continuum containing at least two points and 
Q is not empty (M. Herve, 1955). However, 
this conclusion does not hold if we remove the 
hypothesis on the set E (K. Matsumoto, 1960). 


C. Cluster Sets of Functions Meromorphic in 
the Unit Disk 


Let D be the unit disk (|z| « 1), zo =e" bea 
fixed point on the unit circumference T, A be 
an open arc of F containing zo, and E be a set 
of tlinear measure zero such that zoe Ec A. 
With every e? e A — E we associate an arbi- 
trary simple arc A, in D terminating at e? and 
the curvilinear cluster set C, ( f, e”), defined as 
the set of all values a such that z, € As, Z, >€", 
f(z,)-5a. We put 
Ch gf zo) N M,, 

r>0 
where M, denotes the closure of the union 
Cy, (fe) for all ef in the intersection of A — E 
with |z — zo| <r. 

By using the above cluster set C#_;-(f zo) 
instead of the boundary cluster set Cy. &( f, Zo), 
we obtain results similar to those in the pre- 
ceding section (M. Ohtsuka, 1950; Noshiro, 
1955). Many interesting results have been 
obtained by F. Bagemihl and W. Seidel, E. F. 
Collingwood, O. Lehto and K. I. Virtanen, 
and others concerning the cluster sets of func- 
tions meromorphic in the unit disk. They 
studied functions of Seidel’s class U, normal 
meromorphic functions, and other functions 
where the class U is the totality of regular 
bounded functions in the unit disk possessing 
almost everywhere on |z|=1 radial limits of 
the constant modulus 1; a nonconstant mero- 
morphic function f(z) in |z|< 1 is normal if 
the family { f(T(z))} is tnormal in the sense 
of Montel, where T(z) is an arbitrary con- 
formal mapping of |z| « ! onto itself. 


D. The More General Case 


The definitions of cluster sets are also available 
for arbitrary functions for which neither ana- 
lyticity nor continuity is assumed. If there exist 
two simple arcs A, and A, in the unit disk D 
terminating at a point z=e” such that 


CA. Cf. ef" Crh e?) Ø, 


then z =e” is called an ambiguous point of f. 
Bagemihl proved the following: The set of 
ambiguous points of an arbitrary complex- 
valued function defined in the unit disk D is at 
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most countable [5]. Under the same hypoth- 
esis, the set of points e'? such that Co f, e?) # 
C-(f, e?) is at most countable (Collingwood, 
1960). This result shows the importance of 
introducing the cluster set CZ vi Zo) that was 
previously mentioned. 


E. History 


The theory of cluster sets originated from the 
tvalue distribution theory of analytic functions 
in the neighborhood of their essential singular- 
ities. The first systematic results were those 

of F. Iversen and W. Gross, obtained about 
1920. Subsequent significant contributions 
were made by Seidel, J. L. Doob, M. L. Cart- 
wright, A. Beurling, and others. Since 1940, 
some important results have been obtained 

by K. Kunugui, S. Irie, Y. Tóki, Y. Tumura, 

S. Kametani, Tsuji, Noshiro, and other Japa- 
nese mathematicians. Many results have been 
extended to tpseudoanalytic functions. As can 
be seen from the Bagemihl ambiguous point 
theorem, some properties of cluster sets are 
not intrinsic to analytic mappings [2, 5]. On 
the other hand, it seems to be an interesting 
problem to extend the theory of cluster sets to 
the case of analytic mappings between open 
Riemann surfaces. 
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A. General Notions of Coding Theory 


When we wish to store, to search for, or to 
send information in the presence of noise 
efficiently and with the least error, we can 
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apply various bounds to the efficiency (— 213 
Information theory). Sophisticated coding 
operations are required in order to achieve 
efficiencies as close as possible to the bounds. 
In the abstract sense, the information is gener- 
ally understood to be a choice of an element 
from a finite set X. For implementation, we 
take a set K of q elements called alphabets. 
Each element of K is called a letter. We con- 
sider the direct product K", i.e., the set of all 
sequences of n letters. An injection y from X 
into K” is called encoding. The sequence (x) 
for xe X is called a code word, and the image 

V (X) (a collection of code words) is called a 
code. Such a code is also called a block code. 
The noise is represented by a mapping c: K" 

— K", but usually it is restricted to a certain 
subset Q of the set of such mappings with 
special properties. For example, we usually 
assume that the sequences x e K” and (x) are 
different only at less than d letters, where d is a 
preassigned constant. The inverse mapping 9, 
ie, a mapping o K"5 X satisfying pow(x)=x 
for all xe X, is called the decoding of y. A code 
y satisfying the property o ow(x)év(X) for all 
xe X and for all o €Q is called error-detecting 
with respect to the noise Q. If y has the decod- 
ing 9 satisfying poc owy(x)- x for all xe X 
and for all oe, y is called error-correcting 
with respect to the noise Q. To discuss such 
properties, we can assume X —y(X)(c K”) ` 
without restricting the generality, so hereafter 
we assume this condition. Also, we take q— 2 
unless explicitly stated otherwise. Because 

in many communication systems q —2 is 
commonly adopted, and the generalization 

for other prime powers q can be obtained 
naturally. 


B. Bounds for the Size of Codes 


Let x 2(x,, ..., x), y 2 (yp, Yn E K". The 
Hamming distance d(x, y) = dy(x, y) between 
the elements x and y is the total number 

of unequal bits (x; z yj). We put dmin = 

min (dy(x, y)| x, ye X, x £ yj. When e= 

max (dy(x, o(x))| o € Q, xe X] is less than dain» 
the errors due to Q can be detected, and if 
dnin = 2e + 1, the errors can be corrected. The 
maximal integer t satisfying d min > 2t + 1 is 
often called the error-correcting capability. 
Several important relations are known among 
daim t n (the length of the code), and |X| (the 
size of the code word) as follows. Hamming 
bound: | X |< 2"/>'_, (7). A code satisfying the 
equality here is called a perfect code [6]. Plot- 
kin bound: d... € n| X /2(| X| — 1)). On the 
other hand, if the Varsharmov-Gilbert-Sacks 
bound 2” *> Yi» ?(*:!) is satisfied, there 
exists an (n, k)-linear code (Section C). 


235 


C. Linear Codes 


Let K 2 GF(2)— (0, 1}, K” be an n-dimensional 
vector space over K, and X be a k-dimensional 
linear subspace. Then X 1s called a group code 
or (n, k)-linear code. In the present case, we can 
take a suitable basis of K for which there exists 
a k x (n— k) matrix P over K such that every 
vector x —(x,, ..., x,) representing a code word 
is expressed as x — (zP, z) by a suitable vector zZ, 
and conversely, every vector of this form is a 
code word, i.e., a vector x e X is a code word if 
and only if 


,x,) P — 0. (*) 


(X4; Dune) KEE n 


Introducing the matrix H —[71,.,, — PT], where 
I, , is the unit matrix of order n—k and T 
indicates the transpose, the condition (*) is 
equivalent to xH” =0. Therefore, H is called 
the parity check matrix, x, ,,,,..., x, are 
called the information bits, and x,, ..., x, A 

are called the check bits. If x is deformed to 

y — ox by a noise c, Le, the original signal x 

is transmitted as signal y, we call s= yH? = 

(y — x HT the syndrome of the transmitted 
signal y. If we have an algorithm to determine 
the error vector e= y — x from the syndrome s, 
we have a decoding with error-correcting 
property. For the linear code, it is evident that 
dmin 7 min (the number of nonzero elements of 
x|x (z0)e Xj. 

The Hamming code is given by n — 2"— 1 (m 
being an integer 22), k—n— m, where the (i, j)- 
component h; (i— 1, ..., mj L, ..., n) of the 
parity check matrix H is given by the ith bit of 
the number j — 1 expressed in the binary (2- 
adic) number system. The check bits are x; 
(j=2°,2!,...,2"71), and all other bits are the 
information bits. This code has the error-cor- 
recting property with respect to the noise w 
for which the nonzero component of e = wx — 
x is at most 1. In fact, from the syndrome s= 
(Sy, ...,$,), we compute j=s,+2s,+...4 
2"^1s,, and put e;= 1, e,=0 for all jz j; if j— 0, 
we can put e=0. The Hamming code is a per- 
fect code. 


D. Cyclic Codes 


The cyclic code is a special case of a linear 
code. This is the object of one of the most 
important applications of the theory of finite 
fields. By using this theory, we can actually 
construct the codes with high for error- 
correcting capability in which the encoding 
and the decoding operations are performed 
algebraically. 

An (n,k)-linear code X over GF(2) is called a 
cyclic code if x 2 (x,, KAS X implies (x,, 


Xis E EEN X, oo Xis -> EN rosas 
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., X4, X4)e X. To each xe X, we can associate 
a polynomial of one variable x(Z) 2 x, - x, 
s XQ,$" ! over GF(2). Let gy(é) be the poly- 
nomial of the least degree n — k among the 
polynomials corresponding to xe X. g«(£) is 
called the generator of X, because the property 
xe X and the property that x(€) is divisible by 
g x(&) are equivalent. g,(£) also divides £"— 1. 
Let the quotient be h,(£) 2 (£" — 1)/g4(£). Then 
x€ X is equivalent to x(£)h,(£) z0 (mod(£" — 
1)). h,(£) is called the parity check polynomial. 
The BCH code (Bose-Chaudhuri-Hocquenghem 
code) defined below is a typical example of a 
cyclic code. 

Let « be an element of order n in GF(2") 
(the finite field with 2" elements), and let g,(Z) 
be the polynomial of the lowest degree in 
GF(2) [4] for which ai is a root (i— 1,3, ..., 
2t — 1). The BCH code is given by the least 
common multiple g(é) of the polynomials 
gı (é), 93(€), ---, 921-1 (č). The BCH code has at 
least n— mt information bits and satisfies 
Amin 2 2t + 1, and algebraic decoding methods 
are known [3,5]. The Hamming code is a 
BCH code when t— 1, nz 2" — ]. 


E. Other Codes 


Other important types of codes include con- 
volutional codes (not block codes) for correct- 
ing errors in consecutive digits (burst errors) 
and Goppa codes (an extension of the BCH 
class). For correcting burst errors, special 
cyclic codes or their mixtures are also used. 
This field of research is closely connected to 
information theory, algebra, and various ap- 
plications of combinatorial analysis, such as 
texperimental design [3]. 
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A. General Remarks 


The notion of cohomology operations was 
introduced by L. S. Pontryagin and N. E. 
Steenrod in order to solve homotopy classifi- 
cation problems (— 202 Homotopy Theory). 
Since then, numerous works have proved the 
importance of cohomology operations as 
applied to thomotopy theory, ‘differential 
topology, and other branches of topology. In 
fact, the use of cohomology operations is 
indispensable in studying problems related to 
thomotopy groups, characteristic classes of 
manifolds, etc. 

We denote by the symbol H*(X; A)= 
2, H"(X; A) the tsingular cohomology ring of 
a topological space X with coefficients in an 
tAbelian group A. 


B. Primary Cohomology Operations 


A (primary) cohomology operation (or simply 
an operation) ¢ is a tnatural transformation 


ell A,)>|[H™( ;B,) 


between the cohomology functors defined on 
the tcategory of topological spaces and con- 
tinuous mappings. That ts, ọ is a family of 
mappings satisfying the following conditions: 
(1) For each space X, ọ defines a mapping 


e:[[H^(X; A)o [] H"4X; B) 
A ` u 


that is not necessarily additive. 

(2) For each mapping f: X >Y, the commu- 
tativity f*oo- oo f* holds in the diagram 
[| HX; All W(X; B) 

A ts Hu U* 
[[ 84(Y; 95 [T H”(Y; B). 
H u 


We list here two trivial examples. 

(I) Addition of cohomology groups deter- 
mines an operation o: H'(X; A) x H'(X; A)> 
H'(X; A). 

(II) The *cup product determines an 
operation 


o: HUN: Aj) x HO: A))2 H'* (X; A, ® A2) 
denoted by 


ia, B) - a — p. 

The composite of two cohomology opera- 
tions is defined in the obvious way. Among 
cohomology operations the most important 
ones are operations of one variable. A co- 
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homology operation of type (A, l; B, m) is a 
natural transformation 


oe H ; A)JO H"( ; B). 


D(A, l; B, m) denotes the Abelian group consist- 
ing of all such operations. 

Denote by H"(A,[; B) the mth tcohomology 
group of an fEilenberg-MacLane space K(A, I), 
and let ue H'(A, I; A) be the "fundamental class 
of K(A, 1). If X is a CW complex, by assigning 
f*uto each f: X 9 K(A,1), we obtain a one- 
to-one correspondence between the set of the 
homotopy classes z(X, K(A,1)) and the co- 
homology group H'(X; A) (> 70 Complexes 
F). Hence by utilizing condition (2), it can 
be shown that the value of o on H'(X; A) is 
uniquely determined by its operation on 
H'(A, l; A). Thus the assignment o — qu defines 
the isomorphism O(A, l; B, m) = H"(A, [; B). 
Here we should remark that the isomorphism 
n(K(A,l), K(B,m))z H(A, l; B) defines a one- 
to-one correspondence O(A, l; B, m) 5n(K(A, D), 
KOR m)). In some cases H(A, |; B) vanishes, 
for example, when 0< m<l;, l=] <m, A=Z; 
l=2, m=2w +1, A=Z;1=2I'<m=2m' +1, 
A-—Z:B-Q;lz2l'- 1 «m, A=Z, B=Q; A 
is finite, B = Q; etc. 

The following four types of operations to- 
gether with the two above are called elemen- 
tary operations: 

(IIT) Homomorphisms induced by a coeffi- 
cient homomorphism: There are homomor- 
phisms 5, : H'(X; A) H'(X; B) induced by a 
homomorphism y: A B. 

(IV) Bockstein (cohomology) operation: This 
operation is given by the tconnecting homo- 
morphisms ó*: H'(X; C) 9 H'*! (X; A) asso- 
ciated with a short exact sequence 05.4 B 
C —0 of coefficient groups (— 200 Homo- 
logical Algebra). For example, the coefficient 
sequence 05 ZZ Z, —>0 (Z, = Z/nZ) defines 
a Bockstein operation (or Bockstein homomor- 
phism), which is usually denoted by (1/n)ó or 
As 

(V) Steenrod (or reduced) square operations 
Sq' (12:0): Sal are sequences of operations 
defined by the following five axioms [2, 5]: 

(V1) For each pair of integers i2 0 and />0, 


Sq: H'(X; Z,)> H'*(X;Z,) 


is a natural transformation of functors that is a 


homomorphism. 

(V2) Sq? « 1. 

(V3) If degx =i, then Sq! x = x — x (cup 
product). 

(V4) If degx <i, then Sq'x —0. 

(V5) (Cartan formula) 


qx —y)- 2 Sq!x — Sq' iy. 
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These five axioms imply the following two 
formulas: 

(V6) Sq! is the Bockstein operation f), of the 
coefficient sequence 05 Z, 2 Z,—Z,0. 

(V7) (Adem relations) If 0 « i « 2j, then 


Solo Sqi= 53 a l "ern Sq* 
Ko Ni —2k 
The binomial coefficient is taken mod 2. 

We can extend the definition of the Sq! so 
that they operate on the relative cohomology 
groups. Now (V1), (V2), and (V5) imply: 

(V8) If ô: H'(Y; Z2)> H''! (X, Y; Z;) is the 
tcoboundary homomorphism, then ôo Sq' = 
Sqio6. 

(V') Steenrod pth power operations 7 (i> 0): 
Let p be an odd prime. Then #' is a sequence 
of operations defined by the following five 
axioms [5]. 

(V'1) For each pair of integers i2 0 and />0, 


P: H(X;Z,)o H'*?e (x: Z ) 


is a natural transformation that is a homo- 
morphism. 

(V'2) 29 — 1. 

(V'3) If deg x 22k, then 2x = x”, 

(V'4) If deg x «2k, then *x —0. 

(V'5) (Cartan's formula) 


(x —y)2 Y Pix Piy. 
j=0 

These axioms imply the Adem relations for 
# and the Bockstein homomorphism £, asso- 
ciated with the coefficient sequence 02 Z,— 
Z,2>Z, 0 (— Appendix A, Table 6.II). 

We can extend the definition of to the 
relative cohomology groups too, and we ob- 
tain d0oP=F'06. 

(VI) Pontryagin pth power operations D. 
Let p be a prime. *B, is a system of operations 
satisfying the following five conditions [3]: 

(VI1) For each pair of integers />0 and 
hz, 


B UTC Z,) H"(X; Z p+) 


is a natural transformation. 
(VI2) If 7: Z «i Z, is a homomorphism 


defined by (1) — 1, then we have n, o B,x =x”. 


(VI3) If p: ZZ, Zi is a homomorphism 
defined by p(1)— p, then we have 


P, (x+y) 


p-1 
=P, x+ TE n (MD ~ yr) 


(VI4) B(x — y) 2 S, x — By. 

(VIS) If p» 2 and degx —2k 4- 1 (odd), then 
3x =0. 

Let A, B be finitely generated Abelian 
groups. Then the computation of H*(A,1) 
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shows that each element of O(A, I; B, m) can be 
written as the composite of a finite number 

of the operations of the types (I)- (VI) (H. 
Cartan). 

Here we show some examples. Let u be the 
fundamental class of K(Z,,2). Then the ele- 
ments of H?(Z,,2;Z)) x Z4, H(Z5,2; Z4) 
Z, H*'(Z,,2; Z)& Z4, and H5(Z,,2; Z4) 
Z,+Z, correspond to the cohomology opera- 
tions A2,, 4n, 0(0/2), P2, and pn, 0 (6/4) 0 
P, +12, o Sq? o Sq!, respectively, where A, u 
denote integers satisfying O4« 1, De 
Lx3 and ņ:Z>Z,,2:Z, >Z, stand for the 
homomorphisms defined by n(1)= 1, 2(1) — 2, 
respectively. 

Suppose that we are given sets of integers l; 
and m;. We define the stable (primary) coho- 
mology operation o with respect to these sets 
of integers /; and m; as a system of natural 
transformations 


e: [[ H"^( ; Ayo [[H"*"«( ;B,) 
À H 


satisfying the following condition, for all 
integers n>0; 

(3) Let S: H'*! (SX; A)> H'(X; A) denote the 
tsuspension. Then the commutativity Sog= 
pos holds in the diagram 


I] H"*4(X; A) 5T] H"*r«(Xx; B) 

: ts * 18 

[ [ah {SX; EIER He SX: B). 
A H 


This condition is equivalent to the commuta- 
tivity with the coboundary homomorphisms. 
For example, the cohomology operations 


(71 B,: H'(X; Z)> H"" (X; Z) 


define a stable cohomology operation £. Sq’, 
däi are also examples of stable cohomology 
operations. 

A stable cohomology operation o of type 
(A, B) and of degree q is a sequence of coho- 
mology operations of type (A, n; B, n +q) de- 
fined for all integers n z 0. (A, B), denotes the 
Abelian group consisting of all stable coho- 
mology operations of type (A, B) and of degree 
q. When A= B, 3((4) - DF» UA, A), is a 
*graded algebra, where multiplication of two 
operations is given by their composition. Let p 
be a prime. Then 9l(Z,) is called the Steenrod 
algebra mod p and is denoted by 9I(p). 91(2) is 
the augmented graded algebra over Z, gen- 
erated by Sq! subject to the Adem relations. 
Suppose that we are given a sequence of non- 
negative integers I —(i,, i5, ..., i). We call Z 
an admissible sequence if i,_, > 2i, holds for 
2<s<k. We write Sq! - Sq oSq?o...oSq'*. If 
I is an admissible sequence, we say that Sq! is 
an admissible monomial. The admissible mono- 
mials form an additive basis for 9I(2), which 
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has the structure of a *Hopf algebra whose 
tcomultiplication y : 91(2) — 9L (2) &) 91(2) is 
given by WSq' = Xi. Sq & Sq' ?. The dual 
space OI = Homz (91(2), Z;) gives a Hopf 
algebra that is the polynomial algebra gen- 
erated by € of degree 2' — 1, where €; is the dual 
element of Sq? "Sai ^... Sq! with respect 
to the additive basis. The comultiplication 
Q* :3L(2)* + 91(2)* @ 9L(2)* is given by 9*2; 
e 5 5,0 er 

9I(p) has properties similar to 9I(2) (p is an 
odd prime). In particular, däint is a *Hopf 
algebra generated by #! and f subject to the 
Adem relations with comultiplication y given 
by WB=6@14+1@ and dät ul Q 
4^). An additive basis of W(p) is given by 
(frog: f^i... 9,593, wheree -0 or 1, iz 
Pim+1 £4, (m= 1) and i, 2 1. Such a sequence 
I — (£g, i4, £1, ..., i4, &) is also called an admis- 
sible sequence. Denote the dual element of 
PPE? P and 9» on". P! B by či 
and 7;, respectively; then the dual algebra 
QI(p)* is isomorphic to the tensor product of 
the polynomial ring over Z, generated by ¢,, 
€,,... and the exterior algebra over Z, gen- 
erated by t,, T2, .... The comultiplication q* is 
given by GR = Èi- 5,0 and o*(z)— 
7,@ 14+ Eu EP; @ r [5] (— Appendix A, 
Table 6.IIT). 


C. Secondary Cohomology Operations 


Here we restrict our attention to a special 
type of operation treated by J. F. Adams [4] 
that has been proved to be powerful in 
applications. 

For a specified prime p, we write H *(X)— 

£4 H(X; Z,). We can regard H '(X)asa 
*graded left module over dipl, Now let C, 

(s — 0, 1) be a pair of tleft free modules over 

9I (p), and let d: C, —5 C, be a graded homomor- 
phism over dp), Suppose that we are given an 
element ze C, such that dz 20. For future pur- 
poses, C, and C, are assumed to have bases 
{co,,} and {c,_,} with degcy ; —1; and dege, , 
=m, respectively, in terms of which d has the 
representation dc, ,, — $a, ;co, ;. Then z is 
expressed in the form z 2 X), DACH, 

We say that ® is a stable secondary coho- 
mology operation associated with the pair (d, z) 
if it satisfies the following four axioms: 

(1) Let D"(d, X) be the module consisting of 
homomorphisms e: Cy H * (X) over 9I (p) of 
degree n such that ed —0. Putting &(co, ;) ^ X;, 
we can assume that D"(d, X) 2 {e= [T;x;e 
IL H"*^(X)|3, a, ;x, —0]. Let Q"(z, X) be 
the submodule of H ' (X) consisting of ele- 
ments of the form £(z), where č is an din. 
homomorphism: C, >H * (X) of degree n— 1. 
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In other words, Q"(z, X) - X, b, H""« ! (X). 
Then for each nz 0 and each X, Pisa 
mapping 

Q:D"(d, X) H * (X/Q"(z, X). 


(2) For each mapping f: X 5 Y, the commu- 
tativity £f *oo— doo f* holds in the diagram 


D"(d, X) SH * (X)/Q"(z, X) 
Ir T/* 
D'(d, VH *(YyQ"(z, Y). 


(3) Let S: H"*! (SX) H"(X) denote the 
suspension. Then the commutativity Soo = 
QoS holds in the diagram 


D"(d, X) H * (Xy/Q"(, X) 
TS Ts 
D^" (d, SX) H*(SX)/O"*"(z, SX). 


(4) Let i: YX be an injection such that 
i*oe=0. That is, i*x, 20. We can then find 
homomorphisms (over 9I(p))g: Cy H +(X, Y) 
of degree n and (: C, >H * (Y) of degree n— 1 
such that the following diagram commutes: 
HIV Hr n (x, Y) H* (Y) H*(X) 

tn ti 
tel ert e mq 
Then for any such pair (y, ) we have i* o Pe = 
€zmod i*Q"(z, X), where ó* denotes the homo- 
morphism defined by ó*y =(— 1)dim»óy, 

For each pair (d, z), there is at least one 
associated operation ®. Existence is proved by 
means of a tPostnikov system. Let and dr be 
two operations associated with the same (d, z). 
Then they differ by a stable primary operation 
in the sense that there is an element oe Co/dC, 
such that d'zz e + pe mod Q(z, X), where 
Q 2 date: moddC, means pe= 3, a;x,;. 

For example, define the action of 9I(2) on 
Z, by the rule a-v=0 if dega» 0 and 1: vv 
for each ve Z,. We consider a minimal projec- 
tive resolution 


E d d 
0-Z, -CC 4C,- ab 


(— 200 Homological Algebra). First we take 
Cy = 3L(2) and define e(1)=1. Next we take C, 
to be free over WO) with generators c;, where 
0x i and d,c; Sai, Furthermore, we take C, 
to be free over A(2) with generators c; ; with 
O<i<j,j#i+1 and 


d(c; ) - Sq? ¢j+ Y bij. 
O<k<j 


with b; ; x€ A(2). Here the d,c;=Sq” form 

a minimal set of generators of 9I(2), and 

the equations 0— di dzc; ,— Sq? o Sq? + 
Lo<x<jbj,;,x5q" form a minimal set of gen- 
erators over 2[(2) of the Adem relations. Let 
C,(j) be the submodule over 9I(2) generated 
by c, with Ox k <j in C,, and let d,(j) be the 
restriction of d, on C UL We write z; ,—d;c; j. 
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Let 6; ; denote an operation associated with 
(d, (j), zi, ). Then 6, ; is defined on the sub- 
module D; ((X) consisting of elements xe 
H"(X) such that Sq?'x =0 for O« k <j and 
takes values in H"*?'* "^! (X) modulo the 
submodule 


PAX \= Sq? Hr? x) 
ge X bij H*? (X). 


O<k<j 
For example, , o: Ker f; ^ Cok fi, is the 
generalized Bockstein homomorphism defined 
by 6/2? and 6, , is the operation discovered 
by J. Adem that appears as the ftertiary ob- 
struction of S" for n>2. If k>3, we have the 

relation 


(moa y 2491,00). 
O<i<j<k 


J#it1 


These formulas can be applied to prove the 
nonexistence of an element with fHopf- 
invariant 1 in z,,_,(S") unless n= 1,2,4,8 
(Adams [4]). Analogous formulas also hold 
for 9I(p) with p> 2. We have no satisfactory 
theories concerning cohomology operations of 
orders higher than 2. For cohomology oper- 
ations in generalized cohomology theory — 
237 K-Theory. 
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Combinatorial Manifolds 


A. Introduction 


First of all, we explain some notions in geo- 
metric combinatorial topology, also called PL 
(piecewise linear) topology, in order to distin- 
guish between the usual topological view- 
point and the combinatorial (or PL) one. By 
an n-dimensional simplicial complex K we 
mean an n-dimensional, tlocally finite and 
trectilinear simplicial complex in the Euclidean 
N-space R, i.e., an n-dimensional *Euclidean 
simplicial complex (= 70 Complexes). The 
subspace P =|K| of RY covered by all sim- 
plexes of K is called an n-dimensional poly- 
hedron, and K is referred to as a simplicial 
division or simply a division of P. A poly- 
hedron covered by a fsubcomplex of a division 
of P is called a subpolyhedron of P. A poly- 
hedron contained in a polyhedron P is a sub- 
polyhedron of P if and only if it is a closed 
subset of P. An open subset of a polyhedron P 
is always a polyhedron but not necessarily a 
subpolyhedron of P [13]. For arbitrary divi- 
sions K, and K, of a polyhedron there is a 
common "subdivision of K, and K,. Thus a 
property of a simplicial complex K which is 
invariant under subdivision is called a com- 
binatorial property of K or of a polyhedron 
|K|. Two polyhedra P and Q are said to be 
combinatorially equivalent if they have tisomor- 
phic divisions K and L, respectively. Then a 
homeomorphism f': P Q which is induced 
from an isomorphism f: K >L is called a 
combinatorial equivalence. Since a combina- 
torial equivalence f: PQ is an isomorphism 
in the category of polyhedra and PL (*piece- 
wise linear) mappings, (PL maps) which is 
called the PL category, it is also called a PL 
homeomorphism or a PL isomorphism, and P 
and Q are said to be PL homeomorphic or PL 
isomorphic. A ttopological space X which 

is homeomorphic with an n-dimensional 
polyhedron P is said to be triangulable and 

is referred to as an n-dimensional topological 
polyhedron. Then a division K of P, or a 
homeomorphism t: P X, is called a triangula- 
tion of X. A division K of P is a triangulation 
of P. However, there is a compact polyhedron 
of dimension n (> 5) which has a triangulation 
K not combinatorially equivalent to its divi- 
sion ( — 70 Complexes; for triangulation prob- 
lems on manifolds — Section C below). The 
notion of a simplicial complex or cell complex 
had been introduced as a tool to describe 
topological invariants of triangulable mani- 
folds (H. Poincaré [1]). The notion of an n- 
dimensional combinatorial manifold (i.e., a 
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polyhedron locally combinatorially equivalent 
to an n-dimensional simplex) was established 
as an important geometric object in com- 
binatorial topology by means of J. H. C. 
Whitehead's theory of regular neighborhoods 
[2, 1939, 1940]. The 1960s saw remarkable 
results in the study of combinatorial manifolds 
together with the topological study of dif- 
ferentiable manifolds, which gave rise to con- 
temporary combinatorial or PL topology 

[3, 4]. The fundamental conjecture (Haupt- 
vermutung) for combinatorial manifolds and 
the combinatorial triangulation problem of 
manifolds were negatively solved by R. C. 
Kirby and L. C. Siebenmann [5, 1969]. In the 
1970s, important results about general trian- 
gulations were obtained by T. A. Chapman [6] 
R. D. Edwards, and J. W. Cannon [7]. 


B. Pseudomanifolds and Homology Manifolds 


In this section we explain some topological 
properties of triangulable manifolds which can 
be defined by means of incidence relations 
among their triangulations. A tpolyhedron M 
=|K| is called an n-dimensional pseudomani- 
fold if K satisfies the following three con- 
ditions: (i) Every simplex of K is either an n- 
simplex or a face of an n-simplex; (ii) each 

(n — 1)-simplex is a face of at most two n- 
simplexes; (iii) for any two n-simplexes c, t of 
K, there exists a finite sequence of n-simplexes 
0 —09,0,,...,0,— t such that o; and o;,, have 
an (n — 1)-face in common. 

Consider the set S of (n — 1)-simplexes of K, 
each of which is a face of only one n-simplex. 
Then the set of all ceS together with their 
faces forms a subcomplex L of K. The poly- 
hedron |L| of L is called the boundary of the 
pseudomanifold M =|K| and is denoted by 
OM as well as by ôK = L. The boundary 
of a pseudomanifold is not necessarily a 
pseudomanifold. 

Let |K| be an n-dimensional polyhedron (or 
*cell complex). A point p in |Kl is called a 
regular point of |K | if it has a neighborhood in 
|K| homeomorphic to an n-dimensional sim- 
plex; otherwise, p is called a singular point. A 
pseudomanifold without singular points is a 
topological manifold, called a triangulated 
manifold. An n-dimensional polyhedron |K | is 
a pseudomanifold if and only if the set of all 
regular points in |K | is dense and connected 
and the set of all the singular points is of di- 
mension less than n — 1. Thus a connected 
triangulated manifold is a pseudomanifold 
whose boundary coincides with the boundary 
as a topological manifold: 

Let t,, 7; be oriented n-simplexes of an n- 
dimensional pseudomanifold M 2 |K |. Suppose 
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that they have an (n — 1)-face o in common. If 
the tincidence numbers of these simplexes with 
c satisfy the relation [1,,0] 2 — [15.6], then t, 
and t, are said to be coherently oriented. We 
call M —|K| with all its n-simplexes oriented 
an oriented pseudomanifold if any two n- 
simplexes t, and t, are coherently oriented 
whenever they have an (n — 1)-face in common. 
For an oriented pseudomanifold, the formal 
sum of all its oriented n-simplexes forms an 
integral cycle of the pair (M, 0M), which is 
called the fundamental cycle of (M, ôM). Its 
homology class, or fundamental class, gener- 
ates the homology group H,(M,0M; Z)=Z. 
When M =|K| is nonorientable, the funda- 
mental class with coefficient Z, (Z, = Z/2Z) is 
similarly defined. A pseudomanifold is orient- 
able if and only if the topological manifold 
consisting of all its regular points is orientable. 
If for each point p of a polyhedron M 2|K| 
the local homology groups are H;(p)=0 (i 

n) and H,(p) —Z or 0, we call M=|K| an n- 
dimensional homology manifold, where H;(p) — 
H,(M, M — p; Z). If M is connected, it is also 

a pseudomanifold. The set OM of points p of 
M =|K| with H,(p)=0 is called the boundary 
of M and is an (n — 1)-dimensional homology 
manifold without boundary. 

Let M 2|K| be an n-dimensional homology 
manifold. Given simplexes c, t of K, o «t 
means that øg is a face of t and o x. By defi- 
nition, a vertex of the tbarycentric subdivision 
Sd K of K is the barycenter of a simplex of K, 
and a k-simplex t of Sd K is defined to be 
1696, ...G,|, where Go, G1, ..., 0, are simplexes 
of K such that og «6, <... « o, and 6, is the 
barycenter of o;. In the following we denote 
Sd K by K'. For a q-simplex o of K, we de- 
note by K (ø) the union of all (n — q)-simplexes 
166, ...6, | of K' such that e «e, <... < 6, ,. 
Then K(o) is an (n — q)-dimensional homology 
manifold and pseudomanifold contained in M. 
K (o) is called the (n — q)-dual cell or simply the 
dual of c. Let K, be the subcomplex of K’ 
consisting of simplexes of K' contained in 
K(a). Thus we have |K,|=|K(c)|. We denote 
by (K(c)) the union of simplexes of K, which 
do not intersect with c. The set of dual cells 
K*={K(o)|cEK} of simplexes of K is called 
the dual complex of K; and K* is also called 
the dual subdivision of | K|. K* satisfies the 
following conditions: 
(i) LJ; ex K(o)=|K|. Each K(a) is the "on 
ó *(K(c)) of the barycenter € and (K(o)). 
(ii) ¢<t<>(K(o)) > K(t) (o, te K), (K(o)) = 
a<rex K(1). 
(iii) If K(o) K(1) z Ø, then there exist sim- 
plexes of K with c and t as faces, and for the 
least simplex p of K with c and 7 as faces, it 
holds that K(c)'| K(t)= K(p). 

The boundary 6(K(o)) of the (n — q)- 
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dimensional homology manifold K (ø) satisfies 
((K(e)) 2 (K (o)) U ((OK)(o)), where (0K)(o) 
denotes the dual cell of c in 0K, and we under- 
stand that (OK)(c) — Ø if o¢ 0K. K*U(0K)* is 
called the dual complex of (K, 9K) and is de- 
noted by (K,0K)*. For an r-simplex o (0r < 
n — 1), it holds that 


HÁ(K(o)),(0K(0));Z)-0, — izn—r—l, 
H,-,-1((K(9)), (0K(0)); Z) = Z. 


Let |K*|? be the union of i-dual cells of 

(K, 0K)* = K* U(0K)* such that ix q. Let 
C,(K*) = ff, (|K*|4/|K* |"), and let 6,:C,(K*) 
—C,_,(K*) be the tconnecting homomorphism 
0,: A (I K* M K* A, KSE? 
in the treduced homology exact sequence 

of ((K*]1,|K*|1 1, |K*|* ?) (2 201 Homol- 
ogy Theory F). Then C,(K*) is a free Abe- 
lian group generated by the q-dual cells 

of (K, 0K)*, and C(K*)={C,(K*), @,} isa 
tchain complex. It follows from the property 
of H,((K(0)) , (OK (c)) ; Z) as above that 
H,(C(K*))= H,(|K]) as in the case of a *CW 
complex. 

Now suppose that K is oriented. For a q- 
simplex o — |aga, ... a,| we give an torientation 
oo, gi, ..., a4]. Furthermore, we choose a,,,, 
0442, , 04 SO that [a5,4,, ...,a,] is the orien- 
tation of |aga, ... a,| induced from the orien- 
tation of K, and we give K (øo) the orientation 
determined by [6,6,,...,6, ,], where o;= 
Läd: Gar We define o K(o)=1 for o and 
K(o) thus oriented, and r K(o) 20 if t zo. 
Let cz 2:4;o;€ C (K) and c* = È u;K(oj. Then 
we define c: c* € Z by c: c* — 3; ;Auj(o;: K(oj)). 
The integer c: ch is called the intersection 
number of c and c*. For homology classes 
[z] e H(K) and [z* ] e H,..,(K*), the intersec- 
tion number z: z* is independent of the choice 
of representing cycles, and thus we define the 
intersection number [z]: [z*] =z: z*. 

When X is compact, the isomorphism of 
the Poincaré-Lefschetz duality theorem 
H,(X,éX;R)=H" *(X; R) is given in terms of 
intersection numbers. 


C. Combinatorial Manifolds and PL Manifolds 


A polyhedron which is PL homeomorphic 
with a k-dimensional simplex or its boundary 
is called a PL k-ball or a PL (k-1)-sphere and is 
denoted by B*, S*~!, respectively. A simplicial 
complex K or a polyhedron M =|K] is called 
an n-dimensional combinatorial manifold if 
the tstar of each vertex in K is a PL k-ball. If 
M is an n-dimensional combinatorial mani- 
fold, then so is an open subset of M. An n- 
dimensional PL manifold is a topological 
manifold X with a distinguished maximal 
coordinate system z = {(U,,h,)}, called a PL 
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structure on X, which consists of homeomor- 
phisms h;: U; V, from open subsets U; onto 
open subsets V, of an n-dimensional simplex 
such that h;o hj ! MED U)  h(U; U) are 
PL homeomorphisms. For a PL manifold 
(X,n) there is a triangulation t: M X from a 
combinatorial manifold M such that for each 
(U; bien h;ot|t ! (U) is a PL homeomor- 
phism from an open subset t ! (U;) of M onto 
h;(U;). Thus the notions of combinatorial mani- 
fold and PL manifold are essentially the same. 
For a tsmooth manifold (X, o), there is a 
unique PL homeomorphism class of a com- 
binatorial manifold by means of a tsmooth 
triangulation. In this sense, smooth manifolds 
are combinatorial manifolds; but the converse 
of this is not true (— 114 Differential Topol- 
ogy C). A triangulation of a topological mani- 
fold which is a combinatorial manifold is 
called a combinatorial triangulation. The trian- 
gulation problem for n-dimensional topolog- 
ical manifolds, especially the existence and 
uniqueness problem of their combinatorial 
triangulations, a long-standing problem, is 
stated as follows. CT, (Combinatorial Triangu- 
lation Problem): Is every n-dimensional topo- 
logical manifold homeomorphic with a com- 
binatorial manifold? CH, (Hauptvermutung for 
combinatorial manifolds): Homeomorphic n- 
dimensional combinatorial manifolds should 
be PL homeomorphic. When n <3, CT, and 
CH, hold; this was shown for n —2 by T. Rado 
[15] and for n2 3 by E. E. Moise [16]. More- 
over, in this case, topological manifolds admit 
unique smooth structures, and homology 
manifolds are combinatorial manifolds. For 

n Z 5, tsurgery theory was used and developed 
as an obstruction theory for "homotoping" 

a homeomorphism h: M >M’ between n- 
dimensional combinatorial manifolds to a PL 
homeomorphism by D. Sullivan (1967). By the 
discovery of the torus-unfurling method for 
"isotoping" h: M 9 M' to a PL homeomor- 
phism, Kirby reduced CT, and CH, for n2 5 
to CH, for some specific manifolds, such as 
thickened tori modulo the boundary, for 
which surgery methods had been sufficiently 
developed (for isotopy — Section D). The 
solutions (1)-(9) can be stated as follows. 

(1) Classification of combinatorial trian- 
gulations. Assume n > 5. Existence: An n- 
dimensional closed topological manifold X 
admits a combinatorial triangulation if and 
only if an obstruction A(X)e H*(X; Z,) van- 
ishes. Uniqueness: A homeomorphism h: M ^ 
M' between n-dimensional closed combina- 
torial manifolds is isotopic (— Section D) to a 
PL homeomorphism if and only if an obstruc- 
tion A(h)e H*(X; Z,) vanishes (Kirby and 
Siebenmann [5]). 

In fact, for each nz 5, there is an n- 
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dimensional combinatorial manifold which is 
homeomorphic to a torus S! x ... x S! but is 
not PL homeomorphic, and there is an n- 
dimensional closed manifold which is not 
homeomorphic to a combinatorial manifold 
(Siebenmann [18]). 

As a special case, the following conjecture is 
proved for nz 5. 

(2) The n-dimensional annulus conjecture. 
The closed region bounded by two mutually 
disjoint locally flat (n — 1)-dimensional topo- 
logical spheres S, and S, in an n-dimensional 
sphere S" is homeomorphic with an n- 


dimensional annulus $"^! x [0,1] (Kirby [19]). 


Regarding the uniqueness problem for (not 
necessarily combinatorial) triangulations of 
topological manifolds, we mention the follow- 
ing deep result. 

(3) Double suspension theorem for homology 
spheres. The double suspensions of homology 
n-spheres are homeomorphic with S"*?, where 
a homology n-sphere is a closed topological 
manifold with the same homology groups as 
those of $” (Cannon [7]). 

For each n 23, there is a homology n-sphere 
P" which is a combinatorial manifold with 
the nontrivial fundamental group (for n — 3, 
see Poincaré [1, 1904] and for n 24, see M. 
Newman, 1948). The suspension of P" is an 
example of an (n + 1)-dimensional homology 
manifold which is not a topological manifold. 
The double suspension of P" is not a com- 
binatorial manifold but is homeomorphic 
with $"*?. The problem of whether every n- 
dimensional topological manifold is homeo- 
morphic to a polyhedron is still open for nz 
4. This problem is reduced to a problem in 
homology 3-spheres (T. Matumoto; D. E. 
Galewski and R. J. Stern). 

In the following we list some results on 
combinatorial manifolds focusing on White- 
head’s theory of regular neighborhoods [2]. 
Let M =|K| be an n-dimensional combina- 
torial manifold without boundary. The dual 
complex K* consists of PL balls. Thus the 
division K of M gives rise to a decomposition 
into PL thandles H(K) of M in such a way 
that for each k-dimensional simplex o € K, 
the star Ste. (6) of its barycenter ĉin the sec- 
ond barycentric subdivision K" of K is a PL 
handle of index k. If L is a subcomplex of K or 
K*, then the star St,.(|L]) of|L| in K” isa 
subhandlebody of H(K) or of its dual han- 
dlebody H*(K)= H(K*), denoted by H&(L) 
or Hy«(L), respectively. In particular, if M 
is closed, orientable, and of dimension 3, 
then U=H,(K"”) and V = Heil kä are 
PL homeomorphic with a PL boundary- 
connected sum of some copies of a solid torus 
S! x B?, and M=UUV, UN V=6éU —0V, 
where KIT and Kf, stand for the k-*skeletons 
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of K and K*, respectively. This decomposition 
(U, V; GU =V) of M is called a Heegaard 
decomposition (or splitting) of M [12]. We say 
that a simplicial complex L collapses elemen- 
tarily to a subcomplex L, of L, referred to as 
an elementary collapse L N L,, if L- L| = 
16, 1j, where o is a k-dimensional simplex of L 
and z is a (k — 1)-dimensional face of g which is 
not a face of any simplex in L — (6, 1). We say 
that a polyhedron P collapses to a subpoly- 
hedron Q of P or Q expands to P, referred to 
as a collapse P N Q or expansion Q 7 P, if there 
is a division L of P containing a subcomplex 
L, dividing Q so that there is a sequence of 
elementary collapsings LN L, v... v L,. Fora 
subpolyhedron P of a combinatorial manifold 
M,asubpolyhedron U of M is called a regular 
neighborhood of P in M if (1) U is a combina- 
torial manifold which is a closed neighbor- 
hood of P in M and (2) UN P. In general, for a 
subcomplex L of a simplicial complex K, the 
‘star St, (|L]) of |L| in the second barycentric 
subdivision K" of K is called a second barycen- 
tric derived neighborhood or simply a derived 
neighborhood of | L| in | K| (E. C. Zeeman [4]). 

(4) Regular neighborhood theorem. Let P be 
a compact subpolyhedron of a combinatorial 
manifold M. Existence: Every derived neigh- 
borhood of P in M is a regular neighborhood. 
Uniqueness: Any two regular neighborhoods 
of P in M are PL homeomorphic by a map- 
ping keeping P pointwise fixed [2, 1939]. On 
one hand, this theorem is regarded as a com- 
binatorial counterpart to the ‘tubular neigh- 
borhood theorem of differential topology. For 
a topological manifold X with boundary 0X, 
there are a neighborhood U of 0X in X anda 
homeomorphism h:0X x [0,1] —U, called a 
collar of 0X in X, such that h(x,0) 2 x (xeoX). 
In particular, for a combinatorial manifold 
M — X wecan take U as a derived neighbor- 
hood of 6M in M and has a PL homeomor- 
phism, called a PL collar of ôM in M. (For 
more about normal bundle theory — 147 
Fiber Bundles Q). On the other hand, it is 
also regarded as a combinatorial deforma- 
tion theorem of a handlebody decomposition 
H«(L) of a derived neighborhood St, III, 

Two polyhedra P and Q are said to be 
simple homotopy equivalent if Q is obtained 
from P by a finite sequence of collapsings and 
expansions. A composite mapping f: PQ of 
the inclusion mappings and deformation re- 
tractions associated naturally to the expan- 
sions and the collapsings from P to Q is called 
a simple homotopy equivalence. Moreover, if P 
and Q are subpolyhedra of a combinatorial 
manifold M and those collapsings and expan- 
sions take place in M, then their regular neigh- 
borhoods are PL homeomorphic [2, 1940]. 

(5) A simply homotopy equivalence is clear- 
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ly a homotopy equivalence. Conversely, we 
have the simple homotopy theorem: For the 
homotopy class of a homotopy equivalence 
f: P—Q between compact connected poly- 
hedra P and Q, there is a well-defined element 
W(f), called the Whitehead torsion of f, in the 
*Whitehead group Wh(G) of the fundamental 
group G— z,(P) such that f is homotopic to a 
simple homotopy equivalence if and only if 
W(f)-0, ([2, 1950]; — also J. W. Milnor 
[20]). 

Let W be a compact combinatorial mani- 
fold of dimension n+ 1. If 9W consists of two 
connected components M and N such that 
the inclusion mappings M c. W and N c. W are 
simple homotopy equivalences, then (W; M, N) 
is called an s-cobordism of dimension n+ 1. 
The following theorem of D. Barden, B. Mazur, 
and J. Stallings is the nonsimply connected 
PL version of Smale's th-cobordism theorem. 

(6) s-Cobordism theorem. For nz 5, an s- 
cobordism (W; M, N) of dimension n+ 1 1s PL 
homeomorphic with (M x [0,1]; M x0, M x 1) 
(B. Mazur [21]; M. Kervaire [22]; C. Weber 
[23]; J. F. P. Hudson [8]). 

(7) Topological invariance of simple homo- 
topy types. Two homeomorphic compact 
polyhedra are simple homotopy cquivalent 
(T. A. Chapman [6]). 

The famous Poincaré conjecture that a sim- 
ply connected closed manifold of dimension 3 is 
homeomorphic to a 3-sphere is still unsolved 
despite much effort by many mathematicians. 
This conjecture is generalized for dimension n 
as follows. 

(8) Generalized Poincaré conjecture in dimen- 
sion n. Any homotopy n-sphere 2” is homeo- 
morphic to an n-sphere, where a homotopy 
n-sphere is defined to be an n-dimensional 
topological manifold homotopy equivalent 
to an n-sphere. 

This has been affirmatively answered for 
nz 5. The first proof was given by S. Smale 
(— 114 Differential Topology F, K). From the 
viewpoints of collapsing and general position 
(— Section D), J. Stallings [3, 1960] and E. C. 
Zeeman [11,198—204] proved the following. 

(9) Engulfing lemma. For a compact poly- 
hedron P of dimension m in the interior M — 
M —0M of an n-dimensional combinatorial 
manifold M, there is a PL n-ball in M contain- 
ing P in its interior De, engulfing P), provided 
that M is m-connected De, 1,(M) —O (i & m)) 
and n—m 23. (For the topological version — 
M. H. A. Newman [24].) If a combinatorial 
manifold M =|K| is a homotopy n-sphere 
(n> 5), then by the engulfing lemma, there are 
PL n-balls B, and B, engulfing H,.(K >) 
and Hy-(K$5,) so that B, U B, = M. By the 
generalized Schoenflies theorem (— Section 
G), B, — B, = M — B, is homeomorphic to 
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an n-ball. It follows that by the Alexander 
trick (— Section D) M ts homeomorphic to an 
n-sphere. Since CH, holds for S" (nz 5), the 
homotopy PL n-sphere M is actually PL 
homeomorphic with S". 


D. Embeddings of Combinatorial Manifolds 


An injective PL mapping f: P—Q between 
polyhedra is called a PL embedding if f(P) is a 
subpolyhedron of Q (— Section A). Namely, 

a PL embedding f: P— Q is a PL mapping 
which is induced from a simplictal injection 

f: KL, called a division of f, for some divi- 
sions K and L of P and Q, respectively. In the 
following we explain PL embeddings from an 
m-dimensional combinatorial manifold M into 
an n-dimensional combinatorial manifold N. 
For a PL embedding f: M>N, c=n—mis 
called the codimension of f. If f ! (ON) 2 0M, 
f 1s said to be proper. Since an injective PL 
mapping f: MN is a PL embedding if and 
only if it is proper as a mapping between topo- 
logical spaces M and N (Le the preimages of 
compact sets are compact), this term "proper" 
for PL embeddings should not be confused 
with the term "proper for mappings. Two PL 
embeddings f: MN and f/': M' o N' are said 
to be (PL) equivalent if there is a PL homeo- 
morphism h: N 5 N', called an equivalence 
from f to f’, such that ho f(M)=f'(M’). When 
M = M', f and f' are said to be (PL) isomor- 
phic if there is an equivalence h: NN’ from 

f to f' such that ho f= f'. A submanifold of 
N 1s defined as a subpolyhedron of N which 

is a combinatorial manifold. Thus the equiv- 
alence class of a PL embedding f: M >N is 
nothing but the PL homeomorphism class of a 
pair (N, f(M)) of N and its submanifold f(M ). 
For a submanifold of N, the same terms as 
those for the inclusion mapping are used. For 
every division f: KL of a (proper) PL em- 
bedding f: MN and for xe M we have a 
(proper) PL embedding f|St,(x) from a PL m- 
ball St,(x) into a PL n-ball St, ( f(x)) which 
represents the *germ of f at xe M. A PL em- 
bedding of a PL m-sphere into a PL n-sphere 
is called a PL (n, m)-knot or (n, m)-sphere pair 
and a proper PL embedding of a PL m-ball 
into a PL n-ball is called a PL (n, m)-ball knot 
or (n, m)-ball pair. A PL (n, m)-knot (a PL 

(n, m)-ball knot) is said to be trivial or un- 
knotted if it is equivalent to the inclusion 
mapping 01"*! x Oc.0I"*! (I^ c. I"), where I= 

[ —1, 1]. Then we have the following state- 
ments (1)-(5). 

(1) Zeeman's unknotting theorem. Every PL 
(n, m)-knot and every PL (n, m)-ball knot are 
unknotted, provided that n — mz 3. 

In general, an injection g: X > Y between 
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topological manifolds is said to be locally flat 
if for each point xe X, there is a neighborhood 
U of x in X so that the local germ of f at x is 
equivalent to the inclusion mapping U x Oc U 
x I°. Then f is necessarily proper, i.e., f ! (OY) 
=0X.A PL embedding f: M—N is said to be 
locally flat if it is locally flat in the PL cate- 
gory. Namely, f|St,(x) is equivalent to I" x 
Oc. I". Conversely, Zeeman’s unknotting theo- 
rem implies the following. 

(2) Every proper PL embedding of codimen- 
sion Z3 is locally flat. 

Since every locally flat PL embedding f: M 
>N admits a normal tblock bundle, the classi- 
fication of F: M >U up to isomorphism is 
reduced to the classification of block bundles 
(— 147 Fiber Bundles Q), where U is a derived 
neighborhood of f(M) in N. (For the general 
classification of locally flat PL embeddings by 
surgical methods — C. T. C. Wall [25, Corol- 
lary 11.3.1]). 

In the codimension 2 case, there exist non- 
trivial knots (the classical knots) (— 235 Knot 
Theory) and there is a proper PL submanifold 
which is not locally flat. For example, a com- 
plex projective curve V defined by x"+ y" !z 
— 0 can be regarded as a PL 2-sphere in the 
complex projective plane CP?, and for the 
inclusion mapping i: V c. CP?, at 05 (0:0: 1), 
i| St, (0) is PL equivalent to a cone of a torus 
knot k:$! c. S? of type (n, n— 1). For further 
results in higher-dimensional knot theory (— 
235 Knot Theory) we mention the works of M. 
Kato and Y. Matsumoto [26] and S. Capell 
and J. Shaneson [27]. 

Here we consider only proper PL embed- 
dings of a fixed m-dimensional combinato- 
rial manifold M into a fixed connected n- 
dimensional combinatorial manifold N, so 
that a codimension 0 PL embedding is a PL 
homeomorphism. Two PL embeddings fj, 

TI MAN are said to be pseudoisotopic if there 
is a proper PL embedding F: M x [0,1] N 

x [0, 1], called a pseudoisotopy from f, to 

fi, such that F(x,0) 2 (f(x), 0) and F(x, 1)— 
(fix), 1) (x € M). Moreover, if F satisfies the 
level-preserving condition F(M x (t]) c N x {t} 
(0x t« 1), then f, and f, are said to be isotopic, 
and F or a homotopy { f,} defined by F(x,t) 
(f(x), t) (o, t)e M x [0, 1]) is called an isotopy 
from f, to fi. When M =N, a PL homeomor- 
phism which is isotopic to the identity, or the 
isotopy of the identity itself, is called an am- 
bient isotopy of N. Two PL embeddings from 
M to N are said to be ambient isotopic if they 
are isomorphic by an ambient isotopy of N. 

Since J* (I 2 [ —1,1]) is a k-dimensional 
tconvex cell and is a cone 0 » 0I* of Ofk from 
the center 0, each point x of /*— {0} can be 
written uniquely as x=t-u for 0<t<1 and 
uedl*. Thus a PL embedding f:6I" 0I" can 
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be extended to a PL embedding F:1"— I", 
called a cone extension or simply a cone of f, 
by setting F(0)=0 and F(t- u)=t: f(u) for 

t: ue I" — (0). This method of cone extension is 
often referred to as Alexander's trick. By using 
Alexander's trick and the uniqueness of regu- 
lar neighborhoods, one can show without 
much difficulty that (i) the isotopy classes of 
PL homeomorphisms of PL spheres or PL 
balls onto themselves are classified by their 
tdegrees (= +1)(V. K. A. M. Gugenheim, 
1953), and (ii) every PL (n, m)-knot or PL 

(n, m)-ball knot is equivalent to the standard 
one 0I"*! x 0c.0I"*! or I" x Oc!" up to iso- 
topy, provided that n —mz 1. This is why we 
define the notion of ambient isotopy in a dif- 
ferent way from the differentiable category, 
where the smooth isotopy is always covered by 
the ambient isotopy. However, for codimen- 
sion Z3, because of Zeeman's unknotting 
theorem, the following holds. 

(3) The pseudoisotopy class, the isotopy 
class, and the ambient isotopy class of a PL 
embedding of codimension z 3 are the same 
(J. F. P. Hudson and E. C. Zeeman, 1964). 

In order to construct a PL embedding from 
a continuous mapping within its homotopy 
class, the notion of general position is useful. 
A PL mapping o: MN is said to be in gen- 
eral position if dim !(y) D (ye N) and 
dim Stolz 2m — n, where S(q) is the closure of 
(xeM|o !(p(x)) s (x11 in M (J. F. P. Hudson 
[8]). 

(4) General position theorem. Every con- 
tinuous mapping y: M >N can be approxi- 
mated by a PL mapping o: MN which is in 
general position. 

The following theorem can be proved by 
sharpening the general position theorem and 
the engulfing lemma. 

(5) Irwin's embedding theorem. Let y: M—N 
be a continuous mapping from a compact 
combinatorial manifold of dimension m into a 
combinatorial manifold of dimension n such 
that v |OM is a PL embedding from ôM into 
ON. Assume that (i) M is d-connected, (ii) N is 
(d+ 1)-connected, and (iil) n —m 23, where d= 
2m — n. Then y: MN is homotopic to a 
proper PL embedding f: M —>N relative to 
0M. Moreover, if M and N are closed and if 
(1*) M is (d+ 1)-connected and (i1*) N is (d+ 2)- 
connected, then two homotopic PL embed- 
dings from M into N are ambient isotopic (M. 
C. Irwin, 1965). 


E. 3-Manifolds 


There has been a great deal of research done 
on 3-manifolds, and we have, among others, 
the following results (1)—(4). 
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(1) Sphere theorem. Let M be an orientable 
3-manifold with z,(M) #0. Then there exists a 
PL embedding f of a 2-sphere into M such 
that f is not homotopic to 0 in M (C. Papa- 
kyriakopoulos [28 ]). 

(2) Dehn's lemma. Let D be a 2-cell with 
self-intersections in a 3-manifold M, with its 
boundary a simple polygon C. If some neigh- 
borhood U of C in D has no singularities, then 
there exists a 2-cell in M without singularities 
whose boundary is C. In 1910, M. Dehn as- 
serted this lemma, but his proof was incom- 
plete. In 1957, Papakyriakopoulos and T. 
Homma proved the lemma independently 
[28, 29]. 

(3) Loop theorem. Let N be the boundary 
of a 3-manifold M and U be an open set 
in a component of N. If there is a closed 
curve in U which is homotopic to 0 in M 
but not in N, then there exists a simple closed 
curve in U homotopic to 0 in M but not 
in N (Papakyriakopoulos [30], J. Stallings 
[31)). | 

(4) Unique decomposition theorem for a 3- 
manifold (J. W. Milnor [32], H. Kneser). We 
assume that all manifolds are connected, 
oriented, and triangulated 3-manifolds without 
boundaries and that all homeomorphisms are 
piecewise linear. Two manifolds M, M' are 
said to be isomorphic (M = M) if there exists 
an orientation-preserving homeomorphism of 
M onto M'. Removing an open 3-cell from 
each of two 3-manifolds M, M' and identifying 
the boundaries of these removed cells, we 
obtain a 3-manifold M # M’, called the con- 
nected sum of M and M’. A manifold that is 
not isomorphic to the 3-sphere S? is called 
nontrivial. A nontrivial manifold P is called 
prime if P cannot be decomposed as P = 
M, # M, with M, and M, both nontrivial. 

A manifold M is called irreducible if every 
2-sphere in M bounds a 3-cell. Then from 
the sphere theorem the following results can 
be deduced: If a compact 3-manifold M is 
nontrivial, then M is isomorphic to a con- 
nected sum P, # P; # ... # P, of prime mani- 
folds P; (i— 1,2, ..., k), where Pi, P,, ..., P, 
are determined uniquely up to order. Every 
irreducible 3-manifold M has 7,(M)-0. Con- 
versely, if a 3-manifold M has z,(M) —0, then 
M is irreducible, provided that the Poincaré 
conjecture is correct. (For further results 

on 3-manifolds — F. Waldhausen [10] and 
J. Hempel [12].) 

Irreducible sufficiently large 3-manifolds 
[10] can further be decomposed into Seifert 
fibered spaces and “simple manifolds” which 
contain no essential tori (W. Jaco and P. B. 
Shalen, K. Johannson). Existence of hyperbolic 
structures on “simple manifolds” has been 
extensively studied by W. Thurston. 
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F. Wild Spaces 


Nonclosed 3-manifolds behave very differently 
from closed 3-manifolds. For example, there 
exists an open 3-manifold U that has the same 
thomotopy type as an open 3-cell but is not 
homeomorphic to it (M. H. A. Newman and J. 
H. C. Whitehead, 1937). The construction of 
this and similar examples generally involves 
infinite processes. Such examples, which do 
not exhibit the properties we expect, are com- 
monly termed pathological (or wild) spaces. 
For the manifold U above, the product U x R! 
is found to be homeomorphic to the Euclidean 
4-space R*. R. Moore proved that the quotient 
space of the Euclidean 2-space R? by any 
upper semicontinuous decomposition consist- 
ing of continua not separating R? is homeo- 
morphic to R?. On the other hand, R. H. Bing 
(1959) gave an example of a similar upper semi- 
continuous decomposition of R? for which the 
quotient space B is not homeomorphic to 

R? and is not even a manifold. This quotient 
space has the property B x R! zx R*. Bing 
(1957) also proved that at most countably 
many copies of a wild closed surface can 
simultaneously be embedded in R?. On the 
other hand, Stallings (1960) proved that there 
exists a wild disk of which a set of copies with 
the power of the continuum can simulta- 
neously be embedded in R? [11]. 

The study of the quotient space of n- 
dimensional manifolds by cell-like decompo- 
sitions has been fully generalized by Edwards 
and Cannon [7] for nz 5 to prove the double 
suspension theorem of homology spheres (— 
Section C (3)). 


G. The Schoenflies Theorem 


In 1906, A. M. Schoenflies proved that every 
tsimple closed curve in the plane is the bound- 
ary of a 2-cell. This result is sharper than 

the tJordan curve theorem and is called the 
Schoenflies theorem. However there was a gap 
in the proof, and the complete proof was given 
by L. E. J. Brouwer in 1910. 

The following higher-dimensional analog of 
the above theorem is called the Schoenflies 
problem: Is every (n — 1)-sphere embedded in 
the n-dimensional Euclidean space R” (n> 3) 
the boundary of an embedded n-disk? 

For the case where the embedding is topo- 
logical, L. Antoine constructed a counter- 
example, called Antoine's necklace, for n —3 in 
1921 and solved this problem negatively. Also, 
J. W. Alexander constructed another counter- 
example, called Alexander's horned sphere, in 
1924 (Fig. 1). A similar example was given by 
R. H. Fox and E. Artin in 1948 by making use 


65 Ref. 
Combinatorial Manifolds 


of knot theory (Fig. 2). The study of wild em- 
beddings began with these examples, and 
many interesting results were obtained (Bing). 

For the case where the embedding is PL or 
smooth, the Schoenflies problem has been 
solved affirmatively for n — 3 (Alexander). 

M. Brown (1960) and B. Mazur (1959) 
proved the following: Let h: S"! x [—1, 1] S" 
be a topological embedding; then S" — A(S" ^! x 
{0}) consists of connected open sets D, and 
D of S", and the closures of D, and D are 
both homeomorphic to the n-disk. This result 
solves the Schoenflies problem affirmatively 
for nz 5 when the embedding is PL and lo- 
cally flat or smooth. 





Fig. 2 
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A. General Review of Combinatorics 


Combinatorics is concerned mainly with prob- 
lems of discrete sets, such as the enumeration 
of subsets satisfying certain conditions, or their 
actual construction and the selection of an 
optimal subset with respect to a suitable crite- 
rion. Generally speaking, both the theoretical 
analysis and the construction of discrete sets 
are much more difficult than those problems in 
analysis concerning infinite sets with suitable 
topologies. 

The main emphasis and the name of this 
field have changed from time to time and from 
person to person. Other names such as com- 
binatorial analysis, combinatorial theory, or 
combinatorial mathematics are commonly used 
and mean almost the same thing. Recently, 
the expression “discrete mathematics” has 
also been used to describe the same field. In 
many cases a problem originally conceived as 
number-theoretic, algebraic, analytic, or geo- 
metric has eventually turned out to be com- 
binatorial (— 102 Design of Experiments, 241 
Latin Squares). 
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tGraph theory, originally a combinatorial 
study of 1-dimensional topological tcomplexes, 
has developed into an important branch of 
mathematics. 

In this article we discuss various topics not 
yet considered to be established branches or 
organized fields of mathematics. The topics 
treated under the title “combinatorics” are 
at present not unified, and consist of a mix- 
ture of isolated techniques. However, recent 
progress with electronic computers is play- 
ing an important role in the solution of 
various combinatorial problems arising from 
large systems, and the study of combinatorics 
is now quite active both in theoretical develop- 
ment and application. We now have special 
journals in this field, such as the Journal of 
Combinatorial Theory (Academic Press), Com- 
binatorica (Elsevier), the European Journal 
of Combinatorics (Academic Press), Discrete 
Mathematics (North-Holland), and Discrete 
Applied Mathematics (North-Holland). 


B. Three Fundamental Principles 


The most fundamental among the various 
principles in combinatorics are the following 
three principles for enumeration problems. Let 
Q be a finite set and let |A| denote the number 
of elements in a subset A of Q. 

(1) Rule of sums: If A N B= Ø, then |A + B| = 
|A| + IBI. 

(2) Rule of products: | A x B|=|A| x |B], where 
A x B is the !direct product of the two sets A 
and B. 

(3) Principle of inclusion and exclusion: 


IA UU ASU EA 
-5 4l- X14; Al + » | A; A;N A] 


i<j i<j<k 
EME e EA Cos E CE 


These principles can be generalized as state- 
ments concerning various measures. 


C. Mobius Inversion Formula 


Let P be an ordered set whose arbitrary inter- 
val is a finite set. Let [P x P] = ((x, y)eP x 
P|x € yj be the set of intervals of P. A func- 
tion satisfying the following two conditions is 
determined uniquely and is called the Móbius 
function over P: 
(1) u(x, x) 2 1 for all xe P; 
(il) sen U(x, z) ^0 for all x, ye P with x< y 
and x x y. 

When K is a field, for a mapping f:[P x P] 
>K, we can define another mapping g:[P x 
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P]>K by 
g(x y= Y f(x,z). 


ze[x, y] 


Then we have the Möbius inversion formula 


foy) , g, zul, y). 

ze[x. y] 
This formula implies, as its special cases, many 
important formulas, such as the *Móbius in- 
version formula in number theory, the prin- 
ciple of inclusion and exclusion, and the in- 
verse relation of difference and summation. 


D. Some Special Sequences 


In the enumeration problem, many important 
sequences f{a,} are given by the !generating 
function f(x)= a,x". Typical examples are as 
follows: (1) *Binomial coefficients: (1 +x)" = 
È) x". (2) Multinomial coefficients: (x, +... 
Tx" X, n) X1 Xp”. (3) Multiple combi- 


nation a,=,,H,=("*""'):(1 x) = Y, (, H,)x". 


(4) Bell numbers: exp(e* — 1) 2 X B, x"/n! (5) 
The Stirling number of the second kind S% is 
the total number of bijections from N, to N,, 
divided by m!, where N,— (1,2, ..., nj. This is 
equivalent to the total number of partitions of 
N, into m nonempty blocks: (e* — 1)"/m! — 
Eom WE et 8 

In general, when the generating function 
f(x) is an analytic function of one or several 
variables as above, we can obtain recurrence 
formulas for the sequence from the functional 
equations or the differential equations which 
f(x) satisfies; or we can get asymptotic esti- 
mates by using the tsaddle point method. 


E. Pólya's Enumeration Theorem 


Let U be a finite set, V be a (finite or infinite) 
set, G be a transformation group operating on 
U, and w be a function assigning to each ele- 
ment of V a “weight.” A weight is an element 
of a ring and is usually defined by a suitable 
generating function. The weight of a function 
f:U-V is defined by w(f)=T1,-y w(f(u)). 
Two functions f, and f, are considered iden- 
tical ( f, = f;) if there exists a t€ G such that 

fi hon. Put F= VY (E), the set of all equiv- 
alence classes of the relation =. wi f ) is well- 
defined for f € F, because f, = fa implies w( fi) 
=w( f2). Then we have the equality E zep w( p 
= PX, ey W(0), Doey WO), ae WO). 
Here, Pc(y,, y5, ..., Ym) is the cycle index of G, 
which is given by n Je yh? y0 ,, , vii), 
where n=|G|, m=|U|, and j,(x) is the number 
of cyclic permutations of length k contained in 
x. This is called Pólya's enumeration theorem. 
When G=S,, (the tsymmetric group), Ps (y;, 
oes Yn) = A 20 zia -m(4412545! ... m^74,1) ! yt 
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... yan. When G = Ae (the talternating group), 
P4 Melen Za 0a m £3 7even2(44 22 A;! 
mh!) | yp... Kan, When G=C,, (the tey- 
clic group), Pc, (Yis -s Yn) 9 m^! Erm Plk) y, 
where ø(k) is the tEuler function, i.e., the num- 
ber of integers less than k and coprime to k. 
This theorem is widely applicable to various 
problems with suitable choices of weights. 
Several generalizations have also been studied 


(see, e.g., [3]). 


F. The Notion of Polymatroids 


The theory of matroids was originally intro- 
duced as an abstraction of the notion of flinear 
dependence in a vector space or of some prop- 
erties of tgraphs or of the notion of 'alge- 
braic dependence in the theory of fields (see, 
e.g., [4, 5]). But recently, as many important 
applications were discovered, it has become 
an important branch of combinatorics (see 
[6-8]). 

Let u bé a function from a lattice L to a 
totally ordered module R. x is called sub- 
modular if (x) - u(y) 2 u(x v y) + u(x ^ y) for 
every pair of elements x, ye L. The set of ae L 
such that u(a) — min,., u(x) constitutes a sub- 
lattice in L. Let L be the family of all subsets 
of a finite set E. The sublattices K in L —2* 
and the pseudo-orders over E correspond in a 
one-to-one manner as follows (here pseudo- 
order means a binary relation taken as an 
*ordering without assuming the *antisym- 
metric law): (1) The class of a partition P of E 
corresponds to the difference of two subsets 
which are the immediate predecessor and 
immediate successor in K, or the complement 
with respect to E of the maximum element in 
K, or the complement of the minimum ele- 
ment. (ii) The complement of the maximum 
element in K is maximum in the order of P, 
and the minimum element in K is minimum in 
the order of P. (iii) Two classes £,, č, in P, 
given as the difference of two elements in K, 
satisfy the relation č, 2 č, with respect to the 
order of P if and only if all elements of K such 
that x 24, in E satisfy x >€,. 

Now let E be a finite set, L —2F, and let R be 
the real number field with the usual addition 
and order relations. If a submodular function 
uv 2^ —R satisfies the following conditions, 
then P — (E, n) is called a polymatroid over E, 
and u is called the rank function over P. (i) 
u(@)=0, where @ is the empty set. (ii) x c y 
(c E) implies u(x) € u(y). A function u: ER is 
automatically extended to u:275 R by u(x)= 
Loex ule) (x c E), which is also denoted by 
the same symbol u. If a function u satisfies 
Ox u(x) € p(x) for every xe E, then u is called 
an independent vector over P. An independent 
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vector u with u(E) — u(E) is called a base of P. 
A function u: EO R is identified with an ele- 
ment in R”, where n — |E|. Then a set of inde- 
pendent vectors in P forms a tconvex poly- 
hedron in R”. If we denote the polyhedron by 
P, then such a P is characterized by the follow- 
ing two conditions: (P1) If uc P then each v 
satisfying O0 x v(e) x u(e) for every ec E also 
belongs to P. (P2) If two u, ve P satisfy u(E) < 
v(E), then there exists we P, wu, satisfying 
u(e) € w(e) « max[u(e), v(e)] for every ee E. If 
there are two polymatroids P, — (E, ul and 
P, — (E, u) over the same finite set E, then we 
have max (u(E)|ueP, (0 P,j 2 min( gu, (x) - 

A4 (E— x)|x c E}. According to tinformation 
theory, an example of a polymatroid (E, u) ts 
given by the following: u(x), xe E, is the simul- 
taneous tentropy of a subset x of the infor- 
mation source E with correlation. 


G. Matroids 


A polymatroid P — (E, u) is called a matroid if 
the values of u are always integers and u(x) € 
|x|=the cardinality of the set x. A set x CE is 
called an independent set if (x) =|x|, and 

a dependent set otherwise. An independent 
set with maximal cardinality is called a base. 
A minimal dependent set is called a circuit. 
The set y with max{|y|| y >x, u(x) 7 u(y)] is 
uniquely determined by x, which is called 

the closure of x and denoted by cl x. Several 
axioms stated in terms of these notions exist, 
and can be used to characterize matroids. 
Some of them are: 

(1) The axiom for the family of independent 
sets £: (i) Zef; (it) xe.£ and yc x implies 
yeI; (in) If x, ye.7 and |x| «|y|, then there 
exists an ec E in y— x such that xU fetes. 

(2) The axiom for the family of bases 2: (1) Z 
is not empty; (ii) If b, b'e 2 and b z b, then for 
every eeb — there exists an e' such that ee 
b'—b and (b—{e})U fete. 

(3) The axiom for the family of circuits @: 
(i) @ € €; (it) For every pair x, ye€, y is nota 
proper subset of x; (111) If x, ye € and x Z y, 
then for every ee xy there exists a ze € such 
that z c xU y (ej. 

(4) The axiom for the closure function cl: 
2F 52F: (i) xc el x — cl(cl x) for every xe E; (ii) 
xc y implies cl x c cl y; (iii) If eecl(x U {e'})— 
cl x, then e'ecl(xU ej) — cl x. 

The following are typical examples of ma- 
troids: (1) Let E be the family of finite set of 
vectors in a vector space V and .7 be the 
family of sets of linearly independent vectors. 
(2) Let E be the set of edges of a tgraph and 
€ be the set of edges constituting a fundamen- 
tal circuit. (3) Let E be a finite set and S be a 
family of subsets of E. x={e,,...,e,} c E is 
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defined to be independent if there exist differ- 
ent elements s,, ...,5, in S such that ees, 
for every i— l, ..., r. 

Let a weight function w: E R be defined 
for every element ec E in a matroid (E, i). To 
obtain a base b of the matroid with minimal 
weight w(b) 2 Ze w(e), we can apply the fol- 
lowing procedure, called the greedy algorithm: 
(1) Arrange the elements e,, ..., e, of E in such 
a manner that wiele... « w(e,). (2) Put 59 — 
Ø, and i:=0. (3) If x 2 b? U fe; e.7, then 
p6*U:— x; otherwise, b+) := b. (4) Repeat 
process (3) for i0, 1,...,n— 1. Then the final 
b™ is the base of minimal weight. 


H. Operations on Matroids 


For a matroid M —(E, u) with the family 4 of 
its base, * = {E — x|xe 44j also satisfies the 
axioms of the base. The matroid M* =(E, u*) 
with Z/* as its base is called the dual of M. 
Here we define u*(x) 2 |x| - u(E — x) — (E). 
When F c E, M|F =(F, uy) is called the reduc- 
tion of M —(E, u) to F, where up is the restric- 
tion of u on 2°. A matroid M x F=(M*| F)* is 
called the contraction of M to F. À matroid 
given by the form (M | F) x F' is called a minor 
of M. From two matroids M, =(E, u,) and M, 
—(E, u) over the same set E, we can construct 
a matroid M = M, v M, =(E, u) whose rank 
function u(x) is defined by the minimum of 
Un) ua) * Ix — y |y & xj. This is called 
the union of M, and M,. 

Let M —(E, u) be a matroid and a be a posi- 
tive constant. The function fi(x) = u(x) —«|x| is 
submodular. The set L of x which minimizes 
fi(x) is a sublattice of 2*. The partition of E 
and the order relation of its classes defined by 
L are called the principal partition of M with 
respect to the parameter a. When the value of 
the parameter « is not specified, æ is taken to 
be 2. 

As was seen in example (1) at the end of the 
preceding section, the family of vectors in 
a vector space V over a field K is a matroid 
M. A matroid isomorphic to such an M is 
called linearly representable over K. If K — 
GF(p), the finite field with p elements, then it 
is called a p-ary matroid. A matroid linearly 
representable over any field is called regular. If 
[G] is a matroid isomorphic to the matroid 
defined by the family of edges of a graph as 
shown in example (2), it is called graphic. If the 
dual M* is graphic, M is called cographic. 
Graphic and cographic matroids are always 
regular. There exist matroids not linearly 
representable over a field K, or linearly repre- 
sentable over a particular field K but not 
regular, or regular but neither graphic nor 
cographic. If [G] is a matroid isomorphic to 
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the matroid of the type in example (3), it is 
called transversal. Such a matroid is linearly 
representable when the cardinality of the field 
K is sufficiently large. 
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A. General Remarks 


A ring R (— 368 Rings) whose multiplication 
is commutative is called a commutative ring. 

Throughout this article, we mean by a ring a 
commutative ring with unity element. 


B. Ideals 


Since our rings are commutative, we need not 
distinguish right or left ideals from *ideals. A 
subset a of a ring R is an ideal of R if and only 
if a is an R-submodule of R (— 277 Modules), 
if and only if a is the *kernel of a ring homo- 
morphism from R into some ring (except for 
the case a — R). Given an ideal a of a ring 

R, the set of elements which are tnilpotent 
modulo a, i.e., (xe R|x"ea (3n)j, is called the 
radical of a and is often denoted by Ja. The 
radical of the zero ideal is called the radical 
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of R, or more precisely, the nilradical of R 
(— 368 Rings H). 

For a subset S of a ring R, the smallest ideal 
a containing S is called the ideal generated by 
S, and S is called a basis for a; if S is a finite 
set, then S is called a finite basis. When a, 
(AeA) are ideals of R, the sum of these ideals, 
denoted by È a,, is defined to be the ideal 
generated by the union of the ideals a, (4€ A). 
If A is a finite set, say {1,2,...,n}, then the sum 
is denoted also by a, +... + a,, and in this case, 
the sum is also called a finite sum. Note that 
a,+...+4,={a,+...+4a,|a;€a;}. The product 
a,...a, of a finite number of ideals a,,...,a, is 
defined to be the ideal generated by the set 
{a,...a,|a;€a;}. The intersection of an arbi- 
trary number of ideals is an ideal. When a 
is an ideal of a ring R and S is a subset of R, 
the quotient a: S is defined to be the ideal 
[xe R|xS ca). If a, b, c, b; (AEA) are ideals, we 
have (a:b):c —a:bc, a: X b; = (^; (a:b;). 


C. Prime Ideals 


An ideal p of a ring R is called a prime ideal if 
R/p is an tintegral domain; p is a prime ideal if 
and only if p z R and also abep (a, be R) im- 
plies aep or be p (some literature includes the 
ring R itself in the set of prime ideals). Let S be 
a multiplicatively closed subset of a ring R, i.e., 
a nonempty tsubsemigroup of R with respect 
to multiplication. A maximal member among 
the set of ideals which do not meet S is called a 
maximal ideal with respect to S. Such a mem- 
ber is necessarily a prime ideal; when S — {1}, it 
is called a maximal ideal of R. An ideal m is a 
maximal ideal of R if and only if R/m is a field. 


D. Jacobson Radical 


The intersection J of all maximal ideals of a 
ring R is called the Jacobson radical of R; in 
some cases, this intersection J is called the 
radical of R (— 368 Rings H). Let N be an R- 
submodule of a finite R-module M. If MJ + 
N =M, then M =N (Krull-Azumaya lemma 
or Nakayama lemma). 


E. Krull Dimension 


For a prime ideal p of a ring R, the maximum 
of the lengths n of tdescending chains of prime 
ideals p=po 2 Pı F ... ZP, which begin with p 
(or oo if the maximum does not exist) is called 
the height or rank of the prime ideal p. For an 
ideal a, the minimum of the heights of prime 
ideals containing a is called the height of the 
ideal a. The maximum of heights of prime 
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ideals of R is called the Krull dimension (or 
altitude) of the ring R. For an ideal a of R, the 
Krull dimension of R/a is called the Krull 
dimension (or depth) of the ideal a. (The mean- 
ings of the terms rank, dimension, and depth 
now depend on the writings in which they 

are found; standardization of definition of 
these terms is becoming more and more of a 
necessity.) 


F. Primary Ideals 


For an ideal q of R, if q#R and every zero divi- 
sor of R/q is nilpotent, then q is called a pri- 
mary ideal. (If R is included in the set of prime 
ideals, then R is regarded as a primary ideal.) 
In this case, p = a is a prime ideal, and q is 
then said to belong to p or to be a p-primary 
ideal. The intersection of a finite number of pri- 
mary ideals belonging to the same prime ideal p 
is p-primary. Assume that an ideal a is ex- 
pressed as the intersection a—q, (1...(13, ofa 
finite number of primary ideals q,,...,q,,. If 
this intersection is irredundant, that is, if none 
of the q; is superfluous in the expression of a, 
then the set of Ja; (i=1,...,n) is uniquely 
determined by a. The SE a; are called prime 
divisors (or associated prime ideals) of the ideal 
a; a minimal one among these is called a mini- 
mal (or isolated) prime divisor of the ideal a, 
and those which are not minimal are called 
embedded prime divisors of the ideal a. A max- 
imal one among the prime divisors of a is 
called a maximal prime divisor of the ideal a. 
(For definitions of these concepts in the case 
where a is an arbitrary ideal, see [4].) If, in the 
expression of a as above, n is the smallest 
occurring in similar expressions, then the 
expression is called the shortest representation 
of the ideal a by primary ideals. In this case, 
each q; is called a primary component of the 
ideal a; if ./q; is isolated, then q; is called an 
isolated primary component of the ideal a, and 
otherwise q; is called an embedded primary 
component of the ideal a. Isolated primary 
components are uniquely determined by a, but 
embedded primary components are not. 


G. Rings of Quotients 


Let R be a ring. Then the set U of elements of 
R which are not zero divisors is multiplica- 
tively closed. In the set R x U = ((r,u)|re R, 
ue U}, we define a relation = by (r,u)= 
(r’,u')<>ru'=r'u. Then = is an equivalence 
relation, and the equivalence class of (r, u) is 
denoted by r/u. In the set Q of these r/u, addi- 
tion and multiplication are defined by r/u+ 
ru =(ru'+r'u)/uu’, (r/u)(r' /uw) 2 rr Zug. Then 
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Q becomes a ring, and r/1 can be identified 
with r. Thus Q is a ring containing R, gen- 
erated by R and inverses of elements of U. This 
property characterizes Q, which is called the 
ring of total quotients of the ring R. If R is an 
integral domain, then Q is a field, called the 
field of quotients of the integral domain R. Let 
S be a multiplicatively closed subset of R such 
that O¢S, and let n be (xe R|xs=0 (3se S)] 
and ¢ be the natural homomorphism R Hin. 
Then none of the elements of (S) is a zero 
divisor. The subring of the ring of total quo- 
tients of R/n, generated by R/n and inverses of 
elements of ~(S), is called the ring of quotients 
of the ring R with respect to S, and is denoted 
by Rs (R[S ! ] or RS ^ !). When M is an R- 
module, M &, Rs is called the module of quo- 
tients of the R-module M with respect to S. It 
is significant that Rg is * R-flat. There is a one- 
to-one correspondence between the set of pri- 
mary ideals q of R which do not meet S and 
the set of primary ideals Q of R, such that q 
corresponds to Q if and only if Q2 qR; (q= 
QN R). When p ts a prime ideal of R, the com- 
plement R — p is multiplicatively closed, and 
Rpg-, is called the local ring of p or the ring of 
quotients of the ring R with respect to the prime 
ideal p, and is denoted by R, (— 284 Noe- 
therian Rings C, D). A ring of quotients is also 
called a ring of fractions. 


H. Divisibility 


In a ring R, if a —bc (a, b, ce R), then we say 
that b is a divisor (or factor) of a, and that a is 
a multiple of b, or a is divisible by b. We denote 
this by blo This relation between a, be R is 
called divisibility relation in R. If, in this situ- 
ation, c has its inverse in R, we say that a is an 
associate of b. A factor b of a is called a proper 
factor if b is neither an associate of a nor tin- 
vertible. An element which has no proper fac- 
tor is called an irreducible element. A nonzero 
element which generates a prime ideal is called 
a prime element. 

If in an integral domain R every nonzero 
element is a product of prime elements (up to 
invertible factors), then we say that the unique 
factorization theorem holds in R, and that R is 
a unique factorization domain (or simply u.f.d.). 

Let A={a,,...,a,} bea set of nonzero ele- 
ments of a ring R. A common divisor of A is an 
element which is a factor of a;. A common 
multiple of A is defined similarly. The greatest 
common divisor (G.C.D.) of A is a common 
divisor which is a multiple of any common 
divisor; the least common multiple (L.C.M.) of 
A is a common multiple m which is a factor of 
any common multiple. Thus, the G.C.D. and 
L.C.M. exist if R is a u.f.d. 
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I. Integral Dependence 


Let R be a subring of a ring R" sharing a unity 
element with R". An element oe R" is said to 
be integral (or integrally dependent) over R if 
there are a natural number n and elements c; 
of R such that a" c a" '+...+c,=0. If every 
element of a subset S of R" is integral over R, 
we say that S is integral over R. (When R has 
no unity element, a similar definition is given 
under an additional condition that ce R*. An 
important special case is where R is an ideal. 
See D. G. Northcott and D. Rees, Proc. Cam- 
bridge Philos. Soc., 50 (1954); M. Nagata, Mem. 
Coll. Sci. Univ. Kyóto, 30 (1956).) The set R of 
elements of R" which are integral over R is a 
ring and is called the integral closure of R in 
R". If R=R, then R is said to be integrally 
closed in R”. If R is integrally closed in its ring 
of total quotients, we say that R is integrally 
closed. An integrally closed integral domain is 
called a normal ring. (In some literature, an 
integrally closed ring is called a normal ring.) 
An element ac R” is called almost integral over 
R if there is an element b of R such that b is 
not a zero divisor and a"be R for every natural 
number n. If an element a of the ring of total 
quotients of R is integral over R, then a is 
almost integral over R. R is said to be com- 
pletely integrally closed if its ring of total quo- 
tients contains no elements which are almost 
integral over R except the elements of R itself. 


J. Group Theorem 


Let Q be the ring of total quotients of a ring R. 
An R-submodule a of Q is called a fractional 
ideal of R if there is a non-zero-divisor c of R 
such that cac R. The product of fractional 
ideals 1s defined similarly as in the case of 
products of ideals. The inverse o of the 
fractional ideal a is defined to be [x eQ xa 
c R}. If a contains an element which is not a 
zero divisor, then a! is also a fractional ideal. 
When R is completely integrally closed, we 
define fractional ideals a and b to be equiva- 
lent if a ! 2b !. This gives rise to an equiva- 
lence relation between fractional ideals. The 
set of equivalence classes of fractional ideals 
which contain non-zero-divisors forms a 
group. This result is called the group theorem. 
An integral domain R is called a Krull ring if 
(1) for every prime ideal p of height 1, the ring 
R, is a tdiscrete valuation ring; (ii) R is the 
intersection of all the valuation rings R, and 
(ili) every nonzero element a of R is contained 
in only a finite number of prime ideals of 
height 1. In a Krull ring R, for an arbitrary 
nonzero fractional ideal a, there is a uniquely 
determined product of powers of prime ideals 
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of height 1 which is equivalent to a in the sense 
stated above (— 439 Valuations). 


K. Dedekind Domains and Principal Ideal 
Domains 


A ring R is called a Dedekind domain if (i) R 

is a *Noetherian integral domain, (ii) R is a 
normal ring, and (iii) the Krull dimension of R 
is 1. For an integral domain R which is not a 
field, R is a Dedekind domain if and only if the 
set of all nonzero fractional ideals is a group, 
if and only if every nonzero ideal of R is ex- 
pressed as the product of a finite number of 
prime ideals and such.expression is unique up 
to the order of prime factors. An important 
example of a Dedekind domain is the ring of 
all ‘algebraic integers, i.e., the tprincipal order 
of an falgebraic number field of finite degree. 
In general, if R is a Dedekind domain with 
field of quotients K and L is a finite algebraic 
extension of K, then the integral closure of R 
in L and any ring R' such that R c R'£ K are 
Dedekind domains. 

An ideal generated by an element is called 
a principal ideal; a fractional ideal generated 
by an element is called a principal fractional 
ideal (or simply principal ideal). In a Dedekind 
domain, the set P of nonzero principal frac- 
tional ideals is a subgroup of the group J of 
nonzero fractional ideals; J/P is called the ideal 
class group of R, and a member of it is called 
an ideal class. The *order of I/P is called the 
class number of R (— 14 Algebraic Number 
Fields). There are many Dedekind domains 
whose class numbers are infinite. 

À ring R is called a principal ideal ring if 
every ideal is principal; furthermore, if R is an 
integral domain, then R is called a principal 
ideal domain. A principal ideal ring is the 
direct sum of a finite number of rings of which 
each direct summand is either a principal ideal 
domain or a flocal ring whose maximal ideal is 
a principal nilpotent ideal. A principal ideal 
domain which is not a field is a Dedekind 
domain and a u.f.d. We consider, for an arbi- 
trary natural number n and a principal ideal 
ring R, the set M (n, R) of all n x n matrices 
over R. Given an element A of M (n, R), there 
exist elements X, Y in M (n, R) such that (i) 

X LY ^! are in Min, R), and (ii) denoting by 
bj the (i, j)-entry of XAY, we have b,, R2 b,;R 
>...2b,,R and bj «0 if ij. The nonzero 
members of the set (b, 1, b22, ..., bang are called 
the elementary divisors of the matrix A. Apply- 
ing this to a finite module M over the principal 
ideal ring R, we see that M is the direct sum of 
m, R, ...,m,R (mee M) such that, with a;— 

{xe R|m;x 0), we have a, ca, c ...ca, (2 2 


Abelian Groups B; 269 Matrices E). 
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L. Euclid Rings 


À ring R is called a Euclid ring if there is a 
map ọ of R — {0} into a twell-ordered set W 
(the set of natural numbers is a special case) 
satisfying the condition that if a, be R, a#0, 
then there are r, qe R such that b=aq +r and 
either r=0 or g(r) « o(a). 

Every Euclid ring is a principal ideal ring. 
Besides the ring Z of rational integers, there 
are several familiar examples of Euclid rings, 
such as Z[/ —1], ZE/ —2], Z[o] (o? = 
1, 0 z 1), and the polynomial ring of one 
variable over a field. 
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A. General Remarks 


Let X be a finite-dimensional linear space. 
Then a linear operator in X ts surjective 

if and only if it ts injective. If X is infinite- 
dimensional, this is no longer the case in 
general. For an operator of the form ] + K 
with an integral operator K of continuous 
kernel, I. Fredholm [1] developed the the- 

ory of determinants and retrieved the above 
equivalence (— 217 Integral Equations). Later, 
F. Riesz [3] simplified the proof and showed 
that the equivalence holds if K is a compact 
operator in a Banach space. I. Ts. Gokhberg 
and M. G. Krein [4] reformulated the result as 





68 C 
Compact and Nuclear Operators 


the stability of indices of operators. The con- 
cept of compact operators itself was introduced 
by D. Hilbert (who used the terminology, “com- 
pletely continuous operators"). The Hilbert- 
Schmidt class is also due to him. These classes 
of operators are employed in the spectral 
theory of integral operators [2]. The trace 
class of operators in Hilbert spaces is defined 
as the class of operators for which the trace 
has meaning. A. Grothendieck [5] introduced 
nuclear operators, extending the definition of 
trace class operators to general Banach spaces. 
Nuclear operators and related classes of 
operators, such as integral operators and 
absolutely summing operators, play important 
roles in the theory of topological tensor prod- 
ucts [5], determinants [6], vector measures, 
and measures in linear spaces [7]. 


B. Compact Operators 


A ‘linear operator T from a tBanach space X 
to a Banach space Y is said to be compact (or 
completely continuous) if T maps any tbounded 
set of X to a trelatively compact set of Y. In 
other words, T is compact if, for any bounded 
sequence {x,} in X, the sequence {Tx,} in Y 
contains a fstrongly convergent subsequence. 
A compact operator is necessarily *bounded 
and hence continuous. A compact operator 
from X to Y maps any tweakly convergent 
sequence in X to a strongly convergent se- 
quence in Y. If X is treflexive, the converse is 
also true. 

In this article the set of all bounded (resp. 
compact) linear operators from X to Y are 
denoted by B(X, Y) (resp. B(X, Y)). 


C. Examples of Compact Operators 


(1) Degenerate operators. An operator 

Te B(X, Y) is said to be degenerate or of 
finite rank if the frange R(T) of T is finite- 
dimensional. A degenerate operator is neces- 
sarily compact. The tidentity operator in X is 
compact (Y= X) if and only if X is finite- 
dimensional. (2) *Integral operators with con- 
tinuous kernel. Let E and F be bounded closed 
regions in the Euclidean spaces R" and R", 
respectively, and let k=k(t,s), te F, se E, bea 
continuous function defined on F x E. Then 
the integral operator T with tkernel k, 


roo | k(t,s)x(s)ds, te F, (1) 


is a compact operator from the Banach space 
C(E) to C(F). (For the notation for various 
function spaces — 168 Function Spaces.) (3) 
Integral operators of Hilbert-Schmidt type. 

In example (2) let E and F be tLebesgue mea- 
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surable sets, and let ke L,(F x E). Then the 
integral operator T determines a compact 
operator from the "Hubert space L,(E) to the 
Hilbert space L,(F). (4) Let Q be a bounded 
open set in R”, s « t two real numbers, and 
H*(Q) the tSobolev space of order s (— 168 
Function Spaces). Then the natural injection 
from H'(Q) into H*(Q) is compact (F. Rellich’s 
lemma). 


D. Properties of Compact Operators 


Any linear combination of compact opera- 
tors is again compact. If a sequence {T,} of 
compact operators converges in the norm 
De, in the tuniform operator topology), then 
the limit T is compact. Thus B?(X, Y) is a 
closed subspace of the Banach space B(X, Y) 
with the *operator norm. Any product of 

a compact operator and a bounded opera- 
tor is compact. Namely, Ae B?(X, Y), Be 
B(Y, Z), and Ce B(Z, X) imply BAcB(X, Z) 
and ACeBO(Z, Y). In particular, B?^(X)— 
B(X, X) forms a closed ttwo-sided ideal 

of B(X) - B(X, X). An operator TeB(X, Y) 

is compact if and only if its *dual operator 
T'eB(Y', X^) is compact. The range of a com- 
pact operator is always tseparable. Let X and 
Y be Hilbert spaces. Then for any Te BO(X, Y) 
there exist forthonormal sets (4) in X and 
{Way in Y and a sequence {c,} of nonnegative 
numbers with lim c, —0 such that 


Tu-Y cuo),  ueX. 


Consequently, any compact operator between 
Hilbert spaces X, Y can be approximated by 
a sequence of degenerate operators in the 
operator norm. However, there are Banach 
spaces X and Y for which the statement is no 
longer true. In fact, a Banach space X (resp. 
the dual Y' of a Banach space Y) has the tap- 
proximation property (— 37 Banach Spaces L) 
if and only if every Te B“(Y, X) is the limit in 
norm of a sequence of degenerate operators 
for any Banach space Y (resp. X). 


E. The Riesz-Schauder Theorem 


Let Te B?(X) and consider a pair of linear 
equations 


u—Tu=f, fex, (2) 


oq—T'o-g, gek, (3) 


where T'eB(X^) is the dual operator of T in 
the tdual space X' of X. Put M — (ue X |u— 
Tu} and æ’ — (oe X'|o — Tei Then one 

and only one of the following two cases (i) and 
(ti) occurs. (i) Æ — (05, M’ = {0}; for any feX 
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equation (2) has a unique solution; and for any 
gc X' equation (3) has a unique solution. (ii) 
dim Æ —dim.ZJ'—m, 1 m« oo; (2) has a 
solution if and only if f is orthogonal to M’ 
(e, p(f)=0 for any oe); and (3) has a 
solution if and only if g is orthogonal to æ 
(i.e., g(u) 20 for any ue AJ), This is called the 
Riesz-Schauder theorem. In particular, when T 
is an integral operator in a suitable function 
space, this theorem is also called Fredholm's 
alternative theorem for integral equation (2). 


F. Fredholm Operators and Their Indices 


Let T be a ‘closed linear operator from a Ba- 
nach space X to a Banach space Y. T is called 
a Fredholm operator or an operator with index 
if both Ker T (2 null space N(T)) and Coker T 
(= Y/R(T)) are finite-dimensional. Then the 
integer ind T=dim Ker T— dim Coker T is 
called the index of T. If a closed linear opera- 
tor T has Coker T of finite dimension, then 
the range R(T) is closed. Moreover, if R(T) is 
closed and the domain D(T) is dense, then 
dim Coker T= dim Ker T", where T’ is the dual 
of T, and hence ind T— dim Ker T — dim Ker T". 
À linear operator K from X to Y is said to be 
T-compact if the domain D(K) contains D(T) 
and if K from D(T) with the *graph norm into 
Y is compact, i.e., for any bounded sequence 
{x,} in D(T) with sup || Tx, || < oo, the se- 
quence {Kx,} contains a strongly convergent 
subsequence. 

The following are basic properties of Fred- 
holm operators [4, 11]: (1) If T is Fredholm 
and K is T-compact, then T+ K is Fredholm 
and ind(T 4- K)— ind T. (2) If T is Fredholm 
and another linear operator S is sufficiently 
close to T in (graph) norm, then S is also Fred- 
holm and ind S 2 ind T. (3) If T is a Fredholm 
operator from X into Y and S is a Fredholm 
operator with dense domain in Y into Z, then 
ST is Fredholm and ind ST — ind S + ind T. If 
D(T) is dense, then D(ST) ts also dense. (4) 

Let T be a closed linear operator with dense 
domain; then T is Fredholm if and only if the 
dual T’ is Fredholm, and then ind T’ = —ind T. 

Fredholm operators and their indices were 
first studied by Russian mathematicians in 
connection with boundary value problems of 
differential equations and singular integral 
equations. M. F. Atiyah and I. M. Singer have 
proved that an felliptic linear differential 
operator P of order m on a compact differenti- 
able manifold M is a Fredholm operator from 
the Sobolev space H*(M) into Hr "(M) for 
any s and that its index is computed from the 
tsymbol of P and the fcharacteristic classes of 
M (— 237 K-Theory). Similarly, the indices 
of linear ordinary differential operators (and 
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more generally of maximally overdetermined 
systems of linear partial differential operators) 
are computed in various spaces of functions 
and generalized functions (H. Komatsu, B. 
Malgrange, J.-P. Ramis, M. Kashiwara). 


G. Spectra of Compact Operators 


The following structure of the *spectrum of 
compact operators Te B^(X) is derived from 
the Riesz-Schauder theorem. The spectrum 
o(T) of T consists of at most countably many 
points and has no accumulation points except 
possibly for 0. Any nonzero point of o(T) 

is an 'eigenvalue of T. When X is infinite- 
dimensional, 0 always belongs to c(T) but is 
not necessarily an eigenvalue of T. Each non- 
zero eigenvalue of T has finite (algebraic) 
*multiplicity, and hence the teigenspace .Z,(T) 
= {u| Tu= 2y}, AAO, is finite-dimensional. If 2 
is an eigenvalue of T, then 4 is an eigenvalue 
of the *adjoint operator T* of T with the same 
(either algebraic or geometric) multiplicity as 
that of A. 


H. Spectral Representations of Compact 
Normal Operators 


Let T be a compact tnormal operator in a 
Hilbert space H. Then we can find a tcomplete 
orthonormal set consisting solely of eigen- 
vectors of T. Namely, for each nonzero eigen- 
value 4; of T, take an orthonormal basis (qf?) 
of the eigenspace associated with 4;. Rearrange 
all the qf? into a sequence To, and add to it, 
if 0 is an eigenvalue, a complete orthonormal 
set of the eigenspace associated with 0. Then 
we obtain a desired complete orthonormal 

set of H. Let u, be the eigenvalue associated 
with o, Then the sequence Tu, is precisely 
an enumeration of nonzero eigenvalues of T 
with repetitions according to multiplicity. 

In terms of {¢g,} and {u„}, a tspectral repre- 
sentation of T is given as 


Tu= Y n(wo)e,  ueH. 
n-1 


The eigenvalue problem of a compact non- 
negative ‘self-adjoint operator T can be solved 
by means of the following Rayleigh prin- 
ciple. Consider Rayleigh's quotient R(x) = 
(Tx, x)/||x||*, x #0. The largest eigenvalue u, 
is obtained as ij, 2 max,, y R(x) and any vector 
x, Which attains this maximum is an eigen- 
vector. When the largest n — 1 eigenvalues 
Ui, Hn-1 and eigenvectors x,,..., X, ., are 
determined, the nth eigenvalue p, is the maxi- 
mum of R(x) on the subspace orthogonal to 
X4, ...,X,., and any vector which attains the 
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maximum is an eigenvector. À more direct 
characterization of y,, involving no previous 
eigenvectors, is given by 


foeeefaaceX | (ef 
Ti fa #0 \ Jj71,..un- 


Uz min ( max SEH 


This formula is referred to as the minimax 
principle. 


I. Classification of Compact Operators in 
Hilbert Spaces 


Let H and K be Hilbert spaces. Then a Te 
B(H, K) is compact if and only if (Te,, f,) 0 
for any orthonormal sequences e, in H and 

J, in K. J. von Neumann and R. Schatten 
(Ann. Math., 49 (1948)) classified operators 
Te B(H, K) in the following way. The opera- 
tor A —(T*T)'? is a compact nonnegative 
self-adjoint operator. Let a, za, z ... be the 
enumeration in decreasing order of the posi- 
tive eigenvalues of A, with each repeated ac- 
cording to its multiplicity. The «, — «,(T) are 
sometimes called the characteristic numbers of 
T. For any p>0 the set of all Te B'?(H) such 
that 


oo 1/p 
m,-($ ar) « Lo 
n=1 
is denoted by B,(H) (or simply B,). Among 
these classes, B, and B; are most important. 
B, is called the trace class and B, the Hilbert- 
Schmidt class. Correspondingly, | T'||, and 
I T ||, are called the trace norm and Hilbert- 
Schmidt norm of T, respectively. B, is also 
called the nuclear class and any operator 
T&€B, a ‘nuclear operator. The norm ||T ||, can 
also be defined more directly as follows. (1) If 1 
«p « oo, | T ||, —sup |(Te,, fj), ; (i) If0 px 2, 
IT |, inf (I Te, I)h; Gii) I2 p « oo, ITI, — 
sup ||( || Teil, where e, and f, range over 
the orthonormal sequences in H and K, and e, 
over the orthonormal bases in H. The class B, 
is a two-sided * ideal in the Banach algebra B. 
Moreover precisely, Te B, and ReB imply 
IRTI,&IRI ITI, ITRI, «IRI ITI, and 
| T*|, TH, When 1 € p € oo, the class B, 
becomes a Banach space with the norm ||7'||,. 
It is always a *quasi-Banach space, in which 
the set of all degenerate operators is dense. 
The norm ||T ||, (T#0)is a decreasing func- 
tion of p. Hence B, c B, if p<q. Also, Te B, 
and ScB, imply TS€B,, where l/r — 1/p 4- 1/q. 
Let T€B,, and let (jj be an enumeration of 
eigenvalues of T with repetitions according to 
(either geometric or algebraic) *multiplicit y. 
Then 


Y Iul«lTI. 
J 
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Hilbert-Schmidt class. For an arbitrary 
complete orthonormal set {u,}, we have 


ITI 7, || Tull’. 


Thus B, can be defined as the set of all Te B 
such that the sum on the right-hand side is 
finite for a certain complete orthonormal set 
{u,}. A linear operator T: L,(E)>L,(F) is of 
Hilbert-Schmidt class if and only if it is an 
integral operator of Hilbert-Schmidt type 
(example (3) in Section C). The space B, with 
the norm ||T ||, becomes a Hilbert space with 
the inner product defined by 


(T, S); = 3 (Tu Su,), 


where {u,} is as above. 
Trace class. For an operator T'€B,, the 
trace tr(T) of T is defined as 


tr(T) - Y (Tu, u,). 


Here the right-hand side converges absolutely 
and does not depend on the complete ortho- 
normal set {u,}. The trace is a bounded linear 
functional on the Banach space B,. The prod- 
uct of two operators of Hilbert-Schmidt type 
belongs to the trace class, and the converse is 
also true. If 1 <p, q « oo satisfy 1/p+1/q=1, 
then the inner product (T, S> of Te B,(H, K) 
and SeB,(K, H) is defined by <T, SX =tr(ST). 
Under this inner product the dual of the 
Banach space B,(H, K) is identified with 
B,(K, H). Similarly, the dual of B?(H, K) 

is isomorphic to B,(K, H), and the dual of 
B,(K, H) to B(H, K) (J. Dixmier, Schatten). 


J. Volterra Operators 


A compact operator T in a Hilbert space is 
said to be a Volterra operator if it is tquasi- 
nilpotent, i.e., its fspectral radius is 0. The 
integral operator 


b 


(Tx)(t)= | k(t,s)x(s)ds, xeL,(a,b), 


t 

appearing in the integral equation of Volterra 
type is a Volterra operator. Conversely, a 
Volterra operator satisfying a suitable con- 
dition is unitarily equivalent to such an inte- 
gral operator (M. S. Livshits, Gokhberg and 
Krein [13]). A Volterra operator admits an 
abstract triangular representation in a manner 
similar to Jordan's canonical form (— 390 
Spectral Analysis of Operators H). 


K. Nuclear Operators 
Extending the definition of trace class to 


operators in Banach spaces, A. Grothendieck 
[5] defined a nuclear operator (or Fredholm 
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operator according to [6]) from a Banach 
space X into a Banach space Y to be a linear 
operator T: X 2 Y that is represented as Tx = 
2,2; x, aj? b; with a sequence 4,20 in /,,a 
bounded sequence a; in X', and a bounded 
sequence b; in Y. In other words, a linear 
operator T: X >Y is nuclear if and only if it is 
decomposed as the product 


X 51,51, 5 Y, (4) 


where A and B are bounded linear operators 
and A is multiplication by 4; in /,. The in- 
fimum of || A|| LA || B|| is called the nuclear 
norm of T and is denoted by | T |,. When 
X and Y are Hilbert spaces, this coincides 
with the trace norm. The integral opera- 
tor T defined by (1) is nuclear with || T |], = 
fsup|K(t, s)|ds. The totality B,(X, Y) of nuclear 
operators T: X » Y forms a Banach space 
under the nuclear norm. If T is nuclear and 
A is bounded, then || TA||, € | T ||, || A|| and 
|AT ||, SIAI IT, If T is nuclear, then the 
dual T" is nuclear, and | T'|, € | T||,. Suppose 
that X and Y are Banach spaces satisfying one 
of the following conditions: (1) Y is the dual of 
a Banach space; (11) X' has the approximation 
property; (iii) Y" has the approximation prop- 
erty. Then conversely a Te B(X, Y) is nuclear 
if the dual T” is nuclear, and ||T||,=||T"||,. 
However, this is not necessarily the case in 
general (T. Figiel and W. B. Johnson). 
Replacing l, by /,, we obtain the definition 
of operators T: X ^ Y of summable pth power. 
Grothendieck [5] considered the case Oc px1 
and showed that if Tj, ..., T, are of summable 
Dis Path power, then the product T, ... T, is 
of summable rth power, where r is given by 


l/r - (X 1/p) — (n+ 1)/2. 


L. Traces and Determinants of Operators 


Let T be a linear operator of ‘finite rank in a 
linear space X. There are a finite number of 
elements de X’ and b;e X such that 


Tx= y (x, abi. (5) 
i=l 


Then tr(T) 2 <b; a;» and det(1 — T)= 
det(à; ; — (bj, aj?) are independent of the repre- 
sentation (5). Let X be a Banach space. If 

T € B, (X) is represented as Tx =È 4;(x, aj» bj, 
then it seems reasonable that the trace tr(T) 
be defined by X 4;(5;, aj», but the sum may 
depend on the representation. It is known that 
a Banach space X has the approximation 
property if and only if the sum does not de- 
pend on the representation for any nuclear 
operator T. However, if T is of summable 
(2/3)rd power, the trace tr(T) is always de- 
fined uniquely. If X or T satisfies the afore- 
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mentioned condition, then the determinant 
det(1 — T) is also defined uniquely as the limit 
of det(1 — 7,), where T, is the nth partial sum. 
det! —zT) is an entire function of z and its 
zeros are exactly the reciprocals of the nonzero 
eigenvalues of T. If det(1 — z T) Z0, then the 
resolvent (1 —z T) ! is given by Fredholm's 
formula extending tCramer’s rule. Eigen- 
vectors are also computed explicitly [6]. 

Let H be a Hilbert space, and let T € B,(H). 
Then det(1—z T) — IL I(1 —z4j), where 4; are the 
nonzero eigenvalues of T, and is an entire 
function of genus 0. Moreover, let pz: 2 be an 
integer. Then the modified determinant 


det, (1 —zT)- |] fa SE ko} ei) | 
k-1 


J 


is defined for Te B,(H) and is an entire func- 
tion of genus p — 1. This type of determinant 
was introduced by T. Carleman (Math. Z., 9 
(1921)) for the Hilbert-Schmidt class of oper- 
ators (— also Hilbert [2]). It is utilized to 
prove the completeness of the troot vectors of 
T and many other facts [9, 12, 13]. 


M. Weakly Compact Operators 


A linear operator from a Banach space X to 
a Banach space Y is said to be weakly com- 
pact if T maps any bounded set of X to a rela- 
tively weakly compact set. Weakly compact 
operators are bounded. They have proper- 
ties similar to those of compact operators. 
The following are equivalent conditions for a 
bounded linear operator T: X Y. (1) T is 
weakly compact; (2) T’: Y' — X' is weakly 
compact; (3) T": X" 2 Y" maps X" into Y; 
(4) There is a reflexive Banach space Z and 
bounded linear operators $: X >Z and R: 
Z— Y such that T= RS. The last characteri- 
zation is due to W. J. Davis, Figiel, Johnson, 
and A. Pełczyński. Any linear combination of 
weakly compact operators is weakly compact. 
The uniform limit of a sequence of weakly 
compact operators is weakly compact. The 
product of a weakly compact operator and a 
bounded operator is weakly compact. 
Dunford-Pettis theorem ([9]; R. S. Phillips, 
Trans. Amer. Math. Soc., 48 (1940)). Let (Q, ul 
be a o-finite measure space and X a Banach 
space. If T: L(Q)- X is a weakly compact 
linear operator, then there exists a bounded 
strongly measurable function g on Q with 
values in X such that 


7/- Juan 


Then T maps weakly convergent sequences in 


L,(Q) into strongly convergent sequences in X. 


In particular, the product of two weakly com- 
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pact linear operators in L,(Q) is compact. 
Similarly, Grothendieck and R. G. Bartle, N. 
Dunford, and J. Schwartz (— [9]) have proved 
the following: Let Q be a compact Hausdorff 
space and X a Banach space. If T: C(Q) X is 
a weakly compact linear mapping, then there 
is a fvector measure y defined on the *Borel 
sets in Q with values in X such that <u, x» isa 
"Radon measure for any x'e X’ and 


Tf= 1 (t)du(t). 


T maps weakly convergent sequences in C(Q) 
into strongly convergent sequences in X. The 
product of two weakly compact linear opera- 
tors in C(Q) is also compact. 


N. Absolutely Summing Operators 


A linear operator T from a Banach space X to 
a Banach space Y 1s called an integral operator 
if there is a positive "Radon measure u on the 
product A x B of the unit ball A in X' and 

the unit ball B in Y" both equipped with the 
weak* topology such that 


(Tx, y) = | CD W'S") du(x', y^), 


or equivalently if T: X > Y" is decomposed 

as (4) with /,, and /, replaced by L,,(M) and 
L,(M) for a suitable compact space M with a 
Radon measure. Nuclear operators are clearly 
integral. Integral operators are weakly com- 
pact, but they are not necessarily compact or 
nuclear. However, the product of an integral 
operator and a weakly compact operator is 
nuclear, and hence every integral operator in 
a reflexive Banach space is nuclear. 

If a compact (resp. weakly compact) opera- 
tor T from a Banach space X to a Banach 
space Y maps a closed linear subspace X, of 
X into a closed linear subspace Y, of Y, then 
the restriction T,:X,— Y, and the induced 
operator T! : X/X, — Y/Y, are also compact 
(resp. weakly compact). The corresponding 
result does not hold for nuclear operators and 
integral operators. The following conditions 
are equivalent for a bounded linear operator 
T:X— Y: (i) There is a Banach space Y > Y 
such that T: X > Y is integral; (ii) there is a 
positive Radon measure u on the unit ball 
A of X' such that 


| Tx | cl x'>| dulx’); 


(iii) there is a constant C such that 


ȘI Txil €Csup(Y. ls, xL Ix l1) 


holds for any finite set {x,,...,x, in X; (iv) if 
X x; is an tunconditionally convergent series 
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in X, then È |[Tx;|| converges. Grothendieck 
called such an operator a right semi-integral in 
view of property (i), and A. Pietsch called it 
absolutely summing in view of (iv). Similarly, 
a bounded linear operator T: X — Y is called a 
left semi-integral if there is a Banach space X 
such that X = X/X, for a closed subspace X,, 
and the product T: X 5 Y is integral. The 
product of two right (resp. left) semi-integral 
operators is nuclear. A bounded linear opera- 
tor in a Hilbert space is right (or left) semi- 
integral if and only if it is of Hilbert-Schmidt 
class. A bounded linear operator T from a 
Banach space X into a Banach space Y is 
integral if and only if the dual T' is integral. 
It is a right (resp. left) semi-integral if and only 
if T’ is a left (resp. right) semi-integral. 

For other related classes of operators — 
Grothendieck's paper in Boletin Soc. Mat. Sáo 
Paulo, 8 (1953). 
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A. Compact Groups 


A ‘topological group G is called a compact 
group if the underlying topological space of G 
is a *compact Hausdorff space. The *torus 
group T"— R"/Z" (n— 1,2, ...) (commutative 
group), the forthogonal group O(n), the tuni- 
tary group U(n), the tsymplectic group Sp(n), 
and the additive group Z, of tp-adic integers 
are compact groups (— 60 Classical Groups; 
for other compact Lie groups — 249 Lie 
Groups, 248 Lie Algebras). Let C(G) be the 
‘linear space formed by all the complex-valued 
continuous functions f, g, h, ... defined on a 
compact group G; C(G) is a *Banach space 
with the norm 


iI fll =sup | f(x)]. 
xeG 

Since a compact group G is "locally com- 
pact, there exists a right-invariant *Haar mea- 
sure on G. Because of the compactness of G, 
the total measure of G is finite, and the mea- 
sure is also left-invariant. By the condition 
that the total measure is 1, such a measure is 
uniquely determined. The integral of f in C(G) 
relative to this measure is called the mean 
value of f. Since for f, ge C(G), f(xy Toi y) is 
continuous in two variables x, y, the tconvo- 
lution f x g(x) f f(xy Tat y)dy also belongs 
to C(G). C(G) constitutes a ring under the 
multiplication defined by the convolution. 
This ring can be considered as an extension of 
the notion of a *group ring for finite groups; it 
is called the group ring of the compact group 
G. The function x f(x !) will be denoted by 
f *, and the inner product in the "function 
space L(G) will be written as ( f, g). 





B. Representations of Compact Groups 


Let G(E) be the group of units of tbounded 
linear operators on a Banach space E, and 
suppose that we have a homomorphism U ofa 
topological group G into G(E). The homomor- 
phism U is called a strongly (weakly) contin- 
uous representation on E of G if, for any ac E, 
the map x U(x)a of G into E is continuous 
with respect to the strong (tweak) topology on 
E. When E is a *Hilbert space, a strongly con- 
tinuous representation U such that every 
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U (a) is a unitary operator is called a *uni- 

tary representation. Let U be any strongly 
continuous representation of a compact 

group G on a Hilbert space E; for a, b in E, let 
«a, b» be the mean value of the tinner product 
(U (x)a, U (x)b) on E. Then U is a unitary repre- 
sentation on the Hilbert space E with the new 
inner product <a, bY. 

A representation U of G on a Banach space 
E 1s said to be irreducible if E contains no 
closed subspace other than {0} and E, which is 
invariant under every U (x)(xe G). If a weakly 
continuous representation of a compact group 
on a Banach space E ts irreducible, then E is 
finite-dimensional. Moreover, any unitary 
representation U of a compact group on a 
Hilbert space E can be decomposed into a 
discrete *direct sum of irreducible representa- 
tions. Namely, there exists a family | E,j,., of 
irreducible (hence finite-dimensional) invariant 
subspaces E, of E which are orthogonal to 
each other such that E = X. 4 E,. In particular, 
any continuous representation of a compact 
group on a finite-dimensional space is tcom- 
pletely reducible. In L,(G) with respect to the 
*Haar measure on a *locally compact group G, 
the representation U defined by (U(x) f)( y) « 
fx (x, ye G, f e L,(G)) is a unitary repre- 
sentation of G. This representation U is called 
the (right) regular representation of G. Decom- 
position of the regular representation of a 
compact group G into irreducible represen- 
tations is given by the Peter-Weyl theory, 
described later. 

In the rest of this article, the representations 
under consideration will be (continuous) repre- 
sentations by matrices of finite degree. Let 
D, (x) 2 (dij (x)) and D,(x) — (df? (x)) be irredu- 
cible unitary representations which are not 
mutually tequivalent. Then from *Schur’s 
lemma follow the orthogonality Pom 
(dP, di?) 20 and (,/n, di, due dP) = Sim Sin 
(where n, is the degree of the EE 
DL From each ‘class D, of irreducible repre- 
sentations of G, choose a unitary representa- 
tive D,(x) = (di;(x)), and let n, denote its de- 
gree. Then from the orthogonality relations it 
follows that the n, di, (x) form an tortho- 
normal system of L,(G). 

Let he C(G) and consider the map H: f 
hx f of C(G) to C(G). Then H is a compact 
operator in C(G). Since C(G) is contained in 
L(G), we can define the inner product in C(G). 
By (hx f,g) =(f,h* x g), h=h* implies (Hf, o 
=( f, Hg); that is, H is a *Hermitian operator. 
For a given f in C(G), there exists h(=h*) in 
C(G) such that h x f is uniformly arbitrarily 
near f. From the theory of compact Hermitian 
operators, Hf can be uniformly approximated 
by linear combinations of the *eigenfunctions 
of H. Since the teigenspace of H is finite- 
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dimensional and invariant under U (a), the 
eigenfunctions of H are linear combinations of 
a finite number of n, di; (x). Hence any func- 
tion f in C(G) can be uniformly approximated 
by linear combinations of a finite number of 
n, dix). This fact, like the similar result in 
tFourier series, is called the approximation 
theorem. From this, it follows that the or- 
thogonal system (In; dto) } is tcomplete; i.e., 
if an element of C(G) is orthogonal to each 
element in this system, then it is 0. 

Since C(G) is dense in L(G), (n, dz(x)] is 
a ‘complete orthonormal system of the Hil- 
bert space L(G). Hence for any f € L5(G) its 

"Fourier series" $, Xi; ct / n, d; (x) (where cj; 
-(f n, d;)) converges to f in the mean ob. 
order 2 (1.e., with respect to the tmetric of 
L,(G)). In particular, if G is a compact "Lie 
group and f is sufficiently many times dif- 
ferentiable, then this series converges uni- 
formly to f. 

The space V7 of dimension n, spanned by 
the elements d7(x) (1 <j<n,) in the ith row of 
the matrix D,(x) is invariant under the right 
regular representation U. The representation 
on V? given by U can only be D,. Then the 


fact that (n, d?(x)} is an orthonormal system 
of L(G) means that the regular representation 
U of a compact group G is decomposable into 
a discrete direct sum of finite-dimensional 
irreducible representations. Each irreducible 
representation D, is contained in U with multi- 
plicity equal to its degree n,. 

If a function (x) in C(G) satisfies o( y ! xy) 
= (x) for any x, y, then it is called a class 
function. The set K(G) of all the continuous 
class functions coincides with the tcenter of the 
group ring C(G). The tcharacter of an irredu- 
cible representation of G is a class function, 
and the set (y, (x); of all characters plays 
the same role as the orthogonal system 
(n, del) in C(G). Namely, (y,(x)) is a 
complete orthonormal system in (the tcom- 
pletion of) K(G), and any class function can be 
uniformly approximated by linear combi- 
nations of a finite number of these characters. 

The preceding paragraphs give a brief de- 
scription of the Peter-Weyl theory. If G is the 
1-dimensional torus group T! = R/Z, namely, 
the compact group of real numbers mod 1, 
then this is actually the theory of Fourier 
series concerning periodic functions on the 
line. (For concrete irreducible representations 
of O(n), U (n), Sp(n), and formulas for charac- 
ters — 60 Classical Groups. For represen- 
tations of compact Lie groups — 249 Lie 
Groups, 248 Lie Algebras.) The theory of 
compact groups was completed by F. Peter 
and H. Weyl (Math. Ann., 97 (1927)), and J. 
von Neumann's theory concerning almost 
periodic functions in a group (1934) united the 
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theory of compact groups with H. Bohr's 
theory of almost periodic functions (— 18 
Almost Periodic Functions). 


C. Structure of Compact Groups 


Let a be an element of a compact group G 
different from e. Since the underlying space of 
a topological group is a tcompletely regular 
space, there exists a function in C(G) such that 
f (a) * f (e). Hence there exists a representation 
D(x) of G such that D(a) is not equal to the 
unit matrix. This means that any compact 
group G can be expressed as a tprojective limit 
group of compact Lie groups. Beginning with 
this fact, von Neumann (1933) showed that a 
"locally Euclidean compact group is a Lie 
group (— 423 Topological Groups N). 


D. Set of Representations 


The set G' = {D} of representations of G by 
matrices admits the following operations: (i) 
D, & D, (tensor product representation); (ii) 
E 0 ) e 
D OD. (direct sum representa- 
0 D, 
tion); (iii) P^! DP (equivalent representation); 
and (iv) D (complex conjugate representation). 
Let M be a subset of G' such that De M when- 
ever De M and the irreducible components 
of D, & D, are in M whenever D. D, e M. 
Then M is called a module of representations 
of G. There is a one-to-one correspondence 
between closed normal subgroups H of G and 
modules M formed by all the representations 
of G/H. 

A representation of G' is a correspondence 
which assigns to each D a matrix A(D) of the 
same degree as that of D and preserves the 
operations of G’: A(D, & D;) 2 A(D,) & A(D,), 
A(D, ® D;) - A(D,) € A(D;), A(P"! DP)- 

P^! A(D)P, and A(D) = A(D). Let G” be the 

set of all the representations of G'. Define 

the product of 4,, A,eEG” by A, A,(D)= 
A,(D)A,(D) and a topology on G” by the 
tweak topology of the functions A(D) of D. 
Namely, a typical neighborhood of A, is of the 
form U(Ao; D, D; £)= {A| || A(D;)— Ao(D)) || 
<e,i=1,...,s}. G” is a topological group 
under this multiplication and topology. Then 
the Tannaka duality theorem states that G” = 
G holds (T. Tannaka, Tóhoku Math. J., 45 
(1939)). Let R(G) be the 'algebra over the 
complex number field C formed by the set of 
all the linear combinations of a finite number 
of df,(x), and let Aut R(G) be the automor- 
phism group of this algebra R(G). Let G* 

be the set of all the elements o in Aut R(G) 
which commute with every left translation 
L(x) ((L(x)f)(y) = f(xy)) and which satisfy 
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c(f )=o(f). Then G* is a topological group 
with respect to the weak topology, and the 
Tannaka duality theorem implies that the 
correspondence which assigns to each xeG 
the right translation R(x) (the restriction of 
U (x) to R(G)) is an isomorphism of G onto 
G* as topological groups. For the case where 
G is a compact Lie group, C. Chevalley re- 
stated the Tannaka duality theorem as one 
giving a relation between compact Lie groups 
and complex algebraic groups (— 249 Lie 
Groups U). 
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A. General Remarks 


The notion of complexes was introduced by H. 
Poincaré to study the topology of tmanifolds 
by combinatorial methods (— 65 Combina- 
torial Manifolds). Various kinds of complexes 
have been introduced in the course of the 
development of topology. These will be dealt 
with individually in the sections that follow. 


B. Euclidean Simplicial Complexes 


Let R^ be the N-dimensional tEuclidean 
space, and let a, 41, ..., a, be points of R” 
with coordinates a; — (af), a!”’, ..., a) for i— 0, 
1, ...,m. For real numbers An, 4,, ..., Ams we 
denote by Zoo - 41a, +... + Amam the point 
(Zioa, Lio Ajal”, ..., LiLo 4af) of RP. 

A set of n+1 points do, a,,...,a, of RY is 
said to be independent or in general position if 


vectors Ajay, dà, ..., dgd, are linearly inde- 
pendent. For given independent points do, 
4,, ..., Oe let |aga, ... a,| denote the subset of 
R“ given by 


[aoa] ... Gelz (4949 - 44,0, +... 
+A, Ay|AgtA,+...+4,=1,4; 20}. 


laga, ...a,| is called an n-simplex with vertices 
do, 44, ..., d. Simplexes are denoted by A, A’, 
6, t, etc. For example, a 0-simplex Joel is a 
point dy, a 1-simplex |aga,| is a segment aga;, 
and a 2-simplex |aga,a;| is a triangle Aaga,a;. 
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We attach to Aoao 4- 4,8, +... + 4,8, the co- 
ordinates (Ao, 4,, ..., n) called the barycentric 
coordinates. If loug: ... a,| is an n-simplex, 
every subset {a; > d; , ..., Ai} Of (a9,0,, ..., An} 
is independent. A q-simplex |a;,a;, ...a; | is 
called a q-face of laga, ...a,|. 

A set 8 of simplexes in a Euclidean space R“ 
is called a Euclidean (simplicial) complex in R“ 
if & satisfies the following three conditions: (i) 
Every face of a simplex belonging to & is also 
an element of &. (ii) The intersection of two 
simplexes belonging to & is either empty or a 
face of each of them. (itt) Each point of a sim- 
plex belonging to & has a neighborhood in Ri 
that intersects only a finite number of sim- 
plexes belonging to v A Euclidean complex is 
also called a geometric complex (or rectilinear 
complex). Each 0-simplex in & is called a ver- 
tex in &. We define the dimension of & to be 
n if & contains an n-simplex but no (n+ 1)- 
simplex, and oo if R contains n-simplexes for 
all nz: 0. 

By a subcomplex of a Euclidean complex & 
we mean a Euclidean complex that is a subset 
of &. For a Euclidean complex &, its r-section 
(or r-skeleton) is defined to be the subcomplex 
of & consisting of all n-simplexes (n <r) in H. 

If & is a Euclidean complex in RP, we de- 
note by |K] the set of points in RN belonging to 
simplexes in K. This set || is called the Eucli- 
dean polyhedron of &. 

By subdivision &' of a Euclidean complex & 
we mean a Euclidean complex such that |8'] 
=|] and each simplex in &' is contained in a 
simplex in &. 

Specifically, we can construct a subdivision 
MI of & utilizing tbarycenters of simplexes in 8; 
namely, we let &’ be the set of all r-simplexes 
whose vertices consist of barycenters of the 
series A, C A, c... c A, of simplexes in &. Then 
& is a subdivision of 8, called the barycentric 
subdivision of and denoted by Sd &. 

Given a Euclidean complex & and a subset 
A of |K], we define the star of A in & to be the 
subcomplex of & that consists of simplexes {A} 
and their faces such that AN A-G), Further- 
more, we define the open star of A in & as 
the union of topen simplexes (the interiors of 
simplexes) of & whose closures intersect 4. We 
denote by Stg(A) the star of A in & and by 
O,(A) the open star of A in &; then O4(A) is an 
open set whose closure is |Stg(A)|. 

The notion of Euclidean simplicial com- 
plexes can be generalized to that of Euclidean 
cell complexes; this is done by replacing the 
term simplex by tconvex cell in the definition of 
Euclidean simplicial complex. For a Euclidean 
cell complex K, the notions of vertex, dimen- 
sion, subcomplex, r-section, and subdivision 
are defined similarly as in the case of Eucli- 
dean simplicial complexes. 
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C. Simplicial Complexes 


Given a Euclidean simplicial complex K, let K 
denote the set of all the vertices in &, and let 
4 denote the set consisting of those subsets 
(to; Dh of K for which there exist simplexes 
A in K such that (ve, ..., v, coincide with the 
set of vertices of A. Then we have (1) if se X 
and s>s’, s' # @, then s'e X; (2) every set con- 
sisting of a single element in K is in X, and the 
empty set is not in 2. 

In general, if a pair (K, 2) of a set K and a 
set X consisting of finite subsets of K satisfy (1) 
and (2), then the pair (or the set K) is called an 
abstract simplicial complex (or simply sim- 
plicial complex). If K is a simplicial complex, 
each element of the set K is called a vertex in 
K, and each set of Z is called a simplex in K. A 
simplex consisting of n+ 1 vertices is called an 
n-simplex. We say that a simplicial complex K 
is finite if it consists of a finite number of ver- 
tices; it is locally finite if every vertex of K 
belongs to only finitely many simplexes in K. 
We define similarly countable simplicial com- 
plexes and locally countable simplicial com- 
plexes. The dimension and r-section of a sim- 
plicial complex are defined as in the case of 
Euclidean complexes. By a subcomplex of a 
simplicial complex K we mean a simplicial 
complex K, such that each simplex of K, is a 
simplex in K. 

Let K and L be simplicial complexes. A 
mapping o: KL is called a simplicial map- 
ping (simplicial map) of K to L if the following 
condition is satisfied: If vg, v,, ..., v, are ver- 
tices of a simplex of K, then @(v9), (v), ..., 
Q(v,) are vertices of some simplex of L. Two 
simplicial complexes K and L are said to be 
isomorphic if there exist simplicial mappings 
o KOL, y: LK such that y oo and gow 
are the identity mappings. 

Given a simplicial complex K, let |K| denote 
the set of all functions x from the set of ver- 
tices of K to the closed interval J =[0, 1] satis- 
fying the following conditions: (i) The set 
{ve K | x(v) 40} is a simplex of K. (ii) E ex x(v) 
= |. The value x(v) is called the barycentric 
coordinate of the point x e|K| with respect to 
the vertex v. Each vertex v of K is identified 
with the point of |K| whose barycentric co- 
ordinate with respect to the vertex v is 1 and is 
called a vertex in |K|. For a simplex s= (vo, v,, 
Dal in K, we define |s| - (x e|K|| x(v) 2 0 
(v¢s)}, which is called a simplex in |K |. We 
call (xe|s|| x(v;) » 0 (i—0, 1,...,n)} an open 
simplex of | K| or the interior of |s|. We remark 
here that for an arbitrary simplex sin K a 
point xe|s| can be written in the form x= 
Xe, Ell: v. We can define a metric d on |K] 
by d(x, y) 2 (Ze x(v) — y(v))?)"”. However, |K | 
is usually supplied with a tstronger topology 
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defined as follows: (1) Each simplex |s| in |K | 
has the topology given by the metric d. (2) A 
subset U of |K| is open if and only if UN|s| 

is an open subset of |s| for each simplex s of 
K. Henceforward, by the topology of |K| we 
mean the topology just defined unless other- 
wise stated. The set |K| with such a topology 
is called the polyhedron of K. The topology 
of |K| coincides with the above metric topol- 
ogy if and only if K is locally finite. If sim- 
plicial complexes K and L are isomorphic, 
then |K| and |L| are homeomorphic. If K is 
the simplicial complex defined by a Euclidean 
simplicial complex K, then | Kj and |&| are 
homeomorphic. When K is finite, there exists 
an Euclidean simplicial complex & whose sim- 
plicial complex is isomorphic to K. Accord- 
ingly |K| is homeomorphic to the Euclidean 
polyhedron |& |. 

If K and L are simplicial complexes, a map- 
ping f:|K|—>|L] satisfying the following con- 
dition is said to be linear: If s= (v, v,,...,0,} 
is a simplex in K and x 2 Agvg t+... + A,v, (Ag + 
e A,2 1, 4; 20), then f(vo), ..., f(v,) belong 
to a simplex in L and f(x) — Ao f(vo) +... + 
As f (v,). The linear mapping determined by 
a simplicial mapping o: KL is denoted by 
e|: | K||L| and is also called a simplicial 
mapping and denoted by the same letter o. 
Let K and K' be simplicial complexes. If 
there exists a linear mapping /:|K'|2|K |, 
which is a homeomorphism, then we identify 
|K| and |K’| by / and call K’ a subdivision of 
K. The barycentric subdivision Sd K of a sim- 
plicial complex K 1s defined as in the case of 
Euclidean complexes. We also have notions of 
star St,(A) and open star O,(A) for a simpli- 
cial complex K and a subset A of |K|. If K 
and L are simplicial complexes, a mapping 
f :|K| L| is called a piecewise linear map- 
ping if there exist subdivisions K' and L' of 
K and L, respectively, such that f:|K’|>|L’| is 
linear. 

Given an open fcovering M={M,} ex of a 
topological space X, the index set K becomes 
a simplicial complex if we consider each finite 
nonempty subset s of K such that (pes M, # 
(Z (empty) to be a simplex. The resulting sim- 
plicial complex K is called the nerve of the 
open covering M. Furthermore, if Pt={M,} <x 
and 9t IN. Let, are open coverings of a set X, 
N is a trefinement of the covering W, and L is 
the nerve of N, then a simplicial mapping o: 
LK is defined by sending each vertex w in 
L to a vertex v in K such that N c M,. 

Given two disjoint simplicial complexes K 
and L, a simplicial complex K * L, called the 
join of K and L, is defined by the following: (1) 
the vertices of K » L are the vertices of K and 
the vertices of L. (2) A nonempty subset of 
vertices is a simplex of K « L if and only if its 
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subsets in K and L are empty or simplexes 
there. In particular, the join of a simplicial 
complex K and a single point is called the cone 
of K. 

A simplicial complex K is said to be ordered 
if a tpartial ordering is given in the set of ver- 
tices in K such that the set of vertices of each 
simplex is ftotally ordered. Given ordered 
simplicial complexes K and L, an ordered 
simplicial complex K x L, called the Cartesian 
product of K and L, is defined by the following: 
(1) The vertices in K x L are pairs (v, w), where 
v and w are any vertices in K and L, respec- 
tively. (2) A set of vertices (v9, wo), ..., (Un, Wp) 
such that pose... «v, and wo € ... Sw, is a 
simplex in K x Lif (vo, ..., v,) and (Wo, ..., Wp) 
are simplexes in K and L, respectively; all 
simplexes in K x L are obtained in this man- 
ner. (3) (vı, w,) € (v2, w2) if and only if v, € v; 
and w, xw,. 

Assume that either K or L is locally finite or 
that both K and L are locally countable. Let 
X and Y be topological spaces and 7 be the 
closed interval [0, 1]. We define an equivalence 
realtion ~ in the topological space X U(X x Y 
x DU Y by x~(x, y,0) and y ^ (x, y, 1), where 
ek, yc Y. The quotient space of XU(X x Y 
x I)U Y by this relation is called the join of X 
and Y and is denoted by X « Y. Then the poly- 
hedron |K x L| is homeomorphic to the prod- 
uct space |K| x |L| of the topological spaces 
|K| and |L]. And the polyhedron |K « L| is 
homeomorphic to |K| x |L]. 

By a triangulation T of a topological space 
X we mean a pair (K, t) consisting of a sim- 
plicial complex K and a homeomorphism 
t:| K|2 X. A triangulation is also called a 
simplicial decomposition. If T —(K, t) is a trian- 
gulation of X, the various concepts defined for 
K can be transferred to X by means of the 
mapping t. For example, by a simplex of the 
triangulation T we mean the image of a sim- 
plex of | K| under t. We say that a triangula- 
tion T —(K, t) is finite if K is a finite simplicial 
complex. If T=(K, t) is a triangulation and 
(K', I) is a subdivision of K, then T’=(K’,tol) 
is called a subdivision of T. If T, =(K,,t,), T, 
=(K,,t,) are triangulations of topological 
spaces X,, X,, respectively, a mapping f: X, 

— X, is called a simplicial mapping relative to 
T, and T, if t;! of ot, :|K,|9|K;| is a sim- 
plicial mapping. The following two problems 
on triangulations are famous: (1) Under what 
topological conditions is it possible for a given 
topological space to be supplied with a trian- 
gulation? (2) Given two triangulations T}, T; 
of a space X, are there subdivisions T; = 
(Kiti) T; - (K5,t5) of T, and T5, respec- 
tively, such that K; and K; are isomorphic? 
Concerning the second problem, the conjec- 
ture asserting the existence of subdivisions T, 
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and 7; as above is known as the fundamental 
conjecture (Hauptvermutung) in topology. 
Every 3-dimensional manifold is triangulable, 
and any two of its triangulations admit sub- 
divisions satisfying the condition in (2) (E. E. 
Moise [7]). 

In 1961, J. Milnor showed that the funda- 
mental conjecture is not true for polyhedra 
[8]. The triangulation problem and the funda- 
mental conjecture for topological manifolds 
were negatively solved by R. Kirby and L. 
Siebenmann [9] (— 65 Combinatorial Mani- 
folds). Any ‘differentiable manifold is triangu- 
lable, and the fundamental conjecture holds 
for its *C’-triangulations (rz 1) [6] (— 114 
Differential Topology). 

Let T;, T; be triangulations of topological 
spaces X,, X,, respectively, and let f: X, X, 
be a continuous mapping. Then a simplicial 
mapping o: X, X, relative to T, and T, is 
called a simplicial approximation to f if, for 
each xe X, the image (x) lies on the simplex 
of T, whose interior contains f(x). The follow- 
ing existence theorem is called the simplicial 
approximation theorem: For every continuous 
mapping f: X,  X,, there exist a subdivision 
T; of T, and a simplicial mapping o: X, X; 
relative to T; and T, that is a simplicial ap- 
proximation of f. If the triangulation T, is 
finite, then for a sufficiently large n we can 
choose Sd" T, as the T; above (where Sd? T — 
T and Sd" T 2 Sd(Sd" ! T) (nz 1)). If o: X, 
X, is a simplicial approximation to a continu- 
ous mapping f, then f and q are thomotopic. 


D. Cell Complexes 


Let V" be the tunit n-disk, S"! be the tunit 
(n — 1)-sphere, and X be a Hausdorff space. For 
a subset e of X,let e be the closure of e in X, 
and let é—e — e. A subset e of the space X is 
called an n-cell, or open n-cell in X if there is a 
relative homeomorphism q:(V", S" 1)—(e, ê), 
ie. a continuous mapping o: V"—e such that 
o(s" !)c é and q:V"—S" 1 2 —é is a homeo- 
morphism. For example, S" — (pj (pe S") is an 
n-cell. A set fe; | 4€ A] of cells in the Haus- 
dorff space X is called a cellular decomposition 
of X if the following three conditions are satis- 
fied: (i) e; Ne, is empty if 4# p; (ii) X 2| Jien ez; 
(11) If the dimension of e, is n+ 1, then é; c 
X", where X" is the union of all the cells e, 
(u€ A) whose dimensions are not greater than 
n. For example, the n-sphere S" has a cellular 
decomposition consisting of a single O-cell 
and a single n-cell. 

A Hausdorff space X together with its cellu- 
lar decomposition {e,} is called a cell com- 
plex, and each e, is called a cell in the cell 
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complex X. For a cell complex, the notions of 
vertex, n-section, and dimension are defined in 
the same way as the corresponding notions in 
Euclidean complexes. Let X be a cell complex 
and A a topological subspace of X such that 
the closure of each cell of X intersecting A is 
contained in A. Then the set of cells e such 
that eN A z Ø forms a cellular decomposition 
of A. The set A together with this cellular 
decomposition is called a subcomplex of the 
cell complex X. A cell complex X with its cells 
{e,} is said to be finite if the number of e; is 
finite. If each point in a cell complex X is an 
interior point of some finite subcomplex of X, 
then X is said to be locally finite. We define 
similarly a countable cell complex and a locally 
countable cell complex. If the closure of each n- 
cell of a cell complex X is homeomorphic to 
V" (nz 0,1, ...), X is said to be regular. If X 
and Y are cell complexes, a continuous map- 
ping f: X 2 Y such that f(X")c Y" (n20, 1, ...) 
is called a cellular mapping. If X and Y are cell 
complexes, the set of ttopological products 

€, Xe, Where e,, e; run over all cells of X, Y, 
respectively, is a cellular decomposition of the 
product space X x Y. The resulting cell com- 
plex X x Y is called the product complex of the 
cell complexes X and Y. 

A cell complex X is said to be closure finite 
if each cell in X is contained in a finite sub- 
complex of X; and X is said to have the weak 
topology if a subset U c X is open if and only if 
U e is relatively open in e for each cell e of 
X. We call a cell complex a CW complex if it is 
closure finite and has the weak topology. The 
cellular decomposition of a CW complex is 
called a CW decomposition. A locally finite cell 
complex is a CW complex. 

Fundamental properties of CW complexes 
are as follows: (i) A CW complex is a tpara- 
compact (hence *normal) space and is tlocally 
contractible. (ii) A subcomplex A of a CW 
complex X is a closed subspace of X, and A 
itself is a CW complex. (iii) A mapping f: X 
Y of a CW complex X to a topological space 
Y is continuous if and only if the restriction 
f\e is continuous for each cell e of X. (iv) If 
X and Y are CW complexes and f: X — Y is 
any continuous mapping then there exists a 
cellular mapping of X to Y that is homotopic 
to f (cellular approximation theorem). (v) A 
pair (X, A) consisting of a CW complex X and 
its subcomplex A has the *homotopy extension 
property for any topological space. (vi) A CW 
complex has the tcovering homotopy property 
for any ‘fiber space. (vii) The product complex 
X x Y of two CW complexes X and Y is not 
necessarily a CW complex, but it is thomo- 
topy equivalent to a CW complex. (viii) If 
either X or Y is locally finite, or if both X and 
Y are locally countable, then the product 
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complex X x Y is a CW complex. (ix) For CW 
complexes X and Y, the mapping space Y* 

is homotopy equivalent to a CW complex. 

(x) The tcovering space of a CW complex has a 
CW decomposition. 

If K is a simplicial complex, the polyhedron 
|K| is a regular CW complex whose cells are 
all open simplexes in |K]. A simplicial complex 
K is (locally) finite if and only if the CW com- 
plex |K] is (locally) finite. In particular, the 
Euclidean polyhedron of a Euclidean sim- 
plicial (or cell) complex is a locally finite CW 
complex. A polyhedron |K| generally admits a 
CW decomposition whose cells are far smaller 
in number than the simplexes constituting a 
simplicial decomposition of K. For any CW 
complex X, there exists a polyhedron |K | 
that is homotopy equivalent to X. In partic- 
ular, if X is an n-dimensional finite (coun- 
table) CW complex, we can choose as K an 
n-dimensional finite (locally finite and coun- 
table) simplicial complex. 


E. Semisimplicial Complexes 


By an ordered simplex in a simplicial complex 
K we mean a finite sequence (vo, v,, ..., Up) 
(n20) of vertices in K, contained in the set of 
vertices of a simplex in K. Let O(K), be the 
set of all ordered simplexes of K of length n+ 
1, and define mappings 0;: O(K),— O(K), , 
and s:O(K),— O(K),,, for i=0, 1,..., by 
Oj(Uo, ... , Vn) = (to, ... ,Uj.,, Drais, Dn) and 
Si(to, ... Del = (Vos ... , Vj, Vis Vis Ute DA 
Then the following relations hold: 


0;,00,—0,,00; (i<j), 


J J 
SOSSE SOS (i<j), 
0O S= 5.1 0 0; (i<j), 
0,0 5,= 5,0 Ôi- (i j - D), 
0,0 5; — 0;,4 O S; — identity. (1) 


Let A" be the n-dimensional simplex in R" 
with vertices eg —(0,0, ...,0), e, —(1,0, ...,0), 
Gass, ..., 1). By a singular n-simplex in a 
topological space X we mean a continuous 
mapping T:A"— X. Let S(X), be the set of all 
singular n-simplexes in X, and define map- 
pings 0;: (X), S(X),., and s:S(X),— S(X),., 
for i=0, 1,..., n by 0;T(Ao, ..., 4,) 2 T (åo; ..., 
Ài-1, Ais ..., An) and s T (åo, ..., 4,44) = T(Ag, 
dan Ait Apes daan so An+1) Where (Ao, 

... An) is the point 27.65 4;e;, 4; 20, Lais. 
Then relation (1) holds between 0; and s,. 

Because of the importance of relation (1), 
which is basic in defining thomology of sim- 
plicial complexes and topological spaces (— 
201 Homology Theory), S. Eilenberg and J. A. 
Zilber gave the following definition: A semi- 
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simplicial complex K consists of a sequence of 
sets K, (n —0, 1, ...) together with mappings 
0;: Ka >Kan- S: K, K,,4 (10,1, ..., n) satis- 
fying relations (1). An element of K, is called 
an n-simplex in K, and 6,, s; are called the ith 
face operator and the ith degeneracy operator, 
respectively. A simplex is said to be degenerate 
if it is the image of a simplex under some s. A 
semisimplicial complex is abbreviated as s.s. 
complex. The s.s. complexes O(K)= (O(K),, 
i, Si} and S(X)= (S(X),,0;, 5] are called the 
ordered complex of K and the singular complex 
of X, respectively. 

Let K be an s.s. complex, and let L, be 
a subset of K, for n=0, 1, .... If G((L,)cC L,., 
and s(L,) c L,., for each i, then L= (L,, 
0| Ln Si} Ln} is an s.s. complex, and L is called 
a subcomplex of the s.s. complex K. If A isa 
subspace of a topological space X, S(A) is 
a subcomplex of S(X). If K is an ordered sim- 
plicial complex, a subcomplex O'(K) of O(K) is 
obtained by considering the set of all ordered 
simplexes (Vvo, v,, ..., Del such that vo € v, € 
... Xv, (0-0, 1,...). If K and L are s.s. com- 
plexes, a sequence f= ( f,} of mappings f: 
K„—> L, defined for each n is called an s.s. map- 
ping if 0;o f, = f, ,00; and so f,=f,4,05; 
(O0<i<n). If f: X 2 Y isa continuous mapping 
of topological spaces, then f determines an s.s. 
mapping S(f):S(X)S(Y) by S(f)(T)=fo T. 
Two s.s. complexes K and L are said to be 
isomorphic if there is a bijective s.s. mapping of 
K to L. For two s.s. complexes K and L, we 
define the Cartesian product K x L to be the 
s.s. complex given by (K x L) =K, x Ln, 0,(o, 7) 
— (0,6, 0;1), Sieg, 1) — (sio, s;t) (oe Kp, «e L,). If K 
and L are ordered simplicial complexes, the s.s. 
complexes O'(K) x O'(L) and O'(K x L) are 
isomorphic. If X and Y are topological spaces, 
the s.s. complexes S(X) x S(Y) and S(X x Y) 
are isomorphic. 

Given an s.s. complex K, we construct a 
topological space |K] as follows: First, we 
provide K, with the fdiscrete topology and 
consider the topological space K — Uno K, X 
A". Next we consider simplicial mappings 
£': A" 1 +A" and nf: A"! S A" defined by pd 
=p; G «i) e(p) pj. G2 i) and q'(p)— p; 
(j <i), n'(p;) — pj, (j » i), where po, ..., p, are 
the vertices of A". The topological space |K] is 
defined to be the tquotient space of K with 
respect to an equivalence relation ~ that is 
defined by the following: (0,0, y) — (o, e'(y)) 

(ce K,, ye A"), (so, y) - (o, 1'(Y)) (EK, 

ye A"*!), where i—0, 1, ..., n. The space |K| is 
called the (geometric) realization of the s.s. 
complex K. Given an s.s. mapping f: KL, 
we obtain a continuous mapping |f|:|K|1L| 
defined by Le, yl) 2 [/(e), yl, where |o, y| is 
the point in |K| represented by (c, y)e K. We 
call | f| the realization of the s.s. mapping f. 
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The realization |K | of an s.s. complex K is a 
CW complex whose cells are in one-to-one 
correspondence with the nondegenerate sim- 
plexes in K. For a topological space X,a 
tweak homotopy equivalence p:|S(X)| X is 
defined by p(|T, y — TO) (Te S(X),, ye A"). 
This mapping gives rise to a homotopy equiv- 
alence when X is a CW complex. 

The singular complex S(X) of a topological 
space X has the following property: Given 
simplexes 9$, ...,0, ,, 0444, ..., 044,4, € K, with 
00; = 0;_, 0; (i« j, i, j#k), there exists a simplex 
c €K,., with 6,0 — o; (i k). An ss. complex K 
with this property is called a Kan complex. If, 
in addition, 0,0 =0,0' (o,0 € K,, 1# k) imply 
0,0 —0,0', we call K a minimal complex. For 
every Kan complex K, there are minimal sub- 
complexes M of K that are isomorphic to each 
other. Moreover, |M| is a fdeformation retract 
of |K|. For a Kan complex K, the thomotopy 
group can be defined combinatorially. 

Two CW complexes X and Y are homotopy 
equivalent if and only if the minimal sub- 
complexes of S(X) and S(Y) are isomorphic. 


F. Eilenberg-MacLane Complex 


Given an integer n 2 1 and a group z (Abelian 
if n> 2), there exists an tarcwise connected 
topological space X for which the thomotopy 
groups z;(X) are trivial for iz n and 2,(X)&z. 
Such a space is called an Eilenberg-MacLane 
space of type (z, n). Let Q(X; X, +) be the tpath 
space over X, and let py:O(X; X, x) X be 
the natural projection. If X 1s an Eilenberg- 
MacLane space of type (z, n), then (Q(X; X, +), 
Po, X) gives rise to a standard 'contractible 
‘fiber space whose fiber is an Eilenberg- 
MacLane space of type (z, n — 1). Assume 

that X is an Eilenberg MacLane space of type 
(n, n) with Abelian group z. Then X is (n — 1)- 
connected; hence the tHurewicz theorem can 
be utilized to show the existence of an isomor- 
phism h:z,(X)z H,(X), while the universal 
coefficient theorem can be utilized to show 
that H"(X;x)zHom(H,(X), z). Since in this 
case we have z,(X)z n, the element h ! e 
Hom(H,(X), z,(X)) can be regarded as an 
element of Hom(H,(X), n). Now the funda- 
mental class of X is defined to be the coho- 
mology class ue H"(X ; t) corresponding to 
h`. Let Y be a *CW complex and z(Y; X) be 
the set of *homotopy classes of continuous 
mappings from Y to X. Then there exists a 
one-to-one correspondence n(Y; X) H"(Y;n) 
given by the assignment [ f] f *u (— 305 
Obstructions). Let S(X) be the singular com- 
plex of X, and let M(X) be a minimal complex 
of S(X). If X is an Eilenberg-MacLane space 
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` of type (z,n), M(X) is isomorphic to a certain 


complex determined uniquely by x and n. This 
complex is called the Eilenberg-MacLane 
complex of type (z, n) and is denoted by 
K (n, n). The notation K (n, n) is also used to 
mean the space X itself. 

Let A(q) be a simplicial complex whose sim- 
plexes are all nonvacuous subsets of {0, 1, 
..., qt. Let e: A(q — 1) A(q) be the simplicial 
mapping defined by z(j) -j (0 &j &i— 1), eJ) 
—j41(ixj&q-1) and let 4;: A(q + 1) 2 A(q) 
be the mapping defined by 5,;(j) 2j (0 €j <i), 
"ní(j)-j—1(--1xj&q-1) Now K(z,n)is a 
Kan complex defined by K (z, n), = Z"(A(q); n), 
0,0 — 00€; $0 —001/, where Z°(A(q); n) — 7, 
Z'(A(q); n) is the set of z-valued functions 
defined on the set of pairs (i,j) such that 0 x 
i«j«q and satisfying the equality o(j, k): 
o(i,k)'-o(i,j)=1 for 0<i<j<k<q, and 
Z"(A(q); n) (n> 2) is the group of *oriented 
cocycles of the simplicial complex A(q). If z is 
Abelian, the structure of the Abelian group 
Z"(A(q); 2) gives K (n, n) the structure of an 
Abelian group in the s.s. category. This struc- 
ture yields a one-to-one correspondence 
K(x, n), K(n,n—1)-, x... x K(n,n—1)g for 
n 2 | and leads to the expression of te K (m, n), 
in the form (o, ,,...,09? with o,¢ K(t, n— 1). 
The W-construction of K(z,n— 1) for nz 1 
is a Kan complex W(z,n — 1) defined by W(z, 
n—1),— K(r,n— 1), x K(1, n), and Goalg, x 1) = 
(099,): 041 X 091, Ela X T4) — 0,0, X 0j, for 
1 «ixq, 8(a, X tj) - 50, X s1,, where o, e K(r, 
n — 1), and tes (o,.,, ..., 09» € K(x, n),. Let p: 
W(n,n— 1) K(n, n) be a natural projection. 
Then (W(n, n — 1), p, K (n, n)) plays the role of 
the *universal bundle for K (z, n — 1) in the s.s. 
category in the following sense: Let L be an 
s.s. complex, and let f: L>K(z,n) be an s.s. 
mapping. We define f * W(z,n— 1) to be the 
subcomplex of W(z,n— 1) x L generated by 
simplexes (o, x T4) x p, such that t, = f(p,), 
where c, x tE W(r, n— 1), and Geet, Let 
p: f *W(z,n— 1) L be the natural projection. 
Then ( f * W(z, n — 1), p, L) is called the prin- 
cipal fiber bundle induced from W(mn, n — 1) by 
f. Any principal bundle over L with group 
K(r,n— 1) can be expressed as an induced 
bundle. This property means that (W(x, n — 1), 
p, K(z,n)) is universal. On the other hand, 
we have an algebraic analog of the univer- 
sal bundle for the chain group of K(z, n— 1), 
called the bar construction [12]. Both these 
concepts were defined by Eilenberg and Mac- 
Lane in order to determine the structure of the 
(co)homology of K (z, n), which is denoted by 
(H*(n, n)) H, (n, n). This object was later 
achieved by H. Cartan, who introduced an 
improved notion called Cartan construction 
(— Appendix A, Table 6.III). Let z(L, K(n,n)) 
be the set of s.s. homotopy classes of s.s. map- 
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pings from L to K (z, n). If z is Abelian, there 
exists a one-to-one correspondence z(L, K (n, 
n))— H"(L; n) given by the assignment [f ]> 
f*u, where ue H"(zx, n; n) is the fundamental 
class of K(z,n). By virtue of this correspon- 
dence, k= f *ue H"(L; n) determines an in- 
duced bundle f * W(x, n — 1) uniquely up to 
equivalence, which is denoted by K(z, 
n—1)x,L. 

Let X be an arcwise connected topological 
space. Let X" be the Cartesian product of n 
copies of X. Clearly, the symmetric group G, 
of degree n operates on X". The n-fold sym- 
metric product SP" X of X is defined to be the 
quotient space of X" under the action of &,. If 
we specify a reference point of X, we have a 
natural inclusion SP"! X C SP" X and can 
consider the inductive limit space | Jı <,SP"X, 
denoted by SP? X. Then the Dold-Thom 
theorem [15] shows that M(SP?X)zx 

i£ 1 K(H,(X), i). In particular, we have 
M(SP?S")z K(Z,n)fornz 1. This result can 
be applied to obtain a direct relationship be- 
tween the axiomatic definition of tcohomology 
operations using K(z,n) due to Eilenberg and 
Serre and the constructive definition using the 
symmetric groups due to Steenrod (A. Dold 
[16], T. Nakamura [17 ]). For a detailed study 
of the (co)homology of SP"X — [18]. 


G. Postnikov Complex 


Let X be an arcwise connected topological 
space. For the sake of simplicity, throughout 
this section we assume that X is tsimple (— 
202 Homotopy Theory). Then the tPostnikov 
system of X can be defined as an inverse sys- 
tem (X,, p,) (n=0, 1,2, ...) consisting of topo- 
logical spaces X,, continuous mappings p,: 
X, X,.,, and a system (X, q,) (n—0, 1,2, ...) 
consisting of continuous mappings d, X >X, 
such that p,o q,— de and satisfying the fol- 
lowing three properties: (1) Xo is one point. (2) 
(X, Pn» Xn—-1) is a tfiber space induced from a 
standard contractible fiber space over an 
Eilenberg-MacLane space of type (z,(X), n+ 1) 
by a mapping corresponding to a cohomology 
class k"* eH" (X, ;n,(X)). (3) gan ni( X) 
n;(X,) gives an isomorphism for O i & n. 
These cohomology classes k"*! are called 
Eilenberg-Postnikov invariants (or simply k- 
invariants). Corresponding to the above facts, 
the minimal complex M(X) can be obtained 
as the inverse limit of a certain inverse sys- 
tem (K (n), p(n)) consisting of Kan com- 

plexes K (n) and s.s. mappings p(n): K(n)> 
K(n—1) defined by K(0) 2 K(0,0) and K(n)— 
K(x, n) x »«iK(n—1)for n2 1 with natural 
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projections p(n), where z, —z,(X) and k"t! e 
H?*' (K (n — 1); z,). This system is determined 
uniquely up to s.s. homotopy equivalence by 
its limit, called the Postnikov complex and 
denoted by K(z,, Kin... k"*,m,, ...). As 
yet we are ignorant of an effective method of 
computing the cohomology of a Postnikov 
complex from z, and k"*!. 
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A. Measures for Complexity of Computation 


Intuitively, complexity of computation means 
the amount of computing efforts measured on 
some suitable scale. When a problem can be 
solved by means of any one of several algo- 
rithms, it is highly desirable to compare the 
complexities of those algorithms. For example, 
the power x!? can be evaluated with a pocket 
calculator by using either of the algorithms 


xlÜ-xxxxxxxxxxxxxxxxxxx, (A) 


H-—XXX, V=UXU, W-—UXU 


xl9— ux w. (B) 


The complexities of these algorithms can be 
measured by various quantities: (T 1) time in 
seconds, (T2) the number of times keys and 
buttons are touched, (T3) the number of basic 
operations (here, multiplications), and (S) the 
number of values to be stored in the calcu- 
lator. The quantities (T 1), (T2), and (T3) are 
called time complexities, and (S) is called the 
space complexity. Obviously, algorithm (B) is 
preferable with respect to time complexity, 
although (A) is better with respect to space 
complexity. (If the calculator has only one 
memory register, then only (A) can be executed 
without recording numbers by hand.) For 
many calculators, the following algorithm 

is best with respect to both time and space 
complexities: 


x19 2 ((x?)? x x}. (C) 


In the general theory of complexity, the num- 
ber of basic operations (13) is often taken as 
the basis of time complexity, since it represents 
the intrinsic complexity of the algorithm, 
rather than of extrinsic factors such as human 
skill or mechanical performance. 

For the evaluation of a power x" in general, 
the complexity depends on the value of n: such 
a parameter dominating the complexity is 
called the size of the problem. Let Ty(n) be 
the number of multiplications required for 
evaluating x" by an algorithm X. Let 7 be the 
set of all algorithms for evaluating x". The 
time complexity T(n) for evaluation of the 
power is then defined by 


T(n)= Min, ¢ T,(n). 
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This is satisfied when [6] 
logan < T(n) <2log,n 
and 


im O 


=; 
n>% log, n 





In a problem concerning a tgraph, the num- 
ber m of nodes in the graph, the number n of 
its edges, or their sum m+n are chosen as 
the size of the graph. However, the complex- 
ity is not determined by such a size, because 
there are many different instances (particular 
graphs) of the same size. So the complexity of 
this problem is defined in the following man- 
ner. Let T(I) be the complexity of solving an 
instance I of the problem and L(n) be the set 
of all possible instances of size n. Then 


W(n) = Max: ui T(I) 
and 
A(n) - p) T), 


where p(I) denotes the relative frequency of 
the instance J. The value W(n) is called the 
worst-case complexity, and A(n) is called the 
average complexity of the problem. 

In some cases the complexity of a circuit is 
measured by the number of its building blocks 
[1], and the complexity of a program is mea- 
sured by its length [2, 3]. 


B. Complexity of a Decision Problem 


A decision problem and its complexity are 
rigorously formulated in terms of f Turing 
machines. Let X be a finite set of symbols and 
A" be the whole set of nonempty strings of 
symbols in X. Every instance of the problem is 
assumed to be represented by a string in X^. A 
decision problem is a triple (2, L, P) of the set £ 
of symbols, a subset L of X*, and a mapping P 
from L to {0 (false), 1 (true)}. The set L repre- 
sents the set of all possible instances of the 
problem. The problem is said to be solvable if 
and only if there exists a Turing machine M 
satisfying the following condition for every 
string « in the set L: When a tape containing 
the string a is given to the machine M whose 
head is initially put on the leftmost symbol of 
a, the machine writes the value of P(a) on the 
tape and stops after a finite number of steps. 
Such a Turing machine is said to compute the 
function P. The number 7,,(«) of steps is the 
time complexity of solving the instance « by 
the machine M, and the length S,(x) of the 
used area of the tape is the space complexity. 
The worst-case complexity T,,(n) is defined by 


Ty(n) = Max,., vc Ty(a). 


71€ 
Complexity of Computations 


The space complexity S,,(n) for the size n is 
defined in a similar way. The complexities 
T(n) and S(n) of the problem are not defined, 
because improvement by a constant factor is 
always possible for any Turing machine [5]. 

A problem is said to be solved in linear or 
polynomial time if its time complexity T,,(n) is 
bounded by a linear or polynomial function of 
the size n. 


C. Objectives of Complexity Theory 


The objectives of complexity theory are as 
follows: 

(i) Analyze an algorithm X for a problem P 
and evaluate its complexity. When exact evalu- 
ation is hard, the order of magnitude O(T,(n)) 
is investigated. 

(ii) Construct better algorithms for the prob- 
lem P. This gives a better upper bound to the 
complexity T(n) of the problem. 

(iii) Make clear the limitation of improving 
algorithms for the problem P. This is done by 
establishing a lower bound to the complexity 
T(n) of the problem. 

A common approach to constructing better 
algorithms is to partition the problem into 
smaller parts, apply algorithms to the parts, 
and then combine the solutions for the parts 
into a solution for the whole (divide and con- 
quer, [5]). This approach often yields an effi- 
cient algorithm for the problem, especially 
when the partitioning can be repeated recur- 
sively. For demonstrating the limitations of 
algorithms of a certain class, diagonalization 
[5], determination of information-theoretic 
lower bounds [6], or the oracle method [6] 
are often utilized. 


D. Elementary Results on Time Complexity 


(1) Number of arithmetic operations. A poly- 
nomial of degree n with one variable can be 
evaluated in about 35/2 arithmetic operations 
if preconditioning on coefficients is allowed. 
Otherwise, 2n — 1 operations are necessary and 
sufficient for evaluating a polynomial of degree 
n. For computing the product of two square 
matrices of degree n, O(n?) operations are 
necessary and O(n?-5) operations are sufficient. 
The inverse matrix and the value of the deter- 
minant of a square matrix of degree n are 
computed with the same order of operations 
as the product. Discrete Fourier transforma- 
tion of n points is executed in O(nlogn) arith- 
metic operations (fast Fourier transformation). 
(2) Number of comparisons and data trans- 
fers. Rearrangement of n items in increasing 
order is realized in O(nlogn) comparisons and 
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data transfers. Selection of the kth largest item 
among n items is done in O(n) operations. The 
position of an item in a list containing n items 
is found in O(n) comparisons by linear search, 
in O(logn) comparisons by binary chopping, 
and in O(1) comparisons on average by the 
thashing method. 

Many other results can be found in [5]-[7]. 


E. NP-Completeness 


A problem is easily solved if its complexity is 
O(n) or less. The problem is very hard if its 
complexity is O(2"), unless the size n is small. 
Between these types of problems there is a 
class of problems, each of which can be solved 
in polynomial time by a Turing machine. 

A tnondeterministic Turing machine is said 
to solve a decision problem (X, L, P) if it can 
detect the case when P(x) — 1 by exercising 
good choices; more precisely, if it can stop 
after a finite number of steps and write the 
value 1 of P(a) on the tape, starting from an 
initial state with the head on the leftmost sym- 
bol of a string « on the tape, provided that 
P(a) — 1. If P(x) 0, then the machine may 
never stop. For a string a such that P(a)— 1, 
NT,(x) and NS,,(«) represent the minimum 
number of steps and the minimum length of 
tape used in computing P(«). Nondeterminis- 
tic complexities NT, (n) and NS, (n) for the 
size n are defined in a similar way as before. A 
problem is said to be solvable in NP-time or 
NP-space if it can be solved by a nondeter- 
ministic Turing machine M whose time com- 
plexity N T, (n) or space complexity NS,,(n) is 
bounded by a polynomial function of size 
n. The class of all problems solvable in NP- 
time is denoted by NP, while that of the prob- 
lems solvable in polynomial time by ordinary 
(deterministic) Turing machines is denoted by 
P. Obviously, the class NP contains the class 
P. Nevertheless, whether NP — P or not re- 
mains one of the biggest unsolved problems. 

A decision problem (2, L, P) is polynomially 
transformable to a problem (Z, L’, P’) if there 
exists a mapping h from E* to X'* satis- 
fying the following conditions: (1) h(L) c L'; (2) 
for every string « in L, P(a)= P’(h(a)); (3) the 
mapping h is computed by a Turing machine 
in polynomial time. If a problem C is poly- 
nomially transformable to another problem C' 
in P or NP, then the problem C is also in P 
or NP. A problem C is said to be NP-hard if 
every problem in NP is polynomially transfor- 
mable to C. It is called NP-complete if it is in 
NP and NP-hard. Many problems that have 
been known empirically to be very hard have 
recently been shown to be NP-complete [8]. 
For instance, integer linear programming is 
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NP-complete. Linear programming has re- 
cently been shown to be in P. The satisfiability 
problem of a conjunctive normal form was 
proven to be NP-complete in 1971 [9]. 

The complement of a decision problem 
(2, L, P) is the problem (2, L, P"), where P'(x) — 
1 — P(a). If a problem C is in P, then its com- 
plement C' is also in P. When C is in NP, it is 
not guaranteed that C' is in NP. The class 
of all problems whose complements are in NP 
is denoted by co-NP. The intersection of NP 
and co-NP contains the class P, but it is an 
open problem whether this containment is 
proper or not. The problem PRIME, which 
asks whether a number N is prime or not, the 
size n being the number of digits for represent- 
ing the number N, belongs to both NP and co- 
NP [10]. However, it is not known whether it 
is in P. 


F. Other Topics 


The notion of completeness is introduced in 
many other classes of problems solvable in 
linear space, polynomial space, exponential 
time, etc. Comparisons among these classes 
have been done, although many problems still 
remain open. The fundamental conjecture in 
this field is the nonequality NP z P, the prov- 
ing of which seems to be extremely difficult. 
This conjecture can be relativized in ways such 
that it can be either true or false, so that it is 
not provable in some formal system. 

Some tricks may be used for hard problems. 
For decision problems, some algorithms have 
been proposed for guessing the correct answer 
with high probability [11]. Some algorithms 
give nearly optimal solutions efficiently, in- 
stead of constructing optimal ones over too 
long a time [7]. 

A general theory of computational com- 
plexity would include investigations on rela- 
tions among various complexity measures, 
the complexity hierarchy, and an axiomatic 
approach independent of any machine models 


[12]. 
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A. Definitions 


A Hausdorff topological space X is called a 
complex manifold (or complex analytic mani- 
fold) of complex dimension n if there are given 
an open covering {U;},., and a family Join of 
homeomorphisms of U; onto open sets in the 
n-dimensional complex affine space C" such 
that in case UU, # Ø, the mapping 9,09; ': 
e(U,N Uj - 9;(U; U;) is biholomorphic (i.e., 
9,0 9j ! and its inverse are both tholomorphic 
functions when expressed in terms of coordi- 
nate functions in C"). We call X the underlying 
topological space of this complex manifold, 
and we say that an open covering {U;};er and 
a family Lol define a complex analytic 
structure (or simply complex structure) on X. 
A complex-valued function f defined on an 
open set U in X is called a holomorphic func- 
tion on U if for any i the function fo 9j! on 
g(U N Uj is holomorphic. When we express 
the mapping 9; as q;(p) — (z! (p). ....2"(p)) on 
U, in terms of the coordinates in C”, each z* 
is a holomorphic function on U;. We call 
(z',...,2") a holomorphic local coordinate sys- 
tem on U;. Given two complex manifolds 
Y, X, a mapping o: YX is said to be holo- 
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morphic if for any open set U in X and any 
holomorphic function f on U, fog is holo- 
morphic on o !(U)c Y. When a mapping o: 
Y X is bijective and both o and o are 
holomorphic, we say Y and X are isomorphic 
by ọ as complex manifolds. 

As in the case of tdifferentiable manifolds of 
class C”, we can define concepts such as com- 
plex analytic submanifolds, holomorphic tan- 
gent vectors, holomorphic vector fields, and 
holomorphic differential forms of degree k (or 
simply holomorphic k-forms). Meromorphic 
functions on complex manifolds can also be 
defined as in the theory of analytic functions of 
several complex variables (— 23 Analytic 
Spaces D). 

Let X be a complex manifold and p a point 
of X. Take a holomorphic local coordinate 
system (z!, ..., z”) with center p De, z*(p)=0 
for all x). A holomorphic function defined on 
a neighborhood of p can be expressed as a 
holomorphic function in (z!, ..., z"), hence as a 
power series in (z+, ..., z”) absolutely conver- 
gent in a neighborhood of p. If we denote by 
© =O, the tsheaf of germs of holomorphic 
functions on X, the fstalk ©, of © at p is iso- 
morphic to the *local ring of convergent power 
series in n variables z!, ...,z". At a point p, 
(0/0z!),, ...,(0/0"), form a basis of the holo- 
morphic tangent vector space at p. A holomor- 
phic k-form o defined on a neighborhood of 
p can be expressed as o — ZA, «fi, i, dZ" ^ 
dE, where f; ...; is a holomorphic func- 


i 


tion for each (i,, ..., Jl. 


B. Almost Complex Structures 


Let X be a complex manifold, and let 

{Ui Pitie be its complex analytic structure, i.e., 
a covering of X by holomorphic local coordi- 
nate systems with o — (z], ..., z7). Express zf in 
the form z? 2 x? -- J/ —1 y?, where x7 and y; 
are the real and imaginary parts of z}, respec- 
tively. Then x? and y? real-valued functions 
on the open set U; of X, and the mapping 

V;: U RI" defined by du pl — Ga (p), y; (p). -~> 
x?(p), y?(p)) is a homeomorphism of U; onto 
an open set of R?". This (U;, rh defines on 
X a ‘differentiable structure of class C® (in 
fact, a treal analytic structure). Thus a complex 
manifold of complex dimension n admits 
canonically a C®-structure of real dimension 
2n. For every point p of X there is a real co- 
ordinate system on a neighborhood of p, such 
as (x1, y!, ..., x", y”), where (z', ... , Z") (z*— x^ 
+ am] y*, x — 1, ..., n) forms a holomorphic 
coordinate system in X. The real tangent 
vector space at a point p of X has ((0/0x!),, 
(0/0y!),, ..., (0/0x"),, (0/0y"),) as its basis. 
Define a linear endomorphism J, by (0/0x?), 
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—(0/0y*),, (0/0y*), —(O/0x*), (a= 1, ... 1); 
then J2 = —1, and J, does not depend on the 
choice of holomorphic coordinate system at p. 
Considering J, as a tensor of type (1, 1), we 
thus obtain a tensor field J of type (1, 1) of 
class C? on X, which is called the tensor field 
of almost complex structure induced by the 
complex structure of X. 

More generally, when a real differentiable 
manifold X is provided with a tensor field J of 
type (1, 1) of class C? such that J? = —1 (con- 
sidering J as a linear transformation of vector 
fields), we say that X admits an almost com- 
plex structure or that X is an almost complex 
manifold. In this case, for contravariant vector 
fields x and y on X, we define a tensor field S 
of type (1,2) by S(x, y) = — Do y] + [J (x), J(y)] 
—J([J69, y — J(Dx, J(y)]). S is called a Nijen- 
huis tensor. An almost complex structure J 
is induced by a complex analytic structure if 
and only if its Nijenhuis tensor S vanishes 
identically [49]. A differentiable manifold X 
of dimension 2n admits an almost complex 
structure if and only if the structure group 
GL(2n, R) of the bundle of tangent 2n-frames 
of X can be treduced to GL (n, C). Almost 
complex manifolds are torientable. 


C. Types of Differential Forms 


Let X be a complex manifold, and let (z!, 
...,2") be a holomorphic local coordinate 
system in a neighborhood of a point p with z* 
=xt4 ./-1 y* (x—1, ...,n). On the complexi- 
fied real tangent vector space 7,(X) &) € at p, 
we define 0/dz%, 0/0z^ by 


(0/027), — (1/2) (0/0x*), — / —1 (0/0y*),j. 
(0/027), — (1/2) ((0/0x^), + / —1 (0/0y*),j. 


It is easy to see that the operation 0/0z^ on 
holomorphic functions coincides with that of 
the holomorphic tangent vector 0/0z? defined 
in Section A. A function f is holomorphic at 
pif and only if (0/02?) f 20 (a= 1,2, ...,n). 
T,(X) @ C is the direct sum of the subspace 
spanned by {0/dz',...,6/0z"} and the subspace 
spanned by (0/0z!, ..., 0/02"). Moreover, this 
decomposition is independent of the choice of 
holomorphic coordinate system. Elements of 
the two subspaces are respectively called tan- 
gent vectors of type (1,0) and tangent vectors of 
type (0, 1). Similarly, the complexified space of 
real differentiable 1-forms can be decomposed 
into the direct sum of two subspaces spanned 
by (dz, ..., dz") and {dz',...,dz"}, where dz“ 
—dx* 4- / —1 dy* and dz* - dx* — / —1 dy”. 
We say that the elements of the former sub- 
space are of type (1,0) and those of the latter 
are of type (0, 1). Thus the space of differential 
forms of arbitrary degree can be written as 
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the direct sum of subspaces of type (r, s). Here 
the subspace of differential forms of type (r, s) 
has as a basis (dz^' ^ ... A dz% A dz*1 ^ ... A dE 
(1€a, «...«a, n, 1 « f, <... <b n). This 
decomposition is independent of the choice of 
a local coordinate system, hence the concept of 
type can be defined globally on X. 


D. 6-Cohomology 


In the rest of this article we consider only 
complex differential forms. For every differen- 
tial form o of type (r,s) on X, its fexterior 
derivative dc) decomposes into a sum of dif- 
ferential forms of types (r+ 1, s) and (r,s+ 1), 
which we denote by 2c (or d'œ) and 8c (or 
d'w), respectively. We have d — + ð, (d? =0, 
(0 =0, and 00 + 00 =Q. In terms of a local 
coordinate system, we write 


Q 
boe Y det A de" A.. 


Y 


^ dz* ^ dzP1 0... ^ dz*s, 


SS Q 
fo =( IS dz" o... 


y 
^ dz* ^ dz! ^ dzP1 a... A dzhs, 


for w= fdz™ ^... A dz% a dZ’! a... AdzPs A 
differential k-form « is holomorphic if and 
only if œ is of type (k, 0) and óc — 0. 

For the operator ô Dolbeault's lemma holds: 
Let c be a differential form on a neighborhood 
U of a point p. If dw =0, there is a neighbor- 
hood V of p contained in U and a differen- 
tial form 0 on V such that o- 00 on V. 

Let A? and Op be the tsheaf of germs of 
differential forms of type (r, s) and the sheaf of 
germs of holomorphic p-forms on X, respec- 
tively, and let I'(X, A“) be the set of tsections 
of A^? on X. T (X, A^?) is the set of differen- 
tial forms of type (r,s) on X, and X, (X, A(^?) 
forms a tcochain complex with respect to 6. 
This complex is called the 6-complex or the 
Dolbeault complex, and its *Àcohomology 
groups are called the 6-cohomology groups or 
the Dolbeault cohomology groups. The qth 
cohomology group is denoted by H*'*(A, 6). It 
follows easily from Dolbeault’s lemma that 0— 
Q?— At 05, APD.. is an texact sequence of 
sheaves. From this we get Dolbeault's theorem: 


H(X, QP) = H?*(A, ô), 
where the left-hand side is the cohomology 
group with coefficient sheaf OP. 

More generally, for any tcomplex analytic 
(holomorphic) vector bundle E on X, we can 
define the 6 cohomology groups of the dif- 
ferential forms on X with values in E, and they 
can be shown to be isomorphic to the coho- 
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mology groups with coefficient sheaves of 
germs of holomorphic forms with values in E 
(^ 194 Harmonic Integrals E). 


E. Analytic Coherent Sheaves 


The structure of a complex manifold X is 
determined by the tsheaf © of germs of holo- 
morphic functions on X, and @ is a tcoherent 
sheaf (of rings) (Oka's theorem). Sheaves of 0- 
modules are called analytic sheaves, and coher- 
ent sheaves of @-modules are called coherent 
analytic sheaves. Many properties of X can be 
expressed in terms of coherent analytic sheaves 
and their cohomology groups; some exam- 
ples appear later in this article (also — 366 
Riemann-Roch Theorems B). 

It is important to know whether an analytic 
sheaf on a complex manifold is coherent. To 
this question, not only does Oka's theorem 
apply but so does Cartan's theorem: The tsheaf 
of ideals defined by an analytic subset of a 
complex manifold is coherent. (We say that a 
subset Y of X is an analytic subset if it is a 
closed subset and each point of Y has a neigh- 
borhood U such that UN Y is the set of com- 
mon zeros of a finite number of holomorphic 
functions on U .) Also relevant is Grauert's 
theorem: If z: X ^ Y is a tproper holomorphic 
mapping of complex manifolds (De. the inverse 
image of any compact subset of Y for a holo- 
morphic mapping z is also compact), then for 
any coherent analytic sheaf F on X its *direct 
images Rz, (F) (q —0, 1,2, ...) are also coher- 
ent [16]. (In fact, this theorem holds for ana- 
lytic spaces; — 23 Analytic Spaces.) 

For an analytic coherent sheaf F on a tStein 
manifold X, we have the following fundamen- 
tal theorems of the Stein manifold. Theorem 
A: H°(X, F) generates the stalk F, (as an O,- | 
module) at every point x of X. Theorem B: 
H*(X, F)=0 for all g>0. Conversely, a Stein 
manifold X is characterized by the following 
property: For any coherent analytic sheaf F of 
ideals of €, H'(X, F)=0. If a complex manifold 
X 1s compact and F is a coherent analytic 
sheaf on X, then H*(X, F) is a complex vector 
space of finite dimension. If X is an open sub- 
manifold of another complex manifold and its 
closure is compact, then H*(X, F) is finite- 
dimensional for some q that depends on vari- 
ous properties (convexity or concavity) of 
the boundary of X [3]. 

Let E be a tcomplex analytic (holomor- 
phic) vector bundle on a complex manifold 
X of dimension n, and let E* be the tdual 
vector bundle of E. Then H*(X, 0?(E)) and 
HY *(X,0" °(E*)) (where H, denotes the 
cohomology group with compact support) are 
dual as topological vector spaces under suit- 
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able conditions. The duality is given by the 
integration on X of the exterior products of 
the differential forms representing the re- 
spective elements of the cohomology groups. 
The duality holds, for example, when dim 
H*(X, (*(E)) « oo. If X is compact, we need 
not distinguish H, from H (Serre's duality 
theorem) [50]. 


F. Compact Complex Manifolds 


On a compact and connected complex mani- 
fold X, there are no holomorphic functions 
except constants (by the *naximum principle 
of holomorphic functions). The field K(X) of 
meromorphic functions on X is finitely gen- 
erated over the complex number field, and its 
ttranscendence degree d does not exceed the 
complex dimension n of X. d is said to be the 
algebraic dimension of X and is denoted by 
a(X). For elements of K(X), functional inde- 
pendence and algebraic independence are 
equivalent [52]. When n=1, X is a compact 
tRiemann surface and the classical theory of 
algebraic functions shows that K(X) is an 
talgebraic function field of one variable and X 
is a *projective algebraic variety. When n — 2, 
a(X)-— 2, 1, and 0 can all occur. If a(X)— 2, 
X is a projective *algebraic surface (Chow- 
Kodaira theorem). If a(X)— 1, there exist an 
talgebraic curve A and a surjective holo- 
morphic mapping o: X >A such that K(A) is 
isomorphic to K(X) under ot, and o! (x) is 
an telliptic curve for all but a finite number of 
x€ A. K. Kodaira investigated the structure of 
compact complex surfaces in detail [34, IH]. 
On a compact complex manifold X, the free 
Abelian group generated by the set of irre- 
ducible analytic subsets of codimension 1 is 
called the divisor group of X, and an element of 
it is called a divisor of X. For an analytic sub- 
set Y of codimension 1, the sheaf of ideals (Y) 
defined by Y is a sheaf of locally principal 
ideals of 0. For a divisor D =} a, Y,, the sheaf 
of locally principal fractional ideals 3(D)= 
IL SY, is called the sheaf of ideals of D. 
The set of nonzero coherent sheaves of locally 
principal fractional ideals corresponds bijec- 
tively to the set of divisors. An element f 40 of 
K(X) generates a sheaf of principal fractional 
ideals and therefore defines a divisor, which 
is denoted by ( f). The divisor group has 
an ordering defined by D = È a, Y,> 0 if and 
only if all a, 20, under which it becomes an 
tordered group. For a divisor D, let 


L(D) - (fe K (X)|f#0, (£) + D>0} U (0). 


Then L(D) is a C-module of finite dimension. 
This submodule of K(X) is easy to handle and 
exhibits various analytic properties of D. The 
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*Riemann-Roch theorem is used to calculate 
the dimension of L(D) in terms of other factors 
(— 366 Riemann-Roch Theorems C). Here is 
an example of how L(D) exhibits a property of 
D: We call two divisors D and D' linearly 
equivalent if there is a 0# Fe K(X) such that 
(iz D — D'. This is an equivalence relation 
finer than thomological equivalence. If D and 
D' are linearly equivalent, then L(D) and L(D’) 
are isomorphic by the mapping L(D)s f 2 
fF&€L(D^); therefore dim L(D) — dim L(D’). 

(The latter equation does not follow from the 
homological equivalence.) A holomorphic 
vector bundle with fiber C and structure group 
C* is called a complex line bundle. For the 
sheaf of ideals %(D) of a divisor D, we can take 
a suitable open covering {U;} of X such that 
for each U, there is an R;e F (U;, S(D)) which 
generates (D), for any xeU;. Also gy — R;/R, 
is a holomorphic function nowhere vanishing 
on U;N U,. With {gy as the system of *coordi- 
nate transformations, we define the complex 
line bundle determined by D and denote it by 
[D]. It is easy to see that [D] is independent 
of the choice of {U,} or {R,}. Moreover, [D] 

is determined only by the linear equivalence 
class of D. If we denote by @([D]) the sheaf of 
germs of holomorphic sections of [D], the 
mapping H?(X,6([D])239 7109) 5f 7 

9;/ Rj— o, / R,e L(D) is an isomorphism of these 
modules. On an algebraic variety in a projec- 
tive space, any complex line bundle comes 
from a divisor De, it can be expressed in the 
form [D] for some divisor D) [35, 51], but this 
is not necessarily true on general compact 
complex manifolds. However, the importance 
of complex line bundles in the theory of com- 
plex manifolds lies in the relation L(D) x 
H°(X, 6([D])), which replaces “things with 
poles" with *things holomorphic." 

Any analytic subvariety X of the projec- 
tive space P" is an algebraic variety (Chow's 
theorem) [51]. Suppose that X is an analytic 
submanifold of P^, and let Y be a general 
*hyperplane section of X. Then Y is a divisor 
on X, and the tChern class of the complex line 
bundle [ Y ] corresponds to the canonical 
Hodge metric on X (— 232 Kahler Manifolds 
D). When [Y] is represented by the system of 
coordinate transformations {g,} with respect 
to an open covering {U;}, we can associate 
with a coherent analytic sheaf F sheaves F(n) 
(n —0, +1, +2,...) as follows. Denoting by F, 
the restriction of F to U;, we glue F; and F, 
together on U,N U, with the relation f; f, > f; 
= gr; f, (where f;e F;c F, f, € F, c F) and obtain 
a sheaf (denoted by F(n)) that is locally iso- 
morphic to F. The following theorems for F(n) 
hold. For each coherent analytic sheaf F there 
exists an integer n such that for any n2 n, the 
following fundamental theorems A, B of projec- 
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tive algebraic varieties hold [17]. Theorem A: 
I' (X, F(n)) generates F, (as an @,-module) for 
every xe X. Theorem B: H*(X, F(n)) — 0 for all 
q>0. This means that if we permit “poles” on 
Y of sufficiently high order, then F has suffi- 
ciently many sections and the higher coho- 
mology groups vanish. 

On a (nonsingular) algebraic variety X in 
P" we have the sheaf © of germs of holomor- 
phic functions (the tstructure sheaf as a com- 
plex manifold) and the sheaf €,,, of germs of 
holomorphic rational functions (the structure 
sheaf as an algebraic variety). Therefore we 
have two kinds of coherent sheaves, coher- 
ent analytic sheaves and coherent algebraic 
sheaves. In fact, the cohomology theories de- 
rived from them are isomorphic. More pre- 
cisely, for any coherent algebraic sheaf F, F 
=F Gy, © is a coherent analytic sheaf. The 
correspondence F  F gives an equivalence 
between the tcategory of coherent algebraic 
sheaves and that of coherent analytic sheaves, 
thus giving an isomorphism of their cohomol- 
ogy groups. In other words, as far as the prop- 
erties that can be expressed by cohomology- 
theoretic terms of coherent sheaves are con- 
cerned, there is no difference between the 
analytic and algebraic theories of projective 
algebraic varieties [51]. 


G. Deformations of Complex Structures 


The deformation theory of complex structures 
was initiated by Kodaira and Spencer [36] in 
order to explain various phenomena in the 
theory of (compact) complex manifolds. A 
triple (X, 7, S) is called a family of compact 
complex manifolds if X and S are connected 
tanalytic spaces and z is a tproper holomor- 
phic mapping of X onto S such that (i) it is 
smooth, i.e., is locally identified with the pro- 
jection S’ x US’, where AS and U are open in 
S and C”, respectively, and (ii) every fiber V, = 
x ^!(s) of x is connected. V, is then a compact 
complex manifold. We sometimes write {V,} 
instead of (X, z, S). S is called the parameter 
space of the family. Take a point oeS and fix 
it. We say that V, is a deformation of V, for any 
seS. There are a neighborhood S of o in S and 
a diffeomorphism z !(S')S' x V, (Kuranishi 
[39]). Thus every V, seS', determines a tcom- 
plex structure on a fixed differentiable mani- 
fold. We say that the complex structure of V,, 
seh, is a deformation of that of V}. 

Now (X, z, S) - (V; ),.s is said to be complete 
at o if for any family (Y, on, T)={W},.7 with a 
point o'e T and a holomorphic isomorphism 
1: W; > V, there is a neighborhood T’ of o' in 
T and holomorphic mappings f: T' ^ S and 
h:u ! (T) X such that (i) fu- xh, (ii) f(o') - o, 


seS 
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and (iii) h=: on W,.. In this case, if T’ is suffi- 
ciently small, then h induces a holomorphic 
isomorphism h,: W, Vj, for any te T’. Hence 
{V} ses contains all small deformations of Vo. 
On the other hand, {V,},.5 is said to be effec- 
tively parametrized at o if the Kodaira-Spencer 
mapping (Kodaira-Spencer map) p,: T,S— 
H'(V,,@) is injective, where T,S is the Zar- 
iski tangent space to S at o and @ is the sheaf 
of germs of holomorphic vector fields on Vy. 
Here, the linear mapping p, is defined by 

p, (0/0s) = Lg (Gel, al, Where 0/0se T, S and 
gix are the coordinate transformations (z;, s) = 
(gi (zi. 5), $), using the smoothness of z. po(6/0s) 
is called the infinitesimal deformation to the 
direction 0/0s. 

Kuranishi's fundamental theorem [39, 40] 
states: For any compact complex manifold V, 
there exists a family {V,},-5 with a point oe S 
such that (i) it is complete at every point of S, 
(it) is effectively parametrized at o, and (iii) 
Vi=V. 

The parameter space A in the theorem is 
called the Kuranishi space or the local moduli 
space of V. It is given as the zeros of a holo- 
morphic mapping f: U  H?(V, ©) with f(0) —0, 
where U is a neighborhood of 0 in H! (V, ©). 
Hence (1) if H?(V, ©) 20, then S = U is nonsin- 
gular (Kodaira, Nirenberg, and Spencer [37]; 
(2) if H! (V, 9) «0, then S = {0}, i.e., one point 
(e.g., V= P"(C), the complex projective space). 

Kuranishr's theorem was generalized to a 
compact analytic space V by Grauert [18] and 
A. Douady [8]. 

There are Kuranishi-type theorems in the 
deformation theory of other objects: (1) com- 
pact analytic subvarieties of an analytic space 
(Douady [9]; — 23 Analytic Spaces G), (2) 
holomorphic mappings (Y. Miyajima [41]), 
(3) germs of analytic spaces with tisolated 
singularities [7], etc. 

So far, only the local theory of deformations 
has been developed. The global theory is not 
yet in a satisfactory state. Its final purpose is 
to construct moduli spaces and to understand 
them. For a compact differentiable manifold V, 
we denote by M(V) the set of all isomorphism 
classes of complex structures on V. It is diffi- 
cult in general to determine the set M(V). As 
yet unsolved problems are: (1) Is M(S°) non- 
empty? (2) Does M(P"(C)) (n> 3) consist of one 
point? (M (P?(C)) is known to consist of one 
point (S. T. Yau; — 232 Kahler Manifolds C). 

If M(V) has a reasonable structure (e.g., an 
analytic space structure) and a universal prop- 
erty, then we call it the moduli space (— 11 
Algebraic Functions F, 16 Algebraic Varieties 
W). Kodaira [34, IIT] constructed the moduli 
space of Hopf surfaces. Only a few examples of 
moduli spaces are known. It is to be noted 
that moduli spaces cannot in general exist. 
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One of the reasons for this is that there may 
exist jumpings of structures (Kodaira and 
Spencer [36]). 

As for talgebraic manifolds (varieties), there 
are two known methods for the moduli prob- 
lem: (1) Griffiths’ period mapping (— 16 Alge- 
braic Varieties V) and (2) Mumford's geomet- 
ric invariant theory (— 16 Algebraic Varieties 
V, W). 


H. Monoidal Transformations 


Let Y be an analytic subspace of a complex 
manifold X, which is defined by a nonzero 
coherent sheaf of ideals 3. Then X has an 
open covering such that for each member U of 
the covering, there are elements $9, ..., @n€ 
T(U, 3) that generate the stalks 3, at all xe U. 
Let W' be the graph of the holomorphic map- 
ping xe»(qi(x):95(x):...:9,(x)) from U-Un 
Y to P"^!, and denote by W the closure of Wi 
in U x P"^!, Then W is an analytic space, pos- 
sibly with singularities, which is independent of 
the choice of the generators (jj and is deter- 
mined uniquely by U and 3. Therefore all the 
W's can be glued together to form an analytic 
space X and to determine a holomorphic map- 
ping p: X X. We call p the monoidal transfor- 
mation (or blowing-up) of X with center 3 or 
with center Y. When Y is a point, p is also 
called a locally quadratic transformation or a 
o-process. When Y is an analytic submanifold, 
X also is a manifold, Y 2 p ! (Y) is a nonsin- 
gular divisor on X, and p ' (y) z p* ! for all 
y€ Y, where k is the codimension of Y. More- 
over, the line bundle [Y] restricted to each 
fiber p ' (y) is isomorphic to the Hopf bundle 
L on P*^!, i.e., the line bundle associated to 
the natural C* bundle Ch PR). Conversely, 
suppose that we are given VS X and a holo- 
morphic mapping po: Y Y with the property 
as above so that, in particular, py (y) z P*^! 
and [F] = L on pg ! (y) for all ye Y. Then there 
exists a complex manifold containing Y asa 
submanifold such that X is obtained by the 
monoidal transformation of X with center Y 
as above (S. Nakano [47 ]). Let f: X ^Y bea 
proper modification of complex manifolds (or 
analytic spaces). Then there exists a proper 
holomorphic mapping h: Y'— Y such that 

f ‘oh: Y’—X is holomorphic, where over 
any relatively compact subdomain of Y, h is 
obtained by a finite succession of monoidal 
transformations with nonsingular centers 
(Hironaka). The result is called Chow’s lemma. 


I. Fiber Spaces 


A triple of compact complex manifolds V, W, 
and a surjective holomorphic mapping f: 
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VW is called a fiber space if general fibers 
f (w)are irreducible. In addition, if both V 
and W are algebraic, it is called an algebraic 
fiber space. 

For a compact complex manifold V, letting 
K, denote the canonical line bundle of V, 
define a subfield & of K(V) to be Joie: 
where w,,@,€ H9(V,Q) for some m» 0). The 
transcendental degree of & over C is denoted 
by «(V), which we call the Kodaira dimension 
of V. 

If & 20, then define k«(V)— —oo. If «(V) = 
dim V, then V is said to be of general type, 
and there exist a projective manifold X and a 
bimeromorphic holomorphic mapping u: X > 
V. If «(V) 20, there exists a fiber space f:V* 
W such that (1) V* is bimeromorphically 
equivalent to V, (2) dim W = k(V), and (3) some 
general fibers f! (w) satisfy vi f !(w)) 0. In 
general, for a fiber space f: VW, we have 
k(V)&x(f !(w))4- dim W, where f! (w) is a 
general fiber. Moreover, if it is algebraic, an 
inequality of the form x(V) z x(f !(w))-- k(W) 
is called the conjecture C,, n being dim V. 

C, has been verified by K. Ueno, E. Viehweg, 
T. Fujita, and Y. Kawamata [13, 14, 32, 33, 54— 
57] in the following cases: (1) n x 3, (2) when 
general fibers are curves, (3) dim w = 1, x(w) — 1, 
(4) «(V) Z0, and W is of general types, etc. By 
using the results of case (4), Kawamata proves 
that an algebraic compact complex manifold V 
is birationally equivalent to an Abelian variety 
if and only if k(V)=0 and the irregularity of V 
equals the dimension of V [32]. 


J. Analytic Surfaces 


In what follows, an analytic surface means a 2- 
dimensional compact complex manifold. For 
an analytic surface S, an exceptional curve on 
S and a (relatively) minimal model, etc., are 
defined with respect to bimeromorphic map- 
pings in analogy with the corresponding con- 
cepts for an algebraic surface (— 15 Algebraic 
Surfaces). Let C c S be an irreducible curve on 
S. Then there exists a holomorphic mapping o 
from S onto another surface S’ such that o (C) 
is a point and such that o induces the isomor- 
phism $ — C 5$' — (C) if and only if C? = —1 
and C is a nonsingular rational curve (Grauert 
[17]). S has a minimal model if and only if S is 
not a ruled surface (Kodaira [34, III]). The 
irregularity q — h?'!, the geometric genus p,, i- 
genus P. etc., are also defined in the same way 
as in the case of algebraic surfaces. Note that, 
in general, h?:! #h'-°. The Riemann-Roch 
theorem and M. Noether's formula are valid 
also for an analytic surface (Atiyah and Singer; 
— 366 Riemann-Roch Theorems C). 
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K. Classification of Surfaces 


The classification of analytic surfaces by the 
aid of their numerical invariants was com- 
pleted by Kodaira and includes as a special 
case Enriques’s classification of algebraic 
surfaces [34, INI]. By an elliptic surface we 
mean a surface from which there exists a sur- 
jective holomorphic mapping ¢ onto an alge- 
braic curve A, such that for a general point p 
on A, o ! (p) is an irreducible nonsingular 
elliptic curve. If a(S) 2 1 or «(S) 2 1, S has a 
unique structure as an elliptic surface. The 
image by ¢ of the points on S at which ¢ is 
not of maximal rank is a finite subset {a,, 
...,@,} of A. Let t; be a local coordinate on 

A around a; with t;(aj) 2 0. We call a singular 
fiber of ọ the divisor on S defined by {t;0@ 

— 0). The structure and the construction of 
singular fibers of elliptic surfaces have been 
completely determined by Kodaira. By an 
elliptic surface of general type we mean a 
surface with Kodaira dimension 1. If «(S)= 2, 
S is projective algebraic and is called a surface 
of general type. If a(S) —0, then there exists 
only a finite number of irreducible curves on S. 
By a Hopf surface we mean a surface whose 
universal covering is C? — (0,0). If an analytic 
surface S is homeomorphic to S! x $°, S isa 
Hopf surface. Let b,(S) be the vth Betti num- 
ber of S. If a(S)2 0, b,(S) - 1, b,(S) 20, and S 
contains a curve, then S is a Hopf surface 
(Kodaira). By a surface of class VII, we mean 
a minimal surface S with b,(S)=1. M. Inoue 
[26] constructed three families of surfaces of 
class VII, with b, =0 which contain no curves. 
These are Sy) (M eSL(, Z)) and SI, and 
SV ar (N eSLQ, Z), p, q, re Z, teC) which 
have H x C as their universal covering sur- 
faces, H being the complex upper half-plane. 
These have a line bundle L such that H°(Q' e 
((L))=0. This property characterizes these 
Inoue surfaces among VII, surfaces with b, = 
0 which contain no curves. A couple of new 
surfaces of class VII, with b, >0 were con- 
structed and studied by Inoue, Ma. Kato, 

T. Oda, I. Nakamura, and I. Enoki [10, 27—29, 
46]. Some of these have close connections 
with cusp singularities of Hilbert modular sur- 
faces, torus embeddings, and global spherical 
shells. Enoki's surface is denoted by S, , , 

(n2 0,0 « |a| « 1, te C") and has the following 
properties: (1) class VII, and b, =n, (2) there 
exists a connected curve D with D? =0, (3) 
$,,4,,— D is an affine bundle over an elliptic 
curve. Properties (1) and (2) characterize S, , , 
(Enoki). 

Let K, denote the canonical line bundle of 
an analytic surface S. An analytic surface S is 
said to be a K3 surface, if K; is trivial, i.e., if 
there exists a nonvanishing holomorphic 2- 
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form c and if q —0. K3 surfaces are simply 
connected and are deformations of a nonsin- 
gular quartic surface in P? (Kodaira). We shall 
define the period mapping of K3 surfaces. Let 
L be a free Abelian group of rank 22 with the 
pairing 4 > which is the direct sum of the two 
copies of — Eg and three copies of U = Ze, + 
Ze, with €e,,e,5 = ez, e2) =0, (e, e2) = 
<€5,€,>=1, where — Eg denotes the lattice with 
the pairing corresponding to the Dynkin 
diagram Eg with the opposite signs. H?(S, Z) 
with the intersection form is such a pair, con- 
sisting of S and ( >. A marked K3 surface is 
defined to be a pair (S, Y), where S is a K3 
surface and y: H?(S, Z) o L is an isomorphism 
preserving < >. The class [œw] of c is a base of 
H”? and satisfies Zock [o]? =0 and ol, [o] 
» 0. Let P(L,) denote the 21-dimensional 
projective space associated with Lc — L 697 C. 
Z —i(a)e P(Lc)| <a, a» «0, <a, a» >0} is an 
open set of the quadric Z,— ((a)e P(Lc)| <a, a> 
=0}. 2 is the set of all Hodge structures on 
Lc. The period of a marked K3 surface (S, V) is 
defined to be (Vc[o]). Every point of 2 is the 
period of some marked Kahler K3 surface. If 
the periods of two marked Káhler K3 surfaces 
(S, v) and (S’, Y’), coincide then S is isomorphic 
to f. 

An analytic surface with p, — q—0 and 
K$? —0 is called an Enriques surface, which 
has a K3 surface as its universal covering. An 
Enriques surface is an algebraic elliptic surface. 
An analytic surface with g=1 and K?!? «0 is 
said to be a hyperelliptic surface, which has an 
Abelian surface as an unramified covering. It is 
an algebraic surface and is an elliptic bundle 
over an elliptic curve. The classification of 
minimal surfaces is given in Table 1. The fol- 
lowing relations hold among these invariants: 
Let b* (b ) be the number of positive (nega- 
tive) eigenvalues (counted with multiplicities) 
of the intersection matrix on H?(S, R) and c; be 
the ith Chern class of S. Then 

(D b* —b^ —i(cj —2c;) (Hirzebruch signa- 
ture theorem); 

(2) if b, is even, q2 h^? 15, and b* = 
4p 1, 

(3) if b, is odd, g=h'° + 1=3(b, +1) and 
b* —2p,. 

Let c? and c, denote the Chern numbers of 
an analytic surface S, i.e., c? =(K§) and c, is 
the Euler number of S. Then 3c, 2 c7 [43], and 
if equality holds, then Ks is ample (Y. Miyaoka) 
(— 232 Kahler Manifolds C). If S is a surface 
of general type which is minimal, then c? > 
0, the bigenus P, 2 2 and the m-genus Pp is 
mim — 1)c1/24- 1 — q +p, for mz 2. For each 
mz 5, the mth canonical mapping of S is a bi- 
rational holomorphic mapping onto its image 
(Kodaira, E. Bombieri [5]). In general, c? > 2p, 
— 4, and if the canonical mapping, i.e., the 
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rational mapping associated with Ky, is bira- 
tional, then c} >3p,—7. For certain kinds of 
surfaces of general type, E. Horikawa suc- 
ceeded in determining completely the structure 
of surfaces obtained as deformations of a given 
surface. Among others, every minimal surface 
with p, —4, q —0, and c? —5 is a deformation of 
a nonsingular quintic surface [22, 23]. 


Table 1. 


Classification of Minimal Surfaces 










Structure 








algebraic surface 
of general type 


elliptic surface 
of general type 


HET TE femeror 


THEE elliptic surface 
3 


with a trivial 
ORRA hyperelliptic surface 


canonical bundle 
elliptic surface 


belonging to 
















K3 surface 


Enriques surface 











rational surface 


of genus q 
surface of class 
VII 


For a, b>0, let Z, be the set of isomor- 
phism classes of all minimal surfaces with c? =a 
and 1—q+p,=b. D. Gieseker [15] proved the 
existence of N and constructed the mapping 
h: X, P" so that the following hold: (1) h is 
injective; (2) if f: X 2 Y is a smooth holomor- 
phic mapping such that all ^! (y) represent 
some classes e X,,, then the set-theoretic map- 
ping f: Y>P induced from h is a morphism 
of schemes; (3) ^(ZZ,,) is a locally closed sub- 
variety of P". Hence, h(Z,,) is a moduli variety 
of surfaces of general type with c? =a and 1— 
qt p, b. 

For each m with 1 x m«9, there exists a 
minimal surface of general type with p, —q —0 
and c? =m. 

If there exist a compact analytic surface S 
and a curve C on S such that S — C is biholo- 
morphic to a complex manifold M, then we 
say that S is a compactification of M with 
boundary C. Every compactification of C? is a 
rational surface (Kodaira, J. Morrow [45]). 
Every compactification of C x (C— {0}) is also 
rational (T. Ueda [53]). However, all compac- 
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tifications of (C— (01)? are rational surfaces, 
certain kinds of Hopf surfaces, or P'-bundles 
over elliptic curves as constructed by Serre 
(Ueda). 
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A. Classical Theory 


If the ratio c/c, of two periods w,, œ, of an 
telliptic function f belongs to an imaginary 
*quadratic field K, then there exists an alge- 
braic relation between f(z) and f(Az) for any 
^ in K, and such an f is said to have complex 
multiplication. This phenomenon for the tsn 
function with modulus Bea was discovered 
by C. F. Gauss and was applied to the prob- 
lem of dividing a tlemniscate into five arcs of 
equal length. More generally, N. H. Abel 
showed that the special dividing equation of 
an sn function with complex multiplication 

is algebraically solvable. From a number- 
theoretic point of view, L. Kronecker conjec- 
tured that every *Abelian extension of an 
imaginary quadratic number field K is deter- 
mined by a transform equation of an elliptic 
function with complex multiplication by a 
number of K (1880). This is an analog of the 
fact, announced by Kronecker and proved by 
H. Weber, that every Abelian extension of the 
rational number field is a subfield of a tcy- 
clotomic field. Kronecker's work was con- 
tinued by Weber [2], and his conjecture was 
proved by T. Takagi (1903) for K =Q(./—1), 
by T. Takenouchi (1916) for K = Q(e?"/?), and 
by Takagi (1920) for the general case using 
tclass field theory. H. Hasse [5] and M. Suga- 
wara simplified the theory of complex multi- 
plication, and Hasse noticed a relationship 
between complex multiplication and tcon- 
gruence zeta functions. Working from Hasse's 
idea, M. Deuring constructed the theory of 
complex multiplication purely algebraically 
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and determined tHasse’s zeta function of an 
elliptic curve with complex multiplication. 
For the rest of this article, K always stands 
for an imaginary quadratic field. Let L be a 
"lattice group on the complex plane C gen- 
erated by w,, c, and let z be a complex vari- 
able. Define functions 9, g2, g3 as follows: 
plz L)= 9(z:0,,0,) 72? - X(—0) ? — 
o7?) g2(L) = g5(0,,0,) 603.0 *, g4(L) — 
d3(0,, 05) - 140X c $, where the sum X; is 
over the elements c of L except 0. Let oi be 
the derivative of o then z (1, (o(z), o'(z)) is a 
one-to-one correspondence between the points 
on the complex torus C/L and those on the 
telliptic curve E; X,X224X1—9g,X2X,— 
ag, X5 in the projective plane. If the quotient 
w/w generates K, then the ring of analytic 
endomorphisms of C/L (i.e., the ring of endo- 
morphisms of E) is isomorphic to a subring of 
the *principal order o of K. In particular, if the 
lattice group L is an fideal of K (for K c C; — 
347 Quadratic Fields), then the ring of endo- 
morphisms coincides with o. The function 
J(t)=J(E) - J(L) 2223? g,(LP/A(Q)) (A(L) = 
g; (LP —27g4(LY)) of 1— 0,/0;, Im x» 0, is 
a tmodular function of level 1, and J(E) is 
called the invariant of the elliptic curve E. 
If E has a complex multiplication, then J(E) 
is an algebraic integer. Then the three main 
theorems of the classical theory of complex 
multiplication can be stated. 


Theorem 1. Let h be the tclass number of K, 
and let a,,...,a, be a set of representatives of 
ideal classes of K. Then J(a,), ..., J(a,) are 
exactly the conjugates of J(a,) over K, and 
K(J(a,)) is the maximal unramified Abelian 
extension (the tabsolute class field) of K (— 59 
Class Field Theory). 

Next we define the function f by 


f: L)=9293A *: p(z L), 


K4Q(./-1), Q(e?”), 
-gjA t p(z LP, K=Q(./—1), 


-g,A g(zLy, K=Q(e?™). 
Theorem 2. Let o be the tprincipal order of K, 
m be an integral ideal of K, and a be an arbi- 
trary ideal of K. Choose a number Z of K such 
that m={2e0|4ćea}. Then K(J(a), f(&;a)) is 
the tclass field for the tray modulo m. 

The number č in this theorem is obtained by 
a !m(Z)-—b, where b is an integral ideal be- 
longing to the ideal class of a ! ni. Thus f(é; a) 
is a "special value" of an elliptic function as 
well as of a modular function. 

The tgeneral law of reciprocity, the *prin- 
cipal ideal theorem, and the ramification in the 
class field in theorem 2 can also be described 
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in terms of elliptic functions or elliptic curves. 
In general, the ring of endomorphisms of an 
elliptic curve defined over a field k of charac- 
teristic 0 is either Z or an order of an imagi- 
nary quadratic field. On the other hand, if the 
characteristic of k is not zero, then the ring of 
endomorphisms may be an order in a definite 
tquaternion algebra. 


B. Complex Multiplication of an Abelian 
Variety 


Following Kronecker's idea, D. Hilbert posed 
in his lecture at Paris (1900) the so-called 12th 
problem: to find an analytic function whose 
special values generate Abelian extensions 
over a given algebraic number field (— 196 
Hilbert). E. Hecke constructed unramified 
Abelian extensions of an imaginary biquadra- 
tic field using *Hilbert modular functions 
[10]. After this, there was no notable devel- 
opment concerning the problem until the 
theory of complex multiplication was gen- 
eralized to the case of tAbelian varieties, which 
was made possible by progress in algebraic 
geometry, in particular A. Weil's geometric 
theory of Abelian varieties (G. Shimura and 
Y. Taniyama (11]). The following results 
have been obtained: Consider a triple (A, X, t), 
consisting of an Abelian variety A defined over 
C, a fpolarization X of A, and a point t of A. 
Call two such triples (A, X, t) and (A', X', t) 
isomorphic if an isomorphism of A onto A' 
maps X onto X' and t onto t’. Then there exists 
one and only one subfield ko of C with the 
following property: In order for (A, X, t) and 
(A7, X^, t?) to be isomorphic for an automor- 
phism c of C, it is necessary and sufficient that 
o fix all elements of ky. We call kọ the field of 
moduli of (A, X, t). If A is an elliptic curve E 
and if t=0, then kọ = Q(J(E)). In the higher- 
dimensional case, the field of moduli is gen- 
erated by special values of a 'Siegel modular 
function. 

If F is a totally imaginary number field 
that is a quadratic extension of a totally real 
field Fy of degree n, then there exists a set 
(91. ..., Ony of n different isomorphisms of F 
into C such that @;=@, never occurs for i xj, 
where the bar denotes complex conjugation. 
We call (F; (9,3) a CM-type. 

If a is an ideal of F, then L= ((o, (a), ..., 
Q,(x))e C" |a€a] is a lattice group in the n- 
dimensional complex linear space C", and 
C"/L is analytically isomorphic to an Abelian 
variety A of dimension n in a complex pro- 
jective space. The tendomorphism ring 9I(A) 
of A contains a ring that is isomorphic to 
the principal order o of F. Conversely, every 
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Abelian variety of dimension n such that 9I(A) 
contains a ring isomorphic to o can be con- 
structed in this way. 

Let M be a normal extension of Q contain- 
ing F. Denote the Galois group of M by G 
and the subgroup of G corresponding to F 
by H, and put 5—| J; o, H. where o; stands 
for a prolongation of the previous o, to G. 
Then the following three conditions are equiv- 
alent: (i) 91(4) = o; (ii) A is simple; (iii) H = 
{yeG|Sy=S}. If H* ={5eG|5S=S}, then we 
can choose w,€G such that S= |), H*y,. 

If F* is the subfield of M corresponding to H*, 
then (F*; {w,,}) is also a CM-type, and F* = 
Q(x, ¢,(«)|ae F). Moreover, for an ideal r of 
F*. IL is an ideal of F. 

Now let H,, be the group of ideals r of F* 
that are relatively prime to the norm N(m) of 
an integral ideal m of F* and for which there 
exists a number é of F with 


[I5 02(&. Ne=], €=1 (mod* m). 


Then H,, is an *ideal group modulo m in F*. 


Theorem 3. Assume that 9I(A4)- o, and let X be 
an arbitrary polarization of A. Denote a point 
of A with m—(4€0|4t —0] by t, and let k,, be 
the field of moduli of (A, X, t). Then k,, F* is the 
‘class field over F* corresponding to Hp- 

The point t in theorem 3 always exists. In 
particular, if nt — o, then t=0. When A is an 
elliptic curve E, F is an imaginary quadratic 
field and we have F* — K, and theorem 3 coin- 
cides essentially with the content of theorems 1 
and 2. If n5 1, F=F* holds only in special 
cases. 

The theory of complex multiplication of A is 
closely related to the tHasse zeta function of A 
(— 450 Zeta Functions). 
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A. Algebraic Properties of Complex Numbers 


À complex number is an expression of the form 
a t ib with arbitrary real numbers a and b and 
the imaginary unit i. Writing «=a + ib, B= 

c id, we define «=f if and only if a=c and 

b — d. As regards algebraic operations with 
complex numbers, we define a+ f — (a 4- c) 
i(b +d), «y — B -(a—c) - ib — d), «B —(ac — bd) 
+i(ad+bc), and for f £0, i.e., c? +d? 0, 

a/ PB — (ac + bd)/(c? +d?) + i((bc — ad)/(c? + d?)). 
Then the addition and multiplication thus 
defined obey commutative, associative, and 
distributive laws, and complex numbers form 
a fcommutative field with O=0+i0 and 12 

1 4- i0 as its zero element of addition and iden- 
tity element of multiplication, respectively. The 
set of all complex numbers is usually denoted 
by C. 
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By assigning to each real number a a com- 
plex number a+ i0, algebraic operations on 
real numbers are carried into those of the 
corresponding complex numbers. That is to 
say, the field R of all real numbers is mapped 
isomorphically into the field C of all complex 
numbers. By identifying a with a+i0, we are 
taking R as a subfield of C. Also, 0 - i1 will 
be denoted simply by i. From the previous 
definition of algebraic operations, it follows 
that i? = —1. Furthermore, since « =a + ib 
— (a *- i0) - (b -- i0) (0 -- i1), a+ib is not a mere 
symbolic expression but can also be regarded 
as the outcome of algebraic operations on 
a, b, iin C. The real and imaginary parts of a 
complex number a — a4- ib are, by definition, 
a and b, denoted by Rec and Imo, respec- 
tively. A complex number that is not a real 
number is sometimes called an imaginary 
number; in particular, a complex number « 
with Rea —0 is called a purely imaginary 
number. For each complex number «=a +ib, 
we define its conjugate complex number as 
a — ib, and denote it by à. We then have 2+ f 
—à-- B and af — afi. The mapping «3 is an 
*automorphism of € which leaves each ele- 
ment of R invariant. Also, the following rel- 
ations hold: Rex 2 (x - €)/2 and Ima = (x — 
EA 

Regarded as an overfield of R, C is an 
'extension field of R of degree 2 obtained by 
the adjunction of i, which is a root of an irre- 
ducible equation x? + 1 20. The important 
algebraic property of C is that it is talgebra- 
ically closed. Namely, for any polynomial f(x) 
with coefficients in C, the equation f(x) 2 0 
possesses at least one root in C (*Gauss's 
fundamental theorem of algebra). 


B. Topology of C 


The absolute value (or modulus) of a complex 
number «=a +ib, denoted by |«|, is by defi- 
nition |x| 2 / a? +b? = Ja. If æ is real, then 
the absolute value of « in the sense of complex 
numbers is identical to the one in the sense of 
real numbers. It always holds that |x| 2 0 and 
|x| 20a 0. It follows further that |a + f| x 
la] -- |B], (8| — |o]1 Bl, and |a] — [a]. 

For each pair of complex numbers « and f, 
define p(a, f) 2 |a — B|. Then with p(a, p) as the 
*distance function, C satisfies the axioms for a 
tmetric space, and in particular, lim,..,, %,= 
dy S lim p(o,, xo) =0<>lim|a,, — Goals 0 
(lima, =a, and lim b, = bọ) (where «, =a, + ibp, 
dg — dg t ib). From this it is easily seen, as in 
the case of the set R of all real numbers, that C 
also becomes a ‘locally compact and *com- 
plete metric space. 

With respect to this topology, the four oper- 
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ations (except for division by zero) are con- 
tinuous: %,>%, and fj, Bo imply a, + fj, — 

Go + Bos 0, — Buxo — Bos 0t, Xo Bo; and 
&u/p,— Gelle (where in the last case we assume 
B, #0 and fi, #0). Thus C becomes a topo- 
logical field. Furthermore, the assignment 

X x gives a continuous mapping C >C, a 
homeomorphic automorphism of C. 


C. The Complex Plane 


If in a plane to which is assigned rectangular 
coordinate axes a complex number a — a + ib 
is represented by a point (a, b), then the plane 
is called the complex (number) plane (Gauss- 
Argand plane or Gaussian plane) (Fig. 1), and 
the point representing « is called simply the 
point x. The abscissa and ordinate axes are 
called the real and imaginary axes, respec- 
tively. A point x—a +ib can be represented by 
*polar coordinates r, 0 with the origin and the 
real axis as the pole and the generating line, 
respectively, where rz, /a? +b? is the absolute 
value |a| of « and 0 is the argument (or ampli- 
tude), denoted by argo, of x. The argument 

of æ is uniquely determined mod 27 if a40 
and is an arbitrary real number if à — 0. 


imaginary 
axis 





real axis 


Fig. 1 


The absolute value |«| of a complex number 
x, regarded as a vector from the origin to the 
point a, is the length of this vector. For com- 
plex numbers « and f, to the sum of the vec- 
tors a and f corresponds the sum a + £ of the 
complex numbers. A complex number o, in 
terms of its absolute value r and argument 0, 
is expressed as « —r(cos 0 -- isin 0), which is 
called the polar form of x. For polar forms 
the following hold: x 2r(cos0 —isin 0)= 
r(cos( —0) -- isin(—0); a~! «a/|a|? = 
r^ !(cos( —0) - isin( —0)) (x #0); and aa, = 
rirj(cos(0, -- 0;) - isin(0, + 05)), where |aj| — r; 
and arga; — 0; for j — 1, 2. This last relation 
when r; =r, =| is called De Moivre's formula. 
The nth roots of unity in C are given by p;— 
cos 2zj/n 4- isin 27j/n (j —0, 1, ...,n — 1) (Fig. 2). 

In the complex plane, the mapping « > a 
corresponds to the freflection of the plane in 
the real axis, x > «+ to the parallel *trans- 
lation along a vector f, « > «p (B #0) to the 
trotation through the angle arg f followed by 
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a thomothetic transformation with center 
0 and constant ratio |f|, and «> «^! to 
the tinversion with respect to the unit circle 
fa||a| 2 13. 

The distance p(x, B) 2 |a — f| between a and 
f in € coincides with their Euclidean distance, 
provided that « and f are regarded as points 
in the Euclidean plane, so that the complex 
plane is tisometric, and accordingly homeo- 
morphic, to the Euclidean plane. 





Fig. 2 


D. The Complex Sphere 


In the rest of this article, P denotes the com- 
plex plane and X denotes the sphere of radius 
1, with 0, the origin of P, as its center. The 
points N(0,0, 1) and 5(0,0, —1) of X will be 
called the north and south pole, respectively 
(Fig. 3), where the Ist and 2nd coordinate axes 
are the real and imaginary axes of P, respec- 
tively, and the 3rd coordinate axis is ortho- 
gonal to P. A straight line from N through a 
point z (a complex number) in P intersects 

X at a point Z —(x,, x5, x4) different from 

N, where z=(x, ix;)/(1 — x3), x4 2 (z - z)/ 

(1 +[z|?), x5 G —2)/G + ls and x3 = 

(Iz? —1)/(|z|? + 1). The mapping z > Z is called 
a stereographic projection from N, by means of 
which P and X — {N} become tconformally 
equivalent to each other. Consequently z can 
be represented by a point Z of Y—{N}, and 

2 thus used is called a complex sphere (or 
Riemann sphere). Let us adjoin to the complex 
plane P a new element, denoted by oo, called 
the point at infinity, which corresponds to the 
only exceptional point N of X. The topology 
of the complex plane with oo can be intro- 
duced by the corresponding topology of the 
Riemann sphere. Indeed, the family of all 

the sets (z||z| » M] u {œ} for M>0 forms a 
*local base around oo. By introducing local 
complex coordinates ( = 1/z into the neigh- 
borhoods of oc, each element of this local base 
is represented as (ICT « M! 3, in which the 
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convention ¢ —0 is adopted for z = oo. The 
complex sphere thus defined can be regarded 
as a ‘Riemann surface De, a 1-dimensional 
tcomplex manifold). 

The complex plane (complex sphere) whose 
points are represented by a variable z or w is 
called a z-plane or a w-plane (a z-sphere or a w- 
sphere). 





Fig. 3 


E. Linear Fractional Functions 


Given complex numbers a, b, c, and d with 
ad — bc £0, we define a linear fractional func- 
tion (or simply linear function) by 

az+b 
w= . 

cz d 





(1) 


As a mapping from the z-sphere into the w- 
sphere, this linear function is called a Móbius 
transformation (linear fractional or simply 
linear transformation). The usual linear trans- 
formation, i.e., the one with c=0 in the pre- 
sent case, is sometimes distinguished as an 
entire linear transformation. Since (1) depends 
only on the proportion a:b:c:d, we can 
assume ad—bc=1 without loss of generality. 
The transformation (1) is *holomorphtic and 
*univalent on the whole z-sphere with only 
one exceptional tpole at — d/c (oo if c — 0) of 
order 1, and the inverse of (1) is also a linear 
fractional function. The set of all linear trans- 
formations forms a ‘group with composition 
of transformations as the group operation. 
One of its subgroups is the *modular group. 
Linear transformations carry any circle 
of the complex plane (or of the Riemann 
sphere) into a circle of the same plane (or of the 
same sphere) if we adopt the convention that 
straight lines are a special kind of circle. (In 
the case of a Riemann sphere, no such conven- 
tion is necessary.) Given on a plane a circle 
with center o and radius r and two points p 
and p' on a half-line issuing from o satisfying 
op-op' —^r?, the points p and p' are called 
symmetric points (or reflection points) with 
respect to the circle. The transformation pp’ 
is called the inversion with respect to this cir- 
cle. In the complex plane, let z and z' be sym- 
metric points with respect to a circle C. Sup- 
pose that by a linear transformation, z, z' and 


C are carried to points w, w and a circle D, 
respectively; then w and w' become symmetric 
with respect to the circle D (principle of reflec- 
tion). Thus symmetricity is invariant under 
linear transformations. Also the tanharmonic 
ratio of any four points z,, Z2, Z3, and Z4, 

(Z1, Z2; Z3, 24) - (z1 —23)(21 — za): (za —23)/ 

(z5 — Z4), is invariant under a linear transfor- 
mation; Le, (Z1, Z2; Z3, 24) - Dun, W2; W3, W4) 
holds, where w; is the image of z; under a linear 
transformation (j= 1,2, 3, 4). 


F. Normal Forms of Linear Transformations 


There exist fixed points of the transformation 
(1) on the Riemann sphere, i.e., points satisfy- 
ing z —(az + b)/(cz + d). The number of fixed 
points is 2 or 1, except when w =z. If the trans- 
formation has two fixed points, they will be 
denoted here by p and q. The natural conven- 
tion p —q is adopted if the transformation has 
one fixed point. If c=0, then p or q is oo, and 
furthermore if c — a — d —0, then p and q are 
both oo. 

For unequal finite p and q, (1) can be re- 
written in the following normal form: 
w—p z=p a—cp 


———-—Q ;, Ge #1, 
w—p  z—dq a—cq 








in which, according as arga=0, |«|=1, or 
otherwise, (1) is called a hyperbolic (Fig. 4), 
elliptic (Fig. 5), or loxodromic transformation, 
respectively. This classification can be applied 
also for finite p and infinite q, i.e., to the trans- 
formation w—p=a(z—p). Furthermore, for 
p=q#, (1) is rewritten in the following 
form: 


XA ( 





Fig. 4 





Fig. 5 
Elliptic transformation. 
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In this case (1) is called a parabolic transfor- 
mation (Fig. 6). For p=q = oo, i.e., if w—z 4 f), 
(1) is also called parabolic. We can easily de- 
termine to which class (1) belongs from the 
discriminant D — (a +d} —4 of the quadratic 
equation cz? —(a—d)z—b=0 obtained from 
z «(az + b)/(cz +d) with ad — bc 21 by multi- 
plying both sides by cz +d. If a 4- d is real, 
then according as D>0, «0, or =O, (1) is 
hyperbolic, elliptic, or parabolic, respectively, 
and if a 4- d is not real, then the transforma- 
tion is loxodromic. 


X den 
- Oe SR 


xS 
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NG SE, 
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Fig. 6 
Parabolic transformation. 


Let D and D' be two arbitrary circular disks. 
Then there always exists a linear transforma- 
tion which gives a one-to-one tconformal 
mapping from D onto D'. Conversely, any 
mapping with this property is given only by 
linear transformations (provided that the half- 
plane having a straight line together with the 
point at infinity as its boundary is regarded as 
a closed disk), which are uniquely determined 
by giving three points a, b, c from the bound- 
ary of D and as their corresponding points, 
three arbitrary points a’, b', c' from the bound- 
ary of D'. 


G. The Poincaré Metric 


Since conformal mappings from the domain 
|z| « 1 onto |w|<1 are given by the trans- 
formations w — £(z —zg)/(1 — zoz) ([e| — 1, 

[zo| « D) (^ Appendix A, Table 13), for corre- 
sponding z and w it holds that 


Idw| ` |dz| 5 
jw? Foe o 





|dz|/(1 —|z|?) is called Poincaré’s differential 
invariant. With a metric having ds — |dz|/(1— 
|z|?) as its tline element, the unit disk |z| < 1 
becomes a *non-Euclidean space in the sense 
of Lobachevskii, and the metric is called the 
Poincaré metric. Furthermore, since the trans- 
formations (2) leave the length of curves invar- 
iant, they can be regarded as tmotions in this 
space, where the *geodesic through two points 
zı and z, is the circular arc orthogonal to the 
unit circle. If we denote the intersections of the 
arc with the unit circle by z, and z4, then the 
tnon-Euclidean distance between two points 
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z, and z, along the geodesic is given by 
(1/2)log(z,,23; 23,24), provided that the points 
Z4» Z4, Z5, Z3 are arranged on the arc in this 
order (— 285 Non-Euclidean Geometry). 
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A. History 


Since the beginning of civilization, people 
have utilized tools for aiding computation. In 
Japan, bamboo computing rods were used in 
the 7th century; before the close of the 16th 
century, the abacus was imported from China 
(— 230 Japanese Mathematics (Wasan)). The 
first calculator capable of performing the four 
arithmetic operations automatically was de- 
signed by W. Schickard of Tübingen Univer- 
sity (1623). After that, B. Pascal independently 
made his famous adding machine; this was 
improved by G. W. Leibniz so that it was able 
to execute multiplications and divisions. 

Most calculators manufactored today are 
electronic; however, many of these machines 
are not "programmable" and hence require at 
every step of computation a manual operation 
for specifying the machine operations to be 
executed. On the other hand, modern auto- 
matic computers can execute automatically a 
sequence of operations according to a given 
program without any manual intervention. 

The automatic computer was conceived by 
the English mathematician C. Babbage in the 
19th century, but mechanical engineering at 
that time was not advanced enough to allow 
the construction of such a computer. His idea 
was first realized by the relay computers Z3 of 
K. Zuse (1941) and Mark I of Harvard Uni- 
versity (1944). In 1947, the first electronic 
computer, ENIAC, appeared, in which vacuum 
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tubes were utilized instead of mechanical com- 
ponents. Since then, the capabilities of com- 
puters have increased rapidly and it can be 
said that we now live in the “computer age." 


B. Principles of Modern Computers 


Information processing in a computer is based 
on communication among its constituents by 
electric signals. In digital computers, informa- 
tion is encoded as a sequence of binary num- 
bers 0, 1, whereas continuous values are al- 
lowed in tanalog computers. The minimum 
quantity of information in digital computers 
is therefore a binary digit, called a bit. Since 
continuous values fluctuate on account of 
electric noise in the circuits, digital computers 
have the advantage of maintaining high preci- 
sion during computation. 

The binary numbers 0, 1 are represented in 
practice by two distinct electric signals: two 
distinct voltages, two distinct phases of alter- 
nating current, the existence or nonexistence 
of a pulse, etc. A system of circuits is called 
synchronous when it contains a clock, a gener- 
ator of periodic pulses, which synchronizes 
the transmission and transformation of in- 
formation. In an asynchronous system, circuits 
execute each step of information processing 
independently and advance to the next step 
after verifying the termination of the preceding 
step. 

In ENIAC, a program was inserted into the 
computer by carefully connecting many con- 
trol lines on plugboards. Since this caused 
much trouble in inserting the different pro- 
grams needed for different jobs, J. von Neu- 
mann proposed encoding programs in se- 
quences of binary digits and storing these 
programs in the memory unit of the computer. 
His principle, the stored program principle, was 
realized by EDSAC (1949) and has since been 
widely used. 

An automatic computer in general con- 
sists of the following five units: the arithme- 
tic unit, memory, control, input, and output 
(Fig. 1). These units are interconnected by 
wires that exchange information in the course 
of computation. 
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Construction of electronic computers. 
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The arithmetic unit consists mainly of sev- 
eral memory registers and operational circuits 
associated with them. Each register stores a 
binary number of n bits (usually 16 <n « 64). 
An important building block of the opera- 
tional circuit is the basic adder of 1-bit num- 
bers. A parallel adder of n-bit numbers can 
be obtained by connecting n copies of the 
basic adder. Alternatively, a sequential adder 
can be composed of a single basic adder which 
is utilized repeatedly to sum up bit by bit two 
binary numbers from the lowest bit, Subtrac- 
tion is usually carried out by adding the com- 
plement of the subtrahend. Multiplication is 
realized by shifting the multiplicand to the 
left and adding it to the intermediate sum; 
this addition may be omitted, depending on 
the relevent bit of the multiplier. In division, 
we shift the dividend to the left and if possible 
subtract the divisor from it. By recording at 
each step whether or not the subtraction is 
possible, we obtain the quotient. 

The memory unit stores the instructions 
and given data as well as the necessary data 
obtained in the course of computation. It is 
divided into many registers, each of which 
contains a number and is referred to by means 
of a serial number called an address. At present, 
high-speed memory units are usually made 
up of magnetic cores or integrated circuits. As 
auxiliary large-scale memory, there are also 
magnetic drums, disks, and magnetic tapes, 
etc. Other elements under investigation are 
extreme-low-temperature elements, chemical 
elements, optical elements, etc. 

The control unit repeats the following oper- 
ations consecutively: (1) takes an instruction 
from the memory location indicated by the 
sequential control counter, (2) gets a data 
word from the memory according to the ad- 
dress part of the instruction, (3) decodes the 
function part of the instruction and sends 
control signals to appropriate circuits, and (4) 
increases the content of the sequential control 
counter by 1 and, after receiving end signals 
from the arithmetic unit, returns to step (1). 
For these purposes the control unit contains a 
counter, a decoder, an encoder to send control 
signals, and a register to store the instruction 
to be executed. 

The arithmetic, control, and memory units 
form the central processor of the computer. In 
contrast to the central processor, the input and 
output units are called peripheral devices. An 
input device receives the necessary information 
(a program and data); a card reader reads 
punched cards, and a teletypewriter sends 
signals directly into the computer. An output 
device presents the results obtained by the 
computer. The results are usually printed by a 
teletypewriter or line printer. Magnetic tape 
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units can be considered as auxiliary input- 
output devices, since tape reels are removable. 
The tape units can accept prepared input data 
as well as record the results, the output. 

These devices are operated according to 
given input-output instructions under the 
control of the central processor. However, 
since every input-output device contains me- 
chanical components and is extremely slow in 
comparison with the central processor, a large- 
scale computer is often accompanied by sat- 
ellite computers which undertake the control 
of input-output devices. 


C. Instructions and Programming 


In stored-program operation, a computer 
performs a sequence of calculations according 
to given instructions. There are various types 
of instructions, but those most frequently used 
are single-address instructions, each of which 
contains a single-address part designating an 
operand. 

À program is a finite sequence of instruc- 
tions arranged suitably for the required com- 
putation. Programming, or making a program, 
is therefore the task of decomposing the re- 
quired computation into elementary steps each 
of which corresponds to an instruction. 

Every instruction is represented in a com- 
puter by a number, a numeric code, which is 
determined in a definite way by the construc- 
tion of the control unit. Before starting com- 
putation, instructions thus encoded are stored 
in the memory. In this sense, a program is a 
sequence of numbers. This sequence is called a 
machine-language program. 

A program is usually divided into several 
blocks, called subprograms or subroutines. 
Some subroutines are made in advance, espe- 
cially those for frequently required jobs such as 
evaluating elementary functions and manipulat- 
ing input-output devices, etc. The system of 
these ready-made programs is called software, 
in contrast to the hardware (1.e., mechanico- 
electronic equipment) of the computer. Quite 
often basic routines are microprogrammed, (e. 
written in a simple code and stored in a fast 
read-only memory (ROM). In this case the set 
of built-in programs is called firmware. The 
handliness of a computer depends mainly on 
close matching of the software and hardware. 

Programs are usually written in certain 
forms called external languages which are easy 
to master. A problem-oriented language is an 
advanced external language in which ordinary 
arithmetic expressions are available with slight 
modifications. Programs written in these lan- 
guages are translated into machine languages 
by program input routines. The translator for 
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a problem-oriented language is called the 
compiler. 

A compiler accepts several macroinstruc- 
tions. Moreover, it is equipped with the fol- 
lowing facilities: (i) the ability to translate 
arithmetic expressions into machine language; 
(ii) the ability to generate linkages to various 
ready-made subroutines according to certain 
simple indications; (iii) the ability to auto- 
matically allocate programs, subroutines, and 
data in the memory; (iv) the ability to check 
automatically the syntactic correctness of 
programs. Thus it accepts an external form 
such as 


if x 2 0 then printreal (SORT(x)) 
else printstring (‘negative’). 


Another important translator is the as- 
sembler, which translates mnemonic codes of 
instructions (add for addition, etc.) into their 
numeric codes according to a given table. It 
allows us to utilize symbols for specifying 
addresses. It also converts decimal numbers 
into binary and generates certain segments of 
the program from rather simple indications. 
Compilers and assemblers are important con- 
stituents of software. 

The large-scale high-speed computers that 
have recently appeared have made software 
systems inevitably more complex. There is 
now software, so-called monitors or operating 
systems, that supervises the uninterrupted 
processing of many programs. Some operating 
systems coordinate several assemblers and 
compilers so that several languages can be 
mixed in writing a program. Examples of 
other important software are mathematical 
(numerical) software and database manage- 
ment systems. 


D. Mathematical Models of Computers 


An operational circuit in the arithmetic unit is 
usually constructed from basic elements (log- 
ical gates), each of which performs a certain 
operation on the binary signals. Thus the 
construction of a circuit from basic elements 
is represented by a composition of a logical 
function defined over the set (0, 1] from a 
given set of basic functions. Post [3] consid- 
ered functional composition without feed- 
back loops and established a general crite- 
rion for a given set of logical functions to be 
complete, i.e., to be capable of generating the 
whole set of logical functions. Strictly, how- 
ever, a logical gate takes a definite time delay 
to perform its operation. Therefore Kudryav- 
tsev [4] proposed as a model of a logical gate 
a pair (f, d) of a logical function f and a non- 
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negative integer d: f represents the operation 
of the gate and d represents its delay. He de- 
fined the feedback-free composition of such 
functions with delays and gave a completeness 
criterion for a set of logical functions with 
delays. His study has been extended in various 
directions by Loomis [5], Nozaki [6], and 
Rosenberg [7]. 

When a circuit contains memory elements 
or feedback loops, its function is suitably 
represented by an tautomaton. Hence the 
design and analysis of circuits with memory 
elements, state-minimization, and equivalence 
checking [8], decomposition into simpler 
components [9, 10], verification of complete- 
ness [11], etc., can be studied in terms of auto- 
mata (— 31 Automata). A computer itself can 
be considered to be a finite automaton, since 
it has a finite number of memory elements and 
its behavior is completely determined by the 
content of the memory (its internal state) and 
the inputs. However, since a modern computer 
has an enormous memory and contains re- 
placeable parts, such as magnetic tapes, it is 
more adequately represented by an infinite 
model, such as a * Turing machine. 

Turing machines are capable of simulating 
many intellectual activities governed by for- 
mal rules. It is believed that any well-defined 
finite algorithm can be simulated by a Turing 
machine (Church's thesis.) Thus any compu- 
ter can be simulated by a Turing machine, 
and hence it is unable to resolve those deci- 
sion problems which are unsolvable for Tur- 
ing machines. For instance, no program can 
decide in finite steps whether a given pro- 
gram written in a computer language, say 
FORTRAN or PASCAL, eventually stops or 
not. On the other hand, a modern computer 
can simulate a universal Turing machine pro- 
vided that its memory can be extended un- 
boundedly by supplying magnetic tapes. 
Hence any well-defined finite algorithm can 
be simulated by a computer unless the limi- 
tation of memory capacity hampers its 
accomplishment. 


E. Mathematical Theory of Programming 
Languages 


Along with the development of software, pro- 
gramming techniques have gradually accumu- 
lated. For instance, we now have an almost 
satisfactory method of translating arithmetic 
expressions into machine language. However, 
we still lack a general theory to cover effec- 
tively a wide array of programming problems. 
The design of an adequate metalanguage 
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is still an important problem, if the word 
"adequate" implies complete description of 
syntax and semantics of problem-oriented 
languages. Àn interesting problem is to define 
and suitably classify grammars with respect to 
their capabilities of forming languages. Such 
research is an important branch of mathema- 
tical linguistics. N. Chomsky [12] has inves- 
tigated this problem, starting from research on 
natural languages, and has given formal defini- 
tions of the grammars (— 31 Automata D). 

*Context-free grammars (type-2 languages) 
play an important role in the theory of soft- 
ware as well as in Chomsky's syntax, since 
they are powerful enough to describe the pa- 
renthesis structure and simple and easy to 
manipulate. Some variants of context-free 
grammar have been proposed for attaining 
high efficiency in the automatic appraisal of 
programs [13, 14] or for increasing the pro- 
grams' generative power [15]. 


F. Branches of Information Science 


Many fields related to computers now make 
up the information sciences or informatiques. 
Important mathematical theories born or 
developed in the information sciences are: 

(1) Design and analysis of hardware devices: 
*Boolean algebra [16], switching theory [17], 
and theory of tautomata [18, 19]. 

(2) Design and analysis of programming lan- 
guage; theory of formal languages [14, 21]. 

(3) Design and analysis of algorithms; tnumer- 
ical analysis, theory of *complexity of compu- 
tation, theory of tdata processing. 

(4) Mathematical foundation of programming; 
logical verification of correctness and equiva- 
lence of programs [22], frecursive function 
theory, *decision problems. 

Applications of the information sciences are 
found in diversified fields, such as statistics, 
operations research, mathematical psychology, 
econometrics, jurimetrics, and behaviorimetrics. 
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A. Mobius Geometry 


We represent an n-dimensional sphere S” as 
the tquadric hypersurface An: cx - x24... 
x? —2xox, —Ü in an (n+ 1)-dimensional real 
*projective space P"*!, where the (x,) are 
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thomogeneous coordinates in P"*!. We denote 
by M (n) the group of all tprojective transfor- 
mations of P"*! that leave S" invariant. Then 
the transformation group M (n) acts on S". The 
pair (S", M (n)) is called the conformal geom- 
etry or Möbius geometry. We call S” an n- 
dimensional conformal space, a transformation 
belonging to M(n) a Mobius transformation, and 
M (n) the Möbius transformation group. Every 
point of the projective space P"*! corresponds 
to a hypersphere on S”. For example, if a point 
A lies outside of An, then the intersection of S" 
and the tpolar hyperplane of A with respect to 
S" is an (n — 1)-dimensional sphere $" !, and 
the point A corresponds to this real hyper- 
sphere 5" `. Similarly, if a point A lies on S", it 
corresponds to a point hypersphere, and if a 
point A lies inside of S", then A corresponds 
to an imaginary hypersphere. We sometimes 
identify the point A with the corresponding 
hypersphere. For any two points A =(a,) and 
B=(b,), we put AB—a,b, t ab, ^ ...- a,b, 
— (agb, -- a, bo) and call it the inner product of 
the two hyperspheres A and B. The angle 6 
between two intersecting real hyperspheres A 
and B is defined by cos0 = A BIL A? - / B?). 
This angle is invariant under the Móbius 
transformation. 

In the projective space P"*!, we take a 
"frame (Ae, A, ..., Ans A.) that satisfies the 
conditions 


Ag = do A;= A;A,, = A2 =9, Ag A, = —1, 


AjÁj- gi, Ljol2,...,n, 
where (g;;) is a positive definite matrix. We see 
that Ao and A,, are points on $" and each A, is 
a real hypersphere passing through these two 
points. Every hypersphere X of S" can be 
written as a linear combination of the A;: X = 
UgAg+u,A,+...+uU,A,+U,,A,,. That is, X 
is represented by tprojective coordinates (u,) 
with respect to the above frame. We call these 
homogeneous coordinates (u,) (n + 2)- 
hyperspherical coordinates of the hypersphere 
X. Mf we use these coordinates, the inner prod- 
uct of two hyperspheres X =(u,) and Y= (v,) is 
given by XY— 7 i, giu;v;— (uot, + u Dal 
The Móbius transformation group M (n) is a 
topological group with two *connected com- 
ponents. If we denote by Main) the maximal 
connected subgroup of M (n) and by H the 
subgroup of M (n) that leaves invariant a real 
hypersphere of S", then the set E of all real 
hyperspheres of S” can be identified with the 
thomogeneous space M,(n)/H. The group H 
also consists of two connected components. 
If we denote by H, the maximal connected 
subgroup of H, the homogeneous space E = 
M,(n)/ Ho is a two-fold tcovering space of E. 
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For each real hypersphere Ac E, an element 

A € E over A is called an oriented real hyper- 
sphere. The Móbius transformation group con- 
tains as its subgroups ones that are isomor- 
phic to the group of fcongruent transforma- 
tions of a Euclidean space and ones that are 
isomorphic to the group of congruent trans- 
formations of a non-Euclidean space. That is, 
the subgroup of M (n) that leaves a point 
hypersphere invariant is isomorphic to the 
group generated by congruent transformations 
and thomotheties of the Euclidean space E". 
The subgroup of tindex 2 (the factor group by 
a *cyclic subgroup of *order 2) of the subgroup 
of M (n) that leaves invariant a real (imaginary) 
hypersphere is isomorphic to the group of 
congruent transformation of n-dimensional 
hyperbolic (elliptic) *non-Euclidean space. 

In the n-dimensional Euclidean space E", 
consider a hypersphere of radius r with center 
O. For each point P of E", mark a point Q on 
the tray OP such that OP- OQ =r’. We call 
the point transformation that sends P to Q an 
inversion with respect to the hypersphere. A 
tsymmetry with respect to a hyperplane, con- 
sidered as an extreme case of inversions, is 
also called an inversion. We adjoin a point at 
infinity to the space E" to construct an n- 
dimensional sphere S". Each inversion can be 
extended to a transformation of S", which we 
also call an inversion. Then each Móbius 
transformation is generated by a finite number 
of inversions. By a Móbius transformation 
of S", each hypersphere is transformed to a 
hypersphere. Any angle between two curves 
that intersect at a point of E" is invariant 
under Móbius transformations. Conversely, 
if n> 3, each local transformation of E" that 
leaves invariant the angle of each pair of inter- 
secting curves is a 'restriction of a Möbius 
transformation. However, for n=2 this is not 
true in general; any transformation that leaves 
angles invariant is called a tconformal map- 
ping. Any Mobius transformation z—w on 
the tcomplex sphere $? 2 CU {00} can be ex- 
pressed by an equation of the form w — (az + f) 
(yz +0) or w - (az + B)/(yz + ô), where o, B, y, 
and ô are complex numbers such that «ô — py 4 
0 and z denotes the tcomplex conjugate of z. 


B. Laguerre Geometry 


Let T be an oriented smooth curve in a Euclid- 
ean plane E?. The tangent line at a point p 

of T is supplied with an orientation that is 
induced by the orientation of the curve TI in 
an obvious manner. The oriented line / thus 
obtained is called the oriented tangent line 

of I at p. 
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Let S be the set of oriented lines in E? and T 
be a given set of oriented smooth curves in E?. 
Consider a bijection y of the direct product S 
x T to itself satisfying the following condition: 
If / is an oriented tangent line of a curve I 
belonging to T and y sends (l, T) to (F, T^, then 
l' is an oriented tangent of the curve I". The 
set of such bijections forms a group G. Sup- 
pose that we have le S for which there exist 
two curves I’, and T, in T such that lis a 
common tangent line of I', and T, at p, and 
p2, respectively. Further suppose that the 
element ye G sends (LI;) to (I, I7) (i= 1,2). 
Then the element y of G is called an equilong 
transformation if the following conditions (i) 
and (ii) are satisfied: (i) I’ is tangent to I; at 
points p; (i= 1, 2), (it) the distance between p, 
and p; is equal to the distance between p, and 
p2. The set of equilong transformations forms 
a subgroup H of G. In particular, if T is the set 
of oriented circles (including point circles), the 
elements of H are called Laguerre transfor- 
mations. In an obvious manner, we can divide 
the set of oriented circles into two classes, 
those with “positive” and those with “nega- 
tive” orientations. With an oriented circle F 
we associate the pair o(I) 2(P, r), where P is 
the origin of the circle and r is a real number 
whose absolute value is equal to the radius of 
the circle and whose signature coincides with 
that of the orientation of T. The mapping 
o T E? x E! thus defined is called Lie's 
minimal projection. An example of a Laguerre 
transformation, called a dilatation, is given by 
a bijection y of S x T to itself satisfying the 
following condition: Let y(l, I) 2 (l', ^); then /' 
is parallel to J, the distance between / and I’ is 
a given number, and q(I") Z(P, r +c), where 
o(T)=(P,r) and c is a given constant. We note 
here that the action of a Laguerre transfor- 
mation 7: (I, I) 2(l, I") is determined by its 
action on S (y acts on S by y(/)=!’). Another 
example of a Laguerre transformation y, called 
a Laguerre inversion, is determined by means 
of a given oriented circle O and a line p that is 
not tangent to O; given an oriented line |, its 
image /' under the action of y is determined 
as follows (here we describe the case where | 
is not parallel to p). There exists a uniquely 
determined oriented tangent line g of the circle 
O parallel to |. Now we have a uniquely deter- 
mined oriented tangent line g' of O that passes 
through the point of intersection of the lines g 
and p such that g' Z g. The image l is the line 
parallel to g' and passing through the point of 
intersection of the lines / and p (Fig. 1). Each 
Laguerre transformation can be written as a 
product of a finite number of Laguerre inver- 
sions. We denote the group of Laguerre trans- 
formations by L. The pair (L, S) 1s, by defini- 
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tion, a model of Laguerre geometry. Notions 
such as Laguerre inversions, dilatations, and 
transformations can be generalized to cases of 
higher dimension by utilizing oriented hyper- 
spheres and oriented hyperplanes. 





Fig. 1 


C. Sphere Geometry 


Let S be the set of oriented circles (including 
point circles and oriented lines) in the Eucli- 
dean plane E?. Two oriented circles C, and 

C, are said to be in contact with each other 

if they have a point p and an oriented tan- 
gent line passing through p in common. (An 
oriented circle C and an oriented line / are in 
contact with each other if and only if l is an 
oriented tangent line of C.) In this case, we call 
the pair (C,, C;) a contact pair. A bijection y of 
S to itself is called a Lie transformation if it 
sends any contact pair to another. (The Lie 
transformation is a special case of the tcontact 
transformations.) An inversion with respect to 
a circle determines in an obvious manner a Lie 
transformation, which is also called an inver- 
sion. We denote the group of Lie transforma- 
tions by G. Any element y of G can be written 
as the product of a finite number of inversions 
and Laguerre inversions, and G contains the 
group of Móbius transformations and the 
group of Laguerre transformations as sub- 
groups. The pair (G, S) is called a model of 
circle geometry. The notion of circle geometry 
can be generalized to that of hypersphere geom- 
etry for the case of higher dimensions. Speci- 
fically, when we replace E? by E? and circles 
by spheres, we have sphere geometry. 

Let V be a complex 3-dimensional space, 
and let M, N be the sets of oriented lines and 
oriented spheres in V, respectively. Then M, N 
have the structure of 4-dimensional complex 
manifolds that are homeomorphic to each 
other. The homeomorphism is given by the 
Lie line-sphere transformation that induces a 
bijection from the set of pairs of intersecting 
oriented lines onto the set of pairs of oriented 
spheres that are in contact with each other. 
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D. Group-Theoretic Considerations 


Here we discuss the preceding three kinds of 
geometries from the group-theoretic point of 
view (— 137 Erlangen Program). Let us con- 
sider the quadratic form Q defined by Q(x)= 
— x% 4 X2 x2 xi-—xiin a real projective 
space P^, where x 2 (xo, X1, X2, X3, x4) are 
homogeneous coordinates. We denote by G 
the set of all projective transformations of P* 
that leave Q invariant. The group G consists of 
the set of matrices A of order 5 such that det A 
— ] and Q(Ax) — Q(x) holds for all x in P*; we 
denote by L? the set of all points x in P^ that 
satisfy Q(x) 20. Then G acts transitively on L?. 
Hence if we denote by H, the set of all ele- 
ments of G that fix a point a in L?, for example 
a=(—1,1,0,0,0), we may assume that L? = 
G/H,. The circle geometry that belongs to 

the group of Lie transformations of circles is 
exactly the geometry of the homogeneous 
space G/H,. The group G, of all transforma- 
tions of G that leave the hyperplane x,=0 
invariant acts transitively on L°N{x,=0} (G, 
is isomorphic to M(2)). The geometry of the 
homogeneous space G,/H,1 G is plane con- 
formal geometry. Next, the group G, of all 
transformations of G that leave invariant xo 
+x,=0 coincides with H, and acts transitively 
on L3N {xo +x; =0}. The geometry of the 
homogeneous space L? N {xo +x; =0} (on 
which G, acts transitively) is plane Laguerre 
geometry. In this sense, the circle geometry 
that belongs to the group of Lie transforma- 
tions contains the other circle geometries as 
subgeometries. 

We now describe how plane Laguerre geom- 
etry can be realized as the geometry of the 
space L? N {xo +x; 20]. Let E? be a plane in a 
Euclidean space E?. We fix a Cartesian co- 
ordinate system (yo, Y1, y;) in E? so that E? is 
given by y, —0. To each point y of E? we can 
associate an oriented circle in E? with center 
(0, y,. Y2), radius | yo], and positive (negative) 
orientation if yg is positive (negative). If y lies 
on E?, the corresponding circle is the point 
circle y itself. Now let us consider the group G, 
of all affine transformations of E? whose rota- 
tion parts leave the quadratic form Q'(y) — 

— yà + y? + y} invariant. (G; is an isometry 
with respect to the tmetric defined by Q'.) Each 
element of G; induces a transformation of the 
set of oriented circles in E? (including the 
oriented lines and point circles) onto itself. 
The mappings of E? into L? defined by xy = 

(1+ Q'(y))/2, x, =(1—Q'(y))/2, x5 — y1, X3 7 Va, 
X4 = yo is a one-to-one correspondence of E? 
onto the subset of L? such that xo +x; #0. 
This correspondence induces an isomorphism 
of G; onto G,. In Laguerre geometry, there are 
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no essential distinctions between points and 

oriented circles on E?; and the group G, acts 
on a 3-dimensional space of oriented circles 

(including point circles). 
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A. General Remarks 


Let a function w= f(z) that maps a *domain D 
on the *complex z-sphere homeomorphically 
onto a domain A on the complex w-sphere 
satisfy the following two conditions: (1) Every 
curve C, :z(t) (Oo & t € 1) that starts at any point 
zo in D and possesses a tangent there has an 
image curve C,:w—w(t)— f(z(t)) (0 € t« 1) that 
also possesses a tangent at the image point wo 
= f (zo). (2) The angle between any two curves 
Ci? and CC? possessing tangents at zg is equal 
to the angle between their image curves CCP 
and CP, where the direction of the angle is 
also taken into account. Then the mapping 
from D onto A is said to be conformal, and A 
is called conformally equivalent to D. It has 
been proved that w= f(z) is necessarily a func- 
tion *analytic in D (D. Men'shov, 1931). 

Consequently, the theory of conformal 
mapping is a branch of the theory of analytic 
functions. That a function w= f(z) maps a 
domain D conformally onto a domain A 
means that it is a tmeromorphic function 
*univalent in D and its range is A. Then f '(z) z 
0 holds at every (finite) point in D, and the 
ratio of the lengths of the segments between 
two points wo = w(0), w(t) on C, and between 
two points zo — z(0), z(t) on C, tends to a fixed 
nonvanishing limit | /'(z9)| as t 20 indepen- 
dently of the choice of C,. Hence if z, and z, 
lie on CH) and CO, respectively, and w; and 
w, are their image points lying on the image 
curves CU! and C), respectively, then the 
two triangles Az,zgz; and Aw, wow, are nearly 
similar in the positive sense, provided that z, 
and z, are near enough to Zo. This justifies 
the word “conformal,” which means “of the 
same form” (— Appendix A, Table 13). 


B. Conformal Mapping onto the Unit Disk 


A fundamental theorem in the theory of con- 
formal mapping is Riemann's mapping theo- 
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rem, which states that any tsimply connected 
domain D with at least two boundary points 
can be mapped conformally onto the interior 
A of the unit circle. This theorem is equivalent 
to the assertion of the existence of tGreen’s 
function of D and can be proved in various 
ways. B. Riemann (1851) gave a proof, based 
on an idea of C. F. Gauss, by assuming the 
existence of a solution for a variational prob- 
lem minimizing the tDirichlet integral. The 
logical incompleteness implied by this as- 
sumption was later removed by D. Hilbert 
and others. The proof that is now regarded 
as simplest is due to L. Fejér and F. Riesz's 
method (T. Radó, 1922, 1923), which applies 
‘normal family theory. On the other hand, 
the osculating process due to P. Koebe (1912) 
is a purely constructive method of proving 
existence that is also applicable to the case 

of *multiply connected domains. The map- 
ping function w= f(z) in Riemann’s mapping 
theorem is uniquely determined under the 
normalization condition f(zo) — 0, arg f '(z;9) = 
0, at a point Zo in D, where 6, is a given angle. 

The other types of simply connected do- 
mains are the Riemann sphere less one point 
and the sphere itself. In both cases every con- 
formal mapping of these domains is a linear 
transformation. A simply connected domain 
is called hyperbolic, parabolic, or elliptic if 
it is conformally equivalent to the unit disk, 
the complex plane, or the Riemann sphere, 
respectively. 

Let w= f(z) map a simply connected domain 
D on the z-sphere conformally onto a simply 
connected domain A on the w-sphere. If a 
sequence {z,} in D tends to a boundary point 
¢ of D, then the corresponding sequence {w,} 
(w, — f(z,)) has no faccumulation point in A 
and does not necessarily tend to a boundary 
point of A. 

If for any sequence {z,} tending to C, { f(z,)} 
tends to a unique point w on the boundary of 
A, it is said that f(z) possesses a boundary value 
c at ¢. To investigate the behavior of {w,' on 
a hyperbolic domain D, we can assume, by 
using a suitable mapping if necessary, that D is 
bounded. Then the problem is reduced, in view 
of Riemann's mapping theorem, to the case 
where D is a bounded simply connected 
domain and A is the unit disk |w| « 1. 


C. Correspondence between Boundaries 


Concerning the correspondence between 
boundaries under the conformal mapping w= 


f(z) of a bounded simply connected domain 


D onto the unit disk |w| « 1, we have the fol- 
lowing three theorems: 
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(1) To any taccessible boundary point zc of 
D there corresponds a unique point on |w| — 1, 
and to any distinct accessible boundary points 
zc, and zc, of D there correspond distinct 
points on the unit circumference. Furthermore, 
the set of all points on |w| 2 1 that correspond 
to accessible boundary points of D has tangu- 
lar measure equal to 27. 

(2) There is a one-to-one correspondence 
between tboundary elements of D and points 
on |w| 2 1 (C. Carathéodory). 

(3) Let w= f(z) map the interior D of a fJor- 
dan curve C conformally onto the unit disk 
A:|w| « 1. Then it possesses a boundary value, 
say UL at every point ¢ on C that satisfies 
| f(C)| LL Hence f(z) is continuous on the 
closed domain D — DUC and maps D bijec- 
tively onto the closed disk A:|w| x 1. Similarly, 
the inverse function z= (w) has an analogous 
property and maps A bijectively and continu- 
ously onto D; that is, a conformal mapping of 
the interior D of a Jordan curve onto the unit 
disk A can be extended into a homeomor- 
phism of the closure D to A (Carathéodory). 

In this case, if the Jordan curve C contains 
a tregular analytic arc I, then the mapping 
function w — f(z) can be prolonged analytically 
beyond F (except at the endpoints of I). Hence 
the mapping w= f(z) is conformal at interior 
points of I. 

Problems on the correspondence of angles 
at the boundary are closely related to *angular 
derivatives. These problems have been at- 
tacked by Carathéodory, S. Warschawski, J. 
Wolff, and others. 


D. Schwarz-Christoffel Transformation 


The problem of determining the form of an 
analytic function that maps the interior of a 
circle or a half-plane conformally onto the 
interior of a polygon was first dealt with by 
H. A. Schwarz and E. B. Christoffel (1869) 
(— Appendix A, Table 13). 

Let P be a polygon in the complex w-plane 
with vertices b, (= 1, ..., m) and interior 
angles a, at b,. Then a function w= f(z) that 
maps a circular disk or a half-plane in the z- 
plane conformally onto the interior of P is 
given by 


fi0- cl f (a, — z» ! dz & C. 
wal 


Here b, = f(a,), and C, C' are constants de- 
pending on the position and magnitude of the 
polygon P. If instead of a disk we consider a 
half-plane in the z-plane, and if one of the a, is 
the point at infinity, then, modifying this for- 
mula by deleting the factor of the integrand 
corresponding to a, — oo, we obtain a formula 
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for w= f(z). This representation is called the 
Schwarz-Christoffel transformation formula. 
This formula was originally derived by Chris- 
toffel to solve a problem of 2-dimensional 
distribution of stationary temperature. It 
then found extensive application to several 
problems of conformal mapping concerning 
polygonal domains and to problems of deter- 
mining force lines or stream lines and equi- 
potential lines in 2-dimensional electrostatics 
or hydromechanics. 

An analogous formula is also derived for a 
function that maps the interior of a circle or a 
half-plane conformally onto the exterior of a 
polygon. In connection with these formulas, a 
third-order differential equation that is satis- 
fied by a function mapping the interior of a 
circle onto a domain bounded by a circular 
polygon is found useful in the theory of auto- 
morphic functions. Moreover, a representation 
analogous to the Schwarz-Christoffel trans- 
formation formula is obtained for a function 
that maps the interior of a circle onto a curvi- 
linear polygonal domain bounded by arcs of 
logarithmic spirals with the origin as asymp- 
totic point. 


E. Conformal Mapping of Multiply Connected 
Domains 


It is also important to consider problems 
concerning one-to-one conformal mapping of ` 
a multiply connected domain on the z-sphere 
onto a suitable multiply connected domain D 
on the w-sphere. The two domains D and D 
are then homeomorphic, but the converse ts 
not true; i.e., there does not necessarily exist a 
one-to-one conformal mapping between D and 
D even when they are homeomorphic. Now let 
D and D be multiply connected domains on 
the z- and w-planes, respectively, both possess- 
ing at least three boundary points. The tuni- 
versal covering surfaces D and $ of D and D 
are hyperbolic and they can be mapped onto 
the unit disks. The groups of their ‘covering 
transformations form the fFuchsian groups G 
and ©. Then in order for D to be mapped 
one-to-one and conformally onto 2, it is neces- 
sary and sufficient that the group © be trans- 
formed into the group G by a suitable linear 
transformation. 

To a domain of finite tconnectivity having 
only continua for its boundary components, 
we can associate conformal invariants (namely, 
moduli) expressed by one real parameter in the 
doubly connected case and by 3n — 6 real 
parameters in the n(>2)-connected case. A 
one-to-one conformal mapping is possible 
only within a class of domains having the 
same invariants (— 416 Teichmüller Spaces). 
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While a circular disk ts taken as a canonical 
domain in the simply connected case, an fan- 
nulus is often taken as a canonical domain in 
the doubly connected case. In the latter case, 
the logarithm of the ratio ( 1) of the radii of 
two concentric boundary circles is usually 
called the modulus. There are various types of 
n( > 2)-connected canonical domains, for in- 
stance, the whole plane, a circular disk or 
annulus slit along concentric circular arcs or 
radial segments, a parallel slit plane, etc. The 
possibility of a one-to-one conformal mapping 
of a given domain onto a canonical domain of 
such a type was proved by Hilbert, Koebe, and 
others in a tpotential-theoretic way and by E. 
Rengel, R. de Possel, H. Grunsky, and others 
in a purely function-theoretic way. Such ca- 
nonical domains are characterized by some 
extremal properties. For example, the horizon- 
tal parallel slit mapping function po(z, zo) — 
(z—z9) ! - alpol(G —z9) ^ ..., zo € D, has 
the extremal property that it is the unique 
function maximizing Real f ] within the 
family of univalent functions f(z) 2(z —zg) ! + 
a[ f ] (z ^ zo) - ...in D. For a general domain 
D, po(z, zo), called the extremal horizontal slit 
mapping, is defined as the limit function of the 
sequence of horizontal slit mappings p{”(z, Zo) 
of D. zo € D,. Here {D,}., is a canonical ex- 
haustion of D. It has the same extremal prop- 
erty as in the case of finite connectivity. The 
boundary components of the image domain of 
D under p,(z, Zo) consist of a horizontal slit 
and a point. The parallel slit mapping p,(z, Zo) 
in the direction of 0 is defined similarly to the 
one that maximizes Ree ??a[ f] [8]. 

For a domain of infinite connectivity, by 
accumulation of boundary components, a 
pointlike boundary component can be mapped 
onto a continuum. A boundary component y 
of a domain D is called weak if its image f(y) is 
a point under every conformal mapping f on 
D. y is called strong if f(y) always consists of 
more than one point. À boundary component 
y that is neither weak nor strong is called 
unstable (L. Sario, J. Analyse Math., 5 (1956)). 
Criteria of weakness, etc., by means of ex- 
tremal length are given in [8]. Moreover, in 
terms of extremal length, a generalization of a 
boundary element is given by Oikawa and 
Suita [9]. On the other hand, the existence 
of one-to-one and conformal mapping of a 
domain bounded by a finite number of curves 
onto the whole plane with mutually disjoint 
circular disks removed was proved by Koebe 
and later derived by J. Douglas and R. Cou- 
rant as a particular case of the existence of a 
solution of tPlateau’s problem [10]. M. Schif- 
fer showed that the mapping function is a 
solution of an extremal problem involving 
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Fredholm eigenvalues [11]. L. Bieberbach and 
H. Grunsky showed the possibility of mapping 
an n-connected domain onto an n-sheeted 
disk. Concerning doubly connected domains, 
detailed investigations were made by O. Teich- 
müller, Y. Komatu, and others. For a domain 
D bounded by n analytic Jordan curves, as 

an extension of the Schwarz lemma, Ahlfors 
showed that there exists a unique extremal 
function folz) maximizing Re f '(zj), zo € D, 
within the family of analytic functions f(z) 
satisfying | f(z)| « 1 in D and that f(z) maps D 
onto the n-sheeted unit disk (1947). fo(z) is 
called the Ahlfors function at z,. For a general 
domain the uniqueness of the extremal func- 
tion in nontrivial cases was established first by 
S. Havinson (1961) and later by L. Carleson 
(1967) and S. Fisher (1969). 

Most *kernel functions of a plane domain D 
have connections with conformal mappings. 
For example, the Bergman kernel U(z, ¢) of 
exact differentials is equal to (2x) (pts, C) 

— ps5 (2, CH. The adjoint kernel V(z, C) is de- 
fined by (22)! (po(z, E) + pi j2(Z, 0). M. Schiffer 
showed that the integral of V(z, C), S(z, i) — 

fz V(z, ¢)dz, is univalent and maps D onto a 
domain bounded by n analytic convex curves 
if D is bounded by n Jordan curves [13]. 

For a general domain D, this is true in the 
sense that each boundary component of the 
image domain under S(z, C) is a convex set (K. 
Oikawa and N. Suita, Kódai Math. Sem. Rep., 
16 (1964)). It is known that S(z, č“) maximizes 
the area of the complementary set of the image 
domain under f(z) within the family of univa- 
lent functions f(z)-(n(z—0)) ! - b,(z— Q4 .... 
The maximum value multiplied by 47 is called 
the span of the domain D; it is equal to a[ po] 
— a[ P2]. For the *Szegó kernel function 
k(z,¢) of a domain D bounded by n analytic 
curves, the adjoint kernel /(z, C) is defined, and 
the Ahlfors function f(z) at ¢ is expressed by 
k(z, Hits C) (P. Garabedian, Trans. Amer. 
Math. Soc., 67 (1949)). 


F. Universal Constants 


Among various universal constants appearing 
in the theory of conformal mapping, Bloch's 
constant is especially famous. A. Bloch (1924) 
showed that a covering surface over the w- 
plane obtained from a mapping w= F(z)— 
z+... that is one-to-one, conformal, and 
holomorphic in |z| « 1 always contains a tuni- 
valent (schlicht) disk whose radius B is a posi- 
tive number independent of the function F 
(Bloch's theorem). The supremum $8 of such 
constants B is called Bloch’s constant. The true 
value of 88 is yet unknown, but estimates have 
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been given by Ahlfors (Trans. Amer. Math. 
Soc., 43 (1938)) and by Ahlfors and Grunsky 
(Math. Z., 42 (1937)) in the form 


age S. 





T (1/4 T (1/12) 
=0.4719.... 


M. Heins showed that the equality sign can be 
deleted from the left-hand side (Nagoya Math. 
J., 21 (1962)) and C. Pommerenke gave a 
simpler proof for the same fact (J. London 
Math. Soc., 2 (1970)). It is conjectured that the 
correct value of $ is equal to the upper bound 
of Ahlfors and Grunsky. Landau's constant 2 
corresponding to the case where the image 
disks are not necessarily univalent satisfies 
0.5x £ «0.55. The lower and upper bounds 
are due to Ahlfors (cited above) and Landau 
(Math. Z., 30 (1929)). Pommerenke showed 
0.5 « € (1970, cited above). For the family of 
univalent functions these constants coincide, 
and the value is called the schlicht Bloch 
constant, denoted by 9I. Its lower and upper 
bounds are known to be 0.5705 (J. Jenkins, J. 
Math. Mech., 10 (1961)) and 0.6565 (R. Robin- 
son, Bull. Amer. Math. Soc., 41 (1935)). (For 
distortion theorems and coefficient problems 
— 438 Univalent and Multivalent Functions.) 
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A. General Remarks 


Suppose that we are given two straight lines l 
and m intersecting at V (but not orthogonally) 
in the 3-dimensional Euclidean space E?. By 
rotating the line m around l, we obtain a sur- 
face %. We call this surface Ẹ a circular cone 
with vertex V and axis l; a straight line on the 
surface passing through V is called a generat- 
ing line of o. 

A section C of & by a plane z not passing 
through V (C 2x13) is called a conic section 
(or simply a conic). This C is a plane curve on 
the plane z. The point set Ẹ— V consists of 
two fconnected components jy, and $Y;. Let 7; 
(i= 1, 2, 3) be planes not passing through V. If 
the conic section C, —z, N & is bounded, then 
C, is contained either in $$, or in %, and is 
teonnected. We call such a C, an ellipse. When 
C =m, N ğ is not bounded but is connected, 
then z, is parallel to one of the generating 
lines of %, and C, is contained either in §, or 
in 2. We call such a C, a parabola. When z, 
intersects both of S, and 3, then C3 =r, N F 
has two connected components and is not 
bounded. We call such a C, a hyperbola. These 
three types exhaust all possible types of conic 
sections. In particular, if the plane z is per- 
pendicular to the axis /, then C =n N ğ be- 
comes a circle. Thus a circle is a special kind 
of ellipse. 


B. Foci and Directrices 


Let C=21%, be an ellipse. The Euclidean 
space E? is divided by z into two thalf-spaces 
Ej, ES (two “sides” of zl, If we put S, N E? 

= ui, 01 N ES = 8,5, we can construct a 
sphere S that is contained in Ej, tangent to 
du along a circle K, and tangent to z at a 
point F. Similarly, we can construct a sphere 
S' that is in E3, tangent to %,, along a circle 
K', and tangent to z at a point F'. We call F, 
F' the foci of the ellipse (Fig. 1). 
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Let x, x’ be the planes containing K, K’. 
Straight lines d 2 x (17, d' =x Nr are called 
directrices of C. Unless C ts a circle, we have 
F x F', and x, x (and x’, z) actually intersect; 
hence d, d' exist. When C=21% is a parabola 
or a hyperbola (Figs. 2 and 3), we can similarly 
define foct (a parabola has only one focus, F, 
while a hyperbola has two foci, F, F’) and 
directrices (a parabola has only one directrix, 
d, while a hyperbola has two directrices, d, d"). 











Fig. 3 


Let X be a point on the plane z, let D(X) 
be the distance between the point X and a 
focus F, and let D,(X) be the distance between 
X and a directrix d. Then the curve C is the 
locus of the points X satisfying the condition 
D,(X)- e: D,(X), where e is a constant. We call 
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e the eccentricity of the conic section C. Ac- 
cording as C is an ellipse, a parabola, or a 
hyperbola, we have ecl, e-1, ore» 1.A 
circle is an ellipse whose eccentricity is zero. 
An ellipse is also characterized as the locus of 
X such that FX + F'X —2a; a hyperbola is the 
locus of X such that | FX — F'X|— 2a, where a 
is a positive constant. When there are two foci, 
the straight line FF’ is perpendicular to direc- 
trices d and d. 


C. Canonical Forms of Equations 


When C is a hyperbola or an ellipse that is not 
a circle, C has two foci, F and F’. In this case, 
the midpoint O of the segment FF' is the cen- 
ter of symmetry of C (when C is a circle, its 
center O is, of course, the center of symmetry 
of C). We call O the center of C; an ellipse or 
a hyperbola is called a central conic. If we 
choose a rectangular coordinate system (x, y) 
having O as the origin and FF' as x-axis, then 
the equation of C can be expressed in the 
form 


x?/a? * y*/b? 21, a,b>0. (1) 


According as C is an ellipse or a hyperbola, 
we take the + or — of the double sign. If 

C is an ellipse, we have a b. Furthermore, 
e — 4 / a? —b?/aif C is an ellipse and e = 

</a? +b?/a if C is a hyperbola. We also have 
F —(ae, 0) and F' =(—ae,0); the equations of 
directrices are x = 4 a/e. 

On the other hand, if C is a parabola, the 
straight line that is perpendicular to the direc- 
trix d and passes through F becomes the axis 
of symmetry of C. We call this straight line the 
axis of C; the intersection O of the axis and C 
is called the vertex of C. If we choose a rectan- 
gular coordinate system (x, y) having O as the 
origin and having the axis of C as the x-axis, 
the equation of C can be expressed in the form 


y*=4ax, a>0. (2) 


We call (1) and (2) the canonical (or stan- 
dard) forms of the equation of C. We call the 
associated coordinate system the canonical 
coordinate system. Suppose that C is an ellipse 
(hence a> b). Let A, A’ be points of intersec- 
tion of the x-axis and the ellipse and B, B’ be 
the points of intersection of the y-axis and the 
ellipse. We call AA’ the major axis of C and 
BB’ the minor axis of C. If C is a hyperbola 
and (x, y) is the canonical coordinate system, 
we call the x-axis the transverse axis and the y- 
axis the conjugate axis. If C is a central conic, 
the x- and y-axes of the canonical coordinate 
system are called the principal axes; if C is a 
parabola, the x-axis is sometimes called the 
principal axis. 
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D. Properties of Ellipses 


An ellipse may be considered the image of a 
circle under a *parallel projection. Conse- 
quently, the section of a circular cylinder by a 
plane is an ellipse. Also, if we are given a circle 
C and a fixed diameter D of C, an ellipse is 
obtained as the locus of the points X lying on 
lines PM, which are perpendicular to D, with 
PeC, M eD, satisfying the condition that the 
ratio PM: XM is constant. 

Suppose that we are given two concentric 
circles having the center at the origin O and 
with radii a, b. Let P, Q be points of intersec- 
tion of moving half-lines through O and the 
two circles. Then the locus of points X of 
intersection of the ordinates (lines parallel to 
the y-axis) passing through P and the ab- 
scissae (lines parallel to the x-axis) passing 
through Q is an ellipse (Fig. 4). Suppose that 
the equation of an ellipse is given by x?/a? 4- 
y?/b? — 1, with a b. Then the lengths of its 
major axis and minor axis are 2a and 2b, 
respectively. 





Fig. 4 


Given an ellipse and its center O, the circle 
with center O and diameter equal to the major 
axis of the ellipse is called the auxiliary circle 
of the ellipse. Given an ellipse C and its focus 
F, the auxiliary circle of C is the locus of the 
points X satisfying the condition that the line 
FX is perpendicular to a tangent line to C 
passing through X. Suppose that X is a point 
on an ellipse with foci F, F’. Let TT’ be the 
line tangent to the ellipse at X (X lies between 
T and T"). Then the angle 4 TXF' is equal to 
L T'XF (Fig. 5). Consequently, the rays start- 
ing from one focus of an ellipse and "reflected" 
by the ellipse converge on the other focus of 
the ellipse. Also, the product of the distances 
from two foci of an ellipse to an arbitrary tan- 
gent is constant and is equal to b>. 
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The ellipse C: x?/a? + y?/b? — 1 is expressed 
parametrically in the form 


x=acos@, y=bsiné. (3) 


We call the parameter 0 the eccentric angle of 
a point (x, y) on C. Consequently, C is a tJor- 
dan curve and divides the plane into two parts, 
the inside and the outside. The inside is the set 
of points (x, y) satisfying x?/a? + y*/b? <1, and 
the outside is the set of points (x, y) satisfying 
x?/a? + y?/b? > 1. The inside is a tconvex set. 
From a point Q outside C, two tangents to C 
can be drawn. The locus of points Q such that 
these two tangents are orthogonal is the circle 
x? + y? =a? +b?. We call this circle the direct 
circle. The area of the "sector" OAX formed by 
two points A(a, 0), X (acos0, b sin 0) (0 — 0) and 
the origin O is ab0/2 =(ab/2) Arccos(x/a); the 
length of the arc ÁX of the ellipse is repre- 
sented by the value of the felliptic integral 


8 
af J/ 1 —e?cos? 0 d0 — aE(n/2 — 0, e). 
o 


In particular, the area inside an ellipse is equal 
to zab, and the whole length of the ellipse is 
4aE(0, e). 

With respect to a polar coordinate system 
(r, 0) having the focus F(ae, 0) as the origin and 
the ray directed positively along the x-axis as 
the initial line, the equation of the ellipse C is 


l b? 
[=— 


Fe . (4) 
1+/cos@ a 


Here l is equal to half of the length of the 
chord that is perpendicular to the major axis 
and passes through the focus. (This chord is 
called the latus rectum of the ellipse.) Suppose 
that F is a fixed point and that X is a moving 
particle attracted toward F by a tcentral force 
inversely proportional to the square of the 
length of FX. Suppose further that X begins 
with an initial velocity whose direction is 
tangent to the ellipse C with focus F. Then X 
always moves on C, and the areal velocity 
described by the radius FX is constant 
(Kepler's second law). 


E. Properties of Hyperbolas 


Two straight lines x?/a? — y?/b? =0, that is, 
y/x = t b/a, are tasymptotes of the hyperbola 
C:x^[a? — y? [b? «1. The hyperbola C':x?/a? — 
y?/b? = —1 is called the conjugate hyperbola 

of C. When a=b, the asymptotes are ortho- 
gonal to each other, and C, C' are congruent. 
In this case, we call C a rectangular hyperbola 
(or equilateral hyperbola). When we draw 
parallels to asymptotes from a point X on C, 
the area of the parallelogram formed by these 
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lines and two asymptotes is constant. (In par- 
ticular, when C is a rectangular hyperbola, the 
equation of C becomes xy — K?/2 if we take 
two asymptotes as coordinate axes. The seg- 
ment cut off by the asymptotes on the tangent 
to C at X is divided equally at X. In the case 
of the hyperbola as well, the product of the 
distances from two foci to an arbitrary tangent 
is constant and is equal to b?, and the angle 
between two straight lines joining two foci to a 
point X on C is divided equally by the tangent 
at X. 

A hyperbola C is represented parametrically 
by 


x-—asecÜ, y=btané. (3) 


In this case also, we call 0 the eccentric angle 
of (x, y). If we use the thyperbolic functions 
and the parameter u, the equation of a hyper- 
bola can be written as 


x=acoshu, y=bsinhu (37) 


instead of (3). The area of a "sector" OAX 
formed by two points A(a, 0), X(x, y) on the 
hyperbola and the origin O is, in this case, 


abu ab ab x+./x*-a? 


X 
——-——A h—2—l 
5 75 Arecosh- —-, log S 


The length of the arc ÁX of the hyperbola is 
given by the elliptic integral 


x e? x? — a? 
dy, 
0 X —aà 


With respect to a polar coordinate system 
(r, 9) having the focus F(ae, 0) as origin and the 
ray directed positively along the x-axis as the 
initial line, the equation of the hyperbola C 
becomes 


l=—, (4) 


where / is equal to half the length of the chord 
passing through the focus and perpendicular 
to the principal axis. (This chord, too, is called 
the latus rectum.) 


F. Properties of Parabolas 


The curve described by a particle attracted by 
“gravitation” in a fixed direction and affected 
by no other force is a parabola (G. Galilei). 
The tangent at a point X on a parabola makes 
equal angles with the straight line joining F 
and X and the direction of the principal axis 
(Fig. 6). Consequently, if the rays starting from 
the focus of a parabola are "reflected" by the 
parabola, they all become rays parallel to the 
principal axis. Let X' be 
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Fig. 6 


the point of intersection of the tangent at 

X (xo, yo) and the x-axis, X" the point of inter- 
section of the normal at X and the x-axis, and 
X, the foot of the perpendicular from X to the 
x-axis. Then FX = FX’, AFXX' is an isosceles 
triangle, and X X' is divided equally by the y- 
axis. Consequently, the locus of the foot of a 
perpendicular from F to a tangent is the y- 
axis. Also, the length of subtangent X’X)= 
2xo, and the length of subnormal X" X, —2a— 
20F. Conversely, a curve whose length of 
subnormal is constant is a parabola. The locus 
of the midpoints of parallel chords of a para- 
bola is a straight line parallel to the principal 
axis. 

With respect to a polar coordinate system 
having the focus as origin and the ray directed 
positively along the x-axis as the initial line, 
the equation of a parabola is 


l 


F-———,  l-2a. (4^) 
1—cos@ 


The area bounded by a chord BC and an arc 
BC ofa parabola (Fig. 7) is equal to 4/3 the 
area of AABC, where A is the point of contact 
on the tangent of the parabola parallel to BC 
(Archimedes). Also, the length of the arc OX of 
parabola (2) is 


GEN 
Yo ta taiog 9 297 7. zo 


4a 2a , 


X having the coordinates (xo, yo). 





Fig. 7 


G. Conjugate Diameters 


The diameter is a straight line passing through 
the center of a central conic. The locus of the 
midpoints of the chords parallel to a diameter 
d is another diameter d', called conjugate to d. 
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Then the diameter conjugate to d' is d. The x- 
axis and y-axis of a canonical coordinate sys- 
tem form a set of conjugate diameters. Let 

2a’, 2b' be the lengths of the segments (some- 
times called conjugate diameters) cut off from 
d, d' by the curve or by the curve and the con- 
jugate one for hyperbo!as and by c, the angle 
between d and d'. Then the following relations 
hold (as to the double signs +, we take + in 
the case of an ellipse and — in the case ofa 
hyperbola): a’? +b’ =a? +b’, a/b' sin œ — ab. 
The product of the slopes of d, d’ is equal to 
+b?/a?. With respect to an oblique coordinate 
system having d and d' as axes, the equation of 
the curve is x?/a? + y?/b'? — 1. 


H. Confocal Conic Sections 


The set of ellipses and hyperbolas having two 
fixed points F, F' as foci is called the family of 
confocal central conics with foci F and F' (Fig. 
8). The family of confocal central conics con- 
taining the ellipse x?/a? + y?/b? — 1 is repre- 
sented parametrically by 


x? y? 


Euer ce 
eA bed 


There exist only one ellipse and only one 
hyperbola that pass through a point inside 
each quadrant (for example, a point (Xo, yo), Xo 
7 0, yo 7 0, inside the first quadrant) and be- 
long to the family of curves. The ellipses and 
hyperbolas of the same family cut cach other 
orthogonally. Thus parameters corresponding 
to ellipses and hyperbolas belonging to a 
family of confocal central conics define an 
orthogonal curvilinear coordinate system, 
called an felliptic coordinate system (— 90 
Coordinates). 





Fig. 8 


The set of parabolas having a fixed point F 
as focus and a straight line passing through F 
as axis is called a family of confocal parabolas 
(Fig. 9). The family of confocal parabolas 
containing y? = 4ax is the set of curves 


y! 24(a * A) (x * 4). 


Such families also give rise to orthogonal 
curvilinear coordinate systems. 
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Fig. 9 


I. Curves of the Second Order 


With respect to a rectangular coordinate sys- 
tem, a curve represented by an equation with 
real coefficients of the second degree with two 
variables x, y, 


ax? + 2hxy + by? - 2gx - 2fy - c -0, (5) 


where (a, h, b) 4 (0, 0,0), is called a curve of the 
second order. A curve of the second order is 
either an empty set, one point, one or two 
straight lines, or a conic section. For equation 
(5), we put 


a h 


Dach b 








ah o 
H D=|h b f (6) 
gf c 


and call D the discriminant of the curve of the 
second order. If D, #0, D #0, and the curve is 
not an empty set, then the curve is a central 
conic. If Dy >0, then the curve is an ellipse or 
an empty set. If Dy <0, then the curve is a 
hyperbola. If Dy =0, D #0, then the curve is a 
parabola. If D=0, D, >0, then the curve con- 
sists of one point. If D=0, D, <0, then the 
curve is two intersecting straight lines. If D= 
D, —0, then the curve is an empty set, one 
straight line, or two parallel straight lines. 


J. Poles and Polars 


Let F(x, y) » ax? + 2hxy + by? + 2gx - 2fy - c 
— 0 be the equation of a conic C and (xe, yo) 
the coordinates of a point P on the plane. À 
straight line P* having the equation 


axoX- h(xoy - xyg) - byo y *- g(x + Xo) 
+ f(y yg) c20 


is called the polar of P with respect to C (Fig. 
10). When the polar of a point P is J, we call P 
the pole of / and denote it by /*. In general, /* is 
uniquely determined by L and P** = P, [** = 

l. If Qe P*, then PeQ*. If P'el, then [*e P'*. 
When a straight line passing through P inter- 
sects C at X, Y and intersects P* at P’, then P, 
P' are tharmonic conjugate with respect to 

X, Y. In particular, if Pe P*, then PeC, and 
P* becomes the tangent of C at P. Givena 
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triangle APQR on the plane of C, we call 

the triangle with sides P*, Q*, R* the polar 
triangle of APQR. Let Q* N R* =P’, R* N P* 
=Q, and P* (1 Q* = R'. Then the three straight 
lines PU P', QUQ', RUR' meet at a point (M. 
Chasles). When the polar triangle of APOR 
coincides with itself, then APQR is called a 
self-polar triangle. The polar of a focus is a 
directrix. 





Fig. 10 


K. Curves of the Second Class 


When the coefficients u, v, w of a straight line 
ux +vy+w=0 satisfy an equation with real 
coefficients of the second order, 


Aw? + 2Huv + Bv? + 2Guw4+ 2Fow + Cw? =0, 
(5) 

where (A, H, B) z (0,0,0), the curve enveloped 
by these straight lines is called a curve of the 
second class. Let A be the discriminant defined 
analogously to D in (6) by using A, HB 
instead of a, h, b,.... A curve of the second class 
(with A z 0) is essentially the same as a curve of 
the second order with D #0. In order for the 
curve (5) with A #0 to coincide with the curve 
(5) with D #0, it is necessary and sufficient that 
A, B, C, F, G, H be proportional to the tco- 
factors of a, b, c, f, g, h in the determinant D 
given by (6). If A =0, then (5’) represents either 
the empty set, or a point (regarded as the set of 
straight lines passing through the point), or 
two points. 

From a projective point of view, a curve 
of the second order is defined as a locus of 
the point of intersection /(1/' =X of corre- 
sponding lines | and I’ when two tpencils of 
lines A(I, m, ...), A‘, m',...) passing through 
two different centers A and A’ are in corre- 
spondence under a *projective mapping f (J. 
Steiner) (Fig. 11). From this it can be proved 
that three points of intersection of three pairs 
of opposite sides (AB, DE), (BC, EF), (CD, FA) 
of a hexagon inscribed in a curve of the second 
order are on the same straight line (Pascal's 
theorem, Fig. 12). In particular, if the curve 
of the second order in this theorem consists 
of two straight lines, the theorem coincides 
with Pappus's theorem (Fig. 13). We call this 
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straight line / the Pascal line of ABCDEF. 
Given a set of six points A, B, C D E,Fona 
curve of the second order, by considering all 
possible combinations of the points, we get 60 
Pascal lines. A configuration consisting of 
these 60 lines is called Pascal's configuration, 
and has been studied by Steiner, Kirkman, 
and others. As a tdual to Pascal’s theorem, 
Brianchon's theorem holds: Three diagonals of 
a hexagon with a curve of the second class 
inscribed meet at a point (Fig. 14). 





Fig. 13 


Fig. 14 
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A. General Remarks 


A ttopological space X is said to be connected 
if there are no proper closed subsets A and B 
of X such that AN B= Ø and AU B— X (C. 
Jordan, Cours d' Analyse I, 1983). A subset S of 
X is connected if S considered as a tsubspace of 
X is connected. 

If a subset S of X is connected, then the 
tclosure S is also connected. Let {A,} be a 
family of connected subsets of X such that 
either A, Ap GI or ATI Aas GI for any 
pair A, and Aș. Then the union |), A, is con- 
nected. The continuous image of a connected 
set is connected. The *product space [T, X, of 
a family of connected spaces ( X,] is also con- 
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nected. Let {A,} be a family of connected 
subsets of X and A, be a connected subset of 
X. If A91 A, x Ø for every A, then the union 
A,U\ ), A, is also connected. For a point p of 
a topological space X, the union of all connec- 
ted subsets containing p is connected and is 
called the connected component of p (F. Haus- 
dorff, 1927). 

The n-dimensional Euclidean space E" 

(n 2 0) and the n-dimensional unit sphere S” 
(n> 1) are connected, whereas S°, consisting 
of two points, is not connected. E" (0) is 
connected for n 2 2, whereas E'\{0} is not 
connected. This fact implies that E! is not 
homeomorphic to E" (nz 2). A connected 
open subset of a topological space X is called 
a domain (or region) in X. 

A topological space X is said to be locally 
connected at a point p if for every open set U 
containing p, there is an open set V containing 
p and contained in the connected component 
of p in U. X is said to be locally connected if it 
is locally connected at each point of X. A 
space X is locally connected if and only if 
every connected component of every open 
subset is open in X. 

There are connected spaces that are not 
locally connected. For example, the comb 
space ((x, y)eR?|x 41/n and 0<y<1 (n= 
1,2,3,...) or0 x x« 1 and y=0} is not locally 
connected at (0, 1) (see Fig. 1). 


Wii : | 
Wl 
ili E 
Ir i 
Ili 



































OUR 





Fig. 1 
Comb space. 


B. Arcwise Connectedness 


Two points a and b of a topological space X 
are said to be joined by an arc in X if there is a 
continuous map f(t) of the closed interval J = 
[0, 1] into X such that f(0)=a and f(1)— b. 

A topological space X is said to be arewise 
connected (or path-connected) if every two 
points of X are joined by an arc in X. For 

a point p of X, the union of all arcwise- 
connected subsets in X containing p is arcwise 
connected and is called the arewise-connected 
component (or path-component) of p. A topo- 
logical space X is said to be locally arcwise 
connected at a point p if for every open subset 
U containing p, there is an open subset V 
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containing p such that every two points in V 
are joined by an arc in U. X is said to be lo- 
cally arcwise connected if it is locally arcwise 
connected at each point of X. 

An arcwise-connected space is connected, 
but the converse is not true. The tsinusoid 
{(x, y)eR?| y=sin 1/x and Oc x«1 or x «0 
and —1«y«l1]j is connected but not arcwise 
connected. À complete metric space is arc- 
wise connected if it is connected and locally 
connected. 


C. Simple Connectedness and n-Connectedness 


Denote by S” the n-dimensional unit sphere 
and by D"*! the (n+ 1)-dimensional unit disk. 
A topological space X is said to be n-connected 
if every continuous map f from S" to X is 
extendable over D"*! (m=O, 1,2, ..., n). 0- 
Connectedness is equivalent to arcwise con- 
nectedness. A 1-connected space is also called 
simply connected. (For simple connectedness 
and n-ply connectedness of plane domains — 
333 Plane Domains) A topological space X is 
said to be locally n-connected at a point p if for 
every open subset U containing p, there is an 
open subset V containing p and contained in 
U such that any continuous map f: $"— V is 
extendable to a continuous map f: D"*! — 
U (m —0, 1,2, ..., n). X is said to be locally 
n-connected if it is locally n-connected at 
each point of X. A space X is said to be w- 
connected (or locally «-connected) if X is n- 
connected (or locally n-connected) for every n. 
S” is (n—1)-connected but not n-connected. 
Similarly for E"*'\{0}. If X is n-connected 
(n> 1), then the tsuspension SX is (n+ 1)- 
connected and the tloop space QX is (n— 1)- 
connected. The Hawaiian earring | (x, vie 
R?|(x — 1/n y? 2 1/n? (n21,2,3, ...)] is 
arcwise connected but not locally simply con- 
nected. The ‘cone over the Hawaiian earring 
is simply connected, but not locally simply 
connected (Fig. 2). 





Fig. 2 
Hawaiian earring and its cone. 


A topological space X is said to be contrac- 
tible if the identity map 1, is thomotopic to a 
constant map c, to a point xo of X (K. Bor- 
suk, Fund. Math., 24, 1935). A topological 
space X is said to be locally contractible at a 
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point p of X, if for every open subset U con- 
taining p, there is an open subset V containing 
p and contained in U such that the inclusion 
i: V c U is homotopic to a constant map to a 
point in U. X is said to be locally contractible 
if it is locally contractible at each point of X. 

A contractible or locally contractible space 
is co-connected or locally w-connected, res- 
pectively. The n-dimensional simplex, the n- 
dimensional disk D", and the n-dimensional 
Euclidean space E" are contractible. More 
generally, a tconvex set in E" is contractible. 
*Topological manifolds and tpolyhedra are 
locally contractible. 


D. Continua and Discontinua 


A topological space is said to be totally dis- 
connected if each connected component con- 
sists of one point. Divide the closed interval I 
=[0, 1] into three equal parts, and let J,, and 
I,, be the closed intervals obtained from I by 
removing the middle open interval. As the next 
step, divide I,, and I, into three equal parts, 
respectively, and remove the middle open 
intervals. Inductively we obtain 2"*! closed 
intervals I,,, ; (i= 1, ..., 2"**) from 2" closed 
intervals I, ; by removing open intervals lying 
in their middles. Let C? =| JZ, 1, ; and C 

= (5, C9. Then C is called the Cantor dis- 
continuum or simply the Cantor set or the 
ternary set (G. Cantor, Math. Ann., 21 (1883)) 
(Fig. 3). 








by NO Ing In 
Fig. 3 
Cantor discontinuum. 


C is a subset of J consisting of points with 
coordinates t — (n,/3) - (n;/32) &- ... +(n,/3') 
+..., where n,=0 or 2. As a topological space 
C is homeomorphic to the Cartesian product 
of countably many copies of the discrete space 
D — (0,1). C has the power of the continuum c 
and is a compact, totally disconnected, tperfect 
set. The Cartesian product of infinitely many 
(countably or not) copies of D is called the 
general Cantor set. The continuous image of a 
general Cantor set is called a dyadic com- 
pactum. Compact metric spaces and compact 
groups are examples of dyadic compacta. 

A continuum is by definition a connected 
compact metric space consisting of more than 
one point. A metric space X is said to be well- 
chained if for every two points a, b and ¢>0 
there are points x,, X5, ..., X, , such that 
d(xj, X;44) <€ (Xp =a, x, =b). A well-chained 
compact metric space is a continum. 

Let K be a continuum containing two 
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points a and b. Then K is said to be irreducible 
between a and b if there is no proper subcon- 
tinuum of K containing a and b (L. Zoretti, 
Ann. Sci. Ecole Norm. Sup., 26 (1909)) (— 93 
Curves). 

A continuum K is said to be indecomposable 
if there are no proper subcontinua K ,, K, 
such that K=K,UK, (L. E. J. Brouwer, Math. 
Ann., 66 (1910)). Simple examples of inde- 
composable continua have been given by 
A. Denjoy, C. R. Acad. Sci. Paris, 151 (1910); 
K. Yoneyama, Tóhoku Math. J., 12 (1917); and 
B. Knaster, Fund. Math., 3 (1922). 

C. Kuratowski conjectured the following: 
If a plane continuum K ts homogeneous (that 
is, for any two points a, be K there exists a 
homeomorphism h: K 2 K with h(a)=b) then 
K is homeomorphic to the circle. A counter- 
example for this conjecture has been found by 
R. H. Bing and E. E. Moise. It is an indecom- 
posable, homogeneous, plane continuum and 
is called the pseudo-arc [8 ]. 
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Connections 


A. History 


The geometric notion of connections orig- 
inated with T. Levi-Civita’s parallelism (Rend. 
Circ. Mat. Palermo, 42 (1917)) and was later 
generalized to the notion of connections of 
differentiable fiber bundles. Notions such as 
affine connections, Riemannian connections, 
projective connections, and conformal connec- 
tions can be described in terms of bundles 
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constructed from the tangent bundles of dif- 
ferentiable manifolds. They are also standard 
examples of the Cartan connections formu- 
lated by E. Cartan and C. Ehresmann. 


B. Connections in Principal Bundles 


Let P —(P, x, M, G) be a differentiable *prin- 
cipal fiber bundle. (For the sake of conve- 
nience, we assume that differentiability always 
means that of class C?.) The total space P and 
the base space M are tdifferentiable manifolds, 
and the projection z is a differentiable map- 
ping. The tstructure group G is a tLie group 
and acts on P from the right as a transforma- 
tion group. On each fiber, G acts transitively 
without fixed points. For elements a, x in G, P, 
we write R,(x)= xa. The mappings induced on 
*tangent vector spaces by R, and z will be 
denoted by the same letters, namely R,: T,(P) 
> TA (P), n: T,(P) Tel ML The tangent vec- 
tor space T,(P) at each point x of P is mapped 
by the projection z onto the tangent vector 
space T,(M) at the point p=2(x) of M. The 
kernel of this mapping is denoted by V.(P), 
and each vector in V,(P) is said to be vertical. 
The kernel V,(P) is the totality of elements of 
T,(P) that are tangent to the fiber. 


C. Connections 


We say that a connection is given in P if for 
each point xe P, a subspace Q, of the tangent 
space 7,(P) is given in such a way that the 
following three conditions are satisfied: (i) 
T.(P) — V(P) * Q, (direct sum); (ii) R,(Q,)= 
Q,,(Q is invariant under G); and (iii) the map- 
ping x Q, is differentiable. A vector in Q, 

is said to be horizontal. 

Now suppose that X is an arbitrary tvector 
field on P. By condition (1), the value X, of X 
at each point x of P can be expressed uniquely 
as X, — Y, + Z,, where Y, e V,(P) and Ze, 
The vector fields Y and Z defined by Y, and Z, 
(x € P) are called the vertical and horizontal 
components of X, respectively. Condition (iii) 
implies that if X is a differentiable vector field, 
then its horizontal and vertical components 
are also differentiable vector fields. Let X be a 
vector field on the base space M. Since z de- 
fines an isomorphism of Q, and T,(M) (p= 
n(x)), we have a unique vector field X* on P 
such that (a) z(X*) — X and (b) X*eQ,. We 
call X* the lift of X, and it is invariant under 
G by condition (ii). 

Suppose that a connection is given in P. If C 
is a piecewise differentiable curve in the base 
space M, we can define a mapping ¢ that 
maps the fiber over the initial point p of C 
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onto the fiber over the endpoint q of C as 
follows: Take an arbitrary point x on the fiber 
at p. Then we have a unique curve C* in P 
starting at x such that (a) (C?) 2 C, and (b) 
each tangent vector to C? is horizontal. (C¥ is 
called a lift of C that starts at x.) The endpoint 
y of the curve C* belongs to the fiber over q. 
We set o (x) — y. Because C£, = R,(C*), the 
mapping o commutes with transformations of 
G. We call this mapping ¢ the parallel dis- 
placement or parallel translation along the 
curve C. 


D. Holonomy Groups 


Fix a point p in the base space. If C is a closed 
curve in M starting from p, the parallel dis- 
placement along C maps the fiber over p onto 
itself. So 1f we fix a point x on the fiber over p, 
x is transformed by the parallel displacement 
to a point xa (ae G). Thus each closed curve C 
starting from p determines an element a(x, C) 
of G. If C varies over the set of closed curves 
that start from p, the totality of such elements 
of G forms a subgroup of G. This subgroup is 
called the holonomy group of the connection 
defined over P with the reference point x. If M 
is connected, holonomy groups with different 
reference points are conjugate. In the above, if 
we choose as the closed curves C starting from 
p only those curves that are null-homotopic, 
the elements a(x, C) form a subgroup of the 
holonomy group. This is called the restricted 
holonomy group. The holonomy group is a tLie 
subgroup of the structure group, and its con- 
nected component containing the identity 
coincides with the restricted holonomy group. 
Holonomy groups are useful in the study of 
the behavior of connections. 


E. Connection Forms 


Let g be the Lie algebra (— 249 Lie Groups) of 
the structure group of G of a principal fiber 


-bundle P —(P, 2, M, G). For each A in g, the 1- 


parameter subgroup exptA (—o0 «t « oo) of G 
defines a *one-parameter group Rexpra of trans- 
formations on P, and it determines a vector 
field A* on P (— 105 Differentiable Mani- 
folds). Each element of the vector field A* 
is vertical at each point x on P, and the A* 
(A€g) at x generate V,(P). Moreover, for each 
element a of G we have R,(A*) — (ad(a !) A)*. 
For a connection in P, we define the connec- 
tion form @ on P with values in g by the fol- 
lowing: (i) @,(A*)=A (A€9), and (ii) o,(X) -0 
(X €Q,). The connection form w thus defined 
satisfies (iii) R*(@)=ad(a~')w (ae G), where 
Rž (œ) is the ‘differential form induced by the 
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transformation R, from the differential form 
w. Conversely, given a 1-form o with values in 
g that satisfies conditions (i) and (it), we can 
define a connection in P by defining vectors X 
such that w(X)=0 as the horizontal, and its 
connection form coincides with w. Thus giving 
a connection in P is equivalent to giving a 
connection form in P. 

In particular, when a principal fiber bundle 
P is trivial, i.e., when P= M x G, we can iden- 
tify the tangent vector space T,(P) at a point x 
=(p,g) of P with the direct sum of T,(M) and 
T,(G). If we set Q, = T,(M), then Q defines a 
connection in P= M x G. Such a connection is 
called flat. When a connection can always be 
expressed as above locally, it is called locally 
flat. Since each principal fiber bundle is locally 
a product fiber bundle, we see that locally 
there exists a connection. If the base space M 
is paracompact, we can show the existence of 
connections on P. 


F. Extension and Restriction of Connections 


When a principal fiber bundle P =(P, n, M, G) 
has a treduced fiber bundle P', we shall con- 
sider the relation between the connections of P 
and of P Let G' be a Lie subgroup of G and o 
its Lie algebra. We shall denote by j both the 
injection of G' into G and also the injection of 
o into g. If there exist a differentiable principal 
fiber bundle P 2 (P', x’, M, G’) and a differenti- 
able embedding f of P' into P such that xo f 
=n and f o R, — Rj4,0 f (ae G') are satisfied, 
then (P', f) is said to be a reduced fiber bundle 
of P. Then we have f, (4t) —j(A)7. for each 
Aég and xeP'. 

Suppose that a connection is given in P’; we 
denote the horizontal space at the point x of 
P' by Q}. At the point f(x) of P, we take f(Q) 
as the horizontal space and transform it by 
right translations of G. Thus we obtain a con- 
nection on P. Let oi and o be the correspond- 
ing connection forms. Then we have jow = 
f * (cv) on P'. Conversely, suppose that we 
are given a connection in P with the connec- 
tion form c. If the induced form f *(c) on 
P' has values always in j(q’), we can write 
f *(o)—joco', and o defines a connection in 
P'. In this case the connection in P is called 
an extension of the connection in P’, and the 
connection in P' is called the restriction of the 
connection in P. 


G. Curvature Forms 


Suppose that a principal fiber bundle P — 

(P, n, M, G) has a connection. Let F be a finite- 
dimensional vector space and « be a differen- 
tial form of degree k on P with values in F. We 
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define the covariant differential Da of a by 


(Da)(X;, o X1) (da)y(hX,, w+, AX, 41), 


where the X; are vector fields on P and h de- 
notes the projection to the horizontal compo- 
nent. Da is a differential form of degree k + 1 
on P with values in F. 

Let p: G— GL(F) be a trepresentation of a 
Lie group G on F. A differential form « on P 
with values in F is called a pseudotensorial 
form of type p if a satisfies R*(x) — p(a "lo 
(ae G). In particular, if a pseudotensorial form 
a satisfies ((A*)a — 0 for any Aeg (— 105 Dif- 
ferentiable Manifolds Q), it is called a tensorial 
form of type p. For each representation p of G, 
we can construct an associated vector bundle 
E over M with fiber F. A tensorial form of 
type p is identified with a differential form on 
M with values in E. If « is a pseudotensorial 
form of type p, then Da is a tensorial form of 
type p. 

For a connection form o on P, the co- 
variant differential Dw =Q of c is called the 
curvature form of the connection. Since @ is a 
pseudotensorial form of type ad, Q is a ten- 
sorial form of type ad. For the connection 
form we have the structure equation dc — 

— [o.c] 4- € [4,6]. Let X and Y be vector 
fields on M, and let X* and Y* be their lifts, 
respectively. Then we have o([X*, Y*])— 
Q(X*, Y*), which shows that the curvature 
form Q for X*, Y* gives the vertical compo- 
nent of [ X*, Y *]. 

It is known that the following three con- 
ditions for a connection are equivalent: (i) The 
connection is locally flat. (it) The curvature 
form vanishes. (iii) The restricted holonomy 
group is trivial (e, the identity group). 

The following two theorems are fundamental: 

(1) Suppose that a connection is given in a 
principal fiber bundle P =(P, n, M, G). Then 
the structure group of P can be reduced to the 
holonomy group [4,6]. In fact, for xe P, let 
P(x) be the set of points y in P that can be 
connected to x by a piecewise horizontal curve 
in P. Then P(x) gives a reduced fiber bundle of 
P, and the connection in P is an extension of a 
connection in P(x) [4,6]. 

(2) The Lie algebra of the holonomy group 
with a reference point x in P coincides with the 
vector subspace of g spanned by {Q,(X, Y)| 
ye P(x), X, Ye (P); [4,6]. 

The curvature form Q is used to express the 
tcharacteristic classes of the bundle P [1,2] 
(— 56 Characteristic Classes). 

In some cases, a connection in the principal 
fiber bundle induces a connection in an tas- 
sociated fiber bundle. In particular, when G is 
GL(n, R) or GL(n, C), we can define a connec- 
tion in any associated vector bundle. The 
notion of connections in vector bundles can be 
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defined more algebraically (M. F. Atiyah, 
Trans. Amer. Math. Soc., 85 (1957)) and can 
also be defined as a kind of differential oper- 
ator on vector bundles [8]. 


H. Affine Connections 


Let M be a differentiable manifold of dimen- 
sion n and P be the tbundle of tangent n- 
frames over M. Then P has the structure 
group GL(n, R), and it is the principal bundle 
associated with the tangent vector bundle of 
M, which consists of all tangent vectors of M. 
A connection in the bundle of tangent n- 
frames is called an affine connection (or linear 
connection) on M. An affine connection on M 
defines (as well as the curvature form Q) a new 
form O called the torsion form on P, which is 
given as follows: Let F be an n-dimensional 
vector space with a fixed basis (£,, £,, ..., Ča). 
Since the bundle of tangent n-frames P is 

the set of all bases (i.e., n-frames) (e,, ...,e,) 

in T,(M) at each point p of M, every point 

X —(e,, ..., €n) of P is given as a mapping x 

of F onto T,(M) (p =n{x)) defined by x: Zoe, 
We define differential form 0 of degree 1 with 
values in F on P by 6,(X)=x t(n, (X)) (Xe 
T,(P)). 0 is called a canonical 1-form of the 
bundle of tangent n-frames of the manifold M 
and has the following property: Any diffeo- 
morphism ¢ of M onto itself induces a bundle 
automorphism ó of P onto itself, and @ pre- 
serves 0, that is, 6*(0)=6. Conversely, we can 
show that any bundle automorphism of P that 
preserves 0 is induced by a diffeomorphism of 
the base space M. 

For an affine connection on M, we define 
the torsion form © by © = D9. © is a differen- 
tial form of degree 2 on P with values in F and 
satisfies R,O —a ^! - © (ae GL(n, BIL Further- 
more, we have the structure equation for ©, 
d0=[m,0]+ © [2,4,6]. 

For each element é in F, there exists a 
unique horizontal vector field B(£) on P such 
that 0(B(Z)) — C. B(£) is called the basic vector 
field corresponding to E At each point xc P, 
B(é,),,---, B(C,), form a basis of Q,. Let 
{A,,.--; Am} (m2 1?) be a basis of g — gl(n, R). 
Then at each point xe P, {(A#),, ...,(A*),, 
B(61),, .... B(5,),] is a basis of the tangent 
vector space T,(P). Thus the bundle P of 
frames is a tparallelizable manifold. The pro- 
jection to M of any integral curve of a basic 
vector field is a geodesic, which is defined in 
Section I [4]. 

An affine connection on M gives a parallel 
displacement of the tangent vector space of M 
as follows: Let C2 p, (0<t<1) bea curve in M 
and C* — x, be a lift of C to P. The parallel 
displacement of the tangent n-frame x, at Po 
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along the curve C is x,, and the mapping x,o 
Xo: T, (M) T, (M) is called the parallel dis- 
placement of the tangent space T, (M) onto 

T, (M) along C. It is easily seen that the map- 


ping is independent of the choice of lifts. 


I. Covariant Differentials 


Let C  (p,] (0 €t € 1) be a differentiable curve 
in M. If we have a vector Y, in T, (M) for each 
t and the correspondence t— Y, is differenti- 
able, then {Y,} is called a vector field along the 
curve C. For {Y,} we set 


Y, —lim (1/0 (92 Kean) Y), 


where q, , is the parallel displacement of 

T; (M) onto T, ,,(M) along the curve C. The 
vector field ( XL along C thus obtained is 
called the covariant derivative of { Y,}. ( Y,} is 
parallel along C; that is, ¥,=@po_,,( Yo) if and 
only if Y/ =0. In particular, if the tangent 
vectors to a curve C are parallel along C itself, 
then C is said to be a geodesic. 

Let X and Y be vector fields on a manifold 
M with an affine connection. The covariant 
derivative V, Y of the vector field Y in the 
direction of the vector field X is defined as 
follows: Let po be a point in M, C={p,} 
(—é<t<e) be an integral curve of X through 
Po, and {@,} be the parallel displacement along 
C. We set 


(Vx Y), =lim (1/) (o7 (Y,) — Ypo). 


Then Vy Y is also a vector field on M. 

The mapping (X, Y) V, Y satisfies the 
following three conditions: (i) V, Y is linear 
with respect to X and Y; (ii) Vj, Y= f- Vx Y; 
and (iii) Vx(fY)=f- V, Y -(Xf)- Y, where f is 
a differentiable function on M. Conversely, if a 
mapping satisfying conditions (i)- (iii) above is 
given, then there exists a unique affine connec- 
tion on M whose covariant derivative coin- 
cides with the given mapping [4, 6]. 

Fix a vector field Y. Then the mapping X > 
Vy Y defines a ttensor field of type (1, 1). This 
tensor field is called the covariant differential 
of Y and is denoted by VY. Now fix a vector 
field X. Then the mapping Y V, Y can be 
naturally extended to tensor fields of arbitrary 
type, and it commutes with the tcontraction of 
the tensors. For a tensor field K this is denoted 
by Ko V,K. Furthermore, the mapping X > 
Vy K is called the covariant differential of K 
and is denoted by VK. We call V, K the co- 
variant derivative of K in the direction of X. A 
tensor field K is invariant under parallel dis- 
placements if and only if VK 20 (— 417 Ten- 
sor Calculus). 
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J. Curvature Tensors and Torsion Tensors 


For an affine connection on M, the curvature 
tensor R and the torsion tensor T are defined 
by 


R(X, Y)(Z) a Vx(Vy Z) m. Vy(Vx Z) = Vix, y}(Z), 
T(X: Y)- VY — ,X —[X, Y], 


where X, Y, and Z are vector fields on M, and 
R and T are tensors of types (1, 3) and (1, 2), 
respectively. Also, in terms of the curvature 
form Q and the torsion form © on the bundle 
of tangent n-frames P over M, they can be 
defined by 


R(X, Y)(Z)ex '-Q,(X*, Y*): x(Z), 
D(X, Y)- x '(,(X*, Y*)), 


where z(x)— p, X, Ye T,(M), and X*, Y* are 
lifts of X, Y, respectively. The curvature tensor 
and the torsion tensor satisfy the relations 
R(X, Y)= — R(Y, X), T(X, Y)= — T(Y, X). 
Moreover, Bianchi's identities hold: 


S(R(X, Y)(Z)) 2 G(T(T(X, Y), Z) 
+(Vx T)(Y, Z)) 

and 

GON, R(Y, Z)+ R(T(X, Y), Z)) 20, 


where © denotes the sum of terms that are 
obtained by cyclic permutations of X, Y, Z 
[4]. For instance, in the case of a Riemannian 
connection (Section K), we have T «0, and 
Bianchi's identities reduce to 


R(X, Y)(Z) - R(Y, Z)(X) + R(Z, X)(Y)-0, 
(Vy R)(Y, Z) - (V, R(Z, X) - (V; R)(X, Y) «0. 


We now consider a system of coordinates 
(xl, ..., x") in an n-dimensional linear space M 
=R". The vector fields (X,, ..., X,) (X;=6/0x') 
form a basis for vector fields on M. If we set 
Vy, X;—0, we get an affine connection qn R”. 
For such a connection we have R —0, T — 0, 
and any straight line in R" is a geodesic with 
respect to this connection. The connection is 
called the canonical affine connection on R”. 
An affine connection on a manifold M satisfies 
R=0 and T=0 if and only if the connection 
on M is locally isomorphic to the canonical 
affine connection of R’. 

Let ọ be a diffeomorphism of a manifold M 
with an affine connection onto itself. We call o 
an affine transformation of M if the induced 
automorphism @ on the bundle of tangent n- 
frames preserves the connection. In terms of 
covariant differentials, this condition is equiva- 
lent to the condition V4, 9(Y) = (Vx Y) for 
any vector fields X, Y. An affine transforma- 
tion of the canonical affine connection on R” 
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is an ordinary affine transformation. For an 
affine connection on a manifold M, the set of 
all affine transformations forms a Lie group 
and acts on M as a Lie transformation group 
[4,5]. 

Let M be a manifold with an affine connec- 
tion. M is called an affine locally symmetric 
space if VR 20 and VT=0 are satisfied. These 
conditions are satisfied if and only if at each 
point p of M, there exist a neighborhood U of 
p and an affine transformation o of U such 
that 9? — 1 and p is the isolated fixed point of 
om If for each point p of M there exists an 
affine transformation of M such that o9? —1 
and p is an isolated fixed point, then M is 
called an affine symmetric space. A symmetric 
Riemannian space is a special case of this type 
(— 413 Symmetric Spaces). 

At each point p in a manifold M with an 
affine connection, we can choose local coordi- 
nates (x!, ..., x") such that x'(p) « 0 and the 
curve x'=a't (—ó «t «ó) is a geodesic for each 
(a!, ...,a") with D(a‘)? — 1. Such local coordi- 
nates are called geodesic coordinates at p [4]. 
With respect to geodesic coordinates, we have 
(Vy (X5), —0 (0/0x' X;). 


K. Riemannian Connections 


When a Riemannian metric g (— 364 Rieman- 
nian Manifolds) is given on a manifold M, it 
defines a metric on the tangent space T, (M) at 
each point p of M, and we can take ortho- 
normal bases in T,(M). The set P' of all ortho- 
normal bases of tangent spaces is a subset of 
the bundle P of tangent n-frames of M and 
forms a subbundle of P; its structure group is 
the *orthogonal group O(n), and P' gives a 
reduction of the bundle of tangent n-frames. 
Conversely, when a reduction of the bundle 
of frames to O(n) is given, we can define a 
Riemannian metric on M such that the re- 
duced bundle consists of all orthonormal 
frames. 

For a Riemannian metric g on M, there 
exists a unique affine connection on M such 
that (i) Vg =0, and (ii) the torsion tensor T 
vanishes [4]. This connection is called the 
Riemannian connection corresponding to g. 
The first condition is equivalent to the invar- 
iance of the Riemannian metric g under paral- 
lel displacement. Thus the affine connection 
transforms orthonormal bases on M to ortho- 
normal bases and induces a connection in the 
bundle P’. It is known that the restricted holo- 
nomy group of any Riemannian connection is 
a closed subgroup of O(n) [4]. An affine con- 


nection on a manifold M is called a metric 
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connection if it preserves a Riemannian metric 
g on M, Le, if it satisfies the condition (i). 


L. Representations in Local Coordinates 


(1) Let (x!, ..., x") be a local coordinate system 
in a manifold M and consider vector fields 
X;— 0/6x,. For an affine connection on M, 

the covariant derivative can be expressed as 


Vx (6) -Y KE? 


The Ij, are called coefficients of the affine 
connection with respect to the local coordinate 
system (x!, ..., x"). We denote by I5, the coeffi- 
cients of connection with respect to another 
local coordinate system (y!, ..., y"). Then on 
the intersection of their coordinate neighbor- 
hoods, we have 


p 2 
ef Y oy! 2 D GA ^x m.) 
ap dy! dy dyloy" 

Conversely, if the F} are given in each local 
coordinate system of M and satisfy this rela- 
tion on each intersection of their coordinate 
neighborhoods, then there exists a unique 
affine connection such that the coefficients of 
the connection are given by Ij, 

(2) The coefficients of the Riemannian con- 
nection corresponding to a Riemannian metric 
gen X gjdx'dx! on a manifold M are given by 


t» il SH 0gy, Ügg, 
Lu CU au 


and are called the Christoffel symbols. 

(3) With respect to each local coordinate 
system (x!, ..., x"), we express the torsion ten- 
sor T and the curvature tensor R of an affine 
connection by 


T=} Tj dx! & dx* @ X;, 


ijk 


R= X Rig dx! ®@ dx* Gi dx! Gi X;. 


ijkl 
The components T} and R} are given by 
Ti. = H CC D, 
wll Alëxf — OT Zdëc) 
+) Tm Ti T im). 
(4) Let K — (Kj!) be a tensor field of type 


(r, s). Then the covariant differential VK — 
(Kiir) is given by 


OTTEN: 
KA ies OK 7 /Ox 


In, 


(rein) 


ler 


B-1 


+ 


R 
Desi 
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(5) A curve x'— x'(t) is a geodesic if and only 


d? xi dxidx* 


=0 Iz], Zeg 
dt? ae di d ý 


(— 178 Geodesics, 417 Tensor Calculus). 


M. Cartan Connections 


Let M be a differentiable manifold of dimen- 
sion n. Consider a homogeneous space F = 
G/G' of the same dimension n, where G is a Lie 
group and G' is a closed subgroup of G (— 199 
Homogeneous Spaces). Let B=(B, M, F, G) be 
a fiber bundle over M with fiber F and struc- 
ture group G, and P=(P, M, G) be the prin- 
cipal fiber bundle associated with B. Suppose 
that there exists a cross section f over M to B. 
Then the structure group of P can be reduced 
to G'. We denote this reduced fiber bundle by 
P' =(P’, M, Gand the injection of P' into P 
by j (— 147 Fiber Bundles). 

Suppose that a connection is given in P. Its 
connection form w is a differential form of 
degree 1 on P with values in g, and the in- 
duced form oi —j*(co) is also a differential form 
of degree 1 on P' with values in g. We call the 
connection in P a Cartan connection on M 
with the fiber F 2 G/G' if at each point x of 
P’, o», gives an isomorphism of T,(P") onto g 
as linear spaces. Such a connection in P is 
equivalently defined as a 1-form œw on P' with 
values in g satisfying the following three con- 
ditions: (i) w’(A*)= A (A€a' (Lie algebra of 
G’)); (ii) R(o») - ad(a ^ "leo (ae G’); and (iii) œ 
gives an isomorphism of T(P’) onto g at each 
point xe P'. For such o, we can take a con- 
nection form w in P such that o =j* (w); o 
defines a Cartan connection. 


N. Soudures 


A cross section f over M to B gives a vector 
bundle T'(B) on M defined as follows: For 
each point p of M, the projection B M de- 
fines a mapping T;,,)(B)— T, (M). The kernel of 
this mapping is denoted by V,,,)(B). Then 
T'(B) - Vp (B) forms a vector bundle over 
M, and the dimension of its fibers is equal to 
n=dim F. 

A Cartan connection in P gives a bundle 
isomorphism between T'(B) and the tangent 
vector bundle T(M) of M as follows: Let x be 
an arbitrary point in P’, and put p— (x). The 
projection z: P'— M induces an isomorphism 
of T,(P^/V.(P") onto T, (M). On the other hand, 
w, gives an isomorphism of T,(P’)/V,(P’) onto 
g/g’. As a point in P’, x gives a mapping of F 
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— G/G' onto the fiber in B over p and sends 
the point {G’} in F to f(p). By this map- 

ping, T, (F) 2 a/g' is mapped isomorphically 
onto V;,,)(B). Combining these isomorphisms, 
we get an isomorphism between T,(M) and 
Kall that is independent of the choice of 
xe P over p. The set of such isomorphisms for 
p€ M defines a bundle isomorphism of T(M) 
and T'(B). If a fiber bundle B over M has an 
isomorphism such as above through a cross 
section, then B is said to have a soudure [3]. 

Conversely, if a fiber bundle B over M has a 
soudure with respect to a cross section f, then 
there exists a Cartan connection in P such that 
the soudure given by the connection coincides 
with the original one [3]. There are many 
Cartan connections in P with the given sou- 
dure. However, when F — G/G' is a symmetric 
space of compact type such that G is noncom- 
pact and contains the identity component of 
the group of isometries, we can determine 
uniquely the so-called normal Cartan connec- 
tion among the Cartan connections which 
gives rise to the given soudure [10]. 

For the tangent vector bundle T(M) of M, 
the fiber F is an n-dimensional linear space 
and can be expressed as F — G/G', where G is 
the taffine transformation group of F and G' 
= GL(n, R). Then T(M) has the 0-section over 
M, and there exists a natural soudure. Fur- 
thermore, an affine connection on M canoni- 
cally induces a Cartan connection on M with 
the fiber F = G/G' [3]. 

For a Cartan connection on M, we can 
introduce the notion of development of a curve 
in M into the fiber and also the notion of 
completeness [3]. 


O. Projective Connections 


Let F, =G,/G, and F, =G,/G, be homoge- 
neous spaces with dim F, 2 dim F, =n. Sup- 
pose that G, ts a Lie subgroup of G, and G; is 
contained in G; by the injection. Then we have 
a canonical injection F, >F, (F; is an open 
subset of F, by the assumption). 

Suppose that a fiber bundle B, with fiber F, 
over M has a cross section f,. Using f;, we 
can construct a bundle B, with fiber F; over M 
which also has a cross section f}. The prin- 
cipal bundle of B, is given by extending the 
structure groups from the principal bundle 
of B,. We can show that if B, has a soudure 
with respect to the cross section f}, then B, 
also has a soudure with respect to the cross sec- 
tion f,. A Cartan connection in the principal 
fiber bundle P, associated with B, that is com- 
patible with a given soudure on B, induces 
a Cartan connection in the principal fiber 
bundle P, associated with B,, which then 
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induces a soudure on B,. The latter is called a 
Cartan connection induced from the former. 

Let F, be an n-dimensional linear space and 
F, be the real *projective space of dimension n. 
Then the affine transformation group of F, 
can be embedded into the projective transfor- 
mation group of F,. Thus the tangent vector 
bundle of a manifold M induces a fiber bundle 
over M with the n-dimensional projective 
space as its fiber. A Cartan connection in this 
fiber bundle is called a projective connection on 
M. By the argument in this section, we see that 
every affine connection on M induces a projec- 
tive connection on M. 

Given two affine connections on M, we 
denote by V and V' their corresponding covar- 
iant differentials. The two affine connections 
on M induce the same projective connection 
on M if and only if there exists a differential 
form ọ of degree 1 on M such that V, Y — 

Vy Y =e(X)Y¥+ o(Y)X for any vector fields X, 
Y{7]. If a diffeomorphism q of M preserves 
the projective connection induced by an affine 
connection in M, then o maps geodesics of M 
into geodesics. 


P. Conformal Connections 


Let F, be an n-dimensional Euclidean space 
and F, an n-dimensional sphere (a *Àconformal 
space). We can embed the group of tisometries 
of F, canonically into the group of tconformal 
transformations of F,. A Riemannian metric of 
the tangent vector bundle of a manifold M of 
dimension n gives a fiber bundle over M with 
fiber F,. A Cartan connection in this fiber 
bundle is called a conformal connection on M; 
a Riemannian connection on M induces a 
conformal connection on M. 

Two Riemannian metrics g}, ga on M in- 
duce the same conformal connection on M if 
and only if there exists a positive function f 
on M such that g, = fg,. Thus for a Rieman- 
nian manifold M with metric tensor g,, a dif- 
feomorphism q of M such that o*(g)— fg, 
leaves invariant the conformal connection 
induced by g,. Such a q is called a conformal 
transformation of M with respect to the given 
Riemannian metric g,. 

For a Riemannian manifold M with metric 
tensor g, we define Weyl's conformal curvature 
tensor W by 


1 : 
Wia= Rau +o 5 (Rat — Rak + gy Ri — gj Ri) 


R 


€: (n-Dm—2 EI à E gi 99). 


where the RÄ, and R; are components of the 
curvature tensor and Ricci tensor, respectively, 


307 


and R is the scalar curvature (— 364 Rieman- 
nian Manifolds, 417 Tensor Calculus). When 
dim M 23, the conformal connection induced 
by g on M is locally flat if and only if the 
conformal curvature tensor vanishes [7]. 


Q. Yang-Mills G-Connection 


Let P —(P, n, M,G) be a differentiable principal 
fiber bundle over a compact oriented Rieman- 
nian manifold M with group G. Then fiber 
bundles Gp =P x ,G and gp=P x 449 asso- 
ciated with P are induced naturally from the 
group conjugation c: G— Aut(G) and the ad- 
joint representation Ad: G— Aut(o), respec- 
tively. A (local) section of Gp is called a (local) 
gauge transformation of P. The set of all global 
gauge transformations, which is denoted by 
G,, has a group structure. 

A locally faithful representation p of G to 
an n-dimensional complex vector space F with 
a fixed basis (£,, ..., £,) defines a differenti- 
able complex vector bundle E =P x „F asso- 
ciated with P. Every point x of P is identified 
with a linear mapping x of F onto the fiber 
Exx defined by ¥:č;—>e;, where e; denotes the 
equivalence class of (x, čp eP x F, 1 «in. 

In a manner similar to the case of an affine 
connection, a connection in P with connection 
form o gives a notion of parallel displacement 
of E as follows: Let c =p, (O €t € 1) be a curve 
in M and c* =x, bea lift of c to P. The map- 
ping X, oX9! : E,, E, is called the parallel 
displacement of E,, onto E, along c. 

Let X be a vector field on M and o bea 
differentiable section of E. The covariant de- 
rivative Vy q of ọ in the direction of X is de- 
fined as follows: Let py be a point of M, c— p, 
(—e<t< —&) be an integral curve of X through 
po and c* — x, be a lift of c to P. We set 


Z denE 2L 
(Vx Oil = lim Pac O Xo : (Pp) UA Pp,)- 


Then V, is also a differentiable section of E. 
The mapping (X, 9) V, o satisfies the fol- 
lowing conditions: (i) Vy q is linear with respect 
to X and 9; (ii) V. — f Vy; and (iii) Vy(fo) = 
LTA f Vy 9, where f is a differentiable func- 

tion on M. From these conditions it is seen 
that the mapping X 9 V, o for a fixed section 
Q of E defines a differential form of degree 
one with values in E, denoted by Mo V is 
called a G-connection on E (induced from the 
connection in P). A linear operator d": (A? & 
E)S T (A?*! Q E), defined by d" (a & g)=da & 
Q -(—1) aA Vo for a differential form a of 
degree p and a differentiable section q of E is 
called a covariant exterior differentiation. 
Here I' (A? & E) denotes the set of all differ- 
ential forms of degree p with values in E. 
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The curvature form RY of V is defined by 
R(X, Y)o Mell — VY(Vx 9)— Vix. yi. 
where X and Y are vector fields on M and 
Q is a differentiable section of E. In terms of 
the curvature form Q of the connection in P, 
it can be defined also by 


RY(X, Y)p 2 xoQ,(X*, Y). x !(), 


where X and Y are in T, M and ver ‘(p) and 
X*, Y* are lifts of X, Y to P, respectively, and 
ọ is an element of E,. Since Q is a tensorial 
form of type ad and the differential p, of 

p induces a faithful representation of g to 
al(F), RV can be regarded as a differential 
form of degree two with values in the bundle 
Gp — P X a49. 

The curvature form R satisfies Bianchi's 
identity: dV RV =0, where d is a covariant 
exterior differentiation with respect to the G- 
connection on gp canonically induced from 
the connection in P. 

We denote by € the set of all G-connections 
on E Now let G be a compact semisimple Lie 
group. A functional ^:€ R defined by V 
+ S(V)= — Al tr(RV ^ * RY) is called the 
action integral (Yang-Mills functional) of V, 
where * is Hodge's star operator, given by the 
fixed orientation of M. 

The group £, acts on € by Vo fl oVof 
for f o 4, and then the curvature form is trans- 
formed by this action as RY Ad(f ')RY. 
Thus . is 4,-invariant. 

A connection V is called a Yang-Mills G- 
connection if V is a critical point of ^. The 
Euler-Lagrange equation of 7 is given by 
dV RY «0 by the aid of the formal adjoint 
operator d'V" of d". This equation, called the 
Yang-Mills equation, is a system of non- 
linear second-order elliptic partial differential 
equations. 

When M is a 4-dimensional vector space R* 
with Minkowskian metric and G is the Abelian 
group U(1), the Yang-Mills equation coincides 
with Maxwell's equations for an electromag- 
netic field. Thus the Yang-Mills equation for a 
non-Abelian group G is a natural extension of 
Maxwell’s equations. In fact, the theory of 
Yang-Mills connections has its origin in the 
field theory of physics [11]. 

In the case of dim M =4, special Yang- 
Mills G-connections occur. A G-connection 
V satisfying the condition * RY = RY (resp. a 
RY = — RY) as a differential form of degree 
two is called a self-dual (resp. anti-self-dual) 
G-connection. From Bianchi's identity and 
the expression of d di = —«od" o «, it fol- 
lows that every self-dual (anti-self-dual) G- 
connection gives a solution to the Yang-Mills 
equation. Since the first Pontryagin number 
p, (E) is given by p,(E) 2 — (1/412) f tr(RY A 
RY) by virtue of the Chern-Weil theorem, the 
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action integral satisfies (V) z 2x?|p,(E)| for 
every V in € and the equality holds if and 
only if V is self-dual or anti-self-dual. 

Explicit forms have been obtained for (anti-) 
self-dual connections over the 4-sphere S* by 
many interesting methods [12-15]. And 
it has been shown that moduli space of self- 
dual G-connections (i.e., the set of all solu- 
tions to the Yang-Mills equation modulo 4,;) 
over S* has the structure of a Hausdorff mani- 
fold of dimension p, (g5) — dim G for every 
principal bundle P with group G [16, 17]. It is 
not yet known whether there exists a Yang- 
Mills G-connection over ST whose holonomy 
group is an open subgroup of G and which 
is neither self-dual nor anti-self-dual [18]. 

The following is one of the few known facts 
concerning the properties of Yang-Mills G- 
connections: If a Yang-Mills G-connection, 

G — SU), SU(3), or O(3), over S* is weakly 
stable, i.e., if the second variation of Z is posi- 
tive semidefinite, then it is self-dual or anti- 
self-dual [19]. 
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81 (1.11) 
Constructive Ordinal 
Numbers 


A. General Remarks 


To extend the theory of frecursive functions to 
transfinite ordinal numbers, A Church and S. 
C. Kleene [1] considered the set of effectively 
accessible ordinal numbers and defined the 
concept of constructive ordinal numbers as 
explained later in this article. Their work 
became the basis of fruitful research by Kleene 
W. Markwald, C. Spector, and others [2-5]. 
A constructive ordinal number was origin- 
ally introduced as an "expression" in a tformal 
system utilizing the A-notation. Since such a 
system is “effective,” we can arithmetize it 
utilizing *Gódel numbers and assume from the 
outset that each ordinal number is represent- 
able by a natural number. The notations, 
terminology, and theorems mentioned in this 
article are mainly those for constructive 
ordinal numbers of the second number class. 


* 


B. Definition and Fundamental Properties 


We call a set of natural numbers satisfying 
conditions (I) and (II) a system of notations for 
ordinal numbers, and an ordinal number a 
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constructive ordinal number when it is represent- 
able by a natural number belonging to such 

a system of notations: (I) No natural number 
represents two distinct ordinal numbers. (II) 
There are three tpartial recursive functions 
K(x), P(x), and Q(x, n) defined as follows: (i) for 
any natural number x representing X, K(x) 
takes the value 0, 1, or 2 according as X is 
zero, an tisolated ordinal number, or a tlimit 
ordinal number, respectively; (ii) when X is the 
ordinal number timmediately after an ordinal 
number Y, P(x) represents Y for any natural 
number x representing X; (iii) when X isa 
limit ordinal number, for any natural number 
x representing X there exists an increasing 
sequence {Y,} of ordinal numbers such that 

X — lim, Y, and Q(x, n) represents Y, for each 
natural number n. 

The system called S, by Kleene is the most 
useful and convenient among systems of nota- 
tions for ordinal numbers. Let ng be a tprimi- 
tive recursive function of the variable n defined 
by 09=1, (n+ 1)o = 2". The fundamental 
notion acO and relation a < ob of the system 
S, are introduced by the following inductive 
definition: (1) 1 e0; (2) if y€O, then 2”€O and 
y € o2*; (3) if a sequence {y,} of natural num- 
bers has the property that for each n, Ge 
and y, « o y,,,, and if y is a Godel number 
that defines y, recursively as a function of 
Ng (1.€., y, S (y) (ng) (— 356 Recursive 
Functions)), then 3: 5" € O, and for each n, 
ya € 93: P; (A) if x, y, zeO, x « oy, and 
y<oz, then x< gz; (5) a€ O, a < ob hold only 
when they follow from (1)-(4). 

Now, for brevity, we write a « ob for 
(a « ob) v (a=b). The following propositions 
hold for S4: (1) If a < ob, then b¥ 1; (2) If 
a< ob, then a, beO; (3) If a < 92", then ax oy; 
(4) If a< 53: 5", then there is a natural number 
n such that a € o Ven where y, = ( y] (noy; (5) If 
a€O, then 1 € ga; (6) If acO, then for any 
*number-theoretic function a such that a(0) = 
a and Vn(a(n) z 1 5a(n-- 1) « ga(n)), there 
is a k such that a(k)— 1; (7) For each a, 1 
(a « ga); (8) If ceO, a € oc, and b € oc, then 
a« ob or a=b or b « ga. 

Each member a of O represents an ordinal 
number |a] as follows: |1| 20; |2"| 2 |y|-- 1 for 
yeO; |3- 5"|z lim, |y,| for 3: 5" €O, where y, = 
{y} (no). Let b be a member of O. Then |a| < 
|b| when a < ob; and conversely, for each o < 
|b], there is a number a such that |a| 2 « 
and a < ob. Hence the set (a|a « ob] isa 
twell-ordered set with respect to <Q, and its 
*order type is |b|. The least number £ greater 
than lol for every member a of O is the least 
ordinal number that is not constructive. It is 
denoted by oX* (Church and Kleene denoted 
it by œ). There is a subsystem of S, that is well 
ordered with respect to eo and contains a 
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unique notation for each constructive ordinal 
number a. For such a subsystem we can take 
a IT} set K such that K is trecursive in O (S. 
Feferman and Spector, R. O. Gandy) (— 356 
Recursive Functions H). 


C. Constructive Ordinals and the Kleene 
Hierarchy 


Let R(x, y) be a *predicate on natural numbers. 
We write x € yy for any natural numbers x, y 
for which R(x, y) holds. We consider only the 
case where «gis a ‘linear ordering on the set 
Dg — Íx|3y(R(x, y) v RCy, x))}. If D, is a well- 
ordered set with respect to <p, we denote its 
torder type by |R|. (1) For each (constructive) 
ordinal number x « o, there is a tgeneral 
recursive (more strictly, *primitive recursive) 
predicate R such that |R| = « (Markwald, 
Spector, Kleene [3]). (2) Conversely, if R is a 
*hyperarithmetic predicate (e.g., R is general 
recursive), then |R| « oX£* (Markwald, Spec- 
tor). The following theorems are the most 
fruitful ones in the theory of constructive 
ordinal numbers, and they fully support the 
validity of Kleene's idea of tanalytic hierarchy. 
(3) The set O is I7] (— 356 Recursive Func- 
tions H), and so is the predicate a < gb. Name- 
ly, for O, there is a primitive recursive predi- 
cate Rio, x, a) such that 


ac O <> Va3xR(a, x, a) 


(Kleene [3]). (4) For each ordinal number a < 
oX*, the set fa| xz lol) is a hyperarithmetic 
set (Spector). (5) O is a tcomplete set for 771. 
That is, for any HI set E, there is a primitive 
recursive function @ such that ae E e o(a)e O 
(Kleene [3]). Accordingly, O is not a X1 set 
(— 356 Recursive Functions H). 


D. Relativization and Extension 


Given a (number-theoretic) predicate Q of one 
variable (or a set of natural numbers), we can 
trelativize to Q the notion of constructive 
ordinal numbers. The least ordinal that is not 
constructive relative to Q is denoted by o. 
The relativization to Q of the fundamental 
notion acO and relation a < gb of the system 
S, of notations are denoted by aeO? and a « 
o9b, respectively. Then we can relativize the 
results of the preceding paragraphs to Q. For 
example, as the relativization of (3), we have 
the following: There is a predicate R (a, x, x) 
which is primitive recursive tuniformly in Q 
such that 


ac 09 «» Vx3x R9(a, x, a). 


When Q is hyperarithmetic, we have no gen- 
eralization of the constructive ordinal numbers 
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by relativizing them to Q, that is, w = œf" 
holds (Spector). Now by relativizing to O the 
concept of constructive ordinal numbers, we 
obtain a proper extension of it (wf* < c2), and 
then, performing such extensions successively, 
we have a (transfinite) sequence O, 0°, 
OT... On the other hand, we can extend the 
constructive ordinal numbers to those corre- 
sponding to any like higher number class, 
beyond the second, in which partial recursive 
functions are used at limit levels to provide an 
"accessibility" mapping from previously de- 
fined number classes. There are several exten- 
sions done by Church and Kleene, H. C. 
Wang, D. L. Kreider and H. Rogers, Jr., H. 
Putnam, W. Richter, A. Kino and G. Takeuti, 
and others. Richter [9] has shown that these 
two ways of extending the constructive 
ordinals are equivalent, provided the sets of 
notations for the higher number classes satisfy 
certain natural conditions. Specifically, the 
ordinals of the Addison and Kleene [6] con- 
structive third number class are exactly the 
ordinals less than c? and the set O;, of nota- 
tions for those is recursively isomorphic to 

O° (Richter). 


E. Constructive Ordinals in Higher Number 
Classes 


Putnum [8] has defined a system C of nota- 
tions for constructive ordinals of the Cantor 
higher number classes, improving that of 
Kreider and Rogers [7], as follows: N, is the 
set of notations for the ordinal number a; if 
for some a, xe N, let |x| 2 ué [xe N], which 

is called a hyperconstructive ordinal; and 

let C, - UT N;|é « aj. Define i to be a Gödel 
number of the tidentity function, and n to be 
an index in C, if 3”: 5'e C, for some t. There 
are four cases: 

Case 1. a=0. Then N, = {1}. 

Case 2. «= ß + 1, where N; is already defined. 
Then N, — (2*|xe Np}. 

Case 3. « is a limit ordinal such that N; is 
already defined for all €<«a, and there exists 
an ordinal fj «a such that 3°: 5'e N; for some 
a, and a partial recursive function f giving an 
order-preserving *cofinal mapping (o.p.c.m.) 
from C, into C,. Then N, is taken to be the set 
of all numbers 3^: 5" such that 3%: S'e N, (where 
B is any ordinal with the above property) and 
{n} is an o.p.c.m. from C, into C,. 

Case 4. a is a limit ordinal such that N; is 
already defined for all č < o, there is no f as in 
Case 3, but there is a number ae C, which is 
not an index in C,. Let fij be the least f for 
which such a’s belong to N,. Then N,— 

(3^: 5"|ae Ng, and {n} is an o.p.c.m. from C, 
into C,}. 
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We have that xe N, only as required by Cases 
1—4. Finally, let wç be the least ordinal num- 
ber which is not hyperconstructive, and let 
C = Coc 

Extending Kleene's H, for ye O (— 356 
Recursive Functions H), Putnum has defined a 
hierarchy H, for xe C, and shown that hyper- 
constructive ordinal numbers are uniqueness 
ordinals, i.e., for x, ye C, if |x| = |y|, then H, 
and H, are of the same tdegree of (recursive) 
unsolvability. E, is the type-2 object (— 356 
Recursive Functions F) introduced by T: 
Tugué such that E, («)=0 if VB3x[a(f(x)) - 0]; 
otherwise, E, (x) — 1. It is known that coc — 
c*:, where «c: is the least ordinal which is 
not the order type of any well-ordering recur- 
sive in E, (Richter [10]). 
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82 (XIII.18) 
Contact Transformations 


A. General Remarks 


A transformation of 2n + 1 variables z, x;, p; 
—1,2,...,n) 
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P;— Pí(z, xi, X5, ... 


is called a contact transformation in the (n + 1)- 
dimensional space R"*! with the coordinate 
system (Z, x,, ..., x,) if the ttotal differential 
equation 

dz —p, dx, —pa3dx4 —...—p,dx,—0 (2) 
is invariant under the transformation, i.e., if 
the equality 

= p(dz — p, dx, — p; dx; — ...— p, dx,) (3) 


holds identically for a suitable nonzero func- 
tion p of z, x, pj. Here we assume that (1) has 
an inverse transformation. Using Lagrange's 
bracket 


a= E) SEA 


din o 
dx) dx, Pr 





we see that (1) is a contact transformation if 
and only if [X;, X,]=LX;, Z] 2 [ P, P,] - 0, 
IP X,] =p, LP, Z] = pP, where 6, is *Kro- 
necker's delta. 

From this fact it follows that the composite 
of two contact transformations and the inverse 
transformation of a contact transformation are 
also contact transformations. Since the iden- 
tity transformation Z =z, X;—x;, P,—p;isa 
contact transformation, the set of all contact 
transformations forms an infinite-dimensional 
*topological group. Given a set of scalars 
Di», Da a pair consisting of a point (z, xj) 
and an n-dimensional hyperplane z* —z — 
ZE, p(xf — xj in an (n+ 1)-dimensional space 
is called a hypersurface element, and the set 
of hypersurface elements satisfying (2) is called 
a union of hypersurface elements. Using these 
concepts, we can state that a transformation 
(1) of coordinates z, xj, pj (j 2 1,2, ...,n)isa 
contact transformation if it transforms each 
union of hypersurface elements into another 
one. Consequently, if two n-dimensional 
hypersurfaces are tangent at a point (z, xj) in 
the (n+ 1)-dimensional space, their images 
under a contact transformation, which are 
again two n-dimensional hypersurfaces, are 
tangent at the image point of (z, xj. The name 
“contact transformation” is derived from this 
fact. 

For instance, the tcorrelation with respect 
to a hypersurface of the second order gives a 
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contact transformation. In fact, from the re- 
lation between ‘poles and tpolar lines with 
respect to the parabola x? +2y=0 in a plane, 
we have Legendre's transformation X — — p, 
Y=xp—y, P= —x(p=—x). 

In general, an invertible transformation 
defined by the three relations Q(x, y, X, Y)=0, 
0Q/0X + POQ/OY =0, 6Q/éx + peQ/dy=0 
derived from a function Q(x, y, X, Y) is a con- 
tact transformation. The function Q is called 
the generating function of this transformation. 
In this transformation, to each point (Xo, yg) 
there corresponds a curve Q(X9, Yo, X, Y)= 
0. These results are valid also in the case of 
several variables. For instance, in an (n+ 1)- 
dimensional space, a transformation Z =z — 
XQpi — Xy Xi = Py, Xy = Bu Kya = 
EE E Xp tr = E PS X, 
Pi4,=Pyi1,---,P, =p, represents a contact 
transformation. Here v is an integer between 
1 and n. In the case n=2, v=2, this transfor- 
mation reduces to a Legendre transforma- 
tion; and in the case n=2, v— 1, it is called 
Ampéere's transformation (— Appendix A, 
Table 15.IV). 


B. Canonical Transformations 


A transformation of 2n variables x,, p; (j= 
1,2,...,n) 


Xj Xj(x1, X5, E Dis P2 ++- Pn) 
Pj2 Ben, Ka Xm P1; P2 -<-s Pn) 

]j2L2,..,n, (4) 
is called a canonical transformation if the tdif- 
ferential form 27.., (Pd X;— p;dx;) is exact in 


Xj, pj, Le., if there exists a function U of x, p 
such that 


2, 4X, — p,dx)- dU. (5) 
[- 


Let (1) denote a contact transformation, and 
let 2 denote a new variable which is different 
from zero. Set xy —z, Po =A, Xj— xj, Dj= — AD; 
(j— 1,2, ..., n) and define a transformation of 
(2n+ 2) variables by 





Xo(X, p) 9 Z(z, x. p), 
Po(X, p) = Po/p(z, x, p), Xj p) = Xj p), 
P(X, p) = — Po Pilz, x, p)/p(z, x, p) 
j=1,2,...,n, (6) 


where x, p, X, and p denote points (x,, X2, 


a Xn) (Dis P25 s Pn) (Xos X55: Xn) and (Da. 
Diss BA respectively. Then the identity (3) 
when multiplied by 4 on both sides becomes 


P,dX, +P, dX,  ...-- B. dX, 


—podxo t p,dx, t ... + p,dx,. (7) 
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Thus the transformation (6) represents a 

canonical transformation (U — constant). 

Moreover, from the definition the relation 

(6) is homogeneous, i.e., for all 4240, 

XjG, up) =X; (xP) BS, up) = us, 
j=9,1,...,n. 


Conversely, let a transformation of (2n + 2) 
variables 


X,=X,(X,p), Be PAX, B), 


j=0,1,...,n, (8) 


be a homogeneous canonical transformation. 
Define a transformation of (2n 4- 1) variables 
Z, Xj Dj (j=1,2, sf) by 


Z(z, x, p)» Xo(z, Xy, us A —AP 15 «+> ~APn)s 


l/p(z. x. p) 


Xp As —Ap;, mE 


cË Z, Xise’ =A ` 
i o(Z, X1 Pn) 


X j(Z, x, p) Xj(Z, X15 ++ Xns A dÉ —ADn)s 


P,(z, x. p) 


= -Ê B(z x,, sae 
A 


4 


Xa —Ap;, o — Àp,). 


j2L2,..,n. (9) 


Since the necessary and sufficient condition for 
(8) to be a homogeneous canonical transfor- 
mation is that (7) hold, it follows that 


À 
—(dZ — P,dX,— ...— P,dX,) 
p 


—A(dz — p, dx, J eee — p, dx,). 


Thus the transformation (9) represents a con- 
tact transformation. Therefore the most general 
contact transformation of (2n + 1) variables 
and the most general homogeneous canonical 
transformation are identical concepts, differing 
only in the choice of notation. 

The necessary and sufficient condition for 
(4) to be a canonical transformation is ex- 
pressed by the relations 


(Xj, X) 0, (X;, P) = din; (P, P,)=0, 


j,k=1,2,...,n, (10) 


where (^, +) is Poisson's bracket, i.e., for a pair 
of functions f(x, p), g(x, p), 

"(0f 0g Of Oh 
G=} E , 

jai \Ox; Op; 

Denote Poisson’s bracket in the coordinate 
system X,, Kinn X,, Pi, Bas B, by (^, rt, 
and for f(x, p) and g(x, p) denote their trans- 
formations by f'(X, P) and g'(X, P). Then the 
relation (10) is equivalent to 


(fg) - (f',g') for all f and g. 
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From this fact it follows that the set of all 
canonical transformations forms an infinite- 
dimensional topological group. 

Suppose that we can take x,, x5. ..., Xn, X, 
X», ..., X, as 2n independent variables. Denote 
by Q (x, X) the function U of (5) represented 
by x, X. By means of this Q,, 2n functions 
which give a canonical transformation (4) are 
derived from the relations 


oO, oO, 
= — = , KN e 5 . 
n ee Pax a S 





2 
Conversely, for Q(x, X) such that det ( ox; ai 
#0, a transformation determined by the 
relations 


mof ix 
Dr Ôx; ër J= 3 Zen IO 


represents a canonical transformation. For 
a general canonical transformation, x4, X5, 
a, Xp X1, X5, ..., X, are not necessarily 2n 
independent variables. But for a canonical 
transformation there exists n new variables 
Xip Xi, X, Pj P, , such that x,, 


IJ 
E E PE vasis By EE P A 


jo 
are 2n independent variables, where (i,, i5, 
sl) Glas Jai) Is a partition of the set 
(1,2, ..., n) to two disjoint subsets. Denote 
by Q, the function U — Li} X; P. Then the 
transformation (4) is derived from relations 


EL =] 2 n 
j x," $t yf 
oo, 
kV op: l=1,2, ,n-—k, 
o0, 
pul. caus Uk 
" 0X 


These functions Q,, Q, are called the generat- 

ing functions of a canonical transformation. 
Consider a transformation of 2n variables 

depending smoothly on a parameter t: 

X; Xj(t, X4, X5, e» Ans Dis Po 35s Pah 

Xm Pio P25 <--> Pab 


j=1,2,...,m. (11) 


P; P(t, X, X5, .. 


If (11) represents a canonical transformation 
of 2n variables for each t, then X; and P,, 
j=1,2,...,n, satisfy 'Hamilton's canonical 
equations 


dX, óH dP, ôH 
AGANT AURA X, P), ————— ,X, P), 
d gp. pce IC 


j212,..,n, (12) 


for some *Hamiltonian function H(t, x, p). 
Conversely, for any solution X;, P, of Hamil- 
ton's canonical equations (12) depending on 
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parameters x,, X5, ..., Ku Di; p2, Da if the 
transformation (11) is a canonical transforma- 
tion for t=0, (11) represents a canonical trans- 
formation for each t. By a canonical trans- 
formation (4), the solutions of Hamilton's 
canonical equations 

Pus sy Oba cU 
i p oH ; 


j-21,2,..,n, (13) 


are transformed to the solutions of 





4X, OK xp, dÉ. Key py 
d óPB C^" d OX C^ 


j=1,2,...,n, (14) 


where the new Hamiltonian function is given 
by K(t, X, P)= H(t, x, p). 


C. Applications to the Integration of 
Differential Equations 


Contact transformations have applications to 
the integration of differential equations since 
they transform each union of surface elements 
into another one. 

As an example, we shall describe an outline 
of their application to a partial differential 
equation of the first order 


F(x, y, z p, q) - 6; 
p=0z/0x, q=0z/éy. (15) 


For this purpose, we regard (15) as an equation 
defining unions of surface elements and trans- 
form it into a simpler equation by means of a 
contact transformation. If the transformed 
equation can be solved, then the solution of 
the original equation can be obtained by 
means of the inverse transformation. Now, let 
z= (x, y, a, b) be a tcomplete solution of (15). 
Then (15) is reduced to Z — c =0 by the trans- 
formation generated by the function 


QzZ-—z-cro(x,y, X, Y)—c=0, 


where c is a constant. In this equation the 
solution X =a, Y=b, Z =c, aP + BQ =0 (a, b, 

c, &, B are constants) plays an important role, 
and thís line element will be called a charac- 
teristic line element. The characteristic line 
element satisfies equations that can be trans- 
formed by means of the inverse transformation 
into Charpit's subsidiary equations for (15): 





dx dy ` dz 
OF/Óp OF/dq pOF/Op-4-qOF/0q 
—dp —dg 


= = 16 
QF/0x--poF[O0z OF /0y+qeF/dz (18) 


Consequently, if we have p= p(x, y, z; a), 4 = 
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q(x, y, z; a) from the solution G(x, y. z, p,4)=a 
of (16) and F —0, then the total differential 
equation dz = pdx + qdy is tcompletely inte- 
grable, and the tgeneral solution of this equa- 
tion is a complete solution of (15). Also, if we 
know two independent solutions G(x, y, z, p, q) 
=a, H(x, y, z, p,q) - b of (16) such that [G, H] 
— 0, we can obtain a complete integral of (15) 
by eliminating p, q among the three equa- 
tions F «0, G=a, H =b. This method is called 
the tLagrange-Charpit method, which is ap- 
plicable also to the equation F(z, x, ..., Xy; 
Pis o p)70(— 322 Partial Differential Equa- 
tions (Methods of Integration)). 


D. Applications to Analytical Dynamics 


Consider Hamilton's canonical equations (13). 
An example of system (13) is the equation of 
motion for a dynamical system which is de- 
rived from Al L(x;, xj) dt =0. Here the x; denote 
generalized coordinates, p; - 0L/Ox; and H = 
Ya jXj— L. 

Let S(x,, x2. ..., X,, X4, X2, ..., Xn) bea 
solution depending on parameters X,, X;, 
..., X, of the *Hamilton-Jacobi equation 








os os os 
Hit, x1,X5,.... Xp, 3 E 
OX, OX, OX, 
= K(X, X>,...,Xp) (17) 
2 
such that det ( SS x) #0. Then by a canon- 


ical transformation of a generating function $, 
ie., by a transformation determined by the 
relations 


oS os 
P7 T Bx," Weer j=1,2,.. DO 


the system (13) is transformed to (14). Here 


(x) 
K(t,X, P)=H| t,x, —— |- K(X). 
ôx 


Then (14) becomes 

dX, dP, OK 

c ae 5 JEL; 2a ,n, 
dt dt Óx; 


and the solutions are 


OK 
P(t) = P(0)—t— 


X;(t)= KA), OX;\x=x0) 
jlx= 


In particular, a transformation that makes 
K =0 is called a transformation to an equilib- 
rium system. Thus solving Hamilton’s canon- 
ical equations reduces to finding a tgeneral 
solution of the Hamilton-Jacobi equation (17). 
When the variables of the Hamilton-Jacobi 
equation (17) are totally separable we can 
obtain a general solution of (17) by tquadra- 


82 Ref. 
Contact Transformations 


tures. Then one can integrate by quadratures 
the system of equations (13). For many im- 
portant problems exact solutions are obtained 
by this method (— 271 Mechanics). 

Furthermore, every mapping in an optical 
system is a canonical transformation; in optics, 
the quantity corresponding to S is called an 
eikonal (— 180 Geometric Optics). 
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A. The Notion of Continued Fractions 
Let {b,} (n=0,...,m) and {c,} (n=1,...,m) be 


finite sequences of elements in a ffield F. A 
fraction of the form 





C5 





Cm-1 
wot 


c 
bm- +#— 
E Së 


m 


is called a finite continued fraction. It expresses 
an element in the field F unless devision by 0 
occurs in the process of reduction. Symboli- 
cally, it is also written in the forms 








bonded ore Cm-1 Cm 
P by by ab, 
€, cl Cm-1l Cm 
$+ fe LI 
0 , 
lb; |b, lbm-1 Ip 
C Oups 
b4 34 24 lI, 





etc., or more briefly, 


bo + Gel: 
S b, €: 
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If (5, (n=0,1,...) and {c,} (n— 1,2, ...) are 
infinite sequences, the expression 


Cy 








Ca 








boven 


is called an infinite continued fraction. By 
analogy with the finite case, it is expressed by 


BS €5 Cn 
byt by te +b, tes: 
€ oo 
or by bo + H . 
b, n=1 
For an infinite continued fraction, the quantity 
C, Cn 
k, = bg +— —,  kg-b, 
n 0 b, ZE E +b, 0 0 


is called its nth convergent, b, is called the 
initial term, and c, and b, (n> 1) are called the 
partial numerator and partial denominator, 
respectively. If F is a ttopological field (e.g., 
the real or complex number field) and the se- 
quence {k,} of its elements converges, then the 
infinite continued fraction is said to converge, 
and the limit is called its value. 

A finite or infinite continued fraction in 
which c, (n> 1) are all equal to 1, bọ is a ra- 
tional integer, and b, (nz 1) are all positive 
rational integers is called a simple continued 
fraction. It is expressed by [bp,b,,... ]. b, is 
often called the nth partial quotient. In the 
following paragraphs we shall mostly discuss 
simple continued fractions. 

For a real number x we mean by [x] the 
greatest integer not exceeding x. [ ] is called 
the Gauss symbol. Let « be any given real 
number, and put 


1 
Q—bg4—, bg-[o] 
e 


l 
0, = b, + ] 
Oni 





D.SIoL.8—h2i 
Then an expansion of o into a simple con- 
tinued fraction 


1 l 


in 0 bite tb, + 


is obtained. If œ is irrational, this expansion is 
determined uniquely. Conversely, any infinite 
simple continued fraction converges to an 
irrational number. If œ is rational, the process 
is interrupted at a finite step (c, — bm) result- 
ing in 

b 1 1 
w= a pede 
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An alternative representation of a rational 
number by a simple continued fraction is given 
by replacing b,, above by (b, — 1) - 1/1. 

Examples of infinite simple continued frac- 
tions are 


eR. " 1 1 
3p+ Qno Dp 





el 
where p is a natural number (J. H. Lambert), 
and 

1 1 1 1 1 1 


e=2+- z = — c > 
122-14 14204 b 


(L. Euler). 


B. Convergents 


Let the nth convergent of a simple continued 
fraction be expressed in the form of an irreduc- 
ible fraction 


P, b jl 20 
T m, m 4 * nc , 
Q, ° byte +b, 


and for convenience put P. , —0, P. ,—1, 
Q_,=1, Q_,=0. Then we have the recurrence 
relations 


P,—bP a EP. Qn=PnQn-1 + On-a5 
nz 0, 

whence follows 

P,Qn-1— best, n> —1. 


Any simple continued fraction represents a 
real number œw which satisfies 


DOnt Q, + Q,-1) 


in terms of the notation defined in this and the 
previous section. 

In particular, let w be an irrational number. 
Then each of the fractions 


Q9 —(0,,, Pa + P,- 


PP Kaz: k=1,2,... 
OF Q,-2 + kQn-1 

which are inserted between two conver- 

gents P, 20. .; and P,/Q, (P, Ab AM 
(Q,-.+,Q,-,), is called an intermediate con- 
vergent, while the original convergent P,/Q, is 
called a principal convergent. 

If a fraction P/Q approximating an irra- 
tional number o satisfies |œ — P/Q| « |o — p/q| 
for any other fraction p/q with q « Q, then it is 
said to give the best approximation. The frac- 
tion giving the best approximation of w is 
always a principal or intermediate convergent 
P®©/Q of œw with k» b,/2 or k=b,/2, Q, 
Q,-105. 

The convergents satisfy the relation P,/Q, — 
P,-1/Q,-1=(—1)""!Q,Q,-15 hence the se- 
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quence |P5,/05,; (resp. {P2n+1/Qon+1}) is 
monotonically increasing (decreasing). Ap- 
proximation by convergents is shown in the 
relations 























| P, 1 | n Pi 
= < , gebei H 
Q, Qn+1Qn Q, Qi 
P, o (—1y 
oz Wm — -by-4 Y 
noo Qn n= o0QuaQ, 


In particular, |o — P,/Q, | « 1/02. On the other 
hand, if P, Q are relatively prime positive 
integers with |œ — P/Q| < 1/2Q", then P/Q isa 
convergent to œw (A. M. Legendre). There are 
several results concerning the measure of 
approximation by convergents. Any c satisfies 

— P/Q! « 1/592) for infinitely many n, 
while there exists an œ which satisfies |œ — 
P,/Q,1 « 1/(4Q2) for only a finite number of n 
provided that A> J5 (e.g., o — SE A. Hur- 
witz); at least one of two adjacent convergents 
satisfies the inequality |w— P,/Q,| < 1/(202) (K. 
T. Vahlen); at least one of three consecutive 
convergents satisfies the inequality |œ — P,/Q, | 
< 1/4/5092) (E. Borel); |o — P/Q| <1/,/8Q? 
has infinitely many rational solutions P/Q 
whenever o is not tequivalent to Sc +1)/2 
(Hurwitz). One calls an irrational number c 
badly approximable if there is a constant c= 
c(@)>0 such that |o — P/Q |» c/Q? for every 
rational P/Q. œ is badly approximable pre- 
cisely if the partial quotients of the continued 
fraction expansion of œ are bounded. In partic- 
ular, real quadratic irrationals have bounded 
partial quotients (— Section C; 182 Geometry 
of Numbers F). 


C. Quadratic Irrationals 


If an infinite simple continued fraction 

[bo, b,, ... ] satisfies bh, = Dan V= 

0, 1,2, ...), it is called a periodic continued 
fraction and is denoted by the symbol 

[bo. bis ..., bh, Kl According as m=0 
or m> |, the periodic continued fraction is said 
to be pure or mixed, and the sequence bp, 
bis Dank 1 18 called a period. In order for 
the continued fraction of œ to be periodic, it is 
necessary and sufficient that œ be a quadratic 
irrational number, i.e., a root of ax? +bx+c 

— 0 with rational integral coefficients a, b, c 
and nonsquare discriminant b? — 4ac (J. L. 
Lagrange). In order for œw to be represented by 
a purely periodic continued fraction, it is neces- 
sary and sufficient that c be an irreducible 
quadratic irrational number, i.e., that c satisfy 
021 and 07 o > —1, where w’ is the conju- 
gate root of œ (E. Galois). In order for œw to be 
equal to the square root of a nonsquare ra- 
tional number, it is necessary and sufficient 
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that its continued fraction be of the form 
[bo, by, ..., b, .,, 209 ] (Legendre). 


D. Application to Pell's Equation 


Let ax — by — 1 ((a, b) 2 1) be a tDiophan- 

tine equation of the first degree, and a/b — 

[55, bis ..., bm] = Ball, Since PnQm-1— 
P,.,Q,—7(—1)" t, a solution of the equation 
is given by xy—(—1)" ! Q,, Vo=(— 1)" ! P. 
The general solution is then represented in the 
form xo + bt, yo tat (te Z). This method of 
obtaining a solution is essentially the same 

as the method which uses the tEuclidean 
algorithm. 

Pell's equation x? — Dy? —1 (D is a non- 
square integer > 1) was solved by Lagrange in 
terms of continued fractions. If the length of 
the period of D is k, all positive solutions of 
Pell's equation are given by x= Ban, y= 
Q5, if k is odd, and by x=Py-1, Y= Qu, 
if k is even (v2 1,2, ...), where P,/Q, denotes 
the nth convergent of the continued fraction 
expansion of JD. Incidentally, x = P, 14-4, 
y2Qo, i-i V= L2, ...) are the positive 
solutions of x? — Dy? = —1 provided that k 
is odd. There are no solutions of x?^ —Dy^ = 
+1 other than x,, y,(v— 1,2, ...) given by 
(x, FD yy 2x, D y, where x,, y, is 
the least positive solution. For instance, the 
least positive solution of x? —211y?21 is x= 
278,354,373,650, y = 19,162,705,353. 

Lagrange made further use of continued 
fractions in order to obtain approximate 
values of roots of algebraic equations. The 
method is especially useful for precise compu- 
tation of neighboring roots. 

The theory of continued fractions may be 
investigated geometrically making use of lat- 
tices ([2, 3]; — 182 Geometry of Numbers) (F. 
Klein, G. Humbert). For instance, a measure 
of approximation of P,/Q,, to œw in Diophantine 
approximation is represented by the closeness 
of a lattice point (P,, Q,) to the straight line y 
— ox on the plane. 


E. Continued Fractions with Variable Terms 


There are few results on continued fractions 
with variable terms. It is noteworthy that 
from the expansion of tanz into a continued 
fraction 

Z 2 SE — 


tanz —— 
[r3 3E 354 E 











(Lambert), the irrationality of z and of tanz 
for rational z (40) can be deduced (A. 
Pringsheim). 

Among continued fractions with variable 
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terms, those of the form 


do Ou? AyZ 


qa eb Tg 





Loo, al 


are called normal continued fractions. Let the 
convergent of such a continued fraction be 
P.(z) Q,(z), and for convenience put P_,(z)= 
DO ,(z)— 1. Then we have the recurrence 
formulas 


P,(z)— Let (z) - a,z P, zt 
Q,(z) = Q, ,.(z) + a.zQ, 22). nz l. 


There are the further relations 


P0,- ei — P,.,00,() - (- 1" UA, 


[a á z]?= Y (—1)"2" o ^ 
0» jv udi c AD v 
" n=0 Q, 1(2)0,.) v=0 
where the latter is formal. Let the tpower series 
expansion of the nth convergent of [ag, anz] i 
be 


Then bm, =bn, (0<v<m<n). If [ag,a,z]t has a 
power series expansion about the origin, then 


90 
[ao, anz] 7 Y banz”. 
n=0 


If the supremum g of (|a,]]f is finite, then 
[a9,a,z]f converges uniformly for |z| (1/4) g, 
and hence it represents an *analytic function 
which is holomorphic in |z| « (1/4)g. 
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84 (X.2) 
Continuous Functions 


A. General Remarks 


The notion of continuity is defined for a map- 
ping or a function f: X 5 Y from a topolog- 
ical space X to a topological space Y (— 425 
Topological Spaces G). In the present article, 
however, we are concerned mainly with the 
case where both X and Y are tmetric spaces 
with the distances py and py, respectively. The 
most usual case is X = R" (Euclidean space), 
Y=R (real numbers). 

A function f: X — Y is said to be continuous 
at a point xo € X if for every positive number e, 
we can select a suitable positive number 6 
(depending on e and also on xo) such that 
px (X, xo) « ó implies py( f(x), f(xo)) « e. This is 
equivalent to the condition that x xo implies 
f(x) f (Xo) (— 87 Convergence). We call f 
continuous (on X) if it is continuous at every 
point x, of X. If for every positive number e, 
we can select a suitable positive number ó 
independent of x and y such that py(x, y) « ó 
implies py( f(x), f(y)) « e for all x, ye X, we call 
f uniformly continuous on X. 

The tsupremum (6) of py( f(x), f(y)) for 
x, ye X satisfying py(x, y) « ó is called the 
modulus of continuity of the function f in X. 
Uniform continuity means that w(d) O0 for 
ó0. 

If o6) x Mô" for suitable constants M, «> 
0, that is, if the inequality Gei f(x), f(y) € 
M(py(x, y))* holds for x, ye X, then f is said 
to satisfy the Hólder condition of order o, also 
known as the Lipschitz condition of order o. 

If «— 1, this condition is called simply the 
Lipschitz condition. A function satisfying one 
of these conditions is uniformly continuous. 
The family of functions satisfying the Lip- 
schitz condition of order « is sometimes de- 
noted by Lipa. 

In general, the composite function go f: 

X >Z is continuous if both functions f: X Y 
and g: YZ are continuous. If the ranges of f, 
g are both the real field R (or the complex field 
C, or more generally a *topological field), then 
fxg, fg are continuous if f and g are contin- 
uous; and f/g is continuous provided that 
g(x) 0. If R is the range of both f and g, then 
min( f, g) and max( f, g) are continuous when f 
and g are continuous. If X is tconnected (for 
example, an interval J in R) and if f is contin- 
uous, the image f(X) is also connected. 


B. Continuity from One Side 


In this section, we always assume that the 
domain X is an interval J in R and f is a func- 


84 C 
Continuous Functions 


tion from I to a metric space Y. A point xo 

of X is called a discontinuity (point) of the 

first kind of f if both limits lim,;, f(x) and 
lim, f(x) exist in Y and are different. Then 
we say also that f has a jump (or gap) at zo, If 
these two limits exist and have the same value, 
then f is continuous at xy. 

We say that f has a discontinuity of at most 
the first kind at x, if f is continuous at xy or if 
Xo is a discontinuity of the first kind for f. 
(Sometimes the phrase "discontinuity of the 
first kind" is used to mean “discontinuity of at 
most the first kind.") A discontinuity point of f 
(i.e., a point at which f is not continuous) that 
is not of the first kind is called a discontinu- 
ity point of the second kind. When lim, f(x) 
= f (x9), we call f right continuous (or con- 
tinuous from the right) at xo. In this case, 
Im, f(x) need not exist. Replacing x| xo by 


~ xftxX 9, we can similarly define the concept of 


being left continuous (or continuous from the 
left). If a function f has a finite number of 
discontinuity points of the first kind in the 
interval [a, b] and is continuous at all other 
points, we call f a piecewise continuous func- 
tion in [a,b]. 


C. Semicontinuous Functions 


In this section, we assume that the domain of 
the functions is a subset E of a metric space X, 
and that the range is the set of real numbers 
extended to include 4-oo. Let x be a point in 
the closure of E. We denote by M(x, ô) and 
m(x, 6), respectively, the supremum and the 
infimum of the values of a given function f in 
the ó-neighborhood of x. We put 


M(x) =lim M(x,6), m(x) -lim m(x, à), 

and call them the upper limit function and 
lower limit function, respectively. We have 

-—oo €«m(x) < M(x) « +00. If M(xg) — f(xg) at 
xo€ E, then f is called upper semicontinuous at 
Xo. If m(xo) = Tal at xo € E (Le., if —f is upper 
semicontinuous at x), then f is called lower 
semicontinuous at x). The function with one 
of these two properties is said to be semicon- 
tinuous at Xo. 

Either of the following two conditions is 
necessary and sufficient for the function f to 
be upper semicontinuous at xo € E: (1) f(xo)= 
+, or for every constant 4 such that f(xo)< 
2, there exists a ó-neighborhood such that 
M (xo, ô) « 4. (2) For every sequence x, of E 
converging to xg we have limsup, ,4 J (Xn) = 
f(xo). 

A function f is called upper (lower) semi- 
continuous in E if it is upper (lower) semicon- 
tinuous at every point xe E. A necessary and 
sufficient condition for the upper semicontinu- 
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ity of the function f(x) in E is that [x|f(x) «aj 
be a ?relatively open set in E for every real 
number «. We can define semicontinuity for 
functions on a topological space by using this 
latter property. 

A real-valued function f(x) is continuous at 
xg € E if and only if it is upper and lower semi- 
continuous at xo and f(x;) is finite. A function 
f(x) is continuous on E if and only if it takes 
finite real values on E and both (x|f(x) «aj 
and (x| f(x)» aj are relatively open in E for 
any real number a. When E is tcompact, an 
upper (lower) semicontinuous function on E 
attains its (supremum) infimum at a point in 
E. In particular, a continuous function on a 
compact set E is bounded and assumes its 
maximum and minimum on E (Weierstrass’s 
theorem). Furthermore, if E is connected (e.g., 
the interval J in R), it follows from the con- 
nectedness of the image f(E) that if o, Be 
f(E) and y lies between a and fi, then ye f(E) 
(intermediate-value theorem). 

A real-valued function f(x) on a set E of R 
satisfies the Lipschitz condition if it is *differ- 
entiable and the derivative is bounded. Such a 
function is also tabsolutely continuous, con- 
tinuous, and of tbounded variation. (For the 
polynomial approximation of real continuous 
functions — 336 Polynomial Approximation.) 

The limit function f(x) of a monotone de- 
creasing sequence of upper semicontinuous 
functions f,(x) is also upper semicontinuous. 
The limit function f(x) of a uniformly converg- 
ing sequence of continuous functions is con- 
tinuous. (Regarding the *equicontinuous family 
of functions — 435 Uniform Convergence.) 


D. Baire Functions 


The limit function of a pointwise converging 
sequence of continuous functions defined on a 
metric space X is not necessarily continuous. 
R. Baire (Ann. Mat. Pura Appl., (1899)) intro- 
duced the notion of Baire functions as follows: 
He named continuous functions the functions 
of class 0. Then he called a function that is a 
pointwise limit of a sequence of continuous 
functions a function of at most class 1. A func- 
tion is said to be of class 1 if it is of at most 
class 1 and is not of class 0. He similarly de- 
fined the notion of class n for arbitrary natural 
number n. 

Further, a function is said to be of at most 
class o if it is a pointwise limit of a sequence of 
functions of class n, for a sequence of natural 
numbers n,. À function is said to be of class 
œ if it is of at most class œw and is not of class 
n for any finite number n. In general, using 
ttransfinite induction, we can define the notion 
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of functions of class Z for an arbitrary *ordinal 
number Z [4]. 

All these functions are called Baire functions. 
Actually, there is no function of class Z for an 
uncountable ordinal number €. If X is a tper- 
fect set in Euclidean space, then there is actu- 
ally a function defined on X of class Z for an 
arbitrary countable ordinal number Z Here- 
after, we shall be concerned with this case only. 

If X has the cardinality of the fcontinuum, 
then the set of all Baire functions on X has the 
cardinality of the continuum. On the other 
hand, the cardinality of all functions on X is 
actually greater than that of the continuum. 
Hence there exist functions that are not Baire 
functions on X. A function is a Baire function 
if and only if it is Borel measurable (H. Le- 
besgue). Therefore a necessary and sufficient 
condition for a function f to be a Baire func- 
tion is that the set (x| f(x)» o, xe Xj be a Borel 
set for any real number « (— 270 Measure 
Theory J). The limit of a countable sequence of 
Baire functions is also a Baire function. If f(x) 
and g(x) are of at most class x on X, then the 
following functions are also of at most class 
ac |f Gol, FOE g), Tal: g), and f(9/g(9 
(provided that g(x) ZO on X). 

The condition that a function f is of at most 
class 1 on X is equivalent to either of the 
following two conditions: (1) For any closed 
subset F of X, the restriction f * of f to F has 
a continuity point in F. (2) For every real 
number a, the set (x|f(x) «a, xe X] is an tF, 
set (Baire). In a fcomplete metric space, a 
necessary and sufficient condition for a func- 
tion f to be of at most class 1 is that the set of 
continuity points be dense in X. 

For example, the Dirichlet function, which 
takes the value 1 at rational points and 0 at 
irrational points, is expressed as 


lim (im (cosy! my"), 
you \ k— oo 

which is of class 2. A function f(x, y) of two 
real variables that is continuous in each vari- 
able x and y separately is a function of at most 
class 1. 
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85 (VI.19) 
Continuous Geometry 


A. General Remarks 


The structure of a tprojective geometry is 
determined by the tlattice (lattice-ordered set) 
of subspaces of the projective space. For this 
reason, this lattice itself is sometimes called 

a projective geometry. The concept of con- 
tinuous geometry was introduced by J. von 
Neumann as an abstraction of lattice-theoretic 
properties from a special class of tlattices 
(lattice-ordered sets) which he encountered in 
his research on *operator rings in Hilbert 
spaces [1]. Continuous geometry contains 
projective geometry as a special case when the 
dimension is discrete; but more usually the 
lattices are of continuous dimension. 

A continuous geometry is a tcomplete and 
tcomplemented tmodular lattice L (— 243 
Lattices F) that satisfies the following prop- 
erty and its dual (both called properties of 
continuity): For any element a of L and any 
subset W of L which is twell ordered with 
respect to the ordering in L, we have aN supw 
=sup(a N w) (we W). The tcenter Z of the 
lattice L is called the center of the continuous 
geometry L, which is said to be irreducible 
when Z has no elements other than the tleast 
element 0 and the fgreatest element J; other- 
wise L is said to be reducible. A reducible 
continuous geometry is isomorphic to a sub- 
lattice of a tdirect product of irreducible con- 
tinuous geometries. 

On any continuous geometry L, there can 
be defined a function d(x) whose values belong 
to a complete lattice-ordered linear space M 
and which satisfies the following four con- 
ditions: (1) d(x) > 0; (2) d(x)=d(y) implies the 
existence of a common complement of x and y; 
(3) d(x U y) - d(x y) 2 d(x) + d(y); (4) supd(w) = 
d(sup w) (we W) for any subset W of L which 
is well ordered with respect to the ordering in 
L. Such a function d(x) is called a dimension 
function on L. Irreducibility of L is equivalent 
to the property that a real-valued dimension 
function can be introduced; in this case, if d(w) 
takes only a finite number of values, then L is 
a finite-dimensional projective geometry; on 
the other hand, if d(w) takes every number in 
the closed interval [0, 1], then L is called a 
continuous geometry in the strict sense. An 
example of the latter can be constructed as a 
limit of a sequence of projective geometries of 
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increasing dimensions; another example is the 
lattice of projection operators of the tvon 
Neumann algebra of type II,. If a group G of 
*automorphisms of L is given, there can be 
introduced a generalized dimension function 
which is invariant under G satisfying slightly 
weaker conditions (T. Iwamura [3]). 


B. Representation of Continuous Geometry 


A ring R is called a regular ring if it has a unity 
element and if, for any element a of R, there 
exists an element x in R such that axa =a. A 
continuous geometry L is isomorphic to the 
lattice (with > as its ordering) of principal left 
ideals of a regular ring R provided that d(I) 
=n: d(x) (n 2 4) for some natural number n 
and some element x of L. The decomposition 
of R into a tdirect sum of ideals corresponds 
to the decomposition of L into a direct prod- 
uct of lattices. The condition that L is irreduc- 
ible and finite-dimensional is equivalent to 

the condition that R is a matrix ring over a 
tskew field; when these conditions hold and L 
is considered as a projective geometry, then 
the coordinates are given by this skew field. In 
continuous geometries, join and meet are often 
denoted by the symbols for sum and product, 
respectively, and sometimes a direct product of 
continuous geometries is called their direct 
sum. Sometimes, the requirement that a con- 
tinuous geometry be complete is weakened to 
the requirement that it be tconditionally ø- 
complete. 
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Control Theory 


A. General Remarks 


The classical theory of automatic control 
mostly deals with linear feedback control 
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systems with single input and single output. 
Mathematical structures of such systems must 
be, in principle, described in terms of ordinary 
linear differential equations with constant 
coefficients. Hence control engineers use block 
diagrams to describe systems, and operational 
calculus based on tLaplace transforms to ob- 
tain response characteristics (— 240 Laplace 
Transform). Thus the input/output relation of 
a system is described in terms of transfer func- 
tions. The main objectives of control theory 
are to ensure system stability, to maximize 
closed-loop response characteristics by choos- 
ing the best feedback, and to obtain a desir- 
able transient response to an impulse or step 
input. One of the remarkable contributions to 
classical control theory is Nyquist's criterion 
for stability testing of linear feedback systems 
[1]. The test consists of plotting the Nyquist 
diagram of a transfer function in the frequency 
domain (complex plane), and differs essentially 
from the tRouth-Hurwitz stability test for 
linear differential equations with constant 
coefficient. Classical control theory was almost 
complete by the end of World War II. 

Revolutionary technological innovations in 
electronics and computers and the invention of 
new control instruments and systems devel- 
oped after World War II have opened the way 
to modern control theory. Around 1960, three 
remarkable contributions were made con- 
currently; they are dynamic programming 
(proposed by R. E. Bellman [2]), Pontryagin’s 
maximum principle (L. S. Pontryagin et al. 
[3]). and linear system theory (R. E. Kalman 
[4]). The first two give rise to mathematical 
tools to solve optimal control problems and to 
design optimal controllers or regulators. In 
contrast to the classical theory of control, 
optimal control problems are formulated in 
terms of the system of linear or nonlinear 
multivariable differential equations with multi- 
input forcing terms called control variables. 
This leads to the state-space approach, which 
has become ubiquitous in modern control 
theory. Linear system theory derives from the 
concepts of controllability and observability. 
The former was introduced by Pontryagin et 
al. [3] and later modified by Kalman [5], and 
the latter was introduced by Kalman [5]. 
Controllability and observability are con- 
cerned with the interrelation between internal 
states of a system and its inputs and outputs. 
Hence linear system theory, profoundly related 
to the state-space approach, remains a prin- 
cipal theme of modern control theory. More 
advanced control theories, such as adaptive 
control or learning control, are mostly de- 
scribed and formulated in the framework of 
linear system theory. 

Modern control theory has stimulated 
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the development of cybernetics, the science 
of control and communication, and con- 
trol theory is important in the information 
sciences. In fact, current control theory has 
many features that might be considered to 
belong as well to other areas, such as mathe- 
matical programming (— 264 Mathematical 
Programming), operations research (— 307 
Operations Research), game theory (— 173 
Game Theory), prediction and filtering theory, 
digital signal processing, circuit theory, and 
computer or microprocessor technology. 
Current control theory embodied in modern 
instrumentation and computer technology has 
a vast range of applications. We do not con- 
sider these applications here; we confine our- 
selves to some important theories that con- 
tinue to be major topics in control theory. 


B. Linear Dynamical System 


Let the state of a physical process to be con- 
trolled be represented by a real n-dimensional 
column vector x(t)=(x,(t),...,x,(t))'€R", 
where T denotes the transpose of a vector or 
matrix. If the state is determined by a differen- 
tial system with input u(t) — (u; (t), ..., u,(2)) € 
R’ and output y(t) 2 (y,(t), ..., y, (t) ER”, 


© x(t)= A(t)x(t) + B(t)u(t), 


y(t) - C()x(t), (1) 


then the process is called a linear dynamical 
system or simply a linear system. The dimen- 
sions of the coefficient matrices are given by 


A()eR""", B()eR"", C()eR""". 


If A(t), B(t), and C(t) are constant matrices, the 
system described by 


d 
X=—x=Ax+4+ Bu, 
dt 


y=Cx (2) 


is called a linear time-invariant system. Corre- 
spondingly, the system described by equation 
(1) is called a linear time-varying system. 
Given an initial state x(t 9) and input func- 

tion u(t) for tÈ to, the state of system (1) is 
represented by 

t 
x(t) D, (ED ut. :)B(x)u(x) dr, 

to 
where Q(t, t) is a fundamental solution to the 
matrix differential equation 


EI 


O(t,t1)=! (nxnidentity matrix). 
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Hence the output of the system is described by 


y() — C(t)e(t, to) x(to) 


t 
d C(t) P(t, x) B(x)u(x) dt. 
to 

It is seen from this equation that the input- 
output relation of the system is governed by 


Wit, d C(t)b(t, 2) B()eR""*, 


which is called a weighting matrix. As a spe- 
cial case, the weighting matrix for the time- 
invariant system (2) becomes 


W(t, ds W(t —1)2 Ce^* 9B, 
where e^' is a matrix exponential function 
defined by 
t? p 
Aat SE p vA 
e =I+tA 4A +74 T... 


The Laplace transform of the weighting matrix 
is 


G(s)= PR e *"W(t)dt— C(sl —A)^1B, 
(U 


which is called a transfer function matrix of the 
system (2). First we note that the weighting 
matrix W(t, 1) for the system (1) is invariant 
under any linear transformation X(t) — P(t)x(t) 
of the state, and also that the transfer func- 
tion matrix G(s) for the system (2) is invariant 
under X = Px. For example, the transformation 
X= Px yields 


&-PAP^! X4 PBu, 
y-CP^£. (3) 


In fact, the transfer function matrix for the 
system (3) becomes equivalent to that for the 
system (2), that is, 


G(s)- CP"! (sI — PAP^!)  PB2 C(sI A) ! B 
= G(s). 


As a converse to this, there arises the problem 
of whether any linear time-invariant systems 
with the same transfer function matrix are 
similar to each other. To solve this problem, it 
is necessary to introduce the concept of con- 
trollability and observability for the system (1) 
or (2). 


C. Controllability and Observability 


The concept of controllability and observa- 
bility plays a fundamental role in linear system 
theory. Roughly, controllability tmplies the 
possibility of steering the state from the input. 
Definition of controllability: The linear sys- 
tem (1) is said to be controllable at time t, if 
for any state x(tọ)e R" and x! eR" there exists a 
control input u(t), t€ [to, t, ], that transfers the 
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state x(to) to the state x! at a finite time t, >to. 
Otherwise, the system (1) 1s said to be un- 
controllable at time fo. 

The controllability theorem states: The 
system (1) is controllable at time t if and only 
if there exists a finite t, (> tel such that the 
matrix 


D(to,tı)= | (ts, t)B(t)B'()e'(tg,t)dr — (4) 


is nonsingular. Similarly, the linear time- 
invariant system (2) is controllable if and only 
if the n x (nr) matrix 


D=[B, AB, ..., A! B] (5) 


has rank n. The matrix D is called the con- 
trollability matrix of the system (2). 

The concept of observability is dual to that 
of controllability. It implies the possibility of 
estimating the state from the output. 

Definition of observability: The linear system 
(1) is said to be observable at t, if for any state 
x(tg)e R” there exists a finite t, (>to) such that 
the knowledge of the input u(t) and the output 
y(t) over the time interval [t5, t, ] suffices to 
determine the state x(t9). Otherwise, the sys- 
tem (1) is said to be unobservable at tọ. 

The observability theorem states: The sys- 
tem (1) is observable at time tọ if and only 
if there exists a finite t, (>to) such that the 
matrix 


M(to,t,)— f D(t, to) "Cut, todt (6) 


is nonsingular. Similarly, the linear time- 
invariant system (2) is observable if and only if 
the n x (nm) matrix 


M =[C7, ATC", ... (4T 1 CT] (7) 


has rank n. The matrix M is called the ob- 
servability matrix of the system (2). 

We deal only with linear time-invariant 
systems for simplicity, although some of the 
main results to be stated here can be extended 
to linear time-varying systems. If the controlla- 
bility matrix D has rank n, (<n), there exists a 
linear transformation x= Px with nonsingular 
matrix P which transforms the system (2) into 


X, ken Ae Ai5 Xe + B, 
Cie ke abs eel to” 


(8) 
y= IC; CG Xe 
Xe 
so that the n,-dimensional subsystem 
X,=A,x,+ B.u, 
(9) 


yzCx, 


is controllable. Clearly this subsystem has the 
same transfer function matrix as the system (2). 
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If the observability matrix M has rank n, (<n), 
there exists a linear transformation X = Qx 
that transforms the system (2) into 


X, A, O0 Ix, B, 
= -H u, 
H E 215] ba 
y-2[C, 0]| x, | 
Xo’ 


so that the n,-dimensional subsystem 


X = AXo t Bu, 

(11) 
yc Cx, 
is observable. This subsystem also has the 
same transfer function matrix as the system (2). 
Combination of these two properties yields the 
canonical decomposition theorem (found by 
Kalman [6]), which states: There is a nonsin- 
gular transformation X = Px by which the 
system (2) can be transformed into the canon- 
ical form 


Xes Aco 0 A13 0 Xco 
New ue Az, Aan Aan Ara || Xe 
Xeo 0 0 A55 0 Xorg 
Xero! 0 0 A43 Asa Xero 
Dey 
B 
«ou (12) 
0 
ye e, 0 G OJfxo |, 
Xeo 
Xeo 
X 


where the subvector x,, of x is controllable 
and observable, A, is controllable but not 
observable, x., is not controllable but obser- 
vable, and x, is neither controllable nor 
observable. Furthermore, the transfer function 
matrix of the system (2) is equal to that of the 
system (12), which is described by 


G(s) E C.(sI d Aco) FS B.o» 


that is, the transfer function matrix of the 
system (2) depends solely on the controllable 
and observable part of the state. 

A linear system (2) is said to be reducible if 
and only if there exists a linear time-invariant 
system of smaller dimension that has the same 
transfer function matrix. Otherwise, the system 
is said to be irreducible. The canonical decom- 
position theorem implies that a linear time- 
invariant system is irreducible if and only if it 
is controllable and observable. 


D. Realization Theory 


Realization theory is concerned with determin- 
ing (1) a linear time-invariant dynamical sys- 
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tem that has a prescribed rational function 
matrix or (2) a linear time-varying system that 
has a prescribed weighting matrix. Here we 
consider only the former, although there are 
many interesting results for the latter as well. 

The rational function matrix G(s) is said to 
be strictly proper if and only if G(oo) 20. A 
linear time-invariant system with a prescribed 
G(s) as a transfer function matrix is called a 
realization of G(s). A realization of G(s) with 
the least possible dimension is said to be mini- 
mal. A minimal realization should be irreduc- 
ible by definition. The fundamental theorem 
on minimal realization, which answers the 
question raised at the end of Section B, states: 
For a given strictly proper rational matrix 
G(s), any two minimal realizations of G(s) are 
mutually similar in the sense that there exists a 
linear transformation X — Px that transforms 
one of the systems into the other. 

A minimal realization of a scalar transfer 
function 

bs" t+... b, usb, 


JOE — ———À—————— 13 
ats) s"+a,s" '+...44, 48 0, Se 


is given by 
0 1 0 0 
0 0 1 0 
x= x 
0 0 0 1 
“an An- WEE "Or 
: (14) 
0 
+]: Ju, 
0 
1 


y=Lb, b, b, b, ]x. 


It is easy to see that this system is controllable. 
Hence system (14) is called a controllable 
canonical realization. Kalman [4] showed that 
this realization is observable and therefore 
minimal if and only if there is no common 
factor between the denominator and the nu- 
merator of g(s). An observable canonical re- 
alization of g(s) can be similarly written down 
as a dual of the system (14). 

For a given strictly proper matrix G(s), the 
degree of the least common denominator of 
all entries of G(s) is called the degree of G(s), 
provided that there is no common factor be- 
tween the denominator and the numerator 
of each entry. It was shown by Kalman [7], 
together with a realization procedure, that 
the dimension of minimal realization of G(s) 
is equal to the degree of G(s). Other impor- 
tant realization procedures for rational func- 
tion matrices were proposed by B. L. Ho and 
Kalman [8], W. A. Wolovich and P. L. Falb 
[9, 10], J. Rissanen and T. Kailath [11], and 
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B. Dickinson et al. [12]. Procedures for time- 
varying system realization for prescribed 
weighting matrices were presented by Kalman 
[4], D. C. Youla [15], and L. M. Silverman 
[16]. Realization problems for a given impulse 
response matrix were first discussed by C. A. 
Desoer and P. P. Varaiya [17], and a fast 
algorithm for discrete-time system realiza- 
tion was proposed by C. T. Mullis and R. A. 
Roberts [18]. 


E. State Estimation 


When a linear system is observable, it is pos- 
sible to construct a device, called a state esti- 
mator, that approximates the state vector. For 
the given system (2), define a linear dynamical 
system 


ż=Dz+ Ey + Gu, 
X=L,z+Ly3y (15) 


with imputs ye R” and uc R', state ze R?, and 
output X eR". If the system (2) is observable, 
rank (C) 2 m, and qz n— m, there exist matrices 
D, E, G, L,, La, and M that satisfy 


MA-—DM=EC, 


L,M+L,C=I, (nxn identity matrix), 


G= MB. 
Then it follows that 
X(t)— x(t) = L,eP'[z(0) — M x(0)]. 


Furthermore, it is possible to choose a matrix 
D which is asymptotically stable, that is, every 
eigenvalue of D has negative real part. Thus 
X(t) x(t) as t oo, and system (15) yields a 
state estimator. This result was first shown by 
D. G. Luenberger [19, 20], and thus the system 
(15) is called a Luenberger observer. 

The Kalman filter proposed by Kalman [21] 
for discrete-time processes and by Kalman and 
R. S. Bucy [22] for continuous-time processes 
is an optimal state estimator for a stochastic 
system. Let 


x(t) 2 A(t)x(t)+ B(t)u(t), 
y(t) — C(t)x(t) + v(t), (16) 


in which u(t) is a white Gaussian noise (tBrown- 
ian motion) (— 45 Brownian Motion) such 


that 
Eu(t)=0, ` Eu(t)u'(s) 2 U(t)ó(t — s), 


and v(t) is also a white Gaussian noise such 


that 
Ev(t)=0, | Ev(t)v'(s) - V(t)ó(t — s), 


where E and ' denote the texpectation and the 
transpose, respectively. In addition, it is as- 
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sumed that 

Ex(tg) 20, | Ex(tg)x' (to) — Xo, 


and that x(t), u(t), and v(s) are uncorrelated. 
Under these conditions, the process x(t) be- 
comes a wide-sense Gauss-Markov process 
(J. L. Doob (23]). The Kalman-Bucy filter is a 
linear dynamical system described by 


Zil EA()— LA CHJA + L(0)y(t). 
where 
L(t)=P(t)}C™(t)V (t) 


and P(t) is a solution to the matrix Riccati 
differential equation 


P(t) — A(t) P(t) 4- P(t) AT (t) + B(t) U (t) B'(t) 
-PMC HAV TC) P(t), 
P(to) Ka. 


The Wiener filter proposed in 1949 [24] is 
a special case of the Kalman filter in which 
the signal process is stationary and of single 
input and single output. Extensions to non- 
linear filtering were investigated first by 
H. J. Kushner [25] and subsequently by 
W. M. Wonham [26], R. S. Liptzer and A. N. 
Shiryaev [27], and M. Fuzisaki et al. [28]. 


F. Optimal Control 


Consider a nonlinear dynamical system de- 
scribed by a system of differential equations 


xz f(x, u), (17) 


where xe R" and uc R'. For a given set U in 
R’, a piecewise continuous function u(t) de- 
fined over [to, t, ] is said to be an admissible 
control function if u(t)e U for every te[t,, t, ]. 
When a functional 


js | " fo(x(t u(t)dt 


is given as a performance index of control 
and two points x? and x! are given in R", the 
optimal control problem consists of finding an 
admissible function u(t) that minimizes J and 
simultaneously transforms the state from x(f9) 
— x? to x(t,)  x!. To solve this, a necessary 
condition for optimality was developed by 

L. S. Pontryagin and his colleagues [3] and 
termed the maximum principle. It states: In 
order for an admissible control u(t) and a 
corresponding solution trajectory of equation 
(17) to be optimal, it is necessary that there 
exist a nonzero vector Y(t) —(V(), ... Y, (DY 
such that (1) y(t) satisfies 


6H (y(t), x(t), u(t)) 


v(t) Zeie f 
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where 


H(, x,u)= — fox, u) + V" f(x, u), 


(ii) the function H (y(t), x(t), u) of the variable 
ue U attains its maximum at the point u — u(t) 
almost everywhere in [t,, t, ], and (iii) at the 
terminal time t, it holds that 


Hb (tı), x(t), u(t,)) - 0. 


When f(x, u) is linear in x and u, fo(x, u)& 1, 
and t, is free, we have the time-optimal control 
problem investigated by Bellman et al. [29] 
and J. P. LaSalle [30]. When f(x, u) is instead 
a quadratic function of x and u, we have the 
optimal regulator problem solved by Kalman 
[31]. The relation between the maximum 
principle and the calculus of variations (— 46 
Calculus of Variations) has been pointed out 
by L. D. Berkovitz [32]. Generalizations of 
the maximum principle and other results 
on optimal control problems are found in 
[33-35]. 


G. Miscellany 


There are many other areas of control theory. 
For details, we refer the reader to [36] for 
adaptive control, [37] for stability theory, and 
[38] for system identification. 
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87 (11.19) 
Convergence 


A. Introduction 


The notion of convergence was first intro- 
duced in the real number system to deal with 
sequences of numbers, functions, series, or 
definite integrals (— 379 Series; 216 Integral 
Calculus). The notion was then extended to 
the case of generalized sequences where the 
index moves over a directed set, and the terms 
are in a topological space. 


B. Convergence of Sequences of Numbers 


A sequence {a,} of numbers is said to be con- 
vergent to a number a or to converge to a, 

written lim, ,, a,— d Or a,>a as n> oo, if for 
any positive number £ we can choose a (suffi- 
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ciently large) natural number n, such that for 
every n larger than n, the inequality |a, — a| 
<e holds. Then a is called the limit (or limit 
point) of the sequence {a,}. If (a,] has a limit, 
it is unique. A sequence which is not conver- 
gent is said to be divergent or to diverge. 

A set A of real numbers is said to be 
bounded from above if there is a real number b 
such that a <b for all ae A, bounded from 
below if there is a real number c such that 
à Z c for all ae A, and bounded if it is bounded 
from above and below. A sequence {a,} of real 
numbers is said to be monotonically increasing 
(monotonically decreasing), written a, f (a, |), 
ifa, «aX... $A, Sans, €...(a,2a4 2...2 
d, > d444 Z ...). A monotonically increasing or 
decreasing sequence is called a monotone 
sequence. 


C. Criteria for the Convergence of Sequences 
of Numbers 


Every bounded monotone sequence of real 
numbers is convergent; its limit is sup{a,} 
(inf {a,,}) (— 355 Real Numbers B) if it is 
monotonically increasing (decreasing). For 
any bounded sequence {a„} of real num- 
bers, setting a, — inf(a,, a,,,, ...] and f,— 
sup1a,, 4,44, ... ], we have xf, f, |, and 
a, <a, S B,. Hence Im... %,=« (—supío,]) 
and lim, ,, ff, — f ( —inf( B,}) exist. a is called 
the inferior limit (or limit inferior) of {a,}, 
written lim inf, — a, or lim, ,,, a,, while f 
is called the superior limit (or limit supe- 
rior), written lim supp» a, or lim,_,,, a,. If 
lim sup,,_,.o a, =liminf,_,,, a, — o, then lim, ,,, a, 
exists and equals «. The limit of a conver- 
gent subsequence of a sequence {a,} of num- 
bers is called an accumulation point of the 
sequence. Here we should distinguish an ac- 
cumulation point of a sequence (a,) from 
an accumulation point of {a,} viewed as 
a set (2 425 Topological Spaces O); for ex- 
ample, if a, — 1 for every n, then the former 
is 1 and the latter does not exist. For any 
bounded sequence of real numbers, its supe- 
rior (inferior) limit is the maximum (minimum) 
of its accumulation points. Moreover, if $ is 
the superior limit of a sequence {a,}, then for 
any positive number e, there exist only a finite 
number of n’s for which a, is greater than f +e, 
while there may exist an infinite number of n’s 
for which a, is less than fj — e. The inferior limit 
of the sequence has a similar property. 
Suppose that we are given a sequence {a,} 
of real numbers and that there exist two se- 
quences (u,) and {v,} such that u, < a, S Vy, 
lim(u, —v,} =0, {u,} is monotonically increas- 
ing, and {v,} is monotonically decreasing. 
Then lim a, exists and is equal to lim u, = 
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lim v, (principle of nested intervals). In partic- 
ular, if lim sup a, 2 lim inf a,, then lim a, exists. 
The converse also holds. 

If {a,} is convergent, then |a, — a,,|0 as n, 
m- oo, and vice versa; that is, {a,,} is conver- 
gent if and only if for any positive number e 
there exists a positive integer n, such that 
|a, — a4| « € for all n, mz ng (Cauchy's 
criterion). 


D. Infinity 


For a set A of real numbers, the expression 
sup A= +% means that A is not bounded 
from above; inf A = —oo means that A is not 
bounded from below. For a sequence {a,} of 
real numbers, lim a, = +00 means that for any 
real number b there exists a positive integer no 
such that a, b for all nèng; the notation 

lim a, = —oo has a similar meaning. The sym- 
bols +œ and —o are called positive (or plus) 
infinity and negative (or minus) infinity, res- 
pectively. We say that the limit of {a,} is 

+00 (—00) if lim a, = +œ (—0o). In these 
cases, we customarily say that {a,} diverges (or 
is divergent) to -- oo (—00), or that a, becomes 
positively (negatively) infinite as n oo. We 
also define lim sup a, = +00 (liminfa, = —oo) 
to mean sup{a,} = +00 (inf{a,} = —oo). A se- 
quence {a,} is said to oscillate if lim supa, > 
lim infa,,. 

We now have the following propositions 
concerning sequences of numbers: If lim a, =a 
and lim b, = b, then lim(aa, + Bb,) 2 «a+ Bb, 
lim(a,b,) =ab, and lim(a,/b,) = a/b (provided 
that b, #0, b z 0). For sequences of real num- 
bers, these formulas also hold when a or b is 
infinity. In those cases we set à: (4-00) — to 
(a 2 0), a: (-£o0) 2 Fæ (a «0), x+% = Foi, 
a/(+00)=0 for a real number a. The cases 
0-(+00), + 00 +(~-00), +00/(+00) are 
excluded. 


E. Convergence of Sequences of Points in a 
Topological Space 


A sequence {a,} of points in a topological 
space (— 425 Topological Spaces) is said to 
converge to a point a if for any tneighborhood 
U of a there exists a positive integer ny such 
that a,€ U for all nz ng. The point a is called 
a limit (or limit point) of {a,} and we write 
bm, oa, =a OF A, a as n— oo. A sequence 
is said to diverge if it does not converge to any 
point. 

In particular, the set R of all real numbers is 
a topological space in which the set of inter- 
vals (a — 6e, a + €) for some &7 0 is a *base for 
the neighborhood system of a point a, so that 
the notion of limit in R explained previously is 
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a special case of the same notion in a topolog- 
ical space. By adding the symbols +00 and 
—oo to R, we obtain the topological space R, 
in which any set containing {x|x>a,xeR} 
((x|x «o, xe R]) for some «eR is a neighbor- 
hood of +œ ( —0o0), where the ordering is 
defined as —oo <a< +œ (xe R). Then lima, = 
+æ (—00) is interpreted as convergence in 
the topological space R. The elements of R are 
called extended real numbers. 

In the case where the topological space 1s a 
*metric space (— 273 Metric Spaces) with 
metric p, a,-»a is equivalent to p(a,, a). 

For convergence of sequences of points in a 
topological space, the following properties (S) 
hold: (S) (1). If a, =a for all n, then lima, =a; 
(2) if a,4a, then a, 0 for any subsequence 
{an 3; (3) if there is a point a such that any 
subsequence {an} of {a,} has a suitable sub- 
sequence converging to a, then a,a. In a 
*Hausdorff space (e.g., a metric space), the 
additional property (S*) holds: If a sequence 
{a,} has a limit, this limit is unique, and can be 
denoted lim a, 


F. Limits of Functions 


Let a real-valued function f(x) of a real vari- 
able x be defined for x za belonging to a 
neighborhood of the point a. We say that the 
limit of f(x) is b as x tends to a, and write 
lim, ,, f(x) 2 b or f(x) ob as xa, if for any 
positive number e there exists a positive num- 
ber ô such that 0z|x — a| « ó implies | f(x) — 
b| «s. Replacing 0z|x —a| «ó bya<x< 
actó(a—ó«x«a), we define f(x) b as 
x4 0 (x—4a-—0) and say that b is the 

limit on the right (left) of f(x) as x tends to a. 
We define f(x) +œ or f(x)» —oo as xa 
analogously to the case of sequences. The 
expression f(x)-b as x +œ means that for 
any positive number e there is a real number k 
such that | f(x) — b| «e for any x» k. There 

are similar definitions for x —^ —oo and b= 
too. When f(x) +0 as xa, we often say 
that f diverges definitely at a. 

In general, for a mapping f from a subset D 
of a topological space X into a topological 
space Y, with a point a in the closure of D and 
a point b in Y, lim, ,, f(x) 2 b or f(x) b as 
xa means that any neighborhood V of b 
contains f(U N D — (aj) for some neighbor- 
hood U of a. If Y is a Hausdorff space, b is 
unique (if it exists) for given f and a. This 
point b is called the limit (or limit value) of f(x) 
as xad. 

It is easy to see that this definition of 
lim, ,, f(x) 2 b is a generalization of the cases 
where the topological spaces are R or R. Let N 
be the set of all natural numbers, and let N= 
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NU(-oo) be supplied with the trelative 
topology as a subspace of R. A sequence {a,} 
of real numbers or points can be identified 
with a mapping f from N into R (NR) 
defined by f(n) — a,. It converges to a if and 
only if lim, ,,, f(n) — a. 

Suppose, in particular, that f is a mapping 
from a metric space (X, p) into a metric space 
(Y, o). Then, f(x)-»b as xa means that 
for any £0 there exists a 0 >0 such that 
ot f(x), b) « e for all xe D such that 0< p(x, a) 
« à. Thus f(x)b as xa if and only if f(x,) 
—b for any (x,) in D with x,—4a. If we set 
p(z1.z5) * |z4 — z5| for complex numbers z,, 
Z5, the function p supplies the set of all com- 
plex numbers € with a metric and C becomes 
isometric to the plane R? (C 74 Complex 
Numbers). Thus the cases X 2 € or Y=C are 
particular cases of the above generalization. 
Furthermore, we introduce the *Riemann 
sphere C=CU {co} by adding the tpoint at 
infinity oo to C. We can define a topology 
on Č such that any set containing {00} U 
{z||z|>r} for some positive number r is a 
neighborhood of oo. Thus we can define the 
notions f(z)-»b as zo oo, f(x) oo as xa, 
etc., for a complex-valued function f by con- 
sidering f as a mapping from the topological 
space Č into itself. Then f(x) oo is equiva- 
lent to 1/f(x)-0. 


G. Orders of Infinities and Infinitesimals 


Let f be a complex-valued function defined 
on a topological space X and a a point of X. 
Then f is called an infinity (at a) or an in- 
finitesimal (at a) if f(x) oo as xa or f(x) 
0 as xa, respectively. Suppose that f and 


g are infinities and f/g is an infinitesimal. Then 


f is said to be of lower order than g, and g is 
said to be of higher order than f. If both f/g 


and g/f are bounded, then f is said to be of the 


same order as g. This last relation, written f — 
g, is an equivalence relation. An infinity f is 

said to be of the nth order with g if f ~g". For 
two infinitesimals f and g, f is called of higher 
order than g and g of lower order than f if f/g 


is an infinitesimal. For infinitesimals, the terms 


of the same order and of the nth order are 
defined similarly as above. In particular, when 
X =C and a= œ, we usually omit the phrase 
"at oo." Also, for such a function f, we cus- 
tomarily say that the order of an infinity (in- 
finitesimal) is n if f —z" (z "). 

To describe the order of an infinity or an 


infinitesimal simply, the following notions, due 


to E. Landau [10], are in common use. Let 
f and g be two functions. If | f(x)/g(x)| is 
bounded as xa, then f is called at most of 
the order of g as x a, and we write f(x) — 
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O(g(x)) as xa. Second, if f(x)/g(x) is an in- 
finitesimal at a, then f is said to be of lower 
order than g as xa, and we write f(x)— 
o(g(x)) as xa. The symbols O, o, indicat- 
ing the word "order," are called Landau's 
symbols. The notation f(x) — h(x) - O(g(x)) 
means f(x) — h(x) 2 O(g(x)). When we use the 
symbols O, o we should indicate clearly the 
phrase “as xa," which is sometimes omitted 
when no confusion is to be feared (eg. for the 
case of a complex variable with a= oo). These 
symbols are employed for sequences as well, to 
describe their behavior as n oc. 


H. Convergence of Nets (Moore-Smith 
Convergence) [7,8] 


Let 9I be a (preordered) tdirected set. A family 
of points in a set X with index set 2 (namely, a 
mapping from 9I to X) is called a net in X. A 
net is denoted by (x,],.4 ((x,]q or (x,]). A net 
(vele in X is called a subnet of {x,},, if there 
exists a mapping o:98 9I such that (1) y; — 
Xp) and (2) for any a) € 9f there exists a fj, e 8 
such that f z fy implies o(p) z as. In particu- 
lar, if 8 is a cofinal directed subset of 9I, (x,),, 
is called a cofinal subnet of {x,},,. A net is 
called a universal net if either (x|x,e Y} or 

{a| x, € X — Y] is residual in A for any subset 
Y of X. For any net there is a universal subnet. 

For a net [x,], in a topological space X, 
{Xa} is said to converge to a point x in X if for 
any neighborhood U of x there is an ao such 
that {x,|& > x9) c U. Then a is called a limit 
of the net {x,}. We then write x, x (ae 9I) 
(or simply x, x). The convergence of se- 
quences of points is the special case where 
3I — N. The notion of convergence using nets 
was introduced by E. H. Moore and H. E. 
Smith. 

Concerning this convergence, we have the 
following propositions (D): (D) (1) If x,— x for 
all æ, then x, x. (2) If x, x and {y,} isa 
subnet of {x}, then y,—x. (3) If for a net {x,} 
there is a point x such that any subnet {y,} of 
{x,} has a suitable subnet converging to x, 
then x, x. (4) Suppose that x,x (xe W) 
and y,, x, (B €3B,) for each a. Then let C= 
A x FI B, be the direct product of directed 
sets with projections p: € 2 9I and p,:C>%,, 
and for ye € define z, — y,,, where «= p(y) and 
B — p,(y). Then z,—x (ye ©). Furthermore, the 
space X is a tHausdorff space if and only if 
we have the condition (D*): Any net in X has 
at most one limit. 

A limit of {x,} is denoted by lim x, or 
lim,,4 x,. Then x,x if and only if x is con- 
tained in the closure of any subnet {y| Pe B} 
of {x,}. (We may consider this to be a defini- 
tion of x, x.) 
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Convergence 


I. Convergence of Filters [9] 


Let X be a set. A set of subsets of X is called 
a filter if the following conditions are satisfied: 
(i) OED (Gi is the empty set); (i) AC Bc X 
and Ae® imply Be; (iii) A, Bec imply 

AN Be®. Let B be a set of subsets of X and 

Q be the collection of subsets of X such that 
each element A of ® contains a subset belong- 
ing to 38. If is a filter, then 9B is called a filter 
base which generates ®. 98 is a filter base if and 
only if (i) Ø ¢B; (ii) A, BEB implies that there 
is a Ce% with AN B>C. A filter ® is called 

an ultrafilter (or maximal filter) if there exists 
no filter which contains ® properly. For any 
filter there exists an ultrafilter containing it. If 
(0,14 is a family of filters, then the intersection 
(95, is a filter. If % is a filter base in the index 
set A, then | Ju. U \aem ®,) is also a filter. 

We denote by U(x) the tneighborhood sys- 
tem of a point x in a topological space X. A 
filter in X is said to converge to a point a, 
written Pa, if (x) c ®. A filter base B is 
said to converge to a if the filter generated by 
B converges to a. 

The convergence of filters just defined has 
the following fundamental properties (L): (1) 
for a point a in X the filter b, (AJae Ac 
X} converges to a; (2) for two filters ® and 
YF, d 5 a and OcY imply Ya; (3) if 6, 

a for all members in a family (45, of filters, 
then (6; — $a; (4) suppose that filters 
®,—>y are assigned for all points y in a subset 
Y of X, and that we are given a filter V in X 
which converges to a, generated by a filter 
base B in Y; then | J s.s ({\yex®,) >a. Fur- 
thermore, the space X is Hausdorff if and only 
if we have the condition (L*): Each filter in X 
has at most one limit. 


J. Relations among Various Definitions of 
Convergence 


Convergence of sequences of points is a special 
case of that of nets. Properties (1), (2), and (3) 
of (D) imply (1), (2), and (3) of (S), respectively, 
and (D*) implies (S*). Consider a net (x,, in 
X. Then the set ( {x |ue M, a> a5] | ao € A} of 
subsets of X is a filter base in X which gener- 
ates a filter 6, and 0x if and only if x, x. 
In this situation, (L) implies (D), and (L*) 
implies (D*). Suppose that we are given a 
function f: X ^ Y with the domain D and a 
point ae X. Let U(a) be the neighborhood 
system of a and assume that, for any Ue l(a), 
UND—{a}#@. Then the set ( f(U N D — 
(a))| U eM(a)] is a filter base. Let © be the 
filter generated by it. Then f(x) b as xa if 
and only if 6 —b. Consequently, the various 
types of convergence described previously can 
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be expressed by means of convergence of 
filters. 


K. Convergence and Topology 


In a topological space X, the concept of con- 
vergence of nets and that of filters can be 
defined. Conversely, convergence of nets in X 
defines a topology of X. In fact, let us assume 
that we are given a set X and a definition of 
convergence of filters which satisfies the prop- 
erties (L). Then convergence of nets that satis- 
fies (D) can be introduced as above. If A is 
defined as the set of limits of all nets contained 
in a subset A of X, then A satisfies the axiom 
of closures (— 425 Topological Spaces), and a 
topology can be defined on X. Then we have 
the following propositions: (i) ae A if and only 
if there is a net {x,} with x,e A converging to 
a; (ii) U is a neighborhood of a if and only if 
X, >a implies that there exists an o such that 
x,€U for all «zo. Thus, if X is a topological 
space, it carries a “new” topology defined by 
way of convergence of nets. But this “new” 
topology coincides with the original one. 
Similarly, starting from convergence of filters 
(or nets), we can obtain a “new” definition of 
convergence of filters (or nets), which coincides 
with the initial one. In conclusion, defining a 
topology on a space X is the same thing as 
defining convergence of filters in X or of nets 
in X. 

We shall describe here a few notions on 
topological spaces in terms of convergence. 
The fact that a topological space X is compact 
is equivalent to the fact that every universal 
net in X converges, and to the fact that every 
ultrafilter in X converges. Also equivalent is 
the fact that every net has a convergent subnet. 
A mapping f from a topological space X into 
a topological space Y is continuous at a if and 
only if one of the following conditions is satis- 
fied: (1) for any net {x,} in X converging to 
ae X, we have f(x,) f(a) in Y; (2) for any filter 
® in X converging to ae X, we have f(®)= 
(f(M)| Meo 5 f(a) in Y; (3) f(x) f(a) in Y as 
xa in X (in the sense of the limit of a func- 
tion at a). 

M. Fréchet [6] gave a definition of a topol- 
ogy on a space using the notion of conver- 
gence as a foundation. A set is called an L- 
space (or Fréchet L-space) if convergence of 
sequences of points in it is defined so as to 
satisfy conditions (1) and (2) of (S) and (S*) 
(1906). Such convergence is called star conver- 
gence if it also satisfies (3) of (S), and in that 
case the space is called an L*-space. For any 
subset A of an L-space X, define A as the set 
of all points a such that x, — a for some se- 
quence {x,} contained in A. Then the axioms 
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A 2A, AUB =AUB, and Ø = Ø are satis- 
fied, so that X is a *generalized topological 
space (the axiom A = A is not necessarily 
satisfied). For a Hausdorff space X with the 
‘first countability axiom and convergence of 
sequences defined by means of its topology, 
the closure operation defined above gives the 
same topology as the initial one. 


L. (o)-Convergence 


A sequence (a,; of elements of an ordered set 
S is said to be (o)-convergent to an element a of 
S if there exist two sequences Ju, and {vp} 
such that u, € a, € v, Ha S u,,4,, and v, 20,41; 
and a=supu, —infv,. When we write this 

à, a, properties (1) and (2) of (S) and (S*) con- 
cerning the convergence of sequences hold. 
Next, a sequence {a,} is said to be (o)-star 
convergent to a if any subsequence of {a,} has 
a suitable subsequence which converges to a. 
Then (o)-star convergence satisfies the prop- 
erties (S) and (S*). 

For any set X the set P(X) of all subsets of 
X is an ordered set under the inclusion rela- 
tion. The fact that a sequence (A, of subsets 
is (o)-convergent to a subset A is equivalent to: 


The set A is also equal to lim A,, which is the 
"limit of a sequence {A,} of subsets. 
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A. Convex Functions 


A real-valued function f(x) defined on a tcon- 
vex set D in a linear space over R is called a 
convex function if for every x, ye D and Ox 
A<1 we have 


fx c (1 — y) SAf(x) + (1 — 2)f(y). (1) 


The function f(x) is called a strictly convex 
function if the sign < in (1) is replaced by 
< for x Z y except when J is either 0 or 1. If 
— w(x) is convex (strictly convex), the function 
V (x) is called a concave function (strictly con- 
cave function). The notion of convex function 
was introduced by J. L. W. V. Jensen [1] for 
the case where the domain D is an interval on 
the real line R. 

Sometimes the condition for a convex func- 
tion is weakened in such a way that (1) is as- 
sumed only for 4— 1/2. However, if D is a 
*topological linear space and f is continu- 
ous, then the weakened condition implies the 
original one. Hereafter, we mainly consider the 
case where D is an interval on the real line. In 
this case, a convex function f(x) (in the weaker 
sense) is continuous in the interior of the inter- 
val if f(x) is *measurable or bounded from 
above on a set of positive measure (the latter 
was proved by A. Ostrowski [2]). In particu- 
lar, suppose that f(x) is defined in the interval 
I and is bounded from below. Then either f(x) 
is continuous or its graph is dense in the set 
{(x, y)xeL y 2 g(x)], where g(x) is a suitable 
convex continuous function (Hukuhara [3]). 
We note here that the original definition of a 
convex function f(x) implies the continuity of 
f(x) in the interior of the interval. In such a 
case, f(x) always has fright and tleft deriva- 
tives and satisfies f (x) x fi(x)« f^ (y)« 

f (y) for x< y. Hence it is differentiable ex- 
cept for at most countably many points. 

A function f(x) is a continuous convex 
function in a« x <b if and only if it is expre- 
ssible in the form 


x 


fei fta» | «(t) dt, 


a 
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where g(t) is a monotone increasing function. 
If f(x) is twice differentiable, then f"(x) 20 

(a « x « b) is a necessary and sufficient condi- 
tion for f(x) to be convex in (a, b). 


B. Convex Functions and Inequalities 


If f(x) is convex (in the original sense), we 
have, for a, >0, 


(Es Es )«E«/feo/ Ya. H 


Similarly, we have, for o 0, 


1 | ova] | ots) | oras] f oax (3) 


The functions x^ (a>1 ora«0), —x*(O<a< 
1), —logx, xlogx are strictly convex for x » 0, 
and the functions x?" (n> 1), exp x, log(1 +e”), 
./a* +x? (a 0) are strictly convex in —oo x 

x « +00. Applying the inequalities (2) or (3) to 
these functions, we obtain various inequalities, 
including the inequalities on means (— 211 
Inequalities). 

A continuous convex function f(x) over a 
topological linear space satisfying the relation 
f(x) — af(x) for an arbitrary positive number g 
is called a subadditive functional and is often 
utilized in functional analysis. 


C. M. Riesz's Convexity Theorem 


Let x =(€,,...,¢,) be an n-tuple of com- 
plex numbers, and let v>0. We put N,(x)= 
(25-, 16, for v>0 and No(x)  sup|£jl. 
Let (ol be an m x n complex matrix, x = 
(čis <5 o Z= (Cis ees eck VO, and 420. 
We put 


n 


Y » Gol, 


i-1j-1 


M(v,w)= ` sup 


NX) <1, Ny(z) <1 








Then log M (v, ul is a convex function of (v, u) 
in the following sense: Let 0 « v, € 1, 0 « u;« 1, 
and v,+ u;Z 1 (i— 1,2). Then log M((1— t)v, + 
tvz, (1 —t)u, - tu;) is a convex function with 
respect to t for0 x t« 1 [4,5]. These results are 
called M. Riesz's convexity theorem. Famous 
inequalities such as the tHdlder inequality or 
the Minkowski inequality follow from this 
theorem. For example, let T be an tadditive 
operator from the ‘function space L,(Q) into 
L,(Q) for all 1 <p< œ. If T is a continuous 
operator for p — 1 and p= oo, and the norm 
of T is <C for p=1 and p= oo, then T is con- 
tinuous for all p (1 « p « oo), and its norm is 
always « C. 
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D. Subdifferentials of Convex Functions 


Currently, convex analysis is playing an im- 
portant role in the study of nonlinear evolu- 
tion equations. This analysis treats convex 
functions on infinite-dimensional spaces [9]. 

In convex analysis, it is sometimes conve- 
nient to consider a proper convex function o 
on a Hilbert space X. A mapping ọ from X 
into (—00, +00] is called a proper convex 
function if o is not identically equal to +00 
and if 


p(Ax +(1—A)y) <AG(x) + (1 — A p(y) 
for every x, ye X and Ox A4« 1. The convex set 
Dill ixeX|o(x) «ooj c X 


is called the effective domain of o. The subdif- 
ferential Ze of o is defined to be a multivalued 
function which assigns to each x in D(ọ) all 
elements ye X satisfying o(£) 2 o(x)- (y, £— 
x» for any če X. Then Gg is monotone in 

X in the following sense: If y, €@@(x,) and 

y; €0o(x,), then 


[Yi — Y2: X1 —- Xn) 20. 


Under some conditions on q, Oo becomes a 
single-valued function, as is seen in the follow- 
ing example: Let A be a nonnegative self- 
adjoint operator in a Hilbert space X. For the 
"fractional power JA of A, define ọ by 


1 
sl Axl?, xe D(A), 


+00, otherwise. 


Then D(g)= D(./A) and 0o = A. 


o(x)- 


E. Convex Functions and Nonlinear 
Semigroups 


Let o be a lower semicontinuous proper con- 
vex function on a Hilbert space X. Since A= 

— 09 is a maximal 'dissipative operator in X, 
it generates a nonlinear semigroup (1,|tz 0] 

on the closed convex set D(A): 


Tx- lim (I—tA) "x 


(— 286 Nonlinear Functional Analysis, 378 
Semigroups of Operators and Evolution Equa- 
tions). It can be shown that for any ae D(A), 
T,ae D(A) whenever t>0. Thus the tabstract 
Cauchy problem 


du 
d -*0Q(u)80 (t0) 


u(+0)=a, 


is considered to be “parabolic.” Furthermore, 
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we have 


d 
d 


for all ae D(A) (H. Brézis [10]). 


«-la-Ta| (¢>0) 
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A. General Remarks 


A nonempty subset X of the n-dimensional 
Euclidean space R" is called a convex set if 
for any elements x, y in X and any number a 
such that 0<a<1, the element ax -- (1 — a)y 
of R" is also contained in X. The 'interior 
and the *closure of a convex set are also con- 
vex. A point x of a convex set X is called an 
extreme point of X if x cannot be expressed as 
(x, +x,)/2 in terms of a pair of distinct points 
cu, X2 in X. A bounded closed convex set is 
called a convex body if it has tinterior points. 
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Given an arbitrary nonempty subset X of R", 
the minimum convex set containing X exists, 
called the convex hull of X and denoted by 
[X]. Each point x of [X] can be expressed as 
Ae ZZ ax; where x; belongs to X and the a; 
are nonnegative numbers such that $7771 a; 1. 
When X is a finite set, [.X] is called a convex 
polyhedron. If X denotes the set of extreme 
points (also called vertices) of a convex poly- 
hedron X, then X 2 [X]. 

For elements x, y of R", denote the inner 
product by (x, y). Given a nonzero element v of 
R" and a fixed number a, the thyperplane H = 
(x|(v, x) 2a] divides the space R” into two 
thalf-spaces (x|(v, x)<a} and (x]|(v,x) z a], 
each of which is a closed convex set. If a con- 
vex set X is contained in one of the half-spaces 
S determined by the hyperplane H and the 
boundaries of X and H intersect, then we say 
that H is a supporting hyperplane of X and S is 
a supporting half-space of X. A closed convex 
set X is the intersection of its supporting half- 
spaces. A boundary point of a convex set X is 
contained in a supporting hyperplane of X. 
Given mutually disjoint convex sets X and 
Y, the separation theorems (1) and (2) hold. 

(1) If X has inner points, then there exist a 
nonzero element v of R" and a number a such 
that X is contained in the set [x |(x, v) 2 a] and 
Y is contained in the set (x|(x,v) <a}. 

(2) If X and Y are closed and X is bounded, 
then we can replace the signs < and > in (1) 
by « and >, respectively (when the separa- 
tion of convex sets X and Y is described by 
strict inequalities, we say that X and Y are 
strongly separated). 

As an immediate consequence of the separa- 
tion theorems, we obtain the following pro- 
position: Suppose that A is an m x n matrix 
with real entries. For an element z in a Eu- 
clidean space, we write z > 0 (7 0) if each 
component of z is >0 (>0). Now if 'Ay- 0 
never holds for an m-dimensional vector y » 0, 
then there exists a nonzero n-dimensional 
vector x >0 such that Ax «0 (— 173 Game 
Theory). 

The definitions given previously for subsets 
of R" can be naturally extended to the case of 
treal topological linear spaces (— Section G). 
Also, in the theory of analytic functions of 
several complex variables, various notions of 
convexity of the subsets of C" are considered 
(— 21 Analytic Functions of Several Complex 
Variables). 


B. Helly's Theorem 


Suppose that we are given an index set A of 
cardinality greater than n+ 1, and bounded 
closed convex sets C; (4e A) in R”. If any n+1 
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sets of the C; have nonempty intersection, then 
the intersection of all the C; is nonempty 
(Helly's theorem). 

This theorem has a wide range of applica- 
tions. For example, we have propositions (1)— 
(4). 

(1) If a convex set X of R" is covered by a 
finite number of half-spaces, then X can be 
covered by no more than n+ 1 half-spaces 
among them. (2) Let X and Y be finite subsets 
of R". X and Y are strongly separated by a 
hyperplane if for an arbitrarily chosen subset S 
of X UY consisting of at most n +2 points, the 
sets SN X and SN Y are strongly separated by 
a hyperplane. (3) If the tdiameter of a subset X 
of R" is not greater than 2, then X is contained 
in a tball of radius (2n/(n + 1))!2. (4) Let X be 
a convex body in R”. There exists a point x 
in X such that [x —u||/lv —ul| n/(n 4 1), 
where u, v are points of intersection of an arbi- 
trary straight line passing through x with the 
boundary of X, and ||x|| denotes the length 
(x, x)? of x. 

Helly's theorem can also be applied to 
problems of approximation of functions. 


C. Ovals 


The boundaries of convex bodies in R? and R? 
are called ovals and ovaloids, respectively. An 
oval E is a tJordan curve C(t) which admits at 
every point P, = C(to) left and right “tangents” 
L and I, , where I, is a straight line expressed 
as the set of points P(A), AeR such that P(A) 

— Py=Aa*, with a* —lim, , , 9(C(t)— P5)/|C(t) 
— Pol). There may exist exceptional points P 
on £ for which left and right tangents do not 
coincide, but the set of such points is at most 
countable. Each tangent L shares a point or a 
segment with E. A line satisfying this condition 
is called a supporting line of the oval. It is also 
a supporting hyperplane in R? of the convex 
body [E] in the sense of Section A. If we fix an 
interior point O of a convex body X and take 
an arbitrary point P different from O, then the 
boundary E of X admits one and only one 
supporting line /(P) which is perpendicular to 
the line OP and meets the half-line OP. Take a 
rectangular coordinate system with the origin 
O, and let (x, y) denote the coordinates of P. 
Then the points (E. n) on I(P) satisfy the equa- 
tion čx 4- py — H(x, y), where H(x, y) is a func- 
tion determined for all (x, y) in R? and satisfy- 
ing the following conditions: (i) HI, 0) — 0; (ii) 
H (tx, ty) - tH (x, y), for t>0; (ili) H(x, + x5, y, 
+ y3) < H(xi, yi) - H(x;, y2). The function 
H(x, y) is called the supporting line function of 
E. The magnitude and shape of E are deter- 
mined by H, and any function satisfying con- 
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ditions (1)-(iii) is a supporting line function of 
an oval. An oval E has a finite length L = L(E), 
and the convex body [E] has a finite area F = 
F(E). If OP' denotes the half-line with direc- 
tion opposite to that of OP and I’ denotes the 
supporting line /(P^, the distance between the 
parallel lines / and I’ is called the breadth of E 
in the direction PP’. Let D= D(E) and A=A(E) 
be the maximum and minimum of the breadth 
of E, respectively. D is the tdiameter of E (or 
[E ]), and A is called the thickness of E (or 

[E ]). In particular, if D =A, then the oval E is 
called a curve of constant breadth. In the fol- 
lowing inequalities, equality holds only when 
E is one of the figures mentioned in paren- 
theses: (1) L? 2 4xF (circles, J. Steiner (1838)) 
(— 228 Isoperimetric Problems); (2) xD? 2 4F 
(circles, L. Bieberbach (1915)); (3) L xD 
(curves of constant breadth, W. Blaschke 
(1916); (4) Fz A?/,/3 (regular triangles, J. Pal 
(1921)). See T. Kubota, Tóhoku Sci. Bull., I, 

12, 13; Tóhoku Math. J., 24, 49. 


D. Linear Combinations of Ovals 


Let H, and H, be supporting line functions of 
ovals E, and E,, and let t, and t, be positive 
numbers. Since the function t, H, +t, H, satis- 
fies conditions D) (iii), given before, it is a 
supporting line function of an oval E(t,, t). In 
this case, we can also write E(t,,t;) — t, E, + 
t; E, and call it a linear combination of E, 

and E- In particular, the oval (E, + E,)/2 is 
called the mean oval of E, and E,. In general, 
there exists a quantity M, called the mixed 
area of E, and E,, such that F(E(t,,t,))= 
F(E,)tî - 2Mt,t; + F(Ej)t2. M does not 
depend on the choice of t, and t,, and M? > 
F(E,)F(E,). Here, the equality holds if and 
only if E, and E, are homothetic and situated 
in a position of homothety. Furthermore, if 
Oxt«l, then the square root of F(E(t, 1 — t)) 
is a tconvex function of t (H. Minkowski). 


E. Specific Ovals 


Suppose that we are given an equilateral 
triangle ABC. Draw three circles C,, C,, and 
C, with centers A, B, and C and radii equal to 
the length of the sides of ABC. The minor arcs 
AB, BC, and CA of the circles form an oval 
which is called a Reuleaux triangle. This oval 
is of constant breadth. Furthermore, given a 
fixed breadth D, the area F(E) of an oval E of 
constant breadth D attains its minimum when 
E is a Reuleaux triangle. A Reuleaux triangle 
obtained from a triangle ABC revolves freely 
within the square of side AB and touches each 
side. In general, an oval which revolves touch- 
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ing the sides of a convex polygon from the 
inside is called an inrevolvable oval. Any such 
oval revolves inside some regular polygon (M. 
Fujiwara, S. Kakeya). 

Various properties of an oval already de- 
scribed can be generalized to the case of a 
boundary of a convex body in R". For the 
volumes of subsets A, B in R” and A+ B={x+ 
y|xeA, ye B] in R", the Brunn-Minkowski 
inequality 


[vol(A + B)]'" > [vol(A) ]"" + [vol(B)]'" 
holds. 


F. Convex Cones 


A nonempty subset X of R" is called a convex 
cone if for any elements x, y of X and a non- 
negative number a, ax and x + y are contained 
in X. A convex cone is a convex set. Given any 
nonempty subset X of R", the minimum con- 
vex cone K(X) containing X exists. Given two 
convex cones X and Y, a convex cone X + Y, 
called the sum of X and Y, is defined as the set 
of elements x + y, where x, y are elements of X, 
Y. The intersection of convex cones X and Y is 
also a convex cone. Given a convex cone X, 
the subset of R" consisting of the elements y 
such that (x, y) x 0 for any element x in X isa 
convex cone which is called the dual convex 
cone (or conjugate convex cone) of X, denoted 
by X*. If X is a finite set, K(X) is called a 
convex polyhedral cone. For example, if v 

is a nonzero vector, then the half-line (v) = 
(x|x—av, az: 0j or the half-space (v)* = 

{x|(v, x) <0} is a convex polyhedral cone. A 
convex polyhedral cone is closed. A convex 
cone X is a convex polyhedral cone if and only 
if X is the sum of a finite number of half-lines. 
Given convex cones X and Y, we have pro- 
positions (1)—(3): (1) If Y, c X,, then X* c 

Xf: Q) (X, + X* = XF OXF; (3) Xf + AF 
(X, X,)*. If X, and X, are convex poly- 
hedral cones, then Xf + X7 =(X, N X;)*. Gen- 
erally, X c (X*)* = X** for a convex cone X. 
If X is a closed convex cone, then X — X**. 
Namely, the duality principle holds for closed 
convex cones. A linear subspace of R" isa 
convex polyhedral cone. Also, if A is an m x n 
real matrix, the subsets [x | 4x 20, x 20] and 
(x| Ax 20} are convex polyhedral cones. Since 
the duality principle holds for convex poly- 
hedral cones, we obtain the "Minkowski- 
Farkas theorem (e, if A is an m x n real 
matrix and v is an element of R", then the 
equation Ay —v has a solution y>0 in R” if 
and only if (v, x) 20 for all xe R” such that 
‘Ax 2 0). (For linear inequalities — 255 Linear 
Programming.) 
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G. Convex Sets in Function Spaces 


The definitions of convex sets and convex 
cones in R" can be naturally extended to the 
case of any real linear space. Some of their 
properties can be generalized and applied to 
the case of function spaces. 

(1) Let E be a flocally convex real topolog- 
ical linear space satisfying Hausdorff's tsepara- 
tion axiom. Let A and B be convex sets in E, 
and assume that B has interior points and 
A! B is empty. Then A and B are separated by 
a hyperplane. Namely, there exists a nonzero 
tcontinuous linear functional f on E such that 
sup f(A) « inf f(B). 

(2) Let E be as in (1), and let C be a convex 
set in E. If a boundary point x of C admits a 
nonzero continuous linear functional f such 
that f(x) — sup f(C), we call such a point x a 
supporting point of C, and f a supporting func- 
tional of C. If C has interior points, then any 
boundary point x of C is a supporting point of 
C. 

(3) Let C be a closed convex set of a 
*Banach space E. The set of supporting points 
of C is dense in its boundary. 

A convex set C contained in the dual space 
E* of a real topological linear space E is called 
a regularly convex set if for any f, in E* not 
contained in C, there exists a point x, in E 
such that sup{ f(xo)| f € C] « fo(xo). 

Let E be a real topological linear space 
satisfying Hausdorff's separation axiom, and 
let C be a closed convex cone having 0 as its 
extreme point. Furthermore, assume that CN 
(— C) 2 (0). If we set x & y when y xeC, 

a partial ordering < is defined in E. For 
example, if E is R", then the positive orthant 
C (x -(x)|x; Z0, i=1,...,n} satisfies these 
requirements, and the partial ordering x < y 
defined by means of C is equivalent to the 
relation x; « y; for all i. 

Some of the properties of matrices of posi- 
tive entries or tintegral operators whose tker- 
nel functions are positive-valued can be gen- 
eralized to properties of mappings f: E E 
such that f(C) c C (— 255 Linear Program- 
ming; for the fKrein-Milman theorem — 424 
Topological Linear Spaces). 
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A. General Remarks 


Suppose that we are given a Euclidean plane 
E? and two lines X'X and Y'Y in E? per- 
pendicular to each other. Let O be the point of 
intersection of X'X and Y'Y. We identify each 
of the straight lines X'X and Y'Y with the set 
of real numbers R; the point O on each line is 
identified with zero. Let P be an arbitrary 
point in E?. We draw lines PQ, PR parallel to 
Y'Y, X'X, where Q, R are on X'X, Y'Y, respec- 
tively. Let x and y be the real numbers corre- 
sponding to Q and R. Thus we obtain a map- 
ping sending the point P to the ordered pair 
(x, y) of real numbers. This mapping gives a 
one-to-one correspondence between the points 
P of E? and the ordered pairs (x, y) of real 
numbers in R?. The numbers x and y are 
called the coordinates of P. 

In general, given a set of mathematical 
objects, if we have a mechanism that assigns 
quantities to each element of the set, then such 
a mechanism is called a coordinate system on 
the set, and the quantities corresponding to 
each element are called its coordinates. In the 
previous example, the mechanism is called a 
rectangular coordinate system. Coordinate 
systems are also useful in expressing quantita- 
tive concepts by geometric ones which are 
intuitively easier to grasp, c.g., diagrams of 
train schedules and tnomograms. "Map projec- 
tion, *graphical calculation, tdescriptive geom- 
etry, etc., may be viewed as applications of the 
concept of coordinate systems. 

In many cases, when we introduce a coor- 
dinate system in a space, it is determined 
uniquely by fixing a basic figure in the space. 
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In the case of a rectangular coordinate system 
on a plane E?, the basic figure consists of X'X 
and Y’Y, which are called coordinate axes (the 
point O is called the origin). Sometimes it is 
convenient to consider real-valued functions f 
and g on R and a coordinate system on the 
plane E? determined by the function that 
sends an arbitrary point P to (f(x), g(y)), 
where (x, y) are the coordinates of P in the 
rectangular coordinate system. Logarithmic 
papers, tprobability papers, and tstochastic 
papers (binomial probability papers), etc., are 
constructed in this way to fit their respective 
purposes. 

In various branches of mathematics there 
are many varieties of coordinate systems. In 
this article we deal with frames and coordi- 
nates, curvilinear coordinates, and local 
coordinates. 


B. Frames and Coordinates 


Suppose that we are given a space M and a 
*transformation group G acting on M. It is 
desirable to introduce a coordinate system 
that best represents the geometric structure of 
M. Let G, be a set of figures in M such that G 
acts tsimply transitively on G,. Each element 
of G, is called a frame. Utilizing each frame as 
basic figure, we introduce a coordinate system 
that is “G-invariant” in the following sense: 
Let ReG,, Xe M, and Cp(X) be the coordi- 
nates of X with R as basic figure. Then the 
coordinate system is G-invariant if C,(X)— 
C,r(gX) for any element g in G. If we have 
such a coordinate system for each ReG,, then 
the expressions of geometric properties of M in 
terms of the coordinates are independent of 
the choice of frames. 


(1) Projective Coordinates. Let M be an n- 
dimensional *projective space P" over a field 
K, and let G be the tprojective transformation 
group of P". As a frame we can take the system 
of n+1 points (Ag, A4,, ..., Ap) in general po- 
sition. The *homogeneous coordinates of an 
arbitrary point X € P" are given by the (n+ 1)- 
tuple (x9, x, ..., x,) satisfying the equation X 
= Lj=0 X;A;, x;e K. They are called tprojec- 
tive coordinates. In fact, if (xo, x4, ..., x,) Æ 
(0,0, ...,0), then (xo, x,, ..., x,) and Le, Ax,, 
4 AX4 (A #0) represent the same point in P". 
A thyperplane z of P" is expressed as the set 
of points whose coordinates (xo, x,, ..., Xp) 
satisfy a linear homogeneous equation 
2j-oXju; —0, u;e K. Therefore, the hyperplane 
1 is represented by the homogeneous coor- 
dinates (uo, u4, ... 
coordinates of z. 


,u„), called thyperplane 
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(2) Affine Coordinates. Let M be an n- 
dimensional taffine space E", and let G be the 
tgroup of affine transformations of E". As a 
frame we can take the system (O;e,,e,, ...,e,), 
where O, called the origin, is a point in E", and 
the set of vectors {e;} is a basis of the tstan- 
dard vector space of E". Then tinhomogeneous 
coordinates of an arbitrary point X e E" are 
given by the n-tuple (x1, X2, ---, Xp) where X = 
O + 3, x,e;. They are also called taffine 
coordinates of X in E". Sometimes we replace 
G above by the group of fequivalent affinities 
and consider the frames (O;e,,e;, ..., e,) such 
that the volume of [e,, ...,e,] — 1. 
Furthermore, if E" has the structure of a 
tEuclidean space, we sometimes replace G 
by the *group of motions and consider a sys- 
tem of rectangular coordinates determined 
by an torthogonal frame, that is, a frame 
(0; e,, e; ..., e,) such that the inner product 
(e; ej) — ój, where ô is the *Kronecker delta. By 
contrast, the general affine coordinate system 
of a Euclidean space is called a system of 
oblique coordinates. In this case the inner prod- 
ucts (e;, ej) — gj; are invariants of Euclidean 
geometry, and the distance p between two 
points (x;) and (y;) is given by p=(2? jar gin, 
—x,)(yj—x;,))’”. We sometimes consider an 
oblique coordinate system satisfying (e ej) = 1 
(i— 1, ..., n). In such cases the angle 6,; between 
two basis vectors e; and e; is determined by 
dij ^ cos 6j. 


(3) Barycentric Coordinates. In an n- 
dimensional affine space E", we take n+ 1 
linearly independent points Ap, 4,,..., A, and 
denote the position vectors from a point O to 
these points by ap, a,, ..., a,, respectively. 
Then for any point X € E" there exists a unique 
set of numbers (Ay, 1, ..., 4,) such that X =O 
+ Dye 4j8j, 2j-9À;— 1. We call these numbers 
barycentric coordinates of X in E". They are 
independent of the choice of the point O. 


(4) Plücker Coordinates. Let V (n, m) be the 

set of all m-dimensional subspaces in an n- 
dimensional projective space P". Then V (n, m) 
has the structure of a tGrassmann manifold. In 
order to introduce a coordinate system on 

Hi, m), we fix a projective coordinate system 
on P". An m-dimensional subspace ze V (n, m) 
in P" is spanned by m+ 1 independent points 
Bo, B,, ..., B,,e P". We denote projective co- 


ordinates of these points by (b9,), (bj), ..., (bj) 
and construct the determinants 
boja boj, wee boj, 
Po. Wo uh he Jr Deeg ie oe S US io, ln SN. 
bj Pmj, Ad bj, 


Then the subspace z can be represented by 
homogeneous coordinates (..., p; ;... j , +++): 
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These coordinates are independent of the 
choice of m+ 1 points that span x and are 
called Plücker coordinates (or Grassmann 
coordinates) of z in Hin, m). In these coordi- 
nates, the p; j,...j„ are alternating and satisfy 
the Plücker relations 


m+1 


where j, means that j, is removed. In partic- 
ular, when n=3 and m=1, we have only one 
Plucker relation Q: po; P23 — Po2P13 + Po3P12= 
0, which is a homogeneous equation of the 
second degree. In other words, the set V(3, 1) 
of all lines in a 3-dimensional projective space 
P? is realized as a quadric surface Q in a 5- 
dimensional projective space P? that has 
(Po1:Po2s Pos: P12» P13» P23) as projective co- 
ordinates. Moreover, when P? is a complex 
projective space, we put 
Porz=Sitics, Dos 65 ies, 


Pi3 = — 6, + ity, 


Poi = Go t ids, 


P23 7 6o — i65, D12765— i65 


and obtain a relation 
Got et + 4 F462 +62 =0 


corresponding to the Plücker relation. Thus 
every line in P? can be represented by homo- 
geneous coordinates (čo, £,, ..., Čs), which we 
call Klein’s line coordinates. 


(5) (n + 2)-Hyperspherical Coordinates. Let 

(Xos Ei Xn Xo) be projective coordinates in 
an (n + 1)-dimensional real projective space 
P"*!, An n-dimensional tconformal space S" 

is realized as a quadric hypersurface S” in 
PITT: j=1 GyjXiXj— 2xox., =0, where (gj) is a 
positive definite symmetric matrix. A general 
point in P"*! represents a thypersphere of S". 
That is, a hypersphere represented by a point 
X €P"*! is realized as the intersection of S" 
with the tpolar hyperplane of X with respect 
to S"; according as X lies outside of S", on S", 
or inside of S", it represents a real hypersphere, 
a point hypersphere, or an imaginary hyper- 
sphere. Therefore any hypersphere of S" in 
P"*! is expressed by homogeneous coordinates 
(Xo, X4, ..., X4, Xœ) called (n + 2)-hyperspherical 
coordinates. When n —2, they are called tetra- 
cyclic coordinates, and when n= 3, pentaspher- 
ical coordinates. Therefore, if we restrict (n + 2)- 
hyperspherical coordinates for points on S", 
then they satisfy the quadratic relation stated 
before. In the frame (Ay, 4,, ..., Ans Aœ) of P^*! 
which defines the (n + 2)-hyperspherical co- 
ordinates, Ay and A,, are points on S”, and the 
other A, are real hyperspheres passing through 
the points A and A. It is possible to choose 
a frame (Ap, A,, ..., An, Aœ) such that the equa- 
tion for S” becomes E, x? - 2xox,, — 0 (i.e., 
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gi 79). Among hypersurfaces in S”, one that 
is expressed by a homogeneous equation of 
the second degree with respect to (n+ 2)- 
hyperspherical coordinates is called a cyclide. 
It is an algebraic surface of the fourth order 
and is an enveloping surface of the family of 
hyperspheres that are tangent to n fixed 
hyperspheres. 


(6) Moving Coordinates. When we study the 
differential geometry of an m-dimensional 
surface W in a space M on which a transfor- 
mation group G acts, it is often preferable to 
take a frame or frames at each point of W and 
consider a tconnection among them. These 
frames are called moving frames, and the set of 
coordinate systems with respect to moving 
frames is called a moving coordinate system 
(— 111 Differential Geometry of Curves and 
Surfaces). 


C. Curvilinear Coordinates 


Let (x4, x52, ..., x,) bea rectangular coordinate 
system on an n-dimensional Euclidean space 
E". If x;=x,(u,,U2,...,u,), 1— 1, ..., n, are func- 
tions of n variables (u,, us, ...,u,) of class 

C (rz 1) and the tfunctional determinant 
D(x,, ..., x,)J/D(u,, ...,u,) is not equal to zero 
in some open domain, then (u,,u5,...,u,) are 
considered local coordinates in E". We call 
them curvilinear coordinates of E". A hypersur- 
face u; — constant (obtained by fixing the value 
of one of the variables u,) is called a coordinate 
hypersurface, and a curve u;— constant (j zi) is 
called a coordinate curve. The line element ds 
of a Euclidean space E" is given by 


Thus Ei is equipped with a *Riemannian 
metric. However, as E" is tflat, its tcurva- 
ture tensor satisfies Rj,, — 0. If the metric is 
diagonal, namely, if ds? = 37. , gidu}, the co- 
ordinates are called orthogonal curvilinear 
coordinates. Moreover, if g, =...=g,, the 
coordinates are called isothermal coordinates. 
The metric is diagonal if and only if coordi- 
nate curves are mutually perpendicular at the 
points of intersection. Actually, curvilinear 
coordinate systems that are often used practi- 
cally are diagonal. The concept of curvilinear 
coordinates has been generalized to the case of 
tdifferentiable manifolds and is utilized to 
determine their local coordinates. 

On any 2-dimensional Riemannian manifold 
there always exist isothermal coordinates in a 
neighborhood of any point [7]. 
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(1) Curvilinear Coordinates on Planes or Spaces 
(— Appendix A, Table 3.V). Let (x, y) be rect- 
angular coordinates of a point in a Euclidean 
plane E?. We have the following coordinate 
systems on E?: 

Polar coordinates (r, 0), where 
x—rcosÜ, y=rsiné. 

Elliptic coordinates (A, 4), where 
x? 2 (4 a?) (u - a*)/(a? — b?), 
y! 2G - b?)(u- RW —a?), 
a>b>0, s>—b?>py>—a?. 
Parabolic coordinates (o, 8), where 


x=—(a+f), y=./—4a0f, x20». 


Equilateral (or rectangular) hyperbolic co- 
ordinates (u, v), where 
x=uv, y=(u? —v?)/2. 

Bipolar coordinates (¢, 7), where 
x=asiné/(cosh ë + cosy), 
y=asinn/(cosh č + cosy), 
—0o<€<0, Oxmx2m. 


Next we consider the case of a 3- 
dimensional Euclidean space E? and let (x, y, z) 
be rectangular coordinates on E?. The follow- 
ing systems of coordinates on E? are some- 
times useful. 

Cylindrical coordinates (r, 0, z), where 
x-rcosÜ, y=rsin@, z=z. 

Spherical coordinates (r, 0, o). where 
x-—rsinÜcoso, y=rsinésing, z=rcosé. 
Ellipsoidal coordinates (A, o. v), where 
(A a?) (i-a?) (v a?) (a? — b?) (a? — c?), 
y =(A+b*)(u+b?)(v + b?)/(b? — c?)(b? — a”), 


z? =(A+c?)(u+c7)(v +c7)(c? — a”) (e? A 


x? 


a>b>c>0, A> —cà»-u-—b-v-a. 


These coordinate systems are all systems of 
orthogonal curvilinear coordinates. Suppose 
that we are given two rectangular coordinate 
systems (£, y, C) and (x, y, z) sharing the same 
origin. The correlation of the two is given by 
Euler's angles (0, o, Y), where 0, o, and y are 
the angles between the z-axis and ¢-axis, zx- 
plane and z¢-plane, and €¢-plane and ¢z-plane, 
respectively. The Euler angles 0, o, and y are 
subject to the inequalities 0 x 0 x x and Ox q, 
y « 2n. They are often utilized in the dynamics 
of rigid bodies. 


(2) Multipolar Coordinates. Let P,, P. P, 
be m points in general position in an n- 
dimensional Euclidean space E", m x n. If we 
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denote by p; (20) the distance between a point 
X of E" and P,, then (p,, p;, ... Pm) can be re- 
garded as coordinates of a point X contained 
in a suitable domain of E". They are called 
multipolar coordinates. In particular, if m — 2 
they are called bipolar coordinates, and if m — 3, 
tripolar coordinates. When m n, these coor- 
dinates satisfy m — n relations. Next let «,, 

Az, ..., Xm be m hyperplanes in general position 
in E", mx n. For an arbitrary point X of E", we 
denote by č; the directed distance of X from 
each hyperplane a, The m-tuple të, 55, ..., al 
provides coordinates of X that are called 
multiplanar coordinates in E". When m^ n, 
these coordinates satisfy m — n relations. In 
particular, when n —2 and m=3, they are 
called trilinear coordinates. In this case, if we 
denote by S the area of the triangle defined by 
three lines o. 45, x4, and by a,, a5, a, the 
lengths of the three sides of the triangle, then 
the trilinear coordinates (£,, €,, €) satisfy a 
linear relation a; č; +a,€, - a4044 228. 


(3) Tangential Polar Coordinates. In a Eu- 
clidean plane E?, we take a directed line lo 
passing through a point O. For an arbitrary 
directed line g, let p be the directed distance 
between O and g, and let 0 be the angle be- 
tween lo and g. Then (p, 0) are called tangential 
polar coordinates (Fig. 1). They are useful for 
representing tangent lines to curves in E?. Let 
C be an foval in E?. A line is called a tsupport- 
ing line of C if its intersection with C consists 
of a point or a line segment. In this case we 
take the origin O inside C and consider the 
coordinates (p, 0) of the supporting lines of C. 
Then the equation of C can be represented as 
p= p(0), where p(0) is a periodic function of 
period 27. The coordinates (p, 0) are especially 
useful when the function p(0) can be expanded 
in a tFourier series. In the case of Euclidean 
space E?, the notion of tangential polar co- 
ordinate system can also be defined by using 
tangent planes. 





Fig. 1 
Tangential polar coordinates. 


(4) Normal Coordinates. Let M be an n- 
dimensional tRiemannian manifold, and let 
T4(M) be the tangent space to M at a point A. 
For each tangent vector ve Ti(M), we draw a 
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*geodesic through A with the initial direction v 
and take a point P on the geodesic such that 
the distance from A to P is equal to the length 
of v. Then the correspondence that sends v to 
P is a !diffeomorphism of a neighborhood of 
the zero vector 0 of T,(M) with a neighbor- 
hood of A in M. Therefore the components 
(cl. v?, ..., v") of v with respect to a basis of 
T,(M) give the coordinates of the points P 
contained in a suitable neighborhood of A. We 
call them normal coordinates about the point 
A of M. In these coordinates, each geodesic 
passing through A is given by equations v' 

— ar (i— 1,2, ..., n), where the («') are compo- 
nents of the unit vector in the direction of v 
and r is the parameter that represents the arc 
length from A to the point (t!, ..., v"). In partic- 
ular, when n —2, we fix a tangent vector v, at 
A and denote the angle between v and v, by 0. 
Then (r, 0) are coordinates of P called geodesic 
polar coordinates. The notion of normal co- 
ordinates can also be defined for tLie groups 
or differentiable manifolds with taffine 
connections. 


D: Local Coordinates 


Suppose that we have a space M that has a 
covering by a family of open neighborhoods 
with coordinate systems. If, for each pair of 
neighborhoods with nonempty intersection, 
the coordinate transformation in the intersec- 
tion satisfies certain specified conditions, then 
a mathematical structure on M can be defined. 

Let E be a *topological space. Suppose that 
® is a family of open sets in E such that the 
union of any number of open sets in and the 
intersection of any finite number of open sets 
in ® also belong to ®. A set I of fhomeomor- 
phisms is called a pseudogroup of transforma- 
tions on E if T satisfies the following three con- 
ditions: (i) Any homeomorphism fer is de- 
fined on an open set U ed, and f(U)e 6. (ii) 
When an open set U eO is expressed as the 
union of a family (U;) of open sets U;e®, a 
homeomorphism f defined on U belongs to T 
if and only if its restriction to each U; belongs 
to F. (iii) For any open set U eg, the identity 
mapping on U belongs to I, and if f, gel, 
then the inverse f~" and the composition go f, 
if it exists, belong to T. 

Let E and M be topological spaces. A 
homeomorphism o: U —V of an open set U in 
E to an open set V in M is called a local co- 
ordinate system of M with respect to E. For 
two local coordinate systems o U, — V, and 
(5: U, — V5, the homeomorphism 
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is called a transformation of local coordinates. 
Let T be a pseudogroup of transformations 
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on E. A set & of local coordinate systems of M 
with respect to E is said to define a T -structure 
on M if & satisfies the following two con- 
ditions: (1) the totality of the images of local 
coordinate systems belonging to X covers M; 
(2) if two local coordinate systems o, and ọ, of 
X have a transformation of local coordinates, 
it belongs to I. Now suppose that two sets X 
and 2” of local coordinate systems define T- 
structures on M. If the union of X and X" 
defines a T structure, then we say that the first 
two I -structures are equivalent. Let T bea 
pseudogroup of tdiffeomorphisms, each de- 
fined from an open subset of the n-dimensional 
space R” onto another open set. If a T- 
structure is defined on a space M, then M is 

an n-dimensional ‘differentiable manifold. 

On the other hand, let be a pseudogroup 
of complex analytic homeomorphisms in an n- 
dimensional complex number space C". If a T- 
structure is defined on a space M, then M is an 
n-dimensional tcomplex analytic manifold. 
Locally homogeneous spaces, tfoliated mani- 
folds, the tfiber bundles are all equipped with 
local coordinate systems with suitable T- 
structures. 
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A. General Remarks 


A continuous mapping p: Y Y of an farcwise 
connected topological space Y onto a con- 
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nected topological space Y is called a covering 
mapping (covering map) if the following con- 
dition (C) is satisfied: (C) Each point of Y has 
an open neighborhood V such that every 
*connected component of p! (V) is mapped 
homeomorphically onto V by p. Here we note 
that Y is in fact arcwise connected. 

If there is a covering mapping p: Y Y, we 
call f a covering space of Y and (Y, p, Y) a 
covering. In particular, for a tdifferentiable 
manifold Y, if Y is also a differentiable mani- 
fold and p is differentiable, then F is called a 
covering (differentiable) manifold of Y. (In the 
theory of *Riemann surfaces, a tcovering sur- 
face may have some tbranch points violating 
condition (C). Upon removing such points, we 
obtain a covering space as defined above.) 

For each tpath w:I> Y (I-[0,1]) of Y, a 
path W:I— Y with po w=w is uniquely deter- 
mined by the point Ww(1)ep '!(w(1)) and a 
bijection w,:p '(w(1))-»p !(w(0)) is deter- 
mined by w, (W(1)) — w(0). Thus there exists a 
one-to-one correspondence between p~! (y) 
and p ‘(y’) for every pair of points y, y' of Y, 
and (T, p, Y, p^! (y)) is a flocally trivial fiber 
space with discrete fiber p ^! (yo). When the 
cardinal number of p! (y) is a finite number n, 
we call (Y, p, Y) an n-fold covering. In this case, 
for a tloop wl, J) (Y, yo) with base point yo, 
w,:p (Vo) >p !(yo)is a permutation of the n 
elements in p ! (yy), and we obtain a homo- 
morphism of the "fundamental group z,(Y)— 
T, (Y, yo) of Y into the tsymmetric group G,, 
given by the correspondence w— w,. The 
permutation group Wt, which is the image of 
this homomorphism, is called the monodromy 
group of the n-fold covering. 

Two coverings (Y, p;, Y) (i — 1,2) are said to 
be equivalent if there is a homeomorphism 
9: Y, Y, with p,og=p,; such a o is called 
an equivalence. In particular, a self-equivalence 
9: Y Y of a covering (Y, p, Y) is called a cover- 
ing transformation. The set z of all covering 
transformations forms a group by the compo- 
sition of mappings, which is called the covering 
transformation group of Y. We call (Y, p, Y) 

a regular covering if for each ye Y and ji, 
Dep (y), there exists a unique covering 
transformation that maps j, to ĵ,. In this 
case, the forbit space Y/z is homeomorphic 
to Y, (Y, p, Y, n) is a tprincipal bundle, and 
the monodromy group Mi is isomorphic 

to m. 

For a covering (T, p, Y), we call Ÿ a covering 
group of Y if Y and Y are topological groups 
and p is a homomorphism. Then (F, p, Y) is a 
regular covering, and its covering transfor- 
mation group is isomorphic to p! (e) (e is the 
identity element of Y), which is a discrete 
subgroup lying in the center of Y (— 423 Topo- 
logical Groups O). 
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B. Universal Covering Spaces 


For a covering (Y, p, Y), we have the following 
relations of the thomotopy groups: p,:n(Y)5 
n;(Y) is isomorphic (monomorphic) for i22 
(i— 1), and z, (Y)/p,(z, (Y)) is in one-to-one 
correspondence with p! (yg) (yge Y). If f 

is tsimply connected, Y is called a universal 
covering space of Y; if in addition Y is a cover- 
ing group, Y is called a universal covering 
group of Y. 

For a flocally arcwise connected space Y, 

a covering (Y, p, Y) is regular if and only if 
p,(,(¥)) is a normal subgroup of z, (Y), and a 
universal covering space of Y is a covering 
space of any covering space of Y. Moreover, if 
Y is a topological group, any covering space Y 
of Y can be given a unique topological group 
structure with which Y is a covering group of 
Y. 

Let Y be an arcwise connected, locally arc- 
wise connected, and flocally simply connected 
space. Then the following classification theo- 
rem of coverings holds: The set of equivalence 
classes of coverings of Y is in one-to-one cor- 
respondence with the set of conjugate classes 
of subgroups of the fundamental group z,(Y); 
in particular, the equivalence class of a cover- 
ing (Y, p, Y) corresponds to the conjugate 
class of the subgroup p, (Y). Also, there is a 
unique universal covering space Y of Y up to 
homeomorphism. If in addition Y is a topolog- 
ical group, then Y is a unique universal cover- 
ing group of Y up to isomorphism of topolog- 
ical groups. Such a space Y is obtained as 
follows: Consider the tpath space Q(Y; yo, Y) 
of all paths in Y starting from a fixed point 
yo€ Y, and define two paths wo, w,:(1,0)— 

(Y, Yo) such that wo(1) 2 w,(1) to be equiva- 
lent if and only if there is a *homotopy w: 
(1,0) (Y, yo) with w,(1) 2 wo(1) (0 € t € 1). 


Then we obtain the identification space Y of ` 


Q(Y; yo, Y) by this equivalence relation and 
the mapping p: Y^ Y by p{w} = w(1); this P is 
the universal covering space of Y. 

Let (Y, p, Y) be a regular covering with the 
covering transformation group z. Then there is 
a "locally trivial fiber space (Y', q, B, Y) such 
that the total space Y' has the same (co) ho- 
mology groups as Y, and the base space B is 
an tEilenberg-MacLane space K (z, 1). The (co) 
homology spectral sequence of this fiber space 
is called that of the given regular covering 
(Y, p, Y), E, is a bigraded module tassociated 
with a certain filtration of the fsingular (co) 
homology module H(Y), and E, is the (co) 
homology module H(z; H(Y)) of the group z, 
where z operates on the coefficient module 
H(Y) via the induced homomorphisms of 
covering transformations (— 148 Fiber 
Spaces). 
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For any group 7, there is a regular covering 
(E, p, B,) with the covering transformation 
group z such that E, is tcontractible; B, is an 
Eilenberg-MacLane space K(z, 1). We can take 
S! (1-sphere) as B; of the infinite cyclic group 
Z, and the following infinite lens space as Bz, 
of the finite cyclic group Z, (— 70 Complexes). 


C. Lens Spaces 


Let k be a positive integer and /,, ...,1, be 
integers prime to k. Let S?"*! = {(zo,...,z,)€ 
C"! [|z]? 4 ... - |z, ^ 9 1) be the unit sphere 
in the (n+ 1)-dimensional complex linear space 
CTT, and define the rotation y by y(zo, z,, 

<-s Zn) — (zo exp 2ni/k, z , exp 271, i/k, ..., 

z, exp 2rl,i/k). Then the torbit space S?"* ! /Z, 
— L(k; lj, ...,L), where Z, = Z/kZ is interpreted 
as the cyclic group generated by y, is called 

a lens space. It is an orientable (2n + 1)- 
dimensional tdifferentiable manifold. Also, the 
infinite lens space L®(k)= L(k; 1, ..., 1, ...) 

is defined by taking n= oo; the infinite sphere 
S°” is a k-fold covering space of L®(k), and 
L®(k)= Bz, = K(Z,, 1). Its tcohomology ring 

is given as follows. 

(1) For integral coefficients, H?'*! (L^(k)) — 
0, H?'(L*(k)) 2 Z, (i> 0), and the *cup prod- 
uct of generators of degree 2i and 2j is a gen- 
erator of degree 2(i +j). 

(2) Let k 2 pk' (p is a prime). If p#2, or p=2 
and K' is even, H*(L®(k); Z,)= A(e)) © Z,Le;]. 
If p=2 and K' is odd, H*(L” (k); Z;) - Z;[e,] 
(e; is an element of degree i). Here A indicates 
the texterior algebra over Z,, and Z,[ ] the 
*polynomial ring over Z,. 

Two lens spaces L(k;1,,...,1,) and L(K'; li, 
... h) are of the same homotopy type if and 
only if k=k’ and there is an integer m prime 
to k with 


l... = m", ... (mod k) 


[8]. Furthermore, the condition k=k’, l= 

t I'*! (mod k) holds if and only if L(k; I) and 
L(k';l') are homeomorphic. (Sufficiency is 
shown in [1]; necessity follows from the fact 
that the *Hauptvermutung is valid for com- 
binatorial 3-manifolds and that the condition 
holds if the polyhedra L(k; l) and L(k;l') have 
isomorphic subdivisions [6].) Also — [7,11] 
and 65 Combinatorial Manifolds. — 
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A. Space Groups and Point Groups 


Let G be a discrete subgroup of the group of 
tmotions © in the real n-dimensional Eu- 
clidean space V. If G contains n linearly inde- 
pendent translations, then G is called an n- 
dimensional crystallographic space group, a 
space group, or a crystallographic group. The 
following assertions are known to be equiva- 
lent to each other: (i) A discrete subgroup G of 
© is a space group; (ii) 6/G is compact; (iii) 
V/G is compact; (iv) There is a compact subset 
P of V such that V- GP—| J,.;gP [1]. The 
subgroup T consisting of all *translations in G 
is called the lattice group of G. T is a normal 
subgroup of G and is generated by n linearly 
independent translations, say, t,,t5, ...,t,. 
Take a point x of V. Then the T-orbit of the 
point x is called a lattice of G (or T). t,, ...,t, 
can be identified with a basis of V. Let K be 
the quotient group G/T and k=gT an element 
of K. Then k gives rise to a linear transforma- 
tion k on T via the formula k(t) gtg ! (the 
product in G), te T, and K can be regarded 

as a subgroup of the torthogonal group of 

V via the above identification of T with a 
lattice in V. That is, the set of orthogonal 
transformations stabilizing x, (g|(g(y) =x +. 
g(y) - g(x), ye V, ge Gj forms a group isomor- 
phic to K. The quotient group K is a finite 
group and is called the point group of G. A 
group isomorphic to a point group of an n- 
dimensional space group is sometimes called 
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an n-dimensional crystallographic group. An 
algebraic characterization of space groups 
among abstract groups is given by “A group G 
is isomorphic to an n-dimensional space group 
<= G has a normal tfree Abelian subgroup of 
trank n, which is maximal Abelian and has 
finite index” [1]. 

By representing the action of K on T in 
terms of a basis of T, K can be regarded as a 
subgroup of GL(n, Z). Hence we obtain the 
following crystallographic restriction: If K 
contains an element of order m, then n> o (m). 
Here ¢ is the tEuler function. For example, 
if n=2, 3, then m=1, 2, 3, 4 or 6; and if n= 
4, 5, then m= 1, 2, 3, 4, 5, 6, 8, 10, or 12. The 
number of nonconjugate finite subgroups of 
GL(n, Z) is finite (Jordan-Zassenhaus theorem; 
— [2, 3,21]). 

Two space groups G, and G, are called 
equivalent if they are conjugate via an ‘affine 
transformation f of V, ie. G, = fG,f |. Let T, 
and K; be the lattice group and the point 
group of G; (i— 1,2), respectively. Then T, = 
hT, and K, — hK,h ! for the lincar trans- 
formation h induced by f. Moreover, there is 
an orthogonal transformation k such that 
K,=kK,k +; this k can naturally be consid- 
ered as an element of GL(n, Z). G, and G, are 
equivalent <> G, and G, are isomorphic (as 
abstract groups) [1]. In applications it is often 
required that in the definition of equivalence f 
be an forientation-preserving affine transforma- 
tion. Under this definition, there can be a pair 
G,, G, such that G, and G, are mutually iso- 
morphic but not equivalent, since they are 
conjugate only by means of an forientation- 
reversing affine transformation. In this case, 
they are called enantiomorphous to each other, 
or the pair is called an enantiomorphic pair 
(Table 1 below). For a given dimension n, 
there is only a finite number of equivalence 
classes of space groups [1]. We also conclude 
from this that point groups have finitely many 
equivalence classes up to conjugacy in the 
orthogonal group O(V) or in GL(n, Z). 


B. Crystal Classes, Bravais Types 


Let T be an n-dimensional lattice in n- 
dimensional Euclidean space V, and K a finite 
subgroup of the orthogonal group O(V). De- 
note by a pair (T, K) a tfaithful linear trepre- 
sentation of K on T, i.e., a *monomorphism 
of groups K 5 Aut(T). A space group G deter- 
mines a pair (T, G/T). Conversely, for any pair 
(T, K) the following holds: “Any group texten- 
sion of K over the tkernel T is isomorphic to a 
space group" (— the algebraic characteriza- 
tions in Sections A and C). Thus to each pair 
(T, K) there corresponds a set of space groups. 
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Two pairs (T; , K,) and (T,, K;) are called 
arithmetically equivalent (this is denoted by 
(Tj, K)) &(T,, K,)) if there exists an invertible 
linear transformation ge GL(V) such that T, 
—gT,, Kj,—gK,g !. These pairs are called 
geometrically equivalent or simply equivalent 
(this is denoted by (T,, K,)~(T), K,)) if there 
exists a ge GL(V) such that K,=gK,g~'. The 
relation ~(~) is an equivalence relation, and 
equivalence classes are called arithmetic (geo- 
metric) crystal classes. The set of geometric 
crystal classes is in one-to-one correspondence 
with the set of equivalence classes of point 
groups up to conjugacy, and is sometimes 
called the set of crystal classes. The geometric 
crystal class of (T, K) is usually denoted by K. 

Now let T be a lattice. Then the group of all 
orthogonal transformations that leave T in- 
variant is called the Bravais group of T and 
is denoted by B(T), i.e., B(T)-(geO(V)|gT— 
T}. The group B(T) is finite and determines 
a pair (T, B(T)). Two lattices T, and T, are 
called arithmetically equivalent (this is denoted 
by Tj = T,) if (T,, B(T,)) and (T,, B(T,)) are 
arithmetically equivalent. An equivalence class 
is called a Bravais type, and an arithmetic 
crystal class determined by (T, B(T)) is called a 
Bravais class. For each Bravais type, a repre- 
sentative is said to be its Bravais lattice (Fig. 3 
below). 

Define an torder relation on the set of lat- 
tices belonging to an arithmetic crystal class as 
follows. When (T,, K,) «(1,, K;), define T; < T, 
if there exists a ge GL(V) such that T, =gT,, 
K =gK g, and B(T;) cgB(T,)g !. Note that 
T; = T; «» T, € T, and T, <T. In each arith- 
metic crystal class, there is a pair (T, K) with a 
tminimal T with respect to this order relation. 
In this case the Bravais type of T is referred to 
as the Bravais type of the class of (T, K). This 
lattice T is, intuitively, of the most general type 
appearing in the class. Now define a relation 
T, ~s T; if there are pairs (Ti, Kj), i— 1, 2, such 
that (T;, K,) - (T, K;) and T; is minimal in the 
class of (T;, K;). The equivalence class of the 
equivalence relation generated by the relation 
~, is called a crystal family. 

Summarizing, we have defined the following. 
Let F, 4, 9, 2, € 7 be the set of equivalence 
classes of space groups, arithmetic crystal 
classes, geometric crystal classes, Bravais 
types, and crystal families, respectively. Then 
we have the relations shown in Fig. 1. (All 
arrows in Fig. 1 are surjective.) @F is nothing 
but the tcoproduct of 4 and B over 4, GF 
c 94[I44. 


S— A — G Hu» G 
|] Se 
B—->CF CSL —>CF 

Fig. 1 Fig. 2 
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Moreover, there are injective mappings 2 c,.c 
and X & 7 such that the composites Z c.» 
B and « & ad are identities. The first is 
the mapping sending a Bravais type T to the 
arithmetic crystal class defined by (T, B(T)). 
The second one sends a class of (T, K) to 
a 'semidirect product of T and K (— 190 
Groups N). A space group belonging to a class 
of the image of .»/ c, is said to be symmorphic 
or symmorphous. À class which belongs to the 
image # of the composite Z c.»/ —4 is called 
a holohedral or holosymmetric class, or a holo- 
hedry. Define a mapping M of 4 to the power 
set P(B) by M — (4 4): (4 >F) !. For two 
elements C, and C, of Y, define C, ~,C, if 
M(C,)2 M(C,). Each equivalence class is 
called a crystal system, and the set of crystal 
systems is denoted by €. Two elements C 
and C' of 4 belong to the same crystal family 
if and only if there exists a sequence of ele- 
ments of Z, C,, C, ..., C, such that C, =C 
and C, — C' and M(C)N M(C;,) z for i= 
1, ..., k— 1. Therefore there is a surjective map- 
ping € >@F (Fig. 2. When n <4, the com- 
posite X c, 4 — € is bijective. It should be 
remarked that when n > 3, there is no map- 
ping 44€ such that ./ 24 5 € and A —> 
BCF coincide. 

The numbers of elements of these sets are 
shown in Table 1. 


Table 1 

n = | 2 3 4 
EF , crystal 
families 1 4 6 23 
5^? crystal . 
systems l 4 7 33 
2, Bravais types 1 5 14 64 
G, point groups 
(isomorphic 
classes)" 2(2) 10(9) 32(18) 227 (118) 
W, arithmetic 
crystal 
classes 2 13 73 710 
J^ space groups 2 17 219* 47833 


"When n x4, the number of elements of J£ coincides 
with that of @F. 
* The number in parentheses denotes the number of equiva- 
lence classes under algebraic isomorphism. 
*4The number of equivalence classes under orientation- 
preserving transformations is 230 and 4895, respectively. 


C. Construction of Space Groups 


Take an element (T, K) of an arithmetic crystal 
class. Since K is finite and T is finitely gen- 
erated, the cohomology groups H'(K, T) are 
finite groups (— 200 Homological Algebra G). 
Let « be a *second cocycle representing an 
element [x] of H?(K, T). Let (T, K), be the 
extension of K with kernel T correspond- 

ing to [x] (— 190 Groups N). Then the set 

of elements of (T, K), is given by (t, k5| 
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te T, ke K}, and the product is defined by 
GIE k2) = Cty thytz+a(ky, k2), kı k2). 
Since H'(K, V)=0, iz 1, there is a first co- 
chain fe C'(K, V) such that a(k,, k,) 2 B(k,)-- 
kı B(k,) — B(k, k2), and the tfirst cocycle f on 
V/T defined by f induces an isomorphism 
HI. HIT H*(K, T), [B] [x]. Define an 
action of (t, k» on V by 


(t, k»(x) kx t--B(k, xev. 


Then (T, K), acts on V as a space group. Con- 
cerning their equivalence, this result holds: 
"The equivalence classes of (T, K),, [x] e 
H?(K, T) are in one-to-one correspondence 
with the orbits of N(K), the normalizer of K 
in Aut(T) acting on H?(K, T)" [4]. The action 
of he N(K) is given by "a(k) - ha(h ! kh), ze 
C*(K, T). 

For a pair (T, K) the weight lattice or the 
weight group 7* of (T, K) is defined by (ve V | 
v—kve T, Vke Kj. Let R be an irreducible re- 
duced *root system, Q(R) its lattice, and W(R) 
its *Weyl group. Then for a pair (Q(R), W(R)), 
Q(R)* is nothing but the usual weight lattice 
P(R). (— 248 Lie Algebras). Suppose that K of 
(T, K) does not contain the central inversion 
—I,:ve —v, ve V. Denote the group KU 
(—L)K by +K. Then there is an isomorphism 


H'(XK,V/T)S[H (K, V/T)], 
x (T*/T)/2(T*/T). 


Here [-], denotes the subgroup of elements 
of order not greater than 2 [5]. This isomor- 
phism provides powerful means for the con- 
struction of space groups. 

In constructing a pair (T, K), it is useful to 
take a basis of T in a special form. Let (t,,t;, 
lui be a basis of a lattice T, and set a; = 
(t; tj), where (,) is the inner product in V. The 
n x n matrix A —(a;j) is symmetric and tposi- 
tive definite, and the numbers a; are called 
the lattice constants of T. A basis {t,,...,t,} is 
called a reduced basis if the tquadratic form 
(Ax, x)= X agx;x; is reduced (> 348 Qua- 
dratic Forms). A lattice admits at least one re- 
duced basis, and in this case t 2aj aj; & Oe 
i<j [6, Table 5.1; 7]. The totality of an n- 
dimensional reduced basis forms in a nat- 
ural manner a semialgebraic set of dimension 
(n LG 2)(n — 1)/2 in R"*!"7, Its closure, say Q,, 
is compact. On the other hand, the set L, = 
(R, x O(R"))\GL(n, R)/GL(n, Z) is identi- 
fied with the set of equivalence classes of n- 
dimensional lattices under the relation of 
"same shape." There is a natural inclusion 
L,U...UL,<@Q,, and L, is dense in Q,. More- 
over, the inclusion gives a bijective homeo- 
morphism when n «4 [8]. 

A finite subgroup K of the orthogonal 
group O(V) is said to be fully transitive if there 
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is a set S={e,,...,e,} that spans V on which K 
acts ttransitively and if K has no *invariant 
subspace in V. In this case one can choose S as 
either of the following: (i) the primitive hyper- 
cubic type, S= lei, .... en}, (e, e) = 0; (Kro- 
necker delta); (1i) the primitive hyperbolic type, 
S-(fi. dob Ue Dec, Em Lass nl, 

(fo f)» —1/n, i, j=1,...,n+1, ij (especially, 
Y kp [10]. 


D. Color Symmetry Groups, Twinning 


A color symmetry group, or a colored symme- 
try group, associated with a space group G is a 
pair (G, G^) such that G' is a subgroup of G 
with finite index. The index r=[G:G’] is called 
the number of colors. We call (G, G’) a white 
group when r — 1, a black and white group or a 
magnetic group when r — 2, and a polychroma- 
tic group when r 23. G' is also a space group 
with the lattice group T' 2 G'f1 T, where T is 
the lattice group of G. Two color symmetry 
groups (G,, Gi) and (G,, G5) are called equiva- 
lent if there exists an taffine transformation f 
such that G, = fG, f^! and G} — fG, f !. Then 
(G,, Gi) and (G,, G3) are equivalent if and only 
if there is an isomorphism o:G, -G, such that 
6(G1)— G5. A color symmetry group (G, G’) is 
called lattice equivalent if T' = T, K' c K, and is 
called class equivalent if T' c T, K' =K. Take a 
pair (G, G’), and let G” be the tinverse image of 
K' under the canonical tepimorphism Go K. 
Then (G, G”) is lattice equivalent and (G", G’) is 
class equivalent. Fix the color number r. Then 
(i) the equivalence classes of lattice equivalent 
color symmetry groups are in one-to-one 
correspondence with the conjugacy classes of 
subgroups K, of index r, and (ii) the number of 
equivalence classes of class equivalent color 
symmetry groups is finite. When r is prime to 
the order of K, the classes are in one-to-one 
correspondence with K-invariant sublattices 
T, of index r in T. Therefore, when the number 
of colors is given, the set of equivalence classes 
of n-dimensional color symmetry groups is 
finite. 

In case (i), a pair (K, K,) is called a color 
point group, and in case (ii), (T, Tj) is called a 
color lattice. In particular, when r —2, they are 
called a black and white point group and a 
black and white lattice, respectively. Their 
equivalence is defined in a similar way. 

Take a color lattice (T, T,) of color number 
r. Let m be the least natural number such that 
mT c T, then m is a divisor of r. Let m= 
pi Dt, D; p; (ij) be the decomposition 
of m into its prime factors. Then there is a 
sequence of K-invariant sublattices of T, T;, 
i=2,...,k—1, such that (i) T, 7 75... c le 
Jus T; (ij) p Ti, c Tj, i— 1, ..., k; (iii) the 
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submodules T;/p?;T;,, of i=1,...,k, Z" @ 
Z/p;'Z are uniquely determined independently 
of the choice of such a sequence {7;}. Thus 

the existence of color lattices is related to the 
tmodular representation of K (— 362 Repre- 
sentations G). For example, if r is a prime 
number, then the representation of K on T/rT 
~Z" © Z/rZ is the reduction modulo r of 
that of K on T. 

A mathematical treatment of twinning is 
given as follows. Let V be an n-dimensional 
Euclidean space and T an n-dimensional lat- 
tice in V. The volume of T, vol(T), is defined 
by vol(T)=(|det A|)'7, where A = (a;j) is the 
matrix of lattice constants. This definition is 
independent of the choice of a basis of T. If 
T’ is a sublattice of index r in T, vol(T’)=r- 
vol(T) follows. Let K be a finite subgroup 
of O(V), T, and T; K-invariant lattices, and 
U a hyperplane in V. Then the quadruple 
(T,, T;, K, U) is called a twinning structure if 
it satisfies the following: (i) vol(Ti) = vol(T;); 
(ii) the lattice T; is minimal in the arithmetic 
crystal class of (7;, K), i— 1, 2; and (iii) T; Q U = 
T; YU - T,, and T, is an (n — 1)-dimensional 
lattice in U. Two arithmetic crystal classes are 
said to be twinable if there exist representa- 
tives (Tj, K), (T5, K), and a hyperplane U such 
that (7,, T5, K, U) is a twinning structure. Let 
(T,, K) and (T7, K) be two arithmetic crystal 
classes. If there exists a K-invariant hyper- 
surface U such that 7; — T, (1 U 2 T;(YU is 
an (n — 1)-dimensional lattice in U, then two 
classes (T,, K) and (15, K) are twinable. 


E. Three- and Two-Dimensional 
Crystallography 


Three-dimensional crystal classes are listed in 
Appendix B, Table 5.IV. To name these crystal 
classes, both Schoenflies’ notation and the 
international notation are used. 

Fix an torthogonal tright-handed system 
(x, y, z) in V. For each crystal system, define 
three axes as follows. The first axis is the z- 
axis. The second is the line x = y =z for the 
cubic system and the x-axis for the others. The 
third is the y-axis for orthorhombic, trigonal, 
and hexagonal systems, and the line x 2 y, z=0 
for the others. Then a crystal class is expressed 
by its generators with respect to these axes. 
The symbol k (k= 1,2, 3,4, 6) denotes the rota- 
tion by the angle 2x/k around the axis; k is a 
composition of the rotation k and the central 
inversion (v> —v,ve V), and k/m (k =2, 4, 6) 
means the composition of k and the reflection 
about the hyperplane perpendicular to the 
above axis. The symbol m denotes 2. Usually, 
1 is omitted. This is the full international nota- 
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tion and a set of generators can be read from 
it. The short international notation is also used. 

Schoenflies’ notation consists of the letters 
C,, D,, Sn, T, O, and the subscripts h, v, d. The 
letters denote the group and order: C, is the 
*cyclic group of order n; D, is the dihedral 
group of order 2n; T is the *tetrahedral group 
isomorphic to the fourth *alternating group; O 
is the *octahedral group isomorphic to the 
fourth ‘symmetric group; and S,=1, $,—4, S, 
— 3. The subscripts h, v, d mean that the group 
is generated by reflections in the hyperplane 
which is horizontal, vertical, or perpendicular 
to a diagonal, respectively. The orientation is 
appropriately given. For example, in C,,, and 
D,, there is an nth order rotation around the 
z-axis and the horizontal plane is the xy-plane. 

The above notations are also applied to 
noncrystallographic finite subgroups of the 3- 
dimensional orthogonal group [13]. 

The symbol for Bravais type is specified by 
the name of its crystal system and one of the 
letters P, A, B, C, F, I. Six kinds of simple 
Bravais lattices are given as follows. A simple 
lattice is generated by {a,b,c} and satisfies the 
following condition. Denote the lengths of 
vectors by a= [ja|, b= [b], c= ||e|, and let x = 
L (b,c), B= 7 (c, a), y= Z (a, b) be the angles 
between the indicated pairs of vectors. For 
triclinic P, azZbzcza,«zfxyzoxfor mono- 
clinic P, az bzcza,«—y-90 z fi; for or- 
thorhombic P, az bzcza,x—f-—y-905; 
for tetragonal P, a =b zc, x — fj =y =90°; for 
hexagonal P or trigonal P, a =b zc, a=f=90", 
y= 120°; for cubic P, a =b =c, c= B=y=90°. 
Let T be one of the above lattices generated by 
{a,b,c}. Then the lattices generated by (a, b, 
(b+c)/2}, {(c +a)/2, b, cj, {a,(a+b)/2,c} {(a + 
b+ c)/2, b, cj, {(a+b)/2, (b+ ¢)/2, (c +a)/2} are 
called A, B, C, I, F lattices determined by T. 
The original lattice T is called the primitive 
lattice. The trigonal R lattice is generated by 
{a',b’,c’} satisfying a =b' =c, oa ss Bi zi < 
120°, 490°. Set a=a’—b’, b=b’—c’", c=a’ + 
b +c’. Then the hexagonal P lattice gener- 
ated by {a,b,c} is defined to be the primitive 
lattice of the trigonal R lattice. The fourteen 
Bravais lattices are illustrated in Fig. 3. 

An arithmetic crystal class (T, K) is denoted 
by the symbol of the Bravais lattice T and the 
symbol of K with respect to the action of K on 
T. Let the primitive lattice of T be generated 
by {a,b,c}. Put the vector a on the x-axis, and 
b on the xy-plane. The action of K ts then 
determined by the international notation for 
K, except when K = 42m, 32, 3m, 3m, 6m2, 
and T is the primitive or I lattice. In these 
cases two kinds of actions interchanging the 
second and the third symbols are not equiva- 
lent. If such is the case, 1 is inserted into the 
third position for trigonal systems. Let (K) be 
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the symbol determined as above and (T) be 
the symbol of T. Then the action (T, K) is 
expressed by ((T), (K)). For example, when K 
= 32, there are three arithmetic crystal classes 
(P, 321), (P, 312) (R, 32). 





P C 
Triclinic Monoclinic 
c | 
b b IN 
fa (e / 
P C I F 
Orthorhombic 
A AA ` ve 
} 1 
bi KN | 
1 
e NA tt 
Cl 
P I R P 
Tetragonal Trigonal Hexagonal, 


Trigonal 





Fig. 3 
The conventional unit cell for each of the 14 Bravais 
lattices. 


The notation of space groups determined by 
(T, K) is given by writing both the symbol (T), 
and the symbols of actions of elements on V 
which appear in the symbol (K) defined above. 
The symbol (T)(K) denotes the tsemidirect 
product of T and K by the action (T, K), and 
is the symmorphic space group determined 
by (T, K). Other nonsymmorphic groups are 
designated by replacing symbols in (K). If k is 
replaced by k; (j — L, ..., k — 1), then k; is a k- 
fold screw glide with pitch j/k, that is, the com- 
posite of the rotation k and the translation 
jf/k, where f is the vector of minimum length 
of T along the axis of k. If m is replaced by 
a, b, c, n, d, then it is a glide reflection, that 
is, the composite of the reflection m and the 
translation in the reflecting hyperplane with 
the direction of a, b, c, a face diagonal and 
a diamond, respectively. See [6] for the pre- 
cise meaning of the notation discussed in this 
paragraph. 

The notations for 2-dimensional point 
groups and space groups can be adopted from 
the ones for 3-dimensional ones by setting the 
reference plane as the xy-plane and the z-axis 
perpendicular to that plane. In this setting, the 
notation of a plane point group is given by the 
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corresponding point group in 3-space, and 
similarly for plane space groups. There is only 
one glide operation, denoted by g. The motion 
g is the composite of the line reflection about 
an axis and half the translation along the axis 
with the vector of minimum length of T. In 
[13], the precise group-theoretic description of 
space groups can be found. It should be noted 
that the notations for the two groups p3ml 
and p31m are frequently interchanged in the 
literature (including earlier editions of [13]). 


F. History 


The first mathematical study of the structure 
of crystals was done by the mineralogist J. F. 
C. Hessel (1830). He enumerated the finite 
subgroups of the 3-dimensional orthogonal 
group. Afterward, his result was rediscovered 
many times. A. Bravais (1850) showed that in 
3-dimensional Euclidean space there are only 
14 different lattices. All space groups consist- 
ing of orientation-preserving transformations 
were determined by C. Jordan (1868, 1869) and 
L. Sohnke (1879). Finally, almost all the space 
groups were determined independently by E. 
S. von Fedorov (1885, 1889) and A. Schoenflies 
(1887, 1889). Probably as a result of com- 
parison of each other's lists, they established in 
1891 the existence of the 230 space groups. 
Then A. Barlow (1894, 1896) derived these 
space groups by adding reflection operations 
to Sohnke's 65 groups. 

In 1900 in his eighteenth problem D. Hilbert 
raised the question of whether the number 
of equivalence classes of space groups of a 
given dimension is finite. This was answered 
affirmatively by L. Bieberbach (1910 [1]). An 
algorithm for determining space groups was 
given by H. Zassenhaus (1948 [4]). Their 
results, including [2, 3], and work by C. Her- 
mann (1949 [9], 1952 [10]) gave a solid 
foundation for n-dimensional crystallography. 

Then followed a concrete treatment of 4- 
dimensional crystallography. A. C. Hurley 
(1951), using an earlier work of M. E. Goursat 
(1889) on finite subgroups of the 4-dimensional 
special orthogonal group, determined 221 of 
the 4-dimensional point groups. In the 1960s 
A. L. Mackay and G. S. Pawley (1963) and 
others gave 56 of the 4-dimensional Bravais 
types. After E. C. Dade (1965) determined 
9 maximal finite subgroups of GL(4, Z), R. 
Bülow (1967) and H. Brown, J. Neubüser, and 
H. Zassenhaus (1968) independently deter- 
mined 710 arithmetic crystal classes. They 
reconfirmed the result with H. Wondratschek, 
and as a result, 64 Bravais types and 227 
geometric crystal classes were established 
(1971 [15]). They ran Brown's computer 
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program for the determination of space groups 
based on [4] and obtained 4,783 space groups. 
Their work culminated in a book (1978 [16]). 
There have been several attempts to establish 
a unified treatment of n-dimensional crystal- 
lography [5, 8, 14, 17] and crystallography 

in complex Euclidean spaces. 

Two-dimensional space groups were em- 
pirically known in ancient times, as demon- 
strated by artistic decorations. All the 17 
2-dimensional space groups appear in the 
tile patterns of the Alhambra in Granada. A 
discrete subgroup of the plane motion group 
whose translation subgroup is only rank one 
is called a frieze group. 

The notion of black and white groups was 
introduced by H. Heesch (1930), H. J. Woods 
(1935), and A. V. Shubnikov (1951). Only with 
the introduction of the use of neutron diffrac- 
tion techniques did it become apparent that 
these groups could be used in the description 
of magnetically ordered structures [12]. Color 
symmetry groups were defined by B. L. van 
der Waerden and J. J. Bruckhardt (1961 [11]) 
for an arbitrary number of colors. A mathe- 
matical treatment of the twinning structure 
is found in papers by T. Ito (1938) and R. 
Sadanaga (1959). 
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A. Introduction 


In the beginning of his Elements, Euclid gave 
definitions such as: A line is a length having no 
width; an end of a line is a point. However, he 
left notions such as width and length unde- 
fined. Thus his definitions were far from satis- 
factory. Actually, it was only during the latter 
half of the 19th century that efforts were made 
to obtain exact definitions of lines and curves. 
Euclid, among others, distinguished two kinds 
of curves: straight lines and curves. Nowadays, 
however, lines in the sense of Euclid are called 
curves, and a straight line is considered a 
curve. À first effort to give an exact definition 
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of a curve using analytic methods was made 
by C. Jordan in his Cours d'analyse I (1893). 


B. Jordan Arcs and Jordan Curves 


Following Jordan, we define a continuous plane 
curve C to be the image of a fcontinuous 
mapping sending the interval [0, 1] into the 
Euclidean plane E?. Namely, C is the set of 
points (x, y) in E? such that 


x= f(t), 


with continuous functions f, g defined on 
[0,1]. A continuous curve is also called a 
continuous arc. We call ( f(0), g(0)) and ( f(1), 
g(1)) the ends of the arc. Given continuous 
functions f, g defined on (0, 1), the set {(x, y)| 
x= f(t), y 2 g(t, 0<t< 1} is called an open 
arc. More generally, the image of a continuous 
mapping of [0, 1, (0, 1), [0, 1), or (0, 1] is called 
an arc (or curve). Suppose that C is an arc that 
is the image of an interval I and P=(x, y) is.a 
point on C to which there correspond two 
elements t,, t; (t, «t;) of I such that P is the 
image of both t,, t,. In this case, the point P is 
called a multiple point on C. An arc having no 
multiple point is called a simple arc or Jordan 
arc. 

An arc with one and only one multiple point 
P —(f(0), g(0)) 2 (f(1), g(1)) is called a Jordan 
curve or simple closed curve (— Section K). A 
Jordan curve can be regarded as a topological 
image in a plane of a circle. Let C be a curve 
that is the image (J) of an interval. Then C is 
said to be of class C* (analytic) if the mapping 
q is of tclass Ch (tanalytic). In general, if S isa 
topological space, then the image (I) in S of 
an interval is called a curve in S. In particular, 
if S has the structure of a differentiable (ana- 
lytic) manifold, we can define the notion of 
curve of class C* (analytic curve) in S. 


y=g(t), O<t<l, 


C. Ordinary Curves 


A tconnected subset of E? that is the union of 
a finite number of simple arcs meeting at a 
finite number of points is called an ordinary 
curve. An ordinary curve is called a tree if it 
does not contain a subset that is homeomor- 
phic to a Jordan curve. Let p be a point on an 
ordinary curve C. The tboundary of a suffi- 
ciently small tneighborhood of p meets C 

at a finite number of points, and this number 
is independent of the choice of the small 
neighborhood. We call it the order of p in C. 
A point of order 1 is called an endpoint of C, 
a point of order 2 an ordinary point, and a 
point of order >3 a branch point. If we can 
represent an ordinary curve C as a continuous 
curve tracing each simple arc of C just once, 
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we say that C is unicursal (Fig. 1). A necessary 
and sufficient condition for C to be unicursal is 
that the number of points of odd orders in C 
be less than or equal to 2 (L. Euler). 


Fig. 1 


D. Further Consideration of Definitions 


Although the set of ordinary curves as defined 
above contains most familiar curves, 1t does 
not contain a point set defined by y=sin 1/x in 
O0<x<land —1<x<0O, and by —1<y<1at 
x=0 (Fig. 2). This point set is called a sinusoid, 
and it is desirable to obtain a definition of 
curves wide enough to contain a sinusoid. On 
the other hand, the notion of continuous 
curves is, in a sense, too wide, because a curve 
such as a Peano curve (— Section J), which 
covers a whose square, is among such curves. 
The notion of simple arcs is too narrow, 
because even a circle is not a simple arc. As a 
point set in E?, a continuous arc is character- 
ized as a flocally connected tcontinuum and 

is sometimes called a Peano continuum (H. 
Hahn, S. Mazurkiewicz). On the other hand, 
A. Schoenflies, inspired by the statement of the 
Jordan curve theorem (— Section K), consid- 
ered a closed set that divides the plane into 
two parts, forming the common boundary of 
both domains, and called it a closed curve. 
According to this definition, however, a simple 
curve is not a closed curve. Thus as a general 
definition of curves it is not appropriate. 





Fig. 2 


To give a general notion of curves on a 
plane (containing sinusoids), we may define a 
curve as a continuum that is tnowhere dense 
in E? Ge, a continuum that is a boundary of 
open sets on the plane) (G. Cantor). Further- 
more, to deal with the curves on a topological 
space, P. S. Uryson and K. Menger defined a 
general curve to be a 1-dimensional continuum 
(Menger, 1921—1922 [1]). In E?, the latter 
notion coincides with the notion of curves 
defined by Cantor, while in E? a general curve 
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is a continuum that does not divide any 
'domain (— 79 Connectedness). 


E. Universal Curve 


Consider a 3-dimensional cube 1? (I = [0, 1]). 
Draw two planes parallel to each face so that 
the two planes parallel to a face trisect the 
edges of the cube meeting the planes. Thus I? 
is divided into 3? cubes. Let M, be the closure 
of the subset of I? that is obtained from 1? by 
deleting the cube I? (I, =[1/3, 2/3]) and the 6 
cubes having common faces with I>. Then M, 
consists of 20 cubes (Fig. 3). We apply to each 
of the 20 cubes forming M, the same operation 
that we applied to I? and denote by M, the 
union of point sets thus obtained (consisting of 
20? cubes, the length of whose edges equals 
1/3?). Repeating this process, we obtain a 
point set M, consisting of 20" cubes, the length 
of whose edges equals 1/3". Thus we obtain the 
sequence M, > M,>M,>.... The set U 

— (5, M, is a general curve in the sense of 
Uryson and Menger. Moreover, we can prove 
that an arbitrary general curve is homeomor- 
phic to a subset of U. Hence we call U the 
universal curve. 





Fig. 3 


F. Length of a Curve 


In this section, by a curve we mean a continu- 
ous curve in a Euclidean space E". Let C be 

a curve in E" defined by x;= f,(t) (i2 1,2, ...,n; 
axtxb;the f; are real-valued continuous 
functions defined in [a, b]). (We sometimes 
write this simply as X —f(t), where X = 

(x4, ..., x,).) We divide [a,b] arbitrarily and 
denote the dividing points by az to «t, «t, < 
Xt, =b. Let X,=f(t,), k=0,...,r, and let 
X,.., X, be the length of the straight line seg- 
ment joining X,. and X,. If the length l 

= Mi Xz- X, of the broken line (X, X, ... X,) 
(Fig. 4) is bounded for any subdivision of 
[a,b], C is called a rectifiable curve, and the 
upper limit of I with respect to the subdivisions 
is called the length of C. For C to be rectifi- 
able, it is necessary and sufficient that the 

fi (i2 1,2, ..., n) be of thounded variation 
(Jordan). Thus if C is a rectifiable curve, then 
each f;(t) (i— 1,2, ..., n) is almost everywhere 
differentiable (H. Lebesgue). In particular, if C 
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is of class Cl then C is rectifiable, and its 
length can be represented by 


LEO 


(— 246 Length and Area). 





Fig. 4 


G. Shapes of Curves 


In this section by a curve we mean the image 
(J) in a Euclidean space E" of an interval I 
(bounded or unbounded), where ¢ is a continu- 
ous mapping. When a curve C of class C* is 
given in a Euclidean space E", it often becomes 
necessary to examine the shape of C globally. 
The determination of the global shape of C 
from the equation of the curve is called the 
curve tracing of C. The problem has been 
thoroughly studied in the particular case in 
which n=2 and the equation of C is given by 
f(x, y) - 0 in a rectangular coordinate system 
(by F(r, 0) 20 in a fpolar coordinate system), 
where f (or F) is an analytic function. The 
problems in the case of a rectangular coordi- 
nate system are as follows: 

Let ọ be a single-valued analytic function 
and I a (bounded or unbounded) interval on 
the x-axis. If a curve C, represented by y= 
p(x) (x€1) is a subset of C, then C, is called 
a branch of C. According as I is bounded or 
unbounded, C, is said to be a finite branch or 
infinite branch of C. When a curve represented 
by x2 y(y) (yeJ) (V is a single-valued analytic 
function and J is an interval on the y-axis) is a 
subset of C, it 1s also called a branch of C. If it 
is necessary to distinguish these two branches, 
we call the former the x-branch and the latter 
the y-branch. C consists of an at most de- 
numerable set of branches. If P(x,, yg)e C and 
f, 0f/0y #0 at P, then there exists an x- 
branch containing P; if f, = 0f/Ox #0, there 
exists a y-branch of C. If 0f/0x =0, of/éy=0 at 
P, then P is called a singular point of C. Points 
on C that are not singular points are called 
ordinary points of C. 

When P is an ordinary point of C, a branch 
C, of C containing P is determined, and the 
tangent line and normal line to C at P are 
the same as those to C, and are uniquely 
determined. The equations of these are (x — 


Xo) fi (Xo, yo) - (y — yo), (Xo; yo) =O and (x — 
Xo) f, (Xo, yo) — (Y — YoMs (Xo; yo) 7 9, respec- 
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tively. If we choose a coordinate system with P 
as origin and the tangent line and normal line 
as €-axis and y-axis, respectively, then the 
equation of C with respect to this coordinate 
system is of the form n =c, 4? - c4£? +... in the 
neighborhood of P. If we denote by p the 
'curvature of C at P, then p=2c,= —(fi fy — 
2f f f, S DU +S. When e = p= 

0, P is a stationary point. A stationary point 
of a curve of class C? on C is also called a 
point of inflection. When P is not a point of 
inflection and (£, y) are points on C in a neigh- 
borhood V of P, the sign of 5 is definite if V is 
small enough (Fig. 5). However, if P is a point 
of inflection and c4 #0, then C is of the shape 
shown in Fig. 6. At a point of inflection, if c, = 
7 0,4 =0, c, X 0, and v is even, C is of the 
shape shown in Fig. 5, and if v is odd, C 1s of 
the shape shown in Fig. 6. 





Fig. 5 Fig. 6 


In a neighborhood of a singular point, C 
takes various shapes. For example, consider a 
curve represented by y? = x?(x +a), and let P 
be the origin (0,0). If a>0, then there are two 
branches of C passing through P, and they 
have different tangents at P (Fig. 7). As in this 
case, if there are a finite number of different 
branches passing through P with different 
tangents, P is called a node of C. If a «0, then 
PéC, but there is no other point of C in the 
neighborhood of P (Fig. 8). Such a point is 
called an isolated point of C. If a —0, then there 
are two branches of C starting from P, and the 
tangents to these at P are the same (Fig. 9). 
Such a point is called a cusp of C. When C is 


Fig. 7 


Fig. 8 


Fig. 9 
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an 'algebraic curve, we can examine the shape 
of a curve in a neighborhood of a singular 
point using the *Puiseux series. 

When C has an infinite branch C, (for 
example, when Cy: y= o(x) (xe I) is an x- 
branch of C and I — [a, 00)), if the tangent to 
C, at P(xo, Yo) (xg € 1) has a limiting line for 
X oo, then the limiting line / is called an 
asymptote of C,. In this case, the distance from 
a point P(Xo, yo) of Co to I converges to zero 
when xo oo. An asymptote of an infinite 
branch of C is also called an asymptote of C. 


H. Special Plane Curves 


The following are the well-known curves (for 
ellipses, parabolas, and hyperbolas — 78 
Conic Sections). 

Among curves of the third order, those 
having an equation of the form 


y? = f(x)/(x—a) (1) 


(f(x) is a rational expression of at most the 
third order in x) are symmetric with respect to 
the x-axis and have x =a as an asymptote. In 
particular, if a>0, f(x)= — x? in (1), then the 
curve is as shown in Fig. 10 and the origin is a 
cusp. Let a half-line starting from the origin 
meet the curve, the circle with diameter [0, a], 
and the straight line x =a at points X, Y, and 
A, respectively. Then we have OX — YA. This 
curve is called a cissoid of Diocles. 

If a=0, f(x) 2 c?(c — x) (c > 0) in (1), then the 
curve takes the shape shown in Fig. 11. Let A, 
C be the points whose coordinates are (a, 0), 
(c, 0) (0 — a « c), respectively, and let X, Y be 
the points in the first quadrant at which the 
straight line parallel to the y-axis and passing 
through A meets the curve and the circle with 
diameter OC, respectively (Fig. 11). Then we 
have AX: AY - OC:0OA. This curve is called a 
witch of Agnesi. 





Fig. 10 


Fig. 11 


If a «0, f(x) — x?(x/3 +a) in (1), then the 
curve takes the shape shown in Fig. 12a. If we 
rotate it by 1/4 and put it in the position 
shown in Fig. 12b, then the equation of the 
curve takes the form x? + y? 2 3exy (c= — Ee 
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a). If we take as parameter t = y/x, then we get 
the parametric representation x = 3ct/(1 + t°), 

y 23ct?/(1 LL This curve is called a folium 
cartesii (or folium of Descartes). A curve that 
has a parametric representation of the form 
x— q(t), y ^ v(t), where o. v are rational func- 
tions, is called an (algebraic) unicursal curve (or 
rational curve). Such a curve is an algebraic 
curve of tgenus 0. 





Fig. 12 


Let r= f, (0), r= f,(0), ..., r = f. (0) be equa- 
tions of curves C,, C,, ..., C, with respect to a 
polar coordinate system with origin O. A curve 
C having equation r — 4, f (0) 4- ... -- A, f, (0) 
(the A; are constants, usually +1 or —1) in the 
same coordinate system 1s called a cissoidal 
curve with respect to O. (Fig. 13: r= — fi (0) 
f;(0)). In Fig. 10, let C, be the circumference 
of the circle with the diameter TO. a], let C, be 
the straight line x =a, and put 4, = —1, 4, — f. 
Then we have a cissoid of Diocles. We can 
regard the folium cartesii as a cissoidal curve 
obtained from a straight line and an ellipse. 
When C, is a circle with center at O, we call C 
a conchoidal curve of C, with respect to O. In 
particular, when C, is a straight line and O is 
not on C,, the conchoidal curve is called a 
conchoid of Nicomedes. 





Fig. 13 


As shown in Fig. 14, when C, is perpendic- 
ular to the initial line of the polar coordinate 
system, the equation of the conchoid is r= 
asec +b (b is the radius of C,), and the Car- 
tesian equation of the curve is (x — a (x? + 
y?) « b?x?. According as a b, a=b, or a <b, 
the curve has a node, cusp, or isolated point, 
respectively. When C, is a circle and O is on 
C,, the conchoidal curve of C, with respect to 
O is called a limaçon (or limaçon of Pascal) 
(Fig. 15). The equation of a limagon C with 
respect to a polar coordinate system having 





93H 
Curves 


the diameter of a circle passing through O as 
its initial line is r=acos0+b, while the equa- 
tion of C with respect to a Cartesian coordi- 
nate system is (x? + y? — ax)? =b? (x? + y?). In 
this case, if a >b, O is a node of the curve; if 
a=b, O is a cusp. When a=b, the curve is 
called a cardioid (the curve shown by a dashed 
line in Fig. 15; see also Fig. 26 below). 





Fig. 14 


Fig. 15 


The locus of a point X having a constant 
product of its distances from two fixed points 
A, B is called Cassini's oval (Fig. 16). The 
equation of this curve with respect to the 
Cartesian coordinate system whose origin O is 
the midpoint of the segment AB and whose 
x-axis is the straight line AB is (x? + y? — 
2a? (x? — y?) « k* — a* (where AB=2a, k? = 
AX: BX). In particular, if a? =k’, then O is a 
nodal point of the curve. In this case, the curve 
(shown by the dashed line in Fig. 16) is called a 
lemniscate (or Bernoulli's lemniscate) (Jakob 
Bernoulli). 





Fig. 16 


The locus of the foot of the perpendicular 
drawn from a fixed point O to the tangent of a 
fixed curve C at each point of the curve is 
called the pedal curve of C with respect to O. 
The pedal curve of a rectangular hyperbola 
with respect to its center is a lemniscate (Fig. 
17), and the pedal curve of a circle with respect 
to a point is a limagon. 
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Fig. 17 


When a curve C' rolls on a fixed curve C 
without slipping and is always tangent to C, 
the locus I of a point X kept fixed with re- 
spect to the curve C' is called a roulette whose 
base is C, rolling curve is C', and pole is X. In 
particular, when C is a straight line, C’ is a 
circle, and X is on C', TF is called a cycloid (Fig. 
18). When X is not on C’, F is called a trochoid 
(Fig. 19). A trochoid is represented parametri- 
cally by the equations x 2 a0 — bsinO, y=a— 
bcos @, where the parameter 0 is the angle of 
rotation of C’. When a=b, the equation repre- 
sents a cycloid. The tevolute and finvolute of a 
cycloid are also cycloids (Fig. 20). 





Fig. 20 


Suppose that we are in a gravitational field 
with a given path represented as a cycloid, as 
is shown in Fig. 21. Assuming that there is no 
friction, the time necessary for a particle to 
slide down the path from a point X on the 
curve to the lowest point C of the curve is 
independent of the initial position X (C. Huy- 
gens). Because of this property, the cycloid is 
also called a tautochrone. Suppose that a par- 
ticle starts from a point A in the space and 
slides down to a lower point B along a curve I 
(without friction) under the effect of a gravita- 
tional force. To minimize the elapsed time, 
we simply take I' as a cycloid that lies in the 
vertical plane containing AB and has a hori- 
zontal line through A as the base (Fig. 22). 
Because of this property, the cycloid is called 
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also a brachistochrone, i.e., the line of swiftest 
descent (Johann Bernoulli and others). 


B 
Fig. 22 


When the base curve C and the rolling curve 
C’ are both circles and X is on C’, we call T an 
epicycloid if C, C' are externally tangent (Fig. 
23), and a hypocycloid if C, C' are internally 
tangent (Fig. 24). When X is not on C’, cor- 
responding to these two cases, we have an 
epitrochoid and a hypotrochoid, respectively. 
Let a, b be radii of C, C’, respectively, c the 
distance from the center of C' to X, and 0 
the angle of rotation of C'. Then the para- 
metric equations of these curves are x =(a+ 
b)cos0 F ccos((a X b)/b)0, y 2 (a X: b)sin0— 
csin((a + b)/b)0. (Take the upper signs when 
the curve is an epicycloid and the lower signs 
when the curve is a hypotrochoid. When b=c 
the equations are equations of epicycloids and 
hypocycloids.) When the ratio a:b is a rational 
number p/q (p, q are mutually prime), then C' 
returns to its initial position after rotating q 
times around C; in this case each I becomes 
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an algebraic curve. In particular, when a= 
4b — 4c, the hypocycloid is called an astroid 
(Fig. 25). Its equation (with respect to a Car- 
tesian coordinate system) is x2? + y?? =a”. 
The envelope of line segments of length a 
whose endpoints are on the x-axis and y-axis, 
respectively, is an astroid (Fig. 25). When a— 
b —c an epicycloid is a cardioid (Fig. 26). 





Fig. 26 


When the base C is a straight line, the roll- 
ing curve C' is an ellipse or a hyperbola, and 
the pole is a focus of C', then the roulette is 
called a Delaunay curve (Fig. 27). 


C! 


WKS 


Fig. 27 


When C is a straight line, C’ is a parabola, 
and X is the focus of C’, the roulette is called a 
catenary (Fig. 28). When we hold two ends of a 
string of homogeneous density in the gravita- 
tional field, the string takes the form of this 
curve. The equation of the catenary with re- 
spect to a Cartesian coordinate system is y= 
acosh x/a — a(e""^ -- e^ */^)/2. The involute 
starting at the point A(0, a) of this curve is 
called a tractrix (Fig. 29). Let Q be the point of 
intersection of the tangent at P to the tractrix 
and the x-axis; then the length of PQ is con- 
stant and is equal to a. Consequently, when we 
drag a weight at A by a string of length OA 
along the x-axis, the curve described by the 
weight is the tractrix. The parametric equa- 
tions of the tractrix are x =a(logtant/2 + 
cost), y=asint. 





Fig. 28 Fig. 29 


Suppose that a point Q moves with constant 
velocity on the x-axis and another point P also 
moves with constant velocity always toward Q. 
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The locus of the point P is called a curve of 
pursuit (Fig. 30). When the velocity of Q is « 
times that of P, the equation of the curve of 
pursuit is 2(x — a) — y! */c(1— a) — cy! **/(1-- o) 
if à x 1 and 2(x — a) - (1/c)log y ^ cy?/2 if « — 1. 
We can consider similar problems when Q 
moves on a general curve instead of on the x- 
axis. 





Fig. 30 


Many plane curves that are called spirals 
can be expressed by r= f(0) (f monotonic) in 
polar coordinates (r,0). An Archimedes spiral 
is a curve having the equation r — a6 (Fig. 31). 
Archimedes found that the area bounded by 
two straight lines 0 —0,, 0 —0, (0, <0,) and the 
curve is a? (03 — 63)/6. A logarithmic spiral 
(equiangular spiral or Bernoulli spiral) is a 
curve having the equation r — ke?? (Fig. 32). 
The angle between the straight line 0 —con- 
stant and the tangent to the curve is constant. 
Johann Bernoulli found that the involute and 
evolute of this curve are congruent to the orig- 
inal curve. A curve having the equation r — 

a/0 is called a hyperbolic spiral (or reciprocal 
spiral), and the one having the equation r?0 =a 
is called a lituus. These two spirals are shown 
in Figs. 33 and 34, respectively. Let p = (s) be 
the tnatural equation of a curve. Properties of 
the curves for which the functions ois) are 
simple have been investigated. Specifically, a 
curve having the natural equation p — ks (k is a 
constant) is a logarithmic spiral. A curve hav- 
ing the equation p —a?/sis called a Cornu 
spiral (or clothoid; — 167 Functions of Con- 


Pd 


Fig. 31 Fig. 32 
Fig. 33 Fig. 34 
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fluent Type). Its parametric representation is 
t zt? 
x -a/s | Cos — dt, 
0 2 
t ! at? 
y= a /n | gn dë 
0 


(tFresnel integral). M. A. Cornu used this 
curve in the representation of diffraction in 
physical optics. 

There are also curves that appear as graphs 
of *elementary functions. For example, a curve 
having the equation y —sin x is called the sine 
curve, and the graphs of equations y — e* and y 
—]ogx are called the exponential curve and the 
logarithmic curve, respectively, although they 
are congruent. In contrast to algebraic curves, 
these analytic curves that are not algebraic are 
called transcendental curves. (Regarding the 
differential geometric properties of plane and 
spaces curves — 111 Differential Geometry 
of Curves and Surfaces; for plane algebraic 
curves — 9 Algebraic Curves.) 


I. Envelopes 


Let f(s, t) be a function of class C! of real vari- 
ables s, t. If we fix t— t9, then r= f(s, to) is the 
equation of a curve C,, with a parameter s. If 
s(t) is a function of t, then f(s(to), to) represents 
a point P, on C, Let E be the locus of P. 
when t, moves. If s(t) is a function of class C}, 
then E is a curve of class C. If E and C, are 
always tangent at each point P, , we call E the 
envelope of the family of curves C,. When i(s, t) 
is given, to find E we need only determine the 
function s(t). We note that 0f/és=Aéf/ét is a 
condition that must be satisfied by the func- 
tion s(t). When n=2 and the equation of C, is 
given in the form f(x, y, t9) 2 0, the point of 
intersection of C, and f(x, y, to) =0 (f, — 0f/0t) 
is P The equation R(x, y) -0 obtained by 
eliminating t from f(x, y, t) 20 and f(x, y, t) Z0 
is called the discriminant of f(x, y, t) - 0. The 
set of points (x, y) satisfying the discriminant 
R=0 is the union of E and the locus of the 
singular points of C, . 


J. Peano Curve 


A tcontinuous curve in the Euclidean plane E? 
De, the image f(I) of a segment I — [0,1] 
under a continuous mapping f:I— E?) may 
cover a square. We call such a curve a Peano 
curve after G. Peano, who gave the first exam- 
ple. D. Hilbert simplified the example and 
constructed a Peano curve as follows (Math. 
Ann., 36 (1890), 38 (1891)). 

Divide a square and a segment into four 
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equal parts (Fig. 35) and let each square D, 
correspond to the segment T; (i —0, 1, 2, 3). 
Then divide each D, into four equal parts (Fig. 
36) and let Dj; correspond to T; (j=0, 1,2, 3), 
and continue this process (Fig. 37). A sequence 
of squares D; 2 D; >D; 2 ... has the unique 
common point Du, , and we let this point 
correspond to the unique common point f; 
of the sequence of segments T;2 Tj > Tijk 
The correspondence tijk... Piję... is a continu- 
ous mapping of the segment [0, 1] onto the 
square D, and this continuous curve has 
double points, triple points, and quadruple 
points. The set of multiple points has the car- 
dinal number of the continuum and is a 
tdense set in the square. We can improve this 
method so that there are no multiple points 
other than double and triple points, but it is 
impossible to eliminate triple points 
altogether. 


Darrie 








Fig. 36 
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Fig. 37 





















We can also construct Peano curves as 
shown in Figs. 38, 39, and 40. That is, we 
“bisect” a triangle and a segment [0, 1] suc- 
cessively and build a correspondence as fol- 
lows: Let a point represented by a binary 
number t=0.ijk ... (ij, k, ... — 0, 1) correspond 
to the unique common point of the sequence 
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Ai;2 Aj2 Aj, 2 ... and obtain a continuous 
mapping from the segment [0, 1] onto the 


triangle. 


Fig. 40 


K. The Jordan Curve Theorem 


The Jordan curve theorem states: A Jordan 
curve J in the plane R? separates R? into inner 
and outer fregions (C. Jordan, Cours d'analyse, 
2nd ed., 1893). More precisely, R? — J is the 
disjoint union G, UG, of two regions G, and 
G, whose common boundary is J. Let p bea 
point of J. Then there is a Jordan arc with p as 
an endpoint such that all points of the Jordan 
arc are contained in G; (i= 1 or 2) except for p 
(A. Schónflies), that is, J is accessible from G;. 
Conversely, let J be a compact subset of R? 
such that R? —J =G, UG, and G,NG,=@, 
where the G; are regions such that J is acces- 
sible from both G, and G,. Then J is a Jordan 
curve (Schónflies, 1908). A homeomorphism 
between a Jordan curve J and the circle C 
extends to a homeomorphism (more precisely 
to a tconformal mapping) between a plane 
containing J and a plane containing the circle 
C (— 65 Combinatorial Manifolds G; 77 Con- 
formal Mappings). 
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94 (X.11) 
Curvilinear Integrals and 
Surface Integrals 


A. General Remarks 


The integral of a function (or more precisely, a 
tdifferential form) along a tcurve (surface) is 
called a curvilinear integral (surface integral). 
Because a curvilinear integral is a special case 
of the Stieltjes integral, we shall first explain 
this notion, formulated by T. J. Stieltjes (1894) 
as a generalization of the *Riemann integral. 
The notion was introduced in connection with 
Stieltjes's study of tcontinued fractions, and 
led to the idea of integrals with respect to 
general measures. 


B. The Riemann-Stieltjes Integral 


Suppose that f(x), x(x) are real-valued 
bounded functions defined on [a,b]. Take 

a partition of the interval a 2 xy < x, < x; 
«xx, ,€«x,—7b(— 216 Integral Calculus) 
and consider the Riemann sum with respect 
to a(x): 


X, f.) 7269). apnea 


Suppose that the Riemann sum tends to a 
fixed number as max(x;,, — x;) tends to zero. 
Then the limit is called the Riemann-Stieltjes 
integral (or simply Stieltjes integral) of f(x) 
with respect to a(x) and is denoted by 
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f? f(x)da(x). The Riemann integral of f(x) is a 
special case, where a(x) — x. 

The Riemann-Stieltjes integral has the ele- 
mentary properties, such as linearity, of the 
usual Riemann integral. We also have the 
following theorem: The integral f f(x)da(x) 
exists for every continuous function f(x) if and 
only if a(x) is of "bounded variation. Hence 
when we consider the Stieltjes integral of f(x) 
with respect to a(x), we usually assume that 
f(x) is continuous and «(x) is of bounded 
variation. However, the Stieltjes integral can 
be defined if f(x) is of bounded variation (not 
necessarily continuous) and a(x) is continu- 
ous (not necessarily of bounded variation). If 
a sequence f,(n) (n— 1,2, ...) of uniformly 
bounded continuous functions defined on the 
interval [a, b] converges to a continuous func- 
tion f(x) on the interval [a,b], we have 


b b 
im | In(x) d(x) = | I (x)da(x), 


where «(x) 1s a function of bounded variation. 
Furthermore, if a(x) and «,(x) (n— 1,2, ...) 

are functions of bounded variation whose 
Total variations are uniformly bounded and 
bm, o, (x) 2 a(x) at every point of continuity 
of a(x), then we have 


b b 
im | y oM | f(x) da(x) 


for every continuous function f(x) on [a,b] 
(Helly's theorem). 

Let a(x) be a strictly monotone increasing 
continuous function, and let B(y) be its inverse 
function. Then we have 


b a(b) 
| f(x)da(x)- |  f(BGy) dy, (1) 
a ala) 
where the right-hand side is the usual Rie- 
mann integral. A function «(x) of bounded 
variation is represented as the difference of two 
strictly monotone increasing functions a, (x) 
and a(x). If we denote by f(y) the inverse 
function of «;(x) (i2 1,2), we have 


b HA 
[oww] JI dy 


a a (a) 


aj(b) 


= F(B2(y)) dy. (2) 


azla) 


It «'(x) exists and is continuous, we have 


| f oer fGx)a'(x) dx. (3) 


C. The Lebesgue-Stieltjes Integral 


Suppose that a(x) is a monotone increasing 
and right continuous function and I =(x,, x3]. 
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We define an finterval function U(I)=«(x,)— 
a(x,). It is nonnegative and countably addi- 
tive. Hence by utilizing U(I) we can con- 
struct the outer measure and also a completely 
additive measure (— 270 Measure Theory; 380 
Set Functions). The Lebesgue integral with 
respect to this measure is called the Lebesgue- 
Stieltjes integral (or Lebesgue-Radon integral) 
and is denoted by |? f(x) da(x). If a(x) is a 
strictly monotone increasing continuous func- 
tion and f(y) its inverse function, then formula 
(1) is true if the left-hand side is a Lebesgue- 
Stieltjes integral and the right-hand side is a 
Lebesgue integral. If «(x) is a function of 
bounded variation, decomposing «(x) into the 
difference of two strictly monotone increas- 
ing functions, we also have formula (2). If 
a(x) is tabsolutely continuous, formula (3) is 
valid, where the right-hand side is a Lebesgue 
integral. 

The Stieltjes integral has the following two 
properties. 

Integration by parts: In the interval [a,b], 
we have 


b b 
[cave] VdU =U(b)V(b)— U (a)V(a) 


a a 


if one of U(x), V(x) is continuous and the other 
is of bounded variation. 

Second mean value theorem: If U(x) is mono- 
tone increasing and V(x) is continuous, then 
there exists a č in [a,b] such that 


b 
[vav 


=U (a)(V(€)— V(a)) + U (b)(V(b)— V(&)). 


D. The Curvilinear Integral 


A continuous mapping from an interval a € 
t<b in R! into R":o(t) - (o,(t), ..., p(t) is 

an oriented curve. Suppose that a function 
f(x. ..., x,) is defined in a neighborhood U of 
the image C of the mapping ¢(t) or merely on 
C. The Stieltjes integral 


b 
| L(A (t), $9 SIDERO) i= 1, es, (4) 


is called the curvilinear integral of the function 
f(x,,.-.,X,) along the curve C with respect to 
x, and is denoted by fc fdx;. The curve C is 
called the contour (or path) of the integration, 
o(a) is called the initial point (or lower end), 
and o(b) is called the terminal point (or upper 
end) of the integration. Let C be a rectifiable 
curve, and denote by s(t) the arc length of C 
from the initial point to the point g(t). Then 
the Stieltjes integral 


b 
| f(o(t)) ds(t) 


a 
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or simply fe fds is called the curvilinear inte- 
gral with respect to the line element. Here, 
the line element, denoted by ds, means 

(o, (t)? +... + (o (2)? dt when q(t) is of class 
Cl If the integrand in (4) is of bounded vari- 
ation as a function of t, the curvilinear integral 
is well defined. If C is a trectifiable curve, the 
curvilinear integral is defined for an arbitrary 
continuous function. In the usual case, we are 
concerned mainly with this sort of situation. 
For a differential form œ f, dx,  ...-- f, dx, 
defined on U, the curvilinear integral fc% is 
defined by X7, fc fidx;. 

The curvilinear integral is linear with re- 
spect to its integrand. If the terminal point of 
C, is the initial point of C,, we can construct 
the joint curve C=C, + C;, and we have 
additivity for the contours 


[s=] saxa | fdx; 
C C, C; 


(a similar formula holds if we replace dx; by 
ds). Monotonicity, which asserts that [c fdx;< 
[cg dx, whenever f <g, holds if ¢,(t) is mono- 
tone increasing, and monotonicity also holds 
for the curvilinear integral with respect to the 
line element. 

If n 22, R? can be identified with the tcom- 
plex plane C— (z 2 x t iy), and we define 
fc.f(2)dz by 


d EE stp 
C C 
ll uge «dy 
C C 


where f(z) 2 u(z) 4- iv(z). The integral is then 
said to be an integral in the complex domain. 
(For the application of integrals in the com- 
plex domain to complex analysis — 198 
Holomorphic Functions.) 


E. The Surface Integral 


By an m-dimensional smooth surface § we 
mean the image S of a fregular mapping of 
class C! from a domain G in R” into R" (m< 
n), E(u) 2 (6 (Uy, Hal eus oue). 
Given a continuous function f(x,, ..., Xp) 
defined in a neighborhood U of S in R’, the 
multiple integral 


^f f (5 (u), ..., Gell 
G 


D(x; > , D 
Duis cie) du, ...du,, 
lis csi {l.n}, (5) 


is called the surface integral of f along S 
with respect to x; ,...,x;, and is denoted by 
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[s fdxi, dx; or f... fs fdxi, ... de, This 
definition depends on the choice of the para- 
meters (t4, ..., Hal in the following way. Let 
(ik... Hal be another parametrization of S. 
Since 


E DU sd 


D(u,,...,u,) | D(u,, ...,u,) D(u,,...,u,) 
and 
Di... Uy 
DEE Rin tae, 
D(u,, ...,u,) 








thus according as 


D(u,, ... 


"el - 
Di, Hal 


the foregoing expression remains the same or 
changes its sign. Usually we assign to the 
parameters (u,, ..., Um) positive or negative 
orientation, in which case S is called oriented. 
If we replace the Jacobian D(x, ,..., x )/ 
D(u,, ...,u,,) in (5) by the quantity 


Db ee KEE 
eg E E 
iK. Kim D(u,, ..., Um) 


which corresponds to the surface element of S, 
the integral is called a surface integral with 
respect to the surface element and is denoted 
by f, fdS or fs fdo. The surface integral of a 
differential form of degree m in R" is similarly 
defined. In the case m= 1, the surface integral 
reduces to a curvilinear integral. Just as the 
Stieltjes integral is a generalization of the 
curvilinear integral, there are several ways 

to generalize the notion of surface integral 
without assuming that the mapping ¢(u) is of 
class C!. 


F. The Stokes Formula 


Let S be an m-dimensional smooth surface in 
R" (m € n) and @S be the (m — 1)-dimensional 
surface corresponding to the boundary of S. 
Let w be a differential form of class C! of 
degree (m — 1) and do be its texterior deriva- 
tive. Then we have [5,0 [s do, which is called 
the Stokes formula (or the Green-Stokes for- 
mula). (For the Stokes formula on a general 
differentiable manifold — 105 Differentiable 
Manifolds.) As special cases of the Stokes 
formula, we have the following three classical 
theorems: 

(1) The case of a plane domain: Let D be a 
bounded domain on the xy-plane bounded by 
a finite number of smooth curves C with posi- 
tive directions. If œ = P dx +Q dy is a differen- 
tial form of class C! on D, we have 


| pax+ody=| | (29-5 uas (6) 
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since dw =(6Q/0x — 0P/éy)dx ^ dy. This is 
called Green’s formula (or Green’s formula on 
the plane). Equality (6) is true if P, Q are totally 
differentiable and the integrand on the right- 
hand side is continuous (even if the functions 
0Q/Ox, 0P/éy themselves are not continuous) 
(E. Goursat). This formula remains true under 
the following weaker assumptions: (i) C is 
rectifiable; (ii) P and Q are continuous in D 
and 6P/dy and 0Q/Ox are continuous and 
summable (in the sense of Lebesgue) in D. 

(2) The case of a domain in a 3-dimensional 
space: Let D be a bounded domain in xyz- 
space surrounded by a finite number of 
smooth surfaces S. For a vector field V = 
<P, Q, R» of class C! on D, we put o — Pdy ^ 
dz--Qdz ^ dx - Rdx ^ dy. Then since dw = 
(OP/Ox + 0Q/0y + OR/Oz)dx ^ dy ^ dz, we have 


\| Pdydz+Qdzdx+Rdxdy 
S 


JIL div V dx dy dz 
CP SC 
d xjeee 0 


Equality (7) is called the Gauss formula (Ost- 
rogradskif's formula or the divergence theo- 
rem). The left-hand side of (7) is equal to the 
surface integral f (,(V, gid, which is the tvec- 
tor flux through S (where n means the outer 
unit normal vector of the surface S). This 
formula remains true under the following 
weaker assumptions: (i) S is piecewise of class 
C!; (ii) P, Q, R are continuous in D and OP/óx, 
0Q/Oy, and OR/0z are continuous and sum- 
mable in D. 

(3) The case of a bordered surface in a 3- 
dimensional space: Let $:x 2 x(u, v}, y= y(u, v), 

= x(u, v) ((u, v)e G) be a smooth surface in 

xyz-space, and suppose that the boundary F of 
the domain of the parameters G consists of a 
finite number of smooth curves with positive 
direction. The boundary C of the surface S is 
the image curve of I. Now let V = (P, OR: be 
a vector field of class C! on S, n be the unit 
normal vector of S (its direction being canoni- 
cally assigned by the parameter (u, v)), and t be 
the unit tangent vector, and set w = Pdx+Qdy 
+ Rdz. Then since do — (OR/0y — 0Q/0z)dy ^ dz 
t (0P/0z —OR/Ox)dz ^ dx +(€Q/éx —0P/0y): 
dx ^dy, we have 


HEET 
Eee (2) 


-Í (Pav + Ody + Rds)= | (V,t)ds. (8) 
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Equation (8) is called the Stokes formula (— 
Appendix A, Table 3.IIT). 
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The term cybernetics was invented in 1947 by 
Norbert Wiener [1] to denote a field of science 
that treats the system of control and communi- 
cation in animals and machine. The term was 
derived from a Greek word kvfepvytye, also 
the source of the word “governor.” The stimu- 
lus for establishing such a new area of science 
came from studies of automatic computation, 
automatic control, and information process- 
ing. These fields had given rise to technological 
innovations such as the high-speed electronic 
computer and automatic control instruments. 
Such machines have had a profound influence 
on the information sciences, as well as on 
theoretical investigations in biology. However, 
even now, we can hardly say that cybernetics 
has been formly established as a systematic 
branch of science or of applied mathematics. 
Nevertheless, it has had far-reaching influence 
on both biology and machine engineering as 

a methodology and as a philosophy. In the 
Soviet Union and in European countries, the 
word corresponding to cybernetics is still 
widely used for the *Grenzgebiete" between 
biology and machine engineering in the wider 
sense. Nonetheless, it is difficult to say that 
cybernetics as a whole has undergone system- 
atic development. Systematic theories have 
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been established and developed for separate 
parts of the field, and these have become in- 
dependent disciplines; they are, for example, 
tcontrol theory, tinformation theory, the 
theory of artificial intelligence, the theory of 
tautomata, mathematical biology, genetics, 
and ecology. 

The contributions of Wiener to cybernetics 
include not only the invention of the concept 
itself, but also prediction theory or the theory 
of the Wiener filter [2]. The latter was sub- 
sequently formalized by R. E. Kalman from 
a different point of view as an estimation 
problem within a linear system, working from 
finite observation data, and including the non- 
stationary case (— 405 Stochastic Control and 
Stochastic Filtering G). Another contribution 
by Wiener is the input-output identification of 
a nonlinear system using statistical time series 
analysis of the outputs when the inputs are 
white noises [4]. This is achieved by expan- 
sion of the output function in terms of the 
convolution of input functions, which corre- 
sponds to the expansion of functions in terms 
of tHermite interpolation polynomials. The 
kernel of the expansion is called a Wiener 
kernel. Recently, this method has been applied 
to input-output identification for nervous 
systems; it also finds application in nonlinear 
system theory [5]. Wiener also studied brain 
waves [6,9]. 

Cybernetics in the wider sense, although not 
necessarily called by this name, may include 
investigations of the following “Grenzgebiete”: 
information processing of the nervous sys- 
tem [7]; self-organizing systems in the non- 
equilibrium thermodynamics of Prigogine and 
Haken; the self-reproducing machines of von 
Neumann; and the theory of pattern formation 
in biological or chemical systems [8]. 

For related topics — 176 Gaussian Process 
I, 395 Stationary Processes D. 
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A. General Remarks 


With the development of electronic computers, 
effective systems for data transmission and 
processing have been created on a large scale, 
and there is now a large literature concerning 
data-processing methods and techniques for 
such systems. 

Research on data processing encompasses 
both specific techniques and entire systems of 
processing, for example, the system of pro- 
gramming techniques, as well as mathematical 
problems originating from them, such as the 
complexity of computations (— 71 Complexity 
of Computations). As domains of application, 
we have, for example, information retrieval, 
stock management, program evaluation, and 
review techniques. We denote the set of data 
by D and assume that D is finite. Depending 
on the properties of D, we have various suit- 
able representations and processing problems. 


B. The Notion of Data 


In recent applications of computers, the main 
task has been arranging and scarching for 
items or attributes in storage, rather than 
numerical computation. Individual informa- 
tion is called a record. Records arranged and 
stored in the memories of computers are usu- 
ally called data. A collection of such data is 
called a file. A complex, large-scale data collec- 
tion is often called a data base. 

In an abstract sense, the record consists of a 
string of letters, but it is usually convenient to 
view it as consisting of its identifying mark 
followed by a finite sequence of items. The 
contents can be classified according to their 
properties, such as topological relations, order 
relations, or items representing numerical 
values. For each case, there may be different 
suitable representations and operations. 

We often store information after a suitable 
process of information compression. In some 
cases the operations are reversible, and com- 
plete recovery is possible. In other cases the 
operations are not completely reversible, and 
we must throw away part of the information 
in order to compress the rest. An example of 
recoverable information is the replacement 
of a run of 1’s (or 0’s) in a binary code by its 
length. An example of lost information often 
occurs in the graphical manipulation of num- 
bers by means of hashing. In this process we 


360 


select a suitable function s= f(a,, ..., a,), where 
the vector (d,, ...,a,) corresponds to the repre- 
sentation of the record in such a way that the 
value s can be monomorphic for the universe 
of the records as often as possible, and the 
data (a,, ...,a,) is kept in the storage corre- 
sponding to the value s. Generally, the vari- 
able representing the location of data is called 
a pointer. 

In the abstract sense, data bases are simply 
sets of items, but in most cases a base will also 
exhibit some mathematical structure, such as 
order relations. In order to handle a data base 
efficiently, we must take its particular structure 
into account, and suitably represent that struc- 
ture. Such structures and their representations 
are called data structures. 


C. Linear Structures 


The component of a vector or the stations 
along a single railway line form linearly 
ordered sets. Their most essential feature is the 
notion of immediate predecessor or immediate 
successor. Such a data structure is called a 
linear structure, and the aligned sequence is 
called an array. Multidimensional arrays, such 
as the elements of a matrix, are stored in the 
form of 1-dimensional arrays in the memory of 
a computer. 

When the set of data D is a linearly ordered 
set, it is usually represented as a suitable array. 
Here, the problems of ordering and table look- 
up are fundamental. Ordering in this sense 
means putting the given elements of D into the 
order defined for D. Since, historically, sort- 
ing machines were used to put punched cards 
in order, this process is also called sorting. 

The process of arranging several individually 
sorted data packs into one sorted pack is 
called merging. 

A fundamental data-sorting operation is 
"comparison" with respect to the order for 
D. There have been many investigations of 
the estimation of the lower or upper bounds 
for the number of comparisons and of devel- 
oping efficient algorithms. Asymptotically, 
O(nlogn) is the theoretical lower bound for 
n elements, and some algorithms are known 
to achieve this bound (— 71 Complexity of 
Computations). 

If there is given a univalent correspon- 
dence f: DD’ and the correspondence table 
is stored in the memory in a suitable form, the 
problem of table look-up arises, which requires 
finding f(d) for a given d e D. In this process, a 
fundamental operation is a comparison of 
deD with some xeD in the table. There have 
been many investigations of efficient arrange- 
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ments and of algorithms with minimal num- 
bers of comparisons. 


D. List Representation 


If the given finite set D has an order satisfying 
the reflexive and transitive laws, addressing is 
often used as a medium for representing the 
order in the memory. Let d,, ...,d, be all the 
immediate successors to an element de D. We 
can represent them by the following sequence 
of triplets: 


If n20, we take (d,0,0); 
Ifn=1, we take (d,df,0); 
If n=2, we take (d,dt,47); 
If n=3, we take (d, df, e*), (e,,d3, 43); 
If nz4, we take (d,dT, eT), (e,,d%, e%), 
(€2, d$. ef), .... (64-2. di 1, di), 


where d* means the pointer to d,, e; is an ele- 
ment introduced for convenience, and e£ is the 
pointer to e;. This is called a list representa- 
tion. If we denote by d d,, the fact that d, is 
an immediate successor of d, then the total set 
can be regarded as a fdirect graph. Usually, 
this graph is a ttree, and then the graph is 
called a tree structure and its representation a 
tree representation. The advantage of this 
representation is that addition or deletion is 
quite easy. 

The set of logical formulas is partially 
ordered, with the order given by the rules of 
inference. The set consisting of a series of 
inferences forms an ordered subset. Thus, if 
a tree representation can be automatically 
treated, so can the process of inference. 

In dealing with linguistic data (words or 
sentences, for example), it is often natural to 
consider a noncommutative Tree semigroup D 
generated by a finite number of generators (the 
alphabet or vocabulary). In this case, if there 
exists a natural order for the generators, it 
determines in D a lexicographic partial order- 
ing. Then the tree representation can be used 
for representing a dictionary whose entires are 
elements of D. This method is not efficient with 
respect to speed of table look-up and economy 
of memory, but it sometimes has the advan- 
tage of simplifying the treatment of compli- 
cated data. 


E. Memory Devices for Processing 


In dealing with algebraic formulas or lan- 
guages with parentheses, data maintenance 
methods such as tree representation or push- 
down storage are often convenient as auxiliary 
memory-controlling methods. The character- 
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istic of the push-down storage method is that it 
returns the data in reverse order with respect 
to the time of acceptance and remittance. 

It is sometimes called a stack or a first-in-last- 
out memory. Contrary to a stack, there is a 
storage that returns the data in the same order 
as the acceptance. It is called a queue or a 
first-in—first-out memory. 


F. Information Retrieval 


A request to obtain all the records qualified 
by some property in a data base Q is called a 
query. The part of the collection of records Q 
qualified for a query, therefore, can be identi- 
fied by its characterístic subset of the space of 
all possible records. A procedure that deter- 
mines the subset Q, of Q composed of all the 
records relevant to a query q is called the 
information retrieval of the query. A system 
which provides such a procedure for every 
query q€Q is called an information retrieval 
system organized for Q with respect to Q. 

It is essential to design the system in such a 
way that records can be retrieved quickly for 
queries in a certain class. Such a selected set of 


` queries can be composed of all the first-order 


queries specifying an item and asking for all 
the records containing the attribute value or 
the key characterizing the item. It can, in some 
cases, include second- or higher-order queries 
specifying a number of items and asking for all 
the records in which attribute values or keys 
characterizing the item occur simultaneously. 

In an information retrieval system, the mas- 
ter file of a data base Q is usually organized by 
way of auxiliary memories on data structures 
such as sequential files, indexed sequential 
files, virtual storage files, or direct access files, 
using the magnitude of the accession number 
or primary key of each record. Various man- 
agement systems, called SAM (Sequential 
Access Method), ISAM (Index Sequential 
Access Method), VSAM (Virtual Storage 
Access Method), or DAM (Direct Access 
Method), exist. 

In addition to the organization of such a 
master file, various directory files or indexes 
are organized in order to retrieve each query 
q€Q quickly, because there are many relevant 
keys or combination of keys other than the 
primary key. An inverted filing scheme (IFS) is 
a typical scheme for organizing such indexes 
or directory files. A bucket B, or an address- 
able set of secondary memories is provided 
for each canonical query q,€Q in a one-to-one 
way. An index or a list of accession numbers of 
pertinent records is organized in each of the 
buckets contiguously so as to make it possible 
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to retrieve all accession numbers of relevant 
records quickly. The essentials of an IFS is 

to define a one-to-one QAT (Query to Ad- 
dress Transformation) from the inverted set of 
queries Q= (q,j to the set of buckets B= {B,}. 

An IFS has efficient retrieval performance 
with respect to the inverted canonical queries. 
The scheme, however, may in some cases 
require a large number of buckets. Moreover, 
the scheme requires quite redundant storage 
of accession numbers by a number of buc- 
kets because a record may be pertinent to a 
number of canonical queries simultaneously. 
Although higher retrieval performance can be 
expected by including higher-order queries in 
the inverted set, the space and machine time 
needed for such organization is prohibitive. 
This is one of the reasons why an IFS for first- 
order queries is preferable in almost all prac- 
tical cases. Another reason is that if the first- 
order queries are inverted, every retrieval can 
logically be performed by certain Boolean 
operations executed among a certain number 
of retrieved sets of accession numbers. A trade- 
off of space and time needed for the organiza- 
tion of a scheme and its retrieval performance 
might be the determining factor for the selec- 
tion of the set of canonical queries to be 
inverted. 

An attempt to overcome the limitations 
inherent in the inverted scheme can be found 
in the work on the balanced file organization 
scheme due to Abraham et al. [7]. By extract- 
ing the essentials, Yamamoto et al. [9, 10] 
defined a BFS (Balanced File-organization 
Scheme) in a wider sense as follows: 

(i) Buckets are organized in such a manner 
that every bucket is associated with more than 
one query. 

(ii) Every canonical query is associated with a 
unique bucket. 

(iii) The accession number of a record with 
some additional information is stored in a 
bucket once if and only if it is pertinent to at 
least one of the associated queries. 

The essentials of a BFS is to define a many- 
to-one transformation from the set of canon- 
ical queries Q to the set of bucket addresses B, 
or an MQAT (Multiple Queries to Address 
Transformation). An MQAT defines a parti- 
tion of Q into mutually disjoint subsets. It is 
a generalization of a QAT which defines an 
IFS. If c is the number of queries to be asso- 
ciated simultaneously with a unique bucket 
in a BFS, then the number of buckets to be 
prepared is 1/c, which is a drastic reduction 
from that for an IFS. This reduction makes 
it possible to extend the feasible range of 
canonical queries. Actually, in the system 
HUNDRED (Hiroshima University New 
Documents REtrieval and Dissemination), a 
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51-to-1 MQAT is used for the organization of 
indexes. 

Although some space overload may occur 
by storing some additional information in 
order to tell which record is pertinent to which 
query in a bucket, reduction of redundancy 
can be expected, because a record may be 
pertinent to more than one query simulta- 
neously. The reduction in the number of buc- 
kets and redundancy would contribute much 
in saving time and space needed for the or- 
ganization of indexes. 

Among those BFS's defined by MQAT's 
with a given data base Q, Q, and the number c 
of queries associated with the same bucket, 
one BFS can be called the best if its redun- 
dancy is the least under some reasonable as- 
sumptions imposed on the distribution of keys. 
Several combinatorial problems have been 
raised and solved in this connection [10]. 
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Suppose that we are given a set $ and a propo- 
sition P(x,, x5, ..., x,) for elements x, of S. 
Then we have the problem of universal validity 
of P, which is the problem of finding a general 
algorithm (i.e., a finitary procedure) by which 
we can discern whether P(x,,..., x,) is true for 
all n-tuples (x,, ..., x,). The problem of finding 
an algorithm by which we can discern the 
validity of P(x,,..., x,) for some specifically 
chosen n-tuples (x,, ..., x,) is called the prob- 
lem of satisfiability of P. These two problems 
are customarily called decision problems. The 
problems are such that affirmative solution of 
one of them implies negative solution of the 
other. 

To give a precise definition of decision prob- 
lems, let us note that a tfree semigroup with 
countable generators can be identified with a 
subset of the set N of natural numbers (by 
virtue of *Gódel numbering; — 185 Gódel 
numbers). On the other hand, if S is a given 
tformal system with countably many symbols, 
the set of all *formulas in G is a subset of the 
free semigroup generated by the symbols in G. 
Thus the set of all formulas in G is identified 
with a subset of N. A subset M of N (or N x 
N x ... x N) is (general) recursive if its tre- 
presenting function is general recursive (— 356 
Recursive Functions). By using the concept of 
recursive function, a precise definition of the 
decision problem can be given as follows: The 
decision problem of M is solved affirmatively if 
and only if we can obtain effectively a proce- 
dure defining the representing function of M, 
and the function is general recursive. The 
decision problem of M is solved negatively if 
and only if we can obtain a proof that M is 
not recursive. 

For a set A of formulas in G, we let g(A) be 
the set of all Gódel numbers corresponding to 
the elements of A. Let A' be the set of formulas 
in A that are deducible in G, and let t(A)= 
g(A'). The decision problem of the set A of 
formulas is said to be solved affirmatively 
(negatively) if the decision problem of 1(A) is 
solved affirmatively (negatively). By refining 
this concept we arrive at the notion of the 
degree of (recursive) unsolvability. Let A and B 
be subsets of N. The relation “A is recursive in 
B and B is recursive in A” (— 356 Recursive 
Functions) is reflexive, symmetric, and transi- 
tive. Hence this relation decomposes the class 
of all subsets in N into disjoint nonempty 
equivalence classes. A and B are defined to 
have the same degree of unsolvability if they 
belong to the same equivalence class. Thus the 
degrees of unsolvability can be identified with 
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the equivalence classes. The degree of recursive 

sets is 0. 

The relation a<b ts defined between the 
degrees a of A and b of B to mean “A is recur- 
sive in B." Clearly, for any degree a, we have 
0 «a. The partially ordered system of degrees 
constitutes an tupper semilattice. 

Research on the decision problem has been 
done mostly in areas related to the tfirst-order 
predicate calculus L! and the formal systems 
on it. We now list some important results. 

(I) Results concerning L!. The decision 
problem has been solved negatively for the sets 
of formulas of the following forms. (Here it is 
assumed that no function symbols appear and 
that 9I represents a formula involving no 
occurrence of V, 3, or free variables.) 

(1) All formulas in L! (A. Church, A. M. 
Turning), 

(2) 3x,3x; ... dx, Vy, Vy, ... Vy, A (T. Skolem), 

(3) dx,3xj43x4 Vy, Vy, ... Vy,3z A (K. Gödel), 

(4) 3x, 3x, Vy, Vy, ... Vy,3z A (L. Kalmar), 

(5) 3x,3x4 Vy 3z, 3z, ... 3z, M (J. Pepis), 

(6) Vx3yVzdu, du; ... Ju, MA (W. Ackermann), 

(7) 3x,3xj,3x4Vy A or 
3x, dx; Vy 3z A (J. Surányi), 

(8) 3x Vy, Vy dz, 3z, A or 
Vx dy Vz Ju, Ju, A (Surányi). 

The decision problem has been solved 
affirmatively for the sets of formulas of the 
following forms, where it is assumed again 
that no function symbols appear and that 9I 
is as above. 

(1) All formulas involving variables only on 
predicates with one argument. (L. Lówen- 
heim, Skolem, H. Behmann), 

(2) Vx, Vx; ... Vx, 9L (P. Bernays, M. Schónfin- 
kel, Ackermann), 

(3) Vx, Vx5 ... Vx, dy, Jy; ... dy, M (Bernays, 
Schónfinkel, Ackermann), 

(4) Vx, Vx, ... Vx, dy4dy3 Vz, Vz; ... Vz, A 
(Gódel, Kalmar, K. Schutte). 

(II) Results concerning forma! systems on 
LI Throughout the rest of this article we 
assume that no tfunction variables appear. 
Predicate constants, function constants, and 
object constants may appear. By the decision 
problem for a formal system € we mean the 
decision problem for all tclosed formulas in €. 
Most of the results obtained so far concerning 
the decision problem for formal systems have 
been negative. Such results include those for 
formal systems formalizing natural number 
theory, the theory of rational integers, the 
elementary theory of tgroups, rings, tfields, 
tlattices, and the like, and axiomatic set theory 
(A. Tarski et al.). 

The word problem for groups was solved 
negatively by P. S. Novikov (— 161 Free 
Groups B). In connection with this deci- 
sion problem, there are some investigations 
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by W. W. Boone, G. Higman, and others 
[14-17]. 

The decision problem for a formal system 
formalizing the elementary theory of tAbelian 
groups has been solved affirmatively (W. 
Szmielew). Little is known about the decision 
problem concerning partial systems of for- 
mulas of given formal systems except the fol- 
lowing: (1) the decision problem for the set of 
formulas of the form Vx, Vx, ... Vx, M in a 
formal system (— 161 Free Groups); (2) the 
Hilbert-type problem, which is the decision 
problem for the set of formulas of the form 
dx, 3x, ... Ix,, (t= s) in a formal system. In 
particular, the Hilbert-type problem in a 
formal system formalizing natural number 
theory is called Hilbert's tenth problem (— 
196 Hilbert). The latter is the problem of find- 
ing an algorithm for deciding whether a *Dio- 
phantine equation has an integral solution. 

This decision problem was studied by M. 
Davis, H. Putnam, J. Robinson, and others, 
and finally Yu. V. Matiyasevich solved it neg- 
atively by showing that every recursively 
enumerable relation is Diophantine [9]. (A 
relation R(m,,...,m,) is called Diophantine if 
there is a polynomial P(x,, ..., Xj, y, .... yy) 
with integer coefficients such that R(m,, ..., mj) 
holds if and only if P(m,, ..., mj, Yis ..., Y) 20 
has a solution for y,, ..., y, in natural num- 
bers.) In addition, some investigations have 
been made about the *second-order predicate 
calculus L?, tintuitionistic logic, etc. [ 1, 2]. 
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Dedekind, Julius Wilhelm 
Richard 


Julius Wilhelm Richard Dedekind (October 6, 
1831—February 12, 1916) was born in the city 
of Braunschweig in central Germany and 
studied at the University of Góttingen under 
C. F. *Gauss, who was then in his later years. 
He received his doctorate at Góttingen with a 
thesis on the *Euler integral. He was professor 
of mathematics from 1858 to 1862 at Zürich 
and from 1863 to 1894 at the Technische 
Hochschule in Braunschweig. During his early 
twenties, he wrote papers concerning analysis 
and the theory of probability, but in 1857 he 
began publishing papers on the theory of 
numbers. He edited *Dirichlet’s lectures on 
number theory (Vorlesungen tiber Zahlen- 
theorie, first edition 1863, fourth edition 1899) 
and concentrated on research in arithmetic 
and algebra. The theory of *ideals, which he 
founded, was originally set out in a supple- 
ment (1863) to Dirichlet's Vorlesungen. 
Dedekind treated subjects ranging from the 
axiomatic foundations of the theory of ideals 
to tlattices and tgroups as algebraic systems. 
He was a pioneer of the abstract algebra of the 
20th century. Among his notable achievements 
are the {Dedekind zeta functions of talgebraic 
number fields, Dedekind cuts in the theory of 
real numbers, the algebraic theory of talge- 
braic functions (of which he was a coauthor 
with H. Weber), and the theory of natural 
numbers. He was one of the first to support 
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*Cantor's set theory. His theory of natural 
numbers was founded on the concept of sets 
and included the idea of "recursive functions. 
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Degree of Mapping 


A. Degree of Mapping 


Let M" and N" be n-dimensional tclosed 
toriented tC’ manifolds (or tcombinatorial 
manifolds). For example, M" = N” = S" (the tn- 
sphere). Their nth thomology groups with 
integral coefficients H (MT: Z) and H,(N"; Z) 
are infinite cyclic groups generated by the 
fundamental homology classes [ M"] and [N"], 
respectively (— 201 Homology Theory). A 
continuous mapping f: M"— N" induces a 
homomorphism f, : H,(M"; Z)— H,(N"; Z), and 
there exists an integer d, such that f,([ M"])— 
d, [N"]. This integer d, is called the degree of 
mapping (or the mapping degree) of f. When 
M" = N", d, does not depend on the orienta- 
tion of M”. 

If a continuous mapping g: M" — N" is 
*homotopic to f (f =g), then we have d, — d,. 
If f is homotopic to a tconstant mapping 
(f ~0), then d, —0, while if f is a homeomor- 
phism, then d, — +1. When M" = N”, a homeo- 
morphism f: M"— M" is called an orientation- 
preserving mapping if d, — 1 and an orientation- 
reversing mapping if d, = —1. 

Suppose that M" and N" are closed oriented 
n-dimensional combinatorial manifolds and 
that f: M" N" is a tsimplicial mapping. Let 
Yo; and 257 (oj, s; are n-simplexes of M", 
N”, respectively) represent [ M"], [ N"], and let 
p; (resp. q;) be the number of n-simplexes oi 
such that f(o7) is equal to s (resp. —5;). Then 
p;— 4j is independent of the index j and equal 
to dy. 

Suppose that f, g: S" — S" are continuous 
mappings (nz 1). Then f ^g if and only if d, = 
d, (Brouwer mapping theorem). This implies 
that z,(S") z Z (— 202 Homotopy Theory). 
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B. Local Degree of Mapping 


Suppose that M" and N" are n-dimensional 
oriented C’ (or combinatorial) manifolds and 
f: M"5N" is a continuous mapping. Suppose 
further that a point p of M" has a neighbor- 
hood U such that f(p) z f(q) for any point d 
contained in U — (pj. Then f induces a homo- 
morphism f,: H,(U, U — (p]) 2 H,(N, N — 

{ f(p)}) of n-dimensional flocal homology 
groups with integral coefficients that are both 
isomorphic to Z. If u and v are generators of 
the groups H,(U,U — (pj) and H,(N, N — 

{ f(p)}) corresponding to orientations, re- 
spectively, then there exists an integer k such 
that f,(u) — kv. We call this integer k the local 
degree of mapping f at p. If M" and N" are 
closed oriented C" manifolds (r 2 1) and f: M" 
— N" is a C" mapping, then there exists a point 
r of N” such that the set f! (r) is a discrete 
subset (p, ..., P} of M", and each p; has a 
neighborhood U, satisfying the foregoing con- 
dition ('Sard's theorem). If k; is the local de- 
gree of f at p, then d; - X ko 


C. Linking Numbers 


Given two mutually disjoint smooth closed 
curves C, and C, in Euclidean 3-space, a quan- 
tity Lk(C,, C;) indicating how closely they 

are interlinked with each other was given by 
Gauss as follows: Let C; be expressed by the 
parameters x;— x;(t;) (i= 1, 2), where x;(t;) are 
*continuously differentiable. Then the quantity 


I 1 
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dx, d 
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is an integer called the linking number of C, 
and C}. 

More generally, let M" be an n-dimensional 
oriented tcombinatorial manifold (or C" mani- 
fold (r z 1)) and K and K* its fcellular decom- 
positions such that K* is dual to K. Let 27 and 
z5 (r+s=n—1) be boundaries belonging to 
the complex K and K*. Suppose that C'*! is 
any tchain of K whose boundary is z}. Then 
the *intersection number [C'*! ]- [z5] does not 
depend on the choice of such a chain C’*’. We 
set Lk(z1,z3) - [C'*! ]- [z3] and call it the link- 
ing number of z{ and z3. The linking number 
Lk(27,25) of tsingular boundaries 21, 25 (r - s— 
n — 1) of M" is similarly defined by consider- 
ing the approximations z{, z5 of 21, Z3 belong- 
ing to a suitable cellular decomposition K and 
its dual K*. The number Lk(Z}, 25) is bilinear 
with respect to 21, 2$, and we have Lk(21,25) — 
( —1)^"  Lk(25, 21). In the example in 3-dimen- 
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sional Euclidean space R? shown in the left 
half of Fig. 1, we have Lk(21,22) — 1, while 
Lk(2!, 23)=2 for the example shown in the 
right half of the same figure. In particular, if 7 
is homologous to 0 in M" —|25], then we have 
Lk(21, 25) 20 (Fig. 2). Generally, if Z; and Z7 
are homologous in M" —|25], then Lk(21, 27) 
-—Lk(21,22) 


Fig. 1 


D. Order of a Point with Respect to a Cycle 


Let M" be an n-dimensional oriented com- 
binatorial manifold (or C” manifold (r> 1)) 
with the nth Betti number b, =0, 2"! an 

(n — 1)-dimensional singular boundary of M", 
and o a point of M" that is not contained in 
ISL We set ord(2" !, o) 2 Lk(2"^!, 0) and call 
it the order of the point o with respect to Sr, 
For example, when M" — R? and 2! ={ f(t)|O < 
(el, f(0)— f(1)), where f is a continuous 
function, the order ord(2!, o) is equal to the 
rotation number around o of a moving vector 
of (t) as t varies from 0 to 1. This ord(zZ!, o) 
stays invariant as the point o moves in a *con- 
nected component of the complement R? — 2! | 
(Fig. 3). On the other hand, if 22 = ( f(]0 < 

t & 1, f:(0)= fj (1); (i=0, 1) are closed curves in 
R? and the distance ot fo(t), fi (©) is smaller 
than oi folt), 0) for all t in the interval [0,1], 
then we have ord(Z1, o) - ord(21, o) (Rouché's 
theorem). 


Fig. 2 Fig. 3 
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Denjoy Integrals 


A. History 


For a real-valued function f(x) of a real vari- 
able to be tLebesgue integrable, it is necessary 
and sufficient that there exist an *absolutely 
continuous function F(x) such that F'(x) 2 f(x) 
at talmost all points x (— 221 Integration 
Theory D). In general, the derivative of a func- 
tion is not necessarily Lebesgue integrable. A 
function f(x) is Lebesgue integrable if and only 
if | f(x)| is integrable. Hence a function which 
has an improper Riemann integral is not neces- 
sarily Lebesgue integrable (— 221 Integration 
Theory A). For this reason, it is desirable to 
extend the concept of Lebesgue integrals. In 
1912, A. Denjoy constructively defined a new 
concept of integrals (Denjoy integral in the 
restricted sense; — Section D), which is an 
extension of both Lebesgue and Riemann 
integrals. Later, N. N. Luzin provided the 
descriptive theory of this integral. Indepen- 
dently, and nearly simultaneously, A. J. Khin- 
chin and Denjoy defined a more general in- 
tegration (Denjoy integral in the wide sense 
(1916); — Section D). 

In 1914, O. Perron, independently of Den- 
joy, defined a concept of integrals (Perron 
integrals) that is equivalent to that of Denjoy 
integrals in the restricted sense. To establish 
this concept, Perron considered the differen- 
tial equation y'= f(x) and utilized a method 
similar to the one used in the proof of the 
existence theorem for the solution of the dif- 
ferential equation y'= f(x, y). However, the 
concept of Denjoy integrals is inadequate to 
treat unbounded functions. Thus to extend 
the concepts of Riemann and Lebesgue inte- 
grals, various ideas have been introduced; for 
example, Denjoy (1921), J. C. Burkill (1951), 
and R. D. James (1950) introduced new con- 
cepts as byproducts of investigations concern- 
ing the coefficients of trigonometric series 
[2,3]. The A-integral concept devised by A. N. 
Kolmogorov was meant to deal with the pro- 
blem of the conjugate function of Fourier 
series [4]. As a certain completion of the space 
of functionals of step functions, K. Kunugui 
defined the notion of E. R. integrals (1956), 
which coincides with that of A-integrals in a 
special case [5,6]. 

What has been stated so far deals only with 
functions of a real variable. Concerning the 
extension of Denjoy integrals to the case of 
several variables, research has been done by 
M. Loomis, S. Kempisty, S. Nakanishi (née 
Enomoto), and others [7,8]. 
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B. Approximate Derivative 


If we have lim, ax ,om( EQ (č — h, £ +k) (ht 
k)=1 at a point č of a *measurable subset E 
of the real line, where m is the 1-dimensional 
Lebesgue measure, the point é is called a point 
of density for E. Almost all the points of E are 
points of density for E ( Lebesgue's density 
theorem). Let E be a measurable set having xo 
as a point of density, and let F(x) be a measur- 
able function on E. If there exists a number / 
such that for each £7 0, x, is a point of density 
for the set {x|!—e<(F(x)— F(xg))/(x x9) & 

l 4- e x€ E), then l is called the approximate 
derivative of F(x) at x, and is denoted by 

AD F(x,). If AD F(x) exists, F(x) is said to 

be approximately derivable at x,. If F(x) is ap- 
proximately derivable at each point of E, then 
F(x) is said to be approximately derivable in 
E. If F'(x) exists at a point x, then AD F(x) 
exists at x, and we have AD F(x)= F'(x). How- 
ever, there exists a continuous function F(x) 
that is approximately derivable at almost all 
points of an interval and yet not differentiable 
at any point of a set of positive measure. 


C. Generalized Absolute Continuity 


Let E be a set in R, and let F(x) be a real- 
valued function whose domain contains E. If 
for each £2 0, there is a ô >0 such that for 
every sequence ([a,, b,]) of nonoverlapping 
intervals whose endpoints belong to E the 
inequality ZO, — a,) « ó implies $| F(b,) — 
F(a,)| « £&, then the function F(x) is said to be 
absolutely continuous on E. We denote by AC 
the set of all functions that are absolutely 
continuous on E. If F(x) is continuous on E 
and E is the union of a countable sequence of 
sets E, on each of which Fe AC, then F(x) is 
called a generalized absolutely continuous 
function, and we write FeGAC. If FeGAC, 
AD F(x) exists almost everywhere. 

If, for each £0, there is a 00 such that for 
every sequence {[a,, b,]] of nonoverlapping 
intervals whose endpoints belong to E the in- 
equality X(b, —a,)<e implies X, O1F; [a,, b,]] 
<e (O(F; [a,, b,]] denotes the oscillation of 
the function F(x) in [a,, b, ], i.e., the difference 
between the least upper bound and the great- 
est lower bound of the values assumed by F(x) 
on [a,, b,]), then F(x) is said to be absolutely 
continuous in the restricted sense (or abso- 
lutely continuous (*)) on E; and we write Fe 
AC(*). Just as we defined the notions of gen- 
eralized absolute continuity and absolute con- 
tinuity, so we define the notions of generalized 
absolute continuity in the restricted sense and 
generalized absolute continuity (+). Thus Fe 
GAC(*) means that F(x) is a generalized ab- 
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solute continuous (*) function on E. If Fe 
GAC(*), then F'(x) exists almost everywhere. 


D. Definitions of Denjoy Integrals 


Let f(x) be a real-valued function defined on 
I — [a,b]. If for f(x) there exists a function 
F(x) that belongs to GAC on I and for which 
AD F(x)= f(x) holds almost everywhere, then 
f(x) is said to be Denjoy integrable in the wide 
sense (or D-integrable) on J. We call F(b)— 
F(a) the definite D-integral of f(x) over I, 

and denote the value by (D)f? f(x) dx. The 
function F(x) is called an indefinite D-integral 
of f(x) on I. Similarly, we obtain the definition 
of Denjoy integral in the restricted sense (or 

Dia integral) by replacing GAC by GAC(x) 
and AD F(x) by F'(x) in the definition of the 
D-integral. If a continuous function F(x) satis- 
fies the equality AD F(x)2 f(x) # +00 (F'(x) — 
f(x)s* +00) for all except countably many 
points in 7, then F(x) is an indefinite D- 
integral (D(x )-integral) of f(x). A Lebesgue- 
integrable function is D(*)-integrable, a D( + )- 
integrable function is D-integrable, and a D- 
integrable function that is almost everywhere 
nonnegative is Lebesgue integrable. 


E. Constructive Definition of Integrals 


Let S be a functional whose domain | J, K(S; 1) 
consists of the union of sets K(S; I) of real- 
valued functions defined on closed intervals 

I — [a, b]. If f belongs to K(S; I), we denote 
the value S( f) by S( f; I). Such a functional S 
is called an integral operator if the following 
three conditions are satisfied: (1) If fe K(S; Ip) 
and J is an arbitrary interval contained in Ip, 
then the trestriction f; to I of f also belongs 
to K(S; I). Also, S( f; I) is a fcontinuous addi- 
tive function of the interval I c I. (2) Let I, = 
[a.b], I, =[b, c], and I 2 [a, c] (a « b « c). If for 
a function f defined on J, f, € K(S; 1j) and 

f; € K(S; L), where f, — f;, and f; = fi, then 

f € K(S; I). (3) If f is identically 0 on I, then 

f € K(S; I) and S( f; I) 20. For two integral 
operators S, and S,, we say that S, includes S, 
(or S, is weaker than $5) if K(S,; I) c K(S;; 1) 
for every I and S,( f; I) - S;( f; I) for every 

f € K(S,; I). The D-integral (D( »)-integral) is 
the weakest integral operator containing the 
Lebesgue integral and satisfying the following 
two conditions, (C) and (H) (resp. H(*)): (C) 
Cauchy's condition. If, for every function f 
defined on lọ, we have fe K(S; I) for any I= 
[a-- à, b — £| ẸṢ lo — [a, b], and also if the finite 
limit lim; ,o ,.,9 S( f; I) exists, then fe K(S; Ip) 
and S( f; Jul coincides with the foregoing limit 
value. (H) Harnack’s condition. Let E be a 
closed subset of Ip, (1,) be a sequence of inter- 
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vals contiguous to the set consisting of the 
points of E and the endpoints of lọ, and f bea 
function on Jy satisfying the following three 
conditions: (i) fce K(S; I9), where fz(x) = f(x) 
whenever xe E and f, —0 otherwise; (ii) f, = 
J, € K(S; 1) for each k; and (iii) £,1S(f,; Il « 
+% and lim,.,,, O(S; f,; I,) =0 when the se- 
quence {/,} is infinite. Then it follows that 

fe K(Si lo) and S(f; 15) 2 S(fg; fo) + Lx S( Je; 1). 
(Here O(S; fx; 1,) denotes the variation of S( f,) 
on Jp, that is, the least upper bound of the 
numbers |5( fj; J)|, where J denotes any sub- 
interval of 7,.) We obtain condition (H(*)) by 
replacing condition (iii) in (H) with a more 
restrictive condition: ©, O(S; f; Ij) « +œ. The 
constructive definition of the Denjoy integral 
in the wide sense (the Denjoy integral in the 
restricted sense) is obtained by 'transfinite 
induction starting with the Lebesgue integral 
and using two methods, (C) and (H) (resp. 
(His, of extensions. 


F. Perron Integrals 


Given a function f(x) defined on an interval 
(a, b], suppose that F(x) is a function defined 
on the same interval such that (1) F(x) z f(x); 
(2) F(x) # —oo (resp. (1) F(x) & f(x); Q^) F(x) # 
+00) at every point x, where F(x) (resp. F(x)) 
denotes the tlower (upper) derivative of F(x). 
In this case, F(x) is called a major (minor) 
function of f(x). If for any ¢>0 there is a major 
function w(x) and a minor function g(x) of 
f(x) such that y(b) — o(b) « e then f(x) is 

said to be Perron integrable. We denote by 
(P); f(x) dx the value inf, (V (b) — v(a); = 

sup, (9b) — o(a)]. 


G. Properties of Integrals 


If { fi} is a nondecreasing sequence of func- 
tions that are D-integrable on an interval 
[a,b] and whose D-integrals over [a,b] consti- 
tute a sequence bounded from above, then the 
function f(x) — lim, ,,, f, (x) is D-integrable on 
[a, b], and we have 


b b 
Éf fœ)dx= lim | f, (x) dx. 


If F(x) is a function of tbounded variation 
and g(x) is a D-integrable function on an 
interval [a, b], then F(x)g(x) is D-integrable on 
[a, b]; moreover, denoting by G(x) the inde- 
finite D-integral of g(x), the following formula 
is valid: 


b 
ml F(x)g(x)dx 


a 


b 
= G(b)F(b) — cora- | G(x)dF(x), 


a 
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where the last term is the tStieltjes integral 
(integration by parts). 

If F(x) is a nondecreasing function and g(x) 
is D-integrable on [a,b], there is a point č in 
[a,b] for which the following formula is valid: 


of g(x) F(x) dx 


č b 
- Fia) (D) | go) dx + F(b)- (D) | g(x)dx 
a $ 
(the second mean value theorem). 
The foregoing theorems remain valid if D 
is replaced by Dia) in the hypotheses and 
conclusions. 
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101 (XXI.19) 
Descartes, René 


René Descartes, (March 31, 1596- February 
11, 1650), philosopher, mathematician, and 
natural scientist, was born in the province of 
Touraine in France. He became dissatisfied 
with his studies of scholastic philosophy in the 
Jesuit Academy in La Flèche, and later, in 
1619, while stationed in Ulm during a tour of 
duty in the army, he underwent a philosoph- 
ical conversion. He had an idea of methodo- 
logically unifying the various fields of interest 
to him using mathematics as a model. He 
returned to Paris in 1621, but moved to Hol- 
land in 1628 to concentrate on his work. Swe- 
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den's Queen Christina invited him in 1649 
to that country, where he died the next year, 
evidently from a combination of cold and 
overwork. 

Descartes, often considered the founder of 
modern philosophy, discarded early the tra- 
ditional theological view of the world and 
stated that all knowledge should be recognized 
as logical only after it has been submitted to 
rational criticism. This ushered in the modern 
view of the world based on mathematics and 
physics. In 1637, he published Géométrie as an 
appendix to his Discours de la méthode, which 
also contained his works on optics and meteo- 
rology. In it, he promoted F. tViéte’s symbolic 
algebra, which he applied to geometric prob- 
lems. His idea that algebra could be used as a 
general method for geometry established him 
as the founder of tanalytic geometry. 
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102 (XVIII.12) 
Design of Experiments 


A. General Remarks 


The design of experiments is a part of the 
statistical planning required to collect the data 
appropriate to the purpose of statistical in- 
ference (— 401 Statistical Inference) in various 
fields of scientific research and application. 
The main purposes of the design of experi- 
ments are (1) to analyze a given statistical 
linear model (— 403 Statistical Models) and 
(2) to devise a "good" statistical linear model. 
Sometimes this term also refers to a statistical 
method including the fanalysis of variances. 
Thus the purpose of designing an experiment 
is to provide the most efficient and economical 
methods of reaching valid and relevant conclu- 
sions from that experiment (— 403 Statistical 
Models). 

R. A. Fisher, whose contributions to statis- 
tical theory were remarkable and far-ranging, 
propounded three required principles to con- 
trol the experimental field in order to guaran- 
tee the validity of statistical methods and to 
increase the sensitivity of experiments: (i) re- 
plication, or the repetition of the set of all the 
phenomena to be compared in the experiment, 
for the evaluation of experimental error var- 
iance; (ii) randomization, or the procedure 
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allotting various experimental treatments at 
random, in order to change systematic errors 
into random errors; and (iii) local control, 
or the procedure which makes the variation 
within each experimental block as small as 
possible, in order to minimize or remove sys- 
tematic errors. These are called Fisher's three 
principles. A design satisfying principles (i) and 
(ii) is called a completely randomized design; it 
enables us to attach a probability statement to 
estimated treatment differences by obtaining 
valid estimate of experimental error variance. 
Let an n-dimensional frandom variable Y = 
(Y;, ..., KI be represented by a linear model 


Y=XéE+W, (1) 


where X is a given n x s real matrix, €=(€,, 
<... É) is an s-vector, and W Z(W,,...,W;) isa 
random vector with the texpectation E(W)— 
0. Then Y is called the observation vector, W 
the error vector, é the effect vector of Y, and 
X the design matrix. 

According to the properties of the effect 
vector č, the linear model (1) is separated 
into three classes: (i) The class of fixed-effects 
models for which £ is a fixed unknown para- 
meter. In this case, the component £; of é is 
called a fixed effect, and a linear function z= 
HE of £, with a given coefficient vector F is 
called a linear parameter or parametric func- 
tion. (ii) The class of random-effects models for 
which the components ©, of € are random 
variables. In this case, each component .5; is 
called a random effect, and E is denoted by x. 
(iii) The class of mixed models for which there 
are both fixed effects č; and random effects 5; 
in é. In this case, the model (1) becomes 


Y-2X,£ X, +W, (2) 


where E) —(£,, ..., £y is a fixed-effect vector 
and €? 2(Z,,,,..., Ef is a random-effect vec- 
tor. The conditions frequently assumed for the 
tdistribution law of Y are: (a) The errors W, 
(i— 1, ...,n) are uncorrelated and E(W,)=... 
= E(W,) 20. (b) The errors W, (i2 1,...,n) have 
a common unknown variance o". (c) The 
errors W, (i=1,...,n) have the tnormal distri- 
bution. (d) The random effects =; are uncor- 
related and independent of the error vector W. 
(e) The random effects =, have a common 
unknown variance o?. (f) The random effects 
5; have the normal distribution. 

Let L(X) be a ‘linear subspace of R” span- 
ned by the column vectors of X. The linear 
model 


Y=Xé+W (3) 


is called a hypothesis on the linear model (1) if 
L(X)cL(X). 
The main issues of the theory of design of 


experiments are concerned with (I) statis- 
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tical inferences, such as estimation or testing 
hypotheses, under models (1), (2), (3), (1), (11), 
(iii); (II) determination of the design matrix X 
satisfying certain optimal conditions; (IIT) 
construction of a theoretical foundation that 
can explain the validity of the statistical treat- 
ment of the observed data by means of the 
above models. 


B. Block Design 


The design of experiments is described here in 
terms of the so-called block design. There are n 
experimental units x= 1, ...,n called plots, and 
an observation Y, is assigned to each plot x. A 
block is constructed with several plots under 
Fisher's principle (iii), and the number of plots 
in a block is called the block size, the jth one 
being denoted by k;, j — 1, ..., b, with 25; k;— 

n. One of v operations, called treatments or 
varieties, is applied to each plot. [t is assumed 
that the observation Y, at the plot « in the 

jth block under the ith treatment has the 
structure 


Y =é ++ W, 
The £j, i=1,...,v, are called treatment effects, 
and the rj, j, ..., b, block effects. It is also as- 


sumed that X; £; «0. In this case, Y is repre- 
sented in matrix notation as 


Y-d£ + Yy+W, (4) 
where ®=(p,) x21, ...,n, i= 1, ...,v, with 


when the ith treatment is applied 


1 
Pai E 
0 otherwise, 


to the plot o, 
and V —(y,j, a — L ...,n, ja L, ..., b, with 


1 
n-d 
0 


Here it is assumed that X; pu = 1, Xa Qai = 
n21, Dinan, Xj, l, and LW; —k;z 1. 
We call r; the number of replications of the ith 
treatment. We set N —(n;) - "V. Then n; is 
the number of observations in the jth block to 
which the ith treatment is applied. The matrix 
N is called the incidence matrix of the block 
design. 


when the plot « belongs to 
the jth block, 
otherwise. 


In any experiment, each plot has its own - 
effect. The blocks are constructed so that this 
plot effect in each block becomes as homoge- 
neous as possible, although it is impossible 
to eliminate the effect completely. For this 
purpose, randomization is adopted. Suppose 
that we are given k plots in a block and v 
treatments (k <v). Then randomization is 
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utilized to select a treatment out of v treat- 
ments to be allocated to each plot so that the 
selection is "at random." Then the plot effects 
are random, and the error term in model (4) 
can be considered to be the sum of a plot effect 
and an original error. 

A block design satisfying Fisher's three 
principles is called a randomized block design. 
Blocks that can accommodate all the treat- 
ments to be studied are called complete blocks. 
Those that do not contain all the treatments 
are called incomplete blocks. Blocking can be 
considered to be an extension of pairing. In 
the terminology of block design, we can say 
that many experiments are block experiments, 
or even (tautologically, if we allow those with 
only one block) that all experiments are block 
experiments. 

In a block design N — (nj), treatments io 
and i, are said to be connected if there exists a 
chain 
lo]iliJa dia Jil 
of integers such that 1 <i, & v (p—0, 1,2, ..., I), 
1 &j, & b (q=1,2,...,), and n; j, 70, nij, 0, 
n; j, > 0, ..., nj > 0, n> 0. If all pairs of 
treatments are mutually connected, then the 
design is said to be connected. In this case, the 
rank of the matrix C defined tn Section C is 
v — 1. If the design is disconnected, then the 


incidence matrix N can be partitioned into 
two or more connected portions, e.g., 


N- 
0 N, 


Thus without loss of generality we can restrict 
ourselves to the connected case. 


C. Estimation under the Fixed-Effects Model 


Conditions (a) and (b) of Section A are as- 
sumed here. The *normal equation which gives 
the "least square estimates é and 5 of é and y, 
respectively, is 


ese) o)" | 
y ( H ) H Ee y » (5) 


Set ®'@ -diag(r,,...,r) - D, VP = 
diag(k,,...,k,)=D,, C=D,— ND; 'N', Q= 
(D' — ND, ! V^Y, where diag(...) means a 
diagonal matrix with the diagonal elements 
.... Then (5) reduces to 


Cé=Q, 
Let L be an orthogonal matrix that transforms 
the matrix C to a diagonal form; that is, L'CL 


= diag(p,, ..., 9, 1,0) — A, p; O for all i. Set 
A* — diag(p, !, ..., 9, 4,,0), C* = LA*L'. Then Ê 


=D; (V'Y - N'É). (6) 


371 


= C*Q is a particular solution of (6) and £; Ê 
—0. A parametric function z= HE with coeffi- 
cient vector F=(F,, F,,..., F,) is called a treat- 
ment contrast if the sum È; F; of coefficients 
vanishes. A treatment contrast x = F'é is called 
a normalized contrast if F'F — 1. It is called the 
elementary contrast if F has only two nonzero 
elements 1 and —1. Elementary contrasts of 
treatment effects show the comparison of 
treatments involved in them. When a design is 
connected, any contrast z is festimable, and 
the tbest linear unbiased estimate of z is #= 
FE Furthermore, if f; is the eigenvector of 

the matrix C with unit length corresponding to 
an eigenvalue p; and F = Xajf;, then the var- 
iance of the estimate 7 is given by o7,a?/p;. 
The following properties are equivalent: (1) A 
design is connected. (ii) Any treatment con- 
trast is estimable. (iii) The rank of the matrix 
C is v — 1. (iv) The minimum eigenvalue 0 of 
D; ^ CD.? is simple and other eigenvalues 0, 
satisfy 0< 0, <1. (v) The maximum eigenvalue 
1 of D; ^ ND! N'D; ?? is simple and other 
eigenvalues 0, satisfy 0 x: 0, < 1. (vi) There 
exists a positive integer p such that each ele- 
ment of (D7 ! ^ ND; ! N'D; !?y is positive. It 


r 


holds that 0, — 1 —6,. 





D. Test of a Hypothesis H : č, — ... = č, in the 
Fixed-Effects Model 


Conditions (a), (b), and (c) of Section A are 
assumed here. The hypothesis H:é,=...=€, is 
represented by 


Y=Py+W. (7) 


Consider a direct sum decomposition R” = 
L(V) + LE(V) + Lët, V) - Lá y of R”, where 
Li(A) and Lj stand for the *orthocomple- 
ments of L(B) with respect to L(A) and R”, 
respectively, and F —(1, 1, ..., 1 eR". The 
projection operator matrices for the decom- 
posed subspaces LOL, LE(¥), Lat, Y), and 
La. y are denoted by P. P}, P}, and P}, respec- 
tively. Then we have 


Pen Ens P,- VD V'—n E 


P,— (1, WD,  W)oC*o'(I, — YD; ! V, 

P,=1,—P, —P,—P, 

where E,, is an a x b matrix whose entries are 

all unity and J, is the n x n identity matrix. 
The analysis of variance for the hypothesis 

(7) in model (4) is given by 

YY=Y P Y+Y P, Y+Y'P,Y+Y PY. 


This is called an intrablock analysis. A usual 
test for the hypothesis H is given by a critical 


102 E 
Design of Experiments 


region with 





EE Y’P,Y 
el YPY 


> constant 


(— 403 Statistical Models). 


E. Optimal Block Design 


A block design is said to be optimal when it 
minimizes the variance of the estimate # of a 
normalized contrast z. Suppose that the num- 
ber v of treatments, the number b of blocks, 
and each block size k, j — 1, ..., b, are given. 
Under each of the following criteria the corre- 
sponding block design is optimal in the sense 
indicated: For positive eigenvalues, p,, p;, 

S Boi Of the matrix C in Section C, (T) 
II] p; is maximal (D-optimality); (IT) min p, is 
maximal (E-optimality); (IIT) 7-1 pj ! is mini- 
mal (or the average variance of the estimates 
of all normalized contrasts of the parameters 
č; is minimal) (A-optimality). 

If p, =...= =(1—b)/(v— 1) (=p, say) 
and nj; is either 1 or 0, then the design is opti- 
mal for each of the optimality criteria (I), (IT), 
and (IIT). In this case, we have 


C-p(I,—v E,). 


Such a design is called a variance-balanced 
block design in which every normalized con- 
trast is estimable with the same variance. If 
D,'ND!N'—n E, r —y(1,—n ! E,,r)(or 
C «(1 — u)(D, —n ! rr)), every normalized con- 
trast is estimable with the same efficiency 1 — p, 
where r D E,, and 0< u « 1. Such a design 
is called an efficiency-balanced block design. 
For a block design, any two of the following 
properties imply the third: (1) The design is 
variance-balanced; (ii) the design is efficiency- 
balanced; (iii) the design is equireplicated. 
When all block sizes k; equal some number 
k independent of j, all numbers r; of replica- 
tions equal some number r independent of i, 
and A;r = 2;;njn,; (number of times that both 
treatments i and i’ are applied to the same 
block) equals some number 4 independent of 
i and i', then the design is combinatorially 
balanced. The design is usually called the 
balanced incomplete block design (BIBD) if 
these three conditions are fulfilled and k « 
v. A BIBD is both variance-balanced and 
efficiency-balanced. If all the treatments are 
replicated the same number of times and the 
blocks are of the same size in a block design, 
then the only variance-balanced design is a 
BIBD, provided such a design exists. A BIBD 
is often denoted by BIBD(v, b, r, k, A) and we 
have the relations vr = bk, Air — 1) — r(k — 1) 
and v € b, among the parameters; the last 
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relation is called Fisher's inequality. The design 
is said to be symmetric when v — b. Further- 
more, if v — b is even, then r —4 must be a 
perfect square. If v — b ts odd, then the equa- 
tion x? =(r—A)y? - ( C1 ?? 2z? must have a 
solution in integers x, y, z not all zero. This is 
called the Bruck-Ryser-Chowla theorem. Neces- 
sary conditions for the existence of a BIBD 
have been obtained. One of these conditions 
is stated in terms of the Hasse-Minkowski p- 
invariant C,(A)=(—1, —1), IT. (D, — D; ) 
where (m, m'), is the "Hilbert norm-residue 
symbol, D, — 1, and D; is the principal minor 
of the n x n matrix A. Another is described in 
terms of the embedding of a quasiresidual 
design into the corresponding symmetric 
BIBD. No effective necessary and sufficient 
condition for the existence of a BIBD has been 
obtained. In general, it is conjectured that for 
a positive integer k, with finitely many excep- 
tions, BIBD (v, b, r, k, Ays exist for all pairs v, A 
of positive integers satisfying v> k, 4(p — 1) 20 
(mod k— 1) and Av(v — 1) 2 0 (mod k(k — 1)). 
The known methods of constructing block 
designs are of two main types, direct and re- 
cursive. Recursive methods are a way of con- 
structing designs from smaller ones. Direct 
methods yield easier constructions, but are 
applicable only for special values of the para- 
meters. A direct method of constructing a 
BIBD designates the treatments and blocks, 
respectively, as the points and subspaces of the 
*projective space and the taffine space over a 
*finite field. To explain another method of 
constructing a BIBD, we let G be an 'additive 
group of order n and x‘), ..., x be m treat- 
ments corresponding to each element x“ of the 
group (i— 1,2, ..., n). The treatment x“ is said 
to belong to the ath class (x — 1,2, ..., m), and a 
pair (xf, xj?) of treatments is called a dif- 
ference of type (o, f, x!) if xX —x = xP (40). 
We can form t blocks of size k 


p? 


B, If. x69]. 


ai prey 
Bí {xg sXe ; 
such that each block B; contains exactly r 
treatments belonging to the ath class (x — 
1,...,m) and among all pairs of treatments 
in the same block there are 2 differences of 
each type (a, B, x"). Such a set of t blocks is 
called a difference set. The ¢ blocks in a dif- 
ference set are called initial blocks. Given 
such a difference set, we can obtain nt blocks 
by joining elements of G to the elements of 
each B, (s— 1,2, ...,t). These nt blocks form 
a BIBD(v2 mn, b —nt, r,k 2 rm/t, A) (^ 66 
Combinatorics). 

As a generalization of Fisher's inequality, 
for an unequal-replicated block design with 
unequal block sizes, b z v — ô holds, where ó is 
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the multiplicity of the maximum eigenvalue 1 
of the matrix D, !^ CD, !?, The equality sign 
holds if and only if the projection operator cor- 
responding to zero eigenvalue of D; 1? CD; !? 
is a zero matrix. 


F. Estimation in a Mixed Model 


Consider a block design (4), where every block 
has the same size k and every treatment has 
the same number r of replications. Let £ be a 
fixed effect and y a random effect denoted by 
H, and assume that W satisfies conditions (a) 
and (b) of Section A and that H satisfies con- 
ditions (d), (e), and (f) (where the £, are re- 
placed by the coordinates H, of H). If E(Hj) =y, 
j=1,...,5, then, changing the notation H — 
E,,y to H, we can rewrite (4) as 


Y -Ty-- 0£ 4 VH 4 W, (8) 


with E(H) — 0. The normal equation that gives 
the least square estimate of c is 


(C - à? (o? -- ko2) ! C,)é 
—Q-o?(o? ko) 'Q,, (9) 


where C and Q are the same as in (6) and 
C,=ND,'N'—rvE,,, Q, =(ND W' — 

v 1 E,,T")Y. Equation (9) cannot be solved 
unless the ratio oi oi is given. When 07:4? is 
not known, substituting in o? + ko its tun- 
biased estimate given by analysis of variance, 
one obtains a solution of (9) which tends to a 
tconsistent estimate of E as the number of 
blocks tends to infinity. 


G. Estimation in a Random-Effects Model 


Let € (denoted by E) and H in the model (8) be 
random effects. Suppose that =, H, and W are 
mutually independent, and that the distribu- 
tions of =, H, and W are N(0,031,), N(0,62 I,), 
and N(0, o? I,), respectively. The distribution of 
Y in (8) contains four parameters y, ol, o2, o2. 
When K; « v, the ‘minimal sufficient statistic is 
generally incomplete, and therefore the op- 
timal estimate of y, o”, o2, and o2 cannot be 
determined. As an example, the minimal suffi- 
cient statistic for the random-effects model of a 
BIBD (v, b, r, k, A) is 


(E, Y, Y Pa, Y, Y P), Y, Y P, Y, Y PLY, Y PY), 
where 
P,-k!(r—-4)  BTB—kr(r—24) ^n! E 
Pj, ke 2) r 

x(T—k"! BT(T—k^! TB), 


nn? 


P,—k^1B—k^!(r— 4)! BTB 
+0A(r—A) !n ! E 


nn? 


nn? 
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with T=0@’ and B — YY". In this case, 
E(Y'P, Y) 2 (n—v— b + 1)o?, E(Y'P,Y) - (b— 

v) (c? + ko2). From these equations unbiased 
estimates of o? and o? can be derived, but their 
optimality is not guaranteed (— 396 Statistic). 


H. Factorial Experiments 


Suppose that there are h factors F,,..., F, 
which affect Y, and each factor F, has s; levels 
(i=1,...,h). It is assumed that v=s, X $5 X... X 
s, treatment combinations are derived by all 
the combinations of the levels of h factors, and 
that v treatment effects are represented by the 
sum of subeffects called main effects and inter- 
actions, or factorial effects covering both. Such 
an experiment is called a factorial experi- 
ment or factorial design; it allows us to ex- 
amine the effects of two or more factors, each 
factor being applied at two or more levels, 

by testing all possible treatment combinations 
formed from the factors under study. Specifi- 
cally, suppose that we have the case h — 2, 
called a two-way layout. Let main effects be 
denoted by ET 2 (51, čs) Zitt, ES, 
and interaction by £^ (612, EI, Ča) 
where 2,47 20, 3,02 20, 2,65? 2 3,617 =0. 
When there is no restriction on the number of 
observations, each of v — s, x s; treatment com- 
binations is replicated t times. Components Y; 
of the observation vector Y are represented 
by a linear model 


Y= +E] 6 + EL? + Wa, 


E D Je bense. eL. 


or, in vector notation, 
Y=Ty+X,é! +X,€74+-X,,€17+ W. 
The analysis of variance in this case 1s given by 


Y'Y -Y YPY, 


where P, is the projection operator matrix 
on the subspace derived by a direct sum 
decomposition 


R" = L(V) Lr(X,) + LE(X5) Ly, x (X1) 
+Ly,,, 


where n —s, $,t. Denoting by Y; the arith- 
metic mean of Y; over the subscript k and 


using similar notation Y,.., Yos and Y..., we 
have Y'P, Y 2nY2, Y'P,Y « s,tY (Y... HF, 
Y P,Y 2s,tX(Y; — Y.P, YP,Y -tX( Y, E 
VK, YP,Yz (Y, — Kal, The 
tanalysis of variance table is given in Table 1. 
For hz 3 similar models can also be con- 
sidered for dealing with interactions up to h- 
factor. The factorial experiment is said to be 
symmetric if s, =s, = ... = S, = s and is called 
the s^ factorial experiment. Otherwise it is 
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Table 1 Analysis of Variance 








Sum of Degrees of 
Source Squares Freedom 
F, main effects Y'P,Y s,—1 
F, main effects Y'P,Y $,—1 
F, F, interactions Y’P,Y (s, —1)(s; — 1) 
Error Y'PY $,$5(t— 1) 





said to be asymmetric and is called the 
S1 X $5 X ... X Sp factorial experiment. 


I. Fractional Factorial Designs 


Factorial designs require at least an experi- 
ment for all the combinations of levels of 
factors under consideration. When there are a 
large number of treatment combinations re- 
sulting from a large number of factors to be 
tested, it will often be beyond the resources of 
the investigator to experiment with all of them. 
For such cases Finney (Ann. Eugen., 12 (1945)) 
proposed a method in which only a fraction of 
the treatment combinations are experimented 
with. A design of this type is called a fractional 
factorial design. One reason for the usefulness 
of fractional factorial designs in preference to 
factorial designs is that they involve a smaller 
number of treatment combinations, since, in 
most scientific experiments, it is usually found 
that a large number of the higher-order inter- 
actions are negligible. The crucial part of the 
specification of fractional factorial designs is 
the suitable choice of the defining or identity 
relationship. Equating the nonestimable fac- 
torial effects for the selected fraction of treat- 
ment combinations with J gives the identity 
relation. Each factorial effect is not estimable 
after selecting a fraction of treatment com- 
binations, and any contrast of the selected 
treatment combinations represents more than 
one factorial effect. All factorial effects repre- 
sented by the same treatment combinations 
are called aliases. In aliases, by assuming that 
other interactions are negligible when com- 
pared to the one of interest, estimation can 
be made by means of the corresponding con- 
trast of the selected treatment combinations. 
When all factorial effects of order higher than 
l are assumed to be zero, a fractional factorial 
design is said to be of resolution 2/+ 1 if it 
satisfies the condition that under the usual 
model all factorial effects up to order / are 
estimable, whereas a fractional factorial design 
is said to be of resolution 2/ if it satisfies the 
condition that under the usual model all fac- 
torial effects up to order | — 1 are estimable. 

In the beginning, the theory was developed 
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for an orthogonal fractional factorial design in 
which the estimates of various effects of inter- 
est are all uncorrelated. However, these are 
available only for special values of the number 
n of treatment combinations and are in general 
uneconomical in the sense that they require a 
large value of n in comparison with the num- 
ber of unknown effects. As a generalization of 
orthogonal fractional factorial designs, I. M. 
Chakravarti (Sankhya, 17 (1956)) introduced 
the concept of balanced fractional factorial 
designs. In these designs, the estimates of 
effects are not always uncorrelated. However, 
the tcovariance matrix of the estimates ts 
invariant under any permutation of factors. 
Balanced fractional factorial designs are flex- 
ible in the number of treatment combinations, 
with the result that more experimental situ- 
ations can be handled. These two kinds of 
fractional factorial designs can be constructed 
by using orthogonal arrays and balanced 
arrays, defined in Section L. All alias relations 
can be derived from the identity relation in 
orthogonal designs. 

In the model (1) for a design, the variance of 
the estimates of estimable linear functions of é 
depends on the matrix X’X which is called the 
information matrix. For eigenvalues of X'X, 
the D-, E-, and A-optimalities of fractional 
factorial designs can be defined similarly to (I), 
(II), and (III) given in Section E. An ortho- 
gonal fractional 2" factorial design is D-, E- 
and A-optimal. A fractional factorial design is 
said to be saturated if the number of treatment 
combinations is equal to that of parameters in 
the model to be estimated. However, since 
saturated designs do not provide an estimate 
of experimental error, their use should, in 
general, be confined to those situations in 
which a prior estimate of experimental error 
is available. The existence of a symmetric 
BIBD(4t — 1,4t — 1,2t —1,2t— Ll, t — 1) implies 
that of an orthogonal saturated fractional 
2“! factorial design of resolution III. 


J. Application of Algebras 


In the theory of design of experiments, the 
ideas of association algebra and relationship 
algebra play an important role. Let A; be a 

v xv symmetric matrix with entries 0 or 1 (i= 
0, L, ..., m). If a set ( 41i —0, 1, ..., m] satisfies 
the conditions 


Ao — IL, 3 A; — Ën 
=0 


and there is a nonnegative integer p;, for every 
i, j, k such that 


m 
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then the A; are called association matrices. In 
this case, there exists a natural number n, for 
every i such that A;E,, — n; E,, holds. If an (o, f) 
component a, of A; is unity, then the treat- 
ments « and f are said to be the ith associates. 
The talgebra . over the real number field 
generated by the matrices Ao, A,, ..., Am is 
called the association algebra. .o/ is commuta- 
tive and tcompletely reducible. A4, A, —(p;,) 
is the tregular representation of «7. There is a 
nonsingular matrix U = (u;;) that transforms 
all Z into diagonal matrices simultaneously: 


UPUT! =diag(Zo;, «.., Zm i-0,1,...,m. 


m 


m —1 
Af = ( 2 2 2. uj A, 
f=0 j 


-0 


i=0,1,...,m, 


are mutually orthogonal idempotent elements 
of £. For example, consider the case where 
Ag =]; ® oe A, = (B s = I.) Gi Is A, e 
I © [E bg I A, = (E, s, e) e (E. s — 1,) 
with m —3, where ® means the *Kronecker 
product. The algebra .» generated by Ao, A,, 
A, A, is called an F, type association algebra. 
The association matrices Áo, A,, 42, and A, 
correspond to the relationships between the 
treatments in a two-way layout. The mutu- 
ally orthogonal idempotent elements in this 
case are AF =s] "Es s, Gs; EL, , AP =U, — 
Si Eo) @ 83 E is Ay zm SP E, e (s, 2» 
s3 EA and AŤ = (L, =s Es.) G9 LU. 
S; ! E, , ). For the factorial experiment with h 
factors (h > 3), association matrices can be 
constructed in a similar way. If h —3, the num- 
ber m of association matrices is 7. Many types 
of association schemes are known, and some 
of them for m —2 are classified as group divis- 
ible, triangular, Latin square, cyclic type, and 
so on. For a group divisible type of v=s; s3, 
Ao — L, Ay =1, ® gelen A2= Ey, — Ag— 
Ay; Ad =v Ew, Af =v"! (s, 1) (Ag + A))— 
A}, Af — 53! ((s5 —1)49 — Aj}. 

An experimental design consists of a set of 
n experimental units called plots. Define a 
relationship R between the plots as a set of 
ordered pairs (i,j) of plots. A relationship R 
among a set of n plots can be expressed as a 
symmetric n x n matrix (rjj) of 0’s and 1’s: 


1 ifthe ith plot ts related to the jth 
r= plot by the relationship R, 
0 otherwise, 


and this matrix is also denoted by R. If there 
are k types of relationships R,,..., R, among n 
plots, the algebra 4? over the real number field 
generated by the matrices R,,..., R, is called 
the relationship algebra of the design. 2 is a 
tsemisimple algebra. 

Example (1). The relationship algebra & of 
the factorial experiment with h=2 and t repli- 
cations is generated by the following matrices 


375 


over the real number field: B, =A, © I, B; = 
Ao Q (E, — L), Bs = A, Q Ey, B, — A; ® En, 
B; = A4 Q E,,, where A;, i=0, 1, 2, 3, are the 
association matrices of F, type. The mutually 
orthogonal idempotents that correspond to 
the two-sided ideal decomposition of 2 are 
BÝ = AŽ ®t 'Em, BŽ = Ař Ot E,BI-— 
Af Gt E, BŽ = Af Gt ! E,, and BÝ = 
Ao & (I, t^! E,). These are the same as the 
projection P, in the two-way layout design 
given in Section H. 

Example (2). Consider a block design with v 
treatments, each having the same number r of 
replications, and with b blocks, each having 
the same size k ( « v). Suppose that association 
matrices A;,i=0, 1,..., m, are given, by which 
the associations among the treatments are 
defined. Let Ai, be the number of blocks to 
which the ith associate treatments a and f are 
applied. The design is called the partially bal- 
anced incomplete block design (PBIBD) if 4j, 
= Ae 2 0 independently of oa and f. When m — 1, 
the design is a BIBD. Let the observation 
vector be represented by (4). The relationship 
algebra 2 of a PBIBD is generated by n x n 
matrices I, Enn, B= P'Y', T 2 DA, i=], 

2, ...,m, where NN' =E oA A= 3o p; Af, 

p; 7 AAA (0€ p, & rl; i=0, 1, ..., m), and po 
—rk-Y" on; Write T;* = Af ®’. Accord- 
ing as p; —rk, 0 « p; « rk, or p; —0, L(T;^) is said 
to be confounded with the blocks, partially 
confounded with the blocks, or orthogonal to 
the blocks. 2 is noncommutative, completely 
reducible, and isomorphic to the algebra of 
matrices of the type shown in Fig. 1. 


Wi 0 
* 
kk 
kk 
*k 
** 
d 
9 * 
Fig. 1 


The analysis of variance of a PBIBD is 
given by a decomposition of I, into mutually 
orthogonal idempotent elements of 2. For a 
PBIBD, the matrix C in (6) is of the form C — 
Eg AŽ, where z—r—k ip For a connected 
PBIBD, all A¥ č, i— 1,2, ...,m, are estimable. 
If there exists a group divisible PBIBD(m = 2) 
with 2, — 4, +1, then it is E- and A-optimal. 


K. Design for Two-Way Elimination of 
Heterogeneity 


Consider a design with v treatments in a u x w 
rectangular block. The row effect and the 
column effect of this block are denoted by y 
and v, respectively. Thus the observation vec- 
tor Y 1s of the form 


Y-FPy-c o£ + Vy - Hv W, (10) 
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where the definitions of F, ®, ¥ and IT are 
similar to those for block designs. Set L = 0'TT, 
M=90'Y, D,=0'O, F-D,.—w !LL'-u 'MM' 
c (uw) ! LE,,L'. The matrix F plays a role 
similar to that of the matrix C in (6). When the 
rank of F equals v — 1, the design is said to be 
connected. If 


F=t(I,—v 'E,,), (11) 


then the design satisfies the optimum criteria 
(I), (ID), and (III) given in Section E. When u = 
w —v and (11) holds, the design is called the 
Latin square. When u —v and (11) holds, the 
design is called the Youden square. When uv 
and (11) holds, the design is called the Shrik- 
hande square. In a Youden square design treat- 
ments are grouped into replications in two 
different ways, i.e., rows and columns, where 
rows constitute a BIBD, whereas columns are 
complete blocks. The existence of a Youden 
square design is equivalent to that of a sym- 
metric BIBD. 

If the associations among v treatments are 
defined in terms of association matrices, the 
partially balanced design for two-way elimi- 
nation of heterogeneity can be defined in a way 
similar to the PBIBD. In this case, the equa- 
tion F—X;«; Af holds, and if = (1,, E,, — I), 
then (11) holds, hence the optimum criteria 
are fulfilled. The definition of the relationship 
algebra 2 of a partially balanced design for 
two-way elimination of heterogeneity is similar 
to that used for the PBIBD. 2 is isomorphic 
to the algebra of matrices shown in Fig. 2. 


Fig. 2 


The analysis of variance of this design is 
given by a decomposition of I, into mutually 
orthogonal idempotent elements of 2. 


L. Balanced Array and Orthogonal Array 


Suppose that T' is an n x h matrix with entries 
from a set A of s (22) distinct elements. Con- 
sider the s' 1 x t matrices X —(x,,X;, Al 
that can be formed by giving different values 
to the x,(e A), i— I, 2, ..., t. Suppose that asso- 
ciated with each 1 x t matrix X there is a non- 
negative integer A(x,, x5, ..., x,) which is invar- 
iant under any permutation of a given set 

(x1, X3, ..., X]. If, for every t-columned sub- 
matrix of T, the s' 1 x t matrices X occur as 
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rows A(X,, X5, ..., Xj) times, then the matrix T 
is called the balanced array (BA) of size n, h 
constraints, s levels, and strength t having the 
index set (4(x,, ..., x,)]. Such an array is de- 
noted by BA (n, h, s, t; n is also called the num- 
ber of treatment combinations. In particular, 
when A(x,, ..., x) 2 A for every x,,...,X,, such 
an array is called an orthogonal array (OA) of 
size n, h constraints, s levels, strength t, and 
index 4, which is denoted by OA (n, h, s, t). 
We have n= 4s'. Balanced arrays have the 
advantage that they can be constructed 
with fewer treatment combinations than 
the orthogonal arrays for given h, s, t para- 
meters. The transpose of the incidence ma- 
trix of a BIBD(v, b, r, k, 4) is, for A = {0, 1}, a 
BA(b, v, 2, 2) with 4(0,0) 2 b — 2r + 4, A(1, 0)= 
A(0, t) - r — 4, and A(t, D) 2 4. There is a close 
relation between the existence of an OA and 
that of a BIBD. For example, the existence 
of an OA (s?, h, s, 2) is equivalent to that of 
h —2 mutually *orthogonal Latin squares of 
order s that is for h=s+ 1 equivalent to the 
existence of a BIBD(s?, s(s 4- 1), s -- 1,5, 1). The 
existence of an OA(4t, 4t — 1,2, 2) is equivalent 
to that of a symmetric BIBD(4t — 1, 4r — 1, 
2t— 1,2t — 1, t — 1) which is also equivalent to 
the existence of an OA(8t, 4t, 2, 3). 

Balanced arrays and orthogonal arrays play 
a vital role in the construction of symmetric 
and asymmetric confounded factorial experi- 
ments and fractional factorial designs. In an 
OA (n, h, s, t), regarding columns and entries as 
h factors and levels of a factor of the column, 
respectively, each row corresponds to a treat- 
ment combination. In this case, this ortho- 
gonal array gives an orthogonal fractional s' 
factorial design of resolution t -- 1 with n treat- 
ment combinations. A necessary and sufficient 
condition for a fractional s" factorial design 
of resolution 2/+ 1 to be orthogonal (resp. 
balanced) is that the design be an orthogonal 
(resp. balanced) array of strength 2l, provided 
that the information matrix of the design is 
nonsingular. 


M. Response Surface 


If all the factors represent quantitative vari- 
ables, such as time, temperature, amount of 
ingredients, etc., it is natural to think of the 
yield or response Y of the experimental results 
as a continuous function of the levels of these 
factors. We can write a functional relationship 


Y, = f (Xia X20 -3 Xna) t Wo €—h2.n 


where Y, represents the «th observation in the 
designed experiment and x;, represents the 
level of the ith factor in the ath observation. 
The function f or the surface defined in (h + 1)- 
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dimensional *Euclidean space by f is called the 
response surface. The residual W, measures the 
experimental error of the ath observation. A 
knowledge of the function f gives a complete 
summary of experimental results and also 
enables us to predict the response for values of 
the x;, that were not tested in the experiment, 
or to determine such combinations of the 
values of variables x;, that give the max- 

imum (or minimum) value of f. When the 
mathematical form of f is not known, the 
function can sometimes be approximated 
satisfactorily within the experimental region, 
which is relatively small, by flexible graduating 
functions, such as polynomials of degrees 1 
and 2 in the variables x;,: 


h 
Y, — Bot 3 B:Xia+ Wo 
i=1 


h h h 
Y,— fo Y, BiXiat Y, Bux 2 BijXiaXja t Wa 
izl i=1 i<j 


The coefficients fo, B,,... are parameters to be 
estimated from the data. Thus, (1) when the 
form of the true f is assumed known, the ob- 
ject is to estimate the parameters; (ii) when the 
form of the true f is unknown, the object is to 
approximate the f by some graduating func- 
tion. Designs appropriate for (i) and (it) are 
called designs for estimating parameters and 
designs for exploring a response surface, respec- 
tively. Some experimental designs that have 
been developed for fitting polynomials of the 
first and second degrees are called first-order 
designs and second-order designs, respectively. 
The problem here is to increase the precision 
of fitting response surfaces by appropriately 
choosing n points in a given experimental 
region A of variable x. That is, how do we find 
the experimental region that interests us in the 
sense of finding optimum conditions and, 
having found it, how do we design experiments 
to map f over the region? As optimum criteria 
of allocations, corresponding to (I) and (II) 
given in Section E, consider the following: 
(IV) the *generalized variance of the estimates 
of coefficients is minimal; (V) the supremum of 
the variance of estimates of the expectation 
E(Y) of response Y is minimal for all variables 
eh For example, when an observation Y is 
given by a polynomial regression of a variable 
x as Y, — flo [i x, baxa +... FB t+ W,, if 
for the *Legendre polynomial P,(x) we allocate 
an experiment replicated at x= +1 and h— 1 
roots of P,(x) -0 the same number of times, 
then it is an optimum design in the sense of 
(IV) and (V). 

As a design for fitting first-order response 
planes, we can use a 2" factorial design (or its 
fraction) of resolution III, since it is sufficient 
to consider only linear effects for each of h 
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factors. In comparing first-order designs, we 
take as an optimum criterion the minimum of 
the average variance of the estimates of the 
coefficients, corresponding to (III) of Section 
E. For a design for fitting second-order re- 
sponse surfaces, we need at least three levels 
of each factor for estimating all effects up to 
quadratic ones; then we can use a 3" factorial 
design (or its fraction) of resolution V for esti- 
mating the linear x linear component of inter- 
actions of two factors. In this case, treatment 
combinations are too large in number, so we 
can adopt central composite designs, which are 
constructed by adding further treatment com- 
binations to those obtained from a 2" factorial 
design (or its fraction). If the coded levels of 
each factor are —1 and +1 in the 2" factorial 
design, the (2h + 1) additional combinations 
are (0,0, ...,0), (- 4,0, ...,0), (0, 4 d,0, ...,0), 
...,(0,...,0, +d). The total number of treat- 
ment combinations to be tested is 2"+ 2h+ 1 
(«3^ for h>3). The value of d can be chosen 
to make the coefficients in the quadratic poly- 
nomials as orthogonal as possible to each 
other or to minimize the bias that 1s created 

if the true form of the response surface is not 
quadratic or to give the design with the prop- 
erty of being rotatable. If Y, is the estimated 
response at (X46, X20» ..., X49)€ S, the response 
surface design is said to be rotatable if and 
only if the variance of f, is a function only 

of the distance p — (x9 + x29 +... + x29)!? of 
(Xios Kane. X49) from the origin, so that the 
variance contours in the experimental region 
of the variables are circles, spheres, or hyper- 
spheres centered at the origin. Now, when Y- 
fot Pixi Baxs t Br 1X7 + B22x3 +2B12X1 X2 + 
W with [x,| € 1 for i=1, 2, an optimum experi- 
ment in the sense of (V) is given by allocating 
0.0960 at the origin, 0.0802 at experimental 
points (1,0), (—1, 0), (0, 1), (0, —1), and 0.1458 
at experimental points (1, 1), (1, —1), (—1, 1), 
(—1, —1), respectively. However, this design is 
not rotatable. Instead of an optimum criterion 
(V), we can consider a region $ which is not 
always equal to the original experimental 
region A and a criterion: (V’) The supremum 
of the variance of estimatesof the expectation 
E(Y) of response Y is minimal for all variables 
xeS. Within x? -- x «c?, an optimum alloca- 
tion depends on the value of c, and it is shown 
that a rotatable design is obtained only if c 

is in a certain range. 

There are many topics in the theory of ex- 
perimental design besides the ones mentioned 
in this article (— [4] for multiple comparison, 
[9] for confounding designs that are factorial 
experiments in which the block size is reduced 
and in which for each block a fractional of all 
the treatment combinations are tested, [10] for 
split-plot designs in which certain main effects 
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are confounded with blocks, [15] for weighing 
designs). 
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A. Definition 


The determinant of an n x n tmatrix A = (aip) 
in a tcommutative ring R is defined to be the 
following element of R: 


3. (sgn Pla, Arp, +++ np,» 


where 


( Ju ah | 
P= 
Pı P2 e Dn 


is a permutation of the numbers 1,2, ...,n, 
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sgn P denotes the sign of the permutation P 
(that is, sgn P=1 if P is an even permutation 
and sgn P= —1 if P is an odd permutation), 
and the summation extends over all n! per- 
mutations of 1,2,...,n. The determinant of A 
is denoted by 


duu 012 din 
Gz, 055 An 
dni 4,2 Ann 


It is written |a;,] or |A| and is also denoted by 
det A. Usually we suppose that R is the field R 
of real numbers or the field C of complex 
numbers, but the following theorems are also 
valid for cases in which R is any commutative 
ring, unless otherwise stated. 


B. Relation to Exterior Algebras 


Consider an fextertor algebra (*Grassmann 
algebra) of a linear space (*free module) of 
dimension n over R with a basis (e;, e», Gel, 
Set 


+ ' 
€; — 0j,€, + dj595 t ... + djs, 


where gece R. Then we have ei ^e? ^ ... ^e, 
—laj|e4^e5^...^e,. Conversely, we can 
define the determinant |a,,| by this relation. 
The properties of determinants can be easily 
deduced from those of exterior algebras. 


C. Fundamental Properties of Determinants 


(1) The determinant of the *transposed matrix 
‘4 of a matrix A is equal to the determinant of 
A. Hence the theorems stated for rows are also 
valid for columns. 

(2) If the elements of one row (column) of a 
matrix are multiplied by a factor c, the deter- 
minant of the matrix is also multiplied by c. If 
the elements of one row (column) of a matrix 
are zero, its determinant 1s equal to zero. 

(3) If from a matrix A =(a;,), we obtain two 
matrices A' and A" by replacing one row, for 
instance the ith row, by den, aj, and by 
dj, t 45, ..., Ain + ain, respectively, then | A"| — 
|A| - | A'|. This relation is equally valid for a 
column. 

(4) If we obtain Ag by a permutation Q on 
the rows of a matrix A, then Aal = (sgn Q)IA |. 
In particular, if two rows (columns) of a ma- 
trix are interchanged, then the determinant 
changes sign. 

(5) The determinant of a matrix is zero if 
two rows (or columns) are identical. 

(6) The determinant of a matrix is not 
changed if the elements of any row (column), 
each multiplied by the same factor, are added 
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to the corresponding elements of another row 
(column). 

(7) Suppose that R has unity element. Let x;, 
(i, k 9 1, ..., n) be n? variables in R, and denote 
a function (having its values in R) of these 
variables by p(X), X =(x,,). Assume that ok) 
has the following properties: (1) if the elements 
of one row of X are multiplied by a factor 4, 
the value of ¢ is also multiplied by 4; (11) if we 
obtain two matrices X' and X" by replacing 
one row of X, for instance the ith row, by 
ais, Xin and by xj Xj, ..., Xin + Xin, respec- 
tively, then o(X")— @(X)4+ o(X’); and (iii) if 
two rows of X are equal, p(X)=0. Then o(X) 
—c|X|for some constant c (in R). 

(8) Suppose now that R is a field K. Assume 
that a function (X) (in K) has the following 
properties: (i) if the elements of one row in 
X are multiplied by 4, the value of ¢ is also 
multiplied by /; and (ii) the value of o is not 
changed if the elements of any row are added 
to the corresponding elements of another row. 
Then ~(X)=c|X| for some constant c (in K). 


D. The Laplace Expansion Theorem 


Let A — (a4) be an n x n matrix. Take r-tuples 
(i4, ...,i,) and (k,,...,k,), where i, and k, be- 
long to {1,...,n} and i, <...<i,,k,<...<k,. 
Let (4,,,, ..., ij) and (k,,,, ..., Kn) be (n—r)- 
tuples such that i,,,<...<i,,k,4;<...<k, and 
E FE dey pe excu ly = Wis pes Mya EE Ay), 
={1,...,m}. Let gu, Am, vu be the deter- 
minant of an r x r matrix whose (p, q)- 
component is the (i,, k,)-component of A 

for each p and q. We call this determinant 

a minor of degree r of the matrix A. (The 
corresponding submatrix of A is sometimes 
also called a minor of A.) In particular, if 
(i,,.--,1,)=(k,,...,,), then it is called a princi- 
pal minor. Furthermore, we define the cofactor 
of the minor au, A1, ku OF A to be 


E (qtu 
du, Auge: kd nr RICE 


where A=i,+...4+i, and w=k,+...4+k,. In 
the particular case r = 1, the cofactor of aj, is 
dj, —(—1)^* A4, where Aj, is the determinant 
of the (n — 1) x (n— 1) matrix obtained from A 
by eliminating its ith row and kth column. For 
simplicity, we abbreviate (i, , ..., i), (k,, ... , k,), 
and (ji, ...,,) as (i), (k), and (j), respectively. 
Then we have 


SENE 
e (0G) 09 0) 0 if (i)#(k), 
ll TH 
Ce (ELC bed O01 08) 0 if (i) (Kk), 


where Zum means that the sum is taken over all 
combinations (j). This is called the Laplace 
expansion theorem. If a matrix A has the form 
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B O0 B x 
A= or A= i 


and B and C are square matrices, then by this 
theorem we have |A| 2|B]||C]. If we number 
the combinations (i) =(i,, ...,i,) and (k) = 

(k,, ..., k,) appropriately (for instance, in lexi- 
cographical order) and regard the numbers 
assigned to them as row numbers and column 
numbers, respectively, to form a matrix (dig) 
then the Laplace theorem can be expressed as 


(aa) (&agu)) (A ay X03) 


|A| 0 


0 IAL 
In the particular case r=1, we have 


IA} ifi=k, 
0 ifizk, 


n 
2 djd— 
Zei 


3 d |A| ifi-k, 
d d: = 
Ate LO iis 


E. Product of Determinants 


Let A =(a;,) and B=(b,,) be two n x n matrices. 
For the product AB— C — (cj), where cj, = 
2, agby (lk 1,..., n), we have |AB|= 
|A||B|. The tinverse matrix A~! exists for an 

n xn matrix A = (a) if and only if |A| 40, and 
then A ^! 2 (b) with elements bj —à,;/| A]. 
Moreover, we have |A ! |2|A|"!. (In the case 
where the elements a; are in the commutative 
ring R with unity element, A ^! exists if and 
only if |A| is a tregular element of R.) 


F. Theorems on Determinants 


(1) Let d, be the cofactor of a; in the deter- 
minant of an n x n matrix A —(a,). Then the 
determinant |d;,| is equal to | A|" '. In general, 
EA 
la, occ yey seek l= IAL A 
" (52) 
CORRET ball : 
(2) The determinant of a submatrix of the 
matrix (d;,), composed of the i,th, ..., ith rows 
and k,th,...,k,th columns of (d;,) is equal to 


|A I! dj, pees dp) (Key sees eg) 


TENET 


Let A 
SES Put 


E the determinant 


of the (n— r) x (n —r) matrix obtained from an 
n xn matrix A by eliminating its i,th, ..., i,th 
rows and k,th,...,k,th columns. Then 


NH WER WI 


i<j, k<l. 
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(4) Sylvester's theorem. Let bj, (i, k— 1, ..., 
n — r) denote the minor a em, et 
of an n x n matrix A=(a,). Then 


G1, -. dq, 
EE) 








a ve ad 


(5) Let A be an n x m matrix and Banmxn 
matrix. Then AB is an n x n matrix. If n>m, 
then |AB| =0. If n m, let (i) 2 (i, ...,i,) (i, « ... 
« i,) be a combination of 1, 2, ..., m, taken n at 
a time. Let Au, be the n x n matrix composed 
of the ith, ..., i,th columns of A, and B,, the 
n x n matrix composed of the i,th, ..., i,th rows 
of B. Then |AB| 2 34A, | Bj], where the 
summation extends over all possible combi- 
nations (i). 

(6) Determinant of a *Kronecker product. If 
A is an m x m matrix and B is an n x n matrix, 
then |A © B|=|A|"|B|”. 

(7) Let H be ann x n *Hermitian matrix, and 
let H, denote the matrix composed of its first k 
rows and columns. Then H is positive definite 
if and only if | H,| 0 for all k — 1, ...,n. 


G. Special Determinants 


(1) A determinant of the form 


| 1 
1 X5 Xn 
2 2 2 
n=l n-1 n-—i 
X1 X2 Xn 


is called a Vandermonde determinant. It is 
equal to the *simplest alternating function 
IL (x; =X): 


(2) A cyclic determinant is one of the form 


Xo An X3 7c Xai 
Xn-1 Xo An c7 Xa-2 


hl 
el Leen EE ag oh x 
i=0 


where ¢ is a *primitive nth root of unity. 

(3) Consider the vectors &; = ln, ges, Gin) 
(i— 1,2, ..., n), and let (x;, &;) denote the tinner 
product of o; and a. Then the following deter- 
minant is called the Gramian of these vectors: 


(%1.%1) (01, %) ETH 
(2;,0,) (25,05) (x5, 04) 
(Xn X) Lëns 0:5) (o, Xn) 
gun di; cc yy |? 
82i: "ae e cod 


ui an2 Uo 40 


nn 
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(4) For an talternating matrix (namely, a 
square matrix X such that 'X — — X), we have 
the identity 


0 X12 Xiz 7 Xin 
— X2 0 X233 °° Xan 
—Xi13  — X53 0 Uto Xan 
— Xin Xon Kan d 
Ex ifn is even, 
0 if n is odd, 


where P,(..., xij, ...) is a polynomial of vari- 
ables x,,, which (equipped with appropriate 
sign) is called the Pfaffian of these variables. 
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A. General Remarks 


Let y be a function of a real variable x defined 
on an interval J and let Ax be a fixed quantity. 
When two points x and x + Ax are in I, we 
define the difference Ay(x) of y at x by Ay(x)— 
y(x + Ax) — y(x) and the difference quotient 

by Ay(x)/Ax; Ax is called the difference of x. 
Without loss of generality, we can take Ax = 
1, for otherwise there is a constant b such 

that Ax’=1 for the new independent vari- 
able bx =x’. If Ax — 1, the second difference 
A?y(x) 2 A(Ay(x)) is given by 


A? y(x)= Ay(x + 1) - Ay(x) 
= y(x + 2)—2y(x + D^ y(x). 
Similarly, the difference of the nth order is 


defined by A"y(x)=A(A" ! y(x)), and 


WE ph SS y(x+k). 
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Conversely, y(x +n) is expressed by differences 
as 


STE UI Aty(x) 


k=0 


(— 223 Interpolation). 


B. Summation 


Given a function g(x) and Ax, a function y(x) 
that satisfies Ay(x)/Ax = g(x) is called a sum 
of g(x). Summation of g(x) is to find a sum of 
g(x). Given a sum y(x) of g(x), an indefinite 
sum of g(x), written as Sg(x)Ax, is given by 
Sg(x)Ax = y(x)+ c(x), where c(x) is an arbitrary 
periodic function of period Ax. In many cases, 
c(x), which corresponds to an arbitrary con- 
stant in an indefinite integral, is omitted. For 
example, a sum of g(x) « nx" ! for Ax=1 is the 
nth-order tBernoull: polynomial B,(x) for 
nz: 0; a sum of x`! is W(x), given by y(x)— 
dlogI (x)/dx (— 174 Gamma Function). 

When the series — Ax £go g(x + kAx) or 
Ax È, g(x — kAx) converges, both can be 
sums of g(x). Since the requirement of conver- 
gence for these series was found to be too 
strict, the following requirement was given by 
N. E. Nórlund instead: Let x be a real variable 
and g(x) be continuous for x 2 b. Define A(x) 
by A(x) 2 x"(log x)? (p> 1,9 2:0). Then if for a 
positive 


oc 


F(x, Ax, n- | g(z)e "^? dz 


a 


—Ax bi g(x + kAx)e etka) 
k=0 


is convergent for a b, F satisfies AF (x, Ax, 
n)/Ax — g(x)exp( —94(x)). Accordingly, if 
F(x, Ax, n) approaches a limit F(x) as 40, 
F(x) is a solution of AF(x)/Ax = g(x). F(x) 

is called the principal solution of AF(x)/Ax = 
g(x). 


C. Difference Equations 


Let Ax = 1. An equation F(x, y(x), Ay(x), 
...,A”y(x)) =0 in x and differences of an un- 
known function y(x) is called a difference 
equation. If the substitution y= q(x) satisfies 
the equation for x in some interval, (x) is a 
solution of the equation. Because of the rela- 
tion between y(x), y(x+ 1), ..., y(x +n) and the 
differences of y at x, we can transform the 
given difference equation in the form G(x, y(x), 
y(x + 1),...,¥(x+n))=9. This form appears 
more often in applications and is called the 
standard form of a difference equation. 
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If the equation is linear in y(x), y(x + 1), 
.., Y(x 4 n), namely, if it is given by 


pí(x)y(x t i) 7 q(x), 


itas 


i 


the difference equation is said to be linear. 
When q(x)=0, it is homogeneous; otherwise, it 
is inhomogeneous (or nonhomogeneous). 


D. Linear Difference Equations 


Assume that po(x), ..., p,(x) are single-valued 
analytic functions without poles and common 
zeros in some domain. Consider the linear 
difference equation 


n B(9)yGx- i) - 0. (1) 


If p(X), P(x), ..., n(x) are solutions of (1), 
then a linear combination og, Glo, (x) 4 
a»(x)5(x) - ... + a (x), (x) with arbitrary 
periodic functions a, (x), a;(x), ..., a, (x) of 
period 1 is also a solution of (1). 

Let £i, B,,... be singular points of p,(x), 
DG... p, (X), %1 %2,... be the zeros of po(x), 
and y,, y;, ... be the zeros of p,(x +n). Then the 
set of singular points of the linear difference 
equation (1) is the set {a;, fj;, y;}. 

A function o,,(x) is said to be linearly de- 
pendent on the functions o: LGL ox)... 

Q4 -, (X) with respect to the difference equa- 
tion (1) if o, (x) 2 a; (x) 91 69 + as(x) p» (x) + 
c day a (X)0, 4 (x), where a,(x), a;(x), ..., 
a4, (X) are functions of period 1, every one 
of which takes a nonzero finite value at least 
at one point not congruent (mod Z) to any of 
the singular points, where Z is the additive 
group of integers. 

A set of m functions is called linearly inde- 
pendent if none of the functions is dependent 
on the other m — 1 functions. When a set of n 
solutions of equation (1) is linearly indepen- 
dent, it is a fundamental system for (1). Any 
solution of (1) can be expressed as a linear 
combination of n solutions of a fundamental 
system. 

The determinant 


- PAX) 


p(x) z 
. OAX+ 1) 


p2(x+ 1) 


p(x) 
pı(x+1) 


go (x+n—1) pa(x+n-1) ... o,(xt+n—-1) 


formed from n functions o, GL @2(x), ..., o, (x) 
is called Casorati’s determinant and is denoted 
by Dia, (x), 95(x), ..., 9, (x)). A necessary and 
sufficient condition for a given set of n func- 
tions to be independent is that Casorati's 
determinant be nonzero at every point except 
those which are congruent to singular points 
of (1). Casorati's determinant 1s used to deter- 
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mine whether a given set of solutions is 
fundamental. 

Let y(x) be a solution of a nonhomogeneous 
linear difference equation 


P9) Y pGdyG D - 02. Q) 


If o,(x), Q5(x), ..., 9, (x) are n linearly inde- 
pendent solutions of (1), then an arbitrary 
solution of (2) is given by 


Y= 4, (x) Q1 (x) + a3099;(x) +... 
+ a), x) + Vx), 


where a(x), ..., a,(x) are arbitrary periodic 
functions of period 1. Then the expression for 
y is called a general solution of (2). If we ab- 
breviate Casorati's determinant of a funda- 
mental system of solutions o: (GL o, (x), ..., 

9, (x) of (1) by D(x) and write p(x) as the quo- 
tient of the cofactor of g(x +n) of D(x + 1) by 
D(x + 1), we have 


iG) È SONA 


assuming that the summation S on the right- 
hand side is known. This is the analog of 
Lagrange’s tmethod of variation of constants 
in the theory of linear ordinary differential 
equations. 


E. Linear Difference Equations with Constant 
Coefficients 


If all the coefficients in 


piy(x t i) — 0, po #0, p, 7-0, (3) 


it 


i 
are constants, n linearly independent solutions 
are obtained easily. Indeed, if A is a root of the 
algebraic equation 17. , p;4' — 0, 4* is a solu- 
tion of (3). This algebraic equation is called 
the characteristic equation of (3). If it has n 
distinct roots 4,, 45, .... An, then AT, 45, ..., AX 
are n linearly independent solutions. In gen- 
eral, if A is an m-tuple root of the characteristic 
equation, then 4*, x4*, ..., x "^12* are solu- 
tions of (3). Accordingly, if A; is a root of multi- 
plicity m; (25, m;— nj — 1,2, ..., s) then 47, 
XA, ..., X71 A7 (j— 1, ..., s) constitute a set of 
n linearly independent solutions. 

Even if all the p; are real, the characteristic 
equation may have complex roots. In such a 
case real solutions are obtained as follows: 
When 4 — u+ iv is a root of multiplicity m, 7 = 
LH — iv is also a root of the same multiplicity. 

If we write p — J/ u? -- v?, tan o gin, then 

p* cos qx, p*sin ox, XP* cos PX, xp*sin MX, ..., 
x" 1g*cos ox, x" !p*sin ox are 2m indepen- 
dent real solutions. 

Nonhomogeneous equations with con- 
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stant coefficients can be generally solved by 
Lagrange's method with these solutions. How- 
ever, when the nonhomogeneous term has a 
special form such as 


È pint +i) asi, 


where p(x) is a polynomial in x and 4 is a root 
of multiplicity m of the characteristic equation, 
we can use the method of undetermined coeffi- 
cients. In this particular case, the substitution 
(Aot A x ... + A,xP)x"4* with undetermined 
coefficients Ao, A,,..., A; gives solutions, k 
being the degree of p(x). 


F. Difference and Differential Equations 


The differential operator d/dx acts on the 
family of functions (x"|m-—0, +1,...} accord- 
ing to dx"/dx — mx", just as the difference 
operator A acts on the family (x? =I (x+ 
D/T(x ^m -- 1)|m=0, +1,...} according to 
Ax 2 mx'"7D. Hence by using the factorial 
series > a,,x'” and its similarity to the power 
series 3a, x", we may obtain some analogies 
with the theory of differential equations. For 
example, the Frobenius method in the theory 
of tregular singular points can be applied to 
the system of difference equations 


(z- DA wiels 3 ay (2)w((2), 


k=1,2,...,n. 


However, there are certain essential differences 
between functions defined as solutions of 
differential and difference equations. For ex- 
ample, Hölder’s theorem states that no solu- 
tion of the simple difference equation y(x+ 1) 
— y(x) 2 x ! satisfies any talgebraic differential 
equation. Consequently, the gamma function, 
which is related to a solution of the equation 

V (x) 2 dlogI (x)/dx, cannot be a solution of 
any algebraic differential equation. For the 
numerical solution of ordinary differential 
equations by difference equation approxima- 
tion — 303 Numerical Solution of Ordinary 
Differential Equations. 


G. Geometric Difference Equations 


For an arbitrary complex number q, an equa- 
tion of the form y(qx) — f(x, y(x)) is called a 
geometric difference equation. For example, the 
ordinary difference equation (1) can be trans- 
formed into 


pz) U(zq^) = B(z) (1’) 


x 
Lis 


by the change of variable z 2 q*. Although it is 
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possible to transform an equation of the type 
(1) into that of the type (1), there are theories 
developed specifically for the type (UL since 
the coefficients of the equation may become 
more complicated by such a transformation. 
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A. General Remarks 


The rudimentary concept of n-dimensional 
manifolds can already be seen in J. Lagrange's 
dynamics. In the middle of the 19th century n- 
dimensional *Euclidean space was known as a 
continuum of n real parameters (A. Cayley; H. 
Grassmann, 1844, 1861; L. Schláfli, 1852). The 
notion of general n-dimensional manifolds was 
introduced by B. Riemann as a result of his 
differential geometric observations (1854). He 
considered an n-dimensional manifold to be a 
set formed by a 1-parameter family of (n— 1)- 
dimensional manifolds, just as a surface is 
formed by the motion of a curve. Analytical 
studies of topological structures of manifolds 
and their local properties were initiated and 
developed by Riemann, E. Betti, H. Poincaré, 
and others. To avoid the difficulties and dis- 
advantages of analytical methods, Poincaré 
restricted his consideration to those topo- 
logical spaces X that are *connected, ttri- 
angulable, and such that each point of X 
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has a neighborhood homeomorphic to an n- 
dimensional Euclidean space. We often refer to 
such spaces as Poincaré manifolds; Poincaré 
called them n-dimensional manifolds. In 1936, 
H. Whitney published a monumental paper 
[14] on differentiable manifolds in which the 
various fundamental concepts on differentiable 
manifolds were established. This and sub- 
sequent papers written by Whitney during 
nearly twenty years greatly influenced the 
rapid advance of the theory of differentiable 
manifolds since 1950. 


B. Topological Manifolds 


An n-dimensional topological manifold M is by 
definition a tHausdorff space in which each 
point p has a neighborhood U(p) homeo- 
morphic to an open set of R”. 

Let M' be a Hausdorff space in which 
each point p of M' has a neighborhood U(p) 
homeomorphic to an open set of H", where H" 
is the half-space ((x,, x2, ..., x,JeR"|x, 20]. 
Let 6M’ denote the set consisting of points p 
of M' such that p corresponds to a point of 
Hg = { (X1; ..., x,)e H"| x, 20] c H” under the 
homeomorphism from U(p) to an open set of 
H". M' is called an n-dimensional topological 
manifold with boundary if CA" z Ø, and 0M' is 
called the boundary of M'. On the other hand, 
M defined as above or Mi with 0M' = @ is 
called an n-dimensional topological manifold 
without boundary. The interior of M' is the 
complement Mo = M' — 0M' of the boundary. 
The boundary of an n-dimensional topological 
manifold is an (n — 1)-dimensional topological 
manifold. A topological manifold without 
boundary is called closed or open according as 
it is compact or has no connected component 
which is compact. There exist connected topo- 
logical manifolds that are not *paracompact; 
among them, the 1-dimensional ones are called 
long lines. A connected paracompact topolog- 
ical manifold M has a tcountable open base 
and is tmetrizable. 


C. Local Coordinates 


Let M be an n-dimensional topological mani- 
fold. A pair (U, y) consisting of an open set U 
of M and a homeomorphism y of U onto an 
open set of R" is called a coordinate neighbor- 
hood of M. If we denote by (x!(p), ..., x"(p)) 
(pe U) the coordinates of the point w(p) of R”, 
then x!, x?, ..., x" are real-valued continuous 
functions defined on U. We call these n func- 
tions the local coordinate system in the coordi- 
nate neighborhood (U, y) and the n real num- 
bers x! (p), ..., x"(p) the local coordinates of the 
point pe U (with respect to (U, di 
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A set S= ((U,, Wa) }ac4 of coordinate neigh- 
borhoods is called an atlas of M if {U,} 
forms an topen covering of M. 


acA 


D. Differentiable Manifolds 


Let S 2 ((U,, Ya) }aca be an atlas of an n- 
dimensional topological manifold M. For each 
pair of coordinate neighborhoods (U,, w,) and 
(Us, V) in S such that U,NU,# Ø, V; o V, | is 
a homeomorphism of the open set y,(U, N Up) 
of R” onto the open set d, N Up) of R”. Let 
x —(x!, ..., let, N Up). Then we can 

write (js oV, =A), .... fg). If the 

n real-valued functions fj, ..., f, defined in 
w,(U, Up) are of tclass C" (1 <r < oo) (resp. 
treal analytic) for any a, B in A such that 

U, N U, # Ø, then we call S an atlas of class C” 
(resp. C?) of M. When an n-dimensional topo- 
logical manifold M has an atlas S of class 

C (1&r«o), we call the pair (M, S) an n- 
dimensional differentiable manifold of class C’ 
(or C'-manifold). A C?-manifold is also called 
a smooth manifold, while a C?-manifold is 
called a real analytic manifold. We call M the 
underlying topological space of (M, S), and 

we say that S defines a differentiable structure 
of class C” (or C’-structure) in M. 

In particular, a C?-structure is called a real 
analytic structure. A C'-manifold whose under- 
lying topological space is compact (tparacom- 
pact) is called a compact (paracompact) C’- 
manifold. A coordinate neighborhood (U, y) of 
M is called a coordinate neighborhood of class 
C" of (M, S) if the union SU {(U, y/)] is also an 
atlas of class C’ of M. In particular, each co- 
ordinate neighborhood of M belonging to S is 
of class C”. The set $ of all coordinate neigh- 
borhoods of class C" of (M,S) is an atlas of M 
containing S, and we call $ the maximal atlas 
containing S. Let S and AS be two atlases of 
class C of M. If $2 S, then we say that S and 
S’ define the same differentiable structure of 
class C” on M and that the differentiable mani- 
folds (M, S) and (M, S’) of class C" are equiva- 
lent. In particular, (M, S) and (M, $) are equiv- 
alent C'-manifolds. Let S and S’ be atlases 
of class C” and class C^, respectively, where 
l<r<s<wo. Since sr, we can consider S’ 
an atlas of class C". If S and S' define the same 
C’-structure in M, then we say that the C*- 
structure defined by S’ is subordinate to the 
C"-structure defined by S. If M is paracom- 
pact, then there exists a C?-structure subordi- 
nate to a C'-structure of M (Whitney [14]). 


E. Differentiable Manifolds with Boundaries 


Let U and U' be open sets in the half-space H", 
and let o: U— U' be a continuous mapping. If 
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there exist open sets W and W' in R" contain- 
ing U and U', respectively, and a mapping 
V:WW:' of class C” that extends o. we call o 
a mapping of U into U' of class C'. Let M be 
a Hausdorff topological space. A structure of 
a C’-manifold on M is defined by a set S = 
{(U,, Wl a where {U,}ac4 is an open covering 
of M, and, for each a, y, is a homeomorphism 
of U, onto an open set of H" such that for 

any a, Be A with U,'1U, z Ø, Wow, isa 
mapping of class C” from V,(U, 1 U;) onto 
V5(U, N Up). Let 0M denote the set consisting 
of points p of M such that pe U, and j,(p)e Hë = 
{(x1,...,X,)€H"|x,=0} for some «e A. If 

OM z Ø, the pair (M, S) is called an n-dimen- 
sional differentiable manifold with boundary of 
class C' (or C'-manifold with boundary), and 
ôM is called the boundary of M. OM forms an 
(n — 1)-dimensional C’-manifold. If we put 

U, 2 U,T16M and denote the restriction of y, 
to U; by Ya, then S’ = ((U7, Ya) }aca is an atlas 
of class C” of 0M. If ôM is empty, then (M, S) 
is a C’-manifold. In this sense a C’-manifold 

is sometimes called a C'-manifold without 
boundary. 


F. Orientation of a Manifold 


Let S 2 ((U,, Ya) }aca be an atlas of class C" in 
M, and for each « let {x}, ..., x?) be the local 
coordinate system in a coordinate neighbor- 
hood (U,, W,). If U, and U, intersect, then there 
exist n real-valued functions F! (i— 1, ...,n) 
defined on V,(U, 1 Up) such that xj(p) = 
F'(x1(p)...,xz(p)) for pe U,N Ug and i= 

l, ...,n. The Jacobian D,, — D(F!, ..., F")/ 
D(x}, ..., x?) is different from zero at each 
point (x7, ..., Xa) of V,(U,f1 Up). If we can 
choose an atlas S of M so that, for any a, P 
such that U, N U, is nonempty, the Jacobian 
D, is always positive, then we say that the 
C’-manifold M is orientable, and we call S an 
oriented atlas. 

Let S={(U,, Ya) aca and S' = { (Vz, 9;)] ;cA 
be two oriented atlases of a connected C’- 
manifold M. If M is connected, then the sign of 
the Jacobian D,,(p) of the transformation of 
local coordinates is independent of the choice 

_of ae A, 4e ^, and peU,NV,. We say that S 
and S’ define the same (opposite) orientation if 
D,, is always positive (negative). Hence if M is 
connected, the set of all oriented atlases of 
class C’ is composed of two subsets such that 
atlases belonging to one of them have the 
same orientation, while two atlases belonging 
to different ones have the opposite orientation. 
Each of these subsets is called an orientation 
of the connected C’-manifold M. When we 
assign to M one of two possible orientations, 
M is called an oriented manifold; the assigned 
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orientation is called its positive orientation 
and the other its negative orientation. If S = 
{(U,,W,) taca belongs to the positive orien- 
tation, S and (U,, w,) are called an atlas and 
local coordinate system, respectively, compa- 
tible with the positive orientation. 


G. Differentiable Functions 


Let f be a real-valued function defined in a 
neighborhood of a point p of a C?-manifold 
M. Let (U, y) be a coordinate neighborhood of 
class C* such that pe U. If the function foy ! 
is of class C’ (1 <r € oo) in a neighborhood of 
the point v (p) in R”, then the function f is 
called a function of class C" at p. This defi- 
nition is independent of the choice of a coor- 
dinate neighborhood of class C”. If we de- 
note the local coordinate system in (U, y) by 
(x, ..., x"), there exists a function f(x!, ..., x") 
of n variables defined in a neighborhood of 
V(p) in R" such that f(q) f(x! (q), ..., x"(q)) for 
each point q in the neighborhood of p. Here 
we use the same symbol f for the function f 
defined in a neighborhood of p in M and for 
the function f o y ^! defined in the image of the 
neighborhood by y in R”. The function f is 

of class C" at p if and only if f(x!, ..., x") is 

of class C” in a neighborhood of the point 

(x! (p), ..., x"(p)) of R”. A function of class C" 
(or C’-function) in M is a real-valued function 
in M that is of class C” at every point of M. 


H. Tangent Vectors 


Let M be a C?-manifold, and let (M) be the 
real vector space consisting of all C*-functions 
in M. (For the sake of simplicity, we denote a 
manifold (M, S) by M.) A tangent vector L at a 
point p of M isa linear mapping L: §ğ(M)>R 
such that L( fg) ^ LCf)g(p) - f(p.(g) for any f 
and g in (M). For any two tangent vectors 
L,, L, and any pair of real numbers /,, 4; we 
define 4,L,+4,L, by (A L; +4, L (f les 
ALi (P)+A2La(f), f EFM). 

Thus tangent vectors at p form a real vector 
space T,, which we call the tangent vector 
space (or simply the tangent space) of M at 
the point p. The dimension of the tangent 
vector space T, equals the dimension of M. 
The set of all tangent vectors of M forms a 
tvector bundle over the base space M, called 
the tangent vector bundle (or tangent bundle) 
of M. 

By a tangent r-frame (r <n) at p we mean an 
ordered set of r linearly independent tangent 
vectors at p. The set of all tangent r-frames 
also forms a fiber bundle over M called the 
tangent r-frame bundle (or bundle of tangent 
r-frames) (— 147 Fiber Bundles F). 
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I. Differentials of Functions 


For a C?-function f in M and a point p of M 
we can define a linear mapping df,: T, 5 R by 
df, (L) - L(f) for all Le T,, and we call df, the 
differential of f at p. The totality of differen- 
tials at p of C?-functions in M forms the *dual 
vector space of the tangent vector space T,. 


J. Differentiable Mappings 


Let o be a continuous mapping of a C®- 
manifold M into a C?-manifold M'. We call o 
a differentiable mapping of class C’ (or simply 
a C'-mapping) (1 <r < oo) if the function fo 
is of class C" for any C'-function f on M’. If 
ọ is a homeomorphism of M onto M' and o 
and o | are both of class C”, then we call g a 
diffeomorphism of class C". If there exists a 
diffeomorphism of class C^ of a C?-manifold 
M onto a C?-manifold M’, then M and M’ are 
said to be diffeomorphic. 

Let M and M’ be C?-manifolds and o be 
a C?-mapping of M into M’. For a tangent 
vector L of M at p, a tangent vector L' of M' 
at o(p) is defined by L'(g) - L(go o), ge (M). 
The mapping L— L' defines a linear mapping 
(dq), of the tangent vector space T, of M at p 
into the tangent vector space T,(,, of M' at 
o(p). The linear mapping (dq), is called the 
differential of the differentiable mapping ¢ at p. 
If (de), is surjective, p is called a regular point 
of o A point on M which is not a regular 
point is called a critical point of o. A point q 
on M' which is an image of a critical point is 
called a critical value of o, and a point on M’ 
which is not a critical value is called a regular 
value. In R", the diffeomorphic image of a set 
of Lebesgue measure zero has Lebesgue mea- 
sure zero. So the set of Lebesgue measure 
zero is well defined on a (paracompact) C^- 
manifold. Then Sard's theorem states: Let 
Q9: M ^ M' bea C?-mapping; then the set of 
critical values of ọ has Lebesgue measure zero 
in M’. 


K. Immersions and Embeddings 


Let M and M’ be C?-manifolds and ọ be a 
C?-mapping of M into M’. If (do), is injective 
at every point p of M, then ¢ is called an im- 
mersion of M into M'. If o is an immersion, 
then for some neighborhood U, of any point 
p of M the restriction ọ |U, gives rise to a 
homeomorphism from U, into M'. If an im- 
mersion o is injective, then o is called an 
embedding (or an imbedding) of M into M'. An 
alternative definition is often used, which says 
that o is an embedding if, in addition to the 
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above conditions, o gives a homeomorphism 
from M onto @(M), where o(M) has the rela- 
tive topology of M'. If the former definition is 
adopted as embedding, then a mapping o 
satisfying the conditions of the alternative 
definition is sometimes referred to as a regular 
embedding. If M is compact, the two defini- 
tions coincide. In Sections L and M, embed- 
ding always means regular embeddings. 

The theory of embeddings and immersions 
is mainly concerned with ways to embed and 
immerse a given manifold M into a manifold 
M' of a particular type with lowest possible 
dimension. M' is usually the Euclidean space 
R”, the projective space DIR or a certain stan- 
dard manifold. The theory was initiated by 
Whitney (1936). He proved by "general posi- 
tion” argument that an n-dimensional C?- 
manifold M with countable basis can always 
be immersed in the 2n-dimensional Euclidean 
space and can always be embedded in the 
(2n 4- 1)-dimensional Euclidean space as a 
closed set (Whitney's theorem). 


L. Submanifolds 


A C?-manifold M is said to be a submanifold 
of a C?-manifold M’ if M is a subset of M’ 
and the identity mapping of M into M' is an 
immersion. If the identity mapping of M into 
M' is an embedding, then M is called a regular 
submanifold of M'. A regular submanifold M 
of M’ is called a closed submanifold if M is a 
closed subset of M'. 

Let o be a C?-mapping from M into M’ 
and M" be a submanifold of M’. Then for each 
qc M" the tangent space T; of M” at qisa 
linear subspace of the tangent space 7; of M’ 
at q. Denote by z, the projection of quotient 
vector space onto T/T. A C*-mapping ¢ is 
called transverse to M" if for each peo ! (M") 
the composite 1,,,,0d9,: T, > T,(,/ Toim is 
surjective. If @ is transverse to M" then 
o ^! (M") is a submanifold of M. For any C®- 
mapping o: M — M' and any submanifold M” 
c M', we can find a C?-mapping 9': M M' 
which is transverse to M" and arbitrarily close 
to ọ (transversality theorem). Let M, and M, 
be submanifolds of M'. Then we say M, in- 
tersects transversely to M, if the inclusion 
M, &M' is transverse to M;. 

A C?-mapping is called a submersion if it 
has no critical point. Let g be a submersion 
from M into M'. Then for each point qe M', 

o !(q)isa regular submanifold of M, and M is 
covered by a mutually disjoint family of sub- 
manifolds: M =( J emp *(q). 

Let M be a submanifold of an n-dimensional 
Euclidean space R". We can identify the tan- 
gent vector space T, of M at p with the geom- 
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etric tangent space of M at p in the Eucli- 
dean space R”. A vector in Rn that is ortho- 
gonal to the tangent vector space T, of M at p 
is called a normal vector to M at p. The set of 
all vectors normal to M forms a vector bundle 
over M, which we call the normal vector bundle 
(or normal bundle) of M. If M is compact, then 
the totality of vectors normal to M whose 
length is <e (where e is a sufficiently small 
positive real number) forms a neighborhood 
N(M) of M in R" which we call a tubular 
neighborhood of M. Int N(M) is called an open 
tubular neighborhood. 


M. Vector Fields 


Let N bea subset of a C?-manifold M. By a 
vector field on N we mean a mapping X that 
assigns to each point p of N a tangent vector 
X, of M at p. We can consider X as a tcross 
section over N of the tangent vector bundle of 
M. Let X be a vector field in M, and let f bea 
C^-function in M. Then we can define a func- 
tion Xf in M by (Xf)(p — X,f. We call X a 
vector field of class C" if the function Xf is 

of class C” for any C?-function f in M. Let 
(xl, ..., x") be the local coordinate system in 

a coordinate neighborhood (U, y), and let 
(0/0x*),f —(0f/0x')(p) for pe U and fe (M). 
Then 0/Ox' (i— 1, ..., n) are vector fields in U, 
and the (0/0x'), form a basis of T, at every 
point pe U. A vector field X in U is written 
uniquely as X, — Y, £'(p)(0/0x"), at each point 
peU. Then ët, ..., £" are real-valued func- 
tions defined in U, called the components of X 
with respect to the local coordinate system 
(x!, ..., x"). A vector field X in M is of class C" 
if and only if its components CT with respect to 
each coordinate system are functions of class 
C" (0 € r < oo). Let (x+, ..., x") be another local 
coordinate system in a neighborhood U of 

p, and let (£3, ..., Em be the components of 

X with respect to (Xt, ..., x"). Then we have 
E'(q) = X(0x'/0x)(q)£/(q) at each point qe U. 

For the rest of this article we mean by a 
vector field in M a vector field of class C”, and 
we denote by X(M) the set of all vector fields 
in M. Then X(M) is an %(M)-tmodule, where 
(M) denotes the algebra of all C?-functions 
in M. In fact, for f, ge (M) and X, Ye X(M), 
we can define a vector field fX +gY by (fX 
tgY),— f(p) X, g(p) Y, and this defines an 
S (M)-module structure in X(M). 

In a coordinate neighborhood (U, y), we can 
write X = 3;6'(0/0x'). The right-hand side of 
this equation is sometimes called the symbol of 
the vector field X. A vector field X can also be 
interpreted as a linear differential operator 
that acts on (AL 

Let X and Y be vector fields in M. Then 
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there exists a unique vector field Z in M such 
that Zf — X(Yf)— Y(Xf) for any C?-function f 
in M. We denote Z by [ X, Y ] and call it the 
Poisson bracket (or simply bracket) of X and 
Y. If EI and ai denote the components of X 
and Y, respectively, in a coordinate neighbor- 
hood (U, y), then the components £' of [. X, Y] 
are given by C — Y, (8 (0n'/0x*) — n*(0£/0x^)j. 
The bracket of vector fields has the following 
properties: (i) [X, Y ]f 2 X(Yf) — Y(Xf), i) 
[/X,gY]— fo[X, Y] - f(Xg)Y — q(Yf) X, (iii) 
[X + Y,Z]=[X,Z]+LY, Z], (iv) [X, Y]= 
—{Y, X], and (v) [[X, Y], Z] - [LY. Z], X] + 
[LLZ. X], Y ] 20 (Jacobi identity). These iden- 
tities show that X(M) is a Lie algebra (— 248 
Lie Algebras) over R. 

If ọ is a diffeomorphism of M onto Mi. 
then for any vector field X in M we can define 
a vector field o, X in M' by the condition 
(9, X), — do, (X,), p — plq). Then p, is an iso- 
morphism of the Lie algebra X(M) onto the 
Lie algebra ¥(M’). 


N. Vector Fields and One-Parameter Groups 
of Transformations 


A one-parameter group of transformations of 
M is a family 9, (te R) of diffeomorphisms 
satisfying the following two conditions: (i) the 
mapping of R x M into M defined by (t, p) 
q,(p) is of class C^; and (ii) 9,0 o, = 9,,, for 
S,teR. 

Let p, be a one-parameter group of trans- 
formations of M. Then we can define a vector 
field X by X, f —lim,. (Go) T'SI), 
where pe M and f e (M). The vector field X 
thus defined is called the infinitesimal trans- 
formation of o, We also say that q, is gen- 
erated by X, and sometimes we denote q, by 
the symbol exptX. In this case, if (x+, ..., x") is 
a local coordinate system, then at each point p 
of the coordinate neighborhood, we have X p^ 
Xi(dx'(@,(p))/dt),=0(0/0x'),. 

If M is compact, then every vector field in M 
is the infinitesimal transformation of a one- 
parameter group of transformations; that is, 
every vector field generates a one-parameter 
group of transformations. For M noncompact, 
this is not always true. Nevertheless, for each 
vector field X we have the following result 
concerning local properties of X: For each 
point p of M, there exist a neighborhood U of 
p, a positive real number e, and a family (|t| 
« £) of mappings of U into M satisfying the 
following three conditions. (1) The mapping 
of (—&,£) x U into M defined by (t, q) 5 o,(q) 
is of class C”, and for each fixed t, ¢, is a dif- 
feomorphism of U onto an open set ¢,(U) 
of M. (2) If isl, |t|, and |s4- t| are all smaller 
than c and q and q,(q) both belong to U, then 
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Ps(P:(9)) = 9,..(4). (3) X (q)f — Dm, al feig) 

— f(q))/t for qe U and f e (M). We call o, the 
local one-parameter group of local transfor- 
mations around p generated by X. 

Let X and Y be vector fields in M, and let 
y, be the local one-parameter group of local 
transformations around p generated by X. 
Then [X, Y ], — lim, el Fa — ((9), Y)p)/t, where 
(ol, Y is a vector field defined as follows: Let 
U be a neighborhood of the point p where q, 
(|t| « &) is defined. Then (9), Y is the vector 
field on o,(U) that is the image of Y under the 
diffeomorphism o, In particular, if X gener- 
ates a one-parameter group of transforma- 
tions of M, then we have [X, Y]—- lim, .4(Y — 
(ol, Y )/t for any vector field Y in M. 


O. Tensor Fields 


Let T7 (p) be the vector space consisting of all 
r-times contravariant and s-times covariant 
tensors over the tangent vector space T,ofa 
C*-manifold M, that is, 


'o»-(é,e(é1:). 


where T;* denotes the dual linear space of T, 
(— 256 Linear Spaces). A tensor field (more 
precisely, contravariant of order r and covariant 
of order s, or simply a tensor field of type (r, s)) 
on a subset N of M is a mapping K that as- 
signs to each point p of N an element K, of the 
vector space T; (p). In particular, if r=s=0, K 
is a real-valued function on N, and we call K a 
scalar field. If r 2 1 and s=0, K is a vector 
field, called a contravariant vector field. When 
r=0 and s— 1, we call K a covariant vector 
field (or differential form of degree 1). If r #0, 
s=0 or r=0, s40, we call K a contravariant 
tensor field of order r or a covariant tensor 

field of order s, respectively. A contravar- 

iant or covariant tensor field K is said to be 
symmetric (alternating) if K, is a symmetric 
(alternating) tensor at every point p of M. 

Let (x, ..., x") be the local coordinate system 
in a coordinate neighborhood (U, y). Then at 
each point p of U, the (0/0x!), (i—1, ..., n) form 
a basis of the tangent vector space T,, the 
differentials (dx), (i— 1, ..., n) form a basis of 
the dual space T*, and these bases are dual to 
each other. A tensor field K of type (r, s) de- 
fined on M is written at any point p of U in 
the following form: 


K=} Ki75(0)(0/0x')), ® ... 6 (0/0x^), 
G (dxi), @ ... & (dx*),. 


The functions Kj defined in U are called 
the components of the tensor field K of type 
(r,s) with respect to the local coordinate sys- 
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tem (sf, ..., x"). If Kj'7* are the components 
of K with respect to the local coordinate sys- 
tem (x!, ..., x") in another coordinate neigh- 
borhood (U", /) such that UN U’ # @, then for 
each qe U N U’, the following relations hold: 


Katia 2 Y p, (0x^/0x*:),...(0x*/0x"9), 
x (Ox :/0x71), .. (Ox »Jgx^) Kris 


A tensor field K in M is called a tensor field of 
class Œ (0 € t < oo) if the components are func- 
tions of class C' for any coordinate neighbor- 
hood of M. 

The sum K + L and the tensor product 
K ® L of two tensor fields K and L are defined 
by the rules (K + L), — K,-- L, and (K &® L), 

— K, ® L, respectively. The contraction of 
two tensor fields is also defined by taking the 
contraction pointwise. 

Let ọ be a diffeomorphism of M into M’. 
Then the differential (dọ), is an isomorphism 
of T, onto T, (p= (q)) for each qe M and 
hence induces an isomorphism @, of the vector 
space T7(q) onto the vector space 7; (p) (— 256 
Linear Spaces). For any tensor field K in M we 
can define a tensor field K in M’ by (K), = 
(Kı), P= 9(q), qe M. Then (K  L)- GK + 
PL, PIK © L)=@K G @L, and the mapping 
@ commutes with contraction. 

Let K be a tensor field and X be a vector 
field (both of class C^) in M. We define a 
tensor field Ly K by (Ly K), —lim, ,,(K,— 
(@,K),)/t, where q, denotes the local one- 
parameter group of local transformations 
around p generated by X. We call L,K the 
Lie derivative of K with respect to the vector 
field X. The operator Ly: K  L,K has the 
following six properties: (i) Ly(K + K) = 
LyK+L,K’; (ii) Ly(K © K)-(LyK)G K' 
K @L,K’; (iit) the operator Ly commutes 
with contraction; (iv) Ly f= Xf for a scalar 
field f and Ly Y=[X, Y] for a vector field Y; 
(v) Lyx, y= LyLy~LyLy, that is, Lo nK = 
Ly(Ly K) — Ly(Ly K); and (vi) K is invariant 
under p, Le, @ K =K for all t, if and only if 
LyK=0. 

Let K be a covariant tensor field of order r 
in M. We always assume that K is of class C®. 
The value K, of K at pe M is an element of the 
vector space T,* ® ... © T,* (r times tensor 
product of 7;*); hence we may consider K, an 
r-linear mapping of T, into R (— 256 Linear 
Spaces). If X,,..., X, are vector fields in M, 
we define a C?-function K(X,,..., X,) by 
K(X;,..., X)(p) - K,((X1),, ....(X,)5). Then 
the mapping that assigns to each r-tuple 
(X, ..., X,) of vector fields the C?-function 
K(X,,...,X,) is an r-linear mapping on the 
W(M)-module X(M) consisting of all vector 
fields of class C? in M into (My, that is, 
K(X,, ..., fX; gY, ..., X) - fK(X,, ..., Xj, 
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o, X) gK(X,, ..., Kb, X) (= L, ...,r) for 
f. g€ (M). Conversely any r-linear map- 
ping of the §(M)-module X(M) into (M) 
can be interpreted as a covariant tensor field 
of order r in M. If the tensor field K is sym- 
metric (alternating), the corresponding r- 
linear mapping K(X,,..., X,) is symmetric 
(alternating) with respect to X,,..., X,. For 
the Lie derivative Ly K of a covariant tensor 
field of order r of K, we have the following 
formula: (Ly K)(X,,..., X,) 2 X(K(X,,..., X,)) 
—Di-, K(X1,..., LX, X], .... X,). 


P. Riemannian Metrics 


A symmetric covariant tensor field g of order 
2 and of class C? in M is called a pseudo- 
Riemannian metric if the symmetric bilinear 
form g, on the tangent vector space T, is non- 
degenerate at each point pe M; and g is called 
a Riemannian metric if g, is positive definite 
for all p. If g is a Riemannian metric, the 
length ||L|| of a tangent vector Le T, is defined 
by "LIZ 2 g,(L, L). On a paracompact C®- 
manifold there always exists a Riemannian 
metric. A pair consisting of a differentiable 
manifold and a Riemannian metric on it 

is called a Riemannian manifold (— 364 
Riemannian Manifolds). 


Q. Differential Forms 


An alternating covariant tensor field in M of 
order r and of class C' (0 € t € oo) is also called 
a differential form (or exterior differential 
form) of degree r. A differential form of degree 
1 is sometimes called a Pfaffian form. Let œ be 
a differential form of degree r. Since each alter- 
nating covariant tensor of order r at a point p 
is an element of A'T;*, the r-fold texterior 
product of T*, the form œ is a mapping that 
sends each point p of M to an element œ, of 
/\'T,*. We can also regard o as an alternating 
r-linear mapping of X(M) into Ẹ(M). Let 
(xl, ..., x") be the local coordinate system in a 
local coordinate neighborhood (U, y). Since 
(dx*), (i — 1, ..., n) is a basis of T;* at each point 
p of U, we can express œ, (pe U) uniquely in 
the form 
o= Y, a, (p) (dx), ^... ^ (dx9),, 

iQ €... xi, 
where the sum extends over all ordered r- 
tuples (i,,...,i,) of indices such that 1 <i, <i, 
<...<i,<n. For an ordered r-tuple (i,, ...,i,) 
of indices with repeated indices we put a; . , =0, 
and for (i,, ..., i,) with r distinct indices we put 
ai i =(sgno)a; j, where (jj, .... lU 
<j,) is a permutation of (i,,...,i,) and sgno 
denotes the sign of the permutation o:i, j, 
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(k=1,...,r). Then we can write 


1 n 
OT yb 


WEE 


a; (p) (dx), Aw A (dx*^),. 


The functions dg, are the components of the 
tensor field c, and w is of class C' if these 
components arc of class C' for any coordinate 
neighborhood. By the support (or carrier) of a 
differential form œw we mean the closure of the 
subset of M consisting of all p such that c, 40. 

In the rest of this article, differential forms 
are always of class C”, and we denote by 
$'(M) the real vector space consisting of all 
differential forms of degree r and of class C*. 
In particular, D?(M) 2 &(M) and $'(M)- {0} 
for r»n, n- dim M. 

For differential forms we have the following 
five important operations. 

(1) Exterior product. Let c and 0 be dif- 
ferential forms of degree r and s, respectively. 
The exterior product w ^ 0 of œw and 0 is the 
differential form of degree r -- s defined by 
(o ^0), — 0, ^0,, pe M. Let X,,..., X,., be 
r+s vector fields in M. Then we have 
(c ^ OR, Xp45) 


=) sgn(i;j)o(X; , iie , X;)0(X; , RR Xj). 


Js 


where the summation runs over all possible 
partitions of (1,2, ...,r-- s) such that i, <i, < 
...«i, andj, <j,<...<j,, and sgn(i;j) means 
the sign of the permutation (1,2, ...,r + s) 
(Ulf, Lal, In particular, if w,,...,@, 
are differential forms of degree 1, then we have 
Io: ^... ^o,)(X, ..., X) = det(w,(X;)). 

(2) Exterior differentiation. Let œ be a dif- 
ferential form of degree r, and let c =(1/r!) 
Xa, dx" ^... adx" in a coordinate neigh- 
borhood (U, y). Then we can define a differen- 
tial form do of degree r+ 1 by the condition 
dc — (1/r!) dee ^ dx! ^... ^ dx" in U. The 
differential form do is called the exterior de- 
rivative (or exterior differential) of a. A dif- 
ferential form œ satisfying the condition deu — 0 
is called a closed differential form, and a dif- 
ferential form y that can be expressed as y= 
dw for some w is called an exact differential 
form. If a, be R and o, o € (M), then we 
have d(aw + bo») 2 ado) + bdw’. Therefore the 
set €’(M) of all closed differential forms of 
degree r and the set €'(M) of all exact differ- 
ential forms of degree r are linear subspaces 
of $'(M). 

For the exterior derivative dc, we have the 
following formula: 


(do)(X,, ..., X,) 


rtl 
== 2: (—1Y? Xi(o(X,, ses Xi, wit Xa) 


+ Y CD o(DX, X3] X, 


UIS OPNS NEN. m 
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where the variables under the sign ^ are to be 
omitted. 

(3) Interior product with vector field. Let 
o be a differential form of degree r and X a 
vector field. When r 2 1, we can define a dif- 
ferential form i(X)o of degree r — 1 by the 
formula (1(X)@)(X,,...,X,-,)=@(X, X,, ..., 
X, .,) for any r— 1 vector fields X,,..., X, ,; if 
r —0, we put i(X)co —0. The differential form 
1(X)q is called the interior product of œ with 
X. 

(4) Lie derivative. The Lie derivative L,« of 
a differential form of degree r with respect to 
a vector field X is a differential form of the 
same degree. For any r vector fields X,,..., X, 
we have, by definition, (Lyc) (X,,..., X) = 
X(o0(X,, ..., X) - Liz, @(X,,..., LX, Xi], 

.., X,). 

(5) Let o be a C?-mapping of M into M’, 
and let ož: Täpp T,* be the *transpose (or 
'dual) of the linear mapping (do ),: T, > Typ) 
for pe M, i.e., the mapping defined by the 
condition ((de),L, x) — (L, ofa) for each Le T,, 
xE Tjj. We denote the linear mapping of 
A Totp into AT% induced by ož by 5 also. Let 
w be a differential form of degree r in M. Then 
a differential form o*« in M, the pullback by 
9 of œ, is defined by (o*o), = GF gp), pe M. 

The operations defined previously satisfy 
the following six important relations: (i) d? — 

0, that is, d(dw) =0; (ii) d(w ^ 0) 2 doo ^ 0 + 
(—1Y o ^ d0, where œ is of degree r; (iii) 

ole A Hl ien ^ 9*0, o*(do)-— d(o*o), (iv) 
Llw A 0)=(Ly@) A04+@ ^(Ly0); (v) Ly = 
(X): d - d: i(X), Ly(do) 2 d(L,cy; and (vi) 
Lix,y= Lx: Ly - Ly Ly, (LX, Y 2 Ly (Y) 
(Y) Ly. 


R. De Rham Cohomology 


Let D(M)= 37.9 1$' (M), where n=dim M. 
Then D(M) is a tcochain complex with 
*coboundary operator d. We denote by H'(1) 
the r-dimensional cohomology group of 

this cochain complex, and we call it the r- 
dimensional de Rham cohomology group of the 
differentiable manifold M. If we denote by 
C(M) and €'(M) the subspaces of D"(M) con- 
sisting of closed differential forms and exact 
differential forms, respectively, then H’(D) 

— C'(M)/&'(M) (0x r x n) by definition. If 

c €€'(M) and 0€ €'(M), then w ^ 0eC'*"(M), 
and if oe € (M) and 0€ €'(M) (or oe E'(M) 
and 0€ €" (M)), then w a Ge E'*"(M). So if we 
put H(2)—27., H'($) (direct sum), we can 
define a product in H(D) by [co]: [0] — [c ^ 0] 
for each [w]e H'(D), [0] e H"(D). With respect 
to this product, H(D) forms an algebra over R 
called the de Rham cohomology ring of M. 
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S. Partitions of Unity 


Let M be a paracompact C?-manifold, and let 
(El be a "locally finite open covering of M 
such that the closure of V, is compact for each 
index i. Then there exists a C?-function f, in 
M for each i satisfying the following three 
conditions: (1) 0 « f; € 1, (ii) the support of f; is 
contained in V, and (iti) 3;., f(x) =1 for every 
x€ M. The family of C?-functions f;, ie J, is 
called a partition of unity of class C? subordi- 
nate to the open covering {V;} <7. 


T. Integrals of Differential Forms 


Integrals over an Oriented Manifold. Let M be 
an n-dimensional paracompact oriented C®- 
manifold and o a differential form of degree n 
in M with compact support. We can choose a 
positively oriented atlas S = ((U,, Wa) łasa such 
that (U,),., is a locally finite covering; U, is 
compact for each a. Suppose first that the 
support of w is contained in U, for some index 
a. Then (V, Te, is a differential form of de- 
gree n in w,(U,), and we can express (y, ')*w, 
in the form adx! ^ ... a dx", where (x!, ..., x") 
are the coordinates in R” such that xi 2 x'oy, 
(i— 1, ..., n) give a local coordinate system 
compatible with the orientation of M and a is 
a C?-function with compact support. Then we 
define the integral of w over M by 


| o-[ adx! ...dx". 
M WalUa) 


For the general case let TC be a partition 
of unity of class C” subordinate to (U,],. ,. 
Then the support of f œ is contained in U,, 
and except for a finite number of the indices o, 
Lo vanishes identically. Therefore we may 
define the integral of œ over M by 


[rz] 


and we can show that this definition of the 
integral is independent of the choice of or- 
iented atlas S and of a partition of unity sub- 
ordinate to S. 


Integrals over a Singular Chain. We fix rectan- 
gular coordinates in R”. Let d, be the origin 
and d; be the unit point on the ith coordinate 
axis. Let S’ denote the oriented r-simplex 

(do, d,, ..., dj) with vertices do, d,, ..., d,. When 
we regard AT as a point set, we denote it by 
[$"|. An oriented singular r-simplex of class C^ 
in M is, by definition, a pair (S’, 9) consisting 
of AT and a C*-mapping ¢ of an open neigh- 
borhood of |S’| into M. An element of the free 
Z-module generated by singular r-simplexes of 
class C” is called an integral singular r-chain 
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of class C” in M. We define a real singular r- 
chain of class C? analogously. Let œw be a 
differential form of degree r and (S’, ol be an 
oriented singular r-simplex of class C? in M. 
Then q*o is a differential form of degree r ina 
neighborhood of |S"|, and we can express p*œ 
in the form p*w=adx! ^ dx? a... a dx". We 
define the integral of c over (S’, 9) by 


| o- adx! ...dx', 
(S'. p) IS” 


and the integral of œ over a singular r-chain C 
of class C” by 


f oz 3. m; | C, 
C i KEN 


where C — Xmj;(S', qj), m;e Z (or m;e R). 
When r=0, then o is a function in M, and 
S? is a point o. In this case we put fcœ = 
HEH 

Let C,(S, Z) (C,(S, R)) be the Z-module (vec- 
tor space over R) of integral (real) singular 
r-chains of class C? in M, and let w(C) be 
the value of the integral of œ over a chain C. 
Then o is a linear function in the vector space 
C,(S, R), and hence we can consider o a sin- 
gular r-cochain of class C*. 


U. Stokes's Formulas 


(1) Let D be a *domatn in an n-dimensional 
C*?-manifold M, and let ôD and D be the 
boundary and the closure of D, respectively. 
Let S 2 ((U,, Al 4 be an atlas of class C^ of 
M, U! = U,N D, wW be the restriction of y, to 
LU. and T — ((U;, WO, 4. If the pair (D, T) isa 
C*-manifold with boundary under a suitable 
choice of S, then the domain D is called a 
domain with regular (or smooth) boundary. 
The boundary 0D of (D, T) is then an (n— 1)- 
dimensional closed submanifold of M, and if 
M is orientable, 0D is also orientable. Now let 
M be a paracompact and oriented manifold 
and D be a domain with regular boundary. 
Let C be a characteristic function of D in M, 
i.e., a function defined by the condition C(p)= 
1 for peD and C(p)=0 for péD. Let 0 bea 
differential form of degree n in M with com- 
pact support. We define the integral of 0 over 
D by 


[es 


Let w be a differential form of degree n— 1 in 
M with compact support. We then have 
Stokes's formula: 


| da -Í i*w, 
D CD 


where i denotes the identity mapping of the 
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submanifold 0D into M with ôD having the 
orientation induced naturally from that of M. 
(2) Let C be a singular r-chain of class C? in 
M, and let ðC be the boundary of C. Then for 
any differential form o of degree r — 1, we have 


| ào- | o 
C ac 


This formula is also called Stokes’s formula. 


V. De Rham’s Theorem 


Let M be a connected paracompact C®- 
manifold. If we consider c as a singular co- 
chain, we have (do)(C) 2 o(0C); by Stokes's 
formula, this means that the exterior differen- 
tial dw of w is equal to the coboundary of the 
singular cochain c. Let œw and C be a closed 
differential form of degree r and a singular r- 
cycle of class C^, respectively, and let [œ] and 
[C] be the de Rham cohomology class and the 
singular homology class represented by w and 
C. Using Stokes's formula, we can define the 
inner product (ol, [C]) by 


testen- Q. 


Through this inner product, it follows that 
the de Rham cohomology group H’(D) is 
isomorphic to the rth singular cohomology 
group H'(M, R), the dual space of the rth 
homology group of the complex of real sin- 
gular chains of class C^. Moreover, the de 
Rham cohomology ring H(D) is isomorphic to 
the singular cohomology ring H*(M, R) (de 
Rham’s theorem). 


W. Divergence of a Vector Field 


Let M be an n-dimensional oriented C*- 
manifold, and let $ be an oriented atlas. Let 

o be a differential form of degree n, and let 
(xl, ..., x") be the local coordinate system in a 
coordinate neighborhood in S. Then we can 
express o in the coordinate neighborhood 
uniquely in the form co —adx! a... ^ dx". If the 
function a is positive for any coordinate neigh- 
borhood in S, we call w a volume element of 
M. In a paracompact oriented manifold, there 
always exists a volume element. (We remark 
that an n-dimensional differentiable manifold 
M is orientable if and only if there exists an 
everywhere nonvanishing differential form of 
degree n.) Let f be a C*?-function in M with 
compact support. Then f-« is a differential 
form of degree n with compact support, and so 
the integral 


ME 


391 


is defined. We call this integral the integral of 
the function f with respect to the volume ele- 
ment œ. 

Let g be a Riemannian metric in M and g; 
the components of g with respect to the local 
coordinate system (x!, ..., x") as before. Then 
we can define a volume element c in M by 
putting œ= JG dx! ^ ... ^ dx", G=det(g,;) in 
each coordinate neighborhood. The volume 
element thus defined is called the volume ele- 
ment associated with the Riemannian metric g. 

Let c be a volume element and X a vector 
field in M. Then the Lie derivative Lyc is also 
a differential form of degree n, and we can 
express Lyw in the form Lyco = fy: o, where fy 
is a scalar field, i.e., a function in M. We call fy 
the divergence of the vector field X with re- 
spect to the volume element c and denote it 
by div X. If c is associated with a Riemannian 
metric, then div X is called the divergence of X 
with respect to the Riemannian metric. 

If M is compact, we have 


| div X-w=0 
M 


for any vector field X. This result is known as 
Green’s theorem. 


X. Jets 


Let M and N be C?-manifolds. We define 
an equivalence relation in the set of all C?- 
mappings of M into N. Let f and g be such 
mappings and p be a point of M. Choosing 
local coordinate systems, we write f(p) — 
O, s f. 09). (p) — (930). -+ 9,0). x — 
(x4, ..., x4). We say f and g are equivalent at 
p if f (p), g(p), and the values at p of all the 
partial derivatives of f; and g; up to the order r 
(r an integer, r 20) are equal (i— 1, ..., n). An 
equivalence class with respect to this equiva- 
lence relation is called a jet of order r at p. A jet 
of order r at p represented by a function f is 
denoted by j; f, and the points p and f(p) are 
called the source and the target of the jet j} f, 
respectively. We denote by J5(M, N) the set of 
all jets of order r with source at p and target in 
N and let PO. N) - UJ, 44 J; (M, N). For any 
jet j, let z (j) and z,(j) denote the source and 
the target of j, respectively. We can introduce 
the structure of a C?-manifold in J'(M, N) 
in a natural way such that the projections 
1,:J'(M, N) M and OM. N)—>N are both 
of class C” and J'(M, N) is a fiber bundle over 
M(N) with projection z,(z,). As examples, we 
have: 

(1) JE (R, N) is identified with the tangent 
vector bundle of N. 

(2) The set J'(£) of all jets of order r deter- 
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mined by the sections of a vector bundle € of 
class C? is also a vector bundle of class C^. 

Let f: MON and g: NL be C?-mappings. 
We define a composition of jets by Tue ` 
j=j (go f). A jetj;f e J'(M, N) is inver- 
tible if there exists a mapping a N M such 
that ua "ZS zs fl), where 1,, denotes the 
identity mapping of M onto itself. We de- 
note by I’(M, N) the set of all invertible jets 
in J'(M, N), and put I (M, N)  I'(M, NI) 

J (M, N). 

(3) Let G" (n) be the set of all invertible jets 
in F(R”, R”) whose source and target are the 
origin of R”. Then with respect to the compo- 
sition of jets, G'(n) forms a Lie group that is an 
textension of G!(n) - GL (n, R) by a simply 
connected nilpotent Lie group. The projec- 
tion G'(n)2 G! (n) is a special example of the 
natural projection J'(M, N) J*(M, N) (rz s), 
which is defined in general. 

(4) We can identify I; (R", M) (m — dim M) 
with the tangent m-frame bundle over M. 
More generally, I; (R", M) is a G'(m)-bundle 
over M. 


Y. Pseudogroup Structure 


Let X be a topological space, and let I be a 
set consisting of homeomorphisms f: U, Vp, 
where Up, V; are open subsets of X. We call l'a 
pseudogroup of topological transformations if 
F satisfies the following four conditions: (i) F 
contains the identity mapping of X onto X; (ii) 
if f €T, then the restriction of f onto any open 
subset U of U, is also contained in T; (iii) if f 
and g are in I and V; C U,, then go f is con- 
tained in T; and (iv) if f €T, then f? : V, U, is 
also in I. 

Following the definition of differentiable 
manifolds we define a pseudogroup structure of 
M (or, more precisely, a T -structure of M) asa 
set A of bijections, with each member « defined 
on a subset U, of M onto an open set V, of X, 
satisfying the following three conditions: (i) 
| Ja U= M; (ii) if x, Be A, then xol! eT, 
where the domain of definition of xo f * is 
BLU. "Det and (iii) A is the maximal set of 
bijections that satisfies conditions (i) and (ii). 
We introduce in M the weakest (coarsest) 
topology such that every bijection « is a 
homeomorphism. If I" is a pseudogroup of X 
containing F and A’ isa T'-structure of M 
such that Ac A’, then we say that A’ is sub- 
ordinate to A. If X =R" (or H”, a half-space of 
R”), F is the totality of diffeomorphisms of 
class C of open sets of X onto open sets of 
X, and M is a space with Hausdorff topol- 
ogy, then the I-structure is the C’-structure 
with or without boundary which we have 
already defined. We give three examples of T- 
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structures subordinate to I’, where I" is the 
totality of local transformations of class C" 
(rz 1) in R”. 

(1) When n is even, we identify R" with C"? 
and denote the totality of holomorphic trans- 
formations of connected open domains by I. 
The I’-structure in this case is called a complex 
structure. 

(2) When n is odd, we define F as the totality 
of transformations of connected open domains 
in R" that leave invariant a Pfaffian form 
Xf, x'dx"*!  dx?n*! (n=2m-+ 1) up to scalar 
factors. The I -structure in this case is called a 
contact structure. 

(3) We consider R” =R? x R"? and define F 
as the set of all diffeomorphisms U — V (where 
U, V are open in R") such that each set of the 
form U N(R? x ( y]) is mapped onto a set of the 
form VM(R? x ( y']). The I -structure in this 
case is called a foliated structure. 

The problem of determining whether there 
exists a I'-structure for given I and M involves 
widely ranging problems of topology and 
analysis. The classification of I with reason- 
able conditions is another important open 
problem. 

Haefliger has constructed the classifying 
space BT for I'-structures. 


Z. Infinite-Dimensional Manifolds 


Let B and B' be Banach spaces and o be a 
mapping from an open set of B to B'. Then, 
using the notion of Fréchet derivatives, we can 
define o to be smooth. For a smooth mapping 
Q, the Jacobian J(q) at each point x is a linear 
mapping from B to B', for which the inverse 
function theorem holds true as a straightfor- 
ward extension of the corresponding one in 
the finite-dimensional case. Similar extension 
also holds for the existence and the unique- 
ness theorems of solution of ordinary differen- 
tial equations with value in B. These facts 
permit us to generalize the notion of differ- 
entiable manifolds to infinite-dimensional 
ones. Actually, infinite-dimensional manifolds 
can be defined in a way similar to the finite- 
dimensional case, taking open sets of a certain 
Banach space as local coordinate neighbor- 
hoods. Such a manifold is called a Banach 
manifold. Various formal definitions of dif- 
ferentiable manifolds can also be stated for 
Banach manifolds in extended form. However, 
while differentiable manifolds are locally com- 
pact and admit a partition of unity by smooth 
functions subordinate to a locally finite open 
covering, Banach manifolds generally lack 
these properties. Actually, local compactness 
gives a criterion for whether a manifold is 
finite-dimensional or not. Banach manifolds 
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often provide a basic functional-analytic view 
to nonlinear analysis and global analysis. 

The most important category of Banach 
manifolds is provided by the Hilbert manifold, 
that is, a Banach manifold whose local coordi- 
nates are modeled on a Hilbert space. For a 
Hilbert manifold, a partition of unity subordi- 
nate to a locally finite open covering can be 
taken from smooth functions. In the follow- 
ing, we refer to a separable Hilbert manifold 
simply as a Hilbert manifold. An infinite- 
dimensional Hilbert manifold M can be 
smoothly embedded as an open set of a Hil- 
bert space and thus covered by a single coordi- 
nate neighborhood. Hence, in particular, the 
tangent bundle of M is trivial. Historically, 
this fact was the first instance showing that 
the distinguishing properties are shared by 
infinite-dimensional manifolds; this was first 
recognized as a consequence of the theorem 
stating that the unitary group of an infinite- 
dimensional Hilbert space is contractible. Two 
infinite-dimensional Hilbert manifolds are 
diffeomorphic if and only if they are homo- 
topically equivalent. A typical example of a 
Hilbert manifold is provided by the space of 
L?-loops on a compact Riemannian manifold. 
Morse theory can be extended in a suitable 
way to a Riemannian Hilbert manifold under 
the Palais-Smale condition, which makes it 
possible for the integral curve of grad f to tend 
to a critical point, where f is a Morse function. 


AA. GePfand-Fuks Cohomology 


The space X(M) consisting of all the smooth 
vector fields on a smooth manifold M has the 
structure of a Lie algebra under the bracket 
operation [X, Y]=X Y— YX, where the vector 
fields X and Y are regarded as derivations on 
the algebra C”(M) of smooth functions on M. 
X(M) is a topological Lie algebra when en- 
dowed with the topology defined by uniform 
convergence of the components of vector fields 
and all their partial derivatives on each com- 
pact set of M. When X(M) acts continuously 
on a topological vector space V, the continu- 
ous cohomology H*(X(M), V) of X(M) with 
coefficients in V is the cohomology of the 
cochain complex @ (C? = C'(X(M), V), d}. 
Here C? = V, C"(pz 1) is the space of all the 
alternating p-linear continuous mapping ¢ of 
X(M)x ... x X(M)(p-times) into V, and d: C? 
— C?*! is defined for pe C? and X;e X(M) by 
do(X;)— X, 9 (p=0) and do(X,, es Xu) 
El- D o(EX;, Xj]; Xi, ec, E pales 
Aalt Zi( 17 Eet, Xi, -s X p41) 
(pz 1). When V is a topological algebra and 
the elements of X(M) act on V as derivations, 
the exterior multiplication of cochains induces 
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a graded algebra structure in H*(X(M), V). 
When V=R is the trivial X(M)-module, 

then H*(X(M)) 2 H*(X(M),R) is called the 
Gel'fand-Fuks cohomology of M. Gel'fand and 
Fuks proved that, for any compact oriented 
manifold M, we have dim H"(X(M)) +œ 
for all p and H?(X(M))=0 for 1 xpxn(n— 
dim M). For example, if M is the circle S', 
then the algebra H*(X(S!)) is generated by 
two generators «e H?, fie H?, which are expli- 
citly described as cochains in the following 


way: 
d d 
dr dr kl 


d d d EMEN 
irs. d ra= g g g'|dt. 
S1 kK h h" 


f f” 
, n 


dt, 
g 








Localization of the concept of Gel’fand- 
Fuks cohomology naturally yields the co- 
homology of formal vector fields. Here a 
formal vector field means the expression 
Zä, x,)0/0x,, f, being formal power 
series in x,, ..., X,. The set of all the formal 
vector fields forms a Lie algebra a, and the 
continuous cohomology of a, with respect 
to the Krull topology is denoted by H*(a,). 
Let By be the universal tclassifying space of 
the unitary group U (n), let (Bi), be its 2n- 
skeleton, and let P,, be the canonical principal 
U (n)-bundle restricted to (By),,. Then there is 
an algebra isomorphism H*(a,) = H*(P,,; R). 
This cohomology and its variants play im- 
portant roles in the theory of foliations. 

An important subcomplex @ (C5, d} of 
@ (C7, d], the diagonal complex, is defined as 
CkK={peC?|o(X,,...,X,)=0 if supp X, 

... supp X,— Ø}. Here, supp X; denotes 
the support of X;,, that is, {x | X,(x) #0}. Let 
Py be the principal U(n)-bundle associated 
with the complexified tangent bundle of M. 

U (n) acts freely on the product P,, x BD and 
the quotient space Ey = Py x P,,/U(n) isa 
fiber bundle over M with fiber P,,,. Then, if 
M is a compact oriented manifold, the co- 
homology H7 (X(M)) of the diagonal com- 
plex is completely determined by the isomor- 
phisms HZ(X(M)) z H?""(E,,; R) for all p. In 
particular, if all the Pontryagin classes of M 
vanish, then HZ(X(M)) Zenn H(M; R) @ 
H(a,). 

The Gel'fand-Fuks cohomology has a topo- 
logical interpretation: H*(X(M))z H*(I (Ey), 
R) as graded algebras, where T(E) denotes 
the space of all the continuous cross-sections 
of E,,>M with the compact open topology. 
Moreover the differential graded algebra 
C*(X(M)) has a model in the sense of Sullivan 
constructed purely algebraically from a model 
of the de Rham algebra of M and the Pon- 
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tryagin classes of M. This model shows in 
particular that H*(X(M)) is not necessarily 
finitely generated as a graded algebra. 

The cohomology theory of X(M) in the case 
where the representation is nontrivial has also 
been investigated. The natural representa- 
tion on C*(M) is a typical example. There is 
also a topological interpretation: H*(X(M), 
C*?(M))z H*(Y,, RL where Yy is the fiber 
product of the evaluation mapping M x T(E) 
— E, and the inclusion Py G E,, correspond- 
ing to a fiber inclusion U (n) S Pzp. 
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A. First-Order Derivatives 


Let y= f(x) be a real-valued function of x 
defined on an interval I of the real line R. If for 
a fixed x, el, the limit 

i I (Xo +h)—f(Xo) 

im ————————— 
SE h 
exists and is finite, then f is called differenti- 
able at the point x), and the limit is the deriva- 
tive (differential coefficient or differential 
quotient) of f at the point x. If f is differenti- 
able at every point of a set Ac I, then f is said 
to be differentiable on A. The function that 
assigns the derivative of f at x to xe A is called 
the derivative (or derived function) of f(x), 
which is denoted by dy/dx, y', y, df(x)/dx, 
(d/dx)f (x), f (x), or D, f(x). The process of 
determining f' is known as the differentia- 
tion of f. The derivative of f at the point x, is 
written f'(xo), (df/dx)(xo), D, (Xo), [dy/dx ],—.... 
etc. We say that f is right (left) differentiable or 
differentiable on the right (left) if the limit on 
the right, Jm, ,o(f(xo + h) — f(xo))/h (the limit 
on the left, lim, , .9( (xo — h) — f(xo))/h), exists 
and is finite. This limit is called the right (left) 
derivative or derivative on the right (left) and is 
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denoted by D; f(xo) or f4 (xo)(D, f(xo) or 
f. (xo)). For instance, if f is defined on I= 
[a, b), then D, f(a) is identical to D,' f(a). 


B. Differentials 


In the definitions given above, neither dx nor 
dy in dy/dx has a meaning by itself. In the 
following, however, we give a definition of dx 
and dy, using the concept of increment, so that 
dy = f (x)dx. Let Ay denote the increment 

f(x 4- Ax) — f(x) of f corresponding to the in- 
crement Ax of x. Suppose that f(x) is differen- 
tiable at x. We set Ay/Ax = f (x) -- e. Then we 
have Im, ops D. This can be written utiliz- 
ing *Landau's notation as Ay — f'(x)Ax + 
o(|Ax|) (Ax —0); in other words, Ay is the 

sum of two terms, of which the first, f'(x)Ax, is 
proportional to Ax and the second is an tin- 
finitesimal of an order higher than Ax. Here 
the principal part f'(x)Ax of Ay is called the 
differential of y — f(x) and is denoted by dy. 
The differential dy thus defined is a function of 
two independent variables x and Ax. In partic- 
ular, if f(x) 2 x, from the definition we get 

dx —1: Ax — Ax. Hence, in general, we have 

dy — f (x)dx and f (x) — dy/dx. 

With respect to the rectangular coordi- 
nates (x, y), the straight line with slope f'(x,) 
through a point (xo, f(x9)) on the graph of y= 
f(x) is the ttangent line of the graph at the 
point (xo, f(xo)). A function is continuous at a 
point where the function is differentiable, but 
the converse of this proposition does not hold. 
In fact, Weierstrass showed that the function 
defined by the infinite series 3:2. , a" cos b" nx, 
where 0 «a « 1 and b is an odd integer with 
ab > (3/2) 4- 1, is continuous everywhere and 
nowhere differentiable on (—oo, oo) [3]. 


C. Differentiation 


For two differentiable functions f and g de- 
fined on the interval I, the following formulas 
hold: (af + Bg) —af' + Bg’, where x and f are 
constants; ( fg) = f'g + fg’; and (f/g) —(f'g — 
Jo Vo (at every point where g #0). Let y= 

f(x) be a function of x defined on the interval 
(a, b) and x= ¢(t) a function of t defined on 

(a, B). If (t) e(a, b) whenever t e(a, f), then the 
composite function y= F(t)= f(q(t)) 2 (f o o)(t) 
is well defined. Assume further that f and o 
are differentiable on (a, b) and (a, fj), respectively. 
Then the composite function F(t) 2(f o 9)(t) is 
differentiable on (a, f), and we have the chain 
rule, F'(t) — f'(x)o'(t) (x= e(t), or dy/dt = 
(dy/dx)(dx/dt). Assume that a function y= 

f(x) is tstrictly increasing or decreasing and 
differentiable at x. If furthermore we have 
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f (x) 40, then the inverse function x = f! (y) is 
also differentiable at yg (= f(x9)) and satisfies 
(dx/dy),-, (dy/dx),-., = 1. However, if f'(xo)— 
0, then even though f ^! ( y) is not differenti- 


able at yo, lim4, ol (yo + Ay) — f. ' (yg))/Ay 
exists and is +00 or —oo. 


D. Higher-Order Derivatives 


If the derivative f (x) of a function y= f(x) is 
again differentiable on Z, then ( f (x)) = f "(x) is 
well defined as a function of x on J. In general, 
if f "^ (x) is differentiable on I, then f(x) is 
called n-times differentiable on J, and the nth 
derivative (or nth derived function) f(x) of 
f(x) is defined by f" (x) z( f "^" (x))' and is 
also denoted by d" y/dx" or D™ y. The nth 
derivative for nz 2 is called a higher-order 
derivative. 

Concerning the nth derivative of the product 
of two functions, Leibniz's formula holds: 


Uo he (T) mra Eu 


lee, pm 


Analogous to dy 7 y'Ax, which is a function of 
x and Ax, we can define d? y in the notation 

d? y/dx? by d? y « d(dy)  d(y Ax) -(y AxY Ax 

= y'Ax?. Since Ax — dx, it follows from the 
above that d? y = y” dx?. Similarly, d" y = y? dx" 
and is called the nth differential (or differential 
of nth order) of f(x). 


E. The Mean Value Theorem 


Let f(x) be a continuous function defined on 
[a,b], and suppose that for every point x; on 
(a, b) there exists a limit lim, al f(xo +h) — 
J (Xo))/h, which may be infinite. (These condi- 
tions are satisfied if f(x) is differentiable on 
[a, b].) Then there exists a point č such that 
EC um a<é<hb. 

—a 
This proposition is called the mean value 
theorem. A special case of the theorem under 
the further condition that f(a)= f(b) is called 
Rolle’s theorem. If we put b—a=h, £—a-4- 0h, 
then the conclusion of the theorem may be 
written as f(a-- h) - f(a) -- h- f '(a 4- 0h) (0 « 
0<1). 

This theorem implies the following: Let 
f(x) be a function as in the hypothesis of the 
mean value theorem, and assume further that 
A< f (x) € B holds for all x witha<x<b. 
Then A «(f(b) — f(a))/(b — a) « B. (French 
mathematicians sometimes call this the “théo- 
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réme des accroissements finis.") Using the 
mean value theorem, the following theorem 
can be proved: If f(x) is continuous on [a,b] 
and f'(x) exists and is positive on (a, b), then 
f (b) » f(a). Accordingly, if f'(x)>0 at every 
point x of an interval J, then f(x) is tstrictly 
increasing on that interval. (If f'(x) «0 on J, 
then f is strictly decreasing.) The converse of 
the previous statement does not always hold 
(f(x) 2x? is a counterexample, since f (0) — 0). 
Furthermore, from the mean value theorem 
it follows that if f’(x)=0 everywhere in an 
interval, then f(x) is constant on that inter- 
val. Consequently, two functions with the 
same derivative on an interval differ only by 
a constant. 

Suppose that f(x) is n-times differentiable on 
an open interval J. For a fixed ae I and an 
arbitrary xe I, we put 





fto fia. D (x—a)+... 
f" Xa) i 
Em A +R,- 


Then R, — f (5) (x — a'/n! for some é be- 
tween a and x. This is called Taylor's formula, 
where R, is the remainder of the nth order 
given by Lagrange. We also have several other 
forms for R, (= Appendix A, Table 9). If 

f (x) is continuous at x =a, then £a as 
x—a, and accordingly, f (5) f (a). Hence 
f(x) = Ek- f (a)/k))(x — a) + o((x — a)"). If 
f(x) is continuous at x =a, then, by Tay- 
lor's formula, the value of the polynomial 
Yt oCf V (ayk!)(x — a) can be considered an 
approximate value of f(x) for x near a. This 
approximation is called the nth approximation 
of f(x), and its error is given by |R,,,|. By 
applying this formula, it is sometimes possible 
to calculate a limit such as A=lim,.., f(x)/g(x), 
where f(x)—0 and g(x)—0 as xa. For in- 
stance, if f'(x) and g'(x) are both continuous at 
x-a and g'(a) ZO, then by taking the first 
approximations of f(x) and g(x) it is easily 
seen that A — f (a)/g'(a). A limit of this type is 
often called a limit of an indeterminate form 
0/0. Similarly, we can calculate limits of such 
indeterminate forms as 0- oo or 0” (for limits 
of indeterminate forms — [6]). 


F. Partial Derivatives 


Let w= f(x, y,...,z) be a real-valued function of 
n independent real variables x, y, ...,z defined 
on a domain G contained in n-dimensional 
Euclidean space R". We obtain a function of a 
single variable from f by keeping n — 1 vari- 
ables (say, (x, 9, ..., Z), Le., all except y) fixed. If 
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such a function o(y) — f(x,, y, ..., zo) is dif- 
ferentiable, that is, if 


goj- lim UNE 45) 7 eG 
0 An Ay 
= Jim Lese AY, zo) feros Yor 20) 
AycO Ay 


exists and is finite, then f is called partially 
differentiable with respect to y at (Xo, Vo, 

.. ,Z9), and the derivative is called the partial 
derivative (or partial differential coefficient) 
of f(x, y, ..., z) with respect to y at (xo, yo, 

---» Zo). It is denoted by [0w/0y],—.,.. 
(0/0y)f (Xo. Yo. «++» Zo), zo, Yo» ---, Zo) OF 

D, f (Xo, yo. +- zo). etc. We usually assume that 
the point (x, y, ..., z) where partial derivatives 
are considered is an interior point of the 
*domain of the function. Since in a space of 
dimension higher than 1, the tboundary of a 
domain may be complicated, partial deriva- 
tives at boundary points are usually not con- 
sidered. If a function f possesses a partial de- 
rivative with respect to x at every point of an 
open set G, then f, is a function on G and is 
called a partial derivative of f with respect to 
x. The process of determining partial deriva- 
tives of f is called the partial differentiation of 


f. 


ënn 


G. Total Differential 


Let w= f(x, y, ..., z) be a function defined 

on a domain G, and let P=(x, y, ..., z) bean 
interior point of the domain G of a function 
w= f(x, y, ...,z). Put Aw= f(x9 + Ax, yo + 

Ay, ..., Zo +Az)— f(X9, Yos ---, Zo). If there 

exist constants a, f), ..., y such that Aw = 
&Ax + BAy 4... yAz - o(p) (p 0), where 

p= M Ax? Ay? +... - Az?, then f is called 
totally differentiable (or differentiable in the 
sense of Stolz) at P. In this case, f is partially 
differentiable at P with respect to each of the 
variables x, y, ..., z, and «= f,(X9, yo, .... zo). f 
= Jl ro, Yos Zo. Y= ln, Yo» «++» zo). The 
principal part of Aw as p>0 is aAx+ BAy +... 
+ yAz, which is called the total differential of w 
at P. If f is totally differentiable at every point 
of G, then f is said to be totally differentiable 
on G. The total differential of w is denoted by 
dw. Since the total differentials of x, y, ..., z are 
dx — Ax, dy = Ay, ..., dz = Az, respectively, we 
can write dw — f. dx 4 f,dy+...+f,dz; and 

dw is a function of independent variables x, 
y... Z, dx, dy, ..., dz. The total differentiability 
of f implies the continuity of f, whereas the 
partial differentiability of f with respect to 
each variable does not imply that f is continu- 
ous. (Example: Define f(x, y) 2 xy/(x? + y?) 

for (x, y) (0,0), and /(0,0) 20; then the func- 
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tion f is not continuous at (0,0), even though 
both f, and f, exist at (0,0).) The function f is 
totally differentiable on G if allt fp, ..., fz 
exist and are continuous on G, or, more weak- 
ly, if all f, fys ..., fz exist and, with possibly 
one exception, are continuous. Suppose that 

w — f(x, y) is totally differentiable at (x, y), and 
let Ax 2 pcos, Ay— psin0. As p0, for a 
fixed 0 there exists the limit lim, ..(Aw/p) = 

f. Gc y) cos 0 f(x, y) sin 0. This limit is called 
the directional derivative in the direction @ at 
(x, y). The partial derivatives f, and f, are 
special cases of the directional derivative for 
0 —0 and z/2, respectively. Suppose that we 
are given a curve lying in the tinterior of the 
domain of f and that the curve passes through 
the point (x, y), where the curve is differenti- 
able. Then the partial derivative of w= f(x, y) 
in the direction of the normal to the curve at 
(x, y) is called the normal derivative of w at the 
point (x, y) on the curve and is denoted by 
éw/én. Analogous definitions and notations 
have been introduced for functions of more 
than two variables. 

To see the geometric significance of the total 
differentiability of w= f(x, y), we consider the 
graph of the function w= f(x, y) and a point 
(a, b, f(a, b)) on the graph. Then the plane 
represented by w— f(a, b) 2 a(x — a) - B(y — b) is 
the ‘tangent plane to the surface at (a, b, f (a, b)) 
if and only if «= f.(a, b) and B= f,(a, b). The 
existence of f, and f, depends on the choice of 
coordinate axes, while the total differentia- 
bility of f does not. 


H. Higher-Order Partial Derivatives 


Suppose that a partial derivative of a function 
w= f(x, y, ..., z) defined on an open set G again 
admits partial differentiation. The latter partial 
derivative is called a second-order partial 
derivative of f. We can similarly define the nth 
order partial derivatives. Higher-order partial 
derivatives are denoted as follows: 


0 (OÓwN Gin 
( ) eu sss Z), 


Ox \ ax Ox? 


Q (Ow On D , 
—|— |=— = SR né 
QyVóx) axay oi? 


ô ( 0^w w " , 
= Sf sess 
Ox\dxdy] dxdydx "77770777077 


In general, fx, and f,, are not equal. (Peano’s 
example: Let f(x, y) 2 xy(x* — y?)/(x? + y?) for 
(x, y) z (0,0) and f(0,0)=0. Then f= —1, 
f. 1 at (0,0).) However, if both fe, and ie 
are continuous on an open set G', then they 
coincide in G'. Furthermore, if f., fp, and f,, 
exist in a neighborhood U of a point P belong- 
ing to the domain of f and f,, is continuous 
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at P, then f,, exists at P and fa, = fyx (H. A. 
Schwarz). If f, and f, exist in U and are totally 
differentiable at P, then fy, = f,, at P (W. H. 
a Similarly, if the partial derivatives of 

Pd =f yx... Hence we can change thie’ 
order of differentiation if all the derivatives 
concerned are continuous. 


I. Composite Functions of Several Variables 


Let w be a function of x, y, ..., z, and let each 
X, y, ...,Z be a function of t. Suppose that the 
range of (x(t), y(t), ..., z(t)) is contained in the 
domain of w. Then w is a function of t. If fur- 
ther w is totally differentiable and x, y, ...,z 
are all differentiable, then w as a function of t 
is differentiable, and we have 


dw  Owdx OY 
dt dx dt 


e, E dz 
widi ôzdt ` 








If partial derivatives of order >2 are totally 
differentiable, then d? w/dt?, d? w/dt?, ... are 
obtained by repeating the above procedure. A 
similar consideration is valid when x, y, ...,z 
are functions of several variables. 


J. Taylor's Formula for Functions of Several 
Variables 


Suppose that f(x, y) is defined on an open 
domain G, f(x, y) has continuous partial de- 
rivatives of orders up to n, and the line seg- 
ment (a - (x —a)t, b -- (y — b)t) (O € t & 1) is 
contained in the domain G. Then there exists a 
number d (0 «0 « 1) such that 


f(x,y) 
é Ó 

= f(a, UE b) 
x Qy 


1 Ó dv 
ta giu cR ERO f(a, b) * ... 


1 
Faiy 


1 0 bie 
EC rU d 
x f(a-- (x — a)0, b 4- (y — b)0), 


ô ĝ n-l 
EI fa,b) 





where, for instance, the third term ((x — a): 
(6/0x) + (y — b) (0/0 y))" f(a, b) means 

(x — ay (0^ f/0x?)(a, b) + 2(x — a)(y — b): 

(0? f/Gx0y)(a, b) - (y — b) (0^ f/0y?)(a, b), 

with (0?f/Ox?)(a, b), (0^ f/OxO y)(a, b), and 

(0? f/Oy?)(a, b) denoting the values of (02f/0x?), 
(0^ f/Ox y), and (0? f/0 y?) at (a, b), respectively. 
The displayed formula is called Taylor's for- 
mula for a function of two variables. A similar 
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formula is valid for a function of n variables 
(n z 3). As in the case of functions of one vari- 
able, we can derive approximation formulas 
for f from Taylor's formula. 


K. Classes of Functions 


If all the partial derivatives of order n of f(P) 
are continuous on an open set G, then f is said 
to be a function of class C" (or n-times contin- 
uously differentiable) on G. The set of all n- 
times continuously differentiable functions is 
denoted by C" (n— 1,2, ...). A continuous 
function is of class C°. A function of class C! is 
also called a smooth function. It is obvious 
that CÓ 3 C! 3C?2 .... A partial derivative of 
order r € s of a function belonging to class C* 
does not depend on the order of the differenti- 
ation. A function belonging to C? = AC is 
said to be of class C? or infinitely differenti- 
able. We sometimes say that a function has a 
certain “nice” property or is “well behaved” if 
it belongs to some C (r>1). 


Let w= f(x, y, ..., Z) be a function defined on 
an open set G in R” and P —(a, b, ...,c)e G. If 
f(x,y, ...,z)= f(a, b. isse) 


(z— c)" 


itas 


holds in some open neighborhood U of P, 
where the right-hand side of the equality is an 
absolutely convergent series, then f is said to 
be real analytic at P. In this case, f is r-times 
differentiable at P for any r, and we have 


rns -apiy- by: ee 


riirj!...r, 
(ra r4... rn) 
Qnin 
" A 
Ox" y"... 0z" 


le A mE 
éd 





(a, b, ...,c). 


If fis real analytic at every point P of the 
domain G, then f is called a real analytic func- 
tion on G. Sometimes, a real analytic function 
is called a function of class C®. A real analytic 
function belongs to C”, but the converse is not 
true (— 58 C?-Functions and Quasi-Analytic 
Functions E). 


L. Extrema 


Let f be a real-valued function defined on a 
domain G in an n-dimensional Euclidean 
space R” that has the point P, in its interor. 
If there exists a neighborhood U of P, such 
that for every point P ( Z P;) of U we have 
f(P) 2 f(P3), then we say that f has a relative 
minimum at P), and f(P,) is a relative mini- 
mum of f. Replacing > by <, we obtain the 
definition of a relative maximum. /(P,) is 
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called a relative extremum if it is either a rela- 
tive maximum or a relative minimum. 
To find relative extrema, the following facts 


concerning the sign of the derivative are useful. 


Suppose that a function of a single variable is 
differentiable on an interval J. Then we have 
the following: (1) If f has a relative extremum 
at an interior point x, of J, then f'(x;) — 0. 

(2) If (x9) 2 and f (x) changes its sign at x, 
from positive (negative) to negative (positive), 
then f has a relative maximum (minimum) at 
Xo. (3) If f'(x9) 2 and f is twice differentiable 
on some neighborhood of xo, then f has a 
relative maximum or minimum according as 
f' (xg) «0 or >0. If f"(x9) 20, then nothing 
definite can be concluded about a relative 
extremum of f at xo. In general, if there exists 
a neighborhood of x, in which f is r-times 
differentiable (r is even) and f” is continu- 
ous, and if f'(x) 2 f" (xo) =... =f (x9) 20, 
f 9 (xg) » 0 (or <0), then f has a relative mini- 
mum (maximum) at xo. On the other hand, if 
this condition holds with odd r, then f(x) is 
not a relative extremum. If f’(x))=0, then 
f(xo) is called a stationary value of f. 

If a function f on n variables x, y, ...,z has a 
relative extremum at (xo, yo, ..., Zo), then we 
have f.(Xo, Yo» +++» o) 7 0, f, (Xo. Yo. -...z9) = 
0, <., fa (Xo. Yos ---, Zo) 2 O, provided that the 
partial derivatives of f exist. Assume that for a 
function f of class C? of two variables x and y, 
we have f, (xo, yo) =0 and f,(xo, yo) =9, and let 
ô= fi (Xo. Yo) fy, (Xo. Yo) — f (Xo; yo). Then we 
have the following: (1) If 0 —0, then according 
as f. (xo, yo) «0 or 70, f has a relative maxi- 
mum or minimum at (xo, yo). (2) If ô «0, then 


f does not have a relative extremum at (xo, yo). 


(3) If 9 20, then without further information 
nothing definite can be said about a relative 
extremum of f at the point. 

Let x,, ..., x, be independent variables. If a 
function f of variables x,,..., x, has a relative 
extremum at a point Pj — (x9, ..., x2), then 
fi 9 f, (Po) 0 (i 1, ..., n), provided that all 
the partial derivatives of f exist. In general, a 
point P, where f is totally differentiable and 
these conditions are satisfied is called a critical 
point of f. The value f(P,) at a critical point is 
called a stationary value. If further f is of class 
C?, then consider a tquadratic form of n 
variables Q — Q(X,, ..., X,) - Din f. XiX,, 
where fi, = Le (P9). Suppose that | fil 40. 
Then according to whether Q is tpositive 
definite, tnegative definite, or findefinite, f 
has a relative minimum, relative maximum, or 
no relative extremum at P, If | | 2 0, then 
nothing can be said in general. À critical point 
P of f is said to be nondegenerate if | f| 40 
and degenerate if | f;, | — 0. 

We can also apply the method of differenti- 
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ation of timplicit functions to find relative 
extrema of functions defined implicitly. Given 
functions 91, ..., 9, (m HL the problem of 
finding a relative extremum of f(x,, ...,x,) 
under the condition that o,(x,, ..., Selz. 

<-s Pm(X1, ..., xX,) 2 01s called the problem 
of finding a conditional relative extremum. 
This problem can be reduced to the problem 
of finding a relative extremum of an implicit 
function. Actually, if the functions f, ox. 

.., Øm are of class C! and the !Jacobian 
O(@ 15 .... Pm)/O(Xp—m41> +++» Xn) does not vanish 
in the domain considered, then y, — x, ,,.,, 

-., Ym =X, can be regarded as implicit func- 
tions of x,, ..., x, (I2 n—m). Hence we can set 


TiX3o3Xi Ys E * (X4, ---, xj. Then f 
has a relative extremum at (x9, ..., x9) under 
the condition o, =...=@,,=0 if and only if 


f* has a relative extremum at P, —(x9, ..., xp). 
The latter condition implies that all 0f */Ox; 
(j— 1, ..., I) vanish at P5, which holds if and 
only if for arbitrary constants 4,, ..., Am the 
function F(x,,...,x,) fF A404... Ap Qu 
satisfies OF/Ox; —O0 (i — 1, ..., n), and further o, 
—0, ..., 94,70 at (x2, ..., x). From this system 
of equations we can often find the values of 
x$, ..., x0. This method of finding conditional 
relative extrema is called Lagrange's method of 
indeterminate coefficients or the method of 
Lagrange multipliers (— 208 Implicit Func- 
tions; 216 Integral Calculus H; 379 Series H). 
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A. Ordinary Differential Equations 


It was Galileo who found that the accelera- 
tion of a falling body is a constant and thence 
derived his law of a falling body x(t) 2 gt?/2 as 
what we would now view as a solution of the 
differential equation x"(t) 2 g, where x(t) de- 
notes the distance the body has fallen during 
the time interval t and g is the constant gravi- 
tational acceleration. This pioneering work 
may be regarded as the first example of solu- 
tion of a differential equation. Also, the fequa- 
tions of motion, proposed by I. *Newton as 
the mathematical formulation of the law of 
motion, including Galileo's law as a special 
case, are differential equations of the second 
order. Thus differential equations appeared, 
simultaneously with differential and integral 
calculus, as an indispensable tool for the uni- 
fied and concise expression of the laws of 
nature. Such laws are generally called dif- 
ferential laws. 

Newton completely solved the equations of 
the ttwo-body problem proposed by himself; 
G. W. *Leibniz also succeeded in solving many 
simple differential equations. 

In the 18th century, many mathematicians, 
such as the *Bernoullis, A. C. Clairaut, J. F. 
Riccati, L. *Euler, and J. L. tLagrange, at- 
tacked and solved differential equations of 
various types independently. In that period, 
the emphasis was on solution by quadrature, 
that is, applying to *elementary functions a 
finite number of algebraic operations, trans- 
formations of variables, and indefinite inte- 
grations. It was toward the end of the 18th 
century that new methods, such as integration 
by infinite series, came to be discussed. A 
method of variation of constants for the solu- 
tion of linear ordinary differential equations 
was invented by Lagrange in 1775. At the 
beginning of the 19th century, C. F. tGauss 
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initiated the study of differential equations 
satisfied by thypergeometric series. 

The problem of existence of solutions, which 
supplies a foundation of modern differential 
equation theory, was first treated by A. L. 
*Cauchy. His proof of the existence theorem 
was later improved by R. L. Lipschitz (1869). 

Pioneers in the function-theoretic treatment 
of differential equations were C. A. A. Briot and 
J. C. Bouquet, who investigated the singular 
points of a function defined by an analytic 
differential equation. Also, B. Riemann pro- 
posed a new viewpoint which influenced L. 
Fuchs in his development of the theory of 
linear ordinary differential equations in the 
complex domain (1865). Works of A. M. Le- 
gendre on telliptic functions and of H. *Poin- 
caré on tautomorphic functions should also be 
mentioned in this connection. 

After the Cauchy-Lipschitz existence the- 
orem for the equation y'= f(x, y) was known, 
efforts were directed toward weakening the 
conditions imposed on f(x, y). G. Peano first 
succeeded in giving a proof under the continu- 
ity assumption only (1890), and his results 
were sharpened by O. Perron (1915). 

Regarding the uniqueness of solutions of 


initial value problems, there are various re- 


sults by W. F. Osgood (1898), Perron (1925), 
and many Japanese mathematicians. In the 
course of this work the necessary and sufficient 
condition for uniqueness was successfully 
formulated in a concise form (— 316 Ordi- 
nary Differential Equations (Initial Value 
Problems)). 

For linear differential equations with peri- 
odic coefficients, investigations were carried 
out by C. Hermite (1877), E. Picard (1881), 

G. Floquet (1883), G. W. Hill (1886), and 
others. For instance, solutions satisfying y(x + 
w) — Ay(x) were found to exist, where w is 

the period of the coefficients. Analogous re- 
sults followed in the case of doubly periodic 
coefficients. 

Techniques of factorization of linear dif- 
ferential equations developed by G. Frobenius 
(1873) and E. Landau (1920) should also be 
noted. Picard (1883), J. Drach (1898), and 
E. Vessiot (1903, 1904) established a remark- 
able result on the solvability (in the sense of 
solution by quadrature) of linear differential 
equations, successfully extending the *Galois 
theory in this new direction. 

The concept of fasymptotic series, which in 
a sense approximate the solution of differential 
equations, was introduced by Poincaré (1886) 
and extended by M. A. Lyapunov (1892), J. C. 
C. Kneser (1896), J. Horn (1897), C. E. Love 
(1914), and others. Poincaré was also the 
founder of topological methods in differential 
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equation theory, and his ideas were developed 
extensively by I. Bendixson (1900), Perron 
(1922, 1923), G. D. Birkhoff, and others (— 126 
Dynamical Systems). 

In 1890, Picard invented an ingenious tech- 
nique of tsuccessive approximation for the 
proof of existence theorems, and his technique 
is now widely used in every application of 
functional equations. The technique of reduc- 
ing linear differential equations to linear 
"integral equations of Volterra type was also 
developed. 

On the *boundary value problems and 
feigenvalue problems that appear in many 
areas of physics, there was extensive research 
by mathematicians such as J. C. F. Sturm 
(1836), J. Liouville, L. Tonelli, Picard, M. 
Bócher (1898, 1921), Birkhoff (1901, 1911), 
and others. In this connection the problem 
arises of expanding a given function by an 
torthogonal system of functions obtained as 
teigenfunctions of a given boundary value 
problem. Those problems were brought into 
unified form by D. tHilbert (1904) in his theory 
of 'integral equations. Subsequently boundary 
value problems of ordinary and partial dif- 
ferential equations came to be discussed in this 
framework. 

Finally, it should be mentioned that the 
calculus of variations created by Euler and 
Lagrange gave rise to the study of a certain 
class of differential equations bearing the name 
of Euler (— 46 Calculus of Variations). 


B. Partial Differential Equations 


The origin of partial differential equations can 
be traced back to the study of hydrodynamic 
problems by J. d'Alembert (1744) and Euler. 
However, perhaps Lagrange and P. S. tLa- 
place were the first to investigate the general 
theory. Subsequently, during the 18th and 
19th centuries, it was developed by G. Monge, 
A.-M. Ampère, J. F. Pfaff, C. G. tJacobi, 
Cauchy, S. "Lie, and many other mathema- 
ticians. The fundamental existence theorem 
for the initial value problem, now called the 
Cauchy-Kovalevskaya theorem, was proved 
by S. Kovalevskaya in 1875 (— 321 Partial 
Differential Equations (Initial Value Problems) 
B). 

Because of their close connection with prob- 
lems of physics, linear equations of the second 
order have been a chief object of research. Up 
to the 19th century, classification into felliptic, 
thyperbolic, and tparabolic types and the 
study of boundary and initial value problems 
for each of these types constituted the main 
part of the theory. 

In the 20th century, more complicated 
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problems—'nonlinear problems appearing in 
the study of viscous or compressible fluids, or 
the study of equations of tmixed type in con- 
nection with supersonic flow—have emerged 
as important topics; and the newly developed 
techniques of functional analysis have brought 
about remarkable changes. Especially in the 
study of the !'Schródinger equations of quan- 
tum mechanics and of more general tevolution 
equations , this method has proved to be a 
powerful tool. 

Finally, we should not fail to mention that 
the development of electronic computers has 
made it possible to obtain numerical solu- 
tions and to discover many important facts. 
*Numerical analysis is now becoming an indis- 
pensable part of the theory (— 304 Numerical 
Solution of Partial Differential Equations). 
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A. Introduction 


The study of differential games arose from the 
study of pursuit and evasion problems and 
various tactical problems. The first work was 
done by R. P. Isaacs [1] in a series of RAND 
Corporation memoranda that appeared in 
1954. He applied to many illustrative examples 
the method of Hamilton-Jacobi differential 
equations (— eq. (4) below). The heuristic 
results of Isaacs were made rigorous by W. H. 
Fleming [2], L. D. Berkovitz [3], A. Friedman 
[4,5], and others. 


B. Zero-Sum Two-Person Games 


Suppose that there are two antagonists, each 
exerting partial control over the state of a 
system. One wishes to maximize a given payoff 
that 1s a functional of the state and the control 
exerted, while the other wishes to minimize 
this payoff. Let the state of a differential game 
at time t be represented by an n-dimensional 
vector x(t) eR". In a zero-sum differential game 
between two players I and II, we are given a 
system of n differential equations 


dx/dt = f(t, x, u, v) (1) 


with an initial condition x(t)=ée¢R", where 
uc R? is chosen at each instant of time by 
player I and v eR? is chosen by player II. We 
assume that the function f(t, x,u, v) is con- 
tinuous in t and continuously differentiable 
on the entire (x, u, v)-space. 

It is usually assumed that both players 
know the present state of the game and that 
they know how the game proceeds; that is, 
they know the system (1). Each player can take 
the state of the game into account in making 
his choice. Thus player I can let his choice of u 
be governed by a vector function u(t, x) defined 
on D, where D c [0, oo) x R” is a fixed region of 
the (t, x)-space. Similarly, player II can let his 
choice of v be governed by a vector function 
v(t, x) defined on D. 

A finite collection of subregions D,, ..., D, of 
a region D is said to constitute a decomposi- 
tion of D whenever the following conditions 
hold: (i) Each D, (i= 1, ...,r) is connected and 
has a piecewise smooth boundary; (1i) D; 

D;2 o if ij. A function defined on D is said to 
be piecewise C! in x on D if there is a decom- 
position of D such that on each D, the function 
and all its derivatives with respect to x are 
continuous in (t, x) on Dj. Let U and V be 
fixed closed subsets of R?” and Ri. respectively. 








108 B 
Differential Games 


Let S, denote the class of functions u(t, x) that 
are piecewise C! in x on D and have their 
range in U. Similarly, let S, denote the class of 
functions v(t, x) that are piecewise C! in x on D 
and have their range in V. 

Let weS, and ve S,, and consider the dif- 
ferential equation 


dx/dt = f(t, x, u(t, x), v(t, x)), (2) 
subject to the initial condition 
x(t)=6¢. (3) 


We say that a pair (u, v)e S, x S, is playable 
if for every (t, č) in D every solution of (2) 
satisfying (3) stays in D and reaches a terminal 
manifold F in finite time, where F is a smooth 
manifold contained in D. Let Q, c $,, O,c S, 
be the maximal pair of subclasses such that 
each pair (u, v)e Q, x Q, is playable. We call the 
functions in Q, and Q, the strategies for the 
players. 

For each strategy pair (u, v) we can define a 
functional 


J(t, ¢, u,v) 


= g(ty,x(t,)) + | " h(t, x(0), u(t, x(0), e(t, x(0))) dt 


to 


where x(t) is a solution of (2) and (3) and t, is 
the first time that (t, x(t)) reaches the terminal 
manifold F. The functional J is called the 
payoff. 

Let (u*, v*) eO, x Q, be a strategy pair. 
Suppose that for any ueQ, and veQ,, the 
inequalities 


I(t, 6, u, v*) < J (t, Č, v*) < J (t, č, u*, v) 


hold for all (t, £)e D. We say that (u*,v*) is a 
saddle point relative to the classes Q, and Q,. 
The function 


Wt, x) 2 J (t, x, u*,v*) 


defined on D is called the value function of 
the game. Berkovitz [3] proved that the 
value function W(t, x) is continuous on D and 
continuously differentiable on each D; and 
satisfies 


max [A(t, x, u,v") WA, x) ft, x, u, v*)] 
-min [A(t, x, u*, v) + W, (t, x) f(t, x, u*,v)] 


— h(t, x,u*,v*) -- W(t, x) f(t, x, u*, v*) 
= — W(t, x). (4) 


Equation (4) is called the Hamilton-Jacobi 
equation. 

Let x*(t; t, č) be the optimal trajectory corre- 
sponding to the saddle-point strategies (u*, v*) 
and resulting from an initial point (t, £)e D. 
Then there exists an n-dimensional continuous 
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vector function A(t; t, č) such that the following 
hold [3]: 

(i) The functions x* and 4 satisfy the system 
of differential equations 


dx/dt = f(t, x, u*(t, x), v*(t, x)), 
dA/dt = — H(t, x, A, u*(t, x), v*(t, x)), 
where 
H(t, x, A,u, v) - h(t, x, u,v) + Af (t, x, u, v). 
(ii) If x 2 x*(t; c, č), t2 v, then 
Wu, x)— 4(t; t, 8). 
(iii) At t — t,, the transversality condition 


ôT 0góT 0g0X ðX 


hd =0 
Oc Org Gx 0c Oo 





holds, where the terminal manifold F is given 
parametrically by the relations 


t=T(o), x=X(o), 


c ranging over a cube in some finite- 
dimensional Euclidean space. 
(iv) For all t<t<t,, 


max min H(t, x*(t), A(t), u, v) 
ve veV 


=minmax H(t, x*(t), A(t), u, v) 
veV uc 


= H(t, x*(t), A(t), u*(t, x*(t)), v*(t, x*(t))). 


P. Varaiya and J. Lin [6] and Friedman 
[4,5] have defined certain special classes of 
differential games, and have shown that under 
their definitions the games have nonzero value 
functions. 


C. N-Person Differential Games 


In a differential game between many players, 
the state vector x(t)e R" is governed by 


dxjdt — f(t,x,u,,...,uy) — Zirka, 


where uch is chosen at each instant of time 
by player i. Each u; is constrained to lie in a 
fixed closed subset U; of Ri Let S; be the class 
of functions u;(t, x) that are piecewise C' in x 
on D and have their range in U;. 

We say that an element u = (u,, ..., uy) of 
S, x... x Sy is playable if, for every (t, £)e D, 
every solution of the differential equation 


dx/dt = f(t, x, u; (t,x), ...,uy(t, x), — x(1)- 6, 
(5) 


stays in D and reaches a terminal manifold 

F in finite time. Let O; c S, (i— 1, ..., N) be 

the maximal subclasses such that each ele- 
ment u —(u,, ...,uy)e Q, x ... x Oy is playable. 
We call the functions u;e Q; (i— 1, ..., N) the 
strategies. For each strategy N-tuple we define 
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a payoff 


Jit, Č, Hi, S Ux) =Gi(t,, X(t,)) 


ti 
-| ht, x(t), u; (t, x(t)), ..., uy(t, x(t))) dt, 
to 

where x(t) is a solution of (5) and t, is the first 
time that (t, x(t)) reaches the terminal manifold 
F. Each player i is to choose his strategy u;eEQ; 
so as to maximize his own payoff J;. 

There are many definitions of “solution” for 
games involving more than two players. A 
strategy N-tuple u* — (u,*, ...,uy*) is called an 
equilibrium point for the game if the 
inequalities 
Fu, *. ss 


* * 
2, Ua T Uj, paa...) 


& Jiu, *, eng ap ay se) (i= I, vae N) 


hold for any u,€Q,,...,uyeQy. J. H. Case [9] 
has shown that the conclusions drawn for 
zero-sum two-person games also hold for N- 
person differential games. 
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In differential geometry in the classical sense, 
we use differential calculus to study the prop- 


403 


erties of figures such as curves and surfaces in 
Euclidean planes or spaces. Owing to his 
studies of how to draw tangents to smooth 
plane curves, P. *Fermat is regarded as a 
pioneer in this field. Since his time, differential 
geometry of plane curves, dealing with cur- 
vature, ‘circles of curvature, tevolutes, ten- 
velopes, etc., has been developed as a part of 
calculus. Also, the field has been expanded to 
analogous studies of space curves and surfaces, 
especially of tasymptotic curves, ‘lines of cur- 
vature, tcurvatures and ‘geodesics on surfaces, 
and truled surfaces. C. F. *Gauss founded the 
theory of surfaces by introducing concepts of 
the tgeometry on surfaces (Disquisitiones circa 
superficie curvas, 1827). Gauss recognized 
the importance of the intrinsic geometry of 
surfaces, and it is generally agreed that differ- 
ential geometry as it is known today was ini- 
tiated by him. Thus differential geometry came 
to occupy a firm position as a branch of 
mathematics. The influence that differential- 
geometric investigations of curves and surfaces 
have exerted upon branches of mathematics, 
physics, and engineering has been profound. 
For example, E. Beltrami discovered an in- 
timate relation between the geometry on a 
*pseudo-sphere and tnon-Euclidean geometry. 
The study of *geodesics is a fertile topic deeply 
related to dynamics, the calculus of variations, 
and topology, on which there is excellent 
work by J. Hadamard, H *Poincare, P. Funk, 
G. D. Birkhoff, M. Morse, R. Bott, W. Klin- 
genberg, and M. Berger, among others. The 
theory of minimal surfaces initiated by J. L. 
Lagrange was an application of the calculus 
of variations. At the early stages of develop- 
ment, G. Monge, J. B. M. C. Meusnier, A. M. 
Legendre, O. Bonnet, B. Riemann, K. Weier- 
strass, H. A. Schwarz, Beltrami, and S. Lie 
contributed to the theory. Weierstrass and 
Schwarz established its relationship with the 
theory of functions. J. A. Plateau showed 
experimentally that tminimal surfaces can be 
realized as soap films by dipping wire in the 
form of a closed space curve into a soap solu- 
tion (1873). The Plateau problem, i.e., the prob- 
lem of proving mathematically the existence 
of a minimal surface with prescribed bound- 
ary curve, was solved by Tibor Rado in 1930 
and independently by J. Douglas in 1931. 
Although the relationship to function theory 
is lost for higher-dimensional minimal sub- 
manifolds, their study is intimately related to 
the calculus of variations and topology. 
Euclidean geometry is a geometry belonging 
to F. Klein's Erlangen program (— 137 Erlan- 
gen Program). For other geometries in the 
sense of F. Klein we may also consider the cor- 
responding differential geometries. For instance, 
in tprojective differential geometry we study 
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by means of differential calculus the properties 
of curves and surfaces that are invariant under 
projective transformations. This subject was 
studied by E. J. Wilczynski, G. Fubini, and 
others; *affine differential geometry and tcon- 
formal differential geometry were studied by 
W. Blaschke and others (— 110 Differential 
Geometry in Specific Spaces). 

Influenced by Gauss's geometry on surfaces, 
in his inaugural address at Góttingen in 1854 
Riemann advocated an intrinsic differential 
geometry completely independent of embed- 
dings (Über die Hypothesen, welche der Geo- 
metrie zugrund liegen; Werke, 2nd ed. 1892, 
272-287) (— 364 Riemannian Manifolds). 
Removing the restriction to two dimensions 
and considering abstract manifolds of dimen- 
sion n, he introduced what is now known as 
the Riemannian metric; actually he considered 
the more general metrics that had formed the 
subject matter of the dissertation of P. Finsler 
in 1918 (— 152 Finsler Spaces). Riemann- 
ian geometry includes Euclidean and non- 
Euclidean geometry as special cases, and is 
important for the great influence it exerted 
on geometric ideas of the 20th century. Under 
the influence of the algebraic theory of invar- 
iants, Riemannian geometry was then studied 
as a theory of invariants of quadratic tcovar- 
iant tensors by E. B. Christoffel, C. G. Ricci, 
and others. Riemannian geometry attracted 
wide attention after A. Einstein applied it to 
the tgeneral theory of relativity in 1916. 

In the same year, T. Levi-Civita introduced 
the notion of tLevi-Civita parallelism, which 
contributed greatly to the clarification of geo- 
metric properties of Riemannian spaces. Ob- 
serving parallelism to be an affine-geometric 
concept, H. Weyl and A. S. Eddington devel- 
oped a theory of Riemannian spaces “affinely” 
based on the notion of parallelism without 
using metrical methods. Such a geometry is 
called a geometry of an affine connection (— 
80 Connections). 

Every straight line in a Euclidean space has 
the property that all tangents to the line are 
parallel. In a space with an affine connection, 
we may define a family of curves called tpaths 
as an analog of straight lines. Such curves are 
solutions of a system of ordinary differential 
equations of the second order of a certain type. 
Coefficients of such differential equations 
determine a parallelism and hence an affine 
connection. H. Weyl discovered transforma- 
tions of coefficients that leave the family of 
paths invariant as a whole, namely, projective 
transformations of an affine connection. A 
geometry that aims to study properties of 
paths or affine connections that are invariant 
under these transformations is called a projec- 
tive geometry of paths. Such geometry was 
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studied by L. P. Eisenhart, O. Veblen, and 
others. The concept of projective connections 
was an outcome of such studies. Similarly, the 
concept of conformal connections was devel- 
oped from the consideration of tconformal 
transformations of Riemannian spaces. 

These geometries cannot in general be re- 
garded as geometries in the sense of Klein. 
Actually, any one of these geometries generally 
has no transformations that correspond to 
‘congruent transformations of geometries in 
the sense of Klein; even if it has such trans- 
formations, they do not act transitively on the 
space. Thus geometries are naturally divided 
into two categories, one consisting of geome- 
tries in the sense of Klein (based on the group 
concept) and the other of geometries based on 
Riemann's idea. Under such circumstances, 

E. Cartan unified the thoughts of Klein and 
Riemann from a higher standpoint and con- 
structed his theory of connections in a series of 
papers published between 1923 and 1925. He 
developed the theory of affine, projective, and 
conformal connections from a viewpoint con- 
sistent with that of Klein. Just as each tangent 
space of a Riemannian manifold is viewed as a 
Euclidean space, an affine connection regards 
the tangent space at each point as an affine 
space and develops it onto the tangent space 
at an infinitesimally nearby point. In discuss- 
ing projective connections Cartan attached a 
projective space to each point of a manifold as 
an infinitesimal approximation, and similarly 
for conformal connections. More generally, he 
attached to each point of a manifold a fixed 
Klein space, i.e., a homogeneous space of a Lie 
group, called the structure group. Thus Cartan 
introduced the concept of fiber bundle (— 147 
Fiber Bundles). Then he defined a connection 
as a development of the fiber, i.e., the gen- 
eralized tangent space, at each point onto the 
fiber at an infinitesimally nearby point (— 80 
Connections B). If G is the group of congruent 
transformations in Euclidean space, a mani- 
fold with connection having G as its structural 
group is called a manifold with Euclidean con- 
nection. Among manifolds with Euclidean 
connection, Riemannian manifolds are charac- 
terized as those without ttorsion. If we take 
the group of congruent transformations of 
projective (conformal) geometry as G, we have 
manifolds with projective (conformal) connec- 
tion in the sense of Cartan. Among these, there 
are remarkable ones called manifolds with 
normal projective (or conformal) connections, 
which are essentially the same as the ones 
studied by Veblen and others. Cartan's idea 
had a profound influence on modern differen- 
tial geometry. The method of moving frames, 
created by G. Darboux and extensively used 
by Cartan in his theory of connections, was a 


404 


forerunner of the theory of fiber bundles. Com- 
bining *Grassmann algebra with differential 
calculus, Cartan developed a powerful com- 
putational tool known as calculus of differen- 
tial forms (— 105 Differentiable Manifolds Q). 
Differential forms have become indispensable 
in topology, algebraic geometry, and in studies 
of functions of several complex variables, as 
well as in differential geometry. 

The work of Lie on transformation groups 
also had a profound influence on Cartan. The 
latter’s work on ‘Lie groups, particularly on 
simple Lie groups, and on differential geome- 
try culminated in 1926 in his discovery of 
*Riemannian symmetric spaces. These spaces 
are natural generalizations of the spherical 
surface and the unit disk in the complex plane 
with Poincaré metric, and play essential roles 
in unitary representation theory and in other 
areas of mathematics. 

A tangent line to a curve C at a point P of C 
is the limit line of the line PQ, where Q is a 
point on C aporoaching P; hence we can de- 
fine it locally. A concept (or property), such as 
this, that can be defined in an arbitrary small 
neighborhood of a point of a given figure or a 
space is called a local concept (or local prop- 
erty) or a concept (or property) in the small. 
In the early stages of the development of dif- 
ferential geometry, differential calculus was 
the main tool of study, so most of the results 
were local. On the other hand, a concept (or 
property) that is defined in connection with a 
whole figure or a whole space is called global 
or in the large. In modern differential geome- 
try, the study of relations between local and 
global properties has attracted the interest of 
mathematicians. This view was emphasized 
by Blaschke, who worked on the differential 
geometry of tovals and *ovaloids. The study of 
trigidity of ovaloids by S. Cohn-Vossen be- 
longs in this category, and many works on 
geodesics and minimal surfaces were done 
from this standpoint. 

From the viewpoint of modern mathemat- 
ics, the basic concepts on which we construct 
Riemannian geometry and geometries of con- 
nections are global concepts of !differentiable 
manifolds. However, in Riemann's time the 
theory of tLie groups and topology were not 
yet developed; consequently, Riemannian 
geometry remained a local theory. In 1925, H. 
Hopf began to study the relations between 
local differential-geometric structures and the 
topological structures of Riemannian spaces. 
However, except for the work of Cartan, Hopf, 
and a few others, differential geometry in the 
1920s was still largely concerned with surfaces 
in the 3-dimensional Euclidean space or local 
properties of Riemannian manifolds, and with 
affine, projective, and conformal connections. 
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Gradually the concept of differentiable mani- 
folds was clarified, the global theory of Lie 
groups made progress, and topology devel- 
oped; and the trend toward global differential 
geometry began slowly in the early 1930s. The 
dissertation of G.-W. de Rham published in 
1931 showed that the cohomology of a mani- 
fold can be computed in terms of differential 
forms (— 105 Differentiable Manifolds R). His 
theorem provides the theoretical foundation 
for expressing cohomological invariants of a 
manifold in terms of differential geometric 
invariants. In a series of papers immediately 
following de Rham's, W. V. D. Hodge es- 
tablished that, on a compact Riemannian 
manifold, every r-dimensional cohomology 
class can be uniquely represented by a har- 
monic form of degree r (— 194 Harmonic 
Integrals). 

An important class of complex manifolds 
with compatible Riemannian metric was dis- 
covered by J. A. Schouten, D. van Dantzig, 
and E. Kahler around 1929—1932. This class of 
manifolds, called Kahler manifolds today, 
comprises the projective algebraic manifolds. 
Hodge’s theory of harmonic integrals is most 
effective when applied to compact Kahler 
manifolds, (— 232 Kahler Manifolds). 

The most celebrated global theorem in 
classical differential geometry of surfaces is the 
Gauss-Bonnet formula (1848) (— 364 Rie- 
mannian Manifolds D). The formula was gen- 
eralized to closed hypersurfaces of Euclidean 
space by Hopf in 1925, to closed submanifolds 
of Euclidean space by C. B. Allendoerfer and 
W. Fenchel in 1940, and finally to arbitrary 
closed Riemannian manifolds by Allendoerfer 
and A. Weil in 1943. But the simple proof 
given by S. S. Chern in 1944 contained the 
notion of transgression, which has become 
essential in the theory of characteristic classes 
(— 56 Characteristic Classes). The discovery of 
Pontryagin classes for Riemannian manifolds 
(1944) and Chern classes for Hermitian mani- 
folds (1946) culminated in the tindex theorem 
and the tRiemann-Roch theorem of F. E. P. 
Hirzebruch, and finally in the tAtiyah-Singer 
index theorem. 

A simple but fruitful idea of S. Bochner, 
relating harmonic forms to curvature, es- 
tablished tvanishing theorems for harmonic 
forms of Riemannian manifolds and for holo- 
morphic forms of Kahlér manifolds under 
suitable positivity conditions for curvature. 
His idea has led to the vanishing theorems of 
K. Kodaira and others (— 232 Kahler Mani- 
folds D). 

The work of C. Ehresmann in 1950 on con- 
nections in principal fiber bundles established 
a solid foundation to Cartan’s theory of con- 
nections. Gauge theory in physics ts largely 
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based on the theory of connections in prin- 
cipal fiber bundle. 

R. H. Nevanlinna’s value distribution theory 
and its subsequent generalization by Chern 
and others can be best described in differential- 
geometric terms. 

Differentiable manifolds are currently ob- 
jects of research in both differential geometry 
and differential topology. While topology 
studies manifolds per se, differential geometry 
may be considered as the study of differenti- 
able manifolds equipped with geometric struc- 
tures, such as metric tensors, connections, 
(almost) complex structures, and various other 
tensors. Through these geometric structures, 
differential geometry enjoys close contact 
with many branches of mathematics. From 
its early days, differential geometry has had 
close ties to topology (as exemplified by the 
Gauss-Bonnet formula) and to partial dif- 
ferential equations and analytic functions 
(through, e.g., the study of minimal surfaces). 
The bonds with topology were strengthened 
by Morse theory and, more recently, by the 
theory of characteristic classes. In the most 
recent proof of the Atiyah-Singer index theo- 
rem, differential geometry is an important 
intermediary between topology and analysis. 
Differential geometry and algebraic geometry 
have enriched each other through Kahler 
manifolds. The theory of functions of several 
complex variables also has points of contact 
with differential geometry, such as value distri- 
bution theory and Cauchy-Riemann struc- 
tures. Contact and symplectic structures are 
basic to mechanics. Lorentz manifolds and 
connections in principal bundles are essential 
mathematical tools in the general theory of 
relativity and in gauge theory. Topics such as 
minimal submanifolds, manifolds of positive 
curvature, and closed geodesics are active and 
important areas of research belonging to Rie- 
mannian geometry proper; at the same time, 
differential geometry provides a language and 
methods that are important in wider areas of 
mathematics. 
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A. The Method of Moving Frames 


The main theme of this article is the theory of 
surfaces De. submanifolds) in a differentiable 
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manifold V on which a tLie transformation 
group G acts. 

If a *Lie group G of dimension r acts ftransi- 
tively on a space G, and the 'stability group 
for any point of G, consists of the identity 
element only, then G, is called the group mani- 
fold of G, and an element of G, is called a 
frame. If f, is a fixed frame, then the mapping 
a af,€ G, (ac G) gives a tdiffeomorphism of 
G to G,. Let I(G) be the set of all tdifferen- 
tial forms @ of degree 1 on G, that are invar- 
iant under transformations of G. Then Ice) 
is a linear space of dimension r and is the 
tdual space of the Lie algebra g of G. A basis 
{@,|1<A<r} of I(G) is called a set of rela- 
tive components of G. The structure equations 
hold: 


r 


KS 
H.v71 


do, == 


where c;,, are tstructure constants of the Lie 
algebra g. 

Let G be a Lie transformation group of a 
space E. Then a G-invariant submanifold of E 
on which G acts transitively is called an orbit. 
Each point ye E determines an orbit contain- 
ing y. When there exist parameters k; (1 &j « t) 
such that any G-invariant on E is a function of 
k,,...,k,, then these parameters are called the 
fundamental invariants of E. Let H (c G) be the 
stability group at a point y, on an orbit M; 
then M is identified with the thomogeneous 
space G/H by the diffeomorphism oq: M G/H, 
(ayo) - aH (ae G). Furthermore, a tprincipal 
fiber bundle (G,, M, H, x) is determined by the 
projection t: G,— M, t(afo)= ayo (ae G). The 
'fiber H, on a point ye M is a group manifold 
of H. H, is called the family of frames on y, 
and an element of H, 1s a frame on y. Local 
coordinates 6, (1 < u <s) of the group H are 
called the secondary parameters and are used 
to indicate frames in H,. When H is not con- 
nected, let H? be the connected component 
of the identity of H and M be the !covering 
manifold G/H? of M. An element je M over 
ye M is called an oriented element. Now as- 
sume that the group H is connected. Then the 
family of frames H, on each ye M is given as 
an tintegral manifold of a tcompletely inte- 
grable system of total differential equations 7; 
=0(1<i<r—s, n;e1(G)) on the group mani- 
fold G,. Here the x, are linearly independent 
and are called the horizontal components of M. 
The 7z; are linear combinations of the relative 
components w, of G, and their coefficients are 
generally functions of the fundamental invar- 
iants k;. For simplicity, we assume that the 
relative components To, are chosen such that 
the horizontal components z; and components 
o. (r —s «a &r) are linearly independent. Then 
the c, (1 « fi <s) are called the secondary com- 
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ponents. The differentials d0, of the secondary 
parameters are linear combinations of the oe. 
Furthermore, let {x,} (1 <ø <n) be local co- 
ordinates of E; then the differentials dx, are 
linear combinations of the differentials dk; and 
the horizontal components z;. 

Let G be a Lie transformation group of a 
space V. We regard two m-dimensional sur- 
faces W, and W, passing through xe V as 
equivalent if they have a tcontact of order p at 
x. Then an equivalence class of submanifolds 
is called a contact element of order p at the 
point x. Let E, be the set of all contact ele- 
ments of order p at x, where x runs over all 
the points in V. À contact element of order p 
naturally determines a contact element of 
order p — 1, and we denote this correspon- 
dence by V: E,—E, ,. Thus we obtain the 
series of correspondences 


V-—E,€ Ee... Ep- E,c... , 


where a contact element of order 0 is identified 
with a point of V. Since a transformation on 
the space V induces a transformation on E,, 

G is also a Lie transformation group of E,, 
and this transformation commutes with the 
mapping y. The fundamental invariants k; of 
E, are said to be of order p. We use similar 
terminology (such as frames of order p, etc.) 
throughout this article. The fundamental in- 
variants k; of order p (1 <j<t,) can be chosen 
such that they contain the fundamental invar- 
iants k; (1 <i<t,_,) of order p— 1. The ad- 
ditional t, —t,-, invariants k, (tj, «a «t,) are 
called the invariants of order p. The family H7 
(y€ E,) of frames of order p can be chosen 
such that H? is contained in the family H?~' 
(z= yy€ E, 4) of frames of order p — 1. If neces- 
sary, the family H7 of frames of order p can be 
made connected by defining an orientation of 
contact elements of order p. Furthermore, the 
horizontal components 7; (1 <j<r—s,) of 
order p can be chosen such that they contain 
the horizontal components z; (1 «i &r— s, ,). 
The additional s,.., — s, components 7, (r 
—55-, SASF —5,) are called the principal 
components of order p. 

Let W be an m-dimensional surface of a 
space V. The contact element of order p (20) 
is determined at every point of W and ex- 
pressed by the family of frames of order p and 
the values of invariants of orders less than or 
equal to p. Let (uj) (1 & ix m) be local coordi- 
nates on W. Then the differentials du; are given 
as linear combinations of linearly independent 
differential forms z; (1 <i & m), where her, 
called the basic components of W, are certain 
linear combinations of the differentials of the 
fundamental invariants of V and of the horizon- 
tal components of orbits of V. Let F?(W) be 
the set of all families of frames of order p; then 
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F?(W) depends on m parameters u; and s, 
secondary parameters 0, of order p. On the 
space F?(W), the differentials of invariants of 
orders less than or equal to p — 1 and the prin- 
cipal components of orders less than or equal 
to p—1 are linear combinations of the basic 
components, whose coefficients are functions 
of the invariants of orders less than or equal 
to p. The differentials of invariants of order 

p and the principal components of order p are 
linear combinations of the basic components: 


dka h.n Pauca peg SOS Es 


RNa = E D. F—Sp-1 SASF — Sp 


where the coefficients h,,, b,; are functions of 
the invariants of orders less than or equal to p 
and, in general, the secondary parameters 0, of 
order p. These coefficients are called the coeffi- 
cients of order p. Let I, be a subgroup of G 
preserving a family of frames of order p and D, 
be a space whose coordinates are coefficients 
(h,;, bai) of order p. Then T, acts on D, asa 
transformation group. Knowledge of the prop- 
erties of contact elements of order less than 

or equal to p can be utilized to obtain infor- 
mation about the invariants of order p+ 1, etc. 
In fact, if we can choose in the I-space D, a 
subspace C, that intersects each orbit in D, at 
one and only one point, then in general the 
secondary parameters of order p associated 
with the points in C, correspond to the frames 
of order p, and the parameters associated with 
the points in C, are the invariants of order p + 
1. The restrictions of the coefficients of order 

p to C, are functions of the invariants of orders 
less than or equal to p+ 1; they are indepen- 
dent of the secondary parameters of order p. 
Thus the frames of order p+ 1 and the in- 
variants of orders less than or equal to p+ 1 
determine the contact elements of order p of W 
and their differentials; generally, the latter can 
be utilized to determine the contact elements 
of order p+ 1. 

This process of obtaining information of 
"order p+ 1" utilizing a suitable subspace C, 
of D, is the so-called general method of moving 
frames. However, the surface W may contain 
points for which the general method does not 
apply. Actually, there are surfaces W for which 
the method does not apply for any point in W. 
Thus various methods of moving frames are 
necessary to cope with different kinds of sur- 
faces. In the actual application of the method 
of moving frames, we use certain devices that 
help to simplify the calculations. In fact, an 
infinitesimal transformation (6h,,, ób,;) of the 
group I, acting on the space D, is expressed as 
a linear combination of the secondary compo- 


ai’ 


nents of order p; this expression is easily ob- 
tained by means of the structure equations of 
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G. The group 1,,, is a subgroup of T, fixing 
every point of the subspace C,, and its in- 
finitesimal transformation is such that 6h,;=0, 
ób,; — 0. The secondary components of order 
p 1 are immediately obtained from the 
equations for dk, and z,. Furthermore, when 
m Z 2, the condition for the principal compo- 
nents of every order to satisfy the structure 
equations of G is essential to the problem of 
the existence of (m-dimensional) surfaces. 

As we apply the method of moving frames 
consecutively to a surface W, we eventually 
arrive at the order q having the following 
properties: The families of frames of order 
q+ 1 coincide with those of order q, and the 
invariants k, of order q+ 1 are expressed as 
functions ¢,(k,) of invariants k, of order less 
than or equal to q. In this case, the families of 
frames of orders q +j (j 2 1) are all equal, and 
the invariants of orders q +j are partial ( j — 1)- 
derivatives of functions g,{k,). The family of 
frames of order q is called the Frenet frame. 
The differential invariants on a surface are 
defined to be differential forms generated by 
the basic components and the invariants of 
each order. 

Specifically, assume that the group G is an 
analytic transformation group of V, and the 
m-dimensional surfaces W,, W, are analytic. 
Then there exists an element g of G such that 
gW, =W, if and only if W, and W, are of the 
same kind and have the same relations among 
the invariants of orders less than or equal to 
q 4 1. These relations are called the natural 
equations of the surface. The theory of sur- 
faces based on the analysis of the natural 
equations of surfaces is called natural geome- 
try. The reduction formula can be obtained by 
utilizing the Frenet frame; it gives the equation 
of the surface in the form of power series con- 
taining the invariants of each order. 

Various results are known concerning the 
theory of surfaces of the spaces V,, V, sharing 
the same transformation group G. We also 
have a theory of special surfaces whose in- 
variants satisfy specific functional relations. 
Furthermore, we have problems concerning 
the deformation of a surface (preserving some 
differential invariants). Actually, the theory of 
surfaces of dimension m other than curves and 
hypersurfaces is in general quite difficult. The 
methods of tensor calculus can be applied to 
the study of surfaces. The theory of fconnec- 
tions can be considered to be an outgrowth of 
the study of surfaces by means of the method 
of moving frames and tensor calculus. 


B. Projective Differential Geometry 


The rudiments of differential geometry sub- 
ordinated to the *projective transformation 
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group, or projective differential geometry, can 
be found in the Theory of surfaces by J. G. 
Darboux. The subject has been systematically 
studied by H. G. H. Halphen, E. J. Wilczynski, 
and G. Fubini. The Fubini theory was en- 
riched substantially by E. Cartan, E. Cech, E. 
Bompiani, and J. Kanitani. 

In this section we consider a surface S in a 
3-dimensional projective space. Let A(u!, u?) 
(u',u? are parameters on S) be a point of S, 
and associate with A all the frames TA. A,, 
Az, A3] (|A, A1, Az, A3| 2 1), where A,, Aa, A, 
are points of the tangent plane to S at A.A 
family of such frames is called the family of 
frames of order 1, and we express its differen- 
tial by 


3 
dA,— 3 wf Ag, & — 0, 172.3. Ag — A. 
g=0 


The œf are tPfaffian forms that depend on two 
principal parameters determining the origin A 
and ten secondary parameters determining the 
frame. We have c +w! +3 4- 03 — 0, 0$ — 0. 
Furthermore, c! =a, o? — c$ are indepen- 
dent of each other and depend on the principal 
parameters only. Let zl z?, z? be tnonhomo- 
geneous coordinates with respect to a frame of 
order 1. Then in a neighborhood of the origin, 
S is expressed by z? 2X2, f,, where the f, are 
homogeneous functions of degree r with re- 
spect to z}, z?. 

If we write f; =(a9(z')* + 2a,z!z? + a,(z”)*)/ 
2, then it follows from the structure equations 
of the projective transformation group that 
vi zs dag) +4,07, 03 —a,0! t a0. If we 
put p, 2 ag(c! H + 2a, w! o -- a4(c?Y,, then 
a curve on S defined by p, =0 is called the 
asymptotic curve and its tangent the asymp- 
totic tangent. At any point of this curve, the 
plane tangent to S is in contact of order 2 
with this curve, and there are in general two 
asymptotic curves through any point of S. 
Equations of the asymptotic tangent at A 
are given by z? 20, f; —0. A point of S at 
which the asymptotic tangents coincide is 
called a parabolic point. If every point of S is 
parabolic, then S is a *developable surface, and 
the general theory is not applicable to such a 
surface. 

Among the family of frames of order 1, a 
frame satisfying ay =a, —0, a, = 1 is called the 
frame of order 2. For this frame, the straight 
lines AA,, AA, are asymptotic tangents. With 
respect to this frame, if f3 = —(bp(z')? + 
3b, (z')?2? + 3b,z!(z?Y? +b; (z7)°)/3, then œ? = 
boc! tbe, — o0 o1 T 02—o$-2(b,o0! + 
b,0?), e) bam -- b40?, and the quadric 
surface z? — z!z? — z*(b,z! +b,z? + pz?) (with p 
arbitrary) is called Darboux's quadric at A, an 
especially interesting one among contact quad- 
rics of S. Darboux's curve is a curve on $ such 
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that Darboux's quadric is in contact of order 3 
at any point of it. Its tangent is called Dar- 
boux's tangent and is given by z? —0, bo(z!)? + 
b4(z?y) —0. 

We have bob, £0, except in the case of 
truled surfaces. We take special frames of 
order 2 determined by b, =b, =0, by =b3=1 
and call them the frames of order 3. If a frame 
of order 3 satisfies f4 = —(co(z!)* + 4c, (z1)? z? 
+ 6(c, — 1) (z!z?y? +4c3z! (z2? - c4(22)*)/12, 
then oi — 201 - «2 — c0! t c40?, o0$— o 
=c,0'+c,07, 01 — 09 — c0! t c40?, o0 - 0] 
— 202 — c40! kam", With respect to this 
family of frames of order 3, (ef -- (»?)*)/ 
2o! w is an invariant associated with two 
neighboring points of S, called the projective 
line element. Also with respect to this frame, 
two straight lines A4,, A, A, are polar with 
respect to Darboux's quadric. 

Among the families of frames of order 3, a 
frame satisfying c, =c, =c¢3 =0 is called a 
frame of order 4. With respect to this frame, 
there exist A, u, v, p such that w? 2 Ao! + pw?, 
099 2 vo! + pw, w = po! + Ao. Hence if we 
put c9 = — 3a, c4,— — 3b, it follows that 


x o0 co 0 
ol th o! o? 
(o)=| à 3 
a 0 2 4 1 ]> 
o9 o tt w 
Of o3 Of —% 


t? = —(3/2)(ao! --bo?), «12 (1/2)(ao! + be’). 


Thus the frame of order 4 is the Frenet frame 
and is attached to every point of S. This frame 
is called the normal frame, and the invariants 
a, b, A, u, v, p are called the fundamental differ- 
ential invariants. The straight lines AA, and 
A, A, associated with the normal frame are 
called directrices of Wilczynski of the first and 
second kind, respectively. With respect to the 
normal frame, S is expressed by 


2 DEEL —((z1)* +(z7)9)/3 
+ (a(z')* + b(2?)*)/4 + (21 2?)2/2 4- .... 


A necessary and sufficient condition for two 
surfaces S, S to be projectively equivalent is 
that there be normal frames having the same 
ol, o»? and the same six fundamental differen- 
tial invariants. For six quantities a, b, A, 4, v, p 
to be fundamental differential invariants of a 
surface, they must satisfy a certain condition of 
existence [6]. 

A frame of order 1 such that AA, and A, A, 
are polar with respect to Darboux's quadric 
is called Darboux's frame. With respect to 
this frame also, a theory of surfaces has been 
established. 

Consider a pointwise correspondence be- 
tween two surfaces S, S, and denote by AeS 
the point corresponding to A€5. If there exists 
a projective transformation q that transforms 
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A into A and the image ¢(S) is in contact of 
order 2 with S at A, then the pointwise corre- 
spondence is called a projective deformation. 
A necessary and sufficient condition for the 
existence of a projective deformation between 
two surfaces is that these surfaces have the 
same projective line element [6]. A ruled sur- 
face is projectively deformable only to a ruled 
surface. Given an arbitrary surface S, it is 
generally impossible to find a surface that is 
different from S and projectively deformable to 
S; some conditions must be satisfied [6, 8 ]. 

Let p°, p°?, p93, p12, p13, p?3 be tPlücker 
coordinate of a straight line in a 3-dimensional 
projective space P?. Then we have p?! p?* — 
pO? p+? +p% p!? — 0, and there is a one-to- 
one correspondence between the ratios of { p" 
and straight lines (2 90 Coordinates B). If the 
p? are regarded as homogeneous coordinates 
of a 5-dimensional projective space P", then 
the previous equation defines a hyperquadric 
Q in P?. Thus there is a one-to-one correspon- 
dence between points of Q and straight lines in 
P?. A curve on Q corresponds to a set of one- 
parameter families of straight lines, or a ruled 
surface. Sets of 2-parameter or 3-parameter 
families of straight lines corresponding to 
surfaces of 2 or 3 dimensions on Q in P? are 
called congruences of lines or complexes of 
lines, respectively. Thus by using a theory of 
surfaces in P?, it is possible to establish the 
theory of congruences and complexes [2, 6, 8], 
which is an important part of projective dif- 
ferential geometry. 

Specifically, if the surface is either a curve or 
a hypersurface, there are numerous interesting 
results [2, 4, 6]. 


C. Affine Differential Geometry 


The theme of general affine differential geome- 
try is the study of differential-geometric prop- 
erties of a point or set of points in a space 

that are invariant under the action of the 
taffine transformation group. Affine differen- 
tial geometry is the study of the properties 
invariant under the action of the tequivalent 
affine transformation group, Le., a subgroup of 
the affine transformation group formed by 
elements sending (x;) to (x;) such that 


X-ajt$ agx, i=1,...,n,  det(aj)-l. 
ju 


The latter transformation leaves invariant the 
volume surrounded by an oriented closed 
hypersurface. The method of moving frames is 
effective in affine differential geometry. 

Let C be a plane curve, and associate with 
any point A — A(t) of C a family of frames 
[A, e, e; ], where the area of the parallelogram 
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determined by the two vectors e,, e, is equal 
to 1. This frame is called the frame of order 0. 
Its differential is expressed by 


2 2 

dA-Y œe, de-2Yoe, r=1,2, 
s=1 s=1 

wi 02-0. 


The frames of order 1, 2, and 3 are character- 
ized by œ? =0; o? =0, w? — 0! ; and w?=0, 

c? — o), o) =0, respectively. Then a frame of 
order 3 can be associated with each point of C 
and coincides with the Frenet frame. We call 
o! — do the affine arc element; the affine curva- 
ture x is defined by œt = — do. Then the 
Frenet formula is given by 


dA=doe,, de,=doe,, de,=—Kdeoe,. 


With respect to this frame, C is expressed as 
y—x?/2-r kx*/8 + (dk/do)x5/40 4- .... 
Further, do and x are given analytically by 
dc —|dA,d^A|'2, — x — |d? A/do?, d? A/do?|, 


where |M, N| 2 det(M, N). M, N are column 
vectors with two entries. We call o the affine 
arc length. The straight line on which e, is 
situated is called the affine normal, the dia- 
meter of the parabola osculating C at A. If x is 
constant, then C is a conic section. Further- 
more, C is an ellipse, hyperbola, or parabola 
according as the constant x is positive, nega- 
tive, or zero. In affine geometry, parabolas play 
a role similar to that played by straight lines in 
Euclidean geometry. 

There are numerous results concerning the 
theory of skew curves and surfaces [1]. Con- 
cerning the theory of skew curves, results on 
affine length, affine curvature, affine torsion, 
affine principal normals, and affine binormals 
are similar to those in Euclidean geometry. 
The affine transformation group is situated 
between the projective transformation group 
and the congruent transformation group and 
hence has properties analogous to theirs. The 
theory of surfaces has a character similar to 
that of projective differential geometry [1]. 
We may also consider the variation of the 
affine area of a surface surrounded by a closed 
skew curve C. We call the extremal surface the 
affine minimal surface. W. Blaschke and others 
obtained many results on the global properties 
of such surfaces. 


D. Conformal Differential Geometry 


Let S" be a tconformal space of dimension 

n, and associate with each point A5 € $" a 
frame R[ Ap, A,, ..., Ans A] of the t(n + 2)- 
hyperspherical coordinates with origin Ao 
(— 76 Conformal Geometry). Then denoting 
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by A: B the tinner product of hyperspheres 
A, B, we obtain 


A, Ap = Gap; a, B=0,1,...,n, 00, 
where 
0 0 -1 
(Gag) = 0 Jij 0}, 
—] O0 0 


ij-1l,...,T, Jj = jie 


Let z* be homogeneous coordinates with 
respect to R. Then the tMObius transforma- 
tion zZ of S" is characterized by z* —c5z^, 
where g,sczc? = gan |cf| #0. The differential of 
the family of the frames is defined by 


dA, zs c As, (1) 
p=0,1 
where 
we ox, 0 
(02) = |E ga ok, oi X Giz , 
0 o.  —Ob 


x (Jof T gg Ot) = di. 


There are (n+ 1)(n 4- 2)/2 linearly independent 
forms among œ, and this is the number of 
parameters of the Móbius transformation 
group. The structure equations of this group 
are 


dof =) oi ^f. (2) 


The theme of conformal differential geometry 
is the properties of Pfaffian forms o satisfying 
(1) and (2). 

Consider a transformation c: 3z*A,— 
Y z*(A, +dA,). (1) If all œ vanish except c9, 
then all the circles through Ap, A. are in- 
variant, and any point P is transformed to a 
neighboring point P on the circle, such that 
the cross ratio (P, P; Ao, A») is constant. This 
transformation is called the homothety with 
centers Ao, A». (2) If all œ vanish except Qj, 
wo” =} gi 0€, then all the circles tangent to a 
fixed direction at A,, are transformed into 
themselves, and any hypersphere through A,, 
and orthogonal to those circles is transformed 
into a hypersphere having the same property. 
This transformation is called the elation with 
center A. (3) If all œ vanish except w? = 
Y gg 0, o, then the transformation is an 
elation with center Ap. (4) If all œ vanish 
except wf, then the transformation is an in- 
finitesimal rotation with center Ao, with A,, 
regarded as a point at infinity. Thus any in- 
finitesimal Móbius transformation is de- 
composed into the previous four types of 
transformation. 

To study the theory of curves and hypersur- 
faces in ST, we again utilize the Frenet frame 
chosen from a family of frames associated with 
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Ag. For example, the Frenet formula of a curve 
in S? is given by 


(oj) de 0 0 «do Q 
—tda 0 0 
0 kdo —do 0 0 


We call do, x, and t the conformal arc element, 
conformal curvature, and conformal torsion, 
respectively. There are many results on con- 
formal deformation [5]. 

Concerning Laguerre differential geometry, 
we have results dual to those in conformal 
differential geometry (a point is replaced by a 
straight line and an angle by a distance be- 
tween the points of contact of the common 
tangents of two oriented circles). 


E. Contact Manifolds 


Consider a (2n + 1)-dimensional differentiable 
manifold M?"*! with a 1-form 5 such that 

n A(dn)" #0, where dy is the exterior derivative 
of ug and ^ denotes exterior multiplication. 
(Note that this is true for the 1-form in the left- 
hand side of eq. (2) in 82 Contact Transforma- 
tions A.) Such a manifold is called a contact 
manifold with contact form y. The structure 
group of the tangent bundle of a contact mani- 
fold M?"*! reduces to U (n) x 1, where U(n) is 
the unitary group; hence every contact mani- 
fold is orientable. Simple but typical examples 
are given by the unit sphere S?"*! in Euclidean 
space E?"*? and the tangent sphere bundle of 
an (n+ 1)-dimensional Riemannian manifold 
M"*!, both with natural contact forms (S. S. 
Chern [9]). Every 3-dimensional compact 
orientable differentiable manifold is a contact 
manifold (J. Martinet [12]). 

Now a differentiable manifold M?"*! is said 
to be an almost contact manifold if it admits a 
tensor field @ of type (1, 1), a vector field £, and 
a 1-form y such that 


GRELL EE (3) 


where X is an arbitrary vector field on M?"*! ; 
and the triple (o, £, n) is then called an almost 
contact structure. (3) implies that oc =0 and 
n(q X)-0 (S. Sasaki [14, I]). The structure 
group of the tangent bundle of an almost 
contact manifold M?"*! reduces to U (n) x 1. 
Indeed, J. W. Gray [10] took this property as 
his definition of almost contact structure. For 
any pair of vector fields X and Y on M?"*!, let 


N(X, Y)2[X, Y] o[oX, Y] 
to[X,oY]—-L[oX. oY] 
—(X n(Y)- Y: n(X)j 6 
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where [ , ] is the Poisson bracket; then N is a 
tensor field of type (1,2) over M?"*!, which we 
call the torsion tensor of the almost contact 
structure (9, č). When N vanishes identically 
on M?"*., we say that the almost contact 
structure is normal. 

An almost contact structure (o, č, n) on 
M?*** induces naturally an almost complex 
structure J on M?"*! x R (resp. M?"*! x S1), 
which reduces to a complex structure if and 
only if (o, €,4) is normal. A similar statement is 
also valid for the product space of two almost 
contact manifolds (A. Morimoto [13]). 

If M?"*! is an almost contact manifold 
with structure tensor (9, €,7), we can find a 
positive definite Riemannian metric g so that 
g(oX,oY)-— g(X. Y) —n(X)n(Y) for any pair 
of vector fields X and Y, and the set (9, £, y, g) 
is then said to be an almost contact metric 
structure. 

When M?"*! is a contact manifold with 
contact form 1, there exists a unique vector 
field é which satisfies dyn(X, £) 0, q(£) 2 1 for 
any vector field X. We can then find a tensor 
field o of type (1, 1) and a positive definite 
metric tensor g so that (i) dy(X, Y)=g(X, oY) 
is satisfied for any pair of vector fields X and 
Y, and (ii) (o, Z 1, g) is an almost contact metric 
structure. The almost contact metric structure 
determined in this way by a contact form 7 is 
called a contact metric structure. A differenti- 
able manifold with normal contact metric 
structure is called a normal contact Rieman- 
nian manifold or a Sasakian manifold. Bries- 
korn manifolds are examples of such mani- 
folds. They include, besides the standard 
sphere $?"*!, all exotic (2n + 1)-spheres that 
bound compact oriented parallelizable mani- 
folds. An almost contact manifold is said to be 
regular or nonregular according as the integral 
curve of € is regular or not as a submani- 
fold. A compact regular contact manifold is a 
principal circle bundle over a symplectic mani- 
fold, and it admits a normal contact metric 
structure if and only if the base manifold is 
a Hodge manifold (Boothby and Wang [8], 
Hatakeyama [11]). 

Many research papers on the topology 
and differential geometry of manifolds with 
the structures defined above have been pub- 
lished by S. Tanno, S. Tachibana, D. E. Blair, 
M. Okumura, K. Ogiue, S. I. Goldberg, and 
others. 
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A. General Remarks 


Let f be an timmersion of an m-dimensional 
tdifferentiable manifold M of class C' into an 
n-dimensional Euclidean space E". More pre- 
cisely, f is a differentiable mapping of class C" 
such that the ‘differential df, is injective at 
every point p of M. The pair (M, f) is called 





an immersed submanifold (or a surface) of E". 
When m 1, we call it a curve of E", and when 
m —n-— 1, a hypersurface in E". The cases of 
n=2 and n=3 have been the main objects of 
study in differential geometry of curves and 
surfaces. The differential-geometric properties 
for the general case of immersion are discussed 
in 365 Riemannian Submanifolds. 


B. Frames in Er 


Every fEuclidean motion in E" can be ex- 
pressed as the product of a parallel translation 
and an *orthogonal transformation that keeps 
the origin of E" fixed. The set of all parallel 
translations is a commutative group that can 
be identified with R”. It is a normal subgroup 
of the group of motions I(E") of E". So we see 
that I(E”) is a tsemidirect product of R” and 
the torthogonal group O(n). The Lie algebra 
of I(E") is the direct sum of R" and the Lie 
algebra o(n) of the orthogonal group, where 
both are regarded as additive groups. Corre- 
sponding to this decomposition, we can write 
the fMaurer-Cartan differential form over 
I(E") as o + Q, where o belongs to R” and Q 
to o(n). The tstructural equation d(a+Q)= 
—(1/2)(c +9) ^ (o +9) can be divided into 
the following two parts: dw =Q ^ c; dO — 
—(1/2)Q ^ Q. These are known as the structure 
equations of E". By an orthogonal frame in 

E" we mean an ordered set (x, e,, ...,e,) con- 
sisting of a point x and a set of *orthonormal 
vectors e,, e;, ..., e,. We denote by O(n) the set 
of all orthogonal frames in E". If we denote 
the translation identified with x eR" by T;, 
then there is a one-to-one correspondence o: 
I(E")— € (n) given by of. A) — (x, Ae;. ..., Ae,) 
(AeO(n)). We can make O(n) into a differ- 
entiable manifold so that q is a tdiffeomor- 
phism. We denote the differential forms over 
€ (n), which are images of w and Q under the 
‘dual mapping of o '!, by the same letters œ 
and Q, respectively. For (n) as a *principal 
fiber bundle over R" with the projection z: 
z(x,e,,..., e) - x and n vector-valued func- 
tions q;: P(X, e,, ..., e,) - e; over C (n), we have 


w=) wie, Q=} QUE 
i i«j 
o — (dn, p), QË — (do, oj. (1) 


do'zY OI al, dQ9—-Y Q*AQU, 

j k 
where {E;;} is a basis of o(n) defined by E,e;=e;, 
E,e;=—e,, Eje,—O0 (k4#i,j) and ( , ) is the 
scalar product of vector-valued forms induced 
from the scalar product of E". Any diffeomor- 
phism of C (n) onto itself preserving œw and Q 
must be a Euclidean motion. 
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C. Theory of Curves 


Let (M, f) be an immersion of a 1-dimensional 
differentiable manifold M into E". We identify 
the tangent space of E" at each point with E" 
itself. Then df, maps the origin of the tangent 
space M, to f(x), and the image df (M) of M, 
by df, is a straight line passing through f(x) 

in E", called the tangent line of f(M) at f(x). 
By O,(M) we mean the set of all ordered sets 
(x, €,, ..., e,), where xe M and {e;} is an ortho- 
normal basis of E" such that e, e df, (M,). Then 
€,(M) can be naturally immersed in C (n) by 
the mapping f: f(x, e,, ..., e,) - (f(x), ei, ..., €n) 
We can pull back the differential forms c, Q, 
o. QU over O(n) to €, (M) f* and denote them 
by 6, ©, 64, and OU respectively; then we have 
0! —0 (i7 1). Let f, and f, be two immersions 
of M into E". Then in order for there to exist a 
Euclidean motion a of E" such that f, =a0 fz, 
it is necessary and sufficient that there exist a 
diffeomorphism oe of ©, (M) onto €, (M) such 
that 6, = p*(0;,), Or, — 9*(8, ). Letz, be the 
projection of the fiber bundle €, (M), and let o 
be naturally defined vector-valued functions 
over 0,(M). Then we have d( f on) — 0! @,, 

do, — È$- 87 oj. If we put ds? (X) = df, XO? 
(X e M,), then we have (0)? = z*(ds?). For each 
point xe M there are two possibilities for the 
choice of e, corresponding to two orientations 
of the curve. But since (dq, do) = n7 (p? ds’), 
p? depends only on the point x of M. We call 
p (2:0) the absolute curvature. We now choose 
an orientation of the curve and then e, in 
accordance with the orientation. Thus we get a 
submanifold of ©,(M), which we again express 
by the notation €, (M). If we define the form ds 
by ds(X) — (df(X), e,), we have 0! — x (ds), and 
ds is called the line element. Any tlocal cross 
section R:z, o R=1 of the bundle COM) is 
called a moving frame. Putting 


R*(0!)—ds, | R*(Q"U)— p ds, 


we see that the following equation holds over 
M: 


J 


For two immersions (M, f) and (M, f;), we 
have f, =«0 f, (x is a Euclidean motion) if and 
only if they have the same ds and p? for some 
moving frames. 


D. Frenet's Formulas 


In order to study local properties of curves it 1s 
sufficient to consider them on ‘Jordan arcs of 
class C". With respect to orthogonal coordi- 
nates (x!, ..., x") in E", such a curve is repre- 
sented parametrically by x‘= f'(t) (te [a,b], 
Y(dxi/dtY! > 0) or by a vector representation 
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x — x(t). If o is a diffeomorphism of a closed 
interval To, b^] onto [a,b], then fog and f 
are representations of the same arc in E", and 
Q is called a transformation of the parameter. 
Any curve of class C! is trectifiable, and its 
arc length is given by s = (2(277-, (dx'/dt)?)'? dt. 
We may choose the arc length s measured 
from a point on the arc as a parameter, called 
the canonical parameter of the arc. Consider 
an arc C of class C" given by the vector repre- 
sentation x — x(s), se[a, b]. We assume that its 
*Wronskian |x'(s), ..., x? (s)| is not identically 
zero (we denote by ' the derivative with respect 
to the canonical parameter), which means 
that the arc C is not contained in a hyper- 
plane in E". A point at which the Wronskian 
vanishes is called a stationary point, and we 
assume that there exists no stationary point 
on C. By the *Gram-Schmidt orthonormaliz- 
ing process we obtain an orthonormal basis 
ei, ..., e, ([e,, ..., e,] >0) from n vectors x'(s), 
..., X" (s) at each point of C. We call the frame 
thus determined the Frenet frame. With re- 
spect to the Frenet frame, (2) is rewritten as 


ei(s) 2 — K;_1(s)e;_,(s) + K;(s)e;+1 (S), 


Ko(s) — (s) — 0; (3) 
Kj(s) > 0, 


These are called Frenet's formulas (or the 
Frenet-Serret formulas). We call x,, &;, ..., 
K,.. the first, second, ..., (n— 2)nd curvature, 
respectively, while we call x, , the torsion for 
n23. For a curve in a lower-dimensional sub- 
space E" c E", we set k; —O (i m). The curva- 
tures and the torsion of a straight line are 
zero. To get Frenet's formulas in these special 
cases, we fix e, (i7 m) in the subspace ortho- 
gonally complementary to E" in E" and pro- 
ceed as in the general case. Suppose that C,, 
C, are arcs such that both of their Frenet 
frames are of class CT If there exists a diffeo- 
morphism of C, to C, that preserves arc length 
and the x; (i— 1,...,n— 1) are equal at corre- 
sponding points, C, and C, are mapped onto 
each other by a motion of E". This is the fun- 
damental theorem of the theory of curves. 
Given n — 1 functions of class C! «,(s) z 0, ..., 
K, (s) 20 (we assume that the equality signs 
occur at most at a finite number of points) 
and x, ,(s) for O< s< L, there exists an arc 
that has K,,..., K4 3, K,-, as its first, ..., 

(n — 2)nd curvatures and its torsion, respec- 
tively. The equations x; «;(s) are called the 
natural equations of the curve. 


E. Plane Curves 


Let x = x(s) be a curve of class C? in E?, and 
(x(s), e,, e;) its Frenet frame. The tangent and 


111 F 
Differential Geometry of Curves and Surfaces 


the normal of this curve at x(s) have para- 
metric representations x(s) - te,, x(s)-- te,, 
respectively (with parameter t). Frenet's for- 
mulas are written as x —e,, x" — e, =Ke,, and 
p is called the curvature of the curve C. The 
natural equation is given by x= «(s). If k(s)= 
constant 40 along C, C must be a portion of 
a circle. Another way of defining the curvature 
is as follows: We take a fixed direction (for 
example, the positive direction of the x-axis on 
E?) and denote by 0(s) the angle made by the 
tangent 7; of the curve C at x(s) with the direc- 
tion. Then we have x(s) — d0/ds. If n —2, the 
curvature can take both positive and nega- 
tive values. Figs. 1 and 2 suggest a geometric 
meaning of x » 0 and x «0, respectively. The 
circle with center x 7 x(s) - (1/x)e; and radius 
Liv has a contact of higher order than any 
other circle in E?. We call this circle the oscu- 
lating circle (or circle of curvature) at the point 
x = x(s), its center the center of curvature, and 
1/« the radius of curvature. The locus C’ of the 
center of curvature of a curve C is called an 
evolute of C. Conversely, C is called an involute 
of C’, the fenvelope of the family of normal 
lines of C. When a curve is given in terms of its 
canonical parameter s, the curvature is given 
by [x'(s), x" (s); when the curve is given by 
another parameter t as x = x(t), the curvature 
is given by x(t)— |x'(t), x'(0)|/|x'(0|*, where ' 
means d/dt. 





Fig. 1 


Fig. 2 
K>0. K «0. 


The facts we have just stated concern local 
properties of plane curves. We shall now dis- 
cuss the global theory of curves, which deals 
with properties of each curve as a whole. Let 
C:x-x(s, a<s<b, be a closed curve. Let 
0(s), 0 x 0(s) « 2n, be the angle that e(s) makes 
with the x axis. Put © = (70'(s) ds. Intuitively, 
O measures the total rotation of e,(s) as we 
run along the curve C from a to b. Since C is 
closed, © is an integer multiple J of 2x. The 
integer I is called the rotation number of C, 
and is equal to (1/2z) f? k(s)ds. Let D be a 
closed domain consisting of points in the inte- 
rior and on the boundary of a simple closed 
curve C. C is called a closed convex curve or 
an oval if D is tconvex in E?. Among all ovals 
of given length, the circle has the maximum 
area. Various generalizations of this theorem 
have been obtained, and the collection of 
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problems of this kind is called the isoperimet- 
ric problem. This problem has intimate con- 
nections with fields such as integral geometry. 
The oval has a convenient parameter other 
than the arc length parameter s. Given a num- 
ber t, O& t « 2z, there exists a unique point 
x(t) in the oval such that e, — (cost, sint) at 
x(t). When we describe the oval in terms of 
the parameter t, the tangent vector at x(t) is 
parallel to that at x(t + x), and we can define 
the width W(t) at x(t). W(t) is called the width 
of the oval. A curve is called a curve of con- 
stant width if the curve is an oval whose width 
W(t) does not depend on t. The circle is a 
typical example of a curve of constant width. 
Reuleaux's triangle is another well-known 
example of a curve of constant width (Fig. 3). 
For a curve of constant width of width W 
and length L, we have L - zxW. 


aN 


Fig. 3 


There are also some results concerning the 
relations between local properties (for exam- 
ple, curvature) and properties of the whole 
figure. An example is given by the four-vertex 
theorem. A vertex on a curve C is by defini- 
tion a point where dx/ds=0. Then there are at 
least four vertices on an oval of class C?. A 
simple closed curve with x >0 (<0) must be 
convex (— 89 Convex Sets). 


F. Space Curves 


Let x 2 x(s) (se[a, b]) be a curve C of class C? 
in E? defined in terms of the canonical para- 
meter s. Let (x(s), e,, e;, ej) be Frenet frames 
along C. Then we have the Frenet formulas 


e, =K,e2, e3 = —k,e, - K;e;, e3 = —Kk;5e,. 


We call 1/x,, 1/x, the radius of curvature and 
the radius of torsion, respectively. The line 

X — x(so) - te, is the tangent of C at x(sp). 
The two straight lines through the point x(sọ) 
defined by x 2 x(so) - te; and X 2 x(sg) - te, 
are called the principal normal and the binormal 
of C at x(so), respectively. The three planes 
through x(so) defined by x 2 x(so) - te; + tes, 
X — x(so) - te3 +te;, and x 2 x(sg) - te; + te; 
are called the normal plane, thc rectifying 
plane, and the osculating plane, respectively. 
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At a point x(so) of a curve x 2 x(s) of class C?, 
we take e; (So), e;(so), and e4(so) as unit vectors 
of the coordinate axes. Substituting the Frenet 
formulas into the Taylor expansion of x(s), we 
see that the new coordinate x; (s$), x,(s), X3(s) of 
C are given by 


x1 —($— 59) — (K1(s9)/6) (s — 59)?  ..., 
X2 — (k1(59)/2)(5— so)? + (4 (59)/6) (S — 59)? - ..., 
X3 — (K1(so) 2(s9)/6) (s— so)? +... 


These are called Bouquet's formulas. Utilizing 
these formulas we can see the nature of the 
curve with given x, and x. A curve and its 
osculating plane at a point on it have contact 
of order higher than any other plane through 
that point. The family of osculating planes 

of C tenvelops a tdevelopable surface S and 
coincides with the locus of tangent lines to C. 
We call S the tangent surface of C, and C the 
line of regression of S. The family of rectifying 
planes of C also envelops a developable sur- 
face called the rectifying surface, and C is a 
*geodesic on this surface. The family of normal 
planes of C envelops either a cone or a tangent 
surface of another curve C. When the natural 
equation of a space curve has a special form, 
the shape of the curve is simple. For example, 
K,(s) 2 constant, «;(s) — constant represent a 
curve, called an ordinary helix, on a cylinder 
which cuts all the generators of the cylinder at 
a constant angle. More generally, it is known 
that if k/k, — constant, the tangent at each 
point of the curve makes a constant angle with 
a fixed direction. Such a curve is called a gen- 
eralized helix or a curve of constant inclination. 
Each curve satisfying ax, + bk; — c (ab 40) is 
called a Bertrand curve. For a Bertrand curve 
there exists another curve C and a corre- 
spondence of C onto C such that they have a 
common principal normal at corresponding 
points. Conversely, this property is also a 
sufficient condition for C to be a Bertrand 
curve. A Mannheim curve is defined analo- 
gously as a curve having a correspondence 
with another curve C such that the principal 
normal of C and the binormal of C coincide at 
corresponding points. When a correspondence 
of C and C has the property that tangents at 
corresponding points are parallel, then the 
correspondence is called a correspondence of 
Combescure. 

We have stated mainly local properties of 
space curves. There are also several results 
about global properties of curves in E? analo- 
gous to the case of plane curves. For a sim- 
ple closed curve C of length L, we call K = 
fó xı (s) ds the total curvature of C. Generally 
we have K «2n, while K —2z if and only if C 
is a closed convex curve lying in a plane (W. 
Fenchel) [5, 6]. The total curvature is deeply 


111 G 
Differential Geometry of Curves and Surfaces 


related to the properties of tknots. If a simple 
closed curve in E? is knotted, then the total 
curvature is at least 4z [7,8]. We fix an origin 
O in E? and draw a unit tangent vector with 
initial point O parallel to the unit tangent 
vector at each point of a space curve C; then 
the endpoint of this vector traces a curve C on 
the unit sphere with the center O. We call C 
the spherical indicatrix of C and the corre- 
spondence of C to C a spherical representation. 
The total curvature K of a curve C is equal to 
the length of C. Consequently, we have K — 

$c d0, where 0 is the angular deflection of the 
tangent line along the closed curve C. 


G. Theory of Hypersurfaces 


Let (M, f) be an immersion of an (n— 1)- 
dimensional differentiable manifold M of class 
C" into E". Then we can define on the hyper- 
surface M a positive definite differential form 
g of degree 2 induced from the inner product 
of EN ok, X) 2 (df. (X), df (X)), X € M,. Then 
M becomes a *Riemannian manifold with 
*Riemannian metric g. We call g the first fun- 
damental form of (M, f). The Riemannian 
geometry on a surface with its first funda- 
mental form as Riemannian metric is called 
geometry on a surface (— 364 Riemannian 
Manifolds). 

By ©,(M) we mean the set of all the ordered 
sets (x, ei, ..., e,)e O(n), where xe M and [ej] 
(i— 1,2, ...,n) is an orthonormal system of E" 
such that e;e df, (M) (i— 1, ...,n— 1). Then 
€,(M) with natural projection z, and natural 
differentiable structure is a tprincipal fiber 
bundle over M and has a natural immersion 
f:f(x e, ee 6) S (f(x), e,, ..., e,) in the prin- 
cipal fiber bundle @(n). We can tpull back the 
forms on € (n) to ©,(M) by f* and put 0— 

f *(w), 9 = f *(Q); then the structural equa- 
tions of E" are transformed to d0 — O A 0, 

dO —(—1/2)0 ^ ©. Furthermore, if we put 

05 — f *(o), 99! — f (D), then 0" 20 and 07, 
O"(i, j « n) depend only on the first funda- 
mental form of (M, f). Let f, and f, be two 
immersions of M into E". Then in order that 
there exist a Euclidean motion « of E" such 
that f, —a0o f;, it is necessary and sufficient 
that there exist a diffeomorphism ¢ of ©, (M) 
onto €, (M) such that 6, — 9*(0, ) and O,, = 
Q*(O, ). Suppose that M is forientable and 
oriented. Then the unit vector field normal to 
df (M,) at every point xe M in E" defines a 
mapping of M into the unit sphere in £” called 
the spherical representation of M or the Gauss 
mapping (Gauss map). Regarding the unit 
normal vector field č of M as a vector-valued 
function over M, we can define a symmetric 
product of df and dé by — (df, d&)(X, Y)= 
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(1/2) [(af (X), dč(Y))+ (df (Y), dč(X))], called 
the second fundamental form of (M, f). 

Two immersions f, and f, of M that induce 
the same first and second fundamental forms 
have a Euclidean motion « such that f, = 
ao fa; and the converse is also true. This fact 
is called the fundamental theorem of the theory 
of surfaces. 


H. Theory of Surfaces in E? (— 365 Riemann- 
ian Submanifolds; Appendix A, Table 4.I) 


A surface in E" is locally expressed by para- 
metric equations x;— x;(u,) (i2 1,...,nm a= 
L,...,m) or by a single vector equation x = 
x(u,). We are mainly concerned with the case 
n=3,m=2, and we express the surface by a 
vector representation x — x(u, v). The first and 
second fundamental forms are written as 


Edu? +2F dudv+Gdv’, 
Ldu? +2M dudv+N dv. 


If we use the usual notation of tensor analy- 
sis, then the first and second fundamental 
forms are also denoted by g,,du*du* and 
H,,du* du, respectively, where u* (x= 1,2) are 
parameters (with } omitted by tEinstein’s 
convention). We call Igel, {H,,} the first and 
second fundamental quantities, respectively (— 
365 Riemannian Submanifolds). At the point 
Po = X(ug, vo) on a surface S that corresponds 
to parameter values (uo, Vo), the curves ex- 
pressed by v2 vy and u =u are called a u- 
curve and a v-curve through po, respectively. 
Let x,, x, denote the tangent vectors éx/éu, 
Ox/Ov at po to the u-curve and v-curve, respec- 
tively, through the given point p, and c de- 
note the unit vector orthogonal to x, and x,. 
Then £ is called the normal vector of S at p, 
and (x,, x,, Cl the Gaussian frame of S at po. 
Although a Gaussian frame is not in general 
an orthogonal frame, it is intimately related 
to local parameters. The plane that passes 
through the point p, and is spanned by x,, x, 
is called the tangent plane to S at py. The 
coefficients of the second fundamental form 
L(u, v), M(u,v), N (u, v) are expressed by the 
inner products L-(—x,,,, M -(—x, €,), 

N =(— X, Šo) (6, — 06/0u, Č, — 05/0v). 

Let (X, Y) be the coordinates of a point on 
the tangent space at pg with respect to the 
Gaussian frame. We call the curve of the sec- 
ond order defined by LX? -2MXY - NY?—& 
(e is a suitable constant) the Dupin indicatrix. 
The point p, is called an elliptic point or a 
hyperbolic point on S according as the Dupin 
indicatrix at the point is an ellipse or a hyper- 
bola. If pọ is an elliptic point, then points near 
po on the surface lie on one side of the tangent 
plane at po, whereas if p, is a hyperbolic point, 
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points near p, on the surface lie on both sides 
of the tangent plane at p, (Figs. 4, 5). A hyper- 
bolic point is also called a saddle point, since in 
a neighborhood of the point the surface looks 
like a saddle. A point that is neither elliptic 
nor hyperbolic is called a parabolic point; at a 
parabolic point we have LN — M? —0. If at 
least one of L, M, N does not vanish at po, 
then there is a neighborhood of p, of the sur- 
face that lies on one side of the tangent plane 
at po (Fig. 6). If a vector (X, Y) on the tangent 
plane of the surface at p, satisfies the equation 
LX? -2MXY - N Y? —0, then the direction of 
the vector is called an asymptotic direction. If 
the point p, is elliptic, such a direction does 
not exist; if py is hyperbolic, the direction is an 
tasymptotic direction of the Dupin indicatrix 
on the tangent plane at pọ. A curve C ona 
surface such that the tangent line at each point 
of the curve coincides with an asymptotic 
direction of the surface at the point is called an 
asymptotic curve. 


Fig. 4 
Elliptic point. 





Hyperbolic point. 





Fig. 6 
Parabolic point. 


Let C:x — x(u(t), v(t)) be a curve through pg 
on the surface x — x(u, t). Then the curvature xK 
of C as a space curve is given by 


Ldu?+2M dudv + N dv? 


par ae 
weoi e Edu FaF dudo Gd? 
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where dudv is the direction of C on the surface 
at pg and @ is the angle between the normal of 
the surface at pg and the principal normal of C 
at pg. The center of curvature at a point po of 
a curve C of class C? on a surface of class C? is 
the projection on its osculating plane of the 
center of curvature of the section C* (of the 
surface) cut by the plane determined by the 
tangent to the curve at py and the normal of 
the surface at the point (Meusnier's theorem). 
The curvature of the curve C* at p, is called 
the normal curvature of the surface at the point 
for the tangent direction. Since the normal 
curvature for a direction at a point is a con- 
tinuous function of this direction that can be 
represented as a point on a unit circle, there 
exist two directions that realize the maximum 
and minimum of the normal curvature. These 
directions are given by the equation 


Edu+Fdv F du 4- G dv = 
Ldut+tMdv Mdu+Ndv| ` 


When this quadratic equation in du/dv has 
nonzero discriminant, it determines two direc- 
tions defined by its two roots. These directions 
are called principal directions at the point. A 
curve C on a surface such that the tangent line 
at each point of the curve coincides with a 
principal direction at the point is called a line 
of curvature. When all lines of curvature of 

a surface are circles, the surface is called a 
cyclide of Dupin. The two normal curvatures 
corresponding to two principal directions are 
given by 1/R, satisfying the following second- 
order equation: 


1V EN-GL-2FM | LR-M?_ 
R EG-F? R EG-F) ` 


They are called principal curvatures, and each 
of their inverses is called a radius of principal 
curvature. The mean value H =(x,+«,)/2 of 
two principal curvatures x; UR, (i= 1,2) is 
called the mean curvature (or Germain's curva- 
ture), and the product K — x, x, is called the 
total curvature (or Gaussian curvature). These 
are given by 


AEN  GL-2FM 
2 EG-F ^" 


_LN-M? 
" EG- F^" 


A point on a surface is elliptic, hyperbolic, or 
parabolic according as K >0, K «0, or K Z0 
at the point. A point where the second fun- 
damental form is proportional to the first 
fundamental form is called an umbilical point, 
and a point where the second fundamental 
form vanishes is called a flat point or a geo- 
desic point. If a surface consists of umbilical 
points only, the ratio (L(du)* - 2M dudv + 

N (dv?) (E(duY + 2F dudv + G(du)’) is a con- 
stant, and the surface is either a sphere or a 
portion of it. If a surface consists of flat points 
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only, the surface must be either a plane or a 
portion of it. The mean curvature and the 
Gaussian curvature of a sphere are constant, 
and those of a plane are both equal to zero. If 
we use the spherical representation of a surface 
stated in Section G, we can give to the Gauss- 
ian curvature the following geometric mean- 
ing: Let A be the area of the domain enclosed 
by a closed curve C around a point pọ on a 
surface, and let A* be the area of the domain 
on the unit sphere enclosed by the curve that 
is the image of C under the spherical represen- 
tation of the surface. Then the limit of A*/A 
as the closed curve C tends to the point pọ is 
equal to K at po. 

Let us denote by (g^?) the inverse matrix of 
the matrix (g,,) whose elements are coefficients 
of the first fundamental form g,ş du” du’. We 
easily see that g!! = G(EG — F?), g+? =g?! = 
— F (EG — F?), g?? = EK(EG — F?). We intro- 
duce the symbols 


log, Og, 89 
GE) x zx) 





Qu Qu? Ow 


Ge = ad A 
E g” Epy, 6], 


which are called the Christoffel symbols of the 
first and second kinds, respectively. Suppose 
that a surface is given by the vector represen- 
tation X =x(u;, u3), and put x,=0x/du", Xag = 
Ox,/0u?. Then for the derivatives of the Gaus- 
sian frame, we obtain 


y 
Xag = M Stuf, 6,7 —9" HX, 


We call the former Gauss's formula and the 
latter Weingarten's formula. The integrability 
conditions of these partial differential equa- 
tions are 


R” by = Hap H; — H, H, H =g" Hs, 


OH. oH, gd glo 20 
ôu? uf lap) "leif" 


where 


ERAN 
le 


are components of the curvature tensor. The 
former are called the Gauss equations, and the 
latter the Codazzi-Mainardi equations. In 
connection with these equations, Bonnet's 
fundamental theorem states the following: 
Suppose that a positive definite symmetric 
matrix (g,g) and a symmetric matrix (H,,) are 
given that are functions of class C? and C!, 
respectively, defined over a *simply connected 
domain D in R?. If they satisfy the Gauss 
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equations and the Codazzi-Mainardi equa- 
tions, then there exists a surface x = x(u,,u;) 
with the given (g,5) and (H,,) as coefficients of 
its first and second fundamental forms, respec- 
tively. Such a surface is determined uniquely 
if, for an arbitrary fixed point (u?, u$) of D, 

we assign an arbitrary point pg and a frame 
(x?, x9, £9) at po so that x2, x? are orthogonal 
to the unit vector €° and (x2, x7) 2 Gel, u9) as 
the Gaussian frame at pọ. On the other hand, 
we can investigate surfaces as Riemannian 
manifolds defined by the first fundamental 
form. 

A tdiffeomorphism between two surfaces 
preserving arc length is called an isometric 
mapping. The condition of preserving arc 
length is equivalent to the condition that the 
first fundamental quantities of the surfaces 
coincide at each pair of corresponding points, 
provided that we have introduced parameters 
on the two surfaces so that corresponding 
points have the same parameter values. In 
such a case two surfaces are said to be iso- 
metric. From the Gauss equation we can see 
that the total curvature depends only on the 
first fundamental quantities. So K is a quantity 
that is preserved under isometric mappings 
(Gauss's theorema egregium). 

A vector field 4*(t)x, defined along a curve 
u* — u*(t) on a surface is said to be parallel in 
the sense of Levi-Civita along the curve if its 
*covariant derivative along the curve vanishes, 
i.e., if 


5i%/dt d" dt + f ` Io du’/dt=0. 

By 
The length of a vector belonging to a vector 
field that is parallel along a curve C is constant 
along C. The angle of two vectors both be- 
longing to vector fields that are parallel along 
C is also constant along C. Choose two vector 
fields A(,, parallel along a curve C on a surface 
that satisfy 9,541, 4f» = ôa. Then the tangent 
vector to C is expressed by du*/dt = Atv" (t). 
Take a 2-plane and fix an orthogonal coordi- 
nate system on it; then the integral curve C of 
a set of ordinary differential equations dx*/dt 
= Civ" (t) (CZ = A4 (P9)) is called the develop- 
ment of C (— 80 Connections). We denote by 
k the curvature of a curve C of class C? on a 
surface S of class C? and by ø the angle be- 
tween the binormal of C and the normal of 
S at the same point. Then the quantity x,— 
K cos c, belonging to the geometry on the sur- 
face, is called the geodesic curvature of the 
curve at the point. À curve with vanishing 
geodesic curvature is called a geodesic. It satis- 
fies the differential equations 


aut (a) dub du’ 
E gg siecle) 
ds? {By} ds ds 
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The development of a geodesic is a straight 
line (— 364 Riemannian Manifolds). 

Let us consider a simply connected, orient- 
able bounded domain D on a surface such 
that the boundary of D is a simple closed curve 
C that consists of a finite number of arcs of 
class C?. If we denote by a; (i=1,2,...,m) the 
external angles at vertices of the curvilinear 
polygon C (Fig. 7), we have 


| k,ds-- Y Get ll K do — 2n. 
C i=1 D 


This is called the Gauss-Bonnet formula. In 
particular, if all the arcs of C are geodesics, we 
have 


» at || Kdo=2n. 
i=1 D 


This formula implies as special cases the 
following well-known theorems in Euclidean 
geometry and spherical trigonometry: (1) The 
sum of interior angles of a triangle is equal to 
n. (i1) The area of a spherical triangle is pro- 
portional to its spherical excess. The formula 
also implies the following theorem: On any 
closed orientable surface we have f f K do = 
2ny, where y is the tEuler characteristic of the 
surface. We call f f K do the integral curvature 
(or total Gaussian curvature). 





Fig. 7 


I. Special Surfaces in E* 


A surface is called a surface of revolution if it 1s 
generated by a curve C on a plane z when x 
is rotated around a straight line / in z. Then 

l and C are called an axis of rotation and 

a generating curve, respectively. A surface 

of revolution having the x,-axis as the axis 

of rotation is given by the equations x, — 
rcos@, x, =rsin 0, x,= o(r); its first funda- 
mental form is (1 + @’?)dr? +r? d0?. The sec- 
tion of a surface of revolution by a half-plane 
through its axis of rotation is called a meri- 
dian. According as the meridian is a straight 
line parallel to the axis of rotation or a straight 
line intersecting the axis nonorthogonally, 

the surface of revolution is called a circular 
cylinder or a circular cone, respectively. If the 
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meridian is a circle that does not intersect the 
axis of rotation, it is called a torus. 

A surface of class C? whose mean curvature 
H vanishes everywhere is called a minimal 
surface (— 275 Minimal Submanifolds). A 
surface of class C! realizing a relative mini- 
mum of areas among all surfaces of class C! 
with a given closed curve as their boundaries 
is an tanalytic surface such that H =0. Con- 
versely, a surface of class C? with vanishing 
mean curvature is an analytic surface. The 
equation of a surface of revolution with a cate- 
nary as its generating line is given by x? + x— 
a(e*** + e "ale? This surface is called a cate- 
noid and is a minimal surface. Conversely, a 
minimal surface of revolution is necessarily a 
catenoid. For a surface obtained by rotating a 
*Delaunay curve around its base line, the mean 
curvature H is equal to a constant (#0). Con- 
versely, a surface of revolution with nonzero 
constant mean curvature must be such a sur- 
face. A surface with constant Gaussian curva- 
ture is called a surface of constant curvature, 
and is a 2-dimensional Riemannian space of 
constant curvature (— 364 Riemannian Mani- 
folds). A non-Euclidean plane can be repre- 
sented locally as a surface of constant curva- 
ture (^ 285 Non-Euclidean Geometry). Two 
surfaces of the same constant curvature are 
locally isometric to each other. 

Surfaces of revolution of constant curvature 
are classified. The simplest surface of constant 
negative curvature is a pseudosphere, which is 
a surface of revolution obtained by rotating a 
'tractrix x, —acos o, x,=alogtan((@/2)+ 
(1/4)) — asing ( — 1/2 « o « 1/2) around the 
x-axis. A surface generated by a 1-parameter 
family of straight lines is called a ruled surface; 
a hyperboloid of one sheet, a hyperbolic para- 
boloid, a circular cylinder, and a circular cone 
are examples. The first two can be regarded 
as ruled surfaces in two ways. Each of the 
straight lines that generate a ruled surface is 
called a generating line. A surface consisting of 
straight lines parallel to a fixed line and pass- 
ing through each point of a space curve C is 
called a cylindrical surface with the director 
curve C. A surface generated by a straight line 
that connects a certain point o with each point 
of a curve C is called a conical surface. Both a 
cylindrical surface and a conical surface are 
ruled surfaces such that K =0 everywhere. For 
ruled surfaces we have K <0. In particular, a 
surface such that H #0 and K =0 everywhere 
is called a developable surface. A developable 
surface must be either a cylindrical surface, a 
conical surface, or a tangent surface of a space 
curve. There exist ruled surfaces that are not 
developable, for example, hyperboloids of one 
sheet and hyperbolic paraboloids. A nondevel- 
opable ruled surface is called a skew surface. A 
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ruled surface generated by a straight line that 
moves under a certain rule intersecting a 

fixed straight line | orthogonally is called a 
right conoid. If we take | as the x3-axis, the 
surface is given by the equations x, =ucos», 
x, =usinv, x, = f(v). A surface generated by a 
curve C (C may be chosen as a plane curve) 
that moves in the direction of a fixed line / 
with constant velocity and turns around / with 
certain constant angular velocity is called a 
helicoidal surface. If we take | as the x-axis, 
the surface is given by the equations x, = 
UCOSU, X; — usinv, x4 — f(u) - kv, where k is a 
constant and x, — f(x,) is the equation of C. In 
particular, if C is a straight line that intersects | 
orthogonally, then f(u) — 0, and the surface is 
called a right helicoid (or ordinary helicoid). 

A right conoid is both a ruled surface and a 
minimal surface. Conversely, a ruled surface 
that is also a minimal surface is necessarily a 
right conoid. A helicoidal surface with a trac- 
trix as the curve C is called a Dini surface and 
is a surface of constant negative curvature. On 
the normal of a surface S two points q; (i= 1,2) 
are centers of principal curvature at p. The 
locus of each of these points is a surface called 
a center surface of S. When S is a sphere, two 
center surfaces degenerate to a point; if S is a 
surface of revolution, one of the center surfaces 
degenerates to the axis of revolution and the 
other is a certain surface of revolution. If S is 
general, each of the center surfaces is the locus 
of an edge of regression of the developable 
surface generated by normals of S along a line 
of curvature. 

When a 1-parameter family of surfaces S, is 
given by the equation F(x,,x,,x3,t)=0,a 
surface E that does not belong to this family is 
called an enveloping surface of the family of 
surfaces {S,} if E is tangent to some S, at each 
point of E, that is, if E and S, have the same 
tangent plane. The equation of E is obtained 
by eliminating t from F(x,,x,,x3,t)=0 and 
(OF/0t)(x,, x4, x4, t) - 0. In general, if we de- 
note by o(x,, x2, x3) 2 0 the equation ob- 
tained by eliminating t from F 20 and 6F/ét= 
0, then the surface defined by o =0 is either 
the enveloping surface of (S) or the locus 
of singular points of S,. The intersection éi 
of the enveloping surface E of {S,} and S, 
is a curve defined by F(x,,x5,x4,t9) — 0, 
(OF/0t)(x,, x5, x3, t9) - 0. We call C, a charac- 
teristic curve of {S,}. Since {C,} is a family of 
curves on the enveloping surface E, there may 
exist an envelope F on E. In such a case, F is 
called the line of regression of {S,}. The equa- 
tion of F is obtained by eliminating t from F = 
0, 6F/ét=0, and 6?F/ét? =0. In particular, 
the enveloping surface of a family of planes is a 
developable surface, and their characteristic 
curves are straight lines. Moreover, the line of 
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regression coincides with the line of regression 
of the tangent surface. 

If there exists a diffeomorphism between 
two surfaces such that first fundamental forms 
at each pair of corresponding points are pro- 
portional, then the surfaces are said to be in a 
conformal correspondence. In particular, when 
the proportionality factor is a constant, they 
are said to be in a similar (or homothetic) 
correspondence. There exists a local conformal 
correspondence between any analytic surface 
and a plane. Namely, if we choose suitable 
parameters, we can reduce the first funda- 
mental form of any analytic surface to the 
form A(£, n) (d£? + dn’). Such parameters are 
called isothermal parameters. From the exis- 
tence of isothermal parameters we can see 
that there exists a local conformal correspon- 
dence between any two analytic surfaces. The 
assumption of analyticity in these theorems is 
not necessary [10]. If there exists a diffeomor- 
phism between two surfaces under which geo- 
desics are mapped to geodesics, then the sur- 
faces are said to be in geodesic correspondence. 
A surface has a locally geodesic correspon- 
dence with a plane if and only if it is a surface 
of constant curvature. If two surfaces are in 
geodesic correspondence, then with respect 
to parameters with the same values at corre- 
sponding points, we have the relation 


x x a a 
By um EE 


for coefficients of connections of the two 
surfaces. 

By the *Alexander-Pontryagin duality theo- 
rem, a submanifold M in E? that is homeo- 
morphic to $° divides E? into two domains, 
and two points belonging to different domains 
cannot be connected by a broken segment 
unless the segment meets the surface. Such a 
manifold M is called a closed surface. One of 
the two domains consists of those points with 
bounded distance from a point belonging to 
the domain. Such a domain is called the inte- 
rior of the closed surface M. If the set M* con- 
sisting of M and its interior is convex in E?, 
the surface M is called a closed convex surface 
(or ovaloid). 

The Gaussian curvature of an ovaloid can- 
not be negative at any point. A closed surface 
with K >0 must be an ovaloid. Moreover, it is 
known that on any closed surface there exists 
at least one point where K >0 (J. Hadamard). 
If there exists no umbilical point and K is 
strictly positive in a domain on a surface, then 
the two principal curvatures regarded as con- 
tinuous functions on the domain cannot takc 
their local maximum and local minimum 
values at the same point (D. Hilbert). A com- 
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pact, connected surface of class C^ with con- 
stant Gaussian curvature is a sphere. A closed 
surface with K >0 and H —constant is a 
sphere (H. Liebmann). 

A problem proposed by H. Hopf asks 
whether an orientable compact surface with 
constant mean curvature is a sphere. In con- 
nection with this problem, Hopf showed that a 
closed orientable surface of class C? of genus 
zero with constant mean curvature is a sphere. 
If there is a certain relation W(k,,k,)=0 be- 
tween the two principal curvatures k,, k, 

(k, 2 k;) of a surface, the surface is called a 
Weingarten surface (or W-surface). There are 
many interesting results for W-surfaces. As an 
extension of the convex surface, tight immer- 
sions have been studied (— 365 Riemannian 
Submanifolds). 

The Gaussian curvature K is invariant 
under isometries. Hence a sphere is trans- 
formed to a sphere by each isometry. This 
fact is sometimes described as the rigidity of a 
sphere. More generally, if two ovaloids are iso- 
metric, then they are congruent (Cohn-Vossen’s 
theorem). It is known that if we remove a small 
circular disk from a sphere, then the remaining 
portion of the sphere is isometrically deform- 
able. On the existence of closed geodesics on 
ovaloids, G. D. Birkhoff proved the follow- 
ing theorem: There exist at least three closed 
geodesics on any ovaloid of class C?. It is also 
known that there exist surfaces of revolution 
that are not spheres but whose geodesics are 
all closed (— 178 Geodesics). On a hyperbolic 
non-Euclidean compact tspace form of genus 
p (p> 2) there exists a geodesic whose points 
are everywhere dense in it (E. Hopf) (for the 
ergodicity of flows along geodesics on this 
surface — 136 Ergodic Theory; also 126 
Dynamical Systems). 


J. Singular Points of a Surface 


Suppose that a neighborhood of a point p, of 
a surface S in E? is given by a certain vector- 
valued function f of class C” as r= f(u, v). Then 
a point po where two vectors (Of/du),,, (0f/Ov),. 
are linearly independent is called a regular 
point. A point on S that is not regular is called 
a singular point. If for suitable parameters we 
have (6f/du),, =O but (01/0v),, , (07 f/éu’),,. 

(67 f/0u ðv), are linearly independent, then 
such a singular point is called a semiregular 
point. In general, shapes of neighborhoods of 
singular points are extremely complicated. 
However, we note the following: (i) by a small 
deformation of the function f (and its deriva- 
tives of orders at most r) we can reduce py to a 
regular or semiregular point of the deformed 
surface; (ii) if pọ is semiregular, we can choose 
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suitable parameters and curvilinear coordi- 
nates of class C' in E? near p, so that the 
surface S in the neighborhood of the origin po 
is expressed by the equations x,—u?, x = 

v, X4 — ut (H. Whitney's theorem [15]). (The 
higher-dimensional case has also been consid- 
ered (Whitney [16]).) 
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A. Definition 


A mapping (or an operator) A of a function 
space F, to a function space F, is said to be 
a differential operator if the value f(x) of the 
image f = Au (ue F,, f €F;) at each point x is 
determined by the values at x of u and a finite 
number of its derivatives. If u and f are 'distri- 
butions, the definition applies with the deriva- 
tive interpreted in the sense of distributions 
(— 125 Distributions and Hyperfunctions). In 
this article we restrict ourselves to the case of 
linear differential operators and consider only 
those of the form 
P(x, D)= 3 a(x)D*, (1) 
la| m 
where « denotes n-tuples (a, , o, ..., Gel of 
nonnegative integers, called multi-indices; | «| 
the length of «:|a]=%; +2 +... +æ, and D* 
the differential operator D* = Df}: D}... Dis, 
with D; —(—i)0/6x;. The coefficient (—i) is 
sometimes omitted. The coefficients a,(x) 
are functions defined on an open set Q in n- 
dimensional space. We call P(x, D) an ordinary 
differential operator if the dimension n of Q is 
1 and a partial differential operator if n> 2. 
Ordinary differential operators and partial 
differential operators behave quite differently 
in many respects. 
We set 
P(x, 9= d ax), =% Ca, 
where €=(€,,&,,...,¢,)€R” or C". The order 
of P(x, D) is the greatest integer |«| for which 
a, (x) O0. In expression (1) m is assumed to be 
equal to the order, and in that case 
Ris D)= 3 a,(x)* 
la| 2m 
is called the principal part of P(x, D), and the 
corresponding polynomial P,,(x, £) the charac- 
teristic polynomial. 
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Differential operators have been investt- 
gated for a long time in connection with the 
linear differential equations 


P(x, Dju(x)= f(x)  xeQ. (2) 


Except for ordinary differential operators, 
however, it is rather recently that the prop- 
erties of such operators have been studied 
from the general viewpoint. 

We denote a differential operator with 
constant coefficients by P(D). In general, 

P(x, D) is assumed to be a linear differen- 

tial operator with coefficients that are C^- 
functions. However, many of the results for 
C?-coefficients also hold when the coefficients 
are sufficiently differentiable. (For tfunction 
spaces BQ), B(Q), &(O), &(Q), CQ), C^(Q), 
Co (Q), L,(Q), (9), BQ), ete., —^ 125 Distri- 
butions and Hyperfunctions, 168 Function 
Spaces). 

Differential operators are classified accord- 
ing to their properties. The most important are 
the elliptic, hyperbolic, and parabolic types. 

A differential operator P(x, D) is said to be an 
elliptic operator if the characteristic poly- 
nomial P„(x, č) has no real zero except for 
€=0 for each xeQ. Typical examples are 

the Laplacian A= —(D? + ... + DŻ) and the 
Cauchy-Riemann operator 0/0z —(1/2)(0/0x + 
ið/ðy). 

A differential operator is said to be hyper- 
bolic if the associated Cauchy problem is well 
posed (— 325 Partial Differential Equations of 
Hyperbolic Type). The d'Alembertian D2 — 
(D +... + D2) is an example. A differential 
operator of the form 0/0t + P(t, x, D,) is called 
parabolic if P(t, x, D,) is strongly elliptic (— 
Section G) in x. The heat operator iD,,, — A is 
typical. 

These three types of operators appear most 
often in applications, and if n — 2, then any 
operator of order 2 with real coefficients in 
(6/0xj* belongs to one of them at a generic 
point. In other cases, however, there are dif- 
ferential operators that do not belong to any 
of them. 


B. Fundamental Solutions 


If a differential operator P(x, D) with AQ) as 
its domain has a left inverse F that is expressed 
as an tintegral operator with tkernel distri- 
bution (in Z', ,; = 125 Distributions and 
Hyperfunctions F), then the kernel is said to be 
a fundamental solution (or elementary solution). 
F is usually a right inverse of the weak exten- 
sion (— Section F) of P(x, D) and maps 2(Q) 
into &(£2). The image is mapped to the original 
function by P(x, D). Nevertheless, F is not a 
genuine right inverse, and hence the funda- 
mental solution is not unique if it exists. 
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When P(D) is a differential operator with 
constant coefficients, we call a distribution 
E(x) a fundamental solution if it satisfies 


P(D)E(x) Ate) (3) 


where ó(x) is *Dirac's distribution (ô function). 
If E(x) is a fundamental solution in this sense, 
then F(x, y) 2 E(x — y) is the kernel of a left 
inverse of P(D) and is a fundamental solution 
in the sense of the preceding paragraph. 
Every differential operator P(D) with con- 
stant coefficients has a fundamental solu- 
tion in the sense of (3) (Ehrenpreis-Malgrange 
theorem; see L. Hórmander [4] for a proof). 
General operators P(x, D) with variable 
coefficients do not necessarily have funda- 
mental solutions. However, if P(x, D) belongs 
to one of the classical types of operators (ellip- 
tic, hyperbolic, or parabolic), then it has a 
fundamental solution at least locally. (See 
F. John [8] for elliptic operators, J. Leray 
[11] for strongly hyperbolic operators, and S. 
Mizohata [12] and S. D. Eidel’man [9] for 
parabolic operators.) Leray has generalized 
John's method to strongly hyperbolic opera- 
tors in an enormous work [13]. 


C. Ranges of Differential Operators 


Let P(D) be a differential operator with 
constant coefficients. Then it follows from 

the Ehrenpreis-Malgrange theorem that 
P(D)2'(Q)> BQ) holds for any open set Q. 
However, there are differential operators 

P(x, D) with variable coefficients such that for 
any OQ, P(x, D)2'(Q) 5 BQ). H. Lewy first 
devised such an example: 


P(x, D)- —iD, + D —2(x, ix) D,. 


Let C5, ,(x, D) be the homogeneous part of 
order 2m — 1 of the commutator 





P(x, D) P(x, D) — P(x, D) P(x, D). 


Then in order that P(x, D) Z'(Q) 5 A), it is 
necessary that 


Pail ¢)=0 imply Cami ¢)=0 


for all xeQ, ¢e R” (Hórmander's theorem [4]). 
When P(x, D) is a differential operator that 
does not satisfy this condition (e.g., Lewy's 
operator), choose an f(x)e 2(Q) that is not in 
P(x, D) Z'(Q). Then the differential equation (2) 
has no distribution solutions at all. P. Scha- 
pira extended this result to the case of thyper- 
functions (also — 274 Microlocal Analysis). 
Concerning the ranges of differential oper- 
ators P(D) with constant coefficients, we have 
the following detailed results due to L. Ehren- 
preis [28], B. Malgrange [14], and Hórmander 


[4]. 
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An open set Q is said to be P-convex for a 
differential operator P(D) if for each compact 
set K <Q there exists a compact set KE cO 
such that o € Cj (Q) and supp P(—D)gc K 
imply supp o c K’. Convex sets are P-convex 
for any P(D). All open sets are P-convex if and 
only if P(D) is an elliptic operator. 

Theorem: The following conditions are 
equivalent: (i) Q is P-convex; (ii) P(D)2'(0)2 
&(Q); (iii) P(D)6(Q) = &(Q). Property (iii), the 
Mittag-Leffler theorem, and the solvability of 
*Cousin's first problem for the solutions of 
P(D)u=0 are equivalent. 

An open set Q is said to be strongly P- 
convex if for each compact set K <Q there 
exists a compact set K’ such that ue 4" (OQ) and 
supp P( —D)uc K imply supp uc K’; and 
ue &'(Q) and sing supp P( —D)u c K imply sing 
supp c K^; where the singular support of u is 
the closure of the set of all points at which p is 
not a C?-function. Convex sets are strongly P- 
convex, and strongly P-convex sets are P- 
convex. 

Theorem: Q is strongly P-convex if and only 
if P(D) Z'(Q) 2 2'(Q). 

R. Harvey has shown that every domain Q 
is P-convex in the sense of hyperfunctions, i.e., 
the equation P(D)u — f always has a hyper- 
function solution u on Q for any hyperfunction 
f on Q. For the treal analytic functions .e(Q), 
however, P(D).s/(Q) = gi does not hold for 
convex open set Q in general. Hórmander [15] 
gave a necessary and sufficient condition for 
P(D) and Q in order that this hold. 


D. Hypoellipticity 


A differential operator P(x, D) is called hypo- 
elliptic in Q if for any distribution u(x)e Z'(Q), 
Pue C”(Q) implies ue C*(Q). Further, a differ- 
ential operator with real analytic coefficients 
P(x, D) is called analytically hypoelliptic in Q 
if P 1s hypoelliptic and if for any distribution 
u(x)e 2'(Q), Pue.sZ(Q) implies u(x)e (9). 

There are two fundamental facts about 
such operators. Let P have constant coeffi- 
cients; then P(D) is hypoelliptic if and only if 
P(£ +in)=0 and |ë + in| oo imply that |n| 5 oo 
(Hórmander's theorem [4]). Furthermore, P(D) 
is analytic hypoelliptic if and only if P is ellip- 
tic (Petrovski?'s theorem [16]). The heat opera- 
tor is not elliptic, but hypoelliptic. If P(D) is 
elliptic, then actually any hyperfunction u(x)e 
ABQ) such that Pue (Q) is real analytic (R. 
Harvey, G. Bengel). On the other hand, if P(D) 
is not elliptic, there is a hyperfunction solution 
u of Pu=0 that is not a distribution. 

Strictly speaking, the notion of the hypo- 
ellipticity for general differential operators 
was first formulated explicitly by L. Schwartz 
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[27 ]. Before that time, D. Hilbert, E. E. Levi 
and K. O. Friedrichs, and others investigated 
this problem for some elliptic operators, and 
the hypoellipticity was called WeyPs lemma 
(— 323 Partial Differential Equations of Ellip- 
tic Type). 

Similarly to the constant coefficient case 
the following two theorems are fundamental: 
Elliptic and parabolic operators P(x, D) with 
C* coefficients are hypoelliptic (Schwartz [27], 
Mizohata [12]). Elliptic operators P(x, D) 
with real analytic coefficients are analytic 
hypoelliptic (I. G. Petrovskii [16], C. B. Mor- 
rey and L. Nirenberg). The latter holds also 
for hyperfunction solutions (M. Sato and 
Schapira). Moreover, the following result is 
known: If for each compact set K in an open 
set Q, there exists a constant C such that 
| P*u|| e < C**! (mk)! then ue.(Q), where ||-|| x 
denotes the L,-norm on K (H. Komatsu, T. 
Kotake, and M. S. Narasimhan). 

However, as seen by the example D, +ix{ D, 
in R? (k is even), the analytic hypoellipticity 
also holds for nonelliptic operators. Such an 
operator is called a subelliptic operator; subel- 
liptic operators have been investigated by 
Hórmander, Yu. V. Egorov, F. Treves, and 
others [19]. 

Hórmander has obtained a fairly complete 
result on the hypoellipticity of the operators of 
the form 


L=} X? X94 c(x), 

i=1 
where X,,..., X, are homogeneous first-order 
differential operators with real coefficients 
([17]; O. A. Oleinik and E. V. Radkevich 
[18]). Hypoellipticity was investigated exten- 
sively after the introduction of pseudodifferen- 
tial operators and Fourier integral operators 
(— 345 Pseudodifferential Operators). 


E. Differential Operators in Banach Spaces 


We consider differential operators P(x, D) de- 
fined on a domain Q as operators in the func- 
tion spaces C(O) or L,(Q). Differential oper- 
ators of order mè 1 are always unbounded 
operators in the Banach space X — C(Q) or 
L (0). Moreover, their domains of definition 
as operators in X are not generally determined 
uniquely by the expressions P(x, D) as differen- 
tial operators. 

P(x, D) is a linear operator that maps 

o (Q) into X. This operator has a closed 
extension. The minimal closed extension P is 
called the minimal operator of P(x, D) in X. 
We have ue 2(P,) and Pgu— f if and only if 
there exists a sequence o, ECG (Q) such that 
P, >u, P(x, D)o, > f. On the other hand, the 
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closed linear operator P, whose domain is 

the set of all ue X such that P(x, D)ue X in 
the sense of distribution is called the maxi- 
mal operator (or weak extension) of P(x, D). 
We have ue Z(P,) and P,uz- f if and only if 
Qu, ‘P(x, Dp) — €f, p> for any pe Cz (0), 
where "Div, D) is the transposed operator 

‘P(x, D)o(x) SS » ( —DY'(a(x)o(x)). 
Integration by parts shows that P, is an exten- 
sion of P, and that when X is the *dual space 
of a space Y, the weak extension P, in X is the 
*dual of the minimal operator of 'P(x, D) in 

Y. Let X = L,(Q) (1 <p < oo), € be a bounded 
open set with smooth boundary, and P(D) 
have constant coefficients. Then P, coincides 
with the smallest closed extension of the oper- 
ator P(D) having as its domain the set of all 
ue C?(Q)6 X such that P(D)ue X. The latter 
closed extension is called the strong extension. 
The difference between the weak and the 
strong extension is not obvious in the vari- 
able coefficient case. 

P, coincides with P, when Q is the entire 
space and P(x, D) is an elliptic operator whose 
coefficients are constants or close to constants 
(J. Peetre, Medd. Lunds Univ. Mat. Sem., 16 
(1959); T. Ikebe and T. Kato, Arch. Rational 
Mech. Anal., 9 (1962)). In general, we have 
P, #P,. Let P(x, D) be an ordinary differential 
operator with bounded coefficients such that 
la, (x)| 2ó >0 and Q be the bounded interval 
(a, b). Then the domain of P, coincides with the 
set of all (m — 1)-times continuously differenti- 
able functions u such that the (m— 1)st deriva- 
tive is absolutely continuous and P(x, D)ue X, 
while the domain of P, is the set of all func- 
tions u which satisfy in addition the boundary 
conditions 


u(a) 2 u(a)- ...— uf" (a) 
=u(b)=...=u'"")(b)=0. 


(Moreover, u"?(q) 2 u" (b) 20 when X = 
C(a, b).) 

Let G(P5), G(P,)( € X x X) be the tgraphs of 
Po, P,. Then the quotient space # = G(P,)/ 
G(P5) is called the boundary space, and an 
element of the dual #’ of &, i.e., a continuous 
linear functional on G(P,) which vanishes on 
G(P,), is called a boundary value relative to 
P(x, D). For the ordinary differential opera- 
tors discussed above, the boundary space is 
the set of all linear combinations of u(a) and 
u)(b). When P(x, D) is an ordinary differen- 
tial operator, we can explicitly determine the 
boundary values also in the case where the 
interval (a, b) is infinite, the coefficients a,(x) 
are not bounded, or a, (x)0 (xa, b); and we 
can show that 2 is finite-dimensional. When 
P(x, D) is a partial differential operator, 2 is 
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generally of infinite dimension, and the con- 
crete forms of the elements of Z and Z' are 
not known. However, we have some informa- 
tion by M. I. Vishik (Amer. Math. Soc. Transl., 
(2) 24 (1963)) about the boundary values of 
elliptic operators of the second order. Combin- 
ing this with the results by J.-L. Lions and E 
Magenes (J. Anal. Math., 11 (1963)), we can 
obtain information for elliptic operators of 
higher order. 


F. Differential Operators with Boundary 
Conditions 


A closed operator between the minimal opera- 
tor P, and the maximal operator P, is deter- 
mined by designating a closed subspace B 

of the boundary space 2. This operator is 
called the operator with the boundary condition 
B. Particularly important are boundary condi- 
tions expressed in the form 


Q;(x,D)u(x)20, xeoóQ, i=1,...,k, (4) 


with differential operators Q;(x, D) (i=1, ..., k) 
defined on the boundary O of Q. 

When P(x, D) is an ordinary differential 
operator defined on a finite interval and the 
orders of Q; are at most m — 1 (or m), (4) always 
has a definite meaning. However, for partial 
differential operators, we need an interpreta- 
tion of (4), i.e., (4) does not necessarily deter- 
mine the subspace B of # uniquely. 

Let P, be the smallest closed extension of 
P(x, D) with (ue C*(Q)n X |Q;(x, D)u(x) — 0, 
ect P(x, D)ue Xj as its domain. P, is called 
the strong extension of the differential operator 
P(x, D) with boundary condition (4). 

On the other hand, when Q, P, and Qj 
satisfy suitable conditions, we can define the 
transposed differential operator 'P(x, D) with 
the transposed boundary operators R,(x, D) 
(j— 1, ...,I). Namely, there arc differential 
operators R,(x, D) on the boundary such that 
a necessary and sufficient condition for ue 
C*(Q) to satisfy (4) and Pu(x) — f(x) is 


| net! u(x) Pv(x)dx (5) 
Q o 

for all v(x)e C” (Q) with the boundary con- 
ditions Rjv=0, xeoQ. Then the operator P, 
defined by P, u(x) 2 f(x) for the pairs u(x), 
f(x)eX satisfying (5) is called the weak exten- 
sion of the differential operator P(x, D) with 
boundary condition (4). As in the case of 
operators without boundary conditions, the 
weak extension is an extension of the strong 
extension, and generally is the dual of the 
strong extension in the dual space X’ of the 
transposed differential operator with the trans- 
posed boundary condition. 
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Regularity up to the boundary. The funda- 
mental problems for the differential operator 
P(x, D) with the homogeneous boundary 
condition (4) are to determine, for both the 
strong and the weak extensions, the spaces 
of solutions of the homogeneous equations 
Pu — 0 and their ranges. The problems mostly 
reduce to determining when the strong and 
the weak extensions coincide and, including 
this, also to the problem of regularity on the 
closed domain €) containing the boundary 
of the solutions u of the equation P,u- f. 

This problem was solved by Nirenberg for 
strongly elliptic operators in L?(Q) with the 
Dirichlet boundary condition 


Q^'uw(xyOon ! 20, j=1,2,...,m/2, 


and generalized later by F. Browder, M. 
Schechter [20], S. Agmon, Lions, and others 
for elliptic operators in L,(Q) with a kind of 
coercive boundary condition (— Section H). 
Consequently, when Q is bounded and 
smooth, P,, is equal to P, for those operators. 
Write P for P. Then the space N(P) of the 
solutions of Pu — 0 is a subspace of finite di- 
mension, and the range R(P) is a closed sub- 
space of finite codimension. In particular, 
it follows that the *index of P, dim N(P)— 
codim R(P), is finite. 


G. Strongly Elliptic Operators 


A differential operator P(x, D) is said to be 
strongly elliptic if its characteristic polynomial 
satisfies 


ReP,,(x, c)2C|c(">0, č#0. 


Many of the elliptic operators, such as the 
Laplacian, that appear in applications are 
strongly elliptic. L. Garding treated the 
boundary value problem with the Dirichlet 
condition (in the generalized sense) for strong- 
ly elliptic operators. His work initiated the 
general study of differential operators (— 323 
Partial Differential Equations of Elliptic 
Type). His theory is based on the following 
inequality, called Gárding's inequality: 


ez, <C( Re | Puri jul, ), 


uEeCe(Q). 


H. Coercive Boundary Conditions 


The boundary condition (4) is said to be coer- 
cive if 

iV"ul| <C(| Pul + ull) (6) 
holds for any ue C*(Q) that satisfies (4). In 
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order that a differential operator P have a 
coercive boundary condition, it is necessary 
that it be a special type of elliptic operator. In 
this case, Agmon, N. Aronszajn, Schechter, 
and others found conditions under which (4) is 
coercive. Agmon, A. Douglis, and Nirenberg 
show that the inequality (6) holds in L,(Q) and 
in the normed spaces of Hólder continuous 
functions under a suitable condition. The 
classical boundary conditions au + béu/én=0 
(a>0,b>0, a+ b=1) are coercive for elliptic 
operators of the second order (— 323 Partial 
Differential Equations of Elliptic Type). How- 
ever, problems remain when the coefficients 

of Q,(x, D) are discontinuous. In order to 

have coincidence of the strong and weak exten- 
sions or regularity up to the boundary, it is 
necessary neither that P be elliptic nor that the 
boundary condition be coercive. But it is not 
known to what extent these conditions can be 
weakened. At present, major contributions are 
Hórmander's work (Acta Math., 99 (1958)) 
dealing with operators with constant coeffi- 
cients and flat boundaries, and works by J. J. 
Kohn [21], Nirenberg, and Hórmander con- 
cerning noncoercive boundary conditions. 
The latter works are connected with the theory 
of several complex variables, and have at- 
tracted much attention. 


I. Self-Adjoint Extension 


One of the fundamental problems in the case 
X =L,(Q) is whether the minimal operator P, 
has a self-adjoint extension. P, is symmetric 
if and only if P(x, D) is formally self-adjoint: 
P(x, D) -'P(x, D). Under this condition the 
boundary space 47 turns out to be the direct 
sum of two subspaces B, = {(x, P, x)| x e 2(P,), 
P,x — tix) + G(P)). The numbers n, —dim 2, 
are called the deficiency indices of P), and P, 
has a self-adjoint extension if and only if n4 
GH 

H. Weyl gave a method for computing n, 
for the Sturm-Liouville operators: 





d d 
P(x, D)= — (nena ka x e(a, b). 


We say that a (resp. b) is of limit circle type 

if the solutions u(x) of P(x, D)u(x)-- lu(x) 20 
(le C) always belong to L, in a neighborhood 
of a (resp. b) and of limit point type if a solu- 
tion does not belong to L,. This classification 
does not depend on the choice of le C. (1) If 
both a and b are of limit point type, then 

n, —n. =Q and hence P, is self-adjoint. (2) If 
a is of limit circle type and b is of limit point 
type, then n, =n_ — 1, and the self-adjoint 
extensions of P, are the operators P, that are 
obtained from P, by assigning the boundary 
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condition 
p(a)u'(a) cosa -- u(a)sin g — 0. 


The same is true when a and b are inter- 
changed. (3) If both a and b are of limit circle 
type, then n, —n.. —2, and we can impose two 
boundary conditions to obtain the self-adjoint 
extensions. 

These results have been extended by K. 
Kodaira [23] and N. Dunford and J. Schwartz 
[3] to the case of ordinary differential opera- 
tors of order m. There are formally self-adjoint 
operators that have no self-adjoint extensions. 
For example, the operator —id/dx in L,(0, oo) 
has deficiency indices n, —1 zn. —0. 

For partial differential operators, it is dif- 
ficult to determine explicitly all self-adjoint 
extensions of a given formally self-adjoint 
operator because the boundary space is com- 
plicated. If the boundary condition (4) is for- 
mally self-adjoint and coercive, then it follows 
from the results of Schechter and others that 
the differential operator with boundary condi- 
tion (4) is self-adjoint. Furthermore, condi- 
tions under which P, is self-adjoint or has a 
self-adjoint extension are known. The follow- 
ing theorem is often used as a condition of the 
latter type. If a tsymmetric operator defined on 
a dense subspace of a Hilbert space X is posi- 
tive definite: 


(Tx,x)20,  xe92X(T), 


then there is a positive definite self-adjoint 
extension T (Friedrichs's theorem). The self- 
adjoint extension obtained by this theorem is 
called the Friedrichs extension. 


J. Generators of Semigroups 


From the point of view of probability theory, 
W. Feller investigated the extensions of the 
Laplacian d?/dx? and similar operators that 
are the generators of order-preserving semi- 
groups. Recently various attempts have been 
made to generalize his results to the multi- 
dimensional case (— 115 Diffusion Processes; 
378 Semigroups of Operators and Evolution 
Equations). 

P. D. Lax and A. N. Milgram proved that 
if P(x, D) is a strongly elliptic operator, then 
— P(x, D) with the Dirichlet condition in L,(Q) 
is the generator of a semigroup [1]. 


K. Boundary Value Problems 


There are two methods of solving the inhomo- 
geneous boundary value problem 


P(x, D)u(x)= f(x), | xeQ, 
Qiíx,D)u(x)2gí(x)  xeOQ, i=1,...,k. 


426 


In the first method, we take a function v(x) 
that satisfies Q;(x, D)v(x) ^ g;(x) and reduce 
the problem to the homogeneous one for ug = 
u — v. In the second method, we consider the 
pair P(x, D) and Q,(x, D) as an operator that 
maps a function u to the pair of functions 
(Pu, Q;u) and investigate it directly. The latter 
method was adopted by Peetre and Hór- 
mander [4]. 


L. Estimates in Weighted Spaces 


J. F. Treves, Hórmander [4], and H. Kumano- 
go obtained estimates similar to (6) in L,- 
spaces relative to the weighted measure 

w,(x) dx instead of the usual L,-spaces, and 
applied them to the proof of the uniqueness 
of Cauchy problems for differential equations 
with variable coefficients (— 321 Partial Dif- 
ferential Equations (Initial Value Problems)). 
Hórmander [22] applied similar estimates to 
the proof of the "fundamental theorems of 
Stein manifolds. 


M. Eigenfunction Expansions 


When a self-adjoint operator P in the Hilbert 
space L,(Q) is a self-adjoint extension of a 
differential operator P(x, D), the tspectral 
decomposition of P is concretely expressed by 
the expansion of functions ue L,(Q) into eigen- 
functions of P(x, D). 

If Q is bounded, P(x, D) is an elliptic oper- 
ator defined on a neighborhood of Q, and 
the boundary condition is coercive, then the 
tspectrum of P is composed solely of eigen- 
values, and the eigenvectors of P are eigen- 
functions of P(x, D) in the classical sense and 
are of class C? up to the boundary. 


N. Asymptotic Distribution of Eigenvalues 


If P= —A, the number v(A) of eigenvalues less 
than 4 satisfies the asymptotic relation 


An? A 


t , À 
tute: € 479 


v(A) 
regardless of the shape of the domain and the 
boundary condition, where n is the dimension 
and A is the volume of Q [2]. This was first 
proved by Weyl and extended by T. Carleman, 
Gárding, and others to the case of operators 
of higher order with variable coefficients (— 
323 Partial Differential Equations of Elliptic 


Type). 
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O. Weyl-Stone-Titchmarsh-Kodaira Theory 


When Q is unbounded or the coefficients of 
P(x, D) have singularities near the boundary of 
Q, the spectrum of P may have a continuous 
part. 

Let P(x, D) be an ordinary differential oper- 
ator on an interval (a, b). Then for each Je 
C the equation (P(x, D) — 4)p(x) 20 has m 
linearly independent solutions q,(x, A) (k= 
1, ..., m), and any solution is represented as 
a linear combination of them. Weyl and M. H. 
Stone obtained the spectral decomposition of 
P (of the second order) in the form 


u(x)- Y | p(x, A) dp, (A) 


jk-l 


b 
«| ol, A)u(y) dy, 


a 


Pu(x)= H 1 Aj, 4) dp (4) 


b 
d ay, A) uy) dy, 
where the p,,{4) are functions of bounded 
variation and their variations represent the 
spectral measure. This formula shows that 
linear combinations of the ¢;(x, 4) form gen- 
eralized eigenfunctions even when 4 belongs to 
the continuous spectrum. Later, E. C. Titch- 
marsh and Kodaira gave a formula to obtain 
the density matrix p;(4) and completed the 
theory (Titchmarsh [6], Kodaira [23], Dun- 
ford and Schwartz [3]). This expansion theo- 
rem makes it possible to deduce in a unified 
manner expansion theorems for classical 
special functions, such as the t Fourier series 
expansion theorem, the expansions by "Her. 
mite polynomials and tLaguerre polynomials, 
the tFourier integral theorem, and various 
expansions in terms of tBessel functions [6, 7]. 

The relation between the coefficients of the 
differential operator and the spectral distri- 
bution of P is important in applications and is 
the subject of many papers [3, 5,6]. 

For example, let P(x, D)= —d?/dx? + q(x) 
and Q —(—oo, oo). If g(x) oo as |x| 00, then 
P(x, D) is essentially self-adjoint, the spectrum 
is entirely composed of the point spectrum, 
and a detailed estimate of the jth eigenvalue 4; 
is also known [6]. If q(x) converges rapidly to 
0 as |x| oo, then P(x, D) is essentially self- 
adjoint, and there is only a continuous spec- 
trum for 40 and eigenvalues for 4 «0 with 
at most 0 as accumulation point. If q(x) is a 
periodic function of x, then P(x, D) is again 
essentially self-adjoint, and the spectrum con- 
sists of a continuous spectrum decomposed in 
a sequence of nonoverlapping intervals. The 
converse problem of determining q(x) when 
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the spectral measure is given has been studied 
by I. M. Gel'fand and B. M. Levitan (Amer. 
Math. Soc. Transl., (2) 1 (1955); original in 
Russian, 1951). 


P. Eigenfunction Expansion for Partial 
Differential Operators 


The theory of eigenfunction expansion for 
partial differential operators with continuous 
spectra is not complete, as it is for ordinary 
differential operators. What causes difficulties 
is that the solutions of (P(x, D) — 4)u —0, which 
should be the generalized eigenfunctions, form 
an infinite-dimensional space, and except for 
special cases it is impossible to introduce 
convenient parameters in it. Many proofs are 
known for a general result, saying that any 
self-adjoint elliptic operator has an eigenfunc- 
tion expansion into generalized eigenfunctions 
of the form 


© k(A) 


3. ois, A)dp,(A) I o;(y, A)u(y) dy, 


-œ j=l 


u(x) = 


[10]. There are few operators, however, for 
which we know how to construct (x, 4) and 
the measure dp,(A). The Fourier transform 
gives the expansion for operators with con- 
stant coefficients defined on the whole space. 
By means of a generalized form of the Fourier 
transform, Ikebe gave an eigenfunction expan- 
sion for the Schrödinger operator —A+q(x) 
in R? under the condition that q(x)e L, and 
O(|x| ?7*) as |x| oo [24]. Y. Shizuta, Mizo- 
hata, Lax and R. S. Phillips, N. A. Shenk, 
Ikebe, D. K. Fadeev, and others proved the 
same results for similar operators defined 
on an exterior domain with a bounded set 
deleted from a higher-dimensional Euclidean 
space. These theories are closely related to the 
'scattering theory of the Schródinger equa- 
tion (—id/dt — P)u=0 or the wave equation 
(d?/dt? + P)u=0 associated with those opera- 
tors. Lax and Phillips have developed the 
latter scattering theory [26] (— 375 Scattering 
Theory). 

Many of the problems in quantum mechan- 
ics reduce to finding the spectral distribution 
of self-adjoint partial differential operators. 


Q. Expansion Theorems for Non-Self-Adjoint 
Operators 


À kind of eigenfunction expansion theorem 
may hold for non-self-adjoint differential 
operators or for non-Hilbert spaces. (See 
papers by the Russian school for ordinary 
differential operators and those by Browder, 
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Agmon, and others for partial differential 
operators.) 


R. Systems of Differential Operators 


We have so far dealt with single differential 
operators that map functions u to functions 
J. A linear differential operator that maps a 
p-tuple (u,, ..., up) of functions to a q-tuple 
(fi... Ja) of functions can be written as 


fíx)- » P(x, D)uj(x), i= l, ECH 


where P(x, D) « (P;(x, D)) is a matrix of single 
differential operators. Such a matrix is called 
a system of differential operators. A system 
P(x, D) is said to be underdetermined if p >q, 
determined if p =q, and overdetermined if p < 
q. Many propositions that hold for single 
operators hold for determined systems under 
appropriate conditions. 

However, there is a fundamental difference 
between overdetermined (underdetermined) 
systems and determined systems, as is seen 
from the theory of several complex variables, 
which is the theory of a typical overdeter- 
mined system 0/0z —(0/0z;). The theory is 
much more difficult for overdetermined (under- 
determined) systems. The general theory of 
overdetermined and underdetermined systems 
with constant coefficients has been constructed 
by Ehrenpreis [28], Malgrange, V. Palamodov, 
and Hormander [22] for C*-functions and 
distributions. It has been extended by H. Ko- 
matsu to the case of hyperfunctions. T. Miwa 
has discussed the same problem for real ana- 
lytic functions. 


S. Symmetric Systems of the First Order 


Determined systems of the first order are 
important in applications. Many problems in 
mathematical physics are formulated in terms 
of them. Also, single equations of higher order 
can be reduced to determined systems of the 
first order by regarding the derivatives as 
unknown functions. In some cases determined 
systems of the first order are easier to handle 
than single operators of higher order. In partic- 
ular, a system of differential operators 


P(x, D) 3 Ai(x)0/0x; - B(x) 


is said to be a symmetric positive system if the 
matrices A;(x) and B(x) satisfy the following 
conditions: A* = A; B+ B* + > 06Aj/0x; is 
positive semidefinite. Symmetric positive sys- 
tems have been studied in detail by Friedrichs 
[25], Phillips, C. S. Morawetz, Lax, and 
others. 
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Let R be a tcommutative ring with a unity 
element 1. Ifa map 6 of R into R is such that 
for any pair x, y of elements of R, (i) 6(x + y) 
=6dx + ôy, and (il) ó(xy) 2 Óx- y -- x: dy, then 6 
is called a derivation (or differentiation) in R. A 
ring R provided with a finite number of mutu- 
ally commutative differentiations in R is called 
a differential ring. In this article we consider 
only the case where R contains a subfield that 
has the unity element in common with R. In 
particular, if R is a field, we call it a differential 
field. 

In the above definition of differential ring, it 
is not necessary to mention the *charactertstic 
of the subfield contained in R. However, to 
make it more effectively applicable in the case 
of nonzero characteristics, we may define 
differential rings using higher differentiation in 
place of the differentiation defined above. If a 
sequence ó — {ô,} of maps dp, Ai, 62,... of R 
into R satisfies the following conditions (i)- (iv) 
for any pair x, y of elements of R and any pair 
^, H of nonnegative integers, then ô is called a 
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higher differentiation in R: (i) 9; (x + y) 2 ó;x 
+ Ó; y; (i) ô (xy) È 0,x : òy (the addition is 
performed over all pairs a, D of nonnegative 
integers that satisfy «+ f = A); (iii) 6,(6,x) = 


À 

ferentiations ô= (0,) and A 2 (5, are said to 
be commutative if and only if 6, and A, com- 
mute for all pairs A, u of nonnegative integers. 
Higher differentiations were introduced by H. 
Hasse (1935) for the study of the field of *alge- 
braic functions of one variable in the case of 
nonzero characteristics. 

These two definitions of differential rings 
coincide if the characteristic of R is zero. For 
simplicity, we shall restrict ourselves to that 
case. 

Let à, ..., à, be the differentiations of the 
differential ring R. If x is an element of R, 

OP OF ... Ópm X(S,, 55, ..., Sm are nonnegative 
integers) is called a derivative of x. We call x 
constant if and only if ó,x — ... = „X —0. An 
"ideal a of R with ója c a (i— 1,2, ..., m) is 
called a differential ideal of R. If it is a *prime 
ideal (semiprime ideal (1.e., an ideal containing 
all those elements x that satisfy x’ ea for some 
natural number g)), then a is called a prime 
differential ideal (semiprime differential ideal). 
A subring S of R with ôS C S can be regarded 
as a differential ring with respect to the differ- 
entiations ó,, ..., Ôm. We call S a differential 
subring of R and R a differential extension ring 
of S. 

Let X,,..., X, be elements of a differential 
extension field of a differential field K with the 
differentiations ô}, ..., m, and let ôẸ 05 ... 
ôm X; (s, 20,...,5,20, 1<i<n) be talgebra- 
ically independent over K. The totality of their 
polynomials over K, which forms a differential 
ring, is called the ring of differential polyno- 
mials of the differential variables X,,..., X, 
over K, and is denoted by K (X,,..., X,j. Its 
elements are called differential polynomials. 
For this ring of differential polynomials we 
have an analog of !Hilbert's basis theorem in 
the ring of ordinary polynomials, Ritt's basis 
theorem: If we are given any set Nt of differen- 
tial polynomials of X,,..., X, over K, we can 
choose a finite number of differential polyno- 
mials P,, ..., P, from IR such that each element 
Q of Mi has an integral power Q? equal to a 
linear combination of P}, ..., P, and their de- 
rivatives, where the coefficients of the linear 
combination are elements of K [ X,, ..., Xa} 
This theorem implies that in the ring of differ- 
ential polynomials, every semiprime differential 
ideal can be expressed as the intersection of a 
finite number of prime differential ideals; if this 
expression is tirredundant, it is unique (— 67 
Commutative Rings). 

The equation obtained by equating a dif- 


A 
( S j 0, 4X; (iv) ó9x =x. Two higher dif- 
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ferential polynomial to zero is called an alge- 
braic differential equation. Concerning these 
equations, we are able to use methods similar 
to those used in talgebraic geometry in study- 
ing the usual algebraic equations. J. Ritt made 
interesting studies on solutions of algebraic 
differential equations by such methods, prin- 
cipally in the case when the ground field K 
consists of fmeromorphic functions. 

Since that time, basic study of differential 
rings and fields has been fairly well organized 
and has developed into theories such as the 
following two: 

(1) Picard-Vessiot theory. This is a classical 
theory of flinear homogeneous differential 
equations originated by E. Picard and E. 
Vessiot; it resembles the *Galois theory con- 
cerning algebraic equations. The Galois group 
in this case is a linear group, and its structure 
characterizes the solution of the differential 
equation. E. Kolchin introduced the general 
concept of the Picard-Vessiot extension field of 
a differential field and studied in detail the 
group of differential automorphisms (i.e., the 
group of all those automorphisms that com- 
mute with the differentiations and fix elements 
of the ground field), thus making the classical 
theory more precise and more general. 

(2) Galois theory of differential fields. Gen- 
eralizing the concept of the Picard-Vessiot 
extension, Kolchin introduced the notion of 
the strongly normal extension field and es- 
tablished the Galois theory for such exten- 
sions. In this theory, we see that the Galois 
group is an 'algebraic group relative to a 
tuniversal domain over the field of constants 
of the ground field. Conversely, every algebraic 
group is the Galois group of a strongly normal 
extension field. We also see that a strongly 
normal extension can be decomposed, in a 
certain sense, into a Picard-Vessiot extension 
and an Abelian extension (i.e., an extension 
whose Galois group is an tAbelian variety) 

(— Kolchin [2-5], Okugawa [6 ]). 
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A. General Remarks 


Differential topology can be defined as the 
study of those properties of tdifferentiable 
manifolds that are invariant under ‘diffeo- 
morphisms. The basic objects studied in this 
field are the topological, combinatorial, and 
differentiable structures of manifolds and the 
relationships among them. Some of the re- 
markable contributions, such as H. Whitney's 
embedding theorem [1], the triangulation theo- 
rems of J. H. C. Whitehead and S. S. Cairnes 
[2, 3], and Morse theory [4], were made in 

the 1930s. In the late 1950s, outstanding re- 
sults were obtained by R. Thom, J. W. Milnor, 
S. Smale, M. Kervaire, and F. Hirzebruch, 
among others. Differential topology thus be- 
came a new, fascinating branch of mathematics. 


B. Differentiable Structures 


We assume that all manifolds are tparacom- 
pact. Let M be a ftopological manifold. A 
C’-equivalence class of tatlases of class C” 

(1 rx oo) on M is called a C’-structure on M. 
Any C’-structure on M contains an atlas of 
class C? on M, and its C*-equivalence class is 
uniquely determined (Whitney [1]). This C^- 
structure is called a differentiable structure on 
M compatible with the given C’-structure. 
Moreover, any C?-structure admits a treal 
analytic structure compatible with it in this 
sense (Whitney [1]). A C?-manifold is also 
called a smooth manifold, and a differentiable 
structure a smooth structure. Let Do, D, be 
differentiable structures on M. If two C®- 
manifolds (M, Do) and (M, D,) are not tdif- 
feomorphic, we say that the differentiable 
structures Do, D, are distinct (— 105 Differen- 
tiable Manifolds). Milnor defined a certain 
invariant of differentiable structure using the 
Hirzebruch index theorem (— 56 Character- 
istic Classes) and proved that there are several 
distinct differentiable structures on the 7- 
dimensional sphere S” (Milnor [5]). Milnor's 
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example of such structures was: Let f,: $?— 
SO(4) be the mapping defined by f,(c): = 
o"to/, where k is an odd integer, h, j are inte- 
gers determined by h cj « 1, h—j — k, and 
c, t are tquaternions of norm 1. Let Mj be 
the total space of the S?-bundle over $* cor- 
responding to the mapping f,. This is an 
oriented closed manifold with the naturally 
defined differentiable structure. Moreover, 
for each k, Mj is homeomorphic to the 7- 
dimensional sphere S". But if k, | are odd and 
k?  ? (mod 7), M, is not diffeomorphic to 
M? . Differentiable manifolds (such as Mẹ) that 
are homeomorphic, but not diffeomorphic, to 
the natural sphere are called exotic spheres. 
After the discovery made by Milnor, many 
topological manifolds other than the 7- 
dimensional sphere and possessing several 
distinct differentiable structures have been 
found (N. Shimada, I. Tamura). Moreover, 
topological manifolds admitting no differen- 
tiable structures have been constructed (M. 
Kervaire [6], Smale, Tamura [7], J. Eells and 
N. Kuiper, C. T. C. Wall [8]). On the other 
hand, manifolds of dimension <3 admit 
unique differentiable structures. 

We can introduce a *Riemannian metric 
on a C?-manifold. Let M{ and M3 be C*- 
manifolds (k-dimensional and n-dimensional, 
respectively) and f: Mi M3 be an timmer- 
sion. We fix a Riemannian metric on M5. For 
each point p of M$, let N,( f) be the linear 
space of all tangent vectors of M5 at f(p) that 
are orthogonal to the ttangent space of f(U(p)), 
where U(p) is a small open neighborhood of p 
in Mj. Then (N,Cf)|pe M11 forms an (n—k)- 
dimensional tvector bundle v, over M$, called 
the normal bundle of the immersion f. The 
equivalence class of v, is independent of the 
choice of Riemannian metric on M5. When f 
is an fembedding, we denote the submanifold 
f(M5) by L*. Let N,(L*) be the set of all points 
whose distances (with respect to the Riemann- 
ian metric) from L* are <e. If L* is compact 
and e is sufficiently small, N,(L*) is an n- 
dimensional submanifold of M5 with bound- 
ary and is uniquely determined independently 
of the choice of up to diffeomorphism. This 
submanifold is called a tubular neighborhood of 
L in M5 and is denoted by N(L^. The interior 
of N(L*) is called an open tubular neighbor- 
hood. The total space E(v,) of the normal 
bundle v, of the embedding f is diffeomorphic 
to the open tubular neighborhood of L* in Mi. 


C. C’-Triangulation and the Smoothing 
Problem 


Let (K, f) be a ttriangulation of the C’- 
manifold M" (1 <r < oo) ( f:| K|-5 M") satisfying 
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the following conditions: (i) For any closed n- 
simplex o of K, f [o:o — M" is a C'-mapping; 
(ii) for any point p of c, the tJacobian matrix 
of f |c has rank n at p. Then we say that (K, f) 
is a C’-triangulation of the C'-manifold M", 
and the C’-structure of M” is compatible with 
the triangulation (K, f). Concerning C’- 
triangulations of C’-manifolds, we have the 
following results (S. Cairns [2], J. H. C. White- 
head [3]): (i) any C'-manifold (1 <r < oo) has a 
C’-triangulation, and any C’-triangulation of 
the boundary can be extended to the whole 
manifold; (ii) for a C’-triangulation (K, f) of 

a C'-manifold M", the triangulated space 

(K, f; M") is a !combinatorial manifold; (iii) for 
two C’-triangulations (K,, f;) and (K3, f2) of 

a C'-manifold M", the triangulated spaces 

(K,, fi; M") and (K,, f2; M") are combinatori- 
ally equivalent. 

Conversely, for a combinatorial manifold 
(K, f; M"), a differentiable structure on M that 
is compatible with the triangulation (K, f) is 
called a smoothing of M. The question of 
whether a smoothing of a combinatorial mani- 
fold exists is called the smoothing problem. We 
have some criteria for its solvability. One is 
expressed in terms of transverse fields, and 
another in terms of *microbundles (— 147 
Fiber Bundles P). We also have the following 
method using the theory of *obstruction (J. 
Munkres [9]; M. W. Hirsch and B. Mazur 
(10]). Let I, be the group of oriented differen- 
tiable structures on the tcombinatorial k- 
sphere (an exact definition of this group is 
given in Section I). Let Ki be the i-*skeleton 
of a combinatorial n-manifold (K, f; M") and 
a a smoothing of a neighborhood U of f(K‘) 
in M”. Then there exists an obstruction cocycle 
C,e Z'™ (K,T, satisfying the following con- 
ditions: (i) if C,(a'*')=0 for an (i+ 1)-simplex 
o'*? of K, the smoothing a can be extended 
over a neighborhood of f(K'U a'*!), and vice 
versa; (ii) if there exists another smoothing a 
of a neighborhood of f(K') that coincides with 
the smoothing « of a neighborhood of f(K*^!), 
then C, — C, is a tcoboundary; conversely, for 
any (i+ 1)-coboundary b with coefficients in 
I; there exists a smoothing a of a neighbor- 
hood of f/(K?) such that ba C, — C,. Using 
this method we can prove that for n € 7 all 
combinatorial manifolds of dimension n are 
smoothable. 


D. Embedding and Immersion Theorems 


In the following, M" and X" are C?-manifolds 
of dimensions n and m, respectively. Two 
timmersions fo, fı: M" X" are said to be 
regularly homotopic if there exists a thomotopy 
f: M" X", 0<t <1, such that f, is an immer- 
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sion for each t and the induced mapping (tdif- 
ferential of (di, T(M")  T(X") on the tan- 
gent spaces naturally gives rise to a continuous 
mapping over T(M") x I. Two tembeddings are 
isotopic if they are regularly homotopic and 
the homotopy f, is an embedding for each t. 
Given M" and X", a fundamental problem of 
embedding (immersion) theory is to classify the 
embeddings (immersions) of M" in X" accord- 
ing to their isotopy (regular homotopy) classes. 
Whitney proved that a continuous mapping 

f: M" X" can be approximated by an immer- 
sion if mz 2n and by an embedding if m z 2n + 
1 [1]. The following results are also due to 
Whitney [1]: Any two immersions of M" in X" 
that are homotopic are regularly homotopic if 
m 2 2n - 2, and any two embeddings of M" in 
X" that are homotopic are isotopic if m 2 2n 

t 3. The range m z 2n 4- 3 is called the stable 
range of embeddings. 

In [11,12] (1944) Whitney improved his 
classical theorems and showed that M" can 
always be immersed in R?" ! for n» 1 and M" 
can always be embedded in R?". The methods 
used in his proof have played an important 
role in the subsequent development. Classifi- 
cation of immersions of the n-sphere S” in R” 
was determined by Smale [13]. Let p: T(M")5 
M" and p': T(X") X" be the projections of 
the tangent bundles of M" and X", respec- 
tively. A mapping o: T(M") T(X") is a linear 
fiber mapping (linear fiber map) if, for each 
xe M", o(p ! (x)) is contained in a fiber of 
T(X") and o|p ! (x) is a linear mapping of 
rank n. A linear homotopy o. ` T(M") T(X") 
(0x t & 1) is a homotopy such that each o is a 
linear fiber mapping. The following theorem 
of Hirsch [14] is fundamental to immersion 
theory: Assume n «m. If o: T(M") T(X") is 
a linear fiber mapping, the mapping 9: M" 
X" induced by ¢ can be approximated by an 
immersion f: M" X" such that df and o are 
linearly homotopic. Two immersions f, g: 
M"— X" are regularly homotopic if and only if 
df and dg are linearly homotopic. If M" is im- 
mersible in R"*', where m» n with r linearly 
independent fields of tnormal vectors, then M" 
is immersible in R”. In particular, if M" is a z- 
manifold (— Section I), M" is immersible in 
Krk 

Let &(M") be the set of isomorphism classes 
of real tvector bundles over M", and consider 
d GMT KO(M") (— 237 K-Theory). An 
element Ze KO(M") is said to be positive if Z 
is in the image of 0. If &yeKÜ(M TL the geo- 
metric dimension of €), written g(čo), is the 
least integer k such that čo +k is positive. Then 
Hirsch's theorem [14] can be expressed as 
follows: M" is immersible in R"** (k >0) if and 
only if g(n — «(M")) <k, where t(M") is the 
tangent bundle of M". 
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A. Haefliger [15] obtained the following 
important result: Let M" be compact and 
*(k — 1)-connected, and let X" be k-connected. 
Then any continuous mapping of M" in X" is 
homotopic to an embedding if 2k « n, mz 2n— 
k+1, and any two homotopic embeddings 
of M" in X" are isotopic if 2k « n-- 1, mz 2n— 
k+2. Thus if m» 3(n - 1)/2, any two em- 
beddings of S” in R” are isotopic. The range 
m» 3(n+ 1)/2 is called the metastable range. 
Haefliger further classified the embeddings of 
S^"! in R^" and showed the existence of em- 
beddings of S^" ! in R*" that are not isotopic 
to the natural one. More complete results for 
the classification of embeddings of thomotopy 
n-spheres in S" were obtained by J. Levine 
[16]. Furthermore, Levine proved the follow- 
ing unknotting theorem in higher-dimensional 
knot theory [16]: Let f:S" ? 5 S" bea C”- 
embedding for n 2 6; then f(S"^?) is unknotted 
if and only if S" — f(S" ?) is homotopy equiva- 
lent to $! (— 235 Knot Theory G; also 65 
Combinatorial Manifolds D). 

We list some results about embeddings and 
immersions. If M" is noncompact, M" can 
always be embedded in R?" !; if M" is a non- 
compact z-manifold, M" can always be im- 
mersed in R”; if M" is compact and orientable 
and n2 4, M" can always be embedded in 
R?^^1. 


E. Nonembedding and Nonimmersion 
Theorems 


We denote the *total Stiefel- Whitney class 

of M" by w(M") and the total Pontryagin 
class of M" by p(M") (— 56 Characteristic 
Classes). Then (w(M")) ! (e H*(M"; Z;)) and 
(p(M")) ! (e H*(M"; Z)) can be written as 
W(M") = E s (M") (se HM"; Z,)) and p(M") = 
> pM") (p:e H*'(M"; Z)). Then the property 
of characteristic classes for the Whitney sum 
implies the following theorem: If M" can be 
immersed in R"**, then w,(M")=0 for i» k and 
p,( M")-0 for i> [k/2]. Furthermore, if M" can 
be embedded in R"**, then w,(M")=0. As an 
application, these results yield the nonembed- 
ding (nonimmersion) theorem for projective 
spaces ( Appendix A, Table 6.VII). Sharper 
theorems were obtained subsequently. In 
particular, Atiyah proved the following: Let 

4! (i2 0, 1, ...) be texterior power operations 
(— 237 K-Theory), and let af be the operations 
defined by the formal power series 0/2, ytti = 
(Ze 4/t)(1—1t) !. Then »'(n—1(M"))=0 for 
i7 k (iz k) if M" can be immersed (embedded) 
in R"**, Furthermore, we have an interesting 
result for the differentiable case. For any posi- 
tive integer q, there exists a differentiable 
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manifold M" such that M" is immersible in R* 
but not embeddable in R**4. 


F. Handlebodies 


Let W be a C?-manifold with boundary ôW. 
Then the boundary OW has a neighborhood U 
that is homeomorphic to OW x R, , where R, 
=[0, oo). Let W,, W, be C?-manifolds with 
boundaries and f: 0W, —0W, be a diffeomor- 
phism. Then the quotient space M of W,UW, 
obtained by identifying points corresponding 
under f has a natural differentiable structure. 
This construction of the C^-manifold M from 
the C?-manifolds W,, W, is called pasting 
together the boundaries. 

Now we consider the ftopological product 
W, x W, of manifolds W,, W, with boundaries 
OH, OM. W, x W, —0W, x OW, has a natural 
atlas of class C^. By introducing a suitable 
atlas of class C? in a neighborhood of dW, x 
OW, in W, x W, we obtain a C^-manifold 
with boundary homeomorphic to W, x W,. 
More generally, let M be an n-manifold with 
boundary, let N be a finite union of submani- 
folds of dimension € n—2 in 0M, and suppose 
that M has a corner along N. Extension of the 
C?-structure of M — N over M as in this para- 
graph is called straightening the angle. 

Let D" be the oriented unit *n-disk in the 
n-dimensional Euclidean space R", Mj and 
M3 be oriented compact C?-manifolds, and 
f: D* — M? (i= 1,2) be tC®-embeddings, f, 
*orientation-preserving and f, forientation- 
reversing. Then, pasting together the bound- 
aries of Mj — Int f, (D") and M5 — Int f,(D") by 
the mapping f, 0 f, !, we obtain an oriented 
C?-manifold, called the connected sum of M? 
and M3 and denoted by M? # MI The con- 
nected sum Mj # MI has the orientation in- 
duced from those of MI and M3, and its dif- 
ferentiable structure is uniquely determined 
independently of the mappings f;. Let S" be 
the natural n-dimensional sphere. Then we 
have M" # S" œ M" (here = means diffeomor- 
phic), (M, # M,)# M, ~ M, #(M,#M,), M, # 
M,=M,#M,. 

Let M" be a manifold with boundary and 
f:(0D') x D" * 20M" be a C?-embedding. 
Then we call the quotient space X (M"; f; s) of 
M"U(D? x D" 5) obtained by the identification 
of corresponding points under f the manifold 
with a handle attached by /. Also, we call the 
construction of X(M"; f; s) from M" attaching 
a handle and call D° x D”~* an s-handle. After 
straightening the angle, X(M"; f; s) is consid- 
ered naturally a C?-manifold with boundary. 
Let f,:0D? x D 20M" (i 1,..., k) be em- 
beddings whose images are mutually disjoint. 
Then similarly, using embeddings f,, ..., fp, we 
can define a C?-manifold with handles X(M"; 
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fi fs) In particular, X(X(...(X(X(D"; 
fuis £25 82) ) fiz sj) is called a handlebody. 

Let M" ^! be an oriented (n — 1)-mani- 
fold and f:0D* x D"*S(M"'! —0M" Ian 
orientation-preserving C?-embedding. Then, 
by straightening the angle, the quotient space 
of (M""! —Int f(dD* x D" S)U(D* x 0D" S) 
obtained by the identification of points cor- 
responding under f | 0D? x 0D" is a C*- 
manifold y(M"'^!; f). We say this manifold is 
obtained by a spherical modification (or sur- 
gery) of type (s, n — s) from Mr T The manifold 
X(M"^1; f) has a natural orientation, and 
Oy(M"^!; f) 0M"! . The process of spherical 
modification has the following relation to 
that of attaching a handle: Let W be an n- 
dimensional manifold and f:éD* x D" *0W 
an embedding. Then 0X(W; f; s) (OW; f). 
When WM" ! x[0,1] and f:0D* x D" 53 
M x (1, 0X(W; f; s) - x(M x (15; £)UM x {0}, 
and therefore M is cobordant (— Section H) 
to (M ; f). Conversely, let M, be cobordant 
to M,. Then we can obtain M, from M, by . 
a finite sequence of spherical modifications 
(A. Wallace [17], Milnor). Let M" be an n- 
dimensional C?-manifold and f: M*—R! bea 
C*?-function. If f satisfies the following con- 
ditions, then it is called a nice function: (i) All 
‘critical points of f are tnondegenerate; (ii) for 
any critical point p of f, the *index (— 279 
Morse Theory) of f at p is equal to f(p). We 
have the following theorems. 

1. Let M be a compact C?-manifold. Then 
there exists a nice function on M (M. Morse, 
Smale [18], Wallace [17]). 

2. Let M be a compact C?-manifold and 
f: MoR! a C?-function all of whose critical 
points are nondegenerate. Suppose that the 
number of critical points on f! [—6,c] is k 
and that they are all contained in f^! (0). Sup- 
pose further that the indices of these critical 
points are all equal to s. Then f^ !(—00,c] is 
diffeomorphic to the manifold with handles 
X(f !(—oo, —e] fi, (Morse, Thom, 
Smale [18]). 

3. tGeneralized Poincaré conjecture. Let M" 
be an n-dimensional thomotopy sphere of class 
C? (n> 5). Then M" can be obtained by past- 
ing together the boundaries of two n-disks. 
Consequently, M" is homeomorphic to the n- 
dimensional sphere S" (Smale [18], H. Yama- 
suge [191). 

4. Let M" be a tcontractible compact n- 
dimensional manifold (n 5), with OM con- 
nected and simply connected. Then M" is dif- 
feomorphic to the n-disk D" (Smale [18]). 

5. Let Mt, M3 be oriented, compact, simply 
connected n-dimensional manifolds (n > 4). If 
M, is h-cobordant (— Section I) to M,, then 
M, is diffeomorphic to M, (h-cobordism theo- 
rem; Smale [20]). 
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6. If Mi x R* is diffeomorphic to M3 x R*, we 
say that Mf is k-equivalent to M}. Let M,, M, 
be compact n-dimensional manifolds of the 
same thomotopy type. Then M, is k-equivalent 
to M, if and only if for a thomotopy equiva- 
lence f: M, — M5, t(M,) @ e, is equivalent to 
f *x(M;3) @ &,, where t(M,) is the ttangent 
bundle of M; and e, the ‘trivial vector bundle 
of dimension k (Mazur). 

7. In some special cases the classification of 
manifolds by diffeomorphism is completely 
determined. (i) The classification of simply 
connected 5-dimensional manifolds M with 
vanishing second 'Stiefel- Whitney class w,(M) 
is determined by H,(M). (ii) A 2-connected 
compact 6-dimensional manifold is dif- 
feomorphic to either S? or the connected sum 
of a finite number of copies of S? x S? (Smale 
[20]). Besides these, the classifications of 
(n — 1)-connected 2n-dimensional manifolds 
(Wall [8]) and (n — 1)-connected (2n + 1)- 
dimensional manifolds (Tamura [21], Wall 
[22]) have been obtained. 


G. Thom Complexes 


Let € be a real n-dimensional vector bundle 
over a paracompact space X, A; be the total 
space of the associated bundle of € with fiber 
the closed n-disk D", and A, be the total space 
of the associated bundle of č with fiber the 
(n—1)-sphere ôD". Then the quotient space 
X= ERER obtained from A; by contracting 
A, to a point, is called the Thom space of the 
vector bundle č. If X is a *CW-complex, then 
the Thom space X; of € has the homotopy 
type of a CW-complex and is called a Thom 
complex. The Thom space X, has the canon- 
ical base point p, corresponding to A " 

For a coefficient group G, denote by G; the 
"local system of coefficient groups with stalk G 
associated with the forientation sheaf of an R"- 
bundle Z Then we have the Thom-Gysin 
isomorphism: 


H*(X;Gjz H"*«(X., Pe; G), 
H(X; G)= H,i (Xe, Pes G). 


Let G be a closed subgroup of the ortho- 
gonal group O(n) and BG be the base space of 
the universal n-dimensional vector bundle y, 
with structure group G. Then we can take a 
connected CW-complex as BG. We denote the 
Thom complex of the vector bundle yg by MG 
and call it the Thom complex associated with 
(G, n). The Thom complex associated with 
(G, n) is (n—1)-tconnected. If G is connected, 
then we have n, (MG) SZ, z,(MO(n)) = Z,, 
H"(MG; Z)=Z, and H"(MO(n); Z,)=Z, (Z, 
stands for the quotient group Z/2Z). The 
generator U of H"(MG; Z) is called the funda- 
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mental (cohomology) class of the Thom com- 
plex MG. For a general Thom space X,, we 
can also define the fundamental class of X; 
using the Thom-Gysin isomorphism. 

A C?-submanifold W”? of a compact mani- 
fold V" is a tsupport of a !singular cycle and 
represents a homology class ze H,(V"; G) 
(G — Z, or Z). In this case we say that the 
homology class z of V" is realizable by a sub- 
manifold. A homology class ze H, ,(V"; G) is 
realizable by a submanifold if and only if there 
exists a mapping f: V" MO(k) (or MSO(k)) 
such that f *(U) is the dual cohomology class 
of z (Thom [23]). 

The homotopy group z, ,,(MO(n)) (resp. 
n, 4M SO(n)) (n> k) is determined only by 
k for any n up to isomorphism. It ts called 
the k-dimensional stable homotopy group of 
the Thom spectrum MO = {MO(n)} (MSO 
= {MSO(n)}) and is denoted by z,(MO) 
(1,(MSO)). For the tunitary group U (n) and 
the 'symplectic group Sp(n), we define Thom 
complexes MU (n) and MSp(n) as the Thom 
complexes associated with (U (n), 2n) and 
(Sp(n), 4n) by the canonical inclusions U (n) c 
O(2n) and Sp(n) c O(4n), respectively. The 
k-dimensional stable homotopy groups 
n, (MU)- limz;, (MU (n)), t,(MSp)— 
lim z4, ,,(MSp(n)) of Thom spectra MU ={..., 
MU (n, SMU(n) MU(n+ 1),...}, MSp= 
{..., MSp(n), SMSp(n), S?MSp(n), S* MSp(n), 
MSp(n+ 1),...) can be defined similarly to the 
case of 1,(MO), where SMU (n) denotes the 
treduced suspension of MU (n). The stable 
homotopy groups of Thom spectra are cal- 
culated in connection with the cohomology 
groups utilizing the Thom-Gysin isomorphism 
(— 202 Homotopy Theory T) (Thom [23], 
Milnor [24]). 


H. Cobordism 


Cobordism theory is a theory of classification 
of differentiable manifolds initiated by L. S. 
Pontryagin and V. A. Rokhlin, who called it 
intrinsic homology. The theory was brought to 
maturity by Thom [23]. Its fundamental prob- 
lem is to determine whether a given compact 
manifold is the boundary of another manifold. 
Corresponding cobordism theories for com- 
binatorial and topological manifolds are being 
developed (Wall, R. Williamson). 

We consider only compact C”-manifolds 
that are not necessarily connected. Let D 
be the set of all diffeomorphism classes of 
C*-manifolds, and let Dy be the set of all 
orientation-preserving diffeomorphism classes 
of oriented C?-manifolds. For an oriented 
manifold V, we write — V for the manifold 
with reversed orientation. 
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For two compact k-manifolds V, WeDo, we 
say that V is cobordant to W if there exists a 
compact (k + 1)-manifold X € $, such that 0X 
= VU(— W). In this definition, considering D 
instead of Do, we say that V is cobordant to W 
mod 2. The equivalence class of V* with respect 
to the cobordism relation (the cobordism 
relation mod 2) ts called an oriented (un- 
oriented) cobordism class and is denoted by 
[V*] ([V*];). The set of all oriented (un- 
oriented) cobordism classes [ V*] ([V*],) of k- 
manifolds forms an Abelian group Q, (9t,) by 
the natural addition [V*] +[W*] 2 [V*U W*] 
([V*]; - EW*], 2» [V* U W*],). This is called 
the k-dimensional oriented (unoriented) cobor- 
dism group. Define the product [V*] x [W°] = 
LV x W'] (LV*], x EW']; =[V* x WII 
Then the direct sum Q— 2:0, (N 3. 91,) forms 
an anticommutative (commutative) tgraded 
algebra, which is called the cobordism ring 
or Thom algebra. We have the following 
theorems. 

1. Q,, 9t, are isomorphic to the stable homo- 
topy groups z,(MSO), n,(MO) of the Thom 
spectra MSO, MO, respectively (Thom's funda- 
mental theorem) [23]. 

2. For a natural number k not of the form 
2: — 1, there exists a compact manifold P(k), 
and 9t is a polynomial ring over Z, with gen- 
erators ([P(k)];|k z 2/— 1]. Q6 Q (Qis the 
field of rational numbers) is a polynomial ring 
over Q with generators ([PC?"]|m z 1], where 
PC?" is the complex 2m-dimensional projec- 
tive space (Thom [23]). Moreover, Milnor 
[24] proved that the p-component of Q, is 
zero for an odd prime p. Wall proved that 
the 2-component of Q, contains no element 
of order 4, using a certain exact sequence 
which contains the natural homomorphism 
Q, 5 9t, [25]. 

3. Let T be the ideal of Q consisting of all 
torsion elements in Q. Then O/T is a poly- 
nomial ring over Z with generators {[Y,,]| 
k21}, where we can take for Y}, a complex 
2k-dimensional nonsingular algebraic variety 
(Milnor). 

tCharacteristic numbers are invariants of 
cobordism classes. Combining the results 
stated in 2, we have the following theorem: 

4. Let V, W be manifolds. V is cobordant to 
W mod2 if and only if they have the same 
corresponding 'Stiefel- Whitney numbers. Let 
V, W be oriented manifolds. V is cobordant to 
W if and only if they have the same corre- 
sponding Stiefel- Whitney numbers and tPon- 
tryagin numbers. The findex of an oriented 4k- 
dimensional compact manifold is a cobordism 
class invariant and can be expressed as a linear 
combination with rational coefficients of fPon- 
tryagin numbers (— 56 Characteristic Classes). 

A manifold whose tstable tangent bundle 
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has a complex structure is called a stably (or 
weakly) almost complex manifold. Let V, W be 
2n-dimensional compact stably almost com- 
plex manifolds (n 2 1). We say that V is C- 
equivalent to W if they have the same tChern 
numbers. The set of all C-equivalence classes 
[ V ]c forms the complex cobordism group IL. 
as in the (real) case of cobordism groups (the 
existence of the inverse of an element is not 
trivial). Introducing multiplication into the 
direct sum U, — 3 M, by the product of mani- 
folds, we obtain the complex cobordism ring. 
We have 15, 2:2,,(MU) and U, is a poly- 
nomial ring over Z with generators ([Y7, ]« | 
kz 1], where we can take for Y;, a complex 
k-dimensional nonsingular algebraic variety 
(Milnor [24]). 


I. h-Cobordism Groups of Homotopy Spheres 


A manifold M is said to be parallelizable if its 
tangent bundle t(M) is trivial, and almost 
parallelizable if there exists a finite number of 
points x; in M such that M — | ) {x;} is paral- 
lelizable. A manifold M is called stably paral- 
lelizable (or s-parallelizable) if the Whitney 
sum t Q e, of its tangent bundle t(M) and the 
trivial line bundle e, is trivial. A manifold M" 
is called a z-manifold if M" has a trivial nor- 
mal bundle when it 1s embedded into a Eucli- 
dean space R“ (N » 2n). A manifold M" is a z- 
manifold if and only if M" is s-parallelizable. 
The concepts defined in this paragraph are 
related by inclusions as follows: parallelizable 
& s-parallelizable CG almost parallelizable. 
For a connected manifold with boundary, 
these three concepts are equivalent. *'Group 
manifolds are parallelizable. An n-dimensional 
manifold homeomorphic to the n-dimensional 
sphere is parallelizable if and only if n= 1, 3, 7 
(Milnor [26]). Suppose that we are given an 
(n — 1)-dimensional sphere $"^! (n even). We 
can consider the problem of determining the 
maximal number r such that there exists a 
tangent r-frame field over S" !. J. F. Adams 
solved this problem using *K-theory (— 237 
K-Theory) as follows: Let n=(2a+1)2°, b= 

€ t Ad, where a, b, c, d are integers and O € c < 
3. Put p(n)=2°+ 8d. Then r= p(n) — 1. On the 
other hand, homotopy spheres are z-manifolds 
[27]. 

Let V,, V; be oriented compact manifolds. 
Then we say that V, is h-cobordant to V, if 
there exists an oriented manifold W with 
boundary 0W= V, U( — V3) such that V; (i= 1,2) 
is a deformation retract of W. The set of all h- 
cobordism classes of oriented homotopy n- 
spheres forms an Abelian group 0, with con- 
nected sum as addition. This is called the (h- 
cobordism) group of homotopy n-spheres. We 
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denote by 0,(8n) the subgroup of homotopy n- 
spheres that are boundaries of z-manifolds. 
Kervaire and Milnor gave certain exact se- 
quences that contain the groups 6,, the stable 
homotopy groups G, of spheres, and the stable 
homotopy groups z,(SO). They clarified rela- 
tions among these groups and proved that 
0,(0n) =0 for n even, 0,(0n) — finite group for n 
odd #3, 0,/0,(0n) c Coker J,, etc. [27], where 
J,:1,(S0) G, is the *J-homomorphism. From 
these results it follows that the 0, (n 3) are 
finite Abelian groups, 0, —0 for n « 7, 43, 

-= Zg, etc. (— Appendix A, Table 6.I). 

By pasting together the boundaries of two 
n-disks Df and D3, we obtain an oriented 
manifold which is considered as a smoothing 
of the combinatorial n-sphere. The set of all 
orientation-preserving diffeomorphism classes 
of smoothings of the combinatorial n-sphere 
forms an Abelian group T, with connected 
sum as addition. This is called the group of 
oriented differentiable structures on the com- 
binatorial sphere. By the generalized Poincaré 
conjecture and the h-cobordism theorem, we 
obtain I, — 0, (n 43, 4). Furthermore, I, 2 0 
and I, =0 (J. Cerf [28]). The group I, can also 
be defined as follows: Let Diff * D", Diff * S^! 
be the groups of orientation-preserving diffeo- 
morphisms of D", S" !, respectively, where 
multiplication is defined by composition. Let 
r: Diff * D* 2 Diff * S"! be the homomor- 
phism induced by the restriction D"— S" !. 
Then the image of r is a normal subgroup, and 
I, Diff * S" !/r (Diff * D"). 


J. Surgery Theory 


A process of modifying a manifold into an- 
other by a sequence of spherical modifications 
is called surgery on the manifold (— Section 
F). The technique of surgery proved to be a 
powerful tool for the development of differen- 
tial topology in the 1960s. Kervaire and Mil- 
nor exploited this technique in their study of 
homotopy spheres [27]. W. Browder [29] and 
S. P. Novikov [30] used surgery to construct 
differentiable manifolds with the same homo- 
topy type as that of a given Poincaré complex 
in dimension greater than 4. 

To explain the main points of surgery the- 
ory, we introduce some terminology. A pair 
of finite *CW complexes (X, Y) is called a 
Poincaré pair of formal dimension n if there 
exists a class ue H,(X, Y; Z) called the funda- 
mental class such that the fcap product u^: 
H*(X; £Z) H, (X, Y; Z) is an isomorphism 
for each q. When Y= Ø, X is called a Poin- 
caré complex. Let M be an n-dimensional 
smooth manifold. Consider an embedding g 
of M into a Euclidean space R"**, where k is 
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large enough Ge kz n--2). Then the isomor- 
phism class of the tnormal bundle of g is inde- 
pendent of the choice of embedding, and it 
depends only on M. Any representative of the 
isomorphism class is called the normal k-vector 
bundle of M and is denoted by v. Let (X, Y) 
be a Poincaré pair of formal dimension n, 

and let Z be a real k-vector bundle over X. A 
normal mapping (normal map) ( f; b) consists of 
a degree 1 mapping f:(M,0M)(X, Y) and a 
*bundle mapping b: vf 5 € which covers f. 

A normal mapping (f, b):(M, 0M)—(X, Y) is 
normally cobordant to a normal mapping 
(f, b): (M'; 0M") 2(X, Y) if OM «0M' and 
there exist a smooth (n+ 1)-manifold W and 
a mapping F: Wo X such that 9W =M U 
(—-M)UOM xL, F is covered by a bundle 
mapping B:vj €, and (F, B)| M, 6M) -( f, b), 
(F, B)|(M', 0M) - (f^, b). 

The fundamental theorem of surgery theory 
is formulated as follows: Let (X, Y) be a Poin- 
caré pair of formal dimension n. Suppose that 
X is simply connected and nz 5. Let ( f; b): 
(M,6M)(X, Y) be a normal mapping that 
restricts to a homotopy equivalence on the 
boundary f |0M:0M — Y. Then ( f, b) is nor- 
mally cobordant to a normal mapping ( f’, b^): 
(M', 6M") S(X, Y) with f': M'2 X a homotopy 
equivalence if and only if a well-defined ob- 
struction ot f, b) vanishes. oi f; b) is called the 
surgery obstruction. When n is odd, o( f, b) 
always vanishes. When n z 0 mod 4, o( f, b) is 
an integer. If Y= @, it is given by (I(M)— 
I(X))/8, where I( ) denotes the tindex of the 
manifold or of the Poincaré complex. If nz 2 
mod 4, o( f, b) is an integer mod 2, called the 
Arf-Kervaire invariant. For a thorough devel- 
opment of simply connected surgery — [31]. 

In the PL or even in the topological cate- 
gories, surgery theory can be developed simi- 
larly (Browder and Hirsch, R. C. Kirby and 
L. C. Siebenmann [32]). 

In his study of Hauptvermutung for simply 
connected manifolds, D. Sullivan reformu- 
lated surgery in terms of the "surgery exact se- 
quences" involving the classifying spaces G/PL 
or G/O. (These spaces are “homotopy theoretic 
fibers" of BPL— BG or BO BG respectively.) 
In the special case where X is a closed simply 
connected PL n-manifold, the surgery exact 
sequence can be formulated as follows (n 2 5): 


Z (nz0mod4) 
0oAT(X) [X, G/PL]S 4 Z, (n22 mod4), 
O0 (nodd). 


The set hT(X) is the totality of equivalence 
classes of pairs (M, f), where M is a closed PL 
n-manifold and f is a homotopy equivalence 
M — X. Two such pairs (M’, f^) and (M", f") 
are defined to be equivalent if there exists a PL 
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homeomorphism h: M' ^ M" such that f"oh is 
homotopic to f’. The set of homotopy classes 
[ X, G/PL ] of mappings X 5 G/PL is appro- 
priately identified with the set of all normal 
cobordism classes of normal mappings with 
target X or with the set of all PL reductions 
of the Spivak normal fiber space of X asa 
Poincaré complex [31]. The mapping 5 can 
be defined naturally, and the mapping 0 cor- 
responds to the assignment of the surgery 
obstruction. The image n(M, f)e[ X, G/PL] 
is sometimes called the normal invariant of 
(M, f). Sullivan reduced the classification 
problem of manifolds within a given homo- 
topy type to a homotopy theoretic problem 
of the classifying spaces G/PL, G/O [33] (also 
— [31, 34]). 

In the latter half of the 1960s, surgery theory 
was extended by Wall to cover all compact 
non-simply connected manifolds which are 
not necessarily orientable. He introduced a 
certain Abelian group L,,(z; w), now called the 
Wall group, which functorially depends on the 
fundamental group z — z,(X), with the orienta- 
tion character w:z,(X)— Z;, and which is of 
period 4 with respect to the formal dimension 
n of X. In Wall’s theory, the surgery obstruc- 
tion o( f, b) takes its value in this group. The 
group structure of Wall groups has been cal- 
culated in many cases. Sullivan's exact se- 
quences are extended to the non-simply con- 
nected cases [34]. Making use of the extended 
exact sequences, Wall and W. C. Hsiang and 
J. L. Shaneson classifield homotopy tori. The 
result played an important role in the work of 
Kirby and Siebenmann on the tannulus con- 
jecture and stable homeomorphisms which led 
to the solution of tHauptvermutung and trian- 
gulation problems on topological manifolds in 
1969 (— 65 Combinatorial Manifolds). Sur- 
gery theory has many applications to other 
geometric problems. Among them are find- 
ing missing boundaries for open manifolds, 
equivariant surgery, homology surgery [35], 
and surgery on codimension two submanifolds 
[36, 37]. 


K. 4-Dimensional Manifolds 


The results in differential topology discussed 
above are mainly concerned with manifolds of 
dimension È> 5. For 4-manifolds, however, 
because of their peculiar nature which is not 
observed in other dimensions, many funda- 
mental problems had remained unsolved until 
M. H. Freedman's epoch-making paper [54] 
appeared in 1982. His paper, together with 

S. K. Donaldson's theorem [55] which was 
published a little later, was a breakthrough in 
the theory of 4-manifolds. 
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It was Rokhlin who first discovered a 
strange property of 4-manifolds [38]. Ro- 
khlin’s theorem states: Let M be a closed 
oriented smooth 4-manifold. If M is almost 
parallelizable (or equivalently, if M is a spin 4- 
manifold), then the index of M is divisible by 
16. Milnor and Kervaire gave an alternative 
proof of this theorem from the differential- 
topological point of view [39]. Freedman and 
Kirby and Y. Matsumoto gave geometric or 
elementary proofs. 

Until 1981, Rokhlin’s theorem had been a 
constant source of many curious phenomena 
in 4-dimensional topology. The list contains, 
for example: (1) the class (4m 2, 4n + 2)e 
H,(S? x S*)=Z@ Z cannot be represented by a 
smoothly embedded 2-sphere in S? x S? (Ker- 
vaire and Milnor). This result was improved 
by W.-C. Hsiang and R. H. Szczarba, A. G. 
Tristram, and Rokhlin. (2) The h-cobordism 
theorem fails to hold in 4 or 3 dimensions 
(T. Matumoto, Siebenmann [40]). (3) There 
exists a closed smooth 4-manifold M that is 
homotopy equivalent to the real projective 
space RP^, but M zZ RP^ (Cappell and Shane- 
son). (4) There exists an open 4-manifold Wi 
properly homotopy equivalent to S? x R but 
distinct (Freedman). 

Closed, connected, simply connected 4- 
manifolds M and N are homotopy equivalent 
if and only if the intersection forms defined on 
the 2-dimensional (co)homology groups are 
equivalent as inner product spaces over Z 
[41]. Wall [42] proved that if closed simply 
connected smooth 4-manifolds M and N 
are homotopy equivalent, then they are h- 
cobordant. Moreover, if M and N are h- 
cobordant, then there exists an integer k z 0 
such that M #k(S? x S?) is diffeomorphic to 
N #k(S? x S?). 

About 1973, using a certain infinite repeti- 
tion process, A. Casson constructed a family 
of noncompact smooth 4-manifolds which are 
properly homotopy equivalent to the open 2- 
handle D? x R?. A 4-manifold belonging to the 
family is called a Casson handle. He observed 
that if all the Casson handles are diffeomor- 
phic to D? x R?, then theories analogous to 
surgery and the h-cobordism theorem in 
higher dimensions can also be developed in 
dimension 4. 

In his 1982 paper [54], Freedman proved 
that each Casson handle is homeomorphic to 
D? x R?. (It was proved later that Casson 
handles are not, in general, diffeomorphic 
to D? x R?.) This result and the proper h- 
cobordism theorem, also due to Freedman, 
proved many fundamental results on the topo- 
logical structure of 4-manifolds: (1) If closed 
simply connected smooth 4-manifolds M, N 
are h-cobordant, they are homeomorphic. In 
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particular, a 4-dimensional homotopy 4-sphere 
is homeomorphic to the 4-sphere S*. (Proof of 
the 4-dimensional Poincaré conjecture.) (2) A 
topological 4-manifold properly homotopy 
equivalent to Rt is homeomorphic to R*. (3) 
Given a nonsingular symmetric bilinear form 
œw over Z, there exists a closed, connected, 
simply connected topological 4-manifold 
whose intersection form is equivalent to w. 
(Therefore Rokhlin's theorem cannot be ex- 
tended to topological 4-manifolds. From this, 
it follows that there exist simply connected 
topological 4-manifolds which are nonsmooth- 
able or even nontriangulable.) (4) The homeo- 
morphism class of a closed connected simply 
connected 4-manifold is determined by the 
intersection form and the Kirby-Siebenmann 
class. (This statement is an improved one by 
F. Quinn [56]. 

Donaldson [55] revealed a sharp contrast 
existing between smooth and topological 4- 
manifolds. Donaldson's theorem states: If the 
intersection form of a closed, connected, sim- 
ply connected smooth 4-manifold is positive 
definite, then the form is equivalent to the 
standard one (15 C1» @... ® <1). This 
theorem, together with Casson's theory and 
Freedman's result (2) stated above, implies 
that there exists an exotic differential struc- 
ture on a 4-dimensional Euclidean space R4. 
Moreover, as an application of Donaldson's 
theorem, the problem of representing a 2- 
dimensional homology class of $? x S? by a 
smoothly embedded 2-sphere was completely 
solved (K. Kuga [57]): the class (p, q)e H,(S? x 
S?) is represented by a smoothly embedded 
S? if and only if |p| x 1 or |q| <1. 

Many problems concerning differential 
structures on 4-manifolds remain unsolved. It 
is not known whether an exotic smooth 4- 
sphere exists. 

Among other results, the proof of the 4- 
dimensional annulus conjecture by Quinn [56] 
is remarkable. 


L. Miscellaneous Results 


The Browder-Livesay invariant. Let X" be a 
homotopy n-sphere, T an involution on X", 
that is, a smooth mapping T: 2" 2" with 
To T —id»». Assume that T is free from fixed 
points. The involution T is said to desuspend 
if there exists a homotopy (n — 1)-sphere 2” 
smoothly embedded in £” which is invariant 
under the action of T. Browder and G. R. 
Livesay defined an obstruction c(T) in the 
group Din: even), Z (nz 3mod4), Z, (n= 

1 mod 4) such that o(7)=0 if and only if T 
desuspends (provided that n > 6) [43]. The 
invariant c(T) is now called the Browder- 
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Livesay invariant. The technique used there 
was equivariant surgery under the action of T. 

De Rham homotopy theory. A new approach 
to topology of manifolds was invented by 
Sullivan [44], who considered the exterior 
algebra of differential forms (with polynomial 
coefficients) on a simplicial complex. Through 
the construction of the minimal model for the 
algebra of differential forms, he recovers the 
rational homotopy type of the complex under 
some reasonable condition on the fundamental 
group. Following upon the classical de Rham 
theorem which recovers the (real) cohomology 
algebra from the algebra of differential forms, 
this approach is called the de Rham homotopy 
theory. This method has proved to be useful 
for the explicit calculation of the algebraic 
topological invariants of *Káhler manifolds, 
tloop spaces, cross-section spaces, talgebraic 
varieties, etc. 

Kirby calculus. Kirby [45] initiated a link- 
theoretic approach to 3- and 4-manifolds. Let 
L be a link in S°. Suppose that each compo- 
nent of L is given a framing, which means a 
trivialization of a tubular neighborhood. Such 
a link is called a framed link. A framing of 
a component is determined by the ‘linking 
number between the component and its re- 
placement along the framing. By attaching 
2-handles to the 4-disk D^ along a framed link 
in S? CD. we obtain a handlebody; here we 
denote it by W,. It is known that each closed 
connected oriented 3-manifold is obtained as 
the boundary of such a handlebody (W. B. R. 
Lickorish, Wallace). Kirby proved that for 
framed links L and L’, the boundaries dW, 
and CHA. belong to the same orientation- 
preserving diffeomorphism class if and only if 
L is transformed into L’ by a sequence of the 
following two kinds of elementary operations: 
(1) adding or deleting a trivial knot (separated 
from L) with framing +1, (ii) band summation 
of components corresponding to *handle 
sliding." Kirby's approach is called Kirby 
calculus on framed links. For applications — 
[45,53]. 
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A. General Remarks 


Let (Q, B, P) be a tprobability space. A tMar- 
kov process {X,}o << on a topological space 
S with fcontinuous time parameter t is called a 
diffusion process if the tsample function X,(w) 
is continuous in t with probability 1 until a 
random time C(o), called the tterminal time. 
After the terminal time, X,(c) stays at the 
terminal point ô. Such a process is said to be 
1-dimensional or multidimensional according 
as S is an interval or a manifold (possibly with 
boundary) with dimension 22. Brownian ` 
motion is a typical diffusion process (— 45 
Brownian Motion). 

To give conditions for a Markov process to 
be a diffusion process, let us assume that S is a 
‘complete metric space with metric p. Let {X,} 
be a Markov process on S with time para- 
meter t ranging over a finite interval [t], t3] 


and satisfying 
sup P(s, x,t, S — U,(x)) 2 O(h) (h}0) (1) 


for every £7 0, where U,(x) is the e-neighbor- 
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hood of x and sup is taken over all xe S, and 
s, te[t,, t2] such that 0<t—s<h. Then {X,} 
is a diffusion process if and only if 


E P(p(X,, X...) e)dt—o(h) (lu 


for every £2 0 [5]. This includes the following 
result due to E. B. Dynkin (1952) and J. R. 
Kinney (1953) as a special case: If in (1) we can 
replace O(h) with o(h), then {X,} will be a 
diffusion process. Specifically, when {X,} is a 
1-dimensional *strong Markov process, the 
latter condition is also necessary for the pro- 
cess to be a diffusion process. 

Diffusion processes are intimately related to 
a certain class of tpartial differential equations. 
Let S be the real line. Assume that the ttran- 
sition probability P(s, x, t, E) (s « t) of the pro- 
cess {Xj} <,<, satisfies 
1—P(s,x,s+h,(x—e,x+8))=o(h)  (h|O) 

(2a) 


for every ¢>0 and that the following limits 
exist: 


1 xt+e 
` Se EE 
Jm a (y — x) P(s, x, s +h, dy) 
=2a(s,x)>0, (2b) 


. 1 xte 
Jim Al (y—x)P(s,x,s+h,dy)=b(s,x), (2c) 


1 
lim —(P(s,x,s +h, S)—1)=c(s, x) « 0. (2d) 
h>0+ h 


Assume further that the transition probability 
is tabsolutely continuous with respect to Le- 
besgue measure: P(s, x, t, dy)= p(s, x, t, y)dy. 
Then under some suitable additional con- 
ditions, p(s, x, t, y) satisfies 


0 o? ð 

ALT aS) 5 — b(s x) els, x)p, 

p(t—0, x, t, y) 2 ó(x — y), (3) 
and 

op 2 ô 

MALTA UA. > ae Ta b , H , 

à: Säll? y)p) ay! (t, y)p)+c(t, y)p 

p(s, x,s+0, y)=d(y—x), (4) 


where ó is the *Dirac delta function. The co- 
efficient c vanishes if ( X) is tconservative. 
Equations (3) and (4) are called Kolmogorov’s 
backward equation and forward equation, re- 
spectively. They are also called the Fokker- 
Planck partial differential equations. 

A. N. Kolmogorov [1] derived equations (3) 
and (4) in 1931, and W. Feller proved that (3) 
(or (4)) has a unique solution under certain 
regularity conditions on the coefficients a, b, 
and c, and that the solution p(s, x, t, y) is non- 
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negative, has an integral with respect to y that 
does not exceed 1, and satisfies the *Chapman- 
Kolmogorov equation 


IL x, t, y)p(t, y. u, z) dy = p(s, x, u, Z) 


for every 0« s «t « u. Hence p(s, x, t, y) deter- 
mines a "Markov process analytically. How- 
ever, rigorous proof establishing the sufficiency 
of condition (2) for the Markov process {X,} to 
be a diffusion process did not appear until the 
1950s. 

In the temporally homogeneous case, 
p(s, x, s +h, y) does not depend on s and can be 
written as p(h, x, y). Then a(s, x), b(s, x), and 
c(s, x) are also independent of s, and p(t, x, y) 
satisfies 


Op 6? p Op 
SE a(x)- + Walz" c(x)p, 
p(0  , x, y) 2 ó(x — y). (5) 


Feller made an intensive study of this case and 
completely solved the problem of existence 
and uniqueness of the solution of (5) assuming 
that p(t, x, y) is nonnegative and that its inte- 
gral with respect to y does not exceed 1 [7]. 
In particular, when t varies in [0, +00) and S 
is an interval [r,,r,], Feller used the Hille- 
Yosida theory of tsemigroups of operators to 
determine the conditions that should be satis- 
fied by r, and r, in order that the differential 
equation (5) (with the initial condition and his 
additional assumptions) yield one and only 
one solution. Feller also introduced the notion 
of generalized differential operators, which 
expresses the differential operator in the right- 
hand side of (5) in the most general form [8]. 
The probabilistic meaning of his results was 
clarified by Dynkin, H. P. McKean, K. Itó, 

D. B. Ray, and others, and all 1-dimensional 
diffusion processes with the tstrong Mar- 

kov property have now been completely 
determined. 

Since not much research on temporally 
nonhomogeneous diffusion processes has 
been done so far, we restrict our explanation 
to temporally homogeneous ones. Let M = 
(X,, W, P.| xeS) be a Markov process, where 
S is a tstate space, W is the tpath space con- 
sisting of all paths w:[0, --oo] SU (0) which 
are continuous in t for 0 x t « C(w) (w(t) for 
tz C(w) while w(t)e S for Ot « C(w)), and P, is 
a probability measure on W under the con- 
dition that the process starts from x at t=0 
(— 261 Markov Processes). We can actually 
identify W with the tbasic space Q and set 
X,(w) = w(t) for weQ. Assume that W has the 
*strong Markov property. It follows from 
*Dynkin’s formula for tinfinitesimal generators 
(— 261 Markov Processes) that the infini- 
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tesimal generator (5 of a diffusion process has 
the local property stating that if u and v be- 
long to the domain of © and coincide in a 
neighborhood of xo, then Gu(x_) = Dol, 


B. 1-Dimensional Diffusion Processes 


Let S be a straight line. A point xe S is called 
a right singular point if X,(w) 2 x for all te 
[0, €(w)) with P,-probability 1. A left singu- 
lar point is defined analogously, with > re- 
placed by «. A right and left singular point 
is called a trap, while a right singular point 
which is not left singular is called a right shunt 
(a left shunt is defined analogously). A point is 
called a regular point if it is neither right nor 
left singular. 

The set of all regular points is open. Let 
(ri, r2) be a connected component of this open 
set. One of the most important results con- 
cerning this situation is the proof of the exis- 
tence of a strictly increasing function s(x) de- 
fined on (r,, r;) and two measures m and k 
on (r,, r;) such that the infinitesimal generator 
© of M is represented as 
Gulx) | u (dx)—u(x)k(dx) l 6) 

m(dx) 

where u*(dx) is the measure du * (x) induced by 
the fright derivative u ' (x) of u(x) with respect 
to s(x) (Le., u* (x) 2 lima. , +o {u(x + Ax)— 
u(x)}/{s(x + Ax) —s(x)}). Equation (6) gives a 
generalization of second-order differential 
operator au" + bw - cu (a 0,c <0) [12]. Here 
m is positive for nonempty open sets, both m 
and k are finite for compact sets in (r,, r;), and 
s, m, and k are unique in the following sense: If 
there are two sets of values of s;, m;, and k; (i— 
1, 2), then s (x) 2 cs, (x) -- constant, m, (dx) = 
c ! m,(dx), and k,(dx) -c ! k,(dx) for some 
positive constant c. We call s, m, and k, respec- 
tively, the canonical scale, canonical measure 
(or speed measure), and killing measure for Wi. 
They determine the behavior of X,(w) belong- 
ing to M inside the interval (r;, r;). Conversely, 
given any such set of s, m, and k, we can find a 
1-dimensional diffusion process Wè such that s, 
m, and k are, respectively, the canonical scale, 
canonical measure, and killing measure of W. 
If X,(w) is nonvanishing in (r;, r;) with proba- 
bility 1, the killing measure k is identically 
zero, and the canonical scale s satisfies the 
equation 


s(x) —s(x,) 


Piles, 0) ab 


for x, « x « x, where o, is the thitting time of 
the point y. 

The motion X,(w) belonging to the process 
M and contained in (r;, r;) can be constructed 
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from the Brownian motion by means of a 
topological transformation of the state space 
(interval) based on s, a ttime change based on 
m, and a tkilling based on k. More precisely, 
we first transform the interval (r,, rj) by x 
s(x) into the interval (s(r, +0), s(r, —0)) so 
that the diffusion process on this new interval 
has a canonical scale coincident with x. The 
speed and killing measures are transformed 
accordingly. We can, therefore, assume that 
the canonical scale is x. Let us consider the 
case (r,, r;) 2» (—oo, +00) for simplicity. Let 
t(t, x) be the flocal time of Brownian motion at 
x (— 45 Brownian Motion). Next, we apply 
the ftime change to the Brownian motion by 
means of the tadditive functional 


TEN ta, x)m(dx), 


and finally tkill the latter process by means of 
the tmultiplicative functional 


+% 
a(t) exp( — | tlo (t), dx) f 
Thus we obtain the process M in (—oo, 00). In 
particular, if m(dx)=a(x) ! dx and k(dx)= 
|c(x)| dx, we have 


ott | aa)" a 


0 


a(t)— exp (| c(X,-1) i) : 
0 


and Gu — au" 4- cu. 

At a shunt the infinitesimal generator (5 has 
a form that is a generalization of the first- 
order differential operator bu’ + cu (c <0), 
with b>0 or b «0 according as it is a right 
shunt or a left shunt. At a trap we have Gu(x) 
= —u(x)/ EQ). 

When S is an interval with endpoints r, and 
r, and all interior points are regular, the left 
endpoint r, is classified into the following 4 
types, according to the bebavior of W near r,: 
Take an arbitrary fixed point re(r,, r2), and set 
n(dx) 2 m(dx) 4- k(dx) and 


a= | (s(r) ~ s(x))n(dx), 
(ri.r) 


p= | n((x, r))s(dx). 
(rir) 


Then r, is a regular boundary if x < co, f < 

oo; an entrance boundary if x < oo, B= œ; an 
exit boundary if x = œ, fj < oo; and a natural 
boundary if x= oo, fj 2 oo. This classification 

is independent of the choice of r. A similar 
classification of r, can be established. X,(w) 
approaches r, in finite time with positive or 
null probability according as £ is finite or infi- 
nite. If «= oo, it never happens that X,(w) 
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starts from r, and reaches the interior of the 
interval S even if r; € S, whereas if x < oo we 
can construct (adjoining r, to S if necessary) a 
diffusion process that enters the interior from 
r, and whose motion in the interior coincides 
with that of X,(w). 

If r, is a regular boundary for W and r, €S, 
then there are various possibilities for the 
behavior of X,(w) at pr. They are expressed by 
the boundary conditions satisfied by the func- 
tions u belonging to the domain of the in- 
finitesimal generator (5. The condition is in 
general of the form 


yu(r,) + ó6u(ri) -- uu * (r,)=0, 


where y, ô, and u are constants, y, ó € 0, p>0, 
and |ó| 4- u » 0. If y 2 ó —0, then r; is said to be 
a reflecting barrier. If r, is regular for W and 
does not belong to S, then X,(w) vanishes 
exactly as X,(w) reaches r,, and r, is called an 
absorbing barrier. This case corresponds to the 
boundary condition u(r,) 2 0. Whatever the 
boundary condition may be, 9Ji is constructed 
from the Brownian motion with reflecting 
barrier by topological transformation of the 
state space, time change, and killing. Here if 
y #0, then killing may occur at r,; if 640, the 
set of visiting times of r, has positive Lebesgue 
measure; and if 40, the trace of the motion 
may go beyond the point r, and reach the 
interior points of S [2, 7-9]. 

If we weaken the assumption of continuity 
of paths and admit jumps from r,, the general 
boundary condition becomes 


yu(ri) - dGu(r,)+ pu’ (ri) 
d (u(x) —u(r,)) v(dx) — 0, 
(ri; rl 


where v is a measure with respect to which 
min(1, s(x) — s(r, +0)) is integrable. 

When S — (r,, r;), the transition probability 
is absolutely continuous with respect to the 
canonical measure, the density p(t, x, y) has an 
teigenfunction expansion 


0 
p(t, x, y) | e^ e(dÀ x, y), 
-o 
and p(t, x, y) is positive, jointly continuous in 3 
variables, and symmetric in x and y. A similar 
result is also known when S is half-open or 
closed [9]. 

If M is trecurrent, i.e., P.(o,< +00)=1 for 
every x and y in S, then there exists a unique 
(up to a multiplicative constant) *invariant 
measure for S that is finite for all closed inter- 
vals in the interior of S. If Mt is conservative 
and all the interior points of S are regular, 
then the canonical measure is an invariant 
measure, provided that the endpoints are 
either entrance, natural, or regular reflecting. 
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C. Multidimensional Diffusion Processes 


Let the state space S be a domain or the 
closure of a domain in the n-dimensional 
Euclidean space R". Consider a temporally 
homogeneous diffusion process {X,}5<;<., on 
S. Under suitable regularity conditions the 
infinitesimal generator © coincides, for suffi- 
ciently smooth functions u in its domain, with 
the following 'elliptic partial differential oper- 
ator A: 


2 





A= y aÏ{(x)— 


i, j=1 Ox'Qx! 


xui b! Ia zi CODO. 
cx0; (7) 


where S has a boundary, u satisfies a boundary 
condition of the form 


n ig 
2 wx 


i,j=1 


"27 








+5 ps 


Ou e) 





+ y(x)u(x) + d(x) Au(x)+ Sek =0, (8) 
where, for simplicity, we assume that S is the 
closed half-space defined by x" z 0, (a?) is a 
symmetric nonnegative definite matrix, y <0, 
ó «0, uz 0, and 0/On is the inward-directed 
*conormal derivative associated with al This 
boundary condition was discovered by A. D. 
Venttsel’ [13]. Conversely, given an operator A 
such as (7) and a boundary condition (8) (if S 
has a boundary), the existence and uniqueness 
of the corresponding diffusion process are 
known for several special cases. If S — R" and 
A has continuous coefficients, there exists at 
least one diffusion process corresponding to A 
[14-16]. 

A probabilistic approach to constructing 
diffusion processes corresponding to the oper- 
ator A with c=0 is given by Itó's method 
of tstochastic differential equations (— 406 
Stochastic Differential Equations E). A some- 
what different approach was introduced by D. 
W. Stroock and S. R. S. Varadhan [20] under 
the name of martingale problems (— 261 Mar- 
kov Processes C). Let W” = C([0, oo) 5 R”) 
be the space of all continuous functions w: 

[0, oo) 2 R" endowed with compact uniform 
topology and 8(W") be the topological o- 
field. Given x c R", a solution to the martingale 
problem for the operator A with c=0 start- 
ing from x is a probability measure P, on 
(W^, PALA satisfying P.(w(0) 2 x) 2 1 such 
that f(w(t)) — f$ Af(w(s)) ds is a P,-martingale 
for all fe es (R"), where C$ (R") denotes the set 
of all C?-functions on R” having compact 
support. If a — (a?) is uniformly positive de- 
finite, bounded, and continuous and if b — (b?) 
is bounded and Borel measurable, the martin- 
gale problem for the operator A with c=0 is 
well posed, i.e., for each x eR", there is exactly 
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one solution starting from x. (For further 
information related to the theory of martingale 
problems — [15,20]. 

Suppose next that S is the closure of a 
bounded domain with a sufficiently smooth 
boundary and A is given with sufficiently 
smooth coefficients. If the boundary condition 
is yu + uQu/On =0, 40, then there exists a 
unique diffusion process on S corresponding 
to this situation. Moreover, if y —0, then the 
process is said to have a reflecting barrier. S. 
Watanabe [22] gave a probabilistic condition 
which characterizes the reflecting diffusion 
processes in the normal direction among all 
reflecting diffusion processes in oblique direc- 
tions. Suppose that a general boundary con- 
dition (8) is given. We write the left-hand side 
of (8) as Lu. Write u= Hf for the solution of 
Au —0 with boundary value f. Under some 
natural additional conditions, T. Ueno proved 
that if LH *generates a Markov process on 
the boundary, then there exists a diffusion 
process for A with boundary condition (8) 
[21]. If {X,} has a reflecting barrier, the Mar- 
kov process on the boundary is, conversely, 
obtained from {X,} through time change by a 
nonnegative continuous 'additive functional 
which increases only when the value of X, is 
on the boundary. Stochastic differential equa- 
tions are also used in constructing diffusion 
processes with boundary condition (8) [19] 
(— 406 Stochastic Differential Equations). An 
elegant method for constructing diffusion 
processes with boundary condition (8) has 
been introduced by Watanabe [22]. It consists 
of piecing together excursions from the bound- 
ary to the boundary. In this construction, 
the stochastic integrals for *Poisson point 
processes play an important role. There are 
various other results on multidimensional 
diffusion processes with general boundary 
conditions which cannot be covered by the 
method mentioned above (— [23] and M. 
Motoo, Proc. Intern. Symp. Stochastic Dif- 
ferential Equations, K yoto, 1976). 

When we have a diffusion process on S$ 
with infinitesimal generator of the form A, we 
can obtain a probabilistic expression for the 
solutions of various partial differential equa- 
tions involving A. Let c be the hitting time of 
the boundary of S. Then Hf(x) can be ex- 
pressed as E,( f(X, all The solution of Au = 
— f with boundary value 0 is given by u(x)— 
E (fo f (X) dt), while the solution u(t, x) of 
Qu/Ot = Au with boundary value 0 and initial 
value u(0, x) 2 f is E,(f(X); t « o). The first 
case gives the solution of a tDirichlet problem; 
and in this case, the condition for a boundary 
point to be tregular relative to the Dirichlet 
problem can also be expressed probabilisti- 
cally (— 45 Brownian Motion, 261 Markov 
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Processes). Cen if f( X) is replaced by 
f(X)exp(f5 k(X,) ds) in the expressions for u(x) 
and u(t, x), A is Sege by A +k (M. Kac 
[24]). When k <0, this replacement gives rise 
to a killing of the process (— 261 Markov 
Processes E). 

By using the theory of tDirichlet forms we 
can investigate a general class of tsymmetric 
multidimensional diffusion processes which are 
not in the framework of the classical diffusion 
processes, i.e., diffusion processes whose in- 
finitesimal generators are not necessarily dif- 
ferential operators (M. Fukushima, Dirichlet 
Forms and Markov Processes, 1980)). 


D. Diffusion Processes on Manifolds 


Let M be a tconnected foriented !c-compact 
C* tmanifold of dimension n. Let M be M or 
M U {6} (= the one-point fcompactification 
of M) according as M is tcompact or non- 
compact. Suppose that we are given a system 
of C?—tvector fields Ag, A1, ..., A, on M. 

We consider the following stochastic differen- 
tial equation on M: 


dX,— Y. A(X)odw*(t) - Ag(X,dt (9) 
k=1 


(— 406 Stochastic Differential Equations), 
where w(t)=(w!(t), w?(t), ..., w"(t)) denotes an 
r-dimensional Brownian motion and the first 
term of the right-hand side is understood in 
the sense of the *Stratonovich stochastic dif- 
ferential. Let X (t, x, w) be the solution of (9) 
with Xy — xe M defined on the r-dimensional 
Wiener space (Wj, P") (— 406 Stochastic 
Differential Equations) and P, be the proba- 
bility law on W(M) of (X (t, x, w))} >o, where 
W(M) is the space of all continuous map- 
pings w:[0, 00) M with ô as a trap. Then 
{P,.|xeM} defines a diffusion process W on 
M which is generated by the second-order 
differential operator 37 ., A? /2+ Apo, ie. a 
diffusion process W with the infinitesimal 
generator © such that 


(Sf (x)= D Az f(x) 4- Ag f(x) for f e Cg; (M), 
where C$ (M) denotes the space of all C®- 
functions on M with compact support. By 
appealing to the analytical theory of partial 
differential equations we can discuss regularity 
properties of the transition probability of M 
([15]; L. Hórmander, Acta Math., 119 (1967)). 
Recently, P. Malliavin [25] also suggested a 
probabilistic method for proving elliptic regu- 
larity results (see also [19]). 

We now assume that all linear sums of 
An, A,, ..., A, are tÀcomplete. Then the terminal 
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time of W is infinite a.s. Let W(M) be the space 
of all continuous mappings w:[0, o0) 3 M 
endowed with compact uniform topology, and 
set W.(M)={w|we W(M), w(0) 2 x]. We de- 
scribe the *topological support (P,) of the 
probability P, on W,(M), i.e., the smallest 
closed subset of W,(M) that carries the mea- 
sure P Let Z be the set of all piecewise con- 
stant mappings u:[0, oo) R'. For a given 

u — (u! (t), w? (t), ... ,u'(t)) of F, we consider a 
system of ordinary differential equations 


d r 
qz 0 Alo) + 5, A,(p(t))u*(2), (10) 


i.e., for every C?-function f on M with com- 
pact support, 


d r 
4; (90 = Aof lo) LE (Af )(o())u*(r). 


Then for every ue F and xe M, we obtain a 
curve o = DL, u)=(¢,(x, u)) on M by solving 
(10) with 9(0)— x. Set * —(e(x,u)ue P}. 
Then we have 


SY(P)J=Sf* for every xe M. 


Let ¥ be the *Lie algebra generated by A,, A;, 

., A,, and set Z(x)  (V.|Ve$j. If dim Z(x)— 
n for every xe M, then Y(P,) - WAM) for every 
xe M. For further information — Stroock and 
Varadhan (Proc. 6th Berkeley Symp. Math. 
Statist. Prob. III, 1972) and H Kunita (Proc. 
Int. Symp. Stochastic Differential Equations, 
Kyoto, 1976). 

Let A be a smooth tnondegenerate second- 
order telliptic differential operator on M which 
is expressed in local coordinates as 


n e 


1 
Amt dai 








where (a?(x)) is symmetric and ‘strictly posi- 
tive definite. Then there exist a *Riemannian 
metric g and a C?-tvector field b on M such 
that 


1 
A= Ay +b, (11) 


where Ay is the *Laplace-Beltrami operator 
on the Riemannian manifold (M, g). We now 
construct the diffusion process generated by 
the operator A, introducing a stochastic dif- 
ferential equation on the tbundle O(M) of the 
‘orthonormal frames. There exists an taffine 
connection V fcompatible with the Riemann- 
ian metric g such that for every C*-function f 
on M, 


n 


1 
SH &Cfon)(r)- (ios +b) Maton 


re O(M), 
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where (L,, L3, ..., La) is the system of tcanon- 
ical horizontal vector fields (tbasic vector 
fields) on O(M) corresponding to the affine 
connection V and z:0(M)—M is the natural 
projection [19]. We now consider the follow- 
ing stochastic differential equation on O(M): 


dr, Y L,(r,)odw*(0). (12) 
k=1 


Let r(t,r, w) be the solution of (12) with rp = 

r€ O(M) defined on the n-dimensional Wiener 
space (Wg, P"). Now a stochastic curve 

X (t, r, w) on M is defined by X(t,r, w)= 

n(r(t,r, w)). Set X(t,r,w) 20 fort 2 C, where C 

is the *explosion time of r(t, r, w). Then the 
probability law of (X (t, r, w)),>9 on W(M) 
depends only on x= z(r), and it defines a diffu- 
sion process Jt on M which is generated by 
the operator A of (11). (For details — [19,26].) 
When A=A,,/2, i.e., b=0 in (11), the diffusion 
process Wis called the *Brownian motion on 
the Riemannian manifold M (— 45 Brownian 
Motion). 

Next consider the case when (L4, L2, ..., L,) 
in (12) is a system of canonical horizontal 
vector fields corresponding to the *Riemann- 
ian connection, and let b be a C®—vector 
field on M. Let b be the scalalization of b, 

b=(b', b?, ..., b"), b'(r) - X1, bi(x) fi, and 
(fj) 2 (ej)! for r—(x es, e5, LT EUM 
where b=}; b'0/0x' and e; 237, elO/0x? 
in local coordinates. Suppose Ee bi(r) is 
bounded on O(M) for i— 1,2, ..., n. Then 


ul È In r,)dw'(s) 
g di 2d 
"32 K (r,)) s] 


is an texponential martingale. The diffusion 
process generated by the operator A,,/2 +5 is 
obtained from the Brownian motion on M 
constructed above by the ttransformation of 
drift, i.e., the transformation by the tmultiplica- 
tive functional M(t) (— 261 Markov Processes). 
(G. Maruyama (Nat. Sci. Rep. Ochanomizu 
Univ., 15 (1954)) studied this for 1-dimensional 
processes, I. V. Girsanov [27] and M. Motoo 
(Ann. Inst. Statist. Math., 12 (1960—1961)) for 
multidimensional cases.) 

Consider a diffusion process W= (P,|xe Mj 
on M generated by the operator A of (11). Let 
œ, be the tdifferential 1-form defined from the 
vector field b by 


ex Vy = gx(b,. V.) 


for every xe M and every C*—vector field V 

on M. Then Mt is symmetrizable if and only if 
@, is texact, i.e., if there exists a function F on 
M such that w,=dF. The tinvariant measures 
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are then of the form const. x exp[2F(x) ]m(dx), 
where m(dx) is the *Riemannian volume (E. 
Nelson, Duke Math. J., 25 (1958); Kolmogorov 
Math. Ann., 113 (1937)). The diffusion process 
Mè is said to be locally symmetrizable if for 
every tsimply connected domain D c M there 
exists a Borel measure vP(dx) on D such that 


> 


| Tf (x)g(x)v? ay=f F(x) Tg (xv? (ax) 
D D 


for all bounded continuous functions f and g, 
where 


Tf (x)= fiw 


W(M) 
tp(w) 2 inf(t|w(t)é D}. 


Then Mt is locally symmetrizable if and only if 
œ, is tclosed, 1.e., den, - 0. R. Z. Khas’minskii 
[28] proved a pair of useful tests for explo- 
sions of diffusion processes on M =R", similar 
to Feller's test for n— 1 mentioned in Section B 
(— [18]). In case of a general manifold we can 
investigate the possibility of explosions for 9 
by appealing to the comparison theorems for 
*curvatures in the theory of differential geome- 
try [19,29]; S. T. Yau, J. Math. Pures Appl., 57 
(1978)). The diffusion process 9t on M is said 
to be trecurrent if P,[ X, € U for some t>0]=1 
for any open subset U of M; otherwise it is 
called *transient. It is well known that an n- 
dimensional Brownian motion on R” is recur- 
rent if n € 2 and transient if n> 3 [9]. There 
also are some results for the criterion which 
determines whether W is recurrent or transient 
([19, 28]; A. Friedman, Stochastic Differential 
Equations and Applications I, II, 1975, and K. 
Ichihara, Publ. Res. Inst. Math. Sci., 14 (1978)). 
Explicit formulas for the transition probability 
of the Brownian motion on a hyperbolic space 
are given in [29]. For information related to 
the asymptotic behavior of diffusion processes 
as t|0 we refer the reader to Varadhan (Comm. 
Pure Appl. Math., 20 (1967)) and S. A. Mol- 
chanov (Russian Math. Surveys, 30 (1975)). 
Diffusion approximations for suitably nor- 
malized random sequences have been studied 
extensively beginning with the work of A. J. 
Khinchin [30]. The theory of tlimiting distri- 
butions of sums of independent (or weakly 
dependent) random variables is among the 
best-known examples (— 250 Limit Theorems 
in Probability Theory). For information re- 
lated to the theory of diffusion approxima- 
tions, see Yu. V. Prokhorov (Theory of Prob. 
Appl., 1 (1956)), A. V. Skorokhod [16], and 
Stroock, Varadhan, and G. Papanicolaou 
(Proc. 1976 Duke Turbulence Conference). 
Recently, many interesting examples of multi- 
dimensional diffusion processes have also been 


t) uon P x (dw), 
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introduced to describe probabilistic models in 
physics, biology, etc. (e.g., R. L. Stratonovich, 
Topics in the Theory of Random Noise I, II, 
1963; J. F. Crow and M. Kimura, An Introduc- 
tion to Population Genetics Theory, 1970; and 
K. Sato, Proc. Int. Symp. Stochastic Differential 
Equations, K yoto, 1976). 

In general, if a diffusion process 1 X,] is 
given on a noncompact space S and X, has 
no limit points in S as (1C, then some natural 
compactification should be induced by (Xj. 
The notion of a *Martin boundary for Markov 
processes is introduced in this connection. 
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116 (XX.2) 
Dimensional Analysis 


The system of units for physical quantities is 
derived from a certain set of fundamental 

units. If the fundamental units are denoted 

by 0, 9, W, etc., any other unit a (called a de- 
rived unit) can always be expressed in the form 
a=cOlo™y” ... (c, 1, m,n, ... are constants), by 
definition or by physical laws. The exponents l, 
m,n,... are called the dimensions of «, and the 
content of the previous statement is expressed 
as [x] 2 [0! o" y" ... ], which is called the di- 
mensional formula. The usual practice is to 
take as fundamental units length, time, mass, 
temperature, and energy, which are denoted by 
L, T, M, 0, and H, respectively. Dimensional 
analysis investigates the relation between 
physical quantities by use of the z theorem 
and the law of similitude given below. 


A. The z Theorem 


If a relationship f(a, B, ...) 2 0 holds among n 
physical quantities a, f, ... independently of 
the choice of fundamental units, the equation 
f (a, B, ...) - 0 can always be transformed into 
F(n;,n,,...)20, where the z; are n—m dimen- 
sionless quantities (m is the number of funda- 
mental units) of the form z,;— «^fi... . If we 
choose the x; so that zt; =f "1y 7! ... and 73, 
715, etc. do not contain a, then f —0 implies 
a= p'y^ ...,(n,,7,,...), which clearly shows 
the manner in which the quantity « is related 
to other quantities fi, y, .... 


B. The Law of Similitude 


In general, if two physical systems of the same 
kind have the same values of the z;, then the 
physical states of the systems are similar. If 
we are given a family of mutually similar sys- 
tems, it is sufficient to observe a particular one 
among them (a “model”) in order to estimate 
physical values attached to any one of the 
given systems. 

Consider, for example, the case of the drag 
D acting on geometrically similar bodies 
placed in the flow of a viscous incompressible 
fluid. If v is the velocity, | the representative 
length of the body, p the density of the fluid, 
and u the viscosity (which has the dimensional 
formula ML !T'!) then the z theorem gives 
D/pv? I? = f(pvl/u). Hence the drag coefficient 
as given by the left-hand side can be obtained 
by the experiments performed on a geometri- 
cally similar model. The dimensionless quan- 
tity R=vl/v (v= u/p) is called the Reynolds 
number. If the wave resistance due to gravity 
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as well as the effect of compressibility are 
taken into account, gravitational acceleration 
g and sound velocity a must be included, so 
that we have D/pv? P? = f(vl/v, v? /lg, v/a, C,, 
C,,...), where C,, C,,... are other dimension- 
less quantities depending on the physical 
properties of the fluid. Fr — v?/lg is called the 
Froude number, and M — v/a the Mach number. 
Next, consider the case of heat transfer 
between a solid surface and a flowing fluid. 
Let the area of the solid surface be denoted 
by S, the heat transferred per unit time by 
Q (HT )), the thermal conductivity of the 
fluid by k (HL! T^! 07!), the specific heat by 
C (HM 16°), the two representative tempera- 
tures by T, and Tj, and the representative 
length by L where expressions in parentheses 
represent dimensional formulas. Then we have, 
as dimensionless quantities, the Nusselt num- 
ber Nu — Q/(kS(T, — To)/1), the Prandtl number 
Pr — v/k (K= k/pc), the Grashoff number Gr = 
d?g(T, — To)/v? To, and R, so that from the z 
theorem we have the relation Nu— f(R, Pr, Gr, 
C,,C,,...). Furthermore, Pe— vl/k = Pr R is 
called the Péclet number. 
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A. Introduction 


Toward the end of the 19th century, G. Cantor 
discovered that there exists a one-to-one corre- 
spondence between the set of points on a line 
segment and the set of points on a square; and 
also, G. Peano discovered the existence of a 
tcontinuous mapping from the segment onto 
the square. Soon, the progress of the theory of 
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point-set topology led to the consideration of 
sets which are more complicated than familiar 
sets, such as polygons and polyhedra. Thus it 
became necessary to give a precise definition 
to dimension, a concept which had previously 
been used only vaguely. In 1913, L. E. J. 
Brouwer [9] gave a definition of dimension 
based on an idea of H. Poincaré. In 1922, the 
foundations of dimension theory for separable 
metric spaces were established by K. Menger 
[11] and P. Uryson [10]. Subsequently, P. S. 
Aleksandrov and W. Hurewicz contributed 
much to the development of the theory. The 
foundations of dimension theory for general 
metric spaces were established independently 
by M. Katétov amd K. Morita. More general 
theory for *normal spaces has also been inves- 
tigated; the same results as in metric spaces, 
however, do not always hold. 


B. Definition of Dimension 


Let X be a normal space. If any finite open 
*covering of X has an open covering of forder 
<n + 1 as its refinement (— 425 Topological 
Spaces R) (i.e., if for any open sets G; (i= 1, 
...,8) such that X =G, U...UG,, there exist 
open sets H; (i— 1,...,s) such that H,<G,, X = 
H,U...UH, and any n 4-2 of the H; have no 
point in common), then we write dim X <n. If 
dim X <n but dim X € n— 1 does not hold, 
then we define X to be n-dimensional and 
write dim X =n. We call dim X the cover- 

ing dimension, (or Lebesgue dimension) of 

X. The idea behind this definition is due to 
H. Lebesgue. 

There are other definitions of dimension 
that are given inductively. Let us define 
Ind X = —1 if X is empty. If for any pair 
consisting of a closed set F and an open set G 
with F c G in X there exists an open set V 
such that Fc Vc G and Ind(V — V) xn— 1, 
then we define Ind X <n. Next, we define 
ind @ = —1. For any point p of X and any 
neighborhood G of p, suppose that there exists 
an open neighborhood V of p such that Vc 
G and ind(V — V) n — 1. Then we define 
ind X <n. As before, we set Ind X =n (ind X 
=n) if Ind X <n (ind X <n) but Ind X <n 
— 1 (ind X € n — 1) does not hold. (The defi- 
nition of ind X is due to Menger.) We call 
Ind X (ind X) the large inductive dimension of 
X (the small inductive dimension of X). 

If dim X <n does not hold for any n, then 
X is called infinite-dimensional, written 
dim X = oo; we define Ind X = oo and ind X = 
oo similarly. These dimensions are invariant 
under thomeomorphisms. 

The set of irrational points in a Euclidean 
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space, the tCantor discontinuum, and tBaire 
zero-dimensional spaces are all 0-dimensional. 
The set of rational points in a separable 
*Hilbert space is 1-dimensional. 


C. Dimension of Metric Spaces 


The following theorems hold for the dimension 
of metric spaces (M. Katétov, Czechoslovak 
Math. J., 2 (1952); K. Morita, Math. Ann., 128 
(1954)). Let X and Y be metric spaces. The 
equality dim X = Ind X holds. If Y c X, then 
dim Y x dim X. If X is a union of a countable 
number of closed sets F, (i2 1,2, ...), then 
dim X = max(dim F;) (sum theorem for dimen- 
sion). The inequality dim(X U Y) x dim X + 
dim Y+ 1 holds. If dim X =n, then X isa 
union of n+ 1 0-dimensional subsets (decom- 
position theorem for dimension). We have 
dim(X x Y) dim X + dim Y, where X z Ø 
(product theorem for dimension). 

Each of the following 1s a necessary and 
sufficient condition for dim X <n: (i) There 
exists a subspace A of a Baire zero- 
dimensional space B (t) and a continuous 
closed mapping f of A onto X such that f !(x) 
consists of at most n + 1 points for each point 
x of X (K. Morita, Sci. Rep. Tokyo Kyoiku 
Daigaku, 5 (1955)); (11) there exists a metric of 
X which gives the same topology on X such 
that for any positive number c, any point x of 
X, and any n 4 2 points x; (i2 1,...,n4- 2) 
at a distance less than e from the (¢/2)- 
neighborhood of x, there are at least two 
points x; and x; (ij) with distance <e (J. 
Nagata, Fund. Math., 45 (1958)). 

Hurewicz’s problem asked whether the 
equality dim X =n +m (m 0) implies the 
existence of an m-dimensional space A and a 
mapping f of A onto X having property (i). It 
was solved affirmatively for separable metric 
spaces by J. H. Roberts and for general metric 
spaces by K. Nagami (Japan. J. Math., 30 
(1960)). 

If X is the union of a countable number of 
closed tstrongly paracompact subspaces, in 
particular if X is separable, then Ind X = 
ind X [1,2]. However, it was shown by P. 
Roy (Bull. Amer. Math. Soc., 68 (1962)) that 
this equality does not hold in general. 


D. Euclidean Spaces and Dimension 


The n-dimensional tEuclidean space R” is 
exactly n-dimensional in the sense mentioned 
above; thus this concept of dimension agrees 
with our intuition. The proof of dim R"zn 
comes from Lebesgue’s theorem: If cach mem- 
ber of a finite closed covering of an n-cube has 
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sufficiently small diameter, then the order of 
the covering is not less than n+ 1. (The proof 
of dim R" <n is easy.) Let X be a subset of R" 
and f a homeomorphism from X onto a sub- 
set f( X) of R". If x is an interior point of X, 
then f(x) is an interior point of f(X). Also, if 
an open set A of R" is homeomorphic to a 
subset B of R", then B is open in R" (Brouwer's 
theorem on the invariance of domain [8 ]). 
This theorem holds for any manifold but not 
for general separable metric spaces. By the 
theorem of invariance of domain it can be 
shown that R” and R”, mz n, are not homeo- 
morphic (theorem on invariance of dimension 
of Euclidean spaces). Any n-dimensional 
separable metric space is embedded in a 
Euclidean space R?"*!, or, more precisely, in 
the subset of R?"*! consisting of all points x 
of which at most n coordinates are rational 
(Menger-Nóbeling embedding theorem, G. 
Nóbeling, Math. Ann., 104 (1930)). Thus, from 
the topological point of view, any finite- 
dimensional separable metric space can be 
identified with a subset of a Euclidean space. 
Moreover, it is known that any n-dimensional 
separable metric space is homeomorphic to a 
subset of some n-dimensional compact metric 
space. 

If F 1s a bounded closed subset of R", then 
dim F <n if and only if for any positive num- 
ber e, there exists a continuous mapping f 
from F into an n-dimensional polyhedron in 
R" such that the distance between x and f(x) 
is less than e for each point x of F. 


E. Dimension of Normal Spaces 


Let X be a normal space. Then Ind X 2 dim X 
and Ind X z ind X, but the equalities do not 
necessarily hold here. The following theorems 
were obtained by E. Cech, Aleksandrov, C. H. 
Dowker, E. Hemmingsen, and Morita [1]. If 
dim X <n, then any locally finite open cover- 
ing of X has an open covering of order <n+ 1 
as its refinement; if A is an tF, subset of X 

or A is strongly paracompact, then dim A x 
dim X; if X has a ta-locally finite closed cover- 
ing {F,\, then dim X = max(dim F,). 

In order that dim X <n, it is necessary and 
sufficient that any continuous mapping from a 
closed subset of X into an n-sphere S" can be 
extended continuously to X. If X and Y are 
tparacompact and X is tlocally compact, 
or if X x Y is strongly paracompact, then 
dim(X x Y)<dim X + dim Y, where X z Ø; if 
X is a tCW complex, then the equality holds 
[14]. M. L. Wage (Proc. Nat. Acad. Sci. US, 
75 (1978)) proved under the continuum hypo- 
thesis (CH) that dim(X x Y) x dim X + dim Y 
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does not hold in general even if X x Y is local- 
ly compact and normal, and dim X = dim Y= 
0; T. Przymusinski (Proc. Amer. Math. Soc., 
76 (1979)) noted that CH can be avoided by a 
modification of Wage's construction. Katétov 
(Casopis Pést. Mat. Fys., 75 (1950)) proved 
that dim X is determined by the ring C*(X) of 
bounded real-valued continuous functions 

on X. 


F. Homological Dimension 


Aleksandrov contributed much to the develop- 
ment of dimension theory in introducing the 
concept of homological dimension (Math. 
Ann., 106 (1932)). The homological dimension 
of a compact Hausdorff space X with respect 
to an Ábelian group G is the largest integer n 
such that the n-dimensional *Cech homology 
group H,(X, A; G) is nonzero for some closed 
subset A of X. The cohomological dimension 
D(X; G) is defined similarly by using the tCech 
cohomology group H"(X, A; G). If dim X < oo, 
then dim X = D(X; Z) (Z is the additive group 
of integers). The cohomological dimension of 
X with respect to an arbitrary Abelian group 
is determined by the cohomological dimension 
with respect to some specified groups, and the 
cohomological dimension of the product space 
X x Y is expressed in terms of those of X and 
Y (M. F. Bokshtein, [5]). A compact Hausdorff 
space X has the property that dim(X x Y)= 
dim X + dim Y for any compact Hausdorff 
space Y if and only if dim X 2 D(X; Q(p)), 
where Q(p) is the additive group of rationals 
mod 1 of the form m/p? for any prime number 
p (V. Boltyanskii, [5]); this result holds also 
when X is paracompact (Y. Kodama, J. Math. 
Soc. Japan, 18 (1966)). 


G. Dimension and Measure 


Let X be a separable metric space. Then 
dim X <n if and only if X is homeomorphic 
to a subset of a Euclidean space R?"*! whose 
(n+ 1)-dimensional tHausdorff measure is 0 
(E. Szpilrajn. Fund. Math., 28 (1937); also — 
[1,2]). The infimum of «>0 such that the 
Hausdorff measure A,(X) of dimension « 
vanishes is called the Hausdorff dimension of 
X. 


H. Dimension Type (Fréchet's Definition) 
In analogy to the theory of tcardinal numbers 


in set theory, M. Fréchet (1909) defined the 
dimension type of topological spaces as fol- 
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lows: Two spaces X and Y are said to have the 
same dimension type if X is homeomorphic to 
a subset of Y and Y is homeomorphic to a 
subset of X. 


I. Infinite-Dimensional Spaces 


If X is a metric space with 0 « dim X < œ, 
then for each positive integer m with m< 

dim X, X contains a (closed) subset S with 
dim $ =m. Tumarkin asked the following 
question: For an infinite-dimensional com- 
pact metric space X and for each positive 
integer m, does X contain a closed subset 

5 with dim S =m? D. Henderson (Amer. J. 
Math., 89 (1967)) answered this question in the 
negative. Furthermore, J. Walsh (Bull. Amer. 
Math. Soc., 84 (1978)) constructed an infinite- 
dimensional compact metric space X such 
that if S is an arbitrary subset of X with 
dim S — 0 then dim S = oo. 
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Diophantine Equations 


A. General Remarks 


A Diophantine equation is an *algebraic equa- 
tion whose coefficients lie in the ring Z of 
rational integers and whose solutions are 
sought in that ring. The name comes from 
Diophantus, an Alexandrian mathematician of 
the third century a.D., who proposed many 
Diophantine problems; but such equations 
have a very long history, extending back to 
ancient Egypt, Babylonia, and Greece. As 
early as the sixth century B.C., Pythagoras is 
said to have partially solved the equation x? + 
y?=27 by x=2n+ 1, y=2n? 4 2n, z y 4 1. 

A general solution is given by the Pythagorean 
numbers x ^m? —n?, y -2mn, z o m? +n’. 
*Fermat’s problem also concerns a Diophan- 
tine equation. 

Systematic studies of Diophantine equations 
over Z have been made for the linear equation 
DL, a,x; =a (aj, a€ Z) and for the quadratic 
equation ax? - bxy - cy? =k (a, b, c, keZ) in 
two unknowns. The latter forms a principal 
topic of C. F. Gauss's Disquisitiones arithme- 
ticae and can be regarded as a starting point of 
modern algebraic number theory. The special 
quadratic equation t? — Du? = +4 (De Z) is 
called Pell’s equation. If D <0, then Pell’s 
equation has only a finite number of solutions. 
If D >0, then all solutions t,, u, of Pell's equa- 
tion are given by +((t, +u DYDY - (t, + 
us / D)/2, provided that the pair t4. u, isa 
solution with the smallest t, +u, JD» 1 [15]. 
Using continued fractions (— 83 Continued 
Fractions), we can determine t,, u, explicitly. 
A general quadratic Diophantine equation 
ax? + bxy - cy? =k with two unknowns can 
be solved completely if we use solutions of 
Pell's equation; this 1s an application of the 
arithmetic of quadratic fields (— 347 Qua- 
dratic Fields) [1]. On quadratic Diophantine 
equations of several unknowns, thére are deep 
studies by C. L. Siegel (— 348 Quadratic 
Forms). 

Diophantine problems consist of giving 
criteria for the existence of solutions of alge- 
braic equations in rings and fields and even- 
tually determining the number of such solu- 
tions. The fundamental ring of interest 1s 
Z and the fundamental field of interest is Q. 
One discovers rapidly, however, that to have 
all the technical maneuverability necessary for 
handling general problems, one must consider 
rings and fields of finite type over Z and Q. 
Furthermore, one is led to consider finite fields 
and local fields when one deals with a locali- 
zation of the problems under consideration. 
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Techniques from various fields of mathematics, 
e.g., algebraic number theory, algebraic geom- 
etry, analysis, Diophantine approximation, 
etc., have been successfully applied to solve 
Diophantine problems. However, much re- 
mains unsolved today. Yu. V. Matiyasevich 
(1970) showed that Hilbert’s tenth problem is 
unsolvable; there is no general method of 
telling whether a Diophantine equation has a 
solution. This theorem in a sense indicates the 
complexity of Diophantine problems. For 
many centuries, no other topic has engaged 
the attentions of so many mathematicians, 
both professional and amateur, or resulted 

in so many published papers. For these mis- 
cellaneous results — Dickson [1] and Mordell 


[2]. 


B. Equations over Finite Fields 


Let k be a finite field of characteristic p con- 
sisting of q (= p^) elements. Chevalley’s theo- 
rem: Let f be a tform of degree d in n vari- 
ables with coefficients in k such that d<n. 
Then f=0 has a nontrivial solution in k. A 
generalization is Warning’s theorem: Let 
Jis- f, be polynomials with coefficients in k in 
n variables of degrees d,, ...,d,, respectively, 
and suppose that d=d,+...+d,<n. Then the 
number N of common zeros of f,, ..., f, satis- 
Des N =0 (mod p). Warning's second theorem 
asserts that if N >0 then N za d Warning's 
theorem was also improved by J. Ax (1964) to 
the effect that N =0 (mod q^) for any integer 

b « n/d [3]. For equations over finite fields, 
counting the number of solutions is important. 
Let f(x, y) be an absolutely irreducible poly- 
nomial in x and y over k. Let N denote the 
number of zeros in k of f(x, y). A. Weil proved 
IN —q| KIRCH c(d), where g is the genus of 
the curve f(x, y) 2 0 and c(d} is a constant 
depending on d. Weil's proof requires the 

use of deep results from algebraic geometry. 
This theorem is equivalent to the *Riemann 
hypothesis for algebraic curves over finite fields 
[4]. Later, using Stepanov's method, W. M. 
Schmidt and E. Bombieri (1973) independently 
gave new proofs which do not depend on 
algebraic geometry [3]. P. Deligne [5] proved 
a far-reaching generalization of Wels theorem 
to tnonsingular absolutely irreducible equa- 
tions in n variables. He showed | N — 9"! | = 
Oto "17 This is a part of the *Weil conjec- 
ture for zeta functions of algebraic varieties 
over finite fields (— Section E; 450 Zeta Func- 
tions Q). Schmidt obtained in an elementary 
manner a weaker estimate |N — q" ! |  O(q" 37) 
but without the assumption of nonsingularity 
(— Section F). 
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C. Equations over Local Fields 


A method of solving problems in number 
theory by use of embeddings of the ground 
field into its tcompletions is called a local 
method. Such methods have important conse- 
quences when applied to Diophantine equa- 
tions. Let f be a polynomial in n variables 
with rational integer coefficients. The con- 
gruence f =0 (mod p*) is solvable for all k> 1 if 
and only if f =0 is solvable in tp-adic integers. 
This is an easy consequence of the com- 
pactness of the ring of p-adic integers. We 

can solve f —0 in p-adic integers provided 
that we can solve an infinite sequence of con- 
gruences. It is generally difficult to tell when 
we may limit our consideration to only a finite 
number of these. In this respect, the following 
lemma is most useful. Hensel's lemma: Let 

f (x4, ..., x,) be a polynomial whose coeffi- 
cients are p-adic integers. Let y,, ..., y, be p- 
adic integers such that for some i (1 <i<n) and 
an integer 6>0, we have f(7,,...,7,)=0 

(mod p??*!) and 0f/Ox;(y,, ..., Yn) : 0 (mod p°), 
+0 (mod p?*1). Then there exist p-adic integers 
0,, ..., 0, such that f(0,,...,0,) 20 and 6;z y 
(mod p?*!) for i=1,...,n. The case 6=0 is 
often useful; it implies that a nonsingular 
solution mod p can be extended to a p-adic 
solution. Generalization to simultaneous equa- 
tions is also known [6]. Skolem's method is 
sometimes useful when we investigate certain 
types of equations over flocal fields. This 
method is based on some simple properties of 
local analytic manifolds over local fields [7]. If 
a quadratic form has zeros in each local field, 
then it has a rational zero (‘Minkowski-Hasse 
theorem). When a theorem of this type holds, 
we say that the tHasse principle holds (— 348 
Quadratic Forms). For forms of higher degree, 
the Hasse principle no longer holds even if the 
forms are absolutely irreducible and nonsin- 
gular. Counterexamples were first found for 
cubic (E. S. Selmer, 1951) and quintic (M. 
Fujiwara, 1972) forms. Asymptotic formulas in 
Waring’s problem (— 4 Additive Number 
Theory E) can be regarded as an analytic form 
of the Hasse principle. As to the quantitative 
formulation of the Hasse principle, there are 
deep results of Siegel for quadratic forms 

and their generalization by T. Tamagawa. R. 
Brauer (1945) showed that forms in sufficiently 
many variables represent zero in all p-adic 
fields. Forms of odd degree represent zero in 
the field of rational numbers if the number of 
variables is sufficiently large compared with 
the degree (B. J. Birch, 1957). Let f be a poly- 
nomial with p-adic integer coefficients and c,, 
(mz 0) be the number of solutions to the con- 
gruence f =0 (mod p"). The series y(t) = 

=o Cmt™ is called the Poincaré series of f. 
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J. Igusa (1975) proved, by using his theory of 
asymptotic expansions together with Hiro- 
naka's tresolution theorem, that (t) is a 
rational function of t [8]. 


D. Integral Solutions of Some Diophantine 
Equations 


In this section we are concerned with those 
equations for which some “theory” exists. For 
isolated results > [1,2]. 


(1) Binary Forms. Thue’s theorem (1908): If 
f(x) 2 X89 a,x"(a,eZ, n> 2) has distinct roots, 
then the number of rational integral solutions 
of Shp a,x’ y" "=a (Zəa #0) is finite. This 
theorem is a direct consequence of Thue’s 
theorem on Diophantine approximation, 
which says that there are only a finite number 
of rational numbers p/q (p,qeZ,q>0) with 

Io — p/q| « 1/4 ??*! for a given algebraic num- 
ber « of degree n (n 2) [9, p. 122]. K. F. Roth 
proved that (n/2) 4- 1 in this formula can be 
replaced by 2 +e (e is an arbitrary positive 
number independent of n) (Mathematika, 2 
(1955), 1-20). Roth's theorem was generalized 
to some cases of number fields and function 
fields (2 182 Geometry of Numbers) and is 
applied to Diophantine equations [9, 10]. 

A. Baker (1968), using a completely different 
method, has given explicit upper bounds for 
the solutions of Thue's equations, thus enabl- 
ing one to compute effectively all the solu- 
tions. More precisely, if f in Thue's equation 
is irreducible over the rationals, then every 
integer solution (x, y) of the equation satisfies 
max(|x|,|y]) « exp((nH)' 9" + (loga)?"*?), 
where H is the *height of f. The proof of this 
remarkable theorem is based on his deep result 
concerning the lower bound for the linear 
forms in the logarithm of algebraic numbers 
(Mathematica, 13 (1966); 14 (1967)). Baker's 
method has been applied to elliptic, hyperellip- 
tic, and other curves (Baker, H. Stark, J. 
Coates, V. G. Sprindzhuk, etc.). 


(2) Higher-Degree Forms. A natural generali- 
zation of binary forms is a norm form. Let K 
be an algebraic number field of degree t >3 
and «,,...,a, be elements of K. Then the norm 
N(a, x4... a,x,) = Ii-, (oP x, +... +ax,), 
where ail denotes a conjugate of a, is a form of 
degree t with rational coefficients. It is easy to 
see that every form which has rational coeffi- 
cients and is irreducible over Q but which is a 
product of linear forms with algebraic coeffi- 
cients is a constant multiple of a norm form 
[7]. A module M in an algebraic number field 
K is called degenerate if M has a submodule N 
such that, for some ae K, aN is a full module 


452 


in some subfield K' of K, where K' is neither Q 
nor an imaginary quadratic field. The most 
basic in the theory of norm forms is (W. M.) 
Schmidt’s theorem: Let o,, ...,, be linearly 
independent over Q and suppose that the 
module generated by 2, ...,«, is nonde- 
generate; then N(a,x,+...+4%,X,)=c (CEQ) 
has only finitely many solutions in integers 

X1, ..., Xn. (Math. Ann., 191 (1971)). The proof 
Is based on his remarkable result on *simulta- 
neous approximation which generalizes Roth's 
theorem (— 182 Geometry of Numbers G). 
There are investigations on special norm forms 
(T. Skolem, N. I. Fel'dman, K. Ramanathan, 
K. Gyóry, M. Fujiwara, etc.). For general 
forms of higher degree, not much is yet known 
except for the additive forms (— 4 Additive 
Number Theory E). H. Davenport [11] proved 
that if f(x) is a cubic form with rational in- 
teger coefficients in n variables, then f(x)- 0 
has a nontrivial integral solution, provided 
that nz 16. This theorem was proved by 
means of an exquisite application of the tcircle 
method together with some geometry of num- 
bers. A well-known conjecture is that nz 10 
instead of n > 16. It is known that over local 
fields, any cubic form in 10 variables has a non- 
trivial zero. There are various results of this 
type for simultaneous additive, quadratic, and 
cubic forms (Davenport, D. Lewis, R. Cook, 
Schmidt, etc.) [17]. A satisfactory theory of 
forms of higher degree, like that of quadratic 
forms, is not yet known but is quite desirable. 
In this vein, Igusa has obtained some new 
results of considerable interest, e.g., a tPoisson 
summation formula for higher-degree forms, 
using his theory of asymptotic expansions [8]. 


(3) Algebraic Curves. The fundamental re- 

sult is Siegel's theorem (1929): Assume that 

the equations fj(X,,..., X,) 20 (1x im) 
determine an algebraic curve with a positive 
*genus in an taffine space of dimension n. 

Then the number of rational integral solutions 
of f(X,,..., X,) -O (1 «i m) is finite. This 
theorem was generalized by S. Lang in the fol- 
lowing form: Let K be a finitely generated field 
over Q and / a subring of K that is finitely 
generated over Z. Furthermore, let C bea 
nonsingular projective algebraic curve with a 
positive genus defined over K, and let o bea 
rational function on C defined over K. Then 
there are only a finite number of points P on C 
with o(P)e1 [10]. The proof of this theorem is 
based on a generalization of Roth's theorem in 
the above sense and on the weak Mordell-Weil 
theorem (— Section C). A. Robinson and P. 
Roquette gave another approach to Siegel's 
theorem from the standpoint of nonstandard 
arithmetic (J. Number Theory 7 (1975)). On the 
other hand, a necessary condition for the exis- 
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tence of infinitely many solutions of f(X, Y)— 
0 with rational integral coefficients was given 
by C. Runge (J. Reine Angew. Math., 100 
(1887)). 


(4) Elliptic Curves. An elliptic curve E is an 
*Abelian variety of dimension 1, or what is the 
same, an irreducible nonsingular tprojective 
algebraic curve of *genus 1 furnished with 

a point O as origin. The *Riemann-Roch 
theorem defines a group law on the set of 
*divisor classes of E. Actually, if P, P’ are 
points of E, then there exists a unique point 
P" such that (P) - (P^) -(P") 4- (O), where ~ 
means linear equivalence, i.e., the left-hand 
side minus the right-hand side is the divisor of 
a rational function on the curve. The group 
law on E is then defined by P+ P' = P". If the 
characteristic #2 or 3, using the Riemann- 
Roch theorem one finds that the curve E can 
be defined by a Weierstrass equation y? =x? + 
ax +b with a, b in the ground field over which 
the curve is defined. Conversely, any homo- 
geneous nonsingular cubic equation has genus 
1 and defines an elliptic curve in the projective 
plane once the origin has been selected. If both 
the curve and the origin are defined over a 
field k, then the group law is also defined over 
k, and it becomes a 1-dimensional Abelian 
variety defined over k. If the ground field k is 
the field of complex numbers, the group law 

is the same as that given by the *addition 
theorem of the *Weierstrass (o-function with 
invariants g, = —4a and g4— —4b through the 
parametrization x= p (u), y=} (u). Much of 
the Diophantine theorems on elliptic curves 
are generalized to Abelian varieties. Here we 
shall deal mainly with elliptic curves defined 
by Weierstrass equations over Q. Extension to 
algebraic number fields usually causes no trou- 
ble. For more general elliptic curves — [19]. 
The Lutz-Mattuck theorem (— Section E) 
obviously implies that the points of finite order 
in E, (k 2 Q,) form a finite group. This torsion 
group is computable. In case a and b are in Z 
then any point of finite order in E, has coordi- 
nates (x, y) in Z and, if y z0, y?|4a* - 27b? 
(Lutz-Nagell). The WC group (Weil-Chátelet 
group) of E over k is the birational class of 
principal homogeneous spaces over k. The 
extent of validity of the Hasse principle for 
elliptic curves can be measured by the Tate- 
Shafarevich group, which is defined as the set 
of elements of the WC group that are every- 
where locally trivial. This group is conjectured 
to be a finite group. For other results and 
interesting conjectures — [12-14]. The num- 
ber of integral points on elliptic curves is finite 
according to Siegel's theorem on algebraic 
curves. Explicit bounds for the size of these 
points have been given for several types of 
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elliptic curves by using Baker's method. For 
example, if f(x, y) is an absolutely irreducible 
polynomial with coefficients in Z such that the 
curve f =0 has genus 1, then max(|x|, |y) « 
expexpexp((2H)"), where m= 10°, d — deg f, 
and H is the height of f (Baker and Coates, 
1970. The method of proof was to reduce it to 
the Weierstrass equation case, which had been 
treated earlier by Baker, with a better bound.) 
By the tMordell-Weil theorem (— Section E), 
Aq X Z' x finite torsion group. Here r is called 
the rank of E over Q. There is a rather doubt- 
ful conjecture to the effect that the rank r is 
bounded. The rank r is conjectured to be equal 
to the order of the zero of Lis E) at s = 1 (Birch- 
Swinnerton-Dyer conjecture). Much numerical 
and theoretical evidence supports this famous 
conjecture [13]. 


E. Rational Points of Algebraic Varieties 


Let V be an 'abstract algebraic variety defined 
over a field k, and let P be a point of V. Then 
P is called a rational point over k of V if the 
coordinates of the trepresentative P, contained 
in an "affine open set V, of V are in k (2 16 
Algebraic Varieties D). This definition is inde- 
pendent of the choice of the representative P. 
In particular, if V is a tprojective variety, the 
point P given by the thomogeneous coordi- 
nates (xo, x4, ..., x,) is rational if and only if 
x//xy€k (0 in, x, 0). In the following we 
state main results concerning rational points 
of algebraic varieties, especially results con- 
cerning ‘Abelian varieties, restricting k to be 
either an talgebraic number field of finite 
degree, a tp-adic number field, or a *finite field. 


Mordell-Weil Theorem. Let A be an Abelian 
variety of dimension n defined over an alge- 
braic number field k of finite degree. Then 

the group A, of all k-rational points on A is 
finitely generated. This theorem was proved 
by L. J. Mordell (1922) for the case of n=1 
and by Weil (1928) for the general case [10]. 
The assertion that A,/mA, is a finite group 

for any rational integer m is called the weak 
Mordell-Weil theorem; this theorem is basic in 
the proof of the Mordell- Weil theorem and is 
used in the proof of Siegel's theorem, too. A 
generalization of the Mordell-Weil theorem is 
obtained when k is a field (of arbitrary charac- 
teristic) finitely generated over the tprime field 
[10]. 

If A is defined over a finite algebraic number 
field k, we have the following conjectures of 
Birch, Swinnerton-Dyer, and Tate on the rank 
of A,. Let p be a prime ideal of k at which A 
has a good 'reduction, and denote by A, the 
reduced variety. Let 2°)’, ..., 7” be the eigen- 
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values of the N(p)th power endomorphism of 
A, with respect to an l-adic representation, 
where N(p) denotes the norm of p (— 3 Abelian 
Varieties E, N), and put L,(s, A)= IZ, (1— 

n N(p) "VT The L-function of A defined by 
Lis A) - IT Lis A), where the product ranges 
over all good primes, is the principal part of 
the zeta function of A (— 450 Zeta Functions 
S). Birch and Swinnerton-Dyer conjectured 
that if k=Q and A is of dimension 1, then 
there exists a constant C #0 such that L(s, A) 
— C(s— IR as s>1. Tate generalized this con- 
jecture to any A and k. Moreover, the constant 
C, appropriately modified by factors corre- 
sponding to the bad primes and the infinite 
primes, is thought to be expressible in terms of 
certain arithmetic invariants of A [13]. 


Lutz-Mattuck Theorem. The group of rational 
points of an Abelian variety A of dimension n 
over a tp-adic number field k contains a sub- 
group of finite index isomorphic to the direct 
sum of n copies of the tring o of p-adic integers 
in k (E. Lutz, J. Reine Angew. Math., 177 
(1937); A. Mattuck, Ann. Math, 62 (1955)). 


Mordell's Conjecture. In his 1922 paper, in 
which the above theorem on the set of rational 
fields on I-dimensional Abelian varieties (1.e., 
on elliptic curves) was established, Mordell 
stated the conjecture: Any algebraic curve of 
genus g 2 2 defined over Q has only a finite 
number of rational points. The same can be 
conjectured for such curves defined over any 
algebraic number field k of finite degree. This 
had remained as a conjecture until 1983. In 
1961 I. R. Shafarevich conjectured: Let k be 
any algebraic number field of finite degree, S a 
finite set of finite prime spots of k, and g any 
natural number 22. Then there are, up to 
k-isomorphism, only a finite number of non- 
singular algebraic curves of genus g defined 
over k having good reduction at every finite 
prime spot outside S. 

In 1973, A. N. Parshin showed that Mor- 
dell's conjecture followed from this conjec- 
ture, which was finally proved in 1983 by 
G. Fallings [7]. Mordell’s conjecture was thus 
settled in the affirmative. Analogs of these 
conjectures on algebraic function fields over 
finite fields are easier than the original ones 
and had been proved in the 1960s for Mordell’s 
conjecture by Yu. I. Manin, H. Grauert, and 
M. Miwa, and for Shafarevich's conjecture by 
S. Alakelov, A. N. Parshin, and L. Szpiro. 


F. C;Fields 


Let F be a field, and let i20, d> 1 be in- 
tegers. Let f be a homogeneous polynomial 
of n variables of degree d with coefficients 
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in F. If the equation f =0 has a solution 

(x4, ..., x,) Z(0, ...,0) in F for any f with n» d, 
then F is called a C;(d)-field. If F is a C,(d)-field 
for any d 2: 1, then F is called a C;-field. In 
order for F to be a C,-field, it is necessary and 
sufficient that F be an talgebraically closed 
field. A C,-field is sometimes called a quasi- 
algebraically closed field. There exists no non- 
commutative algebra over a C,-field F. A 
finite field is C, (C. Chevalley (1936)). If F is 
algebraically closed, then F = F(X ) (rational 
function field of one variable) is a C,-field 
(Tsen’s theorem). A homogeneous polynomial 


f of n=d' variables of degree d with coeffi- 


cients in F such that f =0 has no solution in F 
except (0, ..., 0) is called a normic form of 
order i in F. If a C;-field Fy has at least one 
normic form of order i, then (1) Fo(X,,..., Xx) 
is a C,,,-field; and (ii) an extension of Fo of 
finite degree is a C,-field. A complete field F 
with respect to an fexponential valuation is a 
C,-field whenever its residue field F, is alge- 
braically closed. The field F of power series of 
one variable over a finite field Fo is a C;-field 
(Lang). E. Artin conjectured that a p-adic field 
Q, is a C;-field. It was proved by H. Hasse 
(1923) that Q, is a C;(2)-field and by D. Lewis 
(1952) that Q, is a C; (3)-field. However, G. 
Terjanian (1966) [17] gave a counterexample 
to Artin's conjecture; that is, he gave a quartic 
form of 18 variables with coefficients in Q, 
having only trivial zero in Q,. Ax and S. 
Kochen (1965) [18] proved that for any inte- 
ger d> 1 there exists an integer po(d) such 
that Q, is a C,(d)-field for p» po(d) (— 276 
Model Theory E). 
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Peter Gustav Lejeune Dirichlet (February 13, 
1805- May 5, 1859) was born of a French 
family in Düren, Germany. From 1822 to 1827 
he was in Paris, where he became a friend of 
J.-B. tFourier. In 1827, he was appointed 
lecturer at the University of Breslau; in 1829, 
lecturer at the University of Berlin; and in 
1839, professor at the University of Berlin. In 
1855, he was invited to succeed C. F. *Gauss 
at the University of Góttingen, where he spent 
his last four years as a professor. 

His works cover many aspects of mathemat- 
ics; however, those on number theory, analy- 
sis, and potential theory are the most famous. 
He greatly admired Gauss and is said to have 
kept Gauss's Disquisitiones arithmeticae at his 
side even when traveling. 

In number theory, he created the tDirichlet 
series and proved that a sequence in arithmetic 
progression contains infinitely many prime 
numbers, provided that the first term and the 
common difference are relatively prime. Also, 
using his *drawer principle," which states that 
if there are n+ 1 objects in n drawers then at 
least one drawer contains at least 2 objects, he 
clarified the structure of tunit groups of falge- 
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braic number fields. In tpotential theory he 
dealt with the *Dirichlet problem concerning 
the existence of fharmonic functions. He also 
gave tDirichlet’s condition for the convergence 
of trigonometric series. 
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A. The Classical Dirichlet Problem 


Let D be a bounded or unbounded fdomain in 
R” (nz 2) with compact boundary S. The class- 
ical Dirichlet problem is the problem of finding 
a *harmonic function in D that assumes the 
values of a prescribed continuous function on S. 
This problem is also called the tfirst boundary 
value problem (— 193 Harmonic Functions 
and Subharmonic Functions). In this article f 
always stands for a boundary function given 
on S. The problem is called an interior problem 
if D is bounded and an exterior problem if D 
is unbounded. In an exterior problem, it is 
further required that when an *inversion 
with center at an exterior point P, of D is per- 
formed on D and a tKelvin transformation 
is performed on the solution in D (when the 
solution exists), the function thus obtained on 
the inverted image of D be harmonic at P, 
(nz 3). When n=2, the solution in D, which 
is already regarded as a function on the in- 
verted image of D, is required to be harmonic 
at Py. Thus an interior problem can be trans- 
formed to an exterior problem, and vice versa. 
We now explain the history of the classical 
problem. 

Let D be a bounded domain with boundary 
S in R?. G. Green (1828) asserted that if S is 
sufficiently smooth, 


OG(P, Q) | 
u(P)= -El f(Q) — dol) (1) 
Ong 
is the solution for the Dirichlet problem, where 
f is prescribed on S, G(P, Q) is *Green's func- 
tion with the pole at Q in D, ng is the outward- 
drawn normal to S at Q, and do is the tsur- 
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face element on S. He took for granted the 
existence of Green's function from physical 
consideration of the problem. Thus his dis- 
cussion was not quite rigorous. This defect was 
corrected by A. M. Lyapunov (1898) under a 
certain condition on S. Denote by u, the New- 
tonian potential of a measure with density 
mz: 0 on S. Assume that a continuous function 
f on S and a positive constant a are given. In 
1840, C. F. Gauss investigated the existence of 
a density m, Z0 on S of total mass a which 
satisfies [(u,,, — 2f)m, de = min, fs(u,, — 

2f ym do, where the total mass of m is equal 

to a. He asserted also that u,,,—f is equal to a 
constant b on S. If f=0, then um, must be 
equal to a positive constant c on $, and hence 
Um, — DC Um, must be a solution of the ex- 
terior problem for the boundary function f on 
S. However, his discussion was incomplete 
because we cannot always ensure the existence 
of a density that gives a measure minimizing 
the integral. Moreover, even in the case where 
D is a ball, there exists a continuous function 
f 20 on S such that there is no Newtonian 
potential that is equal to f on S up to a con- 
stant (M. Ohtsuka, 1961). Gauss (1840), W. 
Thomson (Lord Kelvin) (1847), and G. L. 
Dirichlet solved the Dirichlet problem by 
making use of the so-called Dirichlet principle, 
which is explained in detail in Section F. After 
K. Weierstrass (1870) pointed out that there is 
a case where no minimizing function exists, D. 
Hilbert (1899) gave a rigorous proof of the 
Dirichlet principle under a certain condition. 
Meanwhile, C. G. Neumann (1870) solved 

the Dirichlet problem rigorously for the first 
time, although he assumed that D is a con- 
vex domain with a smooth boundary. First, 
he considered the potential W, —(1/2z): 

fs f(r !/On)dc of a double layer in D; 

then he formed the potential W, —(1/2z): 

[si (0r! /On)de of a double layer with the 
values f, of W, on S and defined W,, W}, ... 
similarly. The series W, — W, + W, — W, +... 
plus a suitable constant gives a solution to 

the exterior Dirichlet problem for the bound- 
ary function f. In 1887, H. Poincaré also used 
(1) to solve the Dirichlet problem. He ob- 
tained Green's function with the pole at O 

in a bounded domain D in the following man- 
ner: Let D' be the image of D by an inversion 
with center O, Sọ be a spherical surface sur- 
rounding the boundary OD of D. and u bea 
uniform measure on Sọ such that the potential 
of u is equal to 1 inside Se, By ‘sweeping out u 
to €D’, the solution in D’ of the exterior prob- 
lem for the boundary function 1 is obtained. 
A *Kelvin transformation of this solution 
yields the solution h in D of the interior prob- 
lem for the boundary function 1/OP. Now 
1/O P — h(P) is Green's function in D. In 1899 
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Poincaré used another method (without utiliz- 
ing (1)) to solve the Dirichlet problem [8]. He 
observed that it is sufficient to consider the 
case where f is equal to the restriction to S of 
a polynomial g and that g is expressed in D as 
the sum of the Newtonian potential of a signed 
measure t with density — Ag/(4x) and a func- 
tion that is harmonic in D and continuous on . 
DUS. If it is possible to sweep out x to 0D, 
then the solution is obtained. He showed that 
this is in fact the case if at every point P of S 
there exists a cone that is disjoint from D and 
has its vertex at P. This condition is called 
Poincaré's condition. In 1900, I. Fredholm 
discussed the Dirichlet problem by reducing it 
to a problem of tintegral equations. A domain 
D is called a Dirichlet domain if the (classical) 
Dirichlet problem is always solvable in D. H. 
Lebesgue (1912) showed that a solution is 
obtained by the method of iterative averaging 
in every Dirichlet domain. 


B. The Dirichlet Problem in a General Domain 


It has been believed that the classical Dirichlet 
problem is always solvable in every domain 
until S. Zaremba observed in 1909 that the 
problem is not always solvable for a punctured 
ball. In 1913, Lebesgue gave a decisive exam- 
ple in which the domain is homeomorphic to 
a ball and bounded by a surface sufficiently 
smooth except at one point. Thus the central 
interest shifted to finding a harmonic function 
in D that depends only on a continuous func- 
tion f given on OD and coincides with the 
classical solution when D is a Dirichlet do- 
main. Extend f to a continuous function in 
the whole space, and denote it by fo. Approxi- 
mate D by an increasing sequence {D,} of 
Dirichlet domains, and denote by v, the solu- 
tion in D, of the Dirichlet problem for the 
boundary function fọ. N. Wiener proved in 
1924 that u, coverges to a harmonic function 
that is independent of the choice of the exten- 
sion of f and {D,}. The problem of deciding 
where on OD the general solution assumes the 
given boundary values is treated in Section D. 
O. D. Kellogg (1928) found a general method 
that includes Poincaré's method of sweeping 
out, Schwarz's alternating method, and the 
result of Wiener. Both Poincaré's method of 
sweeping out and Lebesgue's method of itera- 
tive averaging yield Wiener's general solution. 


C. Perron's Method 
We explain O. Perron's method (1923) by 


considering the improved method of M. Brelot 
[1]. For simplicity we assume that the domain 
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D is bounded in R?. Let U be the family of 
subharmonic functions u bounded above and 
satisfying limsupp ,, u(P) < f(M) for any 
boundary point M. Define H,(P) as sup, u(P), 
where u runs through U, if this family is not 
empty; otherwise, set H, & —oo. Call H, a 
hypofunction. Define H; by — H and call it a 
hyperfunction. If H; — H,, the common func- 
tion is denoted by H,; if H,(P) « oo, then H; is 
harmonic. This is called a Perron-Brelot solu- 
tion (Perron-Wiener-Brelot solution or simply 
PWB solution). The method of defining H. is 
called Perron's method (the Perron-Brelot 
method or the Perron-Wiener-Brelot method). 

Wiener showed in 1923 that the tDaniell- 
Stone integral can be regarded as a general 
solution if a !Daniell-Stone integrable function 
f is given on the boundary of a Dirichlet 
domain; in 1925, he showed that the same is 
true for a general domain (not necessarily 
Dirichlet). He showed also that his solution 
coincides with the Perron-Brelot solution H, if 
f 1s continuous. Unfortunately, however, from 
a wrong example he concluded that H, +H, 
can hold even for a simple discontinuous f, 
and so he lost interest in Perron’s method. 
Brelot (1939) corrected Wiener’s erroneous 
conclusion and proved that the Daniell upper 
and lower integrals are equal to H ; and H. 
respectively. To any continuous f there corre- 
sponds an H,, and there exists a "Radon mea- 
sure pip satisfying H,(P) — f fdup. This measure 
is called a harmonic measure or harmonic 
measure function. Brelot showed that H, = H f 
if and only if f is zp-integrable for one (or 
every) P. In particular, If D is a Dirichlet 
domain and E is a closed set on the boundary 
OD, then the harmonic measure function up(E) 
takes the value 1 at an inner point (in the 
space CD) of E and vanishes on 6D — E. We 
note that pp is equal to the measure obtained 
by sweeping out the unit mass at P to OD. 


D. Regular Boundary Points 


If H,(P)o f(M) as P2 M eD for any con- 
tinuous function f on 0D, then M is called 

" regular. The regularity of a point is a local 
property. À boundary point that is not regu- 
lar is called irregular. The regularity of M is 
equivalent to the convergence of up as P M 
to the unit mass at M with respect to the 
*vague topology. There are many sufficient 
conditions and necessary conditions for a 
boundary point to be regular. The existence 
of a barrier is a qualitative condition that is 
necessary and sufficient for a boundary point 
to be regular. It was used by Poincaré and so 
named and used effectively by Lebesgue. A 
barrier is a continuous superharmonic func- 
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tion in D that assumes the boundary value 0 
at M and has a positive lower bound outside 
every ball with center at M. À positive super- 
harmonic function defined in the intersection 
of D and a neighborhood of M and taking the 
boundary value 0 can be used as a barrier. A 
necessary and sufficient condition for a bound- 
ary point M to be regular is the existence of a 
Green's function in D assuming the value 0 at 
M. This condition was given by G. Bouligand 
(1925), and it follows from the existence of a 
barrier. Another necessary and sufficient con- 
dition of a quantitative nature was obtained 
by Wiener. It is equivalent to the requirement 
that the complement of D is not Thun at M 
(— 338 Potential Theory G). Kellogg conjec- 
tured that the set of irregular boundary points 
is of capacity zero and verified this in R? 
(1928). The conjecture was proved first by G. 
C. Evans (1933) in R?, and different proofs 
were given by F. Vasilesco (1935) and O. 
Frostman (1935). The conjecture is also true 
in R" for n>4. 


E. The More General Dirichlet Problem 


So far, we have been concerned with R". More 
generally, Brelot and G. Choquet [3] obtained 
the following result in a Green space & (— 193 
Harmonic Functions and Subharmonic Func- 
tions): Consider a metric space that contains & 
and in which @ is everywhere dense, and de- 
note by A the complement of & with respect to 
the space. Let {F} be a family of ffilters on & 
such that each F converges to a certain point 
of A. Suppose that u x 0 whenever u is sub- 
harmonic and bounded above on & and 
limsupu <0 along every F. Assume the exis- 
tence of a barrier v in a neighborhood in & 

of the limit point Q of every F; that ts, v is to 
be positive superharmonic, to tend to 0 along 
F, and to have a positive lower bound outside 
every neighborhood of Q. Under these as- 
sumptions, we obtain the PWB solution on & 
as in R?. There are various examples of A and 
F that satisfy these conditions. In particular, L. 
Naim [6] investigated in detail the case where 
A is the *Martin boundary. More generally, 

it is possible to treat the Dirichlet problem 
axiomatically (c 193 Harmonic Functions 
and Subharmonic Functions). 


F. The Dirichlet Principle 


Let D be a bounded domain with a sufficiently 
smooth boundary in R" and f be a piecewise 
C'-function on D with finite Dirichlet integral 
Il f£ |? — fplgrad f? dt, where dt is the volume 
element. Suppose that f has a continuous 
boundary value o on OD The classical Dirich- 
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let principle asserts that the solution of the 
Dirichlet problem for 9 has the smallest Dir- 
ichlet integral among the functions that are 
piecewise of class C! in D and assume the 
boundary value ø. In a general domain, H, 
minimizes ||u— f | among harmonic functions 
u in D. Brelot [2] discussed the principle for a 
family of competing functions that are defined 
in a domain in & and whose boundary values 
cannot be defined in the classical manner. 
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A. Dirichlet Series 


Forz-x- iy, 4,7 0, and 4,7 4- oo, the series of 
the form 


f= Š, ayexp(— 22) (1) 


is called a Dirichlet series (more precisely, a 
Dirichlet series of the type {/,}). If 4,— n, then 
(1) is a power series with respect to e 7. If 4,— 
logn, the series (1) becomes 


E ale, (2) 


which is called an ordinary Dirichlet series. If 
a, = 1, then (2) is the *Riemann zeta function. 
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Series of the form (2) were introduced by 

P. G. L. Dirichlet in 1839 and utilized in an 
investigation of the problems of fanalytic 
number theory. Later J. Jensen (1884) and 
E. Cohen (1894) extended the variable z to 
complex numbers. The Dirichlet series is not 
only a useful tool in analytic number theory, 
but is also investigated as a generalization of 
power series. The tLaplace transform is the 
generalization of the Dirichlet series to the 
integral, and similar formulas often hold for 
both cases. 


B. Convergence Regions 


If the series (1) converges at z = z,, then it 
converges in the half-plane Rez > Rez,. There- 
fore there is a uniquely determined real num- 
ber S such that (1) converges in Rez » S and 
diverges in Rez « S. If (1) always converges 
(diverges), we put S= —oo (+00). We call S the 
abscissa of convergence (or abscissa of simple 
convergence). Similarly, there is a uniquely 
determined real number A such that (1) con- 
verges absolutely in Rez » A and is not ab- 
solutely convergent in Rez « A. We call A the 
abscissa of absolute convergence. Furthermore, 
there is a uniquely determined real number U 
such that (1) converges uniformly in Rez » U’ 
for every U' >U and does not converge uni- 
formly in Rez » U" for every U” < U. The 
number U is called the abscissa of uniform 
convergence. Among these abscissas we always 
have the relations 


—ooXxSxUzxA-o, 


l 
A-S<limsup =. 


noo n 





The latter was proved by Cohen (1894). The 
numbers S, A, U are determined from a,, 4, by 
means of the formulas 








1 

S-limsup-log| 3 a,|. (3) 
X00 [x] & A, €x 
i 1 

4 =limsup—og( Y D (4) 
xo X [x] S A,«x 
f 1 

U =limsup—log T,, 
xo X 

T,= sup È a,exp(—iA,y)|, (5) 
che +o |[x] S4, <x 








where [ ] is the tGauss symbol. Formulas (3) 
and (4) were proved by T. Kojima (1914) and 
(5) by M. Kunieda (1916). In particular, when 
bm, (logn)/A, — 0, we have 


log|a,,| 





S=U=A=limsup 


n> oo 


(6) 


n 


(O. Szasz, 1922; G. Valiron, 1924). 
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The series (1) converges uniformly in the 
angular domain (z||arg(z —zo)| « « « n2], 
where the vertex z, lies on the line Rez, = S. 
Hence it represents a holomorphic function in 
the domain Rez » S, but it is possible that 
there is no singularity on the line Rez =S. For 
example, if a, — ( — 1)", then the series (2) has 
S =O, but the sum is an tentire function (2! * 
— 1) £(z). Taking the analytic continuation f(z) 
of the series (1), the infimum R of p such that 
f(z) is holomorphic in Rez» p is called the 
abscissa of regularity. It is still possible that 
there is no singularity on the line Rez 2 R. We 
always have R € S, and R is given by 
R= sup im (loglog*| g(x + iy)| +x), 

—wo<y<0 EEN 
2 a,exp(—A,z) 


PE d 


where log* a= max(loga, 0) (C. Tanaka, 1951). 
The infimum B of p such that f(z) is bounded 
in Rez p is called the abscissa of boundedness. 
We always have R« B x A. H Bohr proved 
the following three theorems concerning these 
values: (1) If {/,} are linearly independent over 
the ring of integers, then A — B (1911). (2) If 
(44441 — del ! = O(expe**) for every e — 0, then 

U =B (1913). (3) If Iimsup(log ni, — 0, then S 
= U = A =B (1913). In the final case, the values 
are given by (6). 


C. Properties of Functions Given by Dirichlet 
Series 


The coefficients a, in (1) are given in terms of 
the function f(z) by 


n 1 ctio oz 
Ya | ft) ds, (8) 





v=1 2ni c~ico 


where c > max(S, 0), 4, « o < 4,,,, and the 
integration contour does not pass through 
{4,}. Hoss ån, then the term a, in the sum of 
the left-hand side of (8) is replaced by a,/2 (O. 
Perron, 1908). Furthermore, if S « x, then we 
have 


a9 T 
a, = lim sl ` feceiyexpDAc--iy)]dy. (9) 
T5oT à 
(J. Hadamard, 1908; C. Tanaka, 1952). 

If x a Rez» S, then f(z)—o(|yl) (1y| ^ oo). In 
order to investigate its behavior more pre- 
cisely, Bohr introduced 


log] f(x 4 iy) 

log|y| 
in his thesis (1910) and called it the order over 
Rez — x. The function u(x) is nonnegative, 
monotone decreasing, convex, and continuous 
with respect to x. Bohr later found that there is 


u(x)-limsup 
lyl- t oo 
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a kind of periodicity for the values of f(z) over 
Rez — x; this was the origin of the theory of 
talmost periodic functions. 

As for the zeros of the function f(z), the 
following theorems are known: If f(z) is not 
identically zero, it has only a finite number of 
zeros in x 2 $e, e V* x y eV" for arbitrary 
positive numbers £, M (Perron, 1908). If we 
denote by N(T) the number of zeros in x » S 
+e, T« y « Tt 260log T, then limsup, ,,, N(T)/ 
(log T)? « ó/e (E. Landau, 1927). 

There have been many investigations into 
the connection between the singularities of f(z) 
and the coefficients a,. If the a, are real and 
positive, the point z=S is always a singular- 
ity of f(z). Moreover, if S=0, Rea, 20, and 
lim,,_,,. (cos(arga,))!^ — 1, then z=0 is a sin- 
gularity of f(z) (C. Biggeri, 1939). Furthermore, 
if A, /n— oo, lim inf, Län — An) > 0, then the 
line Rez=S is the natural boundary of f(z) 
(F. Carleson and Landau, 1921; A. Ostrowski, 
1923). If S=0, liminf, Dan — 4,) 24 7 0, then 
there always exist singularities on every inter- 
val on the imaginary axis with the length 2z/q 
(G. Polya, 1923). S. Mandelbrojt (1954, 1963) 
gave some interesting results concerning the 
relations between the singularities of (1) and 
the Fourier transform of an entire function. 

If U = —oo, the function f(z) is an entire 
function. Its *order (in the sense of entire func- 
tion) p is given by 

log* log* M(x) 


p=limsup à 
xo ao |x| 


[f(x +iy)l. 





M(x)= sup 
—o0«y«oo 
There have been many investigations into the 
tJulia direction of f(z) and related topics by 
Mandelbrojt, Valiron, and Tanaka. 


D. Tauberian Theorems 


As in the case of power series, if the series 
Xa, converges to s, then f(4- 0) — s (Abel's 
continuity theorem). The converse is not neces- 
sarily true. The converse theorems, with ad- 
ditional conditions on a, and 4,, are called 
Tauberian theorems, as in the case of power 
series. Many theorems are known about this 
field. The most famous additional conditions 
are lim, ,, 4,a,/(À, — 44-1) = 0 (Landau, 1926) 
and a, =O((A, — Ae HAN (K. Ananda-Rau, 
1928). 

For the summation of Dirichlet series (es- 
pecially *Riesz's method of summation) — 379 
Series R. For Tauberian theorems (especially 
the Wiener-Ikehara-Landau theorem) of the 
ordinary Dirichlet series — 123 Distribution 
of Prime Numbers B. 
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E. Series Related to Dirichlet Series 
A series of the form 


» 


ai z(z - 1)(z+2)...(z+n)’ 


nia, 


z £0, —1, —2,... 
is called a factorial series with the coefficients 
{a,,}. It converges or diverges simultaneously 
with the ordinary Dirichlet series 3 a,,/n7 
except at z — 0 and negative integers. The series 


Y Getae. Y al) 


n! 





is called a binomial coefficient series. It con- 
verges or diverges simultaneously with the 
ordinary Dirichlet series £( — 1)" a, /n* except 
at z=0 and positive integers. 
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A. Definitions [1-4] 
Suppose that a group T acts continuously on a 
tHausdorff space X, that is, for every y eI' and 
x€ X, an element yx of X is assigned in such 
a way that the mapping xx is a homeo- 
morphism of X onto itself and that we have 
35x) 9 4v) x, Lx =x, where 1 is the identity 
element of I. Two points x, ye X are said to 
be I -equivalent if there exists a y eI such 
that y 2 yx. (T-equivalence for subsets of X is 
defined similarly.) 

We consider the following conditions of 
discontinuity of T. (1) For every xe X and any 


460 


infinite sequence {y;} consisting of distinct 
elements of T, the sequence {y;x} has no telus- 
ter point in X. (ii) For every xe X, there exists 
a neighborhood U, such that yU, 1 U, = @ for 
all but finitely many yer. (ii) If x, ye X are 
not I -equivalent, there exist neighborhoods 
U,, U, of x, y, respectively, such that yU, U, 
= (7j for all yeT. (iit) For any compact subset 
M of X, yM N M = Ø for all but finitely many 
yer. 

It is easy to see that (ii) = (i), (11) 4- (ii^) = (iii); 
if, moreover, X is tlocally compact, we also 
have (iii) = (11), (ii). When (i) holds, T is called 
a discontinuous transformation group of X, 
and when (11) holds, I is called a properly 
discontinuous transformation group. In partic- 
ular, when X can be identified with a thomoge- 
neous space G/K of a locally compact group 
G by a compact subgroup K, the conditions 
(i), (ii), and (iti) for a subgroup I of G are all 
equivalent, and they are also equivalent to the 
condition that F is a *discrete subgroup of G. 

For a discontinuous group T acting on X, 
the fstabilizer T, — (yeT |yx x x} of xe X is 
always a finite subgroup. When I’, = {1} for 
all xe X, I is said to be free (or to act freely 
on X). If T, — (Lx L,— {1}, T is said to act 
teffectively on X. A point xe X is called a fixed 
point of F if r, sl, In the following, we 
assume for simplicity that I’ acts effectively on 
X, unless otherwise specified. 

Since I -equivalence is clearly an fequiva- 
lence relation, we can decompose X into I- 
equivalence classes, or I -*orbits. The space of 
all I’-orbits, called the quotient space of X by 
I, is denoted by IX. When T satisfies the 
conditions (it) and (ii), the space T\ X becomes 
a *Hausdorff space with respect to the topol- 
ogy of the quotient space. If, moreover, I is 
free, X is an (unramified) tcovering space of 
IX with the *covering transformation group 
I. (Conversely, a covering transformation 
group is always a free, properly discontinuous 
transformation group.) In general, X may be 
viewed as a covering space of F\X with rami- 
fications, and the ramifying points (in X) are 
nothing but the fixed points of T. 


B. Fundamental Regions 


A complete set of representatives F of T\X in 
X (that is, a subset F of X such that IF = 
X,yFüOF- for yel,yz 1)1s called a fun- 
damental region of I' in X if it further satisfies 
suitable topological or geometrical require- 
ments. Here we assume that F, the closure 

of F, is the closure of its interior F'. (In this 
case, F or F! is sometimes called the funda- 
mental region of T instead of F itself.) Such 

a fundamental region exists if I’ satisfies the 
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conditions (ii), (ii), and the set of fixed points 
is nowhere dense in X (R. Baer, F. W. Levi, 
1931). A fundamental region F is called normal 
if the set {yF} (y eT) is locally finite, that is, if, 
for every xe X, there exists a neighborhood 
U, such that yF N U, — Ø for all but a finite 
number of yer. If X is tconnected and F is 
normal, then T is generated by the set of ye F 
such that yF N F # Ø. Thus it is useful to have 
a fundamental region in order to find a set of 
generators of T and a set of tfundamental 
relations for them. When X has a T-invariant 
*Borel measure u and T is countable, then the 
measure HUE) of F is independent of the choice 
of F. Hence it is legitimate to put u(IVX)— 
u(F); Y is called a discontinuous group of the 
first kind (C. L. Siegel [2]) if T is a discontinu- 
ous transformation group which has a normal 
fundamental region F such that {y|yF NF = 
Ø} is finite and (F) « oo. For instance, if X is 
locally compact and F is compact (e T\ X: 
compact), then I is of the first kind. 

When we are concerned only with the quali- 
tative properties of T, it is sometimes conve- 
nient to relax the conditions for a fundamental 
region, replacing it by a fundamental (open) set 
Q of [, that is, an (open) subset Q of X such 
that TO= X and yQNQ= Ø for all but a finite 
number of yer [5-9]. 


C. The Case of a Riemann Surface 


Let F be a discontinuous group of analytic 
automorphisms of a *Riemann surface X. In 
virtue of *uniformization theory, it is enough, 
in principle, to consider the case where X is 
'simply connected. Thus we have the following 
three cases: 

(1) X 2 CU {oo} (Riemann sphere). I is a 
finite group. Since I can also be considered as 
a group of motions of the sphere, it is either a 
cyclic, *dihedral, or tregular polyhedral group 
[10]. 

(2) X =C (complex plane). F is contained in 
the group of motions of the plane. The sub- 
group consisting of all parallel translations 
contained in T is a *free Abelian group of rank 
y x2. If v=0, then F is a finite cyclic group. 
When v0, T consists of the transformations 
of the following form: 


When v=1,  zoz'z4mo (k,meZ), 


When v=2,  ze*z+m,a,+m,@, 


(k, M; E Z), 


where w, w,, €), are nonzero complex numbers 
with Im(o;/«,)7 0, and £= +1 in general, 
except in special cases when v —2 and œ, /«, = 
C4 (resp. Ca or Ge, where Goss exp(2zi/l)), in 


' point 
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which cases we may put €= Ga (resp. Go or Gel 
For the fundamental regions corresponding to 
these values of e, see Fig. l. [n the cases v= 1 
and 2, the *automorphic functions with respect 
to T are essentially given by exponential func- 
tions and elliptic functions, respectively (— 
134 Elliptic Functions). 





5 





(e) (d) 


Fig. 1 
(a) v22,£—1. (b) o, =1, œ =i, £— i. (c) w — 1, 
O —0,, Costa, (d) [a zu @ — 63, E= Ce. 


(3) X = {|z| « 1] (unit disk) [3, 10, 11]. By a 
*Cayley transformation, the unit disk can be 
transformed to the upper half-plane = {z= 
x+iy|y>0}. Any analytic automorphism of § 
is given by a real tlinear fractional transforma- 
tion (Möbius transformation) z (az + b)(cz + 
d) ! (a,b, c, de R, ad — bc — 1). The totality of 
real linear fractional transformations acts tran- 
sitively on $. Hence § can be identified with 
the thomogeneous space G/K of G=SL(2,R) 
by K =SO(2) (which is the stabilizer of the 
— 1). Hence discontinuous groups I 
of analytic automorphisms of $ are obtained 
as discrete subgroups of G. Actually, every 
element of G defines an analytic automor- 
phism of the whole Riemann sphere, which 
leaves the real axis RU {oo} invariant. For any 
ze CUooj and a sequence {y;} consisting of 
distinct elements of I, a cluster point of the 
sequence (y;z] in CU (oc] is called a limit 
point of IT. When only one or two limit points 
exist, can easily be transformed to one of 
the groups given in (2). Otherwise, the set £ of 
all limit points of T is infinite, and either $ = 
RU {oo} or £ is a perfect, *nowhere dense 
subset of RU {00}. When £ is infinite, T is 
called a Fuchsoid group. 

Since $ has a G-invariant *Riemannian 
metric ds? = y ?(dx? + dy?) (called the *Poin- 
caré metric), by which § becomes a hyperbolic 
plane (‘non-Euclidean plane with negative 
curvature), we can construct a fundamental 
region F of T which is a normal polygon 
bounded by geodesics, that is, the arcs of cir- 
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cles orthogonal to the real axis. A set of gen- 
erators of I and the fundamental relations for 
them are easily obtained by observing the 
correspondence of the equivalent sides of the 
fundamental polygon. (Conversely, starting 
from a normal polygon satisfying a suitable 
condition, one can construct a discontinuous 
group F having F as a fundamental region. In 
this manner, we generally obtain a (nontrivial) 
continuous family of discrete subgroups of G.) 
A Fuchsoid group I is finitely generated if 
and only if the fundamental polygon F has a 
finite number of sides, and in that case T is 
called a Fuchsian group. (More generally, a 
finitely generated discontinuous group of 
linear fractional transformations acting on 
a domain in the complex plane is called a 
Kleinian group.) A Fuchsian group T is of the 
first kind if and only if 9 2 RU 100]; otherwise, 
it is of the second kind. It is also known that a 
discontinuous group F is a Fuchsian group of 
the first kind if and only if i(IA$) « oo [12]. 
For a real point xeRU {œ}, we also denote by 
I, the stabilizer of x (in I). The point x is 
called a (parabolic) cusp of I if T, is a free 
cyclic group generated by a fparabolic trans- 
formation (# +1). Cusps of are represented 
by vertices of the fundamental polygon on the 
real axis. On the other hand, if a fixed point z 
of T lies in $, then the stabilizer T, is always a 
finite cyclic group generated by an 'elliptic 
transformation. Hence such a point z is also 
called an elliptic point of I. For a Fuchsian 
group T of the first kind, let {z,,...,z,} be 
a complete set of representatives of the F- 
equivalence classes of the elliptic points of I” 
(which can also be chosen from among the 
vertices of the fundamental polygon), and let e; 
be the order of I; furthermore, let t be the 
number of the I'-equivalence classes of para- 
bolic cusps of I. Then the quotient space I$ 
can be compactified by adjoining t points at 
infinity, and the resulting space becomes a 
compact Riemann surface R, if we define an 
analytic structure on it in a suitable manner. 
The area u(R,) measured by the Poincaré 
metric is given by the *Gauss-Bonnet formula: 


dxdy 
(Ry) = 2 
F Y 


š 1 
E A ( "hi 
i=1 €i 


where g is the fgenus of the Riemann sur- 
face Rp. It is known that there exists a lower 
bound (— z/21) for u(9Rj) [12]. Automorphic 
functions (or Fuchsian functions) with respect 
to a Fuchsian group T, which are essentially 
the same thing as algebraic functions on the 
Riemann surface Ry, have been objects of 
extensive study since Poincaré (1882). 
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D. Modular Groups [10,13] 


The group 
L-SLQ,Z) 


Jr" 


(or the corresponding group of linear frac- 
tional transformations) is called the (elliptic) 
modular group. The modular group T is a 
Fuchsian group of the first kind acting on $, 
and its fundamental region together with the 
correspondence of the equivalent sides is 
shown in Fig. 2. Fig. 3 illustrates the trans- 
formations under I of the fundamental tri- 
angle, where I is regarded as acting on the 
unit disk. From Fig. 2 we obtain the gen- 





a,b,c,deZ,ad — bc = ] 


1 0 
erators of T (mod( £1) (L ef ) and the 


fundamental relations: 


0; —(0,0,) = exl. 


There are two I'-equivalence classes of elliptic 
points of T, which are represented by ¢4 =i 
and ¢3, with [I;:{ +1} ]2 2; [T5:tt5j]9 3 
and only one I'-equivalence class of parabolic 
cusps, which coincides with QU {co}. The 
corresponding Riemann surface 9i, is analyti- 
cally equivalent to the Riemann sphere. 





Fig. 2 


Fig. 3 


For a positive integer N, the totality T(N) 
of elements in T satisfying the condition 


a by (1 GP dN)f 
MEL mo orms a norma 


subgroup of I, called a principal congruence 
subgroup of level N. (For the case N — 2, see 
Fig. 4.) In general, a subgroup I’ of I contain- 
ing I (N) for some N is called a congruence 
subgroup of I. (It is known that there actually 
exists a subgroup I” of I with a finite index, 
which is not a congruence subgroup.) For 

N z3, —L, éT(N), so that T(N) is effective. 
(For N «1, 2, we have T(N) — ( E 5j.) If 

N Z2, T(N) has no elliptic point. The number 
t(N) of the equivalence classes of cusps of l'(N) 
and the genus g(N) of the corresponding Rie- 
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mann surface Ry are given as follows: 


t(I)21,  £(2)23, 
t(N)—(0/2N)[E:T(N)] | (N23) 
g(1)=g(2)=0, 


gq(N)- I--(N —6)24N)[T:T(N)] (N23) 


where [T:T(N)] 2 N? IT, (1 — t/p?). Auto- 
morphic functions with respect to T(N) are 
called *modular functions of level N. 


sos 


GN 
t 
be 
y 
C 





Fig. 4 
A fundamental region of I'(2) that consists of six 
fundamental regions of I'(1). 


E. The Case of Many Variables 


Up to the present time, discontinuous groups 
I and the corresponding automorphic func- 
tions have been studied only in the following 
cases: (2) X = C", T = Z?" (the free Abelian 
group of rank 2n, consisting of parallel trans- 
lations) [14] (— 3 Abelian Varieties); (3') X is 
a bounded domain in C" and T is a discon- 
tinuous group of analytic automorphisms of 
X. (In this case, conditions (i), (ii), (i11) are 
equivalent.) 

In the case (3), the group (X) of all (com- 
plex) analytic automorphisms of X, endowed 
with its natural (tcompact-open) topology, 
becomes a tLie group, which has F as a 
discrete subgroup. When T\X is compact, 
it is known by the theory of automorphic 
functions (or by a theorem of Kodaira) that 
IX becomes a tprojective variety, which is a 
*minimal model [3, 14]. In particular, when X 
is a tsymmetric bounded domain, Le. when X 
becomes a tsymmetric Riemannian space with 
respect to its Bergman metric, the connected 
component G of the identity element of G' — 
(X) (which incidentally coincides with that 
of the group I(X) of all tisometries of X) is a 
tsemisimple Lie group of noncompact type 
(i.e., without compact simple factors), and X 
can be identified with the homogeneous space 
of G by a maximal compact subgroup K of G. 
The theory of discontinuous groups of this 
type, initiated by Siegel (especially in the case 
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where X = $, — Sp(n, R)/K, *Siegel’s upper half- 
space; = Sp(n, Z), !Siegel's modular group of 
degree n), O. Blumenthal, H. Braun, and L.-K. 
Hua, and continued by those in the German 
school such as M. Koecher, H. Maass, and 
others, has undergone substantial develop- 
ment in recent years under the influence of the 
theory of algebraic groups [5, 8, 14-17] (— 32 
Automorphic Functions). 

On the other hand, for a symmetric Rieman- 
nian space X of negative curvature, the group 
of isometries G=1(X) is a ‘semisimple Lie 
group of noncompact type with a finite num- 
ber of connected components and with a finite 
center, and X can be identified with the homo- 
geneous space of G by a maximal compact 
subgroup. Therefore the study of discontinu- 
ous groups of isometries of X can be reduced 
to that of discrete subgroups of a Lie group 
G of this type. A typical example is the case 
where X is the space of all real positive def- 
inite symmetric matrices of degree n with 
determinant 1; this space can be identified with 
the quotient space SL(n, R)/SO(n) (Ae SL(n, R) 
acts on X by X3S-'ASA). The unimodular 
group l'2 SL(n, Z) is a discontinuous group of 
the first kind acting on this space X, and a 
method of constructing a fundamental region 
of in X is provided by the Minkowski reduc- 
tion theory [6, 7]. 


F. Discrete Subgroups of a Semisimple Lie 
Group 


Two subgroups T, I’ of a group G are called 
commensurable if I NT” is of finite index in 
both I and I. For a real tlinear algebraic 
group G c GL(n, R) defined over Q, a subgroup 
I commensurable with G} = GN GL({n, Z) is 
called an arithmetic subgroup (in the original 
sense) of G (examples: SL(n, Z), Sp(n, Z)). An 
arithmetic subgroup F is always discrete, and 
when G is semisimple, the quotient space ING 
is of finite volume (u(I VG) « oo) with respect 
to an invariant measure u. Moreover, F\G is 
compact if and only if G is *Q-compact (or tQ- 
anisotropic), that is, if Gg or Gz consists of 
only *semisimple elements (A. Borel, Harish- 
Chandra, G. D. Mostow, and T. Tamagawa 
[6. 7] the same results remain true if G is 
*Zariski connected and has no ‘character 
defined over Q. The proofs of these facts (and 
the compactification of the quotient space 
TAX for the noncompact case) depend on a 
construction of fundamental open sets that 
generalizes the reduction theory of Minkowski 
and Siegel [5, 8, 15-18]. 

For a connected semisimple Lie group G 
of noncompact type and a discrete subgroup 
I with u(I AG) « oo, the following density 
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theorem holds (Borel, Ann Math., (2) 72 (1960)): 


(1) For any linear representation p of G, the 
linear closure of p(I) coincides with that of 
p(G); (1) if G is algebraic, I is tZariski dense 
in G. Furthermore, suppose that G is a direct 
product of simple groups G; and the center of 
G is finite; F is called irreducible if its projec- 
tion on any (proper) partial product of {G;} is 
not dicrete. For instance, if G is a *Q-simple 
algebraic group then T = G; is irreducible. In 
general, there exists a partition of the set of 
indices {i} such that F is commensurable with 
a direct product of irreducible discrete sub- 
groups of the partial products corresponding 
to this partition, and these irreducible compo- 
nents are unique up to commensurability. 

The method of constructing a discrete sub- 
group F of G=SL(2,R) in a geometric manner 
using the upper half-plane can be generalized 
to some extent to the construction of discrete 
subgroups of certain groups using hyperbolic 
spaces of low dimensions (E. B. Vinberg). 
Except for these few cases, today it is known 
that a discrete subgroup T of a semisimple Lie 
group G (of R-rank 22) with u(IG) « oo is 
arithmetic in a certain sense (— Section G). 
This implies there are only very few discrete 
subgroups for a semisimple Lie group of 
higher rank. Actually a number of facts sug- 
gesting this result were already known in the 
1960s. First, the only subgroups of SL(n, Z) 
(nz 3), Sp(n, Z) (n> 2) with finite index are 
congruence subgroups (H. Bass, M. Lazard, 
and J.-P. Serre; this result has been generalized 
to the case of an arbitrary *Chevalley group 
over an algebraic number field by C. C. Moore 
and H Matsumoto). Second, Gz = SL(n, Z), 
Sp(n, Z) are maximal in G [8]. Finally, it is 
known (rigidity theorem) that if a connected 
semisimple Lie group G with a finite center 
does not contain a simple factor which is 
locally isomorphic to SL(2, R), then any dis- 
crete subgroup T of G with compact quotient 
T\G has no nontrivial ‘deformation (i.e., all 
deformations are obtained from inner auto- 
morphisms of G) (A. Selberg, E. Calabi and E. 
Vesentini, and A. Weil [19]). This last result 
amounts to the vanishing of the cohomology 
group H'(T, X, ad), and in this connection an 
extensive study has been made by Y. Mat- 
sushima, S. Murakami, G. Shimura, and K. G. 
Raguanathan to determine the "Ber numbers 
of IX, and more generally the cohomology 
groups of the type H(T, X, p) with an arbi- 
trary representation p of G. These cohomol- 
ogy groups are closely related to automor- 
phic forms with respect to I’ [8,20].) For a 
*nilpotent or tsolvable Lie group, a general 
method of constructing discrete subgroups is 
known; see, for example, M. Saito, Amer. J. 
Math., 83 (1961).) 
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G. Rigidity and Arithmeticity 


A discrete subgroup T of a Lie group G with 
B(ING) « œ is usually called a lattice of G. A 
lattice I` of G is said to be uniform if IG is 
compact. By a theorem of Borel and Harish- 
Chandra [7], an arithmetic subgroup of a real 
linear group (defined over Q) is a lattice. Using 
this result, Borel showed further, by a con- 
structive method, that any semisimple Lie 
group has a lattice, especially a uniform one 
(Borel, Topology, 2 (1963)). 

For a long time there were no known ex- 
amples of nonarithmetic irreducible lattices 
in semisimple Lie groups other than those 
locally isomorphic to SL ,(R). This naturally 
led to Selberg's conjecture that any irreducible 
(nonuniform) lattice in a semisimple Lie group 
G not locally isomorphic to SL,(R) is arith- 
metic (Selberg, International Colloquium on 
Function Theory, Bombay, 1960). The con- 
jecture seemed to be well-grounded by the 
rigidity theorem of Weil and Selberg [19]. 

However, in 1966—1967, V. S. Makarov 
and Vinberg constructed nonarithmetic non- 
uniform lattices in S0(n,1) (n23,4, 5) by a 
geometric method; the lattices are generated 
by reflections [21]. Thus rigidity and arith- 
meticity do not necessarily coincide, and the 
conjecture should be considered under stronger 
conditions. 

As for rigidity of uniform lattices, Mostow 
established in 1970 the following strong rigid- 
ity theorem [22]: If G, G' are semisimple Lie 
groups with trivial center and without com- 
pact factors, and are not locally isomorphic 
to SL,(R), and if I, I" are irreducible uni- 
form lattices, then any isomorphism OI 
I" extends to an analytic isomorphism 6: G> 
G' (namely, OUT — 0). The previous rigidity 
theorem is now implied by Mostow's. If X isa 
simply connected symmetric space, then a 
tlocally symmetric space M covered by X is 
expressed as a quotient of X by a fixed-point- 
free properly discontinuous group I in the 
group F(X) of total isometries (when the Lie 
algebra of I(X) does not have a compact fac- 
tor, this condition for I is equivalent to saying 
that it 1s a torsion-free discrete subgroup of the 
semisimple Lie group ICH and the funda- 
mental group z,(M) of M IX ts isomorphic 
to T. The strong rigidity theorem implies that 
compact locally symmetric spaces M, and 
M, (of higher dimensions) are isomorphic as 
Riemannian spaces if and only if 1, (M,) and 
71,(M;) are isomorphic as abstract groups. 

On the other hand, in 1973, G. A. Margulis 
proved the arithmeticity of irreducible non- 
uniform lattices in semisimple real linear 
groups of R-rank greater than 1 (Russian 
Math. Surveys, 29 (1974) (original in Russian, 
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1974); Functional Anal. Appl. 9 (1975) (original 
in Russian, 1975). M. S. Raghunathan also 
proved independently the same fact under 

a slightly stronger condition. Their results, to- 
gether with the rigidity of nonuniform lattices 
in (higher-dimensional) semisimple Lie groups 
of R-rank 1 established by H. Garland, Rag- 
hunathan, and G. Prasad (Inventiones Math., 
21 (1973)), imply that the strong rigidity theo- 
rem holds similarly for nonuniform lattices. 

The results of Margulis and Raghunathan 
show that the Selberg conjecture in the orig- 
inal sense is affirmative for the case of R-rank 
greater than 1. But neither argument is appli- 
cable to uniform lattices, for they depend deeply 
on the fact (proved by D. A. Kazdan and 
Margulis) that a nonuniform lattice contains a 
nonidentity unipotent element. Previously, in a 
lecture at the international congress of mathe- 
maticians at Moscow, 1966, I. I. Pyatetskii- 
Shapiro generalized the definition of arith- 
meticity and suggested that arithmeticity of 
lattices should be investigated without the 
distinction of whether they are uniform or 
nonuniform. His definition is equivalent to the 
following [9, 24]: For a connected semisimple 
algebraic group G defined over R, a subgroup 
T c G= Gr is an arithmetic subgroup (of G) 
if there is an algebraic group H defined over 
Q and a surjective homomorphism o: H5 
Ad G defined over R such that the Lie group 
(Ker o)g is compact and that 9(H7) and AdT 
are commensurable. The uniform lattice in G 
that is constructed by the method of Borel is 
arithmetic in this sense. In 1974, Margulis 
finally established the following arithmeticity 
theorem [23, 24]: If the R-rank of G is not 
less than 2, an irreducible lattice T in G is an 
arithmetic subgroup of G (even if it is uni- 
form). In the same lecture, Pyatetskii-Shapiro 
also extended the Selberg conjecture to such 
"semisimple Lie groups" as those containing 
p-adic Lie groups as factors. Margulis proved 
this Pyatetskii-Shapiro conjecture affirmatively 
by showing that an analog of the strong rigid- 
ity theorem holds for such groups. 

As for semisimple Lie groups of R-rank 1, 
besides the lattices constructed by Makarov 
and Vinberg there are only the few examples 
of nonarithmetic lattices in SU(2, 1) presented 
by Mostow (Proc. Nat. Acad. Sci. US, 75 
(1978); Pacific J. Math., 86 (1980)). The prob- 
lem of arithmeticity still remains open for 
groups locally isomorphic to SO(n, 1) (nz 6), 
SU (n, 1) (nz 3), Sp(n, 1), or F4. 


H. Geometric Discontinuous Groups [1,25] 


The study of discontinuous groups T acting on 
a Euclidean or projective space X as a trans- 
formation group of a given structure is a class- 
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ical problem. All possibilities for such T have 
been enumerated in low-dimensional cases. 
For instance, there are 230 kinds of discontinu- 
ous groups of tEuclidean motions acting on 
3-dimensional Euclidean space without fixed 
subspaces, which are classified into 32 tcrystal 
classes (A. Schónflies and E. S. Fedorov, 1891— 
1892; — 92 Crystallographic Groups). All 
discontinuous groups of a Euclidean space 
generated by ‘reflections have also been enu- 
merated (H. S. M. Coxeter, 1934 [25, 26]). 


I. Kleinian Groups 


The last decade has seen considerable research 
on (finitely generated) Kletnian groups. This 
research is closely related to the theory of 
tquasiconformal mappings and the *moduli of 
Riemann surfaces. 

Making use of Eichler cohomology and 
tpotentials, L. V. Ahlfors established his finite- 
ness theorem and L. Bers his area theorem. 
Bers and B. Maskit investigated the bound- 
aries of *Teichmiiller spaces and discovered 
Kleinian groups with the property that the 
complement of the set £ of limit points is 
connected and simply connected. 

Numerous mathematicians have subse- 
quently discussed the classification, defor- 
mation, and stability properties of the set Q, 
uniformization and deformation of Riemann 
surfaces with or without nodes, and other 
geometric properties. In their discussions, the 
theory of quasiconformal mappings has played 
an important role. The discontinuous groups 
of motions of hyperbolic 3-space have also 
been studied. 
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A. General Remarks 


Given a real number x, we denote by z(x) the 
number of primes not exceeding x. A. M. 

Legendre (1808) obtained empirically the for- 
mula z(x) « x/(log x — B) for some constant B, 
and C. F. Gauss (1849) obtained the formula 


I du 
x)= , 
,logu 


assuming the average density of primes to be 
1/log x. The Bertrand conjecture, which asserts 
the existence of at least one prime between x 
and 2x, was proved by P. L. Chebyshev (1848), 
who introduced the functions 


0(x)— y 3 logp 


m=1 psx 





and 


W(x)= 3 logp 
pu sx 


= 0(x) 0G x) - 6G x) +o... 
(In this section, p represents a prime number.) 
He thereby proved 


Ax + O( / x) « 0(x) < W(x) < (6/5) Ax + O(,/x), 


where A —1og(2!23!3 515307130). G. F. B, 
Riemann (1858) considered the function Ciel 
(where s— + it is a complex variable), ex- 
pressed by the *Dirichlet series Xpan ?, which 
is convergent for g > 1. He found relations 
between the zeros of £(s) (— 450 Zeta Func- 
tions) and z(x). F. Mertens (1874) obtained the 
formulas 


l 
OLP Log e 4-O(1), 
p 


psx 


SS loglogx 1 B-- O 
—=loglo — J 
SE logx 


psx 


1 eo" 1 
1—- }=-——_[1+0 ; 
UG) ois) 


where c is the Euler constant and B is some 
constant. 
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B. Prime Number Theorem 


The prime number theorem 


. m(x)ogx 
lim ———— = 


x>w X 


l, or m(x)—- 





logx’ 


was proved almost simultaneously (1896) by J. 
Hadamard and C. J. de La Vallée-Poussin. 
Without using the theory of tentire functions, 
E. Landau (1908) established the formula 


n(x) = Lix + O(xe"*v/let), 


where 


1-6 x 
Lix= lim (| d SR 
SEA el i+5/ logu 


is the ‘logarithmic integral. It can be shown by 
integration by parts that 





Li X F 1!x 
gx log? x 


(k— Dx oí 3 
log**! x E 


log* x 
For example, by taking x = 107, we get n(x)= 
664,579, Li x = 664,918, and x/log x = 620,417. 

If the Dirichlet series f(s)2 E; a,n ^ ` 
satisfies the condition X, . a, — cx, then its 
abscissa of convergence is 1, and we have 
bm. ,.o(s— 1)f(s) 2 c. The converse is known 
as the tTauberian theorem. If F(s)= Èr a,n^ 
(a, > 0) converges absolutely for ¢>1 and F(s) 
— c/(s — 1) is analytic for o 2 1, then we obtain 
Lin<x4,~ ex (Wiener-Ikehara-Landau theorem, 
1932). Specifically, if we put —C'(s)/C(s) — 

Zu A(n)n 5, then the conditions of the theo- 
rem are satisfied, and we obtain 2. , A(n) — 
yx) x. 

It is easily seen that the prime number 
theorem is equivalent to (x) — x or 0(x) — x. 
The *number-theoretic function A(n) (Man- 
goldt's function) introduced above satisfies 
Lain A(d) =logn. It follows from the *Móbius 





S 


inversion formula that A(n)= Łan u(d)log (n/d ). 


Hence A(n)=log p (n= p") and =0 other- 
wise. Thus we obtain y(x) Än, A(n)— 

O(x). When f(x) 2 0(x) or y(x), it is easy 

to show that [7(/(t)/t?)dt 2log x + O(1) and 
lim mt... f(x)/x <1<limsup,..,, f (x)/x. How- 
ever, it is not easy to prove f(x) — x. To do 

so, introduce the number-theoretic function 
M (n), which satisfies ZA, M (d) - log? n. As 
before, we have M (n) — Za, u(d) log? (n/d); 
hence 


Qi-1logp (n=p',1>1), 
M(n)=< 2logplogqg —(n—p'q",I21,mz 1), 
0 (otherwise). 


Thus we obtain X, M (n) 2 2xlog x + O(x). 
This leads to A. Selberg's well-known formula 
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(1949): 
O(x)logx+ 3 6(x/p)logp-—2xlogx-4- O(x), 


psx 
which enabled him to prove 0(x)~ x. Thus 
he obtained for the first time a proof of the 
prime number theorem that does not use com- 
plex analytic methods. The simple formulas 
Èra u(n)/n 0 and X,..u(n) 2 o(x), obtained 
by H. von Mangoldt (1897), were revealed by 
Landau to have a deep meaning concerning 
the prime number theorem. Let z,(x) denote 
the number of integers not exceeding x that 
can be expressed as the product of r distinct 
primes. In generalizing the prime number 
theorem, Landau (1911) proved that 





1 x(loglogx)"! 
(r — 1)! log x l 


m(x) ~ 


Let us write &(x)= E2; e" "ry. Riemann 
proved that 


d zo 


1 oo 
= «| Bop" erg 
SUS — 





and obtained the well-known functional equa- 
tion for the zeta function (— 450 Zeta Func- 
tions B) 


n PT (s/2) Els) =n 2*3? T (1/2 — s/2)C(1— s). 


This enables us to extend Ziel as a meromor- 
phic function to the whole complex plane. 
Utilizing this extended Ziel and the following 
result of O. Perron on Dirichlet series, we can 
estimate y(x). Let co (4 oo) be the abscissa of 
convergence of F(s) — 2:5, f (n)n ^, and let 
a0, a>, and x>0. If 


atiT x 


lim — F(s) — ds 

T>% 2mi |, im S 

exists, then the limit is equal to ÈX; <, f (n), 
where >’ means that in the summation the last 
term f(x) is replaced by f(x)/2 if x is an in- 
teger. In many cases, F(s) has a pole at s— 1, 
and the principal part of the sum is obtained 
from the residue at s=1, whereas the residual 
part is given by a certain contour integral. To 
estimate w(x), we use —C'(s)/C(s) as F(s); hence 
the problem arises of determining the zeros of 
€(s). Riemann conjectured that all the zeros 
of C(s) in the strip Oo <1 must be situated 
on the vertical line ø= 1/2. If this so-called 
*Riemann hypothesis (— 450 Zeta Func- 
tions) is true, then it follows that z(x) - Li x 

t O(,/x log x). The ultimate validity of 
Riemann’s hypothesis remains in doubt. 
Concerning this, the most recent major 

result is the following formula, obtained 

by I. M. Vinogradov (1958): 2(x)=Lix+ 
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O(xexp( — clog?? x/loglog!?^x)). Without 
using Riemann's hypothesis, J. E. Littlewood 
(1918) proved that 


: n(x)—Lix 
lim sup———— ————— — — > 6, 
ue ( /x /log x)logloglog x 
—lLi 
lim EE <0. 


SCC (x /log x)logloglog x 
If we denote by N(T) the number of zeros of 
C(s) in the domain 0o «1,0 «t « T, then we 
have 


1 
N(T)=— TlogT— T + O(log T). 


1+log2z 

2n 
Let N4(T) denote the number of zeros of C(s) 
on the interval o = 1/2, 0 «t < T. Selberg (1942) 
obtained the impressive result 


No(T)» cTlog T. 


E. C. Titchmarsh (1936) showed that there 
exist 1041 zeros of C(s) in the domain 0 «c « 1, 
0«t«1468 and that all lie on the line o = 1/2. 
Computers have provided further results that 
seem to justify the Riemann hypothesis. For 
example, it has been calculated that there are 
75,000,000 zeros of Ziel in the domain 0 «c « 1, 
0 « t « 32,585,736.4 and that all are simple 
zeros and lie on the line c — 1/2 (R. P. Brent, 
Math. Comp., 33 (1979)). N. Levinson proved 
in 1974 by another method that at least one- 
third of the zeros of the Riemann zeta function 
are on the line c = 1/2. The minimum of the 
modulus of the imaginary part of the zeros 
with c — 1/2 is t— 14.13 .... 


C. Twin Primes 


Let p, be the nth prime. We know from the 
prime number theorem that p, ~nlogn; more 
precisely, p, =nlogn+nloglogn+ O(n). A pair 
of primes differing only by 2 are called twin 
primes. It is still unknown whether there exist 
infinitely many twin primes. There exist in- 
finitely many n satisfying p,,, — p, «clogp, (P. 
Erdós, 1940). Suppose that £(1/2 + it) 2 OU), 
A. E. Ingham (1937) proved that 


Pati 7 Pap, O=(1+4e)/(2+4c)+¢, 


by using the following density theorem related 
to the zeros of £(s): If we denote by N(a, T) 
the number of zeros of £(s) in the domain 
axxoxl,0xt«T(1/2xax]1) then there is 

a positive constant c such that N(a, T)= 
O(T?0 *29172]6g5 T), The Lindelöf hypothe- 
sis asserts that the constant c can be made 
arbitrarily small. If the Riemann hypothesis 
holds, then the Lindelóf hypothesis also holds. 
It is clear that we can substitute 1/6 4- e (2 0) 
for c, e being arbitrarily small. W. Haneke 
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(1963) showed that c can be replaced by 6/37 
+e, and consequently that © «61/98 +e. D. R. 
Heath-Brown and H. Iwaniec (1979) proved 
that © x 11/20 4- by using the *sieve method 
and the zero density theorem of L-functions 
(Section E). R. A. Rankin (1935) proved that 


Pn+1 —Pn> clogp,loglogp,loglogloglog p, 
x (logloglog p,) ? 


holds for infinitely many n. If we denote by 
n(x) the number of primes p € x such that p 
+2 is also a prime, then it has been conjec- 
tured by Hardy and Littlewood (Acta Math., 
44 (1922)) that 


il du 
allen C | —- 


,log?u 


as x oo, where 


1 
C al d ms ei 1:32032 2. 
The numerical evidence provided by the com- 
putation 7,(10°) « 3424506 (Brent, Math. 
Comp., 28 (1974)) tends to indicate the truth of 
this conjecture. At present, 76: 3!?? +1 seems 
to be the largest known pair of twin prime 
numbers (H. C. Williams and C. R. Zarnke, 
Math. Comp., 26 (1972)). 


D. Prime Numbers in Arithmetic Progressions 


Let k be a positive integer, y(n) be a residue 
character modulo k (— 295 Number-Theoretic 
Functions D), and Lis, y) 2 22, x(n)n ? (o> 1) 
be the *Dirichlet L-function. The function 
L(s, y) of s defined by this series can be ex- 
tended to an analytic function in the whole 
complex plane in the same way as the Rie- 
mann zeta function. In particular, when y is 
the principal character, then L(s, y), thus ex- 
tended, is a meromorphic function whose only 
pole is situated at s= 1 and is simple; otherwise 
the function L(s, y) is holomorphic on C. 
Using this function Lis, x) and in connection 
with his research concerning the tclass num- 
bers of quadratic forms, P. G. L. Dirichlet 
(1837) proved that there exist infinitely many 
primes in the arithmetical progression l, l+ k, 
1+2k,..., where l is the initial term and k a 
common difference relatively prime to /. This 
result is called the Dirichlet theorem (or prime 
number theorem for arithmetic progressions). 
Suppose that a runs over all integral ideals 
in a tquadratic number field K of discriminant 
d. Then the tDedekind zeta function £,(s) of K 
is defined by (Na) ? for c» 1. By virture of 
the decomposition law of prime ideals (— 347 
Quadratic Fields C), we have £&(s) — C(s)L(s, y), 
where y(n) — (d/n) is the *Kronecker symbol. 
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Utilizing ¢,(s), we obtain formulas concerning 
the *class number h(d) of the field K. If d — 0, 
then h(d) — (,/d/2log c) L(1, x), where e is the 
*fundamental unit of K. On the other hand, if 
d «0, then h(d)=(w./—d/2n)L(1, y), where w 
denotes the number of the roots of unity 
contained in K. It follows that L(1, y) > 
2log((1 t /5)2)/ A]. Let y be a character 
modulo k induced by a primitive character 

X^. Since we have L(s, y) - Lis y?) E, (1 — 

X9 (p)p ^), it can be shown that L(1, x) zO for 

a real character y. It is easy to prove that 

L(1, x) x0 for a complex character y. These 
statements then lead to the Dirichlet theorem. 
The proof was simplified by H. N. Shapiro 
(1951). Besides Landau's three proofs for 

L(1, x) £0 for real character y (1908), there 

are elegant proofs by T. Estermann (1952), 
Selberg (1949), and others. For a character 

y mod k, we always have L(1, y) 2 O(logk), while 
L(1, x) ! = O(log k) with one possible excep- 
tion, which may occur only if y is a real char- 
acter. Even in this case, we have L(1, y) ! 

= O(k*) (where £>0 is arbitrary, but O depends 
on £). This result was obtained by K. L. Siegel 
(1934) from his study concerning class num- 
bers of imaginary quadratic number fields. His 
proof was simplified by Estermann (1948) and 
S. D. Chowla (1950). The importance of the 
prime number theorem for arithmetic pro- 
gressions was revealed when it was applied to 
the Goldbach problem (— 4 Additive Number 
Theory C). Concerning this problem, the man- 
ner in which the remainder term depends on 
the modulus k became an object of investiga- 
tion. The Page-Siegel-Walfisz theorem is 
convenient to use: Denote by x(x; k, l) the 
number of primes not exceeding x and of the 
form ky +1, where (k, 1)=1. If x > exp(k*) 
(where ¢>0 is arbitrary), then we have 


TRIP Ee 
Ke) 0| — — }. 
(SUD olk) 


Further research on the distribution of zeros 
of L(s, y) is necessary for the study of z(x; k, J) 
when x takes smaller values. If y is a nonprin- 
cipal real character, then L(s, y) may have at 
most one real zero fj, around 1; this is called 
Siegel's zero. Because of this fact, when x is 
small we are unable to obtain any formula to 
indicate the uniform distribution of primes. 
However, we have the following deep result, 
obtained by E. Fogels (1962). For a given 
positive e, there exist coll and ciel such that 
n(x; k, I)» c(e)x/o(k)k*log x, provided that 

x 2k, On the other hand, Titchmarsh (1930), 
using the tsieve method, obtained x(x; k, 1) 

= O(x/q(k)log x) for x z k^». A theorem of this 
type is called the Burn-Titchmarsh theorem (— 
Section E). Fogels's theorem is based on the 
following theorem by Yu. V. Linnik (1947) and 
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K. Prachar (1957), which is an extension of 
Page's theorem (1935): We let ó be the function 
defined by ô= 1 — fj, if L(s, y) has an excep- 
tional real zero D. and 6=c,/log(k(|t|+2)) 
otherwise (where c, is a suitably small num- 
ber). If we denote by £ the real part of any zero 
(7 B1) of L(s, y), then the theorem states that 


P €4 i ( Ce 
log(k(t|3-2) "" Volog(k(t--2)/" 


provided that ólog(k(|t| -- 2) x c,. Linnik 
called this result the Deuring-Heilbronn phe- 
nomenon. Using this theorem and the zero 
density theorem, Linnik (1944) proved that 
p(k, D) «k^, where p(k, 1) is the least prime in 
the arithmetic progression l, l+ k, [-- 2k, ... and 
Lis a constant. We call L Linnik's constant. M. 
Jutila (1977) and Chen Jing-Run (1979) proved 
that L<80 and Lx 17, respectively. 

Let s be positive, bj, z; (1 <j & s) be complex, 
and l, m be real numbers satisfying max |z;| 7 1, 
Is, and mz 0. Under these conditions P. 
Turán (1953) obtained the following funda- 
mental theorem, which ts called the power sum 
theorem: 


B<l 





max |bjz,+...+b,2% 
m<r<li+m 


l n 
lie min |, +... dE 


This theorem is effective in research on the 
distribution of zeros of zeta functions. Based 
on this new method, Turan (1961), S. Knapow- 
ski (1962), and Fogels (1965) reached the re- 
sults cited above. 

Another method of research, considered to 
be a new sieve method, on the distribution of 
primes was introduced by Selberg and A. I. 
Vinogradov. This method was followed by W. 
B. Jurkat and H. E. Richert (1965). Linnik, A. 
Rényi (1950), and E. Bombieri (1965) founded 
still another method, called the large sieve 
method, by which P. T. Batemann, Chowla, 
and P. Erdos studied the value of L(1, y). 


E. Sieve Method 


Let A be a set of integers, and P a set of 
primes. For each pe P, let O, be a set of resi- 
dues mod p, and cp) the number of residues 
belonging to Q,. The sieve method is a device 
for estimating (from above or below) the 
number of integers n belonging to the set 
S(A, P, O,) — in|ne A,nmod p€Q, for pe Pj. 
The combinatorial methods of the Brun, Bus- 
tab, and Richert sieves are interesting and 
efficient but quite complicated. Here, we shall 
briefly describe the Selberg sieve. As an ex- 
ample, denote by S(x; q,1) the number of n 
satisfying n=/ modg, n <x, (n, D) 2 1, where q 
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is a prime not exceeding x,z x x, D— TI 
and (I, q) 2 1; then 


S034.) = 2: en? 
E ^ 


2 
< A ( x ia) 
nek d|(n, D) 

nzimod 


odq 


p<zPs 


where A, — 1 and 4, is arbitrary for d>1. Thus 
the problem reduces to the optimization of 4,. 
Proceeding in this manner, C. Hooley (1967) 
and Y. Motohashi (1975), using analytic 
methods, obtained certain deep results relating 
to the Brun-Titchmarsh theorem. Using the 
large sieve, H. L. Montgomery and R. C. 
Vaughan (1973) expressed this theorem in a 
precise form: z(x, q,1) « 2x/o(q)log(x/q) for all 
q<x. 

Let n,,n,,...,n, be Z natural numbers not 
exceeding N, and Z(p, a) the number of nie 
such that n; a mod p. Rényi (1950) proved 
that 


p-1 Z)? 
ME 5I [zma -2} <2NZ. 
ze AM? aci p 


Set a, —1 for n —n; and a, D otherwise, and 
set S(a)= X, y a, exp(2nina); then 


a 2 
d | 
p 
In view of this simple fact, Bombieri (1965) and 


P. X. Gallagher (1968) extended the problem 
and proved that, in general, 


a 
Y  a,exp (ns n) 
M<n<M+N q 


<(N+20*) P Jo, 


M<n<M+N 


Bech 


SC) Z? 
PEO 


a=1 








2 








qXQ a=l 
H (kel 


Similar results can be obtained for character 
sums Dy cncm+n4nX(n). In this connection 
Montgomery proved that 


S(i M<n<M+N};P,Q,) 


UP 





ae pla P — cp) 


g|P(z) 
where 
P(z)= I] p. 
peP 
pz 


Using these methods, the following estimate 
was obtained by Bombieri: 


n(y, q, 1) 





max max 
zi XX 
loga)? ^7 uaa 


le 
ela) J. logu 


where A is arbitrary and B is a certain func- 


qsx"?( 


«x(log x) ^, 
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tion of A (Vaughan). These methods, known 
collectively as the large sieve, were first di- 
rected toward proving the Rényi theorem 
stating that every sufficiently large even integer 
can be represented as the sum of a prime and 
an almost prime integer (M. B. Barban). After- 
ward, combining Richert's sieve with this 
large sieve, Chen Jing-Run (1973) improved 
this result to a remarkable degree (— 4 Addi- 
tive Number Theory C). Several applications 
of Bombieri's theorem have been demon- 
strated by P. D. T. A. Elliot and H. Halber- 
stam (1966): e.g., the estimation of the num- 
ber of representations of n as p - x? + y? 
(Hooley, Linnik) and the estimation of 

2, «5 d(n— p) (Linnik, B. M. Bredihin). 

Let N(a, T, y) denote the number of zeros of 
L(s, y) in the rectangle x xo <1, |t| T. Com- 
bining the large sieve with new Fourier in- 
tegral techniques and the Turán *power-sum 
method, Gallagher (1970) proved that there 
exists a positive constant c satisfying 


3 N(x T,y)«(QTY" 9, 


xmodQ 


Y X Næ T, yT”. 

q<Qxmodq 
Similar results were also obtained by G. Ha- 
lasz and Montgomery (1969) using another 
method, which was further exploited by Jutila, 
M. H. Huxley, and Iwaniec. In particular, 
Heath-Brown and Iwaniec (1979) deduced that 
n(x+y)—2(x)=cy(logx)! if yz x! 29*s 
Combined with the Deuring-Heilbronn 
phenomenon, the zero density theorem above 
not only establishes the Linnik theory, but 
also yields the following result, due to K. A. 
Rodoskii, T. Tatuzawa, and A. I. Vinogradov: 


1 * du 
n(x; q,1)=—— 
(gq) J2 logu 


l x ,B-1 1 e 
gt | S dut dl x) 
olaq) Ja logu (4) 
if x >exp(logqgloglogq), where E=1 or 0 


according as Siegel's zero f) exists or not, and 
where 








A= Max(log q, (log x)?? (loglog x)"°). 


Throughout these researches, estimates of 
the type 


Y (eia) 


xmodq 
« e(q)(1t| + Zog" (a(t]-- 2)} 


and 
L Ge 
—] it, 
` X 


T 
A 
ymodq J -T 


« p(q)( T + 2))og? q(T + 2) 


4 








4 


dt 
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are of great importance and have been studied 
by A. F. Lavrik (1968), Linnik (1961), Huxley 
(1972), and K. Ramachandra (1975). 


F. The Prime Ideal Theorem in Algebraic 
Number Fields 


In an algebraic number field of finite degree, 
the prime number theorem is replaced by the 
prime ideal theorem (T. Mitsui, 1956, Fogels, 
1962), which is based on the theory of the 
Hecke tL-function (E. Hecke, 1917; Landau, 
1918). Let K be a finite Galois extension over 
an algebraic number field k of finite degree. 
Suppose that p is a prime ideal of k and is not 
ramified in K. The tFrobenius automorphism 
of a prime divisor of p in K determines a 
conjugate class C of the Galois group of K/k. 
Let z(x; C) denote the number of prime ideals 
in k associated with the class C in the above 
sense and whose norm does not exceed x. 
Then we have 


h( €) 
[K:k] 





n(x, C)= Lix+ O(xe ceV/logx), 

where h(C) is the number of elements con- 
tained in C and c is a positive constant de- 
pending on K/k. This is an extension of Che- 
botarev's theorem (E. Artin, 1923). 
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124 (XI.8) 

Distribution of Values of 
Functions of a Complex 
Variable 


A. General Remarks 


Suppose that we are given a function f: AB 
and that the variables z, w take on values in A, 
B, respectively. A value distribution of f(z) is a 
set of points z where f(z) takes on a certain 
value w (called w-points of f(z)). Value distri- 
bution theory is usually concerned with the 
study of value distributions of complex tana- 
lytic functions. Value distribution theory has 
been developed extensively and deeply for the 
case where A is the finite plane |z| « oo or the 
unit disk |z| « 1 and B is the extended complex 
plane |w|< oo, and there are many interest- 
ing results in this case (— 272 Meromorphic 
Functions). Value distributions for analytic 
functions on general domains or on Riemann 
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surfaces or of several complex variables have 
also been studied. 


B. The Case of |z| < R x co 


For a transcendental entire function f(z), 
every value (including oc) is a value of the 
tcluster set of f(z) at the point at infinity 
(*Weierstrass's theorem). This theorem was 
improved in the following way by E. Picard in 
1879: A transcendental entire function f(z) has 
an infinite number of w-points for any finite 
value w except for at most one finite value 
(*Picard's theorem). A value w for which the w- 
points are at most finite is said to be a Picard's 
exceptional value. E. Borel gave a precise form 
of this theorem, taking into consideration the 
order of a function, and G. Julia proved the 
existence of Julia’s directions (= 272 Mero- 
morphic Functions; 429 Transcendental En- 
tire Functions). After other results in value 
distribution theory had been obtained by J. 
Hadamard, G. Valiron, and others, R. Nevan- 
linna published an important work in 1925 in 
which he established the so-called Nevanlinna 
theory of meromorphic functions in |z|<R< 
oo, unifying results obtained up until that 
time, and which became the starting point of 
the subsequent value distribution theory (— 
272 Meromorphic Functions). T. Shimizu and 
L. V. Ahlfors gave a geometric meaning to the 
Nevanlinna tcharacteristic function T(r). Ahl- 
fors established the theory of covering surfaces 
by metricotopological methods in 1935, and 
applied it to obtain the Nevanlinna theory 
and many other results on meromorphic func- 
tions. This theory revealed that the topological 
meaning of the number 2 of Picard's excep- 
tional values is closely related to the *Euler 
characteristic 2 of the sphere. H. Selberg es- 
tablished the value distribution theory of 
talgebroidal functions and gave a precise form 
of G. Rémoundos's theorem, which corre- 
sponds to Picard's theorem for algebroidal 
functions. 

Moreover, as an extension of the value 
distribution theory of meromorphic functions, 
there is the theory of holomorphic curves or of 
systems of entire functions. Let fo, f,, ... f, 
(n> 1) be entire functions without common 
zeros for all and for which fo: f, :...:f, is not 
constant. Put f —( fo, fis ..., fa). This is a re- 
duced representation of a nonconstant holo- 
morphic curve f: C P"(C). For a =(&o, a, 

,a,)e C"*! — {0}, considering the zeros of 


(x, f ) — Xo fo t 6 fi Le (40), 


we can extend Picard’s theorem and the Ne- 
vanlinna theory for meromorphic functions to 
holomorphic curves. 
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We pe the characteristic function 
T(r)(— Tir, f)) as 


r 


bd 


1 2n 
EI log| f |d0 


0 





fo 


DES (ol? E LAT. t dé we 


where rọ is a fixed positive number and Art, 
= A(r) — Aral, Using a bivector pu -( 
— fifi) of f and f' «(fo fi. .... Ja), we have 


another representation of See 


EEN p [our t dt do. 
T jr, 5 JoJo Ifl 


f is transcendental when lim T(r)/logr = oo 
and the number 





,C)e C"! cofo -- c, fit 
teQf.í— 


is the degeneracy index of f. It holds that 0< 
A<n—1. As an extension of Picard's theorem, 
J. Dufresnoy stated that, for transcendental 
holomorphic curves f and among « in gen- 
eral position, the zeros of (a, f) are infinite 
except for at most n c A- 1. X (C C"*! — (0]) 
is in general position when any p( x n4 1) vec- 
tors in X are linearly independent. In connec- 
tion with this result, there are many Picard- 
type theorems, and when 2>0, there are some 
particular results for holomorphic curves. H. 
Cartan, H. and J. Weyl, and Ahlfors extended 
the Nevanlinna theory to holomorphic curves 
as follows. For a —(x9, 0, ...,0,)e C"*! — {0}, 
we put 


Ass däm lc, c,, — 


|x| 2 the length of a, 
Hof Ho AWS) (a f) x0. 


and define the proximity function 


1 (?7 1 
asaz Dër 


and the counting function 





1 2n r 
EEN logs. f)|d01 
T jo To 
Then we have the first main theorem: For any 
a for which (a, f ) #0, 


T(r) 4 m(ro, o) — N (r, a) + m(r, ak 


and the second main theorem: When 4—0, for 
any 9,,05, ...,&, in general position, 


(q—n—t!) T(r) « » N(r, aj) *- S(r), 


where S(r) — O(logrT(r)) except for r in a set e 
of finite logarithmic measure, (De, fedlogr < 
oo). For 47 0, it is conjectured that “qg —n— 
1” can be changed into “q—n—A—1.” This 

is unsolved except for some special cases. 


0j 
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Some results show relations between excep- 
tional values and the order as in the case of 
meromorphic functions. Nevanlinna theory 
has been extended to the associated curves f? 
(p=1,2,...,m f! 2 f) for 4-0, and there have 
been attempts to extend this theory of holo- 
morphic curves even further by generalizing 
domains or ranges. 


C. The Case of General Domains 


The value distributions of meromorphic func- 
tions defined in a general domain or an open 
Riemann surface depend on the function- 
theoretic "size" of the set of singularities (— 
169 Function-Theoretic Null Sets) or the type 
of the Riemann surface (— 367 Riemann Sur- 
faces). For instance, we have the following 
theorem of Picard type: A single-valued mero- 
morphic function with a set of singularities of 
tlogarithmic capacity zero takes on every 
value infinitely often in any neighborhood 
of each singularity except for at most an F,- 
set of values of logarithmic capacity zero 
(Hállstróm-Kametani theorem). For the study 
of value distribution at general singularities, it 
is useful to investigate cluster sets (— 62 Clus- 
ter Sets). In order to generalize the Nevan- 
linna theory to the case of general domains or 
Riemann surfaces, we take their exhaustions 
depending on a real parameter r and define the 
*counting function and so on (— 272 Meromor- 
phic Functions). G. J. Hállstróm established 
the value distribution theory of meromor- 
phic functions defined in the complementary 
domain D of a compact set E of logarithmic 
capacity zero by taking D, = {z|v(z)<r} as the 
exhaustion of D, where v(z) denotes the Evans 
potential for E, Le, v(z) is the potential corre- 
sponding to a positive mass distribution u 
on E of total mass 1 which tends to +00 as 
z tends to any point of E. Thus the number 
of Picard's exceptional values is not greater 
than 2 4- č, where č —limsup, ,,, F(r)/T(r) 
with —n(r)=the Euler characteristic of D, and 
F(r) — f; n(r)dr (Hállstróm-Tsuji theorem). J. 
Tamura, L. Sario, and others studied the value 
distributions of meromorphic functions de- 
fined on Riemann surfaces. Sario succeeded in 
extending the Nevanlinna theory to analytic 
mappings of a Riemann surface R into another 
Riemann surface G by introducing a suitable 
metric in G to define the tproximity function. 
In the Nevanlinna theory on general do- 
mains, we must sometimes impose condi- 
tions that the functions must satisfy in order 
to obtain a concrete conclusion, for instance, 
the condition that £ be finite in the Hállstróm- 
Tsuji theorem. But it is also important to 
determine the domains where some result can 
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be obtained without imposing any additional 
conditions on the functions. The Hällström- 
Kametani theorem is an example. Although 
the set of exceptional values in this theorem 
cannot be replaced generally by a smaller set 
than an F,-set of logarithmic capacity zero, we 
have the following theorem: Let E be a tCan- 
tor set with successive ratios č, = 21,/l, .,, 
where l, denotes the length of the segments 
that remain after repeating n times the process 
of deleting an open segment from the middle 
of another segment. Then any single-valued 
meromorphic function with E as the set of 
singularities has at most 3 Picard's excep- 
tional values if lim, ,,, £, — 0 and at most 2 if 
€,+1 =0(&?) (L. Carleson, K. Matsumoto). By 
weakening the conditions on E. one can get 
several improvements of this result. 
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125 (XII.7) 
Distributions and 
Hyperfunctions 


A. History 


The advancement of analysis, particularly in 
the field of partial differential equations and 
harmonic analysis, necessitated the generali- 
zation of the notion of functions and deriva- 
tives. For instance, "functions" such as Dirac’s 
*delta function and tHeaviside’s function were 
used by physicists and engineering scientists 
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even though the former is not a function and 
the latter is not a differentiable function in the 
classical sense. The finite parts of divergent 
integrals, used by J. Hadamard to investigate 
the fundamental solutions of wave equations 
(1932), and the Riemann-Liouville integrals 
due to M. Riesz (1938) were the notions that 
eventually led to the theory of generalized 
functions. The rudiments of the idea of distri- 
bution, however, can also be found in other 
earlier works. S. Bochner (1932) and T. Carle- 
man (1944) discussed the Fourier transforms of 
locally integrable functions on the reals with 
growth as large as a polynomial. S. L. Sobolev 
introduced the notion of generalized derivative 
and also of generalized solution of differential 
equations by means of integration by parts in 
studying the Cauchy problem for hyperbolic 
equations (1936); J. Leray (1934), K. O. Fried- 
richs (1939), and C. B. Morrey, Jr. (1940) also 
discussed generalized derivatives. On the other 
hand, L. Fantappié (1943) investigated analytic 
functionals that are elements of the dual of the 
space of analytic functions and applied them 
to the theory of partial differential equations. 
Based on a systematic generalization of these 
investigations, L. Schwartz [1] established the 
theory of distributions (1945), which not only 
provided a mathematical foundation for a 
number of formal methods that had been used 
in mathematical physics but also gave new and 
powerful tools for the theories of differential 
equations (L. Hórmander [2]) and *Fourier 
transforms. Furthermore, it has been applied 
to trepresentation theory of locally compact 
groups, the theory of probability, and the 
theory of manifolds (G. de Rham [3]). As will 
be seen in Section B, distributions are defined 
as continuous 'functionals on a certain func- 
tion space, and it is essential to select a func- 
tion space appropriate to the problems con- 
cerned. For this reason, I. M. Gel'fand and G. 
E. Shilov defined various classes of general- 
ized functions [4] as a natural extension of 
Schwartz's theory. In this direction there are 
also various classes of ultradistributions intro- 
duced by C. Roumieu [5] and A Beurling. 
Another but completely different approach 
was given by M. Sato [6, 15] in the form of the 
theory of hyperfunctions (1958). Intuitively a 
hyperfunction is the sum of (ideal) boundary 
values of holomorphic functions at the real 
axis. We obtain in this way an ultimate class 
of localizable generalized functions. Sato em- 
ployed the relative (or local) cohomology 
theory to define the “boundary values" and to 
prove their localizing property. Such an alge- 
braic approach to generalized functions led 
naturally to an algebraic treatment of systems 
of linear partial differential operators [7 ], 
called algebraic analysis by comparison with 
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algebraic geometry. Independently, Hórman- 
der established a similar theory for distri- 
butions using Fourier analysis (— 274 Micro- 
local Analysis). 


B. Definition of Distributions 


Let o(x) be a complex-valued function of x = 
(x,,...,X,) defined on an open set Q in the n- 
dimensional Euclidean space R". By the sup- 
port (or carrier) of o, denoted by supp o, we 
mean the *closure of (x|9(x) 40} in Q. For 
multi-indices p, i.e., n-tuples p=(p,,..., p,) of 
nonnegative integers, we set |p| 2 p, +... - p,. 
For a function ọ(x) of class Cl. we write 


QI? p(x) 


D? —— — —. 
Ox?! ... Oxf" 


(1) 
In particular, D- = ø. To indicate the 
variables x we shall adopt the notation DÉ 

P(Q) denotes the set of all complex-valued 
functions of class C* defined on Q with tcom- 
pact support, which is a ‘linear space under 
the usual addition and scalar multiplication in 
function spaces. A sequence (9, in Z(Q) is 
said to converge to 0 (the function identically 
equal to zero) as m oo, denoted by Pm = O, if 
there exists a compact set E in Q such that E 
contains supp Pm for every m, and for every p, 
{DP Pm} converges uniformly to 0 as m— oc. 

We sometimes abbreviate AQ) either as Y 
or, when we want to indicate the variables x, 
as 2. 

A complex-valued "near functional T de- 
fined on 2(Q) is called a distribution on Q if it 
is continuous on Z(Q), i.e., Pm = 0 implies 
T(q,,) ^0. The set of all distributions on Q is 
denoted by 2'(Q) (or &’). For distributions S 
and T, the sum S 4- T and scalar multiple «T 
are defined by (S + T)(q) 2 S(q) 4- T(q) and 
(o T) (q) — o Tel, respectively, which are also 
distributions. Hence Z'(Q) is a flinear space. 


C. Examples of Distributions 


(1) Let f(x) be a tmeasurable and flocally 
integrable function on Q. Then a distribution 
T, is defined by T,(~) — f o(x)f(x) dx. Here dx 
is the *Lebesgue measure and the domain of 
integration is Q (in fact, supp o). If T, = T}, 
then f(x)=g(x) *almost everywhere. Thus we 
can identify the distribution T, with the cor- 
responding function f, and sometimes T, will 
be denoted simply by f. 

(2) Let u be a *Radon measure on Q, i.e., 
a complex-valued tregular completely ad- 
ditive set function on the ‘Borel sets in Q. 
Then a distribution T, is defined by T/(9) — 
f o(x)u(dx). Example (1) is a special case where 
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pdx) = f(x)dx. Two Radon measures u and v 
coincide if T, = T,. If the measure is concen- 
trated at the origin, then T,(9) —- co(0), denoted 
by có, and à is called Dirac's distribution. 
Sometimes 6 is denoted by 6,, A, or ó(x) to 
indicate that it operates on functions of x. A 
distribution Te 2'(Q) is a measure if and only 
if T(o,,) 50 whenever the supports supp Pm of 
a sequence ¢,,€ YQ) are in a fixed compact 
set in Q and g,, converges uniformly to zero. 
À distribution T is called a positive distribu- 
tion if T(o) 20 for any oe 2(Q) that has non- 
negative values at all points x in Q. Every 
positive distribution is equal to a T, corre- 
sponding to a positive measure p. 

(3) For given p the distribution ó'? is defined 
by à? (o) -( —1)? D? o(0) . 900-0 A 

(4) Let g(x) be a function defined but not 
integrable on an interval (a, b), and assume 
that for any positive number e it is integrable 
on (a +e, b). Moreover, assume that 


qix)- Y. A xa) "cl 


where Re 4, > 1, A, is not an integer, and h(x) is 
integrable on (a, b). Then 


b 
| g(x) dx — Y, A, — 1) ~'e! >= F(e) 
ate 

tends to a finite value as £0. This limit is 
called the finite part (in French partie finie) of 
the integral (^g(x)dx, denoted by Pf f? g(x) dx: 


b 
zl g(x)dx 


a 


b 


--—y Ai b-ais | h(x)dx. 


v A,—1 a 





In the same way, for every pe A(R’), T(o)— 
Pf (2g(x)o(x)dx is defined, and T is a distribu- 
tion which will be denoted by Pfg, and which 
is frequently called a pseudofunction. This 
notion is extended to the n-dimensional case 
and used to express the fundamental solutions 
(^ Section EE; Appendix A, Table 15.V) of 
thyperbolic partial differential equations. 


D. Localization of Distributions 


Let (H be an open subset of Q. Every function 
pe BQ’) can be extended to a function pe 
AQ) by setting o(x) 20 for x EQ’. Thus if 

Te 2'(Q), then a distribution Se Z'(QY) is de- 
fined by S(o) = T(@) for every pe FQ), and S 
is called the restriction of T to €". Two distri- 
butions T and S are said to be equal in Q' if 
their restrictions to Q are equal. If every point 
in Q has a neighborhood where T= S, then 

T —S. In this sense a distribution is determined 
completely by its local data, although the 
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notion of pointwise value, having exact mean- 
ing for functions, has no meaning for distri- 
butions. Moreover, we can construct a distri- 
bution with given local data in the following 
way: Suppose that an topen covering {Q,} of Q 
and a set of distributions T;e Z'(Q are given 
and that T; and T, are equal in QN Q, for any 
j and k. Then there exists a unique distribu- 
tion Te Z'(0) such that T= T; in Q; for each j. 
(Define T(o) = T(x) for a *partition of unity 
{a;} subordinate to {Q,;}). Namely, the distri- 
butions Z' form a tsheaf of linear spaces over 
R”. It is not fflabby but is soft, i.e., for any 
distribution T defined on a neighborhood of a 
closed set F in R", there is a distribution on R" 
that coincides with T on a neighborhood of F. 

For each distribution Te Z'(Q) there is a 
largest open subset (H of Q on which T van- 
ishes. Its complement is called the support (or 
carrier) of T and is denoted by supp T. The 
support of the distribution given in example (1) 
coincides with that of the function f. The sup- 
port of 6” is the origin. 


E. Derivatives of Distributions 


In example (1) in Section C, if f(x) is a func- 
tion of class C*, then by integration by parts 
Tye f(9) 2 ( 1)" TD” 9) for |p|  k. The right- 
hand side defines a distribution even if f is not 
differentiable. In view of this example, we 
define derivatives D? T of any distribution T by 


(D'T)(o)-(-D" T(D"9), PEZ, (2) 


Any distribution is infinitely differentiable. 
Any locally integrable function is infinitely 
differentiable in the sense of distributions, and 
its derivatives D? T, are called distribution 
derivatives (or generalized derivatives or weak 
derivatives). 

For example: (1) D^ó = ô”; (2) (1-dimen- 
sional case) dx, /dx — 1, dl/dx =ô, where x, = 
max(x,0) and 1(x) is Heaviside's function, 
which is equal to 1 for x >0 and to 0 for x «0. 


F. The Operation of Linear Differential 
Operators on Distributions 


Let T be a distribution and g(x) a C”-function 
on Q. Then we can define the product xT by 
(xT)(@)= T(a@), where the right-hand side is a 
continuous linear functional on AQ) since the 
multiplication o — aq is continuous in Z(Q). 
We define the dual operator (or conjugate 
operator) P'(x, D) of a linear differential opera- 
tor P(x, D) 2 Za,(x)D" by 


P(x, D)o =} (—1)"D?(a,(x) g(x), (3) 


where the a,(x) are C*-functions on Q. Com- 
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bining differentiation of distributions, multi- 
plication by functions, and addition, we can 
apply partial differential operators to distribu- 
tions, and we have (P(x, D) T)(q) = T(P'(x, D)q). 

Linear differential operators commute with 
restriction mappings. In particular, we have 
supp P(x, D)T c supp T. In other words, linear 
differential operators are *sheaf homomor- 
phisms on the sheaf Z' of distributions over Q. 
On the other hand, every continuous sheaf 
homomorphism on 2 is a linear differential 
operator whose order is finite on each com- 
pact set in Q. Even when no continuity is 
assumed, a sheaf homomorphism on 2' is a 
linear differential operator except on a discrete 
subset of Q (J. Peetre). 


G. The Topology of 2 and Z' 


Let Ø — K,€ K;€... bea sequence of com- 

pact sets in O that exhausts Q. For every 

nondecreasing sequence of positive numbers 

{a} = {ap,42,... } and nondecreasing sequence 

of nonnegative integers {k} = (ko, k,,...] we 

set 

Pia}, i) (0) - sup sup supaj| D^ p(x)| (4) 
j20 |pl&k; x¢K; 

for oe 2(Q). We define a flocally convex topol- 

ogy of the space Z(Q) by taking the totality 

of the tseminorms ou as a fundamental 

system of continuous seminorms. Then &(Q) is 

a ‘nuclear *(LF)-space, and the convergence 

Pm => 0 of a sequence Pp is identical to the 

convergence q,, 0 in this topology. A sub- 

set Bc 2(Q) is 'bounded in the topology 

if and only if there exist a compact set Ec Q 

such that supp o c E for every ge B, and 

positive numbers M, for each p such that 

sup|D?@(x)|< M, for every oc B. 

The topology of the space Z'(Q) is the 
strong topology of the tdual of A(Q): the 
ttopology of uniform convergence on every 
bounded set in 2(O). Under this topology 
YQ) is a *nuclear freflexive linear topo- 
logical space. With respect to this topology, 
the linear differential operators of Section F 
are continuous in Oil. 

By virtue of the following convergence 
theorems, various limiting processes concern- 
ing distributions can be treated easily. If the 
limit lim Til = T(¢) exists for every pe 2, 
where {7;} is a sequence of distributions, then 
Te Z' and {T} is convergent to the distribu- 
tion T (convergence theorem). Moreover, for 
any p, D’? T; is convergent to D” T (theorem of 
termwise differentiation). Any bounded set in 
£ or Gi is totally bounded. Thus weak conver- 
gence of a sequence {T} implies strong conver- 
gence (convergence in the topology of the 
space 2’) (strong convergence theorem). 


476 


H. Distributions Depending on a Parameter 


Consider distributions T; depending on a 
parameter A, where 4 ranges over the real line, 
the complex plane, or more generally an open 
set in Euclidean space. The convergence theo- 
rem and the strong convergence theorem also 
hold in the case of a continuous parameter A. 
Thus T, is continuous (differentiable) with 
respect to the parameter / if T,(@) is continu- 
ous (differentiable) with respect to 4 for any 
q € Z. If T, is continuous with respect to 4, 
then D?T, is also continuous with respect to 
2. If T; is differentiable with respect to a real 
variable 4, then D? T; is also differentiable, and 
OD? T, /[0À = D' (OT, /02). The same facts hold for 
the case of several real variables. For a com- 
plex parameter 4, T; is holomorphic with re- 
spect to 4 if T; (o) is holomorphic for any oe 2. 
The fundamental properties of holomorphic 
functions also hold in this case. 

If T; 1s defined and continuous in an interval 
[a,b], then the integral T= f T; då with respect 
to 2 exists in Z' and we have 


ro- | L(e)d4 oe, 


a 


I. Distributions with Compact Support 


We denote by &(Q) (abbreviated to &) the 
space of all complex valued C?-functions 

on Q. &(Q) is a nuclear *Fréchet space with 

the locally convex topology defined by the 
seminorms 

Dp. k(0) — sup | D? p(x)| (5) 
as p ranges over all multi-indices and K ranges 
over all compact sets in Q. 

We denote by &"(Q) (abbreviated to &") the 
tstrong dual of &(Q), i.e., the set of all continu- 
ous linear functionals on 6(Q) equipped with 
the topology of uniform convergence on 
bounded sets in 6(Q). &(Q) is a nuclear (DE), 
space. If Te &’(Q), then its restriction to A(Q) is 
a distribution with compact support. Con- 
versely if T is a distribution with compact 
support in Q, then choosing ae AQ) which is 
identically equal to 1 on a neighborhood of 
the support of T, we define a linear functional 
S on &(Q) by S(~)= T(aq). Then SE &(Q), and 
S is independent of the choice of «. In this 
sense, we can consider that S is the extension 
of T to &(Q) and identify S and T. thus &(Q) 
coincides with the set of all distributions with 
compact support in Q. 


J. Structure Theorems for Distributions 


A distribution T €Z'(Q) is said to be of order at 
most k if | T(q)| X Dia) ua (0) for {k} =fk, k, k, doe ! 


4T] 


and for some {a}. A distribution of order 0 is a 
measure. Every distribution of finite order can 
be represented as a finite linear combination 
of derivatives (in the sense of distributions) of 
measures or locally integrable functions. The 
restriction of any distribution to a relatively 
compact open subset 9’ can be represented 
in this way because it is of finite order. There- 
fore the distributions form the smallest class 
of generalized functions that contains all 
locally integrable functions, is stable under 
differentiation, and forms a sheaf. Further- 
more, for any distribution T therc exists a 
sequence 19; c Z that converges to T in the 
distribution sense. 

A closed set F is said to be regular if for 
every point a in F we have a neighborhood U 
of a and constants w and 1 >«>0 such that 
every pair of points x and y in FNU is con- 
nected by a curve contained in F with length 
less than or equal to w|x — y|*. If the support 
of a distribution T is contained in a regular 
closed set F, then 


T=} DT, 


(the expression is not necessarily unique, and 
the sum is locally finite), where the x, are 
complex-valued measures with support con- 
tained in F. In particular, if the support of a 
distribution T contains only one point a, then 
T can be represented uniquely as a finite linear 
combination 
T= Y a,DFà 


|p| m 


(x—a) 


of derivatives of the distribution ó defined 


by à, (9) = Pla). 


(x-a) 


K. Tensor Products of Distributions 


Let Q, c R" and Q, c R” be open sets. If Te 
2'(Q.) and Se Z'(Q,), then there is a unique 
distribution T & Se Z'(Q. x Q,), called the 
tensor product (or direct product), such that 
TG S(oG)v(y) = T(e)S(y) for any oe 2(0,) 
and y e Z(O,). More directly we have Fubini's 
theorem: T & S(e(x, y) = T.(Sj(oGx, y) = 
S,(T(e(x, y))) for any pe BQ, x Q,). If F and 
G are linear spaces of distributions on Q, and 
Q, respectively, then the ‘tensor product 

F ® Gis identified with the linear combina- 
tions of tensor products T & S of Te F and 
SeG, which form a linear space of distribu- 
tions on Q, x O,. When F and G have locally 
convex topologies, the tcompleted tensor 
products F & G and F QG are also usually 
identified with subspaces of Z'(Q, x Q,). For 
example, we have Z'(Q.) & 2'(Q,) = BQ, x 
Q,),6'Q,) ® 4(Q) - &(Q, x Q) and 6Q,)® 
6(Q,) = 6(Q, x Q,) (including the topologies). 
Similarly, we have 2(0,) & AQ,) = AQ, x Q,), 
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but the left-hand side has a strictly weaker 
topology than the right-hand side. Spaces of 
distributions with parameters (— Section H) 
are often identified with completed tensor 
products of spaces of distributions [8-11 ]. 


L. The Kernel Theorem 


Let Q, and Q, be as above. Every distribu- 
tion K on Q, x Q, induces the continuous 
linear mapping Ly: A(Q,)> 2Z'(O,) defined by 
(datts Klo Y), v e 2(9,). 
Conversely every continuous linear mapping 
L:2(0,)5 B(Q,) is equal to Lx fora Ke 
P(Q, x Q,) and the correspondence K> Ly is 
a topological isomorphism of the locally con- 
vex space Z'(Q. x Q,) onto L(Z(Q.), Z'((Q,)) 
equipped with the topology of uniform conver- 
gence on the bounded sets (L. Schwartz's 
kernel theorem, [8-11]). K is called the kernel 
(distribution) of the mapping Ly. 

Lx is a continuous linear mapping Z(Q,)— 
&(Q,) (resp. 2(Q,)— &'(Q,)) if and only if Ke 
BQ.) È E(Q,) (resp. 2'(O,) & &'(Q,)), in which 
case K is said to be regular (resp. compact) in 
y. Lg can be extended to a continuous linear 
mapping &'(Q,)- 2'(Q,) (resp. 6(Q,) 2(Q,)) 
if and only if K e£(Q,) & 2'(Q,) (resp. &'(Q.) & 
Z'(Q,)), in which case K is said to be regular 
(resp. compact) in x. K is said to be regular 
(resp. compact) if it is regular (resp. compact) 
both in x and y. Lg can be extended to a con- 
tinuous linear mapping 6(Q,)>6(Q,) (resp. 
6(Q.) 6'(Q,)) if and only if K €6(Q. x Q,) 
(resp. dÄ, x Q,)). Then K is said to be regu- 
larizing (resp. compactifying) [8, 10]. 


M. Convolution 


For distributions S and T on R", assume that 
either S or T has a compact support or more 
generally that supp TN ({x} — supp S) is locally 
uniformly bounded with respect to x. Then the 
correspondence 


pE P> Six Ty (p(x Ty) 


defines a distribution called the convolution of 
S and T and is denoted by S « T. In particular, 
Tr* T, — T,, ,, where f*g is the tconvolution of 
functions f and g, and T;, T;, and 7;,, denote 
the distributions corresponding to f, g, and 

f * 9, respectively. For example: T+6=6* T= 
T, D? Tx S= T«(D^S)- D'(T«S). 

Thus a solution of the partial differential 
equation P(D)T=S with constant coefficients 
is given by the convolution S« E of S with a 
"fundamental solution E of P(D), whenever 
the convolution exists. 

If Te 2' and oe, the convolution T 
is equal to a function f of class C* (or the 
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distribution corresponding to f), and f( y) — 
Tuy (p(y — x)). f is called the regularization of 
T. For distributions S and T, assume that 
|f(x)g(y — x)| is integrable on R” for any regu- 
larization f =S x and g — T* vy. Then there 
exists a unique distribution V such that 


V*(os D= [fent dx. 


This distribution V is called the general- 
ized convolution and is denoted by S« T 
(C. Chevalley). 


N. Tempered Distributions zc" 


We denote by (R") (abbreviated to S) the 
space of all trapidly decreasing functions of 
class C” on R". ¥ is a nuclear Fréchet space 
with the locally convex topology defined by 
the seminorms 


Pp, (9) = sup |x’ D4 p(x), (6) 


as p and q range over all multi-indices, where 
xP= xP... xl". We denote by #’(R") (abbre- 
viated to S’) the strong dual of Y(R"). Its 
elements are called tempered distributions (or 
slowly increasing distributions) and are identi- 
fied with their restrictions to Z(R"). A distri- 
bution T'e Z' can be continuously extended to 
4 if and only if any regularization f =T*@ 
of T is a slowly increasing continuous func- 
tion (e, there is a polynomial P(x) such that 
f(x) € | PG9]). ^ (R") is a nuclear (DF)-space. 
Let 1 « p « oo, and let q be the tconjugate ex- 
ponent. The strong dual of the function space 
2, (R") is denoted by G(R”). Since Z is dense 
in Z, , Z; is identified with a linear subspace 
of F’. The strong dual of 2; (R") is equal to 
9, (R"). Let A(R") be the closed linear sub- 
space of Z(R") consisting of all functions o 
such that D” e C,(R”) for all p ( 168 Func- 
tion Spaces). It is also the closure of SZ in 2. 
The strong dual of Z is denoted by 7; (R"). 
Its elements are called integrable distributions 
because the strong dual of 2; (R") is equal to 
4(R") so that the integral T(1) has a meaning. 
Here 1 is the function identically equal to 1. 
tSobolev spaces W,(R") and *Besov spaces 
RB 4(R") are also linear subspaces of "'(R"). 


O. Fourier Transforms 
The Fourier transform 
F(x)=(V/ à" | etüenpt- ixé)dé (7) 


is an isomorphism of 7; onto A. where xà 
— X46, t... X,6,. The Fourier transform 
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FTES of Te F is defined by 


(FT)(p)=T(FQ), oe, (8) 


The inverse Fourier transform is defined simi- 
larly, except that — i 1s replaced by i. For 
example: (1) F1 -(/2ny'à, where 1 is the 
distribution corresponding to the function 1. 
(2) £(D?T)— i" x? FT. 

A function q of class C? is called a slowly 
increasing C?-function if @ and its derivatives 
of any order are slowly increasing continuous 
functions. The space Oy is the set of all such 
functions. Its image under the Fourier trans- 
form F (Cu), denoted by Og, coincides with 
the set of all distributions T such that any 
regularization f = T * o is a rapidly decreas- 
ing C^-function. A member of the space O 
is called a rapidly decreasing distribution. If 
ae Oy (ffe €) and Te X", then «T e F’ and 
F (xaT) - (2n) " Fax £T (fl« Ter and 
Z(fi* T) -(J/2ny BAT). Yf T is a distri- 
bution with compact support in R", then its 
Fourier transform is the tentire function f({)= 
( 2n)" T(exp(— ixc)), where ( 2 č + ine C". 
For a convex compact set K in R”, define its 
supporting function H, by H, (nl sup, y xq. 

Paley-Wiener theorem. Àn entire function 
f(&C) on C" is the Fourier transform of a distri- 
bution (resp. a function of class C^) with sup- 
port in K if and only if (i) for any ¢>0 there is 
a constant C, such that | f(| x C,exp(H,(y)+ 
&|C|) on C", and (ii) there are constants C and 
N such that | (|< C(1 +é)” (resp. for any 
N there is a constant C such that | f(£)| x 
C(1+ |é|)~%) on R”. 


P. Fourier Series and Distributions on Tori 


The n-dimensional *torus T" is the quotient 
space of R" with respect to the equivalence 
relation x;z y;mod Z ( j — 1, ..., n). The space 
£'(T") of distributions on T" is defined to be 
the strong dual of Z(T") = &(T"). The volume 
element dx of T" is defined from that of R". 


. Thus, for an integrable function f, we can 


define a distribution T; by 


Trí(o)— | fe)e(x)dx, pe G(T"). 
T 


Consider the family of functions f,(x)= 
exp(2zipx), where p ranges over all n-tuples 
of integers. Then any distribution T on T" has 
the Fourier series expansion T = X c, f, (x), 
where the Fourier coefficients c,— T(f_,) are 
slowly increasing, i.e., |c,|  C(1 4- |p[)* for 
some k and C. Conversely if {c,} is a slowly 
increasing sequence, then È c, f, converges to 
a distribution T on T”. 
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Q. Substitution 


Let f —(fi,...,f,) bea C*-mapping from an 
open set Q in R” into an open set © in R”, and 
assume that the *rank of the Jacobian matrix 
(0f;/0xj (i— 1, ..., m; j= L ..., n) is equal to m in 
a neighborhood of the *inverse image E of the 
support of a distribution S = Bug Z'((). In a 
neighborhood U of E, we can choose u, = 
gut... He 7 Ou (X) SO that the trans- 
formation (y, u)=( f(x), g(x)) has the inverse 
transformation x = y(y, u) of class C*. If the 
support of oe AQ) is contained in U, then 
defining @ by 


y= | QV y, u))J(y, u) du, 
ae 


we have $e AQ), where J(y, u) is the abso- 
lute value of the tJacobian of ett, u), and o 

is independent of the choice of g, ..., 9, ,,. 
Now we define T(q) — S(@) if supp ọ c U, and 
T(q) —0 if supp @ does not intersect E. Since a 
distribution can be determined by its local 
behavior, we have a distribution Te 2'(O) with 
support E, which is denoted by T—- Sof —S(f) 
= Bro and is called the substituted distribution 
of Sy by y= f(x). It is also called the pullback 
of S by f and is denoted by f*S. The chain 
rule for derivatives of composites 





Ó m of, ( 08 
—(Sof)= -[— 9 
ax! of) 2 SE (9) 
also holds. 
For example: (1) If S = (2 (ye R”), then E is 
the surface fi (x)= f; (x) 2 ... — f, (x) 20. Assum- 


ing that (fi, ..., fm) satisfies the condition in the 

previous paragraph, we obtain the distribution 
ó 9X f,, ..., fa) with support contained in a sur- 

face. From the fact 6 = D26, we can write 


"E ouo f,, ics faa 
| (0f)... (0f, 


(Gel'fand-Shilov notation). (2) For the map- 
ping f(x) — Ax -- b, where A is ann xn regular 
matrix and b is an n-vector, we can define the 
substituted distribution of S by f= f(x) for any 
S € Z'(R"). For instance, à (9) = o (b). A, 2; 
—(2c) Të, a 0544) (C>0, x eR). 





ó«X f, , ars 


R. Distributions and Currents on a 
Differentiable Manifold 


Let M be an n-dimensional tdifferentiable 
manifold of class C^. Let {(U,, W,)} be an 
tatlas. Then a distribution T on M is defined to 
be a collection of distributions T, on w,(U,)< 
R” such that 7; on W,(U,/ Uz) is equal to T; o 
(V, o V, !) for any 3 and f. Since the distribu- 
tions obey the same transformation law as 
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functions under coordinate transformation, 
this definition has an invariant meaning, and 
every locally integrable function on M is re- 
garded as a distribution. More generally, a 
distribution *cross section of a tvector bundle 
of class C? is defined in the same way. The 
most important is the case of the tp-fold ex- 
terior power of the tcotangent bundle. Its 
distribution cross sections, i.e., *exterior dif- 
ferential forms of order p with distribution 
coefficients, are called currents of degree p. If 
M is *oriented, then the space of currents of 
degree p on M is identified with the dual space 
of the locally convex space 2" P(M) of all 
differential forms of degree n — p, of class C^ 
and with compact support in M. For example, 
if C is an (n — p)-dimensional singular chain, a 
current To of degree p is defined by T-(a) — fca. 
If M is not *orientable, we have to consider 
either the double covering M of M or the cross 
sections of the tensor product of the bundle of 
(n — p)-covectors and the orientation bundle 
[1, 3]. Currents were introduced by de Rham 
to prove his celebrated isomorphism theo- 
rem of the *de Rham cohomology groups de- 
fined by differential forms and the 'singular 
cohomology groups defined by singular co- 
chains (— 105 Differentiable Manifolds). 


S. Gel'fand-Shilov Generalized Functions 


Let E and F be *locally convex spaces for 
which a continuous linear injection i: E>F 
with dense range is defined. Then the dual 7’: 
F'—E' is a continuous linear injection with 
weak*-dense range. Therefore if we shrink the 
function space E, we obtain a larger space E' 
of continuous linear functionals. Gel'fand and 
Shilov [4] introduced many function spaces 
E, called fundamental spaces or test function 
spaces, and defined corresponding spaces E' of 
generalized functions. Their motivations were 
in applications to the theory of partial dif- 
ferential equations, where Fourier transforms 
play an essential role and Schwartz's frame- 
work of tempered distributions is too restric- 
tive. They considered a pair of function spaces 
E and E such that the Fourier transformation 
F : EE is an isomorphism of locally convex 
spaces. Then the Fourier transformation de- 
fined to be the dual of F gives an isomor- 
phism E E' of spaces of generalized func- 
tions. The function spaces E and E are often 
spaces of entire functions, so that the Gel'fand- 
Shilov generalized functions are not neces- 
sarily localizable in R". As typical examples of 
spaces E and E we shall mention only spaces 
of type S in the next section. 

L. Ehrenpreis's analytically uniform spaces 
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[12] are also a class of generalized function 
spaces introduced from a similar point of view. 


T. Spaces of Type S 


Let a, B, A, B, A, and B be n-dimensional 
vectors. By A> B we mean Aj;7 B,(j=1,...,n). 
We use notations A? = AM... AP, pP* = phi... 
pp. (i) For «20, A>0 the space S, , consists 
of all C?-functions o such that seminorms 


|x? D' o(x)] 
Pa, (P) - supsup = — (10) 


p At" 

are finite for all A > A and q. For example, 
So, a is the space of all functions of class C^ 
with support in {x||x,|< 4j.j 5 1, .... n). (ii) 
For f z 0, B»0 the space $^? consists of all 
C*?-functions o such that the seminorms 


|x? D* o(x)] 
Go ll -SuDSUP "gir (11) 
are finite for all B >B and q. (iii) For x, f 20 
and A, B 7 0, the space SË? consists of all C”- 
functions such that the seminorms 

|x” D" o(x)] 


Pa,B(p)=supsupsup 


x p 4 APBIprtgt ES 


are finite for all A > A and B >B. The topol- 
ogies for these spaces are given in terms of 
the seminorms (10), (11), and (12), respectively. 
These spaces are generically called spaces of 
type S. 


FS, a) =S% (x20) 
F(SP*=Sp (f»0) 
EECHER 
(So, 4) - 5^4, 

F (S^) = Sop, 

F(a =e (atb 


where A' = Aexp(1/A) and B’ = Bexp(1/B). 
We set S, | JS, 4, Sf = |_] SPP, and S? = 
UU S22, where A and B range over all positive 
n-dimensional vectors. SU + {0} if and only if 
one of the following conditions is satisfied: (1) 
a+fp>t,a>0, B>0; (ii) «> 1, fi 20; (iii) «—0, 
f 1. In such a case the space SP of general- 
ized functions contains the tempered distribu- 
tions (RTL 


U. Ultradistributions 


The localizing property of distributions is 
proved mainly from the existence of fpartitions 
of unity by functions in Z. Therefore if a test 
function space admits partitions of unity, the 
corresponding class of generalized functions 
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forms a sheaf. Roumieu [5] took the space 
Zw, (Q) of *ultradifferentiable functions of 
class {M,} with compact support. We can also 
use the space Z, (Q) of class (M,). The cor- 
responding generalized functions are called 
ultradistributions of class ( M,j and of class 
(Mj), respectively. Here M, is a sequence of 
positive numbers satisfying the logarithmic 
convexity M; < M,.., M,., and the Denjoy- 
Carleman condition Z M,/M,.; < oo. Zi (Q) 
(resp. Zu (Q)) is the space of all functions 

oe Z(Q) such that there are constants k and 

C (resp. for any k 0 there is a C) for which 
we have |D*o(x)| € CK" AM. In particular, 

the Gevrey classes {s} = ( p^] and (s) 2 (p) 
defined for s 1 are important, and they ap- 
pear often in theory of differential equations. 
Almost all results for distributions have been 
extended to ultradistributions under appropri- 
ate conditions on M,, which are satisfied by 
Gevrey sequences pP (— [14] in particular). 
Closely connected are A. Beurling's test func- 
tion spaces 2,(Q) (— [13]), where c is a func- 
tion on R" continuous at the origin, satisfying 
0— o(0) <o t n) <@(¢)+ ex) for all č and y 
in R” and such that f o(£)(1--|£]) " ! d£ < oc. 
G.,(Q) is defined to be the space of all pe Z(Q) 
such that the Fourier transform ¢ satisfies 
fé) exp(4o(£)) dé < oo for any A>0. If o£) 
=log(1+|é|), then 2, = 2; if o(£) « |£|"5, then 
D. Hs. Let É) =w — é). The space 2; (Q) 
of Beurling's generalized distributions is defined 
to be the strong dual of 2,(Q). 


V. Hyperfunctions 


In this and the following sections we mean by 
a cone l'C R” a convex open cone with vertex 
at 0. For two cones T, A, we write ACT if 
ADS" ! is relatively compact in l'1S" !, where 
S"! = {|x|=1}. By a wedge we mean an open 
subset of C" of the form Q -- iT, where OQcR" is 
an open subset and T c R” is a cone. F is called 
the opening of the wedge and Q its edge. By an 
infinitesimal wedge (0-wedge for short) or a 
tuboid of opening T and edge Q, we mean a 
complex open set U such that U c Q 4-il' and 
that for any AET, U contains the part of 
Q 4- i^ which is contained in some complex 
neighborhood of the edge Q. The symbol Q + 
ilO will represent any one of such open sets, 
and ©(Q + iO) (the inductive limit of) the total- 
ity of functions holomorphic on some of them. 
A hyperfunction f(x) on an open set Oc R” 
is an equivalence class of formal expressions of 
the form 


fog Y. Re+, (13) 


where Fi(z)e C(Q + i150). {F;(z)} is called a set 
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of defining functions of f(x). Here the equiva- 
lence relation is given as 


F(x 4- i0) + F, (x -iT,0) -(F;-4- F,) (x+ iT; T, 0) 
(14) 


if I; T, #2, that is, we can contract two terms 
into a single term as above and, conversely, 
can decompose, if possible, a term in the 
inverse way. These are considered to be modi- 
fications of the expression of the same hyper- 
function. The totality of hyperfunctions on Q 
is denoted by A(Q). It is a linear space by 
virtue of the linear structure naturally induced 
from that of holomorphic functions, combined 
with the above equivalence relation. 

The symbol F,(x + i10), which represents by 
itself a hyperfunction, is called the boundary 
value of F(z) to the real axis. It is merely 
formal and does not imply any topological 
limit, though there is some justification for the 
terminology as will be seen in Section Z. 

In the case of one variable, we have only 
two kinds of wedges Q+iR*, hence a hyper- 
function can be expressed by two terms: 


F(x + i0) — F (x — i0). (15) 


Some examples of hyperfunctions of one vari- 
able are ó(x) 2 —(2zi) !((x-- i0) ! —(x— i0) y; 
1(x)= —(2zi) '(log( — x — i0) — log( — x + i0)); 
Pfx ! =((x+ i0) ! +(x —i0) !y/2. 


W. Localization of Hyperfunctions 


HO c O is an open subset, the restriction 
mapping 2(9)— AO’) is induced from that for 
holomorphic functions. With this structure the 
correspondence Q- 220 D becomes a *presheaf. 
It is in fact a sheaf, because it can be expressed 
by the terminology of relative (or local) co- 
homology as follows: Let H&(C", F) denote 
the kth relative cohomology group of the pair 
(C^, C"~Q) (also called the kth local coho- 
mology group with support in Q) with coeffi- 
cients in a sheaf F on C". (It is by definition 
the kth tderived functor of F GC", F)= 
the totality of sections of F defined on a 
neighborhood of Q and with support in Q and 
is calculated as the kth cohomology group of 
the tcomplex TA, Z), where Z denotes any 
flabby resolution (i.e., tresolution by tflabby 
sheaves) of F.) Let #kn(F) denote the kth 
derived sheaf of F to R”. (It is by definition the 
sheaf on R" associated with the presheaf 

Qo H&(C", F ).) Then the cohomological 
definition of the sheaf of hyperfunctions is # 
= HAC) (the orientation being neglected). A 
fundamental theorem by Sato says that R" 

c C" is purely n-codimensional with respect to 
€ (i.e., AK(O)=0 for k zn), and moreover : 
H&(C", 0) 20 for k zn for any open set Oc R”. 
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Then by the general theory it can be shown 
that the remaining HCH, ©) agrees with the 
section module A(Q) of 2£q.(0). Since H"(V, ©) 
=0 for any open set V c €" (B. Malgrange), it 
follows further that the sheaf Z is flabby, i.e., 
its sections on any open set can always be 
extended to the whole space. 

If U is a tStein neighborhood of Q, then 
H¢(C", ©) can be expressed using the covering 
cohomology as the quotient space 


eu «y Y C(U #;Q), (16) 


where 
Us Q-izeUl|pr,(z)épr,(Q) for all kj, 


Us,O-ízeU|pr,z)épr(Q) fork#j}, (17) 


and pr, is the projection from CH to the kth 
coordinate. Then the isomorphism H$(C", 6) — 
BQ) is induced by the correspondence 


OU €Q)3 F(z) 5 [ F(z)] 
=} sgn oF(x+il, Ne AQ), (18) 


where T, is the c-orthant (ye blous 0,j — 
1,...,n} and sgnao=o, ...6,. F(z) is called a 
defining function of the corresponding hyper- 
function. For one variable, any complex neigh- 
borhood U >92 is Stein, and the above isomor- 
phism reads O(U\.Q)/O(U) = Z(Q), from which 
the naturality of the sign in (15) follows. 

Thus the notion of support is also legitimate 
for hyperfunctions. The sheaf of hyperfunc- 
tions Z does not admit partitions of unity as 
for Z'. It is, however, flabby. Consequently, 
given a decomposition of a closed set into 
locally finite closed subsets E —| J;., E; and 
a hyperfunction f with support in E, we can 
always find hyperfunctions f; with support in 
E, such that f — X;.4 f}. For distributions this 
property holds only under some regularity 
assumption for the decomposition. 

There are several practical criteria to deter- 
mine whether or not a hyperfunction is zero in 
some open set Q. These are called the edge of 
the wedge theorem. A hyperfunction F(x + iIO) 
with single expression is zero if and only if F(z) 
itself is zero. F,(x - i,0) 2 F(x 4-iD50) if and 
only if they stick together to a function in O(Q 
t i(T, 4 15)0) (Epstein type). (Note that I'; +T, 
is equal to the convex hull of I, UI5, e.g., I + 
(—T)=R" (Bogolyubov type).) SZ, F(x + iT;0) 
=0 if and only if there exist G,(z)e@(Q + i(1; + 
T,,)0), j, k— 1, ..., N, such that G&(z) — G,,(z) 
and E Di, Gy, j— 1, ..., N (A. Martineau 
[19]). These are interpretations of cohomology 
in terms of coverings and have global variants 
concerning the envelope of holomorphy. 

The real analytic functions o €. (Q) on Q 
are naturally included in A(Q) via the ex- 
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pression (x + iF0) for any F or with r =R”. 
They form a subsheaf. Let f be a hyperfunc- 
tion on Q. The complement in Q of the largest 
open subset Q' <Q, where f(x) is real analytic, 
is called the singular support of f(x) and is 
denoted by singsupp f. If Q is bounded and 
singsupp f c K, then we can choose an ex- 
pression of the form (13), where F;(z) can be 
continued analytically to ON K. If supp f c K, 
then these F;(z) satisfy X F(x) 20 on QNK in 
the usual sense. 


X. Operations on Hyperfunctions 


The derivatives of a hyperfunction f(x) with 
the expression (13) are defined through the 
defining functions as D^f(x) = X(D? F)(x + 
i] ;0). The product by a real analytic function 
w(x) is defined by v(x)f(x) 2 X(V F) (x + i10). 
Combining these, we have the operation of a 
linear partial differential operator with real 
analytic coefficients P(x, D) on hyperfunctions. 
It is a tsheaf homomorphism. 

Let fe BQ) and D cQ be a compact 
set with piecewise smooth boundary. If 
singsupp f N ôD = @, then by means of the 
special expression (13) mentioned at the end of 
Section W the definite integral is defined as 


| "de Y | DEI 


where D; is a path deformed from D in such a 
way that 0D; — 0D and F,(z) is holomorphic on 
D,. The result is independent of the choice of 
deformations or of the boundary value ex- 
pression employed. 

If supp f € D, the result is also indepen- 
dent of D; hence it can be written as Je, f(x)dx. 
If f(x, t) is a hyperfunction of the two groups 
of variables (x, t)eQ x V such that singsupp fN 
0D x V= Ø, then the integral fp f(x, t)dxe 
BV) is defined by the same method. It com- 
mutes with differentiation cr integration with 
respect to f. 

If f(x) 2 X F(x + i150), g(x) 2 3 G,(x + iA,0), 
then we can define the product by f(x)g(x) 2 
X(F;G,)(x + il; 1 A,0) under the assumption 
that I; A, # Ø for every pair j, k. Especially, 
the product f(x)g(t) is always legitimate for 
two hyperfunctions depending on different 
groups of variables. (It can also be interpreted 
as the tensor product.) 

A real analytic coordinate transformation 
x = @(%):Q-9 extends naturally to a holo- 
morphic coordinate transformation z= (2) 
on a complex neighborhood. A 0-wedge Q + 
iI'0 is transformed by ®~' to a twisted wedge 
containing, for each xeQ and ACT, a 0- 
wedge $$. + i(D®~'), AO with some (real) neigh- 
borhood ©, of x « ^! (x), where (D ^!), is the 
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derivative or the Jacobian matrix of the map- 
ping $^! at x. Thus the pullback $*f(3)— 


f(b(3))e AQ) of f(x)e AQ) by the transforma- 


tion x = (X) can be defined by substituting 

z= (2) into the defining functions. It is consis- 
tent with the definition of the coordinate trans- 
formation for real analytic functions. Also, the 
law of the change of variables for the definite 
integral is the same as that for real analytic 
functions. This can be extended to the general 
operation of substitution as mentioned in Sec- 
tion Q for distributions or even to much more 
general operations (— Section CC). 

The convolution ( f * g)(x) — le, f(x — t)g(t) dt 
is defined under the same assumption on sup- 
port as in the case of distributions. It can be 
defined either literally as the composition of 
the above operations, or directly by way of the 
defining functions: For example, if g has com- 
pact support € D and È G,(x + i^,0) is an 
expression as mentioned at the end of Section 
W, then by choosing a suitable deformation D, 
of D, we have 


vos KËSSEN 


Seet Hier Aalt 


Y. Hyperfunctions and Analytic Functionals 


The totality Z[K] of hyperfunctions with sup- 
port in a compact set K becomes a nuclear 
Fréchet space. It is the dual of the nuclear 
(DF)-space .(K) of real analytic functions de- 
fined on a neighborhood of K. Thus Z[K] is 
the space of analytic functionals with *porter 
in the real compact set K. The duality is given 
by the definite integral €f, p> = le f(x) p(x) dx 
for f e BLK] and oe.(K). A sequence { f,(x)} 
in ALK] converges if and only if it admits an 
expression (13) for a common fixed set of 0- 
wedges Q+ iT;0, j — 1, ..., N, such that the 
defining functions Ee &(Q + i1;0) are also 
holomorphic on a fixed complex neighbor- 
hood U of QNK and converges locally uni- 
formly in Q+ iT;0U U, where Q is a (real) 
neighborhood of K. 

BLK] is isomorphic to HCH, ©), and 
the above duality is a special case of the 
Martineau-Harvey duality HZ(C", ©) = O(K)’ 
for a Stein compact set K c C". In the 1- 
dimensional case this is due to G. Kóthe. If 
K -[a,,b,] x ... x [a,, b,], then HCH, ©) can 
be represented, including the topology, by the 
quotient space 


J 


nen OU #;K), 

E 

where U is a Stein neighborhood of K and 
U#K, U#;K are defined in the same way as 
(17). If we choose U=U, x... x U,, then by 
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way of a defining function F(z)e €(U # K) for 
f € 8 [ K] the inner product is given by the 
contour integral 


| f(x)e(x)dx 
" 


md Ab F(z)o(z)dz, ...dz,, 


where y; c U; is a closed path surrounding 

[a;, b;] once in the positive sense. Similar inte- 
gral formulas are known for some special K 
of various types. 

Starting from analytic functionals we can 
reconstruct the sheaf of hyperfunctions. For 
example, we can put Z(Q) =the totality of 
locally finite sums of analytic functionals with 
porter in Q modulo the rearrangement of 
supports by decomposition (Martineau [17 ]). 
If Q is bounded, we can also put Z(Q) = Z[O]/ 
BN] (Schapira [18]). The proof of local- 
izability and/or flabbiness is based on the 
decomposability of support (which is the dual 
of the exact sequence 0 —.«/(K U L).«/(K) € 
AL) AKN L)0) and the denseness of 
Z[K]c Z[L] (which is the dual of the unique 
continuation property 0—.4(L)> A (K)) for a 
pair K c L with the same family of connected 
components. Note that in no way is the topol- 
ogy of Z[K] localizable, or equivalently, A(Q) 
does not admit a reasonable topology. 


Z. Embedding of Distributions 


As the dual of the natural mapping ./(K)—> 
&(K) we have the topological embedding 
&'(K)o s (K) = Z[K]. This embedding con- 
serves the support and hence gives rise to an 
embedding of sheaf 2' c, 27 (R. Harvey). 

For a distribution T with compact support 
a set of its defining functions as a hyperfunc- 
tion is given by F,(z)= T,(W(z — x, I,)), o being 
the multisignature, where W(z, I) = fs«ior, 
W(z,w)dw and W(z, w) is the component of a 
Radon decomposition of ó(x) (^ Section CC). 
If supp T c K =[a,,b,] x... x [a,, b,], then 
as a hyperfunction it is represented by the 
following element of €(C" X K}: 


F(z)- d : d 
QniJ'(x; — 21)... Lë" Zn) 
It is in fact in €((P')" # K) and vanishes at 
infinity, where P! = C' U {co} is the Riemann 
sphere. These formulas are valid also for hyper- 
functions, and they give defining functions of 
some canonical types. Especially, the one given 
by (19) is called the standard defining function 
and is characterized by the foregoing prop- 
erties. 
For the above-mentioned defining functions 





(19) 
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for a distribution T, we have the following 
convergence in the sense of distributions on 
R”: 


T, =) sgno lim F,(x-izy,) for y,cI,. 
d E 
More generally if the limit 
> lim F(x+iey) for yjel; (20) 


exists in Z'(Q), then the distribution defined as 
this limit admits {F;(z)} as a set of defining 
functions when it is considered as a hyper- 
function. However, for an arbitrary set of 
defining functions for a distribution, (20) does 
not necessarily converge in Z'(Q). (This is be- 
cause we can add terms with any bad behavior 
which cancel each other formally.) If a distri- 
bution admist the boundary value expression 
with only one term, or if the dimension is one, 
then the convergence of (20) in Z'(Q) neces- 
sarily holds. 

The convergence of (20) in Z'(Q) is equiva- 
lent to the locally uniform estimate for the 
defining functions of the type Fj(z) - O(|y| ") 
for some M » 0. These assertions can be gener- 
alized to ultradistributions. For 2; m, (Q) 
(resp. Zim (Q)) the last growth condition for 
the defining functions reads as follows: F(z) 
O(exp M*(L/|y|)) for some L (resp. every L0), 
where M*(p)—sup,log(p?p!Mo/M,J; espe- 
cially for M,— p we have M*(p)~ pS". 


AA. Hyperfunctions on a Real Analytic 
Manifold 


Sticking the hyperfunctions on coordinate 
patches by the transformation law mentioned 
in Section X, we can define the sheaf of hyper- 
functions on a real analytic manifold. More 
generally, for a real analytic vector bundle 
over a real analytic manifold, we can consider 
the sheaf of its hyperfunction local cross sec- 
tions, which is also flabby. Thus, especially 
on a real analytic manifold M, we can obtain 
a concrete flabby resolution of the constant 
sheaf C,, of length dim M by the sheaves of 
differential forms with hyperfunction coeffi- 
cients: 0>Cy BO 5 40D 5... —, dim M), Q, 
With this sequence we can calculate the rela- 
tive cohomology groups of open pairs with 
coefficients in C by an analytic method. This 
is an extension of the de Rham theory for 
distributions [16]. 

If M is a compact manifold equipped with a 
nowhere vanishing real analytic density glob- 
ally defined on M, then we have the topolog- 
ical duality Z(M)— »/(MY. The inner product 
is given by the definite integral with respect to 
the density. 
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The Fourier series is an example of hyper- 
functions on real analytic manifolds. The series 
È c exp(2nipx) converges in Z(T") and defines 
a periodic hyperfunction if and only if c, is of 
infra-exponential growth, i.e., c, = O(e"?!) for 
any £20. T" has the global complex neighbor- 
hood (Pip. and f(x)e A(T") has the corre- 
sponding boundary value expression 


F(z)e (Iz |" « L, ...,|z, [7  1]), 


which represents the terms in the Fourier 
series such that o;p; 0. 


BB. Fourier Hyperfunctions 


In place of Y we take as the basic space the 
space Y, of exponentially decreasing real ana- 
lytic functions in the sense of M. Sato [15]: 
f(x)e Z, if and only if there exist ô>0 and 

£2» 0 such that for any ð <ô and e' <e, f(x) 
extends holomorphically to the neighborhood 
{Imz| <ô} of the real axis and has order 

O(e "ect there. A, is endowed with the struc- 
ture of nuclear (DF)-space via the inductive 
limit for 6>0, £2 0. The classical Fourier 
transform F acts isomorphically on 28. (In 
fact, ô and ¢ change their roles under F.) The 
strong dual of 2, is called the space of Fourier 
hyperfunctions and is denoted by 2. It is a 
nuclear Fréchet space. It contains ^ as a 
dense subspace in view of the continuous 

and dense inclusion Z, c, F. It also contains 
classical locally integrable functions of infra- 
exponential growth, i.e., of order ell for any 
é>0. Thus by the duality we obtain a wider 
extension of Fourier transformation on 2. 

In the following a 0-wedge of the form R"4- 
ilO will be called a 0-wedge of the form D" + 
ilO at the same time if it is a tubular domain 
(i.e., with fixed imaginary part TOL Then an 
element f(x)e 2 can be expressed in the form 
(13), where each F,(z) is holomorphic in a 0- 
wedge D" + iI;0 and is of infra-exponential 
growth there locally uniformly in Imz. The 
inner product of such f(x) with o e£, is given 
by the definite integral 


N 

I (x)e(x)dx = d F(z) p(z)dz, 

J= Imz-y; 
where the y;€ I0 are fixed. Given a cone A we 
define its dual cone by A^ = {ye R"| <, y» zO 
for all ye Aj. If F(z) are all of exponential 
decrease in Rez locally uniformly with respect 
to ImzeI,0 and Rez/|Rez|¢A’, then the de- 
finite integral 


SQ | eene 
N 
Zur 


j Imz=y; 


e "S F(z)dz (21) 
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converges locally uniformly for ¢ in some 0- 
wedge D" — iA0, and defines there a holo- 
morphic function of infra-exponential growth. 
Thus we obtain a Fourier hyperfunction 

G(č — iA0) that agrees with Ff calculated by 
the duality. For a general f(x)e 2, the Fourier 
transform in the manner of Sato is calculated 
as follows: First we decompose f(x) into the 
sum È f(x) for which the defining functions 
of f,(x) decrease exponentially outside A7. 
Then we calculate G,(¢) by (21) and put Zf = 
È G, (ë — i^,0). An example of such decom- 
position is given by multiplication by z,(x)— 
IIj-, 1/(1 +expo;x;), which decreases expo- 
nentially outside I, — {0;x; > 0}. 

The relation between 2 and 4 is more com- 
plicated than the relation S’ c Z'(R"). The 
growth condition for 2 is interpreted as a 
condition concerning germs at infinity. Thus 2 
can be considered to be a sheaf on the direc- 
tional compactification D" — R"US":! such 
that 2|. — Z. Just as the sheaf Z is obtained 
from €, the sheaf 2 is obtained as the nth 
derived sheaf 240) from the sheaf @ on 
D" +iR" consisting of germs of holomorphic 
functions of infra-exponential growth with 
respect to Rez. We have HAD (Bn. ()=0 
for k#n for any open set Qc D". Especially, 2 
is flabby, and the decomposition of support is 
available to calculate the Fourier transform. 
The symbol 2 employed at the beginning to 
express the global Fourier hyperfunctions 
corresponds to 2(D"), and 2(R")= A(R”) by 
definition. The canonical restriction mapping 
J(D")— A(R") is surjective but not injective. 

As for tempered distributions s”, we can 
introduce various subclasses of Fourier hyper- 
functions, e.g., exponentially decreasing Fourier 
hyperfunctions | J;;, exp( — ó,/ x? +1) 2, real 
analytic functions of infra-exponential growth 
€(D^, etc. We can also consider operations 
such as convolution and multiplication be- 
tween adequate pairs, and apply differential 
operators with suitable coefficients. Concern- 
ing these we can avail ourselves of the same 
formulas as given in Section O. 

A hyperfunction with compact support ts 
naturally considered as a Fourier hyperfunc- 
tion, and its Fourier transform agrees with the 
inner product < f(x), (,/21) "exp(— ix£)», 
which gives an entire function of exponential 
type. 

Paley-Wiener theorem. An entire function 
f(E) is the Fourier transform of a hyperfunc- 
tion with support in a compact convex set K 
in R" if and only if it satisfies condition (1) of 
Section O. 

The theory of Fourier hyperfunctions de- 
scribed above is mainly due to Sato and T. 
Kawai [20]. They are not the largest class of 
generalized functions stable under the Fourier 
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transformation. Following a suggestion of 
Sato, S. Nagamachi and N. Mugibayashi, Y. 
Saburi, and Y. Ito have extended them to the 
modified Fourier hyperfunctions in which the 
radial compactification D?" of C" is employed 
instead of the horizontal compactification 
D" 1 iR" in the above theory. If we discard 
the localizing property, they can be extended 
further to the Fourier ultrahyperfunctions or 
ultradistributions of J. Sebastiaó e Silva (1958), 
M. Hasumi (1961), and M. Morimoto (1973). 


CC. Micro-Analyticity of Hyperfunctions 


The boundary value expression (13) for a 
hyperfunction f(x) can be interpreted con- 
versely as the description of the state of ana- 
lytic continuation of f(x) to the complex do- 
main. Thus we say that f is micro-analytic at 
(xo, čo) if on a neighborhood of x, it admits 
the analytic continuation into the half-space 
(Imz, čo <0 in the sense that it admits an ex- 
pression (13) satisfying I; {<Imz, £j» <0} 7z 
@ for every j. The set of points (Xo, &))EQ x 
S". where fe Z(Q) is not micro-analytic, is 
called the singularity spectrum or singular 
spectrum of f and is denoted by S.S... We have 
by definition S.S.F(x + i0) c Q x (F^ 18" ^1). 
Micro-analyticity can be characterized by the 
Fourier transformation as follows: f is micro- 
analytic at (xo, čo) if and only if there exists 

a Fourier hyperfunction g(x) such that the 
Fourier transform oC) decreases exponentially 
on a fconical neighborhood of čo and that 

f—g is real analytic on a neighborhood of 

Xo. A hyperfunction f(x) is real analytic in a 
neighborhood of x, if and only if it is micro- 
analytic at (x9, č) for any če S" !. 

With this notion we can clarify the oper- 
ations on hyperfunctions. (In the following 
S"! is identified with (R”\{0})/R* and + 
denotes the sum in the latter.) We have 


S.S.(fg) c {(x, 4£ +0 —A)n) (x, £)eS.S. f, 
(x, )eS.S.9,0€ A& 11 US. S. f US.S.g, 
SS (ttc) ((3,'Do£)|(b(3), HES.S.f}, 


and these operations are legitimate if and only 
if 0 does not appear in the direction compo- 
nent of the right-hand side. We also have 


S.S. | f(x Ddx f(x, Gs t, & 0)eS.S.f for a t}, 


SIT «g) c {(x y, OO, NESS Ff (y, eS. bai 


under suitable conditions for support. 
The classical decomposition formula of 
Radon (or the plane wave decompostion of ô) 


MEN dw 
099 | (—2niy I» (xc +10)" e 
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can be interpreted as the decomposition of 

ó into hyperfunctions with S.S. in the single 
direction œ. By the convolution, this formula 
supplies similar decomposition for a general 
hyperfunction (called the singular spectrum 
decomposition). (22) can be generalized to 


KT det(grad,, (x, œ)) 
S^! 


000) = ni (D(x, ol +10)" 








where the twisted phase D(x, œ) satisfies (i) 

OQ (x, 0) 1s a real analytic function of positive 
type (ie., Re D(x, c) 0 implies Im $(x, w) z 0) 
and (ii) (x, w) is homogeneous of order 1 in o 
and (0, w)=0, grad, (0, w) =a; and the 
vector V (x, œ) is such that <Y¥ (x, œ), x» = 

(x, c). If (x, c) further satisfies O(x, c) #0 
for x Z0, then the component becomes a 
hyperfunction (even a distribution) of x whose 
S.S. is precisely one point (0, w), and this fact is 
useful in theoretical applications [7,21] (— 274 
Microlocal Analysis). 


DD. Structure Theorems of Hyperfunctions 


A hyperfunction whose support is concen- 
trated at the origin is expressed as the infi- 
nite derivative J(D)ó(x) = a,D?0(x) of the 
Dirac measure, with the coefficients satisfying 
lim(|a,|p!)/? 20. Such an operator J(D) is 
called a local operator with constant coeffi- 
cients and acts on 2 as a sheaf homomor- 
phism. Its total symbol J(C) ts an entire func- 
tion of infra-exponential growth or of type 
[1,0]. By way of such an operator, a general 
(Fourier) hyperfunction can be written in the 
form J(D)g(x) with a continuous function on 
R" (of infra-exponential growth). 

If Gesupp f c (4v, x? >0}, then SS f 3(0, +v) 
(Holmgren type theorem of Kashiwara and 
Kawai), and furthermore, the direction compo- 
nent of S.S.f at 0 has the form of ( c v)Up TUE) 
with some Ec S" ?, where p: S" ! N( tvj 
S" ? is the projection to the equator (the 
watermelon-slicing theorem of Morimoto, 
Kashiwara, and K. Kataoka). E is called the 
reduced S.S. of f at 0. These theorems have 
many applications in the theory of linear par- 
tial differential equations and also in physics. 

A hyperfunction f(x) with support in the 
hyperplane x, =0 has several further remark- 
able properties in x,. It admits a formal ex- 
pansion of the form X2, f(x) (x,), where 
X exse) and fox) ES d Oda, /k dx, 
The sum converges in the sense of the topol- 
ogy if f has compact support. It reduces lo- 
cally to a finite sum if f is a distribution, and 
the kth term represents the k-ple layer distri- 
bution of mass, electric charge, etc. For a 
general f the coefficients { f,(x’)} do not neces- 


do, (23) 
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sarily determine f, though they are determined 
by f. 


EE. Complex Powers of Polynomials 


Among examples of special generalized func- 
tions the most important are those of the form 
TZ. where f, 2 max( f(x), 0} (or more generally 
it can be replaced by zero on some connected 
components of { f(x)» 0]), and Ae denotes a 
holomorphic parameter. (The discussion is the 
same for f_ = max ( — f(x), 0).) The simplest 
example, x5, is defined as the analytic con- 
tinuation of the locally integrable function x2 
for ReA> —1 by repeated use of the formula 
x} =(A+ 1) ! D,x2*!, and becomes meromor- 
phic in 4 with simple poles at 4— —1, —2, .... 
As a hyperfunction we have x2 = ((—x4- i0) — 
(—x—i0)*}/2isin rå. At a negative integer À = 
—n, x1 has residue (—1)" ^! ó"^ P (x)/(n — 1)! 
and finite part [ —(2zi) !z "log(—z)]. The 
latter is often denoted by x". In general, for 

a germ of a real-valued real analytic func- 

tion f(x) we can find a differential operator 
P(A, x, D,) with polynomial coefficients in 4 
and a monic polynomial b(å) of minimum de- 
gree such that 


P(A, x, DATT «b(f? (24) 


(Sato, LN Bernshtein, Kashiwara, J.-E. Bjork 
[22]). This formula gives the analytic con- 
tinuation of f? just as for x2. The polyno- 
mial b(A) is called the b-function or the Sato- 
Bernshtein polynomial and contains valuable 
information regarding the singularity of f. It 
has only negative rational roots (Kashiwara). 

We have f- f? 2 f?*'; hence f7! — f^! (suit- 
ably interpreted as above) gives a solution of 
the division problem u: f — 1. Thus if f is a 
polynomial, its inverse Fourier transform gives 
a tempered fundamental solution of f(—iD). 
Furthermore, when f is the relative invariant 
of a *prehomogeneous vector space, we can 
calculate b(4) explicitly by way of the holo- 
nomy diagram. Also, the Fourier transform of 
f? can be calculated explicitly by way of the 
real holonomy diagram as a linear combina- 
tion of the corresponding objects for the dual 
prehomogeneous vector space with coefficients 
similar to the *Maslov index. The simplest 
example is 


SE x4 -( 2n) (Fije "^ T (A+ (EFI)! 
=(,/22) (ENP A+) fetes 
-gtmAn emai), 
Among practical examples are the following 
classical formulas: Let P(x) be a nondegener- 


ate real quadratic form and Q(€) its tdual 
form, and let q denote the number of negative 
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eigenvalues of P(x). Then 
FP} = —( 2r) "22a + 1)T(4 + n/2) 
x |det P|"? {sin z(4-- 9/2)Q ;^ ^"? 
—sin z((n —)/2)Q ^^". 


Here the arguments in the I -factor (A+ 1)(4 + 
n/2) give the b-function of P(x). If q 2 n— 1, 
we further have, letting P2. = P2 1(-- (x, v5) 
for an eigenvector v corresponding to the 
unique positive eigenvalue, 


F Pi, = (27) "22249171 en 1p (4 4 1) 
x TL A n/2)|det P| V2 (e * ning -1-»2 
quee MO A CE TN 


From these formulas (taking the finite part if 
necessary) we obtain the fundamental solution 
of the wave equation, the Laplacian, and their 
iterations. These are exactly the distributions 
introduced by Hadamard, M. Riesz, and 
others, as mentioned in Section A (— Appen- 
dix A, Table 15.V). 


References 


[1] L. Schwartz, Théorie des distributions, 
Hermann, revised edition, 1966. 

[2] L. Hórmander, Linear partial differential 
operators, Springer, 1963. 

[3] G. de Rham, Variétés différentiables, Her- 
mann, 1955. 

[4] I. M. Gel'fand and G. E. Shilov, Gen- 
eralized functions. I, Properties and oper- 
ations; II, Spaces of fundamental and gen- 
eralized functions; III, Theory of differential 
equations; IV (I. M. Gel'fand and N. Ya. Vil- 
enkin), Applications of harmonic analysis; V 
(I. M. Gel'fand, M. I. Graev, and N. Ya. Vilen- 
kin), Integral geometry and representation the- 
ory, Academic Press, 1964, 1968, 1967, 1964, 
1966; VI (I. M. Gel'fand, M. I. Graev, and I. I. 
Pyatetskii-Shapiro), Representation theory 
and automorphic functions, Saunders, 1969. 
(Originals in Russian.) 

[5] C. Roumieu, Sur quelques extensions de la 
notion de distribution, Ann. Sci. Ecole Norm. 
Sup. Paris, 77 (1960), 41-121. 

[6] M. Sato, Theory of hyperfunctions I, II, J. 
Fac. Sci. Univ. Tokyo, sec. I, 8 (1959), 139— 
193, 387—437. 

[7] M. Sato, T. Kawai, and M. Kashiwara, 
Microfunctions and pseudo-differential equa- 
tions, in [16, pp. 265—529]. 

[8] L. Schwartz, Espaces de fonctions différ- 
entiables à valeurs vectorielles, J. Analyse 
Math., 4 (1954—1955), 88—148. 

[9] A. Grothendieck, Produits tensoriels topo- 
logiques et espaces nucléaires, Mem. Amer. 
Math. Soc., 16 (1955). 


487 


[10] L. Schwartz, Théorie des distributions 

à valeurs vectorielles, Ann. Inst. Fourier, 7 
(1957), 1-141; 8 (1958), 1-209. 

[11] F. Treves, Topological vector spaces, 
distributions and kernels, Academic Press, 
1967. 

[12] L. Ehrenpreis, Fourier analysis in several 
complex variables, Wiley-Interscience, 1970. 
[13] G. Bjórck, Linear partial differential 
operators and generalized distributions, Ark. 
Mat., 6 (1966), 351—407. 

[14] H. Komatsu, Ultradistributions. I, Struc- 
ture theorems and a characterization; IT, The 
kernel theorem and ultradistributions with 
support in a submanifold; III, Vector-valued 
ultradistributions and the theory of kernels, J. 
Fac. Sci. Univ. Tokyo, sec. IA, 20 (1973), 25— 
105; 24 (1977), 607—628; 29 (1982), 653-718. 
[15] M. Sato, Theory of hyperfunctions (in 
Japanese), Sugaku, 10 (1958), 1—27. 

[16] H. Komatsu (ed.), Hyperfunctions and 
pseudo-differential equations, Lecture notes in 
math. 287, Springer, 1973. 

[17] A. Martineau, Les hyperfonctions de 

M. Sato, Sém. Bourbaki, 13 (1960-1961), 

no. 214. 

[18] P. Schapira, Théorie des hyperfonctions, 
Lecture notes in math. 126, Springer, 1970. 
[19] A. Martineau, Le “edge of the wedge 
theorem" en théorie des hyperfonctions de 
Sato, Proc. Intern. Conf. Functional Analysis 
and Related topics, Tokyo, 1969, Univ. Tokyo 
Press, 1970, 95-106. 

[20] T. Kawai, On the theory of Fourier 
hyperfunctions and its applications to partial 
differential equations with constant coeffi- 
cients, J. Fac. Sci. Univ. Tokyo, sec. IA, 17 
(1970), 467—517. 

[21] K. Kataoka, On the theory of Radon 
transformations of hyperfunctions, J. Fac. Sci. 
Univ. Tokyo, sec. IA, 28 (1981), 331—413. 
[22] J.-E. Bjork, Rings of differential opera- 
tors, North-Holland, 1979. 


126 (1X.22) 
Dynamical Systems 


A. History 


The theory of dynamical systems began with 
the investigation of the motion of planets in 
ancient astronomy. Qualitative investigation 
of mechanics in antiquity and the Middle Ages 
culminated in the work of J. Kepler and G. 
Galilei in 17th century. At the end of that 
century, I. Newton founded his celebrated 
Newtonian mechanics, by means of which 
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Kepler's law on the motion of planets and 
Galileo's observations of movement can be 
explained theoretically. Following this, L. 
Euler, J. L. Lagrange, P. S. Laplace, W. R. 
Hamilton, C. G. J. Jacobi, and others devel- 
oped the theory using analytical methods, and 
founded analytical dynamics. From the end of 
the 18th century through the 19th century, the 
*three-body problem attracted the attention of 
many mathematicians. At the end of the 19th 
century, H. Bruns and H. Poincaré found that 
general solutions of the three-body problem 
could not be obtained by tquadrature, and this 
gave rise to a crisis of analytical dynamics. But 
this was resolved by Poincaré himself. He 
pointed out the importance of the qualitative 
theory based on topological methods, and 
obtained many fundamental results. A. M. 
Lyapunov with his theory of stability and 

G. D. Birkhoff with his many important con- 
cepts of topological dynamics established 
foundations of the new qualitative theory. 

In 1937 A. A. Andronov and L. S. Pontryagin 
introduced the concept of structural stability, 
which attracted the attention of S. Lefschetz. 
Lefschetz's school investigated structural sta- 
bility and *nonlinear oscillations, and obtained 
many important results (H. F. de Baggis, L. 
Markus, M. M. Peixoto, and others). In about 
1960, S. Smale initiated study of differentiable 
dynamical systems under the influence of Lef- 
schetz's school. Smale and his school founded 
a new theory of differentiable dynamical 
systems using ‘differential topology. D. V. 
Anosov generalized the work of E. Hopf and 
G. A. Hedlund on tgeodesic flows of closed 
surfaces of tconstant negative curvature and 
established the concept of Anosov systems, 
which played an important role in Smale's 
theory. The work of Hopf, Hedlund, and 
Anosov is closely related to tergodic theory. 
Ya. G. Sinai and R. Bowen obtained impor- 
tant results in ergodic theory. The concept of 
structural stability and its generalization are 
essential in the tcatastrophe theory of R. Thom 
(— 51 Catastrophe Theory); the theory of bi- 
furcation of dynamical systems is another 
essential part of catastrophe theory. D. Ruelle 
and F. Takens proposed a new mathematical 
mechanism for the generation of turbulence 
using Smale's theory and Hopf bifurcation. 
The new theory of dynamical systems devel- 
oped by Smale and others is now applied to 
the mathematical explanation of chaotic phe- 
nomena in many branches of science. Finally, 
we mention that in the 1960s A. N. Kolmo- 
gorov, V. I. Arnold, and J. Moser obtained 
remarkable results on the existence of quasi- 
periodic solutions for the n-body problem, 
which turned out to solve the long-standing 
problem of the stability of the solar system. 
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B. Definitions of Dynamical Systems 


In the study of the evolution of physical, bio- 
logical, and other systems, we construct math- 
ematical models of the systems. Usually, the 
state of a given system is completely described 
by a collection of continuous parameters, 
which may be related in some cases. Thus the 
space X of all possible states of the system can 
be regarded as a Euclidean space or a subset 
of a Euclidean space defined by some equa- 
tions. [n general, we assume that the space X 
of all possible states of the system forms a 
*topological space, and we call it a state space 
or a phase space. Second, we assume that the 
law of evolution of states in time is given, by 
which we can tell the state x, at any time r, if 
we know the state x, at time to. Assigning x, 
to xo, we have a mapping zit, . to): X o X for 
any times tọ and t,, which satisfies the follow- 
ing conditions: (i) x(t5, tj) o x(t,, tg) 9 n(t5, to); 
(it) 2(tg, to) = 1 y, the identity mapping of X. 
Finally, we assume that the mapping z(t,, to) 
depends only on t —t, — to. Writing z,= 

n(t,, tg) if t=t, — to, we have the following 
conditions from (i) and (ii) above: (i’) z, o x, = 
7,44, S, CER; (i) n9 — 1x. 

In general, the theory of topological dynam- 
ics can be regarded as the study of topological 
transformation groups (— 431 Transformation 
Groups) originating in the topological investi- 
gations of problems arising from classical 
mechanics. Here, we restrict our attention to 
some important special cases. 

(1) Let X be a topological space and R the 
additive topological group of real numbers. 
Let o: X x R-» X bea continuous mapping. 
For each teR, we define a mapping ọ,: X X 
by (x)= (x, t), xe X. If the family of map- 
pings Jo, ip satisfies the following conditions, 
we say that (X, o) is a (continuous) R-action, a 
(continuous) flow, or a (continuous) dynamical 
system on X, and that X is the phase space: 

(i) 9,0 9, — 9,., for all s, t€ R; (ii) o9 = 1x- 

Let (X, 9) be a flow. Then q,: Xo X isa 
thomeomorphism with (ol ! =, for each 
t€ R. For each xe X, define a mapping y*:R~— 
X by o" (t)  o(x, t) teR. The mapping q* is 
called a motion through x, and its image C(x) 
= (o*(t)| teR] is called the orbit or the trajec- 
tory through x. An orbit is nonempty, and two 
orbits are either identical or mutually disjoint. 
The family of orbits fills up the phase space X 
and is called the phase portrait. 

Let R, be the additive tsemigroup of all 
nonnegative real numbers. If we replace R by 
R, in the definition of a (continuous) flow, we 
obtain a definition of a (continuous) semiflow. 
For a semiflow (X, o), the mapping o: X >X, 
teR, is in general not a homeomorphism but 
a continuous mapping. 
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Let (X, 9) and (Y, V) be flows. A homeo- 
morphism A: X — Y is called a topological 
equivalence from (X, p) to (Y, y) if it maps 
each orbit of o onto an orbit of y preserving 
orientations of orbits Ge. there exists an in- 
creasing homeomorphism a. RR for each 
xe X such that ho(x, t) 2 v (h(x), «,(t)) for all 
t€ R). Two flows are topologically equivalent if 
there exists a topological equivalence from one 
to the other. If two flows are topologically 
equivalent, their phase portraits have the same 
topological structure. Two flows (X, 9) and 
(Y, y) are flow equivalent if there exist a c 0 
and a homeomorphism h: X > Y such that 
ho(x,t) - v(h(x), ct) for all teR. Such an h isa 
topological equivalence from (X, o) to (Y, y). 

(2) Let X be a topological space and Z the 
additive group of integers. If we replace R by 
Z in the definition of a continuous flow, we 
obtain a definition of a (continuous) Z-action, a 
discrete flow, or a discrete dynamical system on 
X. If (X, o) is a discrete flow, then f=@,:X > 
X is a homeomorphism and 9, — f" for all 
neZ. Conversely, for a given homeomorphism 
f: X 5 X, define a mapping o: X x Z X by 
Q(x, n) — f"(x), xe X and neZ. Then (X, o) is 
a discrete flow such that o, =f" for neZ. So 
we identify a homeomorphism with a discrete 
flow. Thus the orbit of a homeomorphism 
f: X-X through x is C(x)={ f"(x)|ne Z]. 

Let Z , be the additive semigroup of all 
nonnegative integers. If we replace R by Z, in 
the definition of a continuous flow, we obtain 
a definition of a discrete semiflow on X. For a 
discrete semiflow (X, o), the mapping o, X > 
X, neZ, is in genéral not a homeomorphism 
but a continuous mapping. We can identify 
a continuous mapping f: X 9 X with a discrete 
semiflow (X, @) in a natural way as above. 

Let f: X +X and g: Y Y be two homeo- 
morphisms (continuous mappings). A homeo- 
morphism A: X — Y such that ho f=goh is 
called a topological conjugacy from f to g. 

And f and g are called topologically conjugate 
if there exists a topological conjugacy from 

f to g. Topologically conjugate homeomor- 
phisms have the same phase portrait in a topo- 
logical sense. 

(3) Let M be a tdifferentiable manifold of 
class C (1 <r < oo or r2 o). A continuous flow 
(M, ¢) is a flow of class C", a C’-flow, a differ- 
entiable dynamical system of class C", or a 
one-parameter group of transformations of 
class C’, if 9: M x RA M is of class C". A semi- 
flow of class C" is defined similarly. 

Let (M, 9) and (N, y) be C’-flows. A topo- 
logical equivalence h: M >N from (M, o) to 
(N, V) is called a C'-equivalence if it is a *C'- 
diffeomorphism. Two flows are C’-equivalent if 
there is a C’-equivalence from one to the other. 
Classification of C’-flows by C’-equivalence is 
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dificult and sometimes too unwieldy to work 
with. On the other hand, there are many prob- 
lems which can be solved by the knowledge 

of the topological structure of the phase por- 
trait of C'-flows. 

(4) Let (M, n) be a discrete flow on a dif- 
ferentiable manifold M of class C". If x: M x 
Z-— M is of class C”, then (M, z) is called a Z- 
action of class C”, a discrete flow of class C", a 
discrete C’-flow, or a discrete dynamical system 
of class C". If (M, x) is a discrete C’-flow, then 
fn1,:M—M is a C’-diffeomorphism. Con- 
versely, a C’-diffeomorphism f: M — M defines 
a discrete C’-flow in a natural way on M, and 
we identify a C’-diffeomorphism f: M >M 
with the discrete C’-flow on M defined by f. 

A discrete semiflow of class C” is defined simi- 
larly, and a *C'-mapping f: MM is identi- 
fied with a discrete semiflow of class C" on M 
defined by f. 

Let /: M—M and g: N >N be C- 
diffeomorphisms (C’-mappings) of differ- 
entiable manifolds M and N of class C". A 
topological conjugacy from f to g is called a 
C"-conjugacy if it is a C'-diffeomorphism. f and 
g are C'-conjugate if there is a C'-conjugacy 
from f to g. 


C. Examples and Remarks 


(1) Let D be an open set of R” and f: D5R"a 
continuous mapping. Consider the tautono- 
mous system of ordinary differential equations 


dx/dt— f(x, | xeD. (1) 


We assume that for each xeD there exists a 
unique *nonextendable solution q(x, t) with 
the initial condition g(x, 0) 2 x defined on a 
maximal interval (a,, b), —oo «a, «0 « b, < oo 
(— 316 Ordinary Differential Equations (Ini- 
tial Value Problems)). The set {@(x, t)|a,<t< 
b,i is called the trajectory through x. By the 
uniqueness assumption, we have o(q(x, t), s)= 
(x, t +s) whenever both sides of the equality 
are defined. When (a,, b,)=R for all xe D, 
then equation (1) is called complete. If (1) is 
complete, the mapping o: D x RD defined 
by the solutions @(x, t) determines a continu- 
ous flow (D, o). Furthermore, if f is of class 
C’, then (D, o) is of class C". If (1) is not com- 
plete, then there exists a continuous positive 
scalar function a: DR such that 


dxjdt = x(x) f(x), 


is complete. The trajectories of (1) and (2) 
through x coincide for all x € D, and thus the 
phase portraits of (1) and (2) are the same. 

(2) Let M be a differentiable manifold of 
class C”. A tvector field of class C” on M 
(1 &r € oo) gives rise to an autonomous system 


xeD, (2) 
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of ordinary differential equations in a coordi- 
nate neighborhood of each point of M, and it 
generates a !local one-parameter group of 
local transformations of class C". If M is tcom- 
pact, this local one-parameter group of local 
transformations is uniquely extended to a one- 
parameter group of transformations of class 
C (— 105 Differentiable Manifolds). Thus 

a vector field of class C' on M generates a 
unique C’-flow on M if M is compact. There- 
fore, sometimes we identify a vector field of 
class C” with the C’-flow generated by it. 

(3) Let (M, o) be a C'-flow. Then q,4:M 
M is a C’-diffeomorphism, which we call the 
time-one mapping (time-one map) of (M, ol. 
Thus every C’-flow (M, o) induces a C’- 
diffeomorphism as a time-one mapping. But 
the set of C!-diffeomorphisms that are time- 
one mappings of C!-flows is of the first cate- 
gory in the space of all C!-diffeomorphisms 
with C! topology (J. Palis). Thus most C'!- 
diffeomorphisms are not expressed as time-one 
mappings of C!-flows. 

(4) Let M be a compact differentiable mani- 
fold of class C” with dimension m and (M, o) a 
C"-flow (1 <r < oo). An (m — 1)-dimensional 
closed submanifold Z of M is called a cross 
section of (M, ọ) if the following conditions are 
satisfied: (i) For any xe X, there exist t, >0 
and t; <0 such that o, (x), o, (x)e £; (ti) Every 
orbit intersects X ttransversally whenever it 
meets X. Let X be a cross-section of (M, o) and 
xe. Let t, be the least positive number with 
Q9, lei. Such a t, exists for every xe X, and 
f: Z defined by f(x)= o, (x), xeZ is a C- 
diffeomorphism. We call this diffeomorphism f 
the first-return mapping (first-return map) or 
the Poincare mapping (Poincaré map) for X. 

(5) Let N be a compact differentiable 
manifold of class C? and f: N>N a C’- 
diffeomorphism (1 € r € oo). Define an equiva- 
lence relation ~ on N x R generated by (x, t + 
I) - (f(x), t) for xe X, teR. Then the quo- 
tient space N(f) 2 N x R/~ has a natural 
tdifferentiable structure of class C”, and a C’- 
flow (N( f), v) is defined by v([x,t], s)= [x t + 
s] for xeN, t, seR, where [x,t]e N(f) is the 
equivalence class of (x, t). The flow (N( f), y) 
thus obtained is called the suspension of f. The 
suspension (N( f), y) has a cross section X = 
{Lx,0]|xeN}, and the Poincaré mapping 
for X is C’-conjugate to f. Conversely, if (M, o) 
has a cross section X and the Poincaré map- 
ping for J is f: Z 5 X, then the suspension 
(X( f), v) is C’-equivalent to (M, ol 

(6) Let U be an open set of R” and f: U >R" 
a continuous mapping. Consider the tdiffer- 


ence equation 
Xm = f(x). XmE U. (3) 


For each xe U, let o(x,m) be the solution of (3) 


126 D 
Dynamical Systems 


with (x, 0) 2 x. If /(U) c U, then o(x, m) exists 
for all meZ, and xeU, and q defines a dis- 
crete semiflow on U. If f: U—U is a homeo- 
morphism, then o(x, m) exists for all me Z 

and xe U, and q defines a discrete flow on U 
(— 104 Difference Equations). 


D. Basic Concepts 


For simplicity we assume that the phase 
spaces of dynamical systems are metric spaces, 
and we denote their metrics by d. 

(1) Let (X, o) be a continuous flow on a 
metric space X and xe X. A subset A of X is 
called invariant if o,(A) c A for all teR. A 
subset of X is invariant if and only if it is a 
union of orbits. A subset A of X is positively 
(resp. negatively) invariant if (A) c A for all 
t Z0 (resp. t<0). The subset C, (x)= (9^(0|tz 
0} (resp. C_(x)= {¢*(t)|t <0}) is called the 
positive (resp. negative) semiorbit or the posi- 
tive (resp. negative) half-trajectory starting 
fromex. A subset is positively (resp. negatively) 
invariant if and only if it is a union of positive 
(resp. negative) semiorbits. A subset is invari- 
ant if and only if it is both positively and nega- 
tively invariant. 

The union and the intersection of invariant 
sets are invariant. If A is invariant, then its 
closure A, its interior Int A, its boundary 0A, 
and its complement A* — X — A are invariant. 
If A is invariant, then @(A x R)c A and the 
restriction mapping ~|A x R defines a continu- 
ous flow (A, 9| A x R) on A. The flow thus 
obtained is called the restriction of (X, 9) on A. 

(2) A point xe X is a singular point, an equi- 
librium point, a critical point, a rest point, or 
a fixed point if C(x) 2 (xj (— 290 Nonlinear 
Oscillation). A point is regular or nonsingular if 
it is not a singular point. The set of all singular 
points is a closed invariant set, and the set of 
all nonsingular points is an open invariant 
set. If A is a positively invariant set which is 
homeomorphic to the closed unit ball in R”, 
then there exists a singular point in A (tBrou- 
wer fixed-point theorem). 

A point xe X is periodic if there exists a 
T #0 such that 


mix t- T)= (x,t) (4) 


holds for all te R. If x is periodic, the motion 
o" and the orbit C(x) are said to be periodic. A 
point x is periodic if and only if there exists a 

T 40 with g(x, T) — x. A singular point is 
periodic. For a periodic point x, a number T 
satisfying (4) is called a period of x. If x is 
nonsingular and periodic, then there exists 

a smallest positive period T, of x, and any 
period is an integral multiple of Tọ. An orbit of 
a nonsingular periodic point is called a closed 
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orbit. If C is a closed orbit, then all points of C 
are nonsingular periodic points, and their 
smallest positive periods coincide. A closed 
orbit is compact. 

(3) Let x be a point of X. A point ye X is 
called an w-limit (resp. a-limit) point or a posi- 
tive (resp. negative) limit point of x if there 
exists a sequence {t,} of real numbers such 
that (1) t, — oo (resp.t,— —oo) as n—oo and 
(ii) (x, t,) ^ y as n> oo. The set of all o-limit 
(resp. a-limit) points of x is denoted by w(x) 
(resp. «(x)) and is called the w-limit (resp. g- 
limit) set of x. For each xe X, w(x) and «(x) 
are closed invariant sets, and the following 
equalities hold: C, (x)= C, (x) Uox(x), C. (x) 
C. (x)Ua(x), and C(x) 2 C(x)Ua(x)U w(x). 

If x is a periodic point, C(x) = C(x)=a(x)= 
(x). If C is a closed orbit, then C = C = C(x) = 
a(x) = ox) for all xe C. If A is a compact in- 
variant set and xe A, then «(x) and w(x) are 
nonempty. 

Assume that X is tlocally compact and 
xe X. Then w(x) is connected if it is compact, 
and none of the connected components of w(x) 
is compact if a(x) is not compact. 

(4) Let x be a point of X. Let J, (x) (resp. 
J_(x)) be the set of all points y satisfying the 
following condition: There exist a sequence 
{ta} of numbers and a sequence (x,) of points 
in X such that (i) t, oo (resp. t, —oo) as n> 
oo, (il) x,—x as n oo, and (iii) o(x,, t,) y 
as n— oo. The set J , (x) (resp. J. (x)) is a closed 
invariant set containing (x) (resp. «(x)), called 
the first positive (resp. negative) prolongational 
limit set of x. If X is locally compact, then 
J (x) is connected if it is compact, and none of 
the connected components of J, (x) is compact 
when J., (x) is not compact. 

Notions of higher prolongations have been 
defined and investigated by T. Ura, J. Aus- 
lander, and P. Seibert. 

(5) For a discrete flow, we can similarly 
define basic notions such as an invariant set, 
fixed point, periodic point, and so on defined 
in Sections D(1)- D(4) by replacing R by Z. 
The propositions and theorems stated above 
hold for discrete flows, except those concern- 
ing connectedness. 





E. Recursive Concepts and Dispersive Concepts 


(1) Let (X, o) be a flow on a metric space X. 
Let xe X be a point such that there exist a 
neighborhood U of x and a T >Q satisfying the 
condition U (1o,(U) = @ for tz T. Then x is 
called wandering. The set of all wandering 
points is an open invariant set. A point is 
nonwandering if it is not wandering. The set O 
of all nonwandering points is a closed invar- 
iant set and is called the nonwandering set. 
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The following conditions are equivalent: (i) x 
is nonwandering, (ii) x eJ, (x), (iii) xeJ. (x). 
The nonwandering set Q contains all singular 
points, closed orbits, and (x) and a(x) for all 
xeX. 

(2) A point xe X is positively (resp. nega- 
tively) Poisson stable if xe w(x) (resp. xea(x)). 
It is Poisson stable if it is both positively and 
negatively Poisson stable. A positively Poisson 
stable point and a negatively Poisson stable 
point are nonwandering. The following con- 
ditions are equivalent: (1) x is positively Pois- 
son stable, (ii) C, (x) = (x), (iii) C(x) w(x), (iv) 
for any neighborhood U of x and T» 0, there 
exists a t» T with o(x,t)e U. If X is tcomplete 
and xe X is positively Poisson stable but not 
periodic, then cox) — C(x) is dense in a(x). 
This implies that if X is complete, then C(x) is 
periodic if and only if C(x) 2 ox). 

(3) If all the points of the phase space X are 
wandering, then (X, ol is called completely 
unstable. If all the points of X are nonwander- 
ing, then (X, ¢) is called regionally recurrent. If 
A is an invariant set such that the restriction 
of (X,@) on A is regionally recurrent, then 
(X, @) is said to be regionally recurrent on A. If 
(X, 9) 1s regionally recurrent and X is locally 
compact, then the set of all Poisson stable 
points is dense in X. 

For a given flow (X, o), we obtain a se- 
quence of invariant sets (Q,) and a sequence 
of restriction flows {(Q,, @,)} such that (i) 

(Q5, 99) =(X, p); (1) O,,, is the nonwandering 
set of (O,, ¢,), n2 0; and (ii) (QO,,,,,@,4,) is the 
restriction of (X, o) on Q,,,. Then X 209520, 
25... 250,5.... Put Q, 2 (,Q,. Then Q, is an 
invariant set of (X, ol, and we denote the re- 
striction of (X, o) on Q, by (Qu, Pa). Starting 
from (Q,,, Po), we obtain similarly a sequence 
of invariant sets {Q,,,,} and a sequence of 
flows ((0,.,, Po+n)}- If we obtain an ordinal 
number y such that Q,=Q,,,#@ by continu- 
ing this process, then we call Q, the set of 
central motions. In this case, the flow (Q,, o,) is 
regionally recurrent, and every invariant sub- 
set of X on which (X, ¢) is regionally recurrent 
is contained in Q,. When X is fseparable and 
Q, is compact and nonempty, then the mini- 
mum of such ordinal y is an fordinal of at 
most the second number class. 

(4) Let & be the set of all xe X satisfying the 
following condition: For each e, T 7 0, there 
exist a sequence (xo =X, x,, ..., xy =X} of 
points in X and a sequence (to, t,, ..., t, ,] of 
numbers such that t; 2 T and dien, (xj), xi,1) «€ 
for 0<i<k—1. The set & is a closed invar- 
iant set containing the nonwandering set Q 
and is called the chain recurrent set. If X = 2, 
then (X, p) is called chain recurrent. The re- 
striction (2, o|2? x R) of (X, 9) on & is chain 
recurrent (C. C. Conley). 





126 E 
Dynamical Systems 


(5) A nonempty closed invariant set is called 
a minimal set if none of its nonempty proper 
closed subsets is invariant. For a nonempty 
compact subset A of X, the following con- 
ditions are equivalent: (i) A is minimal, (ii) C(x) 
= A for all xe A, (iii) C, (x) — A for all xe A, 
(iv) C_(x)=A for all xe A, (v) w(x)=A for all 
x€ A, (vi) a(x) — A for all xe A. A point xe X is 
positively (resp. negatively) Lagrange stable if 
C, (x) (resp. C (x)) is compact. If C(x) is com- 
pact, then x is called Lagrange stable. Every 
nonempty compact invariant set contains a 
compact minimal set. If x e X is positively 
(resp. negatively) Lagrange stable, then a(x) 
(resp. «(x)) contains a compact minimal set. 

A closed invariant set A is called a saddle set 
if there exists a neighborhood U of A such that 
every neighborhood of A contains a point x 
such that C, (x) U and C (x)  U. Otherwise 
A is called a nonsaddle set. For a point x of X, 
let E(x) be the subset of X consisting of the 
points y satisfying the following condition: 
There exist a sequence {x,} of points in X and 
two sequences {t,}, {s,} of numbers such that 
(i) x, x, t, — 00, S> —00 as n>% and (ii) 
Q(x,, ta) 9 y, P(X, $,)— y as n oc. For a subset 
B of X, put E(B) — | )xes E(x). Let {S,} be the 
family of all saddle minimal sets and {F,} the 
family of all nonsaddle minimal sets. If the 
phase space X is compact, then the nonwan- 
dering set Q—(( Jj FAU) E(S)) le Fp) U 
E(| J, Sa) (T. Saito). 

(6) A point xe X is said to be recurrent if for 
any £20 there exists a T — 0 such that the tz- 
neighborhood U of g*([t, t-- T ]) contains C(x) 
for all te R. If x is recurrent, the motion o? 
and the orbit C(x) are said to be recurrent. 
Every orbit of a compact minimal set is recur- 
rent, and if the phase space is complete, then 
the closure of a recurrent orbit is a compact 
minimal set (Birkhoff). 

A set D of real numbers is called relatively 
dense if there exists a T>0 such that DN 
(t, t- T) Ø for all teR. Assume that xe X is 
Lagrange stable, then x is recurrent if and only 
if for every e 0 the set {t|d(x, p(x, t)) <e} is 
relatively dense. 

(7) A flow (X, q) is dispersive if for any x, 
yeX there exist neighborhoods U, of x and U, 
of y and a T>0 such that U, 1 o,(U,) = Ø for 
all £z T. The following conditions are equiva- 
lent: (i) (X, o) is dispersive; (ii) For any x, ye X, 
there exist neighborhoods U, of x and U, of y 
and a T>0 such that U.N o,(U,) =Ø if |t|> T; 
(itt) J (x) 2 Ø for all xe X. 

A flow (X, ọ) is parallelizable if there exist 
a subset S of X and a homeomorphism h: 

X >S x R such that (i) g(S x R)=X and (ii) 
ho q(x,t) (x, t) for all xeS and teR. A flow 
(X, 9) 1s parallelizable if and only if there is a 
subset S of X satisfying the following con- 
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ditions: (i) For each xe X there exists a unique 
t(x)ER with 9(x, x(x))e S; (ii) c: X OR is con- 
tinuous. A parallelizable flow is dispersive. A 
flow on a locally compact separable metric 
space is parallelizable if and only if it is disper- 
sive (V. V. Nemytskii, V. V. Stepanov). 

An open set U of X is called a tube if there 
exist a T» 0 and a subset S of U satisfying the 
following conditions: (i) e(S x (— T, T)) =U, (ii) 
For each xe U there is a unique z(x)e R with 
|t(x)|< T such that p(x, t(x))e S. The set S in 
the above definition is called a local section. If 
x€ X is a regular point, then there exists a tube 
containing x (M. Bebutov, H. Whitney). The 
notion of a local section is a generalization 
of Poincare's "surface sans contact" [1] or 
Birkhoff's "surface of section" [4], and it is 
related to the notion of the cross section. 

(8) For discrete flows (homeomorphisms), 
basic notions, such as a nonwandering set, 
Poisson stability, regional recurrence, central 
motion, and minimal set, are defined similarly, 
and many of the propositions and theorems in 
sections E(1)- E(5) hold similarly for discrete 
flows (homeomorphisms). 


F. Stability 


(1) Let (X, o) be a continuous flow on a metric 
space X. A point xe X is called orbitally stable 
if for any ¢>0 there exists a ô >0 such that 
C,(y) is contained in the e-neighborhood of 

C, (x) for all y with d(x, y) «ó (— 394 Sta- 
bility). A nonempty set A is called stable if 
every neighborhood of A contains a positively 
invariant neighborhood of A. If A is compact 
(in particular, if A is a periodic orbit), then 
orbital stability and stability for A are equiva- 
lent. A nonempty set A is called asymptotically 
stable if A is stable and there exists a neighbor- 
hood V of A such that c(x)c A for any xeV. 
If A is stable and o(x)c A for all xe X, then A 
is called globally asymptotically stable. A point 
x€ X is Lyapunov stable if for any e>0 there 
exists a ô >Q such that d(q,(x), o,(y)) « e for all 
t20 and y with d(x, y) « à. Lyapunov stability 
implies orbital stability. A point x is uniformly 
Lyapunov stable if for any ¢>0 there exists a ô 
7 D such that for ze C(x) and y with d(y, z) « ó 
we have d(q,(z), 9,(y)) <e for all t>0. Uniform 
Lyapunov stability implies Lyapunov stability. 
For a singular point, the notions of uniform 
Lyapunov stability, Lyapunov stability, orbital 
stability, and stability are equivalent. Assume 
that the phase space X is locally compact and 
A is a nonempty compact subset of X. Then A 
is asymptotically stable if and only if there 
exist a neighborhood N of A and a continuous 
real-valued function L on N such that (i) L(x) 
—0 if xe A and L(x)>0 if x é A; (ii) L(@(x, t)) 
< L(x) if xé A, t» 0, and p({x} x [0,t]) c N. 


492 


Such a function L is called a Lyapunov func- 
tion for A (— 394 Stability). Assume that X is 
locally compact and A is nonempty, stable, 
and invariant. Then A is asymptotically stable 
if and only if there exists a neighborhood U 
of A such that any invariant set in U is neces- 
sarily contained in A (Ura). Let A be a non- 
empty closed invariant set. A is called an at- 
tractor if it has an open neighborhood U satis- 
fying the following conditions: (i) U is posi- 
tively invariant; (ii) For each open neighbor- 
hood V of A, there is a T>0 such that q,(U)c 
V for all t> T. Condition (ii) implies that 
(\:209(U)=A and o(x)c A for all xe U. 
Thus an attractor is asymptotically stable. If A 
is an attractor, the basin of A is the union of 
all open neighborhoods of A satisfying (i) and 
(ii). 

(2) Assume that the phase space X 1s com- 
plete. A motion oe (xe X) is called almost 
periodic if for any ¢>0 there exists a rela- 
tively dense subset To, in R such that d(g*(t), 
Q" (t 4- 1,)) <e for all te R and t, (— 18 Almost 
Periodic Functions). If @* is almost periodic, 
then @” is almost periodic for all ye C(x). If A 
is a compact minimal set and if 9* is almost 
periodic for some x in A, then every motion 
oi, ye A, is almost periodic. An almost periodic 
motion is recurrent. The converse is not true. 
But if x is recurrent and Lyapunov stable in 
C(x) (i.e., in the restriction flow (C(x), o | C(x) 
x R)), then q* is almost periodic. If x is uni- 
formly Lyapunov stable in C(x) and negatively 
Lagrange stable, then o" is almost periodic 
(A. A. Markov). 


G. Singular Points and Closed Orbits 


In this section we assume that the phase space 
is a tparacompact differentiable manifold of 
class C? with metric d. 

(1) Let E be a finite-dimensional real vector 
space and L: E E a linear automorphism. L 
is called hyperbolic if it has no teigenvalues of 
absolute value 1. If L: EE is hyperbolic, 
there are unique vector subspaces E" and E" 
satisfying the following conditions: (i) E = 
E* @ E", (it) L(E*) - E! and L(E") = E", (iii) if 
-|| is a tnorm on E, then there exist constants 
c»0and0«4«1 such that, for any positive 
integer m, || L"(v)|| x c^" ||v|| when ve E* and 
|L "(v)| x cA" llv|| when ve E". The zero 0 of E 
is the only fixed point of a hyperbolic linear 
automorphism. Put s=dim E? and u = dim E". 
Then s-- u — dim E, and s (resp. u) is the num- 
ber of eigenvalues of absolute value « 1 (resp. 
7 1) counted with multiplicity. A topological 
conjugacy class of a hyperbolic linear auto- 
morphism L: EE is determined by s, u, and 
the signs of det(L| E^) and det(L | E"), where 
det(L | E^) is the tdeterminant of the restriction 
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L|E*: E'O E* of L on E? (o=s, u). Further 
investigations of topological classification of 
linear automorphisms have been carried out 
by N. H. Kuiper and J. W. Robbin. 

(2) Let f: Ro R" be a C’-diffeomorphism 
(1 &r € oo). Assume that the origin Oc R” is a 
fixed point of f. Then the differential dfo: R” 
>R” of f at 0 is a linear automorphism. It is 
given by dfo(x) 2 Jo( f)x, xe R", where Jo( f) is 
the tJacobian matrix of f at 0 and x is ex- 
pressed as a column vector. If df, is hyperbolic, 
then f is topologically conjugate to df, in a 
sufficiently small neighborhood of 0 (P. Hart- 
man). Assume that f is of class C”, and let 
Au, An (possibly repeated) be the eigenvalues 
of dfo. If A; AA... Ap" for all 1 <i<n and for 
all nonnegative integers m,,...,m, with 2< 
M, +... +M, then in a sufficiently small 
neighborhood of 0, f is C?-conjugate to df, 
(S. Sternberg). 

(3) Let f: M —^M be a C’-diffeomorphism 
(1 &r & oo) of a differentiable manifold M of 
class C*. Let pe M be a fixed point of f. Then 
the ‘differential df, of f at p is a linear auto- 
morphism of the ttangent space T, (M) of M at 
p. If df, is hyperbolic, then p is called a hyper- 
bolic fixed point of f. Let p be a hyperbolic 
fixed point of f and T,(M)= E* @ E" be the 
direct sum decomposition with respect to df,. 
Put W*(p)2 (xeM | "(x) 5p as n o0] and 
W"(p)— (xe M|f""G)p as noc]. W*(p) 
and W"'(p) are invariant sets and images of 
tinjective immersions of class C” of vector 
spaces E° and E", respectively. At the point p, 
W*(p) and W"(p) are tangent to E? and E", 
respectively [21]. We call W*(p) (resp. W"(p)) 
the stable (resp. unstable) manifold of f at p. In 
a sufficiently small neighborhood of a hyper- 
bolic fixed point p of f, f is topologically 
conjugate to its differential df,. Therefore a 
hyperbolic fixed point is an isolated fixed point 
(i.e., there are no fixed points in its sufficiently 
small neighborhood except itself). A hyperbo- 
lic fixed point p is called a source if dim W*(p) 
— 0 and a sink if dim W"(p) 2 0. Otherwise it is 
a saddle point. The number of topological 
conjugacy classes of hyperbolic fixed points on 
an n-dimensional manifold is 4n. 

Let pe M be a periodic point of f and m the 
smallest positive period of p. Replacing f by 
f", we obtain notions of hyperbolicity, stable 
manifold, and so on for the periodic point p. 
We also obtain propositions and theorems 
similar to those stated above for periodic 
points. 

(4) Let A be a real n x n matrix. Consider the 
system of linear differential equations 


dx/dt=Ax,  xeR". (5) 


This equation generates a C?-flow (R", o), and 
om. RR" is given by @,(x)=e'4x, where e is 
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the *exponential of the matrix tA. Thus o, is a 
linear automorphism for all t e R. The origin 
Oc R" is a singular point of (R", o). If none of 
the real parts of the eigenvalues of A are zero, 
we call Oc R” a hyperbolic singular point of (5). 
If Oc R" is a hyperbolic singular point of (5), 
then there exist two vector subspaces E? and 
E" of R" satisfying the following conditions: (i) 
R” = EG E"; (ii) A(E*)= E* and A(E") — E"; 
(ii) E? and E" are invariant sets of (R", ot (iv) 
there exist positive constants c and 4 such 
that for all £z 0, |o,(x)] «ce ^ xl when xe E 
and |o. ol «ce ll when xe E", where ||: || 
is the norm on R". The origin 0 is a hyperbolic 
singular point of (5) if and only if the time- 
one mapping o, —e^:R"—R" of o is a hyper- 
bolic linear automorphism. If 0 is a hyperbolic 
singular point of (5), the above direct sum 
decomposition R” = E? @ E" coincides with the 
one with respect to g,. Put s=dim ET and u = 
dim E". Then s+u=n, and s (resp. u) is the 
number of all eigenvalues of A with negative 
(resp. positive) real parts counted with multi- 
plicity. Also we obtain the following proper- 
ties: (i) xe E* (resp. E" « o,(x) 50 as t— oo 
(resp. t —oo), (ii) x € E (resp. E") = Loisi 
— 0o as t oo (resp. t— — oo), (iii) the origin 0 
is the only singular point. 

Let A and B ben xn real matrices. Let 
(R", g,) and (R", pp) be the flows determined 
by the equations dx/dt = Ax and dx/dt = Bx, 
respectively. Assume that Oc R" is a hyperbolic 
singular point for both of the equations. Let s 
and u (resp. s' and v) be the integers defined 
for p, (resp. qp) in the above paragraph. Then 
the following conditions are equivalent: (1) 
(R^, o4) and (R", pg) are flow equivalent, (ii) 
(R", @,) and (R", oy) are topologically equiva- 
lent, (iti) s=s’, (iv) u2w', (v) (p4), and (og), are 
topologically conjugate. Further investigations 
of the phase portrait of the equation dx/dt — 
Ax without the assumption of hyperbolicity 
were done by Kuiper. 

(5) Let D be an open set of R” and f:D5R" 
a continuous mapping. Consider the system of 
differential equations (1), which we write down 
again here: 


dx/dt= f(x) xeD. (6) 


A point pe D is a singular point of (6) if the 
trajectory through p consists of a single point 
p. If (6) generates a flow (D, 9), then p is a 
singular point of (6) if and only if p is a sin- 
gular point of (D, o). A point p is a singular 
point of (6) if and only if f(p) —0. 

Suppose further that f is a C'-mapping 
(1 &r « oo). A singular point p of (6) is called 
hyperbolic if none of the real parts of the eigen- 
values of the Jacobian matrix J,( f) of f at p is 
zero. Let p be a singular point of (6). We as- 
sume for simplicity that p=0 and (6) generates 
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a flow (D, o). Denote the Jacobian matrix of f 
at 0 by A, and let (R", o) be the flow generated 
by the equation dx/dt = Ax. If 0 is a hyperbolic 
singular point, then in a sufficiently small 
neighborhood of 0, the flows (D, ol and (R", o4) 
are flow equivalent and hence topologically 
equivalent (Hartman, D. M. Grobman). Let 
An, Ån (possibly repeated) be the eigen- 
values of A. If f is of class C? and 4,z m,4,4- 
c mQA, for all 1 <i<n and for all nonnega- 
tive integers m,,...,m, with 2<m,+...+m,, 
then in a sufficiently small neighborhood of 0, 
two flows (D, o) and (R", o,) are C?-equivalent 
(Sternberg). 

(6) Let M be a paracompact differentiable 
manifold of class C* and V a vector field of 
class C' (1 <r < oo) on M. For simplicity we 
assume that V generates a C’-flow (M, o). For 
each point p of M, take a coordinate neighbor- 
hood (U, a) of class C? around p, where U is 
an open neighborhood of p in M and a: UR" 
is a homeomorphism onto an open set D in R". 
Using (U, x), we can express the vector field V 
as a system of ordinary differential equations 
of the form (6) in D. The eigenvalues 4,, ...,À, 
(possibly repeated) of the Jacobian matrix of f 
at a(p) are independent of the choice of local 
coordinates (U, x) around p. Thus /,,...,2, are 
invariants of the C'-equivalence, but they are 
not invariants of the topological equivalence. 

A singular point p of a flow (M, o) (ora 
vector field V) is called hyperbolic if «(p) is a 
hyperbolic singular point of the corresponding 
equation (6) (i.e., none of the real parts of the 
above eigenvalues 4,, ..., 2, are zero). A sin- 
gular point of (M, ~) is hyperbolic if and only 
if it is a hyperbolic fixed point of the time-one 
mapping o, of o. A hyperbolic singular point 
p is an isolated singular point. 

Let p be a hyperbolic singular point of 
(M, o) and T,(M) — E? @ E" the direct sum 
decomposition with respect to o,. Put s= 
dim E? and u 2 dim E". Then s (resp. u) is the 
number of all eigenvalues 4; of negative (resp. 
positive) real parts. Put W*(p) - ÍxeM | o,(x) 
p as t2 oo) and W'(p)- xeM|o ,(x)op 
as t ooj. W*(p) and W"(p) are invariant sets 
and images of injective immersions of class C" 
of vector spaces E* and E", respectively. At 
the point p, W^(p) and W"(p) are tangent to 
E* and E", respectively. We call W*(p) (resp. 
W"(p)) the stable (resp. unstable) manifold of f 
at p. The stable manifold and the unstable 
manifold of (M, @) at a hyperbolic singular 
point p coincide with those of the time-one 
mapping ¢, at p. A hyperbolic singular point 
p is called a source if dim W*(p)=0 and a sink 
if dim W"(p) 20. Otherwise it is called a saddle 
point. If p is a source (resp. a sink), then W“(p) 
(resp. W*(p)) is a neighborhood of p. A sin- 
gular point p is a source (resp. a sink) if and 
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only if all real parts of the corresponding eigen- 
values 2,, ..., 4, are positive (resp. negative). 

(7) Let V be a vector field of class C” on a 
C-differentiable manifold M of dimension n 
z 2 and (M, p) the C’-flow generated by V. Let 
C be a closed orbit of (M, o), p a point of C, 
and T>0 its smallest positive period. Then 
p is a fixed point of the C’-diffeomorphism 
Q1:M ^ M. Let V(p)e T, (M) be the value of V 
at p and (do ),: T,(M)— T,(M) the differen- 
tial of o, at p. Then (do), (V(p)) — V(p), and 
V(p) #0. Therefore 1 is an eigenvalue of (do). 
The other eigenvalues 4,, ..., 4, , (possibly 
repeated) of (dør), do not depend on a choice 
of pe C and are called the characteristic multi- 
pliers of C. If none of the characteristic multi- 
pliers of C is of absolute value 1, we call C a 
hyperbolic closed orbit. If C is a hyperbolic 
closed orbit, then there exist vector subspaces 
E and E for each peC satisfying the follow- 
ing conditions: (i) T,(M)= L(V(p)) 6 tot, 
where L(V(p)) is the 1-dimensional subspace 
generated by V(p); (1i) (do) (Ep) Elan (o = 
8, u) for all te R, where (do), : T, (M) T, (M). 
In particular, (dor), (E5) — Ei (a =s, u) (iii) 

dim E; (resp. dim E?) is independent of pe C 
and is equal to the number of 4; of absolute 
value «1 (resp. 71). Put W*(C)={xeM| 
doch C) 0 as t 00; and W"(C)  (xeM | 
d(p_,(x), C) 30 as too]. Then W*(C) (resp. 
W"(C)) is an invariant set and is an injectively 
immersed C’-submanifold of M which is tan- 
gent to L(V(p)) ® E; (resp. L(V(p)) ® E") for 
each pe C. W*(C) (resp. W"(C)) is called the 
stable (resp. unstable) manifold for C. 

Let p be a point of a closed orbit C. An 
embedded (n — 1)-dimensional disk D of class 
C” in M containing p is called a cross section 
for a closed orbit C if V is transverse to D (i.e., 
T.((M) = L(V(x)) ® T,(D) for each xe D) and 
DC —(pj. For a given cross section D for C, 
there exists a neighborhood U of p in D such 
that for any xe U there exists a t(x)>0 such 
that (x, «(x))eD and o(x,t)£D for0«t« 
t(x). A mapping f: UA D, defined by f(x) — 
(x, t(x)), xe U, is called a Poincaré mapping 
for D. The C’-conjugacy class of the tgerm of 
a Poincaré mapping f: V D at p is indepen- 
dent of the choice of pe C and the cross section 
D for the closed orbit C. The point p is a fixed 
point of f, and the eigenvalues of df, coincide 
with the characteristic multipliers of C includ- 
ing multiplicity. Therefore C is hyperbolic if 
and only if pe C is a hyperbolic fixed point of 
f. In a sufficiently small neighborhood of C, 
the flow is C'-equivalent to the suspension of f. 
The topological equivalence of the flow in a 
sufficiently small neighborhood of C is deter- 
mined by the topological conjugacy class of f. 
The number of topological equivalence classes 
of hyperbolic closed orbits which appear in a 
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flow on an n-dimensional manifolds is 4(n — 1) 
(M. C. Irwin). 

(8) Let V be a vector field of class C" (1 <r < 
o0) on a compact C?-manifold M. If M has 
a nonempty boundary, we assume that V 
points outward at all boundary points. Let p 
be an isolated singular point of V and denote 
by i(p) the tindex of the vector field V at the 
singular point P. If all singular points are 
isolated, then the sum X, i(p) is independent of 
V and is equal to the *Euler-Poincaré charac- 
teristic (M) of M (Poincaré, H Hopf) (— 153 
Fixed-Point Theorems). A vector field (or a 
flow) is called nonsingular if it has no singular 
points. If y(M) #0, then any vector field (and 
hence any flow) on M has a singular point. If 
y(M)=0, then there exists a nonsingular vector 
field on M. If M is 2-dimensional and without 
boundary, then M admits a nonsingular vector 
field if and only if M is a ftorus or a tKlein 
bottle. 

There are many directions in which gen- 
eralizations of the Poincaré-Hopf theorem can 
be made. For example, an index of an isolated 
closed orbit of a flow has been defined and 
investigated by F. B. Fuller. 

The phase portrait near a singular point or 
a closed orbit which is not hyperbolic is com- 
plicated. Many results in general situations 
have been obtained for planer flows, but there 
are only a few for higher-dimensional flows. 


H. Generic Properties and Structural Stability 


(1) Let M be a compact C?-manifold without 
boundary. The set 2"(M) of all vector fields of 
class C” (1 <r < oo) on M forms a real vector 
space in a natural way. We can give a norm 

IF ||, (called a C'-norm) for Ve Z'(M) using its 
expressions in a given suitable system of local 
coordinates of class C? on M and their par- 
tial derivatives up to order r. By virtue of 

this norm, Z'(M) is a Banach space with the 
topology of uniform C’-convergence (— 168 
Function Spaces). À subset of a topological 
space is called a residual set or a Baire set if it 
is the intersection of a countable number of 
dense open sets. A residual set in Z'(M)1s a 
dense set. À proposition P concerning a vector 
field of class C" is called generic if the set {Ve 
4"(M)| P(V)} contains a residual set for any 
M. The following properties are generic prop- 
erties: (i) All singular points and all closed 
orbits are hyperbolic [21, 22]; (ii) any stable 
manifold (of a hyperbolic singular point or a 
hyperbolic closed orbit) meets transversally 
with any unstable manifold (of a hyperbolic 
singular point or a hyperbolic closed orbit) at 
any point of their intersection [21, 22]; (iii) the 
set of all periodic points (e, the union of all 
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singular points and all closed orbits) is dense 
in the nonwandering set Q for Ve 4! (M) [23]; 
(iv) there is no regular first integral for ve 

4 (M), where a regular first integral of a vec- 
tor field V on M is a C!-function f: MOR 
such that f is not constant on any open set of 
M and is constant along any orbit of (the flow 
generated by) V (R. C. Robinson). 

Let V (resp. V") be an element of 27(M) 
(resp. Z"'(M")), (M, q) (resp. (MT, ¢’)) the flow 
generated by V (resp. V^), and Q (resp. Q’) the 
nonwandering set of (M, o) (resp. (M’, oa 
V and V' are called topologically equivalent 
(resp. Q-equivalent) if (M, 9) and (M’, q^) (resp. 
the restrictions (Q, o |Q x R) and (0’, o'|Q' x 
R)) are topologically equivalent. A vector 
field VE 2"(M) (or the flow generated by V) is 
called C’-structurally stable (resp. C’-Q-stable) 
if there exists a neighborhood of V in 
4" (M) such that any V’ in M is topologically 
equivalent (resp. Q-equivalent) to V. If Ve 
4"(M) is structurally stable (resp. Q-stable), 
then the topological structure of the phase 
portrait of (the flow generated by) V in the 
whole space (resp. the nonwandering set) re- 
mains invariant under a sufficiently small C’- 
perturbation of V. The generic properties (1)- 
(iv) above hold for C'-structurally stable 
vector fields (Markus, Thom, Peixoto, J. Ar- 
raut). The generic properties (1) and (iii) above 
hold for C'-O-stable vector fields. 

(2) Let M be a compact C”-manifold with- 
out boundary. Let F'(M) be the set of all 
C'-mappings of M into itself with the topol- 
ogy of uniform C’-convergence. Let Diff'(M) 
be the subset of F'(M) consisting of all C’- 
diffeomorphisms of M onto itself. Then F'(M) 
is a complete metric space, and Diff'(M) is 
open in F'(M). Thus Diff'(M) is a *Baire space 
and also a topological group with respect to 
this topology. A proposition P concerning 
f € Diff'(M) is called generic if the set { f € 
Diff'(M)| P( f)] contains a residual set. The 
following properties are generic: (1) Every 
periodic point is hyperbolic [21]; (ii) for each 
pair of hyperbolic periodic points p and q, the 
stable manifold W^(p) intersects the unstable 
manifold W"(q) transversally [21]; (iii) for a 
C'-diffeomorphism f € Diff! (M) the set of all 
periodic points is dense in the nonwandering 
set ([23], Robinson). 

Let f:M—M and g: NON be C’- 
diffeomorphisms and Q( f) and Q(g) their 
nonwandering sets. f and g are called Q- 
conjugate if TI f):Q(/)Q(f) and g|O(g): 
O(g)— O(g) are topologically conjugate. A 
diffeomorphism Te Diff'(M) is called C’- 
structurally stable (resp. C’-Q-stable) if there 
exists a neighborhood A of f in Diff'(M) such 
that any g in A is topologically conjugate 
(resp. Q-conjugate) to f. Generic properties (i) 
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(iii) above hold for C'-structurally stable dif- 
feomorphisms, and (i) and (iii) above hold for 
C'-Q-stable diffeomorphisms. 


I. Low-Dimensional Systems 


(1) Let $! = (zeC||z| 2 1] be the unit circle in 
the complex plane C and p: RS! the tcover- 
ing projection defined by p(x) 2 e?"*, xeR. 
Let f:S! S! be an torientation-preserving 
homeomorphism. Then f can be lifted to a 
mapping F: RR satisfying the following 
conditions: (i) po F = f op; (ii) F is monotone 
increasing; (ili) F — lp is a periodic function of 
period 1, where 1g is the identity mapping of 
R. The limit oi f) 2 Dm, F"(x)/n exists for all 
x€ R, and its tresidue class modulo Z is inde- 
pendent of F and x. We call p( f) the rotation 
number of f. Let f, g:$! 5 S! be orientation- 
preserving homeomorphisms. If f and g are 
topologically conjugate by an orientation- 
preserving (resp. treversing) homeomorphism, 
then oi f)— p(g) (resp. p(f) — p(g)) modulo 
Z. An orientation preserving homeomorphism 
f:S! 5S? has a periodic point of the smallest 
positive period s if and only if p( f)=r/s, where 
r and s are relatively prime integers. In this 
case all periodic points are of the smallest 
positive period s. If p( f) is irrational, then 
the w-limit set w(x) of xeS! is independent 

of x, and E — ox) is either tperfect and tno- 
where dense or the whole space ST. If p( f) is 
irrational and E — o(x) — S}, then f is called 
transitive. If f is transitive, then it is topologi- 
cally conjugate to the rotation rn El 5$! 
defined by r,,,,(e77%) =e?" 1 PY, xeR. Let 
f:S' 58! be of class C! with p( f) irrational. If 
its derivative f’ is of *bounded variation, then 
f is transitive (A. Denjoy). In particular, if f 
is of class C? with p( f) irrational, then f is 
topologically conjugate to the rotation ren, 
However, there are C!-diffeomorphisms of S! 
onto itself whose nonwandering sets are not 
the whole space. Those C!-diffeomorphisms 
are never topologically conjugate to C?- 
diffeomorphisms. M. R. Herman gave a suffi- 
cient condition for a diffeomorphism of S! 
onto itself to be differentiably conjugate to a 
rotation. 

A C’-diffeomorphism f:S!—S! is structur- 
ally stable if and only if Q(/) is finite (hence 
Q( f) consists of a finite number of periodic 
points), and all periodic points are hyper- 
bolic. The set of all structurally stable C’- 
diffeomorphisms of S! onto itself is a dense 
open set in Diff’(S’) (Peixoto). 

(2) Let D be an open set in R? and f:D—R? 
a continuous mapping. We assume that for 
each xe D there exists a unique nonextend- 
able solution g(x, t) with the initial condition 
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(x, 0)=x defined on (a,, b,) for the equation 
dx/dt — f(x), x e D. Suppose that for a given 
point xe D there exists a compact set K in D 
containing the positive semiorbit C, (x)— 
tex, t) |O& t « b.) of x. Then b, — oc. Further, 
we assume that the w-limit set w(x) of x con- 
tains no singular points. Then we have either 
(i) C, (x) 2 (x) and it is a closed orbit, or (ii) 
C, (x) € (x) and w(x) is a closed orbit. In case 
(ii), (x) is a *simple closed curve and C, (x) 

is a spiral which tends the closed orbit w(x) 
(Poincaré, I. Bendixson). We call such a closed 
orbit w(x) a limit cycle. Let f, and f; be poly- 
nomials of two variables and m the maximum 
of degrees of f, and fọ. Let f —( fi, /;): R25 R? 
be the mapping defined by f, and fz. The 
equation dx/dt = f(x), xeR?, defined by such 
an f is called a polynomial system of degree 
m. The following is Hilbert's 16th problem: Is 
there a number N (m) depending only on m 
such that the number of limit cycles for any 
polynomial system of degree m is bounded by 
N (mp? 

Let M be a closed De, compact, without 
boundary) C?-manifold of dimension 2 and 
(M, o) a C?-flow on M. Then a minimal set of 
(M, ọ) is either (i) a singular point, (ti) a closed 
orbit, or (iii) the whole space M. For case (iii), 
M is a torus (Poincaré, Denjoy, C. L. Siegel, A. 
J. Schwartz). Let T be a 2-dimensional torus 
and (T, o) a C’-flow. Suppose that (T, 9) has a 
cross section X which is C!-diffeomorphic to 
S'. Let f: X5 X be the Poincaré mapping for 
2. Then (T, o) has a closed orbit if and only if 
the rotation number pí( f) of f is rational. If 
p(f) is irrational and (T, @) is of class C?, then 
T is a minimal set. 

Let M be an orientable 2-dimensional 
closed C?-manifold and VeZ^! (M). Then V is 
structurally stable if and only if the following 
conditions are satisfied: (i) There are only a 
finite number of singular points, all hyperbolic; 
(ii) There are only a finite number of closed 
orbits, all hyperbolic; (iii) There are no orbits 
which connect saddle points; (iv) The «(x) and 
w(x) for any xe M are singular points or 
closed orbits (Peixoto). The above theorem 
was first proved by Andronov and Pontryagin 
for analytic vector fields on a 2-dimensional 
disk which are transverse to the boundary of 
the disk at any boundary point. The set of all 
structurally stable vector fields of class C! on 
M is a dense open set in 4! (M) (Peixoto). 


J. Axiom A Systems 


In this section we assume that phase spaces 
are closed C*-manifolds with metric d and 
| <r <œ unless stated otherwise. 
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(1) A vector field Ve Z"(M) (resp. a C'-flow 
(M, o)) is called a Morse-Smale vector field 
(resp. a Morse-Smale flow) if the following 
conditions are satisfied: (i) The nonwandering 
set is the union of a finite number of singular 
points and a finite number of closed orbits; (11) 
The singular points and closed orbits are all 
hyperbolic; (iit) The stable manifolds and the 
unstable manifolds of the singular points and 
closed orbits intersect each other transversely. 
If dim M —2, the Morse-Smale vector fields on 
M are exactly the structurally stable ones 
discussed in Section I(2). A Morse-Smale 
vector field is structurally stable [25]. The set 
of all Morse-Smale vector fields on M is open 
but not dense in Z'(M) if dim M > 2 (Palis, 
Smale). However, it contains a dense open 
subset of the set of all tgradient vector fields 
with respect to a given ‘Riemannian metric 
(Smale). In particular, any closed manifold 
admits Morse-Smale vector fields and hence 
structurally stable vector fields. For a Morse- 
Smale vector field, the *Morse inequalities 
hold as they do in the tcalculus of variations 
in the large [24]. 

(2) A vector field Ve Z'(M) (or the C’-flow 
(M, ~) generated by V) is called an Anosov 
vector field (or an Anosov flow) if the following 
conditions are satisfied: (1) There is a direct 
sum decomposition T,(M)-2 L(V(x)) Q ES @ E! 
of the tangent space T,(M) for each xe M 
which depends continuously on xe M; (ii) 
(dej) (E5) = Et and (de), (Et) = Eat for all 
xe M and te R; (ii) There are a Riemannian 
metric on M and constants c, 47 0 such that, 
for all t 0 and xe M, ||(dg,),(w)|| &ce ^ |w|| 
when we EX, and ||(dg_,),(w)|| & ce ^' |w] 
when we E%, where ||: || is the norm induced 
by the Riemannian metric. The suspensions of 
Anosov diffeomorphisms and the *geodesic 
flows on Riemannian manifolds of negative 
curvature are important examples of Anosov 
flows (J. Hadamard [8]). There are exam- 
ples of Anosov flows other than the ones 
stated above (M. Handel and W. P. Thurston, 
J. Franks and R. F. Williams). The follow- 
ing have been proved by Anosov [8]: (i) An 
Anosov flow is structurally stable; (ii) there 
are countably many closed orbits for an 
Anosov flow; (iit) 1f there exists a smooth in- 
variant measure (i.e., an invariant measure 
which has a tsmooth density with respect to 
the measure associated with the Riemannian 
metric), then the set of all closed orbits is dense 
in M. If we assume further that the flow is of 
class C^, then it is tergodic; (iv) the set of all 
Anosov vector fields is open in Z'(My, (v) (Ei), 
(EX) (xe M) define tfoliations on M, which are 
called Anosov foliations. 

(3) A vector field Ve Z"(M) (or the C’-flow 
(M, p) generated by V) is called an Axiom A 
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vector field (or an Axiom A flow) if the follow- 
ing conditions are satisfied: A(a) The non- 
wandering set consists of a finite set F of sin- 
gular points, all hyperbolic, and the closure A 
of the union of closed orbits, and FN A = Ø; 
A(b) The conditions (1)- (ui) in the definition of 
Anosov flow in which we replace the terms 
"xe M" by “ xeA.” For each xe M, put W*(x) 
={yeM|d(9,(x), o,(y)) 50 as t o0j and 
W"(x)={yeM |d(p_,(x), 9.,(y))0 as t2 00}. 
We call W*(x) (resp. W"(x)) the stable (resp. 
unstable) manifold of V at x. For a subset A 
of M, put W*(A)  LJ..,W*(x) (o =s, u). 

For xe M, put W"*(x) 2 W*(C(x)) (o —5,u), 
where C(x) is the orbit of x. If the flow satis- 
fies Axiom A, then W?(x), W"(x), W"5(x), and 
W ""(x) are injectively immersed submanifolds 
for all xe M (M. W. Hirsch and C. C. Pugh). 
If xe M is a hyperbolic singular point, then 
W*(x) and W"(x) defined above coincide with 
those defined before. If C(x) is a hyperbolic 
closed orbit, then W?(C(x)) and W"(C(x)) 
defined above coincide with those defined 
before. 

For an Axiom A flow, there is a decomposi- 
tion of the nonwandering set QN=Q, U...UQ,, 
(disjoint union), where each Q; is closed, invar- 
iant, and transitive (e. has a dense orbit), 
and M — |J, W*(Q) — UI, W"(Q (disjoint 
union) [7]. This decomposition is called the 
spectral decomposition of Q, and each Q; is 
called a basic set. Let (0 - OU ... UG, be the 
spectral decomposition of the nonwandering 
set for an Axiom A flow. Denote O; « O; if 
W*(Q)f1W"(Q;)z (7j. A sequence of basic sets 
Q, Qi- (K> 1) is called a cycle if O; < 
Q; €... 0,0, =Q; and otherwise Q, 7 
Q, for p#q. An Axiom A flow which has no 
cycles in the above sense is said to satisfy the 
no cycle condition. The Q-stability theorem: 
An Axiom A flow with the no cycle condition 
is Q-stable [27]. Q-explosion: If an Axiom A 
flow has a cycle, then it is not Q-stable (Palis). 
An Axiom A flow is said to satisfy the strong 
transversality condition if W(x) and W""(x) 
intersect transversely at any xe M. Àn Axiom 
A flow with the strong transversality condi- 
tion satisfies the no cycle condition [7]. The 
structural stability theorem: An Axiom A flow 
of class C! with the strong transversality con- 
dition is structurally stable [29, 31]. Morse- 
Smale flows and Anosov flows are Axiom A 
flows with the strong transversality condition. 
There are Axiom A flows other than Morse- 
Smale flows and Anosov flows that satisfy the 
strong transversality condition [7]. Stability 
conjecture: A C’-flow is structurally stable 
(resp. Q-stable) if and only if it ts an Axiom A 
flow with the strong transversality condition 
(resp. the no cycle condition). S. Newhouse, 
V. A. Pliss, Robinson, R. Maie, S. D. Liao, 
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and A. Sannami have made important contri- 
butions to the study of the stability conjecture. 

Neither the set of all Axiom A flows nor the 
set of all O-stable flows nor the set of structur- 
ally stable flows is dense in Z"(M) if dim M >2 
(R. Abraham and Smale, Newhouse). How- 
ever, the set of all structurally stable flows is 
dense in 2"(M) in the C? topology (M. Shub, 
M. M. C. de Oliverira). 

(4) A diffeomorphism f € Diff'(M) is called a 
Morse-Smale diffeomorphism if the following 
conditions are satisfied: (i) The nonwandering 
set Q is a finite set, and hence it consists of 
periodic points; (ii) all periodic points are 
hyperbolic; (111) for each pair p, qe Q, W*(p) 
intersects W"(q) transversally. The Morse- 
Smale diffeomorphisms on the unit circle S! 
are exactly the structurally stable ones de- 
scribed in Section I(1). A Morse-Smale dif- 
feomorphism is structurally stable [25]. The 
set of all Morse-Smale diffeomorphisms on M 
is open but not dense in Diff'(M) if dim M z2 
(Palis, Smale). However, it contains a dense 
open subset of the set of all time-one mappings 
of the flows generated by gradient vector fields. 
In particular, any closed manifold admits 
Morse-Smale diffeomorphisms and hence 
structurally stable diffeomorphisms. For a 
Morse-Smale diffeomorphism, the Morse 
inequalities hold [24]. 

Let A be a closed invariant set of fe 
Diff'(M). ^ is called hyperbolic if the follow- 
ing conditions are satisfied: (1) There is a split- 
ting T,(M)— E; & EX of the tangent space 
T,(M) for each xe A, which depends continu- 
ously on xe; (ii) df, (Ex) = ES and df (Ex) = 
E w for all xe A; (iti) There is a Riemannian 
metric on M and constants c» 0,0 «4 « 1 such 
that, for any integer m 0 and xeA, ldfz"(w)| 
x cA" ||wll when we Ef and Vd "(w)| < 
cA" ||wl| when we EX. A diffeomorphism fe 
Diff'(M) is called an Anosov diffeomorphism 
if M itself is hyperbolic for f. Manifolds which 
admit Anosov diffeomorphisms are restricted 
(Hirsch, K. Shiraiwa). Examples of Anosov 
diffeomorphisms are given as follows: Let 
L:R"5R" be a hyperbolic linear automor- 
phism with L(Z") 2 Z", where Z" is the dis- 
crete subgroup of R” consisting of all elements 
with integral coordinates. Then L induces an 
automorphism f: T" T" of the n-dimensional 
torus T" - R"/Z", which is an Anosov diffeo- 
morphism. There are similar constructions 
using hyperbolic automorphisms of simply 
connected ‘nilpotent Lie groups and their uni- 
form discrete subgroups (Smale). A homeo- 
morphism h: X +X of a metric space X is 
called expansive if there is a constant ¢>0 
such that x, ye X and d(h"(x), h"(y)) « e for all 
neZ imply x — y. An Anosov diffeomorphism 
is expansive. For Anosov diffeomorphisms, 
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theorems similar to those stated in Section 
J(2) hold. Franks, Newhouse, A. Manning, 
and J. N. Mather obtained important results 
concerning Anosov diffeomorphisms. 

A diffeomorphism fe Diff'(M) is called an 
Axiom A diffeomorphism if the following con- 
ditions are satisfied: A(a) The nonwandering 
set Q is hyperbolic; A(b) the set of all periodic 
points is dense in Q. There are examples that 
satisfy Axiom A(a) but not Axiom A(b) (A. 
Dankner, M. Kurata). For xe M, put W'*(x) 

- (ye M |d(f"G), f(y) 20 as noc] and 
W*(x)7 (ye M |d( f "G9. f "()) 20 as n> 00}. 
We call W*(x) (resp. W"(x)) the stable (resp. 
unstable) manifold of f at x. For a subset A of 
M, put W*(A4)— UJ. 4W"(x) (c =s, u). If f is an 
Axiom A diffeomorphism, W*(x) and W"(x) 
are injectively immersed submanifolds of M. If 
x is a hyperbolic periodic point, W*(x) and 
W(x) defined here coincide with those defined 
before. For Axiom A diffeomorphisms, notions 
such as spectral decomposition, basic sets, the 
strong transversality condition, and the no 
cycle condition are defined similarly, and 
theorems similar to those stated in Section J(3) 
hold. 

Let h: X +X be a homeomorphism of a met- 
ric space X and a, fj 0. A sequence {x;};.7 of 
points in X is an z-pseudo-orbit if d(h(x;), x,,,) 
«a for all ie Z. We say that {x;},.7 is f- 
shadowed (or fi-traced) by a point xe X if 
d(h'(x), x) « f for all ie Z. We say that h has 
the pseudo-orbit tracing property if for any f 
>0 there exists an a 0 so that any a-pseudo- 
orbit is fj-shadowed by some point. If f is an 
Axiom A diffeomorphism, then f|O( f):O( f) 
—Q( f) has the pseudo-orbit tracing property 
(Bowen). Axiom A diffeomorphisms with the 
strong transversality condition (in particular, 
Anosov diffeomorphisms) have the pseudo- 
orbit tracing property (Bowen, K. Sawada, K. 
Kato, A. Morimoto). 

(5) Let S be a discrete topological space with 
n elements (n 2 2) and Z(S) 2 IT;.z S; the dou- 
bly infinite product of copies S; of S with the 
product topology. A point x =(x,);-7 of 2(S) is 
a doubly infinite sequence of points in S. Z(S) 
is a *totally disconnected, perfect, compact, 
*metrizable space De, a *Cantor set). Let 
oe (S) £Z(S) be a mapping defined by o(x)= 
y, x= (x), Kell Y:=X (i€ Z). Then o is a 
homeomorphism of Zi and is called the shift 
automorphism with n symbols. 

Define a diffeomorphism f:$?— S? of the 2- 
dimensional sphere S? as follows. We consider 
S? as R?U {00}. Let U be a region of R? con- 
sisting of the rectangle R (= ABCD) and an 
upper and a lower cap as shown in Fig. 1. f 
maps U into itself as in Fig. 1 (f(A)= A', f(B) 
= B', f(C) - C', f(D) = D', and so on). Here, 

Rf f(R)- PUQ is the union of two rectangles, 
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Daf !(P)and Q'— f ^ ' (Q) are rectangles, and 
f is taffine on each of P' and Q’ (stretching in 
the vertical direction and contracting in the 
horizontal direction). The lower cap is con- 
tracted into itself, and every point in it tends to 
a sink xo. The upper cap is mapped into the 
lower one and stays there. The point oo is the 
only source, and lim,,,, f "(x)— oo for all 
xeR? — U. Thus the nonwandering set of f 
consists of a sink xy, a source oo, and A= 
(ez /" (R). The mapping f constructed here 
is called the horseshoe diffeomorphism. It is 

an Áxiom À diffeomorphism with the strong 
transversality condition (and hence structur- 
ally stable). The restriction f|A of f ona 

basic set A is topologically conjugate to the 
shift automorphism with two symbols. It is 
neither a Morse-Smale diffeomorphism nor an 
Anosov diffeomorphism [7]. 





Fig. 1 


A restriction of a shift automorphism on a 
closed invariant set is called a subshift. Let A 
=(A;,) be an n x n matrix with (i, j)-component 
Aj; —0 or 1 for all i, j— 1, ..., n. It is irreducible 
if for each i, j there is a positive integer m such 
that A” has a nonzero (i, j)-component. As- 
sume that A is irreducible and S — (1, ..., n]. 
Put Z,— ix —(xj)e Z(S)| A, —1 for all 
ie Z). Then X; is a closed invariant set of X(S). 
The restriction c,—6|X,: Z4, X, is called a 
subshift of finite type or a Markov subshift, and 
A is called the transition matrix. A topological 
classification of the subshifts of finite type has 
been investigated by Williams. 

Let fe Diff'(M) be an Axiom A diffeomor- 
phism and A its basic set. Bowen constructed a 
"Markov partition for f|A by generalizing the 
Sinaí construction for Anosov diffeomor- 
phisms. The Markov partition connects f|A 
with a suitable subshift of finite type and is 
applied to the study of Axiom A diffeomor- 
phisms, especially to the ergodic theory of 
Axiom A diffeomorphisms (Bowen, Ruelle). A 
similar theory for flows was developed by 
Bowen, Ruelle, and M. E. Ratner. 

(6) Let xe M be a hyperbolic fixed point of 
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f €Diff'(M). A point pe W*(x)N W"(x) (p z x) is 


called a homoclinic point. If W*(x) and W“(x) 
intersect transversally at a homoclinic point p, 
then p is called a transversal homoclinic point. 
In a neighborhood of a transversal homoclinic 
point, there is a closed invariant set A of f” 
for some positive integer m such that f"|A is 
topologically conjugate to the shift automor- 
phism with two symbols (Smale). There are 
generalizations of this theorem for semiflows 
by F. R. Marotto, Shiraiwa, and Kurata. 


K. Topological Entropy and Zeta Functions 


(1) The notion of topological entropy was first 
defined by R. L. Adler, A. G. Konheim, and 
M. H. McAndrew as an analog to measure- 
theoretic *entropy. Let X be a compact topo- 
logical space and « an topen covering of X. 
Let N(x) be the minimum number of members 
of a tsubcovering of a Let f: X > X be a con- 
tinuous mapping and « an open covering of X. 
Then lim,,,(1/n)log N(xv f !av...vf "'!a) 
exists, where av f !av...vf "tta is the open 
covering (A590 f HADNA... Af" (A, (Jl Ag, 
A,,...,A,-,€%}. We denote the above limit by 
h( f, x) and call it the topological entropy of f 
with respect to «. The topological entropy bt f) 
of f is defined as the sup h( f, a), where the 
supremum is taken over all open coverings « 
of X. Now assume that X is a compact metric 
space. For an open covering a of X, put diam « 
=sup{diam A| A&a]j, where diam A is the 
diameter of A. If (2,],., is a sequence of open 
coverings of X such that diam a, 0 as n oo, 
then (f, a,) h( f) as n oo. The topological 
entropy of the shift automorphism with n 
symbols is equal to logn. Let L:R"5R" bea 
linear mapping with L(Z") c L(Z") and 4,, ..., 
A, its eigenvalues. Let f: T"— T" be the in- 
duced endomorphism of the n-dimensional 
torus T" R"/Z". Then the topological entropy 
is given by h( f)=%),,>, log|A(l- 

Let f: X 2 X be a continuous mapping 
and M( f) the set of all *f-invariant probabil- 
ity measures on the fBorel sets of X. For 
each ue M(f), denote by h,( f) the (measure- 
theoretic) entropy of f with respect to u. Then 
h(f)=sup{h,(f)| ue M(f)} (E. I. Dinaburg, 
T. N. T. Goodman, L. W. Goodwyn). The 
following properties hold: (1) If £f: X +X isa 
homeomorphism, then bt f") 2 |n|h( f) for all 
neZ. (ii) If (X, o) is a continuous flow, then 
h(o,) — |t|h(q) for all teR. (iii) Let f;: X, X; 
(i— 1, 2) be continuous. Then bt, x f,)= 
h(f,) 4- h( fa). (iv) Let f: X; >X; (i= 1, 2) be con- 
tinuous. If there is a continuous mapping 
g: X, X; such that g(X,) 2 X, and go f, — 
f50g, then h( f) 2 h( f;). In particular, if f; and 
f; are topologically conjugate, then h( fi) — 
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h( f;). (v) If f£: X +X is a homeomorphism and 
Q is the nonwandering set of f, then h( f) — 
h(f|Q). In particular, Q-conjugate homeo- 
morphisms have the same topological entropy 
(Bowen). (vi) If f: MM is an Axiom A diffeo 
morphism of a closed C*-manifold M, then 

h( f)=limsup,,..,,,(1/n)log N,( f), where N,( f) is 
the number of all periodic points of period n. 
And h(f)—0 if and only if the nonwandering 
set 1s finite (Bowen). 

Bowen gave alternative definitions of the 
topological entropy and generalized it for 
uniformly continuous mappings of (not neces- 
sarily compact) metric spaces. If f: M >M 
is a C'-mapping of an n-dimensional Rie- 
mannian manifold M, then h( f) x max10, 
nlogsup(|df,||| xe Mj j, where |df,|| is the 
norm of df,: T,(M)9 fol ML and hence A( f) 
is finite (Bowen, S. Ito). 

(2) Let f: MM bea continuous mapping 
of a closed C?-manifold M and f,: H,(M)— 
H,(M) (resp. Ly Hí(M) H,(M)) the *induced 
homomorphism if f on the fhomology group 
H,(M) (resp. the tfirst homology group 
H,(M)) with coefficients in R. The spectral 
radius s(L) of a linear mapping L: E E ofa 
real vector space E is the maximum of the 
absolute values of the eigenvalues of L. The 
following entropy conjecture is still open: 

If f: MM is a C!-diffeomorphism (C!- 
mapping), then h( f) 2 logs( LL Concerning the 
entropy conjecture, the following are known: 
(i) The conjecture holds for a dense open set of 
Diff'(M) in the C?-topology (Shub); (ii) for 

any continuous mapping f, h( f) 2logs( fy) 
(Manning); (tii) the conjecture fails for a 
homeomorphism (Pugh); (iv) the conjecture 
holds for an Axiom A diffeomorphism with 
the no cycle condition (Shub and Williams); 
(v) the conjecture holds for any continuous 
mapping of the n-dimensional torus (M. 
Misiurewicz, F. Przytycki); (vi) for a C!- 
mapping f:M—>M, h(f)>log|deg f|, where 
deg f is the *mapping degree of f (Misiurewicz, 
Przytycki). 

(3) M. Artin and B. Mazur first defined the 
zeta function of a diffeomorphism by analogy 
to *Well's zeta function. Let f: X X bea 
homeomorphism of a compact metric space 
X. Assume that the number N,, — N,( f) of all 
periodic points of period m is finite for all m. 
Put £(t) Zexp(25-, N,t"/m) and call it the 
zeta function of f. For any closed C^-manifold 
M, there is a dense set E of Diff'(M) such that 
N, (f) &ck" for f cE, where N,(f) is the 
number of isolated periodic points of period m 
and c and k are positive constants depending 
only on f (Artin, Mazur). Hence, for such f € E, 
the series exp(X.z.., NA f )t"/m) has a positive 
radius of convergence. Originally, Artin and 
Mazur called this the zeta function of f. 
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Let 04:24, X, be a subshift of finite type 
with transition matrix A; then £(t)— 1/det(E — 
tA), where E is the unit matrix (Bowen and O. 
E. Lanford III). The zeta function of an Axiom 
A diffeomorphism has a positive radius of 
convergence and is a rational function (K. 
Meyer, J. Guckenheimer, Manning). 

Let (M, 9) be a nonsingular C’-flow on a 
closed C*-manifold M. Let T be the set of 
all closed orbits and /(y) the smallest posi- 
tive period of zel. Smale defined the zeta 
function of (M, 9) by Z(s)= Iler Mol — 
Lexp/(y)] "FL If (M, o) is the geodesic flow on 
a surface of constant negative curvature, Z(s) 
reduces to the tSelberg zeta function. 

There are generalizations and modifications 
of the notion of zeta function by Ruelle and 
Franks. 


L. Classical Dynamical Systems 


(1) Let M be a C*-manifold without bound- 
ary. A C'-flow (M, o) or a C’-diffeomorphism 
f: MM with a smooth invariant measure is 
called a classical dynamical system. Most 
important classical dynamical systems are 
Hamiltonian or Lagrangian systems. In the 
modern formulation, a Hamiltonian system 
consists of M, a symplectic form w on M De a 
*nondegenerate ‘closed 2-form), and the vector 
field Xy on M defined by a C’*!-function 
H:M >R. We call X, the Hamiltonian vector 
field with energy function H. Let (M, œ, Xy) be 
a Hamiltonian system. Then the following 
hold: (i) M is of even dimension and there is a 
system of local coordinates (4! , ..., q", p,, ..., 
p,) such that w= 17-, daf ^dp; (J. G. Darboux). 
In these coordinates X y is expressed by 
*Hamilton's equations dal id = OH /Op;, 
dp;/dt = — 0H /0q', i— 1, ..., n; (ii) the smooth 
measure defined by the *volume element O — 
(( —1)?21/nt)c" is an invariant measure for 
the flow generated by Xj, (J. Liouville); (iii) 
the energy function H is constant along any 
trajectory of Xy. Especially, H ! (e) (ee R) is 
an invariant set for each e and is called an 
energy surface. Energy surfaces are submani- 
folds of codimension one for almost all ech. 
Important examples of Hamiltonian sys- 
tems are given as follows. Let Q be an n- 
dimensional manifold and T*(Q) the tcotan- 
gent bundle of Q. Then T*(Q) has a canonical 
symplectic form cy. We call Q a configuration 
space and T*(Q) a momentum phase space. For 
any C’*!-function H on T*(Q), we have a 
Hamiltonian system (T *(Q), coy, Xy) in which 
Xy 1s of class C". The Hamiltonian formalism 
is translated into Lagrangian formalism by 
using the tangent bundle T(Q) instead of 
T*(Q). T(Q) is called a velocity phase space. 


501 


For a given differentiable function L on T(Q), 
the energy function E on T(Q) and the Lagran- 
gian vector field X; on T(Q) are constructed. If 
L satisfies a certain condition, there is a system 
of local coordinates (q!, ...,q", d! , ..., d") such 
that X, is expressed in these coordinates by 
the *Euler-Lagrange equations dq'/dt — a) 
d/dt(0L/éq') = 0L/0q!, i— 1, ..., n. The projec- 
tion of an integral curve (Le, an orbit) of Xg 
into Q is called a base integral curve. Under a 
suitable condition, the base integral curve of a 
Lagrangian (or a Hamiltonian) system is the 
geodesic of the Jacobi metric on Q up to a 
reparametrization, and the restriction of the 
flow generated by X , on an energy surface is 
the geodesic flow described below. 

(2) Let M be a fcomplete Riemannian mani- 
fold of class C^ and S(M)= (ve T(M)| Iris 1]. 
Let 1: S(M)— M be the projection defined by 
n(v) 2 x for ve T,(M). Then S(M) is a tsphere 
bundle over M, which is called the unit tangent 
sphere bundle over M. Each ve $(M) deter- 
mines a unique !geodesic C,: R— M such that 
C, (0) 2 n(v) and the *tangent vector C;(0) to C, 
at t=0 is equal to v. Let v, be the tangent 
vector C;(t) to C, at t for teR. Then ||v,|| =1 
and z(v,) ^ C,(t) for all te R. Define a map- 
ping 9: S(M) x ROS S(M) by q(v, t) 2 v,. Then 
(S(M), o) is a C?-flow, which is called the 
geodesic flow on M. By the classical Liouville 
theorem, a geodesic flow has a smooth invar- 
lant measure. 

(3) Let T" = R"/Z" be the n-dimensional 


torus and w,,...,@, real constants. Define a 
mapping o: T" x R5 T" by o([x;, ..., x,], t) — 
[xi 0,1, ..., x, - t], where [x,,...,x,]€ 


R"/Z" = T" is the residue class of (x,,..., x,)e 
R” modulo Z”. Then (T", o) is a C?-flow. We 
call it the translation flow with frequencies 
Ois On I 0, ..., 0, are linearly indepen- 
dent over Z, they are called independent. Every 
orbit of the translation flow is dense if and 
only if its frequencies are independent (Poin- 
caré, H. Weyl). A translational flow with inde- 
pendent frequencies is called quasiperiodic. 

Let (M, o Xy) be a Hamiltonian system on 
a 2n-dimensional manifold M. Under a certain 
condition, there exist an open set U of M and 
a diffeomorphism f: U — T" x R” such that the 
following holds: Identify U with T" x R" by 
f, and let (q!, ..., q", p,, ..., p,) be the coordi- 
nates of T" x R". Then the energy function 
H is independent of q —(q!, ...,q") so that 
Hamilton's equations becomes dq'/dt = 0H /Op,, 
dp,/dt = — 9H/0q' 0 for i— 1, ..., n. Therefore 
the solutions are given by q'(t) 2 (OH/Op;(c))t + 
q'(0) modulo Z, p,(t)=p,(0)=c;, i— 1, ...,n, 
where c —(c,, ...,c,). Therefore N,— f !(T" x 
{c\) is an invariant torus (i.e., an invariant 
set diffeomorphic to T”) of (the flow generated 
by) Xj for all ce R”, and the restriction flow on 
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N. 1s the translational flow with frequencies 

w, =0H/ép,(c), ...,@,=0H/dp,(c) (Arnold). As- 
sume further that the Hessian det(0? H/ép,0p,) 
does not vanish at c and o, ..., c, are inde- 
pendent. Let H be an energy function obtained 
by adding a sufficiently small perturbation to 
H. Then for almost all c' near c, there exists an 
invariant torus Ñ of Xg near N, such that the 
restriction flow on N is differentiably equiva- 
lent to the translational flow with the same 
frequencies (Kolmogrov, Arnold, Moser). 

(4) Generic properties for Hamiltonian 
systems were investigated by M. Buchner, 
Markus, Meyer, Pugh, Robinson, Takens, and 
Newhouse. 


M. Bifurcation 


(1) Consider a differential equation with a 
parameter. For example, let X be a domain 
of R”, J=(-1, 1), and f:J x XOR"aC'- 
mapping. For each pe J, define f,: X +R" by 
SAX) = f(u x), xe X. Consider the differential 
equation 


dx/dt = f,(x), 


As u varies, the topological structure of the 
phase portrait of (7) may change. Suppose that 
there exists j4(,€ J such that the topological 
structure of the phase portrait of (7) changes at 
Uz ug but remains the same when 49 —£« u< 
Ho OF Ho € H< uo +E for some 2» 0. Then Ho 

is called a bifurcation point of (7). 

Hopf bifurcation: Assume that X = R? and 
the origin Oc R? is a singular point of (7) for 
all ueJ. Assume further that the Jacobian 
matrix of f, at Oc R? has two distinct complex 
conjugate eigenvalues Au) and A(z) such 
that the real part ReA(gy) of A1 is positive 
when u 0, zero when x =0, and negative 
when u <0. Then Oc R? is a sink for u «0 and 
a source for u» 0. Now assume further that 
d/du(Re A(u))| „=o is positive and Oc R? is a 
“vague attractor.” Then 0€&J is a bifurcation 
point, and there exists an asymptotically stable 
closed orbit for (7) near and around 0c R? 
which depends continuously on u for ui 0 
[37]. Thus a sink of (7) (u « O0) changes to a 
source and an asymptotically stable closed 
orbit (i 2 0) when u changes its sign. The Hopf 
bifurcation theorem can be generalized to a 
higher-dimensional case, and there is a dif- 
feomorphism version of the theorem. 

(2) Bifurcations in more general settings 
have been investigated by many mathema- 
ticians, including Thom, Arnold, R. J. Sacker, 
G. R. Sell, D. H. Sattinger, G. Iooss, Ruelle, 
and Takens. Generic bifurcations of dynamical 
systems have been investigated by J. Soto- 


xeX and peJ. (7) 
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mayer, Meyer, P. Brunovsky, and others; 
bifurcations of Morse-Smale systems by New- 
house, Palis, Peixoto, and S. Matsumoto; 

and bifurcations of Axiom A diffeomorphisms 
by Newhouse and Palis. 


N. Miscellaneous Topics 


(1) Let S* be the 3-dimensional sphere and 
Ved'(S?). H. Seifert proved that if V is suffi- 
ciently close (in C? topology) to a nonsingular 
vector field tangent to the fibers of the tHopf 
fibration S?  S?, then V has a closed orbit. He 
conjectured that every nonsingular vector field 
Ve" (S?) had a closed orbit. (Seifert conjec- 
ture; — 154 Foliations D). If a nonsingular 
vector field V e 4^! (5?) is transverse to a codi- 
mension 1 foliation of class C?, then it has a 
closed orbit (S. P. Novikov). Let M be a 3- 
dimensional C?-manifold. Then there exists a 
nonsingular vector field Ve 47! (M) with no 
closed orbit in any thomotopy class of a non- 
singular vector field on M (P. A. Schweitzer). 
Thus the Seifert conjecture fails for a vector 
field of class C!, but the conjecture for a vector 
field of class C” (r 22) is an open problem. 
Related work has been done by Fuller, H. 
Chu, and A Weinstein. 

(2) Let M be a closed connected C?- 
manifold. A C’-flow (M, q) (resp. f € Diff'(M)) 
is a minimal flow (resp. a minimal diffeomor- 
phism) if M itself is a minimal set. If M admits 
a *locally free S'-action of class C^, then it 
admits a minimal C?-diffeomorphism, and if 
M admits a locally free special (in particular, a 
tfree) T?-action of class C”, then it admits a 
minimal C?-flow (A. Fathi, Herman, A. B. 
Katok). Open problems: What are the topo- 
logical properties of the manifolds admitting 
minimal flows? Does S? admit a minimal flow? 

(3) E. N. Lorenz studied numerical solutions 
of the following nonlinear equations in R? 
which arose from the convection equation: 
dx/dt = —ox-F oy, dy/dt = —xz - rx — y, dz/dt = 
xy/bz. When o 210, r 228, and b — 8/3, he 
found irregular behavior in this dynamical 
system. R. M. May studied numerical solu- 
tions of the following difference equation in 
connection with the growth of biological popu- 
lations with nonoverlapping generations: 
X,44—ax,(1 — x,), x E LO, 1] (1 «a«4) He 
found that the dynamical structure of the 
above difference equation was delicate and 
complicated. Y. Ueda and H. Kawakami also 
found similar phenomena in their numerical 
study of Duffing's equation of the type d? x/dt? 
+kdx/dt + x? = Bcost. The phenomena ob- 
served in these investigations were called 
chaos, which exhibits strange attractors. 

These investigations have attracted the 
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attention of many mathematicians and scien- 
tists. For 1-dimensional semidynamical sys- 
tems such as May's equation, T. Y. Li, J. A. 
Yorke, A N. Sharkovskii, J. W. Milnor, Thur- 
ston, and many others have obtained notable 
results, while for Lorenz's equation we have 
results by Ruelle, Guckenheimer, Williams, 
Sinai, and many others. Chaos arising from 
discretization of differential equations has been 
studied by M. Yamaguti, S. Ushiki, and others 
(— 433 Turbulence and Chaos). 
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A. General Remarks 


There are two types of multistage tdecision 
processes. In one of them, an outcome of the 
whole process is determined at the final stage 
without any consideration of the outcome for 
each intermediate stage. The textensive form 
of a game is of this type. In the other type, 
an outcome is assigned at each stage of a 
multistage decision process. The theory of 
dynamic programming, dealing with this latter 
type, has been developed by R. Bellman and 
others since 1950 and is now one of the fun- 
damental branches of mathematical program- 
ming, along with the theories of linear and 
nonlinear programming. The following exam- 
ples illustrate some features of multistage deci- 
sion processes. 

Multistage allocation process. We are given 
a quantity x >Q that can be divided into two 
parts y and (x — y). From y we obtain a return 
g(y), and from (x — y) a return h(x — y). In so 
doing, we expend a certain amount of our 
original resources and are left with a new 
quantity, ay + b(x — y), 0 «a, b «1, with which 
the process is continued. How do we proceed 
so as to maximize the total return obtained in 
a finite or unbounded number of stages? 

Multistage choice process. Suppose that we 
possess two gold mines A and B, the first of 
which contains an amount x of gold, while the 
second contains an amount y. In addition, we 
have a single gold mining machine with the 
property that if used to mine gold in the mine 
A, there is a probability P, that it will mine 
a fraction r, of the gold there and remain in 
working order, and a probability 1 — P, that it 
will mine no gold and be damaged beyond 
repair. Similarly, the mine B has associated 
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with it the corresponding probabilities P, and 
1 — P, and fraction r,. How do we proceed in 
order to maximize the total amount of gold 
before the machine is defunct? 

These two processes have the following 
features in common: (1) In each case we have a 
physical system characterized in any state by a 
small set of parameters, the state variables. (2) 
In each state of either process we have a choice 
of a number of decisions. (3) The effect of a 
decision is a transformation of the state vari- 
ables. (4) The past history of the system is of 
no importance in determining future actions 
(*Markov property). (5) The purpose of the 
process is to maximize some function of the 
state variables. 

A policy is a rule for making decisions that 
yields an allowable sequence of decisions; an 
optimal policy is a policy that maximizes a 
preassigned function of the final state vari- 
ables. A convenient term for this preassigned 
function of the final state variables is criterion 
function. One of the characteristic features of 
Bellman's methodology of dynamic program- 
ming is the appeal to the principle of optimal- 
ity: An optimal policy has the property that 
whatever the initial state and initial decision 
are, the remaining decisions must constitute 
an optimal policy with regard to the state 
resulting from the first decision. 

In the multistage allocation process the state 
variables are x (the quantity of resources) and 
z (the return obtained up to the current stage). 
The decision at any stage consists of an alloca- 
tion of a quantity 0< y x x. This decision has 
the effect of transforming x into ay+ b(x — y) 
and z into z+ g(y) -- h(x — y). The purpose of 
the process is to maximize the final value of z. 
Denote by f(x) the N-stage return obtained 
starting from an initial state x and using an 
optimal policy. Then we have 


fi(x)= max [g() * h[x — y)], 
f(x) max [gCy) + h(x — y) 


+ f,-1(ay+b(x—y))],  nz2 


This recurrence relation yields a method for 
obtaining the sequence { f,(x)} inductively. 

In the stochastic gold-mining process, the 
state variables are x and y (the present level 
of the two mines) and z (the amount of gold 
mined to date). The decision at any stage con- 
sists of a choice of A and B. If A is chosen, 

(x, y) goes into ((1—r,)x, y) and z into z+r,x, 
and if B is chosen, (x, y) goes into (x, (1 —r;)y) 
and z into z 4 r; y. The purpose of the process 
is to maximize the expected value of z ob- 
tained before the machine becomes defunct. 

Denote by f(x, y) the expected amount of 
gold obtained using an optimal sequence of 
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choice. Then we have 


piirix+ f( — rx, Hi 
pilriy f(x (01—739)) | 


The optimal policy can be described in the 
following way. We choose A or B according as 
pari X/(1 — p) is greater or less than p rz y/(1— 
p2). We can choose either A or B if equality 
holds. After an operation according to such a 
choice, the machine may become defunct and 
terminate the process. If the machine is usable, 
then we can apply our policy to a new com- 
bination of the amounts of gold in A and B. 


B. Discrete Deterministic Processes 


By a deterministic process we mean a process 
in which the outcome of a decision is uniquely 
determined by the decision. We assume that 
the state of the system, apart from time de- 
pendence, is described in any stage by an M- 
dimensional vector p constrained to lie within 
some region D. Let T={T,} (where q runs over 
a set S) be a set of transformations with the 
property that pe D implies that T,(p)e D for all 
gek, i.e., any transformation T, carries D into 
itself. The term "discrete" signifies here that we 
have a process consisting of a finite or de- 
numerably infinite number of stages. A policy, 
for the finite process which we consider first, 
consists of a selection of N transformations 

in order, P=(T,, T3, ..., Ty), yielding succes- 
sively the sequence of states p; — Ti(p; .,) (i= 
2,3, ..., N) with p, = T, (p). These transfor- 
mations are to be chosen to maximize a given 
function R of the final state py. Observe that 
the maximum value of R(p,), as determined 
by an optimal policy, will be a function of the 
initial vector p and the number N of stages 
only. Let us then define our basic auxiliary 
functions Tel pls max R(py)=the N-stage re- 
turn obtained starting from an initial state p 
and using an optimal policy. This sequence is 
defined for N —1,2, ...,and pe D. The essential 
uses of the principle of optimality can be ob- 
served from the following two features. The 
first is the use of the embedding principle. The 
original process is embedded in a family of 
similar processes. In place of attempting to 
determine the characteristics of an optimal 
policy for an isolated process, we attempt to 
deduce the common properties of the set of 
optimal policies possessed by the members of 
the family. The second feature is the derivation 
of recurrence relations by which the functional 
equations connecting the members of the se- 
quence { f,(p)} are established. Assume that 
we choose some transformation T, as a result 
of our first decision, obtaining in this way a 
new state vector T, (p). The maximum return 
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from the following k — 1 stages is, by definition, 
fia). It follows that, if we wish to maxi- 
mize the total k-stage return, d must now be 
chosen to maximize this (k — 1)-stage return. 
The result is the basic recurrence relation 
Jelp) - max,es f, i (1, (p)), for k2 2, with fi(p) — 
max,.s R(T,(p)). For the case of an unbounded 
process, the sequence ( f,(p)} is replaced by a 
single function f(p), the total return obtained 
by using an optimal policy starting from state 
p, and the recurrence relation is replaced by 
the functional equation f(p)— max, f(T,(p)). 


C. Discrete Stochastic Processes 


We again consider a discrete process, but one 
in which the transformations are stochastic 
rather than deterministic. The initial vector p 
is transformed into a stochastic vector z with 
an associated distribution function dG,(p, z) 
dependent on p and the choice of q. We as- 
sume that z is known after the decision has 
been made and before the next decision is to 
be made. We agree to measure the value of 

a policy in terms of some average value of 
the function of the final state. Let us call this 
expected value the return. Beginning with 

the case of a finite process, we define f,(p) as 
before. The expected return as a result of the 
initial choice of T, is therefore 


| fi d Gp, 2). 


Consequently, the recurrence relation for the 
sequence ( f,(p)} is 


fp) - max | f,-1(2)dG,(p, 2), k22, 


with fi (p) =MaXges Jeep R(z) dG,(p, z). Con- 
sidering the unbounded process, we obtain the 
functional relation 


foma [ f(z)dG,(p, z). 


D. Continuous Deterministic Processes 


A number of interesting processes require that 
decisions be made at each point of a con- 
tinuum, such as a time interval. The simplest 
examples of processes of this character are 
provided by the tcalculus of variations. Let 

us denote by f(p; T) the return obtained over 
a time interval [0, T] starting from the ini- 
tial state p and employing an optimal policy. 
Although we consider the process as one con- 
sisting of choices made at each point t on 

(0, T], it is better to begin with the concept of 
choosing policies (functions) over intervals, 
and then pass to the limit as these intervals 
shrink to points. Let d be an allowable deci- 
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sion made over the interval [0, S], and let p, be 
the state at S starting from the initial state p 
and employing d. The application of the prin- 
ciple of optimality suggests that 


f(p S CEET) (1) 


where the supremum is taken over the set D of 
all allowable decisions d. 

The limiting form of (1) as $20 is a non- 
linear partial differential equation (Bellman 
partial differential equation). This expression 
is important for use in actual analysis. For 
numerical purposes, S is kept nonzero but 
small. R. Bellman showed that it is possible 
to avoid many of the quite difficult rigorous 
details involved in this limiting procedure if 
we are interested only in the computational 
solution of variational processes. 


E. Markovian Decision Processes 


Consider a physical system which at any of 
the times t=0, A, 2A, ... must lie in one of the 
states $,, $5, ..., Sy. Let y;(n) be the probability 
that the system is in S; at times nA, and let P 
be the probability that the system is in state S; 
at t+A if it is in state S; at time t. We suppose 
that the transition probabilities P; are inde- 
pendent of t. We assume that the P;; depend 
on à parameter q, which may be a vector, and 
that at each stage of the process q is to be 
chosen so as to maximize the probability that 
the system is in the state $,. We obtain the 
nonlinear system 


yı(n+1)= max) pyl )y(n) 


yi(n)pa (a*); 


Il 
ia 


2 


y(n + 1)— 2. pi(q*) yin), 


i=2,3,...,N 


where g* = q*(n) in the remaining N — 1 equa- 
tions is one of the values of q that maximize 
y,(n+ 1). There are similar processes that can 
be considered as continuous analogs of this 
type of decision process. These are called Mar- 
kovian decision processes and were discussed 
by Bellman. There is, however, another type 
of Markovian decision process in which a re- 
ward is given at each stage. For each state S; 
of the system there are k alternatives 1, 2, 3, 
.,k. If we choose the alternative h among 
these k alternatives, then the transition prob- 
abilities pi! (j=1,2,...,n) are determined, and 
a reward r; is associated with each state A. 
Let us denote by v;{n) the total expected re- 
turn obtained at the nth stage by appealing 
to an optimal policy when the initial state is S;. 
Then the principle of optimality in the theory 
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of dynamic programming yields 
N 
v(n4- 1) max ( x piri; + sin) 
E 


A policy-iteration method invołving a 
value-determination operation with a policy- 
improvement routine was given by R. A. 
Howard [4]. 


F. Dynamic Programming and the Calculus of 
Variations 


Problems in the calculus of variations can be 
viewed as multistage decision problems of a 
continuous type. It can be shown that the 
dynamic-programming approach yields formal 
derivations of classical necessary conditions 
for the calculus of variations. Let us consider 
the problem of minimizing the functional 


b 
J(y)- | L(x, y(x), y (x) dx, 
where the function y is subject to y(a) 

We embed this problem within the family of 
problems generated by allowing a and c to be 
parameters with the ranges of variation —oo < 
a«b, —oo «c« oo. Now we define the optimal 
value function S(a, c) 2 min, J( y). Then the 
principle of optimality yields the functional 
equation 


S(a, c) 


a 


—min (|. Aide: Stc) 


where the minimization is taken over all func- 
tions defined over To, a + A] with y(a) 2 c and 
c(y)— y(a 4- A). Then, writing v= y'(a), we get 


os os 
zo nm (re Cs nto). 
This yields the Euler equation, the Legendre 
condition, the Weierstrass condition, and 
the Erdman corner conditions. Furthermore, 
it can be shown that the functional equa- 
tion characterization yields the Hamilton- 
Jacobi partial differential equation of classical 
mechanics. 

The dynamic-programming approach can 
be applied to more general problems in the 
calculus of variations. 


G. Dynamic Programming and the Maximum 
Principle 


In general, the method of dynamic program- 
ming carries a more universal character than 
the maximum principle of optimal control 
theory. However, in contrast to the latter, this 
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method does not have a rigorous logical foun- 
dation. V. G. Boltyanskii [6] has presented a 
justification of the dynamic programming 
method. 

Let fi(x, u) (i=0, 1, ..., n) be defined for xe 
VcR"and ueU c R', where V is an open set, 
and continuously differentiable on V x U. 
Suppose that two points x? and x! are given in 
V. Among all the piecewise continuous con- 
trols u(t) — (u, (t), ...,u,(t)) eU which transfer 
the phase point moving in accordance with 


dx 


d: =fi(x,u(t)),  i-L...,n, 





from x? — x(t) to x! = x(t,), find the control 
u(t) for which the functional 


s= |” fs Jolx(t), u(t))dt 


takes the smallest value. 

A continuous function o(x) — o(x,, ..., Xn) is 
called a Bellman function relative to a point 
ac V if it possesses the following properties: (1) 
«(a) — 0; (2) there exists a set M (the singular 
set of w{x)), which is closed in V and does not 
contain interior points, such that the function 
w(x) is continuously differentiable on the set 
V—M and satisfies the condition 


ueU 


sup( $ e Ee u) — f(x, ol , xeV—M. 


The following theorem gives a sufficient opti- 
mality condition. 

Theorem: Assume that for dx/dt — f(x, u(t)) 
given in a region VCR" there exists a Bellman 
function q(x) relative to the point ae V with a 
piecewise smooth singular set. Assume, further- 
more, that for any point x?& V there exists 
a control u(t) which transfers the phase point 
from x? = x(t,) to a = x(t,) and satisfies the 
relation 


| fot) u(t))dt— — a(x, 


Then any such control u(t) is optimal in V. 

Recently, Vinter and Levis [7] obtained the 
following general result in this connection. The 
sufficient condition is given in terms of a solu- 
tion to the Bellman partial differential equa- 
tion. It is shown that if this equation is modi- 
fied so that it is actually an inequality, and if 
this inequality is required to be satisfied in a 
limiting sense only, then the condition is also 
necessary for optimality. 


H. Characteristic Features of Dynamic 
Programming 


The characteristic features of the dynamic- 
programming approach can be summarized in 
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the following five points: (1) the advantage of 
lower dimensionality in comparison with the 
enumeration approach; (2) the possibility of 
finding maxima and/or minima of functions 
defined over restricted domains for which 
differential calculus may not work well; (3) the 
availability of numerical solutions in recursive 
form; (4) the possibility of formulating certain 
problems to which classical methods do not 
apply; and (5) the applicability of the method 
to most types of problems in mathematical 
programming, such as finventory and produc- 
tion control, optimal searching, and some 
optimal and adaptive control processes. 
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A. General Remarks 


The term econometrics can be interpreted in 
various ways. In its widest sense, it means 

the application of mathematical methods to 
economic problems and includes mathematical 
economics, *mathematical programming, etc. 
However, here we use it to mean statistical 
methods applied to economic analysis. 

The object of econometrics is to provide 
methods to analyze relationships between 
economic variables. We classify these methods 
into four categories according to the types of 
relationships involved: (1) Analysis of causal 
relations: If a set of variables X,,..., X, affects 
an economic variable Y, we can estimate the 
direction and extent of those effects on Y. 

(2) Analysis of equilibrium: When a set of 
economic variables Y,, ..., Y,, is determined 
through a market equilibrium mechanism, we 
can analyze the structure of relationships 
that determines the equilibrium. (3) Analysis 
of correlation: When a set of economic vari- 
ables is affected simultaneously by some (un- 
known) common factors, we can analyze the 
correlation structure of the variables. (4) Ana- 
lysis of time interdependence: A process of 
development in time of a set of economic 
variables can be analyzed. 

There are two types of economic data: (a) 
macroeconomic data, representing quantities 
and variables related to a national economy as 
a whole, usually based on national census; and 
(b) microeconomic data, representing informa- 
tion about the economic behavior of individ- 
ual persons, households, and firms. Macro- 
economic data are usually given as a set of 
time series (— 421 Time Series Analysis A), 
while microeconomic data are obtained main- 
ly through statistical surveys and are given as 
cross-sectional data. These two types of data, 
related to macroeconomic theory and micro- 
economic theory, respectively, require differ- 
ent approaches; and sometimes information 
obtained from both types of data has to be 
combined; obtaining macroeconomic infor- 
mation from microeconomic data is called 
aggregation. 


B. Regression Analysis 


The most common technique for the first 

category of problems is tregression analysis, 
which is applied to both microeconomic and 
macroeconomic analysis. However, there are 


510 


problems peculiar to economic analysis, where 
a factor can seldom be controlled; and usually 
there are too many highly related independent 
variables. In such cases, if all possible indepen- 
dent variables are taken into the model, the 
accuracy of the testimators of the coefficients 
becomes extremely poor. Such a phenomenon, 
called multicollinearity, brings up the problem 
of selection of independent variables (— 403 
Statistical Models), to which no satisfactory 
solution has been given. Also, assumptions 
about the error terms may be dubious, the 
error terms may be correlated, or the variances 
may be different. If the tvariance-covariance 
matrix of the errors is given, the tgeneralized 
least squares method can be applied, but 
usually such a matrix is not available. 


C. Systems of Simultaneous Equations 


The second category of problems is peculiar 
to economic analysis and applies mainly to 
macroeconomic data. Suppose that Y=(Y;,, 
..., Yo) is a vector consisting of G economic 
variables, among which there exist G relation- 
ships that determine the equilibrium levels of 
the variables. We also suppose that there exist 
K variables Z=(Z,,...,Z,)’ that are indepen- 
dent of the economic relations but affect the 
equilibrium. The variables Y are called endog- 
enous variables, and the Z are called exog- 
enous variables. If we assume linear relation- 
ships among them, we have an expression 
such as 


Y=BY+TZ +u, (1) 


where B and T are matrices with constant 
coefficients and u is a vector of disturbances or 
errors. (1) is called the linear structural equa- 
tion system and is a system of simultaneous 
equations. By solving the equations formally, 
we get the so-called reduced form 


Y-IHZ-4v, (2) 


where 77 «(I — B) tI, v-(I — B) !u. The 
relation of Y to Z is determined through the 
reduced form (2), and if we have enough data 
on Y and Z we can estimate JI The problem 
of identification is to decide whether we can 
determine the unknown parameters in B and 
I uniquely from the parameters in the re- 
duced form. A necessary condition for the 
parameters in one of the equations in (1) to be 
identifiable is that the number of unknown 
parameters (or, since known constants in the 
system are usually set equal to zero, the num- 
ber of variables appearing in the equation) not 
be greater than K +1. If it is exactly equal to 
K +1, the equation is said to be just identified, 
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and if it is less than K + 1, the equation is said 
to be overidentified. 

If all the equations in the system are just 
identified, for arbitrary J there exist unique B 
and T that satisfy /7 — (I — B) ! I. Therefore, if 
we denote the tleast squares estimator of 77 by 
H. we can estimate B and T from the equation 
(I — B)ff — f. This procedure is called the 
indirect least squares method and is equivalent 
to the *naximum likelihood method if we 
assume normality for u. 

When some of the equations are over- 
identified, the estimation problem becomes 
complicated. Three kinds of procedures have 
been proposed: (1) full system methods, (2) 
single equation methods, and (3) subsystem 
methods. In full system methods all the para- 
meters are considered simultaneously, and if 
normality is assumed, the maximum likelihood 
estimator can be obtained by minimizing |(Y — 
ITZ) (Y — IZY |. Since it is usually difficult to 
compute the maximum likelihood estimator, a 
simpler, but asymptotically equivalent, three- 
stage least squares method has been proposed. 
The single equation methods and the sub- 
system methods take into consideration only 
the information about the parameters in one 
equation or in a subset of the equations, and 
estimate the parameters in each equation 
separately. There is a single equation method, 
called the limited information maximum like- 
lihood method, based on the maximum likeli- 
hood approach, and also a two-stage least 
squares method, which estimates J first by 
least squares, computes Y — /7Z, and then 
applies the least squares method to the model 


Y=BY4+TZ+a. 


These two and also some others are asymptot- 
ically equivalent. Among asymptotically 
equivalent classes of estimators corresponding 
to different information structures it has been 
established that the maximum likelihood 
estimators have asymptotically higher-order 
efficiency [5] (— 399 Statistical Estimation) 
than other estimators, and Monte Carlo and 
numerical studies show that they are in most 
cases better than others if properly adjusted 
for the biases. 

In many simultaneous equation models 
which have been applied to actual macro- 
economic data, the values of endogenous 
variables obtained in the past appear on the 
right-hand sides of equations (1). Such vari- 
ables are called lagged variables, and they can 
be treated, at least in the asymptotic theory of 
inference, as though they were exogenous. 
Hence exogenous variables and lagged endog- 
enous variables are jointly called predeter- 
mined variables. When many lagged variables 





128 Ref. 
Econometrics 


appear over many time periods and when 
some structure among the coefficients of those 
lagged variables can be assumed, such a model 
is called a distributed lag model. 

Sometimes it is necessary to include some 
nonlinear equations in the simultaneous equa- 
tion model. Such nonlinear simultaneous 
equation models are difficult to deal with, 
partly because the solution of the equation 
may not be unique, and in practical applica- 
tions ad hoc procedures are applied to obtain 
estimates of the parameters. 


D. Multivariate and Time Series Analysis 


Problems in the third category can be ap- 
proached by tmultivariate analysis techniques. 
Sometimes tprincipal component analysis and 
*canonical correlation analysis have been 
applied to analyze the variations of a large 
amount of data. However, the practical mean- 
ing of the results obtained is often dubious. 

The fourth category is the problem of time 
series analysis. Sophisticated theories of sto- 
chastic processes have little relevance for 
economic time series, because usually the time 
series do not satisfy such conditions as being 
stationary or having the tMarkov property, 
etc. Recently, however, autoregressive moving 
average ((ARMA) and multivariate ARMA 
models [4] have been applied to macro- 
economic data, especially for the purpose of 
prediction and for determining the direction of 
causal relations (— 421 Time Series Analysis). 
Traditionally, fluctuations of economic time 
series have been thought to consist of trend, 
cyclic variation, seasonal variation, and error. 
Various ad hoc techniques have been used to 
separate or eliminate such components, but 
the theoretical treatment of such problems is 
far from satisfactory. 
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Albert Einstein (March 14, 1879—A pril 18, 
1955) was born of Jewish parents in the city of 
Ulm in southern Germany. He became a Swiss 
citizen soon after graduating from the Eid- 
genóssische Technische Hochschule of Zürich 
in 1900. Afterward, he obtained a position as 
examiner of patents at Bern, and while at that 
post, he published his theories on light quanta, 
tspecial relativity, and Brownian motion. 
After briefly holding professorships at the 
University of Ziirich and the University of 
Prague, he became a professor at the Univer- 
sity of Berlin in 1913. His general theory of 
relativity was announced in 1916, and in 1921 
he won the Nobel Prize in physics for his 
contributions to theoretical physics. To escape 
Nazi persecution, he fled to the United States 
in 1933, and until his retirement in 1945 he 
was a professor at the Institute for Advanced 
Study at Princeton. He advised President 
Roosevelt of the feasibility of constructing the 
atomic bomb, but after World War II, along 
with others who had been connected with the 
bomb, he was active in promoting the nuclear 
disarmament movement and the establishment 
of a world government. 

The theory of relativity raises fundamental 
epistemological problems concerning time, 
space, and matter. The results of the general 
theory were verified in 1919 by observations of 
the solar eclipse. 

Through his latter years, Einstein continued 
to work on tunified field theory and on the 
generalization of relativity theory. 
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A. Maxwell’s Equations 


Mathematical formulation of electromagnetics 
leads to tinitial value and *boundary value 
problems for Maxwell’s equations according 
to the geometric nature of the medium. Max- 
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well’s equations for a vacuum are written in the 
form 


£9 OE/0t 2 rot H — J,, £o div E — p,, 


Ho06H/Gt 2 —rotE—J,, — u9divH— p,, (1) 


where E and H are the electric and magnetic 
field vectors, p, and p,, the electric and mag- 
netic charge densities, J, and J„ the electric 
and magnetic current densities, £ and Ho con- 
stants, and the quantity 1/,/£949 =c the speed 
of light in vacuum (2.99797 x 109 m/s). Charge 
and current densities must satisfy the equations 
of continuity 


0p,/0t +div J, — 0, 
0p, / 0t + div J, — O. (2) 


Following upon the observation that, appar- 
ently, p,,=0 and J,, — 0 in nature, we hence- 
forth set them equal to zero. This causes an 
asymmetry between the electric and magnetic 
quantities. On the other hand, the proposition 
*p,, U and J,, — 0" cannot be deduced from 
the classical theory itself. 

In the presence of matter, additional charge 
and current appear due to the electric and 
magnetic polarization P and M of the material. 
Therefore, in this case, it is necessary to make 
the following substitutions in (1): 


p, p — p, — div P, 
J,J —J, - OP/8Gt -- rot M, 
HAH CM. (3) 


Moreover, if we define the electric flux density 
(or electric displacement) D and the magnetic 
flux density (or magnetic induction) B by 
D-«&E-P, B-j4j(H-«M) (4) 
then Maxwell’s equations (1) are transformed 
into | 
OD/ot-rotH —J,,  divD-p,, 
OB/0t = —rot E, div B 2 0. (5) 


In the electromagnetic field in a vacuum 
there is energy with a density 


u — (co/2) E? + (uo/2)B?, (6) 


and energy flux with a density expressed by 
the Poynting vector 


S=ExH. (7) 


Between these quantities the following relation 
holds: 


du/dt + divS=0. (8) 


An electric charge g moving with velocity v 
in an electromagnetic field is subject to the 
Lorentz force 


F=qE+ qvx B. (9) 
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This force can be interpreted as being caused 
by the Maxwell stress tensor 


Ty, =(€9/2)(— E, E, + 26, E?) 
 (uo/2)( — H; H, + 204, H?). (10) 


By introducing the scalar potential V and 
the vector potential A, we can express the field 
vectors as 


B-rotA, E=-—gradV—éA/ét. (11) 


Furthermore, if we impose an auxiliary con- 
dition (Lorentz condition) 


then we obtain from (1) the wave equations 
[]V— —p,/£s, ODAS = Hode, (13) 


where [] = A — £o Ho0%/ôt? is called the d' Alem- 
bertian and is sometimes written as ©7. From 
(13) we conclude that the electromagnetic 

field can propagate in a vacuum as a wave 
with speed c = 1/\/ £o Ho. In tquantum theory 
the potentials V and A are regarded as being 
more fundamental than E and H themselves. 
However, they are not uniquely determined, in 
the sense that the gauge transformation 


Vo9V-Foy/ot, A—A-grady (14) 


with an arbitrary function v of the space and 
time variables does not affect the fields. 
Maxwell's equations are invariant under the 
tLorentz transformation. Therefore they can 
be written in 4-dimensional tensor form (— 
359 Relativity C). 
Maxwell’s equations can be regarded as 
the twave equations for tbosons with spin 1 
(*photons). The equations of quantum electro- 
dynamics are obtained if we regard the field 
quantities as quantum-mechanical vari- 
ables (g-numbers) and then perform *second 
quantization. 


B. Concrete Problems 


In solving Maxwell's equations concretely, 
we usually make additional assumptions for 
polarizations, electric current, and field vectors 


P-y,E M=y,H, J.=oE, (15) 


where Ye, Xm, and c are called the electric sus- 
ceptibility, magnetic susceptibility, and conduc- 
tivity, respectively. Then we have 


Deck Bau, (16) 


where £= £p + Xe is the dielectric constant and 
U= Holl + Xm) is the magnetic permeability. 
Therefore the equations (5) become identical to 
(1) if £% and Ho are replaced by e and y, respec- 
tively. (p,, and J,, are set equal to zero.) 
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Some cases of practical importance are 
given below. 

(1) Electrostatics. If the fields are time- 
independent and there is no electric current, 
then E and H are mutually independent. The 
static electric field is calculated from the solu- 
tion of the boundary value problem of the 
*Poisson equation AV = — p,/c. Specifically, V 
takes a constant value in each conductor. 

(2) Magnetostatics. For zero electric current, 
the problem of magnetostatics is solved in the 
same way as in electrostatics. For the case of 
nonvanishing stationary electric current, the 
problem is reduced to that of solving 


AA=—pJ,,  divA-Q. (17) 


(3) Electric current in a conductor. A station- 
ary electric current in a conductor is governed 
by the equation of continuity div J =0, Ohm's 
law J — cE, and a special case of Maxwell's 
equation rot E — O0. The electric current pro- 
duces heat (Joule heat) proportional to J- E 
per unit volume per unit time (Joule's law). 
For certain substances, the specific resis- 
tance o! suddenly becomes negligibly small 
below a critical temperature. This is called 
superconductivity. 

(4) Quasistationary electric circuit. The 
problem appearing most often in electrical en- 
gineering is that of a quasistationary circuit. 
Its characteristic feature is that the electric 
currents exist only in the circuit elements 
(inductors, capacitors, and resistors) and in the 
lines connecting them. The current (both J, 
and 0D/ét) can be neglected in all other parts 
of the system. (This could be compared to 
the situation in dynamics where we consider 
systems of material points or of rigid bodies 
having a finite number of degrees of freedom, 
although every material body is essentially 
a continuum.) The system network is con- 
structed as a "linear graph with the circuit ele- 
ments as its branches. Topological tnetwork 
theory deals with the relation between the 
structure of the linear graph and the electri- 
cal characteristics of the network, whereas 
function-theoretic network theory deals with 
the relation between current and voltage at 
each part of the network. In the latter theory, 
current and voltage are considered as func- 
tions of the frequency of the sinusoidal alter- 
nating voltage applied to some point of the 
network. Together these constitute a unique 
theoretical system in engineering mathe- 
matics for designing system networks (— 282 
Networks). 

(5) Theory of electromagnetic waves. The 
theory of electromagnetic waves deals with the 
case where the changes of all field quantities 
are proportional to el. and in addition the 
frequency o is so large that the term 0D/ót in 
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(5) is of the same order as or larger than J,. In 
such a situation the electromagnetic field 
behaves like a wave. 

Problems of various types arise depending 
on the geometry of the conducting and dielec- 
tric substances, on the type of the energy 
source, etc. Important problems are: (1) radi- 
ation of a wave from a point source into free 
space; (ii) scattering of a plane wave by small 
bodies or cylinders; (iii) diffraction of a wave 
through holes in a conducting plate; (iv) reflec- 
tion and refraction of a wave at the boundary 
between different media; (v) wave propagation 
along a conducting tube (wave guide); and 
(vi) resonance of the electromagnetic field in a 
cavity surrounded by a conducting substance. 
Theoretical treatment similar to that for ordi- 
nary networks is possible for microwave cir- 
cuits consisting of wave guides, cavities, etc. 

(6) Wave guides. For electromagnetic waves 
with a harmonic time dependence e "77 propa- 
gating inside a hollow tube of uniform cross 
section extending in the z-direction with per- 
fectly conducting walls (called a wave guide), 
the Maxwell equations become 


rot E = ioB, div B — 0, 


rotB-— —iueoE, div E-0, (18) 
with the boundary condition 


nxE-0, n-B=0, 


where n is a unit normal at the boundary 
surface. A further reduction is gained by 
Fourier analysis in the z-variable. For har- 
monic z-dependence eh", the transverse com- 
ponents E, — (e, x E) x e, and B, — (e, x B) x e, 
are determined from the z-components E, = 
e, E and H,=e,-H by the following part of 
the Maxwell equations: 


ikE, 4- ice, x B, — grad, E., 
ikB, — iuge, x E, — grad, B, (19) 


(where grad, is the transverse components of 
the gradient), up to the solutions for E,— B, 
— 0, called transverse electromagnetic (TEM) 
waves, for which k= c4 ue, B,= x / ue e, x 
E,, and E, is a solution of the electrostatic 
problem in two dimensions rot E,=0, div E, 
— 0. To have E,#0 for the TEM solution, it is 
necessary to have two or more surfaces, such 
as a coaxial cable (region between two con- 
centric cylinders) or a parallel-wire trans- 
mission line. Nonzero longitudinal compo- 
nents E, and B, are determined from the 
2-dimensional equations 
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with boundary conditions E, 20 and 0B, /ón 
— 0 on the wall, where (é/én) denotes the 
partial derivative in the normal direction. The 
solutions with B, — 0 are called transverse 
magnetic (TM) waves (or electric (E) waves) 
and those with E, =0 transverse electric (TE) 
waves (or magnetic (M) waves); for each of 
these cases the equations determine an eigen- 
wave number k for a given angular frequency 
œ (typically in a waveguide situation) or 
eigenfrequency v — c/(2x) for a given wave- 
length 4 — 2z/k (typically in a resonant cavity 
situation). 
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A. Definition 


A function of a finite number of real or com- 
plex variables that is algebraic, exponential, 
logarithmic, trigonometric, or inverse trig- 
onometric, or the composite of a finite number 
of these, is called an elementary function. Ele- 
mentary functions comprise the most common 
type of function in elementary calculus. 

J. Liouville [1] defined the elementary func- 
tions as follows: An algebraic function of a 
finite number of complex variables is called an 
elementary function of class 0. Then e" and 
logz are called elementary functions of class 1. 
Inductively, we define elementary functions of 
class n under the assumption that elementary 
functions of class at most n— 1 have already 
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been defined. Let g(t) and g(w,,...,w,) (1 < 
j&m) be elementary functions of class at most 
1 and f(z,, ...,z,) be an elementary function 
of class at most n — 1. Then the composite 
functions ot f(z,, ...,z,)) and f(gi(w,, ...,w,), 

gg W1. ..., w,)) (and only such functions) 
are called the elementary functions of class at 
most n. An elementary function of class at 
most n and not of class at most n — 1 is called 
an elementary function of class n. A function 
that is an elementary function of class n for 
some integer n is called an elementary function. 
In this article, we explain the properties of the 
most common elementary functions. 


B. Exponential and Logarithmic Functions of a 
Real Variable 


Let a0, a1. A function f(x) of a real vari- 
able satisfying the functional relation 


fix y-fo9fQ). — f()-a, (1) 


satisfies f(n) — a" for positive integers n and 
f(— n) Lia for negative integers — n. In gen- 
eral, f(n/m) — “at for every rational number 
r=n/m. If we assume that f(x) is continuous, 
then there is a unique strictly monotone func- 
tion f(x) defined in (—oo, oo) whose range is 
(0, oo). The function f(x) is called the exponen- 
tial function with the base a and is denoted by 
a*, read “a to the power x” and also called a 
power of a with exponent x. Its inverse function 
is called the logarithmic function to the base a, 
and is denoted by log, x. The specific value 
log, x is called the logarithm of x to the base a. 
If g(x) =log, x, we have 


g(a)=1. (2) 


Hence we have xy= f(g(x) 4- g( y)). Therefore 
we can reduce multiplication to addition by 
using a numerical table for the logarithmic 
function. 


g(xy) 7 g(x) + g(y). 


C. Logarithmic Computation 


The logarithm to the base 10 is called the 
common logarithm. If two numbers x, y ex- 
pressed in the decimal system differ only in the 
position of the decimal point De, y 2 x - 10" for 
an integer n), they share the same fractional 
parts in their common logarithms. The in- 
tegral part of the common logarithm is called 
the characteristic, and the fractional part is 
called the mantissa. (We note that the word 
“mantissa” is also frequently used for the frac- 
tional part a in the floating point representa- 
tion x=a- 10", 10 ! <a<1, or 1<a<10.) The 








131 D 
Elementary Functions 


common logarithms of integers have been 
computed and published in tables. 


D. Derivatives of Exponential and Logarithmic 
Functions 


The function f(x) — a* is differentiable, and 
f (x) 2 k, f(x), where k, is a constant deter- 
mined by the base a. If we take the base a to 
be 


lim { 1 2m 2.71828 
e= im ( +) Se ; wie 
then we have k,=1. The number X(1/v!) is 
usually called Napier’s number and is denoted 
by e after L. Euler (his letter to C. Goldbach 
of 1731; — Appendix B, Table 6). In 1873, 

C. Hermite proved that e is a transcendental 
number. We sometimes denote e* by expx; the 
term exponential function usually means the 
function exp x. The function e? is invariant 
under differentiation, and conversely, a func- 
tion invariant under differentiation necessarily 
has the form Ce*. The logarithm to the base e 
is called the Napierian logarithm (or natural 
logarithm), and we usually denote it by log x 
without explicitly naming the base e (some- 
times it is denoted by In x). The derivative 

of log x is 1/x, hence we have the integral 
representation 


logx= |“ (3) 


1 X 


The constant factor k, in the derivative of a* 
is equal to loga. The graphs of y=e* and 
y=logx are shown in Fig. 1. The functions e” 
and log(1 +x) are expanded in the following 
Taylor series at x 20: 


É (4) 


y 


logt += Y, (71) 1. 5) 


v=1 y 


The power series in the right-hand sides of (4) 
and (5) are called the exponential series and 
the logarithmic series, respectively. The radii 
of convergence of (4) and (5) are oo and 1, 
respectively. 
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E. Trigonometric and Inverse Trigonometric 
Functions of a Real Variable 


The trigonometric functions of a real variable 
x are the functions sin x, cos x (— 432 Trig- 
onometry) and the functions tan x = sin x/cos x, 
cot x — cos x/sin x, sec x = 1/cos x, and cosec x 
=1/sinx derived from sin x and cos x. The 
derivatives of sin x and cos x are cos x and 
—sin x, respectively. They have the following 
Taylor expansions at x —0: 


a (Om 








: — 2v+1 

sin x= 2 VE D" ; (6) 
eGP a 

GE Ga (7) 


The radii of convergence of (6) and (7) are both 
00. 

The inverse functions of sin x, cos x, and 
tan x are the inverse trigonometric functions 
and are denoted by arcsinx, arccos x, and 
arc tan x, respectively. (Instead of this nota- 
tion, sin ! x, cos ! x, and tan ^! x are also 
used). These functions are infintely multiple- 
valued, as shown in Fig. 2. But if we restrict 
their ranges within the part shown by solid 
lines in Fig. 2, they are considered single- 
valued functions. To be more precise, we re- 
strict the range as follows: —z/2xarcsinx < 
n/2, 0& arccosx € x, — 7/2 « arctan x « 1/2. 


y y—arctanx 


Larcsin x 
A 














Fig. 2 


The functions having these ranges are called 

the principal values and are sometimes denoted 

by Arcsin x, Árccos x, and Arctan x, respec- 
tively. The derivatives of these functions are 
(1—x2) 12, —(1— x?) 1, (1-- x?) !, respec- 
tively (— Appendix A, Table 9.1; for the Tay- 
lor or Laurent expansions of tan x, cot x, 
sec X, COSeC x, arcsin x, arccos x, arctan x, etc., 
— Appendix A, Table 10.IV). 


F. Hyperbolic Functions 


Let P be a point on the branch of the hyper- 
bola x? — y? = 1, x » 0, and let O be the origin 
and A the vertex (1,0) of the hyperbola. De- 
note by 6/2 the area of the domain surrounded 
by the line segments OA, OP, and the arc ÁP 
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of the hyperbola. Then we define the coordi- 
nates of P to be (cosh 0, sinh 0) as functions of 
0. We have 


cosh x 2 (e* +e *)/2, 
sinh x z(e* —e *)/2, (8) 


called the hyperbolic cosine and hyperbolic sine, 
respectively. As in the case of trigonometric 
functions, we define the hyperbolic tangent by 
tanh x = sinh x/cosh x, the hyperbolic cotangent 
by coth x =cosh x/sinh x, the hyperbolic secant 
by sech x = 1/cosh x, and the hyperbolic cose- 
cant by cosech x = 1/sinh x. They are called 

the hyperbolic functions. The graphs of sinh x 
and cosh x are shown in Fig. 3. The trigono- 
metric functions are sometimes called circular 
functions. 





We now introduce the Gudermannian (or 
Gudermann function): 


0—gdu -2arctane" — n/2, 


1 4- sinO 


1 
u—gd ! 0=log|tan 0 + sec0| 2 2log —— —. 
2 "1—sinO 


Then the hyperbolic functions can be ex- 
pressed in terms of the trigonometric func- 
tions. For example, 


sinhu=tan@, | coshu-secÓ,  tanhu-sinf. 


G. Elementary Functions of a Complex 
Variable (— Appendix A, Table 10) 


(1) Exponential function. The power series (4) 
converges for all finite values if we replace x by 
the complex variable z and gives an fentire 
function of z with an 'essential singularity at 
the point at infinity. This is the exponential 
function e* of a complex variable z. It satisfies 
the addition formula (1), e?:*?: —e?:e72, and it 
is also the tanalytic continuation of the ex- 
ponential function of a real variable. For a 
purely imaginary number z — iy, we have the 
Euler formula 


e? — cos y + isin y. (9) 


The function w — e? gives a tconformal map- 
ping from the z-plane to the w-plane, as shown 
in Fig. 4, which maps the imaginary axis of 
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the z-plane onto the unit circle of the w-plane 
(w=u+ iv). For z=x + iy (x, y are real num- 
bers), we have e7 — e*e"'; hence e is a tsimply 
periodic function with fundamental period 2i. 






JY 





w-plane 


(2) Logarithmic function. The logarithmic 
function logz of a complex variable z is the 
inverse function of e" It is an infinitely 
multiple-valued analytic function that has 
tlogarithmic singularities at z 20 and z = œ. 
All possible values are expressed by logz 
+ 2nni (n is an arbitrary integer), where we 
select a suitable value logz. The principal value 
of logz is usually taken as logr + iĝ, where z 
— re? (r=|z|, 0 is the argument of z) and 0« 6 
« 2n. (Sometimes the range of the argument is 
taken as —z «0 <n.) The principal value of 
logz is sometimes denoted by Logz. The 
power series (5) gives one of its tfunctional 
elements. The integral representation (3) holds 
for a complex variable z. The multivalency of 
logz results from the selection of a contour of 
integration; the integral of 1/z around the 
origin is 22i, which is the increment of log z. 

(3) Power. The exponential function a* for 
an arbitrary complex number a is defined to 
be exp(zloga). Similarly, z^ is defined to be 
exp(alogz). The function z^ is an algebraic 
function if and only if a is rational. In other 
cases, the function z^ is an elementary function 
of class 2. 

(4) Trigonometric functions, inverse trig- 
onometric functions, hyperbolic functions. 

The trigonometric, inverse trigonometric, and 
hyperbolic functions of a complex variable are 
defined by the analytic continuations of the 
corresponding functions of a real variable. For 
example, sin z and cosz are defined by the 
power series (6) and (7), respectively. They are 
entire functions whose zero points are nz and 
(n —1)z (n is an integer), respectively. They are 
also represented by *Weierstrass's infinite 
product (— Appendix A, Table 10.VI). 

The functions tanz, cotz, secz, and cosecz 
are tmeromorphic functions of z on the com- 
plex z-plane, and they are expressed by 
*Mittag-Leffler partial fractions (— Appendix 
A, Tables 10.IV, 10.V). As can be shown from 
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(8) and (9), we have 


cosz—-(e"-re Hi,  sinz-(e"*—e^7y2i, 


coshz=cosiz,  sinhz (sin iz)/i. (10) 


Each of these formulas (10) is called an Euler 
formula. For a complex variable, the trig- 
onometric and hyperbolic functions are com- 
posites of exponential functions, the inverse 
trigonometric functions are composites of 
logarithmic functions, and all of them are 
elementary functions of class 1. The definition 
of elementary functions by Liouville described 
in Section A refers, of course, to the functions 
of a complex variable. We remark that the 
inverse function of an elementary function is 
not necessarily an elementary function. For 
example, the inverse function of y 2x — asinx 
is not an elementary function (— 309 Orbit 
Determination B). 

The derivative of an elementary function is 
also an elementary function. However, the 
tprimitive function of an elementary function 
is not necessarily an elementary function. The 
primitive function of a rational function or an 
algebraic function of tgenus 0 is again an 
elementary function. Similar properties hold 
for rational functions of trigonometric func- 
tions. Liouville [1] carried through a deep 
investigation of the situation where the in- 
tegral of an elementary function is also an 
elementary function. 
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A. Introduction 


The word *atom" is derived from the Greek 
word for indivisible. It turns out that an atom 
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is divisible into its constituent nucleus and 
electrons. The nucleus, in turn, consists of 
protons and neutrons (together called nu- 
cleons). Photons (quanta of electromagnetic 
waves), electrons, protons, and neutrons (de- 
noted by y, e, p, and n, respectively), along 
with many other subsequently discovered sub- 
nuclear particles, are called elementary parti- 
cles, while nuclei, atoms, and molecules are the 
composite particles composed of these elemen- 
tary particles. 

States of a particle form an irreducible (uni- 
tary) representation of the proper inhomoge- 
neous Lorentz group with positive energy (or 
a finite direct sum of such representations). 
Thus a mass (m 2 0) and a spin (j —0,1, 1,...) 
are assigned to each particle (— 258 Lorentz 
Group). For example, e, p, and n have spin 4 
and nonzero masses, while y has spin 1 and 
zero mass. 

Many elementary particles are unstable, 
decaying into other particles. The average 
lifetime is denoted by zq and its inverse is called 
the half-width. The time in which half of many 
samples of the same particle decays is called 
the half-life, given by tlog2. For example, 
electrons, protons, and photons are supposed 
to be stable (or at least to have very long life- 
times), while a neutron decays into a proton, 
an electron, and a neutrino v (fi-decay) with a 
lifetime of about 15 minutes. 

From the study of the relativistic equations 
(Dirac equations) for wave functions of an elec- 
tron, P. A. M. Dirac predicted the existence of 
particles with the same mass as the electron, 
but of opposite electric charge (Dirac’s hole 
theory, 1930). These were discovered in 1932 
and called positrons. Every elementary particle 
is now believed to be associated with an anti- 
particle characterized by the opposite sign of 
the particle's additive quantum numbers, the 
two being connected by the tPCT theorem. 
Hence the positron is the antiparticle of the 
electron. The antiproton, theoretically ex- 
pected for a long time and experimentally 
found in 1955, is the antiparticle of the proton. 
Antiprotons, antineutrons, and positrons are 
constituents of antimatter. The particles whose 
additive quantum numbers are all invariant 
under change of sign, such as photons (y), neu- 
tral pions (7°), etc., seem to be antiparticles of 
themselves and are said to be self-conjugate. 

Elementary particles have four distinct types 
of interaction: gravitational, weak, electro- 
magnetic, and strong, in increasing order of 
interaction strength. Gravitational and electro- 
magnetic interactions were recognized in 
earlier centuries because these interactions are 
of long range. A. Einstein put forward the idea 
of a light quantum or photon as a lump of 
electromagnetic energy behaving like a par- 
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ticle. Whether a corresponding quantum (grav- 
iton) exists for gravitational interactions is a 
question related to the existence of gravita- 
tional waves themselves, and is not yet settled. 

The nuclear force is an example of a strong 
interaction and is studied to elucidate nuclear 
structure and to derive the tcross sections of 
various collision processes involving nuclet. H. 
Yukawa predicted in 1935 the existence of a 
particle associated with the nuclear force, just 
as photons are associated with the electro- 
magnetic interaction. Its mass was predicted, 
from the range of the nuclear force, to be 
about 200 times the electron mass, which is 
intermediate between the masses of electrons 
and nucleons, and hence the particle was 
named a mesotron or a meson. These were 
found in cosmic rays in 1947 and, in fact, it 
was found that the mesons relevant to the 
nuclear force (now called pions and denoted 
by x*, x, n? according to their electric charge) 
decay into other kinds of particles called 
muons (denoted by u*, u~) with a charged- 
pion lifetime of 2.6 x 1078 sec (n+ —u* 4+ v(vy; 
the neutral pion z? decays, over a shorter life- 
time, into photons). The muons then decay 
into electrons and neutrinos (u* —e* 4- v 4 v, 
v indicating antineutrinos) with muon life- 
times of about 2.2 x 1076 sec. Weak inter- 
actions are relevant to these decays as well as 
to the f-decay of the nucleus and the neutron. 
Since 1962 electron neutrinos v, and muon 
neutrinos v, have been distinguished in these 
decays, so that electron and muon numbers 
may be conserved. 

Since 1949, many new particles (unstable 
under weak interactions) have been gradually 
found in cosmic rays; these are called strange 
particles. Some of the early ones are hyperons 
(^, SL of spin 1, and kaons (K, K), of spin 0. 
For such strange particles, the strangeness 
quantum number, which is preserved in the 
strong interaction, has been introduced, and 
the Nakano- Nishijima - Gell-Mann formula 
concerning this number is known to hold. This 
says that Q = I, +4(B + S) for each elementary 
particle, where Q is the electric charge in units 
of that of the positron, I, is the third compo- 
nent of the isospin, B is the baryon number, 
and S is the strangeness. 

Quantum-mechanical wave functions for a 
system of identical particles seem to be either 
totally symmetric (Bose statistics) or totally 
antisymmetric (Fermi statistics) under permu- 
tations of particles. Accordingly, particles are 
either bosons (or Bose particles) or fermions (or 
Fermi particles). All bosons seems to have 
integer spins and all fermions half-odd-integer 
spins. This is the connection of spin and statis- 
tics and follows from certain axioms in quan- 
tum field theory (— 150 Field Theory). There 
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has been some discussion on intermediate 
statistics (parabosons and parafermions). 


B. Families of Elementary Particles 


Elementary particles are classified into families 
of leptons, photons, and hadrons. 

The family of leptons now consists of elec- 
trons (e), muons (u), and tau-leptons (1), their 
accompanying neutrinos (ve, v,, v,), and their 
antiparticles; t was discovered in the late 
1970s. Experimentally, v, 1s not well estab- 
lished nor has the possibility v, = v, yet been 
excluded. Leptons have spin J. They are char- 
acterized by having no strong interactions. 

The family of photons consists of photons 
and the recently discovered intermediary weak 
vector bosons. Gluons (— Section C (4)), if 
they exist, also belong to this family. 

The family of hadrons has a large number of 
members which are either mesons or baryons, 
the former being bosons and the latter fer- 
mions. We now have, in addition to pions and 
kaons, many resonant mesons (unstable under 
strong interaction) such as p-mesons and w- 
mesons. Át present we have, in all, more than 
20 species of mesons. This number does not 
count spin, charge, and antiparticle degrees of 
freedom. The baryon subfamily includes nu- 
cleons, hyperons, and excited states, now con- 
sisting of more than 30 species, again not 
counting spin, charge, and antiparticle degrees 
of freedom. We have nucleonic resonances 
with spin as high as 11/2. 

Hadrons are now well understood as com- 
posites of subhadronic constituents called 
quarks (and antiquarks), although free quarks 
have not been observed. (Hence the problem of 
quark confinement has been discussed ex- 
tensively.) Mesons are systems made up of 
a quark and an antiquark. Baryons are sys- 
tems made up of three quarks. Therefore, at 
our present level of knowledge, the elemen- 
tary particles might be leptons, photons, and 
quarks (and the corresponding antiparticles). 


C. Methods in the Theory of Elementary 
Particles 


(1) tQuantum Field Theory. Application of 
the ideas of quantum mechanics to electro- 
magnetic fields and their interaction with elec- 
trons resulted in the formulation of quantum 
electrodynamics (and more generally quan- 
tum field theory). Application of quantum- 
mechanical perturbation theory to quantum 
electrodynamics with the fine-structure con- 
stant «= e?/(hc) (about 1/137) as an expansion 
parameter resulted in divergent expressions— 
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the so-called divergence difficulties. The ultra- 
violet divergence comes from integration over 
high momenta (of virtual particles), and the 
infrared divergence is due to the zero mass of 
photons. It was later found that the ultraviolet 
divergence can be combined with a small 
number of parameters of the theory (i.e., elec- 
tron mass, (possibly nonzero) photon mass, 
and electromagnetic coupling constant e) into 
a revised set of constants (called the renormal- 
ized masses and coupling constant), which are 
then equated to the observed finite values of 
these constants—a procedure called renor- 
malization. Physically, this renormalization is 
pictured to be effected by virtual photons (and 
electron-positron pairs) surrounding (bare) 
electrons and photons. The infrared divergence 
is supposed to be a reflection of the fact that 
electrons can be accompanied by infinitely 
many photons with negligibly small total 
energy (a situation made possible by the zero 
mass of photons); this cannot be experimen- 
tally analyzed (and is indistinguishable from a 
single electron). 

The relativistically covariant formulation 
of the renormalized perturbation theory of 
quantum electrodynamics proposed by S. 
Tomonaga, J. S. Schwinger, and R. P. Feyn- 
man (independently and in different forms), 
and in particular the Feynman rules and 
Feynman diagrams that lead to the Feynman 
integrals (— 146 Feynman Integrals), made 
possible detailed theoretical computations, 
and the computed values (such as the Lamb 
shift of hydrogen and the anomalous magnetic 
moment of an electron) fit marvelously well 
with observed values—an achievement con- 
sidered a great success of quantum electro- 
dynamics. F. J. Dyson more or less showed 
that the renormalization procedure really 
absorbs all the divergences in all orders of the 
perturbation expansion in terms of renormal- 
ized constants, though there were later refine- 
ments and elaborations of the proof. This 
work also leads to the division of quantum 
field theories into two classes: renormalizable 
theories, where (infinitely many) ultraviolet 
divergences can be absorbed into a finite 
number of constants by renormalized pertur- 
bation theory, and unrenormalizable theories. 
The question of whether perturbation series 
converge in some sense is an unsolved ques- 
tion of quantum electrodynamics. 

In quantum field theory, the central role is 
played by quantum fields, which are operator- 
valued generalized functions of a space-time 
point. Particle interpretations of any state at 
infinite past and infinite future are obtained in 
the theory from the asymptotic behavior of the 
fields (tasymptotic fields) at time —oo and time 
+œ, and from their relation, expressed by the 
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*S-matrix, describing how particles scatter by 
collision. Thus any model of quantum fields 
makes, in principle, a prediction about what 
particles appear and how they behave (asymp- 
totically) in mutual collisions. 

After the success of quantum electrodynam- 
ics, the perturbation theory of quantum fields 
was applied to systems of pions and nucleons; 
this proved to be unsuccessful, possibly due to 
the lack of an appropriate expansion param- 
eter. This has led some people to study the 
mathematically rigorous consequences of 
quantum field theories; these mathematical 
consequences do not require any perturbation 
calculation, simply following from a small 
number of mathematically formulated axioms 
believed to be satisfied by a large class of 
quantum field theories. This approach, re- 
ferred to as taxiomatic quantum field theory, 
has yielded a few physically meaningful con- 
sequences of general nature: analyticity of 
some S-matrix elements, the *PCT theorem, 
and the connection of spin and statistics (— 
Section A). 

While the axiomatic approach provides a 
general framework, the tconstructive field 
theory developed later provides examples of 
quantum field theories that fit into such a 
framework. Because of its concrete nature, it 
can make statements about detailed properties 
of the model, such as the (non-)existence of 
composite particles, the establishment of per- 
turbation theory as an asymptotic expansion, 
and phase-transitions phenomena and the 
related broken symmetry as the coupling con- 
stant varies. It has, however, been successful 
for only 2 and 3 space-time dimensions. 


(2) Analytic S-Matrix Approach. Due to the 
failure of the perturbation approach in quan- 
tum field theory, a new approach was devel- 
oped based on assumptions about the analy- 
ticity properties of the S-matrix elements. The 
assumed analyticity properties were surmised 
from examination of Feynman integrals and of 
nonrelativistic potential scattering, and par- 
tially follow from axiomatic field theory. In 
this approach, the information that scattering 
amplitudes, or S-matrix elements, possess 
certain analytic properties with respect to 
energies, scattering angles, and so on is ex- 
pressed by means of integral representations. 
For example, the forward two-body scattering 
amplitude f(s) as a function of s = (energy in 
the center of mass system)? is written as 
f9- [7 as () 


S ~S 





~o 


Im f(s) is related to the total cross section by 
the optical theorem, which is a statement of 
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the unitarity of the S-matrix. Hence equation 
(1), called a dispersion relation, gives a relation 
among observable quantities. An integral rep- 
resentation for the general two-body scattering 
amplitude f(s, t, u) (2 incoming and 2 outgoing 
particles) has been proposed by S. Mandelstam 
and is called the Mandelstam representation, 
where s-(p, 4 p3)), t=(p, DA, and u= 
(p, + p4)* (squares in a Minkowski metric) 
and p,, P2, P3, P4 are the 4-momenta of the 
incoming and outgoing particles, with sign 
reversed for the latter. (Some relations hold 
among variables: p? = m? with m; the mass, 
> p;=0, s+t+u= > m.) Under the inter- 
change of incoming and outgoing particles, 
the corresponding f’s are related by analytic 
continuation. This is called the crossing sym- 
metry. The study of the S-matrix directly 
from its analyticity and unitarity is called the 
S-matrix approach (— 386 S-Matrices). 

The analyticity of the two-body scatter- 
ing amplitude f as a function of the angular 
momentum / with a fixed s was investigated by 
T. Regge for nonrelativistic potential scatter- 
ing, and later the idea was applied to the S- 
matrix approach. The poles l= (s) of f (fis 
considered to be a function of / for each fixed 
s) are called Regge poles. Regge trajectories | — 
l(s) for variable s <0 have been shown to play 
important roles in the high-energy behavior of 
scattering amplitudes for small values of the 
variable t (four-momentum-transfer squared). 

An approximate expression of S-matrix 
elements, with Regge poles and satisfying the 
crossing symmetry, was introduced by G. 
Veneziano and is called the Veneziano model. 
It has developed into the so-called dual reso- 
nance model (dual in the sense that s-channel 
poles are dual to t-channel poles) and has 
subsequently evolved into the string model 
of hadrons, according to which hadrons are 
viewed as systems composed of strings joining 
quarks (and antiquarks). 


(3) Group-Theoretical Approach. In connec- 
tion with the symmetry properties shown by 
the spectra and reaction patterns of hadrons, 
a group-theoretical approach has been de- 
veloped. For example, the similarity of the 
behavior of neutrons and protons in nuclei, 
apart from the difference in their electro- 
magnetic properties, was formulated as isospin 
invariance (under the group SU (2)). The sym- 
metry properties are sometimes understood in 
terms of new additive quantum numbers that 
are conserved or nearly conserved, and these 
properties are made more concrete in the final 
stage by the introduction of fundamental 
constituents carrying these quantum numbers. 

Based on the canonical formalism of quan- 
tum field theory, (Noether) currents, as 
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quantum-mechanical generators of the sym- 
metry, are introduced in association with con- 
served quantum numbers. The commutation 
relation of these currents, referred to as a cur- 
rent algebra, shows the structure of a Lie alge- 
bra expressing the symmetry of the Lagran- 
gian of the system of hadrons; for example, 
SU (3) x SU (3) for three species (flavors) of 
massless quarks (the first factor for vector 
currents and the second for axial-vector cur- 
rents). The approach showed remarkable suc- 
cess when combined with the hypothesis of 
the partially conserved axial-vector currents 
(PCAC), which requires the divergence of the 
axial-vector currents to be proportional to the 
pseudoscalar meson fields. 

Even if a theory has a symmetry under a 
group G in its formulation, a vacuum state of 
the theory might not have a symmetry under 
G. If that occurs, we speak of spontaneously 
broken symmetry. Under some assumptions, a 
particle of zero mass (which is connected to the 
vacuum by the current for the spontaneously 
broken symmetry) is associated with the spon- 
taneously broken symmetry. This statement is 
called Goldstone's theorem, and the relevant 
particle of zero mass is called the Nambu- 
Goldstone boson. The PCAC hypothesis is 
believed to be connected with the spontaneous 
breakdown of the axial SU(3) symmetry with 
pions, kaons, and eta-mesons as Nambu- 
Goldstone bosons. This approach produced 
the Adler-Weisberger sum rule, which relates 
the weak axial-vector coupling constants to 
pion-nucleon scattering cross sections. 

À group-theoretical attempt to treat fer- 
mions and bosons on an equal footing resulted 
in the introduction of super Lie algebras (Z,- 
graded Lie algebras). The basic new ingredient 
in this approach ts a special class of generators 
roughly interpreted as the square root of the 
four-momentum, whose anticommutators 
(instead of commutators) are linear combina- 
tions of ordinary generators. À supermultiplet, 
which is an irreducible representation of a 
super Lie algebra, consists of both fermions 
and bosons. Extensions to local super Lie 
algebras (and also incorporation of gravitons 
into the framework) have been tried, with no 
realistic model emerging so far. 


(4) Non-Abelian Gauge Field Theory. Recently, 
quantum field theory has been revived as non- 
Abelian tgauge theory, resulting in what are 
considered to be successful qualitative and 
semiquantitative predictions. The quantization 
of the theory was at first carried out in terms 
of the *Feynman path integral, where fictitious 
particles, called Faddeev-Popov ghosts, appear 
through the precise definition of the functional 
measure of the path integral. The canonical 
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quantization has also been formulated with 
the explicit introduction of Faddeev-Popov 
ghosts from the beginning. Non-Abelian 
gauge field theory exhibits the very important 
property of asymptotic freedom, which states 
that the interaction at asymptotically high 
energies, or at very short distances, approaches 
that of free (noninteracting) theory. This prop- 
erty is required for the description of hadronic 
systems made of quarks in view of the experi- 
mental observation of the scaling behavior of 
deep inelastic inclusive structure functions. 
Thus gauge theory is believed to describe the 
dynamics of systems of quarks in hadrons. It 
is called quantum chromodynamics, and the 
quantum of the gauge field is called the gluon. 
The recent development of gauge field theory 
has been accompanied by many technical 
refinements of quantum field theory, which 
include the methods of dimensional regulariza- 
tion and renormalization groups. 

Dimensional regularization starts with 
Feynman integrals defined for n-dimensional 
momenta with n 4, in order to give meaning 
to integrals divergent for n —4. Then Feynman 
integrals have poles at n —4, which are ab- 
sorbed into the unrenormalized constants by 
means of the renormalization procedure. This 
method is particularly suited for non-Abelian 
gauge theory, since it is the regularization 
method that keeps gauge invariance at each 
step of the calculation. l 

The trenormalization-group equation has 
been known for a long time. It results from 
the requirement that physical quantities 
should be dimensionally covariant under re- 
normalization of constants in the theory. The 
renormalization-group equation is usually 
written as a differential equation for a Green’s 
function, expressing the fact that a change of 
scale of momenta is balanced by changing 
coupling constants and masses. Further refine- 
ments are due to C. G. Callan, K. Symanzik, S. 
Weinberg, and G. ’t Hooft. 


D. Models of Elementary Particles 


Hadrons are now considered to be made of 
more fundamental constituents; at present, at 
least 15 species of subhadronic, fermionic 
constituents (quarks) have been proposed. 
Attempts to understand subhadronic structure 
have a long history, the main landmarks of 
which include the Fermi-Yang model (where 
m-mesons are supposed to be made of protons 
and neutrons), the Sakata model (where all 
hadrons are supposed to be made of protons, 
neutrons, and A-hyperons), and a variety of 
quark models developed from the eightfold 
way of M. Gell-Mann and Y. Ne'eman (new 
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assignments of representations of SU(3) to 
particles somewhat different from the Sakata 
model; for example, the octet representation 
for mesons and low-lying baryons, and the 
decuplet for excited baryons). Originally, 
quarks were supposed to come in 3 species 
(u-quarks, d-quarks, s-quarks), each carrying 
its own quantum number, now called the 
flavor quantum number. Then it was suggested 
that each flavored quark has three additional 
degrees of freedom, i.e., three new quantum 
numbers (called color quantum numbers) in 
order to account for experimental data: (1) the 
spin-statistics problem of baryonic ground 
state wave functions, (2) the decay rate of 
1 27, (3) the Drell ratio (= total cross section 
for je +e* anything], divided by the cross 
section fore" +e* ou^ +u”). Recently, two 
new flavor degrees of freedom besides the old 
3 flavors (u, d, s) have been discovered, the 
carriers of which are c-quarks and b-quarks, c 
being the constituents of J/i/-particles, b of 
Y-particles. The combination of 5 flavors and 
3 colors results in 15 quarks, as stated earlier. 
The so-called standard model is a quantum 
field theory based on a local, non-Abelian 
gauge group SU(3) x SU(2) x U(1). The group 
SU(3) is called the color SU(3) group, which 
is supposed to be strictly unbroken and ex- 
presses the invariance of the theory under the 
local SU(3) transformation of the three color 
degrees of freedom. Quarks having spin 1 
transform as its 3-dimensional fundamental 
representation, and the vector gauge bosons 
transforming as its 8-dimensional regular 
representation are gluons. This part of the 
theory is quantum chromodynamics (QCD). 
The remaining part of the theory, based on 
the local gauge group SU (2) x U(1), is called 
the Glashow-Weinberg-Salam model or its 
hadronic extension quantum flavor dynamics 
(QFD), and unifies the electromagnetic and 
the weak interactions. This gauge group is 
supposed to be spontaneously broken with the 
only unbroken subgroup U(1) corresponding 
to the electromagnetic gauge transformation. 
The underlying mechanism for the spontaneous 
breakdown of the gauge group SU(2) x U(1) is 
not well understood, but conventionally it is 
assumed to occur through the so-called Higgs 
mechanism. This is a mechanism proposed by 
P. W. Higgs, whereby the Goldstone boson 
acquires a nonzero mass if the broken sym- 
metry occurs in the presence of an associated 
massless vector field (called a gauge vector 
field), which also becomes associated with 
massive bosons. The gauge vector bosons are 
identified with photons (y) (unbroken sym- 
metry and hence massless) and weak inter- 
mediary bosons (broken symmetry and hence 
massive), usually denoted by W and Z. Quarks 
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and leptons transform under the group SU(2) 
as doublet or singlet representations. The 
renormalizability of the spontaneously broken 
gauge field theory has been established by 

't Hooft. Major predictions of the Glashow- 
Weinberg-Salam model, including the existence 
of the gauge bosons W and Z, have been 
borne out experimentally. 

Grand unified models attempt to unify 
QCD and QFD, employing a larger Lie group 
containing SU(3) x SU (2) x U(1) as a sub- 
group. The most popular ones are those based 
on the groups SU(5), SO(10) and on some of 
the exceptional groups. Super grand unified 
models attempt to unify QCD, QFD, and 
the gravitational interaction, with super Lie 
groups as a possible basis. Recently, there 
have been attempts to search for subquark 
structures. 
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A. Ellipsoidal Coordinates 


If a» bc, then for any given (x, y, 2 e R?, the 
three roots of the cubic equation in 0 


2 y? z? 


X 
= —1=0 
E EE i 





F(0) 
are real and lie in the intervals 0> —c?, —c? > 
0» —b?^, and — b?» 0» —a?. Denoting these 
three roots by A, u, and v (they are labeled 

so as to satisfy the inequalities 4» —c?>p> 
—b-v»-—a?, F(A)=0, F(u) -0, and F(v)=0 
represent an ellipsoid, a hyperboloid of one 
sheet, and a hyperboloid of two sheets, re- 
spectively. They are confocal with the ellipsoid 


x? y? z? 


PEITO e 1 =0, 


pass through the point (x, y, z), and mutually 
intersect orthogonally. 

The quantities 4, 14 v are called the ellip- 
soidal coordinates of the point (x, y, z). Rect- 
angular coordinates (x, y, z) are expressed in 
terms of ellipsoidal coordinates (A, u, v) by the 
formula 


> (a? * A) (a? +p)(a? +y) 
(a? — b?)(a? — c?) 


(1) 


and two others obtained from (1) by cyclic 
permutations of (a, b, c) and (x, y, z). 


B. Ellipsoidal Harmonics 


When a tharmonic function y of three real 
variables is constant on the surface 4 — con- 
stant, 4 — constant, or v=constant in ellipsoidal 
coordinates, the function v is called an ellip- 
soidal harmonic. A solution of Laplace's equa- 
tion Ay =0 in the form y = A(4)M(j)N (v) 

can be obtained by the method of separation 
of variables. The equation Ay =0 is written in 
the form 


à ow L 
Donnas (ace) —0, 


where the summation is taken over the even 
permutations of (A, u, v), and 


A;— JJ (a? - A)(D? +A) (c? +A). 
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The ordinary differential equation 


^djV dh 


is satisfied by A and also by M and N if we 
replace À by u and v, respectively. Equation (2) 
is called Lamé's differential equation, with K 
and C the separation constants. 

Let K =n(n+ 1) for n=0, 1, 2, .... Then 
equation (2), for a suitable value (the eigen- 
value) of C, has a solution that is a polynomial 
in A or a polynomial multiplied by one, two, or 
three of |, /a? -- A,  / b? +4, and ./c? +4. 

Among these solutions 2n + 1 are linearly 
independent. We denote these solutions by A 
= fa (4A) (m- 1,2, ...,2n- 1). They are essen- 
tially equivalent to the Lamé functions, to be 
defined at the end of this section. To be pre- 
cise, by setting 


À t (a? +b? - c?)/35 €, 
C - B n(n4- D(a? +b? +c7)/3, 
e, — (b? +c? —2a’)/3, ...; 


ry, (^ Zoch (2) 


€1 +é€,+e;=0, 














we have 

cl 1/2 " 1/2 n 1/2 Xs 

dé? č—e; Sex £04) de 

" n(n4- 1) +B 3) 
4(6— e) ($— e3)($ — es) 
This can also be written in the form 

ZA 

— - (n(n + 1)(u)-- B)A (4) 


du? 


by the change of variable £ = (o(u) with the 
Weierstrass ! (o-function. 

The differential equation (3) has £— e,, e;, 
e3, © as !regular singular points. A solution of 
(3) that is a polynomial in Z or a polynomial 
multiplied by one, two, or three of ./E—e,, 
~ 6— e5, and ,/ £— e, is called a Lamé function 
of the first kind. 


C. Classification of Lamé Functions 


The 2n 4- 1 linearly independent solutions 
f(A) of (2) are classified into the following 
four families. If n is an even number 2p, then 
p+1 solutions f,"(4) among the 2n+ 1 solu- 
tions are polynomials in 4 of degree p, and the 
other 3p functions are polynomials in A of de- 
gree p —1 multiplied by 


JG? AMA, — (c? 4- A) (a? 4- 4), 
or 
JJ (a? -- A) (b? +4). 


Since all these polynomials are products with 
real factors of degree 1, the solutions belonging 
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to the first family are of the type 
Jn" 0) (4 —0,4 — 0)... (4 — 0,5). (5) 


while solutions of the latter kinds are of the 
type 


b? Aer Al 
f" 0) 4 Je? tr Ay(a? +4) 
(a? + a)(b? 4- 2) 

set BE. DH BA (6) 





The functions (5) and (6) are called Lamé 
functions of the first species and of the third 
species, respectively. On the other hand, if n is 
an odd number 2p + 1, then 3(p+ 1) solutions 
among the 2n + 1 functions f;"(4) are of the 


type 
EE b rà 
x(4—0,)(4—0,)...(4— 8, 123) (7) 


and the other p functions are of the type 





IP 0) (a? +A (b+ A) (c? +) 
x(4—01)(4—05)...(4—04-35. (8) 


The functions (7) and (8) are called Lamé 
functions of the second species and of the fourth 
species, respectively. Hence in either case 
we have 2n 4- 1 linearly independent Lamé 
functions. 

When n is even, we obtain an ellipsoidal 
harmonic 


n{2 
Vy = I] (4 —6)(u — 8,)(v — 6,) 


by multiplying LGL f;"(u), and f" (v) belong- 
ing to the first family. Also, in this case, by 
setting 
x? y? z? 
9,—— — ta a kal 
" a?+0, b?+0, vi, 


= (4—8)(u— 0,)(v — 6,) 
(a? + 0,) (b? - 0) (c? + d 








we have 
V, —0,09,...9,, (9) 


up to constant coefficients. Utilizing Lamé 
functions of the third species (instead of func- 
tions of the first species) and formula (1), we 
find that 


Wr =(yz or zx or xy) x 910,... 0,5 ,. (10) 


For even n, every ellipsoidal harmonic ex- 
pressible in terms of polynomials in x, y, z of 
degree n can be written as a linear combina- 
tion of the functions (9) and (10), which are 
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called the ellipsoidal harmonics of the first and 
of the third species, respectively. Similarly for 
odd n, the ellipsoidal harmonics of the second 
and of the fourth species 


V. —(xoryorz)0,0,...0,.15. (11) 
V; —xyz0,09,...95 55 (12) 


are composed of the Lamé functions of the 
second and fourth species, respectively. 

For odd n, these forms are a complete sys- 
tem of ellipsoidal harmonics that are linearly 
independent and expressible in terms of poly- 
nomials in x, y, z of degree n. 

The zeros €,, €3,...,¢, of the Lamé func- 
tions are real, and ¢;4¢, (i#j). They never 
coincide with any one of e,, e,, and e;. If e, > 
€57 65, then ¢,,...,¢, all lie between e, and 
ez. If mis an integer such that 0O x m € p, we 
have one and only one Lamé function (with 
the species given) with m of its zeros lying 
between e, and e; and the remaining p —m 
zeros between e, and e, (Stieltjes's theorem). In 
this way, a complete system of linearly inde- 
pendent Lamé functions of the specified type 
may be obtained, since m assumes p+ | differ- 
ent values. When the constant B appearing 
in the differential equation (3) takes specific 
values so that the equation has Lamé func- 
tions of the first kind as its solution, (3) also 
yields a solution A such that A II as 
&£— oo. This function A is called the Lamé 
function of the second kind. 


D. Ellipsoids of Revolution (Spheroids) 


When the fundamental ellipsoid is a spheroid 


x-y? z 





a C 


it is convenient to use the spheroidal coordi- 
nates (č, n, 9) given by 


x=- D-n’) cos o, 
y-L/(G? — 1) —9?)sing, (13) 
z=lčņ egg 








—a 
for à? «c? (prolate) and 

xz (GP + 1)(1—n7) cosg, 
yal /(2°+)(U—n’)sing, 
z=léy, l= /a?—c? (14) 


for a? >c? (oblate). The solutions of Laplace's 
equation, which are regular at all finite points, 
are given by 


V = Pr (C) Py (sin mo 
in the prolate case, and 


y = Pr e) Py Gps MP 


525 


in the oblate case. Here P7 is the tassociated 
Legendre function of the first kind. Solutions 
which are regular outside a finite ellipsoid 
can be composed of the *associated Legendre 
functions of the second kind, Q7'(¢) or Qr (ič) 
instead of P”(é) or PP (ié), respectively. 


E. Spheroidal Wave Functions 


Transforming the tHelmholtz equation in 
prolate spheroidal coordinates (13), we have 


1 , on? ov 
uu. E E A 
E- s Ge Ch E) ) =) 

1 1 8? Y 

O then E 


(45) 





By separating variables in the form Y = 
X (éE) Y(n) me, we have the equations 








which X and Y, respectively, must satisfy. The 
only difference between equations (16a) and 
(16b) arises from the fact that the domain of 
(16a) is given by 1<€ whereas the domain of 
(16b) is given by —1 <y <1. For the oblate 
spheroid, utilizing formula (14) we have 


1 j9 OP 9 T) 
eant es SU d 


1 1 Qv 2 
FONS INR Y —0. 1 
E E 0 (17) 


By separating variables as before, Y(r) satisfies 
the same equation as (16b), while X(€) satisfies 
equation (16a) with č replaced by i£. All these 
equations are of the type 








(18) 


A solution of (18) is known as a spheroidal 
wave function. The equation (18) has +1 as 
regular singular points and oo as an irregular 
singular point of class 1. Hence spheroidal 
wave functions behave like ! Legendre func- 
tions in the interval [ —1,1] and like *Bessel 
functions in the neighborhood of oo. 


F. The Functions pe; (x) and gez (x) 


When we write a solution of (18), it is custom- 
ary to write x instead of z when z is contained 
in the interval [ —1, 1]. We denote solutions of 
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(18) which are regular on the whole domain 
—1<x<1 by pe™(x), and the corresponding 
eigenvalues by 4, ,, (assuming the boundary 
condition stated in this paragraph concern- 


ing singularities). In particular, when x0, 


equation (18) reduces to tLegendre’s associ- 
ated differential equation, the eigenvalues of 4 
become n(n + 1) (n is a positive integer), and 
the corresponding eigenfunctions become the 
associated Legendre functions of the first kind: 


d" P, 


Pro) Nr, 


(19) 


Hence pe”(x) is a solution which tends to a 
constant multiple of P7'(x) as x0. Using a 
system of orthogonal functions P% (x), we can 
expand pe; (x) as 


m(x)= 2. Am Pr 
||—n| - even number. (20) 


The coefficients A satisfy a recurrence 
formula 


(Het 1)+K? 





27 421—-1—2m?\ |. 
(21—1)(214+ 3) 


,(—-m-1)—m) ,, 
OI3)0L—1). " SS 


att Wt mt2) m 
(21+ 3)(21+5) sch «en 


The functions pe7'(x) and pe;"(x) are ortho- 
gonal in the domain —1 « x « 1. 

Another solution of (18) exists which corre- 
sponds to the same eigenvalue A, m, is inde- 
pendent of pez (x), and has the opposite parity: 
Ap Qr (x) 


I2 —m,l-n-even 


duo 


jzm,j—n=odd 


Ba Pi (x), (22) 


where the A7', are the same as in (20) and are 
determined by the recurrence formula (21), 


while for j 2 m +2, the B; ; satisfy the recur- 
rence formula 


2i? -2i—1—2m? 
Leet m) e, 


Qj- Qj 3) 
Um DU ml e 
Qj-30j-) H" 


Utmkilitmtil. 


Qj + 3)(j - 5) mj+2=0. 2) 


Since the associated Legendre function of 
the second kind 
QP(x)=(1 2x2)? d™Q,/dx™ 
is of the form 
Qr'(x) = P(x)log./(1 + x)/(1— x) 


(1 — x?) "? x (a polynomial in x) 
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for !z m, the qez (x) have x= +1 as singular 
points. 

By expressing the solution of equation (18) 
in integral form we find that pe?" (x) satisfies an 
integral equation 


i^ "v, spes (X) 


1 
-| (1—x?)"?(1— Syme pem(E)de, ` Gu 
-1 


where the coefficient v, „ is related to P” (0) or 
P?" (0). 

In order to extend the domain of defini- 
tion of pez (x) and qe7 (x) beyond the interval 
[ 71,1], we adopt, in the domain G obtained 
by deleting the interval [ —1, 1] from the com- 
plex plane, the Heine-Hobson definition of the 
associated Legendre function, 


P™(z) — (z2 — 1"? q" p, Jdz", (25) 


instead of N. M. Ferrers’s definition (19), and 
construct a solution of (18) in G: 


pen(z)= A. Ani Pi"(z), 


izm 
|| —n| 2 even number, (26) 
which is like (20) and again satisfies the inte- 


gral equation (24). From this we can obtain 
the expansion formula 


"( E (z? — 1y"? 


CIS 
d v, wc 


x J, (piel E am Zeta) 


12m (1— m)! KZ 
|| — n| even number. (27) 


Multiplying this by a constant, we define 


Kl n (z2—1)"? 
Jen (Z)= a 
Z 


x Y (yo ge y E". 
izm i /Kz D ' 


(l+m)! 
(=m! ^^ 


m 
n, 


LS 
This expression asymptotically assumes the 

form 

jen (z) - sin(x«z — nn/2)/kz 


for |z|>>1. In a similar manner we find a 
solution 


i 1 (z? — Um" 
us EC 


2 TEL 
x A (DOr? Fa, TK FER 
E Izm 


Jm KZ 
having the asymptotic form 


ne; (z) - cos(kz — nn/2)/xz. 
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Disregarding a constant factor, this coincides 
with the function defined by (22) with Q"(z) — 
(z? — 1"? d™Q,,/dz™ in place of the associated 
Legendre function of the second kind. 
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A. Elliptic Integrals 


Let (z) be a polynomial in z of degree 3 or 4 
with complex coefficients and R(z, w) a rational 
function in z and w. Then R(z, 4/ o(z)) is called 
an elliptic irrational function. An integral of the 
type [ R dz is called an elliptic integral. The 
origin of the name comes from the integral 
that appears in calculating the arc length of an 
ellipse. Any elliptic integral can be expressed 
by a suitable change of variables as a sum of 
elementary functions and elliptic integrals of 
the following three kinds: 


| dz 
Ja—-z2)-kz) 
[= 252 
k*z E 
1—z? 


and 











| dz 

(1 —a?z?),/(1 —2?)(1 —k?z?) 

(— Appendix A, Table 16.1). These three kinds 
of integrals are called elliptic integrals of the 


first, second, and third kind, respectively, in 
Legendre-Jacobi standard form. This classifica- 
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tion corresponds to that of tAbelian integrals. 
The constant k is called the modulus of these 
elliptic integrals, and a is called the parameter. 
Let the four zeros of (z) be «,, a, 44, «4 (we 
take one of them as oo when ¢(z) is of degree 
3). The *Riemann surface R corresponding to 
the elliptic irrational function has the zeros 
1, X2, X3, t4 as ! branch points with degree 
of ramification 1, and is of two sheets and 

of tgenus 1. If the integrand does not have 

a pole with a tresidue, then the integral is 
multivalued only because the value (called 
the periodicity modulus) of the integral taken 
along the normal section (basis of the homol- 
Ogy group) is not equal to zero. 


B. Elliptic Integrals of the First Kind 


When R is a function without singularities 
other than branch points, only the topological 
structure of R gives rise to the multivaluedness 
of the integral of R. The standard form is 





7 dz 
o /(1—22)(1—k2z?) 


p do 
-| ——— = WHF UE ok (1) 
o ./1—k? sin? o 


where z —sin o. This integral is the inverse 
function of sn w (— Section J). The periodicity 
moduli are 2iK' and 4K, where 


i dz 
ee ee 
ó a 


Lyte) 
RS o 2 


=K(k), k?-1—-K. 


We call K(k) a complete elliptic integral of the 
first kind and F (k, @) an incomplete elliptic 
integral of the first kind (— Appendix A, Table 
16). Setting 


1 (1 - )sin o cos o (ET? 
sin dë) =— ky =, (2) 
./1—k?* sin? o I+k 


we have the relation 
F(k,p)=(+k,)F(k,, 91)/2, 


which is called Landen's transformation. Since 
k, « k when 0 « k « 1, this transformation 
reduces the calculation of elliptic integrals 

to those with smaller values of k. For two 
given positive numbers a and b, put a) =a, 

bo — b, and 4,4; —(a, FRA, b, =V a,b, 
Then the sequences {a,} and {b,} converge 
rapidly to the common limit, which is called 
the arithmetico-geometric mean of a and 

b, and is denoted by ag(a, b). The complete 
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elliptic integral satisfies the relation K (k) — 


x/[2ag(1, ,/ 1 — k2)]. 


C. Elliptic Integrals of the Second Kind 


When R has poles with residue zero, its inte- 
gral has no singularities other than poles. 
The standard form is 


FO z 1—k?2? 
z= ———— 
9 
-| ./i—k?* sin? o dọ = E(k, 9), (3) 
o 
where z=sin q. We have 
? Q'(u E 
ra- | dn?udu2——— 4—u 
0 O(u) 


if we set z ^ snu. Here, O is Jacobi's theta func- 
tion, and 


ew - 5 (3 u X) 
2K’ K 


where 9, is a theta function to be described in 
Section I and K, K’ are the same as in the case 
of elliptic integrals of the first kind. The 
quantity 


1— k?z? 


1—z 


n/2 T 
d V1=# sin? odo= E) 
0 2 


is called a complete elliptic integral of the 
second kind. 


D. Elliptic Integrals of the Third Kind 


When R has poles with nonzero residue its 
integral has logarithmic singularities. In this 
case, residues also contribute to multivalued- 
ness of the integral. The standard form is 


Z dz 
ro=f 
o (1—a?z?) /(1 —z?)( 


e do 
o (1—a?sin? ọ) 


and it is expressed as 


sna 1 
F(z) 2 —— ———1 = 1 
e Ge S 


if we set z=snu and à? =k? sn? a (— 
A, Table 16). 





1— k?z?) 


1 — k? sin? o 





9(u—a) , Sa 
oura em) 


Appendix 


E. Elliptic Functions and Periodic Functions 


Historically the elliptic function was first intro- 
duced as the inverse function of the elliptic 
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integral. However, since it has been realized 
that elliptic functions are characterized as 
functions with double periodicity, it is now 
customary to define them as doubly periodic 
functions. 

If f(x), defined on a linear space X, satisfies 
the relation f(x + w) = f(x) for some coe X and 
all xe X, the number o is called a period of 
f(x), and f(x) with a period other than zero 
is called a periodic function. The set P of all 
periods of f(x) forms an additive group con- 
tained in X. If a basis œ, ...,c, of the additive 
group P exists, its members are called funda- 
mental periods of f(x). 

Any continuous, nonconstant, periodic 
function of a real variable has only one posi- 
tive fundamental period and is called a simply 
periodic function. The ttrigonometric functions 
are typical examples: sin x and cos x have the 
fundamental period 27; tan x and cot x have 
the fundamental period z (— 159 Fourier 
Series). 

A single-valued nonconstant tmeromorphic 
function of n complex variables cannot have 
more than 2n fundamental periods that are 
linearly independent on the real number field. 
A function of one complex variable with two 
fundamental periods is called a doubly periodic 
function. 

Let o, w be the fundamental periods of a 
doubly periodic function. For a given number 
a, the parallelogram with vertices a, a+ c, 

à oy, a 4- c o- of is called the fundamental 
period parallelogram. The complex plane is 
covered with a network of congruent parallel- 
ograms, called period parallelograms, obtained 
by translating the fundamental period parallel- 
ogram through mo 1 no* (m,n —0, +1, +2,...). 

A doubly periodic function f(u) meromor- 
phic on the complex plane is called an ellip- 
tic function. For simplicity, we usually denote 
the fundamental periods of an elliptic function 
by Zon and 20, and introduce œw, defined by 
the relation o +œ, 4- cj — 0. The first, and 
therefore also higher, derivatives of any elliptic 
function are elliptic functions with the same 
periods. The set of all elliptic functions with 
the same periods forms a tfield. The number of 
poles in a period parallelogram is finite. The 
sum of the orders of the poles is called the 
order of the elliptic function. An elliptic func- 
tion with no poles in a period parallelogram is 
merely a constant (Liouville's first theorem). 
The sum of the residues of an elliptic function 
at its poles in any period parallelogram is zero 
(Liouville's second theorem). Hence there can 
be no elliptic function of order 1. An elliptic 
function of order n assumes any value n times 
in a period parallelogram (Liouville's third 
theorem). The sum of the zeros minus the 
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sum of the poles is a period (Liouville's fourth 
theorem). 


F. Weierstrass's Elliptic Functions 


Weierstrass defined 


o1 


as the simplest kind of elliptic function. Here 
Q —2mo, + 2nw3, with m, n integers. The sum- 
mation } extends over all integral values (pos- 
itive, negative, and zero) of m and n, except 

for m=n=0. (u) is an elliptic function of 
order 2 with periods 2c, and 2a , called a 
Weierstrass (?-function. The following func- 
tions C(u) and oa(u) are called the Weierstrass 
zeta and sigma functions, respectively: 


u 1 
EE LL 0" o *a) 
and 


soci 5)enle zl 


These have quasiperiodicity, expressed by 


Clu + 20) — C(u) + 2n;, (4) 





o(u+2m,)= — e? tod g(y), (5) 
Nit+n2+n3=0, 4,=C(o,), i=1, 2, 3; 

and they satisfy the relations 

(u) - —C'(u) (6) 
and 

E LAMB () 


du ` ein 


The function £o(u) is an even function of u, 
and ¢(u) and c(u) are odd functions of u. By 
considering the integral [C(u)du once around 
the boundary of a fundamental period par- 
allelogram, we have 


1023 — 3, x 
50, —1,0, >= t—i, wl? dE (8) 
2 0i 
130 — 5035 


which is called the Legendre relation. 
The derivative 


g'(u)= —25 l/(u—Qy 


of a g-function is an elliptic function of order 
3 and bears the following relation to (o(u): 


('(u))’ —4(9()? —g @(u)—g3 
=4(9(u)—e,)(@(u) — e3)(o(u) — e3), 
4,—60Y'1/O*,  g,—140Y"1/05, 


e=olo),  i-1,2,3. (9) 
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By differentiating this relation successively, 
we see that o" (u) is expressed as a polynomial 
in (u) if n is an even number and as a prod- 
uct of polynomials in g(u) and $o'(u) if n is an 
odd number. 

In particular, writing (o(u) —z in (9) we find 
that the -function is the inverse function of 
the elliptic integral 





| E dz 

u= 

c / 4z? — g4z2—g4 

(— Appendix A, Table 16.IV). 

Any elliptic function can be expressed in 
terms of Weierstrass's functions. Specifically, 
let the poles of f(u) and their orders be a,, 
45, ..., äm and hi, ha, ..., hm, respectively, and 
let the principal part in the expansion of f(u) 
near the pole a, be 


hy Ay 


j=1 (u—a,) i 





k21,2, ...,m. (10) 
Then we obtain 
f(u)-C- DE 

k-1 


hk (—1) Ay; 
4 > CUA 

ja (j— 1)! 
where C is a constant depending on f(u). This 
can be reduced to 


f(u)— A-- Be) 


by using the addition theorems (— Appendix 
A, Table 16) for (o and zeta functions, where A 
and B are rational functions of D(u). Therefore, 
given two elliptic functions with the same 
periods, after expressing them as rational 
functions of o and ø’ in the above form and 
eliminating o and o, we obtain an algebraic 
equation with constant coefficients. In par- 
ticular, for any elliptic function f(u) we ob- 
tain an falgebraic differential equation of the 
first order by using this method, with f'(u) an 
elliptic function with the same periods. Fur- 
thermore, the functions f(u 4- v), f(u), and f(v) 
satisfy an algebraic equation. Thus for any 
elliptic function an algebraic addition theorem 
holds. 


ua). (11) 


G. Elliptic Functions of the Second Kind 


As an extension of the definition of elliptic 
functions, if a *meromorphic function f satis- 
fies the relations 


f(u* 20.) - u, f(y), ZU Vets) (12) 


(u; and u, are constants) with the fundamental 
periods Zon, 20, we call f(u) an elliptic func- 
tion of the second kind. What we have called 
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simply an elliptic function can now be called 
an elliptic function of the first kind. For con- 
stants p and v, the function 


f(u) — e" o(u —vy/o(u) (13) 


is an example of an elliptic function of the 
second kind. In this case 


seit Zon, j=1,2, (14) 


Furthermore, for given constants u, and H3, 
any elliptic function of the second kind is 
expressed as the product of an elliptic function 
of the first kind and the function (13) with p 
and v determined by (14). 


H. Elliptic Functions of the Third Kind 


If a meromorphic function f satisfies 
f(u--20)-e*"*5f(u,  i-1,3 (15) 


(a; and b; are constants) with periods 2o, 20, 
we call it an elliptic function of the third kind 
(— 3 Abelian Varieties I). 

The Weierstrass sigma function o(u) is an 
example of an elliptic function of the third 
kind. The functions o,, 63, and o3, defined by 
the equations 
o,(u)= ————_—_——.,__ i= 1, 2, 3, (16) 
are also elliptic functions of the third kind. In 
the case of elliptic functions of the second and 
third kinds, 20, and 2%, are not periods in 
the strict sense defined earlier, but are conve- 
niently referred to as the periods. The func- 
tions o; (i= 1,2, 3) are called cosigma functions. 


I. Theta Functions 


The theta functions, or more strictly, elliptic 
theta functions, are defined by 


Air, :) 22 Y. (—1)'q*!?* sin(2n + 1) x0, 
n=Q 

9, (v, t) 22 Y. q*!2* cos(2n + 1)nv, 
n=0 


Ai, t) 2 1-- 2 Y. q” cos2nnv, 


n=1 
9, (v, c) 21-2 Y. (—1)"q" cos 2nnv, (17) 
n=1 


where q— e'*, Imt>0. We call (17) the q- 
expansion formulas of the theta functions, and 
we sometimes write % in place of 94. A theta 
function is an elliptic function of the third kind 
with periods 1 and z. Any elliptic function can 
be expressed as a quotient of theta functions 
(— Appendix A, Table 16 for specific exam- 
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ples). The q-expansion formula is quite suit- 
able for numerical computation because of 

its rapid convergence; its terms decrease as the 
n? powers of q as n> oo. 

An elliptic function with the fundamental 
periods 2m, and Ze, can also be viewed as 
having the periods 2c; 220, 205 = —2a,. 
Consequently, theta functions formed with the 
parameter t= ,/w, can be expressed in terms 
of the parameter d 20/0, = —0,/04— —l/t, 
and we have 


d, (v, t) 149, (v/v, —1/1), 
Hie, 1) = A34 (v/v, —1/1), 
33(v, x) = A9, (v/T, —1/1), 
9, (v, 1) = A3, (v/v, —1/1), 
A — JJ ift exp( — niv? /1). (18) 


These are called Jacobi's imaginary trans- 
formations. If Im(—1/t)> 1, then |q| a 1, and 
therefore the series in the q-expansion for- 
mula converges slowly. However, by an imag- 
inary transformation we get Im(t)> 1 and 
\q|=0, so that computations become much 
easier. 

Each of the theta functions satisfies, as a 
function of two variables u and t, the follow- 
ing partial differential equation of the heat- 
conduction type: 


6? G(u, t)/du? = 4ni8.9(u, x)/0x (19) 
(— Appendix A, Table 16.IT). 


J. Jacobi’s Elliptic Functions 


C. G. J. Jacobi defined elliptic integrals as 
inverse functions of elliptic integrals of the first 
kind in the Legendre-Jacobi standard form (1). 
They are, in the above notation, 


E alu) 3,(0)9,() 
SWEN ees u ORO) SE 
` ent 95(0) 92(0) 
oul 3, (0) 3, (v) í 
020). LORN) 
05 (u) 34 (0)3, (v) ? 


where w—4/e, Uu and ps Milon, 


These functions satisfy the relations 








(21) 





(22) 


sn?w+cn?w=1, k?sn?w+dn?w=1, (23) 








where 
p 62783 00) (24) 
€, —€3 (9,(0))* 


The constants k and k’=./1—k? are called 


the modulus and the complementary mod- 
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ulus, respectively. Furthermore, the relation 
dsnw/dw — cn wdn w holds. The function z 
—snw is the inverse function of the elliptic 
integral (1) (— Appendix A, Table 16.III). 
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A. General Remarks 


Suppose that we are given a relation R be- 
tween elements of a set X such that for any 
elements x and y of X, either xRy or its nega- 
tion holds. The relation R is called an equiva- 
lence relation (on X) if it satisfies the following 
three conditions: (1) xRx, (2) xRy implies yRx, 
and (3) xRy and yRz imply xRz. Conditions 
(1), (2), and (3) are called the reflexive, sym- 
metric, and transitive laws, respectively. To- 
gether, they are called the equivalence prop- 
erties. Condition (1) can be replaced by the 
following: (1') For each x there exists an x’ 
such that x Rx'. The relation “x is equal to y" 
is an equivalence relation. If xRy means that x 
and y are in X, then R is also an equivalence 
relation. An equivalence relation is often de- 
noted by the symbol ~. The relations of con- 
gruence and similarity between figures are 
equivalence relations. If X is the set of integers 
and x= y means that x — y is even, then the 
relation = is an equivalence relation. 
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B. Equivalence Classes and Quotient Sets 


Let R be an equivalence relation. “xRy” is 
read: *x and y are equivalent" (or "x is equiva- 
lent to y"). The subset of X consisting of all 
elements equivalent to an element a is called 
the equivalence class of a. By (1), (2), and (3), 
each equivalence class is nonempty, the equiv- 
alence class of a contains a, and different 
equivalence classes do not overlap. Namely, 
X is decomposed into a fdisjoint union of 
equivalence classes. This tpartition is called 
the classification of X with respect to R. For 
example, the set of integers is classified into the 
equivalence class of even numbers and that of 
odd numbers by the relation =. Conversely, 
since the relation “x and y belong to the same 
member of a partition" is an equivalence re- 
lation, we can regard any partition as a classi- 
fication. Àn element chosen from an equiva- 
lence class is called a representative of the 
equivalence class. In the example we can take 
0 and 1 as the representatives of equivalence 
classes of even and odd numbers, respectively. 

X/R denotes the set of equivalence classes of 
X with respect to R, and is called the quotient 
set of X with respect to R. The mapping p: X ^ 
X/R that carries x in X into the equivalence 
class of x is called canonical surjection (or 
projection). The idea of equivalence relations 
can be generalized to deal with the case when 
X is a !class. 


C. Stronger and Weaker Equivalence Relations 


Let R and S be two equivalence relations on 
X. If xRy always implies xSy, then we say that 
R is stronger than S, S is weaker than R, the 
classification with respect to R is finer than the 
one with respect to S, or the classification with 
respect to S is coarser than the one with re- 
spect to R. The relations “x is equal to y" and 
“x and y are in X" are the strongest and the 
weakest equivalence relations, respectively. 
Any two equivalence relations on X are 
ordered by their strength, and the set of 
equivalence relations on X forms a tcomplete 
lattice with respect to this ordering. 
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A. General Remarks 


The origin of ergodic theory was the so-called 
ergodic hypothesis, which provided the foun- 
dation for classical statistical mechanics as 
created by L. Boltzmann and J. Gibbs toward 
the end of the 19th century (— 402 Statistical 
Mechanics). Attempts by various mathemati- 
cians to give a rigorous proof of the hypoth- 
esis resulted in the recurrence theorem of 

H. Poincaré and C. Carathéodory and the 
ergodic theorems of G. D. Birkhoff and J. von 
Neumann, which marked the beginnings of 
ergodic theory as we know it today. As the 
theory developed it acquired close relation- 
ships with other branches of mathematics, for 
example, the theory of dynamical systems, 
probability theory, functional analysis, number 
theory, differential topology, and differential 
geometry. 

The principal object of modern ergodic 
theory is to study properties of tmeasurable 
transformations, particularly transformations 
with an invariant measure. In most cases, the 
transformations studied are defined on a Le- 
besgue measure space with a finite (or c-finite) 
measure. À Lebesgue measure space with a 
finite measure (o-finite measure) is a *measure 
space that is measure-theoretically isomorphic 
to a bounded interval (to the real line) with the 
usual tLebesgue measure, possibly together 
with an at most countable number of atoms. It 
is known that any separable complete tmetric 
space with a complete regular Borel tproba- 
bility measure is a Lebesgue measure space 
with a finite measure. We assume, unless stated 
otherwise, that the measure space (X, 4,m) is a 
Lebesgue measure space. All the subsets of X 
mentioned are assumed measurable, and a pair 
of sets or functions that coincide almost every- 
where are identified. We use the abbreviation 
“a.e.” to denote “talmost everywhere.” 


B. Ergodic Theorems 


Let (X, Z, m) be a o-finite measure space. 

A transformation o defined on X is called 
measurable if for every BEB, !(B)e.A 
*bijective transformation o on X is called 
bimeasurable if both o and o ! are measur- 
able. À measurable transformation q is called 
measure-preserving (or equivalently, the mea- 
sure m is invariant under ¢) if min !(B)) 2 m(B) 
holds for every B. It is called nonsingular if 
mio !(B)) 20 whenever m(B) —0, and ergodic 
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if m((p ! (B)JU B)—(o * (B) B)) 20 implies 
either m(B) 2 0 or m(X — B)=0. 

The mean ergodic theorem of von Neumann 
(Proc. Nat. Acad. Sci. US, 18 (1932)) states that 
if o is a measure-preserving transformation or 
(X, 42, m), then for every function f belonging 
to the tHilbert space L;,(X)— L,(X,22,m) (— 
168 Function Spaces), the sequence 


HECKEN 


converges in the tnorm of L,(X) as noo to a 
function f* that satisfies f *(ox) = f *(x) a.e. The 
individual (or pointwise) ergodic theorem of 
Birkhoff (Proc. Nat. Acad. Sci. US, 17 (1931)) 
states that for every f belonging to the tfunc- 
tion space L,(X), the sequence A, f(x) con- 
verges a.e. to f *. From either of these theo- 
rems it follows that for any set E satisfying 

o !(E)—E and m(E) « co, the limit function f * 
satisfies |, f * dm= f; fdm. In particular, if m(X) 
— 1 and 9 is ergodic, then the limit f * equals 
the constant | fdm a.e. This fact therefore gives 
a mathematical justification to the ergodic 
hypothesis, which states that the *time mean" 
(ZC f(o* x))/n of what is observable over a 
sufficiently long time can be replaced by the 
“phase mean" f fdm. Both von Neumann's 
theorem and Birkhoff's theorem were sub- 
sequently generalized in various directions by 
many authors. 

(1) Mean ergodic theorems are concerned 
with the tstrong convergence of the sequence 
of averages A, — (Zi T")/n of the iterates of a 
*bounded linear operator T on some tBanach 
space. À generalization of von Neumann's 
theorem due to F. Riesz, K. Yosida, and S. 
Kakutani dispenses with the assumptions that 
the linear operator T is induced by a measure- 
preserving transformation o and that T acts 
on the Hilbert space L,(X). A version of this 
generalization states that if a linear operator T 
defined on a Banach space & satisfies the 
conditions 

Int ; 
(i) supl- Y, T* | «oo, 
Ry-o 


nzi 














(ii) lim IT" =0, 
noH 


then for an element f € 4 the sequence of 
averages A, f converges strongly to an element 
f*e if and only if there exists a subsequence 
converging *weakly to f *. From this theorem 
of Riesz, Yosida, and Kakutani follows the 
L,-mean ergodic theorem (1 < p< oo) for so- 
called Markov operators: If T is a linear oper- 
ator defined on each of the Banach spaces 

Ly (X) (1 <p & oo) by means of the formula 
Tf(x) 2 Lf()P(x, dy), where P(x, B) is the ttran- 
sition probability of a Markov process on 
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(X, 2) leaving the measure m invariant (i.e., 

| P(x, B)dm=m(B) for every Be 2) (— 261 
Markov Processes), then for every f belonging 
to L (X) (1 « p « oo) the sequence A,,f con- 
verges in the norm of L,(X) to a limit function 
f*. 

(2) Birkhoff's ergodic theorem has been 
extended to the following individual ergodic 
theorem by E. Hopf (1954): If T is a tpositive 
linear operator mapping L,(X) into L,(X) 
and L,,(X) into L,,(X) with | T], <1 and 
| Tli, <1, then for every f in L,(X) the se- 
quence A, f converges a.e. to a limit f *. If 
T is a Markov operator, then T maps each 
Lj (X) into itself and satisfies | T ||, & 1 for 
each p (1 <p < oo), and therefore Hopf’s er- 
godic theorem applies to such T. Special cases 
of this theorem were proved earlier by J. 
Doob and by Kakutani. Later, N. Dunford 
and J. Schwartz showed that the assump- 
tion of the positivity of T can be dispensed 
with in Hopf's theorem. For a positive linear 
operator T on L,(X) satisfying ||T ||, € 1, R. 
Chacon and D. Ornstein (1960) proved that 
the ratio ergodic theorem holds: For every pair 
of functions f and g in L,(X) with g>0 a.e., 
Wm, 22-6 T*f(x)/X2-1 T*g(x) exists and is 
finite a.e. on the set (xl T*g(x) » 0). This 
theorem extends earlier results of Hopf and 
W. Hurewicz dealing with special classes of 
operators arising from measurable transfor- 
mations. Hopf's ergodic theorem can be de- 
duced from the Chacon-Ornstein theorem, 
while it is known that there are positive oper- 
ators T on L,(X) satisfying ||T ||; 1 for which 
lim, ,, A, f fails to exist on a set of positive 
measure for some fe L,(X). This shows that 
the assumption | T | ,, € 1 is crucial in Hopf's 
theorem. 

(3) As was the case in the original proof by 
Birkhoff of his ergodic theorem, every known 
proof of an individual ergodic theorem de- 
pends crucially on the so-called maximal 
ergodic lemma (or maximal inequality). For 
the case of a positive linear operator T on 
L (X) with | T ||, € 1, Hopf proved the rele- 
vant maximal ergodic lemma: If E( f) is the set 
{x|sup,>,A,f(x)>0} for each f in L’(X), then 
feg) f dmz 0. Hopf's original proof of this 
lemma was quite intricate, but A. Garcia 
(1965) succeeded in giving an extremely sim- 
ple and elegant proof. From the maximal er- 
godic lemma the following so-called domi- 
nated ergodic theorem can also be obtained: 
Let T be a linear operator mapping each 
L,(X) into L,(X) satisfying | T ||, «1 for each 
p (I &p& oo). If for f in L,(X) we let f(x)— 
sup, s 11A, f(x), then if 1 «p « oo we have 


5 2P 
| fo^ dm «E [uearan 
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while if p=1 and m(X) « oo we have 
[Foamea] [mco | uates! sid | 


This theorem was obtained first by N. Wiener 
(1939) for the special case of T induced by a 
measure-preserving transformation. M. Ak- 
coglu (1975) proved an isometric “dilation” 
theorem for positive contractions (1.e., positive 
linear operators T for which |[T || <1) in the 
space L,(X) for 1 « p « oo, by means of which 
he was able to deduce a dominated ergodic 
theorem for an arbitrary positive contraction 
in L,(X) from the corresponding theorem for a 
positive linear isometry in L,(X) proved earlier 
by A. Ionescu-Tulcea. From this theorem of 
Akcoglu one can obtain the following indi- 
vidual ergodic theorem: If T is a positive con- 
traction in L,(X) for some p with 1 « p « oo, 
then for every f in L,(X), the sequence A, f 
converges a.e. 

(4) Both mean and individual ergodic theo- 
rems can be extended without difficulty to 
a continuous time parameter semigroup 
(1,|£20] of bounded linear operators such 
that 7, 7; — T, (1, — I), under a suitable con- 
tinuity assumption on T, with respect to t, by 
replacing the discrete time average (31-3 T*)/n 
with (f6 T; ds)/t. Further extensions to n- 
parameter semigroups were obtained by N. 
Wiener and by N. Dunford and A. Zygmund. 
For mean ergodic theorems, even further 
extensions were possible to tamenable semi- 
groups of bounded linear operators. For 1- 
parameter semigroups, the behavior of the 
mean at zero, (fo T,ds)/t as t|O (local ergodic 
theorem) or Affe ^ T;fds as 4|0 or Atco 
(Abelian ergodic theorem), has also been in- 
vestigated by Wiener, U. Krengel, E. Hille, 
Yosida, and others. Abelian ergodic theorems 
are related to properties of the tresolvent of 
the semigroup {7;} (— 378 Semigroups of 
Operators and Evolution Equations). Further 
extensions of all these theorems in various 
directions have been given by many authors; 
for these extensions and related topics — 
[5-7]. 

(5) J. F. C. Kingman (1968) proved an inter- 
esting and useful extension of (both mean and 
individual) ergodic theorems, called the sub- 
additive ergodic theorem. A real-valued 'sto- 
chastic process (X; ,|O<i<k, k=1,2,3,...} 
is called a subadditive process if it satisfies the 
following conditions: (i) Whenever i<j<k, 
Xj, € X; j- X; ,. (ii) The tjoint distribution of 
{Xj+1,j+1} is the same as that of | X; ;}. (iii) The 
Texpectation g, = E(X, ,) exists, and satisfies 
gZ — Ak for some constant A and for all 
k> 1. Kingman proved that if ( X; ,] is a sub- 
additive stochastic process, then the limit £— 
lim, ,, (1/n) X, , exists a.e. and in the mean, 
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and E(£) — lim, ,, (1/n)g,. Note that if o isa 
measure-preserving transformation on a finite 
measure space (X, Z, m) and if f is an element 
of L,(X), then X, ,— 275] fo o! for 0<i<k, 
where k — 1,2,3... defines a subadditive pro- 
cess (in fact, X, =X; ,-- X; , fori «j « k). The 
multiplicative ergodic theorem, proved by V. 
Oseledec, which plays a significant role in 
ergodic theory of tnonhyperbolic smooth 
dynamical systems and which has found ap- 
plications also in *algebraic groups, can be 
derived from the subadditive ergodic theorem 
[8,9]. 


C. Recurrence and Invariant Measures 


In this section we assume that the measure 
space (X, 2, m) is nonatomic. A nonsingular 
measurable transformation ¢ defined on 

(X, B,m) is called recurrent (infinitely recurrent) 
if for every set B and for almost all xe B, there 
exists an ne Z/ (infinitely many ne Z *) such 
that o"(x)e B. A set W is called wandering 
under Q if o "(W)0 o *(W)- Ø fornzk. The 
transformation q is called conservative if no 
sets of positive measure are wandering under 
9, and incompressible if B ' B implies 
m(B— o !B)-0. The following statements 
about a nonsingular measurable transforma- 
tion ọ are equivalent: (i) ọ is recurrent; (ii) @ is 
infinitely recurrent; (iii) o is incompressible; 
(iv) ọ is conservative. An immediate con- 
sequence of this is the following recurrence 
theorem of Poincaré (in the form formulated 
by Carathéodory): A measure-preserving 
transformation on a finite measure space is 
infinitely recurrent. In fact, in order for a non- 
singular measurable transformation q to be 
recurrent (and hence infinitely recurrent) it 

is sufficient that there exist a finite measure 

H invariant under o and equivalent to (i.e., 
mutually *absolutely continuous with) the 
given measure m. 

The invariant measure problem is one of the 
basic problems in ergodic theory and is for- 
mulated abstractly in the following way: Given 
a nonsingular measurable transformation p on 
a o-finite measure space (X, #, m), find neces- 
sary and sufficient conditions for the existence 
of a finite (or o-finite) measure invariant under 
om and equivalent to m. The given measure m 
specifies only the class of equivalent measures 
among which an invariant measure is to be 
found. Therefore we can assume without loss 
of generality that m is a finite measure. For the 
remainder of this section, unless we explicitly 
state otherwise, we always mean by an invar- 
iant measure the one that is equivalent to m. 

The Poincaré recurrence theorem states that 
gy being recurrent is necessary for the existence 
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of a finite invariant measure. The recurrence of 
9 is, however, not sufficient, since an ergodic 
transformation with an infinite but o-finite 
invariant measure is recurrent and has no 
finite invariant measure. A necessary and 
sufficient condition for the existence of a finite 
invariant measure was given by a theorem of 
A. Hajian and Kakutani (1964) which states: o 
has a finite invariant measure if and only if « 
has no weakly wandering sets (a set W is called 
weakly wandering under ¢ if there exists an 
infinite subset {n,} of Z* such that o "Wn 
o "W- OG, kj) Hajian also proved that a 
bimeasurable transformation q has a finite 
invariant measure if and only if ¢ is strongly 
recurrent in the following sense: For every set 
E with m(E) 0, there exists a positive integer 
k — k(E) such that 

max m(o" JEN E)>0 

O<j<k 

for every ne Z. 

H. Furstenberg [11] obtained the following 
striking extension of the Poincaré recurrence 
theorem: If a bimeasurable transformation 
Q possesses a finite invariant measure, then 
for every set E with m(E)>0 and for every 
integer k >2, there exists n>1 such that 
MEN "En g?"En...o* Y"£)>0. From this 
theorem one can deduce a difficult theorem 
of E. Szemerédi on tarithmetic progressions 


which states: Any subset of the integers having ` 


positive tupper density contains arithmetic 
progressions of arbitrary length. In fact, it is 
not difficult to show that the theorems of 
Furstenberg and of Szemerédi are mutually 
equivalent, and for this reason the theorem of 
Fürstenberg is sometimes referred to as the 
ergodic Szemerédi theorem. 

For a nonsingular bimeasurable transfor- 
mation q, a pair of sets A and B are said to 
be countably equivalent under ¢ if there exist 
countable decompositions (A,|ke Z * | and 
(B,|ke Z^) for A and B, respectively, and an 
infinite subset {n,' of Z such that 9A4,— B, 
for each k. A and B are said to be finitely 
equivalent under ¢ if finite decompositions 
{A,} and {B,} can be chosen. It was proved by 
Hopf that (i) is recurrent if and only if no set 
of positive measure is finitely equivalent under 
gy to one of its proper subsets, and (ii) g has a 
finite invariant measure if and only if no set of 
positive measure is countably equivalent under 
ọ to one of its proper subsets. It can be shown 
that if o is ergodic, then @ has no o-finite 
invariant measure if and only if every pair of 
sets of positive measure are countably equiva- 
lent under q. 

The first example of a transformation ad- 
mitting no o-finite invariant measure was 
constructed by Ornstein in 1960. Since then, a 
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number of simpler examples have been ob- 
tained by L. Arnold, Brunel, and others. It is 
now known that there are many different types 
of transformations having no o-finite invariant 
measure (— Section F). Furthermore, it was 
shown by Ionescu-Tulcea that in the group of 
all nonsingular bimeasurable transformations 
with a suitable metric, those having a o-finite 
invariant measure form a subset of the first 
category. 

Various extensions of results on the invar- 
iant measure problem for nonsingular trans- 
formations to the case of Markov processes 
having nonsingular transition probabilities 
were obtained by Y. Ito, J. Neveu, S. Foguel, 
and others [12]. 

For investigation of detailed properties of 
particular transformations arising from classi- 
cal dynamical systems, problems in number 
theory, and so on, rather than the existence of 
invariant measures it is more important to 
determine a specific form of an invariant mea- 
sure with desirable properties and to develop 
methods to decide when such a measure is 
unique. Various people have considered spe- 
cial classes of transformations; these workers 
have been able to obtain explicit descriptions 
of invariant measures with nice properties for 
the transformations in question and have 
derived interesting consequences. Most im- 
portant among these is the so-called Gibbs 
measure, introduced and investigated by Ya. 
Sinaí. He obtained this notion by generalizing 
the concept of the equilibrium Gibbs distri- 
bution, which plays a prominent role in tstatis- 
tical mechanics. It is defined in the following 
way: Let X be a compact metric space, o a 
homeomorphism on X, and Go a probability 
measure on X invariant under o. For a func- 
tion g belonging to L (X) and for m, n>0 let 


«atu | ew( $ veau 


X 


and form a sequence of probability measures 
Lm,.n(g) absolutely continuous with respect to 
Ho, for which the *Radon-Nikodym derivative 


du, (x|g) _ EXP Ek- gies 
duo(x) Ge, „(g | Ho) 


holds. A measure that is a limit point, in the 
sense of *weak convergence, of the sequence of 
measures Ha ,(g) is called a Gibbs measure 
constructed from jg and g. It is clear that a 
Gibbs measure is invariant under o. For mix- 
ing topological Markov shifts (— Section D), 
Sinai showed that if one starts with the invar- 
iant measure jọ of maximal entropy (— Sec- 
tion H), the existence of which was earlier 
shown by W. Parry, one can determine a class 
of functions g for which the Gibbs measure 
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is unique, i.e., u(g) — Daa. Da. n(g) exists. 
Sinaí further investigated the properties of the 
unique measure u(g) in detail. R. Bowen and 
D. Ruelle, defining the Gibbs measure some- 
what differently, investigated the existence 
and uniqueness of such measures, thereby 
recapturing the results of Sinaí in the case of 
mixing topological Markov shifts. With the 
aid of Markov partitions (— Section G), these 
results on Gibbs measures were carried over 
to the case of tAnosov and “taxiom A" diffeo- 
morphisms, and they provided essential tools 
in the investigation of the ergodic behavior of 
these transformations. For details — [13, 14]. 

The explicit form of the density of the in- 
variant measure with respect to the Lebesgue 
measure for the transformation associated 
with fcontinued fraction expansion was al- 
ready known to Gauss, and one can draw 
from it numerous conclusions about metric 
properties of continued fractions. The way in 
which Gauss was able to determine this invar- 
iant measure, however, was never explained. 
Recently, Sh. Ito, H. Nakada, and S. Tanaka 
(Keio Eng. Rep., 30 (1977)) developed an inter- 
esting method to describe the mechanism used 
to arrive at this density function for the invar- 
iant measure for the continued fraction trans- 
formation. They employed similar methods in 
subsequent work to determine explicit density 
functions for invariant measures for other 
related number-theoretic transformations and 
for certain classes of continuous mappings 
over an interval; by means of the explicit forms 
of invariant measures they were able to de- 
scribe the metric properties of these transfor- 
mations in detail. 


D. Examples and Construction of Measure- 
Preserving Transformations 


Examples of measure-preserving transforma- 
tions appear in many different contexts. We 
describe some of the important ones. 

(1) Let G be a *locally compact Abelian 
group satisfying the second axiom of *counta- 
bility (— 423 Topological Groups), Z a a- 
algebra of Borel subsets of G, and m its tHaar 
measure (normalized if G is compact). Then 
(G, Z, m) is a Lebesgue measure space. For a 
fixed element g,€ G, define the transforma- 
tion o, :GG by 9,.(g) 2 g * go. Then @,, is 
a bijective measure-preserving transforma- 
tion on (G, Z, m) and is called the rotation 
on G by the element go. If G is compact, then 
the rotation 9, is ergodic if and only if the 
cyclic subgroup generated by the element go is 
dense in G. If this happens, the element g, is 
called the topological generator of G. A group 
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is called monothetic if it has a topological 
generator. 

(2) If ọ is a group *endomorphism of a com- 
pact Abelian group G, then o preserves the 
Haar measure m. If ọ is a group tautomor- 
phism, thén it is a bijective measure-preserving 
transformation on (G, Z, m). A continuous 
group automorphism q induces a group auto- 
morphism ø* of the character group G*. 

The measure-preserving transformation o is 
ergodic if and only if every character except 
the identity has an infinite orbit under the 
induced automorphism q*. When the group 
is the n-dimensional torus T", a continuous 
group automorphism q is uniquely repre- 
sented by an n x n matrix with integer entries 
and with determinant +1. In this case, ¢ is 
ergodic if and only if no roots of unity appear 
among the feigenvalues of the representing 
matrix. 

(3) Let (Y, .«7) be a measurable space, let 
(Y,, %,) 2 (Y, 7) for each ne Z, and define 
(Y*, .27*) to be the *product measurable space 
(lez Yn Inez Sna). The transformation o 
defined on (Y *, 5) by 


y*z(...,y-p Yo Y1) 
29(y*)-(.... Y 5 Mos Yis) 


with y, — y,,, for each n is called the shift 
transformation. Let o be a probability measure 
on (Y, £) such that (Y, 7, u) is a Lebesgue 
measure space, let 1, — u for each n, and define 
u* to be the tproduct measure [T,.z Hn on 
(Y*,.27*). Then (Y*, /*, u*) is a Lebesgue 
measure space, and the shift transformation @ 
is a bijective measure-preserving transforma- 
tion. Considered with the product measure, o 
is called a generalized Bernoulli shift. When the 
set Y is at most countable and the measure p 
on Y is given by a sequence {p;} of positive 
numbers with £ p;— 1, ¢ is called a Bernoulli 
shift. Suppose that P(y, A) is a Markov tran- 
sition function on (Y, land z is a proba- 
bility measure invariant under P(y, A). Then 
we can define the Markov measure z* on the 
product space (Y *,.«7*) by setting 


en | E |, sts) Pos dri) 


x P(yi. dy5)... P(y, 1, 4s), 


for a cylinder set 


em Cn.) 
j<i i<k<its n>its 


and extending it to all of ./*. The shift trans- 
formation o preserves the Markov measure 
1*. Considered as a measure-preserving trans- 
formation on (Y*, «7*, 1*), o is called a Mar- 
kov shift. 
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A generalized Bernoulli shift is always 
ergodic. A Markov shift is ergodic if and only 
if the corresponding Markov process is tirre- 
ducible, which is the case if and only if the 
following property is satisfied: For every pair 
of sets A and B with z(A)x(B) » 0, there exists 

an neZ* such that f4 P"(y, B)dz — 0. 
There are other measures besides the prod- 
uct measure and the Markov measure that 
can be defined on the product space (Y *, .«7*) 
and are invariant under the shift transforma- 
tion g. For example, if Y is a Borel subset of 
R and .x is the c-algebra of Borel subsets of Y, 
then any tstationary stochastic process taking 
values in Y induces such a measure on 
(Y*,.27*). When considered with a measure of 
this type, the shift transformation q is called 
the shift associated with the stationary process. 
Properties of the shift associated with a 
'stationary Gaussian process have been inves- 
tigated by G. Maruyama, I. Girsanov, H. 
Totoki, and others. In particular, it is known 
that the shift is ergodic if and only if the tspec- 
tral measure for the tcovariance function of 
the associated Gaussian process is continuous 
[15]. 

(4) Other important examples of measure- 
preserving transformations arise from classical 
dynamical systems, which will be described in 
Section G. 

(5) There are several ways of constructing 
new measure-preserving transformations from 
given ones. We describe important cases. 

(i) Let o be a nonsingular, measurable, 
recurrent transformation (not necessarily 
measure-preserving) on a o-finite measure 
space (X, Z, m), and let A be a set of positive 
measure. For xe A, let n(x)=min{neZ* | 
q"(x)e A}. The transformation o," A A de- 
fined by o,(x) 2 o"? (x) is a nonsingular mea- 
surable transformation on the measure space 
(A, ANA, mj), where m,(B)=m(AN B)/m(A), 
and it is measure-preserving if qo is. We call 
p4 the transformation induced by ¢ on A. It is 
ergodic if @ is ergodic. 

(ii) Let ọ be a nonsingular measurable 
transformation on a c-finite measure space 
(X, 22, m), and suppose that {A,} is a countable 
(possibly finite) partition of X. Define a func- 
tion f: X 9Z* by setting f(x)=n for xe A,, 
and let X 2 ((,j)|xe X, 1«j« f(x)] bea 
subspace of the product measure space (X x 
Z*,Bx€,mx u), where € is the c-algebra 
of all subsets of Z*, and u is the measure on 
(Z+, €) defined by u((n]) —1 for each get", 
The transformation $ defined on X by $(x,j) 
=(x,j+1) if 1«j«f(x) and =(¢(x), 1) if j= 
J(x) is a nonsingular measurable transforma- 
tion on the measure space (X,(@ x @).X, (m x 
H)s), and it is measure-preserving if @ is. We 
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call @ the transformation built from o with the 
ceiling function f. If is ergodic and m(A,)0 
as n oo, then @ is also ergodic. 

(iii) Suppose that y is a measure-preserving 
transformation on (X, Z, m) and o, isa 
measure-preserving transformation on 
(Y, £, u) for each xe X. Assume that the map- 
ping (x, y) 2 o, (y) is measurable with respect 
to the c-algebras Z x a and a The trans- 
formation 0 defined on the product space 
(X x Y, Bx sf, mx u) by 066 y) - (V(x), P(Y) 
is measure-preserving and is called the skew 
product of y and {¢,}. If o, — o for all xe X, 
then we get a direct product transformation 
A(x, y) = (W(x), oy). 

(iv) A measure-preserving transformation p 
on (Y, £, ui) is said to be a factor transforma- 
tion (or a homomorphic image) of a measure- 
preserving transformation y on (X, 2, m) if 
there exists a measurable transformation 5 
from X onto Y such that mon ! = u and py = 
ny. If Z' is a c-subalgebra of 2 and y leaves 
R invariant De, y ! 8B c 4), then y induces a 
factor transformation o on the measure space 
(X, B’,m). Conversely, if a measure-preserving 
transformation q on (Y, £, u) is a factor trans- 
formation of y on (X, Z,m) via a mapping rm, 
then Z' =n | oA is a o-subalgebra of 4 invar- 
iant under y. 

(6) A one-parameter family («,|te R) of 
bijective measure-preserving transformations 
on a measure space (X, 28. m) is called a flow. 
A flow is called continuous if the mapping t 
T, is *weakly continuous where {T;} is the 
one-parameter family of tunitary operators 
on L?(X) induced by the flow {¢,}. A flow is 
called measurable if the mapping (t, x) q,(x) 
is a measurable transformation of R x X into 
X. A measurable flow is continuous. A. Ver- 
shik and Maruyama proved that for any con- 
tinuous flow there exists a measurable flow, 
unique in a specified sense, which is spatially 
isomorphic (in the sense specified in Section E) 
to the given flow. 

Important examples of flows are given by 
classical dynamical systems (— Section G), 
and by continuous-time stationary stochastic 
processes. 

An important tool in the study of flows is 
provided by the theorem of W. Ambrose and 
Kakutani: Every measurable ergodic flow 
without a fixed point is spatially isomorphic to 
an S-flow. A measurable flow [4] is called an 
S-flow (special flow or flow built under a func- 
tion) if there exist a measure-preserving trans- 
formation @ of a measure space (X, 28. m) and 
an R*-valued function f on (X, 4, m) such 
that each ¢, is a measure-preserving trans- 
formation on the subspace X = ((x,u)| xe X, 
Oxuxf(x)] of the product measure space 
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(X x R^, x Af, m x 4) given by 
p(X, u) 


=(x,u+t) if —u<t<—u+ f(x), 


n-1 
"(eec Y een) if 
n-1 n 
—ut Y. f(p*(x))<t<—ut+ ¥ f(o*(x)), 
k=0 k=0 
=f 
eier: Y so) if 


—1 -1 
s f(e*G))«t« c 3. Toto 
=- =-n+1 
nz 1. Here æ is the c-algebra of Borel subsets 
of R*, and 4 is the usual Lebesgue measure. 


E. Isomorphism Problems 


In this section we assume that the Lebesgue 
measure spaces considered are probability 
spaces. For simplicity, following the common 
usage among Russian mathematicians, we 
call a measure-preserving transformation on 
(X, 2, m) an endomorphism and a bijec- 
tive measure-preserving transformation an 
automorphism. 

An automorphism q, (a flow (9f!)) on 
(X4, 2,, m,) is said to be spatially isomorphic 
(or metrically isomorphic) to an automorphism 
Q9; (a flow (o?1) on (X,, B, m;) if there exist 
sets N, and N, with m,(N,) 2 m,(N,)—0 anda 
bijective measurable transformation 0 from 
X, — N, to X, — N, such that m; 00 —m, and 
09, — 9,0 (Oe! = 9? 0 for each t). 

Classification of automorphisms and flows 
into isomorphism classes constitutes the cen- 
tral problem of modern ergodic theory. Prop- 
erties of automorphisms and flows that are 
preserved under spatial isomorphisms are 
called isomorphism invariants (or metric in- 
variants). There are several isomorphism tn- 
variants that are essential to the study of the 
isomorphism problem. We describe these 
below for the case of automorphisms. There 
are corresponding invarianis for flows as well. 


(1) Spectral Invariants. Two automorphisms o 
and o, are said to be spectrally isomorphic if 
the tunitary operators T, and T, induced by 
o and o, on the Hilbert spaces L,(X,) and 
Lj(X,), respectively, are unitarily equivalent 
(Le, there exists an isometric isomorphism V 
of L,(X,) onto L,(X,) such that VT, = T; V). 
Properties preserved under spectral isomor- 
phisms are called spectral invariants (or spec- 
tral properties). If o, and ọ, are spatially iso- 
morphic, it is clear that they are spectrally iso- 
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morphic; but the converse is not true in 
general. 

(i) The property of o being ergodic is a 
spectral property since @ is ergodic if and only 
if the number 1 is a simple eigenvalue of the 
induced unitary operator T. If @ is ergodic, 
then the set of all eigenvalues of the induced 
operator T forms a subgroup of the circle 
group, each eigenvalue is simple, and each 
teigenfunction has constant absolute value. If 
the tspectrum of the induced operator T con- 
sists entirely of eigenvalues, @ is said to have 
discrete spectrum (or pure point spectrum). A 
theorem due to von Neumann and P. Halmos 
—the first theorem on the question of isomor- 
phism-—states that two ergodic automor- 
phisms p, and o; with discrete spectra are 
spatially isomorphic if and only if they are 
spectrally isomorphic, which is the case if and 
only if the induced operators T, and T, have 
the same set of eigenvalues. Furthermore, 
every ergodic automorphism with discrete 
spectrum is spatially isomorphic to an ergodic 
rotation on a compact Abelian group [2]. 

Analogous results were obtained by L. 
Abramov for a bigger class of automorphisms, 
namely, for ergodic automorphisms having so- 
called quasidiscrete spectra. 

(ii) An automorphism o is ergodic if and 
only if for every pair of sets A, B, 

n-1 
lim Al Y, m(o*(4)n al — m(A)m(B). 
n>% N \k=0 
Strengthening this condition, we can define p 
to be weakly mixing if for every pair of sets A, 
B 


H 


n> n 


lim (5 ten Ai" SEENEN 


strongly mixing if 


lim m(q*(A)0 B) =m(A)m(B); 


and k-fold mixing if for arbitrary choice of sets 
Aj, j=0,1,...,k, 


lim m(Ag 9": A, NA, ... (0 Q9" A,) 
—-m(Ag)m(A,) D m(A,), 


where the limit is taken as n4, nz, ...,n,— 

oo in such a way that n; <n, <... « n, and 
min, «;«,(n;— n;.,) oo. The property of an 
automorphism q being weakly mixing or 
strongly mixing is a spectral property. For 
instance, o is weakly mixing if and only if the 
number 1 is a simple eigenvalue and is the 
only eigenvalue of the induced operator T. It 
is also known that ọ is weakly mixing if and 
only if the direct product automorphism o x @ 
is ergodic. The set of all weakly mixing auto- 
morphisms forms a dense tG;-set in the group 
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of all automorphisms on (X, Z, m) considered 
with the so-called weak topology (Halmos's 
theorem). On the other hand, it was shown 
by V. Rokhlin that the set of all strongly mix- 
ing automorphisms is a set of first category 
with respect to the weak topology. However, 
there are only a few known examples of auto- 
morphisms that are weakly mixing but not 
strongly mixing. 

(iii) An automorphism q is said to have 
countable Lebesgue spectrum if the maximal 
spectral type of the induced unitary operator 
T restricted to the *orthocomplement of the 
subspace of constant functions in L, (X) is 
equivalent to the Lebesgue measure and its 
*multiplicity is countably infinite. 

(iv) An automorphism ọ is called a K- 
automorphism (or Kolmogorov automorphism) 
if there exists a o-subalgebra Z4, of 2 such that 
(a) Po D By and PB # Bo, (b) VnezP" Bo = B, 
and (c) Anez 9" 24, =, where M is the o- 
subalgebra of # consisting of null sets and 
their complements. The notion of a K-flow 
(or Kolmogorov flow) is defined similarly. K- 
automorphisms are k-fold mixing for all orders 
k and have countable Lebesgue spectra. 

Generalized Bernoulli shifts are all K- 
automorphisms. An ergodic Markov shift is a 
K-automorphism if and only if it is strongly 
mixing, which is the case if and only if the 
corresponding Markov process is tirreducible 
and taperiodic (— 260 Markov Chains B). A 
continuous group automorphism of a compact 
Abelian group is a K-automorphism if and 
only if it is ergodic. In particular, a continuous 
group automorphism of the n-dimensional 
torus T" is a K-automorphism if and only if no 
roots of unity appear among the eigenvalues of 
the representing matrix. Automorphisms and 
flows arising from classical dynamical systems 
also provide examples of K-automorphisms 
and K-flows. In particular, a tgeodesic flow on 
a surface of negative curvature is a K-flow, 
and each automorphism (except the identity) 
of this flow is a K-automorphism. For the shift 
transformation q associated with a stationary 
Gaussian process, it was shown by Maruyama 
[15] that ọ is (a) weakly mixing if and only if 
it is ergodic, (b) strongly mixing if and only 
if the covariance function of the associated 
Gaussian process tends to 0 as n— oo, and (c) 
a K-automorphism if and only if the fspec- 
tral measure of the covariance function is 
absolutely continuous with respect to the 
Lebesgue measure. 

(v) Examples of automorphisms having 
various types of spectra have been constructed 
by a number of authors by using stationary 
Gaussian processes and the theory of approxi- 
mation developed by A. Katok and A. Stepin 


[16]. 
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(vi) An ergodic automorphism with quasi- 
discrete spectrum has a mixed spectrum, that 
is, the spectrum of the induced unitary opera- 
tor T has a continuous component and eigen- 
values in addition to 1. Anzai (1951) con- 
structed a special class of skew product auto- 
morphisms having mixed spectra and showed 
that in this class there are automorphisms that 
are spectrally isomorphic but not spatially 
isomorphic. However, the question of whether 
two spatially nonisomorphic automorphisms 
exist among automorphisms having the same 
purely continuous spectrum remained un- 
answered for a long time, until in 1958 Kol- 
mogorov (Dokl. Akad. Nauk SSSR, (5) 119 
(1958)) settled it affirmatively by using a new 
isomorphism invariant called entropy. 


2. Generators and Entropy. (i) By a partition c 
={A,} of the space X we mean a collection of 
sets A, such that A, N A; = f whenever Az A 
and | ] A; — X. We denote by e the partition of 
X into individual points, and by v the trivial 
partition (Xj. A partition into a finite (count- 
able) number of sets is called a finite (count- 
able) partition. A partition Ets said to be 

finer than another partition C (or ¢ is coarser 
than č) if for every Acé there is a set Bec such 
that Ac B. For a collection (£,] of partitions 
of X, we denote by V,£, the coarsest partition 
that is finer than each £,, and by A,£, the 
finest partition that is coarser than each £,. If 
€, = d An is a sequence of countable (or finite) 
partitions, then \/,¢, is precisely the parti- 

tion of X into nonempty intersections of the 
form (, A,,, with A, , € Ča for each k. With 

a partition č of X we associate a o-subalgebra 
B(é) of B which is the c-algebra of all 2- 
measurable sets that are a union of elements in 
€. Two partitions c and C are said to coincide 
a.e. if Z(£) — All) a.e. De for every Ae BE) 
there exists a set B in (C) such that m(AU 

B — AN B)=0 and conversely). 

(ii) Suppose that q is an endomorphism of 
(X, 24, m) and £ a partition of X. By o ! £ we 
mean the partition (o !(4)| Ae}. If o is an 
automorphism, we also define o£ = lot AN 
Aeé}. A partition £ is called a generator for 
an endomorphism ọ if Vo "£- e a.s. If 
Viz -w 9^ 6 =£ a.s., € is called a two-sided gen- 
erator for an automorphism q. 

An endomorphism o is said to be periodic at 
a point xe X if there exists a positive integer n 
such that o"(x) — x and aperiodic if the set of 
points of periodicity has measure zero. If the 
measure space (X, 2, m) is nonatomic, then 
every ergodic endomorphism on it is aperi- 
odic. A theorem of Rokhlin states that every 
aperiodic automorphism ¢ has a countable 
two-sided generator. This implies that every 
such ¢ is spatially isomorphic to the shift 
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transformation on the infinite product space 
(ONT. PI considered with some invariant mea- 
sure uf, where each coordinate space Y; — Y 
has at most a countable number of points. 
Krieger improved this result by showing that if 
an ergodic automorphism q has finite entropy, 
then ¢ has a finite two-sided generator [17]. 
(iii) For a finite or countable partition č = 
{A,}, define the entropy H(¢) of the partition 
to be — X,m(A,)log(m(A,)) (the logarithms 
here and below are natural logarithms). We 
denote by Z the set of all partitions Z with 
H(&) « oo. If Ce Z', then for any endomorphism 
ọ the limit 


Wa, ¢)= lim — Ju (V. ex) 


exists and is finite. The entropy bio) of the 
endomorphism ¢ is defined to be sup(A(q, &)| 
če Z^) and is an isomorphism invariant. Prop- 
erties of entropy have been investigated exten- 
sively since the notion was introduced by 
Kolmogorov. We cite a few results. 

(a) If a partition če ¥ is a generator for an 
endomorphism q or a two-sided generator for 
an automorphism q, then A(q) — h(q, č) (Sinai’s 
lemma). (b) For every integer n, h(o")— |n|h(q), 
and for a measurable flow {@,}, hlo) =|t|h(@,) 
for every real number t. (c) If an automor- 
phism q is periodic, then h(@)=0. (d) If o, is a 
factor transformation of p, then bie, le h(q,). 
(e) bio, x q;) 2 h(q,) + h(q;). (There is a more 
complicated formula (due to Rokhlin) for the 
entropy of a skew product automorphism.) 

(f) If ọ is a recurrent automorphism and o, 

is the automorphism induced by o on a sub- 
set A with m(A) 0, then h(g,)=h(@)/m(A) 
(Abramov's formula). (g) If o is a Bernoulli 
shift with probability distribution {p,}, then 
h(q)- — X, p,log p,. (h) If o is a Markov shift 
based on the Markov transition probability P, 
(defined on a countable or finite state space) 
and an invariant measure "e, then 


h(p)= -Lim Plog P; 


(i) If ọ is ergodic and has a pure point spec- 
trum, or more generally, has a quasidiscrete 
spectrum, then h(g)=0. (j) For an ergodic 
group automorphism q on an n-dimensional 
torus, h(@)= > log{A|, where the sum is taken 
over all eigenvalues 4 of modulus » 1 of the 
representing matrix. (k) If an automorphism o 
has positive entropy, then in L;( X) there exists 
a subspace invariant under the induced uni- 
tary operator T such that the spectrum of T 
restricted to this subspace is countable Le- 
besgue (Rokhlin's theorem). It follows from 
(k) that automorphisms with ‘singular spectra 
or spectra of finite multiplicity must have zero 
entropy. In proving assertion (g), Kolmogorov 
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established for the first time the fact that there 
are uncountably many spatially nonisomor- 
phic Bernoulli shifts. 

(iv) An automorphism ø is said to have 
completely positive entropy if h(q, €)>0 for 
every partition Z z v. It was shown by Rokhlin 
and Sinai that an automorphism ¢ has com- 
pletely positive entropy if and only if 9 is a K- 
automorphism. M. Pinsker proved that for 
every automorphism ¢ there exists a partition, 
called the Pinsker partition, that is invariant 
under o and such that the factor transforma- 
tion of o with respect to this partition has 
zero entropy and is the largest among the 
factor transformations of o with zero entropy. 

(v) Rokhlin showed that h(q) —0 for an 
endomorphism q if and only if all of its factor 
transformations are automorphisms. An endo- 
morphism o is called exact if Ao Q "e— v a.e. 
Rokhlin introduced a way to associate with 
each endomorphism a certain automorphism, 
called the natural extension, which reflects the 
properties of the endomorphism. For example, 
an endomorphism and its natural extension 
are simultaneously ergodic or nonergodic, are 
mixing of the same order, and have equal 
entropy. The natural extension of an exact 
endomorphism is a K-automorphism. 

(vi) Automorphisms p, and @, are said to be 
weakly isomorphic if each of them is a factor 
transformation of the other. Sinai proved 
(1964) that for each ergodic automorphism 
with positive entropy, there exists a factor 
automorphism having the same entropy and 
isomorphic to a Bernoulli shift, and hence it 
follows in particular that Bernoulli shifts with 
the same entropy are weakly isomorphic. 
Ornstein (Adv. in Math., 4 (1970)) went further 
and succeeded in proving the following re- 
markable result: Two Bernoulli shifts with 
equal entropy are spatially isomorphic. Par- 
tial results in this direction were obtained 
earlier by L. Meshalkin, J. Blum, and D. Han- 
son. In the proof of Ornstein's theorem, essen- 
tial use was made of the following theorem of 
C. Shannon and B. McMillan, which plays a 
fundamental role in information theory (— 
213 Information Theory): Suppose that ¢ is 
an ergodic endomorphism on (X, Z, m) and č 
is a partition of X. For a point xe X, let A,(x) 
denote the element in the partition Vào *é 
that contains x. Then for almost all x, 


lim ( log nts) 
exists and equals h(ọ, £). 

(vii) Techniques and ideas developed by 
Ornstein in his proof of the isomorphism 
theorem have been refined and extended fur- 
ther by himself, B. Weiss, N. Friedman, M. 
Smorodinsky, and others, and numerous re- 
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sults were subsequently obtained on the iso- 
morphism problem. We describe below some 
of the main results and basic concepts; for 
more detailed accounts — [21-24]. 

A sequence {€,} of partitions is said to be 
independent if for every choice of sets A,,..., A, 
with A,€€, , m((i-, Aj) DI m(A,) when- 
ever n,,75, ...,n, are all distinct. For a fixed 
€ 0, two partitions € and ¢ are said to ber, 
independent if 


Y |m(ANB)—m(A)m(B)| « «. 


Ae6, Bec 


A pair (9, č), where o is an automorphism 
of (X, 22, m) and € is a finite (or countable) 
partition of X, is called a process (on X). 
(V, ~~ 9") is a e-subalgebra of Z invariant 
under o, and ¢ restricted to this o-subalgebra 
is a factor automorphism of o and is isomor- 
phic to a shift transformation on an infinite 
product space, where each coordinate space 
has the same number of points as the number 
of atoms in č. A process (o, č) is called a Ber- 
noulli process (or an independent process) if the 
sequence of partitions (o"£|ne Z} is indepen- 
dent, and a weak Bernoulli (W.B.) process if 
for every ¢>0 there exists a k 0 such that 
the partitions VE" oi E and V9. .,, o£ are e- 
independent for all n 20. Let J be a finite 
set and a, f be two functions from the set 
{1,2,...,N} to J. By dy(a, B) we denote (1/N) 
z in|a(n)z f(n)), where # denotes the car- 
dinality. dy defines a metric on the space 
Jii? called the Hamming distance. Now, 
for a process (p, €) on X with €={A,|jeJ}, 
we define for x, ye X, dz(x, y) to be equal to 
dy (EN (x), £5 (y), where for xe X, €*(x) is the 
point (ta Metal, Hof Q9))e JU-2 Al 
and j(x) is the index jeJ for which xe A;. A 
process (o, č) is called very weak Bernoulli 
(V.W.B.) if for any ¢>0 there exists an No 
such that whenever N > N, there is a set G 
with m(G)> 1 —e belonging to Z (V9... "£) 
satisfying the following condition: for any 
Aen 9"€) with ACG and m(A)» 0, 
there exists a probability measure v on X x X 
satisfying (1) v(E x X) 2 m(E) and v(X x F)= 
m,(F) for all E, FeB and (ii) fy. xdi(x, y) dv < 
e. It is not difficult to show that W.B. pro- 
cesses are V.W.B. For processes (o, č) on 
(X, 4, m) and (9, č) on (X, Z2, m), where both č 
and £ are indexed by the same finite set J, we 
define dall, 2). (@, £)) to be inf(jy dl Dia 
E*(x))dp(x, x)}, where inf is taken over all 
probability measures p on X x X satisfying 
p(E x X) 2 m(E) for all Ee 2 and p(X x E)— 
m(E) for all Ee 42. dy decreases with N, and 
we define d((@, E (@, €)) to be Wm, — dy((@, €), 
(9, £)). Finally, a process (o, č) is said to be 
finitely determined (F.D.) if for any e 0 there 
exists a 07 0 and N such that if (@, £) is any 
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process satisfying (i) ë and č are indexed by 
the same set J, (ii) bie, £) — h(Q, £) « ó, and 

(iii) dy((g, č), (p, €)) <4, then d((¢, č), (9, €)) <€. 
Let us call a partition č an F.D. generator for 
an automorphism q if £ is a two-sided gen- 
erator for o and the process (q, c) is finitely 
determined. A general isomorphism theorem 
proved by Ornstein states: If automorphisms o 
and @ have the same entropy and both have 
F.D. generators, then o and @ are isomorphic. 
Ornstein and Weiss proved further that the 
process (o, č) is F.D. if and only if it is V. W.B. 
From these basic results a number of results 
can be deduced. (a) Nontrivial factors of Ber- 
noulli shifts are Bernoulli. (b) Strongly mixing 
Markov shifts have two-sided weak Bernoulli 
generators and hence are isomorphic to Ber- 
noulli shifts. (c) Every Bernoulli shift can be 
embedded in a flow, which implies among 
other things that Bernoulli shifts have roots of 
all orders. (d) It was shown by Y. Katznelson 
that every ergodic group automorphism of an 
n-dimensional torus has a two-sided generator 
which can be shown to be V.W.B., and hence 
is also isomorphic to a Bernoulli shift. R. Adler 
and B. Weiss earlier proved by entirely differ- 
ent methods that on a 2-dimensional torus, 
two ergodic group automorphisms having the 
same entropy are spatially isomorphic. They 
have done this by constructing a two-sided 
generator Z for such automorphisms which is 
also a Markov partition (— Section G). 

(viii) Among further positive results there 
are the following: (a) Except for a trivial 
change in the time scale, any two Bernoulli 
flows are isomorphic (Ornstein). (b) Every 
ergodic group automorphism of a compact 
group is isomorphic to a Bernoulli shift 
(Thomas and Miles; Lind). (c) A number of 
automorphisms and flows arising from classi- 
cal dynamical systems are shown to be Ber- 
noulli (— Section G). (d) Examples of exact 
endomorphisms arise in connection with prob- 
lems in number theory; for example, tcon- 
tinued fraction expansion and f}-expansion. 
The natural extensions of the exact endomor- 
phisms associated with continued fraction 
expansion and f-expansion are now known 
to be spatially isomorphic to Bernoulli shifts. 

(ix) Since many of the automorphisms that 
have been shown to be K-automorphisms are 
now known to be isomorphic to Bernoulli 
shifts, it is natural to expect that every K- 
automorphism is in fact Bernoulli. However, 
Ornstein (1973) constructed an example of a 
K-automorphism that is not a Bernoulli shift. 
A lot of work has since been done to investi- 
gate how bad K-automorphisms can be. It 
turns out that K-automorphisms share almost 
none of the finer properties of Bernoulli shifts. 
For example: (a) There are uncountably many 
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nonisomorphic K-automorphisms of the same 
entropy (Ornstein and P. Shields). (b) There is 
a K-flow that is not Bernoulli, and there are 
uncountably many nonisomorphic K-flows 
having the same entropy at time one (Smoro- 
dinsky). (c) There exists a K-automorphism 
not isomorphic to its inverse (Ornstein and 
Shields). (d) There exists an automorphism 
that cannot be written as a direct product of a 
K-automorphism and an automorphism with 
zero entropy. This example shows that a con- 
jecture made earlier by Pinsker was false (Orn- 
stein). (e) There are weakly isomorphic K- 
automorphisms that are not isomorphic (Polit 
and Rudolph). 


F. Weak Equivalence and Monotone 
Equivalence 


(1) Weak Equivalence. In order to construct 
examples of ffactors in the theory of von Neu- 
mann algebras (— 308 Operator Algebras), 

F. Murray and von Neumann considered 
various ergodic groups of bimeasurable non- 
singular transformations on a finite measure 
space. In this context a group 4 = {g} of trans- 
formations is called ergodic if m((g ! (B)JU B)— 
(o !(B) B)) «0 for every ge implies either 
m(B) 20 or m(X — B)— 0. A measure u is said 
to be invariant under the group = {g} if u is 
invariant under every transformation g in 4. 
Murray and von Neumann’s construction (the 
so-called group measure space construction) 
gives a type II, tfactor if the group admits a 
finite equivalent invariant measure, a type II, 
factor if it admits an infinite (but o-finite) equiv- 
alent invariant measure, and a type III factor if 
it has no o-finite equivalent invariant measure. 
In this connection we note that Hajian and Y. 
Itó extended the theorem of Hajian and Kaku- 
tani and proved that an arbitrary group = 
{g} of nonsingular bimeasurable transforma- 
tions admits a finite invariant measure if and 
only if no set of positive measure is weakly 
wandering under the group 4 (a set W is said 
to be weakly wandering under a group 4 if 
there exists an infinite subset (g,|ne Z' ) of 4 
such that g,(W)N9,(W)= @ for nz k). Analo- 
gously to the terminologies used in the theory 
of von Neumann algebras, we define ergodic 
countable group 4 (or an ergodic bimeasurable 
nonsingular transformation q) to be of type 
I, IL, or IH if 2 (or o) has a finite equivalent 
invariant measure, a o-finite infinite equivalent 
invariant measure, or no o-finite equivalent 
invariant measure, respectively. 

For a bimeasurable nonsingular transfor- 
mation o, define the full group [4] to be the 
group of all bimeasurable nonsingular trans- 
formations y such that, for some n=n(x), W(x) 
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— Q"(x) for almost all x. Two transformations 
o and @, are said to be weakly equivalent if 
there exists a bimeasurable nonsingular trans- 
formation @ such that 0[9,]0 ! =[@,]. H. Dye 
proved that any pair of type II, transforma- 
tions are weakly equivalent. It easily follows 
from this that the same is true for type II, 
transformations. Krieger showed, on the other 
hand, that among type III transformations 
there are uncountably many weakly non- 
equivalent ones. In fact, Krieger [28] intro- 
duced an invariant for weak equivalence, 
called the ratio set, by means of which he 
classified type III transformations into mutu- 
ally weakly nonequivalent subclasses III, for 
O<A<1. Krieger showed further that (a) for 
0 «Ax 1, every pair of transformations in the 
class HI, is weakly equivalent; (b) in the class 
III, there are uncountably many mutually 
weakly nonequivalent transformations; and 
(c) two ergodic transformations are weakly 
equivalent if and only if the corresponding 
factors constructed via the group measure 
space construction are tx-isomorphic. An 
ergodic flow of nonsingular transformations 
(not necessarily measure-preserving) called the 
associated flow was constructed independently 
by Krieger [29] and by T. Hamachi, Y. Oka, 
and M. Osikawa [30], and was shown to give 
another effective invariant for weak equiva- 
lence. Krieger showed that the mapping which 
assigns to each ergodic transformation its 
associated flow gives a tbijective mapping 
between the set of all weak equivalence classes 
of ergodic type III, transformations and the 
set of all isomorphism classes of aperiodic 
conservative ergodic flows. In this connection 
the theorem of U. Krengel and I. Kubo on the 
representation of ergodic flows, extending the 
theorem of Ambrose and Kakutani mentioned 
in Section D, plays a significant role. 

A bimeasurable nonsingular transformation 
0 is called a normalizer of another transforma- 
tion ¢ if it satisfies 0[@]=[@]9. The set of all 
normalizers “M (p) of a transformation qo forms 
a group called the normalizer group which 
contains the full group [4] as a subgroup. One 
can introduce a suitable topology to A (ol 
to make it a complete separable metrizable 
group. Hamachi [31] has shown that, for type 
III transformations om. the quotient group 
-(e)/[o] , where [qo] denotes the closure of 
[€]. is algebraically and topologically isomor- 
phic to the tcommutant of the associated flow. 

Results obtained by Krieger and others 
mentioned above were motivated in part by 
corresponding developments in von Neumann 
algebra theory due mainly to A. Connes (— 
308 Operator Algebras). From the recent deep 
result of Connes on the uniqueness of tapprox- 
imately finite-dimensional factors of type II, 
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it follows that every approximately finite- 
dimensional factor with the exception of type 
III, is *-isomorphic to a factor constructed 
from an ergodic transformation via the group 
measure space construction. 


(2) Monotone Equivalence. The theory of weak 
equivalence discussed above deals with the 
structure of orbits of a transformation or of 
groups of transformations. In fact, transfor- 
mations p and y are weakly equivalent if and 
only if there exists a bimeasurable nonsingular 
transformation 0 mapping the -orbit of 
almost all x onto the W-orbit of 0(x). A some- 
what more stringent notion of equivalence 
dealing with orbit structure is called mono- 
tone equivalence or Kakutani equivalence. We 
say that measurable flows {¢,} and {@,} of 
measure-preserving transformations on finite 
measure spaces (X, 8. m) and (X, 2, m), respec- 
tively, are monotonely equivalent if there 
exists a bimeasurable measure-preserving 
transformation 8 on X to X such that for 
almost all xe X and all te R, 09,(x)= Pra, (x), 
where c(t, x) is a monotone increasing function 
of t. Two S-flows (— Section D) built over the 
same base transformation with different ceiling 
functions are monotonely equivalent, and it 
can be shown that monotonely equivalent 
ergodic flows are isomorphic to S-flows built 
over the same base transformation. This in- 
duces an equivalence relation on transforma- 
tions, also called monotone equivalence or 
Kakutani equivalence: Two transformations o 
and @ are monotonely equivalent if they can 
serve as base transformations of the same (or 
equivalent) flow. This notion of equivalence 
was introduced by Kakutani in [32], where he 
showed that g and @ are monotonely equiva- 
lent if and only if there are sets Ec X and 

Ec X such that the induced transforma- 

tions (— Section D) o, and ër are isomorphic. 
Abramov's formula mentioned in Section E 
implies that there are at least three classes of 
transformations (and flows) that are mutually 
nonequivalent: transformations (flows) of zero, 
finite, and infinite entropy. Nothing much was 
done in this equivalence theory for a num- 

ber of years until in 1975 J. Feldman and E. 
Satayev independently showed that there are 
monotonely nonequivalent transformations 
within each class of zero, positive, and infinite 
entropy. Since then extensive work has been 
done by Feldman, Satayev, A. Katok, Orn- 
stein, Weiss, D. Rudolph, M. Ratner, and 
others, and numerous results have been ob- 
tained. For detailed accounts of this devel- 
opment — [22, 24, 33, 34]. The main idea 
employed by Feldman and Satayev was to 
introduce a new metric called an f-metric 

and then to define a notion corresponding to 
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V.W.B. of the isomorphism theory described 
in Section E by using an f- instead of a d- 
metric. They then showed that this property, 
called loosely Bernoulli (L.B.) by Feldman and 
monotonely very weak Bernoulli (M.V.W.B.) by 
Satayev, stays invariant under monotone 
equivalence, and that there exist transforma- 
tions with and without this property. An f- 
metric is defined by starting off with an f,- 
metric on the space J^? instead of the 
Hamming distance d, and proceeding in 
exactly same way as for the definition of d, 
namely, by extending fy to f? and fy and 
finally to f. For « and f in J! i25, fi (a, B) 
is defined by setting 1 — fy(«, B) equal to 1/N 
times the maximal integer n for which there 
are positive integers j,<j,<...<j,,k,<k,< 
. <k, with a(jj) = B(k,), i— 1, ..., n. Ornstein 
and Weiss discovered that by substituting f 
for d one can develop a theory of monotone 
equivalence that parallels the isomorphism 
theory described in Section E. One can define 
a process (o, č) to be finitely fixed (F.F.) by 
substituting f for d in the definition of F.D. 
process, and, as was mentioned earlier, to be 
L.B. by doing the same in the definition of the 
V.W.B. process. Ornstein and Weiss proved: 
(a) If o and @ have zero entropy, finite posi- 
tive, or infinite entropy and if both have F.F. 
generators, then they are monotonely equiv- 
alent. (b) Let ọ have an F.F. generator. If 
h(p)=0, then o is monotonely equivalent 

to an irrational rotation of the circle. If 0 — 
h(q) « oo, then q is monotonely equivalent 

to a Bernoulli shift of finite entropy. If h(@)= 
oo, then o is monotonely equivalent to a 
Bernoulli shift of infinite entropy. (c) If o has 
an F.F. generator, then (o, £) is F.F. for all 
nontrivial partitions Z (d) (o, £) is F.F. if and 
only if it is L.B. It is known further that within 
each entropy class there exist uncountably 
many monotonely nonequivalent transforma- 
tions. A measurable flow {@,} is called L.B. if 
it can be represented as an S-flow built over 
an L.B. transformation. The L.B. flows of 
zero entropy are those monotonely equivalent 
to a *Kronecker flow on a 2-dimensional 
torus, and L.B. flows of finite positive (infinite) 
entropy are those monotonely equivalent to 
the Bernoulli flow of finite (infinite) entropy. 
The direct product of an L.B. flow and a Ber- 
noulli flow is L.B., while it was shown by M. 
Ratner that the horocycle flow (— Section G) 
is L.B. but its direct product with itself is not. 


G. Classical Dynamical Systems 
By a classical dynamical system we mean a 


tdiffeomorphism or a flow generated by a 
smooth vector field on some fdifferentiable 
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manifold M". Such a system is nonsingular 
with respect to a measure defined by any 
*Riemannian metric on M". For a fixed Rie- 
mannian metric, we call measures smooth 

if they have a smooth density with respect 
to the measure given by the metric. 

(1) Among classical dynamical systems, geo- 
desic flows have been investigated most exten- 
sively. Let £,(M) be the unitary ttangent bun- 
dle over the manifold M". A point (x, e)e 
JA (M) defines a unique *geodesic through x 
in the direction of e. The geodesic flow on 
A, (M) is the flow defined by ¢,(x, e) =(x,, ej), 
where x, is the point in M" reached from x 
after time t under a motion with unit speed 
along the geodesic determined by (x, e), and e, 
is the unit vector at x, tangent to the geodesic. 
The classical tLiouville theorem in this con- 
text implies that the measure on /7, (M) that is 
the product of the measure on M" induced by 
the metric and the Lebesgue measure on the 
(n — 1)-dimensional sphere gives a smooth in- 
variant measure for the geodesic flow. A wide 
class of systems arising from mechanics can 
be described as geodesic flows. 

Hopf and G. Hedlund proved that if the 
manifold M" is compact and has constant 
negative curvature, then the geodesic flow is 
strongly mixing. Later, by using the theory of 
group 'representations, I. Gel’fand and S. 
Fomin proved that the spectrum of a geodesic 
flow on a compact manifold of constant nega- 
tive curvature is Lebesgue, and is even count- 
able Lebesgue in the case where the mani- 
fold is of dimension 2. F. Mautner and later L. 
Auslander, L. Green, and F. Hahn extended 
this algebraic method to flows obtained under 
the action of some one-parameter subgroup of 
a *Lie group acting on its thomogeneous space 
and obtained extensive results for the case of 
‘nilpotent and some fsolvable Lie groups [35]. 

(2) The flow on an n-dimensional torus de- 
fined by 


Q(X4. X2, E Ses) 
=(x,+@,t,x,+@ ,t,...,X,+@,8) 


is called a translational flow or a Kronecker 
flow. The numbers w,,@,,...,@, are called 
frequencies. Every orbit of {¢,} is dense in the 
torus if and only if the frequencies are linearly 
independent over Z. The motion under a 
translational flow with independent frequen- 
cies is called a quasiperiodic motion. A trans- 
lation flow for a quasiperiodic motion has 
discrete spectrum. 

(3) Sinai obtained a useful criterion for a 
classical dynamical system to be a K-system. 
Let M" be compact, and suppose that {@,} is a 
flow on M" defined by a smooth vector field 
and preserving some smooth measure u. A 
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one-parameter group (V4) of transformations 
of the space M", given by a vector field, is 
called the flow transversal to the flow {¢,} if 
(i) the decomposition of the space M" into the 
trajectories of the flow {w,} is invariant under 
Io: (ti) the limit Im, ,o lim, „o(W,(t, x) — t)/ts = 
a(x) exists for the function W(t, x), which is 
defined to be the time length of the segment 
to, (x) JO uxtj of the trajectory of the flow 
{w,}. Sinai’s fundamental theorem states that if 
a flow {@,} is ergodic and has a transversal 
ergodic flow {y} for which { a(x) dy «0, then 
{@,} is a K-flow. If «(x)<0, then we can even 
drop the assumption that (qj is ergodic. If 
fax) dp 7 0, the theorem holds for the flow 
(0-)- 

A geodesic flow on a 2-dimensional mani- 
fold of constant negative curvature always has 
a transversal flow, called a horocycle flow. The 
ergodicity of a horocycle flow was proved by 
Hedlund. It follows from Sinaf's fundamental 
theorem, therefore, that a geodesic flow on a 
surface of constant negative curvature is a 
K-flow. Sinaí proved even more: A geodesic 
flow on any surface of negative curvature is 
a K-flow. There is an extension of the notion 
of transversal flow to higher dimensions, 
called transversal field. Using this notion Sinai 
proved that a geodesic flow on a manifold 
(of any dimension) of constant negative cur- 
vature is a K-flow. Finally, Ornstein and 
Weiss established that geodesic flows on com- 
pact manifolds of negative curvature are 
Bernoulli. 

(4) D. Anosov considered a class of flows 
and diffeomorphisms satisfying a condition 
that characterizes unstable motions such as 
geodesic flows on a manifold of negative cur- 
vature. They are now called Anosov flows (or 
Y-flows) and Anosov diffeomorphisms (or Y- 
diffeomorphisms) (— 126 Dynamical Systems). 
Anosov proved that if an Anosov flow has a 
smooth invariant measure, then it is ergodic 
and either it has a continuous nonconstant 
eigenfunction or it is a K-flow. Anosov diffeo- 
morphisms with smooth invariant measures 
are K-automorphisms. Sinaí constructed for 
*transitive Anosov diffeomorphisms (and for 
Anosov flows) a special partition of the under- 
lying manifold M called a Markov partition 
having desirable properties. The importance of 
such partitions lies in the fact that they enable 
one to represent such diffeomorphisms as 
Markov shifts. Starting with a measure invar- 
iant for such a Markov shift having the maxi- 
mal entropy (— Section H), Sinai constructed 
a Gibbs measure (— Section C), which turned 
out to be unique in this case and gave rise to 
a natural invariant measure u for the corre- 
sponding diffeomorphism. He proved further 
that the diffeomorphism q considered as an 
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automorphism of the measure space (M, 2, u) 
is a K-automorphism. It was shown subse- 
quently by R. Azencott that ois in fact Ber- 
noulli with respect to u. Sinai investigated 
further the uniqueness of the invariant mea- 
sure attaining the maximal entropy for transi- 
tive Anosov diffeomorphisms, and he was able 
to show among other things that the set of 
transitive C^-Anosov diffeomorphisms that 
do not have an invariant measure absolutely 
continuous with respect to the measure in- 
duced by the Riemannian metric on M con- 
tains an open dense subset. The methods em- 
ployed by Sinai in these investigations were 
extended further by D. Ruelle, Bowen, and 
others. Bowen, in particular, was able to con- 
struct Markov partitions for a wider class of 
diffeomorphisms, namely, those satisfying the 
so-called axiom A introduced earlier by S. 
Smale, and characterized the Gibbs measures 
for diffeomorphisms in this class by means of 
the variational principle (— Section H). For a 
more detailed account of the results of Sinai 
and Bowen — [13, 14]. 

(5) An important example of a system that is 
neither an Anosov nor an “axiom A” system 
because of nonsmoothness has been studied by 
Sinai: the simplest mechanical model due to 
Boltzmann and Gibbs of an ideal gas, which is 
described as a system generated by tiny rigid 
spherical pellets moving inside a rectangular 
box and colliding elastically. Sinai succeeded 
in proving that this is a K-system, thereby 
giving an affirmative answer to the classical 
question of the ergodicity of the basic model 
of statistical mechanics [38]. It was shown 
subsequently by M. Aizenman, S. Goldstein, 
and J. Lebowitz that this system is Bernoulli. 
Sinai also investigated another nonsmooth 
system of classical importance: the system 
describing the motion of a billiard ball on a 
square table with a finite number of convex 
obstacles. He showed that such a system, is a 
K-system. Ornstein and G. Gallavotti then 
showed that Sinai’s methods in fact show that 
the system is Bernoulli. Sinaf's methods were 
extended further by L. Bunimovich and I. 
Kubo to study properties of billiard systems in 
more complicated domains. For more detailed 
account of these matters — [24]. The results 
obtained by Sinai and Bowen for Anosov 
and “axiom A" systems were extended to 
even wider class of systems by A. Stepin, R. 
Sacksteder, M. Brin, and Ya. Pesin (— [24]). 


H. Miscellany 


(1) An arbitrary aperiodic automorphism can 
be approximated by periodic automorphisms. 
More precisely, if o is an aperiodic automor- 
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phism, then for any positive integer n and 

6 7 0, there exists a periodic automorphism y 
of period n such that m((x|e(x) Zv(x)]) « 
(1/n) 4- ô (theorem of Halmos and Rokhlin). 
The question as to how quickly this approxi- 
mation can be carried out has been investi- 
gated in detail by Katok, Stepin, and others. 
The rate of this approximation was shown to 
have a close relationship with the entropy and 
spectral properties of the automorphism ø. By 
utilizing this relationship, various examples of 
automorphisms with specified spectral prop- 
erties have been constructed [16]. 

(2) When an arbitrary automorphism q is 
given on a Lebesgue measure space (X, Z, m), 
there exists a unique decomposition č = {A,} 
of X such that (i) each A, is invariant under 
y and (it) except for a negligible set (in a 
specified sense) of A,, each A, is turned (in a 
natural manner) into a Lebesgue measure 
space, and the restriction of o to A, is an 
ergodic automorphism. This decomposition is 
called the ergodic decomposition of X with 
respect to o. There is a corresponding decom- 
position with respect to a flow. A formula also 
exists that enables us to compute the entropy 
h(q) in terms of the entropies of ergodic com- 
ponents of o. 

(3) Let X be a compact metric space and 
o X X a *'homeomorphism. N. Krylov and 
N. Bogolyubov showed that there always 
exists on X a Borel probability measure y that 
is invariant under o. Let 2 be the collection of 
all Borel probability measures on X, and let 
P, be the subset of P consisting of those invar- 
iant under o. Then 2 and Z, are both convex 
sets compact with respect to the tweak « topol- 
ogy. If & is the set of all extreme points in 
Z,, then by the tKrein-Milman theorem 6, is 
not empty. A measure u in Z, belongs to &, 
if and only if q is ergodic with respect to u. 
When the set &, consists of a single element, o 
is called uniquely ergodic; ¢ is called minimal if 
for every point xe X, the orbit of x under o = 
orb, (x)= 1o" (x)|ne Z} is dense in X. ¢ is 
called strictly ergodic if it is both minimal and 
uniquely ergodic. A theorem of J. Oxtoby 
states that q is strictly ergodic if and only if for 
every continuous real-valued function f on X, 
the sequence of averages (X7_} f(o*(x)))/n 
converges uniformly to a constant M( f). 
There are homeomorphisms that are minimal 
or uniquely ergodic but not strictly ergodic. 

(4) R. Jewett proved that any weakly mix- 
ing measure-preserving transformation on a 
Lebesgue space is spatially isomorphic to a 
strictly ergodic transformation. Krieger ex- 
tended the result by showing that if the en- 
tropy of an ergodic transformation q is finite, 
then ¢ is spatially isomorphic to a strictly 
ergodic transformation. Similar results were 
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obtained for flows by K. Jacobs, M. Denker, 
and E. Eberlein (— [39]). 

(5) A topological analog of the notion of 
entropy, called topological entropy, was in- 
troduced by Adler, A. Konheim, and J. 
McAndrew. This is defined as follows: For 
every open covering . of a compact topolog- 
ical space X, let N(.o7) be the number of sets 
in the minimal subcovering of s. For open 
coverings . and Z, let «/ v 2 be the open 
covering {AN B| Aes, BEB}. For any open 
covering .¥ and a continuous mapping p 
on X, the limit lim, (og N(«Z vo ! «Z v 
Vp C 7 an)n- hao.) exists. Topo- 
logical entropy h,,,(@) of the continuous 
transformation ¢ is now defined by h,,,(~) = 
sup {h,.,(@, )| an open covering of X}. 

L. Goodwyn showed that h,(9) z h,(q) 
for any q-invariant probability measure u, 
where h,(q) is the measure-theoretic entropy 
of o regarded as a j- preserving transforma- 
tion. T. Goodman went further and succeeded 
in proving that bell - supih,(o)| n a 9- 
invariant probability measure}. In connec- 
tion with these results there is interest in the 
question of the existence and uniqueness of 
an invariant measure for o with maximal 
entropy, i.e., a g-invariant measure u for which 
h,(9) — hy (9). Such a measure does not 
always exist, and even if it does it may not 
be unique. The notions of topological en- 
tropy and of measures with maximal entropy 
are generalized by Ruelle in the following 
way. For an open covering d, a continu- 
ous mapping qo, and a real-valued con- 
tinuous function g on X, let Z (£, ,g) be 
equal to Inf! Yo, p sup. o exp Io gie tal, 
where the inf is taken over all subcover- 
ings I of the covering vo ! s v ... v 
o " Us. Then a finite limit Pio, p, g)= 
bm... 1/nlog Z,(. , o, g) exists, and the 
quantity Pio, g) - sup( PAZ, o, g)|./ an open 
covering of X) is called topological pressure. 
When g —0, Pio, g) reduces to h,,,(@). Ruelle 
proved for fexpansive mappings ¢ that P(q, g) 
—sup(A,(o) - [gdu|u is a g-invariant proba- 
bility measure]. This assertion is called the 
variational principle for the topological pres- 
sure. The variation principle was proved for 
general continuous mappings o by P. Walters. 
If a o-invariant measure u satisfies P(q, g) 
—h,(o)-- [gdu, then x is called an equilibrium 
state for g with respect to o. It is known that 
for expansive mappings o every continuous 
function g on X has an equilibrium state. 

(6) Application of ergodic theory to problems 
in analytic number theory has been made by 
several authors. Ergodic or mixing properties 
of particular measure-preserving transforma- 
tions that arise in connection with various 
problems in number theory have been ex- 
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ploited to give answers to these problems. 
New and more striking applications of ideas of 
ergodic theory to different types of questions 
in number theory have been started by Yu. 
Linnik, Fürstenberg, W. Veech, T. Kamae, and 
others (— [40,41]). 

7. Most of the results discussed in this arti- 
cle dealt with the action of a cyclic group of 
transformations or of one-parameter flow. 
There are significant extensions of many of 
these results to different types of group ac- 
tions. For recent developments — [22]. 
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When F. Klein succeeded K. G. C. von Staudt 
as professor at the Philosophical Faculty of 
Erlangen University in 1872, he gave an in- 
auguration lecture entitled “Comparative Con- 
sideration of Recent Geometric Researches," 
which later appeared as an article [1]. In it he 
developed a penetrating idea, now called 

the Erlangen program, in which he utilized 
group-theoretic concepts to unify various 
kinds of geometries that until that time had 
been considered separately. 

The concept of transformation is not new; it 
was, however, not until the 18th century that 
the concept of transformation groups was 
recognized as useful. The theory of tinvariants 
of linear groups and the tGalois theory of 
algebraic equations attracted attention in the 
19th century. In the same century, tprojective 
geometry made remarkable progress, for 
example, when A. Cayley and E. Laguerre 
discovered that metrical properties of Eu- 
clidean and tnon-Euclidean geometries can be 
interpreted in the language of projective geom- 
etry. Cayley proclaimed, *All geometry is 
projective geometry." After learning geometry 
under J. Plücker, Klein made the acquaintance 
of S. Lie. Both men understood the impor- 
tance of the group concept in mathematics. Lie 
studied the theory of continuous transforma- 
tion groups, and Klein studied discontinuous 
transformation groups from a geometric stand- 
point. Klein was thus led to the idea of the 
Erlangen program, which provided a bird's- 
eye view of geometry. 

Klein's idea can be summarized as follows: 
A space S is a given set with some geometric 
structure. Let a transformation group G of S 
be given. A subset of S, called a figure, may 
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have various kinds of properties. The study of 
the properties that are left invariant under all 
transformations belonging to G is called the 
geometry of the space S subordinate to the 
group G. Let this geometry be denoted by 

(S, G). Two figures of S are said to be congruent 
in (S, G) if one of them is mapped to the other 
by a transformation of G. The geometry (S, G) 
is actually the theory of invariants of S under 
G, with the term invariants to be understood in 
a wider sense; it means both invariant quan- 
tities and invariant properties or relations. 

Replacing G in (S, G) by a subgroup G' of G, 
we obtain another geometry (S, G^). A series of 
subgroups of G gives rise to a series of geom- 
etries. For instance, let A be a figure of S. The 
elements of G leaving A invariant form a sub- 
group G(A) of G that operates on $' — S— A. 
We thus obtain a geometry (S’, G(A)) in which 
A is called an absolute figure. In this way, 
many geometries are obtained from projective 
geometry. Klein gave numerous examples. 

It is noteworthy that he mentioned even the 
groups of frational and thomeomorphic 
transformations. 

Klein's idea not only synthesized the geom- 
etries known at that time, but also became a 
guiding principle for the development of new 
geometries. 

In 1854 G. F. B. Riemann published his 
epochmaking idea of Riemannian geometry. 
This geometry has a metric, but in general 
lacks congruence transformations (isometries). 
Thus Riemannian geometry is a geometry 
that is not included in the framework of the 
Erlangen program. The importance of Rie- 
mannian geometry was acknowledged when it 
was used by A. Einstein in 1916 as a founda- 
tion of his general theory of relativity. H. 
Weyl, O. Veblen, and J. A. Schouten dis- 
covered geometries that are generalizations of 
affine, projective, and tcanformal geometries 
in the same way as Riemannian geometry is 
a generalization of Euclidean geometry. It 
became necessary to establish a theory that 
reconciled the ideas of Klein and Riemann; 

E. Cartan succeeded in this by introducing the 
notion of tconnection (— 80 Connections). 
However, the Erlangen program, which gave 
an insight into the essential character of clas- 
sical geometries, still maintains its role as one 
of the guiding principles of geometry. 
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A. General Remarks 


The data obtained by observations or mea- 
surements in astronomy, geodesy, and other 
sciences do not usually give exact values of the 
quantities in question. The error is the dif- 
ference between the approximation and the 
exact value. The theory of errors originated 
from systematic work with data accompanied 
by errors, and the statistical treatment of ex- 
perimental data was the main concern in the 
beginning stages (— 397 Statistical Data Anal- 
ysis). However, due to the recent development 
of high-speed computers it has become pos- 
sible to carry out computations on a tremen- 
dously large scale, and the detailed analysis of 
errors has become an absolute necessity in 
modern numerical computation. Hence the 
analysis of errors in relation to numerical 
computation has become the center of re- 
search in error theory. 


B. Errors 


One rarely makes a mistake in counting a 
small number of things; therefore the exact 
value of the count can be determined. On the 
other hand, the exact value in decimals is 
never obtainable for a continuous quantity, 
say length, no matter how fine measurements 
are made, and a large or small stochastic error 
is thus inevitable in measuring a continuous 
quantity. A discrete finite quantity is a digital 
quantity, and a continuous quantity is an 
analog quantity. The natures of these two 
quantities are quite different. The values of 

a digital quantity are distributed on some 
discrete set, while the values of an analog 
quantity are distributed with a continuous 
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probability. Thus there is the possibility of 
error even for treatment of digital quantities, 
although checking the results for these quan- 
tities is easy. It is preferable to regard digital 
quantities as being analog quantities if the 
possible values are densely distributed. 

On the other hand, when an appropriate 
tanalog computer is not available, analog 
quantities receive treatment similar to digital 
quantities. They are expressed as x times some 
unit, and x is expanded in the decimal or 
binary systems. An approximation to such 
an expansion is obtained by rounding off a 
numeral at some place, the position depending 
on the capacity for computation by available 
methods. There are two ways of rounding off 
numbers, the fixed point method and the float- 
ing point method. The former specifies the 
place of digits where the rounding off is made, 
and the latter essentially specifies the number 
of significant digits. 

Classification of errors. (1) Errors of input 
data are the errors included in input data 
themselves. Such input-data errors include 
the errors that occur when we represent con- 
stants such as 1/3, wes zx by finite decimals. 

(2) Truncation errors occur in approximate 
expressions for the computation formulas 
under consideration. (3) Roundoff errors occur 
in taking some finite number of digits from the 
earlier digits in the numerical value at each 
step. If the computation of an infinite number 
of digits were actually possible, no errors of 
this type would appear. Recently, it has been 
considered more preferable to call this “com- 
putational error.” 

The difference between fixed point and 
floating point rounding off is that the former 
is better suited for operations of addition and 
subtraction and the latter is better for multi- 
plication and division. In fixed point rounding 
off, if a number is multiplied many times by 
numbers less than 1, a so-called underflow may 
Occur, and many digits may disappear; a great 
deal of information can thus be lost. In com- 
putation for scientific research that involves 
frequent multiplication and division, floating 
point rounding off is preferable. It should be 
noted that rounding off for addition and sub- 
traction may also cause a critical loss of in- 
formation. This phenomenon is called cancel- 
ing digits. For instance, in the subtraction 
7.6325071 — 7.6318425 = 0.0006646, where the 
subtrahend and minuend share several early 
significant digits, the difference loses those 
digits. Thus, relative errors may be magni- 
fied tremendously. By taking a large number 
of significant digits, such a situation may 
be avoided to some extent. So-called high- 
precision computation shows its effectiveness in 
such cases. Similarly, when a small number b is 
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added to a large number a, the result may be 
just a and the information of b could be lost 
completely. This kind of loss of information 
can often cause serious trouble. 


C. Methods of Error Analysis 


In order to analyze the propagation of errors, 
let us assume that all numbers are carried to 
infinitely many digits so that no roundoff error 
occurs. Suppose that we are to evaluate the 
function y= f(x,, ..., x,) when x,, x5, ..., x, are 
assigned. Let y be the truncation error of an 
approximate expression. If an input error ó; for 
x; exists, then the corresponding error for y is 


Moreover, suppose that at the final step we 
round off to get a result with a finite number 
of digits, by which an error e is introduced. 
Then the final error 6 for y is 

n of 
EC Sc ó;. 
This procedure is performed for each step 
needed in the computation. If y 2 f(x,, ..., x,) 
is a specified step, then the input error ó; for 
that step involves all the errors arising before 
that step, i.e., ó; is an accumulated error. For- 
ward analysis is a method to estimate the total 
accumulated error from the initial input data. 
It is usually quite difficult to obtain precise 
estimates by means of this method. In con- 
trast, J. H. Wilkinson proposed the following 
backward analysis. Here, the computational 
value y is considered as the exact result for 
the modified initial data X,,..., X,, say y= 
f(*,, ..., X), and the estimates for |x;— X;| 
are given. For example, in the binary floating 
point arithmetic of u bits, we always have the 
relation: 


computational value of a+b 
=a(1+6)+b(1 +6), 


with |ô], |e] 2 ", even when cancellation or 
loss of information occurs. Wilkinson has 
made a deep investigation of error analysis for 
linear computation, algebraic equations, and 
eigenvalue problems by means of backward 
analysis [4, 5]. 


D. Proliferation of Errors 


The phenomena usually called “accumulation 
of errors” should more appropriately be called 
the proliferation of errors, where the algorithm 
itself includes a particular mechanism to in- 
crease a small error indefinitely. An example is 
the recursion formula for *Bessel functions 
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stated as 
Jas (x) zr (2n/x)J,(x) —Ja-1 (x). 


It is customary to compute J,(x) by this for- 
mula, starting with the values of Jy(x) and 
Jj (x), with x given. By putting J -1 (x) 2 Yn; 
J,(x) — z,, the recursion formula can be re- 
garded as a linear transformation of the 
point P,(y,, z,) in a plane into another point 
Pari Qa Zia), where 


Yn+1 =2n> Zeus — y, Qn/x)z,. 


The *eigenvalues 4,, 4; of this *difference equa- 
tion satisfy the following: As long as n<|x|, we 
have |A,|=|A,|=1, while if n |xl, 4, is greater 
than 1 and increases rapidly as n tends to 
infinity. Consequently, even the slightest dis- 
crepancy in the position of P, gives rise to a 
greatly magnified error in the result [3]. 

Many studies have been made of the propa- 
gation of errors and of instability phenomena 
in the numerical solution of ordinary differen- 
tial equations (— 303 Numerical Solution of 
Ordinary Differential Equations; [2]). 
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Euclidean Geometry 


A. History 


Attempts to construct axiomatically the geom- 
etry of ordinary 3-dimensional space were 
undertaken by the ancient Greeks, culminating 
in Euclid's Elements (— 187 Greek Mathemat- 
ics). The fifth postulate of Euclid's Elements 
requires that two straight lines in a plane that 
meet a third line, as shown in Fig. 1, in angles 
a, D whose sum ts less than 180°, have a com- 
mon point. In the Elements, two straight lines 
in a plane without a common point are said 
to be parallel. It can be proved from other 
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axioms in the Elements that if x + 8 — 180^, the 
two lines / and I’ in Fig. 2 are parallel. Hence 
given a line / and a point P not lying on J, 





Fig. 1 Fig. 2 


there exists a line /' passing through P that is 
parallel to L The fifth postulate ensures the 
uniqueness of the parallel l passing through 
the given point P. For this reason, the fifth 
postulate is also called the axiom of parallels. 
Utilizing this axiom, we can prove the well- 
known theorems on parallel lines, the sum of 
interior angles of triangles, etc. The axiom 
plays an important role in the proof of the 
Pythagorean theorem in the Elements. The 
axiom is also called Euclid's axiom. 

However, Euclid states this axiom in a quite 
complicated form, and unlike his other axioms, 
it cannot be verified within a bounded region 
of the space. 

Many mathematicians tried in vain to de- 
duce it from other axioms. Finally the axiom 
was shown to be independent of other axioms 
in the Elements by the invention of non- 
Euclidean geometry in the 19th century (— 
285 Non-Euclidean Geometry). 

The term Euclidean geometry is used in 
contrast to non-Euclidean geometry to refer to 
the geometry based on Euclid's axiom of par- 
allels as well as on other axioms explicit or 
implicit in Euclid's Elements. It was in the 19th 
century that a complete system of Euclidean 
geometry was explicitly formulated (— 155 
Foundations of Geometry). From the stand- 
point of present-day mathematics, it would be 
natural to define first the group of motions by 
the axiom of free mobility due to H. Helm- 
holtz (— Section B) and then, following F. 
Klein, to define Euclidean geometry as the 
study of properties of spaces that are invariant 
under the groups of these motions (— 137 
Erlangen Program). 


B. Group of Motions 


Let P be an tordered field and A” the n- 
dimensional *affine space over P. Let B' be an 
r-dimensional affine subspace of A", B' ' an 
(r — 1)-dimensional subspace of B", B"? an 
(r — 2)-dimensional subspace of B" !, etc. In 
the sequence of subspaces B", B' !,..., B®, 
each Dk — B**! consists of two thalf-spaces (k = 
r,r — 1,...,1). Let H* be one of these half- 
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spaces. Then the sequence of half-spaces H', 
H"! ..., Hl is called an r-dimensional flag, 
denoted by 8’ (nz rz 1), and B' and H” are 
called the principal space and the principal 
half-space of $”, respectively. If f is a tproper 
affine transformation of A", f(H"), f(H" !), 
..., /(H!) form an r-dimensional flag 8’. We 
write f($7) — MI. 

Let A” be the group of all proper affine 
transformations of A". The subgroup $8" of 9I" 
with the following two properties is called the 
group of motions, and any element of $8" is 
called a motion (or congruent transformation). 
(1) Let r be an integer between 1 and n, and let 
$H", HI be any two r-dimensional flags. Then 
there exists an element f of B” that carries 5’ 
to $':f($7) 2 MI. (2) Let f, g be two elements of 
B” with f($") 2 MI, g($") 2 8", and let p be any 
point on the principal space of $”. Then f(p)— 
g(p), that is, f, g have the same "effect" on 
the principal space. In particular, when r — n, 
then f =g. That 9I" possesses a subgroup 8" 
with properties (1) and (2) is called the axiom 
of free mobility. 

When n= 1, it is easy to see that the ele- 
ments of B! are only those elements f of W! 
that can be expressed in the form f(x)= + 
x--a (ae P). When nz 2, P must satisfy the 
following condition in order that a subgroup 
B” with properties (1) and (2) exists in W": If a, 
beP, then P contains an element x such that 
x? =a? 4- b?. When this condition is satisfied, 
the ordered field P is called a Pythagorean 
field. Every treal closed field (e.g., the field R of 
real numbers) is Pythagorean. If B” exists, its 
uniqueness is assured by (1) and (2). Further- 
more, if P contains a square root of every 
positive element (this condition is satisfied, for 
example, by R), then conditions (1) and (2) are 
reducible to the case r=n only, i.e., conditions 
(1) and (2) for other values of r follow from (1) 
and (2) with r 2 n. Hereafter, we assume the 
existence of 98". 

Suppose that we have A" > B'2 B* (nz 
r>k>0), and let $' be a flag with the prin- 
cipal space B':$' —(H', ..., H*, ..., H’). Let 
Hi be another flag with the same principal 
space B": R" «(K', ..., K*,..., K"), where we 
suppose that H/— K? for kzj 2 1, whereas for 
r>i>k-+1, we suppose that Hi and KI are 
different half-spaces on B' divided by B' !. The 
flag gr is denoted by $;. An element f of B” 
with f($/) EN is called a symmetry (or reflec- 
tion) of B” with respect to B*. It leaves every 
point on B* invariant, and its effect on B” is 
determined only by B* independently of the 
choice of half-spaces in $' and S” (subject to 
the conditions mentioned above). In partic- 
ular, the symmetry of A" with respect to a 
point A? — p is called a central symmetry with 
respect to the center p; and the symmetry of A" 
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with respect to a hyperplane A"! = h is called 
a hyperplanar symmetry. They are uniquely 
determined by p and h, respectively, and are 
denoted by S, and S(h), respectively. If H(A") is 
the set of all hyperplanes of A", then B” is 
generated by (S(h)|he H(A")]. Furthermore, if 
p, q are two points of A", the composite S,S, is 
a parallel translation by 2: pq (Fig. 3). The 
parallel translations generate a normal sub- 
group €" of B". For p, qe A", the element of 3" 
that carries p to q is denoted by t,,. The 


x'—Sy(x) 


x x^ =Sq(z') 


Fig. 3 


subgroup of 38" that leaves a point p of A" 
invariant is denoted by Of. Obviously, we have 
O! = tip, 05154 . Thus all the OF (for pe A") are 
isomorphic. We call 05 the orthogonal group 
around p and any element of O7 an orthogonal 
transformation around p. More generally, any 
element of B” that leaves a subspace A* of A" 
invariant is called an orthogonal transforma- 
tion around the subspace A*. 

An element of 8” that preserves the orienta- 
tion of A”, i.e., is represented by a *proper 
affinity with a positive determinant, is called a 
proper motion. Proper motions form a sub- 
group Bg of B". Rotations are, by definition, 
orthogonal transformations belonging to $85. 
Sometimes 8% is called the group of motions; 
then 38" is called the group of motions in the 
wider sense. In this article, however, we shall 
continue to use the terminology introduced 
above. 

The study of the properties of A" invariant 
under 8" is n-dimensional Euclidean geometry. 
Since W” > 38", every proposition in affine 
geometry (— 7 Affine Geometry) can be con- 
sidered a proposition in Euclidean geometry, 
but there are many propositions that are 
proper to Euclidean geometry. Sometimes the 
subgroup of 9I" generated by 8” and the 
homotheties of A", i.e., elements of 9I" repre- 
sented by ‘scalar matrices, is called the group 
of motions in the wider sense, and the study of 
properties of A" invariant under this group is 
called n-dimensional Euclidean geometry in the 
wider sense. 


C. Length of Segments 


Two figures F, F' in A" are said to be congru- 
ent if there exists an Ten such that f(F)— F. 
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Then we write F = F'. The congruence relation 
is an tequivalence relation. Let s — pq, s' — p'd' 
be two tsegments in A”. We say that s, s' have 
equal length when sz s'. Length is an attri- 
bute of the equivalence class of segments. The 
length of s is denoted by |s|. All segments of 
the form pp are congruent, and we define |pp| 
— 0. If we are given a length and a thalf-line 
starting from a point p, we can find a unique 
point q on it such that |pg| =the given length 
(Fig. 4). Let r be a point on the extension of 
pq. The length |pr| is then uniquely deter- 
mined by |pq| and [qr|. It is defined as the 


Fig. 4 


sum of the lengths: |pr| 2 |pg| + lo! With 
respect to the addition thus defined, lengths of 
segments in A" form a fcommutative semi- 
group with the cancellation law, which can be 
extended to an *Abelian group M with 0 as 
the identity element (— 190 Groups P). 

Let |s| 40 and |s'| be any length. On a half- 
line starting from p, we can find points q, r 
with |pg| = Isl, |pr| 2 |s'|. Then the element 
pr/pq=/eP (— 7 Affine Geometry) is a posi- 
tive element of P uniquely determined by |s| 
and |s']. We call A the measure of |s'| with the 
unit |s| and denote it by |s’]:|s|. If P is tArchi- 
medean, 4 can be represented by a real num- 
ber (— 149 Fields NL We have (|s'| - |s" |): |s| 
=(Is"|:]s]) + 5H: IsI} (S71 Is As E Is" D = 
Is" |:|s| (if |s| 40). Thus the mapping |s'| 
|s’|:|s| sends the additive semigroup of 
lengths to that of the positive elements of P. 
This is actually an isomorphism, which can be 
extended to an isomorphism of M onto the 
additive group of the field P. 

Let ok X’, ..., X?) be a "homogeneous 
rational function of a finite number of vari- 
ables X, X’, ..., X*. If a relation q(A, 4^, ..., 2?) 
=0 holds for A=|s|:|so|, Assel sel... A” 
=|s*|:|so|, where |sp| is a length 40, then 
9(44, 44, ..., 41) 2 0 holds also for 4, =|s|:|s,], 


A —|s'|:|s,], ..., 25 =s": Isil, where |s,] is any 
other length 40. Hence, in this case, the ex- 
pression ¢(|s|, |s], ...,|s*|)=0 is meaningful. 


D. Angles and Their Measure 


An angle / AOB is a figure constituted by two 
half-lines OA, OB starting from the same point 
O but belonging to different straight lines. The 
point O is called the vertex, and the two half- 
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lines OA, OB are called the sides of / AOB 
(Fig. 5). Two congruent angles are said to have 
the same measure, denoted by | /. AOB| or 
sometimes simply by a. Let H? =(H?, H!) (H! 
— half-line QR) be a given 2-dimensional flag 
and the given measure of an angle. Then we 
can find a unique half-line QP in the given 
half-plane H? such that | Z POR|=« (Fig. 6). 
The angle / POR is said to “belong” to 9°. 
Let K? be the half-plane separated by the 
line PUQ containing the half-line QR. Then 
H?N K? is called the interior of the angle 


A 

















0 B 


Fig. 5 





L.PQR. Let / POR be another angle belong- 
ing to $°. If the interior of the latter angle is 
a subset of the interior of / POR and QR 
OP’, then | / PQR|is said to be greater than 
| L P’OR|, and we write | Z POR|>|Z P'OR|. 
Actually, it can be shown that > is a relation 
between the measures of / POR and / POR 
and that the set of measures of angles forms a 
‘linearly ordered set with respect to the rela- 
tion 2 defined in the obvious way. When 
| 4 POR|>|ZP’OR|, we write | / POR|= 
|Z POP’|+|Z P’OR|. Actually, these are rela- 
tions between measures of angles. Further- 
more, if the measure « of an angle is given, the 
set of measures of angles x « forms a linearly 
ordered set order-isomorphic to a segment 
and satisfying: (i) If f <y then there exists a ô 
such that fl -- ó 2 y; (ii) B+6=6+ f; (B, - B5) -- 
f4 Pi (5 + B4) if all these sums exist; and 
(iti) B, --ó — f; +ô implies £, = f;. When P is 
Archimedean, these properties imply that the 
measure of angles «| / PQR| can be repre- 
sented by positive real numbers <k (k is any 
given positive number) such that the relations 
of ordering an addition are preserved. 

This one-to-one correspondence between 


the measures of angles and a subset S of the 


interval (O, k] of real numbers can be extended 
to a correspondence between the measures of 
general angles and the subset of R obtained 
from S. When P =R, then we have S =(0,k], 
and any real number appears as a measure of 
a general angle. We can choose / POR and 
the positive number k arbitrarily, but it is 
customary to choose them as follows. Suppose 
we are given an angle /. AOB. Let the exten- 
sions of the half-lines OA and OB in the oppo- 
site directions be OA’ and OB’, respectively. 
The angles / AOB and / A'OB are called 
supplementary angles of each other, and so are 
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¿L AOB and / AOB'. The angles / AOB and 

4. A'OB' are called vertical angles to each other 
(Fig. 7). Any angle is congruent to its vertical 
angle, and an angle that is congruent to its 
supplementary angle has a fixed measure. 





Fig. 7 


Such an angle (or its measure) is called a right 
angle. In the description of the measurement of 
angles, we usually consider the case where the 
special angle /. POR is a right angle, and we 
set k = 1/2. (The existence and uniqueness of 
the right angle can be proved.) An angle that is 
greater (smaller) than a right angle is called an 
obtuse (acute) angle. A general angle whose 
measure is twice (four times) a right angle is 
called a straight angle (perigon). Sometimes we 
choose as the “unit angle” 1/90 of a right 
angle, which is called a degree (hence a right 
angle = 90 degrees, denoted by 90°); 1/60 of a 
degree is called a minute (1° = 60 minutes, 
denoted by 60’), and 1/60 of a minute ts called 
a second (1’= 60 seconds, denoted by 60"). If, 
as usual, we put the right angle equal to 2/2, 
then the unit angle is (2/z)(right angle). This is 
called a radian, and 1 radian = 1807/1 = 
57^17'44.806 ..." =57.3°. 

If a straight line m intersects two straight 
lines J, I’, eight angles a, f, y, 6, a’, By’, éi 
appear, as in (Fig. 8). In this figure, a and a’, f 
and P. y and y', and 6 and ó' are called corre- 
sponding angles, while « and »', f and 6’, y and 
a’, and 6 and f’ are called alternate angles to 
each other. When / and I are parallel, each of 
these angles is congruent to its corresponding 
or alternate angle. 

The Pythagorean theorem asserts that if a 
triangle AABC is given for which / ABC is a 
right angle (Fig. 9), then | AB? +|BC|? =|CA|? 
(which makes sense since X? + Y—Z? isa 
homogeneous polynomial). 





Fig. 9 


Fig. 8 


E. Rectangular Coordinates 


When two straight lines /, m intersect, two 
pairs of vertical angles appear. If one of these 
angles is a right angle, then all are. Then we 
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say that / and m are orthogonal (or perpendic- 
ular) to each other, and write (lm Let l be a 
line and A’ an r-dimensional subspace of A" 
(1<r<n-— 1) intersecting (at a point O — A" Y. 
If | is orthogonal to all lines on A” passing 
through O, then / is said to be orthogonal to 
A’, and we write ILA” (Fig. 10). If A"! is any 
hyperplane in A", then there exists a unique 
line / through a given point P of A" that is 
orthogonal to A” '; this l is called the per- 
pendicular to A" ' through P, and the 


Fig. 10 


intersection IN A"^! is called the foot of the 
perpendicular through P. When A"^! is given, 
the mapping from A" to A"! assigning to 
every point P of A" the foot of the perpendic- 
ular through P is called the orthogonal projec- 
tion from A" to Al. 

Let $" —(H", H""',...,H') bean n- 
dimensional flag of A” and O the initial point 
of the half-line H!. Then we can find a point E; 
in H' (i2 1,2, ..., n) such that OU E; LOUE; 
(iz j, i,j 21,2, ..., n). Moreover, if |e| is any 
unit of length, then E; can be chosen uniquely 
so that |OE;| —le| (i2 1,2, ..., n). Then O, 

E,, ..., E, are tindependent points in A", and 
we have A" OU E,U ...UE,. Thus we have a 
'frame X — (0; E, ..., E,) of A" with O as origin 
and the E; as unit points. Such a frame is 
called an orthogonal frame. À coordinate 
system with this frame, called an orthogonal 
coordinate system adapted to $”, is uniquely 
determined by $”. A motion is characterized 
as an taffinity sending one orthogonal frame 
onto another or onto itself. 

Utilizing an orthogonal coordinate system, 
the lengths of segments and the measures of 
angles can be expressed simply. Let (x,,..., Xp) 
be the coordinates of X with respect to such a 
coordinate system. Then the length of the 
segment |OX | (with |e| as unit) is equal to 
(X7. SCHT, and when Y is another point, with 
coordinates (y,, ..., y, Oz X, OF Y, then we 
have 


Mi Xii 
(24 xD) (Zi yore 


In particular, we have OU X. LOU Y if and 
only if ZE, x;y;—O. 

We may write x — OX for the tlocation 
vector of X. Then the 'affinity Ax +b is a 
motion if and only if A is an *orthogonal ma- 
trix. Thus the tinner product (x, y) is invariant 





cos| 2 XOY|= 
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under motions; it therefore has meaning in 
Euclidean geometry. If we put |x| — (x, x)!?, 
then the right-hand sides of the formulas for 
[OX | and cos| /. XOY| can be written as |x| 
and (x, y)/(|x|-|y|). More generally, we have 
|X Y|=|y —x|. This is the Euclidean distance 
(or simply distance) between X and Y. Then A" 
becomes a tmetric space with this distance, i.e., 
a Euclidean space. Historically, the notion of 
metric spaces was introduced in generalizing 
Euclidean spaces (— 273 Metric Spaces). 


F. Area and Volume 


The subset I” of A" consisting of points 

(x4, ..., x,) with respect to an orthogonal co- 
ordinate system, with 0 x x; « 1, i— 1,...,n, is 
called an n-dimensional unit cube. A function 
m that assigns to tpolyhedra in the wider sense 
P, Q, ... in A" nonnegative real numbers m(P), 
m(Q), ... is called an n-dimensional volume if it 
satisfies the following four conditions: (1) m(@) 
—0. (2) m(PUQ)+m(PNQ)=m(P)+m(Q). (3) 
If P is sent to Q by a translation, then m(P) — 
m(Q). (4) m(I") — 1. It has been proved that 
such a function is unique and has the property 
that P=Q implies m(P)=m(Q). Thus the con- 
cept of volume can be defined in the frame- 
work of Euclidean geometry. More generally, 
if the affinity f(x) 2 Ax +b sends P onto Q, 
then m(Q) — cm(P), where c is the absolute 
value of the determinant | A|. If P is covered by 
a finite number of hyperplanes, then m(P) — 0, 
and if P is a tparallelotope with n independent 
edges a,,...,a,, then m(P)=abs|a,,...,a,]|, 
where |a,, ...,a,| is the determinant of the 

nx n matrix with a; as the ith column vector, 
and abs x is the absolute value of the real 
number x. If P is an tn-simplex whose vertices 


have location vectors xo, X;,...,X,, then we 
have 

1 ts "le ent, i 
m(P) — — abs 

n! Xo X X, 








The volume of any polyhedron can be ob- 
tained by dividing it into n-simplexes and 
summing their volumes. If P is an r- 
dimensional polyhedron in A", then the r- 
dimensional volume of P is obtained by divid- 
ing P into r-simplexes and summing their 
r-dimensional volumes (in the respective r- 
dimensional Euclidean spaces containing 
them). In particular, when r= 1, we speak of 
length (e.g., the length of a broken line), and 
when r — 2, of area. If V is the r-dimensional 
volume of an r-dimensional parallelotope with 


r edges a,,...,a,, we have the formula 
(a;,a;) (a;,a,) 

y2- ! 
(a,,a,) (a,, a,) 








139 G 
Euclidean Geometry 


The notion of measure of point sets other than 
polyhedra is a generalization of the notion 

of volume of polyhedra (— 270 Measure 
Theory). 


G. Orthonormalization 


Let O, A,,..., A, be n+ 1 independent points 
in A". Then the n vectors OA, =a,, i=1,...,n, 
are independent. The points O, A,,..., A, 
determine an n-dimensional flag §" of A" as 
follows. Let H, be the half-line OA,, H, be the 
half-plane on the plane OU A, U A, separated 
by the line OU A, in which A, lies, ..., H, be 
the half-space on A" separated by the hyper- 
plane OUA,U...UA,_, in which A, lies. Let 
b,,.-.,b, be the unit vectors of the rectangular 
coordinate system adapted to $”. Suppose 
further that we are given a rectangular coordi- 
nate system. Then b,, ...,b, can be obtained 
from a,,...,a, by the following procedure, 
called orthonormalization (E. Schmidt): First 
put b, =a,/|a,|, so that |b, |= 1. Then e; = 
a5 — (a5, b,)b, satisfies (b,,c,)=0, c; #0. 
Put b, —c;/|c;|. Then we have (b,,b,)=0, 
|b,|=1. When b,, ..., b; , are obtained in 
this way, so that (b;, b,)=6, for 1 <j, k« i— 1, 
then c; =a; —(a;,b,)b, — ... — (a; b; Ab 
satisfies (b;, c;) — 0, c; #0. Hence b; =¢,/Ie;| 
added to b,, ..., b; , retains the property 
(b;, b) = ô; for 1<j, k <i, and this procedure 
can be continued to i— n. 

Two vectors u, v are called orthogonal (de- 
noted u.1 v) if (u, v) 0, and u is called normal- 
ized when |u| = 1. Thus any two of the vectors 


b,,...,b, are orthogonal, and each of them is 
normalized. Between given vectors a,, ...,a, 
and b,, ..., b, we have the relation {a,,...,a;} 


(=the linear space generated by a,,... 
Disses b}, i=1,...,7. 

Let M, , M, be two subspaces of the linear 
space M of the vectors of the Euclidean space 
A". If any element of M, is orthogonal to any 
element of M,, then Mt, and Mt, are called 
orthogonal and written M, 1 99t,. For any 
proper subspace Wi, of Mi, it can be shown by 
the method of orthonormalization that there 
exists a unique proper subspace M, of IR such 
that 9t — Mt, UM, M, LM. Such a subspace 
M, is called the orthocomplement of W, (with 
respect to M). Then M, NM, = {0} follows, 
and hence i= Wt, + M,. Every element a of 
Mt is therefore written uniquely in the form 
a,+a,,a,EMt,, a EM; we call a, the M- 
component of a and a, the orthogonal compo- 
nent of a with respect to M. The mapping 
from M to M, assigning a, to a is called the 
orthogonal projection from Wi to 99t,; it isa 
linear and *idempotent mapping. 


RE 
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H. Distance between Subspaces 


Since the Euclidean space A" is a metric space, 
the distance 1s defined between any two non- 
empty subsets of A" (— 273 Metric Spaces). 
Let A’, B° be two subspaces of dimensions r, s 
of A", and let d be the distance between them. 
Then it can be shown that there exist points 
p€ A’, qe B° such that d 2 pq, and if p'e A’, 
q'e B5 are any other points with d — p'q', then 
pq —p'd. In particular, when r 20 and s=n— 1 
De, when A’=p is a point and B5— B" isa 
hyperplane), the distance d can be obtained as 
follows: If (a, x) b is an equation of B"! and 
p is the location vector of p, then d= |(a, p) — 
b|/|a]. If |a| — 1 in this equation of Dr), 

then d is given simply by |(a, p) — b|. An equa- 
tion (a, x)=b of a hyperplane is said to be in 
Hesse’s normal form if |a| = 1. 


I. Spheres and Subspaces 


The set of points in a Euclidean space lying at 
a fixed distance r from a given point is called 
the sphere of radius r with center at the given 
point. If p is the location vector of the center of 
this sphere with respect to a given rectangular 
coordinate system, then the equation of the 
sphere is |x — p|=r or (x, x) - 2(p, x) +(p, p) — 
r? =0. The set of points lying at equal dis- 
tances from k+ 1 points with location vectors 
Po: Di... p; (KÈ 1) is a linear subspace of the 
space (which may be Qj or the entire space). If 
these points are independent, then the sub- 
space has dimension n — k, where n is the di- 
mension of the entire space. In particular, if 
these points are vertices of an n-dimensional 
tsimplex, then there is a unique sphere passing 
through them, called the circumscribing sphere 
of the simplex. In this case, the simplex is said 
to be inscribed in the sphere. If po, P4, .... p, 
are location vectors of the vertices of the sim- 
plex, then the equation of the circumscribing 
sphere of the simplex is given by 


1 1 1 1 
Po Pi Pa x|=0. 
Po Pi Box 


When n=2 or 3, there are many classical 
results concerning the circumscribing circle of 
a triangle, the circumscribing sphere of a sim- 
plex, and other figures related to a triangle or 
a simplex. 
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A space satisfying the axioms of Euclidean 
geometry is called a Euclidean space. An ‘affine 
space having as *standard vector space an n- 
dimensional Euclidean tinner product space 
over a real number field R is an n-dimensional 
Euclidean space E". In an n-dimensional Eu- 
clidean space E", we fix an torthogonal frame 
X —(0, E,, ..., Ej, e; - OE, (ej, e) ën, The 
frame X determines frectangular coordinates 
(X4, X5, ..., x,) of each point in E". We can thus 
establish a one-to-one correspondence be- 
tween E" and R"— ((x,, ..., x,)| x;e R]. In this 
sense we identify E" and R" and usually call R" 
itself a Euclidean space. The 1-dimensional 
space RI is a straight line, and the tCartesian 
product of n copies of R! is an n-dimensional 
Euclidean space (or Cartesian space). Given 
points x «(x,, x5, ..., x, and y2 (yi, yz. .... Yn) 
in the Euclidean space R”, the ‘distance d(x, y) 
between them is given by 


(y1—Xx1Y +... + (y,— x". 


Thus the distance d(x, y) supplies R” with the 
structure of a tmetric space. We call x; the ith 
coordinate of the point x, the point (0, ...,0) 
the origin of R", and the set of points (x| —oo 
« x; € 00; xj 0, j ij the x-axis (or ith co- 
ordinate axis). For an integer m such that 

—] <m <n, we define m-dimensional !sub- 
spaces in R”; a —1-dimensional subspace is 
the empty set, a 0-dimensional subspace is 

a point, and a 1-dimensional subspace is a 
straight line. If we take an orthogonal frame, 
an m-dimensional subspace is represented 

as an R" (— 139 Euclidean Geometry; 7 Affine 
Geometry). 

As a ttopological space, R" is flocally com- 
pact and fconnected. A bounded closed set in 
R" is fcompact (Bolzano-Weierstrass theorem). 

Given a point a —(a,, ...,a,) in R” and a real 
positive number r, the subset {x |d(x,a)<r} of 
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R" is called an n-dimensional solid sphere, solid 
n-sphere, ball, n-ball, disk, or n-disk with center 
a and radius r, its "interior (x|d(x,a) «rj an n- 
dimensional open sphere, open n-sphere, open 
ball, open n-ball, open disk, or open n-disk, 

and its *boundary (x|d(x, a) 2 r] an (n— 1)- 
dimensional sphere or (n — 1)-sphere. In partic- 
ular, a 2-dimensional solid sphere is called a 
circular disk, its interior an open circle, and its 
boundary a circumference. A disk or a circum- 
ference is sometimes called simply a circle. 
The family of n-dimensional open spheres 
with center a gives a base for a neighborhood 
system of the point a. Suppose that we are 
given a sphere S and two points x, y on S. 

The points x, y are called antipodal points on 
the sphere S if there exists a straight line L 
passing through the center of S such that SN 
L — (x, y). The segment (or the length of the 
segment) whose endpoints are antipodal points 
is called the diameter of the solid sphere (or 

of the sphere). The notion of tdiameter (— 273 
Metric Spaces) of a solid sphere or of a sphere 
considered as a subset of the metric space 

R" coincides with the notion of diameter of 
the corresponding set defined above. When 
n> 3, the intersection of a sphere and a 2- 
dimensional plane passing through the center 
of the sphere is called a great circle of the 
sphere. For m such that 1 x m x n, we consider 
an m-dimensional solid sphere or an (m — 1)- 
dimensional sphere in an m-dimensional 

plane R". These spheres are also called m- 
dimensional solid spheres or (m — 1)-dimen- 
sional spheres in R”. 

In particular, the solid sphere of radius 1 
having the origin as its center is called the unit 
disk, unit ball, or unit cell, and its boundary is 
called the unit sphere. (In particular, when we 
deal with the 2-dimensional space R?, we use 
the term circle instead of sphere, as in unit 
circle.) The points (0, ...,0, 1) and (0, ...,0, —1) 
are called the north pole and south pole of the 
unit sphere, respectively. The (n — 2)-dimen- 
sional sphere, which is the intersection of the 
unit sphere and the hyperplane x, — 0, is called 
the equator; the part of the unit sphere that is 
“above” this hyperplane (e, in the half-space 
x, = 0) is called the northern hemisphere, and 
the part that is “below” the hyperplane (i.e., in 
the half-space x, « 0) the southern hemisphere. 

Let a;, b; be real numbers satisfying a; < b; 
(i— 1,2, ..., n). The subset (x |a; « x; « b, i= 
1,2, ..., n] of R” is called an open interval of 
R”, and the subset {x| a; <x; & bj) a closed 
interval. They are sometimes called rectangles 
(when n — 2), rectangular parallelepipeds, or 
boxes. Àn open interval is actually an open set 
of R", and a closed interval is a closed set. In 
particular, the closed interval [x |0 € x; & 1, 

i— 1,2, ..., n] is called the unit cube (or unit n- 
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cube) of R". We can take the set of open inter- 
vals as base for a neighborhood system of R". 
All the *convex closed sets (for example, 
closed intervals) having interior points in 
R" are homeomorphic to an n-dimensional 
solid sphere. A topological space J” that is 
homeomorphic to an n-dimensional solid 
sphere is called an n-dimensional (topological) 
solid sphere, (topological) n-cell, or n-element. A 
topological space Sr ! homeomorphic to an 
(n — 1)-dimensional sphere is called an (n — 1)- 
dimensional topological sphere (or simply 
(n — 1)-dimensional sphere. The spaces I” and 
S"! are torientable ttopological manifolds 
whose orientations are determined by assign- 
ing the generators of the (relative) thomology 
groups H,(I", I") and H,_,(S"~'), respectively 
(both are infinite cyclic groups). By means 
of the tboundary operator 0: H,(I", [") ^ 
H. 4(S" +), the orientation of 1" or S" ^! deter- 
mines that of the other. 


References 
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Leonhard Euler (April 15, 1707-September 18, 
1783) was born in Basel, Switzerland. In his 
mathematical development he was greatly 
influenced by the Bernoullis (— 38 Bernoulli 
Family). He was invited to the St. Petersburg 
Academy in 1726 and remained there until 
1741, when he was invited to Berlin by Fre- 
derick the Great (1712—1786). Euler was active 
at the Berlin Academy until 1766, when he 
returned to St. Petersburg. Already having 
lost the sight of his right eye in 1735, he now 
became blind in his left eye also. This, how- 
ever, did not impede his research in any way, 
and he continued to work actively until his 
death in St. Petersburg. 

Euler was the central figure in the mathe- 
matical activities of the 18th century. He was 
interested in all fields of mathematics, but 
especially in analysis in the style of * Leibniz, 
which had been passed down through the  . 
Bernoullis and was developed by him into a 
form that led to the mathematics of the 19th 
century. Through his work analysis became 
more easily applicable to the fields of physics 
and dynamics. He developed calculus further 
and dealt formally with complex numbers. He 
also contributed to such fields as *partial dif- 
ferential equations, the theory of telliptic func- 
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tions, and the tcalculus of variations. He con- 
tributed much to the progress of algebra and 
theory of numbers in this period, and also did 
pioneering work in topology. He had, how- 
ever, little of the concern for rigorous founda- 
tions that characterized the 19th century. He 
was the most prolific mathematician of all time, 
and his collected works are still incomplete, 
though some seventy volumes have already 
been published. 
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A. History 


By the evaluation of a function f(x) we mean 
the application of algorithms for obtaining the 
approximate value of the function. The evalu- 
ation methods are classified roughly into two 
groups: (1) evaluation of functions using ap- 
proximation formulas, and (2) evaluation using 
microprogramming techniques. The advent 
of high-speed computers has brought about 
drastic changes in the evaluation of functions. 
Before the introduction of high-speed com- 
puters in the 1940s, mathematical tables had 
played a prominent role. The first issue (pub- 
lished in 1943) of the journal Mathematical 
tables and other aids to computation (MTAC), 
the predecessor of the journal Mathematics of 
computation, was primarily concerned with 
tables of mathematical functions. One of the 
aims of this journal was to facilitate the ex- 
change of information on errors in the tables. 
The tables, obtained in the past by tedious 
hand calculation, can now be easily prepared 
by high-speed computers, and there was a 
period when more accurate and extensive 
tables were published one after another. It 

is ironic, however, that high-speed comput- 
ers revolutionized numerical analysis and 
prompted a shift of emphasis in the field, 
beginning in the 1950s, away from the use of 
numerical tables and toward exploration of 
the most efficient methods of approximation of 
the functions, thus causing a rapid decrease in 
the need for tables. 

Recently, a significant trend in computer 
design has replaced the conventional logic 
control section with "stored logic," or micro- 
programmed control, stored in high-speed, 
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nondestructive Read-Only Storage (ROS). For 
example, the microprogrammed control used 
for the unified COordinate Rotation DIgital 
Computer (CORDIC) algorithm is effective in 
calculating elementary functions because of its 
simplicity, accuracy, and capability of high- 
speed execution via parallel processing. It is 
not clear whether the applications of approxi- 
mation formulas heretofore in use will be 
superseded by microprogramming techniques 
in all kinds of computers. However, the value 
of the approximation formulas recognized in 
the 1950s has been declining insofar as elemen- 
tary functions are concerned. Recently, as a 
method for the evaluation of functions based 
on a new viewpoint, some "unrestricted" al- 
gorithms have been proposed by Brent [1], 
which are useful for the computation of ele- 
mentary and special functions when the re- 
quired precision is not known in advance or 
when high accuracy is necessary. It is expected 
that methods for the evaluation of functions 
using approximation formulas will be further 
developed as microprogramming techniques 
and unrestricted algorithms see wider use. 


B. Evaluation of Functions Using 
Approximation Formulas 


Suppose we approximate a function f(x) by a 
function g(x) using the following class of func- 
tions p;(x) and q,(x). Let continuous functions 
p(x), ..., p, (X) and q,(x), ..., q,(x) defined on 
a closed interval [a,b] satisfy the following 
conditions: (i) p,, ..., p, and q,, ...,q,, are both 
linearly independent; (ii) there exist at most a 
finite number of zeros for 37, b.q,(x) in [a,b] 
for any choice of b,,b,, ..., bm, except for the 
case b, =b, =... = b, — 0; (iii) there is a con- 
tinuous function g(x) with a nonzero denomi- 
nator in [a,b] representable as 


gi) Y api) | roi (1) 


where a;,i=1,...,n, and b, j—0, 1, ..., m, are 
constants. Then g(x) is called a generalized 
rational function based on a class of functions 
{p(x)},i=1,...,n, and {q,(x)}, j=1,...,m. If 
pí(x) 2 x^! and q,(x)=x/"+, (1) is reduced to a 
rational function. If m=1 and q,(x)= 1, (1) 

is a linear combination of p,(x) and is called 
an approximation of linear type to f(x); and 
further, if pi(x) 7 x'^!, (1) is reduced to a poly- 
nomial. The crux of the approximation prob- 
lem lies in the criterion to be used in choosing 
the approximate constants in (1). There are 
three methods for choosing them, which lead 
to three types of approximation of major 
importance: (1) interpolatory approximations, 
(ii) tleast-squares approximations, and (iii) 
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min-max error approximations (sometimes 
called best approximations). As a major aim of 
computer approximation of a function is to 
make the maximum error as small as possible, 
the third type of approximation has been used 
for digital computers. 

For every continuous function f(x), there 
always exist min-max error approximations of 
the form (1). In generating a min-max error 
approximation in practice, we essentially 
depend upon the following conditions and 
theorems. Let a function space F be a d- 
dimensional linear space. If for fe E which is 
not identically zero, there exist at most (d — 1) 
zeros of f(x) in [a,b], then F is called the 
unisolvent space or the Haar space. 

Let one of the best approximations to 
a continuous function f(x) be g(x). If the 
linear space D of all the functions of the form 
{Z a.p(x)+ X 5,9(x)q,(x)j is unisolvent, then 
the best approximation is unique. For the 
case where g(x) is an approximation of linear 
type to f(x), Haar [2] proved the following 
theorem. 

Let g(x) be the best approximation to f(x) 
and g(x)= D7", a;p,(x). A necessary and suffi- 
cient condition for the best approximation to 
be always unique is that the m-dimensional 
linear space generated by the functions p,, p>, 

.-, Pm be unisolvent. In particular, we have 
the best unique polynomial approximation 
(one variable) of f(x) defined on [a, b]. 

Let the necessary and sufficient condition 
of the previous theorem be satisfied in a d- 
dimensional linear space D. À necessary and 
sufficient condition for g(x) given by (1) to be 
the best approximation of f(x) is that there 
exist points aX xo € x, €... «X, , «x4 &b, 
called deviation points, such that (—1)( f(x) — 
g(Xj)) =p (i—0, 1, ...,d). 

À great number of algorithms are known 
by means of which one can calculate the best 
approximation g(x) of a function f(x) for the 
given values of n and m and for p((x) 2x! ! 
and g(x)=x/~' in (1). However, the following 
three kinds of algorithms are used most fre- 
quently for generating min-max approxima- 
tions on computers: Remes's second algorithm 
[3], the differential correction algorithm, and 
Yamauti's folding-up method. 


C. Evaluation of Elementary Functions Based 
on Microprogramming 


Chen [6] has given a general algorithm for 
calculating an elementary function z = f(x) as 
follows. Let F(x, y) 2 yg(x) - h(x), where y is 
some parameter for evaluating zo = f (xo) = 
F(xg, Yo). We assume that (xo, Yọ) has been 
given and that z, is unknown. Suppose that a 
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new point Dan, y,,4) 1s obtained from (x,, Yy) 
according to a pair of transformations, x,,, = 
9 (X, y) and pn — V(x,, y,), keeping the 
value of F(xs, yo) invariant. If x, is forced to 
converge to x,, and if g(x,,)=1 and h(x,,)=0, 
then zo = f(xo) = F(xo, yo) = F(X y) 2... = 
F(X, yo) — yo. In this procedure it is necessary 
to determine g(x), h(x), o(x), and y(x) for 

the given f(x). Most of the elementary func- 
tions can be evaluated by Chen's algorithm, 
which is essentially identical with Specker's 
Sequential Table Look-up (STL) method 
based on addition formulas [7], e.g., logx = 
log(xx) —loga or e*=e* fe’, The iteration 
equations (2) of CORDIC given later can also 
be derived by the STL method with complex 
numbers. 

The use of coordinate rotation to evaluate 
elementary functions is not new. In 1956 and 
1959 Volder [8] described the CORDIC for 
the calculation of trigonometric functions, 
multiplication, division, and conversion be- 
tween the binary and the decimal and r-adic 
number systems. Daggett, in 1959, discussed 
the use of CORDIC for decimal-binary con- 
version. It was recognized by Walther [9] in 
1971 that these algorithms could be merged 
into one unified algorithm. Consider coordi- 
nate systems parameterized by m in which the 
radius R and angle A of the vector P=(x, y) 
are defined as R 2 (x? + my?)'? and A =m"? 
arctan(m!^ y/x). The basis of Walther’s algo- 
rithm is the coordinate rotation in a linear 
(m —0), circular (m= 1), or hyperbolic (m= —1) 
coordinate system, depending on which func- 
tion is calculated. Iteration equations of 
CORDIC are as follows. The point D, Ven, 
Z;+1) 18 obtained from the point (x;, Y; z;) by 
means of the transformation 


Xj44 =X; + my,d;, 

Yi+ı = Ji — Ern (2) 
Zou = Zi 4%, 

where m is a parameter for the coordinate 
system, x; — m !? arctan (m!? AL and à; is a 
suitable value, e.g., +2~' The angle A;,, and 
radius R;,, are A;,, =A;—a,; and R;,,=R; x 
K;z R; x (1+m6?)'?. After n iterations we find 
A,— Ao —2 and R, 2 Ro x K, and then 


x, = K(xgocos(um!?)-- yom!? sin(am??)}, 
y, = K(yocos(xm?) — xom '? sin(xm!?)), 
Zn — Zo + a, 


where a= 7-34, and K 2 II/-À K;. These 
relations are summarized in Table 1 for m- 1, 
m=O, and m= —1 in the following special 
cases. (1) The value of A is forced to converge 
to zero; y, 0. (ii) The value of z is forced to 
converge to zero; z,0. 
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Table 1. Input and Output Functions for CORDIC 

Input Quantity 
Function m Xo Yo Zo to be 0 Output 
sint 1 1/K 0 t z,0 y, sint 
cost 1 1/K 0 t z,0 xX, Cost 
tan !t 1 1 t 0 EW z,otan !t 
XZ 0 x 0 z z,0 Y XZ 
y/x 0 x y 0 ESA z,— y/x 
sinht —1 1/K_, 0 t z,—0 y,—sinht 
cosht —| 1/K_, 0 t Z, 70 x,—cosht 
tanh !t —1 1 t 0 Va z,tanh !t 


n-1 


K ,-[[a-97)^ 


n-1 
K,=[] (+622, 
j=0 


D. Fast Fourier Transform (FFT) 


When a function f(x) can be taken to be 
periodic, it is advantageous to use trigono- 
metric polynomials as least-squares approxi- 
mating functions. The summations arising in 
least-squares approximations based on the 
trigonometric polynomials play an important 
role in various applications, and a quite effi- 
cient algorithm for the evaluation of such 
sums was developed by Cooley and Tukey 
[10] in 1965, known as the fast Fourier trans- 
form (FFT). Let Se, (0 € m  N — 1) be a set of 
complex numbers, and consider 


X, (UN) Y. x,exp(- 2zimn/N) 
0 


(0cnzN-—1) (3) 


Equation (3) is often called the discrete 
Fourier transform (DFT) of the sequence xm, 
it being analogous to the continuous tFourier 
transform, 


T 


x,-um| Sons — 2nint/ T) dt 


0 
(O<n<N), (4) 


where x(t) is a periodic function of t with 
period T. X, is called the nth Fourier coeffi- 
cient of a set of N equally spaced samples of 
size N for the function x(t) (t ZjT/N,Oxj« 
N). In the same way as for the continuous 
Fourier transform, the discrete transform can 
be inverted to yield 

N-1 


x,— Y, X,expQninm/N) 


m=0 


(0<n<N-1). 


Here x, is called the coefficient of the inverse 
Fourier transform, and X,, and x, thus form a 
transform pair. 

The FFT algorithm for a finite sequence of 
length N in (3) is based on the fact that the 
calculation of (3) can be performed in stages by 
using the direct product decomposition if N — 
N, N3, with N, and N, relatively prime. For 


JH 


example, the 2-dimensional Fourier transform 
coefficient 1s given by 


Xan = LAN, N2)] x 

NA N-1 

3 Dd exp —2ni(n, m,/N, t n3m;/N;)) Xs, m, 
m,-0m,-O 


(0x n, <N,-1, Oxn,xN,— 1). 


If by an elementary operation we mean one 
complex multiplication and one complex ad- 
dition, we can evaluate X, ,, through (N, N3)? 
such operations using Horner's scheme. By 
the direct product decomposition method, 
however, the (N, N,)” operations can be re- 
duced to only N, N5(N, + N3) operations. 
Because the matrix corresponding to the trans- 
formation mentioned above is a direct product 
of N, x N, and N, x N, matrices, we can per- 
form the calculations in two stages: first to 
obtain Čm,n, forü0&m, <N, — 1 and Om; <N, 
— 1 and then to obtain X,, n, forü£n; « N;—1 
and Oxn, x N, — 1. We have 


N5-1 

En, n, =(1/N2) 2 exp( —2nin; m; /N5) Xu m, 
m,=0 
N,-1 

X, 5, 7 (/N1) 3. exp( —2nin, m,/Ni)6,, ,,.. 
m,=0 


This direct product decomposition method is 
well known for 2- or 3-dimensional Fourier 
transforms. Even for a 1-dimensional Fourier 
transform of length N = N, N3, if N, and N, are 
relatively prime, one can use the method of 
direct product decomposition. Even when N, 
and N, are not prime, we can use a method 
of "pseudo" - direct product decomposition to 
reduce the number of operations. Namely, we 
can put m— m, 4- Nm, (O m, « N,,0x m, < 
N,);n=Nj,n, +n, (O<n, <N,,0<n,<N)). 
Then, similarly as before, X, in (3) can be 
rewritten as 

N,-1 
X,=(1/N,) 3. exp(—2zin,m,/N,) 


m,=0 


x exp( —2nim, n/N, N;))6 


m;,.n5? 
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where 
Ny-1 
BUG =(1/N3) p» exp( —2nin,m;/N;)x,, «v m, 
m,=0 
(5) 
which is a DFT of length N,. If we put 
e, n, =EXp(—2rim; n3 NN, NA, s. (6) 
then 
N,-1 . 
X, —(1/N)) 3. exp( -2nin m,/Ni)6, ny: 
m,=0 
(7) 


Formula (7) is nothing but a DFT of length 
N,. Thus an FFT of length N — N, N, can 

be calculated by decomposing it into three 
stages as follows: (1) obtain N, transforms 

of length N, in (5); (it) multiply čm,,n, by 

exp( —2nim, n;/N) in (6) (phase rotation); (iii) 
calculate N, transforms of length N, in (7). 

If either or both of N, and N, can be factored 
further so that, e.g, N 2 N,N,- N,N,,N5,— 
..., an FFT of length N, can be calculated 
similarly by decomposing it, and so on, and 
in this way one can reduce the total number 
of operations. This is the principle of FFT 
pointed out by H. Takahasi. When N is a 
power of two, it can be shown that the FFT 
algorithm requires approximately N log, N/2 
operations. 


E. Padé Approximation 


Let f(x) co - c x cx? +... be a formal 
power series. For any pair of nonnegative 
integers (p,q), we define the (p, q)th Padé ap- 
proximation of f(x) as follows: The Padé ap- 
proximation is a rational function 


(ao - a,x c a5 x? +... c a,x?) 
[bo -- bi x t b, x? +... +b,x°) 


satisfying the condition that all terms in the 
formal power series 


(bo +bix +... box?) (ese x s.) 
—(ag t a x... a,xP) 


should vanish up to the term x?**. An infi- 
nite matrix whose (p, q)th entry is the (p, q)th 
Padé approximation is called the Padé table 
for f(x). The Padé approximation is uniquely 
determined, provided that every Hankel 
determinant 





Deia Cptqti c 





never vanishes. 
When we expand f(x) into a continued 
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fraction of Stieltjes type, say, 


cog] aix| œx] 
E a a " 








then its 2pth and (2p + 1)th approximate frac- 
tions are the (p, p)th and (p+ 1, p)th Padé 
approximation of f(x), respectively. 
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A. General Remarks 


The notable relation between the lengths of 
certain families of curves in a plane domain 
and the area of the domain has long been 
recognized and utilized in function theory. 
L. V. Ahlfors and A. Beurling formulated this 
relation by introducing the notion of extremal 
length for families of curves [1]. Although 
there are various definitions of extremal 
length, they are essentially the same except for 
one due to J. Hersch [3] and A. Pfluger [2]. 

The image of an interval under a continuous 
mapping is called a curve. We say that it is 
locally rectifiable if every ‘continuous arc 
of the curve is trectifiable. Let C be a finite 
or countable collection of locally rectifiable 
curves in a plane and p, 0 € p < oo, be a !Baire 
function defined in the plane. Represent C in 
terms of arc length s (— 246 Length and Area), 
and set (C, p? = fc p ds. For a family T of finite 
or countable collections C, p is called admis- 
sible if (C, p» 21 for every Cer. If no CeT 
consists of a finite or countable number of 
points, then p = oo is always admissible for 
I. Call inf {f (p? dxdy], where p runs over 
admissible Baire functions, the module of I, 
and denote it by M(T). The reciprocal A(L)— 
1/M (T) is called the extremal length of I. If 
no Baire functions are admissible for I', we 
set A4(L) 20. The extremal length is defined 
equivalently in two other ways as follows: 
Let p be a nonnegative Baire function, and 
put L(T, p) z inf (C, p» |CeT j, then A(T) = 
sup LO, p)^/] | p? dx dy, where p runs over 
nonnegative Baire functions. Next, let be 
the collection of nonnegative Baire func- 
tions p such that f f p? dxdy&1; then A(T)— 
sup(L(I, p} |pe o]. We obtain the same 
value for A(T) if we-require an admissible p to 
be flower semicontinuous. If p is required to 
be continuous, then the extremal length de- 
fined by Hersch and Pfluger is obtained. As is 
shown in example (1) of Section B, there is a 
case where the two definitions actually differ. 

If an admissible p yields M(T) — f f o? dx dy, 
then p|ds| is called an extremal metric. Beur- 
ling gave a necessary and sufficient condition 
for a metric to be extremal [4]. 

We list four properties of extremal length: 
(1) 41) 2 A(T) if T, c F. 2) M(U), D) S 
34, MO AO) Let (I, and F be given. Suppose 
that there are mutually disjoint measurable 
sets (E,) such that each C, cT, is contained 
in E,. If each element of (Lt, contains at 
least one CeT, then M(T)>%, M(I,) and 
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hence 
M (U r,) = » M(I,). 


If each CeT contains at least one C,cT, for 
every n, then A(T) 2 X, A(T). (4) Let f be an 
analytic function in a domain Q and {C} be 
given in Q. Denote by f(C) the image of C by 
f. Then 2({C}) € A(1/f(C)]). The equality holds 
if f is one-to-one. This shows that A({C}) is 
conformally invariant. 


B. Extremal Distance 


Let Q be a domain in a plane, 6Q its bound- 
ary, and X,, X, sets on QUOO. The extremal 
length of the family of curves in Q connecting 
points of X, and points of X, is called the 
extremal distance between X, and X, (relative 
to Q) and is denoted by A4(X,, X2). 

Example (1). Let Q={z||z| <2}, X, 200, 
and X, be a countable set in |z|< 1 such that 
the set of accumulation points of X, coincides 
with |z| 2 1. Then A,(X,, X5) — oo, but the 
extremal distance in the sense of Hersch and 
Pfluger is equal to (2x) !log2. 

Example (2). In a rectangle with sides a and 
b, the extremal distance between the sides of 
length a is b/a. 

Example (3). Let Q be an annulus r; < 
[z| € r4. The extremal distance A between 
the two boundary circles of Q is equal to 
(2x)~' log(r,/r,). The extremal length of the 
family of curves in O homotopic to the bound- 
ary circles is equal to 1/4. 

Example (4). Let Q be a domain in the ex- 
tended z-plane such that oo e Q. Let z; € Q and 
iz —zo| 2r] <Q, and denote by A, the ex- 
tremal distance between (|z — zo| rj and a set 
X c 0Q relative to Q. Then 4, —(2z) '!logr 
increases with r. We call the limit the reduced 
extremal distance and denote it by Ao(X, oo). 
tRobin's constant for *Green's function in Q 
with pole at z= oo is equal to 21A9(0Q2, oo). 

Extremal length is also defined on Riemann 
surfaces. Some classical conformal invariants 
can be given in a generalized form in terms of 
extremal length. The notion of extremal length 
has applications in various branches of func- 
tion theory, such as tconformal and tquasi- 
conformal mappings, the * Phragmén-Lindelóf 
theorem, the tcoefficient problem, the ttype 
problem of Riemann surfaces, and studies of 
the boundary properties of functions of finite 
Dirichlet integrals. It is also applied to prob- 
lems in differential geometry. Extending the 
notion of extremal length, M. Ohtsuka con- 
sidered extremal length with weight, and B. 
Fuglede introduced the notion of generalized 
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module in higher-dimensional spaces [8]. 
These notions have useful properties and 
applications. 
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Pierre de Fermat (August 20, 1601—January 
12, 1665) was born into a family of leather 
merchants near Toulouse, France. He became 
an attorney and in 1631 a member of the 
Toulouse district assembly. When not engaged 
in such work, he did research in mathematics, 
so that he consigned his results only to his 
correspondence or to unpublished manu- 
scripts. The manuscripts were published post- 
humously by his son in 1679 and are known as 
Varia opera mathematica. His research into 
number theory, stimulated by Bachet’s (1581- 
1638) translation of the Arithmetika of Dio- 
/phantus (published in 1621), made Fermat's 
name immortal and initiated modern num- 
ber theory. He posed the famous Fermat's 
Problem, which has yet to be solved (— 145 
Fermat's Problem). He began analytic geome- 
try by studying the theory of conic sections of 
Apollonius, and utilizing this theory he dealt 
with the notions of tangent lines, maximal 
(minimal) values of functions, and quadrature, 
which made him a pioneer in calculus. He also 
wrote a precursory work in the theory of prob- 
ability in the course of his correspondence 
with *Pascal. *Fermat's principle is important 
in the field of optics, where it is known as the 
law of least action. Unlike *Descartes, he em- 
phasized the revival rather than the criticism 
of Greek mathematics. 
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The last theorem of Fermat (c. 1637) asserts 
that if n is a natural number greater than 2, 
then 


x" y"=2" (1) 


has no rational integral solution x, y, z with 
xyz #0. In the case n=2, equation (1) has 
integral solutions called Pythagorean numbers 
(— 118 Diophantine Equations). Fermat read 
a Latin translation of Diophantus’ Arith- 
metika, in which the problem of finding all 
Pythagorean numbers ts treated. In his per- 
sonal copy of that book, Fermat wrote his as- 
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sertion as a marginal note at the point at which 
n=2 in equation (1) is treated and added the 
famous words, “I have discovered a truly 
remarkable proof of this theorem which this 
margin is too small to contain.” It is not 
known whether Fermat actually had a proof. 
Fermat’s problem asks for a proof or disproof 
of this conjecture, which itself has not been 
solved despite centuries of efforts by many 
mathematicians; but its study has promoted 
remarkable advances in number theory. In 
particular, E. E. Kummer's theory of ideal 
numbers and the development of the theory of 
teyclotomic fields were originally conceived in 
treating Fermat's problem. 

In this article, we consider only those in- 
tegral solutions x, y, z of equation (1) with 
xyz #0 that are relatively prime. We also 
restrict ourselves to the cases n=] (odd prime) 
and n=4, without loss of generality. 

For smaller values of n, the nonsolvability of 
equation (1) was proved long ago, for n=3 by 
L. Euler (1770), and later again by A. M. Le- 
gendre; for n=4 by Fermat and Euler; for n=5 
by P. G. L. Dirichlet and Legendre (1825); and 
for n=7 by G. Lamé (1839). S. Germain and 
Legendre found some results on more general 
cases, but the most remarkable result was ob- 
tained by Kummer (J. Reine Angew. Math., 40 
(1850), Abh. Akad. Wiss. Berlin (1857)). 

Let I be an odd prime, Z a primitive Ith root 
of unity, and h the fclass number of the cyclo- 
tomic field Q(¢). Then the class number h, of 
the real subfield Q(t 1 C^!) of Q(C) divides h. 
We call h, =h/h, and h, the tfirst and second 
factors of h, respectively. 

(1) If / is tregular, that is, if (^, 1) 2 1, then 
x' 4 y! ^ z! has no solution (Kummer, 1850). 

There are infinitely many irregular primes 
[3]; those under 100 are 37, 59, and 67. There 
are 7,128 regular primes and 4,605 irregular 
primes between 3 and 125,000. It is not yet 
known whether there are infinitely many regu- 
lar prime numbers, although the beginning 
part of the sequence of natural numbers con- 
tains a larger number of these than the num- 
ber of irregular prime numbers. The condition 
(i, h) 21 is equivalent to saying that the numer- 
ators of *Bernoulli numbers B,,, (m — 1,2, ..., 
(1— 3)/2) are not divisible by / (Kummer, 1850). 

Kummer obtained a result on irregular 
primes (1857) which was improved later as 
follows. Note that if | is not regular then A, is 
divisible by / (Kummer, 1850) (— 14 Algebraic 
Number Fields). 

(2) If (h5, 1) and the numerators of Ber- 
noulli numbers B,,, (m= 1,2, ...,(/—3)/2) are 
not divisible by 1°, then vi y'— z! has no 
solution (H. S. Vandiver, Trans. Amer. Math. 
Soc., 31 (1929)). By computation Vandiver 
confirmed that x'+ y' 2 z' has no solution for 
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| « 619. At present, this procedure has been 
continued for /< 125,080 using computers by 
the method of D. H. Lehmer, E. Lehmer, and 
H. S. Vandiver, Proc. Nat. Acad. Sci US, 40 
(1954) (S. S. Wagstaff, Math. Comp., 32 (1978)). 

When the condition (xyz,/)—1 or (xyz, I) -1 
is added, we speak of Case I or Case II, respec- 
tively. The following theorems hold for Case I. 

(3) If x'+ y! « has a solution in Case I, 
then 


By fi 2s (050 (modl), m21,2,...,(1—3)/2, 


Q) 


holds for —t=x/y, y/x, y/z, z/y, x/z, and z/x, 
where f, (t) — X1-lr" Ip, and B, is the mth 
Bernoulli number. This is called Kummere 
criterion (D. Mirimanov, 1905). 
A simplification of the above result is 
(4a) If x! 4- y! « z' has a solution in Case I, 
then 


(2^! —1)/1=0 (mod!) 


(A. Wieferich, J. Reine Angew. Math., 136 
(1909)). This result created a sensation at the 
time of its publication. It was first shown that 
1093 and 3511 are the only primes with | « 
3700 for which the above congruence holds; it 
is presently known that no other / with 1<6 x 
10? satisfies this congruence. The criterion 
(4a) was gradually improved by Mirimanov 
(1910, 1911), P. Furtwàngler (1912), Vandiver 
(1914), G. Frobenius (1914), F. Pollaczeck 
(1917), T. Morishima (1931), and J. B. Rosser 
(1940, 1941). For example: 

(4b) If x! 4- y! « z! has a solution in Case I, 
then 


(m! — 1120 (mod!) (3) 


holds for all m with 2x m <43. By means 

of this result, Rosser (1941) showed for l< 
41,000,000, and D. H. Lehmer and E. Lehmer 
(Bull. Amer. Math. Soc., 47 (1941)) showed for 
1« 253,747,889 that x! 4- y' « z! has no solution 
in Case I. 

We have hitherto been concerned with 
rational integral solutions of x! + y/— z'. We 
may also consider the problem of proving or 
disproving that al + f' =y" has no solution 
a, B, y with «fy z 0 in the ring of algebraic 
integers of Q(C). Case I means the impossibil- 
ity of 


al + B'--y «0, (apy, D) — 1, (4) 
and Case II means the impossibility of 
a+ f! —&A" y, (ay, 1) 5 1, (5) 


where n is a natural number, e is a funit in 
OCL, and 4— 1 — C. We have the following 
results: 

(1*) If (h, 1) « 1, then neither equation (4) nor 
equation (5) has a solution (Kummer, 1850). 
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(2*) Under the same conditions as in state- 
ment (2), equation (4) has no solution. If we 
additionally restrict o, fi, y to relatively prime 
integers of OC - £^!) and replace 4 by (1— 
CO(1— CT then equation (5) also has no solu- 
tion (Vandiver, 1929). 

(4b*) If equation (4) has solutions o, f), y in 
Q(¢), then congruence relation (3) holds for all 
m with 2« m x: 43 (Morishima, 1934). 

When / is sufficiently large, we have the 
results of M. Krasner (C. R. Acad. Sci. Paris 
(1934)) and Morishima (Proc. Japan Acad., 11 
(1935)). 

Bibliographies are given in Vandiver and 
Wahlin [1] and Vandiver [2]. 
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A. Introduction 


As the S-matrix or *Green's function in quan- 
tum field theory is usually prohibitively dif- 
ficult to calculate, perturbative expansions in 
terms of coupling constants have been em- 
ployed since the beginning of the theory (— 
386 S-Matrices). R. P. Feynman (Phys. Rev., 76 
(1949)) invented a way of calculating the series 
in terms of Feynman integrals. His method 
drastically simplified the preceding method 
due to S. Tomonaga and J. S. Schwinger, even 
though, as was later shown, the two methods 
are theoretically equivalent (F. J. Dyson, Phys. 
Rev., 75 (1949)). A Feynman integral is an 
integral associated with a Feynman graph 
according to the Feynman rule explained 

in Section B. Feynman integrals inherit the 
troublesome problem of divergence, and some 
recipe which systematically provides them 
with a definite meaning is needed. Such a 
recipe is given by the renormalization the- 

ory of Tomonaga, Schwinger, Feynman, and 
Dyson. A mathematically rigorous renormali- 
zation theory was given by N. N. Bogolyubov 
and O. S. Parasiuk (Acta Math., 97 (1957)), 
later supplemented by K. Hepp (Comm. Math. 
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Phys., 2 (1966)). See also W. Zimmermann 
(Comm. Math. Phys., 15 (1969)) and S. A. 
Anikin et al. (Theor. Math. Phys., 17 (1973)). 
Furthermore, E. R. Speer [1] gave a mathe- 
matically convenient recipe of renormalization 
(under the condition that massless particles 
are irrelevant). Although the series expansion 
in coupling constants is a divergent series 
even after renormalization (— 386 S-Matrices), 
the study of Feynman integrals has given 
much insight into the qualitative aspects of the 
S-matrix, and in particular, into its analytic 
structure (e.g., R. J. Eden et al. [2]). In this 
respect the discovery of the Landau-Nakanishi 
equations, which describe the location of sin- 
gularities of Feynman integrals, was crucially 
important (L. D. Landau, N. Nakanishi, and 
J. Bjorken, 1959; — Section C). Later, R. E. 
Cutkosky found a formula which gives impor- 
tant information concerning the ramification 
of Feynman integrals near their singularities 
(— Section C). It gave impetus to J. Leray's 
mathematical study of Feynman integrals 
from the viewpoint of integration of multi- 
valued analytic functions (Leray, Bull. Soc. 
Math. France, 87 (1959)). Such studies were 
subsequently carried out by D. Fotiadi, M. 
Froissart, J. Lascoux, F. Pham, etc; — [3-5] 
and references cited there for this topic. An 
extensive study by G. Ponzano, T. Regge, 
Speer, and J. M. Westwater on the mono- 
dromy structure of Feynman integrals is 
closely related to the studies by Pham and 
others (— Regge in [6] and references cited 
there). On the other hand, the progress of 
microlocal analysis has thrown new light on 
the Feynman integrals and has given a unified 
foundation to these various other studies (— 
Section C; also Pham, M. Kashiwara, and T. 
Kawai in [6], M. Sato et al. in [6] and refer- 
ences cited there). 


B. Definitions 


First, the notion of Feynman graphs is intro- 
duced. A Feynman graph is sometimes called a 
Feynman diagram. A Feynman graph G con- 
sists of finitely many points (called vertices) 
{Vj} j-1,.... finitely many 1-dimensional seg- 
ments (called internal lines) {L,};-,,...y and 
finitely many half-lines (called external lines) 
FLE}, =1.....n all of which are located in a 4- 
dimensional affine space. Each of the end- 
points W;* and W, of L, and the endpoint of 
Lt coincide with some vertex V.. A four-vector 
D,— D, o Pr, 15 Do 2 Pr,3) 18 associated with each 
external line L? and a constant m,>0 is as- 
sociated with each internal line L;. For sim- 
plicity, we usually suppose, in addition, that 
each internal line and each external line are 
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oriented, that W,* Æ W, and that G is con- 
nected. The orientation of a line is indicated by 
the symbol > . Given an orientation, we de- 
fine the incidence number [ j:/] to be +1 or 
—1 according to whether L, ends or starts from 
V,. In other cases, [ j:/] is defined to be zero. 
The incidence number [ j:r] is defined in the 
same way but with L; replacing L, (Fig. 1). 





Fig. 1 

In this example of a Feynman graph, the internal 
lines L, and L, do not intersect and this diagram 
should be drawn in R*, not in R?; this is indicated 
by 7X. . For convenience, multiple lines such as L, 
and L, are usually drawn in a curvilinear manner, 
as shown. 


The Feynman rule associates the following 
integral F(p) with each Feynman graph G: 


ra= | a 9 a Cir p+ Dea LAD 


N i(k?-m +, /-1 0) 
N 
xY dtk, (1) 
1-1 
where k? =k?) — X3., 2,. Here 1/(k? —mj + 
d —1 0) means lim, o(1/K? —m? + /—1 8) 
(— 125 Distributions and Hyperfunctions). 

Here we consider the case where the interac- 
tion Lagrangian density does not contain 
differential operators (i.e., direct coupling) and 
all relevant particles are spinless. In general, 
we should multiply the integrand of F;(p) by a 
matrix of polynomials of the p, and k;. 

F;(p) has the form (X, Lj:r]pJfa(b); we 
often investigate f,(p) instead of F,(p). The 
function fs(p) is studied on M = (pe R^" 
XL :r]p, 70], and is called a Feynman 
amplitude. The integral (1) is not well defined 
as it stands because of the following prob- 
lems: (a) Is the product appearing in the inte- 
grand well defined? (b) Is the integral conver- 
gent? The first problem is not serious if m, #0 
(— 274 Microlocal Analysis E) However, the 
second problem, called the ultraviolet diver- 
gence, is serious. The renormalization proce- 
dure is intended to overcome this difficulty. 
When some mj is equal to zero, even the first 
problem, called the infrared divergence, is 
serious. See D. R. Yennie et al. (Ann. Phys., 13 
(1961)) and T. Kinoshita (J. Math. Phys., 3 
(1962)) for analyses of the infrared divergenee. 
In this article we always assume for simplicity 
that every m, is strictly positive, even though 
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such an assumption is too restrictive from the 
physical viewpoint. 

Calculations of Feynman integrals are often 
done by means of the parametric representa- 
tion of the integral (1), of the form 


(zum.)| yir 


Ji ó(1— Xj, af doy 
o U (x (V(p, a) + / —] Q)-N*2w-2' 


where U(x) and V(p, a) are determined by the 
topological structure of the graph G. See [7] 
for the derivation of this formula and the 
treatment of the integra] written in this form. 
It is useful not only for the study of the sin- 
gularity structure of F,(p) but also for the 
study of spectral representations, etc. (—, e.g., 
[7. 8]). Note that several different notations 
are used in the literature for the parametric 
representation of the integral. Hence one 
should be careful in referring to papers which 
use parametric representations. 





C. Analytic Properties of Feynman Integrals 


The celebrated result of Landau (Nuclear 
Phys., 13 (1959)), Nakanishi (Prog. Theor. 
Phys., 22 (1959)) and Bjorken (thesis, Stanford 
Univ., 1959) asserts that the singularities of a 
Feynman amplitude f; (p) are confined to the 
subset L * (G) (called a positive-« Landau- 
Nakanishi variety) of M — (peR^"| Y; Urin, 
— 0), defined by the following set of equations 
(called Landau-Nakanishi equations), where u,, 
w;, and k; are real four-vectors and a is a real 
number, all of which are to be eliminated to 
define relations among the p, (note, however, 
that a positive-« Landau-Nakanishi variety is 
not strictly a subvariety of M, because of the 
constraint (2e): 


ki 


Uu, = 
j 


Ur (ro b ast). (2a) 


i 


n N 
2, LU lest H [:]k 20  (jzl....m) 
r=1 i= 


(2b) 
[:]w;ak  (l-L...,N) (2c) 
j=l 
a(k?7—m7)=0  (I-1,...,N), (2d) 
a, = 0, (2e) 
with some 
a #0. (2f) 


Usually a Landau-Nakanishi variety (resp., 
equation) is called a Landau variety (resp., 
equation) for short. The equation (2a) is 
conventional; (2b) represents the energy- 








146 C 
Feynman Integrals 


momentum conservation law at the vertex V; 
(2d) corresponds to the mass-shell constraint 
(if «a 2: 0) on the internal line L,. Since (2c) 
entails X, e(C)aj;k, =0 for a closed circuit 

( loop) C of G for some set of values &(C) = 
+1 or 0 with z (C) being 0 if L, does not be- 
long to C, the equation (2c) is usually called 
the closed-circuit condition. By attaching w, to 
the vertex V; and associating «,k, to the inter- 
nal line L,, we get a diagram representing a 
multiple collision of classical point particles, 
since the relations (2a)- (2f) are just the class- 
ical conditions for such a collision (S. Coleman 
and R. E. Norton, Nuovo Cimento, 38 (1965)). 
The fact that the Landau-Nakanishi equations 
admit such an interpretation is neither acci- 
dental nor superficial in view of the tmacro- 
scopic causality of the S-matrix. Note also that 
there is another interpretation of the Landau- 
Nakanishi equations, which emphasizes their 
resemblance to *Kirchhoff's law (Nakanishi, 
Prog. Theor. Phys., 23 (1960)). Such a resem- 
blance can be used to study the structure of 
Feynman integrals from the viewpoint of 
*graph theory. See [7] and the references cited 
there for this topic. 

An important observation by Pham and 
Sato (1973), which opened a way to the micro- 
local analysis of Feynman integrals and the 
S-matrix, is the following: If we consider the 
Landau-Nakanishi equations to define a sub- 
variety of S*M, the fspherical cotangent bun- 
dle of M, by eliminating only w, k, and a, 
then the resulting variety describes the tsin- 
gularity spectrum of f; (p). More precisely, 
S.S.f¢(p) is confined to the set (p; Jc u), 
where (p; u) satisfies the Landau-Nakanishi 
equations. The rigorous proof of this state- 
ment was given by Sato et al. in [6]. The 
subset of S* M or /-1S*M thus obtained 
is denoted by ¥ * (G) and is also called a 
positive-a Landau-Nakanishi variety. It is 
noteworthy that the microlocalization of the 
classical result of Landau, Nakanishi, and 
Bjorken had essentially been achieved in a less 
sophisticated manner by D. Iagolnitzer and H. 
P. Stapp (Comm. Math. Phys., 14 (1969)) in the 
framework of S-matrix theory. The variety 
defined by (2a)—(2d) and (2f) is denoted by 
L(G) or .Z(G) and is a Landau-Nakanishi 
variety. In a neighborhood of p? in L*(G), 
Leni is the boundary value of a holomorphic 
function f;(p) whose domain of definition is 
determined by u-vectors (— 274 Microlocal 
Analysis E). Furthermore, in simple cases one 
can verify that fc(p) can be analytically con- 
tinued to define a holomorphic function on 

mm 


the universal covering space U — L(G)€ of U — 
L(GY for a complex neighborhood U of p°, 
where L(G)* denotes a complexification of 
L(G). Hence we can discuss the difference of 
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fo(yp) and Lin where y denotes a loop en- 
circling L(G)€. Cutkosky (J. Math. Phys., 1 
(1960)) observed that it can be expressed 

(in simple cases) by the integral obtained by 
replacing 1/(k? —m? + J—10) in the right- 
hand side of (1) by — An, d —18*(k? —m?) = ger 
— 2r, / —1Y(k, g)ó(k? — m2), where Y(k, o) 
denotes the *Heaviside function. A result of 
this type is called a discontinuity formula and 
is now obtained for the S-matrix itself in a 
suitably modified manner, i.e., the discontinu- 
ity formula holds beyond the framework of 
perturbation theory (— 386 S-Matrices). As is 
mentioned in 386 S-Matrices, the discontinuity 
formula is closely related to Sato's conjecture 
on the tholonomic character of the S-matrix 
(T. Kawai and Stapp in [6]). Actually, M. 
Kashiwara and Kawai (in [6]) proved that 
fc(p) satisfies a tholonomic system of linear 
differential equations whose characteristic 
variety is confined to the extended Landau- 
Nakanishi variety L(G). They further proved 
that the system has regular singularities (— 
274 Microlocal Analysis G). Their result gives, 
on one hand, a precise version of Regge's 
statement to the effect that f,(p) is a generaliza- 
tion of a thypergeometric function (in Battelle 
Rencontres, C. DeWitt and J. A. Wheeler (eds.), 
Benjamin, 1968), and, on the other hand, a 
rigorous proof of the fact that fel p) is a Nils- 
son class function. This fact is closely related 
to the works of D. Fotiadi, J. Lascoux, Pham, 
and others. Kashiwara and Kawai (Comm. 
Math. Phys., 54 (1977)) also showed that the 
holonomic character of the Feynman ampli- 
tude is an important clue for understanding 
the so-called hierarchical principle, which had 
been proposed and studied in connection 

with the *Mandelstam representation by the 
Cambridge group (Eden et al. [2]). Further- 
more, Kashiwara et al. (Comm. Math. Phys., 
60 (1978)) gave a useful expression of f; (p) at 
several physically important points by analyz- 
ing the microlocal structure of the holonomic 
system that f,,(p) satisfies. Thus the use of 
*(micro-) differential equations in analyzing 
Feynman amplitudes has turned out to be 
effective in understanding their singularity 
structures in a unified manner. 
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A. General Remarks 


E. Stiefel [2] introduced certain tdiffeomor- 
phism invariants of tdifferentiable manifolds 
by considering a field of a finite number of 
linearly independent vectors attached to each 
point of a manifold; and H. Whitney [3] ob- 
tained the notion of fiber bundles as a com- 
pound idea of a manifold and such a field of 
tangent vectors. S. S. Chern [4] emphasized 
the global point of view in differential geo- 
metry by recognizing the relation between the 
notion of fconnections (due to E. Cartan) and 
the theory of fiber bundles. The theory of fiber 
bundles is also applied to various fields of 
mathematics, for example, the theory of tLie 
groups, 'homogeneous spaces, tcovering 
spaces, and general vector bundles, vector 
bundles of class C’, or analytic vector bundles. 
Homological properties of fiber bundles are 
studied by means of *spectral sequences, and 
cohomology structures of several homoge- 
neous spaces and several characteristic classes 
are determined explicitly by means of fcoho- 
mology operations. Also, the group K(X), 
formed by equivalence classes of vector bun- 
des over a finite (CW complex X, is a *gener- 
alized cohomology group, treated in * K-theory, 
in which further development is expected 
(— 237 K-Theory). 


B. Definitions 


Let E, B, F be topological spaces, p: E25 Ba 
continuous mapping, and G an teffective left 
topological ttransformation group of F. If 
there exist an topen covering {U,} (xe A) of B 
and a homeomorphism 9,:U, x F p ‘(U,) for 
each «e A having the following three prop- 
erties, then the system (E, p, B, F, G, U,, q,) is 
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called a coordinate bundle: (1) po, (b, y) - b 

(be U,, ye F). (2) Define ox Fzzp (b) (be U,) 
by 9, (y) — o. (b, yy; then gj, (b) — 95,10 Pap EG 
for be U, N Ug. (3) gga: Ua N Us 5 G is continu- 
ous. We say that this bundle is equivalent 

to a coordinate bundle (E, p, B, F, G, U,, Py) 

if g,,(b) = 9, e 0 Pab EG (be U, N U;) and 

Gua? UN U, — G is continuous. An equivalence 
class č — (E, p, B, F, G) of coordinate bundles is 
called a fiber bundle (or G-bundle), and E is 
called the total space (or bundle space), p the 
projection, B the base space, F the fiber, and G 
the bundle group (or structure group). Also, U, 
of a coordinate bundle (E, p, B, F, G, U,, oi 
belonging to the class £ is called the coordinate 
neighborhood, o, the coordinate function, and 
ge, the coordinate transformation (or transition 
function). 

Let €=(E, p, B, F, G) and & =(E’, p’, B', F,G) 
be two fiber bundles with the same fiber and 
group. A continuous mapping V; E E' is 
called a bundle mapping (bundle map) from £ to 
& if the following two conditions are satisfied: 
(1) There is a continuous mapping y: B5 B' 
with p'oW- Pop. (2) V, (b) 29; 10V odo, ,€ 
G (be UN y (Vj), b= (b), and Ypa: ULN 
V 1 (V,)5G is continuous, where (U,, pa} and 
IK. 9,) are pairs of coordinate neighbor- 
hoods and functions of £ and č’, respectively. 
Moreover, if y is a homeomorphism, then VP is 
also a homeomorphism and V^! is a bundle 
mapping. 

Let €=(E, p, B, F, G) and &' =(E", p', B, F, G) 
be two fiber bundles with the same base space, 
fiber, and group. If there is a bundle map- 
ping V: E E' such that y: Bo B as described 
before is the identity mapping, then we say 
that ¢ is equivalent (or isomorphic) to č and 
write £z C. Take the same coordinate neigh- 
borhoods {U,}, and let g,, and gg, be the co- 
ordinate transformations of č and E. respec- 
tively. Then £z £' if and only if there are 
continuous mappings 4,: U,— G with gj, (b) = 
Allge AKT (be U, N Up). 

For a system {g,,} of coordinate transforma- 
tions of a fiber bundle, we have g,,(5) gj, (b) = 
g,,(b) (be U, Us N U,). Conversely, given an 
open covering {U,} and a system of gga: Us N Ug 
6G satisfying, this condition, there is a unique 
G-bundle (E, p, B, F, G) with {gpa} as a system 
of coordinate transformations. Actually, E is 
the "identification space of Ê= ((b, y, x) | be U,} 
C B x F x ^ obtained by identifying two points 
(b, y, a), (b', y’, B) with b — b', y’=g,,(b)- y, and p 
is defined by p{(b, y,a)] =b, where the index 
set A= (aj is considered a discrete space. 


C. Principal Fiber Bundles 


A fiber bundle 5 — (P, q, B, G, G) is called a 
principal fiber bundle (or simply principal 








147 E 
Fiber Bundles 


bundle) if G operates on G by left translations. 
This is also defined by the following con- 
ditions: G is a right ttopological transfor- 
mation group of P, and there exist an open 
covering {U} of B and homeomorphisms ø: 
U xt sa (U) with qo(b, g) — b, o(b, g): g' = 
ob, gg") (be U; g,g' €G). A bundle mapping 
V: PP’ between two principal bundles y = 
(P, a B, G) and g =(P’, q', B', G) is also defined 
as a continuous mapping V with V(x:g)— 
Y(x): g. 


D. Associated Fiber Bundles 


Let n — (P, q, B, G) be a principal bundle, and 
let F be a topological space having G as an 
effective left topological transformation group. 
Then G is a right topological transformation 
group of the product space P x F by (x, y): g— 
(x-g,g ! y) (xeP, yc F,geG). Consider the 
‘orbit space P x, F =(P x F)/G, and define the 
continuous mapping p: P xF >B by p{(x, y)] 
= q(x). Then y x; F Z(P xc F, p, B, F,G) is 

a fiber bundle, called the associated fiber 
bundle of y with fiber F. On the other hand, n 
is called an associated principal bundle of ¢ = 
(E, p, B, F, G if €=n x, F. A principal bundle 

n having the same coordinate transformations 
as € is an associated principal bundle of £, and 
two fiber bundles are equivalent if and only if 
their associated principal bundles are equiva- 
lent. Therefore, given a fiber bundle č, there 
exists a principal bundle y such that £ —5 xc F. 


E. Examples of Fiber Bundles 


(1) Product bundle. (B x F, p,, B, F, G), where 
pı» the projection of the product space, is 
called a product bundle if there is just one 
coordinate neighborhood B and the coordi- 
nate function is the identity mapping of B x F. 
A bundle that is equivalent to a product bun- 
dle is called a trivial bundle. 

(2) A tcovering (Y, p, Y) is a fiber bundle 
whose fiber is the discrete space p ! (yo) 

(yo € Y), and the structure group is a factor 
group of the "fundamental group z,(Y, yo). In 
particular, a tregular covering is a principal 
bundle. 

(3) Hopf bundle. Let A be the real number 
field R, the complex number field C, or the 
quaternion field H, 4— dimg A, and A"*! the 
(n+ 1)-dimensional linear space over A. Iden- 
tify two points (zo, ...,Z,), (Zo, ..., Z4) of the 
subspace A"*! — {0} (0 is the origin) if there is a 
ZEA such that z;=z;z (i=0,...,n). Then we 
obtain the identification space P"(A), called the 
n-dimensional projective space over A. Let 57 
(2 S*"*0^1. the (2(n-- 1)— D-sphere) be the 
unit sphere in A"*!. Then S? is the topological 
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transformation group of $4 by the product of 
A, and the orbit space S% /SÌ is P"(A). Fur- 
thermore, (S^, q, P"(A), $2) (q is the projection) 
is a principal bundle called the Hopf bundle (or 
Hopf fibering). These comments are valid also 
for n= oo. When n 5 1, P' (A) is homeomorphic 
to S^, and the Hopf bundle is (S?*7', q, $^, $^!) 
(A=1,2,4). A Hopf bundle is defined similarly 
for 4-8 using the tCayley algebra, and the 
projection q:$?^^! +S? (A=2, 4, 8) is the Hopf 
mapping (Hopf map). 

(4) Let G be a topological group, H its 
closed subgroup, and r:G—G/H, r(g)=gH the 
natural projection. If there exist a neighbor- 
hood U of r(H)e G/H and a continuous map- 
ping f: UG such that ro f is the identity 
mapping, then we say that H has a local cross 
section f in G, and (G/K, p, G/H, H/K, H/Ko) is 
a fiber bundle for any closed subgroup K of H 
(where p is the natural projection gK >gH and 
Kj, is the largest normal subgroup of H con- 
tained in K). The associated principal bundle 
of the latter fiber bundle is (G/K,, p, G/H, 
H/K,). Any closed subgroup H of a *Lie 
group G has a local cross section in G; hence 
the above bundles can be obtained. 


F. Vector Bundles 


A system €=(E, p, B) of topological spaces E, B 
and a continuous mapping p: E B is called an 
n-dimensional real vector bundle if the follow- 
ing two conditions are satisfied: (1) p ! (b) is a 
real vector space for each be B. (2) There exist 
an open covering {U,} (xe A) of B and a co- 
ordinate function o, U, x Fz P! (U,) for each 
xe ^, where F =R"; furthermore, the 9, ,: 

R" =p !(b)are isomorphisms of vector spaces. 
In this case, gga(b)= Pki O Pap: R" ©R" ( be 
U,N Up) is an element of the "general linear 
group GL(n, R). Hence a vector bundle is a 
fiber bundle with fiber R” and group GL(n, R), 
and the converse is also true. A 1-dimensional 
vector bundle is called a line bundle. A vector 
bundle E =(E’, p’, B) is called a subbundle of a 
vector bundle £ - (E, p, B) if E’ C E, p| E —p,, 
and p' (b) is a vector subspace of p! (b) for 
each be B. 

Let €, and č, be two vector bundles of 
dimension n, and n, with the same base space 
B. Let E be the union of the direct sum p, ! (b) 
4- p; ! (b) for be B, and define p: EO B by 
p(pi (b) - p?! (b)) 2 b. Take the same coordi- 
nate neighborhoods U, for č, and £;, and 
define p,: U, x Ri» p ' (U,) by o,(b, y) — 
ei, ei lt (ye Ri "rs =R" +R"), where 
the qi: U, x R" ~p; ! (U,) are the coordi- 
nate functions of £,. Then E is topologized by 
taking the family (9,(0)] (O is open in U, x 
R": *":) as the topen base, and we obtain 
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an (n, +n,)-dimensional real vector bundle 

(E, p, B), denoted by £, ® č, and called the 
Whitney sum of č, and £,. Similarly, we can 
define the tensor product č; ® €,, the p-fold 
exterior power AP č (or bundle € of p-vectors), 
and the bundle of homomorphisms 

Hom(£,, £;), of dimension n,7;, (7), and 

nn, respectively, using the tensor product 
R'"GR" = R":",, the p-fold texterior power 
NR'- n(? (the space (RP of p-vectors 

in R"), and the space of homomorphisms 
Hom(R";, R^») - R":";, (For the last one, we use 
Hom((o4 A ', oi d instead of Hom(o1 o?) 
Hom(£,£!) 2 £* is called the dual (vector) 
bundle of £, where e! is the trivial line bundle. 
If we use coordinate transformations, ®, ®, 
AP, and £* are obtained by the direct sum, 
*Kronecker product, matrix of ?p-minors, and 
*transpose of matrices, respectively. If č, is 

a subbundle of £,, the quotient bundle ¢, /¢, 

of dimension n, —n, is defined by using the 
quotient vector space R':/R'? = R" "2, and 

č ® (£,/£5) is equivalent to ¢,. These oper- 
ations preserve the equivalence relation of 
bundles. Also, & and (9 are commutative up 
to equivalence and satisfy the associative and 
distributive laws. For each € having a finite- 
dimensional "CW complex as base space, there 
is a such that € @ C' is trivial. 

Using the complex number field C or the 
quaternion field H instead of the real number 
field R, we can definc similarly the complex 
vector bundle or the quaternion vector bundle 
and the operations ®, Q, etc. For a complex 
vector bundle £, the complex conjugate bun- 
dle Z is defined by the complex conjugate of 
matrices. 

(5) Tangent bundles, tensor bundles. Let M 
be an n-dimensional tdifferentiable manifold of 
class C". Consider the ?*tangent vector space 
T,(M) at pe M, set T(M)=|)pem T, (M), and 
define x: T(M) M by n(T,(M)) — p. Fora 
*coordinate neighborhood U, of p with local 
coordinate system (x,, ..., x,), each point of 
n^! (U,) is represented by 27. f;0/0x;, and 
n (U,) has a coordinate system (x4, .... X, 
fy; f). Hence T(M) is a C"! -manifold, and 
X(M)-«(T(M), 1, M, R", GL(n, R)) is an n- 
dimensional real vector bundle. X(M) is called 
the tangent (vector) bundle, its dual bundle 
X*(M) the cotangent (vector) bundle, and the 
tensor product T(M)@ ... 9 X*(M)Q ...a 
tensor bundle of M. The line bundle A" X*(M) 
is called the canonical bundle of M. 

For a tcomplex manifold M, T(M)isa 
complex manifold and X(M) is a complex 
vector bundle. Therefore these bundles are 
defined as complex bundles. 

(6) Tangent r-frame bundle. In the preceding 
example, the space of all tangent r-frames of 
M is a bundle space with base space M and 
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group GL(n, R). It is called the tangent r-frame 
bundle (or the bundle of tangent r-frames) of 
M. 


G. The Classification Problem 


For a fiber bundle €=(E, p, B, F, Gand a 
continuous mapping w: B' B, consider the 
subspace E' = f(x, b')e E x B'|p(x)  v(b')] of 
E x B' and the projections p': E'2 B' and V: 
EE Then y* € «(E', p', B', F,G) is a fiber 
bundle, and ¥ is a bundle mapping from y^ é 
to č; v*£ is called the induced bundle of č by 
y. Let {U,} and {gga} be the systems of coordi- 
nate neighborhoods and transformations of é. 
Then (y ? (U,)] and (gj, 0 V] are correspond- 
ing systems of y ^£. If V: E'2 E is a bundle 
mapping from č’ to č having y: B' B as the 
mapping of base spaces, then E =” €. Also, 
we have dr ifé mé (yoy) ¿= 
y'* (p? €). If € is a principal bundle, then V ^£ 
is also principal, and (n xg F)=(W*n) xg F. 
For a tparacompact space B', yf £e y č if 
V1, V3: B'— B are *homotopic. 

For a topological group G, a principal 
bundle £(n, G) - (E(n, G), p, Bin, G), G) is called 
an n-universal bundle if E(n, G) is *n-connected 
(n € oo); its base space B(n, G) is called an n- 
classifying space of G. In particular, £(oo, G) 
— 6g — (Eg, p, Bg, G) is called simply a universal 
bundle and B, a classifying space of G. Then 
we have the classification theorem: Let B be a 
CW complex with dim B x n; then the set of 
equivalence classes of principal G-bundles with 
base space B is in one-to-one correspondence 
with the *homotopy set 2(B; Bin. G)) of con- 
tinuous mappings of B into B(n, G). Such a 
correspondence is given by associating with 
the induced bundle w* £(n, G) a continuous 
mapping y: B B(n, G), called the character- 
istic mapping or classifying mapping (charac- 
teristic map or classifying map) of v * Ein, G). 
Furthermore, if G is an effective left topo- 
logical transformation group of F, the set of 
equivalence classes of G-bundles with base 
space B and fiber F is in one-to-one corre- 
spondence with 2(B; B(n, G)). The correspon- 
dence is given by associating v *(£(n, G) x F) 
to y. 


H. Construction of Universal Bundles 


For an arbitrary topological group G, J. W. 
Milnor [6] constructed a universal bundle (Eg, 
p, Bg, G) in the following manner. The join 

Eg; 2 Go...0Go... of countably infinite 

copies of G is defined as follows: A point e of 
Eg is the symbol t; o, ® ... t,.g, (1 & i, < 
ij« ...«i,,m-—1,2,3,...), where Li are 


H im 


real numbers satisfying t; 20, t; +. +t, = L, 
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and o, is an element of the i,th copy of G, 

k 1, ...,m. Here we can omit t; g; if t; — 0. 
Regard E, as a topological space with a weak 
topology such that the coordinate functions 
€ t, e g; are continuous. Define the right 
action of G on Eg by (t; gi, ©... 1,9): 9 = 
t; (gi, g) -O tingin g). Let B; and p: Ego 
Bg be the identification space of E, by this 
action of G and the identification mapping. 
Then (Eg, p, Bg, G) is a universal bundle for G, 
and B, is called the classifying space for G. Eg 
and B, are sometimes written as EG and BG. 
In particular, a classifying space B, is a count- 
able CW complex for any countable CW 
group G (i.e., a topological group that is a 
countable CW complex such that the mapping 
gg ! of G into G and the product mapping 
G x GG are both cellular). The following 
examples of classifying spaces for Lie groups 
are also useful. Note that every CW complex 
Bg of a given G has the same thomotopy type. 


I. Examples of Universal Bundles 


(1) G is either O(n), U(n), or Sp(n): Let A and å 
be as in (3) of Section E. According as A is R, 
C, or H, we let U(n, A) be the torthogonal 
group O(n), the *unitary group U(n), or the 
*symplectic group Sp(n). Then the *Stiefel 
manifold Kan (A) 2 U(m n, AJ/L, x U(n, A) 
(L, 1s the unit element of U (m, A)) is A(n+ 1) 
— 2)-connected. Hence the principal bun- 
dle £(4(n 4- 1) 2, U(m +n, A)) (V, (A), 
M, a, S S), U (m, A)) from (4) of Section E is a 
(A(n+ 1) — 2)-universal bundle of U (m, A), 
where the base space M,,,, (^) - U(m-F n, A)/ 
U (m, A) x U(n, A) is the tGrassmann manifold. 
(2) G is either O(oo), U(oo), or Sp(oo). The 
examples in (1) are valid for m, n= oo. Con- 
sider the tinductive lim#t group U(oo, A) — 
[J U(n, ^) under the natural inclusion 
U (n, A) C U (n +1, A), and supply the infinite 
classical group U(oo, A) with the weak topol- 
ogy (this means that a set O of U(co, A) is 
open if and only if each ON U (n, A) is open in 
U (n, A)). Then the infinite Stiefel manifold 
Vn+n,m(A) and the infinite Grassmann manifold 
Ma n) (m — oo or n= oo) are defined as 
before, and we have 


M, m(A) == U M min n X). 


and so on. Furthermore, these manifolds are 
CW complexes, and V,, (A) (m € oo) is oo- 
connected. Although U (co, A) is not a Lie 
group, U (m, A) x U(n, A) has a local cross 
section in U(m +n, A) for m, n< oo [7]. There- 
fore, setting n= oo in (1), £(oo, U(m, A)) is a 
universal bundle of U (m, A), and the infinite 
Grassmann manifold AM. ,(A) is a classifying 
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space By, an. Also, ¢(A(n + 1) — 2, U(oo, A)) 
in (1) is a (4(n + 1) — 2)-universal bundle of 
U (oo, A) (n € oo). 

(3) G is either SO(m) or a genera! Lie group. 
For the rotation group SO(m), we have £(n — 
1, SO(n) = (Vins m(R), P, H SO (m)) and 
Baatz Mo, m, Where Ma, 4,7 SO(m4 n)/ 
SO(m) x SO(n) is the oriented Grassmann 
manifold. For any compact Lie group G, we 
have €(n— 1, G) (V, m(R), p, O(m + n)/G x 
O(n), G), where G c O(m). For any connected 
Lie group G, we have £(n, G) - (n, Gi) Soe 
where G, is the maximum compact subgroup 
of G (since G/G, is homeomorphic to a Eu- 
clidean space, £(n, G) reduces to £(n, G,)). 


J. Reduction of Fiber Bundles 


Let G be a topological group and H its closed 
subgroup. We say that the structure group of a 
G-bundle £ is reduced to H if € is equivalent to 
a G-bundle whose coordinate transformations 
take values in H. For a principal H-bundle 

Yo — (P, q, B, H), the associated H-bundle no xy 
G - (P x 4G, p, B, G) with fiber G is defined, 
where H operates on G by the product of G; 
it is also a principal G-bundle if we define 

an operation of G on P x4G by {(x,g9)} g = 
1G, gg). For a principal G-bundle y, we say 
that 7 is reducible to an H-bundle if there is a 
principal H-bundle ge with y=, x pG, and 
we call no a reduced bundle of y. It is easy to see 
that the group of a G-bundle č is reducible to 
H if and only if the associated principal G- 
bundle of & is reducible to H. Also, if yo is a 
reduced bundle of y, then w*n, is a reduced 
bundle of y * s. 

Now, assume that H has a local cross sec- 
tion in G and G/H is co-connected. Then 
for an n-universal bundle £(n, H) of H, 

é(n, H) x G is an n-universal bundle of G (n- 
connectedness of Ein, H) xy G is shown by the 
thomotopy exact sequence of ffiber spaces). 
Therefore, by the classification theorem, the 
group of any G-bundle is reducible to H, and 
the equivalence classes of G-bundles are in 
one-to-one correspondence with those of H- 
bundles. 

(1) A G-bundle is trivial if and only if its 
group is reducible to e (identity element). A 2n- 
dimensional differentiable manifold M of class 
C* has an talmost complex structure if and 
only if the group of the tangent bundle T(M) is 
reducible to GL(n, C), i.e., X(M) is considered 
as an n-dimensional complex vector bundle. 

(2) Since GL(n, R) O(n) x R""*? and 
GL(n, C) z U (n) x R”, n-dimensional real (com- 
plex) vector bundles can be considered as O(n) 
(U (n))-bundles with fiber R"(C"). 
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K. Homotopy and Homology Theory of 
Bundles 


Since a fiber bundle is a tlocally trivial fiber 
space, the exact sequence and the spectral 
sequence of fiber spaces (— 148 Fiber Spaces) 
are applicable to fiber bundles. For example, 
the cohomology structures of homogeneous 
spaces of classical groups have been deter- 
mined by A Borel, J.-P. Serre, and others. 

(1) Characteristic class. For a classifying 
space B, of a topological group G, we have an 
isomorphism z,(B,) zz, ,(G) of homotopy 
groups and the following classification theo- 
rem of fiber bundles over the n-sphere S". The 
set of the equivalence classes of principal G- 
bundles or G-bundles with fiber F over the 
base space S” is in one-to-one correspondence 
with the set z,,_,(G)/29(G) of equivalence 
classes under the operation of G on z,,_,(G) 
given by the inner automorphisms of G; such a 
correspondence is given by associating with 
each principal G-bundle y — (P, q, S", G) the 
class (called the characteristic class of n) con- 
taining the image A(z,) of a generator i,e z,(S") 
by the homomorphism A z,(S") = x, (P, G) 
7, 4(G). Take U, and U, (the open sets of 
S" such that the last coordinates t,,, are > 
—1/2 and « 1/2, respectively) as coordinate 
neighborhoods of y. Then the restriction T = 
g12|$" ^! represents the characteristic class 
of n, where g2: U, (1U5 —G is the coordinate 
transformation and S"! is the equator of S". 

(2) For the principal bundle n =(SO(n+ 
1), q, S", SO(n)), the mapping TS" ! —^SO(n) 
is given by 


Jet 0 
Tit, t) 7 2649 0 4) 


(I, is the unit matrix of degree n). Hence the 
*mapping degree of the composite q'o T:S" ! 
>S"! (of T and the natural projection q': 
SO(n)— S"^!) is equal to 0 if n is odd and 2 if 
n is even. From this fact and the homotopy 
exact sequence, we have 


"el Vin+n.m(R)) 


ifm=1 or nis even 


Z 
Ne ee if m» 1 and nis odd 


for the real Stiefel manifold V,,,, ,,(R), which is 
(n — 1)-connected. 

(3) Sphere bundles. An O(n + 1)-bundle with 
fiber S” is called an n-sphere bundle. The set of 
equivalence classes of n-sphere bundles with 
base space S” is in one-to-one correspondence 
with z,,-;(O(n+ 1))/z9(O(n+ 1)). For example, 
any l-sphere bundle over S" (mz: 3) and any 
n-sphere bundle over S? is trivial. Every 3- 
sphere bundle over S* is equivalent to one of 
fém nlm an integer, n a positive integer}, where 
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č m.n is defined as follows: Let p, a: S° —O(4) be 
defined by p(q)q' 7 qq'q `, c(a)q' —qq' (d. d are 
tquaternions of norm 1). Then these mappings 
represent generators of z, (0(4)) = x4(S? x 
S?)= Z +Z, and £, , is the 3-sphere bundle 
over S* corresponding to the element m{ p] + 
n{o}en;(O(4)) (i.e., to the mapping f,,,,:S°> 
O(4) defined by f, ,(a)(4') 2 4" Ida TL, (Here 
we use the fact that the operation of the ele- 
ment re O(4) (r(q) 2 q !) is given by rpr ! — p, 
ror ! «po |) 


L. Cross Sections 


For a fiber bundle €=(EF, p, B, F, G}, a cross 
section f: By E over a subspace By (c B)is a 
continuous mapping such that po f is the 
identity mapping of B,; a cross section over B 
is called a cross section of Z A bundle č ts 
trivial if and only if the associated principal 
bundle of € has a cross section. More gener- 
ally, given a principal bundle 5 =(P, q, B, G) 
and a closed subgroup H having a local cross 
section in G, y is reducible to H if and only if 
the associated bundle mt x ((G/H) = (P/H, o, B), 
G/H) with fiber G/H has a cross section. 

Suppose that the base space B of the fiber 
bundle €=(E, p, B, F, G) is a tpolyhedron. We 
denote the tr-skeleton of B by B' and consider 
the problem of extending cross sections f,: B" 
— E successively for r — 0, 1, .... Clearly, there 
is a cross section fo. For each r-simplex o of B, 
we have (p~! (c), p,o, F) &(o x F, p,,c, F) since 
c is ‘contractible. Hence there is a bundle 
mapping o. ox Fzp (o) with poo, —p,. 
Assume the existence of a cross section f,.,: 
B"? 5 E, and consider the mapping A; = p; o 
9, o(f,_-,|6):¢>F (p,:o x FF is the pro- 
jection and d is the boundary of c). Then if 
h, 1s extensible to h,:o — F, an extended cross 
section f,:c — E of f,_,|¢ 1s defined by f, (b) — 
,(b, h,(b)) (bec), and the extension f,: B'^ 
E of f,_, is defined by f.jo=f,, f,| B^! = 
f.. If 2,_,(F)=9, for example, there is an 
extension h, of h; since (a, 6) ~(V", S'^!), and 
f,-1 is extensible to a cross section f,. 

Now assume that the base space B of a G- 
bundle £ 2 (E, p, B, F, G) is an tarcwise con- 
nected polyhedron and F is t(n — 1)-connected. 
Then there is a cross section f: B"— E con- 
structed by the stepwise method of the previ- 
ous paragraph. But if z,(F) 0, we have an 
obstruction to extending f over B"*!. Now we 
explain how to measure this obstruction. Sup- 
pose that F is tn-simple. Then for each (n+ 1)- 
simplex o of B, the mapping h,:6 — F, defined 
by f as in the previous paragraph, determines 
a unique element c( f )(c) of the homotopy 
group z,(F). Hence we have a !cochain c( f)e 
C"*! (B; ,(F)), and f is extensible to a cross 


147 M 
Fiber Bundles 


section over B"*! if and only if c( f) 20. Thus 
there is a cross section over B"*! if and only 

if the set of c( f) for every cross section f over 
B" contains the cochain 0; {c(f)} is considered 
as a measure of the obstruction. 

Let w:I— B be a tpath. We consider the 
space I x F and the canonical projection p, : 
Ix FI. Then there is a bundle mapping Q: 
Ix FGE with poQ=wopy,, since w* f= 
(I x F,p,,1,F); and a homeomorphism w,:F x 
F is defined by wy 2994,00900,'09; ,,, 
where b, = w(e) (£20, 1), p.: U, x Fx p !(U,) is 
a coordinate function of a coordinate neigh- 
borhood U,3b,, and Q,((2Q|e x F):Fzp !(b,). 
The homeomorphism w, induces an isomor- 
phism w, :z,(F) z x, (F), and n,(F) forms a 
tlocal coefficient on B. Then the cochain c( f) 
is a tcocycle with the local coefficient z,(F), 
called the obstruction cocycle of f. Further- 
more, the set Jet f)} for every cross section 
f: B" E is a cohomology class c"*! (£)e 
H"*! (B; x,(F)) (local coefficient), and c"*!(£) 
is called the primary obstruction to the con- 
struction of a cross section. There is a cross 
section over the (n+ 1)-skeleton B"*! if and 
only if c"*! (£) « 0. The local coefficient z,(F) 
is trivial if B is tsimply connected or, for 
example, if the structure group G of ¢ is con- 
nected (in which case Z is called an orientable 
fiber bundle); when this is true, c"*! (£) is an 
element of H"*! (B; z,(F)), where z,(F) is not a 
local coefficient. Furthermore, if c"*! (£) 20 
and z,(F)=0 (n «i « m), then the secondary 
obstruction O"*!(£) c H"*! (B; z,(F)) is de- 
fined similarly (— 305 Obstructions). 


M. Stiefel-Whitney Classes 


Let €=(E, p, B, F, O(n)) be an O(n)-bundle over 
an arcwise connected polyhedron B and let 

Z be its associated principal bundle. Con- 
sider the Stiefel manifold V, , , = V, , ,(R)— 

O (n)/L, .,, x O(k), which is (k — 1)-connected, 
and the associated bundle & = £9 x ous , A 
with fiber V, ,.,. The primary obstruction 
Wësch (Ee H* (Bim, n-i) (k= 

0, 1, ...,n— 1) is called the Stiefel- Whitney 
class of £. We have 2W,,,(£) 20 unless k=n— 
| and k is odd. Hence we usually consider 
W,.4(£)e H**' (B; Z). č is orientable, i.e., the 
group of € is reducible to SO(n), if and only if 
W,(¢)=0. The Stiefel- Whitney classes of an 
n-dimensional tdifferentiable manifold M 

are defined to be those of the tangent bundle 
X(M). Since the orientability of M coincides 
with that of X(M), M is orientable if and only 
if W,(M)=0. The condition W,,,(M)=0 is 
necessary for the existence of a continuous 
field of orthonormal tangent (n — k)-frames 
over M (if k=n—1 this condition is also suffi- 
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cient). Also, when M is closed, W,(M) is equal 
to (MI, where n is the fundamental coho- 
mology class of M and y(M) is the tEuler 
characteristic of M (— 56 Characteristic 
Classes B). 


N. Chern Classes 


For a U(n) — bundle €=(E, p, B, F, U(n)), the 
primary obstruction C,,,(£) - c?** ?(£*e 
H?**?(B; Z) (K—0, L, ...,n— 1) of the associ- 
ated bundle £^ = €° x uin V, n-x(C) is called the 
Chern class of č. If we consider & as an O(2n)- 
bundle by U(n) c O(2n), then W,,,,(£) 2 0 and 
W,,(¢) = C,(€) (mod 2). The Chern classes of a 
real 2n-dimensional almost complex manifold 
are defined to be those of the tangent bundle 
X(M)(— 56 Characteristic Classes C). 


O. Bundles of Class C', Analytic Bundles 


A fiber bundle € - (E, p, B, F, G) is called a fiber 
bundle of class C (r—0,1,...00, œ) if E, B, F 
are differentiable manifolds of class C", G is a 
Lie group and a transformation group of F of 
class C', and p and the coordinate functions 
are differentiable mappings of class C". Bun- 
des of class C? are usual G-bundles, and those 
of class C? are real analytic fiber bundles. 
Similarly, complex analytic fiber bundles are 
defined by the notions of complex manifolds, 
*complex Lie groups, and *holomorphic map- 
pings. For example, the universal bundles 
€(n— 1, O(m)) and (2n, U (m)) are real and com- 
plex analytic principal bundles, respectively, 
and the tangent bundle X(M) of a C'*! (or 
complex) manifold is a C" (or complex ana- 
lytic) vector bundle. The operations of the 
Whitney sum, etc., are defined analogously for 
these vector bundles. 

The equivalence of C" (complex analytic) 
bundles is defined by means of bundle map- 
pings that are C’-differentiable (holomorphic). 
Bundles of class C" (r < oo) are classified by C” 
mappings into a classifying space in the same 
manner as for bundles of class CH. Also, the 
connection of class C” (— 80 Connections) in 
C” bundles is an important notion. 

For complex analytic bundles, a similar 
classification has been obtained for restricted 
spaces by K. Kodaira, Serre, S. Nakano [8], 
and others. The classification of complex ana- 
lytic bundles over a tStein manifold is reduced 
to that of bundles of class C? (Oka's principle 
[9]), and similar results are valid for C?- 
manifolds. The complex analytic (or holo- 
morphic) connection does not necessarily exist, 
and M. F. Atiyah [10] found the condition 
for its existence and its relation with Chern 
classes. 
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P. Microbundles 


A system x: B EB of topological spaces E, 
B and continuous mappings i, j is called an n- 
dimensional microbundle over B if for each 

be B, there exist a neighborhood U of b, a 
neighborhood V of i(U), and a homeomor- 
phism h:Vz U x R” with hoi|U=i,, j|V=p,o 
h (ij :;UzU x0cUxR",and p,:U x R"SU is 
the projection). Let Han) be the topological 
group of all homeomorphisms of R" onto itself 
fixing the origin with compact-open topology. 
Then the equivalence classes of n-dimensional 
microbundles over B are naturally in one-to- 
one correspondence with the equivalence 
classes of Hj(n)-bundes with base space B and 
fiber R^ [13]. 

In the category of polyhedra and PL (tpiece- 
wise linear) mappings, the notion of a PL 
microbundle can be defined in the same man- 
ner. The structural group of n-dimensional PL 
microbundles is defined, in a generalized sense, 
as a complete 'semi-simplicial complex [11]. 
The tangent PL microbundle is defined for any 
topological (PL) manifold. J. Milnor classified 
fsmoothings of PL manifolds by means of PL 
microbundles [11] and then showed that the 
ttangent vector bundles and its tPontryagin 
classes of smooth manifolds are not topologi- 
cally invariant [12]. 

For a *PL embedding f: MN between PL 
manifolds, if there is a neighborhood E of 
f(M) in N and a PL mapping p: E^ M so that 
a diagram v: M S EM isa PL microbundle, 
then v is called a normal PL microbundle of f. 
In this case, f is "locally flat. 

There is a locally flat PL embedding be- 
tween PL manifolds which admits no normal 
PL microbundle [14]. 


Q. Block Bundles 


As the normal bundle theory for locally flat 
PL embeddings of PL manifolds, the concept 
of block bundle was introduced independently 
by C. P. Rourke and B. J. Sanderson [15], M. 
Kato [16], and C. Morlet [17]. Let E bea 
*polyhedron, and let K be a cell complex. A 
set (E,|c € Kj of PL balls in E indexed on K is 
called a q-block structure of E if the following 
three conditions are satisfied: (1) | Jsex E, = E; 
(2) for each c € K, there is a *PL homeomor- 
phism h,:o x I*5 E, such that h,(t x I?) - E, 
for each face t of c, where J =[—1, 1]; and (3) if 
E, E, (Z, then E, E, — E,, where t=aN p. 
For c € K, E, is called the block over c, and 
h, in (2) is called a trivialization of E,. Then a 
triple (E, K, [E,|c e K} is referred to as a q- 
block bundle over K and is denoted by ¢/K. 
Another block bundle Z/K =(E’, K, {E,|oe 
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K]) over the same complex K is said to be 
isomorphic with £/K if there is a PL homeo- 
morphism g: E E', called an isomorphism, 
such that g(E,)= E; (ce K). A PL embedding 
i:| K[2 E is a zero-section of ¢/K if for each 

c € K, there is a trivialization h,:0 x I* E, 
such that h,(x, 0) 2 i(x) (xeo). In this case 

we say that £/K is a block bundle with a zero- 
section i:| K| E. There is a unique zero- 
section of Z/K up to isomorphism of £/K onto 
itself. For every "locally flat PL embedding 
between PL manifolds M and W of codimen- 
sion q and for any cell division K of M, a 
derived neighborhood N of f(M) in W admits 
a unique q-block bundle v/K =(N, K, {N,|ae 
K]) with f: MN as a zero-section up to 
isomorphism respecting the zero-section [15]. 
The block bundle v/K is called a normal block 
bundle of f: MW. 
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A. General Remarks 


J.-P. Serre [1] generalized the concept of fiber 
bundles to that of fiber spaces by utilizing the 
covering homotopy property (— Section B). 
He applied the theory of tspectral sequences, 
due to J. Leray, to the (cubic) tsingular (co)- 
homology groups of fiber spaces. These are 
quite useful for determining (co)homology 
structures and homotopy groups of topo- 
logical spaces, and are now of fundamental 
importance in algebraic topology. 


B. Definitions 


Let p: E B be a continuous mapping between 
topological spaces, and let X be a topological 
space. Then we say that p has the covering 
homotopy property with respect to X if for any 
mapping f: X 2 E and thomotopy g,: X >B 
(0€ tx 1) with po f — go, there is a homotopy 
fi XE (O<t<1) with f; — f and po f, 2 g,. 
We call (E, p, B) a fiber space (or fibration) if p 
has the covering homotopy property with 
respect to every cube [”={(x,,...,x,)|O<x;< 
1}, n—0, 1, ... (then p has the covering homo- 
topy property with respect to every (CW 
complex). Then E is called the total space, 
p the projection, B the base space, and F,— 
p |(b)the fiber over bc B. 

Let E, B, F be topological spaces and p: E> 
B a continuous mapping. We call (E, p, B, F) 
a locally trivial fiber space if for each bc B, 
there exist an open neighborhood U of b and 
a homeomorphism @:U x Fop !(U) with 
poq«(b', y) - b' (et ye F). In this case, p has 
the covering homotopy property with respect 
to each tparacompact space; hence (E, p, B) is a 
fiber space. A tfiber bundle is clearly a locally 
trivial fiber space. 
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C. Path Spaces 


Another important example of a fiber space is 
a path space. A path in a topological space 

X is a continuous mapping w:15 X (I= 

IO. 1 ]). Given subsets A, and A, of X, the 
path space Q(X; Ao, A,) is the space of all 
paths w:1— X satisfying w(e)e A, (e=0, 1) 
topologized by tcompact-open topology. De- 
fine p,:Q(X; Ao, A,)> A, by p,(w) — w(c) (e= 

0, 1). Then (Q(X; Ap, A4), Pa, A,) is a fiber space; 
in fact, p, has the covering homotopy property 
with respect to every topological space. In 
particular, the total space of the fiber space 
(Q(X; X, *), po, X) («€ X) is tcontractible, and 
the fiber pj! («)- Q(X; +, x) - OX over x is the 
tloop space of X with base point x. For a 
continuous mapping f: Y » X, consider the 
space E,={(y, we Y x Q(X; X, X)| f(y) 2 w(0)] 
and the continuous mapping p: E, X de- 
fined by p(y, w)=w(1). Then Y C Ej, and Y is 
a tdeformation retract of E,; furthermore, 

(E y, p, X) is a fiber space with f =p] Y. 


D. Homotopy Groups of Fiber Spaces 


For thomotopy groups of fiber spaces, the 
Hurewicz-Steenrod isomorphism theorem 
holds: Let (£, p, B) be a fiber space and F — 

p |(*)the fiber over the base point ze B. 

Then p, :z,(E, F) n,(B) is an isomorphism for 
n>2 and a byection for n — 1. By this theorem, 
we have the homotopy exact sequence of a fiber 
space: 


ma (B) n (F) Sn, (E) 5n,(B)—.... 
Furthermore, the more general exact sequence 
...2n(Z; QB), n(Z; F) 5 n(Z; Eg S n(Z; B), 


is valid for each CW complex Z, where 
n(Z; X)y denotes the thomotopy set of map- 
pings from Z to X relative to the base point. 

Example (1). A cross section of a fiber space 
(E, p, B) is a continuous mapping f: B E with 
pof —1. If (E, p, B) has a tcross section or the 
fiber F is a tretract of E, then z,(E) e n,(B) 
n,(F). If F is contractible in E, then z,(B) x 
n,(E) + n,- (F) (n> 2). 

Example (2). (E, p, B) is called an n- 
connective fiber space if B is tarcwise connec- 
ted, E is tn-connected, and p, :7;(E)—>n;(B} is 
an isomorphism, for i» n. For each arcwise 
connected space B and integer n, there is such 
a fiber space. 

Example (3). For a CW complex X, there are 
topological spaces X, and continuous map- 
pings f: Kaka dun" X444 5 X, (n-0,1,...) 
with the following four properties: (i) X, (0x 
nxm)is a point if X is m-connected; (ii) fa»: 

n (X) = ad Kl n); (iii) (Xp. dys X, .,) is a fiber 
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space, and its fiber is an tEilenberg-MacLane 
space K (z,( X), n); (iv) q,0 f, is 'homotopic to 
Jr, Such a system (X,, fas qa} is called the 
Postnikov system of X and is in a sense con- 
sidered a decomposition of X into Eilenberg- 
MacLane spaces. 


E. Spectral Sequences of Fiber Spaces 


The cohomological properties of fiber spaces 
are obtained mainly from the following results 
(which are valid similarly for homology except 
for properties of products). Assume that the 
base space B of a given fiber space (E, p, B) is 
*simply connected and the fiber F = p~!(«) is 
arcwise connected, and let R be a commuta- 
tive ring with unit. Then the spectral sequence 
(of singular cohomology) of the fiber space 

(E, p, B) (with coefficients in R) is defined to be 
a sequence (E,, d,) satisfying the following 
properties: 

(i) E, 2 X, , EP* are bigraded R-modules, and 
d, — X, ,d/* are R-linear differentials such that 
E em. 

(ii) E^:3 2KerdP*/Imd? "**"^!, which means 
that E,,, = H(E,). 

(iii) E?-4=0 for p «0 orq«0, EP* = EPA =... 
= EZ for r>max(p,g + 1). 

(iv) E, has a product for which EP? ER? c 
Era" and d (u: v) 2 (du): v -( 1)?**u- 

d,v (uc E73). Furthermore, the induced prod- 
uct in H(E,) coincides with the product in 
EE 

(DIE. is the tbigraded module associated with 
some filtration of the cohomology module 
H*(E; Ry; that is, H'(E; R)- D" >D"! >... 
> p"95p"*t^.71 50 and EE pPs/pp*t«t, 
Furthermore, the fcup product — in H*(E; R) 
satisfies D^* — D?^4 c D?*»'4** and coincides 
with the given product in E. 

(vi) EF* = H*(B; H*(F; R)), and the product 

in E, coincides with the cup product in 

H*(B; H*(F; R)). 

(vii) The composition of H"(B; R)= E79 > ES? 
>... Eb f = Ev = D"? c H"(E; R) is equal to 
p*, and the composition of H"(E; R) - Din 
E! = E?” c... c ED” c ED” = H”(F; R) is 
equal to i* (i: F c E), where each —> is the 
projection onto the quotient module. In the 
sequence 


H'^(F; RS Hatt, F; R)2-H"(B; R), 


we have 0*^! (Im p*) — Ein! Coimp* = Ez, 
and d,: E" ! — E"? is equal to the trans- 
gression t* = p* ! o0*:0* ! (Imp*)— Coim 
p*. Each element of 0*^! (Im p*) is called 
transgressive. 

In the following examples, we assume that R 
is a principal ideal ring. 

Example (4). Let k be a commutative 
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field, and assume that dim, H,(B; k) and 

dim, H,(F; k) are finite. Then for the *Poincaré 
polynomial P,(t) — X, b,t", b, — dim, HX k), 
we have P,(t)= PO P(t) — (1 + t)p(t), where 
p(t) is a polynomial with nonnegative coeffi- 
cients (Leray). In particular, for the tEuler 
characteristic (X) — P4(—1), we have y(E) 

= x(B)y(F). Also, if i, : H,(F; k)  H,(E; k) is 
monomorphic for each n z 0, then P,(t)= 
Py(t) P, (0). 

Example (5). Isomorphism theorem: If 
H,(B; R)=0 (0<n<r) and H,(F; R)=0(0<n 
<s), then p,: H,(E, F; Ria H,(B; R) is isomor- 
phic for 0<n<r-+s and epimorphic for n=r + 
s, and we have the following homology exact 
sequence: 


5 HAE; R)S HG; BU H,(B; R) 
5H, (F;R)oO.., n<rts 


(similarly for the cohomology). 

Example (6). Assume that H"(F; R)= 
H"(S"; R) (S" is the r-sphere, r > 1). Let E- M, 
be the *mapping cylinder of p: Eo B and p: 
E B be the continuous mapping defined 
by p. Then the Thom-Gysin isomorphism à: 
HIR R)& H"(E, E; R) (nz0) with g(a)— 
p*(a)— g(1) holds (— 114 Differential Top- 
ology G). Also, we have the Gysin exact 
sequence: 


5 H'(B; RS H(E; R)S H"^'(B; R) 
5 H"* (B; R)>..., 


where g satisfies g(a) =a ~ Q=Q~ a (Q= 
g(1)e H"*'(B; R)). Here Q is equal to the 
image of a generator of H'(F; R)=R by the 
transgression t*, and 2Q=0 if r is even. (These 
results hold also for r=0 and R=Z,.) 
Example (7). Assume that H"(B; R)= 
H"(S"; R) (r2 2). Then we have H" '(F; RS 
H"(E, F; R) (n2 0) and the Wang exact 
sequence: 


5 H'(E; R) S H'(F; R) 
S H" (F; R)OH"* E; R)>..., 


where @ satisfies O(a — f) — O(a) — B+ 
(—1y* Ua — 0(B) (x, Be H"(F; R)). 

Example (8). For a field k of odd character- 
istic, if H"(E; k) 20 (n7 0) and the algebra 
H*(F;k) is generated by a finite number of 
elements of odd degree, then H*(F;k)z 
AA, Xi) (the fexterior algebra) and 
H*(B;k)zk[y,, ..., y;], where y,=t*(x,) (A. 
Borel). 
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A. Definition 


A set K having at least two elements is called a 
field if two operations, called *addition (+) 
and *multiplication (-), are defined in K and 
satisfy the following three axioms. 

(1) For any two elements a, b of K, the sum 
a+b is defined; the associative law (a + b) 4- 
c=a+(b+c) and the commutative law a +b = 
b +a hold; and there exists for arbitrary a, b 
a unique element x such that a 4- x — b; that is, 
K is an tAbelian group with respect to the ad- 
dition (the tidentity element of this group is 
denoted by 0 and is called the zero element of 
K). 

(2) For any two elements a, b of K, the prod- 
uct ab (— a: b) is defined; the associative law 
(ab)c = a(bc) and the commutative law ab = ba 
hold; and there exists for arbitrary a, b with 
a#0 a unique element x such that ax =b, that 
is, the set K* of all nonzero elements of K is 
an Abelian group with respect to multiplica- 
tion. K* is called the multiplicative group of 
K, while the identity element of K* is denoted 
by 1 and is called the unity element (unit ele- 
ment or identity element) of K. 

(3) The distributive law a(b -- c) 2 ab - ac 
holds. In other words, a field is a tcummuta- 
tive ring whose nonzero elements form a 
group with respect to multiplication. 

A noncommutative ring whose nonzero 
elements form a group is called a noncom- 
mutative field (skew field or s-field). It should 
be noted that sometimes a field is defined as a 
ring whose nonzero elements form a group 
without assuming the commutativity of that 
group, and in this case our "field" defined 
before is called a commutative field. (The term 
“skew field" is sometimes used to mean either 
a commutative or a noncommutative field.) In 
this article we limit ourselves to commutative 
fields (for noncommutative fields — 29 As- 
sociative Algebras). 
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B. General Properties 


Since a field K is a commutative ring, we have 
properties such as a0=0a=0, (—a)b —- a( — b) 
= — ab for elements a, b in K. If a tsubring k of 
K is a field, we say that k is a subfield of K or 
K is an overfield (extension field or simply 
extension) of k. If a field K has no subfield 
other than K, K is called a prime field. 

A map f ofa field K into another field K' is 

called a (field) homomorphism if it is a ring 
homomorphism, i.e., if it satisfies f(a + b) — 
f (a) 4- f(b), f(ab) = f(a) f (b). Since a field is 
*simple as a ring, every (field) homomorphism 
is an injection unless it maps everything to 
zero. A homomorphism of K into K' is called 
an isomorphism if it is a bijection, and K and 
K' are called isomorphic if there exists an 
isomorphism of K onto K’. An isomorphism of 
K onto itself is called an automorphism of K. 

If there is a natural number n such that the 

n 
sum ni=1+...+1 of the unity element 1 is 0, 
then the minimum of such n is a prime number 
p, called the characteristic of K. On the other 
hand, if there is no natural number n such that 
nl —0, we say that the characteristic of K is 0. 


C. Examples of Fields 


The rational number field Q consisting of all 
rational numbers, the real number field R 
consisting of all real numbers, and the complex 
number field C consisting of all complex num- 
bers, are all fields of characteristic 0. A subfield 
of the complex number field C is called a num- 
ber field. The rational number field is a prime 
field, and every prime field of characteristic O is 
isomorphic to the rational number field. For 
the ring Z of all rational integers, the residue 
class ring modulo a prime number p is a field 
Z/pZ — (0,1,2, ...,p—1 (mod p)} of character- 
istic p, called the residue class field for p. Thus 
Z/pZ is a prime field, and every prime field of 
characteristic p is isomorphic to Z/pZ. If the 
number of the elements of a field K is finite, K 
is called a finite field. Z/pZ is an example of a 
finite field. 


D. Extensions of a Field 


In order to express that K is an extension field 
of k, we often use the notation K/k. Subfields 
of K containing k are called intermediate fields 
of K/k. Consider two extensions K ,/k, and 

K ;/k;, and let o: K,—>K, be an isomorphism 
which induces an isomorphism dek: K;. 
Then we call o an extension of y. Suppose that 
k,, K, are given fields and K, contains a 
subfield k, isomorphic to k,. Then there exist 
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an extension field K, of k, and an isomor- 
phism o: K,— K, which is an extension of the 
given isomorphism yk, —k,; construction of 
the field K, is often called the embedding of 
k, into K,. When K, and K, are extensions 
of k, an isomorphism: K , O K, ts called a k- 
isomorphism if it leaves every element of k 
invariant. 

In an extension K/k, let S be a subset of K. 
The smallest intermediate field of K/k contain- 
ing S is called the field obtained by adjoining S 
to k or the field generated by S over k, denoted 
by k(S). The field k(S) consists of those ele- 
ments in K each of which is a rational ex- 
pression in a finite number of elements of S 
with coefficients in k. An extension field k(t) 
obtained by adjoining a single element t to k is 
called a simple extension of k, and in this case t 
is called a primitive element of the extension. 
The rational function field k(X) with coeffi- 
cient field k is a simple extension of k with a 
primitive element X. 

When subfields k, (4€ A) of a field K are 
given, the smallest subfield of K containing all 
these subfields exists and is called the com- 
posite field of the k,. 


E. Algebraic and Transcendental Extensions 


An element « of an extension field K of a field 
k is called an algebraic element over k if « is a 
tzero point of a nonzero polynomial, say, f(X) 
—üàg-F a, X t... a, X" with coefficients in k. 
If x is not algebraic over k, then « is called a 
transcendental element over k. An algebraic 
element « is always a root of an irreducible 
polynomial over k which is uniquely deter- 
mined up to a constant factor (ek?) and is 
called the minimal polynomial of « over k. K is 
called an algebraic extension of k if all elements 
of K are algebraic over k; otherwise we call K 
a transcendental extension of k. If K, is an 
algebraic extension of K and K is an algebraic 
extension of k, then K, is also an algebraic 
extension of k. In an arbitrary extension field 
K of k, the set of all algebraic elements over 

k forms an algebraic extension field of k. A 
simple extension k(t) with a transcendental 
element t is isomorphic to the rational func- 
tion field of one variable with coefficient field 
k. If tis an algebraic element over k, k(t) is 
isomorphic to the tresidue class field of the 
polynomial ring k[ X] modulo the minimal 
polynomial f(X) of t over k. 


F. Finite Extensions 


An extension field K of a field k is called a 
finite extension if K has no infinite set of ele- 
ments that are tlinearly independent over k, 
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i.e., if K is a finite-dimensional linear space 
over k. The dimension of the linear space over 
k is called the degree of K over k and is de- 
noted by (K :k) (or [K : k]). If K is a finite 
extension of k and L is a finite extension of K, 
then L is also a finite extension of k and (L: K) 
(K:k)=(L:k). Every finite extension field of k 
is an algebraic extension of k and is obtained 
by adjoining a finite number of algebraic 
elements to k. Conversely, every field obtained 
by adjoining a finite number of algebraic 
elements to k is a finite extension of k. If K= 
k(a) with an algebraic element o, then (K : k) is 
equal to the degree of the minimal polynomial! 
of a over k, also called the degree of « over k. 
Every element of k(x) is expressed as a poly- 
nomial in x with coefficients in k. On the other 
hand, for any nonconstant polynomial f(X) of 
k[ X] there exists a simple extension k(x) such 
that « is a root of f(X). 


G. Normal Extensions 


An algebraic extension field K of a field k is 
called a normal extension of k if every irreduc- 
ible polynomial of k[ X ] which has a root in 
K can always be decomposed into a product 
of linear factors in K[X]. An extension field K 
of k is called a splitting field of a (nonconstant) 
polynomial f(X)ek[X] if f( X) can be decom- 
posed as a product of linear polynomials, i.e., 
f(X)=c(X —a,(X —a,)...(X —4,), cek, ae K. 
A splitting field K of f(X) (f(X)ek[X]) is 
called a minimal splitting field of f(X) if any 
proper subfield L of K (K 2 L5k)is nota 
splitting field of f(X). A minimal splitting field 
of f(X) 1s obtained by adjoining all the zero 
points of f(X). A finite extension field of k is a 
normal extension if and only if it is a minimal 
splitting field of a polynomial of k[ X]. For 
any given (nonconstant) polynomial f(X)e 
kX], there exists a minimal splitting field of 
f(X), and all minimal splitting fields of f(X) 
are k-isomorphic. 


H. Separable and Inseparable Extensions 


An algebraic element x over k is called a sepa- 
rable element or an inseparable element over k 
according as the minimal polynomial of x over 
k is tseparable or tinseparable. An algebraic 
extension K of k is called a separable extension 
of k if all the elements of K are separable over 
k; otherwise, K is called an inseparable exten- 
sion. An element « is separable with respect 

to k if and only if the minimal polynomial of 

æ over k has no double root in its splitting 
field. If ais inseparable, then k has nonzero 
characteristic p, and the minimal polyno- 

mial f(X) of « can be decomposed as f(X)— 
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(X —o4' (X —a3)" ...(X — am)”, r> 1, where 
941,00, ..., Dan are distinct roots of f(X) in its 
splitting field; between the degree n of f(X) 
and the number m of distinct roots of f(X), 
the relation n 2 mp' holds. In particular, if 
a?” ek for some r, we call 3 a purely inseparable 
element over k. An algebraic extension of 

k is called purely inseparable if all elements 

of the field are purely inseparable over k. In 
an algebraic extension K of k the set of all 
separable elements forms an intermediate 
field Ky of K/k. The field Kg is called the 
maximal separable extension of k in K. If K is 
inseparable over k. i.e., if K # Ko, then the 
characteristic of k is p #0, and K is purely in- 
separable over Ko. The degrees d=[K,:k] 
and p' - [K:K,] are denoted by [K:k], and 
[K :k];, respectively. A separable extension 
of a separable extension of k is also separable 
over k, and every finite separable extension 
of k is a simple extension. 

If no inseparable irreducible polynomial in 
kl X ] exists, we call k a perfect field; otherwise, 
an imperfect field. Every field of characteristic 
0 is a perfect field. A field of characteristic p 
(#0) is perfect if and only if for each aek the 
polynomial X? — a has a root in k. Every alge- 
braic extension of a perfect field is a sepa- 
rable extension and a perfect field. Any imper- 
fect field has an inseparable, in fact purely 
inseparable, proper extension. 


I. Algebraically Closed Fields 


If every nonconstant polynomial of k[ X] can 
be decomposed into a product of linear poly- 
nomials of k[ X ], or equivalently, if every 
irreducible polynomial of ET X ] is linear, k is 
called an algebraically closed field; & is alge- 
braically closed if and only if k has no alge- 
braic extension field other than k, and hence 
every algebraically closed field is perfect. 

For any given field k there exists an algebrai- 
cally closed algebraic extension field of k 
unique up to k-isomorphisms (E. Steinitz); 
hence we call such a field the algebraic closure 
of k. To proceed further, suppose that we are 
given a field k and its extension K. If there is 
no algebraic element of K over k outside of k, 
Le. if k is the intersection of K and the alge- 
braic closure of k, then we say that k is alge- 
braically closed in K. The complex number 
field is an algebraically closed field (C. F. 
Gauss's fundamental theorem of algebra; 

— 10 Algebraic Equations). 


J. Conjugates 


Let k be a field and K an algebraic extension 
of k. Two elements a, f of K are called conju- 
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gate over k if they are roots of the same irre- 
ducible polynomial of KI X] (or equivalently, 
if the minimal polynomials of a and f) with 
respect to k coincide), in this case we call the 
subfields k(a), k(f) conjugate fields over k. The 
conjugate fields k(x) and k() are k-isomorphic 
under an isomorphism ø such that c(x) — ff. In 
particular, if K is a normal extension of k, the 
number of conjugate elements of an element « 
of K is the number of distinct roots of the 
minimal polynomial f(X) of x, which is inde- 
pendent of the choice of a normal extension K 
containing k. The element « is separable if and 
only if the number of conjugate elements in K 
is the same as the degree of f(X). On the other 
hand, k(x) is normal over k if and only if k(a) 
coincides with all its conjugate fields. 

Let « be a separable algebraic element over 
k, and let à, =a, X2, ..., &, be conjugate ele- 
ments of a over k. The product A=a,4,...4, 
and sum B=a,+a,+...+«a, are elements of k. 
Indeed, if f(X) 2 X"-- c, X"!  ... c, is the 
minimal polynomial of x with respect to k, we 
have A=(—1)"c,, B= —c,, and A and B are 
called the norm and the trace of o, respectively, 
denoted by A= N(a), B = Tr(a). Let K bea 
finite separable extension of degree n over k, 
and let « be an element of K. Then the degree 
m of the minimal polynomial of a is a divisor 
of n; that is, n2 mr with a positive integer r. 
We define the norm and the trace of « with 
respect to K/k by Ngala) =N (ay, Trg,(x) = 
rTr(a), respectively. Then these quantities 
satisfy Ny, (xf) = Nu (X) Ny (B), Treal + B) — 
Trga (a) + Treal P) for o, Be K. (For the Galois 
theory of algebraic extensions — 172 Galois 
Theory.) 


K. Transcendental Extensions 


Let K be an extension of k and u,,...,u, be 
elements of K. An element v of K is said to 

be algebraically dependent on the elements 
U,,U>,...,uU, if v is algebraic over the field 
k(u,,u5, ..., u,). A subset S of K is called 
algebraically independent over k if no ue S is 
algebraically dependent on a finite number 

of elements of S different from u itself; S 1s 
called a transcendence basis of K over k if S is 
algebraically independent and K is algebraic 
over k(S). Furthermore, if K is separable over 
k(S), then S is called a separating transcendence 
basis of K over k. There always exists an alge- 
braically independent basis of K over k, and 
the tcardinal number of S depends only on K 
and k; this cardinal number is called the tran- 
scendence degree (or degree of transcendency) 
of K over k. (When S is an infinite set, we 
sometimes say that the transcendence degree is 
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infinite.) In particular, if K =k(S) with an 
algebraically independent S, K is called a 
purely transcendental extension of k. 

An extension K of k is called a separably 
generated extension, or simply a separable 
extension, if every finitely generated intermedi- 
ate field of K/k has a separating transcendence 
basis over k. If K itself has a separating tran- 
scendence basis over k, then K is separably 
generated, but not conversely. 

A purely transcendental extension field of k 
having a finite transcendence degree n is also 
called a rational function field in n variables 
over k, and a finite extension of such a rational 
function field is called an algebraic function 
field in n variables over k. 

Let K and L be extension fields of k, both 
contained in a common extension field. We 
say that K and L are linearly disjoint over k if 
every subset of K linearly independent over k 
is also linearly independent over L, or equiva- 
lently, if every subset of L linearly independent 
over k is also linearly independent over K. An 
algebraic function field K — k(x,, x5, ..., Xn) 
over k (whose transcendence degree is <n) is 
called a regular extension of k if K and the 
algebraic closure k of k are linearly disjoint. In 
order that K be regular over k it is necessary 
and sufficient that k be algebraically closed in 
K and that K be separably generated over k. 


L. Derivations 


A map D of a field K into itself is called a 
derivation of K if it satisfies D(a-- b) 2 D(a) + 
D(b) and D(ab) 2 aD(b) + bD(a) for all a, be K. 
The set of elements c of K for which D(c)- 

0 is a subfield. If the characteristic of K is 

p( #0), then D(x’)=0 for all xe K. Let k bea 
subfield of K. A derivation D of K is called a 
derivation over k if D(c)=0 for all cek; the 
totality of derivations over k is a tk-module. If 
K — k(x,, X5, ..., x,) is an algebraic function 
field over k, then the k-module of derivations 
over k has finite dimension s( <n) and we 

can choose s suitable elements u,,u,,...,u, of 
K such that K is separably algebraic over 
k(u, ,u5, ..., u,). Generally, the transcendence 
degree r of K over k does not exceed s, and K 
is separably generated over k if and only if 
r=s. 


M. Finite Fields 


Finite fields were first considered by E. Galois 
(1830), so they are also called Galois fields. 
There is no finite noncommutative field (Wed- 
derburn’s theorem, J. H. M. Wedderburn, 
Trans. Amer. Math. Soc., 6 (1905)). A simple 
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proof for this was given by E. Witt (Abh. Math. 


Sem. Univ. Hamburg, 8 (1931)). The character- 
istic of a finite field is a prime p, and the num- 
ber of elements of the field is a power of p. 
Conversely, for any given prime number p and 
natural number «, there exists a finite field 
with p? elements. Such a field is unique up to 
isomorphism, which we denote by GF(p?) or 
F,(q — p^). For any positive integer m, GF(p™*) 
is an extension field of GF(p?) of degree m and 
a *cyclic extension. Every element a of GF(p*) 
satisfies a?" =a; hence a has its pth root in 
GF(p?). Therefore every finite field is perfect. 
The multiplicative group of GF(p^) is a cyclic 
group of order p*— 1. 


N. Ordered Fields and Real Fields 


A field K is called an ordered field if there is 
given a ‘total order in K such that a b im- 
plies a - c» b c for all c and a» b, c» 0 im- 
plies ac bc. The characteristic of an ordered 
field is always 0. An element a of K is called a 
positive element or a negative element accord- 
ing as a0 or a «O0. For an element a of K the 
absolute value of a, denoted by lol is a or —a 
according as a 20 or a <0. If we define neigh- 
borhoods of a by the sets {x|a—e<x<a+e} 
with positive elements £, K becomes a tHaus- 
dorff space. If, for any two positive elements 

a, b of K, there exists a natural number n such 
that na b, then we call K an Archimedean 
ordered field. Two ordered fields are called 
similarly isomorphic if there exists an isomor- 
phism between them under which positive 
elements are always mapped to positive ele- 
ments. The rational number field and the 

real number field are examples of Archime- 
dean ordered fields, while every Archimedean 
ordered field is similarly isomorphic to a sub- 
field of the real number field. (For the struc- 
ture of non-Archimedean ordered fields, see 
[8].) A field k is called a formally real field (or 
simply real field) if —1 (1 is the unity element 
of k) cannot be expressed as a finite sum of 
squares of elements of k. The real number field 
is a model of formally real fields. More gener- 
ally, every ordered field is a formally real field. 
A formally real field is called a real closed field 
if no proper algebraic extension of it is a for- 
mally real field. The real number field is a real 
closed field. The algebraic closure of a real 
closed field is obtained by adjoining a root of 
the polynomial X? 4- 1. If a is a nonzero ele- 
ment of a real closed field, then either a or —a 
can be a square of an element of the field. 
Every real closed field can be made an ordered 
field in a unique way, namely, by defining 
squares of nonzero elements to be positive 
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elements. Since it is known that every formally 
real field is a subfield of a real closed field, it 
follows that every formally real field is an 
ordered field and therefore is of characteristic 
0. The problem of constructing an ordered 
field out of a formally real field is closely re- 
lated to the existence of valuations of a certain 
type [8]. The notion of formally real fields 
was introduced by E. Artin (Abh. Math. Sem. 
Univ. Hamburg, 5 (1927)). By making use of the 
theory of formally real fields, Artin succeeded 
in solving affirmatively fHilbert’s 17th prob- 
lem, which asked whether every positive 
definite rational expression (i.e., a rational 
expression with real coefficients that takes 
positive values for all real variables) can be 
expressed as a sum of squares of rational ex- 
pressions. More precisely, it was shown by A. 
Pfister that every positive definite function in 
R(X,,..., X,) is a sum of at most 2" squares. 
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A. History 


When a quantity w(x), such as velocity, is 
defined at every point x in a certain region of 
a space, we say that a field of the quantity v 
is given. This general concept is used in many 
branches of science. Here we confine ourselves 
to some branches of physics, in particular to 
the quantum theory of fields, which describes 
telementary particles. 

The ‘theories of elasticity and thydro- 
dynamics (in particular, concerning fEuler’s 


150 B 
Field Theory 


equation of motion) deal with displacement 
and velocity fields, respectively. However, a 
field in a vacuum (ether), which is quite differ- 
ent from a field in a space filled with matter, 
first became a subject of physics in telectro- 
magnetism. M. Faraday (1837) introduced the 
electromagnetic field and discovered its funda- 
mental laws, and J. C. Maxwell (1837) com- 
pleted the mathematical formulation. On the 
basis of this formalism, A. Einstein (1905) 
established the theory of trelativity and later 
developed the general theories of relativity and 
of gravity. 

Although quantum theory originated from 
the problem of blackbody radiation, the quan- 
tum theory of the electromagnetic field was 
first developed by P. A. M. Dirac (1927) after 
the development of tquantum mechanics. 
Along similar lines P. Jordan and E. P. Wigner 
(1927) quantized the matter wave (electron 
field), and W. Heisenberg and W. Pauli (1929) 
developed the quantum theory of wave fields 
in general. Subsequently, Jordan and Pauli 
(1927), Pauli (1939), S. Tomonaga (1943), J. 
Schwinger (1948), and others reformulated the 
theory in a relativistically covariant manner. 
Quantum electrodynamics, dealing with an 
electromagnetic field interacting with electrons 
(and positrons), has given excellent agreement 
with experimental measurements when for- 
mulated in this way. Divergence difficulties 
inherent to quantized field theory were by- 
passed by the renormalization procedure of 
Tomonaga, Schwinger, R. P. Feynman, and F. 
J. Dyson (1947). 

On the other hand, H. Yukawa (1934) ap- 
plied the concept of the quantized field to the 
interpretation of nuclear force and predicted 
the existence of z-mesons. Many kinds of new 
particles have since been found, including z- 
mesons and muons. The field theory of z- 
mesons can explain the qualitative features of 
the meson-nucleon system. Similar theories 
can be formulated for other types of unstable 
particles that have been found in cosmic rays 
since 1949. 

During the progress of meson theory, vari- 
ous types of fields were investigated, and a 
general theory of elementary particles was 
developed. Dirac (1936) proposed a general 
wave equation for elementary particles, and 
Pauli and M. Fierz (1939) proved the con- 
nection of spin and statistics (— the end of 
Section D). Schwinger (1951) derived quantum- 
mechanical equations of motion and commu- 
tation relations from a unified variational 
principle. For the cases where no interaction is 
present, a general theory of elementary par- 
ticles consistent with the requirements of rela- 
tivity and of quantum theory was established 
as the theory of free fields. 
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B. Relativistically Covariant Classical Fields 


Relativistically covariant fields f(x) on the 
level of classical theory are functions of the 
space-time point x with either real or com- 
plex values depending on the index o (which 
distinguishes different fields) [1]. A finite- 
dimensional representation D* of SL(2, C) (— 
258 Lorentz Group) on either a real or com- 
plex vector space is assigned to each o, and 
the index r of (x) refers to components in 
this representation space. Each model is 
specified in terms of a Lagrangian density 
L(x) that is a function (typically a polynomial) 
of F(x), 0, 97 (x) (ô, denoting 0/Ox", u — 0, 
...,3) and their complex conjugates. is 
taken to be invariant under the replacement 
of p(x) and ô p(x) by X, D*(A),, o7 (x) and 
2, ,D'(CA,,ACAY,0,0; (x) for all AeSLQ, C) 
(called the Lorentz invariance of Z) and pos- 
sibly under the replacement of f(x) and 
0,92(x) by Ep U of (x) and Ep U*^0,o/(x) 
(called an internal symmetry). 

The fields are supposed to satisfy the follow- 
ing partial differential equation, called the field 
equation, and are obtained as the tEuler equa- 
tion of the variational problem for the action 
integral I = f Z(x)dx: 


3 
6L(x)/0Q(x)— » (0/0x")(0-2^(x)/0(0" y (x) 
=0, 


3 
BL(xOGAX)— 3. (6/0x") (.L(x)/0(0" Le) 


A70 
=0 


(the bar indicates the complex conjugate), 
where the second equation is identical to the 
first for a real field. For complex fields, A(x) is 
chosen such that J =T (possibly except for a 
surface term) in order to ensure that the above 
two equations are complex conjugates of each 
other. 

The invariance of J under a Lie group of 
(local and/or volume-preserving point) trans- 
formations of fields implies a conservation law 
for a certain quantity (Noether’s theorem; — 
e.g., E. L. Hill, Rev. Mod. Phys., 23 (1951)). For 
translational invariance (x^ x" +a“), the 
energy-momentum (stress) tensor 


T" (x) 
= —ó,, Z (x) », 10 (x)/0(0,07 (x))0, Gr (X) 


t 0S (x)/0(, OF (x))0, 9p G9] 


(the complex conjugate terms in all equations 
to be suppressed for real fields) satisfies the 
differential conservation law 27-90, T^,(x) -0 
as a consequence of the field equations, which 
in turn implies the time-independence of the 
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following quantity provided that T*, (x) 
vanishes sufficiently fast at spatial infinity: 


= [rsen EISES (x? =t). 


These are the total field Hamiltonian (energy) 
and momentum. 

In a general situation, T> (=E, gT”; 
where g"" is the Minkowski metric tensor with 
the signature 1, —1, —1, —1) is not necessarily 
symmetric in u and v. F. J. Belinfante (Physica, 
6 (1939); 7 (1940)) has given a formula (by a 
change of (x) via so-called 4-divergence) for 
obtaining a symmetrized energy-momentum 
tensor 0". for which H and P, are equal to 
those defined from the above T", and for 
which the differential conservation law is 
satisfied. 

Lorentz invariance implies the conservation 
law 3,0, M""^(x) -«0 for the angular momen- 
tum density 


M" (x) =x T(x) — x^T'"(x) 


+ p (07 /0(0, 97 (X) SE" of (x) 


+ 0210(0, 07 (x)) S^ o7 ()) 
= x"O#4(x) —x*@**(x) 


and the time-independence of the total angular 
momentum 


iri | mdy (x =t), 


Kess SZ?" is antisymmetric in 4,4 v and D*(A),, 
dE Xu v Spe £,,/2 for A(A),,— 9,, + Euy with 

an ac iniaitesimal £,,. (We have (A(A)x)* = 

E,A(Ayx* and A(4),, = E, Gap A(AY^,.) 

A continuous one-parameter group U(p) 
of internal symmetries implies the conserva- 
tion law 3,0, J"(x) —0 for the 4-current den- 
sity (charge (u —0) and current (u= 1,2, 3) 
densities) 


J”(x)= A {0L/0(6,,02(x))A7? oF (x) 


+ OL /0(8, 0 (x)) A"P o (x)) 


and the time-independence of the charge 
[J?() ?x (x? =t), where U'(0)% =A". An 
example of U(p) is the multiplication by 
exp iQ* p (called a gauge transformation of the 
first kind), where Q* is an integer with Q^— 
for real fields o‘. 

Examples of Lagrangian densities for non- 
interacting fields (called free Lagrangian den- 
sities) are: 


real scalar field: 


(Z, g” 0, 9(x)0,9(x) - m? o(x)*)/2, 
complex scalar field: 


(2,,9" 6, P(x) 6, p(x) — m P(x) p(x), 
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EK EE — 0, V, CO yrs Ws (x) 
mal) 
(La = cht, 7" S 05) are fDirac's y- 
matrices), 


massive real vector field: 


-(1/4) » g" g^ (0, A(x) — 6; A Q9) 


KÁuv 


x (6, A (x) 0, A (x) Hey g" A, (xX) A (x), 
uv 

where D* is trivial for scalar fields, [1,0] ® 
[0, 1] for the Dirac field, and [ 1, 1] for vector 
fields. 

For interacting fields, some interaction parts 
are added to the sum of such free Lagrangian 
densities of relevant fields. Some examples are: 


P(o)(or gq^) interaction: 

P(o(x)) (or ge(x)*), 
Yukawa-type interaction: 

g Bours V XY yr rs (x) A, (x), 
Fermi interactions: 

g Yu. Gu (E, WONO) ` 

x (Lys Vp Gore. vf (x)) 

(O* = 1 (scalar, no index k), y” (vector, Gy = Juw) 
y*y” (tensor, Gyw Ce guy), Y^ 7" (pseudovector, 
Gum un, 27 =iy?y' yy), y? (pseudoscalar, no 
index k). 


C. Heuristic Theory of Quantized Fields 


For free fields, a quantization procedure 
similar to the usual quantum mechanics (— 
351 Quantum Mechanics; 377 Second Quan- 
tization) leads to the following type of canon- 
ical commutation or anticommutation rela- 
tions among fields (and their first time deriva- 
tives) at time 0; for example, 


real scalar field: 

Lem, x), $0, y)] - — ió*(x — y), 

Lou, x), oi, y)] - 2 L6(0, x), o(0, y)] - =0, 
Dirac field: 

[V (0, x), V4,(0, y] =5,,0°(x — y), 

[y,(0, x), V, (0. y)].. =LW,(0, x), v. (0, y)] + =0. 
([A, B], = AB + BA, 6,,=0 for r#s and 6,,= 1, 
ô? (x —y) = IZ. äich — y*).) The free-field equa- 
tions then lead to the following 4-dimensional 
commutation relations: 


real scalar field: 

[0(x), o(y)] - = — iA (x — y). 
Dirac field: 

[V (x). Vy y). AS 740, 
massive real vector field: 

[A,(x), AQ) ] - = (gu, +m ?0,0) A, (x — y). 


Here A,, is the finvariant distribution. There is 
a unique representation of such relations for a 


—im By) A, (X SH y) 
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field (called the Fock representation) with a 
vector Q (called the free vacuum vector) which 
is annihilated by f o(x)f(x)dx, by f(x) f(x) dx 
and f ,(x)f(x)dx, or by f A, (x) f(x) dx when- 
ever [e'?*f(x)dx «0 for p? >0 (here, p: x= 

È g,, p" x"), and which is cyclic (i.e., polynomials 
of (smeared-out) fields generate a dense subset 
of Q). The free fields in the Fock represen- 
tation satisfy the Wightman axioms (— Sec- 
tion D). 

In the Fock representation of canonical 
fields a translationally invariant vector must 
be a free vacuum vector, up to multiplication 
by a complex number. In order to construct a 
model of a translationally invariant interaction 
among canonical fields with a unique vector of 
minimal energy (a vector which is called the 
true or interacting vacuum and which must 
be translationally invariant if unique), one 
must look for some other suitable representa- 
tion. Such a no-go theorem is called Haag's 
theorem. 

The earliest formulation of interacting quan- 
tized fields was developed heuristically by 
formal manipulation in the Fock representa- 
tion, described in textbooks of quantum field 
theory [2-6]. It can be mathematically justi- 
fied if the so-called cutoff is introduced by 
limiting the space to a finite volume, possibly 
changing the space into a lattice and smooth- 
ing out fields (effectively cutting off the high- 
energy part of the interaction) (A. M. Jaffe, O. 
E. Lanford III, and A. S. Wightman, Comm. 
Math. Phys., 15 (1969)). The full theory is then 
expected to be obtained by taking a limit of 
the true vacuum expectation values as various 
cutoff parameters are removed. This is the aim 
of constructive field theory (— Section F), 
which has been achieved for some space-time 
models of dimension 2 and 3. 

In quantum field theory, the S-matrix is 
given in terms of (the mass-shell restriction of 
the Fourier transform of) the vacuum expec- 
tation value of the time-ordered product of 
fields, called t-functions (— Section E). In the 
heuristic approach, it is given in terms of the 
following Gell-Mann- Low formula (its imagi- 
nary time version being mathematically used 
in constructive field theory): If the field g(x) — 
e oa. x))e "(RS with H =H, +H, (Hp is 
the free Hamiltonian as the generator of the 
time translation for the free field g(x) and H, 
is the interaction Hamiltonian, such as P(q,)) 
and x? >... >x, then 


(Q, SIE ch 9(x,)Q) 


= lim (e! TO, p(x)... p(x,)e FTO) 


gc 


where Q, is the free vacuum. The numerator 
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can be written as 
(Qo, U(T, XD osx) U(x?, Ke axe 
Qo(x,) Ee: T)Qo) 


in terms of U (t, s) =e" "oe 10 79H o7 isHo and the 
canonical field o, at time 0. The covariant 
perturbation series due to Tomonaga, Schwin- 
ger, Feynman, and Dyson is obtained by sub- 
stituting the following expansion, which is the 
iteration of the Duhamel formula: 


xy ca Una Hu) 


Here, H,(t) 2 e" eH,e "io Each term can be 
represented by connected tFeynman diagrams 
(the denominator canceling out all discon- 
nected graphs) and computed according to the 
*Feynman rule, yielding tFeynman integrals. 
Each expression so obtained (formally) may 
be a divergent integral (in the absence of the 
cutoff), in which case one tries to cancel it out 
by modifying the original Hamiltonian with 
the alteration of parameters such as mass 

and coupling constant (by an amount called 
the renormalization constant, which is diver- 
gent in the absence of the cutoff) or possibly 
by the addition of terms again involving 
renormalization constants. If this can be 
achieved in terms of a finite number of renor- 
malization constants, the model or the Hamil- 
tonian ts called renormalizable [7]. If the 
divergent integral appears only in a finite 
number of graphs (counting the same sub- 
graph of an infinite number of different dia- 
grams as one graph), the model is called super- 
renormalizable. 


D. Axiomatic Quantum Field Theory 


Relativistically covariant fields ọ,(x) on a Hil- 
bert space # are called Wightman fields if the 
following four Wightman axioms are fulfilled 
[8-10]: 

(1) The fields o, (x) are operator-valued distri- 
butions: For each C”-function f of rapid de- 
crease on the Minkowski space (f € F(R), 


_@,(f) is an operator defined on a common 


domain D dense in # and satisfying o,( f)Dc 
D (the domain of (f )* contains D), o,( f)*|D 
= p;( f) for another index & and (¥, o,( f) b) 
for any V^, and Ẹ in D is linear and continuous 
in f relative to the topology of the Schwartz 
space Y(R*). 

(2) Relativistic covariance: There exists a 
continuous unitary representation U (a, A) 
(ae R* and AeSL(, CH of the universal cover- 
ing group Ž! of the frestricted inhomogeneous 
Lorentz group 28. on # such that U(a, A)D c 
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D and 


U (a, A)Pa( f) U (a, A)* = H D(A)g Pp fa, 4). 


where f, 4(x)  f(A(A) ! (x —a)) and (D(A),,) is 
a finite-dimensional representation of SL(2, C). 
The index a usually consists of tundotted and 
dotted indices, interchanged in x, and of other 
indices, interchanged among them in a.) 

(3) Locality: If the supports of f and g are 
mutually spacelike, then 


Pal I) Pp(9) = €, P) Pl) p. Cf) 


on D, where e(a, f) - +1. If e(a, B) = —1 exactly 
when D(A),, — D(A)55— —1 for A= —1 (i.e., 
when both p, and o; are Fermi fields), they 
are said to satisfy the normal commutation 
relations. 

(4) Spectrum conditions: Let U(a, 1) 2 e/^?. 
The joint spectrum of P" (5 —0,1,2,3) is in the 
forward cone V, 2 (peR*|p:pz0, p? 20}, 
with a point spectrum of multiplicity 1 at p — 0. 
A vector © belonging to the point spectrum 0 
of P" is called the true (or interacting) vacuum 
and is required to be in D. 

Usually D is taken to be minimal, namely, D 
is the linear hull of Q and o, (fi)... 9, (f£,)O 
with all possible n, o, ...,9,, fi, ..., f,. By means 
of the nuclear theorem, it is possible to define 
a linear operator o. (f) for f(x,, ..., Xp) in 
4 (R^"), linear and continuous in f on D such 
that it coincides with o, ( fi) ... Pa, (Jn) if f(x) 
= [ Ij, tech If such operators are introduced, 
then the linear hull of Q and o. , (£)O is 
taken to be D. 

Under the foregoing axiom, the vacuum 
expectation values of the products of field 
operators define tempered distributions W,(x) 
—W, Leien Xn} called Wightman functions, 
such that 


WATT fi) = (Q, Pa, Si) Pana) Q). 

The notion of a connected graph in pertur- 
bation theory corresponds to the following 
notion of truncated Wightman functions W7: 


WT EA in 10 Y T] Wildy, 


IN k=1 


WAD= > dT] WZüg. 

m {Ty} k=1 
where J indicates a set of variables x;e M, jel, 
{I,} is a partition of I into m subsets, with the 
order of the x's in each I, remaining the same 
as in I, and the sum over (1,] extending over 
all possible partitions. If the spectrum of ( Pr) 
in Q^ is contained in V" (pe M |p: p2 m?, 
p? 2 m) for some m 0 (the mass gap), then 
WI has an exponential clustering property at 
spatial infinity. For example W/ (x, —- a,, ..., 
x, -- a,)e" H 50 as a distribution in x if m' <m, 
aj =0 for all j and R 2 max |a; — a,| oc. 
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If the mass operator (P - P)"? has an iso- 
lated point spectrum at m with the eigenspace 
JF, (m), then there are sufficiently many n, «= 
(a,,...,%,), and f(x,, ..., x,) such that p, f)QE 
Jf, (m), and the linear hull of such vectors are 
dense in #,(m). In terms of the f; satisfying 
Papl f)) Qe 2€, (mj) (some of the ms may coin- 
cide) and the solutions g(x) = [gi(p)exp i(px — 
co,(p)x°)d* p, co(p) — (p? + m7)", of the Klein- 
Gordon equation, then the following limits, 
called out and in states, exist: 


out 


yee (hy very h,)= lim Q(t, gı) Q(t, Gn) Q, 


Q(t, gj) = [out + (t, x), TS Xn; T (t, x)) 


XP Teac Self, x)dx, ... dx, dx, 
h,- Qn)! J (P), (Die X (m). 
It defines the S-matrix elements as follows: 
S(h, ... hi hs... RJ 
=(W"(h, .. Rh), (h... KAL 


This is called Haag-Ruelle scattering theory, 
and the existence proof is based on the cluster- 
ing property of MI and an asymptotic esti- 
mate for the behavior of the g's for large t 
and all x (R. Haag, Phys. Rev., 112 (1958); D. 
Ruelle, Helv. Phys. Acta, 35 (1962)). The out 
and in states can be interpreted in the limit of 
infinite future and past as the state where n 
particles are moving at velocities v; related to 
the spectrum p" of P" through the relation p? — 
mj/(1 —v?) ^, po m;vj/(1 —v;)'? (with probabil- 
ity amplitude proportional to h;(p)) (H. Araki 
and R. Haag, Comm. Math. Phys., 4 (1967)). 
Since 


Qo" (h, . AL eat, KA 


= Onn’ 2 II (hj, hp), 
P j=1 


where the sum is over all permutations P, VP?" 
can be viewed as a unitary mapping from the 
"Fock space J£? over LE a, (m) to the closed 
subspace #°™ spanned by WV?"'(h, ...h,), n= 
0, 1,2,... (Q for n=0, h, itself for n=1). The 
free fields on 2£?"' and 2€?" are called (out and 
in) asymptotic fields. Likewise, V" is a unitary 
mapping from #° to sis If 4^" =H, then 
the matrix element S can be viewed as a ma- 
trix element of a unitary operator on #°, 
called an S-matrix. Its unitary transform by 
Yt and by Y™ coincide and define a unitary 
operator, sometimes called an S-operator, on 
Jf" = f™. The equality H = H = H™ is 
called the completeness of the scattering states 
or asymptotic completeness. 

If the scattering states are complete, then 
the following LSZ asymptotic condition, due 
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to Lehmann, Symanzik, and Zimmermann 
(Nuovo Cimento, 1 (1955)), holds: 


weak- lim Qt, g)= gll 


in 


where a*(h) is the tcreation operator (a*(h)- 
Wh, ...h,) = (hh, ... h,) for either out or 

in states), Papl QEA (mj) is not required, 
and (Q, Papl f)Q)=0 is assumed instead. This 
leads to an explicit expression for the S-matrix 
elements in terms of t-functions, which takes 
the following simple form if all particles have 
spin 0, all fields o, are scalar (D(A), =6,,), and 
all n; can be taken to be 1. First define the 
connected part S.(h, ... h,; h', ... hh) from S by 
exactly the same equation as truncated Wight- 
man functions with S to be set to 0 if n—0, 1 
or n' —0, 1 except when n2 n' =1 (S(h, h) = 

(h, h^)). Then 


S.(hy EX hy; his Ké h,) 


-| (fr (Dj: pj— vil GE 


Pitts sees ail Il (—iZ; "^ h(pj dQ,(p,)), 


j=l 


where p? =(p; +m?)'? (the set of such p is 
called the positive mass shell; there the vanish- 
ing factors (p; pj — m?) cancel the poles of 27), 
dQ, „(p)i is the Lorentz invariant measure 
EIST mj )d* p) on the posi- 
tive mass shell, he J£, (m) is represented by a 
measurable function h(p) with the inner prod- 
uct (h, h^) — [h(p)h'(p) dO, (p), Z; is defined by 
(h, pa Cf)0) = ZO, f) for all hem) and 
fp Ser P Fei * f(x) dx, and 17 is the 
Fourier transform of the truncated time- 
ordered function (:-function) defined by 


T pu | Son. ien È »x) 
Zei 


xT] Gi ??^4*x;, 


j=1 


Ty (x) z^ GEN Cp) We. (x) (x = (X, 4 ens Xn))s 


We (x) =(Q, Papp En pe Papin Xr) 2)" » 


in which the sum is over all permutations P, T 
indicates the truncated Wightman functions, 
0(x°; Cp) is the characteristic function of the 
CONE x? > X9 2 --- > Xp (the time ordering) 
possibly smeared out by convolution with a 
C?-function of compact support (so that it can 
multiply a distribution), where the formula 
does not actually depend on the smearing 
functions. (Usually fields o, are normalized so 
that Z; — 1.) 

The r-functions (O. Steinmann, Helv. Phys. 
Acta, 33 (1960); D. Ruelle, Nuovo Cimento, 19 
(1961); H. Araki, J. Math. Phys., 2 (1961)) are 
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defined by 
r(x)= > 0(x*; Cp/C;) We a(x) 
P 


(- 2,8665 Gell 


(often with an additional factor (—i)"~!), where 
the Fourier-Laplace transform 6(q°; C ») of 
0(x9; C,) is a rational function and 0(x?; C,/Ci) 
is the inverse Fourier transform of its bound- 
ary value as Im al eC, tending to 0, and the 
cone C; and hence r; is specified by a consistent 
choice of signs of ©, Im q? for all nonempty 
proper subsets 7 of (1, ..., n). The Fourier 
transform of r,(x) is a boundary value of an 
analytic function common to all r; and coin- 
cides with £/ for p? e C;. Making use of this 
relation, some analyticity properties of the S- 
matrix (including TCP symmetry) have been 
proved ([11, 12]; J. Bros, H. Epstein, and 

V. Glaser, Comm. Math. Phys., 1 (1965); 
Epstein, Glaser, and A. Martin, Comm. Math. 
Phys., 13 (1969); Epstein, J. Math. Phys., 8 
(1967)). 

The two-point function for scalar fields has 
the following simple expression in terms of 
"invariant distributions, sometimes called 
the Kállen-Lehmann representation with the 
Kállen-Lehmann weight p: 


(Q, o(x)o(y)Q) = | A,(x — y) dp(x?). 


Under the Wightman axiom (with the 
normal commutation relations), there exists 
an antiunitary operator ©, called the TCP 
operator, which satisfies the relations 


@Q=2, Ula, A\@~!=U(—a, A), 
= U(0, —1), 
Oo,(x)O'! =(—1)"(— i" o; =X); 


where y is the number of undotted indices in o, 
N=0Oif D(A),, = 1 for A= —1 (Bose fields), and 
N =1 if D(A,,— —1 for A= —1 (Fermi fields). 
The choice of +1 for (o, x) in the locality 
axiom cannot be opposite to the normal com- 
mutation relations for any a except for the 
trivial case ~,(x)=0 ([8-10]; G. Lüders and B. 
Zumino, Phys. Rev., 110 (1958); N. Burgoyne, 
Nuovo Cimento, 8 (1958)). This is called the 
connection of spin and statistics. For a general 
choice of +1 for e(a, f), the Wightman axioms 
imply existence of a certain number of even- 
oddness conservation laws (i.e., the existence of 
a unitary representation u of a group (Z,)' for 
some l such that u(g)O —O, u(g)o,( f )u(g)* = 
Y«(g) o. (f) for some characters y, on the group) 
so that the Klein transforms $,(x) — u(g,) o, (x) 
(for some choice of g,) of the original fields 
satisfy the Wightman axioms with the normal 
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commutation relations ([8—10]; H. Araki, 
J. Math. Phys., 2 (1961)). 


E. Theory of Local Observables 


Except for the technical assumption of 
operator-valued distributions, physical con- 
tents of the Wightman axioms have been for- 
mulated in terms of the von Neumann algebra 
(0) for bounded open sets €, generated by 
those observables which can be measured in 
the space-time region 0, as follows: 

(1) Isotony: If €, > €», then .»7(00,) 2 (0, ). 

(2) Covariance: U(a, A).o(@)U (a, A)* = 
A(A(A)O +a). 

(3) Locality: If Q;e «Z(0) and €, is spacelike to 
0, then [Q;, Q;] —0. (No signal can propa- 
gate faster than the speed of light.) 

In addition, the spectrum condition is as 
before (the stability of the vacuum) and, since 
we restrict our attention to the closed span of 
A(0)Q for all ©, Q is assumed to be cyclic for 
(Jos (€). (Then the latter is irreducible.) By 
treating Q(x) - U(x, 1)QU (x, 1)* for deg 
as a (noncovariant but localized) field, Haag- 
Ruelle scattering theory and the analyticity 
properties of the S-matrix described above for 
Wightman fields hold in exactly the same way 
in the theory of local observables. The notion 
of an algebra of local observables has been 
a concern of R. Haag since the late 1950s, 
with the consequent analogy to Wightman 
fields being demonstrated by Araki in his 
Zürich lectures of 1961—1962. Hence the 
above axioms are sometimes called Haag- 
Araki axioms. 

With the help of the additional axiom 
((€,)U ot = S(O, U 02), called the addi- 
tivity, the vacuum vector Q is cyclic and sep- 
arating for (0) for any bounded open 0 
(Reeh-Schlieder theorem) and .«Z(0) — (Ô), 
where Ó is the double cone [x ||x?| - |x| <L} if 
O={x||x°|+|x|<L, |x|<e} for any £0, for 
example (Borchers theorem). 

A merit of the Haag-Araki axioms is that 
these axioms have direct physical interpre- 
tation. In particular they always imply the 
commutativity at spacelike separation of sup- 
ports, in contrast to the anticommutativity for 
Fermi fields. This then necessitates the con- 
sideration of the representations associated 
with some other states, such as states with an 
odd number of fermions and representations of 
the C*-algebra d of quasilocal observables 
(generated by all ./(()), which are nonequiva- 
lent to the vacuum representation and are 
called superselection sectors. This viewpoint 
was introduced by Haag and D. Kastler (J. 
Math. Phys., 5 (1964)) and the C*-algebra 
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version of the Haag-Araki axioms ts called the 
Haag-Kastler axioms. 

If ©’ denotes the causal complement of Ó 
(e, the set of all points spacelike to ©), then 
the assumption ./(@) =. (C) is called duality 
and is proved for a certain type of region, 
which includes the double cone in the case of 
free fields. With the assumption of the duality 
for double cones in the vacuum sector, S. 
Doplicher, Haag, and J. Roberts (Comm. Math. 
Phys., 13 and 15 (1969); 24 (1971); 35 (1974)) 
succeeded in the analysis of superselection 
sectors and clarified the connection of spin and 
statistics in a much more satisfactory fashion, 
as well as the anticommutativity of intertwin- 
ing operators for superselection sectors for 
Fermi statistics. 


F. Constructive Field Theory 


An effort to make mathematical sense out of 
the heuristic theory of quantized fields and to 
produce examples of Wightman fields and the 
associated system of local observables has 
been pushed forward by J. Glimm and A. Jaffe 
since the mid 1960s and is known as construc- 
tive field theory. Since 1972, the Euclidean 
methods, already known in some sense, have 
become extremely powerful central tools, are 
collectively known as Euclidean field theory 
[13-16]. 

The Wightman function W(z, ...z,) is ana- 
lytic at the Schwinger points z;— (ix? x,) (xe R^) 
if x; x, for j #k, and its value S(x; ... x,)— 
W(z, ...z,) is called the Schwinger function. 
The axioms for Schwinger functions equiv- 
alent to Wightman axioms are known as 
Osterwalder-Schrader axioms (Comm. Math. 
Phys., 31 (1973); 42 (1975)). The positivity 
axiom reflecting the positive definite metric 
of the Hilbert space for Wightman fields is 
known as O-S positivity (or T -positivity or 
reflection positivity). 

Since the Schwinger function is symmetric 
in its variables, it can be viewed as the expec- 
tation value of the product of (commuting) 
random fields, called Euclidean fields, if an 
additional positivity holds. This idea was put 
forward by K. Symanzik in the 1960s. E. Nel- 
son then realized that Euclidean fields for free 
fields have the Markov property, and he devel- 
oped Euclidean Markov field theory. A work 
of Guerra in 1972 utilizing Nelson symmetry 
revealed the extreme power of this approach, 
and the whole of constructive field theory has 
been studied in Euclidean formulation with 
remarkable results for super-renormalizable 
models in space-time of dimension 2 and 3. 
The key point is the Feynman-Kac-Nelson 
formula, which expresses Schwinger functions 
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as functional integrals and reveals a mathe- 
matical connection between Euclidean field 
theory and classical statistical mechanics. 


G. Gauge Theory 


*Electrodynamics in terms of the 4-vector 
potential A, (x) is invariant under the local 
gauge transformations A, (x) A,(x) 4 €, A(x) 
(and the associated transformation of the 
charged fields) for A satisfying the wave equa- 
tion JA =0. This leads on one hand to com- 
plication in the canonical quantization of the 
fields A, (x) and, on the other hand, to the 
necessity of an indefinite inner-product space 
as exemplified by the Gupta-Bleuler formalism 
of quantum electrodynamics. In such a formal- 
ism, the physical Hilbert space (with positive 
definite metric) is introduced by considering 

a specific subspace (physical subspace) of a 
semidefinite metric and taking the quotient by 
its null subspace. 

The corresponding theory with a non- 
commutative gauge group is known as the 
theory of Yang-Mills fields. In order to restore 
the formal unitarity of the S-matrix in the 
perturbation series in terms of Feynman dia- 
grams, L. D. Faddeev and V. N. Popov (Phys. 
Lett., 25B (1967)) introduced fictitious particles, 
called Faddeev-Popov ghosts. T. Kugo and I. 
Ojima (Prog. Theoret. Phys., Suppl., 66 (1979)) 
developed a canonical quantization scheme for 
the Yang-Mills field which naturally intro- 
duces additional fields corresponding to the 
Faddeev-Popov ghosts and specifies the phy- 
sical subspace by means of the simple condi- 
tion that it be the kernel of the generator of 
BRS transformations, earlier introduced by C. 
Becchi, A. Rouet, and R. Stora. 

Gauge theory can be formulated on a lattice 
of finite volume as a kind of classical statistical 
mechanics. This is known as lattice gauge 
theory, and the important issue currently being 
investigated is whether or not its limit, as the 
lattice interval tends to 0 (the continuum limit) 
and the volume tends to infinity, produces a 
nontrivial quantum theory of gauge fields. 
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A. The Number of Finite Groups of a Given 
Order 


A group is called a finite group if its order is 
finite (C^ 190 Groups). Since the early years of 
the theory of finite groups, a major problem 
has been to find the number of distinct isomor- 
phism classes of groups having a given order. 
It is almost impossible, however, to find a 
genera] solution to the problem unless the 
values of n are restricted to a (small) subset of 
the natural numbers. Let f (n) denote the num- 
ber of isomorphism classes of finite groups of 
order n. If p is a prime number, then f(p)=1 
and any group of prime order is a cyclic group. 
If p is prime, then any group of order p? is an 
tAbelian group and f(p?) - 2. If p and q are 
distinct primes and p q, then f(pq)=2 or 1 
according as p is congruent to 1 modulo q or 
not. If pz 1 (mod q), there is a non-Abelian 
group of order pq as well as a cyclic group of 
order pq. For small n, the value of f (n) is as 
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follows: 


n 8 12 16 18 20 24 27 28 30 32 60 
mai a 514 5 515 5 4 451 13 


For any n, f (n) z 1. When p is prime, f(p") is 
known for mx 6: f (p?) 5, f(p*) - 15 if p» 2. 
For f(p5) see O. Schreier, Abh. Math. Sem. 
Univ. Hamburg, 4 (1926). For f (25) see [11]. 
Set f( p") o pl and l= Am?. Then A—2/27 as 
m oo (G. Higman, Proc. London Math. Soc., 
10 (1960); C. C. Sims, Symposium on Group 
Theory, Harvard, 1963). 


B. Fundamental Theorems on Finite Groups 


The following are some fundamental theorems 
useful in studying finite groups. 

(1) The order of any subgroup of a finite 
group G divides the order of G (J. L. La- 
grange). The converse is not necessarily true. If 
a finite group G contains a subgroup of order 
n for any divisor n of the order of G, then G is 
a tsolvable group. Furthermore, if G contains 
a unique subgroup of order n for each divisor 
n of the order of G, then G is a cyclic group. 

Let p be a prime number. Let the order of a 
finite group G be p"m, where m is not divisible 
by p. A subgroup of order p" of G is called a p- 
Sylow subgroup (or simply a Sylow subgroup) 
of G. The importance of this concept may be 
seen from the next theorem. 

(2) A finite group contains a p-Sylow sub- 
group for any prime divisor p of the order of 
the group. Furthermore, p-Sylow subgroups 
are conjugate to each other. The number of 
distinct p-Sylow subgroups is congruent to 1 
modulo p. In general, the number of distinct p- 
Sylow subgroups of G that contain a given 
subgroup whose order is a power of p is con- 
gruent to 1 modulo p (Sylow's theorems). 

(3) A p-group is a fnilpotent group (a finite 
group is called a p-group if its order is a power 
of p). Thus any finite p-group G of order >1 
contains a nonidentity element in the tcenter 
of G. Furthermore, any proper subgroup of G 
is different from its tnormalizer. A paper by P. 
Hall (Proc. London Math. Soc., (2) 36 (1933)) is 
a classic and fundamental work on p-groups. 

A group G of order 8 with 2 generators o, t 
and relations o^ —1, rer) 1 g?—4? is 
called the quaternion group. This group is 
isomorphic to the multiplicative group consist- 
ing of {+1, +i, +j, +k} in the tquaternion 
field. A generalized quaternion group is a group 
of order 2" with 2 generators c, t and rela- 
tions oi =1, tot! Lasel A non- 
Abelian group all of whose subgroups are 
normal subgroups is called a Hamilton group. 
A Hamilton group is the direct product of a 
quaternion group, an Abelian group of odd 


=o 


=o 
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order, and an Abelian group of exponent 2 
(i.e., p? — 1 for each element p). 

Let G be a finite group, and let G;— G2 G, 
2... 2G, — (1) be a tcomposition series of G. 
The set of isomorphism classes of the simple 
groups G;..,/G;, i— 1, 2, ...,r, is uniquely deter- 
mined (up to arrangement) by the tJordan- 
Holder theorem. Thus the two most funda- 
mental problems of finite groups are (i) the 
study of the simple groups and (ii) the study of 
a group with a given set of composition fac- 
tors. The first is one of the leading problems of 
the theory, although it has been in a state of 
stagnation until rather recently (— Section J). 
As to the second problem, initial works by H. 
Wielandt and others are under way (partic- 
ularly in the direction of various generaliza- 
tions of Sylow's theorems). For the class of 
finite solvable groups, the first problem has a 
rather trivial solution; only the second prob- 
lem is important, and even in this case the 
theory seems to leave something to be desired. 


C. Finite Nilpotent Groups 


A finite group is nilpotent if and only if it is 
the "direct product of its p-Sylow subgroups, 
where p ranges over all the prime divisors of 
the order. Any maximal subgroup of a nilpo- 
tent group is normal. The converse holds for 
finite groups; that is, a finite group is nilpotent 
if and only if all its maximal subgroups are 
normal. 


D. Finite Solvable Groups 


One of the most profound results on finite 
groups, an affirmative answer to the long- 
standing Burnside conjecture, is the Feit- 
Thompson theorem (Pacific J. Math., 13 
(1963)): A finite group of odd order is solvable. 
The index of a maximal subgroup of a finite 
solvable group is a power of a prime number 
(E. Galois). But the converse is not true. The 
unique simple group of order 168 has the 
property that all maximal subgroups are of 
prime power index. A finite solvable group 
contains a self-normalizing nilpotent subgroup 
(i.e., a nilpotent subgroup H such that Ng(H) 
=H), and any two such subgroups are conju- 
gate (R. W. Carter, Math. Z., 75 (1960); cf. W. 
Gaschütz, Math. Z., 80 (1963), for a generali- 
zation). Such a subgroup is called a Carter 
subgroup and is an analog of a Cartan subal- 
gebra of a Lie algebra. But unlike Cartan 
subalgebras, most simple groups do not con- 
tain any self-normalizing nilpotent subgroups. 
A finite solvable group of order mn (m, n are 
relatively prime) contains a subgroup of order 
m; two subgroups of order m are conjugate; if l 
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is a divisor of m, then any subgroup of order / 


is contained in a subgroup of order m (P. Hail). 


The converse of the first part of this theorem is 
also true: A finite group is solvable if it con- 
tains a subgroup of order m for any decompo- 
sition of the order in the form mn, (m, n) 2 1 (P. 
Hall’s solvability criterion). This generalizes 
the famous Burnside theorem asserting the 
solvability of a group of order p*q?, where 
both p and q are prime numbers. If the se- 
quence of the quotient groups of a tprincipal 
series of a finite group G consists of cyclic 
groups, then the group G is called supersolv- 
able. A finite group is supersolvable if and 
only if the index of any maximal subgroup is a 
prime number (B. Huppert, Math. Z, 60 (1954)). 
If p is the largest prime divisor of the order of 
a finite supersolvable group G, then a p-Sylow 
subgroup of G is a normal subgroup. 


E. Hall Subgroups 


A subgroup is called a Hall subgroup if its 
order is relatively prime to its index (see the 
theorems of P. Hall on finite solvable groups). 
There is no general theorem known on the 
existence of a Hall subgroup. If a finite group 
G has a normal Hall subgroup N, then G con- 
tains a Hall subgroup H that is a complement 
of N in the sense that G= NH and NOH-—1; 
furthermore, any two complements are conju- 
gate (Schur-Zassenhaus theorem). The analog 
of Hall's theorem on finite solvable groups 
fails for nonsolvable groups. But if a finite 
group, solvable or not, contains a nilpotent 
Hall subgroup H of order n, then any sub- 
group of an order dividing n is conjugate to a 
subgroup of H (Wielandt, Math. Z., 60 (1954); 
cf. P. Hall, Proc. London Math. Soc., 4 (1954), 
for a generalization). There are some gener- 
alizations of these results for maximal 7- 
subgroups which may not be Hall subgroups. 


F. z-Solvable Groups 


Let z be a set of prime numbers. Denote the 
set of prime numbers not in z by z'. A finite 
group is called a z-group if all the prime divi- 
sors of the order belong to z. A finite group is 
called z-solvable if any composition factor is 
either a z'-group or a solvable z-group. If z= 
{p} consists of a single prime number p, we 
use terms such as p-group or p-solvable (in- 
stead of { p}-solvable). Let G be a 1-solvable 
group. A series of subgroups P- 1c Ny 
P,E£N; € ... EP, N,—G defined by the prop- 
erties that P,/N;_, is the maximal normal z- 
subgroup of G/N; , and N;/P; is the maximal 
normal z'-subgroup of G/P, is called the n- 
series of G, and the integer / is called the z- 
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length of G. A solvable group is z-solvable for 
any set x of prime numbers. A z-solvable 
group contains a Hall subgroup which is a z- 
group and also a Hall subgroup which is a z'- 
group; an analog of Hall's theorem on finite 
solvable groups holds. Hall and Higman (Proc. 
London Math. Soc., 7 (1956)) discovered deep 
relations between the p-length of a p-solvable 
group and invariants of its p-Sylow subgroup. 
For example, the p-length is 1 if a p-Sylow 
subgroup is Abelian. 


G. Permutation Groups 


The set of all permutations on a set Q of n 
elements forms a group of order n! whose 
structure depends only on n. This group is 
called the symmetric group of degree n, de- 
noted by S,. Any subgroup of S, is a permuta- 
tion group of degree n. When it is necessary to 
mention the set on which permutations 
operate, $, may be denoted as S(Q), and a 
subgroup of S(Q) is called a permutation 
group on Q. The set of n elements on which S, 
operates is usually assumed to be (1,2, ..., nj, 
and an element o of S, is written as 


LZ ccm 
o= : 
1^ WE oue wr 


; afl 22a Ss 
exc , 
ual 3. TEX 


where i’ is the image of i by o: i' = a(i). The 
element o may be written as (...)(abc...z)(...), 
which means that c cyclically maps a into b, b 
into c, and so on, and finally z back into a. In 
the example, o =(1 2 3)(4 5)(6). It is customary 
to omit the cycle with only one letter in it, 
such as (6) in the example. With this conven- 
tion, o —(123)(4 5) may be an element of S, for 
any n È 5, leaving all the letters i>6 invariant. 
A cycle of length / is an element c of S, which 
moves / letters a,, ...,a, cyclically and fixes 

all the rest; i.e., o —(a,, ...,aj). Then an ex- 
pression such as o — (12 3)(4 5) is the same as 
the product of two cycles (1 2 3) and (4 5). In 
general, any permutation can be expressed as 
the product of mutually disjoint cycles (two 
cycles (a,,...,a,) and (b,, ..., bm) are said to be 
disjoint if a; # b; for all i and j). Furthermore, 
the expression of the permutation as the prod- 
uct of mutually disjoint cycles is unique up to 
the order in which these cycles are written. A 
cycle of length 2 is called a transposition. Any 
permutation may be written as a product of 
transpositions. This expression is not unique, 
but the parity of the number of transpositions 
in the expression is determined by the permu- 
tation. À permutation is called even if it is the 
product of an even number of transpositions 
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and odd otherwise. The symmetric group S, 
contains the same number of even and odd 
permutations. 

The totality of even permutations forms a 
normal subgroup of order (n!)/2(n 2 2), called 
the alternating group of degree n and usually 
denoted by A,. An even permutation is the 
product of cycles of length 3. The alternating 
group A, of degree 5 is the nonsolvable group 
of minimal order. This fact was known to 
Galois. If n4, then the alternating group A, 
is a simple group and a unique proper normal 
subgroup of S,. If n=4, A, contains a normal 
noncyclic subgroup V of order 4. In this case 
A, and V are the only proper normal sub- 
groups of S4. A noncyclic group of order 4 is 
called a (Klein) four-group. If n > 5, the sym- 
metric group S, is not a solvable group. This 
is the group-theoretic ground for the famous 
theorem, proved by Ruffini, Abel, and Galois, 
which asserts the impossibility of an algebraic 
solution of a general algebraic equation of 
degree more than four. If n <4, S, is solvable. 
The group S, has a composition series with 
composition factors of orders 2, 3, 2, 2, and 
S, is realized as the group of motions in 3- 
dimensional space which preserve an octahe- 
dron. Hence S, is called the octahedral group. 
Similarly, A4,(A,) is realized as the group of 
motions in space which preserve a tetrahedron 
(icosahedron); thus A, is called the tetrahedral 
group and A, the icosahedral group. 

These groups have been extensively studied 
in view of their geometric aspect. The group of 
motions of a plane which preserve a regular 
polygon is called a dihedral group. If a regular 
polygon has n sides, then the group has order 
2n. Sometimes a Klein four-group is included 
in the class of dihedral groups (for n— 2). The 
dihedral groups, octahedral group, etc., are 
called regular polyhedral groups. A finite 
subgroup of the group of motions in 3- 
dimensional space is either cyclic or one of 
the regular polyhedral groups. A dihedral 
group is generated by two elements of order 2. 
Conversely, a finite group generated by two 
elements of order 2 is a dihedral group. This 
simple fact has surprisingly many conse- 
quences in the theory of finite groups of even 
order [15, ch. 9]. A dihedral group of order 
2n contains a cyclic normal subgroup of order 
n, and hence is solvable. 

If nz: 6, every automorphism of S, is inner. 
The order of the group of automorphisms of 
Ss is twice the order of Sẹ. The index (S,: H} of 
a subgroup H of S, is at least equal to n unless 
H = A,. H (S,:H)=n, then H is isomorphic to 
S, .,. lfn #6, S, contains a uniaue conjugate 
class of subgroups of index n. But S, contains 
two such classes, which are exchanged by an 
automorphism of S, (— Section I). 
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H. Transitive Permutation Groups 


A permutation group G on a set © is called a 
transitive permutation group if for any pair 

(a, b) of elements of Q, there exists a permuta- 
tion of G which sends a into b. Otherwise G is 
said to be intransitive. Let G be a transitive 
permutation group on a set Q, and let a be an 
element of Q. The totality of elements of G 
which leave a invariant forms a subgroup of G 
called the stabilizer of a (in G). The index of 
the stabilizer is equal to the number of ele- 
ments of Q, the degree of G. Thus the degree of 
a transitive permutation group G divides the 
order of G (a fundamental theorem). 

The concept of orbits is important. Let G be 
a permutation group on a set Q. A subset T of 
Q is called an orbit of G if it is G-invariant and 
G acts transitively on I. In other words, a 
subset T of Q is an orbit of G if the following 
two conditions are satisfied: (1) If ae l'and 
g€ G, the image g(a) also lies in T; and (ii) if 
a and b are two elements of T, there exists 
an element x of G such that b — x(a). Thus 
each element x of G induces a permutation 
o, (x) on T. The set of all these permutations 
q,(x)(xeG) forms a permutation group on T, 
which may be denoted by p (G). Then o, (G) is 
transitive on T, and o, is a homomorphism of 
G onto ¢,(G). Thus the number of elements in 
an orbit T is a divisor of the order of G. It is 
clear that the set Q on which G acts is the 
union of mutually disjoint orbits I',, ..., ot 
G. This implies that the degree of G is the sum 
of the numbers of elements in the orbits I;. 
The resuiting equation often contains non- 
trivial relations. If v; denotes the homomor- 
phism ¢,, defined before, then G is isomorphic 
to a subgroup of the direct product of the 
groups ll, i— 1, 2, ...,r. 

A transitive permutation group is called 
regular if the stabilizer of any letter is the 
identity subgroup {1}. A transitive permuta- 
tion group is regular if and only if its order 
equals its degree. Any group can be realized 
as a regular permutation group (Cayley’s 
theorem). A transitive permutation group 
which is Abelian is always regular. 

Let G be a transitive permutation group on 
a set Q. If the stabilizer of an element a of Q is 
a maximal subgroup, G is called primitive, and 
otherwise imprimitive. A normal subgroup, 
which is #{1}, of a primitive permutation 
group is transitive. An imprimitive permuta- 
tion group induces a decomposition of the set 
Q into the union of mutually disjoint subsets 
A,,...,A, (s> 1) such that each A, contains at 
least two elements, and if xe G maps an ele- 
ment a of A; onto an element b of A,, then x 
maps every element of A, into A. x(A;)=A,. 
The set {A,,...,A,} is called a system of im- 
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primitivity. A subset A of Q is called a block if 
x(A)f1A equals A or the empty set for all x in 
G. A block is called nontrivial if Az O and A 
contains at least two elements. Each member 
of a system of imprimitivity is a nontrivial 
block. A transitive permutation group is 
primitive if and only if there is no nontrivial 
block. 

A permutation group G on a set Q is called 
k-transitive (or k-ply transitive, where k is a 
natural number) if for two arbitrary k-tuples 
(ai, ..., a4) and (b,, ..., by) of distinct elements of 
Q, there is an element of G which maps a; into 
b; for all i2 1, 2, ..., k. If k 22, G is called multi- 
ply transitive. A doubly transitive permutation 
group is always primitive. The symmetric 
group A, of degree n is n-transitive, while the 
alternating group A, is (n— 2)-transitive for 
n> 3. Conversely, an (n —2)-transitive permu- 
tation group on (1,2, ...,n] is either S, or A,- 
For multiply transitive permutation groups 
which are simple, see the list in Section I. If 
k26, no k-transitive groups are known at 
present except S, and A,,. If the Schreier con- 
jecture (— Section I) is true, then there are no 
6-transitive permutation groups except S, and 
A, (Wielandt, Math. Z., 74 (1960); H. Nagao, 
Nagoya J. Math., 27 (1964); O'Nan, Amer. 
Math. Soc. Notices, 20 (1973)). 

Two 5-transitive permutation groups other 
than S, and A, are known: the groups M,, and 
M4 of degrees 12 and 24, respectively, dis- 
covered by E. L. Mathieu in 1864 and 1871. 
The stabilizer of a letter in M,3(M>,) is a 4- 
transitive permutation group of degree 11 (23), 
denoted by M,,(M,,). No 4-transitive permu- 
tation groups other than S,, A,, M; (i2 11,12, 
23, and 24) are known. The groups M,,, Mirs 
M54, M33 and the stabilizer M3, of a letter 
in M,, are called Mathieu groups. They are 
simple groups which have quite exceptional 
properties. For Mathieu groups, see E. Witt, 
Abh. Math. Sem. Univ. Hamburg, 12 (1938). A 
k-transitive permutation group G on Q of 
degree n and order n(n—1)...(n—k+1) has 
the property that no nonidentity element of G 
leaves k distinct letters of Q invariant. If k>4, 
such a group is one of the following: S, A 
M2, and M,, (C. Jordan). For k=2 and 3, 
see H. Zassenhaus, Abh. Math. Sem. Univ. 
Hamburg, 11 (1936). 

A multiply transitive permutation group G 
contains a normal subgroup S such that S is a 
non-Abelian simple group and G is isomorphic 
to a subgroup of the group Aut S of the auto- 
morphisms of S, except when the degree n of G 
is a power of a prime number and G contains 
a regular normal subgroup of order n which is 
an telementary Abelian group (W. S. Burn- 
side). Furthermore, in these exceptional cases, 
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G is at most 2-transitive if n is odd, while G is 
at most 3-transitive if n is even but more than 
4. The symmetric group S, of degree 4 is the 
only 4-transitive group that contains a proper 
solvable normal subgroup. 

A transitive extension of a permutation 
group H on Q is defined as follows. Let oo be a 
new element not contained in Q. A transitive 
extension G of H is a transitive permutation 
group on the set {Q, oo} in which the stabilizer 
of oo is the given permutation group H on Q. 
Transitive extensions do not exist for some H. 
Suppose that a permutation group H admits a 
transitive extension G that is primitive. If H is 
simple, then G is also simple unless the degree 
of G is a power of a prime number. Construct- 
ing transitive extensions has been an effec- 
tive method for constructing sporadic simple 
groups. 

Permutation groups of prime degree have 
been studied extensively since the last century, 
partly because of their connection with al- 
gebraic equations of prime degree. Let p be a 
prime number. A transitive permutation group 
of degree p is either multiply transitive and 
nonsolvable or has a normal subgroup of 
order p with factor group isomorphic to a 
cyclic group of order dividing p — 1 (Burnside). 
Choose two cycles x and y of length p in S,. If 
y is not a power of x, then the subgroup (x, y» 
generated by x and y is a multiply transitive 
permutation group which is simple. The struc- 
ture of <x, y? is not known despite its simple 
definition. More attention has been paid to 
groups of degree p, where p is a prime number 
such that (p — 1)/2 =q is another prime number. 
The problem is to decide if such nonsolvable 
groups contain the alternating group A,. The 
Mathieu groups M,, and M,, are the only 
known exceptions for p 7. The search for 
additional exceptions has been aided by the 
development of high-speed computers. It is 
known that there ts no exceptional group 
of degree p=2q+ 1 for 23<p<4079 (P. J. 
Nikolai and E. T. Parker, Math. Tables Aids 
Comput., 12 (1958); see N. Ito, Bull. Amer. 
Math. Soc., 69 (1963), for further results in this 
direction). 

A Frobenius group is a nonregular transitive 
permutation group in which the identity is the 
only element leaving more than one letter 
invariant. A Frobenius group of degree n 
contains exactly n—1 elements which displace 
all the letters. These n— 1 elements together 
with the identity form a regular normal sub- 
group of order n. This is a theorem of Frobe- 
nius; all the existing proofs depend on the 
theory of characters. The regular normal 
subgroup of a Frobenius group is nilpotent 
(J. G. Thompson, Proc. Nat. Acad. Sci. US, 45 
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(1959); [15, ch. 10; 16, ch. 3]). A Zassenhaus 
group is a transitive extension of a Frobenius 
group. 


I. Finife Simple Groups 


All simple groups of finite order were com- 
pletely classified in February 1982 (— Section 
J; [23]). These are divided into the following 
four classes: (1) cyclic groups of prime order, 
(2) alternating groups of degree 75, (3) sim- 
ple groups of Lie type, and (4) other simple 
groups. 

The subclass (1) consists of cyclic groups of 
prime order p for any prime number p. Abelian 
simple groups belong to this subclass. The sub- 
class (4) consists of twenty-six sporadic simple 
groups including five Mathieu groups. All 
sporadic groups, other than the five Mathieu 
groups, are of recent discovery. 

Simple groups of Lie type are analogs of 
simple Lie groups, and include the classical 
groups as well as the exceptional groups and 
the groups of twisted type. 

Classical groups are divided into four types: 
"linear, tunitary, tsymplectic, and *orthogonal 
(— 60 Classical Groups). Let q =p” be a power 
of a prime number p. Consider a vector space 
V of dimension n> 2 over the field F, of q 
elements, except in the unitary case where V 
is a vector space of dimension nz 2 over the 
field F, of 4? elements. Let f be a nondegener- 
ate form on V which is *Hermitian in the uni- 
tary case (with respect to the automorphism 
of order 2 of F, over F,), *skew symmetric 
bilinear in the symplectic case, and *quadratic 
in the orthogonal case. In the orthogonal case, 
the dimension of V is assumed Z3. Consider 
the group of all linear transformations of V 
(linear case) of determinant 1, or the group of 
all linear transformations of determinant 1 
which leave the form f invariant (in other 
cases). In the orthogonal case, take the com- 
mutator subgroup. With each of these groups, 
the factor group of it by its center is a simple 
group with a few exceptions. 

There are several notations to denote these 
groups. E. Artin's notation for simple groups, 
which is reasonably descriptive and simple, 
follows the name of the simple group: n and q 
are as described in the preceding paragraph, g 
is the order of the simple group, and (a, b) 
denotes the greatest common divisor of two 
natural numbers a and b. 

Linear simple group, L,(q): 


g=gh 7 TV (q'—1)/d,  d=(n,q—1). 
i=2 
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Unitary group, U,(q): 
g=q"" Y? [[u'- C74, d-( a D 
i-2 


The structure of unitary groups does not de- 
pend on the form. 

Symplectic group, S,(q), n — 2m: 
g=q™ [[(a?—1))d, 4-(Q.q-. 
In the symplectic case, the dimension n of the 
space V must be even, so n — 2m, and the struc- 
ture does not depend on the form. 


Orthogonal group in odd dimension n — 
2m 1, Orm+1(Q): 


g-q"[[(q?—1yd, 42Q,4-1). 
i=1 


The structure does not depend on the form in 
odd dimension. l 

Orthogonal groups in even dimension n = 
2m: There are two inequivalent forms, one 
with tindex m (which is maximal) and the other 
with index m— 1. The two orthogonal groups 
are denoted by O,, (c, q), c= + 1, where e— 1 if 
the form is of maximal index and —1 other- 
wise. Then 


m-1 
g=q™ Yq" —s) | | (q?'— 1/4, 
i=1 


d — (4, q" — £). 


The value of e is determined by the form f: 
e=1 if f is equivalent to X74, x5; ., x;;, and 
£——liffexityxix3 xi Lie Xzi-1 X2 
where the polynomial t? + yt + 1 is irreducible 
over F;. 

There are other ways to denote these 
groups. Let X = X (x, x) be a group of nonsin- 
gular linear transformations of a vector space 
V. Two asterisks indicate two invariants, such 
as the dimension of V and the number of ele- 
ments in the ground field. The notation SX 
stands for the subgroup of X consisting of 
linear transformations with determinant 1, and 
the notation PX stands for the factor group of 
the linear group X by its center. Thus PX is 
a subgroup of the group of all projectivities 
of the projective space formed by the linear 
subspaces of V. The following list is self- 
explanatory, except the last term in each row, 
which is the notation of L. E. Dickson [1]: 


L,(q)= PSL(n, q) - LF(n, a 
U,(q) = PSU (n, q) - HO(n, q?) 
S5,(q) = PSp(n, q) = A(2n, q). 


(LF: linear fractional group; HO: hyperortho- 
gonal group; A: Abelian linear group.) If f isa 
nondegenerate quadratic form, then the sub- 
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group of GL (n, q) consisting of all the elements 
leaving the form f invariant may be denoted 
by O(n, q, f). Let O(n, q, f) denote the commu- 
tator subgroup of O(n, q, f). Set e=1 if f is of 
maximal index, and s= —1 otherwise. Then 


Oe, q) = PQ(n, q, f). 


Dickson's notation for orthogonal groups is 
complicated and seldom used. 

Finite simple groups corresponding to 
Lie groups of some exceptional type were 
studied by Dickson early in this century, but 
C. Chevalley (Tóhoku Math. J., (2) 7 (1955)) 
proved the existence, simplicity, and other 
properties of groups of any (exceptional) type 
over any field by a unified method. Simple Lie 
algebras over the field C of complex numbers 
are completely classified, and according to the 
classification theory they are in one-to-one 
correspondence with the tDynkin diagrams. 
Let L be a simple Lie algebra (over C) corre- 
sponding to the tDynkin diagram of type 
X (— 248 Lie Algebras). Let L=Ly>+ XL, 
be a Cartan decomposition of L, where a 
ranges over the troot system A of L. It is pos- 
sible to choose a basis B of L with the following 
properties (*Chevalley's canonical basis): 
B consists of e, € L,(xe A) and a basis of Lo; 
the structure constants of L with respect to B 
are all rational integers; the automorphism 
x,(€) in the adjoint group defined by 


X4(S)=exp(Gade,) (Ge 


maps each element of B into a linear combi- 
nation of elements of B with coefficients which 
are polynomials in Z with integer coefficients. 
Thus the matrix A,(£) representing the trans- 
formation x,(£) with respect to B has coeffi- 
cients which are polynomials in £ with integer 
coefficients. 

The elements of B span a Lie algebra L, 
over the ring Z of integers. Let F be a field and 
form L,— F & ;L;. Then L, is a Lie algebra 
over F, and the set B may be identified with a 
basis of Lp over F. Let t be any element of F, 
A,(t) be the matrix obtained from A,(£) by 
replacing the complex variable č by the ele- 
ment t, and finally x,(t) be the linear trans- 
formation of Lp represented by the matrix 
A,(t) with respect to B. The group generated 
by the x,(t) for each root « and each element t 
of F is called the Chevalley group of type X 
over F. The commutator subgroups of the 
Chevalley groups are simple, with a few excep- 
tions which will be stated after the complete 
list of simple groups of Lie type. Suppose that 
X = Anm Dn, or E; (— 248 Lie Algebras S). 
Then the Dynkin diagram of type X has a 
nontrivial symmetry; let it be a fl. Suppose 
that the field F has an automorphism o of the 
same order as the order of the symmetry of the 
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Dynkin diagram. Let 0 be the automorphism 
of the Chevalley group which sends x,(t) to 
X(t’). Let U (resp. V) be the subgroup of the 
Chevalley group generated by x,(t) with «>0, 
te F (x(t), B <0), and let U'(V!) be the sub- 
group consisting of all the elements of U(V) 
which are left invariant by 0. The group gen- 
erated by U! and Hi is called the group of 
twisted type. If the order of c is i, this group is 
said to be of twisted type 'X. In all but one 
case, the group of twisted type is simple (see 
R. Steinberg, Pacific J. Math., 9 (1959)). The 
value i is 2 except when X — D,. Since D, 
admits symmetries of orders 2 and 3, there are 
two twisted types. If X = B,, G,, or F,, then 
the diagram has a symmetry. If the character- 
istic p of the ground field F is 2, 3, or 2 accord- 
ing as X = B,, G, or F, and the field F has an 
automorphism o such that (t°) =t? for any 
t€ F, then a procedure similar to the one de- 
scribed before is applicable, and the group of 
twisted type X' is obtained (R. H. Ree, Amer. 
J. Math., 83 (1961)). The group of twisted type 
is simple if the field F has more than three 
elements. 

The following list contains all the simple 
groups of Lie type. For each classical group, 
we list the type followed by identification: 


A, — Lat (Q) (n2 1) 
^A,7 Ups (0) (nz 1) 
B, Dall (n? 1) 
C, = Son(Q) (n> 2) 
D, = O2,(1, q) (n> 3) 
"Della  (n>3) 


For other groups the type of the group is 
followed by the customary name or notation, 
if any, and the order g: 


B, Suzuki group, Sz(q), q - 22"*! 
g —4^(q —1)(q? +1) 
"D,  g-q'* (q*  q* - 1)(q* — 1) (4? — 1) 
G, g-q*(q9—1)(q? — 1) 
G, Ree group, Re(q), g=32"*? 
g — q*(q? + 1)(q— 1) 
F,  g-q'*(q'* — Y) (q* — 1)(q9 — 1) 
x (q? — 1) 
F, q-2?"" g=q'?(q°+1)(q*—1) 
x (q? + D(q— 1) 
E;  dg-q^*(q'* — Y) (q? — 1)(g* — 1) 
sief — Ui —1)(q? — 1) 
d—(3,q— 1) 
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"E — dg q?*(q'* — V) (q? rie — 1) 
x (qf — D (a? + D) (a? — 1) 
d=(3,q+1) 
E, dg-q9*(q*—1)(q'*—1)(q?—1) 
x (q!? — 1)(q? — (Daf — t) (a? — 1) 
d-—(2,q— 1) 
E g—q'^(q?? — 1)(q?* — 1)(q?? — 1) 
x (q*—1)(q!* - 0(q — 1) 
x (q* — D) (q? — 1). 


B^: M. Suzuki, Proc. Nat. Acad. Sci. US, 46 
(1960); G, and F;: Ree, Amer. J. Math., 83 
(1961): G4: Dickson, Trans. Amer. Math. Soc., 2 
(1901), Math. Ann., 60 (1905); other Chevalley 
groups: Chevalley, Tóhoku Math. J., (2) 7 
(1955); twisted types: Steinberg, Pacific J. 
Math., 9 (1959), J. Tits, Séminaire Bourbaki 
(1958), Publ. Math. Inst. HES (1959), D. Hert- 
zig, Amer. J. Math., 83 (1961), Proc. Amer. 
Math. Soc., 12 (1961). 

Nonsimple cases: L(2), L,(3), U4(2), and 
Sz(2) are solvable groups of orders 6, 12, 72, 
and 20, respectively. The groups 0,(2), GO 
G5(3), and F4(2) contain normal subgroups 
of indices 2, 2, 3, and 2, respectively. These 
normal subgroups are simple and identified as 
L4(9), U3(3), L,(8) in the first three cases. The 
normal subgroup of F4(2) is not in the list of 
simple groups of Lie type and is quite excep- 
tional (Tits's simple group, Ann. Math., (2) 80 
(1964)). 

Isomorphisms between various simple 
groups: L(q)= U2(q)=S,(q) = 03(q); Os(q) = 
Sala); HA, aq) = Lala) x Lla); O4(—1,q) = 
L2(q’); Og(1, q) = La(q); Oo( —1, 4) Halet 
Oon+1(Q) = S2,(q) if q is a power of 2; L,(2)=S,; 
L,(3)= A4; L2(4)=L2(5)= As; L(7) 2 L2); 
L4(9) - Ag; L4(2) 2 Ag; U,(2) 5 S4(3). If q is odd 
and 2nz 6, then S,,(q) and O,,..,(q) have the 
same order but are not isomorphic. L,(4) and 
L4(2) have the same order but are not isomor- 
phic. There is no other isomorphism or coin- 
cidence of orders among the known simple 
groups (Artin, Comm. Pure Appl. Math., 8 
(1955)). 

The groups of the automorphisms of simple 
groups belonging to subclasses (1), (2), and 
(3) are known. For the simple groups of Lie 
type, see Steinberg, Canad. J. Math., 10 (1960). 

The following list of twenty-six groups con- 
sists of all the simple groups that belong to 
class (4): 

Five Mathieu groups whose orders are 


.Myyiige-7920z2*-3?.5- 11 
M, 9:9 =95,040 = 25-33: 5:11 
M5919 =443,520=27-3?-5-7-11 
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M3:g = 10,200,960 —27-32-5-7- 11:23 
My 41g =244,823,040 = 219-33 5 7- 11-23. 


The other twenty-one groups have been 
discovered since 1964. Each group is identified 
by the symbol (x);, indicating that it is the ith 
group discovered in the year 19x. The list 
continues with the name or names of dis- 
coverers, the order of the group, and a brief 
description. 

(64),: Z. Janko, g= 175,560 = 2°-3-5-7- 
11-19. A subgroup of the Chevalley group 
G,(11). See J. Algebra, 3 (1966). 

(67),: M. Hall and Z. Janko, g = 604,800 = 
27-33-5?-7, a transitive extension of U,(3) 
of degree 100. 

(67),: D. G. Higman and Sims, g= 
44,352,000 = 2? - 3? - 55 -7- 11, a transitive 
extension of M,, of degree 100. It is a normal 
subgroup of index 2 in the group of automor- 
phisms of a certain graph with 100 vertices. 

(67),: Suzuki, g = 448,345,497,600 223 - 37- 
5?.7- 11-13, a transitive extension of G,(4); 
defined from the automorphism group of a 
graph of 1782 vertices. 

(67),: J. McLaughlin, g = 898,128,000 = 
27-3°-53-7-11, a transitive extension of 
U,(3); defined from a graph of 275 vertices. 

(68),: G. Higman, Z. Janko, and J. McKay, 
g = 50,232,960 = 27 -3°- 5-17-19, a transitive 
extension of the group which is obtained from 
L,(16) by adjoining the field automorphism of 
order 2. The existence was verified by using a 
computer. 

(68),, (68),, (68),: J. H. Conway, 


g 221-3955. 7211-13-23 
= 4,157,776,806,543,360,000, 

g=2'8-36-53-7-11-23, 

4—219.37-55.7- 11-23. 


The big group is obtained from the automor- 
phism group of a lattice in 24-dimensional 
space, and the two smaller ones are subgroups 
of it. The lattice was defined by J. Leech in 
connection with a problem of close packing of 
spheres in 24 dimensions (Canad. J. Math., 19 
(1967)). 

(68),: B. Fischer, g 22!7- 3?- 52-7. 11: 13— 
70,321,751,654,400, a transitive extension of 
U,(2) derived by means of a certain graph. 

(69),: D. Held and others, g - 21? 33-52. 7?- 
17 = 4,030,387,200. 

(69),: B. Fischer, g 2219-32-52. 7.11: 13- 
17:23 = 4,089,470,473,293,004,800. 

(69),: B. Fischer, 9 7 22! 316-52. 75-11-13: 
17: 23-29 = 1,255,205,709,190,661,721,292,800. 

(71),: R. N. Lyons and C. C. Sims, g = 
25:35:59. 14 11-3137-67. 

The existence of (69), and (71), was verified by 





151 J 
Finite Groups 


using computers; (69), and (69), were derived 
by means of certain graphs. For a more de- 
tailed account of these simple groups, see J. 
Tits, Séminaire Bourbaki (1970), No. 375, and 
the references [18, 19, 20.] 

(72),: A. Rudvalis, g — 2!*- 33 - 557-13. 19, 
a transitive extention of Tits's simple group, 
i.e., a normal subgroup of F4,(2) of index 2. 
Concerning this group (72),, see the article 
of J. H. Conway and D. B. Wales, J. Algebra, 
27 (1973). 

(73),: M. O'Nan, g 2 2? 3*-5. 5-11. 19.31. 
This group (73), was discovered by O'Nan and 
the existence was verified by C. Sims, using a 
computer. 

(73),: B. Fischer, g 22*! 313-55. 72-11-13: 
19: 23-31-47. 

(73),: B. Fischer and R. Griess, g — 2*9 -37° - 
5?.76.11?- 13? 17-19-2329: 31-41: 47: 
59-71. 

(74),: J. G. Thompson, g=2!5-3!10. 53.72- 
13:19-31. 

(74): K. Harada, g=2!4-36-56.7-11-19. 

The existence of (73), was suggested by B. 
Fischer, and then that of (73), by B. Fischer 
and R. Griess. Shortly after this, the exis- 
tence of (74), and (74), was suggested by J. G. 
Thompson, and then Thompson and Harada 
proved the existence of these groups with the 
aid of P. Smith and S. Norton, using com- 
puters. The existence of (73), was established 
in 1976 by Leon and Sims, using a computer. 
Very recently (July 1980), R. Griess has an- 
nounced that the group (73), is realized as 
a group of automorphisms of a 196,883- 
dimensional commutative nonassociative 
algebra over the rational numbers, 

(75),: Z. Janko, g— 2?! 35-5. 7. 11? 23:29. 
31-37-43. This group (75), was discovered 
by Z. Janko. The existence was verified by 
using a computer. 

For a more detailed account of the twenty- 
six sporadic simple groups, see [21 ]. 

Simple groups of order «1000 are A, 

(g = 60), L(7) (168), A6(360), L.(8) (504), 
and L,(11) (660). All simple groups of order 
<20,000 are known. 

Among the known simple groups, the fol- 
lowing multiply transitive permutation repre- 
sentations are known: Alternating groups A, 
(degree n), Ag and A; (degree 15), A, (degree 
10), A; (degree 6), Mathieu groups M; (degree 
i), My; (degree 12), L,(q) (degree (q”—1)/ 
(q—1)), L(p) (degree p for p=5,7, 11), U3(q) 
(degree 1 4- q?), Sz(q) (degree 1 + q?), Re (3) 
(degree 1 + 35"), S, (2) (degrees 2" *(2" + 1)), 
and the Higman-Sims group (67), (degree 176) 
and, the Conway group (68), (degree 276). 
Among them, A, (degree n, n 2 5), M, (degree i), 
M,, (degree 12), L,(2") (degree 1 -- 2"), and 
L ,(5) (degree 5) are triply transitive. 
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There are several remarkable properties of 
known finite simple groups which have been 
conjectured to hold for arbitrary finite simple 
groups. One of the most famous is the Schreier 
conjecture, which asserts that the group of 
outer automorphisms of a simple group is 
solvable. This has been verified for all known 
cases. Another conjecture says that a finite 
simple group is generated by two elements. 
This has also been verified for almost all 
known groups. In many cases, there is a gen- 
erating set of two elements, one of which has 
order 2. There is no counterexample known to 
disprove the universal validity of this property. 
Except for Sz(q), the orders of known simple 
groups are divisible by 12. 


J. Classification of Finite Simple Groups 


The objective of classification theory is to find 
the complete list of finite simple groups; this 
was accomplished in February 1982, following 
the series of important works mentioned 
below. 

The order of a finite non-Abelian simple 
group is divisible by at least three distinct 
prime numbers (W. S. Burnside; — Section D). 
The order of a finite non-Abelian simple group 
is even (W. Feit and J. G. Thompson; — Sec- 
tion D). These theorems are special cases of 
the following theorem: If G is a finite non- 
Abelian simple group in which the normalizer 
of any solvable subgroup # {1} is solvable, 
then G=L,(q)(q > 3), Sz(2?"*)(nz 1), A;, 
L4(3), U;(3), M,,, or Tits's simple group. In 
particular, a minimal simple group is isomor- 
phic to L(p)(p=2 or 3 (mod 5), p» 3), L,(2?), 
L,(3”), Sz(2?), or L4(3), where p is a prime 
number (a finite non-Abelian simple group is 
called a minimal simple group if all proper 
subgroups are solvable). This theorem is 
proved in a series of papers by J. G. Thomp- 
son (Bull. Amer. Math. Soc., 74 (1968), Pacific 
J. Math., 33 (1970), 39 (1971), 48 (1973), 50 
(1974), and 51 (1974)). The method which 
Thompson used in these papers has since been 
generalized in various ways by many authors 
to establish a number of important theorems. 
There are also some interesting consequences 
of this theorem concerning solvable groups. 
For example, a finite group is solvable if and 
only if every pair of elements generates a sol- 
vable subgroup. 

Let G be a non-Abelian simple group of 
even order and S one of its 2-Sylow subgroups. 
Then S is neither cyclic nor a generalized 
quaternion group (W. S. Burnside, R. Brauer, 
and M. Suzuki). If S is a dihedral group, then 
G — A; or Lj(q) (q odd 25) (D. Gorenstein and 
J. H. Walter). These theorems deal with the 
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cases where S is "small." The study in this 
direction has culminated in the classification of 
finite simple groups all of whose 2-subgroups 
are generated by at most four elements (D. 
Gorenstein and K. Harada, Mem. Amer. Math. 
Soc., 147 (1974)). 

If a 2-Sylow subgroup of a finite non- 
Abelian simple group G is an Abelian group, 
then G=L,(r) (r=0, 3 or 5 (mod 8), r> 3), or 
else G possesses an element of order 2 whose 
centralizer is isomorphic to Z, x L;(q) (q 3 or 
5 (mod 8), 4> 3) (J. H. Walter). In the latter 
case, G is called a group of Janko-Ree type 
(J-R type for short), and if g 45, it is called a 
group of Ree type. If q—5, then G =(64),, the 
Janko's simple group of order 175,560. The 
Ree groups Re(q) are groups of Ree type. Since 
the discovery of Re(q), it has long been an 
open problem to show that there are no other 
groups of Ree type. Very recently, the com- 
bined work of E. Bombiere, J. G. Thompson 
and others settled the problem (Inventiones 
Math., 58 (1980)). 

A surprisingly short proof of the Walter's 
theorem above is given by H. Bender (Math. 
Z., 117 (1970)). Bender's method applies to a 
much larger class of groups. A subgroup A of 
a finite group G is said to be strongly closed if 
A?) NG(A) c A for each geG. D. M. Gold- 
schmidt proved (Ann. Math., 99 (1974)) that if 
A is a strongly closed Abelian 2-subgroup, 
then the subgroup Go generated by the con- 
jugates of A possesses a normal series Gy > 
G, >G, with the properties: G, is of odd order, 
G,/G, is a 2-group and is contained in the 
center of G5/G,, and either Go =G, or Gp/G, 
is the direct product of simple groups on the 
following list: L5(q) (q=0, 3 or 5 (mod 8), 
q7» 3), Sz(2?" 1), U,(2") (n> 1), and the groups 
of J-R type. Furthermore, AG,/G,>G,/G, 
and AG,/G, is the center of a 2-Sylow sub- 
group of G5/G,. This theorem generalizes 
an earlier result of G. Glauberman (the so- 
called Z*-theorem) which states that if A is a 
strongly closed subgroup of order 2, then the 
image of A in the quotient group G/K by the 
maximal normal subgroup of odd order is a 
normal subgroup and hence is contained in 
the center of G/K. These theorems of Glauber- 
man and Goldschmidt are of fundamental 
importance in the study of finite simple groups 
since they provide an effective tool for showing 
that a given group has a normal Abelian 2- 
subgroup. 

Glauberman obtained a criterion for the 
existence of a strongly closed Abelian p- 
subgroup # {1} for some prime p [15,22]. For 
any prime p, the quadratic group Qd( p) is 
defined to be the semidirect product of the 2- 
dimensional vector space V(2, p) over the field 
of p elements by the special linear group 
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SL(Q, p) where the action of SL(2, p) on V(2, p) 
is taken to be the natural one. A finite group G 
contains a strongly closed Abelian p-subgroup 
As (1j if no section of G is isomorphic to 

Qd( p) (a section is a quotient group of a sub- 
group). Furthermore, if p is odd, then we can 
choose as A a characteristic subgroup of a p- 
Sylow subgroup S of G which is determined 
only by the structure of S. Therefore a finite 
non-Abelian simple group has a section iso- 
morphic to Qd(2) — S, except when it is one of 
the simple groups mentioned in Goldschmidt's 
theorem. This theorem generalizes an un- 
published result of J. G. Thompson to the 
effect that 3 divides the order of finite non- 
Abelian simple groups except Sz(2?"*!), 

Let G be a 2-transitive permutation group 
on n+ 1 letters, and assume that the stabilizer 
H of a letter contains a normal subgroup K 
which is regular on the remaining n letters. 
Then G contains a normal subgroup N such 
that G is isomorphic to a subgroup of the 
automorphism group of N and either N = 
L»5(q), U3(q), Sz(q) or a group of Ree type, 
or else N is 2-transitive on the n+ 1 letters and 
no nonidentity element of N leaves two dis- 
tinct letters invariant (the structure of N in the 
latter case is also known (H. Zassenhaus; — 
Section H). This theorem is proved by E. 
Shult for n even (Illinois J. Math., 16 (1972)) 
and by C. Hering, W. M. Kantor, and G. M. 
Seitz for n odd (J. Algebra, 20 (1972)). Its proof 
depends on the work of many authors who 
considered various special cases, especially the 
work of H. Zassenhaus, W. Feit, N. Ito, and 
M. Suzuki on the classification of Zassenhaus 
groups, and the work of M. Suzuki on the case 
where n is even and H/K is of odd order (Ann. 
Math., 79 (1964)). In this special case which 
Suzuki handles, the stabilizer H is of even 
order and H N H” is of odd order for any ge 
G — H. If a proper subgroup H of an arbitrary 
finite group G has this property, then H is 
called a strongly embedded subgroup. Extend- 
ing the work of Suzuki, H. Bender proved the 
following theorem (J. Algebra, 17 (1971)): If a 
finite group G possesses a strongly embedded 
subgroup, then either (1) a 2-Sylow subgroup 
of G is cyclic or a generalized quaternion 
group, or (ii) G possesses a normal series G — 
Go > G, >G, such that G,/G, and G, are of 
odd order and G,/G, = L,(2”), U,(2"), or 
Sz(2?"^!) (n> 1). This theorem generalizes 
another theorem of Suzuki who reached the 
same conclusion under the assumption that 
two distinct 2-Sylow subgroups have only the 
identity element in common. Bender's theorem 
is of fundamental importance in the classifica- 
tion theory of finite simple groups, since a 
strongly embedded subgroup often appears as 
an obstacle to the proofs of classification 
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theorems. For a generalization of Bender's 
theorem, see a paper by M. Aschbacher (Proc. 
Amer. Math. Soc., 38 (1973)), which also con- 
tains an alternative proof of Shult's theorem. 

The theorem of Shult, Hering, Kantor, and 
Seitz may be interpreted as a classification of 
finite groups having a split (B, N )-pair of rank 
l. Let G be a finite group and let B and N be 
subgroups of G such that (1) B and N generate 
G, (ii) T= BNN is a normal subgroup of N, 
and (iii) W= N/Tis generated by a set S of ele- 
ments of order 2 such that sBs # B and sBwc 
BwBU BswB for each seS and each we W. 
The subgroups B and N are called a (B, N )- 
pair of G (the quadruplet (G, B, N, S) is called 
a Tits system), and the cardinality of the set 
S is called the rank of the (B, N )-pair. The 
(B, N )-pair is said to be split if B has a normal 
subgroup U such that B= TU and TAU ={1}, 
and is said to be saturated if T= ( ,., B". If a 
finite group G has a split saturated (B, N )-pair 
of rank 1, and if Z = (\,.¢ B*, then G/Z is a 2- 
transitive permutation group satisfying the 
assumption of the theorem of Shult, Hering, 
Kantor, and Seitz, and information is obtained 
on the structure of G. In general, the simple 
groups of Lie type are characterized as simple 
groups with certain (B, N )-pairs. For (B, N )- 
pairs of rank 2, see papers by P. Fong and G. 
M. Seitz (Inventiones Math., 21 (1973), 24 
(1974)). J. Tits has developed a satisfactory 
theory on finite groups having a (B, N )-pair of 
rank at least 3 (Lecture notes in math. 386, 
Springer). 

Let G be a finite group generated by a 
conjugate class D of elements of order 2, and 
let z be the set of positive integers consisting of 
the orders of the products of two distinct 
elements of D. Furthermore, assume that G 
has no nontrivial solvable normal subgroup. 
B. Fischer proved (Inventiones Math., 13 
(1971)) that if z= (2, 3], then G contains a 
normal subgroup isomorphic to one of the 
following groups: A,, S>,(2), O2,(+1, 2), 
O,(+1, 3), U,(2), and the three Fischer's simple 
groups (68);, (69),, (69),. For a generalization 
of this theorem, see papers by M. Aschbacher 
(Math. Z., 127 (1972), J. Algebra, 26 (1973)). 
The most powerful result in this direction is 
given by F. Timmesfeld (J. Algebra, 33 (1975), 
35 (1975)): Suppose z consists of 2, 4, and odd 
positive integers. Furthermore, assume that if 
d and e are in D and de is of order 4, then 
(de e D. Then G = Ag, U;(3), the Hall-Janko 
group (67),, L,(q)(n > 3), O,,.,(q)(nz 3). 
O2,(+1,q)(n>4), G5(q), *Dalq), Fala), ^Es(q). 
E6(q), E;(q), or Es(q), where q—2". 

Let G be a simple group of even order and 
let H be the centralizer of an element of order 
2. Then the order of G is bounded by a func- 
tion f of the order h of H (for example, we can 
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choose f(h) 5 {h(h+ 1)\!; R. Brauer and K. A. 
Fowler, Ann. Math., 62 (1955)). In particular, 
there exist only finitely many isomorphism 
classes of finite simple groups which contain 
an element of order 2 with a given centralizer 
H. This fact is a ground for Brauer's program 
of studying simple groups of even order in 
terms of the structure of the centralizers of 
elements of order 2. There are a number of 
important results concerning Brauer's pro- 
gram [20]. For example, nine sporadic simple 
groups were discovered in related works. Since 
1973, Brauer's program has been improved 
greatly by M. Aschbacher, D. Gorenstein, and 
others, and the classification of finite simple 
groups was finally completed in February 1982 
[21,23]. 
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152 (VII.18) 
Finsler Spaces 


A. Definitions 


Let T(M) be the ‘tangent vector bundle of an 
n-dimensional tdifferentiable manifold M. An 
element of T(M) is denoted by (x, y), where x is 
a point of M and y is a tangent vector of M at 
x. Given a tlocal coordinate system (x!, ..., x") 
of M, we can obtain a local coordinate system 
of T(M) by regarding (x!, ..., x", yl, ..., y") — 
(xi, y?) as coordinates of the pair (x, vie T(M), 
where (x!, ..., x") are coordinates of a point x 
of M and y « y/0/éx/. A continuous real- 
valued function L(x, y) defined on T(M) is 
called a Finsler metric if the following condi- 
tions are satisfied: (1) L(x, y) is differentiable at 
y £0; (ii) L(x, Ay) - |A| L(x, y) for any element 
(x, y) of T(M) and any real number 4; and 

(iii) if we put dal, y) - (1/2)0? L(x, y?/0y'6y, 
the symmetric matrix (g,;(x, y)) is positive de- 
finite. A differentiable manifold with a Finsler 
metric is called a Finsler space. There exists 

a Finsler metric on a manifold M if and only 
if M is tparacompact. We call F(x, y)= L(x, y)? 
the fundamental form of the Finsler space. 
When F(x, y) is a quadratic form of (y!, ... , y"), 
L(x, y) is a *Riemannian metric, and F(x, y) — 
Y, gu y' y). Therefore a Finsler space is a 
Riemannian space if and only if g; does not 
depend on y. The matrix g;; is also called the 
fundamental tensor of the Finsler space (i, j = 
1,... 7). 

Thus the notion of a Finsler metric is an 
extension of that of Riemannian metric. The 
study of differentiable manifolds utilizing such 
generalized metrics was considered by B. 
Riemann, but he stated that a Riemannian 
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metric is more convenient for the purpose 
since “only nongeometrical results can be 
obtained” by using Finsler metrics [7]. P. 
Finsler initiated the systematic study of Finsler 
metrics and extended to a Finsler space many 
concepts and theorems valid in the classical 
theory of curves and surfaces [5]. 


B. The Finsler Metric 


In a Finsler space, the arc length of a curve 
x=x(t) (a<t<b) is given by f? L(x, dx/dt) dt. 
Therefore a *geodesic in a Finsler space is 
defined as a stationary curve for the problem 
of tvariation ô f? L(x, dx/dt)dt =0, and the 
differential equation of the geodesic is given by 


ax dx! dx* 
es Am uc 


where y;,(x, y) is the "Christoffel symbol of g;;, 
1.€., 


i 1 i Og, Og REI 
AIX, y)29- ja + - " 
Yao y) 52.8 (2: Ox! ax? 





where (g"(x, y)) is the inverse matrix of 
(g(x, y)). 

The distance between two points in a Fin- 
sler space is defined, as in a Riemannian space, 
as the infimum of the lengths of curves Joining 
the two points. Many properties of Riemann- 
ian spaces as metric spaces can be extended 
to Finsler spaces. The topology defined by the 
Finsler metric coincides with the original 
topology of the manifold. A Finsler space M is 
said to be tcomplete if every Cauchy sequence 
of M as a metric space is convergent. The 
following three conditions are equivalent: (i) M 
is complete; (11) each bounded closed subset 
of M is compact; (iii) each geodesic in M is 
infinitely extendable. In a complete Finsler 
space, any two points can be joined by the 
shortest geodesic. 

A diffeomorphism q of a Finsler space M 
preserves the distance between an arbitrary 
pair of points if and only if the transforma- 
tion on T(M) induced by o preserves the 
Finsler metric L(x, y). Such a transformation is 
called an tisometry of the Finsler space. In the 
tcompact-open topology the set of all isome- 
tries of a Finsler space is a tLie transformation 
group of dimension at most n(n + 1)/2. If a 
Finsler space admits the isometry group of 
dimension greater than (n(n — 1) -- 2)2, it isa 
Riemannian space of constant curvature [8]. 


C. The Theory of Connections 


An important difference between a Finsler 
space and a Riemannian space relates to their 
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properties with respect to the theory of fcon- 
nections. In the case of a Riemannian space, 
the Christoffel symbols constructed from the 
fundamental tensor are exactly the coefficients 
of a connection, whereas in the case of a Fin- 
sler space, the Christoffel symbols yi (x, y) do 
not define a connection, for the fundamental 
tensor g;; depends not only on the points of the 
space but also on the directions of tangent 
vectors at these points. 

When we consider notions such as tensors, 
etc., in a Finsler space M, it is generally more 
convenient to take the whole tangent vector 
bundle T(M) into consideration rather than 
restricting ourselves to the space M. For ex- 
ample, let P be the ttangent n-frame bundle 
over a Finsler space M and Q— p" (P) be the 
tprincipal fiber bundle over T(M) induced 
from P by the projection p of T(M) onto M. 
We call the elements of fiber bundles asso- 
ciated with Q tensors. In this sense, the fun- 
damental tensor g; in a Finsler space is the 
covariant tensor field of order 2. Therefore it is 
natural to consider a connection in a Finsler 
space as a connection in the principal fiber 
bundle Q. The connection in a Finsler space 
defined by E. Cartan is exactly of this type [3]. 
Namely, he showed that by assigning to a 
connection in Q certain conditions related to 
the Finsler metric, we can determine uniquely 
a connection from the fundamental tensor so 
that the covariant differential of the funda- 
mental tensor vanishes. 

Cartan's introduction of the notion of con- 
nection produced a development in the theory 
of Finsler spaces that parallels the develop- 
ment in the theory of Riemannian spaces, and 
many important results have been obtained. 
O. Varga (1941) succeeded in obtaining a 
Cartan connection in a simpler way by using 
the notion of osculating Riemannian space. 

S. S. Chern (1943) studied general Euclidean 
connections that contain Cartan connections 
as a special case. Noticing that the tangent 
space of a Finsler space is a *normed linear 
space, H. Rund (1950) obtained many notions 
different from those of Cartan. However, as far 
as the theory of connections is concerned, the 
two theories do not seem to be essentially 
different. The theory of curvature in a Finsler 
space is more complicated than that in a 
Riemannian space because we have three 
curvature tensors in the Cartan connection. 
Using the fact that, in a local cross section of 
the tangent vector bundle of a Finsler space, a 
Riemannian metric can be introduced by the 
Finsler metric, L. Auslander (1955) [2] ex- 
tended to Finsler spaces the results of J. L. 
Synge and S. B. Myers on the curvature and 
topology of Riemannian spaces. A. Lichnéro- 
wicz extended the tGauss-Bonnet formula to 
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Finsler spaces by considering an integral on 
the subbundle of the tangent vector bundle 
satisfying L(x, y) 2 1 [6]. M. H. Akbar-Zadeh 
studied fholonomy groups and transformation 
groups of Finsler spaces by using the theory of 
fiber bundles. 

Connections of Finsler spaces have been 
investigated by many geometers, but most of 
them used methods considerably different from 
those of the modern theory of connections in 
principal fiber bundles. J. H. Taylor and Synge 
(1925) defined the covariant differential of a 
vector field along a curve. L. Berwald (1926) 
defined a connection from the point of view of 
the general geometry of paths. A curve on a 
manifold satisfying the differential equation 


d? x! Ae dx 0 

Ai ` (s dt ) F 

is called a path. The theory was originated by 
O. Veblen and T. Y. Thomas and generalized 
as above by J. Douglas. Characteristically, 
with respect to a Berwald connection, the 
covariant differential of the fundamental ten- 
sor does not vanish. 

A Finsler space is a space endowed with a 
metric for line elements. As a dual concept, we 
have a Cartan space, which is endowed with a 
metric for areal elements [4]. A. Kawaguchi 
(1937) extended these notions further and 
studied a space of line elements of higher order 
(or Kawaguchi space). 
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A. General Remarks 


Given a mapping f of a space X into itself, a 
point x of X is called a fixed point of f if f(x) 
— x. When X is a topological space and f 
is a continuous mapping, we have various 
theorems concerning the fixed points of f. 


B. Fixed-Point Theorems for Polyhedra 


(1) Brouwer Fixed-Point Theorem. Let X bea 
*simplex and f: X X a continuous mapping. 
Then f has a fixed point in X (Math. Ann., 69 

(1910), 71 (1912)). 


(2) Lefschetz Fixed-Point Theorem. Let H,(X) 
be the p-dimensional thomology group of a 
‘finite polyhedron X (with integral coeffi- 
cients), T,(X) the ttorsion subgroup of H,(X), 
and B,(X)— H,(X)/T,(X). The continuous 
mapping f: X X naturally induces a homo- 
morphism f, of the free Z-module B,(X) into 
itself. Let a, be the *trace of f, and A, = 

p-o(— 1*8, (n — dim X). We call this integer 
A, the Lefschetz number f. 

We have the Lefschetz fixed-point theorem: 
(1) Let f, g be continuous mapping sending X 
into itself. If f, g are thomotopic (fœ g), then 
A,=A,. (ii) If A, #0, then f has at least one 
fixed point in X (Trans. Amer. Math. Soc., 28 
(1926)). 

The condition A, #0 ts, however, not neces- 
sary for the existence of a fixed point of f. The 
Brouwer fixed-point theorem is obtained im- 
mediately from (1) and (ii). in particular, if the 
mapping f is homotopic to the identity map- 
ping 1x, then a, is the pth tBetti number of X, 
and A, is equal to the *Euler characteristic 
y(X) of X. Hence, in this case, if y(X) 40, then 
f has a fixed point. 

When X is a compact oriented manifold 
without boundary the Lefschetz number A, of 
f can be interpreted as the fintersection num- 
ber of the graph of f and the diagonal of X. 

More generally, let X and Y be compact 
oriented n-dimensional manifolds without 
boundary. If f and g: X ^Y are continuous 
mappings, a point x of X such that f(x)— 
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g(x) is called a coincidence point of f and g. 
The intersection number A, , of the graph 

of f and that of g is called the coincidence 
number of f and g. If A; 40, then f and g 
have at least one coincidence point. The coin- 
cidence number A , , is also expressed as 
Zp=0(—1)"tr(f* o g| H'(X)), where g: 
H?(X)—> H*(Y) is the tGysin homomorphism 
of g. 

Suppose that a finite group G acts on the 
manifolds X and Y. If f: X ^ Y is a mapping, a 
point x of X such that tf(x)= ft(x) for all ve G 
is called an equivariant point of f. When G is a 
group of order 2 and acts on X nontrivially, 
the equivariant point index A y is employed. 
This index was introduced by Nakaoka 
(Japan. J. Math. 4 (1978)), using the 'equivar- 
iant cohomology. It has the property that 
A 7 #0, implying that f admits an equivariant 
point. The prototype of this theorem is the 
Borsuk-Ulam theorem (Fund. Math. 20 (1933)), 
which states that a continuous mapping f: 
S" 5 R" always admits a point xe S” such that 


Sx) = ft- x). 


(3) Lefschetz Number and Fixed-Point Indices. 
Suppose that |K| is an n-dimensional homo- 
geneous polyhedron (i.e., any simplex of K 
that is not a face of another simplex of K is of 
dimension n), and f:|K|-—|K| is a continuous 
mapping. Then there exists a continuous 
mapping g:|K|—|K| homotopic to f and 
admitting only isolated fixed points (4,, ..., 
q,}, each of which is an inner point of an n- 
dimensional simplex of K. The tlocal degree 
À; of a mapping g at q; is called the fixed- 
point index of g at q;. Then J; = X. 4; does 
not depend on the choice of g and is equal to 
(—1)"A,. 


(4) Singularities of a Continuous Vector Field. 
Let X be an n-dimensional tdifferentiable 
manifold and F a fcontinuous vector field on 
X that assigns a tangent vector x, to each 
point p of X. A point p is called a singular 
point of F if x, is the zero vector. The vector 
field F induces in a natural manner a continu- 
ous mapping f: X 5 X that is homotopic to 
the identity mapping 1,. Then a fixed point of 
f is a singular point of F, and vice versa. When 
such a singular point p is isolated, there exists 
a tcoordinate neighborhood N of p that is 
homeomorphic to an n-dimensional open ball 
such that x, is nonzero for every point q in N 
except for g=p. Let N' be the boundary of N. 
Then we may consider N'z S"! and a map- 
ping F|N' from S"! to R"«(0] S"! The 
tdegree of a mapping 5"! L5 R"~ {0} a 
is called the index of the singular point p. This 
index is equal to the fixed-point index 4, of f 
at p. Hence, when X is compact and has no 
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boundary, the sum of indices of (isolated) 
singular points of F is equal to (—1)" Y(X). In 
particular, a compact manifold X without 
boundary admits a continuous vector field 
with no singular point if and only if 7(X)=0 
(Hopf's theorem, Math. Ann., 96 (1927)). 


(5) Poincaré-Birkhoff Fixed-Point Theorem. In 
certain cases, a continuous mapping f: X >X 
of a finite polyhedron X into itself has fixed 
points even if A, —0. For example, let X be the 
annular space ((r,0)| x &r « fi) ((r,0) are the 
polar coordinates of points in a Euclidean 
plane) and let f: X 2 X be a homeomorphism 
satisfying the following conditions: (i) there 
exist continuous functions g(0), h(0) such that 
gll « 0, h(8) 0, f(a, 0) — (a. 9(0)), f(B. 0) = 

(B, h(0)); (ii) there exists a continuous positive 
function p(r, 0) defined for «<r « f such that 


[f vrai et Tir. 0)) dr d0 
D D 


for all measurable sets D. 

Then f has at least two fixed points. This 
theorem was conjectured in 1912 by Poincaré, 
who hoped to apply it to solve the frestricted 
three-body problem. The theorem was later 
proved by G. D. Birkhoff (Trans. Amer. Math. 
Soc., 14 (1913); see also M. Brown and W. D. 
Neumann, Michigan Math. J. 24 (1977)) and 
is called the Poincaré-Birkhoff fixed-point 
theorem or the last theorem of Poincaré. 


C. Atiyah-Bott and Atiyah-Singer Fixed-Point 
Theorems 


There is a far-reaching generalization of the 
Lefschetz formula given by Atiyah and Bott 
(Ann. Math., (2) 86 (1967), 88 (1968)). Let M be 
a compact differentiable manifold without 
boundary and f: M M a differentiable map- 
ping with only simple fixed points; that is, it is 
assumed that dert! — df) #0 for each fixed 
point pe M of f, where df, is the differential of 
f at the point p. The fixed points of f are finite 
in number. Suppose that an *elliptic complex 
over M 


£:02T(E)ST(E)3... S T(E)50 


and a sequence of smooth vector bundle 
homomorphisms 9;: f * E, E; (i— 0, ...,1) 

are given such that d; T; — T; ,, d; for each i, 
where T;: l'(E)-I (Ej) is defined by T;s(x) — 
9;s( f(x)) for sel (Ej. The sequence T «(T)) 
induces endomorphisms H'T of the homology 
groups H'(&) of the elliptic complex d. We 
define the Lefschetz number L(T) by L(T)= 
Di-o(—1)'tr H'T. On the other hand, for a 
fixed point p of f, let o; p: E; p> E; p denote 
the restriction of o, on the fiber E, , of E; over 
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p. Under the circumstances mentioned above 
the Lefschetz number L(T) is given by the 
formula 


M- 


(—1)'tro,; , 


I 


co 


L(T)=y £9 — — —— 
v z |det(1 —df,)| ' 


where the summation is over the fixed points 
of f. 

Here are some examples of the above for- 
mula. First, take as & the tde Rham complex 
and as o, the obvious one, i.e., the ith exterior 
power of the transpose of df. In this case, the 
formula reduces to the classical Lefschetz 
formula. As a second example consider the 
*Dolbeault complex 


02 A99 (M) S A! (M) ... S A9 (M)-0 


of a compact complex manifold M and a 
holomorphic mapping f: M M with only 
simple fixed points. In this case the formula 
above reduces to one giving the Lefschetz 
number of the induced endomorphisms 
f*: H9 *(M)5 H?-*(M) of the tDolbeault 
cohomology: 


1 
*) — 
LU" 2 ded —df,)’ 
where df, is regarded as a holomorphic 
differential. 

If the assumption that the mapping f has 
only simple fixed points is replaced by the one 
that f is a diffeomorphism of M contained in 
a compact *transformation group G, then 
there is also a generalized Lefschetz formula, 
given by Atiyah and Singer (Ann. Math., (2) 87 
(1968)). The fixed-point set of such a diffeo- 
morphism is a closed submanifold of M (con- 
sisting of several components). Suppose that 
we are given an elliptic complex 4 over M and 
a lift of the G-action on M to &. The latter 
implies that, if we define TL l'(Ej)) 5T (E;) by 
Tis(x) - f ! s(f(x)) for ser (E, then d; T,= 
T;,,4; holds for each i. Under these circum- 
stances, the Lefschetz number L(T) can be 
expressed in the form 
L(T)= 2 Wiel 
where the summation is over the components 
{ F;\ of the fixed-point set M^ of f and where 
the number v(F;) is written explicitly in terms 
of the tsymbol of the elliptic complex & with 
G-action, the characteristic classes of the mani- 
fold F,, the characteristic classes of the normal 
bundle of F, in M, and the action of g=f~' on 
the normal vectors. The formula is essentially 
a reformulation of the tAtiyah-Singer index 
theorem. In fact, L(T) is the tanalytic index of 
& evaluated on g and the number v(Fj) is de- 
duced from the *topological index of & using 
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the localization theorem. The most useful 
elliptic complexes are de Rham complexes, 
Dolbeault complexes, signature operators and 
Dirac operators. In the case of Dolbeault 
complexes, f is assumed to be an analytic 
automorphism, and the number v(F;) takes the 
form 


_ [LZE 


TU" ded ap 


[F;]. 

Here, the normal bundle N of F, has a decom- 
position N =, N(0) into the sum of complex 
vector bundles such that g acts on N (0) as e”, 
and 4/? is the characteristic class defined by 


1 —e i? 


pd e 
where the *Chern class of N(0) is written as 
c(N(0)) - IT(1 + xj; moreover 7 (F;) denotes 
the tTodd class of the complex manifold F,, 
and N* denotes the dual bundle of N (— 237 
K-Theory H). 


D. Fixed-Point Theorems for Infinite- 
Dimensional Spaces 


Birkhoff and O. D. Kellogg generalized 
Brouwer's fixed-point theorem to the case of 
function spaces (Trans. Amer. Math. Soc., 23 
(1922)). Their result was utilized to show the 
existence of solutions of certain differential 
equations, and has led to a new method in the 
theory of functional equations. 

J. P. Schauder obtained the following theo- 
rem: Let A be a closed convex subset of a 
Banach space, and assume that there exists a 
continuous mapping T sending A to a tcount- 
ably compact subset T(A) of A. Then T has 
fixed points (Studia Math., 2 (1930)). This 
theorem is called the Schauder fixed-point 
theorem. 

A. Tikhonov generalized Brouwer's result 
and obtained the following Tikhonov fixed- 
point theorem (Math. Ann., 111 (1935)): Let R 
be a locally convex ttopological linear space, 
A a compact convex subset of R, and T a con- 
tinuous mapping sending A into itself. Then 
T has fixed points. 

This theorem may be applied to the case 
where R is the space of continuous mappings 
sending an m-dimensional Euclidean space E" 
into a k-dimensional Euclidean space EF to 
show the existence of solutions of certain 
differential equations. For example, when m — 
k — 1, consider the differential equation 


dy/dx = f(x, y), 


We set T(y) — yo + fs, f(t, y(t))dt to determine a 
continuous mapping T: Ro R. Then the fixed 
points of T are the solutions of the differential 


y(xo) = yo. 
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equation. Now we can apply the theorem of 
Tikhonov to show the existence of solutions. 

On the other hand, when we are given prob- 
Jems of functional analysis, Schauder’s fixed- 
point theorem is usually more convenient to 
apply than Tikhonov’s theorem. 

The following theorem, written in terms of 
functional analysis, is useful for applications: 
Let D be a subset of an n-dimensional Eucli- 
dean space, F the family of continuous func- 
tions defined on D, and T: FF a mapping. 
Suppose that the following three conditions 
are satisfied: (i) For fj, f,eF, 0<A<1 implies 
Af, - (1 — 2)f5 € F. (ii) If a series { fp} of func- 
tions in F converges uniformly in the wider 
sense to a function f, then fe F; and further- 
more, the series ( Tf,) converges uniformly in 
the wider sense to Tf. (iii) The family T(F) is a 
tnormal family of functions on D. Then there 
exists a function fe F such that Tf— f. 

Let R be a topological linear space and T a 
mapping assigning a closed convex subset T(x) 
of R to each point x of R. A point x of R is 
called a fixed point of T if x € T(x). The map- 
ping T is called semicontinuous if the condition 
X >a, y,—b (y E T(x,)) implies that be T(a). In 
particular, if K is a bounded closed convex 
subset of a finite-dimensional Euclidean space 
R and T a semicontinuous mapping sending 
points of K into convex subsets of K, then 
T admits fixed points (Kakutani fixed-point 
theorem, Duke Math. J., 8 (1941)). This result 
was further generalized to the case of locally 
convex topological linear spaces by Ky Fan 
(Proc. Nat. Acad. Sci. US, 38 (1952)). 
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A. Introduction 


A foliation is a kind of geometric structure on 
manifolds, such as a differentiable or complex 
structure. The study of foliations evolved from 
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investigation of the behavior of torbits of a 
vector field and also of the solutions of *total 
differential equations. Through the early 
works of C. Ehresmann, G. Reeb, and A. Hae- 
fliger, together with the development of mani- 
fold theory in 1960s, it became an established 
field of mathematics. Since then, great progress 
has been made in this field, especially in its 
topological aspects. At the same time it turns 
out that many problems in foliation theory are 
deeply related not only to the geometry of 
manifolds but also to various other branches 
of mathematics, such as the theory of differen- 
tial equations, functional analysis, and group 
theory. 


B. Definitions and General Remarks 


A foliation on a manifold can be defined 
within various categories: topological, C’- 
differentiable (1 <r < oo), real analytic, and 
holomorphic. For definiteness, however, we 
restrict ourselves to the C’-differentiable cate- 
gory in what follows. Furthermore, all mani- 
folds are assumed to be paracompact. 

Let M be an n-dimensional C”-manifold, 
possibly with boundary. A codimension q, C’- 
foliation of M (0€qxn, 0x r oo) is a family 
F —ÍL,|xe A] of arcwise connected subsets, 
called leaves, of M with the following prop- 
erties: (i) LL, = Ø if x # x'; (ii) (Jaca La= 
M; (iii) Every point in M has a local coordi- 
nate system (U, y) of class C” such that for 
each leaf L, the arcwise connected components 
of UN L, are described by x" **! = constant, 
...,X" «constant, where x!, x?, ..., x" denote 
the local coordinates in the system (U, y). 

In particular, every leaf of F is an (n— q)- 
dimensional fsubmanifold of M. The totality 
of integral submanifolds of a tcompletely 
integrable nonsingular system of * Pfaffian 
equations on R”, œ; = gr (x)dx, t aj; (x) dx; + 
c 4;,(x)dx, =0 (i= 1,2,...,q) forms a co- 
dimension q foliation, and the totality of inte- 
gral curves of a nonsingular vector field of 
class C” on M (r2 1) constitutes a codimension 
n — | C’-foliation. 

Let F be a codimension q, C’-foliation of M 
(r z 1). Then M admits a C" ! tp-plane field 
consisting of all vectors tangent to the leaves 
of F, and, dually, a C"! q-plane field (p +q = 
n). Denote the former by t(¥) and the latter 
by v( F), and call them the tangent bundle and 
the normal bundle of F, respectively. v(F) is 
isomorphic to the quotient bundle T(M)/t(.7 ). 
F is called transversely orientable if v(.7) is an 
orientable vector bundle. A C’-mapping f: 
NM is said to be transverse to the foliation 
F if the composite mapping T(N)S T(M)5 
T(M)/t(F) is epimorphic at each point of N. 
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In this case, f induces a codimension q, C’- 
foliation f ^! (.7) of N whose leaves are the 
arcwise connected components of f^! (L) 
(LEF). 

In particular, if Q is a q-dimensional C’- 
manifold and f: N 5 Q is a C'-submersion, f 
induces a codimension q, C’-foliation of N 
whose leaves are the arcwise connected com- 
ponents of f~!(x) (xeQ). 

A C p-plane field X on M is called involu- 
tive if, for any C" vector fields X, Y on M such 
that X,, Kei, (xe M), the tLie bracket LX, Y] 
satisfies [ X, Y ],e Z,. This condition is known 
as the Frobenius integrability condition for 
A. If X is defined locally by q Pfaffian equa- 
tions œ; =... — 0, —0, the above condition is 
equivalent to the condition that there are 
CT 1-forms 0, (i,j —1,..., q) such that dru — 
235.1 0j^c;. A C p-plane field % is said to be 
completely integrable if it is a tangent bundle 
of some foliation. When r 2 2, Z is completely 
integrable if and only if it is involutive (Frobe- 
nius theorem) (— 428 Total Differential Equa- 
tions). There is a topological obstruction to 
the complete integrability of Z (— Section F). 

A closed C*-manifold M admits a codi- 
mension 1, C’—plane field if and only if the 
Euler number of M vanishes. In 1944, Reeb 
constructed a codimension 1, C®-foliation of 
the 3-sphere S? as follows [1]. Let f(x) bea 
C-even function defined on the open interval 
(—1, 1), such that 


| d® | 
E dx9 fo) o (k=0,1,2,...). 
The graphs of the equations y= f(x)-c (—1 < 
x< 1,ceR) together with the lines x= +1 
constitute a codimension 1, C”-foliation of 
[—1, 1] xR. Then by rotating it around the y- 
axis in R?, we obtain a codimension 1, C®- 
foliation of D? x R, where D? denotes the 
closed 2-disk. The foliation is invariant under 
vertical translations and therefore defines a 
codimension 1, C®-foliation of D? x S'. This 
foliation is called the Reeb component of D? x 
S'. Since S? is a union of two solid tori inter- 
secting in the common toroidal surface, the 
Reeb components of each solid torus con- 
structed above define the so-called Reeb foli- 
ation of S? (Fig. 1). 





Fig. 1 
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Generalizing the above construction into a 
differential topological method, one obtains 
the following results: Every closed 3-dimen- 
sional manifold admits a codimension 1, C^- 
foliation (S. P. Novikov [3], W. Lickorish, J. 
Wood); every odd-dimensional sphere admits 
a codimension 1, C®-foliation (I. Tamura, A. 
Durfee, B. Lawson). On the other hand, every 
open manifold has a codimension 1, C*- 
foliation induced by a submersion over R (— 
Section F). 

Let M be a total space of a C’-bundle over 
a C*-manifold B with fiber F. If F is a C*- 
manifold and the *structure group reduces to a 
totally disconnected subgroup of Diff" F, the 
group of all C’-diffeomorphisms of F, then the 
local sections, which are defined in an obvious 
manner using the local triviality of this bundle, 
fit together to give leaves of a codimension q, 
C’-foliation (q = dim F). In this case, M is 
called a foliated bundle or a flat F-bundle over 
B. Each leaf of this foliation is diffeomorphic 
to a covering manifold of B and transverse to 
the fibers of the bundle M >B. Foliated bun- 
dies exhibit a class of foliations; this is espe- 
cially important in connection with the char- 
acteristic classes of foliations (— Section G). 


C. Holonomy 


The notion of the holonomy of a leaf, given by 
Ehresmann, is a generalization of the *Poin- 
caré mapping in *dynamical systems. Let ¥ be 
a codimension q, C’-foliation of M and L be a 
leaf of F. Let N(L) denote the total space of 
the normal disk bundle of L in M. Choose a 
C’-immersion i: N(L) M such that i restricted 
to the zero section of N(L) is the natural inclu- 
sion and i maps the fibers of N(L) transversely 
to the foliation F. Then the induced foliation 

i !(7) of N(L) has the properties that the 
leaves are transverse to the fibers of N(L) and 
the zero section of N(L) is a leaf. If y is an 
oriented loop in L based at a point x, € L, then 
there is a neighborhood U of 0 in the fiber 
over x, satisfying the following: for each point 
x€U there is a curve y,.:[0, 1] —^ N(L) having 
the properties: (i) »,(0) 2 x, (11) Im(y,) lies on a 
leaf of i! (.Z), and (iii) xo y,(t) - (t) for any 
rel. 1], where z: N(L).L is the bundle pro- 
jection. The family of curves (y,|xeUj gives a 
C"-diffeomorphism H, from U to another open 
set of 1! (xo), which assigns y,(1) to x. Let G; 
denote the group of tgerms at 0 of all local 
C’-diffeomorphisms of R? which fix 0. The 
germ at 0 of the mapping H, depends only on 
the thomotopy class of y, and, by identifying 

x (xg) with R*, we obtain a homomorphism 
h, : n, (L, xo) 2 G;. h, is determined by L up to 
conjugacy and is called the holonomy homo- 











154 D 
Foliations 


morphism, or simply the holonomy, of the leaf 
L. The image of h, is called the holonomy 
group of L. For rz 1, by differentiating each 
element of G;, one has a homomorphism 

dh, : n,(L, x9) 2 GL(q; R), called the linear 
holonomy of L. The holonomy of a proper leaf 
(— Section D) completely characterizes the 
foliation of a neighborhood of it (Haefliger). 


D. Topology of Leaves 


Let be a codimension q foliation of M. The 
leaf topology is a topology of M defined by 
requiring each connected component of the set 
of the form U N L to be open, where U is an 
open set in M and Le F. Leaves are nothing 
but the connected components of M with 
respect to this topology. A leaf Le F is called 
a compact leaf if L is compact in the leaf top- 
ology. In general, L is called proper if two 
topologies on L induced from the original and 
the leaf topologies on M coincide. Any com- 
pact leaf is proper. A leaf L 1s called locally 
dense if Int Lz Ø. If a leaf is neither proper 
nor locaily dense, it is called exceptional. Since 
we are assuming that M is paracompact, a 
leaf that is a closed subset of M is proper 
(Haefliger). There exists a codimension 1, C!- 
foliation of the 2-torus T? that contains 
exceptional leaves. But in the C" category 
(rz 2), such a foliation does not exist on T? (A. 
Denjoy, C. Siegel). In higher dimensions, there 
are examples of C*-foliations with exceptional 
leaves (R. Sacksteder). The following result is 
called the Novikov closed leaf theorem [3]: 
Any codimension 1, C’-foliation (r > 2) of a 
closed 3-dimensional manifold M contains a 
Rech component if either z; (M) is finite or 
1>(M)#0(M xS x S?2,S! x RP?) In partic- 
ular, every C?-foliation of S? contains a com- 
pact leaf homeomorphic to T?. The question 
of whether every codimension 2, C’-foliation of 
S? has a compact leaf is known as the Seifert 
conjecture. There is a counterexample in the 
C! case (P. Schweitzer [7]), but it remains 
unsolved for r z2 (— 126 Dynamical Systems 
N). 

A compact leaf L is said to be stable if it has 
an arbitrarily small open neighborhood that 
is a union of compact leaves. The following 
results are called the Reeb stability theorems: 
(1) Let L be a compact leaf of a C’-foliation 
(r 2 0). If the holonomy group of L is finite, 
then it is stable (Reeb [1]). (2) Let F bea 
transversely orientable codimension 1, C’- 
foliation (r > 1) of a compact manifold M 
(tangent to the boundary). If there exists a 
compact leaf L with H'(L; R) —0, then M isa 
fiber bundle over S! or [0,1], and the leaves of 
F are the fibers of this bundle. In particular, 


154 E 
Foliations 


every leaf is compact and stable (Reeb [1], 
Thurston [8]). 

The generalization of the stability theorem 
for proper leaves has been investigated by T. 
Inaba and P. Dippolito. 


E. Haefliger Structures 


Let 4; be the *pseudogroup consisting of all 
C"-diffeomorphisms g of an open subset of Ri 
to another open subset of RI. We write I7 for 
the set of all germs [g], of g at x, xedomain of 
g, g€ $;. The sheaf topology of F} is the to- 
pology whose open base is the family of sub- 
sets of the form |), caomainor g4 L9]x}- With this 
topology and the multiplication induced from 
the composition of 4, I7 is a topological 
groupoid, i.e., a *groupoid whose multiplication 
and inverse mappings are continuous. If one 
identifies a point x in R? with [idg],, L7 con- 
tains R? as a subspace. 

A codimension q, C’—Haefliger structure or 
a I 7-structure # on a topological space X isa 
maximal covering of X by open sets {U,|ieJ}, 
such that for each pair i, jeJ, there is a con- 
tinuous mapping 7;;: U;f1 U,S T7 satisfying 


Vilx) = yu (X) o ya (x) for xeU;f UN U,. (*) 


Since yi,(x) is the germ of the identity mapping 
for xe U;, the correspondence A y;;(x) defines 
a continuous mapping f;: U;2 R*c I7. A co- 
dimension q, C’-foliation of a C*-manifold M 
is the same as a I 7-structure on M such 
that each f, is a C'-submersion (Haefliger [5]). 

If f: YX is continuous and J is a 
I7-structure on X, there is an induced I7- 
structure f! XX on Y which is defined by 
IT U) ygoflijeJ). Two I%-structures % 
and Jf on X are said to be homotopic if there 
exists a I 7-structure # on X x [0,1] such that 
H | y «qj =H, (t=O, 1). 

Let I7(X) be the set of homotopy classes of 
I 7-structures on X. There exists a space BT}, 
called the classifying space for 17-structures, 
such that there is a natural one-to-one corre- 
spondence between [7(X) and LX, BT7] for 
any paracompact space X, where [ A4, B] de- 
notes the set of homotopy classes of continu- 
ous mappings from A to B. By condition (*) 
above, if r — 0, the differentials (dy;(x)|xe 
U;N U, i, jeJ} define a q-dimensional vector 
bundle v(#) over X, which is called the nor- 
mal bundle of #. The correspondence # ^ 
v(#) gives a continuous mapping v: BI? 
BGL(q; R) among classifying spaces. If r=0, 
there is also a similar mapping v: BI? 
BTop,. Let BI’, be the homotopy fiber of the 
mapping v. BI”, is a classifying space for the 
I?-structures with trivialized normal bundles. 

There is a tight connection between the 
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topology of BT? and the group structure 

of Diff' (R9), which is stated below. Let 

Diff (R?) be the topological group of all C’- 
diffeomorphisms of R‘ that are identities 
outside some compact sets with the C” to- 
pology. Let Diffz ;(R?) denote Diff (R?) with 
the discrete topology and B Diffz (R*) denote 
the homotopy fiber of the natural mapping 

B Diffz ;(R?)5 B Diff; (R^). Then there exists a 
continuous mapping B Diff; (R> QBI? 
that induces an isomorphism in the ho- 
mology group with integer coefficient for 
O<r<oo and qz 1l, where Q* denotes the qth 
"loop space functor (J. N. Mather [12], Thur- 
ston [10]). Further, it has been proved that 
Diffz ;,(R?) is a tsimple group (if r#q+1) and 
that BI” is t(q + 1)-connected for r#q+1 
(Mather and Thurston, Haefliger [5]). The 
group Homeo, RI is tacyclic (Mather), and 
hence BI ? is contractible. 


F. Existence and Classification of Foliations 


Not every plane field on a manifold is isomor- 
phic to a completely integrable field (R. Bott). 
Thus, in general, the existence of a plane field 
does not guarantee the existence of a foliation 
of a manifold. Haefliger and Thurston solved 
the existence and classification problems in 
foliation in terms of Haefliger structures as 
follows. Two codimension q foliations Ay and 
F, of a C?-manifold M are said to be concor- 
dant if there is a codimension q foliation ¥ on 
M x [0,1], that is transverse to M x {t} (t= 

0, 1) and induces there the given foliation F, 

(t —0, 1). They are said to be integrably homo- 
topic if one further requires that the foliation 
F be transverse to M x {t} for all te[0,1] in 
the definition above. Similarly, two subbundles 
čo» 6; of T(M) are said to be concordant if 
there is a subbundle č of T(M x [0, 1]) such 
that ¢| m x1 — 6, for t —0, 1, and they are said 
to be homotopic if one further assumes that 
|m xin is a subbundle of T(M x (tj) for all 

t€ [0,1]. The following theorem is of funda- 
mental importance. 

Theorem: Let M be an open (resp. closed) 
C*-manifold. Then for each r=0, 1,..., 00, the 
integrable homotopy classes (resp. concor- 
dance classes) of codimension q, C’-foliations 
of M are in a natural one-to-one correspon- 
dence with homotopy classes of I7-structures 
H on M together with homotopy classes (resp. 
concordance classes) of subbundles of T(M) 
isomorphic to v(.#). (M. L. Gromov, A. 
Phillips, Haefliger [5], Thurston [9]). 

The following are consequences of the 
theorem: (1) A closed manifold M admits a 
codimension 1, C?-foliation if and only if the 
Euler number of M vanishes. (ii) If a manifold 
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admits a *q-frame field, then the associated q- 
plane field is homotopic to the normal bundle 
of a codimension q foliation of M. (iii) Every 
dimension q plane field on a C*-manifold is 
homotopic to the normal bundle of a C°- 
foliation with C” leaves. 


G. Characteristic Classes of Foliations 


Let BT be the classifying space of I7- 
structures. An element of the cohomology 
group H*(BI7; R) is called a (real) character- 
istic class of codimension q, C’-foliations. If 2 
is a codimension q, C’-foliation of M and f: 
M > BY, is the classifying mapping for F, 
then an element «(.Z) — f *«e H*(M; R, ae 
H*(BY%; R), is called the characteristic class 
of ¥ corresponding to «. The first nontrivial 
characteristic classes of foliations are known as 
the Godbillon-Vey classes (C. Godbillon and 
J. Vey [13]) and can be defined as follows. Let 
F bea transversely oriented, codimension q, 
C®-foliation of M. Then there exists a *q-form 
Q on M such that on a neighborhood U of 
each point of M, Q is written as o ^... A cL, 
where cf, ..., c are linearly independent 1- 
forms that vanish on leaves of F. By the inte 
grability condition, there is a 1-form 5 such 
that d) — 5 AQ. Then the Godbillon-Vey class 
T> of F is the de Rham cohomology class in 
H?**! (M; R) represented by the closed 2q + 1 
form 4 A (dn. 

The following construction provides a wide 
class of characteristic classes of foliations (Bott 
and Haefliger [14], I. Bernstein and B. Rosen- 
feld [15]). Let J, be the set of tk-jets at 0 of 
local C?-diffeomorphisms of Ri keeping 0 
fixed. The set {J,}2o forms an "inverse system 
of tLie groups with respect to the natural 
homomorphism p,: J,,, —J,, and each J, 

(k> 1) contains O(q) as a tmaximal compact 
subgroup. Let P, be the differentiable fiber 
bundle of k-jets at 0 of local diffeomorphisms 
of R? whose domains contain 0. It is a *prin- 
cipal J,-bundle over Ri Denote by A(P,,) the 
‘direct limit of the de Rham complexes of 
IRIZ a, and let A be the subcomplex of A(P,,) 
consisting of invariant forms with respect to 
the natural action of £7. A is canonically 
isomorphic to the tcochain complex A(A,) of 
continuous alternating forms on A,, where A, 
is the topological Lie algebra of fformal vector 
fields on R? (— 105 Differentiable Manifolds 
AA). 

Now let F be a codimension q, C®- 
foliation of M. Let P,(F) denote the differ- 
entiable fiber bundle over M whose fiber 
over xe M is the space of k-jets at x of the C”- 
submersion f: U  R* from an open neighbor- 
hood U of x to R*, satisfying (i) f(x) — 0, (i1) 
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Kerdf —:(7 )| y. This is a principal J,-bundle 
over M, and its restriction to U is isomorphic 
to the pullback by f of the bundle P,. Hence 
there are homomorphisms from the set of in- 
variant forms on P, to A(P,(¥)) that induce 
a homomorphism A = A(A,)>lim A(P,(F)). 
This homomorphism is compatible with the 
action of O(q) and hence induces a homomor- 
phism of O(q)-basic subcomplexes. Thus one 
obtains a homomorphism 97: H*(A,;O0(q))> 
lim H*(A(P,(F )); O(q)) = H*(M;R). In fact, 
P; depends only on the 2-jet of the foliation 
4 , and one can think of it as a homomor- 
phism o: H*(4,; O(q)) > H*(BT7; R) (r2 2). 
The elements in Im ọ are called the smooth 
characteristic classes of foliations. 

Let WO, be the differential graded algebra: 


WO, — E(u, Uus, DEE s U2iq+1)/2]-1) G9 R[c,, t. TAF 


where du;=c;, dc; =0, deg(u;)=2i— 1, deg(c;) = 
2i, and E denotes the texterior algebra over 

R generated by the u;s, and R denotes the 
*polynomial algebra of the c;'s truncated by 
the tideal generated by elements of degree 

> 2q. There exists a homomorphism of dif- 
ferential graded algebras WO, A(A,; O(q)) 
which induces an isomorphism in cohomology 
(I. M. GePfand and D. B. Fuks). For a codi- 
mension q foliation Z of M, the cohomology 
class determined by c,; corresponds to the ith 
Pontryagin class of the normal bundle v( F) 
of F, and the cohomology class u, c} corre- 
sponds to the Godbillon-Vey class of F. In 
particular, the subring of the cohomology 

ring H*(M; R) generated by the tPontryagin 
classes of v( F) is trivial for degree — 2q (Bott's 
vanishing theorem). 

Let N**! be a closed (q + 1)-dimensional 
Riemannian manifold of tconstant negative 
curvature. The total space T, N of the unit 
tangent sphere bundle of N admits a codi- 
mension q, C*-foliation associated with the 
*geodesic flow of T, N (tAnosov foliation). It 
has been shown that the Godbillon-Vey class 
of this foliation is nontrivial (R. Roussarie, 

F. Kamber and P. Tondeur, K. Yamato). It is 

known that many of the smooth characteristic 
classes are also nontrivial (Bott and Haefliger, 
Thurston, J. Heitsch). 

A smooth characteristic class xe H*(BI 7; R) 
is called rigid if for any smooth one-parameter 
family {.¥,} of codimension q foliations on 
a C”-manifold M, d(a(.2,))/dt =0 holds. 

The elements in the image of the natural 
homomorphism 


H*(WO,,,)> H*(WO,)> H*(BI*; R) 


are rigid (Heitsch). On the other hand, the 
Godbillon-Vey class is not rigid. In fact, Thur- 
ston constructed a one-parameter family 

of codimension q foliations of a certain ` 
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(2q + 1)-dimensional manifold for which the 
Godbillon-Vey class varies continuously. The 
characteristic classes of a simple foliation are 
often trivial. For example, the Godbillon-Vey 
class of a codimension 1 foliation of a closed 
manifold vanishes if it is almost without holo- 
nomy (i.e., no noncompact leaves have non- 
trivial holonomy) (M. Herman [16]; T. Mizu- 
tani, S. Morita, and T. Tsuboi [17]). 


H. Further Topics 


(1) Transverse structures. Let B be a q- 
dimensional manifold and Z a geometric 
structure on it, and let 4,, denote the tpseudo- 
group generated by the local diffeomorphisms 
that preserve the structure S. Replacing 9 

by Y, in the definition of a C’-Haefliger struc- 
ture, one obtains definitions of a I ;-structure 
and a I ;-foliation. A T ;-foliation is called a 
Riemannian foliation, a transversely real ana- 
lytic foliation, or a transversely holomorphic 
foliation if Y is a Riemannian, real analytic, or 
complex structure on R? (C?”), respectively. 
The theories for many such foliations are 
analogous to those for C’-foliations. For 
example, many results are known about the 
characteristic classes of Riemannian or holo- 
morphic foliations. Haefliger showed that there 
is no codimension 1 real analytic foliation on a 
simply connected closed manifold and that the 
classifying space BI ? for codimension 1 trans- 
versely oriented transversely real analytic 
foliations has the homotopy type of a *K (n, 1)- 
space for some uncountable tperfect group z 
[5]. 

(2) Foliated cobordism. Two closed oriented 

n-dimensional C”-manifolds My and M, with 
codimension q, C’-foliations are said to be 
foliated cobordant if there exist a compact 
oriented (n+ 1)-dimensional C*-manifold W 
with boundary ôW = M, U(— Mo) and a codi- 
mension q, C"-foliation of W which is trans- 
verse to CH and induces the given foliations of 
Mpo and M,. The resulting foliated cobordism 
classes {F } form a group F OQ; , with respect 
to the disjoint union. It is known that Z7 , 
= (0j and that the Rech foliation of S? is 
cobordant to zero. The characteristic classes 
of foliations provide invariants of foliated 
cobordisms. In particular the Godbillon-Vey 
number I [M] is an invariant of 705,,, , 
(r 2 2, q 2 1), and a result of Thurston men- 
tioned in Section G implies that the homo- 
morphism ZU. ,R defined by {F }—> 
Uz[M] is surjective. 

(3) Growth of leaves, transverse invariant 
measure. Let Z be a C”-foliation of a com- 
pact manifold. Fix a 'Riemannian metric 
on M. Then each leaf L of 7 has the in- 
duced metric, and one has a function f, (r)= 
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vol(D(x, r)), where D(x, r) is the set of points 
y€ L whose distance along L from a fixed 
point xe L is not greater than r. The growth 
type of the function f; is determined only by L. 
Many papers have been published that deal 
with the relation between the behavior of 
leaves and their growth types in codimension 1 
foliations (J. Cantwell and L. Conlon, G. Hec- 
tor, T. Nishimori, N. Tsuchiya). On the other 
hand, as a generalization of the notion of 
asymptotic cycles in dynamical systems, the 
notion of foliation cycles, or equivalently a 
transverse invariant measure, has been defined 
(J. Plante [18], D. Ruelle, D. Sullivan [19]). 
The existence of a transverse invariant mea- 
sure for a foliation is closely connected to the 
growth types of the leaves. The example of 
Denjoy in Section D leads to the study of 
*minimal sets of foliations and the structures 
of foliations (Hector, Cantwell and Conlon). 
The structure of a codimension 1 foliations 
which are almost without holonomy has been 
fairly well investigated (Sacksteder, Hector, 
H Imanishi, R. Moussou, Roussaire). 

(4) Compact foliation. A foliation whose 
leaves are all compact is called a compact 
foliation. D. Epstein proved that if F isa 
codimension 2, C?- compact foliation of a 
closed 3-manifold, then the leaves of are the 
fibers of a 'Seifert fibration of M. In higher 
dimensions, the situation is more complicated 
(Sullivan, R. Edwards and K. Millett). 

(5) Foliated bundles. There are many results 
on foliated bundles. In particular J. W. Milnor 
[20] and J. Wood obtained a condition for a 
circle bundle Z over a closed surface X to have 
a foliation transverse to fibers. More precisely, 
if £ and X are orientable, then £ admits such a 
foliation if and only if |X(£)| € —min{0, x(2)], 
where X denotes the Euler number and y the 
Euler-Poincaré characteristic. Kamber and 
Tondeur made an extensive study of charac- 
teristic classes of foliated bundles [21]. 

(6) Transverse foliations. Two foliations F 
and 4 of M are said to be transverse to each 
other if any two leaves K and L of F and 4 
are transverse to each other. A foliated bundle 
has such foliations. D. Hardorp proved that 
on every orientable closed 3-manifold, there 
exists a triple of codimension 1 foliations 
that are pairwise transverse. Tamura and A. 
Sato classified the codimension | foliations 
that are transverse to the Reeb component 
of D? x S$}. 
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A. Introduction 


Geometry deals with figures. It depends, there- 
fore, on our spatial intuition, but our intuition 
lacks objectivity. The Greeks originated the 
idea of developing geometry logically, based 
on explicitly formulated axioms, without re- 
sorting to intuition. From this intention re- 
sulted Euclid's Elements, which was long con- 
sidered the perfect model of a logical system. 
As time passed, however, mathematicians came 
to notice its imperfections. Since the 19th cen- 
tury especially, with the awakening of a more 
rigorous critical spirit in science and philoso- 
phy, more systematic criticism of the Elements 
began to appear. Non-Euclidean geometry was 
formulated after reexamination of Euclid's 
axiom of parallels; but it was also discovered 
that even as a foundation of Euclidean geom- 
etry, Euclid's system of axioms was far from 
perfect. Various systems of axioms for Eucli- 
dean geometry were proposed by mathemati- 
cians in the latter half of the 19th century, 
among them one by D. Hilbert [1], which 
became the basis of far-reaching studies. 


B. Hilbert's System of Axioms 


Hilbert took as undefined elements points 
(denoted by A, B, C, ...), straight lines (or sim- 
ply lines, denoted by a, b, c, ...), and planes 
(denoted by x, f, y, ...). Between these ob- 
jects there exist incidence relations (expressed 
in phrases such as “A lies on a,” “a passes 
through A,” etc.); order relations (“B is be- 
tween A and C^); congruence relations; and 
parallel relations. The relations are subject to 
the following five groups of axioms: 

(I) Incidence axioms: (1) For two points A, 
B, there exists a line a through A and B. (2) 
If A z B, the line a through A, B is uniquely 
determined. We write a= AU B and call a the 
join of A, B. (3) Every line contains at least 
two different points. There exist at least three 
points that do not lie on a line. (4) If A, B, C 
are points not on a line, there exists a plane « 
through A, B, C. (We also say that A, B, C lie 
on a.) For every plane o, there exists at least 
one point A on «. (5) If A, B, C are points 
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not on a line, the plane « through A, B, C is 
uniquely determined. We write x= AUBUC 
and call « the join of A, B, C. (6) If A, B are 
two different points on a line a and if A, B lie 
on a plane a, then every point on a lies on a. 
(We say that a lies on « or « passes through 

a.) (7) If a point A lies on two planes «, f, 
there exists at least one other point B on « and 
f. (8) There exist at least four points not lying 
on a plane. 

(II) Ordering axioms: (1) If B is between A 
and C, then A, B, C are three different points 
lying on a line; also, B is between C and A. (2) 
If A, C are two different points, then there 
exists a point B such that C is between A and 
B. (3) If B is between A and C, then A is not 
between B and C. 

We define a segment as a set of two different 
points A, B, denoted as AB or BA, and we call 
A and B ends of this segment. The set of points 
between A, B is called the interior of AB, and 
the set of points of AU B that are neither ends 
nor interior points of AB is called the exterior 
of AB. 

(4) Let A, B, C be three points not lying on a 
line. If a line a on the plane AU BUC does not 
pass through A, B, or C, but passes through a 
point of the interior of AB, then it also passes 
through a point of the interior of BC or CA 
(Pasch's axiom). 

The following propositions are proved 
from the above axioms. Given n points A,, 

A5, ..., A, on a line (n 2), we can rearrange 
them, if necessary, so that the point A; is be- 
tween A; and A, whenever we have | <i<j< 

k <n. There are exactly two ways of arrang- 
ing the points in this manner (theorem of linear 
ordering). Let O be a point on a line a, and let 
A, B be two points on the line different from O. 
Write A ~B when A=B or O is not between A 
and B; write A ~ B otherwise. Then ~ is an 
equivalence relation between points on the line 
different from O; from A ~ B, A~C it follows 
that B — C. We say that A, B are on the same 
side or on different sides of O on a depending 
on whether A~ B or A ~ B. Two subsets a’ 

and a" of a defined by a2 (A'|A— Ah, a^ = 
1A" | A~ A”} are called half-lines or rays on 

a with O as the extremity (or starting from O). 
Denoting by a, for simplicity, the set of points 
on a, we have a=a’U{O} Ua" (disjoint union). 

Using axiom II.4, we can also prove the 
following: Let a be a line on x, and let A, B be 
two points on x not lying on a. If A— B or if 
the interior of the segment AB has no point in 
common with a, we say that A, B are on the 
same side of a on a, and write A ~ B. Other- 
wise, we say that A, B are on different sides of 
a on « and write A^ B. Then ^ is an equiva- 
lence relation between points on « not lying on 
a, and from A^ B, A ~C follows B~ C. The 
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subsets a = (A'| A~ A'3, a" ={A"| A~ A"] of 

x are called half-planes on x bounded by a. 
Again, denoting by « and a the set of points on 
x and the set on a, respectively, we obtain «= 
x UaUa" (disjoint union). 

(IIT) Congruence axioms: Two segments AB, 
A'B' can be in a relation of congruence, ex- 
pressed symbolically as AB = A'B'. (Segments 
AB and A'B' are then said to be congruent. 
Since the segment AB is defined as the set 
(A, B}, the four relations AB = A'B', BA = 
AB. ABz B'A', BAz B'A' are equivalent.) 
This relation is subject to the following three 
axioms: (1) Let A, B be two different points on 
a line a, and A, a point on a line a, (a, may or 
may not be equal to a). Let a, be a ray on a, 
starting from A,. Then there exists a unique 
point B, on a, such that AB = A, B,. (2) From 
A, B, = AB and A,B, = AB follows A, B, = 
A; B,. (Hence it follows that = is an equiv- 
alence relation between segments.) (3) Let 
A, B, C be three points such that B is between 
A and C, and let A,, B,, C, be three points 
such that B, is between A, and C,. Then from 
AB= A, B,, BCz B,C, follows AC= A,C,. 

Now let h, k be two different lines in a plane 
x and through a point O, and let h’, k' be the 
rays on h, k starting from O. The set of two 
such rays h’, K' is called an angle in «, denoted 
by CO, K) or 7 (K',; h’). This angle is also de- 
noted by / AOB, where A, B are points of h’, 
k', respectively. The rays h’, K are called the 
sides and the point O is called the vertex of 
this angle. Then h’ is a subset of a half-plane 
on a bounded by k, and K' is a subset of a half- 
plane on « bounded by h. The intersection of 
these two half-planes is called the interior of 
this angle, and the subset of a — O consisting of 
points belonging to neither the inside nor the 
sides of the angle is called the exterior of the 
angle. Between two angles / (A, EL (hi, ki) 
there may exist the relation of congruence, 
again expressed by the symbol =, as in the 
case of segments, and subject to the following 
two axioms: (4) Let / (h’,k’) be an angle on a 
plane « and h, be a line on a, (x, may or may 
not be equal to x). Let O, be a point on h,, bh 
a ray on o, starting from O,, and a a half- 
plane on x, bounded by bh, Then there exists a 
unique ray K| starting from O, and lying in 
x, such that / (W, k')z / (h,, EL Moreover, 
L(K, k)= L(h,k’) always holds. (Hence it 
follows that = is an equivalence relation be- 
tween angles.) (5) Let both A, B, C and A,, B,, 
C, be triples of points not lying on a line. Then 
from AB=A,B,, AC=A,C,, and / BAC= 
L B, A, C,, it follows that / ABC= / A,B,C,. 

(IV) Axiom of parallels: Suppose that a, b 
are two different lines. Then it follows from 
axiom I.2 that if a and b share a point P, such 


` a point is the unique point lying on both a and 
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b. In this case we say that a, b intersect at P 
and write aM b= P. On the other hand, if a and 
b have no point in common and if a, b are on 
the same plane, we say that a, b are parallel 
and write a // b. If A, a are on a plane « and A 
is not on a, we can prove (utilizing axioms I, 
II, and IH) that there exists a line b passing 
through A in « such that a //b. The axiom of 
parallels postulates the uniqueness of such a b. 
(V) Axioms of continuity: (1) Let AB, CD be 
two segments. Then there exist a finite number 
of points A,, 45, ..., 4, on AUB such that 
CD = AA, =A,A,=...=A,_,A, and B is be- 
tween A and A, (Archimedes' axiom). (2) The 
set of points on a line a (again denoted for 
simplicity by a) is “maximal” in the following 
sense: It should satisfy axioms IT.1—II.3, HI. 1, 
V.1, and the theorem of linear ordering. If 
à is a set of points satisfying these axioms 
such that à a, then à should be =a (axiom 
of linear completeness). Hence follows the 
theorem of completeness: the set of points, 
lines, and planes is maximal in the sense that it 
is not possible to add further points, lines, or 
planes to this set with the resulting set still 
satisfying axioms I-IV and V.1. 





C. Consistency 


In formulating the above axioms and proving 
their consistency, Hilbert assumed the consis- 
tency of the theory of real numbers (— 156 
Foundations of Mathematics). To prove con- 
sistency, Hilbert constructed a model for the 
above axioms using the method of analytic 
geometry. He defined points as triples of real 
numbers (x4, x», X3), lines and planes as sets of 
points satisfying suitable systems of linear 
equations, and relations of ordering, con- 
gruence, and parallelism in the usual way. It is 
easy to verify that such a system satisfies all 
the axioms I-V. Thus the consistency of these 
axioms is reduced to the consistency of the 
theory of real numbers (— 35 Axiom Systems). 
A model for I-IV and V.1 can be obtained 
in the countable field R, of all real talgebraic 
numbers instead of R. Then Rg can be further 
restricted to its subfield P, defined as follows: 
Let F be an arbitrary field. An textension of F 


of the form F(,/14- 4?) with Ae F is called a 
Pythagorean extension of F, and F is said to be 
a Pythagorean field if any Pythagorean exten- 
sion of F coincides with F (e.g., Ry and R are 
Pythagorean). It is easily verified that I-IV are 
satisfied in the “analytic geometry over any 
Pythagorean field." On the other hand, we can 
construct a minimal Pythagorean field con- 
taining a given field (the Pythagorean closure 
of the field) in the same way as we construct 
the *algebraic closure of a field. The field P, is 
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defined as the Pythagorean closure of the field 
Q of rational numbers. 


D. Independence of Axioms 


In Hilbert's system, the axioms I and II are 
used to formulate further axioms. On the other 
hand, it can be shown that each of the groups 
III, IV, and V is independent from other 
axioms. 

The independence of IV is shown by the 
consistency of non-Euclidean geometry (— 285 
Non-Euclidean Geometry). The following 
model shows the independence of IIL.5: In the 
analytic model for I- V, we replace the defi- 
nition of distance between two points 


(X1, X2. X3), Din, Ha, Y3) by 
((x, =y +X — pa t (x4 — y3) + (x3 — y3 Wei 


Then II.5 does not hold, while all other 
axioms remain satisfied. The independence of 
V.2 is shown by the geometry over Ra or P}. 
The independence of V.1 follows from the 
existence of the non-Archimedean Pythag- 
orean field: the Pythagorean closure of any 
tnon-Archimedean field (e.g., the field of ra- 
tional functions of one variable over Q with a 
*non-Archimedean valuation) is such a field. A 
geometry in which V.1 does not hold is called 
a non-Archimedean geometry. 


E. Completeness of the System of Axioms and 
Relations between Axioms 


The tcompleteness of the system of axioms I- 
V can be shown by introducing coordinates in 
the geometry with these axioms and represent- 
ing it as "Euclidean geometry of three dimen- 
sions. Axiom group V is essential for the intro- 
duction of coordinates over R. Moreover, we 
have the following results: 

(i) The geometry with the axioms I-IV can 
be represented as "Euclidean geometry" of 
three dimensions over a Pythagorean field, 
and the geometry with axioms L.1-1.3, I-IV 
can be represented as "Euclidean geometry" of 
two dimensions over a Pythagorean field. 

(ii) The geometry with II, and a stronger 
axiom of parallels IV* (given a line a and a 
point A outside a, there exists one and only 
one line a' passing through A that is parallel to 
a) can be represented as an taffine geometry 
over a field K that is not necessarily 
commutative. 

(iii) The field K is commutative if and only if 
the following holds: Suppose that in Fig. 1 
A'UB// AUB', BUC// BUC’. Then it follows 
that A'UC // AUC' (Pascal’s theorem). 
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Fig. 1 


(iv) The "two-dimensional geometry" with 
1.1-1.3, HI, and IV* can be embedded in the 
"three-dimensional geometry" with axioms I, 
II, and IV* if and only if the following holds: 
Suppose that in Fig. 2 we have AU B// A'U B', 
BUC/B'UC'. Then it follows that CU AA 
C'U A' (Desargues's theorem). 





Fig. 2 


(v) From 1.1-1.3, IT, IV*, and Pascale theo- 
rem follows Desargues's theorem. 

(vi) Desargues's theorem is independent of 
T.1-1.3, IT, IT.1—IILA, IV*, and V; that is, we 
can construct a non-Desarguesian geometry (a 
geometry in which Desargues's theorem does 
not hold) in which these axioms are satisfied. 

Axioms I, II, and IV*, as well as the theo- 
rems of Pascal and Desargues, are propositions 
in affine geometry. Each has a corresponding 
proposition in *projective geometry, and the 
results concerning them can be transferred to 
the case of projective geometry (— 343 Projec- 
tive Geometry). 


F. Polygons and Their Areas 


Suppose that we are given a finite number of 
points A; (/—0, 1, ..., r) in the geometry with 
axioms I and II. Then the set of segments (or, 
more precisely, the union of segments together 
with their interiors) A;A;,, (120,1, ...,r— 1) ts 
called a broken line joining A, with A. In 
particular, if A; = A,, then this set is called a 
polygon with vertices 4; and sides 4,A4,,,. A 
polygon with r vertices is called an r-gon. (For 
r — 3,4,5,6, r-gons are called triangles, quad- 
rangles, pentagons, and hexagons, respectively.) 
A plane polygon is a polygon whose vertices all 
lie on a plane. A polygon is called simple if any 
three consecutive vertices do not lie on a line, 
and two sides A, Aen and A,A;,, (i#j) meet 
only when j=i+1 or i-j4- 1. In this article, 
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we consider only simple plane polygons, and 
refer to them simply as polygons. 

tJordan's theorem implies that a polygon in 
the sense just defined divides the plane into 
two parts, its interior and its exterior. This 
special case of Jordan's theorem can be proved 
by L1-L.3 and II only. A polygon P is divided 
into two polygons P,, P, by a broken line 
joining two points on sides of P and lying in 
the interior of P (Fig. 3). In this case, we say 
that P is decomposed into P,, P; and write P 
= P, - P. We may again decompose P,, P, 
and thereby arrive at a decomposition of the 
form P=P,+...+ P,. Axiom ITI is used to 
introduce the congruence relation = between 
polygons. Two polygons P, Q are called de- 
composition-equal if there exist decomposi- 
tions P=P,+...+P,,Q=Q,+...+Q, such 
that P, =Q,,...,P,=Q,. This is expressed by 
PzQ. We call P, Q supplementation-equal if 
there exist two polygons P’, Q' such that (P+ 
P')z(Q + Q^), P'zQ'. This will be expressed by 
PeQ. If we assume IV, we can use result (1) of 
Section E. Let K be the ground field of the 
geometry (K is Pythagorean, hence *ordered). 
The area of polygon P is defined as the posi- 
tive element m(P) of K assigned to P such that 
m(P -- Q) - m(P) 4- m(Q), and m(P) 2 m(P") if 
P = P'. From PzQ or PeQ, it follows that m(P) 
— m(Q). Under these axioms, it is proved that 
m(P)=m(Q) implies PeQ. If we also assume 
V.1, then m(P) 2 m(Q) implies PzQ. Thus 
the theory of area of polygons can be con- 
structed without assuming axiom V.2, though 
this result cannot be generalized to higher- 
dimensional cases. For the case of three di- 
mensions, we can construct two solids of the 
same volume that are not supplementation- 
equal [2, 7]. 


Fig. 3 


G. Geometric Construction by Ruler and a 
Transferrer of Constant Lengths 


The geometry with I-IV can be represented 
as 3-dimensional Euclidean geometry over 

a Pythagorean field. Conversely, all these 
axioms are valid in 3-dimensional Euclidean 
geometry over any Pythagorean field. Thus 
the minimal system of "quantities" whose 
existence is assured in geometry with these 
axioms is the field P), the Pythagorean closure 
of Q. Hilbert noticed that the existence of a 
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geometric object under axioms I-IV can be 
expressed as its constructibility by ruler (i.e., an 
instrument to draw a straight line joining two 
points) and a transferrer of constant lengths. 
The latter, for a constant length x, is an instru- 
ment that permits finding the point X on the 
given ray AB such that AX — x. It is not pos- 
sible to construct by ruler and transferrer all 
the points that can be constructed by means 
of ruler and compass (— 179 Geometric Con- 
struction). However, it is possible to construct 
all the lengths Ax, where 4 is any element of P}. 
Hilbert conjectured that an element of P, can 
be characterized as a ‘totally positive algebraic 
number of degree 2", ve N. This conjecture was 
proved by Artin (3]. 


H. Related Topics 


While Hilbert's foundations are concerned 
with 3-dimensional Euclidean geometry, it is 
easy to generalize these results to the case of 
n-dimensional Euclidean geometry (— 139 
Euclidean Geometry). Also, for affine and pro- 
jective geometries, there are well-organized 
systems of axioms (at least for the case of di- 
mensions > 3). Hilbert [1, Appendix IIT] 
showed that plane thyperbolic geometry can 
be constructed on a modified system of 
axioms, but for other non-Euclidean geome- 
tries (in particular, ‘elliptic geometries) there 
are no known systems of axioms as good as 
Hilbert's for the Euclidean case. On the other 
hand, Hilbert (1, Appendix IV] gives another 
method of constructing Euclidean geometry 
in characterizing the group of motions as the 
topological group with certain properties. 

G. Thomsen [4] rewrote Hilbert's system of 
axioms in group-theoretical language utiliz- 
ing the fact that the group of motions is gen- 
erated by symmetries with respect to points, 
lines, and planes. Finally, Hilbert's study of 
the foundations of geometry led him to re- 
search in the foundations of mathematics. 
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A. General Remarks 


The notion of tset, introduced toward the end 
of the 19th century, has proved to be one of 
the most fundamental and useful ideas in 
mathematics. Nonetheless, it has given rise to 
well-known *pargdoxes. Based on this notion, 
R. Dedekind developed the theories of natural 
numbers [2] and real numbers [3], defining 
the latter as "cuts" of the set of rational num- 
bers. Thus set theory served as a unifying 
principle of mathematics. 

It has been noted, however, that some of the 
most commonly utilized arguments in set 
theory, which are at the same time the most 
useful in mathematics and belong almost to 
the basic framework of formal logic itself, 
resemble very much those which give rise to 
paradoxes. This fact has caused many critical 
mathematicians to question the very nature of 
mathematical reasoning. Thus a new field, 
foundations of mathematics, came into being at 
the beginning of this century. This field was 
divided at its inception into different doctrines 
according to the views of its initiators: logicism 
by B. Russell, intuitionism by L. E. J. Brouwer, 
and formalism by D. Hilbert. In set theory, 
which was the origin of this controversy, it was 
pointed out that the "definition" of set as given 
by G. Cantor was too naive, and axiomatic 
treatments of this theory were proposed (— 33 
Axiomatic Set Theory). 


B. Logicism 


Russell asserted that mathematics is a branch 
of logic and that paradoxes come from ne- 
glecting the “types” of concepts. According to 
his opinion, mathematics deals formally with 
structures independently of their concrete 
meanings. Science of this character has been 
called logic from antiquity. According to him, 
logic is the youth of mathematics, and math- 
ematics is the manhood of logic. To construct 
mathematics from this standpoint, asserted 
Russell, ordinary language is lengthy and 
inaccurate, and some proper system of sym- 
bols should be used instead. Thus he tried to 
reconstruct mathematics using tsymbolic logic. 
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Attempts to reorganize mathematics using 
logical symbols had formerly been made by G. 
Leibniz, who wrote Dissertatio de arte com- 
binatoria in 1666, as well as by A. de Morgan, 
G. Boole, C. S. Peirce, E. Schróder, G. Frege, 
G. Peano, and others. Symbols used by the 
last two authors resemble those of today. 
Russell studied these works and published his 
own theory in a monumental joint work with 
A. N. Whitehead: Principia mathematica (3 
vols., 1st ed. 1910—1913, 2nd ed. 1925-1927), 
in which the theories of natural numbers and 
real numbers as well as analytic geometry are 
developed from the fundamental laws of logic. 

If this work had been completely successful, 
it could have eliminated any possibility of the 
intrusion of paradoxes into mathematics. 
However, the authors were forced to postulate 
an "unsatisfactory" axiom in order to con- 
struct mathematics. They introduced the notion 
of type as follows: An object M defined as the 
set of all objects of a certain type belongs to a 
higher type than the types of the elements of 
M. This serves to eliminate certain paradoxes 
but brings about inconveniences such as the 
following. Suppose that we are trying to con- 
struct the theory of real numbers from that of 
rational numbers. Each real number can then 
be considered a tpredicate about rational 
numbers. If this predicate contains only tquan- 
tifiers relating to variables running over all 
rational numbers, then the corresponding real 
number is said to be predicative, otherwise 
impredicative. According to Russell, the latter 
should have a higher type than the former, 
which makes the theory of real numbers 
exceedingly complicated. To avoid this dif- 
ficulty, Russell proposed the axiom of reduci- 
bility, which says that every predicate can be 
replaced by a predicative one. With this rather 
artificial axiom Russell himself expressed 
dissatisfaction. Russell also postulated the 
taxiom of infinity and the *axiom of choice, 
which are also problematic. After examining 
the philosophical background of the book, H. 
Weyl wrote about Principia mathematica, 
“Mathematics is no more based on logic than 
the utopia built by the logician." Nevertheless, 
logic as formulated in this book, as well as the 
theory of types as developed by F. P. Ramsay 
in the school of Russell and Whitehead, is still 
an important subject of mathematical logic. 


C. Intuitionism 


The intuitionist claims that mathematical 
objects or truths do not exist independently 
from mathematically thinking spirit or intui- 
tion, and that these objects or truths should be 
directly seized by mental or intuitional activ- 
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ity. The philosophical standpoints of mathe- 
maticians such as L. Kronecker and H. Poin- 
caré in the 19th century or E. Borel, H. 
Lebesgue, and N. N. Luzin at the turn of this 
century can be assimilated to intuitionism, but 
those of the latter three are often said to be- 
long to semi-intuitionism or to French empiri- 
cism. Brouwer took a narrower standpoint, 
strongly antagonistic to Hilbert's formalism. 
Today the word "intuitionism" is generally 
interpreted in Brouwer's sense. 

Brouwer sharply criticized the usual way of 
reasoning in mathematics and claimed that 
indiscriminate use of the law of excluded mid- 
dle (or tertium non datur) P v 7P cannot be 
permitted. According to him, the proposi- 
tion “Either there exists a natural number 
with a given property P, or else no such num- 
ber exists" is to be regarded as proved only 
when an actual construction of a natural num- 
ber with the property P is given or when the 
absurdity of the existence of such a natural 
number can be constructively proved. When 
neither of these two results can be shown, then 
one can say nothing about the truth of the 
above proposition. Thus the usual method of 
proof, known as the method of reductio ad 
absurdum, i.e., of proving a proposition P by 
proving its double negation 1 1P, is not 
generally considered valid. It is a difficult but 
important problem of mathematical logic to 
determine which parts of usual mathematics 
can be reconstructed intuitionistically, though 
it does not seem easy to reconstruct any part 
of mathematics elegantly from this standpoint. 


D. Formalism 


To eliminate paradoxes, Hilbert tried to apply 
his axiomatic method. From Hilbert's stand- 
point, any part of mathematics is a deductive 
system based on its axioms. In the deductive 
development, however, "logic," including set 
theory and elementary number theory, is used. 
Paradoxes appear already in such logic. Hil- 
bert's idea was to axiomatize such logic and 
to prove its consistency. Thus one must first 
formalize the most elementary part of mathe- 
matics, including logic proper. 

Hilbert proved the consistency of Euclidean 
geometry by assuming the consistency of the 
theory of real numbers. This is an example of a 
relative consistency proof, which reduces the 
consistency proof of one system to that of 
another. Such a proof can be meaningful only 
when the latter system can somehow be re- 
garded as based on sounder ground than the 
former. To carry out the consistency proof of 
logic proper and set theory, one must reduce 
it to that of another system with sounder 
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ground. For this purpose, Hilbert initiated 
metamathematics and the finitary standpoint. 

The finitary standpoint recognizes as its 
foundation only those facts that can be ex- 
pressed in a finite number of symbols and only 
those operations that can be actually executed 
in a finite number of steps. Essentially, it does 
not differ from the standpoint of intuitionism. 
The methods based on this standpoint are also 
called constructive methods. 

Metamathematics is also called proof theory. 
Its subject of research is mathematical proof 
itself. Hilbert was the first to insist on its im- 
portance. The theory is indispensable for con- 
sistency proofs of mathematical systems, but it 
may also be used for other purposes. In fact, 
the same idea can be seen in the tduality prin- 
ciple of projective geometry, which dates from 
long before Hilbert's proclamation of formal- 
ism. This is not a theorem of projective geome- 
try deduced from its axioms; rather, it is a 
proposition about the theorems in projective 
geometry, based on the type of axioms and 
proofs in this subject. 

According to Hilbert's method, one must 
develop proof theory from the finitary stand- 
point with the aim of proving the consistency 
of axiomatized mathematics. For this purpose, 
one must formalize the mathematical theory in 
question by means of symbolic logic. A theory 
thus formalized is called a formal system. 


E. Some Results of Formalist Theory 


One of the most remarkable results hitherto 
obtained with Hilbert's method is the con- 
sistency proof of pure number theory by G. 
Gentzen [7]. This consistency proof covers 
the largest domain for which an explicit 
consistency proof has so far been obtained. 
However, the methods of formalist proof 
theory have proved to be most effective in 
studying the logical structure of mathematical 
theories and have led to various results on the 
consistency of formalized mathematical sys- 
tems, on symbolic logic, and on axiomatic set 
theory. We give some examples. 


(1) Gédel’s Incompleteness Theorem. K. Gódel 
[6] showed that if a system obtained by for- 
malizing the theory of natural numbers is 
consistent, then this system contains a tclosed 
formula A such that neither A nor its negation 
71A can be proved within the system. He 
originally proved this under the assumption 
that the system is c-consistent. This is a stron- 
ger condition for the system than simple con- 
sistency, but J. B. Rosser [13] succeeded in 
replacing this by the latter. This result shows 
the incompleteness not only of the usual 
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theory of natural numbers but of any consist- 
ent theory (from the finitary standpoint) con- 
taining the theory of these numbers. 

At the same time, Gódel also obtained the 
following important result: Let S be any con- 
sistent formal system containing the theory of 
natural numbers. Then it is impossible to 
prove the consistency of A by utilizing only 
arguments that can be formalized in S. This 
means that a consistency proof from the fini- 
tary standpoint of a formal system S inevitably 
necessitates some argument that cannot be 
formalized in S. 


(2) Consistency Proofs for Pure Number 
Theory. Gentzen [7] called pure number theory 
the theory of natural numbers not depend- 
ing on the free use of set theory (differing con- 
sequently from the usual theory of natural 
numbers based on tPeano axioms; — 294 
Numbers) and proved its consistency. W. 
Ackermann [14] proved the consistency of a 
similar theory admitting the use of Hilbert's 
*e-symbol. G. Takeuti [15] showed that Gen- 
tzen's result can be obtained as a corollary 

to his theorem extending tGentzen’s funda- 
mental theorem on tpredicate logic of the first 
order to a ftheory of types of a certain kind. 

According to the result of Gódel mentioned 
in (1) above, some reasoning outside pure num- 
ber theory must be used to prove its consis- 
tency. In all consistency proofs of pure number 
theory mentioned above, ttransfinite induction 
up to the first te-number e, is used, but all the 
other reasoning used in these proofs can be 
presented in pure number theory. This shows 
that the legitimacy of transfinite induction up 
to ze cannot be proved in this latter theory. A 
direct proof of this fact was given by Gentzen 
[13]. On the other hand, the legitimacy of 
transfinite induction up to an ordinal num- 
ber < £ can be proved within pure number 
theory. 

Again, transfinite induction is not the only 
method by which to prove the consistency of 
pure number theory. Actually, Gódel [17] 
carried out the proof utilizing what he called 
computable functions of finite type on natural 
numbers and what we call primitive recursive 
functionals of finite type. 

By restricting pure number theory further, 
one obtains weaker theories of natural num- 
bers whose consistency can be proved with 
finitary methods without recourse to such 
methods as transfinite induction up to £y. M. 
Presburger [18] proved the consistency of a 
theory in which only the addition of numbers 
is considered an operation. Ackermann [19], 
J. von Neumann [20], J. Herbrand [21], and 
K. Ono [22] proved the consistency of theo- 
ries in which some restrictions are placed 
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on the use of the axiom of tmathematical 
induction. 

On the other hand, K. Schütte [23] gave a 
consistency proof for number theory including 
what he called "infinite induction" from a 
stronger standpoint than Hilbert's finitary one; 
he attempted to find a basis that makes such a 
proof possible. 


(3) The Consistency of Analysis. No definitive 
result has yet been obtained from the stand- 
point of formalism, though many attempts are 
being made, among which a recent one by C. 
Spector [24] should be mentioned. 


(4) Axiomatic Set Theory. There are different 
kinds of axiom systems (— 33 Axiomatic Set 
Theory). To give a consistency proof for any of 
these systems is considered a very difficult 
problem today, but many interesting results 
are known concerning the relative consistency 
or independence of these axioms. 


(5) The Skolem-Lówenheim Theorem. The 
. metamathematical Skolem-Lówenheim 
theorem states: Given a consistent system of 
axioms stated in the first-order predicate logic 
whose cardinality is at most countable, there 
always exists an *object domain consisting of 
countable objects satisfying all these axioms. 
For example, axiomatic set theory is stated 
in predicate logic of the first order, and the 
cardinality of its axioms is countable. Thus 
there exists an object domain consisting of 
countable objects satisfying all these axioms, 
provided that they are consistent. Such a do- 
main is called a countable model of axiomatic 
set theory. On the other hand, from the axioms 
of this theory one can prove that there exists 
a family of sets that is more than countable. 
This should also hold in a model of the theory, 
in which each object represents a set. This 
situation is known as the Skolem paradox. 
This does not imply, however, the inconsis- 
tency of axiomatic set theory. In fact, the term 
"countable" is to be interpreted in its math- 
ematical sense when one says "there exists a 
family of sets that is more than countable,” 
while it should be interpreted in its metamath- 
ematical sense when one speaks of a countable 
model of the theory. It is the confusion of these 
two different interpretations that leads to the 
"paradox." 


(6) Skolem's Theorem on the Impossibility of 
Characterizing the System of Natural Numbers 
by Axioms. T. Skolem [27] proved that it is 
not possible to characterize the system of 
natural numbers by a countable system of 
axioms stated in the predicate logic of the first 
order. More precisely, given any consistent 
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countable system of axioms satisfied by the 
system of natural numbers, there always exists 
another ‘linearly ordered system satisfying all 
these axioms and yet not isomorphic to the 
system of natural numbers as an ordered 
system. 

Gódel's incompleteness theorem and the 
Skolem paradox, as well as this result, seem 
to indicate that there is a certain limit to 
the effectiveness of the formalist method. On 
the other hand, nonstandard analysis has 
originated in this result. 
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A. Brief History 


It is obvious that four colors are necessary to 
color some geographical maps on a sphere (or 
plane), but are four colors sufficient to color 
every map? This is the so-called four-color 
problem. The precise formulation of this prob- 
lem will be given in Section B. The conjecture 
was made by Francis Guthrie and communi- 
cated through his brother Frederick Guthrie 
to A. De Morgan in 1852 [4]. A. Cayley called 
attention to the problem in 1878. It became 
famous after J. P. Heawood pointed out a 
mistake in the proof by A. B. Kempe (1879). 
Heawood also studied the problem of coloring 
maps on arbitrary surfaces (— Section E). 
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Though there have been various approaches 
to the four-color problem, the main stream of 
investigation has concentrated on obtaining an 
unavoidable set consisting of reducible ar- 
rangements (— Section D) in order to correct 
the mistake made by Kempe. G. D. Birkhoff 
[7] first discovered the simplest nontrivial 
reducible arrangement, nowadays called 
Birkhoff's diamond (Fig. 1). Using all reduc- 
ible arrangements known up to that time, P. 
Franklin showed that every map with up to 25 
countries must contain a reducible arrange- 
ment, so that such a map is four-colorable [8]. 
This limit has been gradually increased; in 
1975 a reducible arrangement for 25 countries 
was found. 


Fig. 1 
Birkhoff's diamond. 


H. Heesch invented the method of discharg- 
ing (— Section D), found criteria for reduci- 
bility, and finally conjectured the existence of 
an unavoidable set of reducible arrangements 
with several thousand elements, but this was 
too large to construct by hand. W. Haken and 
K. Appel with J. Koch worked with high- 
speed computers for a total of 1,200 CPU 
hours over a period of four and a half years 
and finally succeeded in constructing and 
checking an avoidable set of reducible ar- 
rangements with 1,834 elements, which proved 
the four-color problem affirmatively (1976; 

[9, 10]). For early investigations of the four- 
color problem — [5]; for results up to the 
1930s — [3]. 


B. Precise Formulation of the Problem 


To formulate the problem precisely, we must 
state the following two conditions: (i) Every 
country on a map is a tconnected domain; a 
connected part of the sea is considered to be a 
country. (it) Two countries sharing boundary 
lines must be colored differently. On the other 
hand, if two countries share only a finite num- 
ber of points, then they may share the same 
color. 

Actually we can modify the map so that no 
more than three countries meet at the same 
point. A map with this condition is called a 
trivalent or cubic map. In the study of the four- 
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color problem, we can restrict ourselves to 
trivalent maps without two-sided countries. In 
the remainder of this article, we shall assume 
that the maps considered have these 
properties. 

In recent investigations, it has become 
customary to convert maps into their fdual 
graphs, where each country is replaced by its 
capital lying inside, and for each adjacent pair 
A, B (Fig. 2), the boundary is replaced by a 
line connecting the representative points of A 
and B in such a way that it meets the bound- 
ary only once. 





Fig. 2 
Dual graph. 


The formal extension of the four-color prob- 
lem to higher dimensions is trivial, since we 
can easily construct arrangements needing 
arbitrary many colors for coloring. By con- 
verting to a dual graph, one can see that this is 
not surprising, since the planarity of a graph 
is a strong restriction, while the condition 
of representability of a linear graph in 3- 
dimensional space imposes no restriction. 


C. Tait’s Algorithm 


Suppose that a planar trivalent map M is four- 
colored. Denote the four colors by 0, 1, 2, and 
3. In Fig. 3, we use A, B, C, and D instead of 0, 
1, 2, and 3, respectively). Here we take the 
operation a Q bc. Represent the integers 

a, b 20, 1, 2, or 3 in the binary (2-adic) number 
system as 2-digit numbers a, a, and b, bo, 
where a; and b; are 0 or 1. For each digit we 
take the operation a; b; — c; to be binary 





Fig. 3 

An example of Tait's algorithm. At the vertices, e 
stands for + and © for —. Within the domains, A, 
B, C, and D stand for four colors corresponding to 
0, 1, 2, and 3, respectively. 
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addition without carrying, i.e., the operation is 
that of the logical (exclusive) *or." The number 
€ — cC in the binary system represents the 
result of the operation. To each boundary line, 
we give the number corresponding to a Q b, 
where a and b are the colors for the two coun- 
tries meeting at the boundary. Then we have 
the numbering 1, 2, and 3 for each edge, where 
1, 2, and 3 are labeled once and only once for 
each triple of edges emanating from each 
vertex. Such numbering for edges is called Tait 
coloring for edges. Then we give the signature 
+ or — to each vertex, according as the order 
of the edge numbering is counterclockwise or 
clockwise. Then the algebraic sum of the signa- 
tures along the boundary of each country in 
the map is always a multiple of 3. Conversely, 
if we give the signature + or — to each vertex 
in such a way that the algebraic sum of the 
signatures is always a multiple of 3 along the 
boundary of each country, then we get a four- 
coloring of the map by reversing the above 
procedure. This is called Tait's algorithm, 
which shows that the four-coloring of a planar 
map is an *NP problem. As for five-coloring, it 
is known that an algorithm of polynomial 
complexity exists. 

Tait's algorithm also shows that the four- 
color problem is equivalent to the following 
apparently elementary geometric proposition: 
“For any convex polyhedron, we can always 
cut near some of its vertices in such a way that 
the resulting polyhedron has only faces such 
that the number of sides is a multiple of 3.” 
Many other equivalent statements of the four- 
color problem are known [1,2]. 


D. Solution of the Four-Color Problem 


For a given planar trivalent map, denote by 
V, the number of the countries with n sides 
(n> 3). From Euler’s theorem on polyhedra, 
we have immediately the relation 

Y (6-n)V,-12. (1) 
nzà 

We easily see from this that every planar map 
must contain 3-, 4-, or 5-sided countries. A 
family F of the arrangements of countries with 
the property that every planar map must 
contain at least one arrangement belonging to 
F is called an unavoidable set. The family con- 
sisting of 3-, 4-, and 5-sided countries is the 
simplest example. In order to obtain an un- 
avoidable set, Heesch invented the method of 
discharging. Let us assume the existence of a 
map M that contains no arrangement of a 
family F. We give a signed charge of (6 — n) to 
every n-gon in M. Next we divide and move 
the charges so that the pluses and the minuses 
cancel out. If all positive charges disappear, 
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according to the assumption of the existence of 
the map M, then we have a contradiction of 
(1), so we can conclude that no such map M 
exists. 

A 3-sided country can be ignored in four- 
coloring. As for a 4-sided country A, Kempe 
proved (by means of a definite modification 
procedure) that after four-coloring outside A, 
we also have a total four-coloring including A. 
An arrangement of a country or countries A is 
called reducible if, as in the above case, after 
four-coloring outside A, we get a total four- 
coloring including A by suitable modifications. 
If we have an unavoidable set consisting only 
of reducible arrangements, then by definition, 
every planar map will be four-colorable. 

Kempe believed that he had proved the 
reducibility of a pentagon (5-sided country), 
but unfortunately, he missed a particular case. 
Even though the pentagon is not reducible, 
much effort has been given to finding other 
reducible arrangements. Many useful criteria 
for reducibility have also been studied. If we 
have enough reducible arrangements, then 
we can either eventually obtain an unavoid- 
able set, which proves the four-color problem 
affirmatively, or find a “minimal counter- 
example" without reducible arrangements, 
which disproves it. 

Haken obtained an unavoidable set consist- 
ing of arrangements without certain kinds of 
reduction obstructions. (He called these "geo- 
graphically good arrangements." ) He firmly 
believed that, applying certain "probabilistic 
conjectures" to such arrangements, he would 
conquer the problem by considering arrange- 
ments of up to 14 countries. With repeated 
improvements on the discharging process and 
on the criteria for reducibility, he finally was 
able to conclude that his speculation was 
correct, as recounted in Section A. His inves- 
tigation is not only an example of the use of 
high-speed computation in pure mathematics, 
but also one inviting reassessment of the 
meaning of mathematical proof. 


E. Coloring Maps on Arbitrary Surfaces 


On a ftorus, seven colors are sufficient to color 
any map and that many colors are necessary 
to color some maps. Heawood (1890) inves- 
tigated the problem of coloring maps on 
closed surfaces with *Euler characteristic y « 2, 
orientable or not. The least number of colors 
sufficient to color any map on a surface is 
called the chromatic number of the surface. 
Heawood proved that the chromatic number 
is less than or equal to 


p=[(7+./49 — 24y)/2] 


(x <2), (2) 
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where [x] means the largest integer a < «. After 
this was proved in various special cases, J. W. 
T. Youngs and G. Ringel finally proved in 
1968 that the number p equals the chromatic 
number except for a Klein bottle (a non- 
orientable surface with y= 0) [6]. Franklin 
proved in 1934 that the chromatic number for 
a Klein bottle is 6. 
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Jean Baptiste Joseph Fourier (March 21, 
1768—May 16, 1830) was born in Auxerre, 
France, the son of a tailor, and was orphaned 
at the age of eight. In 1790, he was appointed 
professor at the Ecole Polytechnique. In 1798, 
Napoleon took him on his Egyptian campaign 
together with G. Monge. On his return to 
France, he was made governor of the depart- 
ment of Isére. With the downfall of Napoleon, 
he lost his position; however, he was later 
appointed to the French Academy of Science 
as a result of his research on the transmission 
of heat. In 1826 he was elected a member of 
the Académie Frangaise. 

His research on heat transmission was 
begun in 1800. In 1811, he presented a prize- 
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winning solution to a problem put forth by the 
Academy of Science. He solved the equation 
for heat transmission under various tboundary 
conditions. Fourier also stated (without rigor- 
ous proof) that an arbitrary function could be 
represented by ftrigonometric series (— 159 
Fourier Series), a statement that gave rise to 
subsequent developments in analysis. 
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Throughout this article, we assume that f(x), 
g(x), ... are real-valued functions and that 
integrals are always Lebesgue integrals. 


A. Introduction 


The set of functions 


1/./2n, cos x/,/n, sinx//n,..., 
coskx/,/n, sin bet, e, ..., 


which is called the trigonometric system, is 
an torthonormal system in (— z, z) (2 317 
Orthogonal Functions). Let f(x) be an ele- 
ment of L,( —z, x) (i.e., (Lebesgue integrable 
in (—7, z)). We put 


1 [4 
d Si f(t) cos kt dt, 


1 n 
h-.| f(t)sinktdt, k=0,1,..., (1) 
T = 


and call a,, b, the Fourier coefficients of f. The 
formal series 


1 oo 
540+ Y. (a,cos kx + b, sin kx) (2) 
k=1 


is called the Fourier series of f and is often 
denoted by (f). To indicate that a formal 
series G( f), as above, is the Fourier series of a 
function f, we write 


1 o0 
f(x)~ 540+ Y. (a, cos kx + b, sin kx). 
k=1 


The sign ~ means that the numbers a,, b, are 
connected with f by the formula (1); it does 
not imply that the series is convergent, still less 
that it converges to f. Generally, trigonometric 
series are those of form (2), where the a,, b, are 
arbitrary real numbers. Since the trigono- 
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metric series have period 2z, we assume that 
the functions considered are extended for 

all real x by the condition of periodicity 

f(x 4- 2n) — f(x). The study of the properties 
of the series S( f) and the representation of f 
by G(f) are major objects of the theory of 
Fourier series. Since e’* — cos x +isinx, if we 
set 2c, =a, — ib,, CA =C; (k—0,1,2, ...), we 
have 


LEE 
ax] fe" dt,  k-0, +1,.... 


Then &( f) is represented by the complex form 
Y. Ce, and fe**} (k 20, +1,...) is an 
orthogonal system in (— z, z). In this complex 
form, we take symmetric partial sums such as 
33. _,c,e'™ (n—1,2,...). 

Consider the power series 3a) + Eg (a4 — 
ib,)z* on the unit circle z=e!* in the complex 
plane. Its real part is the trigonometric series 
(2), and the imaginary part (with vanishing 
constant term) is 


Ms 


(a, sin kx — b, cos kx), (3) 
k 


H 


which is called the conjugate series (or al- 
lied series) of f and is denoted by S( f). In 
complex form, the conjugate series is 
—iXE...(sgnk)c,e'**. 

If f and g belong to L,(— z, 2) and 


fo)e Y ee, 


kon 


ga~ È de", 
then 

1 2n oo . 
xl f(x—1)g(t)dt Y cde. 
2x 0 kon 


The function 
1 2n 

Teil | fix 0g(tdt 
R Jo 


is called the convolution of f and g. 
If f is tabsolutely continuous, then the de- 
rivative f'(x) satisfies 


E: Y k(b, cos kx — a, sin kx). 


k=1 
If F(x) is an indefinite integral of f, then 
=, Ck ikx 
F(x)—cox~ C+ 5 —e 
k=- 00 ik 


€ g,sinkx —b,coskx 
-CcY k k : 
k=1 k 





where C is a constant of integration and the 
symbol ' indicates that the term k =0 is 
omitted from the sum. 

If f e Li(— 7,2), then the Fourier coefficients 
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c, converge to 0 as n— oo (Riemann-Lebesgue 
theorem). If f satisfies the tLipschitz condition 
of order a (0 « « « 1), then c, — O(n ^), and if f 
is of "bounded variation, then c, — O(n TL 
When feL,(—7, n), 


1 f?* o0 
| fco dx-2 3. lal 
T k=- %0 


0 


1 oo 
= 503+ Y at eb). 
k=1 


which is called the Parseval identity. If È |c)? 
< œ, then there exists a function fe L,(—7, n) 
which has the c, as its Fourier coefficients. - 
This converse is implied by the *Riesz-Fischer 
theorem (— Appendix A, Table 11.I). 


B. Convergence Tests 


The nth partial sums s,(x)=s,(x; f) of the 
Fourier series G( f) can be written in the form 


TE f(x +D, (dt, 


where 


D,(t)= {sin(n + 1/2)t}/2 sin(t/2). 


The function D,(t) is called the Dirichlet kernel. 


For a fixed point x we set o.(t) 2 f(x - t) - 
f(x —t) — 2f(x)y; then 


so-so] Q9. (t) D,(t) dt. 
0 
Hence if the integral on the right-hand side 
tends to zero as n> oo, lim,..,, s (x) = f(x). If f 
vanishes in an interval I= (a, b), then G&( f) 
converges uniformly in any interval F — (a + 
£, b — e) interior to J, and the sum of G( f) is 
0. This is called the principle of localization. 
Here we give four convergence tests. (1) If f 
is of bounded variation, S( f) converges at 
every point x to the value ( f(x -- 0) - f(x — 
0)}/2. In addition, if f is continuous at every 
point of a closed interval J, &( f) is uniformly 
convergent in J (Jordan’s test). As a special 
case of this test, bounded functions having a 
finite number of maxima and minima and no 
more than a finite number of points of dis- 
continuity have convergent Fourier series 
(Dirichlet's test). (2) If the integral [5|ox(t)l/tdt 
is finite, then S( f) converges at x to f(x) 
(Dini's test). (3) If 


h 
| |p,(t)|dt = o(h) 


0 


and 


ig [O eo 





0, 
13:0] t 
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then S(f) converges at x to f(x) (Lebesgue's 
test). Jordan's and Dini's tests are mutually 
independent, and both are included in Le- 
besgue's test, which, although not as conve- 
nient in certain cases, is quite powerful. (4) If 
f(x) is continuous in (a, b) and its modulus of 
continuity satisfies the condition o(5): log(1/ó) 
—0 as 6-0 in this interval, then S( f) con- 
verges uniformly in (a+ e, b — €) (Dini-Lipschitz 
test). 


C. Summability 


Let s,(x) be the nth partial sum of the Fourier 
series G( f), and o,(x) — o,(x; f) be the first 
arithmetic mean ((C, 1)-mean) of s(x) (Le., o,(x) 
= (so(x) + s, (x) +... +5,(x))/(n+ 1)). Then we 
have 


T 


1 
BEEN (t) K,(t) dt, 


0 


where 


1 (ane 1 ell 


PJ Cl sur? 


The expression K,,(t) is called the Fejér kernel, 
and the o,(x) are often called Fejér means. If 
the right and left limits f(x +0) exist, &( f) is 
*(C, 1)-summable at the point x to the value 
(f(x +0)+ f(x — 0))/2. If f is continuous at 
every point of a closed interval I, S(f) is 
uniformly (C, 1)-summable in J (Fejér theorem, 
1904). As we explain in Section H, there exist 
continuous functions whose Fourier series 
are divergent at some points. Thus the sum- 
mability of G( f) is more important than its 
convergence. Fejér's theorem remains true 

if we replace (C, 1)-summability by (C, a)- 
summability (x 0). More generally, if fe 
L,(—7,z), then G(f) is (C, «)-summable for 
a 2-0 to the value f(x) almost everywhere (H. 
Lebesgue). Since (C, x)-summability (x 0) 
implies tsummability by Abel’s method, the 
result of Fejér's theorem is valid for tA- 
summability. However, the direct study of A- 
summability is also important. Let f(r, x) be 
Abel’s mean of G( f); that is, 


1 Es 
f(r. x) "e + Y. (a,coskx + b,sinkx)r* 
Ki 


gi f(x+OP(r, t)dt, 
where P(r, t)=(1 —r2)/2(1—2rcost - r?), 0< 
r « 1. We call P(r, t) the Poisson kernel. The 
function f(r, x) is harmonic inside the unit 
circle and tends to f(x) as r>1 almost every- 
where. Hence f(r, x) gives the solution of the 
*Dirichlet problem for the case of the unit 
circle. 
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D. The Gibbs Phenomenon 


Let f(x) be of bounded variation and not 
continuous at 0: f(0) 20, f(--0) 21-0, f(—0)2 
— |. Then the partial sum s,(x) converges to 
f(x) in the neighborhood of 0, but not uni- 
formly. Moreover, 


lim s, (5) 216: 
noo n 


2 ("sint 


7T Jo 


Hence as x tends to 0 from above and n tends 
to oo, the values y=s,(x) accumulate in the 
interval TO. /G], while s( — x) 2 — s,(x) in the 
neighborhood of 0. This phenomenon is called 
the Gibbs phenomenon. If f is of bounded 
variation, then G( f) exhibits the Gibbs phe- 
nomenon at every point of simple discontinuity 
of f. However, the (C, 1)-means of &( f) do not 
exhibit this phenomenon. 


E. Conjugate Functions 


For any integrable fe L,( — n, n), the integral 


fe9- 


imt Unsen As m2; a 
kä Tja 2 


exists almost everywhere. The function f(x) 

is called the conjugate function of f(x). The 
conjugate series &( f) is (C, a)-summable («> 
0) to the value f(x) at almost every point, 

and a fortiori summable by Abel's method. 
Even if f € L,( —7, 2), f does not always be- 
long to the class L,(—7, z). For example, 
D2 cosnx/logn is the Fourier series of a 
function fe L,(— n, x), but its conjugate series 
=~, sinnx/logn is not the Fourier series of a 
function in L,(—2, x). However, if both f and 
f are integrable, $( f) 2 G(f). If fe L, (p> 1), 
then fe L, and |], & A, || fll; also, &(f) 

= &(f). If |f|log"|f] is integrable (such a 
function is said to belong to the Zygmund 
class), then f is integrable and S(f)=G(/f). 
Moreover, in this case there exist constants A 
and B such that 


2n 2n 
| Jaxa | [fllog*|fldx + B. 


0 0 


If f is merely integrable, so is |f|” for any 0< 
p«1, and |f lp s Bp ISI (0 p « 1). If feLip« 
(0 « a « 1), then f € Lipa, but the theorem fails 
for «20 and x= 1. The conjugate function is 

important for convergence of partial sums of 
Fourier series. 
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F. Mean Convergence 


The theorems on conjugate functions enable 
us to obtain some results for the tmean con- 
vergence of the partial sums s, of C( f). If fe 
L, (p? 1), then || f —^s,||, 0; if f e L,, then 

If 5,150. If — 3,0 for every 0<p< 

1. Also, if |f|log ^ |f|& L,, then || f —s,| —0, 
(IS —0. As a corollary of this result, we 
obtain the following theorem, which is a gen- 
eralization of the Parseval identity: If the 
Fourier coefficients of functions fe L, and ge 
L, (V/p - 1/q — 1) are a,, b, and ap, bp, respec- 
tively, we have the Parseval formula 


pre 1 o , 
-| fgdx= agayt > (a,a,+5,b,), 
H 2 n=1 


0 


where this series is convergent. 


G. Analytic Functions of the Class H, 


Let p 0. A complex function (z) holomor- 
phic for |z| « 1 is said to belong to the class 
H, (Hardy class) if there exists a constant M 
such that 


2n 
fae | lo(re'9)|? 0 < M. 
rz) 2n 0 

When o(z)e H,, the nontangential limit q(e'^) 
— lim, q(z) exists for almost all 0. We write 
this as (e?) = f(0) + if(0), where f(0), f(0) 
belong to the class L,. Also, f(0) coincides 
with the conjugate function of f(0) for pz 1. 
For 1 « p« oo, H, is isomorphic to L,, but 

for p=1 and p= oo, H, and L, are different 
classes. Using the theory of functions of H,, we 
can discuss some properties of Fourier series. 
If p(e*) = f(0) + if(0) is of bounded variation, 
then q(e'?) is absolutely continuous and its 
Fourier series converges absolutely. We set 


1 1/2 
g(0)— (| (a notre ar) , 


g*(0)— 


1 2x 1/2 
(| (1—r)dr | |o'(reX9^9)|? P(r, pat) 3 
0 0 


where P(r, t) is the Poisson kernel. Then g(0) € 
2g*(0), and there exist constants A,, B, C, 
and A, such that 


21 2n 
| perdo a, | lo(e^)"d0, ` p»0, 


0 0 


0 0 


2n 2n 
| l'ai, | lo(e^)?dÓ, pl 


2n 2n 
| emäreg) |o|log*1o|46 +C, 
0 0 
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2m 2n A 
| lg* (0) d6 « A, (f ena) 
0 D 


O0<yp<i. 


Denote by s,(0) and o,(6), respectively, the 
partial sums and arithmetic means of the 
Fourier series of q(e'*), and set 


A 0)— o.(0)2 N12 
i LN onl i 


n 

Then 0 z A, <g*(0)/y(0) < A, # oo. From these 
relations, we can prove that if the indices 

n, satisfy the conditions f n,,, /n, 42 

1, $, (0) converges almost everywhere to 

(e^) for o(e'^)e H, (1 <p). If we set A,(0) — 
dE, itl ce", 6(0) - (Zo |A,(0)|7)", where 
ole”) — 3.5 c,e"*, then ||5(A)||,< A, loll, 
(p> 1). If o(z)e H, (0 « p « 1), then X c,e"? is 
(C, p! — 1)-summable to (e'?) almost every- 
where. These functions and relations were 
introduced mainly by J. E. Littlewood and 

R. E. A. C. Paley and were later generalized by 
A. Zygmund. There are more precise results 
by E. Stein [7], G. Sunouchi [8], S. Yano [9], 
and others. 


H. Almost Everywhere Convergence and 
Divergence 


P. du Bois Reymond (1876) first showed that 
there exits a continuous function whose 
Fourier series diverges at a point, but the 
problem of whether Fourier series of continu- 
ous functions converge almost everywhere 
(the so-called du Bois Reymond problem) re- 
mained unsolved for many years. At last in 
1966, L. Carleson [10] proved that the Fourier 
series of a function belonging to L, converges 
almost everywhere; hence the du Bois Rey- 
mond problem was solved affirmatively. Using 
Carleson’s method, R. A. Hunt [12] proved 
that 


2n p Qn 
| BEEN ix<A,| Lf (x) dx, 
0 n 0 


1<p<o, 


which implies that the Fourier series of fe L, 
(1 « p « oo) converges almost everywhere. Hunt 
also proved that 


2n 
| (sup ts dx 
0 n 


2n 
<A | Lf) log* [f(o)))2dx-+.4. 


0 


Moreover, P. Sjolin proved that if 


2x 
| \f\-log* |f|-log* log* |f|dx « oo, 


0 
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then the Fourier series of f(x) converges 
almost everywhere. 

On the other hand, A. N. Kolmogorov 
gave an integrable function with Fourier 
series diverging everywhere (more precisely, 
lim sup,,..,, [55 (x)| = 0o almost everywhere). 
Using this example, J. Marcinkiewicz showed 
that there is an fe L, such that s,(x; f) oscil- 
lates boundedly almost everywhere. Moreover, 
there exists an integrable function with inte- 
grable conjugate and almost everywhere diver- 
ging Fourier series [3]. For any given tnull set 
E, we can construct a continuous function 
whose Fourier series diverges on every xe E. 


L Absolute Convergence 


The convergence of the series (1) X (|a,| + | b,]) 
implies the absolute convergence of the trig- 
onmometric series (2) a,/2 + 2(a, cos nx + 
b,sin nx). Conversely, if the series (2) con- 
verges absolutely in a set of positive mea- 
sure, the series (1) converges (Denjoy-Luzin 
theorem). For the absolute convergence of 
Fourier series, we have the following tests: If 
feLipa (x 1/2), then G( f) converges ab- 
solutely, but for «= 1/2, this is no longer true. 
If f(x) is of bounded variation and belongs to 
Lip« (x7 0), G( f) converges absolutely. 

Suppose that the Fourier series of a function 
f(x) is absolutely convergent and the value of 
f(x) belongs to an interval (a, b). If o(z) is a 
function of a complex variable holomorphic 
at every point of the interval (a, b), the Fou- 
rier series of ol f(x)) converges absolutely 
(Wiener-Lévy theorem). As a corollary we 
obtain that if S( f) converges absolutely and 
f(x) Z0, then S(1/f) converges absolutely. The 
converse of the Wiener-Lévy theorem was 
proved by Y. Katznelson [6]. For a given ox) 
defined in (—1, 1], if the Fourier series of 
oi f(x)} converges absolutely for every f(x) 
with absolutely convergent Fourier series 
(| f(x)| € 1), then (z) is holomorphic at every 
point of the interval | —1, 1). 

Many problems concerning this topic still 
remain unsolved. In particular, the determi- 
nation of the structure of the functions with 
absolutely convergent Fourier series has not 
been completed. 


J. Sets of Uniqueness 


If a, cos nx +b,sinnx converges to 0 on a set 
of positive measure, then a,, b, 0 (Cantor- 
Lebesgue theorem). A point set E c (0, 27) is 
called a set of uniqueness (or U-set) if every 
trigonometric series converging to 0 outside E 
vanishes identically. A set that is not a U-set is 
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called a set of multiplicity (or M-set). G. Can- 
tor showed that every finite set is a U-set, and 
W. H. Young showed that every denumerable 
set is a U-set. It is clear that any set E of posi- 
tive measure is an M-set, but D. E. Men'shov 
showed that there are tperfect M-sets of mea- 
sure 0. Moreover, N. K. Bari showed that 
there exist perfect sets of type U. However, the 
structure problem of sets of uniqueness has not 
yet been solved completely. 

A set E is said to be of type H (or an H-set) 
if there exists a sequence of positive integers 
n, «n; <... and an interval I such that for each 
x€ E, no point of [nx], is in J (mod 27). H- 
sets are sets of uniqueness, a fact given by A. 
Rajchman. I. I. Pyatetskii-Shapiro generalized 
H-sets to H™-sets [3]. 

Lacunary trigonometric series are series in 
which very few terms differ from zero. Such 
series can be written in the form 


(a, cos n,x Fb sinn x)= 3. A, (x). 


kal? 


k 


li 
= 
Se 

ll 
- 


S. Sidon established some of the characteristic 
properties of such series; he generalized them 
further and obtained the notion of Sidon sets 
(— 192 Harmonic Analysis). We often define a 
lacunary series more specifically as a series for 
which the n, satisfy Hadamard's gaps; that is, 
n, +1/M%>q> 1. Then if E£ (a2 + b2) is finite, 
the series ZZ: A,, (x) converges almost every- 
where. Conversely, if ZZ: A, (x) is convergent 
in a set of positive measure, then X, (a? + b2) 
converges. This theorem is related to the 
Rademacher series and random Fourier series 


[4]. 


K. Multiple Fourier Series 


Routine extensions to multiple Fourier series 
from the case of a single variable are easy, but 
significant results are difficult to obtain. Re- 
cently, however, there have been several im- 
portant contributions in this field. 
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A. Fourier Integrals 


In this article we assume that f(x) is a 
complex-valued function defined on R = 
{—00, œ) and t(Lebesgue) integrable on any 
finite interval. If the integral 


a B 
| f(x) '* dx= lim | f(x)e "* dx 
GE Bow a 


exists, it is called the trigonometric integral 

or Fourier integral. We have a general result: 
If f(x)e L,(—00o, oo), K(x) is bounded on 

(Go, oo), and (5 K(x) dx — o(T) (T> +0), then 
f2 f(x) K (xt) dx exists and 


lim p f(x)K(xt)dx=0. 


t too 
— 06 


In particular, it follows that if f(x)e L,(— oo, 
oo), then f2 f(x)e"* dx exists and 


lim | fixe dx =0 


(Riemann-Lebesgue theorem). 


625 


B. Fourier's Integral Theorems 


Suppose that f(x) is of bounded variation in 
an interval including x, or more generally 
satisfies the assumption for any one of the 
convergence tests for Fourier series (— 159 
Fourier Series B). Then Fourier's single inte- 
gral theorem 


1 
ox 9 fix -9) 


jJ. mm EE (1) 


= lim — 


A-co TE —X 


holds if one of the following three condi- 
tions is satisfied: (1) f(x)/(1 + |x|) belongs to 
L,(—00, oo); (2) f(x)/x tends to zero mono- 
tonically as x +00; (3) f(x)/x 2 g(x)sin(px + 
q), where g(x) tends to zero monotonically 

as x +00 (S. Izumi, 1934). The right-hand 
side of (1) is called Dirichlet's integral. 

Let f(x) be of bounded variation in an inter- 
val including x (or satisfy some other conver- 
gence test for Fourier series). Then Fourier's 
double integral theorem 


1 
zA +0) +f(x—0)) 


T oo 
sl «| f(u)cost (u — x)du (2) 


holds if one of the following three conditions 
is satisfied: (4) f(x)e L,(—0oo, 00); (5) f(x)/(1 

+ |x|)eL,(—oo, oo), and f(x) tends to zero 
monotonically as x +; (6) f(x)/(1--|x|)e 
L,(—90, oo) and f(x) — g(x)sin(px + q), where 
g(x) tends to zero monotonically as x > +o. 
If f(x)e L,(—00, oo), the formula 


sin? A(t — x) 


d 
A(t — xy : 


1 oc 
fei- tim [^ f(t) 
holds almost everywhere, and in particular at 
any x where f(x) is continuous. More gener- 
ally, the formula 


(x)= lim al f)K(A(t—x))dt 


= lim IT dea (3) 


holds at any point x where 4 (x+0) and f(x — 
0) exist, if K(t)e L1(—00, oo), (*,, K(t)dt = 

1, |K(t)|<M, K(t)— (t !) as |t| ^ oo, and 
Met (— o0, œ), or if K(t)e L,(—0o, oc), 
17. K(t)dt=1, and f(x) is bounded. Similarly, 
the formula 


im | DE axe "n K (x)dx 


holds if M{ f£] —lim,.,, T ^! f f(t)dt exists, 
K(x) is differentiable, |x? K(x)|< : C (1 & x), and 
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x ! ($f(t)|dt € D, where C, D are constants 
(Wiener's formula). 


C. Fourier Transforms (— Appendix A, 
Table 11.IT) 


Let f(x)e L,(0, 


rom D uwen 
0 


is called the (Fourier) cosine transform of f(x). 
Under the same condition as for the validity of 
(2), the inversion formula of the cosine transform 
f(x) o J/2/n f$ F(t)cos xt dt holds where we 
suppose that f(x) -1( f(x +0)+ f(x — 0)). If we 
define f(— x)= f(x), then this is equivalent to 
the formula (2). Analogously, 


G(t) JT f(u)sinut du 


is called the (Fourier) sine transform of f(x). 
Under the same condition as for the validity 
of (2), we get the inversion formula f(x)— 
Janse G(t)sin xt dt. More generally, for any 
f(x)e L,( —oo, o0), 


"D f(x)e "* dx 


is called the Fourier transform of f(x). Under 
the same condition as for the validity of (2), the 
Fourier inversion formula 


oo). Then 


f(x)= 





sm lim. rer dt 
holds. The cosine transform and the sine 
transform coincide with the Fourier trans- 
form when f(x)— f(— x) and — f(x) 7 f(— x), 
respectively. 

If for any f(x), F(t)e Lj(—9o0, o0) (1 & q « oo) 
exists, for which 


Ee F 
1 Cl Anen mu dio 


(To) 
is valid (i.e., (1/,/2n) {7 Title -ixt dt converges 
to F(x) as T oo in the mean of order q), then 
we say that f(x) has the Fourier transform F(t) 
in L,(—oo, oo). If f(x)e L,( —00, 00) (1 <p <2), 
then f(x) has the Fourier transform F(t) in 
L, (1/p + 1/q — 1), and F(t) has the Fourier 
transform f(— x) in L, (E. C. Titchmarsh). 
Moreover, 


8 |F(Df* dx 


sg 


1 oc l((p-1) 





160 D 
Fourier Transform 


If f(x), G(x)e L,( —o0, oo) (1 « p € 2) and their 
Fourier transforms in L, are F(t), g(t), respec- 
tively, then the Parseval identity 


1 F()G())dt= | "fer 


holds. The Fourier inversion formula holds, in 
the sense that 


x 1 oo et) 
| rue: 7 |" fe SH dt, 


x 1 oo eg" —1 
| f(tdt— x." z— du. 


The theory of Fourier transforms is valid for 
cosine and sine transforms. Specifically for p — 
2, if f€ L,(0, oo), then \/2/n (7 f(x): cos xt dt 
tconverges in the mean in L,(0, oo) to the 
cosine transform F(t) as T— œ, and con- 
versely, the cosine transform of F(t) in L, is 
f(x). The transforms f(x), F(x) are connected 
by the formulas 


ih Padus e | ` En 
0 7 Jo t 
i fid fe | Mri 
o T Jo u 


| reas | | F(t)|? dt. 


0 0 




















The theory of Fourier transforms was gen- 
eralized as follows by G. N. Watson (Proc. 
London Math. Soc., 35 (1933)). We suppose 
that x(x)/x e L5(0, oo) and 
Ji IO a — miní(x, y). (4) 
If f(x)e L,(0, oo), then there exists an F(t)e 
L,(0, oo) such that 


| roa | Voie 


0 o u 


and the inversion formula 


[rdus 2279, 
0 


0 t 


and the Parseval identity 


| ka= | |F(0 dt 


0 0 


hold. F(t) is called the Watson transform of 
f(x). For any feL,, equality (4) is necessary 
for the existence of the Watson transform F(t) 
for which the inversion formula holds. S. 
Bochner (1934) generalized this theory further 
to *unitary transformations in L, (— 192 
Harmonic Analysis). 
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D. Conjugate Functions 


Corresponding to Fourier’s double integral 
theorem (2), the integral 


1 A Es 
e?) d sin t(u— x)f(u)du 
Ao T 0 —o0 


is called the conjugate Fourier integral of the 
integral in the right-hand side of (2). Formally, 
this is written Im, t! fo (1 — cos åt)t (f(x 
t t)— f(x — t))dt. If f(x) is a sufficiently regular 
function, the part involving cos At tends to 0 as 
4 oo. Now let 
P T lini TE, 
T 4 » t 
For any fe L,(0, oo), the integral exists almost 
everywhere, and g(x) is called the conjugate 
function or Hilbert transform of f(x). If feL, 
(p> 1), then g(x)eL, also and we have 
fije fm [tte 


T Ao 
£20 


dt 





and (?, lgx) dx <M, f2 |f(x)|’dx, where 
M, is a constant depending only on p. In 
particular, 


| feo dx | \g(x)|? dx for p=2. 


~ o — 0 


E. Boundary Functions of Analytic Functions 


Suppose that a complex-valued function f(z) 
(z=x+iy) is tholomorphic for y>0, f(x 4- iy) 
converges as y—0 for almost all x to f(x) 
(which is called the boundary function), and 
fG)e L,(—oo, oo) (p> 1). Moreover, suppose 
that f(z) is represented by *Cauchy's integral 
formula or *Poisson’s integral of f(x) on the 
real line. If either f(x)e L, (pz 1) and has F(t) 
as its Fourier transform or f(x) is an L,- 
Fourier transform of F(t)e L, (q> 1), then a 
necessary and sufficient condition for the 
function f(x) to be the boundary function of 
an analytic function is that F(t) be 0 almost 
everywhere for t 0 (N. Wiener, R. E. A. C. 
Paley, E. Hille, J. D. Tamarkin). 


F. Generalized Fourier Integrals 


Let |f()|/(1 - |x|9 € L,(—oo, oo) for a positive 
integer k and 





L, = L(t, x)= x y! 8 


627 


The function E,(t) or one that differs from 
E,(t) by a polynomial of degree at most k is 
called the kth transform of f(x). We write 
formally f(x) 2 (^. e'*' d* E, (t). Actually, if we 
give an appropriate meaning to the integral, 
this formula itself is valid (H. Hahn, Wiener 
Berichte, 134 (1925); S. Izumi, Tóhoku Science 
Rep., 23 (1935)). (For the theory and applica- 
tions of kth transforms — [2,ch. 6].) 


G. Applications of Fourier Transforms 


Suppose that f(x)e L,(—oo, oo), F(t) is its 
Fourier transform, and f(x)  o(e 9*?), where 
0(x) is positive and increasing. If f7 0(x)x ? dx 
= oo and F(t) vanishes identically in an 
interval, then F(t) «0 over (—oo, œ). If 

J? 0(x)x ?dx « oo, then there exists a function 
f(x) such that F(t) vanishes identically in an 
interval, but F(t) does not vanish identically in 
(—26, œ) (Wiener, Paley, N. Levinson). These 
results are applicable to the theory of fquasi- 
analytic functions. 

Let f(x)e L,(—0o0, oo). Then a necessary 
and sufficient condition for any function in 
L,(—00, oc) to be approximated as closely as 
we wish by linear combinations of the trans- 
lations EX, a, f(x +h,) of f(x) with respect 
to the L,-norm is that the Fourier transform 
of f(x) does not vanish at any real number. 
When f(x)e L,( —o0, oo), a necessary and suffi- 
cient condition that an arbitrary function in 
L?(—00, oo) can be approximated as closely 
as we wish by X2. , a, f(x +h,) with respect to 
the L,-norm is that the zeros of the Fou- 
rier transform of f(x) have measure zero 
(Wiener). This result was used by Wiener to 
prove the generalized Tauberian theorem: 
Suppose that g,(x)e L,(—0oo, oo) and its Fou- 
rier transform never vanishes. Moreover, let 
ga(x)e L,(—00, oc) and p(x) be bounded over 
(—oo, oo). Then lim,..,, (7. gı (x — t)p(t)dt = 
AT? g,(0dt implies that lim, .. f^, g;(x 
t)p(t)dt — Al". g; (t) dt. Another type of 
Wiener theorem is concerned with tStieltjes 
integrals. Suppose that 


oo 


Y sup [giG9| «oo 


n=— oo ngx<nt+1 


(hence g,(x)e L,) and that the Fourier trans- 
form of g,(x) never vanishes. Moreover, let 


Kë 


3 sup |g2(x)|< co 


n—-—on&Exc«ntl 
and let (z^! |da(z)| be bounded. Then 


Ka 


im | vis msn) g,(t)dt 


—00 








160H 
Fourier Transform 


implies 


lim | g»(x —t)da(t) — A | g(t)dt 

From these general theorems, we can prove 
various tTauberian theorems about the sum- 
mation of series. Also, these results were applied 
to the proof of the tprime number theorem by 
S. Ikehara and E Landau (Wiener [1]). In the 
general Tauberian theorem, the boundedness 
of p(t) can be replaced by one-sided bounded- 
ness (H. R. Pitt, 1938). In fact, the first form of 
the theorem still holds if we replace the con- 
dition by the following one: g, and g, are 
continuous, g,(x) 20, the Fourier transform of 
g(x) does not vanish, g,(x) satisfies the con- 
ditions of the second theorem, and p(x) > C. 
(Concerning Fourier transforms on the topo- 
logical groups — 192 Harmonic Analysis; 

and concerning Fourier transforms of the 
distributions — 125 Distributions and 
Hyperfunctions). 


H. Fourier Transforms of Distributions 


The Fourier transform is defined in higher- 
dimension R" by 


Z(f)e (/2xy" | e 5f(x)dx 


THE E 


f€L,, 


One denotes it by f(¢). The inverse transform 
is defined by 


D- d GER 


Differentiation under the integral sign gives 


DFE - (./2ny" | e 8(— ix)*f(x)dx 


D* =(6/6x 4)" ...(0/0x,", 


under the assumption (1 4-|x|)"f(x)e L, (|a| 
X +... - &,). Roughly speaking, the decreas- 
ing order of f(x) when |x|-» oo is reflected in 
the differentiability of f(£). In the same way, 


GAE =(,/20)" [ena dx, 


which shows that the differentiability of f(x) 
is reflected in the decreasing order of f(£). 
The same statements can evidently be made 
for the relation between g(é) and its inverse 
Fourier transform. 

Let ¥ be the space of rapidly decreasing 
functions ois), i.e., such that for all positive 
integers k and 4 20, (1 + |x|) D* p(x) remains 
bounded (— 125 Distributions and Hyperfunc- 
tions). From the facts above, it follows that the 
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linear mapping f(x) (f) is a topological 
isomorphism from Z, onto A Let S’ be the 
dual space of Z. Usual function spaces in 
classical analysis are contained in 4", for 
instance, the L,-space and that of functions 
increasing at most of polynomial order at 
infinity. 

Let Te“. Then the mapping o € 7; 
T(F o) is continuous, and F Te% is defined 
by ¥ T(~)=T(F p). F is also a topological 
isomorphism from “ onto AC Take as an 
example the distribution p.v.1/x. This is no 
longer a function; however, its Fourier trans- 
form can be calculated as follows: 


d 2i 
lim (4 / 2n)! | gri e = ! 
pns e«|x|« A x 2m 


n 
hc. zi UO 

E [eta d 
a. i 
m j^ cs , 


where the limit is taken with respect to the 
topology of F”. 

Using the definition above, classical results 
can be extended to A" almost automatically. 
For example, 4(D* T) — (i£ F(T), 4 ((—ixY T) 
= D* £ (T). Moreover, if Teg’, then F T= 
Jn 7" T(e~**). In particular, F ô —(2z) "2. 

The Plancherel theorem says that if for 
f(x)e L, one defines its Fourier transform by 


fb - Lim. Jm 


ls A 








e "*f(x)dx, 


then the correspondence f(x) f(E) is a uni- 
tary mapping from L, onto L;, Le, 


| If)? dx = | LIED dé. 


This result can be extended. For any non- 
negative integer m, the element of H” (— 

168 Function Spaces) is characterized by its 
Fourier transform: f(x)e H" if and only if 
(1-4&p"ft&)e L;. Furthermore, for arbitrary 
real s, the space H? can be defined as the set of 
all elements of S’ whose Fourier transform 
f(&) satisfies (1 -|£| f(£)e L}. H* and H ^? are 
dual to each other. 

For feL,,ge€L,, or f; ge La, the convo- 
lution makes sense as a function, and it holds 
that F(f *g)=F(f)F (g). This relation can be 
extended to distributions, to state 


Z(S* T)-(ZSYZ T). 


This holds for (S, T)e 8 x ^, 2, x 2, (Gz, 
is the dual of Z, etc. 
Fourier transforms are also often defined by 


fley= efas or de 
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In the former case, the inversion formula takes 
the form 


fio - ex)" fed 


and the Parseval identity becomes 


[uota am [ioa 
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A. General Remarks 


A group F is called a free group if it is the tfree 
product (— 190 Groups M) of "infinite cyclic 
groups G,,...,G, generated by a,,...,a,, TE- 
spectively. Then n is called the rank of F. A 
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free product of tsemigroups is defined similarly 
to that of groups, and the free product of 
infinite cyclic semigroups G;2 (1,a;,a?, ...] 
(i— 1, ...,n) is called a free semigroup gener- 
ated by n elements a; (i= 1, ..., n). 

If a group G is generated by subgroups 
H; (i— 1, ...,n) isomorphic to G;, then G isa 
homomorphic image of the free product of 
the groups G;. A subgroup # {e} of the free 
product F of groups G; is itself the free prod- 
uct of a free group and several subgroups, 
each of which is conjugate in F to a subgroup 
of some G; (A. G. Kurosh, 1934). Notably, a 
subgroup z {e} of a free group is itself a free 
group (O. Schreier, Abh. Math. Sem. Univ. 
Hamburg, 5 (1927)). A subgroup of index j ofa 
free group of rank n is a free group of rank 
1+j(n— 1) (Schreier). 

Let F be the free group generated by n ele- 
ments a,,...,a,, and let G be a group gen- 
erated by n elements b,, ..., b,. Then there is a 
homomorphism of F onto G. Let N be its 
kernel. If the class of a tword wí(a,, ...,a,) 
belongs to N, then we have w(b,,...,5,)=1. 
We call w(b,,...,5,)=1 a relation among 
the generators b,,..., b,. If N is the minimal 
normal subgroup of F containing the classes 
of words w,(a,, ...,a,), ..., w,(a,, ..., a,), then 
the relations w,(b,, ...,5,) — 1, ..., w4(b,, ..., 
b,) — 1 are called defining relations (or funda- 
mental relations). If generators a,,...,a, and 
words W (a1, ...,a,), ..., Walt, ap) are 
given, then there is a group generated by 
d, ...,0, with defining relations w;(a,, ..., 
à,) — 1, ...,w(d4, ...,a,) 5 1. In fact, let F be the 
free group generated by a,,...,a, and N the 
minimal normal subgroup containing the 
classes of words w,(a,,..., anh -<s Win(@q5 -5 
à,). Then the factor group F/N is such a group. 
A free group is a group with an empty set of 
defining relations. In the preceding discussion, 
n and m are not necessarily finite. If both n 
and m are finite, then G is called finitely 
presented. 


B. The Word Problem 


If a finitely presented group G is given, then a 
general procedure has to be determined by 
which it can be decided, in a finite number of 
computational steps, whether a given word 
equals the identity element as an element of G. 
This is called the word problem (— 190 Groups 
M). A solution to the word problem does not 
always exist (P. S. Novikov [8], 1955); in fact, 
there is a group with two generators and 32 
defining relations for which the word problem 
cannot be solved (W. Boone [7]). However, it 
was shown by V. A. Tartakovskii that the 
problem can be solved for a large class of 
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groups. W. Magnus (1931) showed that it is 
solvable for any group with a single defining 
relation. The word problem is an example of 
decision problems (— 97 Decision Problem). 
The word problem for groups is closely related 
to that for semigroups (A. M. Turing, 1937; E. 
L. Post, 1947; A. A. Markov, 1947). Similar 
problems for other algebraic systems can also 
be considered. The problem of determining a 
general procedure by which it can be decided, 
in a finite number of steps, whether two given 
words interpreted as elements of G can be 
transformed into each other by an (inner) 
automorphism of G is called the transforma- 
tion problem. 

Let F be a free group of rank n and F — F, 
>... DF,>F,,,2>... be the lower central 
series of F. Then F,/F,,, is a tfree Abelian 
group of rank u,(r)=(1/r) 4 u(r/d)n*, where u 
is the tMobius function (E. Witt). The intersec- 
tion of all subgroups of F of finite index is the 
identity element. 


C. The Burnside Problem 


The original problem of Burnside is: If every 
element of a group G is of finite order (but not 
necessarily of bounded order) and G is finitely 
generated, is G a finite group? E. S. Golod [6] 
(1964) showed that this problem for p-groups 
has a negative solution. The following is the 
more usual form of the Burnside problem: If a 
group G is finitely generated and the orders of 
elements of G divide a given integer r, is G 
finite? Let F be a free group of rank n, N be 
the normal subgroup of F generated by all the 
rth powers x” of elements of F, and B(r,n) — 
F/N. Then the problem is the same as the 
question of whether B(r, n) is finite. For r= 

2, 3, 4, 6 the group is certainly finite (I. N. 
Sanov, M. Hall). The restricted Burnside prob- 
lem is the question whether the orders of 
finite factor groups of B(r, n) are bounded. It 
was solved affirmatively for r a prime (A. I. 
Kostrikin [5], 1959). 

A group generated by two generators x, y 
and satisfying the relations x" — y"—(xy)"—1 
(where u, v, w are integers) is infinite if 1/u + 1/v 
t 1/w—1 «0, and is of order g if 0< 1/u + 1/v 
+1/w—1=2/g. 

There is also a finitely presented group 
which is isomorphic to its proper factor group 
(B. H Neumann). 
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The origin of functional analysis can be traced 
to the 1887 work of V. Volterra. He stressed 
the important notion of operations or opera- 
tors, that is, generalized functions of which 
the domains as well as the ranges are sets of 
functions. A typical example is the operator 
assigning to a function f its derivative f’. An 
operator is called a functional if its values are 
numbers, as in the case of the operator assign- 
ing to a function f the value f'(a) or the value 
LU f(t)dt. In 1896, Volterra considered the 
operator mapping a continuous function f to 
a continuous solution ¢ of the integral equa- 
tion f(x)— p(x) — Jz K(x, y)o(y) dy, where 
K(x, y) is a continuous function. Defining 
Io — 9 and (Ko)(x) » [7 K(x, y) (y)dy, he 
showed that ¢ is given by p=(I—K) ' f= 

f * Kf t K?f & ..., where K"f — K(K"^! f). 
Following this lead, I. Fredholm studied in 
1900 the integral equation f(x) e(x) — 

AR K(x, y)e(y)dy containing a parameter 4. 
He proved the so-called *alternative theorem: 
For a given åo, the operator equation (J — 
AoK)p=f, (Ko)(x) — f? K(x, y) o(y)dy, either 
admits a uniquely determined continuous 
solution q for every continuous function f or 
else (I — 4, K) py — 0 admits a nontrivial con- 
tinuous solution o, #0. D. Hilbert discussed 
(1904—1910) a *continuous linear operator K 
defined on the tHilbert space L, with values 
in L3, and he called a complex number Zo a 
*spectrum of K if (I — 4, K) does not have a 
continuous linear inverse. He proved that if 
K is Hermitian, then K admits a *spectral 
resolution with real spectra only. One of his 
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outstanding contributions was the discovery of 
the ‘continuous spectrum. In 1918, F. Riesz 
proved that Fredholm’s alternative theorem 
holds for a tcompletely continuous linear 
operator in the space of continuous functions, 
and later the result was extended to Banach 
spaces. 

In 1932, three important books by S. Banach 
[1], J. von Neumann [2], and M. H. Stone [3] 
were published. These books treated closed 
linear operators that are not necessarily con- 
tinuous. The notion of Banach space was 
introduced: a *normed linear space complete 
with respect to the distance dis(x, y) 2 ||x — 
y||. By making use of the *Baire-Hausdorff 
theorem and the *Hahn-Banach theorem, 
Banach proved, for closed linear operators 
in Banach spaces, the fundamental principle 
consisting of the topen mapping theorem, the 
‘closed graph theorem, the *uniform bounded- 
ness theorem, the tresonance theorem, and the 
closed range theorem. These theorems were 
modified to be applicable in locally convex 
topological linear spaces by the Bourbaki 
group beginning in the late 1940s. 

In 1929, von Neumann proved, as a mathe- 
matical foundation of quantum mechanics, 
that a closed linear operator T in a Hilbert 
space admits spectral resolution with real 
spectra if and only if T is a tself-adjoint opera- 
tor (— also Stone [3]). The condition that a 
closed linear operator is a tfunction of a self- 
adjoint operator was given by von Neumann, 
F. Riesz, and Y. Mimura (1934-1936). K. Frie- 
drichs (1934) proved that a tsemibounded 
linear operator admits a self-adjoint extension. 
T. Kato [4] (1950) proved that a Schrödinger- 
type Hermitian operator is tessentially self- 
adjoint. 

Von Neumann’s tmean ergodic theorem 
in the Hilbert space (1932) was extended to 
Banach spaces by K. Yosida, S. Kakutani, and 
Riesz in 1936. G. D. Birkhoff es tpointwise 
ergodic theorem (1931) was extended by N. 
Wiener (1939), Yosida (1940), E. Hopf (1954), 
N. Dunford (1955), R. V. Chacon and D. S. 
Ornstein (1960), and others in various ways. 
The fAbelian ergodic theorems were discussed 
by E. Hille and R. S. Phillips [5] and Yosida 
[6]. 

The notion of tBanach algebra was intro- 
duced by M. Nagumo in 1936. I. M. Gel'fand 
proved that a commutative Banach algebra 
with multiplicative unit e satisfying |e| = t 
(=the normed ring) admits a representation 
by an algebra of complex-valued continuous 
functions (1941). 

The treflexivity as well as the tduality of 
Banach spaces were studied by S. Kakutani 
(1939), V. L. Shmul'yan (1940), and W. T. 
Eberlein (1941). 
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The notion of !vector lattice (= tRiesz space) 
was introduced in analysis by Riesz (1930). This 
was followed by the work of L. V. Kantrovitch 
(1935) and H. Freudenthal (1936). Kakutani 
gave two standard types of tBanach lattice: 
the *abstract (M) space (1940) and the tab- 
stract (L) space (1941). M. Krein and S. Krein 
(1940) and Yosida and M. Fukamiya (1940) 
discussed the (M)-type vector lattices, and 
Yosida (1941), the (L)-type vector lattices. H. 
Nakano (1940-1941) and T. Ogasawara and 
F. Maeda (1942) studied the spectral resolu- 
tion of the Banach lattice. 

The connection between ‘Brownian motion 
and tpotential theory was clarified by Kaku- 
tani (1942), and extended by J. L. Doob (1956) 
and G. A. Hunt (1957-1958). 

The tone-parameter semigroup of continu- 
ous linear operators in Banach spaces was 
studied by Hille and Yosida, and they gave in 
1948 a characterization of the tinfinitesimal 
generator of such semigroups. Its dual was 
given by Phillips (1955). The one-parameter 
semigroup of nonlinear tcontractive operators 
in Hilbert spaces was studied by Y. Komura 
(1967), who obtained a nonlinear version of 
the Hille-Yosida theorem. This has been ex- 
tended considerably in Banach spaces by 
many scholars, e.g., Kato (1967), M. G. Cran- 
dall and A. Pazy (1969), Crandall and T. Lig- 
gett (1971), I. Miyadera and S. Oharu (1970), 
H. Brezis (1973), P. Bénilan (1973), and others. 

In 1936, S. Sobolev gave a generalization of 
the notion of functions and their derivatives 
through integration by parts. This general- 
ization has been extended by L. Schwartz 
(1945—) [9] to the notion of tdistributions, 
which are continuous linear functionals defined 
on the function spaces Z(R") and "(R"), and 
this extension gives, e.g., a reasonable inter- 
pretation of Dirac's 6-function. Since 1959, 
Gel'fand [10] has been publishing, with his 
collaborators, books on the distribution the- 
ory pertaining to function spaces other than 
Z(R") and A(R"). M. Sato introduced (1959— 
1960) [11] the theory of *hyperfunctions, as a 
generalization of distribution theory. In the 
case of one independent variable, a hyper- 
function f may be defined as a generalized 
boundary value on the real axis R of a holo- 
morphic function F defined in C! — RI. Hyper- 
function theory has been refined to tmicro- 
local analysis and studied extensively by Sato, 
A. Martineau, H. Komatsu, P. Schapira, T. 
Kawai, M. Kashiwara, M. Morimoto, A. Ka- 
neko, and others (— [12]). 
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Functional-Differential 
Equations 


A. General Remarks 


In many models, it is assumed that the future 
behavior of a system under consideration is 
governed only by its present state and not by 
past states. However, for various systems 
arising in practical problems we cannot ignore 
the effect of the past on the future. Such a 
phenomenon is often observed in population 
problems, epidemiology, chemical reactions, 
system engineering, and so on. 

The description of such phenomena may 
involve difference-differential equations 


X(t)= f(t, x(t), x(t — hi), ..., x(t —h,)), (1) 


or integrodifferential equations 


t 


eate xt» | f (t. s, x(t), x(s)) ds. (2) 


0 


An enormous variety of equations is discussed 
in the literature, but most of them can be 
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expressed in the form 
X(t) — f(t, x(s)), (3) 


which is usually called the functional differen- 
tial equation; here s varies in an interval I, and 
f depends on the function x of 1,. If t is the 
right or left endpoint of the interval J,, then 

(3) is said to be of retarded or advanced type, 
respectively. Most of the results obtained have 
been from work on equations of retarded type. 
In this case, the maximal length of the interval 
I, is called the retardation (delay, lag, deviation, 
etc.), and (3) is often called the retarded dif- 
ferential equation, delay differential equation, 
or differential equation with lag (retardation, 
deviating argument). 


B. Historical Remarks 


Functional differential equations of a sort were 
studied by Johann Bernoulli in the 18th cen- 
tury in connection with the string problem. 
Since then, a good deal of work has been done 
in this field by many people; some of this work 
was done before the beginning of this century. 
Among the investigators, Volterra [1, 2] is 
noteworthy for his systematic study on rather 
general equations related to problems of 
predator-prey populations and viscoelasticity, 
though his results were ignored by his con- 
temporaries. In the early 1940s, Minorsky 
[3], in his famous study of ship stabilization, 
pointed out the importance of delay effects in 
control theory, with many of the modern 
issues first appearing in his work [4]. Mishkis 
[5] studied linear systems extensively, and 
Driver [6] gave a unified representation for 
functional differential equations. Important 
achievements were given by Krasovskii [7] 
and Bellman and Cooke [8]. They laid the 
foundation for the qualitative theory of func- 
tional differential equations. Inspired by these 
works, many books (such as [9-14]) were 
published. Owing to these books together with 
many articles on this field, the theory of func- 
tional differential equations has become an 
important branch in the theory of differential 
equations. 

Equations of advanced type are treated in, 
e.g., [5,8], but qualitative theory for them has 
hardly been established. The equation 


X(t)2ax(t)--bx(4t) on  Oxt«oo (4) 


is of advanced type if 2» 1, and its analytic 
solution has been studied in detail. Equation 
(3), whose right-hand side involves differential 
operators, e.g., x(t) — f(t, x(t), x(t — h), X(t — h)), 
is said to be of neutral type [5,8, 14], and is 
generally considered to be of neither retarded 
nor advanced type because of its distinctive 
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features. However, Hale [14] originated the 
study of a class of equations of neutral type for 
which the general qualitative theory can be 
developed in the same manner as for equations 
of retarded type. 

For the case of infinite retardation, proper 
choice of the space of functions x contained 
in the right-hand side of (3) is of great signifi- 
cance. À general treatment of the phase space 
for equations with infinite retardation in an 
axiomatic setting is given in [15]. 


C. Phase Space 


For simplicity, let J, always be the interval 

[t — ^, t] with a finite retardation h 0. A 
solution of (3) starting at t — 1 is a function 
defined on [t — h, a) for a>r such that the 
solution coincides with a preassigned function 
(the initial function) on [:— h, x] and is con- 
tinuously differentiable and satisfies (3) on 

[t, a). Since a solution starting at t — t must be 
continuous for t Z2 t, it is quite natural, as is 
done in much of the literature in order to 
develop a qualitative theory, to choose the 
space C([t — ^, t], R”) of continuous R"-valued 
functions on [t — ^, t] as a space of functions x 
involved in the right-hand side of (3). Intro- 
ducing a symbol x, which represents an ele- 
ment of C 2 C([ — 4,0], R”) defined by x,(s)= 
x(t+s) (se[ —^,0]), we can rewrite (3) in a 
more convenient form: 


X(t) — f(t, xj), (E) 


where the function f(t, o) is defined on Rx C 
(or on its subspace). Here C is considered 

to be a Banach space with the norm ||9|| = 
Dat. o] 9 (5)], |: | being a norm in R”. The 
space C is said to be the phase space of (E), 
and the initial condition can be written as 


x=% (1, OeRxC. (5) 


D. Initial Value Problem 


Let f(t, o) be a continuous functions defined 
on an open domain DcR x C. The initial value 
problem is to find a solution of (E)-(5) for a 
given (t, £) e D. Many fundamental results hold 
as for ordinary differential equations: (a) There 
always exists a solution of (E)-(5) under the 
continuity of f on D. Here, the solution means 
the one to the right. No general existence 
theorem is given for the solution to the left. (b) 
The solution of (E)-(5) is unique, if f satisfies 
the Lipschitz condition: | f(t, 9) — f(t, )| x tie 
—! in a neighborhood of each point of D, 
where L is a constant depending on the neigh- 
borhood. (c) If f varies in the space C(D, R") 
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equipped with the compact open topology and 
if the solution of (E)-(5) is unique for every 
(t, č)e D, then the solution is continuous as a 
function of (t, £, f). (d) If f is completely con- 
tinuous on D, that is, f is a continuous map- 
ping of bounded subsets of D into bounded 
sets in R", then every solution can be con- 
tinuable to the right as long as it remains 
bounded or stays away from the boundary of 
D. This assertion is no longer true in the 
absence of complete continuity. 

To prove the existence theorem, Schauder's 
‘fixed point theorem is utilized; the Picard 
successive method is also effective under the 
Lipschitz condition. Consider the case when 
f(t, 9) can be written as f(t, 9) — g(t, (0), o), 
where g(t, x, o) is defined for (t, x, g)eR x R” x 
C, and g(t, x, 9) — g(t, x, W) if o(s) - v(s) on 
[ — ^, —6], 6>0. Equation (1) is one such case, 
where h=maxh,, ô= min hy. Then, g(t, x, x,) is 
a function of (t, x) alone on [t,t +ô] under the 
given condition (5), that is, (E) is reduced to 
an ordinary differential equation. This makes 
it possible to find a solution of (E)-(5) by 
matching successively the solutions of ordi- 
nary differential equations on [t,t + ô], [t+ 
ô, t - 26], .... This is the step-by-step method, 
which is effective even for equations of neutral 
type with the same property. 

Under uniqueness, the solution x(t) of (E) 
induces a mapping T(t, x): G(t, 7) >C, tz x, 
which maps x,€ G(t, x) to xe C, where G(t, x) c 
C is the set of € for which the solution of (E)- 
(5) is continuable up to t. Then 


T(t,s)T(s,t)=T(t,t) for tsr, (6) 


and T(t, x) is strongly continuous in t. If (E) is 
autonomous, that is, f(t, o) is independent of t, 
then there exists a mapping T(t), t 2 0, satisfy- 
ing T(t — 1) — T(t, x), and (T(t)), on becomes a 
one-parameter semigroup. 


E. Linear System 


When f(t, o) is continuous on J x C for an 
interval J and is linear in ọ, equation (E) is 
said to be a linear system (denoted by (L)). In 
this case, f(t, o) satisfies the Lipschitz con- 
dition with L= L(t) on I x C for a continuous 
function L(t) of I, which also implies that f is 
completely continuous. Thus the solution of 
(L) — (5) uniquely exists over Te, 00)N J for any 
(7, $)eI x C, and the mapping (the fundamental 
or solution operator) T(t, t) is a *bounded 
linear operator on C for any t, rel t2 x, and 
‘compact if tz x 4- h. T(t,t) corresponds to the 
fundamental matrix for ordinary linear dif- 
ferential equations, but it is not invertible in 
general. 

A continuous function f(t, g) linear in o 
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may be expressed in the form 


f(t,9)— | [d,n(t, s) ] (s), ?) 
-h 


where n(t, s) is an n x n matrix defined on I x 
[ —h,0], which is measurable in (t, s) and of 
*bounded variation in s with a ttotal variation 
less than L(t), and the integration is that of 
Stieltjes with respect to s. From this fact it 
follows that the range of the initial functions 
can be extended to the space of fpiecewise- 
continuous functions, and the solution x(t) of 
the nonhomogeneous linear system X(t) = 
S(t, x,)+ p(t) through (t, £)e1 x C can be rep- 
resented by the constant variational formula 


x(t)  [T(t, he lu 
F ik [T(.s](0)p(s)ds, t2, 


where I'(s) — E (the unit matrix) for s=0, and 
I (s) - 0 (the zero matrix) for s « 0. 


F. Autonomous Linear System 


Let (L) be an autonomous linear system 
X(t) — f(x). 


In this case, (7) is not different from the Riesz 
representation theorem f(~)= f?., [dn(s)] o(s). 
For (AL) the fundamental operator T(t) plays 
an extremely important role. As was seen, 
{T(t)},50 is a one-parameter semigroup of 
bounded linear operators T(t) which is strong- 
ly continuous in t>0 and compact for t>h. 
Thus the asymptotic behavior of the solutions 
of (AL) are determined by the distribution of 
the spectra o(A) of the infinitesimal generator 
A of T(t), which is given by Ag=@ with the 
domain D(A)= (o€eC|9€C and $(0)— f(o)]. 
The properties of T(t) assert that: (a) ¢(A) 
consists of point spectra alone. (b) The number 
of A, — (4€c(A)| Re AZ a} is at most finite for 
any «€ R. (c) For every A4€0(A) the dimension 
of the generalized eigenspace of 4 is finite. (d) 
The spectra of A coincide with the roots (char- 
acteristic roots) of the characteristic equation of 
(AL), 


0 
det E — | e^ ants) =0, 
—h 


together with their multiplicities. (e) e% e 
o(T(t)), t 20, if and only if 4eo(A). 

Let P,, xe R, be the linear space spanned by 
the generalized eigenfunctions corresponding 
to a 4€ A,. Then, (a) P, is invariant under T(t), 
that is, T(t) P, c P, for t 2 0; (b) the restriction 
of T(t) to P, is invertible and hence extend- 
able over te R; (c) if £e P,, then [T(E] (0)= 
X ea, Palt, é)e™, where p,(t, č) are polynomials 


(AL) 
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in t and linear in Z (d) there is a direct-sum 
decomposition C = P, 4- Q, such that Q, is in- 
variant under T(t), the projection along Q,z,: 
C— P, is bounded, and there exist positive 
constants K, e for which || T(t)£|| x Ke'** *' | £ |, 
tz0 if €€Q,. Hence for any čeC the solution 
x(t) of (AL) through Z at t =0 satisfies 


Mcr palt, Taje” |< K(1 + |nl)e* "Ié, 


tz0, (8) 


where ||z,|| denotes the operator norm of z,. 
The set Q = ( Last, may contain an element E 
other than the zero element, but T(t)é must be 
identically zero for t 2 hn. 

For a linear system (L) with f(t, ) œ- 
periodic in t, similar conclusions result, with 
Ten, 0) in place of A; and relation (8) holds, 
where A, is defined similarly by replacing o(A) 
by ((1/o)logu| ue o(T(o, 0))} and p,(t, £) are 
polynomials with c-periodic coefficients. This 
corresponds to *Floquet's theorem for ordi- 
nary periodic linear systems. ue ot fie, 0)) and 
(1/o)1og u are said to be the characteristic 
multiplier and the characteristic exponent, 
respectively. 


G. Stability Problem 


The concept of stability can be defined and 
studied in the same spirit as for ordinary dif- 
ferential equations, and the Lyapunov second 
method also turns out to be effective. For 
instance, the zero solution of (E) with f defined 
on Dy —[0, oo) x (pee C | lol] <H} for H>0 is 
said to be uniformly asymptotically stable if 
there are a constant a 0 and positive func- 
tions ó(c) and c(s) of £0 such that any solu- 
tion x(t) of (E) satisfies 


|x(t)| « e as long as x(t) exists, (9) 


whenever ||x,|| « ó(cz) and t2 1 or ||x,|| <a and 
tZ 1 4- o(c) for t>0. In the above definition, if f 
is completely continuous on Dy and if0<ée< 
H, then the phrase "as long as x(t) exists" is 
redundant, and (9) is equivalent to the claim 
that x(t) exists for all t2 x and |x(t)| <£. Under 
the Lipschitz condition and the complete 
continuity on f in (E), the zero solution of (E) 
is uniformly asymptotically stable if and only if 
there exists a continuous R-valued function 
(Lyapunov function) V(t, 9) defined on Dy, for 
O« H, <H such that 


(i)  a(|e(Q)) € Vit, ois bla) 
(ii) | D'*V(tx)z —cV(t, x) 


as long as (t, x,)€ Dy, for a solution x(t) of (E), 
where a(r), b(r) are continuous functions with 
a(r)>0 for r>0, b(0) 20, c>0 is a constant, 
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and D* denotes the upper-right Dini deriva- 
tive. Furthermore, the Lyapunov function can 
be endowed with a Lipschitz condition |V(t, o) 
— V(t,U)| € L, o —v | for some constant L, if 
f in (E) is uniformly Lipschitzian, that is, the 
coefficient L is constant over the domain. The 
Lipschitz condition together with (ii) makes it 
possible for a solution y(t) of the perturbed 
equation 


y(t) — f(t, y) + P(t, y) (10) 
to satisfy 
DË V(t, y) < —cV(t y) + Li [P(t y]. 


By using this fact, the stability of (10) can be 
discussed. 

However, for functional differential equa- 
tions it is quite a difficult problem to con- 
struct a suitable Lyapunov function for a 
given (E). 

Many of the attempts to improve the suffi- 
ciency condition that have been made up to 
now are such that a stability property can be 
verified by means of a simple Lyapunov func- 
tion. The main effort has been devoted to 
replacing condition (ii) by another type of 
condition. One of them is 


(ii*)  D^V(tx)« —c(Ix(t)]) 


under the uniform boundedness of f(t, ol, 
where c(r) is a continous function with c(r) 0 
for r>0. Another one is for the case when 

V(t, q) = W(t, p(0)), where W(t, x) is defined for 
(t, X)e R x R”. In this case, (ii) can be replaced 
by 
(i**) | D^V(tx)« —cV(t, xj) 

whenever HU +s, x,,,) « F (V(t, x,)) for se 

[—h, 0], where c» 0 is a constant and F(r)isa 
continuous function satisfying F(r)>r for r>0. 
The condition (11**) was given by B. S. Razumi- 
khin and provides an easier way to construct 

a Lyapunov function. 

For a linear system, uniform asymptotic 
stability implies || T(t, t)|| < 1/ó(1) for t z x and 
|| T(t, t)|| € 1/2 for t2 t+ o(x/2) - h. These facts 
together with (6) show that the zero solu- 
tion is exponentially stable, that is, |x(t)| < 
Ke "^? lx. ll for tz z, where v = (log 2)/(o(a/2) + 
h), K 2 2/ó(1). From (8) it follows that the zero 
solution of (AL) is uniformly asymptotically 
stable if and only if 


(4€o(A)| ReA20] =. (11) 


H. Equations of Neutral Type 


To deal with an equation of neutral type, such 
as 


X(t) — f(t, x,, Xj), (12) 
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a phase space such as C! 2 C! ([ — ^, 0], R") of 
continuously differentiable functions should 
be chosen. The solution of (E) will become 
smooth as time elapses if f in (E) 1s sufficiently 
smooth. However, this property cannot be 
expected for (12), and the solution of (12) — (5) 
need not belong to C! even if £e C! without a 
specific condition on & such as &(0) = f(z, é, 8). 
Such a restriction on initial functions is an 
obstruction to the development of the general 
theory. Equations of the form 


{Die x) m fit x) (N) 


cover a fairly general class of equations of 
neutral type, where D(t, o) and f(t, o) are 
defined on an open set D cR x C and a solu- 
tion of (N)-(5) means a continuous function 
x(t) defined on [t — h, a), a>t, which satisfies 
(5) and 


D(t, x)= DIS, £) 4 E x)ds on [t,a). 


(E) corresponds to the case where D(t, 9) — 
Q(0), while X(t)= G(t, X,) - g(t, x,) is reduced 
to the form (N) if G(t, o) is linear in o and 
continuously differentiable with respect to t by 
setting D(t, o) = o(0) — G(t, o) and f(t, o) — 
g(t, v) — (0/01) G(t, p). 

A continuous function D(t, o) linear in 9 is 
said to be atomic at O if in the representation 


0 
D(t, ol | [d,u(t, s) | e(s) 
-h 


(see equation (7)) P(t)= p(t, 0) — u(t, — 0) exists 
and is nonsingular, which is equivalent to 


D(t, o) = P(t)o(0) 3 | [d,uo(t, s) ] o(s) (13) 


with a nonsingular matrix P(t) and a matrix 
Holt, s) whose total variation with respect to s 
on [— 6,0] tends to 0 as o0 locally uni- 
formly in t. A nonlinear function D(t, p) is said 
to be atomic at 0 if D(t, p) has a continuous 
*Fréchet derivative D, (t, 9) with respect to o 
and D,(t, o) is atomic at 0. In equation (N), 
D(t, o) is always assumed to be atomic at 0, 
and many of the fundamental theorems for (E) 
are also valid for (N). Let T(t) be the operator 
solution of the autonomous linear equation 


“ Dix) = f(x). (14) 


Then {T(t)},>o is a strongly continous semi- 
group of bounded linear operators, and the 
corresponding tinfinitesimal generator A has 
the domain D(A)={geC|@eEC and D(ġ)= 
f(q)]. The properties of the spectra o(A) are 
the same as for (AL), except that the character- 
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istic equation is now given by 


0 0 
cet un | eduts) | canis) |=0 


—h =h 


and the property (b) is true if x> ap, where 
D(q), f(@) are assumed to be of the forms (13) 
and (7), respectively, without the argument f, 
and where dp is given by 


0 
det] P+ |” eus |=0. 
-h 


Similarly, the decomposition C = P, +Q, is 
possible if x» ap. Hence, to obtain the uniform 
asymptotic stability of the zero solution of (14), 
the condition ap <0 should be assumed in 
addition to (11). The linear function D(q) is 
said to be stable if a5 «0. If D(«) is stable, then 
the Lyapunov method is also applicable, as a 
sufficient condition, to the stability problem of 
(N) with linear D(q) instead of general D(t, o), 
but in this case condition (i) for V(t, o) should 
be replaced by. 


a(|D(o))) < V(t, e) « (lo). 


dp - max {Re A 





I. Infinite Retardation 


There are some basic differences between cases 
of finite retardation and of infinite retardation. 
For example, in the definition of stability, the 
inequality |x(t)| « € in (9) can be replaced by 
Ile with no difference in the case of finite 
retardation, but this replacement yields a 
different concept deeply connected with the 
choice of the phase space in case of infinite 
retardation. 

There are several ways to define a phase 
space for a functional differential equation 
with infinite retardation. One of those phase 
spaces generally used is a linear space X of 
functions: (—00,0] - Ri with a seminorm ||- ||, 
such that if a function x:( —oo, a) ^ R" satisfies 
x,€ X and it is continuous on [t, a], then 


(i) x,€ X for all tet, a), 
(ii) ^ x,is continuous as a function [t, a) X, 
(d) — m|x()| & |x My < K max, ep, | X(5)] 

+ M(t—1)|lx,Ilx, 


where m and K are positive constants and 
M(t) is a continuous function. 

If X satisfies the foregoing conditions and if 
f(t, @) is continuous on an open domain Dc 
R x X, then the local properties (a)—(c) in Sec- 
tion D hold, and so does (d) when D=R x X. 
On the other hand, if X has a fading memory, 
namely, 


(iv) | M(t) Me " in (iii) for positive con- 
stants M, u, 
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then the procedure in Section F is applicable 
by restricting o(A) to o,(4)— (4€eo(A)| Reiz 

— u} and, hence the zero solution of (AL) with 
infinite retardation is uniformly asymptotically 
stable under (11). 

For 0<h<@ and 0€ y « oo, both the space 
C, , of continuous functions @:(—h,0]—R’ 
with a finite limit lim, ._,e”@(s) and the space 
M, , of measurable functions o:(— A4, 0] o R" 
with (9. e^*|o(s)| ds < co satisfy the foregoing 
conditions, where the norms are given by 
lellc, SUP 1,oje"lo(s)| and lolim, , — 100) 
+ f? e^*|o(s)| ds. These spaces have a fading 
memory if h< oo or y » 0. 

In much of the literature in which the equa- 
tion of infinite retardation has a form like 
(2) or (4), the space CB of bounded continu- 
ous functions or the space C, of continuous 
functions with compact supports are suffi- 
cient as a phase space under the norm To = 
Sup, « o| 9 (s). However, it is to be noted that 
C, is not complete when CB does not satisfy 
condition (ii). When CB is chosen as the phase 
space, in order to show the existence theorem, 
f(t, x,) should be continuous for any bounded 
continuous function x in addition to the cont- 
inuity in (t, 9). This condition is satisfied if J, 
— [g(t), t] in (3) for a continuous function 
g(t) € t. Equation (2) or (4) with 0 x 4 « 1 give 
rise to such a case, but g(t) must be equal to 0 
in (2) and (4) with 4—0. If g(t) 5 oo as t oo, 
the Razumikhin condition (ii**) D * V(t, xj) < 
—cV(t, x,) whenever His, x,) « F (V(t, x,)) 
(se[g(t), t]) for a Lyapunov function is effec- 
tive, but one can conclude only that x(t) 0 as 
t— oo without uniformity. 
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A. Definition [1—3] 


Let C(X) be the tBanach algebra of all con- 
tinuous complex-valued functions on a com- 
pact Hausdorff space X with pointwise oper- 
ations and *uniform norm. A function algebra 
(or uniform algebra) on X is a closed sub- 
algebra A of C(X) containing the constant 
functions and separating the points of X, i.e., 
for any x, ye X, x# y, there exists fe A with 
f(x) z f(y). Ho xe X, denotes the evaluation 
mapping f — f(x) of A, the correspondence x^ 
Q9, is a homeomorphism of X into the tmaxi- 
mal ideal space M(A) of A. Since f (9) = f(x) 
for any xe X and fe A, the fGel’fand represen- 
tation of A is an isometric isomorphism. By 
identifying x with ọ,, we regard X as a closed 
subset of Mt(A) and the Gelfand transform f 
Ted, as a continuous extension of f to 9t(A). 


B. Examples [1,3,4] 

For a compact plane set K let P(K) (R(K)) be 
the subalgebra of all functions in C(K) that 
can be approximated uniformly on K by poly- 
nomials (rational functions with poles off K). 
A(K) denotes the subalgebra of all functions in 
C(K) that are analytic in the interior of K. 
These are function algebras on K and P(K)& 
R(K)& A(K). When K is the unit circle T= 
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{|z|=1}, P(T) is called the disk algebra, the 
most typical concrete example. 

The theory of function algebras emerged 
from attempts to solve, by means of functional 
analysis, certain problems in complex analysis, 
especially problems of uniform approximation, 
e.g., when does A(K)= P(K) or A(K)=R(K) 
hold? 

Another important example is given by the 
algebra H,,(U) of all bounded analytic func- 
tions on a bounded plane domain U with the 
supremum norm || f ||, =sup{|f(z)||ze¢U} [5]. 
Since the Gel'fand representation is an iso- 
metric isomorphism of H,,(U) into the algebra 
COUOH (U))), H,,(U) is viewed as a function 
algebra on the space M(H (U)). Further 
examples result if we take K or U in a Rie- 
mann surface or in the complex n-space. 

We list a few abstract function algebras that 
reflect certain relevant properties possessed 
by concrete examples. A is called a Dirichlet 
(resp. logmodular) algebra if the set Re A= 
{Re f|feA} (resp. log| A7! |- (log|fl|f, fe 
A1) is dense in Cy(X), the space of all con- 
tinuous real-valued functions on X. It is called 
hypo-Dirichlet if the closure of Re A has finite 
codimension in Cp(X), and the linear span of 
log|A !| is dense in Cp(X). 

In the following, A denotes a function alge- 
bra on X unless otherwise specified. 


C. Boundary and Representing Measure 


[1-4] 


A subset E of X is a boundary for A if for any 
f€4A there exists xe E such that | f(x)|= l| f ||. 

A closed boundary is a boundary closed in 

X. G. E. Shilov proved that there is a smallest 
closed boundary, 0A, which is called the Shilov 
boundary for A. A positive Borel measure u on 
X is a representing measure for o € 9R(A) if To) 
=| fox) du(x) for all fe A. Each oe MA) has a 
representing measure supported by 0A. The 
Choquet boundary, c(A), consists of all xe X 
such that the evaluation ọ, at x has a unique 
representing measure. Then c(A) is a bound- 
ary, whose closure is 0A. If X is metrizable, 
c(A) is a G; set in X and supports a represent- 
ing measure for every member of M(A). 

For ge Wt(A), M, denotes the set of repre- 
senting measures for o It is a tweak* com- 
pact convex subset of the space of measures 
on X. M, is a singleton if A is Dirichlet or log- 
modular. It is finite-dimensional if A is hypo- 
Dirichlet. The case dim M, « 4-oo has been 
studied in detail. Extensive studies for the case 
dim M, — +œ have been done only for con- 
crete examples related to polydisks, infinitely 
connected domains, etc. 

The notion of boundary reflects the tmaxi- 
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mum modulus principle for analytic func- 
tions, and representing measures come from 
*Poisson's integral formula. The most relevant 
maximum principle is H. Rossi's local maxi- 
mum modulus principle: For any closed set K 
in M(A), we have [f [lk = |f llsakuineA) for all 
f €A, where bd K is the topological boundary 
of K and |f ll; —sup(lf(s)] | seS]. A corre- 
sponding result for function spaces was con- 
sidered by Y. Hirashita and J. Wada. Closely 
related to representing measures are the or- 
thogonal measures for A, i.e., complex Borel 
measures u on X such that f f du =0 for all 

f € A; they are often useful in studying func- 
tion algebras by means of the duality tech- 
nique. The set of orthogonal measures is 
denoted by A+. 


D. Peak Sets [1—3] 


A subset K of X is a peak set for A if there 
exists f € A such that f(x) 2 1 for xe K and 
If(y)] « 1 for ye X — K. K isa generalized peak 
set if it is the intersection of peak sets. A point 
x€ X is a (generalized) peak point if the set 

{x} is a (generalized) peak set. The set of 
generalized peak points equals the Choquet 
boundary for A. If X is metrizable, the peak 
sets and generalized peak sets coincide. There 
exist X and A such that X is metrizable: X — 
3 (A) — c(A), but A z C(X) (B. Cole). A sub- 
set E of X is interpolating for A if for any 
bounded continuous function u on E there 
exists f c A with f| E— u. Then a closed G; set 
K in X is an interpolating peak set if and only 
if ue At implies |u|(K) -0 (E. Bishop). 


E. Antisymmetric Decomposition [1-3] 


A subset F of X is a set of antisymmetry for A 
if every function in A which is real-valued on F 
is constant on F. Bishop’s antisymmetric de- 
composition then appears as an extension of 
the *Weierstrass-Stone theorem on uniform 
approximation. It is a refinement of Shilov's 
decomposition and reads as follows: If (E, is 
the family of maximal sets of antisymmetry for 
A, it Is a partition of X into generalized peak 
sets such that fe C(X) with f|E,e A|E, for all 
x belongs to A. An interesting connection was 
found by J. Tomiyama between the maximal 
antisymmetric decomposition of X relative to 
A and that of MA) relative to A. 


F. Parts and Analytic Structure [1—4] 


A. M. Gleason defined an equivalence relation 
~ in M(A) by setting o ~y if sup{| f(o)— 
SWI SEA, If | <1} <2. Each equivalence 
class for this relation is a part (or a Gleason 
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part) for A. Two points q, V belong to the 
same part if and only if there exist mutually 
absolutely continuous representing measures u 
for ọ and v for y such that c! « du/dv <c for 
some constant c 0 (Bishop). An analytic 
structure in 9c (A) is a pair (V, x), with an tana- 
lytic set V in some open subset of C" and a 
nonconstant continuous mapping tT: V>M(A) 
such that fot is analytic on V for all fe A. 
For such a structure, z(V) is always within a 
part. But there can be a nontrivial part with 
no analytic structure, as was shown by G. 
Stolzenberg. A topological characterization 

of parts was obtained by J. Garnett. Sample 
results in the positive direction are: (1) If the 
ideal A,={ fe A fiel — 0j is finitely generated, 
there ts an analytic structure (V, t) such that + 
is a homeomorphism of V onto an open neigh- 
borhood of o (Gleason); (2) if g has a unique 
representing measure and if the part P of ọ is 
not a singleton, there is a bijective continuous 
mapping t of the open unit disk onto P such 
that Z o1 is analytic for fe A (J. Wermer, K. 
Hoffman and G. Lumer); (3) if A is hypo- 
Dirichlet and if a part P is not a singleton, P 
can be made into a 1-dimensional tanalytic 
space so that each fe A is analytic on P (J. 
Wermer and B. V. O'Neill). Analytic structures 
in *polynomially convex hulls of curves in C" 
have also been studied [3, 4]. 


G. Abstract Function Theory [1,6-9] 


Let o €99i(4), and choose me M,, which is 
fixed. The generalized Hardy class H,(m), 0 

« p € oo, associated with A is the closure 
(weak* closure, if p= oo) of A in the İL, space 
L (m) on the measure space (X, m). Under 
suitable restrictions on A, o. or m, we can 
recapture some of the important classical facts, 
most of which have their origins in the works 
of A. Beurling, R. Nevanlinna, F. and M. 
Riesz, and G. Szegó. In this area, H. Helson 
and D. Lowdenslager came up with a powerful 
method using orthogonal projections in Hil- 
bert space and gave together with S. Bochner's 
remark, a strong influence for subsequent 
development. The modification argument was 
then devised by Hoffman and Wermer, in- 
spired by F. Forelli. After Hoffman's detailed 
study of logmodular algebras, Lumer observed 
that most results remain valid when o has 

a unique representing measure. T. P. Srini- 
vasan and J.-K. Wang (— [8]) introduced the 
notion of weak* Dirichlet algebra and showed 
that some major theorems are mutually equiv- 
alent and are in fact measure-theoretic. With 
the Hoffman-Rossi complement, their result 
now states that for fixed me M, the following 
are equivalent: (i) A+ A is weak* dense in 
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L (m), i.e., A is a weak* Dirichlet algebra on 
(X, m); (ii) if ue M, is absolutely continuous 
with respect to m, then u =m; (iii) if a closed 
subspace M of L,(m) is simply invariant in the 
sense that 4, M € M and A,M is not dense in 
M, then M =qH,(m) with qe L mt |q|=1 
a.e.; (iv) the set log| H, (m) ! | coincides with 

L (mr R), the set of real-valued elements in 

L ,, (m); (v) for every we L,(m), w>0, we have 
inf(f11 — f| wdm]| fe A.) 2 exp(flogw dm); (vi) 
the linear functional h>{hdm on H. (m) has a 
unique positive extension to L..(m). Further 
extensions were subsequently made by use of 
the conjugation operator (— Section K) by 

T. Gamelin, H Kónig, Lumer, K. Yabuta, 
and others. Some more properties of weak* 
Dirichlet algebras have been obtained by T. 
Nakazi. 


H. Generalized Analytic Functions [1,10] 
Let I be a dense subgroup of the additive 
group R of the reals with discrete topology, 
and let G be the character group of I’. Each 
gel, as a character of G, defines a continuous 
function y, on G. Let A be the closed sub- 
algebra of C(G) generated by {y,|aeT,a>0}. 
A is a Dirichlet algebra but is far more difficult 
to describe than the disk algebra. The study of 
this algebra, especially that of invariant sub- 
spaces, has evolved from papers by Helson 
and D. Lowdenslager. Let o be the *Haar 
measure of G and H,(o) the closure of A in 

L (e). A closed subspace M of L;(o) is called 
invariant if y, M S M for all ae, az 0. M is 
called doubly invariant if y, M € M for all ae 
I. Otherwise, it is called simply invariant. In 
fact, only the latter is interesting. Let e,€ G 
with teR be the character of I defined by 
e,(a) ^ e^. Then the mapping te, is a faith- 
ful representation of R into G. A cocycle on 

G is defined to be a Borel function B on G x 
R such that (i) | B(x, )|= 1, (ii) B(x +e,, t) = 
B(x, s) B(x, s-- t) for xeG and s, teR. Two 
cocycles are identified if they differ only on 

a null set in G x R. For a cocycle B, let Mp 

be the set of fe L;(c) such that B(x, t) f(x + 
eje H,(dt/(1+ t^)) for almost all xe G, where 
H, (dt/(1 + t?)) is the closure, in the space 

L ,(dt/(1 4- t?)) on R, of the set of boundary 
value functions on R of bounded analytic 
functions on the upper half-plane. Then the 
mapping B Mj is a bijection from the set of 
cocycles onto the set of simply invariant sub- 
spaces M of L,(a) such that M=(\{y,M|ae 
I,a «01. Moreover, M, =qH,(c) for some 

qe L,,(0), |q| (ae, if and only if B(x, t)= 
q(x) q(x + ej), i.e., B is a coboundary. When T # 
R, there is a cocycle that is not a coboundary. 
Further studies have been done by Helson, 
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Gamelin, J. Tanaka, and others. On the other 
hand, F. Forelli observed that tflows in com- 
pact spaces give rise to a kind of analyticity. 
In the above case, the algebra A consists of all 
f €C(G) that are analytic with respect to the 
flow T/(x) 2 x - e, for xeG and te R. General- 
ized analytic functions induced by flows in 
general have been studied by Forelli and P. S. 
Muhly. 


I. The Unit Disk [1,6,7, 11, 12] 


Let A be the disk algebra P(T), D the open 
unit disk {|z|< 1}, and m the normalized Le- 
besgue measure on T. The algebra A has been 
the most important model in the theory of 
function algebras, and we find here the origin 
of many abstract results. Some typical results: 
(1) every orthogonal measure for A is abso- 
lutely continuous with respect to m (F. and M. 
Riesz); (11) a closed set E in T is an interpolat- 
ing peak set for the tGel’fand transform A 

of A if and only if m(E) 20 (W. Rudin and L. 
Carleson); (iti) A is maximal among the closed 
subalgebras of C(T) (Wermer); (iv) a function 
f €C(CI D) belongs to A if f is analytic at every 
zeD with f(z) Z0 (T. Rado). 

The generalized Hardy class H,(m), 0< p< 
oo, associated with A is viewed as the set of 
nontangential boundary value functions of 
elements in the classical *Hardy class H,(D). 
Here we find the origin of invariant subspace 
theorems: A closed subspace M (+ (0j) of 
H,(m) is invariant, i.e., AM € M, if and only 
if M =qH,(m) with qe H, (m), |g| ^ 1 a.e. 
(Beurling). 

The algebra H = H,, (m) is a weak* Diri- 
chlet algebra, whose Shilov boundary is identi- 
fied with the maximal ideal space X of L, (m). 
We have L,,(m;R)=log|(H”) ! |, and a fortiori 
H. is logmodular on X. The mapping z o, 
embeds the disk D in M(H) as an open set. 
The structure of M(H) was studied in detail 
by I. J. Schark, Hoffman, and others. We finish 
with three remarkable results: (1) D is dense in 
9Xt(H,.) (Carleson). This is the corona theorem 
and was proved in the following equivalent 
form: For any f,, ..., f, H,(D) with | f,|+...+ 
| f,|2e>0 on D, there exist g,,...,9,€ Ho (D) 
with f;g,+...+f,9,= 1. A simple proof was 
discovered by T. Wolff [12]. (ii) The convex 
combinations of tBlaschke products are uni- 
formly dense in the unit ball of H,, (D) (D. 
Marshall) [12]. (iii) Every closed subalgebra B 
between H. and L (m) is a Douglas algebra, 
Le, B is generated by H. and the complex 
conjugates of a family of inner functions (S.-Y. 
Chang and Marshall) [11]. The proof of (iii) is 
an interesting application of the theory of 
tbounded mean oscillation. 
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J. Rational Approximation [1,2] 


The problem of rational (polynomial) approxi- 
mation on a compact plane set K asks when 
A(K)— R(K) (A(K)= P(K)) holds. An impor- 
tant tool for such problems is Cauchy's trans- 
form of measure u: A) — [(z — C)! du(z). Using 
this, one can show, for instance, the following: 
(1) A(K) — P(K) if and only if K has a con- 
nected complement (S. N. Mergelyan); (2) fe 
C(K) belongs to R(K) if each point ze K hasa 
closed neighborhood V with flave RIKANO V) 
(Bishop); (3) A(K) 2 R(K) if the diameters of 
the components of C — K are bounded away 
from zero (Mergelyan); (4) C(K)= R(K) if and 
only if almost all points of K are peak points 
for R(K) (Bishop). The last result cannot be 
extended much because there is a set K such 
that A(K) z R(K), while A(K) and R(K) have 
the same peak points (A. M. Davie). 

A complete characterization for A(K)— 
R(K) was obtained by A. G. Vitushkin: (5) 
The following are equivalent: (1) A(K)= 
R(K); (ii) for any bounded open set D in C, 
a(D —K)=a(D— K°); (iti) for any zebd K, 
there exists r>1 such that limsup; ,. «(A(z;ó) 
— K9y/a(A(z; ró) — K) « +00, where A(z; ô) is 
the disk (weC||w— z| «ó] and a(E), for any 
bounded set E in C, ts the continuous ana- 
lytic capacity of E, which is the supremum of 
| f (o0)! for all continuous functions f on the 
Riemann sphere CU {æ} such that |f| x1, — 
f(o0) 0, and f is analytic off a compact sub- 
set of E. As for uniform or asymptotic ap- 
proximation on noncompact closed subsets 
in C, Carleman's classical study has recently 
been extended by N. U. Arakelyan, A. A. 
Nersesyan, A. Stray, and others in an interest- 
ing way. 

In connection with rational approximation, 
we should note detailed studies on pointwise 
bounded approximation in H (UL U being 
a bounded open set in C, by O. J. Farrell, L. 
Rubel and A. Shields, Gamelin and J. Garnett, 
and Davie. 


K. Further Topics 


(1) Conjugation operator [9, 13]. Take any 

me M,, pe WA). The conjugation operator 
then associates with each ue Re A the unique 
element Zu in C4(X) such that u+i*ue A and 
[*udm —0. After the classical inequalities of 
Kolmogorov and M. Riesz, we consider the 
following conditions with constants c, and d. 
(K) (f |*u]? dm)? € c, (|u| dm for 0 « p < 1; (M) 
(I*ul? dm)? <d (f u|? dm)"? for 1 < p< oo. Let 
m be arbitrary. Then the inequality (K) is valid 
for ue Re A, u>0. The inequality (M) is valid 
for all ue Re A if p is an even integer 22; it is 
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valid for all ue Re A, u>0, if p is not an odd 
integer. All remaining cases have counter- 
examples. On the other hand, M, always con- 
tains an m such that log] f(9)| « log] f| dm for 
Ted (Bishop). Such a representing measure is 
called a Jensen measure for q. If m is Jensen, 
(K) is valid for all ue Re A and all 0 « p « 1, 
and (M) is valid for all ueRe A and all 1 « 
De, 

(2) Riemann surfaces. For a compact bor- 
dered Riemann surface R, let A(R) be the 
algebra of functions, continuous on CIR and 
analytic on R. Then A(R) is hypo-Dirichlet on 
the border bd R of R, and many results for the 
disk algebra are extended to A(R). Most of the 
basic results are described in [14]. The maxi- 
mality of A(R) in C(bd R) was obtained by H. 
Royden; extreme points of related Hardy 
classes were discussed by Gamelin and M. 
Voichick; and invariant subspaces were deter- 
mined by Forelli, M. Hasumi, D. Sarason, and 
Voichick. A further extension to infinitely 


connected surfaces has been obtained by C. W. 


Neville, Hasumi, and M. Hayashi in the case 
of open Riemann surfaces R of Parreau- 
Widom type, which is defined as follows: Let 
G(a, z} be *Green's function for R, and let 
B(a, a), «>0, be the first tBetti number of 
the domain (ze R|G(a, z) » a); then R is of 
Parreau-Widom type if Bio, o) da < +00. For 
such surfaces the situation looks favorable: 
For instance, the Cauchy-Read theorem is 
valid, and the Brelot-Choquet problem con- 
cerning Green's lines is solved affirmatively. 

As for the generalization of approxima- 
tion theorems of Mergelyan and Arakelyan, 
we refer to the work of Bishop and L. K. 
Kodama for compact sets and to that of S. 
Scheinberg for noncompact closed sets. 

(3) Higher-dimensional sets. Much attention 
has been paid to algebras of analytic functions 
on domains in C", n z 2, e.g., polydisks, unit 
balls, and general pseudoconvex domains. 
Polydisk algebras and ball algebras have been 
studied extensively by W. Rudin, P. Ahern, 
Forelli, and many others [15, 16]. Approxi- 
mation theorems of Mergelyan type were ob- 
tained by G. M. Henkin, N. Kerzman, and I. 
Lieb for strictly pseudoconvex domains with 
smooth boundary and by L. Hórmander and 
Wermer and L. Nirenberg and R. O. Wells, Jr., 
in the case of totally real manifolds. Further 
improvements have been obtained by R. M. 
Range, A. Sakai, and others. 
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A. History 


Leibniz used the term function (Lat. functio) in 
the 1670s to refer to certain line segments 
whose lengths depend on lines related to 
curves. Soon the term was used to refer to 
dependent quantities or expressions. [n 1718, 
Johann Bernoulli used the notation ox, and by 
1734 the modern functional notation f(x) had 
been used by Clairaut and by Euler, who 
defined functions as analytic formulas con- 
structed from variables and constants (1728) 
[1]. *Cauchy stated (1821) [2]: “When there is 
a relation among many variables, which deter- 
mines along with values of one of them the 
values of the others, we usually consider the 
others as expressed by the one. We then call 
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the one an ‘independent variable,’ and the 
others ‘dependent variables.’” tDirichlet consi- 
dered a function of xe[a, b] in his paper (1837) 
[3] concerning representations of “completely 
arbitrary functions" and stated that there was 
no need for the relation between y and x to be 
given by the same law throughout an interval, 
nor was it necessary that the relation be given 
by mathematical formulas. A function was 
simply a correspondence in which values of 
one variable determined values of another. 


B. Functions 


Today, the word "function" is used generally 
in mathematics in the same sense as a tmap- 
ping (— 381 Sets C) or, which is the same 
thing, a tunivalent correspondence (— 358 
Relations B). But this word is sometimes used 
in a wider sense, to mean a general (not nec- 
essarily univalent) correspondence, called a 
many-valued (or multivalued) function; in that 
case a univalent correspondence is called a 
single-valued function. 

Specialists in each branch of mathematics 
have their respective ways of using the word. 
In analysis, values of a function are often 
considered real or complex numbers; such 
functions are called real-valued functions or 
complex-valued functions, respectively. Fur- 
thermore, if the domain of the function is also 
a set of real or complex numbers, then it is 
called a real function or a complex function, 
respectively (— 131 Elementary Functions; 84 
Continuous Functions; 198 Holomorphic 
Functions). If the domain of a real- or 
complex-valued function is contained in a 
function space, the function is often called a 
functional; the tdistribution is an example. In 
algebra we often fix a tfield, tring, etc., and 
consider functions whose domains and ranges 
are in such algebraic systems. Special names 
are given to functions having special prop- 
erties, which can be defined according to the 
structures of the domain and the range. For 
example, when both domain and range of a 
function f are sets of real numbers, f is called 
an even function if /(t) — f( — t), and an odd 
function if f(t)= —f(— t). A function f that pre- 
serves the order relation between real num- 
bers, i.e., such that t, <t, implies f(t,) « f(t;), 
is called a tnonotone increasing function. 

A mapping from a set I to a set F of func- 
tions, Q:1— F, is called a family of functions 
indexed by J (or simple family of functions), 
and is denoted, using the form f, instead of 
Pl), by { fa}aey or ( fa} (4€ I). In particular, if 
I is the set of natural numbers, the family is 
called a sequence of functions. 
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C. Variables 


A letter x, for which we can substitute a name 
of an element of a set X, is called a variable, 
and X is called the domain of the variable. An 
element of the domain of a variable x is called 
a value of x. In particular, if the domain is a set 
of real numbers or complex numbers, the 
variable is called a real variable or a complex 
variable, respectively. On the other hand, a 
letter that stands for a particular element is 
called a constant. 

When the domain and range of a function f 
are X and Y, respectively, a variable x whose 
domain is X is called the independent variable, 
and a variable y whose domain is Y is called 
the dependent variable. Then we say y is a 
function of x, and write y — f(x). When a con- 
crete method is given by which we make a 
value of y correspond to each value of x, we 
say that y is an explicit function of x. When a 
function is determined only by a tbinary re- 
lation such as R(x, y) 2 0, we say that y is 
an implicit function of x (— 208 Implicit 
Functions). 

Given functions f, g with an independent 
variable t, suppose that y is regarded as a 
function of x defined by relations x — f(t), 

y — g(t). Then we say that y is a function of x 
with the variable t as a parameter. A function 
whose range is a given set C with variable t as 
its independent variable is often called a para- 
metric representation of C by t. 

If the domain of a function fis contained in 
a Cartesian product set X, x X, x... x X,, 
the independent variable is denoted by 
(X1, X5, ..., X,), and fis often called a function 
of n variables or a function of many variables 
(when n z 2). 


D. Families and Sequences 


A function whose domain is a set J, 9:1 X, is 
called a family indexed by 7 (or simply family), 
and I is called the index set. In the case ol 
X; (4€ I), the family is denoted by [x;],,, or 
1x4) (A€I). If the range X of a function o is a 
set of points, a set of functions, a set of map- 
pings, or a set of sets, then the family (x;);,; is 
called a family of points, a family of functions, 
a family of mappings, or a family of sets, res- 
pectively. If the set J is a tdirected set, the 
family is called a directed family. Generally, if 
J is a subset of I, the family {x} ;., is called a 
subfamily of (x;] ;.,. In particular, if J is a 
finite or infinite set of natural numbers, the 
family indexed by 1 is called a finite sequence 
or infinite sequence, respectively. Sequence is a 
generic name for both, but in many cases it 
means an infinite sequence, and usually we 
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have I 2 N. Then the value corresponding to 
n€N is called the nth term or, generally, a 
term. For convenience, the Oth term is often 
used as well. If each term of a sequence is a 
number, a point, a function, or a set, the se- 
quence is called a sequence of numbers, a se- 
quence of points, a sequence of functions, or a 
sequence of sets, respectively. A sequence is 
usually denoted by 1a, ). If it is necessary to 
show the domain of n explicitly, the sequence 
is denoted by {a,},,.,. If J is a subset of I, a 
sequence (4,1 at is called a subsequence of the 
sequence {a,},-;. And if I =N, the composite 
{ar} Of (a4) and a sequence {k,,} of natural 
numbers with k; <k, « k, ... is usually called a 
subsequence of (a, ). 
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Variation 


A. Monotone Functions 


A function (or mapping) f from an tordered 
set X to another ordered set Y is called a 
monotone increasing (monovone decreasing) 
function if 


x,«x, implies 


f(x) & fixi) 
fx) f(x). (1) 


A monotone increasing (decreasing) function is 
also called a nondecreasing (nonincreasing) 
function. In either case, the function f is called 
simply a monotone function. If X and Y are 
totally ordered sets and the inequality < (>) 
holds in (1) instead of < (>), then f is called a 
strictly (monotone) increasing (strictly (mono- 
tone) decreasing) function. In either case, f is 
called simply a strictly monotone function. 

In particular, when X and Y are subsets of 
the real line R, a monotone function is contin- 


(x;,«x4 implies 


642 


uous except for at most a countable number 
of points. Hence it is *Riemann integrable in a 
finite interval provided that it is bounded. A 
continuous real function f(x) defined on an 
interval in R is tinjective if and only if it is 
strictly monotone. In such a case, the range of 
the function f(x) is also an interval, and the 
inverse function is also strictly monotone. 
Furthermore, a differentiable real function f 
defined on an interval is monotone if and only 
if its derivative f’ is always 20 (monotone 
increasing) or always <0 (monotone decreas- 
ing). If f’>0 (<0), f is strictly monotone 
increasing (decreasing). 


B. Functions of Bounded Variation 


Let f(x) be a real function defined on a closed 
interval [a, b] in R. Given a subdivision of the 
interval a— xo «€ x, « x; <... « x,—b, we de- 
note the sum of positive differences f(x;)— 
f(x.) by P and the sum of negative differ- 
ences f(xj)) — f(x, .,) by — N. Then we easily 
obtain 


P —N —f(b) — f(a), 
PEN-Y f(x) —f(x;-1)I- 


The suprema of P, N, and P +N for all pos- 
sible subdivisions of [a, b] are called the posi- 
tive variation, the negative variation, and the 
total variation of the function f(x) in the inter- 
val [a,b], respectively. If any of these three 
values is finite, then all three values are finite. 
In such a case, the function f(x) is called a 
function of bounded variation. Every function 
of bounded variation is bounded, but the con- 
verse is not true. The positive and negative 
variations z(t), v(t) of the function f(x) in the 
interval [a, t] are monotone increasing func- 
tions with respect to t, and we have 


f69— fla) = n(x) — v(x) (2) 


if f(x) is a function of bounded variation. 
Hence every function of bounded variation has 
both left and right limits at every point. A 
monotone function is a function of bounded 
variation, and the sum, the difference, or the 
product of two functions of bounded variation 
is also a function of bounded variation. Hence 
f(x) is a function of bounded variation if and 
only if it is the difference of two monotone 
functions. The representation (2) (representing 
a function of bounded variation as the differ- - 
ence of two monotone increasing functions) 

is called the Jordan decomposition of the func- 
tion f(x). A function of bounded variation is 
Riemann integrable, continuous except for at 
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most a countable number of points, and dif- 
ferentiable talmost everywhere. 

A continuous function defined on a closed 
interval is bounded but not necessarily of 
bounded variation (eg. f(x) - xsin(1/x) (xe 
(0, 1]), 0 (x 2 0)). A discontinuous function 
may be a function of bounded variation on 
a closed interval (e.g., sgn(x)). However, an 
tabsolutely continuous function, a differenti- 
able function with bounded derivative, or a 
function satisfying the * Lipschitz condition is a 
function of bounded variation on a closed 
interval. 

The notion of functions of bounded vari- 
ation was introduced by C. Jordan in connec- 
tion with the notion of the length of curves (— 
246 Length and Area). 


C. Lebesgue-Stieltjes Integral 


Let f(x) be a right continuous function of 
bounded variation on a closed interval [a, b], 
and f(x) — n(x) — v(x) the Jordan decomposi- 
tion of f(x). Then z(x) and v(x) are monotone 
increasing right continuous functions and hence 
define bounded measures dz(x) and dv(x) on 
[a,b], respectively (— 270 Measure Theory L 
(v)). The difference dx — dv of these two mea- 
sures is a tcompletely additive set function 

on [a, b] which is often called the (signed) 
Lebesgue-Stieltjes measure induced by f, written 
df. For every function g integrable with respect 
to the measure du — dz + dv, we define [4g df 

to be equal to f;g dn — [cg dv and call it the 
Lebesgue-Stieltjes integral. [n this case, h(x) 
cl ug df is of bounded variation on [a, b] 
and the Lebesgue-Stieltjes measure dh(x) 

is denoted by g(x)df(x). If f, and f, are of 
bounded variation on To b], then so is the 
product f = f, fa, and we have 


df (x) — fi (x € 0) dfa(x) + fo(x FO) df, (x). G) 


The Lebesgue-Stieltjes integral for a continu- 
ous integrand is often called the Riemann- 
Stieltjes integral, because it can be defined in 
an elementary way similar to the definition of 
the Riemann integral. 

The notion of bounded variation can also be 
defined for interval functions on R" and set 
functions on an abstract space (— 380 Set 
Functions). 
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Functions of Confluent Type 


A. Confluent Hypergeometric Functions 


If some singularities of an ordinary differential 
equation of tFuchstan type are confluent to 
each other, we obtain a confluent differential 
equation whose solutions are called functions 
of confluent type. The equations that appear 
frequently in practical problems are the con- 
fluent hypergeometric differential equations 


d^w 


m dw E 
i2 (y ye aw = (1) 


and related equations. Equation (1) corre- 
sponds to the thypergeometric differential 
equation for which a tregular singular point 
coincides with the point at infinity and is an 
tirregular singular point of class 1. For (1), 
z=O0is a regular singular point, and a series 
solution (radius of convergence oo) is given by 


F(a, y; z) & ,Fi(0, zz 


" E zs zo (leo) Q) 
nzo n'y(y 4 1)...(y -n— 1) 
where y is not a nonpositive integer. The func- 
tion ,F, in (2) is a tgeneralized hypergeometric 
function due to Barnes and is called a hyper- 
geometric function of confluent type or Kum- 
mer function. If y is not equal to a positive 
integer, the other solution of (1) independent of 
(2) is given by z! "F(1--ca—y,2—y;z) (^ Ap- 
pendix A, Table 19.I). 


B. Whittaker Functions 


Equation (1) with w-e?2z ^"? W, y Ans Ak, 
y? Ans 4m? — 1 reduces to Whittaker's dif- 
ferential equation 


d'H ( 1 
— +| —-+ 


TEELLA, 3 
dz? 27. NF E e) 


If 2m is not equal to an integer, (3) has two 
series solutions for any finite z: 

M, m(Z) = z002*nme7*? FS 4m—k, 1 - 2m; z), 
My -m(Z) =z "e= F( —m—k,1—2m;z). 


If 2m is an integer, since the functions M, ,, 
and M, .,, are linearly dependent, E. T. Whit- 
taker considered a solution of the form 


1 
Wie, m(Z) = -zp (cho mte th 


(0+) pe amem 
d (aarti +£) edit 
E 


oo 
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If k —3— m is equal to a negative integer, this 
integral does not exist. The function 


ks E? 
= axe | p K-0/2)*m 
2 


t k—(1/2)*m 
x ( +£) e ‘dt 
z 


for Re(k —3— m) « 0 is defined for any m, k, 
and for any z except when z is a negative real 
number. We call M, m and W, m the Whittaker 
functions. 'Bessel functions are particular cases 
of these functions, and the relation 


z i2 


nO — Justia (s + 1) 


Mo, (iz) 

is satisfied. In Whittaker's differential equation, 
since W_, m(— z) is also a solution and 

W, 4 (2/W 4, wl — z) is not equal to a constant, 
W, m(z) and W_, ,( —z) can be considered a 
pair of fundamental solutions (— Appendix A, 
Table 19.11). 


C. Parabolic Cylinder Functions 


Putting x 2 (£? —n?)/2 and y =n, the curves 
corresponding to =constant and to y = 
constant, respectively, constitute families of 
orthogonal parabolas. The curvilinear coordi- 
nates (£, 7, z) in three dimensions are called 
parabolic cylindrical coordinates. By using 
parabolic coordinates, separating variables in 
Laplace's equation into the form f(é)g(n)e'™, 
and making a simple transformation, we find 
that f and g satisfy a differential equation of 
the form 


ar 





+(n+45—427)F =0. (4) 


By means of the Whittaker function W, ,(z), a 
solution D,,(z) of (4) is represented by 


-1/2 2 
p.22" 70g S Wri2+c1jay,-1/4(22 ). 


Equation (4) is called Weber’s differential 
equation or the Weber-Hermite differential 
equation, and D,(z) the Weber function. An- 
other solution of (4) is D_,,_, (iz) or D., 4,(— iz). 
The solutions of (4) are called parabolic cyl- 
inder functions. In particular, if n is equal to 

a nonnegative integer, then 


H,(z) - 2 "^ exp(12?) D (2 z) 


is the *Hermite polynomial of degree n. Solu- 
tions of differential equations for harmonic 
oscillators in quantum mechanics are of this 
form. 

In general, suppose that three regular sin- 
gular points are confluent to the point at infin- 
ity, and that they are reduced to an irregular 
singular point of class 2. Suppose further that 
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there are no other singularities. Then differen- 
tial equations of order 2 with these conditions 
are transformed into the form (4), whose solu- 
tions are represented by parabolic cylinder 
functions. Differential equations of the form 
(4) are reduced to confluent hypergeometric 
differential equations if z? is chosen as an 
independent variable (— Appendix A, Table 
20.III). 


D. Indefinite Integrals of Elementary 
Functions 


Since exponential and trigonometric functions 
can be represented by particular types of 
Kummer functions, their indefinite integrals 
that cannot be represented by elementary 
functions, e.g., tincomplete T -functions and 
the error function Erfz = (5exp(— t?) dt, can be 
represented by Kummer or Whittaker func- 
tions. They are included in a family of tspecial 
functions of confluent type. The functions 
defined by 


ies | SOS Td 


are called Fresnel integrals, which are also 
represented in terms of the Whittaker function 
as 


C(z) — iS(z) 


1—i e 78 
4 —z?j2 3 
= 1 Ju E e 2 W 44, 14 (iz) . 
T 





Fresnel integrals first appeared in the theory of 
the diffraction of waves. More recently they 
have been applied to designing highways for 
high-speed automobiles. Furthermore, the 
functions 


ctu | cos($s* as 

Mae 

S6 cim eas 
0 2 


(obtained by a change of variables z = nu? /2) 
are also called Fresnel integrals. Numerical 
tables are available for them. The curves x= C 
and y=S with a parameter z or u are called 
Cornu's spiral (Fig. 1). The functions 


x dt x et 
Lix2| —, Eix= SE 


where a tprincipal value must be taken at t=0 
if x >0, 


` *sint 
sie 9t 


0 


: ? cost 
and cix-- —— dt 
"EET 
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Fig. 1 


are called the logarithmic integral, exponential 
integral, sine integral, and cosine integral, or 
integral logarithm, integral exponent, integral 
sine, and integral cosine, respectively. They 
satisfy the relations 


Eix=Lie*, Eiix=Cix+iSix+(n/2)i. 


They have important applications: Ei x in 
quantum mechanics, Si x and Ci x in electrical 
engineering, and Lix in estimating the number 
of *primes less than x (— 123 Distribution of 
Prime Numbers). Lix is also denoted by lix 
(^ Appendix A, Table 19.II). 


E. Stokes's Equation 


Consider a linear differential equation of the 
second order with five regular singular points 
including the point at infinity such that the 
difference of the characteristic indices at every 
singularity is equal to 1/2. Such equations are 
called generalized Lamé’s differential equa- 
tions. F. Klein and M. Bécher have shown 
that every linear differential equation that is 
commonly treated in mathematical physics is 
represented by a confluent type of generalized 
Lamé's equation. Among these equations, if 
all five singularities are confluent to the point 
at infinity, the resulting equation is called 
Stokes’s differential equation, which is applied 
to the investigation of diffraction. This is re- 
duced to *Bessel's differential equation of order 
1/3 by suitable transformations of the inde- 
pendent and dependent variables. 
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168 (XII.6) 
Function Spaces 


A. General Remarks 


It is a general method in modern analysis to 
consider a set X of mappings of a space Q into 
another space A as a space (— 381 Sets) and 
its elements (namely, mappings of Q into A) as 
points of the space X, and to investigate them 
as geometric objects. In particular, it is impor- 
tant to consider the case where Q is a ftopo- 
logical space, a ‘measure space, or a fdiffer- 
entiable manifold and X is a set of real- or 
complex-valued functions defined on Q and 
satisfying certain conditions, such as continu- 
ity, measurability, and differentiability. Such 
spaces are generally called function spaces; 
they usually form ttopological linear spaces 
(— 37 Banach Spaces; 197 Hilbert Spaces; 424 
Topological Linear Spaces). 


B. Examples of Function Spaces 


The following are important examples of func- 
tion spaces. Throughout this section, all func- 
tions are real- or complex-valued, and two 
functions on a measure space are identified 
whenever they are equal to each other *almost 
everywhere. 


(1) The Function Spaces C(Q), C, (£9), and 
C,(Q). The totality of continuous functions 
f(x) defined on a compact tHausdorff space Q 
is denoted by C(Q). Let f+g and of (a a real 
or complex number) be the functions f(x) - 
g(x) and af(x), respectively. Then C(Q) forms 

a ‘linear space. Furthermore, define the norm 
of f by || f| =supxeo| J (x)|. Then C(Q) becomes 
a *Banach space since it is complete in (the 
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metric defined by) the norm. The norm is 
called the supremum norm or uniform norm 
because Jm, || f, — f || =O means that f,(x) 
converges to f(x) uniformly on Q as n œ. 
Define f-g to be the function f(x)g(x). Then 
clearly || f-g|| € || Viol Hence C(Q) is also a 
*Banach algebra. 

Suppose that a subset R of C(Q) satisfies the 
following three conditions: (1) R is an talgebra 
over the complex number field with respect to 
the addition and multiplication defined above 
and contains the function identically equal to 
one. (it) For any two distinct points x and y 
of Q, there exists a function fe R satisfying 
f(x) z f(y). (iii) For any fe R, there exists an 
f*€R such that f *(x) f(x) on Q. Then R is 
dense in C(Q) with respect to the supremum 
norm (namely, in the sense of uniform conver- 
gence). This fact is known as the Weierstrass- 
Stone theorem (or Stone-Gel'fand theorem). 

A subset E of C(Q) is tprecompact (i.e., any 
sequence of functions in E contains a sub- 
sequence that converges uniformly on Q) if and 
only if E is tuniformly bounded and fequicon- 
tinuous (Ascoli-Arzelà theorem). Since C(Q) is 
not a freflexive Banach space except for trivial 
cases (— Section C), precompact sets and 
relatively compact sets are different in the 
tweak topology. For important characteriza- 
tions of the latter sets — [2, 5]. 

When Q is a topological space that is not 
necessarily compact, the totality of bounded 
continuous functions on Q (denoted by BC(Q)) 
is also a Banach space with respect to the 
supremum norm || f || 2 sup,.o|f(x)]. Let Q be 
a locally compact Hausdorff space. Then the 
space C(Q) of all continuous functions on Q is 
endowed with the topology of uniform conver- 
gence on the compact sets, i.e., the flocally 
convex topology defined by the tseminorms 
Supyex | f(x)]| as K ranges over the compact sets 
in Q. C(Q) is always a complete locally convex 
space. It is a tFréchet space if Q is c-compact 
De, Q is the union of a countable family of 
compact sets). We denote by C,,(Q) the sub- 
space of all functions f(x)e C(Q) that converge 
to zero as x tends to infinity (e, given an €> 
0, there is a compact set K such that | f(x)| « 
€ for x K). C,,(Q) is a Banach space with the 
norm sup,.<o|f(x)|. It can be regarded as a 
closed linear subspace of BC(Q). 

The totality of continuous functions with 
compact support is denoted by C,(Q) or 2£'(Q), 
where the support (or carrier) of a function f 
is the closure of the set {x| f(x) 40} in Q and 
is usually denoted by supp f. If O is not com- 
pact, C)(Q) is not complete with respect to the 
supremum norm, but when Q is o-compact 
CQ(Q) is complete with respect to the strongest 
locally convex topology with the property 
that for each compact set K in Q the embed- 
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ding in Co(Q) of the linear subspace X% of all 
functions with support in K equipped with 
the supremum norm is continuous. Usually 
X (Q) denotes the space CU equipped with 
this topology. 


(2) The Lebesgue Spaces L,(Q) (0 « p « oo). Let 
(Q, u) be a *measure space. We denote by L,(Q) 
the totality of *measurable functions f(x) on Q 
such that | f(x)? is integrable. A function f de- 
fined on (Q, u) is called square integrable if fe 
L,(Q, n). If O is the interval (a, b) equipped 
with the Lebesgue measure, it is sometimes 
denoted by L,(a, b). We define the norm 17, 
by 


1/p 
Ac) is) Pduts) 


If 1 € p « oo, then L,(Q) is a Banach space. The 
triangle inequality for this norm is precisely 
the tMinkowski inequality. If 0 « p « 1, the 
norm no longer satisfies the triangle inequal- 
ity but does satisfy the quasinorm inequality 
If +glp  2"* (|f, lgl), and if 21,5 < 
oo, then X f, converges unconditionally in 

L (0). Hence L,(Q), 0 « p « 1, is a tquasi- 
Banach space. If lim, ,,, || f, — f ||, 70, we say 
that the sequence { f,} converges to f in the 
mean of order p (or in the mean of power p), 
and write l.i.m., ,, f, — f. If ( fa} converges to 
f in the mean of order 2, we simply say that 

{ fa} converges to f in the mean. (The nota- 
tion l.i.m. means the limit in the mean and is 
used mostly when p=2.) For any f, ge L,(Q), 
(f.g) fa Tote) du(x) is well defined, by the 
*Schwarz inequality, and has the properties of 
the tinner product. Hence, L,(Q) is a *Hilbert 
space. If 1 « p « oo, then L,(Q) is a tuniformly 
convex Banach space and is in particular 
treflexive. Deepest results on L,(R"), 1 « p « oo, 
are often derived from the Littlewood-Palay 
theory due to J. E. Littlewood and R. E. A. C. 
Paley, A. Zygmund [6], and E. M. Stein [7]. 
Its starting point is the inequality 


A, f ly S lafe Apllf lp» 
where g( f) is the function 
oo 1/2 
ste-(| |grad, u(x, prar) 
0 


obtained from the Poisson integral 
u(x, t) 


_ T((n-- 12) 


Tee | ta? ls yl) OP fo)ay, 
R^ 


where [x| «(x2 + ... + x2)! ?. 

The L, spaces, 1 « p « oo, are generalized 
in the following way. Let (s) be a convex 
and nondecreasing function on [0, oo) satisfy- 
ing (0) 2 0 and @(s)/s— oo as s oc. Denote 
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by Lei (L$(Q)) the set of all functions f(x) 
such that ®(| f(x)]) is integrable (®(k| f(x)|) is 
integrable for some k 0). L4(0) 2 L$(Q) if 
@(2s) < C(s). L§(Q) is a Banach space, called 
the Orlicz space, under the norm 


£I wire? 





fou ISOJI duo) < ) 


L,(Q), 1 « p « oo, is the Orlicz space for (s) 
=s?, 


(3) The Function Space M(Q)= LOL Let Q 
and y be as in (2). A measurable function f(x) 
on Q is said to be essentially bounded if there 
exists a positive number a such that | f(x)|<« 
almost everywhere on 2. The infimum of such 
a is called the essential supremum of f, denoted 
by esssup..o | f(x)|. The totality of essentially 
bounded measurable functions on Q (denoted 
by M(Q)) is a Banach space with respect to 
the norm ||f || (TI. =€88SUP seal Tall, If 
HI oo, then M(O) c L,(Q) for any p 0, and 
Lf ||, Dm, | fll, for any fe M(Q). From 
this point of view, M(Q) is also denoted by 

L (Q) even when p(Q)= oo. This is also the 
reason why the notation ||: || is used for the 
norm in M(Q). 


(4) The Lorentz Spaces L,, p(Q) (0 p, q < 0). 
The Lebesgue spaces L,(Q), 0 « p & oo, are 
rearrangement invariant. Namely, define for a 
measurable function f on a measure space 

(Q, ol the distribution function u,(s) 2 {xe 
Q|| f(x)|>st, s» 0, and the rearrangement 
f*(t)=inf{s>0| uj (s) & t), t0. Then feL,(Q) 
if and only if f*€L (0, oo) and f], — If * lp 
Another important class of rearrangement in- 
variant spaces are the Lorentz spaces L,, ,,(Q), 
0<p, qx oo (G. G. Lorentz, 1950; R. A. Hunt 
[8]), which is defined to be the quasi-Banach 
space of all measurable functions f on Q such 
that 


If llo. = It" f O< oo, 


where L7 is the L,-space on (0, oo) relative 

to the measure dt/t. Lip, »(Q)=L,(Q) with 
equal norms. If 1 « p « oo and 1 <q < oo, then 
L5, 4&3) is a Banach space under the equiva- 
lent norm |t? fof * (5) dsll i. Except for 
these cases, Lip, p(Q) is not equivalent to 

a normed space in general [8]. If qo « q,, then 
L5, 4,9) c Luna, with continuous embed- 
ding. In case u(Q0) « oo, Lip, , (Q) c Li, all 
for po > p, and any qo, q,. The Lorentz spaces 
play an important role in interpolation and 
approximation theory (— 224 Interpolation of 
Operators). 


(5) The Function Space S(Q). Let (Q, ul bea 
measure space with u(Q) « oo. Denote by S(Q) 
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the totality of measurable functions on Q that 
take finite value almost everywhere. Then || f || 
= fall FONA - 1f 091) du(x) for fe S(Q) has 
the properties of the fpseudonorm and S(Q) 
is a *Fréchet space (in the sense of Banach) 
that is not locally convex in general. We have 
lim, ,, || f, — f£ || =0 if and only if 


lim p({x| fa) S00 >2})=0 


for any positive number £. Convergence of this 
type is called convergence in measure (or as- 
ymptotic convergence) and is the same notion 
as tconvergence in probability of a sequence 
of frandom variables (— 342 Probability 
Theory). If { f} € L,(Q) converges to f e L,(Q) 
in the mean of order p, then | f,} converges to 
f asymptotically, but the converse is not true 
in general. If a sequence | f,j € S(Q) converges 
to f eS(Q) almost everywhere, then { f,} con- 
verges to f asymptotically. Any sequence { f,} 
that converges to f asymptotically contains a 
subsequence | f, } that converges to f almost 
everywhere. 


(6) The Sequence Spaces c, co, |, (0 « p « oo), 
m — 1, , and s. The totality c (resp. co) of se- 
quences x — (£,) that converges (resp. con- 
verges to zero) as n— oo forms a Banach space 
with respect to the norm ||x|| 2 sup|£,l. co 
(resp. c) is the space C,,(Q) (resp. C(Q)) when Q 
is (resp. the one-point compactification of) the 
discrete locally compact space (1,2,3,...). The 
sequence space l,, 0 « p « oo (resp. m=1,,), is 
defined to be the spaces L,(Q) (resp. M(Q)), ` 
where Q is the space (1,2,3, ..., n,... }, of 
which each point has unit mass, while s de- 
notes the space S(O), where Q— (1,2, ...,n,...! 
provided with the measure assigning mass 
1/2” to the point n. s is the set of all sequences 
equipped with the topology of pointwise con- 
vergence (s is also used to denote the space of 
rapidly decreasing sequences; — Section (16)). 
Assume that the space L,(Q) mentioned in 
(2) is tseparable and that {¢,} is a tcomplete 
orthonormal set in L,(Q). Then putting 


5 


g= | Yoda du) 


(tFourter coefficients) for any fe L,(Q), we have 
fén EL and E |é, = LIT. Conversely, for 
any {€,}el, there exists an f — 37, 6,0,€ 
L,(Q) whose Fourier coefficients are the given 
&, (Riesz-Fischer theorem). By means of this 
correspondence, separable spaces L (Q) and 
l, are mutually isomorphic as Hilbert spaces. 
Sometimes we denote by /,(€2) the function 
space L,(Q), where Q is an arbitrary set en- 
dowed with the measure assigning mass 1 to 
each point. 
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(7) The John-Nirenberg Space BMO. A locally 
integrable function f(x) on R" is said to be of 
bounded mean oscillation if 


EE | f(x) — fs| dx < oo, (1) 
S 


where the supremum is taken over all (solid) 
spheres S in R”, f, is the mean |S| ! fs f(x)dx, 
and |S| denotes the measure of S (F. John and. 
L. Nirenberg, 1961). The set BMO(R") of all 
functions on R" of bounded mean oscillation 
forms a Banach space under the norm men- 
tioned above if two functions f and g are iden- 
tified whenever f —g is equal to a constant 
almost everywhere. Condition (1) is equivalent 
to 


l/p 
sup( isi" | uta ras) «00 
S 


for any 1 € p « oo. There are constants B and 
K >0 such that 


(vest) — ls Ai 
< B|S|exp( — KAL lsmo) 


for any sphere S and 4» 0. BMO is a slightly 
larger space than L, (e.g. log|x|e BMO) 

and has better properties. For example, the 
*Calderón-Zygmund operators are bounded in 
BMO(R"). We have 


BMO(R")=L,,(R")+ Y RjL a(R"), 


where R; are the tRiesz transforms [9]. This is 
called the Fefferman-Stein decomposition. The 
Riesz transforms can be replaced by more 
general families of singular integral operators 
(A. Uchiyama, Acta Math. (1982)). 


(8) The Hardy Spaces H, (0 — p « oo). The 
classical theory of Hardy classes (— 159 
Fourier Series G) has been reconstructed by 
the real-analysis method and extended to 
higher-dimensional cases by E. M. Stein, G. 
Weiss, C. Fefferman, and others. According to 
their terminology the elements of the Hardy 
space H, are (the complex linear combinations 
of) the real parts of the boundary values of 
holomorphic functions of the Hardy class. 

Let fe "'(R") be a ttempered distribution. 
For a o e F(R") define the radial maximal 
function M; f and the nontangential maximal 
function M; f relative to o by 
M; f(x) =suplo, * f(x), 


t>0 


Mj f(x)- sup |o f(y), 

Ix-y|&t 
where ¢,(x)=t "o(x/t) and * denotes convo- 
lution. Then the Hardy space H,(R"), 0 c p< 
oo, is defined to be the space of all tempered 
distributions f which satisfy the following 
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equivalent conditions: (i) Mj feL, fora o 
with f o(x)dx #0; (ii) M* fe L, for a ọ with 
[o dx 40; (iii) M; feL, for any o; (iv) Mž fe 
L, for any. ọ. If a distribution f satisfies (one 
of) these conditions, then its Poisson integral 
u(x, t)= q, * f(x) is a function, where (x)= 
const(1 4- |x|?) "+02, and its radial maximal 
function u ' (x) 2 sup,. o |u(x, t)| and nontangen- 
tial maximal function u*(x) — sup... |u(y, 0 
both belong to L,. Conversely if u(x, t) is a 
harmonic function on the upper half-space 
t>0 and if either u* or u* belongs to L,, 
then its boundary value f —u(:,0) exists in the 
sense of tempered distribution and f satisfies 
the above conditions. Define the norm of an 


f € H,(R") by [u*||,. Then H,(R") becomes 


a quasi-Banach space. If 1 « p « oo, then 
H,(R")= L (R") with equivalent norms. H,(R") 
is a Banach space strictly smaller than L,(R"). 
An f € L,(R") belongs to H,(R") if and only if 
all the Riesz transforms R,f are in L,(R"), and 
IS lz, is equivalent to | f|, +£ | Rif ||. Simi- 
lar characterizations of H,(R") are also known 
for p» 0 (Fefferman and Stein [9]). 

Let Ij, j — 1, ...,m, be tproper convex open 
cones in R” such that the tpolars I7? cover 
R". Then an fe "(R") belongs to H,(R") if 
and only if there are holomorphic functions 
F(x +iy) on R"-- iT; such that sup{||F,(- + 
iylll,lyerg <% and f X F(-  iT0) (D. L. 
Burkholder, R. F. Gundy, and M. L. Silver- 
stein for n=1 and L. Carleson for n» 1). Let 0 
«px 1. A measurable function a on R” is said 
to be a p-atom if there is a sphere S such that 
suppac $ and Tal, « |S| Hr and if [a(x)x* dx 
— 0 for all multi-indices « with |a| xn(p ! — 1). 
Here a multi-index o is an n-tuple (o, ...,0,) 
of nonnegative integers, |x| 25, +... - x, and 
x*— x4?! ... Zen, A distribution f belongs to 
H,(R"), 0 c px 1, if and only if there are a 
sequence of p-atoms a; and a sequence of 
numbers 4;>0 in /, such that f= 4;a; in the 
sense of distributions, and the norm Ilf lu, is 
equivalent to the infimum of "Ah, (R. R. Coif- 
man for n— 1 and R. H. Latter for n» 1). The 
theory of H, and BMO has been generalized 
to more general situations (— Coifman and 
Weiss, Bull. Amer. Math. Soc. (1977)). 


From now on we assume that Q is a domain 
in the n-dimensional Euclidean space R" (or 
more generally a differentiable manifold). D* 
stands for D,“ ... DN, where D;=6/0x;. 


(9) The Function Spaces C'(Q) and C}(Q) (1— 
0,1,2, ..., 00). The totality of /-times continu- 
ously differentiable functions in Q (namely, 
differentiable functions of fclass C' in Q) is 
denoted by C'(Q). We say that a sequence 

{ f,} of functions in C'(Q) converges to 0 in 
C'(Q) if | Df, (x)| converges to 0 uniformly on 
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every compact subset of Q for every o satisfy- 
ing 0 « |x| «1 (0 « |a| « oo if l= oo). C'(Q) is a 
Fréchet space. The totality of functions in 
C'(Q) whose supports are compact subsets of 
Q is denoted by C$(Q). We say that a sequence 
{ f,} of functions in Cl converges to 0 in 
C}(Q) if supp f, (v2 1,2,...) is contained in a 
compact subset of Q independent of v and ( f,] 
converges to 0 in CO, C4(Q) is an t(LF)- 
space. 

When Q is a locally closed set in R" (or a 
differentiable manifold with boundary), we 
denote by C'(Q) the totality of functions f(x) 
on Q together with their continuous formal 
derivatives D* f(x), |x| <l (|x| « oo if 12 oo) on 
Q such that for every « and m with |a|<m<l 
(m « oo if 1— oc), 


D'fe)- E PSOE- 
alle 


locally uniformly in Q as |x — y| tends to 0. 
Convergence in C'(O) is defined in the same 
way as above. 

If Q is a closed set in R” with a finite number 
of connected components in each of which . 
two points x and y are connected by an arc of 
length < C|x — y| with a constant C indepen- 
dent of x and y, then every f in C'(Q) can be 
extended to an f in C'(R") (Whitney's extension 
. theorem). 


(10) The Lipschitz Spaces A‘. Let s 0, and 

let k be the least integer greater than s. The 
Lipschitz (or Holder) space A*(R") is the totality 
of functions f(x) on R" which satisfy 


BER 


When s « 1, this is exactly the tLipschitz (or 
Holder) condition of order s. But when s=1, it 
is strictly weaker than the tLipschitz condition. 
A function f € A! is said to be smooth in the 
sense of A. Zygmund [6]. Suppose 0 « h «s is 
an integer. Then a function f belongs to A if 
and only if it is h times continuously differenti- 
able and all the derivatives D*f of order h are 
in A5 ^, A((R") is a Banach space of functions 
modulo the polynomials of degree « k— 1. 

Suppose that 1 <q < oo and f is a measur- 
able function on R" such that 


k 


k 
2, (eie +jy) 


j=0 


lf ls - sup 


xy 








Lg 
IER fea Foods) « 00, 
S P S 


where the supremum is taken over all spheres 
in R" and the infimum over all polynomials of 
degree «s. Then f(x) is equal to a function 

f (x) in A*(R") almost everywhere and the su- 
premum is equivalent to the norm || f llas. 
Conversely every f e A‘(R") satisfies the above 
inequality (S. Campanato). 
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(11) The Sobolev Spaces W/(O), H'(O), and 
Hi(Q) (1=0, 1,2, ...,.1&p& oo or —oo «1« oo, 
I «p«oo). Let [2 0 bean integer and 1 &p« 
oc. The Sobolev space W;(Q) is the totality of 
functions f(x) such that for all satisfying 

jal <l, the derivatives D*f(x) in the sense of 
distribution (— 125 Distributions and Hyper- 
functions) belong to L,(Q) with respect to 
Lebesgue measure in Q. W/(Q) is a Banach 
space with the norm 


mme) 53 


O<lal<I JO 


Up 
|D*f rax) . 


Clearly Win = L,(Q). W7(Q) is a Hilbert 
space with respect to the inner product 


D=] A. D'fo9):D*g(x)dx. 

Q0xla|«t 
Sometimes WJ(Q) is denoted by H'(Q). Its 
closed linear subspace obtained as the comple- 
tion of Cg (Q) is denoted by Hi(Q). We have 
Hà (Q) = W?(Q) — L,(Q). However, if 12 1, we 
have Hi(Q) c W1(Q), and identity does not 
hold unless Q — Rn. 

The definition of Sobolev spaces has been 
extended to those with fractional and also 
negative order —oo <s < oo in many different 
ways. When 1 « p « oo, it is natural to define 
W,(R") to be the space of all tempered distri- 
butions f on R” such that (1 — Ay2f =F ^! (1 
EP? Zf)e L,(R"). If s 0, then W;(R") thus 
defined coincides with the space of all fe 
L,(R") whose Poisson integral u(x, t) satisfies 


oo 1/2 
d Paatu Aral 
0 


for some (and any) integer k > s/2. 

The Poisson integral can be replaced by 
other regularizations, and thus the definition 
of Sobolev spaces of fractional orders is ex- 
tended to arbitrary open set Q with the cone 
condition (T. Muramatu [13]). Here Q is said 
to satisfy the cone condition if there are a 
bounded and uniformly continuous mapping 
V: R"5SR" and an e>0 such that for any xeQ 
the convex hull of the -ball with center at x + 
(x) and {x} is included in Q. 


« DO 
[4 








(12) The Besov Spaces B}, (—o0 «s« oo, 1 & p, 
q <œ). The effort to make the Sobolev em- 
bedding theorem [10] more precise led S. M. 
Nikol’skii and O. V. Besov [11] to the other 
classes of "Sobolev spaces" of fractional order. 
Let s» 0 and 1 <p, q & oo. The Besov space 

B (R") is the totality of functions fe L,(R") 














such that 
k fk i d du 1/4 
s = . JC7AY f C jy) sex] 
If ls (| à LI f JY MIL + 
«oo 
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for some (and any) integer k> s. In terms of 
the Poisson integral an fe L,(R") belongs to 
RB (R") if and only if 
q l/p 
a) <% 
P 


oo 
| [tQ 
0 


for some (and any) integer k>s/2. The first 
definition is obviously extended to an arbi- 
trary domain Q in R". If Q satisfies the cone 
condition, then the functions in Bj ,(Q) are 
characterized similarly as above by using a suit- 
able regularization u(x, t) of f(x) (Muramatu 
[13]). Let € be a domain with the cone condi- 
tion. When 0<h<s is an integer, an f belongs 
to B;.,(Q) if and only if fe W/(Q) and all 

the derivatives D*f of order h are in B5, An 
RB (Q) is a Banach space with the norm III, 
+| fps, If p—2, then B5 ;(Q) coincides with 
the Sobolev space W7(Q). But in the other 
cases, B5, ,(Q) is different from W;(O) for any q. 
However, By, ,(Q) or By, ts often called a 
Sobolev space of order s and denoted by 
W;(Q). Clearly we have BE, o = La NAS. 

The Besov space B, ,(Q) of order s « 0 is de- 
fined to be the totality of distributions f which 
can be represented as f = Zanen Df, with 
Jac B5. 1(Q) for some (and any) integer k 
such that s+ k>0. The norm is defined to be 
inf È, || fall astro. In terms of the Poisson in- 
tegrals or other regularizations the same char- 
acterization holds as above. 

If Q is a domain in R” with the cone con- 
dition, then the restriction B} ,(R")— B5, ,(Q) 
(resp. W;(R").— W;(Q) for 1 «p«oo)isa 
bounded linear surjection with a bounded 
right inverse [13]. 

From now on we denote by Bj, ,, etc., 

Bj, q(Q), etc. for a domain Qc R" with the cone 
condition. If q <r, then B; , B5, If 1<p<2, 
then BR p= W; By». If 2x p oo, then B5 c 
W, c By, If s» t, then B5, < By, 

Sobolev-Besov embedding theorems: (i) Let p 
<p’ and s—n/p=s'—n/p’. Then B5 ,c Bs. ,, 
Bech, W; BS, p and, if p «oo, then W;c 
W; . (it) Let 0 « s « n/p and n/p' 2n/p — s. 

Then BR CL a and WP Lip, (C Lp). (iii) 
Let s=n/p. Then Bj, , c BC for any p and B¥, ,, 
c BMO for p « œ. (iv) LetO n' «n and 0< 

s —s— (n— n')/p. Then there is a bounded trace 
operator Tr: B; ,(R")> Bs, ,(R") that extends 
the restriction mapping A (R")5 A(R"). Tr 

is surjective and has a bounded linear right 
inverse [11-13]. 


A*u(-, t) 














(13) The Function Spaces 2, &, d, 22, and F. 
The spaces of infinitely differentiable functions 
C$ (Q) and C*(Q) are also denoted by 2(9) 
and 6(O), respectively. The totality of func- 
tions f(x) in C*(Q) such that, for all «, D^f(x) 
belongs to L,(Q) with respect to *Lebesgue 
measure is denoted by 2, (Q). The neighbor- 
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hood V,,, of 0 in 2, (Q) is defined to be the 
totality of functions f(x) such that ||D*f ||, <€ 
for any « satisfying |x| € In particular, 2, (Q) 
is also denoted by A(Q). (A(Q) is also used to 
denote the space of hyperfunctions.) 

A function f(x) is called a rapidly decreas- 
ing C^-function if it belongs to C^(R") and 
satisfies 


Am |x|*|D*f(x)|=0 


for any « and any integer k>0. The totality of 
rapidly decreasing C?-functions is denoted by 
SF. The neighborhood V, , , of 0 in Z is defined 
to be the totality of functions f(x) such that 
(1 4 |x D |D*f(x)| <e for any « satisfying |a| x I. 
The spaces in this section are tnuclear except 
for Z, and Z, and are employed in the theory 
of distributions [14] (— 125 Distributions and 
Hyperfunctions). When Q =R”, we usually 
omit (R”); for example, Z(R") and £, (R") are 
denoted by 2 and &, , respectively. 


(14) The Function Spaces 7, y Mis Za,» 61M 
óm,» and d — C". Let (M, be a sequence of 
positive numbers satisfying the logarithmic 
convexity My < Mp-ı Mp+1- Em (Q) (resp. 

d m,)(2)) denotes the totality of C®-functions 
f on Q such that for any compact set K in Q 
there are constants k and C (resp. for any k>0 
there is a constant C) satisfying 


|D*f(x)| < CK" My, ek |a| z 0. 


6,5, (Q) is the totality of treal analytic func- 
tions on Q and is denoted by (Q) or C^(Q). 
If {M,} satisfies the Denjoy-Carleman condi- 
tion 2, M,/M,,, < oo, then Du, (Q)= AQ) 
ém, (Q) and Ay )(Q)= HYN Ey, (Q) are 
dense i in 2(Q). Conversely, if Dry „Qi is differ- 
ent from {0}, then {M,} satisfies the Denjoy- 
Carleman condition. In this case an fe 
Zi m,}(Q) (resp. & M, (3) is sometimes called 
an ultradifferentiable function of class (M,j 
(resp. (M,)). The most important is the case 
where M, — p! for an s 1. Then an fe 
St m, (9) (resp. & a )(82)) is called a function 
of Gevrey class {s} (resp. (s)). The topological 
properties of (Q) (resp. im (Q), etc.) have 
been discussed by A. Marine (resp. H. 
Komatsu). 

For function spaces of S type — 125 Distri- 
butions and Hyperfunctions. 


(15) The Function Spaces €(O), €, (0) — A,(Q), 
and A(Q). Let Q be an open set in C". The 
totality of tholomorphic functions on Q is 
denoted by O(Q). @(Q) is a tnuclear Fréchet 
space with the topology of uniform conver- 
gence on compact sets. It is a closed linear 
subspace of C(Q) and also of C?(Q). 

For any pÈ 1, the totality of functions f 
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holomorphic in Q and satisfying | f(z)? dx dy 
<œ (z=x+iy;dxdy is Lebesgue mea- 

sure), denoted by €,(Q) or A,(Q), is a Ba- 

nach space with respect to the norm |f], — 
(fal.f(2)]" dx dy)". In particular, it is a Hil- 
bert space when p —2. 

The totality of functions bounded and con- 
tinuous on the closure of O and holomorphic 
in Q (denoted by A(Q)) is a Banach space 
with respect to the norm || f || 2 sup,.g| f(z)]. 


(16) The Kóthe Spaces (  4('?) and 24" (a'?). 
Let {x} be an increasing sequence of se- 
quences «™ — (o), a, ...) of positive numbers. 
The echelon space (  4(*'?) of G. Kóthe [15] 
is the totality of sequences x= (£,, £,,...) 

such that p(x) — X a |é; « oo for any k. 

It is a Fréchet space with the topology deter- 
mined by the seminorms p“. The co-echelon 
space X A" (x®) is the totality of sequences y= 
(4.115, ...) such that |7,|< Co for some C 
and k. The space s of rapidly decreasing se- 
quences and the space s' of slowly increasing 
sequences are the echelon space and the co- 
echelon space for the sequence a(? = i*. If «= 
k', then we obtain the space of power series 
with infinite radius of convergence and the 
space of convergent power series. More gener- 
ally, let x —(x;) be a sequence of positive num- 
bers. The echelon spaces for af? = exp(ka;) and 
exp(—k~'a,) are called the infinite type power 
series space and the finite type power series 
space and are denoted by A (x) and A, (a), 
respectively. Echelon spaces and co-echelon 
spaces have been employed to construct exam- 
ples and counterexamples in the theory of 
tlocally convex spaces by K othe [15], Grothen- 
deck. Y. and T. Komura, E. Dubinsky, D. 
Vogt, and others. 


C. Dual Spaces 


When Q is a compact Hausdorff space, any 
bounded linear functional ® on C(Q) is ex- 
pressed by the *Stieltjes integral 


o(f)- | f69de(x), fe C(Q), (2) 
Q 


with a *Radon measure o. i.e., a (real- or 
complex-valued) tregular fcountably additive 
set function defined on the Borel sets in Q. 
Since q is of bounded variation, the totality of 
such o is denoted by BV(Q). Conversely, any 
ye BV(Q) gives a bounded linear functional on 
C(Q) defined by (2), and ||®|| =the total varia- 
tion of g over Q. Hence the dual space of C(Q) 
is isomorphic to the Banach space BV(Q) with 
the norm || |. 

Let Q be a locally compact Hausdorff space. 
Then the dual space of C,,(Q) is again the 
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Banach space BV(Q) of Radon measures of 
bounded variation. On the other hand, the 
dual space of C,(Q) is the space of all Radon 
measures on Q. N. Bourbaki takes this fact as 
the definition of measure. 

Any bounded linear functional ® on L,(Q) 
is expressed as 


D(f -Í S(x)o(x)du(x), — feL,(Q), G) 
Q 


with a suitable pe M (Q); and ||®jj = ell. 
Conversely, any oe M (Q) defines a bounded 
linear functional on L,(Q) by means of (3). 
Accordingly, the dual space of L,(Q) is isomor- 
phic to M(Q). 

The dual space of L,(Q) (1 « p « oo) is iso- 
morphic to L,(Q), where q is the real number 
defined by (1/p) - (1/9) 2 1 (accordingly, 1 <q < 
o0) and is called the conjugate exponent of p. 
Any bounded linear functional on L,(Q) is 
expressible by the formula in (3) (where fe 
LA with ge L,(Q), and ||! = lll: 

The dual space of M(Q) is isomorphic to the 
normed linear space of all (real- or complex- 
valued) finitely additive set functions o defined 
on all measurable sets in Q, of bounded vari- 
ation over Q, and absolutely continuous with 
respect to the measure u given in Q Ge, u(N)= 
0 implies p(N)=0). If 1 «p«oo and l<q< 
oo, then the dual space of L,, 4)(Q) is isomor- 
phic to Liya (Q), where p' and q' are conjugate 
exponents of p and q, respectively. 

If Q is nonatomic (e. Q has no set of posi- 
tive measure that cannot be decomposed into 
two subsets of positive measure), then no con- 
tinuous linear functionals exist other than 
zero on S(Q) and on L,(Q) and L,, 4)(Q) for 
Ocp«l. 

The sequence spaces co, I, (1 & p « oo), m, 
and s (the space defined in Section B (6)) are 
special cases of C,,(Q), L,(Q), M (Q), and S(Q), 
respectively. Hence their dual spaces can be 
described explicitly. For example, the dual 
space of c, (resp. /,) is 1, (resp. m), and if 1 < 
p< œ, the dual space of l, is 1, (where (1/p) 4- 
(1/q) 2 1). The coupling of x =(€,) and y= 
(y,) 1s given by È 6,5,. The dual space of s is 
the totality of sequences (y,,) such that 5, =0 
except for a finite number of n. 

Let VMO(R") denote the closure of C;(R") in 
BMO(R’). Then the dual space of VMO(R") is 
identified with H, (R"). This can be considered 
to be a generalization of the tF. and M. Riesz 
theorem. On the other hand, the dual space of 
H,(R") is BMO(R") [9]. Hardy spaces H,(R"), 
O0 «p « 1l, are not locally convex but have suffi- 
ciently many bounded linear functionals, and 
their dual spaces are identified with Lipschitz 
spaces A‘(R"), where s-n(p ! — 1). 

Let 1 <p, q< œ. Then the dual space of 
W,(R") is isomorphic to W,,*(R") and that of 
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RB ,(R") to B2, (R"), where p' and a are conju- 
gate exponents of p and q, respectively. The 
dual space Z' of 2 is defined to be the space of 
tdistributions; the dual spaces of &, Z, , and 
fF are (algebraically) linear subspaces of Z'. 
Similarly, the dual spaces of 2; m) and Zm, 
are called the spaces of tultradistributions 
of classes {M,} and (M,), respectively. The 
dual space of æ is identified with the space of 
thyperfunctions with compact support (— 125 
Distributions and Hyperfunctions). A continu- 
ous linear functional on @(Q) is called an 
analytic functional. For each analytic func- 
tional ® there is a compact set L <Q such that 
It /)| € C supel f(z)). A compact set K is 
called a porter of ® if every compact neighbor- 
hood L of K satisfies this condition. Porters 
are similar to supports of generalized func- 
tions, but an analytic functional does not 
necessarily have a smallest porter. 

The dual space of an echelon space is the 
corresponding co-echelon space. 
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A. General Remarks 


By a function-theoretic null set we mean an 
exceptional set that appears in theorem as the 
one asserting that a certain property holds 
with a “small exception.” We give below some 
of the more important examples of exceptional 
sets. For simplicity, we limit ourselves to the n- 
dimensional Euclidean space R” (n z 2). 


B. Sets of Harmonic Measure Zero 


Denote by ze the characteristic function of 

a set E on the boundary OD of a bounded 
domain D in R". We call the thypofunction 
H,, and thyperfunction H. (— 120 Dirichlet 
Problem) the inner and outer harmonic mea- 
sures of E (with respect to D), respectively. 
When they coincide, the function is called the 
harmonic measure of E. A necessary and suffi- 
cient condition for E to be of inner harmonic 
measure zero is that u<0 hold in D whenever 
a tsubharmonic function u bounded above in 
D satisfies limsupu(P)<0 as P tends to any 
point of ôD — E. This theorem implies the 
following uniqueness theorem: If h is bounded 
and harmonic in D, if E 1s a set of inner har- 
monic measure zero on OD, and if h(P) 0 as P 
tends to any point of 9D — E, then hz 0. A 
necessary and sufficient condition for E to be 
of outer harmonic measure zero is that there 
exist a positive tsuperharmonic function v in D 
such that v(P)— oo as P tends to any point of 
E. (Concerning the existence of a limit for a 
subharmonic or tharmonic function at every 
boundary point except those on a set of har- 
monic measure zero, — 193 Harmonic Func- 
tions and Subharmonic Functions.) 


C. Sets of Capacity Zero 


Although there are many kinds of capac- 
ity (^ 48 Capacity), here we consider only 
tlogarithmic capacity and a-capacity (x > 0). 
Let K be a nonempty compact set in R”. Set 
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W(K)=inf, f ( PO *du(P)du(Q), where p runs 
through the class of nonnegative "Radon mea- 
sures of total mass 1 supported by K, and 
write C,(K) -(W(K)) Us, Define C,(@) «0 

for an empty set @. For a general set ECR", 
define the inner capacity by supy_,C,(K) and 
the outer capacity by the infimum of the inner 
capacity of an open set containing E. When 
the inner and outer capacities coincide, the 
common value is called the o-capacity (or 
capacity of order x) of E and is denoted by 
C,(E). We denote the logarithmic capacity of E 
by Co(E). In order that Cy(K)=0 (n —2) or the 
*Newtonian capacity C,. ,(K)20 (n>3) fora 
compact set K, it is necessary and sufficient 
that the harmonic measure of K with respect 
to G— K vanish for any bounded domain G 
containing K. Then K is removable for any 
harmonic function that is bounded or has a 
finite tDirichlet integral in G— K. In general, 
K is said to be removable for a family F of 
functions if for any domain G containing K 
and feF defined in G — K, there exists ge F 
defined in G such that g = f in G— K. Let K be 
a compact set in R? with C,(K)=0 and G be a 
domain containing K. Let f be a holomorphic 
function defined in G — K for which every 
point of K is an tessential singularity. Then the 
set of texceptional values for f at every point 
of K is of logarithmic capacity zero. If f is 
tmeromorphic in |z] « and 


| 
acre aya? 


then f has a finite limit in any angular domain 
with a vertex at every point of |z|= 1 except for 
those belonging to a set of «-capacity zero 
(logarithmic capacity zero if «=0). 


|z|)*dxdy<oo (O<a<1), 


D. Hausdorff Measure 


Let «>0 and a set E be given in Rn Denote by 
c a covering of E by a countable number of 
balls with radii d,,d,,..., all of which are 
smaller than e (7 0). As £20, inf, >, d? in- 
creases. The limit is called the Hausdorff mea- 
sure of E of dimension « and is denoted by 
A,(E). In order for a compact set K to be 
removable for the family of harmonic func- 
tions defined in a bounded domain and satisfy- 
ing the *Hólder condition of order a, it is 
necessary and sufficient that A,_,,,(K)=0. 
Next, suppose that K is a compact set in a 
plane and the complement G of K with re- 
spect to the plane is connected. Set || f |, — 

(f fal. fit dx dy)! for f holomorphic in G and 
q, 1<q<«, and ||f ||, ^supg|f]. Denote by 
H” the family of f#0 with || f ||, « oo. If p is 
defined by i/p+1/q=1, then A; ,(K) « oo 
implies H*— @ for a, 2«q« oo, and A,(K)=0 
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implies H” = @. Moreover, H? = Ø if and 
only if C,(K)—0, and H* = Ø implies C,_,(K) 
—0 for q, 2«qx oo [1]. 


E. Null Sets Defined with Respect to Families 
of Functions 


Conversely, L. V. Ahlfors and A. Beurling 
characterized the size of sets in a plane by 
means of families of functions [2]. Let D bea 
domain, and let f represent a holomorphic 
function in D. Fix a point zo in D. Set 


8-(fllfi«t 


>= iri || IEN 


€ — ( f |the area of R^z x], 


where R° is the complement of the trange R 
of (f(z) — f(z9)) !. Denote by SB, SD, SE 
the families consisting of constants and funiva- 
lent functions in 8, D, €, respectively. Use 
the notation $y to represent any one of these 
six families, and define M4 — Ma (zo; D) by 
sup(1f (9| fe). Then Ma-Mez My= 
Mag 2 Meg — Mop, and Ma(zo; D) - 0 im- 
plies Miz D)=0 for any zeD. 

Denote by N; the class of compact sets K 
such that the complement K* of K is con- 
nected and M,(z; K*) 20. We call KeN& a 
null set of class Nx. In order for K to be re- 
movable for B or D, it is necessary and suffi- 
cient that K € Ng or € Np, respectively. We in 
general have 


If A, (K) —0, then K e Ng. There exists a set 

K € Ng with A,(K)- 0 (A. G. Vitushkin, Dokl. 
Akad. Nauk SSSR, 127 (1959); J. Garnett, Proc. 
Amer. Math. Soc., 21 (1970)). When K isa 
subset of an analytic arc A, K e Ng implies 
A,(K)=0, Ns is equal to Neg, and K be- 
longs to Np if and only if C,(A)=C,(A — K). 

If an tanalytic function has an essential sin- 
gularity at every point of K € Na, then any 
compact subset of the set of exceptional values 
at every point of K belongs to Na. A necessary 
and sufficient condition for K € N, is either 
that the complement of any one-to-one fcon- 
formal image of K* be of plane measure zero 
or that any *univalent analytic function in Kr 
be reduced to a 'linear fractional function. If 
the union of at most a countable number of 
Ng or Ny sets is compact, it belongs to the 
same class. But it not true for Neg sets [3]. 


F. Analytic Capacity 


For a compact set K, let Dg be the unbounded 
connected component of K*. The quantity 
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Maio", Dx) is called the analytic capacity of K 
and is denoted by a(K). If «(K)- 0, there exists 
an extremal function f;(z) - a(K)z '+..., 
called the tAhlfors function, that maps D, 
onto a covering surface of the unit disk. In 
general, «(K) is not greater than the logarith- 
mic capacity C,(K). If K is a continuum, then 
a(K)=C)(K), and «(K) is attained by and only 
by fo(z), which maps Dx onto |w| « 1 conform- 
ally and z 2 oo to w=0. For a linear set K, 
a(K) is equal to a quarter of its length (C. 
Pommerenke, Arch. Math., 11 (1960)). The 
concept of analytic capacity is basic for the 
theory of rational approximation on compact 
sets. 
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A tcontinuous mapping f from the interval 
I={t|0<t<1} into a topological space Y is 
called a path connecting the initial point f(0) 
and the terminal point f(1). In particular, a 
path satisfying f(0) 2 f(1)— y, is called a loop 
(or closed path) with y, as the base point. For a 
path f, the inverse path f of f is defined by f(t) 
= f(1— t). When the terminal point of f and 
the initial point of g coincide, the path F de- 
fined by F(t)— ft) for0xtx 1/2 and F(t) — 
gQt—1)for 1/2<t<1 is called the product 

(or concatenation) of f and g, and is denoted 
by f: g. With [ f ] standing for the equivalence 
class of a path f under the relation of fhomo- 
topy relative to 0, 1 (EJ) (i.e., by homotopy 
with 0 and 1 being fixed), the inverse [ f] ! — 


[f] and the product [f]: [g] 2 Lf: 9] are 
defined. In particular, in the set of homotopy 
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classes of loops with one common base point 
yo, the product is always defined, and the set 
forms a group z,(Y, yo). This group is called 
the fundamental group (or Poincaré group) (H. 
Poincaré, 1985) of Y (with respect to yo). If Y is 
tarcwise connected, then z,(Y, yo) zt, y,) 
for an arbitrary pair of points yọ, y,, and the 
structure of the group is independent of the 
choice of the base point. This group is de- 
noted simply by z,(Y). A continuous mapping 
Q:(Y, yg) 2(Y', yo) induces a homomorphism 
9,4 :74(Y, yo) 27, (Y', yo) by sending [f] to 

9x Lf] [vof], and (p'o 9), 2 9, o Py holds 
for the composite o o of mappings. Thus 
n,(Y)is a ttopological invariant of Y. If x, (Y) 
consists of only one class (the class of the 
constant path), we say that Y is simply con- 
nected. For example, cells and spheres S" 

(n> 2) are simply connected. The famous 
*Poincaré conjecture states that a simply con- 
nected 3-dimensional compact tmanifold is 
homeomorphic to the 3-dimensional sphere. 
We have van Kampen's theorem: Let P be a 
connected polyhedron, P, and P, be its con- 
nected subpolyhedra such that P, N P, is con- 
nected, and P= P, U P,. Then z, (P) is isomor- 
phic to the group (famalgamated product) 
obtained from the "free product of z,(P,) and 
1, (P5) by giving the relations that the images 
of each element of z; (P, N P5) in x, (P,) and in 
1, (P5) are equivalent. Also, the fundamental 
group of the *product spaces is the direct 
product of the fundamental groups of the 
spaces involved. Any group is the fundamental 
group of some {CW complex. The Abelization 
1,/[1,,2,] of the fundamental group x, = 
n,(Y)(Y arcwise connected) is isomorphic to 
the 1-dimensional integral thomology group 
H,(Y). For example, the fundamental groups 
of a circle S! and a ttorus T" are an infinite 
cyclic group and a free Abelian group of rank 
n, respectively; the fundamental group of a 1- 
dimensional CW complex is a free group; and 
the fundamental group of an orientable 2- 
dimensional closed surface of *genus p is a 
group having 2p generators (a, ...,ay, b,, ..., 
bp} and a relation a,b,a;'b;'...a,b,a,'b, 1 = 
1. If xg is a fixed point of the circle $+, then 
the fundamental group can be defined as the 
set of all homotopy classes of continuous map- 
pings f:(S', xo) (Y, yo). 

Extending the definition of the fundamental 
group by replacing I, ST with I”, S", we obtain 
the n-dimensional homotopy group (— 202 
Homotopy Theory; 91 Covering Spaces). 
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171 (XX1.25) 
Galois, Evariste 


Evariste Galois (October 25, 1811—May 31, 
1832) was born in Bourg-la-Reine, a suburb of 
Paris. In 1828, while still in junior high school, 
he published a paper on periodic tcontinued 
fractions. Although he published four papers, 
his most important works were submitted to 
the French Academy of Science and either lost 
or rejected. He was unsuccessful in his attempt 
to enter the Ecole Polytechnique and instead 
entered the Ecole Normale Supérieure in 1829. 
Active in political affairs, he was expelled from 
school, imprisoned, and died in a duel soon 
after his release. 

The night before the duel, he left his re- 
search outline and manuscripts to his friend, 
A. Chevalier. These were published by J. 
Liouville in J. Math. Pures Appl., first series, 
11 (1846). The contents include the concept 
of groups and what essentially became the 
tGalois theory of algebraic equations. The 
manuscript also contained expressions such as 
"theory of ambiguity," which seems to indicate 
that Galois intended to study the theory of 
algebraic functions along the same lines. 
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A. History 


After the discovery of formulas giving the 
general solutions of algebraic equations of 
degrees 3 and 4 in the 16th century, efforts to 
solve equations of degree 5 remained unsuc- 
cessful. Early in the 19th century P. Ruffini 
and N. H. tAbel showed that a general alge- 
braic solution is impossible. Shortly after- 
ward, E. *Galois established a general prin- 
ciple concerning the construction of roots of 
algebraic equations by radicals. The principle 
was described in terms of the structure of a 
certain permutation group (the Galois group) 
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of the roots of the equation. Even in this 
original form of the theory (Galois theory), 
Galois not only completed the research started 
by J. L. Lagrange, P. Ruffini, and Abel, but 
also made an epochal discovery that opened 
the way to modern algebra. J. W. R. Dedekind 
(Werke III, 1894) interpreted this result as a 
duality theorem concerning the automorphism 
groups of a field. It was shown later that 
Galois theory plays an important role in the 
general theory of commutative fields estab- 
lished by E. Steinitz. In the 1920s, W. Krull 
generalized the idea of Dedekind, using the 
concept of topological algebraic systems, and 
obtained the Galois theory of infinite alge- 
braic extensions (Math. Ann., 100 (1928)). The 
Galois theory gives a model for a successful 
theory summarizing the essentials of separable 
algebraic extensions and has led to analo- 
gous theories for other algebraic systems. For 
example, E. R. Kolchin constructed an analo- 
gous theory for differential fields where the 
Galois groups are algebraic groups (— 113 
Differential Rings, Galois theory of differential 
fields). Another line of development of this 
theory, also originated by Dedekind (Werke 
III, 1876/77), led to the Galois theory of rings, 
an object of active research by N. Jacobson 
(Ann. Math., 41 (1940)), T. Nakayama, and 
others since the 1940s. Also, recently, the 
Galois theory for some general algebraic sys- 
tems containing inseparable fields has been 
constructed by M. E. Sweedler, U. S. Chase, 
and others in which Galois groups are re- 
placed by Hopf algebras or bialgebras (— 203 
Hopf algebras). 


B. Definitions 


Given a group G of tautomorphisms of a given 
tfield L, the subfield F(G) - (ae L|a" 2a,c€G] 
is called the invariant field associated with G. 
For any extension fields L, L' of K, an isomor- 
phism of L into L' whose restriction to K is 
the identity is called a K-isomorphism. If L is a 
*normal extension of K, any K-isomorphism 
of L into L' is a K-automorphism of L. If an 
algebraic extension L of K is normal, the 
group G(L/K) of all K-automorphisms of L is 
called the Galois group of L/K. À !separable 
normal algebraic extension of K is called a 
Galois extension of K. Let L/K be a finite 
normal extension; then there exist intermedi- 
ate fields M and N of L/K such that M/K is a 
Galois extension and N/K is a ?purely insep- 
arable extension and L= M Gy N (— 277 
Modules J). Further, G(L/K)— G(L/N) = 
G(M/K) and the order of G(L/K) is equal to 
the tseparable degree of L/K, i.e. [M: K]. A 
necessary and sufficient condition for L/K to 
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be a Galois extension is that the invariant field 
associated with G(L/K) be K. A Galois exten- 
sion L/K is called an Abelian extension or a 
cyclic extension when G(L/K) is tAbelian or 
teyclic, respectively (— 149 Fields). 


C. Fundamental Theorem of Galois Theory 


Let L/K be a finite Galois extension and G its 
Galois group. Then there exists a fdual lattice 
isomorphism between the set of intermediate 
fields of L/K and the set of subgroups of G, 
under which an intermediate field M of L/K 
corresponds to the subgroup H = G(L/M); 
conversely, a subgroup H of G corresponds to 
M =F(H). The degree of extension [L: M] is 
equal to the order of the corresponding sub- 
group H (in particular, [L:K] is the order 

of G), and [M : K] coincides with the index 

(CG HL If subfields M and M' are tconjugate 
over K, then the corresponding subgroups 
G(L/M) and G(L/M’) are conjugate to each 
other in G, and vice versa. In particular, M/K 
is a Galois extension if and only if the sub- 
group H corresponding to M is a *normal 
subgroup of G, and in this case, the Galois 
group G(M/K) is isomorphic to the factor 
group G/H. 


D. Extensions of a Ground Field 


Let L/K be a finite Galois extension, K’/K any 
extension, and L’ the tcomposite field of L and 
K’. Then L'/K' is also a Galois extension, and 
its Galois group is isomorphic to G(L/LN K’) 
by the restriction map. 


E. Normal Basis Theorem 


Let L/K be a finite Galois extension with 
Galois group G. Then there exists an element u 
of L such that (u*|a € Gj forms a basis for L 
over K called a normal basis. If we denote by 
KEG] the *group ring of G over K, a *K[G]- 
module structure can be introduced in L by 
the operation Xa,o(x) 2 a,x’; the existence 
of a normal basis implies that L is isomorphic 
to K[G] itself as a K[G]-module, or in other 
words, that the K-linear representation of G 
by means of L is equivalent to the tregular 
representation of G. 


F. Examples of Galois Extensions 


(1) tCyclotomic Fields. Let m be a positive 
integer not divisible by the tcharacteristic of K; 
C a *primitive mth root of unity, and L= K(C). 
Then L/K is an Abelian extension, and its 
Galois group is isomorphic to a subgroup of 
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the treduced residue class group (Z/mZ)*; 

in particular, if K 2 Q, the subgroup coin- 
cides with (Z/mZ)*, by the irreducibility of 
tcyclotomic polynomials. Hence the degree 
[Q(O:Q1] is equal to (m), where ¢ is *Euler's 
function. 


(2) Finite Fields. A ‘finite field K has nonzero 
characteristic p, and the number q of elements 
of K is a power of p. Also, K is uniquely deter- 
mined by q (up to isomorphism), hence it is 
denoted by GF(q) or F,. Thus GF(q") is the 
only extension of GF(q) of degree n; moreover, 
it is a cyclic extension. 


(3) Kummer Extensions. Assume that K con- 
tains a primitive mth root ¢ of unity and the 
characteristic of K is 0 or is not a divisor of m. 
Denote by K* the multiplicative group of K. 
An extension L of K can be expressed in the 
form L=K(2/a,, sas Gs) (a;€ K) if and only if 
L/K is an Abelian extension and all oe G(L/K) 
satisfy o" = 1; in this case L/K is called a Kum- 
mer extension of exponent m. There exists a 
one-to-one correspondence between Kum- 
mer extension L of exponent m over K and 
finite subgroups H/(K*)" of the factor group 
K*/(K*)", given by the relations H 2 L"(Q K*, 
L=K QYH ). Moreover, there exists a canon- 
ical isomorphism between H/(K*)" and the 
*character group of G(L/K), so that H/(K*)" is 
isomorphic to G(L/K). Let L = K(0) be a cyclic 
Kummer extension of degree m of K, and let o 
be a generator of the Galois group G(L/K). 
Then the Lagrange resolvent (£, 0) = 6+ £07 
+...4+67 107" satisfies (5,0) — C! (6,0), 

(C, 0)" e K, and 0 and its conjugates can be 
expressed in terms of (6,0). In particular, L is 
generated by (6,0) over K. 


(4) Artin-Schreier Extensions. Assume that K is 
of characteristic p #0. For any element a of an 
extension of K, we denote by Za the element 
a? —a and by (1/P)a a root of PX —a=0. 

A finite extension L of K is of the form L= 
K((1/2)a,, ...,(1/2)a,) (a;€ K) if and only if 
L/K is a Galois extension whose Galois group 
is an Abelian group of texponent p; in this 
case, L/K is called an Artin-Schreier extension. 
There exists a one-to-one correspondence 
between Artin-Schreter extensions L over K 
and finite subgroups H/AK of the additive 
group K/AK, given by the relations H = 
PLANK, L=K((1/P)H); moreover, H/PK is 
isomorphic to the character group of G(L/K) 
(therefore also to G(L/K) itself). More gen- 
erally, for Abelian extensions L of exponent p" 
De, Galois extensions whose Galois groups 
are Abelian groups of exponent p"), we obtain 
similar descriptions by using the additive 
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group of fWitt vectors of length n instead of 
K (— 449 Witt Vectors). 


G. Galois Group of an Equation 


L/K is a finite Galois extension if and only if L 
is a *tminimal splitting field of a fseparable 
polynomial f(X) in K[X]. In this case, we call 
G(L/K) the Galois group of the polynomial 
f(X) or of the algebraic equation {(X)=0. The 
equation f(X)=0 is called an Abelian equa- 
tion or a cyclic equation if its Galois group is 
Abelian or cyclic, respectively, while f(X)=0 is 
called a Galois equation if L is generated by 
any root of f(X) over K. Generally, G(L/K) 
can be ffaithfully represented as a permutation 
group of roots of f(X)=0. If this group is 
tprimitive, then f(X)=0 is called a primi- 

tive equation. The index of the group in the 
group of all permutations of roots ts called 

the affect of the equation f(X) —0; if the affect 
is 1, the equation f(X)=0 is called affect- 

less. Let u,,...,u, be *algebraically indepen- 
dent elements over K. Then for the polyno- 
mal F(X)7 X"—u, X"! -...-(—1)'u, in 
K(u,, ...,u) [X], the equation F,(X)=0 is 
called a general equation of degree n. The 
Galois group of F,( X) —0 is isomorphic to the 
tsymmetric group ©, of degree n, and if K is 
not of characteristic 2, then the quadratic 
subfield corresponding to the talternating 
group A, is the field K(,/D) obtained by 
adjoining the quadratic root of the *discrimi- 
nant D of F(X). 


H. Solvability of an Algebraic Equation 


Assume that K is of characteristic 0, f(X)e 
K[X], and L is the minimal splitting field of 
f(X). We say that the equation {(X)=0 is 
solvable by radicals if there is a chain of sub- 
fields K - Lyc L,c...c L,—L such that L;= 
Lus KÉN with some deel, ,, and this is the 
case if and only if the Galois group of f(X) 

is fsolvable (Galois). In particular, Abelian 
equations are solvable by radicals. Cyclic 
equations are solved by using the Lagrange 
resolvent, and theoretically the general solv- 
able equation can be solved by repeating this 
procedure. Since S, is solvable if and only if 
n x4, it follows that a general equation of de- 
gree n is solvable only if n— 1, 2, 3, 4 (Abel). 
(For a method of solving these equations 

— 10 Algebraic Equations D.) Also, a poly- 
nomial is solvable by square roots if and only 
if the order of the Galois group is a power of 
2. This fact enables us to answer some ques- 
tions concerning geometric construction prob- 
lems such as trisection of an angle or division 
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of a circumference in equal parts (— 179 
Geometric Construction). 


I. Infinite Galois Extensions 


If a Galois extension L/K ts infinite, then its 
Galois group is an infinite group. Let {M,} be 
the family of intermediate fields of L/K that 
are finite and normal over K, and put H, 

— G(L/M,). Then by taking {H,} as a tbase of 
a neighborhood system of the unity element, G 
becomes a ‘topological group. This topology is 
called the Krull topology (Krull, Math. Ann., 
100 (1928)). G is then isomorphic to the *pro- 
jective limit of the family of finite groups 
{G/H} and is 'totally disconnected and *com- 
pact. There is a one-to-one correspondence 
between the set of intermediate fields of L/K 
and the set of closed subgroups of G given by 
the map (Galois group) e (invariant field), and 
thus we have a generalization of Galois theory 
for finite extensions as described in Section C. 
Various theories, including the theory of Kum- 
mer extensions, can be generalized to the case 
of infinite extensions (— 423 Topological 
Groups). 


J. Galois Cohomology 


Let L/K be a finite Galois extension and G its 
Galois group. Then both the additive group 

L and the multiplicative group L* have G- 
module structures. The tcohomology groups 
of G with coefficient module L are 0 for all 
dimensions because of the existence of a 
normal basis {— 200 Homological Algebra). 
As for the multiplicative group L*, we have 
H°(G, L*)= K*/N(L*) (N is the tnorm N, x), 
H'(G, L*)=0 (Hilbert’s theorem 90 or the 
Hilbert-Speiser theorem). Jn particular, if G is a 
cyclic group with generator oc, then every 
element a such that N(a) — 1 can be expressed 
in the form a— b! *. H?(G, L*) is isomorphic 
to the "Brauer group of 'central simple alge- 
bras over K which have L as a ‘splitting field. 
In the case of number fields, a number of 
G-modules arise, such as *principal orders, 
tunit groups, "ideal groups, tidele groups, and 
so on, whose cohomological considerations 
are important (— 6 Adeles and Ideles; 59 Class 
Field Theory). Further, let A be a group (not 
necessarily commutative), and suppose that G 
acts on A. We denote by ^a the action of ce G 
on a€ A. A is called a G-group if "(ab) —"^a?b 
for all a, be A and o eG. Then, in the same way 
as for G-modules, we can define the Oth coho- 
mology group H?(G, A) to be the subgroup A9 
of A consisting of all elements of A left fixed by 
G. The map a:ora, from G into A such that 
a,,=4,°d, (c, v € G) is called a 1-cocycle with 
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values in A. Denote by Z!(G, A) the set of 

the 1-cocycles of G with values in A. Two 1- 
cocycles a and a’ in Z! (G, A) are called co- 
homologous if there exists an element be A 
such that a; 2b !a,"b for all ceG. This is an 
equivalence relation in Z!(G, A) and the 1- 
cohomology set H!(G, A) of G with values in 

A is defined to be the set of cohomologous 
classes of Z!(G, A). H'(G, A) does not have the 
structure of a group in general, but there does 
exist an element called an identity, namely, the 
cohomologous class containing the trivial 1- 
cocycle. In general, a set X with an element p 
of X is called a pointed set; for any two pointed 
sets (X, p) and (Y, q), a map f: X + Y is called 

a morphism of pointed sets if /(p) — q. For 
pointed sets (X, p), (Y, q), and (Z,r), a sequence 
of morphisms X 4,Y4Z of pointed sets is 
called exact if Im f = Kerg. The 1-cohomology 
set H!(G, A) with the identity can be regarded 
as a pointed set. Therefore, exact sequences of 
these sets make sense and possess some of the 
properties of cohomology groups in the com- 
mutative case. For example, let 1-34 B 
C1 be an exact sequence of G-groups (and 
A is central in B); then 


H°(G, A)> H°(G, B) > H*(G, C) 
+ H'(G, A)> H'(G, B) > H' (G, C) (2 H2(G, A)) 


is an exact sequence of pointed sets. For 

a Galois extension L/K with Galois group 
G, a linear algebraic L-group defined over 
K (— 13 Algebraic Groups) has naturally 
the structure of a G-group, and we have 
H'(G,GL,(L))=0. Applying the above exact 
sequence to 1 SL,(L)  GL,(L)88 L* — ], 

we have H!(G, SL,(L)) «0. Also, we have 
H'(G, Sp,,(L))=0 for any integer nz 1. It is 
difficult to define higher cohomology sets 
naturally, but various methods to define them 
have been obtained. 

In many cases, we are more concerned with 
the *category of Galois extensions of K with 
K-isomorphisms between them than with a 
single extension L/K. In other words, we con- 
sider a tfunctor L> (L) of the category of 
Galois extensions of K into the category of 
(Abelian) groups, and study the cohomology 
related to G(L/K)-module or G(L/K)-group 
structures derived from (L). In the case of 
infinite algebraic extensions, we consider the 
inductive limit of cohomology of subfields 
of finite degrees, making use of continuous 
cocycles of Galois groups relative to the Krull 
topology [8,9]. 

Let L/K be a Galois extension with Galois 
group G. Consider objects X, Y defined over K 
on which the extension of the ground field is 
defined (such as K-linear spaces with certain 
tensors on them, algebras over K (associative 
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or not), K-varieties, or K-algebraic groups 
defined over KL When X and Yare isomor- 
phic over L, Y is called a L/K-form of X. Let 
E(L/K, X) be the set of all isomorphism classes 
of X over L. Let A be the group of all auto- 
morphisms over L of X. For a L/K-form Y 
of X and an isomorphism f: X > Y over L, 
since G acts on X and Y, one can define the 
isomorphism ^f : X ^Y over L that satisfies 

"( f(x)) 2 ?f(*x). In particular, A is a G-group 
by this action. Further, for eet, define a, 
=f !-%eA, then a:o a, is a 1-cocycle of G 
with values in A that corresponds to a L/K- 
form Y of X. This induces a bijection from 
E(L/K, X) onto H!(G, A). Thus L/K-forms 
of X can be classified 1f one can determine 
H'(G, A). For example, H' (G, Sp,,(L)) 20 is 
equivalent to saying that an L/K-form of a 
skew-symmetric bilinear form on a K-linear 
space of dimension 2n is unique up to K- 
isomorphisms. L/K -forms of a semisimple 
Lie algebra g over K can be classified by the 
1-cohomology set of the algebraic group 
Aut(g @ gL). (For the L/K-forms of algebraic 
groups — 13 Algebraic Groups M.) This de- 
scent theory can also be discussed for more 
general categories. 


K. Galois Theory of Rings 


The theory of tcentralizers in simple algebras 
can be interpreted as the theory of a certain 
Galois correspondence with respect to inner 
automorphism groups. Also, by using *crossed 
products, we can deduce from the theory of 
centralizers in simple algebras the Galois 
theory of commutative fields. On the other 
hand, Jacobson obtained a Galois theory of 
‘division rings with respect to finite groups of 
outer automorphisms that is similar to the 
commutative case. Since then, many alge- 
braists have proceeded with investigations that 
aim either at unifying these two theories by 
admitting inner automorphisms in the group 
of automorphisms, at extending the theory 
from division rings to general rings such as 
simple rings, *primitive rings, or *semiprimary 
rings, or at weakening the finiteness con- 
ditions. One principal method in these theories 
lies in considering first the endomorphism ring 
Hom,(S, S) for an extension S/R and then the 
roles of endomorphisms (or derivations) in it 
[6] (— 29 Associative Algebras). 

Let K be a field of characteristic p >0 and 
L/K be a finite purely inseparable extension 
such that L” c K. The set of all tderivations of 
L/K forms a restricted Lie algebra D(L/K) 
over K. Then there exists a one-to-one and 
dual lattice isomorphic correspondence be- 
tween the set of intermediate fields M of L/K 
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and the set of restricted Lie subalgebras H of 
D(L/K), given by the relations H — D(L/M) 
and M = (aeL|d(a) «0, de H} (Jacobson [7]). 
Namely, in the case of purely inseparable ex- 
tensions, derivations or higher derivations play 
the role of automorphisms, and the bialge- 
bras defined by such derivations correspond 
to Galois groups of Galois extension. Now, 
the group algebra KG of a group G over K 
and the (restricted) universal enveloping alge- 
bra of a (restricted) Lie algebra over K both 
have the structure of a *Hopf algebra. From 
this fact, unifying the Galois theory of Galois 
extensions and Jacobson's theory for purely 
inseparable extensions and using bialgebras 
or Hopf algebras, one can construct Galois 
theories of more general objects containing 
certain nonseparable field extensions (see M. 
E. Sweedler, Ann. Math., 87 and 88 (1968); S. 
U. Chase and Sweedler, Lecture notes in math. 
97, Springer, 1969). 
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A. Introduction and Historical Highlights 


Game theory consists of mathematical models 
used in the study of decision making in situ- 
ations involving conflict and cooperation. A 
conflict arises when each player in a game 
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selects from a list of alternatives one which, 
possibly together with chance and random 
events, leads to various outcomes over which 
the players have different preferences; thus the 
behavior of one player aiming at his own 
favorable outcome might induce unfavorable 
outcomes for others. Although the potential 
outcomes usually bring about conflicts among 
the players, there may be room for cooper- 
ation among some of them. Game theory 
attempts to extract that which is common and 
essential to such situations, to handle them by 
means of mathematical methods, and to pro- 
vide a normative guide to rational behavior 
for cach of the players. Game theory thus goes 
beyond classical theories of probability and 
decision making that are sufficient to solve 
games involving just one player and chance. 
Modern game theory started in 1944 with 
the publication of the monumental book by 
von Neumann and Morgenstern [1]. These 
authors presented many logical classifications 
of games, including a distinction between two- 
person and n-person games, between constant- 
sum (zero-sum) and general-sum games (de- 
pending upon whether the sum of payoffs 
to the players is constant (zero) or not), and 
between noncooperative and cooperative games 
(depending upon whether any collaboration 
among the players is prohibited or allowed). 
The second edition included a remarkable 
expected utility theory, which has become a 
mainstay of game theory. Von Neumann and 
Morgernstern also gave three representations 
of games. The first representation is the so- 
called extensive form; this representation has 
been slightly modified by Kuhn [2]. The sec- 
ond representation is the normal-form game; 
this is the form which the minimax theorem 
for two-person zero-sum games was estab- 
lished [3]. This theorem was generalized to n- 
person general-sum noncooperative games by 
Nash [4]. Finally, the characteristic-function 
form was the one in which the authors devel- 
oped the theory of stable sets. Several other 
solution concepts, such as the core, the Shap- 
ley value, and the bargaining set, have since 
been defined for games in characteristic- 
function form. Historically, the development of 
game theory has been closely related to vari- 
ous areas of pure mathematics, such as analy- 
sis, topology, geometry, and the foundations of 
mathematics. A survey of game theory up to 
1957 was presented by Luce and Raiffa [5]. 
In the 1950s and early 1960s, several major 
papers appeared in five issues of the Annals of 
Mathematics Studies [6]. Lucas [7] has pre- 
sented a good survey of developments to 1972, 
and Shubik [8] has surveyed the development 
of the field through 1981. Current articles 
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appear mainly in the International Journal 

of Game Theory, and also in the journals in 
fields such as operations research, manage- 
ment science, economics, political science, and 
psychology. 


B. The Extensive Form 


An n-person game in extensive form is repre- 
sented by a game (tree De, a connected graph 
with no cycles (— 186 Graph Theory)) having 
the following properties: There is one special 
vertex corresponding to the starting point of 
the game. Each nonterminal vertex corre- 
sponds either to a move of one of the n players 
or to a chance move. The edges ascending from 
a vertex denote the alternatives of the player at 
this vertex. For each terminal vertex there is 
an n-dimensional vector whose components 
represent the payoffs to each player. The state 
of a player's information at any stage can be 
described by certain subsets of the set of all his 
vertices, called information sets. At each of his 
moves, player i knows which information set 
he is in, but not which vertex he occupies 
within this set. A local strategy for player i is a 
tprobability distribution over the set of all 
alternatives at each of his information sets. A 
behavior strategy for player i is a function 
which assigns a local strategy to all of his 
information sets. À pure strategy is a special 
behavior strategy that assigns a particular 
choice to each information set. Kuhn [2] 
showed the existence of pure optimal strategies 
for n-person general-sum noncooperative 
games with perfect information (i.e., games in 
which all information sets contain a single 
vertex) as a generalization of the result for 
two-person zero-sum games given in [1]). 
Kuhn also proved the existence of equilibrium 
behavior strategies for games with perfect 
recall (i.e., games in which player i at any of his 
information sets remembers all his prior moves 
but is not aware of the prior choices of the 
other players). The definition of “equilibrium 
strategy" is given in the next section. Because 
of the complexity of n-person general-sum 
games, research on them has been minimal 
since the work of Kuhn. Recently a new attack 
on them has begun; for example, the Nash 
equilibrium point in extensive forms has been 
reexamined by Selten [9]. 


C. The Normal Form (or Strategic Form) 


An n-person game in normal form is specified 
by iN, AEN (Cien) where N — (1, Sak, nj 
is a set of players, X' is the set of player i's 

strategies, and F' is a real-valued function on 
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the Cartesian product []?_, X^, called player i’s 
payoff function. Chance is incorporated by 
invoking an extra set X? of chance moves and 
a probability distribution on X?. A two-person 
zero-sum game or matrix game is the simplest 
case in which the existence of an equilibrium 
point has been established. Equilibrium fol- 
lows from von Neumann's minimax theorem: 
Let M! 2 (1,...,m,], M? (1,...,m;] be the 
sets of pure strategies that may be chosen by 
two players. Let a;; be player 1’s payoff when 
strategies i and j are taken by players 1 and 2. 
A mixed strategy for player i is a probability 
distribution on M'. Sets of mixed strategies 
for players 1 and 2 are thus given by X! = 
(xe R" (Em, x, 21, x; z0 "ieM!) and X?— 
{xe R^ | DM, x, 21, x; z 0 "ie M?}. If players 

| and 2 use the mixed strategies x! and x?, 
respectively, the expected payoff for player 1 
is F'(x!, x*) - Y; X;xl a,x}, which is a pay- 
off function for player 1. Von Neumann [3] 
proved 

max min F!(x!,x?)= min max F!(x!, x?). 
xleX! x?e X? x2eX? xiex! 

We say that a pair (£t, X?) satisfying the 
foregoing minimax theorem is an equilibrium 
point. This pair (£!, €?) turns out to be a saddle 
point (— 292 Nonlinear Programming A) of 
F!(x!, x?). The duality theorem (— 255 Linear 
Programming B) is mathematically equivalent 
to this minimax theorem. A generalization of 
this equilibrium to n-person general-sum non- 
cooperative games was presented by Nash [4]. 
An n-tuple (£+, ..., x") (£'e X^) is a Nash equilib- 
rium if, for each i, 


F(21, ..., $571, A EL gm 
ER, cp xi $t, x) forall et 


That is, no player can improve his payoff by 
changing his strategy if all other players con- 
tinue to use the same strategies. Nash demon- 
strated the existence of this equilibrium for n- 
person general-sum noncooperative games 
with finite pure strategies. The existence of 
Nash equilibria for wider classes of nonco- 
operative games is proved by means of fixed- 
point theorems (— 153 Fixed-Point Theorems). 
Recently, multistage games, such as super- 
games and stochastic games in which games 
are played repeatedly, have become a major 
research topic of noncooperative game theory 
(with work being done on both extensive and 
normal forms). In two-person general-sum 
games or bimatrix games, there arise possibil- 
ities of cooperation (or bargaining) between 
two players. A solution concept for such situa- 
tions, the Nash bargaining solution, was pro- 
posed by Nash [10]; this solution was sub- 
sequently discussed by Luce and Raiffa [5]. 
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D. The Characteristic-Function Form (or 
Coalitional Form) 


An n-person cooperative game in characteristic- 
function form is given by a pair (N,v), where N 
— (1L ..., nj is a set of players and v is a real- 
valued function on a set of all subsets of N 
with v(@)=0, called a characteristic function. 
Sometimes v is assumed to be superadditive, 
ie,v(SUT)zvw(S) - v(T) 'S, TEN with $n 
T#@. v(S) represents the worth or the power 
achievable by the subset (coalition) $ when 

its members cooperate regardless of the be- 
havior(s) of the players in the complement of 
S. An n-dimensional vector x 2(x,, ..., x,) is 
said to be an imputation if it satisfies (i) x; > 
v({i}) "ieN (individual rationality) and (ii) 
Dien X; — v(N) (group rationality). The set of all 
imputations, denoted by A, represents all 
reasonable or realizable ways of distributing 
the available gains among the n players. An 
imputation x is said to dominate another im- 
putation y if there is some nonempty coalition 
S such that (i) x; y; "ie S and (ii) X, x, « 
v(S) (the effectiveness of S with respect to x). 
The original solution concept for games in 
characteristic-function form given by von 
Neumann and Morgenstern [1] is now called 
the stable-set or von Neumann- Morgenstern 
solution. A subset K of A is a stable set if (i) 
no dominance relation exists between any 
two elements of K (internal stability) and (ii) 
any imputation outside K is dominated by 
some imputation of K (external stability) The 
existence of stable sets was settled negatively 
by the ten-person example of Lucas [11]. This 
example, however, is rather specialized, and 
thus the existence of stable sets may yet be 
proved for a large class of games. Stable sets 
have been characterized for several classes 

of games, and these sets accurately reflect 

the coalition-forming processes among the 
players. Thus stable-set theory remains a ma- 
jor research topic in the theory of games in 
characteristic-function form. Many other 
solution concepts have been developed since 
that of the stable set. One of these is the core, 
defined by Gillies in [6] (its naive idea had 
already appeared in [1]). The core is the set 

C —(xeA|Xsx;z v(S) SSN}. This says that 
no coalition can protest against or block an 
imputation in the core on grounds that the 
coalition can expect more. For superadditive 
games the core coincides with the set of un- 
dominated imputations, and thus the core is a 
subset of any stable set if both exist. The con- 
dition for nonemptiness of the core was de- 
rived by Shapley [12] using the duality theo- 
rem. Another solution concept, defined by 
Shapley in [6], is known as the Shapley value. 
The Shapley value is a function q(v) 2 (q, (v), 
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.--5@,(v)) sending an arbitrary characteristic 
function v on N into an n-dimensional Eucli- 
dean space satisfying the conditions (i) q, (zv) 
= qi(v), where z is any permutation of N and 
nv(nS) - v(S) “SEN; (ii) Dies (v) = v(S) "SEN 
such that o(T)=v(SN T) "T€ N; and (iii) 

lr + w) = e, (v) + o;(w) "ie N and for any two 
games v and w. These three axioms uniquely 
determine the value 





Daes 
p= y, SIE (o(8) (5 i) 
en 


for each i, where s is the number of players in 
S. The Shapley value is an imputation, and 
can be viewed as a fair-division solution since 
the three axioms are desirable properties for 
any equitable allocation scheme. From the 
above formula, the Shapley value can also be 
interpreted as the average of the marginal 
contributions of the players in a coalition. 

A bargaining-set concept was proposed by 
Aumann and Maschler in [6]. This set de- 
scribes what payoffs are stable once a partic- 
ular coalition structure (a tpartition of N) has 
formed. Briefly, a payoff associated with a 
coalition structure is stable or in a bargain- 
ing set if there is no objection to it from any 
player; or, even with an objection, if there 
exists a counterobjection to such an objection 
from other players. For details — [6]. Since 
there are many different ways to define objec- 
tion and counterobjection, there are various 
types of bargaining sets. Some of them are 
known to be nonempty. Two additional solu- 
tion concepts, the kernel and the nucleolus, 
derive from investigations into particular 
bargaining sets. The kernel, introduced by 
Davis and Maschler [13], is always a non- 
empty subset of its bargaining set. More impor- 
tant is the nucleolus defined by Schmeidler 
[14], which is a unique imputation in the 
kernel and thus in the bargaining set. It is 
also in the core if the latter is nonempty. The 
"excess" of any nonempty S € N for an imputa- 
tion x is defined by e(x, S) 2 v(S) aas, The 
excess represents the dissatisfaction or the 
complaint of the coalition S with respect to x. 
The nucleolus is the imputation which mini- 
mizes the largest excess. If we have a tie, i.e., 
if the maximum excess attains a minimum 

at several imputations, then the next largest 
excess is to be compared, and so on. That 

is, the nucleolus is the tlexicographical mini- 
mum in the ordering of these arrangements. It 
can be computed by solving a series of linear 
programs. 

Several generalizations and variations of 
the classical von Neumann- Morgenstern for- 
mulation of games in characteristic-function 
form have also been investigated, such as the 
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games without side payments studied by 
Aumann and others (— e.g., M. Shubik (ed.), 
Essays in Mathematical Economics: In honor of 
Oskar Morgenstern, Princeton Univ. Press, 
1967); the games in partition-function form 
proposed by Thrall and Lucas (R. M. Thrall, 
and W. F. Lucas, Naval Res. Logistics Quart., 
10 (1963), 281—298); and the games with in- 
finitely many players, in connection with which 
the Shapley value theory has become a major 
research topic [18]. 


E. Applications; Related Areas of Mathematics 


Game theory has been applied to many fields, 
such as economics, political science, manage- 
ment science, operations research, information 
theory, and control theory, as well as to pure 
mathematics. Games in extensive form are 
now important tools for analyzing the effects 
of information, and thus for solving many 
decision problems with uncertainty. Non- 
cooperative games in normal form and Nash 
equilibria have been used in the study of many 
phenomena, including oligopolistic markets 
(Friedman [15]), bidding processes, electoral 
competition, resource allocation, and arms 
control. Cooperative games have been success- 
fully applied to economics, and the relation 
between the core and competitive equilibria 
sheds further light on the theory of competi- 
tive economy. It is generally true that com- 
petitive equilibria are contained in the core. 
Debreu and Scarf [16] demonstrated that 

if the number of players approach infinity 

in a certain manner, the core shrinks to the 

set of competitive equilibria. By working in 
measure-theoretic terms, Aumann [17] was 
able to identify the core with the set of com- 
petitive equilibria. It was also demonstrated 
by Aumann and Shapley [18] that the set of 
competitive equilibria (and hence also the 
core) converges to the Shapley value under 
such formulations, provided certain conditions 
are satisfied. Another major application of 
cooperative game theory has arisen in pcittical 
science, wherein value-type solutions, such as 
the Shapley value, are widely used as indices of 
the power of each participant in various voting 
situations. Major applications are to problems 
of cost allocation for public goods such as 
water resources [19], public transportation 
systems, and telephone systems; in such appli- 
cations the core, the Shapley value, and the 
nucleolus have all been employed. The books 
[8, 20-22] are good references to the most 
recent applications of game theory. 

Game theory also has many close relations 
with various areas of pure mathematics. The 
following are typical examples. The study of 
certain types of games in extensive form has 
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increased our understanding of the axiom of 
choice (— 34 Axiom of Choice and Equiva- 
lents A) and other foundational questions. 
Nash equilibrium is closely related to fixed- 
point theorems (— 153 Fixed-Point Theorems) 
and to separation theorems (— 89 Convex Sets 
A). Cooperative game theory also has many 
connections to functional analysis and to 
convex analysis. 

Finally, we mention that the study of dif- 
ferential games (— 108 Differential Games) is 
also highly developed and widely used in areas 
such as economics and control theory. 
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Gamma Function 


A. The Gamma Function 


The function T(x) was defined by L. Euler 
(1729) as the infinite product 


1 e [XE xA 
ET ia) et 


Legendre later called it the gamma function or 
Euler’s integral of the second kind. The latter 
name ts based on the fact that for positive real 
x, we have 


rwi| ee Tae 


0 


This function satisfies the functional relation 
DI (x1) xI(x), 


and hence for positive integral x, we have 

I (x - 1)2 x!. C. F. Gauss denoted the function 
I'(x 4 1) by II(x) or x!, even when x is not a 
positive integer. The function x! is also called 
the factorial function. The gamma function 
can also be defined as the solution of the func- 
tional equation I (x+ 1) 2 xI (x) satisfying the 
conditions 


T(x+n) n 
T(n)n* 





I(1)21, lim 
Furthermore, we have 


1 = x 
= Cx 1+ — Sold 
loj xe I ( + j e 


This expression is known as the Weierstrass 
canonical form. Here C is Euler's constant 


1 1 
Cotim( 1454. esten] 
2 n 


ro 


=0.57721566490153286060651209 ..., 
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which is conjectured to be ftranscendental, 
but as yet even its irrationality remains un- 
proved. However, it is known, that if C were 
rational, the numerator and the denominator 
would be integers of more than 30,000 digits 
(R. P. Brent, 1980). 

The numerical value of C was calculated 
by Adams (1878) to 260 decimal places, and 
recently it has been calculated to more than 
20,000 decimal places by means of an elec- 
tronic computer. Seven thousand digits have 
been computed by W. A. Beyer and M. S. 
Waterman (Math. Comp., 28 (1974)), and 
20,000 digits have been computed by Brent 
et al. (1980). 

I (x) is tholomorphic on the complex x- 
plane except at the points x 20, —1, —2,..., 
where it has simple tpoles. When Rex >0, we 
have Hankel’s integral representation 


1 
r(x)= ——— | even 
2isinzx Jo 


x z integer, 





where the contour C lies in the complex plane 
cut along the positive real axis, starting at oo, 
going around the origin once counterclock- 
wise, and ending at oo again. 

Among various properties of this function 
(— Appendix A, Table 17.1), the following two 
formulas are especially useful for numerical 
calculations: Binet's formula 





E log2x 
logIT (x)= E logx—x+ 
S tan(t 
+2 | MAU D. Rex: 
o €e"—1l 


and the tasymptotic expansion formula that 
holds when |arg x| € (1/2) —ó (6 > 0), 


Log Za 
2 





1 
ier)» (35 )logx—x4 


m Aa Bs 
+ —— —ÀMÀ——E- 
à. 2n(2n — 1)x?"^! 


(Stirling’s formula), where the B, are ! Bernoulli 
numbers. This last formula can be rewritten as 


I(x*-1)2xl'—x*e *./2nx, 


which is used for large positive integers x. 
The integrals 


À Eé 
| e lige | put dt, 
0 À 


are known as the incomplete gamma functions 
and are used in statistics, the theory of molec- 
ular structure, and other fields. The texponen- 
tial integral and the terror function (— 167 
Functions of Confluent Type D) are special 
cases of the incomplete gamma function. 


Rex 0, 
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B. Polygamma Functions 


The derivatives of the logarithm of the gamma 
function are named the digamma function (or 
psi function) v (x) —- dlogI (x)/dx; the trigamma 
function V/(x); the tetragamma function y"(x); 
the pentagamma function (x), etc. These 
functions are called polygamma functions. In 
particular, w(x) is the solution of the functional 
equation 


VGorl)-v()-i/x ` Wilz —C, 
lim (f(x +n)— (1 +n))=0. 


C. The Beta Function 


Euler’s integral of the first kind 


1 
SEN (10-10 dt, 


0 


Rex>0, Rey>0, 


is called the beta function and is an analytic 
function of two variables x, y. This function is 
related to the gamma function as follows: 


gya O 


r(x+y) 
If the upper limit 1 in the integral is replaced 
by «, the result is called the incomplete beta 
function B,(x, y). 
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Gauss, Carl Friedrich 


Carl Friedrich Gauss (April 30, 1777— 
February 23, 1855) was born into a poor 
family in Braunschweig, Germany. From 
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childhood, Gauss showed genius in mathe- 
matics. He gained the favor of Grand Duke 
Wilhelm Ferdinand and under his sponsorship 
attended the University of Góttingen. In 1797, 
on proving the "fundamental theorem of alge- 
bra, he received his doctorate from the Univer- 
sity of Halle. From 1807 until his death, he 
was a professor and director of the Observa- 
tory at the University of Góttingen. 

On March 30, 1796, he made the discovery 
that it is possible to draw a 17-sided tregular 
polygon with ruler and compass, which moti- 
vated his decision to devote himself to mathe- 
matics. The publication of his Disquisitiones 
arithmeticae in 1801 opened an entirely new 
era in number theory. In pure mathematics, 
he did excellent research on tnon-Euclidean 
geometry, thypergeometric series, the theory of 
functions of a complex variable, and the whole 
theory of felliptic functions. 

In the field of applied mathematics he made 
outstanding contributions to astronomy, ge- 
odesy, and electromagnetism; he also studied 
the tmethod of least squares, the theory of 
surfaces (— 111 Differential Geometry of 
Curves and Surfaces), and the theory of *po- 
tential. He considered perfection in papers for 
publication to be of utmost importance; thus 
his published works are few relative to his 
amount of research. However, the scope of his 
work can be seen in his diary and letters, some 
of which are included in his complete works, 
comprising 12 volumes. He ts generally consid- 
ered the greatest mathematician of the first 
half of the 19th century. 
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Gaussian Processes 


A. Gaussian Systems 


A system X — (X,(o)| 4€ Aj of real-valued 
random variables on a probability space 
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(Q, B, P) is said to be Gaussian, or X is a Gauss- 
ian system, if any finite linear combination 

of elements X, of X is a ‘Gaussian random 
variable. 

Any subsystem of a Gaussian system is 
again a Gaussian system. In particular, the 
joint distribution of (X,, X5, ..., X,) for any 
finite subsystem {X,|1<j<n} of a Gaussian 
system X is a multidimensional Gaussian 
distribution. This distribution is supported on 
the whole of R" or on a hyperplane (at most 
(n — 1)-dimensional). Let m; be the expectation 
of X;, and let V —(V, ,) be the (positive definite) 
'covariance matrix of {X;} given by 


V,-EQX,-m)(X,—mj)j,  1«j Een 


Suppose that the rank of the matrix Vis r. 
Then the distribution of (X,, X, ..., X,) is 
concentrated on an r-dimensional hyperplane 
of R”: 


m+VR", mc-(m,m,,...,m,). 


When V is nondegenerate, that is, when r — n, 
the distribution is supported by the whole of 
R" and has a density function of the form 


(2n) "Py exp] äs mv e | 


where x —2(x,, X5, ..., x,)e R^, |V| and V! are 
the determinant and the inverse, respectively, 
of V, and where (x —m) denotes the (column) 
vector transpose to the (row) vector (x — m). 
The above expression is the general form of 
the density of an n-dimensional Gaussian 
distribution. The characteristic function q(z) of 
this distribution is given by 


q(z) - exp D z) us a| s 


z — (21,25, v5 Z JER; 


where ( , ) denotes the inner product on R”. 
This distribution is denoted by N (m, V). If, in 
particular, m=0 and if V is the identity matrix, 
then it 1s called the n-dimensional standard 
Gaussian distribution. 

For a general Gaussian system X= {X, | 
AeA} we are given the mean vector m; = 
E(X,), A€ A, and the covariance matrix V; „= 
E{(X,—m,)(X,—m,)}, 4, we A, which is posi- 
tive definite; for any n> 1, complex numbers 
94,05, ..., X, € C, and /,,/,,...,4,€A, we have 


n 
2, ex V; SH 
jk- 


1 

Conversely, given m}, 4€ A, and a positive 
definite V —(V, „|4, ue A), there exists a Gaus- 
sian system X = (X,|A€ A], the mean vector 
and the covariance matrix of which coin- 

cide with (m,) and V —(V, ,), respectively. If 
there exists another system X' with the same 
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property as X, then X and X' have the same 
distribution. 

À necessary and sufficient condition for the 
X; in a Gaussian system to be independent is 
that V, , —0 for every pair 2# u. For a Gauss- 
ian system X = ( X,|nz 1], the tconvergence 
in probability of the sequence {X,,} is equiv- 
alent to tconvergence in the mean square. 
The limit of the sequence in this case is again 
Gaussian in distribution. Also for this system, 
the *almost sure convergence of the sequence 
implies the convergence in mean square. 

There are many characterizations of Gauss- 
ian distributions and of Gaussian systems. 

(1) A necessary and sufficient condition for a 
distribution to be Gaussian is that cumulants 
(tsemi-invariants) y, of all orders exist and 
satisfy y, — 0 for all k > 3. (ii) Gaussian distri- 
butions have a self-reproducing property: If 
X and Y are independent Gaussian random 
variables, then their sum S= X + Y is also 
Gaussian. A converse to this property holds: If 
the sum is Gaussian, then, assuming that they 
are independent, X and Y are both Gaussian 
as well (P. Lévy, H Cramer). (iti) If X, and X, 
are independent and if Y, =aX,+5X, and 

Y; =cX,+dX, are also independent, then 
both X, and X, are Gaussian except for the 
trivial case b=c=0 or az d =Q. (iv) If for X 
and Y there exist U independent of X and V 
independent of Y satisfying Y=aX + U, X= 
bY+V for some constants a, b, then there are 
only three possibilities: (1) (X, Y) is Gaussian, 
(2) X and Y are independent, (3) there is an 
affine relation between X and Y. (v) Suppose 
that a distribution has a finite mean m and a 
density function of the form f(x —m). If the 
maximum likelihood estimate of the mean is 
always given by the arithmetic mean of the 
samples, then the distribution is Gaussian 

(C. F. Gauss). 

For any element X, of a Gaussian system X 
and for a subsystem X' of X, the conditional 
expectation E(X,/B’) is the orthogonal projec- 
tion of X, onto X’, where B' is the smallest ø- 
field with respect to which all the X, in X' are 
measurable and where X’ is the closed linear 
subspace of L?(Q, P) spanned by X’. 

A system X= (X,|A4e A], X,=(X},...,X9, 
of d-dimensional random variables is said 
to be Gaussian if the collection {Xj|ZeA, 

1 < j<d} is Gaussian. 


B. Complex Gaussian Systems 


Let Z be a complex-valued random variable 
with mean m, and denote it in the form Z = 

X +iY¥+m,i=,/—1, X, Y real. If X and Y 
are independent and have the same Gaussian 
distribution with zero mean, then Z is called a 
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complex Gaussian random variable. A system 
Z={Z,|AEA} of complex-valued random 
variables is said to be complex Gaussian if any 
finite linear combination X c;Z ap GE C, is 
complex Gaussian. A complex Gaussian sys- 
tem has properties similar to those of a Gaus- 
sian system discussed above. For instance, two 
complex Gaussian random variables are inde- 
pendent if and only if they are uncorrelated. 
Convergence properties are also similar. Fur- 
thermore, one can define complex Gaussian 
systems consisting of higher-dimensional ran- 
dom variables. 


C. Gaussian Processes 


A real-valued stochastic process (X, is called 
a Gaussian process or a normal process if it 
forms a Gaussian system. If {X,} is a complex 
Gaussian system, it is called a complex Gauss- 
ian process. The most important example 

of a Gaussian process is Brownian motion 
{B,|t>0} with the properties E(B,) 2 0 and 
E(B,— B)? «|t — s|. There are several gener- 
alizations of this process, among them: (i) 
‘Brownian motion with a d-dimensional time 
parameter, (ii) a Wiener process with d- 
dimensional time parameter, which is a Gaus- 
sian system (X, |a —(a,, ...,a,), all a;2 0) with 
E(X,)=0 and E(X, X,;) - IIjminíaj , aj), 

d (41. s d i= 1, 2. 

Since a multidimensional Gaussian distri- 
bution is completely determined by its tmean 
vector and the tcovariance matrix, a Gaussian 
process is strongly stationary if it is weakly 
stationary (— 395 Stationary Processes); and 
such a process is called a stationary Gaussian 
process. The mean value m and the spectral 
measure F(dA) are associated with a weakly 
stationary process. The measure F(d4) is sym- 
metric with respect to the origin since each X, 
is real-valued. Conversely, given such F(dA) 
and m, we can construct a real weakly station- 
ary process {X,} with mean value m and spec- 
tral measure F(d4). Generally, such a process 
{X,} is not determined uniquely; however, if 
{X,} is Gaussian, then there exists only one 
stationary Gaussian process with given m and 
F(d4). In view of this fact, stationary Gaussian 
processes can be regarded as being typical 
among weakly stationary processes. 

The discussions so far on stationary Gauss- 
ian processes are generalized to the case of 
stationary complex Gaussian processes, for 


which symmetry of F(d4) need not be assumed. 


The trandom measure ( M(A)] in the 'spectral 
decomposition of a complex Gaussian process 
{X,} again forms a complex Gaussian system. 
Weakly stationary Gaussian random distri- 
butions can also be introduced. 
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D. White Noise and Gaussian Random 
Distributions 


A real-valued weakly stationary process with 
discrete parameter is called a white noise if the 
mean m=0 and the tcovariance function p(t) 
= 1 (t=0); =0 (t #0). Obviously the tspectral 
measure F (dA) is the tLebesgue measure dA. 

In the continuous parameter case, a weakly 
stationary random distribution (— 407 Sto- 
chastic Processes C) is called a white noise if m 
— 0 and p=6 (6 is *Dirac's delta function: ó(q) 
= q(0)). The spectral measure is, therefore, 

the Lebesgue measure. In both cases Gauss- 
ian white noise {X,} or {X,} is determined 
uniquely. It has independent values at every 
point in the following sense: In the discrete 
parameter case, X, , X, ,..., X, are mutually 
independent for any mutually distinct t4, 

t5, ...,t,. For a continuous parameter Gauss- 
ian white noise, often called simply white 
noise if no confusion occurs, X, , X, ,.... X 
are mutually independent if the supports of 
Q1; 95, ..., 9, are disjoint. In the latter case 
Gaussian white noise is realized by taking the 
derivative of a Brownian motion. The !charac- 
teristic functional c(q) is given by 


1 
sto)-ew| Aer) 


where || || is the L"^(R!)-norm. 

Characteristic functionals of general Gauss- 
ian random distributions are expressed in the 
form 


Pn 


c(p) — exp | mto bie ol 


where m is the mean functional and K is the 
covariance functional. 


E. Representations of Gaussian Processes 


The family of tstochastic integrals X, = 

fa F(t.u)dB,, t>a(t>a if a= —oo), based 

on Brownian motion, defines a Gaussian 
process ( X,|tz aj). The converse problem 

is, however, not obvious. Given a Gaussian 
process {X,|t>a} with E(X,)=0, the problem 
is to find a Brownian motion {B,} and a kernel 
F that give a representation of the above form 
(P. Lévy [10]). A more specific problem dis- 
cussed below is important. If, in addition, the 
representation is formed in such a way that 
B,(X) — B,(B) holds for every t, then the repre- 
sentation is called canonical, where B,(X) is the 
smallest c-field with respect to which all the 
X,,s«t, are measurable. The canonical repre- 
sentation, if it exists, is unique up to equiva- 
lence, i.e., EI, vil is unique. The existence of 
the canonical representation is, however, not 
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always guaranteed. Mean continuous, zero 
mean, purely nondeterministic stationary 
Gaussian processes have tbackward moving 
average representations, which are nothing but 
the canonical representations. Once the canon- 
ical representation of ( X,) is obtained, the 
kernel, together with known properties of {B,}, 
tells us important properties of the given pro- 
cess, e.g., sample function properties, Markov 
properties, and so forth. 

Let {X,|t>a} be a general Gaussian process 
with E(X,)=0, and let M,(X) be the subspace 
of L?(Q, P) spanned by the X,, s<t. Assume 
that (i) ( X, is separable, Le, M „ (X) is sepa- 
rable and (ii) ( X,) is purely nondeterministic 
in the sense that (\,M,(X)= {0}. Then X, is 
expressed as a sum of stochastic integrals: 


X,-Y | F(t, u) dBi, 
J Ja 


where the {Bi}, j= 1,2, ..., N (N can be oo), 
are mutually independent tadditive Gaussian 
processes. In addition, the B/ can be taken in 
such a way that the measures dv/(u) = E(dB/)?, 
j=1,2,...,N is decreasing in j (Le., dv! >> 

dv? >>...) and that B,(X) — V; B,(B/) holds for 
every t. Such a representation is called a gen- 
eralized canonical representation of ( X, [3]. It 
always exists under the above assumptions (i) 
and (ii), although it is not unique when N z 2. 
The number N, called the multiplicity of {X,}, 
is independent of the choice of a generalized 
canonical representation. It is a good question 
to ask when {X,} has a simple unit multiplicity 
or when {X,} has the canonical representation. 
No interesting answer to this question has 
been given so far except for stationary Gauss- 
isn processes. 


F. Gaussian Markov Properties 


A simple Markov Gaussian process ( X,|t >a} 
has the canonical representation if it is sepa- 
rable and purely nondeterministic and if its 
covariance function never vanishes. It is of the 
form 


X, -fo| g(u)dB(u), 


t 


f(t) 0, | ow duo for t»a. 


a 


A generalization of the simple Markov prop- 
erty of a Gaussian process is given in the 
following manner (suggested by P. Lévy [10]). 
If the tconditional expectations E (X, /B.(X)), 
for any choice of distinct t,,t,,...,tysj>t 

(j Z0) span an N-dimensional subspace of 
L?(Q, P), then the process ( X,) is called an N- 
ple Markov Gaussian process. If the canonical 
representation of an N-ple Markov Gaussian 
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process exists, then it is expressed in the form 
N t 

Ass A sof g(u) dB, 
j-1 a 


where det( f,(t;)) never vanishes for any choice 
of distinct t; (>a), 1 «ix N, and where the 

g; are linearly independent vectors in L?([a, t]) 
for any t a. Conversely, a Gaussian process 
given by the above expression is N-ple Mar- 
kov. A stationary N-ple Markov Gaussian 
process has a 'spectral density function f(A) of 
a specific form, namely, it is expressed in the 
form [3] 


f(2) -1Q(A)/PGA)^; P, Q polynomials, 
degP=N and degQ <N 


(rational spectral density function). Y. Okabe 
[14] proved that the roots of P(x)=0 are all 
real, and introduced a multiple Markov prop- 
erty of a stationary Gaussian process {X,} in 
a much wider sense to prove that {X,} enjoys 
this property if and only if it has a rational 
spectral density. 

A somewhat restricted definition of multiple 
Markov properties for a Gaussian process 
uses differential operators. Assume that X, is 
(N — 1)-times differentiable with respect to the 
L?(Q, P)-norm. If there is an Nth order dif- 
ferential operator L= X o a,(t) D" * with 
D — d/dt such that 


LX,=B,,  B,—dB,/dt white noise, 


then {X,} is called an N-ple Markov Gaussian 
process in the restricted sense. Such a process is 
naturally N-ple Markov, and the canonical 
representation always exists. The kernel of the 
representation is the *Riemann function of the 
differential operator L that was used in the 
definition. The spectral density function corre- 
sponding to this process is of restricted form, 
namely, it is expressed in the form 1/| P(iA)|?, 
where P is a polynomial of degree N, and 
P(i4) —0 has no root in the lower half-plane. 
Many attempts have been made to define 
a Markov property of a Gaussian random 
field, namely, a Gaussian system with a multi- 
dimensional time parameter; however, only 
two significant approaches are mentioned here. 
Let (X,|ae R?! be a Gaussian random field 
with d-dimensional time parameter. H. P. 
McKean [12] gave a Markov property in the 
following manner. Let A, be the o-field gen- 
erated by the X,, xe U, U(c R^) open. For 
a closed set C, we set F-=(), Fc, where C, is 
the e-neighbourhood of C. If, for any open 
set U, Fy and Fy: become independent under 
the assumption that Fy is known, then {X,} 
is called a Markov Gaussian random field in 
the McKean sense. It has been proved that 
a Brownian motion with d-dimensional time 
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parameter is Markov in this sense if d is odd, 
while it is not Markov if d is even. Further 
detailed investigations of Markov properties 
for Gaussian random fields have been given 
by L. Pitt, S. Kotani, Y. Okabe [17], and 
others. 

Another definition of Markov properties 
of Gaussian trandom fields, in fact those of 
Gaussian random distributions, has been given 
by E. Nelson [13] in connection with Eucli- 
dean fields (invariant under Euclidean mo- 
tion) in quantum theory. Let X, be a Gauss- 
ian random field. For a smooth surface I in 
Ri let 7, be the c-field generated by X4, D= 
V G ó(n), Y being a smooth function with 
compact support and n being normal to T. If 
Fp and 4p. become independent under the 
assumption that Fp, I = 0D, is given, then 
(X, is called a Markov Gaussian random 
field in the Nelson sense. Given a Gaussian 
Euclidean field which is Markov in the Nelson 
sense, then under some reasonable assump- 
tions, one can form a field (in the sense of 
tquantum field theory) satisfying the *Wight- 
man axiom by changing imaginary time to 
real time. 


G. Sample Function Properties 


Some finer results about the smoothness of 
sample functions have been obtained for station- 
ary Gaussian processes. G. A. Hunt proved that 
if the spectral measure F(dA) of a stationary 
Gaussian process ( X,) has finite moment of 
order o, then almost all its sample functions 
satisfy the following *Hólder condition of 
order « [5]: For every finite interval J and 
every constant C, there exists a sufficiently 
small hy = Dell, C) such that 


(Kn Xi « Cl h (log t/|h]) 


for any tel and |h| <ho. The following result is 
due to Yu. K. Belyaev: Sample functions of a 
stationary Gaussian process are with proba- 
bility 1 either continuous or unbounded on 
every finite interval. 

Other conditions for continuity of sample 
functions are given in terms of the fcovariance 
function. Let v(s, t) be the covariance func- 
tion of a Gaussian process ( X,|te[0, 1]}. If v 
satisfies 


Iris, t) —v(s5, t)| &e| s, — s21" 


mae, v(0,s)=0, s,,s,€[0,1]), 
then 
X, — X, 
d limsup e e oe. A T 
ao, -elsalti — talt |log|t, — tall 


«aceti - L 
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where c, is a constant. This result is due to Z. 
Ciesielskt. 

Conditions for continuity of sample func- 
tions have also been given for Gaussian pro- 
cesses with multidimensional time parameters. 
For {X,|te[0, 1]"}, set dis, t)= E[(X,— XY T7. 
If there exists a function @ monotone increas- 
ing on some interval 0 « u «a such that 


ds, t) € e(]s — t |l), 


where || || is the metric in R", and such that 


too 
| vie "dx « oo, 


then almost all sample functions are continu- 
ous (X. Fernique [2]). Conversely, if there 
exists a monotone increasing function ¢ satis- 
fying d(s, t)= o(|s — t|) and if sample functions 
are continuous with positive probability, then 
the above integral should be finite. 

Suppose that ( X,) is a stationary Gaussian 
field. R. M. Dudley [11] proved that if there 
exists a number q > 1 and a neighborhood 
V=V(0) in R” such that 


Sq *(log N(V,q WD < co, 
k 


where N (V, £) is the minimum number of £- 
balls (relative to the metric d) that cover V, 
then almost all sample fields of ( X,) are con- 
tinuous, and Fernique [18] proved that the 
converse statement is also true. Thus the 
problem of continuity of the sample fields of 
stationary Gaussian fields has been completely 
solved. 


H. Gaussian Measures 


Let (X, te T), T an interval, be a Gaussian 
process. It gives a probability distribution on 
the measurable space (RI. 2), where 4 is the 
c-field of Borel subsets of RI. Such a mea- 
sure P is called a Gaussian measure. Let Di. 
i— 1, 2, be Gaussian measures induced by 
Gaussian processes {X}, te T], i=1, 2, respec- 
tively. Then for these measures concepts such 
as tabsolute continuity, fequivalence, and 
'singularity are introduced in the usual man- 
ner. The most significant property for Gauss- 
ian measures is that two measures P! and P? 
are either equivalent or singular. A powerful 
criterion for testing this dichotomy is the 
tentropy of P? with respect to P!, given by 


P?(A)) 
P'(A)J' 





H(P?|P*)= sup, {Pr adtog 


where the sup is taken over all finite parti- 
tions a= {A,,...,A,}, A;eB, | ) AZ RT (Yu. 
A. Rozanov [15]). The measures P! and P? 
are equivalent (resp. singular) if and only if 
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H(P?|P')< (resp. = oo). This statement can 
be paraphrased in terms of the Hilbert spaces 
H(X), i— 1, 2, spanned by {X$}, i— 1, 2, respec- 
tively. For simplicity, assume that E(X/)=0. 
Then, P! and P? are equivalent if and only if 
the following two conditions hold: (i) A map- 
ping A determined by AX} = X? is extended 
to an invertible linear transformation from 
H(X?) to H(X”); (ii) There exists a symmetric 
operator B of tHilbert-Schmidt type such that 
(Ag, An); — (U — B)5. n), where (č, n), — E(čn) is 
the inner product in H(X’). Next, suppose that 
{Xt}, i— 1, 2, are not centered. Set E(X} 2 mj, 
i— 1, 2. Then P! and P? are equivalent if and 
only if (Xi) with Xi = Xi — mi satisfy the fol- 
lowing condition: (iii) in addition to conditions 
(i) and (11) above, there exists an element č in 
H(X') such that m] —m? = E[E,X/], te T. 
Given equivalent Gaussian measures P! 
and P? on (R7, 4), T=[0, to], it is in general 
not easy to obtain the fRadon-Nikodym de- 
rivative; however, if one of them, say Pt, is the 
"Wiener measure, then one can obtain o = 
dP! /dP? explicitly (M. Hitsuda [4]). Assume 
that P? is derived from {X,}. By assumption, 
X, has the tcanonical representation 


x-5-| i SEHR te T, 
OD o 


where {B,! is the *Brownian motion from 
which P! is assumed to be derived and where 
le L'(T?), ae L'(T). With this notation, we can 
write 


el |" d l(s, EEN dB, 
1 fto s 2 
4l (| EEN d 
2Jo NJo 


This result yields a remark about general 
Gaussian measures: If P! and P? are equiva- 
lent, {X},te T} has a canonical representation 
if and only if ( X?, te T] does. 


I. Nonlinear Functionals 


Start with nonlinear functionals of a Brownian 
motion {B,,teR'}, and call them Brownian 
functionals. If {¢,} is a complete orthonormal 
system in L*(R'), then the collection (X, = 

|o, (u)dB,] of tstochastic integrals forms a 
Gaussian system of independent standard 
Gaussian random variables. A Fourier- 
Hermite polynomial based on {¢,} is a poly- 
nomial in X, expressible as a finite product in 
the form 


ch) H,(G/A/2), c à constant. 
J 


The sum n= È n; is the degree of this polyno- 
mial. If n — 0, its mean is zero and the variance 





672 


is [e]? IL (n;! 2"). Taking c to be [],(n! 2") 78. 
the collection of all Fourier-Hermite polyno- 
mials based on o, forms a complete ortho- 
normal system in the Hilbert space (L?) con- 
sisting of all Brownian functionals with finite 
variance. Denote by #, the subspace of (L?) 
spanned by all the Fourier-Hermite polyno- 
mials of degree n. Then the Hilbert space (L?) 
admits a direct-sum decomposition: 


(L’)= D Hw 
n=0 


which is called the Wiener-Itô decomposition 
[7]. The subspace #%, is eventually indepen- 
dent of the choice of a complete orthonormal 
system (9,]. A member X of #%, is referred to 
as a multiple Wiener integral of degree n and is 
expressed in the form 


x- |. [sos spat, dB, dB, 
R” 


where f is a symmetric L?(R")-function. In ad- 
dition, E(X?)=n!||f\|?, || || being the L?(R")- 
norm. 

Brownian functionals can be expressed as 
functionals of white noise, so that the Hilbert 
space L?(y) is viewed as a realization of (L?), 
where u is the probability distribution of the 
Gaussian white noise introduced in Section D. 

Nonlinear functionals of a general Gaussian 
process can be dealt with in a similar but 
somewhat more complicated manner. If the 
process has a canonical representation, then 
the functionals can easily be rewritten as 
Brownian functionals. 


J. Applications to Prediction Theory 


If a Gaussian process {X,|t >a} has the canon- 
ical representation 


x-| F(t,u)dB,, 
then the best tpredictor E(X,/B,(X)), s « t, is 
given by 


d 
| F(t, u)aB,. 
a 
This is in agreement with the best "near 
predictor. 

No systematic approach for nonlinear pre- 
diction theory has been established so far. We 
illustrate this theory only in a typical case that 
arises from a stationary Gaussian process. Let 
{X,} be a real-valued stationary Gaussian 
process. Without loss of generality, we can as- 
sume that E(X,) 2 0. We consider the continu- 
ous parameter case, since the discrete param- 
eter case is easier and can be inferred from it. 

Let M,(X) be the subspace of L?(Q, P) de- 
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fined in the same way as in Section E, and let 
L, = L?(Q, B,(X)). If the process is purely non- 
deterministic in the sense that (, M;(X)— {0}, 
then X, has the backward moving average 
representation 


X,- | — fü—sB, 


where {B,} is a Brownian motion. Further- 
more, this representation is canonical, Le, 
B,(X)=B,(B). Since B is assumed to be \/,B,, 
every Y in (L?) can be expressed in the form 


Y= X ] Si | fof S1. 82; 
p=0 J —o — o0 4J —o0 


85) dB, dB, ...dB, , 


in terms of multiple Wiener integrals. The 
predictor of Y based on the known values 

X,, 5x0, that is, the projection of Y on Lg is 
given by truncating the domain of the integral 


o0 0 S3 $5 
KS | Si | fp 51.52, 
p=0 J -o —o0 4 —o0 


<-s $5) GB, dB, ... dB, . 


K. Extrapolation and Interpolation 


Extrapolation of a stationary Gaussian pro- 
cess is used to find the best estimate of the 
unknown values of a given process when we 
have been given the only some of the past 
values of the process. M. G. Kreín [8] ob- 
tained some results by using the theory of the 
inverse spectral problem discussed by I. M. 
Gel'fand and B. Levitan. 

In contrast to extrapolation is the interpola- 
tion of a stationary Gaussian process. There 
is an important contribution to this prob- 
lem by H. Dym and H. P. McKean [1] who 
developed the Krein theory by applying the 
de Branges method to the theory of entire 
functions. 
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A. General Remarks 


A power series g(t) — 2, ,a,t" in t that con- 
verges in a certain neighborhood of t=0 de- 
fines the sequence of numbers {a,}. The func- 
tion g(t) is called the generating function of 
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the sequence. Similarly, the series K(x, t) -- 

X oft" that is convergent for x and t 

in a certain domain in (x, t)-space is called the 
generating function of the sequence of func- 
tions { f,(x)}. Sometimes the function g(t) — 
Dr o(a,/nl)t" is called the exponential generat- 
ing function of the sequence Lo, For exam- 
ple, the generating functions of the tbinomial 
coefficients and tLegendre polynomials 

are (1 +4)" and (1—2tx +t?) !, respectively. 
When a generating function of {a,' or { f,(x)} 
is given, we can obtain a, or f(x) by integral 
expressions; for example, in the latter case we 
have 


K(x, t) 
dt, 
Aal ~ 2ni xil pu 


where the contour C is a sufficiently small 
circle going counterclockwise around the 





origin. A generating function can be continued 


beyond the domain of convergence of the 
power series. Simple generating functions are 
known for many important orthogonal sys- 
tems of functions (— 317 Orthogonal Func- 
tions). Generating functions are widely used 
because they enable us to derive analytically 
the properties of sequences of numbers or 
functions. For a system of numbers or func- 
tions depending on a continuous parameter 
instead of the integral parameter n, we define 
the generating function in the form of a tLa- 
place or * Fourier transform. In particular, 
for a probability ‘distribution function F(x), 
the exponential generating function f(t) — 
fee dF(x) for the moments {a,} of F(x) 
is called the moment-generating function of 


F(x). There is another kind of generating func- 


tion for the sequence a,:2,a,n ? in the form 
of tDirichlet series; it is frequently used in 
number-theoretic problems. 


B. Bernoulli Polynomials 


A system of polynomials 


B) - Y H DER 
k=0 


is defined by the generating function 


B,(x) is called the Bernoulli polynomial of 
degree n. Since B,(0) is the coefficient of 
t*/k! in 


t Se t” 
= Y, Bum, 
1 2 ( P 


e — 


te xe. 








we have Bo(0)= 1, B,(0) — 


nz]. The nth Bernoulli number B, (— Ap- 











—1/2, B,(0) — 1/6, ...; 
B,,4,(0)=0 for nz 1; and (—1)""' B,,(0) » 0 for 
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pendix B, Table 3) is usually defined by |B,,(0)]. 
Sometimes other definitions, such as B, — B,(0) 
or B, — B;,(0), are used. Bernoulli polynomials 
satisfy the relations 

B,(x- 1) B,(x) 2 nx"^!, 
dB,(xy/dx E nB, Ch 


which are used in finterpolation. For example, 
a polynomial solution of the tdifference equa- 
tion UNA f(x) 2 i-o a,x" is given by 
f(x) = X29 a, Bn GO/(n + 1) 4- (arbitrary con- 
stant). In ipu we have 1"--2" E... p^— 
(B, (p 1) — Bri (Dnt 1). 


C. Euler Polynomials 
A system of polynomials 


Eo Y MIT 


is defined by the generating function 


20. e 
=}, E, 


e el n=0 


TM 


We call E,(x) the Euler polynomial of degree n. 
Here a, is defined by a, = E,(0) and 


SR 


e =, n=0 


t" 


so that we have a; —1, a, = —1/2, a4—1/4, ...; 
a>, =0 for nz 1. The nth Euler number is 
sometimes taken as a,, but more often it is 
defined by 


n n 
E,—(—1) 2^ 
EE (s. 


i.e., by 





oo E, ‘ 
E 1) S x 

(— Appendix B, Table 3). All the E, are inte- 
gers, E,,,4;=0(m=0,1,...) and Ej,» 0 (m= 
0,1, ... yk in the decimal expressions for E, the 
last digit is 5 for E (mz 1) and 1 for E4m+2 
(m 2 0). Sometimes E;, is denoted by E,. We 
have the relations 





E,(x) + E,(x + 1) 2 2x", 
E,(1 x) - ( CI E(x), 


dE,(x) 


dx = nEn- (x), 
and in particular, 
—1" +2" —3"+4"—...+(—1} p" 


=((—1) E,(p 1)— E„(1))/2. 


675 


D. Application to Combinatorics 


Let p(n) denote the number of ways of dividing 
n similar objects into nonempty class. This 

is called the number of partitions of n. Euler 
noticed that the following formula is valid for 
the generating function of p(n): 


t+ Š porst| flan] 


whence we obtain p(n) = p(n—1)+ p(n —2)— 
p(n—5)—p(n—7) + p(n— 12) + p(n—15)— 
Die (- 1)" [p(n— (3k? — k)/2) + p(n — (3k? + 
k)/2)] (— 328 Partitions of Numbers). Let 
B(n) denote the number of ways of dividing n 
completely dissimilar objects into nonempty 
classes. These are called Bell numbers. The 
first few are 1, 1, 2, 5, 15, 52, 203, 877, 4140, 
21147, ... [2]. For this sequence with B(0)- 1, 
the egent generating function 3. Za B(n)x" 
does not converge except at x 2 0. However, 
the exponential generating function g(x)= 

2o B(n)(n!)! x" is convergent for all complex 
numbers x and is equal to e? !. The differen- 
tial equation g' — e*g gives rise to a recursive 
formula 


B(n+1)= EC Jan 
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A. General Remarks 


In the study of global differential geometry, 
geodesics play an essential role because the 
behavior of geodesics on a complete Riemann- 
ian m-manifold (mz 2) M without boundary 
heavily influences its topological structure. A 
significant merit of using geodesics is that one 
can make elementary and intuitive observa- 
tions on M (as in Euclidean geometry), which 
often yields fruitful results. 

Riemannian structure induces in a natural 
way a distance function d on M, and hence M 
is a metric space. A smooth curve y of constant 
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speed is by definition a geodesic if and only if 
every point on y is contained in a nontrivial 
subarc whose length realizes the distance be- 
tween its endpoints. M is equipped with the 
Levi-Civita connection V (— 80 Connections) 
by means of which a geodesic is characterized 
as an autoparallel curve. In local coordinates, 
y is obtained as a solution of an ordinary 
differential equation of order two (— 80 Con- 
nections L). It follows from the properties of 
solutions of the equation for geodesics that 
every point pe M has a neighborhood in which 
every two points can be joined by a unique 
shortest geodesic that depends smoothly on 
both of the endpoints. A classical result of J. 
H. C. Whitehead (Quart. J. Math., 3 (1932)) 
states that every point pe M has a metric ball 
B centered at p such that every two points in B 
are joined by a unique shortest geodesic whose 
image is contained in B. Let M, be the tangent 
space to M at p, and let M, c M, be the set 

of all vectors ve M, such that the solution of 
geodesic y with initial condition 7(0) — p, 5(0) = 
v is well defined on [0,1]. M p contains an 
open neighborhood of the origin and is fstar- 
shaped with respect to the origin. The ex- 
ponential mapping exp,: M, M is defined to 
be exp, v=y(1). This mapping exp, is smooth 
and has the maximal rank at the origin. Small 
balls around p are obtained as the image 
under the exponential mapping of the corre- 
sponding balls in M, centered at the origin, 
and the restriction of exp, to these balls are 
diffeomorphisms. Thus the topology of M asa 
metric space is equivalent to the original one 
of M. The fundamenta! Hopf-Rinow theorem 
(Comment. Math. Helv., 3 (1931)) states that (1) 
M is complete as a metric space if and only if 
M,=M, for some pe M; (2) M is complete if 
and only if every closed metric ball is compact; 
and (3) if M is complete, then every two points 
can be joined by a shortest geodesic, namely, a 
geodesic with length realizing the distance 
between the endpoints. 

In the following discussion M is assumed to 
be connected, complete, and without boundary. 
Let »:[0, 1] — M be a geodesic. A piecewise 
smooth 1-parameter variation V along y is a 
continuous mapping V:[0,1] x(—5,?)—5 M 
with a finite partition 02 to «t, «€ ...«t,—1 
such that V |[t;, ti+1] x (—é, £) is smooth and 
V(t, 0) — y(t) for all te[0, 1]. Then the first and 
second variation formulas for V are 


L'(0)=<Y, 35 lo/L() 


and 


L'O)- > li (Y - CRG IS Y))dt 


+ Maa Y, ý> re t Vue 
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respectively, where Y;(t) 2 OV(t, s)/0s| -o for 
te[t; ,,t;] is the variation vector field as- 
sociated with V|[t; ,,t;] x (—6&, €), L(s) is the 
length of the variation curve t V(t, s), and 
Y; = V; Y; and R is the curvature tensor. 

A smooth vector field J along a geodesic 
y:[0,1]—>M is called a Jacobi field if and only 
if it satisfies J” + R(J, ))) =0. The set of all 
Jacobi fields along y forms a vector space iso- 
morphic to Mo) X Myo by the natural corre- 
spondence J (J (0), J'(0)). Every Jacobi field 
is associated with a 1-parameter geodesic varia- 
tion V along y. Namely, every variation curve 
of V is a geodesic, V is smooth, and J(t)= 
OV(t, s)/6s]| s-o for all te[0, 1]. Conversely, 
the variation vector field of any 1-parameter 
geodesic variation V along y is a Jacobi field. 
Especially, if A is a tangent vector to M, at 
2001, then d(exp,);(o, A — J(1), where J is the 
Jacobi field associated with the 1-parameter 
geodesic variation V(t, s) - exp, t(7(0) 4- 5A), 
and A is identified under the canonical parallel 
translation in M,. A point y(f9) is said to be 
conjugate to p=7(0) along y if and only if there 
exists a nontrivial Jacobi field along y that 
vanishes at 0 and tọ. A theorem of Morse and 
Schoenberg states that if the sectional curva- 
ture K of M satisfies 0< A < K < B< oo, then 
every unit speed geodesic y:[0,7] — M has a 
point conjugate to (0) if I2n/ / A and has no 
point conjugate to y(0) if 1< n/ / B . Especially, 
every geodesic y on M with K <0 contains no 
conjugate point on it, and exp, is locally regu- 
lar for any pe M. Thus a complete and simply 
connected M with K <0 is diffeomorphic to 
R" (Hadamard and Cartan). 

Let y be a unit speed geodesic with y(0) — p. 
If t, >0 satisfies d(p, y(t))=t for all te [0, t4) 
and d(p, y(t)) « t for all t t;, then the point 
y(t,) is called a cut point to p along y. It ap- 
pears no later than the first conjugate point to 
p along y. The set C(p) of all cut points to p is 
called the cut locus of p. Let U be the set of 
vectors tve M,, 0<t<1, where exp, is the cut 
point to p along y(t)=exp, tv. U is a nonempty 
open set and exp, | U: U M — C(p) is a diffeo- 
morphism, where M — C(p) is diffeomorphic 
to an m-disk. The cut locus possesses essential 
information on the topology of M. 

The metric comparison theorems of H. 
Rauch (Ann. Math., (2) 54 (1951)) and M. Ber- 
ger (Illinois J. Math., 6 (1962)) are stated as 
follows: Let inf,, K =d > —oo and sup, K =A < 
oo, and let y:[0, 1] —^ M be a nontrivial geo- 
desic. Let M(c) be a complete and simply 
connected space form of constant curvature 
c. Fix geodesics y5, y4:[0, 1] ^ M (ô), M(A), re- 
spectively, with the same speed as y. If J, J;, 
and J, are Jacobi fields along y, ys, and y4 
respectively such that J(0) 2 J,(0) — J4(0) 20 
and |J'(0)/| = |J5(0)]| = ||J4(0)]|, then ||JA(0)]| < 
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i JG) < |[Js(t)|| holds for all te[0, t4], where 
Yalta) is the first conjugate point to y4(0) along 
74 (Rauch). If the Jacobi fields satisfy ||J(0)]| = 
I ¥5(0)|| = IJA(0)]] and J'(0) = J5(0) = J4(0) =0, 
then |J4(t)| € JON < || J5(t)|| holds for all 
te[0, ti], where rt, is the first zero point of 
J, (Berger). They are often applied to compare 
curve lengths as follows. Let c: I — R" be a 
piecewise smooth curve with ||c(t)|| t4. Fix 
points pe M, p€ M (ô), and pae M(A) with 
fixed isometric identifications of the tan- 
gent spaces with R”. Then L(exp, oc)« 
L(exp,oc)« L(exp,, oc). If P, P;, and P, are 
unit parallel fields along corresponding geo- 
desics c, 65, 04:1 — M, M(Ó), M(A), respec- 
tively, which have the same speed, and if 
(P,65 CP 65> =< Py, 045, then for every 
piecewise smooth function f:1—[0, 14] the 
curves u(s) — exp, f (s) P(s) and u;(s), u(s) de- 
fined in the same way on M (ô), M (A), respec- 
tively, have lengths L(u4) < L(u) < L(u;). They 
play important roles in the theory of geodesics. 

A geodesic triangle is a triple (yo, Yı, y;) of 
shortest geodesics (for convenience they are 
parametrized on [0, 1]) such that y;(1)= y;.,(0) 
for all i=0, 1, 2 with mod 3. The vertices po, 
Di. p; and the angles 05, 0,, 0, of a geodesic 
triangle (yo, 7, 75) are defined by p;,, =y;(0) 
and 0;,, «cos ! (Oji(0), — Yi- (D/P: lP- I}, 
where angles are always taken in TO. x]. The 
triangle comparison theorem of Toponogov 
(Amer. Math. Soc. Transl., 37 (1964)) is stated | 
as follows. Let K 2ó- —oo be satisfied on 
M. For any geodesic triangle (yo, y1, y;) on M 
there is a geodesic triangle (75, 7,, 7) on M (ô) 
such that L(},)= L(y,) and the angles d. of this 
triangle satisfy 0, « 0; for i=O, 1, 2. It turns 
out that if 6>0 then the circumference of 
any geodesic triangle on M does not exceed . 
Zei, /à. 

For details of the basic facts stated above we 
refer the reader to [1-3]. 


B. Curvature and Fundamental Groups 


The fundamental group x,(M) of M(— 170 
Fundamental Groups) is influenced by the 
curvature of M. A basic idea, going back to 
Hadamard (J. Math. Pures Appl., 4 (1898)) 
and Cartan, states that every nontrivial "free 
homotopy class óez,(M) of loops on a com- 
pact M contains a closed geodesic of minimal 
length whose preimage under the covering 
projection z: M — M in the universal Riemann- 
ian covering M is either closed (when ô is of 
finite order) or a straight line (when ó is of 
infinite order) that 1s translated along itself by 
the "deck transformation 6. 

(1) By using the second variational formula, 
an even-dimensional compact M with K >0 
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either is simply connected or has z,(M)=Z, 
(Synge, Quart. J. Math., 7 (1936)). A beautiful 
result of Myers (Duke Math. J., 8 (1941)) states 
that if the Ricci curvature Ric of M is bounded 
below by 6>0, then the diameter d(M) is not 
greater than GE and hence M is compact 
and z,(M) is finite. It is proved in [3] that if 
K 20, then there exists a finite normal sub- 
group ® of z,(M) such that z,(M)/Ó is a 
Bieberbach group (— 92 Crystallographic 
Groups). The splitting theorem (— Section 
F(2)) is used in the proof. 

(2) When K <0 is satisfied on M, M is dif- 
feomorphic to R”, and hence z;(M)- 0 for all 
iZ 2. A basic tool used in the study of z,(M) in 
this case is that the displacement function X» 
d(x, 6(X)), Xe M, of every isometry ô on M is 
convex [4], where a function on M is said 
to be convex if and only if its restriction to 
every geodesic is convex. A classical result of 
Preissmann (Comment. Math. Helv., 15 (1943)) 
states that every nontrivial Abelian subgroup 
of z,(M) of a compact M with K «0 is an 
infinite cyclic subgroup. It is proved in [4] that 
if K <0 holds on M and if every deck trans- 
formation on M translates some geodesic 
along itself, then z,(M) is a disjoint union of 
infinite cyclic subgroups, and any two com- 
muting elements belong to the same cyclic 
subgroup. Moreover, if M is compact and 
K <0 and if 1, (M) is a direct product TI, x I, 
such that z,(M) is tcenterless, then M is iso- 
metric to the Riemannian product M, x M,, 
and z,(M;) - T; holds for i— 1, 2 [5,6]. It is 
shown in [7] that the fundamental group of 
M with K <0 occurs as that of an (m+ 1)- 
dimensional M’ with K x c <0, which is diffeo- 
morphic to M x R. The warped product [4] 
is used to construct such a metric on M x R. 


C. Cut Locus 


The injectivity radius i(M) of M is defined to 
be the infimum of the continuous function 
x—d(x, C(x)), xe M. As is seen in Section D, 
the estimate of injectivity radii provides many 
fruitful results on the topology of Riemannian 
manifolds. It follows from Synge's result that 
an even dimensional compact and orientable 
M with 0<K <1 has its injectivity radius 
i(M)Z n [3]. However, such an estimate can- 
not be obtained in odd dimensions. The ex- 
amples discussed in [8] show that there are 
infinitely many homotopically distinct homo- 
geneous Riemannian 7-manifolds of posi- 

tive curvature. The injectivity radii of such 
examples are estimated precisely in [9], ac- 
cording to which there is no positive lower 
bound for them. The sphere theorem of Kling- 
enberg (Comment. Math. Helv., 35 (1961)) 
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states that if M is compact and simply con- 
nected and if 1/4 « K <1, then i(M) 2 x, and 
M is a topological sphere. This extends the 
pioneering work by H. Rauch (Ann. Math., (2) 
54 (1951)). A rigidity theorem of M. Berger 
(Ann. Scuola Norm. Sup. Pisa, 14 (1962)) states 
that if dim M is even, M is simply connected, 
and 1/4 K <1, then M is homeomorphic to 
S" (if d(M) > 1) or isometric to one of the sym- 
metric spaces of compact type of rank 1 (if 
d(M)=72). A slight generalization of the trian- 
gle comparison theorem is used to prove that 
for given positive constants d, V, and S, there 
exists a constant C,,(d, V, S) - 0 such that if K, 
d(M) and the volume v(M) satisfy |K| « S, 
d(M) «d, and v(M) V, then i(M)Z C, (d, V, S) 
[10]. 


D. Finiteness Theorem 


Finiteness theorems are a natural extension 

of the sphere theorem. They provide a priori 
estimates for the number of various topolog- 
ical types of manifolds (for instance, homology, 
homotopy, homeomorphism, diffeomorphism 
types, etc.), which admit certain classes of 
Riemannian metrics characterized by geo- 
metric quantities. The basic idea of the esti- 
mates is to find a constant c>0 that depends 
only on geometric information which char- 
acterizes the class of manifolds so that if M be- 
longs to the class, then i(M) >c. Then a number 
N is found from the information given a priori 
such that every element M of the class has an 
open cover of at most N balls whose radii are 
all less than c. It is proved in [11] that for 
given Ae, 1) and m, there exists a number 
N(6,m) such that there are at most N (ô, m) 
homotopy types for the class of all simply 
connected 2m-dimensional manifolds with 

ó« K «1. Furthermore, for given positive 
numbers d, V, and S, there are at most finitely 
many homeomorphism (or diffeomorphism) 
types for the class of all m-manifolds M, each 
of which has the property that d(M) «d, v(M) 
> V, and |K| « S [10]. This result is applied to 
obtain a result in [12], which states that there 
exists for given m, VO and an e>0 such that 
if M is compact, -1 —£« K < —1, and v(M)> 
V, then M admits a metric of constant nega- 
tive curvature. 

For an £ £0, M is said to be e-flat if and only 
if sup, K  d(M)! «c is satisfied for M. Every 
compact flat manifold is ¢-flat for all £20. M is 
called a nilmanifold if and only if it admits a 
transitive action of a tnilpotent Lie group. It 
is shown in [13] that for a compact M with 
dim M =m, there exists a number e(m) 0 such 
that if M is e(m)-flat, then (1) there is a maxi- 
mal nilpotent normal divisor N c z,(M), (2) 
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the order of z,(M)/N is bounded by a constant 
which depends only on m, and (3) the finite 
cover of M which corresponds to N is diffeo- 
morphic to a nilmanifold. Moreover, if M is 
e(m)-flat then M is diffeomorphic to R”. If M is 
e(m)-flat and if 1, (M) is commutative, then M 
is diffeomorphic to a *torus. 


E. Uniqueness Theorem 


Uniqueness of topological structures (as in the 
sphere theorem) of certain classes of compact 
Riemannian manifolds is discussed here. 

The discovery of texotic 7-spheres by J. 
Milnor (Ann. Math., (2) 64 (1956)) gave rise to 
the question of whether in the sphere theorem 
the conclusion (homeomorphism to the sphere) 
could be replaced by diffeomorphism. Since 
the number of differentiable structures on a 
topological sphere depends on its dimension 
(— 114 Differential Topology), it has been 
thought that in order to get the standard 
sphere in the sphere theorem the best possible 
restriction for the curvature might also depend 
upon dimension. The appropriate differenti- 
able pinching problem is to find a sequence 
{Am} so that if M is compact and simply con- 
nected and if ô< K <1 for some 6>A,,, then 
M is diffeomorphic to S", and A,, is the least 
possible with this property. D. Gromoll and Y. 
Shikata proved independently [14, 15] that A,, 
<1 holds for all mz 2. Later it was proved 
that there exists a à, €(1/4, 1) such that A, < ôo 
holds for all mz 2 [16]. The diffeotopy theo- 
rem, which plays an essential role in the proof 
of finding such a Au, provides a sufficient con- 
dition for a diffeomorphism on S”~' to be 
"isotopic to the identity mapping. 

A different idea is put forth in [17], which 
imitates the Gauss normal mapping of a 
closed convex hypersurface in R™*?. It is 
proved that if the curvature operator is suffi- 
ciently closed to the identity, then M is diffeo- 
morphic to S". This idea is used to obtain a 
diffeomorphism between M and a spherical 
space form. A generalization of the sphere 
theorem states that if M is compact and if 
(d(M)/ny inf, K 7 1/4, then M is a topological 
sphere [18]. 


F. Noncompact Manifolds 


Let M be noncompact. It is due to the nature 
of noncompactness that through each point on 
M there passes a ray y:[0, 0)>M, e.g, y isa 
unit speed geodesic any of whose subarc is 
minimizing. 


(1) Busemann Functions. A Busemann function 
F,:M—>R with respect to a ray y is defined by 
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F,(x) « lim, ,, [t — d(x, y(t) The original defi- 
nition of it was used by H. Busemann to define 
a parallel axiom on straight G-spaces (— Sec- 
tion H). F, has the property F; !((—o0,1,]) 
[xe F;  ((—o0, tI) d OF, 1 (—99,1,])) > 

t —1,j for any t; 2 t,, and hence F,(x)=t, — 
d(x, ôF; ! ((—o0,t5])) for all t; and x with 
F,(x) & t5. It has been proved, by using the 
second variation formula, that if Ric>0, K > 

0 [19] or if, in the case where M is Káhlerian, 
the holomorphic bisectional curvature [20] 

is nonnegative, then F, is subharmonic (sh), 
convex or !plurisubharmonic (psh), respec- 
tively, where the holomorphic bisectional 
curvature is defined as R(X,JX,JY, Y) for the 
complex structure J and orthonormal vectors 
X and Y. If the holomorphic bisectional curva- 
ture [20] is positive, then F, is strictly psh. If 
K 20, then the triangle comparison theorem 
implies that F =sup{F,|y(0)=p} is convex and 
is an exhaustion [21], where the sup is taken 
over all rays emanating from a point pe M, 
and a function f is said to be an exhaustion if 
and only if f! (( —o0,a]) is compact for all 
acR. À well-known theorem of H. Grauert 
(Math. Ann., 140 (1960)) states that if M admits 
a strictly psh exhaustion function, then M isa 
*Stein manifold (— 21 Analytic Functions of 
Several Complex Variables). In this context, 
various conditions for curvatures on Kahler 
manifolds under which they become Stein 
manifolds have been found [20]. 

Let H be complete and simply connected, 
and K <0. Busemann functions on H are 
differentiable of class C?, and the thorosphere 
F,"({t}) is a C?-surface [22]. On a parabolic 
visibility manifold (— Section F(3)) the nega- 
tive of every Busemann function is C!-convex 
without minimum [7]. 


(2) Ends and Splitting Theorems. Ends of a 
noncompact M are defined as follows: If A, 
and A, are compact subsets of M with A, c 
A,, then any component of M — A, is con- 
tained in a unique component of M — A,. An 
end is the limit of an inverse system (compo- 
nents of M — A; A} directed by the inclusion 
relation as indicated above and indexed by 
{A|A compact}. 

It has been shown that if Ric >0 [23] (or 
Z0 [19]), then M has exactly one end (or at 
most two ends). A visibility manifold has at 
most two ends if it is not Fuchsian [7]. If M 
admits a locally nonconstant convex function, 
then M has at most two ends [24]. 

If M has more than one end, then there 
exists a straight line y: R9 M which is by 
definition a nontrivial geodesic any of whose 
subarcs is minimizing. A classical result of 
Cohn-Vossen (Mat. Sb., 1 (1936)) states that if 
dim M =2, K >0 and if M contains a straight 
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line, then the total curvature is 0 and hence M 
is isometric to either a plane or a cylinder $! x 
R. Generalizations of this result state that if 
M admits k independent straight lines and if 
K z0 (Toponogov, Amer. Math. Soc. Transl., 
37 (1964)) (or if Ric z 0 [19]), then M is iso- 
metric to the Riemannian product N x R*. 


(3) Structure Theorems. The structure theorem 
of 2-dimensional M with K >0 was proved by 
Cohn-Vossen (Compositio Math., 2 (1935)). It 
has been shown that M is diffeomorphic to R" 
if K >0 [23] and that if K 20, then there 
exists a compact totally geodesic submanifold 
S without boundary such that M is homeo- 
morphic to the total space of the *normal 
bundle over S [21]. Here, homeomorphism 
can be replaced by diffeomorphism, and if 
K >0 outside a compact set of M, then M 
is diffeomorphic to R" [26]. 

For a ray y and a point x on H, ifc isa 
ray with c(0) 2 x and if d(y(t), o(t)), t — 0, is 
bounded above, then c is called an asymptotic 
ray (a ray asymptotic) to y. The asymptotic 
relation is an equivalence relation on the set 
of all geodesics on H. A point at infinity of H 
is an equivalence class on geodesics on H, 
and the set of all points at infinity of H is 
denoted by H(oo). With a suitable topology, 
the set of all points at infinity of H constitutes 
a bounding sphere such that H = HU H(oc) 
is a closed m-cell. For a properly discontin- 
uous group D of isometries acting on H, a 
closed D-invariant limit set L(D) is obtained 
in H(oo) as the set of all accumulation points 
of 6(p), 9c D. M = H/D is a visibility manifold 
if and only if H satisfies the following axiom: 
any two distinct points on H(oo) can be joined 
by at least one geodesic. If K «c «0 is satisfied 
on M, then M is a visibility manifold [7]. 
By investigating limit sets, visibility manifolds 
are classified into three types [7]. (1) M is 
parabolic; M is diffeomorphic to N x R and 
is characterized by the fact that it has a convex 
function without minimum. (2) M is axial; M 
is a vector bundle over ST. and hence it is 
diffeomorphie to either S! x Bn") or the prod- 
uct of a Möbius strip with Bn" 7 (3) M is 
Fuchsian; M has more than two ends, and a 
strong algebraic restriction is imposed on the 
fundamental group of M. 


(4) Convex Functions. Some elementary prop- 
erties of C” convex functions on M have 
been investigated and it was proved in [4] 
that if M admits a C” convex function with- 
out minimum and if K «0, then M is diffeo- 
morphic to N x R, where N is a level hyper- 
surface. When K » 0, F can be replaced by a 
strongly convex exhaustion function. Namely, 
for every compact set A c M there is a 00 
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such that the second difference quotient along 
every geodesic at any point on A is bounded 
below by ô [20]. A standard tconvolution 
smoothing procedure yields smooth convex 
approximations in neighborhoods of compact 
sets for every strongly convex function such 
that their second derivatives along every geo- 
desic is positive. Global approximations can 
be constructed [27]. Let o: M—R be a convex 
function which is not constant on any open 
set. It has recently been proved that (1) if b> 
inf, > —oo, then there is a homeomorphism 
H:o !((b]) x (inf, p, o0) 5 M — {minimum 

set of o, if any} such that o o Hip, «)=4« for all 
rem ^! ((b]) and for all xe(inf, o, oo); (2) if @ 
takes a minimum, then the continuous exten- 
sion H -Dnt p, oo) 2 M of H is proper and 
surjective [24]. 


G. Manifolds All of Whose Geodesics Are 
Closed 


As is well known, every symmetric space of 
compact type of rank 1 (it is abbreviated 
CROSS as in [28]) has the property that all 
geodesics are simply closed and of the same 
length. Let M be a compact Riemannian mani- 
fold with the property that all geodesics on M 
are simply closed and that they have the same 
length (SC property). The problem discussed 
here is whether such an M is isometric (or at 
least the topology of such an M is equivalent) 
to some CROSS, or if it is possible to classify 
all such manifolds. 

An example of a nontrivial SC-manifold 
was first constructed by Zoll (Math. Ann., 57 
(1903)) on S? as a surface of revolution in R?, 
which is not isometric to a standard sphere. 
Blaschke conjectured that every SC-structure 
on PR" is the standard real projective space. 
This has been solved affirmatively by L. W. 
Green (Ann. Math., 78 (1963)). In higher di- 
mensions it has been proved that every in- 
finitesimal deformation of the standard SC- 
structure on PR” is trivial [29]. A general 
result for a SC-manifold M states that the 
volume of M (with dim M =m) with period jn 
is the integral multiple of the volume of the 
standard unit m-sphere and this integer is a 
topological invariant [30]. For a point pe 
M, if every geodesic segment with length / 
emanating from p is a simple geodesic loop at 
p, then M is called a SC?-manifold. It has been 
proved that the *integral cohomology ring of 
every SC?-manifold is isomorphic to one of the 
CROSS [31], which is a generalization of 
Bott's theorem (Ann. Math., 60 (1954)). 

An essential difference between the metrics 
of a Zoll surface and a standard sphere is seen 
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on the cut locus and the conjugate locus of a 
point. The (tangent) cut locus of any fixed 
point of a CROSS coincides with the (tangent) 
first conjugate locus; however, this does not 
hold on a Zoll surface. This observation gives 
rise to the definition of a Blaschke manifold: 
For distinct points p, qe M, let A(p, q) c M, be 
the set of all unit vectors tangent to the mini- 
mizing geodesics from q to p. M is said to be 
a Blaschke manifold at a point p if for every 
q€ C(p), A(p, q) is a great sphere of the unit 
hypersphere in M, centered at the origin. M is 
called a Blaschke manifold if it is so at every 
point of it. The following statements are equiv- 
alent [28]: (1) M is a Blaschke manifold at 

p, (2) the tangent cut locus C, at p is spher- 
ical in M,, (3) along every geodesic starting 
from p, the first conjugate point p appears 

at a constant length and the multiplicity of 

it is independent of the initial direction. Then 
such an M can be exhibited as DU, E, where 
D is a closed ball with center p, E a C*?-closed 
k-disk bundle over an (m — k)-dimensional 
C?-compact manifold with boundary GE dif- 
feomorphic to S" !, and a:0D— OE an tattach- 
ing diffeomorphism. Conversely, if M is ex- 
hibited as DU, E then there exists a metric g 
on M such that (M, g) is a pointed Blaschke 
manifold at p, where p is the center of D. A 
generalized Blaschke's conjecture states that 
every Blaschke manifold is isometric to a 
CROSS. A partial solution to this conjecture 
has been obtained by M. Berger and states 
that if (S", g) is a Blaschke manifold, then it is 
isometric to a standard sphere [28]. 


H. G-Spaces 


G-spaces were created by Busemann to show 
that many global theorems of differential 
geometry are independent of the Riemannian 
character of the metric and also of smooth- 
ness. This approach naturally led to novel 
questions and results. (Many facts hold even 
when the distance is not symmetric; — [34].) 
The symbol G suggests that the principal 
property of G-spaces is the existence of geo- 
desics with the same properties as in complete 
Riemannian manifolds without boundary 
except for differentiability. A G-space X is 
defined as follows: (1) X is metric with (sym- 
metric) distance d; (2) every bounded infinite 
set has at least one point of accumulation; (3) 
given two distinct points p, re X, thereisa 
point qe X such that pz q zr and d(p,q) 
d(q, r) — d(p, ry; (4) to each point xe X a posi- 
tive number p, is assigned such that for any 
p, qe X with d(p, x) « p,, d(q, x) « p, there is 

a point r with d(p, q)+ d(q, r) 2 d( p, r); (5) if 
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dp, q) *- d(q, r) - d(p, r) and if d(p, q)+ d(q, ) — 
d(p, r'), then d(q, r) - d(q, r) implies r— r'. 

From (2) and (3) it follows that any two 
points on X can be joined by a curve called a 
segment whose length realizes the distance 
between them. A geodesic arc (or for simplicity 
geodesic) »:[a, b] — X is a curve such that any 
subarc contained in a p,-ball around some 
point xe X is a segment. From (4) and (5) it 
follows that every geodesic arc has a unique 
infinite extension to both sides. If a geodesic 
arc is extended infinitely to both sides, then it 
is called a geodesic line. 

The absence of smoothness causes an essen- 
tial difference in the fact that the distance 
function to a fixed point on X is not necessar- 
ily convex in a small ball around it, in contrast 
to the Whitehead theorem (— Section A) for 
Riemannian manifolds (or *Finsler spaces). 
Here a function on X is called convex if and 
only if its restriction to every arc length— 
parametrized geodesic line is convex. 

Every two points on X have neighborhoods 
which are homeomorphic to each other. Thus 
the topological dimension of X(— 117 Dimen- 
sion Theory) is well defined. A 1-dimensional 
X is either a circle S! or the whole real line. If 
dim X =2 [33] or if dim X =3 [36], then X isa 
topological manifold. 

It should be noted that G-space theory not 
only proves known Riemannian theorems 
under weaker conditions, but also leads to 
many facts which were either not considered 
previously or well understood in (or thought 
to be very different from) the Riemannian case. 
Among many fruitful topics discussed by 
Busemann [33], only two are included below. 


(1) G-Surfaces. For each point p on a 2-dimen- 
sional G-space Z, which is called a G-surface, 
let $, 2 ixe 7 |d(x, p) - pj, where 0 « p < p,/4 is 
a fixed number. S, is homeomorphic to a circle 
and bounds a 2-disk. An angle A at p is the set 
of all segments emanating from p and passing 
through the points on a connected subarc of 
Sp. An angular measure |:| for the angles at 
pis a nonnegative function which satisfies: 

(1) | A| 2x if and only if p is the midpoint of 
the segment joining two points on S, which 
bounds A; (2) if the intersection*of two angles 
A, and A, is a unique segment, then | A, U A, | 
=|A,|+|A,]|. If a triple of points (po, p,, p2) on 
4 are contained in a sufficiently small ball, 
then the three segments joining them define 

a triangle, and each vertex p; has the angle 
|Pi-iPiPis1l determined by the two segments. 
An excess £(pop; P2) of a triangle (po, p4, p;) is 
defined to be &(poP1 P2)=|PoP1P2|+1P1P2Pol 
+|P2Pop,|—7. A degenerate triangle has _ 
excess 0. If a triangle (a, b, c), which consists 

of geodesic arcs, is simplicially decomposed 
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into a finite number of triangles (a;, bi, c;), i= 
1,...,k, then (abc) = £$- e(ajbjc;), and this 

is independent of the choice of finite decompo- 
sition. If Y is compact and orientable, the 
total excess &(.7) of Z is defined by e(Y)= 
Et e(a;b;,c,)), where ¥ is simplicially decom- 
posed into nondegenerate finite triangles 

(a;, bj cj), i= 1, ..., k. Then (9) - 2n P), 
where y(4^) is the tEuler characteristic of /. 
The results obtained by Cohn-Vossen [35] on 
the *total curvature of complete open surfaces 
have also been generalized to G-surfaces with 
angular measure uniform at x. Although no 
angular measure leading to a true analog of 
the tGauss-Bonnet theorem exists even in G- 
surfaces [34], many of its applications, in 
particular the results by Cohn-Vossen, can be 
proved by using angular measures which are 
uniform at z [33]. 

If every two points on X can be joined by a 
unique geodesic line, then all geodesic lines on 
X are simultaneously either straight lines or 
circles of the same length /. If in the latter case 
dim X » 1, then X has a two-fold universal 
covering space X whose geodesic lines are all 
circles of the same length 21, and they have the 
property that all geodesics through a point on 
X meet at the same point at length /. For the 
Riemannian case it is a Blaschke manifold (— 
Section G). If every two points on Z can be 
joined by a unique geodesic line, then 4 is 
either a plane or a projective plane. 


(2) G-Spaces with Nonpositive Curvature. Let 
(Po. P1; P2) be a triple of points on X which are 
contained in a small ball. X, by definition, is of 
nonpositive curvature if and only if for any such 
triple of points l 


d(pi,pis1) 22d(pi, pl for i=0,1,2, — (*) 


where p; is the midpoint of the unique segment 
joining p;_, to p;.,. If the above inequality is 
strict for any nondegenerate small triangle, 
then X is said to be of negative curvature. X of 
curvature Q is defined in the same way. It 
should be noted that a G-space of curvature 0 
is a locally Minkowskian space [33]. The 
sectional curvature of a Riemannian manifold 
M is nonpositive if and only if (*) holds for all 
small triangles on it. Because of d(po, Po) < 
d(Po, pi) + d(pi, po) € (d(po. P1)  d(po. Pa) }/2s 
the distance function to a point xe X of non- 
positive curvature is convex on a small ball 
around x. X is called straight if and only if all 
nontrivial geodesic lines on X are straight 
lines. If X has nonpositive curvature, then 

the universal covering space X is straight. 
Moreover, for any two geodesic lines y, c: R^ 
X, the function t d(o(t), y(t)) is convex. In 
particular, the distance function to every fixed 
point on X is convex, and (*) holds for any 
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triangle on X. Thus X is contractible. Every 
nontrivial element of the homotopy class of 
loops on X with base point xe X contains a 
geodesic loop at x which has the minimum 
length. This fact and the strict convexity of 

t -»d(o(t), y(t)) for any two arc length para- 
metrized geodesic lines y, o on the universal 
covering X of X with negative curvature imply 
that if X is compact, then every Abelian sub- 
group of the fundamental group z,(X) of X is 
infinite cyclic. This is a generalization of the 
Preissmann theorem (— Section B). 
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A. General Remarks 


A geometric construction problem is a problem 
of drawing a figure satisfying given conditions 
by using certain prescribed tools only a finite 
number of times. If the problem is solvable, 
then it is called a possible construction problem; 
if it is unsolvable, even though there exist 
figures satisfying the given conditions, then it 
is an impossible construction problem. If there 
does not exist a figure satisfying the given 
conditions, then we say that the problem is 
inconsistent. 

Among problems of geometric construction, 
the oldest and the best known are those of 
constructing plane figures by means of ruler 
and compass. In this article, we call these 
problems simply problems of elementary geo- 
metric construction. The following are some of 
the more famous problems of this kind; the 
first four are possible construction problems. 

(1) Suppose that we are given three straight 
lines ], m, n and three points P, Q, R in a plane. 
Draw a triangle ABC in such a way that ver- 
tices A, B, C lie on l, m, n and sides BC, CA, 
AB pass through P, Q, R (Steiner's problem). 

(2) Suppose that we are given a circle O and 
three points P, Q, R not lying on O. Draw a 
triangle ABC inscribed in O in such a way that 
the sides BC, CA, AB pass through P, Q, R 
(Cramer-Castillon problem). 

(3) Draw a circle tangent to all of three 
given circles (Apollonius! problem). 

(4) Suppose that we are given a triangle. 
Draw three circles inside this triangle in such a 
way that each is tangent to two sides of the 
triangle and any two of the circles are tangent 
to each other (Malfatti's problem). 

(5) Let n be a natural number. For the divi- 
sion of the circumference of a circle into n 
equal parts (consequently, the construction of 
a *regular n-gon) to be a possible construction 
problem, it is necessary and sufficient that 
the representation of n as a product of prime 
numbers take the form n — 2?p, ... p,, where 
AZO, Pi, ..., p, are all different prime numbers 
of the form 2*+ 1 (*Fermat number) (C. F. 
Gauss, 1801). 

(6) The following are three famous impos- 
sible construction problems of Greek origin: (1) 
divide a given angle into three equal parts 
(trisection of an angle); (ii) construct a cube 
whose volume is double that of a given cube 
(duplication of a cube or the Delos problem); 
and (iii) construct a square whose area is that 
of a given circle (quadrature of a circle). P. L. 
Wantzel (1837) proved that problems (1) and 
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(ii) are impossible except for the special cases 
of 2/2, 7/4, etc. for (i); and C. L. F. Lindemann 
(1882) proved the impossibility of (iii) while 
proving that the number z is *transcendental. 


B. Conditions for Constructibility 


A problem of elementary geometric construc- 
tion amounts to a problem of determining a 
certain number of points by drawing straight 
lines that pass through given pairs of points, 
and circles having given points as centers and 
passing through given points. Let (a,, b,), 

(a5, b5), ...,(a,, Dal be rectangular coordinates 
of given points, and let K be the smallest 
*number field containing the numbers a,, 
...,0,. Straight lines that join given pairs of 
points and circles that have given points as 
centers and that pass through given points are 
represented by equations of the first or second 
degree with coefficients belonging to K. Con- 
sequently, the coordinates of points of intersec- 
tion of these straight lines and circles be- 

long to a quadratic extension K’=K (Vd) of 
K. Let A be the set of coordinates of the points 
that are to be determined. Then the problem is 
solvable if and only if any number a in A is 


contained in a field L—K( d, , da, d 
where dn e K(/ d, , ..., / d; (i0, 1, ..., r— 1). 
Thus L is a *normal extension field of K whose 
degree over K is a power of 2. Using this 
theorem we can prove the impossibility of 
trisection of an angle and duplication of a 
cube. 

Since the 18th century, besides the problem 
of construction by ruler and compass, prob- 
lems of construction by ruler alone or by 
compass alone have also been studied. We 
state here some of the more notable results: 

(1) If by drawing a straight line we mean the 
process of finding two different points on that 
line, then we can solve all the problems of 
elementary geometric construction by means 
of compass alone (G. Mohr, L. Mascheroni). 
(2) If by drawing a circle we mean the process 
of finding its center and a point on its circum- 
ference, and if a circle and its center are given, 
then we can solve any problem of elementary 
geometric construction by means of ruler 
alone. (3) It is not possible to find the center of 
a given circle by ruler alone (D. Hilbert). (4) It 
is impossible to bisect a given segment by ruler 
alone. (5) When two intersecting circles or 
concentric circles are given, we can find the 
centers of these circles by ruler alone. When 
nonintersecting and nonconcentric circles are 
given, it is not possible to find their centers by 
ruler alone (D. Cauer). 

Cases have been considered in which the 
radius of a circle we can draw by compass or 
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the length of a segment we can draw by ruler 
is required to satisfy certain conditions. Also, 
various considerations have been made con- 
cerning cases in which we can use tools other 
than ruler and compass. For example, it is 
known that although not all possible elemen- 
tary geometric construction problems are 
solvable by ruler alone, all these problems are 
possible if we have either a pair of parallel 
rulers, a square, or a triangle having a fixed 
acute angle. If we use a square and a compass, 
then the trisection of an angle and the dupli- 
cation of a cube are possible (L. Bieberbach). 
Also, when a conic section other than a circle 
is given, the trisection of an angle and the 
duplication of a cube become possible by ruler 
and compass (H. J. S. Smith and H. Kortum). 
By ruler and *transferrer of constant lengths, 
we can solve Malfatti's problem but not Apol- 
lonius' problem (Feldblum). 

Even when a problem is possible, the 
method of construction may be rather com- 
plicated and impractical. In these cases, vari- 
ous methods of highly accurate approximate 
construction have been investigated. 
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A. General Remarks 


Geometric optics is a mathematical theory of 
light rays. It is not concerned with the prop- 
erties of light rays as waves (e.g., their wave- 
length and frequency), but studies their prop- 
erties as pencils of rays that follow three laws: 
the law of rectilinear propagation, the law 

of reflection (e. angles of incidence and re- 
flection on a smooth plane are equal (Euclid)), 
and the law of refraction (i.e., if 0 and 0' are 
angles of incidence and refraction of a light 
ray refracted from a uniform medium to a 
second uniform medium and if n, n' are the 
refractive indices of the first and the second 
medium, respectively, then nsin 0 =n sin 0' (R. 
W. Snell, Descartes)). These three laws follow 
from Fermat's principle, which states that the 
path of a light ray traveling from a point A' to 
A in a medium with refractive index n(P) at P 
is such that the integral (4. n(P)ds attains its 
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extremal value, where ds is the line element 
along the path. This line integral is called 

the optimal distance from A’ to A. Therefore 
Fermat's principle can be taken as a foun- 
dation of geometric optics and is, in a way, 
similar to the tvariational principle in particle 
dynamics (Maupertuis's principle), 


d /2h —2U(P)ds —0, 


which is satisfied by the path of a particle of 
unit mass having constant energy h passing 
through a field of *potential U(P). The quan- 
tity ,/ 2h —2U corresponds to the refractive 
index n. 

In an optical system, express the position 
of a point on the path of a light ray by ortho- 
gonal coordinates (x, y, z), and define the tLa- 
grangian L=n,/1 +x? - y? (xs dx/dz, p= 
d y/dz), optical direction cosines p = 0L/OX, q = 
OL/Oy, and the fHamiltonian H — Xp + yq— 
L=—./n* — p? — q?. Then the *canonical 
equations of the path are obtained as in parti- 
cle dynamics; x, y and p, q are called tcanon- 
ical variables. Due to the variational prin- 
ciple, the integral of the linear differential form 
pdxdt qdy—H dz 2o along a light path is a 
function S(A’, A) of the endpoints A’, A of the 
path, and the optical direction cosines and the 
Hamiltonians of the system at A and 4’ are 
given by 
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Hence we obtain *Hamilton-Jacobi differential 
equations 
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As a corollary to these relations we obtain 
Malus's theorem, which states that a pencil of 
light rays perpendicular to a common surface 
(locally) at a given moment is also perpendic- 
ular to a common surface (locally) after an 
arbitrary number of reflections and refractions. 
Suppose that light rays travel from an ob- 
ject space into an image space through an 
optical apparatus. If all the rays starting from 
any one point of the object space converge to 
a point of the image space and if the mapping 
given by this correspondence is tbijective, then 
we say that this imaging is perfect. Examples 
of perfect imaging systems are realized by 
optical apparatus such as Maxwell's fisheye 
(having refractive index n(r)=a/(b+r7), where 
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r denotes the distance from the center of the 
system) and Luneburg's lens (n(r) = ,/ a— r°). 
Perfect imaging conserves optical distance, 
yields the relation n(A")ds' = n(A)ds, and gives 
a tconformal mapping, with the magnification 
inversely proportional to the refractive index. 


B. Gauss Mappings 


Consider an optimal system with a symmetri- 


cal axis of rotation, its optical axis. A ray of 


light that is near the optical axis and has a 
small inclination to the axis is called a paraxial 
ray. A mapping realizable by paraxial rays 
where the canonical variables x, y, p, q can be 
considered to be infinitesimal variables whose 
squares are negligible, is called a Gauss map- 
ping (Gauss map). When the positions of an 
object point and its image under a Gauss 
mapping are represented by homogeneous co- 
ordinates, the mapping is represented as a 
linear transformation, i.e., a *collineation, 
which maps a point to a point and a line to a 
line. À point in one space corresponding to 
the point at infinity in the other space is called 
a focus. If we take a focus as the origin of a 
coordinate system in each space and use the 
thomogeneous coordinates x; such that x = 

X41 /X4, Y= X5/X4, Z= X3/X4, then a Gauss 
mapping can be represented as x, — x, 

X4 =X}, X42 fx4, f'x4— x4. The ratio of x to 
x’, ie., the lateral magnification, is x/x' 2z/f = 
f '/z , where x’ is the length of an object ortho- 
gonal to the axis and x is the length of its 
image. The distance f between a focus and a 
point where the lateral magnification is 1 (such 
a point is called a principal point) is called 
focal length in each space. The telescopic 
mapping, i.e., x, — X4, X2 — X5, X3 ax5, 

X4 — bx4, is also a Gauss mapping, in which 
the lateral magnification is constant. 


C. Aberration 


When a mapping is realized not only by par- 
axial rays but also by rays having larger incli- 
nations, a departure from the Gauss mapping 
arises. This departure ts generally called aber- 
ration. Suppose that a light ray that passes 
through the point (x’, y’,z’) of a plane perpen- 
dicular to the optical axis at a fixed z’ and has 
optical direction cosines p’, d is transformed 
by the optical apparatus into a light ray that 
passes through the point (x, y, z) of a plane 
perpendicular to the optical axis at a fixed z 
and has optical direction cosines p, q there. 
Then by the variational principle, pdx + qdy — 
p dx —q' dy =dW (dW is an !exact differen- 
tial). Therefore the transformation (x’, y', p', q’) 
—(x, y, p,q) is a canonical transformation. The 
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mapping can be described by 
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in terms of W and can also be represented in 
terms of V=W+p’'x'+q'y or U=W+ p’x’+ 
q'y — px — qy. For a given optical system, 

one of these functions W, U, V (called a char- 
acteristic function or eikonal) can be used to 
estimate the aberration. By developing such a 
characteristic function in power series of 
canonical variables and observing its terms 

of less than the fifth power, we can single out 
five kinds of aberration: spherical aberration, 
curvature of image field, distortion, coma, and 
astigmatism in a rotationally symmetric opti- 
cal system. To eliminate these aberrations an 
optical system must satisfy Abbe's sine con- 
dition (the elimination of spherical aberration 
and coma), Petzval's condition (the elimination 
of astigmatism and curvature of image field), 
and the tangent condition (the elimination of 
distortion). 

The path of a charged particle in an electro- 
magnetic field can be treated in the same way 
as the path of a light ray. Let e represent the 
specific charge of the particle, h the energy, 
Ay the electrostatic potential, and A,, A,, A, 
vector potentials. Then the index of refrac- 
tion is J/2(h—zA&) + £(A,dx/ds + A,dy/ds + 
A, dz/ds). Yn this case, the index of refraction 
shows the anisotropy caused by the existence 
of the magnetic field. The paths of paraxial 
rays are determined by a set of linear differ- 
ential equations of the second order, and the 
Gauss mapping is realized as in geometric 
optics. 
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The Greek word for geometry, which means 
measurement of the earth, was used by the 
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historian Herodotus, who wrote that in an- 
cient Egypt people used geometry to restore 
their land after the inundation of the Nile. 
Thus the theoretical use of figures for practical 
purposes goes back to pre-Greek antiquity. 
Tradition holds that Thales of Miletus knew 
some properties of congruent triangles and 
used them for indirect measurement, and that 
the Pythagoreans had the idea of systematiz- 
ing this knowledge by means of proofs (— 24 
Ancient Mathematics; 187 Greek Mathemat- 
ics). tEuclid’s Elements is an outgrowth of this 
idea (1]. In this work, we can see the entire 
mathematical knowledge of the time presented 
as a logical system. It includes a chapter (Book 
V) on the theory of quantity Oe, the theory of 
positive real numbers in present-day termi- 
nology) and chapters on the theory of integers 
(Books VII-IX), but for the most part, it treats 
figures in a plane or in space and presents 
number-theoretic facts in geometric language. 

Geometry in today's usage means the 
branch of mathematics dealing with spatial 
figures. In ancient Greece, however, all of 
mathematics was regarded as geometry. In 
later times, the French word géomètre or the 
German word Geometer was sometimes used 
as a synonym for mathematician. In a fragment 
of his Pensées, B. Pascal speaks of the esprit de 
géométrie as opposed to the esprit de finesse. 
The former means simply the mathematical 
way of thinking. 

Algebra was introduced into Europe from 
the Middle East toward the end of the Middle 
Ages and was further developed during the 
Renaissance. In the 17th and the 18th cen- 
turies, with the development of analysis, geom- 
etry achieved parity with algebra and analysis. 

As R. Descartes pointed out, however, fig- 
ures and numbers are closely related [2]. 
Geometric figures can be treated algebraically 
or analytically by means of fcoordinates (the 
method of analytic geometry, so named by 
S. F. Lacroix [3]; conversely, algebraic or ana- 
lytic facts can be expressed geometrically. Ana- 
lytic geometry was developed in the 18th cen- 
tury, especially by L. Euler [4], who for the 
first time established a complete algebraic 
theory of tcurves of the second order. Previ- 
ously, these curves had been studied by Apol- 
lonius (262—200? s.c.) as *conic sections. The 
idea of Descartes was fundamental to the 
development of analysis in the 18th century. 
Toward the end of that century, analysis was 
again applied to geometry. For example, G. 
Monge's contribution [5] can be regarded as a 
forerunner of tdifferential geometry. 

However, we cannot say that the analytic 
method is always the best manner of dealing 
with geometric problems. The method of treat- 
ing figures directly without using coordinates 
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is called synthetic (or pure) geometry. In this 
vein, a new field called *projective geometry 
was created by G. Desargues and B. Pascal in 
the 17th century. It was further developed in 
the 19th century by J.-V. Poncelet, L. N. Car- 
not, and others. In the same century, J. Steiner 
insisted on the importance of this field (— 343 
Projective Geometry). 

On the other hand, the taxiom of parallels 
in Euclid's Elements has been an object of 
criticism since ancient times. In the 19th cen- 
tury, by denying the a priori validity of Eu- 
clidean geometry, J. Bolyai and N. I. Loba- 
chevskii formulated non-Euclidean geometry, 
whose logical consistency was shown by 
models constructed in both Euclidean and 
projective geometry (— 285 Non-Euclidean 
Geometry). 

In analytic geometry, physical spaces and 
planes, as we know them, are represented as 
3-dimensional or 2-dimensional Euclidean 
spaces E?, E?. It is easy to generalize these 
spaces to n-dimensional Euclidean space E". A 
“point” of E" is an n-tuple of real numbers (x,, 
..., X), and the distance between two points 
Dei, Xah Quis Yn) 18 (Yi eat (y, 
—x,)*)"?. The geometries of E?, E? are called 
plane geometry and space (or solid) geometry, 
respectively. The geometry of E" is called n- 
dimensional Euclidean geometry. We obtain 
n-dimensional projective or non-Euclidean 
geometries similarly. F. Klein [7] proposed 
systematizing all these geometries in group- 
theoretic terms. He called a "space" a set S on 
which a group G operates and a “geometry” 
the study of properties of S invariant under the 
operations of G (— 137 Erlangen Program). 

B. Riemann [6] initiated another direction 
of geometric research when he investigated n- 
dimensional *manifolds and, in particular, 
*Riemannian manifolds and their geometries. 
Some aspects of Riemannian geometry fall 
outside of geometry in the sense of Klein. It 
was a starting point for the broad field of 
modern differential geometry, that is, the geom- 
etry of fdifferentiable manifolds of various 
types (— 109 Differential Geometry). 

The reexamination of the system of axioms 
of Euclid's Elements led to D. Hilbert's tfoun- 
dations of geometry and to the axiomatic ten- 
dency of present-day mathematics. The study 
of algebraic curves, which started with the 
study of conic sections, developed into the 
theory of algebraic manifolds, the algebraic 
geometry that is now developing so rapidly (— 
12 Algebraic Geometry). Another branch of 
geometry is topology, which has developed 
since the end of the 19th century. Its influence 
on the whole of mathematics today is con- 
siderable (— 114 Differential Topology; 426 
Topology). Geometry has now permeated all 
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branches of mathematics, and it is sometimes 
difficult to distinguish it from algebra or anal- 
ysis. The importance of geometric intuition, 
however, has not diminished from antiquity 
until today. 
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A. History 


H. Minkowski introduced the notions of lat- 
tice and convex set in the talgebraic theory of 
numbers. He developed a simple yet powerful 
method of arithmetic investigation using these 
geometric notions to simplify the analytic 
theory of tDiophantine approximation, which 
had been developed by P. G. L. Dirichlet and 
C. Hermite. His theory, the geometry of num- 
bers, has continued its development and con- 
tributed to various fields of mathematics (— 
83 Continued Fractions). 


B. Lattices 


Let E" be an n-dimensional Euclidean space 
identified with the linear space R". For a point 
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P in E", we denote the corresponding vector in 
R” by v(P) (xi, ..., x,). A subset A of E" is 
called an n-dimensional (homogeneous) lattice 
if there exists a basis {v,,...,v,} of R" such that 
A={PEeE|v(P)= ZIL, Ajvj, AiE Z]. The set of 
points {X}, ..., X,] such that v((X;) — v; (i= 

1, ...,n) is called a basis of the lattice A. A 
typical example of a lattice is the point set 
corresponding to Z” in Rn The "tree module 
generated by v, (i— 1, ..., n) is denoted by A* 
and is called the lattice group of A. We have A* 
= {veR"|v=v(P), Pe A). If {u,,...,u,} is an- 
other basis of the free module A*, then there 
exists an element (2j) of GL(n, Z) (i.e., aj; € Z 
and |det(«;)|= 1) such that u; — 37, Gef, 
Hence the quantity |det(v,, ..., v,)| is inde- 
pendent of the choice of the basis {v,,...,v,}. 
We denote this quantity by d(A) and call it the 
. determinant of the lattice. We denote the mini- 
mum distance between the points belonging to 
A by ó(A). 

A subset L of the space E" is called an in- 
homogeneous lattice if there exists a homo- 
geneous lattice A in E" and a point P, in E" 
such that L2 (Pe E"|v(P) — v(P,)e A*). Thus 
an inhomogeneous lattice is obtained from a 
homogeneous lattice by translation. In this 
article we restrict ourselves to the case of 
homogeneous lattices and henceforth omit the 
adjective “homogeneous.” 

Suppose we are given a sequence of lattices 
A,, Ag,..., in E" with bases {X}, {XP}... 
If the sequence of points Kr) converges to 
X; (i= 1, ..., n) and the set {X,,..., X,} forms 
a basis of a lattice A, we call A the limit of 
the sequence {A,}; we also say that the se- 
quence {A,} converges to the lattice A. In this 
case we have d(A,)—4d(A), 6(A,)6(A). The 
notion of convergence of lattices gives rise to a 
topology of the space M, of all the lattices in 
E. A sequence IA.) of lattices is said to be 
bounded if there exist positive numbers c and 
c' such that d(A,) € c, (^,) 2 c' for all v. A 
bounded sequence of lattices has a convergent 
subsequence. 

Let S be a subset of the space E". A lattice A 
is called S-admissible if we have AN S'= 101, 
where S' is the interior of S and O is the origin. 
We denote the set of S-admissible lattices by 
A(S). Given a closed subset M of M,, we put 
A(SVM)— infe (su d(A) if A(S) M is non- 
empty, while if A(S)f! M is empty, we put 
A(S\M)= œ. When M = Mp, we write A(S\M) 
= A(S) and call it the critical determinant of S. 
Generally, a lattice A in M is said to be critical 
in M with respect to S if Ac A(S) and d(A)— 
A(S\M). Suppose that we have 0<A(S\M)< 
oo. Then for a lattice critical in M with respect 
to S to exist it is necessary and sufficient that 
there exist a bounded sequence {A,} such that 
A, E M A(S) and d(A,)>A(S\M). 
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C. Successive Minima and Minkowski's 
Theorem 


A subset S of the space E" is called a bounded 
star body (symmetric with respect to the 
origin) if there exists a continuous function F 
defined on the space E satisfying the following 
four conditions: (i) F(0) 2 0; (ii) if X 40, then 
F(X )>0; (iii) for an arbitrary real number t 
and a point X, we have F(tX )=|t|F(X); (iv) 
S={X|F(X)<1}. A bounded closed *con- 
vex body that is symmetric with respect to the 
origin is a bounded star body. If we are given 
a star body S, the associated function F, and a 
lattice A, there exist a set of points (P, ..., P] 
in A and a set of positive numbers (pi, ..., Pa} 
satisfying the following four conditons: (1) 
v(P,), ... , v(P,) are linearly independent; (2) 
F(P,)=p; (i= 1, ..., n 3) pi < <- & py; (A) if P is 
a point in A and v(P) is not contained in the 
subspace spanned by [v(P;), ..., v(P, .,)], then 
F(P)>p,. The set {p,,...,p,} is uniquely 
determined by S and A. The numbers p; are 
called the successive minima of S in A; the 
points P, are the successive minimum points of 
S in A. 

Minkowski's theorem: Let A be a lattice in a 
Euclidean space E" and S a bounded subset of 
E". Then we have the following: 

(D If the volume V(S) is larger than d(A), 
then there exist points X, and X, in $ such 
that X, z X, and v(X,) — v(X,)e A*. Suppose, 
moreover, that S is convex and symmetric with 
respect to the origin. Then, if V(S) — 2"d(A), 
there exists a point X in SNA different from 
the origin. Hence we have 2"A(S) > V(S) (n= 
dim E"). 

(II) Let S be a bounded closed convex body 
that is symmetric with respect to the origin, 
and let p,, ..., p, be the successive minima of S$ 
in A. Then we have p, ... p, V(S) x 2"d(A). 


D. Minkowski-Hlawka Theorem 


Suppose that we are given a subset S of the n- 
dimensional Euclidean space E" such that the 
characteristic function y(X ) of S is tintegrable 
in the sense of Riemann. Then we have the 
Minkowski-Hlawka theorem: (i) If nz 2 and S 
is open, then A(S) « V(S), and (ii) if, moreover, 
S is symmetric with respect to the origin, 

then 2A(S) < V(S); (iii) if S is a symmetric 

star body with respect to the origin, then 
2C(n)A(S) « V(S), where C(n) is the *Riemann 
zeta function. 

A proof for the theorem was given by E. 
Hlawka (1944); (tii) and (iv) were conjectured 
by Minkowski. C. L. Siegel obtained another 
proof (1945), and C. A. Rogers simplified the 
original proof by Hlawka (1947). There are 
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results concerning the estimation of A(S)/V(S) 
for various subsets S. 


E. Siegel’s Mean Value Theorem 


In an attempt to obtain a proof for the latter 
half of the Minkowski-Hlawka theorem, Min- 
kowski observed the necessity of establishing 
the arithmetic theory of the linear transfor- 
mation groups. Siegel was inspired by this ob- 
servation and obtained the following theorem, 
Siegel's mean value theorem, which implies the 
Minkowski-Hlawka theorem: Let F be a tfun- 
damental region of the group SL(n, R) with 
respect to a discrete subgroup SL(n, Z). Let œw 
be the tinvariant measure on SL(n, R) such 
that f dw=1 (— 225 Invariant Measures). Let 
f be a bounded Riemann integrable function 
with compact tsupport defined on the space 
R". Note that the lattice Z" is stabilized by the 
subgroup SL(n, Z). We have 


> ze ndoto=| f(x)dx, 
F xez" R" 


x 


where the right-hand side of the equation is 
the usual Riemann integral of the function f. 
A. Weil considered this theorem in a more 
general setting (Summa Brasil. Math., 1 (1946)). 


F. Diophantine Approximation 


Minkowski initiated the notion of Diophan- 
tine approximation in reference to the prob- 
lem of estimating the absolute value | f(x)| of a 
given function f, where x varies in Z or ina 
given ring of 'algebraic integers. (A fDiophan- 
tine equation is an equation f(x)=0, where x 
varies in Z.) Today Diophantine approxima- 
tion (in the wide sense) refers to the investiga- 
tion of the scheme of values f(x), where x 
varies in a suitable ring of algebraic integers. 
The geometry of lattices is a powerful tool in 
this investigation. A typical problem in this 
field of study is that of approximating irra- 
tional numbers by rational numbers; here 
tcontinued fractions play an important role (— 
Section G). For the problem of uniform distri- 
bution considered by H. Weyl, the analytic 
method, especially that of trigonometric series, 
is useful (— Section H). Dirichlet's drawer 
principle (to put n objects in m drawers with 
n>m, it is necessary to put more than one 
object in at least one drawer) is one of the 
basic principles used in the theory of Diophan- 
tine approximation. Recently, the theory has 
been applied to the theory of ftranscendental 
numbers and the theory of tDiophantine 
equations. 


688 


G. Approximation of Irrational Numbers by 
Rational Numbers 


Given an irrational number 0, we have the 
problem of finding rational integers x (> 0) 
and y such that |0 — y/x| « c/x, where € is a 
given positive number. Suppose that we are 
given a positive integer N. Using Dirichlet's 
drawer principle we can show the existence of 
x(x N) and y such that |0— y/x| « I/xN. Let 
M (0) be the supremum of positive numbers 
M such that the inequality |0— y/x| < 1/M x? 
holds for infinitely many pairs of integers x, y. 
We have 1 < M(0) (€ oo). Two irrational num- 
bers 0 and @ are said to be equivalent if there 
exists an element (a;)e GL(2, Z) such that 

0' =(a,,0+4,,)/(a,,;9+a,,). In this case we 
have M(0) 2 M'(0). 

If the irrational number 6 satisfies the qua- 
dratic equation a0? +b0+c=0 (a, b, c are 
rational integers), then we have M(0)= 
ET. /b* —4ac, where k 2 min(ax? + bxy + 
cy?|x, yeZ, x #0, y 0}. In general, for an 
irrational number 0 of degree two, we have 
M(0)>./5. The equality M(0)=./5 holds if 0 
is equivalent to 0, =(1+,/5)/2. If 0 is not 
equivalent to 0,, then M(0)z ,/8; the equality 
holds if 0 is equivalent to 0; = 1 4- /2. Simi- 
larly, we have 0,, 04, ...; and M(0,)—3 (no oo). 
If M (0) <3, there exists a 0, such that @ is 
equivalent to 0,. The set of irrational numbers 
0 satisfying M(8) —3 is uncountable (A. A. 
Markov [13]). We have no information about 
M (0) for the general algebraic irrational num- 
ber 0. Let u(0) be the supremum of real num- 
bers u such that the inequality |0 — y/x|<1/x* 
holds for infinitely many pairs of integers x, y. 
Given a number x 72, it can be shown that the 
tLebesgue measure of the set of real numbers 0 
such that u(0) 2 x is zero. If @ is a real alge- 
braic number of degree n, then u(0) « n (J. 
Liouville). Concerning u(0), results have been 
obtained by A. Thue, Siegel, A. O. Gel'fond, 
and F. J. Dyson. K. F. Roth (1954) proved that 
u(0)=2 (Roth's theorem [12]), which settled 
the problem of (8). Roth’s theorem means 
that if x is larger than 2, then there exist only a 
finite number of pairs x, y satisfying |0— y/x| 
< l/x*. This can be generalized to the case of 
the approximation of an element 0 that is 
algebraic over an A-field k by an element of 
the field k (S. Lang [9]). (An A-field is either an 
algebraic number field of finite degree or an 
algebraic function field in one variable over a 
finite constant field.) 

In 1970 W. M. Schmidt [20] obtained 
theorems on simultaneous approximation 
which generalize Roth's theorem. Thus, if 
Xj ...,0, are real algebraic numbers such that 
1, %,,...,%, are linearly independent over the 
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field of rational numbers, then for every e zU 
there are only finitely many positive integers q 
with 

"ol... lla a! 7* <A, 


where !|£|| denotes the distance from a real 
number Z to the nearest integer; in particular, 
we have 


la;—p;/q|i«q WIR, i-1,...,n, 


for only finitely many n-tuples of rationals 


D1/q, ---, Dag, A dual to this result is as follows. 


Let 0,,...,0,, € be as before. Then there are 
only finitely many n-tuples of nonzero integers 
di, qd, With 


TEE Eq rdi du «1. 


This last theorem can be used to prove that if 
a is an algebraic number, k a positive integer, 
and e 0, then there are only finitely many 
algebraic numbers o of degree at most k such 
that [x —o| « H(w) Ir. where H(o) denotes 
the height of œ. See also [16,22]. 

The work of Thue, Siegel, and Rotli had the 
basic limitation of noneffectiveness. A. Baker 
(1968) succeeded in proving that for any alge- 
braic number 0 of degree n 2 3 and any k> 
n, there exists an effectively computable 
number c—c(0, x) » 0 such that |8 — y/x| > 
cx "exp(log x)!/" for all integers x, y (x0) 
[15]. This result is an immediate consequence 
of the following effective version of a classical 
theorem on binary Diophantine equations 
(Thue, 1909): Let f= f(x, y) be an irreducible 
binary form of degree n 2 3 with integer coeffi- 
cients, and suppose that x » n. Then for any 
positive integer m, all integer solutions x, y of 
the equation f(x, y) 2 m satisfy max(|x], |y] < 
cexp(log m)*, where cO is an effectively com- 
putable number depending on n, x, and the 
coefficients of f. Baker obtained this result by 
making use of his theorems which give effec- 
tive estimates of moduli of linear forms in the 
logarithms of algebraic numbers with alge- 
braic coefficients. A typical theorem reads as 
follows: Let «,,...,%, be nonzero algebraic 
numbers with loga,,...,loga, linearly inde- 
pendent over the rationals, and let Ba... f, be 
algebraic numbers, not all 0, with degrees and 
heights at most d and H, respectively. Then for 
any k2 n4 1, we have |f, + Blogo, +... + 
p,loga,|» cexp(— (log H)*), where c» 0 is an 
effectively computable number depending only 
on n, K, loga,,...,loga,, and d (14,197. 

Results of this kind have many important 
applications in number theory. For instance, 
we obtain a generalization of the Gel'fond- 
Schneider theorem on transcendental num- 
bers. Furthermore, the imaginary quadratic 
number fields of class number 1 can be com- 
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pletely determined on the basis of Baker's 
result. This was actually done by Baker (1966) 
and independently by H. M. Stark (1966) 

(— 347 Quadratic Fields). 

Refinements and generalizations of Thue's 
theorem on the finiteness of solutions of bi- 
nary Diophantine equations have been ob- 
tained by Baker and his collaborators. (— 
[17], and also [16]). Also, p- and p-adic ana- 
logs of Baker's results are known [18]. 

There are a number of unsettled problems 
on the irrationality of particular numbers, 
such as the *Euler constant C, z*, or C(2k 4- 1) 
with k a positive integer. R. Apéry (1978) 
proved that there exist a positive number & 
and a sequence of positive integers {q,} such 
that 0< [4,£(3)I| «e ™ for all nz 1, so that €(3) 
is irrational. 


H. Uniform Distribution 


Let 0 be a real number, x a positive integer, 
and [0x] the maximum integer not larger than 
0x. We write (0x) — 0x — [0x ] z 0x(mod 1). 
Jacobi showed that if 0 is irrational, then 

the set ((0x)| xe Nj is densely distributed in 
the interval (0, 1) (N is the set of positive in- 
tegers). In general, let f be a real-valued func- 
tion defined on N. We say that f(x) (mod 1) is 
uniformly distributed in the unit interval, or 

f (x) is uniformly distributed (mod 1), if the 
following condition is satisfied: Let o, f be an 
arbitrary pair of real numbers such that 0 € 

x « f « 1, and let N be a given positive integer. 
Let T(N) be the number of positive integers x 
such that x < N, a «(f(x)) « B, where (f(x)) 2 
foe) - LfG9)). Then lim, ,4, T(N)/N = fi a. In 
order for f(x) (mod 1) to be uniformly dis- 
tributed, it is necessary and sufficient that 
limy ,4, NIEN e?" — 0 for any non- 

zero integer h (Weyl’s criterion, 1914). Weyl 
proved that if Ó is an irrational number, then 
0x (mod 1) is uniformly distributed. 

The following theorem, given by J. G. van 
der Corput, is often useful: Let f be a real- 
valued function defined on N. Consider the 
function f,(x) — f(x + h) — f(x) for an arbitrary 
positive integer A. If f,(x) (mod 1) is uniformly 
distributed (mod 1) for all such h, then f(x) 
(mod 1) is also uniformly distributed (mod 1). 

Utilizing this theorem, it can be shown that 
if f(x) lat x" ! +...4+6,x, where at 
least one of the coefficients 0; is irrational, then 
f(x) (mod 1) is uniformly distributed. 

The notion of uniform distribution of se- 
quences of real numbers has an analog in 
compact Hausdorff spaces and in various top- 
ological groups. A systemmatic treatment of 
such generalized notions of uniform distri- 
bution can be found in [21]. 


182 Ref. 
Geometry of Numbers 


References 


[1] H. Minkowski, Geometrie der Zahlen, 
Teubner, 1910 (Chelsea, 1953). 

[2] H. Minkowski, Diophantische Approxi- 
mationen, Teubner, second edition, 1927 
(Chelsea, 1957). 

[3] H. Minkowski, Gesammelte Abhandlun- 
gen I, II, Teubner, 1911 (Chelsea, 1967). 

[4] J. F. Koksma, Diophantische Approxima- 
tionen, Erg. Math., Springer, 1936 (Chelsea, 
1950). 

[5] H. Weyl, C. L. Siegel, and K. Mahler, 
Geometry of numbers, Mimeographed notes, 
Princeton, 1950. 

[6] J. W. S. Cassels, An introduction to Dio- 
phantine approximation, Cambridge Univ. 
Press, 1957. 

[7] J. W. S. Cassels, An introduction to the 
geometry of numbers, Springer, 1959. 

[8] T. Schneider, Einführung in die trans- 
zendentalen Zahlen, Springer, 1957. 

[9] S. Lang, Diophantine geometry, Inter- 
science, 1962. 

[10] C. L. Siegel, Über einige Anwendungen 


diophantischer Approximationen, Abh. Preuss. 


Akad. Wiss., no. 1 (1929) (Gesammelte Ab- 
handlungen, Springer, 1966, vol. 1, 209-266). 
[11] C. L. Siegel, A mean value theorem in 
geometry of numbers, Ann. Math., (2) 46 
(1945), 340—347 (Gesammelte Abhandlungen, 
Springer, 1966, vol. 3, 39—46). 

[12] K. F. Roth, Rational approximations to 
algebraic numbers, Mathematika, 2 (1955), 1— 
20. 

[13] A. A. Markoff (A. A. Markov), Sur les 
formes quadratiques binaires indéfinies, Math. 
Ann., 15 (1879), 381—407. 

[14] A. Baker, Linear forms in the logarithms 
of algebraic numbers I, II, III, IV, Mathe- 
matika, 13 (1966), 204—216; 14 (1967), 102— 
107; 14 (1967), 220—228; 15 (1968), 204—216. 
[15] A. Baker, Contributions to the theory of 
Diophantine equations I, II, Philos. Trans. 
Roy. Soc. London, A 263 (1967—1968), 173— 
191, 193—208. 

[16] A. Baker, Transcendental number theory, 
Cambridge Univ. Press, 1975. 

[17] A. Baker and J. Coates, Integer points on 
curves of genus 1, Proc. Cambridge Philos. 
Soc., 67 (1970), 595—602. 

[18] A. Brumer, On the units of algebraic 
number fields, Mathematika, 14 (1967), 121— 
124. 

[19] N. I. Fel'dman, Linear form of logarith- 
mic algebraic numbers (in Russian), Dokl. 
Akad. Nauk SSSR, 182 (1968), 1278-1279. 
[20] W. M. Schmidt, Simultaneous approxi- 
mation to algebraic numbers by rationals, 
Acta Math., 125 (1970), 189—201. 


690 


[21] L. Kuipers and H. Niederreiter, Uniform 
distribution of sequences, Interscience, 1974. 
[22] W. M. Schmidt, Diophantine approxi- 
mation, Lecture notes in math. 785, Springer, 
1980. 

[23] W. M. Schmidt, Approximation to alge- 
braic numbers, Enseignement Math., 17 
(1971), 188—253. 

[24] C. F. Osgood (ed.), Diophantine approxi- 
mation and its applications, Academic Press, 
1973. 


183 (VII.23) 
Global Analysis 


Mathematics that treats, by using functional 
analytical techniques, various problems con- 
cerning the tcalculus of variations, tsingular- 
ities, infinite-dimensional Lie groups, or tnon- 
linear partial differential equations, such as 
equations of fluids or of gravitation in general 
relativity, may be called global analysis if it 
uses as a main tool an infinite-dimensional 
version of differential geometry and topology 
analogous to that for finite-dimensional mani- 
folds. The term global analysis therefore has no 
precise definition. However, it can be said that 
it is analysis on manifolds, and the concept of 
tinfinite-dimensional manifolds is the central 
abstract idea in it. 

Suppose one considers a nonlinear differen- 
tial operator on a finite-dimensional manifold. 
Then by using functional analytical techniques 
it often happens that its domain is neither a 
linear space nor its open subset but an infinite- 
dimensional manifold, and that such a non- 
linear operator can be regarded as a tdifferenti- 
able mapping between infinite-dimensional 
manifolds. The tdifferential at a point in that 
source manifold is called a linearized operator, 
to which one can apply various theories of 
linear functional analysis. 

"Global analysis" seems to have first ap- 
peared in the literature in the late 1960s [1, 2]. 
However, the phrase “infinite-dimensional 
manifold” has been used widely since about 
1960. By that early date, local theories of 
infinite-dimensional manifolds, sometimes 
called “general analysis,” such as definitions of 
differentiability, the timplicit function theorem, 
tTaylor’s theorem, the existence and unique- 
ness of tordinary differential equations, and 
the *Frobenius theorem, had already been 
established in Banach spaces [3]. Therefore, 
with these regarded as a local theory, the con- 
cept of infinite-dimensional manifolds could be 
defined naturally, and the concept of infinite- 
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dimensional Lie groups as well [4]. A few 
years later, R. Palais and S. Smale [5] and J. 
Eells and J. Sampson [6] showed that such 
concepts are useful in the calculus of varia- 
tions, and R. Abraham and J. Robbin [7] re- 
marked that *transversality theorems initiated 
by R. Thom can be easily proved by using an 
infinite-dimensional version of tSard’s theorem 
[8]. 

The so-called tAtiyah-Singer index theorem 
[9]. announced in 1963, gave impetus to the 
field as people sought the theorem's most 
natural expression; the work finally resulted in 
the theorem classifying separable *Hilbert 
manifolds by homotopy type (— 105 Differen- 
tiable Manifolds). 

After the appearance of these theorems an- 
nouncing the nonexistence of differential topol- 
ogy on separable infinite-dimensional Hilbert 
manifolds, global analysis moved toward 
more concrete problems and applications to 
various branches of mathematics. However, 
many of these applications are formulated not 
on * Banach or Hilbert manifolds, but on mani- 
folds modeled on Fréchet spaces (*Fréchet 
manifolds) or on *nuclear spaces. For such 


situations we have in general no local theories. 


Neither the implicit function theorem nor the 
Frobenius theorem holds on such manifolds. 
However, since these theorems are crucial for 
nonlinear problems, various kinds of sufficient 
conditions for the validity of these theorems 
are being studied by many people (— 286 
Nonlinear Functional Analysis). 

As for the calculus of variations, Yamabe’s 
problem is being studied extensively, for this 
seems to be a typical problem not satisfying 
the so-called *Condition C. The original paper 
of H. Yamabe [13], insisting that every com- 
pact Riemannian manifold can be conformally 
deformed into a manifold of constant scalar 
curvature, contains a serious gap, and the 
problem is still open, though many cases are 
known where the statement holds (— 364 
Riemannian Manifolds H). The harmonic 
mappings defined in [6] are also being studied 
extensively (— 195 Harmonic Mappings). 

In differential geometry or the fgeneral 
theory of relativity definitions of various cur- 
vatures, such as Riemannian, Ricci or scalar, 
or Gauss, can be sometimes regarded as non- 
linear differential equations on manifolds. 
Proof of the global existence of solutions to 
these equations has long been sought, and 
several theorems have recently appeared [14— 
16]. 

Infinite-dimensional groups such as GL(E), 
or GL(E) with uniform topology, are called 
tBanach-Lie groups, and they have been 
studied in the operator calculus. On the other 


183 Ref. 
Global Analysis 


hand, infinite groups studied by S. Lie and 
E. Cartan were in fact infinite-dimensional 
germs of transformation groups defined on a 
neighborhood of a point in a manifold. These 
were not groups in the strict sense. Recently, 
H. Omori [17] has given a definition of ab- 
stract infinite-dimensional Lie groups that in- 
cludes Banach-Lie groups and many infinite- 
dimensional transformation groups studied 
by Cartan. An application of these groups to 
fluid dynamics can be found in [18]. More- 
over, tunitary representation theories of these 
groups are now being constructed [19, 20]. 
Though global analysis consists at pre- 
sent of a rather disorganized combination of 
many nonlinear problems in analysis on mani- 
folds and in mathematical physics, it is never- 
theless one of the most active branches of 
mathematics. 
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184 (XXI.27) 
Gödel, Kurt 


Kurt Gödel (April 28, 1906-January 14, 1978) 
was born in Brno, Czechoslovakia (at that 
time Brünn, Austria-Hungary). He studied 
mathematics and physics at the University 

of Vienna, where he took the Ph.D. degree 

in 1930. After he had taught mathematics at 
the University of Vienna from 1933 to 1938, 
he was invited to the Institute for Advanced 
Study at Princeton, where he became professor 
in 1953. In 1976, he was named professor 
emeritus; he died in Princeton in 1978. 

Gödel contributed important fundamental 
results covering all aspects of mathematical 
logic. Among his famous works are the proof 
of the !completeness of the first-order predi- 
cate calculus, the incompleteness of the con- 
sistent axiomatic system containing Peano's 
arithmetic (incompleteness theorem), and the 
tconsistency of the axiom of choice and the 
generalized continuum hypothesis. He also 
introduced the notion of recursive functions 
and found the Gódel solution of Einstein's 
equations of relativity. In addition to mathe- 
matical works, he left philosophical papers on 
set theory and the foundations of mathematics. 
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185 (1.8) 
Gódel Numbers 


A. General Remarks 


K. Gódel [1] devised the following method to 
prove his incompleteness theorems (— Section 
C). 

Let S be a tformal system. In this article, we 
call its basic symbols, *terms, tformulas, and 
formal proofs the “constituents” of S. Let g be 
an tinjection from the constituents of G into 
the natural numbers satisfying the following 
two conditions: (1) Given a constituent C, we 
can compute the value g(C) in a finite number 
of steps. (2) Given a natural number n, there 
exists a finitary procedure to find out whether 
there exists a constituent C of S such that 
g(C) =n; furthermore, when such a C exists, it 
can actually be specified in a finite number of 
steps. 

If such a mapping g is given for the system 
G, then the mapping g is called a Gódel num- 
bering and the number g(C) is called the Gódel 
number of the constituent C (with respect to g). 


B. An Example of Gódel Numbers 


(1) Let go, 24, ... be the basic symbols of G. 
With each o; we associate a distinct odd num- 
ber q;: glx) =q; (i=9, 1, ...). (2) Let F be a con- 
stituent of S. If F is constructed from any 
other constituents Fo, Fi, ..., F, of S by a rule 
peculiar to S (for convenience we write this 

F =(Fo, Fi, ..., F,)), and if, for each F,, g(Fj) is 
already defined, then we put g(F)=<g(F), 
g(Fi), ..., g(F) 5», where (ao, ay, ..., apò 
denotes the number pt pt! ... pz* (p; is the . 
(i+1)st prime number). For example, suppose 
that S contains 0, =, v; (variables), and 

7] (negation) among the basic symbols, and let 
their Gödel numbers be 7, 9, 11/*!, and 13, 
respectively. Since the formula 7 (0— vj) can 
be analyzed in the form (7, (0, =, vj), its 
Gödel number is (13, 47,9, 117*! 55. For 
details — [1,4]. 


C. Gódel's Incompleteness Theorems 


By means of a Gódel numbering any meta- 
mathematical notion about the constituents of 
a formal system G can be interpreted as a 
number-theoretic notion. For example, the 
notion "formula" is interpreted as the number- 
theoretic predicate Form(x) which means 

that x is the Gódel number of a formula. The 
provability of a formula is interpreted as the 
number-theoretic predicate Prov(x), which 
means that x is the Gódel number of a prov- 
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able formula; accordingly, for any formula A 
of G, the proposition Prov(g(A)) means that A 
is provable. This interpretation is called the 
arithmetization of metamathematics. 

Let the formal system G include formal 
elementary number theory. Then many meta- 
mathematically useful number-theoretic predi- 
cates can be expressed by the respective for- 
mulas of G; for example, there exist formulas 
Form(x) and Prov(x) expressing the predicates 
Form(x) and Prov(x), respectively. Further- 
more, Gódel proved the existence of a closed 
formula U such that the formula 
"]Prov(g(U)) — U is provable. This closed 
formula U is one of the so-called formally 
undecidable propositions, and in fact it is shown 
that neither U nor TIU is provable in G if G is 
consistent in a strong sense. This result is 
called Gédel’s first incompleteness theorem. 

By use of the formulas Form(x) and Prov(x) 
the consistency of the formal system © is 
expressed by the formula Consis which is an 
abbreviation of 4x(Form(x)A 7 Prov(x)). 
Godel obtained the result that the formula 
Consis is not provable in S if G is consistent, 
on the basis of the following three facts: (1) U 
is not provable in G if G is consistent; (2) 
Consis ^ '1Prov(g(U)) is provable in S; (3) 
"1Prov(g(U)) 5 U is not provable in S. This 
result is called Gódel's second incompleteness 
theorem. 

The method of arithmetization is important 
and useful in the study of mathematical logic. 
The notion of Gódel numbers of recursive 
functions is one of its applications (— 356 
Recursive Functions). 


D. Tarski's Theorem Concerning Truth 
Definitions 


Let a consistent formal system S and a tmodel 
of S be given. By means of a Gódel number- 
ing, the truth notion of closed formula is inter- 
preted as the number-theoretic predicate 

Tr(x) which means that x is the Gódel number 
of a true closed formula; accordingly, for any 
closed formula A of S, the proposition 
Tr(g(A)) means that A is true. 

In relation to the foregoing fact, if there 
exists a formula Tr(x) of a single variable and 
the formula Tr(g(A))< A is provable for every 
closed formula A, then that formula Tr(x) is 
called a truth definition. A. Tarski proved the 
fact that there is no truth definition in G if the 
formal system G is consistent. 


References 


[1] K. Gödel, Über formal unentscheidbare 
Sátze der Principia Mathematica und ver- 








186 B 
Graph Theory 


wandter Systeme I, Monatsh. Math. Phys., 38 
(1931), 173-198; English translation, On form- 
ally undecidable propositions of Principia 
mathematica and related systems, Oliver & 
Boyd, 1962. 

[2] A. Tarski, Der Wahrheitsbegriff in den 
formalisierten Sprachen, Studia Philosophica, 
1 (1936), 261—405, translated from the Polish 
original 1933; English translation, The concept 
of truth in formalized languages, logic, seman- 
tics, metamathematics, Oxford Univ. Press, 
1956. 

[3] J. B. Rosser, Extentions of some theorems 
of Godel and Church, J. Symbolic Logic, 1 
(1936), 87-91. 

[4] S. C. Kleene, Introduction to metamath- 
ematics, North-Holland and Noordhoff, 1952. 
[5] S. C. Kleene, Mathematical logic, Wiley, 
1967. 

[6] J. R. Shoenfield, Mathematical logic, 
Addison-Wesley, 1967. 


186 (XVI.12) 
Graph Theory 


A. Overview of Graph Theory 


Two aspects of graphs are the object of graph 
theory. One is that a graph expresses a binary 
relation over a set V, and the other is the fact 
that a graph is a CW-complex of 1 dimen- 
sion, an object of study in algebraic topology. 
Because of its special natural as an object of 1 
dimension, we can consider various concrete 
properties in detail. Hence graph theory has 
close connection with other areas, such as 
network theory, system theory, automata 
theory, and the theory of computational pro- 
cesses, and it has many useful applications. 
The notion of a “graph” as currently used in 
graph theory was first discussed by L. Euler 
[1]. It is said that J. J. Sylvester coined the 
word "graph" as we presently understand it. 
However, up to now, there were few unifying 
principles, and graph theory seemed like a 
large collection of miscellaneous problems and 
ad hoc techniques. The terminology has not 
yet been standardized; different usages prevail 
in different schools. The resulting confusion 
can be seen in references books such as [2]- 
[5]. Recently, infinite graphs have also been 
studied. But here we restrict ourselves to finite 
graphs, since only they give a typical theory. 


B. Definition of Graph 


The notion of a graph G -(V, E,»0*,0 )isa 
composite notion of two finite sets V and E 
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and two maps 0 :E>V and 9. : ES V. An 
element of V ts called a vertex, and an element 
of E is called an edge. The maps 0* are called 
incidence relations. The terms point, or node 
instead of vertex, and arc, line, or branch in- 
stead of edge have also been used (sometimes 
with slightly different meanings). For an edge 
ec E, tee V is called the initial vertex and 

Ó eeV is called the terminal vertex. Both are 
called the end vertices of e. The inverse map 6+ 
of ðt: V2 is defined by tv = (ec E|0*e— 

v} and has the following properties: (i) If 

vv, then ó'v(1ó* v 2ó vNd v = Ø. (ii) 
ver O*v=| yey Ó v= E. Conversely, if the 
maps AT" V—2F have the properties (i) and (ii), 
then there exist corresponding maps ô+. For 

a vertex ve V, |Óó* v| is called the outdegree or 
positive degree, |Ó v| is called the indegree or 
negative degree, and the sum |ó* v| - |ó v| is 
called the degree. We always have the relation 
Lev lO vl=Zrev 1d v|-2|E|, and Z,,,(Ió* v| + 
|S v])=2|E£|. The number of vertices with 

odd degree is always even. Two end vertices of 
an edge are called mutually adjacent, and two 
edges with at least one common end vertex are 
also called mutually adjacent. An edge satisfy- 
ing 0^ e—0 eis called a self-loop, and a vertex 
satisfying ó* v —ó ^ v= Ø is called an isolated 
vertex. 

When permutation groups P, operating 
over V and P; operating over E are given, we 
can naturally define the permutation (zy, zp) 
over the graph G(nzy, € Py, nge Pp). A graph is 
classified into equivalence classes by Py, Pr. 
When P,, Pg are all the permutations of V, E, 
respectively, each equivalence class is called an 
unlabeled graph, whereas when Py, P; consist 
only of an identity transformation, G is called 
a labeled graph. 

For a given graph G-(V, E,0*,ó )anda 
subset E' c E, we can define the reoriented 
graph of edges in E by G' (V, E, 0'*,0'-), 
where 6’ *e=6*e if e£E' and 0'*e—0*e if 
ec E'. We have an equivalence relation by 
identifying the reoriented graphs. Each class 
by this equivalence relation is called an un- 
directed graph or an unoriented graph. In con- 
trast, the graph in the original sense is called a 
directed graph or an oriented graph. 


C. Examples of Special Graphs 


(1) A complete graph is a graph such that there 
exists one and only one edge with end vertices 
v and v' for every different pair of vertices v 
and v’. (2) A bipartite graph is a graph with a 
partition V UV~ =V such that V: NVT — gj 
and for every ec E, we have 0^ ee V^ and 
dech . If there always exists an edge e with 
S 


C'est, e-v'for every pair veV* and 
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v€V ,itis called a complete bipartite graph. 
(3) A regular graph is a graph whose degree 

at each vertex is the same. (4) A partial graph 
is defined as follows: Let E' be a subset of E 

in a graph G =(V, E,0*,0^). The graph G'— 
(V', E',0'* O ) is called a partial graph of 

G, where V'—0* E'U E. and 0'* are the 
restrictions of 0^ on E, respectively. When 
E' — E, the partial graph is the graph obtained 
by deleting all isolated vertices from G. Simi- 
larly, for a subset V’ of V, the graph G” = 

(V^, E", 0"* ,0"-), defined by E" éi V"Uó- V" 
and with 0"* being the restriction of d+ on E", 
is called a section graph of G. 


D. Representation of a Graph 


When we want to process by computer a 
problem concerning a graph, we must repre- 
sent a graph in a suitable form (— 96 Data 
Processing). A commonly used method is the 
following list representation: Let V — {v,, v3, 
.,Uy} and E={e,,e,,...,e,}. Foreachv,eV 
(a= 1, ..., M), we arrange the edges in 6*v, 
and in ó v, in suitable orders. Then (i) for 
each v, € V, record |ó* v,, |Ó v,], B}, B], B, , 
and B, , where BI and BI are the first and 
the last index of the edges in ó*v,, respectively. 
We define the corresponding values to be 0 if 
the ó*v, are empty. (ii) For each e, € E, record 
ô, ôr, EZ, E; , E, , and Ez, where d+ are 
the number of vertices 0*e,, and EZ, Ei are 
the number of edges immediately after or 
before e, among the edges with the same initial 
(for +) or terminal (for ~) vertex as that of e,. 
If e, is the last or the first among such edges, 
we define the corresponding values to be 0. 


E. Deformation of a Graph 


Let G —(V, E,0* ,0 ) bea graph. A graph ob- 
tained by opening an edge e is the graph G' — 
(V, E— 1e], 0'* ,0'^), where 0’* is the restric- 
tion of ô+ on E — {e}. The graph obtained by 
deleting all isolated vertices from G' is a partial 
graph of G over E— (ej. A graph obtained by 
shortening an edge e is the graph G” —(V", E — 
{e}, 09" *,0" ^), where V" 2(V — (0*e,0 ei 
(61, 6 being a new vertex not contained in V, 
and 6”* =@-0+, 9: V V" being defined by 

q (v) v for v#0*e, and =6 when v=" e, 

or v= e,. A graph obtained by opening 

and shortening several different edges is simi- 
larly defined, and the result is independent 

of the order of opening or shortening process. 
A graph obtained by opening edge(s) or by 
shortening edge(s) or by both processes is 
called a subgraph, a contraction, or a sub- 
contraction, respectively. 
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F. Connectedness 


Let G=(V, E,0*,0 ) bea graph, and let v, and 
Ge be two vertices. A path from v, to v, of 
length / is a sequence P —(v, — t, £1 €x,» Dan 


65 €,,, Éiere Va = Ug), Where &; are +1 or —1, 

and for every i— 1, ...,], we have ô” e, —v,, ,, 
PRAE ME AN Ge 

O e, —v, ife;- *l,andO e, = Vap 0 e, —v, , 

if ¢;= —1. When v, — vg, it is called a closed 


path considering P as a cyclic sequence. If 

in the sequence P of a path no edge appears 
more than once, it is called a simple path. 
Similarly, if no vertex appears more than once, 
it is called elementary. If all c; — +1, it is called 
a direct path. Direct closed paths, etc., are 
similarly defined. 

If we define v —v' by the existence of a path 
from v to v’, this ~ is an equivalence relation. 
Let us denote the equivalence classes by Vj, 
...,V,. The section graph G; —(V,, E;, 0; ,0; ) 
determined by V; is called a connected com- 
ponent, or simply a component, of G. The sets 
E,,..., E, are mutually disjoint and the union 
is E. If we denote by vv’ the existence of a 
direct path from v to v', the relation — isa 
*pseudo-order. The relation vv defined by 
vv' and v/v is an equivalence relation in V, 
and the equivalence classes V,,..., V or the 
section graphs G,=(V,, E,, 6*, ô; ) determined 
by E are called strongly connected components 
of G. E,,..., E, are mutually disjoint, but their 
union is not always E. Among the sets H. 

..., V, we can define an order relation V,->V; 
by the existence of ve V. ve P. v—v'. Classi- 
fication by is a refinement of that by —. 
When s= 1, the graph G is called connected, 
and when t— 1, G is called strongly connected. 

For a different pair v, v'e V, a set SC V — 

(v, v'} is called a separator of v and v if every 
path from v to v' contains at least one vertex of 
S. When every separator S for any pair v, v' 
has at least k elements, the graph G is called k- 
connected. For k > 3, there are several vari- 
ations of the definition of k-connectedness. 1- 
connectedness is equivalent to connectedness 
in the sense defined above. 

A simple path containing all edges of a 
graph is called an Euler path. Euler direct 
paths, etc., are similarly defined. A graph with 
an Euler path is called an Euler graph. A graph 
is an Euler graph if and only if (i) it is con- 
nected, and (ii) the number of vertices with 
odd degrees is 0 or 2 (Euler’s unicursal graph 
theorem [1]). Similar results are known for 
Euler closed paths or Euler direct paths. An 
elementary path containing all vertices of 
a graph is called a Hamilton path. Hamil- 
ton closed paths. etc., are similarly defined. 
The criteria for the existence of a Hamilton 
path for any given graph is unknown. This is 
known to be an *NP-complete problem (— 71 
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Complexity of Computations). Various suffi- 
cient conditions or necessary and sufficient 
conditions for special graphs have been given 
(— e.g. [6]). 


G. Tieset, Cutset, Tree, and Cotree 


For a graph G—(V, E, 0*,0 ), we define its 
incidence matrix (D? (x —1,..., M (=|V|),«= 
l, ...,n (2|E|))] by defining D% to be 1 if v,= 
O'e,z0 e,, —lifv,-0 e, Z0* e, and 0 if 
0*e,-0 e, Or e fv, UÂ v,. Similarly we 
define its adjacement matrix [17 (o, f — 1, ..., 
M)] by defining 15 to be 0 if v, and v, are not 
mutually adjacent and 1 if v, and v; are mutu- 
ally adjacent. 

A set of edges forming a closed path is called 
a tieset. A set of edges of the form (e|0* ee W, 
0 eek-WiUlelë eeW,0'eeV—Wjfora 
partition (W, V — W) of V is called a cutset. A 
maximal subset of edges containing no tiesets 
is called a tree, and a maximal subset of edges 
containing no cutsets is called a cotree. A tree 
is sometimes called a spanning tree of G. Every 
tree is the complement (with respect to E) of a 
cotree, and vice versa. 

The number of elements of a tree is always 
the same and equal to the trank m of the in- 
cidence matrix. This value m is called the rank 
of the graph G. Similarly, the number k of 
elements of a cotree is always the same, which 
is called the nullity or the cyclomatic number of 
the graph G. Always, k-n— m. 

Let K be a field, K", K™ be vector spaces 
over K of dimensions n and M, respectively, 
and K"*, K™* be the tdual spaces of K”, KM. 
The incidence matrix defines two mutually 
contragradient linear mappings 0: K" O KM 
and ó: KV* 5 K"* with respect to their canon- 
ical bases. A minimal set among the family of 
supports (in E) of nonzero vectors in the kernel 
Ker is a tieset corresponding to an elemen- 
tary closed path. Similarly, a minimal set 
among the family of supports (in E) of nonzero 
vectors of the image Im6 is a minimal cutset. 

Let T (C E) be a tree and T2 E— T bea 
cotree. Let us renumber the edges so that T — 
lei, fk, T m {emt -En For each e, 
T, there is a unique vector R*(p— 1, ..., k(—n— 
m)) in Ker whose support is in {€em+p UT 
and whose e,,,- equals +1. Similarly, for 
each e,€ T (a- 1, ..., m), there is a unique vec- 
tor Dé (a2 1, ...,m) in Im whose support is in 
{e,} U T and whose e,- component equals +1. 
(Rt, ..., Rg} is a basis of Ker ð, and (Dj,..., 
Dz} is a basis of Im ô. Both matrices R¥(k x n) 
and Dz(m x n) are totally unimodular, i.e., every 
minor determinant is 0, +1, or —1. Further- 
more, the following relations hold: R7'^— 

OF (p,q— 1, ..., k), Di =ô; (a,b — 1, ..., m, Rz 
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D5,,,—70(a-1,...,m p=1,...,k). The matrices 
Rj, and D; are called the fundamental tieset 
matrix and the fundamental cutset matrix, 
respectively, with respect to the tree-cotree 
pair (T, T). The minor of the matrix R¥ con- 
sisting of all rows and k columns k,,..., k, 

is +1 or —1 if and only if {e,,,...,e,,} isa 
cotree, and 0 otherwise. Similarly, the minor 
of the matrix D? consisting of all rows and m 
columns «,,...,K,1s +1 or —1 if and only if 
ss Gel IS a tree, and 0 otherwise. 


H. Planarity of a Graph 


Let there be a natural one-to-one correspon- 
dence between the sets of edges of two graphs 
G; — (V, Ej, 67 , 0; ) (i— 1,2), and suppose that 
under the correspondence a tree T, in G, cor- 
responds to a tree T, in G, and that the fun- 
damental cutset matrices D, and D, with re- 
spect to (T;, E; — T;) are mutually equal. Then 
G, and G, are said to be 2-isomorphic. The 
definition is equivalent to the one given by 
the equality of fundamental tieset matrices. 
2-isomorphism is an equivalence relation. If 
G, and G, are 2-isomorphic, the families of 
trees, cotrees, tiesets and cutsets are mutu- 
ally corresponding. The coincidence of one 
of the families is a sufficient condition for 
the 2-isomorphism of G, and G, as undi- 
rected graphs. A 3-connected graph has no 
2-isomorphic graph other than itself. 
Similarly, when a tree T, of G, corresponds 
to a cotree T, of G, and if the fundamental 
cutset matrix of G, with respect to (Tj, E, — Tj) 
is equal to the fundamental tieset matrix of G, 
with respect to (E; — T5, T,) as matrices, then 
we say that G, is dual to G,. In this case, G, is 
dual to G; also. If G, is dual to G, and G, is 
dual tot, then G, and G, are 2-isomorphic. 
The duality is a relation among the equiva- 
lence classes of graphs by 2-isomorphisms. 
Any graph G =(V, E,0*,0 ) can be “drawn” 
in 3-dimensional Euclidean space in the fol- 
lowing sense: Each vertex is a (distinct) point, 
and an edge e is an arc connecting two points 
é*eand 0 e with a direction pointing from 
0*eto O^ ein such a way that no two arcs 
intersect. A graph representable on a plane or 
on a 2-dimensional sphere in the above sense 
is called a planar graph. A graph G is planar if 
and only if it has a dual graph (H. Whitney 
[7]). Another necessary and sufficient con- 
dition is that as an undirected graph, neither 
the complete five-point graph K, nor the bi- 
partite complete graph of three-three points 
K, 3, appear in any subcontraction of the 
graph G. (This is a version of the criterion of 
C. Kuratowsk: [8].) 
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I. Coloring of Graphs 


A coloring of the vertices of a graph G— 

(V, E, 0,0 )isa mapping v from V to the set 
of integers N satisfying the condition v(v) 4 

V (v') for all adjacent vertices v and v’. »(G)= 
min{|w(V)||w is a coloring of the vertices of 
G} is called the chromatic number of G. If 

the graph G is drawn over a 2-dimensional 
closed surface with a 1-dimensional "Be 
number b, we have »(G)<[(7+ ,/1 4- 245)/2 |, 
where | | denotes the integral part. Except 
for the *Klein bottle (with b —2), where »(G) € 
6, this is the best possible, i.e., there exists a 
graph whose y(G) equals the upper bound on 
the right-hand side. As for b> 1, the inequality 
was first proved by P. J. Heawood (1890), 

and final results were established by J. W. T. 
Young and G. Ringel [10]. When b 20, »(G)< 
5 was shown by A. B. Kempe (1879), but the 
four color conjecture: “y(G) x 4?" has remained 
unsolved for more than a hundred years. The 
conjecture is believed to have been solved 
affirmatively recently through checking a 
huge number of cases on a large computer 
[11,12]. 

A subset Wc V is called an independent set 
or an internally stable set if no two vertices in 
W are mutually adjacent. x(G) 2 max ((W|| W 
is an independent set of G} is called the num- 
ber of independence of G. A subset W of V is 
called a dominating set or externally stable 
set of G if every vertex ve V is either ve W or 
adjacent to a vertex of W. B(G)  min(|W|| W 
being a dominating set of G} is called the num- 
ber of domination of G. For every graph G, 
we have a(G) > fi(G) and ail: a(G) z | V]. 


J. Decision Problems and Graphs 


There are many interesting topics in decision 
problems concerning graphs, especially from 
the standpoint of *complexity of computa- 
tions (— e.g., [13]). The following are some 
typical problems: (1) Problems for which algo- 
rithms of polynomial order are known: Is 
the given graph k-connected, strongly con- 
nected, a Euler graph, or a planar graph? (ii) 
*NP-complete problems: Is the given graph a 
Hamilton graph? Do we have «(G) — k, B(G)= 
k, or y(G) 2 Ki 

Let G, and G, be two graphs. The problem 
of whether they coincide as unlabeled graphs 
is called the isomorphism problem, and has 
been studied for many years in connection 
with problems concerning the structure of 
chemical compounds. Unfortunately, no al- 
gorithm of polynomial order is known; nor 
do we know whether this is an NP-complete 
problem. As for the isomorphism problem for 
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planar graphs, algorithms of polynomial order 
are known. 


K. Perfectness Theorem 


Let the number of independence and the chro- 
matic number of a graph G=(V, E,0*,0^) be 
a(G) and »(G), respectively. Let V,, ..., V. be 
a disjoint decomposition of V. We denote by 
0(G) the minimal number of r such that every 
section graph of G over V, contains a com- 
plete graph. Furthermore, we denote by œ(G) 
the maximum value of |W| for a subset Wc 
V such that the section graph of G over W 
contains a complete graph. We always have 
a(G) « 0(G) and »(G) € o(G). G is called a- 
perfect if every section graph H of G satisfies 
a(H)=0(H). Similarly, G is called y-perfect 

if y(H)= (A) for every section graph H of 

G. The conjecture that y-perfectness and a- 
perfectness are equivalent has recently been 
solved affirmatively [14]. This is called the 
perfectness theorem. 
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It is generally believed that theoretical mathe- 
matics originated with the Greeks. The Greeks 
learned the arts of land surveying and com- 
mercial arithmetic from earlier civilizations; 
but they developed theoretical mathematics 
themselves, toward the middle of the 4th cen- 
tury B.C. The creation of a mathematics that 
transcends practical purposes was one of 

the most remarkable events in the history of 
human culture, and one that had an immense 
impact on the development of all branches of 
science. We owe the reestablishment of impor- 
tant Greek mathematical texts and the re- 
construction of the development of Greek 
mathematics to the historians of the 19th 
century (the oldest extensive exposition of the 
history of mathematics before Euclid is due to 
Proklos (Proclus) (410—485)). 

The earliest known Greek mathematicians 
are Thales of Miletus (c. 639—546 s.c.) and 
Pythagoras of Samos (fl. 510? B.c.). Both were 
Ionians, but the latter went to what is now 
southern Italy and founded a semireligious 
school whose members called themselves 
Pythagoreans. Their motto was "Everything 
is number”; their studies were called mathema 
(“what is learned”) and consisted of music, 
astronomy, geometry, and arithmetic (the 
subject group called the quadrivium, which 
formed the core of medieval and later higher 
education) "for the purification of soul." Their 
research delved into the theories of proportion 
(in relation to music) and tpolygonal numbers 
(triangular numbers, square numbers, etc.), 
and more generally into the theory of numbers 
and geometric algebra. Tt is said that they 
knew of the irrationality of Ra though no 
evidence of this has been found. Even after the 
demise of the Pythagorean school, its followers 
continued to promote mathematics in col- 
laboration with the Academy of Plato. 

Another significant school was the Eleatic. 
Among its members Zeno (c. 490—c. 430 B.C.) is 
especially important. *Zeno's paradoxes are 
arguments leading to absurdity. Some see 
within them the origin of logical reasoning 
and, consequently, of theoretical mathematics 
[3]. It is chronologically difficult to attribute 


187 
Greek Mathematics 


to Zeno the consideration of the continuum 
and irrational numbers, but we can find in him 
the impetus toward atomistic reasoning. The 
computation of the volume of pyramids (by 
dividing them into “atomistic” laminae) by 
Democritus (fl. 430? B.c.) and the atomistic 
calculation of the area of circles by Antiphon 
(fl. 430 p.c.) came shortly after the time of 
Zeno. 

The middle decades of the 4th century B.c. 
are known as the Age of Pericles, the Golden 
Age of Athens. The !trisection of an angle, the 
tduplication of a cube, and the *quadrature of 
a circle, known at that time as the "three big 
problems" (— 179 Geometric Construction), 
were studied by the Sophists. Hippias of Elis 
(fl. 420 B.c.), Hippocrates of Chios (fl. 430 a.c. 
in Athens), Archytas of Taras (c. 430—365 B.c.), 
Menaechmus (fl. 350 &.c.), and his brother 
Dinostratus (fl. 350 p.c.) solved these problems 
using conic sections and the quadratrix (a 
transcendental curve whose equation is y — 
xcot(zx/2)). 

By 400 s.c. Athens had lost its political 
influence, but it remained the center of Greek 
culture. It was during this time that Plato's 
Academy flourished, and Plato (427—347 s.c.) 
and his followers laid particular importance 
on mathematics. Archytas, Menaechmus, and 
Dinostratus belonged to or were closely as- 
sociated with this school. During the first fifty 
years of the Academy, research in the follow- 
ing fields was pursued: methodology of mathe- 
matics or science in general (De. dialectics, 
analysis, synthesis); geometric reconstruction 
of Mesopotamian algebra; the theory of irra- 
tionals in relation to the geometrization of 
algebra (Theodorus of Cyrene (Sth century 
BCL who was Plato’s teacher, as well as Theai- 
tetus of Athens (4157-369 B.C.) contributed to 
this study, and the general theory of propor- 
tion by Eudoxus of Cnidos (c. 408—c. 355 p.c.) 
also belongs to this field); the method of ex- 
haustion (by Eudoxus); and studies of the 
"three big problems" and conic sections. It was 
this school in which the term mathema came 
to be used in its present sense of *mathemat- 
ics" rather than in the sense of disciplines in 
general. 

The conquests of Alexander the Great ac- 
celerated the already considerable cultural 
influence of Athens. Later, during the Ptole- 
maic period, the center of culture moved to 
Alexandria. The Mouseion at Alexandra, the 
combined library and university, is said to 
have possessed hundreds of thousands of 
volumes. 

At Alexandria, Euclid (c. 300 s.c.) com- 
piled his Elements, which became a model 
for scientific works for centuries to come— 
Newton's Principia as well as Spinoza's Ethics 
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are modeled on it. Most historians say that 
Euclid's method derives from Aristotle (384— 
322 B.c.), who, after studying at Plato's Acad- 
emy, founded a new school, the Peripatetics, 
whose doctrines are at many points opposed 
to those of the Academy. Others, however, see 
the origin of Euclid's axiomatic method in 

the Eleatics [5]; we can find prototypes of 
some parts of the Elements in both Oenopides, 
who lived during the time of Zeno, and in 
Hippocrates. 

The third century B.c. was the Golden Age 
of Greek mathematics. Archimedes of Syracuse 
(c. 282-212 B.C.) was the greatest mathema- 
tician, mechanic, and technician of antiquity. 
He did important work in mathematics, study- 
ing the exact quadrature of the parabola. 
According to his ephodos (method), he would 
obtain a result by mechanical experiments and 
then prove it by the method of "exhaustion." 
He also computed the value of z; studied 
spirals and other curves, spheres, and circular 
cylinders; contributed to the development of 
statics and optics and their application; and 
had a profound influence on later mathema- 
ticians. During the same period, Apollonius of 
Perga (fl. 210 B.c.) wrote Konikon biblia (Books 
on Conics) in eight books, of which the last 
has been lost. The geometric theory of *conic 
sections contained in this work is not much 
different from the one we know today; it had a 
great influence on 17th-century scientists es- 
pecially. Other mathematicians of this period 
worth noting are Eratosthenes (c. 275-195 
BCL who conceived the tsieve method of find- 
ing prime numbers and who measured the 
earth, and Hipparchus (fl. 150 B.c.), called the 
father of astronomy, who made a table of 
sines. 

Hellenistic influence began to decline in the 
first century B.c., and the influence of Alex- 
andria decreased. The Mouseion burned in 
48 B.C., but was rebuilt. Among the mathe- 
maticians of this time, we may count Heron 
(fl. 60? 4.p.; Menelaus (fl. 100 A.p.), who wrote 
Sphaerica; Theon of Smyrna (fl. 125 A.p.); 
Ptolemy (fl. 150 A.p.), the author of Almagest; 
Nicomachus (50?— 150? A.p.), the author of 
Arithmetike eisagoge; Diophantus (fl. 250? 
A.D.), whose career is not fully known but who 
wrote Arithmetika, of which six of the original 
thirteen books remained to influence tFermat; 
and Pappus (fl. 300 a.p.), the last creative 
mathematician in Greece, who left eight books 
of the Synagoge, which influenced *Descartes 
and which still exist today. 

The period following the fall of the Western 
Roman Empire was a difficult one for Greco- 
Egyptian science. The Mouseion was de- 
stroyed for the second time in 392 a.D. Theon 
of Alexandria (fl. 380) and his daughter, Hy- 
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patia (c. 370—415), were at that time working 
on commentaries on the classics. Among the 
few remaining works of the period is Proclus' 
(410—485) commentaries on the first book of 
Euclid's Elements. The Athenian Academy was 
closed in 529 by order of the Emperor Justi- 
nian; the last director was Simplicius, who 
commented on Aristotle. Soon afterward, 
Alexandria fell into the hands of the Moors, 
and many scholars fled as refugees to Constan- 
tinople, the capital of the Eastern Empire. 
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A. General Remarks 


Green's functions are usually considered in 
connection with *boundary value problems 
for ordinary differential equations and also 
with telliptic and tparabolic partial differen- 
tial equations. For example, consider bound- 
ary value problems for the tLaplacian in 3- 
dimensional space: L[u] —(0?/0x1 + 67/0x3 + 
0?/0x3)u. Let D be a bounded domain with a 
smooth boundary S and the boundary con- 
dition B on S be either u(x) 2 0 (x eS) (the first 
kind) or Qu/On + flu z 0 (xe S) (the third kind), 
where n is the outer normal of unit length, 
f(x) 2 0, and f(x) Z0. We say that the function 
g(X4, X5, X3; £4, 65, 54) is the Green's function of 
L (or the partial differential equation L[u]=0) 
relative to the boundary condition B, when (1) 


` g(x, č) satisfies L,[g(x, £)] =0 except for x = & 


(ii) g(x, €)= —1/Anr + w(x, ë), where r 2 (X2. (x; 
— &?)'? and w(x, ë) is a regular function, i.e., 
of class C" for a suitable value v; (1) g(x, €) 
satisfies the boundary condition B, i.e., g(x, €) 
=0, xes (the first kind), or (0/0n + B) g(x, ë) 
=0, xeS (the third kind). Conditions (i) and 
(ii) mean that g(x, č) is a 'fundamental solution 
of L, i.e., L.[g(x, €)] 2ó(x — &), where ó(x — č) 
is *Dirac's measure at the point x 2 Note 
that if g(x, č) is a fundamental solution, then 
by adding any solution u of the equation L[u] 
=0 to g, we obtain another fundamental 
solution g +u. Thus Green's function is the 
fundamental solution that satisfies the given 
boundary condition. To be more precise, in 
the boundary value problem, if the boundary 
condition is of the first kind, g(x, €) can be 
obtained by adding to a fundamental solution 
—1/4nxr the solution œ(x, č) of the following 
Dirichlet problem: A w(x, $) 20, w(x, à) 
1/4ar (xeS). We remark that there are slightly 
different definitions for Green's function. For 
example, there are cases where g(x, č) is de- 
fined by g(x, €)= 1/4ar+ o(x, č) or by g(x, lz 
1/r+ (x, č). 

In the case in the previous paragraph, 
Green’s functions satisfying the boundary 
conditions are uniquely determined. In gen- 
eral, if we are given a Green’s function, then 
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for any regular function v(x) the function 
=f g(x, Š)\o(č)dč 
D 


represents the solution of L[u] =v with the 
boundary condition B. More precisely, if v(x) 
satisfies the "Holder condition |v(x)—v(x’)|< 
L|x —x'|* (0 £a x 1) (L, x positive constants), 
then u(x) is of class C?. Conversely, if u(x) 
satisfies the equation L[u] =v and the bound- 
ary condition B, it is represented by the for- 
mula for u(x). This means that if we denote 
the operator that associates u to v by G, then 
G is the inverse operator of the Laplacian L 
with the boundary condition B, and Green's 
function is the tintegral kernel of the operator 
G. Using this property, the boundary value 
problem relative to L can be reduced to a 
problem of tintegral equations. For example, 
the differential equation with the boundary 
condition B containing the complex parameter 
A, L[u] - Au 7 f, is equivalent to the integral 
equation u + AG[u] - G[ f ], which is obtained 
by letting G act from the left on the above dif- 
ferential equation. In this way, the problem 
can be simplified. 

In the case of general boundary value 
problems for higher-order elliptic operators, 
Green's functions are defined in the same way 
as before (— 189 Green's Operator). The im- 
portant case is when L and the boundary 
condition B define a !self-adjoint operator. 

In this case, Green's function is symmetric 
(g(x, €) 2 g(£, x)). To obtain Green's function 
is not easy in general. However, in some cases 
such functions can be obtained fairly easily 
(— Appendix A, Table 15.VI). 


B. Self-Adjoint Ordinary Differential 
Equations of the Second Order 


Consider the operator L[u] =(p(x)u'Y + q(x)u 
(p(x)» 0) defined in the interval a € x <b, with 
boundary conditions of the form ou + flu- 0 
at the two endpoints. Then Green's function 
g(x, č) is defined in the following way: (i) For 
xz &, L[g(x, $)] 0; (ii) [ég(x, Q/0x-76 = 
1/p(€); (iii) for £ fixed, g(x, č) satisfies the homo- 
geneous boundary conditions at x =a and 
x=b. 

Conditions (i) and (ti) mean that L[g(x, £)] 
—ó(x — č). We can construct g(x, č) in the 
following way: Let u,(u,) be the solution of 
L{u]=0 satisfying the boundary condition at 
x-—a(at x—b). If u; and u, are linearly inde- 
pendent, we can satisfy Di, u, —u,u5)—1 by 
choosing the constants suitably. Then Green's 
function g(x, č) is defined by g(x, £) = u,(x)u4(€) 
for x <é, and g(x, £) 2 u,(£)u4 (x) for č <x. If u; 
and u, are linearly dependent, there exists no 


700 


Green's function that satisfies conditions (i), 
(11), and (iii). However, by modifying the defi- 
nition, we can get a generalized Green's func- 
tion playing a similar role [2]. This method 
can be applied to the case of ordinary dif- 
ferential equations of higher order. 


C. The Laplace Operator 


When the domain D is the n-dimensional 
sphere of radius a with center at the origin, 
Green's function of the Laplacian relative to 
the boundary condition u=0 is obtained in 
the following way. Let E(r) be the following 
fundamental solution of the Laplacian: E(r)— 
(21) !logr for n=2 and E(r)= —((n—2): 
o,r” ?) ! fornz3, where o, — 2n"? D(n/2) is 
the (n — I)-dimensional surface area of the n- 
dimensional unit sphere. Then Green's func- 
tion g(x, č} is defined by 


g(x, ) = E(r) — E(pr'/a), 


where p — (E Gr r= GZ. (x;— cl ue 
SEO 


D. Helmholtz's Differential Equation 


Let D be an exterior domain with a smooth 
boundary S in R?. In mathematical physics, 
the boundary value problem of finding a solu- 
tion u(x) of Helmholtz's differential equation 
(A+ K?)u(x) = f(x) (k » 0) satisfying u(x) 20 
(x€S) is of particular interest. In this case, 
concerning the behavior of u(x) at infinity, 

we usually assume Sommerfeld's radiation 
condition: 


When [x| +0, 


‘ae 
(Suma) 


where ĝ/ðr is the derivative along the radial 
direction. It is known that this condition en- 
sures the uniqueness of the solution (Rellich's 
uniqueness theorem). We can construct Green's 
function G(x, £) for any k(» 0) so that for 
smooth f(x) with bounded tsupport, 


u(x) = O(Ix| 7), 


-o(|x| *), 


|x| 7r 





u= [ G(x, €) f(¢)d¢ 


represents the solution satisfying u(x)=0 (xe 8S) 
and Sommerfeld's radiation condition. Then, 
with 

ik|x - 8l 


G(x, č)= ~ 4nix—e| 


+ K(x, č), 


where 


3 1/2 
»-a-(3 eir) > 


i= 


A 
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K.,(x, é) can be obtained by solving an integral 


equation of tFredholm type [3,4]. In this case, 


there exists no Green's operator in L, space, 
and G(x, č) can be considered to be a general- 
ized Green's function [4]. 


E. Stokes's Differential Equation 


Let D be a bounded domain in R? with 
smooth boundary S, and consider Stokes's 
differential equation in D 


Op 3 EI 
Au,=———pX;,, i=1,2,3, Y —=0, 
pm Óx; ES ' à Ox; 


where p, p are positive constants. In hydro- 
dynamics, we consider the boundary value 


problem of finding solutions (u; (x), u; (x), u5(x), 


p(x)) of Stokes's equation satisfying the bound- 
ary condition u;(x) —0, xeS (i= 1,2,3). In this 
case, Green's tensors Gi, č), g;(x, č) can be 
constructed, and for smooth functions X;(x) 
(i= 1,2, 3) the unique solution of this bound- 
ary value problem is represented by 


u(x)= o> | Gij(x, C)X (6) dé, 


3 
nol | g(x, SX (Ede 


j=l 
[5]. 
F. Parabolic Equations 


Consider the boundary value problem (the 
initial boundary value problem): 


^ 32 
Li] e e 5 f(x, t>0, a<x<h, 
u(x, 0) — (x), 


where at x =a and x — b, u(x, t) satisfies 
some homogeneous boundary conditions. 
In this case, we can construct the function 
g(x, t; č, x) (t2 x) satisfying the following con- 
ditions: (i) L[g] =0 except for x=€. t=1; 
(ii) 

exp( —(x— &/Ac?(t —1)) 


Je, / n(t — 1) 


cr (regular function) 





g(x, t; č, 1)= 


in a neighborhood of x= £, t— x; and (iii) 
g(x, t; č, t) satisfies the given homogeneous 
boundary conditions at x =a and x — b. Then 


t fb 
u(x, t)= | | gx, t; 6, x) f (5, x) dé dx 
Ova 


d g(x, t 6,0) (2) d 


a 


represents the solution of the problem stated 
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in this section for regular functions f, o. The 
function g(x, t; £, x) is called Green's function 
relative to the boundary value problem. De- 
tailed consideration of such elementary cases is 
found in [6]. 


G. Kernel Functions 


The kernel function is closely related to 
Green's function of A (the Laplacian) rela- 
tive to the first boundary value problem in a 
domain in R?. 

First we explain the general definitions of 
the kernel function. Let E be a general set, and 
let & be a tHilbert space of complex-valued 
functions defined on E with a suitable inner 
product ( f, g). Suppose that we are given a 
function K(x, y) defined on E x E satisfying 
the following conditions: (1) For any fixed y, 
K(x, y) regarded as a function of x belongs to 
& and (ii) for any f(x)e Y, (f(x), K(x, y).— 
f(y). Then K(x, y) is called a kernel function 
or reproducing kernel. The kernel function, if it 
exists, is unique and is tpositive definite Her- 
mitian; that is, 


n 


» Kit, y96e, 20. (1) 


Conversely, any positive definite function is 

a reproducing kernel of some Hilbert space. 
A necessary and sufficient condition for the 
existence of the kernel function is that for 

any yc E, the linear functional f f(y) be 
bounded. In this case, the minimum of || f || 
under the condition f(y)— 1 (fe) is attained 
by the element K(x, y)/K( y, y), and its value is 
K(y, y) !?. When § is a tseparable space, then 
by an *orthonormal system {@,(x)}, we can 
represent K(x, y) as 


K(x,y)- ` ETAO (2) 


As an example of kernel functions, the fol- 
lowing case is of particular importance. Let E 
be an n-dimensional tcomplex manifold, o and 
W be holomorphic fdifferential forms of degree 
n on E, and let the following inner product be 
given: 


woui=| paw. 


Now 8 is given by §={@|¢9, 9) € +00}, and 
the kernel function is called the kernel dif- 
ferential. When E is a domain in C", regarding 
the coefficients of the differential form as func- 
tions, we call the kernel function Bergman's 
kernel function. Moreover, if E can be mapped 
onto a bounded domain by a one-to-one 
holomorphic mapping, then 


ds? =) (0? log K (z, 2)/02,02,) dz, dz, 
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is positive definite and gives a Káhler metric 
which is called the Bergman metric. 


H. Kernel Functions for Domains in the 
Complex Plane 


Let E be a domain D in the complex plane 
(z—x-iy). Let K(z,¢) be Bergman's kernel 
function of D, and let G(z, C) be Green's func- 
tion of A relative to the first boundary con- 
dition with a pole at ¢. Then we have 


K(z,D- —-Q/n)e*G(z, 0)/d20€. (3) 


Next, let U(z, ¢) be the kernel function of 
the Hilbert space consisting of the holomor- 
phic differential forms whose integrals are 
single-valued, and let N (z, Cl be Neumann’s 
function of A, i.e., the function that is har- 
monic in D — (Cj, has the same singularity as 
G at C, and whose derivative in the normal 
direction ON /On is constant along the bound- 
ary. Then we have 


U(z, t) - Q/n)0*N (z, Dy0z8t. (4) 


Now the kernel H(z, €)=(N(z, ) — G(z, ))/2n 
is the kernel function relative to the Hilbert 
space consisting of all real harmonic functions 
whose integral mean value along the boundary 
I is 0 and having the inner product 


dp Oy x K 


The kernel H(z, £) is called a harmonic kernel 
function. 

Suppose that the boundary T is tpiecewise 
smooth, and consider the space of all holo- 
morphic functions in D that are continuous on 
the boundary of D. The inner product of such 
functions ¢ and y is given by (o, V) froy 
ds (ds is the element of the arc length of I). 
Hence we have a Hilbert space. Then the 
kernel function relative to this Hilbert space is 
called Szegó's kernel function, which has a 
close relation with *bounded functions. 

The kernel functions enable us to represent 
holomorphic mappings that map the domain 
D onto various canonical domains (— 77 
Conformal Mappings). 
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A. General Remarks 


Consider the first and the third boundary 
value problems for the elliptic equation 
Alu) = —Au+ Y a) SE = f(x) 

(^ 323 Partial Differential Equations of Ellip- 
tic Type). Let D be a bounded domain of R" 
whose boundary S consists of a finite number 
of smooth hypersurfaces. By the tmethod of 
orthogonal projection, we take the domain 
GA) as follows: (i) (u(x)| u(x)e H?(D) and 
u(x)=0 for xeS] or (ii) (u(x)| u(x)e H?(D) and 
Qu/On + B(x)u=0 for xeS] according as we are 
considering the first or third boundary value 
problem, where H?(D) is the Sobolev space 
(— 168 Function Spaces). If the operator A is 
a one-to-one mapping from &(A) onto the 
function space L,(D), we call the inverse 
operator A ! Green's operator relative to the 
boundary condition, and we denote it by G. In 
general, the existence of A~' is not guaranteed. 
However, if we take real t large enough, G, = 
(A ^ LI) ! exists. 

Consider the general case where 4 is a com- 
plex parameter: (AJ — A) [u] = f(x), f(x)e L;(D). 
Letting G, act from the left, we have (1 — (4 - 

dG) [u] 2 —G,f. Ad if ue L,(D) is 

a solution, clearly u(x)e YA), and u(x) satis- 
fies the first partial differential equation and 
the boundary condition. Since G, is a tcom- 
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pact operator in L,(D), the *Riesz-Schauder 
theorem can be applied (— 68 Compact and 
Nuclear Operators). In particular, if 2+¢ is not 
an ‘eigenvalue of G, u(x) -(A41 — A) 1f = —(I — 
(4 t)G)) ! G,f represents a unique solution. 

In the equations in the first paragraph of 
this section, if a;,(x) 2 0 and c(x) and f(x) are 
real, then G, is a tself-adjoint operator in 
L,(D), and therefore the tHilbert-Schmidt 
expansion theorem can be applied. Namely, let 
{A;} be the eigenvalues of A such that Aw;(x)= 
Zock, where {œw;(x)} is an torthonormal 
system in L;(D). Then for any f(x)e L;(D), f(x) 
=> ~,(f@,a;(x), where the right-hand side 
is taken in the sense of tmean convergence. 
Furthermore, for f(x)e 2(A), we have the 
expansion (Af )(x) 2 X, Al f. j)e,(x) in the 
same sense. 

When G, is not self-adjoint, let G* be the 
tadjoint operator of G, in L,(D). Then G* 
represents Green's operator relative to the 
equation 


n ke? 


(A* +1 [0] = —Av— sach 


t (c(x) - t)v 
—g(x), 


corresponding to the boundary conditions (i) 
v(x) 20, xeS (first boundary condition) and (ii) 
(0/0n)v + B'(x)v 20, xeS, where 





B(x) — BG) + ¥ ax) cos nx,, 
i-1 


with n the outer normal (third boundary con- 
dition) [2]. 


B. Elliptic Equations of Higher Order 


Green's operator can be defined for elliptic 
equations of higher order. Consider the 
equation 


A(x, 0/0x)u(x) = f(x), 
B(x, 6/0x)u(x) = 0, 


xeD; 
xeS, 
j=1,2,...,b(=m/2), (1) 


where A is an ‘elliptic operator of order m and 
the boundary operators (Bj) satisfy: (i) At 
every point x of S, the normal direction is not 
'characteristic for any B,; and (ii) the order m; 
of B; is less than m, and m; # m, (j #k). The 
domain Z(A) of A is defined by 


GA) = (u(x)|ue H"(D) and 
Bj(x, 6/éx)u(x)=0 for xeS, 
j=1,2,...,b}. 


When A is a one-to-one mapping from Z(A) 
onto L (D), the inverse G = A! is called 
Green’s operator. 
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This general boundary value problem, es- 
pecially the existence theorem, was treated 
under some algebraic conditions on A and 
(Bjj by M. Schechter [5] who showed that 
G[u]e H"(D) if u(x)e L;(D) and that G[u] 
depends continuously on u. In particular, if 
m» n/2, then by fSobolev’s theorem, G is a 
continuous mapping from L,(D) into C°(D), 
and G is represented by an tintegral operator 
of Hilbert-Schmidt type (L. Gárding [6]). 
Namely, for any f(x)e L,(D), 


no [ G(x, &)f(E)dé, 
D 


| IG(x, EI dx dé < +00. 


In general, the function G(x, £), obtained by 
the kernel representation of Green’s operator 
G, is called Green’s function. 

On the other hand, consider, for exam- 
ple, the *Dirichlet problem of A in R?. Then 
Green's function is defined in the following 
way (— 188 Green's Functions): G(x, €)= 
—(An|x — EI ! + u(x, č), where u(x, ë) satisfies 
(i) A,u(x, £) 20, and (ii) G(x, EN. 0. The 
function defined in this manner coincides with 
Green's function defined as a kernel represen- 
tation of Green's operator [1,2]. 

Suppose that in problem (1) the telliptic 
operator A(x, 0/0x) is independent of x, and let 
E(x) be a tfundamental solution, i.e., E(x) 
is a distribution solution of A(0/Ox) E(x) 2 ó(x) 
(ó(x) is *Dirac's ó-function; — 112 Differential 
Operators). Then E(x) is a C?-function away 
from the origin. Moreover, in a neighborhood 
of the origin the following estimates hold: For 
[x| « m, 


Lin 


where c is some positive constant. When prob- 
lem (1) has Green's operator G, we can say 

the following, using the fundamental solution: 
Green's function G(x, £) exists and can be 
written as G(x, €)= E(x — €) + u(x, é), where for 
any fixed £e D, u(x, č) satisfies (i) A(0/Ox)u(x, é) 
=0 and (ii) B(x, 6/0x)G(x, €) -0, ren, j= 

l2: b. 


ebe m —n-— |a| «0, 


<< clog|x| !, m—n—|a|=0, 





C, m-—n-— |x| 0, 


C. Hypoelliptic Operators 


Let 
AG dec Ai 
x, 6/ Se D ax] 


C 


Q NX el 
Ox]  OxBh...0xt 





189 Ref. 
Green's Operator 


be a general partial differential operator with 
C*-coefficients. If the kernel E(x, £) satisfies 
A(x, 0/Ox) E(x, €)=6(x — €), that is, if we have 
the relation (E(x, č), 'A(x, 6/6x)p(x)>, = e(£) 
for any o(x)e Z, then E(x, £) is called a funda- 
mental solution of A, where 'A is the transposed 
operator of A: 


ga Na 
'A(x,0/0x)v(x)- Y, (- P (&) (a,(x)v(x)). 
lal m ox 

Now if there exists a fundamental solution 
E'(x, €) of ‘A(x, 0/Ox) such that (i) E'(x, č) de- 
fines a kernel that gives rise to two continuous 
mappings, one of which maps the space 2; 
into &,, and the other of which maps the space 
2, into é, (— 168 Function Spaces), and (ii) 
for x# č, E'(x, č) a C*-function of (x, č), then 
any distribution u(x) satisfying A(x, 0/Ox)u(x) 
= g(x) is a C?-function, where g(x) is of class 
C”. In general, an operator A with the prop- 
erty that any solution u(x) of A(x, 6/éx)u(x) = 
g(x) is of class C* whenever g(x) is of class 
C", is called hypoelliptic. Elliptic and para- 
bolic operators are both hypoelliptic. L. Hór- 
mander characterized the hypoelliptic dif- 
ferential operators with constant coefficients 
[8] (— 112 Differential Operators). 

A kernel E(x, č) such that 


A(x, 6/0x) E(x, &) = d(x — E) + w(x, ¢), 


with w(x, é) a C*-function, is called a para- 
metrix of A. To prove the hypoellipticity of A, 
it suffices to show the existence of a parametrix 
E'(x, č) of the operator 'A having the prop- 
erties (i) and (ii) mentioned in the previous 
paragraph. 

To explain the notion of the fundamental 
solution for the tevolution equations, suppose 
that we are given an evolution equation 


am 


L[u] 7. uix, t) 


de DEIER 
a, dx, Ken Bp UY, FD HY, 
j<m RA " Óx ot! 

xER", ty «t& T. A kernel E(x, t; č, to) (to < 
t< T) is called a fundamental solution to the 
evolution equation if 


Ly (E(x, t; Gs t9)) « 0, L2 fg, 
and 
ato Ot! Geh ó(x—&) | jam-l. 
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190 (IV.1) 
Groups 


A. Definition 


Let G be a nonempty set. Suppose that for any 
elements a, b of G there exists a uniquely deter- 
mined element c of G, which is called the prod- 


‘uct of a and b, written c— ab. We call Ga 


group or multiplicative group if (1) the associa- 
tive law a(bc) —(ab)c holds, and (ii) for any 
elements a, bet: there exist uniquely deter- 
mined elements x, ye G satisfying ax =b, 
ya — b. Then the mapping (a, b) ab is called 
multiplication in G. Condition (ii) is equivalent 
to the following two conditions: (iii) There 
exists an element e (called the identity element 
or unit element of G) such that ae — ea— a for 
any element a of G; and (iv) for any element a 
of G there exists an element x such that ax — 
xa=e. 

The element x in condition (iv) is called the 
inverse (or inverse element) of a, denoted by 
a !. The uniqueness of the identity element e 
and the inverse a ^! follows readily from the 
axioms. The identity element of a multiplica- 
tive group is sometimes denoted by 1. If ab 
— ba, then we say that a and b commute. The 
commutative law, ab — ba for any elements a, 
be G, is not assumed in general. A group satis- 
fying the commutative law is called an Abelian 
group (or commutative group) in honor of N. 
H. Abel, who made use of commutative groups 
in his study of the theory of equations. The 
product in a commutative group is often writ- 
ten in the form a+ b, and in this case the 
mapping (a, b)>a+ b is called addition. The 
element a+b is called the sum of a and b, and 
G is called an additive group. In an additive 
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group the identity element is usually denoted 
by 0 and called the zero element, and the in- 
verse of a is denoted by —a (— 2 Abelian 
Groups; 277 Modules). To describe the *law of 
composition, we sometimes use notation differ- 
ent from multiplication or addition (— 409 
Structures). 


B. Examples 


A "linear space over a ffield K is an additive 
group with respect to the usual addition of 
vectors (— 256 Linear Spaces). A field is an 
additive group with respect to the addition, 
and the set of nonzero elements of a field 
forms a group with respect to the multiplica- 
tion, which is called the multiplicative group 
of the field (— 149 Fields). 

All invertible n x n matrices over a ring R 
form a group with respect to the usual multi- 
plication of matrices. This group is called the 
*general linear group of degree n over R (— 60 
Classical Groups). 

All one-to-one mappings from a set M onto 
itself (1.e., all permutations on M) form a group 
with respect to the composition defined by 
(fog) (x)= f(g(x)) (xe M). (Sometimes the 
product f og is denoted by gf and f(x) by xf. 
Then x(gf)=(xg)f) The group of all permuta- 
tions on M is called the symmetric group on 
M. A group G is called a permutation group 
(on M) if every element of G is a permutation 
on M. For instance, the general linear group of 
degree n over a field K may be regarded as a 
permutation group on the set of n-dimensional 
vectors, and it is also regarded as a permuta- 
tion group on a ftensor space. 

All ‘motions in a Euclidean space form a 
group with respect to the usual composition of 
motions. All invertible n x n matrices over K 
leaving a given *quadratic form invariant form 
a group with respect to the usual multiplica- 
tion of matrices. This group is called the 
torthogonal group belonging to the given 
quadratic form. If K is the fcomplex number 
field or the real number field, then these 
groups are Lie groups (— 13 Algebraic 
Groups; 151 Finite Groups; 161 Free Groups; 
249 Lie Groups; 423 Topological Groups). 


C. Fundamental Concepts 


If a group G consists of a finite number of 
elements, then G is called a finite group; other- 
wise, G 1s called an infinite group. The number 
of elements of G is called the order of G. A 
nonempty subset H of G.is called a subgroup of 
G if H is a group with respect to the multipli- 
cation of the group G. Hence a nonempty sub- 
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set H is a subgroup of G if and only ifa bei 
for any a, be H. For a family {H,} of sub- 
groups of G, the intersection ( , H, is also a 
subgroup. 

The associative law of multiplication says 
that elements a,, a5, a, of G determine the 
product a,a,a3, which is the common value of 
(a,a5)a4 and a,(a;5a,). This law can be gen- 
eralized to say that any ordered set of n ele- 
ments d,,4;, ..., d, (n 2) of G determines their 
product a,a,...a, (general associative law). 
When a, =a, =- =a, =a, we denote the 
product aa...a by a". If we define a” for n>0 
by a? =e and a^" ^ (a")'!, we then have a"a" 
=a"*™ (a"y"=a"" for any n, me Z. If there 
exists a positive integer n such that a" =e, then 
the smallest positive integer d with a^— e is 
called the order of the element a. If there is no 
such n, then a is called an element of infinite 
order. If a is of infinite order, then its powers 
a? (=e), a*!, a*?,..., are all unequal. If a is of 
order d, then the different powers of a are a? 
(=e), a, a°, ..., a^ !. All the powers of a form 
a subgroup <a> of G, called a cyclic subgroup. 
The order of an element a is the same as the 
order of the subgroup <a>. The group <a> 
itself is called a cyclic group and is an example 
of an Abelian group (— 2 Abelian Groups). 

Let S be a subset of a group G. Then the 
intersection of all subgroups of G containing S 
is called the subgroup generated by S and is 
denoted by <S>. It is the smallest subgroup 
containing S, and if S is nonempty, <S con- 
sists of all the elements of the form 
ata5? ... az" (a;€ $S,m;e Z). If ($5 —G, the 
elements of S are called generators of G. When 
G has a finite set of generators, G is said to be 
finitely generated. When S = (aj, then <S> 
coincides with <a>, and the element a is the 
generator of the cyclic group <a>. Suppose 
that elements a,,...,a, of G satisfy an equa- 
tion of the form aj:a5? ... a7" — 1. This equa- 
tion is then called a relation among the ele- 
ments a,, ...,a,. If we have a system of gen- 
erators and all relations among the generators, 
then they define a group (— 161 Free Groups). 
It is, however, still an open problem to find a 
general procedure to decide whether the group 
determined by a given system of generators 
and the relations among them contains ele- 


„ments other than the identity (— 161 Free 


Groups B). 

For a subset S and an element x of a group 
G, the set of all elements x~! sx (seS) is de- 
noted by x ! $x or S*, and S and S* are called 
conjugate. We have (ab )* — a*b*, (a !y 
—(a*) !. If H is a subgroup, then H* is also a 
subgroup. For a subset S, the set of all ele- 
ments x satisfying $* — 5 forms a subgroup 
N(S), called the normalizer of S. The set of all 
elements that commute with every element of 
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S forms a subgroup Z(S), called the centralizer 
of S. The centralizer Z of G is called the center 
of G. The set of all elements conjugate to a 
given element a of G is called a conjugacy class. 
A group G is the disjoint union of its conju- 
gacy classes. 

Let H be a subgroup of a group G and x an 
element of G. The set of elements of the form 
hx (he H) is denoted by Hx and is called a 
right coset of H. A left coset xH is defined 
similarly. G is the disjoint union of left (right) 
cosets of H. The cardinality of the set of left 
cosets of H equals that of the set of right cosets 
of H; it is called the index of the subgroup H 
and is denoted by (G: H). Given two subgroups 
H and K of G, the set HxK = {hxk|heH, ke K} 
is called the double coset of H and K, and G is 
the disjoint union of different double cosets of 
H and K. If the left cosets of a subgroup H are 
also the right cosets, i.e., if Hx =xH for every 
x€G, then H is called a normal subgroup (or 
invariant subgroup) of G. An equivalent con- 
dition is that H = H* for all xeG. The center of 
G is always a normal subgroup of G. If H isa 
normal subgroup of G, the set of all products 
of an element of Ha and an element of Hb 
coincides with Hab. Thus if we define the 
product of two cosets Ha and Hb to be Hab, 
then the set of cosets of H forms a group. This 
group is denoted by G/H and is called the 
factor group (or quotient group) of G modulo 
H. (When G is an additive group, G/H is also 
denoted by G — H and is called the difference 
group.) The group G itself and {e} are normal 
subgroups of G. If G has no normal subgroup 
other than these two, then G is called a simple 
group. A subgroup of finite index contains a 
normal subgroup of finite index. If H is a 
subgroup of finite index, then we can find a 
common complete system of representatives of 
the left cosets and the right cosets of H. If G is 
finitely generated, then so is any subgroup of 
G of finite index. 

Let R be an *equivalence relation defined in 
a group G. If xRx' and yRy' always imply 
(xy)R (x' y), then we say that R is compatible 
with the multiplication. The tquotient set G/R 
is a group with respect to the induced multipli- 
cation. This group is called the quotient group 
of G with respect to R. The equivalence class 
H containing e is a normal subgroup, and 
xRx' if and only if x ! x'eH, i.e., x and x’ are 
contained in the same coset of H. Thus G/R 
coincides with G/H. 

If G is a finite group of order n, then the 
order and the index of any subgroup of G, the 
order of any element of G, the cardinal number 
of any conjugacy class of G, and the number of 
different conjugate subgroups of any subgroup 
of G are all divisors of n. 
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D. Isomorphisms and Homomorphisms 


If there is a one-to-one mapping a «>a of the 
elements of a group G onto those of a group 
G' and if aa and b e» b' imply ab e ab', then 
we say that G and G' are isomorphic and write 
G x G'. If we put a' = f(a), then f:G-5G'isa 
*byection satisfying f(ab) — f(a) f(b) (a, be G). 
More generally, if a mapping f: G—G' satisfies 
f(ab) = f (a) f (b) for all a, be G, then f is called a 
homomorphism of G to G’. An tinjective (fsur- 
jective) homomorphism is also called a tmono- 
morphism (*epimorphism). If there is a surjec- 
tive homomorphism GG’, then we say that 
G' is homomorphic to G. The composite of two 
homomorphisms is also a homomorphism. If a 
homomorphism f': G— G' is a bijection, then f 
is called an isomorphism. In this case f7! is 
also an isomorphism, and we have GG, 

For a subgroup H of a group G, the in- 
jective homomorphism f: H 2 G defined by 
f(a)- a (ae H) is called the canonical injection 
(or natural injection). For a factor group G/R 
of G, the surjective homomorphism f:G—G/R 
such that ae f(a) (ae G) is called the canonical 
surjection (or natural surjection). 

Let f:G—G' be a homomorphism. Then the 
image /(G) of f is a subgroup of G, and the 
kernel H = (ae G| f(a)= e (the identity of G’)} 
of f is a normal subgroup of G. The equiva- 
lence classes of the equivalence relation given 
by f(x) 2 f(y) are just the cosets of H, and f 
induces an isomorphism f: G/H 9 f(G). The 
latter proposition is called the homomorphism 
theorem of groups. This theorem is extended in 
the following way: For simplicity let f: G—G' 
be a surjective homomorphism. (i) If H’ is a 
normal subgroup of G’, then the inverse image 
H = f~'(H')is a normal subgroup of G, and f 
induces the isomorphism f:G/H — G'/H'. (ii) if 
H is a subgroup and N is a normal subgroup 
of G, then HN = {hn| he H,neN} is a subgroup 
of G, and the canonical injection H-» HN 
induces an isomorphism H/HN NO HN/N . (iii) 
If H and N are two normal subgroups of G 
such that H 5 N, then the canonical surjection 
G—G/N induces an isomorphism G/H > 
(G/N)/(H/N). Propositions (i), (ii), and (iii) are 
called the isomorphism theorems of groups. 

A homomorphism of G to itself is called an 
endomorphism of G, and an isomorphism of G 
to itself is called an automorphism of G. The 
set of automorphisms of G forms a group with 
respect to the composition of mappings, called 
the group of automorphisms of G. Given an 
element a of G, the mapping xoa ! xa (xeG) 
yields an automorphism of G which is called 
an inner automorphism of G. The set of inner 
automorphisms of G forms a normal subgroup 
of the group of automorphisms of G, called the 
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group of inner automorphisms of G, which is 
isomorphic to the factor group of G modulo its 
center. The factor group of the group of auto- 
morphisms of G modulo the group of inner 
automorphisms of G is called the group of 
outer automorphisms of G. 

If a mapping f: GG' from a group G to 
another group G' satisfies f(ab) = f(b) f(a) 
(a, be G), then f is called an antihomomor- 
phism. A bijective antihomomorphism is called 
an anti-isomorphism. When G=G’, f is called 
an anti-endomorphism or anti-automorphism 
(e.g., £f: G— G defined by f(a) «a^! is an anti- 
isomorphism). 


E. Groups with Operator Domain 


Let Q be a set and G a group. Suppose that for 
each 0€Q and vert, the product 0x&eG is 
defined and satisfies 0(xy) = 0(x)(0y). Then Q is 
called an operator domain of G, and G 1s called 
a group with operator domain Q, or simply an 
Q-group. (We sometimes write x? instead of 
0x.) The mapping (0, x) 0x from Q x G to G is 
called the *operation of Q on G. If G is an Q- 
group, then any element 0 of Q induces an 
endomorphism 0, : x 0x of G. Conversely, if 
we are given a mapping 00, of Q to the set 
of endomorphisms of G, then we may regard G 
as an Q-group. Any group may be regarded as 
an Q-group with Q equal to the empty set or 
to the set consisting of the identity automor- 
phism of G. Thus the general theory of groups 
can be extended to the theory of groups with 
operator domain, and in some cases effective 
use of suitable operator domains can be fruit- 
ful in the investigation of the properties of 
groups themselves. (— 2 Abelian Groups; 277 
Modules). 

A subgroup H of an Q-group G is called an 
Q-subgroup (or admissible subgroup) if 0x € H 
for any (ei and xe H. In this case, H is also 
an Q-group. If an equivalence relation R de- 
fined in G is compatible with the multiplica- 
tion and also compatible with the operators, 
namely, if xRx' implies (0x) R(0x") for any 
0€Q, then the quotient group G/R is also an 
Q-group. The equivalence class containing e is 
an admissible normal subgroup. Conversely, if 
H is an admissible normal subgroup, then the 
equivalence relation defined by H is compat- 
ible with the operators, and the factor group 
G/H is an Q-group. A homomorphism f: G 
-G' of an Q-group G to an Q-group G' is 
called an Q-homomorphism (admissible homo- 
morphism or operator homomorphism) if f (0x) 
= 0f (x) for any 0€Q and ve If f is an iso- 
morphism, then f is called an O-isomorphism 
(admissible isomorphism or operator isomor- 
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phism). We have the homomorphism theorem 
and the isomorphism theorems of Q-groups if 
we consider only admissible subgroups and 
admissible homomorphisms. 


F. Sequences of Subgroups 


Let H,, H,... be an infinite sequence of 
(normal) subgroups of a group G. If H; $ H;., 
(i= 1,2,...), then the sequence is called an 
ascending chain of (normal) subgroups. If H, 

g Hj(i-1,2,...) then it is called a descend- 
ing chain of (normal) subgroups. If there is no 
ascending (or descending) chain of (normal) 
subgroups of G, we say that G satisfies the 
ascending (or descending) chain condition for 
(normal) subgroups. These conditions are the 
same as the ascending (or descending) chain 
condition in the ordered set of all (normal) 
subgroups of G (— 311 Ordering C). A group 
G satisfies the ascending chain condition for 
subgroups if and only if every subgroup of 

G is finitely generated. Also, for groups with 
operator domain we have similar results. The 
structure of Abelian groups satisfying the 
ascending (descending) chain condition is 
completely determined (— 2 Abelian Groups). 
It is not known whether there is an infinite 
group satisfying both the ascending and de- 
scending chain conditions for subgroups. A 
group satisfying the descending chain con- 
dition for subgroups has no element of infinite 
order, but the converse is not true. There is an 
infinite group which is finitely generated and 
has no element of infinite order (— 161 Free 
Groups C). 


G. Normal Chains 


A finite sequence G—- 6,2 6,26G,2...2G, 

= {e} of subgroups of a group G is called a 
normal chain if G; is a normal subgroup of G; , 
for i— 1,2, ...,r. We call r the length of the 
chain. The sequence G)/G,, G,/G,, ..., G, ,/G, 
is called the sequence of factor groups of the 
normal chain. A normal chain G=H,>H,> 
H,>...>H,= {e} is called a refinement of 

the chain G=G >G, >... 2 G, lei if every G; 
appears in this chain. Two normal chains with 
the same length are called isomorphic if there 
is a one-to-one correspondence between their 
sequences of factor groups such that corre- 
sponding factor groups are isomorphic. Any 
two normal chains have refinements which are 
isomorphic to each other (Schreier's refine- 
ment theorem). A normal chain is called a 
composition series (or Jordan. Holder sequence) 
if it consists of different subgroups of G and in 
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any proper refinement there appear two suc- 
cessive subgroups which are the same. The 
sequence of factor groups of a composition 
series is called a composition factor series, and 
the factor groups appearing in this series are 
called composition factors. Any composition 
factor is a simple group. As a direct conse- 
quence of the refinement theorem we see that if 
a group G has a composition series, then the 
composition factor series is unique up to iso- 
morphism and the ordering of the factors. 
(This theorem is due to O. Hólder. C. Jordan 
proved that if G is a finite group, then the 

set of orders of composition factors is inde- 
pendent of the choice of composition series. 
Hence we call the theorem the Jordan-Hólder 
theorem.) 

For an Q-group G, if we consider only Q- 
subgroups, we have definitions and theorems 
similar to those in this section. When we take 
the group of inner automorphisms of G as Q, 
then a composition series of the O-group G is 
called a principal series. If we take the group of 
automorphisms of G as Q, then a composition 
series is called a characteristic series. An in- 
finite group G does not always have a compo- 
sition series. Even if G has a composition 
series, G may have an infinite normal chain 
G, c G,c...cG such that each G; is a normal 
subgroup of G,,, and | J G; - G. In fact, there is 
a simple group which has such an infinite 
normal chain (P. Hall). Two groups which 
have isomorphic composition series are not 
necessarily isomorphic. A subgroup of a group 
G is called a subnormal subgroup of G if it may 
appear in some normal chain. The intersection 
of two subnormal subgroups is also sub- 
normal, but their join De, the subgroup gen- 
erated by both of them) is not necessarily 
subnormal in an infinite group. The set of 
subgroups and the set of normal subgroups of 
a group form ‘lattices with respect to the inclu- 
sion relation (for the relationship between 
these lattices and the group structure — [8]). 


H. Commutator Subgroups 


Given two elements a and b of a group G, we 
calla !b^'ab- [a,b] the commutator of a and 
b. The subgroup C generated by all commu- 
tators in G is called the commutator subgroup 
(or derived group) of G. The subgroup C is a 
normal subgroup of G, and the factor group 
G/C is Abelian. On the other hand, if B is a 
normal subgroup of G and G/B is Abelian, then 
B contains C. For two subsets A, B of G, the 
subgroup generated by the commutators [a,b] 
(ae A, be B) is called the commutator group of 
A and B and is denoted by [A, B]. If A and B 
are normal subgroups of G, then C' 2 [A, B] is 
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also a normal subgroup of G, and A/C' com- 
mutes with B/C' elementwise in the factor 
group G/C’. Furthermore, [A, B] is the mini- 
mal normal subgroup with the property. The 
subgroup [G, G] is the commutator subgroup 
of G. 

If the commutator subgroup of G is Abelian, 
then G is called a meta-Abelian group. If a 
group G has a normal chain G(— G9)2 G, > 
G,(= {e}) of length 2 and the factor groups 
G/G,, G,/G; are Abelian, then G is meta- 
Abelian. Meta-Abelian groups are special 
cases of solvable groups, discussed in Section I. 


I. Solvable Groups 


Suppose that we are given a series of subgroups 
G; (1—0,1,2,...) of G such that G = Go and 
[G;, G;] = Gj,,. Then we have a normal chain 
G=G)>G,>G,>.... If G, - fe] for some 

r, then G is called a solvable group. For the 
normal chain G (2 Gp) 2G, >...G, (= {e}) the 
factor groups G,/G,,, (i=0,1,...,r—1) are all 
Abelian. A finite group G is solvable if and 
only if G has a composition series G = Họ > 
H,>H,>...>H,={e} such that the factor 
groups H;/H,,, (i—0,1,...,s5— 1) are all of 
prime order. An "irreducible algebraic equa- 
tion over a field of *characteristic 0 is solvable 
by radicals if and only if its Galois group is 
solvable (— 172 Galois Theory). 


J. Nilpotent Groups 


The sequence of subgroups G=G,)>G,> 
G>... defined inductively by setting G, = 

[G, G,_,] (r=1,2,...) is called the lower 
central series of G. If G,= {e} for some n, then 
G is called a nilpotent group, and the least 
number n with G, = {e} is called the class of the 
nilpotent group G. A nilpotent group is solv- 
able. Let Z, be the center of G, Z,/Z, be the 
center of G/Z,, and so on. Then we have a 
sequence of subgroups Zus (ej c Zi c Z4 c ..., 
called the upper central series of G. A group G 
is nilpotent if and only if Z„= G for some m, 
and the least number m with Z,, — G is the 
class of G. For the subgroups G, and Z, (r= 
1,2,...), we have [G,_,;,Z,]={e}. If G isa 

*Lie group, then G is nilpotent if and only if 
the corresponding *Lie algebra g is nilpotent, 
i.e., g"=0. 


K. Infinite Solvable Groups 


The concepts of solvability and nilpotency are 
generalized in several ways for infinite groups. 
For instance, a group G is called a generalized 
solvable group if any homomorphic image of G 
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which is unequal to {e} contains an Abelian 
normal subgroup unequal to {e}, and G is 
called a generalized nilpotent group if any 
homomorphic image (+ {e}) of G has center 
unequal to {e}. These definitions coincide with 
the previous ones for finite groups but not for 
infinite groups [7]. 


L. Direct Products 


Let G,,...,G, be a finite number of groups. 
The set G of all elements (x,, ..., x,) with x;e G; 
(i=1,...,n) is a group if we define the prod- 
uct of two elements x =(x,,...,x,) and y= 
(Yis, Yn) to be Xy =(X1 Yis 4x, Del, WE 

call G the direct product of groups G,, ...,G, 
and write G=G, x ... x G,. Ife; is the identity 
element of G;, then e=(e,,...,e,) is the identity 
element of G. The mapping (x,, ..., Xa) >X; 
from G to G; is a surjective homomorphism, 
called the canonical surjection. The subgroup 
H; zie, < €) Ix; € G;} is 
isomorphic to G;. The subgroups H; (i= 

1, ...,n) satisfy the following conditions: (i) H; 
is a normal subgroup of G. (ii) H; commutes 
with H, elementwise if ij. (iii) Any element of 
G can be written uniquely as the product of 
elements of H,, ..., H,. Conversely, if a group 
G has subgroups H,,..., H, satisfying these 
three conditions, then G is isomorphic to H, 
x... X H,. In this case we also write G— H, 

X ... X H,, and we call this a direct decompo- 
sition of G. Each H, is called a direct factor of 
G. Conditions (1), (ii), and (iii) are equivalent 

to condition (i), (i) G = H,H,... H,, and 

H, ... Hj. ,0 H;= (e) (0-2, ...,n). 

A group G ts called indecomposable if G 
cannot be decomposed into the direct product 
of two subgroups unequal to {e}, and com- 
pletely reducible if G is the direct product of 
simple groups. If G satisfies the ascending or 
descending chain condition for normal sub- 
groups, then G can be decomposed into the 
direct product of indecomposable groups. 
Such a decomposition is not unique in general, 
but if G has two direct product decompo- 
sitions G=G, x ... X G,— H, x... x H,, where 
G; and H, are indecomposable and not equal 
to (ej, then m— n and the factors G; are iso- 
morphic to the factors H; for some j; moreover, 
if G, corresponds, say, with H,, then we have 
G=H,xG,x...xG,,. This fact was first 
stated by J. H. M. Wedderburn, and a com- 
plete proof of the theorem was given by R. 
Remak and O. Schmidt. Later W. Krull ex- 
tended it to more general groups (with opera- 
tor domain), and we call it the Krull-Remak- 
Schmidt theorem. O. Ore formulated it as a 
theorem on *modular lattices. 

For an infinite number of groups G, (4e A) 


a Éis Xi, €i41, n 
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we define the direct product J],., of these 
groups similarly. The set of all elements 
(...,X5,...) (x; €G;) such that almost all x, De, 
all except a finite number of 4) are identity 
elements is a subgroup of the direct product, 
called the direct sum (or restricted direct prod- 
uct) of (G;j. 


M. Free Products 


Given a family of groups {G,},.,, we define 
the most general group G generated by these 
groups, called the free product of {G,}, to- 
gether with canonical injections f;: G; — G. 

Let S be the disjoint union of the sets 
{G,},-q, and regard G, as a subset of S. A 
word is either void or a finite sequence a,, 
d5, ...,Q, of elements of S, and we denote the 
set of all words by W. The product of two 
words w and w' is defined by connecting w 
with w' so that the *associative law holds. We 
write w>w’ when two words w and w satisfy 
one of the following two conditions: (i) The 
word w has successive members a, b which 
belong to the same group G,, and the word w' 
is obtained from w by replacing a, b by the 
product ab. (ii) Some member of w is an iden- 
tity element, and w' is the word obtained from 
w by deleting this member. For two words w 
and w’, we write w=w’ if there is a finite se- 
quence of words w=Wo, w,,..., w,—w' such 
that for each i (1 & i n), either w,_, >w; or 
w,>w;,_,. This relation is an equivalence rela- 
tion and is compatible with the multiplica- 
tion. Thus we may define a multiplication for 
the quotient set G of W by this equivalence 
relation, and then G is a group whose identity 
element is the equivalence class containing the 
void word. Any xeG, is regarded as a word, 
and we have an injective homomorphism 
f,:G; G by assigning the corresponding class 
to each element of G;. The group G is called 
the free product of the system of groups 
IG and f, is called the canonical injection. 
The free product G of {G,},., is characterized 
by the following universal property: Given a 
group G' and homomorphisms f;:G; G' 

(4€ A), we can find a unique homomorphism 
g:G—G' such that go f, — f;. The free product 
is the dual concept of direct product and is also 
called the tcoproduct (— 52 Categories and 
Functors). If each G, is an infinite cyclic group 
generated by a,, then the free product of the 
G; is the free group generated by {a,} (— 161 
Free Groups). 

The concept of free product is generalized in 
the following way. Let H be a fixed group. We 
consider the family of pairs (G,j), where G is a 
group and j: H OG is an injective homomor- 
phism. A homomorphism of pairs f:(G,j)— 
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(G^,j) is defined to be a group homomorphism 
f:G—G' such that f oj —j'. For a given family 
of pairs {(G,,j,)}, we have the amalgamated 
product (G, j) of the family and the canonical 
homomorphism f; :(G;,j,) —(G, j), which is 
characterized by the following universal prop- 
erty: Given a pair (IG, land homomorphisms 
f1:(G,,4,) 9 (G ,j), we have a unique homo- 
morphism g:(G,j)—(G', j) such that go f; — 
fj. If H = {e}, then the amalgamated prod- 
uct is the same as the free product. Now f, is 
an injection. If we regard G, as a subgroup of 
G, then G is generated by the subgroups G, 
and G,NG,=j,(H)=j,(H) nl 

The notion of the amalgamated product is 
useful in constructing groups with interesting 
properties. For instance, we have a group 
whose nonidentity elements are all conjugate 
(B. H. Neumann and G. Higman), and a group 
generated by a finite number of elements such 
that its homomorphic image (  (e]) is always 
an infinite group (Higman) so that we have an 
infinite simple group generated by a finite 
number of elements. 


N. Extensions 


Let N and F be groups. A group G is called an 
extension of F by N if G has a normal sub- 
group N isomorphic to N and G/N = F. The 
problem of finding all extensions was solved 
by Schreier (Monatsh. Math. Phys., 34 (1926); 
Abh. Math. Sem. Univ. Hamburg, 4 (1928)). 
Suppose that (1) to each oe F there corre- 
sponds an automorphism s, of N; (2) there 
exist elements c, , (o, TEF) of N such that 
s (s(a)) —c, (5, (2))c;.. (ae N); and (3) c,, «c, 
= S (c, alte en, Then the set G of all symbols as, 
(ae N,o € F)is an extension of F by N if we 
define multiplication by as, : bs, — (as,(b)c, )s,,. 
In fact, the set of all elements à —ac, s, (ae N) 
is a normal subgroup N of G such that G/N 
=F. Any extension can be obtained in this 
way. A system (s,, c, .) satisfying (1), (2), and (3) 
above is called a factor set belonging to F. 
Two factor sets (s,, c, .) and (t,,d,,,) are said to 
be associated if there exist elements a, (c € F) 
of N such that t,(a) 2 s,(a,aa, !) and d, ,= 
a,(s,(a,))c, el. In this case, two extensions 
determined by these factor sets are isomorphic. 
If (5,, c, ,) is associated with (t, d, ,) (do, = 1 for 
any o, tE F), then we say that the correspond- 
ing extension is a split extension. In this case, 
the extension G contains a subgroup F iso- 
morphic to F, and G— FN, FAN « (e]. We 
call such an extension a semidirect product of 
N and F. 

If N is Abelian, then condition (2) is simply 
s,(s,{a)) = s, (a), since the only inner automor- 
phism of N is the identity mapping. The con- 
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ditions of associated factor sets and split ex- 
tension are also simplified. If N is contained 
in the center of G, then G is called a central 
extension of N. 


O. Transfers 


Let H be a subgroup of finite index in G and g; 
(i 1, ..., h) be representatives of the right 
cosets of H. For be Hg, we write g; — b. Then 
for xeG an element X= H'II^., g;x(g;x) ! of 
H/H' is determined uniquely (independent of 
the choice of representatives), where H' is the 
commutator subgroup of H. The correspon- 
dence G' x X yields a homomorphism of G/G' 
to H/H', which is called the transfer from G/G' 
to H/H'. 





P. Generalizations 


The concept of group can be generalized in 
several ways. A set S in which a multiplication 
(a, b) ab satisfying (ab)c = a(bc) (the associa- 
tive law) is defined is called a semigroup. If S is 
a commutative semigroup in which ax = bx 
implies a — b (the cancellation law), then S can 
be embedded in a group G so that the multi- 
plication in S is preserved in G and any xeG is 
the quotient of two elements of $:x 2a !b— 
ba ! (a,beS). Such a group G is determined 
uniquely by S. We call it the group of quotients 
of S. 

The notion of semigroup is obtained by 
taking only associativity from the group 
axioms. On the other hand, if Q is a set with a 
law of composition (a, b)-»ab which is not 
necessarily associative but satisfies the con- 
dition that any two among a, b, c in the equa- 
tion ab —c determine the third uniquely, then 
Q is called a quasigroup. A quasigroup with an 
identity element e such that ea — ae — a for 
every element a is called a loop. For loops, we 
have an analog of the structure theory of 
groups (R. H. Bruck, Trans. Amer. Math. Soc., 
60 (1946)). 

If we give up the possibility of forming 
products for all pairs of elements or the 
uniqueness of the product in the axioms for 
groups, then we have the following generaliza- 
tions of groups. A set M with multiplication 
under which to any elements a, be M there 
corresponds a nonempty subset ab of M is 
called a hypergroupoid. Moreover, if the asso- 
ciative law (ab)c — a(bc) holds and for any 
elements a, be M there exist x, ye M such 
that be xa, beay, then M is called a hyper- 
group. 

A set M is called a mixed group if (1) M can 
be partitioned into disjoint subsets My, M,, 
M,,...;(2) for ae Mo, be M; (i=0, 1,2,...), 
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elements ab, a\b of M; are defined such that 
a(a\b) = b; (3) for b, ce M;, an element b/c of 
Mo is defined such that (b/c): c = b; and (4) the 
associative law (ab)c — a(bc) (a,be Mg, ce M) 
holds (A. Loewy, 1927). 

A set M is called a groupoid if (1) M can be 
partitioned into disjoint subsets M; (i, j = 
1,2,...); (2) for ae M; and be M,,, an ele- 
ment abe M; is defined; (3) for ae M;; and 
be Mir an element a be My, is defined such 
that a(a b) = b; (4) for ae Mj; and be M,,, an 
element a/b e M;, is defined such that (a/b)b 
=a; and (5) for ae Mj, be M, and ce My, the 
associative law a(bc)=(ab)c holds (H. Brandt, 
1926). These generalized concepts also have 
some practical applications (— 2 Abelian 
Groups; 13 Algebraic Groups; 60 Classical 
Groups; 69 Compact Groups; 92 Crystallo- 
graphic Groups; 122 Discontinuous Groups; 
151 Finite Groups; 161 Free Groups; 243 
Lattices; 249 Lie Groups; 277 Modules; 362 
Representations; 422 Topological Abelian 
Groups; 423 Topological Groups; 437 Unitary 
Representations). 


Q. History 


The concept of the group was first introduced 
in the early 19th century, but its rudiments can 
be found in antiquity; in fact, it was virtually 
contained in the concept of motion or trans- 
formation used in ancient geometry. From the 
time it took explicit form in the late 19th cen- 
tury, it has played a fundamental role in all 
fields of mathematics. 

In their study of algebraic equations in the 
late 18th century, J. L. *Lagrange, A. T. Van- 
dermonde, and P. Ruffini saw the importance 
of the group of permutations of roots; using 
this idea N. H. Abel showed that a general 
equation of degree > 5 cannot be solved alge- 
braically. A. L. *Cauchy studied the group of 
permutations of roots for its own interest, but 
a complete description of the relationship 
between groups and algebraic equations was 
first given by E. *Galois. C. Jordan developed 
a detailed exposition of the theory given by 
Abel and Galois in his Traité des substitutions 
(1870) [1]. Up to that time, a group meant a 
permutation group; the axiomatic definition of 
a group was given by A. Cayley (1854) and L. 
Kronecker (1870). F. *Klein emphasized the 
significance of group theory in geometry in his 
*Erlangen program (1872), and M. S. "Lie 
developed the theory of tLie groups in the 
1880s. In 1897, W. Burnside published his 
Theory of groups [3], whose second edition 
(1911) is one of the classics in group theory 
and is still valuable. Since 1896, G. Frobenius 
[2] and others have developed the theory of 
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representation of groups by matrices (— 362 
Representations). By that time, the theory of 
finite groups had acquired all its essential 
features. Among the branches of abstract 
algebra, the theory of groups was the first to 
develop; it led to the progress of abstract alge- 
bra in the 1930s. Since the latter half of that 
decade, the theory of finite groups has been 
developed further; there has been increased 
interest in the theory, and many significant 
results have been obtained, especially since 
1955 (— 151 Finite Groups). 
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Differential geometry studies differentiable 
manifolds and geometric objects or structures 
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on them. Among the geometric structures, the 
Riemannian and complex structures, with their 
contacts with other fields of mathematics and 
with their richness in results, occupy a central 
position in differential geometry. Perhaps it is 
impossible to say precisely what a differential 
geometric structure is or should be. However, 
the notion of G-structure allows a unified 
description of many of the interesting known 
geometric structures, such as Riemannian and 
complex structures. 


A. The Notion of G-Structures 


Let M be an m-dimensional C*-manifold, and 
let t: T(M)— M be its tangent bundle. For 
xe M, T,(M)=t !(x)is the vector space of 
tangent vectors at x. Let zx: F(M) M denote 
the tframe bundle of M. It is a GL(m; R)- 
principal bundle on M and F,(M)- x (x) is 
the set of linear isomorphisms (called frames at 
x) of R” onto T,(M). Here GL(m; R) is the 
general linear group of R". Hence F,(M) can 
be naturally identified with the set of ordered 
bases of T,(M). Write 0 —(0!, ..., 0") for the 
canonical form of F(M), which is the R"- 
valued 1-form on F(M) defined by 0(v) — 

p ‘(x,{v)) for any ve T,(F(M)). Given a dif- 
feomorphism f:M =N of M onto another 
C*-manifold N, we can naturally define the 
bundle isomorphism f: F(M)& F(N) by 
SD) =f, op. 

Let G be a tLie subgroup of GL(m; R). A 
principal G-subbundle z,: P— M of the frame 
bundle z: F(M)— M is called a G-structure 
over M. Thus P is a regular submanifold of 
F(M) satisfying the following three conditions: 

(Al) x(P)=M. 

(A2) for pe P and ceGL(m; R), poe P if and 
only if c eG. 

(A3) for any xe M, there exist an open 
neighborhood U of x and a C*-mapping 
s: U >P such that z(s(y)) — y for any ye U. 
Conversely, if a regular submanifold P of F(M) 
satisfies the above three conditions (A1), (A2), 
and (A3), then z, — z| p: P->M is a G-structure 
over M. When G is closed, then condition (A3) 
is automatically satisfied. The restriction of 0 
onto P is called the canonical form of P and is 
also denoted by 0. For an open subset U of M, 
the restriction P| U — ze '! (U) is a G-structure 
over U. 

Let xy: P2 M and n9:Q—N be G-structures 
over M and N, respectively. A diffeomorphism 
f:MxN is called a G-isomorphism of P onto 
Q if f '(P) — Q. When such an f exists, we say 
that the G-structures P and Q are equivalent. A 
G-isomorphism of P onto itself is called a G- 
automorphism of P. Write Aut(M, P) for the 
group of G-automorphisms of P. 
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Let g be the Lie algebra of G. Since G acts 
on P from the right, we have the natural Lie 
algebra homomorphism :: 9T (P, T(P)). Here 
T(P, T(P)) is the Lie algebra of C?—vector 
fields on P. Actually z is injective, and we write 
A* for 1(A) (Aeg). We call A* the fundamental 
vector field corresponding to A. 

For xeR", define po(x)e F,(R") by 
po) ((0))) 2 LP, v/(0/0x),, where x!, ..., x" 
are the natural coordinates of R". Any open 
subset D of R" has the natural G-structure 
P(D, G) defined by P(D, G)= (po(x): o € F,(R"; 
xeD,oe€Gj. We say that a G-structure zp: 
PM over M is integrable if for each xe M, 
there exists a local coordinate system (U, o, D) 
around x such that ọ is a G-isomorphism 
of P|U onto P(D, G). Equivalently, if we set 
p —(x!, ..., x" then the frame ((0/0x!),, ..., 
(6/0x"),) belongs to P for each ye U. 


B. Examples 


The following examples (B1)-(B7) show that 
some classical geometric structures can be 
treated uniformly from the viewpoint of G- 
structures. 

(B1) An absolute parallelism on M is, by 
definition, a system (X,,..., Kal of C^—vector 
fields on M such that at each point xe M 
1X1(x), ..., X,(x)] is a basis of T(M),. Clearly 
this is equivalent to giving an {e}-structure 
Tp: P— M, where e stands for the identity 
matrix of GL(m; R). Then P is integrable if and 
only if [X;, X;] 0, 1 <i, j<m. In this case, 
Aut(M, P) is a finite-dimensional tLie trans- 
formation group on M. 

(B2) Let E be a k-dimensional ‘distribution, 
namely, E is a k-dimensional vector subbundle 
of T(M). Write R” = R* @ R™ * Set GL(k, m; R) 
—(ieeGL(m; R); o(R*) c R*) and P2 (pe F(My; 
p(R*)c E). Then P is a GL(k, m; R)-structure 
over M. This gives a bijective correspondence 
between the k-dimensional distributions on M 
and the GL(k, m; R)-structures on M. More- 
over, P is integrable if and only if E is tin- 
volutive (Frobenius theorem). 

(B3) Let g be a Riemannian metric on M. 
Set O(m)= (e e GL(m; R;'oc =e}. Define P(g) 
by P(g) = li: Dale F(MY g(v;, v) = à). 
Then P(g) is an O(m)-structure on M. This 
gives a bijective correspondence between the 
Riemannian metrics on M and the O(m)- 
structures over M. Then Aut(M, P(g)) is 
the group of tisometries of g, and a finite- 
dimensional Lie transformation group on M. 
Moreover, P(g) is integrable if and only if g is 
flat in the sense that the Riemannian curvature 
tensor of g is zero. 

(B4) Two Riemannian metrics g, and g, are 
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called conformally equivalent if there exists a 
C?-function p >0 with g, = pg. This is an 
equivalence relation among the Riemannian 
metrics on M. An equivalence class {g} is 
called a conformal structure on M. Set CO(m) 
— (aA; AeO(m), oeh — (01). Define P((g]) 
by P((g]) Dr, .... o) e F(MY g(01, 0) = 
aó;, ac R — {0}}. Then P({g}) is a CO(m)- 
structure on M. This gives a bijective cor- 
respondence between the conformal struc- 
tures on M and the CO(m)-structures over 

M. Then Aut(M, P({g})) is the group of tcon- 
formal transformations of (gj and is a finite- 
dimensional Lie transformation group. More- 
over P({g}) is integrable if and only if the 
conformal structure {g} is conformally flat in 
the sense that at each point xe M, there exists 
a local coordinate system (x!, ..., x") around x 
such that g(0/0x', 0/0x?) = pô; where p is a 
positive function. 

(B5) Assume that m is even, say m— 21. By an 
almost symplectic structure on M, we mean a 
differential 2-form Q of maximal rank, i.e., 
QA... AQ (I times) never vanishes. Let A be 
the standard skew-symmetric bilinear form on 
R” defined by A((u’), (ri - (u! v? —u2v!) +... 
r (u? 71 9?! — 2192171), The symplectic group 
Sp(l; R) is defined by Sp(l; R)= (ce GL(21; R); 
A(cu, ov) = A(u, v), u, ve R?^. Set P(Q)— 

{pe F(My p*A-Q,. x —np(p)]. Then P(Q) is 
an Sp(I; R)-structure over M. This gives a 
bijective correspondence between the almost 
symplectic structures on M and the Sp(I; Ru 
structures over M. Then Aut(M, P(Q)) is the 
group of symplectic transformations of Q, 
which is never a finite-dimensional Lie trans- 
formation group. Moreover P(Q) is integrable 
if and only if Q is a symplectic structure, i.e., 
dQ — 0. 

(B6) Assume that m is even, say m—21. We 
identify C' with R” as a real vector space. 
Then GL(m; C) is a Lie subgroup of GL(2I; R). 
Let J be an almost complex structure on M. 
Set P(J)— (tpe F(My; Jp(u) = p(iu), ue R” « C!). 
Then P(J) is a GL(l; C)-structure on M. This 
gives a bijective correspondence between the 
almost complex structures on M and the 
GL(l; C)-structures over M. Then Aut(M, P(J)) 
is the group of J-analytic transformations 
on M. Furthermore, Aut(M, P(J)) is a finite- 
dimensional Lie transformation group when 
M is compact. Moreover P(J) is integrable if 
and only if J is a complex structure. 

(B7) It is easy to see that there is a bijec- 
tive correspondence between the SL(m; R)- 
structures over M and the set of volume ele- 
ments on M. Any SL(m; R)-structure over M is 
always integrable and Aut(M, P) is the group 
of diffeomorphisms preserving Q, which is 
never a finite-dimensional Lie transformation 
group. 


ij? 
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C. Structure Functions 


Let G be a Lie subgroup of GL(m; R) and q the 
Lie algebra of G. For convenience, we write V 
for R" and V* for the dual space of V. Then 
V & A?V* can be considered as the space of 
skew-symmetric bilinear mappings from V x V 
into V, and g ® V* can be identified with the 
space of linear mappings from V into g. Define 
a linear mapping 0:9 Q V*—2 V & A?V* by 
(OT Wu, v) = — T(u)v + T(v)u 

for TegGV*, uver. 


Put 
H^'(g VG A?*V*/0(g @ V*). 


Let tp: PAM be a G-structure over M and 
0 the canonical form of P. Take any frame p in 
P. An m-dimensional linear subspace H of 
T; (P) is said to be a horizontal subspace of 
P at pif 0,: HO V(—R")is a linear isomor- 
phism. Then we write fy for 0, ': VH. Write 
$(T,(P)) for the set of horizontal subspaces of 
P at p. For He S(T,(P)), define c(p, H)e V © 
A V* by 


c(p, H)(u, v) — d0( falu), fu(v)) for 
Then we have 
c(p, Hi) Ger c(p, Hie ó(g Gi V*) 

for H,,H,€(T,{(P)). 


u,veV. 


Therefore we have a well-defined element 
c(p)e H^! (a) by c(p) = (c(p, H)}. We call the 
mapping c: P— H?'! (g) the structure function of 
the G-structure xy: P^ M. 

Let Q9 N be another G-structure over N 
and f: M =N be a G-isomorphism of P onto 
Q. Then we have c( f (p) 2 c(p) for pe P. In 
particular, it is easily seen that if P is inte- 
grable, then c z 0. However, the converse is not 
true in general. 


D. Prolongation of Linear Lie Algebras and 
Groups 


Let K denote either R or C. Let g be a Lie 
subalgebra of gl(m; K). For k=0,1,2,..., let g, 
be the space of K-symmetric multilinear 
mappings 

CK" x... x K"5K" 


—— 
(k+ 1) times 


such that for each fixed u,, ..., u, € K”, the 


linear transformation 
veK" 5 t(o,u,,...,u)eK" 


belongs to g. In particular, ge — g. We call g, 
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the kth prolongation of g. If a, —0, then g,,, 
—,4,5— ... =0. The first integer k such that a, 
=0 is called the order of o If g, #0 for all k, 
then g is said to be of infinite type. When g 
contains a matrix of rank 1 as an element, then 
g is of infinite type. When K — C, the converse 
is also true [3]. However, when K=R, the 
converse is not true in general. In fact, if we 
consider al(/; C) as a real Lie subalgebra of 
gl(2/; R) (example (B6)), then al(I; C) is of in- 
finite type, having no matrix of rank I in it. In 
general, a Lie subalgebra g of gl(m; R) is said 
to be of elliptic type if g contains no matrix of 
rank 1 as an element. We consider g, as an 
Abelian Lie subalgebra of gI(K" @ g) by the 
correspondence teg,  tegl(K" @ g), where t 
is defined by 


t(u)=t(-,u) for ucK", 


t(A)=0 for Aeg. 


More generally, we consider g,,, to be an 
Abelian Lie subalgebra of al(K" Oo, € ... ® 
g,) by virtue of the correspondence téq,,,+> 
tegl(K" Q qo O ... 9), where t is defined by 


t(u)—t(,...,",u)eg, for ueK", 


AED... Bg. 


Identifying gl((K” ® go ® ... & 8, OD g,) with 
gl(K" ® go ® ... ® g,), we have the important 
identity 


(Di) ` (oul zs On, 


Let G be a Lie subgroup of GL(m; R) and o 
the Lie subalgebra of gl(m; R) corresponding to 
G. For k- 1,2, ..., let G, be the connected 
Abelian Lie subgroup of GL(R" Q go Q ... O 
9-1) corresponding to the Abelian Lie sub- 
algebra g,; namely, G, consists of the linear 


mappings o(t) (t&9,) defined by 


t(4)20 for 


k=0, 1, 2,.... 


c(t)(u)u--t(^,...,",u) forueR", 


c(t(4)2A for Aeg, Q ...Qa,.,. 
We call G, the kth prolongation of G. We say 
that G is of finite (resp. infinite) type if g is of 


finite (resp. infinite) type. We say that G is of 
elliptic type if g is. From (D1), we have 


(D2) (6G); =Gert- 


We know all irreducible Lie subalgebras g of 
al(m; K) with order >2 [3,6]. 

(D3) An irreducible Lie subalgebra g of 
al(m; C) is of infinite type if and only if g is one 
of 


gl(m; C), sl(m;C), ev (Asc). (5c) 


where csp(m/2; C) € Q sp(m/2, C). 
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(D4) An irreducible Lie algebra g of gl(m; R) 
is of infinite type if and only if g is one of 


ilc). e "e 
a 27 H $ SC > csp 4’ > 
m 
—;C , 
»() 


gl(m; R),  sl(m;R), ev (iR). ln) 


where csp(m/2; R)=R © sp(m/2; R). 


(D5) Let S be an m-dimensional irreducible 
Hermitian symmetric space of compact type, 
which is different from the complex projective 
space. Let L(S) be the identity connected com- 
ponent of the group of biholomorphic trans- 
formations of S. Take any point oe S and set 
Lo(S)= {oe L(S);e(o) 2 0). Let a(S) be the 
linear isotropy Lie subalgebra of gl(m; C) cor- 
responding to L(S). Then g(S) is an irreduc- 
ible Lie subalgebra of al(m; C) of order 2. Con- 
versely, every irreducible Lie subalgebra g of 
gl(m; C) of finite type with order > 2 is equal to 
g(S) as given above. 

(D6) Let S be an m-dimensional irreducible 
symmetric space of compact type. Assume that 
S admits a finite-dimensional connected Lie 
transformation group L(S) which strictly con- 
tains the connected component of the group of 
isometries of S. Take any point oeh, and set 
L(S)= {0 € L(S); (0) o). Let g(S) be the 
linear isotropy Lie subalgebra of gl(m; R) cor- 
responding to L,(S). If (S, L(S)) Z (sphere, pro- 
jective transformation group) or (S, L(S)) 
(complex projective space, complex projective 
transformation group), then g(S) is an irre- 
ducible Lie subalgebra of gl(m; R) of order 2. 
Conversely, every irreducible Lie subalgebra 
g of al (m; R) of finite type with order 22 is 
equal to g(s) as given above. For example, if 
we take an m-dimensional sphere as S and the 
group of conformal transformations of S as 
L(S), then g(s)= co(m). 


E. Prolongation of G-Structures 


Let G be a Lie subgroup of GL(m; R) and let g 
be the Lie subalgebra of al(m; R) correspond- 
ing to G. We choose once and for all a linear 
subspace C of V & A?V* such that 


VA A2V*2a0(g 9 V*)O C. 


In general there is no natural way of choosing 
such a C. 

Let 1»: P—M be a G-structure on M. For 
each horizoantal subspace H of P at p (Le., 
HeO(T,(P)), we define the frame (p, H)e F,(P) 
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to be a linear isomorphism (p, H): R" @ g> 
T, (P) given by 


(p, H)(v ® A)= falv) A* for veR", Aeg, 


where A* is the fundamental vector field on P 
induced from Aeg. Let P, = ((p, H)e F(P); 
HeX (T, (P), c(p, H)e C}. Then P,>P isa 
G,-structure over P. We call P, P the first 
prolongation of P. The kth prolongation P, of 
P is defined inductively by P, - (P, .,), =the 
first prolongation of P,_,. From (D2), P, is a 
G,-structure over P,_,. 

Take any f'e Aut(M, P). It can be proved 
that f e Aut(P, P,). Since P, =(P,_,),, we can 
define f ? € Aut(P,. ,, P) inductively by f? = 
(f * 9)». By the correspondence f f™, we 
can consider Aut(M, P) as a subgroup of 
Aut(P,_,, P,). 

If G is connected, then 


(El) Aut(M, P)=Aut(P,_,,P,), k=1,2,.... 


Let QN be another G-structure. By a similar 
argument to that above, we see that P2 M 
and Q—N are equivalent if and only if P, 
P, and Q, Q,. , are equivalent. In [1], E. 
Cartan studied general equivalence problems 
of two G-structures. In that problem the above 
prolongation procedure plays an important 
role. 


F. Automorphisms of G-Structures of Finite 
Type 


S. Kobayashi proved the following: 

(F1) Theorem [4]. Let P>M be an {e}- 
structure over M. Then Aut(M, P) is a finite- 
dimensional Lie transformation group on 
M such that dim Aut(M, P) x dim M. More 
precisely, for any point xe M, the mapping 
c € Aut(M, P) c(x)e M is injective, and its 
image (o(x); o € Aut(M, P)} is a closed sub- 
manifold of M. The submanifold structure on 
this image makes Aut(M, P) into a finite- 
dimensional Lie transformation group on M. 

Now let G be a Lie subgroup of GL(m; R) 
of finite type, say G, = (e). Then B,2 P, , 
is an {e}-structure. From theorem (F1) 
above, it follows that Aut(P,_,, P) is a finite- 
dimensional Lie transformation group. As 
explained in Section E, Aut(M, P) is a sub- 
group of Aut(P, ,, P,). Clearly Aut(M, P) is 
closed in Aut(P,_,, P,). Since every closed 
subgroup of a Lie group is again a Lie group, 
we have the following: 

(F2) Theorem. Let P— M be a G-structure 
over M. If G is of finite type, then Aut(M, P) is 
a Lie transformation group of dimension 
xdim(R"$9g099,G ...99,.,) 
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G. Automorphisms of G-Structures of Elliptic 
Type 


The following theorem of R. Palais allows us 
to prove that the automorphism groups of 
many general geometric structures are finite- 
dimensional Lie transformation groups. 

(G1) Theorem [5]. Let L be a group of C*- 
transformations of a C?-manifold M. Let I be 
the set of all vector fields X on M which gener- 
ate global 1-parameter groups g,=exptX of 
transformations of M such that 9,¢ L. If the 
set I generates a finite-dimensional Lie algc- 
bra of vector fields on M, then L is a finite- 
dimensional Lie transformation group, and l is 
the Lie algebra of L. 

Let PM be a G-structure over M. For any 
representation p: G— GL(V), we write E(p) for 
the associated vector bundle of P with respect 
to p. Let o,:G— GL(al(m; R)) be the represen- 
tation defined by p,(c) A— oAc ' fora eG, 
Aegl(m; R). Let p,: G— GL(g) be the represen- 
tation defined by p,(a)A=aAo | for cEG, 
Aeg. Then p; is a subrepresentation of p,, 
since p,(c)A = p;(c) A for ce G, Aeg. We re- 
mark that E(p,) — Hom(T(M), T(M)). We 
write g(M) for E(p,). Then g(M) is a vector 
subbundle of Hom(T(M), T(M)). We write F 
for the quotient vector bundle Hom(T(M), 
T(M)y/a(M). Let ®: Hom(T(M), T(M))— F be 
the natural projection. We fix an affine con- 
nection V on M which preserves P. We write 
T for the torsion tensor field of V. For each 
vector field X er(M, T(M)), we define a C*- 
section VX + T(X,:) in TOM. Hom(T(M), T(M)) 
by 


uc T(M),t>V,X + T(X(x), ule T,(M). 


Then define a first-order linear differential 
operator 


D:T(M, T(M))>T(M, F) 


by D(X)=®(VX + T(X,:)). It is easy to see 
that 


(G2) I(M, P)ckerD, 


where I(M, P) is the Lie algebra of all vector 
fields X ef OM, T(M)) which generate global 
1-parameter groups q, — exptX of transforma- 
tions of M such that o, € Aut(M, P). Then we 
can show: 

(G3) D is an elliptic operator if and only if G 
is of elliptic type. 

Now suppose M is compact. Then from the 
standard fact on linear elliptic operators on 
compact manifolds, we know the dimension of 
ker D is finite if D is elliptic. Thus from (G2), 
(G3), and theorem (G1), we have the following: 

(G4) Theorem [7]. Let P2 M be a G- 
structure on an m-dimensional compact mani- 
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fold M. If G is of elliptic type, then Aut(M, P) 
is a finite-dimensional Lie transformation 
group. 


H. Local Equivalence Problem 


Let ze P2 M and n9:Q—N be two G- 
structures. We say that P9 M and Q>N 
are locally equivalent if the following holds: 

(H1) For arbitrarily given (p, q)e P x Q, there 
exists a local G-isomorphism f: U x V of P|U 
onto Q|V such that f (p) q, where U (resp. 
V) is an open neighborhood of zp(p) (resp. 
rola). 

From now on, we assume that the structure 
function c, (resp. Cg) of P (resp. Q) is constant. 
If such an f as in (H1) exists, we must have 
col f (p) - c,(p') for p'e P| U. Therefore cp 
and cg must be the same constant. Cartan 
reduced the local equivalence problem be- 
tween P and Q to that of certain differential 
systems on P x Q. In fact, let 0, — (0!, ..., 0) 
and 65 — (y/!, ..., V") be the canonical forms of 
P and Q respectively. Let a: Px QoS P, f: P x 
Q0 be the canonical projections such that 
a(p,q)- p, B(p,q) — q. Let X be the differen- 
tial system on P x Q generated by (a0! — 
Dei, zs — Buy". If X is the m- 
dimensional regular submanifold of P x Q 
given by the graph of f" in (Hl), ie, X = 
Ip, f(p))eP x Q;p'e P| U}, then X is an 
m-dimensional integral submanifold of X 
satisfying the following conditions: 


(H2) (pq)eX, 


(H3) «a*6',...,0*60” are linearly indepen- 
dent on T,,, (X). 


(p.q) 


Conversely any m-dimensional integral sub- 
manifold of X satisfying (H2) and (H3) is the 
graph of a local G-isomorphism f required in 
(H1). Therefore the local equivalence problem 
between P and Q is equivalent to the problem 
of finding an m-dimensional integral submani- 
fold X of X satisfying (H2) and (H3) for any 
(p.q)eP x Q. 

We say that G is involutive if there exist 
linear subspaces 02 V; c V, c... c V, 2R" 
such that 


(H4) dimV,=k (k=0,...,m), 
(H5) dimg, — Y, dimg(h), 
k=0 


where g is the Lie algebra of G and g(V,)= 
(Aeg; A(u)=0 for ue V,]. Now we have: 
(H6) Proposition. Let PM and QN be 
two G-structures such that their structure 
functions are constant and equal. The dif- 
ferential system X on P x Q defined as above 
is involutive at any m-dimensional integral 
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element E of X on which x*0!, ..., a 0" 
are linearly independent if and only if G is 
involutive. 

Combining proposition (H6) above with the 
classical tCartan-Kahler theorem, we obtain 
the following theorems: 

(H7) Theorem ([1], also see [8]). Let P2.M 
and QN be two real analytic G-structures 
over M and N, respectively, such that their 
structure functions are constant and equal. 
Suppose G is involutive. Then P— M and 
QN are locally equivalent. 

(H8) Theorem. Let P— M be a real analytic 
G-structure over M. Suppose G is involutive. 
Then P is integrable if and only if the structure 
function of P is zero everywhere. 

Finally, we remark that for a general Lie 
subgroup G c GL (m, R), G, is involutive for 
kz kg [8]. 


I. Cartan-Káhler Theorem on Differential 
Systems 


Let U be an open subset of R". We write 
A*(U) for the space of real analytic differential 
k-forms on U. Put A*(U) 2 A*(U) O ... 
A"(U). Then A*(U) is a graded R-algebra 
with respect to the usual exterior product. By 
a differential system on U we mean an ideal 
» of A*(U) such that 


(Il) X-A49?(UnxgeAu)nxzxe.. 

e xnA"(U) 
(I2) d(2)c 2, 
(I3 2 is finitely generated as an ideal. 


For convenience we write 2 = X A*(U). 
When 2°) = {0}, we call X a restricted dif- 
ferential system. 

Let X be a differential system on U c R”. By 
a k-dimensional integral manifold of 2, we 
mean a k-dimensional regular real analytic 
submanifold of U such that for any «e X, we 
have a| X =0. In the above definition, it is 
sufficient to know that «| X =0 for any ae X'9. 

For xe U, we write G,(T,(U)) for the set of 
all k-dimensional linear subspaces of T,(U). 
Set G,(T(U)) = (<u G,(T;(U)) (disjoint union); 
then G,(T(U)) is naturally a real analytic 
manifold of dimension m 4- m(m — k). Let X 
be a differential system on U. A k-plane Ee 
G,(T,(U)) is called a k-dimensional integral ele- 
ment of X at x if for any xe X9, we have «| E 
=0. Thus X is a k-dimensional integral sub- 
manifold if and only if T;(X) is a k-dimensional 
integral element of X for each xe X. We write 
I,(X) for the set of k-dimensional integral ele- 
ments of X. In general /,(2) ts a real analytic 
subset of G,(T(U)). For any k-dimensional 


IL 


integral element E at xe U, we define the polar 
space H(E) by 


H(E)= (ue T,(U); a(u, v,, ...,0,) -0 
for «e DT D pu, o, € E). 


Then E c H(E). Moreover FeG,.,(T,(U)) 

with FE is in J,,,(2) if and only if F c H(E). 
Let E be a k-dimensional integral element of 
X. We choose 9!, ..., o*e A! (U) such that 

o! |E, ..., 9*|E are linearly independent. Define 
U(p*,...,0") by 


(o, .... 9) - (Fe G(T(U)); o |F, ..., p*|F 
linearly independent}. 
Then 4/(o!, ..., ^) is an open neighborhood 


of E in G,(T(U)). For any element Fe(o!, 

.., 9^), define F}, ..., FLE F by o'(F,)=6). Thus 
(Fi, ..., F,] is the dual basis of o'|F,..., o*|F. 
Any element «e 2“) defines a real analytic 
function a, on 4/(o!, ..., 0") by a (F)  a(F;, 
..., F). We call E regular if there exist a, ..., 
ale X? and an open neighborhood ¥ of E in 
U(p',...,@*) such that 


(I LQ)nY ={FeV:01(F)=...=a!(F) 
=0}, 


(I5) dal,...,da} are linearly independent 
ony, 


(I6 dim H(F) is constant for any Fey. 


We say that Ee/,(2) is an ordinary element if 
E contains a (k — 1)-dimensional regular in- 
tegral element. 

The following is the well-known theorem 
due to Cartan and Káhler [7]. 

(17) Theorem. Let X be a restricted dif- 
ferential system on UCR”. Let X be a k- 
dimensional integral submanifold of Z. Sup- 
pose that T; (X) is regular for a point x e X and 
that there exists a (k + 1)-dimensional integral 
element F at x with F 2 T,(X). Then there 
exists a (k + 1)-dimensional integral submani- 
fold Y of X such that xe Y, T,(Y)=F, and 
XcY in a neighborhood of x. 

We say that Z is involutive at Ec I, (2) if 
there exist O2 E? C E! c... c EF c E such 
that E? is a j-dimensional regular integral 
element of X (j—0, 1, ..., k— 1). Applying 
theorem (G7) inductively, we obtain: 

Corollary. Let X be a restricted differential 
system on UCR” If E is a k-dimensional 
involutive integral element of X at x, there 
exists a k-dimensional integral submanifold of 
Z such that xe X and T,(X)— E. 

In view of this corollary, it is important to 
know when Z is involutive at Ee I, (X). 

(I8) Lemma [9]. Let X be a restricted dif- 
ferential system on U cR”, and E be a k- 
dimensional integral element of X. Then X is 
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involutive at E if and only if there exist 0 — E? 
cE'c...c EK! c E such that R(X) is a sub- 
manifold of G,(T(U)) near E with dimension m 
 k(m — k) — Eg t;, where tj,2 m — dim HUE, 
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A. Fourier Transforms 


Let f(x) be an element of the tfunction space 
L4,(—00, oo) and t a real number. Then the 
integral 


fo-Qz) "^ | i Tele "* dx (1) 


converges, and the function f(t) is continuous 
in (—oo, oo). We call f the Fourier transform 
of f. It f(x)e L (—œ, oo), then f(x)e L,(— a, a) 
for any finite interval ( — a, a), and if we set 


fa)-Qz) 17 f f(xje "dx, 


then f, tconverges in the mean of order 2 as 
a— oo to a function fin L,. In this case, we 
define f to be the fFourier transform of f 
(€ L,). Furthermore, in this case, if we set 


fa) = Qn) "P | f(t)e^^ at, 


then Lim., ,,, f(x) = f(x) (Plancherel theo- 
rem). Moreover, we have tParseval’s identity: 
[ef o9 dx = lz. Um dt (— 160 Fourier 
Transform). 

Suppose that f(x) is periodic with period 22 
and f e L,( — n, x). We set 


a,-—(2n) | i f(xje "* dx 


(*Fourier coefficients). The nth partial sum of 
the *Fourier series s (x) - X"... ,a,e^"* con- 
verges in the mean of order 2 to f(x), and 
Parseval's identity 


pn 2 
x | oras È Dk 


holds. On the other hand, if {a,} is a given 
sequence such that X7... |a,|? < oo, then 
$7. .,a,e'"* converges in the mean of order 2 
to a function f(x), the Fourier coefficients of 
f(x) are (a,), and Parseval’s identity holds 


(— 159 Fourier Series). 


B. Bochner's Theorem and Herglotz's Theorem 


A complex-valued function f(x) defined on 
(—co, oo) is said to be of positive type (or posi- 
tive definite) if 277 -1 f(x; — x,) 5,6, 2 0 for any 
finite number of reals x,, X3, ..., x, and com- 
plex numbers €,, &,, ..., ën If f(x) is measur- 
able on (—oo, oo) and of positive type, then 
there exists a ‘monotone increasing real- 
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valued bounded function a(t) such that 


fe9- | g e"*da(t) (2) 


for almost all x. If a( —co)=0 and a(t) is right 
continuous, then «(t) is unique (Bochner’s 
theorem). Conversely, if «(t) is nondecreasing 
and bounded and f(x) is defined by (2), then 
J (x) (called the Fourier-Stieltjes transform of 
a(t)) is continuous and of positive type. 

A sequence {a,} (—00 «n < oo) is said to 
be positive definite (or of positive type) if 
Zi dert 20 for finitely many arbitrarily 
chosen complex numbers €,, Čz, ..., ,. If {a,} 
is of positive type, then there exists a mono- 
tone increasing bounded function a(t) on 
[ ^7, z] such that 


T 
a,= | ei" datt 
(Herglotz’s theorem). Conversely, if x(t) is 
monotone increasing and bounded and a, is 
defined by the above integral, then the se- 
quence (a,j is of positive type. 


C. Poisson’s Summation Formula 


If f(x)e L,(—o0o, oo) is of "bounded variation 
and continuous and if f(t) is its Fourier trans- 
form, then we have 


Ja Y f(ak)- /b 2 f(bk), 
(EE =-0 

where ab = 2z (a> 0). This is called Poisson’s 

summation formula. 


D. Generalized Tauberian Theorems of Wiener 


Suppose that we are given a function f(x)e 
L,(—o0, oo) whose Fourier transform f(t) is 
never zero. Then the set of functions given by 


h(x)— | fix — y)g(y) dy, 


where ge L,( —00, oo), is dense in L,(—0o, oo). 
Hence we can deduce the tgeneralized Tauber- 
ian theorem of Wiener: If the Fourier trans- 
form ku of k,(x)e L,(—00, oo) does not van- 
ish for any real t and 


lim | enforce | ki (y)dy 


— 00 


for a function f(x) that is bounded and 
measurable on (—oo, œ), then for any k€ 
L,(—oo, ol, 


oo 


k,(y)dy 


— oa 


tim | s-nfoy-c | 


= 00 
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(— 160 Fourier Transform G). Hence we can 
deduce *Tauberian theorems of the Littlewood 


type [3]. 


E. Harmonic Analysis 


Let «(A) be a complex-valued function on 
(—00, oo) that is of bounded variation and 
right continuous. If we have the expression 


f()— | e^ da(4), Q2) 


then we say that the function f(t) is repre- 
sented by the superposition of harmonic oscil- 
lations e™™. Conversely, when f(t) is given, we 
have the problem of finding a function «(A) as 
above such that f(t) can be expressed in the 
form of (2). When such a function «(A) exists, it 
is also an important problem (in harmonic 
analysis) to find the amplitude «(4) — «(4 —0) of 
the component of a proper oscillation. Con- 
cerning this problem, we have the following 
three theorems: 

(I) A necessary and sufficient condition for 
f(x) to be representable in the form (2) is 


EMINET 


(II) For a function f(t) expressed in the form 
(2'), we have: (i) for any An, 


dÀ « oo. 











1 fT f 
a(2o) — (4o —0)— lim zl f(t)e dt, 
p -T 


and (ii) if x(4) is continuous at å = An —6 and 


À-Ag-- 6 (o 70), then 
a(o —0) 


sin Ze 


allo -0)— 


— lim E MN 
To 


(III) In (2^, suppose that the discontinuity 
points of (å) are 4,, 45, ..., and set a, —a(4,) — 


fe ^! dt. 





2(/, —0) (n— 1,2, ...); then 
5 EIE o0 
lim | f(t--sf(sds— Y |a, e^ 
S>% 0 n=1 


F. The Paley-Wiener Theorem 


In formula (1), if we change the variable from t 
to a complex variable 6 =t + io, then we have 


SEAN Tale "dx, G) 


which is called the Fourier-Laplace transform 
of f(x). In particular, if f(x) has bounded tsup- 
port, then F(Z) is an tentire function. Concern- 
ing Fourier-Laplace transforms, Paley and 
Wiener proved the following theorems: 
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(I) A necessary and sufficient condition for 
an entire function F({) to be the Fourier- 
Laplace transform of a tfunction of class C? 
having its support in a finite interval [ — B, B] 
is that for any N, there exists a constant C, 0 
such that IECH < Cat) +161) Tell for all (= 
t ic. 

(II) If g(t)e L,(0, oo), then its one-sided *La- 
place transform 


oo 


f(z2) - Qn) '? | g(tje “dt 


0 
satisfies: (i) f(z) is holomorphic in the right 
half-plane Rez>0, and (ii) 


sup | | f(x - iy? dy < oo. 
x20 — 00 

Conversely, if f(z) 2 f(x + iy) (x » 0) satisfies 
(i) and (ii), then the boundary function f(iy)e 
L (—œ, oc) exists and is such that 


im | Lf (iv) — f(x + iy)? dy =0, 


its Fourier transform in L, 


N 


f(iyje'? dy 


TN 


g(t)= Lim. (2x) !? | 


vanishes at almost all negative t, and f(z) is the 
one-sided Laplace transform of g(t). 


G. Harmonic Analysis on Locally Compact 
Abelian Groups 


The general theory of harmonic analysis on 
the real line was extended to a theory on lo- 
cally compact Abelian groups by A. Weil, I. 
M. Gelfand, D. A. Raikov, and others. The 
theory of normed rings was utilized for the 
development of the theory (— 36 Banach 
Algebras). This theory is called harmonic 
analysis on locally compact Abelian groups, 
and it has been developed as described in the 
following sections. 


H. Group Rings 


Let L, =L,(G) be the set of all integrable 
functions with respect to a tHaar measure on a 
locally compact Abelian group G. If we define 
the norm and multiplication in L,(G) by 


Bu -| [f(x)| dx, 
G 


f s= fixy *)g(y)dy, 
G 


respectively, then L, has the structure of a 
commutative *Banach algebra. (We call f -g 
the convolution (or composition product) of f 


192 I 
Harmonic Analysis 


and g.) If the topology of G is not ‘discrete, 
L(G) does not have a unity for multiplication. 
Hence, adjoining a formal unity 1 to L,(G), 
we set R — (al + f |a is a complex number, 
feL,(G)}, and 


Tei - f lh =lol+ Lf M 
(xl f) E (B1 g) - (x - B1 Cf +g), 
(x1 +f) (B1 g) — (a8)1 - (Bf - xg - f: 9). 


Then R is a commutative Banach algebra with 
unity. When G is discrete, R 2 L,(G). The 
Banach algebra R is called the group algebra of 
G. The group algebra R of G is *semisimple. R 
is algebraically isomorphic to a subalgebra of 
COU, which is the associative algebra of all 
continuous functions on the compact Haus- 
dorff space WM consisting of all maximal ideals 
in R (— 36 Banach Algebras). By this corre- 
spondence, if ge R corresponds to q(M), 
which is a function on W, then sup, go (M)| 
«ell. L,(G) belongs to W if and only if the 
group G is not discrete. 


I. Fourier Transforms 


According as the group G is discrete or not, we 
set 9t — 9t or 9t 2 99 — ([ L,(G)]. Then there 
exists a one-to-one correspondence between 
the elements Me and the elements y of the 
‘character group G of G such that the follow- 
ing formulas are valid: 


fw | x(x) f(x) dx, fel, , (4) 
x)= (M)/f(M), (5) 


where f(x) — f(xy ') and f is a function such 
that f(M) x0. This correspondence M — y 
gives a homeomorphism between the locally 
compact space 9t and G. Hence if we identify 
M with y and set f(M)— f(y), then f is a con- 
tinuous function on G, called the Fourier trans- 
form of f(x). Since f » f(M) is an algebraic 
isomorphism, (fg) '() = f(99(). If f= 
g(x), then f is equal to g in L,(G). This is the 
uniqueness theorem of Fourier transforms. 
From it we can deduce the *maximal almost 
periodicity of locally compact Abelian groups. 
If G is not discrete, then L,(G)eM, and 
f(L,(G)) for fe L,(G). Hence (elt zc) 
is a compact subset of G. This means that f is 
à continuous function vanishing at infinity on 
G. (This is a generalization of the *Riemann- 
Lebesgue theorem concerning the cases G — R! 
or T! 2 R/Z.) Any continuous function u(x) on 
G vanishing at infinity is approximated uni- 
formly by f(y), which is the Fourier transform 
of f e L,(G). 
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J. Positive Definite Functions 


A function (x) defined on G is said to be 
positive definite (or of positive type) if the in- 
equality 25 ,-1 9(xjx, !)ajà, 2: 0 holds for arbi- 
trary elements x,,...,x, in G and arbitrary 
complex numbers «,,...,a,. We denote by Pg 
the set of all positive definite functions on G. If 
$9 € P;, then o(e) 0 (e is the identity of G), 
Ioe(x)| € o(&), and w(x !)2 o(x). If G is locally 
compact, we further assume that o € P, is 
measurable with respect to the Haar measure 
of G. Then for any f € L,(G), 


J | ecochronrona: dy 2:0. 


Any e P; is equal almost everywhere to a 
continuous @, € Pg. (Concerning positive de- 
finite functions on locally compact groups and 
their relation with unitary representations — 
437 Unitary Representations B.) 


K. Harmonic Analysis and the Duality 
Theorem 


If G is a locally compact Abelian group, then a 
function (x) on G belongs to P, if and only 

if there exists a nonnegative measure u(Ĝ)< 

oo on G such that p(x) = fc x(x)du(x). (When 
G — R!, this theorem is tBochner’s theorem, 
whereas when G =Z, it is Herglotz’s theo- 
rem.) Hence we can prove a spectral resolu- 
tion of unitary representations of G: U(x)= 
fex(x) dE(y) (a generalization of *Stone’s 
theorem). If fe L,(G) Pg, then f(y) 20, fe 
L,(G), and the inversion formula of Fourier 
transforms f(x) — Le x(x) f(y) dx holds, provided 
that the Haar measure on G is suitably chosen. 
If fe L,(G)' L,(G), then fe L,(G), and Par- 
seval’s identity Le Ulf dx = fel fG)? dy holds. 
If we put Uf — f and Vf — f, then U is extend- 
able uniquely to an isometry of L,(G) onto 
L(G) and V is extendable uniquely to its 
inverse transformation, respectively (Plan- 
cherel’s theorem on locally compact Abelian 
groups). By the inversion formula and Plan- 
cherel's theorem, we can prove that the char- 
acter group G of G is isomorphic to G as a 
topological group. This is called the Pon- 
tryagin duality theorem of locally compact 
Abelian groups (— 422 Topological Abelian 
Groups). In particular, if G is compact, G is a 
discrete Abelian group. Then we can normal- 
ize the Haar measure of G and G so that the 
measure of G and the measure of each element 
of G are 1. Plancherel's theorem implies that 
the set of characters of G is a tcomplete tortho- 
normal set in L(G). 
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L. Poisson's Summation Formula 


Suppose that G is a locally compact Abelian 
group, H is its discrete subgroup, and G/H is 
compact. Then the tannihilator T of H is a 
discrete subgroup of G. For any continuous 
function f(x) on G, if £e f(xy) is convergent 
absolutely and uniformly (hence fe L,(G)) 
and Fere is convergent absolutely, then 
Zale hia O: where c is a constant 
depending on the Haar measures of G and G. 
This is called Poisson’s summation formula on 
a locally compact Abelian group (a generali- 
zation of tPoisson’s summation formula on 
G=R). 


M. Closed Ideals in L,(G) 


In the following, the group operations in G 
and G are denoted by +, and the value of the 
character y(x) (xe G, y e G) is written as (x, y). 

For a function f defined on G and any yeG, 
the translation operator q, is defined by t, f(x) 
— f(x — y). A closed subspace of L,(G) is an 
ideal in L,(G) if and only if it is invariant 
under all translations (N. Wiener). A closed 
ideal I coincides with L,(G) if and only if the 
set of zeros of I, i.e., Z(I) 2 (^ re, f ^ (0), is 
empty (Wiener's Tauberian theorem). A closed 
ideal J is maximal if and only if Z(I) consists of 
a single point. If the dual G of G is discrete, 
then the closed ideals in L,(G) are completely 
characterized by the zeros; that is, tspectral 
synthesis is possible, but this situation does 
not hold generally. P. Malliavin's theorem 
states that if G is not discrete, then there exists 
a set E in G and two different closed ideals I 
and J such that Z(I) — Z(J)- E. Such a set E is 
called a non-S-set. For example, if G — R?, the 
unit sphere is a non-S-set (L. Schwartz). 


N. Operating Functions 


Denote by A(G) the set of all Fourier trans- 
forms of the functions in L,(G). Let fe A(G) 
and ® an analytic function in a neighborhood 
of the range of f. Furthermore, assume that 
®(0)=0 if G is not discrete. Then there exists 
a ge A(G) such that 9(y)= (f(y) for ye G 
(Wiener-Lévy theorem). In general a function 
® defined on a set D in the complex plane is 
said to operate in a function algebra R or to be 
an operating function on R if ( f)e R for all 
f€R whose range lies in D. The converse of 
the Wiener-Lévy theorem holds in the follow- 
ing form. Let G be an infinite Abelian group 
and 6 a function on the interval [ —1, 1]. If ® 
operates in A(G), then 9 is analytic in a neigh- 
borhood of the origin if G is compact, and 
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analytic in a neighborhood of [ —1,1] if G is 
not compact [14, 15]. Let E be a subset of G, 
and I, the set of all fe A(G) such that f=0 on 
E. Let A(E)= A(G)/Ig be the quotient algebra. 
The set E is called a set of analyticity if every 
operating function on A(E) is analytic. For a 
characterization of such a set — [11]. 


O. Measure Algebras 


For a locally compact Abelian group G, let 

M (G) be the set of all tregular bounded com- 
plex measures. For 4 and um M(G), the 
convolution 4 u is defined by (A * p) (E)= 

Le AG — y): du(y), where E is a Borel set in 

G. Then M(G) is a semisimple commutative 
Banach algebra whose product is defined to be 
the convolution. The Fourier-Stieltjes trans- 
form of ue M(G) is defined by 


a=] (x,y)du(x),  yeG. 
G 


A continuous function on G is positive definite 
if and only if it is the Fourier-Stieltjes trans- 
form of a positive measure in M(G) (Bochner's 
theorem). Assume that G is not discrete. A 
function on the interval [ —1,1] that oper- 
ates in the Fourier-Stieltjes transforms of 
measures in M(G) can be extended to an entire. 
function, and a function on the whole complex 
plane that operates in the tGel’fand repre- 
sentation of M(G) is an entire function [15]. 
From this fact, it follows that M(G) is asym- 
metric and nonregular. Furthermore, there 
exists a measure ue M(G) such that fi(y) 2 1 
but 1/4 is not a Fourier-Stieltjes transform of 
M (G). (See also Wiener and Pitt [16], Shreider 
[17], and Hewitt and Kakutani [18]; for the 
general description of measure algebra, see 
Rudin [11] and Hewitt and Ross [12].) 


P. Idempotent Measures 


A measure ue M (G) is called idempotent if p  j 
=p, that is, fi(y)=0 or 1 for all yeG. Then fi is 
the characteristic function of the set {ye lëtz 
— 1). The smallest ring of subsets of G that 
contains all open cosets of subgroups of G is 
called the coset ring of G. The characteristic 
function of a set E in G is the Fourier-Stieltjes 
transform of an idempotent measure in M(G) 
if and only if E belongs to the coset ring of G 
[19]. A simple proof of this theorem is given 
by T. Ito and I. Amemiya (Bull. Amer. Math. 
Soc., 70 (1964)). When G is the unit circle, the 
coset ring consists of sequences periodic except 
at a finite number of points, and for this case 
the theorem was obtained by H. Helson. Let 
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Ny, n5, ..., n be distinct integers and du(x) — 
Zem: dx. Then u is an idempotent mea- 
sure on the unit circle. J. E. Littlewood conjec- 
tured that the norm of u exceeds clogk, where 
c is a positive constant not depending on the 
choice of (njj. A partial answer was given 

by P. J. Cohen [19] for compact connected 
Abelian groups and was improved by H. 
Davenport (Mathematika, 7 (1960)) and E. 
Hewitt and H. S. Zuckerman (Proc. Amer. 
Math. Soc., 14 (1963)). 


Q. Mappings of Group Algebras 


Let G and H be two locally compact Abelian 
groups and o a nontrivial homomorphism of 
L,(G) into M(H). Associated with ọ there is a 
mapping o? of a subset Y of H into G such 


^ 


that o(f)(y) 9 f(o*(y)) for ye Y and =0 for 
yé Y, or symbolically oC f) 2 f(o*). A con- 
tinuous mapping « of Y into G is said to be 
piecewise affine if there exist a finite number 
of mutually disjoint sets S, j— 1, ..., n, in the 
coset ring of H and mappings a, such that (i) 
Y — ( J., S; (ii) x; is defined on an open coset 
K; of H, where K;>S;; (iii) x; a on S; and. 
(iv) ay +y — y") & ag (y) + aly) — og") for all y, 
yy e Ky je 1 ...,n. P. J. Cohen’s theorem is: 
If o is a homomorphism of L,(G) into M(H), 
then Y belongs to the coset ring of H and ¢* is 
a piecewise affine mapping of Y into G. Con- 
versely, for any piecewise affine mapping o, 
there is a homomorphism q of L,(G) into 
M(H) such that o* =x. Related theorems have 
been studied by A. Beurling, H. Helson, J.-P. 
Kahane, Z. L. Leibenson, and W. Rudin [11]. 


R. Exceptional Sets 


Let G be a locally compact Abelian group. A 
subset E is said to be independent if n, x, + 
cb nQx, —0, where the n; are integers and ve 
E implies n;x, — 0, j— 1,...,k. A set E in G is 
called a Kronecker set if for every continuous 
function o on E of absolute value 1 and e>0 
there exists a ye G such that |g(x) — (x, y)| <E, 
xe E. Every Kronecker set is independent and 
of infinite order, but independent sets are not 
necessarily Kronecker sets. For a group G 
whose elements are of finite order p, a set E is 
called of type K, if for every continuous func- 
tion 9 on E with values exp(2zik/p), k —0, ...,p 
— |, there is a ye G such that y=) on E. If E is 
a compact Kronecker set in and yi is a mea- 
sure with support in E, i.e., ue M(E), then Wu? 
= UO. A compact set E is called a Helson set 
if there is a constant C such that [uf <C [| ,, 
for ue M(E). Every K, set is also a Helson set. 
For a Helson set E, C(E)— A(E). A discrete 


724 


analog of a Helson set is a Sidon set. A subset 
F of a discrete group G is called a Sidon set if 
there is a constant C such that X, |a,| < 
Csup,|X.,. a, (x, y)] for every polynomial 

È a(x, 7). For example, a flacunary sequence 
Ut]. ni zg 1, of integers is a Sidon set. 
These sets are deeply connected with har- 
monic analysis on groups, and measures con- 
centrated on these sets have some unexpected 
pathological properties (— e.g., [11, 13]). 


S. Tensor Algebras and Group Algebras 


Let X and Y be compact Hausdorff spaces, 
and denote by V(X, Y) the projective tensor 
product C(X) & C(Y) of continuous func- 
tion spaces C(X) and C(Y). The norm of 

P(x, Y= LZ, fj()g;(y) is defined by Vol 

—inf 27, | fill. gil, where the infimum is 
taken for all expressions of o. If G is an infi- 
nite compact group, then there exist two sub- 
sets K, and K, such that (1) K, and K, are 
homeomorphic to the *Cantor ternary set; (ii) 
the expression y, +y, of an element of E= K, 
+ K, is unique, where y, € K, and y, € K;; (iii) 
K, NK, x and (iv) K, UK, ts a Kronecker 
set or a set of type K, for some p. Varopoulos’ 
theorem states that the algebra V(K,, K;) is 
isomorphic to A(E) which denotes the algebra 
of restriction of functions in A(G) on the set E. 
By this theorem, the problems of spectral 
synthesis and operating functions of group 
algebras are transformed into problems of 
tensor algebras. For a more precise discussion 
— [20]. 
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193 (X.29) 
Harmonic Functions and 
Subharmonic Functions 


A. General Remarks 


A real-valued function u of tclass C? defined in 
a domain D in the n-dimensional Euclidean 
space R” is called harmonic if it satisfies the 
Laplace equation 


^u Cu 
Au(P) kk LP ten, Ell 
ÓX1 OX, 


in D. A harmonic function is, by definition, 
twice continuously differentiable, but turns out 
to be real analytic. It is not true, however, that 
the solutions of the Laplace equation are real 
analytic. For example, for the function u(x, y) 
=Reexp(—z *)(z=x+iy #0), u(0,0)=0, uxx 
and u,, exist everywhere, and u satisfies the 
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Laplace equation, but is not continuous at the 
origin. 

The fundamental properties of harmonic 
functions do not depend essentially on n. 

A real-valued function u of class C? satisfy- 
ing the inequality Au >0 is called subharmonic. 
For a more general definition of subharmonic 
functions and their properties — Sections P- 
U. 


B. Invariance of Harmonicity 


Harmonicity in R? is invariant under any 
tconformal transformation. Namely, when 
there exists a conformal bijection sending a 
domain D in the xy-plane onto a domain D' in 
the €y-plane, every harmonic function u(x, y) 
on D is transformed into a harmonic function 
of (¢,7) on D'. In R” for n> 3, harmonicity is 
not generally preserved under conformal 
transformations. However, harmonicity is 
preserved in the following special case: Let D 
be a domain in R” (n 2 3), and consider the 
inversion f :D— D' defined by f(x,,...,x,)— 
Di, x) = (a2 x, /F?, ..., a? x, r2) r  (x24- 

c X2), Let u(x,..., x,) be a harmonic 
function on D, and let v(x‘, ..., x;) be the func- 
tion on D' obtained by applying the Kelvin 
transformation to u. Namely, v(x}, ..., x;) — 
(a/r' Y" ^u(a? x'V/r^, ..., a? x, /r?), where r? = 
X? M... x2. Then the function v is harmonic 
on D'. A function u that is harmonic outside 

a compact set is called regular at the point 

at infinity if any Kelvin transform of u is har- 
monic in a neighborhood of the origin, in 
which case u(P)0 as OP co. Now let T: 

Xy = Xy(X3, ..., x, | X k <n, be a one-to-one 
analytic transformation of a domain D' onto 
another domain D. If there exists a posi- 

tive function q(x,, ..., x;) in D’ such that 
Qs iss X MC Qe Xh eor XUI E 
harmonic for any harmonic function u(x,, 
...,X,) in D, then T is conformal. A conformal 
transformation as it is known in differential 
geometry is either (i) a ‘similarity transforma- 
tion, (ii) an inversion with respect to a sphere 
or a plane, or (iii) a finite combination of trans- 
formations of types (i) and (ii). 


C. Examples of Harmonic Functions 


(1) If u is a polynomial in x,, ..., x, and har- 
monic in R”, then the terms of degree k in u 
form a harmonic function for each k>0. A 
harmonic homogeneous polynomial is said to 
be a spherical harmonic. (2) logr in R? and ri 
in R” (nz 3) are harmonic except at r =0. (3) 
Every tlogarithmic potential in R? and every 
*Newtonian potential in R” (n> 3) is harmonic 
outside the fsupport of the measure. Con- 
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versely, any harmonic function defined on a 
domain D is represented in an arbitrary rela- 
tively compact domain D' in D as the sum of 
a logarithmic (n= 2) or Newtonian (n> 3) 
potential of a measure on ôD’ and the poten- 
tial of a "double layer. (4) Both the real part u 
and the imaginary part v of an analytic func- 
tion of a complex variable are harmonic. We 
call v a conjugate harmonic function of u. If u is 
harmonic on a fsimply connected domain D, 
then the conjugate v of u is given by 


- (x,y) Q 5 
v(x, »-| Zu ese) + constant, 
(a,b) y x 


where (a, b) is a fixed point in D and the path 
of integration is contained in D. When D isa 
*multiply connected domain, v may take many 
values in accordance with the "homology 
classes of the paths of integration. 


D. Green's Formulas 


In the following one should substitute “curve” 
for the term “surface” when n=2. Let D bea 
bounded domain whose boundary S consists 
of a finite number of closed surfaces that are 
piecewise of class C!. Let u and v be harmonic 
in D, and suppose that all the first-order par- 
tial derivatives of u and v have finite limits 

at every boundary point. We call D the inside 
of S. Let n be a normal on S toward the out- 
side of D. Then the relation 


| (ui uso (1) 
s\ On ôn 


follows immediately from *Gauss’s formula, 
where do is the surface element on S. In par- 
ticular, when v is identically equal to 1, for- 
mula (1) gives 


| 2 de =0. (2) 
S 


on 


Equations (1) and (2) are called Green's and 
Gauss's formulas, respectively. Conversely, u is 
harmonic in D if u is a function of class C? 

in D and at every point pe D, there is a se- 
quence {r,' decreasing to zero and such that 
[sco (Ou/0n) de =0 (k= 1,2, ...), where S(P, r) 
is the spherical surface with center at P and 
radius r,. Another sufficient condition for u to 
be harmonic is that u is of class C' and, at 
every point Pe D, there be an rp — 0 such that 
Lan. (0u/0n)da =0 for every r, 0 «r «rp 
(Koebe, Bocher). 


E. Mean Value Theorems 


We assume that u is a harmonic function, D 
the domain of definition of u, and S the bound- 
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ary of D. The mean value of u on the surface 
or the interior of any ball in D is equal to the 
value of u at the center of the ball. Namely, 


u(P) 2 — ideo | udo, 
Laf JB(P,n Ont S(P.r) 


where 1, and oc, are the volume and surface 
area of a unit ball in R”, respectively, B(P, r) is 
the open ball with center at P and radius r, 
and d: is the volume element. These relations 
are called mean value theorems. Conversely, if 
v is continuous in D and at every point Pe D, 
there is a sequence {r,} decreasing to zero and 
such that the mean value of v over B(P,r,) or 
S(P, r,) is equal to v(P) for each k, then v is 
harmonic in D. This result is called Koebe's 
theorem. From the mean value theorems the 
maximum principle for harmonic functions 
follows: Any nonconstant u assumes neither 
maximum nor minimum in D. If both u and v 
are harmonic in D and have the same finite 
boundary value at every point on S, then uzv 
in D by the maximum principle. This is called 
the uniqueness theorem. 


F. Boundary Value Problems 


The first boundary value problem (or Dirichlet 
problem) is the problem of finding a harmonic 
function defined on D that assumes boundary 
values prescribed on S (— 120 Dirichlet Prob- 
lem). The second boundary value problem (or 
Neumann problem) is the problem of finding a 
harmonic function u whose normal derivative 
Ou/On is equal to a function f prescribed on the 
piecewise smooth boundary S. The solution, if 
it exists, is uniquely determined up to an addi- 
tive constant. In order for the solution to exist, 
f should satisfy the condition Le fdo —0. The 
third boundary value problem is the problem of 
finding a harmonic function u on D that satis- 
fies Qu/Ón — hu + f on S, where h and f are 
functions prescribed on S. All these problems 
can be reduced to certain *Fredholm integral 
equations. There is also the boundary value 
problem of mixed type, in which the boundary 
values are prescribed in a part of S and the 
normal derivatives are prescribed on the rest. 


G. The Poisson Integral 


Let D be a bounded domain with smooth 
boundary S and u a function harmonic in 
D and continuous on DUS. Let G(P, Q) be 
Green's function in D. Then (1) yields 


QG(P, 
! | TO 90^ 9 1,0) 


S ông 
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In particular, if D = B(0, r), then 
r?— 0P? | u(Q) 
s 





u(P) = LE do(Q). 


(0,r) PQ" 
Conversely, given an integrable function f on 
S(0, r), we set 


r? —OP? | f(Q) 
S(0,r) PQ" 


Then u(p) is harmonic in B(0, r) and converges 
to f(Q) as P tends to any point Q on S(0, r) 
where f is continuous. We call u a Poisson 
integral. Sometimes it is possible to represent a 
harmonic function u in B(0, r) in the following 
form, which is more general than the Poisson 
integral: 


o,f 





u(P)= do(Q). 


Opt 


2 np? 
u(p)= 2 | di) H 
du s, PQ 
where a is a signed "Radon measure on 
S(0, r). In order for u to admit such a repre- 
sentation, it is necessary and sufficient that 
Ís, lulde be a bounded function of r’ for 0< 
r' <r, or equivalently, that the tsubharmonic 
function |u| have a tharmonic majorant. Fur- 
thermore, if « is absolutely continuous, then 
the Poisson integral representation of u is 
possible, and vice versa. A necessary and suffi- 
cient condition for the function u to admit the 
Poisson integral representation is that there 
exist a positive convex function g(t) on t0 
such that o(t)/t— oo as t> oo and ¢(|u|) has a 
harmonic majorant. 

When D is a general domain in which 
Green's function exists, every positive har- 
monic function u(P) is represented uniquely as 
the integral | K(P, Q)du(Q), where K(P, Q) is a 
*Martin kernel and u is a Radon measure on 
the Martin boundary B whose support is 
contained in a certain essential part of B, each 
point of which is called a minimal point. A 
similar integral representation appears in the 
theory of Markov processes (— 260 Markov 
Chains I). The representation f| K(P, Q)du(Q) is 
a generalization of (3). In terms of a function 
similar to g(t), we can give a necessary and 
sufficient condition for u(P) to be represented 
in the form f K(P, Q)f(Q)dv(Q), which corre- 
sponds to the Poisson integral representa- 
tion and in which v is determined by 1 — 
[K(P, Q) dv(Q). This condition is equivalent 
to the condition that u(P) be quasibounded, 

Le, that there exist an increasing sequence of 
bounded harmonic functions that converges to 
u [7]. 

A positive harmonic function u is said to be 
singular if any nonnegative harmonic minorant 
of u vanishes identically. Every positive har- 
monic function can be expressed as the sum of 
a quasibounded harmonic function and a 
singular one. 
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H. Expansion 


Let Py —(x9,..., x?) be a point in D, and de- 
note the distance from P, to S by r. Then a 
harmonic function u is expanded uniquely into 
a power series 


0 n 
Ya, E x X1 — xq) (x, — xp) ; 
ki,- k 29, 


in B(Po, Se — j)r). Thus u is (real) analytic in 
D. M u vanishes on an open set in D, then u=0 
in D. If the power series is written as 2, A, with 
spherical harmonics h, of degree k — 1, 2, ..., 
then this series converges over all of B(P,, r). 


I. Sequences of Harmonic Functions 


In this section, {um} is a sequence of harmonic 
functions in a bounded domain D. First, if 
each u,, is bounded and continuous on DUS 
and {u,,} converges uniformly on S, then 

{um, converges uniformly in D, and the limit- 
ing function u is harmonic in D. Moreover, 
Qt thy, [0xli ... Ox converges to OT "eu 
ach)... ÓxF uniformly on any compact subset 
of D (Harnack's first theorem). Second, if u; < 
ve... in D and there is a point of D at which 
{um is bounded, then {u,,} converges uni- 
formly on any compact subset of D (Harnack's 
second theorem). The following Harnack's 
lemma is useful: If u is positive harmonic in D, 
P, is a point of D, and K is a compact subset 
of D, then there exist positive constants c and 
c', depending only on P, and K, such that 
cu(Po) € u(P) € c'u(P5) on K. 

Any family of (locally) uniformly bounded 
harmonic functions is *normal. A family of 
positive harmonic functions that is bounded 
at a point is also normal by Harnack's lemma. 
If [5|u, —u,|" dx 0 as k, moo for p>1, then 
*Hólder's inequality implies that {um} con- 
verges uniformly on any compact subset of D. 
It follows that if [5 |grad(u, —u,)|" dx —0 as k, 
m- oo and {u,} converges at a point in D, then 
there exists a harmonic function u in D such 
that f |grad(u,, — u)|? dz 50 as m> oo and um 
converges to u uniformly on any compact 
subset of D, where P, is any point in D. Finally, 
if | plu,’ dx (pz 1) are bounded, then {u,} 
forms a normal family. 


J. Level Surfaces and Orthogonal Trajectories 


The set {P |u(P) — constant] is called a level 
surface (niveau or equipotential surface). When 
ais given as the constant, the level surface is 
called the a-level surface. Assume that u is not 
a constant. A point where grad u vanishes is 
called critical. The set of critical points consists 
of at most countably many treal analytic 
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manifolds of dimension <n—2 (n=dimD, and 
a manifold of dimension 0 is understood to be 
a point). Any compact subset of D intersects 
only a finite number of such manifolds; we 
express this fact by saying that the manifolds 
do not cluster in D. Each of these manifolds is 
contained in a certain level surface. The 
complement of the critical points with respect 
to any level surface consists of real analytic 
manifolds of dimension n — 1 that do not 
cluster in D. 

For each noncritical point there exists an 
analytic curve passing through it such that 
gradu is parallel to the tangent to the curve at 
each point on the curve. A maximal curve with 
this property is called an orthogonal trajec- 
tory (or line of force). Along every orthogonal 
trajectory, u increases strictly in one direction 
and hence decreases in the other, so that none 
of the orthogonal trajectories is a closed curve. 
There is exactly one orthogonal trajectory 
passing through any noncritical point. There- 
fore no two orthogonal trajectories intersect, 
and no orthogonal trajectory terminates at a 
noncritical point. Moreover, the set of limit 
points of any orthogonal trajectory in each 
direction does not contain any noncritical 
point. When u is a Green's function G(P, Q), 
every orthogonal trajectory is called a Green 
line, and a Green line that originates at the 
pole Q and along which u decreases to 0 is 
called regular. For any sufficiently large a, the 
a-level surface X, is an analytic closed surface 
homeomorphic to a spherical surface. Let E be 
a family of orthogonal trajectories originating 
at the pole. If the intersection A of E and a 
closed level surface Z, is an (n — 1)-dimensional 
measurable set, then the tharmonic measure 
of A at Q with respect to the interior of 2, is 
called the Green measure of E. M. Brelot and 
G. Choquet proved that all orthogonal trajec- 
tories originating at the pole except those 
belonging to a family of Green measure zero 
are regular. Consider a domain D bounded by 
two compact sets, and denote by u the har- 
monic measure of one compact set with re- 
spect to D. Assume that u is not a constant. 
Then u changes from 0 to 1 along all ortho- 
gonal trajectories except those belonging to a 
family that is small with respect to a measure 
similar to the Green measure (see "flux" de- 
fined in Section K). 


K. Harmonic Flows 


Denote by 3. the a-level surface for a harmonic 
function u, and by E the complement of the 
set of critical points with respect to 2. Let o 
be an (n — 1)-dimensional domain in Z? such 
that the (n — 2)-dimensional boundary of ø is 
piecewise of class C?. Suppose that u assumes 
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the value b (>a) on each orthogonal trajec- 
tory passing through c. Consider the union of 
orthogonal trajectories that pass through o. 
The subset of this union on which u assumes 
values between a and b forms a set called a 
regular tube. The parts of the boundary corre- 
sponding to a and b are called the lower and 
upper bases of the tube; accordingly, o is the 
lower base. The integral ((0u/On)dc on any 
section (i.e., the part of a level surface in the 
tube) is constant and is called the flux of the 
tube. The family of orthogonal trajectories 
passing through an (n — 1)-dimensional domain 
(not necessarily bounded by a smooth bound- 
ary) in X2 is called a harmonic flow, and a 
subfamily is called a harmonic subflow if its 
intersection A with X? is measurable (in the 
(n — 1)-dimensional sense). The flux of a har- 
monic subflow is defined to be | ,(0u/On)do. 
Then the Green measure of family E of Green 
lines originating at the pole is equal to the flux 
of E divided by o,. We can compute the exact 
value of the *extremal length of any harmonic 
subflow. 


L. Isolated Singularities 


Let u be harmonic in an open ball except at 
the center O. It is expressed as the sum of a 
function h(P) harmonic in the entire ball and 
X2 HQ(PyOP?"*!, where H,, is a tspherical 
harmonic of degree m. If OP*u(P)—0 as P>O 
for az 0, then u(P) is equal to h(P) - c/OP +... 
+ H,(P/OP?"*! with m«a- 1. In particular, 
if u is bounded in a neighborhood of O, then O 
is a removable singularity for u. If u is bounded 
above (below), then u(P) — h(P) - c/O P" ?, 
where c <0 (c z 0). (When n —2, we have u(p) 
=h(p)+clog1/OP. For the removability of a 
set of capacity zero — 169 Function-Theoretic 
Null Sets). 

If u is harmonic near the point at infinity, 
i.e., outside some closed ball, then 


H,(P) 
Op? D 


u(P)— 2 
where the first sum is regular at the point at 
infinity and U,, is a spherical harmonic of 
degree m. If OP *u(P) 50 as OP— with 

22: 0, then U,,=0 for all m a. If u is bounded 
above or below, then U,,=0 for all mz 1. If u is 
harmonic in R” and OP *u(P)—0 as OP oo 
with x z 0, then u is a polynomial of degree m 
(<a). If u is harmonic and bounded above or 
below in R”, then u is constant. Brelot called a 
function u harmonic at the point at infinity if 


H,(P) 


u(P) — constant + 2 (opm 


(note that mz 1) near the point at infinity [1]. 
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M. Harmonic Continuation 


If u vanishes in a subdomain of D, then u=0 in 
D. If u, is harmonic in D, (k= 1,2), D; D; 4 
Qj, and u, =u, in D, N D,, then u, and u, de- 
fine a harmonic function in D, U D}. If the 
boundaries of mutually disjoint domains D, 
and D, have a surface S, of class C! in com- 
mon, u, is harmonic in D, (k 21,2), u; —u; 
on So, and 0u,/OÓn and — Ou, /ün exist and 
coincide on Sp, then u, and u, define a har- 
monic function in the domain D, U Sọ U D,. 
We express this fact by saying that one of u, 
and u, is a harmonic continuation of the other. 
It follows that u=0 in D if the boundary of 
. D contains a surface S, of class C! and u = 
Ou/On =0 on Sy. Consider the case n =2. If the 
boundary of a Jordan domain contains an 
analytic arc C and u (or du/én) vanishes on C, 
then a harmonic continuation of u into a cer- 
tain domain beyond C is possible. If n — 3, 
however, nothing is known except in the case 
where Sp is a part of a spherical surface or a 
plane and u 20 (or 6u/On — 0) on Sp. 
Boundary values of u do not always exist, 
but in some special cases, u has limits. For 
instance, a positive harmonic function in a ball 
has a finite limit at almost every boundary 
point Q if the variable is restricted to any 
angular domain with vertex at Q. 


N. Green Spaces 


As a generalization of *fRiemann surfaces, 
Brelot and Choquet introduced &-spaces [3]. 
It is required that & be a separable connected 
topological space and satisfy the following two 
conditions: (i) At each point P there exists a 
neighborhood V, of P and a homeomorphism 
between V, and an open set V; in the tAlexan- 
drov compactification R"U {00 }; (ii) if A = V» N 
V», 7 Ø and A, is the part of Vp, that corre- 
sponds to A (k= 1,2), then the correspondence 
between A} and A; via A is a conformal (pos- 
sibly with the sense of angles reversed) trans- 
formation when n=? and an 'isometric trans- 
formation (which keeps oc invariant) when 
nz 3. If a tGreen’s function exists on &, then 

6 is called a Green space. Harmonic functions 
and the fDirichlet problem on a Green space 
have been discussed from various points of 
view. 


O. Biharmonic Functions 


A function v is called polyharmonic if A*v —0 
(kz 2) and biharmonic if AA» — 0; sometimes, 
polyharmonic functions are also called bi- 
harmonic. A biharmonic function in a plane 
domain D is written as Re(zf(z)+ g(z)), where f 
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and g are complex analytic in D (Goursat's 
representation). Biharmonic functions are used 
in the theory of elasticity and hydrodynamics. 


P. Subharmonic Functions 


Let D be a tdomain in the n-dimensional Eu- 
clidean space R” (n 22). A real-valued function 
u(P) in D is called subharmonic if (1) —oo € 
u< +00, u# —o0; (2) u is tupper semicon- 
tinuous; and (3) at every point P, of D, the 
mean value of u over the surface of any closed 
tball in D with center at P, is not smaller than 
u(Pp), Le, 


1 
ul Po) € ———- fe do = L(P,,r), 


or 


where o, is the area of the surface of a unit ball 
in R”. Condition (3) can be replaced by: (3) 
The mean value A(P,, r) of u over the closed 
ball is >u(P,). In order that an upper semi- 
continuous function u be subharmonic it is 
necessary and sufficient that, for any sub- 
domain D' of D and for any harmonic function 
hin D. the maximum principle hold for u — h. 

We call —u superharmonic when u is sub- 
harmonic. A harmonic function is subhar- 
monic and superharmonic. The converse is 
also true (— Section E). 

When u is of tclass C?, then u is subhar- 
monic if and only if 


ENT u 

Au=—>+...42520 (Pz(x,...x) 

dx? x 
When u is an upper semicontinuous function 
that is not necessarily differentiable, u is sub- 
harmonic if and only if Au interpreted as a 
tdistribution is a positive tmeasure. 

Ifu,,...,u, are subharmonic and a,,..., a 
are positive constants, then a; u; +... + au, 
and max(u, (P), u (P), ... ,u,(P)) are subhar- 
monic. If a subharmonic function u is replaced 
by the * Poisson integral for the boundary 
function u inside a closed ball in D, then the 
resulting function in D is subharmonic. H f(t) 
is a monotone increasing convex function, 
then f(u) is subharmonic. If v>0 and logv is 
subharmonic, then v is subharmonic. If f(z) is 
a holomorphic function of the complex vari- 
able z and 470, then Alog|/(z)| and hence 
|/(z))^ are subharmonic. If h is harmonic, then 
|h| is subharmonic. Any tlogarithmic potential 
(n= 2) or *Newtonian potential (n> 3) is super- 
harmonic in R”. 


Q. Properties of Mean Values 


Condition (3) (resp. (3)) can be replaced by the 
condition that there exists an r(P5) — 0 at any 
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Py such that u(P,) € L(Ps, r) (u(Po) < A(Ps, r)) 
for every r, 0<r<r(Po). The relation —oo < 
A(Po, r) € L(Ps, r) always holds, and both 
A(Po,r) and L(Ps, r) decrease to u(P,) as r|O. 
On any ‘compact subset of D, u is tintegrable. 
Both A(P5, r) and L(P,, r) increase with r and 
are convex functions of --logr (n=2) and ri " 
(n> 3); hence they are continuous functions of 
r. If D' is relatively compact in D and rẹ is the 
distance between OD and 0D, then A(P,r) isa 
continuous subharmonic function of P in D', 
where r is fixed in the interval (0, rọ). By taking 
the average of A(P,r) k times, a subharmonic 
function of class C* is obtained that decreases 
to u as r|O. If o,((x? + ... -- x2)!?) is suitably 
chosen, then the fconvolution u * ọ, is a sub- 
harmonic function of class C” and decreases 
to u as r|O. 


R. Sequences of Subharmonic Functions 


The limit of a decreasing sequence or a 
downward-directed net of subharmonic func- 
tions is subharmonic or equal constantly to 
—oo. The limit of a uniformly convergent 
sequence of subharmonic functions is sub- 
harmonic. If u,,u,,... are subharmonic, then 
max(u,,...,u,) is subharmonic for every k, but 
sup(u, uz, ...) may not be subharmonic. Let U 
be a family of subharmonic functions in D that 
are uniformly bounded above on every com- 
pact subset of D. Then the upper envelope of 
U, i.e., the function defined by sup, uu in D, 
coincides with a subharmonic function except 
on a set of tcapacity zero. 


S. Harmonic Majorants and Riesz 
Decompositions 


Suppose that we are given a subharmonic 
function u in D. If there is a harmonic function 
h satisfying hz u in D, then h is called a har- 
monic majorant of u. When there is a harmonic 
majorant of u, there exists a least one among 
them, denoted by hp. For any relatively com- 
pact subdomain D' of D, hy, always exists and 
equals the *Perron-Brelot solution in D' for the 
boundary function u. As D' increases to D, hp 
increases to a function h that is either har- 
monic or equal constantly to oo. If h exists, it 
coincides with h, and hence h is harmonic. 
Conversely, if h is harmonic, then hj exists and 
equals h. Generally, there is a unique "Radon 
measure p in D with the following property: 
Let 6 be any subdomain of D such that h; and 
the tGreen’s function G; exist in 6 (6 may 
coincide with D). Then h;—u is equal to the 
potential f; G; du, and u —h; — f, G; dp. In gen- 
eral, a representation of a superharmonic 
(subharmonic) function as the sum (difference) 
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of a harmonic function and a potential is 
called a Riesz decomposition. 


T. Boundary Values 


Let D be a domain in which a Green’s function 
exists, and consider the tfine topology on the 
*Martin compactification of D. Any negative 
subharmonic function u in D has a finite limit 
with respect to the fine topology at every point 
of the *Martin boundary A except at the points 
of a subset of A of harmonic measure zero 
(J. L. Doob). When D is a ball, u has a limit 
in the ordinary sense along almost every 
radius. However, it may happen that even if u 
is bounded there exists no angular limit at 
any point of the boundary. If the mean value 
of |u| on every concentric smaller ball in D is 
bounded, then u can be decomposed into the 
sum of a nonnegative harmonic function and a 
negative subharmonic function by the Riesz 
decomposition, and hence u has a limit both 
radially and with respect to the fine topology. 

Let D be a domain and K be a compact 
subset of D of tcapacity zero. If a function u is 
subharmonic and bounded above in D — K, 
then u can be extended to be a subharmonic 
function in D. A function that is equal to a 
subharmonic function almost everywhere is 
called almost subharmonic, and an almost 
subharmonic function satisfying condition (3’) 
is called submedian. 

Subharmonic functions can be discussed in 
a space more general than R”, e.g., a *Riemann 
surface (n= 2), or more generally, an !&-space 
of dimension n ( 2 2) in the sense of Brelot and 
Choquet. 


U. The Axiomatic Treatment 


*Newtonian potentials were the main object of 
interest in the early stages of fpotential theory. 
A major part of potential theory can be dis- 
cussed on the basis of the theory of superhar- 
monic functions [8]. For example, a *polar set 
is defined as a set on which some superhar- 
monic function assumes the value oo, and a set 
X is ‘thin at a point P5 4 X if and only if P, has 
a positive distance from X or there exists a 
superharmonic function v(P) in a neighbor- 
hood of P, such that lim sup v(P)> v(P,) as 
Pe X tends to Py. Moreover, we can discuss 
balayage, define potentials, and obtain Riesz 
decompositions. Generalizing results of Doob 
(1954) and starting from a family of harmonic 
functions defined axiomatically in a locally 
compact Hausdorff space, M. Brelot defined 
superharmonic functions and potentials and 
discussed balayage, Riesz decompositions, and 
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the fDirichlet problem (1957). Further pro- 
gress in axiomatic potential theory has been 
made by Brelot, H. Bauer, C. Constantinescu, 
A. Cornea, and others [11, 14]. 
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194 (VII.11) 
Harmonic Integrals 


A. Introduction 


*De Rham's theorem shows that the coho- 
mology group with real coefficients of a tdif- 
ferentiable manifold of class C® is isomorphic 
to the cohomology group of the cochain com- 
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plex of *differential forms with respect to the 
exterior derivative d. Thus every element of the 
cohomology group can be represented by a 
class of closed differential forms. Harmonic 
forms enable us to choose one definite dif- 
ferential form in each cohomology class. The 
theory of harmonic forms, called the theory of 
harmonic integrals, is modeled after the theory 
of holomorphic differentials and their inte- 
grals (Abelian integrals) in function theory 
[2,4,5,8]. 


B. Definitions 


Let X be an oriented n-dimensional differ- 
entiable manifold of class C® with a *Rie- 
mannian metric ds? of class C® (— 105 Differ- 
entiable Manifolds, 364 Riemannian Mani- 
folds A). For every (C^) p-form @ on X we 
define an (n — p)-form » o on X as follows: 
First denote the tvolume element of X by 

dv. If we choose a basis {w,,...,@,} of the 
space of 1-forms on an open set U of X such 
that ds? 2 225,0? and dv — €, ^... ^o,, then 

9 can be expressed on U in the form ọ = 
(1/p!) Z 9i, .....i OA AO; If we let «o = 
(1/(n —p))Z(* oi, dg ph ^r AO where 
Look, en (1/pDX ERICH dnte y Pipin then 
* is an (n— p)-form on U that does not 
depend on the choice of (@,,...,@,,) and is 
determined only by o. Since X is covered by 
open sets as above, + defines a linear map- 
ping that transforms p-forms to (n — p)-forms. 
If we let ds? = X gy dx/dx* in terms of the 
local coordinate system (xt, ...,x") and o = 
(Up), .....;, dx^ ^... ^ die, then, in the nota- 
tion of tensor calculus, we have 


* Q =(1/(n—p)!)(* ol, del A... A dein. 
(* ol lr BEA 
(g = det(gij)). 


For two p-forms ọ and y, we define the 
inner product by (o, V) — fx 9 ^ «y if the right- 
hand side converges. In order for the inner 
product (o, dei to be defined, it suffices that 
either or y has compact fsupport. Then 
(op, V) is a symmetric, positive definite bilinear 
form. 

If we let ó2(— 1)?*"*! «dx operate on p- 
forms, where d is the texterior derivative, then 
d and 6 are adjoint to each other with respect 
to the inner product. That is, if either o or 
y has compact support, we have (dg, V) = 
(o, dw) (*Stokes’s theorem). We call A2 dó + 
od the Laplace-Beltrami operator, which is 
a self-adjoint 'elliptic differential operator. 
These operators satisfy relations such as 
x z(—1)9" P, dd 20,600, «A— Ax, 
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&ó—(—1)dx, and ó«—(—1)'?*! «d (when 
they operate on p-forms). 

A differential form o is said to be harmonic 
if do =0 and 6g =0. Then Ag=0. Since A is 
an elliptic operator, a tweak solution o of the 
equation Ag = is an ordinary solution of 


class C* on the domain where u is of class C*. 


Therefore, if o is harmonic (as a weak solu- 
tion), o is of class C^. 


C. Harmonic Forms on Compact Manifolds 


On a compact manifold X, any o with Ao =0 
is harmonic, since (p, Ao) — (dq, dq) + (69, d¢@). 
Let L,( X) be the linear space of p-forms of 
class C” on X, and denote by 2,(X) the com- 
pletion of L (X) with respect to the inner 
product (o, y). Then £,(X) is the Hilbert space 
of square integrable measurable p-forms. Then 
§,(X)= (oe $,(X)| Ao —0 (in the weak sense)] 
is a finite-dimensional subspace of 2,(X) and 
is contained in L,(X), as we have seen before. 
Also, $,(X) is closed in €,(X), and the tpro- 
jection operator H: $,(X) $,(X) is an tinte- 
gral operator with kernel of class C^. The 
orthogonal complement of $,(X) in 2,(X) is 
mapped onto itself by A and has the inverse 
operator G of A, which is a continuous oper- 
ator of the Hilbert space. By letting G=0 

on $,(X), we can extend G to an operator 
from £,(X) to 2,(X) that is called Green's 
operator. It is also denoted by G, maps L, into 
itself, commutes with d and ó, and satisfies 

GH = HG =0, H+ AG =1 (=identity map- 
ping). Therefore, for pe L, (X) we have p=Ho 
+ Gódq + dóGq, which shows that H is thomo- 
topic to the identity mapping of the tcochain 
complex (2, L,(X), d). From this we infer that 
every cohomology class of de Rham cohomol- 
ogy contains a unique harmonic form that 
represents the cohomology class. However, 
since products of harmonic forms are not 
always harmonic, it is not appropriate to use 
harmonic forms to study the ring structure of 
cohomology. G is also an integral operator 
with kernel of class C* outside the diagonal 
subset in X x X. 


D. Harmonic Forms on Noncompact 
Manifolds 


If X is a noncompact manifold, let L,(X) be 
the space of p-forms of class C” with compact 
support, and let 2,(X) be its completion. Let 
$8, (X) and BF(X) be the respective closures of 
dL, ,(X) and 6L,,,(X) in $,(X), and let 
3p(X) and 35 (X) be the respective orthogonal 
complements of B(X) and BF(X) in £,(X). 
Then 3,(X)) 35 (X) 9 $,(X) is a subspace of 
the square integrable harmonic forms, and we 
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have the direct sum decomposition £,(X)= 
38, (X) + B*(X)-- 5, (X). In this decomposition 
any component of a form of class C? is also of 
class C*. 

If X is an open submanifold of another 
manifold Y, X is compact, and OX — X — X isa 
closed submanifold of Y, then the theory in 
this section is just a generalized potential 
theory with boundary condition o =0 on 0X. 
We sometimes treat decompositions of other 
Hilbert spaces that correspond to other bound- 
ary conditions. 


E. Generalization to Complex Manifolds 


If X is a complex manifold, we consider 
complex-valued differential forms (— 72 Com- 
plex Manifolds C). Then the space L,(X) of p- 
forms is the direct sum of the spaces L, ,(X) of 
forms of type (r, s), and the exterior derivative 
d has the expression d =d +d”, where d' is of 
type (1,0) (Le., L, .(X)9 L, ,, ,(X)) and d" is of 
type (0, 1). If we are given a holomorphic vec- 
tor bundle E on X, we can define an operator 
d" on differential forms with values in E, and 
we have the generalized tDolbeault theorem. If 
X is compact and has a tHermitian metric, we 
can define a Hermitian inner product on E as 
follows: There is an open covering {U,} such 
that over each U; the vector bundle E is iso- 
morphic to U, x C*. A point of E over U, is 
represented by (x, ¢,), where xe U; and če C*. 
For xe U;N U, we have (x, 6j) = (x, čą) (the sides 
are the respective expressions over U; and U,) 
if and only if $,— g4(x)5,, where g(x) is a 
holomorphic mapping from U;M U, to GL(q, C) 
satisfying ga du =g; on U;f1 U, Y U,. A differen- 
tial form o with values in E is expressed as a 
family (oj; of differential forms on U, with 
values in C* such that o;(x) = gu (x)e,(x) on 

U; U,. If we take a positive definite Hermitian 
matrix h; whose components are C*-functions 
on U; such that ‘g,.h,9,,=h, on U;f1 Up, then 
{'¢;h,¢;} determines a Hermitian inner product 
on each fiber of E. We can also endow the 
space L, ,(£, X) (of forms of type (r,s) of class 
C* with values in E) with a Hermitian inner 
product by setting (P, V) 9 Zeng? ^ *y? 
for o, WEL, (E, X) (where the pf (x— L, ...,q) 
are the components of o). If we denote by b 
the adjoint operator of d” with respect to this 
inner product and let A =d” d 4- bd", then Aisa 
self-adjoint elliptic differential operator, and 
results similar to those for A mentioned above 
hold for A. For example, the space §, ,(E, X) 
of harmonic forms of type (r, s) 1s of finite 
dimension, and there is a continuous linear 
operator G on £, (E, X), the completion of 

L, (E, X), that satisfies 1 = H + AG, HG = 

GH —0, d" G = Gd", and dG = Gb. Here H 
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denotes the projection € $, which is an 
integral operator with kernel of class C*. 
Also, G maps L, ,(E, X) into itself. Therefore 
H is homotopic to the identity on the co- 
chain complex (2, L, AE X), d"), and any ele- 
ment of tDolbeault’s cohomology groups (d"- 
cohomology groups) is represented by a uni- 
que harmonic form (— 232 Káhler Manifolds 
B). 


F. Other Generalizations 


Even if a manifold X is not of class C”, if X is 
a manifold of class C1, we can develop the 
theory of harmonic forms [6]. We say that X 
is of class C! if it is of class C! and has a set of 
local coordinate systems whose transition 
functions have derivatives satisfying the * Lip- 
schitz condition. 

If X is a real analytic manifold with a real 
analytic Riemannian metric, then harmonic 
forms are also real analytic. Using this fact, we 
can embed real analytically a compact mani- 
fold with a real analytic Riemannian metric 
into a Euclidean space (P. Bidal and G. de 
Rham; this result is now included in the 
theorems of C. B. Morrey and H. Grauert). 

We can consider the theory of harmonic 
forms with singularities [4, 5], a generalization 
of the theory of differential forms of the second 
and third kinds. Here the notion of current is 
very useful. 


G. Cohomology Vanishing Theorems 


Since the operator A is closely related to the 
Riemannian metric, some metrics may admit 
no harmonic forms of certain degrees except 
zero. This is important since it means that the 
corresponding cohomology group of the mani- 
fold vanishes. The condition for this phenom- 
enon to occur can be described in terms of 

the curvature of the metric. This study has its 
origin in S. Bochner's results [2]. 

Here is an example of a vanishing theorem: 
Let B be a holomorphic line bundle on a com- 
pact complex manifold X of dimension n. If 
the *Chern class of B is expressed by a real 
closed differential form of type (1, 1) as œ = 
at È a,5 dz* ^ dz’, where the Hermitian 
matrix (a,,) is positive definite at every point of 
X, then H4(X,Q?(B))=0 for p+q>n. In this 
case, ds? - 23a, dz* ^ dz? is a Hodge metric 
on X (— 232 Kahler Manifolds D). 
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Harmonic Mappings 


A. General Remarks 


The theory of harmonic mappings between 
Riemannian manifolds has its origin in the 
study of tPlateau’s problem. The basic prob- 
lem in the theory is to deform a given map- 
ping into a harmonic one, which is a problem 
of the *calculus of variations and tglobal ana- 
lysis (— 46 Calculus of Variations, 183 Global 
Analysis). Recently, the theory of harmonic 
mappings has been applied to problems in 
various branches of geometry [7—9, 11]. 


B. Definitions and Examples 


Let (M, g) and (N, h) be *Riemannian mani- 
folds with metrics g = È g,,dx'dx/ and h= 

X hs dy*dyP, respectively. We define the energy 
of a C'-mapping f: MN by 


1 
gn-;| Wio" dx, 


where |df(x)| is the tHilbert-Schmidt norm of 
the differential dr ` T,(M)— Tz, (N) of f at 
xe M and dx is the canonical Lebesgue mea- 
sure defined by g on M (assumed compact). 
Thus E(f) can be considered to be a generali- 
zation of the classical fDirichlet integral for 
functions. The integrand ei f )(x) ^ |df(x)|? is 
called the energy density of f; it measures 

the sum of the squares of elements of length 
stretched on a complete set of mutually per- 
pendicular directions. 
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The *Euler-Lagrange differential equations 
of the energy functional E(f) yield a vector 
field «( f) along f, i.e., a section of the bundle 
f * T(N) induced from the tangent bundle 
T(N) of N by f. In fact, given a family f, of 
mappings depending differentiably on t with 
fo — f, we have 

d 
t=0 


EED- | (03 
where < , > denotes the inner product of 
tangent vectors along f. The vector field «( f) is 
called the tension field of the mapping f; it 
indicates the direction in which the energy of f 
decreases most rapidly. 

The Euler-Lagrange differential equations 
t( f) «0 are a system of *quasilinear elliptic 
partial differential equations of the second 


order. In local coordinates, these can be writ- 
ten in the form 





EE 


Ox! @xfi 


Af* +S g? T, Cf) 





where A is the *Laplace-Beltrami operator on 
M and the I5,( f) (x) are the tChristoffel sym- 
bols on N at f(x). (The f* are local coordi- 
nates of the point f(x), and (g?) is the inverse 
matrix of (g;).) A C?-mapping f: MN is said 
to be harmonic if its tension field t( f) vanishes. 
Thus, if M is compact, a C?-mapping f: MoN 
is harmonic if and only if it is an extremal of 
the energy functional E( f). 

Examples of harmonic mappings appear in 
various contexts of differential geometry. For 
instance: 

(1) If N =R, then the harmonic mappings 
MR are the tharmonic functions on M. 

(2) If M is the circle S', then a harmonic 
mapping ST 5 N is a closed tgeodesic of N 
parametrized by arc length. 

(3) Let f: MN be an *isometric immersion 
of M into N. Then f is harmonic if and only if 
it is a *minimal immersion. 

(4) If M and N are tKahler manifolds, then 
every tholomorphic or antiholomorphic map- 
ping M >N is harmonic, where by an anti- 
holomorphic mapping is meant a mapping 
whose differential mapping carries a differen- 
tial form of type (1,0) into that of type (0, 1). 
We note that each (anti-)holomorphic map- 
ping is an absolute minimum for the energy in 
its homotopy class. There are also examples of 
nonholomorphic (and nonantiholomorphic) 
harmonic mappings between Kahler manifolds 
(— Section D). 


C. Fundamental Properties 


(1) Regularity. Since a harmonic mapping is a 
solution of a second-order quasilinear elliptic 
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system of partial differential equations t( f) «0 
(— Section B), it is a smooth (i.e., of class C?) 
mapping. More generally, it is known that a 
continuous mapping that satisfies t(f)=0 in 
a weak sense is smooth [4, 6]. 

(2) Unique continuation property. The follow- 
ing unique continuation theorem is valid for 
harmonic mappings: If two harmonic map- 
pings of M into N agree up to infinitely high 
order at some point of M, then they are iden- 
tical (M being assumed connected). In partic- 
ular, a harmonic mapping that is constant on 
an open set is a constant mapping. 

The global natures of harmonic mappings 
are closely related to the curvatures of the 
manifolds under consideration. For instance, 
suppose that M and N are compact and that 
the sectional curvatures of N are nonpositive 
everywhere. Then we have: 

(3) Uniqueness. Let f: MN be a harmonic 
mapping, and assume that there is a point of 
f(M) where the sectional curvatures of N are 
negative. Then f is unique in its homotopy 
class unless f(M) is a closed geodesic y of N; 
and in this case we have uniqueness up to 
rotation of y, i.e., an isometry of y which moves 
each point of y a fixed oriented distance along 
d 

(4) Degeneracy. Suppose further that the 
tRicci tensor (R,) of M is positive semidefinite 
everywhere. Then the energy density e( f) is a 
‘subharmonic function for every harmonic 
mapping. This implies that any harmonic 
mapping f: MN is ftotally geodesic. More- 
over, if N is of negative sectional curvature, 
then f is either constant or maps M onto a 
closed geodesic of N; if (R;) is positive definite 
at some point, then f is constant. 

(5) Finiteness. Assume now that N is of 
negative sectional curvature. Then, for each 
K 21, there are only finitely many noncon- 
stant harmonic mappings f: M >N of dilata- 
tion bounded by K. Here, we say that the dila- 
tation of f is bounded by K if and only if at 
each point of M we have df «0 or (4,/A,)! < 
K, 4,2 A45 2 ...» 0 being the positive eigen- 
values of the pullback quadratic form f *h(x) 
on T,(M) induced from the metric h of N by f. 


D. Harmonic Mappings of a Surface 


Let M be a compact surface. Then the energy 
of a mapping M >N is the tDirichlet-Douglas 
functional, and harmonic mappings are closely 
connected with solutions of Plateau's problem 
(— 334 Plateau's Problem). In fact, if a tcon- 
formal mapping M >N minimizes the tarea 
functional, then it also minimizes the energy 
functional. 

Now let M and N be compact orientable 
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surfaces whose genera are denoted by p and q, 
respectively. Then the problem of existence (or 
nonexistence) of harmonic mappings is well 
understood. In fact: 

(1) When q Z0, for any metrics g and h on 
M and N, every homotopy class of mappings 
M ->N contains a harmonic mapping. 

(2) When q —0 (ie., N is the 2-sphere S°), 
every harmonic mapping whose tdegree d 
satisfies |d| 2 p is holomorphic or antiholo- 
morphic with respect to the complex struc- 
tures associated with g and h. For example, 
consider the homotopy classes of mappings 
from the 2-torus T? to S? with any metrics. 
Then all classes with degree |d| 2 2 have har- 
monic representatives, and any such is holo- 
morphic or antiholomorphic; and the classes 
with d= +1 have no harmonic representatives. 

(3) When g=0 and |d| € p — 1, we have, for 
every such p and d, a surface M of genus p and 
a metric h on S? such that there exists a har- 
monic nonholomorphic (and nonantiholo- 
morphic) mapping of degree d from M to S?. 


E. Existence Theorems 


The basic problem in the study of harmonic 
mappings is to prove their existence in general 
geometric contexts. 

(1) In regard to this problem, translating the 
problem of the elliptic system c( f) 20 into the 
"initial value problem of the corresponding 
nonlinear tparabolic system 0f/0t — «( f), J. 
Eells and J. H. Sampson [1] proved that if 
M and N are compact and if N has nonposi- 
tive sectional curvature everywhere, then every 
homotopy class of mappings M >N contains 
a harmonic mapping that minimizes the energy 
in that class. Subsequently, the uniqueness of 
these harmonic mappings was established 
by P. Hartman [2] in the form stated in Sec- 
tion C. 

(2) For harmonic mappings of surfaces, 
more general existence results have been 
known. 

First, by the *direct method of the calculus 
of variations, L. Lemaire (1977) and others 
proved that if M and N are compact and if M 
is 2-dimensional, then every conjugacy class of 
homomorphisms z,(M)— z,(N) of the funda- 
mental groups is induced by a minimizing 
harmonic mapping. It follows that if, in partic- 
ular, the second homotopy group z,(N) of N 
is zero, then every homotopy class of map- 
pings of a compact surface M to N contains 
a harmonic representative realizing the mini- 
mum of the energy in that class. 

On the other hand, by making use of the 
generalized *Morse theory for a perturbed 
energy functional, J. Sacks and K. Uhlen- 
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beck [5] succeeded in giving a satisfactory 
answer to the structure of z (N), which is a 

1, (N)-module, in terms of harmonic mappings. 
They proved that there exists a generating set 
for z (N) consisting of harmonic mappings of 
spheres that minimize energy and area in their 
homotopy classes. We note that these har- 
monic mappings are minimal immersions with 
*branch points. 

(3) Next, we mention the case of harmonic 
mappings of manifolds with boundary. In this 
case, we can naturally formulate the fDirichlet 
and the "Neumann boundary value problem 
for harmonic mappings. 

In his study of Plateau’s problem on Rie- 
mannian manifolds, C. Morrey (1948) dis- 
cussed the Dirichlet problem for harmonic 
surfaces with boundary. 

The problem in arbitrary dimensions has 
been studied by R. S. Hamilton [3], who 
extended the result of Eells and Sampson 
mentioned above to the case where M and N 
have boundaries. In fact, let M and N be com- 
pact Riemannian manifolds with boundary, 
and assume that N has nonpositive sectional 
curvature and that the boundary ON of N is 
tconvex (or empty). Then there exists a unique 
minimizing harmonic mapping in each trela- 
tive homotopy class determined by the pre- 
scribed Dirichlet boundary value. We note 
that if ON is not convex, then it is easy to 
formulate Dirichlet problems with no solu- 
tions. Hamilton also treated the Neumann 
problem. 

Subsequently, S. Hildebrandt, H Kaul, and 
K.-O. Widman [4] gave another existence 
proof of solutions of the Dirichlet problem 
that covers the case where N admits positive 
sectional curvature. 
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David Hilbert (January 23, 1862- February 14, 
1943) was born in Kónigsberg, Germany. He 
attended the University of Königsberg from 
1882 to 1885, when he received his doctoral 
degree with a thesis on the theory of invar- 
iants. It was there that he established a life- 


long friendship with H. Minkowski. In 1892 he | 


became a professor at the University, and in 
1895 he was appointed to a professorship at 
the University of Góttingen, a position he held 
until his death. He obtained his basic theorem 
on invariants between 1890 and 1893, and next 
began research on the foundations of geometry 
(— 155 Foundations of Geometry) and the 
theory of falgebraic number fields. Concerning 
the former, he published Grundlagen der Geo- 
metrie (first edition, 1899), in which he gave the 
complete axioms of Euclidean geometry and 

a logical examination of them. Concerning 

the latter, he systematized all the important 
known results of algebraic number theory in 
his monumental Zahlbericht (1897). In number 
theory, he enunciated his significant conjecture 
on 'class field theory. At the international 
congress of mathematicians held in Paris in 
1900, he put forth 23 problems as targets for 
mathematics of the 20th century (Table 1). 
Between 1904 and 1906 he conducted research 
on the fDirichlet principle of tpotential theory 
and on the direct method in the tcalculus of 
variations. Around 1909 he established the 
foundations of the theory of *Hilbert spaces. 
After 1910 he was chiefly involved in research 
on the tfoundations of mathematics, and he 
advocated the standpoint of tformalism. He is 
one of the greatest mathematicians of the first 
half of the 20th century. 
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Table 1. The 23 Problems of Hilbert 


(1) To prove the continuum hypothesis (— 33 
Axiomatic Set Theory D). 





(2) To investigate the consistency of the 
axioms of arithmetic (— 156 Foundations of 
Mathematics E). 


(3) To show that it is impossible to prove 

the following fact utilizing only congruence 
axioms: Two tetrahedra having the same alti- 
tude and base area have the same volume. 
Solved by M. Dehn (1900). 


(4) To investigate geometries in which the line 
segment between any pair of points gives the 

shortest path between the pair (— 155 Foun- 
dations of Geometry). 


(5) To obtain the conditions under which a 
topological group has the structure of a Lie 
group (— 423 Topological Groups M). Solved 
by A. M. Gleason and D. Montgomery and L. 
Zippin (1952) and H. Yamabe (1953). 


(6) To axiomatize those physical sciences in 
which mathematics plays an important role. 


(7) To establish the transcendence of certain 
numbers (— 430 Transcendental Numbers B). 
The transcendence of 2V2, which was one of 
the numbers put forth by Hilbert, was shown 
by A. Gel’fond (1934) and T. Schneider (1935). 


(8) To investigate problems concerning the 
distribution of prime numbers; in particular, to 
show the correctness of the Riemann hypoth- 
esis (— 450 Zeta Functions). Unsolved. 


(9) To establish a general law of reciprocity (— 
59 Class Field Theory A). Solved by T. Takagi 
(1921) and E. Artin (1927). 


(10) To establish effective methods to deter- 
mine the solvability of Diophantine equations 
(— 97 Decision Problem; 182 Geometry of 
Numbers). Solved affirmatively for equations 
of two unknowns by A. Baker, Philos. Trans. 
Roy. Soc. London, (A) 263 (1968); solved nega- 
tively for the general case by Yu. V. 
Matiyasevich (1970). 


(11) To investigate the theory of quadratic 
forms over an arbitrary algebraic number field 
of finite degree (— 348 Quadratic Forms). 


(12) To construct class fields of algebraic num- 
ber fields (— 73 Complex Multiplication). 


(13) To show the impossibility of the solu- 
tion of the general algebraic equation of the 
seventh degree by compositions of continu- 
ous functions of two variables. Solved nega- 
tively. In general, V. I. Arnold proved that 
every real, continuous function f(x,, x5, x4) 
on [0, 1] can be represented in the form 

ZZ, (gx, X2), x4), where h; and g; are real, 
continuous functions, and A. N. Kolmogorov 
proved that f(x,, x5, x3) can be represented 
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in the form 27-1 hí(gi (x4) + Gi2(X2) + gis(x3)). 
where h; and g;; are real, continuous func- 
tions and g,, can be chosen once for all inde- 
pendently of f (Dokl. Akad. Nauk SSSR, 114 
(1957), Amer. Math. Soc. Transl., 28 (1963)). 


(14) Let k be a field, x,,..., x, be variables, 
and f(x, ..., x,) be given polynomials in 
k[x;, ...,x,] (i= 1, ..., m). Furthermore, let 

R be the ring formed by rational functions 
F(X,,...,X,,) in k(X,,..., Xm) such that F( fi, 
fme k[x;, ..., x,]. The problem is to deter- 
mine whether the ring R has a finite set of 
generators. Solved negatively by M. Nagata, 
Amer. J. Math., 81 (1959). 


(15) To establish the foundations of algebraic 
geometry (— 12 Algebraic Geometry). Solved 
by B. L. van der Waerden (1938-1940), A. 
Weil (1950), and others. 


(16) To conduct topological studies of alge- 
braic curves and surfaces. 


(17) Let f(x,, ..., x,) bea rational function 
with real coefficients that takes a positive value 
for any real n-tuple (x,, ..., x,). The problem is 
to determine whether the function f can be 
written as the sum of squares of rational func- 
tions (— 149 Fields O). Solved in the affirma- 
tive by E. Artin (1927). 


(18) To express Euclidean n-space as a disjoint 
union | J; P,, where each P, is congruent to 
one of a set of given polyhedra. 


(19) To determine whether the solutions of 
regular problems in the calculus of variations 
are necessarily analytic (— 323 Partial Differ- 
ential Equations of Elliptic Type). Solved by 
S. N. Bernshtein, I. G. Petrovskii, and others. 


(20) To investigate the general boundary value 
problem (- 120 Dirichlet Problem; 323 Par- 
tial Differential Equations of Elliptic Type). 


(21) To show that there always exists a linear 
differential equation of the Fuchsian class with 
given singular points and monodromic group 
(— 253 Linear Ordinary Differential Equa- 
tions (Global Theory)). Solved by H. Róhrl 
and others (1957). 


(22) To uniformize complex analytic functions 
by means of automorphic functions (— 367 
Riemann Surfaces). Solved for the case of one 
variable by P. Koebe (1907). 

(23) To develop the methodology of the 


calculus of variations (— 46 Calculus of 
Variations). 
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A. General Remarks 


The theory of Hilbert spaces arose from prob- 
lems in the theory of "integral equations. D. 
Hilbert noticed that a linear integral equation 
can be transformed into an infinite system of 
linear equations for the *Fourier coefficients 
of the unknown function. He considered the 
linear space l, consisting of all sequences of 
numbers {x,} for which Z2; |x,|? is finite, 
and defined for each pair of elements x= {x,}, 
y={y,} €l, their inner product as (x, y) - 

X, Xn), The space l, can be regarded as 

an infinite-dimensional extension of the notion 
of a Euclidean space. F. Riesz considered the 
space of functions now termed L,-space and 
succeeded in giving a satisfactory answer to 
the Fourier expansion problem. In his book 
[3], J. von Neumann established a rigorous 
foundation of quantum mechanics employing 
Hilbert spaces and the spectral expansion of 
self-adjoint operators. The following axiomatic 
definition (— Section B) of Hilbert spaces is 
due to von Neumann. H. Weyl later justified 
the tDirichlet principle of Riemann by the 
method of orthogonal projection in a Hilbert 
space, and thus paved the way for the function- 
analytic study of differential equations. 


B. Definition 


Let K be the field of complex or real numbers, 
the elements of which we denote by x, f, .... 
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Let H be a linear space over K, and to any 
pair of elements x, ye H let there correspond a 
number (x, y)e K satisfying the following five 
conditions: (1) (x, +X, y) - (x1, y) (x2. y); (t) 
(ax, y) = (x, y); (iii) (x, y) 9 Cy, x); (iv) (x, x) 2 0; 
and (v) (x, x) 30 x 20. Then we call H a pre- 
Hilbert space and (x, y) the inner product of x 
and y. 

With the norm ||x||=./(x,x), H isa 
tnormed linear space. If H is *Àcomplete with 
respect to the distance ||x — y|| (i.e, ||x, — x4,]| 2 
0 (m, no co) implies the existence of lim x, =x), 
then we call H a Hilbert space. According 
as K is complex or real, we call H a complex 
or real Hilbert space. A Hilbert space is a 
*Banach space. 

A normed linear space with norm ||x|| can 
be made a pre-Hilbert space, by defining an 
inner product (x, y) so that ||x|| =./(x, x), if 
and only if the equality |x + yl? + |x — yl? = 
2(|xl|? + [| y||?) holds for any x, y. 





C. Orthonormal Sets 


Two elements x, ye H are said to be mutu- 
ally orthogonal if (x, y) 20. A subset X of H is 
called an orthogonal set (or system) if Oé X and 
every distinct pair x, ye X is mutually ortho- 
gonal. If every element of an orthogonal set X 
is of norm 1, then X is called an orthonormal 
set. Any orthogonal set 2 = (x;) can be normal- 
ized into an orthonormal set {x;/|x;||}. A 
maximal orthonormal set is called a complete 
orthonormal set or an orthonormal basis. All 
the complete orthonormal sets of a given H 
have the same cardinal number, which we call 
the dimension of H. Two Hilbert spaces are 
isomorphic if and only if they have the same 
dimension. 

Let X= {x;} be an orthonormal set. Then for 
every xe H, its Fourier coefficients (x, x;) van- 
ish for all but a countable number of i, and the 
Bessel inequality ||x||? > X; |(x, x,)|? holds. The 
following three statements are equivalent in a 
Hilbert space: (i) X is complete; (ii) Parseval’s 
equality |x||? =, |(x, x)|? holds for every x; 
(iii) every x can be expanded in a Fourier series 
x= Xx, x)x; (^ 317 Orthogonal Functions). 


D. Examples of Hilbert Spaces 


The space l, (— Section A) is a Hilbert space 
of dimension N,. The 'function space L, on a 
measure space (X, u) is a Hilbert space if the 
inner product of f, ge L, is defined by (f; g)— 
[x fGdu. In the case of the *Lebesgue mea- 
sure in a Euclidean space, L, is of dimension 
No, so that it is a Hilbert space isomorphic 
to Lj. Further examples of Hilbert spaces are 
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Ax(Q), W;(Q) (= H'(Q)), and Hy(Q) (— 168 
Function Spaces). 


E. Closed Linear Subspaces and Projections 


Let M be a closed linear subspace of a Hilbert 
space H, i.e., a linear subspace that is closed in 
the norm topology of H. It is a Hilbert space 
with respect to the restriction of the inner 
product in H. For a given M the set of all 

x€H such that (x, y) 20 for every ye M forms 
a closed linear subspace M+ called the orthog- 
onal complement of M. The orthogonal com- 
plement of M+ is M (i.e, M++ = M), and H is 
the direct sum of M and M' De every xe H 
can be uniquely represented as x= y -z, ye 
M, ze Mt, and [x|?— | yi^ tuell, Thus the 
quotient space H/M is isomorphic to M+ and 
is also a Hilbert space. The operator P, that 
maps x to y is called the projection (or ortho- 
gonal projection or projection operator) to M. 
A bounded linear operator P is a projection if 
and only if it is idempotent (P? — P) and self- 
adjoint ((Px, y) 2 (x, Py) for any x, ye H) (— 
251 Linear Operators). 


F. Conjugate Spaces 


A linear operator from H to K is called a 
linear functional. The set H' of all continuous 
linear functionals f on H forms a Hilbert 
space with norm || f || =sup{| f(x)|| |||] =1}. 
For every fe H' there exists a unique ye H 
such that f(x) —(x, y) for all xe H (Riesz's 
theorem), and the correspondence f y gives 
an 'antilinear isometric operator from H' onto 
H (for !linear operators on Hilbert spaces 

— 68 Compact and Nuclear Operators; 251 
Linear Operators; 390 Spectral Analysis of 
Operators). 
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A. Differentiation of Complex Functions 


Let f(z) be a tcomplex-valued function defined 
in an open set D in the tcomplex plane C. We 
say that f(z) is differentiable at z if the limit 


lim (f(z + h) — f(z)/h — f(z) (1) 


exists and is finite as the complex number h 
tends to zero. We call f’(z) the derivative of 
f(z) at z. This definition is a formal extension 
of the definition of differentiability of a func- 
tion of a real variable to that of a complex 
variable (— 106 Differential Calculus), but it is 
a much stronger condition than the differentia- 
bility of a real function, since z +h in (1) may 
be an arbitrary point in a 2-dimensional neigh- 
borhood of z. Hence many results essentially 
different from those for functions of a real 
variable follow from it. 

If a function f(z) is differentiable at each 
point of an open set D, it is said to be holo- 
morphic (or regular) in D, or f(z) is a holo- 
morphic function on D. (For the definition of 
holomorphy of a complex-valued function 
of several complex variables — 21 Analytic 
Functions of Several Complex Variables C.) 

Let E be an arbitrary nonempty subset of C. 
We say that f(z) is holomorphic on E if it is 
defined in an open set D containing E and is 
holomorphic on D. Some results valid for 
differentiable real functions also hold for 
holomorphic functions. For instance, the de- 
rivative of a sum, product, or quotient is given 
by the usual rules. The derivative of a com- 
posite function is determined by the chain rule. 
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The set of all functions holomorphic in a 
tdomain D forms a !ring. 

Suppose f(z) is holomorphic on D and 
f '(z9) #9, zo € D. Then two curves that form 
an angle at z) are mapped by f to two curves 
forming the same angle at Goal Because of 
this property, the mapping f is said to be 
conformal at all points z with f’(z) 40. 

The following four conditions are equivalent 
for a function f =u + iv defined on an open 
set D. (1) f is holomorphic in D. (2) u — u(x, y) 
and v — v(x, y) are ttotally differentiable at 
each point z 2 x 4- iy and satisfy the Cauchy- 
Riemann differential equations 


Qu/Ox — Ov/Oy, | Ou/Oy = — Ov/Ox. 


(3) f is represented by a tpower series 25,-o c,(z 
— aj" in a neighborhood of each point a of D; 
that is, f(z) is analytic in D. (4) f is continuous 
and [c f(z)dz=0 for every rectifiable Jordan 
closed curve C whose interior is contained, 
together with C, in D. The proposition that (1) 
implies (4) is called Cauchy's integral theorem, 
and the proposition that (4) implies (1) is called 
Morera's theorem. 

The hypothesis of Morera's theorem can be 
weakened as follows: Let f(z) be continuous in 
a domain D. If fe f(z) dz 2 0 for every rectangle 
C in D with sides parallel to the axes and 
whose interior consists of only points of D, then 
f(z) is holomorphic in D. In the statement of 
this theorem, if we let C be an arbitrary circle, 
we get the same conclusion. 

The following complex differential operators 
are often useful: 


d 1fé d p 1/3 ð 

BE EE SE BEN 
Generally, (0/0z) = 09/02. The Cauchy- 
Riemann equations above can be expressed in 
a single equation : 0f/0z — 0. If f is holomor- 
phic, 0f/0z = f '(z). 

In order to show that f=u-+iv is holomor- 
phic in D, assumption (2) can be weakened. 
Actually, we have the Looman-Men'shov 
theorem: Suppose that u and v are continuous 
in D, Ou/Ox, 0u/Oy, Ov/Ox, and 6v/Oy exist at 
every point of D except for at most a count- 
able number of points, and the Cauchy- 
Riemann equations hold in D except for a set 
of 2-dimensional tmeasure zero; then f =u + iv 
is holomorphic in D. D. E. Men'shov extended 
this theorem and obtained various conditions 
for holomorphy. For example, he proved the 
following theorem: If f is a topological map- 
ping of D and f is conformal in D (1.e., 


lim arg f(z +h) — f (z))/h 





exists) except for at most a countable number 
of points, then f is holomorphic in D. 
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As another type of sufficient condition for 
holomorphy, we have the proposition : If f 1s 
locally tLebesgue integrable in D and satis- 
fies the Cauchy-Riemann equation f; — 0 in 
the sense of ‘distribution, then there exists a 
holomorphic function g in D such that g — f 
talmost everywhere. 


B. Cauchy's Integral Theorem 


Cauchy's integral theorem can be stated as 
follows: If f(z) is a holomorphic function in a 
tsimply connected domain D in the complex 
plane, the equality fe f(z)dz=0 holds for every 
(rectifiable) closed curve C in D. In particular, 
the integral ff f(z)dz (a, Be D) is uniquely deter- 
mined by « and fi provided that its path of 
integration lies in D. The function F(z)= 

ESC) dt (ze D) is called the indefinite integral 
of f. F(z) is holomorphic in D and F'(z) — f(z). 

In the proof of his integral theorem, Cauchy 
assumed the existence and continuity of the 
derivative f’(z) in D. However, E. Goursat 
proved the theorem utilizing only the existence 
of f '(z), Actually, by virtue of the integral 
formula (2) in this section, the existence of f '(z) 
in D implies the continuity of TC This is 
sometimes called Goursat's theorem. 

Let C, C,, C, ..., C, be rectifiable Jordan 
curves. Suppose that C,, C5, ..., C, are in the 
interior of C and that each one lies in the ex- 
terior of the others. If f(z) is holomorphic in 
the region D bounded by these n+ 1 curves 
and continuous on DUCUC,U...UC,=D, 
then we have 


[sod f(z)dz. 
c & Jo, 


Here the curvilinear integrals are taken in the 
positive direction (i.e., we take the direction 
such that Lutz —a) ! dz = c, (z—a) ! dz —2ni for 
a point a in the interior of C or C,, respec- 
tively). Henceforth, an integral along a closed 
curve is taken in the positive direction unless 
otherwise noted. Cauchy's integral theorem 
under the assumption that f is holomorphic 
in D and is continuous on D is sometimes 
called the stronger form of Cauchy's integral 
theorem. 

Under the same assumptions as in the 
stronger form, we have Cauchy's integral for- 
mula for ze D: 


1 1 
ni €, 





DE Jitz 
l TU 
EE (2) 


. This integral formula expresses the value of 
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f(z) at a point z in the domain D in terms of 
the values of f on the boundary of D. In par- 
ticular, when n —0, the integral formula reads 
as 


fe-s. | fG) dt. 3) 





Furthermore, if C is a circle |z|= R (i.e., D is 
the disk |z| « RL we obtain Poisson’s integral 
formula: 





f(z)= 

1 2n R R?—r 

x f(Re EE zs 
f2- 





! (^ ne fei Ët TZ ao + "te f(0) 
2n Eua md 


z—re, O<r<R. OI 


Formula (3) is valid for a *harmonic function. 
Let C be a rectifiable curve and f(f) be a 
continuous function defined on C; then the 





integral of Cauchy type 
1 
rem | ie) d 
2ni Jol —z 


is holomorphic outside C. The nth derivative 
F of F is given by (n!/2zi) [c TECH — z)"*! dt; 
moreover, F can be expanded in a Taylor 
series about every point a outside C: 

eo Fo 
Fe Y ais. ai, a= —, 


n=0 n! 





which converges in |z—a|<p, p being the 
distance from a to C. In particular, formula (3) 
implies that a holomorphic function f is in- 
finitely differentiable and is expanded in a 
Taylor series about every point of D as above. 
Let C be a closed curve not passing through 
a point a. Then the integral (1/2zi) [c dz/(z — a) 
is an integer. It is called the winding number of 
C about a and is denoted by n(C;a). A cycle y 
(a finite sum of oriented closed curves) in an 
open set D is said to be homologous to zero 
in D if n(y; a) 20 for all points a in the com- 
plement of D. The general form of Cauchy's 
integral theorem is stated as follows. If f is 
holomorphic in D, then f, f(z)dz «0 for every 
cycle y which is homologous to zero in D (E. 
Artin). From this we have the general form of 
Cauchy's integral formula: if f is holomorphic 
in a domain D, then 


1 
n(y; afi Zu 


for every cycle y which is homologous to zero 
in D. 
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C. Zero Points 


Let f(z) be a holomorphic function not identi- 
cally equal to zero. If f(a) 20, we call a a zero 
point of f. Every zero point of f is an isolated 
point, and there exists a unique positive inte- 
ger k and a function h holomorphic at a such 
that 


f()- —af'g(z), 


We call k the order of the zero point a and a 
a zero point of the kth order. The equality (4) 
implies that the *Taylor series of f(z) at a 
begins with the term c,(z—a)*. Suppose that a 
is a zero point of f(z) — y of the kth order; then 
we call a a y-point of the kth order. 

For a function f(z) defined in a neighbor- 
hood of the *point at infinity, we set f(1/w)— 
g(w) (f(oo) 2 g(0)) and call f holomorphic at 
oo if g is holomorphic at 0; f is said to have a 
zero of order k at oo if g has a zero of order k 
at 0. If two functions f and g are holomorphic 
in D and f(z) 2 g(z) on a subset E that has an 
taccumulation point in D, then f is identically 
equal to g in D (theorem of identity or unique- 
ness theorem) since the zeros of holomorphic 
functions must be isolated. 


g(a) #9. (4) 


D. Isolated Singularities 


Let f(z) be holomorphic in an annulus D 
={z|R,<|z—a|<R,,0<R,<R,< +0}. 
Then f(z) is expanded in the "Laurent series 


Too 


f= Y, ode (5) 
This is called the Laurent expansion of f 
about a. The coefficients c, are given by c,— 
(1/2ni) fc. f(OdC/C —a)** with C= {z||z—al= 
r}, Ri «r « R;. In particular, if f(z) is holo- 
morphic in D={z|0<|z—a|<R} (or, if a= 

oo, in D— (z| R «|z| € +}) but not holomor- 
phic in DU {a}, we call a an isolated singular 
point (or isolated singularity) of f. By utilizing 
the "local canonical parameter t —z — a (or 
t=1/z for a= oo), the Laurent expansion (5) 

of f is then written as f(z) 2 3,3. Cat" + 

7 ¢,t". The second sum is an ordinary 
power series, called the holomorphic part of 
f(z). The first sum is a power series of 1/t with 
no constant term, called the singular part of 
f(z) at a or the principal part of the singularity 
(or of the Laurent expansion at a). 

If we have lim, a tf(z) 20, the Laurent 
expansion (5) of f(z) lacks its singular part, 
and the limit of f(z) exists as t>0 (za) and is 
equal to co. If we set f(a) — co, then the func- 
tion f(z) is holomorphic in DU (aj. In this 
case, the point a is called a removable singular- 
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ity. If f(z) is bounded in a neighborhood of a 
singularity a, then a is removable (Riemann's 
theorem). Usually, we assume that the remov- 
able singularities of a function have already 
been removed in this way. 

When the singular part of f(z) at a exists 
and consists of a finite number of terms, the 
point a is called a pole; when it consists of an 
infinite number of terms, the point is called- 
an essential singularity. If a is a pole, f(z) is 
represented by the Laurent series 225 p c,t" 

(c .,, #0) and /'(z)— oo as za. In this case, the 
index k is called the order of the pole a. Then a 
relation such as (4) holds, where the index k is 
replaced by — k. Hence the point a is some- 
times called a zero point of the — kth order. If a 
is an essential singularity, then for an arbitrary 
number c there exists a sequence z, converging 
to a such that lim, ,,, f(z,) 2c (the Casorati- 
Weierstrass theorem or simply Weierstrass's 
theorem). Related to Weierstrass's theorem, 

we have *Picard's theorem, which gives a de- 
tailed description of the behavior of a function 
around its singularities. 


E. Residues 


Let a( z oo) be an isolated singularity of f(z). 
Then the coefficient c. , of (z—a) ! in the 
Laurent expansion (5) of f(z) is called the 
residue of f(z) at a and is denoted by Res[ f ],, 
R(a; f), or R(a) if we need not indicate f. We 
have 


1 
R(a).-c-175 f( dK, 


Kaze 

where the integral is taken in the positive 
direction along a path for 0 « r « R. If f(z) is 
holomorphic at z=a, then R(a)=0. If f(z) has 
a pole of the first order at a, 


R(a)= lim (z—a)f(z). 


The residue at the point at infinity is defined to 
be —a_,, where a is the coefficient of 1/z of 
the Laurent expansion of f(z) at oo: f(z)= 
Eo 4,2", and we have 


—i 
Bü. | f(Qdl, R'<r<+o. 
T Jg 


Thus the notion of the residue of f(z) is actu- 
ally related to the differential form f(z)dz and 
not to f(z) itself. 

From the first formula in this section and 
the formula for —a. ,, the residue theorem 
follows (Cauchy, 1825): Let C be a rectifi- 
able Jordan curve in the complex plane. Let 
du, am be a finite number of points inside C, 
and let D be a domain containing C and its 
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interior. If f(z) is a function holomorphic in D 
— (a, ..., Oe, we have 


1 m 
I dz= » R(a,). 
x; | flende= E Ray 
Furthermore, if f(z) is holomorphic in the 
extended complex plane (including the point at 
infinity) except for a finite number of poles, the 
sum of all residues is equal to zero. 


F. Calculus of Residues 


The calculus of residues is a field of calculus 
based on application of the notion of residues. 
For example, we have methods for the calcu- 
lation of definite integrals. Actually, one of the 
reasons why Cauchy studied the theory of 
complex functions was that he believed that 
the theory would provide a unified method of 
computing definite integrals. For example, if 
q(z) is a rational function without poles on 
the real axis and with a zero point at infinity 
whose order is at least 2, then we have 


| p(x)dx=2ni Y, Ris: oz (6) 


— oo 1ma-0 


| e*p(x)dx=2ni Y R(me"o(z)) (7) 
= 69 Ima-0 

Here the sums are taken over all the poles in 
the upper half-plane. Formula (7) is valid also 
for a rational function @(z) with a simple zero 
at infinity. If @(z) has simple poles at a, (k= 

1, ...,n) on the real axis, then we take the 
principal values of the integrals at those poles 
and add ziR(a,) (k — 1, ...,n) to the terms on 
the right-hand side of (6) and (7). Sometimes 
we use the residue theorem to obtain the value 
of the sum of a series (e.g., the ‘Gaussian sum) 
by expressing it as an integral. 

Let f(z) be a single-valued function that is 
*meromorphic and not identically equal to 
zero in a domain D, and let (z) be a function 
holomorphic in D. Draw a rectifiable Jordan 
curve C in D such that the interior of C is 
contained in D and f(z) has neither zeros nor 
poles on C. Let o,, ..., «y and f,,..., Bp be the 
zeros and poles inside C, respectively (where 
each of them is repeated as often as its order). 
Then we have 





: SE ian es 
» p(a,)— 2 olb) = 25 I oz) TE dz. 


If o(z) 2 1, we get 

1 

xl darg f(z)- N — P. 
21 Je 


This is called the argument principle. Next, 
let f(z) be a function meromorphic for |z| « 
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R< +æ, f(0) 0, z oo, and set o(z) «logz. 
Take C as a closed curve consisting of the 
boundary of an annulus 0 « p «|z| «r « R 
(where p is sufficiently small) and two sides of 
a suitable *crosscut joining a point of |z| - p 
and |z| 2r. Then we obtain Jensen's formula: 


01,035 «Oy 


log 
Bis Ba, +--> Bp 


=log|f(0)|+(N — P)logr 








Poa ; 
SH log|f(re™)| dy. 
T Jo 


The argument principle can be utilized to 
prove Rouché’s theorem: Let f(z) and g(z) be 
functions holomorphic in a domain D that 
contains a rectifiable Jordan curve C and its 
interior. Suppose that f(z) 4- Ag(z) never van- 
ishes on C for any 4 with 0x 4x 1. Then the 
number of zeros of f(z) in the interior of C is 
equal to that of f(z) 4- g(z). If | f(z) » |g(z)] on C 
or arg f(z) — argg(z) 2n + 1)z (n is an integer), 
the hypothesis of Rouché's theorem is satisfied. 
This theorem is useful in proving the existence 
of a zero of a complex function (for example, a 
polynomial) and in finding its position. 


G. Analytic Continuations 


Let f(z) be a holomorphic function in a do- 
main D of the complex plane C and D* be a 
domain containing D as a proper subset. If 
there exists a function F(z) holomorphic in D* 
that coincides with f(z) in D, then F(z) is called 
an analytic continuation (or analytic prolonga- 
tion) of f(z) from D to D*. By the theorem of 
identity an analytic continuation of F(z) is 
uniquely determined if it exists. 

The function f,(z) defined by the power 
series P(z; a) - 3.9 a,(z — a)" with the radius 
of convergence r, >0 is holomorphic in the 
domain D, :|z — a| « r;, and at a point b of D, 
it can be expanded into a power series P(z; b) 
=$ <o b,(z—b)" with the radius of conver- 
gence r, (2r, —|b—a|). Ifr,>r,—|b—al, the 
domain D,:|z — b| <r, is not entirely contained 
in D,. Let f;(z) be the function defined in D; 
by P(z; b). Then the function F(z) that is equal 
to f;(z) in D, and to f;(z) in D, is an analytic 
continuation of f,(z) from D, to D,UD, (a 
direct analytic continuation by power series). 

We have the following classical theorems 
about analytic continuations: 

Let D, and D, be two disjoint domains, and 
suppose that their respective boundaries C, 
and C, are trectifiable simple closed curves 
and that the intersection of C, and C; contains 
an open arc T. If two holomorphic functions 
LL f;(z) defined in D, and D,, respectively, 
have finite common *boundary values at every 
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point of I, then there exists an analytic con- 
tinuation F(z) of f(z) and f;(z) to D, UTUD, 
(Painlevé's theorem). We sometimes call f,(z) 
a continuation of f, (2) beyond F. If I is not 
rectifiable, the continuation beyond T does not 
exist, in general. 

Let f(z) be holomorphic in a ‘Jordan do- 
main D lying in the half-plane on one side of 
the real axis and containing an open interval 7 
of the real axis in its boundary. If f(z) has 
finite real boundary values at every point of I, 
then it can be continued analytically beyond I 
to the other side of the real axis; there the 
continued function is given by f(z) (Schwarz's 
principle of reflection). This theorem can be 
generalized to the case where the real interval 
is replaced by an tanalytic curve. 

A harmonic continuation of harmonic func- 
tions is defined analogously to analytic con- 
tinuation. Let D be a Jordan domain lying in 
the half-plane on one side of the real axis and 
having an open interval / on the real axis as a 
part of its boundary. If u(z) is harmonic in D 
and has the boundary value 0 at every point 
of I, then u(z) has a harmonic continuation 
beyond 1. 

For other properties of holomorphic func- 
tions — 43 Bounded Functions; 429 Tran- 
scendental Entire Functions. 


H. Analytic Functions 


A real-valued function f(t) of a real variable t 
is said to be analytic at t — t, if it can be repre- 
sented by a fpower series in t — t; in a neigh- 
borhood of t, in R. If f(t) is defined on an 
open set of R at every point of which it is 
analytic, then f(t) is called an analytic function, 
or, more precisely, a real analytic function. 

Analogously, a complex-valued function f(z) 
of a complex variable z defined on a tdomain 
D of the complex plane C is said to be analytic 
at z «zo (c D) if it can be represented by a 
power series in z — zo in a neighborhood of zo 
in C, and f(z) is an analytic function in D if it is 
analytic at every point of D. In the remainder 
of this article, we are concerned with analytic 
functions in this sense. To distinguish them 
from the real case, they are also called complex 
analytic functions. A complex analytic function 
f(z) defined on D is tdifferentiable in D; there- 
fore it is "holomorphic in D. The converse is 
also true. Thus the term "analytic function" is 
synonymous with "holomorphic function" 
insofar as it concerns a complex function (i.e., 
a complex-valued function of a complex vari- 
able) on a domain, but in the theory of func- 
tions it takes on an additional meaning that is 
explained in the following section. 
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I. Analytic Functions in the Sense of 
Weierstrass 


Let a be a point of the !z-sphere and t the 
‘local canonical parameter at a; i.e., t—z— a if 
af oo andt-z !'ifa- oo. If a power series 
P(z; a) - 5o c,1" has a positive radius r, of 
convergence, we call P(z; a) a function element 
with center a on the z-sphere, after K. Weier- 
strass. P(z;a) - 5.9 c,(z — a)" if az oo, and 
P(za)-Xjoc,z "ifa-oo. These represent a 
holomorphic function in |z — a| <r, or in r; ! « 
|z| € 00, respectively. If b is a point inside the 
circle of convergence of the function element 
P(z;a), by the *Taylor expansion of P(z; a) at 

z — b, we obtain the power series P(z; b) in 

z — b, which is a direct analytic continuation 
of P(z; a). Let a and b be two points on the 
z-sphere, and let C:z=z(s) (Os 1,z(0) — 

a, z(1) 2 b) be a curve joining a and b. We say 
that P(z; a) is analytically continuable along 

C and that we obtain P(z; b) at the end point b 
by the analytic continuation of P(z; a) along C 
if the following two conditions are satisfied: (i) 
To every se[0, 1] there corresponds a function 
element P(z; z(s)) with center z(s); (ii) for every 
So € [0, 1], we can take a suitable subarcz = 
z(s) (|S —sg| <£, €» 0) of C contained inside 

the circle of convergence of P(z; z(so)) such 
that every function element P(z; z(s)) with 
|s—so| <e is a direct analytic continuation of 
P(z;z(So)). When P(z;a) and the curve C are 
given, the analytic continuation along C is 
uniquely determined (uniqueness theorem of 
the analytic continuation). 

Given a function element P(z; a) with center 
a, the set of all function elements obtained by 
every possible analytic continuation along 
every curve starting from a is called an ana- 
lytic function in the sense of Weierstrass deter- 
mined by P(z; a). In this definition, we can 
restrict the curves to polygonal lines. An ana- 
lytic function in this sense is completely deter- 
mined by a single arbitrary function element 
belonging to it, so two analytic functions are 
identically equal if they have a common func- 
tion element. 

A tgerm of a holomorphic function is iden- 
tical to a function element, and the set of all 
germs has the natural structure of a tsheaf d. 
In the terminology of sheaves, an analytic 
function is a connected component of ©, and 
an analytic continuation along a curve C is a 
continuous curve F in © whose projection is C. 


J. Values and Branches of Analytic Functions 


The value of an analytic function at a point b 
is, by definition, the value at b of its function 
elements with center b (whose existence is 
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assumed; there may be several such elements). 
An analytic function is, in general, a multiple- 
valued function because analytic continua- 
tions along different curves with the same end 
points may lead to different function elements. 
For a given analytic function f(z), if the max- 
imal number of its function elements with the 
same center is n, we say it is n-valued, and if 
nz2 we say it is multiple-valued (or many- 
valued). The number of function elements of 
f(z) with the same center is at most *countably 
infinite, so the value of f(z) at a point is a 
countable set (Poincaré-Volterra theorem). By 
introducing a fRiemann surface instead of the 
complex plane as the domain of definition of 
an analytic function, we can regard multiple- 
valued analytic functions as single-valued 
functions defined on a suitable Riemann sur- 
face (— 367 Riemann Surfaces). 

Let f(z) be an analytic function and P(z;a) be 
a function element belonging to f(z), where a is 
a point of a domain D. The set of all function 
elements obtained from P(z; a) by every pos- 
sible analytic continuation along all curves in 
D is called a branch of f(z) in D determined by 
P(z;a). When D coincides with the whole com- 
plex plane, the branch of f(z) in D is the func- 
tion f(z) itself. A function holomorphic in a 
domain D can be expanded in a power series 
with any point of D as its center, and the set of 
these power series (function elements) consti- 
tutes a branch of an analytic function. 

If analytic continuations of a function ele- 
ment are possible along all curves in D, then 
the analytic continuations along two tho- 
motopic curves in D lead to the same result 
(monodromy theorem). In particular, if D is 
‘simply connected and if analytic continua- 
tions of P(z; a) are possible along all curves in 
D starting from a, then the branch of f(z) in D 
determined by P(z; a) is single-valued. 


K. Invariance Theorem of Analytic Relations 


Suppose the following four conditions hold: (1) 
F(z, w) is a holomorphic function of two vari- 
ables for ze A, and weA,, where A,, A, are 
domains in the complex plane. (2) A curve C: 
z «z(s) (O<s<1,z(0)=a, z(1)=b) and two sets 
of function elements P(z; z(s)) and Q(z; z(s)) 
defined for every s (0x s 1) are given. (3) 

P(z; a) and Q(z; a) can be continued analyti- 
cally along C using P(z; z(s)) and Q(z; z(s)), 
respectively. (4) There exists a positive number 
R(s) for every s (O€ s « 1) such that, if |z — ziel 
< R(s), the values of P(z; z(s)) and Q(z; z(s)) 
belong to A, and A,, respectively. Under these 
conditions, if F(P(z; a), Q(z; a)) 2 0 holds for 

[z —a| « R(0), then F(P(z; b), Q(z; b)) =0 holds 
for |z —5| < ROL In other words, an analytic 
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relation between function elements belonging 
to two analytic functions that holds in a neigh- 
borhood of the starting point of a curve C is 
conserved for function elements with center at 
the terminal point b of C. This is called the 
invariance theorem of analytic relations. The 
same statement is valid for relations among 
more than two analytic functions and their 
derivatives (differential equations). 


L. Inverse Functions 


Suppose that P(z; a) (a x co) belongs to an 
analytic function f(z) and Pio a) #0. We 
consider the inverse function of P(z;a) in a 
neighborhood of a and let B(w; «) («= P(a; a)) 
be its expansion as the power series in w— a. 
We call 8(w; o) the inverse function element (or 
simply inverse element) of P(z; a) and the ana- 
lytic function determined by B(w; x) the inverse 
analytic function (or simply inverse function) of 
f(z). The inverse function is completely deter- 
mined by f(z) and is independent of the choice 
of P(z; a). For example, analytic functions rep- 
resented by Jw or log w are defined as the 
inverse function of z? or e7, respectively. 


M. Singularities of Analytic Functions 


Hereafter, when we speak of a curve C:z — z(s) 
(0x s € 1), it is always supposed that C is a 
curve in the complex plane starting at a and 
ending at w. Let K, be the open disk |z — œ| < 
r; we denote by C, the connected component 
of CNK, that contains c. If analytic continu- 
ations of P(z; a) are possible along any subarc 
of C with a terminal point arbitrarily near œ 
but impossible along the whole C, we say that 
the analytic continuation of P(z; a) along C 
defines a singularity Q of the coordinate œ, and 
that Q lies over œw. For example, if P(z; a) has 
a finite radius of convergence, for a suitable 
point œw on the circle of convergence the ana- 
lytic continuation of P(z; a) along the radius aw 
defines a singularity over w. Now take a point 
z, on C, and denote by F,(z) the branch of an 
analytic function determined by P(z;z,) in K,. 
Let €) be a singularity determined by C and 
P(z; a), and suppose that we are given another 
singularity Q* over w determined by C* and 
P*(z; a*). If they define the same branch F(z) 
for every K,, by definition, we put 0)— Q*. 
Thus F,(z) defines an tunramified covering 
surface W, of K, — {œ}, and it is single-valued 
on W.. 

Singularities are classified according to the 
geometric structure of W, and the value distri- 
bution of F,(z) on it. First, if W, has no trela- 
tive boundary over 0 « |z — | « r for a suitable 
r, then Q is called an isolated singularity of the 


745 


analytic function. In this case, the number k of 
points of W, lying over a point z in K,— {a} 

is constant. If k= oo, W, has a tlogarithmic 
branch point over c, and Q is called a loga- 
rithmic singularity. If k < oo, F,(z) can be repre- 
sented as a single-valued holomorphic func- 
tion in O « |t| « r^ by putting z ^c +t". In this 
case, if we introduce an additional point P, 
corresponding to z =w, then W.U {Po} has 
only an talgebraic branch point over w. Now, 
taking into account the value of w= F,(z), we 
call Q an algebraic singularity if lim w exists. In 
this case, we have F,(z) 2 37:,c,t", and if we 
admit analytic continuations in the wider 
sense (— Section O), P(z; a) is analytically con- 
tinuable along the whole C. 


N. The Natural Boundary 


Given a domain D and an analytic function 
f(z) holomorphic in D, if all boundary points 
of D are singularities of f(z) and f(z) is not 
continuable to the exterior of D, the boundary 
of D is called the natural boundary of f(z). This 
phenomenon was first discovered for felliptic 
modular functions. Many results are known 
about power series for which the circle of 
convergence is the natural boundary (— 339 
Power Series). For any given domain D in C, 
there exists an analytic function whose natural 
boundary is the boundary of D. The original 
proof of this fact, given by Weierstrass, con- 
tained a defect that was corrected by J. Besse. 


O. Analytic Continuation in the Wider Sense 


Let two ? Laurent series (with parameter t) z= 
P(t) - 22, a,t” and w=Q(t)= ZZ bt 

and / are integers, and a,b, #0) converge in 
0« [t| «r, and let (P(t;), Q(t)) 2 (P(t;), Q(t;)) 
if t; At,; then we say that the pair (P, Q) de- 
fines a function element in the wider sense. If a 
change of parameter t —r,t 4 rt? ... (ri 

0 and the radius of convergence — 0) gives 
P(t)= (1), Q(t) 2 K(t), we say that (/7, K) and 
(P, Q) define the same function element. By a 
suitable choice of parameter, any function 
element can be given in the form z=t*+ a (or 
a-t "),w=>~,b,t", and the elimination of t 
gives the representation of w as a tPuiseux 
series of z. So if k= 1 and />0, it reduces to a 
holomorphic function element. When k= 1, 
with !<0 not excluded, the above element is 
called a rational element. If k > 1 it is called a 
ramified element, and if /<0 it is called a polar 
element. 

If P’, Q' are the direct analytic continuations 
of P and Q at ty (0«|t5| <r), i.e., their Taylor 
expansions at to, the function element (P', Q’) is 
called a direct analytic continuation of (P, Q), 
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which is also considered its own direct analytic 
continuation. For a fixed r, the set of all direct 
continuations of (P, Q) thus obtained is called 
an analytic neighborhood of (P, Q), and these 
neighborhoods define a topology in the set of 
all function elements. A curve in this topolog- 
ical space is called an analytic continuation in 
the wider sense, and a tconnected component 
of this space is called an analytic function in 
the wider sense. An analytic function in the 
wider sense is a set of function elements in the 
wider sense, but it can also be regarded as a 
function w= f(z) (with an independent variable 
z and a dependent variable w) defined by each 
function element p(z, w): z = P(t), w— Q(t). 

An analytic continuation in the wider sense, 


p(s) = p(z, w; s); 


w= Y c,(s)t", 


n-—l(s) 


z ^ z(s) - t9, O0xsxl, 
is sometimes called an analytic continuation 
along the curve C:z 2 z(s) (Os« 1) in the 
complex plane. If all p(s) are holomorphic 
function elements, this coincides with the 
analytic continuation along C in the original 
sense, but if this is not the case, p(0) and C do 
not necessarily determine p(1) uniquely. Actu- 
ally, an analytic function in the wider sense is 
just an analytic function in the original sense 
with at most a countable number of ramified 
or polar elements added. 


P. Singularities of Analytic Functions in the 
Wider Sense 


Suppose the following three conditions hold: 
(1) For every point on C except o, that is, for 
z(s) (O s « 1), a function element in the wider 
sense p(z, w; s) is given. (2) For every A(« 1), 
p(z, w; s) (0 € s € A) constitutes an analytic con- 
tinuation in the wider sense. (3) It is impossible 
to find a function element p(z, w; 1) such that 
p(z, w;s) (O &sx 1) 1s an analytic continuation 
in the wider sense. When these three condi- 
tions are satisfied, we say that p(z, w; s) (O<s< 
1) defines a transcendental singularity O with 
w as its coordinate. The method of determin- 
ing a branch w= F,(z) in an open disk with 
center c is completely parallel to the case of 
holomorphic analytic functions. Because of the 
appearance of function elements in the wider 
sense in F,(z), the covering surface W, of K, 
defined by F,(z) may have algebraic branch 
points. If W, has a logarithmic branch point 
over o, Q is called a logarithmic singularity. If 
W, has no point over o for suitable r, Q is 
called a direct transcendental singularity; other- 
wise, it is called an indirect transcendental 
singularity. The logarithmic singularities are 
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direct singularities. The inverse function of z= 
wsinw has a direct singularity over z= oo 

that is not logarithmic, and the inverse func- 
tion of z=(sin w)/w has an indirect singularity 
over z 2 0. Taking into account the value of 
w= F(z), if the tcluster set of F, at Q: So = 
so (E (z)) consists of only one point, it is an 
ordinary singularity; if not, it is an essential 
singularity. 


Q. History 


A function of a complex variable is monogenic 
in the sense of A. L. Cauchy if it is differenti- 
able at every point of its domain of definition. 
It was B. Riemann who succeeded in develop- 
ing Cauchy's concept. Riemann considered an 
analytic function as a function defined on a 
tRiemann surface, that is, a 1-dimensional 
complex analytic manifold. On the other hand, 
Weierstrass constructed the theory of analytic 
functions starting from power series. When we 
speak of single-valued functions defined in a 
domain of the complex plane, the monogenic 
functions of Cauchy and the analytic functions 
of Weierstrass are identical. Although the 
analytic functions are very special functions, 
the study of complex analytic functions is 
usually called the theory of functions of a 
complex variable, or simply the theory of 
functions. 

By considering the following point set C, 
which is more general than a domain, E. Borel 
showed that a monogenic function on C is not 
necessarily holomorphic in the ordinary sense. 
Take a countable dense subset in a subdomain 
D' of a domain D and a double sequence of 
positive numbers (r^). Put Sz {z||z—z,|< 
dät and C? — D — | J;2, SP. By a suitable 
choice of r, we can suppose that the C? are 
connected and monotone increasing with 
respect to h. Put C 2| Ji, CH. A function 
defined in C is by definition monogenic if it is 
differentiable in C? for every h. For such a 
monogenic function, Cauchy's tintegral for- 
mula in a generalized form holds, and the 
function is infinitely differentiable. If f(z) and 
g(z) are monogenic in C and coincide on a 
curve in C, then they are identical in C. Let D 
be the set (z|0 « Rez « 1,0 « Imz « 1j and {z,} 
be all rational points in D (z, —(p + iq)/m). For 
a natural number h, we define C™ to be the set 
D minus the union of open disks with radius 
exp( — e"^)/h and center (p + ig)/m. The 
function 


WEEK 


is monogenic in C in the above-mentioned 
sense, but not holomorphic in C. The study of 


exp( —e"^) 
z —(p- ig)/m 
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these functions developed into the theory of 
tquasi-analytic functions. 

The concept of *analytic functions of several 
complex variables can also be defined analo- 
gously to the case of one variable. Then non- 
uniformizable singularities appear that lead to 
a generalization of the concept of *manifolds 
(— 23 Analytic Spaces). 
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A. General Remarks 


Let M be a ‘differentiable manifold. If a tLie 
group G acts ttransitively on M as a tLie 
transformation group, the manifold M is said 
to be a homogeneous space having G as its 
transformation group (— 431 Transformation 
Groups). The tstabilizer (isotropy subgroup) 
H, of G at a point x of M is a closed subgroup 
of G, and a one-to-one correspondence be- 
tween G/H, and M preserving the action of G 
is defined by associating the element sH, (se G) 
of G/H, with the point s(x) of M. This corre- 
spondence is a tdiffeomorphism between the 
manifold M and the quotient manifold G/H, if 
the number of connected components of G is 
at most countable. Under this condition we 
may therefore identify a homogeneous space 
M with the quotient manifold G/H of a Lie 
group G by a closed Lie subgroup H (— 249 
Lie Groups). However, H is not uniquely 
determined by M, and it may be replaced by 
Hy,—sH,s ! (se G). Each element h of the 
stabilizer H, at a point x induces a linear 
transformation h on the ttangent space V, of 
M at the point x. The set A, of all his called 
the linear isotropy group at the point x. 
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If we represent the homogeneous space M 
as G/H, we obtain the canonical map zn:s—sH 
of G onto M, which we call the projection of G 
onto M. Let g be the Lie algebra of G, and b 
the Lie subalgebra corresponding to the closed 
subgroup H. When we identify g with the 
tangent space at the identity element e of G 
and b with its subspace, the projection z in- 
duces a linear isomorphism of ob with the 
tangent space V, of M at the point x —z(e). 
The tadjoint representation of G gives rise to a 
linear representation A Ad(h) modulo b of the 
group H on the linear space g/b. Through the 
linear isomorphism between g/b and the tan- 
gent space V, defined by the projection z, this 
representation of H is equivalent to the one 
which associates with h the linear transforma- 
tion h defined by h on the tangent space V,. 

The homogeneous space G/H is said to be 
reductive if there exists a linear subspace m of 
g such that g=h+m (direct sum as linear 
spaces) and (Ad H )m c m. H is said to be re- 
ductive in g if the representation A Ad(h) of 
H in g is tcompletely reducible. 

If a *tensor field P on the homogeneous 
space M — G/H is G-invariant (namely, invar- 
iant under the transformations defined by the 
elements of G), then the value of P at the point 
x —n(e) is a tensor over the tangent space V, 
at x which is invariant under the linear iso- 
tropy group H. Conversely, such a tensor 
over V. is uniquely extended to a G-invariant 
tensor field on M. If G/H is reductive, then G- 
invariant tensor fields over M are in one-to- 
one correspondence with H-invariant tensors 
over m. For instance, if H is compact, then H 
is reductive in g and an A-invariant positive 
definite quadratic form on m defines a G- 
invariant *Riemannian metric on G/H. 

We say that the homogeneous space M — 
G/H is a Riemannian (linearly connected, 
complex Hermitian, Kahler) homogeneous 
space if there exists on M a G-invariant Rie- 
mannian metric (linear connection, tHer- 
mitian metric, *'Káhler metric). Concerning 
such homogeneous spaces, there are various 
results on their structures and geometric prop- 
erties [1-5] (— 412 Symmetric Riemannian 
Spaces and Real Forms; 427 Topology of Lie 
Groups and Homogeneous Spaces). 


B. Examples 


Stiefel Manifold. A k-frame (1 <k <n) in a real 
n-dimensional Euclidean vector space R" is an 
ordered system consisting of k linearly inde- 
pendent vectors. If we regard the real tgen- 
eral linear group of degree n, GL(n, R), as the 
regular linear transformation group of R", 
GL(n, R) acts transitively on the set V; ,(R) of 
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all k-frames in R". Therefore, if H denotes the 
subgroup of GL(n, R) consisting of the ele- 
ments which leave fixed a given k-frame v$, 
we may identify the set V; , and the quotient 
set GL(n, R)/H. Transferring the differenti- 
able manifold structure of GL(n, RH ok, 
through this identification, we see that V, (R) 
— GL(n, RJH becomes a homogeneous space 
(the differentiable manifold structure of V, , is 
defined independently of the choice of v$). 
The space V, ,(R) is called the (real) Stiefel 
manifold of k-frames in R". 

A k-frame is called an orthogonal k-frame 
if the vectors belonging to the frame are of 
length 1 and are orthogonal to each other. The 
set V, (R) of all orthogonal k-frames is a sub- 
manifold of V; (R). The *orthogonal group 
O(n) acts transitively on V, ,(R), which is a 
homogeneous space represented as V, ,(R) — 
O(n)/I, x O(n —k). The manifold V, ,(R) is 
actually the (n — 1)-dimensional sphere. We call 
V, ,(R) the (real) Stiefel manifold of orthogonal 
k-frames (or simply Stiefel manifold). The 
complex Stiefel manifold V, ,(C) = U (n)/I, x 
U (n — k) is defined analogously. 


Grassmann Manifold. Let M, ,(R) (1 <k <n) 
be the set of all k-dimensional linear sub- 
spaces of R". The group O(n) acts transitively 
on M, ,(R), so that we may put M, ,(R)— 
O(n)/O(k) x O(n — k). Here O(k) and O(n — k) 
are identified with the subgroups of O(n) con- 
sisting of all elements leaving fixed every point 
of a fixed (n — k)-dimensional subspace and of 
its orthogonal complement, respectively. In 
this way, M, ,(R) is a homogeneous space, 
which we call the (real) Grassmann manifold. 
The "proper orthogonal group SO(n) acts 
transitively on M, (R), and M, ,(R) may be 
represented as a homogeneous space having 
SO(n) as its transformation group. It follows 
that M, ,(R) is connected. The homogeneous 
space M, ,(R) = SO(n)/SO(k) x SO(n — k) is 
called the Grassmann manifold formed by 
oriented subspaces. M, (R) and M, (RI may 
be identified with the (n — 1)-dimensional real 
projective space and the (n — 1)-dimensional 
sphere, respectively. l 

Applying the above process for real Grass- 
mann manifolds to the complex Euclidean 
vector space C" instead of R", we see that the 
set M, ,(C) of all k-dimensional linear sub- 
spaces in C" is a homogeneous space with 
the *unitary group U (n) of degree n as its 
transformation group, and we represent it as 
U(n)/U(k) x U (n — Kk). This space is called the 
complex Grassmann manifold. The mani- 
fold M, (C) is a simply connected complex 
manifold and has a cellular decomposition 
as a tCW complex whose cells are ‘Schubert 
varieties (— 56 Characteristic Classes E). On 
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the other hand, M, ,(C) may be regarded as 
the set of all (k — 1)-dimensional linear sub- 
spaces in the (n — 1)-dimensional complex 
projective space. Then, by using the *Plücker 
coordinates of these subspaces, M, ,(C) can be 
realized as an talgebraic variety without sin- 
gularity in the projective space of dimension 


a —1 (— 90 Coordinates B). Sometimes 


M, ,(R) is denoted by G, ,(R) or Gin, k). In the 
same way, the homogeneous space represented 
as Sp(n)/Sp(k) x Sp(n — k) is called the tquater- 
nion Grassmann manifold and is denoted by 
M, (H). 


Flag Manifold. Let &,,..., k, be a sequence of 
integers such that n>k,>...>k,>0, and let 
F(k,,...,k,) be the set of all monotone se- 
quences V, >... > V,, where V,(i=1,...,r)isa 
k;-dimensional linear subspace in R". For the 
two sequences V, >... > V, and V; 2...5V 
belonging to F(k,, ..., k,), there exists an ele- 
ment se GL(n, R) such that s(V) - V; (i= 
1,...,r). Therefore F(k,, ..., k,) is a homogene- 
ous space with GL(n, R) as its transforma- 
tion group, and is called the proper flag mani- 
fold. Since the unitary group U (n) of degree n 
acts transitively on it, F(k,, ..., k,) is also re- 
garded as a homogeneous space admitting 

U (n) as its transformation group. In this case, 
putting F(k,,...,k,)=U(n)/H, H is isomorphic 
to the direct product U (k; —k,) x U(k, —k3) 

x ... X U(k,). In particular, when r=n—1, 
k;=n—i, the homogeneous space is the quo- 
tient space of the compact Lie group U(n) by a 
maximal *torus T. In general, the quotient 
space G/T of a compact connected Lie group 
G by a maximal torus of G is called a flag 
manifold. If G acts effectively on G/T, G is a 
*semisimple compact Lie group. The complex 
Lie group G° is a Lie transformation group of 
tbiregular transformations which acts transi- 
tively on the flag manifold G/T, a simply con- 
nected Kahler homogeneous space. Here G° is 
a complex Lie group having G as a maximal 
compact subgroup. If B is a maximal tsolvable 
Lie subgroup (tBorel subgroup) of G5, G/T is 
represented as G°/B. 
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A. General Remarks 


Homological algebra is a new branch of math- 
ematics that developed rapidly after World 
War II. The introduction of the theory was 
motivated by the observation that some alge- 
braic ideas and mechanisms that arose in 

the development of talgebraic topology, in 
particular, thomology theory, can provide 
powerful tools for treating from a unified 
viewpoint various problems in algebra that 
previously were treated differently. One of its 
characteristic features lies in emphasizing, 
from the standpoint of categories and functors 
(— 52 Categories and Functors), the functional 
structure of the objects to be investigated 
rather than their inner structure. Thus the 
theory of derived functors constitutes the main 
theme of homological algebra. This new 
theory turned out to have wide applications in 
other areas of mathematics, and the philos- 
ophy embodied in the theory has been influen- 
tial in the general progress of mathematics. 
For general references — [2, 5, 6]; for tsheaf 
cohomology — [3,4,8]. 


B. Graded Modules and Graded Objects 


Let A be a tring with unity element and X bea 
*unitary A-module. If we are given a sequence 
of A-submodules X, (ne Z) such that X — 
Ynez X, (direct sum), we call X a graded 
A-module and X, the component of degree n 

of X. Each element x of a graded A-module 

X has a unique representation x = ¥,,-7 x, 
(x, € X,); we call x, the component of degree n 
of x. An A-submodule Y of a graded A-module 
X is called homogeneous if x € Y implies x,¢ Y 
(ne Z). In this case, Y = X), Y, and the quotient 
module X/Y — X, X,/Y, are graded A-modules, 
where Y, — YN X,. Let X 2X, X, and Y=), Y, 
be graded A-modules and f: X > Y be an A- 
homomorphism. If there is a fixed integer p 
such that f(X,) c Y,,, for any neZ, f is called 
an A-homomorphism of degree p. In this case, 
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Ker f 2 X, Ker f, and Im f=, Im f,_, are 
homogeneous A-submodules of X and Y, 
respectively, where f,: X,— Y,,, is the restric- 
tion of f on X,. 

Sometimes, by a graded module we mean 
only a sequence {X,,} of A-modules X,, with- 
out considering the direct sum X, X,. Simi- 
larly, we have the notion of a graded object 
(C,] in any category €. 


C. Chain Complexes and Homology Modules 


By a chain complex (X,0) over A we mean a 
graded A-module X = Y, X, together with an 
A-homomorphism 0: X 5 X of degree —1 such 
that 900 20. Hence a chain complex over A is 
completely determined by a sequence 


Cnet [7 
eo Xn 9 X49 X, 42... 


of A-modules and A-homomorphisms such 
that 6,0 0,,, =0 for all n. We call ð the bound- 
ary operator. For a chain complex (X,0), we 
write Keró — Z(X), Ker ô, = Z,(X), Imo = B(X), 
Im 0,., — B,(X). Then Z(X) 2 X, Z,(X), B(X) 
=), B,(X) are homogeneous submodules of 
X, called the module of cycles and the module 
of boundaries, respectively. B(X) is a homog- 
eneous submodule of Z(X), and the quotient 
modules Z(X)/B(X), Z,(X)/B,(X) are denoted 
by H(X), H,(X), respectively. We call H(X)— 
X, H,(X) the homology module of the chain 
complex (X, ô). 

If (X, 0), (Y, 0") are chain complexes over 
A, an A-homomorphism f: X > Y of degree 
0 satisfying d'o f =f o0 (ie., 0, f, — f, 10,) 
is called a chain mapping of X to Y. For a 
chain mapping f, we have f(Z,(X)) c Z,(Y), 
f(B,(X)) c B,(Y), and hence f induces an A- 
homomorphism f,: H(X) H(Y) of degree 0, 
which is called the homological mapping in- 
duced by f. We have (15), = ln, and (go f), 
— g, O f, for chain mappings f: X — Y and g: 
Y>Z. 

Let f, g: X ^ Y be two chain mappings. If 
there is an A-homomorphism D: X > Y of 
degree +1 such that f —g— Doó--0'oD, we 
say that f is chain homotopic to g and write 
f ~g; Dis called a chain homotopy of f to g. 
If f is chain homotopic to g, we have f, — 
dg, H(X)- H(Y). For chain complexes X 
and Y, if there are chain mappings f: XY 
and g: Y2 X such that fog 1, and gof —1,, 
we say that X is chain equivalent to Y. In this 
case f,: H(X) H(Y) is an isomorphism and 
gą: H(Y) H(X) is its inverse. 

Let (X, 0) be a chain complex over A and 
Y= X, Y, be a homogeneous A-submodule 
of X such that 0Y c Y. Then Y and X/Y are 
chain complexes over A with the boundary 
operators induced by ô. Y is called a chain 
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subcomplex of X, and X/Y is called the quo- 
tient chain complex of X by Y or the relative 
chain complex of X mod Y. For a chain com- 
plex X and its subcomplex Y we have an 
texact sequence 0 Y 5 X 5 X/Y-0, where i is 
the *canonical injection and j the tcanonical 
surjection. 

Let (W, 0’), (X, 0), (Y, 0") be chain com- 
plexes over A, and f: WX, g: X — Y be chain 
mappings such that 02 WX 5 Y-0 is 
exact. Then an A-homomorphism 0,: H(Y)— 
H(W) of degree — 1, called the connecting 
homomorphism, is defined by ô (y + B(Y)) 
—fo0og '(y)+ B(W) (yeZ(Y)), and we 
have the exact sequence of homology: 


(W)5H,(X)5H,(¥) 
SHAW) SHE AIX) SE UPS 
For a ‘commutative diagram 
0—^W >X >Y 0 

ye d. du (1) 
02W'5X'2Y'20 


consisting of chain complexes and chain 
mappings in which each row is exact, we have 
0,0V,—9,00,: H(Y)O H(W). 

For the *inductive limit lim X, of chain com- 
plexes X; over A, we have 


H(lim X,)=lim H(X,). 


A chain complex X is said to be positive if X, 
— 0 for all n « 0. If X is a positive chain com- 
plex over A and M is an A-module, then we 
mean by an augmentation of X over M an A- 
homomorphism e: X, M such that the com- 
position X, Das is trivial:£00; 20. A 
positive chain complex X together with an 
augmentation ¢ of X over M is called an aug- 
mented chain complex over M. It is said to be 
acyclic if the sequence 


X SE eg py "hy pU 


is exact, namely, if H,(X)=0 (n 40) and e 
induces an A-isomorphism H,(X)= M. In this 
case X is also called a left resolution of M. 
Moreover, if each X, is a fprojective A- 
module, X is called a left projective resolution. 
For any A-module M, there exists a left pro- 
jective resolution of M. 

Let à: M  M' be an A-homomorphism of A- 
modules, and X, X' be augmented chain com- 
plexes over M, M' having augmentations e, £', 
respectively. Then a chain mapping f: X > X' 
satisfying £'o fy — 0 is called a chain map- 
ping over a. If X, X' are left projective reso- 
lutions of M, M', respectively, then there exist 
chain mappings of X to X’ over a, and any 
two such mappings are chain homotopic. In 
particular, a left projective resolution of an A- 
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module M is uniquely determined up to chain 
homotopy. 


D. Tor 


Given a right A-module M and a left A- 
module N, Z-modules Tor? (M, N) (n— 

0, 1,2, ...), called the torsion products (or Tor 
groups), are defined as follows: Let 


KEE, AY 5NS0 


be a projective resolution of N, and consider 
the chain complex 


(M & 4Y,1@0):... 
ER 


>M Q ,.Y, M Q 4,Y,-.,2... 
IS M Gi AYo 20 

obtained by forming the *tensor product of M 
and Y. Then we see that the homology module 
HAM ® 4Y) is uniquely determined for any 
choice of left projective resolution of N. We 
define TorZ(M, N) - H,(M @ 4Y). In partic- 
ular, we have Tord(M, N)z& M Q 4N. 


Properties of Tor. (1) If M is a tflat A-module, 
we have Tor4(M, N)=0 (n2 1,2, ...). 

(2) An A-homomorphism f: M, >M, in- 
duces a homomorphism f, : Tor? (M, N)> 
Tor? (M;, N). We have (14), — 1, and (go f), = 
Gx Of, for f:M,>M,, 9:M,>Ms3. 

(3) For an exact sequence 05 M, 4M,% 
M,-0, we have the following exact sequence 
of Tor: 


... 3Tor4(M,, N) S TorA(M,, N) 
% Tor4(M,, N)3Tor4 ,(M,, N)>... 
+Tor4(M;, N) 5M, 9 NM, G ,N 
>M, &® 4N —0, 


where 0, are the connecting homomorphisms. 
(4) For a commutative diagram 


025 M,—5 M,—5 M,—0 


Le } |e 
0—^ M1, M5 M30 


of A-modules and A-homomorphisms with 
exact rows, we have 0,0 V, — 0,006,. 

(5) Tor; (È, M;, N) E X; Tor; (M;, N) 

(6) Tor; (lim M,, N)=lim Tor4(M,, N). 

On the other hand, take a left projective 
resolution X of M and consider the chain 
complex X © ,N. Then we have H,(X & ,N) 
= Tor4(M, N) for n=0, 1,.... Therefore 
properties similar to (1)-(6) hold with respect 
to the second variable N of TorZ(M, N). 

(7) If A? is a ring tanti-isomorphic to A, 
then Tor4(M, N)z Toré (N, M). In particular, 
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if A is commutative, then Tor4(M, N) is an A- 
module and we have TorZ(M, N)z Torz(N, M). 
(8) Let A be a tprincipal ideal ring. Then 
Tor4(M, N) 20 (n22,3, ...) and Tor?(M, N) is 
also denoted by M x ,N. For an exact se- 
quence 05 M, — M; M,—0O, we have the 
exact sequence 05 M, * NS M,* N> 
M3* NOM, & ,NOM;G ANM, GN: 
0. In particular, Z*,N =0 and (Z/nZ)«,N z,N 
(={xeN|nx=0}). 


Universal Coefficient Theorem for Homology. 
If (X, 6) is a chain complex over A and N isa 
left A-module, then (X & ,N,é@ 1) is a chain 
complex. If A is a principal ideal ring and each 
X, is a ttorsion-free A-module, then we have a 
formula 


H,(X @ 4N) 
zH,X)G aN + H, (X) 4N, 


called the universal coefficient theorem. 


E. Double Chain Complexes 


By a double chain complex (X, ,, 0’, 0") over 
A we mean a family of left A-modules X, , 
(p.q€ Z) together with A-homomorphisms 
0,4 X pig? X5-1,, and 0 X, >X p.a- such 
that Clan aert 005,47 05,4 1005, 
+ 671,406, ,=0. We define the associ- 

ated chain complex (X,, 8) by setting X,= 

3 qon p,a On =Z p+q-n0p EN We call 0 
the total boundary operator, and 0’, o" the 
partial boundary operators. 

Given a chain complex X consisting of right 
A-modules and a chain complex Y consisting 
of left A-modules, a double chain complex 
(Zp, 0’, 0") is defined by setting Z, ,— 

X,G 4Y,, Gare DÉI I, Ong=(—1)71 e Ôg» 
where Ge, 6, ate the boundary operators of X, 
Y, respectively. It is called the product double 
chain complex of X and Y, and the homology 
module of its associated chain complex is 
denoted by H(X © 4Y). With respect to this 
homology module, the following facts hold. If 
X is a left projective resolution of a right A- 
module M and Y is that of a left A-module N, 
then H,(X @ ,Y)=Tor4(M, N). If A is a prin- 
cipal ideal ring and each X, is a torsion-free A- 
module, then we have the formula 

H,(X @4Y)= Y, H,(X)® 4H,(Y) 


ptq=n 


+ $ H,(X)* 4H,(Y), 


ptq=n-1 
the Kiinneth theorem. 
F. Cochain Complexes 


By a cochain complex (X,d) over A we mean a 
graded A-module X together with an A- 
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homomorphism d: X >X of degree +1 such 
that dod —0; d is called the coboundary oper- 
ator or differential. For a cochain complex 
(X,d), we denote by X" the component of 
degree n of X, and by d", X" X"*! the restric- 
tion of d on X". Then a chain complex (Y, ĉ) is 
defined by Y, = X " and 0,: Y, 5 Y, , is equal 
tod™:X na ER 

For a cochain complex (X, d), we write 
Kerd" = Z"(X), Kerd = Z(X) (Z(X)2 3 Z"(X)), 
Imd”! = B'(X), Imd = B(X) (B(X) e X, B'(X)), 
and Z"(X)/B'(X)— H'(X)), Z(X)/B(X) = 
H(X) (H(X)= X H'(X)). These modules 
Z(X)(Z"(X)), B(X) (B'(X)) and H(X) (H'(X)) 
are called the module of cocycles, the module of 
coboundaries, and the cohomology module of 
X, respectively. If we consider the associated 
chain complex (Y, ô) of (X,d), then H .,(Y) 
corresponds to H"(X). In this way, results on 
chain complexes give results on cochain com- 
plexes. Thus the concepts of cochain mapping, 
cochain homotopy, cochain equivalence, cochain 
subcomplex, and relative cochain complex can 
be defined as in the case of chain complexes in 
B, and we have corresponding results. In par- 
ticular, given an exact sequence 0 WSX45Y 
—0 of cochain complexes and cochain map- 
pings, the connecting homomorphism d, : H"(Y) 
— H"*! (W) is defined, and the exact sequence 
of cohomology 


SAW) SAX) Ha 
S Hw) gn xo) gv)... 
exists. For a commutative diagram 
0^5W >X >Y 50 
le Low 
0-W'> x’ Y'>0 





of cochain complexes and cochain mappings 
with exact rows, we have d, of, =9, od,. 

A cochain complex X is said to be positive if 
X"=0 for n<0. If X is a positive cochain 
complex over A and M is an A-module, we 
mean by an augmentation of X over M an A- 
homomorphism e: M — X? such that the com- 
position MX o4, X is trivial. If the sequence 


0 n 
oM SK Seas xeu. 


is exact, X is called a right resolution of M. 
Moreover, if each X" is an tinjective A- 
module, X is called a right injective resolution 
of M. For any A-module M, there exists a 
right injective resolution of M, and any two 
such resolutions are cochain homotopic. 


G. Ext 


Given left A-modules M and N, Z-modules 
Ext'4(M, N) (n=0, 1,2, ...), called the Ext 
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groups, are defined as follows: Let X :... X, 
—X,-12...— X9 M 50 be a projective reso- 
lution of M, and consider the cochain complex 


Hom ,(X, N): 
0>Hom,(Xo9, N)>...>Hom,(X,_,,N) 
+Hom,(X,, N)... 


obtained by forming the tmodule of A- 
homomorphisms. Then we can show that the 
cohomology module H"(Hom ,(X, N)) is 
uniquely determined for any choice of projec- 
tive resolution of M. We define Ext",(M, NI 
= H"(Hom (X, N)). This can also be defined 
as the cohomology module H"(Hom,(M, Y)) 
of the cochain complex Hom (A. Y):0> 
Hom,(M, Y9?)5 ...2Hom,(M, Y" )5 
Hom,(M, Y")5 ..., where Y:023N2Y?5... 
Y"! Y"... isa right injective resolu- 

tion of N. Furthermore, for a left projective 
resolution X of M and a right injective re- 
solution Y of N, we see that Ext',(M, N) is 
isomorphic to the cohomology module 
H"(Hom,(X, Y)) of the associated cochain 
complex of the double cochain complex 
Hom,(X, Y)=(Hom,(X,, Y°), d', d"), where 
d, a: Hom, (Xp, Y*) 5 Hom,(X,.,, Y?) and 

d; ,:Hom,(X,, Y?) Hom,(X,, Y**!) 

are given by d ,(u) HO Gan, dp 4(u)= 

(—1) +1 d* ou (ue Hom (X,, Y4)) by using 
the boundary operator ô of X and the co- 
boundary operator d of Y. 


Properties of Ext. (1) We have Ext9(M, N) = 
Hom ,(M, N). 

(2) If M is a projective A-module or N is 
an injective A-module, then Ext",(M,N)=0 
(n=1,2,...). 

(3) An A-homomorphism f: M, >M, 

(resp. f: N,— N5) induces a homomor- 

phism f *: Ext} (M2, N) S Ext'4(M,, N) (resp. 
f,'Ext4(M, N,) 2 Ext'4(M, NL We have 

1% =1 and (gof)*=f* og* for f: M,5M,, 
g: M; M, (resp. 1y, =1 and (go f),—9,0 f, 
for f: N,9 N5,g: N5 Ny). 

(4) For an exact sequence 0 M, — M,—5M, 
—0 (resp. O2 N, N,5 N,—0), we have the 
exact sequence of Ext: 


05 Hom,(M;, N) Hom,4(M;,, N) 
5Hom,(M,, N)S Extl(M,, N) 
—Ext)(M,,N)>... 
(resp. Oo Hom (M, N,)- Hom(M, N>) 
5Hom(M, N4) Exti(M, N,) 
+ Ext)(M,N,)>...) 


I] J - I] Ext'(M,, Ng). 


a B 


732 


(6) If A is a principal ideal ring, then 
Ext, (M, N) 20 (n—22,3, ...), and Extl(M, N) 
is also denoted by Ext ,(M, N). In particu- 
lar, Ext,(Z, N) 20, Ext,(Z/nZ, N) z N/nN, 
Ext,(M, Q/Z) =0, Ext; (M, Z/nZ) = M/nM, 
where M = Hom;(M, Q/Z). 


Universal Coefficient Theorem for Coho- 
mology. If X is a chain complex over a prin- 
cipal ideal ring A such that each X, is a free A- 
module, then for any A-module N we have the 
formula 


H"(Hom,(X, N)) 
= Hom ,(H,(X), N) a Ex AH, - (X), N), 


the universal coefficient theorem. This is gen- 
eralized as follows: Let X be a chain complex 
and Y a cochain complex, both over a prin- 
cipal ideal ring A. Assume that each X, isa 
free A-module or that each Y" is an injective 
A-module. Then we have the formula 


H'(Hom,(X, Y)) 
= Y Hom,(H,(X), H*(Y)) 


ptq-n 


+ YX Ext,(H,(X), H*(Y)) 


ptq-7n-1 


(— 201 Homology Theory). 


H. Complexes in Abelian Categories 


We mainly consider general Abelian cate- 
gories @. Consideration may, however, be 
restricted to the tcategory (Ab) of Abelian 
groups (whose fobjects are Abelian groups and 
whose tmorphisms are homomorphisms) or 
the tcategory g% of R-modules. 

A (cochain) complex C in an Abelian cate- 
gory € is a graded object ICH) in € with 
differentials d": C"— C"*! subject to the con- 
dition that d"*! od" —0 (ne Z). The nth coho- 
mology H"(C) of C is defined by the texact 
sequence 05 B"(C)— Z"(C)— H"(C)0, where 
B"(C) and Z"(C) are objects representing 
Im d" ! and Kerd", respectively. The complex 
C is called positive (negative) if C" —0 for n «0 
(n 0). We sometimes interchange positive 
superscripts and negative subscripts and write 
C. , instead of C". Then the differentials 
become d,: C, — C, ,, and C is then called a 
chain complex. The quotient of Kerd, =Z,, by 
Imd,,, — B, is called the nth homology H,(C). 
Negative complexes are usually described in 
this manner. When C", Z", B", and H” are 
sets, as in the category 47 of R-modules, 
their elements are called cochains, cocycles, 
coboundaries, and cohomology classes, respec- 
tively. Similarly, in the group C, of chains, 
residue classes of cycles (€Z,,) modulo bound- 
aries (cB,) are called homology classes (€H,). 
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A morphism (or chain transformation) f: C 
C' is a "natural transformation of the com- 
plexes considered as tfunctors Z9 €; i.e., 

f is a family of morphisms f": C" C" (neZ) 
satisfying f""' od" —d"o f". It induces a 
morphism of cohomology H"(C)> H"(C’). 

A subcomplex of C is an equivalence class 

of *tmonomorphisms DC, usually denoted 
by any representative D of the class. A (chain) 
homotopy between two chain transforma- 
tions f, g: CA C' is a family of morphisms 
h":C" 5 C"! (neZ) satisfying f"—g"— 

h"*! od"-- d"! oh". If there exists a homo- 
topy between f and g, then f and g induce the 
same morphism of cohomology. A morphism 
ICC is called a (chain) equivalence if there 
exists a morphism f':C'—C such that f'o f 
and fof’ are homotopic to the identities of C 
and C’, respectively. In this case we have 
H"(C)z H"(C"). An exact sequence of com- 
plexes 02 C' 5 C 2 C" 0 gives rise to the 
connecting morphisms H"(C")> H"*'(C') 
(neZ), and the resulting sequence ... ^ 

H" (C^) H"(C)5H"(C) H'(C").5 

H"*! (C')5 ... is exact (the exact cohomology 
sequence), and similarly for homology instead 
of cohomology. An object Ae@ defines a 
complex (also denoted by A) such that A? = A, 
d? —0. A positive complex C together with a 
morphism e: A C is called a complex over A, 
and e is the augmentation. A complex C over A 
is acyclic if 05 A C9 5 C! >... is exact. An 
acyclic positive complex over A is called a 
right resolution of A. Let IC e}, (C', el be 
complexes over A and A’, respectively, and x a 
morphism A A’. Then a morphism f: C C' 
satisfying foe-e' oats called a morphism 
over a. For a negative complex C, we define 
similarly augmentations c: C A, acyclicity, 
left resolutions, etc. 

A bicomplex (or double complex) C in € 
consists of objects C”! (p, qe Z) and two dif- 
ferentials dj: C^*— C?* 3, du: CP 4 Cp 
subject to d? = d? =0 and dén = dg d, (some- 
times replaced by anticommutativity, dydy + 
dydi UL Morphisms of bicomplexes are de- 
fined as for single complexes. A bicomplex 
C becomes a (single) complex if we put C"— 
2544-2 C"? (when the sum exists) and define 
the differential d to be d; -- ( — 1? dy on CP? 
Then d is called the total differential and d. dj 
the partial differentials. On the other hand, Cf 
={C™1(pe Z), dj) constitutes a complex for 
each q, whose cohomology H(CT) is denoted 
by H?(C*). Then dy induces morphisms Hf(C?) 
 Hf(C^*!), so that we obtain a complex 
H[(C). The cohomology of H?(C) is denoted 
by Hf(HP(C)). We define HP(H$(C)) similarly. 
The cohomology of C with respect to the total 
differential is denoted simply by H"(C). Similar 
constructions are applied to double chain 
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complexes {C, ,} and further to multiple 
complexes, as we shall show in the case of 
bicomplexes. 

Let T be a *bifunctor €, x €, €' and C; be 
complexes in @; (i= 1, 2). Then T(C,,C,) is a 
bicomplex in @’. For instance, Hom(C', C) is a 
positive (bipositive) complex if C(C’) is a posi- 
tive (negative) complex in €. If C, C' are com- 
plexes in Wp, pM, respectively, the *tensor 
product C ® gC’ is a complex in (Ab) (the 
product complex). There is a canonical mor- 
phism H,(C) & H,(C')5 H,,,(C & C). If C, 
and B, are "fat for all ne Z, we have the fol- 
lowing exact sequence (Künneth's formula): 
05 Y, H(CO G H(C)5H,(C & C) 


ptq-n 


> A Tor,(H,(C), H,(C’))+0 
ptq=n-1 

(for the definition of Tor — Section D). For C' 
=A€p.W, Künneth's formula reduces to the 
exact sequence 0 H,(C) ® A> H,(C & A) 
Tor, (H,_,(C), A)—>0 (universal coefficient 
theorem). The corresponding exact sequence 
for cohomology is 


0 Ext! (H, ,(C), A)2 H'(C, A) 
5 Hom(H,(C), A)50 


(— Section G; 201 Homology Theory). 


I. Satellites and Derived Functors 


Let € and @’ be Abelian categories. All func- 
tors in this section are tadditive. A tcovariant 
functor Tee" is called exact if T maps 
every exact sequence in € to an exact sequence 
in €". T is called half-exact, left exact, or right 
exact if for every short exact sequence 0— A 

5 B—C-60, the sequence T(A)— T(B) T(C), 
05 T(A)> T(B)> T(C), or T(A) > T(B) T(C) 
—0, respectively, is exact. Similar definitions 
apply for *contravariant functors. The functor 
Hom: @ x € (Ab) (which defines the category 
€) is left exact in both factors. An object P is 
projective if h,(-)= Hom(P,) is exact, while Q 
is injective if h2(-)= Hom(-, Q) is exact. If 
every object A admits an tepimorphism from 
a projective object P— A (resp. fmonomor- 
phism into an injective object A Q), € 

is said to have enough projectives (injec- 
tives). An object G is called a generator (cogen- 
erator) if the natural mapping Hom(A, B) 
Hom(h,(A), hg(B)) (Hom(h*(B), h*(A))) is 
one-to-one. 

An Abelian category € is called a Gro- 
thendieck category if (1) € has a generator, (2) 
*direct sums always exist, and (3) the identity 
(UU A90 B— | J(4; B) holds for any object A, 
its *subobject B, and a *totally ordered family 
(Ai) of subobjects. A Grothendieck cate- 
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gory has enough injectives (R. Baer, 1940, 
for (Ab); A. Grothendieck, 1957, for general 
€). A monomorphism into an injective ob- 
ject f: AQ is called an injective envelope if 
Im f Img #0 for any nonzero monomor- 
phism g: BQ. Every object A in a Grothen- 
dieck category admits an injective envelope, 
which is unique up to isomorphism (B. Eck- 
mann and A. Schopf, 1953, for p; B. Mit- 
chell, 1960, for general €). 

We say that a covariant d-functor € — €" is 
given if we have a sequence of covariant func- 
tors T- (T':€ ^") and the connecting mor- 
phisms 0: T'(4") T'*! (4^) for an arbitrary 
short exact sequence 0— 4'— A> A” 0 satis- 
fying the following conditions: (i) do T'( f") 
=T'*'(f')oé for a morphism f of short 
exact sequences; and (ii) the sequence ...— 

T (47) 5 TA) STA) T(A7) S Ti (4) 
... constitutes a complex. T= {T°} is called a 
covariant 0*-functor if instead of ô there are 
given O ` T'(A") TŻ! (A') satisfying similar 
conditions (i*) and (ii*). By taking duals, we 
define the notion of contravariant ð- and 0*- 
functors. They are also called connected se- 
quences of functors. A 0-(0*-)functor defined 
for —oo «i« +00 is called a cohomological 
functor (homological functor) if the sequence in 
condition (ii) (resp. (ii*)) is always exact. A 
morphism of -functors f: S — T consists of 
natural transformations f*:S'— T' that com- 
mute with the connecting morphisms. A ð- 
functor S defined for a i « b is called universal 
if for any 6-functor T defined in the same in- 
terval and any natural transformation o "A7 
T*, there exists one and only one morphism 
f:S—T such that f^— o. Let F:€ 5 €' bea 
covariant functor and b any positive integer. A 
universal covariant 6-functor S defined for 
Ox i «bis called a right satellite of F if S? = F 
(S is then denoted by (S'F]). If such an S 
exists, then it is unique and satisfies Si+! (F) = 
S'(S'F). If € has enough injective objects, 

the right satellites always exist, and if F is left 
exact, then {S'F} is a cohomological functor. 
The universality of 6*-functors is defined by 
reversing the arrows; the satellites {S;F} are 
then written as (S 'F] and called the left 
satellites. 

Let € be an Abelian category with enough 
injectives. An injective resolution of an object A 
is a right resolution Q = (Q"] such that all Q* 
are injective. Every A admits an injective reso- 
lution, which is unique up to chain equivalence 
(H. Cartan, 1950). For a covariant functor F: 
€ >’, the functor A—H'(F(Q),, called the ith 
right derived functor R'F of F, is independent 
of Q. {RİF} is a cohomological functor. By the 
universality of satellites, there exists a mor- 
phism of é-functors {S'F}—{R'F} which is an 
isomorphism if and only if F is left exact. The 
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left derived functors L;F of a contravariant 
functor F are defined similarly and are isomor- 
phic to the left satellites when F is right exact. 
If € has enough projectives (instead of injec- 
tives), we define left (right) derived functors of 
covariant (contravariant) functors via projec- 
tive resolutions. For a multifunctor, we define 
partial derived functors as well as (total) de- 
rived functors of the functor viewed as a func- 
tor defined in the tproduct category. For 
instance, let T(A, B):€, x €, €' be contra- 
variant in A and covariant in B. When €, has 
enough injectives, we obtain R} T(A, B) — 
H'(T(A, Q)) using an injective resolution Q 

of B. Suppose that T satisfies condition (i) 

A — T(A, B) is exact for any injective B. Then 
for a fixed injective B, R} T(A, B) is a cohomo- 
logical functor in A. When A has a projec- 
tive resolution P in €,, we obtain Ri T(A, B) 

— H'(T(P, B)) as well as the equation for the 
total derived functor R'T(A, B)  H'(T(P, Q)). 
We say that a functor T is right balanced if it 
satisfies (i) and also (ii) B T(A, B) is exact for 
any projective A. In this case, the three derived 
functors are isomorphic. The left balanced 
functors are defined similarly. When the right 
derived functors of the functor Hom (which. 
defines the category) exist, they are denoted by 
Ext'(A, B). 


J. Spectral Sequences 


In this section, we deal with cohomology in 
the category pH of R-modules. A similar 
theory for homology is obtained by modifying 
the theory in a natural way. Similar construc- 
tions are also possible for general Abelian 
categories [3, 8]. 

A filtration F of a module A is a family of 
submodules ( F"(A)|pe Z} such that F?(A)> 
F?*'(A). We say that the filtration F is con- 
vergent from above (or exhaustive) if | ), F?(A) 
— A, and F is bounded from below (or dis- 
crete) if F?(A)=0 for some p. The tgraded 
module G(A)= {G?(A)=F?(A)/F?*'(A)|peZ} 
is said to be associated with A. A morphism of 
filtered modules f: A> A’ is a module homo- 
morphism such that f(F?(A)) c F(A’). It in- 
duces a homomorphism of the graded modules 
G(A)>G(A’). A filtration of a complex C= 
{C",d} consists of subcomplexes F?(C)= 
(F*(C")] such that F'(C)2 F?'! (C). We 
assume that the complex C satisfies | ), F"(C) 
=C, and is bounded from below; i.e., for every 
n there exists some p such that F?(C")=0. In 
particular, if F?(C) — C, F?*! (C7?) «0, the com- 
plex C is called canonically bounded. Writing 
CP — G?(C?*4), we obtain a *bigraded module 
1C?1, in which p, q, and p+q are called the 
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filtration degree, the complementary degree, 
and the total degree, respectively. 

A spectral sequence | E,) with a graded 
module D — (D"| as its limit (denoted by 
E} 1 = ,D") consists of a family of doubly 
graded modules E, - (EP*|p, qe Z] (r>2 or 
sometimes r > 1) and differentiations d,: EP*— 
EP*"4a-'*! (p, qe Z) of degree (r, 1 —r) satisfy- 
ing d? =0 and satisfying the following two 
conditions: (1) H(E,) (with respect to d,) is 
isomorphic to E,,, (hence there exists a se- 
quence of graded submodules of E,:0= B, c 
B4c ...CZ4cZ; = E, such that Z,/B,z E,); 
and (ii) there are submodules Z,, and B,, such 
that (J|, B, C B, € Zo c(,Z,, and E, = 
Z ,/ B,, is isomorphic to the doubly graded 
module associated with a certain filtration F of 
D (ie., E81  G'(D?**)). We assume that Z,, = 
( Y, Zx and B,, = | J, B, (weak convergence). 
Suppose that F is convergent from above and 
bounded from below and that Z,(E£*) is 
stationary for every p,q. Then {E,} is called 
regular. ( E. 1 is bounded from below if for every 
n there exists a py such that EP" ?—0 for 
p € po. In particular, if E3-4=0 (p<0,q <0), 
then {E,} is called the first quadrant (or coho- 
mology spectral sequence). In the latter case, 
the edge homomorphisms EE" — EH. E94, 
E? are defined through base terms EP? and 
fiber terms E^, respectively. A morphism 
of spectral sequences f: (E,, Dj O (Ej, D'| con- 
sists of f.: E, E; of degree (0,0) and f:D>D' 
of degree 0 which preserve the mechanism of 
spectral sequences. When the spectral se- 
quences are regular, a morphism f is an iso- 
morphism if one of the f, is an isomorphism. 
Addition is naturally introduced in the set of 
morphisms so that spectral sequences form an 
additive category. An additive functor from 
an Abelian category € to this category is 
called a spectral functor. A filtered complex 
(C, F} gives rise to a spectral sequence E2" => 
G(H(C)) if we put Z? = (ae F"(C)|dae F"*'(C)] 
B? =dZ?, EP- ZIKZE + BP), E, X, EP. 
A double complex C = (C5, dy, dy} admits two 
natural filtrations Fj: Ff(C) - €,., 3, C** 
and Fy: FACC) Än €, C". By the pro- 
cedure above, these filtrations give rise to 
spectral sequences Hf(Hi(C)) > ,H"(C) and 
H$(Hf(C)) > ,H"(C), respectively. Compari- 
son of these sequences yields many useful 
results. Let T be an additive covariant func- 
tor from an Abelian category € to pM, C be 
a complex in €, and Q — (Q^! be an injec- 
tive resolution of C. The double complex Q 
gives rise to spectral sequences H?(R47T(C)) => 
H(T(Q)) and R? T(H*(C)) > H(T(Q)). The 
limit H(T(Q)) is independent of Q and is 
called the hypercohomology of T with respect 
to C [2, 8]. We can similarly define hyper- 
cohomology of multifunctors. The theory of 
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spectral sequences was initiated by J. Leray 
(1946), and suitable algebraic formulations 
were given by J. L. Koszul (1950). 


K. Categories of Modules 


The category gM (resp. Mpg) of left (right) R- 
modules over a tunitary ring R is not only an 
Abelian category but also a Grothendieck 
category (— 277 Modules). The "full embed- 
ding theorem permits us to deduce many pro- 
positions about general Abelian categories 
from the consideration of o. An object P of 
rM is projective if and only if it is isomorphic 
to a direct summand of a ffree module. Any 
projective module is the direct sum of count- 
ably generated projective modules (I. Kap- 
lansky, 1958). It follows that any projective 
module over a flocal ring is free. Finitely 
generated projective modules P, and P, 

are said to be equivalent if there exist finite- 
ly generated free modules F,, F, such that 

P, OF, =P, Q F,. The equivalence classes 
then form an Abelian group with respect to 
the direct sum construction called the projec- 
tive class group of a ring R. The category of 
complex vector bundles over a compact space 
X 1s equivalent to the category of projective 
modules over C(X), the ring of complex- 
valued continuous functions on X, and simi- 
larly for other types of spaces and bundles. 
Many investigations have been made involv- 
ing the problem of whether every projective 
module over a polynomial ring is free (J.-P. 
Serre, 1955). This problem was settled affirma- 
tively by D. Quillen [13] and independently 
by A. Suslin. It has been observed that “big” 
projective modules are often free: for example, 
nonfinitely generated projective modules over 
an tindecomposable weakly Noetherian ring 
are free (Y. Hinohara, 1963). 

The nth right derived functor of Hom,(A, B) 
is denoted by ExtR(A, B) (— Section G). This is 
a bifunctor së x pM — (Ab), contravariant in 
A and covariant in B. Ext? is isomorphic to 
and identified with Hom ,. An exact sequence 
Q0 4'—5 A A" 0 gives rise to the con- 
necting homomorphisms A": Ext; (A', B) 
Ext}! (A", B), and the following sequence 
is exact: ...— Ext ! (A', DÄ Extk(A”, B)— 

Ext (A, B) > Ext} (A', B) 3 Ext% (A", B)... 
(the exact sequence of Ext). Similarly, an exact 
sequence 0 B' ^ B5 B" 50 gives rise to 

A": Ext} (A, B") 5 Ext; (A, B’) and to an exact 
sequence of Ext. An extension of A by B (or of 
B by A) is an exact sequence (E):05 B5 X > 
A 0. The set of equivalence classes of exten- 
sions of A by B is in one-to-one correspon- 
dence with Ext,(A, B) by assigning to (E) its 
characteristic class y, = A?(1)e Ext}(A, B), 
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where 1 denotes the identity of Hom,(B, B). In 
this correspondence, the sum of two extensions 
is obtained by a construction called Baer's 
sum of extensions. Similarly, Ext,(A, B) is 
interpreted as the set of the equivalence classes 
of n-fold extensions 025 Bo X, ,2...5 X92 
A-0 (exact). This point of view permits us to 
establish a theory of Ext, etc., in more general 
(additive) categories lacking enough projec- 
tives or injectives (N. Yoneda, 1954, 1960). 

The tensor product A © gB is a right exact 
covariant bifunctor Wp ® pH —(Ab). If the 
functor tp(:) — GG P is exact, P is called a tflat 
module. A projective module is flat. In general, 
a flat module is the tinductive limit of finitely 
generated free modules (M. Lazard, 1964). A 
flat module P is called tfaithfully flat if P zmP 
for every maximal ideal m of R. The functors 
Gi and Hom are related by tadjointness (— 52 
Categories and Functors). From this view- 
point & can be introduced in more general 
categories. Left-derived functors of A & 4B are 
denoted by Tor®(A, B) and are called nth tor- 
sion products of A and B. Tor?(A, B) is often 
denoted by A * gB. The functor &, is left 
balanced, hence Tor is calculated by using 
projective resolutions of A, B, or both A and 
B. We have Tor = &,. An exact sequence 0— 
A' — A A" 50 gives rise to A,,:Tor®, ,(A", B) 
Tor®(A’, B) and the infinite exact sequence of 
Tor, and similarly for the second variables. 

From the various relations between Hom 
and ® follow the corresponding relations 
between their derived functors. When A and T 
are algebras over K and Q=A @T, we can 
define the external product ( T -product), which 
is a mapping 


T : Tor}(A, B) & Tor! (4’, B") 


Tor? (A & A’, B@ B^. 


In particular, if A and T are K-projective and 
Tor*(A, 4") « 0 (n7 0), then we can define the 
wedge product (\/-product) V: Ext? (A, B) & 
Ext£(A', B') > Ext2*4(A Q A’, B® B). The latter 
is described in terms of the composition of 
module extensions. When K —A- =Q, 

the T -product reduces to the internal product, 
called the n-product. If A is a *Hopf alge- 

bra over K, the tcomultiplication A—A @A 
induces Ext, G4 — Ext,. This, combined with 
the V-product, yields the cup product (—- 
product) —: Ext} (A, B) © Ext4(A’, B) 
Ext*4(A & A’, B® B'). We define similarly 

JL -product, ^-product, u-product, and — - 
product (cap product) [2]. Let A, T, and 2 be 
algebras over K, with A K-projective; let 

A€ Mf gr, BEAM z, CE Mrez and assume 
Tor}(A, B)=0 (n7 0). The natural isomor- 
phism Hom,gr(A, Hom;(B, C) = Homyg 5 
(A € ,B, C) then yields a spectral sequence 


756 


Ext? or (A, Ext$(B, C)) >, Ext x (A & AB, C) 
by the double complex argument in Section D. 
The homological dimension h dim, A, 
dh, A, or projective dimension proj dim, A of 
A€ gf is the supremum (<œ) of n such 
that Ext,(A, B) 40 for some B. The relation 
hdimg A <0 means that A is projective. The 
injective dimension inj dim, B of Be pM is 
defined similarly by means of the functor 
Extk(-, B), and the weak dimension w dim, C 
of Ce gM by the functor Tor®(-, C). The 
common value supíproj dim, A | Ae RM} 
—sup (inj dim, B| Be 4.7] is called the left 
global dimension 1 gl dim R of R. It is identical 
with the supremum of homological dimensions 
of tcyclic modules (M. Auslander, 1955). The 
right global dimension r gl dim R is defined 
similarly. The common value sup(w dim, A | 
Ac Af.) 2 sup(wdimg C| Ce 4] ] is called the 
weak global dimension w gl dim R of R. We 
have w gl dim R <1 gl dim R, r gl dim R. The 
equality may fail to hold (Kaplansky, 1958). If 
R 1s *Noetherian, the three global dimensions 
coincide (Auslander, 1955) and are called 
simply the global dimension of R:gldim R. The 
condition 1 gl dim R 20 (or r gl dim R —0) 
holds if and only if R is an tArtinian semi- 
simple ring, while w gl dim R =0 if and only 
if R is a tregular ring in the sense of J. von 
Neumann (M. Harada, 1956). A ring R is 
called left (right) hereditary if 1 gl dim R « 1 
(r gl dim R < 1), and left (right) semihereditary 
if every finitely generated left (right) ideal is 
projective. A left and right (semi)hereditary 
ring is called a (semi)hereditary ring. Since 
projectivity and tinvertibility of an ideal of a 
(commutative) integral domain R are equiva- 
lent, R is hereditary if and only if R is a *Dede- 
kind ring. In this case, the projective class 
group of R reduces to the fideal class group. 
An integral domain R is semihereditary if and 
only if w gl dim R < 1 (A. Hattori, 1957), and in 
that case R is called a Prüfer ring. A tmaximal 
order over a Dedekind ring is hereditary. A 
commutative semihereditary ring R is charac- 
terized by the property that flatness of R- 
modules is equivalent to torsion-freeness (S. 
Endo, 1961). A Noetherian ring R is left self- 
injective if and only if R is tquasi-Frobenius 
(M. Ikeda, 1952), and the global dimension of 
a quasi-Frobenius ring is 0 or oo (S. Eilenberg 
and T. Nakayama, 1955). A polynomial ring R 
=K[X,,...,X,] over a commutative ring K 
satisfies gl dim R — gl dim K +n. When K isa 
field, this is a reformulation of Hilbert's theory 
of syzygy sequences (— 369 Rings of Poly- 
nomials). In this sense, the study of the global 
dimension of rings and categories is sometimes 
called syzygy theory (Eilenberg, 1956). The 
homological algebra of commutative Noe- 
therian rings has been studied extensively and 
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is useful in algebraic geometry. Since gl dimR 
=sup,, gl dim Rn (Rn is the tring of quotients 
relative to m), with m running over the max- 
imal ideals of R, the problem of determining 
gl dim R reduces to the case of flocal rings. A 
finitely generated flat module over a local ring 
R 1s free. If K denotes the residue field R/m, 
where m is the maximal ideal of the local 

ring R, Tor®(K, K) has the structure of a 
Hopf algebra (E. F. Assmus, Jr., 1959). De- 
tailed results concerning the Betti numbers 
dim Tor?(K, K) of R have been obtained (J. 
Tate, 1957, and others). In particular, R is 
tregular if and only if gl dim R < oo (Serre, 
1955). A local ring R is called a Gorenstein 
ring if the injective dimension of R-module R 
is finite. This is a notion intermediate between 
regular rings and *Macaulay rings (— 284 
Noetherian Rings). 

Consideration of a ring R in relation to a 
subring S leads to relative homological algebra. 
Foundations for this theory were established 
by Hochschild (1956). An exact sequence of 
R-modules that *splits as a sequence of S- 
modules is called an (R, S)-exact sequence. 

An R-module P is called an (R, S)-projective 
module if Hom, (P,:) maps any (R, S)-exact 
sequence to an exact sequence. (R, S)-injective 
modules are defined similarly. Based on these 
notions, Ext, s, and Tor®® are defined as the 
relative derived functors of Hom, and ® pz, 
respectively. We also have a relative theory 
from a different viewpoint (S. Takasu, 1957). 
Relative theory is extended to general cate- 
gories from various viewpoints [6,14] (— 
Section Q). 


L. Cohomology Theory for Associative 
Algebras 


Let A be an talgebra over a commutative 
ring K and A a *two-sided A-module. Let 
C" be the module of all n-linear mappings 
of A into A called n-cochains (C? — A). De- 
fine the MD operator 9": C^ C"*! 


Ss (^f i... 4 n41) =A f (Aas Anr) + 
=1(—1)'f(A oo SE 
“YY fü. ....À, An+1- 


me thus obtain a E whose coho- 
mology is denoted by H"(A, A) and is called 
the nth Hochschild's cohomology group of A 
relative to the coefficient module 4 (Hochs- 
child, 1945). A cochain f is called normalized if 
Ti, 4,) 20 whenever one of the å; is 1. We 
obtain the same cohomology group H"(A, A) 
from RE obe od of normalized cochains. 
{H"(A,-)} is a cohomological functor from the 
category 4/4 of two-sided A-modules to the 
category gM. Using the enveloping algebra A" 
=A ® x A9, where A is an anti-isomorphic 
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copy of A, a, can be identified with ,eW% 
(and M,e). If A is K-projective, {H"(A,-)} is 
isomorphic to (Ext4«(A, -)]. (In [2], H"(A, A) is 
defined as Ext; «(A, A) in general.) We have 
H°(A, A) 2 ae A| 4a — aA, VAe A}. We call 1- 
cocycles derivations (or crossed homomor- 
phisms) of A in A and 1-coboundaries inner 
derivations. Thus H'(A, A) is the derivation 
class group and is related to the *tramification. 
When K is a field, H!(A,-)=0 if and only if A 
is a fseparable algebra. In general, an alge- 
bra A over a commutative ring K is called a 
separable algebra if A is A^-projective, 1.e., if 
Ext1e(A,-) 20 (Auslander and O. Goldman, 
1960). This is a generalization of the notion of 
*maximally central algebras (Nakayama and 
G. Azumaya, 1948). We have a one-to-one 
correspondence of H?(A, A) to the family of 
algebra extensions of A with kernel A De K- 
algebras I' containing A as a two-sided ideal 
such that T/A =A) satisfying A? —0. Any 
extension of an algebra A over a field K such 
that H?(A,:)=0 splits over a nilpotent kernel 
(J. H. C. Whitehead and G. Hochschild). This 
holds in particular for a separable algebra, and 
we obtain the 'Wedderburn-Mal'tsev theorem. 
There are some interpretations of H?(A. A) in 
terms of extensions. 

The supremum ( € oo) of n such that 
H"(A, A)40 for some A is called the cohomo- 
logical dimension of A and written dim A. For 
a finite-dimensional algebra A over a field K, 
dim A « oo if and only if A/N is separable and 
gl dim A < oo, where N is the tradical of A (N. 
Ikeda, H. Nagao, and Nakayama, 1954). 

The homology groups H,(A, A) of A relative 
to a coefficient module A are defined similarly. 
If A is K-projective, {H,(A,-)} is isomorphic to 
Feet 


M. Cohomology of Groups 


The pair consisting of an algebra A over K 
and an algebra homomorphism e: A K is 
called a supplemented algebra [2] (or aug- 
mented algebra [6]), of which e is the augmen- 
tation. The *group algebra Z[G] of a group G 
over the ring of rational integers is a supple- 
mented algebra, in which the augmentation is 
defined by e(x)= 1 (x eG). The category of left 
G-modules is identified with the category of 
left Z[G ]-modules. For a finite group G, a 
finitely generated projective G-module is not 
necessarily free (D. S. Rim, 1959) and is iso- 
morphic to the direct sum of a free module 
and a left ideal of Z[G]. It follows that the 
projective class group of Z[G] is a finite group 
(R. G. Swan, 1960). The cohomology groups 
and homology groups of G relative to A€ gM 
(Eilenberg and S. MacLane, 1943) are defined 
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by H"(G, A)=Ext},¢)(Z, A) and H,(G, A)= 
Tor“l°1(Z, A), respectively. Their concrete 

description is given usually via the Z[G]- 

standard resolution of Z. 

(1) Homogeneous formulation. The group of 
homogeneous n-chains is the free Abelian group 
with basis G x ... x G(n+1 times), on which 
G operates by x(xo, ..., x,) 2 (XXo, ..., XXa) 
and the boundary operator is defined by 
EE MNS Lab Go. 

(2) Nonhomogencous formulation. The group 
of nonhomogeneous n-chains is the Z[G ]-free 
module with basis G x ... x G (n times), and 
the boundary operator is defined by d(x,, 

a MEK Oa sx LES D Gees 
ehn: X) T (7 D)" (x4, ..., X44). A non- 
homogeneous 2-cocycle is sometimes called 
a factor set. H°(G, A) is the submodule A9 
of A consisting of the G-invariant elements, 
while H,(G, A) is the largest residue class 
module A, of A on which G acts trivially. 
Given two groups G and K, an exact sequence 
of group homomorphisms 1—>K >E>G—>1 is 
called a group extension of G over the kernel 
K. When K is Abelian, the extension canoni- 
cally induces a G-module structure on K, and 
the deviation of K from being a semidirect 
factor of E is measured by a factor set. The 
group H?(G, A) is thus in one-to-one corre- 
spondence with the set of equivalence classes 
of the tgroup extensions of G over A which 
induce the originally given G-module structure 
on A (— 190 Groups N). This point of view is 
essential in the proof of the 'Schur-Zassenhaus 
theorem (— 151 Finite Groups). H?(G, A) is 
interpreted as the set of obstructions for exten- 
sions (Etlenberg and MacLane, 1947). For a 
free group F, H"(F, 4) 20 (n> 1). If a group 
G is represented as a factor group F/R ofa 
free group F, we have a group extension 1— 
KoEoGo1, where K 2 R/[[R, R] and E= 
F/[R, R]. Let ëe H?(G, K) correspond to this 
extension. Then for any G-module A, the cup 
product 7 5 y —é followed by the pairing 
Hom(K, A) ® K —^ A provides isomorphisms 
H"(G, Hom(K, A)) = H"*?(G, A) (n7 0) (the cup 
product reduction theorem of Eilenberg and 
MacLane, 1947); similarly, we have the reduc- 
tion theorem for the homology. The Z-algebra 
H(G, Z)= Ło H"(G, Z) under the multipli- 
cation defined by the cup product is finitely 
generated if G is a finite group (B. B. Venkov, 
1959; L. Evens, 1961). 

The following are mappings relative to a 
subgroup H. 

(1) The inner automorphism by xeG in- 
duces an isomorphism of H"(H, A) and 
H"(xHx +, A) which reduces to the identity 
of H"(G, A) if H 2 G. Hence if H isa normal, 
subgroup, H"(H, A) has the structure of a G/H- 
module, and similarly for H,(H, A). 
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(2) When H is a normal subgroup of G, the 
mapping (x,, ..., Xn} 2(x, H, ..., x, H) of non- 
homogeneous chains induces the inflation 
(or lift) Inf: H"(G/H, AP) H"(G, A) and the 
deflation Def: H,(G, A) H,(G/H, Ay). 

(3) The embedding of nonhomogeneous 

chains induces the restriction Res: H"(G, A) 
— H"(H, A) and the injection Inj (or corestric- 
tion Cor): H,(H, A) H,(G, A). The theory of 
"induced representation gives another con- 
struction of these mappings; that is, if we put 
1°(A) = Homzrj(Z [G]. A), Res is obtained by 
the isomorphism H"(G,1°(A))= H"(H, A) com- 
bined with the homomorphism induced by A 
—1*(A); while if we put 1¢(A)= Z[G] zum, 
Inj is obtained by the isomorphism H,(H, A)= 
H,(G,i15(A)) followed by the homomorphism 
induced by i5;(4) A. 

(4) If (G: H) « oo, we have :%(A)=1,(A). 

The composition of H"(H, A) H"(G, ig{A)) 

— H"(G, A) defines on the cohomology groups 
the Inj: H"(H, A) > H"(G, A), while the compo- 
sition H,(G, A) H,(G,19(A)) HAH, A) gives 
the Res: H,(G, A) ^ H,(H, A). In particular, 
Res: H,(G, Z) H,(H, Z) coincides with the 
Transfer G/[G, G]  H/[H, H]. 

(5) Let H be a normal subgroup of G. 
Consider the additive relation p (the corre- 
spondence) between he Z"(H, A)8 and fe 
Z"*1(G/H, AP) determined by p(h, f) if and 
only if there exists a ge C"(G, A) such that h= 
Resg and Inf f = ôg. If the relation induces a 
homomorphism H"(H, Am >H"! (G/H, AP), 
then it is called the transgression. If H'(H, A) 
—0(0«i«n), the sequence 05 H"(G/H, AP) 
> H"(G, A)JO H'(H, A6  H"* (G/H, AP) 
H"*!(G, A) composed of inflation, restric- 
tion, and transgression mappings is exact 
(Hochschild and Serre, 1953) and is called 
the fundamental exact sequence. This exact 
sequence can be derived from a certain spec- 
tral sequence H"(G/H, H*(H, A)) >, H"(G, A) 
(R. C. Lyndon, 1948; Hochschild and Serre, 
1953). 

The relative (co)homology theory relative to 
a subgroup (I. T. Adamson, 1954) can be dealt 
with in terms of the relative Ext and the rela- 
tive Tor (Hochschild, 1956). Many results in 
the absolute case are generalized to the rela- 
tive case: for example, the fundamental exact 
sequence (Nakayama and Hattori, 1958). The 
relative theory is further generalized to the 
cohomology theory of *permutation represen- 
tations of G (E. Snapper, 1964). 


Non-Abelian Cohomology. For a non-Abelian 
G-group A, the cohomology “set” H!(G, A) 
(and H°(G, A)) is defined as in the Abelian case 
by means of the nonhomogeneous cochains 
(^ e.g., [9]). Some efforts are being made 
toward the construction of a more general non- 
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Abelian theory (J. Giraud, Cohomologie non 
abeliénne, Springer, 1971). 


N. Finite Groups 


Let G be a finite group and A a G-module. 
Define the norm N: A— A by N(a) 2 X,.g xa, 
and denote Ker N by pA. The kernel of the 
augmentation e: Z[G] Z is denoted by I. Put 
H"(G, A) - H"(G, A) (n 0), HOGG, A) - A9/N A, 
H ^" (G, A) - .A/IA, and HG A) - H, ,(G, A) 
(n> 1). Then (Éi"(G,-)) forms a cohomological 
functor, called the Tate cohomology (E. Artin 
and J. T. Tate), that can be described as the set 
of cohomology groups concerning a certain 
complex called a complete free resolution of Z. 
(Similar arguments arc valid more generally 
for quasi-Frobenius rings (Nakayama, 1957), 
and a theory of this kind is called complete 
cohomology theory.) We have H"(G, A) 0 
(ne Z) if and only if hdimz;g) A < 1 (Naka- 
yama, 1957). If A satisfies the conditions (i) 
A'(G,, A)=0 for any Sylow p-subgroup G, 

of G, and (ii) there exists a če Ê? (G, A) such 
that Res če Ĥ?(G,, A) has the same order as G, 
and generates all of H?(G,, A), then the homo- 
morphisms A"(H, B) H"*?(H, A @ B) (ne Z) 
defined by the cup product with Res are 
isomorphisms for every subgroup H and every 
G-module B such that Tor(A, B) - 0 (Naka- 
yama, 1957; for B — Z, Tate, 1952). If G is 
cyclic, the mappings H"(A)> H"*?(A) (neZ) 
defined by the cup product with a generator 
of H?(Z) are isomorphisms. (The notation is 
abbreviated by omitting G.) If the orders of 
°(A) and H!(A) are finite, their ratio is called 
the Herbrand quotient bt AN of A. If 05 4' A 
— A" 0 is exact, then h(A) — h(A')h(A"). If A is 
finite, then h(A)= 1. By combining these two 
facts we obtain Herbrand's lemma: If A' is a G- 
submodule of A of finite index and h(A’) exists, 
then h(A) also exists and h(A)=h(A’). The 
periodicity Hnt Alz "*"(A) (neZ, Ae s.) 
holds if and only if every Sylow subgroup is 
cyclic or a tgeneralized quaternion group 
(Artin and Tate; [2]). 

Let L/K be a finite tGalois extension with 
the tGalois group G. The cohomology groups 
of various types of G-modules related to L/K 
are called the Galois cohomology groups (— 
172 Galois Theory). Using continuous cocycles, 
a cohomology theory (Tate cohomology) is 
developed for infinite Galois extensions as well 
[9, 10]. By means of Galois cohomology (— 59 
Class Field Theory), the cohomology theory of 
finite groups and of ftotally disconnected 
compact groups (which are fprofinite groups) 
plays an important role in class field theory 
and its related areas. 
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O. Cohomology Theory of Lie Algebras 


Let g be a tLie algebra over a commutative 
ring K, and assume that g is K-free. The ten- 
veloping algebra U = U (g) is a *supplemented 
algebra over K. For a g-module (= U-module) 
A, Ext? (K, A) and Tor! (K, A) are called the 
cohomology groups H"(g, A) and homology 
groups H,(a, A), respectively, of g relative to 
the coefficient module A. They are usually 
described by means of the U-free resolution 
U ® Ax(g) of K (called the standard com- 
plex of g) constructed by C. Chevalley and 
Eilenberg (1948), where A (a) is the exterior 
algebra of the K-module g and (denoting 

1G (x4 A... Ax) by (x1, ..., x,)) the differenti- 
ation is given by 


d(x e Xd = Eh epo e) 
i=l 


+ A (G-A ZS x], x.. 


Ixi«jxn 


For n>[g:K], H"(g, A) — H,(g, A)=0. If gis a 
tsemisimple Lie algebra over a field K of char- 
acteristic 0, we have H'(g, A)=0, H? (g, A)=0, 
while H?(g, K) 40. H'(g, A)=0 is equivalent to 
Weyl’s theorem, which asserts the complete 
reducibility of finite-dimensional representa- 
tions (— 248 Lie Algebras E). H?(g, A) and 
H3(g, A) are interpreted by means of Lie 
algebra extensions as in the cohomology of 
groups. The theorem on *Levi decomposition 
is derived from H?(g, A)=0. Chevalley and 
Eilenberg constructed this cohomology by 
algebraization of the cohomology of compact 
*Lie groups. They also introduced the notion 
of relative cohomology groups H to b, A) 
relative to a Lie subalgebra b of g, which cor- 
respond to the cohomology of homogeneous 
spaces. H”(g, b, A) does not always coincide 
with Extt ug (K, 4) (Hochschild, 1956), but 
does so in an important case where K is a field 
of characteristic 0 and b is treductive in g (— 
248 Lie Algebras). 

For *transformation spaces of thnear alge- 
braic groups G over a field K, the rational 
cohomology groups are introduced using the 
notion of rational injectivity (Hochschild, 
1961). In particular, if G is a *unipotent alge- 
braic group over a field K of characteristic 0, 
then H(G, A) is isomorphic to Hí(g, A), where g 
is the Lie algebra of G. There 1s also a relative 
theory. 


P. Amitsur Cohomology 


Let R be a commutative ring, and F a co- 
variant functor from the category €, of com- 
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mutative R-algebras to the category of Abelian 
groups. For $e €, and n=0, 1,2, ..., we write 
S"—$(G...& S (n-fold tensor product over 
R). Let e: $"*U -> S”? (1-0, 1,...,n +1) be 
€-morphisms defined by e(xg 69 ... Q x,)— 
X9 ... GI ©1 Ox; 9 ... & x,. Defining 
d": F(S^*)5 F(S"*?) by d  Y25( Ur Pie 
we obtain a cochain complex (F(S"*), d"}. 
This complex and its cohomology groups are 
called the Amitsur complex and the Amitsur 
cohomology groups, and are usually denoted 
by C(S/R, F) and H"(S/R, F), respectively. 

If S/R is a finite Galois extension with 
Galois group G, then the group H"(S/R, U) 
of the unit group functor U is naturally iso- 
morphic to H"(G, U(S)). If S/R is a finite pure- 
ly inseparable extension, then H"(S/R, U)=0 
for n>3. The group H?(S/R, U) is related to 
the "Brauer group B(S/R) (— 29 Associative 
Algebras K). 


Q. Relative Theory 


In the course of the development of homolog- 
ical algebra, it has been recognized that the 
notion of projective (resp. injective) resolutions 
should be generalized ([14, 4]; Hochschild, 
1956). In the meantime, a method has been 
introduced that utilizes simplicial objects in 
order to define the derived functor of an arbi- 
trary functor with Abelian category as its range 
([15, 17]; J. Beck, 1967). As a consequence of 
these developments, there emerged a view- 
point, which we describe below, making it 
possible to unify various known definitions of 
(co-)homology theories that has been designed 
for particular cases. 

Let æ be a category and 2 a class of ob- 
jects in æ. In this section, we denote the 
set Hom (4. B) of morphisms by (A, B). 
A morphism f: A B in æ is called a P- 
epimorphism if the induced mapping (P, f) 
(=Hom(P, TI: (P, A) 5 A(P, B) is surjective 
for any Pe. The class F is called a projective 
class in æ if there exist an object Pe and a 
£-epimorphism f: P A for each object Ae.. 
To any category 8, we associate a preaddi- 
tive category Zæ, adding a zero object to æ if 
necessary: Put ObZ = Ob, Z.»/(A, B) free 
Abelian group generated by the set (A, B). £ 
is regarded as a subcategory of Za by the 
natural inclusion J :.«Z — Z. 7. Any functor T 
from . into an Abelian category 2 has a 
unique additive extension T: Z +B such 
that T= TJ. If æ is an additive category, 
there is a canonical projection 0: Za >. 
such that 8J = Id. If furthermore T is additive, 
then T = TO. 

Now suppose that a projective class 7 in £ 
is given. For deg, an augmented chain com- 
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plex X.— A in ZA, 


is called a P-projective resolution of A if (1) 
X,€ 2 (nz 0), and (ii) it is Z-acyclic (i.e., the 
sequence 


atb, X ) VL AP, X, 4)... 
5S Z (P, Xo) ZAP, A)>0 


is exact for any Pe 2 [16]). Note that the 
Z-acyclicity in this case implies the existence 
of a contracting homotopy 


h: ZA(P, X,)2 ZAP, Xn+1b 
n>—1,X_,=A, 


satisfying d,,,h, + h, .,d,— 1. 

Comparison theorem: Given two 2- 
projective resolutions X. A and Y. A, we 
have that the chain complexes X, and Y. are 
chain equivalent in Za. 

Let T: —^ 4 be a (covariant) functor with Z 
an Abelian category. The nth left derived func- 
tor L,T :.7 >Z (n2 0) of T, with respect to 
P, is defined by L, T(4) - H,(TX.) for a 2- 
projective resolution X. A. The derived func- 
tors L, T will remain unaffected if we replace 
the projective class 28 by its enlargement P= 
{objects in Z together with their retracts}. 

We can easily verify that (i) Lo T(P) - T(P) 
and L, T(P) 0 (n0) for Pe, and (ii) a 
short exact sequence 05 T'— T— T" 0 of 
functors:.e/ > induces a long exact sequence 
of derived functors 
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If . is preadditive and has a zero object 
and kernels, then it is routine to give a F- 
projective resolution of any object. If a has 
finite !limits Ge, finite products and tequal- 
izers), it can be proved that there exists a 
2-projective resolution for any object [17]. 
There is a standard functorial construction 
which provides canonically a projective class 
4 in a category o and a 2-projective reso- 
lution of any object in <7. Let (G, e, ô) bea 
cotriple (or comonad [18] or functor coalge- 
bra) in £. Here G: oa is an endofunctor, 
£: God and ô: Go G? — GG are natural trans- 
formations such that Geoó —-£Goó —1g and 
Góoó—ó0Goó. A cotriple comes usually from 
a pair (F, U) of tadjoint functors U:.7 — €, 
F:€— with natural bijection 4:.A~(F(C), A) 
X@(C, U(A)). Putting G= FU: A 9, &(A)= 
A" (Io): FU(A) >A, (C) 2 Allee: C 
UF(C), we have a cotriple (G = FU, £, ô = FqU) 
in &. Conversely, it is known that any cotri- 
ple in Z is induced from a suitable pair of 
adjoint functors. 
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Given a cotriple (G, e, 6) in . 7, we define a 
projective class A, = {G(A)| AeA} (or its 
enlargement Z) in zi. and an augmented 
simplicial object G, (A4) A for any object A as 
follows. Put G,(A) 2 G"*! (A4) (nz0), 6; 0? = 
G'sG" (4): G,(A)2 G, (A) (face operator) 
and ô; = ô” = G!óG" (4): G,(A)— G, (A) (de- 
generacy operator) for O «ix n, G ,(A)— A. 
Then G,(A)-A gives rise to a Z; (or equiva- 
lently Y,)-projective resolution of A in ZW 
with differentials d, = EI 9(— 1)6;. This is 
the bar resolution (or standard resolution) 
in a generalized sense, and there are many 
(co-)homology theories defined by means of 
such constructions. 

Most of the above definitions and construc- 
tions can be dualized so as to give injective 
classes .7, $ -injective resolutions, and right 
derived functors with respect to .7, triples (or 
manads), etc. (See also [19] for generalized 
(co-)homology). 
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A. History 


Homology theory is the oldest and most ex- 
tensively developed portion of talgebraic to- 
pology. Historically, it started with measuring 
the higher-dimensional connectivity of a space 
in the sense that the 0-dimensional connectiv- 
ity is the number of tconnected components of 
the space. For example, take a t2-sphere S? 
and a t2-torus T?. Then T? is distinguished 
from S? by the fact that on T? a closed curve 
can be drawn without forming a boundary, 
while this is not true for S°. In fact, a curve (c, 
or ch in Fig. 1) can be drawn on T? so that it 
does not form a boundary of an embedded 2- 
disk. On more complicated tsurfaces there are 
many kinds of such closed curves. The maxi- 
mum number of such closed curves is the 1- 
dimensional connectivity of the surface; this is 
a topological property of the surface. For 
example, the 1-dimensional connectivity of S? 
is 0, that of T? is 2, and that of the surface in 
Fig. 2 is 6. A more general consideration of the 
bounding properties of q-dimensional ‘closed 
submanifolds of a manifold led E. Betti (Ann. 
Mat. Pure Appl., 4 (1871)) to introduce the 
notion of the g-dimensional connectivity of the 
manifold, which was a precursor of homology 
theory. 





The foundation of homology theory was 
laid by fH. Poincaré [1]. He started his study 
of homology with analytic treatment of mani- 
folds, which led to a series of complications. 
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Poincaré then introduced a new method for 
the study, now called tcombinatorial topology: 
He decomposed the manifold into elementary 
pieces or tcells, which adjoin one another in 

a regular fashion; he then substituted the 
algebraic notions of tcycles and tboundary 
operators for the geometrical notions of closed 
submanifolds and boundaries. Thereby the 
notion of homology groups acquired an exact 
logical meaning, and the fundamental for- 
mulas, which are now called Poincaré for- 
mulas, were proved. 

After Poincaré, much of the development of 
homology theory centered around the ques- 
tion of the topological invariance of homology 
groups, that is, the independence of the ho- 
mology groups on the choice of *cellular de- 
composition. Through the development of 
tsimplicial complexes and their techniques, 

J. W. Alexander (Trans. Amer. Math. Soc., 28 
(1926)) gave the first fully satisfactory proof for 
topological invariance of the homology groups 
of polyhedra. In those days, the homology 
groups themselves were barely recognized; 
instead, one dealt with numerical invariants 
such as the "Bet numbers and the *torsion 
coefficients [2, 3]. 

During the period 1925-1935 there was a 
gradual shift of interest from the numerical 
invariants to the homology groups themselves, 
and homology theory developed intensively 
[4,5]. S. Lefschetz (Trans. Amer. Math. Soc., 28 
(1926)) added the theory of the tintersection 
products to the homology of manifolds. He 
also invented trelative homology theory (Proc. 
Nat. Acad. Sci. US, 13 (1927)) and generalized 
the duality theorems of Poincaré and Alex- 
ander (ibid., 15 (1929)). G. de Rham (J. Math. 
Pures Appl., 10 (1931)) obtained a duality 
theorem that relates the texterior differential 
forms in a manifold to the homology groups of 
the manifold. L. S. Pontryagin (Ann. Math., 

(2) 35 (1934)) proved the complete group- 
invariant form of the Alexander duality theo- 
rem. These duality theorems seemed to reflect 
the existence of a theory dual to homology 
theory, and the genesis of this dual theory, 
now called ftcohomology, occurred in 1935, in 
the work of Alexander and A. N. Kolmo- 
gorov. It was discovered subsequently by 
Alexander (Ann. Math., (2) 37 (1936)), E. Cech 
(ibid.), and H. Whitney (ibid., 39 (1938)) that 
the cohomology of a polyhedron can be made 
into a ring. 

On the other hand, after L. Vietoris (Math. 
Ann., 97 (1927)) and P. S. Aleksandrov (Ann. 
Math., (2) 30 (1928)), many devices were inven- 
ted to extend the homology theory of poly- 
hedra to general *topological spaces, and 
numerous variants of homology theory ap- 
peared at the hands of Cech (1932), Lefschetz 
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(1933), Alexander (1935), and Kolmogorov 
(1936). This development served to clarify the 
relations between combinatorial and set- 
theoretic methods in topology (— 426 Topol- 
ogy), whereas it produced complexity and 
confusion in homology theory [6]. S. Eilen- 
berg and N. E. Steenrod (Proc. Nat. Acad. Sci. 
US, 31 (1945) and [7]) cleared the air by treat- 
ing homology axiomatically. 

Roughly speaking, a homology theory as- 
signs *Abelian groups to ttopological spaces 
and thomomorphisms to fcontinuous map- 
pings of one space to another. In this way, a 
homology theory is an algebraic image of 
topology; it converts topological problems to 
algebraic problems. Starting from this view- 
point, Eilenberg and Steenrod selected some 
fundamental properties as axioms to charac- 
terize homology theory. This unified homol- 
ogy theories and allowed systematic treatment 
of homological problems which had previ- 
ously been done separately “by hand" in each 
case. Moreover, it motivated the birth of a 
new branch of mathematics, called thomo- 
logical algebra. 


B. Homology of Chain Complexes 


A chain complex C = {C,,6,} is a collection of 
(additive) Abelian groups C,, one for each 
integer q, and of homomorphisms 6,:C,>C,_, 
such that 0,00,,,=0 for each q. Elements of 
C, are called q-chains of C, and ô, is called the 
boundary operator. A subcomplex C’ = (C7,0;j 
of C is a chain complex such that C; c C, and 
0, = 0,| C; for each q. 

The *kernel of ô, is denoted by Z,(C), and its 
element is called a q-cycle of C. The timage of 
0,4, is denoted by B,(C), and its element is 
called a q-boundary of C. The relation 0,00,,, 
=0 implies B,(C) c Z,(C). The *quotient group 
Z,(C)/B,(C) is denoted by H,(C), the qth ho- 
mology group of C. Elements of H,(C) are 
called q-dimensional homology classes of C. 
Two cycles representing the same homology 
class are said to be homologous. The direct 
sum X, H,(C) is denoted by H,(C) and is 
called the homology group of C. 

If C={C,,0,} and C’={Cj, 0;} are chain 
complexes, a chain mapping (chain map) 9: C 
>C is a sequence of homomorphisms o, C, 
C, such that 0,0 9, — @,-; O 6, for each q. If 
Q:CC' is a chain mapping, then g, sends 
Z,(C) to Z,(C’) and B,(C) to B,(C'), and hence 
o induces a homomorphism of H,(C) to 
H,(C’). 

If H,(C) is ffinitely generated, it can be de- 
composed into the direct sum of a free Abelian 
group B,(C) and a finite Abelian group 7,(C). 
B,(C) and T,(C) are called the qth Betti group 
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of C and the qth torsion group of C, respec- 
tively. The trank p, of B,(C) is called the qth 
Betti number of C. T,(C) is isomorphic to the 
direct sum of t(q) finite cyclic groups of orders 
01,02, ..., 03). where 017 1 and 0? divides 02, 
for i2 1,...,1(q) - 1 (2 2 Abelian Groups). The 
numbers 07,04, ...,02,, are called the qth tor- 
sion coefficients of C. 

If H,(C) is finitely generated, then the num- 
ber y(C) 2 X,( — 1)!p, is called the Euler num- 
ber, the Euler characteristic, or the Euler- 
Poincaré characteristic of C. In this case a 
*polynomial 2, p,t* with variable t is called 
the Poincaré polynomial of C. 

Let C be a chain complex such that, for each 
q, C, is a tfree Abelian group of finite trank. 
Then the gth Betti number and the qth torsion 
coefficients are well defined for each q. Denote 
the ranks of C, and B,(C) by «, and f,, respec- 
tively. Then it holds that p, «,— B, — p, ,, 
and hence x(C)= £,( —1)'«,. (Euler-Poincaré 
formula). Moreover, there exists a set of bases, 
one for each C,, with the following properties: 
For each q, the base for C, is composed of 
five types of elements, a? (1& i < B, — t), b? 
(1<i<t,), c? (A <i<p,), d? (1 <i<t,_,), and e? 
(1 &ix f, , — Eë satisfies 0a? — 0, Ob? — 0, 
Oc£ —0, 0d? UN ! b^! , and 0e? — a? ! . Such a 
set of bases is called the canonical basis of C. 

Let C be a chain complex such that each C, 
is a free Abelian group with a given base (of). 
Then the incidence number [67:67 ']cZ is 
defined by 0,(o?) - 2;[o?:67 leg" This 
notion was commonly used in the early days 
of topology. 


C. Homology of Simplicial Complexes 


Let K be a tsimplicial complex. An oriented q- 
simplex c of K is a q-simplex se K together 
with an equivalence class of ttotal orderings of 
the vertices of s, two orderings being equiva- 
lent if they differ by an even permutation of 
the vertices. If ag,...,a, are the vertices of s, 
then [ay,4,,...,@,] denotes the oriented q- 
simplex of K consisting of the simplex s to- 
gether with the equivalence class of the order- 
ing ao «a, <... «a, of its vertices. For every 
vertex a of K there is a unique oriented 0- 
simplex [a], and to every q-simplex with qz 

1 there correspond exactly two oriented q- 
simplexes, which are said to be opposites of 
one another. 

Let C,(K) denote the Abelian group 
generated by the oriented g-simplexes of K 
with the relations o +0’ =0 if o’ is the oppo- 
site of o If we choose an oriented q-simplex 
of for each q-simplex s? of K, then each 
element of C,(K) is written uniquely as a 
finite sum È; go with integers g; #0, and 
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C,(K) is a free Abelian group generated by 
the set {0f}. We define a homomorphism 

d CK) C, (K) by 6,[4@0,44,-.-,4g1= 

3X. o( —ly'[ao, ....à;-1, Bas 4g]. Then 
6,064, =0 holds, and we have a chain com- 
plex C(K)={C,(K), 6,}(C,(K) = {0} if q <0), 
called the (oriented) simplicial chain complex. 
The homology group H,(C(K)) is denoted by 
H, (K) and is called the (integral) homology 
group of the simplicial complex K. 

Let K, and K, be simplicial complexes, and 
let f:K, >K, be a tsimplicial mapping. Then 
for each q, a homomorphism fy,:C,(K,)> 
C,(K2) can be defined by f, ,([ao,a,, ....a,]) — 
[ f (ao). f (a4), .... f(a;) ], where the right-hand 
side is understood to be 0 if f(a), f(a), .... 
f(a,) are not distinct. The sequence fy = { feel 
is a chain mapping of C(K,) to C(K,), and it 
induces a homomorphism f,: H,(K,)— H,(K)). 

If K is a ‘finite simplicial complex, then the 
Betti numbers, the torsion coefficients, and the 
Euler characteristic of K are defined to be 
those for the chain complex C(K). 

Let K, and K, be a subcomplexes of a sim- 
plicial complex K. Then we have the follow- 
ing fexact sequence which relates the homol- 
ogy group of K, UK, to the homology groups 
of K,, Kj, and K,K):...3H,(K,NK2)> 
HK) H,(K)>H,(K, UK2) H, (K, N 
KA, where a, fj, and 0, are defined as 
follows. Let i: K; NK >K; and j: Kio K(U 
K, (/=1, 2) be inclusion mappings; then a(a) — 
(i4. (8), — i2.(a)) and B(a,,a5) 9 J1.(a1) -J2«(a3). 
If z=c, +c, (c;E C(K))) ts a cycle of C(K, UK,) 
then 6(c,) — —0(c5) is a cycle of C(K, N K5); 6, 
sends the homology class of z to the homol- 
ogy class of 0(c,). The sequence is called the 
Mayer-Vietoris exact sequence of the couple 
{K,,K,}, and ô, is referred to as the connect- 
ing homomorphism. The prototype of the 
Mayer-Vietoris exact sequence was obtained 
by W. Mayer (Monatsh. Math. Phys., 36 (1929)) 
and L. Vietoris (ibid., 37 (1930)). The present 
form is due to Eilenberg and Steenrod [7]. 


D. Homology of Polyhedra 


If K is a tsimplicial complex and K’ is a tsub- 
division of K, then there exists a canonical 
isomorphism H,(K)z H,(K"). This proves that 
if K, and K, are tsimplicial decompositions of 
a polyhedron then H,(K,) and H,(K;) are 
isomorphic, because there exists a common 
subdivision of K, and K,. Thus we may de- 
fine the (integral) homology groups H. (X) of 
a polyhedron X to be the homology group 
H,(K) of a simplicial decomposition K of X. 
Let X and Y be polyhedra, and let f: X Y 
be a continuous mapping. Take a *simplicial 
approximation o Kat of f. Then a homo- 
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morphism of H,(X) to H,(Y) given by the 
induced homomorphism o, : H,(K) H,(L) is 
independent of the choice of o, and is denoted 
by f,.. The following properties hold: (1) 1,, 
— |: H,(X)— H,(X), where 1 is the identity; (11) 
If f: X 2 Y and g: Y OZ are continuous map- 
pings, then (go f), =9, 0 fa: H,(X)5 H,,(Z); 
(iii) If f, f': X > Y are homotopic, then f, = 
Ja: H(X)> H,(Y). These imply the homo- 
topy invariance of the homology group stated as 
follows: If X and Y are polyhedra which are 
thomotopy equivalent, then H,(X) and H,(Y) 
are isomorphic. Specifically, the homology 
group is a topological invariant. Thus if X is 
a ttriangulable space (for example, if X is a 
*differentiable manifold), then its homology 
group H,(X) can be defined to be the homol- 
ogy group H,(K), where (K, t) is a ftriangu- 
lation of X. This homology group is referred 
to as the simplicial homology group of X. 
Similarly, if X is a compact triangulable space, 
the Betti numbers p,(X) of X, etc. can be de- 
fined to be those for the chain complex C(K). 

If we denote by pt a single point, then H,(pt) 
=Z (the group of integers) and H,(pt)=0 if 
q#O0. For an n-sphere S”, a 2-torus T?, and a 
*real projective plane P?, the homology groups 
can be computed as follows by means of their 
triangulations: (1) H)(S")= H,(S") = Z, and 
H,(S")=0 if q 40, n; (2) (T?) = HT?) =Z, 
H,(T’?)=Z+Z, and H,(T’)=0 if q 40, 1, 2; 
(3) H(P?) = Z, H,(P?)=Z,, and H,(P?)=0 if 
q#0, 1, where Z, = Z/2Z. 

Two surfaces are homeomorphic if and only 
if their integral homology groups are isomor- 
phic (— 410 Surfaces). 


E. Singular Homology 


There are various devices for defining ho- 
mology groups of general topological spaces. 
A familiar one is the singular homology theory 
initiated by S. Lefschetz (Bull. Amer. Math. 
Soc., 39 (1933)) and improved by S. Eilenberg 
(Ann. Math., (2) 45 (1944)). 

The standard q-simplex is the convex set 
Atc R**! consisting of all (q + 1)-tuples (to, t,, 
...,¢,) of real numbers with t; 2 0, t5 +t, + 
... £7 1. Any continuous mapping of A* to 
a topological space X is called a singular q- 
simplex in X. The ith face of a singular q- 
simplex o: A*— X is the singular (q — 1)-simplex 
oog: AT ! oO X, where the linear embedding 
6:417! OA is defined by g; (to, ... 

isla) mens oe Fog Di titis fy). 

For each integer q, let S,(X) denote the free 
Abelian group generated by the singular q- 
simplexes in X (S,(X)=0 if q «0), and define a 
homomorphism 0,: $,(X) S, ,(X) by 0,(0) = 


fii, ti+1> 
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? o(—1Yo06s;. Then we have a chain com- 
plex S(X)={S,(X), 6,}, called the singular chain 
complex of X. The homology group H,(S(X)) 
is denoted by H. (X) and is called the integral 
singular homology group of the topological 
space X. 

Given a continuous mapping f: X Y,a 
chain mapping f, :S(X)-S(Y) is defined by 
sending each singular simplex o: A* X to the 
singular simplex f oo: A7 Y, and it induces 
the homomorphism f,: H,(X) H,(Y). The 
properties (i), (ii), (iii) of f, in Section D hold 
for continuous mappings of topological spaces, 
and hence the singular homology group is a 
homotopy invariant. 

The homology group H,(K) of a simplicial 
complex K is isomorphic to the singular ho- 
mology group H,(|K|) of the polyhedron |K]. 
Therefore the simplicial homology group of a 
triangulable space is isomorphic to the sin- 
gular homology group of the space. 

If | Xj) is the set of 'arcwise connected com- 
ponents of a topological space X, then H,(X) 
zc Y, HX). If X is arcwise connected, then 
H(X) =Z. If {A,} is the collection of all the 
compact subsets of X directed by inclusion, 
then H,(X) is isomorphic to the *inductive 
limit lim H,,(A,). It is not true that there is a 
Mayer-Vietoris sequence in singular homology 
for any couple | X,, X,} of subsets of X. How- 
ever, for certain couples ( X,, X,}, there is a 
Mayer-Vietoris exact sequence of {X,, X3}: 
HQ 0X3) 5 HG) H(X5) H(X, U 
X3) H, 4,(X,0 X5)5.... For example, this 
holds if X 2 Int X, U Int X,, where Int denotes 
the *interior. 

Let c: X 5 pt be the mapping of a topolog- 
ical space to a single point. Then the kernel 
of c, : H,(X)9 H,(pt) is denoted by Hi, (X) 
and is called the reduced homology group of 
X. It holds that H,(X)= H,(X)-- H, (pt). 
Regard the tsuspension SX as the union of 
two copies of the tcone over X. Then the 
connecting homomorphism in the Meyer- 
Vietoris sequence gives an isomorphism 
H (SX) = H,_,(X) for any q. The inverse of 
this isomorphism is called the suspension iso- 
morphism for homology. 

Let M be a *C?-manifold. A C*-singular q- 
simplex in M is a singular q-simplex o: A7 AM 
such that c extends to a C?-mapping from an 
open neighborhood of A" in 1(to, t, ...,t,)€ 
R+! |to t, 4... t,—1] to M. The totality 
of C” -singular simplexes in M generates a 
subcomplex of S(M), denoted by S"(M). The 
inclusion $7(M) c S(M) induces an isomor- 
phism H,(S°(M))= H,(M). 

Let M be an n-dimensional ftopological 
manifold. Then H,(M)=0 unless 0<q <n, and 
H,,(M) is finitely generated if M is compact. 
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F. Homology of CW Complexes 


Homology theory is tractable in the category 
of *CW complexes by virtue of the facts stated 
below. 

Let X be a topological space and A its sub- 
set. Then we denote by X/A the quotient space 
obtained from X by shrinking A to a point, 
understanding X/@ to be the disjoint union 
X Upt, If X,, X, are tsubcomplexes of a CW 
complex, then the Mayer-Vietoris exact se- 
quence of {X,, X,} and the excision isomor- 
phism i,: H, (X, AX, 0 X) & H,(X;U X,)/X;) 
(i: inclusion) are valid. If A is a subcomplex of 
a CW complex X, then we have the following 
reduced homology exact sequence of (X, A):... 
5 B (AS B OO) BOCA Bt, (5... 
where i, and j, are induced by the inclusion 
i: A- X and the collapsing j: X > X/A, and 
0, is given by a commutative diagram 


BA) 4 
~ th ~|s 
A (X UCA)S HA). 


Here CA is the cone over A, S is the sus- 
pension isomorphism, and h: X UCA>(X U 
CA) CA = X/A and À': XU CAS(XUCA)X— 
SA are collapsings. More generally, if A, B 
are subcomplexes of a CW complex X and 

A >B, then we have the following reduced 
homology exact sequence of (X, A, B): ... 3 
TL (A/B) S B, (CB) Bi (X A) S B, (AJB)S 
.... Furthermore, the homology group H,(X) 
of a CW complex X can be computed in the 
following manner. 

Let X* denote the q-skeleton of X, i.e., 
the union of all cells of dimensions <q. Put 
C,(X) = A (X8/X*7!), and let 0,: C (X) 

C, ., (X) be the connecting homomorphism 
0,: H,(X*/X* 3) H, (X1 !/X*?) in the re- 
duced homology exact sequence of (X4, XT !, 
X* ?). Then C(X)={C,(X), 6,} is a chain com- 
plex such that C,(X) is a free Abelian group 
with one generator for each q-cell of X. If X is 
a polyhedron | K|, then C(X) coincides with 
the simplicial chain complex C(K). The ho- 
mology group H. (C(K)) is called the cellular 
homology group of the CW complex X. This is 
isomorphic to the singular homology group 
H,(X). 

Since a CW decomposition frequently re- 
quires fewer cells than a simplicial decompo- 
sition, the cellular homology groups are use- 
ful in calculating the homology groups. For 
example, the tcomplex n-dimensional projec- 
tive space CP" has a CW decomposition with 
a single 2i-cell for each i=0, 1,...,n, and hence 
we see immediately that H,(CP") = Z if q— 2i 
(O<i<n) and =0 otherwise. 

If X is a finite CW complex and a, denotes 
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the number of q-cells of X, then we have the 
Euler-Poincaré formula y( X) = $,( —1)*a,. In 
particular, if X is homeomorphic to S? then we 
have the Euler theorem on polyhedra: x, — o, + 
0&5 —2. This was the first important result in 
topology (L. Euler, 1752). 


G. Homology with Coefficients in Abelian 
Groups 


Given a chain complex C and an Abelian 
group G, we have a new chain complex C ® G 
given by (C & G), 2 and ô (c & g)= 

ĉc Q g (ce C,,g € G), where @ is the tensor 
product of Abelian groups. For a topological 
space X, the homology group of the chain 
complex S(X) & G is denoted by H,(X; G) and 
is called the singular homology group of X 
with coefficients in G. The homology group 
H,(K; G) of a simplicial complex K with coeffi- 
cients in G and the cellular homology group 
H,(C(X); G) of a CW complex X with coeffi- 
cients in G are similarly defined. The ho- 
mology group with coefficients in Z is the 
integral homology group. 

The previous results for the integral ho- 
mology groups generalize in a straightforward 
fashion to homology groups with coefficients 
in G. 

We have the homomorphism x: H,(X) & 
G—H,(X;G)sending a ® ge H(X) Q G to the 
homology class of z ® ge Z,(S(X) & G), where 
z is a representative cycle of a. The follow- 
ing theorem is known as the universal coeffi- 
cient theorem for homology, since it expresses 
H(X; G) in terms of H,(X), H. ,(X), and G: 
There is an exact sequence 05 H,(X) @ GS 
Hjí(X;G)— Tor(H, , (X), G) 50, and this se- 
quence is split (— 200 Homological Algebra). 
Universal coefficient theorems of this type 
were first shown by S. Eilenberg and S. Mac- 
Lane (Ann. Math., (2) 43 (1942)). 

Let A be a fring with a unit 1. Then a chain 
complex over A is a chain complex C such that 
each C, is a *A-module and each 0, is a tA- 
homomorphism. The homology groups H,(C) 
of a chain complex C over A are A-modules. If 
C is a chain complex, C & A forms naturally a 
chain complex over A. In particular, if X isa 
topological space, then S(X) & A is a chain 
complex over A, and H,(X; A) are A-modules. 
In this case, the induced homomorphisms 
Ja: HX; A) H,(Y; A) are A-homomorphisms. 
The homology groups with coefficients in a 
field k are tvector spaces over k and are useful 
in applications. If H,(X) is finitely generated, 
then x(X)- X,( 1) dim, H,(X; k) holds for 
any field k. 
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H. Cohomology 


A cochain complex C = (C*,04] is a collection 
of Abelian groups C”, one for each integer q, 
and of homomorphisms ó*: C^ C**! such 
that 0**! 0 64=0. Elements of C? are called q- 
cochains, and ó? is called the coboundary 
operator. The notions of subcomplex of a co- 
chain complex, cocycle, coboundary, coho- 
mology group, and cochain mapping are de- 
fined as in chain complex. Let Hom(A, B) 
denote the tgroup of homomorphisms from an 
Abelian group A to an Abelian group B. Given 
a chain complex C and an Abelian group G, a 
cochain complex C* 2 Hom(C, G) is defined 
by C* - Hom(C,, G) and (ó*u)(c) 2 u(0,,, c) 

(ue C*, eet, al 

For a topological space X, the cochain 
complex Hom(S(X), G) is called the singular 
cochain complex of X with coefficients in G, 
and elements of Hom(S,(X), G) are called 
singular g-cochains of X. The cohomology 
group of Hom(S(X), G) is denoted by H*(X;G) 
and is called the singular cohomology group of 
X with coefficients in G. We write H*(X) for 
H*(X; Z); this is called the integral cohomol- 
ogy group of X. Similarly, the cohomology 
group H*(K; G) of a simplicial complex K 
with coefficients in G and the cellular coho- 
mology group H*(C(X); G) of a CW complex 
X with coefficients in G are defined. There are 
isomorphisms H*(K; G) z H*(| K|; G) and 
H*(C(Xy; G)& H*(X; G). 

If f: X > Y is a continuous mapping, then a 
cochain map f* :Hom(S(Y), G) > Hom(S(X), 
G) is defined by (f *u)(c) 2 u( f 4c) with ue 
Hom(S,(Y), G) and ce S, (X). Therefore f in- 
duces the homomorphism f *: H*(Y;G)— 
H*(X;G). The following properties hold: (i) 
1*=1; (i) (gof)* — f*og*; (iii) If f and f’ are 
homotopic, then f * = f'*. In particular, the 
singular cohomology groups are homotopy 
invariants. The *cokernel of c*: H*(pt; G) 
H*(X; G) induced by the mapping c: X 5 pt 
is denoted by H*(X;G) and is called the re- 
duced cohomology group of X. 

For $e H*(X;G) and ae H,(X), the Kro- 
necker index (£, a5 €G is defined naturally in 
terms of representatives of č and a. We have 
the following universal coefficient theorem for 
cohomology: There is an exact sequence 05 
Ext(H, ,(X), G)— H*(X; G) 5 Hom(H,(X), G)> 
0, and this sequence is split, where x is given 
by the Kronecker index (— 200 Homological 
Algebra). 

If A is a ring with 1, then a cochain complex 
over A is defined analogously to a chain com- 
plex over A. The singular cochain complex 
Hom(S(X), A) forms naturally a cochain com- 
plex over A, and H*(X; A) are A-modules. For 
Ce H*(X; ^) and ae H,(X; A), the Kronecker 
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index (C, a» €A is defined naturally. If k is 
a field, then the vector spaces H4(X;k) are 
identified with the dual space of H,(X,k) by 
means of the Kronecker index. 

If M is a C*?-manifold, then there is a co- 
chain complex D(M)={D%(M), d} over the 
field R of real numbers, where D*(M) is the 
real vector space consisting of the tdifferen- 
tial forms of degree q on M, and d:D4(M)—> 
$**!(M)is the texterior differentiation. The 
cochain complex D(M) is called the de Rham 
complex of M, and its cohomology group 
H*(D(M)) is called the de Rham cohomology 
group of M. A cochain mapping .4:D(M)—> 
Hom(S^(M),R) is defined by 


(7 ())(e) -| oo 
Af 


where we D4(M), c: A* M is a C? singular q- 
simplex in M, and o*q@ denotes the tpullback 
of o by o We have isomorphisms 


H*(D(M))2 H*(Hom(S"(M), BIN H*(M; R), 


where i* is induced by the inclusion S*(M)c 
S(M). This result is called the de Rham theo- 
rem on the cohomology of manifolds (— 105 
Differentiable Manifolds). 


I. Cohomology Rings 


Given a topological space X and a ring A, the 
cup product u ~ ve Hom(S, , (X), A) of co- 
chains ue Hom(S,(X), A) and ve Hom(S,(X), A) 
is defined by (u ——v)(c) 2 u(o o c)v(c oe), where 
a: A’*4-+X is a singular (p 4- q)-simplex, £: AP 
A’*4 and e: Ad A?** are given by c(t, t,, 

ens fp) m (fo, fie 15,0, ...,0) and e(t,, tpi, 
stg 4) (0, ...,0, tps tp+1> e £544). The prod- 
uct operation is bilinear, and the formula 
(uw v)=du— v+(—1)’u — óv holds. There- 
fore it gives rise to the cup product €~ ye 
H***(X; A) of cohomology classes Ze H?(X; A) 
and ne H*(X; A). This cup product operation 
makes H*(X; A) into a ring, which is called 
the singular cohomology ring of X with coeffi- 
cients in A. If A has 1, the cohomology class 
represented by the 0-cocycle taking the value 
1 on each singular 0-simplex serves as 1 of 
H*(X; A). If A is commutative, then £j = 
(—1)" n6 holds. The induced homomorphism 
f *: H*(Y; A) H*(X; A) preserves the prod- 
uct, and hence the cohomology ring is a ho- 
motopy invariant. 

If K is a simplicial complex, the cup prod- 
uct operation — : H'(K; A) & HYK; A) 
H?**(K; A) is induced from the opera- 
tion ~ : Hom(C,(K), A) & Hom(C,(K), A)> 
Hom(C,,,(K), A) defined as follows. Adopt- 
ing a "linear ordering of vertices of K, we write 
all oriented simplexes in this ordering. Then, 
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for ue Hom(C,(K), A) and ve Hom(C,(K), A), 
we define u ~ ve Hom(C,,,,(K), A) by (ur 

v) Ld. 4. -... deg] — (Lo, 41, -.. 45) (Lap. 
den, 0p«4]). The canonical isomorphism 
from the cohomology of K to the singular 
cohomology of | K| preserves the cup product. 

On the de Rham complex $S(M) of a C”- 
manifold M, we have the texterior product 
Q^0€3**(M) of we $(M) and 6e DYM). 
This makes H*(D(M)) into a ring, which is 
called the de Rham cohomology ring. The 
canonical isomorphism of H*(D(M)) to 
H*(M; R) preserves the product (— 105 Dif- 
ferentiable Manifolds). 

Examples. (1) Let T"=S! x ... x S! denote 
the n-dimensional torus, and let z;: T" S! 
denote the projection to the ith factor (1 < 
ixn). Take a generator £ of the A-module 
H' (S!; A), and put č; — z*(£)e H'(T"; A). Then 
H*(T"; A) is the exterior algebra over A gen- 
erated by £,,..., 6,. (2) If we denote by CP" 
the complex n-dimensional projective space, 
then H*(CP"; A) is A ifq—2i (O0 &ixn) and 0 
otherwise. If € is a generator of the A-module 
H?(CP"; A), then Zi generates the A-module 
H?'(CP"; A) (oO in). Thus H*(CP"; A) is the 
quotient ring A[£]/(£"*!) of the *polynomial 
ring A[£] by the tideal (£"*!). (3) If P" denote 
the real n-dimensional projective space, then 
H*(P";Z)zZ,[£]/(£"*!), where £ is the gen- 
erator of H!(P";Z,). 


J. Homology of Product Spaces 


If C and D are chain complexes, their tensor 
product C & D is a chain complex given by 
(C @ D),=Lpig-nC, Q D,, and 0,(c @ d)= 
6,(c)@d+(—1)’c Q 0,(d) (ce C,, d e D,). The 
following Eilenberg-Zilber theorem (Amer. J. 
Math., 75 (1953)) is the link between the alge- 
bra of tensor products and the geometry of 
product spaces: For the product space X x Y 
of topological spaces X and Y, there is an 
isomorphism p,: H,(X x Y; G)= H,(S(X)G 
S(Y)G9 G) induced from a chain mapping p: 
S(X x Y)>S(X) & S(Y) defined as follows: 
Given a singular n-simplex o: A"— X x Y, 
we define for each p (0x p <n) a singular p- 
simplex o; in X to be the composite A? A"-5 
Xx YS X, where £(to, t1, ..., fp) = (to, t4, ..., 
tp 0, ..., 0) and m, is the projection to the first 
factor. Similarly, we define for each q (0 « 
q <n) a singular q-simplex o; in Y to be the 
composite A* E A"-5 X x Y5 Y, where Esas 
else, ...,0, t, 4, ..., t.) and 7, is the pro- 
jection to the second factor. Then p is defined 
by p(c) 2 2:,.,-,0, Q 0; and is called the 
Alexander-Whitney mapping (Alexander- 
Whitney map). 

Given a ring A, a chain mapping p:(S(X) & 
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A) & (S(Y) & A) S(X) @ S(Y) @ A is de- 
fined by u((c & A) 6 (d 6 4) 2c d & 27"; it 
induces homomorphisms gu, ` H,(X; A) @ 
H,(Y; A) H,,,(S(X) @ S(Y) @ A). The cross 
product a x be H,,,(X x Y; A) of ae H,(X; A) 
and beH,(Y; A) is defined to be o, (ui, (a x b)). 
If A is a commutative ring with 1, then the 
cross product defines a A-homomorphism x : 
H,(X; A) G4 HY; A)> H, (X x Y; A), and 
satisfies (a x b) x c=a x (b xc), T (ax b)= ` 
(—1)™b x a, (f x g), (a x b)= f(a) x g, (b), 
where T: X x Yo Y x X is the mapping inter- 
changing factors, and f: X —^ X', g: Y— Y' are 
continuous mappings. If A is a *principal ideal 
domain, there is an exact sequence 

0> Y H,Q A) G4 HY: A) H,(X x Y; A) 


ptq-n 


> 3 Tor,(H,(X;A), HX: A))0, 
ptq=n-1 
and this sequence is split (— 200 Homolog- 
ical Algebra). In particular, if k is a field we 
have the following isomorphism of vector 
spaces: 
x: Y H,(X;k)  H(Y;k)& H,(X x Y;k). 
ptq-n k 
This is called the Künneth theorem, since the 
prototype was proved by D. Künneth (Math. 
Ann., 90 (1923); ibid., 91 (1924)). The present 
form was given by H. Cartan and S. Eilenberg 
[8]. 

The tensor product C © D of cochain com- 
plexes C and D is defined analogously to that 
of chain complexes. Given topological spaces 
X, Y and a ring A, a cochain mapping y: 
Hom(S(X), A) & Hom(S(Y), A)2Hom(S(X) 

@ S(Y), A) is defined by (uu © v))(c & d) 

— u(c)v(d), where ue Hom(S,(X), A), ve 
Hom(S,(Y), A), ce S(X), de S (Y), and u(c)v(d) 
is understood to be 0 if (p, q) Z (s, t). We then 
have the composite H?(X; A) & H*(Y; A)S 
H?**(Hom(S(X) Q S(Y), A) H?*4(X x Y; A), 
where p is the Alexander-Whitney mapping. 
For £e H"(X; A) and ne H*(Y; A), the cross 
product € x n€ H?'*(X x Y; A) is defined to be 
p* u, (€ @ n). The cohomology cross product 
satisfies the properties analogous to the homol- 
ogy cross product. 

The cup product and the cohomology cross 
product are given in terms of each other: ë~ 4 
=d*(¢ x n), č x n =n] (č) ~ nin) where d: X > 
X x X is given by d(x) 2(x, x), and z,: X x Yo 
X,n,:X x Yo Y are projections. 

We have the following Künneth theorem for 
cohomology: If A is a principal ideal domain 
and each H,(X; A) is finitely generated over A, 
then there is an exact sequence 
0> Y Hr: A) G, HY; A)- H'(X x Y; A) 


ptq=n 


> A Tor,(H?(X; A), H (Y; A))50, 


ptqą=n+i 
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and this sequence is split (— 200 Homological 
Algebra). For če H?(X; A, qe HY; A), Ze 
H*(X; A), n'€ H(Y; A), the formula (£ x 4) — 
G'xm)-(-1* (5 ~ Ze (n~ 17’) holds. There- 
fore, if k is a field and dim, H,(X; k) « oo for 
each q, then the cohomology ring H*(X x Y; k) 
is determined by the cohomology rings 
H*(X;k)and H*(Y;k). 

tFiber bundles can be considered as gen- 
eralized product spaces. Let E be the total 
space of a fiber bundle with base B and fiber F. 
The following Leray-Hirsch theorem (J. Math. 
Pures Appl., 29 (1950)) asserts that, under 
certain conditions, the cohomology of E is 
additively isomorphic to that of B x F: Let A 
be a principal ideal domain, and assume that 
H,(F; ^) is free and finitely generated over A. 
Furthermore, assume that there is a homo- 
morphism 0: H*(F; A) H*(E; A) such that the 
composite H*(F; A)-5 H*(E; AUS H*(p ! (by; A) 
is an isomorphism for each be B, where p: E> 
B is the projection and ip !(b)c E. Then 
an isomorphism 0: H*(B; A) ®, H*(F; Alz 
H*(E; A) is given by ®(¢ @ n) 2 p*£ ~ O(n), 
where £e H*(B; A), qe H*(F; A). 

A general connection between (co)homol- 
ogy of E and B x F is given by means of spec- 
tral sequences (— 148 Fiber Spaces). 


K. Cap and Slant Products 


There are other products closely related to the 
cup product or the cross product that involve 
cohomology and homology together. 

Given a topological space X and a ring 
A, the cap product v — ce S, (X) &) A of a co- 
chain ve Hom(S,(X), A) and a chain c= 2o; © 
4;€ $,, (X) & A is defined by v —c—2,9;0 
£Q v(o; o £)4;, where o; are singular (p + q)- 
simplexes in X, 4;€ A, and £: APS A?*4, e: Af 
A?** are the mappings used in the definition 
of cup product. For any ue Hom(S,(X), A), 
the formula <u 2v, c» = (u, v — c» holds. The 
cap product satisfies 6(v ~ c) 2(—1)^óv — c + 
va ĉc, and hence it induces the cap product 
$7. ae H,(X; ^) of a cohomology class Ze 
H'(X; A) and a homology class oc H,. (X; A). 
If A is a commutative ring with 1, then the 
cap product operation is bilinear and satisfies 
the following properties: (££) ~a=é — 
(Cn a), ff *0 a) fala) ~ a=a, 
(¢ xn) ^ (a x b)=(—1)P" TU ~ a) x (n 5), 
where Ze H?(X; A), € H” (X; A), ne H4(Y; A), 
ae H(X; A), be H(Y; A), and f: XY isa 
continuous mapping. 

Given topological spaces X, Y and a ring 
A, the slant product w/d € Hom(S,(X), A) of 
a cochain we Hom(S(X) @ S(Y)),,,, A) and 
a chain d $1; ® 4E SY) & A is defined 
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by (w/d)(c) 2 X;(w(c & 1) 4;, where o is a 
singular p-simplex in X, t; are singular q- 
simplexes in Y, and eeh, The slant operation 
satisfies ó(w/d) — (Ów)/d — ( —1)?w/0d. There- 
fore, under the identification H*(X x Y; A)— 
H*(Hom(S(X) & S(Y), A)), it induces the 

slant product ¢/be H?(X; A) of a cohomology 
class (e H?'*(X x Y; A) and a homology class 
be H (Y; A). For any ae H,(X; A), it holds that 
<¢/b, a> = (5, ax b). 

Let G, G' and G" be Abelian groups. Given a 
homomorphism G' &9 G" — G, we write g'g” for 
the image of g' © g" € G' © G” in G. Then the 
cap product ~ : H(X; G) & H,,,(X;G")> 
H,(X; G) can be defined in the same way as 
before. Similar definitions are valid for the cup 
product, the cross products for homology and 
cohomology, and the slant product. 


L. Relative Homology 


If C' is a subcomplex of a chain complex C, 
then we have a chain complex C/C' = (C,/C;, 
6,}, where C,/C, denotes the quotient group 
and 6, is induced from 0, by passing to the 
quotient. C/C' is called the quotient complex of 
C by C, 

A topological pair (X, A) is composed of a 
topological space X and its subset A. Given a 
topological pair (X, 4) and an Abelian group 
G, we have the chain complex (S(X)/S(A)) & G 
and the cochain complex Hom(S(X)/S(A), G). 
The homology group of (S(X)/S(A)) @ G is 
denoted by H,(X, A; G) and is called the rela- 
tive singular homology group of X modulo A 
with coefficients in G or the singular homology 
group of ( X, A) with coefficients in G. The 
homology group H,(X; G) - H,(X, Ø; G) is 
sometimes called the absolute homology group. 
Similar definitions are made for the coho- 
mology group H*(X, A; G) of the cochain 
complex Hom(S(X)/S(A), G). 

A simplicial pair (K, L) is composed of a 
simplicial complex K and its subcomplex 
L, and a CW pair (X, A) is composed of a 
CW complex X and its subcomplex A. The 
relative homology group H,(K, L; G) — 
H,((C(K)/C(L)) & G) of a simplicial pair 
(K, L) is isomorphic to H,(|K|,|L|; G). Fora 
CW pair (X, A), there is an isomorphism 
H,(X, A; G)= Fl, (X/A; G). Similar statements 
hold for the cohomology groups. 

A continuous mapping f:(X, A)(Y, B) of 
topological pairs is a continuous mapping f: 
X — Y such that f(A)c B. If f:(X, A)(Y, B) is 
a continuous mapping, then f,:S(X)5(Y) 
sends S(A) to S(B), and hence f induces homo- 
morphisms f,: H,(X, A; G)2 H,(Y, B; G) and 
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f*: H*(Y B; G)  H*(X, A; G). Since the prop- 
erties are analogous, we state them below 
only for the case of relative homology. 

The following six properties are fundamen- 
tal. (i) 1, = 1. (ii) (go f), — 9, o fẹ- (iii) Homo- 
topy property: If f, f’:(X, A)—>(Y, B) are tho- 
motopic, then f, = f,. (iv) Exactness property: 
There exists a homology exact Sequence of 
A Ar. S HS G) S HG G) H(X, 4; G) 5S 

WIER G)>.. , where i: Ac X, j:(X, 9f) 
d A), and 0, sends the homology class of a 
cycle of (S(X ys (4)) & G represented by a 
chain ce S(X) & G to the homology class of óc 
which is a cycle of S(A) @ G. 0, is called the 
boundary homomorphism or the connecting 
homomorphism. (v) Naturality of 0,: For any 
continuous mapping f:(X, A)—(Y, B), it holds 
that 6,0 f, —(f | A), 06,. (vi) Excision pro- 
perty: If U is a subset of X such that the clo- 
sure U is in Int A, then the excision isomor- 
phism H,(X — U, A—U;G)z H,(X, A; G) is 
induced by inclusion. 

The exactness property extends to the 
homology exact sequence of a triple (X, A, B): 

S HA, B; G)^H, (X, B; G) H(X, A; ae 
" ,(A, B;G)5.... A couple {X,,X,} of sub- 
sets of X is said to be excisive if H,(X;, X, N 
X) = H(X; U X,, X2) is induced by inclusion. 
If {X,, X2} is excisive, so is {X,, X,}. For ex- 
ample, if X — Int X, UInt X, or if X, and X, 
are subcomplexes of a CW complex, then 
{X,, X2} is excisive. If {X,,X,} and (4,, A;] 
are excisive couples such that A, c X, and 
A, c X,, then we have the relative Mayer- 
Vietoris exact sequence: ... >H (X, N X2, A, 
Ay; G)> HX, A5: G) + HG, dar G)> H(X, U 
X4, A,U A5; G)G H, UN X5, AO 45; G).... 
For the case of relative cohomology, we use 
terms such as cohomology exact sequence and 
coboundary homomorphism, correspondingly. 

The universal coefficient theorems are 
valid for the relative (co)homology groups. 
Given a homomorphism C © G" 5G, if 
(A, B} is excisive in X, then the cup product 
—:H'(X, A; G) Q H*(X,B;G')- H?*4(X, AU 
B; G) and the cap products ~ : H4(X, B; G') © 

H, (X, AU B; G^) H,(X, A; G) can be de- 
fined. The product (X, A) x (Y, B) is defined 
to be the pair (X x Y, A x YU X x B). Given a 
homomorphism G' &) G" 5G, the homology 
cross products x : H,(X, A; G) & H,(Y, B; G”) 
2 H,.4(X, A) x (Y, B); G) and the slant prod- 
ucts /: HP'*((X, A) x (Y, B); G) © H,(Y, B; G”) 
 H'(X, A; G) can be defined. If {A x Y, X x 
B} is excisive, then the cohomology cross 
products x :H?(X, A; G')® H4(Y, B; G")5 
H?*4((X, A) x (Y, By; G) can also be defined. 

If {A x Y, X x B] is excisive, the Künneth 
theorems are valid for the relative 
(co)homology groups [9, 10, 11]. 
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M. Cech Homology Theory 


Another homology theory is commonly used 
along with the singular theory. The theory was 
originated by E. Cech (Fund. Math., 19 (1932)) 
and was modified by C. H. Dowker (Ann. 
Math., (2) 51 (1950)). 

Given a topological space X and an Abelian 
group G, the Cech homology group Hi, (X; G) 
and the Cech cohomology group H*(X; G) are 
defined as follows. We take the family of all 
topen coverings of X directed by frefinement, 
and we consider the *nerve K(U) of each open 
covering H, that is, the simplicial complex 
whose simplexes are finite nonempty subsets 
of U with nonempty intersection. If W is a 
refinement of U, then a simplicial mapping 
n (1L, W): K (1)  K (11) is obtained by assign- 
ing to each U'elI' an element UeU such 
that U' c U. The induced homomorphisms 
n (1, M), : H, (KU); G)  H, (K(1D; G) and 
z(1L UI : H*(K(1D; G) > H*(K (Y); G) are inde- 
pendent of the choice of z(U, U’), and we have 
the tinverse system | H,(K(10); G), x (4L HL, 
and the direct system ( H*(K(10); G), x (1L, W’)*}. 
We now define H,(X; G)=lim H, (K (1I; G) 
and H*(X; G)—lim H*(K(W i" G). 

A continuous mapping f:X—Y induces 
homomorphisms f, : H,(X; G)>H,(Y;G) and 
f*:H*(Y;G)O H*(X; G) as follows. If 8 is an 
open covering of Y then a simplicial mapping 
fa: KIT 1 (83)) 5 K (3) is defined by Lat! "OCH 
=V (Fe The induced homomorphisms 
Sar: A, (K(f 7! (8) G) 5 A, (K(X); G) for 
all open coverings 38 of Y gives rise to f, : 
H,(X; G)> H,(Y; G). Similarly f induces f *. 

Another approach to Cech cohomology 
theory is called the Alexander-Kolmogorov 
construction (Proc. Nat. Acad. Sci., 21 (1935) 
and C. R. Acad. Sci. Paris, 202 (1936)). The 
approach was improved by E. H. Spanier 
(Ann. Math., (2) 49 (1948)), and the theory is 
now called the Alexander (or Alexander- 
Kolmogorov-Spanier) cohomology theory. 

The Alexander cohomology group H*(X; G) 
is defined as follows. Let $*(X; G) be the 
Abelian group of all functions from the (q+ 1)- 
fold product space X**! to G with addition 
defined pointwise. An element ye ®*(X; G) 
is said to be locally zero if there is an open 
covering U of X such that (xo, ...,x,) van- 
ishes if xo, ..., x, are contained simultaneously 
in some U eU. The subgroup of P(X; G) con- 
sisting of locally zero functions is denoted by 
D(X; G). We define a homomorphism 67: 
D(X; a G) by (640) (Xp, X1; ..., 
Xq44)7 So CD) 6(xo; Xi- p Kris co Xa). 
Then i G) = {®1(X; G), 6%} is a cochain 
complex, and P(X; G)= {O3(X; G), 64} is its 
subcomplex. We now define H*(X; G) to be 
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the cohomology group of the quotient com- 
plex P(X; G) - (X; G)/®,(X: G). 

If f: X > Y is a continuous mapping, then a 
cochain map f * : (Y; G)  4(X; G) is defined 
by (f * Po xi... x) = efl) f G) 
f(x,)), and it induces the homomorphism f'*: 
H*(Y;G)— H*(X; G). There is a natural iso- 
morphism H*(X; G) z H*(X;G). 

The (co)homology group of a simplicial 
complex K is isomorphic to the Cech (co)- 
homology group of |K]. If X is a manifold or 
a CW complex, then its singular (co)homology 
group and its Cech (co)homology group are 
isomorphic. However, even for compact met- 
ric spaces X, the singular (co)homology group 
of X is not necessarily isomorphic with the 
Cech (co)homology group of X. 

If {X,} is an tinverse system of compact 
Hausdorff spaces and X = lim X,, then there 
are isomorphisms lim H,CX j; G)z H,(X;G) 
and lim H*(X,; G) z H*(X;G). This is called 
the continuity property for Cech theory. If A is 
any closed subset of a manifold M, then there 
is an isomorphism lim H*(W; G) = H*(A; G), 
where W varies over neighborhoods of A in M 
directed downward by inclusion. If the tcover- 
ing dimension of X is n, H4(X;G)=0 for qn. 

The cup product in the Cech cohomology is 
introduced simply by passing to the limit with 
cup products in simplicial complexes, and the 
cup product in the Alexander cohomology is 
induced from the operation ~ :®?(X; GI 
P(X; G") +? '*(X; G) defined by (9 ~ W) (Xo, 
Xi, Xpt) = Q(Xo Xi; Xp V (Xp. Xpt 
56s EE 

The relative Cech homology group H,(X, 

A; G) and the relative Cech cohomology group 
H*(X, A; G) are defined as follows: An open 
covering of (X, A) is a pair (U, 9t) of an open 
covering U of X and an open covering 9t of 
A such that 9t c U. To such a pair (U, 9t) we 
assign a simplicial pair (K (4), K'(91)), where 
K'(90) is the nerve of (14 2 (NT A[N e90j. 
Considering the family of all open cover- 
ings of (X, A), we define now H,(X, A; G)= 
lim H, (K (4D, K' (81; G) and H*(X,A;G)— 
lim H*(K (U), KO G). 

The relative Alexander cohomology group 
H*(X, A; G) is defined to be the cohomology 
group of the cochain complex which is the 
kernel of the cochain mapping i*: ®(X; G)> 
(A; G) induced by inclusion. There is a nat- 
ural isomorphism /f*(X, A; G) z H*(X, A; G). 

The relative Cech (co)homology groups 
satisfy the properties analogous to the relative 
singular (co)homology groups except the 
exactness property for homology (— Section 
Q). In certain cases, the excision property is 
strengthened for Cech (co)homology. For 
example, we have the following theorem: As- 
sume that X and Y are compact Hausdorff 
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spaces, A and B are closed subsets of X and Y, 
respectively, and f:(X, A)—(Y, B) is a continu- 
ous mapping which maps X — A onto Y— B 
homeomorphically. Then f, : H,(X, A; G) 
H,(Y, B; G) and f *: H*(Y, B; G)& H*(X, A; G) 
hold. 


N. Fundamental Classes of Manifolds 


For a topological space X and a point x of X, 
the local homology groups H. (X, X — x) repre- 
sent a topological property of X around x. 

The notion of *orientation for differentiable 
manifolds and triangulable manifolds general- 
izes to ttopological manifolds by using local 
homology groups as follows. Let M be an 
n-dimensional (topological) manifold with 
*boundary 0M. If x is a point of the interior 
M, =M —0M,then H,(M, M —x)=H,(R", R” 
—0)is Z for g=n and is 0 for g#n. We define 
a local orientation o, for M at xe M, to bea 
choice of one of the two possible generators 
for H,(M, M — x), and we then define an orien- 
tation for M to be a function which assigns to 
each xe M, a local orientation o, which varies 
continuously with x in the following sense: For 
each x there should exist a compact neighbor- 
hood N c M, and an element op€ HAM. M — 
N) such that i. (oy) — o, for each ye N, where 
i.: (M, M — N) c (M, M — y). If there is an ori- 
entation for M, then M is said to be orient- 
able, and the pair of M and an orientation is 
called an oriented manifold. If M is a nonorient- 
able manifold without boundary, the set of 
local orientations for M forms an orientable 
manifold doubly covering M, called the orien- 
tation manifold of M. 

If M is an oriented n-dimensional manifold, 
then for any compact subset K of M there is a 
unique element oe € HOM (AM — K)U0M) such 
that i, (0x) —0, for each xe KN Mo, where 
i: (M,(M — K)UOM)C(M, M - x). The ele- 
ment o, is called the fundamental homol- 
ogy class around K. In particular, if M is it- 
self compact, o, € HOM. 6M) is usually de- 
noted by [M] and is called the fundamental 
homology class of M. A connected compact 
n-dimensional manifold M 1s orientable if 
and only if H,(M,0M)-ZO, and in this case 
H,(M,0M)is a free cyclic group generated by 
a fundamental class [M]. If M is an orientable 
compact n-dimensional manifold, then ôM is 
an orientable compact (n — 1)-dimensional 
manifold without boundary, and the boundary 
homomorphism 0, : H,(M, 6M)» H,.,(0M) 
sends a fundamental class [M] to a funda- 
mental class [0M]. 

An n-dimensional manifold M is orientable 
if and only if there exists an element Ue H"(M 

x M, M x M—dM) such that, for each xe Ma, 
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j*(U) is a generator of H"(M, M — x), where 
dM is the diagonal in M x M, and j,:(M,M 
—x)2(M x M, M x M —dM) is given by j,(y) 
=(x, y) (ye M). In fact, U corresponds to an 
orientation which assigns to each xe My a 
local orientation o, such that 4j*(U),0,5 — I. 
The element U is called the orientation coho- 
mology class of M. If M is a compact manifold 
without boundary, it holds that (d *(U), [M]5 
= x(M), where d*: H'(M x M, M x M —dM)—> 
H"(M) is induced by the diagonal mapping. 
The element d *(U)e H"(M) is called the Euler 
class of M. 

If we work with the (co)homology groups 
with coefficients in Z;, the fundamental classes 
are defined for an arbitrary manifold without 
making any assumption of orientability. If 
M is connected and compact, then H,(M, 

OM; Z,)=Z, is generated by [M]. 


O. Duality in Manifolds 


Let M be a compact n-dimensional manifold, 
and let M,, M, be compact (n — 1)-dimensional 
manifolds such that M, U M; — 6M and M, N 
M,=6M,=6M.,. Assume either that M is 
oriented or that G = Z;. Then for each q, an 
isomorphism D: H4(M, M,; G)z H, (M, 
M,;G) is defined by D(€)=€ —[M ]. In par- 
ticular, there are the isomorphisms D: H*(M, 
6M; G)= H, (M; G) and H*(M;G)x H,_,(M, 
0M; G), where the cap product is taken with 
respect to the homomorphism G ® ZG 
defined by multiplication. This theorem is 
called the Poincaré-Lefschetz duality theorem, 
and the special case for 0M = Qj is often 
referred to as Poincaré duality. 

Poincaré duality implies the following con- 
sequences for a compact n-dimensional mani- 
fold M without boundary. If M is orientable, 
then the gth Betti number is equal to the (n— 
q)th Betti number, and the gth torsion coef- 
ficients are equal to the (n —4— 1)th torsion 
coefficients. If n is odd, then y(M)=0, and if M 
is orientable and nz 2 mod A. then 7(M) is 
even. 

Poincaré duality generalizes to the following 
duality theorem. Let M be an n-dimensional 
manifold without boundary, and let K be a 
compact subset of M. Assume either that M is 
oriented or that G = Z;. Then there is an iso- 
morphism D: H*(K; G)= H, (M, M — K; G) for 
any q, which is given as follows: For each open 
neighborhood W of K, define Dy: HW; G)— 
H, (M, M — K; G) by Dy) =k, ky’ (og), 
where K, is the excision isomorphism induced 
by k:(W, W — K)c (M, M — K). Now D is de- 
fined to be the limit of Dy, where W varies 
over open neighborhoods of K. The inverse of 
D up to sign is given in terms of the slant 
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product with the fundamental cohomology 
class U of M as follows: For each open neigh- 
borhood W of K, we define a homomorphism 
Jw: H, (M, M — W; G)5 H*(W; G) by yy(a)= 
j*(U)y/a, where j*: H'(M x M, M x M—dM)> 
H'(W x (M, M —W) is induced by inclusion. 
Then, passing to the limit, these zw define the 
desired one. 

If we take an n-sphere S" as M in the above 
duality theorem and use the homology exact 
sequence of (S", S" — K), then we have the fol- 
lowing Alexander duality theorem (Trans. 
Amer. Math. Soc., 23 (1922)): If K is a closed 
subset of $", then the qth reduced Cech coho- 
mology group of K is isomorphic to the (n— 
q — 1)th reduced singular homology group of 
S" — K for any coefficient group G and any q. 
In particular, if K is a *neighborhood retract, 
Dok G)= H,_,-,(S"— K; G) holds. This shows 
that H,(S" — K) depends only on K and not on 
the way K is embedded in A" The Alexander 
duality theorem for n 22 and K — S! gives the 
classical tJordan curve theorem. 

In view of the duality theorems, certain 
classical definitions in the homology of mani- 
folds can be given in terms of cohomology. 
For example, if £f: M—M’ is a continuous 
mapping of oriented closed manifolds, then the 
Umkehr homomorphism or Gysin homomor- 
phism f': H,(M’; G)—> H,+4(M; G) (d dim M — 
dim M^) (W. Gysin, Comment. Math. Helv., 14 
(1941)) can be defined by Do f'=f* oD. In co- 
homology we have fi: HM; G)> H* *(M'; G). 
similarly, if M is an oriented n-dimensional 
closed manifold and ae H,(M), be H,(M), then 
the intersection product o ben... (M) of 
Lefschetz can be defined by a- ba D 'a-— ba 
D(D' !a.,.D !b). If p+q=n, the number a- 
be H)(M) = Z is called the intersection number 
of a and b. The classical definitions are still 
meaningful today, since they are closer to 
geometric intuition and therefore possess con- 
siderable heuristic value. For example, the 
following fact serves to compute cup products 
in manifolds. If M is an oriented closed differ- 
entiable manifold and a, be H,(M) are repre- 
sented by closed submanifolds N,, Na which 
intersect ttransversally, then +a-b is repre- 
sented by N, N N, [11]. See [12] for a rigor- 
ous discussion of classical intersection theory. 


P. Cohomology with Compact Supports 


Let X be a topological space. À subset V of X 
is said to be cobounded if X —V is compact. A 
singular q-cochain ue Hom(S,(X), G) is said 
to have compact support if there exists a co- 
bounded set V such that u(o)=0 for every 
singular q-simplex c in V. The singular co- 
chains with compact support form a subcom- 
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plex of the cochain complex Hom(S(X), G). 
The cohomology group of this subcomplex is 
denoted by H?*(X; G) and is called the singu- 
lar cohomology group of X with compact sup- 
ports. There is an isomorphism H*(X;G)= 
lim H*(X, V; G), where V varies over co- 
bounded subsets of X. 

Let K be a simplicial complex. A q-cochain 
uc Hom(C,(K), G) is called a finite cochain 
of K if u(c) «0 except for a finite number of 
oriented q-simplexes o of K. If K is a tlocally 
finite simplicial complex, then finite cochains 
of K form a subcomplex of the cochain com- 
plex Hom(C(K), G) whose cohomology group 
is isomorphic to H*(|K|; G). 

Let X be a "locally compact Hausdorff 
space, and let X U {20} denote the *one-point 
compactification of X. Then the Cech coho- 
mology group of X with compact supports, 
denoted by H*(X;G) is defined to be the re- 
duced Cech cohomology group of X U foc! 
with coefficients in G. There is an isomorphism 
H*(X; G)=lim H*(X, V; G), where V varies 
over cobounded subsets of X. If X is a mani- 
fold or a CW complex, then H*(X;G)= 
H*(X;G). If X is a compact Hausdorff space 
and A is closed in X, then H*(X — A:G)& 
H*(X, A; G). The Alexander-Kolmogorov con- 
struction gives a direct approach to H*(X;G) 
[10, 13]. 

A tproper continuous mapping f: X ^Y of 
locally compact Hausdorff spaces induces 
homomorphisms f *: H*(Y; G)— H*(X; G) and 
f*: H* (Y; G) H*(X5 G), and if f, f': X Y are 
properly homotopic, then they induce the 
same homomorphisms. 

The cohomology with compact supports is 
useful in order to extend results in the coho- 
mology of compact spaces to noncompact 
spaces. For example, the conclusion of the 
duality theorem on a compact set K c M in 
Section O generalizes to the case of a closed 
set K c M as follows: There is an isomorphism 
H3(K; G)& H, (M, M — K; G) for any q. This 
implies the following generalization of Poin- 
caré duality: H2(M; G)= H, (M; G) holds for 
an orientable n-dimensional manifold M with- 
out boundary. 

There are homology theories associated 
with the cohomology theories with compact 
supports [13]. 


Q. Eilenberg-Steenrod Axioms 


Let H* be a collection of the following three 
functions: (1) A function assigning to each 
topological pair (X, A) and each integer q an 
Abelian group H*(X, A). (2) A function assign- 
ing to each continuous mapping f:(X, A) 
(Y, B) and each integer q a homomorphism 
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f*: H*(Y, B)> H*(X, A). (3) A function as- 
signing to each topological pair (X, 4) and 
each integer q a homomorphism ó*: H*(4)— 
H**' (X, A). Then H* is called a cohomology 
theory on the category of topological pairs if 
the following seven axioms are satisfied [7]. 
(i) 1, — 1, where 1 is identity. (ii) (go f)* = 
f*og*:H*(Z, C)  H*(X, A) for continuous 
mappings f:(X, A)-(Y, B) and g:(Y, B)> 

(Z, C). (tit) Homotopy axiom: If f, f':(X, A4) 
(Y, B) are homotopic, then f, = f, : H*(Y, B) 
HX, A). (iv) Exactness axiom: The Begueiee 

S H(X, AS mx) S uA) mx, a)... 
is Ee where i: Ac X and j:(X, @)c(X, A). 
(v) f* oó* =6* o( f| A)*: H4(B) H** (X, A) for 
a continuous mapping f:(X, A)(Y, B). (vi) 
Excision axiom: If U is an open set of X such 
that U C Int A, then i*: H(X, A) H(X —U,A 
— U), where i is the inclusion. (vii) Dimension 
axiom: H^*(pt) 20 if q #0. Axioms (1)-(vii) are 
called the Eilenberg-Steenrod axioms, and the 
group H°(pt) is called the coefficient group of 
the cohomology theory H*. 

A cohomology theory on the category of 
pairs of compact Hausdorff spaces is defined 
similarly. A cohomology theory on the cate- 
gory of CW pairs (or finite CW pairs) is de- 
fined similarly except that axiom (vi) is re- 
placed by the following excision axiom: If 
{X,, X2} is a couple of subcomplexes of a CW 
complex, then i*: H4(X,U X,, X;))z H*(X,, 

X, N X5), where i is the inclusion. Two coho- 
mology theories H* and H'* on the same 
category are isomorphic if there is an isomor- 
phism h,: H(X, A) z H(X, A) for each (X, A) 
and each q, and they commute with f * and 
ó*. A homology theory on various categories 
is defined similarly by dualization. 

A singular (co)homology theory with coeffi- 
cients in G is an example of a (co)homology 
theory on the category of topological pairs. 
The Cech cohomology groups with coefficients 
in G can be made into a cohomology theory 
on the category of topological pairs. However, 
the Cech homology groups do not constitute a 
homology theory on the category of topolog- 
ical pairs; the homology sequence of any pair 
(X, A) is defined, but it can be proved only 
that the composite of any two successive 
homomorphisms is zero. The Cech homology 
groups with coefficients in a field constitute a 
homology theory on the category of compact 
Hausdorff pairs. The Alexander cohomology 
groups constitute a cohomology theory on the 
same category of topological pairs, and it is 
isomorphic to the Cech cohomology theory if 
their coefficient groups are isomorphic [10]. 

The Cech (co)homology constitutes a (co)- 
homology theory on the category of CW 
pairs, and it is isomorphic to the singular 
(co)homology theory on the same category if 
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the coefficient groups are isomorphic. (Co)- 
homology theories on the category of finite 
CW pairs are determined, up to isomorphisms, 
by their coefficient groups. This fact is called 
the uniqueness theorem of homology theory 

on the category of finite CW pairs. Coho- 
mology theories on the category of pairs of 
compact Hausdorff spaces which satisfy the 
following continuity axiom are determined, up 
to isomorphisms, by their coefficient groups: If 
{(X,, A,)} is an inverse system of pairs of com- 
pact Hausdorff spaces, then H *(lim Xi, lim A;) 
~lim H*(X;, A;). The Cech cohomology the- 
ory satisfies this axiom. 

During recent years, many (co)homology 
theories have been developed which satisfy the 
first six Eilenberg-Steenrod axioms but fail to 
satisfy the dimension axiom. These are called 
generalized (co) homology theories, and include 
various *K-theories, tbordism theories, and 
‘stable homotopy theories (— 202 Homotopy 
Theory). 


R. Homology with Coefficients in Local 
Systems 


N. E. Steenrod (Ann. Math., (2) 44 (1943)) in- 
troduced the (co)homology group with coef- 
ficients in a local system of Abelian groups, 
which is useful in *obstruction theory and in 
the homology theory of tfiber spaces. 

A local system (5 of Abelian groups on a 
topological space X is a set of Abelian groups 
G,, one for each xe X, together with an iso- 
morphism /*: Groy >G) for each fpath I: [0,1] 
X subject to the following conditions: (1) 

If two paths l and I are homotopic with 
endpoints fixed, then /* = /'*. (2) If | and m 
are paths such that /(1) 2 m(0), then (l - m)* = 
m* ol*, where l: m denotes the *product of | 
and m. An example is provided by the *homo- 
topy groups z,(X,x) fornz 2. Let M bean 
n-dimensional topological manifold. Then 

x H,(M, M —x) is a local system of infinite 
cyclic groups. It is called the orientation sheaf 
of M.'A local system 6 is said to be trivial if 
ID TS for any paths l, I’ with the same initial 
and final points. 

Given a local system © of Abelian groups 
on a topological space X, a chain complex 
S(X; 6) — (S,(X: G), 0,) is defined as follows: 
If q «0, then S,(X; 6) —0, and if 420, then 
S,(X; G) is the Abelian group of formal finite 
sums È go, where o:A?->X are singular q- 
simplexes in X and g,€ Gan o. ou the bound- 
ary operator 0,:S,(X; 5) S, ,(X; G) is given 
by &, (3,0) — I (8,0 0 & + X ( 714,005; 
where c oe, is the ith face of c, and I,: [0,1] 
>X is given by /,(t)=o(1 —t,t,0,...,0). The 
homology group of the chain complex S(X; ©) 
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is denoted by H,(X; (5) and is called the sin- 
gular homology group of X with coefficients in 
(5. 

Similarly, a cochain complex S*(X; 6)= 
1S*(X; ©), 6%} is defined as follows: If q «0, 
then S"(X; ©)=0, and if g>0, then S$4(X; 65) is 
the Abelian group of functions u assigning to 
every singular q-simplex o in X an element 
u(a)€ Go o... ou: the coboundary operator 
64:$4(X; 6)5 S**! (X; 6) for g>0 is given by 
(ó0*u)(c) =1*  u(c oe) + E221 ( —1Yu(o oe), 
where ø is a singular (q+ 1)-simplex in X. The 
cohomology group of the cochain complex 
S*(X; ©) is denoted by H*(X; ©) and is called 
the singular cohomology group of X with coef- 
ficients in ©. 

If © is trivial, then the (co)homology group 
with coefficients in © coincides with the (co)- 
homology group with coefficients in G2 G,- 
The various notions and theorems on the 
ordinary (co)homology can be extended to 
(co)homology with coefficients in ©. The Cech 
(co)homology group with coefficients in (5 is 
also defined [10]. The cohomology groups 
with coefficients in © are generalized to the 
cohomology groups with coefficients in a sheaf 
[10, 14] (— 383 Sheaves). 
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A. General Remarks 


Given a topological space X, we utilize the 
concept of homotopy to define the ‘funda- 
mental group, homotopy groups, and co- 
homotopy groups of X. These groups, to- 
gether with (co)homology groups, are useful 
tools in topology. 

Since the research of H. Hopf, W. Hurewicz, 
and H. Freudenthal in the 1930s, homotopy 
theory has made rapid progress and now plays 
an important role in topology. 


B. Homotopy 


If a family f: X > Y (tel={t|O<t<1}) of 
*continuous mappings from a ttopological 
space X into a topological space Y is also 
continuous with respect to t, that is, if the 
mapping F from the product space X x I into 
Y defined by F(x, t) — f(x) (xe X,telI) is con- 
tinuous, then { fj) or F is called a homotopy. In 
this case, fọ and f, are said to be homotopic. 
This relation between fọ and f, is indicated by 
fo fi: X 9 Y, or simply f; fi, and is called 
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the relation of homotopy. Denote by Y* the 
set of all continuous mappings from X into 

Y. The homotopy relation is an fequivalence 
relation on Y*, and the equivalence class [ f ] 
of a mapping f: X — Y is called the homotopy 
class (or mapping class) of f. The set of all 
homotopy classes of mappings of X into Y is 
called the homotopy set and is denoted by 
n(X; Y) or [ X, Y]. A function y of continuous 
mappings fe Y* is called a homotopy invariant 
if f 2: g implies y(f)=y(g). When X consists of 
a point x we write z(*; Y)=7,(Y). If all con- 
tinuous mappings in Y* are homotopic to each 
other, we write z(X; Y)=0; z)(Y)=0 means 
that Y is farcwise connected. A mapping f 
from a compact space into an n-dimensional 
sphere S” is called essential if any mapping g 
homotopic to f satisfies g(X) — S". A mapping 
is inessential if and only if it is homotopic to 
the constant mapping. 

These concepts are generalized as follows: 
Let A; and Bj (i— 1,2,...) be subspaces of X 
and Y, respectively, and denote by Y *(A,, A,, 
5 B4, B,,...) the set of continuous mappings 
f € Y* satisfying f(A) c B;. If a homotopy IO 
is such that f,¢ Y*(A,; Bj), then ( f.) is called a 
restricted homotopy with respect to A;, B; ora 
homotopy from a system of spaces (X, A,, 

A5, ...) into a system of spaces (Y, B, , B, ...). 
The notation fo ~fi :(X, A,, Aj, ...) G(Y, B,, 
B,,...) and the homotopy set z(X, 4,, Az, ...; 
Y, B,, B;,...) are defined accordingly. 

For the composite go f € Z*(A,; Cì) of 
f Y*(A; B) and ge Z'(B; C), f= f' and 
g^g' imply go f ~ g'of'. Thus the composite 
poa—[gof ]en(X, A; Z, C) of [f]=ae 
z(X, Aj; Y, B) and [g] Ben, Bj; Z, C) is 
defined. By putting g,[ f]=[go f]=f/*[g] 
we induce two mappings, 


g4 (X, Ai; Y, B) n(X, Aj; Z, Cj), 
Kë : n(Y, B;; Z, C)>n(X, Au Z, C 


Then f~ f" implies f * = f’* and g~g’ implies 
Ix 7 g4. Also (go f), —g,0f,.(gof)*—f*o 
g*, and h*og, —g, oh*, where he X" (Dj; A;). 
The category of pointed topological spaces 
is defined to be the tcategory in which each 
object X, which is a topological space, has a 
point fixed as a base point and each mapping 
X — Y carries the base point of X to. the base 
point of Y. In this category, we define a homo- 
topy set, denoted by z(X; Y)o or [ X; Y ],, as 
follows: Denoting the base points by *, we 
have z(X, Aj, *; Y, Bi, *) —n(X, A; Y, B)g. A 
continuous mapping f homotopic to the 
constant mapping X > +e Y is said to be 
homotopic to zero (or null-homotopic). This 
is indicated by f~0, and z(X; Y); =0 means 
that all continuous mappings are homotopic 
to zero. Let S° be a set of two points; then 
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n(X; S9), 20 means that X is connected. In 
contrast to these specific homotopies, the 
usual homotopy is sometimes called a free 
homotopy. 

Suppose that a homotopy TC (f: X > Y) is 
such that the restriction of f, to a subspace A 
of X is stationary, that is, f,(a)= fo(a) (ae A, 
tel). Then fo, f, are said to be homotopic 
relative to A, indicated by fọ ~ f, (rel. A). If a 
homotopy TC ( f,: X 5 Y) is such that each 
f, is a homeomorphism into Y, then { f;} is 
called an isotopy and f, is called isotopic to f, 
(— 235 Knot Theory). 

Research done by L. E. Brouwer, H. Hopf, 
W. Hurewicz, K. Borsuk, L. S. Pontryagin, 
and S. Eilenberg has contributed to the theory 
of homotopy, an important field of topology 
still in the process of development. 


C. Mapping Spaces 


We endow the set Y* of all continuous map- 
pings f: X 2 Y, with the tcompact-open topol- 
ogy. The topological space Y* is called a 
mapping space. In particular we denote Y’ (0, 1; 
*,*)(xeY) by Q(Y)=Q(Y, x) and call it the 
space of closed paths (or loop space) of Y. Two 
points f, g of Y* are connected by a *path 

in Y* if and only if f &«g: X >Y. Thus z,(¥*) 
—n(X; Y) and m9(Y*(A;; Bj) 2 n(X, Aj; Y, Bj). 


D. Retracts 


Let A be a subspace of a topological space X. 
If there exists an f € A* such that the restric- 
tion f | A is the identity mapping of A, then A 
is called a retract of X, and f a retraction. If A 
is a retract of X, any continuous mapping of A 
into any topological space can be extended to 
a continuous mapping of X. If A is a retract 
of some neighborhood U(A), A is called a 
neighborhood retract or NR of X. If for any 
'homeomorphism of a metric space A onto a 
closed subspace A, of any metric space X, Ao 
is a retract (neighborhood retract) of X, then A 
is called an absolute retract or AR (absolute 
neighborhood retract or ANR). For example, 
an n-dimensional simplex or an n-dimensional 
Euclidean space is an AR. If a retraction f is 
homotopic to the identity mapping of X (resp. 
U(A)), we call A a deformation retract (neigh- 
borhood deformation retract) of X. Moreover, 
if f — 1, (rel. A), then A is called a strong de- 
formation retract. In particular, if a point x, is 
a (strong) deformation retract of X, we say 
that X is contractible to the point xg. For 
example, any tpolyhedron P and any com- 
pact n-dimensional *topological manifold are 
ANRs; any polyhedron P, contained in P in a 
strong deformation retract of some neighbor- 
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hood in P. X is called locally contractible if 
each point x of X has a contractible neighbor- 
hood U of x. 


E. The Extension Property 


Let X, Y be topological spaces, Ac X, fo, 

fie Y*, and (g: A— Y! a homotopy such that 
g;— f| A (i— 0,1). We can extend {g,} to a 
homotopy { f;} of X if and only if the mapping 
F:(X x Q)U(A x DU(X x 1) Y defined by 
F(x, i) = f(x), F(a, t)=g,(a) can be extended to 
a continuous mapping sending X x / into Y. 
Therefore the problem of whether fọ ~ f, can 
be reduced to the problem of whether a con- 
tinuous mapping defined on a subspace can be 
extended to the whole space. If for any homo- 
topy (9: A9 Y] and any continuous mapping 
fo: X ^ Y into any topological space Y satisfy- 
ing f; | A — go there exists a homotopy | f,: X 
Y} satisfying f,| A=g,, then we say that (X, A) 
has the homotopy extension property. This 
occurs if and only if (X x 0)U(A x I) is a re- 
tract of X x I. A pair (X, A) of ANRs, where 

A is closed in X, and a pair (P, P5) with P a 
*CW complex and P, a subcomplex of P have 
this property. Given a continuous mapping 

h: B>A of a subspace B of a topological space 
Y into a topological space A, we identify be B 
with h(b)e A in the tdirect sum AU Y and 
obtain the *identification space denoted by 
AU, Y, which is called an attaching space 
under h. If (Y, B) has the homotopy extension 
property, then (Y x X, B x X) and (AU, Y, A) 
also have the same property. When A consists 
of a point *, we write x U, Y= Y/B and call 

the space Y/B a space smashing (shrinking or 
pinching) B to a point. If Y= Bx I, B=B x0, 
then we call AU,(B x I) a mapping cylinder of 
h, (AU,(B x I))/(B x 1) a mapping cone of h, 
and the mapping cylinder and mapping cone 
of h: B5 x the cone over B and suspension of B, 
respectively. 


F. Homotopy Type 


For systems (X, Aj), (Y, Bj) of topological 
spaces, if there exist fe Y* (A; B), ge X! (B; A4 
such that go f and fog are homotopic to the 
identity mappings of (X, A;) and (Y, Bj), respec- 
tively, then we say that (X, A,) and (Y, Bj) have 
the same homotopy type or are homotopy 
equivalent. Such mappings f and g are called 
homotopy equivalences. For a homotopy 
equivalence f, the induced mappings f, and 

f * are bijective. Therefore, in homotopy 
theory, systems of spaces having the same 
homotopy type are considered equivalent. 

If A is a deformation retract of X, then A 

and X have the same homotopy type, and the 
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injection of A into X and the retraction of X 
onto A are homotopy equivalences. A con- 
tractible space has the same homotopy type 
as a point. Spaces having the same homo- 
topy type have isomorphic homotopy groups 
and *(co)homology groups. Since the mapping 
cylinder Z, — YU,(X x I) of fe Y* contains Y 
as its deformation retract, it has the same 
homotopy type as Y. By this homotopy equiva- 
lence, f can be replaced by the injection of X x 
1 into Z,. If to each topological space there 
corresponds a value (which may be some ele- 
ment of R or some algebraic structure) and 
the values are the same for homotopy equiva- 
lent spaces, then the value is called a homotopy 
type invariant. A homotopy type invariant is a 
"topological invariant; for example, z(X; Y) is 
a homotopy type invariant of X. If a continu- 
ous mapping f: X — Y induces isomorphisms 
of the homotopy groups of each tarcwise con- 
nected component, then f is called a weak 
homotopy equivalence. Conversely, if X and Y 
are CW complexes, then a weak homotopy 
equivalence is a homotopy equivalence (J. H. 
C. Whitehead). 

Now we consider the category of pointed 
topological spaces. Let A and B be pointed 
topological spaces. Then the ‘direct sum in this 
category is the one-point union (or bouquet) 

A v B obtained from the disjoint union AU B 
by identifying two base points x4 and *p. 
Av B is identified with the subspace (A x 
*g)U(*4 x B) in A x B. The reduced join (or 
smash product) of A, B is the space obtained 
from A x B by smashing its subspace Av B 
to a point and is denoted by AA B. We call 
A ^S! the (reduced) suspension of A and de- 
note it by SA. Repeating the suspension n 
times, we have the n-fold reduced suspension of 
A. We call CA- AAI (I — [0,1]) the reduced 
cone of A (I has the base point 1). For a con- 
tinuous mapping f: X Y, the space obtained 
by identifying each point (x, 0) of the base of 
CX with f(x)e Y is called the reduced mapping 
cone and is denoted by C, — YU, CX. The 
reduced join (or smash product) of mappings 
f: Yo X and f': Y' o X' is the mapping f^ f’: 
YAY'2X ^ X' induced from the product 
mapping f x f': Y x Y'o X x X'. The reduced 
join of f: YX and 1:$' 5 S! (identity map- 
ping) is written as Sf — f ^1 and is called the 
suspension of f. 


G. Puppe Exact Sequences 


For f: X 2 Y and g: Y2Z, we have go f ~0 if 
and only if g can be extended to a continuous 
mapping from C, into Z. In other words, the 
sequence 


x(Cy; Zh o n(Y; Z) o n(X; Z) 


T16 


is exact Ge, Imi* = Ker f * = f * ^! (0), where 

i: YC, is the canonical inclusion and 0 is the 
class of the constant mapping). The inclusion 
i: Y 2 C, gives rise to the reduced mapping 
cone C;. We also have the canonical inclusion 
i': C, C,. Adding the term n(C;:Z o> to the 
left-hand side of the sequence above, we have a 
new exact sequence. Continuing this process, 
we obtain an exact sequence of infinite length. 
If X, Y satisfy a suitable condition (e.g., X, Y 
are CW complexes), then C; has the same 
homotopy type as the reduced suspension 

SX of X; i'* is equivalent to p*:2(SX;Z))—> 
m(Cy; Z)o induced by a mapping p: C, SX 
smashing Y to a point; and furthermore C, has 
the same homotopy type as SY, and the inclu- 
sion ig: SX >C, is equivalent to the suspension 
Sf :SX SY of f. Thus the following Puppe 
exact sequence is obtained: 


S n(SCy: Z) Sr n(SY; Z) ^ n($X; Z)g 
rn(C5; Zi o n(Y; Zi (Xs Z)o. 


In this exact sequence, if Y is a CW complex, 
X is a subcomplex of Y, and f is the inclusion 
i: X OY, then C; 2 C, YU Cy is homotopy 
equivalent to the space C;/CX = Y/X obtained 
by smashing CX to a point, and an exact 
sequence of the following type is obtained: 


n(SX; Zj 30 ¥/X; Z >n(Y; Z)o 
Ë n(X; Z)o- 


A sequence equivalent to the sequence 
x45yb5c ; is called a cofibering, for which a 
similar exact sequence is obtained. For a con- 
tinuous mapping f: X — Y, consider the sub- 
space E,={(x, 9)| f(x) 2 9(0)) of the product 
space X x Y'. By identifying X with (x, ol 
Q,(I) = f(x)}, we can regard X as a deforma- 
tion retract of E,. By putting p,(x, 9) 2 (1), 
we obtain a fiber space (E,, p1, Y). The fiber 
T, — p, '(*) is called a mapping track of f. 
Using the covering homotopy property, we 
see that the sequence 


n(W; (ST NEI Y)o 


is exact, where p(x, o) — x. This sequence is 
also extended infinitely to the left as 
2; OX)  ssw;QY) SW; TA. 
where i is the inclusion of the loop space QY 
into T, and Of: X 5 OY is the correspon- 
dence of the loops induced from f. 


H. Homotopy Sets that Form Groups 


If X 2 SX' or Y 2 OY' (or, generally, if Y isa 
'homotopy associative tH-space having a 
thomotopy inverse), then z(X; Y), forms a 
group. In the general case the product of the 
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loops induces the product of z(X; ONT, We 
represent a point of SX by (x, t) (xe X, tel) 
and define the mapping (2,9: X >QY for each 
g: SX >Y by Qog(x)(t)= g(x, t). Hence an 
isomorphism Q,:z(SX; Y); z n(X; QY), is 
obtained. Each of the following pairs of homo- 
morphisms is equivalent: f, :z(SX; Y)ọo > 
n(SX; HI Je and Of, :n(X; QY) 9 > a(X;QY"); 
and h*:2(X';QY),>2(X;QY), and 

Sh* :n(SX'; Y) n(SX; Y)s. 

If S” is an n-dimensional sphere, then z,(X) 
—n($"; X), is the n-dimensional homotopy 
group (— Section J). Let z"(X)2 x(X; S”). If 
X is a CW complex of dimension less than 2n 
— 1, 2"(X) is the cohomotopy group isomor- 
phic to z(X; Q$"*!), (— Section I). Let K, be 
an 'Eilenberg-MacLane space of type (11, n). 
Then we have K,— OK, ,,, and if (X, A) is a 
pair of CW complexes, then x(X/A; K,)s coin- 
cides with the cohomology group H"(X, A; I1). 
For the ‘classifying space Bo (By) of the infinite 
orthogonal group O (infinite unitary group U) 
(— Section V), 2(X/A; Bo) (1(X/A; By)) may be 
considered the KO-group KO(X, A) (K-group 
K(X, A)) (— 237 K-Theory). 


I. Cohomotopy Groups 


K. Borsuk defined a sum of mapping classes of 
X into $" (1936), which was named Borsuk's 
cohomotopy group by E. Spanier. Spanier also 
studied the duality of the cohomotopy group 
with the homotopy group and its relations to 
the usual cohomology groups. A cohomotopy 
group of (X, A) is defined to be z"(X, A)= 
n(X, A; S", «), which forms a group if dim X/A 
<2n—1. A mapping E X/A—S" x S" given 
by F(x) 2 (f(x), g(x)) with Lo X/AS" is 
homotopic to a mapping into S" v S". If we 
compose F with a folding mapping of S" v 
$" onto $", we obtain a mapping that repre- 
sents the sum [ f ] - [3]. With each homo- 
topy class of a continuous mapping f of an 
n-dimensional tpolyhedron K” into an n- 
dimensional sphere $", we associate the image 
f *(u) of the fundamental class ue H"(S"; Z) 
under the induced homomorphism f *: 
H"(S". Z)  H"(K"; Z). We then obtain a bijec- 
tive relation z"(K") 5 H"(K"; Z), called Hopf's 
classification theorem. 


J. Homotopy Groups 


Let X be a topological space with a base point 
$,1 1t s tis tOStu touts libe 
the unit n-cube, and /" its boundary. Write 
Q'(X, x) — X" (f", «) (in particular, Q! (X, +) is 
the loop space), and denote by z,(X, *) or 
simply z,(X) the set of arcwise connected 
components of O'(X, x), i.e., the homotopy 
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classes [ f ]. Using the notation of homotopy 
sets, we have z,(X, «) — n(I^, I"; X, «). If we 
choose the constant mapping as the base 
point s of O"(X, +), then O"(O"(X, «), x) = 
Q"*"(X. x), Thus z, (Q'(X,*),») 9 1, (X. *). 
Since n, is the tfundamental group, t, (X, *)— 
n,(Q' !(X), *) is also a group, called the n- 
dimensional homotopy group of X with base 
point «. “Multiplication” in homotopy groups 
is defined as follows: Given f,, f; cQ" (X,*) 
we define f, + f; € O" (X, *) by 

Tit2E8 itu 0 
TESA) Eeer N 
(Fig. 1). Then the product or sum of [ f, ] and 
[f2] is given by [ f; 4- f; ]. The identity is the 
class of the constant mapping (denoted by 0), 
and the inverse of [ f] is [f], represented by 
f(t) 9 f(1—t,,t, ..., tj). The space Q"(X, «) is 
an *H-space, where multiplication is given by 
the correspondence (f1, f;) f, +f. Since the 
fundamental group of an H-space is commuta- 
tive, z,(X,*) is an Abelian group for n 2 2. 





Q 1/2 1 


Fig. 1 


Fig. 2 


Let S*& (t—(f,...,1,44)] X: 12 =1} be the n- 
sphere, and take * —(1,0,...,0) as its base 
point. Suppose that we are given a continuous 
mapping y, :(I", |") 5 (S", «) such that yn: I" — 
[^ — S" — x is homeomorphic. Then the corre- 
spondence y? :z(S"; X)y 5 n,(X,*) determined 
by W*[g]=[g oy, ] is bijective. Thus we can 
identify the homotopy group z,(X, *) with 
a(S"; X)o. 


K. Relative Homotopy Groups 


Suppose that we are given a topological space 
X and a subspace A of X sharing the same 
base point +. Identify I"~' with the face t, = 
0 of I", and let J"^! be the closure of /"— 
I"! (Fig. 2). Denote by z,(X, A, *) the set of 
homotopy classes of continuous mappings 
f: (7, I", 773) (X, A, «). Let Q"(X, A, «) be 
the mapping space consisting of such map- 
pings f, and let z,(X, A, «)  ng(Q'(X, A, «)). 
Since QO"(Q"(X, A, *), «*) is homeomorphic to 
Q"*"(X, A, *), we have z, (Q'(X, A, *), #) = 
Rmin(X,A,*). Thus z,(X, A, *) is a group for 
n2Z 2 and an Abelian group for n z 3. This 
group is called the n-dimensional relative 
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homotopy group of (X, A) with respect to the 
base point *, or simply the n-dimensional 
homotopy group of (X, A). In the same man- 
ner as in Section J multiplication in this group 
can be defined using f, + f2. Since Q"(X, *, *) 
and Q"(X, «) are identical, we have z,(X, *, *) 
—n,(X,*). Hence homotopy groups are spe- 
cial cases of relative homotopy groups. 

Let g:(X, A, «) (Y, B, *) be a continu- 
ous mapping. Then a correspondence g4: 
n,(X, A, *)- n, (Y, B, +) is obtained by g,[ f]= 
[gof], with gẹ a homomorphism of homo- 
topy groups for n>2 and forn=1, A=*. We 
call g, the homomorphism induced by g. Let 
E"={t=(t,,...,t,)| X t2 =1} be the unit n-cell 
with boundary S"~'. Utilizing a suitable rela- 
tive homeomorphism y, :(I", J" !) G(E", «), 
V. (I^) 5 S"^!, we obtain a one-to-one corre- 
spondence y;* :n(E”, S"! : X, Ajo >n, (X, A, *), 
and Q"(X, A, «) is homeomorphic (via y;) to 
the mapping space XF'(S" 1, x; A, x). 


L. Homotopy Exact Sequences 


Given an element «=[ f ]ez,(X, A, *), and 
letting Ga =[ f |I"! Jen, ,(A, *), we obtain 

a homomorphism (n> 2) 0:n,(X, A, *)> 

T, (A, *), which is called the boundary 
homomorphism. Furthermore, we have the 
following exact sequence involving homomor- 
phisms i,, ją induced by two inclusions i:(A, *) 
>(X,*), FG *) (KX, A, * ): 


THE e) on (XS) In (X, A, *) 
2 n, (X,+) 51, (X, A, 8) n ( A) Sn (XO. 


This sequence is called the homotopy exact 
sequence of the pair (X, A). A system of topo- 
logical spaces X > A > Bə» is called a triple. 
In this homotopy exact sequence, if we replace 
(A, *), (X, *) by (A, B, x), (X, B, «), respectively, 
we obtain an exact sequence, called the homo- 
topy exact sequence of the triple (X, A, B). 

The homotopy group z,(A x B) of the prod- 
uct space is isomorphic to the direct sum 
n, (A) 4- 7,(B), and the projections p(p’):A x B 
— A(B) of the product space induce the pro- 
jections from z,{A x B) onto the direct sum- 
mands z,(A), z,,(B). This is a special case of the 
*Hurewicz-Steenrod isomorphism theorem in 
fiber spaces (— 148 Fiber Spaces). Setting 
A v B «(A x x)U(* x B), we obtain a direct 
sum decomposition z,(A v B) zz x, (A) 4- n,(B) 4- 
1,44 (À x B, Av B). Next we consider a fixed 
pair (X, A) and move the base point « to inves- 
tigate its effect on the elements of the homo- 
topy group. Suppose that we are given a path 
h:I—A with terminal point « — (1) and an 
element xez,(X, A,*) (x f ], f 07, 2,0" 4) 
(X, A, x)). By the homotopy extension prop- 
erty, we can construct a homotopy f,:(I", I") 
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(X, A) satisfying LGL 2 h(0) and f, =f. 
Then the homotopy class [ fo] of fo with re- 
spect to the base point «' — h(0) is determined 
only by « and the homotopy class w of the 
path h. We denote the homotopy class [ fọ] by 
a^ en,(X, Aal The correspondence xa? is 
a group isomorphism, and (ott =«°®. Thus 
if A is arcwise connected, z,(X, A, *) is isomor- 
phic to z,(X, A, *’). Hence, in this case, we may 
simply write z,(X, A) instead of z,(X, A, *). 
When * =x’, the correspondence «a? deter- 
mines the action of the group z,(A,*) on 
m,(X, A, *). Given an element xez,(X,*)anda 
class o of paths in X, we define «°e7,(X, x’) 
as for relative homotopy. Specifically, if we 

n4 (X, *), then x? —a coincides with the White- 
head product [@, x] (when n — 1, we have am, 

a '=[w,«4]=waw 'a~')(— Section P). 

A pair (X, A) consisting of a topological 
space X and an arcwise connected subspace A 
of X is said to be n-simple if the operation of 
1, (A) on z,(X, A) is trivial. Similarly, an arc- 
wise connected space X is called n-simple if the 
operation of z,(X) on z,(X) is trivial. For 
example, a pair (X, A) consisting of an H-space 
X and an H-subspace A is simple, i.e., n-simple 
for each n. If a topological space X satisfies 
m(X)=0 (0<i<n), then X is said to be n- 
connected. 0-connectedness coincides with 
arcwise connectedness and 1-connectedness 
means Temple connectedness. S$” is (n — 1)- 
connected. A pair (X, A) is said to be n- 
connected if z,(4) — x9 (X) — x; (X, Alz (1 < 
i x n), and (E", S" !) is (n — 1)-connected. 


M. Homotopy Groups of Triads 


Let (X; A, B, *) be a system, called a triad, 

of a topological space X and its subspaces 

A, B satisfying AM Bə% (base point). Let 
n,(X; A, B, *) 2n, ,(Q'(X, B, Q'(A, ANB), *) 
(nz 2); n,(X; A, B, *) is a group for nz 3 and an 
Abelian group for n z 4. We call SX: A, B, *) 
the homotopy group of the triad. From the 
homotopy exact sequence of the pair, we ob- 
tain the following homotopy exact sequence of 
the triad: 


Sn A, ANB, «) o n|(X, B, x) 
B n (X; A, B, «) 5n, (4, ANB, «)5... 


Assume for simplicity that AN B is simply 
connected, X — Int AUInt B (Int A is the tin- 
terior of A), (A, AN B) is m-connected, and 
(B, AN B) is n-connected. Then (X; A, B) is 
(m+ n)-connected, Le, SE: A, B, ) -0(2« 
j&m-4 n) (Blakers-Massey theorem). 
Furthermore, in this case we have a replica 
of the *excision isomorphism in homology 
theory for j « m - n; that is, we have the iso- 
morphism i,:zj(A, AN B, *)=7,(X, B, *) in- 
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duced by the inclusion i:(A, AN B) (X, B). On 
the other hand, 7,,,,.,(X; A, B, «) is isomor- 
phic to z,,4,(A, ANB, *) 9 2,,4,(B, ANB, «). 
This shows that the excision isomorphism 
does not always hold for homotopy groups, an 
important difference from homology theory. 
However, if we replace the excision axiom by 
the Hurewicz-Steenrod isomorphism theorem, 
which is valid for fiber spaces (— 148 Fiber 
Spaces), then we can construct homotopy 
theory axiomatically in the same manner as 
homology theory (— 201 Homology Theory). 


N. The Hurewicz Isomorphism Theorem 


The Hurewicz homomorphism : of z,(X, A) 
into the n-dimensional integral homology 
group H,(X, A) is defined by x([ f ]) — Lë 
(where e, is a generator of H,(I", I”)). Then we 
have the Hurewicz isomorphism theorem: Sup- 
pose that the pair (X, A) is n-simple (e.g., A 

= x) and (n — 1)-connected. Then we have 
H,(X, A) «0 (i « n) and the isomorphism t: 
n,(X, AJZE H,(X, A) (for n=1 — 170 Funda- 
mental Groups). Let X, Y be simply connected 
topological spaces, and let f: X — Y be a con- 
tinuous mapping. Then the following two con- 
ditions are equivalent: (1) f, : x; (X) z;(Y) is 
injective for i « n and surjective for i x n. (2) 
f: H(X)5 H((Y) is injective for i « n and sur- 
jective for i x n (J. H. C. Whitehead’s theorem). 

J.-P. Serre generalized these theorems as 

follows: A family € of Abelian groups satisfy- 
ing condition (i) is called a class of Abelian 
groups: (1) If a sequence F>G—H of Abelian 
groups is exact and F, He@, then Ge@. Fur- 
thermore, we consider the following condi- 
tions: (ii) The tensor product G © F of an arbi- 
trary Abelian group F with an element Ge € 
also belongs to @. (i) If both F, Ge@, then 

F & G, Tor(F, G)e €. (iii) If Geg, then its 
thomology group H;(G)e€ (i70). Condition 
(ii) is implied by (ii). A homomorphism f: 

F Gis called -injective if Ker fe €, €- 
surjective if Coker f= G/Im f e €, and a €- 
isomorphism if f is @-injective and @- 
surjective. Two Abelian groups G and G' 

are called €-isomorphic if there exist @- 
isomorphisms f: FG and f': F-G'. In 
particular, if the class G consists of only the 
trivial group 0, then concepts such as @- 
isomorphism coincide with the usual concepts 
of isomorphism, and so on. Let @, be the class 
of finite Abelian groups whose orders are 
relatively prime to a fixed prime number p. 
Here, instead of the terms @,-isomorphism and 
so on, we use the terms mod p isomorphism 
and so on. Let Z be a class of finitely gen- 
erated Abelian groups. Then €, satisfies con- 
ditions (i1) and (iii), and 2 satisfies (1i) and (iii). 
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We have the following generalized Hurewicz 
theorem: (A) Suppose that a class @ satisfies 
(i1) and (iii) and we are given a 2-connected 
pair (X, A) of simply connected spaces X, A. 
If xz, (X, A)e € (i « n), then H(X, A)e@, and 

t nQ(X, A)>H,(X, A) is a €-isomorphism. (B) 
Suppose that A = x, € satisfies conditions (ii’) 
and (ii), and X is simply connected. Then an 
assertion similar to (A) holds. In particular, a 
simply connected space X having finitely 
generated homology groups (e.g., a simply 
connected finite polyhedron) has finitely gen- 
erated homotopy groups. As a corollary to 
theorem (A), we obtain a generalized White- 
head theorem. In particular, applying the 
theorem to the class 9N@,, we obtain the 
following frequently used theorem: Suppose 
that we are given simply connected spaces X, 
Y whose homology groups are finitely gen- 
erated and f: X 5 Y satisfies f, n ,(X)— n;(Y). 
Then the following two conditions are equiva- 
lent: (1) f, :r;(X) x, (Y) is a mod p isomor- 
phism for i<n and a mod p surjection for i— n. 
(2) fx: H(X, Z)— HY, Z,) is an isomorphism 
for i « n and a surjection for i=n (where Z, = 
Z/pZ). The theory above, which makes use 
of the notion of class €, is an example of 
Serre's @-theory. Concepts such as tspectral 
sequences for fiber spaces and *n-connective 
fiber spaces are important tools in Serre's €- 
theory (— 148 Fiber Spaces). 

To calculate homotopy groups, we use 
notions such as exact sequences, fiber spaces, 
(co)homology groups of n-connective fiber 
spaces, and tPostnikov systems. Given an 
arbitrary group (more generally, a Postnikov 
system), there exists a CW complex having the 
given group (system) as its homotopy group 
(Postnikov system) (realization theorem of 
homotopy groups). For an arbitrary arcwise 
connected topological space X there exist 
topological spaces (X, al and continuous map- 
pings p,:(X,n4- 1)5(X, n) (n— 1,2, ...) satisfy- 
ing the following two conditions: (i) ((X, n 4- 1), 
Pa (X,n)) is a fiber space whose fiber is an 
tEilenberg-MacLane space. (ii) (X, 1) 2 X, and 
((X,n-- 1), p,0...Op,, X) is an n-connective 
fiber space. The method of obtaining the 
homotopy group z,(X)z H,((X,n)) by com- 
puting (co)homology groups of (X, n) is called 
a killing method. 


O. Homotopy Operations 


Let X, Y, X', Y' be topological spaces. If to 
each continuous mapping fe Y * there corre- 
sponds a homotopy class di f)ez(X' ; Y") that 
is a homotopy invariant of f (satisfying a 
certain naturality condition), then 6 is called 
a homotopy operation. More generally, we 
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may consider the case where is a mapping 
from z(X,; Yi) x ... x n(X,; Y,) into zk Y’). 
The naturality of ® is defined as follows: Con- 
sider the tcategory € of topological spaces (or 
its subcategory). Let Y= Y’ be an arbitrary 
*object of €, and fix X and X'. In this case, 
the naturality of ,:z(X; Y) z(X'; Y) is de- 
fined to be the commutativity of the diagram: 


n(X; rat Y) 
Ju, x Ja, 
n(X;Z)—>n(X’; Z) 


i.e., gą 00, =, 0g, for an arbitrary tmor- 
phism (De, continuous mapping) g: YZ of 
the category @. Similarly, when objects Y, Y’ 
of the category € are fixed and X = X’ is an 
arbitrary object of €, to say that a homotopy 
operation ®,:2(X; Y)z(X; Y?) is natural 
means that h* o, =y o h* for an arbitrary 
morphism h: WX. 

We have the following theorem: In the 
category of topological spaces and continu- 
ous mappings, the homotopy operations 6, : 
n(X; Y) n(X'; Y) and the elements of z(X'; X) 
are in one-to-one correspondence. The corre- 
spondence is obtained by associating a homo- 
topy operation ®(f)= foa (Ben(X; Y)) with 
each aen(X’; X). Similarly, the homotopy 
operations ®,:2(X; Y) z(X; Y") and the ele- 
ments of z(Y; Y^) are in one-to-one corre- 
spondence. This theorem holds also for the 
case involving several variables if we consider 
n(X X, v X,v...) or (Y, x Yo x ...; Y) in- 
stead of n(X'; Y) or x(X; Y^). The theorem 
remains valid if we replace the spaces X, Y by 
systems of spaces. 


P. Homotopy Operations in Homotopy Groups 


(1) If X, X' are spheres S", S? with base points 
and Y, Y' are topological spaces with base 
points, a homotopy operation Ó,:z,(Y) 
ny(Y) is said to be of type (n, p). By the theo- 
rem in Section O, the homotopy operations 
of type (n, p) are in one-to-one correspondence 
with the elements of the homotopy group of 
the sphere z,(S"). 

(2) As an example of the 2-variable homo- 
topy operations ,:z, (Y) x x, (Y) n,(Y) of 
type (m, n; p) we have the Whitehead product 
defined as follows: Suppose that «e z,,(Y), 
fen, (Y) are elements represented by f:(I”, I”) 
—(Y, x) and g:(I", [") 5(Y, x), respectively. 
Define a continuous mapping F from the 
boundary /"*" — (I" x j^) U (I? x I”) of "+" = 
I" x I" into Y by F(x, y)= f(x) for (x, y)e I" x 
I" and F(x, y) - g(y) for (x, y) e" x I". Since 
["*" is homeomorphic to $"*"^!, we can iden- 
tify them. The homotopy class represented by 
F is an element of z,,,, ,(Y) determined by « 
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and f, denoted by [«, f] and called the White- 
head product of « and f (J. H. C. Whitehead, 
Ann. Math., (2) 42 (1941)). The Whitehead 
product is a homotopy operation of type 

(m, n; m - n— 1). Let y, Un. [") S(S", «) bea 
mapping that smashes /" to a point. The pro- 
duct of Yn and Y, defines a mapping y, ,: 
AIR 1,8" y S" (S" x «)U(x x S"). Let ie 

n, (S" v S"), i en,(S" v S") be the homotopy 
classes of the natural inclusions of S$”, S" into 
S" v S”; then the homotopy class of Ym, n is 
[r]. G. W. Whitehead showed that a direct 
sum decomposition z,(S" v S") — rn, (S) + 
ARS") + [57],n, (7*9 (tys Les [517], are 
injective) holds for 1 < p <m +n + min(m, n) — 
3. Furthermore, P. J. Hilton showed that for 
general p» 1, n, (S" v S") is the direct sum of 
the images of injections 1,, t4, [5,1 ],.. [E 

r], ily, TDL, etc. The homotopy 
operations of type (m, n; p) are in one-to-one 
correspondence with the elements of n,(S" v S"); 
hence such operations can be constructed by 
means of composition and the Whitehead 
product. The last proposition is also valid for 
homotopy operations of type (m,, ...,m,; p). 
The Whitehead product [a, f] (xezx,,(X), Be 
1, (X)) is distributive with respect to « (resp. fj) 
for m» 1 (n> 1), and we have [f, x] -(—1)"" 
[x BJ and fo, f] Un, fab] for f:X Y. 
Moreover, for yez,(X) the Jacobi identity 
holds: (—1)"[[o, B1, y] +(—1)™ CLA, y]. 1 
(—1)"[Dy, «], B] - 0 (M. Nakaoka and H. Toda; 
H. Uehara and W. S. Massey; Hilton). 


Q. Suspensions and Generalized Hopf 
Invariants 


We denote by «^ Ben(X ^ X; YA Y), the class 
of the reduced join of f, g, where f represents 
xe€n(X; Y), and g represents f'en(X'; Y')o. 
We call x^ fi the reduced join of « and £. In 
particular, if Y= Y' 2 $!, fi is the identity 
mapping of S', and « is represented by f, then 
X ^ fi is called the suspension of « and is de- 
noted by Sa. Sa is the class of the suspension 
Sf of f and belongs to z(SX; SY),, where SX 
indicates the reduced suspension of X. The 
suspension Sa is often denoted by Ea in ref- 
erence to the German term Einhángung. The 
identity mapping 1 of SY gives rise to an in- 
jection i— Q,1 sending Y into the loop space 
(S Y) determined by the formula i(y)(t)=(y, t). 
Then we have 


i, —Qgy 0S:n(X; Y) n(SX; SY), 
Sn(X; OS Y), 


and S and i, are equivalent. Let Y, be the 
identifying space Y*/~, where Y* is the prod- 
uct space Y x ... x Y of k copies of Y and — 
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is the equivalence relation determined by 


(S Wis, Mäe Yk) ~ (Y1 Mäe Yk) ~- 


~ (Yis e Di, El 


Denote by Y,, = | J, Y, the limit space with 
respect to the injection Y, , — Y, given by 

(Yis o Yk- (Y1; -> Vy i *) and call it the 
reduced product space of Y. Let Y bea CW 
complex of 0-tsection «. The mapping i: Y= 
Y, 9QSYcan then be extended to i: Y, 
OSY, where i is a weak homotopy equiva- 
lence. If X is also a CW complex, then Q5! o 
i,:n(X; Y,,)o n(SX; SY), is bijective. By 
smashing the subset Y of Y), we have YA Y= 
Y;/Y. This smashing mapping can be extended 
toh: Y, (Y^ Y), (I. M. James). Utilizing 

hb, LE Y,)g n(X; (Y^ Y),)9 and the bijec- 
tion Qo! oi,, we obtain a correspondence 
H:n(SX; SY), n(SX; S(Y^ Y))o. We call H(a) 
the generalized Hopf invariant of x. When X — 
S?"? y-S"'. H is equivalent to the Hopf 
invariant y:z5,.,($") 5Z (— Section U). In 
general, we have H o S —0, and the exactness 
of A 5 holds under various conditions. 
Denote also by o the composition of homo- 
topy classes; then we have S(xo fj) 2 Sxo Sf 
and H(xoSp)- HaoSf. Also, H(Saof)— 

S(x ^a)o HB. Under the condition i « 3n — 3, 
we have (a, +a,)0oB=a,08+4,08+[a,, 

x] o H(fi) for «,, «;en,(X) and fien;(S") (G. 
W. Whitehead). Thus the composition «o fi is 
not always left distributive but is always right 
distributive, and ao f is left distributive if B= 
Sf’. The composition is defined over the stable 
homotopy groups G, of spheres (— Section U): 
ao BeG,,, (xe G,, PEG). It is distributive and 
satisfies f/oa —(—1)"* ao f. 

When Y and Y' are Eilenberg-MacLane 
spaces, Bo, and By, we have tcohomology 
operations on cohomology groups H"( ; I), 
KO groups, and K groups, respectively. As 
typical examples there are tSteenrod square 
operations Sq':H"(X; Z;) ^ H" (X; Z,), *Steen- 
rod pth power operations ': H"(X;Z,)—> 
H'^* "D(X: Z, ), *Chern characters ch": K(X) 
— H?'(X; Q) (Q:rational field), 'Adams opera- 
tions 6;: KO(X) KO(X) (K(X)5 K(X)). They 
are all homomorphisms (— 64 Cohomology 
Operations; 237 K-Theory). 


R. Secondary Compositions 


Suppose that xo f/ 20, Boy=0 for yex(W; 
X)o, Ben(X; Y)o, xen(Y; Z). In the com- 
mutative diagram of Puppe exact sequences 


D MSW; Yh EnC; Y) at: Yo 
——— o 


— (SW; Zaart Z)  x(X; 2,5 
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the set of elements fj in x(SW; Z) such that 
p*(B)ea,i* * (B) is denoted by (a, f, y) and 

is called a secondary composition or Toda 
bracket. If 0, y are elements of z(SW; Y)o, 
m(SX;Z)o, respectively, then we have {a, fl, y! 
--a,0— (a, B, y), (o, B, y] + Sy*n = (os B, y]. 
Hence we may consider the set (a, fj, y) to be a 
residue class modulo a submodule generated 
by a, z(SW; Y), and Sy*z(SX; Z)o- 

The secondary composition (a, $, y} has the 
following properties: (i) {«, f, y} is linear with 
respect to a, f, y (if the sum is defined); (ii) 
xo {B,y, 0} — (o, B. y} o(— Sô); (iii) S{«, B, y) = 
— (8o, SB, Sy}; (iv) ao (B, y, 6} & {ao f, y, ô}, 
{ao B,y,6}={a, Boy, d},...5(v) (o, B. y). 

Sò, Se} + (o, {B, y, ô}, Se} + (o, B, {y, 6, eh} = 

0. Suppose that the spaces X, Y, Z, W are 
spheres. Then by (iit) the secondary composi- 
tion fa, p, y} € Gp eger] (t O Ggtr+1 t yo6uui) 
is defined in the stable homotopy groups G,= 
lim, ,,, Zn+r(S") of spheres. From this we obtain 
(vi) (y, f, a] ^ (— 1n"! (a, B, y} and (vii) 
(—1 (a, B. 7] C79 (8,5, (1 (9,2. f] 
=(). 


S. Functional Operations 


Let ® be an operation corresponding to « and 
y be the class of f. We put ®,(2)={a, f, y) and 
call ®, a functional ®-operation. When ® is a 
cohomology operation, 6, is called a func- 
tional cohomology operation. Then ®,() is 
defined for fi satisfying f *(f) 2 (fi) 2 0, and 
$,(f) is determined modulo Im Sf * tr Im d, 
For f:$"** 5$" k —2i(p—1)— 1, we denote by 
H (f) ën e" Herl Z) the image 
of a generator e, of H"(S"; Z,) under the 
functional A; operation. Then the Hopf in- 
variant modulo p (or mod p Hopf invariant) 
H,:n,44(S") Z, is obtained (we use Sq”! for p 
— 2). The following statements are equivalent: 
(i) The mod2 Hopf invariant is not trivial 

(H, #0); (ii) there exists a mapping: S?**1 > 
S**! of Hopf invariant 1; (iti) S* is an H-space; 
(iv) the Whitehead product [1,7] of a genera- 
tor z of z,(S*) vanishes. Also, H, #0 if and 
only if k=2, 4, 8 (J. Adams), and for an odd 
prime p, H, £0 if and only if k=2p—3 (A. L. 
Liulevicius; N. Shimada and T. Yamanoshita). 


T. Stable Homotopy Groups and Spectra 


The homotopy set z(S" X, S"Y), for n-fold 
iterated suspensions $"X = X AS"—SS"!X 
and S"Y, forms a group (an Abelian group) if 
nz]l(nz2) The limit z,(X; Y)—-limz(S"X; 
$"Y), with respect to the suspension homo- 
morphisms S:z(S"X; S"Y),n(S"*! X; S"*! Y), 
is called a stable homotopy group of X and 
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Y. For an r-connected space Y and a CW- 
complex X, S:2(X; Y)y n(SX; SY), is bijec- 
tive if dim X «2r and surjective if dim X < 

2r 4- 1 (generalized suspension theorem). Thus, 
if X 1s a finite-dimensional CW-complex, 
m°(X, Y) is isomorphic to x(S"X, S"Y), for suf- 
ficiently large n. To discuss stable homotopy 
groups more generally, the following concept 
of spectra is used. A system E = {E}, ¢,} which 
consists of CW-complexes E, and continuous 
mappings Cu AE, E,,, is called a spectrum. 
When E,—$* and ës 1,4,:SS*>S**!, S= 
{S*, 1,,,] is called a sphere spectrum. When 
E, = K(G, k) (Eilenberg-MacLane complex) 
and e, induces a homotopy equivalence K(G, 
k) x OK (G, k +1), HG={K(G,k), &,] is called 
an Eilenberg-MacLane spectrum. As in the 
latter, a spectrum E in which e, induces a 
homotopy equivalence E, — OF, ,, is called an 
Q-spectrum. By Bott's periodicity, O-spectra 
KU={Z~xB,,U,Z x B,,U,...} and KO = 

{Z x B,, U/O, Sp/U, Sp, Z x Bsp U/Sp, SO/U, 
O, Z x B, ... } are obtained (— Section V). 
Also, using "Thom complexes, Thom spectra 
MU, MO, etc. are obtained. Given a spectrum 
E, by putting E"(X, 4) — lim 2(S*(X/A); Ex41)o 
for each pair (X, 4) of CW-complexes, we 
obtain a generalized cohomology theory with 
E-coefficient; and by putting E,(X, A)= 

lim z,(E, , ^(X/A)), we obtain a generalized 
homology theory with E-coefficient. *Gener- 
alized (co)homology theory on (finite) CW- 
complexes can be represented by a suitable 
spectrum (G. W. Whitehead, E. H. Brown, 
Adams). Corresponding to E=S, HG, KU, 
MU, etc., we have stable (co)homotopy 
groups, G-coefficient (co)homology groups, 
K-groups, '(co)bordism groups, etc., respec- 
tively (— 201 Homology Theory). 


U. Homotopy Groups of Spheres 


The spheres S” and their homotopy groups are 
basic objects in homotopy theory. Although 
much research has been done concerning these 
objects, there are still open problems. 

S" is (n — 1)-connected: z;(S") 20 (i « n). The 
fact that z,(S") zz Z (infinite cyclic group) was 
obtained from the Brouwer mapping theorem. 
Also, 1;($!) 2 0 (i> 1) follows from the fact that 
the tuniversal covering space of S! is contrac- 
tible. Suppose that we are given a continuous 
mapping f:$S?" ! 2 S". We approximate it by a 
tsimplicial mapping ọ. Then the inverse image 
9 !(x)ofa point x in the interior of an n- 
simplex of S" is an (n — 1)-dimensional *pseudo- 
manifold which is orientable by means of 
a suitable generator ce H, ,(o !(*)). The 
boundary isomorphism 0: H,(S?" 1, p^!(x))z 
H. to ! (*)) and the homomorphism o, : 
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H,(S?" 9 !(x))9 H,(S", x) give rise to an in- 
teger y(q) determined by the relation pô (s) 
— y(q)e, (e, is an orientation of S"). This inte- 
ger ts independent of the choice of o. so we 
can set y( f) — y(q). Then f ~g implies that 

y (f) 5 y(g). We call y( f) the Hopf invariant of 
CH. Hopf defined y and showed y:z4(S?)z Z 
(1931); (15, ,(5") 20 for odd n; y(1,, ,(8"))2 
2Z for even n; and y(1,,.,(8")) 2 Z for n=4, 8 
(1935). H. Freudenthal defined a homomor- 
phism E:7,(S")>7;,,(S""'), EL£] - [Sf], and 
proved the Freudenthal theorem: (1) E is an 
isomorphism for i « 2n — 1; (2) E is a surjection 
for i=2n—1; and (3) the image of E coincides 
with the kernel of y for i—2n. Furthermore he 
obtained z,,,(S") zz Z, (nz 3) (1937). For n= 
2, 4, 8, a mapping (SIS ! 2 S" (THopf map- 
ping) such that y(f)= 1 (given by Hopf) is the 
projection of a tfiber bundle S?"~! over the 
base space $", and the correspondence (a, f) 
Ea + ff gives an isomorphism z; ,($" ^!) 4- 
1,(S?"^!) (direct sum) = x;($"). Hence we ob- 
tain z4(S?) 2 Z,. It was shown by G. W. 
Whitehead and L. S. Pontryagin that z,. ,(S") 
(n> 3) is isomorphic to Z, (1949). Whitehead 
also defined a generalized Hopf homomorphism 
H :n((S")  n,(S?" !) for a range of i< 3n— 3, 
and this restriction on the dimension was 
removed by P. J. Hilton and I. M. James. 
Using H, many nontrivial results concerning 
1; (S") have been obtained. Serre obtained the 
following (1951—1953): n;(S”) is finite except 
when i=n or i=4m—1 and n=2m. Further- 
more, z4,, 4(S?") is the direct sum of Z and 

a finite group. Let p be an odd prime and 

n be even. Then 7,(S") is €,-isomorphic to 

1; 4(S" !) - n,(S?" !). Let n be odd. Then 

x, 4 (S")e €, (k « 2p —3), and 7,,,, ,(5") is 6,- 
isomorphic to Z,. Serre and H. Toda deter- 
mined z,,,,(S") for k= 3, 4, 5, and Serre further 
determined it for k — 6, 7, 8. Utilizing the re- 
duced product space of S", James gave the 
sequence 


Ee (S) LSC 
>n- (S>... 


and showed that it is an exact sequence 1f n is 
odd and an exact sequence mod 2 if n is even 
(1953). Using this exact sequence and the 
secondary composition, Toda determined 
T,+4(9") for k< 19 (— Appendix A, Table 6.VI). 
By the Freudenthal theorem (1), the 7,,,(S") 
(n2 k 4- 1) for a fixed k are isomorphic to each 
other. We call z,,,(S") (n> k + 1) the stable 
homotopy group of the k-stem of the sphere 
and denote it by G,. For Kk-0,1,2,...,15,..., 
G, 2 Z, Zz, Z2, 454,00, Z2, Z240; Z2 + Z5, 
Z+ Za + Z3, Le, Z504, 0, Z3, Z5 + Z2, Lago + 
Z3, .... For the computation of G,, the notion 
of n-connective fiber spaces is important. By 
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utilizing the Adams spectral sequence, we can 
show that G, is closely related to the cohomol- 
ogy of the *Steenrod algebra. Let p be an odd 
prime. There exist the following sequences 

of elements of order p: {a,€G, (k—2i(p — 1) — 
1. {Bre G, (k=2(ip +i— 1)(p—1)—2)} and for 
p»3(y;e G, (k= 2(ip? +(i— Ip +i—2)(p—1)- 
3)}. The p-component of G, is determined for 
k «2p?(p— 1)—3 by using Steenrod algebra. 
To compute G, for higher k, relations such as 
a, BP =0, B, p? =0 (i> 1) are necessary. In gen- 
eral, each element of G; (k #0) is nilpotent 

(G. Nishida). Let 2,(S": p) be the p-component 
of 2,(S"). To survey this group for the nonstable 
case (iz 2n — 1), we utilize Serre's mod p direct 
sum decomposition (for n even), and we have 
the following two exact sequences for the case 
of odd n: 


n4 S7) 5n, (7*2) n (Q2 (S"*2), SS... 
om (SPP : p) n (SP+ :p) 
—n(Q^(S"*?), S": p) n;, (S"*»*1:p)5..., 


where E? 2 Eo E and AE?(a)= px (— Ap- 
pendix A, Table 6.VI). 


V. Homotopy Groups of Classical Groups 


Consider the classical group U (n, A), which is 
either the orthogonal group O(n) (A =R); the 
unitary group U (n) (A =C); or the symplectic 
group Sp(n) (A 2 H). The infinite classical 
group U (%, A) is defined to be the inductive 
limit group of (U(n, A)| n— 1,2, ... ] with re- 
spect to the natural injection U (n, A) c U (n 
1, A). We call U(oo, A) the infinite orthogonal 
group, infinite unitary group, and infinite sym- 
plectic group for A =R, C, and H, respectively. 
The dimensions of the cells of U(oo, A) — 
U (n, A) are Z A(n- 1) — 1, where 4—dimg A 
(=1(A=R), z2(A-C) =4(A=H)). It 
follows that z,(U (n, A)) ts isomorphic to 
n, (U(oo, A)) for k « A(n+ 1)—2, which is called 
the kth stable homotopy group of the classical 
group. Let O = U(oo, R), U = U(o», C, Sp= 
U (oo, H). The homotopy groups of the clas- 
sical groups are periodic (k > 0): 
z,(U)zn,,,(U)u Z, k odd, 

~0, k even, 
1,(0) = n, (Sp) = n4. ell 

=Z, | kz3,7(mod8) 

~Z,, k=0,1(mod8), 
=0, k #0, 1,3, 7 (mod 8). 


This is called the Bott periodicity theorem. The 
relations are deduced from weak homotopy 
equivalences US O(By), By x Z7Q(U), Box Z 
2 Q(U/O), U/O 5O(Sp/U), Sp/U > O(Sp), Sp 


203 A 
Hopf Algebras 


2Q(Bs,, Bs, x Z>Q(U/Sp), U/Sp>Q(O/U), 
OU ^ O(O). This result is applied to non- 
stable cases; for example, z;,(U (n)) is a cyclic 
group of order n! (— Appendix A, Table 6.VI). 
The 2-dimensional homotopy group z,(G) of 
any Lie group is trivial. 

Let xez,(O(n)), where a=[ f ], f:S*— O(n). 
We define f: S* x S"! >S"! by f(x, y) f(x): y 
and identify S**" with the boundary (E**! x 
an *JU(S* x E") of E**! x E". We extend f to 
f:S**" S" so that it maps E**! x S" !, S* x 
E" into the upper and lower hemisphere of 
S"(S" ! =the equator), respectively. Let J(a)e 
75, AIST) be the class of the mapping thus ob- 
tained. This homomorphism J : z,(O(n))^ 
1, 44S") is called a J-homomorphism of Hopf 
and Whitehead. For the stable case, J:z,(O)— 
G, is injective for k 20, 1 (mod 8), and the 
order of the image of J is the denominator 
of B,,/4t (Ba, is a ‘Bernoulli number) or its 
double for k= 4t — 1 (Adams). 
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A. General Remarks 


The concept of Hopf algebras arose from two 
directions. First in the field of algebraic topol- 
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ogy the notion of Hopf algebras arose from 
the study of homology and cohomology of Lie 
groups or, more generally, H-spaces. It was 
introduced by H. Hopf [1], whose basic struc- 
ture theorem was generalized and applied to 
several problems by A. Borel [2]. These Hopf 
algebras are graded as algebras and coalge- 
bras, and they are now used as standard tools 
in algebraic topology. On the other hand, 
Hopf algebras without grading were studied in 
connection with affine algebraic groups and 
formal groups. The study of nongraded Hopf 
algebras as an algebraic system was initiated 
by M. E. Sweedler [3], and many results on 
Hopf algebras of this type have been applied, 
not only to the theory of algebraic groups but 
also to the Galois theory of field extension and 
to combinatorial theory. 

Although these two types of Hopf algebras 
have similar structures, and the same termi- 
nology is used to describe their properties, 
they are somewhat different from each other. 
So to avoid confusion in this article, we dis- - 
tinguish between graded Hopf algebras and 
Hopf algebras. 


B. Graded Algebras 


A !graded module A= 2,.94, over a field k is 
said to be of finite type when each A, is finite- 
dimensional, A is connected when an isomor- 
phism 5: kz A, is given. The *tensor product 
of two graded modules A and B is a graded 
module with A @ B—X,(A ® B),, (A @ B),— 
Ze B,. ,. We call A* = È Až (where Až 

is the *dual module of A,) the dual graded 
module of A. When A and B are of finite type, 
A G9 B and A* are also of finite type, and we 
have (A ® B)* = A* © B* and A** = A. When 
A and B are connected, A* and A (9 B are also 
connected. 

Let A be a graded module. If there exists a 
degree-preserving linear mapping 9: A @ A> 
A, we call (A, o) a graded algebra, whereas if 
there exists a degree-preserving linear mapping 
V:A—4A Q A, we call (A, y) a graded coalgebra. 
We call o a multiplication, and y a comultipli- 
cation (or diagonal mapping). Usually we write 
ola & b) - ab (the product of a, be A), and call 
(a) the coproduct of a. Multiplication and 
comultiplication are dual operations. If A is of 
finite type and (A, ọ} is a graded algebra, then 
(A*, o*) (where of is the dual mapping of 9) is 
a graded coalgebra, and vice versa. A multi- 
plication q is called associative (commuta- 
tive) if p(1 © 9) — mie 1) (po T= 9), where 
T:AGQ A>AQ Ais the mapping defined by 
T(a & b) -(—1)" b & a for ae A, and be A,. 
Associativity and commutativity of a comulti- 
plication are defined dually. Let (A, y) be a 
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graded connected coalgebra of finite type, 

and identify k with A, via n. Then the graded 
algebra (A*, w*) has the unity of k as unity if 
and only if y satisfies v(1)21 69 1 and (x)= 
1@x+x@1+X;x; 8x; (0«degx;«degx) 
for deg x >0. In this case we say that y has the 
unity of k as counity. For graded algebras 

(A, o) and (B, ¢’), if 9" =(9 & o')o(1& TG 1): 
AG) BG AG B—A G B,then (A @ B, 9") is 
also a graded algebra, which we denote by 

(A @ B, 9") 2 (A, 9) G9 (B, q^). The tensor prod- 
uct of graded coalgebras is defined as the 
dual notion of (A, o) © (B, oi 


C. Graded Hopf Algebras 


For simplicity we assume that graded mod- 
ules are defined over a field, connected, and 
of finite type. Let a graded module A be 

equipped with a multiplication q and a co- 


"multiplication y. If o and y have the unity 


of k as unity and y:(4, 9) (A, 9) & (A, o) is an 
algebra homomorphism, then we call (A, o, v) 
a graded Hopf algebra. The last condition for a 
graded Hopf algebra is satisfied if and only if 
9:(A,U) G9 (A, V) (A, v) is a homomorphism 
of graded coalgebras. The dual (A*, w*, *) is 
also a graded Hopf algebra, called the dual 
Hopf algebra of (A, o, y). 


D. H-Spaces 


Let X be a topological space. The tcoho- 
mology group H *(X) (homology group 

H, (X)) considered over a field k has a multi- 
plication d* (comultiplication d,), which is 
induced by the diagonal mapping d: X >X x X 
and becomes a commutative and associative 
graded algebra (coalgebra). The groups H*(X) 
and H, (X) are dual to each other (— 201 
Homology Theory I, J). When X is equip- 
ped with a base point x, and a base point- 
preserving continuous mapping h: X x Xo X 
such that ho~ 1, (thomotopic) for i=1 and 
2 (where i, (x) 2 (x, x9) and i5 (x) 2 (xo, x)), we 
call (X, h) an H-space, h a multiplication, and 
Xo a homotopy identity of X. Then h induces, 
through a *Künneth isomorphism, a co- 
multiplication A*: H*(X)— H*(X) 69 H*(X) 
(Hopf comultiplication) and a multiplication 
h: H,(X) @ H,(X)5 H, (X) (Pontryagin 
multiplication). Then h*(x) (xe H*(X)) is 
called the Hopf coproduct of o, and ^, (f © y) 
(B, y € H,(X)) is called the Pontryagin product 
of f and y. When X is tarcwise connected and 
H,(X) is of finite type, (H*(X), d*, h*) and 
(H,(X), hy, d,) are graded Hopf algebras dual 
to each other. In particular, when h is homot- 
opy associative, i.e., ho(h x 1) e ho(1y xh) 
(homotopy commutative, i.e., h~ ho T, where 
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Ten, x3) 9 (x5, x4) for x;e X), then h* and hy 
are associative (commutative). *Topological 
groups and ‘loop spaces are homotopy as- 
sociative H-spaces. If a continuous mapping 
g: X — X satisfies ho(1y xg) zho(g x ly)=c 
(constant mapping X 5 (xo), then g is called a 
homotopy inverse for X, h. 

Suppose that a graded Hopf algebra A is 
defined over a field k of characteristic p and 
equipped with associative and commutative 
multiplication, and A is generated by a single 
element ae A,. Then A is a tpolynomial ring 
k[a] (n is even when p Z2) or a tquotient ring 
k[a]/(a?) (n is odd when p 2) or k[a]/(a"') 
(only when p40; n is even when p #2). These 
are called elementary Hopf algebras. Every 
graded Hopf algebra over a tperfect field k 
with associative and commutative multiplica- 
tion is isomorphic (as a graded algebra) to a 
tensor product of elementary Hopf algebras 
(Borel’s theorem) [2]. In particular, the coho- 
mology algebra over a field of characteristic 0 
of a *compact connected Lie group is isomor- 
phic to a *Grassmann algebra generated by 
elements of odd degrees [1]. 


E. Steenrod Algebras 


The 'Steenrod algebra s, over Z, is generated 
by 'Steenrod operations Sq! (p=2), 2'(p»2), 
and the *Bockstein operation A, (p> 2), with 
composition of operations defined as multi- 
plication. Then a. is a connected associative 
graded algebra of finite type (not commuta- 
tive). Defining a comultiplication y of ./, by 
V(Sq") =E Sa Q Sq", PNE P OP", 
and W(A,)=1@A,+A, @ L, «7, becomes a 
graded Hopf algebra with an associative and 
commutative comultiplication. Thus its dual 
7 is a graded Hopf algebra with an associa- 
tive and commutative multiplication, and we 
can apply Borel's theorem to «77 in order to 
investigate the structure of A, [4]. 

Let (A, p, y) be a graded Hopf algebra 
with associative multiplication and comulti- 
plication. Putting c(1)2 1 and c(a) 2 —a— 
ÈZ a; c(a;) for dega >0 (where (a) 169 a4 
aG) 1-- Xa; aj), we obtain a linear map- 
ping c: AA satisfying co — o(c Q c) T. We 
call c the conjugation mapping of A. When the 
multiplication or comultiplication is commuta- 
tive, we obtain the relation c? 2 1, and c is a 
bijection. The conjugation mapping is utilized 
in studying Steenrod algebras [4, 5]. 


F. Coalgebras 


Now we turn to nongraded cases. Let A bea 
vector space over a field k, and let o: A 99,4 
A and »:k—A be linear mappings. Then the 
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triple (A, u,n) is said to be an algebra over k if 
uo(u& 1,)—uo(1,G yp) and no(1,G 1)— 
uo(n @ 1,)—1,, where 1,1s the identity 
mapping of A, and A & ,k and k @,A are 
identified naturally with A. We call u the multi- 
plication and 7 the unit mapping of the alge- 
bra. Dually a triple (C, A, e) with C a vector 
space over k, linear mappings A: C5 C @,C, 
and e: Ck is said to be a coalgebra over k if 
(1c € A)oA (AQ 1,)oA and (lc 699 2) oA — 
(e@ 1-)oA 2 1c. We call A the comultiplica- 
tion or the diagonal mapping and e the aug- 
mentation or the counit of the coalgebra. 

An algebra (A, yz, n) is called commutative 
if uo T= pu, where T is the twist mapping 
aG be» bG a. A cocommutative coalgebra is 
defined dually. Definitions of the tensor prod- 
uct of two algebras or coalgebras are similar 
to the definitions in the graded case. If D is a 
subspace of a coalgebra (C, A, c) over k satisfy- 
ing A(D) c D & ,D, then (D, A] p, Elp) is a co- 
algebra and is said to be a subcoalgebra of C. 
A subspace I of a coalgebra (C, A, £) over k is 
called a coideal of C if A(I) c C 69,1 +I 69,C 
and e(1) 2 Then the quotient space C/I has a 
coalgebra structure induced naturally from 
(C, A, €) and is said to be a quotient coalgebra 
of C. The intersection and the sum of sub- 
coalgebras of C are again subcoalgebras of 
C. If S is a subset of C, then the intersection 
of all subcoalgebras containing S is said to 
be the subcoalgebra generated by S. The 
subcoalgebra generated by any finite set or 
finite-dimensional subspace of C is finite- 
dimensional. 

If (C, A, £) is a coalgebra over k, then the 
dual space C* of C has an algebra structure 
over k with multiplication u and unit mapping 
n defined naturally from the tdual mappings of 
A and e respectively. (C*, u, n) is called the dual 
algebra of (C, A, e). Suppose that (A, p, n) is an 
algebra over k, and let A? be the subset of the 
dual space A* of A consisting of elements f 
whose kernel contains an ideal 7 such that A/I 
is finite-dimensional. Then (A5, A, £) is a co- 
algebra over k, where A and e are the linear 
mappings induced from the dual ones of u and 
5, respectively. (A°, A, c) is called the dual co- 
algebra of (A, u,n). The functors ( )* and ( )? 
are adjoint to one another in the sense that 
there is a natural bijective correspondence 
between the set of algebra homomorphisms 
of A to C* and that of coalgebra homomor- 
phisms of C to A? for any coalgebra C and 
algebra A, where coalgebra homomorphisms 
are defined as the dual notion of falgebra 
homomorphisms. 

A nonzero subcoalgebra D of a coalgebra C 
is called simple if D has no nonzero proper 
subcoalgebras, and the sum of all simple sub- 
coalgebras of C is called the coradical of C. If 
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C coincides with its coradical, then C is said to 
be cosemisimple. If C has only one simple 
subcoalgebra, then C is called irreducible. C is 
called pointed if all simple subcoalgebras of C 
are 1-dimensional. An element g of a coalgebra 
(C, ^, £) is called grouplike if A(g)=g ® g and 
£(g) — 1. The set G(C) of grouplike elements in 
C is linearly independent over k. 


G. Bialgebras 


A system (H, ug. A, c) with an algebra structure 
(H, u,n) and a coalgebra structure (H, A, e) is 
said to be a bialgebra over k if A and e are 
algebra homomorphisms. This last condition 
is equivalent to saying that u and y are coalge- 
bra homomorphisms. If K is a subspace of a 
bialgebra (H, u,n, A, c) which is simultaneously 
a subalgebra and a subcoalgebra of H, then 
we call K a subbialgebra of H. An ideal J of 
(H, u,n) which is also a coideal of (H, A, £) is 
called a biideal of H and the quotient space 
H/I has a bialgebra structure which is said to 
be a quotient bialgebra of H. A linear mapping 
between bialgebras is a bialgebra homomor- 
phism if it is simultaneously an algebra homo- 
morphism and a coalgebra homomorphism. A 
bialgebra (H, u,n, A, £) is called commutative 
(cocommutative) if (H, u, n) ((H, A, ell is com- 
mutative (cocommutative). 


Examples. Let kG be the vector space with a 
set G as free basis over a field k. If we define 
linear mappings A: kG— G Q ,kG by A(x)= 

x & x and e:kG—k by e(x)- 1 for x in G, 

then (kG, A, £) is a cocommutative coalge- 

bra over k such that the set G(kG) of group- 
like elements is equal to G. Moreover if G 

is a tsemigroup with unit element, then 

(kG, u,n, ^, £) is a cocommutative bialgebra 
over k, where u(n) is the multiplication (unit 
mapping) of the tsemigroup algebra kG. This 
bialgebra is called a semigroup bialgebra over 
k. If Lis a tLie algebra over k and U(L) the 
‘universal enveloping algebra of L with multi- 
plication p and unit mapping y, then Lie alge- 
bra homomorphisms xx Q x (L LG L) 
and x0 (L— (0]) induce algebra homomor- 
phisms A: U(L) U(L@ L)= U(L) & U(L) and 
e: U(L)k, respectively. Then (U (L), 4,1, A, £) 
is a cocommutative bialgebra over k, called the 
universal enyeloping bialgebra of L. 


H. Hopf Algebras 


Let (A, p,n) be an algebra over k and (C, A, £) 
a coalgebra over k. If f and g are in R= 
Hom,(C, A), then f*g=po(f & g)o^ is 
called the convolution of f and g. Defining 
K:RO,R>R and jg: koR by u(f g)—f*g 
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and s'(x) — anos, then (R, x,y’) is an algebra 
over k. If H is a bialgebra over k with underly- 
ing coalgebra H^ and algebra H^, then Ry = 
Hom,(H*, H^) is an algebra over k in the 
same manner as above. If the identity mapping 
ly of H has the inverse S in Ry under the 
multiplication yp’, then H is said to be a Hopf 
algebra over k with antipode S. Then S satisfies 
the following: S(gh) 2 S(h)S(g) for g, h in H, 
Sog-mn,zoS-eand To(S @S)oA=AoS, 
where T is the twist mapping a & bb G a. If 
H is commutative or cocommutative, then 
SoS=1,. 

If H’ is another Hopf algebra over k with 
antipode S’, then a bialgebra homomorphism 
f of H to H' such that S'f — fS is called a Hopf 
algebra homomorphism. If H and H' are both 
commutative (cocommutative), then any bi- 
algebra homomorphism of H to H' is a Hopf 
algebra homomorphism. The *category whose 
objects are commutative and cocommutative 
Hopf algebras over a field k and whose mor- 
phisms are Hopf algebra homomorphisms is 
an tAbelian category (A. Grothendieck). If H is 
a commutative Hopf algebra over a field of 
characteristic zero, then the underlying algebra 
H^ has no fnilpotent elements (P. Cartier). 

If G is a *group, then the group bialgebra 
kG given above has an antipode S defined 
by S(x) 2 x^! for x in G, and hence kG isa 
cocommutative Hopf algebra over k. Another 
example of Hopf algebras is the tcoordinate 
ring of an talgebraic group defined over k. 
More generally, let X =Spec(A) be an taffine 
group scheme over k. Then algebra homomor- 
phisms A: 42 A4 9,4, 6: A—k, and $: A A 
are naturally induced from the group structure 
of X, and (A, 1,53, ^, £, S) is a commutative 
Hopf algebra over k, where o and q are the 
multiplication and unit mapping of A, respec- 
tively. Conversely, if (A, u, n, A, e, $) is a com- 
mutative Hopf algebra over k, then X — 
Spec(A) is an affine group scheme over k 
with the group structure induced from A, e, 
and S. Hence a commutative Hopf algebra 
over k is nothing but a cogroup object of the 
category of commutative algebras over k (i.e., 
a tgroup object of the tdual category). Dually 
a cocommutative Hopf algebra over k is 
nothing but a group object of the category of 
cocommutative coalgebras over k. 


I. Hyperalgebras 


If (C, A, £) is a pointed irreducible coalgebra 
over k, then C contains a unique grouplike 
element g, and kg is the unique simple sub- 
coalgebra of C. An element a of C satisfying 
A(a)=a ®g+g® ais called primitive. The set 
P(C) of primitive elements in C is a vector 
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subspace of C. A cocommutative coalgebra C 
is called colocal if C is irreducible, i.e., if the 
dual algebra C* of C is a *quasilocal ring. The 
*dimension of P(C) of a colocal pointed coalge- 
bra C is finite if and only if the dual algebra 
C* of C is a tNoetherian complete local ring. 
A bialgebra (H, p,n, A, £) over k is said to be a 
hyperalgebra over k if the underlying coalgebra 
is colocal. Then the unique simple subcoalge- 
bra (grouplike element) of H is 4(k) (y(1)), and 
P(H) has a Lie algebra structure defined by 
[x, y] 2 xy — yx for x, y in P(A). 

The universal enveloping bialgebra U(L) of 
a Lie algebra L over k is a hyperalgebra over k 
such that the set P(U(L)) of primitive elements 
in U(L) is equal to L. Conversely, if the char- 
acteristic of k is zero, any hyperalgebra H over 
k is isomorphic to the universal enveloping 
algebra U(P(H)) of the Lie algebra P(H). But 
in positive-characteristic cases U(P(H)) is 
generally a proper subbialgebra of H. Another 
important example of hyperalgebras is the 
dual coalgebra hy(X)= A? of the tstalk A at 
the neutral point of the tstructure sheaf of an 
talgebraic group scheme X over k. In addition, 
hy(X) has an algebra structure defined from 
the group structure of X and is a hyperalgebra 
over k such that P(hy(X)) is equal to the Lie 
algebra L(X) of X. Although L(X) plays an 
important role in the infinitesimal theory of 
algebraic groups over a field of characteristic 
zero, it does not give any information on in- 
finitesimals of orders higher than p in the case 
of positive characteristic p. In the case of char- 
acteristic zero we see hy(X)= U(L(X)) and 
L(X)= P(hy(X)), and so hy(X) is a natural 
substitute for L(X) in positive-characteristic 
cases. From this viewpoint many interesting 
results on hy(X) of an algebraic group scheme 
X which are parallel to those on L(X) in the 
case of characteristic zero have been obtained 


[6-8]. 


References 


[1] H. Hopf, Über die Topologie der 
Gruppen-Mannigfaltigkeiten und ihre Verall- 
gemeinerungen, Ann. Math., (2) 42 (1941), 
22-52. 

[2] A. Borel, Sur la cohomologie des espaces 
fibrés principaux et des espaces homogénes des 
groupes de Lie compacts, Ann. Math., (2) 57 
(1953), 115-207. 

[3] M. E. Sweedler, Hopf algebras, Benjamin, 
1969. 

[4] J. W. Milnor, The Steenrod algebra and its 
dual, Ann. Math., (2) 67 (1958), 150-171. 

[5] J. W. Milnor and J. C. Moore, On the 
structure of Hopf algebras, Ann. Math., (2) 81 
(1965), 211—264. 


204 B 
Hydrodynamical Equations 


[6] J. Dieudonné, Introduction to the theory 
of formal groups, Dekker, 1973. 

[7] M. Takeuchi, Tangent coalgebras and 
hyperalgebras I, Japan. J. Math., 42 (1974), 1— 
143. 

[8] H. Yanagihara, Theory of Hopf algebras 
attached to group schemes, Lecture notes in 
math. 614, Springer, 1977. 

[9] E. Abe, Hopf algebras, Cambridge Univ. 
Press, 1980. (Original in Japanese, 1977.) 

[10] G.-C. Rota, Coalgebras and bialgebras in 
combinatorics, Lecture notes at the umbral 
calculus conference, University of Oklahoma, 
1978. 


204 (XX.10) 
Hydrodynamical Equations 


A. General Remarks 


Mathematical analysis of the motion of fluids 
(— 205 Hydrodynamics) gives rise to various 
kinds of mathematical problems; the equations 
that govern flows are amongst the most im- 
portant and most extensively studied of the 
nonlinear partial differential equations. Here 
we review only basic and noteworthy results. 
Sections B—E are concerned with incompress- 
ible fluids, while Sections F and G deal with 
compressible fluids. 


B. Nonstationary Solutions of the Navier- 
Stokes Equation 


Let Q be a bounded domain in R” (m=2 or 3) 
occupied by a fluid, with smooth boundary 
0Q. If the fluid is viscous and incompressible, 
its motion can be described by means of the 
velocity u — u(t, x) and the pressure p= p(t, x) 
with tz 0 and xeQ the time variable and the 
space variable, respectively. For simplicity, we 
assume that external forces are absent. Then u 
and p satisfy the Navier-Stokes equation 


Ou 
a t UN uev Vp, (1) 


and the equation of continuity 
divu=0, (2) 


where the positive constant v stands for the 
(kinetic) viscosity. On 0Q, u is subject to the 
boundary condition 


ulea = Blt, x). (3) 
The initial value of u 1s also prescribed: 


u|,-o = uo(x). (4) 
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If Q is unbounded, e.g., if Q is an exterior 
domain outside compact surfaces, then 


u(t,x)>Up(t) — ([x|-» o0) (3) 


is imposed in addition as the boundary con- 
dition at infinity. 

The problem of finding u and p that satisfy 
(1)-(4) for given fl and up is called the Navier- 
Stokes initial value problem (abbreviation, NS 
initial value problem). Pioneering mathemat- 
ical studies of this problem were initiated by 
J. Leray, and since the 1950s various contri- 
butions have been made by many authors, 
including E. Hopf, O. A. Ladyzhenskaya, H. 
Fujita, T. Kato, and S. Ito (2 [13,17,21]). We 
state here the result in terms of regular (class- 
ical) solutions under the simplifying assump- 
tion that Q is bounded, fj Z0, and ug is sole- 
noidal (div u — 0) and smooth. The situation 
depends nontrivially upon the dimension m. 
Namely, if m — 2, then a regular solution of the 
NS initial value problem exists uniquely and 
globally, i.e., for all time. If 23, we can prove 
only a local existence theorem (De, one hold- 
ing in a finite interval) of a regular solution. 
This solution is unique in the internal of its 
existence; however, it can be extended over 
the whole interval when the Reynolds number 
is sufficiently small. In other words, the ques- 
tion of well-posedness of the 3-dimensional 
NS initial value problem is open at present. 


C. Weak and Strong Solutions of the Navier- 
Stokes Equation 


In 1951 Hopf introduced the notion of weak 
solutions of the NS initial value problem and 
succeeded in proving their global existence 
(without uniqueness). We here give the defini- 
tion of Hopf's weak solution: Let C’, be the 
set of vector functions ue Cy (Q) with divu=0. 
By H we denote the closure of Co’, under the 
L?-norm, and by V the closure of CZ, under 
the W,!(Q)-norm (or equivalently, the Dirichlet 
norm for bounded Q). Then an H-valued func- 
tion u=u(t) is a weak solution of the NS initial 
value problem with f=0 in [0, T) (0€ T< 
+) if 

(i) ue L” (0, T; H)N L*(0, oo; V), 

(ii) u is weakly continuous from [0, T) to H, 
and 

(iii) u satisfies the weak equation 


T 
| d Pìr — v(Vu, Vol zo 


0 


T (wu: V)o, Diot dt — (uo, ll aen (5) 


for all pe Cg ([0, T) x Q) with div ọ =0. 
Note that the pressure p has been eliminated 
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in the weak equation. In general, the unique- 
ness of Hopf's weak solution is not known 
except for m — 2. However, when a regular 
solution does exist, then the weak solution 
coincides with it (a.e.), and is unique. Solutions 
of the NS initial value problem which are 
stronger than the weak solution to the extent 
that the uniqueness can be proved have been 
introduced, for instance, by A. A. Kiselev and 
Ladyzhenskaya [12] and Fujita and Kato 
[5]. Actually the existence and uniqueness 
theorems of the aforementioned regular solu- 
tions are proved using existence theorems 

of such strong solutions. Recently, Y. Giga and 
T. Miyakawa succeeded in generalizing the 
Fujita-Kato theory from L? to L”, and thus 
they have shown that if uo € L"(O) and is sole- 
noidal, a unique strong solution exists at least 
locally for any m. 

Those strong solutions that satisfy unique- 
ness, and hence are regular solutions of the NS 
initial value problem, turn out to be smooth 
for t — 0; they are analytic in t0 and xeQ. 


D. Stationary Solutions of the Navier-Stokes 
Equation 


If the flow is steady, u and p are solutions of 
the boundary value problem consisting of (1) 
with &u/Ot omitted, (2) and (3) with f indepen- 
dent of t (and, in addition, (3’) with a constant 
U, if Q is an exterior domain). The existence 
of solutions of this boundary value problem 
for the case of bounded Q was established by 
Leray as one of the earliest applications of 
his fixed-point theorem. For the case of un- 
bounded Q, Leray's study was completed and 
extended by R. Finn, H. Fujita, and others to 
yield theorems on existence, regularity, and 
asymptotic behavior in the wakes of solutions 
(— [4, 13,21]). These stationary solutions are 
unique if the Reynolds number is sufficiently 
small. On the other hand, under certain cir- 
cumstances that involve Quette flow, non- 
uniqueness or bifurcation of stationary solu- 
tions for large Reynolds numbers has been 
positively proved. 


E. Euler’s Equation 


If the fluid is inviscid and incompressible, the 
Navier-Stokes equation is reduced to Euler’s 
equation 


Ou 
S LI Vyu 2 — Vp. (6) 
ct 


vi 





Then the boundary condition is replaced by 
the frictionless boundary condition, which, in 
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the homogeneous case, takes the form 
Hu | he 0, (7) 


where u, is the normal component of u. Regu- 
lar solutions of the initial value problem con- 
sisting of (6) with (2), (4), and (7) have been 
proved to exist for all t if m=2 and in a finite 
time if m=3 [1,11]. 


F. The General Navier-Stokes Equations 


If the fluid is compressible, viscous, and heat- 
conductive, its motion is described in terms of 
the density p, the velocity u, and some thermo- 
dynamic quantity, say the absolute tempera- 
ture 0, and is governed by the following system 
of equations, sometimes called the general 
Navier-Stokes equations: 


d 

P 4 div(pu) - 0, (8) 
et 

Qu H Les l 

— y(u: Vu 2 —| A+—Vdiv ]u—— Vp, (9) 
et p 3 p 


^ 


0 
EE 
ot 


Cy p 





Us sc. v 
AU 4- — (div u)p, - —. 
cyp Cyp 
(10) 


Here the pressure p is regarded as a function 
of p and 0 through the equation of state. The 
viscosity coefficient u, the coefficient of heat 
conduction x, and cy, the specific heat at 
constant volume, are positive constants. We 
have for simplicity assumed that the external 
force is absent, and that the Stokes condition 
for viscosity is satisfied. Finally, ¥ is the dis- 
sipation function: 


dio om wpm fa g^ Y 
w- pidiv 5 Y [ur Lu]. 
j OIM ES. (& " s) 


If Q is the whole space, then the initial values 
fo; Up, Oo of p, u, 0 are given at t —0, and we 
obtain the Cauchy problem for the general 
Navier-Stokes equation. Mathematical study 
of this Cauchy problem has become active 
since J. Nash [19] and N. Itaya [8] proved the 
existence of unique regular solutions local in 
time. Following Itaya’s argument, A. Tant 
constructed a unique regular solution local in 
time for the initial boundary value problem 
consisting of (8)-(10) and boundary conditions 
imposed on u and 0. When it comes to the 
global existence of solutions, only restricted 
results are known. The global existence of a 
regular solution in the 1-dimensional version 
of the Cauchy problem has been established 
by Ya. Kanel’ [10] and Itaya under certain 
simplifying assumptions, such that the fluid 

is a barotropic gas, i.e., one obeying p= Cp", 
where C and y 2 1 are positive constants. The 
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1-dimensional initial boundary value problem 
can also be solved globally if the gas is ideal 
and polytropic, i.e., one for which p= Rp0, and 
with the internal energy proportional to 0 and 
[9]. For the 3-dimensional case, we can only 
refer to [18], where existence of global solu- 
tions of the Cauchy problem has been proved 
for initial data close to constants under the as- 
sumption that the gas is ideal and polytropic. 


G. Equations for Inviscid Ideal Gases 


When the gas under consideration is inviscid, 
ideal, and barotropic, we put u— 0 and p= Cp? 
in (9). The equation thus obtained is combined 
with (8) to yield the following quasilinear 
hyperbolic system, which admits conservation 
laws: 


L E cs 
e u}=Q, 
Ot S 


(11) 
d C 
fu V)u +—Vp’=0. 
ôt p 


It the initial data py and uy are smooth to 
some extent, then the Cauchy problem for (11) 
has a regular solution local in time. Generally, 
we cannot expect existence of regular solu- 
tions global in time, namely, discontinuity is 
likely to take place in a finite time, which cor- 
responds to the occurrence of shock waves. 
Therefore we have to introduce weak solutions 
that admit discontinuity. By definition a piece- 
wise continuous function {p, u} is a weak 
solution of (11) if it satisfies (11) in the distri- 
bution sense and if its discontinuity is sub- 
ject to a certain jump condition, called the 
Rankine-Hugoniot relation as well as another 
condition, called the entropy condition, which 
two conditions allow us to distinguish a physi- 
cally realizable solution among many possible 
discontinuous solutions [3, 14]. Global exis- 
tence of the weak solution has been proved 

so far only for the 1-dimensional problem with 
initial data close to constants in the sense that 
their oscillations and total variations are suffi- 
ciently small. Actually in this case we can 
apply J. Glimm's method [6] to construct 
weak solutions by means of a difference ap- 
proximation which involves random numbers. 
Little is known regarding the uniqueness of 
weak solutions. Finally, if we are concerned 
with steady-state solutions of an inviscid com- 
pressible fluid and assume that the flow is 
irrotational, then we are led to a quasilinear 
partial differential equation of mixed type for 
the velocity potential ®, which is elliptic in the 
subsonic region and hyperbolic in the super- 
sonic region. Classical results concerning these 
equations may be found in [2, 3]. 
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A. General Remarks 


Gases and liquids are easily deformed, and 
they share many kinetic properties. They are 
examples of fluids. By definition, a fluid is a 
continuous substance having the property that 
when it is not moving, any part of the sub- 
stance separated from the rest by a surface 
exerts an outward force that is perpendicular 
to the given surface. 

Hydrodynamics (or fluid dynamics) is con- 
cerned with the equilibrium and the motion of 
gases and liquids without considering their 
molecular structure. In particular, the branch 
of the theory concerning fluids in equilibrium 
is called hydrostatics, and hydrodynamics 
sometimes refers to the branch concerning 
fluids in motion. 

There are two methods of describing the 
motion of a fluid. One regards a fluid as a sys- 
tem consisting of an infinite number of par- 
ticles and discusses the motion of each parti- 
cle as a function of time. This is Lagrange's 
method. For example, suppose that a fluid 
particle with the coordinates (x, y, z)=(a, b, c) 
at the moment t=0 has coordinates x= 
fila, b, c, t), y= f(a, b, c, t), z= f3(a, b, c, t) at 
an arbitrary time t. Then the motion of the 
fluid is perfectly determined by the functions 
fi. fz, and fy. 

The other is Euler's method, which discusses 
the values of the velocity v(u, v, w), the density 
p, the pressure p, etc., of the fluid at arbitrary 
times and positions. From this standpoint 
each quantity of the fluid is regarded as a 
function of a space-time point (x, y, z, t). 

The rate at which any physical quantity F 
varies while moving with the fluid particle is 
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the Lagrangian derivative DF/Dt, which is 
related to the ordinary partial derivatives by 





DF GF OF OF CF 
= tuc 


e 
Ox Oy Oz 


The three components (u, v, w) and the two 
state quantities (p, p) (in general, other state 
quantities, for example, the temperature T and 
the tentropy S, are assumed to be determined 
by equations of state such as T= T(p, p), S= 
S(p, p)) are determined by five (=1+3+1) 
relations derived from the conservation laws of 
mass, momentum, and energy, namely, the 
equation of continuity, which corresponds to 
the conservation of mass, 


€p/8t + div(pv) — 0; (1) 


the equation of motion, which corresponds to 
the conservation of momentum, 


O(pv)/Ct + div(pv & v — p) - pK, (2) 


where K is the external force per unit mass, p 
is the stress tensor, and ® denotes the 'tensor 
product, while *divergence is applied to each 
row vector, and by virtue of (1), the equation 
(2) can be expressed component-wise as 





Du Op. ÓOp,, Op, 
PUE UL NES 


Dt ôx Oy Oz Fa 


cere ER 
Dt ox Oy @z 





+ pK, EN 





Dw 6 zx G z Óp;. 
Nd 


+pK,; 
Dt ôx dy à ” 


p 
and the energy equation, which corresponds to 
the conservation of energy, 


Q(pv?/2 + pE)/0t 
t div(pv(r^/24- E) - v-p--h) - pv-K, (3) 


or the equation of entropy production, which is 
another expression of (3), 


pTDS/Dt — — divh 4 Q, (3) 


where E is the internal energy per unit mass, Q 
the heat generated per unit time and volume, 
and h the heat flux. Here K, p, h, and Q or 
their relations with other quantities (e.g., h— 
—k grad T, where x is the thermal conductiv- 
ity) are assumed to be known. 


B. Perfect Fluids 


When there is a velocity gradient in the flow, 
a tangential stress appears which tends to 
make the velocity uniform, so that p is not a 
diagonal tensor (— p6,,, i.e., pressure). This 
property is called fluid viscosity. Generally, Q 
and h do not vanish in this case. However, in 
order to simplify the problem we consider a 
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nonviscous (sometimes also adiabatic) fluid, 
which is called a perfect fluid and is a good 
approximation in the large of actual fluids. 
The motion of a perfect fluid is determined by 
Euler's equation of motion 


p Dv/Dt = — grad p+ pK, (4) 


which is obtained from (1) and (2) by replacing 
Pix by the pressure only, and also by the ther- 
modynamic relation DS/Dt —0 obtained from 
(3) by putting Q —0 and h=0 or its integral 

S —constant in homentropic flow, which is 
governed by the adiabatic law poc p”, where y 
denotes the ratio of specific heat at constant 
pressure to that at constant volume. In partic- 
ular, for a liquid, the density variation can be 
neglected. Putting p — constant in (1), we have 


div v — 0, (5) 


which, in conjunction with (4), determines four 
unknowns (u, v, w, p) as functions of (x, y, z, t). 

A fluid of constant density is called an in- 
compressible fluid, and one of variable density 
a compressible fluid. Even though it might 
seem natural to consider gases as examples of 
compressible fluids, they can be treated as 
incompressible fluids if the speed of the flow 
of the gas q — |v| is small compared with the 
velocity c — J/ dp/dp of sound propagating in 
the gas. We call q/c = M the Mach number. 

The vector ot n, C), which is derived from 
the velocity vector v as o — rot v, is called the 
vorticity. A small part of the fluid rotates with 
angular velocity w/2. If œ =0, the flow is called 
irrotational, otherwise rotational. The curves 
dx:dy:dz=u:v:w and dx:dy:dz=€é:n:C are 
called, respectively, stream lines and vortex 
lines. The line integral $cv,ds along a closed 
circuit C is called the circulation around C. 

In irrotational flow, the velocity is expressed 
as v — grad 6, where 6 is called a velocity poten- 
tial. When the external force K has a potential 
Q (K — — grad Q) and p is a definite function 
of p, we have the pressure equation 
oo 


1 d 
- ege | cac constan, 
ot 2 p 


which is valid everywhere in the flow. In a 
steady flow, 


s+ [Zo constan (6) 
2 p 

is valid along each stream line or each vortex 
line; this is called the Bernoulli theorem. These 
two equations correspond to tenergy integrals 
of the equation of motion. Furthermore, cor- 
responding to the conservation of tangular 
momentum, we have Helmholtz’s vorticity 
theorem: When K = — grad Q and p= f(p), 
vorticity is neither created nor annihilated in 
the fluid. 
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For the irrotational motion of an incom- 
pressible fluid, *Laplace's equation Ao =0 is 
derived from (5). Hence the problem reduces to 
the determination of a tharmonic function ® 
under appropriate boundary conditions (e.g., 
for a fixed wall, normal velocity v, 2 0/0n = 
0). For the 2-dimensional problem a stream 
function VH is introduced to satisfy (5) by the 
relation u — 0P/Oy, v= — OH /Ox. Since the 
tCauchy-Riemann equations 0/0x — 0WV/Oy, 
05/0 y = — 0N'/Óx are valid in this case, f= 
Q 4 iV is an 'analytic function of z = x + iy. 
Therefore the theory of 2-dimensional irro- 
tational motion is essentially equivalent to the 
theory of complex tanalytic functions, and 
consequently the theory of tcomformal map- 
ping is a powerful method in the theory of 
such fluid motion. 

For irrotational steady flow of a compress- 
ible fluid in which Q —0, c is determined from 
(6) as a function of q. Then (1) and (4) yield a 
*nonlinear partial differential equation for d: 


i u? CH, i v? ges. 1 w2N0? c 
c? j ôx? c?) oy? c? } oz? 


ow €)! wu do _ uv 4 ET 


ac cure A 
c?0yüz | c?0züx — c?^óx0y 0? 


This equation is ‘elliptic or thyperbolic (— 326 
Partial Differential Equations of Mixed Type) 
according as M is less than 1 (subsonic) or 
greater than 1 (supersonic). 

For 2-dimensional flow, we can introduce a 
stream function ¥ from (1) by u = 0b/0x = 
(1/p) (C /6y), v  0o/0y = —(1/p) (OV/0x). By 
utilizing the idea of tLegendre transformation, 
this system of nonlinear equations for and Y 
can be reduced to a system of linear equations 
in the hodograph plane (q, 0): 


ab Topa 1—M? 9 


ôq "dqNoq) 00 pq 00 
0p gov 
00 pq 


(d(pq)/dq = p (1 — M?)), where the independent 
variables q and 0 are the magnitude and the 
inclination of the velocity, respectively. The 
treatment of 2-dimensional compressible flow 
on the basis of this system is called the hodo- 
graph method. For a flow of small M, there is a 
method of successive approximation (M?- 
expansion method) which starts from Laplace's 
equation, neglecting the terms of O(M?) in 
(7). For uniform flow (velocity U in the x- 
direction) past a thin wing or slender body 
where v and w are small, we have thin wing 
theory or slender body theory, whose first 
approximation is 

Ceo 6o oo 


qu s 8 
Por (8) 


1—M? 
( IL y? 
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If M «1 or M » 1 (although not too large) a 
linearization (Prandtl-Glauert approximation) 
is possible by replacing M by the Mach num- 
ber at infinity M, —U/c,. For M>1, (8) has a 
*characteristic surface, which is the Mach cone 
whose central axis makes an angle arcsinc/q — 
arcsin 1/M with the flow. This can be inter- 
preted also as an envelope produced by spher- 
ical sound waves with velocity c from a source 
drifting with velocity q. For M~ 1, we put b— 
Geht t Q9 (c, is the fluid velocity when q =c). 
Then for an adiabatic gas, (8) is approximated 
by a partial differential equation of tmixed type: 


Oo EE BEES ĉo 
du? dx Ox?” 








oz ` Cx e) 
Such a flow in which both domains M 2 1 co- 
exist is called the transonic flow, and exact 
solutions by the hodograph method are 
known. However, continuous deceleration 
from M »1 to M «1 generally tends to be 
unstable or impossible, and the appearance of 
a shock wave, i.e., a discontinuous surface of 
state quantities, is not unusual. This can be 
considered as the tweak solution of (1), (2) (3) 
for a perfect fluid. In particular, in the coordi- 
nate system fixed to the surface, its integrated 
form can be obtained as follows: [ pv, ] — 0, 

[ pó;,  pv;v,] =0, [q?/2-- E+ p/p] 2 0 ([ is 
the jump of the quantity at the surface, and n 
is the normal component). Supplemented by 
the entropy increase, these formulas give rela- 
tions between the fluid velocity and the state 
variables at the front and back of the shock. 
In an ideal gas they are called the Rankine- 
Hugoniot relation. Entropy is not uniform 
behind a curved shock, and the flow is not 
irrotational. For a weak shock starting from 
the tip of a pointed slender body, however, the 
discontinuity is small and approaches the 
*characteristic surface of (8), i.e., the Mach 
wave (compressive wave, in this case). Rare- 
factive Mach waves are found in the super- 
sonic flow of acceleration around a convex 
surface. Such waves contribute to the drag on 
an obstacle placed in supersonic flow. 


C. Viscous Fluids 


A body moving uniformly in a fluid at rest 
(with velocity less than that of sound) suffers 
no drag as long as the viscosity of the fluid 1s 
negligible and the flow is continuous (d'Alem- 
bert's paradox). Hence we must take the vis- 
cosity into account in order to discuss the 
creation and annihilation of vortices, the gen- 
eration and structure of shock waves, and 

the drag acting on obstacles. For this purpose, 
we extend Newton's law stating that frictional 
stress is proportional to the velocity gradient 
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and assume that the stress tensor p is a linear 
function of the rate-of-strain tensor e: 


2 
pss =p Hp divy + ue, — udivv 


Ou , 2 g 
= —p-F2u—-l gu ——u |divv, ..., 


ôx 3 
SS | (ew Op 
Dy; = Hey, =H TAT serre 


The proportionality constants p and u are 
called, respectively, the coefficients of (shear) 
viscosity and bulk viscosity. The bulk viscosity 
is sometimes neglected (Stokes’s assumption) in 
the usual hydrodynamics, and then the mean 
value of the normal stress components equals 
the pressure. When a fluid satisfies this linear 
relation between p and e, it is called a New- 
tonian fluid. Otherwise, it is called a non- 
Newtonian fluid. Except for a few cases, such 
as colloid solutions, fluids can be regarded as 
Newtonian. 

If we take the viscosity into account, the 
equation of motion of an incompressible fluid 
becomes 


p Dv/Dt = pK — grad p + pAv. (10) 


This is called the Navier-Stokes equation. A 
nondimensional quantity R= oU L/u formed 
by representative length L, velocity U, density 
p, and viscosity u of a flow is called the Rey- 
nolds number. In order for two flows with 
geometrically similar boundaries to share 
similar kinetic properties, their Reynolds num- 
bers must be equal. This is called the Reynolds 
law of similarity. 

For small R, we can approximate the equa- 
tion of motion (10) by replacing the accelera- 
tion Dv/Dt by év/ét (Stokes approximation) or 
by 0v/0t + Udv/dx (Oseen approximation) for a 
body placed in the uniform flow of velocity U 
in the x-direction. 

For large R, the flow can be regarded as 
that of a perfect fluid, since we can neglect pAv 
as long as the velocity gradient is not too 
large. In the vicinity of a fixed wall, however, 
the velocity gradient becomes large, because in 
a very thin layer the velocity decreases rapidly 
from the value U of a perfect fluid to zero at 
the wall. This layer is called the boundary 
layer. For the boundary layer, Prandtl’s bound- 
ary layer equation 





ou ou Ou QU | dU uu 

Qt "ës "dy ër ax póy" 

Qu dv 

0M. = 11 
ôx dy SR 


is valid, where x and y are the coordinates 
parallel and perpendicular to the wall, respec- 
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tively, and U is the velocity outside the bound- 
ary layer. 

If QU/Ox <0, it sometimes happens that the 
boundary layer separates from the surface of 
the body. In this case a vortex is generated in 
the flow, as large vorticities in the boundary 
layer are carried into the flow. For a body 
without separation of the boundary layer, the 
d'Alembert paradox holds and is no longer a 
“paradox,” and the drag is small. Such bodies 
are called streamlined. 

In compressible flow the new problem arises 
of the interaction of shock waves with the 
boundary layer. A rapid increase in pressure 
due to the shock wave formed on the surface 
of a body invalidates the assumption of a 
boundary layer and causes its separation. 1f 
the Mach number becomes sufficiently large 
(M = 5, hypersonic flow), the bow shock ap- 
proaches the body and interferes with the 
boundary layer. The generation of heat at the 
boundary layer (e.g., viscous dissipation in Q) 
requires the consideration of heat transfer as 
well as viscosity. In this manner, it becomes 
necessary to treat a complete system of equa- 
tions which take into account the energy 
equation (3) as well as the temperature de- 
pendence of x, H, and H. 


D. Laws of Similarity 


For such complicated systems, tdimensional 
analysis is often useful (— 116 Dimensional 
Analysis). As laws of similarity, we can con- 
sider not only those like the Reynolds law but 
also others for bodies which transform similar- 
ly by taffine transformations. Corresponding 
to equation (8), the Prandtl-Glauert law of 
similarity for subsonic flow is famous: The 
pressure coefficient (nondimensional pres- 
sure change) for a thin wing of chord (1.e., the 
length in the direction of flow) 1, span L, and 
thickness t is 


CAL, 1) 2 4C, (4/1 — M2, L,1/,/1— M2 4), 


when 4 is an arbitrary constant and C,, is C, 
for a body of scaled length and thickness 
placed in an incompressible flow. Correspond- 
ing to (9), an extension of the famous von 
Kármán transonic similarity is possible: 


C,(L, x) 2 1?P (y+ 1) 15 


x f(y |1- Mal L, (7+ leit MS) 


E. Turbulence 


For low Reynolds numbers, the flow generally 
has smooth streamlines. For high Reynolds 

numbers, however, extremely irregular motion 
in space and time appears. The former is called 
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laminar flow, and the latter turbulent flow (— 
433 Turbulence and Chaos). The transition 
from laminar to turbulent flow is considered to 
be due to the instability of the laminar flow, 
and this transition has been studied by the 
method of small oscillations. Recently, non- 
linear effects have also been examined. Regard- 
ing the internal structure of turbulence, statis- 
tical theories originated by T. von Kármán 
and G. I. Taylor (Proc. Roy. Soc. London, 

151 (1935)) and A. N. Kolmogorov (Dokl. 
Akad. Nauk. SSSR, 30 (1941)) are of central 


importance. 


F. Water Waves 


*Surface waves that occur on the free surface of 
water (or other liquids) are called water waves; 
their restoring forces are gravity and surface 
tension. If we consider waves generated on still 
water (assumed to be inviscid and incompress- 
ible) in equilibrium, we can regard the flow 
field associated with the wave motion as tirro- 
tational by virtue of *Helmholtz's vorticity 
theorem. Hence the flow velocity can be de- 
rived from the tvelocity potential ® which 
satisfies fLaplace’s equation AP =0, together 
with the boundary conditions 
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at z=2(x,y,t), (13) 
po-p 6 

p PRm 

at z=h(x,y,t), (14) 
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where the Cartesian coordinates x and y are 
taken in the undisturbed horizontal free sur- 
face, while the positive z-axis is vertically 
upward. The equations z= — H(x, y) and z= 
h(x, y, t) denote, respectively, the bottom sur- 
face (assumed to be known) and the elevation 
of the free surface measured from the un- 
disturbed level z 20, while g stands for the 
gravitational acceleration, o the surface ten- 
sion, po the atmospheric pressure, and 1/R,, 
the *mean curvature of the disturbed free 
surface expressed as 


Tt. (on yen 
Rn Ox} joy? 
(ON h 2808 CN 
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The conditions (12) and (13) imply, respec- 
tively, that the fluid does not cross the bottom 
and the free surface, while (14) expresses the 
fact that the difference between atmospheric 
and fluid pressures at the free surface is equal 
to the normal force (per unit area) due to the 
surface tension. Thus the problem is formu- 
lated as a nonlinear boundary value problem 
for Laplace's equation including the unknown 
boundary z=A(x, y, t). 

Let us first consider linear waves for which 
the wave amplitude of the surface elevation is 
much smaller than any other characteristic 
linear dimension such as the wavelength or 
the water depth H (for simplicity, we assume 
hereafter H — constant, i.e., a flat horizontal 
bottom). Linearizing the boundary conditions 
(12)-(14) with respect to h and grad ®, and 
assuming a sinusoidal wave proportional to 
exp[i(k*r— ct)], k(k,, k,) and w being respec- 
tively the (2-dimensional) *wave number vector 
and the *angular frequency and r — (x, y), we 
obtain the dispersion relation 


ck? 


In a layer of still water the waves are isotropic 
in the horizontal plane and the dispersion 
relation involves only the magnitude k of 

the wave number vector. It is readily seen 
from (15) that the water waves are typical 
*dispersive waves in which the tphase velocity 
Col — c)/k) depends on the wave number k or 
the wavelength A( = 27/k). It is also evident 
from (15) that the quantity ck?/(pg) measures 
the relative importance of surface tension and 
gravity. Hence for waves with wavelengths 
much larger than 4,,=22,/a/(pg) (1.7 cm 
for water), the effect of surface tension ts negli- 
gible, and we have gravity waves. Conversely, 
when 4 << Ae, the effect of surface tension 
becomes dominant, and we have capillary 
waves or ripples. When the water depth H is 
much larger than the wavelength 4, we can 
approximate (15) by o? ^ gk + ok?/p, since 
tanh(kH) — 1. We call such waves deep water 
waves. On the other hand, if H is much smaller 
than the wavelength (kH « 2z), we have shal- 
low (or long) water waves for which (15) can be 
approximated by o? 2 gHK?[1  (o/(pgH?) — 
1/3} (KH)?  ...] if c/(pgH?) - O(1). In partic- 
ular, if we neglect O(kH)?, we recover the 
well-known dispersionless long gravity waves 
whose phase velocity is simply ,/ gH. In all the 
cases mentioned above, the amplitude function 
of the velocity potential ® is proportional to 
— io cosh (k(z + H)}/{ksinh(kH)}, so that the 
flow velocity due to deep water waves de- 
creases exponentially as one proceeds verti- 
cally downward from the free surface. In the 
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limiting case of long gravity waves, however, 
the fluid motion is nearly horizontal through- 
out the fluid layer. 

When the wave amplitude becomes larger, 
nonlinear effects are no longer negligible. For 
such waves, various ‘singular perturbation 
methods provide powerful tools. For example, 
the basic system of equations for weakly non- 
linear (1-dimensional) shallow water waves can 
be reduced to a simple solvable nonlinear 
equation called the *Korteweg-de Vries equa- 
tion, whose solitary wave solution is known 
as a prototype of solitons (— 387 Solitons). 
Another classical example is a (1-dimensional) 
deep gravity wave called the Stokes wave, 
which can be obtained as a power series in the 
wave steepness (amplitude x wave number). 
The first term of the series is of the form of a 
linear sinusoidal wave, and the higher-order 
terms correspond to the higher harmonics, 
while the angular frequency is shifted from the 
linear case and depends not only on the wave 
number but also on the amplitude. Similar 
singular perturbation methods have also been 
applied to various kinds of resonant interac- 
tions such as nonlinear self-modulation, higher 
harmonic resonances, and multiwave interac- 
tions. Finally it should be mentioned that an 
exact solution representing a (1-dimensional) 
deep capillary wave was obtained by G. D. 
Crapper (J. Fluid Mech., 2 (1957) 532—540; 
extended later to the case of finite depth by 
W. Kinnersley, J. Fluid Mech., 77 (1976), 229— 
241). This is one of the few realistic exact solu- 
tions obtained so far; a famous exact solution 
of Gerstner's trochoidal wave does not satisfy 
the irrotationality condition. 
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A. Hypergeometric Functions 


The *power series 


ro) 2 T(a+nr(p+n) 

F(a, f. z)— en 
Toilen a!n) 

in the complex variable z is called the hyper- 

geometric series or Gauss's series. 

It is convergent for any o, f, and y if |z| « 1, 
and is convergent for Re(a+ i. — y) <0 if |z| — 1. 
If z 2 1, its sum is equal to T (p) (y —«— B)/T(y 
—a)I (y — f) (except when y is a nonpositive 
integer). The hypergeometric functions are 
obtained as analytic continuations of the 
functions determined by hypergeometric series 
that are single-valued analytic functions 
defined on the domain obtained from the 
complex plane by deleting a line connecting 
branch points z 2 1 and z= oo (— Appendix A, 
Table 18.1). 

A hypergeometric function is a solution of 
the differential equation 








2 


d^w dw 
BA el a soe en D2)4- — «Bw -0, 


(1) 


which is called the hypergeometric differential 
equation or Gaussian differential equation. This 
equation is a differential equation of tFuchsian 
type with fregular singular points at 0, 1, and 
oo, whose solutions are expressed, in terms of 
the *P-function of Riemann, by 


0 Ke 1 
w=P 0 o 0 EN 
In B wy-a-f 


If any one of the values of y, y —« — B, or a— 
f is integral, there exists a series containing 
logz, representing a solution of the differential 
equation (1) in a neighborhood of the corre- 
sponding singular points. When none of the y, 
y—a—f, or x — p values is integral, since the 
linear transformations z' =z, Z = 1/z, z’=1—z, 


206 B 
Hypergeometric Functions 


z’=2z/(z - 1), z =(z—1)/z, z’=1/(1 —z) permute 
singular points, there exist 24 particular solu- 
tions around the singular points. The latter 
fact was first proved by E. E. Kummer (1836). 

There exist various curves C for which the 
integral 


v=| u* 11—uy * !(1—zu) ?du 
C 


is a solution of (1). Among them we can take 
the segment [0, 1] when Rea>0, Re(y — a)» O. 
Then the corresponding solution is holomor- 
phic in the interior of the unit circle, and 


T(y) 


Fes pha) GIN 


x | u* (1 —uy-*^1(1—zu) ? du. 
C 


Since the integrand has branch points at 0, 1, 
and 1/z, we have the following expression 
when y is not an integer: 


F(a, D, y; z) 


" l ro) 
(Uert Tu — e?) P'(3)P() — a) 





(1+,0+,1-,0-) 
«4 u*(1—u)’ * !(1—zu) "du, 


where Rea 0, Re(y — 4) » 0; whereas if y is an 
integer, then 


1 I) 


F(a, B, y; z) ~ (1—e 2") P(x) Py —o) 


(1+,0+) 
«4 u*(1—uy * !(1—zu) f du, 


where the contour in the first expression en- 
circles successively each of 1, 0, 1, and 0 once 
with indicated directions. These expressions 
can be adopted as a definition of the hypergeo- 
metric functions for the general value of z. 
Other integral expressions are also known 

(— Appendix A, Table 18.1). 


B. The Ladder Method 


A linear ordinary differential equation of the 
second order having three regular singular 
points on the complex sphere is easily trans- 
formed into an equation of the form (1). To 
solve such an equation with a parameter, it is 
often useful to decompose, in two different 
ways, the main part of the equation into two 
factors of the first order, and find a recurrence 
formula involving the parameter, as we shall 
see in the following example. This method 
is called the ladder method or factorization 
method. 

For example, tLegendre’s differential 
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equation 
L,[w]=(1—27)(((1 —z2)wy +n(n4 1)w)— 
is decomposed as follows: 


L,-8, T +n’ = Tyi ‘Sa t(at i), 


$25) eas $,-(1—z?)— — nz 

T dz i R dz 
If w, is a solution of L,[w]=0, then multiply- 
ing both sides of S,- T [w,] 4- n^w,—0 by Th, 
we find that T, S.(T,[w,])  n^(T;[w,]) =0, 
that is, 7; [w,] is a solution of L,_,[w]=0. 
Similarly, we see that S,,,[w,] is a solution of 
L, 44 [w] 20. In this sense, S, and T, are called, 
respectively, the step-up operator, or up-ladder, 
and the step-down operator, or down-ladder, 
with respect to the parameter n. 

The above relation constitutes a recurrence 

formula for Legendre functions (— Appendix 
A, Table 18.II). 


C. Extensions of Hypergeometric Functions 
J. Thomas (1870) proposed the series 

py ee + (0), z", 

n=l ta fs (Bin 
(A), =A +1)... (A+n-—1) 


as an extension of the hypergeometric series. 
The sum of this series satisfies the hth-order 
differential equation 


d^w i. 
(1-277; t Uh cB) ur di^ 
h-2 
(A; E E e (A, — B,z)w —0, 


t —logz. 


When h=2 and f, — 1, it reduces to the ordi- 
nary hypergeometric series. The notation 


MCI %25 ++ t? Op; By, B2 -0 po z) 
e (qi(x3). (ag) 
z”, 2 
EO € 


which is due to L. Pochhammer and modi- 
fied by E. W. Barnes, is used to denote the ex- 
tended hypergeometric series, and the function 
defined by (2) is often called Barnes's extended 
hypergeometric function. For example, Gauss's 
series in this notation is F; (æ, f, y; z). 

Corresponding to Barnes's integral expres- 
sion for hypergeometric functions, it is known 
that the integral 


1 ctioo 
W(z)—-T— 
o xl. 


where 


K(b)—- K(E 1) 


K(Q)H(Q)z *dt, 
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T(E +a (525)... (C+ 24) 
I(C-1- BI (6 12 f)... 


is a solution of the hth-order differential equa- 
tion at the beginning of this section. The 
hypergeometric function expressed by the 
definite integral 


H(C)= 





[er — I*(C— zy dt 
has an obvious formal extension 
[c —a,) ^ ( — a3y^ ... (C — a," (C — zy c. 


On the other hand, the equation 


(II 


(h+m—2)...(h+ 1)h 


(rmn 
m-—v-—1 


Q(z) = 


UT Gë) 
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Po(z) - z — ai)(z — a3)... (Z — am), 


P (z)= st E- Bs e Bee Ae), 


— à, z—a 


called the Tissot-Pochhammer differential 
equation, has a solution 


«o- | (i-a oae tus 


X (E — am)" (-2"n "éi 


After Pochhammer (1870), this is sometimes 
called Pochhammer’s generalized hypergeo- 
metric function. 


As another extension of Gauss’s series, H. E. 


Heine (1846) introduced Heine’s series: 
C= 
(1—4)(1— q^) 
4 —4*)—4**)0 —45(1 SE 
(1—4a) —4)(1—4)(1—4*7?) 
Setting g=1+e, z=(1/e)log x, and letting £0, 


we obtain Gauss's series as the limit of Heine's 
series. 


Q(a, b, c; q;z)=1+ 





D. Hypergeometric Functions of Several 
Variables 


P. Appell (1880) formally extended Gauss's 
series to the case of two variables and defined 


r++ f) 
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four kinds of functions [3]: 
F (06 p, BS y; x. y) 


=> y (0), cs (DB), rl Bn em yn 


n=0 mInt(y), e 
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alo; By, y^; x, y) 
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m=o n=0 m!nl(y),(y'), 


They are called Appell’s hypergeometric func- 
tions of two variables. Each satisfies a cor- 
responding system of partial differential 
equations: 


x(1—x)r+ yd —x)s + (y — ex)p — Bya 


— afiz — 0, 
F, 4 y(1 — y)t * x(10 — y)sc (y —c'x)q— f'xp 
—ap’z=0, 


(x—y)s— B'p + Bq —0, 
x(1 — x)r — xys t (y — ex)p — Byq — ofz =0, 
F, t —y)t - xyst (y —c' y) D cp 
—ap’z=0, 


x)r + ys - (y —ex)p — afz = 
y(1— yt xs-ct(y—c"y)q—a' f'z 20, 


x(1—x)r— y?t -2xys - (y — ex)p — cyq 


—afz=0, 
Fy 
y(i — yt —x?r — 2xys 4 (y — cy)q ^ exp 
— xfiz — 0, 
where 
c=a+B+1, c-acf'-u-l c"=0'+P' +1, 
p=0z2/6x, q=0z/dy, r=072/dx’, 
s=0°z/ðxðy, t=6*z/dy?. 


Appell’s hypergeometric functions can also be 
represented by integrals: for example, 


rarer =i" 
" Të EI 


x(1—u--v) 9 9-!(1—ux—vy) *dudv, 


rm rro’) |, piura 
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x(1—uy 9 !(1—v)' "9-!(1—ux-—vy) *dudv, 
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x(1—u-—vy FF !(1—ux) *(1—vy) * dudv, 


where, for F, and F,, the domain of integra- 
tion is u>0, v>0, 1—u—v>0. E. Picard 
(1881) showed that F, can also be expressed 
by a single integral: 


; 1 
= TQ) | üt (1-4 
F&T (x—7) Jo 


x(1—ux) ^(1—uy) ^ du. 


G. Lauricella (1893) extended the foregoing 
functions to the case of more than two vari- 
ables. More general hypergeometric series of 
several variables were defined by R. Mellin, 
J. Horn, and J. Kampé de Fériet [3,4]. Every 
algebraic equation can be solved analytically 
in terms of the foregoing functions (Mellin, 
R. Birkeland) [4]. Also there are studies con- 
cerning *Riemann's problem and *automor- 
phic functions derived from F, (Picard; 

T. Terada [5]). 


E. Hypergeometric Functions with Matrix 
Argument 


For symmetric matrices Z of degree m, C. S. 
Herz defined hypergeometric functions with 
matrix argument as follows [5]: Denoting by 
etr Z the exponential exp(tr Z) of the ttrace of 
Z, let 


oFo(Z)=etr Z, 





pr Era S o hei) 
1 

= etr(— A) F (o, -3 De By, .... Bas 

Ial) Ja>o i p Pi à 


AZ)(det AP ?d4A,,dÀ55...dA,m. (3) 
E EE va Bp uA) 


_ Ist) "m | 
"Qaia ReZ-X Ge p Fats... fe 
ez-— o? 


Bises Bs AZ~')(det Z) *dz,,dz;,...dz,. (4) 
where 


ASU, 


alte e 

Z=((1  ài)zij/2),, ji-1..... m 

L,(y) =n” ^ T(y)T (y — 1/2)... 
x lF'(y — (m — 1)/2), 


and A >0 means that A is tpositive definite. 
The integral (3) converges for — Z >0 if Rey 
>(m—1)/2. If Rey is sufficiently large, then for 
suitably chosen Xo, (4) converges in a domain 
of the space of A and represents an analytic 
function of its argument. In particular, we 
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have 
1Fo(a; Z) - (det(E — Z)) *. 


Based on this definition, many special func- 
tions and formulas are extended to the case of 
a matrix argument. For example, 


A5(Z)= oFi(0 +(m + 1/2; — ZyT,QO t (m + 1)/2) 
(5) 


is an extension of the *Bessel function, and this 
reduces to 


(t/2) *Js(t) = As(t/2))) 


when m= 1. Formula (5) is applied to the 
tnoncentral Wishart distribution in mathe- 
matical statistics. 
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A. Ideal Boundaries 


For a given Hausdorff space R, a tcompact 
Hausdorff space R* that contains R as its 
dense subspace is called a compactification of 
R, and A= R* —R ts called an ideal boundary 
of R. In the present article, we deal mainly 
with properties (in particular, function- 
theoretic properties) of ideal boundaries of 
*Riemann surfaces R. 


B. Harmonic Boundaries 


By R we mean an open Riemann surface. 
The set I’ of points p* in A such that 

lim infp,,+p* P(p) —0 for every tpotential P (i.e. 
a positive fsuperharmonic function P for 
which the class of nonnegative "harmonic 
functions smaller than P consists only of the 


constant function 0) is a compact subset of R*. 


The set T is called the harmonic boundary of R 
with respect to R*. For an arbitrary compact 
subset K in A — T, there exists a finite-valued 
potential Py with Im, Px(p)= œ (p*e K). 
From this, various kinds of *tmaximum prin- 
ciples are derived. For instance, if u is a har- 
monic function bounded above for which 
limsup,..-u(p) « M holds, then u< M on R. 
There are infinitely many compactifications 
of R. For two compactifications R¥ (i= 1,2) 
of R, we say that RT is greater than R3 or, 
equivalently, lies over R7, if the identity map- 
ping of R can be extended to a continuous 
mapping of Rf onto R3. In order that deep 
function-theoretic studies of R* can be carried 
out, various conditions must be imposed on 
R*. A compactification R* is said to be of 
Stoilow type (or of type S) if for every tcon- 
nected open subset G* in R* whose boundary 
in R* is contained in R, G* — A is also con- 
nected. Next suppose that R¢O, (— 367 Rie- 
mann Surfaces E). For a given real-valued 
function f on A, let DÄ * OUR be the class 
of tsuperharmonic functions s bounded from 
below (tsubharmonic functions s bounded 
from above) such that lim infa. s(p) 2 f(p*) 
(lim SUPrap>p* s(p) € f(p*)) for every p*eA. If 
these classes are nonempty, then HP *'(p)— 
inf(s(p)| seU? ET and HR E (p) 2 sup(s(p)|se 
We" are harmonic on R, and HE < HRP 
In particular, if HÉR z H?-*", then the com- 
mon function is denoted by Hf-*", and the 
function f is said to be resolutive with respect 
to R*. A compactification such that every 
bounded continuous function on A is resolu- 
tive is called a resolutive compactification. In 
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such a case, a point p* in A is said to be regu- 
lar with respect to the fDirichlet problem if 
liM spp» HF?" (p)— f(p*) for every bounded 
continuous function f on A (— 120 Dirichlet 
Problem). The set A, of regular points in A 

is contained in T. If R* is a resolutive com- 
pactification, then there exists a unique posi- 
tive Borel measure y, such that HF:*'(p)— 
fa f(p*) du,(p*) for every bounded continu- 
ous function f on A. This measure is called 
the harmonic measure with respect to pe R. 
There exists a function P(p, p*) on R x A with 
du (p*)- P(p, p*) du,(p*) for an arbitrary fixed 
point o in R satisfying the following three 
conditions: (1) P(p, p*) is harmonic on R asa 
function of p; (ii) P(p, p*) is Borel measurable 
as a function of p*; (iii) k(o,p) ! < P(p,p*)« 
k(o, p), with the Harnack constant k(o, p) of 
to. p} relative to R [9]. 


C. Compactifications Determined by Function 
Families 


A family F of real-valued continuous functions 
on R admitting infinite values is called a sep- 
arating family on R if there exists an f in F 
such that f(p) z f(q) for any pair of given 
distinct points p and q in R. A compactifica- 
tion R* is called an F-compactification, de- 
noted by RS. if every function in F can be 
continuously extended to R* and the family 
of extended functions again constitutes a sep- 
arating family on R*. The correspondence 

o F5 Rf defines a single-valued mapping 

of all separating families F on R onto all F- 
compactifications of R. If F, > F,, then o(F) 
lies over q(F;). For any R*, o! (R*) contains 
infinitely many separating families, among 
which the separating families constituting 
tassociative algebras are important. The fol- 
lowing are typical examples of compactifica- 
tions determined by function families: 

(1) The Aleksandrov compactification is the 
U-compactification Rý with the family U of 
bounded continuous functions on R with 
compact support. It is the smallest compacti- 
fication of R and is often used in function 
theory in discussing Dirichlet problems for 
relatively noncompact subregions in reference 
to relative boundaries. 

(2) The Stone-Cech compactification is the 
G-compactification Rě with the family € of 
bounded continuous functions on R. It is the 
largest compactification of R. It is rarely used 
in function theory, but an application is found 
in the work of M. Nakai [8]. 

(3) The Kerékjártó-Stoilow compactification 
is the Z-compactification R¥ with the family 
S of bounded continuous functions f on R 
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such that there exist compact sets K, with the 
property that the f are constants on each 
connected component of R — K,. This is the 
smallest compactification of Stoilow type. 
Many applications of this compactification 
can be found in function theory, among which 
the investigation done by M. Ohtsuka on the 
Dirichlet problem and the theory of conformal 
mappings is typical. 

(4) The Royden compactification is the R- 
compactification R# with the family R of 
bounded C* functions f on R with finite 
Dirichlet integrals (le df ^ * df. It was intro- 
duced by H. L. Royden and developed further 
by S. Mori, M. Óta, Y. Kusunoki, Nakai, and 
others. This compactification has been used 
effectively in the study of HD-functions and 
the classification problem of Riemann surfaces 
(^ 367 Riemann Surfaces). 

(5) The Wiener compactification is the W- 
compactification RS with the family 9B of 
bounded continuous functions f on R such 
that ( Hf^] converges to a unique harmonic 
function independent of the choice of exhaus- 
tions {G,} of an arbitrary fixed subregion G¢ 
Og, where the G, are relatively compact sub- 
regions of G. It is the largest resolutive com- 
pactification, and compactifications smaller 
than Rẹ are always resolutive. This compac- 
tification was introduced independently by 
Mori, K. Hayashi, Kusunoki, and C. Constan- 
tinescu and A. Cornea and is useful for the 
study of HB-functions and the classification of 
Riemann surfaces. 

(6) The Martin compactification is the W- 
compactification Rj with the family W of 
bounded continuous functions f on R such 
that there exist relatively compact regions 
R; with the property that f = HA PRORA Rs 
H Rr Ra E: on R — Ry. Here f* coincides 
with f on R; and equals 0 on Ri — R, and 1* 
is similarly defined. The set RA — R is called 
the Martin boundary of R. If fGreen’s func- 
tion g exists on R, then the function m(p, q) — 
g(p, q)/g(o, q) for an arbitrary fixed oe R can 
be extended continuously to R x Ri, which ` 
is called the Martin kernel. By the metric 
dyy(q, r) =SUPper, MCP, OU) + mp, q)) — mp, r)/ 
(1+m(p,r))| with a parametric disk Rp in R, 
Ri is *metrizable. This compactification was 
introduced by R. S. Martin, and many appli- 
cations of it to the study of H P-functions, 
potential theory, Markov chains, and cluster 
sets were obtained by M. H. Heins, Z. Kura- 
mochi, J. L. Doob, Constantinescu and Cor- 
nea, and others. 

(7) For a function f on R, (R)df/én=0 
means that there exists a relatively compact 
subregion R, such that f is of class C” on R 
outside R, and the Dirichlet integral of f over 
R — R, is not greater than those of functions 
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on R—R, that coincide with f on the bound- 
ary of Ry. The Kuramochi compactification is 
the R-compactification Rë with the family & of 
bounded continuous functions f on R satisfy- 
ing (R)ôf/ôn =0. The continuous function 

k(p, q) on R such that (R)Ok/On =0 vanishes in 
a fixed parametric disk Ry in R and is har- 
monic in R — R, except for a positive tlogarith- 
mic singularity at a point g can be extended 
continuously to R x R$, which is called the 
Kuramochi kernel. By the use of this kernel, 

R$ is metrizable, as in the case of Martin 
compactification. This compactification was 
introduced by Kuramochi, and its important 
applications to the study of HD-functions, 
potential theory, and cluster sets were made 
by Kuramochi, Constantinescu and Cornea, 
and others. 

Among compactifications (1)-(7), no bound- 
ary point in (2), (4), or (5) satisfies the tfirst 
countability axiom (hence they are not metriz- 
able), while the others are all metrizable. In (4) 
and (5), A, =T. Fig. 1 shows the relationship 
among the seven examples. Here A B means 
that A lies over B, and A zx B means that in 
general neither A B nor B A. 


ui mS 
Rt—RS XX "RR, 
R&—R& 


Fig. 1 


D. Remarks 


In contrast to topological compactifications 
(1)-(3), (4)-(7) can be regarded as potential- 
theoretic and have enough ideal boundary 
points so that one can solve the Dirichlet 
problem and introduce various measures 
there. Utilizing Green's functions, R. S. Martin 
[6] deduced the first important compactifi- 
cation and gave the integral representation of 
positive harmonic functions (the extension of 
the tPoisson integral). Z. Kuramochi [3] ob- 
tained his compactification similarly by using 
N-Green functions introduced by himself 
instead of the usual Green’s functions. In this 
compactification, the ideal boundary points 
can be considered to be interior points of the 
surfaces in a potential-theoretic sense. In case 
of finitely connected domains with smooth 
boundaries, both the Martin and Kura- 
mochi boundaries coincide with the usual 
boundaries. 

The Royden and Wiener compactifications 
were introduced as the maximal ideal spaces 
of respective function algebras. Their ideal 
boundaries contain extremely many points. 
However, these compactifications have elegant 
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properties and many applications. An analytic 
mapping ọ of a Riemann surface R into an- 
other R' is called a Dirichlet (Fatou) mapping 
if ọ can be extended to a continuous map- 
ping of R&(Rẹ) to Rot (R). For example, a 
Lindelófian mapping (AD-function) is a Fatou 
(Dirichlet) mapping. The Dirichlet and Fatou 
mappings were investigated by Constanti- 
nescu, Cornea, and others. 

By using the Martin boundary, Z. Kuramo- 
chi and M. Nakai proved the extension of the 
Evans-Selberg theorem to parabolic Riemann 
surfaces [3,8]. The normal derivatives of HD- 
functions on the ideal boundaries and their 
applications were studied by Constantinescu, 
Cornea, F. Maeda, Y. Kusunoki, and others. 
The theory of compactification can be gen- 
eralized to domains in R", tGreen spaces, and 
harmonic spaces. 
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A. General Remarks 


Historically, a function y of x was called an 
implicit function of x if there was given a 
functional relation f(x, y) 20 between x and y, 
but no explicit representation of y in terms of 
x (^ 165 Functions). Nowadays, however, 
the notion of implicit function is rigorously 
defined as follows: Suppose that a function 
SXi -Xp y) is of fclass C! in a domain G in 
the real (n+ 1)-dimensional Euclidean space 
R^ and that f(x}, ..., x8, y°)=0, dX aes 
x, y9) 0 at a point (x9, ..., x, y9) in G. Then 
there is a unique function g(x,,...,x,) of class 
C! in a neighborhood of the point (x9, ..., x9 
that satisfies f(x,, ...,x,,g(x4, ..., x,)) =, 

y? 2 g(x9, ..., x2) (implicit function theorem). 
The function g is called the implicit function 
determined by f —0. The partial derivatives of 
g are given by the relation 


dg/Oxj= —(ðflðx Nfld), 


where y —g(x,, ..., x,). If the function f is of 
class C' (1 <r < oo or r=q), then the function 
g is also of class C". In particular, when n= 1, 
letting x, be x, we have dg/dx = — f; /f,. 


B. Jacobian Matrices and Jacobian 
Determinants 


A mapping u from a domain G in R" into R" 


ux) X ur X er sets X) 


XD oe XE 


is called a mapping of class C" if each compo- 
nent ui, ...,u, Is of class C (Or < oo or r=w) 
in G. Given a mapping u of class C! from G 
into R", we consider the following matrix, 
which gives rise to the differential du, of the 
mapping u (— 105 Differentiable Manifolds I): 


O(u)/0(x) — (0uj/0x4)4 «i«m.1 eben, (1) 


This matrix is called the Jacobian matrix of 
the mapping u at x. If there is another map- 
ping v of class C! from a domain containing 
the trange U of u into RB, then we have the 
law of composition: 


(6(v)/0(u)) (O(u)/6(x)) = 0(v)/0(x). 


When n =m, the fdeterminant of the matrix (1) 
is called the Jacobian determinant (or simply 
Jacobian), and is denoted by D(u)/D(x), 


D(u,, ...,u,J/D(x,, ..., x,) or 
D(u,, ...,u,) 
Ds xe)" 
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Sometimes the notation 8 is used instead of D; 
but in the present article we distinguish the 
matrix from the determinant, using ô for the 
matrix and D for the determinant. 

If m=n and D(u)/D(x) never vanishes at any 
point of the domain G, then u ts called a regular 
(or nonsingular) mapping of class CT. If the 
Jacobian D(u)/D(x) is 0 at x, we say that u is 
singular at x. A mapping that is singular at 
every point in a set S c G is said to be degen- 
erate on S. For a regular mapping u, the sign 
of the Jacobian is constant in a connected 
domain G. If it is positive, the mapping u 
preserves the orientation of the coordinate 
system at each point in G, while if it is nega- 
tive, the mapping changes the orientation. A 
point where u is degenerate is called a critical 
point of the mapping u, and its image under u 
is called a critical value. In general, the image 
of the mapping is "folded" along the set of 
critical points. The set of critical values of a 
mapping u of class C! (sending a domain in R" 
into R") is of Lebesgue measure 0 in R" (Sard's 
theorem). If u is a regular mapping, then each 
point in the domain G of u has a neighbor- 
hood V such that the restriction of u on V is a 
*topological mapping. Its inverse mapping x(u) 
is also a regular mapping of class C! and 
satisfies the relation 


D(u)D(x) ` 
D(x) Dlu) 


(inverse mapping theorem). If u is of class C" 
(I xrzxoc or r—o)) then so is its inverse 


mapping. 


C. Functional Relations 


A function F(u,, ...,u,) defined on a domain B 
in R" is called a function with scattered zeros if 
F has a zero point (i.e., there exists a point u 
for which F(u) — 0) and if every open subset of 
B contains a point u such that F(u) 40. Every 
tanalytic function #0 has scattered zeros. Let 
u(x) be a mapping from a domain G in R" into 
BCR". Suppose that there exists a function 
F(u) defined in B, of class C", with scattered 
zeros. If F(u(x)) «0 for every x in G, then we 
say that the components u,,...,u, of the 
mapping u have a functional relation of class 
C" or are functionally dependent of class C’. In 
such a case, we sometimes say simply that 
u,,...,U, are functionally dependent or that 
they have a functional relation. If the compo- 
nents u,,...,u, of a mapping u of class C! are 
functionally dependent of class CH. then the 
Jacobian D(u)/D(x) of u must vanish. Con- 
versely, if the Jacobian D(u)/D(x) of a mapping 
u of class C! is identically 0 in the domain G, 
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the components u,,...,u, are functionally de- 
pendent of class C” on every compact set in G 
(Knopp-Schmidt theorem) [1]. 


D. Implicit Functions Determined by Systems 
of Functions 


Suppose that the trank of the Jacobian matrix 
of (1) is r « m everywhere in G. Suppose that 
u(x) is a mapping of class C! from a domain 
G in R" into R", the Jacobian determinant 
D(u,,...,u,)/D(x,,...,x,) never vanishes in G, 
and further D(u,, ...,u,,uj)/D(x,, ..., Xps Xa) is 
identically 0 in G for each p, o with r «p <m, 
r «g xn. Then the values u, ,,(x), ..., u, (x) are 
determined by the values u,(x), ..., u,(x), and 
each u, is represented as a function of class C! 
OL ..., Up. 

Let u(x) be a mapping of class C! from a 
domain G in R” into R" and V the tinverse 
image of a point u?. To study the properties of 
the set V, we assume, for simplicity, that u? is 
the origin. Suppose that the rank of the matrix 
O(u)/O(x) is r for every point x in and each 
of u,,,, ..., Um 1s functionally dependent on 
u,,-..,U,. Then each u, (r « p <m) is a func- 
tion u,(u,,...,u,) of u,, .... u,. The set V is 
empty if there is a p such that u,(0,...,0) 40. 
On the other hand, if u,(0,...,0)=0 for all p 
(r € p m), then V is the set of common zero 
points of the functions u,(x), ...,u,(x). There- 
fore, to study the set V, we can assume that r — 
m «n. If m— n, V consists of isolated points 
only. If m « n, then, changing the order of the 
variables x,, ..., x, if necessary, we can assume 
that D(u,, ...,u,)/D(x,, ..., x,) #0 ata point 
(x9) in V. In this case, there is a unique func- 
tion £,(x,,,, ..., Xa) of class C! (1€ u xm) ina 
neighborhood of (x?) satisfying the following 
two conditions: (i) x? = 6,941. -.., x2); (ii) if 
the point (x,,,,, ..., x,) is in a neighborhood of 
(Gen, X9), then the point 


(Ei (Xm+1> nag Xa) SARS SX seis Me , Xp), 


due XD EV. 
Each function 6, is called an implicit function 
Of x,,,,, ..., x, determined by the relations 
u,=...=u,,=0. The ttotal derivatives of the 

¢, are determined from the system of linear 
equations 





Ou; n Qu; 
X —dé,+ Y —*dx,=0, j=1,...,m 
k=1 OX, l=mt+1 0X} 


The foregoing implicit function theorem is a 
local one. Among the global implicit function 
theorems, the following one, due to Hadamard, 
is useful: Let x — y(x) be a mapping of class C! 
from R” into R” such that the inverse d ^! (x) 
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of its Jacobian matrix ®(x) is bounded in R”; 
then the mapping is a tdiffeomorphism of class 
C! from R” onto R”. 

The implicit function theorem holds also in 
complex spaces. Let fí(x;,.... Xans yi... Yph 
1 <i<p, be a system of tholomorphic functions 
in a domain in the complex (n+ p)-dimensional 
space C"*?. If (i) tt, ..., xp, y9, .... yp) 
0, 1 <i<p, and (ii) D( fis ..., f) /D(y,, ..., 
Hehe, - (9, yo) € 0, then there exists a unique 
holomorphic solution y; — y,(x,, ..., x,) (1 € 
ix p) in a neighborhood of the point x? = 
(x9, ..., xe) that satisfies fí(x,, ..., X, Y1(X4, 

asso Mp puso A) = OTLSTS pP) 


E. Linear Relations 


Suppose that u(x) is a mapping of class C"! 
from Ri into R”. [ts components (uy, ..., Um) 
are functions of class C”~'. Then the 
determinant 


up7D o upon um 
is called the Wronskian determinant (or simply 
the Wronskian) of the functions u,,...,u,, and 
is denoted by W(u,,u5,...,u,,). If the func- 
tions u,,...,U, are flinearly dependent, i.e., if 
there exist constants c; not all zero satisfying 
ZE, cju(x) ^0 identically, then the Wronskian 
vanishes identically. Therefore, if W(u,,...,u,) 
#0, then the functions u,,...,u,, are linearly 
independent. Conversely, if W(u,, ...,u,) —0 
identically, and further if there is at least one 
nonvanishing Wronskian for u,, ..., Uu; 4, Han, 
Sus (1 <ix<m), then the functions u,,...,u,, 
are linearly dependent. The necessity of the 
additional condition is shown by the follow- 
ing example: u, = x? and u, 2 |x|? are of class 
C! in the interval [ —1,1] and linearly inde- 
pendent, but they satisfy W(x?,|x|*) 20 iden- 
tically. However, the additional condition is 
unnecessary if the functions u,,...,u,, are 
analytic. Similar theorems are valid in a do- 
main in a complex plane. 

Furthermore, if u(x) is a continuous map- 
ping from an interval [a,b] in R! into R”, the 
determinant 


(1) ... (lm) 
G(u,,.... Um) = (2,1) (2, m) , 


(m, 1) (m, m)| . 


b 
(j,k) -| uj(x)u,(x) dx 


is called the Gramian determinant (or simply 
Gramian). The Gramian is the fdiscriminant of 
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the quadratic form 


| (£u) dx 


of či, ..., £,, and is equal to 


b b 
E | 8 | (det(u,(x,)))? dx, ... dx,,. 
m! |, à 


We always have G(u,, ...,u,) 20, and G(u,, 
...,U,,)=90 if and only if u,,...,u,, are linearly 
dependent. The Gramian is defined if the 
functions u,,...,u,, are tsquare integrable 

in the sense of Lebesgue. In that case, the 
condition G(u,, ...,u,,) 2 0 holds if and only if 
Ui, ..., Uu, are linearly dependent talmost every- 
where, i.e., there are constants c,, ...,c,, not all 
zero such that the relation c,u,(x) +... + 
c, u, (X) — 0 holds except on a set of Lebesgue 
measure 0. 
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indian Mathematics 


India was one of the earliest civilizations, but 
because it has no precise chronological record 
of ancient times, it is said to possess no his- 
tory. Indian mathematics seems to have de- 
veloped under the influence of the cult of 
Brahma, as did the calendar. It may also have 
some relation to the mathematics of the Near 
East and China, but this ts difficult to trace. 
The word ganita (computation) appears in 
early religious writings; after the beginning of 
the Christian Era, it was classified into pati- 
ganita (arithmetic), bija-ganita (algebra), and 
ksetra-ganita (geometry), thus showing some 
systematization. The Buddhists (notably Na- 
garjuna) had a kind of logic, but it had no re- 
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lation to mathematics. Unlike the Greeks, the 
Indians had no demonstrational geometry, 
but they had symbolic algebra and a position 
system of numeration. 

Indian geometry was computational; Arya- 
bhata (c. 476—c. 550) computed the value of z 
as 3.1416; Brahmagupta (598—c. 660) had a 
formula to compute the area of quadrangles 
inscribed in a circle; and Bhaskara (1114— 
1185) gave a proof of the Pythagorean theo- 
rem. In trigonometry, Aryabhata made a 
table of sines of angles between 0? and 90° 
for every 3.75? interval. The name “sine” is 
related to the Sanskrit jya, which referred to 
half of the chord of the double arc. 

The Indians had a remarkable system of 
algebra. At the beginning they had no oper- 
ational symbols and described in words the 
rules for solving equations. Brahmagupta 
worked on the *Pell equation ax? +1 = y?. 
Bháskara knew that a quadratic equation can 
have two roots that can be positive and nega- 
tive, but did not assign any meaning to the 
negative root in such cases. Bhaskara also 
introduced algebraic symbols. 

The symbol 0 was used in India from about 
200 B.c. to denote the void place in the posi- 
tion system of numeration; 0 as a number is 
found in a book by Bakhshali published in the 
3rd century A.D. The number 0 is defined as a 
—a=Q in our notation, and the rules a 3-0— 
a, 0xa-0, /0=0, 0+a=0 are mentioned. 
Brahmagupta prohibited division by 0 in 
arithmetic, but in algebra he called the “quan- 
tity” a+0 taccheda. Bhaskara called it khahara 
and made it play a role similar to that of our 
infinity. 

Some historians assert that the Indians had 
the ideas of infinity and infinitesimal. Some 
hold that the Indian position system of numer- 
ation arose from the circumstance that the 
names of numbers differed according to their 
positions. The Indian numeration system was 
exported to Europe through Arabia, and 
had great influence on the development of 
mathematics. 
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Inductive Limits and 
Projective Limits 


A. General Remarks 


Inductive and projective limits can be de- 
fined over any *preordered set J and in any 
tcategory. We first explain the definition of 
these limits in the special case where J is a 
*directed set and the category is that of sets, of 
groups, or of topological spaces. The simplest 
case is when / is the ordered set N of the 
natural numbers. 


B. The Limit of Sets 


Let J be a directed set. Suppose that we are 
given a set X; for each ie I and a mapping 

Pi: X;2 X; for each pair (i,j) of elements of I 
with i<j, such that g; = 1x, (the identity map- 
ping on Xj) and Pi = 9,0 P; (i<j <k). Then 
we denote the system by (X;, ;;) and call it 
an inductive system (or direct system) of sets 
over I. Let S be the tdirect sum II, X; of the 
sets X; (ie I), and define an equivalence re- 
lation in S as follows: xe X; and ye X; are 
equivalent if and only if there exists a ke] 
such that i € k, j & k, o,;(x) 7 &,(y). Let D be 
the tquotient set of S by this equivalence rela- 
tion, and let f;: X; D (ieI) be the canonical 
mappings. Then we have I(1) fo oj fi (i<j); 
I(2) for any set X, and for any system of map- 
pings g;: Xj X (iel) satisfying gjo 9j; — gi 
(i<j), there exists a unique mapping f: DX 
such that f o f; — g; (ie I). We call (D, f;) the 
inductive limit (or direct limit) of the inductive 
system (X;, 9) over I, and denote it by lim X; 
or ind lim X; (more precisely, by limjcr X; or 
ind lim;,, X;). 

Suppose, dually, that we are given a set X; 
for each ie] and a mapping yj: Xj X; for 
each i<j, such that y;; — 1y, and y, — ijo Wy 
(i xj € k). Then we denote the system by (X;, 
V;;) and call it a projective system (or inverse 
system) of sets over 7. Let P be the subset 
of the Cartesian product [ LX; defined by P= 
{œl pyx )=x; (i <j)}, and let p;: PO X; be 
the canonical mappings. Then we have P(1) 
V; O p; = p; (i<j); P(2) for any set X, and for 
any system of mappings q;: X >X; satisfying 
V; O 4d; (i<j), there exists a unique mapping 
p: X OP such that p;o p=q; (ie I). We call 
(P, p;) the projective limit (or inverse limit) of 
the projective system (X;, dal over I and de- 
note it by lim X; or proj lim X;. 
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Note that we may replace I by any coftnal 
subset of J without changing the limits. 


C. The Limit of Groups and of Topological 
Spaces 


If, in the notation of Section B, X; is a group 
and qj; (V) is a homomorphism, then we say 
that (X;, p) ((X;, V/;)) is an inductive (projective) 
system of groups. The inductive limit (as a set) 
D =lim X; has the structure of a group for 
which the canonical mappings f; are homo- 
morphisms. With this group structure, D is 
called the inductive limit (group) of the induc- 
tive system of groups. It satisfies properties 
I(1) and I(2) with group X and homomor- 
phisms g; and f. Similarly, the projective limit 
(as a set) P— lim X; has a unique group struc- 
ture such that each p;: P— X; is a homomor- 
phism, namely, that of a subgroup of the direct 
product group []X;. The group P is called the 
projective limit (group) of the projective system 
of groups. When each X; is a module over a 
fixed ring A, we get entirely similar results by 
considering A-homomorphisms instead of 
group homomorphisms. 

Next, let X; be a topological space and oj 
and y/;; be continuous mappings. Then (X;, p) 
((X;, Y;)) is called an inductive (projective) 
system of topological spaces. If we introduce in 
D -lim X; the topology of a quotient space of 
the ttopological direct sum of the spaces X; 
(i€ I), then the f; are continuous, and I(1) and 
1(2) hold with sets and mappings replaced by 
topological spaces and continuous mappings. 
Similarly, if we view P —lim X, as a subspace 
of the tproduct space nx, then the p; are 
continuous and P(1) and P(2) hold with the 
same modification as before. The spaces D and 
P are called the inductive limit (space) and the 
projective limit (space) of the system of topo- 
logical spaces, respectively. The projective 
limit of Hausdorff (compact) spaces is also 
Hausdorff (compact). 

Furthermore, if the X; (ie I) form a topolog- 
ical group and oj, V; are continuous hom- 
omorphisms, then lim X; and lim X; are topo- 
logical groups, and properties 10, I(2), P(1), 
and P(2) are satisfied for topological groups 
and continuous homomorphisms (— 423 
Topological Groups). In particular, projec- 
tive limits of finite groups are *totally discon- 
nected compact groups and are called profinite 
groups; they occur, e.g., as the ring of *p-adic 
integers and as the *Galois group of an infinite 
*Galois extension. Conversely, the tgerms of 
continuous functions at a point x in a topolog- 
ical space X, and other kinds of germs (— 383 
Sheaves), are important examples of inductive 
limits of groups. 
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D. Limits in a Category 


Let I be a preordered set and € a category. If 
we are given an object X; of a category € for 
each iel and a *morphism oj: X; X; of € 

for each pair (i,j) of elements of J with i<j, 
and if the conditions oj; — lx, Pki = Pry Pi 

(i € j € k) are satisfied, then we call the system 
(X;, 9) an inductive system over / in the cate- 
gory €. A projective system in € is defined 
dually: It is an inductive system in the *dual 
category €^. If we view I as a category (— 52 
Categories and Functors B), then an inductive 
(projective) system in € over the index set 7 is 
a *covariant (tcontravariant) functor from 7 to 
€. Now if an object Deg and morphisms 

f: X; D (ie I) satisfy conditions I(1) and IO) 
with the modification that X is an object and 
gi, f are morphisms in €, then the system 
(D, fj) is called the inductive limit of (X;, o) and 
is denoted by lim X;,. Similarly, if an object 
Pe@ and morphisms p;: P X; (ie I) satisfy 
P(1) and P(2) with a similar modification, then 
(P, pj) is called the projective limit of (X;, y;;) 
and is written lim X;. By I(2) and POL these 
limits are unique if they exist. 

In the categories of sets, of groups, of 
modules, and of topological spaces, inductive 
and projective limits always exist. Note that if 
the ordering of / is such that i<j implies i=j, 
i.e., if there is no ordering between two distinct 
elements of J, then the inductive (projective) 
limit is the direct sum (fdirect product) (— 52 
Categories and Functors E). 

Let (X;, 95), (Xi, pq) be two inductive sys- 
tems over the same index set J, and let o: 
Xj2 Xi (ieI) be morphisms satisfying 950 p; 
= 9,0 P; (i<j). Then the system (oj) is called 
a morphism between the inductive systems. 
Such a morphism is a fnatural transformation 
between the inductive systems viewed as func- 
tors Te If lim X; and lim X; exist, then (qj) 
induces a morphism lim pi: lim X;olim X;in 
a natural way, and similarly for projective 
limits. 

For the more abstract notion of limit of a 
functor — [5]. For the theory of procategories 


— [4]. 
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A. General Remarks 


In this article we consider various properties 
of inequalities between real numbers. An in- 
equality that holds for every real number (e.g., 
x? z 0) is called an absolute inequality; other- 
wise it is called a conditional inequality. When 
we are given a conditional inequality (e.g., 
x(x — 1) « 0), the set of all real numbers that 
satisfy it is called the solution of the inequality. 
The process of obtaining a solution is called 
solving the conditional inequality. 


B. Solution of a Conditional Inequality 


Suppose that a conditional inequality is given 
by f(x) - 0 (or f(x) 2 0), where f is a continu- 
ous function defined for every real number. 

If the equation f(x) —0 has no solution, then 
we have either f(x)>0 or f(x) «0 for all x. 
On the other hand, if « and fi (x< fj) are adja- 
cent roots of the equation f(x) 20, the sign 

of f(x) is unchanged in the open interval (o, fj). 
Therefore the solution of the given inequality 
depends essentially on the solution of the 
equation f(x) — O0. If inequalities involve two 
variables x, y and are given by f(x, y) 0, 

g(x, y)» 0 for continuous functions f and g, 
the solution is, in general, a domain in the xy- 
plane bounded by the curves f(x, y) 0 and 
g(x, y) « 0. Similar results hold for the case of 


inequalities involving more than two variables. 


C. Famous Absolute Inequalities 


(1) Inequalities concerning means (or averages): 


Suppose that we are given an n-tuple a= 


(44, ...,a,), a, 2 0. We set 
1 n 1/r 

Mojo C a) | 
Hui 


If at least one a, is 0 and r «0, we put M,=0. 
In particular, we put 


A=M,, 
n Lin 
G —lim "- (TI a) : 
Weg v-1 
H-M.; 
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these are called the arithmetic mean, geometric 
mean, and harmonic mean of a, (v= 1, ..., n), 
respectively. Except when either all a, are 
identical or some a, is 0 and r «0, the func- 
tion M, increases 'strictly monotonically as r 
increases, and M, mina, (r> —oo), M,» 
max a, (r> +00). Therefore we always have 
mina, « M, x max a,. In particular, we have 
H «G«A if the a, are all positive and not 
all equal. 

Let p(x) (70), f(x) (20) be fintegrable func- 
tions on a *measurable set E. We put 


lir 
wi sas] pdx) , rÆ0. 


Furthermore, if M,(f) is strictly positive for 
some rz 0, we put 


Mo(f)= lim MAT) 


-(| plog fas] | pis) 
E E 


We call M,( f) the mean of degree r of the 
function f(x) with respect to the weight func- 
tion p(x). It has properties similar to those of 
M,(a). In particular, when the weight function 
p=1, the means MOL Moa( J), M_,(/) are 
called the arithmetic mean, geometric mean, 
and harmonic mean of f, respectively. 

(2) The Hólder inequality: Suppose that 
p#0, 1 and (p—1)(q—1)=1; that is, 1/p + 
1/q — 1, and a, >0, b, >0. Then, in general, 
we have the Holder inequality: 


1/p l/a 
GEN (xa) . wee 


where the inequality signs in the first inequal- 
ity are taken in accordance with p « 1 or p» 1. 
The summation may be infinite if the sums 
are convergent. The inequality sign is replaced 
by the equality sign if and only if there exist 
constant factors À and u such that Aa? = ub? 
for all v. The Hólder inequality for p=q=2 
is called the Cauchy inequality (or Cauchy- 
Schwarz inequality). 

For two measurable positive functions f(x), 
g(x), we have the Hólder integral inequality: 


l/p 1/q 
| naz, f prax) (| grax) e PSl, 
E E E 


except when there exist two constant factors 4 
and u ((A, u) z (0, 0)) such that Af? = ug* holds 
‘almost everywhere. The above inequality is 
replaced by equality if and only if we are in 
this exceptional case. The case where p=q=2 
is called the Schwarz inequality (or Bunyakov- 
skii inequality). 

(3) The Minkowski inequality: Suppose that 
p#0, 1 and a,» 0, b, — 0. Then we have the 
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Minkowski inequality: 
l/p 1/p l/p 
(Zarar) (zat)" dx, 


except when {a,} and {b,} are proportional. 
The inequality is replaced by equality if and 
only if we are in the exceptional case. 

The corresponding Ge inequality for 
positive functions f(x), g 


Lues! al Di 


grax) , psi, 
E 


psi, 


except when f(x)/g(x) is constant almost every- 
where. The inequality is replaced by equality if 
and only if we are in the exceptional case. 


D. Related Topics 


Absolute inequalities are important in anal- 
ysis, especially in connection with techniques 
to prove convergence or for error estimates. 
However, there seldom are general principles 
for deriving such inequalities, except for a few 
elementary theorems. 

For other famous inequalities ^ Appendix 
A, Table 8. For related topics — 88 Convex 
Analysis; for convex functions and their ap- 
plications — 255 Linear Programming; for 
linear inequalities — 212 Inequalities in 
Physics. 
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212 (XX.36) 
Inequalities in Physics 


A. Correlation Inequalities 


Let u be a probability measure on a space X 
(with c-field 2) and f; be measurable functions 
on X, and write 


Sd = Em 


A number of inequalities among such expec- 
tations under a variety of conditions on the 
measure y and functions f, are known as cor- 
relation inequalities, after their original occur- 
rence for correlation functions in the statistical 
mechanics of lattice gases. 

The earliest results were obtained by Grif- 
fiths (J. Math. Phys., 8 (1967)), for the case 
where X is the product TT, I; of two point 
sets I, — (1, —1}, the f are kth coordinate func- 
tions e, —o,(x) (= +1) of xe X, du(x) is the 
counting measure multiplied by a Gibbs factor 
Z !e 9) with a ferromagnetic Hamiltonian 


fax) dux). 


H(x)= c aar a(x)o(x), J 


i<j 


=J;;20, 


ij 
f 2:0, and the normalization factor Z = 
Xe fil His conclusions are 
<6,6,>20 (Griffiths’s first inequality), 
(0,0,0,0,? Z (0,01? Fm Fn > 


(Griffiths’s second inequality). 


These were extended by Kelly and Sherman 
(J. Math. Phys., 9 (1968)) as 


(04? z0 
(0408? 2 (04) Og» 


(GKS first inequality), 


(GKS second inequality), 


where A and B are subsets of (1,..., N}, o4= 


IL. os, and H— —3X,..J40, with J,>0.A 
further generalization (for example, 7; 2 R) can 
be found in [1,2]. 
Under the same situation with 
zs s -Xhe. Jet, h; z 0, 
i<j =1 


the following GHS inequality by Griffiths, 
Hurst, and Sherman (J. Math. Phys., 11 (1970)) 
holds: 


ër A — (5) X059.) — (Gj de 
EENEG EE KEO 
If ^; — 0 for all i, then 
(8,60, — (6,0, dee — (691) (6,9, 
— (0,6) (0,6) - 249,0) (66) (6,6,) <0. 
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Further generalizations can be found in [2, 3]. 
and the references quoted therein. 

Let X be a finite distributive lattice, u be 
a positive measure satisfying the condition 
His ^y)u(x v y) 2 n(x)u(y), and f and g be both 
increasing or both decreasing functions on X. 
Then the following FKG inequality by For- 
tuin, Kasteleyn, and Ginibre (Comm. Math. 
Phys., 22 (1971)) holds: 


(fav 2f». 


B. Inequalities Involving Traces of Matrices 


` Let tr A denote the trace of a matrix A. Some 
of the earlier results applied to statistical 
mechanics are as follows, where p 20, o z: 0, 
A* = A, B* =B: 


tr(plogp—ploga—p+oa)20 
(Klein mequality), 
tr(e4**)/tre4 z exp(tre^B/tr e^) 
(Peierls-Bogolyubov inequality), 
tr(e^eP) 2 tr(e4* P) 
(Golden-Thompson inequality). 


The following inequality is related to the last 
inequality and was proved for a general mz 0 
by Lieb and Thirring (Studies in Mathematical 
Physics, Lieb, Simon, and Wightman (eds.), 
Princeton Univ. Press, 1976): 


tr(pc)" «tr(p"o"), pz0, o0. 


For the Hilbert-Schmidt norm || A||y s. = 
(tr A* A)? and the trace-class norm TA = 
tr| Al, where | A| 2 (4* A)2, the Powers-Stormer 
inequality (Comm. Math. Phys., 16 (1970)) 
holds: 


lp? oi >l- olas) p20, o20. 


Araki and Yamagami (Comm. Math. Phys., 81 
(1981)) give 


ICI- IDI hus € 21 —Dllns.- 


If C* =C and D* =D, then 
removed. 

The entropy $(p) 2 —trplogp is a concave 
function of p zU satisfying the triangular 
inequality (Araki and Lieb, Comm. Math. 
Phys., 18 (1970): 


S(p1) + S(p2) 25(913) 2 I(91) — Stol 


and strong subadditivity (Lieb and Ruskai, J. 
Math. Phys., 14 (1973)): 


S(0153) —5(912) —S(p13) + S(p1) <9, 


where p, is a matrix on the tensor prod- 
uct space H, © H, Q Hs, py, —tr;ipizs Pi= 
tr;tr, p,55 ((5j, k) = 11,2, 3}) and tr; is a 
(partial) trace taken only on the space H;. The 


2 can be 
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last inequality is based on the concavity of 
the function f(p)=trexp(A + log p) in p z0 
(A* — A) proved by Lieb [4] (also see Epstein, 
Comm. Math. Phys., 31 (1973)), who also 
proved the joint concavity of tr(C*p*Co’) 
(r>0,s>0,r+s<1)in p>0 and o zO. This 
concavity for the case r—s— 1/2 was previ- 
ously proved by Wigner and Yanase (Proc. 
Nat. Acad. Sci. US, 49 (1963)), and the general 
case has been conjectured by Wigner, Yanase, 
and Dyson. It leads to the joint concavity of 
the relative entropy S(p,a)=tr p(log p —logo) 
(defined to be +œ if ov =0 and py £0 for 
some vector y) in p20 and o >0 as well as its 
monotonicity S(a(p), a(c) x S(p, c) for any 
trace-preserving expectation mapping «. (For 
entropy, see also [5].) 

The above results have generalizations in 
the context of von Neumann algebras (Araki, 
Publ. Res. Inst. Math. Sci., 11 (1975); 13 (1977); 
Uhlmann, Comm. Math. Phys., 54 (1977); [6]). 


C. Operator Monotone and Operator Convex 
Functions 


A real-valued function f(x) defined on an 
interval J (finite or infinite; open, half-open, or 
closed) is called matrix monotone increasing 
(decreasing) of order m if f(A) 2 f(B) whenever 
m x m Hermitian matrices A and B with their 
eigenvalues contained in J satisfy A> B (A < B) 
and is called operator monotone if it is matrix 
monotone of order n for all positive integers 
n. f(x) is called matrix convex of order m if 
f[(1 —)4 tB] «(1—£0)/(A) - tf'(B) for0xt« 
1 and all m x m Hermitian matrices A and B 
with their eigenvalues in J and is called opera- 
tor convex if it is matrix convex of order n 

for any positive integer n. The functions x? 
(0x ox 1), logx, and —(x--a) ! (az 0) are 

all operator monotone increasing in the half- 
line interval x >0. The functions (x +a) ! 

and xlogx are operator convex in the same 
interval. 

A function f(x) is operator monotone in- 
creasing in an open interval (a, b) if and only if 
it is analytic in (a, b) and has an analytic con- 
tinuation to the whole upper half-plane where 
the function has a nonnegative imaginary part 
[7]. Another necessary and sufficient condition 
is 


Y SEEM) c rk 1! 20 
i,k=0 


for all x e (a, b), for all real £, and for all posi- 
tive integers N. An operator monotone func- 
tion f(x) on an open interval ( — R, R) has the 
integral representation 


foe fro | xa —tx)du(t), 
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where u is a probability measure with its sup- 
port contained in[ —R , R ! ] and, if f is 
not a constant, is uniquely determined by f. 
The set of all extremal points of the set of all 
operator monotone increasing functions on 
(—1, 1) satisfying f(0) — 0 and f (0) — 1 is exact- 
ly the set of functions x/(1 —tx), |t| < 1, which 
appear as integrands in the above formula. An 
operator monotone function on R must be 
linear. 

An operator convex function in an open in- 
terval (— R, R) has the integral representation 


Til f(0) + f'()x - C/7(0)/2) ET —tx)du(t), 


where u is a probability measure with its sup- 
port contained in[ —R , R ! ] and, if f is 
not linear, is uniquely determined by f. An 
operator convex function in R must be at most 
a quadratic polynomial satisfying f "(0) z 0. 
For a continuous real function on an inter- 
val [0, x) (0 € x < oo), the following four condi- 
tions are mutually equivalent: (i) f is operator 
convex and f(0) <0, (it) f(a* Aa) € a* f(A)a 
for any self-adjoint operator A with its spec- 
trum in [0, x) and any operator a with its 
norm not exceeding 1, (iti) the preceding condi- 
tion with a limited to projections, (iv) x ! f(x) 
is operator monotone increasing in an open 
interval (0, æ). (See [8-11] and Hansen and 
Pedersen, Jensen's inequality for operators and 
Lówner's theorem, Math. Ann., 258 (1982), 
229—241. 
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A. General Remarks 


The mathematical theory of information trans- 
mission in communication systems, first devel- 
oped by C. E. Shannon [1] and now called 
information theory, is one of the most impor- 
tant fields of mathematical science. It consists 
of two main themes, channel coding theory and 
source coding theory. The purpose of channel 
coding theory is to ascertain the existence of 
schemes for transmitting information or data 
reliably over a noisy channel at a fixed trans- 
mission rate. The purpose of source coding 
theory is to show the existence of schemes for 
compressing data emitted from an information 
source and reproducing them within tolerable 
limits of distortion. Both theories are based 
profoundly on the theory of probability, statis- 
tics, and the theory of stochastic processes. 


B. Entropy 


In information theory it is customary to call 
a finite set A = {a,,...,%,} an alphabet and to 
call its elements the letters of the alphabet. The 
simplest model of information sources consist- 
ing of an alphabet A and a tprobability dis- 
tribution p over A is denoted by X =[4A, p], 
which can be regarded as a ‘finite probability 
space, where the probability of outcome of a 
letter ae A from the source is denoted by p; — 
p(a;). A real-valued function defined by I(a;) — 
— log p(a;) is called the self-information of 

the event «,¢ A. The *mean value of the self- 
information, i.e., 


H(X)= 2. — p(a)log pla), 
= 
is called the entropy of the source. This can 

be interpreted as a measure of the average a 
priori uncertainty as to which letter will ema- 
nate from the source, or a measure of the 
average amount of information one obtains 
upon receiving a single letter from the source. 
The unit of information or entropy for base e 
logarithms is called a nat, while that for base 2 
logarithms is called a bit. 
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Let A={a,,...,0,} and B={f,,...,B,,} be 
two alphabets. Let r(a;, H) be a *joint proba- 
bility distribution defined on the product AB, 
and denote the *probability space by X Y= 
[AB,r]. Then the joint distribution gives rise 
to tmarginal distributions p(«,;)= X74, r(a;, f;) 
and q(fj) = Èi- r(%;, fj) and to *conditional 
distributions p(a;| 5j) — r(;, B)/q(B) and q(B;| «;) 
— r(o;, B;)/p(a,). Probability spaces X =[A, p] 
and Y=[B,q] are subspaces of the probability 
space X Y=[AB,r]. The entropy of XY = 
[AB,r] is defined by 


num Y —r(x;, Blog r(a;, B) 


i=1 j= 
as well. A real-valued function /(a;| B) — 
— log p(a;| fj is called the conditional self- 
information. The average of I(a;| fj) over o; 
defined by 


n 


H(X|p)— 2, PI M(o;| Bj) 


is called the conditional entropy of X for given 
Deh The average of H(X |) over fj; defined 
by 
HX IY)= Y. quf) HUC) 

is called the conditional entropy of X for given 
Y. The conditional entropy of Y for given X, 


denoted by H(Y| X), is defined similarly. Then 
it can be shown that 


H(XY)- H(Y)-- H(X| Y) 2 HX) - H(Y| X), 


H(XY)« H(X)4- H(Y), 
H(X|Y)« H(X), 
H(Y| X)« H(Y), 


where equalities in the last three expressions 
hold if and only if r(«;, fj) = p(;)g(f) for all 
x, € A and fe B. 


C. Information Sources 


Given a finite alphabet A, we consider the 
finite product set A= A x ... x A and the 
doubly infinite product A? Il... A,, where 
A,=A,k=0, +1, +2,.... Let Z be the o- 
algebra generated by all fcylinder sets in A7. 
Given a tprobability measure P over ¥,, an 
information source is defined as a probability 
space [ A47, P] or as a trandom process X = 

eX 4 Xo X, X, .... When P is invariant under 
the tshift transformation T on A7. i.e., P(E)= 
P(TE) for any Ee £,, then [ AZ, P] is said to 
be a stationary source. In particular, if P(E) — 
0 or 1 whenever TE = Ee F,, then the infor- 
mation source is said to be ergodic. If X is an 
"independently and identically distributed 
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random process, the source X ^ [ A7, P] is said 
to be memoryless. 

For a given X, denote the subsequence 
X, ... Xy of X by X". Then a probability mea- 
sure P on A" and a finite probability space 
X^ - [AP, P] are naturally induced. The en- 
tropy of the stationary information source 
X-[A7, P] is defined as H(X)=limy ,,, H(X*)/ 
N or as H(X)—limy ,,, H(Xy| X" TL because 
both limits exist and are identical. If X is 
memoryless, the entropy of X is SE 
to that of X, — ISP ie, H(X) 2 X, — 
p;log p;, where p, — p! (X, ad and H(X*)= 
NH(X!)- NH(X). 


D. Source Coding Theorem 


Let x =x; ... xy be a sequence of N consecu- 
tive letters from the source [A7, P]. Suppose 
that we wish to encode such sequences into 
fixed-length code words u^ =u, ... Hr consist- 
ing of L letters from a code alphabet U of 
size v. The number R — (log vP/N is called the 
coding rate per source letter. A mapping 9: 
AP S U' is called an encoder and 9: U> A" a 
decoder. The set U^ with specified encoder- 
decorder pair [ọ, y] is called a code with rate 
R=(Llogv)/N. The error probability of the 
code is defined by 


Plo v] PD (o(X7) X]. 


The fixed-length source coding theorem 
[1,2] states: Let X2 (A7, P] be a stationary 
ergodic source. Then for any 0 0, if R > H(X) 
+ò, there exists a code [ 9, w] with rate R such 
that the error probability P,[q, y] can be 
made arbitrarily small by making N suffi- 
ciently large. Conversely, if R x H(X)—6 then 
for any code with rate R, P,[«, y] must be- 
come arbitrarily close to 1 as Noo. 

This theorem is trivial if R>logn. For mem- 
oryless sources, the theorem follows immedi- 
ately from the tweak law of large numbers, 
which implies that for arbitrary e — 0 and 6>0 
there exists an integer Nie, ô) such that for all 


N >N, (e, éi 
ke 


N N 
d —log P(X”) 
The validity of this property for stationary 
ergodic sources was proved by B. McMillan 
[2]. In case of memoryless sources, the exact 
asymptotic form of the error probability for 
an optimal code with rate R was given by 
F. Jelinek [3], I. Csiszár and G. Longo [4], 
Blahut [5], and Longo and A. Sgarro [6] as 





H(X)| > 





1 
lim ——logP,[o,y]- min D f 
lim —47log «Lo. V] in, (qlip) 


where p is the source distribution, q denotes a 
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probability distribution on A, and 
qi 


D(q|p)— > «log. 
i-] D 


which is called Kullback's discrimination in- 
formation [7] or the divergence. 

Source sequences x" can be encoded into 
variable-length code words consisting of letters 
from alphabet U of size v. A set of n" code 
words is called a prefix condition code if there 
is no code word which is equivalent to the 
prefix of any other code word. Denote the 
` length of the code word corresponding to a 
source sequence x" by L(x^) and the average 
length of the code words per source letter by 


1 I l Ni N N 
N 2 (x )L(x^). 
The variable length source coding theorem 
states: Given a memoryless source X with 
entropy H(X), there is a prefix condition code 
such that the average length of the code words 
per source letter satisfies 


> 1 
H(X)< Llogv<H(X)+ 5 logy. 


This theorem is valid for stationary ergodic 
sources if (log v)/N in the last term is replaced 
by e(N), where £(N)0 as N> oo. 

These two theorems are referred to as noise- 
less source coding theorems. 


E. Source Coding Theorem with Fidelity 
Criterion 


The noiseless source coding theorem implies 
that the average number of code letters per 
source letter can be reduced to the source 
entropy H(X) under the requirement that the 
source sequence be exactly reproduced from 
the encoded sequence. If an approximate rc- 
production of the source sequence to within a 
given fidelity criterion is required, the coding 
rate per source letter must be reduced further 
to a certain value below the source entropy. 

Suppose that a distortion measure do, o) is 
defined for a. o;€ A, where it is assumed that 
da, x) 20 and dia, ;) 2 O0. For blocks x* = 
Xp: Xy and y — y; ... yy, define 


1 N 
dy(x", yy D de Vx). 
k=1 


Any set € — (yf, ..., yar}, YA € A^, of reproduc- 
ing words is called a source code of block 
length N. Each source sequence x" of the 
source X =[A%, P] is mapped into whichever 
code word y eg minimizes d, (x, y"), Le. 


dy(x", Ym) = min dy (x, y^, 
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and the suffix m is transmitted. Hence the 
coding rate per source letter is R — (log MYN, 
and the average distortion is 
dy(@)= Y, P"(x")mindy(x", y"). 

xNe AN did 
The problem is how far we can reduce the rate 
under the condition that the average distortion 
keeps satisfying a given fidelity criterion, which 
is specified as a maximum tolerable value d for 
the average distortion. 

The source coding theorem with a fidelity 
criterion states: Let X — [A7, P] be a memory- 
less source. For any specified d 20, any e» 0, 
and à —0, there exists a source code € with 
rate R > R(d) 4- A and with sufficiently large 
block length N for which the average distor- 
tion satisfies 


dy() « d tt, 


where R(d) is the rate distortion function de- 
fined by 


R(d)= min, I(p; W), 
n n WwW cl: 
I(p;W)= 25 2; Pi Wülilog— HI 
SE py p,WGli) 


TE 


vi-tw. È nWulides sie) 
Kafe 


where W(j|i) denotes a conditional proba- 
bility distribution referred to as the test chan- 
nel, and where F(p; W) is called the mutual in- 
formation. It should be noted that R(0)= H(X). 
The rate distortion function R(d), which was 
first defined by Shannon [1,8], is closely re- 
lated to the -entropy introduced by A. N. 
Kolmogorov [9]. R(d) is a monotonically de- 
creasing and convex function. 

The theorem was first proved by Shannon 
[8], and was extended by R. G. Gallager [10] 
to the case of stationary ergodic sources with 
discrete alphabets and by T. Berger [11] to 
stationary ergodic sources with abstract al- 
phabets. More recently, R. G. Gray and L. D. 
Davisson [12] have proved source coding 
theorems without the ergodic assumption 
for stationary sources subject to a fidelity 
criterion. The proof was based on Rokhlin's 
ergodic decomposition theorem [13]. Other 
important source coding theorems have been 
obtained by F. Jelinek [14] for tree codes, A. J. 
Viterbi and J. K. Omura [15] for trellis codes, 
and Gray, D. L. Neuhoff, and D. S. Ornstein 
[16] for sliding block codes. 

The rate distortion function for memoryless 
Gaussian sources subject to squared error 
distortion was given by Shannon [8], and that 
for autoregressive Gaussian sources was deter- 
mined by Kolmogorov [9], M. S. Pinsker 
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[17], Berger [11], Gray [18], and T. Hashi- 
moto and S. Arimoto [19]. 


F. Channel Coding Theory 


A mathematical model for a channel over 
which information is transmitted is specified in 
terms of the set of possible inputs, the set of 
outputs, and a probability measure on the 
output events conditional on each input. The 
simplest channels are the noiseless ones, for 
which there is a one-to-one correspondence 
between input and output and no loss of in- 
formation in transmission through the chan- 
nel. The second simplest channels are discrete 
memoryless channels (DMCs) which are de- 
fined as follows: The input and the output are 
sequences of letters from finite alphabets (say, 
«;€ A and f,e B), and the output letter at a 
given time depends statistically only on the 
corresponding input letter. That is, à DMC is 
characterized by a fixed conditional proba- 
bility distribution W(j|i)= W(f;| x) because 
the probability measure on the input and 
output sequences satisfies 


N 
WSQISEDSSTT W(ylx),  x,€A, y,eB. 
=1 


By U — (1, ..., M} we denote the set of integers 
each of which is assigned to each correspond- 
ing possible message from the source. A map- 
ping 9: U — A" induces a collection @ = (x, 
..., XM], called the block code with rate R = 
(log M)/N; each element is called a code word. 
Only code words are transmitted over the 
channel. A mapping y: DN — U is called the 
decoding. Thus, given an encoding and decod- 
ing pair [ọ, y], the error probability is defined 
by 


1 M 
P [oy] E A7 W(y"|o(m)), 


where the summation X* is taken with respect 
to all y" such that y(y") z m. The capacity for 
a DMC is defined by 


C —max I(p; W) 
p 


T dé W(j|i) 
-— max Mi log. 
Dé PMU Doss WTO 

The fundamental theorem of channel coding 
theory states: Given a DMC with capacity C > 
0, there exist a block code with rate R below 
capacity C, a pair [g, Y], and a function E(R) 
—-0forOxR-«C such that the error proba- 
bility satisfies 


P,[o. v] «exp( — NE(R)]. 


This was first discovered by Shannon [1], 
and its precise proof was first given by A. 
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Feinstein [20]. The precise expression of E(R) 
was subsequently given by P. Elias [21], R. M. 
Fano [22], and Gallager [23]. The best ex- 
pression of E(R), due to Gallager [23], is 


E(R) 2 max (E,(R), EAR + O(N 101, 


where 


E,(R)= max max| -oR 


O<p<l P 
ltp 
eet {Zawya} | 
j i 


E,,(R) - supmax | —pR 


Lin 
= logy DP E W(j|i)W(j| d | 


The converse of the fundamental thcorem 
states: Given a DMC with capacity C, for any 
block code with rate R above C and any pair 
Ly, Y], the error probability satisfies 


P9, V] 1—exp{—NE(R)}, 


where E(R) is a function positive for R » C. 
This was first proved by J. Wolfowitz [24], 
and the precise expression, found by S. Ari- 
moto [25], is described by 


-1€px0 p 


E(R)= max min | —pR 


1tp 
XL 


The fundamental theorem of coding theory 
and its converse imply that the capacity is a 
critical rate; above the capacity, information 
cannot be transmitted reliably through the 
channel. Unfortunately, there is, in general, no 
direct method for computing the capacity, and 
therefore an iterative method was proposed by 
Arimoto [26]. Another iterative method for 
computing the rate distortion function was 
given by R. E. Blahut [27]. 

A discrete channel with memory is, in gen- 
eral, defined by a list of probability measure 
(W,,xe A*) on a measurable space (B7, Fp} 
such that for each Fe Fg, W,(F) is a *measur- 
able function of x. A channel W is called sta- 
tionary if W,,(TF)= ME) for all xe A? and 
all Fe Fp. Given an input source [ A7, P] and a 
channel W, connecting the input to the chan- 
nel induces a joint process of input and out- 
put, denoted by [47 x B^, PW] where PW isa 
measure on (A7 x B”, ¥,, el, If a joint process 
[A^ x B^, PW] is stationary the mutual in- 
formation between input and output is defined 
by 


I(X; Y) = H(X)+ H(Y)— H(XY). 


A channel W is called ergodic if for every 
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ergodic input [ A47, P] the induced joint process 
[A7 x B^, PW] is ergodic. The channel capac- 
ity for a discrete stationary channel is defined 
in various ways. The important ones are the 
ergodic capacity defined by C, —sup. of I(X; Y) 
with respect to all stationary ergodic sources 
X=[A%, P], and the stationary capacity de- 
fined by C, = sup. of I(X; Y) w.r.t. all station- 
ary sources. K. R. Parthasarathy [28] proved 
C, — C, for all discrete stationary channels. 

In another way J. Nedoma [29] introduced 
the operational source/channel block coding 
capacity C,.,, which is defined as the supre- 
mum of the entropies of all admissible sta- 
tionary ergodic sources in the sense that there 
exist source/channel block codes such that 
the error probability P,—0 as block length 

N 5 oo. Nedoma [29] also pointed out an 
example of a stationary channel where C, > 
Ca Hence block coding theorems have been 
proved for various channels: for finite memory 
channels by A. I. Khinchin [30], K. Takano 
[31], Nedoma [29], and Feinstein [32], for 
d-continuous channels by Gray and Ornstein 
[33], and for almost finite memory channels 
by D. L. Neuhoff and P. C. Shields [34]. 
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A. General Remarks 


Insurance is a system in which a large number 
of people contribute a small precalculated 
amount of money (called a premium) to fill the 
economic need that arises when a person 
meets adversity. The amount of economic need 
filled by this system is called the amount of 
insurance (or amount insured). The insurer is 
the one who implements the system. Actuarial 
mathematics is the branch of applied mathe- 
matics that studies the mathematical basis of 
insurance, one of the first cases in which math- 
ematics was successfully applied to a social 
question. Actuarial mathematics can be di- 
vided into two branches according to its ap- 
plication. The first includes the calculation of 
various values of each individual policy, such 
as premiums or reserves. The second is mainly 
connected with management of an insurance 
business and includes the study of reinsurance 
systems, of the maximum amount of insurance, 
of the contingency fund, or the analysis of 
profits. There is only one basic principle in 
actuarial mathematics, called the principle of 
equivalence. It determines the premium and 
reserve in each year so that the present value 
of future premium income of the insurer is 
equal to the present value of future benefits for 
each policy. 

The basic factors of actuarial calculations 
are (1) probabilities of contingencies, (2) an 
expected rate of interest in the future (often 
referred to as thc assumed rate of interest), 
and (3) cost of administration of the system. 
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Premiums are calculated using these factors 
and the principle of equivalence. The follow- 
ing is an example of the classical method 

of calculation for a life insurance policy 

with the use of “commutation symbols,” 
which is an old device for the convenience 
of calculations. 

We write P for the net premium (in which 
the cost of administration is disregarded), P’ 
for the gross premium, 7, for the amount of 
death benefits payable in the tth year after the 
policy is issued, E, for the amount of survival 
benefits payable at the beginning of the tth 
year, n for the period for which the insurance 
is effective, and m for the period for which pre- 
miums are to be paid. Let o, fj, and y stand for 
three positive constants determining the initial 
expenses —a(T, or E,), the premium collection 
expenses = DÉI. and the general expenses for 
maintenance = »(T, or EL The factor that 
comes into consideration next is a model of 
human death and survival (measurement). 
Assume that /, is the number of lives attauung 
age x, and write q, for the probability that a 
life of x years will end within one year. Then 
d,, the number of lives ending within one year 
out of /,, is l qx, and [,.,, the number of lives 
remaining after one year at age x + 1, is [, — 
d, —l,(1 —q,). The commutation symbols 
commonly employed are defined as follows: 
Write v=1/(1+i), where i is the assumed rate 
of interest; then 


For a policy issued at an insured person's age 
x, the present value of the insurer’s future 
income can be expressed as P'(N, — N,,,)/D,, 
and the present value of his future payments 
can be expressed as 


1 n nti 
zd TCs -1 + > E,D. 4-4 
x \t=1 t=1 


+a(T, or E,)D.+y Y (T; or EAD, 
Cen 


+ BP(N,— Al 


By assuming that the present value of the 
future income is equal to the present value of 
the future payments, the value of the gross 
premium P' is obtained. (The P' obtained from 
the assumption «= fj — y 20 is denoted by P 
and is called the net premium. The difference 
P' —P is called the loading.) For a policy in 
which benefits are payable on disability or 
contingencies other than death, we have only 
to obtain a model of contingencies and apply a 
similar calculation. 
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B. Liability Reserve 


During the term of an insurance contract, it 
often happens that the present value of the 
future income is less than the present value of 
the future payments. If this is the case, the 
difference is to be held by the insurer as the 
liability reserve. The source of this fund is the 
past premium income plus interest. The net 
premium reserve, which disregards expenses, is 
calculated as 





1 n nti 
y- ( Y LC. »» ED, 


D, r-tt1 r=t+1 


m P(N,., m Neon) . 


Between the net premium P and the net 
premium reserve V, we have the relation 


P=0V— Va) tT Wart Er 


The first term of the right-hand side of this 
formula is called the savings premium, since it 
is the amount left out of the premium income 
of the tth year and added to the reserve. The 
second term is called the cost of insurance or 
risk premium and is applied to cover the dif- 
ference between the amount of insurance and 
that of the existing reserve in case the contin- 
gency of death arises. The third term is applied 
to the payment of the survival benefits (or 
annuities in case of an annuity contract). If 7, — 
V,, the amount of risk insured by the insurer, 
is positive for all values of t during the period 
of insurance, the policy is called death insur- 
ance. On the other hand, if the value of T, — V, 
is negative for all values of t, the policy is 
called survival insurance. If the value of 7T; — V, 
varies between positive and negative according 
to the different values of t, the policy is called 
mixed insurance. If 7; is always equal to V,, the 
policy constitutes mere savings. Most of the 
insurance policies issued today are one or 
another type of death insurance, while life 
annuity policies are a type of survival insur- 
ance. For a long time, studies have been made 
of the effect on premiums and reserves of 
changes in the three basic factors (1), (2), and 
(3) in Section A. 


C. Risk Theory 


Risk theory occupies a special position in the 
field of actuarial mathematics. Actuarial math- 
ematics was first born from the theory of prob- 
ability. Since the modern theory of proba- 
bility based on measure theory was developed 
by A. N. Kolmogorov and other mathema- 
ticians (— 342 Probability Theory), new ap- 
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proaches have inevitably been made to ac- 
tuarial mathematics. An outstanding example 
is risk theory. 

Risk theory can be divided into two 
branches. One is called classical risk theory 
(or individual risk theory), in which the profit 
or loss that may result during a certain term 
of an insurance contract is regarded as a tran- 
dom variable. Since the insurer's profit equals 
the sum of these random variables over all the 
individual contracts, various probability func- 
tions can be obtained by applying the theory 
of probability. The second, called collective 
risk theory, pays no attention to each indiv- 
idual contract but studies changes in the 
insurer's balance as a whole with the lapse of 
time. The basis of collective risk theory was 
given by F. Lundberg, H. Cramér, and other 
mathematicians. 

We explain the collective risk theory follow- 
ing Cramér [5]. For simplicity we consider an 
insurer who issues no policies other than death 
insurance and makes no expenditures except 
the policy claims. Suppose that during the 
time interval (t, t + At) contingency occurs with 
probability 2At + o(At) independently of the 
past. We denote by F(x) the 'distribution 
function of the amount to be paid by the in- 
surer when a contingency occurs. Then the 
number of contingencies in (0, t), N(t), is a 
*Poisson process with parameter 4, and the 
total expenditure of the insurer during the 
time interval (0, t), X(t), is a fcompound Pois- 
son process (— 5 Additive Processes) such that 


E(e'**9) 2 exp (af (e — naro); 

0 
If p is the premium income per unit time, then 
we have p=A($ xdF(x), because E(X (t)) - pt 
holds by the principle of equivalence. If u 
denotes the initial fund of the insurer, then the 
fund reserved at time t, Y(t), equals u + pt — 
X (t). Y(t) is a *Lévy process (— 5 Additive 
Processes) such that 


E(ei?Y()) 
=exp (ia +At | (e**—1— voll 
0 


The probability that Y(t) «0 for some time 

t « oo is called the ruin probability, which de- 
pends on the initial fund u, denoted by y(u). 
This satisfies the following tintegral equation 
of Volterra type: 


P yw) K Q(x) dx-+ | Vo)Q(u— x)dx 


(Q(x)=1—F(x)), 


from which one can derive an asymptotic rela- 
tion p(u)~ Ce *" as uo, where R and C 
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are positive constants depending only on A 
and F. See [6] for recent developments in risk 
theory. 
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A. General Remarks 


Integer programming, in its broadest sense, 
addresses itself to either minimization or maxi- 
mization of a functional f over some discrete 
set S in R”, but it is usually understood as 
dealing with questions related to linear pro- 
gramming problems (— 255 Linear Program- 
ming) with additional integrality conditions 

on the variables, namely, the problem Po: 
Minimize {c’x| Ax =b, x 2 0, x; an integer, j= 
1,...,n,(<n)}, where AcR"*", beR", ceR" 
are given data and x —(x,, ..., x,)'eR"isa 
vector. P, is called a pure integer programming 
problem or an all-integer programming prob- 
lem if n=n,, and a mixed integer program- 
ming problem if n, « n. In particular, P, is called 
a 0-1 integer programming problem if all the 
integer variables are restricted to be equal to 
either 0 or 1. We write 


X? — (xeR"| Ax 2b, x, 2 0,j 1, ..., n], 
X! = X? ixeR"|x; is an integer, j— 1, ..., 74], 


and assume for simplicity that (i) X7 # Ø, (ii) 
X? is bounded, and (iii) all the components of 
A and b are integers. P, arises not only asa 
mathematical model for an optimization prob- 
lem where some of the decision variables 

have indivisible minimum units but also as one 
for many optimization problems with some 
logical and/or combinatorial constraints [1]. 
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Algorithms for solving P, can be classified into 
algebraic methods and enumerative methods. 
Both are based upon linear programming and 
relaxation techniques [2] in which some of 

the constraints of P) are temporarily relaxed 
(— 264 Mathematical Programming). 


B. Cutting Plane Methods 


The origin of this class of algorithms is the 
fractional cutting plane algorithm proposed 

in 1958 by R. E. Gomory [3], the outline of 
which is described as: 

(1) Let X =x”. 

(2) Solve a linear programming problem: Mini- 
mize (e'x|xe Xj, and let x be its toptimal 
solution. If the x; are integer for all j & n,, then 
stop (X is optimal to P5). Otherwise, go to (3). 
(3) Generate a half-space H = (xeR"|n'x 2 no} 
(re R", xy €& R!), where H (i) contains X! and (ii) 
does not contain x. Return to (2) by replacing 
X by XANH. 

Here, a linear programming problem ob- 
tained by relaxing integrality conditions is 
solved, and as long as X¢ X’, an inequality 
satisfying the two conditions (i), (ii) of (3) ts 
introduced. Such an inequality xx 2 no or 
equality m’x = z is called a cut or a cutting 
plane. Gomory devised a Gomory cut using 
(relaxed) integrality conditions on the vari- 
ables, and showed that the algorithm above 
produces a point of X” in finitely many steps. 
Some of the other algorithms using cutting 
planes are the all-integer algorithm (Gomory, 
1963) and the primal all-integer algorithm 
(R. D. Young, Operations Res., 16 (1968)). 
These algorithms, however, are generally slow 
and behave erratically, so that it is believed 
that they cannot in practice serve as general- 
purpose algorithms. 


C. Other Algebraic Methods 


Gomory [4], again in 1965, proposed a group- 
theoretic approach to Pj. This method is based 
upon the following observation: Let x, be the 
vector of tbasic variables (— 255 Linear Pro- 
gramming) associated with a tdual feasible 
basis of a linear programming problem P: 
Minimize (e'x|xe X°}, and let X! be the set 
generated from X” by relaxing the nonnegativ- 
ity constraints on xg. Then X! can be shown 
to have a ‘cyclic group structure, so that a 
group minimization problem P,,: minimize 
íe'x|xe X!| can be solved as a tshortest path 
problem on a directed graph with a special 
structure (— 186 Graph Theory). If the opti- 
mal solution X of P, satisfies X, >0, then it is 
optimal for P). If, on the other hand, x, Z 0, 
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then a branch and bound algorithm described 
later can be applied for a systematic search 
of integer points near X. This algorithm is 
reported to produce good results when the 
size of the associated graph ts not excessively 
large (G. A. Gorry et al., Management Sci., 17 
(1971)). Some of the other important results 
in this area are: (1) the theory of subadditive 
cuts (R. Gomory et al., Math. Prog., 3 (1972)) 
and disjunctive cuts (E. Balas, in Nonlinear 
Programming 2, O. Mangasarian et al. (eds.), 
Academic Press, 1975), (11) research on facial 
structures of the integer polyhedron co X' 
(convex hull of X!) for some of the more im- 
portant integer programming problems, such 
as that of the corner polyhedron co X' [5], 
knapsack polytopes (E. Balas, Math. Prog., 8 
(1975)), and traveling-salesman polytopes 
(M. Grótschel et al., Math. Prog., 16 (1979)). It 
should be pointed out, however, that more 
difficulties of Pj have been revealed rather 
than resolved through the intensive research 
in this area. Incidentally, the 0-1 integer 
programming problem is known to be tNP- 


complete (— 71 Complexity of Computations). 


D. Enumerative Methods 


Another large class of algorithms for solving 
P, consists of the branch and bound methods, 
first proposed by Land and Doig [6] in 1960. 
An outline of the improved version (R. J. 
Dakin, Computer J., 8 (1965)) is as follows. 
(1) Let  — (P5), z* = oo, x* = undefined. 
(2) If A= Ø, then stop (x* is optimal to Po). 
Otherwise, choose from P the problem P: 
Minimize (e'x|xe X]]. 
(3) Solve P;: Minimize (c'x|xe Xj), in which 
the integrality condition is relaxed from the 
constraints of Pj. If P, has an optimal solution 
X, then go to (4). Otherwise, return to (2). 
(4) If X e X] and e'x «z*, then let XOx*, CNA 
z*, and return to (2). If x e X] and e’x 2 z*, 
then return to (2). Otherwise, go to (5). 
(5) Choose j <n,, for which X; is not integral 
and generate the two subproblems P,* : mini- 
mize (e'x|xe X}, x 2 [3;] - 1}, and RB : mini- 
mize (e'x|xe X], x; € [3;]], in both of which 
[*;] represents the largest integer not exceeding 
Xj. Let 2UÍP; , P, } 2 and return to (2). 
The best point of X' identified during the 
preceding steps is denoted by x* and called 
an incumbent. In summary, the branch and 
bound method chooses one subproblem P, 
from tbe problem list 2 and estimates the 
lower bound of its optimal objective func- 
tional value. If the lower bound is worse than 
the current incumbent, then Pj is discarded, 
whereas P, is separated into two subproblems 
if no conclusion can be reached. This process 
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is continued until the problem list P becomes 
empty, thereby implicitly checking all points of 
X!. The above method is called an LP-based 
branch and bound method because linear 
programming techniques are employed to 
obtain a lower bound. The branch and bound 
method tends to require a large amount of 
storage, but many engineering improvements 
on the method of choosing (1) a subproblem P, 
and (ii) a branching variable x;, and several 
improvements of the bounding techniques, in 
addition to the substantial progress made in 
linear programming codes, enable us to solve 
a problem of size n z:100. In particular, an 
improved version of the implicit enumera- 
tion method, proposed by E. Balas [7] for 0-1 
integer programming problems which uses 
logical conditions for obtaining lower bounds, 
is known to be able to solve rather large 0-1 
integer programming problems (A. Geoffrion, 
Operations Res., 17 (1969)). 


E. Other Topics 


The partitioning algorithm [8], in which in- 
teger variables are varied parametrically, is 
reported to work well for 0-1 mixed integer 
problems with relatively few integer variables. 
As people begin to realize the intrinsic dif- 
ficulty of P), they pay more attention to heu- 
ristic algorithms or approximate algorithms 

to obtain a good but not necessarily optimal 
solution. Among heuristic methods, the inte- 
rior path method [9], which elaborates simple 
ideas such as rounding of the optimal solution 
of P5, has been reported to work well for prob- 
lems in which X? has a nonempty interior. 
Also, more emphasis is being placed on special 
purpose algorithms for solving practical prob- 
lems, such as tset partitioning and the ttravel- 
ing salesman problem, etc. [1]. Po is a typical 
nonconvex programming problem, and no 
practically useful *duality theorem is avail- 
able. Hence it is difficult to perform sensitivity 
and/or post-optimality analysis. Some research 
in this area has emerged recently (e.g., C. J. 
Piper et al., Management Sci., 22 (1976)), but 

it looks as if it will be several years before a 
reasonably good procedure becomes available. 
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A. The Riemann Integral 


Let f(x) be a bounded real-valued function 
defined on an interval [a, 5]. We shall divide 
this interval J — [a, b] into subintervals 7; — 
[x;i-1, Xj] (@=1,...,n) by a finite number of 
points x; (a2 xg «€ x, <... < X, = b). This divi- 
sion into subintervals is uniquely determined 
by the set D= {x,}, called the partition of I. We 
set M; — sup, ,, f(x), m; —inf,., f(x), and put 
o(D)= EI, Mi(x;—x,-1), e(D) = Xi, mix;— 
X;-1). Considering all possible partitions D 
of I, we set |? f(x) dx =inf, s(D), f? f(x)dx = 
supp c(D), which are called the Riemann upper 
integral and Riemann lower integral of f, re- 
spectively. If they coincide, then the common 
value is called the Riemann integral of f on 
[a,b] and is denoted by fg f(x)dx. In this case, 
we say that f is Riemann integrable (or simply 
integrable) on [a,b] and call f the integrand; a 
and b are called the lower limit and the upper 
limit, respectively. In this case, by integrating f 
from a to b we mean the process of obtaining 
the value f? f(x) dx. 

Darboux's theorem: For each c 0 there 
exists a positive ò such that the inequalities 








«t 
«E 


an- | F(x) dx 
b 
c(D)— | f(x)dx 
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hold for any partition D with ô(D) 2 max;(x; 
—x;_,)<6. In other words, we have 


lim @(D)= | f(x) dx, 


é(D)0 å 


b 
„üm eg(D)- f f(x) dx. 
From Darboux's theorem it follows that a 
necessary and sufficient condition for f(x) to 
be integrable on [a,b] is that for each positive 
e there exist a positive ô such that ô(D) <ô im- 
plies o(D)—a(D) = Mi-,(M;—mj)(x; — x;-1) « e. 
We call M; —m; the oscillation of f on I; and 
a(D) and a(D) the Darboux sums. Obviously, if 
f is integrable on [a, b], then for each positive 
é there exists a positive 6 such that the follow- 
ing inequality holds for every partition D — 
{x;} with 6(D) <6 and for every set of points 
Gel; G=1,...,n): 
<E. 


n b 
$ Ea- | fle) dx 





The sum ZC, f(£) (x; — xj. ,) is often called a 
Riemann sum (or sum of products). A function 
that is continuous on [a,b], or bounded and 
continuous except for a finite number of points 
in the interval, is integrable. Furthermore, a 
bounded function that is continuous on [a, b] 
except for an infinite number of points x, is 
integrable if for an arbitrary positive number € 
there exist a finite number of intervals I; of 
which the total length is less than e and if the 
set {x,} of exceptional points is contained in 
U I;. Generally, a necessary and sufficient 
condition for a bounded function defined on 
[a,b] to be integrable is that the set of points 
where the function is not continuous be of 
tmeasure 0 (in the sense of Lebesgue). A func- 
tion that is either *monotonic on [a, b] (and 
consequently bounded) or of tbounded vari- 
ation is integrable. A function that is inte- 
grable on [a,b] is integrable on any sub- 
interval of [a, b], the integrand being the 
restriction of the given function to this 
subinterval. 


B. Basic Properties of Integrals 


Let I be the set of all functions integrable 

on [a,b]. If f, gel, then for any numbers 

a, f, we have af + fjgcl, fgel, min( f; gj €L 
max{ f,g}el, and f/g eI provided that there 
exists a positive constant A such that the 
inequality |g|> A holds. Furthermore, if fel, 
then | f| eL; and if f el (n=1,2,...) and f, 
converges ‘uniformly to f, then fel. Cor- 
responding to these properties, the following 
formulas hold. 
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(1) Linearity: 


b 
| (af (x) + Bg(x)) dx 


-a| fods | f(x)dx, 


where a, f are constants. 
(2) Monotonicity: If f(x) 20, then 


b 
| f(x)dx z 0. 


If, further, f is continuous at a point xoe[a, b] 
and f(x )>0, then f? f(x) dx » 0. 

(3) Additivity with respect to intervals: If a, 
b, and c are points belonging to an interval on 
which f is integrable and a « c « b, then 


| fesdx- ae) f(x)dx. 


Adopting the conventions that (2 f(x)dx =0 
and ff f(x)dx — (? f(x)dx, the additivity 
formula holds independently of the order of a, 
b, and c. 

It follows from (2) that |f} f(x)dx| < 
f21fG9|dx if a « b. Further, if f, (x) converges 
to f(x) uniformly on [a,b], then 


b b 
lim | f(x)dx -| f(x) dx. 
Replacing f,(x) by partial sums of a series, 

we obtain the following theorem: Let È a,(x) 
be a series in which each term a, (x) is inte- 
grable on an interval [a,b]. If the series con- 
verges uniformly on [a,b], then the sum s(x) is 
integrable on [a, b], and the series is termwise 
integrable, that is, 


| s(x)dx = y 


Also, the series Dy), [7 a,(t) dt converges uni- 
formly on [a,b] to the integral {x s(t)dt. As- 
sume that *a,(x) is convergent but not uni- 
formly convergent. If all a,(x), together with 
s(x) - X a,(x), are integrable and there is a 
constant M independent of n such that |s,(x)| 
<M (xe[a, b]) for all n, where s, (x) are par- 
tial sums, then the series is termwise integrable 
(C. Arzelà). 

The first mean value theorem: If f(x) is con- 
tinuous on [a,b] and (x) is integrable and of 
constant sign on [a,b], then there exists 0 (0 < 
0 « 1) such that 


b 
a,(x)dx. 


a 


b b 
| fist fia 00a) | Q(x)dx. 
When o(x) — 1, we have 


| fenis fas 0(b — ay (b — a). 


a 
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The second mean value theorem: If , (x) is a 
positive, monotone decreasing function de- 
fined on [a,b] and q(x) is an integrable func- 
tion, then there exists y (a<n x b) such that 


b n 
| f(x)o(x)dx = f(a-- of o(x)dx. 


In the hypothesis of the second mean value 
theorem, if f(x) is assumed to be monotonic 
but not necessarily positive, then there exists 
n (aq « b) such that 


b 
| F(x)p(x)dx 


a 


" b 
— f(a «o Q(x)dx mal Q(x) dx. 

a n 
H. Okamura (1947) proved that the condition 
a<n <b can be replaced by a <q «b. 

In the case f(x) z0 on [a,b], we consider the 
figure F bounded by the graph of f(x), the x- 
axis, and the lines x 2a and x =b. Then a(D) 
and c(D) are areas of polygons of which one 
encloses F and the other is enclosed by F, as 
shown in Fig. 1. Hence it can be shown that 
the integrability of f(x) in the sense of Rie- 
mann is equivalent to the measurability of F 
in the sense of Jordan. The Riemann integral 
f? f(x)dx is the area of F with respect to its 
*Jordan measure. 





Fig. 1 


C. Relation between Differentiation and 
Integration 


Suppose that f(x) is integrable on an interval 
I. We fix a point a of J and consider the in- 
tegration F(x)— (7 f(t)dt, where x varies in I. 
The function F(x) is called the indefinite in- 
tegral of f(x). In contrast with this, the integral 
on a fixed interval, as considered in the previ- 
ous sections, 1s often called the definite in- 
tegral. The indefinite integral F(x) is continu- 
ous on the interval J and of bounded varia- 
tion. If f(x) is continuous at a point x, in J, 
then F(x) is differentiable at x, and F'(x9) — 

f (xo). In general, if a function G(x) satisfies 
G'(x)— f(x) everywhere in J, then G(x) is called 
a primitive function of f(x). If f(x) ts continu- 
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ous, the indefinite integral of f(x) is one of 

the primitive functions of f(x). Furthermore, if 
a function G(x) is a primitive function of f(x), 
then any other primitive function can be writ- 
ten in the form G(x)+ C, where C is a con- 
stant, called an integral constant. For a con- 
tinuous function f(x) on [a, b] and any one 

of its primitive functions G(x), we have 


| f(x)dx = G(b) — G(a) 2 [G(x) Je 


(fundamental theorem of calculus) (—> Ap- 
pendix A, Table 9). From the differentiation 
formulas we obtain the following integration 
formulas: 

Integration by parts: If f(x) and g(x) have 
continuous derivatives on [a, b], then 


b b 
| F(x)g'(x)dx - Lf vest) I'(x)g(x)dx. 


More generally, if f(x) and g(x) are integrable 
on [a,b], then 


[aof 7a): 
b b b x 
d fnis | vun! rex] soda 


Change of variables: If f(x) is integrable on 
[a,b] and x= p(t) and o'(t) are continuous on 


Io, DL where a— q(a), b — op) (a p(t) <b), 
then 


b B 
| fenis | f(ott)) o'(t) dt. 


D. Improper Integrals 


The concept of the integral can be generalized 
to the case where the integrand or the interval 
on which integration is accomplished is not 
bounded. Assume that f(x) is not bounded on 
[a, b) but is bounded and integrable on any 
interval [a, b —£] (c [a, b)). HL *f(x)dx has a 
finite limit for £20, the limit is denoted by 
fé f(x)dx and is called the improper Riemann 
integral (or simply improper integral) of f(x) on 
[a, b). For example, if f(x) is continuous on 
[a, b) and f(x) - O((b— xy) for some « (O>a> 
—]), where O is the *Landau symbol, then the 
improper integral f? f(x)dx exists. On the 
other hand, if f is integrable on [a+ e, b] for 
each £2 0 but not bounded in any neighbor- 
hood of a, we can define the integral on (a, b] 
in the same way. If f is not bounded in any 
neighborhood of a or b and if there exists a 
point c (a « c « b) for which the improper in- 
tegrals f; f(x) dx and f? f(x)dx exist, then we 
define fa f(x)dx = f; f(x)dx + f? f(x) dx, which is 
independent of the choice of the point c. Fur- 
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thermore, assume that f(x) is not bounded in 
any neighborhood of each point c; (j — 1, ..., n) 
(a «c, «€... «c,«b). Then we define 


| fisyax= [^ rende |" reads a 


2 faxa | f(x)dx, 


provided that all improper integrals 


| fidis, BER 


exist. Suppose that f(x) is defined on [a, b] 
and bounded outside any neighborhood of 

c e (a, b) but not bounded in either [c — e, c] or 
[c.c +e] for any £20. It may well happen 
that, although neither lim, „o f4 * f(x)dx nor 
lim, o (2... f(x) dx exists (accordingly, the 
improper integral f? f(x)dx does not exist), if 
we put tz, the limit 


Cp b 
im( | ner! feux) 


does exist. This limit is called Cauchy's prin- 
cipal value and is denoted by p.v. f? f(x) dx 
(v.p. in French). For example, p.v. (^ ,(dx/x) — 
lim, a -1(1/x) dx + (1(1/x) dx) «0. 


E. Integrals on Infinite Intervals 


Suppose that we are given a function f(x) 
defined on an infinite interval [a, oo) and 
integrable on any finite interval [a, b]. If 
Im, J? f(x) dx exists and is finite, then this 
limit is called the improper integral of f on 

[a, oo) and is denoted by (7 f(x)dx. We define 
similarly [^ ,, f(x)dx ss Dm... (? f(x) dx, where 
f is defined on (—o0, b] and integrable on any 
interval [a, b]. Furthermore, [? , f(x)dx is, by 
definition, f^, f(x) dx + f? f(x) dx, which is 
independent of the choice of c. Suppose that 
f(x) is integrable on [a, b] for a fixed a and an 
arbitrary b larger than a. If f(x) 2 O(x?) for 
some x < —1, then [7 f(x)dx exists. Generally, 
for x, B such that —oo xa < oo and —oo < 

P< oo, if the improper integral (7 f(x)dx exists, 
we say that the integral is convergent; other- 
wise, it is divergent. Improper integrals also 
satisfy the three basic properties of integrals 
(1), (2), and (3) (— Section B). However, the 
existence of an improper integral of a func- 
tion f on an interval J does not imply the 
existence of the improper integral of | f| on 
the same interval J. For example, let f be 

a function determined by f(0) 20, f(x) — 
(1/x)sin(1/x) for 0 « x & x. Then Lë f(x) dx 
exists, but (5| f(x)] dx does not. On the other 
hand, if the improper integral of | f(x)| exists, 
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then the improper integral of f(x) exists, and 
we have 


B 
| f(x)dx 


where —oo € a « fl € +o. In this case, we say 
that f is absolutely integrable on the interval 
[x, P]. Assume now that f(x) is defined on 
(—00, œ) and integrable on any finite interval. 
If um, |. f(x)dx exists, then it is called 
Cauchy’s principal value of the integral of f in 
(— 00, 00). 

If f(x) is a monotone decreasing, positive, 
and continuous function defined on TE. oo) 
(where k is an integer), then according as 
Zo f(v) converges or diverges, so does 
fe f(x) dx. 

Suppose that a series B f, (x), where all 
the functions f,(x) (n— 1,2,...) are defined and 
nonnegative on an infinite interval [a, oo), 
satisfies (MDP  f,6))dx ^ D2, ffi) dx 
for arbitrary b >a. Then according as 
E fe f(x) dx converges or diverges, so does 
fe (Cam, f(x) dx. When they converge, the 
following equality holds: (7 E f,(x)dx 
= Era |? f(x) dx. In this theorem, if 
f£ Ez o9ldx or Ez E fol dx con- 
verges, then the same conclusion as above will 
follow even when the f,(x) are not necessarily 
positive (C Appendix A, Table 9). 








B 
«| | f(x)| dx, 


F. Multiple Integrals 


Suppose that f(x, y) is a function defined and 
bounded on an interval I = {(x, y)|a<x<b, 
c<y<d} in the xy-plane. Partitions {x,} and 
{y,} of [a,b] and [c,d] with a2 xg «x, <... 
«x,—b and c2 yg € y, <... € y, d determine 
a "partition," denoted by D, of I into subinter- 
vals of the form I, = (65, y)| xj. SX S Xj yy 1 S 
y<y,} 4L ...,m k-1,...,n). Writing 

M= sup f(x, y), 


(x, y)e Tj 


Mj = inf f(x, y), 


Le, len 


we set 


o(D)= Y Mg(xj— xj-1) Vie — Ve-1)> 


Sen 
ll 
- 
z 
ll 
Co 


a(D)= X, Y mga(x;— X;-1) Ve — Vu-1)- 
j*ikz1l 

Then we obtain infp E(D) 2 suppo(D). If 
inf, o(D)=suppa(D), then f(x, y) is called inte- 
grable on J, and the common value is called 
the double integral of f on J and is denoted by 
JJ, f(x, y) dx dy. Analogously, we can define n- 
tuple integrals (or multiple integrals) and the 
integrability of functions of n variables. 

Let K be a bounded set in the xy-plane and 
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I an interval containing K. Let g(x, y) be the 
characteristic function of K defined on 1, that 
is, 9 is determined by 


g(x,yJ=1 for (x,y)eK, 
(x, y)=0 for (xy)el—K. 


Replacing f(x, y) by this @(x, y), we consider 
infyc(D) (supp o(D)). These values can be 
shown to be independent of the choice of such 
an interval J and are called the outer area 
(inner area) of K, respectively. When these two 
values coincide, K is said to be of definite area, 
and the common value is called the area of K. 
A necessary and sufficient condition for K to 
be of definite area is that the outer area of 

the *boundary of K be zero. Now consider a 
bounded function defined on a set K of de- 
finite area. Then, taking an interval I contain- 
ing K, define an extension oí(x, y) of f(x, y) as 
follows: 


(x, y)2 fo y) for (x, y)eK, 
9(x,y)-0 for (x,yjeI—K. 


If (x, y) is integrable on I, then f(x, y) is called 
integrable on K, and the integral of f on K is 
defined by f fk f(x, y)dx dy =f fi (x. y) dx dy, 
which is independent of the special choice of I. 
The set K is called the domain of integration. 
Since K is of definite area, the set of boundary 
points of K at which (x, y) is not continuous 
can be contained in a union of intervals whose 
total area can be made smaller than any preas- 
signed positive number. Consequently, a func- 
tion bounded on K and continuous at each 
"interior point of K is integrable on K. Like 
integrals of functions of a single variable, 
multiple integrals satisfy the three basic prop- 
erties of integrals (— Section B). 


G. Multiple Integrals and Iterated Integrals 


Suppose that we are given a function f(x, y) 
that is continuous on an interval I = {(x, y)| 
axx&b,c«yxdyj. Then, for a fixed y in 
[c,d], the function f(x, y), regarded as a func- 
tion of x, can be integrated with respect to 

x on the interval [a, b], and the integral thus 
obtained is a continuous function of y. The 
integral of the function defined on [c,d], 
namely, UO: f(x, y) dx) dy, is called the iterated 
integral (or repeated integral) of f(x, y) and is 
often written as [2dy (? f(x, y) dx. The following 
formula gives a representation of a double 
integral by iterated ones: 


d b . 
{| fes yiaxdy= | a| f(x, y)dx 
b d 
-| nl f(x, y) dy. 
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More generally, o, (x) and p(x) being con- 
tinuous on [a, 5] and o, (x) € o; (x), consider 
the following subset K = {(x, y) a&x <b, 

0 (x)<y <¢@,(x)! of the xy-plane. Suppose 
further that f(x, y) is continuous on K. Then 
the following equality holds: 


b P(x) 
{| I(x, y)dx b- | dx f(x, y) dy. 


a p(x) 


In the case of unbounded integrands or 
unbounded domains of integration, we can 
still define integrals under suitable restrictions. 
For instance, assume the following two prop- 
erties: (1) There exists a sequence {K,,} of sets, 
each of which is of definite area, satisfying 
K,c Kc... and K |J. K,. (2) f(x, y) is 
bounded and integrable on each K, (n= 
1,2, ...). If a finite limit mm, Ile f(x, y) dx dy 
exists and is independent of the choice of {K,}, 
then f(x, y) is called integrable on K. This limit 
is called the integral of f(x, y) on K and is 
denoted by f fx f(x, y)dx dy: 


lim i fs ndsay- | | f(x, y)dx dy. 
TS K, K 


When the integral thus defined exists, we say 
that the integral is convergent. If a finite limit 
lim, Ile Lf (x, y) dx dy exists for some se- 
quence {K,,} with property (1), then f is inte- 
grable on K. Let f(x, y) be continuous and 
nonnegative on K 2 ((x, y)| a «x « f, y « y «àj, 
where —oo €«a «fix oo, —oo €y «ó x oo. Fur- 
thermore, let f(x, y) be integrable on K, and 
assume that the improper integral F(x)— 

P f(x, y) dy - lim, ats 12 f(x, y) dy exists and 
converges uniformly with respect to x as c], 
d1ó. Then [7 F(x)dx is well defined, and we 
have 


B d 
i flssypaxdy= | nl f(x, y) dy. 
K a y 


In particular, if «=a, y — b, f 2ó — oo, then we 
have 


| | fes sd | af f(x, y)dy. 
a b a b 


H. Interchanging the Order of Differentiation 
and Integration 


If both f(x, y) and Of(x, y)/Oy are continuous 
on an interval ((x, y)]|a€x «b, yo—n<y< 
yo +4}, then we can interchange the order of 
differentiation and integration as follows: 


d b b Q : 
—]| fix,y)dx= fy) dx for y=yo. 
dy Ja a y 





Assume further that this equality holds for 
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every b( a), the improper integral 


Es b 
| fessi | f(x, y)dx 
converges, and the improper integral 


E? , b A 
Of (x, y) actin Of (x, y) dx 
a Qy b- o0 Qy 








a 


converges as b— oo uniformly for y with |y — 
yol € 4. Then 


qp "d 
— f(x, y)dx- fO y. for y=Yo. 
dy Ja a Op 


Several other similar theorems are known. 
Though the previous theorems are written in 
terms of two variables, analogous theorems 
hold for n variables. 


I. Change of Variables in Multiple Integrals 


Let G be a bounded domain of definite area in 
an n-dimensional Euclidean space R"(x). As- 
sume that a mapping x y(x) — (y4(X4, ..., Xn} 
Halt, x) is of class C! from an open 
set containing the closure G of G into an n- 
dimensional Euclidean space R"( y). We de- 
note the image of G under this mapping by B. 
If f(y,, ..., Yn) is continuous on B, then the 
following formula on change of variables 
holds: 


kel SOY s Feld... dYa = 


Dip, Bal 
sq dl ese Xs) 
G 


Dix, X,) 

where g(x;, ...,x,) = f1 (X15 -5 Xah s Yn 
..., X4) and D(y,, ..., y,/D(x,, ..., Xp) is the 
tJacobian determinant of the mapping y(x). 
This formula is usually utilized in the case 
where y,, ..., y, are functionally independent, 
though otherwise both sides vanish and the 
formula still holds. For improper integrals, 
a similar formula will hold under suitable 
restrictions, for example, if the integrals con- 
verge absolutely. 

For related topics — 94 Curvilinear In- 
tegrals and Surface Integrals, 221 Integration 
Theory, and 270 Measure Theory. 


dx, ...dx,, 
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A. General Remarks 


Equations including the integrals of unknown 
functions are called integral equations. The 
most studied ones are the linear integral equa- 
tions, i.e., linear in unknown functions. 

Let D be a domain of n-dimensional Eu- 
clidean space and f(x) and K(x, y) be func- 
tions defined for x Z(x,, x5, ..., x,)eD, y= 
(Yi. Ma, Yn)E D. Integral equations of Fred- 
holm type (or Fredholm integral equations) [1] 
are those of the forms 


| K(x, y)o(y)dy = f(x), (1) 
q(x)— | K(x, y)o(y) dy 7 f(x), (2) 
D 
A(x) @(x)— | K(x, y)o(y)dy = f(x), (3) 
D 


where (x) is an unknown function and Jody 
means the n-fold integral |... (5dy, ... dy,. 
Equations of the forms (1), (2), and (3) are 
called equations of the first, second, and third 
kind, respectively. Equations of the second 
kind have been investigated in great detail. 
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Equations of the third kind, in many cases, can 
be reduced formally to those of the second 
kind. The function K(x, y) is called a kernel (or 
integral kernel) of the integral equation. 
Integral equations of Volterra type (or Vol- 
terra integral equations) are those of the forms 


[Kes nowdr= rin (1’) 
ots) | K(x, y)o(y)dy — f(x), (2’) 
A(x) p(x) — | K(x, y)o(y)dy = f(x), (3) 


where g(x) is an unknown function. Equations 
of the forms (1^), (2), and (3’) are also called 
equations of the first, second, and third kind, 
respectively. Integral equations of Volterra 
type can be regarded as integral equations of 
Fredholm type having kernels equal to 0 for 

x « y, but these two types of equations are 
usually treated separately, since they have 
considerably different characters. 

The kernels in equations (1)—(3) and (1’)— 
(3) are frequently written in the form AK(x, y) 
with a parameter A, in particular when the 
equations are related to eigenvalue problems, 
which is explained in Section F. 

The theory of integral equations was orig- 
inated in 1823 by N. H. Abel, who investigated 
the relationship between time and the path of 
a falling body in the field of gravitation. Let 
p(t) be a quantity varying with time, which is 
connected by some law with its values in some 
time interval of the past or the future. Then 
the law of variation of o(t) can be described 
mathematically by an integral equation. The 
situation ts the same even when the variable t 
is not time but a coordinate of the space. In 
this way, various problems in physics can be 
reduced to solutions of integral equations. 


B. Relation to Differential Equations 


Many problems in differential equations can 
be reduced to problems related to integral 
equations. Such reduction often makes the 
problems easier to handle and clarifies the 
nature of the solutions. For example, consider 
the problem of finding a solution of the ordi- 
nary second-order linear differential equation 
d? y/dx? + Ay «0 with the boundary condition 
y(0) = y(1) 0 [4]. Let d? y/dx? 2 u(x). If we 
integrate the equation twice, change the order 
of integration, and make use of the boundary 
condition, then we have 


cl (x Chacd -f x(1— ¢)u(d) dé, 
0 0 
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from which we see that the given differential 
equation can be written in the form 


1 x 
u -af x(1— ends 2 | (x — Jul) dě. 
Uu 0 

Decomposing the first integral on the right- 
hand side into the sum of an integral over 

(0, x) and one over (x, 1), and combining the 
integral over (0, x) with the second integral on 
the right-hand side, we obtain a Fredholm 
integral equation of the first kind as follows: 


os | G(x, )u(£) d£, 


Clearly, the solution of this integral equation 
is equivalent to that of the original differential 
equation. The function G is called *Green’s 
function for the boundary value condition 
y(0) = y(1) 20 in the theory of boundary value 
problems. Differential equations of higher 
orders can be treated analogously (— 315 
Ordinary Differential Equations (Boundary 
Value Problems)). tInitial value problems of 
linear ordinary differential equations can be 
reduced to the solution of Volterra integral 
equations in a similar way. 

As another example [5, 6], consider the 
tDirichlet problem on a plane, i.e., the problem 
of finding a function u satisfying the conditions 
(i) u is tharmonic in the interior of the region D 
bounded by a closed curve C(ë = q(s), n — (s), 
Oxs«l) (ii) u(x, y) F(s) uniformly with re- 
spect to (xo, yo) as (x, y) approaches (xo, yo) 
from the inside of D, where Ga, yo) is an arbi- 
trary point on C, F(s) is a continuous func- 
tion given on C, and s is the arc length along 
C. Put f(s)= Fiss and 


(V(s—v()o(0) —(o(s) — EMH) 
nill- e(t)? +H()- WO) ` 


Then it is known that a solution u of the prob- 
lem can be given in the form 


K(s;t)— 





! ô 1l 
u(x, y)= | u(s)z-log-ds, 
o On r 
where r? —(o(s) — x)? 4- (V(s) — y)’, n is the inner 
normal of C, and p(s) is a continuous solution 
of the following Fredholm integral equation of 
the second kind: 


i 
van K (s; t)u(t) dt. 

0 
We can treat the * Neumann problem similarly, 
i.e., the problem in which condition (ii) is re- 
placed by (ii) (Qu/On)(x, y)» F (s) uniformly 
with respect to (xo, yo) as (x, y) approaches 
(Xo, yo) from the inside of D. In the Neumann 
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problem, put f(s) — F(s)/n and 


(9) — dies — (o) — eM) HS) 
n((o(s)— PO) --(()—V(0)) ` 


Then we have a solution u in the form 


Liste 


i 1 
u(x, y)= — | u(s)log—ds+c, 
e r 
where ji(s) is a solution of the following Fred- 
holm integral equation of the second kind: ` 


I 
u(s)=f(s)— | Lis t)u(t) dt. 

0 
A solution of this integral equation, however, 
exists when and only when fh F(s)ds=0. In the 
expression of a solution u(x, y) of the Neu- 
mann problem, c is an arbitrary additive con- 
stant, up to which a solution of the problem is 
determined uniquely. We can also treat par- 
tial differential equations of elliptic type in 
an analogous way. 


C. Integral Equations with Continuous Kernel 


We describe some results for integral equa- 
tions with m-dimensional independent vari- 
ables, i.e., equations in which D is an m- 
dimensional closed domain. We assume that 
K(x, y) and f(x) are continuous in Sections 
D-H. 


D. The Method of Successive Iteration 


Among methods of solving Fredholm integral 
equations of the second kind, the simplest is 
the method of successive iteration, sometimes 
called the method of successive approximation 
[7]. In the method of successive iteration, we 
rewrite (2) in the form 


e(x) 9 fo) | K(x, y)o(y)dy 


D 


and replace the function ¢(y) on the right- 
hand side by the function 


f) | K(y, z) o (z) dz. 
D 


If we repeat the process successively, then we 
have 


p(x) = f(x) + J | Kx, y f(y) dy 
t= D 


d Kisii, y)e(y)dy, 
D 


where 


K(x, y) - K(x, y), 


Kí(x, »-| Kj.,(x, s)K(s, y) ds. 
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The functions K (x, y) are called the iterated 
kernels. Assume that ZZ, K,(x, y) converges 
uniformly. Then, putting 


R(x, y) » K (x, y), (4) 


we obtain a solution of (2) in the form 


p= | R(x, y)f(y)dy. (5) 


The series (4) is called a Neumann series. 
For a given kernel K(x, y), a function R(x, y) 
satisfying 


K(x, uns K(x, s)R(s, y)ds =0 


and 


K(x, y) — R(x, ul R(x, aikis, y)ds =0 
D 


is called a resolvent of K (x, y) (in some cases 
— R(x, y) is taken as the resolvent). If a re- 
solvent of K(x, y) exists, the solution of (2) 
can be given uniquely by (5). If a Neumann 
series converges uniformly, (4) gives a resolv- 
ent of K(x, y). 

If we apply a similar process to Volterra 
integral equations of the second kind, then we 
have the iterated kernels defined by 


Ki KG, s)K(s y)ds (i—1,2,...). 
y 

For these iterated kernels, a Neumann series 
defined by (4) always converges uniformly. 


E. Fredholm’s Method 


Let D be a bounded closed domain and K(x, y) 
a continuous kernel. A Neumann series (4) 
converges uniformly and gives a resolvent if 

IK (x, y)| or the region D is sufficiently small, 
but otherwise it does not necessarily converge. 
E. I. Fredholm [1,7] gave a method of con- 
structing a resolvent for the more general case. 
Write a kernel in the form AK (x, y), and put 





ps K(xi, yi) K(xi,y,) 
K = ; 
du M YET K (Xp, Yn) 
Define D(A) and D(x, y; A) by 
Di 
c (— A)" Gab 
145! dÉ «(^ "nu 
aci n jp D Sia ciega 
and 


D(x, y; 4) — K(x, y) 


o (—4A4y SCIRE Se 
AE | SI «(C 5 Jasi dns 
n=1 n! D D ETH 
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The series in these two equations both con- 
verge uniformly and hence define tentire func- 
tions of A. The functions D(A) and D(x, y; A) are 
called Fredholm's determinant and Fredholm's 
first minor of the kernel K(x, y), respectively. 
For small |4|, we have 





Si , ^n-1 
DO) VI | is s)ds, 


where the K,,(x, y) are iterated kernels corre- 
sponding to K(x, y). Now if D(A) 40, a resolv- 
ent AR(x, y; A) of the kernel AK (x, y) can be 
given by 
D(x, y; A 


If D(4) 0, some extension of the method in 
this section is needed. Fredholm introduced 
for this purpose Fredholm's rth minor 


pm 
Yis Yr 
defined by 
Xis... X, 
d 
Yis- Vp 


lh y (—Ay 


EG Ve den A! 
X15. ns isen A 
D D yis Yrs Siste Sn 


F. Eigenvalue Problems and Fredholm’s 
Alternative Theorem 





Consider a homogeneous integral equation of 
the second kind 


TE K(x, y)og(y) dy —0, (6) 


where D is a bounded closed domain and 

K(x, y) is continuous in D x D. When (6) has 

a nontrivial solution g(x) for some 4, then A 

is called an eigenvalue corresponding to the 
kernel K(x, y), and the corresponding non- 
trivial solution g(x) is called an eigenfunction 
corresponding to the kernel K(x, y). If D(A) #0, 
then (6) has no nontrivial solution, from which 
it follows that eigenvalues must be zero points 
of the entire function D(A). For an arbitrary 
eigenvalue 4, there is a set of linearly inde- 
pendent eigenfunctions corresponding to À 
such that any etgenfunction corresponding to 
A can be written as a linear combination of 
the eigenfunctions belonging to the set under 
consideration. Such a set of linearly indepen- 
dent eigenfunctions corresponding to an eigen- 
value 4 is called a fundamental system corre- 
sponding to the eigenvalue 4. The number of 
elements of the fundamental system is called 
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the index of the eigenvalue 4. The index of an 
eigenvalue is always finite. The homogeneous 
integral equation of the form 


(oi K(x, y (x)dx —0 (6) 
D 


is called an associated (or transposed) integral 
equation of (6). The associated equation has 
the same eigenvalues as the original equation; 
moreover, the index of a common eigenvalue is 
the same for both equations. For any eigen- 
value 4, the order of the zero point 4 of the 
entire function D(A) is called the multiplicity of 
the eigenvalue 4. If an eigenvalue 4 is a pole of 
R(x, y; 4), then we have p+ 1 2 r 4 q, where r is 
the order of the pole, p is the multiplicity of A, 
and q is the index of 4. In particular, if A is a 
simple pole of R(x, y, 4), we have p — q, namely, 
the multiplicity is equal to the index. An exam- 
ple with this particular property is the integral 
equation with a symmetric kernel to be dis- 
cussed in Section G. For the set of eigenvalues 
there is no finite accumulation point even if 
there are infinitely many eigenvalues. 

If 4 is not an eigenvalue, the inhomogeneous 
equation 


(x) EI K(x, y)p(y)dy=f(x) (7) 
D 


can be solved uniquely for any continuous 
function f(x). In this case we have D(A) # 

0, and the resolvent R(x, y; 4) of the kernel 

AK (x, y) exists. If A is an eigenvalue, we have 
D(A) =0, and equation (7) has a solution if and 
only if 


| V(x)f(x)dx =0 


for all solutions (y) of (6) (linearly indepen- 
dent solutions y( y) are finite in number). The 
last statement is called Fredholm's alterna- 
tive theorem (— 68 Compact and Nuclear 
Operators). 

A kernel of the type 


K(x,y)- 2 Xjo) Hin 


is called a separated kernel, degenerate kernel, 
or Pincherle-Goursat kernel. For such a kernel, 
we have D(4) detién, — 4 (^ X,(t) Y(t) dt), and 
hence we can easily obtain eigenvalues and 
eigenfunctions. À nondegenerate kernel can be 
studied using the results obtained for sepa- 
rated kernels, since we can regard a kernel of 
the general form as the limit of a sequence of 
separated kernels. 


G. Symmetric Kernels 


A kernel K(x, y) is called a symmetric kernel 
if it is real and K(x, y) 2 K(y, x). Let D bea 
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bounded closed domain and K(x, y) be a con- 
tinuous symmetric kernel. In this case the as- 
sociated equation (6^) clearly coincides with the 
original equation (6) [5, 6, 8]. 

Corresponding to any nontrivial symmetric 
kernel K(x, y), there exist at least one eigen- 
value and one eigenfunction. The eigenvalues 
are all real, and the eigenfunctions correspond- 
ing to distinct eigenvalues are mutually ortho- 
gonal. If we orthonormalize the eigenfunctions 
belonging to all fundamental systems and 
number them according to the order of in- 
creasing absolute values of the corresponding 
eigenvalues, then we have an orthonormal 
system {@,(x)}, called a complete orthonormal 
system of fundamental functions or simply a 
complete orthogonal (or orthonormal) system. 
If we number the eigenvalues taking their 
multiplicities into account and according to 
the order of increasing absolute values, then 
we have the equality 


o] 
X z-| | K(x, y)dx dy. 
i DJD 


i=1 4 





Corresponding to an iterated kernel 
K m(x, y), we have the eigenvalues {4;"}, and we 
can choose the corresponding orthonormal 
system so that it coincides with the one corre- 
sponding to K(x, y). Eigenvalues and eigen- 
functions corresponding to an iterated kernel 
can be obtained in the following way: Put 
[p K,(s, s)ds=u,; then the following limit 
exists: 


lim u5,/u5,,.5 — 47 < +0, 
n-*oo 


which gives an eigenvalue of the iterated ker- 
nel K,(x, y). A function g(x, y) defined by 


lim 2^ K(x, tel 


(uniformly convergent) gives the correspond- 
ing eigenfunction (x, c) for any constant c 
satisfying o(x, c) z 0. 

Let 4" be an eigenvalue corresponding to an 
iterated kernel K,(x, y) and o(x) be a corre- 
sponding eigenfunction. Consider the func- 
tions w,(x) (j=0, 1, ...,n— 1) defined by 


n-1 
od ec) 3 en | K,(x, y)o(y)dy 


D 
(j=90,1,2,...,n—1), 


where e is one of the nth primitive roots of 1. 
Thea for at least one value of j, £4 is an eigen- 
value corresponding to the kernel K(x, y), and 
y; is a corresponding eigenfunction. This rela- 
tionship between eigenvalues and eigenfunc- 
tions corresponding to an iterated kernel and 
those corresponding to the original kernel is 
valid even for kernels that are not necessarily 
symmetric [2]. 


217 H 
Integral Equations 


Let a kernel K(x, y) be defined by 


K(x, y) e Ks y) € 28790. 
i=1 i 

where the /; (i= 1,2, ..., n) are the eigenvalues 
corresponding to the kernel K(x, y) and the 
(x) (i — 1, ..., n) are the corresponding ortho- 
normalized eigenfunctions. Then eigenvalues 
and eigenfunctions corresponding to K(x, y) 
are those corresponding to K (x, y), with the 
exception of 4,,..., 4, and @,(x), ..., o, (x). 

Let o(x) be any function that satisfies 
Jo(y(x))? dx = 1. Then the integral 


=| | K(x, yp(x) p(y) dx dy 
DJD 


assumes the maximum value when ọ(x) is an 
eigenfunction corresponding to K(x, y) with 
the smallest eigenvalue 42. Let the eigenvalues 
4, Of K(x, y) be numbered in order of increas- 
ing absolute values, so that |4,| € |4,,,|. Let 
q(x) be any function satisfying 


| Wi(x)p(x)dx=0 (i=1,2,...,n), 


| (p(x)? dx=1 


for given functions w(x) (i=1,...,n). Then the 
maximum value of the integral J above is the 
least when the set of all linear combinations of 
(br, (x), ...,W,(x)} coincides with the set of all 
linear combinations of Joy (x), ...,@,(x)}, and 
in this case the maximum of J is attained by 
some eigenfunction (x) corresponding to 
K,(x, y) with the eigenvalue 4, ,. The results 
in this paragraph show that we can obtain 
eigenvalues by solving a variational problem 
concerning the integral J. 


H. Expansion Theorems 
Let K(x, y) be a continuous symmetric kernel 
and h(x) be a function square integrable on a 


bounded closed domain D. Then a function 
f(x) such that 


fe9-— i K(x, y)h(y)dy 

can be expanded in the form 
WEEK 

where {@,(x)} is a complete orthonormal sys- 


tem of fundamental functions corresponding 
to K(x, y) and 


«-| fG)e,(x)dx |. (n21,2,...) 
D 
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The series in the expansion of f(x) converges 
uniformly. These facts are the content of the 
Hilbert-Schmidt expansion theorem [5,6,8]. By 
using this theorem, for a 4 that is not an eigen- 
value, we can obtain a solution (x) of the 
Fredholm integral equation (7) with a sym- 
metric kernel in the form 


alte Y d 





| I(x) @j(x) dx. 
D 


For m Z2, iterated kernels can be expanded in 
the form 


(uniformly convergent). If AR(x, y; A) is a re- 

solvent of a symmetric kernel AK (x, y), then 

R(x, y; 4) can be expanded as 

. & Gx) Ply) 

R(x, y;A)= K(x, y) ——— 4 

(SAT KG ye T 
If a symmetric kernel K (x, y) satisfies the 

inequality 


| | K(x, y)p(x)e(y) dx dy z 0 
DJD 


for all o(x), it is called a positive (semidefinite) 
kernel. If in this inequality the equality holds 
only for o(x) 20, K(x, y) is called a positive 
definite kernel. For a positive definite kernel, 
eigenvalues are all positive, and the kernel can 
be expanded in the form 
K(x, y)= y "ro 
i=1 i 

(uniformly convergent). This result is called 
Mercer’s theorem. 

When a real continuous kernel K (x, y) is not 
symmetric, we consider two positive kernels 
Eis, y) and K"(x, y) defined by 


| K(x, s)K(y, s)ds 2 K'(x, y) 
D 


and 


| K(s, x) K(s, y ds = EIS, y). 
D 


The eigenvalues corresponding to these ker- 
nels are the same, and they are all positive. Let 
A; (i 1,2, ...) be these eigenvalues and {@,(x)} 
and {w,(x)} be the corresponding complete 
orthonormal systems corresponding to K’ and 
R”, respectively. Then we have 


4| K(y, x)oi(y) dy = W(x), 
D 


Ài | K(x, y)Wi(y) dy = p(x). 
D 
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Let f(x) be an arbitrary function such that 


fe- | K(x, y)h(y)dy, 


where h(x) is a function square integrable on 
D. The function f(x) can then be expanded in 
the form 


S= È co. (8) 


where 


=| f(x)e(x)dx  (i=1,2,...). 
D 


The series in the expansion (8) converges 
uniformly. 

The Fredholm integral equation (1) of the 
first kind with a general kernel (i.e., a kernel 
that is not necessarily symmetric) has a square 
integrable solution (x) if and only if f(x) has 
a uniformly convergent expansion (8) and 

2 (cA) < +00. When this condition is 
satisfied, equation (1) has a solution given by 
X4 C, A4 Q(X) that converges in the sense of 
*mean convergence. 

It should be noted that the theory concern- 
ing symmetric kernels can be extended to 
complex-valued Hermitian kernels, i.e., kernels 
such that K(x, y)= K(y, x). Also, we can obtain 
the theory in Section G and this section, con- 
cerning Fredholm integral equations with 
continuous kernels, by using the methods of 
functional analysis that treat fp K(x, y)o(y)dy 
as a ‘compact operator in the space of con- 
tinuous functions (— 68 Compact and Nuclear 
Operators). 


I. Kernels of Hilbert-Schmidt Type 


Kernels of Hilbert-Schmidt type are kernels 
which are square integrable in the sense of 
Lebesgue over D x D, where D is an arbitrary 
domain. Most of the results mentioned in the 
previous section concerning integral equations 
with kernels continuous on bounded domains 
are valid also for equations with kernels of 
Hilbert-Schmidt type, because every operator 
mentioned in the previous section is also a 
compact operator in the space concerned, i.e., 
the space L,(D) [6] (— 68 Compact and 
Nuclear Operators). 


J. Singular Kernels 


For general kernels that are not necessarily 
continuous, the theory described in the previ- 
ous sections does not apply properly, but 
when an iterated kernel K,,,(x, y) has a resolv- 
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ent R,(x, y), we can find a resolvent R(x, y) of 
K(x, y) in the form 


R(x, y) = R,(x, y) + Halt, y) 


|| R,, (x, 8) H,,(s, y)ds, 
D 


where H,,(x, y) 2 37*,! K'(x, y). When a kernel 
is of the form AK(x, ». the relationship be- 
tween eigenvalues and eigenfunctions corre- 
sponding to an iterated kernel and those cor- 
responding to the original kernel, which was 
stated in Sections G and H, is still valid for 
general kernels. If a kernel K(x, y) is continu- 
ous for x #y and has a singularity of the form 
Ix —y| * (0 «a « I) on x — y, the iterated ker- 
nels K,,(x, y) are continuous provided that 

(1 —a)m>1. tGreen’s functions of partial dif- 
ferential equations of elliptic type have this 
property. 

A kernel that is not square integrable is 
called a singular kernel. An integral equation 
whose domain of definition is unbounded or 
whose kernel is singular is called a singular 
integral equation [9]. Singular integral equa- 
tions have some particular properties that are 
not seen in ordinary integral equations, i.e., 
integral equations with kernels continuous in a 
bounded closed domain. For example [10], 
consider the identity 


= D 
sin xy 4 d 
ale "ich 
= ZE 
fk a? + x? 


where « is an arbitrary real number. This 
equality shows that for the continuous kernel 
/2/rsin xy, 1 is an eigenvalue and Jn2 e 
+x/(a* + x?) is a corresponding eigenfunction. 
Since a is arbitrary, the index of the eigenvalue 
1 is evidently infinity. As another example, 
observe the equality 


ao 2 
"lege "IR dy = ~iax 
e e = ev 
e qx 





where « is an arbitrary real number. From this 
equality, we see that for the continuous kernel 
e ^ defined on (—oo, oo), and number A= 
(1 2- 42)/2 greater than or equal to 1/2 is an 
eigenvalue. In this example, the spectrum, i.e., 
the set of eigenvalues, is a continuum. Such a 
spectrum is called a continuous spectrum. 

In applications, an important role is played 
by integral equations with kernels of Carleman 
type: 


K(x, y) 2 G(x, yJC y — x), 


where G(x, y) is a bounded function. In inte- 
gral equations with such kernels, the integral 
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is taken in the sense of the tCauchy principal 
value [5,9, 11, 12]. For example, the Riemann- 
Hilbert problem is the following: We are given 
a simple closed and smooth curve L in the 
complex plane and real-valued smooth func- 
tions a, b, and c defined on L, a+ib never 
vanishing. The problem is to find a function 
(z) which is holomorphic in the exterior of L, 
at most of polynomial growth at infinity, and 
continuous up to L with boundary value o" 
such that Re[(a-- ibp* ] - c on L. This prob- 
lem is reduced to that of finding a function Au 
defined on L satisfying an integral equation 


G(z, 
ua- | EE er) 
L z—c 


where G ts a smooth kernel determined by 

a t ib, f is a known function depending on c, 
and the integral is taken in the sense of the 
Cauchy principal value. The integer x defined 
by k — (1/n) f, d(arg(a — ib)) is called the in- 
dex of this problem. The full solution of the 
Riemann-Hilbert problem was given by I. N. 
Vekua [11]. 

A multidimensional analog of the Cauchy 
principal value is the singular integral of A P. 
Calderón and A. Zygmund (— 251 Linear 
Operators). À smooth function k(x) defined 
in R” except at x =0 is called a kernel of 
Calderón-Z ygmund type if k(x) is positively 
homogeneous of order —n and if its integral 
mean on the unit sphere is zero. Then the 
operator K defined by 


Kozim | | k(yf(x— y)dy 
ze 

is called a Calderón-Zygmund singular integral 
operator [13]. K is a bounded linear operator 
in L?(R") if 1 «p « oo. If nz 1 and k(x)« 
l/(zix), K is nothing but the *Hilbert trans- 
formation. The pseudodifferential operator (— 
345 Pseudodifferential Operators) is an exten- 
sion in some sense of the singular integral 
operator. 


K. Systems of Integral Equations 
A system of Fredholm integral equations of 
the second kind can always be reduced to a 


single equation. In fact, as is seen easily, a 
system of integral equations 


1 
air] Ki(x, y)e;(y)dy 9 f(x) 
j Jo 


can be reduced to a single equation 


$(x)— 4 r K(x, y)®(y)dy= F(x) (O<x<n), 
0 
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where 


O(x)=o(x—i+1), F(x)=f(x—i+ 1), 


K(x, yy=K,(x—i+ 1,y—j+1) 
(i—1l<x,j-l<y<jij=1,2,...,n). 


A system of Volterra integral equations of the 
second kind can be reduced to a single equa- 
tion by eliminating the unknown functions 
successively. 


L. Integral Equations of Volterra Type 


Consider a Volterra integral equation of the 
first kind 


| K(x, y)o(y dy = f(x) 
such that K(x, x) Z0 and K (x, y) and f'(x) are 
continuous. If we differentiate both sides of the 
equation, then we have a Volterra integral 
equation of the second kind: 


* Kx, y) Zo 
oc: K(x,x) e(ydy- KG xy 








a 


Abel’s integral equation of general form is 





*G 
| CoN) a= f (0 «a « 1). (9) 
o (x—y) 


If G, G,, and f’ are continuous and G(x, x) z 0, 
equation (9) can be reduced to the equation 


| iu sonar | Tina dx, 
0 0 
where 

" — G(x, y)dx 


Hee] qua) "(GP 





. Since H(u, u) ^ (r/sinax)G(u, u) Z0, it follows 


that 





“H ; 
TH End (9a) 


where 


g(u)  H(u,u) ' - [roe =x) ae 
du Jo 
- H(u,u) ' Le mm 


d var) 
0 

Clearly (9a) is a Volterra integral equation of 
the second kind. Abel's problem (— Section A) 
was to find the path of a falling body for a 
given time of descent. The problem then can 

be reduced to the solution of equation (9) with 
G(x, y) 81 and «— 1/2. When G{x, y)=1, we 
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can solve equation (9) explicitly to get 


sin a7 


p(x) 





("ro | te ECHT i) 


T 0 


[5]. 


M. Nonlinear Integral Equations 


When a nonlinear integral equation includes a 
parameter 4, it may happen that the parameter 
has a bifurcation point, i.e., a value 4, such that 
the number of real solutions is changed when 
À varies through 4, taking real values. For 
example, consider the equation 


1 
BEEN 9^ (y) dy - 1. 
This equation has real solutions ọ(x)=(1 + 
/ 1 —44)/24 for 4x 1/4 but no real solution 
for A> 1/4. Hence A, = 1/4 is a bifurcation 
point (— 286 Nonlinear Functional Analysis). 
Among nonlinear integral equations, Ham- 
merstein's integral equation has been studied 
in detail [5, 14]. It is an equation of the form 


o | K(x, y) f(y, e(y)) dy =0. (10) 
D 


If K(x, y) and f(y, 0) are square integrable and 
f(y. u) satisfies a tLipschitz condition in u with 
a sufficiently small coefficient, then the inte- 
gral equation (10) can be solved by successive 
approximations. If K(x, y) is a square inte- 
grable positive kernel and [5|K(x, y)|? dy is 
bounded, then we can prove the existence and 
uniqueness of a solution of (10) under a con- 
dition on f(y, u) weaker than a Lipschitz con- 
dition. We can prove similar results for equa- 
tion (10) with a nonsymmetric kernel when 
K(x, y) is continuous in the mean, that is, 


lim | [K(x' y) - K(x, y)? dy =0, 
xx D 


im | IK(x, y) — K(x, y)? dx =0. 
Y^» Jp 


À nonlinear Volterra integral equation of the 
form 


ETH F(x, y; (yp dy 
can be solved by successive approximations 

if F(x, y; u) and f(x) are square integrable, 

F(x, y; u) satisfies a Lipschitz condition 

| F(x, y; u') — F(x, y; u^)| < k(x, y)|u' — "| with 
some square integrable function k(x, y), and 

Jz F(x, y; f(y)) dy is majorized by some square 
integrable function of x. When F(x, y; u) and 
f(x) are continuous, we can obtain theorems 
on the existence and uniqueness of continuous 
solutions and tcomparison theorems similar to 
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those for initial value problems in ordinary 
differential equations [16]. 


N. Numerical Solution 


For the numerical solution of integral equa- 
tions, we assume throughout that the func- 
tions appearing are all continuous and the 
solution of every equation is unique. Methods 
of numerical solution can be divided roughly 
into two classes. Methods of one class try 

to evaluate numerically the analytical solu- 
tion described in the preceding sections, and 
methods of the other try to obtain a solution 
by transforming the problem to one that is 
numerically solvable. 


(1) A Method Based on Numerical Quadra- 
ture. Consider the integral equation 


b 
| F(x, y, p(x), o( y) dy =0. 
Let a=x,<x,<...<x,=b be points on the 
interval To b] and o, @,...,@, be the values 
of o (x) at x,, x5, ..., x,. By the use of numer- 
ical quadrature, we then have the following 
system of equations in q;: 


2. agF(x,x,9,9)-0  (i21,2,...,n). 
fa 


The method corresponds to that of solving 
ordinary differential equations by their dif- 
ference equation analogs. Hence the errors 
involved in the solutions obtained by this 
method can be analyzed similarly to those in 
the case of numerical solution of ordinary 
differential equations (— 303 Numerical Solu- 
tion of Ordinary Differential Equations). If 
the given integral equation is a Fredholm 
equation of the second kind, then we have a 
system of linear equations in q;. If we apply 
quadrature formulas to the integral appearing 
in the integral equation by using the values of 
Q; obtained, then we have a formula by which 
the solution can be evaluated directly, that is, 
without using an interpolation formula. 


(2) A Method Utilizing Recurrence Formulas. 
Let d, and d,(x, y) be the respective coefficients 
of A" in the expansions of Fredholm’s determi- 
nant D(A) and Fredholm's first minor D(x, y; A). 
They satisfy the recurrence formulas 


b 


d,(X, y)=d, K(x, | K(x, sid, — (s, y)ds, 


a 


1 b 
SES i dis, s) ds, 


n+1 7 


dọ=1,  do(x, y)=K(x, y). 


By the use of these formulas, we can compute 
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d, and d,(x, y) successively and hence evalu- 
ate D(A) and D(x, y; 4) approximately, and by 
means of these recurrence formulas we can 
readily obtain a solution of a Fredholm equa- 
tion of the second kind. 


(3) A Method Utilizing Approximate Kernels. 
If we replace a kernel by an approximate one 
in a Fredholm integral equation of the second 
kind, then we have an integral equation that 
has a solution approximately equal to the 
solution of the original equation. Hence if we 
can find an approximate kernel for which 

an integral equation can be solved numeri- 
cally or analytically, then we can find an ap- 
proximation to the desired solution by solving 
the modified equation. For such solutions, a 
method of error estimation was given by F. G. 
Tricomi [17]. 


(4) An Iterative Method. Consider the integral 
equation 


b 
(p(x) = | F(x, y, p(x), oy) dy. 


Let (x) be an adequate function, and define 
P(x) successively by 


b 
Pari (x)= | F(x, y, Pn(X)s q,Cy)) dy. 
If the sequence {¢,(x)} converges, then the 
limit Im, Qn(X) = @(x) is a solution of the 
given equation, and hence we can obtain an 
approximation to a solution by calculating 
Q,(x) for some finite n. This method can be 
used effectively for Fredholm integral equa- 
tions of the second kind with a parameter 
A, provided that the absolute value of 4 is 
smaller than the least absolute value of the 
eigenvalues. 


(5) Variational Method. If some conditions are 
fulfilled, an integral equation of the form 


b 


G(x, p(x))+ | F(x, y, p(x), p(y))dy=0 


a 


can be regarded as an fEuler-Lagrange equa- 
tion for a variational problem 


b (^b 
s=] | E(x, y, u(x), u(y))dx dy 


+ | H(x,u(x)) dx =extremal. (11) 


In this case, we can find a solution of the given 
integral equation numerically by solving the 
variational problem (11) numerically [18]. 


(6) Enskog's Method. Suppose that {¢,,(x)} is 
a complete orthonormal system for the Fred- 
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holm integral equation (7). If we put V,(x) — 
n(x) — å fa KG, x) e, (Y) dy, then from (7) we 
have 


b b 
| potid- | f(x)e, ix) dx, (12) 
and furthermore we see that {w,(x)} can be 
orthonormalized to yield a complete ortho- 
normal system (y,(x)]. The equality (12) then 
shows that the Fourier coefficients of a solu- 
tion ~(x) with respect to the system {y,(x)} 

can be obtained readily from the Fourier 
coefficients of f(x) with respect to the system 
1e, (x)). This method of obtaining a solution is 
called Enskog's method. 


For Volterra integral equations, methods (1) 
and (4) can be used effectively. We usually 
transform equations of the first kind into 
equations of the second kind by differentiation 
and then apply the above numerical methods. 
This is done for the sake of securing stability 
of the numerical methods. 
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A. General Remarks 


Integral geometry, in the broad sense, is the 
branch of geometry concerned with integrals 
on manifolds, but the problems considered in 
integral geometry are usually of a more limited 
nature. If a *Lie group G acts on a "different. 
able manifold M as a tLie transformation 
group, G also acts on various figures on M, 

by which we mean geometric objects such as 
tsubmanifolds of M, ttangent r-frame bundles 
on M, etc. Let F be a set of such figures on M 
invariant under G (ie, gFe F forgeG, FEF). 
Consider the following problems: (1) to know 
whether any G-invariant *measure u on F 
exists, and how to determine uo if it exists; (ii) to 
find the integral [ p(F)dy(F) of functions o on 
F with respect to the measure p. 

The term integral geometry was introduced 
by W. Blaschke, who considered the special 
case of problem (ti) tn which (F) is a function 
representing geometric properties of F and the 
integral is to be evaluated by means of the 
geometric invariants concerning Z [1]. Prob- 
lems of so-called geometric probability (such 
as the problem of Buffon's needle) belong to 
this category. The measure yp is called the 
kinetic measure (or kinetic density), and du(F) 
is also denoted by dF. If has the structure of 
an n-dimensional differentiable manifold and 
the measure u is given by a tvolume element w 
De, a positive tdifferential form of degree n), 
we denote o also by dF. Problem (i) is simple: 
If G acts 'transitively on F, then F — G/H, 
where H is the tisotropy subgroup of G. In 
this case F has the structure of a differentiable 
manifold, and if a G-invariant t(Radon) mea- 
sure exists, it is unique up to a multiplicative 
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constant. A condition for the existence of a G- 
invariant measure 4 can be given by means of 
*Haar measures of G and H (— 225 Invariant 
Measures). We now consider some examples. 


B. Crofton's Formula 


Let G(p, 0) be a straight line defined by the 
equation x, cos6+x,sin@=p with respect to 
orthogonal coordinates in a Euclidean plane. 
Let n(p, 0) be the number of intersections of 
G(p, 0) with a curve C of length L. Then we 
have Crofton's formula, 


ILE 0)dpd0 — 2L, (1) 


where dp d is the exterior product of the 
differential forms dp, d0 of degree 1, and the 
integral is extended over pe(—oo, oo) and 
0€ [O, 2x]. 


C. Poincaré's Formula and the Principal 
Formula of Integral Geometry 


The kinetic density dF of a figure F congruent 
(with the same orientation) to a fixed figure in 
a Euclidean plane is defined as follows: Let R 
be an orthogonal frame attached to F, (x,, x;) 
be the coordinates of the origin of R with 
respect to a fixed orthogonal frame R,, and 

0 be the angle between the first axis of R and 
the first axis of Ro. If we put dF — dx, dx; d0 
(exterior product), dF has the following in- 
variance properties: (1) dF is not changed by 
displacements of F; (ii) dF is not changed if 
instead of R we take another orthogonal 
frame R' attached to F. 

Let two plane curves C,, C, of length L,, 
L,, respectively, be given, and suppose that C, 
is fixed and C, is mobile. If the number of 
intersections of C, in an arbitrary position 
with C, is finite and equal to n, then the in- 
tegral of n extended over all possible positions 
of C, is given by 


| ndC,=4L,L, (2) 


(Poincaré’s formula). L. A. Santaló applied this 
result to give a solution of the fisoperimetric 
problem (1936). 

Let C,, C; be two plane ‘Jordan curves of 
length L,, La, respectively, and let S,, S, be 
the areas of the domains bounded by C,, C3, 
respectively. Suppose that C, is fixed and C; 
mobile, and let y be the number of connected 
domains common to the domains bounded by 
Ci, C5 for C, in an arbitrary position. Then 
the integral of y extended over all possible 
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positions of C, intersecting C, is given by 


[ues m Lian Sa (3) 


(Blaschke [1]). This is the principal formula of 
integral geometry. Many formulas can be 
derived from it as special cases or limiting 
cases. 


D. Generalization to Dimension n 


The kinetic density of subspaces of dimension 
k in a Euclidean space and in a spherical space 
of dimension n was given by Blaschke, while 
the generalization of the principal formula 
(3) to a Euclidean space of dimension n was 
given by S. S. Chern, applying the methods of 
E. Cartan. 

Let (e, ...,e,) bea positively oriented ortho- 
normal frame with vertex A. The infinitesimal 
relative displacements are then given by 


dA—- we, de,= > oe, 
i=l j=1 
where œ;=(dA, ej), w= (de;, e)= — w; are 


differential forms of degree 1 in the orthogonal 
coordinates of A and the n(n — 1)/2 variables 
that determine e,, ...,e,. For various posi- 
tions of a figure, we take an orthogonal frame 
(A, €, , e,) fixed to this figure and form the 
texterior product 
dK — ^ o, A o 

i i<j 
of all the c; and cj; (i<j). This has the invar- 
iance properties (i) and (ii) of Section C and 
is, by definition, the kinetic density of the fig- 
ure in an n-dimensional Euclidean space. 
Moreover, the kinetic density of dE of k- 
dimensional subspaces E can be obtained by 
considering the orthogonal frames such that 
the vertex A and e, ...,e, lie on E and by 
forming the exterior product of the corre- 
sponding o,, oi, (#=1,...,k;2=k+1,...,n), 


dE = Ao,A0,;. 


Let 2 be a compact orientable hypersurface 
of class C? in an n-dimensional Euclidean 
space, and let k, (x — 1,...,n— 1) be the fprin- 
cipal curvatures at a point on X. Denote by S; 
the "elementar symmetric form of degree i in 
k, (i— 1, ...,n— 1), and put S, — 1. Then con- 
sider the integrals over 2: 


m-f sas/ (^77) i=0,1,...,n—1, (4) 
5 l 


where dS denotes the surface element of X. Let 
D,, D, be the domains bounded by two com- 
pact orientable hypersurfaces X,, X, of class 
C? with volume V,, V,, respectively, and let 
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Mj", M)?! be the integrals M; defined by (4) for 
2, 25. If X, ts fixed, X, is mobile, and the 
'Euler-Poincaré characteristic y of D, N D, is 
finite, then the generalization of (3) has the 
form 


fraz: =] DEE Li (wes V, T MU, V, 


n-2 
HE MOM 4) o 
Nk=0 
(Chern's formula), where dX, is the kinetic 
density of X, and I, (k=1,...,n—1) is the area 
of the unit sphere in a Euclidean space of 
dimension k+ 1, with the integral extended 
over all positions of X, intersecting 2,. 

Let dE be the kinetic density of the sub- 
spaces E of dimension k intersecting a com- 
pact orientable hypersurface X of class C?, and 
let y be the Euler characteristic of the intersec- 
tion of E with the domain bounded by X. The 
integral f y dE extended over all hyperplanes of 
dimension k intersecting X is proportional to 
M, relative to the hypersurface X defined by 
(4). This fact generalizes (1). Further generali- 
zations were obtained by Chern (1966). 


E. Other Generalizations 


For 2-dimensional spaces of constant curva- 
ture, Santaló derived formulas analogous to 
those in a Euclidean plane (1942-1943) and 
thus solved the isoperimetric problem in these 
spaces. In 1952, he derived a formula corre- 
sponding to (5) in n-dimensional spaces of ' 
constant curvature, following Chern's method. 
He investigated further integral geometry in 
affine, projective, and Hermitian spaces. 

Chern and others obtained the results of the 
previous sections by Cartan's method of gen- 
eral moving frames and studied integral geom- 
etry in the setting of the geometry of Lie 
transformation groups in the sense of F. Klein 
(— 137 Erlangen Program). 

Chern, P. Griffiths, and others studied the 
value distribution theory of holomorphic 
mappings in several complex variables from 
the point of view of integral geometry (1961) 
(— 21 Analytic Functions of Several Complex 
Variables, 124 Distribution of Values of Func- 
tions of a Complex Variable). 


F. Radon Transforms 


Another important topic of integral geome- 
try is the theory of Radon transforms. Let ¥ 
be the set of hyperplanes £(c, p) 2 (x e R"|(x, c») 
=p} in the Euclidean space R”, where œ = 

(^, ..., Ån) iS a unit vector, (x, œ)= X x;4;, and 


p is real. For a function f defined in R", define 
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f(& 9 fo, p) on F by 
f= | Ff (x)dex, (6) 
$ 


where d;x is the tvolume element on the 
hyperplane Z such that d.x ^ È A4; dx, — dx, with 
dx the volume element of R". Then f is called 
the Radon transform of f. For example, if f is 
the tcharacteristic function of a bounded 
domain V, the value fi (£) of the Radon trans- 
form f of f at Ee F is the volume of the sec- 
tion of V by č. Now the group G of *motions 
of R” (the tconnected component of the iden- 
tity of the group of isometries) acts on F tran- 
sitively. For every xe R", the tisotropy sub- 
group G, of G with respect to x acts transi- 
tively on the set x= (£e F |xe č}. Since G, is 
compact, there exists a unique normalized G,- 
invariant measure u on X such that p(x)=1. 
For a function g on F, the conjugate Radon 
transform g can now be defined by 


g(x) -| g(o)du(c) (7) 
xeó 


as a function on R". 

The determination of j belongs to problem 
(ii) of integral geometry mentioned in Section 
A. In particular, it is important to determine 
g=(f)” for g=f and to find the relation be- 
tween ( f ) and f. These problems were solved 
by J. Radon for n=2, 3 and by F. John in the 
general case. The results can be formulated as 
follows: 

In the case of odd n, let Y be the space of 
trapidly decreasing C*-functions (— 168 
Function Spaces). Let S*(F) be the set of 
g(o, p)e "(7 ) such that f? g(o, p)p* dp =0 for 
every natural number k and every w. For every 
f € F(R") and every ge ¥*(F), we then have 


f cA HI, 
where A is the Laplacian in R” and Lg(o, p) 
— d? g(o, pJ/dp?, c V (n/2)! (2xi)! "n". 

In the case of even n, for every fe A(R"), 
ge (7), 
faa AQ y, 


where 


g-cL" D), 


FESTER EI 


um Tix — yl?" dy, 
R 


J;(g)(a» p) = |i gio, p) p — q| "dq. 
These integrals are in general divergent, and 
they must be interpreted as regularizations 
defined by analytic continuation with respect 
to the powers of |x — y| or |p — q| [9]. 

John applied the Radon transform on R" to 
the study of partial differential equations in R" 
with constant coefficients [8]. 
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A formula corresponding to tPlancherel’s 
theorem and an analog of the tPaley-Wiener 
theorem for the Fourier transform are valid 
for the Radon transform (I. M. Gel’fand et al. 


[6]). 


G. Horospheres 


The theory of the Radon transform is also 
important in noncompact tsymmetric Rie- 
mannian spaces M. The connected component 
G of the identity in the group of isometries of 
M is isomorphic to the fadjoint group Ad G 
and can be considered as a linear group. Maxi- 


. mal *unipotent subgroups of G are conjugate 


to each other. If N is such a subgroup, we call 
the torbits on M of gNg '! horospheres on M 
for ge G. These correspond to the hyperplanes 
in R”. If M is the complex upper half-plane 
with the thyperbolic non-Euclidean metric, 
the horospheres are precisely the circles tan- 
gent to the real axis and the straight lines 
parallel to the real axis. 

The group G acts transitively on the set 
F of horospheres on M, and we have F = 
G/M, N. Here G= KAN is an tIwasawa de- 
composition of G, and M, is the *centralizer 
of A in K. For a horosphere Ze F, let d.x be 
the volume element on Z with respect to the 
*Riemannian metric on € induced by the Rie- 
mannian metric on M, and define the Radon 
transform f ofa function f on M by (6) asa 
function on F. For every xe M, there exists a 
unique normalized measure on X= (£e F | 
xeč} invariant under the (compact) isotropy 
subgroup G, of G at x (u(X) 2 1). The conju- 
gate Radon transform g of a function g on F 
is defined by formula (7) by means of this 
measure u. Then there exists an integrodif- 
ferential operator A such that if A* is the 
adjoint operator, we have the inversion for- 
mula f Z(AA* f )' and Plancherel’s theorem: 


| fœ] dx= | AÑO ae, 


where dx, dë are G-invariant measures on M, 
F , respectively, and f is an arbitrary C^- 
function with compact support. If the *Cartan 
subgroups of G are conjugate to each other, A 
is a differential operator; the inversion formula 
can then be written in the form fe L((f)") 
with some differential operator L on M [9]. 

S. Helgason applied the Radon transform 
on noncompact symmetric Riemannian spaces 
to solve differential equations on these spaces 
(1973). 

Horospheres and Radon transforms can be 
defined not only for symmetric Riemannian 
spaces G/K, but also for various thomoge- 
neous spaces G/H of noncompact semisimple 
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Lie groups G. The Radon transform f >f 
maps a function f on G/H into a function on 
the space of horospheres on G/H. If a tunitary 
representation U of G is realized in a function 
space over G/H, the Radon transform f>f 
helps to clarify the properties of U by going 
over to the function space on ¥. In several 
examples, Gel'fand, Helgason, and others have 
by this method decomposed U explicitly into 
direct integrals of irreducible representations 
[6, 7,9, 10]. 

Further work on the Radon transform on 
‘symmetric Riemannian spaces (e.g., compact 
symmetric Riemannian spaces of rank one and 
TGrassmann manifolds) has been done by 
Helgason and others (1965). A generalization 
of Radon transforms to differential forms has 
also been given by Gel'fand and others (1969). 


H. Another Generalization 


Integral geometry can also be investigated 

in spaces admitting no displacement. Let 
F(X4, X5, Y1, Y2) be positive and homoge- 
neous of degree 1 with respect to y,, y;, and 
consider a ‘stationary curve of the integral 
LG, x5, dx,/dt, dx;/dt)dt. Let p; - OF/Oy; 

( y; 7 dxj/dt) along this curve. Poincaré found 
that for a 2-parameter set of stationary curves, 
dx, dp, - dx, dp; is not changed by any dis- 
placement of the line element (x,, p;) along a 
stationary curve. Blaschke took this as the 
kinetic density of the stationary curve and 
proved a formula containing formula (1) as a 
special case. On the other hand, Santaló intro- 
duced kinetic density for sets of geodesics on 
2-dimensional surfaces and proved a generali- 
zation of formula (2). The study has been fur- 
ther extended to tfoliated manifolds. Some 
uniqueness theorems for various integral geo- 
metric problems with applications to the study 
of the earth’s internal structure from seismo- 
logical data have been given by V. G. Roma- 
nov [11]. 


References 


[1] W. Blaschke, Vorlesungen über Integral- 
geometrie, Teubner, I, 1936; IL 1937. 

[2] S. S. Chern, On integral geometry in Klein 
spaces, Ann. Math., (2) 43 (1942), 178—189. 
[3] S. S. Chern, On the kinematic formula in 
the Euclidean space of n dimensions, Amer. J. 
Math., 74 (1952), 227-236. 

[4] L. A. Santaló, Introduction to integral 
geometry, Hermann, 1953. 

[5] L. A. Santaló, Integral geometry and geo- 
metric probability, Addison-Wesley, 1976. 
[6] I. M. Gel’fand, M. I. Graev, and N. Ya. 
Vilenkin, Generalized functions V, Integral 


836 


geometry and representation theory, Academic 
Press, 1966. (Original in Russian, 1962.) 

[7] I. M. Gel'fand and M. I. Graev, Geometry 
of homogeneous spaces, representations of 
groups in homogeneous spaces and related 
questions of integral geometry, Amer. Math. 
Soc. Transl., (2) 37 (1964), 351—429. (Original 
in Russian, 1959.) 

[8] F. John, Plane waves and spherical means 
applied to partial differential equations, Inter- 
science, 1955. 

[9] S. Helgason, A duality in integral geome- 
try; some generalizations of the Radon trans- 
form, Bull. Amer. Math. Soc., 70 (1964), 435— 
446. 

[10] S. Helgason, A duality for symmetric 
spaces with applications to group representa- 
tions, Advances in Math., 5 (1970), 1—154. 
[11] V. G. Romanov, Integral geometry and 
inverse problems for hyperbolic equations, 
Springer, 1974. (Original in Russian, 1969.) 
[12] R. Deltheil, Probabilités géométriques, 
Gauthier-Villars, 1926. 

[13] D. G. Kendall and P. A. Moran, Geo- 
metrical probability, Griffin, 1963. 

[14] M. I. Stoka, Géométrie integrale, 
Gauthier-Villars, 1968. 


219 (XIII.14) 
Integral Invariants 


A. Poincaré’s Integral Invariants 


Let us view a system of differential equations 
dxj/dt = Xj(x, ...,x,,t) (i2 1,2, ..., n) as defin- 
ing the motion of a point whose coordinates 
are (x;, ..., x,) in the n-dimensional space R” at 
time t. Let K be a p-dimensional manifold 

(1 &pxn) in R”, and let K, be the set occupied 
at the instant t by the points which occupy K 
for t— 0. If the integral 


| F(x, ...,X,, t)dw, (1) 
K, 


where dw is the tvolume element of K,, does 
not depend on t for any p-dimensional surface 
K, then | F(xi, ...,x,, t) dw is called an integral 
invariant of degree p of the original system of 
differential equations. (The ‘differential form 
F dw is also called an integral invariant.) In 
particular, a necessary and sufficient condi- 
tion for an integral f M(x, t)dx, ... dx, to be 
an integral invariant of degree n is 0OM/ét+ 
1-1 (M Xj/Ox, 20. Furthermore, a neces- 
sary and sufficient condition for an integral 
[2 Mats, t) dx; to be an integral invariant 
of degree 1 is 0M,/ét + X- ((0M;/Oxj) X; + 
(€ X;/0xj) Mj) — 0. If the integral (1) does not 
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depend on t for any closed p-dimensional 
surface K, then [F(x,, ..,X4,t)dw is called a 
relative integral invariant of degree p (1 xp« 
n — 1). Corresponding to this terminology, an 
integral invariant is sometimes called an ab- 
solute integral invariant. 

If a differential form 0 is a relative integral 
invariant of degree p, then its exterior differ- 
ential d0 is an absolute integral invariant of 
degree p+ 1. 

For a Hamiltonian system 


dp,/dt= —0H/0q;, | dq;/dt - OH/Opj (2) 

(i=1,....n,H=H(p,q,t), P=(P1; s ps. q— 

(di> ..., Gell, the 1-form 

w=) p,dqi (3) 
i=l 


is a relative integral invariant. The integral of 
o on a closed curve $ LX 7, p; dq; plays a role in 
classical quantum theory. 

The absolute integral invariant Q=dw= 
ZE, dp; ^ dq; has, as a differential 2-form on 
Bim. the properties: 


Q is a closed form, that is, dQ — 0. (4) 


Let € be a tangent vector (at any 
point). If Q(é, 5) 20 for any tangent 
vector y (at the same point), then č —0. (5) 


Also, for a Hamiltonian system (2), Tos 
fdp; ... dp, dq, ...dq,, is an integral invariant. 
In other words, a 2m-dimensional figure in 
(pi... Poo dio dl Space may change its 
form according to the motion of points, but its 
volume remains unaltered (Liouville's theo- 
rem). This fact is of importance in applica- 
tions to statistical mechanics. 

Poincaré developed the theory of integral 
invariants and applied the theory to the tthree- 
body problem and the problem of tstability 


[1]. 


B. Cartan's Extension 


Poincaré treated the position (x,,..., x,) and 
the time t separately. E. Cartan extended 
Poincare’s theory by unifying the treatment of 
position and time. 

We may consider the solution curves of a 
Hamiltonian system (2) through all points of a 
closed curve C in (2m + 1)-dimensional space 
(p, q.t) together with the tube consisting of 
these solution curves. For any closed curve 
C, lying on and enclosing the tube, we have 
fc X pida; — H dt - c, 2i-, p;dq; — H dt, and 
X7 pidq;— H dt is called Cartan's relative 
integral invariant. If the curve C lies on t= 
constant, then fo is a relative integral invar- 
iant, in Poincaré's terminology. 











219 Ref. 
Integral Invariants 


The exterior differential of this form d(@— 
H dt) - Q— dH adr has the property: 


Let (p(t), q(t), t) be a solution curve for 

(2). For the velocity vector £, =(p(t), 

q(t), 1) and any vector ¢, in (p,q, t) 

space, we have (Q— dH adt)(f,,0,)=0. (*) 


Therefore the integral of the 2-form over a 
tube consisting of solution curves equals 0. 
Applying this to the tube enclosed by C and 
C,, we have the former equality. 

(*) also implies that a solution of (2) is de- 
rived from the fvariational problem of finding 
the extremal of the functional 


| È pjlt)4;(t) — H( p(t), q(t), ch 


for the curves (p(t), q(t), t) (to <t € t,) satisfying 
q(t)) di (i—0, 1) for given to, t,, q? and q! 
(without any condition for p — p(t)). 

If H is given by the *Legendre transforma- 
tion H X, pd; - (T — U), where T = T(q, à) 
is a kinetic energy, U — U(q) is a potential, and 
p; = 0T/0q;, this variational problem considers 
a wider class of curves than tHamilton’s prin- 
ciple f(T- U)dt =0 (because there are no 
restrictions on p(t)). However both variational 
problems give the same extremal [3]. 

The vector č —(p(t), q(t)) is reconstructed by 
means of Q and dH as follows. Put ¢, —(£, 1) 
and C, — (5,0) (Di is a vector in (p, q) space) in 
(*), then 


Q(6,n) 2 —dH -n for any yn. (6) 


Such a Z is uniquely determined by virtue of 


(5). 


C. Symplectic Structure 


Let M be a 2m-dimensional differentiable 
manifold. A differential form © of degree 2 on 
M is called a symplectic structure if it satisfies 
(4) and (5). And then (M, Q) is called a symplec- 
tic manifold. 

For H: MR, we define the vector field č 
over M by (6) (at each point xe M and ne 
T, M). H is called a Hamiltonian function 
(independent of t) and € is called a Hamil- 
tonian vector field. 

In this case, we also have an argument 
similar to that for the case of the usual Hamil- 
tonian systems in Euclidian space. For exam- 
ple, H is invariant along the flow generated 
by the vector field £, and Q is an integral 
invariant. 


References 


[1] H. Poincaré, Les méthodes nouvelles de la 
mécanique céleste III, Gauthier-Villars, 1899. 


220 A 
Integral Transforms 


[2] E. Cartan, Leçons sur les invariants in- 
tégraux, Hermann, 1922. 

[3] V. I. Arnold, Mathematical methods of 
classical mechanics, Springer, 1978. 


220 (X.27) 
Integral Transforms 


A. General Remarks 


Given (real- or complex-valued) functions f(y) 
and K(x, y) such that their product is tinte- 
grable as a function of y in the interval [a,b], 
we set 


«- | K(x, y) f(y) dy. (1) 


This transformation of f to g is called the inte- 
gral transform with the kernel K(x, y). Now fix 
the kernel K(x, y). When the correspondence 
f—g given by (1) from a set of functions f 

to a set of functions g is bijective, we can con- 
sider the inverse transform o f. The for- 

mula that describes the inverse transform g f 
in terms of an integral transform is called the 
inversion formula. The kernel K (x, y) can often 
be written as k(x — y), k(xy), etc., where k(t) 

is an integrable function. Table 1 contains 
integral transforms that are important in 
applications. 


Table 1 

Kernel Interval Name 

e? (—00.o0) Fourier transform 

COS Xy (0, œ) Fourier cosine 
transform 

sin xy (0, oc) Fourier sine 
transform 

e v (0, oc) Laplace transform 

Vxy J,(xy) | (0,00) Hankel transform 

1/(x —y) (—œ,%) Hilbert transform 

get (0, o0) Mellin transform 

(x+y)? (0, oo) Sueltjes transform 

eT yy (—0c,co) Gauss transform 





In the Hankel transform, J, is the tBessel 
function. In the Hilbert transform, the tprin- 
cipal value is to be taken in the integral. In the 
Stieltjes transform, p is assumed positive. 

Since the explanations for the Fourier trans- 
form and Laplace transform are given in the 
corresponding articles, we deal here only with 
generalized Fourier, Hilbert, Mellin, and 
Stieltjes transforms. 
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B. Generalized Fourier Transform 


Suppose that the kernels of the transform (1) 
and its inverse transform are both of the form 
k(xy). Hence 


b 
I w=| k(xy)g(y)dy. (2) 


In this case, we call the integral transform (1) 
the generalized Fourier transform of symmetric 
type or the Watson transform, and k(t) the 
Fourier kernel of (2). The functions äis COS t, 
Bei sint, and JJ) defined in the interval 
(0, oo) are examples of such Fourier kernels (— 
160 Fourier Transform). The last kernel gives 
rise to the Hankel transform. 

Let k(x) and /(x) be Fourier kernels. Then 
the integral transform m( y) of I(x) with respect 
to the kernel k(xy) is called the resultant of k 
and /. The resultant of two Fourier kernels is 
also a Fourier kernel. 

Suppose that a function K(1/2 + it) satisfies 


K(1/2+it)K(1/2—it)=1, |K(1/24- i))| — 1. 


Then the function k(t) 2 K(1/2 + it)/(1/2 — it) 
belongs to L;(—o0o, oo). We set 
T 


k,(x) - Lim. | k(t)x 2 qt 

2n Too -T 
(the limit is taken in the *mean convergence of 
order 2). Then for a function f(x)e L,(0, oo), 


d |°” dt 
g(x) -<| k,(xt)f(t)— 
x Jo t 
exists almost everywhere, and g(x)e L, (0, oo). 
In this case, we have the inversion formula 


d CS d 
Jie c i a (xt)9()-— 


and the Parseval identity 
| LO dt — | (g(x))? dx. 
0 0 


If a function f(x) is invariant under a gen- 
eralized Fourier transform, then f(x) is called 
a self-reciprocal function. Such a function f(x) 
is a solution of the homogeneous integral 
equation 


b 
=f k(xy) f(x)dx. 


The function x? is an example of a func- 


tion that is self-reciprocal with respect to the 
Fourier cosine transform. Using a function 
that is self-reciprocal with respect to the Han- 
kel transform, we can derive the lattice-point 
formula of number theory: Let r(n) be the 
number of possible ways in which a nonnega- 
tive integer n can be represented as the sum of 
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two square numbers. Set 


P(x)= Y" r(nm—nx, 

O<n<x 
where >’ means that if x is an integer, we 
take r(n)/2 instead of r(n). Then f(x) 2 x ??- 
(P(x?/2n) —1) is self-reciprocal with respect to - 
the Hankel transform with v —2. Utilizing 
this, G. H. Hardy proved (1925) 


= /x S siet 


A. Z. Walfisz (1926) and A. Oppenheim (1927) 
generalized this formula and obtained a for- 
mula concerning the number of ways in which 
n can be represented as the sum of p square 
numbers (— 242 Lattice-Point Problems). 


C. Mellin Transform 


If f(x)x* ! eL,(0 


, 00), then 
ra- [ fG)x* dx, s=k+it, 
0 


is called the Mellin transform of f. If f(x) is of 
tbounded variation in a neighborhood of x, 
then the inversion formula 


F(x+O)+f(x—-9) 1. | 





————— lim 


> >i im F(s)x "de 


k—iT 

holds. If f(x)x* "e L,(0, oo), the integral 

fia fo)x*^! dx (se k- it) tconverges in the 
mean of order 2 to a function F(s) for a fixed k 
and —oo « t < oo, and the Parseval identity 


| |f) x?! ix. | F(k + it)|? dt 
0 2n — 00 
holds. F(s) is also called the Mellin transform 
of f(x). If f(x)x* 2, g(x)x!?^* e L.(0, oo), and 
e G(s) are the Mellin transforms of f(x), 
x), respectively, then 


Cu fs) 


The theory of the Mellin transform in the 
function space L, is analogous to the theory of 
the Fourier transform [1, ch. 4]. 


Í ktioo 
ads | F(s)G(1 — aide, 
Ze J i-is 


D. Stieltjes Transform 


For a function a(t) of bounded variation, 


dai 
f= M ape PO 


is called the Stieltjes transform of a(t). Usually 
we assume that p — 1. If the Laplace transform 
is applied twice in succession to a(t), we obtain 
formally the Stieltjes transform of a(t). The 

Stieltjes transform has been studied systemati- 
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cally in connection with the theory of the 
Laplace transform by D. V. Widder, R. P. 
Boas, and others. 

Assume that p=1. Let D be the domain 
obtained from the complex plane by removing 
the negative part of the real axis. If the Stieltjes 
transform converges at a point s = sọ €D, then 
it converges uniformly on any compact set in 
D. The inversion formula is 


t 


i ee + indo 


n> *02zi 
= (a(t + 0) + a(t — 0) —(a( + 0) + ( — 0)))/2, 
t>0. 


If x(t) fo p(u)du and g(t +0) exist, then 


ha —f(—t+in)) 


JO 


—(o(t--0)4-o(t—0))2, | t»0. 


E. Hilbert Transform 


Let o(z) U(x, y) - iV(x, y) (z2 x 4 iy) be 
holomorphic in the upper half-plane and f(x) 
= U(x,0), g(x) = — V(x, 0) be the respective 
boundary values on the real axis. Then if f, 
ge L,(—o0, oo), 


d í 
g(x) 2—p.v. | 
D — a 





f et t) d 





Els Zen) E (4) 


t 


Here p.v. means Cauchy’s principal value, 
that is, 


ZI F(t)dt 


(| b roas f" Finds) 


We call g the Hilbert transform of f. If fe 
Lj(—00, oo), the inversion formula and the 
Parseval identity hold. More precisely, for any 
f € L;( —oo, oo), relations (3) and (4) above hold 
almost everywhere, ge L,(—oo, oo), and the 
L;-norms of f and g are identical (— 160 
Fourier Transform). The importance of Hilbert 
transforms lies in the fact that they establish 
relations between the real and imaginary parts 
of an analytic function (= 132 Elementary 
Particles C). 


= lim 


A>, £> 
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A. General Remarks 


The class of Riemann integrable functions is 
not closed under limits and infinite sums. For 
example, the function equal to 1 for rationals 
and to O for irrationals, the so-called Dirichlet 
function, is not Riemann integrable, though it 
is expressed by Dm... lim, œ (cos(m! nx))?". 
Hence the concept of Riemann integral is too 
narrow to be used effectively in modern anal- 
ysis. To cope with this difficulty, H. Lebesgue 
[1] introduced a general integral which is now 
called the Lebesgue integral. It is not only 
defined for all useful functions appearing in 
analysis but also has nice properties, such as 
exchangeability with limits and infinite sums 
under some simple conditions which can be 

. checked easily. The definition of the Lebesgue 
integral is based on the concept of Lebesgue 
measure (— 270 Measure Theory F), which is 
a generalization of length, area, or volume. In 
modern analysis one discusses integrals in an 
abstract space endowed with a measure and 
defines the Lebesgue integral as a special case. 
Integration theory plays a basic role in mod- 
ern mathematics, in particular in analysis, 
functional analysis, and probability theory. 

Let f(x) be a bounded function defined on a 

bounded interval [a, b]. f(x) is *Riemann inte- 
grable if and only if it is continuous talmost 
everywhere with respect to the Lebesgue mea- 
sure. If f(x) is Riemann integrable on [a, b], 
then f(x) is Lebesgue integrable on [a, b] and 
the integrals coincide with each other. How- 
ever, the improper Riemann integral is not 
included in the Lebesgue integral; for example, 
(sin x)/x is not Lebesgue integrable on (0, oo), 
but lim, ,, [6(sin x)/x dx = 7/2. The theory of 
Denjoy integral gives insight into this situation 
(— 100 Denjoy Integrals). 
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B. Definition of Integrals 


Let X be an arbitrary set. If a completely 
additive class 8 of subsets of X and a tmea- 
sure u defined on 38 are given, then we say 
that a tmeasure space (X, 9B, u) is defined. For 
example, the Euclidean space R”, the class W, 
of all *Lebesgue measurable sets in R", and 
the *Lebesgue measure m, on M, form the 
measure space (R", Wi, m,) (— 270 Measure 
Theory). We consider only 8-measurable sets 
and %-measurable functions in this article, so 
a *B-measurable set will be called simply a 

set and a 38-measurable function simply a 
function. We admit +œ for the values of a 
function. 

The integral f; f(x)du(x) on a set Ec X ofa 
real-valued function f(x) (we write simply 
fc fdu or Le f) can be defined in steps as fol- 
lows. (1) Let f(x) Z0 be a simple function, that 
is, a function whose range is a finite set (aj] 
(i— 1,2, ..., n). If f(x) 2 a; for xe Ej, where E= 
(jar Ej, EN E, = Ø (jh), then we put f; f= 
La x0 ër (Ej. (The value of the integral is a 
real number or +00. For operations concern- 
ing +00, — 270 Measure Theory D.) (2) For 
an arbitrary f(x) 20, we define fz f as the 
tsupremum of Leg. where the supremum is 
taken for all simple functions g such that 
0x g <f. (3) In general, letting f(x) — f * (x) — 
f (x) where f *(x)=max{ f(x),0}, f (x)= 
max { — f(x),0}, we define fe f= feft Ji 
if at least one of [; f* and J, f~ is finite and 
say that f has an integral (or a y-integral) on E 
In particular, if fẹ f is finite, then we say f is 
integrable (or -integrable) on E. 

If the given measure space is (R", 9)i, m,,), 
the integral defined in this section is called the 
Lebesgue integral (or simply L-integral), and 
the function that is integrable in this case is 
said to be Lebesgue integrable. The integral is 
often written as |, f(x)dx, and if E is the inter- 
val [a, b], as f f(x) dx. 


C. Properties of Integrals 


(1) The set of all functions integrable on E 
forms a "linear space over R, that is, if f and 
g are integrable on E, then for any real a, fi, 
af+ Bg is also, and 


| oF + Bodas | il gdp. 


(2) The integrability of f, of both f* and f7, 
and of |f| are mutually equivalent. (3) If g « f 
on E, then fpg € f; f. In particular, if m < 

f(x) &l on E, then mu(E) « f; fdu « lp(E). (4) 
If f(x) is integrable on E (has an integral on 
E), then it ts integrable (has an integral) on 
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every subset of E. (5) If E= ( jJi En EN E,— 
Ø (j z: k), and fp fdu exists, then the series 
Y, Je, fdu converges and is equal to fg fdu 
(complete additivity of the integral). (6) If f(x) z 
0 and f; fdu=0, then f(x) 20 talmost every- 
where on E. (7) Modification of the values 

of f(x) on a ‘null set influences neither the 
integrability nor the value of the integral. 
Consequently, if the function f is not defined 
on a null set, we can define the value of f on 
this set arbitrarily so that the integral has 
meaning. (8) If f(x) is integrable on E, E > Ep, 
and u(E„)—>0, then Im, Je fdu — 0. (9) If 
SAX) € f. (x) on E, and there exists a p(x) 
such that f,(x) 2 (x) and fro —oo (for 
example, if f,(x) 2 0), then 


im | f= | (im4). 


(10) If lim, ,,, f, (x) exists almost everywhere on 
E, and there exists a (x) such that | f,(x)| < 
p(x) and fg « +00 (for example, if (E) « oo 
and | f,(x)| « M), then 


[ben 


(Lebesgue's convergence theorem). (11) If there 
exists a p(x) such that f,(x) 2 (x) on E and 
Luz —oo, then 


lim inf | > | (in inf D 
KEE E EX H" 


(Fatou’s theorem). (12) If f,(x) 2 0 on E, then 
fe Erci fo Xa ety. M f,(x) is real on E, then 
this equality holds for | f,| in place of f,. If the 
common value is finite, then the equality 
holds also for the original f,. (13) Let @ be 
a mapping from (X, 8, u) to (X’, B’, p’) such 
that B'e% implies o ^! (B)e98 and p'(B’)= 
ulo !(B' If f(x’) is B’-measurable on Ee, 
then fe f(x) du'(x') 2 fp- f (o(x)) du(x), where 
the Sally means that if one of these inte- 
grals exists, then so does the other, and they 
have the same value. 


D. Indefinite Integrals 


If we put F(e) — f, f for each measurable subset 
e of E, with f(x) integrable on E, F(e) is a tu- 
absolutely continuous completely additive tset 
function (properties of integrals (4), (5), (8)). We 
call F(e) the indefinite integral of f(x). In the 
special case where X is the set R of real num- 
bers and f(x) is integrable on the interval 

[a, b], the function F(x) = [7 f(t)dt defined for 
x€[a, b] is also called the indefinite integral of 
f(x). The F(x) so defined is an tabsolutely 
continuous function. Conversely, if a function 
F(x) is absolutely continuous on [a,b], then it 
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is differentiable almost oe on [a,b], 
and we have F(x) — F(a) = [7 F'(t)dt. (For the 
relationship between differentiation and in- 
tegration in the case of R", or more generally 
in the case of an arbitrary measure space, — 
380 Set Functions.) 


E. Fubini’s Theorem 


Let (X, B, u,) and (Y, B,, u2) be two o-finite 
measure spaces and (X x Y, 98, u) be their 
*direct product measure space. Assume that 
f(x, y) is 8-measurable and integrable (has an 
integral) on X x Y. Then for almost all fixed 
ye Y, f(x, y) considered as a function of x is 
B -measurable and integrable (has an inte- 
gral), and fy f(x, y) du, (x) is a B -measurable 
function of y. Moreover, in this case we have 


| f(x, y)du(x, y) 


- | | fix, OE 
Y X 


(Fubini's theorem). This fact also holds with x 
and y exchanged. The integral on the left-hand 
side of this equation is called a multiple inte- 
gral, while that on the right-hand side is called 
an iterated (or repeated) integral. 

Even if an iterated integral exists, the corre- 
sponding multiple integral need not always 
exist. For example, let f(x, y) be defined as 
(x? — y?)/(x? + y?Y? on (0, 1) x (0, 1), and other- 
wise 0. Then fro f" =|p2f~ — 00, so that [gf 
does not exist, but 


[ojo 
CU sche) 


By Fubini's theorem, if f is a nonnegative 
function defined on a B-measurable subset 
of a o-finite measure space (X, B, u), the B- 
measurability of f is equivalent to the measur- 
ability of the ordinate set E, = {(x, y)JO& y S 
f(x), xe E} considered as a subset of the mea- 
sure space (X x R, 8’, u^), which is the direct 
product of (X, 8, u) and (R, M, m). In this 
case we have |, fdu — u'(E,), which can serve as 
a definition of the Lebesgue integral. When 
(X,88, u) coincides with (R, 9Wt,, m,), the *Jor- 
dan measure (area) of the ordinate set coin- 
cides with the Riemann integral. 
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Integrodifferential Equations 


A. General Remarks 


A 'functional equation involving an operator 
T of the form 


f(t, x (t), x(t), (Tx) (0)) =9, 
X(to) — Xo, (2) 


reduces to a tdifferential equation, tdifferential- 
difference equation, tintegral equation, inte- 
grodifferential equation, or other functional 
equation when the operator T is given a spe- 
cial form [1,6]. In particular, by letting 


fo St&t,, (1) 


f(t, x, y, z) x —g(t. y) — z, (3) 


et) 
cro [ K (t, s, x(s)) ds, (4) 

to 
we obtain an integrodifferential equation 

e(t) 

x'(t) — g(t, x(t)) | K (t, s, x(s)) ds. (5) 
If o (t) constant or o(t) 2t, (5) is said to be an 
integrodifferential equation of Fredholm type 


or of Volterra type, respectively. 


B. The Initial Value Problem 


Let I be an interval tp <t<t,, Io the interval 
to «t&t,, C(I) a set of continuous functions 
on I, T an operator such that TxeC(1,) if xe 
C(I), & a family of all such Te. a subset of 
d consisting of all. T in & for which Tx < Ty 
holds at t=s whenever x, y are functions in 
C(I) satisfying x(t) « y(t) for ty <t <s for some 
sel, and Z a set of continuous functions 
which are differentiable on Jọ. Let M be the 
class of all functions f(t, x, y, z) defined for te 
: Te; ixl, Iyb |z| « oo and satisfying f(t, x1, y, zi) 2 
F(t, X2, Y, Z2) (X1 2 X2, Z1 &z3). 

Suppose that in (1) f(t, x, y, z) is defined for 
telo, |xl, Iyl, |z| < oo, and Te, Suppose fur- 
ther that for some y >0 and two solutions x, 
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and x; of (1) with (2) belonging to Z, there 
always exist a function we M and an operator 
Qc, such that the inequality 


w(t, x, —x,,x;—x,,Q(x, —x,)«0 


holds for every te lo such that x,(t)—x,(t)=y 
and x;(t) — x,(t) « y for ty « t «t. Furthermore, 
suppose that there exists a function oe satis- 
fying the following inequalities: 


Ox<p<y, telo; 


c(t, p’,p,Qp)>0, telo; 


Go +0) > x5(tg 4-0) — x; (to + 0). 


Then equation (1) with (2) has at most one 
solution xe Z. This result can be established 
by obtaining an estimate for the difference 
|x, (t) 7 x5(t)], where x;(t) is a solution of (1) 
with the condition x(tọ)= 9; (i= 1,2). 

For the particular case of integrodifferential 
equations of Fredholm type, suppose that the 
following conditions are satisfied: 


g(t.y3)—9(6y3 «L(t)(y; —y3) telg; 


| (K (t, s, w,(s)) — K(t, s, w5(s))) ds 
«wo | M(s)(w,(s)—w,(s))ds, tel; 


t 
| sM(s)ds«oo, telo; 
to 


N(t)-tL()€1--t^M(t)) rel 


where y; > y, Le C(I9), wu, w2, M, NE C(I), 
w, >w, M 20, and N 20. Then it is possible 
to obtain more practical expressions for c, Q 
and p: 


3 


w(t, x, y, z) -x—L(t)y —N(t)z, telo, 


ao- | M (s)w(s)ds, 


t 


p(t) — Bt(14- vex | s(1+s)M(s) ds, 


to 


where f) 7 0 is sufficiently small. 


C. Another Problem 


A problem analogous,to the tboundary value 
or 'eigenvalue problems of linear ordinary 
differential equations is to find a solution of 
the linear integrodifferential equation 


(pu — qu saput | k(x, y)u(y) iy) =0 
G 


with the boundary value u=0. This equation 
can be derived from the problem of minimiz- 
ing the functional D[«] under the condition 
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that H[«] is constant, where 


pto- | po a+ | qq? dx, 
G G 
itel- | po? dx 

G 


+ | | k(x, y) e(x)o( y) dx dy. 
GJG 


The orthogonality condition for this boundary 
value problem is given by 


| pu,{x)u,(x) dx 


fh isi 
I | Kæ shuts) eed die izj 


[2]. 

Integrodifferential equations are closely 
related to problems of mathematical physics 
and engineering, and there are many investi- 
gations of such equations in the study of the 
equilibrium of rotating fluids [3], Prandtl’s 
integrodifferential equation for aircraft wings 
in 3-dimensional space [4], the dynamics of 
reactors, and so on. In the second example, the 
circulation of the airflow I (y) with constant 
velocity V around the profile is determined by 
the equation 


ry) " 1 | PP oqp(y) 
Pe = NA ; 
nk(y4(yV nV —bj2 y-y 





(y)= 


called Prandtl’s integrodifferential equation, 
where b is the wingspan, y and y' are variables 
whose range is [ —b/2, b/2] (y is assumed 
fixed), t the length of the chord, x an angle of 
incidence from the point with buoyancy 0, 27k 
the slope of the curve defined from buoyancy 
by the angle of incidence, and p.v. tCauchy’s 
principal value. 

As a problem having applications in the 
theory of tstochastic processes, the existence of 
solutions that have finite limits as t— oo has 
been investigated for the Wiener-Hopf integro- 
differential equation 


[] (D4 23/69—4, ...4, | fix Mt 
k=1 0 


x20, D-d/dx. (6) 


For this problem, by means of the method of 
tsemigroups of operators, equation (6) can be 
extended to 


Heese, ia " (Tf) G)dH(D, 
k=1 [U 


xel, 


where A is the tinfinitesimal generator of a 
semigroup of operators {T;}, and it has been 
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shown that analogous results to those for (6) 
can be obtained for this more general equation 


[5]. 
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A. Lagrange Interpolation 


Assume that the values of a function f(x) with 
some regularity property (e.g., differentiability 
up to a certain order) are given at each of n+ 1 
distinct values x, x4, ..., x,. The method of 
finding the values f(x) at x Z x; rd using the 
values fo — f(xo), fi = f G1). -- = f(x) is 
called interpolation. A function 2n ) that ap- 
proximates f(x) and coincides with f(x) at x — 
Xo, X4, -.., X, is called an interpolation function 
or interpolation formula. 

We usually use a polynomial of degree n as 
L(x). Such a polynomial is called a Lagrange 
interpolation polynomial, and the method of 
using such a polynomial is called Lagrange 
interpolation. If we let 


I(x) 
(x— x) I(x)’ 


the Lagrange interpolation polynomial can 
then be expressed in the form 


10) = pd- De: 


Lix)= Y Wat, 
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I(x) is a polynomial of degree n and is called 
the Lagrange interpolation coefficient. Some- 
times, by interpolation we mean the method of 
finding f(x) for x lying between the maximum 
and the minimum of x;. When x is located 
elsewhere, such a method is called extrapola- 
tion. The Lagrange interpolation polynomial 
is uniquely determined, and if f(x) is (n+ 1)- 
times differentiable, the deviation of L(x) from 
f(x) is given by f "* (E) T(x)/(n - 1)!, where é 
lies between the maximum and the minimum 
of x, Xo, X, Au Conversely, the method of 
finding an approximate value of x satisfying 
f(x) 2 f for a given value of f using L(x) is 
called inverse interpolation. 

Suppose that we restrict the interval of 
interpolation to [ —1, 1] and require that the 
points of interpolation x; be spaced equally, 
x; —2i/n — 1, 1i— 0,1, ..., n. Then, if the number 
of points n 4- 1 is increased, the Lagrange inter- 
polation polynomial L(x) may not converge to 
f(x) even if f(x) is an analytic function of x on 
{ —1,1]. For example, this nonconvergence 
phenomenon is observed when f(x) 2 1/(25 + 
x?) is interpolated; this is called the Runge 
phenomenon. On the other hand, if we choose 
the zeros of the tChebyshev polynomial of 
degree n+ 1 as x,, i.e., if we take x;=cos{ (2i + 
1)n/2(n 4- 1)), i=0,1,...,n, which is called 
Chebyshev interpolation, then L(x) converges 
uniformly to f(x) as n+ 1 tends to infinity, 
provided that f(x) has a bounded derivative 
on [ —1,1]. 


B. Iterated Interpolation 


We can compute the value of the interpolation 
polynomial for given x by generating a se- 
quence of interpolants each of which involves 
one more point than the previous one. The 
following method is called the Aitken interpo- 
lation scheme. First we compute 


1 fo Xo—X 
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and then successively 
1 Jon A Xk—X 
bras cc : , 
s loy..k-ii Xi—X 
i=k+1,...,n. 


If we continue successive evaluation of Ip,, 
1515, ... until the successive values coincide 
within the desired accuracy, then we can ac- 
cept the converged value LL , as an approxi- 
mate value of the Lagrange interpolation 
polynomial L(x) of degree n at x. In this case 
the x; are not necessarily assumed to be ar- 
ranged in monotonic order. It is better to ar- 
range them in order of their distance from x 
rather than in ascending or descending order 
of magnitude. 


C. Interpolation by Finite Differences 


When the interpolation points are equally 
spaced, the values of the interpolation poly- 
nomial can be expressed in terms of finite 
differences. Suppose that for x;= x, 4- ih, where 
his the grid size, the values fo, f}, ..., f, of the 
function f(x) are known. The differences Af; = 
fis: fi are then called (finite) differences of 
first order. Furthermore, we define the dif- 
ferences of order k+ 1 inductively by A**! f, = 
A* f ., — A fi. If we use the shift operator E 
defined by Ef; — f;,,, the difference operator is 
represented as A= E— 1. Sometimes the back- 
ward difference V = 1 — E '! is used. In contrast 
to V, the operator A is called the forward dif- 
ference. The central difference ô = E'? — E 1? 
which is defined by Ai, = f; — fi- is also 
used. Table 1 shows the relations between 

the finite differences and the differentiation 
operator D defined by Df(x) — df(x)/dx. 

Table 2, in which each entry after the second 
column is the difference of the two entries 
lying immediately to its left is called the dif- 
ference table. From the relation A=VE or A= 











i= : ,  i=1,2,...,n, ôE"? we can express each entry of table 2 in 
Toart e terms of V or ô. For example, in the second 
Table 1 
E A à V hD 
E E 1+A ise aos ph? 
2 l-V 
x 8? V AD 
8 EE A — Yo; ^^ 2smh(hD/2) 
(1+A) (1- V) 
MET A _ ô? s bt. 
V 1-E E EK V ENT 
hD logE log(1 + A) 2arcsinh(8/2) — log(1— V) hD 
EV? E-2 1+A/2 l= V/2 
u EE EE oe. (12-82/4) 7? punted cosh(hD /2) 
(1+ A) (1— V) 
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Table 2. Difference Table 


Af; 
fa A’ fr 
Af- Af- 
Jo ATI, A* f. 
Afo Afa 
fi A* fo 
Af, 


column, Af, is equal to Vf,,, and óf,,,. If 
f(x) is a polynomial of degree k, then Af(x) is 
a polynomial of degree k — 1, A* f(x) is a con- 
stant, and A**! f(x) is zero. Therefore, looking 
at the difference table, we can find the degree 
of an interpolation polynomial that can satis- 
factorily approximate f(x). It should also be 
noted that, if the computation of each entry of 
the table is carried out with a finite number 
of significant figures, the error in the values 
fo. fi; -<--> 18 multiplied by binomial coeffi- 
cients corresponding to the location in the 
difference table. 

The following are interpolation formulas 
for which the difference table is used. Suppose 
that we want to interpolate the value f, at 
X — xg t ph. Then from f,= E” fo —(1-- A" fo, 
we have Newton's forward interpolation 
formula: 


p(p — 1) 


fp foc pAfo + EI 


A* fo 





Pp Dp—2) 


3 
31 A" fo .... 


Similarly, from f,— E” fo —(1— V) Pio, we have 
Newton's backward interpolation formula: 


p(p- 1) 
2! 





fp fot Piot Vh 
P(p+ 1)\(p+2) 


3 
31 Vë EE .... 


We get these formulas by starting at fọ and 
proceeding downward to the right or upward 
to the right. On the other hand, by proceeding 
in a zigzag manner, downward to the right, 
then upward to the right, then again down- 
ward to the right, etc., we get another formula, 
called Gauss's forward interpolation formula: 


p(p—1) 
2! 





fp fo pAfo + A’ f 


e 


Similarly, we have Gauss's backward interpola- 
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(pc 1)p 
2! 





fp Jo tPA- + Afa 
(p+1)p(p—!) 


3 
z A3 deus 


In addition to these formulas, there are several 
by Everett, Bessel, Stirling, and others, which 
are essentially equivalent to the Lagrange 
interpolation polynomial that uses the same 
tabular points, although the representations 
are different. 


D. Interpolation by Divided Differences 


For points located at unequal intervals, the 
values of the interpolation polynomials can 
also be expressed in terms of divided differ- 
ences, defined successively as 

£-f i- 
dunt s. 


DEE 





The divided difference of order k defined as 
above can be expressed as 


Jan. A Ds / 


Amt: Oe lea 


The Lagrange interpolation polynomial of 

degree n is 

L(x) = fo +(x — xoMfo1-- (x — Xo) (x — x1) fora 
+... F(X = Xo) (x — xi)... (X Xn-1)fo1...n> 


with the error given by 


I(x) L(x) = (x — xg) (x — x1) ...(x — XW) fo12..2 


where f,,;. ,, means that the divided dif- 
ference of first order is calculated from x and 
f(x) instead of from x; and f. 


E. Hermite Interpolation 


The polynomial H(x) of degree 2n 4- 1 that 
satisfies not only H(x)= f(x) but also H'(x) — 
f (x) at x 2 xg, x4, ..., x, 1s called the Hermite 
interpolation polynomial. The Hermite inter- 
polation polynomial is 


H= Y. (1— 20696 x) } POC) 


+ Y (=a), 


where I(x) is the Lagrange interpolation coef- 
ficient. The deviation of H(x) from f(x) is given 
by f(x) - H(x) = f ?(6) IP (x)/(2n)!, where € lies 
between the maximum and the minimum of 

Xo OD qu eese 
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F. Spline Interpolation 


Let the interval (—0o, oo) be divided into n+ 1 
subintervals not necessarily of equal length by 
n knots xy = —o0 € x, € X4 € ... «X, « Kat = 
oo. A function S(x) which coincides with a 
polynomial of degree at most m on each sub- 
interval (xj, x;,,), i= 1,2, ..., n, and has con- 
tinuous derivatives up to order m — 1 at each 
knot is called a spline, i.e., a spline S(x) is a 
CH"! function. If S(x;) = f(x), i=0, 1, 2,...,n, 
then S(x) is an interpolation function. Inter- 
polation by means of a spline is called spline 
interpolation. The term “spline” derives from 
the name of an instrument with which drafts- 
men fair a curve through points. 

If a spline coincides with a polynomial of 
degree 2k — 1 on each subinterval and with a 
polynomial of degree k — 1 on (—00, x, ] and 
[x,, 00), it is called a natural spline. If the data 
y; is given at each knot x,, i— 1,2, ..., n, and 
also an integer k not larger than n is given, 
then the spline S(x) of degree 2k — 1 that satis- 
fies S(x) =y; i— 1,2, ..., n, is uniquely deter- 
mined. The spline that is most frequently used 
in practical problems is the natural cubic 
spline (k — 2). 

Let f(x) be an arbitrary function of C* (k « 
n) satisfying f(x;)— y; at each knot x,, i— 1,2, 
...,4. Then in any interval [a,b] which in- 
cludes x,, x5, ...,x 


sn 


b b 
| [S®(x)]* dx < | Ex] dx 
holds, where S(x) is the natural spline of degree 
2k — 1 that satisfies S(xj)) — yj, i= 1,2, ..., n. This 
inequality is called the minimum norm prop- 
erty, and, in particular, the minimum curva- 
ture property when k —2, of the natural spline. 


G. Polynomial Interpolation on Triangles 


Polynomial interpolation on a triangular 
region is used in the finite element method. The 
complete polynomial of degree m, 


Py) YL ayy xy”, 

: utv=0 
can be used as an interpolation polynomial on 
a triangular region. The number of the coef- 
ficients a,, is (m+ 1)(m + 2)/2. On the other 
hand, if we divide each side of the triangle 
into m equal parts and join the points of sub- 
division by lines parallel to the sides of the 
triangle, then we have m? congruent small 
triangles the number of whose vertices is (m+ 
1)(m 4- 2)/2. Therefore if we choose these ver- 
tices as the points of interpolation, we can 
determine the coefficients a,, uniquely from 
the data given at these points. This is the La- 
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grange interpolation polynomial of degree m 
on a triangle. 

The interpolation polynomial P, (x, y) of 
degree m= 1 that is uniquely determined from 
the data u,, uz, u3 given at the vertices (x4, y1) 
(X5. Y2), (x3, Y3), respectively, can be expressed 
as 


P,(x, y) =U ën (x, y) + u5 $566 y) + u3 $366 y), 


where 
11 1 Ji 1 1 
E(x, yJ=—5|x x; x| S-zix, x, xj. 
Y We Xk Au Y2 3 


(i,j,k) is any cyclic permutation of (1, 2, 3), and 
the absolute value of S is equal to the area of 
the triangle. č;(x, y) is a polynomial of degree 1 
satisfying 


l; j= i, 
ei. yj = e Ja 
and is called the shape function of degree 1 on 
the triangle. 

If we choose the vertices together with the 
midpoints of the sides as the points of inter- 
polation, we can determine the six coefficients 
a,, of the interpolation polynomial P, (x, y) of 
degree 2 uniquely, and the polynomial can be 
written as 


6 
Pis y) - wéien 


where £9, i=1,2,...,6, are the shape func- 
tions of degree 2 on the triangle; these can be 
expressed in terms of the shape functions of 
degree 1 as follows: 


Hx y)- 4505-1,  i-1,2,3, 
EN x, y) 7 4616. 6» = 46, &,, = 48, 6,. 


The Lagrange interpolation polynomial 
P„(x, y) given above is defined locally on each 
triangle in the finite element method, and the 
function over the whole triangular network is 
constructed by connecting these Lagrange 
interpolation polynomials. This piecewise 
polynomial function of degree m over the 
whole network is evidently of class C°, and 
has no continuity of derivatives of higher order 
in general. A C'-function, for example, can 
be obtained from the complete polynomial 
PG, y) by determining the 21 coefficients 
from the values and the derivatives up to 
order 2 at the vertices and the normal deriva- 
tives at the midpoints of the sides. Alterna- 
tively, we can impose three conditions so that 
the normal derivative reduces to a cubic 
along each side instead of imposing that it 
coincide with specified data at the midpoint; 
then we obtain another C'-function. A variety 
of interpolation functions are known for 
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other elementary regions, such as rectangles 
or tetrahedra. 
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A. General Remarks 


In 1926 M. Riesz proved that if T is a bounded 
linear operator L, (0) L, (Q) with norm No 
and L, (0) L, (Q') with norm N,, 1 € po, 

Pi» do, qı & ©, simultaneously, then it is a 
bounded linear operator L,(Q) L,(Q") with 
norm < NA ?N? whenever 


1/p —(1 — O/po tip, (1) 
(ig — (1 — Oger 0/q,, (2) 


for 0 « 0 « 1, provided that qz p. In 1939 G. O. 
Thorin removed the restriction q > p by devis- 
ing a proof based on function theory (Riesz- 
Thorin theorem). Meanwhile, J. Marcinkie- 
wicz (1939) announced that the boundedness 
of T: L,(Q)> L,(Q’) holds for a quasilinear 
operator T under weak type assumptions (— 
Section E (2)). 

From 1959 to 1964, J.-L. Lions, A. P. Cal- 
derón, J. Peetre, and others extended these 
results to linear operators from a couple of 
Banach spaces to another couple. The inter- 
polation methods provide a powerful and 
often essential tool in various fields of mathe- 
matical analysis where estimates of operators 
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play a central role, such as harmonic analysis 
[4, 8, 9], numerical analysis, approximation 
theory [7], and the theory of partial differen- 
tial equations [2]. 

A compatible couple is a pair ( Xo, X,} of 
Banach spaces (or more general topological 
linear spaces) that are continuously embedded 
in a Hausdorff topological linear space. Let 
( Yo, Yi] be another compatible couple. A 
linear operator T': Xy 4- X, >Y, + Y, is said to 
be continuous 1 Ko, X,) —^| Yo, Yı} with norm 
(Ns, Ni} if for each [—0, 1, T: X, Y, is con- 
tinuous linear with norm J). 

An interpolation method is a ‘functor which 
assigns to each compatible couple {Xo, X,} of 
Banach spaces a Banach space X with Kal) 
X,c X c Kat X, such that every continuous 
linear operator T: ( X;, X,] —(Y,, Yı} induces 
a continuous linear operator T: X > Y. X is 
called an interpolation space of { Xo, X,}. There 
are two important types of interpolation 
methods, the complex method (due to Cal- 
derón [5], S. G. Krein, Lions) and the real 
method (due to E. Gagliardo, Lions, Lions and 
Peetre [6], Peetre). These methods generalize 
the classical results of Riesz and Thorin and of 
Marcinkiewicz, respectively. 


B. The Complex Interpolation Method 


In this section ( Xs, X,} is assumed to be a 
compatible couple of complex Banach spaces. 
Let F(X,, X;) be the Banach space of all 
bounded continuous functions f(D) ¢ = € + in, 
on the strip Ox £x 1 with values in Xo+ X, 
holomorphic in 0 « č « I, and such that for 
each /— 0, 1, f(I -- ip) is a continuous and 
bounded X,-valued function. | f lous, x, = 
max (sup [| frr] x, sup [i f(1 + in)|lx,} is 

the norm. The complex interpolation space 

[ Xo, X, ]5, 0 « 0 « 1, is defined to be the Banach 
space of values f(0) of fe F(X,, X,) with the 
norm "elt, x A z infi f lp, xl x = f(0)}. 
Xo N X, is dense in [ Xs, Els, 

Let ( X, X,} and { Yo, Y,} be two compatible 
couples, and let T: X9, X;}{Yo, Yı} bea 
bounded linear operator with norm {No, N;}. 
Then T':[ X,, X,]Je—>LYo, Y; ]a is bounded linear 
with norm < Nd ?N? (interpolation theorem). 

If a holomorphic family | 7), Oc Rec « t, of 
linear operators, acting in Xo + X, into Y, + Y,, 
fulfills a certain boundedness requirement, 
then T, induces a bounded linear operator 
[ Xo. X, ]5 [ Yo, Vide (E. Stein [9]). 


C. The Real Interpolation Method 


The method of this section applies also to 
compatible couples of tquasinormed spaces (P. 
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Krée (1967)). If | Xo, X,} is a compatible couple 
of Banach spaces (resp. quasinormed spaces), 
then Xo N X, and X, + X, are Banach spaces 
(resp. quasinormed spaces) under the norms 
J(t,x)=max{||X|ly,, Tele 1 and K(t, x)= 

inf [xol y, + tllx; ly, [X 9 xo +x,}, respectively, 
for any t0. Denote by L*, 0 « px oo, the L, 
space on (0, oo) relative to the measure dt/t. 
Then the real interpolation space (Xo, X,), p, 
0«0«1,1xpxoo (resp. Oc px oo), is defined 
to be the Banach space (resp. quasinormed 
space) of all x e X; + X, such that t ?K(t, x)e 
L5 with the norm |xli, x, — |t ^K (t, x)lz 
(Peetre's K-method). X, X, is dense in 

(Xo, X1)o,p if p< oo. The continuous embed- 
ding (Xo, X;), p c (Xo, X1)o,q holds for p< 
qoo. 

If T: (X$, X] 5 (Ys, Yı} is a bounded 
linear operator with norm ( No, N,], then 
T:(Xo, X4), , 2(Yo, Y)o,,, 00 « l, px oo, is 
bounded linear with norm < Nd *N? (inter- 
polation theorem). 

The linearity or boundedness requirements 
of T can be weakened considerably (Krée, T. 
Holmstedt, H. Komatsu (1981)). 

There are several equivalent definitions of 
the real interpolation spaces for compatible 
couples ( X,, X,} of Banach spaces. The J- 
method gives (Xo, X, Jj, p 00 « 1, 1 « p& oo, 
which is the space of means x= fẹ u(t)dt/t with 
the norm [xl xog, = inf, [t *J( ut). 
where u(t) are strongly measurable functions 
on (0, oo) with values in X; N X, such that 
t *J(t,u(t))e Lt. (Xo, Xy), p — (Xo, X, Je, holds 
with equivalent norms. A discrete version of 
the J-method applies also to couples { Xo, X,] 
of fquasi-Banach spaces. 


D. The Reiteration Theorem 


Let 1 X, X,) be a compatible couple of (quasi-) 
Banach spaces. A (quasi-) Banach space X 
is said to be of class Ky(Xp, X,), 0<@<1, 

if Kall X, c X c Xo +X, with continuous 
embeddings and if there are constants Cx 
and C, such that Kit, x) < Cyt?||xl||y, xe X, 
and t*|x||, < C,J(t, x), xe Xo X,. The class 
Kaka X,) contains (Xo, X, Jp ,, 0 c 0« 1, 

O0 « p € oo, and, for Banach spaces Ka, X}, 

[ Xo, X, Jo. If Y, is of class K,(Xo, X,) and Y, 
of class KX, X,) and ifO x a « f <1, then 
(Y, Ypa. p — (Xo. Xiha. p 0€42« 1, 0« p oo, 
0(4) — (1 —A)a+ AP, with equivalent norms 
(reiteration theorem). 

Set Y, Z[ X;, X, ], and Y; — [ Xs, X,], for 
Banach spaces X, and X,. If XN X, is 
dense in Xy, X, and KI) Ke, then LY,, Y]; — 
[ Xo. Xi laa O € A& 1, with equal norms (re- 
iteration theorem). 
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E. Examples and Applications 


(1) Let L = L,(Q, ji) be the L,-space on a ø- 
finite measure space (Q, ul, Then [Los Lp lo 
Lp, holds with equal norms for 1/p, — (1 — 0)/po 
*0/p,, 0 «0 «1, 1 € po, p, & oc. Hence follows 
the Riesz-Thorin theorem. In particular, the 
Fourier transform maps L,(R"), 1 <p<2, 
continuously into L,.(R"), p' = p/(p — 1) (the 
Hausdorff-Young inequality). The convo- 
lution operator Kx, Ke L,(R"), 1 &r « oo, 

is bounded from L,(R") into L,(R") with norm 
SIIK}, if p> 1, 1/g—1/p-- 1/gr — 12:0 (Young's 
inequality). 

For the *Hardy space H, (R"), [L,(R"), 

H, (R^) ], — L,(R"), 1/4 —(1 —0)/p-- 0, 1 « px oo, 
0 « 0 « 1 (Stein and C. Fefferman). 

(2) Recall that the "Lorentz space La 
L4, u), 0 € p, q € oo, is the tquasi-Banach 
space of all measurable functions f with 
I File. = Wf Stil 00, where f *(t) is the 
trearrangement of f(c), o € Q. Lac Lp and 
Lat Lon for der H l<p<o,1l<q< 
o6, then Lip, coincides with the Banach 
space (Lo L,),;,,, under the equivalent norm 
cP? fé f*(s) del: More generally, if 0< po, 
Pı <%, PoF Pi, 0<0< l, 1/pe=(1 —0)/Po + 
0/p,,0« 49, q,, q & oo, then (Loa L 
= Lipa With equivalent norms. 

Let T be an operator which maps a space of 
measurable functions on (Q, ul into another 
on (€, 1’). The inequality || Tf la, SNIS, 
holds if and only if w {w EQ |I Tf(w')|>s}< 
(NIT, Aen, $2» 0. Then T is said to be of weak 
type (p, q). 

Suppose that T is quasilinear, i.e., that T(f 
+g) is uniquely determined whenever Tf and 
Tg are defined and that | T( f A g)(o»)| € 
K1|Tf(o)| 4- | Tg(o»)|) ae. holds with a con- 
stant K independent of f and g. The interpola- 
tion theorem then holds. Therefore, if a quasi- 
linear operator T is of weak type (po, qo) and 
(P1. 41). do #41, then T is of type (p, q), i.e., 
T:L,(Q0) L, (€?) is bounded for p, q satisfying 
(1). (2), and q > p (Marcinkiewicz's theorem). 
When po £ p,, the same conclusion is obtained 
if || Tf la, o € Nf Ilo, e [—0, 1, for some r, 
(R. A. Hunt). 

The tHilbert transform and the tCalderon- 
Zygmund singular integral operators are of 
weak type (1, 1) and of type (2, 2). Hence it 
follows from these facts together with dual- 
ity that they are of type (p, p) for 1 « p « oo 
[4, 8, 9]. 

The convolution operator K «, K € Le, ,,(R"), 
| «r « oo, is of type (p,q) if p» 1, 1/g=1/p4+1/r 
— 1» 0 (R. O'Neil) In particular, the convolu- 
tion with |x|", 0 «a «n, on R” is of type (p, al 
for 1 «p«n/z and 1/q=1/p—«/n (the Hardy- 
Littlewood-Sobolev inequality). 

For the "Hardy spaces H, = H,(R") and the 


(p,.4,0. 
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*John-Nirenberg space BMO = BMO(R"), 
(Lies Hp,Jo.g=(BMO, Hy Jo.q=H, and 
(Hp, Hp Jo, p, = Hy, if 0<0<1,0<po, p, « oo, 
1/q — 0/p, and 1/pg - (1 —0)/po  0/p, (S. Igari, 
N. Riviére, Y. Sagher, Fefferman, R. Hanks). 

A 'Besov space B$ ,(Q) is understood most 
naturally as the real interpolation space 
(LQ), W3 (Q))sm,4 of the Lebesgue space L,(Q) 
and the 'Sobolev space W,"(Q) [2, 3]. 

(3) Let A be a closed linear operator in a 
Banach space X such that the resolvent (t 4- 
A) ! exists for t>0 and satisfies ||t(t + 4) !| < 
M. If D(A") denotes the domain of the inte- 
gral mth power A" of A equipped with the 
graph norm, then D^(A) - (X, D(A")),,, ,. 
0«c «m, 1 <p< oo, coincides with the space 
of xe X such that |t"(A(t- A) "xlyeL? 
and is independent of m. If — A generates an 
tequicontinuous semigroup T, of class (C?) 
(resp. a tholomorphic semigroup T; bounded 
on a sector), then D7(A) consists also of all 
x€ X such that || *(1 — T)"x||y € LF (resp. 
|t **" A"Tx| «e L*). There are similar charac- 
terizations of elements in (X, N D(A7)), p for 
a commutative family {A,,...,A,} of such 
operators (Lions and Peetre, P. Grisvard, 

P. L. Butzer and H. Berens, Komatsu, T. 
Muramatu). 

When A = —A or A;=6/6x; in R" or ina 
suitable domain Q in R”, these results give 
various equivalent characterizations of the 
functions in Besov spaces B5 ,(Q). The Sobolev 
embedding theorem for Besov spaces can also 
be proved from this point of view (Grisvard, 
Peetre, A. Yoshikawa, Komatsu). 

The space D7(A) is closely connected with 
the domains D(A’) of *fractional powers A* 
of A. Di(A) c D(A*) c DS (A), o - Rea O. If 
0 c Refi «c, then D;(4) — (xe D(A^)| A^ xe 
D; R*5(4)). If the pure imaginary powers A" 
are locally uniformly bounded, then D(A?) 

[ X, D(A”) reams 0 Rea «m. 


F. Duality 


Suppose that { Xo, X,} is a compatible couple 
of Banach spaces such that XN X, is dense 
both in X, and in X,. Denote the dual of X by 
X’. If one of X, and X, is reflexive, then so are 
[ Xo, Xi ]o and (X9, X),,, 00 cl, 1& p 

oo. Furthermore, [ Xo, X, ]; = [ Xs, Xi] and 
(Xo, XJ, m Ure, X1 Jo, p. P DIS — 1), with 
equivalent norms (Calderón, Lions and Peetre, 
H. Morimoto). 
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A. Introduction 


The tLebesgue measure in the Euclidean 
space R" is invariant under Euclidean motion 
(translation and rotation), and the measure 
du(A) — d4/4 on the half-line of positive num- 
bers is invariant under multiplication. If we 
define usual tspherical coordinates (0, ol on 
the unit sphere S? in R?, the measure du(w) 
= sin 6d0 dq (c — (0, ¢)) on S? is invariant 
under rotation. More generally, if we define 
the spherical coordinates (0,,...,0, ol on 
the unit sphere S” in R"*!, the measure 


du(c) — sin"! 0, sin" ? 0, ...sin0, , 40, d0, ... 
d, do 


on S" is invariant under rotation, where the 
spherical coordinates are related to the rectan- 
gular coordinates (x,, x5, ..., Ka x,,,) in RX) 
as follows: 


x, — sin, sin 0, ...sin0,_, cos o, 
X5 —Ssin ĝ; sin 0, ... sin, ,sin q, 


X4 =sin 0; ...sin0, ,cosÓ,. ,, 


X,-1 —SinO, sin 0, cos 03, 
x, =sin 0, cos 05, 
X444 — C080, 


(0€ 0, m, v—1,...,n— 1; 0x o x2n). 
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The notion of measures invariant under cer- 
tain transformations is generalized to that of 
invariant measures, defined in the following 

section. 


B. Definitions 


Let u be a tmeasure defined on a fo-algebra B 
of subsets of a space X, and G be a transfor- 
mation group acting on X in such a way that 
sde for any AEB and seG. For every seG, 
define a measure y(s)u on B by (y(s)u)(sA) = 
u(A). If y(s)u = u for all se G, then u is called 
an invariant measure with respect to G (or G- 
invariant measure). 

We consider the case where X is a locally 
compact Hausdorff space and G is a *topo- 
logical transformation group of X. We further 
suppose that $ is the smallest o-algebra con- 
taining the family © of all compact subsets 
of X and that u(K) « oo for all Ke& (— 270 
Measure Theory). Let Co( X) be the space 
of all real-valued continuous functions with 
compact tsupport defined on X. For example, 
if X is an foriented tRiemannian manifold 
and o is the tvolume element associated with 
the Riemannian metric on X, there exists a 
unique measure u on X such that 


| fodad = | fo (1) 


for every f(x)e Co(X). This measure p is invar- 
iant under the group G of isometries of X. In 
the case of a nonorientable X, a G-invariant 
measure can also be defined from the tRie- 
mannian metric. 

In the following, we consider G-invariant 
measures on thomogeneous spaces X ofa 
locally compact Hausdorff topological group 
(abbreviated to locally compact group) G. 


C. Haar Measures 


Most fundamental is the case in which G is 
locally compact and X =G, with sx (resp. xs~*) 
defined by the group multiplication law. In 
this case, a nonzero G-invariant measure on 
G is called a left- (resp. right-) invariant Haar 
measure on G. On every locally compact 
group, there exists a left- (right-) invariant 
Haar measure, which is unique up to a posi- 
tive multiplicative constant (Haar's theorem). 
For example, Haar measures on the additive 
group R of real numbers and the additive 
group R” are the usual tLebesgue measures. A 
Haar measure u on the multiplicative group 
R* of positive real numbers is given by 


| TOU= | Jods 


0 D 
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For an n-dimensional fLie group G, a left- 
invariant Haar measure p is defined by for- 
mula (1) with a left-invariant *differential form 
w of degree n. 

A Haar measure 4 on a locally compact 
group G is tregular in the following sense. If © 
is the set of all open subsets of G, then for 
every AEX, we have 


p(A)=sup{u(K)|Ke@, K c A} 
=inf{(U)|UEBND, ACU}, 


For VeBND (U z Ø), we have p(U)>0, and 
H(A) « oo for compact A. The measure p(s) of 
one point s is >0 if and only if G is tdiscrete. 
The total measure u(G) of G is finite if and 
only if G 1s compact. 


D. Modular Functions 


Let ube a left-invariant Haar measure on a 
locally compact group G, and define the mea- 
sure ó(s)u by (Ó(s)u)(A) = u(As) for every se 

G. Since ó(s)u is also a left-invariant Haar 
measure, there exists a positive real number 
A(s) such that 6(s) = A(s)u, by virtue of the 
uniqueness of the left-invariant Haar measure. 
The function A= Ae on G is called the modular 
function of G. For an 'integrable function on 

G with respect to u, we have 


| f(xs)du(x) — A(s) " | F(x) du(x). 
G G 


| f(x "Wuel du(x)= | fo9duoy. 
G G 


If v is a right-invariant Haar measure on G, we 
have the formulas 


| f exdito-ac | I(x) dv(x), 
G G 


| Jamhad = | f(x) dv(x). 
G G 


Moreover, AT! u is a right-invariant Haar 
measure, while Av is a left-invariant Haar 
measure. 

The modular function A of G is a continu- 
ous homomorphism of G into the multipli- 
cative group R* of positive real numbers. If 
the modular function A of G is equal to the 
constant 1, i.e., if a left-invariant Haar measure 
is also right-invariant, G is said to be uni- 
modular. G is unimodular if G is compact, 
commutative, or discrete. If G is a *Lie group, 
we have A(s)=|det Ad(s) ! , where s Ad(s) is 
the *adjoint representation. In particular, G is 
unimodular if G is a tsemisimple Lie group, 

a connected tnilpotent Lie group, or a Lie 
group for which Ad G is compact. However, 
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the group 7(n;R) of right triangular matrices 
(n2 1) is not unimodular. 


E. Product Measures 


Let {G,},<4 be a family of locally compact 
groups, and let p, be a left-invariant Haar 
measure on G, for every «e A. Suppose that 
there exists a finite subset B of A such that G, 
is compact and y,(G,)=1 for xe A— B. The 
product measure u= [T,. 4 44 is then a left- 
invariant Haar measure on the *Cartesian 
product G= [L,. 4 G,, which is also a locally 
compact group. Moreover, if A, is the modular 
function of G,, then Ac(x) 2 TL. 4 A,(x,) for 


x= (Xe A s 


F. Product Formula 


Let H, L be two closed subgroups of a locally 
compact group G, and suppose that Q=HL 
contains a neighborhood V of e in G. This 
means that Q is an open subset of G. If we put 
D={(s,s)|se HN L}, then the mapping (s, t) 
st ^ of H x L into Q induces a one-to-one 
continuous mapping q of the quotient space 
H x L/D onto Q. Suppose that o is a homeo- 
morphism. This is the case, for example, if G 
is *paracompact. Furthermore, if H (1L is 
compact, we have the product formula 


| f(o)du(o) 


=a | | FAD AGODA L(Y dity(h) du, (l), 
HxL 


where 4, ug, Hg denote left-invariant Haar 
measures on G, H, L, respectively, and a>0 is 
a constant independent of f. 


G. Weil Measures 


If A is a measurable subset with respect to a 
left-invariant Haar measure u and u(A) 0, 
then A~'A={s~'t|s, te A] is a neighborhood 
of the identity element of G, and such subsets 
form a base for the neighborhood system of 
the identity. This shows that the topology of a 
locally compact group is determined by its 
Haar measure. Conversely, we shall consider 
the definition of a topology in an abstract 
group G with a measure u. 

Let u be a ta-finite measure defined on a 
*c-additive family 88 in G, such that sde for 
Ae and seG. n is called a Weil measure if it 
satisfies the following two conditions: (W1) 
u(sA)= u(A); (W2) if f(x) is B-measurable, 
then f(y ! x) is B x 8-measurable. 


225H 
Invariant Measures 


If a Weil measure „#0 exists in a group 
G, then (A! A|u(A)» 0] forms a base for 
the neighborhood system of the identity ele- 
ment of a topology, which makes G a locally 
totally bounded topological group. If for 
every setz there exists an Ae such that 
L(A MSA) « u(A) « oo, u(4) 0, then G isa 
Hausdorff space. In this case the tcompletion 
G of G is a locally compact group, and for a 
suitable left-invariant Haar measure jt on G, 
we have A= ANGE and u(4)— ii(A) for 
every Ae% (the smallest o-additive family con- 
taining the family of all compact subsets of G). 


H. Relatively Invariant Measures 


Let G be a transformation group acting on a 
set X. A measure p on X is said to be a rela- 
tively invariant measure with respect to G if for 
every Setz, y(s) is proportional ou. i.e., 
Y()u- x() Tv (x(s)e Rt). If uO, x(s) is 
uniquely determined by s, and s y(s) is a 
continuous homomorphism from G into the 
multiplicative group R* of positive real 
numbers. We call y the multiplicator of the 
relative invariant measure. 

We now consider relatively invariant mea- 
sures with respect to a locally compact group 
G on the *quotient space G/H of G by a closed 
subgroup H Let u, f be left-invariant Haar 
measures on G, H, respectively, and let x — x* 
be the canonical mapping of G onto G/H. For 
any measure 4 on G/H, there exists a unique 
measure 4” on G satisfying the condition 


| (| $ unner: | fGo942* (x) 
G/H H G 


for every continuous function f with com- 
pact support on G. For every he H, we have 
ôlh)à* =A,,(h)A*. Conversely, for a measure 

v on G such that 6(h)v=A,,(h)v for every heH, 
there exists a unique measure 4 on G/H such 
that 4* =v. This measure 4 is called the quo- 
tient measure of v by f and is denoted by A= 
v/B. For a continuous homomorphism y of G 
into the multiplicative group R* of positivc 
real numbers, a necessary and sufficient condi- 
tion for the existence of a not identically zero, 
relatively invariant measure on G/H with the 
multiplicator y is that y(h)=A,(h)/A,(h) for 
every he H. If this condition is satisfied, the 
relatively invariant measure on G/H with 
multiplicator y is unique up to a multiplicative 
constant and is given by the quotient measure 


. v—(xu)/B of yu by p. In particular, for the exis- 


tence of a G-invariant measure on G/H, it is 
necessary and sufficient that the modular func- 
tions Ag and Aj coincide on H. Hence, if H is 
compact or if G and H are unimodular, there 
exists a G-invariant measure on G/H. 
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I. Weyl’s Integral Formula 


Let G be a compact connected *semisimple 

Lie group and H a tCartan subgroup (maxi- 
mal torus) of G. Then a Haar measure u on G 
can be expressed by means of a Haar measure 
P on H and a G-invariant measure 4 on G/H. 
If u, p, A are all normalized to be of total mea- 
sure 1, we have the following formula for every 
continuous function f on G: 


| f(g)du(g) 
G 
1 


-1f fighg !)J(h)d4(g*) dph) 
W JH JGH 

(Weyl’s integral formula), where w is the order 
of the tWeyl group of G and J (h) is given by 


2 


I] (ez? —e 0/2) : 
aeP 


J(exp X)= 





with P the set of all tpositive roots « of G with 
respect to H and X an arbitrary element of the 
"Lie algebra of H For an element h of H, the 
element X with h=exp X is not unique, but 
the function J is a singlc-valued function. A 
similar formula is valid on a symmetric Rie- 
mannian manifold. Weyl’s integral formula 
can also be generalized to the case of noncom- 
pact semisimple Lie groups. However, it is 
then necessary to replace the right-hand side 
by a sum extended over a system of repre- 
sentatives of mutually nonconjugate Cartan 
subgroups. 


J. Quasi-Invariant Measure 


suppose that a group G acts on a space X in 
such a way that 54€$8 for any Ae and seG, 
where 38 is a o-additive family of subsets of X. 
A measure yp defined on 38 is called a quasi- 
invariant measure with respect to G if the mea- 
sures u and y(s)u (Section A) are equivalent 
for every se G. Here, two measures 4 and u 
defined on 38 are equivalent if A= gy (this 
formula means A(E) — f; 9x) du(x) for any 
Fe) for some measurable function o(x) 
which is >0 almost everywhere with respect 
to u and p-integrable on every Ae% such that 
p(A) « oo. 

We now consider quasi-invariant measures 
with respect to a locally compact group G on 
a quotient space G/H of G by a closed sub- 
group H. There are always quasi-invariant 
measures on G/H with respect to G, and they 
are all mutually equivalent. They can be con- 
structed as follows. There exists a positive 
continuous function p on G such that p(gh)= 
Ayh) Achy p(g) for ge G, he H. Then the 
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quotient measure A=(py)/f is a nonzero quasi- 
invariant measure on G/H with respect to G if 
l, B are Haar measures on G, H, respectively, 
and it holds that dA(sx)— p(sg)p(g) ! dA(x) for 
xegHeG/H and set If G is a Lie group, we 
can take a function p of class C^. If X is an 
infinite-dimensional !locally convex topo- 
logical vector space over R, there exists no 
tø-finite Borel measure on X that is quasi- 
invariant with respect to translations by the 
elements of X [6]. 
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A. The General Case 


Let R be a Àcommutative ring. We say that a 
group G acts on R if (i) each element o of G 
defines an fautomorphism f —o f of R, and (ii) 
c (1f) 2 (ox) f for any e, «eG, f e R. In this case, 
an element f of R is said to be G-invariant (or 
simply invariant) if of = f for any ce G. An 
element f is called (G-)semi-invariant if for 
each o in G there is an invariant a(a) such that 
of =a(o)f, that is, if f is invariant up to an 
invariant multiplier depending on c. A semi- 
invariant may also be called a relative invar- 
iant, and an invariant may be called an ab- 
solute invariant. The correspondence o —a(o) 
mod(0: f) ((0: £F) - (xe R| xf =0}) is a trepre- 
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sentation of degree 1 of G, and a(o) ts called 
the multiplier of the semi-invariant f, or the 
character defined by f. 


B. Invariants of Matrix Groups 


Let K be a commutative ring with a unity 
element. Then we can consider a matrix group 
(or matric group) G over K, i.e., a subgroup of 
the group of n x n invertible matrices over K; 
this latter group is called the general linear 
group of degree n over K and is denoted by 
GL(n, K) (— 60 Classical Groups). Assume 
that R is a commutative ring generated by 

Xj, -.., X, over K and an action of the group G 
on R is defined such that ((ox,,...,0x,)— 
6'(x,, ..., Xn) (‘means the *transpose of a 
matrix). In this case, we say that G acts on R 
as a matrix group. If K is a field, then the 
smallest *algebraic group G (in GL(n, K )) 
containing G acts on R as a matrix group, and 
an element f of R is G-invariant if and only if 
f is G-invariant, and similarly for semi-in- 
variants. These results can be generalized to 
the case where K is not a field. 

A thomomorphism p of a matrix group G 
(c GL(n, K )) into GL(m, K) is called a rational 
representation of G if there exist rational func- 
tions Px (1 <k, L& m) in n? variables x;; (1 <i, 
j&n) with coefficients in K such that p((o;;)) 
=(@,,(6;;)) for all (o;;)e G. Assume that p is a 
rational representation of a matrix group G 
and p(G) acts on a ring R as a matrix group. 
Then we have an action of G on R defined by 
of =(po)f (c eG, f e R), called the rational 
action defined by p. If the following condi- 
tion Is satisfied, then the action is called semi- 
reductive (or geometrically reductive): If N — 

f, K t... f, K is a G-tadmissible module 

(fi, -JE R) and if fy mod N (fo e R) is G- 
invariant, then there is a *homogeneous form h 
in fo, ..., f, of positive degree with coefficients 
in K such that h is monic in fj and is G- 
invariant. This action is called reductive (or 
linearly reductive) if h can always be chosen to 
be a linear form. 

(1) Rational actions of a matrix group G in 
each of the following three cases are all reduc- 
tive. (1) K 1s either the real number field or the 
complex number field, and G is a dense subset 
of a tLie group (c GL(n, K)) which is either 
'semisimple or tcompact. (ii) K is a field of 
characteristic 0, and letting G be the smallest 
algebraic group containing G, the tradical of G 
is a ttorus group (— 13 Algebraic Groups). (iii) 
K is a field of characteristic p40, and G (as in 
(11)) contains a torus group T of finite tindex 
that is relatively prime to p. 

(2) Any action of a finite group is a semire- 
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ductive action. If we omit the condition that 
the characteristic of K is O in (it), then rational 
actions of G are semireductive. This is known 
as Mumford's conjecture and was proved by 
W. J. Haboush. 

(3) Assume that @ is a G-tadmissible homo- 
morphism of the ring R onto another ring R'. 
We denote the sets of G-invariants in R and R' 
by IG (R) and [,(R’), respectively. 

If the actions of G are reductive, then 
(i) e(5(R)) = Lei Rtl h;€ I5 (R) implies 
(X;h; RYO IG (R) 9 X h; I; (R); and (iit) if K is 
*Noetherian, then /,(R) is finitely generated 
over K. 

If rational actions of G are semireductive, 
then (i) for each element a of J,(R’), there is a 
natural number t such that a'e p(/,(R)), and 
hence /,,(R’) is tintegral over o(Ig;(R)); (ii) if 
h;e I (R) and f e(E; h; R)'MG(R), then a suit- 
able power f' of f is in X; h;I; (Ry; and (iii) if K 
is a tpseudogeometric ring (in particular, if K 
is a field), then I; (R) is finitely generated over 
K [3]. 

When /,(R) is generated by f,,..., f, over 
K, then fi, ..., f; are called basic invariants. 


C. Polynomial Rings 


Let p,,.... p, be matrix representations of a 
group G over a commutative ring K of re- 
spective degrees n,,...,n,. Let x) (1 <i<u, 
1<j<n,) be Xn; talgebraically independent 
elements over K. Then we define an action of 
G on the tpolynomial ring K[x{, ..., x“ ] by 
(ox), ..., 0x) = p,(a) (xf? ..., x). In this case, 
a (relative) invariant is the sum of (relative) 
invariants that are homogeneous in each 
(x), ..., x9). (Because of this fact, in some of the 
literature, a (relative) invariant means a (rela- 
tive) invariant that is homogeneous in each of 
(x, ..., x®).) On the existence of basic invar- 
iants, the following theorem is known (be- 
sides the one on (semi)reductive actions): As- 
sume that K is a field of characteristic 0, G 
is dense under the fZariski topology in an 
algebraic linear group G such that the *uni- 
potent part (G), of the radical of G is at most 
1-dimensional (these conditions hold if G is 
a I-dimensional Lie group), and that all the 
p; are rational representations; then basic 
invariants exist (R. Weitzenbóck). 
Furthermore, if G is a matrix group and 
each p; is either the tidentity map or the con- 
tragredient map A—'A ^, then the invariants 
are called vector invariants. If K is a field of 
characteristic zero, the basic invariants and a 
basis for the ideal of algebraic relations of the 
basic invariants are explicitly given in several 
cases [1,2]. 
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D. Classical Terminology 


The classical theory of invariants considers 
the following objects. Let K be a field of char- 
acteristic zero (e.g., the real number field or 
the complex number field), and let G be 

GL(n, K ). Consider a homogeneous form F 
of degree d in n variables £,, ..., £, with coeffi- 
cients in K:F—Xc, im, in (Xi,-d m, A 
— (d'/TI Dén... £i»). For each o€G, we 
define c£; by (c£,,...,06,) —(£,,..., £,))o | and 
then (ech, by F—- Ziech, ; (om; ;). Then 
the transformation 


Lol, "Leon o ef eegal 


^((6c)uo... o; +++ (OC); ig? ëch. aal 


is a "linear transformation of a ,,,. , C, ,- 
dimensional faffine space. Let us denote the 
matrix of the linear transformation by the 
same symbol [6],: 


(s (00, us Dol C... 0i, ee) 


n 


Then 9,:6 [o], is a rational representation 
of G. 

Now fix an F such that the coefficients 
Ci, ...i, are independent variables, and consider 
the action of G on the polynomial ring R — 
K[ GE ... | defined by the rational 
representation q,. If g is a relative invariant, 
then og — a(c)g with a rational representation 
c a(a). Hence there is an integer w such that 
a(c) — (det a)". Then g is called an invariant of 
weight w (note that g is an SL(n, K )-invariant); 
g is an absolute invariant if and only if w= 
0. The group G acts naturally on the ring 
R[6,, ...,£,] also. Then relative invariants in 
this case are called covariants. The weight of a 
covariant and the absolute covariant are de- 
fined as in the case of invariants. When we 
want to refer to n and d, we call the invariants 
(covariants) invariants (covariants) of n-ary 
forms of degree d: binary for n=2; ternary for 
n —3; linear forms for d= 1; quadratic forms 
for d —2, etc. 

For example, (1) if n — 2, d — 2, then the 'dis- 
criminant D —c95c59 — c?, is an invariant of 
weight 2. 

(2) Assume that d= 2 (n arbitrary), and let uj; 
be the coefficient of ¢;¢; in F, or, more pre- 
cisely, uj; uj —c, an where (i) if i=j, then 
2; —2 and the other a, are zero and (ii) if iz j, 
then a;— «;— 1 and the other a, are zero. In 
this case, D=det(u;;) is an invariant of weight 
2. 

(3) If n=2 and d=4, then g5—c469€94 — 
4€13€31 F 3025, J3 C94€22C40 — Co4C31 — 
C49€15- 2€,3€25€31 — C35 are invariants of 
respective weights 4, 6. The discriminant ts 
expressed as 2?(g3 — 2792) and is an invariant 
of weight 12. 
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(4) For d — 1, if we denote the coefficient of č; 
in F by c;, then X;c,£; is an absolute covariant 
and is also a vector invariant. 

(5) For arbitrary n and d, det(0^ FOE, GC is 
a convariant of weight 2 and is termed the 
Hessian. 

Instead of one form F, we may take a finite 
number of homogeneous forms F,, ..., F, of 
degree d,, ..., d, such that the coefficients 
cl. are algebraically independent. Then we 
consider an action of GL(n, K) on the poly- 
nomial ring K[..., c2). , , En, 6,] given 
by 9,,:0 — [o], on the coefficients of F, and 
by o—'c Toni, as in the case of one form. 
Invariants in this case are called covariants, 
and covariants containing no č; are called 
invariants. Weights, absolute covariants, and 
absolute invariants are defined similarly. The 
forms F,, ..., F, are called ground forms, and 
the covariant is called a covariant with ground 
forms F,, ..., F,. 

(6) If r— n, then the ‘Jacobian det(0F;/0£;) is 
a covariant of weight 1. 


E. Multiple Covariants 


Consider the situation described in Section C 
for the case of polynomials, and assume that K 
is a field of characteristic zero, G= GL(n, K), 
and each p; is either a 9, (d arbitrary) or the 
contragredient map x. Then these invariants 
are called multiple covariants, and weights and 
absolute multiple covariants are defined as 
before. Let s be the number of p; equal to x. 
Then the invariants and covariants of the 
preceding section correspond to the cases 
s=0 and s— 1, respectively. Now assume that 
p; K if and only if i— l, ..., s. Gram's theorem 
states: for each «= 1, ...,s, let H, be a poly- 
nomial in xf? (i7 s) homogeneous in x®, ... , Al 
for each i, and assume that the set V of com- 
mon zeros of H,, ..., H, (in the affine space of 
dimension 2;.,n;) is G-stable . Then there 
exists a finite number of absolute multiple co- 
variants c,, ...,c, such that V is the set of 

(..., a, ...) (i> s) satisfying the condition 

that (..., a2, ..., a9, ...,) isa zero point of 
Cu, C for any a with a <s. 

Since every rational action of GL(n, K) is 
reductive, the set J of absolute multiple covar- 
iants (in a fixed polynomial ring over K)is a 
finitely generated ring over K. Furthermore, 
the set of multiple covariants of a given weight 
is a finitely generated [-module. 

If we omit the assumption that K is of char- 
acteristic zero, it is difficult to define @,; this 
difficulty can be avoided by considering 
transformations of coefficients a; |. Of F = 
Xa, ići Či". Although we can give similar 
definitions in that case, the theory does not 
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proceed similarly because, for example, ra- 
tional actions of GL(n, K ) are not reductive. 


F. Invariants of Lie Groups 


Let G be a Lie group (hence K must be either 
the real number field or the complex number 
field) and p be a 'differentiable representation 
of G such that p(G) c GL(n, K), and assume 
that an action of G on K [x,, .... x,] is defined 
by p. Then, by means of an infinitesimal trans- 
formation X, corresponding to G, an invariant 
(or semi-invariant) is characterized as an ele- 
ment f satisfying the condition X, f 20 (VX,) 
(or X,f —a,f,x,c K (VX,)). For instance, if G 
— GL(Q, K), then f is an invariant if X, f —0 
for only two infinitesimal transformations that 


du 1 ¢ d 1 0 
correspon n - 
p o 0 1 a e 3 , respec 


tively. Similarly, for each Lie group G, there 
exists a finite number of X, such that X,f=0 
for these X, characterizes f as an invariant. 


G. History 


In connection with geometry, the theory of 
invariants, especially that of binary forms, was 
first studied by A. Cayley (J. Reine Angew. 
Math., 30 (1846)). The theory was further 
developed by J. J. Sylvester, R. F. A. Clebsch, 
P. Gordan, and others. Since the theory was 
originated with applications to projective 
spaces in mind, homogeneous semi-invariants 
were important; this is why semi-invariants 
were called invariants in the classical theory. 
On the other hand, in the theory of binary 
quadratic forms, invariants of discontinuous 
groups were studied from the viewpoint of the 
theory of numbers. It was D. Hilbert who 
introduced clearly the notion of invariants for 
general groups. He proved the existence of 
basic invariants in the classical case, making 
use of the tHilbert basis theorem. Hilbert's 
14th problem (— 196 Hilbert) is related to this 
result, but its answer is negative; that is, even 
in the case of a polynomial ring over the real 
number field or the complex number field, 
there are groups acting on the ring without 
basic invariants (M. Nagata). Though the 
theory of invariants has not been studied 
effectively for a long time, it is again under 
active study because of its importance in alge- 
braic geometry. 
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Although inventory control is discussed main- 
ly in connection with production control and 
operations research, inventory models have 
applications in many other fields. For exam- 
ple, an inventory (or queuing) model can be 
used to plan the optimal operation of a dam, 
and may thereby influence the engineering 
design of the dam. 

A mathematical theory of inventory control 
must be based upon an inventory structure in 
which the following items are considered: (i) 
costs and revenue, (ii) demand, and (iii) deliv- 
eries. Item (1) involves (1) order and produc- 
tion costs, (2) storage costs, (3) discount rates, 
(4) penalties, (5) revenues (under the assump- 
tion that price and demand are not controlled 
by the firm), (6) costs of changing the produc- 
tion rate, and so on. For item (it), there are 
several different situations due to the com- 
bination of predictability and stability condi- 
tions of demand for the commodity. Current 
theories are concerned with two particular 
situations. In the first, the fprobability distri- 
bution of demand is known to the firm. In the 
second, the date of occurrence of orders is a 
trandom variable with a known probability 
distribution, while the amount of demand is a 
known constant. A ‘minimax principle is 
applied when some of these probability distri- 
bution functions are unknown to the firm. 

Problems in (iii) are due to delays (lead time 
periods) after an order for inventory is placed 
or a decision is made to produce for an un- 
certain demand. 

The following inventory system has been 
studied mathematically in much of the litera- 
ture. Let x, be the initial storage level for the 
nth period, and £, and z, be the demand and 
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the amount of order, respectively, for the nth 
period. Then y, — x, 4- z, is the storage level 
after the arrival of ordered items, and x,., = 
max(0, y, — ¢,) is the initial storage at the (n+ 
1)th period. Furthermore, in this period, the 
stock shortage 7, — £, — y, is incurred if y, < čą. 
When the demand £, is known, the inventory 
model is a simple deterministic one. When £, is 
a random variable, let its density function be 
oC For constructing the model, the following 
three cost functions are introduced: The order- 
ing cost c(z) of order z, the holding cost A( y) of 
storage y for one period, and the penalty cost 
p(y) for a shortage y. Then the total cost for 
the period of storage level y after reorder is 


I(y)= | neues! p yo(de. 
y 


0 


Let x be the initial storage for the first period, 
and let f,(x) represent the minimal total cost 
of an n-period inventory model; then Lisi 
satisfies the following fundamental func- 
tional equation with a fixed discount rate « 

(0 ca « 1): 


fo- nin min Jeo 410) 
e| ur oat]. 
0 


Ies] f. e-8o | 


In this equation, the first term of the right- 
hand side gives the amount of order at the first 
period as y,(x)— x, where y,(x) is a minimizing 
value of y, and the second term gives the cost if 
no order is put in. Policies of the (s, S) type are 
defined as follows: Order S — x if x « s, and do 
not order if x 2 s. For the special case when 
the ordering function is given by c(z) 2 C + 

cz (z>0); =0 (z=0), some sufficient conditions 
for the optimal policy to be of (s, S) type have 
been studied. 

For various inventory models where the 
demand and lead time are random variables, 
the optimal policy can be derived by using 
queuing theory. We consider an inventory 
model with maximum inventory M. The queu- 
ing situation corresponding to this model 
is an M-channel queuing system. "Empty 
shelf space due to demand" corresponds to the 
arrival of a customer, the ordering instance 
corresponds to the commencement of service, 
and the arrival of an ordered item corresponds 
to the completion of service. Thus the station- 
ary probability of storage level n, P. can be 
obtained by solving the equilibrium equation 
of the queuing system; then mean inventory, 
mean sales, and mean ordering number can be 
derived. This above ordering system is called 
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"reorder for each item sold"; more general 
systems have been treated in a similar manner. 

Another example of the queuing system 
approach may be illustrated by way of the 
planning problem for dams mentioned at the 
beginning of this article: The probability distri- 
bution of the amount of water stored in the 
dam is derived by analyzing the functional 
equation of a queuing system. Let the capacity 
of the dam be M, the storage of the dam at the 
nth period be x,, and the outflow after inflow 
z, be €,4,; then the storage at each period is 
given by 


X,=x, O<x<M, 


Xn+1 =[M — č, t[x,*z,—M] ]'. 


where a* = max(0,a) and a’ = min(0, a). 

The equations for queuing models will be 
the same as those given above for the inven- 
tory model if we let x, be the waiting time of 
the nth customer, z, the service time, ¢,,, the 
interarrival time of customers n and the n+ 1, 
and M the service time. 
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A. The Classical Isoperimetric Problem 


Two curves are called isoperimetric if their 
perimeters are equal. The term curve is used 
here to mean a !Jordan curve. The classical 
isoperimetric problem is to find, among all 
curves J with a given perimeter L, the curve 
enclosing the maximum area. This problem is 
also called the special isoperimetric problem or 
Dido's problem. Its solution is a circle. The 
analogous problem in 3-dimensional space has 


857 


a sphere as its solution; that is, among all 
closed surfaces with a given surface area, the 
sphere encloses the maximum volume. The 
following variational problem can be regarded 
as a generalization of the classical isoperi- 
metric problem: To find the curve C: y= f(x) 
that gives the maximum value of the func- 
tional fc F(x, y, y)dx under the subsidiary 
condition (c G(x, y, y) dx — constant. This is 
sometimes called the generalized isoperimetric 
problem. The classical isoperimetric problem 
can be solved by variational methods. It can 
also be solved by using inequalities involving 
quantities associated with the figure in ques- 
tion. For example, the inequality 


L?^—4nF 20 (1) 


between the area F and the perimeter L ofa 
Jordan curve J solves the problem, since the 
equality holds only for a circle. For refinements 
of (1) there are further inequalities due to T. 
Bonnesen (1921): 


L? -AnF z ( L—2nry,, 
L? —4nF z QxR — Ly, 
L? -4nF 2 i? (R —ry,, 


where r is the radius of the largest circle in- 
scribed in the curve J and R the radius of 
the smallest circumscribed circle. These in- 
equalities can also be used to solve the iso- 
perimetric problem. Moreover, we have the 
following inequality for curves on the sphere 
of radius a: 


R-r 


L?—4nF + Fa ? > 8ra? sin ————— 
" "eere PETER 


(F. Bernstein, 1905). For curves on the surface 
of negative constant curvature —1/a?, we have 


L? —4rF — F? a`? 


Sr ida +Fa`?) (ann — tanh ai 

4 2a 2a 
(L. A. Santaló [2]). From these inequalities, we 
see that the circle remains the solution to the 
isoperimetric problem in each of the non- 
Euclidean planes. 

The corresponding problem in the 3-dimen- 
sional case is more difficult. Without going 
into detail, we cite the following example: For 
an *ovaloid with surface area S and volume V, 


S? — 36nV? x 0, 


where the equality holds only for the sphere. 


B. Isoperimetric Inequalities on Eigenvalues 


In recent years the concept of isoperimetric 
inequality has been extended to include all 
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inequalities connecting two or more geometric 
or physical quantities depending on the shape 
and size of a figure. For example, it includes 
inequalities on teigenvalues of partial dif- 
ferential equations under given boundary 
conditions. 

Lord Rayleigh conjectured in 1877 that, for 
the equation Au LAN —0 for a vibrating mem- 
brane on a region D of fixed area F (with u 20 
on the boundary), the first eigenvalue 4, is 
least when D is a circle. This conjecture is true. 
In fact, in 1923 G. Faber and E. Krahn proved 
independently that 


A, Z (n/F)j? Q) 


and that the equality in (2) holds if and only if 
the domain D is a circle, where j = 2.4048 ... is 
the first positive zero of the t Bessel function 
Jo(x). For the second eigenvalue 4, of the 
same problem, the circle does not give the 
minimum value. I. Hong (1954) gave the 
inequality 


Ai 2 Qn] F)j 


and showed that 4, approaches its greatest 
lower bound, (27/F)j*, as the shape of the 
domain approaches a figure consisting of two 
equal mutually tangential circles, each having 
area F/2 [4]. 

Many other results were found with regard 
to isoperimetric inequalities on eigenvalues of 
partial differential equations, for example, 
results related to eigenvalues for a membrane 
under other boundary conditions, such as 
Ou/On — 0, and for other types of equations, 
such as AAu — Aus 

A method devised by J. Steiner and called 
symmetrization is a powerful means of dis- 
covering isoperimetric inequalities. Steiner's 
symmetrization with respect to the line / 
changes the plane domain P into another 
plane domain Q characterized as follows: 

Q is symmetric with respect to /, any straight 
line g perpendicular to / that intersects one of 
the domains P or Q also intersects the other, 
both intersections have the same length, and 
the intersection with Q is a segment bisected 
by |. This operation can be extended to a 
space of higher dimension by replacing / with 
a hyperplane. Steiner's symmetrization pre- 
serves area (or volume) and diminishes perim- 
eter (or surface area). Steiner first used these 
properties to solve the classical isoperimetric 
problem in 1838. In 1945, G. Polya and G. 
Szegó found that the electrostatic capacity 

of a solid is diminished by Steiner's sym- 
metrization. The concepts developed in their 
papers made possible a systematic treat- 
ment of many isoperimetric inequalities and 
estimations of mathematical and physical 
quantities. 
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Carl Gustav Jacob Jacobi (December 10, 
1804—February 18, 1851) was born into a 
wealthy banking family in Potsdam, Germany. 
He was well educated at home and was highly 
cultured in many areas. He entered the Uni- 
versity of Berlin, studied mathematics largely 
on his own through Euler's texts, and obtained 
the doctorate in 1825. The following year he 
became a private lecturer at the University of 
Kónigsberg, and in 1831 a professor. For the 
next seventeen years he worked vigorously in 
KGnigsberg, where his influence was consider- 
able. Toward the end of his life, his health 
failed; moreover, he lost his property and met 
with general misfortune because of the polit- 
ical situation of the time. He made no further 
contributions after 1843; he died of smallpox 
at 47. 

Because he had an intense personality, there 
were times when he invited the animosity of 
people; however, he did have early contact 
with tAbel, and in his later years he enjoyed 
the friendship of tDirichlet. Jacobr's mathe- 
matical works lacked formal completeness 
but were very original and contributed to 
many fields. The *Hamilton-Jacobi equation in 
dynamics and the tJacobian determinant of a 
differentiable mapping are well-known prod- 
ucts of his ideas, but even more noteworthy 
are his contributions to the theory of elliptic 
functions and algebraic functions, particularly 
the introduction of theta functions and the 
treatment of the inverse problem of hyper- 
elliptic integrals [2]. 
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Japanese Mathematics 
(Wasan) 


Before the introduction of Western mathe- 
matics, indigenous Japanese mathematics de- 
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veloped along its own characteristic lines. In 
Japanese, this form of mathematics is called 


. wasan. The first development took place in the 


8th century A.D. during the Nara era under the 
influence of the Tang dynasty in China. After a 
period of decline came another period of de- 
velopment from the 13th to the 17th century. 
During this second wave, Chinese mathemat- 
ical books such as Suanhsueh chimeng and 
Suanfa tangtsung were imported, along with 
the abacus, or soroban as it is called in Japa- 
nese, and calculating rods, sangi in Japanese, 
with which Chinese mathematicians performed 
algebraic operations. Japanese mathematicians 
absorbed these methods and invented and 
developed their own written algebra, called 
endan-jutsu or tenzan. 

The fundamental concepts of wasan are 
attributed to Seki Takakazu, or Seki Kowa 
(Seki is the surname; likewise for other Japa- 
nese names in this article), Takebe Katahiro, 
and Kurushima Yoshihiro. Its main develop- 
ments stem from Ajima Naonobu and Wada 
Yasushi (or Wada Nei), among others. Wasan 
scholars obtained interesting and significant 
results, but they pursued mathematics as an 
art in the Japanese manner rather than as a 
science in the Western sense. Wasan had no 
philosophical background as did the Greek 
tradition, nor did it have an intimate relation 
with the natural sciences. Thus it lacked the 
character of systematized science and dissolved 
after the introduction of Western mathematics 
into the school system by the Meiji govern- 
ment (1867—1912). 

Among wasan works of the earlier period, 
Jinkóki by Yoshida Mitsuyoshi (1598—1672), 
the first edition of which appeared in 1627, 
contributed much to popularize the soroban 
and to arouse general interest in mathematics. 

Seki Takakazu (16422-1708) was born in 
Fujioka in the Gunma prefecture. Some his- 
torians say he learned mathematics from 
Takahara Yoshitane, while others say he was 
completely self-taught. His achievements in- 
clude the following: (1) the invention of endan- 
jutsu, or written algebra; (2) the discovery of 
determinants; (3) the solution of numerical 
equations by a method similar to *Horner's; (4) 
the invention of an iteration method to solve 
equations, similar to Newton's; (5) the intro- 
duction of derivatives and of discriminants of 
polynomials; (6) the discovery of conditions for 
the existence of positive and of negative roots 
of polynomials; (7) a method of finding max- 
ima and minima; (8) the transformation theory 
of algebraic equations; (9) continued fractions; 
(10) the solution of some Diophantine equa- 
tions; (11) the introduction of !'Bernoulli num- 
bers; (12) the study of regular polygons; (13) 
the calculation of x and the volume of the 
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sphere; (14) *Newton's interpolation formula; 
(15) some properties of ellipses; (16) the study 
of the tspirals of Archimedes; (17) the discov- 
ery of the Pappus-Guldin theorem; (18) the 
study of magic squares; and (19) the theoretical 
study of some questions arising out of math- 
ematical recreations (called mamako-date, 
metsuke-ji, etc.). 

Takebe Takahiro (1664—1739) was a disciple 
of Seki. He is the author of the book Enri 
tetsujutsu. (Enri, or circle theory, was one of 
the favorite subjects of wasan scholars.) It 
contains the formula 


1 2 «x^ 2? >x* 02?.47.X8 
Lac: = 7 + + 





4 6 


He also obtained other formulas of trigono- 
metry and some approximation formulas, by 
means of which he compiled trigonometric 
tables to 11 decimal places. In collaboration 
with Seki he wrote the 20 volumes of Taisei 
sankei and Fuk yti tetsujutsu, the latter elucidat- 
ing the methodology of the Seki school. It 
contains a value of z to 42 decimal places. 

Kurushima Yoshihiro (?- 1757) was an 
original scholar influenced by Nakane Genkei 
(1662-1733), a disciple of Takebe. He gen- 
eralized an approximation formula for sines 
obtained by Takebe, treated problems of max- 
ima and minima involving trigonometric func- 
tions, improved the theory of determinants 
and the theory of equations, obtained a for- 
mula for $,— 1? +... -- n? without using Ber- 
noulli numbers, and found a relation among a, 
b, c, n by eliminating x from 


Xt (xe)... (xc (n—1)e)2a, 
x* (x - eK... +(x+(n— Lo} =b. 


Furthermore, he studied *Euler's function o(n) 
before Euler and obtained the ! Laplace expan- 
sion theorem for determinants before Laplace. 
He is said to have contributed to Hóen san- 
kei, an important work on enri written by 
Matsunaga Yoshisuke (c. 1694—1744) in which 
a value of z is given up to the 50th decimal 
place. The Seki school, continued under 
Takebe, Nakane, Kurushima, and Matsunaga, 
became a center of wasan. Scholars of this 
school lived mainly in Edo (the ancient name 
for Tokyo). Yamaji Nushizumi (or Shuji) 
(1704-1772) studied wasan with Nakane, 
Kurushima, and Matsunaga. Arima Yoriyuki 
(1712-83), one of his disciples, for the first 
time made public the teachings of this school 
in the book Shiki sampó. The practice of dedi- 
cating to Shinto shrines or Buddhist temples 
tablets engraved with solved mathematical 
problems became popular during this period. 
Ajima Naonobu (1739—98) was a disciple 
of Yamaji. He improved enri, simplified its 
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theory, and amplified its applications. He 
treated problems of finding volumes involving 
double integrals, discovered the binomial 
theorem for exponent 1/n, compiled a table of 
logarithms to 14 decimal places, and treated 
Diophantine problems. No trace of demon- 
strative geometry ts found in wasan, but Ajima 
and his school did treat geometric problems, 
such as *Malfattí's problem, dealing with 
several circles tangent to each other. 

Wada Yasushi (or Wada Nei, 1787-1840) 
studied wasan with Kusaka Makoto (1764— 
1839), a disciple of Ajima. He made tables 
containing more than 100 definite integrals, 
including, for example, 


1 1 
| x?(1 — x! dx, | x?(1 — x?! dx. 
0 0 

However, there is no evidence that wasan 
scholars, even in this period, knew of the fun- 
damental theorem of infinitesimal calculus. 

Apart from the Seki school, there were 
Tanaka Yoshizane (1651—1719), a contempor- 
ary of Seki, and his disciple Iseki Tomotoki 
(c. 1690). Tanaka is said to have ranked with 
Seki in his work on determinants and magic 
squares, but most of his writings have been 
lost. Iseki wrote a text on determinants called 
Sanpó hakki (1690), the first of its kind in the 
history of mathematics. A little later, Takuma- 
ryü (the Takuma school) was formed in Osaka. 
Inó Tadataka (1745-1821), famous for making 
the first precise map of Japan, had studied 
wasan with Takahashi Yoshitoki (1764—1804), 
who belonged to this school. Aida Yasuaki 
(1747—1817), a contemporary of Ajima, 
founded Saijó-ryü, or the “superlative school,” 
and rivaled Fujita Sadasuke (1734—1807) of 
the Seki school. 

Toward the end of the Tokugawa era, the 
study of geometric problems became popular 
among wasan scholars, such as Hasegawa Kan 
(1782-1383), Uchida Gokan (1805-82), and 
his disciple H6d6j1 Zen (1820-68), who used 
the method of tinversion. Hasegawa wrote 
Sanpó sinsho, a popular work containing an 
explanation of the methods of enri. 

The influence of Western mathematics is 
hardly recognizable outside astronomy, cal- 
endar making, and the compilation of loga- 
rithmic tables. However, some wasan scholars 
took a more positive attitude and began study- 
ing Western mathematics toward the close of 
the Tokugawa period, thus helping to lay the 
foundations for the development of mathe- 
matics in Japan in the new era. 

Following the Meiji restoration, Kikuchi 
Dairoku (1855-1917), Hayashi Tsuruichi 
(1873-1935), Fujiwara Matsusaburo (1861— 
1933), and recent scholars contributed much 
to preserve wasan literature and to clarify its 
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content, but their undertaking has not yet 
been completed. 
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A. Definitions 


Let A be a *linear space over a field K. If there 
is given a *bilinear mapping (multiplication) 

A x A> A, the space A is called a distributive 
algebra (or nonassociative algebra) over K. In 
particular, if this multiplication satisfies the 
associative law, A is called an associative 
algebra over K (— 29 Associative Algebras). 
We assume that A is a distributive algebra 
over K of finite tdimension over K. Denote 

by E(A) the associative algebra of all K- 
*endomorphisms of the K-linear space A. With 
every ae A we associate R,€ E(A), L,e E(A) by 
R,(x) 2 xa, L,(x) 2 ax, where x belongs to A. 
The 'subalgebra of E(A) generated by the R,, 
L, (ae A) and the tidentity mapping of A is 
called the enveloping algebra of A. The left, 
right, and two-sided *ideals of A are defined as 
in the case of associative algebras. An element 
c of A is said to be in the center of A if (i) ac = 
ca and (ii) a(bc) — (ab)c, a(cb) =(ac)b, and 

c(ab) =(ca)b for every a, b in A. We denote the 
product of two elements a, bin A by a: b in 
order to distinguish it from multiplication in 
the case of associative algebras. Denote a: a by 
a? and put A? —(a,:a5|a,,a5€ A}. Define A” 
successively by A = A, AV = AT... ARTI 
(A®)?. Then A is called a solvable algebra if 
A'? — 0 for some n and a nilalgebra if every 
element of A is *nilpotent. A distributive alge- 
bra A is called a Jordan algebra if the follow- 
ing two conditions are satisfied for every 

a, uin A: (i) a:u—u-a and (ii) a?-(u:a)— 

(a? -u): a. We omit discussion of noncom- 
mutative Jordan algebras in this article. A 
distributive algebra A is called an alternative 
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algebra if the following two conditions are 
satisfied for every a, u in A: (i) a: u? -(a:u):u 
and (ii) u? a—u- (ua). An alternative algebra 
A is called an alternative field if L,(A)= A= 
R,(A) for every a in A such that a #0. Gen- 
eralizing these algebras we obtain the notion 
of a power associative algebra. A distributive 
algebra A is a power associative algebra if 
every element of A generates an associative 
subalgebra. Jordan algebras and alternative 
algebras are power associative. 

We now consider an algebra A over a field 
K. We assume that the !characteristic of K is 
not 2 and that A is of finite dimension over K. 
If A and B are associative algebras, a linear 
mapping c: A->B is called a Jordan homomor- 
phism if (i) (a?)? = (a^)? for every a in A and (ii) 
(abay — a^b*a* for every a, b in A (where we 
denote by a’ the image of ae A by the map- 
ping ol If B does not contain tzero divisors, 
then (ab) — a?b? or (ab)? — bor, 

Let A be an associative algebra. Define a 
new multiplication : in A by a: b —(ab + ba)y/2. 
We then have a Jordan algebra A" A sub- 
algebra of the Jordan algebra A" is called a 
special Jordan algebra. Let K[x;,...,x,] be 
the noncommutative free ring in the indetermi- 
nates x,, ..., x, (that is, K[x,,...,x,] is the 
associative algebra over K that has as its K- 
bases the free semigroup with identity element 
1 over the free generators x,, ..., x,). The sub- 
algebra K[x,,...,x,]* generated by 1 and 
the x; is called the free special Jordan algebra 
of n generators and is denoted by Jf?. A Jor- 
dan algebra A is special if and only if there 
is an isomorphism of A into B^, where B is 
some associative algebra. A Jordan algebra 
that is not special is called exceptional. All 
homomorphic images of JỌ? are special. De- 
note by & the ideal of JẸ’ generated by x ?— 
y? (note that Jj? G K[x, y, z]). Then Ji? /2? 
is exceptional. A Jordan algebra is special if it 
contains the unity element and is generated by 
two elements. If A is an alternative algebra, the 
associated Jordan algebra A* is special. 

Condition (ii) for a Jordan algebra A is 
equivalent to [R,, R,-2]=0. In A, we have 
[Ra Ry] + [ Rs, Real + IR. Rar] =0; and 
TIR. Ra], Re] = Rp, 4. Here we put [S; T] = 
ST— TS, [a,b,c] 2 (a: b): c—a-(b: c). Such 
an equation in A is called an identity in a 
Jordan algebra. 


B. Structure of Jordan Algebras 


A Jordan algebra A has a unique largest solv- 
able ideal N, which contains all nilpotent 
ideals of A and is called the radical of A. If N 
=0, A is called semisimple. The quotient A/N 
is always semisimple. A semisimple Jordan 
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algebra A contains the unity element and 

can be decomposed into a direct sum A— 

A, Q ... ® A, of minimal ideals A;. Each A; 

is a tsimple algebra. In particular, if K is of 
characteristic O, there is a semisimple subalge- 
bra S of A such that A— S @ N. Let e be an 
idempotent element of A, and let 4e K. Put 
AG) = ix|xeA, e: x 2 Ax]. Then we have 


A — A«(1) 6 A,(1/2) © A,(0). 


This decomposition of A is called the Peirce 
decomposition of A relative to e. Suppose that 
the unity element 1 is expressed as a sum of 
the mutually orthogonal idempotents e;. Then, 
putting A;;— A, (1), A; j= A«(1/2) A, (1/2) 
we have A = uer A;,;. The A, j are called 
Peirce spaces. Furthermore, suppose that for 
every i, A; ; is of the form A; ;=K-e;+N,, with 
N; a nilpotent ideal of A, ;. In this case, A is 
called a reduced algebra and the number of the 
e; is called the degree of A. 

Let D be an alternative algebra with the 
unity element 1 and with an tinvolution `~. 
Furthermore, suppose that there is a tquadra- 
tic form Q(X) such that x: x 2x: x 2Q(x):1 
(xe D) and that f(X, Y)-(Q(X + Y) Q(X )— 
Q(Y))/2 is a tnondegenerate bilinear form. 
Then D is called a composition algebra. 

Reduced simple Jordan algebras A are 
classified into three types: 

(D A- K-1. 

(2) There exist idempotents e,, e, such that 
A-K-1Q K-:(e,—e;) A, 2, where A, 2 £ 
0; and furthermore, the multiplication of 
A is determined as follows: Let x =a(e, — 
€o)+ 41,2, y — fí(e4 — e5) - b, 2 bein A with 
a, PEK, a, 5,b, 4€ A, ,. Then x: y (auf + 
f(a,, 5, b,,2)): 1, where f is a nondegenerate 
symmetric bilinear form. 

(3) Let D be a composition algebra with 
the involution `, and let D, be the ftotal 
matrix algebra over D of degree n. Let r= 
diag{r,,...,1,} (riz 0) be given in D,. Define 
J:D,2D,bya!-r:a:r ^ !. Then A is of the 
form A- (xeD,|x 2 x?). 

Let A be a special, reduced simple Jordan 
algebra such that A; ; Ke;. If A is of degree 
2, then A is a *Clifford algebra. If A is of de- 
gree 73, then A is classified into five types. 

Let A be a simple Jordan algebra over a 
field K. Then there is an extension field P of K 
of finite degree such that Ap(=A ® xP) is 
isomorphic to one of the following five types: 
(i) P, where P, is the total matrix algebra of 
degree n over P; (ii) the subalgebra of P 
consisting of all symmetric matrices in P; ; (iii) 
{xe P3, | x? =x}, where 


with 'x the transpose of x and I, the unit ma- 
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trix of degree m; (iv) an algebra generated by 
the generators so, 5,, ..., s, together with the 
defining relations so: s;— S; s? = So, s; s;— D 
(ij); (v) HF, where 2, is the algebra of all 
3 x 3 Hermitian matrices over a tCayley 


algebra. 


C. Representations of Jordan Algebras 


A representation S of a Jordan algebra A on a 
K-linear space M is a K-linear mapping a—S, 
from A into the associative algebra E(M) of all 
K-endomorphisms of M such that (i) [S,, $,..] + 
Is, Sea] - [$,, Sap] 2 0 and (ii) SSpS. + 

Se Sp Sa + $5... = Sa Spc + Sp Sac + S.S, (for all 
a,b, c in A). A K-linear space M is called a 
Jordan module of A if there are given bilinear 
mappings M x A-»M (denoted by (x, a) x- a), 
A x M M (denoted by (a, x) 5a: x) such 

that for every xe M and every a, b, ce A, (i) 
x:a-a:x;(i)(x-a) (b:c)3-(x: b): (ac) 9 

(x: c) (a: b) (x: (b: c) at (x:(ac c); b+ 

(x- (a: b)): c; and (iii) x: a- bc x:c:b- a4 
acc b: xs(x:ce)y (ac b) (x:a): (b: c)»-(x- b)- 
(a: c). As usual, there is a natural bijection 
between the representations of A and the 
Jordan modules of A. A special representation 
of a Jordan algebra A is a homomorphism 
A-+E*, where E is an associative algebra. 
Among the special representations of A, there 
exists a unique universal one in the following 
sense. There exists a special representation 
8:A—U* with the following property: For 
every special representation c: A E* there 
exists a unique homomorphism gy: UE such 
that o =S. The pair (U, S) is uniquely deter- 
mined. Furthermore, if A is n-dimensional over 
K, U is of dimension x 2"— 1 over K. The pair 
(U, S) is called the special universal enveloping 
algebra of A. A is special if and only if S: 4 
U * is injective. 

A Jordan algebra A has only a finite number 
of inequivalent *irreducible Jordan modules. 
Suppose that the base field K is of character- 
istic 0. Let S be a representation of A. If N is 
the radical of A, S(N) is contained in the rad- 
ical of the associative algebra S(A)* generated 
by S(A). If A is semisimple, so is $(A)*. In 
general, the radical R of S(A)* is an ideal of 
S(A)* generated by S(N). Furthermore, for 
every semisimple subalgebra T of A such that 
A-—TQG N, we have S(A)* — S(T)* ®R. 
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A. Definitions 


Let X be a tcomplex manifold of complex 
dimension n. Let J denote the ttensor field of 
type (1, 1) of *almost complex structure in- 
duced by the complex structure of X (— 72 
Complex Manifolds A). Considering J as a 
linear transformation of vector fields on X, we 
have J? = —1. A *Riemannian metric g of class 
C? on X is called a Hermitian metric on X if 
g(x, y) 2 g(Jx, Jy) for any two vector fields x, y 
on X. On a tparacompact complex manifold 
X there always exists a Hermitian metric. If we 
put Q(x, y) 2 g(Jx, y) for a Hermitian metric g, 
then Q is an 'alternating covariant tensor field 
of order 2, and hence a ‘differential form of 
degree 2. We call Q the fundamental form 
associated with the Hermitian metric g. If 

the texterior derivative of Q vanishes, i.e., 
dQ=0, the given Hermitian metric g on X 

is called a Kahler metric on X, and X is called 
a Kahler manifold. With a tholomorphic local 
coordinate system (z!, ...,z") on a coordi- 
nate neighborhood U in X, we can write g — 
Èz. g=1 Jap dz" dz? in U, where the matrix (sl 
is an n x n tpositive definite Hermitian matrix. 
Then the fundamental form can be expressed 
as Q=(// —1/2)X gs dz" ^ dz". 

If a complex manifold X is a Káhler mani- 
fold with a Kahler metric g, then X has the 
following properties: (1) For every point p of X 
there exists a real-valued function w of class 
C* on a suitable coordinate neighborhood of 
p such that g,5 = 0^ w/0z*0z". (2) For every 
point p there exists a holomorphic local co- 
ordinate system at p whose real and imaginary 
parts form a tgeodesic coordinate system at p 
in the weak sense. (We say a coordinate system 
(X4, ..., x) is a geodesic coordinate system at p 
in the weak sense (IN. (0/0x7)], —0, i, j= 
1, ...,n (— 417 Tensor Calculus).) Each of 
these properties characterizes a Kahler metric. 
The Kahler metric was introduced by E. Káh- 
ler with property (1) as the definition. W. V. D. 
Hodge applied it to the theory of harmonic 
integrals [7]. 


B. Harmonic Forms on Compact Kahler 
Manifolds (— 194 Harmonic Integrals) 


We now consider complex-valued differential 
forms on a compact complex manifold X of 
complex dimension n endowed with a Her- 
mitian metric. The operators d and a on real- 
valued differential forms can be uniquely 
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extended to complex linear operators, and the 
inner product of real differential forms can be 
extended to a Hermitian inner product of the 
form (p, V) — [x 9 ^ «y. Then the *adjoint 
operator 6 of d is also complex linear, and 

the decomposition d — d' +d” gives rise to d= 
6'+ 6", where A and ô” are the adjoint opera- 
tors of d' and d", respectively. We also have 

0’ =(—1)*d" * and ô” z(—1)«d'*. Define the 
operator L by Lp=Q ^ o, and denote by A the 
adjoint operator of L. Then for p-forms, we 
have A-(—1)?« L«. We say that q is primitive 
when Aq =0. Some important properties of 

L and A follow: (3) For a p-form o Ao? =0 if 
and only if L’ =0 (q = max(n — p + 1,0)). (4) 

If Ao —0 and q is of type (r,s), then for all 
O<q<n—p, 


*L'o- ((—1pe*»? 


x (/ —1) ?qU(n—p—-q) L"? o 


(where p =r + s). (5) A p-form ọ can be 
uniquely decomposed in the form o = o + 
Lo, t... L'o, with r=[p/2] and o; primi- 
tive (Hodge [7], Weil [13]). Properties (3)-(5) 
are shared by all Hermitian metrics. When 
the metric is Kahlerian, we further have 


Ld—dL-0, Ad’'—d'A=,/-1 ô”, 


Ad" —d"A=—./-1 ó. 
From these we also obtain 


AL-LA, AA=AA, 


d'ó"-Fó'q'—0,  d"ó'-pó'd" —0, 


A — 2(d'ó' + ó'd') z 2(d" 0" + ó" d"), 


where A is the *Laplace-Beltrami operator A= 
dé + ôd. *Green's operator G commutes with 
d', d'A and ô”. Thus when the metric is Káh- 
lerian, we have: (6) Let L ,(X)=È,+s-p Lr, (X) 
be the decomposttion of the space of p-forms 
into the spaces L, ,(X) of forms of type (r,s). 
Then, correspondingly, the space H,(X) of 
harmonic p-forms is the direct sum H,(X)— 

2 ++s=pH,,.(X) of the spaces H, ,(X) of har- 
monic forms of type (r, s). (7) If we let A— 

d"ó" --ó" d", then Ap =0 is equivalent to Ag= 
0. Denote the projection to the space of har- 
monic forms by H. Then using the formula 1 — 
H+AG (— 194 Harmonic Integrals A), we 
can infer that not only in the tcochain com- 
plex ($£, L (X), d) but also in the cochain com- 
plexes (X, L, (X), d") and (È, L, ,(X), d’), H is 
homotopic to the identity mapping and the 
respective cohomology groups of degree s and 
degree r of the last two complexes are both 
isomorphic to H. ,(X). (8) For a harmonic p- 
form q, the forms o. in the decomposition (5) 
are harmonic. (9) The exterior powers (UN (r = 
0, 1, ..., n) of the fundamental form Q are non- 
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zero and harmonic. (10) Holomorphic dif- 
ferential forms are harmonic. 


C. Further Properties of Compact Káhler 
Manifolds 


The p-dimensional tde Rham cohomology 
group with complex coefficients of a compact 
Kahler manifold X is canonically isomorphic 
to the direct sum of the tDolbeault coho- 
mology groups of type (r,s), where r+s=p 

(^ 72 Complex Manifolds D). Let b, and br, 
respectively, denote the pth Betti number of X 
and dime H*(X,Q"). Then it follows that b, = 
Ly+s=ph"*. If 9 is harmonic, then @ is also 
harmonic, so we have h"5— h*", Therefore, if p 
is odd, b, is even. This is a generalization of 
the fact that the first Betti number of a closed 
Riemann surface is even. From (9) we have 
b, = 1 if p is even. Furthermore, a linear com- 
bination of irreducible analytic subsets of X of 
(complex) dimension r with positive real coeffi- 
cients is a cycle of degree 2r that is never hom- 
ologous to zero on X, since the integral of Q” 
on the cycle is never zero. 

For a compact Kahler manifold X, we can 
define a tcomplex torus 21 and a holomorphic 
mapping 4: X >% such that (1) Y is gener- 
ated by A(X) as a group; (ii) any holomorphic 
mapping u:X — T can be decomposed as p= 
XO A-F c, where T is another complex torus, 

x is a complex analytic homomorphism from 
9I to T, and c is a point of T. Wis called the 
Albanese variety of X. The set *B of complex 
analytic isomorphism classes of tcomplex line 
bundles that are trivial as topological bundles 
has a natural complex structure with a canoni- 
cally associated structure of a family of vector 
bundles (— 72 Complex Manifolds G). With 
this structure $ ts, in fact, a complex torus 
and is called the Picard variety of X. Then $ 
and d are constructed using H'(X, R) and 
H?" (X, RL respectively, and are dual com- 
plex tori. If X is a Hodge manifold (— Sec- 
tion D), P and 9I are Abelian varieties (— 3 
Abelian Varieties). 

A small deformation of a compact Kahler 
manifold is also Káhlerian (a Kahler metric 
can be taken to be of class C” with respect to 
parameters) [9]. But a limit (in the sense of 
deformation) of a Káhler manifold is not 
always Káhlerian. An example was given by 
Hironaka [6]. 

Generalizing the notion of compact Kahler 
manifold, A. Fujiki [2,3] introduced the cate- 
gory € by X e€ if and only if (i) X is a com- 
pact complex reduced space and (ii) there exist 
a compact Kahler manifold Y and a surjec- 
tive holomorphic mapping: Y X. Fujtki 
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proves among other things that when X €@ is 
a manifold, (1) every holomorphic p-form is 
closed, (2) H"(X, C) z B, H'(X,O* ?) for any 

n Z0, and (3) H?(X,04%)= H*(X,Q5) as a vector 
space, where the Q% denotes the sheaves of 
holomorphic p-forms on X. 

Let X be a compact Káhler manifold 
with Kahler metric g = X g,5 dz? dZ’. Then 
R(g) - X R,g(g)dz* dz" (where Raa = 
— 0?/0z* Oz" (log(det(g,5)))) is the associ- 
ated Ricci tensor. According to Chern, 

G/ Jett Rag(g) dz* ^ dz? is a closed (1, 1) 
form representing the first Chern class of X. 
Conversely, for a compact Kahler manifold X, 
E. Calabi conjectured that: 

(a) Given a real closed (1, 1) form 

G-A /21)X. R,zdz* ^ dz? which represents the 
first Chern class of X, there exists a unique 
Kahler metric g on X such that its corre- 
sponding Ricci tensor R(g) coincides with 

È Ra dz? dz? and that g determines the same 
cohomology class as the original Kahler 
metric. 

(b) If X has either negative or zero first Chern 
class, then X admits an Einstein Kahler metric 
g (where "Einstein" means R(g) — (const): g). 

As a partial answer to (b), T. Aubin proved 
the existence of an Einstein Kahler metric on 
compact complex manifolds with negative first 
Chern class. Later, S. T. Yau gave a complete 
affirmative answer for both (a) and (b), devel- 
oping the method (solving complex Monge- 
Ampère equations) of Calabi and Aubin. 
Among several consequences of Yau's work, 
the following by A. Todorov is one of the most 
interesting: Every point of SO(3, 19)/SO(2) x 
SO(1, 19) corresponds to at least one marked 
Kahler surface via period mapping. 

Yau also showed that Aubin's previous 
work solved affirmatively the following con- 
jecture of F. Severi: Every compact com- 
plex surface that is homotopic to P?(C) is 
biholomorphic to P?(C). Yau's proof used a 
delicate differential geometric analysis of the 
inequality c (XY [X] x 3c(X) LX ], proved by 
Y. Miyaoka for compact complex surfaces X 
of general type. The key observation is that if 
X is a compact complex surface with negative 
first Chern class and c,(X)? [X] 23c,(X) LX], 
then the universal covering space of X is an 
open ball ((z,.z5)ec?; |z;]? - |z;]? <1}. For 
recent developments concerning Calabi's con- 
jecture — [1,15]. 


D. Hodge Manifolds 
One of the most important examples of Kah- 


ler manifolds is a projective nonsingular vari- 
ety over the field of complex numbers. If we 
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let (Co, ...,€y) be a homogeneous coordinate 
system in the N-dimensional projective space 
P"(C), then the subset (, #0 has its holomor- 
phic coordinate system (z!, ..., z7), where z! = 
ORES m afe, STT = Cass... 
C"/C*. If we let y, — (1/z)log(1 t Xjlz7l) and 
Gag = 0^ V[0z*0z^, then ds? = E g,5 dz* dz" is 
independent of k and determines a standard 
Kahler metric of P'(C) that coincides with 

the so-called Fubini-Study metric on P*(C) up 
to a constant multiplier. In this case, we have 
the following results: (11) Let $ be a hyper- 
plane of P". Then the integral {,Q of the fun- 
damental form Q on a 2-cycle Z is equal to 

the *Kronecker index KI(Z, $) of the tintersec- 
tion of Z and §. Therefore (12) an integral (a 
period) of O on a 2-cycle with integral coeffi- 
cients is an integer. In other words, Q corre- 
sponds to the cohomology class in H?(X, R) of 
a cocycle with integral coefficients. Property 
(12) holds for the induced Kahler metric on an 
analytic submanifold X of P'(C) (which is 
algebraic by tChow’s theorem). Property (11) 
also holds if we replace § by the intersection Y 
of X and $. Property (8) (— Section B) can be 
thought of as an expression (in terms of har- 
monic forms) of Lefchetz's theorem on the 
topology of projective algebraic varieties. 

Generally, if a compact complex manifold X 
admits a Kahler metric with property (12), we 
say that the metric is a Hodge metric and X is 
a Hodge manifold. Kodaira's theorem asserts 
that a Hodge manifold has a biholomorphic 
embedding into a projective space [8]. Coho- 
mology vanishing theorems (— Section D) and 
the properties of tmonoidal transformations of 
complex manifolds are used to prove this 
theorem. 

On a closed Riemann surface Hie a 1- 
dimensional compact complex manifold, any 
Hermitian metric is Kahlerian. Moreover, a 
metric with total volume 1 is a Hodge metric. 
This proves that 9R is isomorphic to an alge- 
braic curve in a projective space. This is a 
proof (using Kodaira's theorem) of the exis- 
tence of nonconstant meromorphic functions 
on R. The condition on the *Riemann matrix 
for a complex torus T — C"/D (D is a discrete 
subgroup of rank 2n) to be a projective alge- 
braic variety is that T admit a Hodge metric. 

Let L be the sheaf associated with a positive 
line bundle (i.e., its first Chern class is repre- 
sented by a Hodge metric) on a compact com- 
plex manifold X. Then the Kodaira vanishing 
theorem states that H'(X, Ky & L)=0 for any 
i70, where Ky is the canonical sheaf, i.e., 
the sheaf of holomorphic n-forms, n ^ dim X. 
Several generalizations are known. For in- 
stance, let X be a compact complex variety 
with a(X) — dim X and F be a sheaf associated 
with a positive vector bundle on X. Take a 
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nonsingular projective variety Y and a bira- 
tional holomorphic mapping z: Y X and 
define K , to be the direct image z,(K y). Then 
H'(X, Ky ® F)=0 for any i» 0(S. Nakano 
[10], H. Grauert and O. Riemenschneider 
[5]). Ramanujam’s vanishing theorem is con- 
cerned with an effective divisor D on a non- 
singular projective surface S. D is said to be 
numerically connected if the intersection num- 
ber D, -D, >0 for every decomposition D = 
D, +D, with D,>0. If D?>0 and D is numer- 
ically connected, then H'(S,.%,)=0, where 

Jp is the sheaf of ideals defining D [12]. How- 
ever, Kodaira's vanishing theorem fails for 
algebraic manifolds over fields of positive 
characteristic. 


E. Examples and Other Properties 


Concerning differential geometry on compact 
Kahler manifolds, the analytic transformation 
group and the 'isometric transformation group 
are also studied. For example, let X be a com- 
pact Káhler manifold of complex dimension n. 
Then the Lie group of isometric transforma- 
tions on X is of dimension « n? 4- 2n, and the 
equality holds if and only if X = P"(C). 

A nonalgebraic complex torus is the most 
important example of a compact Kühler mani- 
fold that is not algebraic. A complex torus is 
not an algebraic variety if it is not a Hodge 
manifold. 

An example of a noncompact K4hler mani- 
fold is a bounded domain in C" with the Káh- 
ler metric 
ds? =Y (6? log K (z, Z)/0z" Oz) dz" dz, 

a, f 
where K (z, C) is *«Bergmàn's kernel function. 
This metric is significant because of its invar- 
iance under the analytic automorphisms of 
the domain. More generally, a *Stein manifold 
admits a complete Káhler metric [4]. 
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Felix Klein (May 25, 1849—June 22, 1925) was 
one of the leading mathematicians in Germany 
in the latter half of the 19th century. Born in 
Düsseldorf and graduated from the University 
of Bonn, Klein went to study in Paris. In 1872 
he became a professor at the University of 
Erlangen, and in 1886 attained a chair at the 
University of Góttingen, where he was em- 
ployed until his death. His accomplishments 
cover all aspects of mathematics, but his main 
field was geometry. In his inaugural lecture at 
the University of Erlangen, he presented a 
bird's-eye view of all the then known fields of 
geometry from the standpoint of group theory, 
which is referred to as the tErlangen program 
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(— 137 Erlangen Program). In it he stated that 
both Euclidean and non-Euclidean geometry 
are included in fprojective geometry. Klein 
said in his last lectures [4] that he spent the 
greatest part of his energies in the field of 
*automorphic functions. These last lectures are 
important as historical material on the mathe- 
matics of the 19th century. Klein was a leader 
of reforms of mathematical education in Ger- 
many [3]. 
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A. Definitions 


Let H? be the upper half-space of the 3- 
dimensional Euclidean space. The boundary 
plane of H? is regarded as the complex plane 
C. The one-point compactification H°? U C 

of H? is denoted by H?, where C is the ex- 
tended complex plane. Let Mób(H?) be the 
group of all the Móbius transformations in 
I? which are represented by an even number 
of compositions of reflections with respect to 
hemispheres in H* orthogonal to C. Here, the 
plane in H? orthogonal to C is regarded as a 
hemisphere orthogonal to C. The hyperbolic 
structure can be defined naturally in H?, and 
this structure is preserved by any element in 
Mób(H?). A hemisphere in A? orthogonal to 
C is a (hyperbolic) plane in this structure. The 
restriction of any element in Mób(H?) to C isa 
linear fractional transformation on C; con- 
versely, for any given linear fractional trans- 
formation y on C, there is a unique Móbius 
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transformation in Mób(H?) whose restriction 
to C coincides with y. 

A Kleinian group I is defined as a discrete 
subgroup of Mób(H?) and acts discontinu- 
ously on H?. If there exist a point pe H? and a 
sequence |, of distinct elements of F such 
that the sequence {y,(p)} converges to a point 
(cC in the usual topology of Ĥ?, then the 
point ¢ is called a limit point of I, and the set 
A(T) of all the limit points of T is called the 
limit set of F, which is a closed subset of C. 
When A(T) 2 C, the Kleinian group I is of the 
first kind. Otherwise, I is of the second kind, 
and the restriction of T to C acts discontinu- 
ously on the region of discontinuity Q(T) = 
C — A(T) for T. Usually, “a Kleinian group" 
means one of the second kind. Thus the Klein- 
ian group I acts discontinuously on H? U 
Q(T), the quotient space H?/T is a 3-manifold 
with a hyperbolic structure induced by that of 
H?, and the union of some Riemann surfaces 
Q(T)/T is the boundary of this 3-manifold. A 
study of Kleinian groups in connection with 
the theory of 3-manifolds has been done re- 
cently by W. P. Thurston. 

If A(T) consists of at most two points, then 
the Kleinian group I is called elementary; if 
otherwise, I is nonelementary and A(T) is 
perfect and nowhere dense in C, and Q(T) 
allows the Poincaré metric and has a hyper- 
bolic structure invariant under I. A Kleinian 
group whose region of discontinuity has a 
nonempty connected component invariant 
under T is often called a function group. 


B. Examples 


A Fuchsian group (— 122 Discontinuous 
Groups) is a Kleinian group which leaves a 
circular disk in C invariant. Another important 
Kleinian group is the so-called Schottky group. 
Let (C5, CH, ben (22) pairs of Jordan 
curves on C, every one of which lies in the 
exterior of each of the other 2n — 1 curves. 
Assume that there are n loxodromic (or hyper- 
bolic) linear fractional transformations y,, 
where y, maps the interior of C; onto the ex- 
terior of C; conformally. The group generated 
by these {y,} is a Schottky group whose limit 
set is totally disconnected and has its nonem- 
pty subset in the interior of every C; and Cj. A 
Kleinian group is a Schottky group if and only 
if it consists of only loxodromic or hyperbolic 
elements, is finitely generated, and is free. A 
Schottky group is not always Fuchsian, and 
there are many kinds of Kleinian groups 
which are not Fuchsian. Quasiconformal de- 
formation of a Fuchsian group G of the first 
kind yields the *Tetchmiiller space T(G) with 


870 


the center G, and every point of T(G) is a 
quasi-Fuchsian group whose limit set is a 
quasicircle, an image of a circle on C under a 
quasiconformal mapping of C onto itself. To 
every point of the boundary of T(G), there 
corresponds a Kleinian group called a bound- 
ary group for G. Among them, there is a Klein- 
ian group whose region of discontinuity is 
connected and simply connected. Such a 
Kleinian group is called totally degenerate. A 
web group is a Kleinian group whose region of 
discontinuity consists only of Jordan domains. 


C. Fundamental Sets 


Let T be a Kleinian group acting on H?. There 
is a "fundamental set for I in H? which is 
bounded by a countable number of hyperbolic 
planes in H? and is convex in the sense of the 
hyperbolic structure. The closure of this funda- 
mental set in H? is called a convex funda- 
mental polyhedron for T. For a point poe H? 
not fixed by elements of I except for the 
identity, the set {pe H? | d(p, po) € d(y(p), Po) for 
each y eI j, where d denotes the hyperbolic 
distance, yields a convex fundamental poly- 
hedron for T and is called the Dirichlet region 
with the center p, for T. If T has a convex 
fundamental polyhedron with a finite number 
of sides or faces, then T is called geometri- 
cally finite. Similarly, a Dirichlet region for a 
Fuchsian group in the upper half complex 
plane H can be defined, and it is often called a 
normal polygon for the group. For a Kleinian 
group T, instead of fundamental sets for I in 
Q(T), the concept of fundamental domains is 
sometimes useful. A fundamental domain for a 
Kleinian group F in Q(T) (40) is an open 
subset D of Q(T) such that I'-images of any 
point in D, except the point itself, are not in D 
and such that some of the I'-images of any 
point in Q(T) lie on the closure of D in Q(T). 
The Ford fundamental region is also useful. 
The action of yeT on C is given by a linear 
fractional transformation z->(az+ b)/(cz +d), 
ad — bc — 1. The circle (ze C ||ez - d| 2 1] is 
called the isometric circle of y with c #0. If 

oo € O(T), then the set, every point of which is 
in the exterior of isometric circles of all ye I 
with c #0, is a Ford fundamental region for I 


[7]. 


D. Finitely Generated Kleinian Groups 


Much work has been done on finitely gen- 
erated Kleinian groups [2, 5,9, 10, 12, 13], 
while very little has been done on nonfinitely 
generated Kleinian groups. If a Kleinian group 
I (of the second kind) is finitely generated, 
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then the quotient space O(I)/T is a disjoint 
union of a finite number of compact Riemann 
surfaces with at most a finite number of punc- 
tures (Ahlfors's finiteness theorem). Further- 
more, if the above I' is nonelementary and is 
generated by N generators, then the (hyper- 
bolic) area of Q(T)/T is not greater than 4x(N 
— 1) (Bers's area theorem). These two theo- 
rems played important roles in the study of 
Kleinian groups after 1960. If a Fuchsian 
group is finitely generated, then every normal 
polygon for the group has a finite number of 
sides. In contrast with this fact, the following is 
suggestive: If a finitely generated Kleinian 
group I is totally degenerate, then T is not 
geometrically finite (L. Greenberg). 

There is a method to obtain a Kleinian 
group from two simple groups. If I; (j= 1, 2) is 
a finitely generated Kleinian group and if the 
interior of a connected fundamental domain D; 
for I; (j= 1, 2) contains the boundary and the 
exterior of D; (ij), then the group generated 
by T, and I is again a Kleinian group with a 
fundamental domain D, N D, (Klein's combina- 
tion theorem). Generalizations of this theorem 
are discussed by Maskit [14]. 

Let F be a nonelementary Kleinian group, 
and let A (#0) be an invariant union of com- 
ponents of Q(T) under T. Denote by 4,(A,T) 
the Banach space of holomorphic auto- 
morphic forms @(z)dz? in A for T satisfying 
{far eG *|o(z)| dx dy « oo, where A/T is a 
measurable fundamental set for F in A, z=x 
t iy, and p(z)|dz| is the Poincaré metric on A. 
The Banach space B,(A, I) is the totality of 
holomorphic automorphic forms y (z)dz* for I 
in A with sup,., p(z) “|W(z)| « oo. If T is finitely 
generated, then A,(A, D) — B,(A, T), and it is the 
space of all the cusp forms for T in A. If T is 
not finitely generated, then even the inclusion 
A, (Q(T), D) c B,(O(T), D) does not hold in 
general. 


E. Limit Sets of Kleinian Groups 


Let T be a Kleinian group, and let {Q;} be 
the collection of all the components of Q(T). 
Although it was stated in [8] that A(T) con- 
sists of boundaries ĉQ; of Q,, it was proved by 
Abikoff in [3] that the statement is incorrect; 
that is, the residual limit set A;(D) — A(I)— 
|J; ðQ; of T is not always empty. In fact, there 
is a web group whose region of discontinuity 
consists of an infinite number of components, 
and such a group has a nonempty residual 
limit set [1]. 

It was conjectured by Ahlfors that the 2- 
dimensional Lebesgue measure of A(T} equals 
zero for every finitely generated Kleinian group 
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F of the second kind. This conjecture is one 
of the sources of recent developments of the 
theory of Kleinian groups, and it is still open. 
Concerning this conjecture, the following fact 
was proved by Sullivan in [13] from the stand- 
point of the ergodic theory: The Beltrami 
coefficient for I with support on A(T) is equal 
to zero. It is also known that, if I is geometri- 
cally finite, then the conjecture is affirmative. 
The study of Kleinian groups from the view- 
point of ergodic theory has also been devel- 
oped [11, 15]. 

Let M,(I) be the *t-dimensional Hausdorff 
measure of A(T). The Hausdorff dimension 
d(T) of A(T) is defined by d(T) — infí[t 0| 
M,(I) 20]. There are some studies on the 
Hausdorff dimension of A(T) for several kinds 
of Kleinian groups I [3, 4, 6]. 
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A. General Remarks 


The knot problem is a special case of the 
placement problem, stated as follows: Given 
two homeomorphic topological spaces X, 
and X, and their respective homeomorphic 
subsets A, and A,, is there a homeomor- 
phism f: X, >X, such that f(A,)=A,?A 
simple closed curve in a Euclidean 3-space 
R? (or in a 3-sphere $? obtained from R? by 
one-point compactification, sometimes used 
in preference to R?) is called a knot. If two 
knots K, and K, can be mapped from one 
to the other by a homeomorphism of R? 

(or of S?), they are called equivalent. All 
knots are classified into knot types by this 
equivalence. 

Let (x, y, z) be Cartesian coordinates in R?. 
Knots equivalent to x? + y? — 1, z «0 and their 
knot type are called trivial or unknotted; knots 
of other types are called knotted. Knots equiv- 
alent to those given by polygonal curves in R? 
and their knot types are called tame; other 
knots are called wild. We discuss only tame 
knots in this article except in Section H. Given 
a knot K in R°, there exists a plane such that 
the orthogonal projection z on it has the fol- 
lowing two properties: (1) The image z(K) has 
no multiple points other than a finite number 
of double points. (2) The projections of the 
vertices of K are not double points of z(K). 
Then z(K) is called a regular knot projection 
of K. Let z 20 be the plane in question. Of 
the two points of K corresponding to a dou- 
ble point of z(K), the one with the greater z- 
coordinate is called the overcrossing point; 
the one with the smaller z-coordinate is 
called the undercrossing point. Suppose that 
overcrossing points and undercrossing 
points appear alternately when we move 
along K in a fixed direction; then K is called 
alternating. 

Knots K, and K, in R? are said to be of 
the same isotopy type if there is an fisotopy 
{h} (0 t & 1) of R? such that h,: R? OR? are 
homeomorphisms with the identity hj and 
such that h,(K,) 2 K,. Knots K, and K, of R° 
are of the same isotopy type if and only if K, is 
mapped onto K, by an orientation-preserving 
homeomorphism of R?. Thus knots of the 
same isotopy type are equivalent. The con- 
verse, however, is false. A knot that can be 
mapped onto itself by an orientation-reversing 
homeomorphism of R? is called amphicheiral. 
A knot K is called invertible if K is map- 
ped onto itself by an orientation-preserving 
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homeomorphism h such that h| K reverses the 
orientation of K. Knots 4,, 63, 83, Ba, 812, 817 
8,, in Appendix A, Table 7 are amphicheiral, 
and 8,; is noninvertible. 

Given a knot K, there is an orientable sur- 
face F of fgenus h, say, having K as its bound- 
ary, called a Seifert surface of K [1]. The 
minimum of the genera of Seifert surfaces 
of K is called the genus of K, and there is 
an algorithm with which one can determine 
the genus of K (W. Haken, Acta Math., 105 
(1961)). Using tDehn’s lemma and the ‘sphere 
theorem, C. D. Papakyriakopoulos showed 
that for a tame knot, 2,(S* — K)=0 (i> 2), and 
further, K is unknotted if and only if z,($? — 
K)zxZ[2]. 

The first attempt to list all knots systemati- 
cally was made by P. G. Tait and C. N. Little 
in the 19th century; an attempt was also made 
by J. H. Conway in 1970. At present, the classi- 
fication has been completed for all knot types 
with at most ten double points by using the 
invariants of knots introduced in Sections B-E 
[3,4]. 

Given two knots K, and K, separated by a 
2-sphere S?, we can tie them together with a 
narrow strip to obtain a new knot K (Fig. 1), 
called the product or composition of K, and 
K,. If the strip is chosen so that the orienta- 
tions of K,, K,, and K are mutually compat- 
ible, then the isotopy type of K is uniquely 
determined by those of K, and K,. The set of 
isotopy types forms a commutative ‘semigroup 
with respect to this product, in which the 
trivial knot type is an identity. Any knot is 
decomposed uniquely (up to isotopy) into 
finite products of the prime knots, which can- 
not be decomposed into products of knotted 
knots (H. Schubert and S.-B. Heidelberger, 
Akad. Wiss., 3 (1949)). 


Until about 1930 the theory of knots had 
been studied chiefly by J. W. Alexander in the 
United States and by K. Reidemeister, H. 
Seifert, and others in Germany. Little progress 
was made, however, until a new development 
of the theory was introduced by R. H. Fox [5] 
and his school in the United States. In Japan, 
significant contributions have been made by T. 
Homma, S. Kinoshita, K. Murasugi, and 
others [6]. 
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B. Knot Groups 


If K is a knot, the fundamental group G(K) 
=n, (R? — K) of R? - K is called the knot group 
of K. Any group G is written as a tquotient 
group F/N with a free group F and its tnor- 
mal subgroup N. By specifying the generating 
set (x4, X5,...] of F and a set of elements 
frt: } in F such that N is the minimum 
normal subgroup of F containing r,,r;, ..., 
we obtain a system (x, X5, ... i r4, 5, ...), 
called a presentation of G, written G 2 (x,, 

X5, F3, 73, ...). For the knot group G(K), 
there is a standard presentation called a Wir- 
tinger presentation, which is obtained as fol- 
lows. First, for the sake of convenience, we 
give an orientation to K. (The orientation is 
irrelevant, however, since the reverse orienta- 
tion gives an isomorphic group.) Consider a 
regular knot projection z(K). If zx(K) has n 
double points z(d;), then K is divided into n 
arcs z; by n undercrossing points. We con- 
sider a free group F generated by n letters 
X1,...,X,- To each double point z(d;) we asso- 
ciate a word r; in the following manner. The 
two cases near z(d;) are illustrated in Fig. 2. 
For the first case, we define r,— x; 1, x,x;x, !; 
and for the second, r,— x; 1, x; ! x x,. Then the 
presentation (x,, X5, ... , Xy: 4,73, Fal lS 
called a Wirtinger presentation of G(K). Since 
any one of these r; is a consequence of the rest, 
G(K) may be written as (x4, X5, ei En F3, 


vix asd). 


ET -1 
Pats = ZE Xa Xa Xy 
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-1 .-1 
riti = kickt XaX, 


Fig. 2 


If G' is the commutator subgroup of G(K), 
then G(K)/G' is an infinite cyclic group Z. A 
knot group is an invariant of a knot type, but 
two knots with isomorphic knot groups are 
not necessarily equivalent. However, it is not 
known whether or not two prime knots with 
isomorphic knot groups are equivalent. The 
affirmative statement is called the general knot 
conjecture and is one of the fundamental prob- 
lems in knot theory. The knot complement 
conjecture is another important conjecture and 
states that two prime knots with isomorphic 
knot groups have homeomorphic comple- 
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ments [11]. Nevertheless, the knot group plays 
a prime role in knot theory. To be more pre- 
cise, given a knot K in S?, consider a ttubu- 
lar neighborhood V of K. V is a solid torus. 
Choose two simple closed curves m and / on 
OV in such a way that (1) m and l intersect at 
the base point; (2) m bounds a disk in V, but m 
is not homologous to 0 on 6V; (3) | bounds an 
orientable surface in S? — V, but l is not homo- 
logous to 0 on OV. A pair {m, 1} is called a 
peripheral system for K, and m is a meridian 
and l a longitude of K. The meridian m and the 
longitude Las elements of G(K), generate a 
free Abelian subgroup p(K) of G(K), called the 
peripheral subgroup of G(K). Two knots K, 
and K, are equivalent if and only if there is an 
isomorphism q from G(K,) to G(K,) which 
sends {m,,/,} onto a conjugate of (mz, I!) in 
G(K,) (F. Waldhausen, Ann. Math., (2) 87 
(1968)). Therefore the group system {G(K), 
p(K)} is an important invariant of the knot 
type of K. 

G(K) has been studied quite extensively since 
1960. For example, the commutator subgroup 
G' of G(K) is tfinitely generated if and only if 
G' is free, and then the rank of G' is twice the 
genus of K [7]. S? — V is a fiber bundle over S! 
with the fiber an orientable surface if and only 
if G' is finitely generated (J. Stallings, 1962). 
Such a knot is called a fibered knot. G(K) is 
residually finite for any knot K (the result 
requires Thurston's hyperbolization theorem), 
and thus the *word problem for G(K) is solv- 
able. G(K) has no element of finite order [2]. 
The knot type whose group has a nontrivial 
tcenter is completely characterized. Also, many 
problems on 3-manifolds can be formulated in 
terms of knot groups. For example, the follow- 
ing conjecture, related to *Poincaré's conjec- 
ture, still remains unsolved. Property P conjec- 
ture: For a knotted knot K in S°, let N, be the 
smallest normal subgroup of G(K) containing 
ml’. If G(K)/N, is a trivial group, then q =0. 








C. Alexander Invariants and the Seifert Matrix 


Let (x4, ..., Xy; 4, ..., r4 1) be a presentation of 
the knot group G of a knot K, F be the free 
group generated by x,, x;,..., Xp, and op F^ 
G, y:G—H —G/G' be canonical homomor- 
phisms. Then o and y can be extended to 
homomorphisms between tgroup algebras 
with integral coefficients o: Z[F]-Z[G] and 
V:Z[G]Z[H)]. For any word r in F, we have 
the free derivative àr/Ox; for i— 1, ..., n satisfy- 
ing the following: 


Ox;/0x; =ô ôx; '/éx;,= —x; !, 
O(rs)/Ox; = 6r/Ox; +r: (0s/Ox;) 


ijs 
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(Fox [6]). Now we have the Alexander matrix 
A — (aj) (i—1,...,n— l;j L, ..., n) of the knot 
K defined by aj; — V o o(8r;/0x;.. The ideal E, of 
Z[H] generated by all (n — k) x (n — k) minors 
of A is called the kth elementary ideal of A. 
OCE,CE,C...CE,=Z[H], and in particular, 
E, is a principal ideal, called the Alexander 
ideal of K. Since H z Z, any element of Z[H] 
is a finite sum of elements of the form mt" 

(m, ne Z). The generator A(t) of the Alexander 
(principal) ideal is called the Alexander poly- 
nomial of the knot K. The elementary ideals of 
A, including A(t), are invariants of the knot 
type of K. A(t) is uniquely determined by K up 
to the factor +t* and has the following prop- 
erties: (i) A(1)2 +1; (ii) A(1/t) 2 t^A(t). Con- 
versely, any polynomial with integral coeffi- 
cients satisfying (1) and (ii) is the Alexander 
polynomial of some knot. For example, the 
knot group G of the trefoil knot (or clover- 
leaf knot) K (Fig. 3) is presented by G —(a, b; 
aba(bab) 11 The Alexander matrix of K is 
(1—t-- £2, —1+t— t’), and the Alexander 
polynomial of K is A(t) 2 1 — t - t?. 


bab 


Fig. 3 


For a trivial knot, the Alexander polynomial 
is 1, but the converse is false. The first example 
of a nontrivial knot with A(r) 2 1 was given by 
Seifert (Fig. 4) [1]. The Kinoshita-Terasaka 
knot (Fig. 5) is another knot with this property 
(Osaka J. Math. 9 (1957)). 


K, K: 


Fig. 4 


ox 
es 


The degree of the Alexander polynomial 
never exceeds twice the genus of the knot, and 
the equality holds for any alternating knots 
and for fibered knots. For fibered knots, A(0) 
= +1 and this is also sufficient for alternating 
knots to be fibered knots. 

The Seifert surface F of K is used to define 
an invariant of K as follows. Since H,(F; Z) is 
a free Abelian group of rank 2h, we can choose 
2h oriented simple closed curves a,,4;, ..., 45 
on F, whose homology classes form a free 


874 


basis for HUE Z). Consider F x [—1,1] in $°, 
and let v; denote the "linking number v(a; x 
{—1},a; x {1}). Then the 2h x 2h integral matrix 
V — (v; is called the Seifert matrix of K. It has 
been shown that the determinant of the matrix 
'V—tV is the Alexander polynomial A(t) of K, 
and the 'signature of the symmetric matrix ‘V 
+ V is a useful knot invariant o(K), called the 
signature of K (H. F. Trotter, 1962; [8]). Fora 
trefoil knot K (Fig. 3), c(K)— —2. Since o(K) 
=0 for an amphicheiral knot, this shows that 
a trefoil knot is not amphicheiral. 


D. Link Theory 


A link L in R? is a disjoint union of knots K; 
in R? and is denoted by L=K,UK,U...U 
K,. Two links L= K(UK,U...UK, and L'— 
Ki U K5U ... UK; are said to be equivalent 
if r=s and each K; is mapped onto Kj, i= 
1,2,...,r, by a homeomorphism of R?. Further, 
we can define the isotopy type of a link, as we 
did in Section A. In link theory, however, we 
mostly consider oriented links, i.e., each com- 
ponent is given an orientation. Therefore, in 
the following, we consider only oriented links, 
and the link type means an isotopy type which 
preserves the orientation of each component. 
Let L=K,UK,U...UK, bea link in R?c 
S?. The link group G(L) of L is defined as 
1, (S? — L). Using a regular projection, a Wir- 
tinger presentation of G(L) —(x,, X3, E 
,, 4) can be obtained in exactly the 
same manner as was described in Section B. 
The Alexander matrix A of L is also defined 
by means of the free derivative from a presen- 
tation of G(L). Since G/G' is a free Abelian 
group of rank r, the Alexander matrix A isa 
matrix over an integral Laurent polynomial 
ring with r variables t,,t,,...,t,, and hence, 
the first elementary ideal E,(A) may not be 
principal. A generator of the smallest prin- 
cipal ideal containing the generators of E,(A) 
is called the Alexander polynomial or the link 
polynomial of L. It is an integral polynomial 
A(t;, t5, ..., t,) which is uniquely determined 
up to the factor -- t?1122 ... t^. The link poly- 
nomial is an invariant of the link type but is 
not an invariant of the link group. There are 
some necessary conditions for a polynomial to 
be a link polynomial, but they are not suffi- 
cient to characterize a link polynomial. By 
putting t, —t5— ...—t,, we obtain a poly- 
nomial of one variable A(t). (1 — t)À(t) is called 
the reduced link polynomial. Air) is divisible by 
(1 — t? and V(t) 2 A(D/(1 — t? is called a 
Hosokawa polynomial of L. V(t) is symmetric. 
1, (S? — L) 0 if and only if L splits, i.e., 
there is a 2-sphere S* in S? which is disjoint 
from L, but each component of S? — S? con- 


Fis Fas. 
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tains at least one component of L. Given two 
links, we can define (not uniquely) a product 
link in the same manner as was described in 
Section A, and any link type is decomposed 
uniquely into a finite number of prime links. 
The complement of a prime link cannot be 
determined from its group. 

Given a link L of r components, we can find 
an orientable surface F of genus h, say, whose 
boundary is L, and thus we can define a Seifert 
matrix V for L. (— Section C. The only dif- 
ference is that H,(F; Z) is free Abelian of rank 
2h-- r— 1.) The determinant of 'V —tV is, then, 
the reduced link polynomial. The signature of 
‘V+ V is called the signature of L [8]. 

For links in $?, we can define a weaker 
equivalence, called the homotopy type. Two 
links are said to be homotopic (or of the same 
homotopy type) if one link is deformed con- 
tinuously onto another under the condition 
that during the deformation each component 
of the link is allowed to cross itself, but no two 
components are allowed to intersect. A typical 
homotopy invariant of a link L is the linking 
number 4j; — Lk(K;, K;). The absolute value of 
A, is determined by G/G;, where G; i7 1,2, ..., 
denotes the ower central series of G(L). Using 
the group G/G,, k 2 1, J. Milnor defines a 
numerical link invariant Gil: AL 1 €j1.Ja. 

.. Jy <r, called Milnor's invariant (Ann. Math., 
(2) 59 (1954)). In particular, u(ij) — 4j, and j 
completely determines the homotopy type of 
a link with at most three components. 


E. Representations and Covering Spaces 


Let K be a knot (or a link) in S?. A ftransitive 
homomorphism q from the group G(K) into 
S,, the *symmetric group of degree n, is called. 
a representation of G(K) of degree n. Two rep- 
resentations @, and @, are said to be equiva- 
lent if there is an tinner automorphism p of S, 
such that p, — pq. 

Since G/G' = Z, q(G)/o(G) must be cyclic. 
Conversely, any finite group I with I/I" cyclic 
is a homomorphic image of some knot in S? 
(F. González-Acüna, 1975). If T =S, or D;,., 
(the tdihedral group of order 2(2n + 1)), T/T” is 
finite cyclic and some conditions are known 
for the knot group to be mapped onto I [5]. 
Given a representation o of G(K) of degree n, 
we obtain the n-fold tcovering space C, of S? 
— K. Equivalent representations correspond to 
homeomorphic covering spaces. C, can be 
completed to a covering space X, of S? with 
branch points on K. X, is an orientable closed 
3-manifold and is called the n-fold branched 
covering space of S? along K. Let A, be X,,— 
C,. Then A, is a link in £,. Any topological 
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invariant of (£, A,) is an invariant of K. For 
example, the homology groups of X, are im- 
portant invariants of K, and by using these 
invariants we can show that the two knots K, 
(Fig. 4) and K, (Fig. 5) are distinct (R. Riley, 
1971). If H, (2; Z) is a finite group, we can 
compute the linking number between two 
components of A, (R. Hartley and Murasugi, 
Canad. J. Math., 29 (1977)). The set of these 
rational numbers is called the covering link- 
age invariants and is used to classify all knot 
types with ten double points (K. Perko, Jr. 
(1974)). 

Since G/G' =Z, every knot group has a 
unique representation on the cyclic group of 
order g. Let X, — A, be the corresponding g- 
fold covering space of S? — K, called the cyclic 
covering space. Then the 1-dimensional ho- 
mology group H,(Z,— A,) is isomorphic to the 
direct sum of H,(2,) and Z, and the order 0 of 
H, (2j) is given by 0 — IA Atoll, where œ 
denotes the primitive gth root of unity. If 
H, (2) is infinite, then the tBetti number of 2, 
is the number of roots of A(t) 20, properly 
counted, that are gth roots of unity. 

Since every orientable closed 3-manifold is 
a branched covering space of S? along a knot 
or link (Alexander (1928)), some attempt has 
been made to identify *simply connected 3- 
manifolds as branched covering spaces of S?. 
However, all simply connected 3-manifolds 
obtained as coverings thus far are known to 
be Si, 

In 1979 W. Thurston proved that any g-fold 
(g 22) branched cyclic covering space of S? (or 
of a homotopy 3-sphere) along a nontrivial 
knot is never simply connected. This result set- 
tles affirmatively the Smith conjecture: The fixed 
point set of any periodic self-diffeomorphism 
of S? is unknotted if it is a circle. 

The proof of the conjecture is based on the 
study of hyperbolic structures on 3-manifolds 
initiated by W. Thurston (about 1977) and on 
the equivariant loop theorem of Meeks and 
Yau. A hyperbolic manifold is a Riemannian 
manifold of constant negative curvature which 
is complete and of finite volume. A knot (or 
a link) K whose complement S? — K admits 
a hyperbolic-manifold structure is called a 
hyperbolic knot (or link). Two hyperbolic knots 
with isomorphic knot groups have homeo- 
morphic complementary domains. A faithful 
representation y of a knot group G(K) into 
*tPSL(2, C) is called an excellent parabolic rep- 
resentation if (1) triyj(m) 2 +2 for a meridian 
m and (2) w(G(K)) is a non-Abelian discrete 
subgroup of PSL(2, C). According to Thur- 
ston, a knot K is hyperbolic if and only if G(K) 
has an excellent parabolic representation. 

The theory of hyperbolic knots has had a 
most profound influence on knot theory. 
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F. Braids 


The theory of braids has been used as a tool to 
investigate the theory of knots. An illustration 
of a braid of fifth order is given in Fig. 6. In 
general, consider a cube D? written as D? x 
[0, 1], where D? is a disk {(x, y)JO€x, y« 1j 


n+1°2 
define 2n points A; — P; x {1} and B;— P; x {0}, 
i— 1,2, ...,n. Then join A; with B,,, i—1,2, 
...,n, Where (k,, k2, ..., kp) is a permutation 
of (1,2, ..., n), by means of mutually disjoint 
polygonal curves l; in D? in such a way that 
(1) nop? = A;U B,, and (2) for each t, 0<t< 
1, a disk D,= D? x {t} intersects l, at exactly 
one point. Such a configuration is called a 
braid of nth order. 

Two braids 3, and 3, are said to be isotopic, 
31 © 32, if there is a homeomorphism of D? 
onto itself mapping 3, onto 3, such that it is 
an identity on 0D?. The isotopy relation is an 
equivalence relation among braids. 

Suppose that we are given a braid such as 
Fig. 6. If we connect A; and B, by polygonal 
curves l; on the boundary of D? (Fig. 7), we 
obtain a closed braid. In general, a closed braid 
is a link. Conversely, any link is equivalent to 
a closed braid. 


1 
in R?. Let n- 5) be points on D? and 





"A vA rA 3A 4 “As 
"LOT a 
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Fig. 6 


Fig. 7 


The product 3,3; of two braids 3, and 3, is 
defined as the braid obtained by connecting 
3, and 3, as shown in Fig. 8. If 3, «3; and 
32 935, then 3,3; %3132. Hence we can define 
the product [3,] [32] = [3132] of equivalence 
classes of braids of nth order. The totality of 
[3] forms a group 3, called the braid group of 
order n. 3, 1s generated by the equivalence 
classes o; (i— 1,2, ...,n— 1) of braids as shown 
in Fig. 9(a); note o; ! shown in Fig. 9(b). The 
"fundamental relations between (6j are S,,=1, 
where Sj — 0; ! o; | 8,6; or o; ! 0; | 0j | 0;66; 
according as |j — i| z 2 or |i —j|— f. 

Suppose that we are given braids 3,, 3; 
represented as products of o, Then 3, and 3, 
are equivalent if and only if the two products 
represent the same element in 3,. The problem 
of deciding whether 3, and 3; are equivalent 
reduces to the ‘word problem in 3,,. On the 
other hand, the problem of deciding whether 
or not two closed braids are equivalent reduces 





876 


aL 


In 


Fig. 8 
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i i+] 
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Fig. 9 


to the ttransformation problem in 3, (— 161 
Free Groups). 

The theory of braids was initiated and de- 
veloped by E. Artin, who also gave a solution 
to the word problem in 3, (Abh. Math. Sem. 
Univ. Hamburg (1926)). The transformation 
problem was solved by F. A. Garside (Quart. J. 
Math., 78 (1969)). 

By taking an arbitrary surface F instead of a 
disk D?, we have a braid group over F, de- 
noted by B,(F). The structure of the group 
B,(F) has been studied since 1962. A presenta- 
tion of B,(F) for any 2-manifold is known. 

In particular, B,(F) has no elements of finite 
order except when F is a 2-sphere or a projec- 
tive plane. Following the discovery of a deep 
connection between B,(F) and the mapping 
class group, the study of B,(F) has become 
quite important [9]. 


G. Higher-Dimensional Knots 


The problem of knots, that is, the problem of 
placement of simple closed curves in R?, is 
extended to the problem of placement of q- 
dimensional spheres in p-space R or in a p- 
dimensional sphere S?. 

In this section, the explanation is restricted 
to the case of tcombinatorial manifolds and 
"PI homeomorphisms between them. A simi- 
lar theory can be developed for other cate- 
gories (— 114 Differential Topology). 

Let D" be the n-dimensional disk ((x,, X3, 
eeo Xp ER” | |x; & Ll, i— 1,2, ..., n]. We identify 
D" with the subspace D" x {0} of Di If S4 
is a subcomplex of S”, then (S?, S°) (p q) is 
called a (p, q) sphere pair or (p, q)-knot. If a q- 
dimensional cell B? is a subcomplex of B" and 
if B4) B" = B*, the boundary of B’, then (B?, B9) 
is called a (p, q) ball pair or (p, q)-ball knot. 
Two pairs X =(X?, X?) and Y Z(Y7?, Y*) are 
said to be homeomorphic if there is a homeo- 
morphism h: X? Y? such that A(X?) - Y*. 
We classify (p, q) ball pairs and (p, q) sphere 
pairs into equivalence classes via homeo- 
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morphisms. A (p,q) ball pair (B?, B?) is said 

to be unknotted (or flat) if it is homeomorphic 
to the standard pair I^? —(D?, D"). A (p,q) 
sphere pair (SP, S°) is said to be unknotted if it 
is homeomorphic to (0D**!, QD?*!), E. C. 
Zeeman showed that if p —42 3, then the (p,q) 
ball pair and the (p, q) sphere pair are both 
unknotted (Ann. Math., (2) 78 (1963)). Similar 
results have been obtained by Stallings (Ann. 
Math., (2) 77 (1963)). Because of these results, 
the only interesting case is p —q 4- 2. We as- 
sume further that S? is locally flat in S**?. Let 
Ki be a q-knot, i.e., a g-dimensional sphere, 

in $?*?, and let G(K?) — n,(84*? — K%), the 
group of Ki Since G(K9)/G' = Z, we can define 
the Alexander matrix, the Alexander ideal, 
and the Alexander polynomial A(t) of K^, 

as we did in Section C. An alternative descrip- 
tion of these invariants expressed as invar- 
iants of homology groups of the infinite cyclic 
covering space of S**? — K* can be found 

in J. Levine, Ann. Math., (2) 84 (1966). There 
are some discrepancies between 1-knot theory 
and q-knot theory, q 2:2: (1) It is not known 
whether z,(S^ — K?) z Z implies that (S4, K?) is 
unknotted (unknotting conjecture). (2) z,(S*^? 
— K^) can have an element of finite order. 

(3) A(1)2 £1, but A(t) may not be symmetric. 
(4) 1,(S* — K?) may not be trivial. 

Although the characterization problem of 
the 1-knot group z,(S*— K) has not yet been 
solved, the same problem for the q-knot group 
has been completely settled by M. A. Kervaire 
(1963) as follows. Let G be a finitely presented 
group. Then G is the group of some knot ki 
in $42, 4 >3, if and only if (i) G/G' =Z, (ii) 
H,(G; Z) «0, and (iii) there is an element x in 
G whose set of conjugates generates G. These 
conditions are satisfied by any q-knot group, 
qÈ 1, but they are not sufficient for 1-knot 
groups. For q—2, conditions (1)—(iil) are suffi- 
cient for G to be a 2-knot group in a thomo- 
topy 4-sphere. 

A q-knot K‘ in $?*? (2 0D**?) is called null- 
cobordant if K? is the boundary of a locally flat 
embedded disk in D**?. The concept of knot 
cobordism was introduced by Fox and Milnor 
in 1957 (Bull. Amer. Math. Soc., 63 (1957)) for 
1-knots and was readily extended to q-knots 
in $4*?. Knot cobordism is a weaker equiv- 
alence than isotopy, but the set of cobordism 
classes forms an Abelian group C, under the 
connected sum (joining the knotted spheres 
by a tube) or the product for !-knots. Any 
1-knot K that represents zero element in C, 
is called a slice knot. If K is a slice knot, then 
A(t) must be of the form f(t)f(t !) for some 
integer polynomial (Fox and Milnor (1957)) 
and c(K)—0 [8]. For all qz 1, Cj, , is not 
finitely generated, while C;, —0 (Kervaire 
(1964)). 
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H. Miscellaneous Results 


Results on wild knots can be found scattered 
throughout the mathematical literature since 
1945. Many strange things can happen with 
wild knots or wild arcs. For example, the 
simple closed curve K (Fig. 10) bounds a disk, 
but x, (S? — K} is not Abelian. 

The placement problem of graphs in R? has 
also been treated, for instance by Kinoshita 
(Pacific J. Math., 47 (1975)). 


Fig. 10 
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Leopold Kronecker (December 7, 1823- 
December 29, 1891) was born in Liegnitz near 
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Breslau in Germany (now Legnica in Poland). 
He entered the University of Berlin in 1841 
but studied at various universities throughout 
the country, finally studying under E. E. Kum- 
mer at Breslau. In 1845, he received his doctor- 
ate with a thesis on units of falgebraic num- 
ber fields. Then he succeeded to his uncle's 
business in the management of banks and 
farms, which kept him away from publishing 
mathematical papers for eight years. In 1853, 
he published a paper on algebraic solution 

of equations, containing the assertions that 
Abelian extensions of the rational number field 
are contained in cyclotomic fields and that 
Abelian extensions of imaginary quadratic 
fields can be obtained using fcomplex multi- 
plication. The latter is called *Kronecker's 
dream in his youth." It remained a conjecture 
until it was solved by means of !class field 
theory. He gave lectures at the University of 
Berlin, first in his capacity of academician, 
then as a professor, succeeding his teacher 
Kummer in 1883. His statement that mathe- 
matics as a whole should be based solely 

on the intuition of natural numbers (— 156 
Foundations of Mathematics) often brought 
on disputes with his colleague K. tWeierstrass. 
His rejection of the bold reasoning of tset 
theory produced anxieties for G. Cantor. His 
famous statement, “Natural numbers were 
made by God; the rest is the work of man,” 
can be put in contrast with the liberal state- 
ments of Cantor and R. Dedekind. Kronec- 
ker's works and lectures ranged widely over 
the theory of numbers, algebra, and analysis. 
His contribution to the theory of elliptic func- 
tions and his ‘limit formula for zeta functions 
are well known. He also did pioneering work 
in topology. 
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A. General Remarks 


K-theory was introduced by M. F. Atiyah and 
F. Hirzebruch after the original idea was sug- 

gested by A. Grothendieck. The *Bott period- 
icity theorem is essential for the development 

of the theory. There are important applica- 


878 


tions of K-theory to tdifferential and alge- 
braic topology, such as the Riemann-Roch 
theorems for differentiable manifolds (Atiyah 
and Hirzebruch [6, 7]), the solution of the 
vector field problem on spheres (J. F. Adams 
[1]), applications to immersion and embed- 
ding problems (— 114 Differential Topology), 
a simple proof of the solution of the Hopf in- 
variant problem, and the determination of J- 
images in stable homotopy groups of spheres 
(— 202 Homotopy Theory). A review of alge- 
braic K-theory is also included in this article. 


B. Construction of K (X) 


Let the basic field A be the real number field 
R, the complex number field C, or the quater- 
nion field H Let A’ be the tcenter of A, and X 
be a compact Hausdorff space. Then &,(X) 
denotes the set of all isomorphism classes of A- 
tvector bundles over X and is a commutative 
semigroup under the *Whitney sum é @ y. Let 
FA(X) be the free Abelian group generated by 
the set £ (X), and let Q,(X) be the subgroup 
of FA(X) generated by the elements of the form 
č ®ny—č-—n. Then the Abelian group KA(X) 
is defined as the quotient group K,(X)= 
FA(X)/QACX). KACX) ts called the Grothen- 
dieck group or K-group, and such a construc- 
tion is called the Grothendieck construction. 
From this we obtain the canonical mapping 
0:6 (X) c FA(X)o K ACX), which is a homo- 
morphism of the semigroups. Moreover, the 
pair (KA(X), 0) is universal in the following 
sense: Given an Abelian group G and a semi- 
group homomorphism g: &,(X)->G, there 
exists a unique group homomorphism h: 
K4(X)—9G such that g —- ho0. We call h the 
extension of g. 

Let f: X >Y bea continuous mapping from 
X into another compact Hausdorff space Y. 
Then for ne &4(Y), the *induced bundle f *(g)e 
&,(Y) is defined. Since the mapping f*:&,(Y) 
—&,(X) is a semigroup homomorphism, it 
induces a homomorphism K,(f):K,(Y)> 
K(X), which is the extension of 00 f*, so 
that K,(f)o@=@0o0f*. Usually, K,( f) is also 
denoted by f*. Thus K, is a tcontravariant 
functor. According to whether A — R, C, or H, 
the notations KO, K, or KSP are often used 
for Ky. 

If X 2 1xoj, the semigroup homomorphism 
é (Xo ə £5 dim, £e Z induces an isomorphism 
K(x = Z. If X is a finite CW complex with 
base point xo, then the reduced group K,(X) 
is defined to be the kernel of i*, where i: xo X 
is the inclusion. Then we have the canonical 
splitting KA(X)z Z 6 K(X), and K, isa 
functor defined on the category of pointed 
finite CW complexes. 
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Two isomorphism classes ¢ and ug of é (X) 
are said to be stably equivalent if there exist 
trivial bundles 0, and 6, such that é @ 0, = 
n ® 0,. An equivalence class with respect to 
this relation is called a stable vector bundle. If 
X is connected, then the set of all stable A- 
vector bundles can be identified with K,(X). 

When A =R or C, the ttensor product of 
vector bundles induces a ring structure on 
K(X), and f * becomes a ring homomorphism. 

The complexification of real vector bundles 
E+E Op C=i(E) is a semigroup homomor- 
phism é4(X)— é6c(X) and induces a ring 
homomorphism i: KO(X) K(X) such that 
io0 —0oi. If če u(x), then č can be viewed 
as a complex vector bundle under the scalar 
restriction of basic field, which we shall denote 
by p(¢)e &(X). The mapping p:é4(X)> 
&c(X) induces a homomorphism p: KSP(X)—> 
K(X). Similarly, the scalar restriction from C 
to R induces a homomorphism r: K(X)—> 
KO(X). All these are tnatural transformations. 

Let € be a complex vector bundle. We can 
formally write the Chern class (— 56 Charac- 
teristic Classes) c(£) of £ as 


BEIEN 


Then the Chern character ch(é)e H*(X; Q) is 
defined by 


ch(c) 3 expx;, 


where Q is the field of rational numbers. The 
mapping ch:éc(X)— H*(X;Q) is extended to a 
ring homomorphism ch: K(X) H*(X; Q). We 
denote by the same notation ch the ring homo- 
morphisms choi: KO(X) H*(X; Q) and 
chop: KSP(X)5 H*(X;Q). These are natural 
transformations from the functor K, to the 
functor H*( ;Q). 


C. Cohomology Theory 


O,(n) denotes O(n), U (n), or Sp(n) according as 
the basic field A is R, C, or H. Let O, be the 
tinductive limit group with respect to the usual 
inclusion O,(n) c O,(n+ 1). Provided with the 
weak topology, the group O4 becomes a tCW 
complex. The set of all equivalence classes of 
stable A-vector bundles corresponds bijec- 
tively to the set of principal O,-bundles. Let 
B, be the ‘classifying space for the group O,, 
X, Y be finite CW complexes with base points, 
[ X, Y] be the set of ali *homotopy classes of 
mappings from X to Y, and [ X, Y ], be the set 
of all thomotopy classes in the *category of 
pointed topological spaces. Then by the classi- 
fication theorem of fiber bundles (— 147 

Fiber Bundles), we have K,(X)=[X,Z x B,] 
and K,(X)=[X, Z x Bilo. The space B, has 
the structure of a weak "H space, so that the 
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induced group structure of the homotopy set 
LX, Z x B,] coincides with that of K,(X). If f 
is a continuous mapping, then the induced 
homomorphism f* of the homotopy set [ X, 
Z x B4] coincides with that of K A(X). 

For a finite CW pair (X, A), we put 
Kx"(X, A) - R (S"(X/A), n—0,1,2, ..., where 
X/A is the space obtained from X by collaps- 
ing A to a point that becomes the base point 
of X/A, and S" denotes the tn-fold reduced 
suspension. This gives rise to a cohomology 
theory (indexed by nonpositive integers) (— 
201 Homology Theory, 202 Homotopy 
Theory). 

The tensor product of vector bundles in- 
duces the following pairing, called the cross 
product: 


K4"(X, A) & KA" (Y, B) 
>K; (X x Y, X x BUAx Y). 
When A=R or C, the cup product 
Kx"(X)@ K,"(X)9 K(X), 
K(X) @ KANN, A) >K” (X, A) 


is defined as the composite of the cross prod- 
uct and the induced homomorphism A*, 
where A: X 2 X x X is the diagonal map- 
ping. The complexification i: KO "(X, A)> 
K "(X, A) preserves the cup product. The 
composite of 


ch: Kj" (X, A) H*(S"(X/A); Q) 
and the ‘suspension isomorphism 
H*(S'(X/Ay Q)> H*(X, A, Q) 

is denoted by 

ch: Ky" (X, A) > H*(X, A; Q). 


The homomorphism ch preserves the cup 
product when A=R or C. 


D. Bott Periodicity 
Let £4 be the tcanonical A-line bundle over 
the A-projective line. The elements 
gc 0(£c) - 1e R(5?) - K ?(x), 

mA 
Iu = 0(£g) — 1e KSP(S^) = KSP *(x), 
and 
dp 7 gp X ggí(cross product) 

eKO(S*)- KO *(x) 


are called Bott generators. 

The Bott periodicity theorem [10] is as 
follows: (1) K(S?), KSP(S*), and KO(S?) are 
infinite cyclic groups generated by gc, gy, and 
gr, respectively. Moreover, ch(gc) 7 07, ch(gy) 
— c^, and ch(gg) — 05, where c"e H"(S"; Z) is a 
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generator. (2) The cross products 

KN. A)G K ?(x) OK "*?(X, A), 

KSP "(X, A) G KSP *(x) BKO '"*^(X, A), 
KOU, A)@ KO 8(x) KO "*9(X, A) 


are isomorphisms. 
The isomorphisms 


K "(X, Ask "'?XX, A), 
KSP "(X, Az KO "**(X, A), 
and 

KO "(X, A)= KO "'9(X, A), 


defined by aa x g,, are called Bott isomor- 
phisms. They commute with f*, the tcobound- 
ary operator 0, and ch. Identifying K " with 
K^ "*9 KSP^" with KO "**, and KO™ with 
KO~“™*®) we get periodic cohomology theories 
K* —Y,,7, K and KO* = ae, KO", which 
are multiplicative cohomology theories, and ch 
is a multiplication-preserving homomorphism 
into H*( ; Q). The cohomology of a point 
Ka"(x) = RA(S") = n, (B4) = 7, (04) is given 
by Bott [11] (— Appendix A, Table 6.IV). 


E. Operations 


We assume that A is R or C. The exterior 
powers 4^ are basic operations in K,(X). For 
čeg, (X), the pth texterior power of č, A"(£)e 
&,(X), has the following properties: 49(£) — 1, 
4 (2) E and A°(E 61) 2 3,,,-, EO 4'(£). 
Let G be the multiplicative group consisting 
of the formal power series eK ,(X) {t} whose 
constant term is 1. The assignment £5 A,(£) — 
A. A*(£)t* gives rise to the homomorphism 

E (X)>G. Let 4,: KA(X) G be its extension. 
The operation 44: KA( X) K4(X) is defined 
by A(x) = Lo «4 4" (x)t*. 

An important series of operations w*, called 
the Adams operations, is derived from the 
exterior powers. Put 

dA(x) ! 
di TO)" KA(X) tj, 
and define y^: K (X)> KA(X) by dl 
Eoc Y (x)t*. When A=C, define d 1 as the 
extension of ££, where £ is the tconjugate 
complex vector bundle of £. The operation y* 
is a ring homomorphism preserving 1, and the 
relation y*ow'=y" holds. If £e £,(X) is a line 
bundle, then w*(0(6)) (BEI If xe KA(X) and 
ch(x) 2 X, ch,(x), where ch,(x)e H?"(X; Q), then 


ch(y*(x)) 3 k"ch, (x). 





Vx) —1 


The operations 4* and y* commute with the 
complexification i: KO(X) K(X). If Be: K(X) 
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—+ K(S? X) and By: KO(X)- KO(S? X) are Bott 
isomorphisms, then y*o fi. =kßco y" and 


Vo fg — K* Bro y* [1]. 


F. Thom-Gysin Isomorphisms 


Let é be a real oriented vector bundle of di- 
mension n over a finite CW complex X. A 
reduction of the structure group SO(n) of 

the bundle Z to its universal covering group 
Spin(n) is called a spin structure of €. The bun- 
dle č admits a spin structure if and only if 
w,(€)=0, where w;(£) denotes the second 
'Stiefel- Whitney class of é. The bundle č 
equipped with a spin structure is called a spin 
bundle. The bundle £ is called a c,-bundle 

or Spin^ bundle if there is given a cohomol- 
ogy class c,(£)e H?(X; Z) such that w,(£) 
c,(€) mod2. Let X* be the Thom complex 
of the vector bundle &. The group K*(X°) 
has the structure of a KX(X)-module. If we 
assume that 4 is a c,-bundle when A=C and 
a spin bundle when A=R, then there exists 

a canonical KX(X)-module isomorphism 

o KA(X)9 KA dim5(X5) such that cho(1) 

= (Cof (Z)exp(e, (2/2) *) when A 2 € and 
cho(1) 9 o'Co(£) ') when A=R [6]. Here, 

o H*(X;Q)— A*(X¢; Q) is the usual 'Thom- 
Gysin isomorphism, and .«/(&) is the f- 
characteristic class of the bundle ë defined as 
follows: Write the *Pontryagin class p(£) of € 
formally as p(£) 2 TI +x?); then the class 
AE) is given by 


AE) = [ [ 65/2) inh (x;/2)). 


If č is a complex vector bundle, then its first 
*Chern class c,(&) gives a c,-bundle structure 
to r(¢)€ 6g(X). In this case, the class 7 (£5) — 
«(£)exp(c,/2) is the Todd class of the com- 
plex vector bundle £. 


G. Riemann-Roch Theorems for Differentiable 
Manifolds 


Let M and N be connected closed differenti- 
able manifolds. A continuous mapping f: 
M-—N is called a spin mapping (spin map) if 
w1(M)- f *w,(N) and w;(M)- f *w;(N). If 
W1(M) — f *w,(N) and there is given a class 
c, € H?(M; Z) such that w,(M)— f *w,(N) 
c, (mod 2), f is called a c,-mapping. If we as- 
sume that f is a c,-mapping when A =C and 
a spin mapping when A=R, then there is a 
canonical homomorphism 


f: K^ (M) Ki dm "dm MN) 
such that 
MI: y) = x: fi) 
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and 
ch AYAN) = fi(ch(y) AM )exp(c,/2)) 


for ye KX(M) and xe KX(N). This is the 
Riemann-Roch theorem for differentiable mani- 
folds [6] (— 366 Riemann-Roch Theorems). 

In the second formula, the homomorphism 

f: H*(M; Q)> H*(N;Q)on the right-hand side 
is the usual tGysin homomorphism, and if 

A=R we set c; =0. The homomorphism f, de- 
pends only on the homotopy class of f and has 
the usual functorial properties 1,=1 and 


(fog)-fog. 


H. The Atiyah-Singer Index Theorem 


Let X be an n-dimensional compact differenti- 
able manifold of class C? without boundary. 
As we shall see later, any elliptic differential 
operator (or, more generally, any elliptic com- 
plex) d on X has analytic index ind,(d) and 
topological index ind,(d), the latter of which is 
deeply related to K-theory. The Atiyah-Singer 
index theorem asserts that ind, (d) — ind,(d) [8]. 
We shall describe the details of the definitions 
and the theorem. 

Let E and F be complex vector bundles of 
class C” over X with dim E =s and dim F =t 
(— 147 Fiber Bundles). Let F(E) and I (F) be 
the linear spaces over C consisting of C*-cross 
sections of E and F, respectively. A linear 
mapping d from T(E) to I (F) is called a dif- 
ferential operator of the kth order if d is locally 
expressed by some differential operator of the 
kth order. This means that if we choose a local 
coordinate neighborhood U of X and triviali- 
zations of E and F on U such that EJU =U x 
C! and F|U =U x C, then d is a differential 
operator of the kth order from C^(U, C5) to 
C^(U, C"). Thus d is locally expressed by the 
matrix form (X. , at? (x)D*) (i21, ...,5j— 
l,...,5), each component of which is a differen- 
tial operator (— 112 Differential Operators). 
Using this expression we define the symbol 
c (d) of d as follows: Let T*(X) be the cotan- 
gent bundle of X. Given any n.e T,*(X), put 
o(d)(,) = get, where n stands 
for gt ...9% for any multi-index a —(a,, ...,a,) 
and for a local coordinate expression 5, — 
(n1, ..., 2). We call o(d) the principal symbol of 
d. Now a differential operator d is called ellip- 
tic if for each n, 40 the linear mapping o(d)(r,) 
gives an isomorphism from E, onto F,. For 
the elliptic differential operator d, we have 
dim Kerd « oo and dim Cokerd « oo [5]. The 
analytic index ind, (d) is defined to be the inte- 
ger dim Kerd — dim Cokerd, and it has the 
characteristic property that ind,(d) is invariant 
under deformation of d. 

More generally, an elliptic complex 4 on X 
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and the analytic index ind,(&) can be defined 
as follows: Given a finite number of smooth 
complex vector bundles EI , on X and 
differential operators d;: (Ej) ^ U'(E;.,), we 
call £ — (Ej, dl , an elliptic complex on X 
if the following two conditions are satisfied: 

(i) di., 0d; =O; (ii) for any n,€ T*(X), n, £9, 


1 H o(dj)(ny 
the (principal) symbol sequence ^ E cy 


Ej., ,2 is exact. For an elliptic complex &, we 
have dim H'(&) 2 dim(Ker d;/Im d; ,) « oo [5]. 
The integer X(—1)' dim H'(4) is called the 
analytic index of &. An elliptic complex with 
the form 02 T(E)-5T(F)-50 is an elliptic 
operator. An important example of an ellip- 
tic complex arises from de Rham theory (— 
105 Differentiable Manifolds Q): Take E;— 
A'T*(X) and the exterior differentiations as 
differential operators. The elliptic complex 
thus obtained is the de Rham complex. 

The topological index ind,(&) of the elliptic 
complex & is introduced in the following way: 
By virtue of the exactness for 5, 40, the sym- 


aldi) (nx) ; 
bol sequence — E, — SEQ determines 


a definite element [a(d)] of K(X?), where A" 
is the *Thom complex associated with the 
cotangent bundle of X. Embed X in some 
Euclidean space R“. Then the mapping j: 
T*(X)—> T*(R") x R?" canonically induces the 
homomorphism j: K(X*) 2 K((R^y)z K(S2%) = 
Z, and j, is obtained as in Section G from j by 
using a canonical complex vector bundle struc- 
ture of T*(X) in T*(R"). We set ind,(&4) — 
jiLo(€)] and call this the topological index of 
6. We have 


ind,(&) — ch([e(d)]) 7 (X) [X*], 


where ch([a(d)]) (c H*(X*; Q)) is the Chern 
character of o(d), 7 (X) (c H*(X; Q)) is the 
Todd class of T(X) & C, and [ X*] is the funda- 
mental cycle of X* [8]. 

The Atiyah-Singer index theorem, in general 
form, asserts that ind,(4) — ind,(4). For the de 
Rham complex E, it follows from the definition 
that ind,(E) is equal to the Euler characteristic 
of X. Let X be a compact complex manifold 
and W be a complex analytic vector bundle on 
X. Applying the theorem to the Dolbeault 


complex with value in W... A9 (W)-5 

A911 (W)5.... (2C 72 Complex Manifolds), we 
can conclude that Hirzebruch's formulation of 
Riemann-Roch theorem (— 366 Riemann- 
Roch Theorems B) holds not only for projec- 
tive algebraic manifolds but also for compact 
complex manifolds. Moreover, from the index 
theorem we can deduce the Hirzebruch index 
theorem (— 56 Characteristic Classes G) and 
various integrability theorems [8]. Any char- 
acteristic number which takes integral values 
on all oriented manifolds or weakly almost 
complex manifolds can be derived from the 
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Atiyah-Singer index theorem (R. Stong and A. 
Hattori). 

The Hirzebruch index theorem and the 
Atiyah-Singer index theorem can be extended 
to compact manifolds with boundary in the 
framework of Riemannian geometry, giving a 
formula analogous to the Gauss-Bonnet 
theorem for manifolds with boundary [9]. 

Another generalization of the theorem is 
its equivariant version. Let X be a compact 
Hausdorff space on which a compact Lie 
group G acts, i.e., a !*G-space. A real or com- 
plex vector bundle z: E X is called a G-vector 
bundle if E is a G-space, z is an tequivariant 
mapping and G-actions are fiberwise linear. 
The set of isomorphism classes of G-vector 
bundles over X is an additive semigroup with 
respect to Whitney sums, and the Grothen- 
dieck construction on this semigroup gives an 
Abelian group, which is denoted by K,(X) or 
KO,(X), depending on the scalar field of bun- 
des, called equivariant K-group of X. K;(X) 
and KO,(X) have commutative ring structures 
given by the tensor products. Now consider 
the case where X is a one-point space pt. 

Then a G-vector bundle over pt is a finite- 
dimensional complex or real linear represent- 
ation of G. Therefore K (pt) or KO,(pt) is the 
Grothendieck group of isomorphism classes of 
linear representations. This group also has a 
ring structure. It is called the representation 
ring of G and is denoted by R(G) or RO(G), 
respectively. When X = G/H, a homogeneous 
space of G by a closed subgroup H, the iso- 
morphism K ,(G/H) x R(H) (or KO,(G/H) x 
RO(H) holds true. 

Let X be a compact smooth G-manifold and 
E and F smooth complex G-vector bundles 
over X, Le, G acts smoothly on X, E, and F. A 
differential operator d: '(E)-T(F) is called 
equivariant when d commutes with induced G- 
actions on I (E) and T (F), respectively. Sup- 
pose that d: F (E)>T(F) is an equivariant 
elliptic differential operator. As T(E) and T(F) 
are G-modules, Kerd and Coker d are finite- 
dimensional G-modules, and the analytic index 
of d is defined by 


ind,(d) 2 Kerd — Coker de R(G). 


An argument parallel to that for the inequivar- 
iant case defines the topological index of d as 


ind,(d)e Kq(V) = Kg(pt)* R(G), 


where V is a suitable finite-dimensional com- 
plex G-module. Both analytic and topological 
indices are also generalized for equivariant 
elliptic G-complexes, and the equivariant 
Atiyah-Singer index theorem asserts that 


ind,(6) 2 ind,(&)e R(G) 
for any elliptic G-complex & [8]. 
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The equivariant Attyah-Singer index theo- 
rem has a close relation with the Lefschetz 
fixed-point theorem (of Atiyah-Bott [5]) and 
generalizes the Hirzebruch index theorem (also 
called the Hirzebruch signature theorem) to its 
equivariant form, the so-called G-signature 
theorem (— 153 Fixed-Point Theorems). 


I. J-Groups and the Adams Conjecture 


Let č and 5 be real vector bundles over a finite 
CW complex X. č and 5 are called fiber homo- 
topy equivalent if there exist fiber-preserving 
mappings f:S(£) S(n), g:S(y)  S(C) between 
associated sphere bundles, and fiber-preserving 
homotopies go f «1, fog 1. č and y are 
called stably fiber homotopy equivalent if there 
exist trivial bundles n and m such that č n 
and n Q m are fiber homotopy equivalent 

[24]. Stable fiber homotopy equivalence is an 
equivalence relation, and the set of all stable 
fiber homotopy equivalence classes of real 
vector bundles over X is denoted by J(X) and 
is called the J-group of X. J(X) is an Abelian 
group with addition induced by Whitney sums 
of vector bundles, and we can express J(X)= 
KO(X)/T(X) as a quotient group. The natural 
projection 


J:KO(X)2J(X)2 KO(X)/T(X) 


is called the J-homomorphism. When X — 

S*, the k-sphere, the J-homomorphism can 

be identified with the classical stable J- 
homomorphism J : x, .,(0)— n$, of Hopf and 
Whitehead (— 202 Homotopy Theory). 

Adams [2] proposed to compute J(X) by 
introducing two factor groups J'(X) and J"(X) 
of KO(X) in such a way that these are comput- 
able and that the epimorphisms J"(X)J(X)^ 
J'(X) hold whenever the following conjecture 
is true: 

Adams conjecture. Let k be any integer and 
ye KO(X). There exists a nonnegative integer 
e — e(k, y) such that J(k*(J^ — 1) y) - 0. 

Adams [2] proved this conjecture for line 
and plane bundles. In 1970, D. G. Quillen [25] 
proved this conjecture in its full generality. By 
intensive use of the Brauer induction theo- 
rem, Quillen reduced the problem to the case 
of bundles with finite structure groups and 
then to the case of line or plane bundles 
where Adams' proof [2] applies. Since then, 
many different proofs of the conjecture have 
appeared. 

Adams' theory on J(X) and Quillen's theo- 
rem (Adams conjecture) are utilized in deter- 
mining completely the J-images in stable 
homotopy groups of spheres (— 202 Homo- 
topy Theory). 
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J. Algebraic K-Theory 


Algebraic K-theory is a branch of algebra 
concerned mainly with a series of Abelian 
group valued functors K, of rings (and, more 
generally, of certain categories), which have 
certain features of generalized homology 
theory. It originated in the K-group construc- 
tion used in Grothendieck's work on the 
Riemann-Roch theorem. The theory was ini- 
tiated in the early sixties by H. Bass, who in- 
troduced K, and extensively studied Ky and 
K, in collaboration with other researchers 
[15-18]. Then K, was introduced by J. Milnor 
[26], and higher K-theories were constructed 
by Quillen and others from various view- 
points [22,1]. There is also a K-theory with 
respect to Hermitian structure [22, IIT]. 
Algebraic K-theory is intimately related to 
various other branches of mathematics, such 
as topology, algebraic geometry, and number 
theory. 

The Grothendieck group K,(A) of a ring A 
is the Abelian group generated by the set of 
isomorphism classes [P] of finitely generated 
tprojective A-modules subject to the relation 
[P P'J=[P]+[(P’] for every pair of projec- 
tive modules P, P’. If A ts finitely generated as 
a Z-algebra, then K,(A) is a finitely generated 
group. The assignment n n[A ] defines a 
homomorphism Z 5 K$(A) whose cokernel 
is the tprojective class group of A. If A is 
commutative, a similar construction for the 
category of rank 1 projective A-modules with 
respect to the tensor product ® leads to the 
Picard group Pic(A). We then have an epimor- 
phism Ko{A)— Pic(A) defined by PAP, 
where P is of rank r [16]. If X is a compact 
Hausdorff space, the topological K(X) is iso- 
morphic to the algebraic Ko(A), where A is 
the ring of complex-valued continuous func- 
tions on X. 

The Whitehead group K,(A) is defined as 
follows. Let GL(A) be the direct limit of the 
sequence 5 GL) GL, 4 (A)9.... , where 


X 0 
«D dh Let E,(A) be the subgroup of 


GL,(A) generated by all elementary matrices 
1 - ae; (ij, a€ A), where the ej are matrix 
units. The limit E(A) of E,(A) coincides with 
the commutator subgroup of GL(A). Now 
define K (4)= GL(A)/E(A) [15.18]. For A= 
Zn, the integral group algebra of a group z, 
the cokernel of the natural homomorphism 
+n—K,(A) is denoted by Wh(n). The tor- 
sion invariant of J. H. C. Whitehead is de- 
fined in Wh(z) [20]. If A is commutative, 

we put SK,(A)=SL(A)/E(A), where SL(A)= 
lim SL,(A). By the determinant homomor- 
phism, we have K,(A)=SK,(A)  U(A), where 
U(A) is the group of units of A. SK,(A)=0 
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when A is a field or a local ring A. A deeper 
result states that SK,(A)=0 for the ring of 
integers of an algebraic number field (Bass, 
Milnor, and Serre [17]). This and related re- 
sults have been applied to investigate SK, (Zz) 
for a finite group 7. 

We define K,(A)=H,(E(A); Z) (the Schur 
multiplier of E(A)) [21]. This yields a universal 
central extension of E(A), defined by 0 K,(A) 
—St(A) E(A)0, where St(A)=lim St, (A) 
and St,(A) (n> 3) is the Steinberg group gen- 
erated by x,(a) (e 4; j 1, ..., n, ij) subject 
to the relations (i) x;(a)x;(b)= x;(a +b), and 
(ii) the commutator (x;(a), x,,(b)) equals x;(ab) 
for j=k, iAl; and equals ! forjzk, iz. 

Let F be a field. A bimultiplicative mapping 
s: F* x FF5C (C an Abelian group) satisfying 
s(x, 1— x)= 1 (x Z0, 1) is called a (Steinberg) 
symbol on F. There exists a universal symbol 
F* x F* 5 K, (F) which, followed by homomor- 
phisms K;(F)— C, yields all C-valued symbols 
on F. Since certain Steinberg symbols, such as 
the *Hilbert symbol, are important in number 
theory, the group K,(F) of a global or local 
field F is intimately related to the arithmetic of 
F. If F is an algebraic number field and R its 
ring of integers, we have an exact sequence 0 
> K;(R)9 K;(F) 1L, (R/p)* +0 (the next-to- 
last term being the direct sum over all prime 
ideals p of R), and K,(R) is a finite group. 

Quillen [26, 27] defined higher algebraic K- 
theory based on the following topological 
construction. Let X be a CW complex given 
with a perfect normal subgroup N of z,(X). 
Attaching 2- and 3-cells suitably to X, he con- 
structed a complex X * and a mapping f: X^ 
X* so that z,(f) is epimorphic, Kerz,(f)= 
N, and fy: HX, f *L) = H,(X ~, L) for any 
local coefficients L over X *. (X *, f) is univer- 
sal for pairs (Y, g), g: X > Y, satisfying zx ONT 
=0, i.e., there exists a mapping g^ : X * Y 
unique up to homotopy such that g* o f ^ g. 

Quillen applied the above construction to 
X = BGL(A) and N = E(A), and defined K,(A) 
—n,(BGL(A)*) for nz 1 (the first definition). 
The universality implies that the inclusion 
E(A) c GL(A) induces a mapping BE(A)* > 
BGL(A)* which is the same as the universal 
covering mapping up to homotopy, which 
implies that K,(A) z H,(E(A); Z), i.e., Quillen's 
K, coincides with Milnor's. It is also known 
that K,(A)~ H4(St(Ay; Z). 

Quillen [22, I] defined higher algebraic K- 
theory for certain additive categories endowed 
with a class of short exact sequences, called 
exact categories. For an exact category M, he 
defined another category QM having the same 
objects as M but with morphisms changed. 
Making use of Segal's classifying spaces B of 
categories, he defined K,(M)=7;,,(BQM,0), 
where 0 is a zero object of M (the second de- 
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finition. When M = P}, the exact category 

of finitely generated projective A-modules, 
Kj(A) x K,(P,) [23], whereby Quillen intro- 
duced the notions of symmetric monoidal 
categories S and their localizations S! S and 
showed the homotopy equivalences BS ! S, 
K,(A) x BGL(A)' and QBQP, ~ BS !S,, 
where S,=Iso P, is the subcategory of P, 
whose morphisms are all isomorphisms of P,. 
The second definition is used to generalize 
many classical results in algebraic K-theory to 
higher K-theory [22, I]. 
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Joseph Louis Lagrange (January 25, 1736— 
April 10, 1813) was born in Turin, Italy, and 
became an instructor at the military school 
there in 1753. In 1766, he was invited to Prus- 
sia by King Frederick the Great (1712—1786) 
and moved to Berlin, where he filled the post 
(formerly occupied by L. Euler) of chairman of 
the mathematics department in the graduate 
division of the University of Berlin. In 1787, 
he moved to Paris, where he became a pro- 
fessor at the recently founded Ecole Normale 
Supérieure; he remained in France for the rest 
of his life. Lagrange was chiefly responsible 
for the establishment of the metric system. 

In 1795, he became the first president of the 
newly established Ecole Polytechnique. Dur- 
ing the later stages of the Napoleonic Era, he 
was made a count. 

Mathematically, his position is between 
Euler and Laplace, and he is considered one of 
the major mathematicians of the late 18th and 
early 19th centuries. His notable achievements 
in analysis—the initiation of the tcalculus of 
variations resulting from his research in the 
tisoperimetric problem, the founding of tana- 
lytical dynamics with the introduction of gen- 
eralized coordinates, and the solving of the 
equations now known as tLagrange’s equa- 
tions of motion—all have a strong algebraic 
flavor. Lagrange attempted to base calculus on 
*formal power series. He also conducted re- 
search on the solution of algebraic equations, 
and his work on the tpermutation group of the 
roots of algebraic equations can be regarded 
as a forerunner of the achievements of Abel 
and Galois. 
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Laplace, Pierre Simon 


Pierre Simon Laplace (March 23, 1749-March 
5, 1827) was born into a farming family in 
Beaumont en Auge in Normandy, France. His 
genius was recognized carly, and in 1767 he 
moved to Paris, where he enjoyed the favor of 
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J. d'Alembert. He became a professor at the 
Ecole Normale and the Ecole Polytechnique. 
During the Napoleonic Era, he was nominated 
for the post of Minister of the Interior; he 
later became a count. Following the fall of 
Napoleon, he became a marquis under Louis 
XVIII. He is often said to have lacked pro- 
fessional integrity, particularly in the matter of 
claiming priority, but on the other hand he 
sometimes showed independence of character 
and was generous to his pupils toward the 
close of his life. His achievements in mathe- 
matics, physics, and astronomy were so well 
recognized that he reached the highest social 
position. 

Laplace's achievements reached a peak in 
the field of analysis, which had been initiated 
in the 17th century and developed in the 18th 
century by Euler and the mathematicians of 
the Bernoulli family. He applied the methods 
of analysis to celestial mechanics, tpotential 
theory, and tprobability theory, obtaining 
remarkable results. 

Without the use of formulas and in a flow- 
ing and elegant literary style [3,5], he eluci- 
dated his various results. Concerning the 
origin of the solar system, in 1796 he published 
the nebular hypothesis—the so-called Kant- 
Laplace nebular hypothesis— which is famous 
as the predecessor of the theory of the evo- 
lution of the universe. 
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A. General Remarks 


The notion of the Laplace transform can be 
regarded as a generalization of the notion of 
Dirichlet series. L. Euler applied the Laplace 
transform to solve certain differential equa- 
tions (1737); later, independently, P. S. Laplace 
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applied it to solve differential and difference 
equations in his famous book Théorie analy- 
tique des probabilités, vol. 1 (1812). In this cen- 
tury, the Laplace transform has been used to 
justify Heaviside's *operational calculus, and 
the notion has become an important tool in 
applied mathematics. 

Let x(t) be a function of tbounded variation 
in the interval 0 € t € R for every positive R. If 


L(s)= L(a(t))(s)= |, e * da(t) 
0 


R 
= lim | e * dalt) 
0 


converges for some complex number sọ, then it 
converges for all s satisfying Re s> Re ze, We 
call Lisi the Laplace-Stieltjes transform of «(t). 
If a(t) — fo Q(u)du (where (u) is Lebesgue 
integrable in the interval 0 & t € R for every R), 
then we call 


oo 


L(s)= L(@)(s)= | e *e(t)dt 


[U 
the Laplace transform of o(t) (^ Appendix A, 
Table 12.1). 


B. Regions of Convergence 


Given a Laplace transform L(s) of x(t), there 
exists a real number (or +00) o, such that 
the maximal region of convergence of Lí(s) is 
the set of all s such that Res>o,. In extreme 
cases, when the integral never converges, we 
write c, — +œ, and when it converges every- 
where, we write c, — —oo. The number g, is 
called the abscissa of convergence of L(s), and 
the line Res— o, the axis of convergence of 
L(s). A formula to determine the abscissa of 
convergence in terms of a(t) is known. If k= 
lim sup,_,, (log }a(t)|)/t #0, then a=k; if k «0 
and «(t) does not converge as t tends to infin- 
ity, then c, — 0. If L(s) has a nonnegative ab- 
scissa o,, then c, — limsup,.,, (log |a(t)|)/t, and if 
c, « 0, then c, 2 limsup, (log |a(o0) — «(t)|)/t 
(E. Landau, S. Pincherle). Generally, 


arp 


log |a(t) — 
limsup 


tra t 





where [ ] is the Gauss symbol (K. Kurosu, 
T. Kojima, T. Ugaeri, K. Knopp). 

If fọ e~*|da(t)| < oo, then the Laplace- 
Stieltjes integral L(s) — fọ e~“ da(t) is said to be 
absolutely convergent. There exists a real num- 
ber o, such that L(s) converges absolutely for 
Res o, and does not converge absolutely for 
Res « o,. We call c, the abscissa of absolute 
convergence of L(s). There exists a real number 
c, such that L(s) converges uniformly for 
Resz o, -- € (for every £7 0) and fails to do so 
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for Resz o, —*. We call o, the abscissa of 
uniform convergence of (s). It is clear that 

6, € 0, € o,. Formulas determining o, and o, 
are analogous to the Dirichlet series formulas 
given by Kojima and M. Kunieda (— 121 
Dirichlet Series B) [1]. 


C. Regularity 


In the region of convergence Res o,, Lisi 

= fọ e * da(t) is tholomorphic, and we have 
L'?(s)— f e *(— t) da(t) in Res o,. If a(t) is 
monotonic, then the real point s =c, on the 
axis of convergence is a singular point of L(s). 
However, there may not be any singular point 
on the axis of convergence in general. The 
abscissa of regularity is the infimum of all o 
such that L(s) is holomorphic in Res» c. Also, 
L(o+it)=O(|t|) uniformly in o, +ô & o « oo 
for every positive ô as |t| oo. Any analytic 
function that is holomorphic at oo can be rep- 
resented by a Laplace transform. To be pre- 
cise, let f(s) = f(oo) + EZ o(a,n!)/s"*! ([s| c). 
Then the function g(t)= 227.9 a,t" is entire, 
f(s) — f(oo) - fF e " o(t)dt, and o, c. 


D. Inversion Formulas 


We say that a function «(t) (t z 0) that is of 
*bounded variation on any interval is normal- 
ized if a (0) = «(+ 0) 20 and a(t) (ott - 0) - 
a(t — 0))/2. A normalized function a(t) (t 0) is 
uniquely determined by its Laplace transform. 
Moreover, the inversion formula determining 

a(t) in terms of L(s) is known. That is, if L(s) 
= |g e * da(t), then for c > max(o,, 0), 


a(t), t>0, 
. 1 ctiT L(s) 
lim — e“ds=<o(+0)/2, t=0, 
Toe 2ni jJ. $ 

0, t<0. 


The integral on the left-hand side is often 
called the Bromwich integral. Suppose that «(t) 
= fọ p(u)du, L(s) converges absolutely on Res 
=c, and ¢(u) is of bounded variation in a 
neighborhood of u=t (t 2 0). Then we have 


c+iT 
lim xil L(s)e* ds 


(o(t--0)--o(t—0)/2, | t»0, 
Q(4- 0)/2, t=0, 
0, t<0. 


Il 


There is another form of the inversion for- 
mula by E. L. Post and D. V. Widder. Namely, 
set 


L,LfG)] =(— 05 9 (k/t) (k/k! 


for a C?-function f(x) (t 0, k ts a positive 
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integer). Then 


lim | dod ton] dust a0) 


0 


If a(t) => p(u) du, then for almost all t (70), 
lim L,[L(x)] 7 9) 


(— Appendix A, Table 12.I). 


E. Representation Theorem 


If f(x) is of class C” on (a, b) and (—1)*f(x) > 
0 for k «1, 2, ..., then f(x) is called completely 
monotonic in (a, b). Moreover, if f(x) is con- 
tinuous on [a,b], then f(x) is called corapletely 
monotonic on [a,b]. A necessary and suffi- 
cient condition for a function f(x) to be com- 
pletely monotonic in 0 € x « oo is that f(x) — 
Le "datt, where a(t) is bounded and in- 
creasing and the integral converges for 0< 

x « oo (Bernshtein’s theorem). A necessary and 
sufficient condition for f(x) to be represent- 
able in the form [y e * p(t)dt, where o(t)e 

L (0, oo) (p 1), is that (i) f(x) have derivatives 
of all orders in 0 <x < oo, (ii) f(x) vanish at 
infinity, and (iii) there exist a constant M such 
that IP |L, [f(X]? dt € M for k— 1, 2, 3, .... 
In the representation f(x) LZ e^" o(t)dt, a 
necessary and sufficient condition for ot) 

to be bounded in 0 « t « oo is that f(x) is 

of class C? in 0 « x « oo and there exists a 
constant M such that |L, ,[ f(x)]| « M and 
[xf(x)| <M for 0 « x « oo. In order that o(t)e 
L, (0x t « oo), it is necessary and sufficient that 
(i) f(x) be of class C” in 0 « x « oo, (ii) f(x) 
vanish at infinity, and (iii) (5 | L, Lf) ]] dt < 
oo and 


im | | IL, Ef 09] — Li Lf(x)]] dt 20 


ko 0 


(Widder). 


F. Operations on Laplace Transforms 


Let the Laplace transform of f(x) be 2 f(t))(s) 
= [o e" "f(t)dt. It is important in operational 
calculus to know the formula for the Laplace 
transform of of, where ¢ is an operation. We 
mention here some important formulas: 


1 b 
L(f(at —b))(s)= orl 2 Jf (0)/a) 


a 


(f(at —b) 20 if at «b, a>0, b z0y 


SI | fod Joy rto 


(Res max(0, o,)); 
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and 
L( f(t) (s) 5 s-Z(f )(s) — f (0), 


provided that .Z/( f (t))(s) converges at s (2 0) 
and f(t) f(0) as t +0. Generally speaking, if 
f( 9) ..., £ * (4-0) exist and the Laplace 
transforms 2 f *(t))(s) converge at s (2 0), 
then 


PASADA =E LON- f(-0)s* ! 
= f'C-0)5* 2 — ... — f*- P (-- 0). 


Furthermore, given functions f, and f}, if 
L f(s) and 7 f;)(s) are both convergent 
and if one of them converges absolutely or 
L fi * f;)(s) converges, then 


Lf, * f) Gf fr). 


G. Asymptotic Properties of the Laplace 
Transform 


If L(s)=[¢ e~“ da(t) (s>0), then for any c>0 
and any constant A, we have 


lim sup |s‘ L(s) — Al 


s-tÜ 


xlimsup|a«(t)t *T(c-- 1)— A|, 


too 


limsup [s^ L(s) Al 
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«limsup|a(t)t *T(c-- 1)— Al. 
t^ T0 

In particular, if we set c 20 and assume that 
a(t)— A as t— oo, then f(s) A as s> +0; and if 
we assume a(t)~ At^/T(c-- 1) as t2 oo (or t 
+0), then f(s)~ As * as s— +0 (or s— oc). 
These results are called Abelian theorems, 
because if we choose «(t) appropriately and 
change variables, we get *Abel's continuity 
theorem on power series: If 3a, converges to 
s, then Xa, x" tends to s as x tends to 1—0. 
More generally, we have the following theo- 
rem: If ($e *'da(t) — L(s)2 A as s +0, then 
Im, a (t) — A if and only if $(t)= fọ uda(u)= 
o(t) (t — oc). 


H. Bilateral Laplace Transform 


If x(t) is of bounded variation in every finite 
interval and if for some s 


R 0 
lim | e “da(t), lim | e “da(t) 
Ro 0 R'>æ J pg 

exist, we set L(s)= [?,, e “da(t). 

If L(s) converges at s, —6, +it,, 5 — 05 + it;, 
then L(s) converges in the vertical strip o, < 
Res<o). If L(s) converges in the strip o; < 
Res « o; and diverges for Res o; and Res< 
o, then each of the numbers o; and o; is called 
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an abscissa of convergence of L(s). If 





: log |a(t 
limsup E ( | #0, 
to 
1 t 
limin EO 1 


with k «1, then k and / are the abscissas of 
convergence. If x(t) is a normalized function of 
bounded variation in every finite interval and 
L(s) converges in the strip k < Res <l, then for 
all t 


1 ag 
lim xl EO) das 


c-iT S 


c>0, k<c<l, 


k<c<l. 


M ME a( —oo) 


a(t) — a(l), c «OQ, 


Suppose that a(t) — fọ o(u) du, L(s) converges 
absolutely on the line Res=c, and o(t) is of 
bounded variation in some neighborhood of 
t—tg. Then 

ctiT 
lim — L(s)e*'o ds 
T>% 2ni c-iT 


=(@(lo + 0)+ o(to —0))/2. 


There are other formulas analogous to those 
for the ordinary Laplace transform. 


I. Application to the Theory of Semigroups of 
Operators 


Applying a Laplace transform to a one- 
parameter semigroup of bounded operators 
(UO, a on a Banach space yields a natural 
correspondence between the infinitesimal gen- 
erator A of {U(t)} and its resolvent (s— A)’. 
Namely, given a continuous one-parameter 
semigroup {U(t)},,9 on a Banach space X 
(— 378 Semigroups of Operators and Evolu- 
tion Equations), the infinitesimal generator A 
is defined by Ax=lim,..,9471(U(h)—J)x, and 
A is a densely defined closed operator on X 
for which there exist some positive numbers 
M and f such that 


l(s—A)"l|M(Res—-f)"  (n21,2,...) 


is valid, provided that Res fl. The Laplace 
transform of {U (t)},>ọ is defined by 


Ke 


JZ(U (t))(s)x d e “U(t)xdt 


0 


(xe X, Res» fj), 


and the following identity is valid: 
L(U)(s)=(s— A) !. 


Furthermore, the inversion formula 
ctiT 
e*(s— A) ! xds 


c-iT 


U(t)x lim — 


Tra 27i 


(c> f, t>0) 
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holds for every x in the domain of A, and the 
convergence is compact-uniform in t>0. If A 
is bounded, then U(t) 2 X. t* A*/k! (norm 
convergent), and we can write U(t)=e'4. If A is 
not bounded, U (t) is still regarded as an ex- 
ponential function of tA, because we have U(t) 
=lim,..,.(1+tA/n)" or U(t) -lim,,,, e Ta, A, 
= A(1-- tA/n) ! being bounded (both strongly 
convergent). Hence the correspondence be- 
tween {U(t)} and (s— A)‘ is considered a 
generalization of the formula 


J'(e"(s)2(s—a)!' (Res f) 


In order for us to get the inversion formula 
for every xe X, (U(t)) must satisfy some ad- 
ditional conditions. Namely, if (U(t)) is a 
holomorphic semigroup, the inversion formula 
is valid for every xe X, t>0. 


J. Laplace Transform of Distributions 


The Laplace transform in Ri is defined by 


n 
x 


SU rm | e *G* f(x) dx =F (e **f )(n) 
when e" *f(x)eL, (—160 Fourier Transform). 
For n— 1, this definition is equivalent to the 
bilateral Laplace transform. For a given func- 
tion f(x), the set A of E for which e **f be- 
longs to L, is convex in R}. g) - 9€ +in)= 
(f (£c ig) is holomorphic in A iR; pro- 
vided that A is not empty. Differentiation 
under the integral sign gives 


Da: | e (— xyfo) dx, 


and the integral converges uniformly on K + 
iR}, K being a compact subset of A. Especi- 
ally, if e "*f(x) belongs to “(R”) for some é, 
then Y(f)(¢ + in) is defined on A, 1 iR7, where 
Ap= {Ele “f(x)e Y(R%)}, which is also con- 
vex, Z(f (é+ ig) belongs to Y(R") for every 
fixed Zei, Hence the definition of Laplace 
transform can be extended to some class of 
distributions. Namely, for every distribution 
Te 2'(R}), the set 


A,—(EeR'|e * Te SR!) 


is convex. Thus the Laplace transform of T for 
which A; is not empty is defined by 


Z(T)( + in) =F (eT) (1). 


This definition can be rewritten by the use of 
test functions as follows. For any o € (RI, 
(T), 95,7 C£ (€ T), 95,7 Ce "T, (9), 
=<T, [pre *6*o(ndn»,. Moreover, if Ar 
has an interior point, (T) has an explicit 
expression: take a č in Ay, and let O0<e< 
dist(£, @A,); then e *? e A(R"), e Te 

SF (RY, and L(T)(E+ inh=¢T,e 7) = 
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(e? T, e ri" X my where (xy = 
Jl |x|?. 

In the following, assume that Te Z'(R7) and 
Q= A, is not empty. Then h(f)= Z(T)(£ + in) 
is (i) holomorphic in Q + iR} and (ii) slowly 
increasing in y. More precisely, for any com- 
pact subset K of Q, there exist a positive num- 
ber Cy and an integer m, such that for every č 
tine K +iR the following inequality holds: 


IZ +in)| Col v Imp". 


Conversely, for a nonempty open convex set Q 
and a function h(¢ + in) defined on Q+iR? 
satisfying conditions (i) and (ii) above, ihe: cor- 
responding inverse Laplace transform of h is 


(qne) (CEQ), 


the right-hand side being independent of é in 
Q. T= V ~'(h) is a distribution and A, >Q. 
Moreover, Z(T)-— hin Q+ iR}. The following 
identities hold as in the case of functions: 


L =) 7] me 
(2 (Dc 


d a 
oT =(-5) T). 


e", (h(x) 


ZG). 


Roughly speaking, differentiability of T is 
reflected in the decreasing order of ¥(T) as 
In] co. 

Let I’ be a cone in R? and B, be the ball 
with center O and radius r in Ri The dual 
cone of T is denoted by I". Assume that the 
support of T is contained in B, - I", and Q 
=A, is not empty. Then (i) Q--I'c Q, i.e, Q 
extends in the direction of T, and (ii) for any 
& eQ and any e» 0, there exist some C and m 
such that for any č + ige ^ +T +iR}, we have 


| L(T)(E + in| & Ce" * P + |y". 


Conversely, if there exist a cone T, a domain 
Q, and a holomorphic function A(¢) on Q+ iR7 
satisfying (i) and (it) with A(C) replacing (T), 
then the support of 7"! (h) is contained in 
B, +T”. 

The convolution f » T of a function f in 
SR!) and a distribution T in Zei (R5) belong 
to F(R), and if A PAP Ø, then Ar SÁ,n 
A, and 


Lf *T)=L(f)L(T) in Å DÄ +iR*. 


More generally, convolution of two distri- 
butions T and S for which A NA,s=Q4 Ø 
holds is defined by 


T* S= PPT) ZS). 


The formula Z(T * S)= HT) L 
virtue of the definition. 
If O-- I c 0 for some cone TL, we have 


(S) is valid by 


supp(T'* S)csupp T c suppS +T”. 
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K. Moment Problem 


Given a real sequence (14,15, the problem of 
finding a function «(t) of bounded variation 
normalized on the interval [0, 1] satisfying 
I, = fot" da(t) (n —0, 1, ...), is called Haus- 
dorff's moment problem. Such a function, if it 
exists, is unique. This problem is a discrete 
analog of the Laplace inversion formula, 

for if we put t Ze " and n=s, we have u,— 
Le "d[—a(e^")]. 

Let Amn =(P" A"7" u, (m, n0, L, ...), 
where Ai, = Zait =I Hnr- (k =0, 1, ...). 
Then the problem has a solution «(t) if and 
only if there exists a positive number L such 
that 27.6|4, ,| € L (m—0, 1, ...). Moreover, 
a(t) is nondecreasing if and only if Ana 2:0. In 
this case { u„} is called completely monotonic. 
When da(t)= o(t)dt, ot) belongs to L?(0, 1) 
(p? UU and only if (m+ 1)?! 7 |An nP SL 
(m — 0, 1, ...), and g(t) is bounded if and only if 
(m+ 1)|À, ,| & L (m, n —0, 1, ...). 

Given a real sequence { 1,}3, the problem of 
finding a nondecreasing function x(t) on R 
satisfying 1, — f2 t" da(t) (n —0, 1, ...) is called 
Hamburger's moment problem, and the simi- 
lar problem obtained by replacing the condi- 
tion by u,— [5 t" da(t) is called Stieltjes's mo- 
ment problem. In these problems uniqueness 
is not valid. The following is a counterexam- 
ple for the Stieltjes problem given by Stieltjes 
himself: o, (t) — fg exp( —u'^)du and «,(t)= 
foexp(—u!*)[1—sin(u'^)] du (t 0) are non- 
decreasing in (0, oo), and both correspond 
to the sequence qi, — 4(4n + 3)! in Stieltjes's 
problem. Carleman showed that the condition 
È u517" — œ is sufficient for uniqueness in 
Hamburger's problem. Hamburger's problem 
has a solution if and only if the quadratic 
forms 27 i-o 1. ;6;6; (n — 0, 1,...) are all non- 
negative definite. Stieltjes's problem has a 
solution if and only if the quadratic forms 
Èi, j=0 Hi+jŠićj and È; j=0 Hie joi Sie; (n=0,1,...) 
are non-negative definite. If, in Hamburger's 
or Stieltjes's problem, we require only that the 
solution is a function of bounded variation 
instead of a nondecreasing one, then every 
sequence | 11,)5 has a solution (R. P. Boas, Jr.). 
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A. Definition and Classification 


A latin square over a set A = (a,,a5, ... , Ap}, or 
of order n, is an arrangement of elements of A 
in a square of side n such that each symbol in 
A occurs exactly once in each row and in each 
column. It is in standard form, or reduced, if 
the first row and the first column consist of the 
natural permutation. There are three different 
interpretations of latin squares. (1) If we iden- 
tify the set of indices with A and write z=xoy 
to indicate that the symbol at the row x and 
the column y is z, then we have a quasigroup, 
denoted by (4,0) (— Section C). (2) The set of 
the n? triplets xyz in the above relation is an 
terror-detecting code over the alphabet A, of 
word length 3 with the minimum Hamming 
distance 2. (3) The set of the n permutations P, 
moving the natural permutation to the ith row 
satisfies the condition that P; ' P, is a discord- 
ant permutation if iz j. 

Two latin squares L and L' are isotopic if 
under convention (2) there are three permu- 
tations p, q, r of A such that L—(xyz], L' = 
ix 'yz'), x' = p(x), y =q(y), z =r(z). An iso- 
topy class can contain more than one reduced 
latin square. The number of isotopy classes is 
denoted by L*. If we admit another equiva- 
lence due to rearrangement of the three com- 
ponents of the words in the code (2), then we 
have main classes; the number of main classes 
is denoted by L**. The total number of latin 
squares of order n is given by n!(n— 1)! L, for 
the number L, of reduced latin squares. 

It is known that L, =L,=L,=1, L4,—4, 

L, =56, Le =9408, L- = 16,942,080, Lg = 
535,281,401,856, Lo = 377, 597,570,964,258,816; 
{=L} =L} =1, L = L! =2, L* =22, L* = 
564, L = 1,676,257; LT = LF" = L¥* = 1, Li 
—L$*-2,Lf*—]12,L**—]147. 
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B. Orthogonality 


Suppose that we superimpose two latin . 
squares L =(A, o) and L' —(A, e), and if there 
is no coincidence for the pairs of the symbols, 
Le, ifxoy-x' oy and xey-x'ey' implies 

x —X', y x y, then L and L’ are mutually ortho- 
gonal, and the resulting square is an Euler 
square. Actually there exists a pair of mutually 
orthogonal latin squares of order n, provided 
that n#2 or 6; this was established by Bose, 
Shrikhande, and Parker only in 1959, contrary 
to a long-standing conjecture of Euler himself 
that such a set does not exist if nz 2 (mod 4). 
A set of mutually orthogonal latin squares 

of order n cannot contain more than n— 1 
squares, and a set of n — 1 squares is a com- 
plete set of mutually orthogonal latin squares. 
This is equivalent to an error-correcting code 
of n? words of word length n+ 1 with mini- 
mum distance n. Again this is equivalent to a 
finite projective plane of order n, i.e., each line 
containing exactly n+ 1 points. If n is a power 
of a prime, then there exists a projective plane 
of order n. In general, if p" is the minimum of 
the prime-power components of n, then there 
is a set of p" — 1 mutually orthogonal latin 
squares of order n. 


C. Quasigroups 


A quasigroup (A, o) is a set A bestowed with a 
binary operation o, satisfying both cancella- 
tion laws; xo y=xoz implies y=z; and yox= 
zox implies y=z. A finite quasigroup is a 
group if and only if it satisfies the associative 
law (xo y)Ooz=x0(yoz). A quasigroup is 
isotopic to a group if and only if it satisfies 
Brandt’s law: xo y=zow, xo y’'=zow, x’oy 
—z' ow implies x'o y' =z’ ow’. Two groups are 
isotopic if and only if they are isomorphic. 

A biunique mapping f of a finite group G is 
called a complete mapping if x 5x ! f(x) is also 
a biunique mapping. In this case if we denote 
by G' the quasigroup defined by xo y — xf(y), 
then the two latin squares corresponding to G 
and G' are mutually orthogonal. A group of 
odd order has a complete mapping, while a 
group of even order with a cyclic 2-Sylow 
subgroup does not have a complete mapping. 
A solvable group of even order with a non- 
cyclic 2-Sylow subgroup also has a complete 
mapping, and it is conjectured that the solv- 
ability condition here is redundant. 


D. Room Squares 


A quasigroup (A, 0) is idempotent if xox =x 
for any x and is commutative if xo y — yox for 
any x and y. Now a Room square of order 2n 
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is a distribution of the n(2n — 1) unordered 
pairs of elements from A = (a9,a,, ...,a5,.,] 
among the (2n — 1Y? cells of a square of side 
length 2n — 1, such that each element of A 
appears exactly once in each row and in each 
column. The remaining (2n — 1)(n — 1) cells 
should be empty. According to Bruck this is 
equivalent to a pair of two idempotent com- 
mutative quasigroups (4; o) and (A; e) defined 
over A — (a, ..., 45, ,], satisfying an ortho- 
gonality condition similar to that of latin 
squares, namely, xoy-x'oy, xey-x' ey 
implies x= x', yz y' or x 2 y, y=x’. A Room 
square of order 2n exists if nz 4. 


E. Number of Latin Squares 


Little is known about the total number of 
latin squares in general. The first k rows of a 
latin square of order n isa k x n latin rect- 
angle. The number of k x n latin rectangles is 
asymptotic to (n!)*exp( — k(k— 1)/2) for k < 
n!’ (Erdós, Kaplansky, and Yamamoto) and 
to fl fent —() —/(m — 1) — (9/05!) for k< 
n°? They suggest an analogous asymp- 
totic relation for the number of latin squares. 
The following congruences are known for 
the number L, of reduced latin squares; L,= 
1 (mod p), L, =0 (mod p) for n> 2p if pis a 
prime. 
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A. General Remarks 


Suppose that we are given in a Euclidean 
plane a closed *Jordan curve C of length L 
that bounds a 'region of area F. We denote by 
A the number of "lattice points on the curve C 
or in the region bounded by C. In many cases 
it can be verified that A — F + O(L) (O is the 
*Landau symbol). Specifically, if we take a 
circle whose center is the origin and whose 
radius is 4 / x, then A(x) 2 nx 4 O( /x ). Next, 
consider the closed region defined by uv € x, 
u>l, and v>1 on the uv-plane. Let D(x) be 
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the number of lattice points lying in this closed 
region. Then D(x)=xlogx +(2C —1)x+ 
O(. /x), where C is the *Euler constant. To 
observe another aspect of the problem of 
estimating the number of lattice points in a 
given region, consider the series 

+a 

È (mW? +n’) = fis), 


m,n-— co 
(m,n) 7 (0,0) 


D 2 
( D n » =g(s). 


Then we have f(s) — È; r(n)n ^, g(s)— 

Lr, dinin ^, where r(n) is the number of 
integral solutions (u, v) of the equation u? + v? 
=n and d(n) is the number of positive integral 
solutions (u, v) of the equation uv =n. Thus the 
problem of estimating A(x) is identical to that 
of estimating H(x) — X, . . r(n); this is called 
Gauss's circle problem. We also have D(x)= 
Ln<x4(n), and the problem of estimating the 
latter is called Dirichlet's divisor problem. 

We set P(x) - A(x) ^ xx and A(x) 2 D(x) 
—(xlog x +(2C — 1)x). W. Sierpinski (1906) 
showed that P(x) 2 O(x!?), and G. Voronoi 
(1903) showed that A(x) 2 O(x!? log x). There 
are further investigations concerning the esti- 
mations of P(x) and A(x). J. G. van der Cor- 
put and E. C. Titchmarsh devised methods to 
estimate more general ‘trigonometric sums. 
For instance, let f(x) be a real-valued func- 
tion of tclass C* (k 23). If 0c A« f 9 (x) « hA or 
0<A< —f P(x) «hà in the intervalaxxb 
(with b—az 1), then 

Y. expQzif(x)) 2 O(h?^ "(b — äh 21" 


acn«b 
t(b—a) LA, 


As of 1968, we have an estimate slightly better 
than O(x!?^?**) for P(x) and A(x) obtained by 
L. K. Hua (1942), C. J. Cheng (1963), and W. 
L. Yin (1959). It is conjectured that P(x) and 
A(x) are O(x!^**), where c is an arbitrary 
positive number. On the other hand, M. Tsuji 
(1953) proved that (7 P(y)/ydy —O(1). G. H. 
Hardy (1916) and A. E. Ingham (1941) showed 
that 





! P(x) D P(x) 

e OO, Hm int sog x < 0, 
A(x) , , A(x) 

KEE DE E 


H. Cramér (1926) showed that 


1 x 
d |P(y)ldy 2 0(x**), 
X Ji 


i x 
| IA(y)I dy = O(x'^). 


X 


G. Voronoi (1904) proved that if x is positive, 
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then 


20 we a ET Qn /nx), 


Y' d(n)=xlogx +(2C—1)x+1/4 


pyx X "neie 


where X’ means that when x is an integer m, 
the mth term d(m) is replaced by (1/2)d(m). 
Here J, (x) is the *Bessel function of the first 
kind, and F(x)=(2/1){>° cos(xu)sin(x/u) du. 
There are many proofs for these expansion 
formulas. E. Landau’s proof (1920) of the 
estimation of X, .r(n) is interesting from the 
geometric point of view, and W. Rogosinski's 
proof (1922) of the estimation of 27,. din) uses 
real analytic methods in an ingenious man- 
ner. A. Oppenheim (1926) generalized these 
problems. 


B. Other Extensions 


Let a,, be rational, a,, — a,,, and Q(u,, ...,u,) 
= SE d,,U,U, be a positive definite qua- 
dratic form with *discriminant D. As an ex- 
tension of Gauss's circle problem, it is nat- 
ural to consider the number of lattice points 
(m,,...,m,) satisfying Q(m,, ...,m,) x. In 
connection with the *Epstein zeta function, 
this problem was extended to that of estimat- 
ing the sum 


Fx) 


Qm, ,...,mp)x 


exp(2ni(a,m, +... + a,m,)). 


Namely, the weight exp(2zi(a,m, +... + a,m,)) 
is placed at each lattice point. We now define ó 
such that ó — 1 if o,,...,«, are all integers and 
ó —0 otherwise. Landau (1915) obtained three 
exquisite proofs of 


ni2 


ur eia MERE 
JD V(n/24- 1) 


I. M. Vinogradov (1960) obtained a deeper 
result for the special case n—3: 3,:,,,,:..1 
— (4/3)nx?? + Q(x1928*9), 

Four times the tDedekind zeta function 
of the Gaussian field OC 9T )is equal to 
XV r(nn *. Hence Gauss's circle problem can 
be extended to that of estimating H(x) for the 
Dedekind zeta function. The generalized divi- 
sor problem, including Gauss's circle problem 
and Dirichlet's divisor problem, was the prin- 
cipal theme of Landau's research after 1912. 
We now consider the case where the Dirichlet 
series ZZ, F(n)n ? is a finite product of Dede- 
kind zeta functions. With a slight modification 
the following result is valid for the product of 
"Hecke L-functions: Let k; (1 <j<t) be an 


x"? Æ O(x""- 1)/2(n+1 ?. 
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algebraic number field of degree n,, ¢;(s) be the 
*Dedekind zeta function of k;, and p; be the 
residue at the pole s= 1 of ¢;(s). Further, we 
assume that 


ni n3... =N, 


C1(s)¢>(s) BEE = 
H(x)= Y, F(n) 
Then 


H(x)=x(a, log! x+... +a, logx+a,) 
+ Q(x" TDN D og! x) 


pt — DI, 


and the remainder O-term of the right-hand 
side cannot be replaced by O(x*) for 0< 1/2 
—1/2N. There are some algebraic results (Z. 
Suetuna (1929), H. Hasse and Suetuna (1931)) 
concerning the estimation of H(x). In particu- 
lar, if in the definition of F(n), the ¢;(s) are all 
equal to the Riemann zeta function, then we 
obtain C(s)* = £2; d,(n)n ^, where d, (n) is the 
number of ways of expressing n as a product of 
k factors. In this special case the remainder 
term can be replaced by O(x^'5), where c= 
max(1/2, (k — 1)/(k + 2)) (k 2 3) (G. H Hardy 
and J. E. Littlewood, 1922). An appropriate 
application of the *Artin L-function was ob- 
tained by Suetuna (J. Fac. Sci. Univ. Tokyo, 
1925), who extended to algebraic number fields 
of finite degree the result obtained by Landau 
(1912)— which states that the number of posi- 
tive integers not larger than x that can be 
expressed as the sum of two squares is ap- 
proximately equal to ax/, /log x, where a is 

a positive constant. 


à, —ipa 
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A. Definitions 


When x and y are elements of an *ordered set 
L, the ‘supremum and *infimum of (x, y}, 
whenever they exist, are called the join and 
meet of x, y and denoted by x Uy and xN y, 
respectively. L is called a lattice (or lattice- 
ordered set) when every pair of its elements has 
a join and a meet. The following three laws 
hold in any lattice L: (i) x Uy yUx, xN y= 
yx (commutative law); (11) x U(yUz) (xU 
y)JUz, xQ (yz) 2 (x y)fàz (associative law); 
and (iii) x U(yf1x) 2 (xU y)N x 2 x (absorption 
law). Conversely, if in a set L two operations 
U, N are given that satisfy (i)- (11i), then the 
conditions xU y = y and xf1y 2 x are equiv- 
alent and define an *ordering x< y in L with 
respect to which L becomes a lattice. The 
supremum and infimum of {x, yj in this lattice 
coincide with the elements xU y and xN y, res- 
pectively. Accordingly, a lattice can also be de- 
fined as an 'algebraic system with operations 
U, N satisfying laws (i)-(iii). The idempotent 
law x Ux =xfx=x holds in any lattice. 

An ordered set L is called an upper semilat- 
tice if each pair of elements x, y always has a 
join (supremum) x U y, and a lower semilattice 
if each pair of elements x, y always has a meet 
(infimum) x 1 y. 


B. Examples 


The set ‘B(S) of all subsets of a given set S is a 
complete and distributive lattice with respect 
to the inclusion relation (— Sections D, E). 
The set of all tnormal subgroups of a given 
"group is a complete and modular lattice (— 
Section F) with respect to the inclusion re- 
lation. This remains true if the normal sub- 
groups are replaced by the tadmissible sub- 
groups with respect to a given *operator 
domain. This applies in particular to the 
case of the set of all *ideals in a given com- 
mutative tring. The set of all subspaces of a 
given fprojective space is a modular lattice. 
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C. Further Definitions 


A mapping f of a lattice L into a lattice L' that 
satisfies the conditions f(xU y) 2 f(x)U f(y) 
and f(xfy)- f(x) f(y) is called a lattice 
homomorphism (or simply homomorphism). A 
tbijective lattice homomorphism f is called a 
lattice isomorphism (or simply isomorphism); 
its inverse mapping is also a lattice isomor- 
phism. When such an f exists, the lattices L 
and L' are said to be isomorphic. More gener- 
ally, a mapping f between ordered sets is said 
to be order-preserving when it satisfies the 
condition: x € y implies f(x) € f(y). Any lat- 
tice homomorphism is order-preserving, but 
the converse is not always true; however, an 
order-preserving bijection is an isomorphism. 

If the ordering in a lattice L is replaced by 
the tdual ordering, then the join and the meet 
are interchanged, and a new lattice L’ is ob- 
tained. This new lattice is called the dual lattice 
for L. 

A mapping f of a lattice L into a lattice L' 
satisfying the conditions f(x y)=f(x)U f(y) 
and f(xU y) 2 f(x) f(y) is called a dual homo- 
morphism (or antihomomorphism). Moreover, 
when f is a bijection, f is called a dual iso- 
morphism (or anti-isomorphism), and we say 
that L and L’ are dually isomorphic (or anti- 
isomorphic) to each other. 

When a lattice L' is a subset of a lattice L 
and the canonical injection L'  L is a lattice 
homomorphism, L’ is called a sublattice of L. 
If a subset L' of a lattice L satisfies the con- 
dition that x, ye L’ implies xU y, xN ye L', 
then two operations U, N can be induced tn L’ 
so that L' becomes a sublattice. For example, 
when a, b are given elements of a lattice L, the 
set of elements x satisfying a x <b is a sub- 
lattice, denoted by [a,b] and called an interval 
of L. When the quotient set L/R of a lattice L 
by an equivalence relation R in L is also a 
lattice and the canonical surjection L—> L/R is 
a homomorphism, then L/R is called a quo- 
tient lattice of L. If an equivalence relation R 
in a lattice L satisfies the condition that x= x’, 
yz y' (mod R) implies xUyzx'Uy, xNy= 
x'f1 y' (mod R), then two operations U, N can 
be induced in L/R so that L/R becomes a 
quotient lattice. The Cartesian product L= 
IT;., L; of a family (L;j;., of lattices becomes 
a lattice if the operations U, f! are defined by 
(x) U(y) 9 GU yg, (x) (y) 2 (; 1 y). This 
lattice is called the direct product of lattices 
{Lijer 


D. Complete Lattices 


An ordered set L is called a complete lattice if 
every nonempty subset of L has a supremum 
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and an infimum in L, and a c-complete lattice 
if every nonempty countable subset has a 
supremum and an infimum. Naturally, such 
ordered sets are lattices. And a sort of con- 
verse holds: Any lattice is a sublattice of some 
complete lattice. An ordered set L is said to be 
conditionally complete when every subset 
*bounded from above (below) has a supremum 
(infimum) in L, and conditionally c-complete 
when every countable subset bounded from 
above (below) has a supremum (infimum). For 
any ordered set L there exist a complete lattice 
L and an order-preserving injection f: LoL 
satisfying the condition that each element Ze L 
is the supremum and infimum of the images 
f(X) and f(Y), respectively, for some sets X, 

Y cL. This condition is equivalent to the con- 
dition that for any complete lattice L' and 
order-preserving injection f": L5 L', there 
exists an order-preserving injection q: L2 L' 
for which oo f — f". Hence (L, f) is unique up 
to lattice isomorphisms. L is called the com- 
pletion of the ordered set L. For example, the 
set of real numbers supplemented by +00 and 
—oo is the completion of the set of rational 
numbers. 


E. Distributive Lattices 


A lattice L is said to be distributive when it 
satisfies the following equivalent conditions 
(distributive laws) for x, y, ze L: (i) x U(yNz) 
=(xU y)N(x Uz); (ii) xQn(yUz) » (xn y)U 
(xz); and (iit) (x Uy) N(yUz)N(zUx)= 
(xN y)U(yNz)U(zN x). The dual lattices, 
sublattices, quotient lattices, and direct prod- 
ucts of distributive lattices are distributive. 
The set P(S) of subsets of a given set S isa 
distributive lattice, and each of its sublattices is 
called a lattice of sets in S. A distributive lat- 
tice is isomorphic to a certain lattice of sets. A 
homomorphism from a distributive lattice L 
into P(S) is called a representation of L in S. 

A lattice L is said to be complemented when 
a greatest element J and a least element 0 exist 
in L and for every element x, there exists an 
element x’ satisfying xUx' 2 I, xMx'=0. 
Such an x' is called a complement of x. A 
lattice that is distributive and complemented is 
called a Boolean lattice (or *Boolean algebra). 
In a Boolean lattice, every element has a 
unique complement. The lattice 3B(S) of all 
subsets of a given set S is a Boolean lattice, in 
which S is the greatest element and @ the 
least element. A sublattice of P(S) that con- 
tains the complement of each of its elements is 
also a Boolean lattice and is called a Boolean 
lattice of sets. Any Boolean lattice can be rep- 
resented isomorphically by some Boolean 
lattice of sets (— 42 Boolean Algebra). 
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An element a of a lattice is called a neutral 
element if for any pair of elements x and y, the 
sublattice generated by a, x, y is distributive. 
When a is neutral and has a complement, a is 
called a central element. The set of all central 
elements of a lattice L is called the center of L. 


F. Modular Lattices 


A lattice L is said to be modular if the follow- 
ing condition (modular law) is satisfied: x< z 
implies x U( yMz)=(xUy)Nz. A distributive 
lattice is always modular. The dual lattices, 
sublattices, quotient lattices, and direct prod- 
uct of modular lattices are also modular. 

The tJordan-Holder theorem and the refine- 
ment theorem of O. Schreier on normal sub- 
groups (— 190 Groups) are generalized as 
follows to the case of any modular lattice: À 
pair of elements x, y in a lattice L satisfying 
x> y is called a quotient and is denoted by 
x/ y. In particular, when x> y and there exists 
no element z such that x >z> y, then x/y is 
called a prime quotient, x is said to be prime 
over y, and y is said to be prime under x. A 
sequence C: xo, x, ..., x, of elements of L 
satisfying the conditions x; > x; (1 &i«k) is 
called a descending chain, and k is called its 
length. Each A, ,/x; is called a quotient deter- 
mined by C. When each of these quotients is 
prime, C is called a composition series. A de- 
scending chain D: yo, Y1, ..., y; is called a re- 
finement of a descending chain C: x», x,,..., 
x, when x — yo, x, = y; and each x; is equal 
to some y;. We now define an equivalence 
relation between descending chains. First, a 
relation x/y « x'/y' between quotients x/y and 
x'/y' is defined to mean that either the con- 
dition x 2 x'U y, y =x N y or the condition 
x =xUy, yzxfy' holds. Then x/y and 
x'/y’ are called equivalent if there exists a finite 
number of quotients qo, 4,, ..., q, satisfying the 
conditions x/y 7 qo, x'/y' = q, and q;_, x qj 
(1 <i<r). Descending chains C and C’ are 
said to be equivalent if they have the same 
length and the set of quotients determined by 
C is mapped bijectively to the set of quotients 
determined by C' so that the quotient and its 
image are equivalent. Now let L be a modular 
lattice. If two quotients x/y and x'/y' in L are 
equivalent, the intervals [ y, x] and [ y, x], 
considered as lattices, are isomorphic (Dede- 
kind's principle). If two descending chains 
C:xg, X4, ..., x, and C’: yo, Yis ..., y; have the 
same ends x, = yo and x, — y; then there exist 
a refinement of C and a refinement of C' which 
are equivalent. In particular, any two compo- 
sition series connecting the same elements are 
equivalent (— 85 Continuous Geometry). 

In a modular lattice L with a least element 
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0, if there exists a composition series connect- 
ing 0 and a given element a, then all such 
composition series have a common length k, 
denoted by d(a) and called the height of the 
element a. If no such composition series exists, 
the height is defined to be d(a) 2 oo. If two 
elements a, b of L satisfy d(aU b) « oo, then 
d(aU b) + d(aN b) 2 d(a) 4- d(b). This fact is 
called the dimension theorem of modular lat- 
tices. If L has a greatest element J, d(I) is 
called the height of the lattice L. 

In a complemented modular lattice L, an 
element which is prime over the least element 
0 is called an atomic element. A complemented 
modular lattice L is said to be irreducible if 
any two atomic elements have a common 
complement. 


G. Lattice-Ordered Groups 


An ordered set G in which a group operation 
is defined is called an ordered group when 

x< y implies xz € yz and zx & zy for all x, y, z 
in G. Moreover, if it is a ftotally ordered set, 
the ordered group G is called a totally ordered 
group. If G is a lattice, the condition for an 
ordered group is equivalent to the condition 
that (xU y)z = xzU yz, (xN y)z = xz yz, z(xU 
y)=zx Uzy, and z(xf y) 2 zxfzy. In this 

case, G is called a lattice-ordered group. A 
lattice-ordered group is a distributive lattice 
and has neither a greatest nor a least element. 
If {x;} has a supremum in a lattice-ordered 
group, then we have (sup; xj) y ^ sup;(x;f1 y) 
and its dual (complete distributive law). 

The lattice-theoretic structure of a lattice- 
ordered group was clarified by P. Lorentzen, 
A. H. Clifford, and T. Nakayama. In particular, 
a lattice-ordered commutative group is isomor- 
phic (as a lattice-ordered group) to some sub- 
group of a direct product of totally ordered 
groups. A lattice-ordered group has no ele- 
ment of finite order other than the identity 
element. Conversely, a commutative group 
which has no element of finite order other than 
the identity element can be made a lattice- 
ordered group with respect to some total 
ordering. Any free group can also be made a 
lattice-ordered group with respect to some 
total ordering. K. Iwasawa (1948) and others 
have done further research on totally ordered 
groups. 

An element x (e) of a lattice-ordered 
group G is called positive (negative) when x ze 
(x € e), where e is the identity. G 1s called an 
Archimedean lattice-ordered group when the 
following condition is satisfied: if, for some y, 
x" « y for all natural numbers n, then x € e. 

An Archimedean lattice-ordered group is 
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isomorphic to some subgroup of a complete 
lattice-ordered group. Conversely, a complete 
lattice-ordered group is Archimedean; more- 
over, it is commutative and isomorphic to 
the direct product of some tlattice-ordered 
linear spaces and copies of the lattice-ordered 
group of rational integers (Iwasawa, 1943). In 
particular, any totally ordered Archimedean 
lattice-ordered group is isomorphic to a sub- 
group of the lattice-ordered group of all real 
numbers. 

If the *minimal condition holds for the set of 
all positive elements in a lattice-ordered group, 
then the group is commutative, and each of its 
elements can be decomposed uniquely into a 
product of powers of elements that are prime 
over the identity element. The set of all tfrac- 
tional ideals of an algebraic number field is a 
typical example of such a lattice-ordered 
group. For further reference — 310 Ordered 
Linear Spaces, 85 Continuous Geometry. 
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Lebesgue, Henri Léon 


Henri Lebesgue (June 28, 1875—July 26, 1941) 
was born in Beauvais (Oise), studied at the 
Ecole Normale Supérieure, and received in 
1902 the doctoral degree for his thesis con- 
cerning integration [1]. After teaching in 
Rennes and Poitiers, he came to Paris where 
he was nominated for a professorship at the 
Faculty of Science in 1920 and then at the 
Collége de France in 1921. He was one of the 
most influential French analysts of this cen- 
tury and is known as the inventor of the fLe- 
besgue integral. With deep insight based on 
intuitive geometric conceptions, he was able to 
initiate a new cra in analysis by creating the 
theory of this integral. Not only was this 
theory the start of modern integration, it was 
also a turning point in the theory of Fourier 
series and *potential theory. The notions of 
*Lebesgue dimension of topological spaces and 
of tLebesgue number of compact sets are due 
to him. Lebesgue also made significant contri- 
butions to the tDirichlet problem. 
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Leibniz, Gottfried Wilhelm 


Baron Gottfried Wilhelm von Leibniz (July 1, 
1646- November 4, 1716) was born the son of 
a professor and grew up to be a genius with 
encyclopedic knowledge. He took part in 
politics and touched all scholarly fields, con- 
tributing creatively to the development of 
technology as well. His posthumously pub- 
lished works cover theology, philosophy, 
mathematics, the natural sciences, history, and 
technology, and are classified into 41 fields. A 
complete edition of his works has yet to be 
published. Ars combinatoria, written upon his 
graduation from Altdorf in 1666, was a scheme 
to systematize the various fields using mathe- 
matics as a model. During his stay in Paris 
(1672-1676), when not involved in politics, he 
studied the works of tDescartes and *Pascal, as 
suggested by C. Huygens. He discovered the 
tfundamental theorem of differential and in- 
tegral calculus and set up a basis for calculus 
with the introduction of an ingenious system 
of notation. After 1676, he worked on histor- 
ical compilations under the Duke of Hanover. 
Leibniz worked not only on the synthesis of 
mechanistic philosophy and medieval theolog- 
ical philosophy but also on the reconciliation 
of Protestantism and Catholicism. With his 
monadism he attempted to unify the old and 
new philosophies. In addition he worked on 
plans for a world academy for the develop- 
ment of learning and on the unification of 
all knowledge. This was to be accomplished 
using, for example, universal symbolism and 
universal linguistics. Under his influence, the 
Berlin Academy was established in 1700. After 
his death, his conceptions of *symbolic logic 
and *computers were realized. 
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A. Length of a Curve 


A continuous mapping C sending each point u 
of an interval I— (u|a&u € b] to a point p= 
p(u) — (x4(u), ..., x,(u)) of the k-dimensional 
Euclidean space R* (k > 2) is called a tcontin- 
uous arc or continuous curve, denoted some- 
times by C:p = p(u). The supremum of the 
length of a polygonal curve inscribed in C is 
called the length of C (in the sense of Jordan) 
and is denoted by /(C). Namely, it is equal 

to sups Xr; | p(u;) — p(u;_,)|, where ô is a par- 
tition of I:a-ug <u; ... «u, —b. Let C:p= 
p(u), and C,:p=p,(u), n=1, 2, ... (ue I) be 
continuous arcs. If p,(u)— p(u) on I, then 
I(C) x liminf, IC This property ts called the 
lower semicontinuity of length. Given two 
continuous arcs C: p= p(u) (ue I) and Cp 

= q(v) (ve L), if for any £2 0, there exists a 
homeomorphism u=h,(v) of I, onto J such 
that | p(h,(v)) — q(v)| « e on 4, then C and C, 
are called equivalent in the sense of Fréchet. 
Equivalent continuous arcs have the same 
length. À continuous image of an open interval 
is called an *open arc. The length of an open 
arc is defined to be the supremum of the length 
of a continuous arc contained in the open arc, 
and the notion of the equivalence of open arcs 
is defined in the same way as for continuous 
arcs. An equivalence class of continuous arcs 
(or open arcs) is called a Fréchet curve, and its 
length is defined uniquely. 

Suppose that a continuous arc C ts ex- 
pressed by (x,(u), ..., x,(u)), ue I. Then the 
length /(C) is finite if and only if every x;(u) is 
of fbounded variation. When (C) is finite, C is 
called rectifiable. In this case each Ox;/Ou exists 
talmost everywhere on I, and the inequality 


b k Or: 2\ 1/2 
eye w 


holds. The equality holds if and only if each 
x,(u) is absolutely continuous. Among the 
continuous arcs equivalent to C, there exists a 
unique continuous arc C, such that C,:q — q(s) 
(0x s x I(C)) and the length of every subarc q = 
q(s) (0 € s &s'( € I(C))) is equal to s’. C, is called 
the representation in terms of arc length of C. 
For C,, the equality holds in (1). A similar 
argument is valid for any open arc. When 
every subarc of C is rectifiable, C is called 
locally rectifiable. If A, is the 1-dimensional 
*Hausdorff measure in R* and n(p) is the 
number of points on / corresponding to 

pe R*, then (C) 2 fn(p)dA (p) (M. Ohtsuka, 
1951). 
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B. Surface Area 


In this section, we deal with area of tsurfaces 
in R?, using [1] as the main reference. In con- 
trast to the situation for curves, the area of a 
polyhedral surface P inscribed in a given sur- 
face does not necessarily tend to a fixed value 
as P approximates the surface. In a letter 

of 1880, H. A. Schwarz gave the following 
example: Approximate a circular cylinder with 
height h and radius r by a sequence (P, of 
inscribed polyhedral surfaces, each P, consist- 
ing of similar triangles of height b, and base 
length a,. If b,/a? is suitably chosen, then the 
surface area of P, tends to an arbitrary value 
not smaller than 2zrh (Fig. 1). 
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C. Lebesgue Area 


Suppose that we are given a plane domain A 
and a continuous mapping T of A into R?. 
The pair (T, A) is called a surface. Let d(T, 
T’, B) sup,.gl T(w) — T'(w)| for surfaces 
(T, A), (T', A’) and a set Bc AN A’. Let (T, A), 
(T,, A1), (T2, A5), ... be given. If 4,14 and 
d(T, Ta, A,)-50, then (T,, A,) (or simply T,) is 
said to converge to (T, A) (or T), and the con- 
vergence is expressed by 7, T. In particular, 
if A consists of a finite number of triangles and 
T is linear on each triangle, i.e., the image of 
A under T consists of triangles, then the nota- 
tion (P, F) is used for (T, A), and the area of 
T(A) is denoted by a(P, F). Given a surface 
(T, A), denote the totality of sequences 
{(P,, F,)} converging to (T, A) by ®, and call 
inf lim inf, a(P,, F,) the Lebesgue area of (T, A). 
This area is denoted by L(T, A). By virtue of the 
definition there exists a sequence {(P,, EI) con- 
verging to (T, A) such that a(P,, F,,) > L(T, A). 
Like length, Lebesgue area has the lower semi- 
continuity property. Namely, T, T implies 
L(T, A) <lim inf, L(T,, A,). When A is a Jordan 
domain, the same value L(T, A) is obtained if 
® is replaced by the set ®* of all sequences 
{(P,, F,)} such that FITA and P,(w) T(w). 

Let f(x, y) be a continuous function defined 
on0<x<1,0<y<1. Regard f(x, y) as a func- 
tion of y (resp. x) for a fixed x (y), and denote 
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it by f. (y)C/,(x)) and its "total variation by 
V69(V, When fò V(x)dx + f$ Vi (y) dy « oo, 
f(x, y) is said to be of bounded variation in the 
sense of Tonelli. Furthermore, if f.( y) and f(x) 
are fabsolutely continuous for almost every x 
and y, respectively, then f(x, y) is said to be 
absolutely continuous in the sense of Tonelli. 
Similar definitions are also given when f is 
defined in a general domain. Suppose that a 
surface (T, A) is expressed by a set of three 
functions x = x(u, v), y= y(u, v), z = z(u, v), all of 
which are absolutely continuous in the sense 
of Tonelli, and that the partial derivatives x,,, 
X», ..., Zy are square integrable. Then L(T, A4) — 
f fa J dudv < oo (as was shown by C. B. Mor- 
rey), where J =(J? + J? -J2)? with tfunctional 
determinants J,, Ja, J, of the transformations 
(u, v) Cy, z), (z, x), (x, y). 


D. The Geócze Problem 


The Geócze problem is the problem of deter- 
mining whether L(T, A) coincides with the area 
obtained by using (instead of ®) the set of all 
sequences {(P,, F,)} such that (P,, F,) converges 
to (T, A) and each (P,, F,) is inscribed in (T, A). 
The answer is affirmative when A is a Jordan 
domain with L(T, A) « oo. Let T be a surface 
expressed by a function z= F(x, y) (Ox x « 1, 
Ox y « 1) that is absolutely continuous in the 
sense of Tonelli, and let ((P,, F,)} be as before. 
If the ratio of the length of the largest side and 
the smallest height of each triangle in (P,, F,) 

is uniformly bounded, then a(P,, F,) tends to 
L(T, A) [1, p. 74]. 


E. Geócze Area 


Consider a surface (T, A). Let E,, E,, E, be 

coordinate planes in R?, and denote by T; 

(i= 1,2,3) the composition of the mapping T 

and the projection of R? onto E Let ën be the 

positively oriented boundary of a polygonal 

domain z in A and C; be the oriented image 

of ôn by T;. Then the forder O(z; C,) of z with 

respect to C; is a measurable function of z. Set 

v(T; n)  v;i— le |O (z; C)| dx dy (z ^ x + iy) and 

v(T; nr) =v — (vi - v2 + v3)". The quantity 

V(T; A) - sup 3 v(T; n) (2) 
S eeh 

is called the Geócze area of (T, A), where S is a 

finite collection of polygonal domains in A 

such that no two of them overlap. If V(T, A) 

< oo, u; is defined by f fz, O(z; C) dx dy, and 

U(T, A) is defined as in (2) by means of uj, then 

U(T, A) - HOT, A). The inequalities V(T;, A) € 

V(T, A) « V(T;, A) - V(T,, A) V(T,, A) hold 

trivially. 
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F. Peano Area 


Consider (T, A) and z as in the previous sec- 
tion. Let t be the projection of R? onto a plane 
E, and denote by C' the image of the bound- 
ary of z under the composite mapping to T. 
Set v(T, x, E) - f f |O(z; C')| de, where do is 
the surface element on E, and set Y(T, lz 
sup, v(T, z, E). Define 
P(T, A) sup Y. W(T, n) 

S meS 
as in (2). This is called the Peano area of (T, A). 
H. Okamura defined area by integrating the 
tmapping degree instead of |O(z; C")| [5]. 
L, V, P all coincide, and hence L and V are 
invariant under any orthogonal transforma- 
tion of R?. 


G. Other Definitions of Area 


As in the definition of Peano area, consider z, 
t, E, and denote the Lebesgue measure of to 
T(x) by m(n, E). If we set u(x) 2 supe m(z, E), 
then we can define the area of (T, A) by 

supe Ères u(n). If v(x) — (m? (n, E.) + m? (n, E;) 

+ m? (x, E4))!? is used instead of u(x), then 

the Banach area of (T, A) is obtained, and if 
[f10(z; C)| dx dy is used instead of m(z, E;), 
then the Geócze area is obtained. Let us define 
various kinds of area for an arbitary *Borel set 
X in R?. Divide R? into meshes M,, M,, ... 
which are half-open cubes with diameter of 
equal length d, and denote by m? (i= 1, 2, 3) 
the tLebesgue measure of the projection of 
M,N X onto the ith coordinate plane. The limit 
of 2: (m$)? + o? + (my?) as d50 is 
called the Janzen area of X. Denote by m; the 
supremum with respect to the set of planes E 
in R? of the Lebesgue measure of the projec- 
tion of M,N X to E. Then lim2;m; as d>0 is 
called the Gross area of X. C. Carathéodory 
covered X by a countable number of convex 
sets K,, K,,... each of whose diameters is less 
than 6>0, denoted by m; the supremum of the 
Lebesgue area of the projection on K; into E, 
and adopted lim 2m; as 6-0 as his defini- 
tion of area of X. If the K; are restricted to be 
spheres, then Carathéodory's area divided by 
7/4 is identical with the tHausdorff measure 
A,(X). (For other definitions and mutual 
relations — [4].) 

We give a measure-theoretic definition of an 
area for a surface (T, A) as follows. Denote by 
n(p) the number of points in A corresponding 
to a point p in R?, and call n(p) the multiplic- 
ity function of the mapping T. The integral 
A(T, A) — (1/4) [n(p) dA; (p) can be taken as a 
definition of the area. However, a different 
definition of the multiplicity function is needed 
for the integral to be equal to L(T, A) [3,6]. 
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In fact, if x 2 o(u), y 2 (u) (O &u& 1) represent 
a tPeano curve filling the square 0x x « l, 
Ox yx, then the Lebesgue area of the sur- 
face (T, A) defined by A={0<u<1,0<v<1} 
and T:x 2 o(u), y 2 v(u), z=0 is zero, but 
A(T, A) z 1. 


H. Mappings of Bounded Variation 


Let T be a mapping of a domain A in the 
w-plane into the z-plane, and define z, O — 
O(z; C), and S as in Section E. Set 


0*(z; C) «(0| + 0)/2, 
0 (z; C)-(10|— 0)/2, 


v(T, =| lioe C)| dx dy, 


v*(T, n= | (o*t C)dxdy 


(the same signs correspond to each other), 
V(T, A)  Ssups ? «es t(T, n), 

V (T, A) 2 sups dines? (T, m), 

N(z; T, A) - sups? .slO(z; CHL 

and 

N*(z; T, A) - sups? zes O*(z; C). 


Then N, N* are lower semicontinuous in the 
z-plane. The integrals W(T, A) 2 f f N dx dy, 
W*(T, A)=JJN* dxdy, and W (T, A)— 

fI N^ dxdy are called the total variation, 
positive variation, and negative variation of T, 
respectively, and the equalities W= W+ + W7, 
V=W, and V* 2W* hold. When W(T, A) « 
co, T is said to be of bounded variation. A 
related notion is defined as follows: T is ab- 
solutely continuous if the following two con- 
ditions hold. (1) For any given ¢>0 there 
exists a ô >Q such that ¥,.,0(T, x) « e when- 
ever the sum of the areas of nes is <ô. (2) 
For any polygonal domain zy such that zo U 
Ong c A and any polygonal subdivision S of Zo, 
V(T, n9) 2 Lares V(T, n). If the area of A is finite 
and T is absolutely continuous, then T is of 
bounded variation. 

Let T be a continuous mapping of bounded 
variation of a domain A in the w-plane into 
the z-plane. The derivatives V'(w), V; (w), V' (w) 
of the set functions HOT, A), V* (T, A, V (T, A) 
exist falmost everywhere (a.e.) in A and are 
finite. The difference J(w)= V; (w) — V! (w) is 
called the generalized Jacobian, and the rela- 
tion J(w) 2 V'(w) holds ae If x(u, v), y(u, v) are 
differentiable a.e., then J(w) coincides with the 
ordinary ‘functional determinant a.e. Next, let 
T be a continuous mapping of A into R? with 
V(T, A) « oo, and denote by J;(w) the gen- 
eralized Jacobian of (T;, A). Then J(w) =(J?(w) 
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+ J?(w) + J2(w))'? is called the generalized 
Jacobian of (T, A). The relation J(w)= + V'(w) 
holds a.e. in A. Therefore 


V(T, as[| J(w)du dr (3) 


is valid. The equality holds if and only if each 
(T,, A) is absolutely continuous. 


I. Fréchet Distance 


Let (T,, A,) and (T,, A2) be surfaces, and as- 
sume that the set H of homeomorphisms be- 
tween A, and A, is nonempty. Define the 
Fréchet distance between two surfaces by 
| T3, T] = inf sup | Ti (w)— 7, (A(w))]. 

heH weA, 
It satisfies the three axioms of distance (— 273 
Metric Spaces). When ||T,, T|| =0, T, and T; 
are called equivalent (in the sense of Fréchet). 
A set of all equivalent surfaces is called a 
Fréchet surface. Equivalent surfaces have 
equal Lebesgue areas; hence Lebesgue area is 
well defined for any Fréchet surface. Given a 
surface (T, A) with L(T, A) « oo, there exists a 
pair (T,, A,), equivalent to (T, A) such that the 
functional determinants J;(w) exist for (T; , A,) 


and 
Y(T, all J(w)dudv, 
A, 


where J?(w) = ÈE}; J2(w), as before. The prob- 
lem of finding such a pair (T,, A,) is called the 
representation problem. Moreover, we can 
choose (Tj, A,) to be a generalized conformal 
mapping in the following sense: Express 

(T,, A1) by x 2 x(u, v), y = y(u, v), z= z(u, v). 
Then x,, x,, ...,z, exist a.e. in A, and are 
square integrable, x2 + y2 - z2- x2 + y2 4 z2, 
and x,x, - y,y, H z,z, —Ü ae. in A,. 


J. Higher-Dimensional Case 


In higher-dimensional spaces, many results are 
also known, such as area and coarea formulas 
[11] and a generalization of Morrey's result 
on Lebesgue area (C. Goffman and W. P. 
Ziemer, 1970). 


K. Hausdorff Dimension 


A. S. Besikovich [9] has shown that for every 
set S in a Euclidean space, there exists a real 
value D such that the td-dimensional Haus- 
dorff measure is infinite for d < D and van- 
ishes for d> D. This D is called the Hausdorff 
dimension. 

F. Hausdorff had already considered this 
measure for tCantor sets and Koch curves. 
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A fractal [ 10] is defined as a set for which ` 
the Hausdorff dimension strictly exceeds the 
*dimension of the set, which is topologically 
defined in a Euclidean space. 

We show only one example, the coastline of 
a triadic Koch island constructed in the fol- 
lowing way. Consider an equilateral triangle 
with sides of unit length. Remove the middle 
third of each side, and attach in its place a V- 
shaped peninsula bounded by two sides of an 
equilateral triangle with side length 1/3. We 
thus get a Star of David. Repeat the same 
process of formation of peninsulas for each 
segment of the star’s sides. If we continue this 
process indefinitely, then we get a complicated 
coastline whose Hausdorff dimension is 
1.2618. Of course the tlength of this coastline 
is o0. 

B. Mandelbrot mentions many interesting 
examples of fractals having fractional dimen- 
sions in his book [10]. 
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Marius Sophus Lie (December 17, 1842— 
February 18, 1899), a Norwegian mathema- 
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tician, is famous as the founder of the theory 
of *Lie groups. From 1869 to 1870 he colla- 
borated with F. Klein on tsphere geometry, 
which led Lie to develop the concept of con- 
tinuous groups. This discovery was the step- 
ping stone that allowed Klein to complete his 
ideas for the "Erlangen program. In 1872, Lie 
became a professor at the University of Chris- 
tiania (now Oslo). In 1886, Lie succeeded Klein 
in the chair of mathematics at Leipzig, where 
he remained until 1898. Then he returned to 
Christiania, where a post was created for him, 
but he died one year later. 

The continuous groups that Lie dealt with 
are today called the "Lie transformation group 
germs. With the free use of geometric concepts 
and analytic methods (especially the theory of 
*differential equations) he was able to develop 
his theory and apply it to the theory of dif- 
ferential equations. The significance of his 
work was not recognized until after his death. 
Early in the 20th century, E. *Cartan and H. 
*Weyl were able to complete the theory of Lie 
groups, and by the middle of the century, the 
characteristics of the Lie group as a *topolog- 
ical group were clarified. 
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A. Basic Concepts 


Let K be a fcommutative ring with unity. A set 
g is called a Lie algebra over K if the following 
four conditions are satisfied: (i) g is a "left K- 
module, where we assume that the unity of K 
acts on g as the identity operator. (ii) There is 
given a K-bilinear mapping (called the bracket 
product) (X, Y)>[X, Y] from g x g into g: 


b tX is YB Y, | set a f; X;, Yi] 
for all a, B; in K and X;, Y; in g. (iii) LX, X]=0 
for every X in g. (Hence [X, Y]=—[Y,X] 


for every X, Y in g (alternating law).) (iv) 
[X, LY,Z]]+LY, (Z, X]] - LZ. LX, Y ]] 20 for 








248 B 
Lie Algebras 


every X, Y, Z in g (Jacobi identity). In partic- 
ular, if K =C (the complex number field) or 
K =R (the real number field), g is called a 
complex Lie algebra or a real Lie algebra, 
respectively. 

For example, let d be an *associative alge- 
bra over K. Putting[ X, Y] - XY — YX, we 
can supply 9I with the structure of a Lie alge- 
bra over K, which is called the Lie algebra 
associated with 9I. In particular, if 9I is the 
*total matrix algebra K, of degree n over K, 
then the Lie algebra associated with K, is 
called the general linear Lie algebra of degree 
n over K and is denoted by al(n, K). 

Let q be a Lie algebra over K and a, b be 
*K-submodules of g. The subset of g consisting 
of elements of the form X[A, B] (finite sum) 
with Aca, Beb is denoted by [a,b], which is a 
K-submodule of g. A K-submodule a of g is 
called a Lie subalgebra of g if [a, a] c a. A sub- 
algebra a of g is called an ideal of g if [a, g] c 
a (this condition is equivalent to [9,a] c a). 

If a is a subalgebra of g, the restriction of the 
bracket product of g on a makes a a Lie alge- 
bra over K. If a is an ideal of g, the tquotient 
K-module g/a is a Lie algebra over K relative 
to the bracket product [X +a, Y - a] 2 [X, Y] 
+a. This Lie algebra g/a is called the quotient 
Lie algebra of g modulo a. 

Let o, 9; be Lie algebras over K. A map- 
ping f:8,—9; is called a homomorphism of 
gı into a; if f is K-linear and f([X, Y ])— 
Lf/GX), f(Y)] for every X, Y in g,. A bijective 
homomorphism is called an isomorphism. 
Then g, ts said to be isomorphic to g, if there 
exists an isomorphism from g; onto g,, and 
we write go Sg: If f:9, 9, is a homomor- 
phism, then f(g,) is a subalgebra of g,, and the 
kernel a= f! (0) of f is an ideal of g,. Fur- 
thermore, the homomorphism f induces an 
isomorphism f: g,/a—f(g,) (homomorphism 
theorem). 

The direct suni o, + g, of two Lie algebras 
g1, 8; Over K is defined as in the case of as- 
sociative algebras. Then g,, 9; are ideals of 
9i +92. 

The set A(q) of all automorphisms of a Lie 
algebra g is a subgroup of the general linear 
group GL(q). A(g) is called the (full) automor- 
phism group of o. 


B. Representations 


Let g be a Lie algebra over K, and let V bea 
K-module. Denote by £(V) the associative 
algebra consisting of all K-linear mappings 
from V into V. Denote by gl(V) the Lie algebra 
associated with £(V). (Note that if V has a 
basis consisting of m elements over K, then 
al(V)z al(m, K).) A homomorphism p:98-gl(V) 
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is called a representation (more precisely, a 
linear representation) of g over V, and V is 
called the representation space of p. If V is a 
*free K-module of rank m, then m is called the 
degree of the representation p. We also use 

(p, V) instead of p to mention the representa- 
tion space explicitly. The concepts concerning 
the representations such as 'equivalence, tirre- 
ducibility, or complete reducibility are similar 
to the corresponding concepts found in the 
representation theory of associative algebras 
(— 362 Representations). In particular, by 
taking V=g and putting p(X) Y=LX, Y] 

(X, Yeg), we obtain a representation of g, 
called the adjoint representation of g, and p(X) 
is denoted by ad(X). Then ad(g) — (ad(X)| 

X €g} is a subalgebra of al(a) and is called the 
adjoint Lie algebra of g. 

Let (p, V) be a representation of g. Then 
there is associated with this representation a 
*symmetric bilinear form B,:g x g— K given 
by B,(X, Y)—- trp(X)p(Y), where B, satisfies 
the invariance property B,([X,Z], Y)= 
B,(X,[Z, Y ]). In particular, if p — ad, then 
we write B instead of B p» and B is called the 
Killing form of a. 

Let g be a Lie algebra over R of dimension 
n, and let ad: g— al(g) be the adjoint represen- 
tation of g. Put det(t] —ad(X)) 2 3.9 t! P(X) 
for every element X eg. Then the P(X) are 
polynomial functions on g, and P,— 1. Let | be 
the least integer such that P, #0. Then | « Trank 
g. An element X of g is called regular (singular) 
if (X) x0 (P,(X)=0). The subset og of g con- 
sisting of all regular elements of g is open and 
dense in g. The subset o a' of g consisting of 
all singular elements of g is of measure zero 
with respect to the tLebesgue measure of g, 
which is obtained uniquely up to positive 
scalar multiples by means of a linear isomor- 
phism of o onto R" using any basis of g over R. 

Now suppose that g is reductive (— Section 
G). Then an element X eg is regular if and 
only if the centralizer 3y = (Yeg|ad(X)Y 20] 
of X is a Cartan subalgebra (— Section I) of g. 
Furthermore, if X €g is regular, then ad(X) is a 
'semisimple linear endomorphism of g. 


C. Structure of Lie Algebras 


Suppose that a, b are ideals of a Lie algebra g. 
Then [a,b] is also an ideal of g. In particular, 
g has the following ideals: g’ =[g,q], g” = 
[959], ..., 9€" 2 [a9, g®], .... Furthermore, 
we have g>q/>q">.... This series is called 
the derived series of g, and oa is called the 
derived algebra of o The Lie algebra g is said 
to be Abelian if g =0 and solvable if g =0 for 
some k. Now put a! 29, g9? =[9, 91], 9? = 


Ls — 


[9:07], 0 =[g, 9f], .... Then gt, sss 
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are all ideals of g, and we have al >97 >g° 

2 .... This series is called the descending cen- 
tral series of a, and g is said to be nilpotent if 
g*=0 for some k. An ideal a of g is called 
Abelian (solvable, nilpotent) if the subalgebra a 
is Abelian (solvable, nilpotent). 

Put 3={Aeg|[X, A]=0 for every X in g}. 
Then 3 is an Abelian ideal of g, called the 
center of g, and is the kernel of the adjoint 
representation of g. Define the ideals 3,, 35, ... 
of g as follows: 3, is the center of g, 3,/3, is the 
center of g/3,,...,3;41/3; is the center of g/3,,.... 
Then we have 0<3, c3; c .... This series is 
called the ascending central series of g, and g is 
nilpotent if and only if 3, =g for some k. 

We assume that K is a field of characteristic 
0 and Lie algebras over K are of finite dimen- 
sion. Let X,,..., X, bea basis of g over K. 
Then the n? elements c$ in K defined by 
[X;, X;] 2 X c; X, are called the structural 
constants of g relative to the basis (X;). 


D. Radicals and Largest Nilpotent Ideals 


The union r of all solvable ideals of q is also a 
solvable ideal of g, called the radical of g. The 
union n of all nilpotent ideals of g is also a 
nilpotent ideal of g, called the largest nilpotent 
ideal of g. The ideal s=[r, g] is called the 
nilpotent radical of g. We have go>r>n>2s. 


E. Semisimplicity 


A Lie algebra is called semisimple if its radical 
is 0. A semisimple Lie algebra g over K is 
called simple if g has no ideals other than g 
and 0. If r is the radical of a Lie algebra g, 
then g/r is semisimple. Every semisimple Lie 
algebra is a direct sum of simple Lie algebras. 
For example, put t(n, K) = (A =(a,)egl(n, K)| 
aj 0 for every i<j} and n(n, K)={A —(aj)e 
t(n, K)|a,; — 55 — ...—a,,—0]. Note that 
t(n, K) is the set of all lower triangular ma- 
trices and n(n, K) is the set of nilpotent lower 
triangular matrices. Then t(n, K) is a solvable 
subalgebra of gl(n, K), and n(n, K) is a nilpo- 
tent subalgebra of gl(n, K). Put sI(n, K) - 
{Aegl(n, K)|tr A —0]. Then sl(n, K) is an ideal 
of al(n, K). For nz 2, sl(n, K) is a simple Lie 
algebra. 


F. Theorems 


The following theorems are fundamental in 
the theory of Lie algebras: 

(1) Engel's theorem (valid even if K is of 
positive characteristic): Let V be a finite- 
dimensional vector space over a field K such 
that V z (0). Let g be a subalgebra of gl(V) 
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consisting of nilpotent elements. Then there is 
a nonzero element v in V such that Xv=0 for 
every X in g. (Thus by choosing a suitable 
basis of V and identifying gI(V) with gl(n, K), 
we have qc n(n, K), where n — dim V.) 

(2) Lie's theorem: Let (p, V) be an irreduc- 
ible representation of a solvable Lie algebra 
g. Then p(a) is Abelian. In particular, if K ts 
algebraically closed, then dim V = 1. (Thus for 
every representation (p, V) of a solvable Lie 
algebra g over an talgebraically closed field K, 
we have p(g) c t(n, K) by choosing a suitable 
basis of V.) 

(3) Cartan’s criterion of solvability: Let g 
be a subalgebra of al(n, K). Then a is solvable 
if and only if tr X Y - 0 for every X eg and 
Teo, g]. 

(4) Cartan's criterion of semisimplicity: A 
Lie algebra g is semisimple if and only if the 
Killing form B of g is tnondegenerate (i.e., 
B(X,g)20, X eg implies X =0). 

(5) Weyl's theorem: Every representation (of 
finite degree) of a semisimple Lie algebra is 
completely reducible. 

(6) Levi decomposition: Let r be the radical 
of a Lie algebra g. Then there is a semisimple 
subalgebra s of g such that g=r+s, rMs=0. 
Furthermore, such a subalgebra s is unique up 
to automorphisms of g (A. I. Mal'tsev). 

(7) Ado's theorem (orginally proved only for 
the case of characteristic 0 for K; the case of 
positive characteristic was proved by K. Iwa- 
sawa): Let g be a finite-dimensional Lie alge- 
bra over a field K. Then there exists a repre- 
sentation (p, V) of g of finite degree such that 


az p(g). 


G. Reductive Lie Algebras 


A Lie algebra g is called reductive if the radical 
t of g coincides with the center 3 of g. The 
following four conditions for a Lie algebra q 
are mutually equivalent: (i) g is reductive; (ii) 
the nilpotent radical s of g is 0; (iii) the adjoint 
representation of g is completely reducible; 
and (iv) the derived algebra [g, g] of g is semi- 
simple and g =3 + [9, a] (direct sum), where 3 is 
the center of g. 

A representation (p, V) of a reductive Lie 
algebra g is completely reducible if and only if 
p(X) is diagonalizable for every X in 3. For 
example, the Lie algebra gl(n, K) is reductive. 


H. Derivations 


A linear mapping ô:g—>g is called a derivation 
of the Lie algebra g if ([X, Y])=[6(X), Y] + 
[ X,0(Y)] for every X, Y in g. The set Dia 

of all derivations of g is a subalgebra of gl(a), 

and D(q) is called the Lie algebra of deriva- 
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tions of g. The adjoint Lie algebra ad(q) is an 
ideal of D(q), and elements of ad(a) are called 
inner derivations of g. If g is semisimple, then 
S(g) - ad(g)2 a. 

Now suppose that K =R (K =C). Then the 
group A(g) of automorphisms of g is a Lie 
group (complex Lie group), and the Lie alge- 
bra of A(g) is given by 2 (a). Lie ôe (qa) 

(c gl(g)). Then expó (e GL(g)) is in A(g). The 
connected subgroup I(g) of A(g) generated by 
(expó|óead(g)j is a *Lie subgroup of A(g). 
Furthermore, I(g) is a normal subgroup of 
A(g), called the group of inner automorphisms 
of g or the adjoint group of g. Thus ad(g) is the 
Lie algebra associated with I(q). The quotient 
group A(g)/I(g) is called the group of outer 
automorphisms of a. If q is semisimple, then 
I(g) coincides with the identity component of 
A(9). 


I. Cartan Subalgebras 


A subalgebra b of a Lie algebra g over K is 
called a Cartan subalgebra of g if (1) b is nilpo- 
tent and (ii) the normalizer n of b in g (Le., n 
—(Xegl|[X,5]c 5]) coincides with b. If K is 
algebraically closed, then for every two Cartan 
subalgebras bh, b, of g there exists an auto- 
morphism c of g such that c(b;)— b;. Fur- 
thermore, for such a o we can take an auto- 
morphism of the form o =exp(ad(A,))... 
exp(ad(A,)) (A,, ..., A, &g), where all the ad(A,) 
are nilpotent. l 


J. Universal Enveloping Algebras 


Let g be a Lie algebra over a field K. Regard- 
ing g as a vector space over K, let T(g) be the 
ttensor algebra over g. Let J be the ttwo-sided 
ideal of T(q) generated by all elements of the 
form X ® Y—Y @ X —-[X, Y] (X, Yea). The 
quotient associative algebra U(g)= T(g)/J is 
called the universal enveloping algebra of g. 
The composite of the natural mappings g> 
T(g)— U(g) is an injection a— U (g), and we 
identify g with a linear subspace of U (g) by 
this mapping. Then we have LX, Y]=X Y— YX 
(X, Yeg) in U(g). The algebra U(g) has no 
tzero divisors. In particular, if g is the Lie alge- 
bra of a connected Lie group G, then U (g) is 
isomorphic to the associative algebra of all 
Heft-invariant differential operators on G. For 
every subalgebra b of g, the universal envelop- 
ing algebra U(b) of b is isomorphic to the sub- 
algebra of the associative algebra U(g) gener- 
ated by 1 and b. If g is the direct sum of two 
Lie algebras g,, 82, then U(g) is isomorphic to 
the ttensor product U(g,) & x U(g,). Let a be 
an ideal of g, and let 9I be the two-sided ideal 
of U (g) generated by a. Then we have U(q)/2l= 
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U(g/a). Now put U, = K : 1. Define a linear sub- 
space U; of U(g) by 


U,=K-1+q+g‘g+...+ g 


Then we have Uo C U, c ..., De, (Ji Ui 
= U (g). Thus {U;} defines a filtration of U (g). 
Denote by G2 G? - G! - G? +... (G°  U,, 
G'=U,/U,_,) the tgraded ring associated 
with this filtration. Then we have g=G' c G. 
Let X,,..., X, be a basis of g, and let S = 
KLY;,,..., ¥,] bea tpolynomial ring on K in 
n indeterminates Y,,..., Y,. Then there exists a 
unique algebra homomorphism o 5 G such 
that o(1)— 1, w(¥) =X; (i— 1, ..., n). Further- 
more, c is bijective, and the ith homogeneous 
component S' is mapped by œw onto G*. Thus 
the set of monomials (X X}... X?) (i, 20, 
..,1,2 0) forms a basis of U (g) over K (the 
Poincaré-Birkhoff-Witt theorem). 

Every representation (p, V) of q over K can 
be extended to a unique representation (o, V) 
of U(g). Furthermore, p is irreducible (com- 
pletely reducible) if and only if p' is irreducible 
(completely reducible). Given two representa- 
tions p1, p; of g, p, is equivalent to p, if and 
only if pj is equivalent to p3. 

Now suppose that g is semisimple, and let 
X;, ..., X, be a basis of g. Using the Killing 
form B of g, put gj; — B(X;, Xj). Denote the 
inverse matrix of (gj) by (g"). Define ce U (g) 
by c— X. g? X; X;. The element c, called the 
Casimir element of the Lie algebra, is inde- 
pendent of the choice of the basis (X;), is a 
well-defined element of U (g), and belongs to 
the center of U(g). For every absolutely irredu- 
cible representation p of U(g), p(c) is a scalar 
operator, and tr p(c) is a positive rational 
number. 


K. Complex Semisimple Lie Algebras 


We assume that K — C, although there is no 
essential change 1f we assume that K is an 
algebraically closed field of characteristic 0. 

A subalgebra b of a complex semisimple Lie 
algebra g is a Cartan subalgebra of g if and 
only if b is a maximal Abelian subalgebra of g 
such that ad(H) is diagonalizable for every H 
in b. We fix a Cartan subalgebra b; dim b is 
called the rank of g, and we denote the linear 
space consisting of all C-valued forms on b by 
b*. For every « in b*, let 


8,— [X €g|ad(H)X —a(H)X for all H in b]. 


Then g, is a linear subspace of g, and gu =b. 
Define a subset A of b* by 


A={aeh*|a40,q, 4 {0}}. 


Then A is a finite set. Elements of A are called 
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roots of q relative to b, and A is called the root 
system of g relative to b. For every root o, g, is 
of dimension one, and g is decomposed into a 
direct sum of linear subspaces: 
g-b- > s.. 

aca 
For each root 4, a, is called the root subspace 
corresponding to o. 

The restriction B, of the Killing form B of g 
on b is nondegenerate. Hence for every 4 in b* 
there exists a unique element H. in b such that 
A(H)= B(H,, H) for all H in b. Thus we get a 
linear bijection b* b defined by 45 H;. Via 
this bijection, B, gives rise to a symmetric 
bilinear form (A, u) =(H,, H,) (4, ue *) on b*. 
Denote by bg the real linear subspace of h* 
spanned by A. Then the inner product (A, uu) 
defined on b* is positive definite on bi. Hence 
with respect to this inner product, bi is an l- 
dimensional Euclidean space, where / — dim b. 
The root system A is a finite subset of the 
Euclidean space bi. 


L. Properties of Root Systems 


(i) xe A implies —«€A. Furthermore, among 
the scalar multiples of x, only +a belong to A. 
(ii) Let à, Be A. Then 2(x, Ba, x) is a rational 
integer. (iii) Let a, BEA and fj ¥ +a. Then there 
exist unique nonnegative integers j, i such that 
(iB va|veZ] QA — (B —jo, B—(j—1)a, ...,B— 
x, B, B +a, .... B - ix]. Furthermore, j— i= 

2(a, B)/(a, x), i-- j x 3. The set {B+ Za] A is 
called the a-string of £. 

Now let ge A. Denote by w, the treflection 
mapping of bg with respect to the hyperplane 
P,—(xebg (o, x)=0}, which is orthogonal to 
a. Then we have w,(f)= B —(a*, B)ae ^ for 
every f in A, where «* = 2x/(x, x). Thus we have 
w,(A)=A for every « in A. (iv) Let x, BEA and 
Data Then the angle 0 between a and £ is 
one of the following: 30°, 45°, 60°, 90°, 120°, 
135°, 150°. Suppose, moreover, that 0 x 0 «: 90^ 
and (a, x) « (f, p). Then we have the following 
criteria: 0 = 30° <> 3(o, a) - (B, B); 0-45? <=> 
2(a, x) =(P, B); 0 — 60° <>(x, x) (B, B). (v) Let a. 
p. x+ Bed. Then [Sa gel = Dap: 

Conversely, suppose that a finite subset A of 
a finite-dimensional Euclidean space E satisfies 
conditions (1) and (i1) together with a part of 
(iii): w,(A) — A for every « in A. Then A is a 
root system of some complex semisimple Lie 
algebra. 


M. Lexicographic Linear Ordering in bi 


Let 4,,...,4, be a basis of bg over R. Define a 
linear ordering 42> on bj as follows: If 4 = 
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and only if there exists an index s (1 <s <1) 
such that é,=y; fori=1,...,s—1 and €,>y,. 
This linear ordering is called the lexicographic 
linear ordering of bi associated with the basis 
(4). Relative to this linear ordering, a root a is 
called a positive (negative) root if x >0 (a « 0). 
We denote the subset of A consisting of all 
positive (negative) roots by A* (A ). A subset 
S of A coincides with A* for some lexico- 
graphic linear ordering of bg if and only if the 
following conditions are satisfied: A - SU( — Sy; 
SO(—S)=@; a, eS, a - eA imply «+ fie S. 
A positive root oeh" is called a simple root if 
x cannot be expressed as the sum of two posi- 
tive roots. 


N. Fundamental Root Systems 


Let J be a subset of A consisting of I roots 
a,,---,%,. Then 77 is called a fundamental root 
system of A if (i) every element « of A is ex- 
pressed uniquely as an integral linear combi- 
nation of the «; («= mia), and (ii) in this ex- 
pression, m,,...,m, are either all 20 or all <0. 
For any lexicographic linear ordering of bi, 
the set of all simple roots forms a fundamental 
root system of A. Moreover, every funda- 
mental root system of A is obtained in this 
manner. Let H — a, ...,2;] be a fundamental 
root system of A. Then Za, + 2 g-., generates 
9. The Lie algebra g is not simple if and only if 
I] admits an orthogonal partition, i.e., 77 = 

I UIL, 1,4, 1,42, I O = Qj, and 
(a, [) - 0 for every «e M, and Belt, The ? 
integers aj; — —2(a;, %,)/(a;, à) (1 <i, j& I) are 
called the Cartan integers of o relative to the 
fundamental root system /7. Then we have aj; 
= 2,ajz0forizj. 


O. Borel Subalgebras and Parabolic 
Subalgebras 


Let b-54-X,.,9,. Then b is a maximal solv- 
able subalgebra of o The group /(g) acts 
transitively on the set of all maximal solvable 
subalgebras of o A maximal solvable subalge- 
bra of g is called a Borel subalgebra of g. A 

. subalgebra of g is called a parabolic subalgebra 
if it contains a Borel subalgebra of g. Now let 
(p be any subset of a given fundamental root 
system /7={a,,...,%,}. Denote by A (0) the 
set of all negative roots «= È n;o; such that 

n; — 0 for all ge in ®. Then pg — b + c4 (3 Ga 

is a parabolic subalgebra. Thus we get 2! para- 
bolic subalgebras (ps | c 77]. Every parabolic 
subalgebra is conjugate under /(a) to one and 
only one of the parabolic subalgebras [pg | € 
TT}. 
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P. Weyl’s Canonical Basis 


Let H,,..., H; be a basis of b, and let E, be 

a basis of g, for each root «. Then we have 

a basis (H;, E,} of g. Such a basis is called 
Weyl’s canonical basis if the following three 
conditions are satisfied: (i) «(H)e R (j — 1, ...,1) 
for every «€ A; (ii) the Killing form B of g 
satisfies B(E,, E_,)= —1 for every «e A; and 
(iii) if x, B, a+ BeA and [E,, Ep] - N, e, 

(N, eet then N, isin Rand N,,—N , -p 
The Lie algebra o always has Weyl's canonical 
basis. For such a basis ( H;, E,}, the linear 
space 


ac Y. R/ —1 H+} R(E,+E_,) 
Y RG/-1(E,—E.)) 


is a semisimple Lie algebra over R. The Killing 
form of q, is negative definite. Every connected 
Lie group whose Lie algebra is o, is always 
compact. Furthermore, g— a, - J/ —1 9u 

g, N J^ g, — 0. Thus g is isomorphic to the 
Lie algebra gF =C Gg o, over C obtained from 
g, by extending the basic field R to C, and g, is 
called the unitary restriction of g relative to 
Weyl’s canonical basis ( H;, E,}. 

A Lie algebra a over R is called a real form 
of g if af — C ga is isomorphic to g. When 
this is the case, g is called a complex form (or 
the complexification) of a. Note that a real 
form a of g can be regarded as a real subalge- 
bra of g such that g=a+,/—-1 a, aN, / —1 a= 
0. A real Lie algebra a is called a compact 
real Lie algebra if its Killing form is negative 
definite. A real Lie algebra is compact if and 
only if it is semisimple and is the Lie algebra of 
some compact Lie group. The Lie algebra a of 
a compact Lie group A is the direct sum of its 
center 3 and some compact Lie algebra; hence 
a is reductive. 

A compact real form g, of a complex semi- 
simple Lie algebra g is called a compact form 
of g. The group [(g) of all inner automor- 
phisms of g acts transitively on the set of all 
compact real forms of g (regarding the real 
forms of g as real subalgebras of g). 


Q. Chevalley's Canonical Basis 


A complex semisimple Lie algebra g always 
has a basis ( H;, E,} (consisting of a basis 
H,,..., H, of b together with a basis E, of g, 
for each root x) such that (i) «(H;)e Z for every 
xeA^and i=1,...,/; (ii) B(E,,E ,) -2/(a, al 

for every «e A; and (iii) if x, B, x+ fe A and 
[E,, Ej] =N, 5E,.5 (Na 5€ C), then N, pE Z and 
N, g= — N-a, . y. Such a basis {H,, E,} is called 
Chevalley's canonical basis. When we take this 
basis, the structural constants of g relative to 
{H,, E,j are all integers. Thus gz = 2 ZH; + 
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Y ZE, is a Lie algebra over Z. Furthermore, 
gg - ZRH;4 È RE, is a real form of g with the 
property that there exists a Cartan subalgebra 
ba Of gp such that for each element H in bp, all 
the eigenvalues of ad(H) on gp are contained 
in R. (In fact, we may take £ RH; as bg.) Such 
a real form of g is called a normal real form. 
The group Fg) acts transitively on the set of 
all normal real forms of g. 


R. Weyl Groups 


The reflections w, (xe A) of the Euclidean space 
bi generate a subgroup W of the group of all 
tcongruent transformations of hg. W is called 
the Weyl group of g relative to þh and is repre- 
sented faithfully as a *permutation group over 
the finite set A. Hence W is a finite group. Let 
Il — (o,,...,04] be a fundamental root system 
of A. Then W is generated by w, , ...,w,,. The 
root system A coincides with the set (w(a)| 

we W,aeIlj. Let X be the set of all fundamen- 
tal root systems of A. Then W acts on d and 
is *simply transitive on iy. If g is simple, two 
roots a, f are conjugate under W if and only if 
(a, «) - (B, P). Now let I be the complement in 
bi of the union of all the hyperplanes P, (xe A) 
orthogonal to «. Then I is a W-stable open 
subset of hg. A connected component of T is 
called a Weyl chamber. W acts on the set o 
of all Weyl chambers and is simply transitive 
on o. Let I7 — (o, ...,0j be a fundamental 
root system. Then the set {x Es | (x, x;)>0 for 
i=1,...,1} isa Weyl chamber, called the posi- 
tive Weyl chamber associated with /7. Now fix 
any lexicographic linear ordering of bg that 
has /7 as the set of simple roots. For we W, 
put A. 2 (xeA* |w(a)eA }. Denote the car- 
dinality of AZ by n(w). Then n(w) -0«&-w- 1. 
Furthermore, w can be expressed as a product 
of n(w) factors w= w, ... Wes where each factor 
is taken from {w,,,...,W,,} admitting repeti- 
tions. In fact, n(w) is the minimum length of 
the expression of w as a product w =w, ... Wy, 
(i,j=1,...,/). With respect to the generators 
Wa, ++» W4, W has the following system of 
tdefining relations: 


W2-1, 1<i<l, 
(Wa — 1, 1 Ki«j« l, 


where mj; is the order of w,,w,,. Thus if 0j; is 
the angle between a; and oj, we have m;;— 
n/(x —6;). 

Denote by T the set of all linear transfor- 
mations o of bi such that c(A) — A. Then T is 
also the subgroup of all congruent transfor- 
mations of bit; furthermore, T is a finite group, 
and W is a normal subgroup of T. Let /7 bea 
fundamental root system of A and put P= 
{ae T|c(11) = II]. Then P is a subgroup of T, 
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and we have a semidirect product T= P- W. 
Elements of P are called particular transfor- 
mations relative to /7. The group P= T/W is 
isomorphic to the group A(a)/I(a) of outer 
automorphisms of o. 


S. Classification of Complex Simple Lie 
Algebras 


Let b be a Cartan subalgebra of a complex 
semisimple Lie algebra g, and let M= (o, 
...,0] be a fundamental root system relative 
to 5b. We associate with 77 the diagram (t1- 
dimensional complex) indicated in Fig. 1. This 
diagram is called the Dynkin diagram of g 
(also called the Schláfli diagram or Coxeter 
diagram). It is constructed as follows: with 
each a; there is associated a vertex (denoted by 
a small open circle). These / vertices are con- 
nected by several segments as follows. Let 6,; 
be the angle between a; and oj. (i) If 6 150°, 
a; and a; are connected by three oriented seg- 
ments as in (1) of Fig. 1, where the orienta- 
tion means (œ; x;) (25, x;). (ti) If 8; — 135^, oi 
and a; are connected by two oriented segments 
as in (2) of Fig. 1. (iii) If 6; = 120^, o; and a; are 
connected by a nonoriented single segment as 
in (3) of Fig. 1. (iv) If 06 — 90^, a; and a; are not 
connected. 


Fig. 1 
A Dynkin diagram. 


The Dynkin diagram of g is independent of 
the choice of b, M. Furthermore, two complex 
semisimple Lie algebras are isomorphic if and 
only if they have the same Dynkin diagram. A 
complex semisimple Lie algebra is simple if 
and only if its Dynkin diagram is connected. 

Fig. 2 gives all possible Dynkin diagrams 


Qj su CE a.) Qj 
B, (1=2,3, 7) o—o——9—- 

gi es rmi Ort dj 
C, (1=3,4,; =) 06—0——0— ---- —Ó«—90 


a Q, Ou OG @ QQ Ga a a Q a 
l 2 3 5 ü 7 D t 2 3 4 
Ex E CHR F, 9—o-—9——0 
a a. 
a l 3 
4 Go O—=0 
Fig. 2 


Dynkin diagrams of simple Lie algebras. Note that 
dim g is 4,: 2 + 21; B.:2P +1; C2 +1; D:2P — E; 
E,:78; Ez: 133; Eg: 248; F4:52; G,:14. 
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associated with complex semisimple simple Lie 
algebras. There are seven categories. (The 
index l in A, means the rank of g.) Among 
these A, (I2 1), B, (l2 2), C, (I2 3), D, (12:4) are 
called classical complex simple Lie algebras. 
E, (1— 6, 7, 8), F4, and G, are called exceptional 
complex simple algebras. Note that A, z B, = 
C,, B, Cj, AZ D3, Dj =A, + A,. A, (resp. 
B;, Cj, Dj) is the Lie algebra of the complex 

Lie group SL(i+ 1, C) (SO(214- 1, C), Sp(l, C), 
SO(2I, CH. 


T. Classification of Real Simple Lie Algebras 


We refer the reader to [3] and the references at 
the end of [3] for the classification of simple 
Lie algebras over a general field k; in partic- 
ular, for k=R (— Appendix A, Tables 5.I, 

5.IT), the algebras are closely related to the 
classification of irreducible symmetric Rieman- 
nian manifolds (— 412 Symmetric Riemannian 
Spaces and Real Forms). 

In particular, since compact semisimple real 
Lie algebras g are in one-to-one correspon- 
dence (up to isomorphism) with complex semi- 
simple Lie algebras g© obtained as the com- 
plexification of a, the classification of compact 
real simple Lie algebras reduces to the classifi- 
cation of complex simple Lie algebras. Hence 
they are also represented by the same Dynkin 
diagrams. Compact real simple Lie algebras of 
the types A,, B.C and D, are called classical 
compact real simple Lie algebras. They are the 
Lie algebras of the compact Lie groups SU (1^ 
1), SO(21-- 1), Sp(I), and SO(21), respectively. 
Compact real simple Lie algebras of the type 
E, (1-6, 7,8), F4, and G, are called exceptional 
compact real simple Lie algebras. 


U. Satake Diagrams of Real Semisimple Lie 
Algebras 


Let g be a real semisimple Lie algebra, f be the 
subalgebra associated with a *naximal com- 
pact subgroup of the tadjoint group of g, and 
p be the orthogonal complement of f in q 
relative to the Killing form of o Let a bea 
maximal Abelian subalgebra contained in p, 
and let h be a Cartan subalgebra of g contain- 
ing a. Denote by a^, b€ the complexifications 
of g, b, respectively. Let o be the *semilinear 
tautomorphism of g^ defined by o(X + / —1 
Y)- X —/ —1 Y (X, Yeg). Then we have o(h°) 
=b°. Thus ø acts on the root system A of a 
relative to b€ as follows: (ca)(H) — (o H) 
(xe A). Thus c acts on (9%. There is a lexico- 
graphic linear ordering of (9% such that 
x€A* and ca z —a imply exeA*. Fix such an 
ordering, and let 77 be the set of simple roots 
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relative to the ordering. Put o = pw, where p is 
a particular transformation relative to 77 and 
w is an element in the Weyl group W. Then p 
induces a permutation of order 2 on the set 
{ae H |oa x —a}. Suppose that a vertex be- 
longing to the Dynkin diagram of 77 corre- 
sponds to a simple root « such that ga = —a. 
Then replace the vertex by a small filled circle. 
Also, if two vertices are mapped to each other 
by p, then connect the two vertices by an arc 
with two arrows on the end. The diagram thus 
obtained is called the Satake diagram of g. The 
Satake diagram of g is independent of the 
choice of f, a, b and of the ordering of (bO)&. 
Two real semisimple Lie algebras are isomor- 
phic if and only if they have the same Satake 
diagram, and g is simple if and only if its Sa- 
take diagram is connected. Thus real simple 
Lie algebras are classified by their Satake 
diagrams [16]. 


V. Iwasawa Decomposition of Real Semisimple 
Lie Algebras 


Let g be a real semisimple Lie algebra. Take f, 
a, b and the ordering of (DO) as in the con- 
struction of Satake diagrams (— Section U). 
Let n be the intersection of g with the sub- 
space X (g€),, where the sum is taken over 
x€A* such that ca x —«. Then n is a nilpotent 
subalgebra of g, and we have a decomposition 
of g into the direct sum of linear spaces: g — tf 
+a+n. This decomposition is called an Iwa- 
sawa decomposition of g. Iwasawa decom- 
positions are unique in the following sense: Let 
9 —T' +a +w be another Iwasawa decompo- 
sition; then there exists an inner automor- 
phism A of g such that Af —T', Aa=a’, An w. 
For the cohomology theory of Lie algebras 
and Lie algebras over fields of characteristic p 
>0, in particular the theory of restricted Lie 
algebras — [3]. For the relationship between 
Lie algebras and the theory of finite groups 
(e.g., Chevalley's simple groups, *Burnside 
problems) — [7] and the references therein. 


W. Representations 


Let g be a complex semisimple Lie algebra and 
b be a Cartan subalgebra of g. We fix h and a 
lexicographic linear ordering on bf. Let 77 — 
{ais ..., 04] be the set of simple roots. Since 
every representation of g is completely reduc- 
ible, we restrict ourselves to the explanation 
of irreducible representations. Let (p, V) be a 
representation of g. For each 4€b*, put V, 
—iveV|p(H)v— A(H)v for all Heb}. Then V, 
is a linear subspace of V, and 4 is called a 
weight of the representation p (relative to b) if 


248 X 
Lie Algebras 


V, z (0); then dim V, is called the multiplicity 
of the weight 4. The set of all weights of p is a 
finite W-stable subset of hä Denote this set 

by (44, ..., 4,]. Then V is decomposed into a 
direct sum: V=V, +...+ V, . The maximum 
element among (4,, ..., 4,] with respect to the 
given ordering of bj is called the highest weight 
of p. 

The following two theorems are basic in 
determining irreducible representations. 

(1) Cartan's theorem: Let A,, A, be the 
highest weights of irreducible representations 
Pi» P2 Of g, respectively. Then p, is equivalent 
to p, if and only if A, 2 A,. 

(2) Cartan-Weyl theorem: Let 4&b£. Then 
there exists an irreducible representation p of g 
which has 4 as its highest weight if and only if 
(i) 2(A, a)/(a, a) e Z for every root «eA (such an 
element 4e bg is called an integral form on bt 
and (i1) w(4) x for every we W (such an ele- 
ment A€bg is called dominant). 

These theorems lead to the concept of the 
system of fundamental representations. Put «* 
— Allge, &;), and let A,, ..., A; be the basis of 
bg dual to of, ..., af ((^;, 07) ô;). Then the 
"free Abelian group > ZA; coincides with the 
module P of all integral forms. An element 
È mA; (m,¢Z) is dominant if and only if m, > 
0, ...,m, Z0. Denote by P" the semigroup 
in P consisting of all dominant elements in P. 
For j— 1, ...,L let (pj, V) be the irreducible 
representation of g which has A, as its highest 
weight. The system (p, ..., pj) is called the 
fundamental system of irreducible representa- 
tions associated with 77. The irreducible rep- 
resentation that has A= X m;A;e P* as its 
highest weight is constructed as follows: Put 
V —V;G ... 6) V; (mth tensor power of Vj and 
V=V™@®...@ Vi". Then V can be regarded 
as a representation space of g in a natural 
manner. Let V be the smallest g-stable sub- 
space of H containing P, =(V,)n: & ... (Vg. 
Then V gives an irreducible representation of g 
with highest weight A. Thus by decomposing 
Vir ®©... © Vr", we get all irreducible rep- 
resentations of g. This is why (p, ..., pj) is 
called the fundamental system of irreducible 
representations. 


X. Relation with Representations of Compact 
Lie Groups 


Let G be a compact, connected, semisimple Lie 
group. Then every Cartan subalgebra b of the 
Lie algebra g of G is Abelian. We call dim} the 
rank of g or of G. Let H be the connected Lie 
subgroup of G associated with b. Then H isa 
maximal torus (toroidal subgroup) of G. Fur- 
thermore, every maximal torus of G is conju- 
gate to H in G. Also, every element of G is 
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conjugate to an element of H in G. Let N — 
N(H) be the *normalizer of H in G. Then 

every o € N induces an automorphism Ad(o) of 
g, which induces an automorphism (denoted 


. by the same symbol Ad(o)) of the complexifi- 


cation g of g. We have Ad(c)(bC) 2 bC, and 
Ad(o) preserves the root system of h©. Fur- 
thermore, the restriction of Ad(c) on (ÉIS is 
an element w, of the Weyl group W of a 
relative to b. Via this mapping o w,, we 
have N/H = W. Thus we identify N/H with W. 

Since (BO =./ —1 b, we can define a lexico- 
graphic linear ordering using a basis of b. Fix 
such an ordering. Now every representation p 
of G over a complex vector space V induces a 
representation dp of g over V, where dp is the 
differential of p (— 249 Lie Groups). Then we 
have a representation dp of g€ over V. We also 
call a weight A relative to © of the representa- 
tion dp of g© a weight of the representation p 
of G relative to H. Representations p,, p, of G 
are equivalent if and only if representations 
dp,, dp; of g© are equivalent. Denote by A,, 

A, the highest weights of dp,, dp,, respective- 
ly. Then p, is equivalent to p, if and only if 
A, =A. 

The module P of all integral forms in (HCH 
coincides with the set of all elements in (h°)# 
that are weights (relative to bÏ) of some repre- 
sentation of a^. Denote by P, the subset of P 
consisting of all elements in P that are weights 
(relative to H) of some representation of G. 
Then Pg is a submodule of P such that [P: P5] 
« oo. Put P§ = PN P*. Then the mapping 
(representation p) (the highest weight A of p) 
induces a bijective mapping from the set of all 
classes of irreducible representations of G onto 
B 

The exponential mapping exp:b H isa 
surjective homomorphism from b to H. The 
kernel I5 of this homomorphism is a 'lattice 
group in b of rank / (=dimb); a basis H,,..., H, 
of Tọ over Z is also a basis of b over R. Now 
define linear forms 4,,...,4, on b by (4, H)— 
ój. Then we have 


P= 2n / —1 Y Zi. 

In other words, an element Ae(bC)£ is in Py if 
and only if A(H ein, / —1 Z for every element 
H in Tg. This characterization of P, also char- 
acterizes Pg =P, P*. In particular, G is 
simply connected <> P= P eIl; = 2n / —1 : 
XZaf, where «,,...,%, are simple roots and 

af =2a;/(a;,%;) for i— 1,...,1. We also have 
G= I(g) «the group of inner automorphisms 
of ge P; = X Za; «I5 —2n4/ —1 3 Ze; where 
64, ...,& are elements of (aO) defined by (x; cj) 
= ó;. In general, we have P 5 P; 2 Za, 

2n / —1 X Za? c Fg c 2n / —1 Ze,. Fur- 
thermore, the *fundamental group z,(G) of 

G is given by P/P z I;/(2x4/ —1 X Za). Also, 
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the kernel of the adjoint representation 
G—1(G)=I(qg) is given by BX Za; z (214 / —1- 
X Zell. 


Y. Invariant Measure on G 


Keeping the definitions in this section the 
same as in the previous section, we see that 
every root « defines a representation h y,(h) 
of the group H of dimension 1 over (a), as 
follows: Let E, be a basis of (al, Then 
Ad(A)E, 5 y (h) E,. We have y,(h) ^ e*? if h 
—exp X (X €b), i.e., y, oexp— e*. We let e” 
stand for y,:e*(h) — y,(h), he H. Now let dg, dh, 
dm be the *invariant measures on G, H, M 

— G/H, respectively, normalized by 


| at | ai | inet 
G H M 


Then for every continuous function f on G, we 
have 


I 
| rou f (| fiv dm Jedi, (1) 
G W JH AJM 


where w is the order of the Weyl group W and 
f (m, h) is a function on M x H defined by 

Tim, h) 2 f(ghg ^), n2 gH. Note that f(m, h) is 
well defined. Finally, O(h) is a function on H 
defined by 


O(h) = I] (et? — e 28) 

acA 
for h=exp X (X eb), and Q(h) is called the 
density on H. Denote by D(h) the same prod- 
uct as Q(h), letting « range over A". Then 
we have Q(h) = D(h) D(h) «|D(h)|? z 0. In 
particular, if f is a tclass function on G (i.e., 
f(xyx *)= f(y) for every x, ye G), f(m, h)= 
f(h). Hence (1) is simplified into the following 
integral formula for class functions: 


1 
| none) F(h)Q(h) dh. (2) 


Z. Weyl's Character Formula 


Keeping the definitions in this section the 
same as those in the previous section, we let 
(p, V) be an irreducible representation of G 
and y, be the character of p(y,(g) tr p(g)). Let 
^ be the highest weight of p. Then since A 
determines p up to equivalence, y, must be 
determined by A. In fact, y, is given via A by 
Weyl’s character formula (3). (Note that G 

=| JgHg ^. Hence x, is determined by its 
restriction on H.) Now for he H, we have 


GA can) 
hj ; 3 
Zell) D (3) 
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where ¢, (4€ P) is the alternating sum 


£0) Y, det(w)e'"?»90,— 5 exp X. (4) 


weW 


Finally, 6 in (3) is given by 


zeAt 


that is, ô is the half-sum of the positive roots. 
In particular, we have £,(h) = D(h). Denote by 
m,(A) the multiplicity dim V, of a weight A of p. 
Then we have y,(h) 2 X, m4(A)e^(h). Further- 
more, we have m,(A)=1, m,(A)=m,(w(A)) 

(we W), and m,(A) is given by Kostant's 
formula: 


my(A)= Y, det(w)P(w(A + à) — (4 4- à)), (5) 


where for each ue P, P(y) is the number of 
ways u can be expressed as a sum of positive 
roots. Thus P(x) is the number of nonnegative 
integral solutions {k,} of u= X4. KA, 

Now suppose that (p,, Vi), (p;, V2) are 
irreducible representations of G. Their tensor 
product (p, © p2, V, © V2) is decomposed into 
a direct sum of irreducible constituents: p, © 
Das à m(u)p,, where p, is the irreducible rep- 
resentation of G which has u as its highest 
weight and mu) is the multiplicity of p, in 
p, ® p2. Then m(y) is given by Steinberg’s 
formula: 


m(g) 
— Y Y det(ww)P (w(A, +ô) 
we W w'eW 


+w(A,+6)—(u+26)), (6) 


where p, — p... p? — pa,. The partition function 
P(n) that appears in (5) and (6) satisfies the 
following recursive formula (B. Kostant): 


Pilz — 5, det(w)P(u—(ó—w(0)). (7) 


weWwzl 


References 


[1] Sém. "Sophus Lie," Ecole Norm. Sup., 
1954-1955. 

[2] N. Bourbaki, Eléments de mathématique, 
Groupe et algèbres de Lie., Actualités Sci. Ind., 
Hermann, ch. 1—3, 1960; ch. 4—6, 1968. 

[3] N. Jacobson, Lie algebras, Interscience, 
1962. 

[4] S. Helgason, Differential geometry, Lie 
groups, and symmetric spaces, Academic 
Press, 1978. 

[5] E. Cartan, Oeuvres complètes pt. I, 
Gauthier-Villars, 1952. 

[6] C. Chevalley, Theory of Lie groups I, 
Princeton Univ. Press, 1946; Théorie des 
groupes de Lie, Actualités Sci. Ind., Hermann, 
H, 1951; IM, 1954. 


249 A 
Lie Groups 


[7] I. Kaplansky, Lie algebras, Lectures on 
modern mathematics 1, T. L. Saaty (ed.), 
Wiley, 1963, 115—132. 

[8] H. Weyl, Theorie der Darstellung kon- 
tinuierlicher halb-einfacher Gruppen durch 
linearen Transformationen I-III, Math. Z., 23 
(1925), 271—309; 24 (1926), 328—376, 371—395 
(Gesammelte Abhandlungen II, Springer, 1968, 
543—647). 

[9] H. Weyl, The classical groups, Princeton 
Univ. Press, revised edition, 1946. 

[10] I. D. Ado, Über die Darstellung von 
Lieschen Gruppen durch linearen Substitu- 
tionen, Bull. Soc. Physico-Math. Kazan, (3) 7 
(1934-1935), 3—43. 

[11] F. R. Gantmacher, On the classification 
of real simple Lie groups, Mat. Sb., 5 (47) 
(1939), 217—250. 

[12] E. B. Dynkin, The structure of semisimple 
Lie algebras, Amer. Math. Soc. Transl., 17 
(1950). (Original in Russian, 1947.) 

[13] R. Bott, Homogeneous vector bundles, 
Ann. Math., (2) 66 (1957), 203—248. 

[14] I. Satake, On representations and com- 
pactifications of symmetric Riemannian 
spaces, Ann. Math., (2) 71 (1960), 77—110. 
[15] H. Freudenthal, Oktaven, Ausnahme- 
gruppen und Oktavengeometrie, Mathemat- 
isch Instituut der Rijksuniversiteitte, Utrecht, 
mimeographed note, 1951. 

[16] S. Araki, On root systems and an in- 
finitesimal classification of irreducible sym- 
metric spaces, J. Math. Osaka City Univ., 13 
(1962), 1—34. 

[17] J.-P. Serre, Lie algebras and Lie groups, 
Benjamin, 1965. 

[18] J.-P. Serre, Algébres de Lie semi-simples 
complexes, Benjamin, 1966. 

[19] J. E. Humphreys, Introduction to Lie 
algebras and representation theory, Springer, 
1972. 

Also — references in [3,4]. 


249 (IV.9) 
Lie Groups 


A. Definitions 


A set G is called a Lie group if there is given on 
G a structure satisfying the following three 
axioms: (i) G is a group; (it) G is a tparacom- 
pact, treal analytic manifold (G need not be 
connected); and (iii) the mapping G x G>G 
defined by (x, y) 5xy ^! is treal analytic (— 105 
Differentiable Manifolds). 

For simplicity, real analyticity is denoted by 
C^: for example, C?-functions, C?-mappings. 
If we replace real analyticity by tcomplex 
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analyticity in axioms (ii) and (iii), then we have 
the axioms (i), (ii), and (iii) of a complex Lie 
group. We consider the real analytic case, since 
the complex analytic case can be dealt with 
similarly. 

Every element c of G defines a mapping G 
>G given by xox (x xo), denoted by L, 
(R,) and called the left (right) translation of G 
by o. L,, R, are automorphisms of G as a C?- 
manifold. Therefore, given a tvector field X on 
G, a tdifferential form œ on G, or, in general, a 
*tensor field T on G, we can apply L, and R, 
in a natural manner to the given tensor field 
and obtain a tensor field L, T, R,T on G. A 
tensor field T on G is called left (right) invar- 
iant if L, T= T (R, T— T) for every eG. A 
tensor field on G is of class C? if it is left or 
right invariant. 


B. Lie Algebras of Lie Groups 


Let G be a Lie group. Then the set X(G) of all 
C?-vector fields on G has the structure of a 
vector space over the real number field R. 
Furthermore, with respect to the bracket oper- 
ation LX, Y]=X Y— YX (X, YeX(G)), X(G) 
forms a tLie algebra over R. Denote by g the 
subset of X(G) consisting of all left invariant 
vector fields on G. Then g is a tsubalgebra 

of the Lie algebra X(G). Thus g is also a Lie 
algebra over R. This Lie algebra g is called the 
Lie algebra of the Lie group G. The linear 
mapping from g into the ttangent space T,(G) 
of G at the identity element e given by X X, 
is bijective. Hence dim g = dim G. The Lie 
algebra g is often identified with T,(G) via this 
bijection. 


C. Simply Connected Covering Lie Groups 


For any finite-dimensional Lie algebra g over 
R, there exists a connected Lie group G that 
has g as its Lie algebra. Such Lie groups are 
all tlocally isomorphic. Among these groups, 
there exists a tsimply connected one that is 
unique up to isomorphism. This group is 
called the simply connected covering Lie group 
of the Lie algebra g. 


D. Lie Subgroups 


A subgroup H of a Lie group G is called a Lie 
subgroup if (i) H has the structure of a Lie 
group and (ii) the inclusion mapping o HG 
is a C?-mapping, and the ‘differential dq is 
injective at every point of H De, as a manifold, 
H is a 'submanifold of G). Moreover, if H is a 
connected manifold, it is called a connected Lie 
subgroup. For a subgroup H of a Lie group 
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G, there exists at most one structure of a Lie 
group on H that makes H a connected Lie 
subgroup. (If H is not assumed to be con- 
nected, then this uniqueness statement is not 
generally true.) A subgroup H of a Lie group 
G is a connected Lie subgroup if and only if 
H is arc-wise connected (M. Kuranishi and 
H. Yamabe). 

Let H be a Lie subgroup of a Lie group G. 
Then the tangent space 7,(H) is a subspace of 
T.(G). Under the isomorphism T,(G) z g men- 
tioned in Section B, the subspace T,(H) corre- 
sponds to b, which is a tsubalgebra of the Lie 
algebra g. We call þh the subalgebra of g asso- 
ciated with the Lie subgroup H, and b can 
be identified with the Lie algebra of H in a 
natural manner. By this mapping Hh, we 
have a bijection from the set of all connected 
Lie subgroups of G onto the set of all subalge- 
bras of g. For example, the connected Lie 
subgroup G’ of G corresponding to the tde- 
rived algebra o of g is the Àcommutator sub- 
group of G. The Lie algebra of a normal sub- 
group of G is an fideal of g. Conversely, if the 
Lie algebra b of a connected Lie subgroup H 
of a connected Lie group G is an ideal of g, 
then H is a normal subgroup of G. 

A connected Lie group G is called semi- 
simple, simple, solvable, nilpotent, or Abelian 
(or commutative), respectively, whenever the 
Lie algebra g is (— 248 Lie Algebras). This 
definition of solvability, nilpotency, and com- 
mutativity agrees with the corresponding 
definition in which G is regarded as an ab- 
stract group. 


E. Closed Subgroups 


The topology of a Lie subgroup H (which we 
call the inner topology of H) as a submanifold 
of a Lie group G need not coincide with the 
relative topology of H regarded as a subspace 
of a topological space G. The inner topology 
of H coincides with the relative topology of H 
if and only if H is closed in G. Conversely, for 
every closed subgroup H of G, H has the struc- 
ture of a Lie subgroup such that the inner 
topology and relative topology coincide (E. 
Cartan). Moreover, such a structure of a Lie 
subgroup on H is unique. We always regard 
closed subgroups of a Lie group as Lie sub- 
groups in this sense. Also, we denote the Lie 
algebras of Lie groups G, H,... by the corre- 
sponding lower-case German letters g, b, ... 


F. Homogeneous Spaces 


Let G be a Lie group, and let H be a closed 
subgroup of G. Then the tquotient topological 
space (coset space) M — G/H of the topological 
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group G modulo H has the structure of a C?- 
manifold such that the canonical mapping 
1:G— M and the action of Gon M:G x M^ 
M (defined by (g, xH)gxH) are both C?- 
mappings. Furthermore, such a C?-manifold 
structure on M is unique. The C?-manifold M 
thus obtained is called the homogeneous space 
of G over H (— 199 Homogeneous Spaces). 
Put z(e) — p. Then b is the kernel of the dif- 
ferential mapping T,(G) ^ g> T,(M). Thus we 
can identify the tangent space T,(M) of M 

— G/H at p with the quotient linear space g/h 
via this differential mapping. 


G. Quotient Lie Groups 


Suppose that H is a closed normal subgroup 
of a Lie group G. Then G/H has the structure 
of a quotient group together with the structure 
of a manifold as a homogeneous space. Now 
G/H 1s a Lie group with respect to these two 
structures. The Lie group G/H is called the 
quotient Lie group of G over H. In this case, b 
is an 'ideal of g, and the quotient Lie algebra 
g/b is isomorphic to the Lie algebra of the Lie 
group G/H. 


H. Direct Product of Lie Groups 


Let G,, G, be Lie groups. Then the direct 
product G, x G, satisfies axioms (i), (ii), and 
(iii) of a Lie group (— Section A) in a natural 
manner. The Lie group G, x G, thus obtained 
is called the direct product of the Lie groups 
G,, G,. The Lie algebra of G, x G, can be 
identified with the direct sum of a,, 82- 


I. Cartan Subgroups 


A subgroup H of a group G is called a Cartan 
subgroup of G if H is a maximal nilpotent 
subgroup of G and moreover, for every sub- 
group H, of H of finite index in H, [N(H,): 
H,]< œ, where N(H,)={geG|gH,g"'=H,} 
is the tnormalizer of H, in G. A connected Lie 
group G always has a Cartan subgroup. Fur- 
thermore, every Cartan subgroup H of G is 
closed; hence H is a Lie subgroup. The Lie 
algebra hof H is a tCartan subalgebra of 

the Lie algebra g. This mapping H >} is a 
bijection from the set of all Cartan subgroups 
of G onto the set of all Cartan subalgebras of 


a [7]. 


J. Borel Subgroups 


Let G be a complex connected semisimple Lie 
group. A maximal connected solvable Lie 
subgroup of G is called a Borel subgroup of G. 


249 K 
Lie Groups 


Any two Borel subgroups are conjugate in G. 
A subgroup P of G is called a parabolic sub- 
group if P contains a Borel subgroup of G. 
Every parabolic subgroup is a connected 
closed Lie subgroup of G. 


K. Simple Examples 


Let V be a finite-dimensional vector space 
over R. Let (V) be the tassociative algebra 

of linear endomorphisms of V. The 'general 
linear group GL(V)— (xe 9(V)|detx 40} over 
V is a Lie group. The Lie algebra of GL(V) is 
denoted by alt HL We can identify aI(V) with 
L(V), and we have [X, Y] 2 X Y — YX for all 
X, Yegl(V). If dim V=n, GL(V) may be identi- 
fied with the group GL(n, R) of all real nonsin- 
gular n x n matrices. Then gl(V) may be identi- 
fied with the Lie algebra gl(n, R) of all n x n 
real matrices. 

The following are examples of (closed) Lie 
subgroups of the general linear group: the 
'special linear group SL(n, R), whose Lie alge- 
bra is {X egl(n, R)|tr X 20]; the tunitary 
group U(n), whose Lie algebra is {X egl(n, CH 
'X +X =0}; the torthogonal group O(n), whose 
Lie algebra is {X egl(n, R)|'X +X =0}; and the 
'symplectic group Sp(n), whose Lie algebra is 
{X egl(2n, C)|J X +'XJ=0,'X + X =0}, where 


FO 


and J is the n x n unit matrix. 

Examples of complex Lie groups are: the 
complex ‘general linear group GL(n, C), whose 
Lie algebra is gl(n, C); the complex tortho- 
gonal group O(n, C), whose Lie algebra is 
{X egl(n, C)|'X + X 20]; and the complex 
'sympletic group Sp(n, C), whose Lie algebra is 
{X egl(2n, C)| JX +'X J =0}, where J is the 
matrix given above. 


L. Compact Simple Lie Groups 


If a connected Lie group G has a tcompact 
real Lie algebra, then G is compact. Thus for 
each compact real simple Lie algebra g, the 
simply connected covering Lie group G is 
compact, and the center of G is a finite group. 
The connected compact Lie groups SU (I^ 1), 
SO(2l+ 1), Sp(l), SO(2I) have, respectively, the 
compact real simple Lie algebras A, (/> 1), Bj 
(12 2), C (l> 3), D, (L2 4) as their Lie algebras; 
these groups are called classical compact 
simple Lie groups. Sp(n) (n z 2) and SU (n) 

(nz 2) are simply connected, but SO(n) (n=3 
or nz 5) is not, and has the fundamental 
group of order 2. The universal covering group 
of SO(n) De, the simply connected, connected 
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Lie group that has the same Lie algebra as 
SO (n)) is Spin(n). The connected compact Lie 
groups that have E, (1—6, 7,8), F,, or G, as 
their Lie algebras are called exceptional com- 
pact simple Lie groups. The simply connected 
covering Lie groups of Eg, F4, G, have the 
identity group {e} as their center. Hence they 
coincide with their tadjoint groups. The simply 
connected covering Lie groups E,, E- have as 
their centers the groups of orders 3 and 2, 
respectively. Hence their adjoint groups are 
not simply connected. 


M. Complex Simple Lie Groups 


For a complex Lie group G, the definition of 
the Lie algebra g is similar to that for real Lie 
groups. Then g is a Lie algebra over the com- 
plex number field C and is called the complex 
Lie algebra of the complex Lie group G. The 
complex connected Lie groups SL(/+ 1, C), 
SO(2l 4 1, C), Sp(l, C), SO(2I, C), which have the 
*classical complex simple Lie algebras A,, Bj, 
C, D, as their Lie algebras, respectively, are 
called classical complex simple Lie groups. 
Complex connected Lie groups that have 
*exceptional complex simple Lie algebras E, 
(1— 6, 7,8), F4, G, as their Lie algebras are 
called exceptional complex simple Lie groups. 


N. Homomorphisms 


A mapping o:6G,—G; from a Lie group G, 
into a Lie group G, is called an analytic homo- 
morphism (C°-homomorphism) if (i) o is a 
homomorphism between groups and (ii) o is a 
C?-mapping between manifolds. Moreover, if 
ọ is bijective and ~} is also a C?-mapping, 
then ¢ is called an analytic isomorphism (C°- 
isomorphism). Two Lie groups G,, G, are said 
to be isomorphic as Lie groups if there exists a 
C?-isomorphism from G, onto G,; we denote 
this situation by G, = G;. Now let 9:6, G; 
be a C?-homomorphism. Then the *differential 
(do), : Te (G,) Te (G;) induces a Lie algebra 
homomorphism du o, >g2. The kernel of dq 
is the Lie algebra of the kernel of o. Suppose 
that G, is connected and simply connected. 
Then for every Lie group G, and every Lie 
algebra homomorphism y : 9, 9;, there exists 
a unique C?-homomorphism 9: G, — G, such 
that y —dq. We can replace condition (ii) in 
the definition of C?-homomorphism by the 
following weaker one: (ii') o is a continuous 
mapping. In particular, for a given topological 
group G, there exists at most one structure of a 
Lie group on G preserving the group structure 
and the topology. A topological group G 
admits a structure of a Lie group (preserving 
the group structure and the topology) if and 
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only if G is a tlocally Euclidean topological 
group (— 423 Topological Groups N). 


O. Representations 


Let G be a Lie group, and let V be a finite- 
dimensional vector space over C(R). Then a 
continuous (hence C?-) homomorphism p:G 
—GL(V) is called a complex (real) represen- 
tation of G. The linear space V is called the 
representation space of p, and dim V is called 
the degree of p. To be more precise, a represen- 
tation is denoted by (p, V) instead of p. The 
function y,: GC defined by y,(g) —tr p(g) 
(gc G) is called the character of p (— 362 Rep- 
resentations). The representation (p, V) of G 
gives rise to a representation (dp, V) of g. Sup- 
pose G is connected. Then two representations 
(p V) (i2 1,2) of G are tequivalent if and only 
if the representations dp, and dp, of g are 
equivalent. For the !direct sum of representa- 
tions and the contragredient representation, 
we have d(p, + p;) 2 dp, + dp; dp !)— —'(dp). 
For the *tensor product of representations, we 
have (d(p, & p5))(X) 2 (dp1)(X) © ly, + ly, e) 
(dp,)(X). (For representations of compact Lie 
groups — 248 Lie Algebras W.) 


P. Adjoint Representations 


An element c of a Lie group G defines an 
analytic automorphism $,:x—0oxo ! of G. 
The ‘differential Ad(o) of o, ts an automor- 
phism of the Lie algebra g. Since Ad(o)e GL(g), 
we have a representation og Ad(a) of G whose 
representation space is g. This is called the 
adjoint representation of G, and the image 
Ad(G) of G is called the adjoint group of G. 
Ad(G) is a Lie subgroup of GL(q). The ‘center 
Z of G is a closed subgroup of G, and G/Z x 
Ad(G). The analytic homomorphism o — 
Ad(o) from G into GL(q) gives rise to a Lie 
algebra homomorphism g- gl(g) by taking the 
differential of o — Ad(oc). Denote this Lie alge- 
bra homomorphism by X —ad(X) eau) 
Then ad(X) Y 2 [ X, Y] for every X, Y in g. 
Thus ad: g—gl(g) coincides with the fadjoint 
representation of the Lie algebra g. If G is 
connected, then the Lie subalgebra ad(g) (the 
tadjoint Lie algebra of g) of al(a) is associated 
with the connected Lie subgroup Ad(G) of 
GL(g). Thus the adjoint group Ad(G) coincides 
with the group /(q) of tinner automorphisms 
of g (the tadjoint group of g). 

If G is connected and semisimple, g is also 
semisimple. Furthermore, we have D(g)= 
ad(g). Hence the adjoint group Ad(G)= fia 
of G coincides with the connected component 
of the tautomorphism group A(g) of g contain- 
ing the identity element. If G is compact or 
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semisimple, then every representation of G is 
completely reducible. 

Let G be a connected Lie group of dimen- 
sion n, g be the *Lie algebra of G, and Ad CG 
—GL(a) be the adjoint representation of G. 
Put det((t + 1)! — Ad(x)) = 27-9 t/D,(x) for 
every xeG. Then each D; is an analytic func- 
tion on G, and D,,=1. Let I be the least integer 
such that D, #0. Then / 2 rank G— rank g. An 
element x of G is called regular (singular) if 
D,(x) 50 (Dj(x) - 0). The subset G* of G con- 
sisting of all regular elements of G is open and 
dense in G. The subset G — G* of G consisting 
of all singular elements of G is of measure zero 
with respect to the left-invariant *Haar mea- 
sure of G. 

Now suppose that g is treductive. Then an 
element xeG is regular if and only if the cen- 
tralizer 3, — (X ea| Ad(x) X =0} is a tCartan 
subalgebra of g. Furthermore, if x € G is regu- 
lar, then Ad(x) is a semisimple linear endomor- 
phism of g. 


Q. Canonical Coordinates 


With each element X of the Lie algebra g of 
the Lie group G there ts associated a one- 
parameter subgroup of G, i.e., a continuous 
homomorphism t g(t) from the additive 
group R of real numbers into G, such that 
do(X9)— X, where X,— d/dt is the basis of 

the Lie algebra of R. Furthermore, the con- 
tinuous homomorphism q is unique. Putting 
Q(1)— exp X, we define the exponential map- 
ping exp: 9 G. Then we have o(t)  exptX 
for every t€ R. In particular, for the case G = 
GL(n, C), g — gl(n, C), we have exp X 2 3 X "/m! 
for every X eg. Thus exp X coincides with the 
usual exponential mapping of matrices (— 269 
Matrices). The mapping exp:g3G is a C®- 
mapping whose differential at X —O is a bijec- 
tion from Ty(g) 2 g onto T,(G). Thus for each 
basis X,, ... X, of g, there exists a positive 

real number e with the following property: 
{exp(X x, Xj) ||xi| «e (i L ...,n)j is an open 
neighborhood of the identity element e in G on 
which e 2 exp(X x; Xj) (x,, ..., x) (Ix;| « & 
i=1,...,n) is a !lecal coordinate system. These 
local coordinates are called the canonical 
coordinates of the first kind associated with 
the basis (X;) of g. Similarly, we have a local 
coordinate system t=(expx, X,)...(expx,X,) 
—(Xx,,---,X,) in a neighborhood of e; these 

Xj, ..., X, are called the canonical coordinates 
of the second kind associated with the basis 
(X) of g. 

Let $:G,—G,; be a continuous homomor- 
phism from a Lie group G, into a Lie group 
G5. Then p(exp X) — exp(do(X)) for every 
X €g,. The Lie subalgebra b associated with a 
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connected Lie subgroup H of G is character- 
ized by using the exponential mapping as 
follows: b = (X eg|exptX EH for all teR}. 


R. Multiplication Functions 


Fix a basis X,, ..., X, of the Lie algebra g of a 
Lie group G. Let (u,, ..., u,) be the canonical 
coordinates of the first kind associated with 
the basis (X;). With respect to this coordinate 
system, let (x,), (y;), (z;) be the coordinates of 
the elements o, t, o1, respectively. Then each z; 
is a real analytic function in (x,, ..., x,; 

yp ss Yn). The * Taylor expansion of z; at 

X, =X; =... = y, =y,=...=0 is given as 
follows: Put Sc x, X, T— X y; X;. Then 


oo 


1 
i5 (S T'u; 
PT Jo 


Z 


= x; + y; - (STuj)g + (terms of degree > 3). 


Furthermore, let (w;) be the coordinates of 
t lo ‘to. Then we have 


w; -([T, Ale + (terms of degree > 3). 


Let g* be the set of all R-valued linear forms 
On g, that is, g* is the dual vector space of g. 
We can identify the elements of g* with the 
left-invariant tdifferential forms of degree 1 on 
G. These forms are called Maurer-Cartan 
differential forms. Let c, ..., ©, be a basis of 
of which is dual to the basis X,,..., X, of g: 
O(Xj) — à; (1 <i,j <n). The exterior derivative 
dc of c, is a left-invariant differential form 
of degree 2 on G and is expressed as a linear 
combination of the texterior products œ; ^ a; 
as follows: 


do, = Ebenen (1) 
where (cf) are the "structural constants of g 
with respect to the basis (X;), 1e., [X;, X;] 
es ZAC E, 

Using the canonical coordinates of the first 
kind (uj) associated with the basis (Xj), put c, 
— ax(u, du) =X, A,(u)du;. Then the matrix 9I 
=(A,,(u)) is given by 


E x 
WE a re (2) 
where X =(c;,(u)), ci (u) = Ly c5, u,, ie, —X is the 
matrix of ad(X u, X,) relative to the basis (X;). 
Thus, in particular, if G is a nilpotent group, X 
is a fnilpotent matrix. 

Once the functions A,,(u) are known, then 
the functions z; 2 f(X], ..., X4: yi. Yn) de- 
scribing the multiplication in G (note that 
e (xj), t (yj, ot e (z;)) are obtained as 
follows. By the left-invariance of the w;, we 
have the following Maurer-Cartan system 
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of differential equations: 
w,(z, dz) = @;(y, dy), 1 SES? (3) 
Regarding x,, ..., x, as parameters, put z; 
— Qi y 4, ..., Yp) Then (3) is equivalent to 

CQ: 
YA) = Agi), Leiken (3) 
j OV, 


Now (3’) is tcompletely integrable, by (1). By 
solving the system (3^) of differential equations 
under the initial conditions 


Q;(0, ..., 0) — xj, Leien, (4) 


we obtain the multiplication functions z; 
-fxi. eX Ya avy Yn) {1}. 

By the exponential mapping exp:g—G from 
the Lie algebra g of a Lie group G, a suffi- 
ciently small neighborhood U of the zero 
element of g is mapped bijectively and bi- 
real-analytically onto a neighborhood V of the 
identity element of G (— Section Q above). 
Now by taking U sufficiently small, the prod- 
uct (exp X) (exp Y) for X, Y in U can be 
explicitly given in the form exp Z by the 
Campbell-Hausdorff formula. Namely, Zeg 
is given by 


Z=c,(X, Y)+c,(X, Y)+..., 
where 
cy (X, Y)=X+Y 


and c,, ¢3,... are defined recursively by 
1 
(n+ De,.4 (X, Y)-4LX — Y, c,d 


de 


pzl,2pxn 


Ky), [ens b= Leck, X + YI] 


where the second summation is taken over 

all 2p-tuples (k,, ..., k;,) of positive integers 
k,,...,k2, satisfying kı +... -- k;,— n. Here the 
K,, K4, K,,... are rational numbers defined 
by the Taylor expansion 


Z 





1 oo 
—~z=1+ Y K3”. 
l-e? 2 p i 


For example, 


1 
co=5L%, Y], 


1 
C3 = UA Y], Y] 


1 
"ll3, Y),X], 


1 
C4 = bz LX, [X,Y 11] 
1 
-gA LY, LX, Y]]]. 


etc. [23]. 
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S. Maximal Compact Subgroups 


Any compact subgroup of a connected Lie 
group G is contained in a maximal compact 
subgroup of G. Any two maximal compact 
subgroups of G are conjugate in G. Let K bea 
maximal compact subgroup of G. Then K is 
connected and G is homeomorphic to the 
direct product of K with a Euclidean space R" 
(Cartan-Mal'tsev-Iwasawa theorem). 


T. Iwasawa Decomposition 


Let G be a connected semisimple Lie group. 
Suppose that the center of G is a finite group. 
Let g=f+a+n be an tlwasawa decompo- 
sition of the Lie algebra g of G (— 248 Lie 
Algebras). Then the connected Lie subgroups 
K, A, N associated with f, a, n, respectively, are 
all closed subgroups of G. Furthermore, K isa 
maximal compact subgroup of G, A is isomor- 
phic to the additive group R? for a suitable s, 
and N is a simply connected nilpotent Lie 
group. The mapping K x A x NG given by 
(k, a, n) ^ kan is bijective and is an isomorphism 
of analytic manifolds. The decomposition G 

— KAN is called an Iwasawa decomposition of 
G. An Iwasawa decomposition is unique in the 
following sense: Let G= K’A’N’ be another 
Iwasawa decomposition. Then there exists an 
element g in G such that gKg ! - K', gAg ! 
—A', gNg ! « N' [9,3]. 


U. Complexification of Compact Lie Groups 


Let G be a compact Lie group. Denote by 
C(G) the commutative associative algebra of 
all C-valued continuous functions defined on 
G relative to the usual multiplication in C(G). 
Then for each eet, L,(R,) acts on C(G) by 
L,f=foL, (R,f=foR,). Now put o(G)— 
Lf eC(G)| dim Eseg CL, f « oo). Then o(G) is 

a subalgebra of C(G), called the representative 
ring of G. Elements of 0(G) are called represen- 
tative functions of G because for an element 

f €C(G), feo(G) if and only if there exists a 
continuous matrix representation o —(d;/(a)) of 
G such that f is a C-linear combination of the 
d;. For each e eG, L, and R, preserve o(G). 
With respect to the uniform norm |||, = 
max,.gl/f(o)| of C(G), o(G) is everywhere 

dense in C(G) (Peter-WeyI theory) (— 69 Com- 
pact Groups B). This implies the existence of a 
faithful ( — injective) representation p: G9 
GL(n, C) of G. Furthermore, o(G) is a finitely 
generated algebra over C. Thus there exists a 
representation o (d;i(c)) such that o(G) is 
generated over C by the d;;. Such a representa- 
tion is called a generating representation. 
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Denote the group of all automorphisms of 
the algebra o(G) by A. Let G be the centralizer 
in A of the subgroup (L,|o € G] of A. Then we 
have a bijective mapping x—>«' defined by 
a'(f)=(a(f))(e) (f eo(G)) from the group G 
onto the set Homo(o(G), C), which is an affine 
variety. Thus every generating representation 
p:0>(4,(6)); <ij<n defines a faithful matrix 
representation p of G by Pa) — (e (di): <ijcn 
by means of the bijection «>a’ from G onto 
Hom (o(G), C). Furthermore, the image oC) 
is an talgebraic subgroup of GL (n, C) given 
by (f(di)i <ij<n| BE Hom,(0(G), C)}. Thus G 
has the structure of a ‘linear algebraic group, 
which is actually independent of the choice of 
a generating representation. (Hence G also has 
the structure of a complex Lie group.) Now a 
—R, defines an injective continuous homo- 
morphism from G into G, and we can regard 
G as a subgroup of G. Then for «eG, a is in G 
if and only if «( f) 2 a( f) for every f in o(G) 
(Tannaka duality theorem) (— 69 Compact 
Groups D). G is a maximal compact subgroup 
of G, and every maximal compact subgroup of 
G is conjugate to G in G. Then complex Lie 
algebra § of G is isomorphic to the complexifi- 
cation g€— € Gg 9 of the Lie algebra g of G. 

G is homeomorphic to the direct product of G 
with a Euclidean space RY, where N — dim G. 
For a complex analytic function o defined on 
G, o =0< Q |, — 0. In particular, G is the clo- 
sure of G relative to the tZariski topology of 
G. The group G is called the Chevalley com- 
plexification of G, which we denote by GC. Let 
G,, G, be compact Lie groups, and let o:G, 

— G; be a continuous homomorphism. Then o 
can be extended uniquely to a rational homo- 
morphism o*:Gf 5 G. In particular, every 
complex representation (p, V) can be extended 
uniquely to a frational representation (p^, V) 
of G*. Every complex analytic representation 
p of G* is a rational representation, and f 

— p, where p= p |;. Thus f is completely 
reducible. Also, we have a bijection from the 
classes of irreducible continuous representa- 
tions of G onto the classes of irreducible 
complex analytic representations of G^. For a 
closed subgroup H of G, the complexification 
H€ of H coincides with the closure of H in GC 
relative to the Zariski topology of GC. 

If (p, V) is a generating representation of 
G, then GE = pg* (GO) c GL(V). Furthermore, 
pf (G9) is an algebraic subgroup of GL(V) that 
is completely reducible on V. Conversely, let 
F be an algebraic subgroup of GL(V) that is 
completely reducible on V. Then there exists 
a compact Lie group G such that GC =F (as 
algebraic groups). If an algebraic subgroup F 
of GL(n, C) satisfies 'F = F, then F = GC, where 
G — FOU (n). Now let F be a connected semi- 
simple complex Lie group. Then F = G“ for 
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every maximal compact subgroup G of F. In 
particular, F has a faithful representation. 

For example, let G = U (n), O(n), SO(n), Sp(n), 
respectively. Then GE = GL(n, C), O(n, C), 
SO(n, C), Sp(n, C), respectively [1]. 


V. History 


S. Lie was the first to consider Lie groups, in 
the late 19th century; at that time they were 
called continuous groups. His motivation was 
to treat the various geometries from a group- 
theoretic point of view (— 137 Erlangen Pro- 
gram) and to investigate the relationship be- 
tween differential equations and the group of 
transformations preserving their solutions. Lie 
groups were studied locally, and the notion of 
Lie algebras was introduced. It was observed 
that the properties of Lie groups reflect the 
properties of Lie algebras to a large degree. 
Even in this early stage, the notions of solvable 
and semisimple Lie algebras were introduced 
together with proofs of basic properties. The 
Galois theory of linear differential equations 
(by E. Vessiot and others) is contained in Lie's 
theory. Early in the 20th century, the theory of 
infinite-dimensional Lie groups was studied by 
E. Cartan. After him, however, there was a 
long lull until it was taken up again by M. 
Kuranishi and others in the 1950s. We re- 
strict ourselves to the consideration of finite- 
dimensional Lie groups. 

From 1900 to 1930, E. Cartan and H. Weyl 
obtained a complete classification of the semi- 
simple Lie algebras and determined their 
representations and characters. They also 
devised useful methods for investigating the 
structure of these algebras, and did pioneering 
work on global structures of the underlying 
manifolds of Lie groups. After them (1930— 
1950), these results were systematized and 
refined by C. Chevalley, Harish-Chandra, and 
others. In the same period, K. Iwasawa [9] 
clarified Cartan's idea, showing that the only 
Lie groups that are topologically important 
are compact Lie groups. He also obtained the 
Iwasawa decomposition, which has become 
a basic tool in the study of semisimple Lie 
groups. At the same time, Iwasawa contri- 
buted to Hilbert's fifth problem, which seeks to 
characterize Lie groups among topological 
groups. This problem was solved by A. M. 
Gleason, D. Montgomery, L. Zippin, and H. 
Yamabe in 1952. Since 1950, the topological 
properties of Lie groups have attracted con- 
siderable attention. The methods of Cartan, 
who postulated de Rham's theory, and H. 
Hopf, who used the properties of groups ex- 
tensively, were succeeded by systematic appli- 
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cations of the general theory of algebraic top- 
ology. In particular, the topological theory of 
‘fiber bundles was applied to homogeneous 
spaces G/H by A. Borel and others. Thus 
homology groups of Lie groups were com- 
pletely determined, and homotopy groups of 
Lie groups were determined to a considerable 
extent. At the present time, the following fields, 
rather than Lie groups themselves, are objects 
of extensive study: structure and analysis 

of homogeneous spaces, algebraic groups, 
infinite-dimensional unitary representations, 
and finite or discrete subgroups (— 13 Alge- 
braic Groups, 109 Differential Geometry, 

427 Topology of Lie Groups and Homogen- 
eous Spaces, 437 Unitary Representations). 
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250 (XVII.3) 
Limit Theorems in 
Probability Theory 


A. General Remarks 


Let S, be the number of successes in n *Ber- 
noulli trials with probability p for success. 
Then the classical de Moivre-Laplace limit 
theorem implies that 


$,— 1 s 3 
E zd ex)- | e VI dy 
np(1-— p) JJ 27 J - o 


(n oo), 





for all x. It is also known that for any e>0 


P([n ! S, pls et (n oo), 


which is a special case of the law of large num- 
bers. General forms of such convergences of 
sequences of frandom variables are explained 
in this article. 

In fprobability theory, theorems concerning 
*convergence in distribution, ‘convergence in 
probability, and *almost sure convergence 
of sequences of "random variables are sub- 
sumed under the generic term of limit theo- 
rems. When the fprobability distributions of a 
sequence of random variables converge to the 
distribution F (— 341 Probability Measures 
F), then F is called the limit distribution. 
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B. Convergence in Distribution of Sums of 
Independent Random Variables 


A sequence of random variables {X,,} (1 «k« 
k,,n 2 1) (k,— 00) is said to be infinitesimal if 
max, ut, DU Kal = £)-50 (n> oo) for every 
£7 0. When X,,,..., Aa, are tindependent 

for every n and {X,,} (1€ k <k, nz 1) is infi- 
nitesimal, the set of all limit distributions for 
the sums S,=X,,, +... + X,,, — A, (the A, are 
suitable constants) coincides with the class 

of infinitely divisible distributions (— 341 
Probability Measures G). Suppose that the 
tcharacteristic function f of an infinitely divis- 
ible distribution F is given by *Lévy's canon- 
ical form 





and the distribution function of X,, is Fal 
Then a necessary and sufficient condition for 
the distribution of S$,— y, to converge to F for 
some sequence {y,} is that (i) for every tcon- 
tinuity point x of M(x) and N(x), 


x 


` FA) M(x) (x<0), 


k 


li 
- 


kn 
x (F,(x)-1)2N(x) (x20) 
kn 
lim limsup 3 (| x? dF,(x) 
Ix|<e 


£290 noo k=l 
2 
«f dun) 
Ix| «e 


kn 
=limliminf 3. (| x? dF, (x) 
xi «e 


£20 no pry 
2 
«(f sar ) 
[xi <e 


Applying these results to special distribution 
functions, we obtain various kinds of limit 
distributions. 


Weer” 


Lg? 


(1) The Central Limit Theorem. In the cen- 
tral limit theorem, the limit distribution is a 
*normal distribution. À necessary and suffi- 
cient condition for the distributions of the 
sums $,— B, ! (X, - ...—- X,) — A, (B, oo) ofa 
sequence of independent random variables 
{Xn} to converge to the tstandard normal 
distribution N(0, 1) and for (B,! X,) (1<k<n, 
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nz 1) to become infinitesimal is that 


n 


lim Y | dF,(x)-0 and 
EE 


Dam Kei 
lim B;? Y (| CUR 
nie k=1 |x|<eB, 


-(| sano) )- l, 
|x| «B, 


where F,(x) is the distribution function of X,. 
When the variance of each X, is finite, we put 
n 


B; = x V(X;), 


k-1 


A, =B," > E(X,), 

k=1 
where V(X), E(X) stand for the variance and 
mean of X, respectively. Then the necessary 


and sufficient condition is replaced by the 
Lindeberg condition: 


lim B, ? y 


Heron k=1 Jix|>eB, 


x? dF, (x + E(X,))=0 


for every £2 0. 


In particular, if the X, have the same distri- 
bution with finite variance, the corresponding 
Lindeberg condition is satisfied, and the cen- 
tral limit theorem holds. When the variables 
give outcomes of independent ! Bernoulli trials, 
the proposition is reduced to the classical 
theorem of de Moivre and Laplace. Moreover, 
if X, has the finite tabsolute moment m$? ,— 
E(| X,|?*?) of order 2-- ó and E(X,) —0, then 
the Lindeberg condition is implied by the 
Lyapunov condition 


n 
B,O*9 Y mPa >O (n=). 
k=1 


(2) The Law of Small Numbers. In order that 
the distributions of the sums S, = X,,, +...+ 

X „k, Of infinitesimal independent random 
variables converge to the fPoisson distribution 
P(A) with mean 4, it is necessary and sufficient 
that for every ce 1), 


R,-íx||x-1|2&  |x|2e}; 


n 


lim Y | dF, (x) — å; 
ix-1|«& 


xdF,,(x) 20; and 


2 
«(| m) Jes 
Ix|«& 


From this proposition follows the classical law 
of small numbers: If the probability of success 
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p, for the kth outcome of independent trials 
satisfies kp, — ^ with constant 4 independent of 
k, then the total number S, of success up to the 
nth trial converges in distribution to the Pois- 
son distribution with mean 2. 


(3) The Law of Large Numbers. In the law of 
large numbers the limit distribution is the *unit 
distribution. Given a sequence of independent 
random variables X, with distribution func- 
tion F,(x) with mean a,, a necessary and suffi- 
cient condition for n ! X2.,(X, — a,) to con- 
verge to 0 in probability is that 


n 


lim 3 | dF,(x + a,)=0, 
|x| 7n 


no pry 


— Lä 
lim — ad xdF,(x +a,)=0, and 
Ix| n 


eat EA) 


1 n 
lim — 3 (| x? dF,(x + aj) 
|x|<n 


(Í siia) Jes 
xin 


In particular, this is the case when (1) X, has 
the finite variance V(X,) and n 7 Et, V(X,) 
—0; or (it) the X,, nz 1, obey the same distri- 
bution with finite mean. 


(4) Convergence to Quasistable Distributions. 
The set of limit distributions for the sums S, = 
B, (X, +... +X,)— A, of identically distri- 
buted independent random variables | X,j 
(with suitable constants A,, B,) coincides with 
the set of quasistable distributions (— 341 
Probability Measures G). If G(x) is the distri- 
bution function of the X;, in order for the limit 
distribution to be normal it is necessary and 
sufficient that 


lim K| dis f x? dG(x)=0. 
Kore |xi- K Ix|<K 


In order for the limit distribution to be quasi- 
stable with index « (0 « a « 2) it is necessary 
and sufficient that 

G( T x) Cy 


lim =—, 
x-x 1—G(x) c, 





c,+c,>0; and 


~ 1—G(x)+G(—x) `, 
lim =a" for every a>0. 
xox | —G(ax)+ G( — ax) 





In this case the characteristic function of the 
limit distribution is given by Lévy’s canonical 
form with M(x)-c,Ix| *, N(x)2 —c,|x|7%, 

e —0. 


(5) Refinement of Central Limit Theorem. Let 
{Xa} be a sequence of identically distributed 
independent random variables with E(X;)— 0, 
o? =V(X,), E(| X;|?) « oc. Let ®,(x) be the 
distribution function of S, (X, +... + X,)/ 
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c4/ n and ®(x) the distribution function of the 
normal distribution N(0, 1). Then we have 


®,,(x) — B(x) 


2 

CEDE GO + R(x)) +o (=) 

Zen Vn 
uniformly in x, where Q(x) 2 (1 — x?) E(X)/6e?; 
R(x) is identically zero when X; does not have 
a lattice distribution; but R(x)=do™!R,((x+ 
ao /n d), R (x) - [x] —x4- 1/2, and a, = 
~ n ac ! when X; has the lattice distribu- 
tion with maximal span d: P(X;e{a+kd|k= 
0, +1,...})=1. More accurate asymptotic 
expansions for ®,(x) are derived when the X; 
have absolute moments of higher orders [1]. 
Asymptotic expressions are also obtained 
when the X; are not identically distributed [2] 
or the limit distribution is tstable [3]. 


(6) The Local Limit Theorem. The local limit 
theorem is concerned with the density function 
of the limit distribution. Let {X,,} be a se- 
quence of identically distributed lattice vari- 
ables with finite variances as in (5). Let 


Sieg "E EGG). 
$,.;— (o /n) ! d(j—nE(X,)). 


Then uniformly in j, 
o /n P(S,—5,)— (2n) "^ exp( 52/2)50 
(n— oo). 


The classical de Moivre-Laplace theorem for 
Bernoulli trials ts a special case of this theo- 
rem. When X; has a density function, a cor- 
responding limit theorem holds. Local limit 
theorems are derived also when (i) higher 
moments exist, (ii) the limit distribution is 
stable, or (iii) the component variables are not 
identically distributed [2, 3]. 


(7) Large Deviations. Let ( X,) be a sequence 
of identically distributed independent random 
variables with E(X;) 20, 0 c6? — V(X;) « oo. 
Let 6, (x) be the distribution function of S, — 
(X, +...4+.X,)/o./n and (x) the distribu- 
tion function of the standard normal distribu- 
tion N(0, 1). In many problems in the field of 
applications of probability theory, the asymp- 
totic behavior of 1 — $,(x,) as x,—oo with 

n is of interest. This is called the problem of 
large deviation. Assume that E(exp(a| X;|)) « oo 
for some a 0. Then for x, >1, x,— o( /n) and 
n->oo the following assertions hold: 


rcv, mU C) C2) 
scu mU C) 02) 
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where A(z) is a power series involving the 
'semi-invariants of X; and is convergent for 
sufficiently small |z| [4, 5]. This result has been 
generalized for the case of random variables 
not identically distributed [5,6]. Bounds for 
1—@,,(x,) have also been derived by several 
authors (— [7]). 


C. The Strong Law of Large Numbers and Its 
Refinements 


A sequence of random variables | X,] is said 

to obey the strong law of large numbers if 

n! X54 (X,— a) tends to zero with proba- 
bility 1 as n— oo, when constants a, are pro- 
perly chosen. When the component variables 
of the sequence are independent, a useful suffi- 
cient condition for validity of the strong law of 
large numbers is 

1 


2 


Ms 


E(X,— E(X,))? < oo. 


Dat 


According to Birkhoff's individual ergodic 
theorem (— 136 Ergodic Theory B), it is neces- 
sary and sufficient that the mean of a compo- 
nent variable exist for a sequence of identically 
distributed independent variables in order for 
it to obey the strong law of large numbers. 
Suppose that ( X,] is a sequence of inde- 
pendent random variables with E(X,) — 0, 
0; = E(X2) « oo, b? 202 - ...-c o2 — oo, and 
P(|X,| « 4,b,, n—1,2,...)— 1 for a decreasing 
sequence 4, tending to 0 as n— oo. If 4, — 
O(1/9? (b2)) for a certain increasing continu- 
ous function 9, the probability that X, +... + 
X, > b,o(b;) for infinitely many n equals 0 or 


oo 


1 2 
l according as the integral | ewe? (0/2 dr 


1 
converges or diverges [8]. In particular, when 


p(t) 2 logo, t 3loga, t + 2log,4yt 

+... 2logg 4t (24 8)loga, t)", 
where log,,,t=loglogt, etc., the relevant prob- 
ability is equal to 0 or 1 according as e is 
positive or negative. If we take o(t)  21og,, t, 


we are led to Khinchin's law of the iterated 
logarithm: If 


IX, = 0(B,/ y log; by ) 


then 
X,+...+X 
p(imsup A jJ- 1. 


D. Functionals of Sums of Independent 
Variables 


(1) Consider a recurrent event which occurs at 
successive time periods z,, t, + T2, ..., where 
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{t,} is a sequence of nonnegative independent 
random variables with the same distribution. 
Let F be the distribution function of t,, m= 
E(1,), a? =V(t,)< oc, and N, be the number 
of occurrences to time n. Then 


lim E (voz SE x)= al 


When c? 2 co and P(t, < o0) — 1, a necessary 
and sufficient condition for suitably norma- 
lized N, to converge in distribution is that for 
every a0 the relation (1 — F(x))/(1 — F(ax)) 
aü*(x005), 0 €a «2, hold. When 0 a«« Il, 
P(N,(1 — F(n)) « x) O V, (x). When 1 <a <2, 
P((N,—nm !)m!* ^b. ! — x)5 Y, (x) for b, 
such that 1 — F(b)n !, where P(x) = 1— 

e *(x»0) V,(x) 21— 6,(x 1) (x >0) for 0< 
a«1, P, (x) 21—O,(—x)for 1 «a«2, and 
o, is the quasistable distribution function 
whose characteristic function is 


e VD du. 





exp(—|t|*(cos2 ' xa—isgntsin2 ' na) (1 — «)). 


In particular, we have 


V al 2/m x) eJ 2/m E exp( —u?/2)du 


Moreover, the strong law of large numbers 
and the law of the iterated logarithm con- 
nected with N, have been obtained [9]. 

(2) Let {X,} be a sequence of independent 
identically distributed random variables taking 
integer values with the same distribution, and 
put $, — X, +... - X,. If the maximum span of 
the X; is d and 0< E(X;) € +, then 


SR taS) JUE (n o9), 
20 (n —ooy; 
a0) tows 1/E(X;) cl 


where y, is the tindicator function of the set E, 
and 1/E(X;) is defined as zero when E(X) = oo. 
If the limit distribution of the normalized sums 
S, of {Xp} is stable with index £ (1 fl <2), X 
is symmetric for f — 1, and E(X;) —0 for B> 1, 
then the event that S, returns to O is a recur- 
rent event. For these cases, setting x — 1 — fj !, 
the conditions in (1) imposed on the distri- 
bution of the recurrence time q} are satisfied, 
and the limit distribution for N,— %;o;(S1) + --. 
+ Xioy(S$,) is determined. In this connection, we 
can show that a similar result is valid when 
Zo 1s replaced by a member of a considerably 
wider class of functions, and that the limit 
distribution is determined as well for M, which 
is the number of occurrences of the event 
($,20,5,,, <0), 1 <k <n. Similar theorems are 
established for the case when X; has nonlattice 
distribution [10-12]. 


920 


(3) Let {X,,} be a sequence of independent 
random variables with the same distribution, 
and put S — X, ...-- X,, L,— Xo, (S1) - ... 
+ Xo, XS). If n! E(L,) (0 € o x 1), then 
P(L, nx) G,(x), where G, is a distribution 
function on the unit interval [0, 1] such that 


Gy(0--) — G,(0—)- 1, 


x 


G,(x)=n ‘sin «| u^ (1—u) *du 


9 


for 0<a<1, and G,(1+0)—G,(1—0)=1. 


When a= 1/2, the corresponding limit distri- 
bution is given by G,.(x)=227' arc sin /x 
(the arcsine law). These L, and N,, M, in (2) 
are considered as tadditive functionals of the 
*Markov process S,. When E(X;)=0 and 
E(X?)=1, we have the limiting relation 


lim P( m max Sex) 


1Skn 


(x <0), 


Dal 7 (x > 0), 
X Jo 2 


lim P| max |S,| «x. /n 
noo La 1 Vn 


4 (—1)" ( Au 
=— exp| ————-— 
a 4-92m4- 1 8x? . 
Furthermore, the limit distributions of n ^! (S2 
+... S2) and n. 7? (|S, |+... - |S,|) are also 

determined explicitly [14]. When X, is non- 
negative, let v, be the number of S, (n— 1,2, ...) 
not exceeding x, and write Y, 2$, ,, —x, Y; = 
x — S, . Then a fundamental theorem of tre- 
newal theory guarantees the existence of the 
limit distributions of Y,, Y, as x oo [15]. 
Extensive studies have also been devoted 
to limit theorems for sums of dependent ran- 
dom variables, especially for Markov chains 
[16-20]. 





(x >0). 


E. Convergence in Distribution for Stochastic 
Processes 


(1) General Theory. Let T be a finite or infi- 
nite time interval, and let (Q, B, P) and S be a 
probability space and topological state space, 
respectively. Given an S-valued stochastic 
process X =(X(t,w), te T, c €Q), we denote by 
$8(S") the c-algebra generated by the tcylinder 
sets and denote by Py and Py ^^, respectively, 
the probability measures over B(ST) induced 
by the processes X and (X(t,,@),..., X(t,,@)). 
Suppose that there is a sequence of stochastic 
processes X, (n— 1,2,...) whose tsample paths 
are contained in a subset E of ST. We can 
introduce a topology t on E so that if B, (n= 
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1,2, ...) is the Py -completion of B(S"), the 
topological c-algebra B; on E becomes a 
subfamily of B,N E (n— 1,2, ...). If there exists 
a probability measure P, over (E, Bi) such 
that 


im | soar =f f(y) dPo 
E E 


for every bounded t-continuous real function f 
on E, then the probability distributions of IX. 
are said to converge to Po. 

Let (E, p) be a tcomplete metric separable 
space and P' the Lebesgue measure on Q'— 
[0, 1]. If a sequence of E-valued random vari- 
ables | X,(c), 1 € n « oo] has the probability 
distribution P, over the c-algebra 932 of Borel 
sets of (E, p) as its limit distribution, then it 
can be shown that there is a sequence of E- 
valued random variables {€,(@’)} (v el Des 
n « oo) such that the probability distribution 
of €,(w’) coincides with that of X,(w) (1 n« 
oC), €9{w’) has the probability distribution P,, 
and P'(¢,(w')>o(')) = 1 [21]. 

Consider a sequence of real-valued sto- 
chastic processes | X,(t,@)}. If (i) there is a 
probability measure P, on *B(R?) such that 
DN converges to Pj" for every choice of 
(t5, .... tj) and (ii) 
limlimsup sup P(|X,(s)— X,(t)] s) 0 
Ho ne |s-t| xh 
for every £0, then there exists a sequence of 
real-valued processes TZ, w); (m'eQ';0<n< 
oo) such that for every te T, €,(t) converges 
in probability to €o(t), £, and X, (1«n« oo) 
induce the same probability measure over 
38(R?), and & has the probability distribution 
P, and is continuous in probability. 


(2) Convergence in Distribution of Stochastic 
Processes Whose Sample Functions Have tDis- 
continuities of Only the First Kind. Yu. V. Pro- 
khorov and A. V. Skorokhod carried out a sys- 
tematic study of convergence in distribution 
for those processes whose sample functions are 
(S, p)-valued and have discontinuities of only 
the first kind, where (S, p) is a complete metric 
separable space. Let D be the set of functions 
x(t) from T=[0, 1] to S having discontinuities 
of only the first kind and right continuous for 
O<t<1 with x(1 —0)— x(1), and introduce a 
metric pp by pp(x, y) - inf; (sup,.; |t — A(t)] + 
Sup;er p(x(t), y(A(t)))}, where the infimum 

is taken over all homeomorphisms 4 on T. 
Then (i) in the notation of (1), 8?» =B(S")N 
D, (ii) (D, pp) is fseparable but not neces- 

sarily tcomplete, and (iii) there is a complete 
separable metric pp in D which is equiva- 

lent to py (i.€., pp and pp induce the same 
topology in D). When almost all sample func- 
tions of X are elements of D, we write X € 
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D. Given X,€ D (nz 0), a necessary and suf- 
ficient condition for Py to converge to Py, 
over B is that (i) PN" converge to PA" 
for any (t4, ..., t,) c N, where N is a dense 
subset of [0, 1] containing 0, 1; and that (ii) 
lim, o lim sup, ,, Py (Ap(h, Xp) > 0) — 0 for 
every 6>0, where A,(h, x)  sup(min(p(x(t,), 
x(t)), p(x(t5), x(t))t -h«t, «t«t; «t- hj. 

If C is a subspace of D consisting of all con- 
tinuous functions, pp is equivalent to pc(x, y) — 
max { p(x(t), y(t)) JO t€ 1}, and Ac(h, x)= 
max { p(x(s), x(t))||s—t|<h} corresponds to 
Ap. For a sequence of processes X,€C, a 
necessary and sufficient condition for Py to 
converge to Py, over Bé is that (i) PA con- 
verge to Py" for any (t,,..., t) c N (N is 
a dense set as in the case of D); and that (ii) 
lim, limsup, ,, Py (Ac(h, X,) > 6) =0 for 
every à 0. 

If Py converges to Py, over Bey (over BE), 
then there exists a sequence €,(t, c )e D (eC) 
(te T, o e) for 0€ n « oo such that P, = 
Py for0<n<co and P'(py(£,, &) 0) = 1 
LP gel Go) 20) = 1). 

The following theorem, due to Prokhorov, is 
useful for practical applications. Let {X,(t)| 
Oxt«1,x€ Aj be a family of real-valued pro- 
cesses that satisfy 


E(IX,()— X) clt-s| "^, aed, 


where a, b, c are positive constants inde- 
pendent of x. If the family of distributions of 
1X,(0)] a € A} is tight (C 341 Probability 
Measures F), then X,(t)e C (xe A), and (Py } 
is a tight family of probability distributions 
over C. 


(3) Convergence in Distribution of Markov 
Processes. Consider a Markov process ( X (t), 
O<t<1} with complete metric separable 
space (S, p) as its state space and with topo- 
logical c-algebra B,. Suppose that its tran- 
sition probabilities P(s, x;t, A, Ox s«t« I, 
Ac, are measurable in xeS. Define V*(x)— 
iy loo y) 2 e], and introduce two kinds of 
conditions: (D) for every £» 0 and O x t « 1, 
lim, o SUPxes.t, t, P(t4 x; t5, V*(x)) =0, where fa, 
t, are subject to the conditions <t; «t; «t 4- 
h,t—h<t,<t,<t,1—h<t,<t,<1;(C) for 
every £20, Sup, eren P(S, X; t, V(x)) bb (h), 
where VR (h) is such that 'P,(h)[O as h|O for 
every £2 0. According as {X (t, 0 « r« 1] satis- 
fies conditions (D) or (C), there is a process 
X'(t) which is equivalent (— 407 Stochastic 
Processes) to X (t) belonging respectively to (D) 
or (C). Convergence in distribution of Markov 
processes is based on this fact. Suppose that 

a sequence of Markov processes | X,(t)|0« 

tx 1,,0xn« oo, satisfies (i) condition (D) uni- 
formly in n (condition (C) with Y, indepen- 
dent of n), (ii) with N as in (2), PB con- 
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verges to Py for every (t;, ..., t) c N. Then 
Py, converges to Py, over Be? (over B2). Con- 
vergence in distribution of Markov processes 
is also implied by the convergence of the !gen- 
erators of the Markov processes [24, 25]. 


(4) Donsker's Invariance Principle. Donsker's 
invariance principle is a kind of central limit 
theorem for stochastic processes. Let {&,} bea 
sequence of identically distributed independent 
(real-valued) random variables with E(£;) —0 
and 0 «c? 2 V(£) « oo, and let S,=¢,+...+€,, 
nz 1, S)=0. Define random elements X,eC = 
C[0, 1], nz 1, by 


X (t, @) 


EE 


(0x t « 1), 


1 
on 


when [nt] denotes the integer part of the real 
number nt. Thus X,(i/n, c) — Sie, /n, and X, 
is linear on the interval TO — 1)/n, i/n] for i= 
1,2, ..., n. The C-valued random variables 
X, induce their probability distributions Py, 
over Bee, nz 1. Let X(t), 0<t<1, be a (real- 
valued) "Wiener process (— 45 Brownian 
Motion A) with X (0) —0 and Py its probability 
distribution over Dër P, is called a Wiener 
measure. Then Donsker's invariance principle 
[26] asserts that Py converges to Py. 

If we define X;,€D, nz 1, by 


1 
X(t, @) zo rn 
and if we consider P, as a probability measure 
on the space (D, 8%?) which is concentrated on 
Cie, P,(C) 2 1), then the above convergence 
theorem holds also for the distributions Py, of 
Xj, nZ l1, over 87» [20]. 

The arcsine law (— Section D (3)) is a nice 
example of an application of the invariance 
principle. Define a function f on C by f(x)= 
Lina, (X(t) dt, xe C. Then it is shown that 
f is measurable and continuous except on a set 
of Wiener measure 0. Hence the invariance 
principle implies that the probability distri- 
butions of { f(X,)} converge to the distribution 
of f(X). It is known that f(x) obeys the arcsine 
law P(f(X)«a)-2n ! arcsin./a,0<a<1(— 
45 Brownian Motion E). In the random walk 
case, where DU 1)= P(£; 2 —1)=1/2, nf(X,) 
is the number of i, 1 «ix n, for which S,_, and 
S; are both nonnegative. 


(0x t « 1), 


(5) Strassen's Invariance Principle. An in- 
variance principle for the law of the iterated 
logarithm is explained here. Let {€,}, {S,}, and 
{X,,(t)} be the same as in subsection (4) above. 
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Define 
T, (t, 0) 2 (2loglogn) !? X, (t, w) 


Strassen's invariance principle [28] asserts that 
with probability 1 the set {T,,n> 3} is trela- 
tively compact in C = C[0, 1], and the set of its 
limit points coincides with the set K of ab- 
solutely continuous functions x on the inter- 
val [0, 1] such that x(0)=0 and fò x'(t)? dt « 1. 
This principle is based on the following fact. 
Let X (t), Ox t € oo, be a tWiener process with 
X (0) « 0, and 


Z,(t)— (2nloglogn) !? X (nt), 

(O<t<1, n>3). 
Then with probability 1 the set {Z,,n>3} is 
relatively compact in C with its limit set equal 
to K. As applications of the invariance prin- 
ciple we obtain, e.g., (i) the ordinary law of the 
iterated logarithm 


noc 


Pim sup S$,/(2nloglog n)? = d eT, 
and (ii) if a — 1l, for any az 1, 


r(im sup(2nloglogn) "un Y Lab 
i=l 


nooo 


1 dt —a 
äu äzre cl ) )-. 
o ./1—1* 


in particular, 





"in sup(2nloglogn) "n~! Y |S,| 
i=1 


sinn 


F. Convergence of Empirical Distributions 


Let {X,(@)|1<k <n}, {Ywl & j &m] be 
independent random variables with the same 
distribution function F. Let N,(x, œ) be the 
number of k (k— 1, ..., n) such that X, x x. 
Then F,(x,@)=n TN, c) is a distribution 
function in x called an empirical distribution 
function. According to the Glivenko-Cantelli 
theorem, H =sup,| F,(x, 9) — F(x)| 0 (n9 oo) 
with probability 1. Let Gelz, œ) be the empir- 
ical distribution function constructed from the 
Y, and put 

H,—sup(F,(x, «)— F(x), 


x 


H*(n, m)=sup | F,(x, Q)— G, (x, w)|, 


H (n, m)=sup (F,(x, v) — G,,(x, c»). 


x 


If F is continuous, then the following results 
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hold. According to Kolmogorov's theorem, as 
noo we have 


P(./n Hj? «x)5 K(x), 
where 


K(x)= y (—l*exp(—2k?x?) (x>0); 


k=- o0 
-0 (x<0). 


On the other hand, Smirnov's theorem asserts 
that (1) 


P( /n H,<x)=0 
EL REY e(t 


k=r+1 


x (k —x,/n)K(n— k+ nny? 
(0<x<./n, r= [xn] 


(x <0); 


=1 (xn) 


and, as n— oo, 


P( /n H,« x)» Lx) 


L(x)=1—exp(—2x*)  (x»0) 


=0 (x<0); 


1 


(ii) when n> œ, mn ! >r, r a constant, 


P((nm)'? (n 4- m) ! ^H * (n, m) « x) K(x), 


P((nm)' 7 (n-- m) "^ H(n, m) < x)> L(x). 


In terms of convergence in distribution for 
Markov processes, these results are interpreted 
as follows: F(X,) is uniformly distributed over 
[0, 1]. Let v,(t) be the number of k (k = 1,2, 
...,) such that F(X,) «t, and put X,(t) — 
dain ^! wt) —t) (0<t <1), X,(0)=0. Then 
E(X,(t)) - 0, E(X,(t) X, (t)) = min(s, t) — st, 

H; —sup,|X,(t)|, and H,(t) - sup, X,(t). Let 
iX(0|Oxt«1) bea *Gaussian process with 
E(X (t)) 20, E(X (s) X (t)) 2 min(s, t) — st, and put 
H* —sup,|X (t), H —sup, X (t). Then Py, con- 
verges to Py over BZP. Therefore 


P(/n H} «x)jo P(H* <x) 
Pn H, « x) P(H <x) 


(n co), 
(n oo). 


Similarly, if we denote the number of j such 
that F(Y) &t by i, (t), then H ^ (n, m)= 
sup, |n ! v,(t) — m~! u,,(t)|, and therefore 


P((nm)'?(n 3 m) 'AH* (n, m) <x) 


—P(Q +r) I^ sup|X()— Vr Y(0)] x) 


when n oo, nm. ! 5r, where {Y(t)|O<t<1} is 
independent of {X (tO t & 1) and distributed 
according to the same law as for the latter. 
The exact as well as asymptotic expressions for 
the distributions H, , H * (n, m) are also ob- 
tained [33, 34]. Thus the limit distributions 
can be calculated by using the distributions of 
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the processes {X (t)} and { ¥(t)}. The results 
above are obtained by applying Donsker's 
invariance principle. The Gaussian process 
{X(t)|O<t<1} introduced above is called a 
Brownian bridge because it is obtained from 
a Brownian motion B(t) by conditioning 
B(0) 2 B(1)=0. 
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A. General Remarks 


In *functional analysis, when we talk about an 
operator or a mapping T from one space to 
another, it is important to specify not only the 
domain D(T) on which T is defined and the 
range R(T) onto which T maps D(T) but also 
the spaces X and Y of which D(T) and R(T), 
respectively, are subsets. Thus a mapping T 
from a subset D(T) of a (real or complex) 
linear space X to a linear space Y is called 

an operator from X to Y. The image of xe X 
under T is customarily denoted by Tx. An 
operator T from X to Y is said to be a linear 
operator (linear transformation or additive 
operator) if (i) D(T) is a tlinear subspace of X 
and (ii) T(a, x, 4-à5x,) 2 a, Tx, +4, Tx, for all 
X4, X5€ D(T) and all scalars «,, «,. The set of 
all continuous linear operators T from X to 

Y with D(T)=X is denoted by L(X, Y). We 
denote L(X, X) by L(X). 

For simplicity we suppose throughout this 
article that X and Y are tBanach spaces. In 
this case L(X, Y) consists of all bounded 
linear operators from X to Y. Hence L(X, Y) 
and L(X) are denoted by B(X, Y) and B(X), 
respectively. Some of the statements given 
remain true in more general situations (— 424 
Topological Linear Spaces). More information 
about operators in B(X, Y) can be found else- 
where (— 37 Banach Spaces; 68 Compact and 
Nuclear Operators). Examples are grouped 
together in Section O. 


B. Operations on Operators 


When T, and T, are operators from X to Y, 
the sum T, + T, is the operator defined by 
D(T, + T) - D(T,)M D(T;) and (T, + T;)x = 
Ty x - Tx, xe D(T, + T,). When T, is from 

X to Y and T; is from Y to Z, the product 

T; T, is the operator defined by D(T, T,)= 
(ve D(T))| T; xeD(T;)) and T; T, x = P(T, x), 
x€D(T, Tj). The product «T of a scalar « and 
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an operator T is defined in a similar and obvi- 
ous way. These operators are linear whenever 
T, and T, are linear. For an operator T from 
X to Y the subset I(T) - {(x, Tx) xe D(T)] of 
the product space X x Y is said to be the graph 
of T. If (Tj) c T (15), the operator T, is said 
to be an extension of T, (we write T, c T;). If 
a linear operator T from X to Y satisfies the 
condition that x 40 implies Tx 40, then T 
has the inverse operator T~!, which is a linear 
operator satisfying D(T !) — R(T) and R(T !) 
= D(T). 


C. Convergence of Operators 


The following three topologies are most fre- 
quently used in the linear space B(X, Y), which 
becomes a locally convex topological linear 
space under each of them: (1) the uniform 
operator topology, (2) the strong operator topol- 
ogy, and (3) the weak operator topology. These 
topologies are determined by the respective 
fundamental systems of neighborhoods of 0 
consisting of all sets of type (1) (T | | T | <e}, 
(2) (T ||| Tx|| <£, xeX}, and (3) (T || f(Tx)| <e, 
xeX, fe Y’}, where e varies over all positive 

: numbers, X over all finite subsets of X, and Y' 
over all finite subsets of Y’, the *dual space of 
Y. The uniform operator topology is thus the 
metric topology determined by the tnorm in 
B(X, Y) (— 37 Banach Spaces C). Convergence 
of T, to T with respect to one of these topol- 
ogies is referred to as (1) uniform convergence, 
(2) strong convergence, or (3) weak convergence. 
We have such convergence if and only if (1) 
|T, — Tl 0, (2) I(T, — T)x|| 0 for every xe 
X, or (3) | (T, x — Tx)| 50 for every xe X and 
feY. 


D. Closed Operators 


Closed operators play an important role when 
we deal, as is frequent in applications, with 
operators that are not necessarily continuous. 
An operator T from X to Y is said to be a 
closed operator if the graph of T is closed in 

X x Y, or equivalently, if x, € D(T), x, x, and 
Tx, y imply xe D(T) and Tx — y. If T is con- 
tinuous and D(T) is closed, then T is closed. 
Conversely, if a linear operator T is closed and 
D(T) is closed, then T is necessarily continu- 
ous (the closed graph theorem). An operator 

T from X to Y is said to be a closable operator 
if T has a closed extension. A linear operator 
is closable if and only if x, e D(T), x,—0, and 
Tx, y imply y=0. The closure of the graph 
of a closable operator T is the graph of the 
smallest closed extension T of T. Thus T is 
also called the closure of 7. When T is a linear 
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operator from X to Y with D(T) dense in X, 
the dual operator (or conjugate operator or 
adjoint operator) T” of T is defined to be the 
operator from Y' to X' determined by the 
relations D(T")— | fe Y'|there exists ge X’ 
such that g(x) — f(Tx) for every xe D(T)} and 
T'f =g, f eD(T?) (2 37 Banach Spaces). T’ is 
always closed. If T is a closable linear operator 
with dense domain, then its smallest closed 
extension is obtained as the bidual T" re- 
stricted to D(T)- (xe Xn D(T")| T'xeY]. 

For a closed linear operator T we define 
its nullity nul T to be equal to dim N(T), 
where N(T)— (xeD(T)| T, —0] is the null 
space (Le, the tkernel) of T, and its deficiency 
def T to be equal to dim Y/R(T). When at 
least one of nul T and def T is finite, we define 
the index of T by ind T — nul T— def T. A 
closed linear operator T is said to be a Fred- 
holm operator if R(T) is closed, nul(T) « oc, 
and def(T) « oo (— 68 Compact and Nuclear 
Operators). 

The domain D(T) of a closed linear operator 
T from a Banach space X to another Y turns 
out to be a Banach space under the graph 
norm |x| y | Tx||y, and T is looked upon as 
a bounded linear operator from this Banach 
space D(T) to Y. 


E. Operators between Hilbert Spaces 


Throughout this section we suppose that X 
and Y are complex !Hilbert spaces. Let T bea 
densely defined linear operator from X to Y. 
Instead of the dual T’, it is sometimes conve- 
nient to use the operator T* from Y to X 
determined by the relation (x, T*y) - (Tx, y), 
x€D(T). The operator T* is called the adjoint 
operator (or Hilbert space adjoint) of T. By 
means of the antilinear isomorphism z, from 
X onto X’ given by (zy x)(u) =(u, x) (tRiesz’s 
theorem), T* is related to T' by T*—z,! T'n,. 
The correspondence T T' is linear, while the 
correspondence T T* is antilinear. D(T*) is 
dense in Y if and only if T is closable, and in 
this case the smallest closed extension T of T 
coincides with T** —(T*)*. 

If a densely defined linear operator T in X 
De, T is from X to X) satisfies Tc T*, then 
T is said to be a symmetric operator (or Her- 
mitian operator). If T= T*, then T is said to be 
a self-adjoint operator. A symmetric operator 
is always closable. A symmetric operator T is 
said to be essentially self-adjoint if the closure 
of T is self-adjoint. A self-adjoint operator T 
is said to be nonnegative or positive or positive 
semidefinite if (Tx, x) 20 for every xe D(T). 
An operator Te B(X, Y) is said to be partially 
isometric if there exists a closed linear sub- 
space M of X such that T is isometric on M 
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(Ge, || Tx|| = |xll, xe M) and zero on the *ortho- 
gonal complement of M. The closed linear 
subspace M (T M) is called the initial (final) 
set of T. An operator TeB(X, Y) is partially 
isometric if and only if T*T and TT* are 
(orthogonal) *projections. The ranges of these 
projections are the initial and final sets of T, 
respectively. In particular, if M =X, then T 

is said to be an isometric operator or an iso- 
metry. If X = Y M =TM or, more generally, 
if X 2 M and Y- TM, then T is said to bea 
unitary operator. Te B(X) is unitary if and 
only if T* — T +. A closed linear operator T 
with dense domain is said to be normal if 
T*T- TT*. The set of self-adjoint (or unitary) 
operators in X forms a subclass of the class 

of all normal operators. The structure of nor- 
mal operators, especially that of self-adjoint 
operators, has been studied in detail by means 
of spectral analysis (— 390 Spectral Analysis of 
Operators). Let T be a densely defined closed 
linear operator from X to Y. Then there exist a 
nonnegative self-adjoint operator P in X and a 
partially isometric operator W with initial set 
R(P) - R(T*) such that T= WP. The operators 
P and W are determined uniquely by these 
requirements. This is called the canonical de- 
composition or the polar decomposition of T. 
For linear operators T in a complex Hilbert 
space X, the notion of numerical range W(1)— 
(Tx, x)| xe D(T), ||x|| =1} is also impor- 

tant. W(T) is a convex set such that | T |]/2« 
sup|W(T)|=sup{|A|| 4e W(T)] «(T |. If T is 
normal, then the closure of W(T) coincides 
with the *closed convex hull of the *spectrum 
c(T), and we have sup|W(T)|- | T |. A nor- 
mal operator T is a nonnegative self-adjoint 
operator if and only if its numerical range is in 
the positive ray 42:0. 





F. Resolvents and Spectra 


Let T be a closed linear operator in a complex 
Banach space X, and I be the identity in X. 
The set p(T) of all complex numbers å such 
that AI — T has an inverse in B(X) is called the 
resolvent set of T, and the complement o{T) of 
p(T) is called the spectrum of T. For 4e p(T) 
the operator R(4; T) (4I — T) ! eB(X) (or 
sometimes — Ria T)) is called the resolvent of 
T. If 45e (T), then {4|]2— 20l € [R(4o; T)|| ^! 
c p(T). Hence p(T) is an open, and c(T) a 
closed, set. If T is bounded, then o(T) is a 
nonempty compact set. However, p(T) may be 
empty (example (2) in Section O) or the whole 
plane C in general. The operator R(A; T) is an 
*analytic operator function of 4 in p(T) and 
satisfies the (first) resolvent equation 


RU, T)-RGS: T) (4; — 4) RA; TRL T) 
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for every 4,, 45€ p(T). For every TeB(X) the 
limit r(T) lm [| T"|| U^ € | T ||, n oo, exists and 
coincides with the "spectral radius sup|o(T)| of 
T. An operator Te B(X) with r(T) —0 is called 
quasinilpotent or generalized nilpotent. Con- 
cerning the dual operator, we have p(T") — 
p(T) and R(4; T') 2 R(A; TY. f X isa Hil- 

bert space, then p(T*) 2 p(T) and RA T*)= 
BL: T)*, where ` stands for the complex 
conjugate. 


G. Operational Calculus 


In operator theory, the term operational cal- 
culus generally indicates a way of defining 
"functions" f(T) of an operator T so that a 
kind of algebraic homomorphism is estab- 
lished between a set of complex-valued func- 
tions f and the corresponding set of operators 
f(T). The functions and operators that must 
be taken into consideration depend on the 
nature of the problems to be solved, and ac- 
cordingly there are several versions of oper- 
ational calculus. We describe two typical ones. 

(1) Let T be a self-adjoint operator in a com- 
plex Hilbert space X with the tspectral resolu- 
tion T — (*, 4 dE(4), and let f be a complex- 
valued Borel measurable function on R. Then 
the operator f(T) is uniquely determined by 
the following relations (— 390 Spectral Analy- 
sis of Operators): 


putri 





1 f(A? 4(EG)x. x) « ch 


oo 


((T)x, y)= | SAAE), y), 


xeD(f(T), yeX. 


Then f(T) is normal and the correspondence 
f f(T) satisfies the following relations: (1) (xf + 
fl (T) 2 af( T) + BolT): (ii) (f) (T) > (Dg CT 
and (in) f(T)* = f(T). M g is a bounded func- 
tion, the extensions in (1) and (ii) can be re- 
placed by equalities. 

(2) Let X be a complex Banach space, Te 
B(X), and (T) the set of all functions holo- 
morphic in a neighborhood of c(T). We define 
an operator f(T)e B(X), f e F(T), by 


1 
fms A()R(t; T) dt, (*) 
Ti Jc 


where C is a closed curve consisting of a finite 
number of rectifiable Jordan arcs encircling 

a domain that contains o(T) in its interior 
and lies with its boundary completely in the 
domain in which f is holomorphic. The inte- 
gral does not depend on the curve. In this 
case relations (1) and (ii) hold with equality in 
place of extension. Instead of (iii) we have (iv) 
A(T)- £(T')and f(T')- f(TYy. The integral 
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appearing in (*) is sometimes called the Dun- 
ford integral. In both situations described in (1) 
and (2) the spectral mapping theorem ot f(T))— 
f(c(T)) holds. When these two ways of defin- 
ing f(T) are possible, the resulting operators 
coincide. 

Another kind of operational calculus can be 
constructed, for example, when T ts the gen- 
erator of a certain semigroup of operators 
[1,2,4]. 


H. Isolated Singularities of the Resolvent 


Let T be a densely defined closed linear oper- 
ator in a complex Banach space X and 4, 

an isolated point of o(T). Take a sufficiently 
small circle C around 4, and put 


1 
sl R(4; T)d4, 
21i. 


which is a projection in X. Then the Laurent 
expansion around 4, of the resolvent is given 
by 


RUST)= Y A4 Aaf 

with 4, —(—1)*! (44I — T) **P E forn «0. 
When the dimension v of the range of E is 
finite, A, is a *pole of R(4; T) with order not 
exceeding v, and 4, is an teigenvalue of T with 
*multiplicity not exceeding v. Furthermore, E 
is then a projection onto the "root subspace 
belonging to the eigenvalue As. 


I. Extension of Symmetric Operators 


In applications we frequently encounter the 
problem of finding self-adjoint extensions of a 
given symmetric operator. Let T be a closed 
symmetric operator in a complex Hilbert space 
X. Then Til is one-to-one, and its range R, 
is a closed linear subspace of X. The operator 
Vj—(T—il)(T-- il) ! from R, onto R_ is 
isometric, and (I — V,)R, is dense in X. We 
call V; the Cayley transform of T. Conversely, 
let V be an isometric operator from a closed 
linear subspace M of X onto another one N 
such that (I — V)M is dense in X. Then the 
operator T— i(1 4- V)(I V) ! is a closed sym- 
metric operator satisfying V, = V. Thus the cor- 
respondence T V, is one-to-one onto; Tc S 
if and only if V; c Vy, and T is self-adjoint if 
and only if V} is unitary. The dimension n, of 
the subspaces X ON, —(x| T*x— +ix} are 
called the deficiency indices of T. Denoting the 
"residual spectrum of T by o,(T) and putting 
IT, —14|1m 420], we have the following 
propositions: (1) According as n, >0 or n, =0, 
II, C o, (T) or II, c p(T) (similarly for n_ in 
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place of n, ); (tt) T has a self-adjoint extension 
if and only if n, =n_; and (iii) T is self-adjoint 
if and only if n, =n_ =0, or, equivalently, if 
and only if /7, c p(T). When T is symmetric 
but not closed, the previous arguments can be 
applied to the closure T of T. The deficiency 
indices of T are also called the deficiency 
indices of T. These arguments can be per- 
formed similarly with å and / (Im 4 #0) in 
place of i and — i. 

We now describe two more concrete criteria 
for T to have self-adjoint extensions. (1) Semi- 
bounded operators: If there exists a real num- 
ber y such that (Tx, x) 2 y||xll? for every xe 
D(T), then T has a self-adjoint extension 
satisfying a similar inequality with the same 
constant y. The structure of all self-adjoint 
extensions of T with the same lower bound 
y was studied in detail by M. G. Krein [8]. 
Among such extensions the Friedrichs exten- 
sion is distinguished as the one having the 
smallest form domain. (2) Real operators: If T 
commutes with a conjugation in X, namely, if 
there exists an antilinear mapping J from X 
onto X such that (Jx, Jy) 2 (y, x), J? « I, and 
JT= TJ, then T has a self-adjoint extension 
(— Section O (2)). 


J. Dissipative Operators 


A linear operator T in a Hilbert space X 

is said to be dissipative (resp. accretive) if 
Re(Tx, x) «0 (resp. 20) for every xe D(T). To 
extend the definition to operators in a Banach 
space X, we define, for each xe X, Fx to be 
the set of all x’ in the dual space X’ such that 
(x, x? = lxi? = UNI. Fx is not empty by 
virtue of the Hahn-Banach extension theorem 
(— 37 Banach Spaces). The multivalued map- 
ping F:x—x' is called the duality mapping. 

A linear operator T in X is called dissipative 
(resp. accretive) if for each xe D(T) there exists 
an x'e Fx such that Re< Tx, x’) <0 (resp. >0), 
or equivalently if [x — ATx|| > ||x|| for every 
x€ D(T) and A>0 (resp. 4 <0). A dissipative 
operator T is called m-dissipative if R(J — 2T) 
= X for a (and all) 4 0. Then the half-plane 
ReA0 is in the resolvent set p(T), and we 
have |(4I — T) !| <1/ReA, Re» 0. In particu- 
lar, T is closed, and if X is freflexive, then the 
domain D(T) is dense. A dissipative operator 
is called a maximal dissipative operator if it has 
no strict extension that is dissipative. An m- 
dissipative operator is a maximal dissipative 
operator. If X is a Hilbert space, then con- 
versely a maximal dissipative operator with 
dense domain is m-dissipative. Every dissipa- 
tive operator with dense domain in a Hilbert 
space can be extended to an m-dissipative 
operator. A linear operator T in a Banach 
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space is the tinfinitesimal generator of a con- 
traction semigroup of class (C°), i.e., a tsemi- 
group (1;|t 2 0] of class (C°) such that || T;| <1 
for all tz 0, if and only if T is an m-dissipative 
operator with dense domain. 
The notion of dissipative and accretive 

operators is also defined for nonlinear opera- 
tors (— 286 Nonlinear Functional Analysis). 


K. Subnormal Operators and Hyponormal 
Operators 


A bounded linear operator T in a complex 
Hilbert space X is said to be subnormal if there 
exists a bounded normal operator N in a 
complex Hilbert space Y, containing X as a 
closed linear subspace such that Tx = Nx for 
all xe X. The operator N is called a normal 
extension of T. T is subnormal if and only if 
Lim.n( 1 *" T" Xm Xn) 20 for every finite sequence 
{Xa} of X. Then its normal extension N is 
determined uniquely up to *unitary equiva- 
lence under the minimality requirement that 
there is no treducing subspace (— Section L) 
for N between X and Y. Every normal or 
isometric operator is subnormal. The spectrum 
of a subnormal operator T is obtained from 
the spectrum of its minimal normal extension 
N by adding some bounded components of its 
complement. An operator Te B(X) is said to 
be hyponormal if the self-commutator [T*, T] 
= T*T—TT* is nonnegative. Every sub- 
normal operator is hyponormal. If a hypo- 
normal operator T has a cyclic element x, i.e., 
the "linear span of ( T"x |n —0, 1,...} is dense 
in X, then the self-commutator is a *trace 

class operator (C. Berger and B. Shaw). The 
planar Lebesgue measure of the spectrum of 
any hyponormal operator T is not less than 
Is T || (Putnam's theorem) [10]. 


L. Invariant Subspaces 


A closed linear subspace M of a Banach space 
X is said to be invariant under an operator 
TeB(X)1f T maps M into M. If M is invariant 
under all operators in B(X) that commute with 
T, then M is said to be hyperinvariant under 

T. When X is a Hilbert space and M is invari- 
ant under both T and T*, then M is said to 
reduce T. That M reduces T is characterized by 
the commutativity TP — PT, where P is the 
orthogonal projection to M. The question of 
whether an arbitrary nonzero bounded linear 
operator in a separable infinite-dimensional 
Hilbert space has a nontrivial invariant sub- 
space still remains open but work has pro- 
gressed significantly in recent years. (1) Every 
nonzero *compact operator has a nontrivial 
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hyperinvariant subspace (V. I. Lomonosov). (2) 
An operator T has a nontrivial invariant sub- 
space if it is subnormal or if || T || x 1 and the 
spectrum o(T) covers the whole unit disk of 
the complex plane (S. Brown, 1979). (3) If T is 
hyponormal, a certain power T" has a non- 
trivial invariant subspace (Berger, 1979). 

The complete description of all invariant 
subspaces is usually difficult even for an oper- 
ator of quite simple type. For the tshift oper- 
ator (— Section O (6)) on the tHardy space 
H, on the unit circle this was done by A. Beurl- 
ing [11,12]. For the tintegral operator on the 
space L,(0, 1) with *kernel k(t, s) 2 max(t — s, 0) 
the set of all invariant subspaces is *totally 
ordered with respect to inclusion [13]. 


M. Dilations 


Let X be a Hilbert space and Te B(X). A 
bounded linear operator U in a Hilbert space 
Y, containing X as a closed linear subspace, 

is said to be a dilation (also strong or power 
dilation) of T if T"x 2 PU"x for all xe X, n— 1, 
2, ..., where P is the orthogonal projection 
from Y to X. If U is unitary, it is called a uni- 
tary dilation of T. Every contraction Te B(X), 
i.e., an operator T with || T | <1, has a unitary 
dilation U in a suitably constructed Hilbert 
space Y (the dilation theorem). Such a unitary 
dilation U is uniquely determined up to *uni- 
tary equivalence under the minimality require- 
ment that there ts no reducing subspace for 

U between X and Y. A corollary is the von 
Neumann inequality: For any contraction T 
and any complex polynomial p(A), the in- 
equality | p(T)|| <sup,,)<; |p(A)| holds. Further- 
more, if an operator Se B(X) commutes with 
a contraction T, then there exists a dilation V 
of S in B(Y) such that ||V|| = Isi and V com- 
mutes with the unitary dilation U of T (the 
lifting theorem). If a contraction T is com- 
pletely nonunitary in the sense that the restric- 
tion of T to any nontrivial reducing subspace 
is not unitary, then the minimal unitary dila- 
tion has tabsolutely continuous spectrum, 
and the operational calculus p p(T) is ex- 
tended to all functions in the *Hardy space 
H [15]. 


N. Functional Models for Contractions 


A canonical model for an operator is a "natu- 
ral" representation of the operator in terms 

of simpler operators and in a context in which 
more structure is present. There are several 
canonical models for bounded linear operators 
[13-16]. Here, we follow the approach of 

B. Sz.-Nagy and C. Foiag that was developed 
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in connection with unitary dilations of con- 
tractions [15,16]. 

An tanalytic operator function (2) defined 
on the open unit disk with values in B(X, Y), 
where X and Y are separable complex Hilbert 
spaces, is denoted by {X, Y, ©}. An analytic 
operator function | X, Y, 9] is said to be con- 
tractive if ||@(A)|| <1 for all å. If, in addition, 
|| ©(0)x|| < |x|} for all nonzero xe X, then it is 
said to be purely contractive. Each contractive 
(X, Y, O) is uniquely decomposed into the 
direct sum 0(4) 2 © (A) O V, where { Xo, Yo, Oo} 
is a purely contractive analytic operator func- 
tion with closed linear subspaces X, € X and 
Y; & Y, and V is an isometric operator from 
X © Xy into YO Y. The operator function ©% 
is called the purely contractive part of O. 

Let ( X, Y, O} be purely contractive. Then 
the boundary value O() — s-lim, ,, O(rc) exists 
for almost all ¢ with respect to the Lebesgue 
measure on the unit circle. Let A(Z) 2 (1 — 
OG*(D00O(0)'? for |C| 2 1. Then the operator 
functions O(C) and A(C) defined on the unit 
circle induce, respectively, the operators Oe 
B(LX, LY) and Ae B(L3) defined by (Of)(D)— 
OCSE) and (Af XE) AHC) for |5| = 1 and 
f € Li, where LY is the tHilbert space of X- 
valued square integrable functions on the 
unit circle. Consider the Hilbert space K — 

LY @ R(A) and the unitary operator U de- 
fined by US € )() =O D Lal). Con- 
sider the linear subspace H « [H! @ R(A)] © 
(Of & Af |f € Hy} of K, where HX is the X- 
valued *Hardy space that is a subspace of LX. 
Then the operator T = PU restricted to H, 
where Py is the orthogonal projection from K 
to H, is a completely nonunitary contraction 
with U as its minimal unitary dilation. T is 
called the contraction generated by (X, Y, 9]. 
Two purely contractive analytic operator 
functions {X,, Y, ©,} and ( X;, Y,,@,} can 
generate contractions which are unitarily 
equivalent if and only if there are isometries 
V, from Y, onto Y; and V, from X, onto X, 
such that V, O,(4) 2 O;(4) V, for all 4 in the 
unit disk. In this case, ©, and ©, are said to 
coincide. 

Now let a completely nonunitary contrac- 
tion T be given. Consider the operator Dy = 
(I — T* T)'? and the closed linear subspace $4 
= R(D,), and define similarly Dj. and Da by 
using T* instead of T. Then the function O4(4) 
= —T-ADz,«(I—AT*) ! D, restricted to Dy 
becomes a purely contractive analytic opera- 
tor function with values in B(D,, $,.) and is 
called the characteristic operator function of T. 
The contraction generated by the character- 
istic operator function is unitarily equivalent 
to T and is called the functional model (or the 
Sz.-Nagy-Foias model) for T. The spectrum 
of T coincides with the set consisting of the 4; 
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in the open unit disk at which ©,(4,) fails to 
have bounded inverse, together with the Co on 
the unit circle at which @,(-) fails to have an 
analytic extension to a neighborhood Q of c, 
which is unitary on the intersection of Q and 
the unit circle. There is a one-to-one corre- 
spondence between the invariant subspaces M 
for T and the regular factorizations of ©, as a 
product O4(4) 2 O,(4): ©, (4) of two contractive 
analytic operator functions { Y, Dy», ©,} and 
{D,, Y, O,) with a suitable Hilbert space Y. 
Here the regularity of the factorization means 
that for almost all ¢ in the unit circle the range 
of (I — O*(0)O ,(())? and the range of (J — 
O,(D)O9*(2)'7 are linearly independent. More- 
over, the characteristic operator function 

of the restriction of T to M coincides with 

the purely contractive part of @,, while the 
characteristic operator function of P,,1 T 
restricted to M^ = X © M coincides with the 
purely contractive part of O,, where Py 

is orthogonal projection to M+. 


O. Examples of Linear Operators 


(1) Integral Operators. Let E;, j — 1, 2, be linear 
spaces consisting of measurable functions 
defined on measure spaces Q, with measures 
H;. Let k(t, s) be a measurable function on Q, x 
€,, and define D(R) to be the set of all x € E, 
such that (Kx)(t) — fo, k(t, s)x(s) d, (s) belongs 
to E,, where the integral is assumed to be 
absolutely convergent almost everywhere. The 
mapping that assigns Kx to each xe D(R) 
determines a linear operator K from E, to E, 
with domain D(K). K is called an integral 
operator, and k(t, s) the kernel (or integral 
kernel) (of K). As an example, let E, L (Qj). 

1 <p< oo, and suppose there exists an M>0 
such that 


| [k(t, s)| du, (s) « M, 
9, 


| [k(t s) dus(t) < M. 
Q, 


Then K e B(E,, E;) with || K || < M (— 68 Com- 
pact and Nuclear Operators C; [17]). An 
integral operator is said to be Hermitian if 
the kernel satisfies k(t, s) 2 k(s, t). A bounded 
Hermitian integral operator is self-adjoint in 
L(Q). 


(2) Differential Operators. For X — L?(0, 1), 
let Dy be the set of all xe C?(0, 1) with com- 
pact support in (0, 1) and D, the set of all 
x€C (0, 1) such that x'(r) is absolutely continu- 
ous in (0, 1) with x"e X. Then the operators 
T;, j —0, 1, determined by (7}x)(t)= —x"(t), 
xéD,, are linear in X. Ty = Tj, so that Tọ is a 
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symmetric operator. Furthermore, T) is a real 


operator with respect to the conjugation x x. 


Since two linearly independent solutions of 
(T, — 4I)x «0 both belong to X, the deficiency 
indices n4 of T, are 2. (Note that p(T))— Ø.) 
Self-adjoint extensions of T, are obtained by 
restricting the domain of T, by boundary 
conditions (— 112 Differential Operators; 


[1,4]). 


(3) Fourier Transforms. For every xe L,(R"), 
I <p<2, 


(Ux)(t) lim ex exp( — its)x(s)ds 


It m 


converges in the norm of L(R", p ! «q^! — 


1. The operator U thus defined belongs to 
B(L,, La. When p=q=2, U is a unitary oper- 
ator in L? (— 160 Fourier Transform). 


(4) Singular Integral Operators. Let Q(t) be a 
bounded measurable function on R" homoge- 
neous of degree 0. Suppose that the integral 
of Q over the unit sphere vanishes and that Q 
satisfies the condition 


1 
| 6! ( sup mu al « o. 
We 

Then for every x(t)e L,(R"), 1 « p « oc, 
O(t — s) 


t—s|>e |t—s|" 





(Tx)(t) -lim | x(s)ds 
| 


el 
converges in the norm of L,(R"), and the oper- 
ator T so defined is a bounded linear opera- 
tor in L,(R"). If n—1 and O(t) 2n 't/|t|, then 
T is the "Hilbert transform. If n» 1 and Q(t)= 
Tun tr 1)/2)n "*9?t/r ja 1,..., n then T is 
called the Riesz transform. 

In general, integral operators on L,(R") 
defined by kernels k(t, s) satisfying the esti- 
mate |k(t, s) x C|t —s| " are called Calderón- 
Zygmund singular integral operators. These 
have been widely investigated [18, 19]. 


(5) Pseudodifferential Operators. Let a(t, x) be 
a function defined on R” x R”. Then the oper- 
ator T defined by 


(Tx)(t) (Dal "2 Jena 1)Å(t)dt 


is called a pseudodifferential operator, and 
a(t, t) is called the symbol of T, where X de- 
notes the Fourier transform of x. This is a 
generalization of the Calderón-Zygmund 
singular integral operator. The following is 
a sufficient condition for T to be a bounded 
linear operator in L,(R") for every 1 «p — 
oo: There exists a constant C such that the 
inequality 


Lait, DICA +r)" 
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holds for every *multi-index « with |a| x n +2, 
and there exists a function œ(ô) such that 
fà o0) 0^! dé < oo and that 

sup Jor, x) — Gza(s, t)| & (0) (1 + |r|) 
It-s| <6 
for every multi-index « with [x| x n - 2 (T. 
Muramatu and M. Nagase, Proc. Japan Acad. 
(1979); [19]). Pseudodifferential operators play 
a decisive role in the modern theory of partial 
differential equations (— 345 Pseudodifferen- 
tial Operators). 


(6) Toeplitz Operators. Let L, be the L;-space 
on the unit circle in the complex plane with 
respect to the linear Lebesgue measure, and 
let H, be the "Hardy space. Each bounded 
measurable function q on the unit circle gives 
rise to the Toeplitz operator T, in H, defined 
by T,f = P(ọf ) for f € H,, where P is the ortho- 
gonal projection from L, to H, The matrix 
(2,,,) of the Toeplitz operator T, with respect 
to the complete orthonormal basis | y, |0 € n « 
oo}, where y,()=¢" for |C| « 1, is given by 
Dan = Q, , Where @, is the kth tFourter coeffi- 
cient of o. If ọ is in the *Hardy space H. then 
the Toeplitz operator T, is subnormal. When 
Q (C) — the Toeplitz operator T, is called the 
shift operator (or shift). A Toeplitz operator T, 
cannot be compact except when o —0. When o 
is a continuous function, the Toeplitz operator 
T, becomes a Fredholm operator if and only 
if o does not vanish on the unit circle, and in 
this case the index of T, is equal to minus the 
winding number of the curve traced out by 

o with respect to the origin. Toeplitz opera- 
tors play an important role in approximation 
theory and prediction theory [20]. 
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252 (XIII.6) 
Linear Ordinary Differential 
Equations 


A. General Remarks 


Let p,(x), .... p,(x), g(x) be known functions of 
a real (or complex) variable x. An ordinary 
differential equation 


y?  pLG)y" +... + Palx) y= q(x) (1) 
containing an unknown function y and its 
derivatives y', y^, ..., y of order up to n is 


called a linear ordinary differential equation of 
the nth order. In particular, a linear differential 
equation 


y+ p (x) y"? +... +p, (x)y=0 (1) 
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with q(x) z is said to be homogeneous. If 
q(x) Z0, (1) is said to be inhomogeneous. The 
existence and uniqueness theorems for solu- 
tions to the initial value problem are valid 

for (1) (— 316 Ordinary Differential Equations 
(Initial Value Problems)). Moreover, the fol- 
lowing theorem holds: Let D denote an inter- 
val in the real line or a domain in the complex 
plane. If the coefficients p,(x), q(x) are continu- 
ous in an interval D, then every solution of 

(1) has D as its interval of definition. If p, (x), 
q(x) are holomorphic in a domain D, then 
every solution of (1) is continued analytically 
along any path in D. Combining this theorem 
and the existence and uniqueness theorems 
we have the following theorem: Let p,(x), 

q(x) be continuous in an interval D. Then for 
every point xo in D and every n-tuple of num- 
bers y, a, ..., n" JL there exists one and only 
one solution y(x) of (1) satisfying the initial 
conditions 


y(xo) =y, y (Xo) = H. tes i es qr) (2) 


and such that y(x), y(x), ..., y (x) are all con- 
tinuous in D. If p,(x), q(x) are holomorphic in 
D, then there exists one and only one solution 
y(x) of (1) satisfying (2) and such that y(x) is 
*complex analytic (not necessarily single- 
valued) in D. 

It follows that every point of discontinuity 
(or singular point) of a solution of (1) is a 
point of discontinutty (or singular point) for at 
least one of the functions p,(x), g(x) (— 254 
Linear Ordinary Differential Equations (Local 
Theory)). 


B. Fundamental Systems of Solutions 


The totality of solutions of a homogeneous 
linear ordinary differential equation forms a 
linear space (over the real or complex field). 
That is, any linear combination y(x)= 

X4 Cy,(x) of the solutions y,, y;, ..., Ym of 
(I), where the C; are arbitrary constants, is 
also a solution of (1’). This is called the prin- 
ciple of superposition. More than n+ 1 solu- 
tions of (1) are always linearly dependent, 
that is, if mz n4- 1, we can find m constants C}, 
C5, ..., C,, not all equal to zero, such that 
ZE, Ciyi(x) 2 0. Equation (1^) has n linearly 
independent solutions. For instance, the n 
solutions y,, y», ..., y, defined by the initial 
conditions 


yiGo)2l, — yi(xo)=0, S VT (xo) 20, 
ya(xo) — 0, V(X) = 1, EK ee EE 
Yn(Xo) =Q, Yn(Xo)=0, KK 39 Tale 1 (3) 


are linearly independent. Such a system of n 
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linearly independent solutions y,,..., y, of 
(1) is called a fundamental system of solutions 
of (1^). In terms of a fundamental system of 
solutions y, ..., y,, each solution y of (1’) 

is represented uniquely in the form y(x)— 
Mia Cy) 


C. Liouville's Formula 


In order for n solutions y4, y>,..., y, of (I) to 
be linearly independent, it is necessary and 
sufficient that the *Wronskian determinant 
W(y;, ya, --- Yn #0 in D. Furthermore, the 
coefficients p,(x) can be represented in terms of 
an arbitrary fundamental system of solutions 
yis fa, Yn Since the coefficient of y"? in the 
expansion of 





C-D'W(y, y1), ¥2(%), «+ Val) (4) 
W(y; (x), ya(x), Kä Vn()) 

is identically equal to p,(x) in D. Using this 
equality for p,(x), we obtain Liouville’s 
formula: 
W(¥1(X), .... Yn()) 
= W(yi(xo), sde! — | nod). 

(5) 


D. Lagrange's Method of Variation of 
Constants 


The difference of two solutions of the inhomo- 
geneous equation (1) is a solution of the homo- 
geneous equation (1^). Consequently, the tgen- 
eral solution of (1) can be represented as the 
sum of a *particular solution of (1) and the 
*general solution of (UL Since a particular 
solution of (1) can be obtained from an arbi- 
trary fundamental system of solutions y,, 
yz. Yn Of (19, (1) can be solved if a funda- 
mental system of solutions for (1°) is known. In 
fact, if we consider C,, C,, ..., C, in the repre- 
sentation y= Èf- C; y;(x), not as constants, 
but as functions of x, and determine them by 
the conditions 


Y169 C109  y209 C2 (x) +--+ y, (x) Cp (x) = 0, 


V(X) Ci (x) + y209 a(x) +... + YO) CLO) = 0, 


YP "()CiGo) rap POCA) +... 
tx Leite = G(x), (6) 


then y(x) 2 222-, Ci(x)y;(x) is a solution of (1). 
This is always possible because from (6) we 
obtain 


4C, ams 
dx 7 W(y, (x), SEHR 
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where W,(x) is the tcofactor of y" (x) in the 
determinant W( y, (x), ..., y,(x)). This method is 
called Lagrange's method of variation of con- 
stants (or variation of parameters). 


E. Linear Ordinary Differential Equations with 
Constant Coefficients 


A linear ordinary differential equation 
y? E a, y" D... q, y +a,y=0 (8) 


with constant coefficients a; has y=exprx asa 
solution if r is a root of the algebraic equation 


f(r)2r" car"! usa, ar a,—0, (9) 


called the characteristic equation of (8). Let r,, 
r?, ..., Fe be the distinct roots of (9), and sup- 
pose that the root r; has multiplicity u; (i= 
1,2, ..., m). Then the set of functions 


pt xam o oxtun lghpes s 
ein", eis... x!m lerm (10) 


is a fundamental system of solutions of (8). 


F. D'Alembert's Method of Reduction of 
Order 


Let y, (x) be a solution, not identically equal 
to zero, of the homogeneous equation (1’). 
By substituting y= y,z into (1^), we see that 
z’ satisfies a linear differential equation of 
order n — 1. This method is called d'Alembert's 
method of reduction of order. Since linear 
ordinary differential equations of the first 
order can be integrated by quadrature (— 
Appendix A, Table 14.I), a homogeneous 
linear ordinary differential equation of the 
second order can be integrated completely if 
one solution of the equation that does not 
identically vanish is known. 

So far we have outlined a general theory of 
solutions of (1) in the domain where solutions 
are continuous or complex analytic, but in 
order to have thorough knowledge of all the 
solutions, we have to examine their behavior 
also in the neighborhood of a singular point 
(which ts a discontinuity point or a singular 
point for at least one of the coefficients) (— 
253 Linear Ordinary Differential Equations 
(Global Theory), 254 Linear Ordinary Dif- 
ferential Equations (Local Theory)). Also, 
boundary value problems are important as 
well as initial value problems described before, 
especially for second-order equations in con- 
nection with mathematical physics [7]. For 
these — 315 Ordinary Differential Equations 
(Boundary Value Problems), 390 Spectral 
Analysis of Operators. 
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G. Systems of Linear Ordinary Differential 
Equations of the First Order 


Let the f; (x) be known functions. A system of 
linear differential equations of the first order 


dy/dx = fy yi tr... fi Cy» +g (xX) 
dy/dx = fz1(X)y1 +--+ fan(X)Yn 9269, 


dy,/dx = fy (X)Y1 + <- + fa X) Yn + Gal) (11) 


with n unknowns y4, y;, ..., y, contains (1) as a 
special case, since (1) is transformed into (11) 
by setting y 2 y,, y =Y2, ..., »" Tas y,. A SYS- 
tem (11) with g,(x)=0, Le, a system 


dy, /dx — fyi(x)y, kr. t fi Gy, 


dy,/dx = f, (x)ys F -+ + fa) Y, (11) 


is said to be homogeneous, while (11) is said to 
be inhomogeneous. 

Suppose that the f;(x), g(x) are continuous 
in an interval D. Then the following theorem 
holds: To every point x, in D and every initial 
condition 


yi(xo)- bi, ya(xo) - b5, s ValXo) =D, 


there corresponds one and only one solution 
that is continuous in D. If the LL g,(x) are 
holomorphic in a domain D, there corresponds 
one and only one solution that is complex 
analytic (not necessarily single-valued) in D. 

It follows that every point of discontinuity 
(singular point) of a solution of (11) is a dis- 
continuity (singular) point for at least one of 
the coefficients f,,(x), g;(x). 


H. Fundamental Systems of Solutions 


If mz n-- 1, m solutions (Yii, Van, Yai) Uz 
1,2, ..., m) of (11^) are linearly dependent, i.e., 
we can find constants C,, C3, ..., Cm, not all 
equal to zero, such that ZE, C;y,;(x) 20 (k= 
1,2, ..., n). System (11°) has n linearly inde- 
pendent solutions. To see this, we have only to 
choose the initial conditions so that 
ya(xog)2l, i=k, 


=0, i#k. 


Such a system of n linearly independent solu- 
tions (Yii Y2is --- Yni) (i= 1,2, ..., n) is called a 
fundamental system of solutions of (11^). In 
terms of this fundamental system, any solution 
(ui... Yn) of (11) is represented uniquely in 
the form y,(x)= 225-4 Cyy,j(x) (k= 1,2, ..., n). 
The linear independence of n solutions 
(Yap ss Vinb s Quo s Van) Is equivalent to 
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the condition that the determinant 


yu). yia) Vin(X) 
A(x)= yaiC9 V22(x) Yon(X) 
Yat (x) Yn2 (x) Ynn(X) 


does not vanish in D. Corresponding to 
Liouville’s formula (5), we have 


EE | al 


I. Method of Variation of Constants 


The general solution (Y;, Y, ..., ¥,) of the 

inhomogeneous equation (11) is given as the 

sum of the general solution (y,, y5, ..., Yn) of 

(11°) and one particular solution (Y, ,, Yo, 
.., Yio) of (11), i.e., in the form 


(yi Yi. Y2 + Y2o, +++» Yn + Yno). 


To obtain the particular solution (Y, o, Y20, 
.. , Ypo), we take any fundamental system of 
solutions 


y17 9 (X). V2 = P2x(X), o Vn = 98x), 
K1,:2: 5 


of (11) and consider the constants u, in the 
linear combination 


NSE Qu(x)u,, i=l; 2 usd (12) 


as functions of x. Substituting (12) into (11), 
we obtain a system of differential equations 


H Pix X) Uy, (X) = gi(x), i= 5 2, eH, (13) 


with unknowns u,. Since the y; form a funda- 
mental system, the determinant of the matrix 
with elements Q(x) does not vanish. Hence 
(13) can be solved in the form 


u(x)-G(x)  k-1,2,...,n, 


and the u,(x) can be obtained by quadrature. 
Consequently, it follows that one particular 
solution can be given, in terms of the u,(x), in 
the form (12). This method is also called the 
method of variation of constants. 


J. Systems of Linear Ordinary Differential 
Equations with Constant Coefficients 


Suppose that the coefficients f;; in (11) are 

all constants. Then n columns of the matrix 
exp(Fx) form a system of fundamental solu- 
tions of (11), where F denotes the matrix [ f;;]. 
Therefore the general solution has the form 


ym à. Pi(xye***, ]SL2.— 
=1 
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Here 4,, 42,...,4,, are the distinct roots of the 
characteristic equation 


fiat fua e Ju 
fa Jah e fe La 
dá p Jun 


and if A, has multiplicity e, (27-, e;— n), P(x) 
is a polynomial of degree at most e, — 1 which 
contains e, arbitrary constants. 

Suppose that the coefficients fj (x) in (117) 
are all periodic functions having the same 
period w. Then there exists a linear trans- 
formation y; — 294 (x)z, in which the q; are 
periodic with period c, such that the original 
equation is reduced to dz;/dx = Lcy,z,, where 
the co are constants (Floquet's theorem). 
Hence if we can find such a linear transfor- 
mation, we can integrate the original equation. 


K. Adjoint Differential Equations 


Consider a linear homogeneous ordinary 
differential equation 


F(y)=poy + p, y^^?  ... E p,y0. 
Integration by parts of [zF(y)dx gives 
zF(y)— yG(z) - d[R(y, z) /dx, (14) 
where 


G(z) - (1 ((poz)? —(p,zy" ? +... 


+(—1)"p,„z), 


R(y, z)= Y 


k-l1h-O 


( —1)^y* 7^ D(p, SH. 


The equation G(z)=0 is called the adjoint 
differential equation of F( y) 2 0, and the iden- 
tity (14) is called the Lagrange identity. By 
integrating (14), we have Green's formula 


| (£F() GG) dx = Ry, z] (4) 


—RLy,2](%o). 


The adjoint differential equation of the adjoint 
equation of F(y) 20 coincides with F(y)=0. If 
y is a solution of F(y)=0, then the solution 

z of the adjoint differential equation satis- 

fies the (n — 1)st-order differential equation 
R(y, z) 2 constant. When G(y)— F(y), F(y) 20 
is called a self-adjoint differential equation. In 
the case of second-order equations with real 
coefficients, its general form is 


F(y) - d(pdy/dx)/dx +qy=0. (15) 


For systems of differential equations, the 
adjoint system of (11^) is defined by 


URL E ee 


j=1,2,...,n. (16) 
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Conversely, (11’) is the adjoint system to (16). 
If (Y1, Y2; Wak (21, Z2, ---, z,) are solutions of 
(11^), (16), respectively, then we have ZIL, y;z; 
— constant. The system (11^) is called a self- 
adjoint system of differential equations if (11^) 
coincides with (16), i.e., if f, (x) = — f(x). 


L. Laplace and Euler Transforms 


When the coefficients p;(x) (i— 1,2, ..., n) in (1^) 
are rational functions, it often happens that we 
can find a solution in the form 


b 
y(x) -j v(t)e"' dt. (17) 


Namely, we can often find a suitable function 
v(t) so that the *Laplace transform (17) of 
vít) is a solution of (1^). Similarly, it is often 
possible to find a solution of (1') as the Euler 
transform 


A b 
=f v(t) (1 — xy dt (18) 
of some suitable v(t). These transforms are 
used for the integral representation of special 
functions. 


M. Linear Ordinary Differential Equations 
and Special Functions 


A number of transcendental functions, such as 
*hypergeometric functions, *Bessel functions, 
*Legendre functions, etc., and Hermite poly- 
nomials, Laguerre polynomials, Jacobi poly- 
nomials, etc., are defined by linear ordinary 
differential equations of the second order (— 
389 Special Functions). 
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253 (XIII.8) 
Linear Ordinary Differential 
Equations (Global Theory) 


A. General Remarks 


Let there be given a linear differential equation 
of the nth order 


y? +a () y"? ... + a (x)y 0, (1) 
or a system of linear differential equations 
y =A(x)y, Q) 


which is the vector-matrix expression of 


j= 2. 8j xls. Jel osa (2) 


where the coefficients a,(x), a, (x) are complex 
analytic functions of x in a certain complex 
domain D. The solutions of (1) or (2) are 
known to be holomorphic when the coeffi- 
cients are all holomorphic. However, at a 
singular point of at least one of the coefficients, 
a ‘branch point of the solution usually ap- 
pears. Thus a solution of (1) or (2) is, in gen- 
eral, a *multiple-valued analytic function of x. 
The object of global theory is the function- 
theoretic study of this function—that is, deter- 
mination of its *Riemann surface and investi- 
gation of its behavior on the Riemann surface. 

At a tregular singular point of (1) or (2), any 
solution can be expressed explicitly by the 
combination of elementary functions and 
power series convergent within a circle around 
the singular point. In the presence of an tirre- 
gular singular point, instead of a convergent 
expression, we can construct an asymptotic 
expansion valid within a certain sector whose 
vertex is situated at the singular point (— 254 
Linear Ordinary Differential Equations (Local 
Theory)). Once such expressions have been 
obtained, the remaining task is to find the 
relations connecting those locally valid ex- 
pressions. Therein lies the main and most dif- 
ficult part of global theory. The problem of 
determining the relations, called the connection 
formulas, is the connection problem. 

Equation (1) or the system (2) is said to be of 
Fuchsian type if all singular points of (1) or (2) 
are regular singular points. If (1) is an equation 
of Fuchsian type defined on the Riemann 
sphere and having singularities at a,, a3, 

<-> Ams Ag+, 00, then we have the Fuchs 
relation 


> EEN )n(n — 1)/2, 


where pj, ..., pj, are the *exponents at a; of (1). 
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B. Monodromy Groups 


Suppose that the coefficients of equation (2) 
are defined on a certain Riemann surface vk 
with singular points a,, a5, .... By deleting a,, 
45, ... from ğ, another Riemann surface Wi is 
obtained. Choose a point x on di, and let Y(x) 
be any fixed branch of a tfundamental system 
of solutions of (2). Also, let F be a circuit on e 
starting from x. By an analytic continuation 
along F, another branch of Y(x) is obtained, 
which we denote by Y(xI). It is known that in 
this case these two branches are connected by 
the relation Y(xI) 2 Y(x)Cy, where Cp is an 

n x n constant matrix, and also that if T} and 
I’, are *homotopic circuits, Cr, — Cr,. So the 
branch Y(xT) and the matrix Cy are deter- 
mined by the *homotopy class y to which I 
belongs. Thus we can write Y(xy) or C, instead 
of Y(xT) or Cr- 

Now let G be the "fundamental group of X. 
Since ¥(xy271) = Y(xy2)C,, = Y(x)C,,C,, for any 
Yı» 72 € G, the correspondence y C, defines a 
'representation of G. The group g={C,|yeG}, 
which is naturally homomorphic to G, is called 
the monodromy group of equation (2). 

For equation (1), we can also define the 
monodromy group of the equation by g — 
{C,|yeG}, where C, is a matrix such that 
(Vi Genk Vax) m (109... Kale, where 
y(x), ..., y, (x) are linearly independent solu- 
tions of (1). 

If the equation is of Fuchsian type, the 
global problem can be regarded as solved 
when the monodromy group of the equation 
has been completely determined. 

If is a complex sphere and the equation 
is of Fuchsian type, the number of singular 
points of the coefficients is of course finite. Let 
a,,...,4, be those singular points, and y, be 
the homotopy class of %’ determined by a 
closed curve T, that encloses only one sin- 
gular point a,. Then the monodromy group g 
is generated by the matrices C, , ..., C, . 
Obviously C, > Cyn are not necessarily in- 
dependent. At least one relation Cyes Cpa =L 
(a unit matrix) always holds. Fn this case, 
‘Jordan canonical forms of C, are all de- 
termined from the convergent expression for 
the fundamental system of solutions valid 
around a, constructed by the famous fFro- 
benius method. However, the calculation of C,, 
itself is generally impossible. 

If n — 2, m=3, and the coefficients are all 
rational functions of x, equation (1) or (2) is 
completely determined if we fix the roots of 
tindicial equations at every a,, as long as the 
equation is of Fuchsian type. Therefore the 
monodromy group is determined by the values 
of the roots of indicial equations. Since these 
roots are calculated purely algebraically, the 
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monodromy group is determined by algebraic 
procedure in this case [5]. 

Let n=2, m=3 in equation (1). Denote by a, 
b, c the three singular points and by 4, 4^; u, u’; 
y, Y' the roots of indicial equations at a, b, c, 
respectively. As was mentioned in the pre- 
vious paragraph, the equation is determined 
uniquely by these nine quantities. Hence they 
also determine a family of functions consisting 
of all the solutions of the equation. This family 
is usually written as 


a b c 
P«A qu wv x 
A! vu v 


and is called the P-function of Riemann [2]. A 
simple transformation of variables reduces it 
to the totality of solutions of Gauss’s thyper- 
geometric differential equation 


x(L—x)y"+(y—-@+ B+ I)x)y’—aBy=0. (3) 


Some solutions of (3) are expressed by a hyper- 
geometric integral 


| t* (t — 1y 797! (r— x) dt, 
C 


where C is a suitably chosen path of integra- 
tion [2] (— 206 Hypergeometric Functions). 

Calculation of the monodromy group of the 
equation (1) or (2) is still an unsolved problem 
except for the case n —2, m — 3 and a few other 
particular cases. 


C. Equations with an Irregular Singular Point 


In the presence of an irregular singular point, 
complete knowledge of the monodromy group 
is still insufficient for the solution of the global 
problem. It is only the structure of the Rie- 
mann surface that is known from the mono- 
dromy group, and the behavior of the solution 
on the Riemann surface still remains to be 
studied. At an irregular singular point, the 
solution can be expressed only by an tasymp- 
totic series valid within a certain sector, and 
the same solution possesses completely differ- 
ent expressions in different sectors. This is 
called tStokes’s phenomenon (— 254 Linear 
Ordinary Differential Equations (Local The- 
ory)). Thus, to complete the global theory, 
connection formulas between different asymp- 
totic expressions must be established. 

For a second-order linear equation with two 
singular points, one of which is regular and the 
other irregular of the first rank, the problem is 
completely solved. In this case, the equation 
can be reduced to a tconfluent hypergeometric 
differential equation [2] (— 167 Functions of 
Confluent Type). The problem is also partly 
solved for a linear equation of higher order 
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with two singular points, one of which is regu- 
lar and the other irregular (K. Okubo, J. 
Math. Soc. Japan, 1963). 

If two singular points are both irregular, 
even the monodromy group cannot be cal- 
culated in general. For such a case, G. D. 
Birkhoff proposed a method of reducing one 
of the singular points to a regular one. He 
showed that this procedure is possible under 
certain assumptions on the monodromy ma- 
trix (Math. Ann., 1913). 


D. Riemann's Problem 


As a noteworthy result for equation (1) of 
Fuchsian type with algebraic coefficients, 
Poincaré's theory deserves special mention. 
According to his theory, a solution of (1) can 
be uniformized in the form y= f(z), x ^ g(z), 
where f and g are single-valued analytic func- 
tions of z. Although it is known generally that 
any analytic function admits such uniformiza- 
tion (— 367 Riemann Surfaces), Poincaré's 
theory gives a more explicit and efficient uni- 
formizing construction. As uniformizing para- 
meter z, we may take a ratio of two indepen- 
dent solutions of a certain linear differential 
equation of the second order that is deter- 
mined from (1), and f and g are, in general, 
*Fuchsian functions, i.e., tautomorphic func- 
tions for a certain *Fuchsian group, save for a 
few exceptional cases in which they are ra- 
tional or elliptic functions. 

Brief mention should be made of Riemann's 
problem as a problem closely related to the 
global theory of linear differential equations. 
This problem was taken up by "Hubert in his 
famous Paris lecture as the 21st problem, and 
hence is often called the Riemann-Hilbert 
problem. The problem can be stated as fol- 
lows: Suppose that we are given a Riemann 
surface %, points a,, 45, ... on iy, and a group 
g of nx n matrices homomorphic to the funda- 
mental group of && — (a,,a;, ...]. Then find an 
equation of the form (2) such that (i) the coeffi- 
cient A(x) is single-valued and meromorphic 
on %; (ii) the singular points are all regular and 
situated at a,, a5, ...; and (iii) the monodromy 
group of the equation coincides with g if a 
fundamental system of solutions is suitably 
chosen. Extensive research was done by many 
mathematicians, and finally H. Róhrl suc- 
ceeded in solving the problem (Math. Ann., 
1957). 


E. Isomonodromic Deformations 


M. E. R. Fuchs considered the equation 
d^ y/dx* = p(x)y, (4) 


937 


where p(x) is given by 
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and he proposed the following problem: Ob- 
tain conditions among the parameters a, b, c, 
d, t, 4, a, B such that the monodromy group of 
(4) is kept invariant when these parameters 
vary, under the hypothesis that a fundamental 
system of solutions around x = 4 of (4) does 
not contain logarithmic terms. It is clear that 
a, b, c, d remain constant. Fuchs obtained a 
necessary and sufficient condition which is 
stated as follows: 4, «, f are considered as 
functions of t and 4 satisfies the tPainlevé 
equation (VI) and a, f are rational functions of 
A and dA/dt (Math. Ann., 63 (1907)). 

The result of Fuchs was extended by R. 
Garnier into two directions: Considering equa- 
tions of the form (4) with irregular singular 
points, Garnier derived the Painlevé equations 
(D-(V). Then from equation (4) with 











a b N g d 
leg the- xx-D) 
N 3 a 
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ay h; 


he obtained a system of partial differential 
equations called the Garnier system (Ann. Sci. 
Ecole Norm. Sup., 1912). 

For equations with irregular singular points, 
the problem must be modified as follows: 
Deform equations so that not only the mono- 
dromy group but also the system of tStokes 
multipliers are kept invariant. This modi- 
fied problem is called the isomonodromic 
deformation. 

L. Schlesinger studied the isomonodromic 
deformation of the system 


dy N Aj 
ae SA 


and he derived a tPfaffian system, called the 
Schlesinger equations (Crelles J., 1912). 

The research of the theory of isomono- 
dromic deformation has recently become 
active. K. Okamoto has investigated the iso- 
monodromic deformation of Painlevé equa- 
tions, Garnier systems, and linear ordinary 
differential equations in detail, and T. Miwa 
and M. Jimbo have extended the Schlesinger 
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equations. Also — 288 Nonlinear Ordinary 
Differential Equations (Global Theory). 
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254 (XIII.7) 
Linear Ordinary Differential 
Equations (Local Theory) 


A. Singular Points 


Consider a system of n linear ordinary dif- 
ferential equations 

aa A(x)y, (1) 
dx 

where the independent variable x belongs to a 
domain in the *Riemann sphere, y is a complex 
n-dimensional column vector (y,(x), y2(x), ..., 
y, (X)), and the n x n matrix A(x) has tcomplex 
analytic functions as elements. A singular 
point x =a (#00) of A(x) is called a singular 
point of the system (1). Let 


dy/dt = B(t)y (2) 


be the system transformed from (1) by the 
change of variable t 2 1/x. The point x= oo is 

a singular point of the system (1) if t—0 is a 
singular point of (2). By definition a point x =a 
or x=% is a singular point of (1) ift 20 isa 
singular point of the transformed equation (2) 
by the change of variable t2 x —a or t=1/x. It 
follows that by the use of tlocal coordinates 
the notion of singular points of the system can 
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be extended to the case when A(x) is complex 
analytic on a *Riemann surface. 

Consider a single nth-order differential 
equation defined on a Riemann surface 


y? La, (x) y" P+... +a,(x)y=0. (3) 


The above definition of singular points may 
easily be adapted for use in equation (3), 
because (3) can be converted into a system (1) 
by a simple transformation. 


B. Classification of Singular Points 


Suppose that A(x) is holomorphic in a deleted 
neighborhood 0 « |x| « R of x «0. Then any 
solution of (1) can be continued analytically in 
0 « |x| « R and is not necessarily single-valued 
(— 252 Linear Ordinary Differential Equa- 
tions). For simplicity, we denote by xe?"' the 
terminal point of a closed path starting and 
ending at x and surrounding x —0 once in the 
positive sense. Let y,(x), ..., y,(x) be a funda- 
mental system of solutions of (1). Then the 
fundamental matrix solution Y(x) —(y,(x), 

.., y. (X)) undergoes a linear transformation 
Y(xe?") = Y(x)M when x is transformed to 
xe?" where M is a nonsingular constant ma- 
trix. The matrix M is the monodromy matrix 
(or circuit matrix) of Y(x) at x 20. If we take a 
matrix S such that M — e?"55, then there exists 
a matrix P(x) whose elements are holomor- 
phic functions in 0 « |x| « R and such that 
Y(x) has the form Y(x)= P(x)x?, where x? is 
defined by x? 2 exp(Slog x). 

If for a solution y(x) of (1) and an arbitrary 
sector X there is a positive number r such that 
lim, .o|x|"y(x)=0, x =0 is a regular singular 
point of the solution y(x); and if there is no 
such number, x =0 is an irregular singular 
point of y(x). If x 20 is a regular singular point 
of all the solutions of (1), it is a regular singular 
point of the system (1). If some of the solutions 
have x =0 as an irregular singular point, then 
it is an irregular singular point of the system 
(1). A necessary and sufficient condition for 
x=0 to be a regular singular point of the sys- 
tem is that an arbitrary fundamental matrix 
solution Y(x) of (1) in the form Y(x)= P(x)x? 
as described above have x =0 as a pole of P(x). 
In the same way, we can give the definition of 
regular singular points and irregular singular 
points for equation (3). 


C. Regular Singular Points 


Consider equation (3). A necessary and suffi- 
cient condition for x 20 to be a regular sin- 
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gular point of (3) is that every a,(x) have a pole 
of order at most k at x =0. Consequently, we 
can write (3) in the form 


x" y - A,(x)x" 1 7E... ALX)y-0, (4) 


where each A,(x) is holomorphic at x —0. 
Equation (4) has a fundamental system of 
solutions of the form y = x^* P,(x,log x), where 
p, are n roots of the indicial equation at x =0 of 


(4): 
p(p—1)...(p—n-4 1) - A,(0p(p— 1)... 
(p—nc2)4 ...- A,-,(0)p + A,(0) «0, 


and P,(x, L) is a polynomial in L of degree 

at most equal to the number of roots of the 
indicial equation that are congruent to py 
(modulo integers) with coefficients holomor- 
phic functions of x. The quantities p, are 
called the exponents at x — 0 of (3). If the real 
part of p, is largest among the real parts of the 
roots p; that are congruent to p, modulo in- 
tegers, there exists a solution with the expo- 
nent p, and not containing the logarithmic 
term. (A number a is congruent to b modulo 
integers when a — b is an integer.) In particular, 
no solution contains the logarithmic terms 
when no pair of roots of the indicial equation 
is congruent modulo integers. Frobenius's 
method is convenient for finding these solu- 
tions (+ Appendix A, Table 14). 

We return to system (1). The following 
theorem gives a simple sufficient condition for 
x —0 to be a regular singular point, but there 
is no simple necessary condition: x 20 is a 
regular singular point of (1) if x 20 is a simple 
pole of A(x). Then (1) is written as 


x(dy/dx) = C(x)y, (5) 


where C(x) is holomorphic at x —0. Equation 
(5) has a fundamental system of solutions of 
the form y, =x?*p,({x, log x), k= 1,...,n, where 
the exponents p, are n roots of the indicial 
equation at x =0 of (5): det(C(0)— pl!) 20, and 
the p,(x, L) are vector functions whose compo- 
nents are polynomials in L with coefficients 
holomorphic at x =0. If none of the exponent- 
differences p;— p, (J# k) is equal to an integer, 
then there is a fundamental system of solutions 
of the form y, — x^*p, (x). The above result is 
derived from the following theorem: There 
exists a transformation y = P(x)z that takes (5) 
into a system 


x(dz/dx) = Dz, 


where P(x) is a matrix holomorphic at x 2 0 
and D is a constant matrix. 

If all solutions of (1) or (3) are meromorphic 
at x=0, then x =0 is called an apparent sin- 
gular point. 
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D. Irregular Singular Points 


If at least one of the coefficients a,(x) of equa- 
tion (3) has a pole of order more than j, then 
x =0 is an irregular singular point of (3). Let m, 
be the order of the pole of a,(x) at x —0, with 
the convention that m;= oo when a;(x) — 0; and 
assume that mo =0 for simplicity. Let A, (v= 
0, 1, ..., n) be the points with coordinates 
(v, m,) in the Euclidean plane and 77 be the 
upper half of the boundary polygon of the 
‘convex hull of the set | 46, A1, -.-, Aal, This 
polygon is called the Newton diagram of equa- 
tion (3). Let (v, r,) be the intersection of the 
straight line x 2v with 77, and consider the 
nonincreasing sequence of numbers {0;}, 0, = 
r,— ry 4 (ry — 0). We assume u to be an integer 
such that o, 205 2 ...20,7120,44 EA 
To the first u sections of the diagram 77 with 
slopes o; 1 there correspond u formal solu- 
tions of the following form that are formally 
linearly independent: 


yc exp(A(x)) x^ P, (x, log x), 


where the 4,(x) are polynomials in fractional 
powers of x ! and the P,(x, L) are polynomials 
in L with coefficients given by formal series in 
fractional powers of x. Associated with the 
sections of /7 corresponding to those n— u 
numbers o, 1, there are n — linearly inde- 
pendent formal solutions of similar form but 
without the exponential term. 

Consider a system 


x'*' (dy/dx)— A(x)y, (6) 


where r is a positive integer and A(x) is holo- 
morphic at x 20. The system (6) has n formally 
linearly independent solutions: 


y, = exp(A,(x))x^*p,(x, log x), 


where the A,(x) are polynomials in fractional 
powers of x! and the p,(x, L) are n-vectors 
whose components are polynomials in L with 
coefficients given by formal power series in 
fractional powers of x. This result is obtained 
from the following theorein: There exists a 
transformation y = P(x)z that changes (6) 
formally into the system 


x'*! (dz/dx) 5 (A(x) 4- Jx")z, 


where P(x) is a matrix with elements given by 
formal power series in fractional powers of x, 
A(x) is a diagonal matrix with diagonal ele- 
ments x”*! /; (x), and J is a constant matrix in 
Jordan canonical form. A necessary and suffi- 
cient condition for x =0 to be a regular singu- 
lar point of (6) is that A(x) vanish identically. 
Unfortunately, formal power series appear- 
ing in the formal solutions of (3) and (6) are in 
general divergent series. H. Poincaré, by intro- 
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ducing the notion of tasymptotic expansions, 
first proved under a very restrictive hypothesis 
that these formal solutions represent asymp- 
totically actual solutions in any small sector. 
Contributions in this direction had been made, 
notably by W. J. Trjitzinski and J. Malm- 
quist, but a decisive result was obtained by M. 
Hukuhara, whose method is also applicable to 
the study of regular singular points. 

Let y1(x), ..., y (x) (6! (x)) be a fundamental 
system of solutions of (3) (fundamental matrix 
solution of (6)) that are expressed asymptoti- 
cally by n formal solutions in a sector D, and 
yi(x), ..., y2(x) (P? (x)) be another fundamental 
system of solutions of (3) (fundamental ma- 
trix solution of (6)) of the same nature ex- 
pressed by the same formal solutions but in a 
different sector D,. Then the two systems 
(two matrix solutions) are, in general, not the 
same system (matrix solution), but they are 
connected by a constant matrix C: 


(y169. «Vn OD) = (V7), «Va ONC 
(bh (x)= @7(x)C). 


This ts the so-called Stokes phenomenon, and 
the elements of the matrix C are called Stokes 
multipliers. The problem of determining these 
multipliers ts a kind of tconnection problem, 
and the above equality is a ‘connection for- 
mula (— 253 Linear Ordinary Differential 
Equations (Global Theory)). 

For an equation of the form (3) (system of 
the form (6)) with an irregular singular point 
at x —0, the polynomials 2, (x), ...,4,(x), the 
quantities p, ..., p,, and a set of Stokes multi- 


‘pliers form a complete system of invariants 


under linear transformations of the form 


z ^ qi(x)y  qa(x)y' +... oq (x)y j 


(z= QC9y). 


where the q,(x) (the elements of Q(x)) are mero- 
morphic at x —0. 

Even if equation (3) has x =0 as an irregular 
singular point, it may happen that (3) admits 
solutions holomorphic at x =0. This was first 
studied by O. Perron and then by F. Letten- 
meyer, M. Hukuhara, and Iwano, and by H. 
Komatsu. G. D. Birkhoff proved that when the 
monodromy matrix at zero of the system (6) 
can be diagonalized, there ts a nonsingular 
matrix P(x) (det P(0) 40) such that the linear 
transformation y — P(x)z transforms (6) into 


E. Singularities with Respect to a Parameter 


An analogous theory has been obtained for a 
system of first-order linear differential equa- 
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tions with a small complex parameter e: 


d 
p A(x, €)y, 
dx 


A(x, e) = p, A,(x)e*, (7) 


where h is a positive integer, the n x n matrices 
A, (x) (k=0, 1, ...) are single-valued holomor- 
phic functions of x in a neighborhood D of 
the origin, and the tasymptotic expansion is 
valid when eO in a sector X. When all eigen- 
values of the matrix A,(0) are distinct, there 

is a matrix of formal solutions of the form 
P(x, £)e9-?, where P(x, £) is a formal series of 
the form A(x, £), Q(x, £) is a diagonal matrix 
with polynomials in Lie of degree h as diagonal 
elements, and all the coefficients are single- 
valued holomorphic functions of x on D* 

(c D). In particular, the coefficient of z^" in 
the jth diagonal element of Q(x, c) is (5 u(t) dt, 
where u;(x) is an eigenvalue of the matrix 
Ao(x). If we take a certain subsector 2* of X, 
the matrix P(x, c)e9*-? represents an actual 
matrix solution in Z*. When there is no fturn- 
ing point (— Section F), it is always possible, 
even if there are multiple eigenvalues in A,(0), 
to construct formal solutions that are asymp- 
totic representations of some solutions in 
some sector. Similar theories were developed 
for cases where more than two parameters 
appear. 


F. Turning Points 


Consider the point x =0 in the system (7), and 
0 1 

set n2, h=1, and WEI 3 Then 
X 


A(x) has a multiple root when x =0 and dis- 
tinct roots when x 40. As in this example, 
when the Jordan canonical form of the leading 
matrix A,(x) has different structure for x =0 
and for x #0, the coefficients of the formal 
power series in c are not single-valued holo- 
morphic functions of x, and have worse sin- 
gularities as the order becomes higher. Con- 
sequently, in the neighborhood of x 20, we 
cannot construct actual solutions that can 

be represented asymptotically by these formal 
solutions. Such a point is called a turning point 
(or transition point) of the system (7). 

If there is a nonsingular formal transforma- 
tion y= T(x, e)z; T(x, e) - X T,(x)e* (det Ty(0) # 
0) having similar analytic properties to those 
of A(x, £), and if the transformed system has 
a well-known form, it is possible to give ana- 
lytic meaning to the formal transformation 
T(x, ¢). Then the transformed system is called 
a related differential equation of (7); in the 
above example, e(dz/dx) = Ag(x)z is a related 
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differential equation that has well-known solu- 
tions expressed by *Bessel functions. What is 
meant by well-known here is that the behavior 
of all solutions is known in the entire complex 
plane for a fixed c. However, it is not easy to 
find a suitable related differential equation 

for an arbitrarily given system. 
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A. Problems 


A linear programming problem is a special type 
of mathematical programming (— 264 Mathe- 
matical Programming) in which all the func- 
tions involved, i.e., the objective function and 
the constraints, are linear in the variables. 

The simplest typical case is formulated as 
follows. 

Problem I: Maximize z=c’x, under the 
condition 


(C) Ax=b and xz0, 


where xc R" is the vector to be determined, c is 
a constant n—real vector, b is a constant m- 
vector, and 4 is an m x n real matrix. 

Without any loss of generality we can as- 
sume that the rank of A is equal to m, because 
otherwise the condition is either redundant or 
inconsistent. This also implies that m x n. 

Any vector which satisfies condition (C) is 
called a feasible solution, and that which maxi- 
mizes z among all feasible solutions is called 
an optimal solution. Let A, be an m x m non- 
singular submatrix of A, i.e., a matrix consist- 
ing of m linearly independent columns of the 
matrix A. Denote by A, the m x (n — m) matrix 
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formed by the remaining columns of A. Let x, 
be the m-vector consisting of the components 
of x corresponding to A, and x, be the vector 
of the remaining components. Then the equa- 
tion Ax =b can be written as A, x; + A;x,— 
b, and one of its solutions is given by x, — 
A, ! b and x, —0, which is called a basic solu- 
tion. Furthermore, if x, 2 0, the basic solution 
is called a basic feasible solution, and if it is 
also optimal, it is a basic optimal solution. 
Then the following theorem can be proved. 

Theorem: If there exists a feasible solution, 
there is also a basic feasible solution, and if 
there exists an optimal solution, there is also a 
basic optimal solution. 

Corresponding to a basic solution, we can 
derive the expression x, + Ay! A,x,=A, | b; 
z —(€5; A; 1 Aj)x, «0. Such an expression 
is called a basic form of the problem, and the 
components of x, are called the basic vari- 
ables. The basic solution x, = A, ! b, x, — 0, 
is an optimal solution if and only if 


(Q) Aj'bz0 and q-c,— 4, A( "c, «0, 


which is called the optimality criterion. 

Since there are at most ,C,, basic solutions, 
we can always find an optimal solution, if 
there is one, in a finite number of steps. 

Another form of the linear programming 
problem is as follows. 

Problem II: Maximize z —c'x, under the 
condition 


(C) Ax<b and xz 0. 


The two formulations are equivalent be- 
cause Problem II can be transformed into 
Problem I by introducing a new nonnegative 
m-vector s and writing the equation as Ax + 
s=b. Such a vector s is called the vector of 
slack variables. Conversely, Problem I can be 
formulated as Problem II by imposing the 
conditions Ax x b and Ax z b instead of the 
equality Ax — b. 

Sometimes it happens that for some of the 
variables nonnegativity conditions are not 
assumed. Then for the vector x? of such vari- 
ables we can define x* —x^ =x°, x * 20, and 
x 20, and we can assume that all the vari- 
ables are nonnegative. 


B. Duality 


Problem II has the following dual problem. 
Problem III: Minimize w — b'y, under the 
condition 


(C) A'y2c and yz 0. 


For Problem I, if we restate the equality 
condition as Ax xb and — Ax < — b, then the 
dual problem can be written as: Minimize 
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w-by*-b'y under the condition A’y* — 
A'y zcy'20,y^ 20. Then by putting y= 
y —y ,wecan formulate the dual problem 
as follows. 

Problem IV: Minimize w — b'y, under the 
condition 


(C") A'yzec, with no restriction on the sign 
of y. 


In contrast to the dual problem, the original 
problem is called the primary problem. The 
following theorem is basic to the theory of 
linear programming. 

Duality theorem: If either one of the primary 
or dual problems has an optimal solution, 
then the other also has an optimal solution, 
and it holds that max z = min w. Moreover, if 
x* and y* are optimal solutions of the two 
problems, we have c'x* =b’y* = y*' Ax*. 

Conversely, if x* and y* are feasible solu- 
tions of the primary and dual problems and 
if we have c'x* = b'y* = y*' Ax*, then x* and 
y* are optimal solutions of the respective 
problems. 

Let xf = A, ! b, x$ =0 be a basic optimal 
solution of Problem I. Then the condition in 
its dual problem is written as A; y 2c, and 
A5y Z €5. Put y* = A4 "c4; then from the opti- 
mality condition it is shown that 45y* >c, 
and also Aj y* —c,; hence y — y* is a feasible 
solution of the dual problem. It is obvious 
that e'x* 2c; xf =c, A; ! bz b'y* y* A, xt = 
y" Ax*, and for any feasible solution y of the 
dual problem, we have b'y = x* A; y 2 x*c— 
b'y*, which establishes that y* is an optimal 
solution. 

The duality theorem can be formulated in 
a more general way: Let V and W be closed 
convex cones in R" and R", respectively. Then 
we state the following problems. 

Primary problem: Maximize z —c'x, xe R", 
under the condition b — Axe V and xe W. 

Dual problem: Minimize w= b'y, y eR", 
under the condition A'y - ce W* and yeV*, 
where W* and V* are the dual cones of W and 
V, respectively. 

We can also write maxz — oo if the value of 
z is not bounded, and max z= —oo if there is 
no x which satisfies the condition; similarly, 
min w — —oo if w is not bounded from below, 
and min w = oo if there is no y satisfying the 
condition. 

Theorem: If either max z z —oo or min w Æ 
oo, we have max z = min w. Moreover, if —oo 
<maxz=minw=c’x* — b'y* < oo, we have 
y" Ax* =c'x* =b’y*. 

Define the expression @(x, y) 2 e'x - b'y — 
y Ax — ex t y (b— Ax) — b'y — x'(A'y —c) for 
xc W and yeV*. o can be regarded as a 
Lagrangian form for both the primary and 
dual problems. And if (x*, y*) is a pair of op- 
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timal solutions, we have o(x*, y) > o(x*, y*) > 
(x, y*) for all xe W and ye V*, which means 
that (x*, y*) is a saddle point of the function o. 
Now suppose that in the primary problem the 
constraint vector b can change, and consider 
maxz to be a function of b, denoted by z(b). 
Then under a small change of b at least some 
of the optimal solutions of the dual problem 
will remain optimal; hence for any vector a we 
have 


inf a'y* «lim P sib + ta)—z(b))< sup a/y*, 
Viet? t ytey* 
where Y* is the set of the optimal solutions 
of the dual problem. Because of this property 
the solution of the dual problem is called the 
vector of shadow prices or of the imputed costs 
of the constraints. Symmetrically, the solution 
of the primary problem describes the rate of 
change in the value of the objective function 
under a small change in the constraint vector 
in the dual problem. l 
The duality theorem is closely related to 

some theorems on systems of linear inequal- 
ities and convex cones in a finite-dimensional 
Euclidean space; especially, the following are 
equivalent to or easily derivable from the 
duality theorem. Minkowski-Farkas theo- 
rem: Given an equation Ax — b, where b is an 
element of R", a necessary and sufficient condi- 
tion for a solution x 20 to exist is that u'b > 
0 hold for any vector u such that w A >0. 
Stiemke theorem: For a matrix A one of the 
following two alternatives holds: (i) Ax «0, x > 
0 have a solution; (it) uA z 0 has a solution. 
Tucker's theorem on complementary slackness: 
For any matrix A, the two systems of linear 
inequalities (i) Ax 20, x 20, and (ii) u Az 0 
have solutions x, u satisfying A'u +x >0. The 
minimax theorem for tzero-sum two-person 
games (— 173 Games Theory C) with finite 
number of strategies for both players is also 
shown to be equivalent to the duality theorem. 
It is formulated as 
min max y Mx = max min y Mx, 

yeS, xes] xeS, yeS, 

where M is the payoff matrix and S, and S, 
are tsimplexes of mixed strategies. 


C. Algorithms 


The most commonly used method for solving 
linear programming problems numerically is 
the simplex method introduced by van Dantzig 
[1] and its variations. It gives a procedure 
starting from one basic feasible solution to 
reach an optimal basic solution in a finite 
number of steps by improving the value of the 
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objective function at each step. Let 


xit Y dyxj=g;, iel, 


jeJ 


z+)" fxj;=v 

jeJ 
be a basic form for Problem I, where J denotes 
the set of the basic variables and J the set of 
nonbasic variables. Then the feasibility implies 
that d; 20 for all ie I. Furthermore, if f,>0 for 
all jeJ, the basic solution x;=g; for i in J and 
x; —0 for j in J is optimal. If fj. « 0 for some j* 
in J, define r; — g;/d;;« for i in I and for which 
d, >0. Let rẹ 2 minr;; then we can delete i* 
from the set 7 and add j* to it and get a new 
basis, and by simple algebraic calculation we 
get a new basic form corresponding to the 
new basic feasible solution, for which the value 
of z is increased by — fir... If rx is always 
positive, we can get an optimal basic solution, 
since the above procedure cannot continue in- 
definitely; and even when r; becomes zero (the 
degenerate case), we can avoid infinite circular 
repetition by using a method proposed by 
R. G. Bland [16]. And fj. «0 and d; <0 for all 
i implies that the value of z is unbounded. The 
array of the coefficients of the basic form is 
called the simplex tableau, and the method is 
called the simplex method. In order to obtain 
a basic feasible solution, the two-phase simplex 
method is used. Assume that in Problem I the 
vector b is nonnegative. Then we introduce a 
new m-vector u called the vector of artificial 
variables, and formulate an additional prob- 
lem as follows. 

Problem Ia: Maximize t= —l'u under the 
condition Ax -u- b and x 20, uz 0. 

This problem can be solved by the simplex 
method starting from the basic solution u=b, 
X — 0, and if we get to an optimal solution 
with t=0, we have a feasible solution for the 
original problem, from whence we can proceed 
with the original problem. 

If there some inequalities in the condition 
we transform them to equalities by introduc- 
ing slack variables, and then apply the simplex 
procedurc. Once an optimal basic solution is 
obtained, a solution of the dual problem is 
easily obtained from the relation y* = Aj! b. 
The dual simplex method utilizes the primary- 
dual relationship. 

Recently, L. G. Khachiyan [7] proposed a 
new linear-programming method that gives an 
optimal solution within predetermined ac- 
curacy of approximation in a number of steps 
bounded by a polynomial in the numbers of 
the variables and constraints, a property the 
simplex algorithm fails to have. His method is 
basically an iterative procedure to find a solu- 
tion x satisfying a system of strict inequalities 
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a;x bh. i=1,...,m, where the b; and the com- 
ponents of the a; are all integers. Let x?" be 
any vector and BY = KI, where K is a posi- 
tive constant defined in terms of the a; and 
the b,, and the sequence of vectors x“ and 
matrices B™ are defined by 





(a. 
x «0 i d 
3 
n+l da Bea, 
2 (ia ar BY 
pm, CU (ges 2 B'"ajajB 
n^—1 n+1 a;Ba, 


when the vector x violates the inequality a;x 
« bj, and continue until a solution is obtained 
or the number of steps reaches some constant. 
In the latter case it can be proved that the 
system of inequalities does not have any feasi- 
ble solution. This algorithm can be applied to 
solve Problem II in the following way: Due to 
the duality theorem, the optimal solution of 
Problem II can be characterized as vectors 
satisfying b'y x ex, Ax <b, — Ay x —e,x 20, 
y 20. Then the system is approximated by 
another system with integer coefficients, and 
then by a system of strict inequalities, and the 
above algorithm can be applied to this ap- 
proximated system. This method is based on a 
principle entirely different from the simplex 
method of computing successively the centers 
of ellipsoids of indefinitely decreasing size and 
containing a subset of feasible solutions. 

Although Khachiyan’s method has a mathe- 
matically appealing property, in practical 
applications the simplex method and its vari- 
ations still seem to be the most efficient gen- 
eral method for the numerical solution of 
linear programming problems. 

Some special types of linear programming 
problems allow specific algorithm for solution, 
e.g., the transportation problem has the struc- 
ture: Minimize 2; X;cjx; under the condition 
Yjxj2 a; Li Xy<b;, x;z:0. This can be solved 
by any one of several simple intuitive methods. 
Various types of linear programming problems 
are discussed as problems of maximizing flows 
on networks (— 281 Network Flow Problems). 
Also, problems with the further condition that 
some or all of the variables are integers can 
profitably be discussed separately (— 215 
Integer Programming). 


D. Generalizations and Applications 


Linear programming in the sequence space 
()={X — (xj | Z2, |x| < 00} was treated by P. 
C. Rosenbloom [127]. In this case, the require- 
ments for variables X are given in terms of 
‘linear functionals 4;€(1)* — (m) (i= 1,2, ... , k) 
for example, as (1) equalities 4;(X) — c; or (i^ 
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inequalities A;(X) € c;, and (ii) x; >0 (Vj). The 
space (/) is supplied with the *weak topol- 

ogy as the conjugate space of (cj) - {X = 

(xch | lim; o x; — 0J. Let & be the set of all X 
satisfying (i) (or (i')) and (ii). Let A(X) bea 
*weakly upper semicontinuous functional, 
and suppose that the A;(x) are tweakly lower 
semicontinuous, y #@, and A(X) is upper 
bounded. Then the maximal value of A(X) is 
attained at an extreme point of %, and further- 
more, if the space is completely regular, then 
the solution is unique. If % is bounded, it is 
the convex hull of its extreme points, i.e., the 
smallest closed convex set containing its ex- 
treme points. By applying the theory in this 
section to the family of functions that can be 
expressed as f(x) — ZZ: a;o(x) in terms of a 
given system of functions (x) (j=1,2,...) on 
R, S. N. Bernshtein’s approximation theory of 
function systems, the theory of absolutely 
monotonic functions, and several inequalities 
in the theory of functions of a complex vari- 
able can be treated in a unified fashion. If 

the theory is further extended to the case of 
'finitely additive measures defined by means of 
linear functionals on the Banach space of 
bounded functions, we may treat the extremal 
problems of linear functionals on the func- 
tion space of all f(x) that can be expressed as 
f) 9 fs K(x, s) du(s), and apply it to obtain the 
interpolation formula of nonnegative thar- 
monic functions, results of Carathéodory and 
Fejér on its Fourier coefficients, an analog of 
Harnack's theorem for the *heat conduction 
equation, and so on. 

The extension of linear programming theory 
to linear topological spaces is due to L. Hur- 
wicz [5]. Let Z^ be a linear space, Y, be 
linear topological spaces, Py, P; the nonnega- 
tivity cones of Y, Z, respectively, which are 
closed convex cones containing inner points, 
and D a convex set in 4. We assume that F 
and A are linear mappings from D into 3 and 
Z, —R, Z is "locally convex, and Py, P; are 
closed convex cones in X and &, respectively, 
and we consider the following. 

Problem L: Maximize F(X)= X*(X) under 
the condition G(X)= A(X)— Bz0, X >0, 
Be®. 

Put ®(X, Z*) = X*(X)+ Z*(A(X)— B). If we 
define a linear mapping T from W=2 xR 
into Y^ =Z x W by T((X,p)) -(A(X)—pB, 
(X, p)), where pe R, then under the condition 
that the image under T* of the nonnegativity 
cone of Y^* is a fregularly convex set in *Y^*, 
Xo is a solution of problem L if and only if 
there exists a Z$ e Z* such that (Xo, Z3) is a 
nonnegative saddle point of ®. 

Under conditions expressed in simpler 
terms, Isii [6] proved the coincidence of the 
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supremum of the objective function and 

inf. supy, p of the *Lagrangian form, gave 
conditions for the supremum and the infimum 
to be attained, and developed a theory that 
generalizes the *Chebyshev inequality and the 
*Neyman-Pearson fundamental lemma for 
testing statistical hypotheses. 


E. History 


The theory of linear programming is closely 
related to the theory of convex cones, convex 
polyhedra, and systems of linear inequalities. 
Concerning *polyhedral convex cones, fconvex 
polyhedra in R", and the algebraic theory of 
systems of linear inequalities, we have classical 
results due to P. Gordon (1873), J. Farkas [2], 
E. Stiemke [13], H. Weyl [15], etc., and a later 
refinement due to A. W. Tucker [10, paper 1]. 

The application of linear systems to eco- 
nomics was made possible through the works 
of J. von Neumann, especially his tgame the- 
ory and balanced linear growth model [14]. 
These and the interindustrial input-output 
analysis of W. Leontief [11] led to the works 
assembled in 1951 by T. C. Koopmans and 
others [9]. Concerning practical computa- 
tion and applications to industry, there are 
isolated and long-neglected works by L. V. 
Kantorovich [8]; however, the main part 
of the method was developed in the United 
States, especially after the discovery of the 
simplex method by G. B. Dantzig [1] and his 
followers. 

Linear programming has become one of the 
most important techniques in operations re- 
search, and, following upon the development 
of computers, has found wide application to 
practical problems. Most contemporary large- 
scale computers are equipped with programs 
to solve linear systems problems. 
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A. Definition 


Suppose that we are given a set Land a ‘field 
K satisfying the following two requirements: (1) 
Given an arbitrary pair (a, b) of elements in L, 
there exists a unique element a + b (called the 
sum of a, b) in L; (11) given an arbitrary element 
ain K and an arbitrary element a in L, there 
exists a unique element «a (called the scalar 
multiple of a by x) in L. The set Lis called a 
linear space over K (or vector space over K) if 
the following eight conditions are satisfied: (1) 
(a 4- b) - c — a 4- (b +c); (11) there exists an ele- 
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ment Oe L, called the zero element of L, such 
that a+O0=0+a=a for all ae L; (itt) For any 
ae L, there exists an element x = —ae L satisfy- 
ing a+x=x+a=0; (ivy) a+b=b+a;(v) «(at 
b)=aa+ ab; (vi) (aB)a = a(fla); (vii) (a+ f)a = 

xa 4- Ba; (viti) 1a —a (where 1 is the tunity 
element of K). An element of K is called a 
scalar, and an element of L is called a vector. 
K is called the field of scalars (basic field or 
ground field) of the linear space L. 

In the definition of linear spaces, K can be 
noncommutative. L is also called a left linear 
space over K since the scalars act on L from 
the left (aa, a€ L, a. € K). A right linear space 
is similarly defined. Actually, a left (right) 
linear space is a unitary left (right) K-module. 
If K is commutative, it is not necessary to 
specify left or right, since we can identify «a 
and aa. In this article we consider only linear 
spaces over commutative fields. A similar 
theory can be established for linear spaces 
over noncommutative fields (— 277 Modules). 

If K is the field of real numbers R or the 
field of complex numbers C, a linear space 
over K is called a real linear space or complex 
linear space, respectively. In the following 
discussion, we fix a field K, and by a linear 
space we mean a linear space over K. 


Examples. (1) Geometric vectors: In a tEu- 
clidean space or, more generally, an taffine 
space, the set of vectors PO associated with 
points P, Q in the space forms a linear space. 

(2) n-tuples in K: K" denotes the set of all 
sequences (2,, ...,x,) of n elements in a field 
K. Defining two operations by (2,, ...,o,) 4 
(Bs... B) m (a + Bry sss t BA Alta «++ y= 
(A8, ..., A4x,)(4 € K), the set K” forms a linear 
space over K. An element of K" is called an n- 
tuple in K, and o; is called the ith component of 
(24, ..., Dal In general, the inner product of a= 
(a,,...,%,), b — (f, ..., Ba) is defined by (a, b)= 
Xi-19;f;. However, when K =C (complex 
number field), we usually define (a, b) to be 
Ma uf. 

(3) Sequences in K: All infinite sequences in 
a field K form a linear space over K under the 
operations defined in example (2). 

(4) K-valued functions: Given a nonempty 
set J and a field K, let KI be the set of all K- 
valued functions defined on I. Defining two 
operations by (f+ g)(x)=f(x)+ g(x), AS)(x) 
=A f(x) (xel,A€K), the set K’ forms a linear 
space over K. In particular, if we put 1 = 
{1,...,n}, then the space K' can be identified 
with the space of n-tuples given in example (2), 
and if we put / 2 N (natural numbers), then we 
obtain the space given in example (3). Let K be 
the field R of real numbers and / an interval in 
R. The set C(I) of all continuous functions on 
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I, the set D(I) of all differentiable functions on 
1, and the set A(7) of all treal analytic func- 
tions on / are all linear spaces contained in the 
space RI. 

(5) Polynomials in K: K[X,, ..., X,] denotes 
the set of all polynomials of n variables with 
coefficients in a field K. This forms a linear 
space under the usual operations. 


B. Linear Mappings 


Let L, M be linear spaces over a field K. A 
mapping ¢ from L to M is called a linear map- 
ping or linear operator if ọ satisfies the follow- 
ing two conditions; (i) (a + b) 7 o(a) + o(b); 
and (ii) o(4a) = Ao(a) (a, be L, że K). Namely, 

a linear mapping is a K-homomorphism be- 
tween K-modules (— 277 Modules). Regard- 
ing K as a linear space, a linear mapping 
LK is called a linear form. A linear mapping 
from L to L is called a linear transformation 
of L. The identity mapping of L is a linear 
transformation. Given linear spaces L, M, N, 
and linear mappings o: LM and y: MN, 
the composite y o o: LN is also a linear 
mapping. If a linear mapping 9: LM is 
*bijective, then the inverse mapping o !: Mo 
L is also a linear mapping. Such a mapping 

ọ is called an isomorphism, and we write 

L=M if there exists an isomorphism LA A 
linear transformation L  L is called regular 
(or nonsingular) if it is an isomorphism. 


Examples. (1) Let L be the linear space formed 
by all geometric vectors in a Euclidean space 
(affine space) E. Then a tmotion (taffine trans- 
formation) of E induces a linear transforma- 
tion LoL. 

(2) Let (x) be an m x n tmatrix in K. As- 
signing (n, ...,5,) to (£,,..., En), where Hz 
27-1056; (1 &i € m), we have a linear map- 
ping K"> K”. 

(3) Assigning the derivative f" to a real- 
valued differentiable function f on an interval 
I, we have a linear mapping D(I) R/. 


C. Linear Combinations 


Let L be a linear space over a field K. An 
element of L of the form oa, +... + oa, 

(x;€ K, a;€ L) is called a linear combination of 
a,,...,d,. A sequence a,, ...,a, of elements in L 
is called linearly dependent if there exists a 
sequence a,, ..., o, of elements in K such that 
not all the o; are equal to 0 and ja; +... + 
Xna, —0. A sequence of elements in L is called 
linearly independent if it is not linearly depen- 
dent. Suppose that there exists a linearly in- 
dependent sequence of n elements in a linear 


256 D 
Linear Spaces 


space L, and no sequence of n+1 elements in 
L is linearly independent. Then n is called the 
dimension of L and is denoted by dim L. If 
there exists such a number n, L is said to be 
finite-dimensional. Otherwise, L is said to be 
infinite-dimensional. In an infinite-dimensional 
linear space, there exist linearly independent 
sequences of elements having arbitrary length. 
The linear space K" of n-tuples in K is of di- 
mension 7. 

A sequence (a,, ...,a,) of elements in a linear 
space L ts called a basis if every element a of L 
is uniquely written in the form a=a,a,+...+ 
An, (xE K, i—1,..., n). This means that the 
linear mapping K"— L assigning «,a,+...+ 
Xna, E Lto (a,,...,%,)€K" is bijective and 
hence an isomorphism. The condition that 
(4,, ...,a,) is a basis of L is equivalent to any 
two of the following three conditions: (i) 

(d,, ...,a,) is linearly independent; (ii) every ele- 
ment of L is a linear combination of a,, ...,a,; 
(iii) L is of dimension n. It follows that the 
length n of a basis (a,,...,a,) is equal to the 
dimension and hence is independent of the 
choice of basis. In the expression a= È o;a;, %; 
is called the ith component (or ith coordinate) of 
the element a relative to the basis (a,, ...,a,). 


D. Spaces of Linear Mappings (Finite- 
Dimensional Case) 


Let L, M, and N be finite-dimensional linear 
spaces over a field K. The set Hom,(L, M) of 
all linear mappings L M is a linear space 
under the operations defined by (9+ 9^)(a) = 
Q(a) + o (a), (Ap)(a) = 29(a) (ae L, Ae K). 

Let (a,, ..., aj), (b4, ..., Del be bases of L, M, 
respectively. Then any linear mapping o:L -» 
M can be represented by an m x | matrix 
(x) determined by o(aj) 7 X; bia; (1 «j«l). 
This assignment o —>(x;) gives an isomorphism 
from the linear space Hom,(L, M) to the 
linear space of all m x l matrices (= 269 Ma- 
trices). In addition, let (c,, ...,c,) be a basis of 
N, and let the n x m matrix (f,;) represent a 
linear mapping y: M >N. Then the composite 
mapping y o q: LN is represented by the 
product (f) (x) of the matrices (f,;) and (al 
The set of all linear transformations of a linear 
space N of dimension n forms an 'associative 
algebra over K which is isomorphic to the 
*total matrix algebra M,(K) of degree n under 
the correspondence q (a). Its tinvertible 
elements are regular linear transformations, 
and they form a group which is denoted by 
GL(N) and called the *general linear group on 
N. This corresponds to the group GL(n, K) 
formed by all n x n *invertible matrices under 
the isomorphism ọ —(a;;). 
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E. Infinite-Dimensional Linear Spaces 


In this section, we consider only the algebraic 
aspects of infinite-dimensional linear spaces 
(for the topological aspects — 422 Topological 
Abelian Groups L; 424 Topological Linear 
Spaces). Let {a,},., be a family of elements in 
a linear space L. A linear combination of the 
family is an element of L in the form €,.4«;a; 
(x E€ K, where «, =0 except for a finite number 
of å). The family (a;];.4 is called linearly in- 
dependent if no linear combination Sa Da: 
is equal to 0 unless all the coefficients x, are 
equal to 0. The family {a,},-, is called a basis 
of L if every element of L is uniquely written in 
the form $,.42;a;. These notions are gen- 
eralizations of those defined for finite A. In 
general, A is an infinite set. Any linear space 

L has a basis (— 34 Axiom of Choice and 
Equivalents C). The cardinality of a basis is 
determined by L. Two linear spaces are iso- 
morphic if and only if their bases have the 
same cardinality. 


F. Subspaces and Quotient Spaces 


Let L be a linear space over a field K. A non- 
empty subset N of L forms a linear space over 
K under the induced operations if the follow- 
ing two conditions hold: (i) a, be N implies 

ac beN;and (ii) 4e K, ae N imply 4ae N. In 
this case, the subset N is called a linear sub- 
space of L (or simply subspace of L). The 
canonical mapping o: NL defined by io) 
=a (ae N) is an injective linear mapping. 

Let S be a nonempty subset of L. The set of 
all linear combinations of elements in S forms 
the smallest subspace of L containing S; this 
space is called the subspace generated (or 
spanned) by S. For subspaces N, N' of L, the 
intersection NM N' and the sum N + N'— {a+ 
a' |ae N,a'e N') are both subspaces. Similar 
propositions hold for an arbitrary number of 
subspaces. If N, N' are of finite dimension, the 
equality dim N + dim N' 2 dim(N N N') 4 
dim(N + N’) holds. We say that L is decom- 
posed into the direct sum of N, N' if every 
element of L can be uniquely written in the 
form a -- a' (ae N, a'e N’). This is the case if 
and only if L is generated by N and N' and 
NIIN 10]. In this case, N’ is called a com- 
plementary subspace of N. Any subspace has a 
complementary subspace. For direct products 
and sums of linear spaces — 277 Modules F. 

An equivalence relation R in a linear space 
L is said to be compatible with the operations 
in L if the following two conditions hold: (i) 
Rio, a’) and R(b, b’) imply R(a+ b, a' +b’); 

(ii) R(a, a’) implies R(Aa, Aa’) (4€ K). Then 
the *quotient set L/R, namely, the set of all 
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equivalence classes, forms a linear space over 
K under the induced operations; this is called 
the quotient linear space (or simply quotient 
space) of L with respect to R. The canonical 
mapping o9: L9 L/R, given by ae (a) (ae L), is 
a surjective linear mapping. The equivalence 
class N containing 0 forms a subspace of L, 
and the equivalence class containing ac L is 
the coset a+N={a+b|beN}. We have 

Rio, a’) if and only if a —a' e N. Conversely, for 
any subspace N, there exists an equivalence 
relation R compatible with the operations 
determined by Rio, a") if and only if a—a'e N. 
The quotient linear space L/R thus obtained is 
denoted by L/N and called the quotient (linear) 
space of L by N. If L/N is finite-dimensional, 
its dimension is called the codimension of N 
relative to L and is denoted by codim N. 

Let o LM bea linear mapping of linear 
spaces. Its image (L) is a subspace of M, and 
the kernel N = (ae L| p(a)=0} is a subspace of 
L. The mapping o induces an isomorphism 
Q:L/N —o(L) (— 277 Modules E). If L is finite- 
dimensional, dim L — dim N 2 dim q(L). The 
dimension of the image of o is called the rank 
of ~, and the dimension of the kernel of o is 
called the nullity of o. The rank and nullity of 
an m x n matrix (2) (— 269 Matrices) are the 
respective rank and nullity of the linear map- 
ping K"— K" represented by the matrix (a 
(— Section B, example (2)). 


G. Dual Spaces 


Let L be a linear space over a field K. The set 
Hom,(L, K) of all linear forms on L is a linear 
space, denoted by L* and called the dual 
(linear) space of L. The space L* is the *dual 
module of L as a K-module (— 277 Modules). 
For elements a of L and a* of L*, we denote 
the element a*(a) by (a, a*» and call it the 
inner product of a and a*. For a linear map- 
ping o: L- M, we define a linear mapping 

tp: M* 5 L* by ('9)(b*) - b*oq (b*e M*). The 
mapping ‘g is called the dual mapping (trans- 
posed mapping or transpose) of o, and is deter- 
mined by the relation (a,'o(b*)» = (o(a), b*» 
(ae L, b*e M*). We have (o; 4-95) 2'9, +'@>, 
Woolz look, 1, le, If o is tsurjective, 
then 'ọ is *injective, and if q is injective, then 
um is surjective. If ọ is bijective, then 'ọ is also 
bijective. The rank of ‘ coincides with the 
rank of q if the rank of q is finite. For an 
isomorphism qo: L— M, the inverse mapping 
(p '=@:L* >M* of'o is called the contragre- 
dient of o. We have (V o 9) — y o. 

Given a subspace N of a linear space L, the 
subspace {a* € L*|<a,a*) 20 (ae N)] of L* is 
denoted by N + and is called the subspace (of 
L*) orthogonal to N. Then we have the canon- 
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ical isomorphisms (L/N)* x NT, N* = L*/NT., 
Similarly, given a subspace N' of L*, we obtain 
the subspace N’* of L orthogonal to N’: N’+ 

= {ae L|<a,a*> =0 (a*e N')). Thus we 

have a one-to-one correspondence between the 
finite-codimensional subspaces of L and the 
finite-dimensional subspaces of L* by assign- 
ing N+ to N and N’+ to N'. The codimension 
of N is equal to the dimension of N+. If L is 
finite-dimensional, we have a canonical iso- 
morphism L ~(L*)* and a one-to-one corre- 
spondence N 5 N' 2 N7 between the set {N} of 
all subspaces of L and the set {N’} of all sub- 
spaces of L*. These properties of correspon- 
dence between the subspaces of L and L* are 
called duality properties of the linear space. 

Let (e,, ....e,) be a basis of a linear space L. 
Then the system of elements (ef, ..., e*) in L* 
defined by the relation Ze, e*5 =0 (ij), 
<e; ef» — I forms a basis of L*, called the dual 
basis of (e, ...,e,). Thus a finite-dimensional 
linear space L can be identified with its dual 
space utilizing the isomorphism given by as- 
signing each element of the dual basis to the 
corresponding element of the basis in a natural 
manner. 


H. Multilinear Mappings 


Let L, M, N be linear spaces over a field K 
and f be a mapping from the Cartesian prod- 
uct M x N to L. Suppose that for any fixed 

be N, the mapping M Lassigning f(x, biet, 
to xe M is linear, and for any fixed ae M, the 
mapping N >L assigning f(a, y)e L to ye N is 
also linear. Then f is called a bilinear mapping 
from M x N to L. The set of all bilinear map- 
pings from M x N to L forms a linear space 
under the operations (f - g)(x, y) 2 f(x, y) 
g(x, y), Af) y) — A fG y) (iE K); this space 

is denoted by (M, N; L). In general, for linear 
spaces M,,...,M,,a mapping f: M, x ... x M, 
> Lis called a multilinear mapping if it is linear 
in each variable. The set of all multilinear 
mappings from M, x... x M, to Lforms a 
linear space, denoted by .Z(Mi,, ..., Mp; L). If 
L=K, a bilinear mapping and a multilinear 
mapping are called a bilinear form and multi- 
linear form, respectively. 


Suppose, in particular, that M, =... = M, = 
M. A multilinear mapping f: M, x... x M, 
L is called symmetric if f(x,,,, ...,X,4,) = 


fG x) (x;EM) for any permutation o 

of (1, ..., n]. Also, f is called alternating if 
F(X 5 E Ei Xq) =O for x, =x;, ij. In 
this case, f(x, ..., Xom) sgno- f(x,, .... Xp) 
for any permutation c (sgno is +1 if ø is teven 
and —1 if c is todd). On the other hand, f is 
called skew-symmetric (or antisymmetric) if it 
satisfies this equality. Therefore, if the charac- 
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teristic of the field K is different from 2, a 
skew-symmetric mapping is alternating. 

Let M, N be linear spaces over a field K and 
® be a bilinear form on M x N. The mappings 
dg: N -» M*, sg: M>N* defined by B(x, y) 
(do(y))(X) = (s9())()) (x € M, ye N) are linear 
mappings. If M, N are finite-dimensional, 
then d4 and sg have the same rank, called 
the rank of ®. Let (x,, ..., Xm) (y4, ..., Yn) be 
bases of M, N and (xf, ..., x*), (y, ..., y*) 
be their dual bases. Then we have del d 
MO XP O(X;, yj. So(x) = 25-1 P(x; yi) yp 
The matrix (®(x;, y,)) is called the matrix of 
a bilinear form 6 relative to the given bases, 
and its rank coincides with the rank of ®. If 
dg, Sg are both injective, they are also bijec- 
tive, and in this case ® is said to be nondegen- 
erate. Then each of dg and sg can be regarded 
as the transpose of the other, and M, N can 
be identified with N*, M*, respectively. In 
particular, if is a nondegenerate bilinear 
form on M x M, we have an isomorphism 
from M to its dual space M*; identifying M 
with M* by this isomorphism, M is said to be 
self-dual. 

Let M bea linear space over a field K. A 
mapping Q: MK is called a quadratic form 
on M if the following two conditions hold: (1) 
Q(ax) 2 a?Q(x) (xe K, xe My; and (ii) the map- 
ping 6: M x MK defined by ®(x, y) 2 Q(x 4- y) 
— Q(x) — Q(y) (x, ye M) is a bilinear form on 
M x M. In this case, ® ts called the bilinear 
form associated with the quadratic form Q, and 
it can be shown to be symmetric. We have 
(x, x) - 2Q(x) (xe M) and O(x)=(1/2)®(x, x) if 
the characteristic of K #2. In general, for any 
bilinear form f: M x M >K, the mapping Q: M 
>K defined by O(x)= f(x, x) is a quadratic 
form. If (x,, ..., x,) is a basis of M, a quadratic 
form Q is expressed as follows: Q(327., €;x;) 
= X; p 015,5; (the sum over all unordered 
pairs {i,j}), where aj — Q(x;), oj; Do, xj) 
(ixj) (— 348 Quadratic Forms). A metric 
vector space is a linear space M supplied with 
a nondegenerate quadratic form Q on M, and 
is denoted by (M, Q). The bilinear form ® 
associated with Q gives an inner product D(x, y) 
(x, ye M). 


I. Tensor Products 


Let M, N be linear spaces over a field K. The 
tensor product M © N of M, N is defined as 
follows and can be used to “linearize” bilinear 
mappings from M x N to any linear space. Let 
F be the linear space generated by M x N and 
R be the subspace of F generated by all ele- 
ments of the forms (x +x’, y) ^ (x. y) ^ (X , y), 

(x, y y) — (x, y) — (x, y’), (ax, y) — xo y), (x, xy) 
— a(x, y)(x,x'eM, y, y eN, «e K). Then the 
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quotient space F/R is denoted by M GN, 
and the canonical projection F M @ N is de- 
noted by y. Given an element (x, y)e M x N, 
we denote the image w((x, vil by x & y. The 
bilinear mapping M x NM Q N assigning 
x & y to (x, y) is called the canonical bilinear 
mapping. We have, by definition, (x +x’) G y 
—xG TE Gyx&í(yty)2xQytxQy, 
(ax) 69 y 2 a(x € y) 2 x @ (ay) (we K ). To em- 
phasize the basic field K, we sometimes write 
M Gy N instead of M GN. 

The tensor product can be characterized by 
the property that for any linear space L and 
any bilinear mapping f: M x N >L, there 
exists a unique linear mapping 9: M @N-L 
satisfying f(x, y) 2 «(x ® y). Thus assigning 
the bilinear mapping f: M x N — L defined 
by f(x, y) 2 (x & y) to a linear mapping 
9:M @ NL, we obtain an isomorphism 
Hom(M ® N,L)z Z(M, N; L). Every element 
of M &) N can be expressed as a finite sum of 
elements of the form x & y (xe M, ye N). If 
{Xi}iers {Yj} jes are bases of M, N, respectively, 
then the family (x; ® ehn jes forms a basis of 
M GN Hence if M and N are of finite dimen- 
sion, dim(M & N )- dim M dim N. 

Let M,, M,,... be linear spaces over a field 
K. We have a unique isomorphism M; ® M, 
— M; ® M, that assigns x, ® x, to x, @x, 
(x; € Mj). We also have a unique isomorphism 
(M, & M) & M,—5 M, ®© (M, ® My) that as- 
signs x, ® (Xz & x3) to (x, @ x5) @ x4 (x€ Mj); 
hence we can identify (M, © M;) © M, and 
M, & (M; © Mj), and we denote them sim- 
ply by M, © M, © M,. In general, assigning 
X1 69... 9 x, to (x4, ..., x,), we obtain the 
canonical multilinear mapping M, x... X M, 
>M, Q GM. As before, given any linear 
space L, we have the natural isomorphism 
Hom(M, e. GM. Liz Z(M,, ..., M, L). 
Conversely, given linear spaces M,,..., MA, the 
space M, ® ... & M, can be characterized as a 
linear space N with a given multilinear map- 
ping v: M, x ... x M,—N such that (i) N is 
generated by the image V (M, x ... x Ma); and 
(ii) for any multilinear mapping f: M, x... x 
M, 9 L, there exists a unique linear mapping 
f':NoLsatisfying f =f' ow. The tensor prod- 
uct M, & ... © M, is sometimes written as 
G9 7-1 M; and an element x, ©... © x, is writ- 
ten as (9;-, Xj. 

Given linear mappings f;: M; M; (1 <i<n), 
there exists a unique linear mapping f : M, 
®...®M,>M', ©... & M, satisfying f(x, 
Q... Q x) — fix) ©...  f,(x,); we denote 
the mapping f by f; & ... & f, or Gét: f; and 
call it the tensor product of the f, (1 «ix n). 
The assignment (fi, ...,f,) fi D ... Of, 
gives an isomorphism Gi: Hom(M,, Mj) 
Hom(697-, Mi Gi, Mj) if the M, are finite- 
dimensional. If in particular M =... = M,= 
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K, we have an isomorphism @?_, MF» 
(697-1 M;)* under the identification Get, M; = 
K given by the assignment x, © ...@ x,» 
x’... Xa Explicitly, the isomorphism f: 

-1 M#>(@ iL, Mj)* is determined by 
SBi xPO Xi) = Tar eu xf? (xf EMF, 
x;€ M). 


J. Tensors 


Let EO? (1 <4 <k) be linear spaces over a 
field K. If EO =... = E® = E, then Qf- EM 
is written @*E and called the tensor space 
of degree k of E (G)? E denotes K). Also, 
(GO^E) & (G9*E*) is written T7(E), where E* 
is the dual space of E. We have TP (E) = (^E, 
TA(E)-— Q *E*, and TQ(E)— K. T/(E) is called 
the tensor space of type ( p, q) of E, and each of 
its elements is called a tensor of type (p, q). In 
particular, a tensor of type (p, 0) is called a 
contravariant tensor of degree p, and a tensor 
of type (0, q) is called a covariant tensor of 
degree q. A tensor of type (0,0) is a scalar. An 
element of T} (E) — E is called a contravariant 
vector, and an element of TP (E) — E* is called 
a covariant vector. If p #0, q #0, a tensor of 
type (p,q) is called a mixed tensor. 

Let (e,, ...,e,) be a basis of E and (f £, 

., f") be the basis of E* dual to (e,, ...,e,). 
Then the tensors e; ®@ ... e; OQ f^  ... & f^ 
(i4, ju, m, ...,m A=1,...,p; u— 1, ..., q) form 
a basis of T7(E). Therefore any tensor of 
type (p, q) can be written uniquely in the form 


EECHER 


Also, GET is called the component of t relative 
to the basis (e,, ...,e,), the index i, is called a 
contravariant index, and the index j, is called a 
covariant index. 

Let (e,, ...,e,) be another basis of E and 
(f, ..., f") be its dual basis. Suppose that we 
have 


Then we have 


2 Bias =; 


k=1 


and the transformation formula 


dE eg L. E Ki Kp at. e. d af : 
where the des e are the components of t rela- 
tive to (e, ..., e) and the E ? are the compo- 
nents of t relative to (e, , ney 

In the tensor calculus, an index appearing 
after the symbol 2; is called a dummy index if 
it appears in both the upper and the lower 
positions. For example, in the expression 
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27, & y , the index i is a dummy. As a conven- 
tion, we sometimes omit the symbol 7 , for a 
dummy index i; for example, by č; y! we mean 
the sum X’; £j yi. This convention is called 
Einstein's convention. Using it, we write the 
previous transformation formula as 


Sg m Be Big agin, 
We have a nondegenerate bilinear form ® 
on TP (E) x T(E) determined by 


d 


p 4 
=|] <x x>] y 
i=1 j=l 


Thus the space T/'(E) can be identified with 
the dual space of T(E), and vice versa (— 
Section H). In this identification, the basis 

(e, & ... Ge, & f^ & ... & f^) of T/(E) and 
the basis (e; e. Ge, AS 8... QS") 

of T(E) are dual to each other. In addition, 
combining the natural isomorphism T4(E)* = 
-P(IT'E, IT^E*; K) with the duality 7,3(E)* = 
TP (E), we have a natural isomorphism T/(E) 
> LTTE, I]"E*; K). Explicitly, identifying an 
element te TP(E) with the multilinear form 
[T'E x II^E* ->K corresponding to it under 
the natural isomorphism, we have 


xis Yl sy) Gt t. Ehn.. -Nip 


where (£71 is the component of x, € E = T} (E), 
[nij is the component of y* e E* = TI (E), and 
Gr) is the component of te T7(E). By the 
natural isomorphisms TP (E) = Z(T[^E*, K), 
T? (E)= V(TIE, K), a contravariant tensor of 
degree p and a covariant tensor of degree p 
can be identified with a multilinear p-form on 
E* and on E, respectively. 


K. Tensor Algebras 


There exists a unique bilinear mapping T/'(E) 
x T(E) T, (E) which assigns the element 
x10)... x,09 y4 9 ... Oy, CO xt @ -OFO 
yi Q ... yf to the pair (x, @ ... G x, A xf @ 
-- OXF, V1 8... y, G yt G ... @ yF); we 
denote the element assigned to the latter by 
t&u, where t=x,@®... Ox Oxf @...@ xk, 
u—y, 6... y,  yT @... @ y*, and call it 
the product of t and u. If the components of t, 
u,t®u are 
(uch Iech Quan) 
then we have 
Gt 
Let T 2 be the direct sum of T7(E) (p,q 

—0,1,2, ...). Then T(E) is an associative 
algebra over K whose product is a natural 
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extension of the product &. We call T(E) the 
tensor algebra on E. The direct sum of T(E) 
(p=0, 1,2, ...) forms a subalgebra of T(E), also 
called the (contravariant) tensor algebra. 


L. Contractions 


The contraction of T,?(E) relative to the kth 
contravariant index and the /th covariant 
index is by definition the linear mapping 

Cf: TP(E)> TE (E) determined by assigning 
(xo AT PX, DI... QO Xp) DÉI X41 ©... OX, @ AT 
® ... Oxf, @ xf... @ xf tox, @...@xX,@ 
xf @ ... @ xz, where (x,, xf» is the inner 
product of x,, xf. For a tensor t of type (p,q), 
the tensor Cf(t) of type (p — 1, q— 1) is called 
the contracted tensor of t. If the components 
of t are BU the components of C7(t) are 
given by 


n 
ii-i RETE eo 
4 TERR d heikai" 
SS 


M. Tensor Representations 


For a linear mapping f: EF, the tensor 
product f & ...& f: TJ(E) — G9^E G9"F 

= TI(F)is denoted by (P The f? (p= 

0,1,2, ...) give an algebra homomorphism 
Èp- Id (E) 9 Èpo T9 (F). Next, let f be an 
isomorphism and f —!f ! be its contragre- 
dient. Then f, denotes the tensor product 
fe... f: TE = GE GF*— TA) 
and ff the tensor product f? & f,: TP (E) 

> T?(F). The mapping f? is an isomorphism, 
and the system { fP} (p,q—0, 1,2, ...) gives rise 
to an algebra isomorphism T(E) T(F). In 
particular, if f is a nonsingular linear trans- 
formation of E, then f? is a nonsingular linear 
transformation of the linear space T/(E), and 
the assignment f 2 f? gives a group homo- 
morphism GL(E)>GL(T,?(E)); this homomor- 
phism is called a tensor representation of the 
group GL(E). 


N. Symmetric and Alternating Tensors 


A contravariant tensor of degree p is called 
symmetric (alternating) if the corresponding 
multilinear p-form, under the natural isomor- 
phism TF (E)= A(T]’E*, K), is symmetric 
(alternating). A covariant tensor is also called 
symmetric (alternating) if the corresponding 
multilinear form is symmetric (alternating). A 
skew-symmetric (or antisymmetric) tensor is 
defined similarly. We reformulate these defi- 
nitions under the assumption that the field K 
is not of characteristic 2. Let &, be the group 
of all permutations of 1,..., p (the tsymmetric 
group of degree p). For any oe G,, we have 
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a unique linear transformation TE (E) TJ (E) 
assigning X,-1(1)@ ... ® x,-1 to x, G9... 

& x,. This transformation is nonsingular 

and is also denoted by oc. Similarly, we have 

a unique nonsingular linear transformation 
of T? (E), also denoted by c. An element te 

Td (E) (or e T? (E)) is symmetric if and only 

if ot —t for all ce &,, while t is alternating if 
and only if ot — (sgno)t for all ce &,. Let the 
čini» (or the E, be the components of t. 
Then t is symmetric (alternating) if and only if 
the components are symmetric (alternating) 
relative to permutations of the indices i,, ..., i 
The linear transformation $,— Ze, of 

TJ (E) or T?(E) is called the symmetrizer, and 
A, — Zone, (Sgn glo is called the alternizer. For 
any t, S,f is a symmetric tensor, and A,t is an 
alternating tensor. 

The subspace of T(E) consisting of all 
symmetric (or alternating) tensors is invar- 
iant under the transformation R: GL(E)— 
GL(T?(E)), the tensor representation where 
(t)= R(o)(t) for pe GL(E) and te T?(E). 


p 


O. Exterior Product 


For simplicity, we assume that the basic field 
K is of characteristic 0. We denote by N, the 
kernel of the alternizer A,: T(E) TE (E), 
namely, the subspace consisting of all t satisfy- 
ing A,t=0, and by APE the quotient space 
Tj (EY N,. The image of x, @ ... @ x, (x;€ E) 
under the natural mapping TZ(E)— APE is 
denoted by x, ^... ^x, and is called the ex- 
terior product of x,, ..., x,. The linear space 
APE is called the p-fold exterior power of E. 
We have 


Xp Avr AHR )AL. AX, 


E A AC P XGA AEN AG NX 


=OA(X,A...AX 
and for every qe. 
ei ^ A Een 7 (SE 0)X, A... ^ Xp. 


A, induces a natural isomorphism APE = 
Ar, where 9I? consists of all contravariant 
alternating tensors of degree p. Thus an ele- 
ment of APE can be identified with a con- 
travariant alternating tensor of degree p. 
Then we have A,(x, G9 ...  x,)z X4 ^... ^X 
Similarly, A"E* is identified with the linear 
space consisting of all covariant alternating 
tensors of degree p. An element of APE is 
sometimes called a p-vector, and an element 
of APE* is called a p-covector (— 90 Coordi- 
nates B). If (e,,...,e,) is a basis of E, then the 
ei, A. AG, (iy «i5 <... <i,) form a basis of 
NE, and any element of AE is written in 


p 





95] 


the form t=), <.<% e; ^... ^e; OF t 

-(/p) i, "T Quote, NS get In the latter 
form, «':^» is alternating relative to permuta- 
tions of i4, ..., i, and it is the component of t. 


n 
The dimension of APE is equal to ( ) and 
p 


APE=Oif p» n. If (f, ..., f") is the dual basis 
of (e,, ..., e), the inner product of an element t 
m, eei A t6) AA e; € NE and an ele- 
ment s= ij, is Hh dco ^A foe APE* 
is defined by 
(t= Y cef 

ij <...<ip 
Then we have (x, A...AXp, nh A Yp? 
— det((x;, yj»), where <x;, yj? is the inner 
product of x;e E and ve E*. By this inner 
product, we can identify A?E* with the dual 
space of APE. 

The tensor product ("E x (9'E G9?" *E 
induces naturally a mapping ©: @?E/N, x 
G)*E/N,— GO" "" EJN, .,. Using this bilinear 
mapping D: APE x A*E—A^*E, we define the 
exterior product t ^s of an element te A"E and 
an element se A4E by t^s (t, s). Then t^s is 
an element of A?**E, and we have tas= 
(—1)P4S At, (xq ^... A Xp) Asi ^ A Xp 4) = 
NAT ANLE 

We denote by AE the direct sum of APE (p 
— 0, 1,2, ..., n), and define the product of two 
elements x= 375.9 x^, Y= X5-oy" (x^, ye APE) 
by XA y laan tf ^ y*. Then the product ^ 
satisfies the associative law. We call AE the 
exterior algebra (or Grassmann algebra) of the 
linear space E. If E is of dimension n, AE is 
of dimension 2”. If (e, ...,e,) is a basis of E 
relative to K, AE is sometimes written as 
elen, €n). The exterior algebra AE* of the 
dual space E* is similarly defined and can be 
considered as the dual space of AE. 


P. Semilinear Mappings 


Let L be a linear space over a field K and L' 
be a linear space over a field K’. A pair (9, p) 
consisting of a mapping o: L L' and a map- 
ping p: Ko K' is called a semilinear mapping if 
the following four conditions hold (for conve- 
nience here we write the scalars to the right of 
the vectors): (1) o(a + b) 2 e(a) 4- o(b); (ii) o(aA) 
 q(a)p(4); (iii) p(x + B) — p(a) + p(B); and (iv) 
plab) = p(x) p(B) (a, be L; A, o, Be K). In this 
case, o is sometimes said to be semilinear 
relative to p (— 277 Modules L). Conditions 
(iii) and (iv) mean that p is a field homomor- 
phism. If K = K' and p is the identity, then the 
semilinear mapping o: L>L’ ts a linear map- 
ping. If L=L’, K 2 K' (and p is an automor- 
phism), ¢ is called a semilinear transformation 
relative to p. 
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For semilinear mappings (9, p): LO L', K » 
K' and (o, p): L' 2L", K' S K", where L” 
is a linear space over K", the composite 
(o' oo, p' o p) is also a semilinear mapping. If a 
basis (e,, ...,e,) for L over K and a basis 
(ei, .... ej.) for L’ over K' are given, a semi- 
linear mapping (e, p): L5 L', K 5 K' deter- 
mines a matrix (a,) by the relation o(e) — 

XL ejas (1 <j <n). Conversely, a homomor- 
phism p and an n' x n matrix A —(o,j) deter- 
mine a semilinear mapping ¢ by this relation. 
Hence for fixed bases a semilinear mapping is 
represented by a pair (A, p), where A is an n' 

x n matrix. If a semilinear mapping o relative 
to p' is represented by (A’, p^), the composite 
oi 0, p' o p) is represented by (A'A^,, p' o p), 
where A? is the matrix (p’(«,;)). 

Let o: LL be a semilinear transformation 
relative to an automorphism p: K >K. Sup- 
pose that q is represented by (A, p) relative to 
a basis (e,, ..., e,) for Land by (B, p) relative to 
another basis ( f,, ..., Ja). If we define a matrix 
P —(pj) by the relation f; 2 Zi-,e;p;j (1 <j <n), 
we have B= P^! AP?. Two pairs (A, p), (B, p) 
having the relation B= P '! AP? are said to be 
similar. 


Q. Sesquilinear Forms 


Let K be a field (not necessarily commutative) 
and J be its tantiautomorphism. For left linear 
spaces M, N over K, a mapping D: M x NOK 
Is called a (right) sesquilinear form relative to J 
if the following four conditions are satisfied: (i) 
is +x’, y) 2 46x, y) + (x , y) (i) B(x, y + y) 

— De, y) + 4 (x, y’); (iii) (ux, y) - obt, yj; and 
(iv) D(x, xy) 2 (x, y)a? (x, xeM; y, y e N; 

x€ K). If J is the identity automorphism, 

then K is necessarily commutative and is a 
bilinear form (— Section H). As an example of 
K and J, we may take K as the field of com- 
plex numbers and J as complex conjugation. 
In general, for a left linear space E over K, we 
denote by E" the right linear space with the 
scalar multiplication x4—4" ‘x (xe E, Ae K). 
Then condition (iv) becomes (iv’) B(x, ya) 

— (x, y)o; and if K is commutative, D is a 
bilinear form on M x N”. For a sesquilinear 
form ® on M x N, we have the linear map- 
pings dg: N' MS. s: MI — N* defined by 
the relation ®(x, y)= (x, dg(y)» = <y, sg)?" 
(xe M, ye N). If M, N are finite-dimensional, 
dg, Sp have the same rank, which is called the 
rank of o. We assume that all linear spaces are 
finite-dimensional. 

Let (x4, ..., Xm) (än, Yn) be bases of M, N 
and (xt, ..., x5), (y, ..., y*) be their dual bases. 
Then we have del yj) = A: xf D(x; yj), so(x;) 
=D", yF O(x;, yj) '. The matrix (@(x;, y;)) 
is called the matrix of the sesquilinear form 
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® relative to the given bases; its rank is equal 
to the rank of ®. If dg, sg are both injective 
(therefore bijective), is said to be nondegen- 
erate. Let D': M' x N'SK be another sesqui- 
linear form relative to J. Then for any linear 
mapping u: M — M', there exists a unique 
linear mapping u*: NN such that ®’(u(x), y’) 
= D(x, u*(y’)) (xe M, y' e Ny; this is called the 
left-adjoint linear mapping of u. Similarly, 
for any linear mapping v: NN’, there 
exists a unique linear mapping v*: Mi AM 
such that d'G, v( y)) 2 S(v*(x), y) (x e M', 
ye Ny; this is called the right-adjoint linear 
mapping of v. We have u* =dg! o'uo da, v* 
=S OVO Sa. In particular, let u, v be isomor- 
phisms. Then we have ®(x, y) 2 ®'(u(x), v( y)) 
(xeM, yeN)if and only if u`! 2v*, v ! =u*. 
A sesquilinear form on M x M is called 
simply a sesquilinear form on M. Let J be an 
*involution (namely J =J~'), and write 47 = 
4 (àe K). If condition (v) (x, y)=®(y, x) 
(x, ye M) holds, ® is called a Hermitian form 
on M. On the other hand, if the condition (v) 
(x, y) — dX y, x) holds, is called an anti- 
Hermitian form (or skew-Hermitian form) on 
M. In particular, if J = 1x, then a Hermitian 
form (anti-Hermitian form) is a symmetric 
bilinear form (antisymmetric bilinear form). A 
linear space M supplied with a nondegenerate 
Hermitian form 6 is called a Hermitian linear 
space, and (x, y) is called the Hermitian inner 
product (or simply inner product) of x, ye M. 
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A. General Remarks 


A field k that is tcomplete with respect to a 
*discrete valuation is called a local field if its 
field of residue classes is finite. (Real and com- 
plex number fields are sometimes also called 
local fields; these, however, are not considered 
in this article.) A local field k is isomorphic 
either to the tcompletion with respect to a tp- 
adic valuation determined by a prime ideal p 
of a number field of finite degree or to the field 
of ‘formal power series of one variable over a 
finite field. In the former case, k is called a p- 
adic number field. We let o stand for the tvalu- 
ation ring of k, p stand for the tvaluation ideal 
of k, p stand for the tcharacteristic of the field 
0/p of residue classes, and N(p) stand for the 
number of the elements of o/p. An tadditive 
valuation of k whose set of values coincides 
with the set of all rational integers, is denoted 
by ord x (x ek); here we understand ord0= oo. 
The *normal (multiplicative) valuation of k is 
defined by || 2 (N(p)) ?'9* (— 439 Valuations). 


B. Construction of Local Fields 


À p-adic number field k is an extension of 
finite degree of the p-adic field Q,. If n= 
[k:Q,]=ef, where e is the *ramification in- 
dex of k/Q, and f is the trelative degree of 
k/Q, (— Section D), then there exists one and 
only one field F such that k> F2Q,, [k:F] — 
e, [F:Q,] — f. and F/Q, is tunramified. The 
field x of residue classes of F is isomorphic to 
GF(p/^), and F is uniquely determined by « by 
means of Witt vectors (— 449 Witt Vectors). 
Every residue class (0) of oz/p zx contains 
one and only one mth root of unity (m is a 
divisor of p/ — 1), and F is obtained by adjoin- 
ing to Q, a primitive (p^ — 1)th root. Then k is 
a totally ramified extension of F and is ob- 
tained by adjoining to F a root of an tEisen- 
stein polynomial (— 337 Polynomials F). 


C. The Topology of k 


Taking p" (m —0,1,2,...) as a thase for a 
neighborhood system of 0, k becomes a tlo- 
cally compact *totally disconnected *topolog- 
ical field, and p" (m —0, 1,2, ...) are compact 
subgroups of the additive group k. The multi- 
plicative group k* of nonzero elements of k is 
a locally compact tAbelian group, and the u™ 
= (xeo|xz1(modp")) (n—0,1,2,...) forma 
base for the neighborhood system of 1. The 
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tcharacter group in the sense of Pontryagin of 
the additive group k is isomorphic to k. This 
isomorphism is obtained by the following 
natural correspondence: For a p-adic field k, 
denote by o the composition of the natural 
mapping of Q, onto Q,/Z, (x Z[1/p]/Zc 
Q/Z) and the ftrace Tr from k to Q,, and 

put x y) =exp(2n,/—1 p(xy)) (yek); for the 
field k of the formal power series over a finite 
field x, put y,(y) — (xy) (yek) with (a) — 
exp / —1: Tr(Resa)/p) (xek), where Resc 
is the residue of xek and Tr is the trace from x 
to Z/pZ. Then in both cases y, is a character 
of k, and x 5 y, gives an isomorphism between 
k and the character group. 


D. Ramification Theory 


A valuation v of k has a unique tprolongation 
to an extension K of finite degree over k (we 
denote the prolongation also by v). K is com- 
plete under the valuation and is therefore a 
local field. Denoting by f and x the field of 
residue classes of K and k, respectively, we call 
[f: x] — f the relative degree of K/k, and e = 
[v(K *):v(k")] the ramification index of K/k. 
Then we have the equality [K:k]=ef. Ife=1, 
we call K/k an unramified extension. 

An unramified extension K/k is tnormal, 
and its Galois group is a cyclic group gen- 
erated by the Frobenius automorphism, i.e., 
the element c of the Galois group of K/k such 
that a? z x"? (mod p) for any element « in the 
valuation ring of K. For a given natural num- 
ber f, there exists one and only one unramified 
extension of degree f over k in an algebraic 
closure of k. 

Let K/k be a normal extension of finite 
degree with Galois group G, let /7 be a tprime 
element of K, that is, a generator of the tvalu- 
ation ideal P of K, and put V? — (ceG|II^ = 
II (mod $'*")}. Then V is independent of the 
choice of 77. We call V the inertia group and 
V9? the ith ramification group. Then V“ is 
normal in G, [G: V®] =f, [V9 : 1] =e, and 
V™ is the p-Sylow subgroup of V. Further- 
more, G/V® and V/V are cyclic, and 
y0/yG6*U (i2 1,2,...) is an Abelian group of 
type (p. p, ..., p). Ramification theory for Abel- 
ian extensions is described in Section F. 

For xek, the series exp(x)= 2272.9 x"/n! (resp. 
log(1-- x) 2 32,(—1)' ! x"/n) converges for 
ord x > e/(p — 1), where e is the ramification 
index of k/Q, (resp. ord x » 0). The additive 
group p" and the multiplicative group um 
(m e/(p — 1)) are isomorphic as topological 
groups under the mappings x > y —exp(x), y 
x =log(y). If we fix an element x ek with 
ord z — 1, then an arbitrary element xek with 
ord x «r is uniquely expressed in the form x 


257 E 
Local Fields 


=n'Ca, CP 1 —1, xeu(?, The group u is a 
multiplicative group on which Z, operates, 
and the structure of u” as a Z,-group can be 
determined explicitly (— [2, ch. II]). 


E. Cohomology 


For a normal extension K/k with Galois group 
G, we may consider the ‘cohomology groups 
H'(G, K") (r2 1,2,...) of G operating on the 
multiplicative group K *. In particular, the 2- 
cohomology group H?(G, K *) is important in 
local class field theory and theory of algebras 
over k. 

If C is the separable algebraic closure of k, 
i.e., the *maximal separable field over k in the 
talgebraic closure of k, and I is the Galois 
group of C/k, we can consider the 2-cohom- 
ology group H?(TI, C*). Here we take as 
cocycles only those mappings f(o,1) of Lx I 
into C* that are continuous with respect to 
the *Krull topology of T and the discrete top- 
ology of C". A fundamental theorem about 
the structure of H? (T, C*) states that the 
cocycles of H?(T, C*) that split in an extension 
K/k of degree n are exactly those cocycles that 
split in the unramified extension of degree n 
over k. Here a cocycle is said to split in K if it 
belongs to the *kernel of the homomorphism 
H^(T, CHILD, C"), where H is the sub- 
group of T corresponding to K, and res is the 
mapping obtained by restricting o, t of f(o, 7) 
to the elements of H. 

Let K/k be a normal extension of degree 
n with the Galois group G, and let H be the 
subgroup of T corresponding to K. Then the 
fundamental theorem combined with the exact 
sequence 


0H*(GK")H*(I,C*)5H(H,C*) (1) 


known in the theory of Galois cohomology 
(— 59 Class Field Theory H. 200 Homological 
Algebra I), yields H?(G, K ")z Z/nZ (cyclic 
group of order n), and H^(I, C") z Q/Z, where 
Q is the additive group of rational numbers 
and Z is the additive group of rational in- 
tegers. These isomorphisms can be given in 
canonical form as follows: Denote the unrami- 
fied extension of degree n by K,/k, and the 
Frobenius automorphism of K,/k by c. Then 
an element c of H?(G,, K7') (where G, is the 
Galois group of K,,/k) is represented by the 
cocycle 


f (s, oJ) af Dim -lim-Ul i jez, 
with aek*, and conversely, every ack” deter- 
mines an element c of H?(G,, K;') in this 
manner. Under these conditions, the corre- 
spondence between c and a gives rise to an 
isomorphism H?(G,, Kj) =k” IN 4(K; ). 
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Next define an element invc of Q/Z by invc= 
(ord a)/n (mod Z). Then since ce H?(T, C") 
splits in an unramified extension of degree n by 
the fundamental theorem, the exact sequence 
(1) determines in a natural way an element c’ 
of H?(G,, K,") corresponding to c. Putting inve 
— invc', we can show that H?(T, C*)sc—invc 
gives an isomorphism H?(L, C*)z Q/Z. We 
call invc the invariant of ce H?(L, C*). The 
invariant of an element of the cohomology 
group H?(G, K *) is defined to be the invariant 
of the corresponding element of H?(I, C *), 
which is determined by the exact sequence (1). 
Mapping an element of H?(G, K *) to its invar- 
iant, we obtain an isomorphism H?(G, K *)= 
Z/nZ. 


F. Local Class Field Theory 


Let K/k be a normal extension of degree n 
with the Galois group G, let f(a, t) be a cocycle 
representing the element of H*(G, K *) with the 
invariant Lin, and put 


-yeo 


0 reG 


K/k 
Then (E) gives an isomorphism be- 
o 


tween G/G' (where G' is the commutator sub- 
group of G) and k*/Nx,(K "). It follows from 
this that [k" : N£,(E")] & [E:k] for any exten- 
sion E/k of finite degree, and the equality holds 
if and only if E/k is Abelian. The inverse map- 


: K/k 
ping of Gao | — Jek*/Nx,(K ")for an 
o 


Abelian extension K/k is written as k* 3x 
(a, K/k)e G, and (x, K/k) is called the norm- 
residue symbol. If L/k is Abelian and K/k is a 
subfield of L, then the restriction of (x, L/k) to 
K coincides with (x, K/k). Let k, be the max- 
imal Abelian extension of k, 1.e., the union of 
all Abelian extensions of finite degree over k. 
Then for any «¢k*, an element (o, k) of the 
Galois group G(k,/k) of k,/k is uniquely deter- 
mined by (a, k)(y)=(a, k(y)/K)O), » e ka. The 
mapping k * 3a(a, k)e G(k,/k) is a one-to-one 
continuous homomorphism, and the image is 
tdense in G(k,/k). It has also been proved that 
there exists one and only one Abelian exten- 
sion K/k with Ny,(K”“)=A for any given 
closed subgroup A of finite index of k *. There- 
fore closed subgroups A of finite index of k* 
are in one-to-one correspondence with finite 
Abelian extensions K of k through the relation 
A — Ny 4 (K *), and in this case A is called the 
subgroup of k* corresponding to K/k. 

Let K/k be an Abelian extension of finite 
degree, A be its corresponding subgroup of 
kr. and V9? (i=0, 1,2,...) be ramification 
groups of K/k. Furthermore, define constants 
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ye ymo. = Vyts., 
— Kid S.S ie me o ye) 
D tl) 

Vis 


denote the order of V**P by n; (i2 1,2, ...,r), 
and put u, — vo + (no/no)(v, — vo) + --- + 
(n,-1/No)(V,—v,-1), P= 1,2, ..., r (here we 
understand vo = —1, ny 2 [V9 : 1]). Then 
U,,...,u, are rational integers, and we have 


Au =... = Au^ Auen le 
— Au 2 2 ,., 2 Aut A 


(H. Hasse). If m is the smallest integer with A 
=u), then p" is called the conductor of K/k. 
The above results of Hasse show that m— 

u, +1. On the other hand, it is known that the 
correspondence between Au?) and Viel (p= 
1,2, ...,r)is given by the norm-residue symbol 
(a, K/k) (- 59 Class Field Theory). 


G. Theory of Algebras 


By the general theory of *crossed products of 
algebras, the structure of the "Brauer group 
formed by the classes of *normal simple alge- 
bras over a local field k is obtained directly 
from results concerning cohomology (— Sec- 
tion F). Namely, a *normal simple algebra 
over k splits over a separable extension of 
degree n if and only if it splits over the unrami- 
fied extension of degree n, and the Brauer 
group of k is isomorphic to Q/Z. Furthermore, 
the *exponent (the order as an element of the 
Brauer group) of a normal simple algebra dt 
over k coincides with the *Schur index, and if 
[3: k] « n?, then N is expressed as a crossed 
product with respect to any normal extension 
of degree n over k. The invariant of the factor 
set (2-cocycle) that appears in this crossed 
product expression is called the invariant of 9I 
(— 29 Associative Algebras; for the properties 
of d as a topological ring and as a topological 
group of the group of invertible elements of 9I 
— 6 Adeles and Ideles). 


H. Explicit Formulas 


Let K/k be an Abelian extension of finite de- 
gree. When we have an explicit formula for the 
norm-residue symbol (x, K/k), we say that we 
have an explicit reciprocity law. 

Let k be a p-adic number field containing a 
primitive mth root £, of unity, and let D be an 
element in k*. Let p be a prime number con- 
tained in p. Since the Kummer extension K — 
k(X/ fl) over k is Abelian, the Hilbert norm- 
residue symbol (a, f),, is defined by 
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(a, K/k) QV B) =(ct, B), B, where (x, f),, is an 
mth root of unity. In this case, the problem of 
obtaining an explicit reciprocity law is solved 
if we can express the symbol (o, DL, in terms of 
a, f and suitable parameters depending on the 
ground field k. 

In particular, if p 22, m—2, we have the 
simple formula given by Hasse, (x, f); = 
(—1)7*7007179. where o, f are two units in k 
satisfying xz fi = 1 (mod 2) and Tr is the trace 
from k to Q,. Similar formulas for the comple- 
mentary laws are also known [10]. 

On the other hand, if m — p is an odd prime 
and k=Q,(¢,), we have the following formulas 
for a prime element 4, — 1 — £, and two units 
a, B satisfying x «1 (mod p°), B=1 (mod p): 


(a, f), op nra ia, Q) 
(EL), meo O A, (3) 
E p) Se E EE (4) 


where Dlog fi — (1/8) 377, ib; ! , while the b; 
are determined by the 4,-expansion f = 
Eob; bie Z, [5]. Furthermore, we have 
an explicit Kummer-Hilbert formula deduced 
from (2) in terms of Kummer's logarithmic 
differential quotients [8, 10]. Concerning the 
complementary laws (3), (4), the following 
Artin-Hasse formulas are known for k 


— Q,(£,.) and f. «1 (mod p): 
(C ze B) yn = EE NEE (3) 
Le B) on E o P PEERS UMORE EN 


where åpn = 1 — Can [9]. Utilizing these for- 
mulas, K. Iwasawa obtained a formula for 
(a, B),, that is a natural generalization of (2) 
[13]. A. Wiles has shown that a generalization 
of the Artin-Hasse-Iwasawa formulas can be 
obtained in terms of the Lubin-Tate formal 
groups (Ann. Math., (2) 107 (1978)). 
Concerning (x, f),,, we have Shafarevich's 
reciprocity law [11]. To explain it, let ky be the 
inertia field in k, i.e., the maximal subfield in k 
that is unramified over Q,. For an arbitrary 
integer a in ky, we consider the Artin-Hasse 
function E(a, x) 2 exp( — L(a, x)), where L(a, x) 
— YE (a/p')x” with the *Frobenius auto- 
morphism c of kal, We choose a prime 
element z in ko such that C, — E(1, 7t) and an 
integer à in a suitable unramified extension 
field of ky such that a^ — a =a for a given 
integer a in ky. Given any p"-primary element 
x in k (i.e., K(x') is unramified over k), there 
exists an integer a in ky such that x = E(a) 
= E(p"a, ï), where x zy (x, yek*) means x 
=y- u for an element u in k*?", Furthermore, 
if 6 is an element of k*, we have the canonical 
decomposition formula 
dan Ei [| E(dj n^ 


Ix j«eop 
Linz 
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where z is a prime element in k, d* €Z, d, d; are 
integers in ky, and eg — e/(p — 1) with the ramifi- 
cation index e of k. The explicit formula due to 
Shafarevich and Hasse is expressed as 


(2, Baar 


where Tr, is the trace from k, to Q, and a, a* 
(resp. b, b*) are determined by the canonical 
decompositions of « (resp. f) as stated above. 
The element c in ky is determined by 

[]  Eüab,z -—-ymE(o) [| Ele; x4) 

l<ij<eop 1<j<eop 

(Gp) (4 py-1 (j,p)=1 
for odd p, and by 
(IN I] E(ia,b, nit) 


ij 
I1 <ij<2e, 
(,2)-(,2)-1 


x [[ E2 + ja" ag" b? n” +") 
u v=1 
=yxE(c) [| Pie, zi 
ISj<2eo 
27-1 
for p — 2. Several formulas for the case where k 
is a function field are also known [1, 5]. 

Let k be a general local field. When K is an 
extension field of k obtained by adjoining z"- 
division points {A} of the Lubin-Tate formal 
group F defined over the integer ring of k, the 
extension K/k is totally ramified and Abelian, 
and we have an explicit formula: (u !, K/k)A 
=[u]-(A), where u is a unit in k and [u], is an 
endomorphism of F corresponding to the unit 
u. In particular, this formula implies the cyclo- 
tomic reciprocity law [12]. 

The problem of obtaining an explicit re- 
ciprocity law arises in the problem of obtain- 
ing the reciprocity law for power residues from 
the law of reciprocity in global class field 
theory. The problem is closely connected with 
T. Kubota’s recent results (e.g. [14]), which 
clarify the analytic meaning of the reciprocity 
law in algebraic number fields. 
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A. Lorentz Group 


(1) Minkowski space. A 4-dimensional real 
vector space M with an indefinite inner prod- 
uct between two vectors x and y defined by 


x:yexG'y o x? yo — x! y! — x2 y? ici, 


1 0 0 0 
Que. e^ do 
Hin nu 0]7(& 


0 0 0 —1 


is called a Minkowski space. A vector xe M is 
classified according to the signs of x: x and x? 
as follows. 


x:x»0,x?»0: future timelike 

timelike, 
x:x»0, vlt past timelike 
x:x-0,x?»0: future lightlike 
x:x=0,x°=0: origin lightlike, 
x:x—0, x? «0: past lightlike 


x:x<0: spacelike. 
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The set of all future (or past) timelike vectors is 
called the future (or past) cone and is denoted 
by V, (or V_). The set of lightbke vectors is 
called the light cone. The set of spacelike vec- 
tors is called the side cone. 

Minkowski space is used in the tspecial 
theory of relativity (with units such that the 
velocity of light is 1), where ((x — y): (x — y)? 
is called the proper time between two mutually 
timelike events x and y. 

(2) Inhomogeneous Lorentz group. The 
group of all one-to-one mappings of the Min- 
kowski space onto itself preserving the proper 
time between any timelike pair of points is 
called the full inhomogeneous Lorentz group or 
Poincaré group and is denoted by 2. An ele- 
ment g of F is a linear transformation 


3 
(gx) =}, Ax" a^ (u209,,,2,3), 
v=0 


where ae M and A — (A7) is any real matrix 
satisfying AG(‘A)=G. We write g —(a, A) and 
gx - Ax -- a. Then 


(a, ^1) (a5, ^5) 2 (a, - A1a5, A, Ay). 


The normal subgroup of 2 consisting of 
(a, 1), ae M, is called the translation group on 
M. The subgroup of 22 consisting of (0, A) is 
called the full homogeneous Lorentz group and 
is denoted by Z or (1,3). It consists of the 
following four connected components: 


£l ={AeL|detA=1, AV, =V,}, 
£i (Ae |detA- Ll, AV, SV. 
Li (Ae |detA-2 —L AV, =V;}, 
L! -(Ae.Z|detA- —L AV, =V}. 


The identity mapping, the space-time inversion 
x — x, the space inversion x?— x?, x'2 — x' 
(i — 1,2,3), and the time reversal x° — x°, 
x! x! are their typical elements, respectively. 
If det A= 1, Ae Z is called proper. If AV, = 
V,, AEL is called orthochronous. The identity 
component f is called the restricted homoge- 
neous Lorentz group. (This usage in mathe- 
matical physics may differ from the general 
terminology for O(m, n), in which the identity 
component is called proper.) 

The same terminology is used for 2, which 
also consists of four components P}, AL, 9, 
and 2!. 

(3) Universal covering group. For xe M, let 
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where 6 —(0,,05,03) is called the Pauli spin 
matrix. For any 2 x 2 matrix A with determi- 
nant 1 (Le., AeSL(2, C)), 


AXA*=%,, — x4—A(A)x, 


defines A(A)e.Z , and A2 A(A) is a two-to- 
one mapping from SL(2, C) onto Z1. By this 
covering mapping, SL(2, C) becomes the uni- 
versal covering group for Z1. 

If A is unitary (i.e, AeSU(2)), A(A) leaves x? 
invariant, and SU(2) is the universal covering 
group of OO). (the 3-dimensional proper 
orthogonal group or proper rotation group). 

For a complex vector ze M +iM, 


A2'B-—Z£,p  z4g-A(A,B)z 


defines a complex Lorentz transformation 
A(A, B), and (A, B) — A(A, B) gives a covering 
mapping from SL(2, C) x SL(2, C) onto the 
proper complex Lorentz group (C), , consist- 
ing of complex 4 x 4 matrices A satisfying 
AG(‘A)=G and det A — I. 


B. Finite-Dimensional Representations 


(1) Z1. Any continuous representation of 
SL(2, C) on a finite-dimensional complex 
vector space is a direct sum of irreducible 
representations. 

Let E =C? be the 2-dimensional complex 
vector space on which SL(2, C) is acting. Let a 
representation ny , of SL(2, C) on the (k +n)- 
fold tensor product E®**” be 


n, n( A) = A9* G) (A)®”, 


where A is the complex conjugate of A. A 
vector č in this representation space is called a 
mixed spinor of rank (k, n) and its components 
are written as £^r-^/-^ with undotted indices 
A,,...,A, and dotted indices ji, ... 0. all taking 
values 1 or 2. If n=0 (or k—0), it is called an 
undotted (or dotted) spinor of rank k (or n). A 
spinor with upper indices, called a contra- 
variant spinor, can be converted to a spinor 
with lower indices (partially or totally), called 
a covariant spinor, by 


0 1 
rail 1 3 


For example, 

Ay fy) YW EAS 

Š S ass D6 ` "` Ek Egi, 
KN 


The restriction of n, „ to the subspace of 
spinors which are invariant under permuta- 
tions of undotted indices and under those of 
dotted indices is an irreducible representation 
of dimension (k+ 1) (n + 1), which is denoted by 
[k,n]. The set of [k,n], k=0,1,...,n=0,1,..., 
exhausts all finite-dimensional irreducible 
representations of SL(2, C). 
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The tensor product of two irreducible repre- 
sentations is decomposed as follows: 


[k,,n,] @[k2,n2] -y9 [k,n], 


where [k,n] appears with multiplicity 1 for k, n 
satisfying k; +k,>k>|k,—k,|,n,+n,2n> 
In, —n,|. The complex conjugate representa- 
tion of [k,n] is [n, k]. 

For xe M, x is a mixed spinor of rank (1, 1). 

(2) SU(2). The restriction of [k, 0] (k= 
0,1,...,) to the subgroup SU(2) is an irreduc- 
ible representation (called the spinor represen- 
tation of rank k) and exhausts all irreducible 
representations of SU(2). [0,n] is equivalent to 
[n, 0] for SU(2) due to A =eAe! for Ae SU(2). 
[k,n] for SU(2) is equivalent to 


k*n 
[4,0] & [1,0] = D [r, 0]. 

r-|k-n 
An explicit coefficient in this decomposition is 
called a *Clebsch-Gordan coefficient. 

Any representation of OO). is a representa- 
tion of SU(2). The irreducible representation 
of SU (2) with rank r is an irreducible repre- 
sentation of (3), if r is even. It is an irreduc- 
ible double-valued representation of (3), if r 
is odd. Likewise, the irreducible representa- 
tion of SL(2, C) with rank (k,n) is an irreduc- 
ible representation of Z! if k 4- n is even and 
double-valued if k -- n is odd. 

(3) ! Z1 U Z! . The space inversion P 
induces an automorphism of 27 by A— PAP, 
that of SL(2, C) by A>eAe 1 and the asso- 
ciated mapping U(4) V(A) — U(eAe“!) of 
irreducible representations [Kk, n] [n, k]. Thus 
an irreducible representation of .7! is obtained 
on [k,n] @ [n, Kk] if nz k, and two inequivalent 
irreducible representations on [n, n] corre- 
sponding to two choices (+1) for the operator 
representing P. A vector in [k,n] ® [n, k] is 
called a bispinor of rank (k,n). The wave func- 
tion of the Dirac equation is a bispinor of rank 
(1, 0). 


C. Unitary Representations 


(1) Invariance in quantum mechanics. In the 
special theory of relativity, each ge P} repre- 
sents a symmetry of the system, namely, a 
transformation of states y of the system to 
states gy such that “physical relations” be- 
tween two states y, and y, should be the same 
as those between gy, and gy,. In quantum 
mechanics, a (pure) state is represented by 

a unit ray De, a set of vectors e'^P, 0 0 — 
2r for a unit vector V) in a Hilbert space, 
and |(V,, V,)|? is supposed to be an obser- 
vable quantity, called the transition prob- 
ability. Thus the special theory of relativity in 
quantum-mechanical situation implies a con- 
tinuous representation of ZI by bijective map- 
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pings of unit rays in a Hilbert space preserv- 
ing the transition probability. By the Wigner 
theorem, such a mapping is implemented 

by either a unitary or antiunitary operator 
U(g) as a mapping Jeff) (e ?U (gw) (— 
e.g., [12]). In order for this to be a group 

of transformations of unit rays, U should 

be a projective representation: U(g,)U(g;) — 
u(g,. 93) U(g193), where u(g,. g;) is a 2- 
cocycle with a complex value of modulus 1. 
In the case of A], each U(g) has to be uni- 
tary (true for any connected Lie group), and 
there is a choice of U (a, A) from the unitary 
ray {e®U (a, ^(4))|O € 0 « 2z] so as to make 
U(a, A) (ae M, AeSL(2, C)) a continuous uni- 
tary representation of the universal covering 
group A! — ((a, AJ]ae M, AeSLQ, C)] of A 
[2]. 

(2) A]. Any continuous unitary represen- 
tation of Ž on a separable Hilbert space is a 
direct integral of continuous irreducible uni- 
tary representations of 28. (called irreducible 
representations in the following). Any con- 
tinuous unitary representation of the trans- 
lation group {(a, 1)| ae Mj, the group being 
commutative, is of the form 


(U (a, 1)¥)(p)=e" rn, ve | Hina 


The parameter pe M is called the energy- 
momentum 4-vector or 4-momentum. In an 
irreducible representation of 2| , the measure 
dy is equivalent to an invariant measure on 
an orbit of the conjugacy action g(a, 1)g ! = 
(A(A)a, 1), g — (b, A)e 2, i.e., on one of the 
following orbits. 


m,:p'mp:p-m^ (m>0), +p°>0, 
O,:p?=0, +p°>0, 

0: p=0, 

im: p?>=—m?  (m»0) 


The parameter (p?)'? is called the mass. 

For each orbit the subgroup of all Ae 
SL(2, C) that do not move a predetermined 
point q, on the orbit å, is called the little group 
(of 4 or of g,). Up to isomorphism, the little 
group contains the following. 
m ,: SU(2), consisting of all unitary Ae 
SL(2, C), which leaves Im, =(£m, 9, 0, 0) 
invartant. 
0,: The 2-fold covering group of the 2- 
dimensional Euclidean group, consisting of all 

e? Tið, 


ane E | zeC, eR 
e 


(qo , =(+1,0,0, +1)), with the product 


A(0, 2) A(05, 25) — A(0, + 92,2, c e? z3). 


958 


0: SLQ, C). 
im: SL(2, C) consisting of all real 4e SL(2, C), 
which leaves d — (0, 0, m, 0) invariant. 

Any irreducible representation of 2 is 
uniquely determined (up to unitary equiva- 
lence) by an orbit A and an irreducible repre- 
sentation D of the little group G, c SL(2, C) on 
a Hilbert space K as follows (as an *induced 
representation): 


T = [winaumen = | Hindu 


[U(a, A)'V](p) =e "D(R(A, p) W(A(A) ! p), 
R(A, p) - L(p) ' AL(A(A "int 


where H(p)— K, du(p) is a measure (unique up 
to a multiplicative constant) on 4, invariant 
under the action of “| on A(p2Ap, Ae Z1), 
L(p) is a fixed element of SL(2, C) for each pe å 
such that A(L(p))q; =p, and R(A, p) is called 
the Wigner rotation. 

(3) The orbit m, . The irreducible represen- 
tation with the orbit m, and the irreducible 
representation of SU(2) with rank 2j (j— 

0, 1/2, 1, ...) is denoted by Im, ,j], where j is 
called the spin. It is given by 


[U(a, A) ](p) — e'"* AP? P(A(A) Tei 


(Y,D)= | (‘¥(p), (m/+ p)®7/(p)) 
x (x: 2p9)! dp! dp? dp’, 


where Wine C?! for each p and p is restricted 
to the orbit m,. 

(4) The orbit 0. Irreducible representations 
of the little group can be classified by the orbit 
in the spectrum of the normal Abelian sub- 
group consisting of A(0, z), ze C. Since the 
spectrum is a plane and A(0, z) acts as a rota- 
tion by an angle 26, an orbit is a circle of 
radius p. The case p=0 is further classified 
completely by the spin j 20, +1/2, +1,... 
(used for the description of a massless particle 
of spin | j| and helicity o =sign j). It is denoted 
by [0,, j] and is given by 


[U (a, AV] (p) =e *(A*)9 ? W(A(A) ! p), 


(P, d)- [o (x: p7)°*@(p)) 
X (x 2p9) ! dp! dp? dp’, 


where A" — A or A and p*=e'pe or p de- 
pending on c — 1 or —1, and ‘P(p) belongs to 
the quotient of C" and the subspace consisting 
of all y with (y, (p7)9?7y) 2 0 (the quotient is of 
one dimension). 

The case p £0 (called continuous spin) is 
completely classified by p and U(0, —1)— +1. 
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The representation D of the little group for 
p #0 is given by 
[D(A(z, 0) '] (9) 
—expi(pRee '"?z 4- k0) P (o — 20), 
where k=0 or 1 (accordingly U(0, —1)—1 or 
—1), peR mod2z and V e L;([0, 27], dq). 
(5) The orbit 0. Irreducible representations 
of the little group SL(2, C) are as follows: The 


principal series with two parameters —oo « 
p « o and m —0, +1,... are given by 


[D(A)f ](z) — (bz + d)"|bz + d|? " ?f('Az), 
A= : 
c d 


where f is an L,-function of z 2 x - iy with 
respect to dx dy. The supplementary series with 
a parameter 0 « p « 1 is given by 


[D(A)f ](z) » |bz - d|*^"* f Az), 


"Az =(az+c)/(bz +4), 


usta [5e 
x [z, —z2| ^" ?? dx, dx, dy, dy. 


Together with the identity representation D(A) 
= ], they exhaust all possibilities up to unitary 
equivalence. 

(6) The orbit im. Irreducible representations 
of the little group SL(2, R) are as follows: The 
principal continuous series with parameters 
o=0, 1, p20 for «=O and p»0foro-1 are 
given by 


[D(A)f] (x)= f((Ax)|bx + dl? (sign(bx + d))’, 


where f € L;(( —oo, oo), dx). The supplementary 
series with a parameter 0<p <1 are given by 


[D(4)/]C9 — |bx - d| ! ^f('Ax), 
ufa | [risate xp tse dx. 


The principal discrete series with a parameter 
n —1,2,3,... are given by 


[D(A)f ](z) - (bz +4) "f( Az), 


ids | Ve SEAN) 


—90 


usta [ 


where f is holomorphic in either the upper or 
lower half-plane, (so that the y-integration is 
over (0, 00) or (—00,0) accordingly), z = x + 

iy, and the two choices give rise to two in- 
equivalent representations for each n. Together 
with the identity representation D(A) = 1, 

they exhaust all possibilities up to unitary 
equivalence. 
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D. Infinitesimal Generators 


In any continuous unitary representation U of 
A, 

( "mr *(U(())— 1) 

(defined on the dense set of all those vectors on 
which the limit exist) for any one-parameter 
subgroup y(t) of A! is a self-adjoint operator 
called the infinitesimal generator for y(t). If 
y(t) =(ta, 1) (ae M), the generator is P a, where 
the components P^, P', P?. P? of P mutually 
commute and are called energy-momentum 
operators (or 4-momentum operators). If y(t)= 
(0, exp(it o,/2)), k — 1, 2, 3 (the rotation by an 
angle —t around the kth axis), the generator is 
written as J* — M?' = — M" ((jIk) is a cyclic 
permutation of (123)) and is called the angular 
momentum operator. If y(t) 2 (0, exp(to,/2)), 

k — 1, 2, 3 (the pure Lorentz transformation 
along the kth axis with a velocity tanh t), it is 
written as N* 2 M° = — M*9, and MIT (u, v= 
0, 1,2, 3) is called the angular momentum 
tensor. The value of the scalar P-P is the 
square of the mass and, when P. P «0, the 
alternatives of P? being positive or negative 
definite or zero give the invariants of irreduc- 
ible representations, distinguishing different 
orbits. The angular momentum in the center of 
mass coordinates, 


w*z2 yg" "p, My, 


gives another invariant w- w, called the Pauli- 
Lubanski vector, where the quantity with lower 
indices are obtained from those with upper 
indices by contraction with the Minkowski 
metric g, e.g., P, =$, P"g,,; € is totally anti- 
symmetric in its indices and ¢°!?3 — 1. If P- 

P >0, then w: w- —j(j- 1)P- P defines the 
spin j. In the representation with orbit 0, and 
p —0, w"=ojP* defines the spin j 2 0 and the 
helicity o = +1. 
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Magnetohydrodynamics 


Magnetohydrodynamics (also called hydromag- 
netics or magnetofluid dynamics) is concerned 
with the motion of an electrically conductive 
fluid in the presence of a magnetic field. An 
electromotive force (e.m.f.), induced by the 
motion of the fluid in the magnetic field, in 
turn induces an electric current that perturbs 
the original magnetic field. On the other hand, 
an electromagnetic force due to the magnetic 
field deforms the original motion. Many im- 
portant and interesting phenomena result from 
such interactions of the magnetic field and the 
motion of the fluid. 

In ordinary magnetohydrodynamics we 
assume that (i) the fluid is continuous, (ii) 
electric conductivity c is not negligible, and 
(iii) fluid velocity is small compared with the 
velocity of light, i.e., max(L?/(c^ T?), U?/c?) «« 
min(1, R,,), where L is a representative length, 
T a representative time, U a representative 
velocity, and R,, — cuU L (u is the magnetic 
permeability) is a nondimensional number 
called the magnetic Reynolds number. Jn this 
case we can neglect in the 'Maxwell equations 
the displacement and convective currents in 
comparison with the conductive current J, 
and write 


divB=0, rotH=J, rotE- —9oB/ét, (1) 


pe=diveE, (2) 


where e is the dielectric constant, and 
B=yH, J=o(E+v~xB). (3) 


Here the latter equation, Ohm's law for a 
moving medium, is valid when the effects of 
temperature gradient, the Hall effect, etc., are 
small. The motion of fluids (^ 205 Hydro- 
dynamics) is governed by the *equation of 
continuity 


Op/0t + div pv — 0, (4) 
the fequation of motion 
6(py)/dt = —div(py à v —P — T) (5) 


(P is the mechanical stress tensor, T is the 
Maxwell stress tensor T= u(H;H; —4H*0;)) or 


pDv/Dt=divP+K, K=JxB, (5') 


the tequation of state, and the *energy equa- 
tion. (From assumption (tii), the force p, E on 
the electric charge p, can be neglected com- 
pared with the force J x B on the electric cur- 
rent, and (2) can be separated from the other 
equations in order to determine p,.) © 

When u and o are uniform, we can eliminate 
E and J from (1) and Ohm's law to obtain the 








962 


induction equation 
0B/0t = rot(v x B)+ 2AB, (6) 


where A = grad div — rotrot, 4 = 1/(uc). This 

is of the same form as the equation éw/ét = 
rot(v x Q)-- v Ao (v is the kinematic viscosity) 
for the *vorticity œ of an ordinary fincom- 
pressible viscous fluid. We call 4— 1/(uo) the 
magnetic viscosity, and the ratio of the first 
term (the convection term) to the second one 
(the diffusion term) in the right-hand side of (6) 
is the magnetic Reynolds number R„ =U L/} = 
uc UL. R,,= oo corresponds to the tperfect- 
fluid case as o oo or L— oo. In this case, the 
magnetic flux moves with the fluid as if both 
were frozen together, as in the *Helmholtz 
theorem about vorticity. The existence of 

a transverse wave of velocity a =,/ uH?/p 
along magnetic lines of force in the fluid, 
owing to the tension JH? (T; except for the 
magnetic pressure —3uH?ó;) of the magnetic 
flux frozen to the fluid, was noted for the first 
time by H. Alfvén (1943), and this wave is 
called the Alfvén wave. In a compressible 
perfect fluid (LR, = oo, P; = — pó;, where p is 
the pressure), (1)-(5) reduce to a system of 
thyperbolic partial differential equations, 
yielding as tcharacteristic surfaces in addition 
to the pure Alfvén wave two magnetosonic 
waves of phase velocities 


1/2 
x (e +a? 4 (a? +8?) — 4a? a? cos? i) 


(0 is the angle between the magnetic field and 
the wave normal, a is the velocity of sound 
interfering with the Alfvén wave). We call a, 
anda. the fast wave and the slow wave, re- 
spectively. Hydromagnetic dynamo theory 
explains the generation and maintenance 

of the magnetic field inside the earth on the 
basis of (6). Applications are also made to 
cosmic problems and MHD generation of 
electricity. Mercury, liquid sodium, etc., can 
be used to verify the theoretical results. Extra- 
polation may be made to the plasma used in 
thermonuclear fusion to the extent that the 
hydrodynamic treatment is valid as a first 
approximation. 
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A. General Remarks 


We consider a random process ( X,] (tz 0 or 

t —0, 1,2, ...) that is governed by some prob- 
ability law. One of the most important is the 
process whose probability law of X, under the 
condition X, —a,,..., X, —a,(s, «s5 «...« 
S, <t) coincides with that under the condition 
X, —a,. This is called the Markov property, 
and a process with this property is called a 
Markov process (— 261 Markov Processes). If 
the process takes place in a finite or countable 
set S, it is called a Markov chain. 

A Markov process is specified by its ttran- 
sition probability (e, the system of the prob- 
ability laws of X, given X,— x for s« t) and 
initial distribution (= 261 Markov Processes). 
The transition probability of a Markov chain 
is denoted by p, ,(x, y), while that of a general 
Markov process is denoted by P(s, x, t, A). 
Before proceeding to a sophisticated definition 
of Markov chains, we give some examples. (a) 
Suppose that ¢,, ¢,,... are mutually tindepen- 
dent real trandom variables on a probability 
space (Q, B, P) and fi, f,,... are Borel measur- 
able functions from R? to R. The process {X,, 
which is defined by the recurrence relation 
Xo = 6o, Xn = f. (6,, X, ,) is a Markov process 
with its transition probability given by P(n— 
1, x, n, A) « P1 f, (5,, x)e A}. In particular, if 
the possible values of each £, are at most 
countable, | X,) becomes a Markov chain. It is 
*temporally homogeneous, i.e., p, ,.,(x, y) is 
independent of n, if the {€,' are tidentically 
distributed and f, = f, — .... Moreover, if each 
Cy Is integer-valued and f(x, y) =x - y, {X,} is 
a 1-dimensional random walk. (b) Ehrenfest 
model of diffusion. Suppose that N molecules 
are distributed in two containers A and B. At 
each trial a molecule is chosen at random and 
moved from its container to the other. Let 
X, be the number of molecules in A after the 
nth trial. Then {X,} is a temporally homo- 
geneous Markov chain with p, ,,,(i,i—1)— 
i/N, Panyi (i+ 1) 2(N - D/N for0O i x N 
Ip, 0 otherwise). (c) Population model. 
Consider a population in which there is no 
interaction among individuals. Suppose that 
during a time interval (t, t+ At] each individ- 
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ual gives birth to a new one with probability 
AAt + o(At) and dies with probability 4At + 
o{At). Let X, be the population size at time t. 
Then the process {X,} becomes a birth and 
death process, the most typical of Markov 
chains with continuous time parameter (— 
Section G). Numerous Markov chains with 
special structure are extensively studied in 
various fields of applications of probability 
theory such as queuing theory (— Section H), 
the theory of tbranching processes, the sto- 
chastic theory of *population genetics, and 
others [1, 3]. In this article, however, we are 
mainly concerned with the theoretical aspects 
of Markov chains. 

Hereafter, we consider only ttemporally 
homogeneous Markov chains, which are re- 
formulated as a system of tstochastic pro- 
cesses in the following way. Let S be a finite 
or countable set (called the tstate space of 
the motion) and T be {0,1,2,...} or [0, oo) 
(called the time parameter space). A family 
(Xp Poher, «es 1s called a Markov chain if P. the 
probability law of the process X, starting at x, 
is subjected to the condition 


PAX, uu my, eX S t7 Yml 
Aen Xp = x) 
= PL (Xs, e Y1 X = Ym) (1) 


for every t, ET (j=1,...,n;k=1,...,m) such 
that t, <t,<...<t,,0<s,<s,...<s,,. Then the 
function defined by 


PAx,y)=P(X,=y) teT, x, yeS 

satisfies 

Ox p,(x, y)<1, Q) 
2. p(x, y) 5 1, 3) 
Pis y) — 2 PAX, z) ps. y) (4) 


because of (1) and the general properties of 
probability laws. We call p,(x, y) (te T, x, ye S) 
the transition probability (or transition func- 
tion) of the Markov chain, and relation (4) is 
called the Chapman-Kolmogorov equation. 
Furthermore, the matrix p,=(p,(x, y)) is called 
the transition matrix. 

Conversely, for a given p,(x, y) with prop- 
erties (2), (3), and (4), we can construct a Mar- 
kov chain that satisfies 


P(X, 9 x,..., X, = x) 
= p, (x, x1)5,, -1 (X1 X2) oe Dg gs AN S Xn) 


by using Kolmogorov’s extension theorem. 
Such a Markov chain is essentially unique. If 
we are given a stochastic matrix p=(p, ,), i.e., 
a matrix satisfying O0<p, , & 1 and Ees p, ,— 1, 
the components p,(x, y) of the iterated matrix 
p" satisfy (2)-(4), and hence there exists a 
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Markov chain with discrete parameter having 
p, (x, y) as its transition probability. 

Let u, = P(X =x) be the initial distribution 
over S at time 0. Then the distribution p= 
P(X, — y) at time t is obtained from p9 = 
X ees Ux pix, y). In particular, if uf? is indepen- 
dent of t, i.e., 

Hy= Y. up. y). (5) 


xeS 


then 4 is called the invariant distribution of the 
Markov chain. An arbitrary real nonnegative 
solution of (5) is called an invariant measure. 
When S is the set of all d-dimensional lattice 
points and p, ,=7,_,, the associated Markov 
chain is called a (general) random walk. In 
particular, if 1, —2 " for every neighboring 
point x of the origin and =0 otherwise, it 
is called a standard random walk (or simply 
random walk). 
Now, when the equal sign in (3) is replaced 
by «, we consider the space S* obtained 
by adjoining  ('death point) to S, and define 
Pr(X YI=P(% y) x yeS 


p*(x, 0) 1— Y. p,(x, y). 


yes 


xeS; 


pr(0,0)—1;  pf(0,x)—0. 


Then we can associate a Markov chain (X,, P*) 
on ST with {p¥*}. If X, equals 3, the particle 
of the process is regarded as extinct at the 
random time 6 =inf{t| X, — 0j, called the life- 
time (or killing time). In this case, the process 
X, restricted to t smaller than £ is also called 
a Markov chain on S with the transition 
probabilities | p,(x, y)). Then the conditions 
Lyes Dlz, y) 2 1 and Biz œ)=1 are equiva- 
lent, and the chain is called conservative if 

P (t= 00) «1 for every x. 

In this section we have restricted ourselves 
to the temporally homogeneous case. In the 
temporally inhomogeneous case, we have to 
consider the probability laws P, , of the path 
starting from xe$ at time t, instead of P. 
Equation (1) becomes 


P, (X, =y, ave 


=P, (Xs, 1 say Asn = yn), 


[eo py cS 


E ys X, Xs e Xp Xx, 


He 


For the rest of this article, we consider only 
the homogeneous case. 


B. Markov Chains with Discrete Parameter 


Hereafter, the one-step and the n-step tran- 
sition probability of a Markov chain with 
discrete parameter will be denoted by P(x, y) 
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and P,(x, y) instead of p,(x, y) and p,(x, y), 
respectively. 

Let X -(X,, P.) be a conservative Markov 
chain. For a subset A of the state space S, c, = 
min(nz 1| X,e A] (min Ø= +00) is called 
the hitting time for the set A. If P,(c, « o0)» 0 
(=0), we write xy (x+y). Then x y is 
equivalent to the existence of n> 1 with P,(x, y) 
7 0. When x y and yx, we write x e y. The 
set of all x for which there exists y # x with x 
y and y+x is denoted by F and is called the 
dissipative part of S. For elements of $ — F, the 
relation is an equivalence relation. Each 
equivalence class E, is called an ergodic class. 
When F = Ø and S consists of a single class, 
the chain X is called irreducible. (or ergodic). 
For an ergodic class E, the greatest common 
divisor d of {n> 1| f(x, x) 0] for xe E does 
not depend on the choice of xe E and is called 
the period of the class E. A class with period 1 
is called aperiodic. Set G,— {ye E | P,,,,(x, y) > 
0 for some k z 0) (n— 1,2, ..., d) for a fixed 
x€E. Then we have a decomposition of E: E = 
UG,, GaN Gm — Ø (nz m), and eg, Pip, 2) 
=1(veG,). Each G, is called a cyclic part. The 
decomposition may depend on the choice of 
x but is unique up to ordering. 

The point x is called recurrent or nonrecur- 
rent (transient) according as P,(o, « oo) — 1 or 
<1. A necessary and sufficient condition for x 
to be recurrent is Eo P,(x, x) = oo. The prob- 
ability P, (X, — x occurs infinitely often) is 1 or 
0 according as x is recurrent or nonrecurrent. 
A chain is called recurrent or nonrecurrent ac- 
cording as every point in S is recurrent or 
nonrecurrent. À recurrent point x is called 
positive recurrent or null recurrent according as 
m, = E,(o,) is finite or infinite. Let E be an 
ergodic class. If there exists a positive recur- 
rent element in E, then all elements in E are 
also positive recurrent, and the class E is called 
positive recurrent. We can define null recur- 
rence or nonrecurrence of an ergodic class 
similarly. For a finite state space, every Mar- 
kov chain has at least one ergodic class, and 
all ergodic classes are positive recurrent. 


C. Limit Theorems for Markov Chains, and 
Recurrent Events 


The properties of a recurrent chain are re- 
duced to those of an irreducible recurrent 
chain, since a recurrent chain is decomposed 
into ergodic classes. We assume that X — 
(X,, P.) is an irreducible recurrent chain. 
Concerning the transition function we have 
the following limit theorems: Let d be the 
period of X and S=\_J¢_, G, be the decom- 
position into cyclic parts. If x € G}, then 
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lim, P,4,,(x, y) dm; ! (yeG), =0 (ye). 
These results are sometimes referred to as the 
basic limit theorem of P,(x, y). For every x, 
yeS,lim,,, N TE P,(x, y) 2m; !. Further- 
more, r, y= limy- (XX Py, )/ Xa Pj, x) 
exists and is finite. Then r, , is an invariant 
measure as a function of y, and every invariant 
measure is a constant multiple of it. The chain 
is positive recurrent if and only if Xr, , « oo, 
and then we have r, , — m,m, !. Consequently 
there exists a real invariant measure u, (real 
solution of (5)) such that 0< X... |u,| « oo if 
and only if the chain is positive recurrent, and 
in this case u, is a constant multiple of m; 1. 

Let u be an invariant measure, and set I,(f) 
=$ ses Hx f(x). For f, g such that 1,([f]) « oo, 
O« I, ([gl) « oo, and 1,(g) £0, we have 


P, (im ( 2. fa] » uni = LY) =1. 


The ‘law of large numbers, the *central limit 
theorem, and the "aw of the iterated logarithm 
hold for the asymptotic behavior of the sum 
Eazi f&n) as No oc. 

Let 6={E,, E,,...} bea sequence of events 
on a probability space (Q, 2, P), and let u,— 
P(E,). The time intervals {t,} between the 
successive occurrences of & are called the 
recurrence time of &. More precisely, the {z,} 
are successively defined by t, —inf(nz 1|E, 
occurs}, z, zinfínz t, +... +T- - 1 |E, 
occurs} — (t, +... -F t, 4) (k> 2). To avoid 
complications we assume here that all the c, 
are finite *almost surely. The sequence & is 
called a (persistent) recurrent event if the q, 

(k 2 1) are mutually tindependent trandom 
variables with a common tdistribution | f,]. 
More generally, if the distribution of t, is 
allowed to be different from ( f,', & is called a 
delayed recurrent event. The basic relation on 
a recurrent event is the renewal equation 


us — b, Uy fa- ufui. Up- fis 


where {b,} is the distribution of t,. Suppose 
that the greatest common divisor of (n 21 | f, 
>0} is d. Then we obtain 


lim Unair Up lb.  r=1,2,...d, (6) 
where = X nf, is the mean recurrence time 
and f, = sn, This fact is known as the 
renewal theorem. Let X —(X,, P,) be an irreduc- 
ible recurrent Markov chain. For fixed x and 
y, consider the events E,— ( X, — yj under the 
measure P Then, £ — (E,, E,,...} is a delayed 
recurrent event with u, = P,(x, y), f, — P,(o, — n), 
and b, = P,(a, — n). Applying (6), we obtain the 
basic limit theorem of P,(x, y) stated at the 
beginning of this section. We next consider the 
number N, of occurrences of a recurrent event 
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d up to time n. The limiting behavior of N, 
has been extensively studied (— 250 Limit 
Theorems in Probability Theory D). 


D. Potential Theory for Markov Chains 


For a given Markov chain, 


8 


G(x, y)- a, y) 


is well defined, admitting possibly the value oc. 
If G(x, y) is not identically oo, we can define a 
(generalized) *potential with kernel G(x, y) (^ 
45 Brownian Motion; 261 Markov Processes; 
338 Potential Theory). For a real function o 
over S, the function Go(x) — X,.s G(x, y) p(y) is 
called the potential with charge q if the sum 
exists. Even if Go does not exist, the infinite 
sum 27, P,o may exist. In this case this sum 
is also called the potential with charge o. 

A real function f (—o0 < f € +00) over S 

is defined to be superharmonic (or super- 
regular) if Pf « f. Here P is the operator asso- 
ciated with the kernel P(x, y), that is, Pf(x) — 
Lyes P(x, y) f(y). Furthermore, if f>0 and 

f Z Pf, f is called excessive, and if —oo < f 

< +œ and f= Pf, f is called harmonic. If 

Pf « f at a point x, f is called superharmonic at 
x, etc. A potential with charge o >0 is always 
excessive. For an irreducible recurrent chain, 
every nonnegative superharmonic function is 
constant. 

(1) For a nonrecurrent chain we can consider 
the potential f = Go for every function @ with 
finite support. G satisfies (P — I)G = — I and 
bm, P,G —0, where I is the unit matrix. 
Consequently, the operator P — 1 corresponds 
to the tLaplacian A of *Newtonian potential 
theory, and the equation (P — I) f =0 corre- 
sponds to the Laplace equation Af =0. If the 
limit w— lim, ,, P, f exists for a function f on 
S, f can be expressed uniquely as the sum of 
the potential Go to f — Pf) and a harmonic 
function w. This decomposition is called the 
Riesz decomposition, following the terminology 
used in Newtonian potential theory (— 338 
Potential Theory). Let E be a finite subset of S 
and of — min(nz 0| X,e E). Then f(x)— P,(o£ 
< œ) is a (unique) potential which is harmonic 
at xe E° and takes the value 1 on E. This is 
called the equilibrium potential of the set E, 
and its total charge C(E) 7 È esl f(x) — Pf(x)) 
is called the capacity of E. The maximum 
principle and the tbalayage principle (— 338 
Potential Theory) are valid in this potential 
theory. 

(2) For a recurrent chain, we cannot define 
the potential kernel as in (1), since G(x, x)= oo. 
However, we can define a kernel analogous 
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to the case of the tlogarithmic potential. We 
assume that the chain is irreducible. When the 
limit 


A(x, y) - lim (G(X, x) uus! — GG y) 


Go.) Y. Pe y. 


exists for an invariant measure p, the chain ts 
called (right) normal. A chain is normal if and 
only if 


Zeil le lim 3. P(x, z) P(oge-) 
NO reg 


exists. If 4,(-) exists, it is independent of x. Every 
positive recurrent chain is normal. For a nor- 
mal chain we obtain the following results. Let 
ọ be a function with finite support. The poten- 
tial f of ọ exists if and only if X, cu, o(x) 

—0, and f= — Aq in this case. Aq satisfies 

the conditions (P — 1)4o = and lim,,,, P, Ag 
=Q. A function f is a bounded potential of a 
function with finite support E if and only if f is 
bounded and harmonic in E° and satisfies 
D.Ap(x) f(x) 20. An irreducible recurrent chain 
is not necessarily normal. For every such 
chain, however, there exists a kernel W(x, y) 
such that (P—1I)Wo = — q for any function 

Q with finite support and null charge (e, 

X ves Hx P(X) 20). Such a kernel W is called a 
weak potential kernel. 


E. Random Walks 


Consider a random walk defined on the set S 
of all lattice points in a d-dimensional Eucli- 
dean space. Let ST ={x|0>x}, $2 (z|z2x— 
y. X, ye S* 1. F. Spitzer obtained the follow- 
ing results for the random walk with S — S. 
The random walk is recurrent if the following 
conditions are satisfied: (1) d 2 1, X|x|P(0, x) « 
oo (|x| is the distance between 0 and x), and 
m= >xP(0,x)=0; (ii) d=2, m=0, and o° = 

E |x|? P(0, x) « co. When d 23, the random 
walk is always nonrecurrent. The measure 
i, = 1 is invariant whether the random walk is 
recurrent or not. 

Every recurrent random walk is right 
normal, and the potential kernel A satisfies 
(P —I)A— I. Several interesting results are 
known on the uniqueness of the kernel A 
satisfying (P — 1) A — I [7]. 

For the case m=0, there are a number of 
results similar to those for ‘Brownian motion, 
including: (i) When d=1 and 0 «o « oo, 

lim Py(maxo «4 «,IX,| & o /n x)= 1— Fis "1 


noo 


x0, 
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where 





4 » (-1)* n? 
F(x)=1—— ——(2k+1)*x |; 
e) ER pue 
(ii) The are sine law: Let T, be the number of 
k «n for which X, becomes >0. When d=1 
and 0 « o « oo, 


lim P,(T nx) arcsin v Oxxxl; 
(iii) The Wiener test: The set E is called re- 
current if P (op « o0) — 1 holds for every x. 
When d —3 and o < œ, a set E is recurrent if 
and only if £72, C(E,)2 "= oo, where E, = 
Elte six 2n 


F. Markov Chains with Continuous Time 
Parameter 


Suppose that the transition probability 
p,(x, y) is measurable in t. Then p,(x, y) is uni- 
formly continuous in a complement of any 
neighborhood of t=0, and there exists m, , — 
lim, pls, y) for which m, , = Lees m, DS, y) 
= Dees Dx, z)m, y. If p(x, y)=0 for all xeS and 
t0, y is called a fictitious state. Let F be the 
set of all fictitious states of S. Then the restric- 
tion of p,(x, y) to S — F gives a transition prob- 
ability on S — F. If F = @ and the family p= 
(p, C. y)] t» 0, yeS} separates points of S, the 
transition probability p,(x, y) is called stan- 
dard. Then p,(x, y) is standard if and only if 
lim, o p,(x, y) 2 ôx, When F = @ and p does 
not separate, points of S can be reduced to 
the standard case by a suitable identification 
of states. We assume that p,(x, y) is standard. 
Then Dm, af !(pi(x, y) 2, ,) = x,y exists and 
satisfies 0 x q, , < oo if x z y. The matrix Q = 
(4x,y) over S is called the Q-matrix of the tran- 
sition matrix p,=(p,(x, y)), and we write Q = 
Do. Set q,— —4,,, (20). We call x a stable 
state if q, < oo and an instantaneous state if 
q,— oo. If q, < oo, p,(x, y) (the derivative with 
respect to t) exists and is continuous in t» 0. 
When every point of S is stable, z(x, y) de- 
fined by 


-1 if 
n(x, ym rex} x*y 
0 if x=y 
satisfies 


Oxz(x yl, mn(xx)-0, 


Y z(x,y) <1. (7) 
yeS 

From the tKolmogorov-Chapman equation 
we can formally derive Kolmogorov's backward 
equation 

pio Y= —q.po5 y) qx 2, nie, z)p (zy) — (8) 


zeS 
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and its dual, Kolomogorov's forward equation 


p(x, Y= —q,pGc y) + 2, PAX z)q,n(z.y. (9) 
Strictly speaking, these equations hold 

only under suitable conditions: For in- 

stance, a conservative transition probability 
p,(x, y) satisfies (8) if and only if Zeie, y) 
=1.If there exists č, >0 (xeS) such that 
Än, y) < č, (Vt > 0), then p,(x, y) satisfies 
(9) if and only if Ze £,q,z(y, x)= £,. Con- 
versely, given 0 € q, < oo and z satisfying (7), 
there exist in general many solutions of (8) and 
(9) with the initial condition lim, o p,(x, y) = 
A... Among them, the minimal solution 

p? (x, y) exists. The chain with p?(x, y) as its 
transition probability is called the minimal 
chain associated with (q, 7}. 

For the path of a Markov chain with stan- 
dard transition probability, there exists a 
separable measurable *modification X,, but in 
general there exists neither a right continuous 
modification nor a modification having the 
tstrong Markov property for all fstopping 
times. (For detailed properties of paths — 
[8,9]) Set t) 20, c, =inf{t>7,_,|X,#X,,_,} 
(n2 1), and x, =lim,..,, Tn. If each point x of S 
is stable, t, is subject to the exponential distri- 
bution Biz, 7 t) e **, so that E,(1,) 24, 
PX, = y) — n(x, y), and t, and X, are indepen- 
dent with respect to the P,-measure. Further- 
more p°(x, y) e P,(X,— y, Ta >t) is the minimal 
solution. For the minimal chain there exists a 
right continuous modification with left-hand 
limits, which has the strong Markov property. 

For a finite Markov chain with a standard 
transition probability, all states are stable and 
both (8) and (9) are fulfilled. Furthermore, the 
transition probability satisying (8) and (9) is 
the unique minimal solution. 

D. Williams [10] obtained the following 
remarkable result concerning Markov chains 
having only instantaneous states. Let Q be a 
matrix on S with —q, ,—oo(Vx), O € q, , < oo 
(Vx, y; x x y). Then Q is the Q-matrix of a tran- 
sition matrix if and only if the following two 
conditions are satisfied: (i) 3... , min(q, del 
« oo (Vx, y; x € y), (ii) for some infinite subset A 
of S, Zeg ii 4x,y < © (VX). Williams [11] also 
studied a counterpart of Kolmogorov's back- 
ward equation for the instantaneous case. 


G. Birth and Death Processes 


A Markov chain X with state space S= (0, 1, 
2, ... } is called a birth and death process if 

its Q-matrix Q =(q,,,,) satisfies the following 
conditions: 0<4qg= qo. 1 < ©, 0€ qu q, ,-1 + 
dunn 00 (nZ 1) and q, m=0 (mz no 1,n— 
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1), where q, — —4q,,, as before. Usually we 
write 4, for q, ,,, and p, for q, ,.,. The para- 
meters 4, and ji, are called the infinitesimal 
birth and death rates, respectively. In partic- 
ular, the chain X is called a birth process if 44, = 
0 and a death process if 4, — 0. The birth and 
death process satisfying 4, 0 and 4, 0 for 
hA Z 1 has a number of properties similar to 
those of a 1-dimensional ‘diffusion process. 
For the rest of this section we assume that 
Ay =0 (i.e., O is a *trap) and 4,0, i, 0 (nz 1). 
The sequence x,, called the natural scale, is 
defined as follows: x, = u, !, x9 24 ! - 4, 5, 
X =u) HATI... as y MAS Ane) 
(n> 3), x,, =lim,..,, x,. The measure m with the 
mass m, — 4, ... A, 4(H3 ...14) ' (nz2) and = 
1 (n— 1) at the point x, is called the canon- 
ical measure. Then p;(x;, x,) - DI, Kim, ! is a 
transition probability on E— (x,,x;, ...] 
and f(x; t) 2 p,(x;, x,) satisfies a differential- 
difference equation 


oft - Df *, (10) 


which is equivalent to (8). Here, f *(x,) 
(f) 7 fo) X41 — x,) and D,g(x,)— 
(g(x) gt, -,))m, 1. The operator Ta f * 
is similar to Feller's expression for the infini- 
tesimal generators of 1-dimensional diffusion 
processes. 

Every birth and death process ts obtained 
from *Brownian motion by ttime change. 
Furthermore, x,, is regarded as a boundary 
point and is classified as a natural, exit, en- 
trance, or regular boundary point. Every birth 
and death process is determined by (10) and 
the boundary condition at x„ (— 115 Diffu- 
sion Processes). 


H. Markov Chains in Queuing Theory 


A queue is formed when customers arrive at 
random times at some facility and request 
service of some kind. Such phenomenon often 
arises in service systems. The queuing model 
generally embodies the following mathemat- 
ical structure. Suppose that the nth customer 
arrives at time z,, waits for a time interval w, 
until the beginning of his service, and departs 
after a service time v,. Usually the models we 
consider are specified by the following assump- 
tions. The interarrival times u,— fait — t, 

(nz 1) are mutually *independent "random 
variables distributed according to a common 
*distribution function F(x) with tmean 4 '!. 
Similarly, {v,} is a sequence of independent 
random variables with a common distribution 
function G(x) with mean „`+. The sequences 
{u,} and {v,} are mutually independent. The 
queuing discipline is "first come, first served." 
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In case of many servers, this means that the 
first customer waiting in the queue is served as 
soon as any one of the servers becomes free. 
Besides this standard queuing model, various 
other queuing models have been studied in 
different fields of application. There is a huge 
literature on queuing theory, i.e., the mathe- 
matical analysis of queuing models. A com- 
pendium of results obtained by around the 
1960s, together with an extensive bibliography 
on this theory, is contained in [15]. Some 
recent aspects of queuing theory are developed 
in [17]. Here, we are concerned only with 
some simple queuing models and their asso- 
ciated Markov chains. 

There is a standardized notation, introduced 
by D. G. Kendall [14], for identifying stan- 
dard queuing models. In Kendall’s notation 
A/B/s, s represents the number of servers, 
while A and B indicate the types of distri- 
butions F(x) and G(x), respectively. Distri- 
butions frequently used in the first two places 
of Kendall’s notation are the following: M —an 
*exponential distribution, D —a *unit distri- 
bution, E, =a tgamma distribution of order 
k (called a k-Erlang distribution in queuing 
theory), G(or GI) —a general distribution, and 
so on. The model M/-/: is often said to have 
Poisson input, for the number A, of arrivals 
during a time interval (0, t] forms a tPoisson 
process. 

One of the important problems in queuing 
theory is to analyze the number Q, of cus- 
tomers waiting or being served at time t. The 
process Q, is called the queue length. If s= oc, 
then Q, indicates the number of busy servers 
at time t. The simplest and most extensively 
studied queuing model is M/M/s. In this case, 
the process Q, becomes a birth and death 
process with 4;— 4, uj—ju (1 <j <s), and =sp 
(j7 s). Kolmogorov's forward equation is 
given by 


Pi) = Ap.(j — 1) — (4 tip 


BIL j) = Ap,(, j — 1) — (A + sup, (i,j) + sup,(i, j + 1), 


1xjss—1, 


jes. 


This equation can be solved explicitly by the 
method of tgenerating functions. The limit 
distribution lim, p,(i, j) = pj, independent of i, 
exists if and only if su >å. In particular, 
(1—p)p if s=1, 

ap Ee if s=, 
where p — Ai. (In the single-server case, p is 
called the traffic intensity.) Unless the queuing 
model is M/M/s, Q, is no longer a Markov 
chain. In some special cases, such as E,/M/1, 


968 


the properties of Q, can be reduced to those of 
some birth and death process. 

The analysis of Q, itself is difficult in general. 
However, if either F(x) or G(x) is of exponen- 
tial type, the method of embedded ( — imbedded) 
Markov chains is useful. For example, in the 
system M/G/1 we examine the queue length 
only at times fy =0, t,, t5, ..., where t, is the 
departure time of the nth customer. This em- 
bedded process X, — Q,, becomes a Markov 
chain on S — (0, 1,2, ... ). For practical pur- 
poses we could content ourselves with results 
concerning {X,} instead of the original process 
Q,. The transition probability P(i,j) of ( X,) is 
given by 


: P(0,j) - k;, P(i,j)=kj-i+1 izl, (11) 
where 
o0 H j 
| e 9 ago) if jz0, 
kj= 0 j! 


This chain is irreducible and aperiodic. More- 
over, it is transient, null recurrent, or positive 
recurrent according as the traffic intensity 
p=A/u>1, =1, or «1. In the last case, the 
generating function P(s) of the limit distri- 
bution (Pj is given by 

(1 — p)(1 —s)K(s) 


EO e 





where K(s) is the generating function of {k;}. 
Similarly, if the system is GI/M/1, the embed- 
ded process X,=Q, .. is the Markov chain 
with transition probability 


P(i,0)=a,, P(i,j)=li-j+1s j21, 


where 


90 Jj 
ec. abes) it i0, 
b= 0 j! 


J 
l=0 l if j«0, 

and 2; — ail, The limiting behavior of this 

chain is analogous to that of the preceed- 

ing chain (11). In the positive recurrent case 

(p « 1), the limit distribution is given by p;— 

(1 — OC, where ¢ is the solution of L(s)=s in 

(0, 1) for the generating function List of (1j). 

These results are extended to the cases with 

many servers. 

For the general single-server queuing model 
GI/G/1, when we consider the waiting times 
{w,} instead of Q,, we obtain the recurrence 
relation w,,, = max(0, w, 4- v, Mal, This im- 
plies that {w,} is a Markov process with dis- 
crete parameter, over S=[0, oo) an uncount- 
able state space, having the structure of a 
Markov chain with transition probability (11). 
Several methods are exploited for the analysis 
of {w,} [3, 16, 17]. 
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I. Boundary Value Problems for Markov 
Chains 


To discuss the behavior of the path of a Mar- 
kov chain beyond the time t„, we have to 
introduce a suitable boundary of the state 
space S. Among several conceivable bound- 
aries, the Martin boundary, which is most 
frequently utilized, is explained later in this 
section. The name comes from its similarity to 
the 'Martin boundary in the theory of har- 
monic functions. Most results stated in this 
section are also valid for the discrete time 
parameter case. 

Let X =(X,, P,) be the minimal, nonrecur- 
rent chain associated with (4, zl for which Ta 
equals the lifetime &. Harmonic functions, etc., 
are defined as in the discrete time parameter 
case (with p replaced by zl Let y=y(x) bea 
measure such that 0 « yG(y) z X, y(x)G(x, y) 
< oo, where G(x, y) — [$ p,(x, y) dt. Let p, be the 
metric in the one-point compactification of the 
state space S equipped with the discrete topol- 
ogy. Set K(x, y) 2 G(x, y/yG(y), and define 


Nd (ia, y) Kita Y)l 
PAI et KG, KG, y. 


{xn} -$. 


The set 0S of all points adjoined to S by the 
completion of S relative to p= p, + p is called 
the Martin boundary of S. M =SU ôS is a 
compact separable space. By the definition of 
p, K(x, £) 2 lim,.., K(x, y) exists, is continuous 
in č, and is superharmonic in x. A nonnegative 
superharmonic function u is called minimal if 
every superharmonic function v such that 
Oxvxuis a constant multiple of u. The set 0$, 
— (£| K(^, é) is minimal harmonic}, called the 
essential part of OS, is an tF,-set. Then every y- 
integrable nonnegative superharmonic func- 
tion u is represented by u(x)— f K(x, €)u(dé) by 
means of a unique measure u on M, — SUOS;, 
called the canonical measure of u. In particular, 
if u is harmonic, 4 is concentrated on 0S. A 
number of representation problems in analysis, 
such as the Hausdorff fmoment problem, can 
be considered to be representation prob- 

lems of suitable Markov chains. Let u be a y- 
integrable nonnegative superharmonic func- 
tion and (X,, P^) the Markov chain (called the 
u-chain) having p; (x, y) 2 u(x) ' u(y)p.Gs, y) 

(0/0 2 0) as its transition probability. Then 

X;- — lim, X, exists and 


PX eB =u | K(x, Clod), 


where u is the canonical measure of u. 

A measure v on § is called a superharmonic 
measure (harmonic measure) if vg(1 — I) <0 
(=0), that is, 3,.5v,3,(1,, —0,,) <0 (=0). Fix a 
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function g 20 such that 0 « Gg < oo (— Section 
D), and set K*(x, y) - G(x, y)Gg(x) !. Define 
the metric of similar to p,, using the function 
family (K*(', y)] (yeS). The set adjoined to S 
by the completion relative to p* =p, + p is 
called the dual Martin boundary. Extend K* to 
SUGS* and denote by OS? the set of all ye oS* 
such that K *(i,:) is a minimal superharmonic 
measure. Then every superharmonic measure v 
with f v(dx)g(x) « oo is represented uniquely as 
v — f u(dn)K*(n,-) in terms of a measure Uu on 
SUGST. 

Let če M,, and denote the K(-, €)-chain by 
(X, P**). Then Pit < o0) 20 or =1. We call ë 
a passive boundary point in the first case and 
an exit boundary point in the second. On the 
other hand, denote by (X,, P^") the chain 
having P*"(x, y) Kim, x) "Kin. y)p,(y, x) as 
its transition probability. Then Bän < 00) =0 
or =1. We call y a dual passive boundary point 
in the first case and an entrance boundary point 
in the second. 

Fix {q, z}, and let ¥=(X,, P.) be the minimal 
chain associated with (q, n}. Let (@S),, and 
(0S*)., be the sets of exit and entrance bound- 
ary points, respectively. If EK. e(0S),,) — 0, 
(8) has no solution other than the minimal 
solution. But if P,(X,_ €(@S),,)>0 for some 
xES, (8) has infinitely many solutions. Further- 
more, if (0S*)., #@, we can obtain infinitely 
many solutions satisfying (8) and (9). There 
are many open problems in this connection. 


J. Concluding Remarks 


(1) Literature and some historical remarks. 
Besides the monographs on countable Markov 
chains, such as [6, 8], many of the standard 
textbooks on stochastic processes contain 
chapters on Markov chains (e.g., [1-4]). 
Among others, W. Feller [1] gave an elemen- 
tary and elegant treatment of the basic theory 
of Markov chains with discrete parameter, 
which the bulk of the recent textbooks follow. 
The terminology for the classification of states 
we use here follows the first edition of [1]. 
Since the second edition, Feller has used the 
terms "persistent" and "ergodic" instead of 
"recurrent" and “positive recurrent," respec- 
tively. S. Karlin's book [3] contains some 
excellent chapters on applications of Markov 
chains. l 

The potential theory of nonrecurrent Mar- 
kov chains is essentially contained in.that 
of general nonrecurrent Markov processes 
due to G. A. Hunt [5]. The potential theory 
of recurrent chains was developed by J. G. 
Kemeny and J. L. Snell (J. Math. Anal. Appl., 3 
(1961)). A greater part of Kemeny, Snell, and 
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A. W. Knapp [6] is devoted to the potential- 
theoretic aspects of Markov chains, including 
boundary theory. F. Spitzer's book [7] is a 
definitive work on the theory and application 
of random walks. The proof of normality of 
recurrent random walk is the highlight of the 
book. P. Lévy's article [9] is a pioneering 
work on Markov chains with continuous 
parameter involving instantaneous states. 
Feller and H. P. McKean first gave an exam- 
ple of Markov chains having only instantane- 
ous states (Proc. Nat. Acad. Sci. US, 42 (1956)). 
K. L. Chung's monograph [$8] is the only 
book devoted to the foundation of the theory 
of Markov chains with continuous parameter. 
The exposition of birth and death processes 
in this article follows Feller [12]. (For details 
of birth and death processes — [3].) The first 
systematic treatment of queuing theory from 
the point of view of stochastic processes is 
due to D. G. Kendall [13, 14], and this has 
greatly influenced subsequent work in this 
field. As the standard textbooks of queuing 
theory we mention N. U. Prabhu [16] and 
ch. 14 of Karlin [3]. The Martin boundary 
theory of nonrecurrent Markov chains was 
developed by J. L. Doob [18], Hunt [19], and 
T. Watanabe [20]. There are many papers on 
the extension of the boundary theory to more 
general Markov processes (e.g., H. Kunita 
and T. Watanabe, Illinois J. Math., 9 (1965)). 
The exposition in this article was taken from 
Kunita and Watanabe (Sügaku, 13 (1961); 14 
(1962)). Feller (Trans. Amer. Math. Soc., 83 
(1956)) was the first to introduce the notion 
of an tideal boundary of Markov chains. 

He also studied the tboundary conditions for 
Kolmogorov's equations (Ann. Math., (2) 65 
(1957)). 

(2) General Markov chains. A Markov pro- 
cess with discrete parameter is often called a 
Markov chain whether the state space is count- 
able or not. We here call such a process a 
general Markov chain. A considerable amount 
of the results on Markov chains—on potential 
theory, limit theorems, random walks, and so 
on-—are extended to general Markov chains. 
Classical results are found in Doob [2]. Later, 
Sparre E Andersen (Math. Scand., 1 (1953); 2 
(1954)) initiated the fluctuation theory of 1- 
dimensional nonlattice random walks. T. E. 
Harris carried out an important study on the 
existence of invariant measures of recurrent 
general Markov chains (Proc. 3rd Berkeley 
Symp. on Math. Stat. Prob., vol. 2, 1956). S. 
Orey (Pacific J. Math., 9 (1959)) extended the 
limit theorems of Markov chains to a class 
of recurrent general Markov chains (called 
Harris chains). A systematic treatment of gen- 
eral Markov chains ts given in D. Revuz [21]. 
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A. General Remarks 


Let ( X,],, 7 be a tstochastic process defined on 
the ‘probability space (Q, 8, P). The 'state 
space S of X, is the set of real numbers R or N- 
dimensional Euclidian space RI In general, S 
may be a flocally compact Hausdorff space 
satisfying the second countability axiom. T is 
an interval [0, oo) or a set (0, 1,2, ... y. (T may 
also be any interval in the real line or a set 
f..., —2,1,0,1,2,...}.) We call this {X,},.7 a 
Markov process if, for any choice of points 
$,€$5 «€ ... «s,«t in T, the tconditional prob- 
ability distribution of X, relative to X, , X, , 
X, is equal to the conditional probability 
distribution of X, relative to X, . Namely, for 
AeB(S) and x,,..., X ES, 


(1) P(X EAX, —x,,..., X, x) 
—P(X,e A| X, — x,), 


where B(S) is the least *o-algebra that contains 
all open sets of S. 

For a Markov process {X,}er, the distri- 
bution of X, is called the initial distribution of 
{X, hier The conditional probability distri- 
bution P(X,¢ A| X, — x) is denoted by p(s, x, 

t, A) and is called the transition probability of 
[Xie y. This is a function of s, te T (s« t), 
eh, and Ace %(S) that has the following 
properties: 


(2) For fixed s and t, P(s, x, t, A) is a tprobabil- 
ity measure in A and is B(S)-measurable 
in x. 


(3) P(s,x,s, A)=1 
=0 if x¢A. 


if xeA, 


(4) The Chapman-Kolmogorov equality, 


P(s, x,u, a-| P(s, x, t, dy)P(t, vu. A) 
S 


(s<t<u). 


In view of (1), the tfintte-dimensional distri- 
bution of a Markov process ts completely 
determined by its initial distribution and its 
transition probability. 

Moreover, for a given function P(s, x, t, A) 
satisfying (2), (3), and (4), and for a given prob- 
ability distribution o on (S), there exists a 





261 A 
Markov Processes 


Markov process {X,},-7 with transition proba- 
bility P(s, x, t, A) and initial distribution u. In 
this sense, the transition probability is a char- 
acteristic quantity of the Markov process. 

If the transition probability depends only on 
the difference between s and t, that is, if there 
exists a function P(t, x, A) of t, x, and A such 
that P(s, x, t, A) - P(t—s, x, A), then the Mar- 
kov process is called temporally homogeneous. 
If S is an additive group and there exists a 
function P(s, t, A) of s, t, and A such that 
P(s, x, t, A)» P(s,t, A— x), where A—x={y— 
x|yeA}, then the Markov process is called 
spatially homogeneous. When S — R^, a spa- 
tially homogeneous Markov process is an tad- 
ditive process. A Markov process whose state 
space S is countable is called a * Markov chain 
(-* 260 Markov Chains). A Markov process 
whose sample path (— Section B) is continu- 
ous with probability 1 (— 407 Stochastic Pro- 
cesses) is called a 'diffusion process (— 115 
Diffusion Processes). Consider the case T= 
[0, oc) and S - R", and assume that the transi- 
tion probability has a tdensity p(s, x, t, y) with 
respect to Lebesgue measure and satisfies 
certain analytic conditions that ensure the 
continuity of the sample path, etc. Then A. N. 
Kolmogorov proved that p(s, x, t, y) satisfies 
the tFokker-Planck partial differential equa- 
tion. Conversely, he raised the problem of 
finding conditions for the existence and 
uniqueness of the transition probability satis- 
fying the given Fokker-Planck equation [1] 
(— 115 Diffusion Processes). W. Feller ex- 
tended this equation to an 'integrodifferen- 
tial equation of a certain type and solved 
the problem partially by classical analytic 
methods [2]. On the other hand, S. N. Bern- 
stein [3] and P. Lévy [4] made probabilistic 
approaches to the problem, and K. Itó con- 
structed Markov processes directly by solv- 
ing the corresponding stochastic differential 
equations [5] (— 406 Stochastic Differential 
Equations). 

A profound study of the structure of Brown- 
ian motion and the additive process by P. 
Levy [4,6], the theory of tmartingales initiated 
by J. L. Doob [7], and the fstochastic calculus 
due to K. Itó [5] have served as useful appara- 
tus in analyzing the structure of Markov pro- 
cesses probabilistically. The theory of Markov 
processes has also retained a close relationship 
to various aspects of mathematical analysis— 
especially to functional analysis and tpotential 
theory. The link lies in an obvious fact that, by 
virtue of the formula 


(5) Tfi e- | DU, x, dy)f(y). 


S 


the transition probability p(t, x, A) of a (tem- 
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porarily homogeneous) Markov process in- 
duces a semigroup (1,12 0] of linear positive 
operators on a function space, e.g., on the 
space B(S) of all bounded measurable func- 
tions on S. 

The study of diffusion processes has played 
important roles in the theory of Markov pro- 
cesses. When S- RI. the structure of diffu- 
sion processes was completely clarified by W. 
Feller [8], E. B. Dynkin [9], K. Itó and H. P. 
McKean [10], and others (— 115 Diffusion 
Processes). In succession, G. A. Hunt [11] and 
E. B. Dynkin [9] isolated fairly general and 
practically useful concepts in Markov pro- 
cesses, including the right continuity of sample 
paths and the strong Markov property. Based 
on these properties, probabilistic potential 
theory and the theory of additive functionals 
were developed. 

We now present a formulation for the tem- 
porally homogeneous Markov process with 
continuous time parameter [9,11—14]. 


B. Fundamental Notions 


Adjoin a point ô to S as a point at infinity 
when S is noncompact and as an isolated 
point when S is compact, set $ 2 SU {ð}, and 
let B(S) be the c-algebra that consists of all 
the Borel sets in S. Let W be the set of all right 
continuous functions w(t) whose tdiscontinu- 
ities are at most of the first kind and such that 
w(t)=6 for tz s if w(s) 2 0. By convention, we 
set w(oo) — €. Let (w) be the minimum of the 
t-values such that w(t) €. For we W, w and 
w,' in W are defined as w, (s) = w(min(t, s)) and 
w, (s) 2 w(s +t), respectively. Let W be a subset 
of W that is closed under the operation w— 
w, and w—w, , and let 8 2 8(W) be the o- 
algebra generated by the sets {we W|w(t)e A} 
(A €98(S)). We often write X,(w) for w(t). The 
subclass of B that consists of sets represented 
by {weW|w, € B) (Be 8) is denoted by 38,. 
Suppose that the family of probability mea- 
sures {P.}(xeS) on (W, B) satisfies the follow- 
ing conditions: 


(6) For a fixed B in $8, P,(B) is B(S)- 
measurable in x. 


(7) P.(X,(w)=x)=1 for xeS. 


(8) The Markov property P,(w; €B|88,) = 
P, «w)(B) holds with P,-measure 1 for BeB. 


Then the triple 9t ^ (X,, W, P, | xeS) is called 
a Markov process. This is a mathematical 
model for the random motion of a particle 
moving in 5 whose tprobability law is inde- 
pendent of its past history once the present 
position of the particle is known (— 260 Mar- 
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kov Chains). We call S the state space of W, W 
the path space, and an element w in W a path, 
respectively. P, represents the probability law 
of a particle that starts from x. In view of the 
interpretation that the particle vanishes if it 
reaches ô, we call £(w) the lifetime (or terminal 
time) and à the terminal point. 

Now, for any s, «s; € ... « s, and t, 


PAX, 6 ALK, EE e 
- PAX, E€ A| X,)— Py, (X,e A) 


holds with P,-measure 1 for any x. This equal- 
ity means that {X,},>o is a Markov process 
on S in the sense mentioned before with initial 
distribution 6, (a unit measure at x) and the 
transition probability given by P(t, x, A)= 
P.(X,e A). The restriction of this transition 
probability to (t, x, A)e [0, oo) x S x B(S) satis- 
fies the following properties: 


(2) Fora fixed tz 0, P(t, x, A) is B(S)- 
measurable in x and is a measure in Ac 
B(S) with P(t, x, S) « 1. 


(3) P(0,x, A)=64,(A), xeS, Ae B(S). 


(4) P(s+t,x, A)= | P(s, x, dy) P(t, y, A). 
s 


(9) lim,o T, f(x) — f(x), xeS, holds for any 
bounded continuous function f on S, 
where T, f is defined by formula (5). 


{T,} is called the semigroup of the Markov 
process M because it enjoys the property 
T,.,=T,T, on B(S). By convention, any 
function f € B(S) is extended to a function on S 
by setting f(0) — 0. (9) then follows from the 
expression T, f(x) 2 E,( f(X,)), where E,( ) 
represents the integral with respect to P. 

A Markov process W is called conservative if 
P(t, x, S) 2 1 holds for every x in S and every t. 
A point x in S is called a recurrent point if for 
every neighborhood U of x and every t>0, the 
path starting from x returns to U after time t 
with probability 1. Under some conditions, x 
is recurrent if and only if fẹ p(t, x, U)dt = oo for 
any neighborhood U of x. Mt is called recur- 
rent if all points in S are recurrent. Otherwise, 
it is called transient. W is of course conserva- 
tive if it is recurrent. There are other instances 
in which {X,} is called recurrent, if a particle 
starting from any point in S reaches any neigh- 
borhood of any other point in S in finite time 
with probability 1. In this case, under suitable 
conditions for regularity, the *mixing property 
holds, whence follows the fergodic property 
(— 136 Ergodic Theory). If the transition 
probability of ( X,) has an tinvariant mea- 
sure with total mass 1, tBirkhoff’s individual 
ergodic theorem holds [7]. 

Let {%,} be an increasing family of ø- 
subalgebras of 8(W) such that X, is %,- 
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measurable for each t 2 0. A (0, oo)-valued 
function c on W is said to be an ({%,}-)Markov 
time (or stopping time) if (o|o(c) €t] &$,, t20. 
The c-algebra iy, is then defined by 


Ye ={BeB| BN {o<t}e§%, for any tz 0]. 


If there exists a family {%,} as above and if for 
every {¥,}-Markov time c and for any xe$, 


52-0, and deis 
(10) P(X545€ Al Bo) = Py (XE A) 


holds with P,-measure 1, then it is said that MN 
has the strong Markov property with respect to 
(8,]. and such an M is called a strong Markov 
process. Since a constant time is also a Mar- 
kov time, the strong Markov property imposes 
stronger conditions than the Markov prop- 
erty. A sufficient condition for a Markov 
process Mt to have the strong Markov prop- 
erty is that the space C(S) of all bounded 
continuous functions on S. be invariant under 
the semigroup of W. In this case, we can take 
= (ese 2. 

It is convenient to enlarge the algebra 8, as 
follows. Let u be a measure on (5, B(S)), and 
write B(S), for the completion of B(S) by yi. 
Write B(S) for the intersection of all (B(S)),, 
where u runs over all probability measures on 
B(S). Let P, be the measure on (W, 8) defined 
by P,(A)= fs u(dx) P, (A); and let 8, = (7, (88), 
and 38 — (',(8),, where (8), is the completion 
of B by P, and (%,), is the o-algebra obtained 
by adjoining to 88, all P,-null sets in (8),. A 
Markov process Mi is called a Hunt process if 
88, — (..,88, for any t>0, M has the strong 
Markov property with respect to {B,}, and W 
has left quasicontinuity in the following sense: 
If (6,j is any increasing sequence of {B,}- 
Markov times and c —limo,, then for o < 
oo, lim X, =X, holds except for a set of P,- 
measure 0 for any ven, For a set A of S, let 


o,=inf{t|t>0, x,¢A} if such a t exists, 
= oo if x,€ A for all t» 0. 


Then c, is called the hitting time for A. (Some- 
times the condition t0 in the definition of c, 
is replaced by tz: 0.) If M is a Hunt process, 
then c, is a {B,}-Markov time for any Borel 
set A [14, 15]. For a subset A, t4 —6,. is called 
an exit time from A. If W is a strong Markov 
process, the exit time t, from a point a is sub- 
ject to the exponential distribution 


Bir, >t) =e,  0xA(a)« oo. 


In particular, a is called an instantaneous state 
if ^(a) — oo and a trap if A(a)=0. For a Hunt 
process M, P.(B) —0 or 1 if B is in Bg. This is 
called Blumenthal's zero-one law. 

A function P(t, x, A) oft z0, ven, and Ae 
B(S) is said to be a transition function on S 
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if it satisfies (27), (3), and (4^). Denote by C, (S) 
the set of those functions in C(S) vanishing at 
infinity. A transition function defines by (5) a 
linear operator on B(S) which satisfies the 
semigroup property 7; T, = T, ,,. If T; leaves the 
space C, (S) invariant for every t 2 0 and if (9) 
holds for any f € C, (S), then the transition 
function (resp. semigroup T) is called a Feller 
transition function (resp. Feller semigroup). 
Any Feller transition function on S admits a 
Hunt process; namely, for a given Feller tran- 
sition function P(t, x, A), there exists a Hunt 
process IN (on S) whose transition probability 
coincides with P(t, x, A) for tz 0, xe S, and 

A e :8(S). Such a process IM is often called a 
Feller process. For instance, any spatially 
homogeneous Markov process on RN is a 
Feller process. If a Feller transition function 
satisfies an additional condition that, for any 
compact set K and neighborhood G of K, 


t 'P(t,x,S—G)>0, t0, 


uniformly on K, then the associated Hunt 
process W is a diffusion, namely, P, (X, is 
continuous in t<{)=1, xeS [2,7]. 


C. Generators of Markov Processes 


For a given Markov process M, the generator 
of Mis defined by 


(11) f(x) -limt "(tt fo) xe, 


where {T;} is the semigroup of M. The choice 
of the domain D(G) of G depends on the 
situation. When {7;} is a Feller semigroup, it is 
a strongly continuous semigroup on C,,(S). Let 
© be its infinitesimal generator in the sense 

of Hille and Yosida (— 378 Semigroups of 
Operators and Evolution Equations). Then 
D(G) consists of those functions fe C, (S) for 
which the convergence in the right-hand side 
of (11) takes place uniformly on S. A necessary 
and sufficient condition for a linear operator 
on C,,(S) to be the infinitesimal generator of a 
Feller semigroup is known [8]. 

For a general strong Markov process W, 
one way of defining D(G) is as follows [16, 17]. 
Denote by C(S) the set of all functions fe B(S) 
which are finely continuous in the sense that 


(12) P,Cf(X,) 1s right continuous in tz 0)— 1, 
xeS. 


D(G) is then defined as the set of those func- 
tions fe C(S) for which the convergence in the 
right-hand side of (11) takes place boundedly 
in x with limit functions belonging to C(S). © 
is called the generator of the strong Markov 
process WM. The following three conditions are 
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equivalent: 


(13 feD(G), Gf=g. 


(14) f,geC(S) and T/f(x)— f(x) 


-| T,g(x)ds, t>0. 
0 


t 
(15) f. ge C(S) and nn 2 | g(X ds 
0 
is a *martingale on (W,8,, P.) for each 


eh, 


Furthermore, let c be a Markov time such that 
E,(c) « oo. Then we have Dynkin’s formula: for 


JEDO), 
J=- sl | i f(x at] A EG) 
If 6f is continuous at x, then 
ent. tim ELS X= fle 


Ux} E,(ty) 
=0 if x is a trap, 


if x is not a trap, 


where U is an open neighborhood of x. 

This representation of © suggests that, when 
S is a smooth manifold, Gf for smooth func- 
tions f often becomes an elliptic partial dif- 
ferential operator or a mixture of it and an 
integral operator satisfying a certain maximum 
principle. In many practical cases, the data 
given to us are the coefficients of such a linear 
operator L carrying a certain linear space 
D(L)c B(S) into B(S). Thus a problem arises 
as to the existence and uniqueness of a strong 
Markov process M whose generator © is 
an extension of L. In connection with (15), 
Stroock and Varadhan [18] formulated this 
problem as the martingale problem concerning 
the existence and uniqueness of a probability 
measure P, on W such that P,(X)=x)=1 and 
f(X) — f(X) — fo Lf(X,) ds is a martingale on 
(W, %,, P.) for every f € D(L). Since this formu- 
lation refers directly to probability measures 
on W, it is useful in the study of stochastic dif- 
ferential equations and of the convergence of 
Markov processes as well. 

Let m be a positive Radon measure on S$ 
which is strictly positive on each nonempty 
open set. A Markov process W is said to be m- 
symmetric if 


| Tf (x)g(x)m(dx) 
S 


-| fG)ng(x)m(dx) (< +) 
S 


holds for any t>0 and nonnegative mea- 
surable functions f, g. Then {T,} is realized 
uniquely as a strongly continuous semigroup 
of symmetric operators on the real L?-space 
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L?(S; m). Let © be its infinitesimal generator. 
(5 is then a nonpositive definite self-adjoint 
operator on L?(S; m), and the following defi- 
nition of the symmetric form & on L?(S; m) 
makes sense: D[&]= D(./ — 6), 6(f£,.g)— 
(/-G f, / —6 g) f; ge S[6], where( , ) 
denotes the inner product in L?(S; m). & is 
called the Dirichlet form of an m-symmetric 
Markov process M. A Dirichlet form (on 
L?(S; m)) is by definition a closed symmetric 
form & on L?(S; m) such that ue D[4] implies 
v — min(max(u, 0), (e DLE] and &(v, v) € E(u, u). 
A Dirichlet form is called regular if the space 
DLETA C$(S) is dense in D[é] and in C,(S), 
where C,(S) is the space of continuous func- 
tions with compact support. For a given regu- 
lar Dirichlet form &, there exists uniquely in a 
certain sense an m-symmetric Hunt process 
whose Dirichlet form is the given & [19]. In 
many cases of symmetric Markov processes, 
the form é(f, f) rather than the generator Gf 
admits an explicit expression for smooth func- 
tions f. 

When W is the N-dimensional Brownian 
motion, the generator of its semigroup on 
L? (RY) is given by Gu —iAu, D(G)= {ue 
L*(R^)| Aue L^(R")), and its Dirichlet form 
is given by 


1 
E(u, v)= 2 | gradu: grad vdx, 
RN 


7" eL) 


la d'EM - 
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D. Markov Processes and Potential Theory 


Analytic notions and relations in classical 
potential theory can be interpreted in terms of 
Brownian motion (— 45 Brownian Motion). 
The relevant probabilistic notions and re- 
lations have been formulated not only for 
Brownian motion but also for a general Hunt 
process under the name of probabilistic 
potential theory [11, 14, 20]. 

Let M be a Hunt process. A set ACS is 
called nearly Borel measurable if for each 
probability measure u on S there exist A,, A, 
€B(S) such that A, c Ac A, and P,(X,e 
A5 — A, for some t >0)=0. 8"(S) denotes the 
family of all nearly Borel subsets of S. Then 
B(S) B"(S) < B(S). The hitting time c, for a 
nearly Borel set A is still a {8,}-Markov time, 
and the probability Bio, 0) is either zero or 
one according to Blumenthal’s zero-one law. x 
is said to be a regular point of A in the former 
case and an irregular point of A in the latter 
case. Let A’ denote the totality of regular 
points of A. 

A set ACS is called finely open if for any 
x€ A there exists a set C= C(x)e B” containing 
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S— A such that x¢ C”. Any open set is finely 
open by virtue of the right continuity of the 
paths. The fine topology defined by the fine 
open subsets of S is therefore finer than the 
original topology. A nearly Borel measurable 
function f on S is finely continuous if and 
only if it satisfies (12). A nonnegative *B(S)- 
measurable function f is called x-excessive 
(x20) ife" Tf <f for any t and e "T,f(x)o 
f(x) (t-+0). A 0-excessive function is called 
simply excessive. Any «-excessive function is 
nearly Borel measurable and finely continuous. 
The function f(x) 2 G,g(x) — fo e ^ T;g(x)dt 
(G, is called the resolvent of I) is -excessive 
for any nonnegative 8(S)-measurable func- 
tion g. If f is «-excessive and ACS is nearly 
Borel measurable, then the function p^ f(x) = 
Efe 774f(X,,); G4 < 00) is a-excessive again. 
When W is the Brownian motion on R” (N > 
3), the class of excessive functions coincides 
with the class of nonnegative tsuperharmonic 
functions. Also, the fine topology ts identical 
with the ‘Cartan fine topology. 

A set ACS is called polar if A is contained 
in a set DeB” such that P,(¢p < 00)=0 for any 
xes. Ais called thin if A is contained in a set 
Den such that D” is empty. A set contained 
in a countable union of thin sets is called 
semipolar. If A is semipolar, then X,c A occurs 
for at most countably many values of t with 
P -measure 1, xeS. The set A — A’ is semipolar 
for any Ae B”. Any polar set is semipolar. The 
converse is also true when Wt is Brownian 
motion but not true when Mit is the uniform 
motion to the right on R!. 

Under certain conditions of duality or sym- 
metry for the Hunt process, the notion of 
capacity can be introduced by generalizing the 
classical *Newtontian capacity. Then a set of 
zero capacity is identified with a polar set or 
its weaker version [11, 19]. The stochastic 
solution of the classical tDirichlet problem can 
also be formulated for a wide class of diffu- 
sions. Thus, for a domain DCS, the solution of 
the equation Gu(x)=0, xe D, with boundary 
function f is expressed as u(x) = plp f(x), xe D, 
and the boundary behavior of u is studied 
probabilistically [9]. 

We have so far discussed mainly potential 
theory for transient Markov processes. How- 
ever, we can also establish potential theory for 
recurrent Markov processes following the 
model of the tlogarithmic potential in the case 
of 2-dimensional Brownian motion. 





E. Additive Functionals 


The notion of additive functionals was first 
introduced in relation to the study of time 
change of Markov processes, and in particular, 
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to the study of the "local time of Brownian 
motion. Later it was studied in relation to 
potential theory and *martingale theory. Addi- 
tive functionals play an important role in the 
study of Markov processes. For a Markov 
process W, a function g=¢,(w) of t and w is 
called a (right continuous and homogeneous) 
additive functional if the following conditions 
are satisfied: (1) —oo < p(w) € oo; (2) e, (w) is 
right continuous in t (t C), op exists, and q, 
= qx- for tz 6G (3) p(w) is 8,-measurable in w 
for fixed t; and (4) for any t and s, 


(16) 9,.,(w)=@,(w)+ @,(w,) holds with P,- 
measure 1 for any xes. 


We call o a perfect additive functional, if it 
satisfies (1), (2), (3) and satisfies (16) for all w. 
Two additive functionals are equivalent if they 
are equal except on a set of P,-measure 0 for 
any t and x, and q is called nonnegative if there 
exists a nonnegative additive functional which 
is equivalent to o. The concept of continuous 
additive functionals can be defined similarly. 
The function «=a,(w) of t and w is called a 
(right continuous and homogeneous) multi- 
plicative functional if it can be expressed by an 
additive functional o as 


(17) a,(w)=exp(—¢,(w)). 


An additive functional q is said to be natural if 
p, and the path X, of M have no jumps in 
common with P,-measure 1 for any xes. 

For a nonnegative additive functional o. 


(18) EE | “edo 
Q 


is a-excessive. Let WM be a Hunt process. A 
finite -excessive function u, can be expressed 
in the form (18) by a unique nonnegative 
natural additive functional ¢ if for every 
increasing sequence (o,) of Markov times 
such that of, E,(e *°"u,(X,,)) 20 holds. 

It is possible to choose a continuous q cor- 
responding to u, if and only if for every in- 
creasing sequence Jo, of Markov times, 

E (e *"u,(X,,)) > E,(e "^u,(X,)) holds, where 
o=limo,,. In particular, u, may correspond 

to a continuous q if the following two con- 
ditions are satisfied: (1) u, is bounded and 

e "Tu,(x)20 (t2 0), (2) e " Tru (x) > u,(x) 
(t0) uniformly in x [4]. In the case of 
Brownian motion, any nonnegative continu- 
ous additive functional o can be characterized 
by a certain o-finite measure u on S charging 
no polar set, which is called a smooth measure 
[9,21]. Such a characterization can be ex- 
tended to more general Hunt processes satis- 
fying certain conditions of duality or sym- 
metry with respect to a basic measure m, and 
the correspondence between ¢ and y is speci- 
fied by the following formula of Revuz [22]: 
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For any function f 20, 


1 t 
(19) im; es | "uge F(x) (dx). 


An additive functional o of a Hunt process 
IM is called a martingale additive functional if 
E,{92)< and E,(g,)=0 for any t>0 and 
xes. In fact, o, is then a square integrable 
tmartingale on (W, {B,},50, P,) for each xes. 
Let M be the set of all martingale additive 
functionals. The study of the class .@ consti- 
tutes a special aspect in the general theory of 
*semimartingales [15, 16, 23-26]. In particular, 
the quadratic variation <ò of pe M can be 
realized as a nonnegative continuous additive 
functional that satisfies E,(<g>,) = Elo?) for 
all xeS and t» 0. The additive functional 
(9, V» is similarly defined for o, Y € A. Con- 
sider for oe M a measurable function f on S 
such that E,(fo /(X,)? d(@),) is finite. Then the 
stochastic integral f: can be defined as an 
element of Æ satisfying 


t 


Q0) EAS: owo-E(| 


o 


feodo tz) 


wel. 


(f-o) is often denoted by (5 f(X.) do,. The 
space of all continuous functionals in æ is 
denoted by .Z,. Its orthogonal complement in 
the sense of < , > 1s denoted by 4. The struc- 
ture of æ, is known, and each functional in 
Mg; can be represented by means of the so- 
called Lévy system, which is a pair consisting 
of a certain kernel on S and a certain nonnega- 
tive continuous additive functional [24, 25]. If 
M is an N-dimensional Brownian motion, 
then M =M., and any oe Zë takes the form 


i=1 JO 


ou ec Lanuag 


where the integral appearing on the right-hand 
side is defined by (20) for the ith coordinate 
process X; — Xį¿e M and for a measurable 
function b; on R” with E,([5 RON ds) « oc, 
xeR" [23,28]. 

Replacing (16) by the "associative law” 


p(w) + palw) = Del, 


we can also define temporally inhomogeneous 
additive functionals. 


F. Transformation of Markov Processes 


There are several methods by which a given 
Markov process can be transformed to a 
new one. Here we mention some important 
transformations. 


Transformation by a Multiplicative Functional. 
For a Markov process IN, let « be a multiplica- 
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tive functional such that F,{a,)<1 and P,(a)= 
1)=1. Set 


P*(t, x, T)= E.(xr( X1) 
P*(t, x, (0)) 2 1 — P*(t, x, S). 


(Te B(S)), 


Then P*(t, x, E) is a transition probability on 
S and corresponds to a Markov process 
M* (S, W*, P. xeS). We call Mt a transfor- 
mation of W by a multiplicative functional «x. It 
is possible to choose W*= W. If M is a strong 
Markov process, so is W”. Conversely, let M 
and W be two Markov processes with the 
same path space W on the same state space S. 
If the probability law P; of W is absolutely 
continuous relative to P, of IR, then W is a 
transformation of M by a certain multiplica- 
tive functional of M. This transformation 
includes killing, transformation by drift, and 
superharmonic transformation as special cases. 
(1) Killing. Transformation by a multiplica- 
tive functional a is called killing if O x o, € 1 
holds. In fact, 97 can be constructed as fol- 
lows: A particle going along a path w of Wè is 
"killed" (jumped to 0) in such a way that its 
surviving probability up to time t is a,(w). For 
a nonnegative bounded continuous function 
c(x) on S, let 


a,(w)— ol — 1 c(X,(w)) as) : 
0 


Then « satisfies the condition given previously. 
If the semigroup {T,} of W is a Feller semi- 
group and has the generator ©, then the semi- 
group {7;} of WY is also a Feller semigroup, its 
generator ©” has the same domain as ©, and 
(5*— 6 —c [29]. 

(2) Transformation by drift. For pe M, let o, 
—expíq, — (9»,/2). Then a, is a multiplicative 
functional. The transformation determined by 
a is called a transformation by drift. Let M be 
the N-dimensional Brownian motion and 
oeM be the functional expressed as in (21) 
with bounded b,, ..., by; then 


tN 
Gei | 2 bX ds, 


and the above formula for « gives a transfor- 
mation by drift. Moreover, if b,,..., by are in 
Ca (S), then the semigroup of W” is a Feller 
semigroup, and for a bounded function f with 
bounded continuous derivatives up to the 
second order, f is in the domain of 6? and 
Gf =(1/2)Af + LZ, b(f/0x) 9, 16]. 

(3) Superharmonic transformation. Let u be 
an excessive function of W and A= {x|0< 
u(x) < oo}. Set 


a,(w) =u(X,(w))/u(Xo(w)) if Xo(w)eA 
=0 if X,(w)¢ A. 


Then a, is a multiplicative functional. The 
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transformation defined by «, first introduced 
by Doob, is called a superharmonic transfor- 
mation. The transition probability P(t, x, I) of 
WM? is equal to u(x)" fr P(t, x, dy)u(y) if xe A, 0 
if xA and t» 0, and 6,(T) if x A and t=0, 
for T in B(S). In particular, if Mt is a Feller 
process and u is a continuous function such 
that 0<c<u<k< oo, then W? is also a Feller 
process and G*f =u ! G(uf), where the do- 
main of 6* is the set of f for which uf is in the 
domain of (5. 


Time Change. We understand the term time 
change in a broad sense, including the follow- 
ing two important special cases. 

(4) Time change by an additive functional. 
Let W be a Hunt process and ¢ be a nonnega- 
tive continuous additive functional such that 
P(o —0) —1. Set S* = (x| P(o,(w) > 0 for 
every £20) — 1), and assume that S* is locally 
compact. Let W* be the set of all right con- 
tinuous functions on S*. They have discon- 
tinuities of at most the first kind, and B* is 
the c-algebra on W* generated by all tBorel 
cylinder sets. Set P*(B) = PX. i, (w)e B) 
(Be38*), where o ! is a right continuous in- 
verse function of g,. Then M* — (S*, W*, P*, 
xe§*} is a Hunt process on S*, and we say 
that M* is obtained by time change from M 
by o Roughly speaking, M* can be con- 
sidered to be a Markov process with paths 
Xf*(w) = X,-1,)(w). The resolvent of M* is 
given by E,({ge ^^f(X))dq). Suppose that 
a(x) is a continuous function on S such 
that 0 «c «a(x) € k « oo, and set 9,(w)= 
foa(X,(w))ds. Then S* =S, W* = W, and 
B* =B, and M* has the same fine topology as 
M. The domain of the generator G* of M* 
coincides with that of © of Mand G*f= 
a ‘Gf. Let M and W be Markov processes 
with the same state space and the same path 
space W. If they have the same hitting proba- 
bilities {H,(x,-)=P,(X,,€°)}, then each one 
can be obtained from the other by time change 
by a strictly increasing additive functional. The 
converse is also true [14]. 

(5) Subordination. This concept, introduced 
by Bochner, was then extended as follows: 

Let e" mä be the tLaplace transform of an 
tinfinitely divisible distribution with support 
[0, oo), and let F,(-) be the distribution with 
Laplace transform e '*?, Let {T;} be a Hille- 
Yosida semigroup on a certain "Banach space, 
and set T,^ = ( T, (ds) (Bochner integral). 
Then (11 is also a semigroup and is called 
the subordination of ( 7;) by y. If G is a gen- 
erator of {7,}, then —V(— ©) is a generator of 
(T9). 

In particular, we can assume {T,} to be a 
nonnegative semigroup on C(S)(C,, (S)) such 
that 7,1 — 1, and {X,} to be a Markov process 
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corresponding to the semigroup T;. Let {‘P(t)} 
be an additive process that is independent of 
(X) and satisfies E(e ^*^) Ze "*. Set Y. (o5) 
= Xy slk then {Y,} is a Markov process 
corresponding to the semigroup { T,”}. The 
operation by which we obtain ( Y;} from | X,j 
by using {¥(t)} is also called subordination. In 
particular, if {¥(t)} is a one-sided stable pro- 
cess of the ath order (— 5 Additive Processes), 
this operation is called the subordination of the 
ath order. If {X,} is an additive process, then 
the process obtained from it by subordination 
is also an additive process. The subordination 
of the ath order of Brownian motion gives a 
*symmetric stable process of the 2ath order. 
Let (y, (t)) and (v;(t)] be independent of {X,}. 
Then the superposition of two subordinations 
of {X,' by w,(t) and datt) coincides with the 
subordination by ('P, (V; (t, c), œ)} [30]. 

(6) Reversed processes. Let ( X,), be a 
Markov process on (Q, 8, P) and X* = X , for 
te T* - (t| -te T). Then (X*],.,. is a Markov 
process and is called a reversed process of 
(X). If the state space of S is countable, then 
the transition probability P(s, x, t, y) of ( X,) 
and P*(s, x, t, y) of ( X*) satisfy the following 
condition: P*(s, x, t, y 2 Q(—t, y) P(— t, y, 
—$,x)Q( —5, x) +, where Q(t, x)= P(X,(o) — x) 
and we assume Q(t, x) 40. 

For a given temporally homogeneous Mar- 
kov process {X,} with state space S, let ( ;! be 
the semigroup corresponding to {X,}. Then a 
ta-finite measure m on (S, B(S)) is called a 
subinvariant measure (or excessive measure) if 
the inequality fs T; f(x)m(dx) « fs f(x)m(dx) 
holds for every nonnegative function f. The 
measure m is called an invariant measure if 
the equality holds instead. Let ( X*) be an- 
other Markov process whose state space is 
S and whose semigroup is {7;*}. If for some 
o-finite measure m and for every nonnega- 
tive f and g the equality fs T; f(x)g(x)m(dx)— 
fs f(x) T,* g(x)m(dx) holds, then (Kai is 
called the dual process of ( X,). m is then a 
subinvariant measure of ( X,) and {X#}. 

The concept of reversed process is related to 
that of the dual process. For example, if m is 
an invariant probability measure of {X,} and 
if the distribution of X, is m, then the distri- 
bution of X, is also m for every t, and the 
reversed process of ( X,) coincides with the 
dual process [23]. 
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A. General Remarks 


Let (Q, B, P) be a fprobability space and T a 
time parameter set. For each te T, let %, be a 
*g-algebra such that e c %, c B (s « t). With- 
out loss of generality we can assume that the 
probability space (Q, B, P) is fcomplete and 
each %, contains all measurable subsets of Q 
with P-measure zero. A real-valued tstochastic 
process {X,},.7 on (Q, 38, P) (which is also 
denoted by (X,, te T)) is called a martingale 
with respect to %, provided that (i) X, is $,- 
measurable and E(| X,|) « oo; and (ii) if s « t, 
then 


E(X,|W)—X, (as) (1) 


where (a.s.) means talmost surely, which will be 
omitted when there is no room for confusion. 
In this case, we also say that (X,,$y,, te T)isa 
martingale. If the equality in (1) is replaced by 
the inequality < (>), {X,}er is called a super- 
martingale (submartingale). For the case of 
martingales the values of X, may be complex 
numbers. We write martingale, submartingale, 
and supermartingale as (M), (SbM), and (SpM), 
respectively, for short. If (X,, %,,f¢ T) is an 
(M) then (X,, 8, te T) is also an (M), where 
$8,— B(X, ue T, u<t). When the family of o- 
algebras involved in the definition of an (SbM) 
(Xie is not explicitly mentioned, we under- 
stand that {X,},.7 is an (M) with respect to B,. 
This convention is used for (SbM) and (SpM) 
also. The term martingale is due to J. Ville. P. 
Lévy had already made use of the concept in 
his work, but it was J. L. Doob who estab- 
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lished a systematic theory of martingales [2]. 
These concepts now provide us with not only 
basic tools but also fundamental principles in 
the theory of stochastic processes. 

The terminology “super-” and “submartin- 
gales" comes from the analogy to super- and 
subharmonic functions. As a mathematical 
model of games, however, they correspond on 
the contrary to subfair and superfair games, 
respectively. One of the fundamental mathe- 
matical principles in games was formulated 
by Doob as the following optional sampling 
theorem for martingales. To state it, we sup- 
pose for the moment that T=Z* = {0,1,2,...}; 
a similar result also holds if T=R* and X, is 
right-continuous. Let c and x be bounded ($y,)- 
tstopping times (c is bounded if me Z* exists 
such that o <m a.s.). If (X, neZ") is an (M) 
((SbM), (SpM)) with respect to (%,), then 


X,Q—E(XLS) Le, 2). 


In particular, E(X,,,.)= E(X.) (<, =). Con- 
versely, these properties characterize martin- 
gales: If X, is %,-measurable, E(| X,]) « oo and 
E(X,, Je E(X,) (<, =) for any bounded stop- 
ping times o and rz, then (X,, Y, neZ *) is an 
(M) ((SbM), (SpM)). 

The following are some basic properties of 
martingales: (1) For any given (SbM) X,, m(t) 
— E(X)) is an increasing function of t, and X, is 
an (M) if and only if m(t) 2 constant. (2) Let X, 
and Y, be (SbM). Then aX,+ bY, (a,b z 0) and 
sup(X,, Y) are (SbM). (3) Let X, be (SbM) and 
f(x) an increasing *convex function defined in 
(—90, 00). For any given toe T; if E(| f(X, )]) « 
oo, then ( f(X), te(—9o0o, t9] T) is an (SbM) 
and furthermore, when X, is an (M), (f(X,), 
te(—0o0,tg] T) is an (SbM) even if f(x) is 
not increasing. In particular, X," = sup(X,, 0) 
is an (SbM) if X, is an (SbM), and | X,| is an 
(SbM) if X, is an (M). (4) Let X, be an (SbM). 
If a, b, teT and a«t «b, then E(|X,|)< 
2E(|X,|) — E(X,). (5) If X, is an (SbM) and 
X,z 0 (te T) and te T, then the family of 
trandom variables (X,|te(—o0o, t] T] is 
uniformly integrable (see (6)). (6) If X, is an 
(SbM), t,e T and t,], then ( X, J is uniformly 
integrable if and only if lim, ,,, E(X,,) > —oo. 
Here a family of random variables {X;,},<7 is 
said to be uniformly integrable if we have 


n-o0 


lim sup | | X, (co)| dP(c») — 0, (2) 


where A, ,— (|| Nell » n). 
Example 1. For any sequence of random 
variables Y, , Y,, ..., if the relations 


RO ar LE wes Y) Z0, m-12,.; (3) 
hold, then 
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is an (SbM). If the inequality sign in (3) is 
replaced by the equality sign, then X, is an 
(M). In particular, if { Y,) is a sequence of 
independent random variables such that 
E(Y,)=0, then 


B. Martingale Inequalities and Convergence 
Theorems 


The following inequalities, which are conse- 
quences of the above optional sampling theo- 
rem, are due to Doob. Let (X;, 1 <j <n) be 

a nonnegative (SbM) and X7 —max, «;«, X;. 
Then 

AP(XF-A)&E(X, XF—-A)&E(X,) (4-0) 


From this we have that if (Xj, 1 <j<n) is an 
(M) such that E(| X, |?) « oo, then 


P(|X|t-4)&€A PE(|X, |") (p20) 
and 

P P 
eaxe<( zf: E(X,P)  (p»1) 


where |X |¥ — max, sel El If U(I) is the up- 
crossing number of an interval I= [a,b] by a 
sample sequence of an (SbM) (X;, 1 & j & n) (Le., 
the number of pairs (i,j), 1 <i <j <n, such that 
X;xa, Xjzb and a « X, «b for i<k<j), then 


l 
E(U(D) s, Ea) ] — BI, —-a)*p. 


Using these inequalities, we have the fol- 
lowing convergence theorems: (i) Let (X,, 1 < 
n< oo) be an (SbM). (a) If sup, E(X,") « oo, 
then lim, ,,, X, — X exists with probability 
1 and E(| X,,|) « oo. In particular, every non- 
positive (SbM) and nonnegative (SpM) con- 
verge to integrable random variables a.s. 

(b) Furthermore, if ( X,|1 Xn« œ} is uni- 
formly integrable, then bm, X, = X, exists 
with probability 1 by (a), and (X,]1 «nx oo) 
is also an (SbM). (c) If X, is an (SbM) such 
that (E(| X,|)? is bounded, then Im, X, 
exists, and if (X,, 1 X n < oo) is an (SbM), then 
lim, ,4, E(X,) € E(X,,), where the equality holds 
if and only if | X,| 1 «n « o0] is uniformly 
integrable. (ii) If (X,, —oo «nx —1) is an 
(SbM), then lim, ,_,, X, — X. ,, exists and 

—00 € X. « oo. Furthermore, if E(X ls 
—oo,then —oo < X ,, « oo and (X,, —oo Sn S 
—1) is a uniformly integrable (SbM). (iit) Let 
(Xi, X,, ..., Z) be an (SbM). (a) lim, ,,, X, — 
X, exists and lim, , o E(X,) < E(X,,) < E(Z). 
(b) Dm, E(X,) = E(X,,) if and only if ( X,| 

| <n< oo) is uniformly integrable, and in 
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this case (X,, X5, ..., Xv Z) is an (SbM). 

(iv) Let (&,) (700 «n« o» and ,c Eat 

B) be a sequence of c-algebras on (Q, $8, P). 
Put & a = (\a Yn and a — V, 6, (the smallest 
c-algebra containing all %,,). If Z is a random 
variable with E(|Z]) < oo, then lim, , , E(Z| 
S) = E(Z | +o) (a.s.). We mention some 
applications of these convergence theorems. 

— Example 2. Let (Q, 8, P) be a probability 
space and (z,) (n2 1,2,...) a sequence of par- 
titions of Q into B-measurable sets with posi- 
tive P-measure such that for each n, z,,, is 
finer than z,. Let z, be {M{”, M$”, ...}, denote 
the smallest c-algebra containing (M/?3,., 9, 
by %, for each n, and set a = MAR, Fora 
given tcompletely additive set function @ 

on (Q, ğa), if we define X,(w) by X,(o) — 
o(Mj?/P(Mf?) for we Mf", j — 1,2, ..., then 
(Xn n 1 &n« oo) is an (M), and lim, ,,, X, 

= X,, exists. If P is the restriction of P to §,,, 
then ¢ is tabsolutely continuous with respect 
to P if and only if {X,|1<n< oo] is uniformly 
integrable, and in this case, X,, —dqo/dP with 
P-measure 1. If ¢ is tsingular with respect to 
P, then X, 20 with P-measure 1. 

Example 3. Let X,, X,,... be any sequence 
of random variables and Z an integrable ran- 
dom variable that is measurable with respect 
to 8(X,, X,,...). Then lim, ,,, E(Z| X,, X, 

.., X,)=Z (a.s.). In particular, if X,, X,,... 
are independent and Z ts 8(X,, X,,,,...)- 
measurable for every n, then Z is equal to a 
constant a.s. This is the so-called tKolmo- 
gorov zero-one law. 

Recently, much work has been done to 
develop an analog of the classical theory of 
H?-spaces of harmonic functions in the frame- 
work of martingale theory. Let (Q, %, P) and 
Lai nez* be fixed, and let X =(X,) be a uni- 
formly integrable (M) with respect to {%,}. 
Then X, — £(X,,|%,), where X,, zs Dm... Xn 
and we can identify X =(X,,) with X,,. Set 
X* —sup,|X,| and [X, X ] = Erzo(X, Kal 
(X_, =0). By Doob’s inequality above, 


p-1 
e [X*ll XS sIX*l,  1<p<oo, 


where | |, is the usual L?(Q, P-norm, 1 < 
p<. Burkholder and Gundy, and Davis 
obtained the following inequality: There exist 
positive constants c, and C, depending only 
on p such that 


c lX, X1? Il; & ll e CX, STI, 
l<p<o. 

If we set 

H” ={X | |X |= (LX, X]'^ly«o0), pel, 


then H? is a Banach space which can be identi- 
fied (by the identification of X and X.) with 
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L? for p» 1. H' Ẹ L', however, and if we set 


BMO={X| Xl suo sup EUR, 
n 


E aail |Full « o0), 


then BMO is a Banach space which can be 
identified with HI". the dual space of HI In 
particular, Fefferman's inequality holds: 


|E(X, YIS /21XlgilYlguo, Ae), 
Ye BMO. 


This is an analog of the classical Fefferman's 
theorem, and X e BMO is an analog of a func- 
tion of bounded mean oscillation, the notion 
of which is due to John and Nirenberg. For 
details — [1,3]; for an approach using con- 
formal martingales in continuous time — [4]. 


C. Sample Functions 


Let (X,,$,, tE T) be an (SbM). Here the para- 
meter set T may be an arbitrary subset of 

(— 00, oo). However, we can always find an 
interval 7 >T and for each teI a o-algebra $5, 
and a random variable X, such that (X,, $$,, 
tel) is an (SbM) and P(X,— X) 1, S, =, 
for every te T. Therefore we can assume with- 
out loss of generality that the parameter set 
T 1s an interval. Furthermore, we assume 
that the stochastic process {X,}er is tsepa- 
rable. Using inequalities and convergence 
theorems for the sample sequences of (SbM)’s 
with discrete parameters, we obtain the follow- 
ing properties of the *sample functions of 
(SbM)’s with continuous parameters: (i) The 
sample function of an (SbM) {X,} is bounded 
on every finite interval [a, b] c T with proba- 
bility 1. (ii) Let T, be the interior of T. Then 
P(X,+o and X, , exist for all te Tj) — 1, and for 
each te To, lim, E(X) < E(X, o) S E(X) < 
E(X, als lim, E(X,). (iii) Let D be the set of 
fixed discontinuity points of {X,} (t is called a 
‘fixed discontinuity point of {X,} if P(X, az 
X,— X,,9) X 1). Then D is an at most count- 
able set. 

We assume for simplicity that the parameter 
set is R* 2 [0, 00) and the sample functions of 
{X,} are right continuous with probability 1. 
In this case, if (X,, Y, te R *) is an (SbM), 
then (X,, S, , ,teR ^) is an (SbM), where §,. = 
(,., s. Therefore we can assume that §,= 
J,- for all teR ^. Let A be an interval and 
(ol a a family of stopping times such that 
t, & ty < © whenever «< f. Put X7 =X, and 
Yx = G. Then (X7, V7, ae A) is called the 
stochastic process obtained by an optional 
sampling (or a time change) from (X,, 5, te T). 
By the optional sampling theorem of Section 
A, we can conclude that (X7, wb «e A) is also 
an (SbM) if at least one of the following con- 
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ditions is satisfied: (1) X, «O (a.s.) for all teR *; 
(2) {X,,teR*} is uniformly integrable; (3) for 
each «€ A, v, is bounded with probability 1. 


D. Decompositions of Submartingales 


[1,6] 


If an (SbM) (X,, &,, teR *) is uniformly inte- 
grable, lim, X, — X,, exists and (X,, à, tE 
IO. oo ]) is an (SbM) for which 5. = V, &,. In 
addition, when the family {X,|teT} is uni- 
formly integrable, it is said to belong to class 
(D). Here X denotes the collection of all stop- 
ping times with respect to (%,). If for each a 
(0« a « oo) the family {X,|teT and t<a} 
is uniformly integrable, the family is said to 
belong locally to class (D). 

If an (SbM) X, is uniformly integrable, X, is 
decomposed as 


X= PUE, |) - CE(X |) — X1). (4) 


and if we take appropriate tversions of con- 
ditional expectations, E(X, | %,) becomes a 
right continuous (M). In the decomposition (4), 
M, — E(X,, | &;) is an (M) and Z,= E(X.,|,) — 
X, is a potential, i.e., a nonnegative right 
continuous (SpM) with Im, Z,=0 (a.s.). The 
decomposition (4) is called the Riesz decompo- 
sition of the (SbM) X,; the names “potential” 
and “Riesz decomposition” come from tpoten- 
tial theory in view of the obvious similarity. 

A stochastic process (A, teR ^) on (Q, 8, P) 
is called a (right continuous) increasing process 
provided that (i) A, is §,-measurable, E(A,) < 
oo for each t, and (ii) with probability 1, the 
sample function is a right continuous and 
increasing function with Ao =0. If E(A,) « oo, 
where A. — lim,» A,, the stochastic process is 
said to be integrable. We have the following 
Doob-Meyer decomposition theorem: (i) A 
potential X, is decomposed as 


X,— E(A,$)— A, (5) 


by a suitably chosen integrable increasing 
process if and only if X, belongs to class (D). 
(ii) An (SbM) X, is decomposed into X,— X; + 
A,, where X; is an (M) and A, is an increas- 
ing process if and only if X, belongs locally to 
class (D). (iti) In (i) and (ii), A, can be chosen to 
be *predictable; under this condition, these 
decompositions are unique. Furthermore, A, 
can be chosen to be continuous if and only if 
X, 1s regular in the sense that for any sequence 
nET such that 7,77, Wm, E(X, na) = E(X,,,) 
for every a 0. 


E. Semimartingales [1,6,7] 

*Lévy processes and *Itó processes are natu- 
rally generalized to a class of stochastic pro- 
cesses called semimartingales. This class of 
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processes is important since it appears to be 
the most general class for which a systematic 
theory of stochastic calculus can be developed 
(— 406 Stochastic Differential Equations). 
Furthermore, we can define some quantities 
called local characteristics of semimartingales 
which are, in a sense, a generalization of Lévy- 
Khinchin characteristics for Lévy processes. In 
many interesting cases, the law of a semimar- 
tingale can be determined if we specify its local 
characteristics to be given functionals of the 
sample paths. This way of characterizing sto- 
chastic processes is known as a martingale 
problem, a concept introduced by Stroock and 
Varadhan. 

A stochastic process X,, teR ', on (Q, 8, P) 
and (%,) is called a semimartingale if it can be 
represented as 


X,=Xo+M,+V,, 


where X, is %o-measurable, M, (M, =0) is a 
right continuous local martingale with respect 
to (%,), i.e., there exists c, € X such that a, oo 
and Mix M,,, is a uniformly integrable (M) 
with respect to (&,) for each n, and V, (Vo — 0) 
is a right continuous *(%,)-adapted process 
such that te[0, T'] V, is of bounded vari- 
ation a.s. for every T 7 0. The class of semimar- 
tingales is known to be invariant under C?- 
transformations (Itó's formula), time changes, 
and absolutely continuous changes of the basic 
probability P (Girsanov's theorem). 

Example 4. Let X, —(X], X2,..., X") bea 
system of semimartingales such that all X7, 

Xi Xj — ó?t, i, jc 1,2, ..., n, are continuous 
(local) martingales with respect to (%,). Then 
X, is an n-dimensional Wiener process such 
that 8(X, — X, u,v >t) and the $$, are inde- 
pendent for every t. For this reason such an X, 
is often called a Wiener martingale. 

Example 5. Let X, be.a semimartingale 
whose sample paths are increasing step func- 
tions with jumps of size 1 a.s. If X,— ct (c >0) is 
an (M) with respect to (%,) then X, is a tPois- 
son process with the same independence prop- 
erty as Example 4. For a similar characteri- 
zation of Lévy processes in the martingale 
framework — [5]. 

The notion of semimartingales can also be 
defined for processes taking values in a dif- 
ferentiable manifold [8]. 
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A. History 


In certain quantitative studies in biology, 
particularly in epidemiology, population ecol- 
ogy, and developmental biology, there arise 
mathematically describable models. Such 
mathematical descriptions of biological phe- 
nomena have sometimes been proposed by 
mathematicians and sometimes by biologists. 
We call these descriptions mathematical mod- 
els in biology. 

The first mathematical model was proposed 
by R. Ross [1] in 1911. He undertook a theo- 
retical investigation of the propagation of 
malaria. His object was to give an analysis of 
the propagation of malaria in a certain locality 
under somewhat simplified conditions. His 
assumption was that both emigration and 
immigration were negligible and that there 
was no increase of population. In such a local- 
ity, the propagation of malaria is considered 
to be determined in general by two factors 
which evolve continuously and simultane- 
ously. On the one hand, the number of new in- 
fections depends upon the number and infec- 
tivity of the mosquitoes; on the other hand, at 
the same time the infectivity of mosquitoes is 
determined by the number of people in the 
given locality and the frequency of infection 
among them. Ross expressed the uninterrupted 
and simultaneous dependence of the first com- 
ponent on the second and that of the second 
on the first by means of a system of first-order 
differential equations. 

As a much simpler case, following P. F. 
Verhulst [2], R. Pearl and L. J. Reed [3] pro- 
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posed in 1920 a population model describing 
how a population of animals living in a fixed 
region initially increases, but after some period 
approaches a saturation point. Their model 

is the simple differential equation 

EU. 1 
d —ku)u, (1) 
where u is the population at time t and A and 
k are positive constants. The solution of this 
equation with positive initial data u is ex- 
pressed by a monotone-increasing curve that 
tends to the saturation value A/k as t tends to 
infinity. We call this equation a logistic equa- 
tion. The solution fits the growth pattern of 
some real populations of insects; but we also 
have another model, which was exploited by 
experimental ecologists who were engaged in 
the study of a kind of bean weevil. That model 
is expressed by the difference equation [4] 


WER = f(u,), Q) 


where u, is a population of the insects in the 
nth generation and f(u) is called the reproduc- 
tion function; the latter is often a simple func- 
tion, but is not necessarily monotonic. 


B. Population Model with Two Species 


V. Volterra [5] proposed the following model 
for the prey-predator relation between two 
species. We denote the prey population by 

u, and the predator population by v. These 
populations satisfy the system of equations 


Hed k 
Ce 
dr 


where A, B, k, and h are positive constants. 
The orbit of the solution of this system passing 
through (uo, vo) in the first quadrant is closed, 
enclosing the point (B/h, A/K) and staying in 
the first quadrant. An integral for this system 
is given by 


oy Apto (4) 


where C is a constant of integration. Conse- 
quently, the solution starting at a point in 
the first quadrant is periodic, with the period 
depending on the initial data. The average 
populations over one period, 


i| ET 


` ux I "aas, 9 


T 
do not depend on the initial data. 

Many mathematical models in biology are 
expressed in terms of ordinary differential 
equations [6,7]. 
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C. Fundamental Equations 


Some models are used to describe the spatial 
distributions, that is, the spatial patterns of 
populations. For example, we can consider 
equations for the pattern of a population with 
migration. 

We denote by p(t, x) the population den- 
sity at time t and position x in some region 
V in R?. v(t, x) is the velocity of the migra- 
tion. f(t, x) is a source (or supply) term for the 
population. Then we get 


d 
a oe -[ ponds+ | f dx, (6) 
dt av v 


where n is the outer unit normal to the bound- 
ary CH. Consequently, we get a partial dif- 
ferential equation 


^ 


op 

—+V-(py=f. 

a (pv) =f. 

Here we assume, for example, that v is a func- 
tion of x, p; then we get a thyperbolic equation 


ô 
ELE 
t 
Also, if we assume v 2 — d(x, p)Vp/p and f= 
f(p), then we get 


Op 


ar^ Oo pIVBHEJAB)- on. JF, (7) 


which is an equation of tparabolic type. We 
impose some boundary condition at the 
boundary OV; for example, the homogeneous 
*Neumann condition 


dp 
— —0 on av. (8) 
ôn 


In this case, if V is a convex region and d(x, p) 
=constant, then H. Matano [8] showed that 
only a constant solution of the stationary 
problem 


dAp + f(p)=0 (9) 


and (8) can be stable as a limit of the solution 
of the nonstationary problem (7) and (8) as t 
tends to oo. (He assumed that f(p) is a smooth 
function of p.) 


D. A Diffusive Prey and Predator Population 
Model 


R. May [4] and M. Mimura [10] indepen- 
dently proposed mathematical models to ex- 
plain some aspects of the population pat- 
terns of plankton in water. Their work can be 
considered to be the completion of earlier 
attempts by the ecologist J. H. Steele [9]; 
Mimura and Nishida proved that Steele's 
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original model does not yield a solution de- 
scribing the observed population patterns, and 
Mimura and May introduced corrections 
arising from their ecological viewpoint. The 
model is the system 





Op , Op apq 

—=d,—5+kp—pp?-——_., 

ôt T SE 1+bp 
Co pd 

icio dp E EE 

> ? 0x2 1+b 


on [0,1] x (0,00), (10) 


where p(t, x) and q(t, x) are the prey and pre- 
dator populations, respectively, and d, and d, 
are diffusion coefficients. k, 14 y, a, and b are 
positive constants. If b, d,, da, and u are zero, 
we get Volterra's prey and predator equation. 
Mimura considered the Cauchy problem with 
boundary conditions 


Op _oP 8 
ER = (t, 0) T. 1)=0, 


ĉq ĉq 

actus. Hrsg (11) 
In this case, under some assumptions on d,, d, 
and on k, u, y, a, Mimura [11] succeeded in 
proving some stationary patterns of p and q 
which are stable but not constant for the vari- 
able x. He used bifurcation theory to prove 
existence of a stationary solution near the 
equilibrium point. For the case in which d, is 
sufficiently small, he used the singular pertur- 
bation theory introduced by Fife [12]. 

The same procedure can be applied to the 
following system of partial differential equa- 
tions, which was proposed by Gierer and 
Meinhardt in developmental biology [14]: 


ĉa 7a cpa? 

a ai Po T ST re 

oh 07h 

47D atcp ‘a? — yh, (12) 


where a(t, x) is the concentration of a short- 
range activator and h(t, x) is that of a long- 
range inhibitor. D}, D,, po, p, c, c', D and v are 
all positive constants. Gierer and Meinhardt 
showed numerically that a system such as (12) 
exhibits some interesting spatial patterns. 
Under the boundary conditions (11), Mimura 
obtained a rigorous proof of the numerical 
results [13]. 

These models may not be realistic from the 
biological viewpoint, but they do afford some 
insight, so that biologists may now begin to 
devise more realistic ones. 

For example, the model given by (10) and 
(11) is one by means of which we can explain 
pattern formation in spatially homogenous 
environments. (We call the existence of such 
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patterns "patchiness.") The model shows that 
without any local environmental inhomogene- 
ity, we can still find some patchiness in the 
population pattern of the plankton. 

There are many models that explain bio- 
logical phenomena only metaphorically. An 
example is the use of catastrophe theory by 
R. Thom [15] to model morphogenesis. 


E. Population Genetics 


Consider the development of a genetic struc- 
ture in a population consisting of N individ- 
uals. We assume two kinds, a and A, of genes; 
and an individual has one of the genotypes aa, 
AA, aA. We assume that every generation is 
of a constant size N, and we consider the 
number of genes, not genotypes. The num- 
ber X, of a genes in the nth generation is the 
result of a stochastic process. In the simpest 
model it is a *Markov chain with the *tran- 
sition probability p;; from X,—ito X,,,—j 
being expressed by a tbinomial distribution: 


i : 


2NN . 
= i —p yy SES, 
Pij K Jet p) ub 2N 


Models that take mutation, natural selection, 
and migration into account can be given by 
appropriate changes of p;. Since the works of 
R. A. Fisher and S. Wright, natural selection 
and migration models have been the object of 
much research. A powerful method in the 
analysis of these Markov chains is the diffu- 
sion approximation [16-18]. Consider the 
time t and the ratio x — i/(2N) of a genes as 
continuous variables. Then the partial differen- 
tial equation 
2 

Zuel —y)P} -$, (enn) (13) 
is obtained for the transition probability den- 
sity p(t, x, y) of the ratio of a genes. Here, a( y) 
is a polynomial with degree <2. M. Kimura 
found a solution of (13) in terms of special 
functions and solved many problems in ab- 
sorption probability, limit distribution, speed 
of convergence, etc. Equation (13) is !Kolmo- 
gorov's forward equation in the theory of 
tdiffusion processes, but the coefficients are 
degenerate at the boundaries (0 and 1). S. 
Karlin and J. McGregor [19] found that the 
foregoing discrete models can be taken as 
tbranching processes under the condition that 
the number of individuals in each generation 
is 2N. The relation between discrete and con- 
tinuous models has been studied in general 
dimensions as a problem in the convergence of 
stochastic processes [20, 21]. 

For related topics — 40 Biometrics. 
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264 (XIX.1) 
Mathematical Programming 


A. Introduction 


Mathematical programming is a method 
useful in operations research, industrial en- 
gineering, systems engineering, etc., and it is 
concerned with optimization in general. If we 
want to minimize cost or loss or to maximize 
some effect or profit under various circum- 
stances, the corresponding mathematical prob- 
lem is usually formulated in the form of a 
mathematical-programming problem. Com- 
putational methods in mathematical program- 
ming have seen great development hand in 
hand with remarkable progress in computer 
technology, and we are now able to deal with 
large-scale problems practically. 


B. General Definitions 


A mathematical-programming problem in the 
broadest sense of the word is one of finding a 
maximum or minimum of a given function 
f: X —R (where X is a set and R is an ordered 
set). However, in a narrower sense, it usually 
means a problem where X is a closed subset of 
the n-demensional Euclidean space R" (or, 
more generally, a Banach space) and f is a 
real-valued continuous function. Often, X is 
defined as the set of points in R" that satisfy a 
system of equalities and inequalities of the 
form g,(x) x0, 20, or =0 for some given real- 
valued functions g; (i= 1, ..., m) defined on R”. 
In mathematical programming, special termi- 
nology is sometimes used; e.g., X is conven- 
tionally called the feasible region, a point of X 
a feasible solution, the solution of the problem 
itself the optimal solution, f the objective func- 
tion, and the equalities and inequalities defined 
in terms of the g; are the constraints. 
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C. Types of Mathematical-Programming 
Problems 


Mathematical-programming problems can 

be classified from various viewpoints, from 
which the problems get their various names. 
(i) Linear programming (— 255 Linear Pro- 
gramming): The objective function f and all 
the functions g; of constraints are linear. Non- 
linear programming: Some of f and the g; are 
nonlinear. Convex programming: f and all the 
g; are convex with inequalities of the form 
g,(X) <0 (and, consequently, X is a convex set); 
the problem is to minimize f (~ 88 Convex 
Analysis, 255 Linear Programming, 292 Non- 
linear Programming). (ii) Disjunctive pro- 
gramming: X is not connected. Integer pro- 
gramming: X is a subset of the lattice points of 
integer coordinates in R” (— 215 Integer Pro- 
gramming). (iti) Parametric programming: f 
and/or the g; contain parameters, and the 
problem is to analyze the behavior of the 
optimal solution and/or the feasible region 
when the parameters vary. (iv) Stochastic 
programming: The parameters in (ili) are ran- 
dom variables. (v) Multiobjective programming: 
A vector-valued function f: R"5 R* (k > 2) is 
taken as the objective function, where a certain 
order relation is defined in R* (such as the 
Cartesian product of the order in R or the 
lexicographic order based on the order in R). 
(vi) when the constraints and/or the objective 
function are endowed with special mathemat- 
ical structures, special names are accordingly 
given. For example, tnetwork flow problems, 
whose f and g; are defined with reference to a 
graph (— 186 Graph Theory), are called net- 
work programming; if f and the g; have an 
iterative or repetitive structure, the name 
multistage programming is used; dynamic 
programming (— 127 Dynamic Program- 
ming) can be regarded as a kind of multistage 
programming. 


D. Mathematical-Programming Problems of 
Special Type 


In this encyclopedia, independent articles 
appear for those types of mathematical- 
programming problems that have been math- 
ematically well investigated and are most 
frequently used in practice (— 127 Dynamic 
Programming, 215 Integer Programming, 255 
Linear Programming, 292 Nonlinear Pro- 
gramming, 349 Quadratic Programming). The 
following are some other typical problems that 
have been systematically studied. 

(i) Fractional programming: The objective 
function to be minimized within the set X 
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takes the form f(x) = C(x)/D(x) (C, D: R" o R), 
where D(x) is assumed to be positive in X. If X 
is convex, C is convex and of positive value, 
and D is concave, the function of a real para- 
meter A defined as h(4) z min, y, [C(x) — AD(x)] 
is monotone decreasing and convex in 4. The 
problem of determining h(A) for each value of 4 
is a convex-programming problem, and the 
solution of the convex-programming problem 
for A= An such that h(A,)=0 is the optimal 
solution of the original problem. In particular, 
if C and D are linear, i.e., if C(x) 2 € x +c, 
and D(x)—d'- x 4- d, and, moreover, if X = 
ixeR"| Ax x&b,.x 2 0], where AeR""", beR" 
and the inequalities are to be read component- 
wise (linear fractional programming), then by 
introducing new variables y 2 x/D(x) and yọ = 
1/D(x), we can reduce the original problem to 
the linear-programming problem min(c’y + 
CoYo| Ay — byo <0, d'- y c doyo — 1,y 20, yo 20}. 

(ii) Geometric programming: The objective 
function f(x)— go(x) and the feasible region 
X - IixeR"|gí(x) «0,i— 1,...,m x 20] are 
defined in terms of functions g,(x) of the so- 
called polynomial type: g,(x)= 2, cx; (i= 
0,1, ...,m). The dual (— 292 Nonlinear Pro- 
gramming) of a geometric-programming prob- 
lem is easier to treat than the original, because 
it is a problem of finding the minimum of a 
convex function under constraints of linear 
equalities and inequalities. 

(iti) Nonconvex quadratic programming and 
bilinear programming: The former is the prob- 
lem of minimizing an objective function of 
the form f(x) c': x - 3x'Qx (where Q isa 
symmetric matrix not necessarily nonnegative 
definite) under linear constraints (equalities 
and inequalities), and the latter is to minimize 
f(x,,x,) X; - 657 x4 - X4 Ox, over the 
region X = {(x,,x,)ER" x R"|A,x, <b,, 
A,X, € b;, x, 20, x 20j, where QER™ "ns, 
A,;€ R":*"i and b; e R" (i= 1,2). These two 
types of problems are related to each other, 
and a number of computational techniques, 
such as specially elaborated versions of the 
cutting-plane method (— 215 Integer Pro- 
gramming B), have been developed for them. 
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265 (XXI.9) 
Mathematics in the 17th 
Century 


The 17th century abounds in remarkable 
events in the history of science: the work on 
mechanics by Galileo (1564-1642); the discov- 
ery of analytic geometry by R. tDescartes 
(1596-1650); the early research in the theory 
of probability by P. de *Fermat (1601—1665) 
and B. * Pascal (1623—1662); the discovery of 
tmathematical induction by Pascal; and the 
discovery of the infinitesimal calculus (i.e., 
'differentjal calculus and "integral calculus) 
by I. "Newton (1642-1727) and G. W. tLeib- 
niz (1646-1716). Compared with these events, 
the results of mathematical research from 

the Middle Ages to the 16th century seem 
minute. These results nonetheless exist, and 
historians of mathematics are now reevaluat- 
ing them, particularly those of the 15th and 
16th centuries. 

Before Galileo, Tycho Brahe (1546-1601) 
kept precise records of astronomical observa- 
tions. J. Kepler (1571—1630), motivated by a 
mystic faith in the “harmony of the universe,” 
studied Brahe's records and discovered three 
laws on the motion of planets. He also treated 
a question of cubature in his paper on the 
form and volume of the wine barrel (1615). His 
contemporaries J. Napier (1550—1617) and 
J. Bürgi (1552-1632) discovered logarithms, 
which helped astronomers tremendously in 
their calculations. Napier used the concept of 
velocity in his introduction of logarithms; thus 
analysis began to germinate. Galileo founded 
the modern approach to the concepts of veloc- ` 
ity and acceleration in his Dialogue on two new 
sciences (1638). Using a telescope he built, he 
discovered four of Jupiter's moons and ob- 
served sunspots. His espousal of the heliocen- 
tric theory of Copernicus (1473-1543) led to 
his denunciation before the Inquisition, which 
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ordered him to refrain from holding or de- 
fending the theory. This is the most famous 
episode of his life, but his most significant 
contribution to science lies in his foundations 
of theoretical mechanics, which he freed from 
the Aristotelian tradition, thereby opening the 
way for Newton. F. Cavalieri (1598-1647), a 
disciple of Galileo, applied the notion of indivi- 
sibilis (originating in scholastic philosophy) to 
questions of quadrature in his Geometry of the 
indivisible (1635). This idea influenced Pascal 
and J. Wallis (1616—1703). 

Descartes established the method of analytic 
geometry in his Geometry (1637), published as 
an appendix to his Discourse on method. The 
use of tcoordinates can be traced back to 
Apollonius of Perga; Fermat used them occa- 
sionally, but Descartes made the first clear 
formulation of the method of representing 
general figures by means of equations, an 
essential step beyond Greek geometry. He also 
surpassed F. tViéte (1540—1603) by abolishing 
the restriction that quantities represented by 
letters should be of one dimension. 

Contemporary to Descartes, Fermat made 
remarkable contributions to ‘number the- 
ory and Pascal to tprojective geometry, and 
through their correspondence there began the 
theory of probability; both also made precur- 
sory contributions to analysis. Fermat treated 
questions on maxima and minima of functions 
and tangents of curves; Pascal answered some 
questions on tangents, centers of gravity, 
quadrature, and cubature concerning tcy- 
cloids. Pascal also contributed to hydrostatics, 
made positive use of the idea of the point of 
infinity in projective geometry, and clearly 
formulated the principle of mathematical 
induction in his theory of arithmetic triangles, 
the so-called *Pascal's triangles. (Freudenthal 
[4] established that the first discovery of the 
principle of mathematical induction is due to 
Pascal; the exact date of the discovery was 
studied by Hara [5].) i 

In England, Wallis and I. Barrow (1630— 
1677) preceded Newton. Wallis solved ques- 
tions concerning quadrature and cubature 
(by bold use of the methods established by 
Cavalieri), infinite series, and interpolation. 
Barrow was Newton's teacher. He came close 
to the fundamental theorem of calculus, and 
Newton certainly owed some ideas which led 
to his discovery to Barrow's suggestions. New- 
ton completed his method of fluxions, corre- 
sponding to our differential calculus, toward 
1669—1671, but his paper on this method was 
published only after his death (1736). In his 
main work, Principia mathematica philosophiae 
naturalis (1687), he used this method and its 
converse, without naming them, to solve the 
ttwo-body problem. The work begins with 





265 Ref. 
Mathematics in the 17th Century 


three laws of mechanics and covers the motion 
of the moon and hydromechanics. Leibniz 
discovered infinitesimal calculus slightly later 
than Newton but independently. He invented 
convenient new symbols that gave great im- 
petus to the development of calculus: the sym- 
bols dx and f are due to him. Leibniz was in 
Paris in 1672-1677, where he made the ac- 
quaintance of Father Arnauld (1612-1694) of 
Port Royal (the monastery to which Pascal 
belonged) and the Dutch physicist C. Huygens 
(1629—1695). Through their suggestions, he 
studied the work of Descartes and Pascal. 
Leibniz's first papers on calculus were pub- 
lished in 1684 in the scientific journal Acta 
Eruditorum, which he also edited. The methods 
of calculus he initiated were transmitted to 
mathematicians of the fBernoulli family and 
then to L. *Euler, who developed them into the 
wide field of analysis. 

Thus the mathematics of the 17th century 
went clearly beyond Greek mathematics. 
The importance of numbers over diagrams 
was recognized, and mathematicians were no 
longer hesitant to use infinity. Moreover, 
people became aware of the importance of 
experimental methods in science. The position 
of mathematics as an important method of 
natural science was established; mathematics 
became a rational basis of scientific research. 

It was also in this century that a peculiar 
kind of mathematics was developed in Japan 
by T. Seki (1642?—1708). However, it lacked 
the Greek tradition of viewing logical founda- 
tions as being important, and furthermore 
had no connection with natural sciences; con- 
sequently, it did not see subsequent develop- 
ment comparable to that of Western mathe- 
matics (— 230 Japanese Mathematics 
(Wasan)). 
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266 (XXI.10) 
Mathematics in the 18th 
Century 


During the Age of Enlightenment, mathe- 
matical analysis developed steadily after its 
initiation in the preceding century. It found 
numerous applications in theoretical physics 
and contributed to the growth of rationalistic 
thought. 

The central figures in mathematics during 
the late 17th and early 18th centuries were I. 
*Newton (1642-1727) and G. W. tLeibniz 
(1646-1716). C. Maclaurin (1698—1746) of 
Scotland followed Newton, but no mathema- 
tician of Newton's stature appeared in Great 
Britain. An unfortunate dispute over the prior- 
ity of discovery of infinitesimal calculus arose 
between Newton and Leibniz, after which their 
followers came into conflict. This prevented 
the members of the English school from giving 
up their inconvenient notation system, which 
hindered their progress in calculus. 

On the Continent, Leibniz was succeeded by 
the mathematicians of the * Bernoulli family 
and by L. tEuler (1707-1783), who brought 
about brilliant developments in calculus and 
its applications. They solved various kinds of 
*differential equations and invented the tcal- 
culus of variations. F. tViéte used the term 
analysis in the sense of algebra as a heuristic 
method; the same term meant algebraic treat- 
ment of infinite series in Newton's usage. It 
was during this century that analysis secured a 
position as a branch of mathematics indepen- 
dent of algebra and geometry. *Analytical 
dynamics, initiated by Euler, was further de- 
veloped by J. L. tLagrange (1736-1813) and 
P. S. de tLaplace (1749-1827). Laplace, in 
systematizing tcelestial mechanics and the 
theory of probability, showed what a power- 
ful instrument analysis was. A. M. Legendre 
(1752-1833) investigated felliptic integrals and 
opened the way for C. F. Gauss and other 
mathematicians of the next century. 

The growth of the Ecole Polytechnique, 
established during the time of the French 
Revolution, contributed to the brilliant pro- 
gress of French mathematics. Lagrange, La- 
place, and Legendre were all active in Paris 
during this period, as were S. D. Poisson 
(1781—1840) and J. B. J. *Fourier (1768—1830), 
both of whom made major contributions to 
analysis, and G. Monge (1746-1818), L. Car- 
not (1753-1823), and J. V. Poncelet (1788— 
1867). A problem proposed by Fourier in 
his theory of heat propagation gave rise to 
an important question of analysis, one that 
later formed the basis of *harmonic analysis. 
Fourier and Poisson aimed at clarifying the 
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laws of nature, while Monge, Carnot, and 
Poncelet developed tprojective geometry and 
descriptive geometry for their purely geo- 
metric interest. Monge also did precursory 
work on tdifferential geometry. 

The mathematics of this century left many 
remarkable results in geometry and analysis 
and their applications; however, it inherited 
its methods from the preceding century and 
lacked critical spirit. Mathematicians were 
more interested in obtaining new results than 
in reflecting upon the rigor of their methods. 
Reexamination and reestablishment of the 
foundations of mathematics were left to the 
next century. 
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Century 


The 19th century was a critical period in the 
history of mathematics. When the century 
began, the memory of the French Revolution 
was still fresh, and World War I followed 
closely upon the turn of the 20th century. 
During this time mathematics made enormous 
progress and left a tremendous legacy to the 
present century. Increased personal liberty 
released people from traditions and allowed 
culture to spread to wider classes of society, 
producing a greater reservoir of talent. Re- 
search activities were intensified in the univer- 
sities, and many specialists collaborated or 
competed with each other. The century can be 
divided into three periods: the first 20 years, 
during which many new fields of mathematics 
arose; the next 30 years, a period of further 
development; and the latter half of the century, 
when these fields attained maturity. 

In 1801, Disquisitiones arithmeticae by the 
young C. F. *Gauss (1777—1855) appeared. 
It contained a systematized theory of num- 
bers, ushering in a new era of mathematics. 
In France, many mathematicians studied at 
the Ecole Polytechnique, established during 
the French Revolution. Among them, A. L. 
*Cauchy (1789-1857) was one of the most 
prominent. He gave the exact definitions of 
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limit and convergence, thus giving solid foun- 
dations to calculus some 150 years after its 
discovery. N. H. *Abel (1802-1829) and C. G. 
J. tJacobi (1804—1851) studied felliptic func- 
tions during the same time, producing results 
sensational to their contemporaries. Gauss 
gave a rigorous proof of the existence of roots 
of algebraic equations in the field of complex 
numbers; Abel proved the algebraic nonsolva- 
bility of algebraic equations of degree > 5; and 
E. *Galois (1811—1832) created his theory of 
algebraic equations, which began a new phase 
in algebra. J. V. Poncelet (1788—1867), another 
graduate of the Ecole Polytechnique, devel- 
oped tprojective geometry along the lines 
pursued by G. Monge (1746-1818). His re- 
search was continued in Germany by A. F. 
Mobius (1790—1868), J. Steiner (1796—1863), 
and J. Plücker (1801—1868). Steiner investi- 
gated, in particular, talgebraic curves and 
surfaces by synthetic methods; Plücker intro- 
duced projective coordinates, thus enlarging 
the usage of analytic methods in geometry. 
These brilliant results in the first period of 
the 19th century were all achieved by young 
mathematicians, most of them still in their 
twenties. 

The new geometry was developed in the 
period 1830—1840 by K. G. C. von Staudt 
(1798—1867) in Germany and M. Chasles 
(1793—1880) in France. In the 1840s, the the- 
ory of tinvariants was taken up in connection 
with geometry; outstanding in this domain 
were the English mathematicians A. Cayley 
(1821—1895) and J. J. Sylvester (1814—1897). 
P. G. L. *Dirichlet (1805-1859) endeavored to 
simplify Gauss's number theory and intro- 
duced the Dirichlet series in his computation 
of tclass numbers of binary tquadratic forms. 
He also initiated the theory of *trigonometric 
series by giving a rigorous proof of a theorem 
on expansion in tFourier series, introduced 
by J. B. J. *Fourier (1768—1830) in his theory 
of heat propagation. Another notable event 
was the independent and almost simultaneous 
discovery of tnon-Euclidean geometry by J. 
Bolyai (1802—1860) and N. I. Lobachevskii 
(1793-1856), which aroused philosophical 
interest since it changed the character of 
axioms. The invention of fquaternions by 
WR Hamilton (1805-1865), publication of 
Ausdehungslehre (theory of extensions) by H. 
G. Grassmann (1809-1877), and development 
of the algebra of logic by G. Boole (1815— 
1864) also occurred during this period, but 
these notions received neither full comprehen- 
sion nor sympathy until later. 

During the latter half of the century, G. 

F. B. * Riemann (1826-1866) and K. T. W. 
*Weierstrass (1815-1897) were prominent. 
Both have had great influence on the mathe- 
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matics of the 20th century, the former by his 
brilliant and abundant production, the latter 
by his mature and critical spirit. Riemann lived 
only 40 years, and published, in rapid suc- 
cession, his epoch-making ideas on the theory 
of functions of a complex variable, *Abelian 
functions, trigonometric series, ffoundations of 
geometry, ‘distribution of primes, and ‘zeta 
functions. Weierstrass was already 49 when, 
after teaching in a country gymnasium (second- 
ary or college preparatory school), he became 
a professor at the University of Berlin. The 
theory of the functions of a complex variable, 
initiated by Cauchy in the 1820s, had to wait 
for the contribution of these men to attain 
completion in the form of the theory of *ellip- 
tic functions. Riemann's influence is also con- 
siderable in algebraic geometry and the theory 
of ‘differential equations. Weierstrass reformed 
the tcalculus of variations. His critical ap- 
proach uncovered pathological functions, such 
as continuous nowhere differentiable functions 
(— 106 Differential Calculus) and Peano space- 
filling curves (— 93 Curves J), and to real 
analysis, based largely on the set theory of G. 
*Cantor (1848-1918). 

Concerning the foundations of mathemat- 
ics, Cantor, M. C. Méray, and J. W. R. Dede- 
kind (1831—1916) established the theory of 
irrational numbers. Dedekind and G. Peano 
(1858—1932) developed the theory of natural 
numbers; their results brought about the 
“arithmetization” of mathematics and led to 
the research in the foundations of mathematics 
of the present century. 

Cayley and F. fKlein (1849-1925) inter- 
preted non-Euclidean geometry by means of 
metrics introduced in projective geometry. 
Toward the end of the century, D. "Hilbert 
(18621943) examined the roles of axioms of 
congruence, continuity, and parallelism in 
Euclidean geometry, thus initiating the study 
of taxiom systems in general. 

The theory of tgroups, in particular finite 
groups, was developed around 1870 by C. 
Jordan (1838—1922), G. Frobenius (1849— 
1917), and W. S. Burnside (1852-1927). M. S. 
*Lie (1842-1899) applied infinitesimal trans- 
formations to differential equations, and Klein 
applied the groups of linear transformations 
to geometry. The discovery of tautomorphic 
functions by Klein and H. *Poincare (1854— 
1912) was another brilliant application of the 
theory of groups. In the algebraic theory of 
numbers, originated by Gauss, E. E. 'Kummer 
(1810—1893) developed the idea of *ideal num- 
bers" (— 14 Algebraic Number Fields); Dede- 
kind then established the theory of *ideals. L. 
*Kronecker (1823—1891), an admirer of Abel’s 
work, studied algebraic equations and dis- 
covered that every *Abelian extension of the 
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rational number field is contained in a tcy- 
clotomic field; he believed that relations of a 
similar kind would hold between *modular 
equations of elliptic functions with fcomplex 
multiplications and Abelian extensions of 
"imaginary quadratic fields, and enunciated a 
famous conjecture known as his “dream in his 
youth.” 

Finally, we note the appearance of another 
important mathematician, S. V. Kovalevskaya 
(1850—1891). After studying with Weierstrass, 
in 1884 she was invited by G. M. Mittag- 
Leffler (1846-1927) to teach at the University 
of Stockholm, where she remained until her 
death. 

Toward the end of the 19th century, the 
subjects of mathematical research became 
highly diversified. Branches were further rami- 
fied into more specialized branches, while 
unexpected relations were found between 
previously unconnected fields. The situation 
became so complicated that it was difficult to 
view mathematics as a whole. It was under 
these circumstances that in 1898, at the sugges- 
tion of F. Meyer and under the sponsorship 
of the Academies of Gottingen, Berlin, and 
Vienna, a project was initiated to compile an 
encyclopedia of the mathematical sciences. 
The Enzyklopádie der mathematischen Wissen- 
schaften was completed in 20 years; it provides 
a useful overview of the mathematics of the 
19th century. 

Toward the end of the century, the Interna- 
tional Congress of Mathematicians (ICM) 
was established to foster communication 
among mathematicians from all parts of 
the world. Before World War I broke out, 
the ICM met in Zürich (1896), Paris (1900), 
Heidelberg (1904), Rome (1908), and Cam- 
bridge, Mass. (1912). During this period, mathe- 
matical societies were formed in many coun- 
tries, e.g., the London Mathematical Society 
(1865), the Société Mathématique de France 
(1872), the American Mathematical Society 
(1888), the Deutsche Mathematiker Vereini- 
gung (1907), and the Mathematical Society of 
Tokyo (1877), which later became the Physico- 
Mathematical Society of Japan and then was 
split in two in 1946. The present Mathematical 
Society of Japan evolved from this division. 

Five years after the 1872 reform of the Japa- 
nese educational system, the University of 
Tokyo was established, and D. Kikuchi (1855— 
1917) and R. Fujisawa (1861—1933) taught at 
the Department of Mathematics during its 
early ears. Under their influence, Japanese 
research in European-style mathematics (based 
on Greek traditions) began. The Universities 
of Kyoto and Tóhoku were established in 1897 
and 1911, respectively. From the beginning of 
the 20th century, original results were ob- 
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tained and published in the Proceedings of the 
Physico-Mathematical Society of Japan and in 
the journals of the faculties of science of these 
universities. In 1911, the Tóhoku Mathematical 
Journal was founded by T. Hayashi (1873— 
1935). In 1920, a paper on *class field theory 

by T. tTakagi (1875—1960) was published in the 
Journal of the College of Science of the Univer- 
sity of Tokyo. Thus the position of Japanese 
mathematics gradually came to be established. 
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Mathieu Functions 


A. Mathieu's Differential Equation 


The 2-dimensional tHelmholtz equation 
(A+k?)¥ =0 (A x 0?/0x? + 0?/0y?), sepa- 
rated in elliptic coordinates £, y given by x= 
ccoshécosy, y=csinh Zsin 5, has a solution 
of the form Y = X (C) Y (y) whose factors X (č), 
Vin) satisfy 


d?u/dz* -- (a — 24 cos2z)u — 0, (1) 
d^u/dz? — (a — 2q cosh 2z)u =0, (2) 


respectively, where a is an arbitrary constant 
and q = k?c?/4. By the substitution z + iz, (1) 
becomes (2). (1) and its solutions are known 

as Mathieu’s differential equation and the 
Mathieu functions, and (2) and its solutions are 
called the modified Mathieu differential equa- 
tion and the modified Mathieu functions. 


B. Hill's Differential Equation 


Hill's differential equation is a linear ordinary 
differential equation of the second order: 


d?u/dx? + F(x)u ^ 0, (3) 


with F(x -- 21)— F(x). It is named after G. W. 
Hill, who investigated it in his study of lunar 
motion. This equation includes Mathieu's 

differential equation and tLamé's differential 
equation as particular cases, and by suitable 
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transformations, tLegendre’s differential equa- 
tion and the fconfluent hypergeometric dif- 
ferential equations as well. 

While F(x) is periodic, solutions of (3) are 
not necessarily so. There always exists, how- 
ever, a particular solution that is quasiperiodic 
in the sense that 


u(x4272)-—ou(x) o=constant (4) 


(Floquet's theorem). That is, the differential 
equation (3) has a solution of the form 


u(x) =e" p(x), (5) 


where o(x + 27)= o(x) and u (defined by o = 
e?"^) is called the characteristic exponent. 

Being a particular case of Hill's equation, (1) 
has a general solution of the form 


u(z) — Ae" p(z) + Be "^ (— z), (6) 


where q(z 4- x) 2 o(z). For those values of a 
(called eigenvalues) that make the character- 
istic exponent u equal to 0 or i, u(z) has period 
1 or 2x and is called a Mathieu function of the 
first kind, also called an elliptic cylinder func- 
tion when employed in problems of diffraction 
by an elliptic cylinder. Sometimes it is called 
simply the Mathieu function, and other solu- 
tions of (1) are referred to as general Mathieu 
functions. The formula 


F(x)= > a,e"* (7) 
suggests, in conjunction with Floquet's theo- 
rem, a solution of (3) in the form 

u-e" Y be" (8) 
Substituting (7) and (8) into (3) and comparing 
coefficients of e“*”*, we have infinitely many 
linear equations 


(u+in’b,+ Y a,b, ,,—0, 


n=...,—2, -1,0,1,2,.... (9) 


By eliminating the b, in (9), we also obtain an 
infinite determinantal equation called Hill’s 
determinantal equation, 


A(u)=|B,s| = 9, (10) 


where the elements B,, of Hill’s determinant 
A(g) are such that 


1 if r=s, 
B,,= d 


7 =o a ; 
(urita " "7° 


Here an infinite determinant D =| B,,,,|(m,n= 
—00, ..., 00) is defined as the limit, if it exists, 
of D,, = det(B;,) (i j= —m, ..., m) as moo. The 
formula (10) can be reduced to a simpler form 


sin? riu = A(0)sin? x, / ag. (11) 
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This determines a characteristic exponent p, 
which in turn determines the b, in (9) and a 
solution (8). This procedure is called Hill's 
method of solution. 

Applying Hill's method of solution to (1), we 
have an equation for the characteristic expo- 
nent UH 


sin?(n/2)iu = A(0)sin? (1/2). /a, (12) 


where the infinite determinant A(0)=|B,,,,| has 
elements such that 





1 if m-n 
= z if n=m+1 
B mn = a— 4m 
0 otherwise 
(m,n=..., —1,0,1,...). 
When 40, we have 





A(0)=1 +t mt n/a 4 O(q*). 


(13) 


Thus if 4 —0, we have A(0) — 1, and a dn, 
(2n + 1)? correspond to us. i in (12). 


C. Mathieu Functions of the First Kind 


Mathieu functions of the first kind are further 
classified into the following four types: 


Ces, q) 3. AP? cos2rz, (azn), (14.1) 
Se3,41(2, 4) A. BEP sin(2r + 1)z, 

(bo, 4), (14.2) 
Cean (2,4) A. AST cos(2r + 1)z, 

(024.1). (14.3) 
36344302, 4) = A. BE? sinQr + 2)z, 

(b...) (14.4) 


(n, r «0, 1,2,...), where the appended terms in 
parentheses are eigenvalues, ordered by a,, « 
Dont1 <Gon+1 € ban+2 for a given q, and increas- 
ing with n. Each of these series converges ab- 
solutely and uniformly for all finite z and has 
n zeros in 0 <z « z/2. In addition, orthonormal- 
ity relations 


2n 
| ce, (x)se,(x)dx — 0, 


0 


2n 2n 
| cece dn | se, (x)se,(x) dx 


0 0 


— nó 


mn 


hold. When q>0, ceo(z)1/4/2, ce, (z) ^ 
cos mz, and se, (z)sin mz. 

For small q we assume that the quantities 
involved have power series expansions in q, 
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e.g., 
a=m +q +B ^ ..., 
ce,(z) = cosmz + gF,(z)+q7F,(z)+..., 


and we substitute them in (1) to determine 
successively a, $, ..., F(z), F(z), ... (Mathieu’s 
method). For larger q, (14) is substituted in (1) 
to give recurrence formulas for the coefficients, 
e.g., for ce,,(z), 


29 AQ" --(4 a) A?" + qA" —0, 
EI ET EE 
(15) 


After we eliminate the A9", the formulas lead 
to an equation for the eigenvalues a;,: 


a — 0 0 0 0 
—2q a—4 —q 0 0 
0 —q a-16 -q 0 cu 9 


or, equivalently, to a tcontinued fraction 


2? up q? 


a= SH 
à—4—a—16—a—36— 





(17) 


Given q, we can find the a,, from (17) and 
determine the AY” for each a,, from (15) 
(Ince-Goldstein method). 


D. Mathieu Functions of the Second Kind and 
Modified Mathieu Functions 


There exists only one (half-)periodic solution of 
(1) corresponding to cach (half-)periodic eigen- 
value (— Section E). Therefore other solu- 
tions corresponding to the same a,, or b,, and 
independent of ce,,(z, q) or se, (z, q) are non- 
periodic. They are called the Mathieu functions 
of the second kind and are denoted by fe,,(z, q) 
or ge,(z, q). 

By the substitution z iz in (14) we obtain 
formulas for the modified Mathieu functions of 
the first kind, 


Ce; q) — ce, (iz, q), 
Se, (z, q)= — ise, (iz, q). (18) 


When q—0, then Ceo(z)2 1/,/2, Ce, (z)9 
cosh mz, Se, (z) sinh mz. Similarly, by the 
substitution z-»iz we obtain modified Mathieu 
functions of the second kind from Mathieu 
functions of the second kind. In addition, we 
introduce modified Mathieu functions of the 
third kind as those linear combinations of 
modified Mathieu functions of the first and 
second kinds that have the asymptotic form 

e iv !? (y2q!" e^) as zo oo. In addition to 
the Fourier expansion (7), expansion of the 
Mathieu functions in terms of tBessel functions 
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Ce5,(z, q) A. A;, cosh 2rz (19.1) 
— (Ao)! ce;,(n/2, q) 
xY(—-lyA;,J,(Qhcoshz) (19.2) 
—(Ao) ' ce;,(0, q) 
x Y, A5,J;, 2 hsinh 2) (19.3) 
—(A9) *cea,(0, q)ce;,(n/2, q) 
x Y (—1) A;,J, (he *)J,(he*). 
(19.4) 


These series converge absolutely and uni- 
formly for all finite z. Replacing the J on the 
right-hand sides of (19) by N,,(2hcosh z), 
N,,(2hsinh z), J,(he 7?) N,(he*?), respectively, in 
an obvious way we obtain infinite series for a 
function that again satisfies (2) and is denoted 
by Fey,,(z,q). In a similar manner other modi- 
fied Mathieu functions of the second kind 
Geyasa (2, d), FVons1(Z, 4); G2V2n+2(Z, 4) can be 
obtained. These are more convenient for prac- 
tical applications than fe,,(iz,q) and ge,,(iz, q) 
since they converge more rapidly. 

The equations 


d?u/dz* +(a+ 24 cos2z)u — 0, (20) 
d^u/dz? — (a 4- 2qcosh2z)u 20 (21) 


obtained from (1) and (2) by the substitution 
q— — q are the results of separating (A — k?) 
=Q. In general, if f(z, q) is a solution of (1), 
then f(n/2 — z, q) is a solution of (20). Thus the 
formulas 


ces, (z, —q) =(—1)" ces, (11/2 — z, q), (22.1) 
CO 4, —d) -(-D"sesu(n/2—2,4, — 222) 
S€an+1(2, —d) -(—D'ces,41(0/2—2,4, (22.3) 
Se34 «52, —q)— (7D"sez,,5(0/2—2,4) — 24) 


can be adopted as definitions of ce,,(z, q), 
se, (z, q) for q « 0 (Ince's definition). Accord- 
ingly, an expansion of Ce holds in terms of 
modified Bessel functions 1, in place of the J,, 
in (19), which in turn becomes a solution of 
(21) if we replace I,, by (—1)"K, /n: 


Fek,,(z, — q)- (71 (nAo) ! ces,(n/2, al 
x Y A,K>,(2hsinh z). (23) 


In a similar manner, Fek,,,,(z, — q), 
Gek;,.4 (2, — 4), Gek;,,5(z, — q) can be defined. 
They decrease exponentially as z o oo and 
hence are precisely the modified Mathieu 
functions of the third kind. 


E. Stability 


Let u, (x), u;(x) be a fundamental system of 
solutions of (3) such that u,(0)= 1, wu (0) - 0; 
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u5(0) «0, u5(0) « 1; then o, u in (4), (5) are given 
by 


o=", 2nu=arccosh A, 


2A =u, (2n) - uS (2x). (24) 


There are two values of u satisfying (24); let 
them be u, and 4, (H3 — — 41). If F(x) is a real 
function, then u,(x), u;(x) are also real func- 
tions and A is a real number. It can be seen 
from (24) that A < —1,1<A<0,0<A<1l, 

1 < A correspond to 0-274 — arccosh |A|4- zi, 
i(arccos|A|+ 72), i arccos A, arccosh A, respec- 
tively. Consequently, when |A| « 1, the general 
solution of (3) neither diverges nor vanishes as 
x tends to infinity. Such solutions are called 
stable solutions of Hill's equation, or some- 
times Hill's functions. When | A|> 1, either e^:* 
or ez? tends to infinity with x. Such solutions 
are called unstable solutions. If A — 1 (—1), 

we have u=0 (i/2) and a solution of the form 
of u= p(x) (e? o(x)) with the period 27 (42), 
called a periodic (half-periodic) solution. 

When we apply the Mathieu functions to 
physical and engineering sciences, such as the 
theory of oscillation and quantum mechanics, 
it is convenient to modify (3) in the form 


d^u/dx? +(A+y@(x))u=0 (25) 


involving parameters 4, y. Then with y fixed, 
there exist countably many values of 4 (called 
eigenvalues) corresponding to periodic or half- 
periodic solutions of (25). Let them be A) € 
Je Or 44 <A, € ..., respectively; then we 
have 


o <å | Ae SAY RAG ose Aa o S 9; 
< AE XA <5: 


The values of 4 in the open intervals (A,,_5, 
Aal and (A5,, dal correspond to stable 
solutions, while the 7's in other intervals 
correspond to unstable solutions (Haupt's 
theorems). 

When both 4 and y vary, the Ay-plane is 
divided into regions corresponding to stable or 
unstable solutions according as the character- 
istic exponent y is purely imaginary or not. 
For example, when (x) 22cosx in equation 
(25), we obtain the Mathieu equation. In this 
case the Ay-plane is divided as shown in Fig. 1, 
where shaded (unshaded) regions correspond 
to stable (unstable) solutions and boundary 
curves give eigenvalues corresponding to peri- 
odic or half-periodic solutions. 
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A. General Remarks 


Let K be a tring or a tfield. As examples of 
such K, we may take the real number field R 
and the complex number field C. By a matrix 
with elements in K, we mean an array of mn 
elements aj, (i— 1, ...,m; k- 1, ..., n) of K ar- 
ranged in a rectangular form: 


Aii Qiz 7 Qin 
a21 22 `“ 05, 
, Ami Am2 `U Am 


The element aj, is called the (i, k)-element (entry 
or component). (Sometimes, instead of using 
parentheses, we use | | or[ J) More pre- 
cisely, this matrix is said to be an m by n ma- 
trix (m x n matrix or matrix of (m, n)-type). In 
particular, an n x n matrix is called a square 
matrix of degree (or order) n, while a matrix in 
general is sometimes called a rectangular ma- 
trix. Each horizontal n-tuple in an m x n ma- 
trix is called a row of the matrix, and each 
vertical m-tuple is called a column of the 
matrix. We often abbreviate the notation for 
the matrix given previously by writing (aip) or 
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simply A. A square matrix is called a diagonal 
matrix if all its components are zero except 
possibly for diagonal components a; that is, 
if aj, — 0 for iz k. If the components aj; of a 
diagonal matrix are all equal, it is called a 
scalar matrix. An n x n matrix whose (i, k)- 
element is equal to 6,, is called the unit matrix 
(or identity matrix) of degree n, where ô; is the 
Kronecker delta, which is defined by A, — 1 and 
ój —0 for i#j. We denote this matrix by I,, or 
simply by Z if there is no need to specify n. 

A 1 xn matrix (a,,a;, ...,a,) is called a 
row vector of dimension n, and an m x 1 matrix 


Lët, 
is called a column vector of dimension m. The 
m rows and n columns of an m x n matrix A 
are called row vectors and column vectors of 
A. 


B. Operations on Matrices 


Two matrices A —(a;,) and B — (b) are said to 
be equal if and only if they are of the same 
type and a, — bj (i—1,...,m; k- 1, ..., n). If 
both A and B are m x n matrices, we define the 
sum of A and B by A+ B — (aj, + bal, The prod- 
uct of two matrices A and B is defined by AB 
= (ca), Co = Aaron, provided that the number 
of columns of A is equal to the number of rows 
of B. We further define the (left and right) 
multiplication of a matrix A by an element a of 
K by aA —(aaj,) and Aa=(a;,a). The set of all 
matrices over K of the same type forms a tK- 
module. The multiplication of matrices satis- 
fies the associative law and the left and right 
distributive laws with respect to addition. 
Thus the set of all n x n matrices over K forms 
a ring, which is called the total matrix algebra 
(or full matrix algebra) of degree n over K; it is 
usually denoted by M,(K) or K,. If K has the 
tunity element 1, then J, is the unity element of 
M,(K). The matrix whose components are all 0 
is called the zero matrix and is denoted by the 
same symbol 0. Suppose that K has the unity 
element 1. Let Ej, be the matrix whose (i, k)- 
element is 1 and whose other elements are all 
0. Then every matrix A =(a;,) in M,(K) can be 
expressed uniquely as A= X aj Ej, a linear 
combination of E. The matrix E; is called a 
matrix unit. We have E,,E,,=0 if j Z k, Ey, Ey, 
= Ej, and aE, — Eja for all ae K. 

Let A be a square matrix in K. If there exists 
a matrix A ! such that 44 ! 24 ! A—I, then 
A ! is called the inverse matrix of A, and A is 
called a regular matrix (nonsingular matrix or 
invertible matrix). The inverse A 1 is unique if 
it exists. When K is commutative, A is regular 
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if and only if its *determinant |A| is a tregular 
element of K; in particular, when K is a field, 
A is regular if and only if | 4| 40. A square 
matrix that is not regular is called a singular 
matrix. 

Let A — (aj) be an m x n matrix. Then the 
n x m matrix (by) such that b — a,; for all i and 
k 1s called the transposed matrix of A, and is 
usually denoted by '4. Transposing a matrix 
amounts to changing rows into columns and 
vice versa. When K is commutative, AB =C 
implies IC ='B'4. A square matrix such that 
‘A =A is called a symmetric matrix, and one 
such that 'A = —A is called an alternating 
(skew-symmetric or antisymmetric) matrix. 
A square matrix A =(a,,) is called an upper 
(lower) triangular matrix if a;, =0 for ik 


(i « k). 


C. The Kronecker Product of Matrices 


We assume that the ring K is commutative. 
Let A bean m xn matrix (a4), let B be an r x s 
matrix (bj) in K, and write c; „= aj b; by 
means of indexes å — (i, j) and u - (k, I). The 
Kronecker product of A and B, usually denoted 
by A ® B, is defined as the mr x ns matrix C = 
(c,,,). By an appropriate choice of A and p, it 
can be expressed as 


oun H aB a1,B 

gouf aB gef 

am B dy; B Sak am B 
or 

b,,A b,,A ACE b,,A 

ba, A b,,A Se: b,,A 

b, A b, A RAT b,,A 


We have the formulas 
A®(B,+B,)=A®B,+A@B,, 

(A, -A 0 B=A, @B+A,@B, 

c(A & B)=(cA) @ B=A @ (cB), 

(4, & B,)(4; & By) - (4, Az) © (B, B2), 


provided that the sums and products can be 
defined. 

Matrices correspond to tlinear mappings of 
free K-modules (— Section L). The Kronecker 
product corresponds to the ttensor product of 
the corresponding linear mappings. 


D. The Rank of a Matrix 


Let A bean m x n matrix (a) in a field K. If 
there exists a nonzero tminor of A of degree r, 
and if all minors of degree >r+ 1 are equal to 
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0, then the number r is called the rank of A. 
Denoting the rank of a matrix A by p(A), we 
have p(PAQ)< p(A) for any matrices P, Q for 
which the product PAQ exists; the equality 
holds if P and Q are regular (square) matrices. 
The rank r of A is equal to the maximum 
number of tlinearly independent row vectors of 
A, and it is also equal to the maximum num- 
ber of linearly independent column vectors of 
A. The number n — p(A) is called the (column) 
nullity of the matrix A. It is equal to the di- 
mension of the linear space consisting of solu- 
tions of homogeneous linear equations Ax — 
0, that is, the number of fundamental solu- 
tions of these equations. (The row nullity m — 
p(A) of A is equal to the dimension of the 
linear space consisting of solutions of x4 =0.) 
Even if K is not commutative, we can define 
the rank of a matrix A as the maximum num- 
ber of left (right) linearly independent row 
(column) vectors of A (— 256 Linear Spaces F). 


E. Elementary Divisors 


Let o be a *principal ideal domain (e.g., the 
ring Z of rational integers, or the *polynomial 
ring K[x] over a field K), and let A be a ma- 
trix with elements in o. Then, multiplying ap- 
propriate invertible matrices from the left and 
right, we can transform A into a diagonal 
matrix of the form 


ey 0 
ez 


0 0 
where r denotes the rank of A, each e; 0, and 
each e;,, is divisible by e;. These e,, e,, ...,e, 
are called the elementary divisors of A and are 
uniquely determined up to unit factors. This 
fact can be generalized to some extent for the 
case where o is noncommutative (— 256 
Linear Spaces P). Using the concept of deter- 
minant, the elementary divisors can also be 
defined as follows. Let d, denote the greatest 
common divisor of all minors of degree k of A. 
The elements e; — d;/d;.., (i—1,...,r; dg = 1) of o 
are the elementary divisors of A. The numbers 
d; (1 «i &r) are called the determinant factors 
of A. 


F. Characteristic Polynomials and Eigenvalues 
Let A —(aj,) be a square matrix of degree n in 


a field K. The determinant F(x)= |x1 — A] is 
a polynomial in x with coefficients in K, 
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called the characteristic polynomial of A. The 
algebraic equation F(x)=0 is called the 
characteristic equation of A, and its roots 4,, 
Aa 4, are called the eigenvalues (proper 
values or characteristic roots) of A. The tdeter- 
minant of A, det A, is equal to []?_, 4; the 
product of the eigenvalues. The sum of eigen- 
values, ZH, 4;= Zug, is called the trace 
(Ger., Spur) or diagonal sum of A, and is de- 
noted by tr A or Sp(A). In case K is talge- 
braically closed, there exists a nonzero vector x 
that satisfies the equations Ax = Ax if and only 
if 4 is an eigenvalue of A. This solution x is 
called an eigenvector (proper vector or charac- 
teristic vector) belonging to the eigenvalue 4. 
In particular, if all the a; are real and A is 
symmetric, then all eigenvalues of A are real, 
and the characteristic equation F(x)=0 of A is 
called a secular equation. Every square matrix 
A satisfies its characteristic equation, Le., 
F(A)=0. This is called the Hamilton-Cayley 
theorem, and it is useful in numerical calcula- 
tion of inverse matrices. 

Let A be a square matrix. As is clear from 
above, there exist monic polynomials f(x) (40) 
in K[x] such that f(A)=0. Let @(x) be such a 
polynomial of the least degree. Then every f(x) 
is divisible by g(x). This (x) is called a mini- 
mal polynomial of A. Let e, (x), ..., e, (x) be the 
elementary divisors of the matrix xf — A. Then 
e,(x) 2 (x). Now if F(x)— x", A is said to bea 
nilpotent matrix, and if F(x) (x — 1)", A is said 
to be a unipotent matrix. 


G. Jordan Normal Form 


Two square matrices A and B are said to be 
similar to each other if B=P~'AP with a 
regular matrix P. The matrices A and B are 
similar if and only if x1 — A and xI — B have 
the same elementary divisors. Now suppose 
that A has elements in a field K and that every 
eigenvalue of A is in K. Then there is a regular 
matrix P with elements in K such that P! AP 
is of the form 


A, 0 
phApe| ^ 
0 E 
where 
A; l l 0 
A ^is ` 
d A 


(A, 1s a square matrix of a certain degree, say 
m,; if m;=1 then A;=(A,).) This matrix P^! AP 
is called the Jordan normal form of A. It isa 
diagonal matrix if and only if the minimal 
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polynomial of A has no multiple roots. If this 
is the case, A is said to be semisimple. 


H. The Exponential Function of a Matrix 


Let A, — (aj?) (v —0, 1, 2, ...) be square matrices 
with elements in the complex number field C. 
The series Ag+ A,+...+A,+... is said to be 
convergent if the series of components af?" + 
allt... E aff +... is convergent for each 

(i,j). For every square matrix A, the following 
series is convergent: 


1 1 
TEA AA es pA Ec 
2! y! 


We denote this series by exp A. Then exp(‘A) 
—'(exp A); exp(—A)=(exp A) !; and if AB 

= BA, exp(A + B) - exp Aexp B. Moreover, 
det(exp A) — exp(tr A). Furthermore, if we set 
F(t) 2 exp(tA), then dF(t)/dt = AF(t) (— 431 
Transformation Groups). 


I. Normal Matrices, Unitary Matrices, and 
Hermitian Matrices 


Let A — (a;,) be a square matrix with elements 
in the complex number field C. Then the ad- 
joint matrix A* of A is the transposed conju- 
gate 'A =(4,,), where a is the complex conju- 
gate of a. If AA* = A*A, then A is said to bea 
normal matrix. A matrix U such that U*U =I, 
that is, U ! = U*, is normal. Following G. 
Frobenius, we call it a unitary matrix (in the 
following discussion, we shall call it simply a 
u-matrix). A matrix H such that H* — H is 
also normal. It is called a Hermitian matrix 
(which we shall refer to as an h-matrix). An h- 
matrix P such that P?— P is called a projec- 
tion matrix. 

The set of all u-matrices forms a tgroup with 
respect to matrix multiplication. If A is a 
normal matrix and U is a u-matrix, U*AU isa 
normal matrix. If H is an h-matrix, Q*HQ is 
also an h-matrix for any matrix Q. Every 
normal matrix A can be transformed into a 
diagonal matrix by a suitable u-matrix (i.e., 
there exists a u-matrix U such that U*AU 
—U !AU is a diagonal matrix), and con- 
versely, every matrix with this property is a 
normal matrix. Thus, in particular, any h- 
matrix or u-matrix can be transformed into 
a diagonal matrix by a suitable u-matrix. 
Moreover, every real symmetric matrix, which 
is naturally an h-matrix, can be transformed 
into a diagonal matrix by an orthogonal ma- 
trix; this will be defined later. If A,,...,A,, are 
mutually commutative normal matrices, we 
can transform them into diagonal matrices by 
the same u-matrix; that is, there exists a u- 
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matrix U such that U ! A;U is of diagonal 
form for all i. All eigenvalues of an h-matrix 
are real, and all eigenvalues of a u-matrix have 
absolute value 1. 

If all eigenvalues of an h-matrix H are posi- 
tive (positive or zero), then H is said to be 
positive definite (positive semidefinite). For an 
h-matrix H, exp H is a positive definite h- 
matrix, and conversely, every positive definite 
h-matrix can be expressed as exp H by a 
unique h-matrix H. Furthermore, every regular 
matrix A can be uniquely expressed as A= UH 
(or A= H'U") by a u-matrix U (or U’) anda 
positive definite h-matrix H (or H’), and A is 
normal if and only if UH — HU. 

A square matrix A such that A* = —A is 
called a skew-Hermitian matrix (simply skew 
h-matrix or anti-Hermitian matrix). All eigen- 
values of a skew h-matrix are purely imaginary 
numbers. If A is a skew h-matrix, exp A is a u- 
matrix. Conversely, if a u-matrix U lies in a 
sufficiently small neighborhood of the identity 
matrix I (i.e., if all elements of U — I have 
sufficiently small absolute values), U can be 
uniquely expressed in the form exp A with a 
skew h-matrix A. 


J. Orthogonal Matrices 


An h-matrix whose components are all real is 
necessarily symmetric. A u-matrix whose com- 
ponents are all real, that is, a real matrix R 
such that R~! —'R, is called an orthogonal 
matrix. The totality of orthogonal matrices 
forms a group with respect to matrix multi- 
plication. If S is a real symmetric matrix, 

there exists an orthogonal matrix T for which 
T 'ST is a diagonal matrix; that is, S can be 
transformed into a diagonal form by T. Every 
orthogonal matrix R can be transformed by an 
orthogonal matrix T into the diagonal form 


TRT 


J 


( cos 0; SE ee, Ste 
= or j=l,...,t. 


—sin 0; cos 6; 


The determinant |R| of an orthogonal ma- 
trix R is either 1 or —1. If [| R| 2 1, then R is 
called a proper orthogonal matrix. If A is a 
real alternating matrix, then exp A is a pro- 
per orthogonal matrix; conversely, any pro- 
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per orthogonal matrix in a sufficiently small 
neighborhood of the identity matrix J can 

be expressed uniquely in this form. Moreover, 
there exists a one-to-one correspondence be- 
tween real alternating matrices A and proper 
orthogonal matrices R without eigenvalues 
equal to —1, given by R=(J—A)({I+ A) ! 
and A «(I— R)( + R) !. This is called a 
Cayley transformation. 

A (complex) square matrix T with the 
property 'T=T ^! is called a complex ortho- 
gonal matrix. This matrix can be uniquely 
expressed as T= R exp(iA), where R is an 
orthogonal matrix, A is a real alternating 
matrix, and i? 2 — 1. 


K. Infinite Matrices 


By an infinite matrix with elements in a ring 
K, we mean an array of elements of K with 
infinite numbers of rows and columns as 
follows: 


where oc, t are indices denoting the row and 
column for the element a,, and each index 
ranges over an infinite set I. Equality, ad- 
dition, and multiplication by an element of K 
of infinite matrices are defined in the same 
manner as for ordinary matrices. Generally, 
however, multiplication of infinite matrices 
cannot be defined. If the elements of each row 
(column) of an infinite matrix are zero except 
for a finite number of them, then it is called a 
row (column) finite matrix. For row (column) 
finite matrices A =(a,,) and B=(b,,), the prod- 
uct AB=(c,,) is defined by c,,= X, a,,b,, for 
all c, rel. By this definition the totality of 
such infinite matrices forms a ring. 

Now let K be the complex number field and 
T the set of natural numbers, and consider an 
infinite matrix (a;,). This is called a bounded 
matrix if the following inequality holds for 
arbitrary x; and y,: 








2 AikXiYk 


m 1/2 / n 1/2 
<m( 5 s) (3 pal ; 


where M is a constant. The set of all bounded 
matrices forms a ring. Ho, 9, ... is a com- 
plete orthonormal system of a "Hilbert space 
$, then for any continuous linear operator A 
we have Aq, = X424 9;a4,, and A corresponds 
to a bounded matrix (a;,). By this correspon- 
dence we have a ring isomorphism between 
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the ring of continuous linear operators of $ 
and the ring of bounded matrices (— 197 
Hilbert Spaces). 


L. Linear Mappings 


Let L, L’ be free right K-modules with bases 
44, ..., Am and bi, ..., bp, respectively. Then, to a 
tlinear mapping f of L into L’, there corre- 
sponds an n x m matrix A such that 


f: aic; (Gy, ..., as) € (b... b)AÍc 
Cw. Cm, 


Conversely, for any n x m matrix A with ele- 
ments in K, there is a unique linear mapping f 
as above (with respect to these bases). In this 
sense, if a linear mapping g of L' (into some 
free K-module) corresponds to a matrix B, 
then the product go f corresponds to the prod- 
uct BA. (For left modules, we can observe 
transpositions.) If a5, ...,a,, and bj, ..., b7 

are another pair of bases, then there are regu- 
lar matrices P, Q such that (a4, ...,a,) = 

(di, ...,a,) P and (b, ..., b) (b, ..., b,)Q. 
With respect to these new bases, the corre- 
sponding matrix to f is Q ! AP. In the partic- 
ular case where L= L', a; — b; (for all i), we see 
that (1) under a fixed basis of L, a linear trans- 
formation of L is nothing but a matrix of 
degree m and (2) a change of basis corresponds 
to the transformation A P~! AP by the base- 
change matrix P. 

Thus, notions applicable to matrices can be 
applied to linear mappings. For instance, an 
eigenvector (characteristic vector or proper 
vector) of a linear transformation q of L is 
an element a( 40) of L such that q(a) - aa 
(a € K). If K is a field, then the characteristic 
polynomial y(X), and therefore also the eigen- 
values (proper values or characteristic roots), 
are invariant under base change. We add a 
little more on the case where K is a field. 

For an eigenvalue « of o, the subspace N, = 
{ae L|o(a) - «aj of L is called the eigenspace 
belonging to «. Furthermore, the space Ni, = 
(ae L|(o —«)* (a)=0 for some k>0} is a 
subspace of L containing N, and is sometimes 
called an eigenspace in a weaker sense. If all 
roots of y(X)=0 are in K, then L is decom- 
posed into the direct sum of N; ,..., Nj , 
where «,,...,0, are the distinct roots of the 
equation y(X)=0. The dimension of N; is 
equal to the multiplicity of the root o; in the 
equation y(X) —O0. This fact is the basis of the 
Jordan normal form. Minimal polynomials of 
linear mappings are well defined. 

A linear transformation q of L is called 
semisimple if L has the structure of a *semi- 
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simple K [ X ]-module determined by Xa = 
o(a). Hence o is semisimple if and only if the 
minimal polynomial p(X) of o has no square 
factor different from constants in K[X]. In 
particular, the condition that all roots of u(X) 
— O0 are in K and simple is sufficient for semi- 
simplicity of o. This condition is equivalent 
to the condition that ¢ is represented by a 
*diagonal matrix relative to some basis of L. 
Then L is decomposed into the direct sum 

of the eigenspaces N,, and q is said to be 
diagonalizable. 

If K is a tperfect field, any linear transfor- 
mation ¢ of L is represented as the sum of a 
semisimple linear transformation q, and a 
nilpotent linear transformation 9,: 9 = P, + On 
(Jordan decomposition). Also, o, and 9, com- 
mute with each other, and they are uniquely 
determined by qo. We call o, and 9, the semi- 
simple and nilpotent component of o, respec- 
tively. Furthermore, p, and o, can be repre- 
sented as polynomials of o without a constant 
term. The transformation q is nonsingular if 
and only if 9, is nonsingular. A nonsingular 
linear transformation ¢ is called unipotent if 
o, is equal to the identity transformation 1,. 
Any nonsingular linear transformation o is 
uniquely represented as a product of a semi- 
simple linear transformation and a unipotent 
linear transformation, which are commutative: 
P = ,¢, (multiplicative Jordan decomposition). 
Here o, is the semisimple component and p, 
=1,+9, ! 9, is unipotent (p, is called the 
unipotent component of o; — 13 Algebraic 
Groups). 


M. Linear Equations 


Let K be a ffield, f,— a;, X, +... + a, X, (i= 

1, ...,m) be m ‘linear forms: K" K, and 
b,,..., b, be m given elements of K. Then a set 
of n elements x,, ..., x, of K, or an n-tuple 

x —(x,, ..., x,)& K", satisfying the system of 

m linear equations 


aj Xi DEE aux, = by, i-]1,...,m, (1) 


is called a solution of equations (1). In partic- 
ular, a system with b, =...=b,,=0, Le., 


Qj Xi +.. du X, —0, 


i=1,...,m, (2) 


is called a system of linear homogeneous equa- 
tions. In the theory of linear equations, the 
field K need not be commutative. In those 
cases where K is noncommutative, we have to 
distinguish between right multiplication and 
left multiplication. We make this distinction 
by adding the word “right” or “left” in paren- 
theses whenever it is necessary to do so. 

If x,, ..., x, are solutions of the system (2), 
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then any of their (right) linear combinations 
£; xc; (c;e K) is also a solution of (2), so that 
the solutions of (2) form a f(right) linear space 
L over K. L is the kernel of the (left) linear 
mapping K"— K" given by x( f(x), ..., 
Lil, Let s2 dim L. If s=0, the system (2) 
has only x 20 as a solution, called the trivial 
solution. If s 0, L has a basis {x,,...,x,}. 
Then x, ..., x, are (right) linearly independent 
solutions of (2), and every solution of (2) is a 
(right) linear combination of them. We say 
that x,, ..., x, form a system of fundamental 
solutions of (2). Let r denote the maximum 
number of (left) linearly independent forms in 
the set { f1,f,, ..., Jm}. Then we have s+r =n, 
so that the system (2) has a nontrivial solu- 
tion if and only if r « n, and the number of 
linearly independent fundamental solutions is 
n— r. Accordingly, if the number of equations 
is less than the number of unknown quantities, 
there always exists a nontrivial solution. 
Suppose now that the system (1) has a solu- 
tion xo. Then every solution x of (1) can be 
written xo +y for a solution y of (2); and con- 
versely, for any solution y of (2), x+y isa 
solution of (1). Furthermore, in order for (1) to 
have a solution, it is necessary and sufficient 
that 3;c;b; 2 0 whenever > ;c; f; - 0 (c;€ K), Le, 
that the following two matrices have the same 
trank: 


ayy din 
A= ; 

Amı Amn 

gun c0 dj, b 
A= 

Bids amn b 


mn m 


In particular, equation (1) has a unique solu- 
tion if and only if A and A have the same 
rank n. If m- n (Le., A is a square matrix), (1) 
has a unique solution if and only if A has the 
inverse A ^!, and then the solution is given by 
'x A 7 !b, where b=(b,, bz, ..., bp). 

We have also the following result. Let K' be 
an 'extension field of K. If a system of linear 
equations in K has a solution in K', it hasa 
solution contained in K. In particular, if (2) in 
K has a nontrivial solution in K', it has a 
nontrivial solution already in K, and any 
system of fundamental solutions in K is itself a 
system of fundamental solutions in K’. 

Suppose now that K is commutative. Then 
we have an explicit formula for solving equa- 
tion (1) by means of *determinants. 

Consider first the case m =n. Let A=|Al], the 
determinant of the matrix A. If A #0, then (1) 
has a unique solution, given by x, — A,/A 
(k- 1, ..., n), where A, is the determinant ob- 
tained from A by replacing its kth column, 
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Aik ak, Any DY Du, b2, ... Dp. This ts called 
Cramer's rule. To consider the general case, let 
r be the common rank of A and A. We may 
assume that |aj,| x O (i k — 1, ..., r), appropri- 
ately changing the order of the equations and 
unknowns. Then by Cramer's rule we can 
solve the first r equations for x}, ..., x,, assign- 
ing arbitrary values to x,.,, ..., x,. In order for 
equation (2) to have a nontrivial solution, it is 
necessary and sufficient that the rank of A be 
less than n, and hence that | 4| 20 if A is a 
square matrix. For geometric applications of 
the theory of linear equations — 7 Affine 
Geometry, and for numerical solution — 302 
Numerical Solution of Linear Equations. 


N. Positive Matrices 


A positive (or nonnegative) matrix is a matrix 
whose elements are all positive (or nonnega- 
tive) real numbers. A nonnegative square ma- 
trix A is said to be reducible (or, more pre- 
cisely, reducible by permutations) if by certain 
permutations of rows and columns A is re- 
duced to a form 


0 A, 
with square matrices A,, A45; otherwise A is 
called irreducible. 


Theorem (Perron). For each positive matrix A, 
there is a positive real number r such that (1) r 
is a simple root of the characteristic equation 
for A and (2) every other eigenvalue of A has 
absolute value less than r. Furthermore, (3) if v 
is an eigenvector corresponding to r, then the 


components of v are all positive or all negative. 


Theorem (Frobenius). If A is an irreducible 
nonnegative matrix, then there is a positive 
number r satisfying (1) above and such that 
every eigenvalue of A has absolute value at 
most r. Furthermore, (3) above also holds in 
this case. 


Theorem. If A is a nonnegative matrix, then 
there is a nonnegative real number r that is 

an eigenvalue and such that absolute values of 
other eigenvalues are at most r. If v is an 
eigenvector corresponding to this r, then the 
components of v are all nonnegative or all 
nonpositive. 


References 


See references to 103 Determinants. 
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A. General Remarks 


Roughly speaking, a measure is a nonnegative 
function of subsets of a space completely addi- 
tive in the sense that the measure of the union 
of a sequence of mutually disjoint sets is the 
sum of the measures of the sets. This concept 
was introduced as a mathematical abstraction 
of length, area, volume, mass distribution, 
probability distribution, etc. Measure theory is 
the basis of integration theory, and these two 
theories play a fundamental role in modern 
mathematics, particularly in analysis, func- 
tional analysis, and probability theory. 


B. Important Classes of Sets 


Let X be an abstract space. The power set of 
X, the class of all subsets of X, is denoted by 
27. Let A be a nonempty subclass of 2*. If N 
is closed under intersections, i.e., if A, BEA 
implies AN BeA, then A is called a multipli- 
cative class on X. If 2 is closed under finite 
unions and complements, then 9I is called an 
algebra (or finitely additive class or field) on X. 
If 9I is closed under countable unions and 
complements, 9I is called a c-algebra (count- 
ably additive class, completely additive class, or 
Borel field). If XN is closed under montone 
limits, i.e., if for every sequence A EW, n— 1, 

2, ..., monotone increasing or monotone de- 
creasing, the union or intersection of the se- 
quence belongs to 9I, then QI is called a mono- 
tone class on X. If Xe% and if 9I is closed 
under countable disjoint unions and proper 
differences (A.B for A >B), then 9I is called 
the Dynkin class on X. For any € c 2* the 
least o-algebra that includes € is called the ø- 
algebra generated by &; this is written as c [&]. 
Similarly, we denote by M[C](D[C]) the 
monotone class (Dynkin class) generated by ©. 
The following theorems are often useful. 

The monotone class theorem: If € is an algebra, 
then M[C] — c [€]. 

The Dynkin class theorem: If & is a multiplica- 
tive class, then D[C] —o[&]. 


C. Measurable Spaces (Borel Spaces) 


A space X endowed with a c-algebra B on X 
is called a measurable space (or Borel space) 
and is denoted by (X, B). A set belonging to B 
is called a B-measurable set (or Borel set) in 
(X, 8). It is obvious that the empty set @ and 
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the whole space X are B-measurable. B- 

measurability is preserved by complements, 
namely, if A is 8-measurable, then so is A‘. 
Similarly B-measurability is preserved by 

countable set operations such as countable 
unions, countable intersections, differences, 
etc., but not always by uncountable unions. 

For a sequence {A,} (ne N) of subsets of 
X, the superior limit (or limit superior) and 
the inferior limit (or limit inferior) are de- 
fined by limsup,..., An = (5-1 JL An) and 
lim inf, |, A, — (Jai Ve An), respectively. 
The symbols lim and lim may be used in place 
of limsup and lim inf, respectively. The inferior 
limit is always contained in the superior limit, 
and when they coincide, they define the limit 
of {A,}, which is denoted by Dm... A,. If 
{A,} is monotone increasing (or decreasing), 
lim, A, is (J, A, (or (55, A,) (^ 87 Con- 
vergence L). 38-measurability is preserved by 
these limits. 

Let Y be a subset of a measurable space 
(X, B). Then the class BN Y={BN Y| BeB} is 
a c-algebra on Y. Y is regarded as a measur- 
able space endowed with BN Y unless stated 
otherwise. Suppose that we are given a family 
of measurable spaces (X,,%,), Ae A, where A is 
an arbitrary index set. Let X be the *Cartesian 
product of X,, 4€ A, and z, the projection 
from X to X, for each Ae A. Then X is re- 
garded as a measurable space endowed with the 
c-algebra generated by the class (z;!(B,)| Ae A, 
B,€8,}, denoted by IT,-, 8,, unless stated 
otherwise. 

Let T be a topological space. The o-algebra 
on T generated by the open subsets of T is 
called the topological c-algebra (Borel field) on 
T, denoted by B(T). Some authors define B(T) 
to be the c-algebra generated by compact 
subsets of T, which is equivalent to the defi- 
nition mentioned above if T is locally compact 
and o-compact. T is regarded as a measurable 
space endowed with B(T) unless stated other- 
wise. Hence R” is a measurable space (R", B”), 
where $8" = S3(R"). A 8(T)-measurable subset 
of T is called a Borel subset of T. If the tchar- 
acteristic function of a set is a tBaire function, 
then the set is a Borel set. Such a set is called a 
Borel set in the strict sense (or a Baire set). The 
union (intersection) of at most a countable 
number of closed (open) sets is called an F, set 
(G; set). Both F, sets and G; sets are Borel sets. 
Denote by ga the class of all closed subsets of 
X and by 6, the class of all open subsets, and 
define classes d - (6.) for an ordinal number 
é to be the sets expressible as a countable 
union (intersection) of the sets belonging to 
nce Balanec 9,). Then by the construction, 
every set belonging to either §, or (5, is a 
Borel set. If œ, < 6 then = Üo,» ©,= 6,, . 
(For the definition of w, — 312 Ordinal Num- 
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bers.) In particular, in a tcompletely normal 
topological space (for example, a metric space), 
a c Gy, 6, c By if <P, and both | Jeco, Be 
and UL 6. coincide with the Borel field. In 
this case, an arbitrary Borel set B is a Baire 
set, and thus can be classified according to the 
tBaire class of the characteristic function of 

B. It can also be classified according to the 
smallest ordinal number č with respect to 
which Be. (or BeG,). 

If S is a subset of a topological space T, 
then S is regarded as a topological space with 
the relative topology and so is regarded as a 
measurable space (S, 8(S)). Hence every subset 
of R" is regarded as a measurable space. S is 
also a measurable space (S, B(T)N S), as men- 
tioned above. Since 8(S)=8(T)NS, no con- 
tradiction arises. Let T}, 4€ A be a family of 
topological spaces. Then the topological prod- 
uct T of this family is regarded as a measur- 
able space (T, sB(T)) or (T, TL. B(T,)). It 
should be noted that these two c-algebras 
may be different. They are the same if T has 
a countable open base. 

Let (X;, 8,), i— 1, 2, be measurable spaces. A 
bijective mapping f from X, to X, is called a 
Borel isomorphic mapping if f(8,)={ f(B,)| 
B eB} =%8,. If there exists such an f, then 
(X5, 8,) is said to be Borel isomorphic to 
(X,,%,). Borel isomorphism is an equivalence 
relation. A measurable space is called a stan- 
dard measurable space (or a standard Borel 
space) if it is Borel isomorphic to a Borel sub- 
set of R! (viewed as a measurable space as 
explained above). A striking fact is that a 
standard measurable space is Borel isomor- 
phic to one of the following spaces: {1,2,...,n} 
(n— 1,2, ...), N, [0,1]. A measurable space 
which is Borel isomorphic to an tanalytic sub- 
set of R! is called an analytic measurable 
space. Every complete separable metric space, 
viewed as a measurable space, is standard. 
More generally, every ‘Luzin space is a stan- 
dard measurable space. Hence the spaces of 
tdistributions, Z' and 9", are standard mea- 
surable spaces, being Luzin spaces. Every 
'Suslin space, viewed as a measurable space, 
is analytic. 


D. Measure 


A real-valued set function m defined on a 
finitely additive class W on a space X is called 
a finitely additive measure (or Jordan measure) 
if it satisfies the following two conditions: (I) 
Ox m(E)& oo, m(@)=0; (Io) A, BEM, AN 
B= Ø imply m(AU B) 2 m(A) 4- m(B). 

A set function u defined on a completely 
additive class 38 in a space X is called a mea- 
sure (completely additive measure or c-additive 
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measure) on $ (or on X) if it satisfies the fol- 
lowing two conditions: (I) 0 < (E) € oo, vo 
=0; (II) E,e8 (n— 1,2, ...), EYE, — (j#k) 
imply wl. Ej) = E2, n (E,) (complete addi- 
tivity or c-additivity or countable additivity). 
The triple (X, 8, 1i) or the pair (X, uk where X 
is a space, B a completely additive class of 
subsets of X, and ua measure, is called a 
measure space. We call u or (X, B, u) finite or 
bounded if (X) « oo and o-finite if there exists 
a sequence {X,,} satisfying u(X,„) < oo and 

mo, X,=X. A set deis called atomic if 
(i) u(A)>0 and (ii) Be% and Bc A imply 
that u(B) 20 or u(B) 2 (A). A measure space 
(X, B, u) is called nonatomic if no element of B 
is atomic. The simplest bounded measure is 
given by defining m(A)=1 if ae A and m(A)=0 
if a¢ A, where a is a given point in X. Such a 
measure is called the Dirac ó-measure. A mea- 
sure m with m(X)— 1 is called a *probability 
measure (— 342 Probability Theory). 

Equalities appearing in (II) and (II) include 
the possibility that both sides equal oo. In 
measure theory and integration theory, ad- 
dition and multiplication involving +00 are 
carried out as follows (a denotes a real num- 
ber): (400) ca ad (-o0)— +0;(+0)-a= 
+o, a—(+00)= +00; (+00)+(+00)= +0; 
(+00)—(Foo)= +0; a-(+00)=(+0)- a= 
+œ ifa>0; a-(400)=(+00):a= Fo ifa< 
0; (4-00): (4-00) — +00. (Sometimes it is further 
agreed to put 0-(+00)=(+00):0=0.) 

A $8-measurable set in a measure space 
(X, B, u) is also called u-measurable (or sim- 
ply measurable). For a sequence {A,} of u- 
measurable sets the following conditions 
hold: (1) u(lim inf, ,, A,,) x lim inf, u(A,,); 
(ii) if ut JE An) < +00 for some no, then 
“(lim sup, An) 2 limsup,.,,, H(A,); and (iti) 
if Im, A, exists and the hypothesis of (ii) 
holds, then (lim-o A,)=lim,.,, 4(A,). For a 
finitely additive measure u on a completely 
additive class 8 to be a completely additive 
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space (X, 88, u). If the set of all points for which 
a property P fails to hold is a null set, then we 
say that P holds almost everywhere (a.e. or 

at almost all points). Sometimes we use the 
expression "almost P" to describe the same 
situation. 


E. Construction of Measures 


A set function *(A) defined for every subset of 
X is called a Carathéodory outer measure (or 
outer measure) on X if it has the following 
three properties: (1) 0< u*(A) < oo, u* (f) —0; 
(ii) AC B = u*(4)  n*(By; (iii) w*(U An) < 

2 1H4*(A,). Because of (iii), the inequality 
u*(B) € u*(B' A) 4 u*(BN A‘) always holds; 
if the equality holds for every B, then A is 
called measurable with respect to up. It follows 
from this definition that a set A satisfying 
p*(A)=0 is always measurable with respect to 
u*. The class $ of all sets measurable with 
respect to uh is a completely additive class, 
and if we define u(4)= u*(A) for A belonging 
to B, then p(A) gives a complete measure on 
B. This measure is called the Carathéodory 
measure induced by uf or the generalized 
Lebesgue measure. 

A finitely additive measure m defined on a 
finitely additive class W is called completely 
additive when A, EM, A; 4, — Ø (j E Kk), 

2, A,eM imply mE, 4) E mA). M 
by means of such an m we define u*(A) for an 
arbitrary Ac X to be the infimum of all pos- 
sible values EZ, m(A,), where Ac ( J™, A, 

(A, EM), then uf gives a Carathéodory outer 
measure. From this u* a measure p on a com- 
pletely additive class 98 is induced as described 
above, and we have $8 >M and (A) 2 m(A) for 
Jet Therefore m can be extended to a mea- 
sure jt on of Wit] (E. Hopf's extension theorem). 
In particular, if X is a countable union of sets 
of finite measure, then this extension is unique. 
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x, &b,, k—1,2,...,n] (—0 <a, «b, « oo)is 
defined by m(I) = II (b; — a,). Let Mo be the 
collection of all sets that can be represented as 
the finite union of disjoint left-open intervals, 
and for such expression A =| J*.., J, in My 
define m(A)= 27, m(I). Let  -90,U( i] 
and m(@)=0. Then m gives a finitely additive 
measure on Wè that is completely additive. 
Therefore m determines an outer measure p*, 
which in turn determines a measure u. This ut 
is called the Lebesgue outer measure (or simply 
outer measure). Sets that are measurable with 
respect to uf are said to be Lebesgue measur- 
able (or simply measurable), and the measure 
H is called the (n-dimensional) Lebesgue mea- 
sure (or simply measure). Every interval is 
measurable, and its measure coincides with its 
volume. Open sets, closed sets, and Borel sets 
are all measurable. More generally, suppose 
that an outer measure u* defined on a metric 
space X with a metric d satisfies the condi- 
tion: If d(A, B) 0, then u*(A U B) pu*(A) 
H*(B). Then every closed subset of X is u*- 
measurable, and therefore so is every Borel 
subset. Here d(A, B) - inf(d(a, b) ae A, be B] 
denotes the distance between the two sets A 
and B. A measure u defined on the class of all 
Borel subsets of a topological space X is called 
a Borel measure. The cardinality of the set of 
all Lebesgue measurable subsets of R" is 2°, 
while the cardinality of the class of Borel 

sets is c (here c is the cardinal number of the 
‘continuum, i.e., the cardinal number of R). 
Therefore there exists a Lebesgue measur- 
able set that is not a Borel set. It follows 
immediately from the definition that the 
Lebesgue measure is K-regular, so for every 
Lebesgue measurable set A we can find an tF,- 
set B and a tG;-set C such that BC Ac C and 
u(C — B)=0. 

Historically, for a bounded subset A of R", 
C. Jordan defined m(A) to be the infimum of 
all possible values m(B), where BEIM and B> 
A, and m(A) to be supremum of all possible 
values m(B), where BEIM and Bc A. He called 
m(A) the outer volume of A and m(A) the inner 
volume of A (in the case of R?, the outer area 
and inner area, respectively). When m(A) — 
m(A), A is called Jordan measurable, and this 
common value is defined to be the Jordan mea- 
sure (Jordan content) of A. Jordan measure is 
only finitely additive, and was found to be un- 
satisfactory in many respects. It was Lebesgue 
who modified this notion and introduced 
completely additive measures. Jordan measur- 
able sets are always Lebesgue measurable. 

Using the faxiom of choice, a set that is not 
Lebesgue measurable can be constructed (G. 
Vitali). For example, a set obtained by choos- 
ing exactly one element from each coset of 
the additive group of all rationals in the addi- 
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tive group of the reals is not measurable [3, 
pp. 67-70]. 


H. Product Measure 


When two o-finite complete measures py and 
fly are defined on completely additive classes 
By and By on X and Y, respectively, an ele- 
ment C belonging to the smallest finitely addi- 
tive class & in the Cartesian product that 
contains {A x B| Ae8y, Be By} can be repre- 
sented as a finite disjoint union C — ( J^..,(A; 

x Bj (4j€ By, Bye By). If we define v(C) = 

Xa ux( A) (Bj) (here we agree to put 0: oo = 
0), then this value is independent of the way 
the set C is represented, and v defines a com- 
pletely additive measure on &. By extending v 
by means of Hopf's extension theorem, we 
obtain a (complete) measure space, called the 
(complete) product measure space obtained 
from the measure spaces (X, By, wy) and (Y, 
By, uy). The measure obtained in this way 

on the space X x Y is called the product mea- 
sure of 4i and py and is denoted by uy x py. If 
we denote by M, the class of all Lebesgue 
measurable subsets of the p-dimensional Eu- 
clidean space R? and by m, the p-dimensional 
Lebesgue measure, then the (complete) product 
measure space of (RT, 9, m,) and (R9, 9t, , m,) 
is (R*9,99t,, ,, mal The product measure 
space of any finite number of measure spaces 
(Xj, B; uj) (i= 1, ..., n) is defined similarly. 

Let X, (4€ A) be spaces with an index set of 
arbitrary cardinality. For the product space X — 
IL.4X;. an n-cylinder set, or simply a cylinder 
set, is a set of the form A x IIg, 2 X; 
(Ac X; x ... x X,). If a finitely additive class 
9L, is given for each X,, the class of all subsets 
that can be represented as the union of a finite 
number of cylinder sets of the form A, x A, x 
XA XI, Xa (Ave Uy, J = 1,2, ..., n) 
is a finitely additive class in the space X. 
When each 9I, is completely additive, the 
completely additive class 2 generated by this 
finitely additive class is called the product of 
the completely additive classes 9I, and is de- 
noted by [T,., 2,. When a measure space (X,, 
B,.u,) with 4,(X,)= 1 is given for 4e A, a mea- 
sure u can be defined in the following way on 
the completely additive class B = [T,.4 B, in 
the product space X: To begin with, for a cyl- 
inder set of the form A, x ... x A, x X' (here 
A,€88, and X'= [ieta Au Xa), we define 
H(A, X... X AX X)8 py (41)... Ha, (A,). If we 
extend u to the finitely additive class € con- 
sisting of all sets that can be represented as 
the finite union of such cylinder sets, then this 
extension gives a completely additive measure 
on H. and therefore, by Hopf's extension theo- 
rem, there exists a unique extension to a mea- 
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sure u on $8, and yp satisfies u(X)=1. We 
denote this u by u= [zeh Ha. 


I. Radon Measure 


Let X be a tlocally compact Hausdorff space, 
B the topological o-algebra on X, and Co(X) 
the real linear space of all real-valued con- 
tinuous functions f on X having tcompact 
support (i.e., the closure of the set {x| f(x) #0} 
is compact). A (real) linear functional @ de- 
fined on C,(X) is called a positive Radon mea- 
sure if 9(/) Z0 whenever f 20. For such a 
functional o there corresponds a Borel mea- 
sure u on X for which ot f) — fx f du holds for 
every f e Co(X). Lebesgue measure m, is re- 
garded as a positive Radon measure on R”. If 
X is o-compact De, X can be represented as 
the countable union of compact sets), then the 
property o( f)— fx f du for all f e Co(X) defines 
the measure u uniquely on the class of Borel 
sets of X. A linear functional on C,(X) that 
can be written as the difference of two positive 
Radon measures is called a Radon measure. 
For a linear functional o defined on CX) 

to be a Radon measure, it is necessary and 
sufficient that for an arbitrary f € C,(X), the set 
te(a)|lgl «Ifl ge Co(X)) be bounded. Equiva- 
lently, it is necessary and sufficient that the 
restriction of o to the subspace of all functions 
in Co(X) having their support in a fixed com- 
pact subset of X must be continuous with 
respect to the ttopology of uniform conver- 
gence. Therefore, if X is compact, an arbitrary 
continuous linear functional on C(X) is a 
Radon measure. L. Schwartz investigated 
Radon measures on spaces that are not locally 
compact [6]. 


J. Measurable Functions 


When a completely additive class 8 on a 
space X is given, a function f defined on a $- 
measurable set E and taking real (and possibly 
+œ) values is called a 8-measurable function 
on E if for an arbitrary real number a, the set 
Ix f(x) al is B-measurable. The condtion 

f >a may be replaced by f za, f «o, f «a. 

A function f can also be defined to be 8- 
measurable if the tinverse image under f of 
any Borel set is 8-measurable. When f and 

g are B-measurable, so are af + bg (a,b con- 
stants), f: g, f/g, max(f, g), min(f, g), |f|? (p a 
constant), whenever they are well defined. The 
superior and inferior limits of a sequence of 8- 
measurable functions are also 8-measurable. 
In a complete product measure space of two c- 
finite measure spaces, a function f(x, y) may 
fail to be measurable as a function of two 
variables even if it is measurable with respect 
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to each of the variables x and y separately. For 
example, let Ẹ be the class of all closed subsets 
F of R? satisfying m,(F)>0, F c [0,1] x [0,1]. 
Using the fact that the cardinality of § does 
not exceed the cardinal number of the con- 
tinuum, we can prove by *transfinite induction 
that the elements of % can be indexed as F;, 
E< c, E being the cardinality of £, and that for 
every Fe we can pick two points z, —(x;, 
yj. Z= (Xz, Yz) in such a way that if F; 4 F,, 
then Xz, Xz, x, x, are all distinct and yz, yz, Yy» 
y, are all distinct. Furthermore, we can prove 
that the set E consisting of all such z; is not 
measurable. Therefore, denoting the charac- 
teristic function of the set E by f(x, y), f(x, y) is 
not measurable; but if we fix x (resp. y), then as 
a function of y (resp. x), f(x, y) is measurable, 
since f(x,-) (resp. f(-, y)) is always 0 except 
possibly at one point. If 8 is the class of all 
Lebesgue measurable sets or the class of all 
Borel sets, then a B-measurable function is 
called a Lebesgue measurable function (or 
simply measurable function) or a Borel mea- 
surable function, respectively. 

In Euclidean space, the class of all Borel 
measurable functions coincides with the class 
of all Baire functions, and an arbitrary Le- 
besgue measurable function is equal almost 
everywhere to a Baire function of at most the 
second class. For a function f that is finite 
almost everywhere on a Lebesgue measur- 
able set E to be Lebesgue measurable, it is 
necessary and sufficient that for an arbitrary 
£2» 0 we can find a closed subset F such that 
m(E — F) « e and f is continuous on F (Luzin's 
theorem). 

If a sequence { fp} of B-measurable func- 
tions on a measure space (X, B, ul converges 
almost everywhere to f on a set E with u(E) « 
oo, then for an arbitrary £0 we can find a 
set F (F CE, Fe$88) such that u(E— F) « e and 
f, converges uniformly on F. If X =R” and B 
is either the class of Borel sets or the class 
of Lebesgue measurable sets, then the set F 
can be chosen to be a closed set (Egorov's 
theorem). 

For a finite measurable function f(x) defined 
on the real line, there exists a sequence {h,} 
such that lim, ,, f(x + h,) = f(x) almost every- 
where (H. Auerbach). 

The functional equation f(x + y) 2 f(x) 4- 
f(y) has infinitely many nonmeasurable solu- 
tions (G. Hamel; — 388 Special Functional 
Equations). 

Let W, i— 1, 2, be c-algebras on X;, i— 1, 2, 
respectively. A mapping f: X, X, is said to 
be measurable 9I, /9L, if f^! (4;)e9I, for every 
A c9L,. When W, generates 9L,, f: X, X, is 
measurable YW, /9L, if f^! (A5) 9I, for every 
Ae. For example, a mapping f: R! OR! 
is Borel (Lebesgue) measurable if and only if 
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f is measurable $8! /98! (M, /$8!), where B! 

and Jt, denote the Borel subsets and the Le- 
besgue measurable subsets of RI Measurabil- 
ity of mappings is preserved by compositions, 
namely, if f: X, >X, and g: X; — X, are mea- 
surable 91, /9L, and 9L, /9I,, respectively, then 
gof: X, X, is measurable d, /9L,. If the pro- 
jection z,: ITi., X;^ X, is measurable, then 
IT;., 9I;/90,,. Since 2; ! (4j) (ie I, A;e90;) generate 
IL 9l;, a mapping f: X 2 II;.; X; is measur- 
able 9L/TT;., 9l; if 2,0 f: X — X; is measurable 
W/W for every ie I, where 9I is a c-algebra 

on X. 


K. Image Measures 


Let u be a regular measure on a measurable 
space (X, B), i.e., the completion of a measure 
on By. Then every mapping f: X > Y measur- 
able $8,/38, induces a c-algebra on Y: 


B={BcY|f~'(B) is u-measurable], 
and a measure on 8: 
v(B)— uCf "UD, 


The measure v is called the image measure of 
p under the mapping f; this is denoted by 

uf ` or f: u. Obviously, v(Y)- p(X), and v is 
complete. Although $8 includes $8, by virtue 
of the measurability of f, v is not regular in 
general. However, v is regular if u is a bounded 
measure and if both (X, 8,) and (Y, 8,) are 
analytic measurable spaces (— 22 Analytic 
Sets I). Therefore the image measure of a 
regular probability measure under a measur- 
able mapping is also a regular probability 
measure in practically all useful cases. 


L. Related Topics 


(i) Integration. For a nonnegative measurable 
function f(x) on (X, 8, m), we can define the . 
integral of f(x) on a set Ee%, denoted by 
Sef u(dx), fef) dul), [ef dp. or [s f. For a 
real measurable function f(x), the integral is 
defined to be equal to fg f* du — fef ^ dy, if at 
least one of these integrals is finite, where f * 
and f^ are the positive and negative parts of f 
(— 221 Integration Theory B). 

(1i) Fubini's theorem for measures. Let (X, 
3B, u) be the complete product measure space 
of (X;, B; uj) (i= 1,2), where both u, and u, 
are o-finite and complete. For Ee% we define 
the sections E(x,) and E(x,) by 


E(xi) 9 txsl(xi, x3)e E}, 
E(x;) {x1 (x1, X2)E E}. 


Then we have E(x,)eB, for almost all (u,)x, € 
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X,, E(x,)€B, for almost all (u;)x; € X,, and 


u(E) | Ho (E(X1)) 44 (dx) 
X, 


-| Hi (E(x2))u2(dx2) 
X; 
(— 221 Integration Theory E). 

(iii) The Radon-Nikodym theorem and the 
Lebesgue decomposition theorem for measures. 
Let u and v be o-finite measures on (X, %8). 

If u(E) =0 implies v(E) —0, then v is said to 

be absolutely continuous with respect to u, 
denoted by v<y. v<y if and only if v is ex- 
pressible in the form v(E)— fz f du, where 

f is a nonnegative 8-measurable function 

(the Radon-Nikodym theorem). Such an f is 
uniquely determined up to j4-measure 0 and is 
called the Radon-Nikodym derivative of v with 
respect to u, denoted by dv/du. The concept 
opposite to absolute continuity is singularity. v 
is said to be singular with respect to u if there 
exists an Ee% such that v(E?) - (E) - 0. If u 
and v are two arbitrary o-finite measures, then 
v is expressible uniquely as the sum of an 
absolutely continuous measure v, and a sin- 
gular measure v; with respect to u (Lebesgue 
decomposition theorem) (— 221 Integration 
Theory D, 380 Set Functions C). 

(iv) Invariant Measures. Let (X, 8) be a 
measurable space and G a group of Borel 
isomorphic mappings from (X, 58) to itself. A 
measure u on (X, $8) is said to be invariant 
under G if (E) — u(g !(E)) for every Ee% and 
every ge G. For example, the Lebesgue mea- 
sure m on (R", B”) is invariant under the group 
of congruent transformations (— 225 Invariant 
Measures). 

(v) The Lebesgue-Stieltjes measure. Let f bc 
a right continuous monotone increasing func- 
tion on R. Then there exists a unique measure 
u on (R, 8!) such that u((a, b]) = f(b) — f(a) for 
a « b. This measure is called the Lebesgue- 
Stieltjes measure induced by f (— 166 Func- 
tions of Bounded Variation B). 

(vi) Baire measurable functions and univer- 
sally measurable functions. Let f(x) be a real- 
valued function defined on a topological space 
X. If for every open set O the inverse image 
f !(O) has the *Baire property (resp. is mea- 
surable with respect to the completion of any 
o-infinite Borel measure on X), then f is said 
to be Baire measurable (resp. universally mea- 
surable or absolutely measurable). Universally 
measurable functions can be defined similarly 
on a measurable space. 

(vii) Disintegration. Let (S, €) and (T, X) be 
standard measurable spaces, v a o-finite mea- 
sure on (S, S), and p: SO T a v-measurable 
mapping. If the image measure u — vf ! is o- 
finite, there exists a unique family of measures 
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(v, te T} on (S, S) such that (1) t v (E) is 
universally measurable for every Ee GS, (2) v, is 
concentrated on p '! (t) for almost every t with 
respect to u, and (3) the following equality 
holds: 


v(E) -Í u(dt)v (E), Ee S. 
T 


This expression is called the disintegration of v 
with respect to u. Every v-measurable function 
f on S is v,-measurable for almost every t with 
respect to u, and the integral of f is expressible 
in the form of iterated integrals as follows: 


| f(s)v(ds)= | (dt) | S(s)v¢(ds). 


This corresponds to Fubini’s theorem on 
product measures. When v is a probability 
measure on (S, S), p(s) is regarded as a (T, X)- 
valued random variable on (S, G, v), and v, is 
the conditional probability measure under the 
condition p(s) — t. (— 342 Probability Theory 
E. For disintegration of measures on a topo- 
logical space — [3, ch. 9].) 
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271 (XX.4) 
Mechanics 


A. Newton's Three Laws of Motion 


The study of laws governing the motion of 
bodies began with the laws of falling bodies, of 
which the first exact formulation was made by 
Galileo. But the general relationship between 
force and acceleration was first described by 
*Newton, who established Newton's three laws 
of motion; the mechanics based on them is 
Newtonian mechanics. Newton expounded 
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these laws in his famous book Principia mathe- 
matica philosophiae naturalis (1686— 1687), 
where the law of gravitation and its applica- 
tion, problems of fluid motion, motions of the 
planets in the solar system, etc., were systemat- 
ically treated. 

Newton's first law. A body continues its 
state of rest or uniform motion in a straight 
line unless it is compelled to change that state 
by external action (i.e., force). This is also 
called the law of inertia. 

Newton's second law. The rate of change of 
momentum is proportional to force and is in 
the direction in which the force acts. Here, 
momentum is defined as the product mv of the 
mass m and the velocity v. This law can be 
expressed as d(mv)/dt 2 F, where F is the force 
expressed in an appropriately chosen system of 
units. Since dv/dt is the acceleration a, the law 
takes the form ma — F, when the mass is con- 
stant. These equations are called equations of 
motion. The second law is often simply called 
the law of motion. 

Newton's third law. When two bodies 1 and 
2 in the same system interact, the force exerted 
on body 1 by body 2 is equal and opposite in 
direction to that exerted on body 2 by body 1. 
This law is called the law of action and reac- 
tion, or simply the law of reaction. 

Various attempts at a rigorous axiomati- 
zation of Newtonian mechanics have been 
made, beginning with E. Mach's work. At the 
beginning of the 20th century, it was found 
that Newtonian mechanics requires modifica- 
tion when bodies travel at speeds approach- 
ing the speed of light or when it is applied to 
physical systems of molecular size or smaller. 
These modifications led to the establishment of 
the theory of frelativity and tquantum mechan- 
ics. In contrast to these later theories, New- 
tonian mechanics is called classical mechanics. 


B. Newton's Law of Gravitation 


Kepler discovered the following three laws for 
the motion of planets around the sun (valid 
within the accuracy in observation available at 
the time): 

Kepler's first law. The orbit of a planet is an 
ellipse with the sun at one of its foci. 

Kepler's second law. The area swept per unit 
time by the straight line segment joining the 
planet and the sun is independent of the posi- 
tion of the planet in its orbit. 

Kepler's third law. The square of the period 
(the time needed for the planet to go around 
the orbit once) is proportional to the cube of 
the major axis of the orbit. 

From these empirical laws, Newton deduced 
his law of universal gravitation: Between any 
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pair of point particles, with masses m, and 

m, and at a distance r, there arises an attrac- 
tive force along the line joining the two points; 
the magnitude of this force is given by F — 
Gm,m;r ?, where G is a universal constant 
(approximately 6.670 x 107? dyne cm? g`?) 
called the gravitational constant. 


C. Kinetic and Potential Energies 


If the force F on a particle is a function of the 
position x of the particle and is the gradient 
— VU of a time-independent (scalar) function 
U(x), called the potential, then 


1 d 
E= mv’ + U(x) (=) 


is a constant of the motion of the particle. 

E is called the total energy, while the first 
and second terms on the right-hand side are 
called the kinetic energy and potential energy, 
respectively. 

For a system of n particles at points x", ..., 
x”) with masses m,,...,m,, suppose that the 
force acting on the particle at x? is — VOU 
(the gradient of U relative to x) for a com- 





mon potential function U Gill... x®). Then 
n | ax) 
EA -m;v? 4 U(x®), ..., x? v= 
2. 2 Jd ( ) J dt 


is the constant total energy of the motion. For 
example, Newton’s gravitational force acting 
among a number of particles can be described 
by the Newtonian potential 
U=)) —Gmm,|x® —x9 |), 

i<j 
A potential U which is a sum of functions 
depending only on a pair of coordinates as 
in the above example is called a two-body 
interaction. 


D. Apparent Force 


The coordinate system in which Newton’s 
three laws of motion hold is called an iner- 

tial system. In some cases (e.g., on a rotating 
sphere such as the earth) it is more convenient 
to use a moving coordinate system, in which 
the equation of motion derived from Newton's 
second law by a coordinate transformation 
(from an inertial system to the moving coordi- 
nate system) takes a form similar to the second 
law except for an additional apparent force to 
be added to the force in the original equation 
of motion. For a coordinate system rotating at 
a constant angular velocity o (relative to an 
inertial system), the apparent force consists of 
the centrifugal force, of magnitude mo? p (p 
being the distance to the axis of rotation), 





1006 


which pushes the particle away from the axis 
of rotation (along the perpendicular), and the 
Coriolis force 2mv x œ ( x denotes the tvector 
product), which bends the motion of the par- 
ticle in a direction perpendicular to both the 
axis of rotation and the velocity of the particle. 


E. Dynamics of Rigid Bodies 


A rigid body is defined as a system of particles 
whose mutual distances are permanently fixed. 
Like a point particle, it is an ideal concept 
introduced into mechanics to simplify the 
theoretical treatment. Actual solid bodies can 
in most cases be regarded as rigid under the 
action of forces of ordinary magnitudes. Since 
a rigid body can be imagined to be made up of 
an infinite number of particles, the equations 
of motion for a system of particles can also be 
applied to it. Thus the motion of a rigid body 
can be completely determined by the theorems 
of momentum and angular momentum. 

The momentum of a rigid body is defined by 

dr 

SE di e 
where dm is the mass of the volume element at 
a point r of the rigid body K and dr/dt is its 
velocity. If the external forces acting on K are 
denoted by F, (i=1,2,...), we have 


4Q/dt - Y; F, 


which expresses the theorem of momentum. 
If the velocity and acceleration of the cen- 
ter of gravity (center of mass or barycenter) 
[rdm/ dm of the rigid body are denoted by Vg 
and Ag, respectively, and the mass {dm by m, 
we have Q 2 me, and the theorem of momen- 
tum becomes md V; /dt 2 mA; — 2 F;. 

The angular momentum of a rigid body K 
about an arbitrary point r, is defined by 


dr 


H= — : 
k (r —rg) x T dm 


If P, is the vector from ra to the point at which 
F; acts, we have 


dH/dt = (P, x F)=G. 


This is called the theorem of angular 
momentum. 

For the case of a rigid body with one point 
ro fixed, the angular momentum H and the 
angular velocity œ are related by 


H. = Ao, — Fo, — Eo, 
H,— — Fw, + Bo,— Do, 
H, = — Ea, — Dow, + Co,, 


where H,, H,, H.; c, My, o), are the compo- 
nents of H and o in the xyz-coordinate system 
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fixed in space with its origin at the fixed point 
ro, and 


au rei B= |@e), 


SUE p= [ssim 


SIE 


with the integrals taken over the whole rigid 
body. We call A, B, and C the moments of 
inertia about the x-, y-, and z-axes, respec- 
tively, and D, E, F the corresponding products 
of inertia. The rotational motion of a rigid 
body with one axis fixed is completely deter- 
mined by the theorem of angular momentum. 
However, for rotation about a fixed point, the 
theorem is not very convenient to use, because 
A, B, C, D, E, and F are generally unknown 
functions of time. 

The *quadric 


F= | xydm 


Ax? + By? +Cz?+2Dyz+2Ezx+2Fxy=!1 


represents an 'ellipsoid with its center at the 
origin, called the ellipsoid of inertia. If the 
principal axes £, y, and ¢ are taken as coordi- 
nate axes, the equation of the ellipsoid of 
inertia becomes AC? + By? +r? =1, where A, 
B, Y are the moments of inertia about the £-, 
n-, &-axes and are called the principal moments 
of inertia, while the £-, 7-, -axes themselves 
are called the principal axes of inertia. If the 
components of the angular momentum H and 
angular velocity in the direction of the prin- 
cipal axes of inertia are denoted by (H,, H}, H3) 
and (o, oz, 04), respectively, then H, = Ao, 
H, = Bo», Hy =I w. Furthermore, if the £-, y-, 
C-components of the resultant moment of the 
external forces G = X (P, x F;) are denoted by 
(G,, G,, G4), then dH/dt = G becomes 


Ado, /dt 2 G, -(B—I)0504, 
Bdw,/dt=G,+(T—A)w3@,, 
T do/dt ^ G4--(A — B), w3. 


These are called Euler's differential equations. 

The study of the motion of a rigid body can 
be reduced mostly to the study of the motion 
of its ellipsoid of inertia, since the latter is 
attached to the rigid body. The method of de- 
scribing the motion of a rigid body by means 
of its ellipsoid of inertia is known as Poinsot's 
representation. The motions of two bodies 
having equal ellipsoids of inertia are the same 
if the external forces acting on them have 
equal resultant moments, even if their geo- 
metric forms are different. 

When a rigid body moves under no con- 
straint, the motion of its center of gravity G 
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can be determined by the use of the theorem of 
momentum. Also, the rotational motion about 
the center of gravity can be found from dH'/dt 
= G’, which is a modification of the theorem of 
angular momentum. Here H and G’ are re- 
spectively the angular momentum and mo- 
ment of external forces about the center of 
gravity. In this case also, simplification can be 
achieved by considering the equation in the 
reference system that coincides with the prin- 
cipal axes of the ellipsoid of inertia with its 
center at the center of gravity. 


F. Analytical Dynamics 


Mechanics as originally formulated by New- 
ton was geometric in nature, but later L. Euler, 
J. L. Lagrange, and others developed the ana- 
lytical method of treating mechanics that is 
now called analytical dynamics. Lagrange 
introduced generalized coordinates q, (j= 

1,2, ..., f, where f is the number of degrees 

of freedom of the system considered), which 
uniquely represent the configuration of the 
dynamical system, and derived Lagrange's 
equations of motion: 


a (08) 28 E 
dr 04; UMS j= $7». 


where 4;— dq;/dt, and 2= T— U(T — kinetic 
energy, U — potential energy) is a function of q; 
and 4; called the Lagrangian function. Later, 
W. R. Hamilton introduced 


pj —- 0T/04;, 
H=} pjdj- $— Hlp, ee 


and transformed the equations to Hamilton's 
canonical equations: 
dq; OH dp; oH 


LE a seen, j=1,2,...,f. 
dt Op; dt 0qj g f 


prs dis del 








Here p, is the generalized momentum conjugate 
to q;, and q;, p; are called canonical variables. 
If the functions representing the configuration 
of the dynamical system in terms of q; do not 
explicitly contain the time t, the Hamiltonian 
function (or Hamiltonian) H coincides with the 
total energy of the system T+ U. 

The transformation (p, g)>(P, Q) under 
which canonical equations preserve their form 
is called a canonical transformation. It is given 
by 

oW Š oW kat oW 
U^ Ug TENERE 
where W- W(qi,...,q5;:Q,,...,Qp) and K is 
the Hamiltonian of the transformed system. 
The set of canonical transformations forms a 
group, called a group of canonical transforma- 


Pj 
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tions. An infinitesimal transformation is given 
by 


where e is an infinitesimal constant. Here S 
is an arbitrary function and is said to be the 
generating function of the infinitesimal trans- 
formation. Canonical equations can be inter- 
preted to mean that the variations of p and q 
during the time interval c — dt are the infini- 
tesimal canonical transformations whose gen- 
erating function is Hip, q, t). 

The variation of an arbitrary function 
F(p, q) under an infinitesimal transformation 
is given by 





dF —&(F, S), 
where 
Ou Ov ĝu Ov O(u, v) 
&a-x( ee La 
j 04; Op; Op; 0q; j aldi pj) 


is Poisson’s bracket. Therefore the time rate of 
change of a dynamical quantity F(p, q) can be 
written as 


dF /dt =(F, H). 


Thus the function F(p,q) that satisfies (F, H)=0 
is an tintegral of the canonical equations. 

If a canonical transformation (p, q) 5(P, Q) 
such that P; — o, Q;— D; are constant is found, 
the motion of the system can be determined by 


OW ` , OW 


nr?" ran 


where W is the *Àcomplete solution of the 
Hamilton-Jacobi differential equation: 


oW (= oW 
H 


— + H| —,...,—; DEET: AE 
Or 0q, Ce 4 in) 


(— 82 Contact Transformations). 


G. Theory of Elasticity 


(1) General remarks. Suppose that a solid 
body is deformed elastically by the action of 
external forces. We may inquire about the 
magnitudes of deformation, stress, and strain 
caused by the external forces at each point 
of the body. The theory of elasticity studies 
this problem, assuming the body to be a con- 
tinuum and utilizing classical mechanics as a 
basis, and “endeavours to obtain results which 
shall be practically important in applications 
to architecture, engineering and all other use- 
ful arts in which the material of construction is 
solid” [1]. 

Cartesian coordinates (x, y, z) are employed 
for defining the 3-dimensional space contain- 
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ing the body. We confine ourselves to the 
small-displacement theory of elasticity, in 
which the components of the displacement u= 
(u, v, w) of an arbitrary point P(x, y, z) of the 
body are assumed to be small enough to jus- 
tify the linearization of the governing differen- 
tial equations and boundary conditions. 

(2) Stress. Consider an infinitesimal rectan- 
gular parallelepiped enclosed by the following 
six surfaces: 


x — const, y=const, z — const, 


x+dx=const, y+dy=const, z+dz=const, 


in the body. The stress at the point P(x, y, z) is 
defined as those internal forces per unit area 
acting on the six surfaces of the parallelepiped. 
It has nine components, which are usually 
represented by 


Ox Tyx Tzx 
Txy d y T. y |> 
Le Tyz 0; 


where the c and 7 are called normal and shear- 
ing (or tangential) stresses, respectively. These 
nine quantities form a tensor called the stress 
tensor. Since it can be shown that t,,=1,,, Tyz 
=T,y, Tex =Txz, the stress tensor is a symmetric 
tensor. By considering equilibrium conditions 
of the parallelepiped, the equations of equilib- 
rium are found to be 


Ga, Ot, Ot. 











jx y 2 oes) o toss. 
where X,... are body forces per unit volume. 
(3) Strain. The infinitesimal rectangular 
paralielepiped fixed to the body at the point P 
before deformation ts transformed, after de- 
formation, into an infinitesimal parallelepiped 
which is no longer rectangular. The strain at 
the point P is defined as the changes caused by 
the deformation of the parallelepiped: exten- 
sions of the three sides and changes from right 
angles of the three angles formed by the three 
sides of the parallelepiped. Thus the strain has 
six components, usually represented by 


(Ex, Eys Ez» Yyz» Yzx> Vxy)s 


where e and y are called elongation and shear- 
ing strains, respectively. These six quantities, 
with slight modification, form a symmetric 
tensor called the strain tensor. 

(4) Strain-displacement relations. In small- 
displacement theory, the strain-displacement 
relations are given in linear form by 


ðu dv 
Es Bue, e. yym—-—-. 
E Yee ay Ox 


(5) Stress-strain relations. In the theory of 
elasticity, stress and strain are assumed to 
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obey a linear relationship called Hooke's law: 


{a} =[A] {e}, 
where 


{o}? = [o.. Oy, Oz, Tyzs KT 1], 


(e = IL: £y; £z; P yz» Yzx» yu 


and [A] is a symmetric positive definite ma- 
trix. For an isotropic body, the relations are 


Ee, = 0x — v(o, n oz), ttt Gs, =T 


y? 


where E, v, and G = E/2(1 +v) are elastic con- 
stants called the modulus of elasticity in tension 
or Young's modulus, Poisson's ratio, and the 
modulus of elasticity in shear or the modulus of 
rigidity, respectively. 

(6) Boundary conditions. The surface of the 
body can be divided into two parts with re- 
gard to boundary conditions: the part $, over 
which the boundary conditions are prescribed 
in terms of external forces and the part S, over 
which the boundary conditions are prescribed 
in terms of displacements. Obviously ôV = 
S, 4 $,, where OV is the whole surface of the 
body. 

(7) Small-displacement theory of elastic- 
ity. We have seen that the equations which 
govern the problem are 3 equations of equilib- 
rium, 6 strain-displacement relations, and 6 
stress-strain relations in terms of 15 unknowns, 
namely, 6 stress components, 6 strain compo- 
nents, and 3 displacement components. Thus 
our problem is reduced to a boundary value 
problem in which these 15 field equations are 
to be solved under the specified boundary 
conditions. Since all the field equations and 
boundary conditions are linear with respect 
to the unknowns under the assumption that 
the displacements are small, we obtain linear 
relationships between the external load and 
resulting deformation of the body; this is the 
small-displacement theory of elasticity. It 
should be remembered, however, that the 
assumption of small displacement sets limits 
to the application of the theory to practical 
problems. 

(8) Variational principles. Several variational 
principles have been formulated in the small- 


displacement theory of elasticity. These include ` 


the principle of minimum potential energy 
[u], the generalized principle Te, ¢, uj, the 
Hellinger-Reissner principle [6, u], the prin- 
ciple of minimum complementary energy 
[c], and so forth, where the symbols in the 
brackets represent independent functions 
subject to variation. In connection with the 
aforementioned variational principles, vari- 
ational principles with relaxed continuity re- 
quirements have also been formulated by re- 
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laxing the continuity requirements imposed 
on admissible functions. 

One of the practical advantages of these 
variational principles is that they often provide 
the problem with approximate formulations 
and approximate methods of solution, among 
which the Rayleigh-Ritz method is well known. 
Theories of beams, plates, shells, and multi- 
component structures are typical examples 
of such approximate formulations. Recently, 
these variational principles have been found to 
provide effective bases for the formulations of 
the finite element method. 

(9) Notation. Various symbols are used for 
stress and strain. For example, c, t and e, y are 
commonly used in the engineering literature. 
However, in Love's treatise [6], X,, Y,, Z, and 
exx» €, €,, are used in place of o. T,,, Zen and 
£x; ) xy» 7x, respectively. Also, various nota- 
tions are used for elastic constants. E is widely 
used, while G and v are less common. Love 
uses 4 and o for G and v, respectively. In the 
engineering literature the reciprocal of Pois- 
son's ratio m (= 1/v) is used and is called the 
Poisson number. 

(10) Finite-displacement theory of elastic- 
ity. When the displacement of the body is 
no longer small (infinitesimal) but is finite, 
we should abandon small-displacement the- 
ory and instead employ finite-displacement 
theory, in which stress and strain are defined 
in a manner different from that in small- 
displacement theory, keeping in mind the 
difference between spatial and material vari- 
ables. Thus equations of equilibrium, strain- 
displacement relations, and the boundary 
conditions on $, become nonlinear equations 
with stress and displacement components as 
unknowns, although the stress-strain relations 
remain linear. Thus the problem is reduced to 
solving a nonlinear boundary value problem, 
sometimes called a nonlinear elasticity prob- 
lem. Variational principles have been formu- 
lated for finite-displacement theory and are 
frequently used in the formulation of approxi- 
mate methods of solution. 

When the stress becomes large enough to 
exceed the so-called elastic limit, where the 
linear stress-strain relationships cease to hold, 
the theory of elasticity is no longer valid and 
should be replaced by the theory of plasticity 
[9, 10]. 
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A. General Remarks 


A single-valued 'analytic function in a domain 
D in the complex plane C is called meromor- 
phic in D if it has no singularities other than 
tpoles. A function that is meromorphic in 

the whole complex plane including the point 
at infinity is a rational function (Liouville’s 
theorem). Specifically, if a function is mero- 
morphic in the domain C, then the function is 
called simply a meromorphic function, and a 
meromorphic function that is not a rational 
function is called a transcendental meromor- 
phic function. A meromorphic function f(z) 
can be represented as a quotient of two tentire 
functions. Let {z,} (k= 1,2, ...) be poles of f(z), 
and let f,(z) — af? /(z — z)" +... -- a? (z—z,) 
denote the ‘singular parts of f(z) at z; (k= 
1,2,...). Then f(z) can also be written in the 
form 


fl2)=9(2) £x (f n2). 


where g(z) is an entire function and the p,(z) 
(k — 1,2, ...) are rational entire functions 
(Weierstrass's theorem). Assume that a se- 
quence {z,} (k2 1,2, ...) converges only to the 
point at infinity and that f,(1/(z—z,)) (k= 

1,2, ...) are rational entire functions of 1/(z— 
z,) which have no constant terms. Then 
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there exists a meromorphic function of z with 
f,A/(z —2z,)) as its tsingular part at z, (Mittag- 
Leffler's theorem). 


B. Nevanlinna Theory 


The theory of meromorphic functions can be 
considered an extension of the theory of entire 
functions. In particular, value distribution 
theory, originating in Picard's theorem, was 
studied by many people, and in 1925 R. Nevan- 
linna published a systematic theory unifying 
the results obtained until then. This is called 
Nevanlinna theory. 

We let f(z) denote a meromorphic function 
in |z| € R< +œ, and when we say that f(z) 
takes on a value, the value may be oc. For a 
value a, n(r, x) denotes the number of a-points 
of f(z), i.e., points z with f(z)=a, in |z| &r « R, 
where each a-point is counted with its multi- 
plicity. We set 


N(r, apm [ 85217089 a a)logr, 


0 


Eq 1 
niam | log* |__| 40 








if a 2 oo, and 





Ni, o | RE =) dt 4 n(0, 00) logr, 


0 


2n 
mr, SE log * |f(re'?)| de 
2x Jo 
if x= oo, where log* a= max(loga, 0) for a» 0. 
The functions N and m are called the counting 
function and proximity function of f(z), respec- 
tively, and T(r)= T(r, f) mir, oo) - N (r, oo) is 
the order function (or characteristic function) of 
f(z). T(r) is an increasing function of r and a 
*convex function of logr and is useful for ex- 
pressing f(z) as an infinite product, etc. 

The following relation holds among T(r), 
mir, x), and N (r, «) for any a: 


T(r) 2 mir, a) t- N(r, a) + O(1), (1) 


where O(1) is *Landau's symbol (Nevanlinna's 
first fundamental theorem). By this theorem, if 
a bounded remainder is disregarded, then 
mir, a) - N(r, a) is equal to T(r) for all æ. This 
equality thus demonstrates a beautifully bal- 
anced distribution of «-points. 

We see that N(r, a) is in a sense the mean 
value of the number of a-points in |z| & r, and 
mir, x) is the mean proximity to « of the value 
f(z) on |z|=r. If the term log* in the definition 
of the proximity function is replaced by the 
logarithm of the reciprocal of the chordal 
distance between Treff) and x on the com- 
plex sphere, then the remainder term in (1) is 


1011 


eliminated. Hence the definition of the proxim- 
ity function is sometimes given in this form. 


C. The Order of Meromorphic Functions 


For an entire function f(z), the equality 


. log T(r) .. loglog M (r) 
limsup -limsup—— — —— 

y bod ogr DË logr 
holds, where M(r)= Mir, f) 2 max... |f (2). 
Since the right-hand side is the order of f(z) 
(— 429 Transcendental Entire Functions), 
we define the order (lower order) p of a mero- 
morphic function f(z) by 


log T(r) . . ,log T(r) 
lim inf i 
r r> 


oo ogr 











p=limsup , 
The order of a meromorphic function in |z| « R 
is also defined by 

log T(r) 


= lim sup ————_-—_. 
P= R log /R—r)) 


D. Meromorphic Functions on a Disk 


The order function T(r) is bounded if and 
only if f(z) can be represented as the quo- 
tient of two bounded holomorphic functions 
h,(z), h;(z) in |z| « R (Nevanlinna). If T(r) is 
bounded, Im, f (re'?) exists and is finite 

for every 0, 0 « 0 « 2n, except possibly for a 
set with tlinear measure zero (P. Fatou and 
Nevanlinna). Among functions f(z) such that 
lim, ,4 T(r) = oo, those satisfying 


T(r) 


lim sup ———_——— = oo 

ror log(l/(R—r)) 
have properties similar to those of transcen- 
dental meromorphic functions. 


E. Meromorphic Functions in the Whole Finite 
Plane 


Any meromorphic function such that 


T(r) 
lim sup-——< K 
n logr 

is a rational function. If f(z) is a meromorphic 
function of order p and {r(«)}, ría) <r; (2) 
(j 21,2, ...) is the set of absolute values of «- 
points, then 2/7*,(1/rí(a))" '* converges for any 
a. Furthermore, 


fle)=zter 
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where p is the smallest integer satisfying p+ 1 
> p, the a, and b, are the zeros and poles of 
f(z), respectively, k is an integer, and P(z) is a 
polynomial of degree at most p (Hadamard’s 
theorem). 

Let o, Ge (q > 3) be distinct values. Then 
for any meromorphic function in |z| « R € oo, 


(a-2)T()« Y. N(r x) - Malz DO), 


O<r<R. 


Nn» f OTO den, Olor 


n, (r) is the number of *multiple points in |z| <r 
(a multiple point of order k is counted k— 1 
times), and D(r) is the remainder such that if R 
= oo, then D(r) « K(log T(r) - logr) for some K 
except possibly for values of r belonging to the 
union of a countable number of intervals with 
finite total length, and if R « oo, then D(r) « 
K (log T(r) + log(1/(R — r))) except possibly for 
the union of a countable number of intervals 
{L} with X; fr, d(1AR —r)) « oo (Nevanlinna's 
second fundamental theorem). 
Several theorems on value distribution of 
meromorphic functions can be obtained di- 
rectly from this theorem. For instance, if f(z) is 
a transcendental meromorphic function, the 
equation f(z)=« has an infinite number of 
roots for every value « except for at most 
two values called Picard's exceptional values 
(Picard's theorem). For a meromorphic func- 
tion of order p, Wm, 2, .,(n(0)) ^ (4 « p) 
diverges for every value a except for at most 
two values (Borel’s theorem). A value « for 
which the series converges is called a Borel 
exceptional value. We call ó(x) — ó(a, f) - 1 
—limsup,.,,, N(r, «)/T(r) the defect of f. It 
always satisfies 0 x ó(x) x 1, and the values 
with d(«)>0 are called Nevanlinna's excep- 
tional values. The number of values « (may be 
œ) with ó(x)- 0 is at most countable for any 
meromorphic function f(z), and EZ, ó(a;) <2. 
There are many studies concerning the values 
a with ó(x) 20 


F. Julia Directions 


Among functions that have an essential sin- 
gularity at the point at infinity and are mero- 
morphic in the whole plane, there are some 
that possess no tJulia directions. These func- 
tions, called Julia exceptional functions, are 
of order 0. A necessary and sufficient condi- 
tion for f(z) to be a Julia exceptional func- 
tion is that f(z) can be written in the form 

z" [L,(1 —z/a,)/TI,(1 — z/b,) (A. Ostrowski, 
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1925). Concerning zeros a, and poles b, of f(z), 
the following three properties are obtained 
using the theory of *normal families due to P. 
Montel: (1) There are constants K,, K,, and 
K, independent of r such that |n(r, oo) — n(r, 0)| 
« K,, nQr, oo) - n(r, o0) « K,, and n(2r,0) — 
n(r, 0) « K3. (2) There are constants K, and 

K; such that for any p and q, 














a a 
p P 
lay" eg | I] b. «Ki, 
lal «lal a, Ibi «lagi v 
b b 
77 TE el D ejes. 
dad | By |] layl<lbgl | Ey 














(3) There exists an ¢>0 satisfying |a,/b,—1|> 
é>0 for any p and q. 

G. Valiron gave a precise form of Julia 
directions that corresponds to Borel’s theorem 
(Acta Math., 52 (1928), [3]). Namely, if the 
order p of a meromorphic function f(z) in |z] « 
oo is positive and finite, then there exists a 
direction J defined by argz=« such that the 
zeros z,(a, A) of f(z) —a in any angular domain 
A:|argz—a|<6 containing J have the prop- 
erty X,|z,(a, A)| "^? = œ for any e>0 except 
for at most two values of a. The direction J is 
called a Borel direction. 


G. Relations between Two or More 
Meromorphic Functions 


Borel’s unicity theorem can be stated as fol- 
lows: Let f; (j — 1, ..., n) be nonvanishing entire 
functions satisfying 27.., f, — 1; then for some 
(Cis ---,¢,) (0, ...,0), 25.1 6;f;— 0. This is 
contained in the following theorem: Let f; 
(j=1,...,n) be transcendental entire functions 
such that 25. j= 1; then ZE, 6(0, f) &«n— I. 
If two meromorphic functions f| (z), f;(z) have 
the same «points for five distinct values o; (j 
=1,...,5) (where multiplicity is not taken into 
account), then they coincide everywhere. If the 
*Riemann surface of the talgebraic function 
w(z) defined by a polynomial P(z, w)=0 of z, w 
is of tgenus > 1, it is impossible to find mero- 
morphic functions z = f (¢), w = g(C) that satisfy 
PIC g(€)) =0 (tuniformization by meromor- 
phic functions). 


H. Asymptotic Values 


If a meromorphic function f(z)—>& as z ^ oo 
along a curve C, the value « and the curve 

C are called an asymptotic value and asymp- 
totic path, respectively. Each (Picard's) excep- 
tional value of f(z) is an asymptotic value. For 
meromorphic functions, no simple relation is 
known between their order and the number of 
their asymptotic values. There exists a merom- 
orphic function of order 0 with an infinite 
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number of asymptotic finite values. Some 
results analogous to those for entire functions 
are obtained for meromorphic functions by 
applying the theory of normal families. F. 
Marty established a systematic theory of 
normal families of meromorphic functions by 
using spherical distance. 


I. Inverse Functions 


Generally, the inverse function of a meromor- 
phic function w= f(z) is infinitely multiple- 
valued. Let P(w, wo) be a function element of 
the inverse function with center at wo, and let 
C be an arbitrary curve starting at wọ and with 
o its terminal point. For any domain S con- 
taining C, P(w, wọ) can be continued analyti- 
cally in S up to a point arbitrarily near o 
(Iversen's theorem). Continue the function 
element analytically along each half-line start- 
ing at its center. Then the set of arguments of 
half-lines along which the analytic continu- 
ation meets a singularity at a finite point is of 
zero linear measure (Gross's theorem). 

By considering the inverse image of the 
suitably cut Riemann surface of the inverse 
function, the z-plane can be divided into fun- 
damental domains such that each domain is 
the inverse image of the whole w-plane (with 
suitable slits removed) and has a boundary 
each point of which is taccessible from the 
inside of the domain, and the boundary curves 
of fundamental domains cluster nowhere in 
the plane. 

For a meromorphic function f(z), the set 
of functions z' = mt) defined by f(z’) = f(z) 
De, transformations between points that give 
f(z) the same value) has the property of a 
thypergroup. If @(z) is single-valued, then it 
is a linear entire function, and if it is finitely 
multiple-valued, then it is an algebraic func- 
tion. The fcluster set of the inverse function at 
a transcendental singularity consists of only 
one point, oo, that is, it is an fordinary sin- 
gularity. To an analytic continuation along a 
curve that determines a transcendental sin- 
gularity of the inverse function there corre- 
sponds a curve in the z-plane terminating at 
oo. This curve is an asymptotic path of f(z). 
Namely, the value f(z) tends to the coordinate 
of the transcendental singularity as z — oo 
along this path. Each asymptotic value of a 
transcendental meromorphic function w= f(z) 
corresponds to a transcendental singularity of 
its inverse function z = (w), and if we consider 
two asymptotic paths to be the same if they 
correspond to the same singularity, then there 
exists a one-to-one correspondence between 
the set of asymptotic paths and the set of 
transcendental singularities of the inverse 
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function. The inverse function of any mero- 
morphic function of order p has at most 2p 
‘direct transcendental singularities if p > 1/2 
and at most 1 such singularity if p < 1/2 (L. V. 
Ahlfors). 


J. Theory of Covering Surfaces 


Ahlfors established the theory of covering 
surfaces by a metricotopological method and 
in applying it, obtained Nevanlinna theory 
and many other results on meromorphic 
functions. 

Let F, denote the covering surface of the 
Riemann sphere F with radius 1/2; F, is the 
image of |z|<r under a meromorphic function 
w= f(z). The area of F, divided by z, where z is 
the area of Fo, is given by 


LC 
vu) la meom ^ 404" 


z 5 pef? 


and is called the mean number of sheets of F,. 
The length of the boundary of F, is given by 


If'(2)| 
|z|=r 1+ IAEN? 


The relation 


L(r)= Wel 


To= | Paceon 


holds (T. Shimizu, Ahlfors). 

Consider the Riemann surface of the inverse 
function of a meromorphic function w= f(z) in 
|z| « R € +. It has a countable number of 
components Q, over a domain on the w-plane. 
Let A, denote the inverse image of Q, on the z- 
plane. If A, together with its boundary is con- 
tained in |z| < R, the component Q, is called an 
island, and otherwise, a peninsula. 

Let D be a simply connected domain of the 
w-plane, n(r, D) be the sum of the sheet num- 
bers of the islands of E over D, and mir, D) be 
the sum of the areas of the peninsulas of F, 
over D divided by the area of D. Then 
mir, D) - n(r, D) 2 A(r) - O(L(r)). 

Let Dj (j— 1, ..., q) (q È 3) be disjoint simply 
connected domains on the w-plane. Then 


b n(r, D) — y 


ni(r, D) 7 (q — 2) A(r) - O(L(r)), 
where n, (D) is the sum of the orders of branch 
points in all islands of F, over D. 

For a meromorphic function f(z), L(r) 
A(r) ^ *, where r satisfies 0 & r < oo except 
forre Ui for some intervals J,. Hence in the 
case where D; is a point a, this inequality 
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yields 


»(q—2)A(r) - 0(A(Q)'**) 


with some exceptional intervals of values of r. 
This latter important inequality corresponds 
to Nevanlinna's second fundamental theorem. 
Let D; (j— 1, ...,q) (q = 3) be disjoint simply 
connected domains on the w-plane. If every 
simply connected island over D; has at least ju 
sheets, then 277 ., (1 — (1/1) <2 (disk theorem). 
It follows from this theorem that given three 
disjoint disks D; on the Riemann sphere, there 
is at least one D; that has an infinite number 
of islands over it, and also that given five D 
there exists at least one D; that has a 1-sheeted 
island over it (Ahlfors's five-disk theorem). This 
theorem corresponds to tBloch’s theorem for 
entire functions. These theorems can also be 
obtained for meromorphic functions on a disk. 
Ahlfors established a more important theory 
by introducing a differential metric. 


K. Recent Development 


The Nevanlinna brothers raised several im- 
portant problems, which gave strong motiva- 
tion for later investigations. The first major 
breakthrough after World War II was given by 
A. Gol'dberg in 1956. He gave an example 
which has infinitely many deficient values 
(Nevanlinna's exceptional values). W. Hayman 
proved that > d(a, f converges for «> 1/3, 
and there are examples of meromorphic func- 
tions for which the series diverges for a < 1/3. 
Finally, A. Weitsman showed that the series 
converges for «= 1/3. The second major break- 
through was given by A. Edrei and W. Fuchs 
in 1959, whose works concern the following 
Nevanlinna theorem: Let K( f) be 


; N(r,0)+ N(r, oo) 
limsup Tf) 


and x(p)=inf K(f), where inf is taken over all 
meromorphic function f of order p. Then x(p) 
> 0 if p is neither a positive integer nor oo. 
Furthermore, Nevanlinna gave a conjecture 
for an exact value of x(p). This conjecture is 
still open, although several estimates have 
appeared. In the case of entire functions hav- 
ing only negative zeros the conjecture was 
positively solved by S. Hellerstein and J. Wil- 
liamson. Edrei and Fuchs proved that x(p)— 1 
forO<p<1/2 and k(p)=sinzp for 1/2«p« 1. 
They also proved the ellipse theorem: Let f(z) 
be a transcendental meromorphic function of 
order p (O<p< 1). Put u21—ó(a, f), v=1— 
ó(b, f). Then u, ve[0, 1] and u? —2uvcos xp + 
v? 2 sin? xp. If further u « cos zp, then v— I. 
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For the sum of deficiencies Edrei proved 
that 


Case) 


unless the number of deficient values is one. It 
was conjectured that È ó(a, f) 22 implies that 
the order p of f is a half integer, the number v 
of deficient values is at most 2p, and the value 
of ó(a, f) is a multiple of 1/p. For the case of 
entire functions this conjecture is true (A. 
Pfluger). Weitsman proved that v € 2p. Other 
cases remain open. All the above results still 
hold even if the order is replaced by the lower 
order. The inverse problem was completely 
solved by D. Drasin. Many of the above re- 
sults depend on the concept of Pólya peaks. 

N. V. Govorov and V. Petrenko proved 
independently that 


. , Jog Mfr, f) 
lim inf — —— —— 


pues SEU 
for every entire function of order p. This was a 
conjecture made by R. Paley. For p < 1/2, an 
exact upper bound zp/sin zp was given by 
Valiron. Furthermore, the following result was 
proved by Edrei and Fuchs: 


N(r, 0) sin 7p 
ogM(r.f) np 


1—coszp (O<p<1/2) 
(1/2 «px 1) 


2—sinzp 


«np for p>1/2 


limsup 


roo d 





(0x p « 1). 


L. History 


The value distribution theory of meromorphic 
functions had its inception with the classical 
Picard theorem. It first appeared as the value 
distribution theory of entire functions and was 
developed into a well-organized field by way 
of the Nevanlinna theory and the Ahlfors 
theory of covering surfaces. In recent years, 
emphasis has also been placed on the study of 
meromorphic functions on open Riemann 
surfaces (— 367 Riemann Surfaces). The value 
distribution of a set of several meromorphic 
functions was studied first by A. Bloch and 
developed into the study of meromorphic 
curves by Ahlfors, and H. and J. Weyl [2]. The 
behavior of meromorphic functions in neigh- 
borhoods of general singularities has also been 
studied. An example of results in that field is 
the theory of fcluster sets. 
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A. General Remarks 


The distance between two points x =(x,,..., 
x,) and y Z(y,, ..., y,) in the n-dimensional 
TEuclidean space R” is defined by p(x, y) — 


JO1—x1) +... (y, A, The function 
p(x, y) is nonnegative for every pair (x, y) and 
has the following properties: (i) p(x, y)=0 if 
and only if x = y; (ii) p(x, y) 2 p(y, x); and 

(iii) p(x, z) € p(x, y)+ p(y, z) for any three 
points x, y, z. Property (iii) is called the 
triangle inequality. 





B. Definition of Metric Spaces 


Abstracting the notion of distance from Eu- 
clidean spaces, M. Fréchet defined metric 
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spaces [1] (1906). A metric on a set X isa 
nonnegative function p on X x X that satisfies 
(i), (11), and (iii) of Section A, and a metric space 
(X, p), or simply X, is a set X provided with a 
metric p. The members of X are called points, 
p is called the distance function, and p(x, y) is 
called the distance from x to y. The distance 
function is sometimes denoted by d(x, y) or 
dis(x, y). If (i) is replaced by its weaker form (1’) 
p(x, x) «0, the function p is called a pseudo- 
metric (or pseudodistance function), and X is 
called a pseudometric space. 

Examples of metric spaces: 

(1) The n-dimensional Euclidean space R", in 
particular the real number system R with 
po Gc y) 21x — y|. (2) The *function space 
L ,(Q). (3) The tfunction space C(O). (4) The 
tsequence space s, i.e., the space R of all se- 
quences of real numbers with metric p(x, y) — 
22 "px,— Yall Ix, — Yal), where x = 
(x4, X2, ...) and y z(y,, yz, ...). (5) The tse- 
quence space m, Le, the space of all bounded 
sequences of real numbers with metric 
p(x, y) 2 sup,|x, —y,| for x 2 (x4, X2, ...) and 
Vi, ya. --.). (6) A Baire zero-dimensional 
space (QN, p), where Q is a set and the dis- 
tance p(x, y) between x 2(x,, x5, ...) and y= 
(Y1; Y2, -..) is equal to the reciprocal of the 
minimum 7 such that x, zz y,. When the car- 
dinal number t of Q is specified, the Baire 
space is denoted by B(t). (7) For any set X, 
define p by setting p(x, x)=0 and p(x, y)=1 
when x z y. Then (X, p) is a metric space, 
called a discrete metric space. (8) For any set 
X, define p by setting p(x, y) 20 for any mem- 
bers x and y. Then p is a pseudometric, and 
the resulting space X is called an indiscrete 
pseudometric space. 

For a subset M of a metric space X, 
sup { p(x, y)| x. ye M} is called the diameter 
of M (denoted by d(M)), and M is said to 
be bounded if its diameter is finite (includ- 
ing M = Ø). For two subsets A, B of X, 
inf p(x, y)| xe A, ye B} is called the distance 
between A and B, denoted by p(A, B). We 
have p(A, B)= p(B, A). When a family W= 
(M;|A4€ Aj of subsets of X is a covering of 
X, ie, X =|], M,, the supremum of the dia- 
meters d(M,) of M; in WM, sup{d(M,)|4e Aj, 
called the mesh of the covering WM. For a posi- 
tive number e, a covering whose mesh is less 
than e is called an e-covering. A metric space 
X 1s called totally bounded (or precompact) 
(F. Hausdorff, 1927) if for each positive num- 
ber e there exists a finite e-covering of X. 

A subset X, of a metric space X becomes a 
metric space if we define its metric p, by set- 
ting p,(x, y) 2 p(x, y) for x, ye X,, where p is 
the metric of X. The space (X,, p) is called a 
metric subspace of (X, p). A subset of X is 
called totally bounded (or precompact) if it is 
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totally bounded as a metric subspace. Any 
totally bounded subset is bounded. Converse- 
ly, in the Euclidean space R" any bounded 
subset is totally bounded. 

A bijection f from a metric space (X,, pi) 
onto a metric space (X,, p2) is called an iso- 
metric mapping if f preserves the metric, Le, 
paCf (x), JO) = py (x y) for any points x, ye X,; 
and X, and X, are called isometric if there 1s 
an isometric mapping from X, onto X;. 

Let X be a metric space with metric p, and 
left f be an injection from a set Y into X. 
Then the function p'(y1, y;) 2 o(f(y1). f(v2)) 
(iv, y5€ Y) is a distance function on Y, and 
with this metric the set Y becomes a metric 
space called the metric space induced by /; 

f is an isometric mapping from (Y, p’) onto 
(CY), p). 
For a finite number of metric spaces (X,, p4) 
<-> (Xn Pn), we can define a metric p on their 
Cartesian product X = X, x... x X, by 
setting 


p(x, y)2 pas y, + Lt Dax. Vn) j 


for two points x 2(x,,..., X, y (y, +--+ Yn) 

of X. Thus we obtain a metric space (X, p), 
called the product metric space of (X,,/,), ..., 
(X,, Pn). The n-dimensional Euclidean space R” 
is the product metric space of n copies of the 
real line (R, po). 





C. Topology for Metric Spaces 


For a point x of a metric space (X, p) and 
any positive number e, the set U,(x) of all 
points y such that p(x, y) «e is called the g- 
neighborhood (or ¢-sphere) of x. We can intro- 
duce a topology for X by taking the family of 
all e-neighborhoods as a tbase for the neigh- 
borhood system (— 425 Topological Spaces). 
Then the following five propositions hold, any 
one of which can be used to define the same 
topology: (1) A subset O is *open if and only if 
for any point x in O there is a positive number 
e such that the e-neighborhood of x is con- 
tained in O. (ii) A subset F is *closed if and 
only if any point whose every e-neighborhood 
contains at least one point of F is contained 
in F. (ut) A subset U is a neighborhood of a 
point x if and only if U contains some e- 
neighborhood of x. (iv) A point x is an finte- 
rior point of a subset A if and only if some £- 
neighborhood of x is contained in A; the inte- 
rior A’ of A is the set of all such points. (v) A 
point x is adherent to a subset A if every e- 
neighborhood of x contains at least one point 
of A; the closure A of A is the set of all such 
points, and xe A if and only if p(x, A)=0. 
Every metric space X satisfies the 'first 
countability axiom. A metric space X isa 
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tHausdorff space and, more specifically, a tper- 
fectly normal space; it is also tparacompact. 

In the same way, we can define a topology 
for each pseudometric space that satisfies the 
first countability axiom, but a pseudometric 
space is not necessarily Hausdorff. 


D. Convergence of Sequences 


A sequence {x,} of points in a metric space is 
said to converge to a point x (written Im... x, 
=x) if p(x,, x) tends to zero as n> oo. The 
point x is called the limit of (x,j. This conver- 
gence is equivalent to convergence with re- 
spect to the topology defined in Section C (— 
87 Convergence). As the first countability 
axiom is satisfied, we may define the topology 
by means of convergent sequences of points: 
the closure A of a subset A is the set of all 
limits of sequences of points in A. 


E. Separable Metric Spaces 


For a metric space X the following three con- 
ditions are equivalent: (1) There exists a coun- 
table family D, of open sets of X such that 
each open set of X is the union of members of 
Do (tsecond countability axiom). (ii) X is tsep- 
arable, that is, X has a countable subset that is 
*dense in X. (iii) Every open covering of X has 
a countable subcovering (t Lindelóf space). A 
metric space with any of these properties is 
called a separable metric space. The sequence 
space s is separable. Any separable metric 
space can be isometrically embedded in the 
sequence space m, i.e., is isometric to a sub- 
space of m (— 168 Function Spaces B). 


F. Compact Metric Spaces 


For a metric space X, the following five con- 
ditions are equivalent: (i) X is tcompact, that 
is, every open covering of X has a finite sub- 
covering. (ii) X is *countably compact, that is, 
every countable open covering of X has a 
finite subcovering. (iil) X is tsequentially com- 
pact, that is, any sequence of points in X has a 
convergent subsequence. (iv) Every nested 
family F, > F, >... of nonempty closed sets 

of X has a nonempty intersection. (v) Every 
infinite subset M of X has an accumulation 
point x, i.e., xe M — {x}. A metric space satis- 
fying any of these conditions is called a com- 
pact metric space (M. Fréchet [1]). Every real- 
valued continuous function defined on a com- 
pact metric space has a maximum and a min- 
imum. A metric space is compact if and only 
if it is totally bounded and complete (— Sec- 
tion J). Every totally bounded metric space is 
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separable. In particular, every compact metric 
space is separable. 

Let U={U,} be an open covering of a com- 
pact metric space X. There exists a positive 
number ô such that every set with d(A) « ó is 
contained in some U,. The number ô is called 
the Lebesgue number of the open covering UI. 

A subset A of a metric space is said to be 
compact if it is compact as a metric subspace, 
and A is said to be relatively compact if its 
closure is compact. Bounded closed sets in R", 
in particular closed intervals of real numbers, 
are compact. For these sets, conditions (i), (iv), 
and (v) are called the Heine-Borel theorem (or 
Borel-Lebesgue theorem), Cantor's intersection 
theorem, and the Bolzano-Weierstrass theorem, 
respectively. 


G. Product Spaces of Metric Spaces 


Let (X,, Pi), ---, (Xn, p,) be metric spaces. Then 
the Cartesian product X = X, x... x X, has 
distance functions 


PAX, y) iG Y? +- + yx. NET, 


pzi 
and 
Pa (x, y) max (pi (Xp, Y1), «s ub Ys 
where x «(x,,...,x,) and v—(y,,..., y,). The 


topology of X induced by each one of these 
metrics coincides with the product topology. 
In particular, for the n-dimensional Euclidean 
space R”, the metrics p, (p > 1) and p, define 
the same topology. 

Let (X,, P1) .... (X, Pn), ... be a countable 
number of metric spaces. If we define a metric 
p on the Cartesian product X = [I7-, X, by 
e 1 Pan Yn) 


X, = H 
2 e à 2" 1 F5, Yn) 





where x =(X,,X2,...) and y z(y,, ys, ...) then 
the topology defined by p is identical with the 
product topology. For the Cartesian product 
of an uncountable number of metric spaces, we 
cannot construct a metric p such that the 
topology induced by p agrees with the product 
topology in general. 


H. Uniformity of Metric Spaces 


Every metric space X is a uniform space, for 
which we may take a countable number of 
subsets {(x, y) p(x, y) «2 ", n=1, 2,..., of 
X x X as a base of tuniformity (— 436 Uni- 
form Spaces). 
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I. Uniform Continuity 


A mapping f from a metric space (X, p) into a 
metric space (Y, ol is tcontinuous if for any 
point x in X and any positive number e there 
is a positive number 6 such that f(U,(x)) c 

V.( f(x)), where U(x) is a ó-neighborhood for 
p and V,(y) is an e-neighborhood for o; that 
is, p(x, x') « ó implies o( f(x), f(x')) « e. In this 
case, we must generally choose ó depending 
on x and e In the special case where we can 
choose 6 depending only on e, independently 
of x, we call f uniformly continuous in X. (The 
notion of uniform continuity may be general- 
ized to uniform spaces.) Not every continuous 
mapping is necessarily uniformly continuous, 
but every continuous mapping from a com- 
pact metric space into a metric space is uni- 
formly continuous. 


J. Complete Metric Spaces 


A sequence (x, of points in a metric space 

(X, p) is called a fundamental sequence (or 
Cauchy sequence) if p(x, x,,) ^0 as n, m oc. 
Every convergent sequence is a fundamental 
sequence, but the converse is not always true. 
A metric space is called complete if every fun- 
damental sequence in the space converges to 
some point of the space (M. Fréchet [1]). A 
topological space that is homeomorphic with a 
complete separable metric space is sometimes 
called a Polish space (— 22 Analytic Sets I). 
The metric spaces introduced in examples (1) 
through (5) of Section B are complete. (In 
example (3) we must assume that the space Q 
is a compact Hausdorff space.) A metric space 
is compact if and only if it is complete and 
totally bounded. A flocally compact metric 
space is homeomorphic to a complete metric 
space. 

For a metric space X, we can construct a 
complete metric space Y such that there is an 
isometric mapping q from X onto a dense 
subspace X, of Y (F. Hausdorff, 1914). Such a 
pair (Y, o) is called a completion of X. If X has 
two completions (Y,, g,) and (Y,, @,), then 
there is an isometric mapping f from Y, onto 
Y, with o; = f o ọ,. In this sense the comple- 
tion of X is unique. By identifying X with 
Q(X) when (Y, o) is the completion of X, any 
metric space can be regarded as a dense sub- 
space of a complete metric space. For example 
the completion of the rational number system 
Q is the real number system R. A metric space 
is totally bounded (— precompact) if and only 
if its completion is compact. 

Baire-Hausdorff theorem: In a complete 
metric space every set of the ffirst category is a 
*boundary set. That is, every set that can be 


3 
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expressed as the union of a countable number 
of sets whose closures have no interior point 
has no interior point. In other words, if the 
union | J7., F, of closed sets F,, F;,... of X has 
an interior point, then at least one of the F, 
must have an interior point. 


K. The Metrization Problem 


A topological space X 1s called metrizable if 
we can introduce a suitable metric for X which 
induces a topology identical to the original 
one. A !Tj-space satisfying the second counta- 
bility axiom is metrizable if and only if it is 
tregular (Uryson-Tikhonov theorem; P. S. 
Uryson, Math. Ann., 94 (1925), A. Tikhonov, 
Math. Ann., 95 (1925)). However, a metric 
space does not necessarily satisfy the second 
countability axiom. Therefore, the Uryson- 
Tikhonov theorem does not provide a neces- 
sary and sufficient condition for metrizability. 
The following are some necessary and sufficient 
conditions for a topological space X to be 
metrizable: 

(1) There exists a nonnegative real-valued 
function d on X x X satisfying the first two 
axioms given in Section A and the following 
condition: There exists a real-valued function 
(w) that converges to zero as w—0O such that, 
for any three points x, y, z and any positive 
number e, d(x, y) « (e) and d(y, z) « (£) imply 
d(x, z) « e (E. W. Chittenden, Trans. Amer. 
Math. Soc., 18 (1917)). 

(2) X is a T,-space that has a countable 
number of open coverings 9t, , 99t,, ... satisfy- 
ing the following two conditions: (i) If U,, 

U eM, have a common point, there is a set 
Uewm, with U >U, U U}; (ii) for any point x 
in X, if U, is any member of M, containing x, 
the family {U,},,-1,2,... is a base for the neigh- 
borhood system of x (P. S. Aleksandrov and 
Uryson, C. R. Acad. Sci., Paris, 177 (1923), and 
N. Aronszajn). When X is a uniform space, 
this amounts to saying that X has a metric 
compatible with the uniform structure if and 
only if X is a T,-space and has a countable 
base of uniformity. 

(3) X is a T,-space that admits a countable 
number of open coverings M, Wi,, ... such 
that (S(S(x, M;), 9t)]i,j — 1,2, ...} is a base for 
the neighborhood system of x at each point of 
X, where S(A,9X) is the ‘star of A relative to 
M (R. L. Moore, Fund. Math., 25 (1935); K. 
Morita, Proc. Japan Acad., 27 (1951); A. H. 
Stone, Pacific J. Math., 10 (1960); A. V. Arkh- 
angel’skii, Dokl. Akad. Nauk SSSR, 2 (1961)). 

(4) X is regular and has a to-locally finite 
open base (J. Nagata, J. Inst. Polytech. Osaka 
City Univ., 1 (1950); Yu. M. Smirnov, Uspekhi 
Mat. Nauk, 6 (1951)). 
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(5) X is regular and has a to-discrete open 
base (R. H Bing, Canad. J. Math., 3 (1951)). 

(6) X is a tcollectionwise normal Moore 
space (— below; Bing, ibid.). 

(7) X is a *perfect image of a subspace of a 
Baire's zero-dimensional space (Morita, Sci. 
Rep. Tokyo Kyoiku Daigaku, sec. A, 5 (1955)). 

(8) X is a Hausdorff *M-space such that the 
diagonal is a 'G,-set in the direct product 
X x X (A. Okuyama, Proc. Japan Acad. 40 
(1964); C. J. R. Borges, Pacific J. Math., 17 
(1966); J. Chaber, Fund. Math., 94 (1977)). 

A regular space is said to be a Moore space 
if it has a countable number of open coverings 
M; such that (S(x, 991) is a base for the neigh- 
borhood system of x for any point x. A Moore 
space is not necessarily metrizable. F. B. Jones 
(Bull. Amer. Math. Soc., 43 (1937)) proved 
under the assumption 25° < 2": that every 
separable normal Moore space is metrizable 
and asked whether or not every normal Moore 
space is metrizable. (3) and (6) are partial 
answers to the question. A normal Moore 
space is metrizable if it is locally compact and 
"locally connected (G. Reed and P. Zenor). The 
existence of a nonmetrizable separable normal 
Moore space is consistent with and indepen- 
dent of the axioms of the usual ZFC set theory, 
the tZermelo-Fraenkel set theory with the 
taxiom of choice (F. D. Tall). W. G. Fleissner 
(Trans. Amer. Math. Soc., 273 (1982)) con- 
structed a normal nonmetrizable Moore space 
assuming an axiom weaker than the contin- 
uum hypothesis, while P. J. Nyikos (1980) has 
proved that every normal space with the first 
countability axiom is collectionwise normal 
from the strong axiom of set theory. 

In connection with (7) the following result is 
known. Let f be a tclosed continuous mapping 
from a metric space X onto a topological 
space Y. Then the following conditions are 
equivalent: (1) Y is metrizable; (2) For each 
y € Y the tboundary OT" (y) of the inverse 
image is compact; (3) Y satisfies the first coun- 
tability axiom (Morita and S. Hanai, Proc. 
Japan Acad., 32 (1956); Stone, Proc. Amer. 
Math. Soc., 7 (1956); I. A. Vainstein, Dokl. 
Akad. Nauk SSSR, 57 (1947)). In particular, 
perfect images of metric spaces are metrizable. 

For quotient topological spaces of metric 
spaces — 425 Topological Spaces CC. 
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A. General Remarks 


Let X be an open set of R”. Then X x (R"\0) 
can be identified with T* X 0, the tcotangent 
bundle of X minus the zero-section. To every 
locally integrable function (or distribution 

or hyperfunction (— 125 Distributions and 
Hyperfunctions)) f(x) defined on X, one can 
assign closed subsets of X x (R"-.0) called the 
wave front set of f and the singularity spec- 
trum (or analytic wave front set or essential 
support) of f. The wave front set (resp. the 
singularity spectrum) of f describes in detail 
the singularity of f modulo the infinitely dif- 
ferentiable functions (resp. the real analytic 
functions). One can further associate with f 
more refined objects defined on X x (R" 0), 
such as microfunctions. In many cases one can 
recover knowledge of the structure of f by 
analyzing these objects defined on X x (R"-.0). 
Such an analysis on the cotangent bundle of X 
is called microlocal analysis. Microlocal analy- 
sis is particularly successful if f is a solution of 
a system of linear (pseudo-)differential equa- 
tions, because in that case one can use vari- 
ous linear transformations, such as differential 
operators, pseudodifferential operators (— 345 
Pseudodifferential Operators), or microdiffer- 
ential operators and Fourier integral opera- 
tors, or quantized contact transformations. 


B. Microlocal Analysis for Distributions 


Let u be a distribution defined in an open 
subset X of R". The wave front set WF(u) of u 
is defined as the complement in X x (R"\0) of 
the collection of all (x9, čo) in X x (R" 0) such 
that for some neighborhood U of xs, V of čo 
we have for each pe C$ (U) and each N » 0, 


<u, pexp(—itx: Els O(x ^") 


as t— oo, uniformly in če V (L. Hórmander; 
[1]). WF(u) is considered to be a subset of 
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T*X (0j. If WF(u)= Ø, then u is a C*- 
function. Let 7: X x (R" 0) X be the natural 
projection. If x(WF(u)) contains x, then u is 
not a C?-function in any small neighborhood 
of xy. Thus WF(u) is the obstruction for u to 
be infinitely differentiable. Let p(x, D) bea 
"linear partial differential operator of order m. 
Assume that p(x, D)u = f. Then 


WF(f)c WF(u) c WFCf/)U p, (0), 


where p,,:(x, £) 5 p, (x, č) is the "principal sym- 
bol of p(x, D). Such a technique of localizing 
the problem on the cotangent bundle has 
been used in the form of the estimation of the 
Fourier transform of f since the advent of the 
tsingular integral operators of A. P. Calderón 
and A. Zygmund [3,4] (see also S. Mizohata 
[5,6]). The formula above contains as a special 
case the classical result that u is a C?-function 
if p(x, D) is telliptic and if f is a C*-function. 
In obtaining useful results of microlocal 
analysis for distributions one often uses Fou- 
rier integral operators and pseudodifferential 
operators (or singular integral operators) 
(— 345 Pseudodifferential Operators). 


C. Fourier Integral Operators [1,2,7-9] 


A Fourier integral operator B: Cj (R") Z'(R") 
is a locally finite sum of linear operators of the 
type 


asp) a(x, 0, y) 


x exp(io(x, 0, y)) f( y) dy dð. 


Here a(x, 0, y) is a C?-function satisfying the 
inequality 


| D: Dj D?.a(x, 0, y| < C(14- |0|yn- P * c -P alld 


for some fixed m and p, 1 >p > 1/2, and any 
triple of *multi-indices o, fl, y, and @(x, 0, y) is 
a real-valued C?-function which is homoge- 
neous of degree 1 in 0 for |0| 1. The function 
q is called the phase function and a the ampli- 
tude function. 

Let C,— (6x, 0, y)| dao (x. 0, y) 0, 0 20] and 
W = {(x, y)eR" x R"|30 40 such that (x, 0, y)e 
Co}. If d, 4,9 (x, 0, y) #0 for 0:0, then the 
kernel distribution k(x, y) of A is of class C* 
outside W. A phase function q is called non- 
degenerate if the d, 4 ,(Oo(x, 0, y)/00,), j= 1,2, 

., N, are linearly independent at every point 
of C,. In this case, C, is a smooth manifold 
in R^***" and the mapping 0: C, 3(x, 0, y)> 
(x, y, & n). $£—d,9(x, 0, y), n =d p(x, 0, y), is 
an immersion of C, to T*(R" x R")--0, the 
cotangent bundle of R" x R" minus its zero- 
section. The image 6C, = A, is a conic La- 
grangian manifold, i.e., the canonical 2-form o 
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= Äddi dx;— 1 dn; Ady; vanishes on A, and 
the multiplicative group of positive numbers 
acts on A,. Let 44, 45, ..., 45, be a system of 
local coordinates in A,. These, together with 
09/00,, 09/085, ..., 0p/ 00s, constitute a sys- 
tem of local coordinate functions of R"*N*" 

in a neighborhood of C,. Let J denote the 
Jacobian determinant 


6p Co 
IG ai 
1 N 
D(x, 0, y) 





The function dAn Eë de o ' exp(xMi/4) is 
called the symbol of A. Here alc, is the restric- 
tion of a to C, and M is an integer called the 
Keller-Maslov index [10,11]. The conic La- 
grangian manifold A,=A,(A) and the symbol 
ay dall essentially determine the singular- 
ity of the 'kernel distribution k(x, y) of the 
Fourier integral operator A. Conversely, given 
a conic Lagrangian manifold A in T*(R" x 
R”)~0 and a function a, on it, one can con- 
struct locally a Fourier integral operator A 
such that A,(A)=A and ay (A) - a4. For 
global construction of such a Fourier integral 
operator one requires detailed consideration of 
the Keller-Maslov index. A globally defined 
Fourier integral operator A with A,(A)=A 
and a, (A)=a, exists if and only if a, is not a 
function on A but a section of the complex 
line bundle Q, @ L, where Q,, is the bundle 
of square roots of the volume elements of A 
and Lis a Z, bundle over A called the Maslov 
bundle. The factor dd exp(z Mi/4) in the de- 
finition of a, above appears as the effect of 
trivialization of the bundle Q, & L. Those 
Fourier integral operators whose associated 
conic Lagrangian manifolds are the graphs 
of thomogeneous canonical transformations 
of T*(R") are most frequently used in the 
theory of linear partial differential equations. 
Let A be a Fourier integral operator such that 
A,(A) is the graph of a homogeneous canon- 
ical transformation y. Then the adjoint of A 
is a Fourier integral operator such that the 
associated conic Lagrangian manifold is the 
graph of the inverse transformation y '!. Let 
A, be another such operator; if A,(A,) is the 
graph of y,, then the composed operator 
A, A is also a Fourier integral operator and 
A,(A, A) is the graph of the composed homo- 
geneous canonical transformation y, y. 
Consider the kernel distribution k(x, y) of A. 
If the phase function q of A is nondegenerate, 
then WF(k) is contained in A,(A). Moreover, 
if the symbol a, (A) does not vanish, then 
WF(k) — A,(A). Let u be a distribution and A 
be a Fourier integral operator such that A,(A) 
is the graph of a homogeneous canonical 
transformation y. Then WF(Au)c y(WFt(u)). 
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A pseudodifferential operator of class 
$7.1 - ,(R") is a particular type of Fourier inte- 
gral operator (— 345 Pseudodifferential Oper- 
ators). In fact, a Fourier integral operator 
A is a pseudodifferential operator of class 
85.1 (R^) if and only if A,(A) is the graph of 
the identity mapping of T*(R"). Hence for any 
Fourier integral operator A, A*A and AA* are 
pseudodifferential operators. 

The following theorem is due to Yu. V. 
Egorov [12]: Let P(x, D) be a pseudodifferen- 
tial operator of class S7 ,_,(R") with the sym- 
bol p(x, č), and let A be a Fourier integral oper- 
ator such that the associated conic Lagrangian 
manifold A,(A) is the graph of a homogeneous 
canonical transformation y of T*(R"). Then 
there exists a pseudodifferential operator 
Q(x, D) with the symbol q(x, Ces", 87 
such that P(x, DA = AQ(x, D) and q(x, £)— 
P(x(x, $))e S 207 (R^). Note that m— 2p + 
] «m. 

Assume that m— 1, p — 1, and that p,(x, ë) 
is a real-valued C^-function, homogeneous 
of degree 1 in £ for |£|» 1, such that p(x, €) 

— Pix, E)ES? AR" and dp, (x9, £9) 40 at 

(x9, £9), where p, (x°, £9) 20. Then one can find 
a Fourier integral operator A such that the 
function q(x, č) of Egorov's theorem satisfies 
the relation q(x, ë) — £, e S? a(R”). 

The boundedness of Fourier integral oper- 
ators in the space L;(R") (or the spaces H*(R")) 
has also been studied in several cases. Some 
sufficient conditions for boundedness can be 
found in [7,8, 14—16]. 

The theory of Fourier integral operators has 
its origin in the asymptotic representation 
of solutions of the wave equation, (see, e.g., 
[17, 18]). Fourier integral operators were first 
used by G. I. Eskin [7]. 


D. Essential Support or Analytic Wave Front 
Set of a Distribution 


Inspired by the physical idea introduced by 
C. Chandler and H. P. Stapp, J. Bros and D. 
Iagolnitzer introduced the notion of the essen- 
tial support of a distribution, which is a closed 
subset of X x (R”~0) [19]. Let u be a distri- 
bution defined on an open set X of R" and 

x be a C*-function with compact support 
around x€ X which is locally analytic and 
different from 0 at xo. Let Ju) be the subset 
of R" «0 of which the complement is defined 
as follows. A point 5 is in the complement of 
X (u) if there exist a conic neighborhood U 

of y, constants o, 797 0, and Cy such that 


Ku xexpi pe ls — xol} > 
SE 
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for all £e U, 0 « y <7, and all positive integers 
N. 

The essential support X, (u) of u at xo, is 
the limit of Lu) when the width |supp zl of the 
support of y around x, tends to 0. The es- 
sential support X(u) of u is the closed subset 
sex {x} x Eu of X x R"«0. Eil is the ob- 
struction for u to be real analytic. L. Hórman- 
der [20] also defined the analytic wave front 
set of a distribution, which is also the obstruc- 
tion for a distribution to be real analytic. The 
definition of an analytic wave front set is quite 
different from that of essential support. How- 
ever they coincide with each other [21]. More- 
over, both of them coincide with the singular- 
ity spectrum of u if the distribution u is re- 
garded as a hyperfunction. 


E. Microlocal Analysis for Hyperfunctions 


[22] 


(1) Microfunctions. Let N be a real analytic 
manifold of dimension n+d and M its sub- 
manifold of codimension d. In what follows, 
Ty N and T% N denote the normal bundle of 
M and the conormal bundle supported by M, 
respectively. Here the normal bundle T, N is 
defined to be the quotient bundle TN | ,,/TM 
of the tangent bundle and the conormal bundle 
Ty N to be the subbundle of the cotangent 
bundle T* N|,, that annihilates TM. Identify- 
ing N with ((x,v)e TN|v=0} or {(x, £e 
1*N|¢=0}, we define the tangential sphere 
bundle SN and cotangential sphere bundle S* N 
by (TN  N)/RZ (= UA (TN S{0})/R*) and 
(T*N NR (— LL TAN ~ {0})/R%), re- 
spectively. The normal sphere bundle S, N = 
(T, N~M)/R% and conormal sphere bundle 
SiN —- (TEN M)/RX are defined in the same 
manner. In parallel with the algebraic geom- 
etry (— 16 Algebraic Varieties) we define the 
real monoidal transform of N with center M to 
be the manifold (N ~ M)US,N with boundary, 
in which the center M is blown up to Sy N by 
the polar coordinates. We denote it by NN. 
We mainly use this notion when N is a tcom- 
plexification X of M, regarding X as a 2n- 
dimensional real manifold. In this case we can 
canonically identify T,, X with ft TM, and 
hence S,,X with e SM. We denote by x+ 
Jf ot vO the point in S,,X that corresponds to 
(x, ,/—1v) in J/ —1 SM by this identification. 
An open subset W of X ~M is called a conoidal 
neighborhood of a subset U of \/—1 SM if 


WU./—1 SM is a neighborhood of U in ^X. 
Let e, & and denote respectively the canon- 
ical embedding mappings from X ~M to X, 


from “MX ~./—1SM to "X and from ./—1 SM 
~ — A 
to MX. We then define the ‘sheaves @ and o 
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by &c ! 0, and Ó| / is (— €^ 1,). Here Oy 
denotes the sheaf of germs of holomorphic 
functions on X. We also define the sheaf 2 on 
/-1 SM by #1/_ism(t ! Oy), where x is the 
canonical projection from MY to X and HE 
denotes the pth tderived functor of the functor 
I5 of taking the sections with support in S. 
Then 2 is isomorphic to së Ir Lag for the 
sheaf og of real analytic functions on M. The 
sheaf ai is, so to speak, the sheaf of “boundary 
values" of holomorphic functions. Actually 
there exists a canonical mapping b from d to 
t^ By, where Zy denotes the sheaf of thyper- 
functions on M (— 125 Distributions and 
Hyperfunctions). Thus we see that 2 describes 
the singularities of the boundary value of a 
holomorphic function. These sheaves «Z and 
2 are easy to understand intuitively. However, 
they are defined on fst SM, while J- 
S*M is more important in analysis. Our final 
goal, namely, the sheaf of microfunctions, is 
constructed on V EN S*M through cohomo- 
logical machinery starting from 2. In order to 
do this, we introduce the disk bundle DM by 


{(x+,/—100, (x, / —1 €00)) 


e / —1 SM xy./—1 S*M| <v, £) <0}. 


Here ./—1 SM Su, —1 S*M denotes the 


"fiber product of ,/ —1 SM and ./—1 S*M 


over M and the symbol čo is used to em- 
phasize that ë designates the codirection, 
which is dual to the infinitesimally small 
quantity v0. The canonical projections from 
DM to Ael S*M and from TE SM to 

M are both denoted by t. Similarly, the pro- 
jections from DM to A SM and from 
gat S*M to M are denoted by z. We denote 
by a the antipodal mapping on fol S*M, 


namely, a(x, V/ 1 £oo) = (x, aalto). For 
a sheaf F on NE S* M, we also denote 
a,(—a TZ) by F". In the following, R/t,, 
etc., denotes the jth tright-derived functor of 
the functor t, of taking the direct image of 
sheaves, etc. (— 383 Sheaves). Now the sheaf 
€, of microfunctions is defined on SEH S*M 
by (Rtr n 2^ Q lay, where wy de- 
notes the *orientation sheaf of M. Note that 
Rit n’ 2—0 holds for jzn— 1. 

Remark: Here we have defined the sheaf €, 
of microfunctions on TIS S* M. However, it 
is sometimes more convenient to define the 
sheaf on Be T*M (= T% X) by the follow- 
ing convention: € e / 1 = EM bef Li) if 
$70, and Z4, x if €=0. Several authors (e.g., 
[23]) call this sheaf €,, the sheaf of micro- 
functions and denote it by Cy. 


(2) Basic Properties of Microfunctions. The 
sheaf €y defined above has the following 
properties: (i) The sheaf €y is a tflabby sheaf 


274 E 
Microlocal Analysis 


on 4/ —1 S*M. (ii) For each (x, J/ — |o) in 


Jl S*M, there exists a surjective mapping 
from 2, x to Cm, Zizo This mapping is 
denoted by sp. The mapping from A, to 
lu is also denoted by sp. (iii) We have the 
following exact sequence: 0 A y 2,» 
T, €, 0. (iv) Rn €, —0 holds for k #0. 
The exact sequence (iii) shows that the singu- 
larities of hyperfunctions are dispersed over 
/ —1 S*M and that the dispersed object 
is described by the sheaf €,, of microfunc- 
tions. For a hyperfunction f € Z,,(M), we call 
f) (EEI —1 S*M)) the spectrum of f. We 
denote supp sp f) by S.S. f and call it the 
singularity spectrum of f or the singular spec- 
trum. It is known [21] that this coincides 
with the analytic wave front set of f and with 
the essential support of f if f is a distribution. 
(v) The following sequence is an exact se- 
quence on Kë SM: 0S. bv Junt 
>Q. In the following, a subset A of the (n — 1)- 
dimensional sphere S" ! is said to be convex 
if o ! (A)U {0} is convex, where w is the ca- 
nonical projection from R”~ {0} to S" !, and 
if c! (A)U {0} is convex and includes no 
straight line, A is said to be properly convex. 
A subset Z of /—1 SM (resp., /-1 S*M) is 
also said to be (properly) convex if z ! (x)'1Z 
(resp., x ! (x) Z) is so for each x in M. For 
a subset Z of ft SM its polar set Z^ is, by 


definition, { (x, / 1 ë oO) jeg 1. S*M | 
dë E> > D iis for each x-- J/ —1 v, O in 
Z}. The polar set Z^ of a subset Z of ef hs 
S*M is defined in the same way. (vi) Let U 

be an open subset of c SM such that 

t !(x)'1U is a nonvoid connected set for 

each x in M. Let V denote U°°. Then we 

have (a) The restriction mapping p:I'(V;.7 )> 
(IL: lisa bijection. Here I'(V; «7 ), etc. de- 
notes the space of global sections of ai over V, 
etc. (b) 0—.4(U)— Z(M)S «(, / —1 S*M — UÀ’) 
is an exact sequence. (vii) Let f(x) be a hyper- 
function on M. Then the following two state- 
ments are equivalent: (a) SP(P ys. Aizan) =Q. 
(b) There exist a finite family of open subsets 
U, of Be SM whose polar set U; does not 
contain (xo, / —1 £500) and o; in T(U, , ) 
such that f — Ze bie), Then we say that fis 


micro-analytic at (xo, ,/ —1 & 00). 


(3) Operations on Microfunctions. Let M and 
N be real analytic manifolds, and let f be a 
real analytic mapping from N to M. We de- 
note by T£ M the kernel of the natural map- 
ping from N x, T*M to T*N. It is also called 
a conormal bundle supported by N. The as- 
sociated sphere bundle is denoted by SM. 
Denote by p and o the natural mappings from 


N Xy —1 S*M-4/ —1 SHM to,/—1 S*N 
and from N x 44/ —1 S*M~./—1 SEM to 


./ —1 S*M, respectively. (i) Let Zy denote the 


ui 
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sheaf {ue By|S.S. uN / —1 SEM = Ø}. Then 
we have the following two canonical homo- 
morphisms: f *: By > By and f *:pjo ! €, 
€x. Here and in what follows, for a continuous 
mapping ọ from N to M and a sheaf F on N, 
~,F denotes the sheaf on M defined by assign- 
ing [seI'(o ^ (Ux F)|Pleupps?Supps> U is 
*proper; to each open subset U of M. These 
two homomorphisms are consistent. We call 
each of them a substitution (homomorphism) 
and denote it by (f *u)( y) 2 u( f( y)). (ii) Let 

Du denote the sheaf of tdensities. Then there 
exist the following two canonical homo- 
morphisms: f, ` Le Qu, vy) Bu D, Du and 
Sy: tp "(Gy Gy vy) > Cu Quy vy. These two 
homomorphisms are consistent. We call each 
an integration along a fiber and denote it by 
(Jau) (x)= fru. (iii) Using the result in (i), 
we can define the product u, u, of two hyper- 
functions u, and uw), if S.S. u, N (S.S. u) = 

Ø. Furthermore, the singularity spectrum 


of u, u, is contained in ((x, J/ —1(0£, + (1 — 


0)83)o0)|(x, / —1 č; 00)e S.S. uy, (x, / ~1 6,0) 
€&S.u,, 00x 11 US S. u; USSS.u,. 


F. Microdifferential Operators [22] 


(1) Microlocal Operators. Let M be a real 
analytic manifold, and define the sheaf 

Ly on / —1 S&(M x M)=,./—1 S*M by 
ANTT x Mf xm Q Vy). A section K(x, y)dy 
of #y naturally determines an integral oper- 
ator J£ :u(y) [ K(x, y)u(y)dy. An operator 
thus obtained is called a microlocal operator, 
because it acts on the sheaf €, of microfunc- 
tions as a sheaf homomorphism. Usually we 
identify an operator ¥ and a kernel function 
K(x, y)dy. (i) Ly is a sheaf of rings by the 
natural composition. The unit element of 

Ly 1s Ó(x — y)dy. It acts on Gy as an identity 
operator. (ii) Let K(x, y)dy be the kernel func- 
tion of a microlocal operator X defined near 
(Xo; J-i &o 00). Then its adjoint operator 2" 
is, by definition, the microlocal operator de- 
fined near (Ne, eye čo 00) with the kernel 
function K(y, x)dy. The operation * is a sheaf 
isomorphism between ^, and V, where a 
denotes the antipodal mapping (— Section E). 


(2) Microdifferential Operators. A micro- 
differential operator is an analog of a micro- 
local operator in the complex domain. By a 
procedure similar to that used to define the 
sheaf of microfunctions we first define the 
sheaf vie of holomorphic microfunctions for 
a submanifold Y of X (— [22, definition 1.1.7 
on p. 319], where it is denoted by yx). It 
follows from the definition that ec, is sup- 
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ported by P*¥(Y x X), which is identified with 

Y xx P* X. Here and in what follows P*X, 

etc., denotes the cotangential projective bundle 
of X, etc. Then the sheaf & of microdifferen- 
tial operators (of infinite order) is, by defini- 
tion, die x Gc, Qx, where Qy denotes the 
sheaf of holomorphic dim X forms. 

Remark. Several notations are used to de- 
note éy in the literature. For example, [22] 
uses the symbol Z, and calls it the sheaf of 
pseudodifferential operators. As in the case of 
microfunctions, some authors use the symbol 
óy to denote a sheaf on T'* X. In this case 
6x |x is, by definition, 27, the sheaf of linear 
differential operators (of infinite order). One 
should be careful in these notational confu- 
sions in referring to papers using microdif- 
ferential operators. We note also that the sym- 
bols 2 and £ have nothing to do with the 
symbols in distribution theory. 

We now list the basic properties of micro- 
differential operators. 

(i) When X is a complexification of a real 
analytic manifold M, &y | /—js«y is a subring 
of Zu. 

(ii) Let Q be an open subset of P* X. Using 
a local coordinate system (x) on X, we define 
Ô by {(x, eC" x (C"— (01) (x, £o0)e Q}. Let 
{ p(x, )}iez be a sequence of holomorphic 
functions on Q satisfying the following condi- 
tions: (1) p,(x, č) is homogeneous of degree j 
in ¢. (2) For each e» 0 and each compact sub- 
set K of Q, there exists a constant C, y such 
that supy pa, £)| < C, e?/j! (j 2 0) holds. 

(3) For each compact subset K of Q, there 
exists a constant Ry such that supx|p,(x, £)| < 
Rx‘(—j)!(j <0) holds. Then there is a one-to- 
one correspondence between the space of such 
sequences and the space of sections of & over 
Q. 

(iii) A sequence satisfying the conditions 
in (ii) is called a symbol sequence, and the 
corresponding section of & is denoted by 
Ljez DAS, D,). If we define a subsheaf &,(m) of 
by by {P= Lip, DJe&g p(x, &=0 (j> 
m+ 1)}, it is independent of the choice of the 
local coordinate systems. A microdifferential 
operator belonging to &,(m) is said to be of 
order (at most) m. We denote | J,, &,(m) by 
dr and call a section of &, a microdifferential 
operator of finite order. 

(iv) Let ®,(z) denote I'(4)/( — z)^, where 


its branch is chosen so that ®,(—1)=T (4). 


When A=0, —1, —2, ..., we consider its *finite 
part. Let Q be a complex neighborhood of 
(xo, 4// —1 &yoo)e / —1 S*R'a R^ x. / —1 S571. 
Using a symbol sequence {p,(z, €)} on Q, we 
define a multivalued holomorphic function 
K(z, w, ¢) by 2; pj(z, 0, ;((z — w, 0») and 
consider its boundary value from the domain 
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Re«z —w,C» «0. We denote the resulting 
microfunction by 


Y pj, / 109,4 / -1( 9 
+,/—10)). 


Then 


K(x, y) SEH pj / — 15), (c y. 
Be EE 


is a well-defined microfunction in a neighbor- 


hood of (Xo, Xgi / Mo. — £9)oo) whose sup- 
port is contained in the antidiagonal set A^ = 
{œ y; —1(5m)o0)e / —1 S*(M x M)Ix o y, 
E+n=0}. Here w(€) is the volume element 

of the (n — 1)-dimensional sphere S" `. Hence 
K(x, y) dy defines a microlocal operator. 

The mapping which associates K(x, y)dy with 
{ p,(x, €)} is compatible with the inclusion 
mapping stated in (1). 

(v) By using the tplane wave decomposition 
of the ó-function due to F. John (— 125 Distri- 
butions and Hyperfunctions CC), we find that 
microdifferential operators are a natural gen- 
eralization of tlinear differential operators: A 
linear differential operator corresponds to a 
symbol sequence {p,(x, Elo o, where p; is a 
polynomial with respect to c. 

(vi) (a) Let P 2 Xp;(x, D,) and Q = 
È; d(x, D,) be microdifferential operators. 
Then their composition R= P oQ is a micro- 
differential operator with the symbol sequence 
(rs given by 


n= Y Dip, Digs. 
I7 jtk- |a| €: 

Here D? = 0Pl/0£8: ... 082» and a! =a,!...a,! for 
the *multi-index a — (a, ...,2,). (b) Let P 
= eps, D,) be a microdifferential operator. 
Let ó(x — y) denote the residue class [1] of the 
left Ex x x Module £y x x (2i Ex x xx — Yi) 
T X4 Ox x x(0/0x, + 0/0y,)). “Module” means 
sheaf of modules.) Then there exists a unique 
microdifferential operator R = X, r( y, D,) such 
that P(x, D,)ó(x — y) - R(y, D,)ó(x — y). Fur- 
thermore, r(x, č) is given by 

(—1) 


KA 





Dz Di px, €). 


1-j- lo] 


R is called the adjoint operator of P and is 
denoted by P*. When X is a complexification 
of the real manifold M, it coincides with the 
adjoint operator P* e Ly. 

(vii) For a microdifferential operator P in 
&,(m), we define its principal symbol o,,(P) by 
Pm(X, é). The principal symbol o,,(P) is in- 
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dependent of the choice of local coordinate 
system. It gives an isomorphism between 
&y(m)/& (m — 1) and €,. vim, the sheaf of 
holomorphic functions on T * X which are 
homogeneous of degree m with respect to č. 
(viii) (a) Let P be in &y (m). au Assume that 
a, (P)(xo, č) #0. Then its inverse P~? (i.e., 
PP '=P™'P=1)exists in 6y(—m),,,_ «y. (b) 
Let P and G be méin, «5, Exo. e, re- 
spectively. Suppose that H7 (c, (o,(P)) (xo, So) = 
0(j=0,...,p—1) and that H7 (go, (P)) (xo. čo) 
#0. (Here H,(g) is, by definition, the *Poisson 
bracket { f, g] of f and g (— 82 Contact Trans- 
formations)). Then for each S in &y Ae, Sai we 
can find Q and Rm, Gut so that S=QP + 
p 


R with (ad G} R£S[G, [G, ..., [G, R], ...] «0 
holds. This result is usually referred to as the 
Spáth-type division theorem (for microdifferen- 
tial operators). In particular, when G — x, 

and (xo, £j) 2 (0; 1,0, ..., 0), R has the form 
SEL R(x, D')D*. Here RP (x, D') 2 R(x, Dy, 
5 D, 4). As a corollary to this expression 

we find the following (Weierstrass-type) pre- 
paration theorem (for microdifferential oper- 
ators): Let P be as above, and let G = x,. Then 
we can find Q and W in 6, (o, o, o Such 
that P=QW with invertible Q and W=D?+ 
SEL W(x, D’))D*, where W'? belongs to 
&(p—k) and e, ,(W9)(0; 1,0, ...,0)=0. 

(ix) Quantized contact transformation. (a) Let 
X be an n-dimensional complex manifold and 
Q an open subset of P*X. Let P; (j— 1,2, ...,n) 
be in &(1) (Q) and Q; (j— 1, ..., n) in &(0)(Q). 
Assume that [P,, P,]=[0;,0,]=0 and [P, Q,] 
= 6, hold (1 &j, k <n). Let @ be the contact 
transformation from Q to P*C" defined by 
Pr (Go(Qi)(P), ...,o9(Q,) (p), e.(P) (p). ---> 
o, (P,)(p)). Then there exists a unique C-algebra 
homomorphism 6:9 Të 6,|g such that 
(xj) 2 Q; and 6(D)-— Pj (j—1, ...,n). Further- 
more, ® is an isomorphism, Dé¢.(m) = y (m) 
holds, and o, ((R)) 2 o, (R) o @ holds for R in 
&c«(m). We call the pair (o, 9) a quantized con- 
tact transformation. In the above situation, 
the &, , »- Module 


nell È Ex x cr(X; — Q) 
j= 


j=1 


dE EN 


is a simple holonomic system (— Section H) 
whose support is the graph of o°. Let u be the 
canonical generator of Æ, i.e., the residue class 
of 1 in M. Then R*u=@(R)u holds for R € c. 
(b) Conversely, let o be a contact transfor- 
mation from a neighborhood of p in P*X to 
P*C", and let u be a generator of a simple 
holonomic system whose support is the graph 
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of o" Then a C-algebra isomorphism o: 

o écr >é is defined in a neighborhood of p 
through R*u=@(R)u {R € éc), and (p, ®) 
becomes a quantized contact transformation. 
(c) In particular, let p be a point in icd S*M 
and Q its complex neighborhood. Let (o, ©) be 
a real quantized contact transformation de- 
fined on Q Ge, @ maps gat S*M to VL 
S*R"). Then, in a neighborhood of (p, o(py)e 
E] S*(M x R”), there exists a microfunction 
solution K(y, x) 40 of the equations x,K(y, x)= 
O;K(y,x), —G/0x) Ks x) BEI, x) (j= 
L,..., n) (xeR", ye M). Such a microfunction is 
unique up to constant yero The integral 
operator X "rtl | K(y, x)v(x) dx gives rise 
to a sheaf isomorphism between o '@,. and 
@y in a neighborhood of p. Furthermore, we 
have A (Rv) 2 G(R) (/£t) for rege, and Re. 
This 2X" is the counterpart of the Fourier inte- 
gral operator (— Section C). 

(x) Algebraic properties of é% and &,. (a) éy 
is 'Àcoherent as a left &,-Module, and its stalk is 
a "left Noetherian ring. (b) & is *faithfully flat 
over &y. (c) &, is tflat over &,(0). (d) &, is flat 
over z ^! Dy. 


G. Microdifferential Equations [22,24] 

(1) Background. A system A of microdifferen- 
tial equations (of finite order) is by definition a 
‘coherent left (or right) 6,-Module, i.e., there 
exists locally an exact sequence of the form 

k 6X f ^50. For a coherent 6-Module 
M, the support Supp æ of M is an impor- 
tant geometric object associated with æ. It is 
called the characteristic variety of Æ. For 

a coherent Z-Module .Z, its characteristic 
variety is by definition Supp(é e. M). It is 
often denoted by S.S.. Æ. Since a microdifferen- 
tial operator is a microlocal operator, the 
result in (viii) (a) of Section F (3) asserts that 
&at}(M, Cy) is supported by the characteristic 
variety of M intersected with "Eri S* M. 
Now, it is known that Has Supp Æ is tinvolu- 
tory (=involutive, in involution) in P*X, 
namely, f|, —g|, —0 entails { f, g} |y —0 [22, 
theorem 5.3.2 on p. 453]. One of the most 
important problems in microlocal analysis is 
to study how much information V can give 
concerning the structure of M itself, and hence 
that of &x/L(.//, €). The epoch-making dis- 
covery of [22] is that V determines the struc- 
ture of £” Qy M at generic points of V as 
follows. 


(2) Structure Theorems. The fundamental 
result of [22, theorem 5.3.7 on p. 455] is the 
following theorem for coherent &-Modules: 
Structure theorem 1. Let .@ be a coherent 
&y-Module satisfying the following conditions: 
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(1) ath, (M, &)=0 for jd. (2) VS Supp 
is regular at pe V in the sense that V is non- 
singular near p and that w|,(p)40 for the 
‘canonical 1-form w. Then, through a quan- 
tized contact transformation (q, D), £” Q M 
is isomorphic to a direct summand of a direct 
sum of finite copies of partial de Rham sys- 
tem az NU 66«0/0z; with e(p)— 
(0:0, ...,0, D)e P*C". 

By studying the canonical form of V under 
real contact transformations, [22] further 
gives the following structure theorem in a real 
domain, Le, in SH S*M for a real analytic 
manifold M. 

Structure theorem 2. Let X be a complexifi- 
cation of a real analytic manifold M, and let M 
be as in structure theorem 1. Let p be a point 
in VN ecd S*M. Suppose that the following 
three conditions are satisfied: (3) VN V is regu- 
lar at p. (4) (VNA T, (V) - (VN V) holds for 
each q in V V. Here V denotes the complex 
conjugate of V (with respect to gt S*M) 
and T,(V), etc., denotes the tangent space of V, 
etc., at q. (5) The generalized Levi form of V is 
of constant fsignature (a, b) near p, where the 
generalized Levi form of V is, by definition, the 


Hermitian form 
x, / LEI, 


for the p; such that V= (p; ! (0). Then £” & 
M is isomorphic to a direct summand of 

a direct sum of finite copies of the system 

6 * Qg M considered in a neighborhood of 


(x, / —1&) (0, / —1(0,...,0, 1) e / —1 S*R', 


where WV is given by 


L(v)— DHA Px} ( 








^ ^ 
Ó o 
(. Es 1 MAE UE 
OXr+2s+1 Xn 


d Q 
ew cba zm 
ÜXp äent 


Here, r 2 2codim V — codim(Vn V) and s= 
codim(Vf V) —codim V — (a+ b). 

The first (resp. second) type of equation in 
the above are called (partial) de Rham equa- 
tions (resp. (partial) Cauchy-Riemann equa- 
tions). The third and the fourth are called 
Lewy-Mizohata equations (of type (a, b)) after 
these authors' pioneering works [25, 26]. Thus 
any system is seen to be microlocally isomor- 
phic to a mixture of these three types of equa- 
tions, generically speaking. As a corollary to 








a+1,... a+b). 
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this, structure theorem 2 clarifies the structure 
of microfunction solutions of .@ as follows: 
Structure theorem 3. Let M, X, M, V, 
and p be as in structure theorem 2. Then 
Ext, (EY O M, Gy) 0 (j#a) holds, and the 
remaining cohomology group F = ózfss(6x © 
M, €,,) has the following structure in a neigh- 
borhood U of p: There exists an s-dimen- 
sional complex manifold Y, a real analytic 
manifold N, and a tsmooth mapping @ from 
VOUN/=1 S*M to Yx4/—1 S*N such 
that ¥ — 'G for a sheaf 4 on Yx./-1: 
S*N that is a direct summand of J£", with J£ 
being the solution sheaf of the partial Cauchy- 
Riemann equations associated with Y. 


H. Holonomic Systems 


A coherent (left) d Module .@ is called holo- 
nomic if Supp Æ is *Lagrangian. A coherent 
(left) -Module Æ is called holonomic if 

E Q pM is so. Even though the term “holo- 
nomic” is currently used, another term, “maxi- 
mally overdetermined,” is used to describe the 
same object in some of the literature, including 
[22]. The importance of such a system lies in 
the fact that the space of its microfunction 
solutions is finite-dimensional [27, 29]. In this 
sense it resembles an ordinary differential 
equation. À holonomic system, however, does 
not satisfy condition (2) of structure theorem 1 
of Section G, and its structure ts rather com- 
plicated. A result which corresponds to struc- 
ture theorem 1 in Section G is the following: 
Let V be an involutory submanifold of T * X, 
and let &, be the subring of 6, generated by 
{Pe&,(1)|o,(P)], - 0). Then a coherent £y- 
Module .ZJ defined on an open subset Q of 
T*X is said to have regular singularities along 
V if for any point p of Q, there exist a neigh- 
borhood U of p and an 6,-sub-Module Mo 

of M defined on U which is coherent over 
6(0), and which generates A as an éy-Module. 
A holonomic system is said to have R.S., 
which is an abbreviation for regular singular- 
ities, if it has RS along its support. Then for 
an arbitrary holonomic &-Module M we can 
find a holonomic &-Module A... with R.S. 
such that &* Qg Mreg x 67 G9, M holds. See 
[28] for the proof of this striking result and 
related topics on holonomic systems with reg- 
ular singularities. 

An elementary class of holonomic systems is 
that of simple holonomic systems. A holonomic 
&-Module M is called simple if there exists a 
left Ideal .£ such that .“=6/¥ and that the 
symbol Ideal {o(P)|Pe4} coincides with the 
defining Ideal of Supp. A. Let u denote the 
generator 1 mod.7 of a simple holonomic 
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system &/.Z. Suppose that A = Supp. M is 
nonsingular. Then the principal symbol o,(u) is 
defined as follows: For P= %jp,(x, D)e (m), 
let L, denote the first-order linear differential 
operator 


ke i 
L= Hna Pamit T] 


24 6x6 


Here, H, denotes the Hamiltonian vector field 
defined by {Pm j. Let Q, and Qy denote the 
sheaf of n-forms of A and X, respectively, and 
let O9'? (resp. Q812 @ Q9? 17) denote a line 
bundle L such that L9? is isomorphic to Q4 
(resp. Q4 & Q71). Since the line bundles QE"? 
and Q9'? @ OF ? do not exist globally in 
general, all the equations among their sections 
should be understood up to a constant multi- 
plicative factor. When A is a purely imaginary 
Lagrangian submanifold of NET S*M for 

a real analytic manifold M, these line bun- 
dles can be constructed globally by using the 
Keller-Maslov index (— Section C). Let v 

and y denote respectively the first term and 
the second term in the right-hand side of the 
above definition of Lp. Then L,:02!2 5 

Q9 is given by Lp(s) - (1/2s) L, (s?) + Ws for 
seO9!?. where L, (s?) denotes the "Lie deriva- 
tive of s? along v. One can then prove that the 
system of equations Lps =0 (Pe.7) admits 
locally one and exactly one nonzero solution 
s in Q?!? up to a constant factor. Then the 
principal symbol c4 (u) of u equals s ® V dx e 
O91? Q (9712 by definition. The principal 
symbol o,(u) is homogeneous with respect to 
£, and its homogeneous degree is called the 
order of u and is denoted by ord,(u). The 
microlocal structure of a simple holonomic 
&-Module with a nonsingular characteristic 
variety is determined by the order of its gen- 
erator as follows: (a) Let &u and ër denote 
simple holonomic &-Modules with the same 
characteristic variety A. Then &u is isomorphic 
to &v if and only if ord, (u) — ord, (v) is an 
integer. (b) Let &u be a simple holonomic 
system, and let x denote the order of u. Then 
through a suitable quantized contact trans- 
formation, &u is isomorphic to dee, where w 
satisfies 


Calar. 0 
DS GER uon 


Ow 





—-0 (j=2,...,n). 
02; 

Thus the microlocal structure of a simple 
holonomic system M = &u is fairly simple at 
nonsingular points of its characteristic variety. 
Moreover a Hartogs-type theorem for micro- 
differential equations [28, ch. I, 82] entails 
that if Supp. A has the form A, UA, with 
Lagrangian manifolds A, and A; such that 
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codim, (A, 145) 22; then æ has the form 

M, € M, with supp. A; — A; (j — 1, 2). Hence 
the case where codim, (A, (1A5)— 1 is impor- 
tant. Suppose A, and A, are nonsingular in this 
case and T, A, £ T, A; at a point xeA, NA. 
Then, through a quantized contact transfor- 
mation (wy, ®), the system A is isomorphic to 
cW, with w satisfying the following equations 
defined near (0; dz, œ): 


ôw 

——— =3,. 

o (j=3,...,7), 

where a= ord, (u) (j — 1,2) with y(A;)— 

(2; 20, č, 2... 2£,-0] and w(A,)={z,= 
z,=0,€,=...=€,=0}. For more general cases 


and an application — [30]. 

The theory of holonomic systems and its 
applications are being studied most inten- 
sively, raising the hope of establishing a uni- 
fied theory of special functions of several 
variables; — [23] and references cited therein. 


I. History 


Microlocal analysis means local analysis on 
the cotangent bundle. It emphasizes the im- 
portance of localization in cotangent bun- 
dles in analysis, which was pointed out by S. 
Mizohata [5,6] immediately after the advent 
of singular integral operators in the works of 
A. P. Calderón and A. Zygmund [3,4]. Since 
then, localization in the cotangent bundle has 
been used frequently in the theory of linear 
partial differential equations. R. T. Seeley [31] 
proved that the symbol of a singular integral 
operator is well defined on the cotangent 
bundle. Works by J. J. Kohn and L. Nirenberg 
[32] and L. Hórmander (Comm. Pure Appl. 
Math., 18 (1965)) strengthened the trend of 
localizing the problem on the cotangent bun- 
dle. Although it seems that the term “micro- 
local analysis" first appeared in the literature 
in 1973 (T. Kawai, Astérisque, 2 and 3 (1973)), 
the basic part of the theory had been con- 
structed during the period from 1969 to 1972 
by M. Sato (Proc. Intern. Conf. Functional 
Anal. and Related Topics, 1969), Yu. V. Egorov 
[12], Hórmander [8, 20], J. J. Duistermaat 
and Hórmander (Acta Math., 128 (1972)), 

M. Kashiwara and Kawai (Proc. Japan Acad., 
46 (1970)), and Sato, Kawai, and Kashiwara 
[22]. Apparently the work of V. P. Maslov 
[11] had an important influence on the work 
of Egorov. The most important part of Sato's 
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contribution was the fact that, through the 
construction of the sheaf of microfunctions he 
found that the singularities of hyperfunctions 
can be canonically dispersed over the cotan- 
gent bundle (— Section E) and that a hyper- 
function solution u of a linear differential 
equation Pu —0 is concentrated on the charac- 
teristic variety when its singularities are thus 
dispersed (— Section G). The last-stated fact 
was also formulated by Hórmander [8, 20] in 
the framework of distribution theory. The 
most important part of the contribution of 
Egorov [12] was the discovery that one can 
use an integral transformation introduced by 
V. I. Eskin [7] to find a transformation of 
pseudodifferential operators compatible with a 
homogeneous canonical transformation, i.e., a 
contact transformation, so that the commut- 
ation relations and the orders of the operators 
can be preserved. Hórmander (Acta Math., 

121 (1969)) independently introduced integral 
transformations of the same type, calling them 
Fourier integral operators. Egorov [13] and L. 
Nirenberg and F. Treves [33] successfully used 
the transformation of operators to study the 
regularity and existence of solutions. Sub- 
sequently, Hórmander [8] elaborated the 
theory of Fourier integral operators. Kashi- 
wara and Kawai (Proc. Japan Acad., 46 (1970)) 
observed that a pseudodifferential operator in 
their sense (now called a microdifferential 
operator; — Section F) gives rise to a sheaf 
homomorphism on the sheaf of microfunctions 
and that the structure of the microfunction 
solutions of pseudodifferential equations is 
determined by the principal symbol of the 
operator in question If it has simple character- 
istics. Then Sato, Kawai, and Kashiwara [22] 
succeeded in amalgamating these two theories, 
namely, the theory of microfunctions and the 
theory of the transformation of operators. 
(They called the transformation a quantized 
contact transformation in [22]). Such an amal- 
gamation was also done independently by 
Hórmander (8, 20], who introduced the notion 
of the wave front set for distributions as a 
substitute for the support of microfunctions. 
Incidentally, it is noteworthy that C. Chandler 
and H. P. Stapp (J. Math. Phys., 10 (1969)) and 
D. Iagolnitzer and Stapp (Comm. Math. Phys., 
14 (1969)) obtained a notion similar to the 
singularity spectrum in a physical context. 
Their results were later elaborated (around 
1971-1973) by J. Bros and Iagolnitzer [19]. 
With the aid of the above-mentioned amal- 
gamation of the theories, the works of Sato, 
Kawai, and Kashiwara [22], Hórmander [20], 
Duistermaat and Hórmander (Acta Math., 121 
(1969)), Kawai (Publ. Res. Inst. Math. Sci., 7 
(1971—1972)) and K. G. Andersson (Trans. 
Amer. Math. Soc., 177 (1973)) have clarified the 
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importance of the bicharacteristic strip, which 
is a submanifold of a cotangent bundle, not a 
base manifold, as a carrier of singularities of 
solutions of pseudodifferential equations (— 
Section G); and the works of Sato, Kawai, and 
Kashiwara [22], Sato (Actes Congr. Internat. 
Math. Nice, 1971) and Kawai (Publ. Res. Inst. 
Math. Sci., 7 (1971—1972)) revealed the hidden 
mechanism of the celebrated counterexample 
of H. Lewy [25]. Among these, the contri- 
bution of Sato, Kawai, and Kashiwara [22] 
was most decisive and fundamental in that it 
first clarified the structure of a general system 
of pseudodifferential equations at generic 
points of the characteristic variety and then 
derived from it the above-quoted results on 
the structure of the solutions (— Section G). 

Microlocal analysis has now become one of 
the most important and basic concepts in the 
theory of linear partial differential equations 
and theoretical physics. For recent develop- 
ments — Hormander [34] and V. Guillemin, 
Kashiwara, and Kawai [23] and references 
cited therein. 
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A. General Remarks 


An immersion f of an m-dimensional manifold 
M with boundary 06M (possibly empty) into a 
Riemannian manifold N is called minimal if 
the *mean curvature vector field H of M with 
respect to the induced Riemannian metric 
vanishes identically. Then M is called a mini- 
mal submanifold of N. This definition comes 
from the following variational problem: By a 
smooth tvariation of f is meant a smooth map- 
ping F:I x MN, where I —(—1, 1), such that 
each f,=F(t,-): M—N is an immersion, fy f, 
and f,| OM — f | 0M for all te I. Let dV, be the 
volume element of the metric induced by f,, 
and set V(t) — fu dV, the volume of M at time 
t. Then the first variation of the volume is 


expressed as 
d 
t=0 


0 
-— —nmnm HP ne 
1=0 M ot 


where 8/ôt denotes the canonical vector field 
along the J factor in J x M. Thus the mean 
curvature vector field H of f vanishes identi- 
cally if and only if dV/dt |, —O for all vari- 
ations of f. Therefore a minimal submanifold 
gives an extremal of the volume integral, 
though neither necessarily minimal nor of the 
least volume. 

In the case N =R”, an immersion x: M >R” 
is viewed as a vector-valued function, and the 
mean curvature vector field H is expressed as 
H — Ax/m, where A denotes the tLaplace- 
Beltrami operator — give, Thus x is minimal 
if and only if each component function of x is 
harmonic. In particular, there is no compact 
minimal submanifold without boundary in R”. 


dV 
dt 
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The history of the theory of minimal sub- 
manifolds goes back to J. L. Lagrange, who 
studied minimal surfaces in the 3-dimensional 
Euclidean space R?. In 1762 he developed 
his algorithm for the *calculus of variations, 
which can be applied to higher-dimensional 
problems and is now known as the tEuler- 
Lagrange equation. For instance, let D bea 
domain in the plane R? and z= f(x, y), (x, y)e 
D, the equation of a surface in R?. As a neces- 
sary condition for the surface to have the least 
area among the surfaces with fixed boundary, 
Lagrange obtained a *quasilinear elliptic par- 
tial differential equation of the second order, 
called the minimal surface equation: 


(1 T z2)zy. — 22,2,2,, FU T2125 =0. 


Before this, in 1744, L. Euler had found that a 
*catenoid is a minimal surface. In 1766, J. B. 
M. C. Meusnier showed that a right thelicoid 
is a minimal surface. Besides catenoids and 
helicoids, in 1834, H. F. Scherk found that the 
surface defined by z=log(cos y) —log(cos x) is 
a minimal surface, which ts called Scherk’s 
surface. 

In the latter half of the 19th century, *Pla- 
teau’s problem (— Section C; 334 Plateau's 
Problem) was studied extensively by O. Bon- 
net, B. Riemann, K. Weierstrass, A. Enneper, 
G. Darboux, and others. The problem is stated 
as follows: Given a !Jordan curve F in R? (or 
in R”), find a surface of least area having I 
as its boundary. On the other hand, in 1866, 
Weierstrass gave a general formula, called 
the Weierstrass-Enneper formula (— Section B 
(5)), to express a simply connected minimal 
surface in terms of a complex analytic function 
and a meromorphic function with certain 
properties. The formula allows one to con- 
struct a great variety of minimal surfaces by 
choosing those functions. 

The existence of a minimal surface of disk 
type having a prescribed boundary curve was 
first obtained in 1930 by J. Douglas and T. 
Radó independently as a solution to Plateau's 
problem, admitting singularities. The result 
was improved by R. Courant for the case of 
finitely many boundary curves by the method 
of !Dirichlet integrals. The method was carried 
out further by C. B. Morrey for the gener- 
alized Plateau's problem in a Riemannian 
manifold (— Section C (5)). Indeed, the Euclid- 
ean space was replaced by any complete Rie- 
mannian manifold which is metrically well 
behaved at infinity. For example, any compact 
or any *homogeneous Riemannian manifold is 
in this class. 

The existence proof of minimal surfaces 
cannot in general be applied directly to the 
case of higher-dimensional minimal submani- 
folds. The notion of varifolds (— Section G (2)) 
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and the corresponding generalization of a 
minimal submanifold, called a minimal variety 
(— Section G (2)), were then introduced by F. 
Almgren, who proved the existence of a mini- 
mal variety in a compact Riemannian mani- 
fold. It was also proved that a minimal variety 
is approximated by regular minimal submani- 
folds. This work has been developed as geo- 
metric measure theory by E. R. Reifenberg, 
W. H. Fleming, H. Federer, F. Almgren, and 
others (— Section G). 

The study of minimal surfaces given as the 
graph of a real-valued function of two vari- 
ables leads to that of solutions of the minimal 
surface equation. In 1915, S. Bernshtein proved 
that a minimal surface z — f(x, y) defined on 
the entire plane R? must be a plane. Sub- 
sequently, this was generalized to the following 
problem: Is a minimal hypersurface x,,, — 
f(x, ..., x) in R"*! defined on the entire space 
R" a hyperplane? The answer turns out to be 
affirmative for n x 7 and negative for nz 8 
(— Section F (1)). 

In general, when a bounded domain D in R" 
and a continuous function o on its boundary 
OD are given, the problem of finding a minimal 
hypersurface M defined by the graph of a real- 
valued function f on D with f|0D = q gives 
rise to a typical fDirichlet problem. The basic 
questions are those of existence, uniqueness, 
and regularity of solutions. These were first 
studied by Rado for n — 2 and later by L. Bers, 
R. Finn, H. Jenkins, J. Serrin, R. Osserman, 
and others (— Section D (1)). 

Minimal submanifolds of a sphere have 
interesting properties; some are analogous to 
those in Euclidean space but some are not. 
Among them, J. Simons, in 1968, gave a dif- 
ferential equation that the norm of the second 
fundamental form of a minimal submanifold 
in a sphere should satisfy. He then showed 
that a totally geodesic submanifold is isolated 
among the compact mintmal submanifolds in 
the sphere. S. S. Chern, M. P. do Carmo, S. 
Kobayashi, N. En, T. Ito, T. Otsuki, N. Wal- 
lach, and others have made further contri- 
butions to this subject (— Section F (2)). 


B. Minimal Surfaces 


(1) Branched minimal surfaces. A minimal 
surface M in R" is an immersed surface with 
vanishing mean curvature vector field. M 
equipped with an atlas of tisothermal coor- 
dinates is viewed as a *Riemann surface. A 
branch point of a tharmonic mapping f: M 
R" is a point at which the differential df, is 
zero. A harmonic mapping f: MR" ofa 
Riemann surface M is called a branched (or 
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generalized) minimal immersion if it is con- 
formal except at the branch points, and the 
image f(M) is then called a branched minimal 
surface. The solution to Plateau's problem 
given by Douglas and Radó is a branched 
minimal surface. 

(2) Maximum principle. When n —3, the 
following maximum principle for minimal 
surfaces holds: If M, and M; are two con- 
nected branched minimal surfaces in R? such 
that for a point pe M, N M5, the surface M, 
locally lies on one side of M, near p, then M, 
and M, coincide near p. 

(3) Convex hull property. In general, every 
branched minimal surface with boundary in R" 
lies in the convex hull of its boundary curve, 
the smallest closed convex set containing the 
boundary. 

(4) Reflection principle. If the boundary 
curve of a branched minimal surface contains 
a straight line y, then the surface can be ana- 
lytically continued as a branched minimal 
surface by reflection across y. Based on this 
principle, the following holds: Let I' be an 
analytic Jordan curve in R” and f: MOR"a 
branched minimal immersion with boundary 
F. Then f is analytic up to the boundary, i.e., 
f(M) is contained in the interior of a larger 
branched minimal surface (H. Lewy). The 
smooth version of this theorem was given by 
S. Hildebrandt: If f: M —R" is a branched 
minimal immersion with smooth boundary 
curve, then f is smooth up to the boundary. 

(5) Weierstrass-Enneper formula. Every 
simply connected minimal surface in R? is 
represented in the form 


x)= Re | E o, k=1,2,3, 
0 


where d = f(1 —9^)2, $; — / —1 f(1 +g°)/2, 
$3 = fg, and c, is a constant. Here, g(f) is a 
meromorphic function on a domain D in the 
complex ¢-plane, and f(¢) is an analytic func- 
tion on D with the property that at each point 
C, where g(C) has a pole of order m, f(¢) has a 
zero of order 2m, D being either the unit disk 
or the entire plane. This formula is quite use- 
ful for constructing various minimal surfaces. 
For instance, on setting f=1 and g(C) — C, 
Enneper's surface is obtained: 


3 3 


u v 
(6, X2 X3) 7 (u= +u’, MC vu?, u? — v?), 


(u, v)e R?. 


Another feature of the formula is that general 
theorems about minimal surfaces can be ob- 
tained by translating statements about analy- 
tic functions into the corresponding minimal- 
surface ones. For example, a surface f: M >R? 
is minimal if and only if the *Gauss mapping 
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G:M 5S is antiholomorphic, i.e., the complex 
conjugate of the mapping is holomorphic, 
when S? is viewed as the Riemann sphere with 
complex structure by the stereographic projec- 
tion (from the south pole) onto the complex 
plane. As a consequence, a complete minimal 
surface in R? is a plane or else the normals to 
the surface are everywhere dense in S? (R. 
Osserman). 

(6) Stability. A minimal submanifold M is 
called stable if for every compact region on M 
all the second variations of the volume are 
positive. For instance, if M 5 R" is a minimal 
hypersurface and fv is a variation vector field, 
v being the unit normal vector field of M and f 
a smooth function with compact support on 
M, then the second variation formula for the 
volume V(t) is given by 


d'H 
dt? 





-| (Vf? —1AP f?) dos, 
t=0 M 


where | A| denotes the norm of the second 
fundamental form of M. In particular, when 
n= 3, then —|A|?/2=K, the Gauss curvature 
of M. Therefore a minimal surface M in R? is 
stable if and only if 


| (IVf +2Kf*)dV>0 


for any smooth function f with compact sup- 
port on M. It follows that the stability of mini- 
mal surface M in R? is related to the bound- 
ary value problem of a linear elliptic operator 
L=A—2K, A being the Laplacian on M. 
Namely, a minimal surface M in R? is stable if 
and only if the first eigenvalue 4,(D) of L on 
any bounded domain D in M is nonnegative. 

In connection with the tGauss mapping, a 
sufficient condition for a domain D in a mini- 
mal surface to be stable was obtained by H. 
Schwarz in 1885: If a minimal surface M in R? 
has one-to-one Gauss mapping G: MoS?, 
then a domain D c M such that G(D) is con- 
tained in a hemisphere of S? is stable. This was 
generalized by J. L. Barbosa and M. do Carmo 
as follows: If the area A(G(D)) of the spherical 
image G(D) is less than 2z, then D is stable. 
This result is sharp in the sense that for any 
&>0 there exists an unstable domain D with 
A(G(D)) 2 21 +e. As an analog to Bernshtein’s 
theorem, the following holds: A complete and 
stable minimal surface in R? is a plane. H. 
Mori has also obtained results in this regard. 

As for the existence of unstable minimal 
surfaces, it is known that if there are two dis- 
tinct stable minimal surfaces with the same 
smooth boundary curve T in R^, then there 
exists an unstable branched minimal surface 
with T as its boundary (M. Morse and C. 
Tompkins, M. Schiffman). 
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C. Plateau's Problem (— 334 Plateau's 
Problem) 


Let T be a trectifiable ‘Jordan curve in R” and 
D={(x, y eR?|x? - y? « 1). Then there exists 
a continuous mapping f: DR" such that (a) 
f{|éeD maps homeomorphically onto T; (b) 
f|D is harmonic and almost conformal, i.e., 
Cf. f) =0 and | f| - [jl in D with |df|>0 
except at isolated branch points; (c) the in- 
duced area of f is the least among the class 

of piecewise smooth surfaces bounding I with 
(a). This mapping f is called the classical solu- 
tion or the Douglas-Radó solution to Plateau's 
problem for T, which may have singularities 
called tbranch points. The resulting surface S 
is a branched minimal disk. This theorem 
establishes the existence of a surface of least 
area among all surfaces homeomorphic to a 
disk. It has been generalized by R. Courant 
for T consisting of finitely many rectifiable 
Jordan curves. 

For a branched minimal disk S bounded by 
smooth F in R?, there is a relation, called the 
Gauss-Bonnet-Sasaki-Nitsche formula, among 
the "total curvature x(I) of T, the total curva- 
ture of S, and the orders of branch points: 


1 1 
1 — M, +— dA x — K(T), 
+¥(m,-)+¥ rts | IKI zD 


where K denotes the Gauss curvature of S, 

m, — 1 the orders of the interior branch points, 
and 2M, the orders of the boundary branch 
points, which must be even. 

(1) Regularity. A minimal disk of least area 
in R? has no boundary branch points when I 
is real analytic (R. Gulliver and F. Leslie) or 
when T is.smooth and the total curvature «(I) 
is less than 4z (J. C. C. Nitsche). In general, a 
classical solution for smooth T in R? cannot 
have infinitely many branch points. A re- 
markable fact is that every classical solution 
to Plateau's problem in R? is free of branch 
points in its interior, i.e., is a regular immer- 
sion (R. Osserman). 

(2) Embeddedness. A classical solution is 
not necessarily an embedding; it may have 
self-intersections. For instance, if I is knot- 
ted in R?, then every solution must have self- 
intersections. It is known, however, that im- 
mersed minimal disks of least area in R? which 
can self-intersect only in their interiors are 
embeddings. In particular, if I is an extremal 
Jordan curve, i.e., if IT lies on the boundary of 
its *Àconvex hull, then the classical solution for 
T is an embedding (F. Tomi and A. J. Tromba; 
W. H. Meeks and S. T. Yau). If the topological 
type is not specified, then there always exists 
an embedded minimal surface. That is, if T is 
the union of any finite collection of disjoint 
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smooth Jordan curves in R?, then there exists 
a compact embedded minimal surface with 
boundary F which is smooth up to the bound- 
ary (R. Hardt and L. Simon). However, there 
exists an unknotted Jordan curve that never 
bounds an embedded minimal disk. 

(3) Uniqueness. In general, the classical 
solutions to Plateau's problem do not have the 
uniqueness property. Radó was the first who 
gave a condition on a boundary curve to 
guarantee the uniqueness of the minimal disk. 
Namely, if a Jordan curve T in R” admits a 
one-to-one orthogonal projection onto a con- 
vex curve in a plane R? in R", then the classical 
solution to Plateau's problem for T is free of 
branch points and can be expressed as the 
graph over this plane. When n—3, the solution 
is unique. Another geometric condition on I 
has been given by Nitsche: An analytic Jordan 
curve I in R? with total curvature k(T)< 
4r (or a smooth T with k«(I) «4z) bounds a 
unique immersed minimal disk. Moreover, 
the generic uniqueness holds: In the space . 
of all smooth Jordan curves in R" with suitable 
topology, there exists an open and dense sub- 
set Z such that for any Um & there exists 
a unique area-minimizing minimal disk (F. 
Morgan and A J. Tromba). 

(4) Finiteness. As for the finiteness of the 
classical solutions to Plateau's problem, 
several conditions on boundary curves are 
known. An analytic Jordan curve in R? 
bounds only finitely many minimal disks of 
least area (F. Tomi). An analytic fextremal 
Jordan curve in R? bounds only finitely many 
minimal disks with relative minima of area (H. 
Beeson). A smooth Jordan curve I with total 
curvature x(I) € 6x bounds only finitely many 
minimal disks (Nitsche). Moreover, generically, 
i.e., for an open and dense subset of boundary 
curves, there are at most finitely many minimal 
surfaces with given boundary, relative minima 
or not (R. Bóhme and Tromba). 

(5) Generalized Plateau's problem. C. B. 
Morrey's setting of the generalized Plateau's 
problem is as follows: A homotopically trivial 
rectifiable Jordan curve T is given in an n- 
dimensional Riemannian manifold N. Let D 
denote a disk in Ri Find a mapping f: Do N 
such that (a) f (0D maps homeomorphically 
onto T, (b) the induced area of f is the least 
among the class of piecewise smooth surfaces 
in N bounded by T with (a). Obviously when 
N =R", this is the classical Plateau’s problem. 
A solution was given by Morrey under the 
assumption that N is homogeneously regular; 
i.e., that there exist 0 c k < K such that for 
any point ye N there is a local coordinate 
system (V, o) around y for which ®(V)= 
[xeR"|lxi <1} and the Riemannian metric 








275 D 
Minimal Submanifolds 


È g,, dx* dx satisfies 
KY (v*Y «5, go v*v" « K » (o 


for any x and (v,, ..., v,)e R”. Any compact or 
any homogeneous Riemannian manifold is 
homogeneously regular. Morrey's solution is 
as follows: If N is a homogeneously regular 
Riemannian manifold and if I is a homotopi- 
cally trivial rectifiable Jordan curve in N, then 
there exists a branched minimal immersion /: 
DAN with least area bounded by I with (a). 
The regularity of Morrey’s solution is simi- 
lar to the classical one. If I is smooth, then 
so is the solution f up to the boundary. If, 
furthermore, dim N = 3, then the solution f 
is an immersion in its interior. If N and T are 
real analytic and dim N =3, then the solution f 
is an immersion up to the boundary. 


D. Existence Problems of Minimal 
Submanifolds 


(1) The Dirichlet problem for the minimal 
surface equation. When the graph of a (vector- 
valued) function is a minimal submanifold 

in some Euclidean space, the function must 
satisfy the so-called *minimal surface equation. 
To be precise, let D be a (bounded) domain in 
R”, f: DO R' a (vector-valued) function, and 
put F: D2R"^*: F(x) « (x, f(x)) for xe D. Let M 
be the graph of f: M — F(D). Then M is mini- 
mal if and only if f (or F) satisfies one of the 
following equations: 


ifn=2 and k=1, 
Ui AE redeo, 0 (1) 
if n is arbitrary and k=1, 
n 0/1 of 

—|—— J= h WH /1I+|Vf/?; 
Nis) 0 where TVfl; 


(2) 
if n=2 and k is arbitrary, 
(L6) S ex 26 fs, tA FL Gy = OI 


and if n and k are arbitrary, 


"0 Qf OF aF 
y r -=0 h = ST 
2, M dete QE Ox! Ax?” 


i,j=1 





(4) 


and (g") is the inverse matrix of (g;;). 


The basic problem for this class of equations 
is the *Dirichlet problem. For n=2 and k=1, 
the following theorem of Rado and Finn is 
fundamental: There exists a solution f of the 
Dirichlet problem corresponding to an arbi- 
trary continuous function qo on the boundary 
OD of D if and only if D is convex. Since the 
difference of any two solutions of eq. (1) satis- 
fies the maximum principle, there can be at 
most one solution f of the Dirichlet problem 
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corresponding to a given continuous function 
g on OD. As for the removability of isolated 
singularities, the following is known: Every 
solution of eq. (1) in 0 « x? + y? <1 extends 
continuously to the origin. The extended func- 
tion ts smooth at the origin and satisfies eq. (1) 
in the full disk x? 4- y? « 1 (Finn). 

For arbitrary n and k — 1, namely, for the 
case of minimal hypersurfaces, the following is 
known: Let D be a bounded domain in R" with 
smooth boundary. The Dirichlet problem for 
eq. (2) has a solution for any continuous func- 
tion o on OD if and only if the mean curvature 
of OD with respect to the inner normal is non- 
negative at every point. As for the uniqueness 
and regularity, the same results as above hold 
(Jenkins and Serrin). 

For n —2 and arbitrary k, namely, for the 
case of minimal surfaces in R***, exactly the 
same result of existence as in the classical case 
k=1 holds. A solution exists for an arbitrary 
continuous vector-valued function on ôD if 
and only if D is convex (Osserman). Though 
the removability of singularities holds under 
suitable restrictions, the uniqueness fails in this 
case. 

For the last case, n 2 and k> 1, there are 
essentially no results on either existence or 
uniqueness for the Dirichlet problem. 

(2) Existence of minimal surfaces and top- 
ology. For a compact Riemannian manifold 
N, there are results on the existence of minimal 
surfaces related to the homotopy groups of 
N. Let f bea continuous mapping from a 
*Riemann surface Z, of tgenus g into N. If the 
induced mapping f, :2,(2,) 7,(N) of f on 
the *fundamental groups is injective, then there 
exists a branched minimal immersion h: Z, — N 
such that h, =f, on 2,(2,) and the induced 
area of h is the least among all mappings with 
the same action on z,(Z,). If furthermore 
m>(N)=0, then A can be deformed from f 
continuously (R. Schoen and S. T. Yau). If 
1. 4(N) #0, then there exists a generating set for 
T(N) consisting of branched minimal immer- 
sions of spheres that minimize energy and area 
in their homotopy classes (J. Sacks and K. 
Uhlenbeck). If, furthermore, dim N —3, then 
the above minimal immersion h in the homo- 
topy class is an embedding or a two-to-one 
covering mapping. In the second case, its 
image is an embedded real projective plane. If 
h' 1s another such least-area mapping, then 
either h’ is equal to h up to a conformal re- 
parametrization or the images of h and h’ are 
disjoint (Meeks and Yau). 


E. Gauss Mapping of Minimal Surfaces 


On a connected, orientable surface M there 
exist local isothermal coordinates (x, y) at each 
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point. On putting z=x+ at y, the metric 
has the form ds? = 2F|dz|?, and M is viewed as 
a Riemann surface. A conformal immersion 
f: M SR" is minimal if and only if Af =0, or 
equivalently 0,6, f —0. In the case n=3, the 
*Gauss mapping G: M S? of a surface f: 

M >R? is defined by assigning to each point 
pe M the unit normal vector translated paral- 
lel to the origin of R?. A surface f: M 9 R? is 
minimal if and only if the Gauss mapping is 
antiholomorphic. In the general case, the 
Gauss mapping G is defined to be a mapping 
assigning to each point pe M the oriented 
tangent space f, T,(M) c R". Thus G is a map- 
ping from M into the tGrassmann manifold 
M, ,(R) =SO(n)/SO(2) x SO(n —2) of oriented 
planes in R”, which is naturally diffeomorphic 
to the tcomplex quadric Q,. , in the complex 
projective space CP"^!, An immersion f: 
MR" is minimal if and only if its Gauss 
mapping is antiholomorphic. 

Let f: M >R” be a complete orientable 
minimal surface. Let y be the Euler character- 
istic and —zC la KdA, the total curvature of 
J(M). Assume that the total curvature is finite. 
Then the following results of Chern and Osser- 
man are fundamental: (a) M is conformally a 
compact Riemann surface M with finite num- 
ber, say r, of points deleted; (b) C is an even 
integer and satisfies C 2 2(r — y) 2 4g + 4r — 4, 
where g is the genus of M (= genus of M); (c) if 
f(M) does not lie in any proper affine subspace 
of R”, then C 24g -r-n—324g 4 n—22n—2 
(F. Gackstatter); (d) if f(M) is simply connected 
and nondegenerate, i.e., its image under the 
Gauss mapping does not lie in a hyperplane 
of CP" !, then C 2 2n —2 and this inequality 
is sharp; (e) when n —3, C is a multiple of 4, 
with the minimum value 4 attained only for 
Enneper's surface and the catenoid; (f) the 
Gauss mapping G of M extends to a mapping 
of M whose image G(M) is an algebraic curve 
in CP"! lying in Q,  ;; the total curvature of 
f(M)is equal in absolute value to the area of 
G(M), counting multiplicity; (g) G(M) inter- 
sects a fixed number of times, say m (counting 
multiplicity), every hyperplane in CP" ! except 
for those hyperplanes containing any of the 
finite number of points of G(M — M); the total 
curvature of f(M) equals — 2am. 

In particular, if a complete minimal surface 
in R" has k tends, then the total curvature 
never exceeds 2z(y — k). Enneper's surface and 
the catenoid are the only two complete mini- 
mal surfaces in R? whose Gauss mapping is 
one-to-one. 

If the Gauss mapping of a complete minimal 
surface of finite total curvature in R? omits 
more than 3 points of 5^, then it is a plane 
(Osserman). F. Xavier proved that the Gauss 
mapping of any complete nonflat minimal 
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surface in R? can omit at most 6 points of S?. 
It is an open question whether this can be 

improved to 4 points. The Gauss mapping of 
Scherk's surface omits exactly 4 points of S?. 


F. Minimal Submanifolds 


(1) Bernshtein’s problem. As was mentioned in 
Section A, a minimal surface in R? represented 
as the graph of a function on the entire plane 
R? is a plane. This is known as Bernshtein’s 
theorem (1915). This is, however, not true in 
R4, namely, there is a minimal surface defined 
as the graph of a vector-valued function on the 
entire plane, which is not a plane. The gen- 
eralized Bernshtein problem is stated as follows: 
Let f: R"R be a function satisfying the mini- 
mal hypersurface equation 


^ ATI ër 
cl cs ]=0, 
à vis) 
where W 2 ,/1--|Vf P. Is the graph of f an 


affine hyperplane? The answer is affirma- 

tive for n x 7 (E. De Giorgi, F. J. Almgren, J. 
Simons). For nz 8, it is negative (E. Bombieri, 
De Giorgi, E. Giusti). Though the problem 
turns out to be negative for n 2 8, no concrete 
counterexample is known, even for n — 8. 

(2) Minimal submanifolds in the sphere. 
Minimal submanifolds in the unit sphere S" 
have some special properties. For example, 
there is no compact minimal submanifold of 
S" contained in an open hemisphere. On the 
other hand, there is no stable minimal closed 
submanifold in S” (B. Lawson, Simons). 

More generally, any closed minimal hypersur- 
face in a Riemannian manifold with positive 
Ricci curvature is unstable (Simons). As for the 
existence of minimal surfaces in a sphere, the 
following theorem of Lawson is general: Any 
closed surface of any genus, except the real 
projective plane, can be minimally immersed 
into the unit sphere $?. As an analog to Bern- 
shtein’s theorem, a sphere $°, which is mini- 
mally immersed in S°, is necessarily totally 
geodesic. On the other hand, by making use 
of higher-order second fundamental forms and 
the relations among them, a complete descrip- 
tion of the minimal immersions of R? into $" 
has been given by K. Kenmotsu. 

As a general theorem which gives a neces- 
sary and sufficient condition for a Riemann- 
ian manifold to be minimally immersed in the 
unit sphere S”, the following is fundamental: 
An isometric immersion x of an m-dimensional 
Riemannian manifold M into S”, viewed as a 
vector-valued function into R"*!, is minimal if 
and only if Ax 2 mx (T. Takahashi). As its 
immediate application, any m-dimensional 
compact homogeneous space whose linear 
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isotropy group is irreducible can be minimally 
immersed into the n-sphere of curvature 4/m, 
corresponding to any eigenvalue ( #0) of the 
Laplacian, where n+ 1 is the dimension of the 
eigenspace corresponding to 4 (W. Y. Hsiang, 
Takahashi). 

Since for the *symmetric spaces of frank 1 
the eigenvalues and the corresponding eigen- 
spaces of the Laplacian are known, the mini- 
mal immersions of such spaces into the unit 
sphere have been determined as well as the 
rigidity of such minimal immersions (N. Wal- 
lach). For example, if the m-sphere of constant 
curvature c is minimally immersed into the 
unit n-sphere, but not contained in any great 
hypersphere, then for each nonnegative integer 
k, c2 m/k(k--m—1) and n- 1 & 2k -m— 1): 
[(k -- m — 2)!/k!(m — 1)!]. The immersion is rigid 
if and only if m=2 or k <3. Similar results 
have been known for the projective spaces 
over real numbers, complexes, quaternions, or 
Cayley numbers. 

Simons showed that the scalar curvature 
p of a compact minimal submanifold M of 
dimension m in the unit (m + p)-sphere is not 
greater than mim — 1). Furthermore, if p> 
mim — 1) — np/(2p — 1), then either p = mim 1) 
or po m(m— 1) — mp/(2p — 1). In the former 
case, M is totally geodesic and therefore is the 
unit sphere S™. In the latter case, M is either 
a hypersurface of the unit (m + 1)-sphere 
which is isometric to the product S*(./k/m) x 


S"-*(. /(m— k)/m) of spheres of radius ,/ k/m 


and 4/(m — k)/m, respectively, called the gen- 
eralized Clifford torus or the Veronese surface 
in S*. Here the Clifford torus is the torus 

sS! ( /1/2) x S! (/1/2)c s? c R*, and the Ver- 
onese surface is defined as follows: Let (x, y, z) 
be the natural coordinate system in R? and 
(ol. u2,u?,u*, u^) that in R^. The mapping 
defined by 


u! — yz/ /3, u? =zx/,/3, w= xy/ /3, 
u^ — (x? —y)/2,/3, už =(x? + y? —227)/6 


gives an isometric immersion of $2(,/3) into 
S^. Two points (x, y, z and (— x, — y, — Z) of 

S 2(. /3) are mapped into the same point of S^, 
and thus the mapping defines an embedding of 
the real projective plane RP? into S^. This 
embedded real projective plane in S* is called 
the Veronese surface. 

T. Otsuki proved the following: Let M be a 
complete minimal hypersurface immersed in 
the unit (n 4- 1)-sphere with two principal cur- 
vatures. If their multiplicities are m and n— 
mz 2, then M is congruent to the generalized 
Clifford torus S"(. /m/n) x S" DL /(n— m)/n) c 
S" c R"*?. If one of their multiplicities is 1, 
then M is a hypersurface of $"*! in R"'?— 

R" x R? whose orthogonal projection into R? 
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is a curve of which the tsupport function x(t) 
is a solution of the following nonlinear differ- 
ential equation of the second order: 


nx(1 — x?) (d? x/dt?) + (dx/dt)? -- (1 — x?) (nx? — 1) 
=Q. 


Furthermore, there are countably many com- 
pact minimal hypersurfaces immersed but not 
embedded in S"*!. Only S"! (./(n— 1)/n) x 

S! (. /1/n) is minimally embedded in $"*". 
This just corresponds to the trivial solution 
x(t)z1 / fn of the above equation. 

(3) Minimal submanifolds in Riemannian 
manifolds. In general, it is difficult to deter- 
mine all the complete stable minimal sub- 
manifolds in a given Riemannian manifold. If, 
however, some curvature conditions are given 
to the Riemannian manifold, then a classifica- 
tion has been given: Let N be a complete 3- 
dimensional Riemannian manifold with non- 
negative scalar curvature p, and let M bea 
complete, stable, and orientable minimal sur- 
face in N. 

(a) If M is compact, then either M is confor- 
mally equivalent to the *Riemann sphere S? 
or else it is a totally geodesic flat torus. Fur- 
thermore, if p >Q, then the latter case never 
occurs. 

(b) If M is not compact, then M is confor- 
mally equivalent to the complex plane C or a 
cylinder (D. Fischer Colbrie; R. Schoen). 

As a generalization of the procedure for 
generating periodic minimal surfaces in R? 
with octahedral or tetrahedral symmetry 
(Schwarz, 1867), T. Nagano and B. Smyth gave 
a construction procedure to generate periodic 
minimal surfaces in R" or n-tori with sym-. 
metry corresponding to any *Weyl group of 
rank n. 


G. Minimal Varieties 


Recent progress in the study of Plateau's prob- 
lem in higher dimensions is closely related to 
the point of view of geometric measure theory 
and tthe calculus of variations. 

(1) Integral currents. Let T be a tcurrent of 
degree k, or simply a k-current, defined on an 
open set U of R”, and let ôT be the boundary 
of T, the exterior derivative of T. For simplic- 
ity, in the following only currents with com- 
pact support are considered. 

The mass M(T) of a k-current T is the dual 
norm M(T)-—sup(T(q)| M(o)x 1j of the 
comass M(q) on k-forms, which is defined by 
M(o)=sup{l o)l | xe U) with |e(9I = 
sup { (P(X), 0, ^... ^0,» |t,, ..., v, are ortho- 
normal vectors at xe U}. It follows that M(T) 
coincides with k-dimensional volume when T 
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is a k-current defined by integration on a k- 
dimensional simplex. We say that T is normal 
if M(T) and M(?T) are finite. Normal cur- 
rents form a !*Banach space with norm N(T)= 
M(T)+ M(0T). 

À current T is called rectifiable if it can be 
approximated in the mass norm by currents of 
type f, y, where y is a finite polyhedral chain 
with integer coefficients, f:suppy—>U isa 
*Lipshitz mapping of the support of y into U, 
and f,» is the current defined by means of 
fav (9) 9 »Cf *o). If both T and CT are recti- 
fiable, T is called an integral current. Inte- 
gral currents give the appropriate notion of 
generalized manifolds with boundary in the 
study of higher-dimensional Plateau problems. 

One of the fundamental properties of in- 
tegral currents is the compactness, stated as 
follows: Given a compact subset K c U and a 
number c 7 0, the set of integral currents T 
such that supp T c K and N(T)<c is sequen- 
tially compact in the tweak topology. Since 
the mass M(T) is flower semicontinuous in 
the weak topology, it follows that Plateau's 
problem can always be solved in the space of 
integral currents (— Section C; 334 Plateau's 
Problem). The question arises: To what extent 
is this a satisfactory solution to the problem? 
For codimension 1, it is known from the work 
of Federer and others that an integral (n — 1)- 
current of least mass in R" is nonsingular in 
codimension «7. In particular, in R”, n<7, 
every integral (n— 1)-current of least mass is an 
analytic manifold. In general codimensions, it 
is known that the set of regular points is dense 
(Reifenberg, Almgren, and others). 

The definition of rectifiable and integral 
currents carries over to those on a Riemann- 
ian manifold M in a natural way. Then the 
space of integral currents J,(M) on M with the 
boundary operator 0, for which 6? 20, form 
a chain complex. It is then a fundamental 
theorem, due to Federer and Fleming, in 
homological integration theory that there is a 
natural isomorphism H,(L(M)) = H,(M; Z) of 
the homology of the complex of integral cur- 
rents with the integral singular homology 
groups of M. From this it follows that if M isa 
compact Riemannian manifold M, then each 
class xe H,(1,(M)) x H,(M; Z) contains an 
integral current of least mass among all in- 
tegral currents in a. 

For a homology class of codimension 1, it 
has also been proved by Almgren that if M is a 
compact Riemannian manifold of dimension 
x 7, then for each «e H, ,(M; Z) there exists a 
finite collection of mutually disjoint, compact 
oriented minimal hypersurfaces $,, ..., Ar em- 
bedded in M and integers n,, ..., n; such that 
the integral current Xii n;5; represents «. On 
the other hand, it has been recently proved 


1035 


that every compact Riemannian manifold of 
dimension <6 contains a nonempty closed 
embedded minimal hypersurface (J. T. Pitts). 

(2) Varifolds. The theory of integral currents 
developed by Federer, Fleming, and others 
yields reasonable spaces for purposes of the 
calculus of variations. However, it is not en- 
tirely feasible to use these in the study of the 
actual soap films that occur in physical ex- 
periments. It turns out that we should work 
in a more set-theoretic fashion and give up 
the notion of orientability and the boundary 
operator. A convenient theory for describing 
these physical experiments is the theory of 
varifolds developed by Almgren. 

A k-dimensional varifold on a Riemann- 
ian manifold M is a *Radon measure on the 
bundle G,(M) of unoriented tangent k-planes 
of M. For simplicity, only varifolds with com- 
pact support are considered. These are re- 
garded as continuous linear functionals on the 
space C(G,(M)) of continuous functions on 
G,(M). Thus a k-dimensional submanifold S of 
finite volume embedded in M determines a 
varifold V, by integration: 


«n- | STS AHX), —feC(G,(M)), 
S 


where #* denotes the k-dimensional tHaus- 
dorff measure on M. More generally, to 

any rectifiable current there corresponds an 
underlying varifold obtained by neglecting 
orientations. 

Let V,(M) denote the space of k-dimensional 
varifolds on M. Given a varifold Ve V,(M), the 
mass M(V) is defined to be the V-measure of 
the total space, i.e., M(V)- HO), Let f:M—M 
be a C'-mapping. Then f induces naturally a 
mapping f, of V,(M) into itself. In particular, if 
X is a smooth vector field on M with asso- 
ciated flow f,, then there is a smooth function 
M (t)— M(f,, V). A varifold Ve V,(M) is then 
called a k-dimensional minimal variety in 
M if the first variation (d/dt) Mil, zs D for 
all smooth vector fields on M. For example, 
minimal submanifolds, complex analytic sub- 
varieties of Káhler manifolds and integral 
currents of least mass are minimal varieties. 

An appropriate notion of rectifiable and 
integral varifolds 1s defined, and an analog of 
the compactness theorem can be obtained. As 
a consequence, it was proved by Almgren 
using "Morse theory methods that if M is an n- 
dimensional compact Riemannian manifold, 
then for each p, 1 € p n— 1, there exists at 
least one minimal variety of dimension p. As 
for the regularity of minimal varieties, it is 
known that if V is a k-dimensional variety, 
then in the support of V there is a relatively 
open dense subset that is a regular minimal 
submanifold of dimension k (W. K. Allard). 
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H. Maximal Hypersurfaces in Minkowski 
Space 


Let L"*! be a Minkowski space, Le, L"*1 = 
(ten, x t)GG,  Xx)eR teRj with the 
tLorentzian metric 27. , (dx? — (dt. Let M 
be a *spacelike hypersurface of L"*! so that 
the induced metric on M is Riemannian. If the 
mean curvature vector field H of M, defined 
in the same way as in the Riemannian case, 
vanishes identically, then M is called maximal. 
In contrast to the Riemannian case, by the 
variation in the normal direction of H, the 
volume increases, provided H #0. However, 
the equation describing a maximal hyper- 
surface is similar to that for the Riemannian 
case (— Sections D (1), F (1)). Indeed, let D be 
a domain in R” and f: D5R a smooth func- 
tion. Then the graph of f in L"*! is a maximal 
hypersurface if and only if f satisfies the fol- 
lowing quasilinear partial differential equation 
of the second order: 

n A2 n 
u emeng a Y 
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This is *elliptic for |Vf |< 1, since M is space- 
like. E. Calabi proposed a problem similar to 
Bernshtein’s in 1968. Contrary to Bernshtein’s 
case, the answer is affirmative for any n. 
Namely, a maximal hypersurface in L"*! de- 
fined as the graph of a function on the entire 
space R" is a hyperplane in R"*!. More gener- 
ally, a maximal hypersurface, which is a closed 
set in L"*!, is a hyperplane (S. Y. Cheng and 
S. T. Yau). It is also known that any maximal 
hypersurface in L"*! is stable. 
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A. Language 


Every mathematical theory has an appro- 
priate language. To determine a language for a 
theory means to determine a language for the 
related mathematical system. Such a language 
consists of the following symbols (the symbols 
given here are examples of only one notational 
system). 

(1) Symbols that express logical concepts 
(logical symbols): V, 3, 1, ^, v, >; 

(2) free variables: ao, a4, a5, ...; 

(3) *bound variables: xo, x4, X5, ...; 

(4) symbols that denote individual objects 
(individual constants): Co, C1, €2, ....€,, -3 

(5) function symbols: fo, fis fa, 

(6) tpredicate symbols: Po, Pi, P,, .... P,, .... 

The *cardinalities of the sets of symbols in 
(4), (5), and (6) are arbitrary, except that there 
must be at least one predicate symbol. It is 
assumed that each set of symbols is "well 
ordered. Also, it is understood that to each f; 
in (5) there corresponds a positive integer i; 
while to each P, there corresponds a nonnega- 
tive integer (these integers are called the num- 
ber of arguments of f; and P, respectively). 

In practice, other kinds of languages are 
also dealt with. One example is a system with 
infinitely long expressions, which permits 
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*transfinite ordinals for the numbers of argu- 
ments of f, and P;, and which has the extended 
concepts /\,<p, Va<p 3x; ... 3x, ... for a < fi, 
and Vx, Vx, ... Vx, ... fora < fi, where x and f 
are transfinite ordinals. Another language 
includes variables of higher *type as well as V 
and 3 over those variables. Free and bound 
variables may not be distinguished typograph- 
ically. In that case a variable not bound by V 
or 3 in a fformula ts called free. (The notion of 
a formula is defined later.) To simplify this 
discussion, however, we restrict ourselves to 
the tfirst-order predicate language with a 
typographic distinction between free and 
bound variables. We also assume that there 
are only a countable number of variables, and 
hence we use natural numbers as subscripts. 

Set L, = (logical symbols}, L; = (a9,4;, 
dise Dig E Matec T ey pee as 
C3, $s Ls = {Jos fis fas] Le {Po Pi, 
Poa ba Ly, bee Labais Ee TO 
determine a language L ts to specify such a list 
L. Since V, 3, 1, ^, v, 5, are normally used 
for L1, do, 41, a5, ... for L;, and xe, x,, 

X5, ... for L4, we may assume that these are 
fixed, and hence to determine a language is to 
determine (L4, Ls, L9». We take (L,, La, L4» 
just described and assume an arbitrary but 
fixed L= (L4, L5, Lg». First we define the 
notions term of L (or L-term) and formula of 
L (or L-formula). 

Definition of the terms of L (L-terms): (1) 
Each free variable a; is an L-term. (2) Each 
individual constant c; of L is an L-term. (3) If f; 
is a function symbol of L, i; is the number of 
arguments of fj, and each of t,, ..., d is an L- 
term, then f,(¢,,..., ti) is also an L-term. (4) 
The L-terms are only those constructed by (1)— 
(3). 

A term that does not contain a free variable 
is called a closed term. 

Definition of the formulas of L (or L- 
formulas}: (1) Let P, be a predicate symbol of L 
and i; be the corresponding natural number. If 
each of t,, esti, is an L-term, then P/(t,,..., ti) 
is an L-formula. This type of formula is called 
a prime formula (or atomic formula). (2) If A 
and B are L-formulas, then each of (A), 

(A) A(B), (A) v (B), and (4)-(B) ts an L- 
formula. (3) Let F be an L-formula and x; be a 
bound variable that does not occur in F. Then 
an expression obtained by putting {  ) around 
F, replacing some occurrences in F of a free 
variable, say a,, by x;, and prefixing Vx; or 3x; 
is an L-formula. (4) The L-formulas are only 
those constructed by (1)-(3). 

A formula that has no occurrence of a free 
variable is called a closed formula. The paren- 
theses used in the formation of a formula may 
be omitted if no ambiguity arises thereby. 
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B. Structures 


Let L be a specific language as described in the 
previous section. Then Jt 2 [ M :p;6o;1] defined 
by (1)-(4) below is called a structure for L (or 
L-structure). 

(1) M is a nonempty set. (M is called the 
universe of Ji.) 

(2) p is a mapping from L, into M. 

(3) Let L5 — | f;|the number of arguments 
of f, is i}. Then L; - LEUL2U...U LSU... 
provides a partition of L;. Let %, be the set of 
all mappings from M' = Mx... x M (i times) 
into M and o; be a mapping from L5 into §;. 
We define o for an arbitrary f of L; by a(f) 
=o, f), where i is the number of arguments of 
f. Then c is obviously a mapping from L, into 

Zu Yi 

(4) Decompose Lg into LLU LL...U LSU... 
as in (3). Let P, be the set of all subsets of M' 
and r be a mapping from Li, into P,, where P, 
is the set (M, Ø} (Ø is the empty set). Then « 
is defined for every i and for an arbitrary P of 
L by t(P)=1,(P). 

If we denote p(c) by c, ot f) by f, and «(P) by 
P, then we may understand that p is repre- 
sented by ĉo, €,, ..., 0 is represented by fo, 
fi,- and t is represented by P5, P,, .... 
Therefore Mt is normally expressed as 


M=[M Sess oca fos Jis; BP d 


C. Satisfiability 


We fix not only a language L but also a struc- 
ture M for L. Then the property that an L- 
formula is satisfiable is defined by the follow- 
ing procedure: 

Let m, n, ... stand for !'sequences of the 
elements of M, say (mg, m,, ...), (no, My, ...),..., 
called 9ü-sequences. We write m =n to indi- 
cate that each entry of m except the ith one is 
equal to the corresponding entry of n. Using 
these concepts, the value of an L-term at an 
Wi-sequence m, denoted by t[m], is defined as 
follows: 

(1) If t is a free variable a,, then t[m] = m;. 

(2) If t is an individual constant c;, then 
t[m] 7c; 

(3) If t is of the form f(t,,...,t,), then t[m] = 
fitim], .... t [m]). If £ is an L-term, then 
evidently t [m] is an element of M. 

Based on this definition of t [m], the relation 
A is satisfiable by m in Mi, denoted by M, m = 
A, is defined for an arbitrary L-formula A 
and an arbitrary 3Ji-sequence m as follows: 

(1) M, m E Pt, ... t) C Em], ... t Ent]? 
eP, 

(2)M@, mE IBM, m E- B is false. 
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(3) M, mE BAC 3, mB and M, 
mec. 

(4) Mt, mE Bv CM, m E B or W, 
mC. 

(5) M, mE B5 C M, mB implies W, 
mEC. 

(6) Mi, m E Vx; F(x) «» WM, n F(a,) for an 
arbitrary n that satisfies m =n, where a, has 
the least index among the free variables that 
do not occur in F(x;). 

(7) Mt, m E: 3x; F(x;)<> there exists an n such 
that n 5 m and M, n j F(a,), where a; satisfies 
the same condition as in (6). 

Following are some consequences of this 
definition. 

(1) For an arbitrary L-formula A and an 
arbitrary M-sequence m, exactly one of W, 
mA and M, mE 14A holds. 

(2) Let a; , ..., a; include all free variables 
that occur in A, and let m and n be WM- 
sequences for which m; =n;,, ...,mj,=Nj.. 
Then W, m E A and WM, n F A are equivalent. 
So we may write M} A “iy d | instead 
of M, m E A, for any formula A whose free 
variables are among ajs Die and any WM- 
sequence m. 

(3) If A is a closed formula, then for an 
arbitrary pair of sequences m and n, M, mE A 
and Mi, n [- A are equivalent. Therefore, for a 
closed formula A, we may express the state- 
ment “for some (or, equivalently, for all) W- 
sequence m, Jt, mE A holds" by ME A. 

(4) Let a; be an arbitrary variable that does 
not occur in VxF(x) or 3xF(x). Then W, 

m E- Vx F(x) is equivalent to W, n E F(a;) for 
an arbitrary n such that n=m. Likewise, W, 
m E: 3xF(x) is equivalent to the statement 
that there is an n such that n =m and M, 


n E F(a). 


D. Models 


Here again we fix a language L. Let A bea 
closed L-formula and W an L-structure. If 

M H A, then M is called a model of A. 
Furthermore, if I'— (A6, A4,, ....] is an arbi- 
trary set of closed formulas and Wt A; for all 
A, in T, then the structure W is called a model 
of I. 

(1) Consistency. Consider a logical system 
whose language is L. If there is a model of the 
set of all provable closed formulas of the sys- 
tem, then the system is *consistent. In partic- 
ular, the *first-ordered predicate calculus is 
consistent. 

(2) Completeness. A logical system is said to 
be complete if every closed formula that is 
satisfied in every structure is provable in the 
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system. In particular, the first-order predicate 
calculus is complete. 

K. Gódel proved (2). Later L. Henkin gave 
an alternative proof whose essential idea con- 
tributed to proving the following proposi- 
tion: If a set I' of closed L-formulas is con- 
sistent, then there is a model of F. Henkin 
also introduced a (nonstandard) second-order 
semantics, relative to which the *second- 
order predicate calculus is complete. This 
can be shown by extending Henkin's tech- 
nique (for the first order) to the second-order 
language. 

(3) Here we extend the language slightly by 
adding the second-order free predicate vari- 
ables af, 65, ...,07, ... (n— 1,2, ...) and the 
second-order bound predicate variables oi. 
$5, ..., Qr, ... (a2 1,2, ...), where n indicates 
the number of arguments of a variable. Other- 
wise the definition of the language is the same 
as for the case of the first-order predicate 
calculus. For simplicity, however, we assume 
that there are no individual constants, function 
symbols, or predicate symbols. 

The structure is defined as follows: Put Dt= 
[M:$,, $5, ..., $,,... ]; where M is a nonempty 
set and S, is a set of subsets of M x ... x M (n 
times). An 9Jt-sequence m is defined as before, 
and s, denotes (57,55, ..., S7, ...), where each s? 
is a member of S,. The concept of satisfiability 
is defined as follows: 

M, (m, 5,, ...,5,, ...) Poi xi) 
proces. 

M, (m, $1, ...,5,, ...) EVEk Alp) for 

an arbitrary s, for which s, Ż Sp M, (M, s,,..., 
Sp --- ) = A(aj), where aj has the smallest index 
among the free predicate variables that do not 
occur in A(@j). 

M, (m, a, Sa lb Igy A (Qj) > there 
exists an e, such that a, 5s, and M, 

(m, $,, ...,5,, ...) = Aa, where oj satisfies 
the same condition as in the previous clause. 

Satisfiability for other cases is defined as for 
first-order predicate language. A structure W 
is called normal if all axioms of the second- 
order predicate calculus are true in W. 

Completeness of the second-order predicate 
calculus: Every closed formula that is satisfi- 
able in all normal structures is provable in the 
second-order predicate calculus. 

(4) Let the cardinality of L, be t, and F be 
an arbitrary set of closed L-formulas. If I 
has a model, then I' has a model of cardinal- 
ity max (t, Xo). This follows from Henkin's 
method. Historically, however, it was first 
proved by Th. Skolem and L. Lówenheim for 
a special case, and was later generalized by A. 
I. Mal'tsev and A. Robinson. 

(5) The following results are all due to A. 
Tarski and R. L. Vaught. 
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Definition 1. Two L-structures W and 9t are 
said to be elementarily (arithmetically) equiva- 
lent if for an arbitrary closed L-formula A, 
ME ANE A. 


Definition 2. Let 


3 -[M:4o. di Gor 01:5 Qo. Qis] 


and 
9t - [N :ro, Fia hos ps oss] Ro, Rs] 


be two structures. Wt is an elementary exten- 
sion of 9i if the following two conditions are 
satisfied: (i) M >N; q;2r;(j 20,1, ...); the 
restriction of g; to N is identical to h; (j= 

0, 1, ... ; the restriction of Q, to N is identi- 

cal to R; (j —0, 1, ...). (If this condition holds, 
then I is said to be an extension of 91.) (ii) For 
an arbitrary L-formula A and an arbitrary 9t- 
sequence n, if R, nm A then W, nH A. 


Theorem 1. Let M be an extension of 9t. A 
necessary and sufficient condition for Mt to be 
an elementary extension of 9t is that for an 
arbitrary L-formula of the form 3xF(x) and 
an arbitrary 9i-sequence n, if Mi, n HE 3x F(x), 
then there is some element n of N such that 
for the Nt-sequence m for which m +n and 
m; —n, W, m E- F(aj), where a; is an arbitrary 
free variable that does not occur in F(x). 


Theorem 2. Here we place a condition on L 
that each set of symbols be at most countable 
and arranged in the w-type (— 312 Ordinal 
Numbers). Let the cardinality of the universe 
M of M be an infinite cardinal a, M’ bea 
subset of M of cardinality c, and b be an in- 
finite cardinal that satisfies c <b <a. Then 
there exists an L-structure 9t whose universe 
N has cardinality b and such that M'c N and 
Wt is an elementary extension of 9t. 


Theorem 3. Suppose that L satisfies the same 
condition as in Theorem 2. Let the cardinality 
of the universe M of M be a (a is an infinite 
cardinal) and b be a cardinal for which a <b. 
Then there exists an L-structure 9t that is a 
proper elementary extension of I and whose 
universe has cardinality b. 


E. Ultraproducts 


Assume that for a set of L-structures X and a 
set of indices J, there is a mapping 0 from I 
onto 2. If « is a member of J, W is a member 
of L and 0(a) 2 9Ui, then Nt may be denoted 
by WM”. It should be noted that there may be 
more than one x corresponding to the same 
structure. If D is a *maximal filter of I and 0" 
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1s expressed as 
gt -[M*:e?, "Te. PY... J, 
then TL. M* is defined by 


[[M*-ío|o is a mapping from 


ael 


I into | ] M*, where q(x)e M^. 


acl 


For any two elements o and v of [I e; M*, 
p 2 y is defined by 


eS ye (a1 pla) diti € D. 


Then o3 V is an equivalence relation between 
the elements of [T,., M". Furthermore, the set 
IL., M* partitioned by 2 is expressed by 
IL,M*/D, and each element m of TI e, M"/D 
is expressed by m — [q], where ¢ is a repre- 
senting element of m. 

Next we define an operator that produces a 
new structure from 2. Put M — IT, M?/D. 
For an individual constant c of L, let c — [4], 
where o(x) — c* for every a. For an n-ary 
function f of L and arbitrary elements m, — 
[9,]. ....m,—[9,] of M, define f(m,, ..., 
m,) = [V], where (x) = f "(ex (a), ... 9, (2)) for 
every x. For an n-ary predicate P of L, define 
(m,,...,m,»eP by 


nm, m, EP 


< {a| Co, (a), 15 Q,(%)>€P*} ED. 


According to these definitions, put 

9 —[M:e, ..., f,- P,...] 

and denote it by TL. St*/D, called the ultra- 
product of {WM} <; (with respect to D). M is an 
L-structure. 


Fundamental Theorem of Ultraproducts. Let 
39 = TL. 9U/D be the ultraproduct of 
(9901,.,, m 2 (m,,m;, ...) be an M-sequence, 
q; be a representing element of m;, and A 

be an arbitrary formula. Then It, m = A <> 
{al MR, (9, (9), pala), ...) A] € D. 

By using this fundamental theorem we have 
the following result: Suppose that I is a set of 
closed formulas in L such that every finite 
subset of I has a model. Let I be the set of all 
the finite subsets of I. For each «e7 and each 
AET, let 9% be a model of « and A the set of 
all the finite sets in / which contain A as a 
member. Let F={A|AeT}. Since F has the 
finite intersection property, there is a maximal 
filter D such that D 2 F. Let M be the ultra- 
product TL. 9V/D. Since A is a subset of the 
set {a| M" H A} and A belongs to D, the set 
{a| M" H A} belongs to D, for each Aer. 
Hence we have that M is a model of T, by 
the foregoing fundamental theorem of ultra- 
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products. Therefore, we have the following 
theorem. 


Compactness Theorem. A set F of closed for- 
mulas has a model if and only if every finite 
subset of it has a model. 

In the case where all the structures IN 
coincide with the single structure Jt, the ultra- 
product of (900*1,.., (with respect to D) may be 
written WI (D and called the ultrapower of N 
(with respect to D). 

Let 


3 —[M; go; Qi. Jos gie Qo. Diss] 


and 
N=[N; Tg, Pisces figs ss Ro, R,, Sa 


be two structures. W and R are said to be 
isomorphic if there is a bijection f from M to 
N such that the following three conditions 
hold: (i) f(qo) 2 rg, f(q1) —r,, .... (ii) The se- 
quences go, g,, ... and ho, h,, ... are of the 
same type and f(g;(a,, -a =h; flai), 
f(a,)) holds for every n-tuple a,, ...,a, in M. 
(ii) The sequences Qo, Q,,... and Ro, R,,... 
are of the same type and R; ={< f(a,), .... 
fía)» |. ....4, € Qi}. 

Let j be the function from N to N!/D de- 
fined by j(a) 2 [«,] for each ae N, where 9, is 
the constant function from J to N such that 
Q,(x) =a for each «eI. Let M be the substruc- 
ture of 9t!/D whose universe is the range of j. 
Then j is an isomorphism of 9 to Mt. In the 
following we identify a and j(a) for each ae N. 
Then K is an elementary substructure of 9t'/D 
by the fundamental theorem of ultraproducts. 

If Mt and N are isomorphic, then M and N 
are elementarily equivalent. By using this fact 
and the fundamental theorem of ultraproducts, 
we have the following result. Let W and R be 
two structures. If there is a nonempty set J and 
a maximal filter D on I such that 9t"/D and 
9t'/D are isomorphic, then Mi and N are 
elementarily equivalent. H. J. Keisler proved 
the converse of this proposition by using the 
G.C.H. (generalized continuum hypothesis), 
and later S. Shelah proved it without the 
G.C.H. Keisler-Shelah isomorphism theorem: 
Let M and N be two structures. Then WM and 
N are elementarily equivalent if and only if 
there is a nonempty set J and a maximal filter 
D on I such that WM'/D and N'/D are 
isomorphic. 

The ultraproduct operation has various 
applications in number theory, algebraic geo- 
metry, and analysis. Here we give an example 
due to J. Ax and S. Kochen. Let P be the set of 
prime numbers. Let Q, and Z,((t)) be the field 
of p-adic numbers and the field of formal 
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power series over Z,— (0,1, ..., p— 1} for each 
p in P, respectively. Ax-Kochen isomorphism 
theorem: Suppose that D is a nonprincipal 
maximal filter on P. Then JJ,,-pQ,/D and 
II,» Z,(()/D are isomorphic. 

As an immediate consequence of this theo- 
rem, we have the following partial solution of 
Artin's conjecture on Diophantine equations. 
Theorem: For each positive integer d, there 
exists a finite set Y of primes such that every 
homogeneous polynomial f(t,, ...,t,) of degree 
d over Q,, with n d?, has a nontrivial zero 
in Q, for every pé Y (— 118 Diophantine 
Equations). 

We give another example in nonstandard 
analysis. A. Robinson developed the general 
theory of nonstandard analysis in [10]. Here 
we explain a theorem due to A. R. Bernstein. 
Let X be a nonempty set and U(X) be the 
smallest transitive set (i.e., aeb and be U(X) 
imply ae U(X)) which has X as a member and 
is closed under the following operations: pair- 
ing, union, power set, and subset operation 
(De, ae U(X) and bc a imply beU(X)). Let L 
be the first-order predicate logic with equality 
whose set of nonlogical constants consists of a 
binary predicate symbol e and individual 
constant symbols c, for ae U(X) (— 411 Sym- 
bolic Logic F). Then the first-order structure 
9t Z[U(X)y a(aeU(X)y E] is an L-structure, 
where E is the e relation on the set U(X). Let 
9t - [U(X)'/D:a(ae U(X)); E'/D] be the 
ultrapower 9t//D of N with respect to a non- 
principal ultrafilter D on a set J. For each 
ae U(X), let a* be the set of all elements [9] in 
U(X)'/D such that [ieI|o(i)ea] e D. Then a 
is a proper subset of a* if a is infinite. Since N 
and M are elementarily equivalent, these two 
sets a and a* have common first-order prop- 
erties in the following sense: for each formula 
(ap) in L, 


(Vbea)(3i E- b[7»]) (vb Ea*) (ME [7 ]). 


From this it follows that if r is a relation on a 
set a in 9t, then r* is a relation on the set a*; 
and if f is a mapping from a to b in 9i, then f* 
is a mapping from a* to b*. Hence a* isa 
mathematical object which greatly resembles 
a. By using this type of resemblance between a 
and a* we have the following result. 

Let H be a Hilbert space over the complex 
number field C such that dim(H)— c and let T 
be a bounded linear operator on H. Let X — 
HUC and consider the first-order structure 
M as above. Since R (the set of all real num- 
bers) and N (the set of all natural numbers) are 
infinite sets which belong to U(X), R* and N* 
have elements which do not belong to R and 
N, respectively. Such elements are called non- 
standard real numbers and nonstandard natural 
numbers, respectively. By the fundamental 
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theorem of ultraproducts we can conclude that 
there are many nonstandard real numbers « 
such that 0 « *a < *a in R* for any ae R. Such 
a nonstandard real number a is called an 
infinitesimal real number. Since the norm 
operator | || is a mapping from H to R, 

|| l*1isa mapping from H* to R*. If || x||* 
(xe H*) is infinitesimal, then x is said to be 
infinitesimal in H*. Let S be the set of all linear 
subspaces of H. For a linear subspace K of H* 
that is contained in S* let K^ be the set of 
elements xe H such that x — x, is infinitesimal 
for some x, in K. Then K^ is a closed linear 
subspace of H. Let e= {e;};en be an ortho- 
normal basis of the Hilbert space H; e can be 
considered as a mapping from N to H, and 
hence e* = Leen, is a mapping from N* to 
H*. For each je N*, let H, be the linear sub- 
space of H* spanned by {e,|k <j}. For a given 
bounded linear operator T on H, T* isa 
linear operator on H*. We define T, - P,T* P, 
where P, is the projection from H* to H;. Since 
dim(Hj) —j is a (nonstandard) natural number, 
there exists a tower Jo; C Jie... GJ H; of 
closed, T-invariant linear subspaces of H; such 
that dim(J,)) — k(k <j). Then J^ is a closed, 
T-invariant linear subspace of H. If there is a 
polynomial p(x) such that p(T) is a compact 
operator, then we get a nonstandard natural 
number j such that de^ is a proper subspace 
of H for some k <j. This gives the follow- 

ing result, which is an affirmative solution of 
a problem of K. Smith and P. R. Halmos. 
Theorem (Bernstein [4]): Let T be a bounded 
linear operator on an infinite-dimensional 
Hilbert space H over the complex numbers 
and let p(x) # 0 be a polynomial with complex 
coefficients such that p(T) is compact. Then T 
leaves invariant at least one closed linear 
subspace of H other than H or {0}. 


F. Categoricity in Powers 


Let T be a set of closed formulas in a first- 
order language L which has a designated 
binary predicate symbol P, In the following, 
we assume that the interpretation P, of P, by 
Mi is the equality relation on the universe of 
Mi for every L-structure Mt. T is said to 

be categorical if all the models of F are iso- 
morphic. By Theorem 3 in Section D, any I 
having a model of infinite cardinality is not 
categorical. Hence, there exists no interesting 
I which is categorical. Therefore, we consider 
the weaker notion of categoricity in powers. 
Let x be an infinite cardinal and nl, x) be the 
number of nonisomorphic models of T of 
cardinality x. Then T is said to be categorical 
in x if n(T, x)= 1, i.e., if all the models of T of 
cardinality x are isomorphic. There exist many 
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interesting (e which are categorical in «x for 
some x. For example, the set of axioms of 
algebraically closed fields of characteristic 0 is 
categorical in N,, and the set of axioms of 
dense linear orderings without endpoints is 
categorical in Ra, With respect to this notion, 
J. Los conjectured that if I ts categorical in x 
for some «> L (the cardinality of L), then T is 
categorical in x for all «> L. This conjecture 
was solved affirmatively by M. Morley in the 
case L —- X,, and later by S. Shelah in the 
general case. Theorem (1): Let I^ be a set of 
closed formulas in L. Then T is categorical in 
k for some x> L if and only if T is categorical 
in x for all x > L. We also have the following 
interesting theorem, due to J. T. Baldwin and 
A. H. Lachlan. Theorem (2): Let I^ be a set of 
closed formulas in L such that L 2 No. If T is 
categorical in N,, then nl, N9)- or No. 

As for n(T, x) there are two famous conjec- 
tures due to R. Vaught and others. A set F of 
closed formulas in L is called complete in L if 
it has a model and, for any closed formula A 
in L, either del or "del, 


Conjecture 1. There is no complete set F of 
closed formulas in L such that No « n(T, Xo) < 
250, 


Conjecture 2. There is no finite set I of closed 
formulas in L such that nl, Salz No and 
n(T, N,)— 1. 


G. Omitting Type Theorem 


By do-formulas (or a-formulas), we mean for- 
mulas which have no free variables except ao. 
Suppose that W is an L-structure and X is a 
set of a-formulas in L. Then, W realizes X if 
there is an element m of the universe of WM such 
that M H A [f] for all A in X, and M omits £ 
if M does not realize X. For example, if L is a 
first-order language such that L4 = (c9, c]. 
Ls={ fo, fi]; Le — (Po, Pi}, where fo, fi, Po. 

P, are all binary, and 9t is the L-structure 
[350, 1; +,x; =, <], where 9t is the set of 
natural numbers, and X = {P,(n,a)|n=0, |, 
2,...}, where 0=cy, 1=f,(0,c,),...,n+1 = 
Jfa, c), ..., then clearly 9t omits X. Also, if WM 
is an L-structure, m is an element of the uni- 
verse of WM, and Li Al A is an a-formula 
such that M H A[7,] ), then clearly W realizes 
Xm, Where Z, is called the type of m in WM. 
Suppose that T is a set of closed formulas in L. 
Then, by the completeness theorem of L, we 
can easily see that F has a model realizing Z if 
and only if I" X is consistent. On the other 
hand, it is rather difficult to obtain a necessary 
and sufficient condition for I to have a model 
omitting X. The following is a sufficient con- 
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dition: T is said to locally omit £ if there is no 
a-formula A such that I^ (Aj is consistent 
and, for any formula B in X, I"(A, 1B] is 
not consistent. 


Theorem. Suppose that T is a consistent set of 
closed formulas in a countable language L, 
and X is a set of ag-formulas in L. If T locally 
omits 2, then I' has a countable model which 
omits 2. Also, if I' has a model of power 
greater than 1; omitting 2 for each č «o, 
then I has a model omitting Z in each infinite 
power, where 2, is defined by Ip = Ba, 2,,, = 
27, 1, 2 sup, .., Q, if o is a limit ordinal. 


References 


[1] J. Ax and S. Kochen, Diophantine prob- 
lems over local fields I, Amer. J. Math., 87 
(1965), 605—630. 

[2] J. Ax and S. Kochen, Diophantine prob- 
lems over local fields II, Amer. J. Math., 87 
(1965), 630—648. 

[3] J. Ax and S. Kochen, Diophantine prob- 
lems over local fields HI, Ann. Math., (2) 83 
(1966), 437—456. 

[4] A. R. Bernstein, Non-standard analysis, 
Studies in Model Theory 8, M. D. Morley 
(ed.), Mathematical Association of America, 
1973, 35-58. 

[5] C. C. Chang and H. J. Keisler, Model 
theory, North-Holland, 1973. 

[6] S. C. Kleene, Introduction to metamath- 
ematics, Van Nostrand, 1952. 

[7] M. Machover and J. Hirschfelt, Lectures 
on non-standard analysis, Lecture notes in 
math. 94, Springer, 1969. 

[5] M. Morley, Categoricity in powers, Trans. 
Am. Math. Soc., 114 (1965), 514—538. 

[9] M. Morley, Omitting types of elements, 
The Theory of Models, J. W. Addison et al. 
(eds.), North-Holland, 1965, 265—273. 

[10] A. Robinson, Non-standard analysis, 
North-Holland, 1966. 

[11] G. E. Sacks, Saturated model theory, 
Benjamin, 1972. 

[12] S. Shelah, Classification theory, North- 
Holland, 1978. 


277 (11.23) 
Modules 


A. General Remarks 


In this article, we consider mainly modules 
with operator domain (Section C), in partic- 
ular modules over a fring. Modules over a 
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field are linear spaces (— 256 Linear Spaces). 
Modules over a commutative ring are impor- 
tant in algebraic geometry (— 16 Algebraic 
Varieties, 67 Commutative Rings, 284 Noe- 
therian Rings). The theory of modules over a 
‘group ring can be identified with the theory 
of linear representations of a group (— 362 
Representations). Modules without operator 
domain may be regarded as modules over the 
ring Z of rational integers, and the theory of 
finitely generated Abelian groups can be gen- 
eralized to the theory of modules over a tprin- 
cipal ideal domain (— 52 Categories and 
Functors, 200 Homological Algebra). 


B. Modules 


A module (without operator domain) is a tcom- 
mutative group M whose law of composition 
is written additively: a +b =b +a (a, be My; the 
identity element is denoted by 0, and the 
inverse element of a by —a. Every subgroup H 
of M is a normal subgroup. For any ae M, the 
left and right cosets of H containing the ele- 
ment a are identical: H+a=a+H (— 190 
Groups A). 

In the set N of all mappings of a set M to 
a module N, we define an addition by the sums 
of values: (f - g)(x) f(x) - g(x). Then NM 
forms a module. The set Hom(M, N) of all 
homomorphisms of a module M to a module 
N forms a subgroup of the module NN. called 
the module of homomorphisms of M to N. The 
composite of homomorphisms is a homomor- 
phism. Hence the set Hom(M, M)=&(M) of all 
endomorphisms of M forms a fring with re- 
spect to the addition and the multiplication 
defined by composition; this is called the endo- 
morphism ring of M. The *unity element of 
&(M) is the identity mapping of M, and the 
tinvertible elements of &(M) are the automor- 
phisms of M. 

Let (x;);.4 be a family of elements in a 
module M. The sum $e x; is well defined if 
x4, =0 (AEA) except for a finite number of 2. 
For any family (N;1;.4 of subsets of M, Dae, 
N, denotes the set of all elements of the form 
Daca X, (xE N,), where x; 20 except for a 
finite number of 4. If all the N, are subgroups 
of M, then N =$ ea N, is also a subgroup, 
called the sum of ( N;);.,. If every element of 
N can be written uniquely in the form Änt 
(x, € Nj), N is called the direct sum of ( N,);.,. 
When the N, are subgroups, this is equivalent 
to the condition that N,N Za aen N, — {0} 

(4€ A). 


C. Modules with Operator Domain 


Suppose that we are given a set A and a 
module M. If with each pair of elements ae A 
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and xe M there is associated a unique element 
axe M satisfying the condition (1) a(x + y) - 
ax +ay (ae A; x, ye M), we say that A is an 
operator domain of M and M is a module with 
operator domain A (module over A or A- 
module) (— 190 Groups E). The mapping 

Ax MM given by (a, x) 5ax is called the 
*operation of A on M. Any ae A induces an 
endomorphism ay:x-ax of M as a module 
(not as an A-module). To give the structure of 
an A-module to a module M amounts to 
giving a mapping A-6(M) (aay). 

If N is a subgroup of an A-module M such 
that axe N for any ae A and xe N, then N 
forms an A-module, called an A-submodule (or 
allowed submodule) of M. If ( N;);., is a family 
of A-submodules of an A-module M, then the 
intersection ( ;.4 N, and the sum X. N, are 
both A-submodules of M. 

Let R be an *equivalence relation in an A- 
module M such that if ae A and xRy, then 
axRay. Then R is said to be compatible with 
the operation of A. In this case, an operation 
of A 1s induced on the quotient set M/R. 
Moreover, if R is compatible with the addition, 
namely, xRx' and yRy' imply (x + y) R(x' + y’), 
then M/R forms an A-module, called a factor 
A-module of M. The equivalence class N con- 
taining 0 is an A-submodule of M, and M/R 
coincides with M/N. 


D. Modules over a Group or a Ring 


If a (multiplicative) group structure is given to 
the operator domain A of a module M, we 
always assume (in addition to condition (1) 

in Section C) that the following two condi- 
tions are satisfied: (2) (ab) x = a(bx); (3) 1x ^x 
(a, be A, xe M). 

If a ring structure is given to A, we always 
assume (besides conditions (1) and (2)) that the 
following condition holds: (4) (a+ b)x 2 ax - bx 
(a, be A, x € M). This means that the mapping 
A—é6(M)(a—ay)is a tring homomorphism. If 
the ring A has unity element 1, and 1,, 2 iden- 
tity mapping (namely, condition (3) holds), 
then the A-module M is called unitary. We 
consider only unitary A-modules. Any module 
M can be regarded as a Z-module or as an 
&(M)-module. 

When M is a module over a ring A, an 
element of A is called a scalar, A itself is called 
the ring of scalars (basic ring or ground ring), 
and the operation A x M — M is called the 
scalar multiplication. The elements ax (ae A) 
are called scalar multiples of x, and the totality 
of these elements is denoted by Ax. Let (x;);.4 
be a family of elements in M. An element of 
the form 3,.4a;x;, where the a, are elements 
of A and equal to 0 except for a finite number 
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of A, is called a linear combination of (x;);.4. 
The set N of linear combinations of {x,},., is 
the smallest A-submodule of M containing all 
the x, (AeA) and is equal to the sum ©... AX. 
The A-module N is said to be generated by 
(xl and {x,},., is called a system of gen- 
erators of N. A module having a finite number 
of generators is said to be finitely generated (of 
finite type or simply finite). The module Ax 
generated by a single element x is called mono- 
mial. If A is a *field (which may be noncom- 
mutative), an A-module is a linear space over 
A (— 256 Linear Spaces). 

Let a be an element of an A-module M. If 
there exists a nonzero divisor 4 of A such that 
4Aà —0, then a is called a torsion element of M. 
We say M isa torsion A-module if every ele- 
ment of M is a torsion element, and M is 
torsion free if M has no torsion element other 
than 0. An element a of M is called divisible if 
for any nonzero divisor 4€ A there exists an 
element be M such that a — Ab. M is called a 
divisible A-module if every element of M is 
divisible. 

Strictly speaking, the A-modules we have 
considered so far are called left 4-modules. If 
we define the operation of 4 on M by xa 
(ae A, xe M) instead of ax and modify con- 
ditions (1)-(4) appropriately (in particular, 
condition (2) becomes x(ab) —(xa)b), then M is 
called a right A-module. If A? is a group or 
ring anti-isomorphic to A, then a left A- 
module can be naturally identified with a right 
A?-module. If A is a commutative group or 
ring, we can disregard the distinction between 
left and right A-modules. 

Let A and B be groups or rings. Sometimes 
we consider an A-module structure and a B- 
module structure simultaneously on the same 
module M. If the operations of A and B com- 
mute with each other, namely, a(bx) = b(ax) 
(ae A, be B, xe M), it is convenient to put one 
of the operations to the right. If M has a left 
A-module structure and a right B-module 
structure, satisfying condition (5) (ax)b — a(xb), 
then M is called an A-B-bimodule. If G is a 
group and K is a commutative ring, the G-K- 
bimodule structure is equivalent to the left 
K [G]-module structure, where K [G] is the 
tgroup ring. 


E. Operator Homomorphisms 


A homomorphism f: M — N of A-modules M 
and N such that f(ax)=af(x) (ae A, xe M) is 
called an A-homomorphism (operator homo- 
morphism or allowed homomorphism). If A is a 
ring, f is also called an A-linear mapping. 
Regarding A as an A-module, an A-linear 
mapping M — A is called a linear form on M. 
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The composite of A-homomorphisms is an A- 
homomorphism. 

Let f: M—L be an A-homomorphism of A- 
modules. The A-submodule Im f = f(M) of L is 
called the image of f, and the A-submodule 
Ker f « Íx|xe M, f(x) - 0) of M is called the 
kernel of f. Coim f — M/Ker f is called the co- 
image of f, and Coker f = L/Im f the cokernel 
of f. The binary relation xRy on M defined by 
f(x) 2 f(y) (x, ye M) coincides with the equiva- 
lence relation defined by x — ye N 2 Ker f 
(x, ye M), and the mapping f induces an A- 
isomorphism f: M/N 5 f(M). 

A sequence of A-homomorphisms of A- 
modules M, (ne Z) 

oM, aye Ma. 
is called an exact sequence if Im ts Ker f, 
for all n. The A-module {0} is denoted by 0. 
Exactness of 0— N 5 M or M $ L0 means 
that the mapping f: N — M is injective or the 
mapping g: M >L is surjective, respectively. In 
an tinductive (tprojective) system {M,, Jan} of 
A-modules, where every MM. Zei 
(47 n) is an A-homomorphism, the limit M = 
lim M; (lim M,) is also an A-module. If 0— 
L,—^ M, N,0 is exact for every 4 and 


L,7M,N, 


I. E. d 
L,2M,N, 


is a Àcommutative diagram, then 0-lim L,— 
lim M,—lim N,-0 is also exact. For the 
projective limit, however, we can only state the 
exactness of 0—lim L >lim M,—>lim N,. 

The set of all A- {-homomorphisms of an A- 
module M to an A-module N, denoted by 
Hom,(M, N), is a subgroup of the module 
Hom(M, N), and is called the module of A- 
homomorphisms. The set Hom,(M, M)- &,(M) 
of all A-endomorphisms of an A-module M 
forms a ‘subring of the ring &(M) and coin- 
cides with the set of all elements commuting 
with any ay, (ae A). We denote by GL(M) the 
group of all tinvertible elements in &,(M). If 
A is a commutative ring, Hom,(M, N) can be 
regarded as an A-module by defining (af )(x) - 
af (x), namely, af — ay o f. In particular, &,(M) 
is an 'associative algebra over A. If M is an 
A-B-bimodule, Hom,(M, N) forms a left B- 
module by (bf )(x) = f(xb). If N is an A-B- 
bimodule, Hom,(M, N) forms a right B- 
module by ( fb) (x) — f(x)b. 


F. Direct Products and Direct Sums 


In the Cartesian product P — [L;.4 M; ofa 
family {M,},-, of A-modules, we define ad- 
dition and an A-operation as follows: (x;] + 
{y,}={x,+y,}, a{x,} ={ax,}. Then P forms 


277 G 
Modules 


an A-module. We call IT;.4 M, the direct 
product of modules {M,},.,. The canonical 
projection assigning x; to {x,} is denoted by 
p,:P—>M,. Suppose that an A-module M and 
A-homomorphisms fı: M M, (Ae A) are 
given. Then there exists a unique A-homo- 
morphism f: M >P such that p; o f =f, (AeA); 
f is given by f(x) - (f (9)]. 

In the direct product TT;.4 M3, the set S of 
all elements whose components x; are equal to 
0 except for a finite number of 4 is called the 
direct sum of modules (M; ;., and is denoted 
by Daca M; (or H zea M; or ® iea M;). The 
canonical injection assigning | ...,0,x,,0, ...] 
ES to x, € M, is denoted by j;: M; S. If an 
A-module M and A-homomorphisms f;: M, 
M are given, then there exists a unique A- 
homomorphism f':S— M such that f oj; — f, 
(AeA), defined by f({x,})= 2.4 f;(x;). When 
M is an A-module and {N,},., is a family of 
A-submodules of M, the A-homomorphism 
S:Ziea N;2 M defined by f(1x;j) = Zen x, is 
an A-isomorphism if and only if M is the 
direct sum of {N,}. 

If M; 2 M for all 4€ A, Ilea M; and 
Daca M; are denoted by M^ and M“, respec- 
tively. M^ can be regarded as the set of all 
mappings of A to M. The direct product M, 
x... X M, and direct sum MO... OM. ofa 
finite number of A-modules M,,..., M,, can 
be identified with each other and, if M; - M 
(1 <i<n), we simply denote it by M". 


G. Free Modules 


Let A be a ring. A family [x;);.4 of elements 
in an A-module M is called linearly indepen- 
dent if 27,.,4a;x; —0 (a; e A) implies a, =0 for 
all AEA. This is equivalent to saying that the 
mapping A M that assigns X eaa; x;e€ M 
to {a,} is injective. A linearly independent 
family {x,},-, generating M is called a basis of 
M. A family [x;],.4 is a basis if and only if 
every element of M can be written uniquely in 
the form X,.4a;x, (a,€ A). 

An A-module that has a basis is called a free 
module over A. If A is a field (which may be 
noncommutative), every A-module is a free 
module (— 256 Linear Spaces). The fcardi- 
nality of a basis of a free module M over A 
depends only on M if A is a field (which may 
be noncommutative) or a commutative ring; 
this number is called the rank (or dimension) of 
M. Any submodule of a free module over a 
principal ideal domain is a free module. 


H. Simple Modules and Semisimple Modules 


An A-module M is called simple if M #0 and 
M has no A-submodules except M and 0. 
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If M and N are simple A-modules, any A- 
homomorphism of M to N is an isomor- 
phism or the zero homomorphism (i.e., one 
which sends every element of M to 0) (Schur's 
lemma). If an A-module M is the sum of a 
family {M,},-, of simple submodules, M is the 
direct sum of a suitable subfamily IAM, In 
(A' c A). In this case, M is called semisimple (or 
completely reducible). 

If an A-module M can be decomposed into 
the direct sum of A-submodules N and N', 
then N' is called a complementary submodule 
of N. An A-module M is semisimple if and 
only if every A-submodule of M has a comple- 
mentary submodule. Let A be a ring. Then the 
A-module A is semisimple if and only if every 
A-module is semisimple. In this case A is 
called a *semisimple ring (— 368 Rings G). 
Every simple module over a semisimple ring A 
is A-isomorphic to a *minimal left ideal of A. 


I. Chain Conditions 


The set of all A-submodules of an A-module 
M forms an *ordered set under the inclusion 
relation. An A-module is called a Noetherian 
module if the ordered set satisfies the tmaximal 
condition and an Artinian module if it satisfies 
the tminimal condition (— 311 Ordering C). 

Let N be an A-submodule of an A-module 
M. Then M is Noetherian (Artinian) if and 
only if N and M/N are both Noetherian (Ar- 
tinian). A ring A is called a tleft Noetherian 
ring (‘left Artinian ring) if A is Noetherian 
(Artinian) as a left A-module, and similarly for 
right Noetherian and Artinian rings. Every 
finitely generated module over a Noetherian 
(Artinian) ring is Noetherian (Artinian). Over 
an arbitrary ring A, a module M is Noetherian 
if and only if every A-submodule of M is fi- 
nitely generated. 

A finite sequence {M;}o<i<, of A-submodules 
of an A-module M is called a tJordan-Hélder 
sequence if M = Mo, M;> M,,,, M, — (0), and 
the M//M;,, (0 €i«r) are simple. If such a 
sequence exists, M is said to be of finite length. 
The number r, called the length of M, depends 
only on M. The quotient modules M;/M,,, 
(0x i « r) are uniquely determined by M up 
to A-isomorphism and permutation of the 
indices (C. Jordan and O. Hólder). An A- 
module M is of finite length if and only if M is 
Noetherian and Artinian. A semisimple A- 
module is of finite length if and only if it is 
finitely generated. 

An A-module M is called indecomposable if 
M cannot be decomposed into the direct sum 
of two A-submodules different from M and 
{0}. Any A-module of finite length can be 
decomposed into the direct sum of a finite 
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sequence N,, ..., N, of indecomposable A- 
submodules different from {0}. The direct 
summands N; (1 «i n) are unique up to A- 
isomorphism and permutation of the indices 
(W. Krull, R. Remak, and O. Schmidt). 


J. Tensor Products 


Let A be a ring. Given a right A-module M 
and a left A-module N, we construct a module 
M GN (called the tensor product of M and N) 
and a canonical mapping o: M x NOM Q, N 
as follows. Let F be a free Z-module (free 
Abelian additive group) generated by M x 

N, and R be the subgroup generated by the 
elements of the forms (x + x’, y) ^ (x, y) 

(x^, y), (x, y+ y) - (x, y) — (x, y’), (xa, y) — (x, ay) 
(x, x*eM, y, y eN, ae A). We define MG, N 

= F/R, and call the natural projection qo. If we 
denote (x, y) by x & y, then we have (x, + 
x) yox, @V+X, @y, xQ (y1 +y) = 

x® yı +X & yz, and (xa) y 2x @ (ay). Any 
element of M &,N is written in the form 
Xx; Q y; (x;e M, y;e N). 

The tensor product Me, N of M and N 
and the canonical mapping 9: M x N > 
M &,N can be characterized as follows: 

For a module L, a mapping f: M x NL is 
called biadditive if the conditions f(x 4- x', y) 

— fs yy f(x^ y), fos y - y) 9 fos y) - foo y) 
hold. A biadditive mapping f satisfying the 
condition f(xa, y) ^ f(x, ay) is called an A- 
balanced mapping. Then we have (i) the canon- 
ical mapping o9: M x NOM G,N is A- 
balanced; and (ii) for any module L and any A- 
balanced mapping f: M x N-L, there exists a 
unique homomorphism f,:M Ge, N>L such 
that f(x, y)= f, (x & y) (xe M, ye N). 

A right (left) A-module can be regarded as a 
left (right) A°-module, where A? is the ring 
anti-isomorphic to A. In this sense, we have 
MQOINSN QM. 

Let A be a commutative ring. For A- 
modules M, N, and L, a mapping f:M x N> 
L is called a bilinear mapping if f is biaddi- 
tive and satisfies f(ax, y) 2 f(x, ay) ^ af(x, y) 
(ae A, xeM, ye N). The set (M, N; L) of 
all bilinear mappings M x N >L forms A- 
submodule of the A-module LM *". A bilinear 
mapping M x N >A is called a bilinear form 
on M x N. The tensor product MG, N be- 
comes an A-module if we define a(x & y) — 
(ax) & y (=x @ (ay)), and the canonical map- 
ping Mx NOM ©, N is bilinear. For any A- 
module L and bilinear mapping f: M x N> 
L, there exists a unique A-linear mapping 
f,:M @,N-L satisfying f(x, y) — f(x G y). 
By this correspondence fe» f,, we get an A- 
isomorphism £(M, N; L)z Hom,(M &,N,L). 
If A is a field, M ©, N coincides with the ten- 
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sor product M @ N as a linear space (— 256 
Linear Spaces H, I). 

In general, let M be a B-A-bimodule and N 
be a left A-module. Then M @, N becomes a 
left B-module if we define b(x & y)=(bx) G9 y. 
Let N be an A-B-bimodule and M be a right 
A-module. Then M @, N becomes a right B- 
module if we define (x & y)b =x & (yb). In 
particular, we have A@,N2=N,M@,A=M. 

Let M, M' be right A-modules and N, N' be 
left A-modules. For A-homomorphisms f: 
M—M'and g: NN’, there exists a unique 
homomorphism h: M &, NM, N' satisfy- 
ing h(x & y)= f(x) & g(y); h is called the tensor 
product of f, g and is denoted by f Q g. We 
give here some simple examples (also — Sec- 
tion L). 

Examples. (1) Let M, N be free modules 
(linear spaces, for example) over a commuta- 
tive ring A. If {x,};., and {y;}jej are bases 
of M and N, respectively, M ©, N is also a 
free module with a basis {x;@ Ylisen, jes. 

If the dimensions dim M, dim N are finite, 
dim M &, N «dim M dim N. 

(2) For an tideal a of a commutative ring A, 
the tfactor ring M — A/a can be regarded as an 
A-module, and we have M &,N z N/aN. For 
instance, (Z/mZ) ® z(Z/nZ) = Z/(m, n) Z, where 
(m, n) denotes the greatest common divisor of 
m and n. 


K. Hom and © 


We continue to consider modules over a ring 
A. Concerning the direct sum and product, we 
have 


Hom, (5 MIT.) = HI Hom,(M,, N,) 
D Wi 


and 


E m) Oa (5 x, = Y (M,@,N,). 


Au 


Concerning projective and inductive limits we 
have 


Hom, (um M,, lim x, = lim Hom,(M,, N,) 
and 


(in m) e, (im x, -lim(M; &, N,). 


An A-homomorphism f: M 9 M' induces a 
homomorphism Hom,(M', N)-5Hom,(M, N) 
by the assignment g go f. An exact sequence 
M'—5 MM" >Q gives rise to the exact 
sequence 


05 Hom,(M", N) 5 Hom,(M, N) 
5Hom,(M', N). (1) 
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An A-homomorphism f: N  N' induces a 
homomorphism Hom,(M, N)—5 Hom,(M, N’) 
by the assignments gf og, and an exact 
sequence 0 N' 5 N — N” gives rise to the exact 
sequence 


05 Homj4(M, N')-5Hom,(M, N) 
5 Hom,(M, N”). (2) 


Let M bea right A-module and N', N, N” | 
be left A-modules. An A-homomorphism f: 
NN’ induces the homomorphism ly @ f: 

M GQ,N-M G,N', and an exact sequence 
N'N-N"—0O gives rise to the exact sequence 


MQN’: >MQ, N>M Q, N">0. (3) 


Exchanging left and right, we obtain similar 
results (— 52 Categories and Functors B; 200 
Homological Algebra). 

Let Q be an A-module. If for any exact 
sequence of A-modules 


02M'5SMoM"0, (4) 
the induced sequence 
05 Hom (MT Q) ^K Hom,(M, Q) 

>Hom,(M’, Q)30 (5) 


is exact, then Q is called an injective A-module. 
This is equivalent to the condition that if M' is 
an A-submodule of an A-module M, then any 
A-homomorphism M'— Q can be extended to 
an A-homomorphism M >Q. Any A-module is 
an A-submodule of some injective A-module, 
and any injective A-module is a divisible A- 
module. If A is a Dedekind domain, any 
divisible A-module is an injective A-module. 

Let P be an A-module. If for any exact 
sequence (4), the induced sequence 


05 Hom,(P, M’)> Hom, (P, M) 
25 Hom,(P, M")50 (6) 


is exact, then P is called a projective A-module. 
This is equivalent to the condition that for any 
surjective A-homomorphism g: M M" and 
any A-homomorphism f: P M", there exists 
an A-homomorphism A: P M satisfying goh 
— f. Any A-module is a factor A-module of 
some projective A-module. A projective A- 
module has no torsion element. A free A- 
module is a projective A-module. In general, 
an A-module is a projective A-module if and 
only if it is a direct summand of a free A- 
module. 

Let R be a right A-module, If for any exact 
sequence (4), the induced sequence 


0—BRGOG,M'—-SRGOG,MoRG,M"-0 (7) 


is exact, then R is called a flat A-module. Any 
projective A-module is a flat A-module. A flat 
A-module R is called faithfully flat if Roi, M 
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= {0} implies M = (0j. A flat right A-module R 
is faithfully flat if and only if R# RY for any 
left ideal d (+ A) of A. Let A be a *principal 
ideal domain. Then an A-module R is flat if 
and only if R has no torsion element, and R is 
faithfully flat if and only if R has no torsion 
element and R z Rp for any ‘prime element 

p of A. We have the following important 
examples: 

(1) For a commutative ring A and its multi- 
plicatively closed subset S, the tring of quo- 
tients A, is flat as an A-module. However, As 
is not faithfully flat. For instance, the field of 
rational numbers Q is not faithfully flat as a 
Z-module. 

(2) Let A be a fsemilocal ring and A be its 
completion. Then A is faithfully flat as an A- 
module (— 284 Noetherian Rings; also [1,7]). 

In the exact sequence (4), if Im o = Ker y isa 
direct summand of the A-module M, we say 
that (4) splits. Then (5), (6), and (7) are exact for 
any A-modules Q, P, R. The exact sequence (4) 
splits if M' is injective or M" is projective. 

By ,M, M,, and Ms, we mean that M isa 
left A-module, a right A-module, and an A-B- 
bimodule, respectively. As already stated, ¿Mpg 
and ,N imply ,(Hom,(M, N)), and ,M and 
aNg imply (Hom,(M, Nils, Similarly pM, and 
N, imply (Hom,(M, N)),, and M, and eh, 
imply ,(Hom,(M, N)). Furthermore, „M, and 
aN imply (M @,N), and M, and Ne imply 
(M G4 N). 

With 4L 4, 4M, and 4N, we have 


Hom,(M,Hom,(L, N))=Hom,(L @, M,N). 
(8) 


Similarly, for Ma, La, and Na, we have 


Hom, (L, Hom,(M, N)) z Homs(L @, M, N). 
i (8) 


If B is a commutative ring, (8) and (8) are B- 
isomorphisms. Furthermore, with L4, Me, 
and N, we have 


(LG,M)G&INZxLOG (M gN). (9) 


We denote by M* the set Hom,(M, A) of all 
linear forms on an A-module M. Then A 
implies M*, and M, implies ,M*; the A- 
module M* is called the dual module of M. A 
as a left A-module is dual to A as a right A- 
module, and vice versa. For a family of A- 
modules {M,},-,, we have a canonical corre- 
spondence ($ ea M;)* x IL;.A M1. From this, 
we have a canonical isomorphism (M*)* = M 
for any finitely generated projective A-module 
M. Many facts concerning this *duality are 
similar to those valid for linear spaces (— 256 
Linear Spaces G). 

Let A be a commutative ring. Letting A= 
B=N in (8) and (8), we have the canonical A- 
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isomorphisms 
Hom,(M, L*)  Homj4(L, M*) 
z(LG,M)* OU. M; A), 


namely, any bilinear form on L x M is repre- 
sented by a linear mapping M >L* or L> M*. 


L. Extension and Restriction of a Basic Ring 


Fix a ring homomorphism p: A B. We regard 
B as a B- A-bimodule by defining a right oper- 
ation of A on B by b-a=bp(a) (ae A, be B). 
This bimodule is denoted by B,. 

For every left 4-module M, we construct the 
left B-module p*(M)= B, &, M, which is 
called the scalar extension of M by p. Every 
A-homomorphism of A-modules f: M 
M'induces the B-homomorphism p*( f) — 

15 f:p*(M)» p*(M?). 

For every left B-module N, we construct the 
left A-module p, (N) - Home, N), which is 
called the scalar restriction (or scalar change) 
of N by p. By the assignment (1), we have 
a module isomorphism Hom,(B,, N) — N. We 
identify p, (N) with N under this isomorphism. 
The operation of A on N is then given as a: y 
= p(a)y (ae A, ye N). If A is a subring of B and 
p is the canonical injection, then the opera- 
tion of A on p, (N) is the restriction of the 
operation of B on N. Every B-homomor- 
phism of B-modules f: N — N' induces the A- 
homomorphism p,( f ): p, (N) P(N’). For 
any left 4-module M and left B-module N, an 
A-linear mapping f: M p,(N)— N is called a 
semilinear mapping with respect to p. This 
means that f is an additive homomorphism 
satisfying f(ax)— p(a)f(x) (ae A, x e M). 

The extension and the restriction of a basic 
ring are related by the canonical isomorphism 
Hom,(M, p, (N)) z Homgs(p*(M), N) for an A- 
module M and a B-module N (— equation 
(GU. An element « of the left-hand side and an 
element f of the right-hand side are associated 
by the relation a(x)— B(1G x) (x«eM)(— 52 
Categories and Functors). 

Let A and B be commutative rings. Then for 
A-modules M and M', we have the canonical 
B-linear mapping p*: Bei, Hom,(M, M’) 
5Homj4(B 6&9, M, Bo, M^), which is a B- 
isomorphism 1f M is a finitely generated 
free (or more generally, projective) A- 
module. Using the notation p*, we have 
p*(Hom,(M, M’))= Hom,(p*(M), p*(M’)). 

We now give some examples where the basic 
rings are noncommutative. Let G be a group 
and H its subgroup. Let p denote the homo- 
morphism of group rings K[H]> K[G] in- 
duced by the canonical injection H — G, where 
K is a commutative ring. For any K[H]- 
module M, p*(M) is called the induced module 
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of M. The representation of G associated with 
p*(M) is the tinduced representation of the 
representation of H associated with M. Next, 
we fix a group G and consider a homomor- 
phism p: K[G]— K [G] induced by a homo- 
morphism of commutative rings o: KK. If 
K =K and o is an automorphism, then the 
representation associated with the "scalar ex- 
tension" p*(M) of a K[G]-module M is the 
*conjugate representation to the representa- 
tion associated with M. If K = K/N (Ql is an 
ideal of K) and o is the canonical projection, 
then the representation over K associated 
with the scalar extension p*(M) of a K[G]- 
module M is the reduction modulo Y of the 
representation over K associated with M, and 
p*(M) is canonically isomorphic to M/9IM. 
Furthermore, the ‘localization and the tcom- 
pletion can also be treated under the formula- 
tion of scalar extension (— 67 Commutative 
Rings G, 284 Noetherian Rings B). 
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278 (XIII.23) 
Monge-Ampere Equations 


A. Monge-Ampére Equations 


A Monge-Ampeére differential equation is a 
second-order partial differential equation of 
the form 


Hr+2Ks+Lt+M + N(rt—s?)=0, (1) 
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where H, K, L, M, and N are functions of x, y, 
z, p, and q, and r, s, t, p, and q represent the 
partial derivatives 


Oz Qz oz 

= = s= ——, ——2,. 

Ox? ESCH ay? 
| oz _ Oz 
Pex’? ay 


The characteristic manifolds are integrals of 
a system of differential equations defined as 
follows: 

Case (1) N 40. 


Ndp--Ldx- 4,dy-0, 
Ndqt+4,dx+Hdy=0, 
dz—pdx—qdy=0, (2) 
Ndp+Ldx+4,dy=0, 
Ndq+A,dx+Hdy=0, 
dz—pdx—qdy=0, (3) 


where 4, and 4, are the two roots of the 
equation 4? --2KA44- HL— MN —Q. 
Case (ii) N 20, H #0. 


dy—À,dx, HdpctHA,dq4 Mdx-0, 
dz —pdx —qdy —0, (4) 
dy-À,dx, Hdp+Hi,dqg+Mdx=0, 
dz -pdx —qdy—0, (5) 


where 4, and å, are the two roots of the 
equation H4? -2KA- L0. 
Case (iii) N «0, H=0, Lat 


dx=0, Mdy+2Kdp+4+Ldq=0, 
dz—pdx—qdy=0, (6) 
2K dy—Ldx=0, Mdy-cLdq-0, 

dz— pdx —qdy=0. (7) 


Case (iv) N =0, H =0, L=0. 


dx=0, 2Kdp+Mdy=0, 
dz —pdx —qdy- 0, (8) 
dy=0, 2Kdq+Mdx=0, 
dz—pdx—qdy=0. (9) 


A manifold x(A), y(A), z(4), p(A), q(A) that 
satisfies the system (2), (3) of differential equa- 
tions for case (i), (4), (5) for case (ii), (6), (7) for 
case (iii), or (8), (9) for case (iv) is a character- 
istic manifold of equation (1). 

The following result is known concerning 
Monge-Ampere equations: The union of sur- 
face elements of an integral surface of (1) is 
generated in two ways by characteristic mani- 
folds depending on one parameter. Conversely, 
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if a manifold, each element of which is com- 
posed of a point of a surface S and the tangent 
plane at that point, is generated by a family of 
characteristic manifolds depending on one 
parameter, then the surface S is an integral 
surface of (1). 


B. Intermediate Integrals 


If a relation dV(x, y, z, p, q) - 0 is a consequence 
of a system of differential equations of charac- 

teristic manifolds, for example, in case (1) when 
V satisfies 


N(QV/Ox + poV/óz) — LOV/Op — A,0V/0q —0, 
N(6V/dy + q0V[02) — 4,0V/0p — HOV/0q —0, 


V(x, y, z, p, q) - c (c an arbitrary constant) is 
called an integral of the system of differential 
equations. (i) If V — c is an integral of a system 
of differential equations of characteristic mani- 
folds, the solution z(x, y) of V=c considered 
anew as a partial differential equation of the 
first order is a solution of (1). Conversely, if 
every solution of V 2c (excepting tsingular 
ones) satisfies equation (1), V=c is an integral 
of a system of differential equations of char- 
acteristic manifolds. (11) If u(x, y, z, p, q) and 
v(x, y, z, p,q) are two integrals of a system of 
differential equations of characteristic mani- 
folds, then for any arbitrary function o, (u, v) 
is also an integral. Thus, as a consequence of 
(i), every solution z(x, y) of o(u, v) 20 is also 

a solution of (1). The converse is also true, 
namely, for every solution z of (1) we can find 
a function o such that z(x, y) is also a solution 
of o(u, v) 2 O0. The relation o(u, v) 20 is called 
an intermediate integral of (1). Sometimes an 
integral of a system of differential equations of 
characteristic manifolds is also called an inter- 
mediate integral. If each of the two systems of 
differential equations defining the character- 
istic manifolds has an intermediate integral, 
then the two intermediate integrals @(u, v) 20 
and y/(u, v) «0 form a tcomplete system of 
partial differential equations of the first order. 
Integrating this complete system, the tgeneral 
solution of equation (1) is obtained. 
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Morse Theory 


A. General Remarks 


We are interested in smooth functions on 

a smooth manifold of dimension m that 

have only the simplest critical points. Here, 
“smooth” means differentiable of class C*. 
Such a function f: M —R enables us to investi- 
gate the topology of M. The decomposition of 
M into level sets of f contains a lot of topo- 
logical information on M. For instance, it is 
shown that M is thomotopy equivalent to a 
*CW complex that is determined by critical 
points of f; furthermore the *Euler character- 
istic of M can be computed by means of f, 
because the critical points are related to the 
*homology groups of M. These types of inves- 
tigations, called Morse theory, were originated 
by H. Poincaré [1] and G. D. Birkhoff [2] and 
then developed into the form we see today by 
M. Morse [3]. An excellent exposition of this 
theory has been given by J. Milnor [4]. R. S. 
Palais [5] has extended the theory to Hilbert 
manifolds. Morse theory has been fruitfully 
applied to differential topology and differential 
geometry. 


B. Critical Points of Functions on Manifolds 


For a point pe M, M, is by definition the 
tangent space to M at p. Let f be a smooth 
real-valued function on M. A point pe M is 
called a critical point of f if the induced map- 
ping f: Mp >R y, is zero at p. For every local 
chart (x,,...,X,,) around a critical point p of f 
with p —(0, ...,0), 

of of 


2m )=...=5—-()=0. 





m 


The value f(p) is then called a critical value of 
f If the matrix (0? (ës, Gell is invertible, 
then the point is called a nondegenerate critical 
point, and if the matrix is not invertible, then 
the point is said to be degenerate. These no- 
tions are independent of the choice of local 
charts around p. The above matrix is called 
the Hessian of f at p. The nullity and index of 
the Hessian of f at p are called the nullity and 
index of f at p, respectively. The function f has 
a local minimum at a nondegenerate critical 
point of index 0, and a local maximum at a 
nondegenerate critical point of index n. 

A smooth function f on M is called a Morse 
function if it satisfies: (A1) Every critical point 
of f is nondegenerate. The Morse lemma states 
that if p is a nondegenerate critical point of f 
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and if the index of f at p is å, then there exists 
a local chart (y,, ..., Ym) around p with p= 
(0, ..., O) such that f is expressed as 


fi) 9 fp - Gi — ...— Qi + Vans)? 4... 


+ (Vm). 


C. The Existence of Morse Functions 


Let M be a compact smooth m-dimensional 
manifold without boundary. The existence of 
Morse functions on M is shown by tembed- 
ding M into R" for a sufficiently large n. For 
convenience, M will be here identified with its 
embedded image. At each point pe M the set 
vp of all unit normals forms an (n —m — 1)- 
dimensional unit sphere, and v(M) — | Jj. v, 
is a smooth (n — 1)-dimensional compact 
manifold. Let S be the unit hypersphere in R" 
around the origin. The Gauss normal mapping 
Q9: v(M)-S sends each point (p,v(p)) e v(M) to 
veS via a canonical parallel translation of R”. 
Since o is smooth, the tSard theorem implies 
that the set E of all critical values of o has 
measure zero on S. Thus o, at every (p,v(p))e 
o (vr) has maximal rank if v is taken in S — E. 
For every fixed ve S — E let h,: M >R be de- 
fined by h,(x) 2 (o, x», xe M, where < , > de- 
notes the canonical inner product of R". Then 
h, is a Morse function. To see this let dX, dv, 
and dM be volume elements of S, v, and M, re- 
spectively. Then dvAdM is a volume element of 
v(M), and the function G: v(M)9R is obtained 
by the relation p*d = G(x,v(x))dvAdM. 

G(x, v(x)) is called the Lipschitz-Killing curva- 
ture at v(x)e v,. If A(v(x)) denotes the 'second 
fundamental form of M with respect to v(x), 
then G(x, v(x)) =(—1)"det(A(v(x))) (Chern and 
Lashof [6]). Every critical point pe M of h, 
for veS — E has a normal v(p) to M at p, and 
the Hessian of h, is given by A(v(p)). Thus 
G(p, v(p)) #0 ensures the nondegeneracy at p 
[6]. 

Another approach to the construction of 
Morse functions is to find for a given embed- 
ding of M into R” an open dense set Uc R” 
such that the Euclidean distance d, from any 
fixed point x on U to points on M has no 
degenerate critical points on M. In this case 
d, ((—9o0,a]) is compact for all ae R. It is also 
possible to construct Morse functions on 
compact manifolds with boundary as well as 
on noncompact manifolds. But a striking 
result of A. Phillips states that there exists a 
Morse function on every noncompact mani- 
fold which has no critical points [7]. 

For a pair of smooth manifolds M, N let 
C? (M, N) be the set of all smooth mappings 
from M to N. The weak (or compact-open) C^ 
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topology on C^(M, N) is generated by the sets 
defined as follows. Let feC*(M, N), and let 
(U, o) and (V, Y) be charts of M and N, respec- 
tively. Let K c U be a compact set such that 
f(K)c V. For re, œ), a weak !subbasic 
neighborhood N^(f:;(U, p), (V, V), K, £) of f 

is defined to be the set of all smooth map- 
pings ge C^ (M, N) such that (1) g(K)c V, Q) 
ID*(pofoy ') G) -D*(eogov "ll «s. 
xeo(K), k — 1,2, ..., where D* denotes the Ath 
derivative with respect to the coordinates. 
The weak C* topology is thus defined on 
C*(M, N), and the space with this topology is 
denoted by CZ (M, N). It follows from the 
construction of the embedding in the Whitney 
embedding theorem that the set of all smooth 
embeddings of compact M into R" is dense in 
C? (M, R") if n7 2dim M. Moreover, the set of 
all Morse functions on a compact M forms 

an open dense set in C7 (M, R) (Nagano [8], 
Hirsh [9], Auslander and MacKenzie [10]). 


D. Decomposition of M by a Morse Function 


To decompose M into levels of a Morse func- 
tion we now consider the process of tattaching 
a handle. Let M be a compact manifold with 
*boundary ôM. Let Dr be the s-disk, and let 

g :(0D*) x D™ "aM be an tembedding. Then a 
tmanifold X(M;g;s) with a handle attached by 
g is defined as the quotient set obtained from 
the disjoint union M U(D? x D" *) by identify- 
ing points in QD x Dr "and their images 
under g and equipped with a natural differenti- 
able structure. Similarly, if g;: (0D?) x D “*—> 
OM (i— L, ..., K) are embeddings with disjoint 
images, we can define the thandlebody X (M; 
Prion Ups Leva ELL, 

For an f eC*(M, R) and for a, be f(M) with 
a<b let M@={xeM|f(x)<a} and M; 
{xeM|a<f(x)<b}. M? is called a level set 
of f. For a Morse function f on a compact 
manifold M without boundary, the following 
fundamental facts are known [4]: (1) The first 
fundamental theorem. If M? contains no crit- 
ical points, then M? is diffeomorphic to M7 x 
[a, b]; (2) The second fundamental theorem. 
Let c€(a, b) be a unique critical value in [a, b], 
and let p,, ..., p € M: be critical points of f 
with p; having index A. Then M" is diffeo- 
morphic to X(M5; fi, BE fgs Aqui c AL FOT 
suitable embeddings f1, ..., Jp: It follows from 
the existence of Morse functions and from (1) 
and (2) that every compact manifold can be 
obtained by successively attaching handles to 
a disk (— 114 Differential Topology). Fur- 
thermore, if M admits a Morse function with 
only two critical points, then M is homeo- 
morphic to a sphere (Reeb [12]). 

Concerning the homotopy types of M and 
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M" we have the following: (3) If ce f(M) is a 
critical value of f with points p4, ..., p, with p; 
having index 4; and if M77 is compact for £2 0 
and contains no critical points other than 

pi; s yo then M‘** has the same homotopy 
type as M^ Uet U... Ue^, where e» (j= 
1,...,k) are 4;-cells. (4) Assume that a Morse 
function f on a (not necessarily compact) M 
satisfies: (A2) Mi is compact for all ach, Then 
by choosing a sequence d, «4; <... of regular 
values of f and applying (3) to each MT, we 
see that M has the same homotopy type as 
that of a CW complex C, where the number of 
À-cells belonging to ( is the same as that of 
critical points of index 4 of f. 

Moreover, we have the following Morse 
inequalities: (5) Let f be a Morse function on a 
compact M. If M, is the number of critical 
points of f on M of index 4, and if Riisa å- 
dimensional *Betti number of M, then 


M,—MoZ R,- Ro, 
M,—M;. +... +(—1) Mo 


zb, Rot +(—1 Ro 1<4<n-l1, 


M,—M,-; +... +(—1)"Mo 
cR eR cu qq Ra 


In particular, we have M, > R, for all k. By 
using these facts S. Smale obtained an affirma- 
tive solution of the tPoincaré conjecture in 
high dimensions [11, 13]. 

The concept of critical manifolds in the 
sense of Bott [14] is stated as follows: Let M 
be a compact manifold embedded into an 
open set U c R*. Let f: UR be a smooth 
function. M is called a nondegenerate critical 
manifold of f on U if (1) all points of M are 
critical points of f and (2) the nullity of all 
x€ M is equal to dim M. When M is such a 
nondegenerate critical manifold of f on U, f is 
constant on M, and the index A of f at x is 
well defined and is the same at all points on 
M. The *Poincaré polynomial is expressed as 
P(M;t) - X.t*dim H*(M). The Morse poly- 
nomial is defined by 9( f; t) - X t^" P(N; t), 
where N runs over all critical manifolds of f 
and Ay is the index of N. Under certain con- 
ditions for orientability along each nondegen- 
erate critical manifold of f, we have again the 
Morse inequality 


M( fst) > P(M; 0). 


E. Morse Theory on Hilbert Manifolds 


Let M be a "Hilbert manifold, and let f: MR 
be a smooth function. The Hessian €? f, of f at 
a critical point of f is a symmetric bilinear 


1051 


form on M, given by 
0? fu, v) d*(foo ')(do,(u), do,(v)), 
u, vE Mp, 


where ¢ is a coordinate mapping of a local 
chart around p. The index (coindex) of f at a 
critical point pe M of f is defined to be the 
supremum of dimensions of subspaces of 
M, on which 0? f, is negative (positive) defi- 
nite. The self-adjoint bounded operator A 
which represents 6? f, is given by 2? f, (uv) = 
<u, A(v)>,. If A is invertible, then f is said to 
be nondegenerate at p. These notions do not 
depend on the choice of local charts around p. 
The Morse lemma has been generalized to 
a Riemannian Hilbert manifold M by R. 
Palais and S. Smale [15] as follows. Let f be a 
smooth function defined on M. If xe M isa 
nondegenerate critical point of f then there 
exist a chart o around x and a projection P 
such that for y near x 


ft) fe) Hetz)? — I — Poy). 


The above result has been extended to critical 
manifolds [16]. A connected submanifold N of 
a Hilbert manifold M is called a nondegenerate 
critical manifold of f: M >R if (1) every point 
peM is a critical point of f and (2) for each 
p€ M there exists a closed subspace E, of M, 
such that M, = N, ® E, and the Hessian form 
restricted to E, is nondegenerate. Let v — 

(x, E, N) be a smooth Hilbert-space bundle 
over a compact connected manifold N, and let 
( , > be a Riemannian structure for v. Assume 
that a smooth function f: ER has the zero 
section of v as a nondegenerate critical mani- 
fold. Then there exist a tubular neighborhood 
v, tee E| ||v|| «ej of the zero section in v and 
a fiber-preserving diffeomorphism y: v, y(v,) 
and an orthogonal bundle projection P such 
that for Gen, 


fow(v)=f(N)+ | Poll? — ll — Pei", 


Let us assume that (B) M is a complete 
Riemannian manifold, and (C) (Palais-Smale 
condition) if f is bounded on a set S c M and if 
the norm ||Vf || of the tgradient vector of f 
has infimum 0 on S, then there is a critical 
point of f on the closure S of S. Since M is not 
locally compact, condition (C) is required. In 
order to prove the first and second funda- 
mental theorems of Morse theory it is neces- 
sary that the integral curves of Vf exist, and 
(C) ensures their existence. Namely, under 
conditions (A1), (B), and (C), one of the follow- 
ing two facts follows for any be f(M) and for 
any regular point pe M”: (1) The integral curve 
c: [0, r] 5 M* of Vf with c(0) — p exists, and 
foc(r)=b holds for some re[0, oo); (2) the 
integral curve c: IO. oo) — M? of Vf exists and 
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lim, „œ c(t) exists such that the limit point c(oc) 
— lim, „œ c(t)e M? is a critical point of f. These 
facts imply that the first fundamental theorem 
of Morse theory holds if M? does not contain 
any critical point of f, and also the second 
fundamental theorem holds if M? has only 
nondegenerate critical points on M; for some 
c €(a, b). Furthermore, assume that f: M >R 
and M satisfies (A1), (B), and (C), and let a, 

be f(M) be regular values with a « b. If for 
each nonnegative integer å, R, is the Ath Betti 
number of M?, and if M, is the number of 
critical points of index 4 of f in MP, then the 
Morse inequality holds: 


Mo Ro, 
M, —M,ZR,-—R,, 


m 


È (DM, > È (CIR, 


GET EE 


In particular, M, 2 R, holds for all / If f is 
bounded below, then the Ath Betti number R* 
of M^ and the number A? of all critical points 
of f with index 4 in M? satisfy M* > RI for 

all 2. 


F. Morse Theory of Path Spaces 


Morse theory on Hilbert manifolds applies to 
the energy functions on Riemannian Hilbert 
manifolds which consist of all H,-curves on a 
compact Riemannian manifold M, and the 
theory is useful for proving the existence of 
closed geodesics on M. 

Let M be a smooth manifold, and let I = 
[0,1]. Let Hu. M) be the set of all continuous 
curves og 1 M such that for each local chart 
(V, o) of M, goa is absolutely continuous and 
(te oof 1s locally square integrable. In par- 
ticular, if M =R”, then H, (I, R”) is a Hilbert 
space with the inner product 


(e, p), :— (o(0), p(0)) + | (c(t), p(t)) dt, 


I 
c, p€ H,(I, R”). 


For each oe H (I, M), set HU, M) ={X € 
HU. TM)| X(t)e Mam, te I], where TM is by 
definition the *tangent bundle over M. For any 
fixed pair of points p, qe M, let Q(M; p, q)= 
{oe H,U, M)|o (0) 2 p.o(1)— qj, and for each 

c €O(M; p,q), let O(M; p, q), - (X € H, (I, M), | 
X(0)=0EM,, X(1) 20e M,j. Then H,(I, M), 
forms a vector space, and O(M; p, q), 1s a sub- 
space of H,(I, M),. M can be embedded into 
R" for sufficiently large n by the Whitney 
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embedding theorem [17]. Then we have 
the following facts [17]: (1) HU, M)= foe 
A, (1, R")|o(1)< Mj, (2) H, (I, M) is a closed 
submanifold of H, (I, Rn (3) Q(M;p,q) isa 
closed submanifold of H, (I, M), (4) the tangent 
space to H,(I, M) at each point oe H,(I, M) is 
HO M),, (5) the tangent space to O(M; p,q) at 
each point c € OQ(M; p, q) is O(M; pat, Here 
M is identified with the embedded image in 
R”. Thus H,(I, M) and O(M; p, q) carry the 
structure of a Hilbert manifold, and they are 
independent of the choice of embeddings [17]. 
The energy functions on HU. M) and 
Q(M; p,q) play important roles in the develop- 
ment of Morse theory. M is now assumed to 
be a complete Riemannian manifold. It follows 
from the Nash isometric embedding theorem 
(— 365 Riemannian Submanifolds B) that 
M is isometrically embedded into R" for a 
sufficiently large n’. H,(I, R") carries a com- 
plete Riemannian metric in a natural way, 
and H,(I, M) becomes a complete Riemann- 
ian manifold with the metric induced from 
H,(I, RN). Similarly O(M; p, q) admits a com- 
plete Riemannian metric. Then the energy 
function E: H,(I, M)R is defined to be 


E(o):— 2 (c, 0*5 dt, 


where < , > is the inner product induced from 
the Riemannian structure of M. Then the 
Palais-Smale condition (C) is satisfied for E 
on H,(I, M). That is, if a sequence {0,} on 
H, (I, M) satisfies: (1) {E(o,)} is bounded above; 
and (2) VE(o,) 50 as k— oo, then there is a 
subsequence 1o, of {op} such that Io con- 
verges to a critical point a € H,(I, M) of E. 
Also, condition (C) is fulfilled for the energy 
function on Q(M; p,q). 

For a fixed pair of points p, qe M, let Q = 
(M, p,q). Let c€Q be a critical point of E. 
A tangent vector W to Q at o: 1— M with c(0) 
=p, o(1)=q is a tpiecewise differentiable vec- 
tor field along o such that W(0)=OeM, and 
W(1)=OeM,,. A proper variation &:(— e, £) x I 
>M along o which is associated with W satis- 
fies the following: (1) at. t) 2 a(t), te I; (2) there 
exists a finite partition 02 to «t, «...«t,-1 
of I such that «|(— 6, €) x [t;.,, t;] is differenti- 
able for all i= 1, ...,x; (3) «(u,0) 2 p and a(u, 1) 
— q hold for all ue(— &, et and (4) éa(0, t)/ðu 
= W(t), tel. It follows from the first variation 
formula (— 178 Geodesics A) that ceQ is a 
critical point of E if and only if ø is a geodesic 
on M. Let W,, W,€Q,, and let U be an open 
set in R? around the origin. Then a proper 
variation x: U x I— M along ø that is asso- 
ciated with W, and W, satisfies the follow- 
ing: (1) «(0,0, 0 — c(t), te I; (2) there exists a 
finite partition 0— t «t, « ... «t, = 1 of I such 
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that «| U x [t;_,,t,] is differentiable for all i= 
1, ...,k; (3) a(u,,u,, 0 — p, a(u,,u,, 1) q for 
all (u,,u5)e U; and (4) datt, 0, t)/0u, = W, (t), 
6a(0, 0, t)/0u, = W,(t), te I. Then the Hessian 
E,, of Eat o is given by 

1 e^ E(x(u,, u;)) 


kx MA. Wl: 


E 
2 ĝu ĝu, 


5 


(0,0) 





where x(u,,u5)e€ is by definition the curve 
&(u,,u5)(t) - a(u,,u5, t). The second variation 
formula (— 178 Geodesics A) then gives 


E, (0, Wl -Y O40, Ai) 
- | w WEAR oec 


where A, W/(t) = W (t + 0) — HU —0). We have 
the Morse index theorem, which states: (1) The 
tnull space of E at a critical point ø is the 
linear space spanned by all Jacobi fields (— 
178 Geodesics A) along c that vanish at 0 

and 1; (2) if (o(s,),o(s5), ....0(s)] (0s, «s; 

« ... «s, « 1) is the set of all points conjugate 
to c(0) along c and if 4; is the multiplicity of 
the conjugate point o(s;), then the index of E at 
g is equal to 4, +... +å. It follows from the 
Sard theorem together with the differentia- 
bility of the exponential mapping on M that 
except for a set of measure zero in M x M, p, q 
can be chosen so that p, q is not a conjugate 
pair along any geodesic in Q. Then all critical 
points of E are nondegenerate, and for any 
c» 0, = {weEQ| E(w)=c} contains at most 
finitely many nondegenerate critical points 
with finite indices. 


G. Existence of Closed Geodesics 


Let M be a compact Riemannian manifold. By 
replacing I with the circle S!, we consider a 
Hilbert manifold A(M)= H,(S!, M). A(M) 
carries the structure of a complete Riemannian 
manifold. Every point pe M is naturally em- 
bedded in A(M) as a point curve, and M is a 
totally geodesic submanifold of A(M). A point 
ge A(M) is a critical point of the energy func- 
tion E: A(M)5R if and only if ø is a closed 
geodesic of M. It is known that E satisfies the 
Palais-Smale condition (C). The index and 
nullity of E at a critical point o is finite. Since 
M is compact, the fundamental length Je of 
M is positive and A^— (ce A(M)|E(a)<c} isa 
tdeformation retract of M c A(M) by means of 
the deformation along the integral curves of 
—VE. 

If M is not simply connected, then each 
nontrivial element of the "fundamental group 
of M represents a class of homotopic curves in 
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which there is a closed geodesic whose length 
realizes the infimum of all these curves. If 
M is simply connected, then there is a mini- 
mum integer k for which z,(M)-z 0. Clearly, 
7, 1(A(M)) x n,(M) 40. Suppose that E has no 
positive critical value. Then A(M) is a defor- 
mation retract of M, a contradiction. Therefore 
there exists at least one closed geodesic on 
every compact Riemannian manifold (Lyuster- 
nik and Fet [18]). 

Proof of the existence of many closed geo- 
desics on M is difficult due to the follow- 
ing: (1) There is a continuous O(2) action on 
A(M) that assigns c € A(M) and we O(2) to the 
curve t—o(t 4- a), te S3; (2) for each integer 
k and for each critical point o € A(M), the 
curve t>o(kt) is a critical point with energy 
k^ E(o). 

À remarkable result has been obtained 
by Lyusternik and Shnirel'man [19], which 
states that there are at least three closed geo- 
desics on every simply connected compact 
manifold of dimension 2. Fet then proved that 
there exist at least two closed geodesics on a 
compact manifold if all critical points of E on 
A(M) are nondegenerate [20]. By developing 
a precise argument concerning the Morse 
lemma around an isolated degenerate critical 
point c € A(M) of E, Gromoll and Meyer have 
proved that there exist infinitely many closed 
geodesics if the sequence of Betti numbers 
{b{A(M))} with respect to any field is un- 
bounded [21]. If 4: MM is a certain isome- 
try, then there are also infinitely many A- 
invariant closed geodesics if the Betti numbers 
of the space of A-invariant H,-curves are not 
bounded [22, 23]. By investigating the Z,- 
cohomology of A(M) of compact symmetric 
spaces, Ziller has proved that if M has the 
same homotopy type as that of a symmetric 
space of rank >2, then M has infinitely many 
closed geodesics [24]. l 

Many attempts have been made by W. 
Klingenberg and others to prove the existence 
of infinitely many geometrically distinct closed 
geodesics on every compact Riemannian mani- 
fold [25]. 
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Multivariate Analysis 


A. General Remarks 


Multivariate analysis consists of methods of 
statistical analysis of multivariate observations 
represented by a collection of points in a finite- 
dimensional Euclidean space R”. With the 
development of powerful computers, multi- 
variate techniques are beginning to be utilized 
in many fields of science and technology. 


B. The Multivariate Linear Model 


The multivariate linear model is an immediate 
extension of the univariate linear model. Sup- 
pose that X 2 (X'! ... X?) denotes the p x n 
matrix of n observations of p-dimensional 
data. Suppose that it can be expressed as 


X=BZ+U, (1) 


where B is a p x m matrix of unknown para- 
meters, Z is a known m x n matrix of inde- 
pendent variables, and U is a p x n matrix of 
errors. We assume that (i) the *expectations of 
the elements of U are zero, that is, the px n 
matrix E(U)— O0, and call the relation (1) a 
multivariate linear model. We usually assume 
further that (ii) the column vectors U®, i= 

I, ..., n, of U are independent and identically 
distributed, and that (iii) U is distributed 
according to a multivariate "normal distri- 
bution with ‘covariance matrix X. Analogous 
to the univariate case, the *least squares es- 
timator B of B is defined to be the p x m ma- 
trix that minimizes 


tr(X — BZ)(X — BZy 
and is given explicitly by 


B-XZ(ZZ)' when |ZZ'z0. 


Here the symbol ' means the transpose of a 
matrix. Also, an unbiased estimator of X is 
given by 


5 -Q./(n— m), 


B is an unbiased estimator of B under the 
assumption (i) above and is the "best linear 
unbiased estimator, under (i) and (ii), while £ is 
unbiased when (i) and (ii) are assumed. Under 
the assumptions (i)-(iii), B and Ê form a set of 
complete ‘sufficient statistics; hence they are 
tuniformly minimum variance unbiased es- 
timators. Also under (i)- (iii), elements of B are 
normally distributed, and their covariance can 
be expressed by 


Q,-XX' - BZx'. 


AM (Q denotes the 'Kronecker product), 
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where M —(ZZ') !. Applying *Cochran's 
theorem for the multivariate case, Q, is shown 
to be distributed according to a *Wishart 
distribution with n — m degrees of freedom. 

To test the hypothesis B= By under (i)—(iii), 
we put Q,=(B—B,)ZZ'(B— B,) and have (X 
— BoZY(X — Bo ZY Us + Q., where Oe and Q. 
are independently distributed. The distribution 
of Q, is a Wishart distribution with m degrees 
of freedom when the hypothesis is true, and a 
*noncentral Wishart distribution when B z B,. 
Based on this fact, several procedures have 
been proposed. If we require the invariance of 
procedures with respect to linear transforma- 
tions of the coordinates of p-dimensional 
vectors, the roots 4,,...,4, of the tcharacter- 
istic equation |Q; — 4Q,| 20 form a tmaximal 
invariant statistic; hence the testing proce- 
dures should be defined in terms of these roots 
(— 396 Statistic I). Also, the consideration of 
*power leads to procedures that reject the 
hypothesis when these roots are large. Com- 
monly used test statistics are (1) the flikelihood 
ratio test W—[Q,[//Q, + Q,| 2 IL(1 - 4) ' 

(S. S. Wilks); (2) trQ: ! Q4, 2 3:4; (D. N. Lawley 
and H. Hotelling); (3) max 4; (S. N. Roy); (4) 
trQ4(Q;-Q) ' 2 XA(1 4)! (K. C. S. 
Pillai). Pillai’s trace test (4) is locally the most 
powerful invariant; Wilks's likelihood ratio 
test (1) has the maximum Bahadur efficiency. 
The power functions of the tests (1)-(3) have 
the monotonicity property, namely, they are 
monotonically nondecreasing with respect to 
each eigenvalue of Q= X^! (B— B))ZZ'(B— 
Boy, the matrix of noncentrality parameters 
for Qx. The monotonicity for Pillai's test (4) is 
known to hold only for restricted cases where 
the critical value c for the acceptance region 

tr Q(B +Q) ! «c should satisfy Oc <1. All 
the tests (1)-(4) are unbiased. With respect to 
0—1 loss, the tests (1) and (4) are *admissible 
Bayes and the tests (2) and (3) are ‘admissible. 
Tests based on min 4; are inadmissible. 

Small-sample distributions of these statis- 
tics are complicated but when n— oo, nlog W, 
ntrQ,Q,', and ntrQ,(Q,+Q,) ! are as- 
ymptotically distributed according to a chi- 
square distribution with pm degrees of free- 
dom under the null hypothesis. Even under 
the alternative hypothesis that the matrix 
of noncentrality parameters Q = o(1) for large 
n, the asymptotic distributions remain the 
same. When Q= O(1), they are noncentral chi- 
square distributions of pm degrees of freedom 
and noncentrality parameter ó — tr. If Q = 
O (n), they are normal distributions, namely, 

— V/n(log W+log|I + 0|), /n(trQ,Q;! — tr) 
and J/n(trQ4(Q, +Q)! —tró( -- 0)7!) 

for 0 — lim QO/n have asymptotically normal 
distributions of zero means and variances 
given by 2tr(I — (1-0) ?), 2tr(20-- 0?), and 
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2tr((1--0) ? —(I 4-0) ^), respectively. As a 
special case, if m= 1 there exists only one non- 
zero A, and the procedures in this paragraph 
all coincide and are equivalent to one based 
on T? AM ! (B— By Z ? (B — Bj). It is known 
that under the hypothesis, (n — p) T?/(n — 1)p is 
distributed according to an * F-distribution 
with (p, n — p) degrees of freedom. When p— 

2 (resp. m= 2), (n —m— (1 — / W/(m /W) 
(resp. (a— p — 1)(1— JW Ve, / W)) is distributed 
according to an F-distribution with degrees of 
freedom (2m, 2(n — m — 1)) (resp. (2p, 2(n — p 
—1))). Simultaneous ‘confidence regions of B 
can be derived from the testing procedures in 
this paragraph, that is, 


trQ, '(B—B)ZZ'(B—B) «c. 


Moreover, when the matrix B is decom- 
posed as B —(B, : B), where B, and B, area p 
x q matrix and a p x (m — q) matrix, respec- 
tively, and the hypothesis to be tested is of the 
form B, =0, the test procedures can be ob- 
tained as follows: Decompose Z as 


C. 
ze 
Z, 


where Z, isa q x n matrix and Z, is an (m — q) 
x n matrix, and put 


B3=XZ}(Z,Z;)"', 


Q* = XX’ — BSZ,X’, Q;, -Q*-Q. 


Then Q,, and Q, are independent, Qg, is 
distributed according to a Wishart distribution 
with q degrees of freedom when the hypothesis 
is true, and we can apply the procedures in 

the previous paragraph, simply replacing Q, 
by Qs, 

Such a procedure is called multivariate 
analysis of variance (or MANOVA, for short). 
Various standard situations can be treated in 
this way (after some linear transformation of 
variables, if necessary). Some examples are (1) 
X «(X9 ... X), where the X (i= 1, ...,n) 
are distributed independently according to a 
p-dimensional normal distribution N(g, X). 
We can express X as X =al’ + U, and the 
estimators are given by #=X=X1/n, $= 
(X - XI )(X — XI) /(n — 1). In this case, we 
obtain a test for the hypothesis z= 4 based 
on Hotelling's T? statistic, i.e., the test with a 
'critical region of the form 


T? =n(X — ug £^ (X — ug) >. 


(2) Suppose that p x n; matrices X;, i— 1, ..., k, 
are samples of size n, from p-dimensional 
normal distributions N(#;, 2) with common 
covariance matrix X. The tests for the hypoth- 
esis 4t; =... = 4t, are obtained from the follow- 
ing observation: Let Q, = X,(X, — X,1)(X;— 

X; 1y, where X;=X,1/n,, Q, 2 X n(X;—X) 
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(X,- Xy, X= Xi n;X;/X nj. Then X(X; XI) 

(X, — X1) 2 Q, +Q.. We call Q, the matrix of 
the sum of squares within classes, and Q, the 
matrix of the sum of squares between classes. 
The latter is distributed according to a Wish- 
art distribution when the hypothesis is true. 

(3) Suppose that X,; are p-dimensional vectors, 
and that 

Xy 9; fl; +U;;, i=1,...,m, 


where y, a;, fj; are p-dimensional constant 
vectors such that $a; — and X fj; — 0, and the 
U; are independently distributed according to 
a p-dimensional normal distribution N (0, 2). 
We set 


Q,— ny. (X, —-X) (X; —Xy, 
Q, 7m (X -XXX ,—Xy, 


X-YY Raim, 

Then we have © X(X;—X)(X;—X) =Q, + 
Q; +Q., and Q,, Q,, Q. are distributed inde- 
pendently according to (noncentral) Wishart 
distributions with degrees of freedom m — 1, 

n — 1, and (n— (im — 1), respectively. The tests 
for the hypothesis «;=0 (i= t, ..., m) or ff, 0 
(j=1,...,n) are obtained from these matrices. 


C. Tests for Covariance Matrices 


Let X;(p x 1), j=1,...,N;, bea random sample 
from p-variate normal distribution N (gt;, 27) 
for i2 1,..., k. For testing the hypothesis Ho: 
X,=...=2;, with unknown mean vectors gt 
against all alternatives, the likelihood ratio 
statistic is given by 


NN? lH JS, "2 
BESA DS? 


where S= X, (X; — X)(X, Xy for X;= 
Y Xj/N;. If we replace the sample size N; 
by the degrees of freedom n; 2 N; — 1 and N 
by n= N —k in (2), the modified likelihood 
ratio test is unbiased for general p and k [16]. 
For k— 1, the hypothesis specifies Hy: X, = 

Xo (a given positive definite matrix) and 

the likelihood ratio statistic is given by 
[S| "^ etr( — X5! S/2) (etr(x) 2 exp(tra)). Again 
replacing N by n2 N —1 yields an unbiased 
test. Moreover the power function of this 
modified likelihood ratio test depends only 
on the eigenvalues of X, Xj! and is increasing 
with respect to the absolute deviation of each 





N=N,+...+N,, (2) 
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eigenvalue from 1, that is, Ich, A, X; ! — 1j. For 
k — 2, it is conjectured that the modified likeli- 
hood ratio test has a power function monoton- 
ically increasing with respect to |ch;2, X; ! — 1|. 
This conjecture has not yet been proved; we 
know only that the power is increasing if the 
maximum eigenvalue ch, E, £7! increases 
from 1 or the minimum eigenvalue ch, Z, 2;! 
decreases from 1. 


D. Estimation of Mean Vector and Covariance 
Matrix 


Let X(p x 1) have p-variate normal distri- 
bution N (4, I). For estimating the unknown 
mean, take the sum of squared errors A asa 
loss Lin, d) — ||u — d||?. Since each component 
X, of X is distributed independently according 
to N(uj, 1) for uz (yu, ..., iy, it is natural to 
suppose that X is a good estimator. In fact X 
is a minimax estimator for u. However, Stein 
showed that X is admissible for p — 1 and 2 but 
inadmissible for p > 3. James and Stein showed 
that the estimator 


p—2 
1—-—— IX 
( "3 a 


dominates X when p > 3. This estimator is 
further dominated by Stein’s positive part 
estimator (1 — (p —2)/||X |7) X, where (a), 
means a if a is nonnegative and zero if a is 
negative. The class of estimators X + Vlog f(X), 
where V —(0/0x,, ..., 0/0x,) for Kl —(x,,..., xy) 
are all minimax for o dominating X, if f(X) 

2:0 and ./f(X) is superharmonic (V?, / f(X) 
<0), satisfying E[||Vlog f(X)||?] < oo and 
E[X|0? f(X)/0x? /f(X)] « oc. Putting f(X)— 
IXI ^7? yields the James-Stein estimator 

(3). For this problem a class of monotone 
estimators is essentially complete, where an 
estimator d(X) is called monotone if d(X) « 
d(Y) whenever X < Y (defined componentwise). 
Stein's positive-part estimator is not mono- 
tone and is still inadmissible [8]. 

Let X,(p x 1), i=1,...,n, bea random sam- 
ple taken from the p-variate normal distri- 
bution N(0, X). Then the maximum likeli- 
hood estimator for X is given by S/n, where 
S= Xi- X;X;. S has Wishart distribution 
W,(n, X). To study the estimation problem of 
X, take two loss functions L,(Z,d)-trdX ^! 
—log|dX !|— p and L,(Z,d) - (1/2)tr(dX ! 
— Iy*. Under L,-loss, the best scalar multiple 
of S is given by S/n, and under L,-loss it is 
given by S/(n - p 4- 1). The James-Stein mini- 
max estimator for L,-loss is given by d,(S) 

— KAK', where K is the lower triangular ma- 
trix with positive diagonal element for which 
S—KK' and A—diag(A,, ..., A,) with A;= 
Lin - p - 1 — 2i). A minimax estimator for 
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L,-loss is given by d,(S)= KAK' with the same 
lower triangular matrix K, but A is the solu- 
tion of the linear equation AA — b, where A is 
given by 


(n+ p— 1)(n- p 4 1) n4 p—3 


A n+p+3 (n+ p—3)(n+ p— 1) 
n—p+1 n—p+1 
n—p+l 
n—p+l1 


... (n—pt1)(n—p+t+3) 
and b'=(n+p—1,n+p—3,...,n—p+l1). The 
minimax estimators d,(S) and d,(S) dominate 
the best scalar multiples of S. However, they 
are inadmissible. Let P be a permutation ma- 
trix, then the estimator £, P’d,(PSP’)P/p! dom- 
inates d;(S) because of the convexity of the loss 
function L; for i2 1 and 2. The estimator h,(S) 
—(S--ub(u)C)/n, where u=1/tr CS“! and C is 
a fixed positive-definite matrix, dominates S/n 
under L,-loss if 0 € b(u) x 2(p — 1)/n and b(-) is 
nonincreasing. Under L,-loss (S + bC/tr CS !)/ 
(n+ p - 1) dominates S/(n+p+1), if0<b< 
2(p — l)y/(n — p - 3) [7]. The Haff estimator 
h,(S) is dominated by the James-Stein esti- 
mator d,(S) if p 2 6. The estimators (S + 
b(tr CS ! tr CS ?)C)/n for0 x b «2(p— 1)/n 
under L,-loss also dominate S/n and are not 
dominated by d, (S). Similar results hold under 
L,-loss. 


E. Correlation among Variables 


In order to represent the interrelationships 
among the p variates, the population corre- 
lation matrix P= {p,;} is used. Also, we can 
calculate the population *multiple correlation 
coefficient of the ith component X; and all the 
rest of the variables by 


Dei mn LBE, 


and the tpartial correlation coefficient of X; 
and X, given all others, by 


Piji...G)...Q..p7 — Pal, PiPj, 


where P; are the cofactors of P (— 397 Statis- 
tical Data Analysis). 

The sample correlation matrix is calculated 
from the data, and the sample multiple correla- 
tion and the sample partial correlation coeffi- 
cients are calculated from the sample correla- 
tion matrix in exactly the same way as the 
population coefficients are calculated from 
the population correlation matrix. When X = 
(X... X) is a sample of size n from a 
multivariate normal population, the sampling 
distributions of R,,. vn and R; are 


2s d)esp 1.) Gap 


known (— 374 Sampling Distributions). 
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The determinant of the covariance matrix 
|X| (or |S]), called the (sample) generalized var- 
iance, is a measure of the dispersion of a p- 
dimensional distribution. The distance of two 
distributions with mean vectors gt, and #3, 
respectively, and with common variance X is 
often expressed by 


KEE —puyZ (qu, — ft), 


which is called the Mahalanobis generalized 
distance. 
When the data consists of (p + q)- 


X 
dimensional vectors (*) with q <p, the inter- 


relation of X and Y as a whole can be ex- 
pressed in the following way: Let the covar- 
lance matrix be partitioned as 


ER Es SE 

Zon Za 
and the nonzero roots of the equation |pZyy — 
ZyxZxx Zxy| =0 be p,, ..., p,. Then pi^, ..., 
pł”, called the canonical correlation coeffi- 
cients, are the maximal invariant parameters 
with respect to linear transformation of X 
and Y. Also, if we denote the eigenvector cor- 
responding to a root p; by 9, i.e., 


(9; Zulte (Lyx Xxx Lyy)Nis 


the linear function o Y and y; Zyy xd X are 
called the canonical variates. 


F. Principal Components 


An important problem in multivariate analysis 
is to express the variations of many variables 
by a small number of indices. Principal compo- 
nent analysis is a technique of dealing with this 
problem. 

Let X be a p xn matrix with n column vec- 
tors of p-dimensional data. A linear transform 
of X, T= AX (A anr x p matrix, r <p, T an 
r x n matrix) is called the principal component 
if A is chosen so as to maximize the sum of 
the squares of the sample multiple correlation 
coefficients of each of the row vectors of X to 
those of T, namely, if A is an r x p matrix 
formed by the r eigenvectors of the sample 
correlation matrix of X corresponding to the r 
largest eigenvalues or any nonsingular linear 
transformation of them. This is a characteriza- 
tion of principal components in terms of cor- 
relation optimality. 

The principal component T — AX is also 
characterized by the information-loss op- 
timality in that all eigenvalues of (X — CY — 
bl, (X — CYbL) are simultaneously minimized 
subject to the condition: C is a p x r matrix, 
bis a px 1 vector and Y is an r x n matrix. 

The solution is given by CY + b1, = A'T 4- x1,, 
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where X= X1,/n. The variation optimality of 
the principal component is given by maximiz- 
ing simultaneously all the eigenvalues of the 
matrix C(X — XL): (X — X; C, subject to 

the condition that C is a p x r matrix such 
that C'C — I,. The solution is given by C — A', 
namely, C'X — T [15]. If all the correlations 
between the components of X are positive, the 
largest eigenvalue of the covariance matrix of 
X is simple and positive. All the coefficients of 
the first principal component (components of 
the eigenvector) can be taken to be positive 
(Perron-Frobenius theorem). 

When we assume normality, the eigenvalues 
of the sample correlation matrix R are the 
'maximum likelihood estimators of the eigen- 
values of the population correlation matrix, 
and their sampling distributions can be ob- 
tained. A hypothesis relevant to principal 
component analysis is, for example, that the 
smallest p —r eigenvalues of the correlation 
matrix are equal, which can be tested by the 
statistic 


Rp- 
=|RIMA, Ap — A — ..— AMG—r) 7), 
where 4,, ..., 4, are the r largest eigenvalues of 


R. Under the hypothesis, — clogR,., (c a con- 
stant) is asymptotically distributed according 
to a chi-square distribution when n oc. 

Variations of principal component analysis 
can be obtained by taking the eigenvectors of 
the covariance matrix of the raw data or of a 
multiple of it by some weight matrix. 


G. Factor Analysis 


Factor analysis is closely related to principal 
component analysis. We assume a model 


X=BF+U, 


where B and F are unknown pxrandrxn 
matrices of constants (p>r) and Uisapxn 
matrix of independent errors. F is called the 
matrix of factor scores and B the matrix of 
factor loadings. We assume FF' —nI. If E(UU’) 
=n® is known, then by applying the least 
squares principle, we can determine B and F 
so as to minimize the trace of (X BET IO 
— BF). Then B is obtained by taking the r 
eigenvectors of ®~' XX' corresponding to the r 
largest eigenvalues. When ® is diagonal but 
unknown, we can solve the simultaneous equa- 
tion for B and ®, whose solutions are the 
matrix B with columns equal to eigenvectors 
of 6 ^! XX’ and the diagonal matrix Ê with 
elements equal to the diagonal elements of 
XX'/n — BB’. 

If we assume that U is normal, the proce- 
dure in the preceding paragraph for the case 
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when ® is known is equivalent to the maxi- 
mum likelihood method. When 6 is unknown, 
we can assume further that the columns in F 
are also multivariate normal vectors distri- 
buted independently of U, which implies that 
the columns of X are also normal vectors with 
the covariance matrix X = BB' +0. B and 6 are 
estimated from the sample covariance matrix, 
and the solutions of the simultaneous equation 
for B and gives the maximum likelihood 
estimators. However, there is the so-called 
identification problem of determining whether 
for given Z and r the decomposition X = BB’ + 
® is unique. This problem has not yet been 
completely settled. If the solution Ê is posi- 
tive definite and BB’ is positive semidefinite, it 
is called proper. When 9 is not positive defi- 
nite, it is called a Haywood case, and when 
BB’ is not positive semidefinite it is called a 
complex case. Sometimes iterative procedures 
lead to improper solutions. 


H. Canonical Correlation Analysis 


Canonical correlation analysis can also be 
used for descriptive purposes. Sample canon- 
ical variates have various descriptive implica- 
tions. Suppose that y; Y and £j X are the first 
canonical variates corresponding to the largest 
canonical correlation p]. Then p^ is the 
largest possible correlation between a linear 
function of X and a linear function of Y and is 
actually equal to the correlation of y, Y and 
či X. Similarly, the second canonical correla- 
tion is equal to the largest possible correla- 
tion between linear functions in X and in Y 
which are orthogonal to £i X and y, Y, respec- 
tively, and so forth. 

As a second interpretation of the canonical 
variates, we consider the linear regression 
model 


Y=BX+U, 


where Y, B, are q xn, qx p, q x n matrices, 
respectively, and rank B —r <q. Then there 
exist an r x p matrix A anda q xr matrix C 
satisfying CA ! C— I such that B=CA, where 
E(UU’)=n2. Putting T= AX, we get a re- 
gression model Y 2 CT + U, regarding T asa 
matrix of regressor variables. If X is known, 
least squares considerations lead to minimiz- 
ing tr(Y — BX)X ! (Y — BX) with the condition 
rank B —r. The resulting row vectors of A 
consist of the r eigenvectors corresponding 

to the r largest eigenvalues of the matrix 

$4 S,2 !S,. and column vectors of C consist 
of the r eigenvectors corresponding to the r 
largest eigenvalues of the matrix $,,S,7 S,,7'. 
If X is replaced by S,,, T- AX and ZZ C'S,/ Y 
are equal to the matrices of canonical variates. 
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If we assume that U is normal, this proce- 
dure is equivalent to the maximum likelihood 
method. It should be remarked that although 
the model here is not symmetric in X and Y, 
the results are symmetric in X and Y, and 
therefore they will be the same if the roles of X 
and Y are interchanged in this model. 


I. Linear Discriminants and Problems of 
Classification 


Let p x n; matrices X; (i— 1,..., k) be the set of 
observations for k distinct populations with a 
common covariance matrix. We determine a 
vector a such that T; — a'X; reveals the dif- 
ferences of the k populations as much as pos- 
sible, or, more precisely, so that the ratio of 
the sum of squares between classes of T to the 
sum of squares within classes is maximized. If 
the matrices of the sums of squares between 
and within classes are Q, and Q,,, respectively, 
the ratio is equal to 


1—a'Q,a/a'Q..a, 


which is maximized when a is equal to the 
eigenvector of Q,,'Q, corresponding to the 
largest eigenvalue. The linear function t — 
a'X is called the linear discriminant function. 
When k=2, a is given by a-Q;, (X, — Xj), 
where X, and X, are sample mean vectors. 
When k> 2, we let A be the matrix formed 
by the r eigenvectors corresponding to the 
first r largest eigenvalues of Q;,'Q,, and set 
T; — AX;. From this we can construct the r- 
dimensional discriminant function. These 
functions can be used to locate the k popu- 
lations in r-dimensional space, and also to 
decide to which population a new observa- 
tion belongs. For the latter problem we can 
also construct k quadratic functions s; — 

(X — XQ, (X — X), where X; is the sample 
mean vector of the ith population, and X a 
new observation. Then we can decide whether 
X belongs to the population corresponding to 
the minimum s;. Such a method is called a 
classification procedure. 


J. Discrete Multivariate Analysis 


Let X,,, be an observed frequency on three 
characteristics, each belonging to ijkth class 
fori «ixl,1xj«J and 1<k<K. Assume 
that X; is a sample from multinomial distri- 
bution having p; as a probability of occur- 
ence for an (i,j,k) cell, where p... —3 pj, — 1. 
The multinomial observation X, with proba- 
bility pa is called an I x J x K contingency 
table. If we further assume that 


log pij, —pn-r a; B; "yp Bu TYyuc Orn 
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with the restrictions on parameters 2o, = 
2,2, 7 Lutz D. similarly on f’s and y’s and 
20, = 22,04 = Zu Go =O, we say that a 
saturated log-linear model is given. Here the 
number of observations and the number of 
parameters are equal, and no errors can be 
estimated. The parameter Au, is called the 
three-factor effect, or equivalently, the second- 
order interaction. Similarly, au, f, Yri are 
called the two-factor effects or the first-order 
interactions. Finally o;, fj; 7, are called the 
main effects. A simple model is obtained when 
all the first- and second-order interactions 
vanish. This is equivalent to the independence 
model: p; = p;.. p.;.p., and the maximum like- 
lihood estimators for u, a, Pj, y, are obtained 
from Pj — X,. X. X.,/m^, wheren- X... A 
nontrivial model is obtained by putting all the 
second-order interactions equal to zero. The 
likelihood equations for this model are given 
by 


npj.—X nPir Aa, MP. j= Xj: (4) 


ij? 
Bartlett first described a solution of (4) when I 
=J = K =2. For a 2 x 2 x 2 table, the solution 
can be expressed by Bu — (xij, + 0)/n because 
of the constraints for X;,., X;,, and Ka, Putting 
011, —0 for fi, yields a cubic equation for 0: 


(Xiii FOX p29 -0)(X55, -0)(X5,; +0) 
—-(Xi5 —0)(X,,5—0)(X,,; —0)(X,,, —0). 


For a general I x J x K table, the equations (4) 
have a unique solution within the no-three- 
factor effect model if there exists du >Q satisfy- 
ing (4). The unique solution maximizes the 
likelihood. To solve the likelihood equa- 

tions (4), standard iterative procedures, such 

as the Newton-Raphson method, can be ap- 
plied. However, the following iterative scaling 
method of Deming and Stephan is more useful: 





np3m*1 — y3m Au 
Pijk = Pijx (3m)? 
Uu 
np(m*2) — p(3m*1) Xa 
Dm ` 5 Pijk Ga: 
Pik 
X. 
(3m+3)__ ,(3m+2) “Jk 
NP ijx = Pij (5) 


ijk pen ; 
The first iteration adjusts pi”? by fitting pro- 
portionally with respect to k so that np(?"*! is 
equal to X;., and similarly for the second and 
third iterations. Starting with any initial values 
satisfying the first-order interaction model, the 
iterative scaling method (5) converges to the 


unique solution of (4).as m> oo. 


K. Other Problems 


The sampling distributions associated with the 
procedures discussed here are usually very 
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complicated, and often only asymptotic prop- 
erties are known (— 374 Sampling Distri- 
butions). Nonparametric rank analogs of 
many multivariate techniques for normal 
distribution are found in [18]. Robustness of 
the distributions of test statistics or of latent 
roots is investigated in [3, 10, 14]. Multivariate 
data analysis is found in [5]. 
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281 (XIX.5) 
Network Flow Problems 


A. Introduction 


The network flow problem is a special kind of 
mathematical-programming problem (— 255 
Linear Programming, 264 Mathematical Pro- 
gramming, 292 Nonlinear Programming), 
where the variables of the objective function 
and the constraints are all defined in terms of a 
graph (— 186 Graph Theory). Owing to its 
special structure, the mathematical properties 
of the network flow problem as well as the 
solution algorithms have been investigated in 
detail. Network flow problems have a variety 
of useful applications in fields such as trans- 
portation, scheduling, and resource alloca- 
tion, and in operations research in general, so 
that they now constitute an important class 
of mathematical programming. By the term 
"network" (Netzwerk in German, réseau in 
French, set' in Russian) we usually mean a 
graph with some physical attributes attached 
to its edges and vertices. 


B. Basic Form of the Problem 


Let G—(V, E,0*,0 )bea graph with vertex 
set V, edge set E, and incidence relations à *, 
OC :E—V, and let R be the field of real num- 
bers. (Most of the statements in the following 
are valid if we take for R an ordered field or 
ordered additive group more general than the 
field of real numbers.) Furthermore, we regard 
the collection £ of all functions £: E2R asa 
vector space of dimension |E], denoting £(e,) 
by é* if E={e,,...,e,$. Similarly, the collec- 
tion Q of all functions o: V9 R is a vector 
space of dimension |V|. If we define the map- 
ping 6*:V>2* by ó*vz (e|0*e- v] (— 186 
Graph Theory), a linear mapping 0:2-Q is 
naturally introduced through the relation 
IO Wine Äere £(6) — Becs- éle). A vector 65 
is called a flow on G if 06 20. The dual space 
E* of = and the dual Q* of Q are identified 
with the collection of functions y: E R and 
that of C: V9R, respectively, under the obvi- 
ous correspondence. Then the linear mapping 
6:Q* 2 E*, which is contragredient to 0, is 
defined by means of the relation (6£)(e) = 
C(0* e) —C(O el A vector je E*, which is the 
image of ¢ under 4, i.e., which satisfies 7 = 0C, 
is called a tension on G, and ¢ is called the 
potential on G corresponding to the tension 
n. (Sometimes, Ziel is called a flow in edge e, 
giel a tension across e, and £(v) a potential at 
vertex v.) 

A continuous curve C (cR x R) on the 
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Euclidean plane R x R is said to be mono- 
tone if (x, — xj)(y; — y;) 20 for any (x1, yi), 
(X5, y)e C. A monotone curve C is called a 
characteristic curve if its projection to each 
coordinate axis, Le, C, = (x|(x, y)e C] and 
C,siyl(x. y)e C}, is a closed interval. For a 
characteristic curve C, two convex functions, 
g(x) — Ar — yo) dx and y(y)= ls — xo) dy. 
are defined, where (x, Vo) is a point fixed on 
C and the integrals are taken along (x, y)e C. 
(It is understood that o(x)— oo if xC, and 
V(y)— oo if yéC,.) These two functions defined 
for a fixed C are conjugate to each other in 
Fenchel’s sense, and they satisfy o(x) - (y) 2 
(x — X9)(y — yo) for any (x, y)eR x R, where the 
inequality reduced to an equality if and only if 
(x, y)e € (— 88 Convex Analysis, 292 Non- 
linear Programming). 

A network N in network flow theory is a 
graph G to each edge e, whose edge set E = 
le, 6,4) is given a characteristic curve C*. 
On a network N, the following three problems, 
PI, PII, and PIII, are defined. PI: Find a flow 
€ that minimizes D(Z) z >" -| o, (£*) under 
the constraints €* = £(e,)e CT (k=1,...,n). 

PII: Find a tension y which minimizes V (i) 
3-109" (n) under the constraints n, =n(e,)€ 
Cy (& — 1, ..., n). PII: Find a pair (£, 5) of a 
flow Z and a tension y such that (£*,5,)e C* for 
all x — 1, ..., n. (With respect to PI or PII, a 
flow or a tension satisfying the “constraints” 
is ordinarily called a feasible flow or tension, 
respectively.) Then, as a special case of the 
(Karush-)Kuhn-Tucker theorem and the dual- 
ity theorem in nonlinear programming, we 
have the following theorems (A) and (B). 

(A) For a given network N, one and only 
one of the following four alternatives is the 
case: 

(1) There is a feasible flow in PI, and at the 
same time, there is a feasible tension in PII. In 
this case, both PI and PII have a solution, 
and, for any solution Z of PI and any solution 
f) of PII, the pair (ê, 4) is a solution of PIII; 
and, conversely, for any solution (ê, 4) of PHI, 
the flow £ is a solution of PI and the tension D 
is a solution of PII. 

(ii) There is a feasible tension in PII, whereas 
there is no feasible flow in PI. In this case, 

c (c) of PII is not bounded below for feasible 
tensions č. 

(iii) There is a feasible tension in PII, whereas 
there is no feasible flow in PI. In this case, 

Y (4) of PH is not bounded below for feasible 
tensions q. 

(iv) There is neither a feasible flow in PI nor a 
feasible tension in PII. 

(B) Let CS =[b*, c*] (b* i c*; b" can be —oo, 
and c^, oo) and Cj — [d,, Jk] (d, can be —oo, 
and f,, oo). Then a necessary and sufficient 
condition for PI to have a feasible flow is that 
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E, c“— $b“ 20 for every cutset (cocycle or 
cocircuit) w of G, where the summation X, 
(resp. È) is taken over all the edges e, (€ E) 
that lie in o in the positive (resp. negative) 
direction. Similarly, a necessary and sufficient 
condition for PII to have a feasible tension is 
that ©, f, Ład, 20 for every tieset (cycle or 
circuit) 0 of G, where X, (resp. X) is taken 
over all the edges e, that lie in 0 in the posi- 
tive (resp. negative) direction. 


C. Shortest Paths, Maximum Flows, and 
Minimum-Cost Flows 


We choose one of the edges of G, say e,, as the 
reference edge and assign it the parametric 
characteristic curve C! (a) = ((x, y)| y=x+a}. 
Then if there is a feasible flow on the network 
obtained from G by contracting De, short- 
circuiting) e,, and if at the same time there is a 
feasible tension on the network obtained from 
G by deleting (i.e., open-circuiting) e,, then 
problem PHI has a solution (êca), <a>) for 
every real a, and é! (a and fj, <a) are uniquely 
determined for each a. The problem of deter- 
mining these parametric solutions is the two- 
terminal problem for the two-terminal network 
N, (which is obtained from G by deleting e,) 
with the vertex O e, as the entrance (or source) 
and the vertex 0 * e, as the exit (or sink). The 
curve C={(E!a), 4, <x>)|aeR} enjoys the 
properties of a characteristic curve, and is 
called the two-terminal characteristic of N, 
with respect to the entrance 0” e, and exit 

O^ e,. A two-terminal network for which only 
the projections to the x-axis Ck — [b*,c*] of 
the edge characteristics are specified (x — 2, 

... n) is called a capacitated network, and the 
maximum-flow problem for a capacitated 
network N, can be mathematically formulated 
as the problem of determining the projection 
to the x-axis C, —[b,c] of the two-terminal 
characteristics of N,. For the maximum-flow 
problem, the relation c 2 minier — Ehr) 
holds, where the minimum is taken over all 
the cutsets that contain e, in the negative di- 
rection and where X, (resp. 35) denotes the 
summation over all the edges, except e,, lying 
in a cutset in the positive (resp. negative) direc- 
tion. This relation is called the maximum-flow 
minimum-cut theorem. (A similar relation 
holds also for b.) 

Similarly, or dually, the problem of deter- 
mining the y-projection C, — [d, f ] of the two- 
terminal characteristic of a two-terminal net- 
work N, for which the y-projections C; — 

[4,, fa] of the characteristics are specified to 
the nonreference edges e, (x —2,...,n)isa 
network flow formulation of the shortest-path 
problem. For the shortest-path problem, the 
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relation f 2 min [X74 f, — 225 d,), called the 
maximum-separation minimum-distance theo- 
rem, holds, where the minimum is taken over 
all the tiesets that contain e, in the positive 
direction and where $; (resp. 25) denotes 
the summation over all the edges, except e,, 
lying in a tieset in the positive (resp. nega- 
tive) direction. 

The minimum-cost flow problem is to deter- 
mine the two-terminal characteristic of N, 
when all the C* (x z 1) are of staircase form. 

A number of algorithms exist for time com- 
plexity (— 71 Complexity of Computations) 
O(|V|3) for the shortest-path problem [4]; the 
algorithm proposed by E. W. Dijkstra [7] for 
the case d, « 0 « f, for all «(4 1) is of complex- 
ity O(|V]?). The algorithm of A. V. Karzanov 
[8,9] for the maximum-flow problem is of 
complexity O(|V|?). The minimum-cost flow 
problem can be solved by alternately solving 
the subproblems of the shortest-path type and 
those of the maximum-flow type, but no al- 
gorithm of time complexity polynomial in |E| 
and |V| has been found. 


D. Transportation and Scheduling 


Let us make the edges of a graph G —(V, E, 
0*,0 ) correspond to the transportation 
routes and the flow to the stream of a com- 
modity; impose a capacity constraint of the 
form 0 x £* «c* on the flow £* in each edge e, 
and assume a cost function of the form ¢,(é") 
= f, &* for each edge. (c* is a constant called 
the capacity of edge e,, and f, is a constant 
called the unit cost of edge e,.) Furthermore, 
let us specify a subset V, (c V) of vertices as 
the set of entrances and another subset V, 
(c V)as the set of exits, where V, V, =Ø, 
and prescribe the amount of inflow q(v) to 
each entrance ve V, and the amount of outflow 
q(v) from each exit ve V,, where we must 
have Liev, q(v) Äer, qv). Planning a trans- 
portatton plan that satisfies all the above- 
prescribed conditions and that minimizes the 
total cost Eer 9, (£^) can be reduced to finding 
a minimum-cost flow on the extended network 
G -(F, E,6*,6-), defined as follows, such that 
the flow in the reference edge is to be maxi- 
mum: V=VU {st U{t} (s,t¢ V), E=EUE, U 
E-U feo} (E,NE=E,NE=@, e,¢ EVE, UE); 
eo is the reference edge, de, =t, 0^ eg =s; 
E, ={e|éte=s, 0^ e—v, ve Vj, C(e) - ((, y) 
y «0jJU (x, 0)0€x«q(0 ei U {(4(v), y)] 
0x yj, where C e=veV,; E;-le|0* e—v, 
0 e-t,ve V], Cte) - (0, y)|y<O}U {(x,0)]0< 
x <q(v)} U{(q(v), y) lO <y}, where 0* e- ve V5), 
§* | p=0%, Ce) - (0,0 y fU (G5 fol 
O<x<c*}U{(c*, y yz fe fore,eE. 

For the project-scheduling problem with an 
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acyclic graph G=(V, E,O*,0 )asthe arrow 
diagram for which the start node se V and the 
completion node are specified, we make an 
edge e(e E) correspond to a job (or activity), 
and a vertex correspond to the event that 

all the jobs corresponding oda v have been 
finished, whereas those corresponding to ó*v 
are ready to commence. Then we interpret 
the negative of the tension of an edge as the 
time (e, duration) spent on the correspond- 
ing job and the potential at a vertex as the 
time De, instant) of the corresponding event 
taking place. Furthermore, we assume that 

to each edge e, (or the corresponding job) are 
given the normal duration — d,( > 0), the crash 
duration — f,(7 0, < —d,), and the unit cost 
c*(>0) we have to pay in order to decrease 
the job time by one unit, where the job time 
necessarily lies between the normal and the 
crash duration. Under the above-listed speci- 
fications, we consider the relation between the 
total project time (e, the duration from the 
event corresponding to the start node to that 
corresponding to the completion node) and 
the extra cost to be paid for decreasing the 
project time below the normal one. This prob- 
lem can be reduced to the two-terminal char- 
acteristic problem for the following network 
G: G is obtained from G by adding the refer- 
ence edge e, (0* eo =t, dey = s); C* = ((0, y) 
yxd,JU((x,d)|O«x«c*)U((c* yd. «y« 
S U fale" <x}. 


E. Applications to Combinatorial Optimization 


Many problems in combinatorial optimization 
can be reduced to network flow problems. The 
problem of finding a maximum matching on a 
bipartite graph G (— 186 Graph Theory) is 
reduced to the maximum-flow problem for the 
graph representing the transportation prob- 
lem on G with one of the two vertex sets as 
the entrance set and the other as the exit set, 
where all the edges have a unit capacity and 
the amount of inflow/outflow to/from each 
entrance/exit vertex is equal to unity (the cost 
being irrelevant). (The existence of an integer 
solution to this kind of maximum-flow prob- 
lem is proved constructively on the basis of 
the solution algorithm.) The maximum-flow 
minimum-cut theorem in this particular case 
can be stated as follows: The maximum car- 
dinality of matchings on a bipartite graph 

is equal to the minimum cardinality of ver- 
tex subsets which cover all the edges (this is 
known as the König-Egerváry theorem). The 
Dilworth theorem for a partially ordered set, 
the criterion for the existence of a graph with 
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prescribed vertex degrees, etc., are known to 
be reducible to network flow problems [2]. 


F. Generalizations 


The network flow problem may be general- 
ized in various directions. Replacing a graph 
by a matroid (— 66 Combinatorics) or con- 
sidering stronger conditions on the feasibility 
of flows and tensions are natural extensions 
[4, 6, 10, 11]. Another extension is to con- 
sider several kinds of flow, instead of a single 
kind, that simultaneously affect the capacities 
of edges. This latter problem is called the 
multicommodity flow problem in contrast to 
the single-commodity flow problem that was 
treated above [5]. It.can be said that any ex- 
tension of the network flow problem aims at 
a mathematical model that has wider appli- 
cation without losing the advantage of having 
simple effective solution algorithms. 
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282 (XX.17) 
Networks 


A. Linear Graphs (— 186 Graph Theory) 


A linear graph (or simply graph) ts an object 
composed of (i) a finite set {B,} (« — 1, ...,n) 
of elements called branches (or edges), (ii) a 
finite set {N,} (a=1,...,m) of elements called 
nodes (or vertices), and (iti) an incidence rela- 
tion between branches and nodes represented 
by a function [B,: N,] from (B,] x {N,} to 
(0, 1, —1} such that for every x, there exist 
exactly one N, with [B,: N,] = 1 and exactly 
one N, with [B,: N,] = —1, with all the other 
[B,: N,] equal to 0. (Intuitively, a branch B, 
starts from the node N, with [B,: N,] = 1 and 
ends at the node N, with [B,: N,] = —1.) In 
terms of topology, a (linear) graph is a 1- 
dimensional finite tsimplicial complex (— 70 
Complexes, 186 Graph Theory). A network 
in the wide sense is a linear graph whose 
branches and nodes are endowed with some 
physical properties. 


B. Networks 


A contact network, one of the simplest kinds of 
networks, is an abstraction of a circuit whose 
branches correspond to contact points of 
relays and switches that are allowed to take a 
finite number of physical states, e.g., the two 
states “on” and "off." The theory of contact 
networks is developed by means of Boolean 
algebra and is applied to switching networks, 
such as telephone exchange networks and the 
logical networks of digital computers. 

In most cases, to the branches B, of a net- 
work two kinds of real quantities i, and E, are 
assigned (which may be variables or functions 
of time) satisfying the conditions: 


© Y IRAN  (a=1,...,m) 
k-íi 


(ii) there exist E, such that 


Ms 


[B,:N,JE,=E, (k=1...,n). 


ll 
m 


a 


In the case of electric networks, where i,, E,, 
and E, are the current in branch B,, the vol- 
tage across branch B,, and the potential at 
node N,, respectively, these conditions are 
known as Kirchhoff's laws. 

The network flow problem, which has a 
number of practically important applications 
in toperations research (e.g., transportation 
problems, project-scheduling problems) and is 
a special case of *mathematical programming, 
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can be formulated as the problem of minimiz- 
ing >”, LG) under condition (i) (or minimiz- 
ing SI, f(E) under condition (ii)), where for 
each branch B,, f, is a given fconvex function 
defined on a given interval [a,, b, ]. 


C. Electric Networks 


Since there has been a great deal of research 
on electric networks, “network” often means 
“electric network.” We call a branch in which 
the current is a given function of time a cur- 
rent source, and one across which the voltage 
is a given function of time a voltage source. 
A branch that is either a current source or a 
voltage source is called a source branch. A 
network with M source branches is called an 
M-port network. If the currents i, in and the 
voltages E, across the non-source branches 
(n' in number) are related by 


where the z,, or y, , are linear tintegrodifferen- 
tial operators, the network is said to be linear. 
If z;, —z,; OF Yea = yj. it is said to be recipro- 
cal or bilateral; if the z,; or y,; are invariant 
under the change of the origin of time, it is 
said to be time-invariant; and if a linear time- 
invariant network satisfies the condition 


t 
| Y, üí()E(0dr <0 
-œ ByeS 
for every t and for every choice of functions 

of time for i, or E, associated with the source 
branches B, in S, provided that the current- 
voltage relations for non-source branches are 
satisfied, then the network is said to be passive. 
Under certain nonsingularity conditions, for 
the currents in and the voltages across the 
source branches (denoted by 7, and e,) of a 
linear M-port network, we have the relations 


e, — X Zelo h= Y Yea (B,eS) 
Reg Reg 


where the matrices Z,, and Y,, of linear inte- 
grodifferential operators are called the port- 
impedance matrix and the port-admittance 
matrix of the network, respectively. Analysis 
determines Z,;, Y,, from a given linear graph 
and given z,,, Yxa, while synthesis finds a net- 
work Oe, Z,; or y,;, as well as a linear graph) 
when part of Z,;, Y,;, or some relations to be 
satisfied by them are given. (In synthesis, the 
Z,, Of y,, are usually confined to some special 
class.) In analysis as well as synthesis we usu- 


282 Ref. 
Networks 


ally deal with the *Laplace transforms z, ;(s), 
Y«4(5), ZealS), Yea(s) instead of Zeis Yeas Zi. Yes 
themselves, where the characteristics Z, ; (s), 
Y,,(s) of a network are determined by the 
topological properties of its linear graph and 
the properties of Z,,(s), ¥,,(s) as functions of 
the complex variable s. (If œ is the angular 
frequency, s = ia.) 

The following fundamental facts are known 
in regard to analysis and synthesis: 

(1) If Z,.,(s), Y, (s) are rational functions of s 
holomorphic on the open right half-plane, a 
necessary and sufficient condition for a net- 
work to be passive is that for arbitrary real 
numbers £,, 

X. Ee, is) or X Ox ed Y. (s) 
B,,B;es B,,B;eS 
is a positive real function of s, i.e., a function 
whose value is real when s is real and lies on 
the right half-plane when s lies on the right 
half-plane. 

(2) Every passive one-port network can 
be synthesized by using a finite number of 
three kinds of branches, Le. positive resistors 
(E, = R,i,, R, 7 0), positive capacitors (i, = 
C, dE, /dt, C, > 0), and positive inductors 
(E, — L,. di, /dt, L, 7 0). 

(3) Every passive M-port network (M > 2) 
can be synthesized by using, in addition to the 
three kinds of branches mentioned in (2), ideal 
transformers (an ideal transformer is a pair of 
branches (B,, B;) such that i, =ni}, E, —nE,, 
and n — real number) and deal gyrators (an 
ideal gyrator is a pair of branches (B,, B;) such 
that i, = E}, i; = — E,)y; ideal gyrators are not 
needed, however, to synthesize a reciprocal 
network. 

However, very little is known about the 
synthesis of passive M-port networks without 
using ideal transformers and gyrators. Topo- 
logical methods are expected to be powerful 
for such synthesis problems. 

The linear graph structure of a network 
loses its significance if ideal transformers are 
admitted. 
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283 (XX1.37) 
Newton, Isaac 


Sir Isaac Newton (December 25, 1642- March 
20, 1727), the English mathematician and 
physicist, was born into a family of farmers in 
Woolsthorpe, Lincolnshire. In 1661, he entered 
Cambridge University, where he was greatly 
influenced by the professor who was teaching 
geometry, I. Barrow, and where he began 
research in Kepler's optics and Descartes's 
geometry. 

In 1665, he discovered the tbinomial theo- 
rem, and in the same year, during a stay at 
his birthplace to escape the plague, he began 
work on his three great discoveries— the spec- 
tral decomposition of light, the universal law 
of gravity, and differential and integral cal- 
culus. He returned to Cambridge University in 
1667, and in the following year invented the 
reflecting telescope and proposed his theory 
of light particles. During this period, he suc- 
ceeded Barrow as professor and lectured on 
Optics. At the same time, he probed deeper 
into the calculus. Guided by Barrow's insight 
that differentiation and integration were in- 
verse operations and also by his own research 
on infinite series, Newton obtained the tfunda- 
mental theorem of calculus. Leibniz obtained 
the same theorem a little later, and a struggle 
resulted between the two over priority. The 
two discoveries were independent, but because 
Leibniz's notation was superior, the later 
development of calculus owes more to him. 
The dynamic elucidation of the heliocentric 
theory was accomplished in Newton's main 
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work, Principia mathematica philosophiae 
naturalis (1686-1687), in which Kepler's law 
on the movement of the planets, Galileo's 
theory of movement, and Huygens's theory 

of oscillation were unified into the three laws 
of Newtonian dynamics. These natural laws, 
which deal with all dynamic phenomena in the 
universe, are the most superlative realization 
of Descartes's concept of exploring the mathe- 
matical structure of nature; they had an es- 
sential influence on the later development of 
the natural sciences. The style of writing is 
similar to that in Euclid's Stoicheia. In the 
Principia, Newton also sets forth his philo- 
sophical position. 

In 1695, Newton moved to London and 
became engrossed in theology. He was ap- 
pointed Master of the Mint and was presi- 
dent of the Royal Society from 1703 until his 
death. While he is sometimes said to have 
divorced himself from science, many of his 
notes on geometry date from this time. 
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284 (111.12) 
Noetherian Rings 


A. General Remarks 


In this article, we mean by ring a commutative 
ring with unity element. Thus a Noetherian 
ring is a commutative ring with unity element 
that satisfies the tmaximum condition for its 
ideals; if it is also an tintegral domain, then it 
is called a Noetherian integral domain or 
Noetherian domain (for right and left Noe- 
therian rings — 368 Rings F). 

A ring is Noetherian if and only if every 
prime ideal of the ring has a *finite basis 
(Cohen's theorem). A ring is Noetherian if it is 
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generated by a finite number of elements over 
à Noetherian ring (Hilbert's basis theorem). 
The following three conditions for a ring R 
are equivalent: (i) R is an Artinian ring, that 
is, it satisfies the *minimum condition for its 
ideals (for right and left Artinian rings — 368 
Rings F). (ii) R is a Noetherian ring and every 
prime ideal of R is a maximal ideal. (iii) There 
exist a finite number of Noetherian rings 
R,(i 5 1, ...,n) whose fmaximal ideals are 
tnilpotent such that R is the direct sum of 

the R,(i — L, ..., n). We say that the restricted 
minimum condition holds in a ring R if R/a is 
an Artinian ring for every nonzero ideal a of 
R; the latter condition is satisfied if and only if 
R is either an Artinian ring or a Noetherian 
domain of tKrull dimension 1. Every ideal of 
a Noetherian ring R can be expressed as the 
intersection of a finite number of tprimary 
ideals. Given a ring R and an R-module M,a 
submodule P of M is said to be a primary 
submodule of M if every element a of R that is 
a zero divisor with respect to M/P (i.e., there 
exists an me M/P such that mz 0 and am — OI 
is nilpotent with respect to M/P ie, there 
exists a natural number n such that a"(M/P) 
=). 

The property of ideals in Noetherian rings 
stated above can be generalized to the case of 
Noetherian modules: If an R-module M is a 
*Noetherian module, then every submodule of 
M can be expressed as the intersection of a 
finite number of primary submodules. 

Let R be a Noetherian ring, a an ideal of R, 
M a finite R-module, and N and N' submod- 
ules of M. Then we have (i) the Artin-Rees 
lemma: There exists a natural number r such 
that for all n2 r, NA N' =a" "(a NAN’). 
(ii) Krull’s intersection theorem: { \°., a"M— 
{me M |Jdaea such that (1 —a)m —0] (hence, 
in particular, if m is the Jacobson radical or 
R, then (55, m"M = {0}. (iii) Krull’s altitude 
theorem: If a is generated by s elements and 
p is a ‘minimal prime divisor of a, then the 
height of p € s. 


B. Topology Defined by an Ideal 


Let R be a ring, a an ideal of R, and M an R- 
module. Then the a-adic topology of M is 
defined to be the topology on M such that 
(a" M |n=1,2,...} is a *base for the neighbor- 
hood system of zero. In particular, let R be 
Noetherian, M a finite R-module, and N a 
submodule of M. Then by the Artin-Rees 
lemma, the a-adic topology of N coincides 
with the topology on N as a subspace of M 
with the a-adic topology. Returning to the 
general case, M is a tT)-space (under the a- 
adic topology) if and only if (\2, a"M = (0j, 
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or, in other words, if and only if M is a *metric 
space, where the *distance d(a, b) between 
points a, b in M is defined to be inf(2^"|a— 
bea"M}. Moreover, if N is a submodule of 

M, then M/N is a T,-space if and only if N 

is a closed subset of M, that is, if and only 

if (e (N +a"M)=N. A sequence (a,] = 
(a,,4,...,4,,..-) is called a Cauchy sequence 
(under the a-adic topology) if Vn IN Vr Vs 

(with each of them a natural number) ay,, 

— ay ,,€ a" M; for this it is sufficient that 

Vn dN Vr ày,,,, — ay,,€a" M. If this sequence 
converges to zero (i.e., Vn IN Vr ay,,ca"M), 

it is called a null sequence. The set W of all 
Cauchy sequences in M becomes an R-module 
if we define their sum and multiplication by an 
element of R by {a,}+ {b,} ={a,+,}, cay} = 
{ca,}. Then the set N of all null sequences is 

a submodule of M. An element m of M is 
identified with the sequence (m, m, ...,m, ...), 
and we regard M as a submodule of Wt. Then 
the R-module M =Wt/M is the tcompletion of 
MIT, a"M) (as a metric space under the a- 
adic topology). M is called the a-adic com- 
pletion of M. If a has a finite basis, then the 
topology of M (as the completion) coincides 
with its a-adic topology. If M =R, we define a 
multiplication in W by {a,} {b,} ={a,b,}. In 
this case, Vt is a ring in which 98 is an ideal, 
and hence the completion R= M is a ring. If 

a= Xi-,aR, then considering the tring of 
formal power series R = R[[x;, ..., x,]] and its 
ideal ñ= (7. (E, (a; — x) R + a"R), we have 
Rz Rn. 


C. Zariski Rings 


For a Noetherian ring R and an ideal a of R, 
every element b of R such that 1 — bea has an 
inverse in R if and only if every ideal of R is a 
closed subset of R under the a-adic topology. 
When this condition is satisfied, the ring R 
with the a-adic topology is called a Zariski 
ring; we often express this by saying that (R, a) 
is a Zariski ring. A Zariski ring is called com- 
plete if it is a complete topological space. The 
completion R of the Zariski ring (R, a) has the 
aR-adic topology, and (R, aR) is a Zariski 
ring. Furthermore, (i) R is a tfaithfully flat R- 
module; and (ii) when N is a submodule of a 
finite R-module M, then N is a closed sub- 
space of M (under their a-adic topologies), and 
their completions are identified with N ® kR, 
M QRR. 


D. Local Rings 


Suppose that R is a Noetherian ring having 
only a finite number of maximal ideals and J is 
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the Jacobson radical of R. Then the Zariski 
ring (R, J) is called a semilocal ring. Further- 
more, if R has only one maximal ideal, then 
(R, J) is called a local ring. A ring that has only 
a finite number of maximal ideals is called a 
quasisemilocal ring; if it has only one maximal 
ideal, it is called a quasilocal ring. (In some 
literature, the terms local ring and semilocal 
ring are used under weaker conditions; in the 
weakest case, quasisemilocal rings and quasi- 
local rings are simply called semilocal rings 
and local rings, respectively, and local rings 
and semilocal rings in our sense are called 
Noetherian local rings and Noetherian semi- 
local rings, respectively.) 

Assume that R is a semilocal ring with max- 
imal ideals m}, ..., m, and the Jacobson rad- 
ical J — m, 1... Tim, For every finite R- 
module M, we introduce the J-adic topology 
as its natural topology. The completion R of R 
is a semilocal ring with maximal ideals m, R, 
...,m,R and is naturally isomorphic to the 
direct sum of the completions of the local rings 
Rm; (i= 1, ..., n). Since R is a Zariski ring, (1) R 
is faithfully flat; (2) submodules of M are 
closed subsets of M; and (3) the completion of 
M is identified with M & gR. If (R, m).is a 
complete local ring (e. a local ring and a 
complete Zariski ring at the same time), then 
R contains a subring J with the following 
properties: (1) / is a complete local ring, and 
Jm fI) 2 R/m; and (ii) for the tcharacteristic p 
of R/m (p is either zero or a prime number), 
mí1/ — pI. Therefore, if m is generated by n 
elements, then R is a homomorphic image of 
the ring of formal power series in n variables 
over I. This theorem is called the structure 
theorem of complete local rings, and 7 is called 
a coefficient ring of R. If R contains a field, 
then 7 is a field, called a coefficient field of R. 
A complete local ring is a *Hensel ring. 

When (R, m) is a local ring and 27;-, x;R 
is m-primary, then we have an inequality 
r Z (*Krull dimension of R); if the equality 
holds, then we say that x,, ..., x, form a sys- 
tem of parameters of R. Furthermore, if m= 
Di-1x;R, then we say that x,,..., x, forma 
regular system of parameters of R. A local ring 
that has a regular system of parameters is 
called a regular local ring (cf. *Jacobian crite- 
rion). A regular local ring is a tunique factori- 
zation domain. Let d be the Krull dimension 
of a local ring (R, m). Then R is a regular local 
ring if and only if one of the following holds: 
(1) every R-module has finite thomological 
dimension, (2) every R-module has homolog- 
ical dimension of at most d, or (3) the homo- 
logical dimension of R/m (as an R-module) is 
finite (and actually coincides with d). A Noe- 
therian ring R' is called a regular ring if R 
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a regular local ring for every prime ideal p’. A 
regular local ring is a regular ring. 

Consider a local ring (R, m) and an m- 
primary ideal q. Then the length /(n) of R/q" 
(as an R-module) is a function of n. For a 
sufficiently large n, the length /(n) can be ex- 
pressed as a polynomial f(n) in n with rational 
coefficients. The degree of f(n) coincides with 
the Krull dimension d of R. The multiplicity of 
q is (coefficient of n^ in f(n)) x (d!). If x,, ..., x, 
form a system of parameters, then (multi- 
plicity of X7 , x; R) « length of R/È x; Ry; if 
the equality holds, then we call x,,...,x,a 
distinct system of parameters. A local ring 
that has a distinct system of parameters is 
called a Macaulay local ring. À local ring is a 
Macaulay local ring if and only if one of the 
following holds: (1) every system of parameters 
is a distinct system of parameters, or (2) if an 
ideal a of height s is generated by s elements, 
then every fprime divisor of a is of height s. A 
regular local ring is a Macaulay local ring. 

The notion of multiplicity can be also de- 
fined in general Noetherian rings [4]. Let R be 
a Noetherian ring. If R,, is a Macaulay local 
ring for every maximal ideal m, then R is 
called a locally Macaulay ring. Furthermore, if 
height m= Krull dim R for every maximal 
ideal m, then R is called a Macaulay ring. If R 
is a locally Macaulay ring, then the poly- 
nomial ring tn a finite number of variables 
over R is also a locally Macaulay ring. In 
general, an ideal a is called an unmixed ideal 
(or pure ideal) if the height of every prime 
divisor of a coincides with height a; otherwise, 
a is called a mixed ideal. Thus if R is a locally 
Macaulay ring, an ideal a of the polynomial 
ring R[X,,...,x,] over R is generated by r 
elements, and height a =r, then a is unmixed 
(unmixedness theorem). 

If the completion of a local ring R is a tnor- 
mal ring, then we say that R is analytically 
normal. If the completion of a semilocal ring R 
has no nilpotent element except zero, then we 
say that R is analytically unramified. A semi- 
local integral domain R which is a ring of 
quotients of a finitely generated ring over a 
field is analytically unramified. If R is a normal 
local ring, then R is analytically normal (O. 
Zariski). 

Let (R, m) be a local ring, and let q be an m- 
primary ideal. Set F,=q'/qit! (i=0,1,2,..., 

q? = R). Let a à' (mod qi*!)eF, and b— P! 
(modq/*!)e F. We put ab — a'b' (modq'"7*!)e 
F;,;. Then the direct sum of modules F = 

Lic F; becomes a graded ring generated by 

F, over Fy, in which F; is the module of homo- 
geneous elements of degree i. F, called the 
form ring (or associated graded ring) of R with 
respect to q, plays an important role in the 
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theory of local rings, particularly in the theory 
of multiplicity. 


E. Chains of Prime Ideals 


Let R be a Noetherian ring with prime ideals 
p. q such that p c q. Consider the length n of a 
chain of prime ideals p 2 po Ep, E...£p,— 

q which cannot be refined any more. It is 

not true in general that n is uniquely deter- 
mined by p and q (M. Nagata). However, n is 
uniquely determined for a rather large class of 
Noetherian rings, for instance the rings that 
are homomorphic images of locally Macaulay 
rings and, in particular, finitely generated rings 
over a *Dedekind domain. 


F. Integral Closures 


Let R be a Noetherian integral domain with 
the field of quotients k, let K be a finite al- 
gebraic extension of k, and let R be the tin- 
tegral closure of R in K. Then (i) If R is of 
Krull dimension 1, then for an arbitrary ring 
R’ such that Rc R' c K and for every nonzero 
ideal a’ of R’, the quotient R’/a’ is a finite 
R/(a' N R)-module. In particular, R’ is a Noe- 
therian domain satisfying the restricted mini- 
mum condition. (ii) If R is of Krull dimension 
2, then R is Noetherian. (iii) In general, R is a 
‘Krull ring, and for an arbitrary prime ideal p 
of R there are only a finite number of prime 
ideals p of R such that p 2 BY R. For any 

of such $, the field of quotients of R/p is a 
finite algebraic extension of that of R/p. Result 
(i) is called the Krull-Akizuki theorem. We say 
that R satisfies the finiteness condition for 
integral extensions if R is a finite R-module for 
any choice of K. A Noetherian ring R is called 
a pseudogeometric ring, or a universally Japa- 
nese ring, if R/p satisfies the finiteness con- 
dition for integral extensions for every prime 
ideal p. A ring is pseudogeometric if it is gen- 
erated by a finite number of elements over a 
pseudogeometric ring. 


G. History 


J. W. R. Dedekind first introduced the concept 
of ideals in the theory of integers. The main 
objects studied in ring theory were subrings of 
number fields or function fields until M. Sono 
(Mem. Coll. Sci. Univ. Kyoto, 2 (1917), 3 (1918— 
1919)) originated an abstract study of Dede- 
kind domains, which was followed by E. 
Noether (Math. Ann., 83 (1921), 96 (1926)), 
who originated the theory of Noetherian rings. 
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W. Krull made further important contri- 
butions to the development of the theory of 
Noetherian rings and general commutative 
rings [1]. Many other authors, including E. 
Artin, Y. Akizuki, and S. Mori, also contri- 
buted to the theory. The theory of local rings 
was originated by Krull (J. Reine Angew. 
Math., 179 (1938)) and developed by C. Che- 
valley (Ann. Math., 44 (1943)), I. S. Cohen 
(Trans. Amer. Math. Soc., 59 (1946)), and Zari- 
ski (Ann. Inst. Fourier, 2 (1950)), and later by 
many authors, including P. Samuel, Nagata, 
M. Auslander, D. A. Buchsbaum, and J.-P. 
Serre [4]. The theory of Noetherian rings is 
applied to algebraic geometry. 
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A. History 


The validity of the fifth postulate of Euclid's 
Elements, the *axiom of parallels, has been a 
subject of argument ever since it was for- 
mulated (— 139 Euclidean Geometry). At the 
beginning of the 18th century, G. Saccheri 
tried to prove the postulate by assuming the 
validity of other axioms. Under the hypothesis 
that the axiom does not hold, he deduced 
various extraordinary results. Although he 














1070 


was mistaken in thinking that he had ob- 
tained a contradiction, his work is regarded 
as a forerunner to the study of non-Euclidean 
geometry. 

At the beginning of the 19th century, N. I. 
Lobachevskii and J. Bolyai opened up the 
impasse by establishing a geometry based on 
postulates that contradict the fifth postulate. 
This geometry is called hyperbolic geometry or 
Lobachevskii’s non-Euclidean geometry. Actu- 
ally, a similar idea had been conceived by C. F. 
Gauss, but he refrained from publishing it 
because of likely misunderstanding by a public 
still strongly influenced by I. Kant’s philoso- 
phy. On the other hand, B. Riemann con- 
structed so-called elliptic geometry (or Rie- 
mann's non-Euclidean geometry), which is 
different from both Euclidean and hyperbolic 
geometry. Euclidean geometry (including the 
theory of similarity) is sometimes called para- 
bolic geometry. In general, a space satisfying 
axioms that contradict Euclid's postulates is 
called a non-Euclidean space. 

Around the turn of the 20th century, A. 
Cayley, F. Klein, and H. Poincaré constructed 
models of non-Euclidean spaces that are sub- 
sets of Euclidean spaces, and E. Beltrami con- 
structed a differential geometric model. By 
means of these models, it was established that 
non-Euclidean geometries are consistent as 
long as there is no inconsistency in the under- 
lying Euclidean geometries. On the other 
hand, D. Hilbert established a complete system 
of axioms for Euclidean geometry and showed, 
by constructing non-Euclidean models, that 
the axiom of parallels is independent of the 
other axioms (— 155 Foundations of Geom- 
etry). The logical foundation of non-Euclidean 
geometries was thus clarified. Moreover, A. 
Einstein showed in his ‘theory of relativity 
that actual space-time does not satisfy Eu- 
clidean axioms. Together with Euclidean 
spaces, non-Euclidean spaces are often used 
as fundamental models both in the problem of 
tspace forms and in the theory of tsymmetric 
spaces. 


B. Axiomatic Considerations 


Hilbert’s system of axioms of plane Euclidean 
geometry consists of axioms of incidence, 
order, congruence, parallels, and continuity (— 
155 Foundations of Geometry). Specifically, 
the axiom of parallels is stated as follows: 
Suppose we are given a straight line and a 
point in a plane. If the straight line does not 
contain the point, then there exists only one 
line through the point that does not intersect 
the line. 
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The system of axioms of hyperbolic geome- 
try is obtained by replacing the axiom of par- 
allels by the following: Suppose we are given a 
straight line and a point in a plane. If the line 
does not contain the point, then there exist at 
least two lines that pass through the point 
without intersecting the line. (The other four 
groups of axioms are unaltered.) In this case, if 
lis a given line that does not contain a given 
point C, then there exist exactly two lines 
parallel to / that pass through C. We denote 
them by X Yand X'Y', and they are character- 
ized as follows (Fig. 1): Any line that passes 
through C and lies in / X'CY necessarily 
intersects /; by contrast, neither the two lines 
X Y, X'Y' nor any line in L XCX' intersects l. 
Euclidean geometry can be considered as a 
"limit" of this geometry where the lines X Y 
and X'Y' coincide. 


Xx’ Y 
l 
Fig. 1 


In elliptic geometry, the axiom of parallels is 
replaced by the following: Suppose we are 
given a straight line and a point in a plane. If 
the line does not contain the point, then any 
line passing through the point intersects the 
line. In this case, the lines are closed curves, 
and the axioms of order must be modified. 
Specifically, in Euclidean geometry, the axioms 
of order are based on the notion of a point A 
lying between points B and C, where A, B, C 
are distinct points on a line. In elliptic geome- 
try, however, to define order we utilize the 
notion of a pair A, C of points separating 
another pair B, D (and vice versa), where A, B, 
C, D are distinct points on a line. The axioms 
of order are modified accordingly. 

The sum of inner angles of a triangle is 
smaller or greater than two right angles ac- 
cording as we use the axioms of hyperbolic or 
elliptic geometry. 


C. The Projective-Geometric Point of View 


We take tprojective coordinates in an n- 
dimensional, real tprojective space P" and 
consider a quadric hypersurface defined by 
Q:axp+xi+...4+x2=0, a0 (— 90 Coordi- 
nates; 343 Projective Geometry). We denote by 
G the group consisting of the totality of pro- 
jective transformations of P" that leave Q 
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invariant. We call Q the absolute and call G 
the group of congruent transformations. When 
a<0, then this Q is a real quadric hypersur- 
face. In this case, there exists a domain H" (the 
totality of points inside Q) whose boundary 
coincides with Q, and the group G acts ttransi- 
tively on H". The pair (G, H") provides a 
model of hyperbolic geometry, and the n- 
dimensional hyperbolic space H" is homeo- 
morphic to an n-dimensional open cell. Points 
of H", points on Q, and points outside Q are 
called ordinary points, points at infinity, and 
ultrainfinite points (or ideal points), respec- 
tively. Two lines on H" arc said to be parallel if 
they intersect on the absolute Q. Next, when 
a0, the absolute Q is an imaginary quadric 
hypersurface, and the group G acts transitively 
on P". The pair (G, P") provides a model of 
elliptic geometry. The n-dimensional elliptic 
space P" is homeomorphic to n-dimensional 
real projective space; hence it is compact. In 
elliptic geometry, any two distinct lines in a 
plane necessarily intersect at a point. The 
above models {G, P"! of non-Euclidean geo- 
metries are called Klein's models. 

Let (G, H"| be a Klein's model, A, B distinct 
points in P", | the line containing A, B, and I, J 
be two points where the line / meets Q (Fig. 

2). If we denote by (A, B, I, J) the tanhar- 
monic ratio of these four points, then the non- 
Euclidean distance p between the points A and 
B is given by p— alog(A, B, I, J), a constant. 
Next, let J, g be lines in H” intersecting at a 
point D. In the plane determined by / and g, 
we draw two imaginary tangents u, v to the 
absolute through D. Denoting by (l, g, u, v) the 
anharmonic ratio of these four lines, the non- 
Euclidean angle 0 between the lines / and g is 
given by 0 —(1/2i)log(l g, u, v), i= BE? Gen- 
erally, let O be a point in the projective space 
P", and denote by p the tpolar of O with re- 
spect to the absolute. By counting the polar 

p doubly, it can be regarded as a quadric 
hypersurface, which will be denoted by Sy. A 
quadric hypersurface S of H" is called a non- 
Euclidean hypersphere if it belongs to the *pen- 
cil of quadric hypersurfaces determined by Q 
and S,. S is called a proper hypersphere, a 
limiting hypersphere, or an equidistant hyper- 
surface according as the center O is an ordi- 
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nary point, a point at infinity, or an ultrain- 
finite point, respectively. In the case of ellip- 
tic geometry 1G, P"}, the distance p between 
two points A and B is given by p —(a/i): 
log(A, B, I, J) as I, J are imaginary points. 
Moreover, we may get parabolic geometry as 
the "limit" (a> oo) of the geometry given by 
Klein's models. 


D. The Conformal-Geometric Point of View 


Let S" be an n-dimensional tconformal space. 
If we take suitable (n + 2)-hyperspherical co- 
ordinates in S", the space Si can be realized as 
a quadric hypersurface x7 4- x2 ...--x2— 
2xgx,, =0 in (n+ 1)-dimensional real projec- 
tive space P"*!. A point in P"*! represents a 
hypersphere in $" (— 76 Conformal Geometry; 
90 Coordinates). We denote by G the group 
consisting of the totality of *Móbius trans- 
formations leaving invariant a (real, point, or 
imaginary) hypersphere Q. When Q is a real 
hypersphere, the space $" is divided into Q and 
the two open cells H", Hy. If we denote by G 
the totality of transformations of the group G 
that do not interchange H" and H;, then G isa 
subgroup of index 2 of G. In this case, each of 
the pairs (G, H"} and {G, Hr) provides a model 
of hyperbolic geometry. When Q is a point 
hypersphere, the space E" obtained from a" by 
omitting the point Q is homeomorphic to an 
open cell, and the pair {G, E") provides a 
model of parabolic geometry. On the other 


hand, when Q is an imaginary hypersphere, G ` 


is isomorphic to the forthogonal group O(n+ 
1). We call the pair {G, S") a spherical geom- 
etry and S" an n-dimensional spherical space. 
In this case two points x and x’ are called 
equivalent if x’ is the image of x by symmetry 
with respect to Q. The space P" obtained 
from S" by identifying equivalent points is 
homeomorphic to the n-dimensional real 
projective space. If we denote by G the group 
obtained from G by making its actions effec- 
tive on P", then G is the factor group of G by a 
cyclic group Z, = Z/2Z. The pair (G, Pri pro- 
vides a model of elliptic geometry. These 
models are called Poincaré's models. They 
were introduced as a result of research on 
tautomorphic functions in the case n — 2. 

In Poincaré's model, every straight line is 
represented either by a circle orthogonal to Q 
or by a circle passing through Q according as 
Q is a (real or imaginary) hypersphere or a 
point hypersphere. In spherical geometry, 
however, straight lines are usually called great 
circles, and two distinct great circles lying on a 
2-dimensional sphere necessarily intersect at 
two points that are symmetric with respect to 
Q. Also, in Poincaré's model the distance 
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between two points A and B is defined as 
before, by making use of the anharmonic ratio 
of four points A, B, I, J on a circle (Fig. 3) (— 
74 Complex Numbers G). 





E. The Differential-Geometric Point of View 


An n-dimensional space M of tconstant curva- 
ture is by definition a Riemannian manifold 
whose line element ds is given by 


dx; dx... dx? 


ds? = 
K 2 2 2 
BET (x1-- x59... X4) 





with respect to appropriate local coordinates, 
where K is a constant called the tsectional 
curvature (— 364 Riemannian Manifolds). 
According as K is positive, zero, or negative, 
M can be considered locally as an elliptic 
space, Euclidean space, or hyperbolic space, 
respectively. In this case, lines are tgeodesics 
of M, and the non-Euclidean distance and 
non-Euclidean angle are those defined in the 
Riemannian manifold. When n — 2, a tsim- 

ply connected and *complete space of posi- 
tive constant curvature is tembedded in 3- 
dimensional Euclidean space as a sphere, and a 
space of negative constant curvature is locally 
isometric to a pseudosphere (Fig. 4), which is a 
surface of revolution obtained by rotating a 
ttractrix around its asymptote. tComplete n- 
dimensional spaces of constant curvature 
(n> 2) are called space forms. A simply con- 
nected space form is necessarily one of spher- 
ical space, Euclidean space, or hyperbolic 
space. Each of these is a tuniversal covering 
manifold of a general connected space form 
with a curvature of the same sign, and the 
group of fcovering transformations is isomor- 


Fig. 4 
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phic to a tdiscontinuous subgroup of the 
group of congruent transformations. 

Each of the spaces, Euclidean, non- 
Euclidean, and spherical, is a thomogeneous 
space on which the corresponding group of 
congruent transformations acts transitively. 
Actually, each of these spaces has the structure 
of a tsymmetric Riemannian homogeneous 
space of rank 1. 
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286 (XII.21) 
Nonlinear Functional 
Analysis 


A. General Remarks 


At present, the theory of nonlinear problems is 
still not unified, and many individual results 
obtained for specific classes of problems are 
stated in the languages of the corresponding 
fields of study. However, there are some funda- 
mental facts and methods of a general nature 
concerning nonlinear problems, which may be 
referred to as the subject matter of nonlinear 
functional analysis. 


B. Iterative Methods 


Let X be a ‘Banach space. Consider a non- 
linear mapping G: X — X and the equation 
Gx-0 (xex). (1) 
Set F =] — G. Then (1) can be written as 

x= Fx; (2) 


F satisfies the Lipschitz condition if there exists 
a constant « such that 


| Fx — Feil &aix— yl (3) 
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for all x, yin X. In particular, the mapping F 
is said to be nonexpansive if 0<a<1.If0<a< 
1, then F is called a contraction. (Sometimes 

a nonexpansive mapping is also called a con- 
traction.) A contraction F satisfies the contrac- 
tion principle: F has a unique fixed point xo, 
and the iteration 


(n=1,2,...) 


Xn+1 = Fx, 


with an arbitrary initial element x, always 
converges to x, [1-3]. Similar results hold for 
a contraction that is defined only on a "ball 
and leaves the ball invariant. This leads to 
*Newton's iterative process and the *implicit 
function theorem. 


C. Methods of Monotonicity 


By definition, a nonlinear mapping G from a 
tHilbert space H to H is a monotone or accre- 
tive operator if 


Re(Gx—Gy,x—y)z0  (x,yeH) 


G. Minty [4] proved that if G: H>H is mono- 
tone and continuous, then AI + G is a mapping 
onto H for any 470, and its inverse (41 + G) ! 
is nonexpansive. He has also shown that in the 
hypothesis of the theorem, we can replace the 
continuity requirement for G by maximality of 
G within the class of accretive operators that 
are possibly multivalued. Various develop- 
ments of Minty's ideas, including generaliza- 
tion of his results to Banach spaces and appli- 
cations to partial differential equations, have 
been obtained by F. Browder (Amer. Math. 
Soc. Proc. Symposia in Appl. Math., 17 (1965)), 
J. Leray and J. L. Lions (Bull. Soc. Math. 
France, 93 (1965)), J. L. Lions [5], W. Strauss, 
H. Brézis (Amer. Math. Soc. Proc. Symposia in 
Pure Math., 18 (1970)), and others. 

A mapping A is said to be dissipative if — A 
is accretive. Dissipative mappings play a cen- 
tral role in the theory of nonexpansive semi- 
groups (— Section X). 


D. Topological Methods 


In the geometric study of ordinary differen- 
tial equations [6] some familiar theorems of 
topology and tdifferential topology have been 
strong tools, e.g., tBrouwer’s fixed-point theo- 
rem is utilized to establish the existence of 
periodic solutions. However, in order to deal 
with nonlinear partial differential equations we 
have to generalize these theorems to infinite- 
dimensional cases. For example, a fixed-point 
theorem in an infinite-dimensional space was 
first obtained by G. D. Birkhoff and O. D. 
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Kellogg (Trans. Amer. Math. Soc., 23 (1922), 
95-115). The theory of the degree of mappings 
was generalized to the case of Banach spaces 
by J. Leray and J. Schauder [33] for the class 
of mappings of the form J — F, where F isa 
compact continuous mapping, i.e., the image 
by F of any bounded set is relatively compact. 
Let D be an open bounded set in a Banach 
space X and OD be its boundary. Let F:D5 X 
be a continuous compact mapping and ® 
denote I — F. If a point p in X does not be- 
long to (0D), then we can define the Leray- 
Schauder degree d(®, p, D) of relative to p 
[1-3]. The Leray-Schauder degree dii, p, D) 
is an integer with the following properties: (i) 
d(I, p, D) - 1 if peD. If pé (D), then d(Q, p, D) 
=Q. (ii) (Homotopy invariance) di. p, D) de- 
pends only on the compact homotopy class of 
Q:6D— X ~{p}. More precisely, let K :[0, 1] x 
OD X be a continuous compact mapping 
such that x+ K(t, x) Z p for any te[0,1] and 
any zech Let ®)(x)=x+ K(0, x) and ®, (x)= 
x+ K(1, x). Then d(6,, p, D) 2 d(9,, p, D). (iii) 
If p and p' are in the same component of 

X  Ó(0D), then d(®, p, D) = d(®, p’, D). (iv) (Con- 
tinuity) d(®, p, D) is a continuous locally con- 
stant function of ® (with respect to uniform 
convergence) and of pe X 40D). (v) (Domain 
decomposition) If D is the union of finite num- 
ber of open disjoint sets D; (j — 1,2, ..., N) 

with 0D; c 0D and ®(x)#p on | }®, Op. then 
d(®, p, D) E}, d(®, p, DÄ (vi) (Excision) If A 
is a closed subset of D on which (x) z p, then 
d(®, p, D) = d(®, p, D — A). (vii) (Cartesian prod- 
uct formula) If X = X, X, with D;c X, B= 
(®,,®,) with ®,;:D,> X, (i21,2, D=D, x Dy, 
and p —(p;, p;), then d(®, p, D) - d(,, p,, D,) x 
d(®,, p2, D,), provided that the right-hand 
side is well defined. 

The degree of mapping of ® is also defined 
for some proper Fredholm mapping ® (— 
Section E; K. D. Elworthy and A. J. Tromba 
[38]; [32]. 

Brouwer's fixed-point theorem (— 153 
Fixed-Point Theorems) is generalized to 
Schauder's fixed-point theorem: A compact 
mapping F of a closed bounded convex set K 
in a Banach space X into itself has a fixed 
point. Using the Leray-Schauder degree 
theory, one has the Leray-Schauder fixed-point 
theorem: Let D be a bounded open set of a 
Banach space X containing the origin O. Let 
F(x,t): Dx[0, 1] X be a compact mapping 
such that F(x, 0) 20. Suppose that F(x, t) Zx 
for any xeóD and te[0,1]. Then the compact 


mapping F(x, 1) has a fixed point x in D [1-3]. 


Other homotopy invariants, such as *homo- 
topy groups and *cohomotopy groups, are 
also used in nonlinear functional analysis 
[1,3]. For instance we have the following 
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.theorem of Shvarts (1964) [3]: Let X and Y be 


two Banach spaces. Let D= (xeX | xl <1} be 
the unit ball and ôD — (xe X | [x| 21] be the 
unit sphere of X. Suppose L is a fixed continu- 
ous linear !'Fredholm operator from X to Y 
of index p 20. Let PL be the set of compact 
perturbation of L mapping 0D into Y 0, i.e., 
PL={®=L+K|K isa continuous compact 
mapping of 6D to Y such that ®(x)=Lx+ 
K(x) #0 for xeeDj. Two mappings ®, and 

d, in PL are said to belong to the same com- 
pact homotopy class on 6D if there exists a 
continuous compact mapping h:[0, 1] x 0D» 
Y such that Lx + A(t, x) ZO for x in 0D, Bo(x) = 
Lx + h(0, x), and ®,(x)= Lx 4- h(l x). 

Shvarts's theorem: Let L be a fixed continu- 
ous linear Fredholm mapping from X to Y 
of index p Z0. Then the compact homotopy 
classes on OD of PL are in one-to-one corre- 
spondence with the elements of the pth stable 
homotopy group z,,,($") (nz p4- 1) (— 202 
Homotopy Theory H). 

Warning: The topological structure of an 
infinite-dimensional Hilbert or Banach space is 
quite different from that of a finite-dimensional 
Euclidean space. For instance, let X be a Hil- 
bert space of infinite dimension, and let D — 
(xe X | ||x|| <1} be its unit ball and 0D = 
(xe X ||x|| 1] be the unit sphere of X. S. 
Kakutani (Proc. Imp. Acad. Tokyo, 19 (1943)) 
gave a fixed-point free continuous mapping 
of D into itself if X is separable [1-3]. Thus 
naive generalization of the Brouwer fixed- 
point theorem is no longer true in infinite- 
dimensional spaces. V. Klee and C. Bessaga 
[17] proved that the unit sphere ôD is C*- 
diffeomorphic to X for an arbitrary Hilbert 
space X of infinite dimension. N. H. Kuiper 
proved that the group of invertible continuous 
linear operators on X is contractible if X is 
separable [3]. All this is in striking contrast 
to the well-known facts for finite-dimensional 
spaces (— 202 Homotopy Theory, 427 Topol- 
ogy of Lie Groups and Homogeneous Spaces). 
This is the reason why compactness assump- 
tions are made in the theorems mentioned 
above. 


E. Calculus in Banach (or Locally Convex) 
Spaces 


When one considers a nonlinear operator, 

it often happens that the domain and the 
range are neither linear spaces nor their open 
subsets. The domain might be a space of all 
smooth mappings of a compact manifold into 
another, and so might be the range. Such 
spaces have no linear structure, and hence 
linearity or semilinearity do not make sense in 
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general. The concept of infinite-dimensional 
manifolds is therefore introduced of necessity 
in nonlinear functional analysis. 

Definition of differentiable mappings. Let E 
and F be real Banach spaces and let L(E, F) 
(= L(E) if E= F) be the Banach space of all 
bounded linear operators with uniform oper- 
ator norm. Let U be an open subset of E and 
x a point of U. A mapping ( — nonlinear oper- 
ator) f of U into F is called Gáteaux differ- 
entiable at x if lim, „ot ! (f(x + ty) - f(x))— 
df(x, y) exists for any ye E. df(x, y) is called the 
Gâteaux derivative of f at x. f is called Fréchet 
differentiable at x if there exists a linear oper- 
ator Ae L(E, F) such that lim,_,9 || f(x + y) 
f(x) — Ayl|/|| yl =0. A is called the Fréchet 
derivative of f at x and is denoted by df(x) or 
f (x). f is Fréchet differentiable in U if and 
only if it is Gateaux differentiable, df(x, y) 
is linear in y, and sup, o ||df(x, lie is 
bounded [10]. 

Let U be an open subset of E. À mapping 
f of U into F is said to be of class C? if it 
is continuous and to be of class C' if it is 
Fréchet-differentiable at each point xe U and 
the differential df(x)e L(E, F) is continuous as 
a mapping of U into L(E, F). The differential 
df (x) is also called the linearized operator. If 
the mapping df: U > L(E, F) is of class Cl, 
then f is said to be of class C". d(d" ! f)(x) is 
written as d" f(x), and called the rth differential 
at x. d'f(x) is an r-linear, bounded, symmetric 
operator of Ex... x E (r times) into F. f is said 
to be of class C* if f is of class C” for every r. 
For an open subset V of F, f: UV is called a 
C" diffeomorphism if f is a bijection and both f 
and f ^! are of class C”. 

A C” mapping f of U into F is called a Fred- 
holm mapping (Fredholm map) if df(x)e L(E, F) 
is a linear *Fredholm operator for every xeU. 
Since Ind df(x) is constant if U is connected, 
that integer Ind df(x) is called the index of f. 


F. Taylor's Theorem and Its Converse 


Let f: U—F be of class C’ (rz 1). A general- 
ized Taylor theorem claims that f can be ap- 
proximated by a polynomial mapping: Let 

x€U and ye E be sufficiently close to 0 so that 
x+tyeU for 0<t<1. Then 


=t 1 
GDE 
x (d fx ty) d f), 5 y) dt. 


Let Lat, F) be the Banach space of all k- 
linear, bounded, symmetric operators of E x 
..X E into F with the uniform topology. If for 
every k, Ox kr, there exists a continuous 


fx y)- Eaton] € mmi: o 
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mapping p: um F) such that 


STEE BAO 9) FOUN 


at every ve and y sufficiently close to 0, then 
f: U SF is of class C", and d* f(x) =@, (x). 


G. The Implicit Function Theorem 


Using the notation above, let f: U —V be of 
class C”, rz 1, and assume that Oe U, Oe V, and 
f(0) 2 0, where 0 is the origin of E or F. Sup- 
pose there is an AeL(F, E), called the right 
inverse of df (0), such that df(0) A = 1, (the iden- 
tity). Then the following assertions hold: (1) 
The image of F under A, AF, is a closed sub- 
space of E, and E = Kerdf(0) ® AF. (it) There 
are neighborhoods U,, U,, V' of the zeros 

of Kerdf(0), AF, F, respectively, such that 

U, Q U, C U and such that the mapping g: 
U, Q U, 5 U, Q V' defined by g(u, v) =(u, f(u, CH 
is a C'-diffeomorphism. Therefore, denoting the 
inverse of g by h=(h,,h,), we have h, (u, w)=u 
and f(u, hlu, w)) & w. The latter means that the 
nonlinear equation f(u, v) 2 w can be solved 
with respect to v. 


H. Existence and Uniqueness of Integral 
Curves 


Using the notation above, let f be a C” map- 
ping (r z 1) of U into E. Since U x E is the *tan- 
gent bundle of U, (x, f(x)) can be regarded as a 
C tangent vector field on U. The equation of 
"integral curves is (d/dt)x(t) = f(x(t)). A local 
existence and uniqueness of solutions is stated 
as follows: For an arbitrarily fixed x eU, there 
are £z» 0 and an open neighborhood W of x 
such that there exists uniquely a C’ mapping h 
of W x (—€,£) into U satisfying (d/dt) h(w, t) = 
f(h(w, t)) and h(w, 0) 

Using this fact, one can prove the Frobenius 
theorem: Let E' be a closed linear subspace of 
E with a direct summand E", and let f: U ^ 
L(E', E") be a C” mapping (r > 1) such that 
f(0) 2 0. To each xe U one associates a closed 
linear subspace D, = {(v, f(x)v)| ve E’}. The 
disjoint union D —( } yey D, can be regarded as 
a subbundle of the tangent bundle U x E. A 
mapping ŭ of U into E is called a cross sec- 
tion of D if à(x)e D, for every xe U. D is called 
involutive if for any two C! cross sections #, Ò 
of D, the Lie bracket product To č] defined by 
[3,9] (x) = dá(x)(6(x)) — d?(x)(G(x)) is again a 
cross section of D. Now suppose D ts an in- 
volutive subbundle of U x E. Then for an arbi- 
trarily fixed x eU, there are a neighborhood W 
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of x and a C' diffeomorphism h of W onto a 
neighborhood of 0 of E such that dh(x)(D,) = 
E'. This fact shows that an involutive sub- 
bundle can be trivialized by a suitable change 
of local coordinate systems. 


I. Local Theories on Locally Convex Spaces 


All local theories mentioned in Sections E-H 
are constructed on Banach spaces. However, it 
is important in concrete applications to con- 
struct these theories on a wider class of locally 
convex topological linear spaces. 

Let E, F be tlocally convex topological 


linear spaces, and let U be an open subset of E. 


A mapping f: U-F is said to be of class C? if 
it is continuous. f is said to be of class C" if f 
is of class C"! and the following is fulfilled: 
For every xe U, there is an r-linear continuous 
symmetric mapping d'f(x) of Ex... x E into F 


such that d'f:U x Ex ... x EF is continuous. 


If we put 
F(y)— f(x * y) — f(x) — 4f()(y) — ... 


1 
SAPs d) 
r: 


for every y sufficiently close to Oc E, then the 
mapping G defined by 


Ru: 


teR 
0, ey MEM 


is continuous at (0, 0)e R x E. The definitions 
of C? mappings and C’ diffeomorphisms are 
given as in Section E. 


J. Implicit Function Theorems in Locally 
Convex Spaces 


The implicit function theorem does not hold 
in general locally convex spaces. However, 
since it is useful for nonlinear problems, sev- 
eral sufficient conditions are presently being 
studied. The following are some of them. We 
assume that f: U—F is a C” mapping (rz 1) 
such that f(0) 2 0 and that df(0): E F has a 
continuous right inverse. 

(1) If dim F « oo, the implicit function theo- 
rem holds just as in Section G. 

(ii) An implicit function theorem that can 
be applied to nonlinear elliptic differential 
operators can be restated as follows. Suppose 
E, F are tprojective limit of families of Banach 
spaces (E^ kz d), {F*,k>d} and US EQ U4, 
where U’ is an open subset of E’. If f: U SF 
can be extended to a C” mapping (r z 2) of 
E*N U? into F* for every k z d, if f satisfies the 
following inequalities, called a linear estimate 


1076 


with respect to | |,: 
ldf(x+ Vo) <C(lylelolat lylalel) 

rbl, Ach, k>d, 
Id? f(x + y)(u, ls CC yldulalola Leldlullrl 


* Iylalulalol) + PCy lk lok, kd, 
(4) 


where | |, is the norm in ER or F*, C is a posi- 
tive constant independent of k, and P, is a 
polynomial with positive coefficients, and if a 
right inverse A of df(0) satisfies the inequality 
of Gárding type, 


(äu Cul, Djul-,, — kd, (5) 


where C’>0 independent of k and D, >0, then 
the implicit function theorem holds just as in 
Section G, and the obtained mapping h satis- 
fies the same inequalities as (4) [11]. 

(iii) Nash-Moser implicit function theorem. 
Though linear estimates such as (4) hold for 
many differential operators, the second in- 
equality (5) is sometimes out of order, espe- 
cially if f is a nonlinear hyperbolic operator. 
However, one can often obtain instead of (5) 
a weaker inequality: 


lAu|, & C'lulr+s + Dilulk+s-13 sz 


J. Nash [37] and J. Moser [12] approximated 
such an operator A by some smoothing oper- 
ators and proved an implicit function theo- 
rem under a certain additional condition. 

The Nash-Moser implicit function theorem 
was successfully applied to many difficult 
problems, e.g., the embedding problem of 
Riemannian manifolds [37], the small divi- 
sor problem of celestial mechanics [36], free 
boundary problems (e.g., L. Hórmander Arch. 
Rational Mech. Anal., 62 (1976), 1-52), and 
other problems (e.g., S. Klainerman, Comm. 
Pure Appl. Math., 33 (1980), 43-101; M. Kura- 
nishi, Amer. Math. Soc. Proc. Symposia in Pure 
Math., 30 (1977), 97—105). 

(iv) Analytic implicit function theorem. In 
cases (ii) and (iii), the spaces E, F were given 
as projective limits of Banach spaces. On the 
contrary, H. Jacobowitz considered the case 
where E, F are tinductive limits of Banach 
spaces. For instance, the space E of the 
smooth functions can be approximated by a 
family of Banach spaces {E,|¢> 0] of all real 
analytic functions with e as the radius of con- 
vergence. Under this circumstance, certain 
conditions for f and the right inverse of df(0) 
yield an implicit function theorem [13]. 

(v) Mather's implicit function theorem [14]. 
The difficulty of implicit function theorems in 
Fréchet spaces is concentrated in the following 
fact: Even if df(0) has a right inverse and even 
if x is sufficiently close to 0, df(x) may not have 
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a right inverse. If the implicit function theorem 
holds, such a phenomenon should not happen. 
In cases (ii)- (iv), several functional-analytic 
conditions exclude this pathological phenom- 
enon. In some special cases, these conditions 
can be replaced by algebraic ones. J. Mather, 
using his division theorem, proved an im- 
plicit function theorem that was applied to the 
theory of singularities. 


K. Infinite-Dimensional Manifolds 


A Hausdorff space M is called a C" Banach 
manifold modeled on a Banach space E if the 
following conditions are satisfied: (a) M is 
covered by a family of open subsets (U,],. ,. 
For each U, there are an open subset V, of E 
and a homeomorphism y, of U, onto V,. Such 
a pair is called a local coordinate system or a 
local chart of M. (b) If U, IU, z Ø, V; ! 

is a C'-diffeomorphism of Y(U, U,) onto 

V, (U, N Up). (c) The index set A is maximal 
among those that satisfy (a) and (b). 

If M (resp. M^) is a C' Banach manifold 
modeled on E (resp. E"), then so is M x M' 
modeled on E @ E’. A mapping f: M—M' is 
said to be of class C” if f expressed through 
local coordinate systems is of class C”. 

Suppose M is covered by a family of local 
charts {(U,, Ya) taca- On the disjoint union 
eA U, x E, define an equivalence rela- 
tion ~ as follows: For (x, u)e U, x E, (y,v)e 
U; x E, (x, u) - (y, v) if and only if x= y and 
d(U jg ')(x)v — u. The set of equivalence classes 
is called the tangent bundle of M, which will 
be denoted by Ty. There is another definition 
of the tangent bundle which uses the ring of 
C" functions on M. However, since E is not 
reflexive in general, the latter gives us a differ- 
ent vector bundle. Ty is a C" ! Banach mani- 
fold modeled on E @ E. The correspondence 
which sends (x, u)e | Jea U, x E to x induces a 
C"! mapping of Ty onto M, called the pro- 
jection of 7,,, denoted by z. 

A topological group G is called a Banach- 
Lie group if G is a C” Banach manifold and 
the group operation (g, 4) 5g 'hisa C” 
mapping of G x G into G. If E is a Banach 
space, then GL(E), the group of all invertible 
bounded linear operators, is a Banach-Lie 
group under the uniform topology. If E is a 
Hilbert space, then the group of all unitary 
operators is also a Banach-Lie group under 
the same topology. 

The concepts of manifolds and Lie groups 
are similarly defined when the model space 
is a Hilbert space or a Fréchet space. These 
are called a Hilbert manifold and a Fréchet 
manifold, respectively. For some differential 


286 N 
Nonlinear Functional Analysis 


topologies on separable Hilbert manifolds — 
279 Morse Theory E. 


L. Structures on Infinite-Dimensional 
Manifolds 


Suppose M is a C'*! Hilbert manifold 
modeled on E. At each xe M, the tangent 
space T, M =n! (x) is a Hilbert space, linear- 
homeomorphic to E. M is called a C" Riemann- 
ian manifold if there is defined an inner prod- 
uct <u, v», on each 7, M such that <-, +X, is of 
class C" with respect to x. Existence of such a 
structure is ensured by using a partition of 
unity if M is paracompact. 

Let M be a C' Banach manifold (rz 1) 
modeled on a Banach space E. Each tangent 
space T, M is linear homeomorphic to E. M is 
called a C" Finsler manifold if there is a norm 
|u|,. defined on each T, M such that | |, is 
continuous with respect to x. A paracompact 
C! Banach manifold can have a C! Finsler 
structure. 

Let M be a C+? Fréchet manifold (r z 0) 
and C'*! (M), I'*! (T) the spaces of all CC" 
functions and of all C’*! vector fields on M, 
respectively. A bilinear mapping V of T" (Ty) 
x I'*! (T4) into I(T) is called an affine 
connection on M if V satisfies V að = f Mat, 

V, f — (üf i+ f Vað for every d, £e U^! (T), 
f €C'* (M). For an affine connection V, T(i, ò) 
= V,0—V,;u—[t, 0] is called the torsion tensor, 
and Ru. č) « V,V; — VV; — V; sis called the 
curvature tensor of V. If M is a Riemannian 
manifold, then there exists a unique affine 
connection without torsion which leaves the 
Riemannian inner product parallel. 


M. Local Linearization Theorems 


Let M bea C'*! Banach manifold (r 2 1) 
modeled on E. Let & be a C" vector field on M 
such that ü(x) 40 at xe M. Then there are a 
neighborhood U, of x and a C’ diffeomor- 
phism y of U, onto an open subset V of E 
such that dWa(w "Ge &(y, v), ve E, for every 
y€V, where v does not depend on y. 


N. Morse Lemma 


Let M be a C'*? Hilbert manifold and f be an 
R-valued C’*? function. Suppose x is a critical 
point of f, i.e., df(x) 20. x is called a nondegen- 
erate critical point if d? f(x) is a nondegenerate 
bilinear form. For such x there are a neighbor- 
hood U, of x and a C’ diffeomorphism y of U, 
onto an open neighborhood of 0 of the model 
space E such that w(x)=0, and f(y !(y))— 
|Py — i(t — P)y[2, where P is an orthogonal 


286 O 
Nonlinear Functional Analysis 


projection in E. i, - dim(1 — P) E (0 € i, & oo) is 
called the index of the critical point x of f. 


O. Submanifolds 


A subset N of a C' Banach manifold M mod- 
eled on E is called a C" submanifold if at each 
point xe N there are a neighborhood U, of 

x and a C’-diffeomorphism y of U, onto an 
open neighborhood V of 0 of E such that w(x) 
=0 and y(U,O) N) 3 VNF, where F is a closed 
linear subspace of E. There are some other 
definitions of submanifolds. One of them re- 
quires in addition that F be a direct summand 
of E, and another uses instead of ULN N its 
connected component containing x. In the 
latter definition, a submanifold is not neces- 
sarily locally closed. 


P. Sard-Smale Theorem 


Although it is not easy to define nontrivial 
measures on infinite-dimensional manifolds, 
the concept "almost everywhere" can some- 
times be replaced by that of residual sets. 

A subset of a topological space is called a 
tresidual set if it contains an intersection of 
countably many open dense subsets. A re- 
sidual set in a complete metric space or in 

a tBaire space is dense. S. Smale [15] ex- 
tended tSard’s theorem to inftnite-dimensional 
manifolds as follows: Let M, N be C" Ba- 
nach manifolds and f: MN be a C” Fred- 
holm mapping. If M is separable and r> 
max (0, Ind(df(x))} for each xe M, then R, = 
N — f(C) is a residual subset of N, where C 
is the set of critical points of f, i.e., a point x 
where df(x) is not surjective. 


Q. Calculus of Variations and Infinite-. 
Dimensional Manifolds 


Many problems in the calculus of variations 
can be understood as problems seeking crit- 
ical points of functions defined on infinite- 
dimensional manifolds. R. Palais and S. Smale 
set up the following Condition C and fixed a 
category of functions where the critical points 
can be chased through gradient-like vector 
fields [6]. 

Palais-Smale Condition C. Let f be a C! 
function on a C! Finsler manifold M. If S is 
any subset of M on which f is bounded but 
on which |df(x)| is not bounded away from 0, 
then there is a critical point of f adherent to S. 

In general, it is not easy to examine Con- 
dition C for a concrete f. However, many 
concrete problems, where the Euler equations 
are nonlinear elliptic, satisfy Condition C. 
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Morse theory. Let M be a C* complete 
Riemannian manifold and f a C* function 
bounded below satisfying Condition C and 
having only nondegenerate critical points. 
Then, using the Morse lemma, one can make a 
*handlebody decomposition of M by the same 
method as in the case of finite-dimensional 
manifolds (— 279 Morse Theory). 

Lyusternik-Shnirel’man theory. This theory, 
constructed on finite-dimensional manifolds, 
can be extended naturally to Finsler mani- 
folds. Let M be a complete C? Finsler mani- 
fold and f a C? function satisfying Condition 
C and bounded below. Then f has at least 
cat(M) critical points, where cat(M) =m means 
that M can be covered by m closed contrac- 
tible subsets of M but not by m— 1 ones. If 
there is no such integer, then we set cat(M)= 
oO. 

Both Morse theory and Lyusternik- 
Shnirel’man theory have been successfully 
employed in the global theory of the calculus 
of variations. 


R. Bifurcation Theory 


Bifurcation theory concerns itself with the 
structure of the zeros of the functional equa- 
tion of w with a parameter 4: 


G(A, w) — 0. (6) 


In general, the state w satisfying (6) represents 
the equilibrium (time-independent or station- 
ary) solution of the evolution equation 


w,—G(A,w) and w(0)-w,. (7) 


Here the evolution equation itself stems from a 
mathematical model describing natural phe- 
nomena, w — w(t) stands for the state at time t, 
and / is the set of parameters representing the 
physical environment. For example, in the 
*Navier-Stokes equation appearing in fluid 
dynamics, w(t) represents the unknown veloc- 
ity field at time t and / is the *Reynolds num- 
ber. It is important to study bifurcation phe- 
nomena because they typically accompany 
the transition to instability of the state when 
some characteristic parameter passes through 
a certain value, called a critical value. 

Let X, Y, and A be real Banach spaces, and 
let G{A, w) be a mapping from A x X to Y. 
Suppose that there exists a mapping w(A): A 
X satisfying G(A, W(4)) 2 0. One calls (A, w(A)) 
a trivial solution of (6). 

(Ao. W(A9)) is called a bifurcation point of 
G(A, w) (with respect to the trivial solution) if 
in any neighborhood of (Ao, w(A9)) there exists 
a nontrivial solution of (6). (In general, there 
may appear another type of solution that is 
not connected with the trivial solution [20]). 
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Assume that G(A, w) is of class C! in some 
neighborhood of (åo, w(A45)) in A x X. Then it 
follows from the timplicit function theorem 
that (45, w(A9)) is not a bifurcation point if 
G,,(A9, W(A9)), the *Fréchet derivative of G with 
respect to w, is nonsingular. 


S. The Principle of Linearized Stability 


Closely tied to the phenomenon of bifurcation 
is the property of stability. Suppose that the 
dynamics of a physical system are governed 
by (7). Let w(t; w9) denote the solution of (7). 
An equilibrium solution w= w(A) is called 
stable if for any £>0 there exists a 07 0 such 
that ||w(t; wo) — w|| <e for all t2 0 whenever 

| wo — w|| <ô. Furthermore, w is said to be 
asymptotically stable if, in addition, w(t; wy) 
Was t oo. 

By the principle of linearized stability we 
mean that the stability of an equilibrium solu- 
tion w is determined formally by the 'spec- 
trum of the linearized operator G,(A, w(4)). As- 
sume that G, (4, w(4)) has only a *point spec- 
trum. Then (i) if w(A) is stable, the spectrum of 
G,,(A, W(A)) is contained in (ze C| Rez <0}, and 
(ii) if the spectrum of G, (4, W(A)) is contained 
in {zeC|Rez<0}, w is asymptotically stable. 

Suppose that as À crosses a certain value Ap, 
one or more eigenvalues of G, (^, w(4)) cross 
the imaginary axis from the left to the right 
half-plane, where w(A) is the known equilib- 
rium solution. This is precisely the situation 
when wW(A) becomes unstable. For notational 
simplicity, we put u2w— w(4) and F(/,u)= 
G(A, ut w(A)). 


T. Bifurcation from Simple Eigenvalues 


Take A=R. Let FeC?(R x X, Y) be such that 
F(À,0) —0 for any real å. Set Ly = F,(49, 0), 
Lı =F; än, 0), and suppose that (i) Ker(Lo) 
is spanned by ug £0; (it) codim R(Ly) = 1; 
and (iii) L uo € R(Lo), where RL al denotes the 
trange of Ly. Then there exists a C'-curve 
(A, W):(— 6,6) —- R x X defined on some inter- 
val (— ô, ô) such that 4(0) 2 45, (0) « 0, and 
F(A(s), s(ug + V (s))) 30 for any se(— ô, ô). 
Moreover, in a neighborhood of (45,0) any 
zero of F either lies on this curve or is a trivial 
solution (M. G. Crandall and P. H. Rabino- 
witz, J. Functional Anal., 8 (1971)). 

In the above case, there exist only three 
possible situations of the curves (A, o) and 
(A, 0), called subcritical, supercritical, and 
transcritical bifurcations. In the third case, 
there occurs the so-called exchange of stability 
[19-21]. 
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U. Bifurcation of Periodic Solutions (Hopf 
Bifurcation Theorem) 


If u(å) loses stability by virtue of a pair of 
complex conjugate eigenvalues crossing the 
imaginary axis, then under suitable conditions 
one can prove the existence of bifurcating time- 
periodic solutions of (7). Rewrite equation (7) 
as 


u, + Lou 4 g(4, u) - O. (8) 


Suppose (i) Lo: DU alc X > X is a densely de- 
fined linear operator on X such that — Lo 
generates a strongly continuous semigroup on 
X, which is holomorphic on X (=the com- 
plexification of X). Lọ has compact resolvent; 
iis a simple eigenvalue of Ly, and niéo(L,), 
the spectrum of Lo, for n 20,2, 3,.... Asa con- 
sequence of (i), if r> — Re4 for any 4eo(L;), 
then the fractional power (rI + Lo) for «>0 is 
well defined. Because their domains are in- 
dependent of r, one can set X, = D((rI + Loy), 
which are Banach spaces under the norm 

Iul, = llr + Loull. Suppose (ii) there exist 

an «e[0, 1) and a neighborhood @ of (0,0) in 
R x X such that ge C?(@, X,), where Ch, X,) 
denotes the space of all X,-valued C* functions 
defined on ©. Moreover g(4,0) 20 if (4,0) e 0, 
and g,(0,0)=0. (iii) Let f = B(A) be a continu- 
ously differentiable function defined in a neigh- 
borhood of 0 such that B(A)eo(Ly + g,(4, 0)) 
and f(0) 2 i. Suppose that Re f’(0) 40. If as- 
sumptions (1), (i1) and (tii) are satisfied, then 
there exist a positive 6 and continuously dif- 
ferentiable functions (p, 4, u):(— ô, 6) >R? x 
C°(R, X,) such that (a) for 0<|s}<6, u(s) is a 
2np(s)-periodic solution of period 2zp(s) of (8) 
corresponding to 4— A(s); (b) p(0) = 1, 4(0) 20, 
u(0) 2 0, and u(310 if s #0; and (c) any 2zp- 
periodic solution of (8) in C$, (R, X,) (=the 
space of 2zp-periodic continuous functions 
with value in X,) with |p — 1|, |A| and ||ulj suffi- 
ciently small is of the above form for some 

(el <ô up to a translation of the real line. 
Moreover, if ge C**! (0, KL then the functions 
p, 4, u are of class Ch, 


V. The Lyapunov-Schmidt Procedure 


Suppose that L(A)= F,(4,0) has at A=0 an n- 
fold eigenvalue at the origin, i.e., dim Ker L(0) 
=n, and assume that X c Y. Let Ker L(0) be 
spanned by 9,,...,@,, and let P be the projec- 
tion onto this linear subspace that commutes 
with L(0). Then P must take the form Pu= 
275-1 <u, 97)? 9j, where the o* e Y* c X* are 
null functions of the adjoint operator L(0)* 
and <o; of» =ò. P is a linear operator from 
Y to Y; hence it can be regarded as a mapping 
from X to X as well, and Q—1— P is a projec- 
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tion onto the range of L(0) in Y. Using P and 
Q, the equation F(A, u) 20 can be decomposed 
into the system of equations 


OF(A,v+W)=0 and PF(Av-cwy)-0, (9) 


where v= Pu and y — Qu. One solves the first 
equation for y —wV(A, v) by the implicit function 
theorem, and then, substituting this into the 
second, one has the bifurcation equation 


F(A, v)=PF(A,v+ (4, v)) 20. (10) 


Solutions of the bifurcation equation are in 
one-to-one correspondence with solutions of 
the original system sufficiently close to the 
bifurcation point. 

There exists another method of reducing 
an infinite-dimensional problem to a finite- 
dimensional one, called the center manifold 
theorem [21, 39]. 


W. A Global Result 


The following global result is due to P. H. 
Rabinowitz (J. Functional Anal., 7 (1971)). 
Assume that F(A, u)=u—ALu+ H(A, u), where 
L is a compact linear operator and H:R x X > 
X is a compact mapping with H(4,u) — o( ||ul|) 
at 0 uniformly on bounded 4-intervals. Then, 
if u^! is an eigenvalue of L of odd multiplic- 
ity, (5, 0) is a bifurcation point for F with 
respect to the trivial solution. Moreover, the 
closure of the set of nontrivial zeros of F con- 
tains a component that meets (u, 0) and either 
is unbounded in R x X or meets (f, 0), where 
iz: à and fi“ is an eigenvalue of L. 

The beginning of bifurcation theory seems 
to be in the celebrated work of H. Poincaré 
(Acta Math., 7 (1885)). 


X. Abstract Cauchy Problems 


Suppose that we are given an abstract Cauchy 
problem 


ie Au (t 0), (11) 
u(+0)=a (12) 


in a Banach space X, where ae X and the non- 
linear operator A is assumed, for simplicity, to 
be independent of t. If the domain D(A) of A 
coincides with X and A is *Lipschitz continu- 
ous, we can reduce the abstract Cauchy prob- 
lem to the tintegral equation of Volterra type 


t 
u(t) 2 a4 | Au(s) ds, 

0 
and by applying the iteration procedure we 
can easily show that the abstract Cauchy 
problem has a uniquely determined solution. 
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A similar treatment works for a singular but 
mildly nonlinear A of the form A — L +N if the 
linear operator L is the generator of a fstrong- 
ly continuous semigroup el: in the sense of 
Hille-Yosida theory (— 378 Semigroups of 
Operators and Evolution Equations) and the 
nonlinear mapping N is Lipschitz continuous. 
In this case, we reduce the problem to the 
integral equation 


t 
u(t) - e'*a4- | e« 9" Nu(s) ds. 
0 
Here, if we merely have to assure ourselves of 
the local existence of the solution, then N can 
merely be locally Lipschitz continuous; and 
in this case, N can even be singular to some 
extent if e'^ is tholomorphic in t [22, 23]. A 
typical application of this procedure was made 
by P. Sobolevskii, T. Kato, and H. Fujita to 
the Navier-Stokes equation (— 204 Hydro- 
dynamical Equations; 205 Hydrodynamics) 
to construct regular solutions [22]. 
*Galerkin's method (— 304 Numerical Solu- 
tion of Partial Differential Equations) is some- 
times quite convenient for obtaining (tweak) 
solutions of (11) and (12) [5]. Again, typical 
applications were made to the Navier-Stokes 
equations by E. Hopf and others [5, 24, 25]. 
To prove the convergence of approximate solu- 
tions constructed by Galerkin’s method and 
subject to so-called energy estimates, we often 
make use of Aubin’s compactness theorem 
concerning vector-valued functions [5]. 
Remarkable developments have taken place 
since 1964 for the case where A is dissipative, 
i.e., — A is accretive. F. Browder [26] proved 
that if X is a Hilbert space and A is dissipative, 
then the mere continuity of A is sufficient for 
the twell-posedness of (11) and (12). Then Y. 
Komura [27] brought about a crucial advance 
by showing that if A is a (possibly multivalued 
and) maximal dissipative operator with D(A) 
dense in the Hilbert space X, then (11) and 
(12) are uniquely solvable for any ae D(A). 
Furthermore, he founded the theory of non- 
linear semigroups of operators by establishing 
a *nonlinear version of the Hille-Yosida the- 
ory for semigroups of inonexpansive opera- 
tors in Hilbert spaces [28]. Subsequent de- 
velopments and applications were made in 
various directions by T. Kato (J. Math. Soc. 
Japan, 19 (1967)), M. Crandall and A. Pazy 
(J. Functional Anal., 3 (1969)), H. Brézis and 
A. Pazy (J. Functional Anal., 6 (1970)), S. 
Oharu (J. Math. Soc. Japan, 22 (1970)), M. 
Crandall and T. Liggett [29], Y. Konishi (Proc. 
Japan Acad., 47 (1971)), and others. While 
Komura’s original proof was based on the 
Yosida approximation AU — 24) (c0) for 
A, it is proved in [29] that in any Banach 
space a dissipative operator A that is maxi- 
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mal in a certain sense generates a nonlinear 
semigroup T, by the exponential formula 


noo 


t —n 
T,X = lim (1-4) xX. (13) 
n 


The scope of applications of the generating 
theorem (13) can be seen, e.g., in B. K. Quinn 
(Comm. Pure Appl. Math., 24 (1971)), M. G. 
Crandall (Israel J. Math., 12 (1972)), Y. Koni- 
shi (Proc. Japan Acad., 48 (1972) and J. Math. 
Soc. Japan, 25 (1973)), and S. Aizawa (Hiro- 
shima Math. J., 6 (1976)). 


Y. Nonlinear Semigroups in Banach Lattices 


Let X be a Banach lattice (— 310 Ordered 
Linear Spaces). An operator A: X > D(A)>X 
is said to be dispersive if for all x, ye D(A), 


Ixy) * | <||(&-y—A(Ax—Ay))* || C420) 


When a dispersive operator A satisfies the 
range condition RU — 44) D(A) for any 4» 0, 
it generates an order-preserving semigroup T, = 
e^ on D(A): (Tx —5y)* ll & (x —)»)* | fort» 
0. We have therefore the preservation of order: 
x € y implies T;x < T; y. We can prove in par- 
ticular that the order of initial data is inherited 
by the solutions of a nonlinear heat equation 
(Y. Konishi). 

Remark. Various pathological phenomena 
arise when we do not restrict the form of A 
in the abstract Cauchy problem (11), (12). We 
cite merely the “blowing up” of solutions of 
Cauchy problems for *nonlinear heat equa- 
tions (— 291 Nonlinear Problems) and the 
nonlinear wave propagations described by 
nonlinear Schrodinger equations (J. B. Bail- 
lon, T. Cazenave, and M. Figueira). 


Z. Abstract Cauchy-Kovaleyskaya Theorem in 
a Scale of Banach Spaces 


T. Yamanaka (Comment. Math. Univ. St. Paul, 
9 (1960)), L. V. Obsyannikov (Soviet Math. 
Doklady, 6 (1965) and 12 (1971)), F. Treves 
(Trans. Amer. Math. Soc., 150 (1970)), L. Niren- 
berg (J. Differential Geometry, 6 (1972)), and T. 
Nishida [30] discussed abstract treatments 

of classical Cauchy-K ovalevskaya theorem for 
partial differential equations: Let S= {B,},>0 
be a collection of Banach spaces depending on 
the real parameter p>0. Let lui, denote the 
norm of an element uc B,. The collection S is 
called a scale of Banach spaces if, for any p and 
p' «p, B,c B, and |ul||, < lul, for any u in 
B,. Consider in S the initial value problem of 
the form 


eg (u(t), t), 


dt 


and u(0)=0. (14) 
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Assume the following conditions on F: (1) For 
some numbers R>0, n >0, pọ >Q and every 
pair of numbers p, o such that Deg «p « 

Po. (u, t) 9 F(u, t) is a continuous mapping of 
(ue B,| ull, <R} x (t|It| «5j into B,.. (ii) For 
any o <p < po and all u, ve B, with (ul, 

R, ||v||, « R, and for any t, |t| <y, F satisfies 

| F(u, t) — F(v, tll Clu —vll,/(p — p"), where C 
is a constant independent of t, u, v, p, or p'. (iii) 
F(0, t) is a continuous function of t, |t| <n, with 


values in B, for every p < py and satisfies, with 


a fixed constant K, || F(0, t), € K/(pg — 9), 0x 
p « po- 

Abstract Cauchy-Kovalevskaya theorem. 
Under the preceding hypotheses there is a 
positive constant M such that there exists a 
unique function u(t) which for every positive 
p < pg and |t| « M(pg — p) is a continuously 
differentiable function of t with values in B,, 
|u(t)]], « R, and satisfies (14) (— also [31 ]). 

These results cover a theorem of M. Na- 
gumo (Japan. J. Math., 18 (1948)), which gen- 
eralizes the classical Cauchy-K ovalevskaya 
theorem. 
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A. Lattice Dynamics 


In order to elucidate certain characteristic 
features of nonlinear waves, one-dimensional 
lattice models have been studied. Around 
1953, E. Fermi et al. performed computer 
experiments on nonlinear lattices to verify a 
generally accepted belief that nonlinear cou- 
pling between the tnormal modes of harmonic 
oscillators would lead to complete energy 
sharing between these modes. To their sur- 
prise, their nonlinear lattices yielded very little 
energy sharing at all; on the contrary, the 
interactions resulted in the recurrence of the 
initial state. These results were later inter- 
preted in terms of solitons (— 387 Solitons), 
i.e., nonlinear waves that preserve identity 
despite mutual interaction. 

The equations of motion for a uniform 1- 
dimensional chain of particles of mass m with 
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nearest-neighbor interaction can be written as 


so, 


m 
dt? 





—9(Q,— Q,-1) t PQr ~ Qn) 


mS Dale 


It was shown that a lattice with a nonlinear 
interaction of the form 


olr)=e "+r+const. 


admits solutions in closed form. Later, it was 
shown that this lattice (the exponential lattice 
or the Toda lattice) is a completely integrable 
system. 

It is convenient to introduce s,, which is the 
generalized momentum canonically conjugate 
to the mutual displacement r, — Q,,, — Q,. For 
a lattice with exponential interaction, 


e "^ —1 2 ds, /dt, 


and if we introduce 


S, -| s, dt, 


then the equations of motion can be written as 
log(1-- d?S,/dt?) Ban + $,., — 2S,, 
and the displacements are given by 
Q,—$,— 5,4. 
We have a solution 
S, —logí1 + e2t* Fray 


where « and ô are arbitrary constants and f = 
+sinha. The associated wave 


e '"—]1 = fl? sech (an+ Bt +ô) 


represents a solitary wave or soliton. 
The multisoliton (N-soliton) solution is 
given by 


S, log det ¥,, 


where VP, is an N x N matrix whose elements 
are 


(ZZ) 
(Pu = Orn + ciep — et DAN 
J J dJ 1 — ZZ, 
with 
Ze Te" 


Asymptotically the wave reduces as t-> Foo to 
an assembly of solitons 


N 
e vn 1 = KS B? sech?(a;n + Bt + 6;*). 
j=1 


B. Conserved Quantities 


The equations of motion for a periodic ex- 
ponential lattice of N particles can be written 
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in a Lax representation dL/dt = BL — LB, 
where L and B are N x N matrices with the 
elements 


Lian = Dy, 


Aen = Lam d, 
Liam Ly, 1— y, 


B, nsi = — Bati n= — ds, 


By y= — By iy 


(the other elements of L and B are all zero), 
with 


d, =te One "Hd, 


b,-5P, (P,—aQ,/dt). 


The eigenvalues 4 of L can be shown to be 
independent of time, and so the motion of the 
lattice is a spectrum-preserving deformation. 
Now, if we define {J;} by 


det(41 — L) 2 AN -- AV! J, +... - Aly 4 9 Hy, 


the n{1,} are polynomials of a, and b,. These 
are constants of motion that were discovered 
independently by M. Hénon and H. Flaschka. 
Thus the lattice has N conserved quantities; 
I, is related to the total momentum and J, to 
the total energy, but higher-index conserved 
quantities have no physical interpretation. 


C. Method of Integration 


Let L and B be the infinite matrices obtained 
from the foregoing ones in the limit N >œ. 
The eigenvalues 4 of the equation 


Loọo=}p 


are independent of time, and the time evolu- 
tion of ọ is given by the equation 


dọ/dt= Bg. 


If the motion in the lattice is restricted to a 
finite region, we can clearly speak of the scat- 
tering of the wave o due to the deformation in 
the lattice. For a given initial motion Q,(0) and 
P,(0), or L(0), we calculate the initial scattering 
data of asymptotic form q ~z” (n oo). The 
scattering data consist of the reflection coeffi- 
cient R(z), the bound state eigenvalue 4;— 
—(zj--zj ')/2 (4j« —1 or 4; 1), and the co- 
efficient c; of the normalized bound state eigen- 
function of asymptotic form c;z; for n> oo. 

From the initial data and the equations of 
motion for n— +00, we get the scattering data 
at a later time t. In effect, we construct the 
kernel 


1 -i _ 
Fem y Ree ml dz 
2ni 


FEG Oe e up 
j 
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of the discrete integral equation (Gel'fand- 
Levitan-Morchenko equation) 


Kin, m) - F(n+m)+ y Kin, n) F(n' 4- m) 20, 


n'-n4l 


mzn-4i. 


After solving this equation for x(n, m), we 
calculate K (n, n), given by 


1 Es 
—————-—1-4F(Qn)4 (n, n') F (n' +n). 
kugi tE vg en) 
Then the initial value problem is solved in the 
form 


e Q^ Q4-0— | Knn ` i 
K(n—1,n—1) | 


The solution can be given as dQ„/dt =S, — Sn+1> 
with 


$,— k(n — 1, n). 


The simplest case R(z)=0 yields the multi- 
soliton solution. ) 

For the periodic case also, eigenvalues of 
the equations Lo = ue and do/dt = Bo, under 
suitable boundary conditions and for certain 
initial data, give sufficient information to 
construct a solution to the initial value prob- 
lem. Such a method of obtaining a general 
solution for the periodic lattice was developed 
by E. Date and S. Tanaka, and independently 
by M. Kac and P. Van Moerbeke. Following 
Date and Tanaka, the solution can be written 
in terms of the multivariable theta function, or 
the Riemann theta function 3, as 

d ant Bt 4- à) 


Z= t. 
WEEN E R 





where a, fj, and 6 are certain vector constants. 
There has been much activity recently to- 

ward interpreting the integrability of the Toda 

lattice in terms of Lie algebras (B. Kostant 


[9]). 
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288 (XIil.10) 

Nonlinear Ordinary 
Differential Equations 
(Global Theory) 


A. General Remarks 


Many well-known functions (with the notable 
exception of the 'T-function), such as the ex- 
ponential, trigonometric, *elliptic, and tauto- 
morphic functions, satisfy ordinary differential 
equations of simple forms. For the purpose 

of finding new transcendental functions, P. 
Painlevé initiated the systematic study of the 
equation 


,))20 (1) 


in the complex domain. To investigate the 
solution in its whole domain of definition, he 
assumed that F is a polynomial in y, y^, ... , y 
whose coefficients are analytic in x. Such an 
equation is called an algebraic differential 
equation. If F is linear in y, then (1) is written 
as 


F(x, y Ves 


SM 
e EE 


where P and Q are polynomials in y, y^, ..., 
y"! with coefficients that are analytic func- 
tions of x. Equation (2) is called a rational 
differential equation. 

If F is linear in y, y, ..., y, i.e., (1) is a linear 
differential equation, and if the coefficient of 
y? is 1, then singular points of solutions are 
situated at the singular points of the coeffi- 
cients (— 254 Linear Ordinary Differential 
Equations (Local Theory)). If F is not linear, 
then singular points of solutions of (1) are 
divided into two categories, one consisting of 
those points whose positions are determined 
by the equation itself and are independent of 
individual solutions, and the other consisting 
of those points whose positions depend on the 


is P(x, y, y, o 
Q(X, y, y. vo 





(2) 
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choice of particular solutions. In other words, 
the singularities of the first category appear 
independently of the choice of arbitrary con- 
stants involved in the general solution, while 
those of the second category depend on the 
choice of the arbitrary constants. The former 
are called fixed singularities and the latter 
movable singularities. The linear differential 
equation (1) has fixed singularities only, which 
are situated at the singularities of the coeffi- 
cients. In the same way, *branch points of 
solutions can be classified into two kinds, fixed 
branch points and movable branch points. 


B. Algebraic Differential Equations of the First 
Order 


Consider the equation 


y Pei 
Oto 


where P and Q are relatively prime polyno- 
mials in x, y. The fixed singular points of (3) 
are defined to be points č, č with the following 
properties: (i) Q(5, y) «0. (ii) Q(Z^, y) #0, P(C', y) 
— Q(&, y) 20 have a root y=’. (iii) If, in (ii), 
we substitute 1/z for y and if the same relation 
(ii) holds for x= € and z=0, we count such 
a value €, as a fixed singular point. (iv) We 
transform equation (3) by setting x = 1/t. If 
the value t — 0 satisfies (i) or (11) for this trans- 
formed equation, we count oo as a singular 
point € or E accordingly. The points £, E are, 
in general, ‘transcendental singularities of solu- 
tions, and the points ¢' cannot be tessential 
singularities of solutions but may be *ordinary 
transcendental singularities. A singular point 
of a solution different from č and £' is an falge- 
braic singular point, and for any point dis- 
tinct from Z and Z', equation (3) admits a 
solution with an algebraic singularity at this 
point. A necessary and sufficient condition 
that (3) has no movable branch point is that 
(3) be a *Riccati equation. 

Consider the algebraic differential equation 
of the first order 


F(x, y, y') - 0. (4) 


After defining the fixed singular points £ and 

& of (4), where the algebraic function of x, y 
defined by (4) has bad singularities, Painlevé 
proved that movable singularities of solutions 
are algebraic. Before Painlevé's work, L. Fuchs 
gave a necessary and sufficient condition that 
(4) has no movable branch points, and then H. 
Poincaré showed that if this condition is satis- 
fied, then (4) is either reducible to the Riccati 
equation if g=0, integrable by the use of ellip- 
tic functions if g — 1, or algebraically integrable 
if g> 1, where, with x the independent vari- 





(3) 
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able, g denotes the tgenus of the algebraic 
curve defined by (4). Painlevé found that there 
were gaps in the proofs of Fuchs and Poincaré 
and completed these by proving his theorem 
and the following one: Let @(x, yo, xo) be the 
solution of (4) satisfying the initial condition 
y(x9) = yo. Let x, Xo be points different from ¢ 
and £, and let L be a curve connecting x, to 
X and not passing through any č or č’. If we 
denote by q, (X, Yo, Xo) the value at x =X of the 
branch obtained by continuing (x, Vo, Xo) 
analytically in a neighborhood of L and re- 
gard o, (X, yo, Xo) as a function of yo, then 

(0, (X, yo. Xo) coincides, in a neighborhood of 
every point yg — b, with several branches of an 
talgebroid function of yọ. Painlevé studied the 
case when the general solution is finitely many- 
valued and gave a condition for o(x, yo, xo) to 
be an algebraic function of ys. 

When equation (4) does not contain x ex- 
plicitly, no movable branch points appear if 
and only if all solutions are single-valued, and 
then the solutions are expressible in terms of 
rational, exponential, and elliptic functions. 
Such an equation is called a Briot-Bouquet 
differential equation. 

J. Malmquist proved, by using P. Bou- 
troux's method of studying the behavior of 
solutions in the neighborhood of a fixed sin- 
gularity, that if equation (4) admits at least one 
solution that has an essential singularity and is 
finitely many-valued and free from movable 
branch points around this singularity, then (4) 
is an equation without movable branch points. 
If (4) admits a solution that is a finitely many- 
valued transcendental function, then an alge- 
braic transformation may be applied to (4) 
so that it will become an equation without 
movable branch points. It is an immediate 
consequence of the first assertion that if (3) 
admits a solution that has an essential sin- 
gularity and is finitely many-valued and free 
from movable branch points around the sin- 
gularity, then (3) is a Riccati equation. 

Later, equations (3) and (4) were studied by 
M. Hukuhara, K. Yosida, T. Sato, T. Kimura, 
and T. Matuda. The following results are due 
to Kimura. If a solution @(x) of (3) has an 
essential singularity at x =, then, in an arbi- 
trary neighborhood of č, o(x) assumes every 
value with the exception of the roots of P(é, y) 
=Q. If (3) is not a Riccati equation, it is deter- 
mined by a finite number of algebraic pro- 
cesses whether or not (3) admits a solution 
that has an essential singularity at x 2 € and 
has no movable branch point around €. If (3) 
admits such a solution, then the singularity is 
a ‘logarithmic branch point. For an essential 
singularity of a solution there exists, in gen- 
eral, a direction similar to a *Julia's direction, 
which was investigated by Hukuhara and 


288 C 
Nonlinear ODEs (Global Theory) 


Kimura. Matuda studied in detail the behavior 
of solutions as x tends to č along a half-line 
and concluded that, except for some special 
cases, any solution tends to a certain value 

as x tends to č along a half-line. To obtain 
algebraic solutions C. Briot and J. C. Bou- 
quet devised a method similar to *Puiseux 
expansion in the theory of algebraic functions. 
Hukuhara improved their method and suc- 
ceeded in reducing (3) to several differential 
equations of standard forms in a neighbor- 
hood of x = č. This enables us to apply the 
local theory to the global study. Hukuhara's 
method was used by Kimura and Matuda to 
obtain the results discussed in this paragraph. 


C. Algebraic Differential Equations of the 
Second Order 


For second-order algebraic differential equa- 
tions we can pose the same problems as for 
first-order equations: When do these equations 
have single-valued or finitely many-valued 
general solutions? What new transcendental 
functions are needed to integrate such equa- 
tions? These problems, studied by E. Picard 
and Painlevé, are difficult because of the exis- 
tence of movable transcendental singularities. 
However, Painlevé succeeded in determining 
rational differential equations of the second 
order without movable branch points. Such 
equations, with the exception of those that 
are integrated by the use of solutions of the 
first-order and linear differential equations, 
can be transformed by rational transforma- 
tions into one of the following six differential 
equations: 


(I) y"=6y?+x, 
(ID. y’=2y?+xy+a, 


(I) y'2—-——4——— Hy 4- 
y x x 
,;2 3 3 

(IV) y=} ke + 2(x? E 
2y 2 y 





l 1 y 
V "y + 
(V) y"=y (s zu) 5 


-17 yy ôy(y+l 
EECH (s) 2. y(y+ 1) 
y x yel 





H 








(VI) y= 


(C 1 1 ) 
—y'| -+ + + 
x x—1 y-x 
Bx y(x—1) dx(x—1) 
` t y (i ^ (yx? | 


where o, fj, y, and 6 are constants. These equa- 


y(y—1)(y—x) 
x?(x — 1? 
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tions and their solutions are called Painlevé 
equations and Painlevé transcendental func- 
tions, respectively. Equation (VI) was dis- 
covered by B. Gambier, who found an omis- 
sion in Painlevé's calculations. All solutions of 
(I) are single-valued, and their properties were 
investigated by Boutroux. The solutions of 
(VI) have, in general, logarithmic branch 
points at x =0, 1, oo and were studied by R. 
Garnier. 

The case when the equation is of degree 2 
with respect to y" was studied by Malmquist 
and F. Tricomi. 

The following facts are known concerning 
movable transcendental singularities of the 
rational equation y" = P(x, y, y Q(x, y, y'), 
where P and Q are relatively prime poly- 
nomials (Kimura). Let p, q be the degrees of P 
and Q with respect to y’. If p>q +2, then any 
solution g(x) admits no movable essential 
singularities, but its derivative ~'(x) may admit 
such singularities. If p q-- 2 and Q is not 
decomposable as Q(x, y) O;(x, y, y), then 
neither o nor g’ admits movable essential 
singularities. If p <q +2, then both o and o 
may have movable essential singularities. If 
p<q+2 and Q is not decomposable as above, 
then oi has no movable essential singularities. 
If, for a solution (x), x =a is a movable essen- 
tial singularity of o(x) or g’(x), the (x) or 
Q'(x) assumes all values other than a finite 
number of exceptional values in an arbitrary 
neighborhood of x =a. If p>q+2, then every 
solution possesses tIversen’s property, and 
hence the set of movable singularities is not a 
continuum. 


D. Higher-Order Equations and Other 
Equations 


Painlevé's method of obtaining the second- 
order equations without movable branch 
points is applicable to higher-order equations. 
The determination of third-order equations 
without movable branch points was attempted 
by Painlevé, J. Chazy, and Garnier by the use 
of this method, but is not yet complete. Chazy 
studied in detail an equation of the form 


Int (1—1/n)y? , "n + 
y Aa EPOD te(y)y? 


and showed that when n — —2 and b(y)— 
0, *Fuchsian and *Kletnian functions are 
obtained as solutions (— 32 Automorphic 
Functions). 

R. Fuchs, a son of L. Fuchs, derived equa- 
tion (VI), at almost the same time as Gam- 
bier, from the study of *nonodromy groups. 
He showed that the monodromy group of the 
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1d?z Co P y 
——— = | —T————--4———— 
z dt? Li (t — xy 
ô 3 a 





PUD S EEN 


ay 
(t 1) — y) 


remains invariant as the singularity x varies if 
and only if o, f, y, and 6 remain constant; the 
singularity y, considered as a function of x, 
satisfies equation (VI); and a and b are rational 
functions of x and y and y’. Investigating a 
second-order linear differential equation of 
Fuchsian type with tregular singularities 0, 1, 
00, X155 Xp Y p, Yn Where Mi Yp are ap- 
parent fixed singularities, Garnier was led, 
under the hypothesis that the monodromy 
group of this equation remains invariant as x,, 
<- Xp Vary, to a tcompletely integrable system 
of partial differential equations and showed 
that a symmetric function of y,, ..., Ya, CON- 
sidered as a function of any one of x,, ..., Xp, 
satisfies an equation without movable branch 
points (— 253 Linear Ordinary Differential 
Equations (Global Theory)). 

For nonalgebraic equations, movable 
branch points appear even in the first-order 
case. Kimura obtained a sufficient condition 
that for an equation F(x, y, y) 20, where F isa 
polynomial of y' with meromorphic coeffi- 
cients in x, y, every solution has Iversen's 
property in a domain of the complex plane. 

It was shown by O. Holder that the T- 
function satisfies no algebraic differential 
equation. Also, if a function meromorphic in 
the unit circle is of *order oo, then it satisfies 
no algebraic differential equation. 
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289 (XIII.9) 

Nonlinear Ordinary 
Differential Equations 
(Local Theory) 


A. General Remarks 


Consider a system of n differential equations 


j=1,...,n, (1) 


where the f, are analytic functions of x, y4, 
<- Yn: TO simplify the notation, we use vector 
notation y instead of (y,,..., y,). If all the 

f; are holomorphic at a point (x, y) — (a, b), 
there exists one and only one solution y(x) for 
(1) such that y 5 b as xa and y(x) is holo- 
morphic at x —a. We say that a point (a, b) is 
a singular point of the system (1) if it is a sin- 
gular point of f; for some j. The well-known 
*Cauchy's existence theorem can no longer be 
applied to the case when (a, b) is a singular 
point of system (1). In this case, the following 
three problems arise naturally: (i) to determine 
whether solutions y(x) such that y(x)-»b as 
xa exist, and if they exist, to determine the 
number of independent solutions; (ii) to con- 
struct analytic expressions for solutions y(x) 
such that y(x) b as xa or, in a slightly more 
general way, analytic expressions for bounded 
solutions y(x) such that the values of (x, y(x)) 
stay in a neighborhood of the singular point 
(a, b); (iii) to investigate the properties of these 
solutions. These three problems are called 
local problems, since only those solutions in 

a neighborhood of the singular point (a, b) 

are considered. However, even when n- 1, . 
the study of local problems is very difficult 
except for the case of singular points of par- 
ticular types at which the functions f; are 
meromorphic. 

When n> 1, the problem becomes even 
harder; research on this case lags that for n=1. 
In the subsequent discussion we assume with- 
out loss of generality that a —0, b — 0. 


dy;/dx = f; yi, .... ys), 


B. The Case of a Single Equation 


Consider the equation 
dy/dx = Y (x. y X (x. y), Q) 


where X and Y are holomorphic functions of 
(x, y) at (0,0) x, ye C. When X(0,0) 20 and 
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Y(0,0) 40, we can rewrite equation (2) in the 
form dx/dy = X (x, y)/Y (x, y), and we see that (i) 
if X (0, y) £0, equation (2) has one and only 
one solution y(x) which is algebraic at x «0 
and tends to 0 as x 26; (ii) if X(0, y) =0, there 
is no solution of equation (2) such that y0 as 
x0. 

The case where X(0,0)=0 and Y(0,0)=0 
was first studied by C. A. A. Briot and J. C. 
Bouquet. In order to obtain algebraic solu- 
tions they introduced a method similar to 
*Puiseux expansion in the theory of algebraic 
functions. A. R. Forsyth and J. Malmquist 
studied a problem of reduction for equation 
(2) by using the Briot-Bouquet method. The 
theory of reduction was completed by M. 
Hukuhara, who divided a neighborhood of 
(0, 0) into a finite number of subdomains in 
such a way that (i) the union of these sub- 
domains covers the given neighborhood of 
(0, 0) completely; (ii) in each of these sub- 
domains equation (2) takes one of eight canon- 
ical forms. Hukuhara investigated the prop- 
erties of the solutions for these canonical 
forms. Among them, the following two are 
well known as the Briot-Bouquet differential 
equations: 


xdy/dx= f(x,y) ^ f(0,0)—0, (3) 
TTT dy/dx = f(x, y), 
f(0,0)=0, o21 is an integer. (4) 


C. Systems of Differential Equations 

When y is a vector with components (yj), equa- 

tions (3) and (4) can be written as 

sch, (^ (5) 
j=l,...,n. (6) 


xdy;/dx = fx, yi, ..., yo). 
x"! dy/dx = f(x, pi Vas 


After Briot and Bouquet, the singular points of 
these types were studied by many authors, 
including Poincaré, E. Picard, H. Dulac, 
Malmquist, W. J. Trjitzinski, and Hukuhara. A 
method of constructing solutions of these 
equations consists of two parts: (i) formally 
transforming the given equations into simpli- 
fied or reduced equations of the simplest pos- 
sible form by applying a formal transforma- 
tion of the type 


y=} pax z (7) 
or 


= ko ok k 
Yj 7 3 Pa, X SNE 


j=1,2,...,n; (8) 


(ii) verifying convergence or the validity of 
*asymptotic expansions for the formal solu- 
tions of the given equations, which are ob- 
tained by substituting bounded solutions of 
the equations satisfied by z or z, into (7) or (8). 
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By studying these analytic expressions, the 
properties of the solutions can be clarified. 


D. Properties of Solutions of Briot-Bouquet 
Differential Equations 


For equation (3), the character of the simpli- 
fied equation depends on the value of å = 
f,(0, 0). We have the following four cases: (i) 
4 is neither 0 nor negative. A suitable formal 
transformation (7) changes (3) to 


xdz/dx = Az 4 bx?. 


In particular, if 4 is not equal to a positive 
integer, then b=0. The double power series (7) 
is uniformly convergent. There exists a func- 
tion @(x, z) of (x, z) holomorphic at (0,0) such 
that y= o(x, x^(blogx- C)) is a general solu- 
tion of (3) with an integration constant C. (ii) 
4=0. The equation satisfied by z takes the 
form 


xdz/dx 2z"*!(b--b'z", mæl. 


If b=0, b’ necessarily vanishes and x 20 is a 
holomorphic point of (3). A general solution is 
given by 


Z(x)2(C—mblogx) "" ban bast 
and 
b (mb? 1 um 
Z(x)= {| —a{ ——log—+C , bb’ #0. 
b b' x 


Here, 6 — a(t) is the branch (of the inverse 
function of č — logi =t) such that a(t) — tlogt 
—t—0 as t— oo. Then there exists a holo- 
morphic function ot, z) of (x, z) for |x| « ó, 
Imargz - argb| « 3x/2 — e, |z| « ^ such that 
mix, Z(x)) is a general solution of (3). (iii) A is a 
negative rational number — uv The equation 
satisfied by z is written as 


vx dz/dx = 2z(— u+ b(x"z"y" + b'(x"z")?m). 
A general solution has the form 


Z(x)=x*(C—mblogx) Um, — p'—0, 


and 
b p? 1 —1/mv 
Z(x)- x^| —a ne log—+C ; 
b b x 


In this case, there exists a holomorphic func- 
tion oi, z) of (x, z) for |mpargx+mvargz+ 
argb t z/2| «1 —&, |x|<6, |z| « A such that 
(x, Z(x)) is a general solution of (3). M. 
Iwano expressed this solution in the form 

V ((x" Z(x)")", x, Z(x)), where j(w, x, z) is 
holomorphic for |argw E 5| « x —&, 0 « |w| « ó, 
Ix| « A, |z| « A. In particular, if b=b’ 20, o(x,z) 
is holomorphic at (0, 0). (iv) 4 is a negative 





bb’ #0. 
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irrational number. The equation in z has the 
form 


xdz/dx = Az. 


The formal transformation (7) may either 
diverge or converge. Dulac proved that if (7) 
diverges and if there exists a solution y(x) such 
that y(x) 0 as x0 along a suitable path L, 
then |x*y(x)? arg x| oo and |x* y(x)? arg y(x)| 
oo as x0, xe L for any « and fj. However, 
the existence of such a solution is not yet 
verified. C. L. Siegel proved that if A satisfies 
certain inequalities, the formal transformation 
(7) is divergent. An example such that (7) is 
divergent was first given by Dulac. A very 
simple example for such a case was given by 
Y. Sibuya. 

In the case f(0,0)=0 and A= f,(0,0) 40 for 
(4), the most complete result was obtained by 
Hukuhara, namely: (1) A suitable formal trans- 
formation (7) reduces equation (4) to 


x°*) dz/dx =2(Gyot%X+4+...+4 9X7), äus A 


A general solution is given by Z(x)=C: 
eeh, where A(x) is a polynomial in 1/x of 
degree oc. (ii) A general solution of (4) is ex- 
pressed by a uniformly convergent power 
series of the form È q,(x) Z(x), where the q, (x) 
are holomorphic functions of x for a certain 
sectorial neighborhood of x =0 and have 
*asymptotic expansions X Dar" as x 50. 
Assume f;(0, 0, ...,0) - 0 for equations (5). 
Put An = 0f;/0y, (0,0, ...,0). Denote by 4,, ..., 
Àn the eigenvalues of an n x n matrix with 
elements (45. Then (i) if an angle w can be 
chosen so that for some m <n, all of lol, 
larg 4, — o) ...,|]argZ,, —«] are less than z/2, 
equations (5) possess solutions that are 
expressed as uniformly convergent (m + 1)- 
tuple power series of x, Z,(x), ..., Z, (x): 


ko Fk k 
NS ky wk X Sat pap. 
0^1 m 


Here the Z,(x) are general solutions of the 
simplified equations and have the expression 


Z,(x) 2 x^ (C, +a polynomial of 
Cs C o log x), 


(ii) Moreover, Iwano extended the result of (i) 
as follows: When there exists one and only one 
zero among the other n— m eigenvalues, equa- 
tions (5) have solutions, depending on m+ 1 
arbitrary constants, that are expressed as (m+ 
1)-tuple uniformly convergent power series 


x Pj, s (Zm OXZ Q^ ... Zelle, 


Here the coefficients py. x, A (z,,,,) are holo- 
morphic functions of z,,,, in a sectorial neigh- 
borhood of z,,,,=0 and admit asymptotic 
expansions in powers of z,,, as z,,, 70. In 
this case, the functions Z,(x), ..., Zm+1(x) can- 


k=1,2,...,m. 
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not generally be integrated by quadratures, 
but as Z,,.;(x)0, the power series expansions 
coincide with the expressions obtained in (i). 
(iii) In the case when the Jacobian matrix (45) 
is the zero matrix, Iwano constructed, under 
additional assumptions, a convergent analytic 
expression of a general solution for equations 
(5). 

Let the right-hand side of (6) be holomor- 
phic at (0,0, ...,0). In 1939 Trjitzinski proved 
the existence of solutions that admit asymp- 
totic expansions in powers of n arbitrary con- 
stants. In 1940 and 1941, Malmquist proved, 
under strong conditions, the existence of 
solutions that are expressed as uniformly 
convergent power series of Z,(x), ..., Z,(x): 

X py, us (X) ZxY^ ... Zy(x)'^. Here the Z,(x) are 
polynomials of x and log x of the form 


Z, (x) e e^? x^: (C, +a polynomial of 


nen C, log x), k= 0p 03 8, 


where the coefficients admit asymptotic expan- 
sions in powers of x. Trjitzinski’s result is 
contained in Malmquist’s result as a special 
case. On the other hand, under much weaker 
assumptions than Malmquist’s, Hukuhara 
solved a problem on the formal simplification 
of equations (6) and formal solutions. Iwano 
improved Hukuhara’s result on formal solu- 
tions and discussed the convergence of formal 
solutions under weaker conditions than Malm- 
quist’s (Ann. Mat. Pura Appl., 1957, 1959). 
The pj, ..,, (x) are holomorphic functions of 
x in a sectorial neighborhood of x 20 and 
admit asymptotic expansions in powers of x as 
x90. The angle of the sector in which the 
asymptotic expansions are valid is largest for 
Iwano's method. 

If 2:0; « n, equations of the form 


x"idyjdx—fjG5, yi,.... yh — jol....n 


possess at least n — 3 c; solutions that are 
holomorphic for |x| « ó (R. W. Bass, Amer. J. 
Math., 77 (1955)). This result is analogous to 
those obtained by O. Perron, F. Lettenmyer, 
and Hukuhara and Iwano in the linear case 
(— 254 Linear Ordinary Differential Equa- 
tions (Local Theory)). 


E. Singular Perturbations 


The terms on the right-hand side of the non- 
linear differential equations 


e%Gdy;/dx = f(x, y gees , Yn» E) 


(9) 


are holomorphic functions of (x, y, c) for |x| « a, 
lyi <b, 0 « [e| «c, |arge| <d, and admit uni- 
formly convergent expansions in powers of y 
with coefficients asymptotically developable in 
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powers of c. The o; are nonnegative integers. 
W. R. Wasow, W. A. Harris, Sibuya, and 
Iwano and T. Saito discussed problems on 
constructing asymptotic or convergent expan- 
sions for bounded solutions that are depen- 
dent on several arbitrary constants. 

In equations (9), the f; and 6f;/Cy, are con- 
tinuous functions of (x, y, e) for —oo «x « +o, 
lvl <b, |e| <c, and periodic functions of period 
T with respect to x. Moreover, assume that a 
system of degenerate algebraic equations 


0= fi(x. y1 s Ym 0), 


has a periodic solution y;— pj(x) of period T 
for —oo « x « +o. Then if equations (9) have 
periodic solutions y,= p,{x, £) of period T such 
that y,>p,(x) as £0, the p,(x, £) are called 
singular perturbations of p,(x) for equations (9). 
Concerning this problem, see I. M. Volk (Prikl. 
Mat. Mekh. SSSR, 10 (1946)). The work of 
Wasow (1950) on a single equation 


jede (10) 


020, n»mz0, 


(11) 


gy = f(x, y, y’, y mg), 
is remarkable. 
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A. General Remarks 


By nonlinear oscillation we usually mean oscil- 
lation described by periodic or talmost peri- 
odic solutions of nonlinear ordinary differential 
equations. The theory of nonlinear oscillation 
is sometimes called nonlinear mechanics. In 
connection with oscillations in tdynamical 
systems and electrical circuits, the theory of 
nonlinear oscillation has been studied inten- 
sively in the Soviet Union under the direction 
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of N. M. Krylov and N. N. Bogolyubov since 
around 1930, and after World War II research 
in this field became active in Western countries 
also. 

Written as first-order systems, the differen- 
tial equations in this theory take one of the 
forms 


dx/dt = X(x) l (1) 
Or 
dx/dt = X(x, t), (2) 


where x is a vector and t is a scalar. A dif- 
ferential equation of the form (1) ts said to 

be autonomous. In a differential equation of 
the form (2), X(x, t) is usually assumed to be 
periodic or almost periodic in t. In the former 
case, the differential equation (2) is said to be 
períodic, and in the latter case, almost periodic. 
Oscillations in physical systems are described 
mostly by periodic or almost periodic dif- 
ferential equations; therefore it is a principal 
problem in the theory of nonlinear oscillation 
to find a periodic or almost periodic solution 
of these differential equations. However, an 
oscillation described by a solution of a dif- 
ferential equation can be actually realized only 
when the solution is tstable (— 394 Stability) 
under a small variation of the initial value. 
Therefore it is important to investigate the 
stability of periodic or almost periodic solu- 
tions. In view of the fact that an actual phe- 
nomenon may be only approximately de- 
scribed by mathematical equations, sometimes 
it is necessary to require a certain stability 

of the system so that solutions of a peri- 

odic or almost periodic equation stay stable 
under a small variation of the equation itself. 
Such stability is called structural stability, 

the investigation of which is also important. 

It may happen that a differential equation 
possesses neither a periodic solution nor an 
almost periodic solution, but that it has an 
almost periodic tintegral manifold (e, a mani- 
fold x =f(t, 0) in tx-space, where 0 is a para- 
meter, such that f(t, 0) is periodic or almost 
periodic in t and periodic in 0) containing the 
‘trajectory of the differential equation passing 
through an arbitrary point of the manifold. In 
this case, we can consider that a solution 
corresponding to a trajectory lying on the 
manifold describes an oscillation. Therefore 
it is important to find a periodic or almost 
periodic integral manifold of a differential 
equation and to investigate the stability of 
such a manifold. 

The methods used most frequently in re- 
search on nonlinear oscillation are: (1) geo- 
metric methods, (ii) analytic methods, and (iii) 
numerical methods. 
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B. Linear Oscillations 


Let S, and Sf be the spaces of all c-periodic 
solutions of the w-periodic linear system 


dx/dt=A(t)x, A(t+@)=A(2), (3) 


and its adjoint system, respectively. Then the 
space P of continuous w-periodic functions 
RR" has direct sum decompositions P. = 

Si +S, and P,,— Sf -- $7 so that (x, y)=0 for 
every xE S], yES, or xeSf, ye S¥, where (x, y) 
—(1/0) [8 'x(s)y(s) ds. Let (£!, ..., £") and 
{y',...,9™\ (m may be 0) be bases of S| and S7 
orthonormal with respect to (-,-). Then for 

pc P, there is a unique c-periodic solution x 
of 


dx/dt = A(t)x + p(t) (4) 


belonging to S, under the condition (uk, p)=0 
(k — 1, ..., m), which is represented in the form 
x =G[p] by a bounded linear operator CG P,,— 
PR Also, if (3) and hence its adjoint system 
have no nontrivial c-periodic solution, then 
(4) always has a unique w-periodic solution 
given by x 2 G[p], and this is true even if the 
c-periodicity is replaced by almost periodicity. 
The almost periodic system (3) is said to be 
regular if (4) has almost periodic solution for 
an arbitrary almost periodic function p(t). A 
necessary and sufficient condition for (3) to be 
regular is that (3) induce an exponential di- 
chotomy, that is, the solution space S of (3) 
have a direct sum decomposition $2 S. +S, 
such that |x(t)| x Me ?'^* |x(s)| holds for —oo 
«sxt«oo if xe$, and for ~œ «txs« oo if 
xeS_, where M and y are positive constants. 
An autonomous (resp. periodic) system (3) is 
regular if and only if no *characteristic roots 
(resp. characteristic exponents) have zero real 
parts. 


C. Geometric Methods 


Geometric methods are used frequently for 
finding a periodic solution in an autonomous 
or periodic case. In the autonomous case (1), a 
periodic solution describes a closed orbit (t is 
a parameter of a curve) in x-space, which is 
usually called a phase space. The geometric 
method is used to show the existence and the 
stability of a closed orbit by investigating 
geometrically the behavior of orbits in the 
phase space. In such an approach, the prop- 
erties of tcritical points and tlimit sets (— 126 
Dynamical Systems) are utilized frequently. 
This method is effective especially for 2- 
dimensional cases on the basis of the tPoincaré- 
Bendixson theorem, and various results are 
given for a generalized Liénard’s (or Duffing’s) 
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differential equation X + f(x)x + g(x) 20, which 
includes the van der Pol differential equation 
X—A(1 —x?)x 4 x «0 [1,2]. 

In the periodic case (2), let ot 0) be a 
period of X(x, t) with respect to t and x — 
ot, x) be a solution of (2) such that (0, x)= 
a. Then a periodic solution of (2) is given by 
X= P(t, %o), where Geo satisfies (o, x9) = ap. 
Thus the existence of a periodic solution can 
be shown by investigating geometrically the 
existence of a ‘fixed point of the mapping x5 
x’ = o(o, x) in the phase space. In such an 
approach, *Brouwer's fixed-point theorem is 
utilized frequently. In the mapping xx, it 
may happen that there is no fixed point but 
that there exists an tinvariant manifold. In this 
case, we get a periodic integral manifold. 

Geometric methods give information on 
qualitative properties, but usually not on 
quantitative properties like the shape of an 
oscillation. Thus these methods are in general 
not sufficient for the analysis of the phenom- 
ena met in practice. 


D. Analytic Methods 


At present analytic methods are used most 
frequently in the study of nonlinear oscilla- 
tions because, in comparison with geometric 
methods, they enable us to get many quantita- 
tive results in addition to the qualitative ones. 
However, these methods are usually efficient 
only for weakly nonlinear differential equations, 
that is, differential equations differing only 
slightly from linear differential equations (gen- 
eral nonlinear differential equations are called 
sometimes strongly nonlinear differential equa- 
tions). In this sense, analytic methods are all 
fperturbation methods in the wider sense [3], 
and the variety of the methods lies in the form 
of perturbation and the method of calculation. 
To make use of analytic methods, we always 
reduce the given differential equation to a 
differential equation of the form 


x= Ax 4 £X(X, t, £). (5) 


Here £ is a parameter with small absolute 
value, A is a matrix of the form A= 
diag(O,, B), where O, is a p x p zero matrix 
and B is a matrix whose eigenvalues have all 
nonzero real parts, and X(x, t, c) is periodic or 
almost periodic in t. 

(i) When X(x, t, £) is periodic in t, Poincaré's 
perturbation method is used frequently. 

(ii) In practical problems, we frequently meet 
the case A —0, that is, the case where (5) is of 
the form 


X = £X(x, t, 8). (6) 
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In this case, the average X,(x, c) — 

lim; ,o(1/T) fd X(x, t, c) dt exists. If dx/dt = 
Aal, 0) has a periodic solution &(t) and the 
related variational linear system is regular, 
then (6) has an (almost) periodic integral mani- 
fold x — f, (t, 0) such that f,(¢, 0) £(80) uniformly 
as £20. The method of averaging based on 

this fact was devised by Bogolyubov and 
Mitropol’skii [4] and is used frequently. The 
equation 


X-Fo? x — ef(x, X, t,£) (7) 


is one of the examples that can be reduced to 
an equation of the form (6). Equation (7) ap- 
pears frequently in practical problems, and 
hence various convenient techniques are de- 
vised, such as the method of linearization, the 
asymptotic method, the method of harmonic 
balance [4], etc., through which we can apply 
the method of averaging directly to the given 
equation (7). 

(ii) Consider an w-periodic system 


dx/dt = A(t)x + f(x, t). (8) 


For any x(t)e P,,, dx/dt = A(t)x + f(x(t), t) is of 
the form (4) and it has an c-periodic solution 
Mi a6 + G[Nx], or 


m 


(LEM ača] nx- Y ih. gaer (9) 
k=1 k=1 
under the condition (n*, Nx)=0 (k=1,...,m) as 
in Section B, where N: P „>P, is defined by 
Nx=f(x(-),-), and the o-periodic solutions of 
(8) correspond to the fixed points of the trans- 
formation TR P,, induced by (9), where the 
a, in (9) are given by a, — (£^, x), as is required 
when x is a fixed point. This is the alternative 
or bifurcation method [6—9], which is extend- 
able to the case of almost periodic systems 
under the regularity of (3) [10]. In order to 
obtain a fixed point of T, various kinds of 
fixed point theorems can be utilized. 

(iv) In Section C and (iii) above, the search 
for a fixed point of an appropriate mapping is 
a principal device. However, the choice of a 
suitable domain for the mapping is a crucial 
problem, and usually an a priori bound for 
solutions is looked for. The concept of stability 
and hence Lyapunov's second method are 
effective in such situations [11]. 


E. Numerical Methods 


Numerical methods are used for obtaining 
explicit forms of the oscillations. They are 
convenient in practical applications, since they 
can be used efficiently whether or not the 
nonlinearity of the system is weak. For an 
autonomous or periodic system, one can uti- 
lize the following methods as efficient means 
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of computing periodic solutions: Newton's 
iterative method, the finite-element method, the 
Lindstedt-Poincaré method, the method of 
multiple scales, the method of harmonic bal- 
ance, the Galerkin method, etc. [12-15]. 

For weakly nonlinear cases, analytic meth- 
ods (that is, perturbation methods) are very 
efficient. However, when a parameter is fixed 
beforehand, it is not easy to know whether 
the conclusion obtained by the perturba- 
tion method is valid for the given value. For 
strongly nonlinear cases, it is very difficult 
to analyze the problems by analytic methods, 
and at present hardly any efficient methods 
exist. Numerical methods will therefore be- 
come exceedingly important in research on 
nonlinear oscillations. These are now of toler- 
able efficiency and reliability, due to progress 
in high-speed machine computation. 


F. Nonstationary Oscillations 


Physically speaking, an oscillation is a station- 
ary state. However, when a system contains a 
parameter varying slowly with time, the oscil- 
lation also varies slowly with time (for exam- 
ple, when the length of a pendulum varies 
slowly, the amplitude of the pendulum also 
varies slowly). Such variation of oscillations in 
the course of time is represented by dx/dt — 
ef(x, t, et, c), where f(x, t, s, c) is (almost) peri- 
odic in t, s. Such cases provide important 
problems in the theory of nonlinear oscilla- 
tions; one such problem has been posed by 
Mitropol’skii [16] under the name nonstation- 
ary oscillations and has been investigated by 
means of the method of multiple scales [15]. 
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A. General Remarks 


Nonlinear problems deal with nonlinear map- 
pings or operators and the related equations. 
Until recently it was customary to consider 
nonlinear problems as belonging to applied 
mathematics and the physical sciences. How- 
ever, nonlinear problems now belong to mod- 
ern mathematics. Many phenomena in math- 
ematical physics are essentially described 

by nonlinear equations, e.g., the motions of 
several particles or of viscous or compressible 
fluids (— 420 Three-Body Problem, 204 Hy- 
drodynamical Equations). Some of these equa- 
tions are approximated by linear equations 
only when the variables appearing in the equa- 
tions are restricted to very small domains; 
they are treated by perturbation methods 
when the variables stay in comparatively small 
domains. If these requirements cannot be met 
and we have to deal with equations in which 
the variation of the variables are not negli- 
gible, nonlinear problems certainly arise. 

The methods of solution of nonlinear prob- 
lems are not as powerful or general as those 
for linear differential equations. For instance, 
the tprinciple of superposition of solutions 
does not hold for nonlinear problems, and 
therefore Fourier methods are no longer ap- 
plicable. Indeed, some nonlinear problems can 
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be dealt with only by means of very particular 
or ad hoc devices. 


B. Methods Used in Nonlinear Problems 


Consider a nonlinear equation 
G(x)=0, (1) 


where G is a nonlinear mapping or operator of 
a subset S of a linear space X into itself. If we 
put G=I-—F (I is the identity), equation (1) 
becomes 


x = F(x). (2) 


Then a solution of (2) is a fixed point of F. 
Therefore fixed-point theorems of various kinds 
are useful for solving (2) (— 286 Nonlinear 
Functional Analysis). 

Let x” eS, and suppose that we can define 
x, k=1,2,..., by 


x= Fiat In  k-1,2,.... 


If F is continuous and the sequence xf? con- 
verges to a point xe$, then x is a fixed point 
of (2). Such a method of constructing an ap- 
proximate sequence by iteration is called the 
iterative method. *Newton's iterative process, 
given by 


x) - x*-D t [G'(x*^! »] -1 G(x*^ D), 


is one such method, where G' denotes the 
*Fréchet derivative of G. 

If X is an infinite-dimensional space, many 
concepts and methods of the theory of func- 
tional analysis can be used for a number of 
nonlinear problems (— 286 Nonlinear Func- 
tional Analysis). 

We note that there exist nonlinear trans- 
formations that change nonlinear equations 
into linear ones. For example, the thodograph 
method, which is often applied in hydrody- 
namics, consists of reducing a system of fquasi- 
linear partial differential equations of the form 


A;(u, v)u, + Blu, v)u, + C,(u, v)v, + Du, v)v, =0 


to linear differential equations 
Aiy, — B;x, — Ciy, D;x, se (i— 1,2) 


by means of the *hodograph transformation, 
which changes the independent variables from 
x, y to u, v (2 205 Hydrodynamics). 


C. Nonlinear Algebraic and Transcendental 
Equations 
Consider a system of equations 


Dis, deet (i=1,...,n). (3) 
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Newton's iterative process can be applied as 
follows. Starting with a point x — (x9, ... , 
x) lying near the desired solution, we define 
xP =(x®,..., x), k— 1, 2, ..., by solving the 
system of equations 


of; 
1 Ox; 


D 





E ee =x% D) + f(x 1) =O 


IM 


[i 


Under some conditions the iteration {x} 
converges to the solution (— 301 Numerical 
Solution of Algebraic Equations). 

If the system (3) is a real one, (3) is equiva- 
lent to X f? —0. It is clear that a method of 
obtaining a minimum of a function f(x,, ..., 
x,) is applicable to solving È f? =0. Taking a 
point x, we define 


x5 —x* DA QVf(x* TA 


where Vf — (0f/Ox,, .. 
satisfy 


FP — à, VAP) fex — AVP) 
(420). 


kem uo 
. , Of/Ox,), and let AA , 


Under suitable assumptions, a subsequence 
x) converges to x such that Vf(x) 2 0 and 
f(x*?) decreases monotonically to f(x) [1]. 

Let f be a continuous mapping from a 
domain D of R" or C" into itself. Then for any 
x? c D, the iteration x? can always be defined 
by x® = f(x* 1. The problem of the behavior 
of x is an interesting and important one in 
pure mathematics; recent work in the physical 
sciences is yielding many new concepts related 
to this problem (— 126 Dynamical Systems, 
433 Turbulence and Chaos). 


D. Nonlinear Differential Equations 


Consider a nonlinear system of ordinary dif- 
ferential equations 


dx/dt —f(t,x) | (xeR") 


The initial value problem with initial condi- 
tion x(1) = € is equivalent to the problem of 
solving the nonlinear integral equation 


t 
x(t)=č+ | fs, x(s)) ds. (4) 
The solution of (4) is a fixed point of the 
operator T: (t) ë+ fi f(s, o(s)) ds defined 
for a suitable function space. Therefore fixed- 
point theorems are applicable to T, and the 
iterative method for T is called the "method of 
successive approximation. Also the method of 
the tCauchy polygon is useful (= 316 Ordi- 
nary Differential Equations (Initial Value 
Problems)). These ideas are extended to an 
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abstract Cauchy problem, 


du/dt- A(tu (t>0),  u(+0)=a, 


in a Banach space X, where ae X and A(t) isa 
nonlinear operator (— 286 Nonlinear Func- 
tional Analysis). 

For extensive studies of nonlinear ordinary 
and partial differential equations — 314 Ordi- 
nary Differential Equations (Asymptotic 
Behavior of Solutions), 290 Nonlinear Oscil- 
lation, 394 Stability, 321 Partial Differential 
Equations (Initial Value Problems), 323 Par- 
tial Differential Equations of Elliptic Type, 
325 Partial Differential Equations of Hyper- 
bolic Type. 

Nonlinear differential equations of special 
types appear in many fields of pure and ap- 
plied mathematics, e.g., the Monge-Ampére 
equation and the equation for minimal sur- 
faces in differential geometry (— 183 Global 
Analysis, 275 Minimal Submanifolds), and 
the Toda lattice equation and the Korteweg- 
de Vries equation in mathematical physics 
(— 287 Nonlinear Lattice Dynamics, 387 
Solitons). 


E. Nonlinear Problems of Control Systems 


The basic equation for a tcontrol system in 
which the state of the controlled object can be 
represented by an n-vector x is given by the 
following system of differential equations [3]: 


Ziel,  c-—cx—y6 (5) 


where x and Ë stand for dx/dt and dé/dt, re- 
spectively, č is a scalar function representing 
the control, b, c are constant n-vectors, y is a 
constant number, c' is the transpose of c, and 
A is a constant n x n matrix whose character- 
istic roots have negative real parts. Further- 
more, we assume that when the control has 
no effect upon the system, x is determined by 
X — Ax. The quantities under consideration 
are all real. Finally, the function o = (ø) is a 
scalar function characteristic of the control 
mechanism. Generally, o is nonlinear in c, and 
hence equation (5) is nonlinear. Normally we 
assume that ọ has the following properties: (i) 
Q(c) is a real-valued continuous function on 
(—00, œ) with g(0)=0 and eo(c) — 0 for c z 
0; (ii) [F ” e(a)da = +00. We say that (5) is 
absolutely stable if 


£(t)20 as 


X— Ax — q(a)b, 


x(t)-50, t— +00 


for any choice of o subject to (1) and (ii) and 
for every solution of (5). In the study of control 
systems an important problem is to obtain a 
necessary and sufficient condition for the 
system to be absolutely stable. In this connec- 
tion, we have the following result, due to M. V. 
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Popov: (5) is absolutely stable if there exists a 
nonnegative q such that 


Re((1-r ioq)(c'(ioI — A) ! b)] -qy 20 (6) 
for any real o, where I is the n x n identity 
matrix. Conversely, if the absolute stability of 
(5) is given by means of the tLyapunov func- 
tion (— 394 Stability) 


V(x, o)=x'Bx -- ao? - af'x s| q(o) do, 
D 


where B is a constant matrix and f is a con- 
stant vector, then there exists a nonnegative 
q for which (6) is satisfied for all real c. 


F. Nonlinear Equations in Applied 
Mathematics 


Some examples of nonlinear problems are 
given here (— also 205 Hydrodynamics; 318 
Oscillations). 

(1) The nonlinear tdifferential-difference 
equation 


du(t)/dt 2 (a — u(t — 1))u(t) 


is called the Cherwell-Wright differential equa- 
tion [4]. Given the initial condition u(t) — g(t) 
(0x t € 1, g(t) is a given continuous function), 
its solution is uniquely determined for 0x t « 
oo. If a «0 and g(1)» 0, then u(t) 30 as t5 

oo; if a=0 and g(1) « 0, then u(t) —oo as t5 
oo. For a>0, u(t) —oo as too if g(1) «0, 
while u(t) either approaches a monotonically 
(0 «a x 1/e) or oscillates (boundedly) around 
a (a 1/e) if g(1)>0. In particular, we have 
damping oscillations for a x 3/2, while oscilla- 
tory solutions without damping appear for 

a n/2. 

(2) If we regard a star as a gas sphere and 
assume the polytropic relation p= Kp’ (K and 
y are constants) between the pressure p and the 
density p at each point inside the star, then 
we have a differential equation of the second 
order that determines the density distribu- 
tion. This equation constitutes the basis of the 
classical theory of the internal structure of the 
stars and is called Emden's differential equa- 
tion or the polytropic differential equation. It 
reads: 


(1/8*)4 (7? d0/dč)/dč = — 0", 


where y — 1+ 1/n, p — 40^, r a£, a — ((n4- 
1)K4""?/(4nG))'7, with r the distance from 
the center, G the universal gravitational con- 
stant, and 4 an arbitrary constant. The solu- 
tion that satisfies the conditions 0 — 1 and 
d0/dé —1 at €=0 is called the Lane-Emden 
function of index n. Emden's equation is in- 
variant under the transformation £— AC, 

0— A ?/"790 (with A an arbitrary constant). 
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Although Emden's equation can be reduced to 
an equation of the first order, it has not been 
possible to solve it analytically except for the 
cases n —0, 1, and 5. For certain values of n 
between 0.5 and 6, Emden gave numerical 
solutions, which were refined later by Green, 
D. H. Sadler, and D. C. Miller [5]. 

(3) Caianiello's differential equations, which 
describe the state of a network of neurons [6], 
are 


x,(t -- t) — d x aj x(t —rt) — J ' 

jr 

where the function x;(t) represents the state of 
the ith neuron at the time t and takes only the 
values 0 and 1. The state of the system is to be 
considered at the discrete times t — 0, c, 21, .... 
The (real) coefficient af? represents the weight 
of the effect of hysteresis in the relay process 
from the cell j to the cell i. The nonnegative 
integer 0; is the threshold value of the cell i. 
Y[x] is the unit step function that is equal to 1 
for x >0 and vanishes for x « 0. 

(4) The following Hodgkin-Huxley differen- 
tial equation arises in the study of conduction 
and excitation in nerve systems [7]: 

OV 2rg ( CH 


P Co, t gm h(V — V) 


tgjn*(V— Vj) gi(V— 4) 


™ = —(0,(V) + B,(V))m+0z,(V), 


ôt 


^ 


h 
A — (x, (V) 3- B5(V))h  «;(V), 


oe — (a3(V) + B3(V))n+ «3 (V), 

where a;, fj; (1 ix: 3) are given functions of V, 
and ro, Ro, Co, gi, V, (1 <i< 3) are constants. 
The unknown function V is sought in the 
domain 0 « x « o0, 0 « t « oo, while the initial 
values of V, m, h, n, and the boundary value of 
V are given at t=0 and x=0, respectively. 
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A. Problems 


A nonlinear programming problem is a type of 
mathematical programming problem where it 
is required to minimize or maximize a non- 
linear function 0(x) of n-vector variable x 
defined in a closed connected set X? with a set 
of linear or nonlinear constraints. Minimiza- 
tion or maximization of a continuously dif- 
ferentiable function 0(x) under the equality 
condition expressed by a set of continuously 
differentiable functions has been traditionally 
dealt with by the method of Lagrange multi- 
pliers. Hence a typical nonlinear programming 
problem is usually formulated as follows. 

(NLP) Minimize 0(x) under the condition 
xe X? CR" and g(x) «0 for i2 1,2, ...,m. 

Or, equivalently, determine the set of all 
x such that 0(x) 2 min, c 0(x), where C= 
{x|xeX° and g(x) <0}. 

Here we need only consider minimization, 
since maximization problems can be converted 
to minimization problems by virtue of the 
obvious relation max 0 = —min(— 0). 

The set C is known as the feasible region or 
the constraint set, and x is called an optimal 
solution or simply a solution. In many non- 
linear programming problems X? is R”. If X? 
— R" and 0 and g are linear functions on R", 
then the problem becomes a linear program- 
ming problem (— 255 Linear Programming). 
The problem of minimizing a quadratic func- 
tion subject to linear constraints is called the 
quadratic programming problem (— 349 Qua- 
dratic Programming). If X? is a convex set, 0 
is convex (or concave), and the g; are convex 
on X°, then the minimization (or maximiza- 
tion) problem is known as a convex (or con- 
cave) programming problem. Convex and con- 
cave functions are important in nonlinear 
programming, because they admit reason- 
ably straightforward sufficient conditions for 
optimality and also because they constitute the 
only important class of functions for which 
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necessary optimality conditions can be given 
without the differentiability condition. Subject 
to suitable modifications, the method of La- 
grange multipliers can also be applied to the 
solution of nonlinear programming problems. 
The Lagrangian function y associated with 
the minimization problem (NLP) is defined by 


W(x, u) = 0(x) + u'g(x), 


where u —(u,, ...,u,) and w denotes the vector 
of Lagrange multipliers. 

A pair (x, i) is called a saddle point of (x, u), 
provided xe X°, uc R”, uz 0, and y(x,u)« 

V (x, u) € v (x, u) for all xe X? and all ue R” 
such that u>0. It follows easily that: 

(1) (H. Uzawa [18]) If (x, u) is a saddle point 
of v (x, u), then x is an optimal solution. 

(2) (Kuhn and Tucker [11]) Assume that X? 
is open and convex, and that 0 and g are dif- 
ferentiable and convex. If there exists a pair 
(X, ii) such that 





VO(X)-u'Vg(X)-0, xeC, üug(x)-0, üz0 


(where V0(x) and Vg(x) denote, respectively, 
the gradient vector of 0 at x and the Jacobian 
matrix of g at x), then X is an optimal solution. 


B. Necessary Conditions for Optimality 


In Section A sufficient conditions for opti- 
mality were given. Some necessary conditions 
are also known. 

(3) When (x) is continuously differentiable 
and all the g; are linear, that is, when the con- 
dition is expressed as Ax <b and x 2 0, where 
A is an m x n matrix and b an m-vector, X is 
an optimal solution only if x is an optimal 
solution of the following linear programming 
problem. 

(LP?) Minimize v'x under the condition that 
Ax € b, x 20, where v 2 VO(x). 

By applying the duality theorem of linear 
programming, it can be proved that there 
exists a vector i z 0 such that 


Vo(x, u)z V0(x)d- u'A—0. 


(4) (F. John [9]) Assume that X? is open, 
and that 0 and g are differentiable. Then, if x is 
a solution of the problem (NLP), there exist 
ug Z 0 and ue R" such that 


ig VO(X) + i Vg(x) =0, 


g(X)&0, u'g(X)-0, m20. 


Now we consider the following: The vector- 
valued function g is said to satisfy Guignard's 
constraint qualification at an inner point x of 
X? if any vector y satisfying the linear inequal- 
ities Vg;(x): y <0 for ie I= {i|g,(x) -0j is in 

the convex hull spanned by the vectors tangent 
to the set C at x. 
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(5) (Guignard [7]) Let X? be an open set, let 
X be an optimal solution of (NLP), let 0 and g 
be differentiable at x, and assume that g satis- 
fies the Guignard constraint qualification. 
Then there exists me R” such that V@(x)+ 
u'Vg(x)=0, g(x) «0, and u'g(x) 20, uz 0. 

Guignard's constraint qualification is satis- 
fied if neither of the following conditions 
hold: (L) Vectors g,(x) for ie/ are linearly 
independent. 

(S) (Slater's constraint qualification) The g; 
are convex and X? is a convex set; and there 
exists a vector x such that g,(x) « 0 for all i. 
With convexity of 0 and g,, differentiability is 
not required: 

(6) (Kuhn and Tucker [11]) Let X? bea 
convex set, let 0 and g be convex on X?, and 
assume that g satisfies Slater's constraint 
qualification. If x is an optimal solution, then 
there exists ue R”, uz 0, such that v'g(x) 20 
and (x, i) is a saddle point of w(x, v) = 0(x) - 
u'g(x). 


C. Sensitivity Analysis 


Now consider the following class of problems. 

(Pc) Minimize the function 0(x) under the 
condition that g(x)<c, where c is a real m- 
vector. We denote by x, the set of the solutions 
of the problem (Pc) and denote 0* = OG Le 
X.. Suppose that Guignard's condition is 
satisfied for each c and that the set of La- 
grange multipliers A, is nonempty. Then for 
any real vector a, we have 


€^ 


infa'u, < lim RI ,70*)€supa'u, u,eA,. 
Therefore, if the Lagrange multiplier is unique- 
ly determined for some c, then u, represents 
the vector of the rates of increments of the 
objective function to the small increments of 
the components of the constraints vector. 
Hence the components of the Lagrange- 
multiplier vector are called the imputed 

prices or shadow costs of the constraints; these 
have important economic !mpilcations, espe- 
cially when the objective function is expressed 
in terms of money or profits. 

More generally, we can consider the follow- 
ing class of problems. 

(GPc): Minimize the function @(x, c) under 
the condition that g(x, c) x 0, where c is a real 
parameter. 

Denote by x(c) and u(c) the solution and the 
Lagrange multiplier of (GPc) corresponding to 
the parameter c, respectively, and let 0*(c) = 
0(x(c)). Then under a set of regularity condi- 
tions we have 


V6*(c) — V.O(x(c), c) + u(c) V.g(x(c), c), 


292 D 


Nonlinear Programming 


where V, denotes the gradient of 0 or g with 
respect to the parameter. 


D. Duality 


A duality theorem in mathematical program- 
ming is the statement of a certain relationship 
between two problems. This relationship has 
the following two aspects: (1) one problem is a 
constrained minimization problem and the 
other is a constrained maximization problem; 
(ti) the existence of a solution to one of these 
problems ensures the existence of a solution 
to the other, and in this case their respective 
values are equal. 

Let w(x, u) be the Lagrangian form of 
(NLP), and define c(u) —inf,, ve W(x, u). Then 
we can state the following two problems. 

(P) (primary problem) Miminize 6(x) under 
the condition that xe X? and g(x) «0 

(D) (dual problem) Maximize w(u) under the 
condition u >Q. 

If (x,u) gives a saddle point of w(x, u), then 


infü(x) = 0(X) 2 sup w(u) = w(u) = W(X, u). 


The dual problem can be formulated alterna- 
tively as follows. 

(D) Maximize (x, u) 2 D) - u'g(x) sub- 
ject to x, u)e Y= (G5 u)|xe X?, ue R”, uz 0, 
V.aj(x, u) 20] (where Vy (x, u) denotes the 
vector whose components are the partial de- 
rivatives Oy (x, u)/Ox; for i— 1, ...,n). 

There are a number of duality theorems 
related to problems (P) and (D); two such 
theorems are as follows: 

(1) (P. Wolfe [20]) Suppose that X? is open 
and convex, that 0 and g are differentiable and 
convex, and that g satisfies the Kuhn-Tucker 
constraint qualification. Then, if x is a solution 
of (P), there exists a me R” such that (x, u) is a 
solution of (D) and 0(x) = (Xx, ij). 

(2) (O. L. Mangasarian and J. Ponstein [13]) 
Suppose that X? is open and convex, that 0 
and g are differentiable and convex, and that 
(%, à) is a solution of (D). If W(x, OU is strictly 
convex in some neighborhood of X, then X is a 
solution of (P) and 0(X) 2 v (X, á). 

The above two problems (P) and (D) are not 
symmetric. The notion of symmetric duality 
was introduced by G. B. Dantzig, E. Eisen- 
berg, and R. W. Cottle: 


Primary: Minimize 

F(x, u)= K(x,u) —v'V, K(x, u), 
subject to the constraints 
V,K(x,u)&0, x >0, and uz 0; 
Maximize 

G(x,u)& K(x,u) — x V. K (x, u), 
subject to the constraints 
V.K(x,u) 20, x z0, and uz 0, 


Dual: 
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where K is continuously differentiable in 
(x, u)e R” x R”. 

(3) Dantzig et al. proved [6] the existence of 
a common optimal solution (x, i) to both the 
primary and dual problems, provided (i) an 
optimal solution (x, u) to the primary problem 
exists, (ii) K is convex in x for each u and 
concave in u for each x, and (itt) K is twice 
differentiable and the matrix of second partials 
(0^ K/0u'Qu?) is negative definite at (x, u). 

Rockafellar [15] gave another expression of 
the duality relation: Define F(x, y)= 0(x) if 
g(x) € y and = oo otherwise, and denote ov 
— inf, yo F(x, y). Then 0(x) 2 ¢(0). For any 
nonlinear function q( y), the conjugate q*() is 
defined by 


Q* (n) —sup (n'y — oy). 


(4) (Rockafellar [15]) sup w(u) = 9**(0) — 
clco (0), where clco o( y) denotes the closed 
convex hull or maximum convex minorant 
of o(y) defined by clco g(y) ^ sup, l'y('z < 
«(z) for all z}. It follows from the above that 
inf0(x) 2 sup w(u) if and only if ol — clco (0), 
which holds true if (y) is convex. 

Further forms of the duality theorem hold 
for linear or quadratic programming problems 
(— 255 Linear Programming, 349 Quadratic 
Programming). 


E. Algorithms 


In a limited class of problems, i.e., when the 
objective function is quadratic and the con- 
straints are linear (— 349 Quadratic Program- 
ming) the optimal solution can be obtained by 
solving a system of linear equations by the 
simplex method or other algorithms; but in 
most nonlinear programming the solution is 
calculated by some kind of iterative procedure. 
Note that even when the constraints are given 
as equalities and all the functions involved are 
continuously differentiable, so that the optimal 
solution is explicitly given as a solution of a set 
of simultaneous equations, we usually require 
some iteration procedure, such as the Newton- 
Raphson algorithm, to obtain numerically the 
solution with preassigned accuracy; and the 
iteration is not always easy if the functions are 
sufficiently complex. 

Several iterative procedures for solving non- 
linear programming problems have been pro- 
posed. Since the simplex method is a powerful 
tool in linear programming, one type of ap- 
proach is to obtain an approximately optimal 
solution by approximating the objective and 
the constraint functions by piecewise linear 
functions and then applying linear program- 
ming techniques to get the approximately 
optimal solution within each region. Another 
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is to change the constrained problem to a 
nonconstrained one by introducing a suffi- 
ciently large number M, called the penalty, 
and then maximizing 


0, (x) = O(x) + M) max(g,(x), 0) 


without the constraint. This is called the pen- 
alty method. 

The third and most generally applicable 
technique is the gradient method, of which 
several variations are known: 

(1) Arrow-Hurwicz-Uzawa gradient method 
[4]. Concave or convex programming prob- 
lems can be solved by finding a saddle point 
of the Lagrangian function w(x, u). Let (x, u) 
be strictly concave and of class C? in n-vector 
x >0 and convex and of class C? in m-vector 
u>0 and possess a saddle point (x, i). To 
approach a saddle point of (x, u) it is natural 
to devise a gradient process of the form 





J 
= $ = S 1 
dt 0x; dt Ou; SH 
To keep the variables in the positive orthant, 


we need to modify (1), and we consider the 
following system of differential equations: 


d 
d if x,-0 and <0 


dx, — 
dt lô 

E otherwise 

Óx; 

(i=1,...,n), 
; 6p 

dn 0 ifu;=0 and m 
dt Con l 

———- otherwise 

Ouj 


Under certain regularity hypotheses, there 
exists a unique solution (x(t), u(t)) of the sys- 
tem with any initial point (x9, u°), and the x 
component x(t) of the solution converges to x 
as t> oo. 

Applying the above results to the Lagran- 
gian function w(x, ul we can solve the concave 
or convex programming problem. 

(ii) Rosen's gradient projection method [16]. 
If a point x? of the feasible region does not 
give a solution for the minimization problem 
(P), then we look for a feasible point with a 
lower function value by proceeding from x? in 
the direction of the gradient of the function 
— (x). The method fails if x? is a boundary 
point and if the gradient vector points toward 
the exterior of the feasible region. Rosen's 
method [16] is to project the gradient onto the 
boundary of the feasible region and then pro- 
ceed in the direction of this projection. In this 
manner, we remain on the boundary of the 
feasible region. 
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(i) Methods of feasible directions. These 
were first described by G. Zoutendijk [21 ]. 

Consider the problem of minimizing 0(x) 
subject to the constraint xeS c R", where S is 
a closed, connected set satisfying certain regu- 
larity conditions and @ is a continuously dif- 
ferentiable function of the n-vector x, such 
that, for some o, the set (xeS|0(x) aj is 
bounded and nonempty. 

A method of feasible directions is any recipe 
for solving this problem by proceeding along 
the following lines: (1) Start with some x° eS 
such that 0(x9) « sup a. (2) Pass from the kth 
iteration point x* to x**! by first determining 
a direction s* in x* such that the ray x* + As* 
lies in S for all sufficiently small 4 0. (3) Then 
determine the step length 4,, thus obtaining 
the (k+ 1)st iterate x**! = x* + 2,5s*. (4) Repeat 
this procedure until some prescribed stopping 
condition is satisfied. 

There are many methods of determining the 
s*, and in most cases the 4, are then deter- 
mined by solving a one-dimensional mini- 
mum problem along the direction so obtained. 
Zoutendijk has unified the various possible 
methods of feasible directions and the relevant 
normalization rules that yield the optimal 
directions s*, and has investigated this subject 
in detail from the viewpoint of computational 
technique [21, 22]. 


F. Generalizations 


The extension of the Kuhn-Tucker theory to 
linear topological spaces is due to L. Hurwicz 
[8]. Let X be a linear space, Y, Z be linear 
topological spaces, P,, Pz the nonnegativity 
cones of Y, Z, respectively, which are closed 
convex cones containing inner points, D a 
convex set in X, and F, G concave mappings 
(— 88 Convex Analysis A) from D into Y, Z, 
respectively, such that G(D) contains an inner 
point of PR If F(X) attains its maximal point 
when X = X, and X, satisfies G(X) 20, X,€ D, 
then there exist Y; 20, Zë 20 such that 

D(X, Z*)= Y*(F(X)) 3- ZE(G(X)) has a saddle 
point at (Xo, Z9). The condition that P, and 

P, have inner points can be weakened to cover 
the cases of (l), (Lp), (s), (S) (Hurwicz and 
Uzawa). P. P. Varaiya [19] considered the 
following nonlinear programming problem in 
Banach space. 

(B) Maximize f(x) subject to xe A, g(x)e Ay, 
where X, Y are real Banach spaces, xe X, g: 
X Y is a Fréchet differentiable mapping, f is 
a real-valued differentiable function, A is a 
subset of X, and A, is a convex set in Y. 

The main results are similar to the Kuhn- 
Tucker necessary conditions. Varaiya also 
exhibited a saddle value problem related to 
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(B), when A, is a closed convex cone. L. W. 
Neustadt [14] investigated nonlinear pro- 
gramming problems in linear vector spaces 
and gave an application to the theory of 
optimal control. The main results are: (i) 
Kuhn-Tucker type conditions which are both 
necessary and sufficient for optimality, (ii) a 
duality theory for obtaining multipliers in the 
generalized Kuhn-Tucker conditions, and (iii) 
an application to optimal control theory. 
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A. General Remarks 


Nonstandard analysis is a new field of research 
that has branched off from model theory and 
that provides a powerful method applicable in 
almost all fields of mathematical science. 

About 1960, A. Robinson successfully used a 
nonstandard model of the real number field R 
to justify Leibniz-type infinitesimal calculus. 
After this, using higher-order logic, he devel- 
oped a stronger theory of nonstandard analy- 
sis, and applied it to other mathematical fields 
[1]. 

In this article, we adopt a first-order logic 
over a universe. In the final section we present 
a theory called nonstandard set theory; this is 
a conservative extension of Zermelo-Fraenkel 
set theory with the axiom of choice. 


B. Axioms for Nonstandard Analysis 

A nonempty set U is called a universe if the 
following four conditions are satisfied: 

(a) xeU, yex — yeU; 

(b) x, ye U = (x, y)eU; 

(c) xeU =| JxeU; 

(d) xe U = P(x)eU. 


In what follows, U will be a universe contain- 
ing the field R of real numbers. 

We construct a language Z describing 
mathematics in U. The alphabet consists of the 
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following symbols: (1) countably many vari- 
ables. (2) constants corresponding to all ele- 
ments in U. (3) predicate symbols — and e. (4) 
logical symbols 4, ^, v, >, 3, V. (5) auxiliary 
symbols [ , ]. The last two symbols will be 
omitted where there is no danger of confusion. 

Definition of formulas. (1) If t and s are 
terms (variables or constants), then t =s and 
tes are formulas (atomic formulas). (2) If a 
and y are formulas, then 14, daw, $ vy, 
$— y are formulas. (3) If ¢ is a formula and x 
is a variable, then 3x[4$] and Vx[4] are for- 
mulas. (4) The formulas are those that can be 
constructed by the above procedure. 

A formulas ¢ containing at most n free 
variables is called an n-ary formula. In this 
case, we often write ó(x,, ..., x,) for ¢. A sen- 
tence is a 0-ary formula. We write F $ if a 
sentence ¢ is true under the usual inter- 
pretation. 

Consider a quadruple (U, *U, *e, «) where 
*U is a set, *e is a binary relation on *U, and 
x iS a mapping: a *a from U to *U. An 
element x of *U is called standard if x = *a for 
some ac U, and nonstandard if not. Adjoin to 
¥ constants representing all nonstandard 
elements of *U. Then we have a language * Z 
for *U. A sentence in ¥ is a sentence in * Z. 
Taking *U as the scope of quantifiers, we can 
interpret every *.2-sentence in *U. We write 
*E à if an *#-sentence ¢ is true under this 
interpretation. 

Axiom 1 (transfer): For every sentence d in 
L, we have 


Fe "té 


Definition: Let (x, y) be a binary formula in 
£ (resp. in *.7). We say that (x, y) is concur- 
rent in U (resp. in *U), if, for every finite num- 
ber of elements a,,...,a@, in U (resp. in *U), 
there exists an element b in U (resp. in *U) 
such that - $(a;, b) (resp. *E- ¢(a;, b)) for 
Ixixn. 

Axiom 2 (enlargement): If a binary formula 
#(x, y) in Z is concurrent in U, there exists an 
element f in *U such that * (a, f) for all a in 
U. 

A quadruple (U, *U, *e, *) or simply *U 
is called an enlargement of U if it satisfies 
Axioms 1 and 2. 

These axioms are strong enough to develop 
the basic theory of nonstandard analysis, but 
sometimes we require a stronger axiom. Let x 
be an infinite cardinal. 

Axiom 3 (x-saturation): Let ó(x, y) bea 
binary *Z -formula concurrent in *U. Then, 
for every subset A of *U with cardinality at 
most x, there exists an element f of *U such 
that *H Ais, P) for all xe A. 

*U is called a x-saturated model if it satisfies 
Axioms | and 3, and a x-saturated enlarge- 
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ment if it satisfies Axioms 1, 2, and 3. If x is 
not less than the cardinality of U, then a xK- 
saturated model is necessarily an enlargement. 

In the remainder of this article, *U will be 
an enlargement of U, if the contrary is not 
explicitly mentioned. 

For «in *U, 4 is the set of €€*U such that 
é*ea:d— (£e*U|£*ea]. For simplicity, we 
write € for *e and identify å with «. Under this 
identification, a subset A of *U is called inter- 
nal if it belongs to *U and external if not. 

Axiom 1 and 2 imply the following results. 

(1) There exist infinite hypernatural num- 
bers, namely, elements of *N bigger than all 
*n, ne N. 

(2) There are positive infinitesimal hyperreal 
numbers. 

(3) For every set A in U, there exists a 
hyperfinite internal subset of *A containing all 
*a, ac A. Here, I is called hyperfinite if we 
have *E AL where ¢(x) is a formula that says 
"x 1s a finite set.” 

(4) Let A be a set in U. Then *A has non- 
standard elements if and only if A is an infinite 
set. 

(5) If a family ¥ of sets in U has the finite 
intersection property, then the family {*A| Ae 
F} in *U has a nonempty intersection. 

(6) Let c: *N 5 *R be an internal hyper- 
sequence. If «(*n) is infinitesimal for every 
n€ N, then there exists an infinite hypernatural 
number A such that «(v) is infinitesimal for 
every hypernatural number v less than 4. 

If we moreover assume Axiom 3, we have 
the following results. 

(7) The cardinality of an internal set is either 
finite or more than x. 

(8) If a family F of internal sets in *U has 
the finite intersection property and cardinality 
k at most, F has a nonempty intersection. 

(9) Introduce the order topology in *R. 
Then every subset of *R with cardinality at 
most vis bounded and discrete. 

(10) Let a, f be internal sets and C a subset 
of oa with cardinality at most x. Then every 
mapping from C to fi can be prolonged to an 
internal mapping from « to f. 

(10°) In particular, every external sequence 
Nf can be prolonged to an internal se- 
quence *N— f. This follows from the assump- 
tion of countable saturation. This fact plays an 
essential role in nonstandard probability 
theory, which is now in the process of rapid 
development (— Section D (3)). 


C. Construction of Ultrapower Models 


Let I be an infinite set. A mapping « from I to 


U is a family of elements in U with indices in J. 


So we write (a(i) Xie; or simply (a(i)» for a. 
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Let F be a nonprincipal ultrafilter on J and 
define an equivalence relation ~, on the set 
U' of all mappings from I to U: 


(x)? ~s LPO {iel ali) = BO} e. 


Denote by *U the quotient set of U under the 
relation ~,. The class of (a(i)»;., will be 
denoted by [a(i) ];., or simply [a(i)]. 

Define a binary relation *e on *U: 


[a(i] ze ëss tie Ha(i)e BG); e. 


Let « be the diagonal mapping from U to *U 
and consider the quadruple (U, *U, *e, «). 

Theorem (Los): The foregoing quadruple 
satisfies Axiom 1. Moreover, *U is a countably 
saturated model. 

In particular, let J be the set of all finite 
subsets of U. For iel, put u(i) - ( jel |icj}. 
Then the family of sets Z = { u(i)ļie I} has the 
finite intersection property, and therefore there 
exists an ultrafilter F on I including 2 (use 
*Zorn’s lemma). If we construct *U from the 
ultrafilter F, then *U is a countably saturated 
enlargement of U [1]. 

It is not easy to construct a x-saturated 
enlargement for an arbitrary cardinal x. We 
have two ways: to use a x-good ultrafilter (the 
proof of its existence is difficult) [2, 3] or to use 
an ultralimit (iteration of ultrapowers) [4, 16]. 


D. Applications 


(1) Infinitesimal Calculus. A hyperreal number 
x (element of *R) is called infinitesimal if |«| is 
smaller than every positive real number. If 

a — p is infinitesimal, a and D are said to be 
infinitely close to each other; this is written 

a f. If 1/o is not infinitesimal, æ is called 
finite. Every finite hyperreal number « is in- 
finitely close to a real number a (completeness 
of R). We call a the standard part of a and 
write st(x). Every finite hyperinteger is an 
integer. 

Let f be a real-valued function on a real 
interval J. Then *f is a hyperreal-valued func- 
tion on the hyperreal interval */. The function 
f is continuous if and only if f(x) z *f() for 
every xel, ne*I with x xy. The function f is 
uniformly continuous if *f(Z) = *f(y) for every 
é,ne*l with £x. 

Let 6x be a variable ranging over nonzero 
infinitesimals. The function Af ^ *f(a -- óx) — 

f (a) of ôx will be called the infinitesimal 
increment of f at a. f is differentiable at a if 
and only if the quotient óf/óx is of infinitesimal 
variation. The common standard part of f/x 
is the derivative f'(a). 

The higher-order differential ó"f is also 
justified as a higher-order infinitesimal dif- 
ference, with the standard part of 6"f/6x" being 
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f(a). We can define the Riemann integral as 
the standard part of a Riemann hypersum with 
respect to a hyperfinite partition of infini- 
tesimal width. 

This type of reformulation permits us to 
rewrite the whole of calculus, and many 
teachers are trying to adapt this theory to 
elementary calculus [5,6]. 


(2) Topological Spaces. Let X be a topological 
space in U and let a be a point of X. The 
intersection of *A, A varying over the neigh- 
borhoods of a is called the monad of a and is 
denoted by Mon(a). There exist hyperneigh- 
borhoods of a contained in Mon(a) (infini- 
tesimal neighborhoods). The topology is deter- 
mined by the system of monads (Moní(a)»,. y. 
We can thus rewrite the theory of topological 
spaces. For example, X is Hausdorff if and 
only if Mon(a) () Mon(b)= Ø for every a#b 
in X. X is compact if and only if every point of 
* X belongs to the monad of some element in 
X. This characterization is very useful; using it, 
Robinson and Bernstein were able to solve the 
invariant subspace problem for a special class 
of operators on a Hilbert space [7]. We can 
also construct a Haar measure very naturally 
and simply [2]. 


(3) Measures and Probability Theory. Let 

(X, B, m) be a measure space. Then there exist 
a hyperfinite subset I' of * X and a positive 
hyperreal-valued internal function o such that 
we have 


| f dm *f($)e(£) 

X Zei" 

for every integrable function f [8]. The right- 

hand side is a hyperfinite sum. It follows that 

the measure m can be extended to a finitely 

additive measure defined over all subsets of X. 
If in particular every measurable finite set is 

of measure 0, then there exist a hyperfinite 

subset I of * X including X and a hyper- 

natural number p such that we have 


| fdms Y SE 

X P žer 

for every integrable function f [8]. If m(X)- 1l, 
then p can be taken as the hypercardinality of 
T. Here, the right-hand side is nothing but the 
mean value of *f on a hyperfinite set. This idea 
leads us to a simple description of probability 
theory. 

The above method is rather formal; but P. 
Loeb [9] has pioneered a new approach in 
probability theory by constructing an external 
measure space from a hyperfinitely additive 
internal measure space (X, sZ, v) in *U. In the 
following, *U ts supposed to be countably 
saturated. 





1102 


Let c(«7) be the external countably additive 
algebra of subsets of X generated by £. Then 
the mapping .o/ +> st(v(.c/)) from æ to R can 
be extended to a measure on c(.s7). The com- 
pletion of this measure space is denoted 
(X, L(V), L(v)). In cases where X is hyper- 
finite, æ is the totality of internal subsets of X, 
and v(X)- 1, then the space (X, L(.o/), L(v)) is 
called a Loeb space and L(v) a Loeb measure. 

Every Radon probability space can be re- 
presented as the image of a Loeb space by a 
measure-preserving mapping. Therefore the 
probability theory on a Radon probability 
space can be reduced to that on a Loeb space. 

For example, for *Lebesgue measure on the 
interval [0,1], take an infinite hypernatural 
number 4 and put X 2 (u/A| ue *N,Ox ux 
4 — 1). If we assign 1/4 to every point of X, we 
have an internal probability measure v on X. 
Then the mapping st: x st(x) from X to [0,1] 
serves as a measure-preserving mapping from 
the Loeb space (X, L(.</), L(v)) onto the Lebes- 
gue measure space on [0,1]. 

The notion of lifting plays a key role in 
probability theory on Loeb spaces. Let f be a 
real-valued function on X, and F an internal 
hyperreal-valued function on X. F is called a 
lifting of f if we have f(x) 2 st(F(x)) almost 
everywhere with respect to L(v). Hence f is 
measurable if and only if it has a lifting. 

Shuttling between an internal probability 
space and its Loeb space by lifting and 
standard-part mapping, we can develop, sim- 
ply and partially hyperfinitely, probability 
theory on the Loeb space. Among others, 
Anderson [10, 11] and Keisler [12] applied 
this method quite successfully to Brownian 
motions, Itó integrals, stochastic differential 
equations, etc. [17]. 


E. Nonstandard Set Theory 


In our formulation in B, we must construct *U 
from a fixed universe U. Nelson [13], Hrbacek 
[14] and Cuda [15] independently invented 
theories that nonstandardize the whole of set 
theory. In these theories, there is only one real 
number field R, and R already contains in- 
finitesimal numbers. Compare this to 
Robinson-type infinitesimal analysis, where 
infinitesimal numbers are introduced as an ad 
hoc tool. The new theories may demand a 
reflection on mathematical description in the 
natural sciences. 

In the following, we outline (intuitively) 
Hrbacek's theory, as strengthened and im- 
proved by Kawai [16]. 

We start from ZFC, the Zermelo-Fraenkel] 
set theory plus the axiom of choice. The lan- 
guage of Kawai’s theory NST is that of ZFC 
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plus two constants S and 7. We understand S 
as the totality of standard sets and / as the 
totality of internal sets. 

Let ¢ be a formula of ZFC, that is, a for- 
mula of NST without S and I. We write *¢ 
(resp. Ié, the formula of NST obtained by 
restricting the scope of variables in ¢ to S 
(resp. I). 

The mathematical axioms and axiom 
schemes of NST are substantially as follows. 
(1) If à is an axiom of ZFC, then °¢ is an 
axiom of NST. In other words, all the axioms 
of ZFC are valid in the universe of standard 

sets. 

(2) In the universe of all sets, all the axioms 
of ZFC are valid except the regularity axiom. 

(3) Every standard set is internal. Every 
element of an internal set is internal. 

(4) (transfer) For every n-ary formula 
$(x,, ..., x,) of ZFC, we have 


Vx,eS ... Vx,e S[^o(x,, ..., Xp) 


eld ix). 


(5) (saturation) For every set of size at most 
that of S, the scheme corresponding to Axiom 
3 (x-saturation) in Section B holds. Therefore 
the scheme corresponding to Axiom 2 (enlarge- 
ment) holds also. 

(6) (standardization) For every set a in- 
cluded in a standard set, there exists a stan- 
dard set b such that we have VxeS[xea «1 
xeb]. 

This completes the description of NST. 

Theorem (Nelson-Hrbacek-Kawai): NST is 
a conservative extension of ZFC. That is, let ø 
be a sentence of ZFC. If *9 can be proved in 
NST, then ¢ can be proved in ZFC. 

Corollary: If ZFC is consistent, so is NST. 
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A. General Remarks 


From counting, a primitive mental activity, 
came the natural numbers (N) (— Section B), 
which serve to denote the number of items or 
the order in which these items are arranged. 
We can extend this concept to define, step by 
step, the tintegers (Z), trational numbers (Q), 
treal numbers (R), and tcomplex numbers (C) 
(— 355 Real Numbers; 74 Complex Numbers). 
The extensions up to the rationals are 
carried out to attain a domain within which 
the operations of addition, subtraction, multi- 
plication, and division, namely, the four arith- 
metic operations (or rational operations) can 
be performed indefinitely, with division by 
0 the only exception. To develop the theory of 
natural numbers there are two well-known 
methods, one being G. Peano's system of 
axioms [4], which will be stated below, and 
the other being R. Dedekind's set-theoretic 
treatment [5]. The domain of rational num- 
bers can be extended, taking continuity into 
consideration, to that of real numbers by 
several methods of which the best known are 
those of the Dedekind cut (1872) [8], which 
will be described below, and of G. Cantor's 
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fundamental sequences (1872) [9]. There is 
also a way based on infinite series, given by 
K. Weierstrass in his lectures (1859—1860). 
Though in the domain of real numbers (1) 

the four arithmetic operations can be per- 
formed indefinitely and (2) an order relation for 
magnitude is defined, the equation x? + 1=0 
has no root. Introducing numbers expressible 
as a +ib (i= Sr it is possible to solve 
every equation of the second order. Such 
numbers, once called imaginary, have been 
used since the days of G. Cardano [6] in the 
16th century. L. Euler also made good use of 
complex numbers as convenient tools in many 
calculations and obtained among other things 
his formula exp iÜ — cos 8 + isin 0. Indeed, the 
notation i —4/ —1 was used for the first time 
by Euler (1777). Furthermore, C. F. Gauss, 
giving imaginary numbers the name complex 
numbers, showed that any algebraic equa- 
Don with numerical coefficients always has 
roots in the domain of complex numbers. The 
discovery of the geometric representation of 
complex numbers by several mathematicians 
in the late 18th and early 19th centuries and 
their use in many applications have made 


complex numbers indispensable in mathematics. 


Though there are extensions of complex 
numbers, such as 'Hamilton's quaternions or 
*Cayley numbers, it is generally accepted that 
when we speak of a number we usually mean a 
complex number. 


B. Natural Numbers 


Peano, basing his system on a specific natural 
number, 1, and a function such that to each 
natural number x corresponds a natural num- 
ber x 4- 1 (hereafter denoted by x' and called 
the successor of x), formulated the funda- 
mental properties of the set N of natural num- 
bers in the following five axioms, called the 
Peano postulates: (1) 1 € N; (2) if x e N, then 
x' eN; (3) if xeN, then x’ #1; (4) if x =y 
(x, ye N), then x= y; and (5) if a set M satisfies 
the two conditions: ! € M, and xe M implies 
x'e M, then N c M. Since these postulates 
determine N uniquely up to isomorphism, they 
can be regarded as a definition of N. Elements 
of N are called natural numbers. 

Owing to Peano's fifth postulate, regarding 
a certain property P(n) for natural numbers n 
we can deduce that P(n) is true for every n if 
we prove both of the following conditions: (i) 
P(1) is true; and (ii) for any natural number k, 
if P(k) is true, then P(k + 1) is true. Such rea- 
soning is called mathematical induction (or 
complete induction). Accordingly, Peano's fifth 
postulate is called the axiom of mathematical 
induction. Double mathematical induction, a 
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generalization of mathematical induction, is as 
follows: to show that the property P(m, n) is 
true for every pair of natural numbers m and n, 
we have only to show that (iii) P(m, 1) and 
P(1,n) are true for every m and every n; and 
(iv) for every pair of natural numbers k and 1, if 
P(k -- 1, ID) and P(k,/+ 1) are true, then P(k+ 1, 
{+ 1) is true. This axiom can be formulated in 
several other ways and can be generalized 
further to n-tuple mathematical inductions 

(n — 2, 3,4, ...), generically called multiple 
mathematical inductions. 

Assume for a set M that a mapping f from 
the *Cartesian product N x M into M is given. 
Then a mapping ọ from N into M such that 
(V) p(1)=a; and (vi) e(x') 7 f(x, p(x) (xe N) 
exists and is unique. Defining ¢ by (v) and (vi) 
is called the definition of o by mathematical 
induction. 

In particular, given a natural number a, the 
mapping o N-N defined by (vii) o(1) — a'; 
and (viii) o(x)  o(x) is called addition by a. 
We shall write o(b) 2 a +b, whence x'2x- 1. 
Addition thus defined obeys the following 
laws: a+ b =b + a (commutative law); (a + b) 
+c=a+(b+c) (associative law). Peano’s 
postulates are thus equivalent to the following: 
(1) 1€ N; (2) for each pair a, be N, a+ beN is 
defined so that addition obeys the commuta- 
tive and associative laws; (3’) for each pair of 
natural numbers a and b, one and only one of 
the following three relations holds: a — b-- c 
(ceNy; a=b; a+c=b (cEN); (4) the same as 
(5) (mathematical induction). From (1*)- (4?) 
follows the cancellation law: a+ c — b -- c 
a=b. Define az b if and only if a=b or a= 
b+c (a, b, ce N). Then, from (3), N becomes a 
ttotally ordered set and az be» ac cz bc. 

For each aeN, the mapping o" NN 
defined by o(1) o and ¢(x’)= g(x) a is 
called multiplication by a, and we write o(b) 

— ab (or a: b). Multiplication obeys the follow- 
ing laws: ab = ba (commutative law); (ab)c = 
a(bc) (associative law); a(b + c) - ab + ac, 

(a+ b)c — ac + bc (distributive laws); and 

ac = bc <> a= b (cancellation law). The state- 
ment a‘! =1-a=a also holds. 

Natural numbers, which have been intro- 
duced thus far as *ordinal numbers, also have 
the properties of *cardinal numbers. Denoting 
11,2, ...,n] 2 M,, we have M, Maes mn, 
M, + M,- M, Mn x M, = Moy (— 49 


mtn? 


Cardinal Numbers). 





C. Integers 


Introducing new numbers which are not in 
N=/1,2,...}, represented by the notations 
0, —1, —2,..., —n,..., we write Z={..., 
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—n,..., —2, — 1,0, 1, 2, ..., n, ...J. An element 

of Z is called an integer (or rational integer). 
Algebraically we can construct Z from N as 

follows: Let the set of all tordered pairs (k, l) 

of natural numbers k, | be M =N x N, and 

define in M an tequivalence relation (k, I) ~ 

(m, n) by k+n=m-+l. Let the equivalence class 

of (k, 1) be K(k, I), and construct the *quotient 

space, M/~ = M*. Then the mapping o: 

Z— M* defined by o(n) 2 K(k4 n, k), o(0) — 

K(k, k), q( — n) 2 K(k, k +n) is bijective. Setting 

K (k, 1) - K (m, n) 2 K(k -- m, 4- n), addition 

can be defined in M* (and accordingly in Z) 

which is an extension of that in N. Since 

K (k, D) — K (m, n) 2 K (k +n,1+ m), subtrac- 

tion can be defined in Z. This makes Z an 

*Abelian group with respect to addition. An 

order relation in Z is defined by K(k, D 

K (m, n) & k-n Z2 m - l, which makes Za 

totally ordered set. This order relation is an 

extension of that in N. In particular, N — 

(ae Z|a- 0]. Furthermore, setting K(k, I) x 

K (m,n)— K (km + In, kn + lm), multiplication 

in Z can be defined. It is an extension of 

that in N and obeys commutative, associative, 

and distributive laws. Also, for each a, be Z, 

we have ab=0<(a=0 or b=0). Thus, Z 

becomes an "integral domain. 


D. Rational Numbers 


Let P be the set of all ordered pairs (a, b) of 
integers a, b with b #0, and define in P an 
equivalence relation (a, b) — (c, d) by ad — bc. 
Each equivalence class determined by this 
relation is called a rational number. Denoting 
by L(a, b) the equivalence class to which (a, b) 
belongs, we can define the sum x + y, the 
difference x — y, the product xy, and the quo- 
tient x/y of rational numbers x = L(a, b), y= 
L(c, d), as in the cases of addition and multi- 
plication of integers, in the following way: 
x4 y L(ad + bc, bd), x — y= L(ad — bc, bd), 
xy = L(ac, bd), x/y = L(ad, bc), where the quo- 
tient is defined only when c #0. Thus the set 
Q of all rational numbers becomes a field. 

For the same reason that we have identified 
integers of a special type with natural num- 
bers, we now identify a rational number ex- 
pressible as L(a, 1) with an integer a. Hence- 
forth, any rational number L(a, b) (b #0) can 
be expressed in the form of a quotient a/b 
(b £0) of integers a and b. 

From L(a, b) 2 L(ca, cb) (c £9), it is always 
possible to assume b 0 in the representation 
L(a, b) of a rational number x. For any two 
rational numbers x = L(a, b), y= Lie, d) with 
b» 0, d» 0, define an ordering in Q by xz y 
<> ad > bc, which is an extension of the order- 
ing of integers. Thus (i) Q becomes totally 
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ordered, and we have (ii) x> y-» x tz 2 yz 
and (iii) x z y and z 2 0 = xz z yz. The ration- 
al x is called positive if x — 0 and negative if 
x<0. 


E. Real Numbers 


Two typical methods of constructing real 
numbers from rational numbers are those of 
Dedekind and of Cantor. 


Dedekind’s Theory of Real Numbers. We call a 
pair (A,, A,) of subsets A,, A, of the set Q of 
all rational numbers a cut of Q if they satisfy 
the following conditions: (i) A; #@, A; x Ø; 
(ti) Q = A, UA); (ili) a € 4,,a5€ Ay a, « a5. 
Then the following three cases are distin- 
guished: there is (i) a maximum in A, with no 
minimum in A,; (ii) no maximum in A, with a 
minimum in A,; or (iii) no maximum in A, 
with no minimum in A,. A cut with either 
condition (i) or (iti) is called a real number (in 
the sense of Dedekind); condition (ii) can be 
converted to (i). The set of all real numbers is 
denoted hereafter by R and each real number 
by a or f or .... A real number with property 
(1) is called a rational real number, and a real 
number with property (iii) an irrational real 
number. Any rational real number ts uniquely 
determined by the maximum a* of A,, and the 
mapping: a>a* =(A,, A5) from the set Q of 
rational numbers onto the set Q* of rational 
real numbers is bijective. 

I. For real numbers x —(A,, A;) and B= 
(B,, B,) we define a <£ if and only if A, c B,. 
By this ordering <, R becomes a totally 
ordered set. 

IL For real numbers « —(A,, A;) and B= 
(B,, Bj), put C; 2 fa- bae A;, be B} and 
C, -R—C,;then (Cj, Cj) - » is a real num- 
ber. Define the sum a+ $ by setting «+ B=. 
Addition thus defined obeys commutative 
and associative laws, and R becomes an 
*Abelian group with Of as its zero element. 
Furthermore, for real numbers « —(A,, A;) 
and B —(B,, B,) with 0* <a, 0* x f, put D = 
{ab|ae A, be B,}, D, =R—D,; then (D,, Dj) - 
ô becomes a real number. Define the product «f 
by setting af =ô. According as 0* a, 0* <p; 
0* <a, 0* > f; or 0* >a, 0* > f, define af = 
—((—2)B); xB = —(a(—f)); and xp =(—a)(— p), 
respectively. Multiplication thus defined obeys 
the commutative, associative, and distributive 
laws, and R becomes a field with 1* as its 
unity element. 

III. For ordering and arithmetic operations, 
we have (1) x> fl  &-- yz ß +y; and (2) 
a> p, y >0* => ay > fy. 

By letting each rational number a corre- 
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spond to a rationally real number a*, we can 
set up a bijection between the set Q of all 
rational numbers and Q*. Furthermore, in this 
correspondence, the sum, product, 0, and 1 of 
Q are mapped to the sum, product, zero ele- 
ment, and unity element of Q*, respectively; in 
addition, the ordering is preserved. Thus Q 
and Q* are isomorphic with respect to both 
arithmetic operations and ordering. We shall 
hereafter identify the element a* with a. Ac- 
cordingly we call rational real numbers simply 
rational numbers and similarly irrational real 
numbers simply irrational numbers. 

IV. Similarly as for Q, we can define a cut 
(A,, A5) of the set of all real numbers R (pre- 
cisely, by the conditions A, x Ø, A; x Ø; 
R=4 UA; d €A, d€ A; — d, «d,) For 
each cut (A,, A5) of R, either there is a max- 
imum in A, and no minimum in A, or no 
maximum in A, and a minimum in A, (Dede- 
kind). This property is called the connectedness 
(or continuity) of real numbers. 

By using definitions (1)- (IV), any real num- 
ber « can be represented (i) as the tleast upper 
bound of some set A of rational numbers: «= 
sup A; and also (ii) as the limit of a sequence 
{a,} of rational numbers: «=lima,. 


Cantor’s Theory of Real Numbers. A sequence 
{a,} of rational numbers is called a funda- 
mental sequence (or Cauchy sequence) if it 
satisfies the following condition: for each posi- 
tive rational number e, there exists a natural 
number n, such that —e<a,—a,,<é for 
every a,, and a, with m ng and n» ny. For 
two Cauchy sequences {a,} and {b,}, we can 
write {a,}~{b,} if (a,, b, a5, b, ... ap b, ...] 
is again a Cauchy sequence. The relation ~ is 
an equivalence relation. Let R' be the set of all 
equivalence classes obtained by classifying, 
with respect to —, the set of all Cauchy se- 
quences, and call an element of R' (an equiva- 
lence class) a real number (in the sense of Can- 
tor). We shall write [{a,}] hereafter for the 
equivalence class of {a„}. In particular, {a,} 
with a, —a (n= 1,2, ...) being a Cauchy se- 
quence, we denote a real number [{a,}] by 
a**. An element of R’ of this type is called a 
rational real number (in the sense of Cantor), 
while one not of this type is called an irra- 
tional real number (in the sense of Cantor). 
For each pair of real numbers a = [1a,] ] 
and fj — [ 5, ], their sum and product are 
uniquely defined by «+ B — [Ía,-- b,] ] and 
«f — [ (a, b,] ], where both {a,+b,} and (a,b, 
are shown to be Cauchy sequences. Further, 
for « and fi, define «< fi if a, « b, hold for all 
n larger than some number ny. As regards 
arithmetic operations and ordering, R' has the 
properties (I)- (IV) of Dedekind's theory. Fur- 
ther, any Cauchy sequence of which the terms 
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all are real numbers has a limit in R' (com- 
pleteness of real numbers). 

The sets of real numbers obtained by the 
above two methods are isomorphic with re- 
spect to arithmetic operations and ordering 
(— 355 Real Numbers). 


F. Complex Numbers 


To construct complex numbers from real ones, 
several methods have been devised, among 
which the one stated below is due to W. R. 
Hamilton. 

An ordered pair (a, b) of real numbers a and 
b will be called a complex number. Arithmetic 
operations among complex numbers are de- 
fined as follows: (a, b) 4- (c, d) 2 (a - c, b +d), 
(a, b) — (c, d) 2 (a— c, b — d), (a, b): (c, d) 2 (ac — 
bd, bc + ad), and 





(a, b) SE bd be — ad 


= ; ; ,d) #(0, 0). 
(c,d) \c?+d? m (c, d) 7 (0,0) 


According to these definitions, the set C of all 
complex numbers becomes a "feld with (0, 0) 
and (1,0) as its tzero element and tunity ele- 
ment, respectively. R* = ((a,0)| ae R] isa 
*subfield of C, and the mapping o: Ro R* 
defined by (a) — (a, 0) proves to be a field 
‘isomorphism. Thus, identifying the element 
(a, 0) of R* with the element a of R, C can be 
regarded as an !overfield of R (R c C). Hence, 
the zero element of C is the real number 0 and 
the unity element is the real number 1. The 
complex number (0, 1) is called the imaginary 
unit and denoted by i. Thus we can write 

a+ bi as usual for a complex number (a, b) 
(— 74 Complex Numbers). 
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A. Recurrent Sequences 


A (complex-valued) function that has the 

set of nonnegative integers (or the set N of 
natural numbers) as its tdomain is called a 
number-theoretic (or arithmetic) function. 
Thus it can be regarded as a tsequence of 
numbers. We first consider recurrent se- 
quences. Let f(xo, x,, ..., x, ,) be a complex- 
valued function of r variables. Put up — ag, 
,Uu,_,=a,_,, and successively de- 
fine ve, = (Uj, Hen, Hoas 4) (050, 1, 2,...). 
The sequence {u,} thus defined is called a re- 
current sequence of order r determined by the 
initial values ao, 44, ...,a,., and the function 
f. In particular, when the defining function f 
is given by EC Abee, {u,} is called a linear re- 
current sequence. The Fibonacci sequence is a 
special linear recurrent sequence with initial 
values a9, a, and the defining function x, + x,. 
Let a —(1--/5)/2, B=(1—,/5)/2 be two 
roots of 1 -- x ^ x?, and let c, and c; be deter- 
mined by c, +c, ag and c a+c fj —a,. Then 
the Fibonacci sequence with the initial values 
do, 4, is given by putting u,—c,0" Les f" (n= 
0,1,2,...). If we put ay—1,a,— 1, then we 
obtain Binet’s formula: 


CÓ) CY} 


B. Multiplicative Functions 


u,-—4;,... 








A number-theoretic function f with domain N 
is said to be multiplicative if f(1) — 1, f(mn) 

= f(m)f(n) for (m, n) 2 1, and to be completely 
multiplicative if /(1) — 1, f(mn) — f(m)f(n) for 
any m and n (EN). Similarly, f is said to be 
additive if f(1)=0, f(mn) = f(m) 4- f (n) for (m, n) 
= ], and to be completely additive if /(1) — 0 
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and f(mn) — f(m) 4- f (n) for any m and n (EN). If 
f is multiplicative and £X, f(n) is absolutely 
convergent, then X7. f(p") is absolutely con- 
vergent for any prime p. Moreover, we have 


» ai: Il ( fo). 


where the infinite product on the right-hand 
side is also absolutely convergent. Further- 
more, if f(n) is completely multiplicative, then 
we have 


È sT -ftpy?. 


In particular, for f(n) — n ?, which is com- 
pletely multiplicative (with s=o+it a complex 
variable), we get the *Euler product formula 
for the tRiemann zeta function: 


SEKR 


p 


for o » 1. 


C. Convolutions 


If f and g are number-theoretic functions, the 
convolution f +g is defined by 


(«90 2. SA) glad), 


with the summation carried over all divisors d 
of n. For any number-theoretic functions f, g, 
h, we have f*g—g* f and (f*g)*h= f *(gxh). 
If f and g are multiplicative number-theoretic 
functions, f*g is again a multiplicative 
number-theoretic function. 

The Möbius function u(n) is defined as fol- 
lows: u(1) — 1, u(n) -O if n is divisible by the 
square of a prime, and u(n) —-(— 1) if n is the 
product of r distinct primes. It can easily be 
proved that yz is multiplicative and that 


1 (n=1), 
zudi (n>1). 
Let e and p be functions defined by e(1)=1, 
e(n)=0 (n> 1), and p(n)=1 for every n. Then e 
is the identity element for the convolution 
x, and p*xp=p*p=e. It follows that f «p—g 
is equivalent to f =g + y, that is, 


g(n)=) f(d) 


d|n 


implies that 
fin) => u(d)g(n/d), 


and vice versa. We call the latter the Móbius 
inversion formula. Similarly, for complex- 
valued functions F, G defined on [1, +00), 


G(x)= 3. F(x/n) 


nsx 


295 D 
Number-Theoretic Functions 


is equivalent to 


F(x)= Y, pln)G(x/n). 
HS 
Let oi) be the number of integers m not 
greater than n and such that (m, n) 2 1. The 
function ¢(n), called the Euler function, is 
multiplicative, and we have 


aal gi =n 24 


pir d\n 


and ~(p")=p"—p" ' for every prime p. Let v 
be the function defined by v(n) — n for every n. 
Then we have u*v— and v2 p* o, and hence 
Lain P(d) =n. 

The generalized divisor function d (nl is the 
number of ways of expressing n as a product of 
k factors. Thus we have 
d,(n) = X l =È d (m) 

niha. ny n m|n 
and d, — p * ...* p (k factors). Therefore, d, is 
multiplicative. For simplicity, we write d(n) 
instead of d (n). Then d(n) is the number of 
divisors of n, and we call it the divisor function. 
For example, d(12) 2 6. We denote by o,(n) 
the sum of ath powers of divisors of n. If n= 
II, p^, then we have 


pur 4 


* 


c, (n) 3 d*=[ | 


din pn p*—1 


and c, is also multiplicative. We write o, (n) — 
a(n). A number n satisfying o (n) — 2n is said 

to be a tperfect number. Let n= [T,,, p”. The 
functions o(n) — È, 1 (the number of distinct 
prime factors of n) and O(n) = È pnl, (the num- 
ber of prime factors of n) are often used, the 
former being additive and the latter com- 
pletely additive. 


D. Residue Characters and Gaussian Sums 


Let k be a positive integer and y be a com- 
pletely multiplicative function such that y(n) 

— 0 for (n,k)>1 and z(n,) — x(n;) for n, =n, 
(mod k). The function y is called a Dirichlet 
character (or residue character) with modulus k 
(or modulo k). There exist (k) distinct charac- 
ters modulo k for a given k. The character 
satisfying y(n)= 1 for every n coprime to k is 
called the principal character modulo k, and 
we denote it by yo. It ts easily proved that 


Kl jet). G x9. 
Zar) 


0 (t 7 Xo). 
_ folk) (n=1(modk)), 
2 m=] 0  (n#1(modk)). 


Suppose that k=k,k,...k, ((k; k) — 1, ij). 
Then there exist r characters y; modulo k; 
permitting the unique decomposition y — 
XiX2 Xr 
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Let y be a character modulo k and p= 
exp(2zi/k). The Gaussian sum modulo k is 
defined by 
G(a,z)- 5, x(mp". 

n(mod k} 
where n runs over a complete system modulo 
m (— 297 Number Theory, Elementary, G). 
Hence if a z b (mod k), then G(a, y) = G(b, x). 
Suppose that k=k,k,...k, ((k; k)— 1, iz j) 
and a; za (mod Kj). Then we have 


G(a, y) » €G(a, , 41) G(a5. X2) ... G(a,, y,). 
e= y Vk, )y s (k/ Ks)... X (K/k,), 


where y 2 4x5... X, is the decomposition of y 
mentioned above. 

Let k be a divisor of k. If y(n) 2 1 whenever 
(n, k) (and nz 1 (mod kọ), then we say that 7 
is defined mod ky. The least positive integer 


f= f(y) modulo to which y is defined ts called 


the conductor of y. If the conductor of y is k 
itself, then 7 is said to be a primitive character 
modulo k. When x has the decomposition y = 
71 f, the conductor of y is also decom- 
posed as f(y) — fGr)f x2) --- f(x). If D isa 
square-free integer, then the tdiscriminant d of 
the quadratic number field QD) (where Q is 
the rational number field) is either D (D=1 
(mod 4)) or 4D (D =2, 3 (mod 4)). The integers 
d so represented are called the fundamental 
discriminants. The *Kronecker symbol (d/n) 
(— 347 Quadratic Fields) is defined only for 
such fundamental discriminants. Z. Suetuna 
and A. Z. Walfisz (1936) proved that if y (n) is 
a real primitive character modulo k, then we 
necessarily have one of the following cases: 

(i) Kk 2 pipa ... Py; Gi) k—4p, p; ...p,i or (iii) k = 
8p, p; ... p, (with the p; distinct odd primes). In 
case (1), 


2 S(k/n) (kz 1 (mod 4)), 
E E EE 


in case (ii), 


(—k/n) (k/4=1 (mod 4)), 
m=} T" ; 
(k/n) (k/4= —1 (mod 4)); 


and in case (iti), y(n) is either (k/n) or ( — k/n). 
If y is a primitive character modulo k, then 

G(a, y) 7 x(a)G(1, y), GO, WGC, x) — k. In par- 

ticular, if y is a real primitive character, then 


Sp ach (C7 0 1) 
U uk GC 0--D. 


Sometimes S(a, k) 2 X, (moan 9 " is called the 
Gaussian sum, where p — exp(2zi/Kk). If (a, k;) 
=] (i=1,2) and (k,,k,)=1, then S(a, k, k2) 
= S(ak,, k,)S(ak,, k;). If (a, 2) 2 1, then 


0 (r=1), 
AË i?)27? 


Qriala Y Dirt 1)j2 


S(a, 27) 2 (r even), 


(r odd > 1). 
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Let p be an odd prime and (a, p) 2 1. Then 
we have 


(a/p)S(1,p) — (r—1), 
S(a, p') -4 p'? (r even), 
p* "?S(a,p) (rodd» 1), 


where (a/p) is the *Legendre symbol (— 297 
Number Theory, Elementary, H). The well- 
known Gauss formula is stated as follows: 


Vp (psl(mod4), 
S(.ps a 
i /p (p=3 (mod 4)). 
Various proofs of this formula have been given 


by many authors [5]. 
Let 


G(a,zo)- / exp(2niah/k), 


h(mod k) 


called the Ramanujan sum, be denoted by c, (a). 


The sum 24,4, means that h runs through a 
reduced residue system modulo m (— 297 
Number Theory, Elementary, G). It follows 
that c, (1) = p(k) and 


k 
Gë A ab 
dýka) \d 


E. Analytic Methods 


Let f be a number-theoretic function. One of 
the problems in analytic number theory is to 
estimate È< f(n) or to expand it into series. 
We assume that f(x) and g(x) are real-valued 
functions defined for x 2 1 and g(x) is of class 
C!. Then we have 


3 Dua Fexata- | F(t)g' (t) dt, 
nex 1 


where F(x)= Ze, f(n). If we take f(x) « 1, g(x) 
=log x or 1/x in this formula, then we obtain 


3 logn 2 xlogx— x -- O(log x) 


nix 


or 


y n!zlogx-C-0(x !) 

nsx 

(where C is the tEuler constant), respectively. 
We now construct from the function f the 

Dirichlet series 


or the power series 


Ke 


P(x)= 3. f(nx". 

There are called the generating functions of f. 
The consideration of these functions makes 
possible the application of function-theoretic 
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methods to the problem in this section. Some- 
times more complicated functions are used. 

We mainly consider the generating function 
represented by the Dirichlet series. The gen- 
erating functions of p, u, d, y (— Sections C, D) 
are ((s)= Zen ^, Broi u(n)n ^, Ze d(n)n S, 
L(s, x) 2 Łza x(nn ^, respectively. The tab- 
scissa of absolute convergence of each of these 
Dirichlet series is 1. The function Lis, y) is 
called the *Dirichlet L-function (— 123 Distri- 
bution of Prime Numbers; 450 Zeta Func- 
tions). When k=1 and y = yo, L(s, y) is pre- 
cisely Ciel, 

Let F(s), G(s), and H(s) be the generating 
functions of f, g, and h= f *g. Let f and g 
be multiplicative, so that h is also multiplica- 
tive. Moreover, if F(s) and G(s) are absolutely 
convergent for o > o9, then H(s) is also ab- 
solutely convergent for o 2 6g, and F(s)G(s)= 
H(s) for c > og. It follows from this that 
Xa unn Test (s, Ere w(n)y(n)n ? 
=L(s, AT, Lye du(n)n "es C(s)*, and 
Si, d*(nn^-C(syCQs) The last result 
was given by S. Ramanujan. More generally, 
Lz, d'(n)n "e CH (s)o(s), where o(s) is an 
analytic function for oz 1/2. By utilizing ana- 
lytic methods, we can deduce from this that 
Enc d'(n sien Dog ti ^! +... +c) (B. M. 
Wilson, 1923) (^ 123 Distribution of Prime 
Numbers; 242 Lattice-Point Problems). 

There are many formulas known as the 
Euler summation formula. Among them the 
following one is convenient to use (E. Landau, 
L. J. Mordell, H. Davenport): Let f; (x) —-x 
— [x] — 1/2 (when x is not an integer), =0 
(when x is an integer). We successively con- 
struct continuous functions f,(x), f;(x), ... of 
period 1 such that f; (x) 2 f,- (x) (rz 1) if x is 
not an integer, and {9 f,(x)dx — 0. If F(x) is of 
class Ch on [a,b], then using these auxiliary 
functions, we have 


b 
Y F(ma| F(x)dx 
agm<b a 


h-1 


+) y Ga) roa) 


r=0 


— f(b) F(b)) 
«cur Sas) FP(x)dx, 


where >’ means that the term corresponding 
to m=a or m=b in the sum is to be replaced 
by F(a)/2 or F(b)/2 whenever a or b is an in- 
teger. Since the tFourier series of fọ(x) is 


* sin(2znx) 
n=1 nn 


(which is convergent), f,(x) can be expressed by 


Li 


2 exp(2minx) 


ascia riny” 
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where >’ means that the term with n=0 is 
omitted and this sum actually means 


i ARE 7 


(2xiny t! (—2niny*? 


Hence if h — 1, we have 


b 
y ra= | F(x)dx 


agngb 
1 1f? e 
—— Y -| F'Q9jsinnnx) dx. 
For instance, if we put h21,a— 1, bz N, 


F(x)=x ? and let N tend to oo, then we have 
the formula 





1 1 
s Solx 9 dx for o»1. 
pert 


Utilizing the following integration by parts, 
the integral on the right-hand side can be 
extended so as to become holomorphic in s in 
the whole complex plane: 


| 55h x= funtion | A file 2 d 
1x 





Probabilistic considerations are also used 
for the study of various number-theoretic 
functions. If f(n) is œ(n) or Q(n), then we have 
Xs fn) xloglogx-Fcx-t o(x). Therefore 
the average order of c(n) or Q(n) is estimated 
as loglogn. Let A(x; a, B) be the number of n 


satisfying n <x and loglogn 4- x, /loglogn < 


f(n) «loglogn- f./loglogn. Then 
, A(x; a, p) -u2 

l “$12 du. 
ee ee 


For f(n) — on), the result was proved by P. 
Erdós and M. Kac (1940) by using the tcentral 
limit theorem and V. Brun's !sieve method. 
Further general formulas were obtained by M. 
Tanaka (1955). Further development and the 
present state of probabilistic number theory 
can be seen in [7,8]. 

Finally, we mention the well-known estima- 
tion formulas. Let e be an arbitrary positive 
number and n be a positive integer. Then d(n) 
= O(n*), where O (*Landau’s symbol) depends 
on e. We know that limsup,,,,, (log d(n))- 
(loglogn)/logn=log 2 (S. Wigert, 1907), and 
lim inf, „~ o(n): (loglogn)/n =e €. The result 
on) — O(log n/log log n) is often used. Let y be 
a primitive character modulo k, and let A, = 


E”, x(n). We can prove that |S,| « J/k logk for 


all m (I. Schur, 1918) and S, — Ok log log k) 
(R. Paley, 1932). This formula, with the symbol 
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Q, means that there exist infinitely many k 
satisfying |S,,| >c /k loglogk by taking m 
and y suitably, with c any positive constant. 
Let y be a character modulo k, and y? be a 
primitive character associated with y. Then 
we have Lis, y) — L(s, x?) p1 — x° (p)p. ?). 
It follows that 


Y n= Y. add) Y xi) 
d;d\k,d f 


3 
il 
ka 


me nid 


if the conductor of y? is f. A. G. Postnikov 
investigated (1956) the sum of characters. The 
least integer that is not a quadratic residue 
modulo p does not exceed DI? (log p)?, where 
p is a sufficiently large prime (I. M. Vinogra- 
dov, 1926). The least integer that is a *primi- 
tive root modulo p does not exceed 2"*! | /p, 
where m is the number of distinct prime divi- 
sors of p— 1, with p a prime (L. K. Hua, 1942). 
D. A. Burgess (1962) deals with the latter re- 
sults. We conclude with Artin's conjecture: If w 
is a given square-free integer, then there exist 
infinitely many primes p such that w is a primi- 
tive root modulo p. In 1967, C. Hooley proved 
this conjecture subject to the assumption that 
the general *Riemann hypothesis holds for 
*Dedekind zeta functions over certain *Kum- 
mer extensions of Q. He also obtained the 
asymptotic formula for the number of such 
primes not exceeding x. Recently the tsieve 
method has been widely applied to the various 
investigations in the theory of number theore- 
tic functions (— 123 Distribution of Prime 
Numbers E). 
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296 (V.1) 
Number Theory 


A. History 


Simple and curious relations among integers 
were discovered and admired from antiquity. 
For example, we have the relation 3? 4- 4? =5?, 
which has geometric meaning concerning right 
triangles. The Pythagoreans (— 187 Greek 
Mathematics) sought similar relations. They 
were also interested in tperfect numbers (num- 
bers equal to the sum of their divisors, such 

as 28=1+2+4+7+ 14). Modern arithmetic 
inherits from the Greeks the proof of the 
existence of an infinite number of primes, the 
*Euclidean algorithm to obtain the greatest 
common divisor of two integers (both given in 
Euclid's Elements), and Eratosthenes’ tsieve for 
finding primes. In the 3rd century Ap. Dio- 
phantus of Alexandria discovered a method of 
solving indeterminate equations of degrees 1 
and 2; this marked the origin of Diophantine 
analysis. The ancient Chinese also knew how 
to solve equations of the first degree in some 
special cases. Arithmetic also developed in 
India from an early period; its relation to 
Greek mathematics is not yet entirely clear. In 
the 12th century, the Indian mathematician 
Bháskara knew how to solve tPell equations 
using a method much like Lagrange's. 

Interest in integers was reborn in Europe in 
the 17th century. During this period, Bachet 
de Méziriac (1581—1638) rediscovered the 
solution of the tDiophantine equation of the 
first degree and published it in his famous 
book on mathematical recreations [1]. Primes 
of the form 2? — 1, closely related to perfect 
numbers and called tMersenne numbers, at- 
tracted considerable interest. *Fermat, some- 
times called the father of number theory, 
announced numerous results without giving 
proofs; the most famous among them is the so- 
called Fermat’s last theorem (— 145 Fermat's 
Problem). Another famous conjecture of his is 
that every integer is expressible as the sum of 
at most n n-gonal numbers, i.e., numbers of the 
form k(k — 1)n/2 — k(k — 2), ke N. This was 
proved by Gauss (for the case n= 3), Jacobi 
(n — 4), and Cauchy (for the general case). 

In the 18th century, remarkable progress 
was made by Euler and Lagrange. The second 
part of Euler's Algebra [2] contains rich re- 
sults of miscellaneous sorts in the field. La- 
grange developed the theory of tcontinued 
fractions and applied it to arithmetic. Toward 
the end of the 18th century, Legendre com- 
piled his comprehensive book [3], from whose 
title originates the term number theory. 
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Gauss's Disquisitiones [4] appeared at about 
the same time as Legendre's book. The theo- 
retical arithmetic of today originates from this 
work of Gauss. The book includes the theories 
of *quadratic residues, tquadratic forms, and 
cyclotomy (De. arithmetic theory of the roots 
of unity in the field of complex numbers), all 
of which appeared as well-developed theories 
of a remarkably high standard. The work was 
received with more respect than comprehen- 
sion. Dirichlet made a lifelong effort to pop- 
ularize the Disquisitiones; he also applied 
analytical methods to compute the tclass 
number of quadratic forms, thus giving num- 
ber theory a new direction, the analytic theory 
of numbers. Gauss treated only binary qua- 
dratic forms; Eisenstein, Minkowski, and 
Siegel generalized the theory to the case of n 
variables. The algebraic theory of numbers has 
its origin in Gauss's paper on biquadratic 
residues. (— 14 Algebraic Number Fields; 59 
Class Field Theory; 118 Diophantine Equa- 
tions; 182 Geometry of Numbers; 297 Num- 
ber Theory, Elementary; 347 Quadratic Fields; 
430 Transcendental Numbers.) 


B. Analytic Methods 


Analytic methods are sometimes used to solve 
arithmetic problems. For example, Legendre 
conjectured that any arithmetic progression of 
integers a, a 4- d, a 4- 2d, ... contains an infinite 
number of primes if a and d are relatively 
prime. The conjecture was first proved by 
Dirichlet in 1837; in the proof he used an tL- 
function. Recently, proofs that do not use L- 
functions have been obtained, although these 
proofs are still not purely arithmetical. Analy- 
sis is indispensable in the formulation of some 
arithmetic problems. For example, let xeR 
and z(x) be the number of primes not exceed- 
ing x. Euclid's Elements already give the re- 
sult z(x)-»oo as x— oo, but to describe the 
behavior of z(x) as x— oo, notions of analysis 
are needed. Gauss conjectured that 

_ R(x) 

lim zd 
x X/logx 





This *prime number theorem was first proved 
in 1896, using the results of the theory of func- 
tions of a complex variable; more elementary 
proofs have been obtained in recent years. 

Analysis is sometimes needed to solve or 
simplify certain problems in number theory. 
The branch of mathematics treating such 
problems is called analytic number theory. The 
question of the extent to which analysis is 
really needed in dealing with such problems is 
itself an interesting one. 
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In the 20th century, analytic number theory 
has made rapid progress. The problem of 
distribution of primes has been generalized to 
the case of algebraic number fields. tAdditive 
number theory dealing with fWaring’s prob- 
lem, tGoldbach’s problem, and other prob- 
lems has developed and formed a new field. 
We also have geometric number theory, which 
deals with tlattice-point problems. (— 4 Addi- 
tive Number Theory; 123 Distribution of 
Prime Numbers; 242 Lattice-Point Problems; 
295 Number-Theoretic Functions; 328 Par- 
titions of Numbers; 450 Zeta Functions.) 
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A. The Euclidean Algorithm 


We denote the set of natural numbers 1,2, 

3, ... by N and the set of rational integers 

0, +1, £2, ... by Z. Evidently Z is an ordered 
tcommutative ring and also an fintegral do- 
main with respect to ordinary addition and 
multiplication. For any ae Z and beN, there 
exists a unique pair of integers q and r such 
that a=qb +r (x r « b) (division algorithm), g 
is called the quotient and r the remainder of the 
division of a by b. When the remainder r is 
zero, we say that a is a multiple of b, b is a 
divisor of a, and a is divisible by b. We denote 
this relation by bla, After a finite number of 
divisions a qub t r,, b-q;r, t r4, ru — dar; 
r3, ... (bo ri r3... 0), we reach an equa- 
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tion of the form r, = q,, ab, The remainder 

r — r,4,, uniquely determined in this manner by 
a and b, is the greatest common divisor (G. C. 
D.) of a and b. It can be expressed as r— ax 
by with integers x and y. This method of 
obtaining the G. C. D. is called the Euclidean 
algorithm. The greatest common divisor of 

a and b is denoted by (a, b). If an integer c 
divides both a and b, then c|(a, b). When 

(a, b) 2 1, we say that a and b are relatively 
prime, and there are integral solutions x, y of 
the equation ax + by 2 1. Given a pair of posi- 
tive integers a and b, there exists a unique 
positive integer c that divides any common 
multiple of a and b, called the least common 
multiple (L. C. M.) of a and b. If d and ! denote 
the G. C. D. and L. C. M. of a and b, respec- 
tively, we have ab — dl. 


B. Prime Numbers 


An integer p is called a prime number if p is 
larger than 1 and has no positive divisors 
other than 1 and itself. A positive integer is 
called a composite number if it has positive 
divisors other than 1 and itself. The follow- 
ing method of selecting the prime numbers 
from the sequence 2, 3,4, 5, ..., known to the 
Greeks, is called Eratosthenes’ sieve: First we 
discard multiples of two, thus reaching the 
second prime number, 3. Then we discard 
multiples of three and reach the next prime, 5. 
Continuing this process, we find the sequence 
of primes 2, 3,5, 7, 11, ... by sifting out all 
multiples. 


C. Decomposition into Primes Factors 


Every integer n> 1 can be uniquely expressed 
as the product of primes; the resulting decom- 
position can be written as n — p*q?r? ... with 
distinct prime factors p, q,r, ... and corre- 
sponding exponents o, $, y, ... uniquely de- 
termined by n. This theorem is called the 
fundamental theorem of elementary number 
theory. 


D. Perfect Numbers 


We denote by c(n) the sum of positive divisors 
of n (including 1 and n itself). According as 
o(n) is greater than, equal to, or less than 2n, 
we call n an abundant number, perfect number, 
or deficient number. An even number is perfect 
if and only if it can be represented as n —2*^! - 
(2* — 1) with prime 2* — 1 (L. Euler). The exis- 
tence of an odd perfect number still constitutes 
an open question. It is well known that, if n 

is odd and perfect, then n must be of the 
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form n — ppi! ..-pft, where pyzagzl 

(mod 4), and a; is even for i 0. Recently it has 
been proved that t must be 7 7 (P. H. Hagis, 
Jr., Math. Comp., 35 (1980)). Many necessary 
conditions for an odd number to be perfect 
have been given and repeatedly improved. It 
has been recently proved that there exists no 
odd perfect number less than 10°° (Hagis, 
Math. Comp., 27 (1973)). 

Two numbers m and n are said to form a 
pair of amicable numbers if c(n) — n =m and 
o(m)—m=n (e.g., m— 220 and n= 284). Euler 
listed 61 such pairs. The following two num- 
bers m, n, both having 152 digits, constitute the 
largest amicable pair currently known: 


m -—3*-5-11- 5281!?-29- 89(2-1291- 5281!? 
— 1), 
n=3*-5-11-528119(23- 33-52-1291 - 528119 
—1) 


(H. J. J. te Riele, Math. Comp., 28 (1974)). 

E. Lionnet considered the numbers n such 
that the product TL, d is equal to n? and 
called such numbers perfect numbers of the 
second kind, e.g., n 2 p?, n= pp', where p and p' 
are unequal prime numbers. It is also known 
that there are numbers n such that o(n)/n is an 
integer. For example, 2? - 3-7, 215. 35. 52. 72. 
11:13:17-:19-: 31:43: 257 have this property. 


E. Mersenne Numbers 


A number of the form 29 — 1, where e is a 
prime, is called a Mersenne number. For the 
number 2* — 1 to be prime, it is necessary but 
not sufficient that e be prime. If a Mersenne 
number is a prime, it is called a Mersenne 
prime. It is not known whether there are in- 
finitely many Mersenne primes; it has been 
verified, however, that for e « 44500, there are 
27 cases of e such that 2^— 1 is a Mersenne 
prime; e= 2, 3, 5, 7, 13, 17, 19, 31, 61, 89, 107, 
127, 521, 607, 1279, 2203, 2281, 3217, 4253, 
4423, 9689, 9941, 11213, 21701, 23209, and 
44497. Until the 18th century, the verifications 
for ex 31 were done by direct calculation. For 
61 xex127, the “Lucas test” was utilized to 
execute the computation. The remaining cases 
were calculated by means of electronic com- 
puters. The number 244497 — |, which has 
13395 digits, is at present the largest known 
prime. 


F. Fermat Numbers 


Numbers of the form 27 4-1 are called Fer- 
mat numbers. For a number p —-2*4- 1 to bea 
prime, it is necessary that e be a power of 2. 
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Fermat conjectured that numbers of the form 
27?" - 1 are all primes. In fact, for v — 0, 1, 2, 3, 
4, the corresponding p — 3, 5, 17, 257, 65537 
are primes; however, 277 4- 1 is divisible by 641. 
It is not yet known whether there exist Fermat 
primes other than these five primes. For fur- 
ther details — [10]. Fermat numbers are 
closely connected with the problem of tgeo- 
metric construction of regular polygons. 


G. Congruence 


Let m be a positive integer. Two integers a and 
b are said to be congruent modulo m if their 
difference is divisible by m; we denote this 
relation by the congruence a =b (mod m) (or 
simply by az b (m)) and call m the modulus of 
this congruence. Congruence modulo m is an 
tequivalence relation, which is compatible 
with the ring operations and classifies Z into 
m classes. We thus obtain the tresidue class 
ring Z/mZ with m elements. If p is a prime, 
then Z/pZ is a field which is isomorphic to the 
'prime field with tcharacteristic p. A complete 
system of representatives of the quotient set 
Z/mZ is called a complete residue system 
modulo m. On the other hand, a set of (m) 
elements n; (o is "Euler's function) such that 
n € n; (m) (ij) and (n;, m) — 1 is called a re- 
duced residue system modulo m. The set of all 
residue classes represented by a reduced re- 
sidue system modulo m forms a multiplicative 
Abelian group of order o(m); we denote this 
group by (Z/mZ)*. If (a, m) 2 1, then a?” z 1 
(mod m). If p is a prime and (a, p)= 1, then 
a?! =1 (mod p) (Fermat's theorem), since 
~(p)=p—1. When m is 2, 4, p* or 2p* (p2; 
k=1, 2, ...), (Z/mZ)* is a tcyclic group, whose 
tgenerator g is called a primitive root modulo 
m (— Appendix B, Table 1). For any a prime 
to m and generator g of (Z/mZ)*, there exists 
a unique number u (1 <u < o(m)) such that 
a=g" (mod m). We call u the index of a with 
respect to the basis g, and write u= Ind, a (— 
Appendix B, Table 2). The group (Z/2* Z)* 

(k > 3) is Abelian of type (2, 2^ ?), a basis of 
which is formed by the residue classes repre- 
sented by —1 and 5. From this follows (p — 1)! 
= —1 (mod p) (Wilson's theorem). Generally, 
if m= [[;-; m; (m;, mj) —1 for ij, then we 
have Z/mZ = Zím, Z  ... - Z/m,Z (direct sum) 
and (Z/mZ)* =(Z/m, Z)* x ... x (Z/m,Z)* (di- 
rect product). 

The congruence ax =b (mod m) with (a, m) 
=d is solvable if and only if d|b, and when it is 
solvable, the solution is unique modulo mid 
The simultaneous congruences x =a; (mod m;) 
(i— 1,2, ..., k) are solvable if and only if a; a; 
(mod (mi, m;)) (i, j — 1, ..., EL and when they are 
solvable, the solution is unique modulo the 
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greatest common multiple of m,,...,m,. In 
particular, when (m;, mj) — 1 (ij), the solution 
is unique modulo m; m, ... m, (Chinese remain- 
der theorem). If m — pi: ... pg* is the factoriza- 
tion of m, solving the congruence f(x)=0 
(mod m), where f(x) is a polynomial with in- 
tegral coefficients, can be reduced to solving 
f(x) 0 (mod pf) (i— 1, ..., EL Also, solving a 
quadratic congruence can be reduced to solv- 
ing a linear congruence and a congruence of 
the form x? «a (mod m). 


H. Quadratic Residues 


When the congruence x? =a (mod m), where 
(a, m) 2 1, is solvable, a is said to be a quadratic 
residue modulo m; otherwise, a is said to be a 
quadratic nonresidue modulo m. The following 
two conditions are necessary and sufficient for 
à to be a quadratic residue modulo m: (i) a is a 
quadratic residue with respect to each of the 
prime factors p (#2); and (ii) a= 1 (mod 4) or 
az 1 (mod 8) according as 4|m or 8|m. 

Given a prime number p and integer a 
prime to p, the Legendre symbol (a/p) is by 
definition 1 or —1 according as a is a qua- 
dratic residue modulo p or not. The value of 
this symbol is determined by a (mod p), and 
we have (ab/p) = (a/p) (b/p). Hence the symbol 
determines a tcharacter of (Z/pZ)* of order 2. 
Furthermore, the congruence (a/p) = op 17 
(mod p) holds (Euler's criterion). 


I. Reciprocity Law 


For odd primes p and q (p z q), the formulas 
(p/a)(alp) - (— 1)(71)2)471)2) 
(-1p)-(-1y9-92, 2/p)=(— 9" 


are called the law of quadratic reciprocity of 
the Legendre symbol, the first complementary 
law, and the second complementary law, re- 
spectively. These laws were conjectured by 
Euler and first proved by Gauss, who gave 
seven different proofs. We now have more 
than fifty different proofs of these laws. P. 
Bachmann (Niedere Zahlentheorie I (1902)) 
lists the 47 different proofs of the laws known 
at the time. Gauss's first proof was elementary; 
his second proof used quadratic forms. The 
latter has been reformulated utilizing the 
theory of quadratic fields [4]. The fourth proof 
used tGaussian sums, and the sixth proof used 
algebraic congruences with integral coeffi- 
cients. His seventh proof, contained in his 
posthumous works, used congruences of higher 
degree (4]. His fourth, sixth, and seventh 
proofs are related to the arithmetic of tcyclo- 
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tomic fields [4]. The third and fifth proofs 
are the most elementary and simple. They are 
based on Gauss's lemma: Let r,,r,, ..., Da Aus 
be the residues of divisions of la, 2a, ...,(p— 
1)a/2 by an odd prime p, and let n be the 
number of these residues that are greater than 
p/2. Then we have (a/p)=(—1)". T. Takagi gave 
a simplified exposition of the third proof using 
geometric figures (1904). The same method 
was rediscovered by G. Frobenius (1914). 

When m is an odd integer such that m— 
X Tun, (m, n) — 1, we call (n/m) - TL(n/p;Y': 
Jacobi's symbol. If m has no square factor, it is 
a character of (Z/mZ)*. If we put sgnm + I 
or — 1 according as m>0 or m «0, we have 
the following law of quadratic reciprocity of 
Jacobi's symbol and its complementary laws: 

If m and n are relatively prime odd numbers, 
then 


(m/n)(n/m) 


= ( aes [ym Diener )2)* (gn m—1y2)(sgnn—1y2) 
(-1/n) 5 (— 1)" De tee 12. 

(2/n) UD. 

where 

n* 5 (— 1)" Dg, 


Furthermore, tKronecker’s symbol, another 
generalization of the Legendre symbol, is used 
in the theory of quadratic number fields (— 
347 Quadratic Fields). 


References 


[1] C. F. Gauss, Werke 2, Konigliche Gesell- 
schaft der Wissenschaften, Góttingen, 1863. 
[2] C. F. Gauss, Sechs Beweise des Funda- 
mentaltheorems über quadratische Reste, 
Ostwald's Klassiker Nr. 122, Wilhelm Engel- 
mann, 1901. 

[3] L. E. Dickson, History of the theory of 
numbers I, Carnegie Institution of Washing- 
ton, 1919 (Chelsea, 1952). 

[4] P. G. L. Dirichlet, Vorlesungen über Zah- 
lentheorie, Herausgegeben und mit Zusátzen 
versehen von R. Dedekind, Vieweg, fourth 
edition, 1894 (Chelsea, 1969). 

[5] H. Hasse, Vorlesungen über Zahlentheorie, 
Springer, 1950. 

[6] E. G. H. Landau, Vorlesungen über Zahlen- 
theorie I, Hirzel, 1927 (Chelsea, 1946). 

[7] W. J. LeVeque, Topics in number theory, 
Addison-Wesley, 1956. 

[8] H. Rademacher, Lectures on elementary 
number theory, Blaisdell, 1964. 

[9] Z. I. Borevich and I. R. Shafarevich, Num- 
ber theory, Academic Press, 1966. (Original in 
Russian, 1964.) 


1115 


[10] W. Sierpinski, Elementary theory of 
numbers, Hafner, 1964. 


298 (XV.6) 
Numerical Computation of 
Eigenvalues 


A. General Remarks 


Numerical computation of feigenvalues and 
teigenvectors of a matrix provides a basic 
technique for the numerical solution of various 
eigenvalue problems. Roughly speaking, there 
are two kinds of method. In methods of the 
first kind, one determines the tcharacteristic 
polynomial p(4) 2 det(41 — A) (where I is the 
unit matrix) of A (or to give an algorithm to 
calculate the value of p(A) for an arbitrary å), 
then solves the algebraic equation p(4) 20 
numerically to obtain the eigenvalues 4, (u= 
1,2, ...) (^ 301 Numerical Solution of Alge- 
braic Equations), and finally to one deter- 
mines the eigenvectors x, by means of the 
equations (4,7 — A)x, =0. In methods of the 
second kind, one obtains eigenvalues and 
eigenvectors directly without resorting to the 
solution of an algebraic equation, relying in- 
stead upon repeated application of similarity 
transformations. (The power method does not 
fit into this classification, being a different 
approach altogether; — Section C.) In partic- 
ular, a real symmetric or complex Hermitian 
matrix is reduced approximately to a diagonal 
matrix. In this article we denote a given square 
matrix of order n by A —(aj)) (ij — 1, ..., n). 


B. The Jacobi Method 


The Jacobi method is an iterative technique 
for determining all the eigenvalues and eigen- 
vectors of a real symmetric matrix (7]. Before 
the advent of high-speed computers it was 

not considered practical, but at present it is 
one of the most compact and elegant methods. 
The following algorithm can be extended 

to Hermitian matrices by replacing tortho- 
gonal transformations by suitable tunitary 
transformations. 

Roughly, the method transforms a given 
matrix A —(a;j) (aj; — aj; j= 1, ...,n) into a 
diagonal one by repeated application of 2- 
dimensional rotations of the reference axes. 
We first put A = A, U =], and compute 
AXO, A0), ,,, , UO), UO), ... successively as 
follows: 

(1) Select an off-diagonal element of A? = 
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(a?) with the maximum absolute value and 
denote it by aO). 
(2) Compute 


2a) sgn(a®), — at 
at — a 4- / (at) — 


ao +4(a St 
(where sgnx — 1, 0, or —1 according as x ^ 0, 


tand= 








-—0, or <0), 
cos0 —(1--tan?0) !?, and 
sin Ó =tan0-cos 6; 
and form T” — (t)), where t9, — 1 —cos 0, t9 = 
1 for iz p, q, — t9 =10 —sin6, and t? =0 for 
all other (i, j). 

(3) Determine ATD and UCHD py ACY = 
T" A? T? (^ denotes the transpose) and U**? 
— UT”. In this process, T'? represents an 
orthogonal transformation (rotation) in the 
plane spanned by the pth and qth coordinate 
axes such that e o. ? is nullified. If we 
put N(B) 2 X, jb and M(B)= Y, ,; bj, then 
N(B) is Eeer Geen an orthogonal trans- 
formation, so that N(A®)= N(A). Further- 
more, since d" — af (iz p,q) and (a0; Y + 
(at? = (a0)? + (a)? + 2(a)^, we have 
M (A"*!) = M(49) — 2(a0y". Since at) has the 
maximum absolute value among all the off- 
diagonal elements, we have (a)? > M(A9)/ 
(n? — n). Therefore 


M(A“™”) <(1—2/(n* —n))M(A®) 


<(1—2/(n’ —n))'*'M(A) 


< M(Ajexp( — 2( + 1)/(n? —n)). 


It has been proved a fortiori [13] that, after 
M (A9) comes down to below a certain thresh- 
old value, the convergence of the iteration 
process becomes quadratic, i.e., there is a num- 
ber c determined by the order n of A and the 
arrangement of the eigenvalues of A such that 
M(AU*"t^-D/2)) — c( M (A9))?, Since the set of 
eigenvalues of A? coincides with that of A 
and the eigenvalues of an arbitrary symmetric 
matrix B can be made to correspond in a one- 
to-one way to its diagonal elements in such a 
way that the difference between an eigenvalue 
and the corresponding diagonal element is not 
greater than M(B)'?, a tends to 4; (i— 1, 

,n) as 1 tends to infinity. Moreover, as : 
tends to infinity, each column vector of U® = 
(u?) tends to the EE eigenvector, 
in the sense that Eg- auf) — Aju? 5.0. 

The number of arithmetic computations 
required to obtain A“*”) and UCHD from A? 
and U” is at most proportional to n, so that 
for a given e>0, the arithmetic required to 
reduce max; af? — 4;| below £(M(A))'? is at 
most proportional to n? (because z is at most 
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proportional to n?). On the other hand, the 
search for an off-diagonal element of A? with 
the maximum absolute value, if it is done by 
simply comparing all the elements, will require 
effort proportional to nê, so that the amount 
of work required by the searching process is 
proportional to n*. To bypass this searching 
process, the cyclic Jacobi method and the 
threshold Jacobi method are often used. The 
former method adopts as af) the off-diagonal 
element for which q p and ;2 (p — 1)(n— p/2) 
t (q — p). that is, aq, at, etta ah at, a, 
... are adopted in this sequence. The latter 
method adopts as a") off-diagonal elements 

in a sequence similar to the one above as long 
as they exceed a given threshold value; but if 
an element is less than that threshold value, 
then the next element in the sequence is a can- 
didate for adoption as a‘), where the thresh- 
old value is made to decrease gradually as 

the iteration process proceeds. However, the 
search for an element with the maximum 
absolute value can be done more effectively by 
taking account of the fact that only the matrix 
elements lying in rows p and q and in columns 
p and q change their values when we transform 
A? into A'*' In fact, we can record for each 
row the value as well as the position of the 
(off-diagonal) element with the maximum 
absolute value in that row. By so doing, the 
effort of searching for an off-diagonal element 
with the maximum absolute value can be 
reduced to something proportional to n on the 
average. 


C. The Power Method 


The power method is suitable for obtaining 
only the eigenvalue of maximum absolute 
value [6]. Let us assume that the eigenvalues 
A,,+.-,4, Of A are arranged so that |, | 

A5] Z|44| 2... 2 |A,], with 2, real, and de- 
note by y, the left eigenvector corresponding 
to 4, (which means y,(4,1 — A) - 0). Starting 
from an arbitrary (real) vector x such that 
(y1, x9) and 7{°=1 for a prescribed ig, 
we compute 09, 9%, ... and x™, x)... by 
Ax = 69x") (1—0, 1,2, ...; x8) = 1). Then 
we have lim, ,,, 0? — 4, and Wm, x? — x, 
(the eigenvector corresponding to the eigen- 
value 4,). The rate of convergence depends on 
|A,/A,| if the telementary divisor of A cor- 
responding to 4, is linear, but in the case of a 
nonlinear elementary divisor, the convergence 
is too slow for practical purposes. If 4, #42 
and |4,|=[4,|>|43|2...2|4,|, then the se- 
quences of 0 and x? computed by the for- 
mulas above do not converge but in general 
oscillate. However, from 0, 9&4), x0. xe*D. 
x"*?) for a sufficiently large 1, we can obtain 


i> 0 
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approximate eigenvalues 4, and 4, as the two 
roots of the quadratic equation in 4: 


y? x? pgoget» 

yit? xn 004 =0 
+2 +2 2 

jp e 


(i and j arbitrary, i Æj). 


The corresponding eigenvectors are given by 
KSA Ma and x, =A xt xeu 
Complex conjugate pairs of eigenvalues can be 
dealt with in this manner. This is useful also 
when |/,|=|A,|. The extension to the case of 
more than two eigenvalues with the same 
maximum absolute value is obvious. 

In order to determine the remaining eigen- 
values by the power method we have to com- 
bine it with the deflation or transformation of 
matrices, as discussed later in this section and 
in the following section. The amount of com- 
putation depends on the arrangement of the 
eigenvalues of A and the required accuracy. 
We note that the multiplication of a matrix by 
a vector requires an amount of computation 
proportional to n°. 

Improvement in the approximations can be 
incorporated into the power method. If v=x, 
+ O(c) is an approximation to the eigenvector 
x, corresponding to an eigenvalue 4, of a real 
symmetric matrix A, then the Rayleigh quo- 
tient Ap — (v, Av)/(v, v) affords a good approxi- 
mation to 4,. In fact, we have |4, — Ag| — O(e?). 

If A; ((— 1, ..., n) is an eigenvalue of A and 
x, the corresponding eigenvector, then P(4,) 
(i=1,...,n) is an eigenvalue of P(A) and x; 
the corresponding eigenvector, where P(¢) is 
a polynomial in č and & !. This fact can be 
utilized to transform the magnitudes of eigen- 
values to accelerate the convergence of the 
power method, to separate eigenvalues with 
the same absolute value, or to obtain inter- 
mediate eigenvalues. The choice P(A)=A—c 
or (4— c)! is particularly useful, where c is 
an appropriate constant. In fact, an efficient 
algorithm known as inverse iteration exists 
for computing an approximate eigenvector 
when a good approximation to an eigenvalue 
is known. Given a trial eigenvector x cor- 
responding to a computed eigenvalue c, one 
computes an improved approximate eigen- 
vector y by solving (A —cl)y=x or y=(A— 
e) *x. 

Aitken’s ó?-method is also efficient in accel- 
erating the convergence of the power method. 

When an eigenvalue-eigenvector pair is 
known, another eigenvalue-eigenvector pair 
can be computed using the process known as 
deflation. If an eigenvalue 4, and the corre- 
sponding eigenvector x, (and also the corre- 
sponding left eigenvector 'y, if necessary) of A 
are known, it is possible to "subtract" them 
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from A to get a problem containing only the 
remaining eigenvalues. Such deflations are 
often used in combination with the power 
method. The following are two examples of 
deflation methods. 

(1) Assuming that x, and y, are normalized 
in such a way that (y,, x,) - 1, form B= A— 

A Eu Then B has the same set of eigen- 
values and eigenvectors as A except for 4,. The 
eigenvalue and the eigenvector of B corre- 
sponding to 4, of A are 0 and x,, respectively. 
This kind of deflation process can be gen- 
eralized to the case of nonlinear elementary 
divisors, but that becomes somewhat more 
complicated. 

(2) After normalizing x, so that its nth com- 
ponent x,, is equal to 1, form B=(b;):b; =a; 
— Xyidy (lj — 1, ..., n — 1). Then B has the same 
set of eigenvalues as A except for A, If w, is 
the eigenvector corresponding to the eigen- 
value 4, of B, then the corresponding eigen- 
vector x, of A is given by x, — Wark dux, (i= 
L...,n— 1), xy, — d,, where d, is determined 
from 37-1 niw, — (4 — 4,)d, if 4, #A,. If A, =A, 
and r= SH, a,;w,; —0, then we can put d, =0. 
If 4, — 4, and r ZO, A has a nonlinear elemen- 
tary divisor for 4, — Au and the x, defined by 
Xy = W,;/r and Xy, =Q is a generalized eigen- 
vector of A in the sense that Ax, — 4, x, + Xp- 


D. Transformation of Matrices 


There are a number of methods of transform- 
ing a given matrix A by means of a suitable 
similarity transformation A B—S !4AS into 
another matrix B for which it is easier to solve 
the eigenvalue problem. The Givens method 
[9] transforms a symmetric matrix A into a 
tridiagonal matrix B (i.c., a matrix such that bj; 
=0 for |i —j| 22) by means of a matrix S which 
is the product of 2-dimensional rotation ma- 
trices. The Householder method [10] also trans- 
forms a symmetric A into a tridiagonal B by 
means of an orthogonal matrix S of special 
type, i.e., a reflection matrix [-2uu* (u*u=1, 

u* =conjugate transpose of u), and the Lanc- 
zos method [8] transforms a general A into a 
tridiagonal B. To general matrices the follow- 
ing methods are also applicable: (1) The Dani- 
levskii method [1] transforms A into its com- 
panion matrix B by repeated application of 
elimination operations. Here, row interchanges 
can be combined to increase numerical sta- 
bility [14]. (2) The Hessenberg method [3] 
transforms A into a B such that b; — 0 for 
i—jz2 with a triangular S. (3) The Givens 
method [9] transforms A into a B of the same 
form as in (2) by repeated application of 2- 
dimensional rotations. This method now tends 
to give way to the Householder method or to 
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the elimination method with row interchange. 
All these methods require an amount of com- 
putation proportional to n°. In general, special 
treatment is necessary for the case of multiple 
eigenvalues. As an example, we explain the 
Givens method for general matrices. Let N — 
(n— 1)(n— 2/2. For :=0, 1,..., N—1, choose 
(p, q) = (3, 2), (4, 2), ....(n, 2); (4, 3), (5, 3), ..., 

(n, 3); ...; (n—1,n—2), (n,n —2); (nn — 1) in this 
order. Using T" of the same form as in the 
Jacobi method, and setting tan 0 =a"), dal, , 
in this case, calculate A) = A, UY — J, AY = 
Th AOTO, UD = UOT® ((|—0, 1, ..., N — 1), 
and put B= A“. Then we have b,,=0 for i— 
jZ 2. We can solve the eigenvalue problem for 
this simplified B and then retransform the 
eigenvectors of B thus obtained into those of 
A by means of U™. It should be noted that 
the method of bisection based on "Sturm e 
theorem is effectively used to solve the char- 
acteristic equation of a real tridiagonal ma- 
trix [14]. This method is remarkably stable 
numerically. It is used when the number of 
eigenvalues to be computed is small relative to 
the order of the given matrix. If all eigenvalues 
are desired, an alternative method such as the 
QR method (— Section F) is recommended. 


E. The Lanczos Method 


A more detailed exposition of the Lanczos 
method is now given. Let A be a given real 
matrix of order n. Pick two vectors c, and 6,. 
Determine recursively the vectors c;,, and 
Gun, 1— 1, ..., n, that satisfy the following con- 
ditions: (i) Y;+1 Ci+1 = Aci — HC; ~ Bici zb, 
Jii Čia 7 A6; — 8,6; — Biči- m bii i= 2, ...,0; 
(ii) c;,, is orthogonal to é;_, and 6; (iii) &j,, is 
orthogonal to c;_, and c;, where «;, Bi, i, and 
B, are scalars, and where y; and 7; are nor- 
malizing scalars. Actually, x; = €; Ac;/C;c; — à, 
Beef ug, Bim Metal uf 1, where 
C;€;, i— 1, ..., n, are assumed nonzero. It can 
be shown that c;,, is orthogonal to every 6j, 
| <j <i, and that ¢;,, is orthogonal to every c;, 
1<j<i(c,4,=€,4, —0). The conclusion is that 
the given matrix A is similar to the tridiagonal 
matrix H — (h;), where hi; — a; (i— 1, ..., n) hi ii, 
= Bias Baan uim Bean (—2,..., n) In fact, if C 
denotes the matrix whose jth column equals c; 
(j=1,...,n), one obtains C ! AC =H. Thus the 
eigenvalues of A are identical to those of H. 

In principle, the Lanczos method transforms 
a given matrix to a tridiagonal matrix similar 
to it if every c;&; is nonzero (i= 1, ..., n). If cé 
does vanish for a certain i, one selects c, and 
€, again and restarts. On the other hand, if 
b;,, =0 one chooses an arbitrary vector c;,, 
orthogonal to ¢,,...,é, and if 5,,, =0 one 
selects an arbitrary vector ¢,,, orthogonal to 
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C,,-.-,¢;. In the actual numerical computation, 
the distinction between zero vectors and non- 
zero vectors is usually blurred by rounding 
errors. 

In the application of the Lanczos method 
one often observes the loss of orthogonality 
c;C; — 0 (i#j). This usually results from cancel- 
lation errors in the computation of b;,, and 
b;.,. If the orthogonality is lost, CC") AC may 
significantly deviate from a tridiagonal matrix. 
As a practical means for preserving the in- 
dicated orthogonality one can reorthogonalize 
the vectors c; and &;. Indeed, one can add 
to the computed c;,, a linear combination 
of c,, ...,c; so that the sum is orthogonal to 
Či» ..., €, then take the sum as c;,, after pro- 
perly normalizing. A similar process applies to 
Gen, The Lanczos method is often applied 
in double precision. It has been suggested 
that one first reduces the given matrix to an 
upper Hessenberg form by using the Hessen- 
berg method with row interchange before 
applying the Lanczos method [15]. 

À further remark on the Lanczos method is 
in order. Recall that y; is determined from 
Aci, Cj, %-1, and B;_,; B, from cj, €;, Ci-15 rn 
and ¥,; and o; from Ac,, c;, and ¢;. This shows 
that the Lanczos method applied to a sparse 
matrix (a matrix whose elements are almost all 
zero) requires only a memory proportional to 
n. The Householder method, on the other 
hand, requires memory proportional to n? 
even for a sparse matrix. 

For a real symmetric matrix one can modify 
the Lanczos method for a general matrix so 
that C is orthogonal and H is real, symmetric, 
and tridiagonal (the details are omitted). 


F. The QR Method 


The QR method was discovered independently 
by J. G. F. Francis and by V. N. Kublanov- 
skaya in 1961 [15]. The method has been 
improved and extended considerably since 
then. In the usual application to matrix eigen- 
value problems, the QR method provides the 
most efficient iterative process for finding all 
eigenvalues of a given matrix. The matrix is 
reduced by means of a similarity transforma- 
tion to Hessenberg form or to tridiagonal form 
before application of the QR method; the re- 
duction process may be effected by the House- 
holder method or by the elimination method 
with row interchange. (If the given matrix 

is a complex matrix, the latter is preferable.) 
The reason is that one step of the QR process 
applied to a full matrix is prohibitively expen- 
sive, requiring a number of operations propor- 
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tional to n?, while one step of the QR process 
applied to a Hessenberg matrix requires a 
number of operations proportional to n?. 

We now describe one of the most useful 
versions of the QR method. Let A= A, bea 
given matrix, and define a sequence of matrices 
A, A5, ... obtained from A, as follows. At the 
ith (i=0, 1, ...) step, choose an appropriate 
constant s;, called an origin shift, according to 
the process described below, and decompose 
Aj — 8,1 into the product of a unitary matrix 
Q; and an upper Hessenberg matrix R;:A;— 
si:l =Q;R;. The matrix A,,, is then defined by 
Aj4, =R,Q;4+ 8,1 = QF A,Q;. Hence A;,, is simi- 
lar to A, The QR method is closely related to 
the power method and to the inverse power 
method. We describe this relationship in order 
to obtain an insight into the nature of the QR 
method. To this end, we first state a well- 
known theorem: Let A be diagonalizable, and 
let its eigenvalues 4; (i— 1, ..., n) have distinct 
moduli, say |4,|»|45| » ...»|A,]. Let X ! AX 
— diag(4,, 45, ..., 4,], and let X have an LU- 
decomposition X = LU, where L and U are, 
respectively, lower triangular matrix and an 
upper triangular matrix. Then the QR al- 
gorithm without an origin shift (s;=0, i= 
0, 1,2, ...) behaves as follows: a) —0 (p >q); 
at —À,; a) oscillates (p <q), p, q— 1,2, ..., n, 
i— oo, where a(? denotes the (p, q)th element 
of A;. In other words, A; approaches an upper 
triangular matrix as i— oo so that the diagonal 
elements of A; converge to the eigenvalues of 
A. 

The relationship between the QR algorithm 
and the power method is given by (QoQ, --.Q) 
(RR, Ro) -(A—soI)(A—s,D)...(A— s;I). 
If so=s, =... =s;=0, then the right-hand side 
reduces to A‘*?, and since the product R;... Ro 
is upper triangular, the first column of 0,;= 
QoQ, ...Q; equals a scalar multiple of Ale, 
where e, =(1,0,0,...,0)’. Therefore, by the 
power method, the first column of Q; converges 
to an eigenvector corresponding to the eigen- 
value 4, of A that has the largest modulus, 
under a certain fairly mild condition. Since 
Ai+ı = QF AQ, Ai 061 = OF Alen den for 
large i. In other words, the first column of A; 
converges to the vector (4,,0,0, ...,0) as ioo. 

The relationship of the QR method to the 
inverse power method and to the Rayleigh 
quotient is now explained. From the definition 
of the QR algorithm with an origin shift, we 
have Q; — [(A—s;I) ! ]J*R£. Since R* is a lower 
triangular matrix, the nth column of Q; equals 
Q;e, — [(4;— 5I) ' Rte, —(A— |I) ! -(a 
scalar multiple of e,), where e, — (0,0, ...,0, LY. 
This last process of obtaining the last column 
of Q; represents a process known as the inverse 
power method. If s; is close to an eigenvalue of 
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A, (and hence to an eigenvalue of A), the last 
column of A; gives a good approximate eigen- 
vector of A corresponding to the eigenvalue 
under consideration. Now, if x is a given col- 
umn vector such that x*x — 1, the value of 4 
which minimizes (Ax — Ax)*(Ax — Ax) is given 
by the Rayleigh quotient x* Ax. If x(x*x — 1) 
happens to be an eigenvector of A, the Ray- 
leigh quotient equals an eigenvalue of A. If 
we take x — e,, the corresponding Rayleigh 
quotient equals a,,. Therefore if we take the 
(n, n)th element of A; as the origin shift s;, then 
s; can be regarded as the best approximation 
to an eigenvalue of A; (hence of A) when e, is 
taken as an approximate eigenvector of A,, in 
the sense that 4— s; minimizes the functional 
(Aje, — 4e,)*(A;e, — Aen). 

Under the same condition as in the preceed- 
ing theorem, the rate of convergence of the 
QR method with an origin shift is given as fol- 
lows: a (n> p> q2 1) is asymptotically pro- 
portional to (4,/A,)' for s; — 0 (i=0, 1,...) (no 
origin shift); and with the origin shift s,, the 
behavior of a‘) at the ith step is determined by 
(A, — S))/(À4 — si). If 5; =A, (=the eigenvalue of A 
with the least modulus), each element in the 
nth row of A;,, except a@*” exhibits rapid 
decrease in modulus. A natural and practical 
choice of the origin shift s; is given as follows: 
(1) for a real tridiagonal matrix, s; is taken to 
be that eigenvalue of the 2 x 2 matrix situated 
at the lower right corner of A; that is closer 
to a? [14]; (ii) for a real upper Hessenberg 
matrix, the two eigenvalues of the 2 x 2 matrix 
situated at the lower right corner of A; as 
s; and s;,, [14]; (iii) for a complex upper 
Hessenberg matrix, s; is chosen as in (1) [17, 
COMQR]. 

The QR method with origin shift would 
eventually make each element in the nth row, 
except a, smaller in modulus than a pre- 
scribed positive number. At this stage of itera- 
tion, a? is taken as an approximate eigenvalue 
of A. The QR method is then applied anew to 
the (n— 1) x (n— 1) matrix obtained from A; by 
deleting the nth row and the nth column of A;, 
and another approximate eigenvalue of A is 
obtained. The method proceeds similarly until 
all the eigenvalues of 4 are computed. For 
maximum accuracy the eigenvalues of the 
given matrix should be computed in the order 
of increasing modulus. A word of caution is in 
order. When the given matrix A has elements 
of greatly varing modulus, rearrangement of 
elements of A may be necessary before apply- 
ing the QR method with an explicit origin 
shift, where A; — s;I (1—0, 1, ...) is explicitly 
computed. 

The reader is referred to [14—16] for details 
of the QR method. 
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G. Generalized Eigenvalue Problem Ax = ABx 


An eigenvalue problem of the type Ax = ABx ts 
called a generalized eigenvalue problem and is 
often encountered in applied mathematics. A 
necessary and sufficient condition for A to be 
an eigenvalue is det(A —AB)=0. If B^! exists, 
then Ax = ABx is equivalent to B! Ax 2 Ax and 
hence has n eigenvalues, where n is the order of 
A. If B^! does not exist, the eigenvalue prob- 
lem Ax — ABx has at most n — 1 eigenvalues. 
We restrict ourselves to one of the most im- 
portant cases: that where A is real and sym- 
metric and B is real, symmetric, and positive 
definite. In this case one could solve the prob- 
lem by reformulating it as B. ! Ax — Ax, where 
B !Ais explicitly computed. However, B^! A 
is not in general symmetric. Moreover, when 
B has eigenvalues of widely different moduli, 
the elements of B^! may have widely different 
moduli as well, which would in turn make the 
computation of those eigenvalues of B ! A of 
smaller moduli difficult. An efficient and nu- 
merically stable method is known which obvi- 
ates the aforementioned difficulty by exploit- 
ing the symmetry of A and B. An outline of 
this procedure is now given. Since B is positive 
definite, a lower triangular matrix L exists 
such that LL’ = B. This is called the Cholesky 
decomposition of B. The elements of L can be 
computed by equating the corresponding ele- 
ments in LL' = B. The eigenvalue problem Ax 
= ABx is now equivalent to L' ! A(L) t y = 
Ay with y= L'x, where the matrix L ! A(L') ! 
— P is computed in two stages by solving 
LX =A and PL' — X. In this last equation it is 
enough to compute the upper right half of P, 
because P is symmetric while X is not gener- 
ally symmetric. The upper right half of P can 
be computed by equating the corresponding 
elements in PL’ = X. The eigenvalues / of 
Ax — ABx are given by the eigenvalues of the 
matrix P. Other types of eigenvalue problems, 
such as y AB=Ay’, BAy - Ay, and x' BA AN, 
often appear in practice, where A is real and 
symmetric and B is real, symmetric, and posi- 
tive definite. By using the Cholesky decomposi- 
tion of B, one can reduce any one of these 
eigenvalue problems to the ordinary eigen- 
value problem for a real symmetric matrix [4]. 
If A and B are general matrices in Ax = ABx, 
the following method is known to be effective 
[18]. First, reduce B to an upper triangular 
matrix by applying n — 1 Householder trans- 
formations from the left. This reduces the 
eigenvalue problem Ax = ABx to the case 
where B is upper triangular. Next, apply to A 
a sequence of plane rotations of Givens type 
from the left, thereby reducing the eigenvalue 
problem Ax — ABx to the case where A is an 
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upper Hessenberg matrix and B is an upper 
triangular matrix. Then apply the QR method 
to B" Ax —Ax without explicitly computing 
B !A to reduce the eigenvalue problem Ax 

= ABx to the case where A and B are both 
approximately upper triangular. The eigen- 
values of Ax = ABx are then easily computed 
as ratios of corresponding diagonal elements. 
This method is called the QZ method [18]. 

A collection of about 50 excellent FOR- 
TRAN subroutines for various types of ma- 
trix eigenvalue problems is contained in 
[18]. These subroutines are in most part trans- 
lations from ALGOL procedures given in 


[14]. 
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A. Interpolatory Integration Formulas 


Numerical integration is a method of finding 
an approximate numerical value of a definite 
integral of a given function f(x). Usually the 
integral f? f(x)dx or (2 f(x)w(x)dx (w(x) is the 
weight function) is approximated by a linear 
combination X7, W,f(x,) of the values of the 
integrand at the points x1, X3, ..., x,. Integra- 
tion formulas are divided into two groups, 
the interpolatory formulas and the formulas 
based on variable transformation. 

In order to obtain an interpolatory formula, 
we interpolate over the integrand f(x) at n 
points x4, X», ..., x, by means of the fLagrange 
interpolation polynomial of degree n — 1, and 
then integrate the polynomial over [a, b]. 
Depending on the selection of the points x; 
and the weights W,, we have several kinds of 
formulas. 


(1) Newton-Cotes Formulas. We assume w(x) 
— constant and x; 2 xg 4- ih (i—0, 1,..., n). The 
weights W, are so determined that the value of 
the integral can be calculated accurately if the 
integrand f(x) is a polynomial whose degree 
does not exceed n. There formulas are called 
the Newton-Cotes formulas: (7: f(x)dx =(fo + 
f) h/2 for n= 1 (trapezoidal rule), (7: f(x) dx = 
(fo - Af, + f3)h/3 for n 22 (Simpson's 1/3 rule), 
and [25 f(x)dx — (fo - 3f. + 3f; + f3)3h/8 for 
n — 3 (Simpson's 3/8 rule). The truncation 
errors of these three formulas are given by 
h? f (212, hf (2/90, 3h5 f (2/80, respec- 
tively, where Z is a number in the interval of 
integration and f ? denotes the ith derivative 
of f (differentiability is assumed). For an even 
n, the polynomial of degree n+ 1 can also be 
integrated accurately by these formulas. 

In Table 1 the coefficients A and B; of the 
Newton-Cotes formulas hA 377.9 Bif (x), h= 
(x, — xo)/n, and the coefficients C of the error 
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Table 1 

n A B, B, B, B, B, B, B, B. B € 

1 12 1 1 —1/12 

2 17 1 4 1 — 1/90 

3 3/8 1 3 3 1 — 3/80 

4 4/45 7 32 Ir 29 7 — 8/945 

5 5/288 19 75 50 ap 75 19 — 275/12096 
6 1/140 41 216 27^ 37 27 216 41 —9/1400 

7 7/17280 751 3577 1323 2989 2989 1323 3577 751 — 8183/518400 
8 4/14175 989 5888 —928 10496 —4540 10496 —928 5888 989 —2368/467775 


term Ch?*! f PXE), where p=n+2 if n is even 
and p=n+1 if n is odd, are given. 

When the interval [a, b] of integration is 
large, we usually divide it into small subinter- 
vals and apply formulas for small n for each 
part rather than formulas for large n for the 
whole interval. For example, if the interval is 
divided into m equal subintervals, we get the 
following formula by applying the trapezoidal 
rule for each subinterval: 


| f(x) dx — h((fo T2 F6 Ff eua) 


where x —a, X,, —b, h- (b — a)/m, with trun- 
cation error (b — a? f ?/(12m?). (Here, as in the 
rest of the article, f stands for f ?(£), with 
differentiability assumed as before.) By apply- 
ing Simpson's 1/3 rule we obtain the formula 


| fenes 


2 
SAUER KEE + EEN 


+h +fa+ Wee + fan-2)) 


where xy — a, x44,7 b, h=(b—a)/2m, with 





the correction term h( f, — f..,)/24 + h( f, — 


Jm+1)/24=h(Afo + Af -,)/24 RAL, + Mfn)/24, 


where Af; means f;,, — fj. 


(2) Chebyshev Formulas. The Chebyshev for- 
mulas are a family of integration formulas in 
which all the weights W, oft, W,f(x,) are 
equal, while the abscissas x; are chosen so that 
the integral can be evaluated exactly when f(x) 
is an arbitrary polynomial whose degree does 
not exceed n. When f4, f(x) dx 3 WX. f(x), it 
is easy to see that W — 2/n since the right-hand 
side must be equal to the left-hand side when 
f(x)=1. It is known that the abscissas x; for 

n x7 and n—9 are real, while for n=8 and 

n= 10 at least one of the abscissas becomes 
complex. It is easy to see that Chebyshev for- 
mulas are interpolatory. 


(3) Gauss Formulas. In the Gauss formulas 
both the weights W, and the abscissas x; are 
chosen so that we obtain the accurate value of 
the integral when the integrand is any poly- 
nomial whose degree does not exceed 2n— 1. If 
we put /7/(x)  [DI7-1(x —x,), an arbitrary poly- 
nomial of degree 2n — 1 can be expressed in the 
form 


truncation error (b — a)? f '9/180(2m)*. n Dis M 
; di : " x)= A — —— — —— f, + H(x a,x", 
For the integral of a periodic analytic func I(x) 2 (x —x)I (xj) h ( )2. k 


tion over a single period the trapezoidal rule 
with equally spaced points gives the best result 
asymptotically, as the number of points tends 
to infinity. 

Newton-Cotes formulas can also be ob- 
tained by integrating over the interval [a, b] 
the interpolation polynomial for equally 
spaced points. The formulas mentioned so 
far are called closed formulas since they use 
values at the two endpoints of the interval. 
We can also use open formulas, which do 
not use values at the ends. For example, we 
have fž4 f(x) dx =4h(2f, — f; + 2f3)/3 with trun- 
cation error 14h5f 9/45. Open formulas are 
useful for numerical solution of differential 
equations. There are also formulas that in- 
clude values outside the interval, for example, 
(— f+ 13fot 13f, — f)h/24 for (7 f(x)dx. 
Applying these formulas to the m subintervals 
of [a, b], we obtain a trapezoidal formula with 





where f(x,) — f, and the first term is the *La- 
grange interpolation polynomial. By the as- 
sumption that the integral of f(x) with weight 
w(x) equals 27, Wi we obtain W, = 

[2 (w(x) T(x) — x, T (x,)) dx and the relations 
k=0,1,... 


b 
| w(x)IT(x)x* dx =0, n=l. 


Accordingly, the abscissas x; are determined as 
the roots of the polynomial /7(x) of degree n 
that is orthogonal to x* (k «0, 1, ...,n— 1) with 
respect to the weight function w(x). 

The following are typical examples of inte- 
gration formulas of Gaussian type. 

(1) Gauss integration formulas (in the 
narrow sense). For w(x) — 1 and the interval 
[—1,1], M(x) is the *Legendre polynomial 
P,(x)=(1/2"n!)d"(x? — 1"/dx". The error is 
(ml 2?"*! £2” On + Dal, 
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(ii) Gauss-Laguerre formulas. For the 
weight w(x) — exp(— x) and the interval [0, oc), 
Il(x) is the tLaguerre polynomial L,(x) — 

(exp x)d"(x" exp( — x))/dx". 

(iii) Gauss-Hermite formulas. For the weight 
function w(x) 2 exp( — x?) and the interval 
( —oo, oo), /7(x) is the fHermite polynomial 
H,(x) 2 ( —1l'expx?- d"exp( — x?)/dx". 

(iv) Gauss-Chebyshev formulas. For the 
weight function w(x)=(1 — x?) '? and the 
interval [ —1, 1], we use the Chebyshev poly- 
nomial T,(x) — 2" P cos(narccos x). In this 
case, the W, are all equal to z/n. 

From the definition of W, we see that the 
Gauss formulas are interpolatory. 


(4) Clenshaw-Curtis Formulas. Although the 
Gauss formula is in general more accurate 
than the Newton-Cotes formula with the same 
number of points of interpolation, the points 
of interpolation for a Gauss formula of any 
order are distinct from those of any other 
order except the point zero, which appears in 
all formulas of odd order. The Clenshaw-Curtis 
formulas are interpolatory, with the points 
chosen so that the distribution of the points is 
similar to that of the Gauss formula and such 
that, in proceeding from a computation of 
order n to that of order 2n, all the function 
values evaluated in the former computation 
be used in the latter. For w(x)= 1, the inter- 
val [ —1,1], and even n, the points x, and the 
weights W, are given by 








T 
As CO, k=0,1,...,n; 
n 
mM = W, ; 
o n Wn’ 
4n2 | 2njs 
W= Wes z COS— —, 
nj2ol—4j n 
Ee 
$—1,2,...,. 
2 


>” means that the first and the last terms in 
the sum are to be halved. There are some 
different types of Clenshaw-Curtis formulas 
depending on the selection of the points x, [1]. 


B. Integration Formulas Based on Variable 
Transformation 


If the integrand has some singularity at the 
endpoint, any interpolatory formula based on 
interpolation with a polynomial does not give 
a good result. In such cases integration for- 
mulas based on variable transformation are 
quite effective. 
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(1) IMT Formula. The tEuler-Maclaurin 
formula is given by 


b 1 n-1 
| Min WEN 


a 


m ps , E 


SE TR, 








kim" 


1 n-1 
= (2m) 
Rn Gm! 1 B,,,(t) f5 Ise EZE vi dt, 


where the B,(t) are tBernoulli polynomials 

of degree n and the B, are !' Bernoulli num- 
bers (By = 1, B, = —1/2, B, = 1/6, B, =0, B = 
—1/30,...). This formula suggests that if the 
higher derivatives of the integrand vanish at 
the both endpoints, the error of the trape- 
zoidal rule with equally spaced points be- 
comes very small. The IMT formula is based 
on the idea of transforming the variable x of 
f3 f(x)dx in such a way that all the deriva- 
tives of the new integrand vanish at both end- 
points by taking x= p(t), p(t)=K ! fo y(t)dr, 
K =[§y(a)de, W(t) c exp( — c7! —(1—1)"1). 
Then the trapezoidal rule with h=1/n is 
applied to the transformed integral to obtain 
(1/Kn)2 ji V G/n)f(oG/n)) [1]. The asymptotic 
expression of the error for the IMT formula is 
proportional to exp(— C /n) with a positive 
constant C. 


(2) Double Exponential Formula. The trape- 
zoidal rule with equally spaced points ap- 
plied to the integral of an analytic function 
over (—oo, oo) gives in general a result of 

high accuracy. The double exponential for- 
mula is based on the idea of transforming 

Fa f(x)dx to (2, f(o(t))o'(t)dt with x= 
q(t) — tanh(izsinh t) and applying the trape- 
zoidal rule with a mesh size h, which results in 
hE f(o(nh))o'(nh) [4]. The name of the 
double exponential formula is attributed to the 
decay of q'(t) at t> coo, which is approxi- 
mately proportional to exp( — C exp|t|) with a 
positive constant C. The transformations x — 
exp(zsinh t) and x =sinh(}zsinht) give the 
double exponential formulas for the infinite 
integrals f f(x)dx and IS. f(x) dx, respec- 
tively. In the actual computation the infinite 
summation is truncated at appropriate upper 
and lower bounds. The asymptotic expres- 
sion of the error for the double exponential 
formula in terms of the number N of the sam- 
pling points actually used is proportional to 
exp( — CN/log N) with a positive constant C. 
The IMT formula and the double exponential 
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formula are robust against the singularities at 
the endpoints. 


C. Automatic Integration 


By an automatic integration scheme we mean a 
computer program for numerical integration 
of f? f(x)dx in which the user gives the limits 
of integration a and b, a subroutine for com- 
puting f(x), and an error tolerance e. Then the 
program gives a value of the integral which is 
expected to be correct within the tolerance e. 
Usually in an automatic integration scheme 
the mesh size is halved until the desired ac- 
curacy is attained. Automatic integration 
schemes are classified into two groups, non- 
adaptive schemes and adaptive schemes. In a 
nonadaptive scheme, the sequence of integra- 
tion points is chosen according to some fixed 
rule independent of the shape of the inte- 
grand. Newton-Cotes formulas, Clenshaw- 
Curtis formulas, IMT formulas, and double 
exponential formulas can be used as base 
formulas for nonadaptive schemes. 

From the historical point of view, Romberg 
integration should be mentioned; it is a kind of 
nonadaptive automatic integrator. Consider 
an integral I — f} f(x) dx. Divide the interval 
[a, b] of integration into 2* subintervals and 
apply the trapezoidal rule with the mesh size 
h — (b — a)/2*, which we denote by Tf". Then, 
starting from the values obtained for T®, 

k —0,1,..., we compute the sequence 

pem TET Ls 
4"—1| 

From the *Euler-Maclaurin formula the as- 

ymptotic error for the trapezoidal rule T/? can 

be expressed as I — Tf =c,h*+c,h*+...+ 

c, h?" + ..., where c,,=const x (f "7 (b) — 

f?" Xa)) does not depend on h. If we com- 

pute T® 2 (4T(* * P — T®)/3 using the values 

Tf) and T9, then we see that the asymp- 

totic error expression of T® becomes I — T{ = 

c h* -- ch +... cR?" + .... Romberg inte- 

gration is based on the 1dea of eliminating 

the term with h?" in the expression of the error 

by successive application of T9. This is an 

application of Richardson's extrapolation 

procedure. 

In the adaptive scheme, the points are 
chosen in a way that depends on the shape of 
the integrand. The Newton-Cotes formula of 
order 8, for example, is used as the base for- 
mula for the adaptive scheme. 


D. Approximate Multiple Integration 


If a region is a product region, such as a rect- 
angular parallelepiped, a product rule ob- 
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tained by forming the product of 1-dimen- 
sional formulas is useful. For integrals over a 
cube or a sphere there are monomial rules 
which are exact for a certain family of mono- 
mials. These rules can be used for integrals of 
dimension lower than 5 or 6. For integrals of 
higher dimension only methods based on 
sampling make sense. 


E. Numerical Differentiation 


In order to find the numerical value of the 
derivative f; = f '(x,) at a point x, from the 
tabulated values f, — f(x,), we usually use the 
derivative of the tLagrange interpolation 
formula. This gives 

WAS 1 
f E e (x, = xT)? i D Xp— "i 
where M(x) = II (x — x,). 

When we compute the derivative of a func- 

tion which can be evaluated at any point in a 
given interval, the approximation 


F(x +h)— fix-h) 


f'e9- 7 


is useful; similarly, we can use 


JGx-r h)— hg) 


f'e)- De 





It must be noted that, as h tends to zero, the 
difference f(x -- h) — f(x — h) comes to contain 
fewer significant digits, so that it is meaning- 
less to carry out { f(x + h) — f(x — h) )/2h be- 
yond a reasonable value of A. 
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300 (XV.1) 
Numerical Methods 


In the earlier history of mathematics, the de- 
velopment of methods of numerical calcula- 
tion was one of the main purposes of research. 
Until the beginning of this century, logarith- 
mic calculation played a central role in numer- 
ical calculation, and the main topic of this field 
was to make tables of values of functions. The 
digital electronic computer (— 75 Computers), 
which made its first appearance in the 1940s 
and has been developing at an exponential 
rate, has caused drastic changes in numerical 
technique. Problem solving by numerical 
methods has now become one of the fundamen- 
tal means of research in the physical sciences 
and engineering, and also in the social sciences 
and humanities. In this article we give exam- 
ples of the changes in numerical methods 
brought about by the availability of digital 
computers and portable calculators. 

Computers may be effectively utilized for 
calculating individual values of functions. This 
has led to the reexamination and, in some 
cases, modification of approximate formulas 
for evaluating functions (— 142 Evaluation of 
Functions). For familiar functions, such as 
tlogarithmic, *exponential, and ttrigonometric 
functions, tables have been almost completely 
replaced by function keys on electronic cal- 
culators. Microprogramming algorithms for 
obtaining values of these functions have also 
been devised [1]. A complex function-theoretic 
error-estimation method for use with numer- 
ical integration formulas is given in [2]. In 
this method, graphical outputs of the com- 
puter are utilized. For problems in which the 
existence and uniqueness of solutions have 
been established, as for algebraic equations 
and ordinary differential equations, fairly good 
numerical calculation methods and their error 
estimates have been established (— 301 Nu- 
merical Solution of Algebraic Equations; 

303 Numerical Solution of Ordinary Differen- 
tial Equations; [3,4]). A method for estimating 
arithmetic *accumulation errors for operations 
involving finite numbers of digits has been 
systematized (— 138 Error Analysis; [5]). 

It is not unusual nowadays for linear equa- 
tions with 10,000 or more unknowns to be 
solved. As new computing systems, such as the 
virtual memory system and the vector oper- 
ation system, come into existence, new al- 
gorithms are examined; a numerical method 
that is optimal for today's technology may 
well be suboptimal for the next generation 
of computers. General-purpose program 
packages of linear problems, including eigen- 
value problems, have been developed (— 298 
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Numerical Computation of Eigenvalues; 
[6]). 

Partial differential equations seem to have 
become more familiar because of the visualiza- 
tion of their numerical solutions in graphical 
computer outputs. In [7], which appeared 
much earlier than computers, partial difference 
equations for the fundamental linear prob- 
lems in mathematical physics were discussed 
with a suggested variational treatment (— 304 
Numerical Solution of Partial Differential 
Equations; [8]). The ffinite element method, 
which started as a calculation technique in 
structural mechanics and is based on the tcal- 
culus of variations, is widely accepted as an 
efficient approximation method for partial 
differential equations [9, 10]. The term tsimu- 
lation is used often to describe procedures in 
which partial differential equations describing 
time-dependent phenomena are discretized; 
the resulting difference system can be solved 
for long periods of time [11]. 

Numerical analysis has heretofore long been 
considered to be the only numerical method 
(for error analysis in particular), and has been 
carried out mainly by means of the discretiza- 
tion of equations. Nowadays, however, mathe- 
matical modeling, taking into account both the 
phenomena to be described and the capabi- 
lities of the computers to be used, has become 
an important numerical method. 
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A. General Remarks 


Methods for finding roots of an equation 
f(x) 20 (where f(x) is not necessarily a poly- 
nomial, but is assumed to be a function with 
some regularity) by numerical calculation can 
in general be divided into the following two 
types: (1) The first has as its goal the finding 
of good approximate values of the roots; 
examples are the Bernoulli method (Section J) 
and the Graeffe method (Section N). (ii) The 
second improves the accuracy of estimates of 
the roots; an example is the Newton-Raphson 
method (Section D). The methods belonging to 
(1) and (ii) can be used separately. However, in 
(i) convergence of the approximations may be 
excessively slow when a pair of nearly equal 
roots is present, and the size of numbers in- 
volved in the calculation may grow exponen- 
tially as we proceed through the iterations. In 
(ii) convergence of the approximate values is 
not assured unless the initial approximate 
value is suitably chosen. Accordingly, when an 
electronic computer is utilized, it is advisable 
to combine both types of method. Because of 
the development of computers, solutions with 
global convergence have become important 
[1215.17 21]. 


B. Successive Substitutions 


When an equation f(x) —0 is transformed into 
x = F(x) and the roots are obtained by iterative 
calculation of x;,, = F(x;) (1-0, 1,2, ...), suffi- 
cient conditions for its convergence are as 
follows: Let one of the roots of the equation be 
a. Then x;,, —«— F(x;) — a, and therefore (xX;+; 
— x)/(x, —a) = F'(Z) (x, <ë «a or xj» £» a). 
Accordingly, x converges monotonically if 0 < 
F'(£) « 1, while it converges with oscillation 

if 0> F'(&)> —1. If |F(5)|»1and F (x) de- 
notes the inverse function of F(x), the iteration 
Xj 4 =F !(x) converges. To define the speed of 
convergence by iterative processes the follow- 
ing notion of order is utilized: When lim, ,, x; 
=a, the speed of convergence of the sequence 
{x,} is said to be of the kth order if Im, (x;,, 
— a)/(x; a) =c #0. Necessary and sufficient 
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conditions for the speed of convergence of 
{x,;} to be of the kth order are a= F(a), F'(x) — 
F"(a)=...= F* ag) =0, F(a) 40. 

The main iterative processes used in numer- 
ical calculation are described in the following 
sections. 


C. Regula Falsi 


Regula falsi (or the method of false position) is 
a process of obtaining the real root « of an 
equation f(x) 20 by approaching the root 
from both sides. The calculation procedure is 
as follows: Assume that f(x,) 0 and f(x) «0, 
where x, and x, are approximate values of a 
such that « lies between them. A new approxi- 
mate value x is then obtained from x — (x, f(x,) 
=x pf ODMS G3) — f(x). E Diels. then x^ 
xy, and if f(x) «0, then x x,, and the pro- 
cedure is repeated (the symbol — means re- 
placement). The conditions and speed of con- 
vergence of regula falsi are as follows: Let F(x) 
— Go) — Xf (XQ) F(x) —f(xq)); then F'(a) = 
(f(x_)+(a—x,)f'(«))/f(x,). Y£ f' and f" exist 
and are continuous near a, then f(x) — f(a) 
(x,— 0) f (x) - (1/2) (x, — 2)? f "(£), where «> 

$7 X, Or a < « x,. Accordingly, assume 

that F'(x) - (1/2)(x, —a)? f "(£)/f(x,) x0. Then 

| F'(x)| € 1 can be satisfied if x and x, are near 
a. Therefore, if the initial value is appropriate, 
the convergence is of the first order. In regula 
falsi, only calculation of f(x) is necessary, while 
that of f'(x) is unnecessary. Furthermore, if 
f(x) -« 0 has nearby real roots, i.e., o, f, ... close 
to each other and f'(x) small near the roots, 
there can be no mistake such as the neighbor- 
ing root fj being obtained in the process of 
finding the root « by this method. This method 
belongs to the inverse linear interpolation 
method. This kind of inverse interpolation 
includes Muller's method [3], which uses 
*Lagrange's interpolation formula, the Torii- 
Miyakoda method [4], which uses tHer- 
mite's interpolation formula, and Whittaker's 
method [5], which uses 'Stirling's interpola- 
tion formula. These methods converge more 
rapidly but have more complicated formulas 
than inverse linear interpolation. The Sturm 
method (in which the interval where the roots 
exist is narrowed by tSturm’s theorem) and the 
Horner method (which obtains the decimal 
part digit by digit), both used to obtain the 
real roots of a high-order algebraic equation 
f(x) - 0, are also of this type. 


D. The Newton-Raphson Method 


In obtaining the real roots of an equation f(x) 
=0, the Newton-Raphson method (or the New- 
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ton iterative process), which converges rapidly, 
is used when f (x) Z0 is computable. Let x; be 
a sufficiently close ith approximation of the 
root o; then x;,, —x;— f(xj)/f (xj) is closer than 
x, to the true solution. The process is repeated 
until |x;,, — x;| is small enough. Conditions 
and the speed of convergence are as follows 
[6,7]: Let F(x) 2 x — f(xy/f (x); then F'(x) = 
Tel GMT GI and F'(x) «0. Accordingly, 
F"(o) #0 if f(x) 40 and f"(a) 40, and the 
convergence is of the second order if it is pos- 
sible to determine an appropriate approxi- 
mate value x; that is close to « and satisfies 

| F'(x;)| « 1. In particular, if f(xo)f (x9) 40, ho = 
— f(xoMf (xo). L/"()| & M, and | all > 
2|ho|M, then every Newton approximation 

x; starting with x, is contained in the interval 
I -[x, —|hgl, x, 4 |hg| ], the equation has only 
one root « in I, and x;—a. Besides, 


la—xiíl&MlIxi-xi-í 2f (x); i=1,2,.... 


With regard to convergence and evaluation of 
the error, including roundoff error in practical 
computations, M. Urabe's studies should be 
consulted [8,9]. In general, the convergence 
is of the third order in the Newton-Raphson 
method, which uses not only f (x) but also 
f(x) [10]. 

When f'(x) 20, we must assume f"(x) 40 
and use the value f"(x). With regard to this 
case, the study by S. Hitotumatu should be 
consulted [11]. In addition, W. Kizner (STAM 
J. Appl. Math., 12 (1964)) reported an itera- 
tive process in which the convergence is of the 
fifth order without using any derived function 
higher in order than f "(x). The essential part 
of his method lies in a numerical integration of 
the integral part by the *Runge-Kutta formula, 
based on the fact that if x, is the first approxi- 
mate value of a root x of f(x) —0, then 


9 dx 
x= —df +x. 
M l 


There may be several iterative processes for 
solving f(x)=0, even if the order k of the con- 
vergence is fixed. For example, in obtaining 
the positive root of f(x)=x? —a=0 (a>0), i.e., 
the square root x — a^, both iterative pro- 
cesses A = (x; a/x)/2 and x;y 22x? /(3x? — 
a) give convergence of the second order. In 
obtaining the real root of f(x) 2x? —a=0, i.e., 
the cube root x=a", x,,, =x; - (a/x2 — xj)/3 
converges of the second order, while x;,, = 
x;/2 - (a+ a/2)/(2x? + a/xj) converges of the 
third order. 

The Newton-Raphson method is applicable 
also to holomorphic functions of complex 
variables. 

Take simultaneous equations of two vari- 
ables, f(x, y) 20 and g(x, y) 2 0. If it is possible 
to transform the equations into x = ~(x, y) and 
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y —V(x, y), and if |óo/0x| + |OV/Ox| <1, |O«/Oy| 
t |Ov//Oy| « 1, then the iterative processes x;., 
= Q(x;, yj) and y;,, =W(x;, yj) are applicable. In 
the Newton-Raphson method, let the correc- 
tions to the ith approximate values x; and y; 
be Ax; and Ay;, and solve 


Ax yy Ax; + HX, y)Ayi— — f (xi, yi 


dubio yg Ax; G(X, y) Ay; = —g(X;, yi). 


Then we can take x;,, =x;+Ax; and yi; = 

y; + Ay; as the next approximate values. This 
process is repeated until both |Ax,| and |Ay,| 
are sufficiently small. More generally, the 
Newton-Raphson method for solving a system 


of n nonlinear equations f;í(x,, ..., x,) -0, i= 
1,2, ...,n, is defined by 
xD = x0. J(x*)-7!F(x9)  Kk-—0,12,..., 


where x — (xf9, ..., x9), F(x) e (fi(x,, ..., X4) 
tx fix s. sie ES J(x) =(6f,(x,, Kä ,XyJ/0x;) 
(the ‘Jacobian matrix of EL and where x is 
chosen appropriately [23]. 


E. The Bairstow Method 


Corresponding to a pair of complex roots «+ 
if of an algebraic equation f(x) -agx" --a,x"- 
+...+a,=0 with real coefficients, there is a 
real quadratic factor x? + p*x +q*. The Bair- 
stow method (or Hitchcock method) obtains 
the coefficients p* and q* of this quadratic 
factor. 

To do this, first choose an appropriate qua- 
dratic factor x? + px +q as a candidate. Then 
through synthetic division by the quadratic 
factor, b; and c; are computed by means of 
the formulas 


1 


b; — aj— pb, — qbj.?, j=0,1, H. 
and 
cj7bj—pej,—46j-3, j=9,1,...,n-1, 


where 
b.2b.,-6.,26.,-—0. 
By solving the simultaneous equations 
C -2Àp +¢,-3Aq=b,-1, 
Cr-1 Ap +c,-2Aqg=b,, 


where 


Cn-1 = C4-1 TB 


the quantities Ap and Aq are obtained. Then 
taking p=p+ Ap, d —q- Aq, we have x? + 
px + d as a new candidate. This operation is 
repeated until Ap and Aq are sufficiently small. 
Since this method corresponds to the Newton- 
Raphson method in the case of two variables, 
and, accordingly, the convergence is of the 
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second order, a choice of suitable initial values 
of p and q leads to rapid convergence. The key 
to this method lies in choosing p and q so that 
Rip. q) - 0 and R;(p, q) 20, where R, and R, 
are such that R,x+R, is the remainder of f(x) 
divided by the trial quadratic factor x? + px + 
q. This method was generalized by A. A. Grau 
(SIAM J. Appl. Math., 11 (1963)). Namely, 
when f(x) 2 (x? + px + g)g(x) 4 r(x) with r(x)— 
r,x**! 4 r5x*, the functions r; and r, can be 
used instead of R, and R,. The process cor- 
responds to the Bairstow method when k — 0, 
and to the McAuley method (SIAM J. Appl. 
Math., 10 (1962)) when k=n—2. 


F. The Durand-Kerner Method 


The Durand-Kerner (DK) method [12] for 
solving an algebraic equation f(z) 2z" - a,z" ! 
+... 4,70 (a, 40) with complex coefficients 
is an iterative method defined by 





(k) 
il z 09. - f(zi d d 
ID an (2 ?—zj ) 
Plies k=0,1,2,.... (1) 


A feature of this method is that it can deter- 
mine simultaneously all the roots of f(z) 20. 
Let $,(z,, ...,z,), m— 1,2, ..., n, be the mth 

elementary symmetric functions with respect 


tO Z,,...,Z,: 


E MER NE nm 


B K S m 
HQ E15 € ... Xl 


Set f, (z) - (—1)" o,(z,,...,z,) —a,. Then «= 
(x,,...,%,)', a set of the roots of f(z)=0, is a 
solution of a system of the n equations f,(z) — 
0, m— 1,2, ...,n. I. O. Kerner [12] showed 
that the DK method is the Newton-Raphson 
method applied to the system of nonlinear 
equations f, (2) — 0, m 1,2, ..., n. Therefore 
the speed of convergence of (1) is of the second 
order, provided that it converges. The initial 
values 2%, ..., z(9 for the DK method are 
usually chosen as follows: Let g(z) = f(z — a,/n) 
=2z"+¢,2" ? +... +c, and h(z) «z"— |e,|z"? 
—...—|c,|. If (65, ...,c,) (0, ...,0) (nz: 2), then 
it can be shown that h(z)=0 has exactly one 
positive root r, and all the roots of f(z) —0 lie 
in the disk [z - a,/n| x r. Now, with a positive 
ro Zr and 0 — n/(2n), put 


2(i—1 
P= Zuel C "eoi | 
n n 


jd axo (3) 





Such a choice was proposed by O. Aberth 
[13]. Hence the process (1) together with (2) 
can be called the Durand-Kerner-A berth 
(DKA) method. It is shown for the DKA 
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method that, if ry is large enough, then 


a (le a 
ol (2+5), i=1,2,...,n, 
n n H 


hold nearly for the first several steps. Thus 
the DKA method has a certain kind of global 
convergence property that renders it one of the 
most powerful methods for solving algebraic 
equations. 

A variant of the Durand-Kerner method is 
the Ehrlich-Aberth method [13, 14]; it is de- 
fined for i2 1,2, ...,n, k=0, 1, 2, ... by 


P ED 


m3 fict 
TF PYF) Ea jes Me — 2)" 


The speed of convergence of this method 

is of the third order. Further variants of 

the Durand-Kerner and the Ehrlich-Aberth 
methods have been proposed by M. Iri et al. 
[15] and A. W. M. Nourein [16]. 





G. The Dejon-Nickel Method 


Let f(z) be a polynomial of degree n with 
complex coefficients, and take zy such that 
J (Zo) € 0. Then we can write 


f(zo +h) 2 f(zo) 14+ bh! +b hi +... +b,h™ 
(b; #9) 


= f(zg) (1 - bh (1--0), | 0—o(h). 


Now, choose h sufficiently small so that |b,h'| 
<1, [0| « 1, and arg(b;h") 2 x. Then |1 + bjh'| — 
1 —|b;h'| and |b;h'0| < |b;h'| so that | f(zg +h)| 
«|f oM + bh] AIR <| f(zo)]. Hence, 
if zy is chosen so that min| f(z)| 2| f(z), 

then we must have f(z;) —0. This is the out- 
line of Cauchy's existence proof for the roots 
of algebraic equations. The Dejon-Nickel 
method [17] is a method in which h is chosen 
as follows: 


h=( —1/b,)""*, 
where 


min (|1/b]'7 |b; Z0, ij &n] —|t/b,|"* 
(=r, Say). 


If several such k exist, take the smallest one. 
The branch of the multivalued function 

( — 1/b,)!* is chosen such that z! is positive for 
z positive. If | f(zg - h)| <(1 — e)| f(z )| with a 
preassigned constant e such that 0 «e « 1, then 
put Z; —zo-- h. If the inequality does not hold, 
then for each integer mz 1 choose the small- 
est integer l= l(m) such that max ( |bj|(r/2") | 
i<j<n\=|b|(r/2"), and put Z, = Zo + (r/2") 

( ^ |bj/b). Find the smallest integer mz 1 such 
that | f(zo)| «(1 —2 ! (r/2"/|b)] tel, and put 
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2; —Z,. By continuing this process, a sequence 
{z;} is constructed such that | f(z)|>|f(z,)| > 
| f(z)|> .... It converges toward some root 

of the equation f(z)=0. S. Hirano has pro- 
posed a similar method. 


H. Methods for Finding Good First Estimates 
of Roots 


Some of the principal methods for obtaining 
good first approximate values of the roots are 
given in the following sections LN. 


I. Matrix Methods 


The problem of solving an algebraic equation 
f(z) oz" c a,z" Tt a, 0 with complex 
coefficients is equivalent to that of finding the 
eigenvalues of the companion matrix 


—0, ~a sa — apn- 70g 
| 0 0 0 0 
A=| 0 1 0 0 0 
0 0 0 1 0 


Therefore numerical methods for solving the 
eigenvalue problems for nonsymmetric ma- 
trices are applicable (— 298 Numerical Com- 
putation of Eigenvalues). However, it should 
be remarked that this might be inefficient 
because most of the elements of the matrix A 
are zero [18]. 


J. The Bernoulli Method 


In the Bernoulli method the iterative formulas 
S1 = —a,, S,— —(4,S,-, c a58,- 5... 8,8, 
+ka,) (k=2,3,...,n), S= —a, 8$, ,—a58$,.,— 
-e Sy, (k— n T n 2,...) are repeatedly 
applied to an algebraic equation f(x) — x" t 
a,x” 3 ax" ?+...+a,=0. When the roots 
of f(x) - are à,,25, ..., a, (la,|l 21x 5| 2 ...2 
lanl) $,/8,-, 2, If a, is a real and simple 
root and |o, | » |a; |. When «,, x; are complex 
roots, we put a, = Re”, a, = Re. If |a3|<R, 
then we have 


S — Spat Ska 


2 3R, 
Si-i— 5$ 3,7 


NET TEE 


S —2R cos 0, 
Sk-1 — 9.2 


and hence the two roots of x? — (2R cos 0)x + 
R? =0 are «, and a. S, is the sum of the kth 
powers of the n roots of the equation, «,, >, 
<., æ. C. Lanczos used xf (x)/f(x) 2 n-- S,/x + 
S,/x?+...4+8,/x*+ ... to compute S, [10, 

pp. 26-30]. Let the eigenvalues of the com- 
panion matrix A be 44,45, .... An ([A4,|] 2|45| Z 
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... 2 |4,]). Then the Bernoulli method is the 
same as the tpower method for obtaining the 
maximum characteristic value 4, (— 298 Nu- 
merical Computation of Eigenvalues C). 


K. The Lehmer Method 


The Lehmer method [9] is a general method 
for finding the n roots of an algebraic equation 
with complex coefficients 


f(z)-ag- a,z 4 a4z? +...4+4,2"=0 


in the complex plane by repeating the follow- 
ing procedure: First, draw a circle whose cen- 
ter is at the origin and whose radius is R — 

r2 5, where r is an arbitrary given number 
and @ an arbitrary given integer. Then, utiliz- 
ing thc Lehmer theorem, observe whether a 
root x of f(z)=0 lies inside the circle. If there is 
no such root, replace R with 2R, whereas if 
such a root exists, replace R with R/2. Con- 
tinue this process until an R for which a root 
x exists within the annulus R <|z|<2R is 
obtained. 

Second, draw circles with the centers fj, — 
(SR/3)exp(i2nk/8) (k =0, 1,..., 7 and com- 
mon radius p=(5R/3)/2 and cover the annulus 
obtained by the first step. Then find out which 
of the circles has the root o in its interior. If 
a is in the interior of the circle for k=j, the 
origin is shifted to f; and the operation is 
started again from the first step. If R, satisfy- 
ing R, « |z — f| « 2R, is obtained by the first 
step, we have R, x (5/12) R. Therefore, when 
the first step is repeated N times, the root « 
is confined in a small circle whose radius is 
smaller than 2(5/12)"R. Then the center f) of 
this small circle gives a good approximate 
value of a. 


L. The Downhill Method 


The downhill method is a method for obtain- 
ing the extreme values of a function of many 
variables. It is also applicable in obtaining 
approximate solutions of a system of equa- 
tions. Let us consider the downhill method in 
the case of two variables. The problem of 
obtaining the real roots of simultaneous equa- 
tions f(x, y) 30 and g(x, y)=0 can be reduced 
to the problem of obtaining the coordinates 
(a, B) which give the extreme value 0 of D(x, y), 
where (x, y) f? - g?. The values of ®(x, y) 
are calculated at 3? points obtained by the 
combination of x 2x;, x, +h; y- y,, yh, 
where (x,, yı) are arbitrary approximate values 
of (a, fl) constituting the centers of those sets of 
points, and h is the given step size. Utilizing 
the values of the function at the 3? points, 
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(x, y) is approximated by a quadratic surface 
bot b x by b, x?—2) 
+ b,,(03y? —2)4- b, 5 xy. 


The values of bp, bi, b2, by, 622, b, 5 are cal- 
culated by the ‘method of least squares, and 
the center (x*, y*) of the approximate quadra- 
tic surface is obtained. Then the first approxi- 
mations x, and y, are replaced by the sec- 
ond approximations x, +x¥ and y, + yf. This 
process is repeated while the step size h is 
made appropriately smaller. This method is 
an improvement on the method of successive 
experimental planning used by G. E. P. Box 
and K. B. Wilson (1954) to obtain optimum 
conditions in the exploration of response 
curved surfaces. 

By using another minimization technique, 
J. A. Grant and G. D. Hitchins [21] gave an 
ever-convergent algorithm for determining 
initial approximations to the roots of algebraic 
equations with real coefficients. The prac- 
tical implementation of this method is given in 
J. A. Grant and G. D. Hitchins (Comput. J., 
18 (1975)). 


M. Continuation Methods 


Let F(x) 2 (f169, ..., fi 60) 2 0 (x m (xs, -.-, X,))! 
be a system of n equations. Suppose that no 
reasonable approximation for a solution exists. 
Then, take arbitrary x and define a one- 
parameter family of equations 


H(x.t)sF(x)-(0-)F(x9)20, Oxt«l. 


(3) 


Suppose that for each t the equation has a 
solution x(t) which depends continuously on 

t. Observe then that x(0)=x and x(1) is a 
solution for the equation F(x)=0. Partition 
the interval [0,1] by the points O=t)<t, <... 
<t =L. First, solve H(x,t,)=0 by some itera- 
tive method using x as a first approximation. 
Let x be a solution thus obtained. Next, 
solve H(x, t,)=0 by some iterative me Uu) 
using x" as a first approximation, and so on. 
Finally, solve H(x, ty) 20 by some iterative 
method which uses x" ^" as a first approxima- 
tion. Then xf", a solution thus obtained, can 
be used as a first approximation in an itera- 
tive method applied to the equation F(x) — 0. 
This method is called a continuation method 
[22-24]. 

As another approach, differentiate (3) with 
respect to t. Then J(x(t))x'(t) + F(x) «0, 
where J(x(t)) is the Jacobian matrix of F, 
evaluated at x= x(t). Hence x(t) is the solution 
of the system of ordinary differential equations 


il —J(x()) TF), O<t<], 
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subject to the initial condition x(0)= x, pro- 
vided that J(x(t)) is nonsingular. Therefore 
numerical solution at t= 1 gives a good ap- 
proximation for a solution of F(x)=0. This is 
called Davidenko's method of differentiation 
with respect to a parameter. These methods 
can be used to find initial approximations of 
the Durand-Kerner and the Ehrlich-Aberth 
methods. They are also applicable to a single 
algebraic equation f(x) —0, which is the special 
case n — 1. 


N. Other Methods 


To obtain the first approximate values of the 
roots of an algebraic equation, the Graeffe 
method has been used, in which the roots of 
the equation are separated by successively 
forming an equation whose roots are the 
squares of the roots of the preceding equation. 
In computer calculations, however, the other 
methods described in previous sections are 
more convenient than the Graeffe method. 

The Lanczos method [10] is a method in 
which g(x), an approximate function of f(x), 
is obtained by the process of approximating 
functions, and the roots of g(x)=0 are taken 
as approximate values of the roots of f(x) — 0. 
The Garside-Jarratt-Mack method [25] is a 
modification of the Lanczos method and ap- 
proximates f(x)/f (x) by a rational function 
g(x) =(x — ay(b + ex). 


O. Error Bounds for Computed Solutions 


Let z,,..., z, be computed solutions of an 
algebraic equation f(z) 2 0 which were ob- 
tained by some method. If z,, ...,z, are dis- 
tinct, then the following result due to B. T. 
Smith [26] is quite useful for estimating the 
errors of z;: Let 


[z-z|&y- n| f(z] 


II, ;«ilzi— zl 


Then the union of the disks T; contains all 

the roots o, ...,, of f(z) 2 0. Any connected 
component of | IS, I, which consists of just 
m disks T;, contains exactly m roots of f(z) — 
0. Hence if I;'1I;— Ø for every j i, then |x; — 
zj| € y. Smith obtained a more general result 
for the case where z,,...,z, are not necessarily 
distinct. 
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302 (XV.4) 
Numerical Solution of Linear 
Equations 


A. Condition of Linear Systems 


The solution of the system of linear algebraic 
equations 


b, real, (1) 


n 
Y ayx,=b; i=1,...,n, aij, 
ji 


which may be written in the matrix form 


Ax — b; A — (aij), x — (Xj), b — (bj), (1) 


is expressed as quotients of determinants by 
*Cramer's rule. In practice, however, this form 
of the solution is of little value for numerical 
computation, because the direct evaluation of 
the determinants involves (n+ 1)! (n — 1) multi- 
plications which, even for a moderate-sized 
system, amount to a prohibitive number of 
arithmetic operations to be executed, even 
with modern high-speed computers; besides, 
it requires high-precision calculation. There 
are a variety of practical methods of solving 
efficiently the system (1) with finite-precision 
calculation. These numerical methods are 
generally divided into two classes: direct meth- 
ods and iterative methods. 
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Whatever method is used, inherent diffi- 
culties are encountered when the solution of 
the system is unstable. The instability is usu- 
ally measured by the condition number of the 
coefficient matrix, which is defined by 


condA=o,/a, (=|All||A7'I 21), (2) 


where the 6; (o, 2605 2 ... 20, 20) are the non- 
negative square roots of the 'eigenvalues of 
A'A (called the singular values of A), || A|| = 
max {|| Ax||/l|x||:x #0}, and lix ie, This 
definition is valid also for an m x n matrix A 
(mz n). The condition number cond A satisfies 


oxi , — condA4 (ea A) 
Ixl ôA | Al — qp 7? 
————— CO 
li Al 








nd A 


where Ax —b, (A+ 6A)(x + óx) 2 b + ôb, and 
|S Ajj iA ! || <1. As the condition number 
increases, solution processes become more 
susceptible to errors. If cond A is large, the 
system is called ill-conditioned. 


B. Direct Methods 


A direct method is one that yields an exact 
solution in a finite number of arithmetic oper- 
ations if they are performed without roundoff 
error. Among the existing direct methods, the 
one known as Gaussian elimination with pivot- 
ing, which is based on systematic elimination 
of unknowns of the equation (1), is found to be 
the best with respect to time or accuracy. Its 
dependability was re-established by means of 
backward error analysis (— 138 Error Analy- 
sis; [1-3]). The method generates successive 
vectors NN! —(b*) and matrices A? — (aj), typi- 
cally of the form 


AP = 


CO CO CC zs 
CO CC * x 
* * ox Xo X 

* X X X 
* * X X OX 


* 


when n — 5 and k=2. At the kth stage, a pivo- 
tal element aj; ! 40 (i, j> k) is chosen in one 
way or another. Then the ith and kth rows 
and the jth and kth columns of AT are 
interchanged so that aj ! becomes aj,. The 
ith and kth elements of b*~" are also inter- 
changed. The pivot aj, is then used to elimi- 
nate all the nonzero entries in its column 
below the diagonal as follows: 

k k-1 


j^74i > 


Db t i=1,...,k; 


a i-1,..., k, j2L..., n 


= (ai, | / aj.) ass, 
i=k+1,...,n, j=k,... 1; 


MH iaa, i=k+1,...n. 
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Starting with A® = A and b® =b, a system 
of linear equations with an upper-triangular 
coefficient matrix, 


=p bel 
ACID = pino 


is produced at the stage k—n— 1, where x* is 
a permutation of x, caused by interchanging 
columns. This part of the process is called 
forward elimination. If a5! =0 for all i, j>k at 
some stage, then A is a singular matrix of 
trank k — 1 and the system (1) admits infinitely 
many solutions if b*^! =0 for i=k,...,n, and 
no solution otherwise. If this is not the case, A 
is a nonsingular matrix with det A —a7,'a5;! 
...a"™ +, and the solution of (1) is given by 


n 
k n-1 be WE? n-1 
x# =| bo — Y aik Xk II dij 
k=i+1 


for i=n,n—1,...,1, which is called back sub- 
stitution. Taking the pivot to be an element 

of the largest absolute value among column 
elements a$ ! (iz k) at each stage is called the 
partial pivoting strategy. In complete pivoting, 
the pivot is taken to be an element of the 
largest absolute value among aj; ! (i, j> k). 
These pivotings are introduced to prevent loss 
of accuracy due to rapid growth of elements of 
successive A. Although a smaller bound for 
the growth factor is obtained for complete 
pivoting, in practice partial pivoting appears 
to be entirely adequate. 

If rows or columns are not interchanged in 
the process, forward elimination effectively 
produces a factorization of A into the product 
of a lower triangular matrix L and an upper 
triangular matrix U, i.e., 


A-LU, (3) 


where L has unit diagonal elements and U — 
A“), This factorization is computed directly 
by the Doolittle method without calculating 
the intermediate A“. The Crout method also 
produces a similar factorization (3) in which 
U has unit diagonal elements. The Cholesky 
method determines a similar factorization (3) 
of a tpositive definite matrix A, in which U = 
L'. Once the triangular factorizations (3) are 
formed, the solution of the system (1) is deter- 
mined by solving the two triangular systems 
Ly — b and Ux = y successively. 

The number of multiplicative operations 
required for these factorizations are about n?/3 
for forward elimination, Dootittle’s method, 
and Crout's method, and about n?/6 for Cho- 
lesky's method. The solution of each triangu- 
lar system requires about n?/2 multiplicative 
operations. Special properties of A, such as a 
banded structure, can be exploited to reduce 
the number of operations and memory re- 
quirements considerably [4]. Gauss-Jordan 
elimination is similar to Gaussian elimina- 
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tion, but the elements above the diagonal 

are also eliminated to dispense with back sub- 
stitution. However, it requires about n?/2 
multiplications. Modern computers can easily 
handle problems of size n — 100 by these direct 
methods. 


C. Iterative Methods 


An iterative method is a dynamical process 
that generates a sequence of approximations 
(x*! converging to the exact solution. At each 
step of the iteration, an improved approxima- 
tion is obtained from the previous ones. The 
accuracy of the solution depends on the num- 
ber of iterations performed. Most iterative 
methods retain the coefficient matrix in its 
original form throughout the process and 
hence have the advantage of requiring minimal 
memory. They are suitable for solving large 
sparse systems arising in finite-dimensional 
approximations to ‘partial differential equa- 
tions, where A is sparse if most of its elements 
are zero. 

Linear stationary iterative processes are the 
most frequently used iterative methods. A 
method in this class is written in the form 


x*ax*! Bib Ae k= 1,2,..., (4) 


where R is chosen to approximate the inverse 
of A. Usually, R and b — Ax*~! are not ex- 
pressed explicitly in the actual algorithms. If 
the tspectral radius of the iteration matrix | — 
RA is less than one, the method converges 

to the solution for an arbitrary initial ap- 
proximation x° [5]. The key matrix R can be 
chosen quite freely as long as thts condition 
is satisfied. In the Richardson method, R = 
aA‘, 0a «2/] A]?. R=(A+E) is the usual 
choice, in which E is a perturbing matrix that 
makes A + E easily invertible. In the Gauss- 
Seidel method and the Jacobi method, R is 
chosen to be the inverse of the lower triangular 
and the diagonal submatrices of A, respec- 
tively, ie, R-(L-- D) ! and R=D ', where 
L —(a;) (i7 j) and D —(aj). 

Direct methods can be combined with the 
iterative method (4) to obtain an approxi- 
mate inverse R. In the course of factorization 
by a direct method, an artificial perturbation 
E is introduced to produce an incomplete 
factorization, 


LÜ —A- E, (5) 


so that £ and U are of low tcomplexity of 
computation. A combined method is then 
constructed by putting R= Ü ! L^. Even if 
E is not added intentionally, each of the di- 
rect methods actually produces an incomplete 
factorization (5), in which E is a matrix of 
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small elements accounting for the effect of 
roundoff errors [1—4]. In this case, the com- 
bined method is called the iterative improve- 
ment of the direct method. If applied to a first 
solution x? = 0 ~'E~'b, it produces more accu- 
rate solution in a few steps with only a mod- 
est increase in computation time, when the 
system ts not too ill-conditioned [4]. It is 
essential, however, that the residual b — Ax? be 
computed with higher precision. 

Various convergence criteria have been 
established for a variety of methods [5,6]. For 
example, the Gauss-Seidel method ts conver- 
gent for a symmetric positive definite matrix A. 
The smaller the spectral radius, the faster the 
convergence of the method. In general, a larger 
amount of computation is required in each 
iteration to get faster convergence. If the spec- 
tral radius ol — RA) is close to one, the con- 
vergence is slow, and an acceleration of the 
process is needed. SOR (successive over- 
relaxation) is an accelerated version of the 
Gauss-Seidel method, in which R,,=(L+ 
w ! D)! and the optimum acceleration 
parameter c (0 <œ < 2) is chosen to minimize 
of — R, A) (^ 304 Numerical Solution of 
Partial Differential Equations; [6]). There is 
an adaptive acceleration method [7], which, 
if applied to a scalar sequence, reduces to 
Aitken's ó?-method. 


D. The Conjugate Gradient Method 


The conjugate gradient (CG) method is a non- 
linear stationary iterative method for solving 
a system with a symmetric positive definite 
coefficient matrix. The method generates (x*J, 
{r*}, and {p*} by means of the formulas 


o, = (ry. PLP, Ap"), 

xt ext ap", 

rtt po Ax" =r*— g, Ap*, (6) 
f, = (7, re (rk, r”, 

p a 1 nk, 


where x? is arbitrary and p? =r° =b — Ax?, 
(r, s) 5 s'r. 

The CG method shares a feature with the 
direct method. In theory, {x*} converges to 
the solution in less than n steps, and the pi 
are mutually conjugate, i.e., (p/)'Ap'=0 (iz j). 
When Hestenes and Stiefel [9] proposed the 
method in 1952, they created a great sensation 
because of the method's theoretical elegance. 
The CG method turned out, however, to be 
highly sensitive to roundoff errors. In practice, 
nice theoretical properties, such as finite ter- 
mination, do not hold in the presence of error. 
Recently, the CG method has regained its 
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popularity as an iterative method for solving 
large sparse systems. The iteration (6) is usu- 
ally restarted periodically to accelerate the 
convergence. It may converge in a small num- 
ber of steps, as it takes advantage of the distri- 
bution of eigenvalues of A adaptively. The CG 
method is most effective when it is used to 
accelerate linear stationary iterative methods 
or when it is applied to the preconditioned 
system 


EAE 'yy-L b, x=(L +}y, (7) 


where L is computed by the incomplete Cho- 
lesky factorization LE! A + E. The matrix 

L^ A(L y is not to be formed explicitly in the 
CG method. 


E. Linear Least Squares Problem 


Let Ax — b be an overdetermined system of 
linear equations, where A is an m x n matrix 
(m z n). The linear least squares problem is 

to find the x that minimizes the Euclidean 
norm ||b — Ax||. We assume here that A is of 
rank n. The most straightforward method for 
solving the problem is to apply the Cholesky 
method to the normal equation 


(A'A)x = A'b. (8) 


This may result in ill-conditioning of the sys- 
tem, since cond A'A — (cond A)”. Ill-conditioning 
can sometimes be avoided by forming the 
factorization A = LU by Gaussian elimination, 
where L is an m x n lower trapezoidal matrix 
and U is an upper triangular matrix. The 

least squares solution is then obtained by 
solving successively the systems L'Ly = L'b 
and Ux= y. 

Another approach to avoiding ill- 
conditioning is based on orthogonalization. 
The modified Gram-Schmidt orthogonaliza- 
tion method produces in an effective manner 
the factorization 


A —QU, (9) 


where Q is an m x n matrix whose columns 
are a set of forthonormal vectors and U is an 
upper triangular matrix [10, 11]. The least 
squares solution is obtained by solving Ux — 
Q'b. A sequence of Householder transforma- 
tions, P, = I — 2w,wt/||w,||?, can produce the 
factorization 


U 
RP. PiP A=), (10) 


where U is an n x n upper triangular matrix 
[12]. The least squares solution is obtained by 
solving Ux — b, where b is formed by the first 
n elements of the vector P,P, , ... P P,b. A 
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similar factorization can also be produced by 
using a sequence of Givens transformations, 
which have the advantage of exploiting the 
sparseness of A. The matrices U in (9) and (10) 
are unique and coincide with the transpose L' 
of the lower triangular matrix L of the Cho- 
lesky factorization, A'A = LU. of A'A. The 
multiplicative operations required for the 
factorizations (9) and (10) are about mn? and 
mn? — n?/3, respectively. 

When the rank of the matrix is unknown, 
we can determine an "effective rank" p of A, 
based on the singular value decomposition 
(SVD): 


A-UDY', 


where U and V are torthogonal matrices of 
dimensions m and n, respectively, and D is an 
m x n diagonal matrix whose diagonal ele- 
ments d are the singular values o; of A [13]. If 
singular values smaller than o, can be ignored, 
the approximate least squares solution is given 
by 2? = VD; U'b, where the ith element d? of 
the diagonal matrix D; is o; | ifi<p and 0 
otherwise. 

These methods certainly solve the system (1) 
but generally require more computational 
work than those based on the triangular fac- 
torization. The least squares solution is com- 
puted also by applying the CG method (6) to 
the normal equation (8), in which A'A is not to 
be formed explicitly [9]. Iterative methods for 
solving linear systems with singular and rect- 
angular coefficient matrices are characterized 
in terms of the trange of RA and the null 
space of AR [15]. 

Besides the methods discussed here, numer- 
ous other methods have been proposed for 
solving the system (1) [16—18]. Recently, vari- 
ous sparse techniques have been developed for 
direct methods to control the growth of non- 
zero entries (fill-in) in thc process of matrix 
factorization [19]. In general, direct methods 
have an advantage over iterative methods; 
however, their relative computational effi- 
ciency varies according to the scale, sparsity, 
and type of the coefficient matrix and to the 
available computational devices. 
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Numerical Solution of 
Ordinary Differential 
Equations 


A. General Remarks 


Applications of the numerical solution of 
tordinary differential equations include *initial 
value problems, *boundary value problems, 
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and 'eigenvalue problems. Although solution 
by an fanalog computer is among the numer- 
ical methods in the wide sense, we usually 
mean by "numerical solution" a solution ob- 
tained by approximating a problem of infinite 
degrees of freedom and a continuous variable 
by one of finite degrees of freedom. The ap- 
proximation of an arbitrary function by a 
linear combination of a given finite system of 
functions is an example (— Section I). Succes- 
sive substitution and power series expansion 
are also used but have no particular advan- 
tages over other methods. Among various 
methods of approximation, the so-called dif- 
ference methods (or discrete variable methods) 
are the most flexible and have the largest field 
of application. A difference method reduces a 
given problem to an approximate problem 

in which we deal only with a systematically 
chosen set of discrete values of the variable 
and with values of the unknown functions 

at the chosen points. A difference method is 
usually carried out by a simple iterative com- 
putation, so that it is suitable for 'digital com- 
puters. We confine ourselves almost exclu- 
sively to difference methods [1-7]. 


B. Initial Value Problems 


To solve numerically general initial value 
problems, it suffices to consider the problem 
of determining numerically the values on an 
interval [a,b] of m functions y'(x) (i— 1, ...,m) 
that satisfy a system of ordinary differential 
equations of the form y"(x) — f (x, y! (x), ..., 

y" (x)) and the initial conditions y'(a) 2 Hl (i= 
1, ...,m), where the f? are given functions with 
appropriate smoothness, and a, b, and ai are 
given constants. We define the mesh points 

x, =a+nh (n—0, 1,...), call h the step size, and 
determine numerically the values y? approxi- 
mating y'(x,). 

Assuming that the yl were computed with 
infinite accuracy, i.e., without troundoff error, 
we call ei = yi — y'(x,) the global truncation 
error (or global discretization error), while 
we call ria ji — yi the global roundoff error, 
where the ĵi are the values we actually have 
by means of finite-accuracy computation. On 
the other hand, if the assumption holds locally 
that the solutions at the previous steps are 
exact, we call the el and rj the local truncation 
error (or local discretization error) and the 
local roundoff error, respectively. To avoid 
confusion, we denote the local truncation error 
by ti. The numerical solution must have the 
property that, for every xe[a, b], lim, ap, 
— 0, which is called the condition of conver- 
gence. In particular, if ti = O(h?*!) (x, 2 x, h» 
0, where O(h?*') is Landau's symbol), then p 
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is called the order of the solution process. 
We shall show the typical methods of numer- 
ical solution, using the abbreviation LU for 


TG y). 


C. Overall Approximation 


If we rewrite the differential equations on 
the interval [x,, Xn+p] as y'(x,,,) — y'(x,) = 
Fn f'(x, yi(x)) dx, q— 1, ..., P, and substitute 
suitable ‘numerical integration formulas for 
the integrals on the right-hand side, then we 
have the overall approximation formulas: 


P 
Vira Yam h 9, Co ins q=1,...,P, 
r=0 


from which we can obtain yi,,,..., yl. p when 
the yi are given. Since f/, , contains yj ,,, these 
equations are nonlinear in y;,,, so that we 
have to use, for example, the iterative proce- 
dure of starting from suitable Oth approxima- 
tions and adopting as the /th approximations 
of the y;,, values computed from the overall 
approximation formulas by substituting the 
(I— 1)th approximations for the y/,, in the fi, , 
on the right-hand sides. These formulas are 
often used to determine a set of starting values 
for a multistep method. The truncation error 
of the formulas depends on that of the numer- 
ical integration formula substituted for the 
integral. A few examples: P= 1, (c,9,c,,)— 
(1/2, 1/2) (error h? yiP(x,)/12 + O(h*); P= 

2, (€19. C11. C12) 7 (5/12, 8/12, —1/12) (error 
—h* y" (x) OI (659, Can, 055) = (1/3, 4/3, 
1/3) (error hë iz) + O(h9); P—3, (C10; C11 
€12,13) 7 (9/24, 19/24, — 5/24, 1/24), (650, €21, 
C22, C53) 7 (1/3, 4/3, 1/3,0), (639,651, 035, 033) = 
(3/8, 9/8, 9/8, 3/8). 


D. Runge-Kutta Methods 


By a general Runge-Kutta method we mean a 
method of determining y — 5, yi, y5,... SUC- 
cessively by means of a formula yi,, — yi = 
h®'(x,, y]; h), where the functions d are de- 
fined in terms of parameters «, and f,, as di 
(x, y^; h) e XL so, ki, kà = f'(x, y?) ki= f(x 
Boh, y! + h(Bjo — XT f) kh ha. Bk) (r= 
L,..., P. A Runge-Kutta method is called ex- 
plicit if 5,, —O for r <s, and implicit otherwise. 
In the latter case, if f,,—O0 for r <s, the method 
is called semi-explicit (or semi-implicit). Unless 
otherwise stated, the Runge-Kutta methods 
considered below are assumed to be explicit. 
In order to proceed one step from yi to yi, 
we have to compute each function f' (P 4 1) 
times. Hence this is called the (P + 1)-stage 
method. If we denote by z'(t) the solutions 


303 D 
Numerical Solution of ODEs 


of z"(t) « f (t, z/(t)) with the initial condi- 
tions z'(x) Vi and put hA'(x, y^; h) - z(x +h) — 
z'(x), then we have ®'(x, vi h) — A'(x, y^; 1) = 
h? o'(x, y+ O(h?*!), where the (x, y} are 
expressible in terms of the f‘(x, y) and their 
(partial) derivatives. Various Runge-Kutta 
methods have been devised by searching for 
values of the o, and £,, that make p as large 

as possible for a given p (p is called the order 
of the method). When searching for these 
values of o, and f,,, we usually impose at least 
one of the following conditions: (1) «, and H. 
are simple; (2) the truncation error is small; (3) 
the region of absolute stability (— Section G) 
is large; (4) a, and H. give smaller roundoff 
errors; (5) only a small computer memory is 
required. The ordered pairs (P, p), where p is 
the highest order that can be attained by the 
(P + 1)-stage method, are (P <3, P+ 1), (4, 4), 
(5, 5), (6, 6), (7, 6), (8, 7), (9-10, 8), (p-- 2 < (P + 
1) « (1/2) (p? - 2p +4), p 2 9). The following 
formulas are frequently used since they are 
accurate and have simple parameters: for P — 
1 and p=2, the formulas with a) —0, «, — 1, 
Bio =1/2 (modified Euler method), with œ = 
xı — 1/2, fio = 1 (improved Euler method) and 
with xo = 1/4, «, — 3/4, f, — 2/3 (Ralston's 
second-order method); for P=2 and p=3, 

the formulas with a) — 2/9, x, = 1/3, x> — 4/9, 
Bio =1/2, Bro =3/4, fl, , — 3/4 (Ralston's third- 
order method), and with xy = 1/4, x, =0, à, = 
3/4, Bio =1/3, Bro = 2/3, B; , — 2/3 (Heun's 
third-order method). For P =3, p=4, the for- 
mula with au =a; = 1/6, 7, =% = 1/3, Bio = Dao 
= Bus UU, B30 = B4, 1, B3,=0 is well known 
and is frequently referred to as "the fourth- 
order Runge-Kutta method" or "the Runge- 
Kutta method." It has various desirable fea- 
tures [8]. Gill's modification of the classical 
Runge-Kutta method (sometimes called the 
Runge-Kutta-Gill method) has some advantage 
in regard to roundoff errors and to the neces- 
sary computer memory size, while Ralston's 
second-, third-, and fourth-order methods have 
minimum error bounds in Lotkin's sense for 
the local truncation error [7]. There exist 
"substantially fifth-order" methods for P—4 
[9]. Today we frequently prefer to use higher- 
order methods, especially fifth-order methods, 
instead of lower-order methods, because the 
former require less computation time for solu- 
tions of given accuracy [10]. 

For the estimation of the local truncation 
error, the following two practical techniques 
are well known: (i) one-step—two-half-steps 
error estimate. If we let y/2, and gn denote 
the approximations that are computed with 
an mth-order method by taking two half-steps 
and one full step (=h,), respectively, then 
an estimate of the local truncation error per 


half-step associated with y, is given by the 
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formula 
thar X Valh D" (y, — yl /2(1 — 2), 


where we assume that y$, does not vary much 
over the interval [x,, x,., ]. (ii) The formula 
of embedding form. The following formula of 
embedding form is more effective than the 
traditional method above. By use of the pth- 
order method y;,, =y} - X. an, ki and the 
(p+ 1)th-order method yi*, 2 yi +EP oa Ki, 
we obtain an estimate of the local truncation 
error in Ma as ti, 2 yl, — yit, [11-13]. At 
present, formulas of this type are known for p 
— 2-8, where the method for p — 5 is said to be 
the most efficient. Recently, a similar method 
for the estimation of the global truncation 
error has been investigated [14]. In addition, 
multistep generalizations of explicit Runge- 
Kutta methods, which are known as pseudo— 
Runge-Kutta methods, have also been inves- 
tigated [15]. 


E. Multistep Methods 


A linear multistep method approximates given 
differential equations by difference equations 
p(E) y; — ho(E) fj, where E is the operator of 
increasing n by 1, p() =at" + apC! + 
cb kä, o(0) m BC + Prk A 

Pi C Bo, % #0, and |a| - | Bo] #0. (If p and 

o are of degree k in C, we speak of a linear k- 
step method.) By means of these difference 
equations we can determine yj, from vi. 
Viek-29 +++» Y. Since the y! ,, are determined 
explicitly if f, — 0 and implicitly if fj, #0, the 
difference equations are called explicit and 
implicit, respectively. In the implicit case, if |A| 
<|(a,/B,)L|, where L is a tLipschitz constant 
for f'(x, y), then the y!,, can be calculated by 
successive substitutions. In order to obtain the 
y, successively by means of a k-step method 
with k 20, it is necessary, to begin with, to give 
the k — 1 sets of m values y!, ..., yi. , (called the 
starting values) in addition to the initial values 
yo — 1. To determine the starting values, 
Runge-Kutta methods, overall approximation 
formulas, etc., are ordinarily used. The number 
of times we have to compute the f' to pro- 
ceed one step from yi to yi}; is only 1 for an 
explicit case, while for an implicit case it is 
equal to the number of iterations required to 
secure the convergence of the successive substi- 
tutions (ordinarily, the step size h as well as the 
Oth approximation for vi, are chosen so that 
the convergence is attained after a few itera- 
tions). We have lim, a, -,e; =0 (xe[a, b]) 

for any f' and gi and any starting values such 
that lim, at, =n" (u—0, L, ..., k— D); if and 
only if the polynomials p and c satisfy the 
following two conditions: (i) consistency: p(1)— 
0, p'(1)— o(1); (ii) zero stability: Every root 
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of p(f)=0 lies on or inside the unit circle, and 
every root lying on the unit circle is simple. 
We call t; =(p(E)y'(x,) — ho(E) y"(x,))/a(1) the 
local truncation error. For a consistent and 
stable method determined by (p, o) there exist 
a real constant C,,, (#0) and an integer p 
(> 1) such that t; - h?*! Cpa HIRT (x yo(1)4- 
O(h?*?): p is called the order of the method 
and C zC,,,/ce(1) the error constant. For a 
given p satisfying (ii), ø is chosen to make the 
order as small as possible and then to make 
the error constant as small as possible. How- 
ever, p cannot exceed k + 1 if k is odd or k - 2 
if k is even; moreover, p can be equal to k -2 
only if all the roots of p(£)=0 lie on the unit 
circle. 

The following are examples of multistep 
methods. 


(1) Explicit Methods. 
(a) Adams-Bashforth methods. 


k=1, p()=C-1, o(-l pi 
C=1/2 (Euler method); 

k-2,  pQ=CC-1), ce(0-(35— 12, 
p=2, C=5/12; 

k=3,  p(Q=C(C—1), 
o(6)=(2307-166+5)/12, pz3, 
C=3/8; 

etc. 

(b) Midpoint rule. 

k=2, p()-C-M o(f)=26, 
p=2, C=1/6. 


(c) Milne’s predictor. 


k=4, p(Q=C4-1, 
a(t) - (8P — 4C + 8¢)/3, 
p=4, C=7/90. 


(2) Implicit Methods. 
(a) Adams-Moulton methods. 


k=1, ois, o(f=(C+1)/2, 

p=2, C=-1/12 (trapezoidal rule); 
k=2, p(Q=C(C—-1), 

(0) = GC + 86 — 1)/12, 

p=3, C--A/24; 
etc. 


(b) Milne's corrector (or the Milne-Simpson 
formula). 


k=2, p= —1, eiis +4041)/3, 
p=4, C--1/90. 


1137 


When an implicit formula (p, c) is used to 
obtain yi,,, the Oth approximation y?*, for 
yi, , is usually determined by an explicit for- 
mula (p*,6*) of the same order as (p, o) (where 
(p*,o*) itself may not necessarily be stable). 
This kind of combination of implicit and ex- 
plicit methods is called a predictor-corrector 
method (or PC method), where the formula 
(p*,a*) is called the predictor and (p, ol the 
corrector. Typical combinations are the mid- 
point rule and the trapezoidal rule, Milne’s 
predictor and Milne’s corrector (this combina- 
tion is called Milne’s method), and an Adams- 
Bashforth method and an Adams-Moulton 
method of the same order. A predictor- 
corrector method has the advantage that the 
local truncation error (of the corrector) can be 
estimated in the course of calculation without 
extra computation. In fact, if the orders of the 
predictor (p*, of) and the corrector (p, a) are 
equal to p and their error constants to C* and 
C, respectively, then the local truncation error 
ti can be estimated by ti = K Di + O(h?*?) (n= 
0,1,...), where Di = yi*, — yi,, is the differ- 
ence between the value yi*, at x, ,, obtained 
by the predictor and the vi. obtained by the 
corrector, and K «af C/[(C —C*)o*(1)] [16]. 

The term multistep methods originates from 
the fact that these use the values of the depen- 
dent variables at more than two different mesh 
points in order to proceed one step. These are 
also called multivalue methods since they use 
more than one value of the dependent vari- 
able. The multivalue method is, however, 

a more general concept than the multistep 
method. 

Linear multistep methods are not only ex- 
amples of the PC method; they are also exam- 
ples of the variable-step variable-order algo- 
rithms (VSVO algorithms), where the order of 
the formula as well as the step size are auto- 
matically chosen according to the behavior of 
the solution. In practical VSVO algorithms, 
the Adams-Bashforth-Moulton family of PC 
pairs of order | to 13 are usually used, and 
for solving stiff systems (— Section G) those 
correctors with large regions of absolute sta- 
bility are used. In these algorithms we use the 
multivalue method, which saves the informa- 
tion at different steps in a form convenient 
for the change of order and of step size. The 
multivalue method also facilitates error esti- 
mation and stability, and results in an efficient 
use of memory and reduction of the computa- 
tional cost. For example, in Gear's algorithm 
the information required for computing y,,, 
is saved in the following form: y, — (y, hy,, ..., 
(h*/k!) 9), where y'(x) is the polynomial P, ,(x) 
interpolating fa, fa-1s -<3 fa - 44, Where y® = 
d* ! P, (x)/dx'^! | x-x, hold. All the local trun- 
cation error estimators are based essentially 
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on Milne's device. In the VSVO algorithms 
heuristics play an important role [2, 17, 18]. 


F. Extrapolation Methods 


Let us assume that the numerical solution with 
the step size h at some fixed point x has the 
form 


y(x, h)=y(x)+ 5 zh" + OQ (hr), 
i=1 


where r is a positive integer. Then we can 
approximate y(x, h) by a function R,,(x, h) with 
(m+ 1) unknowns determined by the require- 
ment that R,,(x, hj) = y(x, hj, j=0, 1,2, ...,m, 

h; 7 h; (i<j) in order to approximate y(x) by 
R,, (x, 0). If R, (x, h) is a polynomial of degree 
mr in h, we call the foregoing method a poly- 
nomial extrapolation method, where R„(x, h) 

= y(x)+ o0(h""*) as h tends to 0. Defining 

Ri = y(x, h) and R,,(x,0)= Ri, for j=i,i+1, 
...,i4- m, we can calculate the Ra by the re- 
cursion relation Ri, = RE, A (Ri, — RP 4)/ 
((hj/h;, y — 1), mz 1. It is well known that, for 
the following method (Gragg's method), there 
exists an asymptotic expansion with r 22 [19]: 
x, — nh, n(0, h)= y(0), n(x, h) 2 yo + hf(yo. 0), 
Us, , h) m n0, a. A) + 2hf (n(x, h), x,), n= 1,2, 
...,N — 1, where xy = x, y(x, h) 2 1/2 [n(xy .,, h) 
+y(xy, h) - hf (y(xy, h), xy) ]. If R,,(x, h) is a 
rational function 


R,(x, h) =(Po + ph - p, I? + aut puit) 
Jo - q4 h-- q5h? ^ ... -- q,h?), 


where u — [m/2], v2 m — u 2 [(m-- 1/2], j— i, 
i 4-1, ...,i4- m, we call this method a rational 
extrapolation method. For r —2, the following 
formulas were derived by Bulirsch and Stoer 
[20]: 


Bes RE ENS — Rin- MIR: wl 


x[1 (RL, — Ri, DAR — RL =1), 





mzl,R',20, R$-y(uüh) 


G. Stability 


Consider the application of the general k-step 
method at the nth point (which is consistent 
and 0-stable) 


k k 
2, OYn =h Y. Bu 
j=0 j=0 
to y= f(x, y), y(xo) = yo. Let y, be the numer- 
ical solution at x= x,. Let e, — y(x,) — H, be the 


global error, and let o, be the total error at the 
nth application. Then we find 


k k of " 
p Xn =h > ay mt STOT + nic 
j=0 j=0 0Y 
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where 6,.; lies in the open interval whose 
endpoints are y,,; and y(x,.;). If we make two 
assumptions, Of/Oy =A (const), o, = « (const), 
the above equation reduces to 3.9 (xj; — 
h4[j)&, 9, whose general solution is given 


by 


where the d, are arbitrary constants and the r, 
are the roots, assumed distinct, of the poly- 
nomial equation 


(a,—hAB)r’ =0. 
j=0 


J 


We call the linear k-step method absolutely 
stable for a given ha if |r| « 1, s=1,2,...,k. 
On the other hand, we call the linear k-step 
method relatively stable for a given hå if |r| < 
Iri] (or |r,] « e^*, 5 —2,3, ..., k, where r, is the 
root corresponding to the theoretical solu- 
tion. We call the region S,— (hA | |r| « 1, s= 
1,2, ..., k (or S, e (hA Ir] « e?" (or |r,l s= 
2,3, ..., kj) in the complex plane the region 
of absolute stability (or the region of relative 
stability) of the linear k-step method. Also we 
call S, R (or S.N R) the interval of absolute 
stability (or the interval of relative stability) of 
the linear multistep method, where R is the 
real line. The explicit methods and the PC 
methods have a finite interval of absolute 
stability. The implicit methods usually have 
larger intervals of absolute stability than the 
corresponding explicit methods. The higher- 
order PC methods have smaller intervals, 
while Runge-Kutta methods do not. The 
PECE methods usually have larger intervals 
of stability than the corresponding PEC or 
P(EC)? methods, where P indicates an appli- 
cation of predictor, C an application of cor- 
rector, and E an evaluation of f [21,22]. 

Let us consider the linear system y' = Ay + 
g(x), where A is an m x m constant matrix 
and y(x), y' (x), e(x) e R". If A possesses m dis- 
tinct eigenvalues 4, = u, + iv, t— 1,2, ...,m, the 
theoretical solution of this system is given by 


3 


l 


10) = K,exp((u, + iv)x)C, + w(x), 

where K, and C,, =1,2,...,m, are, respec- 
tively, arbitrary constants and the eigenvectors 
corresponding to 4, and y(x) is a particu- 

lar solution. We call the linear system stiff 

if (1) 4, «0, t=1,2,...,m, (i) s 1, where s = 
màX «ce 1 |/ min, wl il, We call the non- 
linear system y'= f(x, y), y(x), y" (x), f(x, y)e R" 
stiff in an interval / of x if, for every xe I, the 
eigenvalues /,(x) of the ‘Jacobian matrix of f 
satisfy (i) and (ii). The ratio s is called the stiff- 
ness ratio. To solve stiff systems effectively 

a variety of methods with infinite stability 
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regions have been proposed. Some of them 

are: 

(1) A-stability (Dahlquist): 5,2 [hA] Re(h4) <0} 

(2) Stiff-stability (Gear): $, >S,US,, Sı = 

(hA| Re(hA) « —a <0}, $2 (h4| —a < Re(h2) < 

b, -c&Im(hA) c, bz 0,c» 0j 

(3) A(a)-stability (Widlund): S, 2 {hå | -a < n — 

arg(hA) « a, a e(0, n/2)) 

(4) Ao-stability (Cryer): $, 2 (h4| Re(hA) « 

0, Im(h2) 20]. 

The order p and the step number k of linear 

multistep methods are restricted by the follow- 

ing stability requirements: 

(1) A-stability: implicit, p x 2; trapezoidal rule is 

the most accurate method. 

(11) Stiff-stability: implicit, p < k; backward 

differentiation formulas Zi 9 a;y,.; Ma are 

stiffly stable for p —k— 1,2, ...,6, O-unstable 

for k — 6. 

(i1) A(a)-stability: implicit; there exist high- 

order A(0)-stable linear multistep methods. 

(A method is said to be A(0)-stable if it is A(a)- 

stable for some sufficiently small «€(0, 2/2).) 
For the Runge-Kutta (P, p) methods, we can 

write 


à, = +h +(hd)?/2! +4... (hA p! 


Pi 


T X T O, Qui, 
q-ptl 

where 0f/0y = å (const) and y, are functions of 
the coefficients of the method in use. We call a 
region R — (hà ||1-- hA - (h2)/21 4- ... -- (hAY/p! 
FELLaGQAY| <1} a region of absolute 
stability of the Runge-Kutta (P, p) method. 

The implicit Runge-Kutta methods have a 
larger region of absolute stability than the cor- 
responding explicit Runge-Kutta methods. 
Therefore the implicit Runge-Kutta methods 
are suitable for stiff equations, and the explicit 
Runge-Kutta methods are suitable for nonstiff 
or mildly stiff equations. 

Recently, Yamaguti and others have pointed 
out that the instabilities occurring in the nu- 
merical solution of ordinary differential equa- 
tions are closely connected to the phenomena 
of chaos studied by Li, Yorke, and others 
[23, 24]. 


H. Boundary Value Problems 


A boundary value problem is generally for- 
mulated as the problem of obtaining functions 
y'(x) that satisfy the differential equations 
y"(x) 9 f(x, y! (x), ..., y"(x)) and boundary 
conditions B,(y! (x,.), ..., V Xn) V Xa) 
Y"Qx, oT Qu), os VG) HOG Ko 1, ..., 
m), where the f' and B, are given functions. 

If the f and B, are linear in the ri, we can 
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first calculate the solutions y5(x) of the differ- 
ential equations under an appropriate initial 
condition, as well as a set of m independent 
solutions yj(x) (I= 1, ...,m) of the homogene- 
ous differential equations (say, under the ini- 
tial conditions y;(a)=6; at some point x =a), 
and then substitute the expression for the 
desired solution y'(x) 2 yi6) + D™, aj yi(x) in 
the boundary conditions to obtain simulta- 
neous linear equations for the unknowns o; 
(— 302 Numerical Solution of Linear Equa- 
tions). No universally powerful method is 
known for the case where the f' or the B, or 
both are nonlinear. Ordinarily, we resort to 
the trial-and-error method of solving the 
differential equations iteratively under differ- 
ent initial conditions until the solution fully 
satisfies the boundary conditions, or to ap- 
proximating the differential operators by 
suitable difference operators to obtain a set of 
(generally nonlinear) simultaneous equations 
approximating at the same time the differential 
equations and the boundary conditions. For 
related topics — 298 Numerical Computa- 
tion of Eigenvalues; 315 Ordinary Differen- 
tial Equations (Boundary Value Problems); 
[25-27]. 


I. Methods Other than Difference Methods 


Besides difference methods there are frequently 
used methods for finding in a given finite- 
dimensional function space a function that 
best satisfies the differential equations as well 
as the initial or boundary conditions. Denote 
the equations by L[ y(x)] 20 and assume the 
conditions to be linear. We choose a function 
yo(x) that satisfies the conditions and that is 
considered to approximate the exact solution, 
and also functions y,(x) (/=1,...,g) that are 
considered to represent the typical deviations 
of yo(x) from the exact solution and each of 
which satisfies the homogeneous conditions. 
We then determine the x, in such a way that 
y(x) = yo(x) + Zi: xı yi (x) best satisfies the 
differential equations. Corresponding to differ- 
ent interpretations of the words "best satisfy" 
there are different methods. The collocation 
method determines the «, so as to nullify the 
values of L[ y(x)] at some prescribed points 
x;; the method of least squares minimizes the 
integral of | L[ y(x)]? over the considered 
region; the Galerkin method makes L[ y(x)] 
orthogonal to every y(x) (l= 1, ..., at and the 
Ritz method considers the variational problem 
ôJ [ y(x)] 40 whose *Euler equation is L[ y(x)] 
=0 (— 46 Calculus of Variations F), if such a 
problem exists, and determines the a, so as to 
have J [ y(x)] take an extremum value with 
y(x) = yo(x) + Xf- y(x). 
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304 (XV.9) 
Numerical Solution of Partial 
Differential Equations 


A. General Remarks 


The numerical solution of partial differen- 

tial equations became practical in the 1950s 
with the advent of automatic digital compu- 
ters. Nowadays, by means of modern high- 
performance computers, the numerical solu- 
tion of partial differential equations is carried 
out extensively and often on a very large scale 
for problems in physics, engineering, and 
other fields of applied analysis, in order to 
obtain approximate solutions of rigorous 
equations or to simulate real phenomena by 
means of numerical experiments. Various 
numerical-solution schemes have been pro- 
posed, applied, and studied, and programming 
techniques required to implement numerical 
solutions are being invented regularly; some 
are of a general nature, and others are de- 
vised for particular problems. The variational 
method, which includes the Ritz-Galerkin 
method and the finite element method, and 
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the difference method deserve special mention 
here because of their generality and scope of 
application. 


B. The Ritz-Galerkin Method 


Having been applied mainly to boundary 
value problems for elliptic partial differen- 

tial equations, the Ritz method is a classical 
method, which has been used since the old era 
of mechanical calculators, and is mathemati- 
cally nothing other than the *direct method in 
the calculus of variations (— 46 Calculus of 
Variations F) applied to tvariational problems 
that are equivalent to the original boundary 
value problems. In order to exemplify the 
method, let Q be a bounded domain in R“ 
with piecewise smooth boundary S, and con- 
sider the Dirichlet boundary value problem 
comprised of the Poisson equation 


Au- —f (1) 
and the boundary condition 
ul ,=B. Q) 


Here f and $ are given functions on Q and S, 
respectively. (In the following, given functions 
are assumed to be sufficiently smooth.) The 
boundary value problem (1), (2) is equivalent 
to the variational problem of minimizing the 
functional 


1 
Jlu]=5a u) — (wu, f) G) 


within the set D(J) of admissible functions u 
subject to (2). In (3) the following notation is 
used: 


a(u, v) 2 (Vu, Vo), (9 — | Vu: Vvdx 
Q 
and 


(u, v) — (u, duc uvdx, 


Q 


while 
D(J)= {ue L,(Q)|VueL,(Q), | u|,— fj. 


This variational problem is further reduced to 
the condition: 


ue D(J), 
a(u, p)=(f, 9) 


where V stands for the tSobolev space H1(Q) 
(— 168 Function Spaces B), that is, 


(Vpe V), (4) 


V= Hi (Q)= {ue L,(Q)| VueL;(Q), u|, =0}. 


Note that V is equal to D(J) with 6=0. The 
condition (4) is often called a weak form of the 
boundary value problem. In applying the Ritz 
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method to this problem, we introduce un- 
known parameters 2,25, ...,2, and set the 
approximate solution u, in the form 


u,— F(x; 0,,%,...,4,)€D(J) 


and minimize J [u,] as a function of the n vari- 
ables «,,%,,...,%,- A standard way to form 

u, is to choose first a be D(J) subject to the 
inhomogeneous boundary condition and ¢,, 
95, ...,9,€ V subject to homogeneous bound- 
ary condition, and then to set 


u,—b-4- 049, EE DE DEE 0,9. (5) 


The functions o; (j — 1,2, ..., n) are called basis 
functions or coordinate functions in the Ritz 
method. Denoting by D, the set of all possible 
functions appearing on the right-hand side of 
(5) and by V, the linear space generated by 

Q1. 5, ..., 9,, We can state the condition to 
determine the approximate solution 4, as: 


u,€ D, 


alun, 9) - Cf, 9) 


Thus the Ritz method is a kind of projective 
approximation method which approximates the 
weak equation (4) by its projection on a finite- 
dimensional space V,. The condition (6) is 
equivalent to the equations 


(j21,2, ...,n). (7) 


(Vee V,). (6) 


a(u,, p;) = (f, od 


Particularly when BzsümOL D, coincides 
with V,, and the equations (7) that deter- 
mine the coefficients o; are reduced to a linear 
equation 


Ka=y (8) 


for a= äu, 25, ...,2,), where the matrix K = 
(Kj) and the vector y='(y,, Y2, ..., y,) are given 
respectively by K;=a(p; oj) and y;—( f, oj). 
Solving (8) numerically by the *Gauss elimina- 
tion method or by fiteration, one eventually 
obtains the approximate solution u, by the 
Ritz method. The Ritz method is also appli- 
cable, for instance, to the eigenvalue problem 
Au = du, u|, —O, since this eigenvalue problem 
is equivalent to the variational problem of 
finding a stationary Rayleigh's quotient R[u] = 
a(u, u)/(u, u) with V as the set of admissible 
functions. In the approximation to restrict the 
admissible functions to V,, the approximate 
eigenvalue 1 is determined through the ma- 
trix eigenvalue problem 


Ka — A Ma, (9) 


where the matrix K is as above and the matrix 
M —(Mijj is given by Mio, p). Sometimes, 
optimization of J[u,] or R[u,] is carried out 
more directly, e.g., by the tgradient method, 
particularly when u, contains free parameters 
a: in some nonlinear way. 
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Many studies have been made of conver- 
gence and error estimation for the Ritz method 
in general (— e.g., [1-4]). For instance, it is 
known that, if B=0, fe L?(Q), and {¢,, @, 

.-,Q,,--.} Spans a linear subspace dense in 
V =H, (Q), then the approximate solution u, 
obtained through (6) (or through (12) below) 
converges to the rigorous solution in the H!- 
topology (and hence in the L?-topology). 
Nevertheless, success in using the method in 
practical applications can be gained only by 
a clever choice of trial functions. A typical 
but systematic example of such a choice is 
made by the finite element method (— Sec- 
tion C). 

We proceed to the Galerkin method [1]. 
Suppose that we are to solve the equation 


L[u]- Au- (b: V)u- — f, (10) 


which is the equation in (1) perturbed by a 
lower-order term (the convection term). Here b 
=(b,(x), ..., b,(x)) and (b: V)u- Xj b(x)(0u/Ox;). 
For simplicity, the boundary condition (2) 

is assumed to be homogeneous (i.e., 6 =0). 
Although the boundary value problem (10), (2) 
cannot be reduced to a variational problem 
since (10) is not symmetric, it is equivalent to 
its weaker form 


((b* V)u, 9) 2 (f, €) 


provided that ue V. Note that (11) is ob- 
tained from (L[u]+ f, 9),, —0 by transforming 
(L[u], 9) through integration by parts. Ac- 
tually, this boundary value problem has a 
unique solution u for any fe L,(Q) if |div b| is 
sufficiently small. According to the Galerkin 
method as applied to the present problem, one 
replaces V by V, in (11) and determines the 
approximate solution u,=«,9,+...+4,9,€V, 
by the condition 


(Uh: Vin, 9) 7 Cf. 9) 


which is equivalent to the equations 


(f. oi (j=1,2,...,n). 
(13) 


a(u, p)— (och — (10) 


alun 9)— (ocv) (2) 


alun, Pj) — (b: V)u,, 9j) = 


Sometimes it is convenient to project (L[u] + 
£4 9) «0 onto the finite-dimensional space and 
to deal directly with (L[u,]+ fg) - (j= 
1,2, ..., n). When the coefficients of the equa- 
tion and the given function f are periodic in 
space variables and when sine or cosine func- 
tions are chosen as basis functions, the Galer- 
kin method turns out to be the same as the 
so-called Fourier approximation method and 
can be efficiently implemented with the aid of 
a fast Fourier transform (FFT) to yield the 
required Fourier coefficients numerically [5]. 
The Galerkin method can be applied to 
the numerical solution of evolution equa- 
tions, namely, in order to approximate time- 


304 C 
Numerical Solution of PDEs 


dependent solutions. This is exemplified 
through the following initial-boundary value 
problem for the diffusion equation (heat 
equation): 

= = Au (t20, xeQ) (14) 


with the homogeneous boundary condition 
u|,=0 (15) 
and the initial condition 

ul,-o—a(x) = (x EQ). (16) 


We first set the required approximate solution 
u, — u,(t, x) in terms of the basis functions 


91; 92; +++ 9, OF V, as 
Uy = os(t) 9, t 5 (t) 9o +... (t) Pn, (17) 


and then determine the coefficients o, (t), oc; (t), 
...,%,(t) from the conditions 


Zi, gl —atu,, 9) (zt, Voc V) (18) 


dt 

and 

(u,(0), 9) — (a, p) 
Since (18) is equivalent to the equations 


d 


ji Ue p)= — a(u,, Qj) (j=1, 2; ida A), : 


(och (19) 


the vector function a(t) — (a, (t), a, (t), ... , «,(t)) 
is obtained by solving the ordinary differential 
equation 


M um = — Ka, (20) 

dt 
where the matrices M and K are those in (8) 
and (9). The initial value «(0) of « should be 
given in accordance with (19). Sometimes the 
procedure above is called a semidiscrete ap- 
proximation based on the Galerkin method. 
Obviously, this method can be applied to more 
general evolution equations, for instance, to 
the diffusion-convection equation 
Ou 
g L15 Au +ib: V)u, (21) 
which governs time-dependent states corre- 
sponding to stationary states subject to (10). 
As a concrete example, we can refer to the 
finite element approximation for evolution 
equations described in Section D. If the geom- 
etry of Q is simple enough, e.g., for 1-dimen- 
sional Q, one can adopt eigenfunctions of L[ ] 
as the basis functions; the resulting version of 
the semidiscrete Galerkin method is called a 
spectral method. 

In order to obtain u,(t) one has to carry out 
numerical integration for (20), discretizing 
the time variable. The Galerkin approxima- 
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tion with discretization of the space and time 
variables is called a full discrete approximation. 
Finally, an advantage of the Ritz-Galerkin 
method is that if the boundary condition 
imposed on rigorous solutions is a natural 
boundary condition, then the admissible func- 
tions and particularly the basis functions need 
not satisfy it [4]. 


C. Finite Element Methods for Boundary 
Value Problems 


In recent years the method of numerical solu- 
tion of partial differential equations most 
extensively employed in structural mechanics 
and many other fields of engineering has been 
the finite element method. In its standard form, 
the finite element method can be regarded, 

at least mathematically, as a type of Ritz- 
Galerkin method that adopts as its basis func- 
tions piecewise polynomials of low degree 
and with narrow supports. Although the idea 
leading to the finite element method can be 
traced back to a paper by R. Courant in 1943 
(Bull. Amer. Math. Soc.), the method acquired 
its popularity in the late 1950s when it was 
rediscovered by engineers on the basis of me- 
chanical considerations [6]. Here we apply 
the method to the boundary value problem 
(1), (2), assuming that f =0 and Q a polygonal 
domain in R? [7-9]. First, Q— QUS is divided 
into small triangles of which the length of any 
side does not exceed h >0. Each triangle T 
appearing in this decomposition of Q is called 
a triangular element, or simply an element 
(following the terminology in structural me- 
chanics), and we denote the set of all triangular 
elements by 7,, h being equal to man, (the 
longest side of T}. The set of all nodal points, 
i.e., the vertices of the triangular elements, is 
denoted by N, and we put 


N;- (PeN |PeQ) - (P,, P,, ..., bi (22) 


Then a standard choice of admissible functions 
is to adopt as V, in (6) the following V, cV= 
H (Q): 


V, = (v,€ C(Q)|v, is linear on each element 
and v,|,=0}, (23) 


namely, V, is the set of all elementwise linear 
continuous functions satisfying the boundary 
condition. Now the approximate solution 


u, € V, is determined by the condition 
alUn, q,) —(f.9,) — (Vp, € V). (24) 


A basis of V, is formed by pyramidal functions 
g; (j — 1,2, ..., n)e V, such that 


e(P)-! and o(Q)-0 (QeN-(Pj) (25) 
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In order to obtain u,, one has to solve (8) 
numerically for a. Since the support of o; is 
confined to elements adjacent to Pj, the matrix 
K turns out to be a sparse matrix of multi- 
diagonal type, and hence particular elimination 
procedures can be applied with high efficiency. 
In the finite element method, the matrix K is 
often called the stiffness matrix. The conver- 
gence of u, to u as h—0 is guaranteed if 7, 

(0 « h x hj) satisfies a certain regularity con- 
dition. For instance, if any angle of a triangle 
T is not smaller than 6, 0, then u, converges 
in the H! (O)-topology. Moreover, when Q is 
convex, it holds that 


[u — unl: < Chl f lp. 
|| ulle, S Ch? Sl, (26) 


[7,9]. Uniform convergence of u, to u and L,- 
estimation of the error have also been estab- 
lished (e.g., [7, chap. 3]). u, is subject to the 
maximum principle, provided that the trian- 
gulation Z, is of acute type. Here, Z, is said to 
be of acute type (or of strongly acute type) if 
any angle 0 of Te Z, is in 0 « 0 € (n/2) (0 « 

0 « 0, « (1/2). To acquire higher accuracy, 
more sophisticated admissible functions that 
are piecewise polynomials of higher degree are 
used. To take account of curved boundaries, 
several devices, including the isoparametric 
method, have been proposed. 

In dealing with biharmonic equations or 
other equations of higher order, one can con- 
veniently apply a modified version of the finite 
element method of mixed type or of noncon- 
forming type; in the latter the approximate 
solution is sought within a class of functions 
which are of less regularity at the interface 
of elements and hence do not belong to the 
domain of the original variational problem. i.e., 
which are not admissible in the classical sense 
[10, 11]. 


D. Finite Element Methods for Initial Value 
Problems 


The most basic type of finite element method 
applicable to evolution problems is described 
here, and concerns the initial boundary value 
problem (14)-(16), assuming that Q is the same 
as in Section B [8,9]. In the semidiscrete finite 
element approximation one uses the functions 
g; of the preceding section as the basis for the 
approximate solution, now denoted by u,, and 
then determines u, through (17)-(19) with V, 
replaced by V,. Then u, converges to u as h0, 
provided that the triangulation 7, is regular. 
Furthermore, under the same condition that 
makes (26) hold, the error is given by the fol- 
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lowing estimate [12]: 


lu— url, € Ch^|ally;/t  (t>0), (27) 


which reflects the smoothing property of the 
diffusion equation. In the terminology of the 
finite element method, the matrix M in (20) is 
called the mass matrix. In order to gain a fully 
discrete finite element approximation u, , for 
u, one discretizes the time variable with the 
*mesh length t and solves the difference analog 
of (20), i.e., either 


M (a(t 4-1) — a (t): = — Kalt) (28) 
or 
M(a(t - 1) — a(t))/t = — Ka(t+7), (29) 


where t is restricted to t 2 kx (k=0, 1,2, ...). 
The difference schemes (28) and (29) are of 
forward type and of backward type, respec- 
tively. From (29) follows ||a(kt)|| < ||x(0)], and 
consequently Tu, aH ll, <Clla||,,. In the gener- 
ally accepted terminology, the approximation 
is stable if for each T'— 0 there exists a positive 
constant C4 depending on T such that 


lu ()I«Crlal (<t=kt<T). (30) 


Thus the approximation through (29) is un- 
conditionally stable, while the one through 
(28) is stable only under certain conditions on 
the triangulation 7, and on the time mesh zt. 
For instance, the forward scheme (28) is stable 
if 7, is regular and satisfies the inverse 
assumption 

h 


su max ———————————— < +00, 
eg zez, (longest side of T) 


and if the ratio t/h? is sufficiently small 
[13, 14]. 

Sometimes, the mass matrix M in (28) and 
(29) is modified by the following procedure, 
called mass lumping: For each P. one joins 
alternately the center of gravity of the ele- 
ments with P; as one of their vertices and the 
middle point of the sides with P; as one of their 
endpoints, thus forming a closed broken line 
I; surrounding P We denote by o the char- 
acteristic function of the polygonal domain 
B; bounded by I;. When mass lumping is ap- 
plied, the matrix M is replaced by the matrix 
M =((@;, 9))). Generally, mass lumping re- 
laxes the conditions for the stability and the 
maximum principle to hold, although it may 
lower the accuracy to some extent (H. Fujii, 
Proc. US-Japan Seminar, Univ. Tokyo Press, 
1973). Usually, schemes without mass lumping 
are called consistent mass schemes. 

In principle, there is no difficulty in applying 
the finite element method to the diffusion- 
convection equation (21). However, when || || 
is large, the stability condition becomes strin- 
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gent. To meet these difficulties, M. Tabata, 

T. Ikeda [15], and others have devised some 
schemes that enjoy better stability, the maxi- 
mum principle, and even the law of conserva- 
tion of mass by improving mass lumping and 
introducing the idea of an upstream approxi- 
mation or an artificial viscosity for discretizing 
the convection term. Moreover, finite element 
methods are currently being applied to wave 
equations, the Navier-Stokes equations, and 
various other evolution equations. 


E. Difference Methods for Boundary Value 
Problems 


In difference methods for solving partial dif- 
ferential equations we reduce the original 
equation to its difference analog, replacing 
tdifferential quotients of the unknown function 
by the corresponding difference quotients. The 
difference analog, usually a system of algebraic 
equations, is then solved numerically, yielding 
the desired approximate solution. Since the 
last part of the method, i.e., the numerical 
solution of the difference analog, requires 
a large amount of computation, difference 
methods were not feasible until the develop- 
ment of automatic computers. 

In terms of the variable x, difference ap- 
proximations for the derivative df/dx are the 
forward difference 


(Def) (x) =(f(x + h)— f)/h, 

the backward difference 

(Df (x) 2 GG) — f(x — bh, 

and the central difference (symmetric difference) 
(SES x) e (f(x + h) — f(x — h)/2h. 


In order to exemplify the difference method 
applied to boundary value problems, we re- 
turn to equations (1) and (2), supposing that Q 
is a 2-dimensional square Q = ((x, y)|0« x « 1, 
0cy«1] [1,16-18]1. Then we cover QUS by 
a square net (lattice, grid) with mesh length 
h=1/N, for N a large integer. The set of net 
points P, , — (mh, nh) lying in Q is denoted by 
Q,, and S, is the set of net points on S. We 
want to have a net function u, on R,=Q,US, 
that approximates the solution u of (1) and 
(2), where u, is determined by the difference 
equation 


Anda, Y= —fGo y) (C y)€Q,), (31) 


and the boundary condition 


((x, y) e Sy. (32) 


Here A, is a difference operator, the 5-point 


u(x, y)= B(x, y) 
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difference analog of A, which is defined by 


A, p = DEDi gp + DÌ D; o 


=(p(x +h, y)+ o(x — h, y) + p(x, y +h) 
+ (x, y — h) - 4o(x, y))/h?. 


In view of the maximum principle for net 
functions on R,, it is seen that (31) and (32) 
admit only u,=0 as a solution if f — f —0, 
which implies that the solution u, of (31) and 
(32) exists uniquely for all f and f. When we 
are concerned with the convergence of u, to 

u as h-0, we have to consider (31) and (32) 
for all small A7 0. Such a family of difference 
equations is called a difference scheme. Actu- 
ally, u, obtained through (31) and (32) con- 
verges to u as h —0, and we can say that the 
difference scheme (31) and (32) is convergent. 
In general, the order of the error u — u, de- 
pends on the smoothness of u. For instance, if 
uc C^(Q), then |u—u,| < Ch? holds uniformly 
on R,. When the asymptotic behavior of the 
error is known to satisfy u — u, = wh? + o(h?) 
for some w= w(x, y), it is possible to attain a 
higher accuracy by eliminating the h?-order 
term if we compute u, for two different values 
of h and form an appropriate linear combi- 
nation of the u, thus obtained (Richardson's 
extrapolation; — e.g., [17]). The convergence 
of difference schemes applied to boundary 
value problems is not affected if rectangular 
nets are used instead of square nets. Many 
techniques have been devised to treat curved 
boundaries. There have been attempts recently 
to map Q into a domain Q of simpler geom- 
etry, say, a rectangular one, and then discretize 
the arising partial differential equations with 
variable coefficients in Q (e.g., J. F. Thompson 
et al., J. Comput. Phys., 1982.) Moreover, 
when information regarding the location and 
nature of singularities of solutions is available, 
one can refine the net near these points. 

The difference analog of elliptic equations 
described above is a system of linear equa- 
tions with the values of the u, as its unknowns. 
Since the coefficient matrix of this system is of 
multidiagonal type, direct elimination methods 
can be efficiently applied to solve it. Partic- 
ularly, some elimination algorithms employ- 
ing vector computations have been invented 
[19]. Iteration methods of the Gauss-Seidel 
type can also be applied, where acceleration 
of convergence by SOR is effective (— 302 
Numerical Solution of Linear Equations C). 


F. Difference Methods for Initial Value 
Problems 


To demonstrate characteristics of the dif- 
ference method applied to evolution equa- 
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tions [17, 20, 21], we first consider the initial 
boundary value problem (14)-(16) for the dif- 
fusion equation supposing that Q — (0, 1), a 
1-dimensional interval. Then we cover Q = 
[0, 00) x Q by a rectangular net t — nt (n= 
0,1,2, ...) and x =jh (j 20, 1,...) with mesh 
lengths At 2 : and Ax =h. The value of the 
approximate function u, , at the net point 
(nt, jh) is denoted by U?. Then the simplest 
difference scheme of forward type for (14) is 
written as 





Ur" — U; UA + Uf, —2U? 


T h? 


(33) 


Namely, we adopt the difference operator 
L. aC ]- D,— D*D* as the difference analog of 
(0/0t) — (0?/0x?). Equation (33) is consistent in 
the sense that L, ,[u] — "the residual of u^ ^0 
(x, h—0) for any smooth solution u of (14). This 
is the case also with the difference scheme of 
backward type 
Ur = Ur Ur + Ur -2uy;' 
T x h? l 





(34) 


To compute the approximate solution U” = 
(UP [Ox j <N}, (33) or (34) must be combined 
with the approximate initial condition cor- 
responding to (16), say U? =a(jh), and with 
the boundary condition Uj} = Us =0. Then one 
can proceed from U? to U!, from U! to U?, 
and so on. Actually, (33) is rewritten in the 
form 


Ur" = AUP, +AUP E -22)0f, vn) 


with 4 =1/h?, which gives U"*! explicitly in 
terms of U". Hence (33) is an explicit scheme. 
On the other hand, in order to acquire U"*! 
from U" according to (34), it is necessary to 
solve a system of linear equation with U"*! as 
its unknown. Hence (34) is an implicit scheme. 
Since the coefficient matrix of the system of 
linear equations to determine U"*! according 
to (34) is tridiagonal, one can employ a partic- 
ularly efficient elimination algorithm. 

We introduce the maximum norm Volz 
max, |o(jh)| for net functions o, on R, —O,U 
$,— [ix —jh|0 xj x Nj. Then, as is obvious from 
(35), the approximate solution U"=u, ,(nt) 
obtained through (33) satisfies || U"||, « lU, 
(n=0, 1, ...), provided that 


roue I 
O<4= TSS (36) 


Generally, a difference scheme approximating 
an initial value problem is said to be stable if 
the approximate solution u, , for the initial 


value a, — u, ,(0) satisfies, for small t and A, 
lus ala Maia, (O<t=nt<T) (37) 


for any T0, M; being a constant depending 
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on T. Lax’s equivalence theorem asserts that 

if the original initial value problem ts "ewell. 
posed and if the approximating difference 
scheme is consistent, then the stability of the 
difference scheme is necessary and sufficient 
for the convergence of the approximate solu- 
tion u,,, to the rigorous solution u as t, h0. 
For instance the explicit difference scheme (33) 
for the 1-dimensional diffusion equation is 
convergent under the mesh condition (36). On 
the other hand, the implicit scheme (34) is 
unconditionally stable and hence is conver- 
gent. Many difference schemes more sophisti- 
cated than (33) and (34), of higher accuracy or 
with other favorable properties, have been 
proposed and studied. All these difference 
schemes can be generalized to the case of 
many-dimensional Q. For instance, if Qc R?, 
then by means of the difference operator A, in 
(31), the forward scheme and the backward 
scheme are given by D,U" = A,U" and D,U"— 
A, U"*!, respectively. The former is stable if 
0<A=t/h? <1/4, while the latter is uncon- 
ditionally stable. Moreover, in a method called 
the ADI method (alternating direction implicit 
method) one introduces U"*!? on fractional 
steps t — nt + 1/2 and discretizes (14) as 


Uni" — Un 


= DADA U" + Dh ph Une 
1/2 ter n^ 


Un ym 
——— — — 2 piptu"*t! phphu"*'?, (38) 
2/2 pe A 
Then, starting from U?, the computation goes 
as US UT? Dia Un S Un*12., by 
solving at each step alternatively a difference 
equation implicit with respect to x and y, for 
which a particular elimination method can be 
used with good efficiency. Furthermore, the 
ADI method is unconditionally stable. Various 
types of fractional-step method that generalize 
the ADI method have been proposed [12, 17]. 

We now proceed to hyperbolic equations. 
First consider the Cauchy problem where the 
spatial domain is the whole R!. Noting that 
the wave equation v, — v,,, for instance, can be 
rewritten as 


(9. 1\ĉu 
«Al 0Jéx 


by putting u — (v, v,), we here deal with a 
hyperbolic system of the form 


Ll S 


Qu Qu 
— = A— (39) 


where the unknown u is an m-vector u= 
'(u,,U>,...,U,,), and A is a constant m x m 
matrix that is supposed to be real and sym- 
metric. For hyperbolic equations, explicit 
schemes are generally preferred, and as a typ- 
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ical one we mention the Friedrichs scheme. 
In the Friedrichs scheme applied to (39), the 





approximate solution U —'(U,, U,, ..., Um) is 
determined by 
U (t - 7, x) -5(U (t, x +h) + U(t, x — h)) 
t 
= Ad*U(t, x). (40) 


Here t is restricted to t — n: (n=0, 1,...), while 
x is regarded as a variable ranging over R!. 
Introducing translation operators T, and T. , 
by 


T,:o(x)Oo(x-Fh) and T ,:9(x) o(x — h), 


we obtain from (40) 
U(t+t,-)=S,U(t,), - (41) 


where, with the mesh ratio 4 — z/h, S, is given 
by 


s-[la 44A mo (1-44 42 
dei ER? ht 25 -h (42) 


Generally, if a difference scheme is reduced to 
the form of (41), then the operator S,, which 
yields evolution of the solution of the dif- 
ference equation for one time step, is called 
the amplification operator of the scheme. The 
stability of the difference scheme is implied 
by the uniform boundedness of the operator 
norm [|Sz|| of S" acting in the Hilbert space 
(L4(R!))" for 0x n « T/t. Consequently, the 
scheme is stable if |S,]| < 1 + Cz, C being a 
constant. The tsymbol S,(£) of S, with respect 
to the Fourier transform is called the amplifi- 
cation matrix of the scheme. The L stability 
of the scheme mentioned above is equivalent 
to the uniform boundedness of the matrix 
norm ||$?()|| for Ont « T and éceR!. There- 
fore, denoting by r,(4) the spectral radius of 
S, (£), one can give a necessary condition for 
the stability by |r,(£)| € 1 + Cz, which is called 
the von Neumann condition. Concerning gen- ` 
eral critera for the uniform boundedness of 
powers of §,(£) when S,(£) is not necessarily 
normal, a fundamental theorem was given by 
H. O. Kreiss in 1962 [21]. 

If the largest-modulus eigenvalue of A is 
denoted by uo, then the Friedrichs scheme is 
stable and hence is convergent under the con- 
dition on the mesh ratio 4 — z/h given by 


A<1/|tol- (43) 


In particular, (43) implies that for the stable 
Friedrichs scheme 


propagation speed of the difference scheme 


> propagation speed of the original equation. 
(44) 


Condition (44) is a necessary condition for 
general difference schemes approximating 
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hyperbolic equations to be stable, and is 
known as the CFL condition (Courant- 
Friedrichs-Lewy condition) [22]. For a scheme 
more accurate than the Friedrichs scheme we 
refer to the Lax-Wendroff scheme, which pro- 
ceeds by way of the following amplification 
operator $: 


2 
T 
Aldin APE 


2 


À À 
=1+5A(T,+ T4) 7 (5,7214 T y). 


The Lax-Wendroff scheme is stable under (43). 
There are many other difference schemes 
to approximate (39), most of which are appli- 
cable to higher-dimensional spaces. Some of 
these schemes can be conveniently employed 
to solve nonlinear hyperbolic equations, for 
instance, those arising in the gas dynamics of 
compressible fluids, for which the Friedrichs 
scheme and the Lax- Wendroff scheme were 
originally intended [23]. Concerning difference 
schemes with variable coefficients which ap- 
proximate a tregularly hyperbolic system with 
an x-dependent principal part, criteria for the 
stability of the scheme in terms of the symbol 
Sr, é) of S, have been obtained, making use of 
the theory of tpseudodifferential operators, by 
P. D. Lax and L. Nirenberg (Comm. Pure Appl. 
Math., 1966), M. Yamaguti and T. Nogi (Publ. 
Res. Inst. Math. Sci., 1967), R. Vaillancourt 
(Math. Comput., 1970), Z. Koshiba (J. Math. 
Anal. Appl., 1981), and others. 


References 


[1] L. V. Kantorovich and V. I. Krylov, Ap- 
proximate methods of higher analysis, Inter- 
science and Noordhoff, 1958. (Original in Rus- 
sian, 1950.) 

[2] J. Necas, Les méthodes directes en théorie 
des équations elliptiques, Masson, 1967. 

[3] R. Temam, Numerical analysis, Reidel, 
1973. l 
[4] R. Courant and D. Hilbert, Methods of 
mathematical physics, Interscience, I, 1953; TI, 
1962. 

[5] J. W. Cooley and J. W. Tukey, An algo- 
rithm for the machine calculation of complex 
Fourier Series, Math. Comp., 19 (1964), 297— 
301. 

[6] M. J. Turner, R. W. Clough, H C. Martin, 
and L. J. Topp, Stiffness and deflection anal- 
ysis of complex structures, J. Aero. Sci., 23 
(1956), 805—823. 

[7] P. G. Ciarlet, The finite element method 
for elliptic problems, North-Holland, 1978. 
[8] A. R. Mitchell and R. Wait, The finite 
element method in partial differential equa- 
tions, Wiley, 1977. 


1147 


[9] G. Strang and G. Fix, An analysis of the 
finite element method, Prentice-Hall, 1973. 
[10] F. Kikuchi and Y. Ando, On the conver- 
gence of a mixed finite scheme for plate bend- 
ing, Nuclear Eng. Design, 24 (1973), 357—373. 
[11] T. Miyoshi, A mixed finite element 
method for the solution of the von Karman 
equations, Numer. Math., 26 (1976), 255—269. 
[12] H. Fujita and A. Mizutani, On the finite 
element method for parabolic equations I, 
Approximation of holomorphic semi-groups, 
J. Math. Soc. Japan, 28 (1976), 749—771. 

[13] T. Kato, Perturbation theory for linear 
operators, Springer, second edition, 1976. 
[14] T. Ushijima, Approximation theory for 
semi-groups of linear operators and its appli- 
cation to approximation of the wave equa- 
tion, Japan. J. Math., 1 (1975), 185-224. 

[15] T. Ikeda, Maximum principle in finite 
element models for convection-diffusion phe- 
nomena, Kinokuniya and North-Holland, 
1983. 

[16] G. E. Forsythe and W. R. Wasow, Finite- 
difference methods for partial differential equa- 
tions, Wiley, 1960. 

[17] G. I. Marchuk, Methods of numerical 
mathematics, Springer, second edition, 1982. 
[18] G. D. Smith, Numerical solution of par- 
tial differential equations: Finite difference 
methods, Clarendon Press, second edition, 
1978. 

[19] D. L. Book (ed.), Finite-difference tech- 
niques for vectorized fluid dynamics calcula- 
tions, Springer, 1981. 

[20] F. John, Lectures on advanced numerical 
analysis, Gordon & Breach, 1967. 

[21] R. D. Richtmyer and K. W. Morton, 
Difference method for initial value problems, 
Interscience, 1957. 

[22] R. Courant, K. Friedrichs, and H. Lewy, 
Über die partiellen Differenzengleichungen der 
mathematischen Physik, Math. Ann., 100 
(1928), 32—74. 

[23] R. Peyret and T. D. Taylor, Computa- 
tional methods for fluid flow, Springer, 1983. 
[24] B. Carnahan, H. A. Luther, and J. O. 
Wilkes, Applied numerical methods, Wiley, 
1969. 








304 Ref. 
Numerical Solution of PDEs 





305A 
Obstructions 


305 (1X.11) 
Obstructions 


A. History 


The theory of obstructions aims at measuring 
the extensibility of mappings by means of 
algebraic tools. Such classical results as the 
*Brouwer mapping theorem and Hopf's exten- 
sion and tclassification theorems in homotopy 
theory might be regarded as the origins of this 
theory. A systematic study of the theory was 
initiated by S. Eilenberg [1] in connection 
with the notions of thomotopy and tcoho- 
mology groups, which were introduced at the 
same time. A. Komatu and P. Olum [2] ex- 
tended the theory to mappings into spaces not 
necessarily tn-simple. For mappings of poly- 
hedra into certain special spaces, the thomo- 
topy classification problem, closely related to 
the theory of obstructions, was solved in the 
following cases (K" denotes an m-dimensional 
polyhedron); K"*' 5 S" (N. Steenrod [5]), K"*? 
98$" (J. Adem), K"**— Y, where z;(Y)-0 for 
i«nand n<i<n-+k (M. Nakaoka). There 

are similar results by L. S. Pontryagin, M. 
Postnikov, and S. Eilenberg and S. MacLane. 
Except for the special cases already noted, it is 
extremely difficult to discuss higher obstruc- 
tions in general since they involve many com- 
plexities. Nevertheless, it is significant that the 
idea of obstructions has given rise to various 
important notions in modern algebraic topol- 
ogy, including cohomology operations (— 64 
Cohomology Operations) and characteristic 
classes (— 56 Characteristic Classes). 

The notion of obstruction is also very useful 
in the treatment of cross sections of fiber bun- 
dies (— 147 Fiber Bundles), 'diffeomorphisms 
of differentiable manifolds, etc. 


B. General Theory for an n-Simple Space Y 


The question of whether two (continuous) 
mappings of a topological space X into an- 
other space Y are thomotopic to each other 
can be reduced to the extensibility of the given 
mapping: (X x {0})U(X x (11) Y to a map- 
ping of the product X x I of X and the unit 
interval Z — (0, 1] into Y. Therefore the prob- 
lem of classifying mappings can be treated 

in the same way as that of the extension of 
mappings. 

Let K be a *polyhedron, L a subpolyhedron 
of K, and Enz LU K” the union of L and the 
*n-skeleton K" of K. Let Y be an farcwise 
connected n-simple space, and f' be a mapping 
of L into Y. Denote by dn f^) the set of map- 
pings of R" into Y that are extensions of f", 
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and by ©"( f^) the set of thomotopy classes 
of mappings in ®"( f^) relative to L. The set 
d f") consists of a single element because of 
the arcwise connectedness of Y, $!( f") is non- 
empty, and dit f^) (nz 2) may be empty. Let f" 
be an element of ©"( f^). If we consider the 
restriction of f" to the boundary ó"*! of an 
oriented (n+ 1)-cell o"*! of K, then f":ó"*! 
Y determines an element c( f", o"*') of the 
thomotopy group z,(Y) (- 202 Homotopy 
Theory). This element gives a measure of ob- 
struction for extending f” to the interior of 
oer". We obtain an (n+ 1)-fcocycle c"*! ( f") of 
the tsimplicial pair (K, L) with coefficients in 
n, (Y), called the obstruction cocycle of f", by 
assigning c( f", o"*!) to each (n+ 1)-cell o”*'. 
This obstruction cocycle c"*!( f") is the mea- 
sure of obstruction for extending f" to Ei A 
necessary and sufficient condition for the 
extensibility is given by c"'!( f") -- 0. Clearly, 
c"*! ( f") is uniquely determined for each ele- 
ment f of dn f^). The set of all c"*! ( f") with 
f"e®"(f’) forms a subset o"*! ( f") of the group 
of cocycles Z"*!(K, L; r,(Y)). P" +f’) is non- 
empty if and only if o"*! ( f") contains the zero 
element 0. 

Let KH - K x I, LB «(K xO)U(L x DU 
(K x 1). Given two mappings jn, fi: K5 Y 
satisfying fo| L= f,| L, we can define a natural 
mapping F': LU Y such that an element 
F" of ®"(F’) corresponds to a thomotopy 
h" ! relative to L connecting LES 1 with 
f; | K". Given an element F"e”(F'), we 
have the element c"! (F") of Z"' (KC, LU; 
n, (Y)), which we identify with Z"(K, L; x,(Y)) 
through the natural isomorphism of chain 
groups of the pair (KO, LH) to those of the 
pair (K, L). Thus we can regard c"*! (F") as an 
element of Z"(K, L; t, (Y)), which is denoted 
by d"( fo, h"^!, fi), and call it the separation (or 
difference) cocycle. If f| R"! = f, | E"! , we 
have the canonical mapping F": LH U(KH)" 
~ Y, and the separation cocycle is denoted 
simply by d"( fo, fi). The set of separation 
cocycles corresponding to elements of ®"(F’) is 
considered to be a subset of Z"(K, L; x,(Y)) 
and is denoted by o"( fo, f1). A necessary and 
sufficient condition for br ^! to be extensible 
to a homotopy on Er is d"( fo, h"^!, f; ) - 0. 
Therefore a necessary and sufficient condition 
for fo| R" « f, | K" (rel L) (ie., relative to L) is 
0€ o"( fo, fi). Given fë, fT: K" > Y with fèl L= 
ff L, then d”( fë, h"™!, fP) (eo (d. ft) is 
an element of Z"(K", L; 1,(Y)), which is also 
considered to be a cochain of the pair (K, L). 
In this sense, we call d"( fg, h" !, fT) the sepa- 
ration (or deformation) cochain over (K, L). 
The coboundary of the separation cochain 
d"( fë, h"™!, f?) coincides (except possibly for 
sign) with c" (£z) — e"*! (fp). 

For a fixed en f^), any n-cochain d" 


1151 


of the pair (K, L) with coefficients in z,(Y) 

is expressible as a separation cochain d" — 
a"( fo, f?) where (en f) is a suitable map- 
ping such that f?| R"! = f?| R"! (existence 
theorem). 

Therefore if we take an element f"! of 
©" ^ !( f") whose obstruction cocycle c"( f" !) 
is zero, the set of all obstruction cocycles 
c" (f?) of all such f"e4"( f") that are exten- 
sions of f"! forms a subset of o"*! ( f") and 
coincides with a coset of Z"*'(K, L; ,(Y)) 
factored by B"*! (K, L; ,(Y)). Thus a coho- 
mology class c"*! ( f" ))e H"*!(K, L; 2,(Y)) 
corresponds to an f" ! ed"^!(f") such that 
c"(f"71) 20, and c"*! ( f" D is a necessary 
and sufficient condition for f"! to be exten- 
sible to K"*! (first extension theorem). 

For the separation cocycle, d"( fo, h" ?, fie 
H"(K, L;x,(Y)) corresponds to each homotopy 
h"^? on R"? such that d""1(f,,h" 2, f,)=0, 
and d”( fo, h"^?, f,)=0 is a necessary and 
. sufficient condition for h" ? to be extensible to 
a homotopy on E" (first homotopy theorem). 

The subset of H"*! (K, L;z,(Y)) correspond- 
ing to o"*!(f’) is denoted by O"*!( f^) and is 
called the obstruction to an (n + 1)-dimensional 
extension of f". Similarly, the subset O"( fo, f) 
of H"(K, L; 1,(Y)) corresponding to o"( fo, fi) is 
called the obstruction to an n-dimensional 
homotopy connecting f, with f}. Clearly, a 
condition for f’ to be extensible to EY"! is 
given by 0€ O"*!( f^, and a necessary and 
sufficient condition for f, |K” = f, | K” (rel L) is 
given by 0€ O"( f,, fa). 

A continuous mapping q:(K', L')(K, L) 
induces homomorphisms of cohomology 
groups oi H*" (K, L; x, (Y) H^" (K',L'; 

n, (Y), H'(K, L; z,(Y))  H'(K', L'; r,(Y)). Then 
for f':L— Y, O"*!(f'og)29*O""! (f^, and 
for fo, f, : K 5 Y such that f;| L— fil L, Ofo 
P, f, o 9) 2 0*O"( fo, fı). Therefore we also find 
that the obstruction to an extension and the 
obstruction to a homotopy are independent 

of the choice of subdivisions of K, L, and con- 
sequently are topological invariants. 

Let fo, fi, and f; be mappings K — Y such 
that f| L— f, | L= f;| L. Given homotopies 
hb: fo| R" = f, | R7 (rel L), hig": f lR"! = 
f;| K^! (rel L), then for the composite h33! = 
h35 oh, we have 


d” fo, hor , fi) - d" (f, hia" Sa) 
— d"( fo, ho; sha) 


and for the inverse homotopy hig! : fi |R" ~ 
fol R"! of 55! , clearly 


d”( fis hios fo) = —d"( fo, hoi’, fi) 


Therefore O"( fo, Jo) forms a subgroup of 
H"(K, L; n,(Y)) that is determined by the 
homotopy class of GIE) relative to L. In 
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general, if O"( fo, f1) is nonempty, it is a coset 
of H"(K, L; n,(Y)) factored by the subgroup 
O"( fo» fo). Combined with the existence theo- 
rem on separation cochains, this can be uti- 
lized to show the following theorem. 

Assume that On f^) is nonempty. The set 
of all elements ®"( f^ that are extensions of 
an element of $" ^! ( f?) is put in one-to-one 
correspondence with the quotient group of 
H"(K", L; 1, (Y)) modulo O"( f7, D) by pairing 
the obstruction O"( fj, f") with each f" for a 
fixed fo. Among such elements of ©"( f^ the 
set of f" that are extensible to "+! is in one- 
to-one correspondence with the quotient 
group of H"(&"**, L; z,(Y)) 2 H'(K, L; n (Y )) 
modulo the subgroup O"( f^, f?*!), assuming 
that fo is extended to f; ^! (first classification 
theorem). 


C. Primary Obstructions 


Assume that H'*! (K, L;7,(Y))= H(K,L;n;(Y)) 
— 0, where 0 « i « p (e.g, z(Y) 20, 0 «i « p). In 
this case, by consecutive use of the first exten- 
sion theorem and the first homotopy theorem, 
we can show that each ®'( f") (i< p) consists of 
a single element and O?*!( f^) also consists of a 
single element c?*! (f) e H?*! (K, L; x, (Y)). The 
element c?*!( f^ called the primary obstruc- 
tion of f’, vanishes if and only if f' can be 
extended to K?*! (second extension theorem). 
When H'" (K, L; t (Y)) «0 for i» p (for exam- 
ple, when z;(Y)=0 for p «i«dim(K — L)), 

f' is extendable to K if and only if the first ob- 
struction of f' vanishes (third extension 
theorem). 

Correspondingly, if H'(K, L;2,(Y))= 
H? (K, L:1,(Y))=0 (0<i<p), then for any 
two mappings fo, fi: K Y, fol L=f,lL, 

O"( fo, f1) consists of a single element d "( f,, fi) 
€ H'(K, L; n,(Y)), which we call the primary 
difference of f, and f,. This element vanishes 
if and only if fo | R? = f, | K? (rel L) (second 
homotopy theorem). Moreover, when H'(K, L; 
n;(Y)) 2 0 (i p), the primary difference is zero 
if and only if fọ ~ f, (rel L) (third homotopy 
theorem). 

Assume that the hypotheses of the second 
extension theorem and second homotopy 
theorem are satisfied. If we assign to each 
element f” of ^(f") the primary difference of 
f? and the fixed element LP. then pn f^) is in 
one-to-one correspondence with H?(K?, L; 
7,(Y)) by the first classification theorem (sec- 
ond classification theorem). Similarly, assume 
that the hypotheses of the third extension 
theorem and third homotopy theorem are 
satisfied. If fp: K 2 Y, f= fọ] L, then homotopy 
classes relative to L of extensions f of f’ are 
put in one-to-one correspondence with the 
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elements of H?(K, L; n,(Y)) by pairing d"( f, fo) 
with f (third classification theorem). 


D. Secondary Obstructions 


For simplicity, assume that z;(Y)—O0 (i< p and 
p«i«q). If the primary obstruction c?*!(f)e 
H?*(K, L;n,(Y)) of f': LY vanishes, we can 
define O**! ( f^ c HK, Lin, (Y)), which we 
call the secondary obstruction of f’. When Y= 
SP, q=p+ 1, p» 2, the secondary obstruction 
O?**( f^) coincides with a coset of Hr ZK L; 
Z) modulo the subgroup Sq?(H"(K, L; Z)), 
where Sq’ denotes the *Steenrod square 
operation [5]. In this case, if L= K?, then 
O?*?(f") reduces to a cohomology class, 
Sq? (i*) ^ f"*(o) with i: L2 K, where c is a gen- 
erator of H(S?, Z) (in this case (i*) !f'*(a) # 
@ is equivalent to c?*!( f") - 0) [5]. Moreover, 
if Sq?f'* (o) «0, then there exists a suitable 
extension f/?*?: &?*? Y =S” of f’. The set of 
obstruction cocycles of all such f?*? defines 
the tertiary obstruction O?*?( f^), which coin- 
cides with a coset of H?**(K, L; Z,) modulo the 
subgroup Sq?(H?*'(K, L; Z,)). By using the 
tsecondary cohomology operation € of J. 
Adem, it can be expressed as d((i*) ! f'*(a)) 
(— 64 Cohomology Operations). 

All the propositions in this article remain 
true if we take (CW complexes instead of 
polyhedra K. 
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Operational Calculus 


A. General Remarks 


The term “operational calculus” in the usual 
sense means a method for solving tlinear dif- 
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ferential equations by reducing the operations 
of differentiation and integration into alge- 
braic ones in a symbolic manner. The idea 
was initiated by P. S. Laplace in his Théorie 
analytique des probabilités (1812), but the 
method has acquired popularity since O. 
Heaviside used it systematically in the late 
19th century to solve electric-circuit problems. 
The method is therefore also callec. Heaviside 
calculus, but Heaviside gave only a formal 
method of calculus without bothering with 
rigorous arguments. The mathematical foun- 
dations were given in later years, first in terms 
of ‘Laplace transforms, then by apolying the 
theory of *distributions. One of the motiva- 
tions behind L. Schwartz's creation of this 
latter theory in the 1940s was to give a sound 
foundation for the formal method, but the 
theory obtained has had a much larger range 
of applications. Schwartz's theory was based 
on the newly developed theory of *topological 
linear spaces. On the other hand, J. Mikusin- 
ski gave another foundation, based only on 
elementary algebraic notions and on Titch- 
marsh's theorem, whose proof has recently 
been much simplified. 

In this article, we first explain the simple 
theory established by Mikusinski [2] and later 
discuss its relation to the classical Laplace 
transform method. 


B. The Operational Calculus of Mikusinski 


The set € of all continuous complex-valued 
functions a= (a(t)] defined on t>0 is a "linear 
space with the usual addition and scalar multi- 
plication. @ is a tcommutative algebra with 
multiplication a: b defined by the *convolution 
(Lait — s)b(s)ds]. The ring € has no tzero 
divisors (Titchmarsh's theorem). (There have 
been several interesting proofs of Titchmarsh's 
theorem since the first demonstration given 

by Titchmarsh himself [3]. For example, a 
simple proof has been published by C. Ryll- 
Nardzewski (1952).) Hence we can construct 
the *quotient field 2 of the ring €. An element 
of 2 is called a Mikusifiski operator, or simply 
an operator. If we define a(t)=0 for t <0 for 
the elements {a(t)} in €, then € is a subalgebra 
of 4/, which is the set of all locally integrable 
(locally L,) functions in (—00, o0) whose tsup- 
port is bounded below. Here we identify two 
functions that coincide almost everywhere. 
The algebra % has no zero divisor, and its 
quotient field is also 2. 

The unity element for multiplication tn 2, 
denoted by ô= b/b (b #0), plays the role of the 
*Dirac ó-function. It is sometimes called the 
impulse function. The operator /= {1} €@ is the 
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function that takes the values 0 and 1 accord- 
ing as t «O0 or t>0. This operator is Heavi- 
side's function and is sometimes called the 

unit function. Usually it is denoted by Lu 

or simply 1. The value 1(0) may be arbitrary, 
but usually it is defined as 1/2, the mean of the 
limit values from both sides. The operator l is 
an integral operator, because, as an operator 
carrying a into l-a, it yields 


t 
l-a= d wash =the integral of a over [0, t]. 
0 
More generally, the operator (t^ !/T(4)] (Bei 
7 0) gives the Ath-order integral. The operator 
$—Ó/l, which is the inverse operator of I, is a 
differential operator. If ae@ is of tclass CT. 
then we have 


s‘a=a'+a(+0)d=a' + {a(+0)}/{1}. (1) 
Similarly, if ae is of class C", we have 
s": a—a'? c a^ (0) rain ?(0)s 

T... a(0)s" 71, (2) 


The operator a5: a can be applied to func- 
tions a that are not differentiable in the ordi- 
nary sense, and considering the application of 
s to be the operation of differentiation, we can 
treat the differential operator algebraically in 
the field 2. In particular, we have s: 1— 6, and 
this relation is frequently represented by the 
formula 


d1(t)/dt = à(t). (3) 


A rational function of s whose numerator is 
of lower degree than its denominator is an 
telementary function of t. For example, we 
have the relations 


1/(s — a)" — (t" ! e**/(n — 1)!), 
Lite" + B?) - (B sin Bt}, 
s/(s? + B?) — (cos fit]. (4) 


The solution of an ordinary linear differ- 
ential equation with constant coefficients 
$7.90, o (t) = f(t) under the tinitial condition 
Q9 (0) 2 y; (0 € i x n— 1) is thus reduced to an 
equation in s by using formulas (1) and (2), and 
is computed by decomposing the following 
operator into partial fractions: 

n—i 
(o() - P Pot (5) 
XS TF. HAS HAO 

where fj, — 0, .1 Yo - 04271 + E AnYn-r-1> 
Oxrzn-— 1. The general solution is repre- 
sented by (5) if we consider the constants yo, 
<...> Yn-1 OF Da, f,-, as arbitrary parameters. 
If the rational function in the right-hand side 
of (5) is M(s)/D(s) and the degree of the numer- 
ator is less than that of the denominator, then 
the right-hand side of (5) is explicitly repre- 
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sented by 

Ed. qr d” MO) id 

rzoll—r—1)!r!| dA ND(/(A— 2o) i 
c M(A;) At 

tD S 


where we assume that 45, 4, ..., Åm exhaust the 
roots of the equation D(A)=0, A, is a multiple 
root of degree L and all other roots are simple 
(m n— I). The above formula is called the 
expansion theorem. 


C. Limits of Operators 


A sequence a, of operators is said to converge 
to the limit a= b/q if there exists an operator 
q( #0) such that a: a,€ € and the sequence 
of functions q'a, converges *uniformly to b 
on compact sets. The limit a is determined 
uniquely without depending on the operator q. 
Based on this notion of limits of operators, we 
can construct the theory of series of operators 
and differential and integral calculus of func- 
tions of an independent variable A whose 
values are operators. They are completely 
parallel to the usual theories of elementary 
calculus (— 106 Differential Calculus; 216 
Integral Calculus; 379 Series). A linear partial 
differential equation in the function ¢(x, t) of 
two variables, in particular its initial value 
problem, reduces to a linear ordinary differen- 
tial equation of an operational-valued func- 
tion of an independent variable x. 

For a given operator w, the solution (if 
it exists) of the differential equation (4) — 
w: Q(A) with the initial condition @(0)=6 is 
unique, is called the exponential function of an 
operator w, and is denoted by q(4) — e^". If the 
power series 


8 


A" w"/n! (6) 
0 


Il 


n 


converges, the limit 1s identical to the exponen- 
tial function e^". However, there are several 
cases in which e^" exists even when the series 
(6) of operators does not converge. 

For example, for w= — Js. , we have 


es = {(1/2,/nt)exp(—1?/41)}, O 
and for w= —s, we have 
e ^-—h^-s- H(t), (8) 


where the function H,(t) takes the values 0 and 
1 according as t « 4 or t» 4. H;(t) belongs to 
the ring X and is called the jump function at 4. 
For f(t)e 4, we have 


h^ (foy Uf - 2). 


and hence we call (8) the translation operator 
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(or shift operator). For w= —s, the series (6) 
does not converge, but if we apply the formal 
relation e ^ =E% 9(—As)"/n! to f(t), we have a 
formal Taylor expansion 


ft—a= Tu — AY mt. 


The solution of linear tdifference equations 
are represented by rational functions of bi. 
The power series 3 a, h" of operators always 
converges. This fact gives an explicit example 
of a representation by formal power series. 
Note that the operators e~% and e "Zu" play 
an essential role in the solution of the *wave 
equation 


09 a0 
dx? Ot? 





and the theat equation 








0o ade 
Ox? D" 


The operator (7) converges to ô for 10, and 
this gives a tregularization of the Dirac 6- 
function. 


D. Laplace Transform 


For every function { f(t)} €@, the limit 


B oo 
lim | nai | e*f(Aydd (9) 
Ere Jo 0 
always exists (in the sense of the limit of oper- 
ators), and as an operator coincides with the 
original function { f(t)}. Therefore, if the usual 
Laplace transform (— 240 Laplace Trans- 
form) of the function f(A) exists and (9) is a 
function g(s), then as a function of the differen- 
tial operator s, g(s) is the operator that is 
given by the inverse Laplace transform f(t) of 
g(s). Formulas (4) and (7) are indeed typical 
examples of this relation, where the left-hand 
side is the usual Laplace transform of the 
right-hand side. In the practical computation 
of (5), we can compute the Laplace inverse 
transform of the right-hand side. However, if 
we took the Laplace transform as the founda- 
tion of the theory, it would not only be com- 
plicated but also be subject to the artificial 
restriction caused by the convergence condi- 
tion on Laplace transforms. 

In the theory of operational calculus, the 
transform 


HEN e "f(t)dt (10) 
0 

is sometimes used instead of the Laplace trans- 
form itself. But the difference is not essen- 

tial; we obtain the latter transform merely by 
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changing the variable from s to p and multi- 
plying the former transform by p. 


E. Relation to Distributions 


For Tee, an operator of the form h^: s*- f is 
identified with a distribution of L. Schwartz 
with support bounded from below. We can 
identify with a Schwartz distribution the limit 
of a sequence f, (or a suitable equivalence class 
of sequences) of operators of the form h^: s*- f 
such that f,, f,41,--- are identical in the inter- 
val (—n,n). The notions of Schwartz distri- 
butions and of Mikusinski operators do not 
include each other, but both are generaliza- 
tions of the notions of functions and their 
derivatives. For formulas and examples — 
Appendix A, Table 12. 
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Operations Research 


A. General Remarks 


Operations research in the most general sense 
can be characterized as the application of 
scientific methods, techniques, and tools to the 
operations of systems so as to provide those in 
control with optimum solutions to problems. 
This definition is due to Churchman, Ackoff, 
and Arnoff [1]. Operational research began 

in a military context in the United Kingdom 
during World War II, and it was quickly taken 
up under the name operations research (OR) 
in the United States. After the war it evolved 
in connection with industrial organization, 
and its many techniques have found expand- 
ing areas of application in the United States, 
the United Kingdom, and in other industrial 
countries. Nowadays OR is used widely in 
industry for solving practical problems, such 
as planning, scheduling, inventory, transpor- 
tation, and marketing. It also has various im- 
portant applications in the fields of agricul- 
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ture, commerce, economics, education, public 
service, etc., and some techniques developed in 
OR have influenced other fields of science and 
technology. 


B. Applications 


Applications of OR to practical problems are 
often carried out by project teams because 
knowledge of disparate aspects of the prob- 
lems are required, and interdisciplinary co- 
operation is indispensable. The following are 
the major phases of an OR project: (i) formu- 
lating the problem, (ii) constructing a mathe- 
matical model to represent the system under 
study, (iii) deriving a solution from the model, 
(iv) testing the model and the solution derived 
from it, (v) establishing controls over the solu- 
tion and putting it to work (implementation). 

When the mathematical model that has 
been constructed in phase (ii) is complicated 
and/or the amount of data to be handled is 
large, a digital tcomputer is often utilized in 
phases (iii) and (iv). 


C. Mathematical Models [2] 


Typical mathematical models and tools that 
appear frequently in OR are: 

(1) Optimization model (— 264 Mathematical 
Programming). This model is characterized by 
one or more real-valued functions, which are 
called objective functions, to be minimized (or 
maximized) under some constraints. According 
to the number of objective functions, the types 
of objective functions, and the types of con- 
straints, this model is classified roughly as 
follows: (i) Single-objective model, which in- 
cludes linear, quadratic, convex, nonlinear and 
integer programming models (— 215 Integer 
Programming, 255 Linear Programming, 292 
Nonlinear Programming, 349 Quadratic Pro- 
gramming); (ii) multi-objective model; (tii) sto- 
chastic programming model (— 408 Stochastic 
Programming); (iv) dynamic programming 
model (— 127 Dynamic Programmingy (v) 
network flow model (— 281 Network Flow 
Problems). 

(2) Game-theoretic model (— 173 Game 
Theory). Game theory is a powerful tool for 
deriving a solution to practical problems in 
which more than one person is involved, with 
each player having different objectives. 

(3) Inventory model (— 227 Inventory Con- 
trol). It is necessary for most firms to control 
stocks of resources, products, etc., in order to 
carry out their activities smoothly; various 
inventory models have been developed for 
such problems. Mathematically, optimization 
techniques (— 264 Mathematical Program- 
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ming), Markovian decision processes (— 127 
Dynamic Programming, 261 Markov Pro- 
cesses), and basic tprobability theory are used 
to construct models for these problems. 

(4) Queuing model (— 260 Markov Chains 
H). In a telephone system, calls made when all 
the lines of the system are busy are lost. The 
problem of computing the probability of loss 
involved was first solved in the pioneering 
article on tqueuing theory by A. K. Erlang in 
1917. For systems in which calls can be put on 
hold when all lines are busy, one deals with the 
twaiting time distribution instead of the proba- 
bility of loss. In the 1930s, F. Pollaczeck and 
A. Ya. Khinchin gave explicit formulas for the 
characteristic function of the waiting time 
distribution. In many situations, such as in 
telephone systems, air and surface traffic, 
production lines, and computer systems, vari- 
ous congestion phenomena are often observed, 
and many kinds of queuing models are utilized 
to analyze the congestion. Mathematically, 
almost all such models are formulated by 
using Markov processes. For practical uses, 
approximation and computational methods 
are important as well as theoretical results. 

(5) Scheduling model (— 376 Scheduling and 
Production Planning). Network scheduling is 
used to schedule complicated projects (for 
example, construction of buildings) that 
consist of a large number of jobs related to 
each other in some natural order. PERT (pro- 
gram evaluation and review technique) and 
CPM (critical path method) are popular com- 
putational methods for this model (— 281 
Network Flow Problems). Job shop scheduling 
is used when we have m jobs and n machines 
and each job requires some of the machines in 
a given order. The model allows us to find an 
optimal order (in some certain sense) of jobs to 
be implemented on each machine. 

(6) Replacement model. There are two typical 
cases. One is the preventive maintenance model, 
which is suitable when replacements are done 
under a routine policy because a replacement 
or a repair before a failure is more effective 
than after a failure. Probabilistic treatments 
are mainly used, and this model resembles 
those for queues and tMarkov processes. The 
other is a model for deciding whether to re- 
place a piece of equipment in use. In this case, 
we need to compare costs of both used and 
new equipment, and evaluations of various 
types of present cost are important. 

(7) ‘Simulation. This is a numerical experi- 
ment in a simulated model of a phenomenon 
which we want to analyze. Simulation is one of 
the most popular techniques in OR. 

(8) Other models. Besides the models listed 
above, many problems are formulated by way 
of various other models in OR. In modeling, 
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*probability theory, and mathematical tstatis- 
tics, especially, Markov chain, *multivariate 
analysis, !design of experiments, tregression 
analysis, ttime series analysis, etc. often play 
important roles. 
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Operator Algebras 


A. Preliminaries 


Let $ be a *Hilbert space. The set of *bounded 
linear operators on $ is denoted by Z($)— 2. 
It contains the identity operator J. The notions 
of operator sum A + B, operator product AB, 
and *adjoint A* are defined on it. A subalge- 
bra of Z($) is called an operator algebra. In 
this article we consider mainly von Neumann 
algebras. For C*-algebras — 36 Banach Alge- 
bras G- K. 

Any *Hermitian operator A (i.e., an operator 
such that A = A*) has the property that (Ax, x) 
is always real for any xe 9. If (Ax, x) 20 for 
any x, A is called positive, and we write A0. 
When Hermitian operators A and B satisfy A 
— B20, we write A> B. Thus we introduce an 
ordering A > B between Hermitian operators. 
A set {A,} of positive Hermitian operators is 
said to be an increasing directed set if any two 
of them A,, A, always have a common major- 
ant A,, that is, 4, < 4, and Aș < A,. If a Her- 
mitian operator A satisfies (Ax, x) — sup(A,x, x) 
for such a set, it is called the supremum and is 
denoted by sup A,. The supremum sup A, 
exists if and only if the sup(A, x, x) is finite for 
any x, and then A, converges to A with respect 
to the weak and strong operator topologies. 


B. Topologies in 


Various topologies are introduced in #= 
BH): the *uniform operator topology, the 
*strong operator topology, and the tweak 
operator topology (— 251 Linear Operators). 
These topologies are listed above in order of 
decreasing fineness. The operation in # of 
taking the adjoint, A— A*, is continuous with 
respect to the uniform operator topology and 
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weak operator topology, but not with respect 
to the strong operator topology. The opera- 
tion from Z x Z to B of taking the product, 
(A, B) AB, is continuous with respect to the 
uniform operator topology, is continuous with 
respect to the strong operator topology when 
the first factor is restricted to a set bounded in 
the operator norm, but is not continuous on 
Bx B. It is continuous with respect to the 
weak operator topology when one of the fac- 
tors is fixed (i.e., it is separately continuous). 
The set 2 is a Banach space with respect to 
the operator norm, or, more precisely, a *C*- 
algebra. It is a ‘locally convex topological 
linear space with respect to the strong or weak 
operator topology. 

The Banach space 2 is the *dual of the 
Banach space 27, of all *nuclear operators in 
H (— 68 Compact and Nuclear Operators I). 
The weak* topology in 2 as the dual of 2, is 
called the c-weak topology. 


C. Von Neumann Algebras 


A subset .// of 4 is called a *-subalgebra if it 
is a subalgebra (i.e., A, Be M implies ZA + uB, 
AB € Af) and contains the adjoint A* of any 
Ae A The commutant .—' of a subset .o of B 
is the set of operators that commute with both 
A and A* for Ae.s7. The commutant is a *- 
subalgebra, and o —.«". 

A von Neumann algebra .@ is a «-subalgebra 
of 4 that is defined by one of the follow- 
ing four equivalent conditions: (1) Æ is a *- 
subalgebra of # containing I, closed under 
the weak operator topology; (ii) JJ is a *- 
subalgebra of # containing I, closed under 
the strong operator topology; (iii) .@ is the 
commutant of a subset of Z (or, equivalently, 
M = M"), (iv) M is a *-subalgebra of # con- 
taining J, closed under the uniform operator 
topology, and, as a Banach space, coinciding 
with the conjugate space of some Banach 
space. Note that a »-subalgebra of 2, closed 
under the uniform operator topology, is a 
C*-algebra. Von Neumann algebras are also 
called rings of operators or W*-algebras. The 
latter term is usually used for a C*-algebra 
*-isomorphic to a von Neumann algebra in 
contrast to a concrete von Neumann algebra. 

The study of these algebras was started by J. 
von Neumann in 1929. He showed the equiva- 
lence of conditions (i)-(ii1) (von Neumann's 
density theorem), and established a foundation 
for the theory named after him [1]. The notion 
of von Neumann algebras can be regarded as 
a natural extension of the notion of matrix 
algebras in a finite-dimensional space, and 
therein lies the importance of the theory. The 
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fourth condition of the definition was given by 
S. Sakai (Pacific J. Math., 1956). 

The following theorem is of use in the the- 
ory of von Neumann algebras: Given a *- 
subalgebra . of B containing I, its closure æ 
with respect to the weak or strong operator 
topology is von Neumann algebra; and when 
we denote the set of elements of operator norm 
<1 in £ (resp. M) by A, (resp. M), M is 
likewise the closure of »/, with respect to the 
weak or strong operator topology (Kaplan- 
sky's density theorem (Ann. Math., 1951). 

If E is a projection operator in a von Neu- 
mann algebra M, then EWE={EAE|Ae.4} 
is a x-subalgebra of 4 closed with respect to 
the weak operator topology. It is not a von 
Neumann algebra because it does not contain 
I, but since its elements operate exclusively in 
the closed subspace EH, we can regard it as 
an algebra of operators on ES. In this sense, 
EME can be regarded as a von Neumann 
algebra on E$, which we call the reduced von 
Neumann algebra of Æ on ES and write Mg. 
If E is a projection operator in M’, EME = 
EM restricted to the subspace E$ is called 
the induced von Neumann algebra of A on 
ES and is denoted also by Mg. In the latter 
case, the mapping AE M 2 EAce f, is a +- 
homomorphism and is called the induction of 
M onto JA. 

The tensor product H, © H, of two Hilbert 
spaces §; (i= 1,2) is the completion of their 
*tensor product as a complex linear space 
equipped with the unique tinner product satis- 
fying (fı & f2.91 @ 92) 2(f1,911(/2. 92); for 
all fj, g;j€ $;. For any A,€ A(H;), there exists 
a unique operator in A(H, © H,) denoted 
by A, @ A, satisfying (A; & A2)(f, & f;) — 

A, f, & A; f, for all fe $;. For von Neumann 
algebras M, c &(H,), the von Neumann alge- 
bra generated by A, & A, with A;E€ M, is de- 
noted by Mı ® Mh, and is called the tensor 
product of æ, and æ. The *-isomorphism Ae 
At — A G) €. G) L, where I is the trivial von 
Neumann algebra consisting solely of complex 
multiples of the identity operator, is called an 
amplification. 

For two von Neumann algebras M, c A(H,) 
(i— 1,2), a *-isomorphism z from M, into M, 
is called spatial if there exists a unitary De, a 
bijective isometric linear) mapping U from $; 
to H, such that UAU* — z4A for all Ae M, 
and a «-homomorphism z is called normal if 
sup, tA, — n(sup, A,) whenever A, is a bounded 
increasing net in Æ. Any normal »-homomor- 
phism is continuous in the strong and weak 
operator topologies and is a product of an 
amplification, a spatial *-isomorphism, and an 
induction. Its kernel is of the form EM, where 
E 1s a projection operator belonging to the 
center Z = 41M’ of M. This gives a com- 
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plete description of all possible normal repre- 
sentations (e. a normal »-homomorphism 
into some 4) of a von Neumann algebra. 


D. States, Weights, and Traces 


A state o of a C*-algebra «X isa complex- 
valued function on . that is (1) complex 
linear: o(A + B)= 9(A) - ọ(B), e(cA) - co(A) 
for A, Be «f, ce C, (2) positive: 9(A* A) 20 for 
Aes, and (3) normalized: Tel 21 (equiva- 
lent to o(I)— 1 if Ies). For any positive 
linear functional o on £, there exists a triplet 
(Dy. To» Čo) (unique up to the unitary equiva- 
lence) of a Hilbert space §,, a representa- 
tion 7, (i.e., a X-homomorphism into A(H,)) 
of £, and a vector č, in §, such that g(A)= 
(1,(4)6,, Čo) and $, is the closure of 2,(.7)€,. 
The space §, is constructed by defining the 
inner product (4(A), 4(B)) = o BS A) in the 
quotient of æ by its left ideal {A| @(A*A)= 
0), where y is the quotient mapping, and by 
completion. Then z, is defined by z,(A)n(B)= 
n(AB). This is called the GNS construction 
after its originators I. M. Gel’fand, M. A. 
Naimark, and I. E. Siegel. 

A weight o on a von Neumann algebra æM is 
a function defined on the positive elements of 
Aff, with positive real or infinite values, which 
is additive and homogeneous (g(A + B)= 
(A) + ọ(B) and Q(4A) 2 A9(A) for all A, Be 
M and A>0 with the convention 0: oo =0 and 
œ -- a — oo). It is said to be faithful if it does 
not vanish except for ¢(0)=0, normal if @(A) 
=sup q(A,) whenever A, is an increasing net 
of positive elements of .@ and A —sup A,, and 
semifinite if the left ideal N,, consisting of all 
elements Ae M for which @(A* A) is finite, has 
the property that the linear span M, of NEN, 
is dense in Æ. The restriction of ọ to positive 
elements of M, has a unique extension to a 
linear functional on W,, which we denote by 
the same letter o. Canonically associated with 
a normal semifinite weight o, there exists a 
Hilbert space $,, a normal »-representation T, 
of M, and a complex linear mapping y from 
N, into a dense subset of H, such that (4(B’), 
n(B))= ¢(B* B), and z,(A)n(B)=n(AB), where 
B, BeN, and Ace M. If o is finite (i.e., 9t, = 
M), then its extension to M is a positive linear 
functional for which the triplet ($,, 2. (1) 
is given by the GNS construction. 

The linear span of all normal states of a von 
Neumann algebra æ is a norm-closed sub- 
space of its dual A5. called its predual and 
denoted by My, because M turns out to be the 
dual of My. 

A trace t on a von Neumann algebra M is a 
weight satisfying t((UAU*) — t(A) for U unitary 
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in Æ and for all positive A in Æ (equivalently, 
t(A* A) - t(AA*) for all AE M). 


E. The von Neumann Classification 


A von Neumann algebra for which a semi- 
finite normal trace does not exist is called a 
purely infinite von Neumann algebra or von 
Neumann algebra of type HI. In contrast to 
this, a von Neumann algebra M is called 
semifinite (resp. finite) if for each positive Her- 
mitian operator A (40) in .@ there is a semi- 
finite (resp. finite) normal trace t such that 
t(A) 0. Every Abelian von Neumann algebra 
is finite. If there are no central projection oper- 
ators E #0 such that Mg is finite, Æ is called 
properly infinite. Purely infinite Æ and Z(H) 
for infinite $, for example, are properly in- 
finite. A nonzero projection operator E in æ is 
called Abelian when p is Abelian. We call æ 
a von Neumann algebra of type I (or discrete) 
when it contains an Abelian projection E for 
which J is the only central projection P cover- 
ing E (Le., Ex P). A von Neumann algebra is 
of type II if it is semifinite and contains no 
Abelian projection. À von Neumann algebra 
of type II is called of type II, if it is finite and 
of type II,, if it is properly infinite. A finite von 
Neumann algebra is also characterized as a 
von Neumann algebra in which the operation 
of taking the adjoint is continuous with re- 
spect to the strong operator topology on 
bounded spheres (Sakai, Proc. Japan Acad., 
1957). A properly infinite von Neumann alge- 
bra M is characterized by the property that 
M and M © &() for any separable $ is *- 
isomorphic. 

Given a von Neumann algebra .Z, there 
exist mutually orthogonal projections E;, Ey, 
Ey in the center Z of æ such that Ej 4- Ej 4- 
Ey — I, and Mgp Mep MEn are von Neumann 
algebras of type I, type II, type III, respec- 
tively. This decomposition is unique. There 
also exist unique central projections E, and 
E; such that My, is finite, Æg, is properly in- 
finite, and E,-- E;— 1. The two decompositions 
can be combined. (If some of the projections E 
are 0, the condition on the corresponding Me 
is to be waived.) 


F. Factors 


A von Neumann algebra whose center consists 
exclusively of scalar multiples of the identity 
operator is called a factor. Von Neumann's 
reduction theory (— Section G) reduces the 
study of arbitrary von Neumann algebras on 
a separable Hilbert space more or less to the 
study of factors. Factors are classified into 
types I, (n= oo, 1,2, ...), IL, (i.e., of type II and 
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finite), H. De, of type II and not finite), and 
III. A factor of type I, is isomorphic to the 
algebra A(H) of all bounded operators on an 
n-dimensional Hilbert space §. Since the dis- 
covery of two nonisomorphic examples of 
factors of type II, by F. J. Murray and von 
Neumann (1943), classification of factors has 
been a central problem in the theory of von 
Neumann algebras. After 1967, great progress 
was made in the investigation of isomorphism 
classes of factors, and we have uncountably 
many nonisomorphic examples of factors of 
types II,, II,,, and III. After the discovery of 
the third to ninth nonisomorphic examples of 
factors of type II, by J. Schwartz (1963) (the 
third example, 1963), W. Ching (the fourth), 
Sakai (the fifth), J. Dixmier and E. C. Lance 
(the sixth and seventh), and G. Zeller-Meier 
(the eighth and ninth), D. McDuff showed that 
there exist countably many nonisomorphic 
examples of factors of type II,, and finally 
McDuff (Ann. Math., 1969) and Sakai (J. Func- 
tional Anal., 1970) showed the existence of 
uncountably many nonisomorphic examples 
of factors of type IL, For type III factors — 
Section I. 


G. The Integral Direct Sum and 
Decomposition into Factors 


The Hilbert spaces considered in this section 
are all tseparable. Let (W, &, u) be a *measure 
space; with each eM we associate a Hilbert 
space §(¢). We consider functions vU) on W 
whose values are in $(C) for each C. Let K be 
a set of these functions having the following 
properties: (i) ||x({)|| is measurable for x(C)e K; 
(ii) if for a function y(¢), the numerical func- 
tion (x), y(C)) is measurable any x(C)e K, 
then y(C) € K; (iii) there is a countable family 
{x (4), x2(0), ... of functions in K such that 
for each fixed eW, the set {x,(0), x4 (C)... } is 
dense in $(£). Then K is a linear space. We call 
each function in K a measurable vector func- 
tion. We introduce in the set of measurable 
vector functions x({¢) with 


| Olano < oo 


an equivalence relation by defining x(¢) and 
y(C) as equivalent when 


| Ix) yO? du) — 0. 
Thus we obtain a space of equivalence classes 


which we denote by $. $ is a Hilbert space 
with the inner product 


(x, y)= EEE 
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which is called the integral direct sum (or direct 
integral) of $(C). An operator function A) 
that associates with each Ce Mt a bounded 
linear operator A(C) on $Y(C) is called measur- 
able if for any measurable x(¢), A(D)x(C) is also 
measurable. If, moreover, || A(C)|| is bounded, 
A(C) transforms a function in § to a function 
in $ and thus defines a bounded linear oper- 
ator on §. This operator is called the integral 
direct sum (or direct integral) of A(¢), and an 
operator on § that can be reduced to this 
form is called decomposable. 

Generally, let $ be a Hilbert space, and 
consider an Abelian von Neumann alge- 
bra o on $. Then we construct a measure 
space (Mt, €, ul and represent .»/ as the set of 
bounded measurable functions on IW. (This is 
possible in different ways. The Gel'fand repre- 
sentation is an example.) Then a Hilbert space 
$Y(C) can be constructed so that § is repre- 
sented as the integral direct sum of §(¢). Oper- 
ators in & are all decomposable and are called 
diagonalizable. A von Neumann algebra æ on 
$ whose elements are all decomposable is 
characterized by ca The A(Q) yielded 
by the decomposition of operators A in M 
generate a von Neumann algebra .ZJ(C) on 
S (C). If we take as of the center Z of M, then 
almost all the (Ç) are factors (von Neumann's 
reduction theory), and if we take as a a maxi- 
mal Abelian von Neumann algebra contained 
in æ’, then almost all the .4() are type I 
factors (F. I. Mautner, Ann. Math., 1950). 


H. Tomita-Takesaki Theory 


Motivated by the problem of proving the 
commutant theorem for tensor products (ie., 
(4, Q MY = M, Q M), which remained 
unsolved for algebras of type III up until that 
time, Tomita succeeded in 1967, after years of 
effort, in generalizing the theory of Hilbert 
algebras, previously developed'only for semi- 
finite von Neumann algebras. The most im- 
portant ingredient of this theory lies in certain 
one-parameter groups of »-automorphisms of 
a von Neumann algebra, called modular auto- 
morphisms (see below), each one-parameter 
group of modular automorphisms being in- 
trinsically associated with a faithful semifinite 
normal weight of the algebra. Tomita's theory 
was perfected by Takesaki [13], who also 
showed that modular automorphisms satisfy 
(and are characterized by) a condition origi- 
nally introduced in statistical mechanics by 
the physicists R. Kubo, P. C. Martin, and J. 
Schwinger and accordingly known as the 
KMS condition. In the mathematical founda- 
tions of statistical mechanics, this condition 
characterizes equilibrium states of a physical 
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system for a given one-parameter group of 
automorphisms (of a C*-algebra) describing 
the time-development of the system. It was a 
fortunate coincidence that this condition was 
formulated in a so-called C*-algebra approach 
to statistical mechanics by R. Haag, N. M. 
Hugenholtz, and M. Winnink [14] at about 
the same time that Tomita's work appeared 
in 1967. The original proofs of the Tomita- 
Takesaki theory have been simplified con- 
siderably by the work of M. Rieffel [16] and 
A. Van Daele [15]. Deeper insight into the 
significance of modular automorphisms is also 
provided by the work of A. Connes [191, 
showing that the group of modular automor- 
phisms (up to inner automorphisms) is intrin- 
sic to the von Neumann algebra (i.e., inde- 
pendent of the weight) and belongs to the 
center of Out Æ (the group Aut M of all +- 
automorphisms of the von Neumann algebra 
M modulo the subgroup Int.@ of all inner 
*-automorphisms). 

Some of the basic definitions and results of 
the Tomita-Takesaki theory are as follows. If @ 
is a normal semifinite faithful weight on M, 
the antilinear operator S,, defined on a dense 
subset n (9t, NNE) of H, (— Section D) by the 
relation $,5(A) — n(A*), is tclosable and the 
polar decomposition S, — J,AJ^ of its closure 
defines a positive self-adjoint operator A,, 
called a modular operator, and an antiunitary 
involution J,. The principal results of the 
Tomita-Takesaki theory are (1) if xe æ, then 
af (A)e Ai AA," e Af for all real t, and this 
defines a continuous one-parameter group of 
*-automorphisms o? of æ, called modular 
automorphisms, and (2) if Ae, then j,(A)= 
J, AJ, € M', and j, is a conjugate-linear iso- 
morphism of JJ onto M’. A weight o on M 
is said to satisfy the KMS condition at f (a real 
number) relative to a one-parameter group of 
*-automorphisms oc, of A if, for every pair 
A, BEN, NN, there exists a bounded con- 
tinuous function F(z) (depending on A, B), on 
OxImz « f holomorphic in 0 « Imz « f and 
such that F(t) 2 g(Ao,(B)) and F(t-- if)— 
Q(o,(B) A). A given one-parameter group o, 
coincides with a group of modular automor- 
phisms o? if and only if @ satisfies the KMS 
condition at f = —1 relative too, (In statistical 
mechanics, f —(kT) !, where k is the Boltz- 
mann constant and T the absolute tempera- 
ture of the system.) 


I. Structure and Classification of Factors of 
Type III 


At the Baton Rouge Conference in 1967, R. 
T. Powers reported his results [17] on non- 
isomorphism of the one-parameter family 
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of factors of type IH (now called Powers's 
factors), which had been constructed by von 
Neumann in 1938 in terms of an infinite ten- 
sor product of factors of type I (abbreviated 
as ITPFI) Prior to Powers's work only three 
different kinds of factors of type III, along 
with the same number of factors of type II}, 
had been distinguished. A systematic classifica- 
tion of ITPFIs was subsequently given by 

H. Araki and E. J. Woods [18] in terms of two 
invariants, i.e., the asymptotic ratio set r Lë 
and the p-set p(s) of the von Neumann 
algebra 2 Using the Tomita-Takesaki the- 
ory, Connes [19] introduced two invariants, 
namely the S-set $(.//) (the intersection of the 
spectra of all modular operators) and the T-set 
T(@) (the set of all real t for which the modu- 
lar automorphism o? is inner), and, when æ is 
an ITPFI, proved the equality S(./7) 2 r, (M) 
and the relation T(./7) 2 2x|logp(./7)| !. The 
S-set S(.7/) of a factor of type III on a separa- 
ble Hilbert space ts either the set of all non- 
negative reals (type HI), the set of all integral 
powers of 4 (where 0 « 4 « 1) and 0 (type III ;), 
or the set (0, 1} (type III). The work of Araki 
and Woods shows that there exists only one 
ITPFI of type III, for each Zem, 1) (the exam- 
ples of Powers) as well as for 4— 1, while there 
exist continuously many ITPFIs of type III. 
Woods [20] has shown that the classification 
of ITPFIs of type III, is not smooth. 

A structural analysis of factors of type III, 
given independently by Connes [19], Takesaki 
(Acta Math., 1973), and Araki (Publ. Res. Inst. 
Math. Sci., 1973) expressed independently a 
certain class of factors of type III as a kind of 
crossed product of semifinite von Neumann 
algebras with their injective endomorphisms 
(automorphism in the case of Connes). These 
analyses led Takesaki [21] to the discovery 
of a duality theorem for crossed products of 
von Neumann algebras with locally compact 
groups of theit «-automorphisms (— Section J) 
and its application to the following structure 
theorem for von Neumann algebras of type III. 
The crossed product of a von Neumann alge- 
bra æ with the group of modular automor- 
phisms gf is a von Neumann algebra A of 
type IL, , with a canonical action 6, of the 
dual group as a one-parameter group of «- 
automorphisms which is trace-scaling, i.e., 
tof, ze '1 for some faithful normal trace «. 

If æ is properly infinite, the crossed product 
of A" with 0, is isomorphic to the original von 
Neumann algebra .@. In particular, any von 
Neumann algebra A of type III can be writ- 
ten as the crossed product of a von Neumann 
algebra A of type H, with a one-parameter 
group of trace-scaling *-automorphisms 6,. 
The isomorphism class of A is determined by 
the isomorphism class of A together with the 
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conjugacy class of 6, modulo inner automor- 
phisms. The restriction 6, of 0, to the center Z 
of A is of special importance. M is a factor if 
and only if 6, is ergodic. In that case, æ is of 
type HI, if A is a factor, 4 is of type Ill, 
0« 4 « 1, if Ó is periodic with period —log 4, 
and M is of type III, if ie aperiodic and not 
isomorphic to the one-parameter group of 
*-automorphisms of L „(R) induced by the 
translations of the real line R. (The excluded 
case does not occur for æ of type III.) 

A von Neumann algebra on a separable 
Hilbert space is called approximately finite- 
dimensional (or approximately finite or hyper- 
finite) if it is generated by an increasing se- 
quence of finite-dimensional *-subalgebras. 
This class of von Neumann algebras includes 
many important examples, such as ITPFI and 
the von Neumann algebra generated by any 
representation of canonical commutation (or 
anticommutation) relations on a separable 
Hilbert space. The classification of approxi- 
mately finite-dimensional factors is almost 
complete. In fact the uniqueness of an approxi- 
mately finite-dimensional factor of type II, has 
been known since the work of von Neumann. 
It is called the hyperfinite factor. The unique- 
ness of approximately finite-dimensional fac- 
tors of type H „ (which is then the tensor prod- 
uct of the hyperfinite factor with 2(5))) and of 
type III,, 0 € 4 « 1, (which are then Powers's 
factors) has been demonstrated by Connes 
[22]. Approximately finite-dimensional fac- 
tors of type III, are classified exactly by the 
isomorphism classes of the ergodic groups 6, 
of »-automorphisms of commutative von Neu- 
mann algebras 2. Any such factor is a Krie- 
ger's factor, i.c., a crossed product of a com- 
mutative von Neumann algebra with a single 
*-automorphism. Examples of such factors 
have been extensively studied by Krieger, who 
has also shown [23] that isomorphism of a 
Krieger's factor is equivalent to weak equiva- 
lence of the associated nonsingular transfor- 
mation of the standard measure space. 

A von Neumann algebra on a separa- 
ble Hilbert space is approximately finite- 


Banach Algebras H). 


J. Crossed Products 


The crossed product .Z/ ©, G of a von Neu- 
mann algebra .ZJ (acting on a Hilbert space 
$) and a locally compact Abelian group G 
relative to a continuous action a of G on M 
(by *-automorphisms «,, ge G) is the von 
Neumann algebra A generated by the opera- 
tors n(A), Ae M and A(h), he G, defined on the 
Hilbert space L(G, $) of all $-valued L,- 
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functions on G (relative to the Haar measure) 
by 


[x(4)£](g) =a, ! (A)&(g). 

[AChE] lg) = (hg), 

where če L (G, $). The canonical action ĉ 
of the dual G on A" is defined by &,(B)= 


u(p)Bu(p)* for Be A" and pe G, where p(p) 


is defined by [ n(p)$](g) 2 Ca. p? ciao) The 
duality theorem of Takesaki [21] asserts 

that [./ QG] Q; G is isomorphic to M @ 
Z(L.,(G)), where the second factor @(L,(G)) 
is the algebra of all bounded linear operators 
on L(G). 





K. Natural Positive Cone 


The closure V* of the set of vectors A% n(A) 
for all positive A in 9t, 1907 reflects certain 
properties of the von Neumann algebra M 
for O«a« 1/2 [24, 25]. In particular, V "^ is 
called the natural positive cone. It is a self-dual 
closed convex cone, and is intrinsic to the von 
Neumann algebra M (i.e., independent of 

the weight ol Every normal positive linear 
functional y on æ has a unique representa- 
tive &(V) in this cone De, V(A4) — (1, CA)£ (V), 
£(V))), and the mapping £ is a concave, mono- 
tone, bijective homeomorphism, homo- 
geneous of degree 1/2. The group of all +- 
automorphisms of æ has a natural unitary 
representation U(g), ge Aut A. satisfying the 
relations U(g) AU(g)* 2 g(4), U(g)é(o) — 
&(o0g ). 


L. C*-Algebras and von Neumann Algebras 


Let a C*-algebra . be given. A *-represen- 
tation x T, gives rise to a von Neumann 
algebra M, generated by 7;, ve The type of 
this representation is defined according to the 
type of A A C*-algebra is called a C*-algebra 
of type I if its «representations are always of 
type I. It is known that this class is exactly the 
class of GCR algebras (— 36 Banach Algebras 
H). It is also known that a separable non-type 
I C*-algebra has a representation of type II 
and a general non-type I C*-algebra has a 
representation of type III (J. Glimm, Ann. 
Math., 1961; Sakai [8]). 

For a C*-algebra £, all its representations 
` generate tinjective von Neumann algebras if 
and only if æ is tnuclear [26, 27]. 


M. Topological Groups and von Neumann 
Algebras 


Consider a unitary representation g — U, of a 
locally compact Hausdorff group G (— 423 
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Topological Groups). If this representation is 
a tfactor representation, the type of this repre- 
sentation is defined according to the type of 
the von Neumann algebra æ generated by 
LU. gEG. A tgroup of type I is a group whose 
factor. representations are all of type I. For 
example, connected semisimple Lie groups and 
connected nilpotent Lie groups are of type I 
(Harish-Chandra, Trans. Amer. Math. Soc., 
1953; J. M. G. Fell, Proc. Amer. Math. Soc., 
1962). Examples of groups that are not of type 
I are known (— 437 Unitary Representation 
E). 


References 


[1] J. von Neumann, Collected works II, III, 
Pergamon, 1961. 

[2] W. Arveson, An invitation to C*-algebras, 
Springer, 1976. 

[3] O. Bratteli and D. W. Robinson, Operator 
algebras and quantum statistical mechanics, 
Springer, 1979. 

[4] J. Dixmier, Von Neumann algebras, 
North-Holland, 1981. 

[5] J. Dixmier, Les C*-algébres et leurs repré- 
sentations, Gauthier-Villars, 1964. 

[6] I. Kaplansky, Rings of operators, Ben- 
jamin, 1968. 

[7] G. K. Pedersen, C*-algebras and their 
automorphism groups, Academic Press, 1979. 
[8] S. Sakai, C*-algebras and W*-algebras, 
Springer, 1971. 

[9] J. T. Schwartz, W*-algebras, Gordon & 
Breach, 1967. ) 

[10] S. Stratila and L. Zsido, Lectures on von 
Neumann algebras, Editura Academiei/Abacus 
Press, 1975. 

[11] M. Takesaki, Theory of operator algebras 
I, Springer, 1979. 

[12] Y. Nakagami and M. Takesaki, Duality 
for crossed products of von Neumann alge- 
bras, Lecture notes in math. 731, Springer, 
1979. 

[13] M. Takesaki, Tomita's theory of modular 
Hilbert algebras and its applications, Lecture 
notes in math. 128, Springer, 1970. 

[14] R. Haag, N. M. Hugenholtz, and M. 
Winnink, On the equilibrium states in quan- 
tum statistical mechanics, Comm. Math. Phys., 
5 (1967), 215-236. 

[15] A. Van Daele, A new approach to the 
Tomita-Takesaki theory of generalized Hilbert 
algebras, J. Functional Anal., 15 (1974), 378— 
393. 

[16] M. Rieffel and A. van Daele, A bounded 
approach to Tomita-Takesaki theory, Pacific 
J. Math., 69 (1977), 187-221. 

[17] R. T. Powers, Representations of uni- 
formly hyperfinite algebras and their asso- 





200 A 
Orbit Determination 


ciated von Neumann rings, Ann. Math., (2) 

86 (1967), 138—171. 

[18] H. Araki and E. J. Woods, A classifica- 
tion of factors, Publ. Res. Inst. Math. Sci., (A) 
4 (1968), 51—130. 

[19] A. Connes, Une classification des facteurs 
de type III, Ann. Sci. Ecole Norm. Sup., (4) 6 
(1973), 133—252. 
[20] E. J. Woods, The classification of factors 
is not smooth, Canad. J. Math., 25 (1973), 96- 
102. 

[21] M. Takesaki, Duality for crossed prod- 
ucts and the structure of von Neumann alge- 
bras of type III, Acta Math., 131 (1973), 
249-310. 

[22] A. Connes, Classification of injective 
factors, Ann. Math., 104 (1976), 73-115. 

[23] W. Krieger, On ergodic flows and the 
isomorphism of factors, Math. Ann., 223 
(1976), 19-70. 

[24] H. Araki, Some properties of modular 
conjugation operators of von Neumann alge- 
bras and a non-commutative Radon-Nikodym 
theorem with a chain rule. Pacific J. Math., 50 
(1974), 309—354. 

[25] A. Connes, Caractérisation des espaces 
vectoriels ordonnés sous-jacents aux algébres 
de von Neumann, Ann. Inst. Fourier, 24, Fasc. 
4 (1974), 127-156. 

[26] M. Choi and E. Effros, Separable nuclear 
C*-algebras and injectivity, Duke Math. J., 43 
(1976), 309—322. 

[27] M. Choi and E. Effros, Nuclear C*- 
algebras and injectivity: The general case, 
Indiana Univ. Math. J., 26 (1977), 443—446. 
[28] R. V. Kadison and J. R. Ringrose, Funda- 
mentals of the theory of operators I, Academic 
Press, 1983. 

[29] S. Stratila, Modular theory in operator 
algebras, Editura Academiei/Abacus Press, 
1981. 


309 (XX.7) 
Orbit Determination 


A. General Remarks 


The purposes of the theory of orbit determi- 
nation are (1) to study properties of orbits of 
celestial bodies, (2) to determine orbital ele- 
ments from observed positions of the celestial 
bodies, and (3) to compute their predicted 
positions utilizing the orbital elements. Celes- 
tial bodies to which the theory is applied are 
mainly planets, asteroids, comets, satellites, 
and artificial satellites in the solar system, 
although orbits of meteors and visual, photo- 


1162 


metric, and eclipsing binaries can be deter- 
mined by similar methods. 


B. Kepler's Orbital Elements 


Consider, for example, an asteroid moving on 
an ellipse with one focus at the sum. The ellip- 
tic orbit is fixed by the initial conditions of the 
motion or the integration constants of the 
tHamilton-Jacobi equation (— 55 Celestial 
Mechanics) and is described by Kepler’s orbital 
elements a, e, œ, i, Q, and t (Fig. 1). 


north 





Fig. 1 
Orbital elements 


The size and shape of the ellipse are deter- 
mined by the semimajor axis (half the tmajor 
axis) a and the 'eccentricity e, while the argu- 
ment c of perihelion, measured from the as- 
cending node to the perihelion, shows the 
direction of the major axis. (Sometimes, we 
adopt as one of the main parameters the peri- 
helion distance q = a(1 — e) instead of the semi- 
major axis a.) The position of the orbital plane 
is determined by the inclination angle i to the 
ecliptic and the longitude Q of the ascending 
node, and then the position of the asteroid 

on the orbit is determined by the time t, of 
the perihelion passage. The period T of one 
revolution, or mean motion n — 2z/T, which 

is the mean angular velocity, is coraputed by 
Kepler’s third law n?a? = u, with u a constant 
depending on the mass of the asteroid. The 
mean motion is a fundamental frequency in 
the solution of the Hamilton-Jacobi equation 
and is obtained by differentiating the energy 
constant — u/2a with respect to an action 
variable pa. 

To express the position of the asteroid on 
the ellipse as a function of time, we use the 
true anomaly v, which is the angular distance 
of the asteroid from the perihelion, the eccen- 
tric anomaly E, and the mean anomaly M = 
n(t — to). Of these three anomalies the mean 
anomaly can be derived directly from Kepler's 
elements, although it must be transformed to 
the true anomaly or to the eccentric anomaly 
when we compute the coordinates of the aster- 
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oid. Kepler's equation 
E—esinE- M (1) 


holds between E and M. Solving this equation, 
we obtain an expression for E as a function of 
M: 


v 2 
E=M t Y, -J,(ne)sinnM, (2) 
ni H 


where J, is the tBessel function of order n. 
However, in practical computations, we often 
solve equation (1) directly by numerical 
methods or by using tables. 


C. Orbit Determination 


An astrometric observation of a celestial body 
usually consists of measurements of two co- 
ordinates (right ascension and declination) on 
the celestial sphere. Therefore, to derive six 
orbital elements, three sets of observations 
should be made at three moments separated 
by appropriate time intervals. If the topocen- 
tric distance of the celestial body is known, 
the orbital elements can be computed directly 
from observations. However, since the distance 
is not usually known, special methods have 
had to be developed. A method for orbit deter- 
mination was worked out by C. F. tGauss at 
‘the beginning of the 19th century to find the 
orbit of Ceres, the first asteroid to be dis- 
covered. Although the topocentric distances 
are not known, we know that orbits of aster- 
oids are planar, and Kepler's second law, the 
law of conservation of areal velocity, holds 
approximately. Therefore we can assume that 
the area of the triangle made by the sun and 
the two positions of the asteroid observed at 

. different moments is proportional to the cor- 
responding time interval. Using this property 
of the orbit we can derive the topocentric 
distance and then the orbital elements. This 
method is called the indirect method, and 
similar methods can be developed for para- 
bolic and hyperbolic orbits. 


D. Osculating Elements and Orbit 
Improvement 


For the ttwo-body problem the orbit is a fixed 
and invariable ellipse, and therefore Kepler's 
orbital elements are constants. On the other 
hand, when gravitational interactions from 
other bodies cannot be disregarded, the orbital 
elements are found to be variable by comput- 
ing the tperturbations by the tmethod of vari- 
ation of constants. The perturbations are 
expressed as sums of periodic, tsecular, and 
long-periodic terms. 
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Because of the perturbations, the orbit 
deviates from the fixed ellipse, although at 
every moment the instantaneous velocity and 
position of the asteroid determines an ellipse. 
The orbital elements of the ellipse thus defined 
at each moment, called osculating elements, are 
variable with time. To compute perturbations 
that cause this change of osculating elements, 
it is necessary to observe the initial conditions 
of motion, i.e., the osculating elements at the 
initial moment. During a time interval shorter 
than the period of one revolution, the vari- 
ations of the osculating elements are usually 
very small. Therefore, by three sets of observa- 
tions made at three moments at short inter- 
vals, it is possible to determine the orbital 
elements that can-be identified with the oscu- 
lating elements observed at the mean moment. 
However, if the intervals are very short, errors 
in the determined values often are very large, 
and it becomes necessary to carry out obser- 
vations at distant moments also. When such 
additional observations are made, those data 
are compared with the respective values that 
follow from the initial observations, and the 
perturbations computed from them; then the 
method of least squares is used to improve the 
estimation of the orbital elements. 


E. Artificial Satellites 


Since the periods of revolution of asteroids 
are of the order of a few years, the osculating 
elements change very little in a few weeks. On 
the other hand, for artificial satellites moving 
around the earth, the periodic as well as secu- 
lar perturbations become very large after a 
few hours because the period of revolution 
may be as short as two hours. Therefore, to 
determine orbital elements for artificial satel- 
lites, observed positions should be corrected 
by subtracting the effects of periodic pertur- 
bations computed from approximate orbital 
elements already known. By using the obser- 
vations thus corrected, mean orbital elements 
are derived by the method of orbit improve- 
ment. The approximate orbital elements can 
be computed if the launching conditions of the 
satellites are known. In this manner, mean’ 
orbital elements can be derived every day, and 
variations of the mean orbital elements, or 
amounts of secular perturbations, for a certain 
period (say, for 100 days) are found. From 
them, information on atmospheric density and 
the gravitational potential of the earth are 
derived. It should be remarked that for arti- 
ficial satellites distance measurements have 
been made by radar, and velocity determi- 
nations have been made by measuring the 
Doppler effect. 
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For satellites of other planets, measurements 
of two coordinates with respect to the centers 
of the planets are made. Masses of planets can 
be computed by Kepler's third law when the 
orbital elements of satellites are known, and 
gravitational potentials of the planets can be 
determined from their secular perturbations. 


F. Binaries 


In the study of visual binaries, methods similar 
to those for satellites can be applied, although 
the exact estimation of the distances to bina- 
ries is often impossible. For photometric 
binaries radial components of velocities are 
derived by measuring the Doppler effect; and 
for eclipsing binaries important information, 
such as their masses, densities, and sizes, as 
well as data regarding their internal consti- 
tutions, can be derived from the observed 
orbital elements. 
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A. History 


Many spaces used in functional analysis, such 
as "Hilbert spaces, *Banach spaces, and *topo- 
logical linear spaces, are generalizations of 
Euclidean spaces, where the leading idea has 
been to generalize the distance in Euclidean 
spaces in various ways. On the other hand, 
generalizing the order concept for real num- 
bers has led to spaces of another kind: ordered 
linear spaces and vector lattices. The theory of 
vector lattices was presented in a lecture by F. 
Riesz at the International Congress of Mathe- 
maticians in 1928 [1] and has been developed 
by many authors. Among them we cite H. 
Freudenthal, L. V. Kantorovitch (Mat. Sb., 2 
(44) (1937)), Riesz (Ann. Math., (2) 41 (1940)), 
S. Kakutani, F. Bohnenblust, G. Birkhoff [2], 
H. Nakano (3], B. C. Vulikh [5], and H. H. 
Schaefer [8]. Vector lattices have been used in 
lattice-theoretic treatments of integration (— 
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Section I), *spectral resolution, and *ergodic 
theory. The central notion is Banach lattices 
(— Section F), but the theory has been ex- 
tended to the case where E is a ‘locally convex 
topological linear space with the structure of 
a vector lattice [6-8]. 


B. Definitions 


A real tlinear space EF is said to be an ordered 
linear space if E is supplied with an torder 
relation > with the following two properties: 
(i)x2>y>xt+z>y+z; (i) x>y and 420 (Ais 
a real number) => Ax > Ay. 

If, in addition, E forms a flattice under this 
order >, we call E a vector lattice (Riesz space 
or lattice ordered linear space). 

For Sections B through E, we assume E to 
be a vector lattice. For any x, ye E, the tjoin 
and *meet of x, y are denoted by x v y and 
X ^ y respectively. The following relations are 
obvious: 


(x+z)v(y+z)=(xvy)+z, 
(x+z)A(y+z)=(xAy)+z, 


AXVAycA(Xvy) AXAAY=A(xAy) (ASO), 


AXVAY=A(XAy), Ax^Ay-A(xvy) (A<0) 


and 
(XV VJAZ=(XAZ)V(VAzZ), 
(X ^y)vz-(xv z)^(yv z). 


The last relation means that E is a ‘distribu- 
tive lattice. 

For xe E, the elements x v0, (—x) v0, and 
x v(—x) are called respectively the positive 
part, negative part, and absolute value of the 
element x, and are denoted respectively by x * 
x and |x|. The following identities hold: x = 
x*' —x' (Jordan decomposition), | x| ^ x * + 
Xx ,X'Ax =O, xvyt+txay=x+y, |Ax|= 
[A||x|, and |x —y| 3x vy —x ^y. 

For a, be E with a <b, the set {xja<x<b} 
is called an interval and is denoted by [a, 5]. 
A subset of E 1s called (order) bounded if it is 
included in an interval. An element e of E is 
said to be a unit or an Archimedean unit if 
for any xe E there exists a natural number n 
such that x <ne. A linear subspace I of E is 
called an ideal (or order ideal) of E if x eI and 
|y| &|x] imply yel. 


d 


C. Order Limits 


Given a subset {x,} of E, if an element x of 
E is an upper bound of {x,} and any upper 
bound y of {x,} satisfies the relation y z x, 

then it is called the least upper bound (or su- 
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premum) of {x,} and is denoted by sup, x, or 
V, X,. The greatest lower bound (or infimum) 
of {x,}, denoted by inf, x, or A, x,, is defined 
dually. 

A sequence {x,} (x, €E) is said to be order 
convergent to x if there exists a nonincreasing 
sequence {u,} (u„€ E) such that /\,,u,=0 and 
|x, —x| € u,. In this case x is called the order 
limit of {x,} and is denoted by x =o-limx,. For 
order convergent sequences (x,) and {y,}, we 
can show the following relations: o-lim(Ax, + 
Hills A(o-lim x,) + u(0-lim y,) and o-lim(x, v y,) 
— (o-lim x,) y (o-lim y,). We say that E is Archi- 
medean if the relations 0 € nx € y (n2 1,2,3,...) 
imply x=0. If E is Archimedean, then the 
relations x 2 o-lim x, and 4 — lim 2, imply Ax = 
o-lim 4,x,. We say that E is complete (c- 
complete) if any (countable) subset of E that 1s 
bounded above has a least upper bound. A 
o-complete vector lattice is always Archime- 
dean. If E is c-complete, for any sequence {x,} 
bounded above, o-lim sup x, is defined to be 
AA, VnznXm; We define o-liminf x, similarly. 
With these definitions, x — o-lim x, is equiva- 
lent to x 2 o-limsup x, — o-liminf x,. Any 
Archimedean vector lattice can be extended to 
a complete vector lattice in the same way as 
the real numbers are constructed from the 
rational numbers by Dedekind cuts (— 294 
Numbers). 


D. Examples of Vector Lattices 


'Sequence spaces, such as c, m, and /,, and 
‘function spaces, such as C, M, and L,, form 
vector lattices under pointwise ordering (— 
168 Function Spaces). Among these spaces c 
and C are not o-complete, but the others are 
complete. We give two examples. First, let X 
be a c-algebra of subsets of a space Q, and let 
A(Q, X) be the set of all finite o-additive tset 
functions defined on X. Then A(Q, X) is a 
complete vector lattice if we define Uu, zu: to 
mean 44,(S)z u;(S) for any Se X. The second 
example is an ordered space consisting of all 
tbounded symmetric operators T on a Hil- 
bert space H, where we define T, > T, to mean 
(Ti x, x) 2(T; x, x) for any x e H. In general, this 
space is not a vector lattice. However, if A is 

a commutative tW*-algebra of operators on 
H and § is the set of *symmetric operators be- 
longing to A, then $ is a complete vector lattice 
under the ordering just defined. We can re- 
place the conditions of finiteness in A(Q, 2) 
and boundedness in S with weaker ones and 
still obtain the same situation. The tRadon- 
Nikodym theorem in A(Q, X) and the spectral 
resolution theorem of symmetric operators in 
S can be extended to theorems of tspectral 
representations in general vector lattices. 





310 F 
Ordered Linear Spaces 


Let E, be a linear space of functions defined 
on a set Q ordered pointwise. If there exists a 
bijective mapping defined on a vector lattice E 
onto E, that is linear and order isomorphic, 
we call E, a representation of E. If E has an 
Archimedean unit and is simple (which means 
that E and {0} are the only ideals of E), then E 
can be represented as the set of real numbers 
such that the Archimedean unit of E is repre- 
sented by the number 1 (H. Freudenthal, Proc. 
Akad. Amsterdam, 39 (1936)). 


E. Dual Spaces 


Let &(E, F) be the set of order bounded linear 
mappings of a vector lattice E into a vector 
lattice F, where order boundedness means that 
any bounded (in the sense of the order) subset 
of E is mapped into a bounded set of F. For 
any ox, Q; € &(E, F), define o, 2 9, to mean 
$1602 o;(x)(x 20, xe E). If F is complete, then 
2(E, F) is a complete vector lattice. An element 
o € $(E, F) is called a positive operator if 2 0. 
If F is the set of real numbers R, then Q(E, F) 
is the set of all (order) bounded "linear func- 
tionals on E. This space, called the dual lattice 
of the vector lattice E and denoted by ER, is a 
complete vector lattice. For fe E" and x 2 0, 
x€ E, we have 


f'(Q)- sup f(y), f o=- inf. 219 


O<y<x 


|f\(x)= sup f(y). 


GEZ? 


F. Banach Lattices 


A linear space E is called a normed vector 
lattice if E is a vector lattice having the struc- 
ture of a *normed space satisfying |x| € | y| 
ixl < || yl]. Furthermore if a normed vector 
lattice E is complete relative to the norm, we 
call E a Banach lattice. The examples in Sec- 
tion D are Banach lattices (for ue A(Q, X) we 
define Vull =|4|(Q)). 

In Banach lattices, ||x, — xjj ^0 and ly, — ul 
0 imply Ia? y, CH Hl 50. Among relations 
between order convergence and norm conver- 
gence in Banach lattices, the following is one 
of the most fundamental: In a Banach lattice 
E, norm convergence of a sequence {x,} to x is 
equivalent to relative uniform star convergence 
of (x, to x, i.e., for any subsequence te) 
of {x,}, there exists a subsequence em) of 
{Xm} and an element y of E satisfying the 
relations |X m) — x| € y/l (L9 1, 2,...). 

Any set bounded relative to the order is 
bounded relative to the norm, but the con- 
verse does not hold in general. For a linear 
functional, however, these two concepts of 
boundedness coincide, and the order dual of 
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E is the same as the norm dual of E. More- 
over, the dual (in any sense) of a Banach lattice 
is also a Banach lattice. 


G. Abstract M Spaces and Abstract L Spaces 


For a Banach lattice E, we consider the follow- 
ing three conditions: (M) x, y 20 [x v y| = 
max(||x||, || yl]. (L) x, yz 0 [x + yl] = llxll + 

Il yl. (L,) x ^ ya 0-9 lx - yl? — xl? + (wtf 

(1 « p « oo). If E satisfies one of the conditions 
(M), (L), or (L,), we say that E is an abstract M 
space, and abstract L space, or an abstract L, 
space, written AM, AL, and AL, , respectively. 
If the unit ball of an AM space has a *greatest 
element, it is called the Kakutani unit of E. 
The duals of AM spaces, AL spaces, and AL, 
spaces are AL spaces, AM spaces with the 
Kakutani units, and AL, spaces (1/p + 1/q — 1), 
respectively. An AM space with a Kakutani 
unit is represented by C(Q), i.e., the set of all 
real-valued continuous functions defined on a 
compact Hausdorff space Q. The AL spaces 
and AL, spaces are represented by L, and L,, 
respectively, on a *measure space. Here the 
representation of a Banach lattice means a 
representation of a vector lattice preserving 
the norm (Kakutani, Ann. Math., (2) 42 (1941); 
Bohnenblust, Duke Math. J., 6 (1940)). 


H. Spectral Properties of Positive Operators 


The n-dimensional real vector space E, is a 
vector lattice under pointwise order (— Sec- 
tion D). An element x in E, is called strictly 
positive if x; 0 for all i. A square matrix A= 
(a, d of order n is called positive if a; ; 2:0 for 
all i and j. It corresponds to a positive opera- 
tor in E, (— Section E). A is called irreducible 
if there exists no permutation matrix P such 


A, A 

that rar) : "Laien A, and A, 
0 A 
3 


are square matrices of order n; (1 € n; « n). We 
denote by o(A) the *spectrum of A and by r(A) 
the tspectral radius of A, i.e., supí|a|| xeo(A)]. 
The spectral circle of A is the circle of radius 
r(A) having the origin as its center. O. Perron 
(Math. Ann., 64 (1907)) and G. Frobenius (S. B. 
Preuss. Akad. Wiss., 1908 and 1912) established 
the following remarkable result on the spec- 
tral properties of positive matrices. 

Theorem (Perron-Frobenius). Let A be a 
positive square matrix. Then its spectral radius 
r(A) belongs to c(A), and for this spectrum 
r(A) there exists an eigenvector x >0. Assume 
further that A is irreducible and the order of A 
is greater than 1. Then r(A) 0 and the eigen- 
space of A for r(A) is a 1-dimentional subspace 
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spanned by a strictly positive element. In this 
case the eigenvalues of A on the spectral circle 
are the kth roots of unity for some k multiplied 
by r(A), each of which is a simple root of the 
eigenequation of A. 

Since a positive matrix of order n corre- 
sponds to a positive operator in E,, exten- 
sions of this theorem to positive operators in 
ordered linear spaces have been studied by 
many mathematicians. For these extensions, see 
the following articles: M. G. Krein and M. A. 
Rutman (Amer. Math. Soc. Transl., 26 (1950); 
original in Russian, 1948), F. F. Bonsall (J. 
London Math. Soc., 30 (1955)), S. Karlin (J. 
Math. Mech., 8 (1959)), T. Ando (J. Fac. Sci. 
Hokkaido Univ., ser. 1, 13 (1957)), H. H. Schae- 
fer [8], H. P. Lotz (Math. Z., 108 (1968)), F. 
Nitro and I. Sawashima (Sci. Pap. Coll. Gen. 
Educ., Univ. Tokyo, 1966), I. Sawashima and F. 
Niiro (Nat. Rep. Ochanomizu Univ., 30 (1979)), 
and S. Miyajima (J. Fac. Sci. Univ. Tokyo, 27 
(1980)). 


I. Integrals Based on Ordering 


As applications of ordered linear space theory, 
we state the integrals of Daniel-Stone and 
of Banach. Let us begin with a set ‘€ of real- 
valued functions defined on an abstract space 
S and assume that € is a vector lattice with 
respect to the usual order relation, addition, 
and scalar multiplication. Assume further that 
a functional E( f) defined on € satisfies the 
following conditions: (i) additivity, i.e., E(f +g) 
= E( f ) - E(g); Gi) positivity, i.e., f 7:0 implies 
E(f) 20; (ii) f, f,e€ (n=1,2,...) and |f|« 
Daci [fal imply E(f) &277 : El fal) ) where | f| 
means f v(— f). A functional on € satisfying 
both conditions (1) and (ii) is called a positive 
linear functional. A positive linear functional 
on € satisfies M. H. Stone's condition (iti) if 
and only if it satisfies P. J. Daniell’s condition 
(itt): fi z f; 2... and lim, ,, f, =0 imply 
bm, E( f,) =0. Next, we define, for every 
function o on S admitting +œ as values, a 
functional N(q) as follows: 





Ni) i-i n (nD mee lols x D 
Here we put N(q)— +00 when for a function o 
there are no functions ( fa} such that |o| « 
EL Ifl. A function @ is, by definizion, a null 
function if N(@)=0 holds, and a set A is a 

null set if its tcharacteristic function is a null 
function. Since each function of ,— (9| N(q) 
< +00} takes finite values except on a null set, 
we can define addition and the scalar multipli- 
cation for such functions except on a null set. 
Let % be the set of equivalence classes of o 
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with respect to the relation o ^y defined as 
N(o—wW)=0. Then % is a Banach lattice with 
the norm N, and € is included in $y (by iden- 
tifying f and g of Œ when E(| f —g|) - 0). Let us 
denote now by £ the closure of € in §. Then 
any function o belonging to € is said to be 
Daniell-Stone integrable, and Lie) — N(o*)— 
N(o )(o*' =3(lel+ e) e^ =3(lel—¢)) is 
called the Daniell-Stone integral of o. The 
integral L thus defined is, as a functional on &, 
an extension of the functional E on ©. For this 
integral, Lebesgue's convergence theorem is 
easily proved, and a result corresponding to 
Fubini's theorem has been obtained [9]. Fur- 
thermore, the concepts of measurable func- 
tions, measurable sets, and measure can be 
defined by using L and £. Also, the relation 
between L and the Lebesgue integral with 
respect to this measure is known [9]. Since 

€ is an tabstract L-space, the Daniell-Stone 
integral L(g) is represented by the Lebesgue 
integral of o on a certain measure space. The 
Daniell-Stone integral introduced above is due 
to M. H. Stone [9]. Daniell (Ann. Math., (2) 

19 (1917—1918)) originally defined the upper 
integral I(@) by using E(f) on € satisfying 
conditions (i), (ii), and (iii^). Also, he defined 
the set £ of Daniell-Stone integrable func- 
tions by L-(o|I(o)2 —I(—@)}. S. Banach 
defined an integral by using methods similar 
to Daniel's, replacing condition (iii?) for a posi- 
tive linear functional E( f) on € by condi- 

tion (iii^): lim, ,,, f, —0,|f,| &g, and f,, ge€ 
imply lim, ,,, E( Ja) 20 [10]. Furthermore, N. 
Bourbaki [11] and E. J. McShane (Proc. Nat. 
Acad. Sci. US, 1946) have defined a more gen- 
eral integral than the Daniell-Stone with a 
condition analogous to (iii), replacing the se- 
quence in it by a tdirected family of functions 
LO 

Specifically if, in the Daniell-Stone integral, S 
is a locally compact Hausdorff space and € is 
the set of continuous functions with compact 
supports, then a functional E(f) on € satisfy- 
ing conditions (1) and (ii) is proved to satisfy 
the condition (iii), and the Daniell-Stone 
integral L(q) can be constructed from E(f) 
[11]. 

Banach also defined another integral for all 
real-valued bounded functions on TO. 1) by 
using the *Hahn-Banach extension theorem 
[12]. His definition is as follows: Let Ẹ be the 
set of all real-valued bounded functions on 
[0, 1) and 9I be the family of all finite sets of 
real numbers a — (a,,25, ...,«,). Furthermore, 
we define, for x(s)e 5 and «e A, 


1 n 
M(x,a)= sup — > x(st+o), 
—o €«s« +% Fl kz1 
where x(s) is considered as the periodic exten- 
sion to (—oo, +00) of the function defined 
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originally on TO, 1) and 
p(x)= inf M(x, a). 


Then, by the Hahn-Banach extension theorem, 
there exists a linear functional F on § satisfy- 
ing F(x) pt) If we write f x(s)ds for F(x), 
then we can prove immediately that | x(s)ds 
has the following properties: (1) [(ax(s) - 
by(x)} ds a | x(s)ds +b f y(s)ds, where a 

and b are real constants. (2) x(s) 2 0 implies 
[x(s)ds z 0. (3) f x(s+ so) ds = f x(s)ds, where sy 
is an arbitrary real number. (4) (1ds— 1. If 
necessary, we can add the property (5) fx(1 — 
s)ds — [ x(s)ds by defining 


x0 ds -i {F(x(s)) + F(x(1 —s))]. 


Then 


F(x)= [xo ds 


or 


5 (Fees) + F(x(1—s))}= La 


is called the Banach integral of x(s). 

The construction of the Daniell-Stone inte- 
gral and the Banach integral opened avenues 
to several other abstract integrals based on the 
order relation, such as an integral for more 
general functions with values in a vector lat- 
tice, or an integral considered as a mapping 
from a vector lattice into another vector lattice 
(or from an ordered set into another ordered 
set). Indeed, if the function takes values in a 
complete vector lattice, then almost all results 
in this section (e.g., the Hahn-Banach exten- 
sion theorem) hold trivially. For discussions of 
these and other abstract integrals — [13-15]. 
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A. Ordering 


The concept of ordering is abstracted from 
various relations, such as the inequality rela- 
tion between real numbers and the inclusion 
relation between sets. Suppose that we are 
given a set X ={x, y, z, ... }; the relation be- 
tween the elements of X, denoted by < or 
other symbols, is called an ordering (partial 
ordering. semiordering, order relation, or sim- 
ply order) if the following three laws hold: (i) 
the reflexive law, x < x; (ti) the antisymmetric 
law, x € y and y € x imply x — y; and (iii) the 
transitive law, x € y and y x z imply x & z. 

A set X with an ordering between its ele- 
ments is called an ordered set (partially ordered 
set or semiordered set). A subset of an ordered 
set X 1s also an ordered set with respect to the 
same ordering as in X. If for an arbitrary pair 
of elements x, y of an ordered set A either 
x<y or y<x must hold, then the ordering < 
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is called a total (or linear) ordering, and A is 
called a totally ordered (or linearly ordered) set. 

We sometimes write x € y as yz x. The 
binary relation > is called the dual ordering of 
<; it is also an ordering. More generally, the 
duals of concepts, conditions, and propositions 
concerning an ordering are defined by replac- 
ing the ordering with its dual. For example, 
x< y means that x € y and x Z y, while x> y 
means that x 2 y and x # y; and > is the dual 
of «. If a universal proposition concerning the 
ordering is true, then its dual is also true; this 
principle is called the duality principle for 
ordering. Incidentally, x < y is equivalent to 
the statement x < y or x = y, according to the 
definition of «. 


B. Definitions 


A subset of an ordered set X of the form 

ix[a «x «bj is denoted by (a, b), and a set of 
the form (a, b), {x|x <a}, or (x | x» aj is called 
an interval. In particular, S(c) 2 (x| x « c] is 
called the segment of X determined by c. A 
pair of elements a, b satisfying a <b is called a 
quotient of X and is denoted by b/a. 

When a<c<b or b<c<a, c is said to lie 
between a and b. A totally ordered set A is said 
to be dense if for any pair of distinct elements a 
and b in A there exists a third element c lying 
between a and b. When a <b and there is no 
element lying between a and b, then a is called 
a predecessor of b, and b a successor of a. In 
this manner, most of the terminology as- 
sociated with the inequality of numbers is 
carried over to general ordering. 

In an ordered set A, an element a is called 
an upper bound of a subset X if x <a for 
every element x of X. When an upper bound 
exists, X 1s said to be bounded from above (or 
bounded above). The dual concept of an upper 
bound is a lower bound of the subset; and if the 
subset has a lower bound, it is said to be 
bounded from below (or bounded below). A set 
bounded both from above and from below is 
simply said to be bounded. When a is an upper 
bound of X and ae X, then a is called the 
greatest element (or maximum element) of X. 
Such an element a (if it exists) is unique and is 
denoted by max X; its dual is the least element 
(or minimum element) and is denoted by 
min X. If there is a least element in the set of 
upper bounds of X, it is called the least upper 
bound (or supremum) of X and is denoted by 
Lu.b. X or sup X. Its dual is the greatest lower 
bound (or infimum) and is denoted by g.Lb. X 
or inf X. 

If the ordered set X is the image q(A) of a 
set A under a *mapping g, where A is of the 
form {4|C(A)}, then sup X is also written 
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supca) P(A) and is called the supremum of o(4) 
for all A that satisfy C(A). When there is no 
danger of misunderstanding, it may be written 
as sup, 9 (A) or sup 9 (A) and called simply the 
supremum of (A); similar conventions hold 
for inf, max, min, etc. 

An element a of a set X is called a maximal 
element if a « x never holds for any element x 
of X; its dual is a minimal element. If the 
greatest (least) element exists, it is the only 
maximal (minimal) element. But in general, a 
maximal (minimal) element is not necessarily 
unique. 


C. Chain Conditions 


An ordered set X is said to satisfy the minimal 
condition if every nonempty subset of X hasa 
minimal element. The dual condition is called 
the maximal condition. An infinite sequence 
(44,45, ..., an, ... } of elements of an ordered 
set X such that a, «a; « ... «a, « ... is 

called an ascending chain, and the condition 
that X has no ascending chain is called the 
ascending chain condition. The notions dual to 
those of ascending chain and ascending chain 
condition are descending chain and descending 
chain condition, respectively. By the chain 
condition, we mean either the ascending or the 
descending chain condition. Under the taxiom 
of choice, the maximal condition is equivalent 
to the ascending chain condition, and the 
minimal condition to the descending chain 
condition. 

If a totally ordered set X satisfies the mini- 
mal condition or, equivalently, if every non- 
empty subset of X has a least element, then the 
set X is called a well-ordered set, and its order- 
ing is called a well-ordering. 

The following theorem is called the principle 
of transfinite induction: Let P(x) be a propo- 
sition concerning an element x of a well- 
ordered set X such that (i) P(x) is true for the 
least element x, of X, and (ii) P(x) ts true if 
P(y) is true for all y satisfying y « x. Then P(x) 
is true for all x in X. 'Mathematical induction 
is a special case of this principle, where X is 
the set of all natural numbers. To define a 
mapping F from a well-ordered set X into a 
set Y, we may use the following principle: 
Suppose that F(x ) is defined for the least 
element x, of X, and for each element x of X 
there is given a method to associate an ele- 
ment G( f) of Y uniquely with each mapping 
f:S(x) Y with domain S(x), where S(x) is the 
segment of X determined by x. Then there 
exists a unique mapping F: X — Y satisfying 
F(x)=G(F | S(x)) for all x. The definition of 
the mapping F by this principle is called a 
definition by transfinite induction. The prin- 
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ciples of induction are often used for proving 
propositions or giving definitions concerning 
ordinal numbers (— 312 Ordinal Numbers). 


D. Directed Sets 


An ordered set (or in general a preordered 

set (— Section H)) in which every finite subset 
is bounded from above is called a directed 

set. Let B be a subset of a directed set A. If 
(b|bzaj)1Bz Ø for every element a of A, 
then B is said to be cofinal in A; such a subset 
B is itself a directed set. If (b|b za) c B for 
some element a of A, then B is said to be re- 
sidual in A; such a subset B is also cofinal in A. 
The condition that B is cofinal in A is equiva- 
lent to the condition that A — B 1s not residual 
in A. 


E. Order-Preserving Mappings 


A mapping o: AA’ of an ordered set A into 
an ordered set A' is called an order-preserving 
mapping (monotone mapping or order homo- 
morphism) if a < b always implies g(a) < (b). 
Moreover, if ọ is bijective and g~! is also an 
order-preserving mapping from A’ onto A, 
then ọ is called an order isomorphism. A’ is 
said to be order homomorphic (order isomor- 
phic) to A when there exists an order homo- 
morphism (order isomorphism) @ such that A’ 
= Q(A). If a mapping o: A-» A' gives an order 
isomorphism of A to the dual of A’, ọ is called 
a dual isomorphism (or anti-isomorphism). 


F. Direct Sum and Direct Product 


Let S be a set that is the tdisjoint union of a 
family {A,},-, of its subsets, and suppose that 
each A, is an ordered set. For a, be S, define 
axb to mean that a, be A, for some 4e A and 
a<b with respect to the ordering in A,. The 
ordered set S obtained in this way is called the 
direct sum (or cardinal sum) of the family 
{A,},ea Of ordered sets. When (a;);.4 and 
(bJa are elements of the Cartesian product 
P —[I.4A, of a family {A,},-, of ordered 
sets, we define (a;);.4 € (b;),.4 to mean that 

à; € b, holds for all Ac A. The ordered set P 
obtained in this way is called the direct product 
(or cardinal product) of the family TA. of 
ordered sets. 


G. Ordinal Sum and Ordinal Product 


Suppose that W= (A, B,...} is a family of 
mutually disjoint ordered sets and is itself an 
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ordered set. Then an ordering < can be de- 
fined in the disjoint union $ —( ) X (X e%) as 
follows: x € y in S means that either (1) there 
exists an A satisfying x, ye AeA and x< y 
holds with respect to the ordering in A; or (ii) 
for A and B satisfying xe AeA, ye BEAU, we 
have A « B. The ordered set S obtained in this 
way is called the ordinal sum obtained from d 
and is denoted by X, X. In particular, if UA = 
(A, B) and A <B, the ordinal sum is denoted 
by A4 B. 

Suppose that X is a subset of the Cartesian 
product TT, X, of a family of ordered sets 
indexed by an ordered set A, and the subset 
(A|x4 x ya} of A has a least element whenever 
x—(x;),,4, and y —(y;);.4 are two distinct 
elements of X. The ordering in X defined by 
setting x < y when x, < y, for the least ele- 
ment o of (4|x; # y;) is called the lexico- 
graphic ordering in X. It can be applied to X — 
II, X; if A is well-ordered; X is then called 
the ordinal product. When A, B, ... are ordered 
sets, AB... is often used to denote the ordinal 
product obtained from X, = A, X, — B,... 
with the ordering 1 «2 « ... of indices; the 
ordering in this ordinal product is called the 
lexicographic ordering in the Cartesian product 
AxBx.... 


H. Preordering 


A relation R between elements of a set X is 
called a preordering (or pseudoordering) if it 
satisfies the reflexive law and the transitive 
law, but not necessarily the antisymmetric law. 
By defining (x,, y,) R (x2, y;) & x, € x2, for 
example, a preordering of pairs (x, y) of real 
numbers is obtained. From a preordering R an 
equivalence relation ~ can be defined in X by 
xc y e(xRy and yRx). Let [X]=X/~ be 

the *quotient set of set X by this equivalence 
relation, and let [x], [ y] be the equivalence 
classes determined by x, ye X; then an order- 
ing € can be defined in [X] by [x] [y] & 
xRy. (For further topics — 52 Categories and 
Functors; 409 Structures.) 
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A. General Remarks 


Let AX B mean that two tordered sets A, B 
are torder isomorphic; then the relation ~ is 
an *equivalence relation. An equivalence class 
under this relation is called an order type, and 
the class to which an ordered set A belongs is 
called the order type of A. Historically, an 
ordinal number was first defined as the order 
type of a *well-ordered set (Cantor [2]). How- 
ever, it was found that a contradiction occurs 
if order type defined in this way are considered 
to form a set. Hence, another definition was 
given by J. von Neumann [3], which is stated 
in Section B. A similar situation was found 
concerning the definition of tcardinal numbers, 
which led to a new definition of cardinal num- 
bers using ordinal numbers, which is given in 
Section D. 


B. Definitions 


A set « is called an ordinal number if it satisfies 
the following two conditions: (i) « is a well- 
ordered set with the tbinary relation e as its 
ordering; and (ii) Bea implies B c o. Accord- 
ing to this definition, the empty set is an 
ordinal number, which is denoted by 0. Also, 
1= {0}, 2= {0,1}, 3— (0, 1,2), ... are ordinal 
numbers. These ordinal numbers, which are 
finite sets, are called finite ordinal numbers. 
The finite ordinal numbers are identified with 
the natural numbers (including 0). The set w 

— (0, 1,2, ... ) of all natural numbers is also an 
ordinal number. An ordinal number that is an 
infinite set, like œ, is called a transfinite ordinal 
number. 

For every well-ordered set A, there exists 
one and only one ordinal number order iso- 
morphic to A. This ordinal number is called 
the ordinal number of A. (Throughout this 
article, lower-case Greek letters denote ordinal 
numbers.) We also write «e f as a « fj, which 
defines an *ordering of the ordinal numbers. 
The least ordinal number is 0, and the order- 
ing of the finite ordinal numbers coincides 
with the usual ordering of the natural num- 
bers. The least transfinite ordinal number ts c. 
The ordering <, introduced by defining «<P 
to mean either « < f or «=f, is a "linear order- 
ing and, in fact, a twell-ordering of the ordinal 
numbers. Therefore ftransfinite induction can 
be applied to ordinal numbers. 

For any ordinal number a, the set 3 = 
£€|&<a} is also an ordinal number, and is 
the tsuccessor of «. There exists at most one 
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ordinal number that is the tpredecessor of «. A 
transfinite ordinal number without a prede- 
cessor is called a limit ordinal number, and ali 
the other ordinal numbers are called isolated 
ordinal numbers. The first limit ordinal num- 
ber is c. For any set A of ordinal numbers, 
(£|3n (č «ne A)] is an ordinal number and is 
sup A, the tsupremum of A. 


C. Sum, Product, and Power 


The sum «+ f, the product a f (or xf), and 
the power af of ordinal numbers o, f! are de- 
fined by transfinite induction on f and have 
the following properties: 


a+O0=a, a+f’=(a+ fy, 
a+y=sup{a+é|E<y}; 
«:0=0, a f’=a-Bta, 


a-y=sup{a-c|e<y}; 


a9 —1, ah =oF-a a’=supfat|e<yt. 


Here y is a limit ordinal number, and for the 
power we assume that «>0. The sum and 
product thus defined satisfy the associative 
laws (a+ B) - y za (B y), (a: B y= 
a-(B-y) and the left distributive law æ: (B +y) 
së + «y; the power satisfies the laws 
a9 ** of, a, af? = (of, If x and f are the 
ordinal numbers of the well-ordered sets A 
and B, respectively, then «+ f is the ordinal 
number of the tordinal sum A+ B, and a: f is 
the ordinal number of the tordinal product 
BA. 

When z> 1, any ordinal number g can be 
written uniquely in the form 


ao nPiy, CH EE uuo E CH 
Bi B, ... B,» 0, 


which is called the z-adic normal form for o 
when z — o, it is called Cantor's normal form. 
Let f be an ordinal number-valued function 
of ordinal numbers. We say that f is strictly 
monotone when o « f implies f(«) « f(f). If f is 
strictly monotone, then a < f(x). We say that f 
is continuous when f(y) 2 sup( f(£)| £ <y} for 
each limit ordinal number y. A strictly mono- 
tone continuous function is called a normal 
function. If f is a normal function, then for any 
a there exists a £ that satisfies f(fj) — f >a. In 
fact, it suffices to define Bin < c) by Bo — f(a - 
1), B, «, =S (Ba) and put B —sup( £ |n « oj. 
Since f(x) — c* is a normal function, there 
exists an e that satisfies of =e. Such an ordinal 
number e is called an e-number. We say that f 
is cofinal to « when there exists a monotone 
function f that satisfies a — sup f(£) |é < f]. 
The first ordinal number that is cofinal to « is 


0 cy «n, 1<i<n, 
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called the cofinality of x and is denoted by 
cf(a). 


D. Cardinal Numbers 


Let M — N mean that a one-to-one corre- 
spondence exists between the two sets M and 
N. An ordinal number « with the property that 
oC implies a < é is called an initial number 
or a cardinal number. 

With the taxiom of choice, it can be shown 
that for each set M there exists one and only 
one cardinal number a satisfying M ~ «. This 
unique « is called the cardinality (or cardinal 
number) of the set M and is denoted by M. 

All finite ordinal numbers are cardinal num- 
bers, and w is the least transfinite cardinal 
number. There exists one and only one mono- 
tone function that maps the class of ordinal 
numbers onto the class of transfinite cardinal 
numbers, and it is a normal function. The 
value of this function corresponding to « is 
denoted by X, (aleph alpha) or @,. In partic- 
ular, No — o, and X; is both the smallest un- 
countable cardinal number and the smallest 
uncountable ordinal number. A finite ordinal 
number is called an ordinal number of the first 
number class, and an ordinal number a satisfy- 
ing No a « N, is called an ordinal number of 
the second number class. The concept of ordinal 
number of the third (or higher) number class is 
defined similarly. 


E. Inaccessible Ordinal Numbers 


The cofinality cf(x) of « always satisfies cf(a) < 
a. An ordinal number is said to be regular 
when cf(x) — x and singular when cf(a) « x. For 
any ordinal number o, cf(a) is a regular car- 
dinal number; therefore any regular ordinal 
number is a cardinal number. When «= c, is 
regular and f is a limit ordinal number, « is 
said to be weakly inaccessible. Let R be the 
set-valued function of ordinal numbers, de- 
fined by R(0) Ø and R(a) - | {PREIE <a} 
(by ftransfinite induction), where 38(M) de- 
notes the tpower set of M. A regular ordinal 
number « is said to be strongly inaccessible 
when a o and the following condition is 
satisfied: If x, y are a pair of sets such that 

xe R(a), y c R(a), and there exists a mapping 
of x onto y, then ye R(a). If a regular ordinal 
number « is strongly inaccessible, it is weakly 
inaccessible. À strongly inaccessible ordinal 
number is usually defined as a regular number 
&- 0 such that f <a implies 3B(f) <a. Under 
the axiom of choice, this definition is equi- 
valent to the one given here. Moreover, under 
the tgeneralized continuum hypothesis, strong 
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inaccessibility and weak inaccessibility are 
equivalent. 
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Equations 


A. General Remarks 


Let x be a real (complex) variable and y a real 
(complex) function of x. Assume that y — F(x) 
is a differentiable function of class C" if x, y are 
real, and a holomorphic function if x, y are 
complex. We write y', y", ..., y? for the first 

n derivatives of y. A relation among x, y, y’, 


n 
uj 


,))—0 (1) 


(which holds identically with respect to x), is 
called an ordinary differential equation for the 
function y= F(x). Here we assume that the 
function f in the left-hand side of (1) is a real 
(complex) function of the n+ 2 variables x, y, 
y, ..., Y and is defined in a given domain of 
R"*? (C"*?), Usually we assume further that f 
has a certain regularity, such as being of class 
C" (r 20,1, ..., 0), treal analytic, or tcomplex 
analytic. A function y= F(x) that satisfies (1) is 
called a solution of (1). To find a solution of (1) 
is to solve or integrate it. Ordinary differential 
equations may be contrasted to partial dif- 
ferential equations, which are equations similar 
to (1) but in which y is a function of two or 
more variables x,, x5, ... and which contain 
the partial derivatives Oy/Ox,, 0y/Ox>,... (— 320 
Partial Differential Equations). Ordinarily, the 
term differential equation refers to an ordinary 
or partial differential equation. In this article, 


f(X, y, Meis 
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since we are concerned only with ordinary 
differential equations, we omit the word “ordi- 
nary.” If the left-hand side f of (1) contains 

y explicitly or 0f/0y'? 40, then we say that 
the order of (1) is n, and if further f is a poly- 
nomial in y, y^, ..., y? that is of degree m with 
respect to y™, we say that the degree of (1) is 
m. In particular, if f is a linear form in y, y’, 

.., Y", then (1) is said to be linear. A differ- 
ential equation that is not linear is said to be 
nonlinear (— 252 Linear Ordinary Differ- 
ential Equations; 291 Nonlinear Problems). 

Let (x, y, c,, ...,c,) be a function of the 
n -- 2 variables x, y, c,, ...,c, of class C" in 
a domain D, and let (xo, yo, c2, ...,c2)e D, 
lo, Yo» Ch, 900) 0, and 9, (Xo, Y». 6$, éch 
c9) zx 0. Then the equation (x, y, c2, ...,c2) 20. 
defines an fimplicit function y(x) of class C" 
satisfying the condition y(xo) 2 yo. Consider 
Cis ..., €, to be constants in @(x, y, c,, ...,c,) D 
and differentiate @ n times with respect to x. 
Then we obtain a system of n equations in the 
variables x, y, y', ...,y/,c,, ..., c,. If we can 
eliminate c,, ...,c, from these n equations and 
Q —0, then we obtain an nth-order differential 
equation of the form (1). Conversely, a solu- 
tion of an nth-order differential equation can 
usually be written in the form 


P(X, y, Cis ..., Cn) - 0, (2) 


which contains n arbitrary constants c,, ...,c 
(sometimes called integration constants). A 
solution containing n arbitrary constants of 
the form (2) of an nth-order differential equa- 
tion is called a general solution, and a solu- 
tion g(x, y, cQ, ..., c2) D obtained from a 
general solution 9 — 0 by giving particular 
values c9, ...,c9 to the arbitrary coastants is 
called a particular solution. Some equations 
admit solutions that are not particular solu- 
tions. They are called singular solutions (for 
example, 'Clairaut differential equations; — 
Appendix A, Table 14.1). 


n 


B. Systems of Differential Equations 


A set of n differential equations containing n 
unknown functions y,, ..., y, of a variable x is 
called a system of ordinary differential equa- 
tions. Here each equation of the system has a 
form similar to (1), but each left-hand side 
contains y,, ..., y, and their derivatives. A set 
of n functions y,, ..., y, of x is called a solution 
if the functions satisfy the given system of dif- 
ferential equations. The highest order of de- 
rivatives in the left-hand sides is called the 
order of the system of differential equations. 

We consider most frequently a first-order 
system of the form 


Y= SAX Yis Yn)  i2152,...,n. (3) 
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If we put y — y,, y' 2 Y2, ..., y" ? = y, and solve 
(1) with respect to y? to get y® = f, (x, y,, 

-<-> Yn) then (1) is equivalent to a system of 
equations of the form (3), where f, =y2, f;— 
Ja» f, — f. In an analogous way, a general 
system of equations can be transformed to a 
system of the form (3). Therefore (3) is called 
the normal form of differential equations. 


C. The Geometric Interpretation 


When x, y,, ..., y, are real, (3) can be inter- 
preted as follows: Let J — (a, b) be an open 
interval and D a domain of R". Let 


yi OR, €,, .... Cal i=1,2,...,n, (4) 


be functions of class C! defined for (x, c,, 
...,€4)€1 x D, and let D(x9) be the image in 
the y,, ..., y,-space of D under the mapping 
y; 7 Q(X, €1, -.., Cn) (= 1,2, ..., n) for a fixed 
xo€1. We assume that for each fixed Zeg! we 
have lPi, ..., 9,)/0(c,, ..., c,) #0 in D. Then 
for every x 2 xoel, c,, ...,c, are considered 
to be functions of (y,, ..., Yn) defined in a 
neighborhood of every point (y9, ..., y?) of 
D(x), and we have y; 2 el, €,, ...,c,)— 
fx Wu, Yn) (= 1,2, ..., n, ie. Yis ---3Yn 
satisfy a system of differential equations of the 
form (3). On the other hand, (4) represents a 
family of curves of class C! in the x, y,, ..., y,- 
space R"*! containing n parameters c,, ...,c,, 
for which (y1, ..., yn) is the tangent vector (in 
the terminology of physics, ( y;, ... , Yn) gives the 
speed and the direction of a stationary flow in 
R"*! at each point). By solving (3) we find the 
family of curves of class CT in R"*! (in the 
terminology of physics, we find a stationary 
flow of which the speed and the direction are 
given at each point). A solution containing n 
parameters analogous to (4) is called a general 
solution of (3), and a solution obtained from a 
general solution by giving particular values to 
the n parameters is called a particular solution. 
As may be imagined by the interpretation in 
this section, there exists in general one and 
only one particular solution passing through 
the point (xo, yf, ...,y°) for xoeL (y, .... ye 
D(x). The problem of finding this solution, 
i.e., the solution of (3) for which y;(xo) — y? for 
X= xg, is called the initial value problem (— 
316 Ordinary Differential Equations (Initial 
Value Problems)). 


D. Methods of Integration 


We have different methods of solving differen- 
tial equations. To solve differential equations 
by a finite number of integrations is called the 
method of quadrature. This method is useful 
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for some special types of differential equations 
(— Appendix A, Table 14.1). S. Lie gave theo- 
retical foundations for this method by using 
Lie transformation groups (— 431 Transfor- 
mation Groups; Appendix A, Table 14.111). 
There are many other methods, for exam- 

ple, power series methods (assuming that the 
solution can be expanded in a power series 
La,(x—a)’, substituting the series for y in (1), 
and finding its coefficients); methods of suc- 
cessive approximation; methods using La- 
place transforms or ! Fourier transforms; tper- 
turbation methods; numerical methods; etc. 
(— 303 Numerical Solution of Ordinary Dif- 
ferential Equations). 

Historically, finding explicit solutions of 
various kinds of differential equations has 
been the main object of the theory. Recently, 
however, the importance of qualitative studies, 
in particular theorems on the existence and 
uniqueness of solutions, has been recognized. 
For example, if a solution with a property A is 
given, and if the uniqueness of the solution 
having the property A and the existence of 
solutions having the properties A and B can be 
shown, then the given solution necessarily has 
the property B. In this way, topological and 
analytic studies of differential equations are 
applied to find their solutions (— 314 Or- 
dinary Differential Equations (Asymptotic 
Behavior of Solutions); 315 (Boundary Value 
Problems); 316 (Initial Value Problems); 126 
Dynamical Systems). 
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Ordinary Differential 
Equations (Asymptotic 
Behavior of Solutions) 


A. Linear Differential Equations 


A system of linear ordinary differential equa- 
tions can be written as 


x’ = A(t)x, (1) 
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where t is a real independent variable, x = 
(x4, ..., x,) Is an n-dimensional complex vec- 
tor function of t, and A(t) is an n x n matrix 
whose elements are complex-valued functions 
of t. If A(t) is a continuous function of t de- 
fined on an open interval I, any solution of (1) 
is continuously differentiable for te J. The 
question naturally arises as to how the solu- 
tions behave as t approaches either one of the 
endpoints of I; that is, the question of the 
asymptotic properties of the solutions. The 
interval J can always be taken to beO xt « oo, 
by applying a suitable transformation of the 
independent variable if necessary. 

The study of the fasymptotic expansions of 
solutions when the coefficient A(t) is an ana- 
lytic function of t was initiated by H. Poincaré 
in 1880. This work was continued by J. Horn, 
J. C. C. A. Kneser, and others in the direction 
of removing assumptions on the structure of 
A(t) and extending the domain where the ex- 
pansions are valid. The theory has been almost 
completed by W. J. Trjitzinsky, J. Malmquist, 
and M. Hukuhara (— 254 Linear Ordinary 
Differential Equations (Local Theory)). On the 
other hand, O. Perron initiated a new direc- 
tion of research by weakening the regularity 
conditions on the coefficients. His work was 
continued by F. Lettenmeyer, R. A. Spath, 
Hukuhara, and others. The methods used in 
these two lines of investigation were originally 
distinct, but Hukuhara established a unified 
method of treating the problems arising in 
these two different types of investigations. 
Furthermore, he succeeded in sharpening 
those results previously obtained. 

Here we assume that A(t) need not be ana- 
lytic. The following asymptotic properties 
of a solution x(t) as t— oo are considered: 

(i) boundedness of limsupt~!log|x(t)|; (ii) 
boundedness of solution: limsup|x(t)] < oo; 
(iii) convergence of solution: lim x(t); (iv) inte- 
grability: (?|x(s)|" ds < oo, etc. We call x(x) — 
limsupt ' log|x(t)| the type number (or Lya- 
punov characteristic number) of the solu- 

tion x(t). The fact that all solutions of (1) are 
bounded is equivalent to the tstability of the 
solution x — 0, and the fact that all solutions of 
(1) tend to zero as t oo is equivalent to the 
*asymptotic stability of the solution x =0. 


B. Constant Coefficients and Periodic 
Coefficients 


We begin with the particular case of (1), where 
A(t) is a constant matrix: 


x' = Ax. (2) 


To study the asymptotic properties of the 
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solutions of (2), it suffices to transform the 
matrix A into a tJordan canonical form, since 
the structure of the solution space of (2) is 
completely determined by the Jordan canon- 
ical form of A. Thus all solutions of (2) are 
bounded if and only if every eigenvalue of A 
has a real part not greater than zero, and those 
with zero real parts are of simple type, that 
is, the corresponding blocks in the Jordan 
canonical form are all 1 x 1 matrices; all solu- 
tions of (2) tend to zero as t— oo 1f and only 
if every eigenvalue of A has negative real part. 
Consider the linear system 


x = Ao(t)x, (3) 


where Ao(t) is a periodic matrix function of 
period w. According to tFloquet’s theorem, (3) 
is transformed into a system with constant 
coefficients by means of a suitable transfor- 
mation x= P(t)y, where P(t) is a nonsingular 
periodic matrix of period œw. Thus, at least 
theoretically, the information on the asymp- 
totic behavior of the solutions of the periodic 
system (3) can be derived from the correspond- 
ing theory for the system with constant coeffi- 
cients (2). 


C. Asymptotic Integration 


Suppose that A(t) is bounded. Then the type 
number y(x) is finite for any nontrivial solu- 
tion x(t) of (1), and the number of distinct 
type numbers does not exceed n. 

Consider the linear system 


x’=[A + B(t)]x, (4) 


where A is a constant matrix and A(t) is a 
matrix function such that ME || B(s)|| ds—0 as 
t— oo. For any nontrivial solution x(t) of (4), 
the limit u— limt '!log|x(t)| exists and is equal 
to the real part of one of the eigenvalues of A. 
Conversely, if at least one eigenvalue of A has 
real part u, then there exists a nontrivial solu- 
tion x(t) of (4) satisfying limt !log|x(t)| = u. 
Suppose in addition that B(t) 0 as t> oc. Let 
Hı SH € ... <p, be the real parts of the eigen- 
values of A. Then there exists a 'fundamental 
system of solutions of (4), {x; (t), ..., x,(t)j. 
such that for any c;, c, #0, 


loglc, x, (t) eae +c, X;,(t)| =p,t+o(t). 


A sharp estimate of the term o(t) was given by 
Hukuhara. 
Next consider the linear system 


x’=[A(t)+ B()]x. (5) 


where the matrices A(t) and B(t) satisfy 
f° || A(s)|l ds < oo and [^||B(s)l| ds < oo. 
Let A,(t), ..., A, (t) and 24, ..., Ags Aus lim A, (0), 
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be the eigenvalues of A(t) and A —lim A(t), 
respectively. N. Levinson proved the follow- 
ing theorem: Assume that 4,,..., 4, are mutu- 
ally distinct and M; (t) — Re fo [Aj(s) — A,(s)] ds 
satisfy either Mj (t)— oo as t— oo and for each 
pair (j,k), M&(t;) — Mi (t) 2 — K for t, <t,, or 
Mj(t)—> —oo as t2 oo and Mj(t;) My(t,)<K 
for t, «t;, or |Mi&(t;) — M&(t,)| x K for all t,, 
t5, where K is a positive constant. Then (5) has 
a fundamental system of solutions (x, (t), ..., 
x,(t)) such that 


t 

xj(t) - exp (| unn! [+o], j=1,...,n, 
0 

where č; is an eigenvector of A corresponding 

to 4;. 


D. Boundedness and Convergence of Solutions 


Consider again the linear system (5) satisfying 
J” | A(s)l| ds < oo and (*||B(s)|| ds < oo. Suppose 
that all eigenvalues of A(t) have nonpositive 
real parts and that the eigenvalues of A= 

lim A(t) whose real parts vanish are simple. 
Then all solutions of (5) are bounded. This 
result is a generalization, due to L. Cesari, of 
the so-called Dini-Hukuhara theorem. 

In the case of general A(t), it is known that 
not all solutions of (5) are bounded even if all 
solutions of (1) tend to zero as t— oo and if the 
matrix B(t) is such that (^||B(s)|| ds < oo and 
B(t) 50 as t ^ oo. However, if A(t) is periodic 
or satisfies lim inf Re [' tr A(s)ds > —oo, then 
under the assumption that f” | B(s)| ds < oo, 
the boundedness of all solutions of (1) implies 
the boundedness of all solutions of (5). 

The following inequalities often provide 
useful information about the asymptotic 
behavior of solutions of (1): 


Ix(0)| exp ( d Hl A(5)] as) «Ix(t)] 
0 


t 
<|x(0)| Si | u[A(s)] is) t>0, 
0 

where ol Ale Dm, [IZ -- hA(t)]] — 1]/h. 
(u[A(t)] was introduced by Lozinskii.) If 
lim sup f' u[A(s)] ds < 00, then all solutions of 
(1) are bounded; if lim vlt A(s)] ds exists, then 
for every solution x(t) of (1), |x(t)| tends to a 
finite limit as t oo; and if lim f u[A4(s)] ds = 
—oo, then all solutions of (1) tend to zero as 
t— oo. It can be shown that every solution of 
(1) tends to a finite limit as t— oo, provided 
that (^| A(s)|| ds < oo. 

If all solutions of (1) are bounded, then 
lim sup Re f‘ tr A(s) ds < oo. If liminfRe ('tr A(s) 
ds —oo, then (1) has a solution x(t) with the 
property that lim sup|x(t)| — 0. When lim|x(t)| 
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exists for every solution x(t) of (1), if there 
exists a nontrivial solution x(t) of (1) such 
that lim x(t)=0, then lim Re f tr A(s)ds= 
— oo, but if there is no such solution, then 
Re f' tr A(s)ds is bounded. 


E. Nonlinear Differential Equations 


Consider a system of nonlinear differential 
equations of the form 


x' — Ax + f(t, x), (6) 


where A is an n x n constant matrix and f(t, x) 
is an n-vector function that is continuous for 
tz 0, x| «A, and that satisfies f(t, 0) — 0. Sup- 
pose that f(t, x)/|x| 0 as |x| 50 and t> œ. 
Then for every eventually nontrivial solution 
x(t) of (6) that tends to zero as t2 œ, p= 
lim t ^! log|x(t)| exists and equals the real part 
of one of the eigenvalues of A. Conversely, 
if at least one eigenvalue of A has real part 
<0, then there exists a solution x(t) of (6) 
such that lim t~/ log|x(t)| =. Suppose that 
f(t, x)/x| 20 as |x|>0 uniformly with respect 
to t. Then if all eigenvalues of A have negative 
real parts, the zero solution x(t)=0 of (6) is 
asymptotically stable, and if A has an eigen- 
value whose real part is positive, then the zero 
solution of (6) is tunstable. Suppose that f(t, x) 
= (f(t, x)/0x,)-0 as |x| 20 uniformly with 
respect to t. In this case, if A is a matrix such 
that its k eigenvalues have negative real parts 
and the other n — k eigenvalues have positive 
real parts, then there exists a k-dimensional 
manifold S containing the origin with the 
following property: For t, sufficiently large, 
any solution x(t) of (6) tends to zero as t— oo, 
provided that x(t5)e S, and if x(t9)£ S, x(t) 
cannot remain in the vicinity of the origin no 
matter how close x(to) is to the origin. 

In the nonlinear system 


x’ — F(t, x), © (n 


suppose that F(t, x) is of period w with respect 
to t and has continuous partial derivatives 
with respect to x. Suppose, moreover, that (7) 
has a solution p(t) of period c. If all the tchar- 
acteristic exponents of the fvariational system 
of (7) with respect to p(t), y' = F,(t, p(t))y with 
F,(t, x) =(6F,(t, x)/0x;), have negative real parts, 
then the periodic solution p(t) is asymptoti- 
cally stable. If an autonomous system x' — F(x) 
has a periodic solution p(t) and the corre- 
sponding variational system y' — F,(p(t))y has 
n — 1 characteristic exponents with negative 
eigenvalues, then there exists an £7 0 such that 
for any solution x(t) satisfying |x(t,) — p(t9)| « € 
for some ty and t,, we have |x(t) — p(t - c)| 20 
as t oo for a suitable choice of c (asymptotic 
phase). 
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F. Scalar Differential Equations 


The aforementioned results can be specialized 
to the case of higher-order scalar (or single) 
ordinary differential equations. Much sharper 
results can often be derived through direct 
analysis of scalar equations themselves. In 
particular, detailed and deep results have been 
obtained for second-order linear differential 
equations of the form 


x" + q(t)x =0, (8) 


e.g., ' Mathieu's equation. 

If (* s|q(s)| ds < oo, then (8) has a funda- 
mental system of solutions (x, (t), x2(t)} satisfy- 
ing, for f oo, 
uff) ail — xy) (T1 +0(1)], 
x4(t)=1+0(1); 


if (^|q(s) - 1|ds< oo, then (8) has a funda- 
mental system of solutions satisfying 


x,(t) -e'[1--o(1)], x,(t)=e'[1+o(1)], 
xi(O=e'[1+o0)],  x5(t)5 —e ‘[1+o(1)]; 


and if f*la(s) — 1|ds « oo, then (8) has a funda- 
mental system of solutions 


x,()=e"[1 + 0(1)], 
x (t)=ie"[1 +0(1)], 


xit) "ol 


x,(t)=e "[1--0o(1)], 
x; (t) —ie TT) +0(1)]. 


Suppose that q(t)+c>0 as t2 oo and (^ |q'(s)] 
ds « oo. Then x(t) and x'(t) are bounded for 
every solution x(t) of (8). The same is true if 
q(t) is a positive periodic function of period 

c such that œ fọ q(s) ds <4. If q(t) is nega- 

tive, then (8) always has both bounded and 
unbounded monotone solutions. 

The number of linearly independent solu- 
tions x(t) of (8) satisfying (?|x(s)|? ds < oo 
plays an important role in feigenvalue prob- 
lems. It is known that the ordinary differen- 
tial operator /[x] 2 x" + q(t)x is of "limit point 
type at infinity if there exist a positive func- 
tion M(t) and positive constants k,, k such 
that q(t)<k, M(t), |M(t)M ??(t)| « kj, and 
[^ M "?(s)ds— oo, and that I[x] is of "limit 
circle type at infinity if q(t)>0, [^q ""(s)ds— 
oo, and |*|Eq~*?(s)q(s)] -(1/4)q ?"(s)a"*(s) 
ds « oo. 

Finally consider the nonlinear equation 


x" + q(t)|x|'sgnx —0, (9) 


where y is a positive constant and q(t) is a 
positive function. If q'(t) 2 0, then all solutions 
of (9) are bounded; if either g(t) 2 0 and lim q(t) 
« oo or q'(t) «0 and lim q(t) » 0, then all solu- 
tions x(t) of (9) are bounded together with 
their derivatives x(t); and if q'(t) 2 0, lim q(t) — 
oo and either g”(t)>0 or q'" (t) € 0, then all 
solutions of (9) converge to zero as t — oc. 
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Equation (9) is said to be oscillatory if every 
solution of (9) that is continuable to t= oo has 
arbitrarily large zeros. If (9) is oscillatory and if 
q,(t)>q(t), then the equation x" 4- g,(t)|x|’sgnx 
—0 is also oscillatory. When y= 1, (9) is oscil- 
latory if q(t) > (1 + £)/4t? for some e>0, and is 
not oscillatory if q(t) « 1/4t?. A necessary and 
sufficient condition for equation (9) with y #1 
to be oscillatory is as follows: fr sq(s)ds = oo 
if y>1; (^s?*q(s)ds- oo if0 «7 « 1. 
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315 (Xill.4) 

Ordinary Differential 
Equations (Boundary 
Value Problems) 


A. General Remarks 


Consider the differential equation in the real 
variable x 


, y) 0. (1) 


. , a, be points in an interval 7 c R and 


Ten, 


Let a,, .. 
consider several relations between nk values 
yla), y (aj, ..., y" P (aj), i 1, ..., k. The prob- 
lem of finding solutions of (1) satisfying these 
relations is called a boundary value problem of 
(1), and the relations considered are called 
boundary conditions. When k —2 and a,, a, are 
the endpoints of J, the problem, called a two- 
point boundary value problem, has been a main 
subject of study. We can consider boundary 
value problems in the same way for systems of 
differential equations. 
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B. Linear Differential Equations 


Consider a linear ordinary differential opera- 
tor L defined by 


L[y] - poG9)y? + p,6) y" +... + p. (X) y, 


. Where p,(x) is a complex-valued function of 
Class C"^* defined on a compact interval 
a<x<b and po(x) X0 for any xe[a, b]. We 
define a system of linear boundary operators 
U,,...,U,, by 


ULy]= J Mjy9 (a) 4- > ëtt? 
j= j=1 


Given a function f(x) and complex constants 
Yis ++- Ym the linear boundary value problem 
defined by 


Lly]J=f(),  U:Ly]=Y; 


is a two-point boundary value problem. When 
f(x)=0, y, 20, i=1,...,n, the problem is called 
homogeneous; otherwise it is called inhomoge- 
neous. Let L*[ y] be a formally tadjoint dif- 
ferential operator of L[ y]. A set of m* linear 
boundary conditions U*[y]=0, i- 1, ..., m*, 
is said to be an adjoint boundary condition of 
Uf y] 20, i— 1, ..., m, if for any function y of 
class C" satisfying U;[ y] 20, i— 1, ..., m, and 
any function y* of class C" satisfying U*[y*] 
=0,i=1,...,m*, we have [PL[y]y* dx= 

fe yL*[y*]dx. The boundary value problem 


L*[y]-0, Uf[y]-0, 


i=1,...,n, (2) 


i=1,...,m*, (3) 


is said to be an adjoint boundary value problem 
of 


L[y]=0, 


We say that the problem (4) is self-adjoint if 
L[y]2 L*[y] and the conditions U;[ y] —0, 
i— 1, ...,m, are equivalent to the conditions 
U*[y]20,i-1, ...,m*. 

The boundary value problem containing a 
parameter A 


LLy]=Ay, 


U[yl-0, i=1,...,m. (4) 


U[y]-20, i=1,...,n, (5) 


admits nontrivial solutions only for special 
values of 4. Such values of 4 are called the 
eigenvalues (or proper values) of (5), and the 
corresponding solutions 3&0 are called the 
eigenfunctions (or proper functions) of (5). For 
any value of 4 that is not an eigenvalue, there 
exists a unique function G(x, £, 2) such that the 
conditions L[ y] 2 Ay + f, U;Ly] - 0 are equiva- 
lent to y= f? G(x, č, A) f(é) d£. The function 

G(x, £, A) is called the Green's function of (5). If 
A=0 is not an eigenvalue, then (5) is equivalent 
to 


b 
y(x)- 4 | G(x, £)y(5) 45. 
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where Gio £a GO COL Bor! 


"|. function G(x, £, A) of (5) and th 


tion G*(x, €, A) of L*[ y] 2 Ay, 
have the relation G(x, č, Alz G 
the assumption that (5) is self- 
the following four proposition 
has only real eigenvalues whic 
or countably infinite discrete s 
functions corresponding to tw 
values are orthogonal to each 
is an forthonormal set of eiger 
that no eigenfunction is linear: 
of {Pn} then the system (9,] i 
orthonormal set in the Hilbert 
consisting of functions that arı 
grable on (a, b), and hence for 
expansion f —a,09,--a595 4 .. 
L(a, b) the tParseval equality 
if f is a function of class C" sat 
0, then the Fourier expansion 
uniformly to f on [a,b]. 

The boundary value problet 
order equation 


(p(x) y) t (a(x) + Ar(x)) y —0, 
ay(a)+ By(a)-0, — yy(b)- oy 


is called a Sturm-Liouville prol 
r are real-valued functions defi 
and o, fi, y, are real constants. ` 
q, r are continuous and p(x)»! 
[a, b]. Then (i) the eigenvalues 
ing sequence tending to +00; ( 
tion Q,(x) associated with 4, hi 
zeros in a « x « b, and there ex: 
adjacent zeros of @,(x) a zero « 
(iii) the set of eigenfunctions is 
set on [a,b] with weight functi 


b 
| r(x)e,(x)p,(x)dx—0, m: 

When the coefficients pe, .... 
defined in an open interval —o 
b< oo and p, is of class CHL 
natural way operators in the E 
consisting of functions that are 
grable in a « x « b, and the gen 
based on operator theory in H 
(— 390 Spectral Analysis of OH 


C. Nonlinear Differential Equ: 


Boundary value problems for 1 
ferential equations are very dif 
sults are obtained only for equ 
form. 

Consider, for example, the s 
equation 


y'—f(GQ y, y) 


315 Ref. 
ODEs (Boundary Value Problems) 


and boundary conditions y(a) 2 A, y(b) 
B. The following theorem has been proved: 
Suppose that f(x, y, y) is continuous fora x 
<b, o(x) & y & (x), —oo < y < +00, and 
(rte, y. VIS M (1 +y?) e" (x) > f(x, ea(x), etc 
and o" < f(x, @(x), o'(x)) fora xx b; and 
o(a) « A x o(a) and w(b)< B € o(b). Then (6) 
admits a solution y(x) such that y(a) — A, y(b) 
= B, and o(x) € y(x) & o(x) for a < x <b. If 
in addition f is an increasing function with 
respect to y, the solution is unique. More- 
over, under suitable conditions, the solution 
is obtainable by the method of successive 
approximations. 

The boundary value problem 


y" + 2yy" -2A(K? — y?) « 0, 
yO) = y'(0) 2 0, 


where 4 and k are constants, appears in the 
theory of fluid dynamics. It is known that if 
A> 0, the problem has a solution, and that if 
OxAx1, the solution is unique. 

Consider the system of differential equations 


y'(x)>k (x 00), 


uc ld, yis Ynh jz2l...n. 


The problem of finding a solution such that 
yj(aj) — bj, j — 1, ..., n, called Hukuhara's prob- 
lem, reduces to the initial value problem when 
the a; coincide. The problem of solving 


ge), 


yla) = b;, f= hen 


y" = f(x, y, y^,... 


is reduced to Hukuhara's problem by a suit- 
able change of variables. The following result 
is a generalization of tPerron’s theorem: Let 
w(x), a(x), j — 1, ..., n, be continuous and right 
and left differentiable functions and c(x) € 
(x) for a<x <b. Suppose that the f(x, y,, 

<-s V.) are continuous for x < x € f and c (x) « 
yy SOx), k — 1, ...,n; satisfy (x —a)(D*a(x) — 
Sx, Vises y) 20 for y= @,(x) and w(x) < 
yy <@,(X), k Aj; satisfy (x —a) (D*+ a(x) — fix, 
yis ,y,) <0 for y; oj(x) and ex(x) € y, 
@,(x), k #j; and satisfy c(aj) < b; € (aj. Then 
there exists a solution y(x) such that y;(aj) — b; 
and o(x) € y(x) € o(x). This theory was ap- 
plied by M. Hukuhara to the study of singular 
points of ordinary differential equations. 
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316 (XIII.3) 

Ordinary Differential 
Equations (Initial 
Value Problems) 


A. General Remarks 


Consider a system of ordinary differential 
equations 


dyi/dx Lis, Mu, Dal  i=1,...,n. (1) 


A. L. Cauchy first gave a rigorous proof for 
the existence and uniqueness of solutions: 

If fj, i— 1, ..., n, and their derivatives 0f;/Oy, 
are continuous in a neighborhood of a point 
(a, b,, ..., b,), then there exists a unique solu- 
tion of (1) satisfying the conditions y,(a)= b;, 
i=1,...,n. These conditions are called initial 
conditions, and the values a, b,, ..., b, initial 
values. The problem of finding solutions that 
satisfy initial conditions is called an initial 
value problem (or Cauchy problem). If we con- 
sider (x, y,, ..., Yn) as the coordinates of a point 
in the (n + 1)-dimensional space R"*!, then a 
solution of (1) represents a curve in this space 
called a solution curve (or integral curve). The 
statement that a solution satisfies initial con- 
ditions y;(a) 2 b;, i— 1, ..., n, means that the 
integral curve represented by it passes through 
the point (a, b, ..., b,). 

Since, in general, we can transform a dif- 
ferential equation of higher order into a sys- 
tem of differential equations of the form (1) by 
introducing new dependent variables, all defi- 
nitions and theorems concerning the system (1) 
can be interpreted as applying to a higher- 
order equation. For example, for the equation 
y? = f(x, y, y', ..., y" ?) the conditions y(a) — b, 
y'(a) & b', ..., y" P (a) DI" UU constitute initial 
conditions, and the values a, bb... b are 
initial values. If f and its derivatives 0f/0 y? are 
continuous, then there exists a unique solution 
satisfying given initial conditions. 

Suppose that the f; are continucus. Then a 
system of functions (y,(x), ..., y,(x)) is a solu- 
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tion of (1) if and only if 


yix) =r f Si y169) -+> Yul) dx, 


i=1,...,n. 


When the f; are not continuous, we define 
(GDL... y,(x)) to be a solution of (1) for 
the initial value problem y,(a)= b; if (y,(x), 
<<.» Ya(x)) satisfied the integral equation just 
given. 

We use the vectorial notation: y —(y,, ... 
ys) f=(f,,-.-»f,) together with [yll^ — yi + 
... + 2. The equations (1) are then written 
as the single equation 


y —f(xy). 


B. Equations in the Real and Complex 
Domains 


We state main theorems for differential equa- 
tions in the real domain in Sections C-F and 
in the complex domain in Section G. 


C. Existence Theorems 


Suppose that f(x, y) is continuous for |x—a|<r 
and |y — b|| <p, and that |\f(x, y)|| < M there. 
Then equation (1) admits a solution satisfying 
y(a)- b and defined in an interval |x —a| € 
min(r, p/M) (existence theorem). There are 

two methods of proving this theorem, one 
using Cauchy polygons and one using fixed- 
point theorems for function spaces. From this 
theorem we deduce that if f(x, y) is continuous 
in a domain D of R"*!, then there exists a 
solution curve passing through any point of D. 
Let y=9,(x) and y ^ g;(x) be solutions of (1) 
defined in the intervals I, and L, respectively. 
If I, <I, and g,(x) 2 e; (x) for xe1,, we say 
that p, is a prolongation or extension of g,. 
Given a solution of (1), there exists a nonex- 
tendable solution that is an extension of the 
solution. The solution curve of a nonexten- 
dable solution tends to the boundary of D as x 
tends to any one of the ends of its interval of 
definition. 


D. Uniqueness Theorems 


Continuity does not imply uniqueness of the 
solution. If (1) admits at most one solution 
satisfying a given condition, we call this con- 
dition a uniqueness condition. Various kinds of 
uniqueness theorems, which state uniqueness 
conditions, are known. 

The Lipschitz condition: 


I f(x, y) f(x z)|| « Llly—zl, L0 constant, 
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is one of the simplest. When f is continu- 
ous and satisfies the Lipschitz condition, the 
method of successive approximation, initiated 
by C. E. Picard, is often used to prove the 
existence of solutions. This method is as fol- 
lows: We choose a suitable function, for ex- 
ample yo(x) z b, and then define y,(x), k — 1, 
2, ..., recursively by y,(x)=b + fä (x, y,_1(x))dx. 
Then {y,(x)} is uniformly convergent, and its 
limit is a solution of (1) satisfying y(a) — b. 
Assuming the continuity of f, H. Okamura 
gave a necessary and sufficient condition for 
uniqueness: Suppose that f is continuous in D. 
Then a necessary and sufficient condition for 
there to exist a unique solution curve of (1) 
going from any point of D to the right is that 
there exist a C'-function @(x, y, z) defined 
for (x, y, z) such that (x, y) and (x, z)e D and 
satisfying the conditions oi, y, z) - 0 for y =z, 
Q(x, y, 2) » 0 for y Zz, and 


09 09 do 
Ce e y= filx, 2) <0. 
2: ty fils y)+ DR det 


E. Perron's Theorem 


Consider the scalar equation y' — f(x, y). We 
have Perron’s theorem: Let w(x) and @(x) be 
continuous functions that are right differenti- 
able in «<x « f and satisfy w(x) <@(x), and let 
f be a continuous function defined on D:a< 
x« B, o(x)< y<@(x). Suppose that D* w(x)< 
f(x, a(x) and D* @(x)> f(x, @(x)). (D * c de- 
notes the tright derivative of c.) Then for any 
(a, b)e D there exists a solution defined on 

ax x « f and satisfying y(a) — b. The fact that 
the interval of definition is a € x < f can be 
expressed by saying that if we denote the set 
oo «x « B, |y| « oo by Q, then D is closed in Q 
and there exists, among solution curves going 
from a point in D to the right, a curve that 
reaches the boundary of 2. 

Perron's theorem was generalized by M. 
Hukuhara and M. Nagumo. Let Q be an open 
set in R"*!, D a closed set in Q, and f a con- 
tinuous function in D. A necessary and suffi- 
cient condition for (1) to admit a solution 
curve going from any point (a, b) in D to the 
right is that there exist a sequence of points in 
D, {(a,,b,)}, such that apļa and (b, — b)/(a, — a) 
—f(a, b). Moreover, every solution curve is 
prolonged to the right to the boundary of Q. 
Let S(y) be a continuous tsubadditive and 
positively homogeneous function and a(x) a 
function continuous and right differentiable 
on a € x « fl. A sufficient condition for D'os 
x« B, S(y) & w(x) to possess the property in the 
statement of Perron's theorem is given by 
D*o(x)Z S(f(x, y)) for |yl =o(x). 

A continuous function w(x) is said to be a 
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right majorizing function of (1) with respect to 
S(y) if for any solution g(x), S(g(a)) < o(a) 
implies S(g(x)) < w(x) for x za if both S(g(x)) 
and (x) are defined. In order for w(x) to be 
a right majorizing function, it suffices that 
D* w(x) > S(f(x, y)) for |y|| 2 ox). A function 
satisfying this inequality is called a right su- 
perior function of (1) with respect to S(y). If 
F(x, S(y)) > S(f(x, y)), then any solution of 

y = F(x, y) is a right superior function of (1). 
Theorems stating such facts are called com- 
parison theorems. 

If (1) has a unique solution, the condition 
D* w(x) 2 S(f(x, y)) for S(y)=c(x) implies that 
w(x) is a right majorizing function of (1). Con- 
versely, we can derive from comparison theo- 
rems general uniqueness theorems, one of 
which we state. Suppose that G(x, y) is continu- 
ous fora «x H and 0< y « r(x); G(x,0)20; 

a solution of y'= G(x, y) such that y 2 o(r(x)) as 
xa +0 vanishes identically; and finally that 
S(f(x, y ) — f(x. y;)) < G(x, S(y, —y2)). Then for 
two solutions g,, 9; of (1) such that Ste, — 9;) 
— o(r(x)) as x «-- 0, we have p; 2 9,. Assum- 
ing that f is continuous at (a, b) and taking 
y/(x — a) as G, we obtain Nagumo's condition 
(x — a)S(f(x, y.) — f(x, y2)) < S1 — Y2). 

G. Peano proved the following theorem: 
With the same notation and assumption as in 
Perron's theorem, there exist a maximum 
solution @ and a minimum solution ¢ of y' = 
f(x, y) such that y(x) 2 b for w(a)<b< (a), 
and such that there exists a solution curve 
passing through any point in «<x « f, o(x)« 
y&qQ(x). This theorem was extended by Huku- 
hara as follows. Suppose that f(x, y) is con- 
tinuous and bounded in D:a x x « fi, [|y || « oo. 
Let C be a tcontinuum in D, and let %(C) de- 
note the set of solutions intersecting C. Then 
#&(C) is a continuum of the tfunction space 
C([a, P]). From this theorem we can deduce 
the Kneser-Nagumo theorem, which says that 
the intersection of the set of points belonging 
to the members of (C) and a hyperplane x = € 
is a continuum. It was proved by Hukuhara 
that if C is in the hyperplane x =«, then (1) 
admits a solution connecting the two hyper- 
planes x =a and x= f and passing through the 
boundary of the set of points belonging to the 
members of (C). 


F. Equations Containing Parameters 


We assume uniqueness of the solution of 
y =f(x,y, å) Asti, Ag) (2) 


where f is a continuous function of (x, y, 4). Let 
€ (x, a, b, 4) denote the solution of (2) satisfying 
y(a) 2 b. Then g(x, a, b, 4) is continuous with 
respect to (x, a, b, 4) in its region of definition. 
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If the derivatives OU Om, are also continuous, 
then g(x, a, b, 4) is a continuously differentiable 
function of (x, b); zy, =0@;/0b,, j— 1, ...,n, 
satisfy the system of linear ordinarv differential 
equations and the initial condition 


d ” (of; 
PS = 2 LG Zins Z(G) = dn: 


and z;=0@,/0a, j — 1, ..., n, satisfy the same 
system with the initial condition z;(a) — 

— Lia, b, A), where (0f;/€ yj) means (0/;/0y;) 

(x, g(x, a, b, A), A). If f further admits continu- 
ous derivatives 0f/0/,, then g(x, a, b, A) is con- 
tinuously differentiable with respect to A, and 
moreover, wj — 6@;/64,, j — 1, ..., n, satisfy 

the system 


d n dvo of; l 
EC LG "e w;la)=0. 


These differential systems are called the varia- 
tional equations of (1). 

C. Carathéodory proved the existence of 
solutions of (1) under the less restrictive as- 
sumption that f is continuous with respect to y 
for any fixed x and measurable with respect to 
x for any fixed y. 

Suppose that f is continuous and satisfies a 
Lipschitz condition. Let z(x) be a function 
such that z(a)=b and ||z(x)— f(x, z(x))|| < (x), 
and let y(x) be a solution of (1) such that y(a) = 
b. Then we obtain 


lz) y(x) ser 


> 





| e(xje "* ^ dx 


a 





which gives approximate solutions of (1). 


G. Equations in the Complex Domain. 


We assume that the variables x, y,. ..., y, all 
have complex values. We have the following 
theorem: If f is holomorphic at (a, b), then (1) 
has a unique solution that is holomorphic 

at x —a and takes the value b at x -: a. This 
theorem can be proved by utilizing the method 
of successive approximations and fixed-point 
theorems. Cauchy proved the theorem by 
using majorant series. This method, called the 
method of majorants, proceeds for the scalar 
equations as follows: Let f(x, y) - È a(x — 
ay(y — b and y= Xc,(x — ay". Substituting 
the latter series into both members, we can 
successively determine the coefficients c, by the 
method of undetermined coefficients. Assum- 
ing that | f| « M for |x —a| <r and |y—b| <p, 
consider the solution Y = X C,(x — ay of 


dY | M 
dx (1—(x—a)/r) (Y — byp) 





satisfying Y(a) — b. We have C, |c,| for any n, 
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which shows that 3 C,(x —a)' is a majorant 
series of 3 c,(x — a). 

We have the following uniqueness theorem: 
Suppose that f(x, y) is holomorphic at (a, b). 
Let C be a curve having the point a as one of 
its ends and g(x) be a solution with the follow- 
ing properties: 9 is holomorphic on C except 
possibly at x =a, and there exists a sequence of 
points on C, Io, such that a,—a and g(a,)> 
b. Then g(x) is holomorphic at a. By a ttheo- 
rem of identity, the analytic continuation 
of a solution continues to be a solution if it 
does not encounter any singularity of f. Ifa 
solution g(x) is holomorphic on a smooth 
curve x= y(t), with O&t tg and y(0)— a, and 
9(a) — b, then y 2 e(x(t)) is a solution for 0< 
t &tg of 


y =x (x)f(x(0, y) (3) 


satisfying y(0) — b. Conversely, if (3) has a solu- 
tion y= y(t) defined in Ox t € ty and satisfy- 
ing y(0) 2 b, and if f(x, y) is holomorphic at 
(x(t), w(t)) for O € t € ty, then (1) has a solution 
g(x) holomorphic on C and w(t)= e(x(t)) for 
Oxt&t,. 

Suppose that f= f, /f,, where f, and f, are 
holomorphic at (a, b). If f, (a, b) £0, f,(a, y) 0, 
and f,(a, b) «0, then the equation y' = f(x, y) 
admits a unique solution such that yb as 
xa, and this solution can be expanded into 
a tPuiseux series: 


y= Y, c,(x—a)"”. 
n=0 


If f is holomorphic at (a, b), then the solu- 
tion y 2 9(x, xo, Yo) of (1) satisfying y(x9) — yo 
is holomorphic with respect to (x, xo, Yo) at 
(a, a, b). If f(x, y, 4) is holomorphic at (a, b, A»), 
then the solution of (2), y=¢(x, xo, Yo, 4), satis- 
fying y(xo) 2 yo is holomorphic at (a, a, b, Ao). 
Suppose that x is a real variable and y is a 
complex vector. If f is continuous with respect 
to (x, y) and holomorphic with respect to y, 
then the solution g(x, xo, yo) is holomorphic 
with respect to yo. If f(x, y, 4) is continuous 
with respect to (x, y, 4) and holomorphic with 
respect to (y, A), then g(x, x, yo, 4) is holomor- 
phic with respect to (yo, A). 


References 


[1] E. Kamke, Differentialgleichungen reeller 
Funktionen, Akademische Verlag, 1930. 

[2] E. A. Coddington and N. Levinson, 
Theory of ordinary differential equations, 
McGraw-Hill, 1955. 

[3] P. Hartman, Ordinary differential equa- 
tions, Wiley, 1964. 

[4] H. Okamura, Condition nécessaire et 
suffisante remplie par les équations différen- 


317A 
Orthogonal Functions 


tielles ordinaires sans points de Peano, Mem. 
Coll. Sci. Univ. Kyoto, (A) 24 (1941). 

[5] O. Perron, Ein neuer Existenz Beweis für 
die Integrale der Differentialgleichung y'= 
f(x,y), Math. Ann., 15 (1945). 

[6] M. Hukuhara, Sur la théorie des équations 
différentielles ordinaires, J. Fac. Sci. Univ. 
Tokyo, sec. I, vol. VII, pt. 5, 1958. 

[7] M. Nagumo, Über die Lage der Integral- 
kurven gewóhnlicher Differentialgleichungen, 
Proc. Phys.-Math. Soc. Japan, 24 (1943). 

[8] E. Hille, Ordinary differential equations in 
the complex domain, Wiley, 1976. 

[9] R. Abraham and J. E. Marsden, Founda- 
tions of mechanics, Benjamin-Cummings, 
second edition, 1978. 


317 (X.21) 
Orthogonal Functions 


A. Orthogonal Systems 


Let (X, ui) be a tmeasure space. For complex- 
valued functions f, g on X belonging to the 
'function space L,(X), we define the inner 
product ( f, g) - fx f(x)g(x) du(x) and the norm 
If | =F)". If (f, g) «0, then we say that f 
and g are orthogonal on X with respect to the 
measure u. If X is a subset of a Euclidean 
space and p is the tLebesgue measure m, then 
we simply say that they are orthogonal. If the 
measure has a fdensity function g(x) with 
respect to the Lebesgue measure and (ta 

= (x f(x)g(x) p(x) dm(x) 20, we say that they 
are orthogonal with respect to the weight 
function Q(x). If || f ||? — 1, then f is said to be 
normalized. A set of functions { f,(x)} (n = 

1,2, ...) is said to be an orthogonal system 

(or orthogonal set), and we write { f} € O(X), if 
any pair of functions in the set are orthogonal. 
The orthogonal set { f,(x)} is said to be ortho- 
normal, and we write ( f.) eON(X) if each f, is 
normalized. 

Let { f,} be a set of linearly independent 
functions in L;(X), and let R be a subset of 
L,(X). If we can approximate any function 
f €R arbitrarily closely by a finite linear com- 
bination of the f(x) with respect to the norm 
in L;(X), we say that ( f,} is total in R. Let 
Uf] €0O(X). If (o, f.) 20 for all n implies o(x) 
=0 almost everywhere (a.e.), then 1 f,} is said 
to be complete in L,(X). An orthogonal system 
{ f,} is complete in L,(X) if and only if the 
system is total in L,(X). 

If { f.) € O(X), then the series 25, c, f, (x) is 
called an orthogonal series. If the series con- 
verges to g(x) tin the mean of order 2, then c, 
=(¢, f.y/Ilf.ll?. We call the c, (n 1,2, ...) the 
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expansion coefficients or Fourier coefficients of 
(x) with respect to { fa}. 

If {ga} < L;(X) are linearly independent, we 
can construct an orthonormal system { fa} 
by forming suitable linear combinations of 
the gn; { fa} spans the same subspace as {gẹ}. 
For this purpose we set f(x) — g,(x)/llg; ll, 
f(x) gell, where e,(x) 7 g.(x) — 
EtL ga ff, (x), n 22. This procedure is 
called Schmidt orthogonalization or Gram- 
Schmidt orthogonalization. 

If the c, are Fourier coefficients of o(x)e 
L,(X) with respect to { f,(x)} eON(X), then 
we have the *Bessel inequality 22 , |c,|^ < 
ol? Equality in the Bessel inequality for all 
Q9 € L;(X) (the tParseval identity) is equivalent 
to completeness of | f,} in L;(X). In this case 
LL ca falx) is called the orthogonal expansion 
of ọ with respect to { f,}, and conversely, for 
any sequence {c,} such that È |c,|? < oo, there 
is a function o € L,(X) that has the c, as its 
Fourier coefficients, and 


» le siet, 
n=1 


ots)=Lian( Y 21] 


m=i 


This is called the tRiesz-Fischer theorem. 


B. Orthogonal Systems on the Real Line 


We assume that X is a finite interval (a, b) and 
that functions on X are real-valued. We write 
O(a, b) or ON (a, b) instead of O( X), ON (X). 
(1) If ( f} EON (a, b), | fa(x)| < M (const.), and 
È c, f.(x) converges fa.e., then c, 0 as n> œ. 
(2) We can construct a complete orthonormal 
system { f,(x)} and a function g(x)e L, (a, b) 
such that its orthogonal expansion 2c, f,(x) 
diverges everywhere. (3) If { f,(x)}€ON(a, b) 
and Xc?log?n« œ, then X c, f, (x) converges 
a.e. The factor log? n cannot be replaced by 
any other monotone increasing factor w(n) 
satisfying 0 < o(n) — o(log? n) (Rademacher- 
Men'shov theorem). K. Tandori proved that if 
c,[0 and È c E, converges a.e. for any ortho- 
normal system {@,}, then È |c, log? n « oo. 
(4) If the orthogonal expansion of a function 
oe L;(a, b) is ttummable by Abel's method on 
a set E, then it is '(C, 1)-summable ae on E. 
(C, 1)-summability a.e. of the orthogonal ex- 
pansion of a function of o € L,(a, b) is equiva- 
lent to convergence a.e. of the partial sums 
Sj.(x) (n — 1,2, ...) of its expansion. (5) Sup- 
pose that { f(x)? EON (a, b), | f, (x)| < M. Then: 
(i) If the a, are the Fourier coefficients of (x) 
with respect to { f,(x)}, then 


oo ip’ b l/p 
(š a") «uec wor ; 
n=1 a . 
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where 1 «p x 2, 1/p + 1/p' — 1. Conversely, if 
(da, |?)"? < oo (1 « p €2), there exists a func- 
tion @(x) which has the a, as its Fourier coeffi- 
cients and such that 


b : l/p' oo 1/p 
(| Lo ax) <mome( 2 kr) 
a n=1 


(F. Riesz’s theorem). When the orthonormal 
system is the trigonometric system, this is 
called the Hausdorff-Young theorem. (ii) Let 
{až} be the decreasing rearrangement of {|a,|}; 
then 


b 
owzenl leG)l dx (1<p<2. 


If q22 and X a*^n* ? < oo, then there exists a 
function (x) which has the a, as its Fourier 
coefficients and such that 


b oo 
| lọ(x)| dx « A, » axtnt ? 


(Paley's theorem). When the system is trigono- 
metric, this is called the Hardy-Littlewood 
theorem. (6) If for some positive e we have 

Llc, |? *« oo, then Dc, f,(x) converges a.e. 

(7) If we set s*(x)=sup, | 35, c, f, (x)|, then 
(ele A (È c v* 7)"4 (q > 2), where {c*} is 
the decreasing rearrangement of {|c,,|}. 


C. Examples of Orthogonal Systems 


(1) {cosnx} e O(0, n), {sin nx} eO(0, n). (2) 

(1, cos nx, sinnx} €O(0, 2x) (— 159 Fourier 
Series). (3) Suppose that A(x) ts positive and 
continuous, and let y,(x) be solutions of y"(x) 
T A, A(x)y(x) 20 satisfying the condition y,(a) 
= y,(b) 20, where 4, is any teigenvalue. Then 

{V A(x) y,(x)} € O(a, b) (for orthogonality of 
eigenfunctions — 315 Ordinary Differential 
Equations (Boundary Value Problems) B). (4) 
Set r,(x) - —1 or 1 according as the nth digit of 
the binary expansion of x (0 « x € 1) is 1 or 0, 
and r,(x) 20 if x is expandable in two ways. 
Then (r,(x)] € O(0, 1). This is called Rade- 
macher's system of orthogonal functions. The 
system is not complete, but it is interpreted 

as a tsample space of coin tossing. Rade- 
macher's system is useful for constructing 
various counter-examples. (5) Let the binary 
expansion of n be n=2 +2": +... +2% (v, «€ v; 
<... € vj), and set w,(x) =r, a la (x) 
Filz Then {w,(x)} is a complete or- 
thonormal system called Walsh's system of 
orthogonal functions. This system is interpreted 
as a system of characters of the group of binary 
numbers, and there are many theorems for this 
system analogous to those for the trigono- 
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metric system. (6) In the interval [0, 1], set 
hum a^, xe((k—1/2", (k—1/2)/2") 
=—,/2", xe((k—1/2)2", k/2") 
=0, xe((l— 1/2", 1/2”), 


lzk, 1 «I«2". 


The orthonormal system x(x) (1 <k <2", 

1 &m) is called Haar's system of orthogonal 
functions. The Haar expansion of the continu- 
ous function f(x) converges to f(x) uniformly. 


D. Orthogonal Polynomials (— Appendix A, 
Table 20) 


Suppose that we are given a weight function 

9 (x) 20 (ọ(x)>0 a.e.) defined on (a, b) and that 
the inner product of functions f, g on (a, b) is 
defined by (f, 9) f$ f o)g(x) e(x) dx. We ortho- 
gonalize {x"} by Schmidt orthogonalization 
and obtain polynomials p,(x) of degree n. Here 
the sign of p,(x) can be determined so that the 
sign of the coefficient of the highest power of 

x is positive. We call ( p,(x)) the system of 
orthogonal polynomials belonging to the 
weight function (x). This system is complete 


-in LY (a, b), which is defined to be the space 


of functions f such that (^|f(x)l^ e(x) dx < oo. 
In other words, the system {,/@(x)p,(x)} is a 
complete orthonormal system in the ordinary 
L (a, b) space. Concerning the convergence 
problem of the orthogonal expansion by 
{p,(x)}, the Christoffel-Darboux formula 


pi suus s Bie Ch 


plays an important role. 

_ Several important special functions in clas- 
sical mathematical physics are given by ortho- 
gonal polynomials: 

(1) Setting o(x) e x} (1 +x¥ (a> —1,87 
—1) in [—1, 1], we get the Jacobi polynomials, 
although they are sometimes defined in [0,1] 
with respect to q(x) 9 x*(1 — x)? (— Appendix 
A, Table 20.V). If we set a= in the Jacobi 
polynomials, we get the ultraspherical poly- 
nomials (or Gegenbauer polynomials) (— 
Appendix A, Table 20.1). Furthermore, if a = f 
—0, then we get the tLegendre polynomials, 
and if a= $= —1/2, we get the Chebyshev 
polynomials T,(x) — cos(narccos x). The T,(x) 
also appear in the best approximation prob- 
lem (— Appendix A, Table 20.IT, 336 Poly- 
nomial Approximation). 

(2) If we set o(x)=x"e " in (0, 00), we get the 
Sonine polynomials (or associated Laguerre 
polynomials) with appropriate constant factors. 
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If « is a positive integer m, we get 





WE? d" ntm,-x 
S&™(x)=x~™e Ly e ) 


The particular case m=0 gives the Laguerre 
polynomials. In this case, however, it is cus- 
tomary to normalize them as L,(x)= 

(e*/n!) (d"/dx")(x"e^*) (— Appendix A, Table 
20.V1). Laguerre polynomials are used in 
*numerical integrations of a Gaussian type in 
(0, oo). Furthermore, associated Laguerre 
polynomials appear in the solutions of the 
Schródinger equation for the behavior of 
hydrogen atoms. This system of orthogonal 
polynomials is useful in the expansions of 
approximate eigenfunctions of atoms analo- 
gous to hydrogen, velocity distribution func- 
tions of molecules in gas theory, and so on. 

(3) Setting o(x) e "or e ^?) in (—oo, 
oo), we get Hermite polynomials H,(x)— 

( 1e? (d"e^**/dx"), modulo constant fac- 
tors (— Appendix A, Table 20.VI). Hermite 
polynomials are special cases of parabolic 
cylinder functions (— 167 Functions of Con- 
fluent Type). These polynomials appear as 
eigenfunctions of the Schródinger equation for 
harmonic oscillators. They are also connected 
with probability integrals and are used in 
mathematical statistics. 

(4) Replacing the integral by a finite sum 
I? -o f m)g(m) in the definition of inner pro- 
duct, we get so-called orthogonality for a finite 
sum. (Regarding orthogonal polynomials with 
respect to a finite sum (— Appendix A, Table 
20. VII) and their application to the mean 
square approximation — 19 Analog Compu- 
tation F.) Since orthogonal polynomials with 
respect to a finite sum are often called simply 
orthogonal polynomials by engineers, one 
must be careful not to confuse these with the 
ordinary ones. 
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A. General Remarks 


A vibration or oscillation is a phenomenon 
that repeats periodically, either exactly or ap- 
proximately. Exactly periodic oscillations are 
studied in the theory of tperiodic solutions of 
differential equations. The *period of a solu- 
tion f(t) is called the period of the oscillation, 
and its reciprocal the frequency. The difference 
between the greatest and least values of f(t) 
(globally or in an interval) is the amplitude. 
The theory of vibrations has its origin in the 
study of mechanical vibrations, but its nomen- 
clature has been used also for electric circuits. 
As examples of practical applications of the 
theory of oscillations, we mention, in engineer- 
ing, the prevention of vibrations and the gen- 
eration of stable sustained oscillations, and in 
geophysics, investigations concerning the free 
oscillation of the earth, the existence of which 
has recently been confirmed. 


B. Linear Oscillation 


Periodic solutions of "linear differential equa- 
tions have been studied in detail for a long 
time. Perhaps the simplest case of such an 
oscillation 1s represented by the differential 
equation 


d? x/dt? - n? x «0, (1) 


where the restitutive force is proportional to 
the displacement from the equilibrium posi- 
tion. Typical examples are the free vibration 
of a simple pendulum with small amplitude 
and an electric circuit composed of a self- 
inductance and capacity (without resistance). 
The solution is given by x = Acos(nt + «). This 
is called harmonic oscillation or simple har- 
monic motion. Here the amplitude is A, the 
period is 2z/n, n is the circular frequency, and x 
is the initial phase. 

A system of m tdegrees of freedom (x,, ..., 
Xm) is said to be in free harmonic oscillation if 
the coordinates can be expressed as 


m 


Xi = Y Ax, cos(n,t + 0), 


i=1,2,...,m. 
Each of these simple harmonic oscillations is 
called a normal vibration. As a limiting case, 
where the number of degrees of freedom is 
infinite, we have the vibration of a string: 


O0u(x,t) ê u(x,t) 
3 





ôt? Ox 


u(0, t) 2 u(1, t) =0. 
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The solution is given by a series 


Y. A, sin(knx/l) cos(kznt/l), 
k 


which is just the superposition of the funda- 
mental vibration (corresponding to k — 1) and 
simply harmonic motions of frequencies equal 
to multiples of the fundamental frequency. 

If a resisting force proportional to the veloc- 
ity is acting, the equation becomes 


d?x/dt? -2£dx/dt-- n? x 0, n>e, (2) 
whose solution 
x=Ae“cos(at+a), o=,/n?—e? (3) 


is not periodic. However, x becomes zero at 
a fixed interval z/c, and the extremal values 
in the intervals decrease to zero in a geo- 
metrical progression with the common ratio 
v —exp(— z£/o). This phenomenon is called 
damped oscillation with damping ratio v and 
logarithmic decrement log v= — zeé/v. In this 
case, too, 2z/e is called the period. 

When a driving force term ¢(t) is present in 
the right-hand side of (2), the solution takes on 
the additional term 


-gt 


e 
oe (sin ot | oes cos otdt 
o 


— cosot LS singt a) ; 


which represents the forced oscillation due to 
e(t). 

If <0 in (2) (negative resistance), the solu- 
tion (3) increases in amplitude, so that a small 
disturbance is amplified, resulting ia an auto- 
matic generation of oscillation. This phenom- 
enon is called self-excited vibration. Besides 
being caused by some special kinds of circuit 
elements (e.g., tunnel diodes), such a situation 
often occurs when the vibrating system has 
time delay characteristics (— 163 Functional- 
Differential Equations). 

Among sustained vibrations, other than 
forced oscillations and self-excited vibrations, 
are the parametrically sustained vibrations 
caused by periodic variation of a parameter of 
the vibrating system. Electric wires and panto- 
graphs for use in high-speed electric railways 
must be designed to prevent unwanted para- 
metrically sustained vibrations. On the other 
hand, a parametron is an electric element 
utilizing parametrically sustained vibration. 


C. Nonlinear Oscillation 


Actual vibrating systems contain more or less 
nonlinear elements, which give rise to various 
kinds of oscillations different from those de- 
scribed by the linear theory (— 290 Non- 
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linear Oscillation). For example, d? x/dt? — 

e(1 — x?) dx/dt +x =0 (e>0) represents a stable 
sustained oscillation such that for large values 
of e, two nearly stationary states occur alter- 
nately, the transition from one to the other 
taking place abruptly. This is called relaxation 
oscillation. 
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A. General Remarks 


A statement that is apparently absurd but not 
easily disproved is called a paradox. A con- 
tradiction between a proposition and its neg- 
ation is called an antinomy if both statements 
can be supported by logically equivalent rea- 
soning. In practical use, however, “paradox” 
and “antinomy” often mean the same thing. 


B. Paradoxes in Set Theory 


(1) The Russell Paradox (1903). We classify 
sets into two kinds as follows: Any set that 
does not contain itself as an element ts called a 
set of the first kind, and any set that contains 
itself as an element is called a set of the second 
kind. Every set is either a set of the first kind 
or of the second kind. Denote the set of all sets 
of the first kind by M. If M is a set of the first 
kind, M cannot be an element of M. But if M 
is of the first kind, then M must be an element 
of M, by definition. This is contradictory. On 
the other hand, if M is a set of the second kind, 
M must be an element of M; but since M is an 
element of M, M is a set of the first kind, so M 
cannot be an element of M, by definition. This 
is contradictory. 

Since the kind of reasoning employed in this 
paradox is very simple and is often utilized in 
mathematics, it became popular in set theory. 
To remove this paradox from set theory, Rus- 
sell suggested tramified type theory. If we 
adopt this theory, however, it becomes very 
hard to develop even an ordinary theory of 
real numbers (— 156 Foundations of Math- 
ematics). On the other hand, this paradox, 
together with the Burali-Forti paradox, indi- 
cates that the definition of a set should be 
restrictive. This realization led to the develop- 
ment of *axiomatic set theory. 


(2) The Burali-Forti Paradox (1897). Let W — 
10, 1,2,...,@,...} be the *well-ordered set 

(— 312 Ordinal Numbers A) of all *ordinal 
numbers. Let O be the ordinal number of W. 
Then every ordinal number, being an element 
of W, is less than Q. But Q is an ordinal num- 
ber. Hence, Q <Q. This is contradictory. 


(3) The Richard Paradox (1905). The expres- 
sions in the English language can be enum- 
erated by the device that is applied to the 
usual enumeration of the algebraic equations 
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with integral coefficients. From the specified 
enumeration of all expressions in the English 
language, by striking out those which do not 
define a real number in the interval (0, 1 ], 

we obtain an enumeration of those which 

do. Consider the following expression: “The 
greatest real number represented bv a proper 
nonterminating decimal fraction whose nth 
digit, for any natural number n, is not equal to 
the nth digit of the nonterminating decimal 
fraction representing the real number defined 
by the nth expression in the last-described 
enumeration." Then we have before us a defi- 
nition of a real number in the interval (0, 1] by 
means of an expression in the English lan- 
guage. This real number, by its definition, 
must differ from every real number definable 
by an expression in the English language. This 
is contradictory. 

The following paradox was given by Berry 
(1906): “The least natural number not name- 
able in fewer than twenty-two syllables" is 
actually named by this expression, which has 
twenty-one syllables. The Epimenides paradox 
is a traditional ancient Greek paradox of this 
kind: Epimenides (a Cretan) said, “Cretans are 
always liars. ..." 


C. Paradoxes of the Continuum 


The problem of the continuum is important in 
both mathematics and philosophy. There are 
several paradoxes of Zeno concerning the 
continuum, among which the following two 
are best known: 

(1) Assume that Achilles and a tortoise start 
simultaneously from the points A and B, re- 
spectively, Achilles running after the tortoise. 
When Achilles reaches the point B, the tortoise 
advances to a point B,. When Achilles reaches 
the point B,, the tortoise advances further to a 
point B,. Thus Achilles can never overtake 
the tortoise. 

(2) A flying arrow occupies a certain point at 
each moment. In other words, at each moment 
the arrow stands still. Therefore the arrow can 
never move. 
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A. General Remarks 


A partial differential equation is a functional 
equation 

d Oz. 
F| xq, Xe Za, ]20 
( ; ax, Ox?’ ôx, Ox, ) 


that involves a function z of independent vari- 
ables x,, X2,...,X,, its tpartial derivatives, 
and the independent variables x,, ..., x,. The 
definition of a system of partial differential 
equations is similar to that of a system of 
tordinary differential equations. (The partial 
differential equation becomes an ordinary 
differential equation if the number of inde- 
pendent variables is one.) 

The torder of the highest derivative appear- 
ing in a partial differential equation is called 
the order of the partial differential equation. 

Usually we write p; for 0z/Ox;, x for zx, and 
y for x, when n=2, and p=0z/0x, q=0z/éy, 
r= 07z/0x?, s 0? z/OxOy, t 2 0? z/Oy? when z 
is a function of x and y. 

A partial differential equation is called linear 
if it is a linear relation with respect to z and its 
partial derivatives. For example, the equation 


A(x, y)r + B(x, y)s + C(x, yt + D(x, y)p 
+ E(x, y)q + F(x, y)z= G(x, y) 


is a linear partial differential equation. A par- 
tial differential equation is quasilinear if it is 
a linear relation with respect to the highest- 
order partial derivatives. A partial differential 
equation is called nonlinear if it is not linear 
(— 291 Nonlinear Problems). A function z= 
Q(x,, X5, ..., Xn) that satisfies the given partial 
differential equation is called a solution of 
the partial differential equation. Obtaining 
such a solution for a given partial differen- 
tial equation is called solving this equation, 
and by analogy to the case n —2, the integral 
hypersurface of the equation is 


z— Q(x,, X2, ..., x,) =O. 


For a system of partial differential equations, 
we define solutions in the same manner. 
Example 1. X,, X2, ..., X, are functions of 
n independent variables x,, x5, ..., Xn. Then 
solving the partial differential equation 


oz 0z Oz 
—-+X,—+4+...4+X,—= 
Óx, 


0 (1) 
is equivalent to solving the system of ordinary 
differential equations 


dx, dx; dx, 
X, E x. D 





Q) 
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In other words, if f,, f2,...,f,-1 are n— 1 in- 
dependent tintegrals of (2), then for an arbi- 
trary function ®, z 2 d f,,..., f, .,) is a general 
solution of (1) (— Section C). 

Example 2. If P,, P}, ..., P,, R are functions 
of independent variables x,,..., x, and the 
dependent variable z, and if the quasilinear 
partial differential equation (Lagrange's dif- 
ferential equation) 


P,—+P,—4+...+P,—-=R 3 
“ax, 70x, "Ox, 6) 
has an integral hypersurface V(z, x,, ..., x,) — 0, 
then we have 

P OV +P. SE +P, XR (4) 
Lax ^ X2 "77" "ex êz "7 


which is an equation of type (1). From this we 
can obtain a general solution by the method of 
example 1. The same procedure is applicable 
to solving other systems of partial differential 
equations. 


B. Characteristic Manifolds 


We consider a partial differential equation of 
the nth order of two independent variables 
xX, y: 


F(x, Y, Z, Pio» Pors +--> Pnos «++» Pon) = 9, 


oitkz 
where p ==. (5 
With this equation, we associate a manifold 
defined by a real parameter 4, 


x-x(A, y=y(A), Pik 7 Pj (A), 


j, k-0, hoo nb jrk&n-1, (6) 


and consider the following problem: Find a 
solution ot, y) of (5) that satisfies 


ai p(x, y) 
gigy: Pa. 


j, k=0, 1, 2,...,n—1; jrk&n-—1 


on the curve x= x(A), y — y(4). We call this 
problem the Cauchy problem for equation (5). 

If F vanishes for.a system of values x?, y?, 
p9 (j,k—0,1,2,..., m poo z9;j +k <n) and is 
tholomorphic in a neighborhood of this sys- 
tem of values, if x, y, py, are holomorphic 
functions of 4 in a neighborhood of 4-0 and 
take the respective values x°, y?, pj, at 4 —0, 
and if 


Po dy" — P,., ,dxdy" | + DE 
+(—1)"Po,dx"#0 (7) 
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for (x°, y9, py), then we have a unique holo- 
morphic solution of this Cauchy problem in 

a neighborhood of (x°, 9, p9). (Here we use 
the notation Pj, — OF /dp,,.) This is Cauchy's 
existence theorem. We cannot apply this theo- 
rem when the left-hand side of (7) vanishes 

at (x°, y°, ph). At such a point, uniqueness of 
the solution fails, and there may be several 
solutions through the point (x, y°, pà). There 
are n curves on which the left-hand side of (7) 
vanishes on the integral surface z = (x, y). 
These curves are called characteristic curves of 
(5). We associate the values 


(9-2 9) 
BIS Oxioy* 


H 


x=x(A), y=y(A) 


j,k=0,1,...,m; jtk<n, 


with each point (x, y) on these curves. The 
manifold (x(4), y(A), p(A)} (P) — (py (4)]) of 
the parameter 4 is called a characteristic mani- 
fold of equation (5). Cauchy's existence theo- 
rem cannot be applied on the characteristic 
manifold. 

The foregoing considerations can be ex- 
tended, to some extent, to the space of higher 
dimensions R” or C”. Let P be a linear partial 
differential operator of order m: 


P(x,D)= Y, a,(x)D*, 


la| <m 


E E EE |x| — 
= i) os ae a) =a, +... + an. 


The coefficients are assumed smooth and real 
in R” or holomorphic in C”. Its homogeneous 
part of order m, denoted by P.,(x, D), is called 
the principal part of P. Let S be a hypersurface 
defined by o(x) 20 with grad o(x) z O0. S is 
called a characteristic surface if P, (x, grad o(x)) 
— 0 holds for x in a neighborhood of S or 
merely for x eS, and (x) is called a phase 
function. A real vector €°( 40), or a complex 
vector El #0 is called a characteristic direction 
at the point xo if P, (xo, £9) D. The zeros (x, č) 
(č 40) of P, (x, é) are called the characteristic 
set. Furthermore (xo, č?) is called simple if 
grad, P, (xo, £9) #0. Suppose that (xo, £9) is 
simple. The integral curve (x(t), £(t)) that 
satisfies 





X — grad. P,(x, č), 
é= — grad, P(x, £) 


is called the bicharacteristic strip of P Evi- 
dently P, (x, £) is constant along the bicharac- 
teristic strip. In particular, if this constant is 
zero, it is said to be null-bicharacteristic. The 
phase function g(x) can be obtained at least 
locally by using the bicharacteristic strip. 
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C. Classification of Solutions 


First, we consider a partial differential equa- 
tion of the first order of two independent 
variables: 


F(x, y, Z, p, q) - O0. (8) 


A solution of (8) that contains two arbitrary 
constants is called a complete solution. If we 
get one complete solution of (8), then we can 
obtain all the solutions by differentiations and 
eliminations. Let (9) be a complete solution: 


V(x, y, z, a, b) - 0. (9) 
Differentiating this, we get 


oV | OV oV Oh 
ar?» è arce (10) 
Eliminating a and b from (9) and (10), we get 
the original equation (8). Furthermore, solving 
equation (8) is equivalent to getting three func- 
tions a, b, z of x, y from (8), (9), and (10). If we 
regard a, b as functions of x, y, in (9), we get 


ôV õa OVóOb | ôV ôa OV ab _ 
0aóx óObóx  '  0a0y dbéey — 





0. Ou 


Therefore we can replace (9) and (10) by (9) 
and (11). Hence we have the following three 
cases: (1) When a, b are constants, we get a 
complete solution. (ii) When V —0, 2V/0a —0, 
and 0V/db=0, we get a solution that does not 
contain arbitrary constants, because z, a, b are 
all functions of x and y. We call this solution a 
singular solution of (8). (iii) When 0V/0a, 0V/Ob 
do not vanish simultaneously, the Jacobian 
D(a, b)/D(x, y) vanishes because of (11). This 
means that there exists a functional relation 
between a and b. If there are two such rela- 
tions, a and b are constants and the solution 

z becomes a complete solution. Therefore we 
assume that there is only one such relation 
between a and b, whose form is assumed to be 
b= (a). Then we get 


OH OV 
V(x, y,z,a, p(a))=0,  —-———e(a)-0. (12) 

da Ob 
If we solve (12) for the unknowns a and z, we 
get a solution z of (8) that contains an arbi- 
trary function q instead of arbitrary constants. 
Such a solution is called a general solution of 
the partial differential equation (8) of the first 
order. By specializing this function @, we ob- 
tain a particular solution of (8). Thus (1)- (iii) 
exhaust all the cases, and by obtaining a com- 
plete solution of (8) we can get all the solu- 
tions of (8). The number of complete solutions 
may be more than 1, or it may be infinite. 
These complete solutions can be transformed 
into each other by ‘contact transformations. 
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Moreover, they are contained in the general 
solutions. 

Now we consider the case where the number 
of independent variables is n. Take an equa- 
tion that contains n —r + 1 arbitrary constants 





Ai, 25 siut 
V(X 15 X25 0269 Xn: Gy, Bz, ..., 04,41) =O. (13) 
Differentiate this equation assuming that 
du dau take fixed values. Then we get 
OH OH Oz 
-— + ;—=0, i= D 1-12: 
Ox; Oz j Ox; : 
(14) 


If we eliminate a,, a5, ...,a,. ,,, from (13) and 
(14), we obtain r partial differential equations 


of the first order: 
Fj(xi, X5, ... p) ^0, 


j=1,2,...,r. (15) 


s» Aas Zs P1 Dä 


We call (13) a complete solution of (15). In this 
case, as in the case when n=2, we get all the 
solutions of (15) from a complete solution (13) 
of (15). We have the same classification as in 
the case n —2: (i) When we regard a,,...,a, ,.44 
as constants, then we have a complete solution 
of (15). (i) When we can eliminate the con- 


stants di... Oe ,,, from equations 
OV. oV 

V=0, AT =0,. z——— =0, 
0a, Br uen 


we get a solution that does not contain an 
arbitrary constant. Such a solution is called a 
singular solution of (15). (iii) If not all of the 
OV/0a; vanish, there exists at least one func- 
tional relation among a,, 25, ...,8, ,,,. We 
assume that there exist exactly k (<n—r) 


relations among d. a2, ..., 0, 144: 
fjí(81,02,...,4, 2,41) 90, j=1,...,k. (16) 
Then there exist numbers 24, 45, ..., A, such 
that 
oV d 
CN 

l l=1,...,.n—r+1. (17) 
Hence, by eliminating a,, 0... anne 445 


Az, ---, Ay from (13), (16), and (17), we generally 
obtain exactly one relation between x4, ..., x, 
and z. This is a solution of (15) that contains 
exactly k arbitrary functions fi, f,,..., bh, Such 
a solution is called a general solution of (15). In 
particular, if k —n—r + 1, then it is a complete 
solution. We might think that there are n—r 
general solutions corresponding to k- 1, ...,n 
—r. But these general solutions are not essen- 
tially different. For the partial equation of the 
second order F(x, y, Z, p, q, r, s, t)=0, this defi- 
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nition of a general solution is not applicable 
since we cannot successfully define a general 
solution by using the number of arbitrary 
functions contained in a solution. 

Instead, we now use the following definition, 
due to J. G. Darboux: A solution of a general 
partial differential equation is called a general 
solution if by specializing its arbitrary func- 
tions and constants appropriately we obtain 
a solution whose existence is established by 
Cauchy’s existence theorem. A solution z= 
Q(x, y) of a general partial differential equa- 
tion is called a singular solution if Cauchy's 
existence theorem cannot be applied on any 
curves on the manifold formed by z= (x, y), 


p—609/0x, q=00/0y. 


D. Cauchy's Method 


We can regard equation (8) as a relation be- 
tween the point (x, y, z) on the integral surface 
S and the direction cosines of a tangent plane 
at that point. Therefore the tangent planes at 
all points of the surface form a one-parameter 
family. They envelop a cone (T) whose vertex 
is (x, y, z) on S. The tangent plane at a point M 
on the integral surface S is tangent to this cone 
(T) along one generating line G of (T). 

A curve on S whose tangents are all generat- 
ing lines of (T) is a characteristic curve. If we 
write 


———, P =—, =—; 
Oz Op Q ôq 

then the characteristic curve is given by the 

system of ordinary differential equations: 

dx _4y dz —dp —dp 


P Q Pp+Qq “X+pZ Y+qZ 





(18) 


We call this system the characteristic differen- 
tial equation or Charpit subsidiary (auxiliary) 
equation of the partial differential equation (8) 
of the first order. System (18) determines not 
only x, y, z but also p and q. The set of these 
tsurface elements (x, y, z, p, q) is the character- 
istic manifold. This characteristic manifold is 
considered as a part of the integral surface 
with infinitesimal width, and in this case we 
call it the characteristic strip. The character- 
istic strip is represented by the equations x = 
x(4), y= y(4), z 2 z(4), p= p(4), a — 4(4) con- 
taining a parameter. On the integral surface 

z — z(x, y), we have 


dz Oz dx EC 
dà Ze di *oydà 


and 


dz=pdx+qdy. | (19) 
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Equation (19) is called the strip condition. The 
equations (18) evidently satisfy this condition. 
The solution u(t, x) 2 u(t, X1, ..., x,) of 


uf oe Xps Uga ie 


u(0, x)= p(x) 


„ux, )=0, 


is obtained (at least locally) as follows. Let the 
solution of the differential equations 


dx; 

dt = f, E E 

d 
dt Felt E EE Ixixn, 


issuing from (xo, č?) at t=0 be (x(t; xo, £9), 
E(t; xo, °)). Then specializing č? = e, (xo) 

(1 €j € n), the solution u(t, x) is obtained by 
quadrature along these curves (characteristic 
strips) from the relation 


du ` 


dT -ftx D Y Efex 0) 


In particular, when f(t, x, č) is homogeneous 
of degree 1 in £, by Euler's identity the right- 
hand side is identically zero. This means that u 
is constant along the characteristic strip. 


E. Homogeneous Partial Differential 
Equations 


Assume that f(€,,€3,...,€,) is a Thomogene- 
ous polynomial of m independent variables 
Zu, Gan, We denote the differential oper- 
ator 6/éx, by D,. Then consider a homogene- 
ous partial differential equation 


f(D,, D>, ..., D,)u 0. (20) 


We can obtain a homogeneous equation from 
an inhomogeneous partial differential equation 
by transformation of the dependent variable. 
For example, D2w = D,w becomes the homoge- 
neous partial differential equation (D? — D, D4)u 
=Q by the transformation of the dependent 
variable u — e**w. The equation (D? — D, Du = 
0 corresponds to the homogeneous polynomial 


TU 63) 2 61 — 6265. Generally, for equa- 
tion (20), we consider the solution 

u= F(0,,02,..., 0), (21) 
where 0,, 02, ...,0, are functions of x,, ..., Xm 


and F is an arbitrary function of 6j. Such a 
solution is called a primary solution of (20). 
For example, for the equation (D? — D2)u — 0, 
there are two primary solutions, u = F(x, 4- x5) 
and u= F(x; — x2). For tLaplace’s equation 
" ^u tu ^u » Ou 
u-——— —;-—0, 
ôx? ôy? és? 
we have a primary solution u = F(z + ixcos a + 
iysina), where a is a parameter. 
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Second, for the twave equation 
o*u u eM USCH (22) 
Qx* Oy? og top 
we have a solution 
1 r z—r 
u — yf(a, P), Y= X-—lL—-, f= Se 
r [a X+1y 
(23) 


where r? 2 x? 4- y?+2?, f is an arbitrary func- 
tion, and y is a particular solution of (22). 
Furthermore, v= f(a, f) satisfies the equation 
of a characteristic curve of (22): 


Or Së ov Se Ov pea óv V? 
Óx Or 02] cWN0t] ` 


Such a solution, which is the product of par- 
ticular solutions and some function that con- 
tains an arbitrary function, is called a primitive 
solution of the original equation. Equation (22) 
has another primitive solution of the type 


1 ( r n9 
u-—-g br, a , 
r € X-ly 


where g is an arbitrary function. 
Laplace's equation has a primitive solution 


"e z—r 
u=— f 
r` \x+iy 


A basic equation is an equation, such as La- 
place’s equation, that has a primary solution 
and a primitive solution. A solution of an 
equation that has the same characteristic 
curves as a basic equation can be obtained 
from a particular solution of the basic equa- 
tion by integrations and additions. For exam- 
ple, if we choose a particular solution u-r ! 
of Laplace's equation, which is a specialization 
of the primitive solution u —r ! f((z —r)/(x + iy)), 
then 


S =| {fre =f yay Rel 


x F(¢,n, Ode dy de 














is a solution of 
Au t 4n F(x, y, z)=0 


in the interior of the domain of integration. 


F. Determined Systems 


The general form of a system of partial dif- 
ferential equations in two independent vari- 
ables is 


ue), yom) 


1 2 1 1 
F(x, yu, e uP, uh, uy”, 


Wise, i=1,2,...,h, (24) 


i.e., a system of h equations for m functions 
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u D, vil... vil of the independent variables x 
and y. The system is called a determined sys- 
tem if h =m, an overdetermined system if ^ > 
m, and an underdetermined system if h <m. 

An example of a determined system is the 
tCauchy-Riemann equation 


u,—7,—0, u,+v,=0, 


for u(x, y), v(x, y), which can be further reduced 
to two determined equations Au =0 and Av = 
0. 

A simple example of an overdetermined 
system is 


u= f(x,y)  u,= f(x,y). 


A necessary and sufficient condition for the 
existence of a solution of this system is f, = fx- 
The Cauchy-Riemann differential equations for 
a holomorphic function f(z,,z;)—-u- iv of two 
complex variables z; =x, 4-iy,,z;— x54 iy; 
are 


Ws; Dy, Hx, — Dy, Uy, = — Ux,» 
Ha = Dan 

which can be reduced to 

Uy x, T My y, = 0, Uy x, T Hy y, = 0, 


Rea + Uy oy. = 0, Hy; ES Hy, = 0, 


which is also an overdetermined system. 

An example of an underdetermined system 
is 
O(u,v) ` 
O(x, y) 
If this equation holds, there exists a functional 


relation w(u, v) 20 that can be regarded as a 
solution of this underdetermined system. 


=0. 





U Vy — UU, 


G. General Theory of Differential Operators 


In recent developments of the theory of par- 
tial differential equations, there is a trend to 
construct a general theory for tdifferential 
operators regardless of the classical types 

of differential equations (— 112 Differential 
Operators). For example, we take a property 
that is satisfied by some equation of classical 
type (e.g., an telliptic differential equation) and 
proceed to characterize all equations that have 
this property. (For example, *hypoellipticity is 
a property of classical tparabolic and elliptic 
equations.) There are several basic problems in 
this general theory: the existence of a funda- 
mental solution, the existence of a local solu- 
tion, unique continuation of solutions, the 
differentiability and analyticity of solutions, 
and the propagation of smoothness. We ex- 
plain here only two of them: the fundamental 
solution and the local existence of solutions. 
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H. Fundamental Solutions 


L is assumed to be a linear partial differential 
operator with constant coefficients. If a tdistri- 
bution E satisfies the equation 


LE(x)= ó(x), 


where ó(x) is the tDirac 6-function, then we 
call E(x) a fundamental solution (or elementary 
solution) of L. Also, if L is a linear differential 
operator and E satisfies the equation 


L,E(x, y) - ó(x — y) 


then we call this distribution E(x, y) a funda- 
mental kernel (or elementary kernel) of L. 

Let L be a differential operator with con- 
stant coefficients and E(x) be a fundamental 
solution of L. Then E(x, y) is a fundamental 
kernel of L. Sometimes E(x, y) itself is called a 
fundamental solution. L. Ehrenpreis and B. 
Malgrange proved that any linear differential 
operator with constant coefficients has a fun- 
damental solution [4]. 

If we take a fundamental solution (funda- 
mental kernel) E and add to it an arbitrary 
solution of the equation Lu —0, then we get 
another fundamental solution (fundamental 
kernel) of L. This freedom of the fundamental 
solution (fundamental kernel) can be used to 


‘construct *Green's functions of the boundary 


value problem of elliptic equations and of the 
mixed initial-boundary value problem for 
parabolic equations. A Green's function is a 
fundamental solution (fundamental kernel) 
that satisfies given boundary conditions (— 
188 Green's Functions; 189 Green's Operator). 
The fundamental solutions (fundamental ker- 
nels) relative to the Cauchy problem are also 
defined as in this section. For example, con- 
sider a fundamental solution of the Cauchy 
problem concerning the future behavior of 

a differential operator L —0/0t — P(0/0x), 
namely, a distribution E(t, x) that satisfies LE — 
0 (t2 0) and E(t, x)|, o — 9(x). If we put E(t, x)= 
E(t, x) (tz 0) and E(t, x) - 0 (t <0), then E(t, x) 
is a fundamental solution (or kernel) of the 
differential operator L, that is, LE =d(t, x). 
Sometimes a fundamental solution of the 
Cauchy problem for a parabolic equation is 
called a Green's function. On the other hand, a 
fundamental solution (or kernel) of the Cauchy 
problem for a hyperbolic equation is called a 
tRiemann function. A Riemann function actu- 
ally is not always a function; in general it is a 
distribution. 

Example 1. A fundamental solution of the 3- 
dimensional Laplacian 


Ge Oe OF 


ôx? ôx? ax? 


is E(x)= —1/4nr, where r=,/ x? - x - xà. 


A- 
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Example 2. A fundamental solution of 
the Cauchy problem for the future of the 3- 
dimensional wave operator 


e 3 e 
or AUD 


ie., a distribution E(t, x) (t 2 0) that satisfies 
LE —0 (t 0), E(0, x) 20, and (0/0t) E(0, x)= 
ó(x), is given by E(t, x) —(1/4nt)ó(r — t) (t > 0), 
r= SE t x24 x3. A fundamental solution 
for L is given by E(t, x)= E(t, x) (t20); = 

0 (t <0) (2 Appendix A, Table 15.V). 


I. Existence of Local Solutions 


Given a linear differential operator L and the 
equation Lu = f, we have the problem of deter- 
mining whether this equation always has a 
solution in some neighborhood of a given 
point. If the coefficients of L and f are holo- 
morphic in a neighborhood of this point and 
if the homogeneous part of highest order 
does not vanish, then there exists a solution 
that is holomorphic in a neighborhood of the 
given point (*Cauchy-K ovalevskaya existence 
theorem). 

If L is a linear differential operator with 
constant coefficients, E is a fundamental solu- 
tion of L, and f is a function (or distribution) 
that ts zero outside of a compact set, then we 
have a solution u that is the tconvolution of E 
and f:u= E = f. On the other hand, H. Lewy 
proposed the following example [3]: 

Qu |. Qu . . . Ou 
Lu=—-+i———+2i(x, +ix,)— f(x) 

ôx; xX, Ox; 
where f is a real function of x,. He showed 
that if this equation has a solution that is of 
class C', then f must be real analytic. There- 
fore, if f is of class C? but not real analytic, 
then this equation has no Cl solution. Actu- 
ally, no solution exists even in the distribution 
sense. (Note that, since the coefficients of L are 
now complex-valued, the results mentioned at 
the beginning of this section are no longer 
applicable.) 

For linear differential operators L, a study 
by L. Hórmander gives some necessary con- 
ditions and also some sufficient conditions for 
the local existence of a solution of the equa- 
tion Lu — f for sufficiently general f [4]. This 
result has been developed and completed by 
L. Nirenberg and F. Treves [18] and by R. 
Beals and C. Fefferman [19]. The operator 
considered by S. Mizohata (J. Math. Kyoto 
Univ., 1 (1962)), 
L= A 

zov 


serves as a standard model in this problem. In 
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the neighborhood of the origin, if k is even, L 
is locally solvable, and if k is odd, it is not 
locally solvable. However, for linear partial 
differential operators with multiple character- 


‘istics, the problem of local solvability becomes 


extremely difficult. 
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321 (XIII.21) 
Partial Differential Equations 
(Initial Value Problems) 


A. General Remarks 


First, we give two examples of initial value 
problems for tpartial differential equations. 

(I) Consider the partial differential equation 
uy — uy — 0 of two independent variables x and 
y. If the function (y) is of class C+, then u = 
Q(x 4- y) is a solution of this equation that 
satisfies u(0, y) = p(y). 

(II) We denote a point of R"*! (or C"*!) by 
(t, x), x 2 (x4, ..., x,). Let L be a linear partial 
differential operator of order m: 

e" IM 
L=—,+ ¥ dos luës, Ox 


ot" Iv £m 


|v| 2 vo v4 t... v. Vo «m, 


where the coefficients a, , lt, x) are func- 
tions of class C? (i.e., treal analytic functions 
or holomorphic functions) in a neighborhood 
of (t, x)= (0, 0). If the functions v(t, x) and w,(x) 
(0x k « m — 1) are of class C? in a neighbor- 
hood of (t, x) — (0,0), then there exists a unique 
solution u(t, x) of class C? in a neighborhood 
of (t, x)= (0, 0) that satisfies 


L [u] = v(t, x), 
tu 
zr 0:9 = wal), O0<k<m-l. (1) 


This is called the Cauchy-Kovalevskaya exis- 
tence theorem (for linear partial differential 
equations). 

As in (II) we choose one of the independent 
variables as the principal variable and regard 
the others as parameters. When we assign a 
value a to the principal variable t, the values of 
the dependent variables (unknown functions) 
and their derivatives are called initial values, 
initial data, or Cauchy data. Conditions to 
determine initial values are called initial con- 
ditions. The problem of finding a solution of 
(1) under given initial conditions is called an 
initial value problem or Cauchy problem. We 
may consider initial value problems not only 
for initial conditions on a hyperplane t=a, but 
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also for initial conditions on a hypersurface, 
called an initial surface (— Section C). 

Let a(x, D) be a linear partial differential 
operator of order m: 


el 
a(x, D) a,{x)D*, | D'z—————, 
pon) PX 09 Oxf... On 


jal =æ +... + an 


where the coefficients a,(x) are of class C? in a 
neighborhood of x=0. Its characteristic poly- 
nomial is 

h(x, J » ax), Ct = ert. Ex 

Let S:s(x)=0 be a regular surface Oe, s, = 
(0s/0x,, ...,0s/0x,)#(0)) of codimension 1. 

We suppose that S is a tnoncharacteristic 
surface, that is, h(x,s,) 40 on S. Let v(x) and 
w(x) (0<k € m — 1) be the functions of class C? 
in a neighborhood of x —0 and on S, respec- 
tively. We consider the Cauchy problem 


k 


a(x, D)u(x) = v(x), Sos 


w(x) on S, 


O<k<m-1, (1) 


where n is the outward normal direction of S. 
S is thus the initial surface. Then there exists a 
unique solution u(x) of class C? in a neighbor- 
hood of x =0. In fact, by the change of vari- 
ables X, =s(x), X; 2x; 2«ixn) if 0s/0x, £ 

0 on S, this problem can be reduced to the 
Cauchy problem (1) by taking account of the 
fact that h(x, s,) ZO on S. 

The Cauchy-Kovalevskaya theorem asserts 
the local existence of solution when the initial 
values are of class C?. Indeed, J. Hadamard 
noted that if the initial values are not of class 
C^, the initial value problem does not always 
have a solution. For example, consider the 
initial value problem 


LH LN aS 
ôx? oy Gei: 


with the initial values 
Qu 


u(0, y, z) — w(y, z), odo." 


If the function w(y, z) is not of class C? in any 
neighborhood of y 2z —0, the solution of this 
problem can never exist in (or even on one side 
x 20 of) any neighborhood of x 2 y=z=0. 


B. The Cauchy-Kovalevskaya Existence 
Theorem for a System of Partial Differential 
Equations in the Normal Form 


The Cauchy-Kovalevskaya existence theorem 
(1) is extended for more general systems of 
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partial differential equations in the normal 
form studied by Kovalevskaya. Consider 


^ 


Cru, 


Sp SE [scio 





Alvi 
O uj 


——————,eed 
Ot' exp... Ox" ) 


where 1 <i, j&m, |v|= vo +v; +... +Y € pj Y< 
pp and x =(x,,...,X,). We assume that F, (1 < 
i <m) are functions of class C? with respect to 
arguments t, x, u,, ... Me. 0 uf Ot" oOx]... 
Ox," ...,in a neighborhood of (0,0, ...,0). If 
the functions wa(x) (I <i<m,0<k<p;—1) 

are of class C? in a neighborhood of x «0, 
then there exists a unique solution u; (t, x), ..., 
us (t, x) of class C? in a neighborhood of (t, x)= 
(0, 0) that satisfies the equations and the initial 
values 

Ak 


NM u; . 
39 ger l<i<m, O<k<p,—1 
C 


i 


[2.4]. 


C. Single Equations of the First Order 


For a single partial differential equation given 
in the normal form 


Cu P Ou Ou 
Self been i (2) 
, Cy, OV, 


we assume that F(x, Yi, .... Yk U, Q4, ..., dj) is a 
real-valued function of class C? in a neighbor- 
hood of x «a, y; b, u=c, q;— d;, and that 

Q Y. -+ Yg) is also a function of class C? such 
that o — c, 09/0 y; — d; at y; — b. Then there 

is a solution u of (2) in a neighborhood of 
x—a and y;— b, that satisfies u(a, y, ..., y) — 
vlt, pl and is of class C?. 

For the uniqueness of solutions of this 
Cauchy problem, A. Haar (Atti del Congresso 
Internazionale dei Matematici, 1928, Bologna, 
vol. 3) showed that if F satisfies the tLipschitz 
condition 


| F(x, yu, q') - F(x, y. u, q)] 
k 

<AY lo qi Blu —ul, 
i=1 


then the solution of the initial value prob- 
lem for (2) is unique. To obtain this result he 
studied the partial differential inequality 


k 
«AY 
i=} 


Next, we consider more general equations of 
the first order 


Cu Qu 
la ss ln (4) 


^ 


Cu 


^ 





ou 
EE E o (3) 
OCH, 


^ 


OX 
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Suppose that F(x,,...,X,,U,P1,-.-.p,) is a real- 
valued function of class C? in a neighborhood 
of x, aj u=b, pj es @(%1, ...,X,), $09, an X) 
are functions of class C? in a neighborhood 

of x;— a, that satisfy b — o(a,,...,a4), c;— 
(09/0x)),—4, S(a,, ...,a,) 2 0; and 


k OF oS 
—— #0. 5 
bs Op, OX, | 


Then there exists a solution u of (4) of class 
C? in a neighborhood of x =a that satisfies u= 
(x) on the hypersurface S(x) ^ 0. 
Furthermore, if F, o, S are of class C! and 
satisfy (5), then there is at most one solution u 
of (4) of class C! in a neighborhood of a that 
satisfies u = o (x) on the hypersurface S(x) — 0. 
These facts can be proved in the following 
way. By choosing S, (x), ..., $, ,(x) and then 
S(x) so that the tJacobian 0(S, $,,..., $, ,)/ 
O(x,,..., xj) does not vanish and by changing 
variables from x to S, $,,..., $, ,, we obtain 
a normal form solved for éu/éS by condition 
(5) (this condition means that the hypersurface 
S(x) 20 is not tangent to the tcharacteristic 
curves). 


D. Quasilinear Equations of the Second Order 


Consider the equation 


32 


k ^u 
Kë a;(x, u, SESCH 


J 





+ b(x, u, p) - 0, 
ij 
where x 2(x,, ..., x), P — (pi, ..., Pi), and p;= 
Ou/Ox;. We assume that the initial conditions 
are u= (x) on S(x) 20 and 





on the same hypersurface. Taking the other 
functions S, (x), ..., Sj ,(x) and then S(x) so 
that the Jacobian 0(S, $,, ..., $, .,)/0(x,, ..., x) 
does not vanish, we change the variables x to 








$9,841, +++) Sy, (S0 =S, S; = Sj). Then we get 
k-1 "u Si E 
S Sy EE —0, 
HI S en ( ) 
where 
k 08,08. 
S LÉI SE FI t, 
QU Sam 2 A Bx, 
E. 0S, CN 
PU A Ox; OS, 
E E: 0?8, Qu 
b*=b+ ü Au 
à. UU Ox Ox, 0s, 


When s, =0, the initial conditions are trans- 
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formed to 


u- Q*(s,, ...,$, 1) 9(x(0, gu, Sk-1)), 


Ou 
a cM Is sory Sp—1) 
So 


k-1 (AS M] 
3b (=) | DEE es Sk—1)) 





Suppose that Q(S, S) #0, and set du/ds;= q;. 
Then equation (6) added to the initial con- 
ditions u— Q*(s,, ...,s, 1) do — V" (s, ..., $4.4) 
at So =0 is equivalent to the system in the 
normal form of partial differential equations of 
the first order 





Ou 0q, 04 

—- n —1,...,k—1, 
Ze" so as, 

Ódo _ 1 bib 

és, — Q(S,S) | = 2, OS,, $) 


-20(8, 


ER 


do — V*(s,, ..., Sy 1). 


with initial conditions 


u — Q*(s,, ..., 8,1), 
-09*/0s, 


when sọ =0. Thus, if the coefficients are of 
class C®, the preceding theory applies to this 
equation. 


E. Continuous Dependence of Solutions on the 
Initial Values 


First, we consider the following simple exam- 
ple of a linear equation. The wave equation 


for v(x, t) is the simplest thyperbolic equation. 
Its solution satisfying the initial conditions 
v(x, 0) — f(x), (0v/0t)(x, 0) — g(x) is given by 


x+ct 


v(x, Sura äs tz) g(A)dA4. 


x-—ct 

It is obvious from this expression that if we 
regard f(x) and g(x) as elements of the tfunc- 
tion space C(R) of continuous functions of 
xeR with the topology of tuniform conver- 
gence on compact sets, then v(x, t) is deter- 
mined as the value of a linear operator from 
C(R?) to C(R?) on xt-space. If such continuous 
dependence on the initial values is established, 
or more precisely, if there is a unique solution 
for sufficiently smooth initial values that de- 
pends on the initial values continuously in a 
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suitable sense, we say that the initial value 
problem is well posed (properly posed or cor- 
rectly posed). 

Systematic research on the well-posedness 
of Cauchy problems was initiated by I. G. 
Petrovskii, who considered the following sys- 
tem of partial differential equations, which is 
more general than differential equations of the 
normal form. (The coefficients are all assumed 
to be functions of t only.) 


Ou, N Qo vits tvy, (x, t) 
TE E e Sech xs 


k=1 |v| &m 





3. Bd, Sal, fei... .N. 


Iv] = vo tyi +... + Yp Vo Hr, (7) 


This has a normal form if n, 2 m (k=1,..., N). 
Now, taking the derivatives 


0 yr! QW 

(5) uj(x, t), (=) uj(x, t),..., 
T 

EA ek 


as new unknowns, we get another system: 


y, t.t v. 


e d 
Gë pas Oto, A OF Bape 0D 


+C(x,t), j=1,...,N’. (8) 


We take as the space of initial values the 
ttopological linear space composed of all func- 
tions whose derivatives up to a sufficiently 
large order are bounded on the whole space R" 
and equipped with the topology determined by 
the tseminorms that are the maximums of 
derivatives up to a given order on the whole 
space, and we take as the range space a simi- 
lar space on the xt-space, where xeR" and 
0<t<T. Then we can formulate a necessary 
and sufficient condition for the well-posedness 
of the initial value problem for the future (the 
problem is regarded as specifying a mapping 
that assigns to the initial values on t=0 the 
values u(x, t) for t>0). To give such a con- 
dition we consider the following system of 
ordinary differential equations, which are 
given by a *Fourier transformation on the 
x-space of system (8): 





zo d due, UI 
x Qrié;)^ ... Qnic,)" f. (S, t) + Cj, t), 


j=1,...,N’. (9) 


If the "fundamental system of solutions of (9) 
is v 9(5,t) (i2 1, ..., N', jo 1, ..., N’), then the 
condition is that these functions satisfy the 
inequalities 


le als CH ay, O<t<T, (10) 
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where C and L are constants independent of č. 
This is the condition obtained by Petrovskii. 
If system (7) 1s of normal form and is well 

posed for the future, it is also well posed for 
the past. In this case, equation (7) is said to be 
of hyperbolic type (— 325 Partial Differential 
Equations of Hyperbolic Type). An example 
that is not of a normal form and is well posed 
is a parabolic equation (— 327 Partial Dif- 
ferential Equations of Parabolic Type). In 
Petrovskifs theory it is sufficient to assume 
that the coefficients in (7) are continuous and 
bounded, and we can take T arbitrarily large 
provided that (10) is satisfied. Hence Petrov- 
skif's theory guarantees global existence of 
solutions. 


F. Uniqueness of Solutions 


If a linear partial differential equation of the 
first order of normal form has coefficients of 
class C*, then the solution of class C! satisfy- 
ing the prescribed initial conditions is unique 
(Holmgren's uniqueness theorem, 1901). Every 
system of partial differential equations of 
higher order of normal form can be reduced to 
a system of the first order of normal form. 
Therefore, if the coefficients are of class C?, 
there is only one solution for the original 
initial value problem with continuous partial 
derivatives up to the order of the equation. 
Moreover, if an analytic manifold S of dimen- 
sion n — 1 (n is the number of independent 
variables) is not tcharacteristic for the given 
equation of order m, there is at most one solu- 
tion whose derivatives of order up to m— 1 
coincide with given functions on the manifold 
S. The proof of this fact relies on the Cauchy- 
Kovalevskaya existence theorem. 

The uniqueness problem for the initial value 
problem is in general very difficult even for 
linear equations when the coefficients are not 
of class C®. 

In particular, if the number of independent 
variables is 2 and the coefficients are all real, 
then we have a result of T. Carleman (1939) 
about the system: 

m 
one 2, aulx, ys PD (x, y)u,, (11) 
where the a,, are of class C? and the b,, are 
continuous. He proved that if the eigenvalues 
of the matrix (a,,) are all distinct, then even 
when some of the eigenvalues are complex, 
there is at most one solution of class C! for the 
initial value problem. If we omit the assump- 
tion about the eigenvalues, however, the theo- 
rem does not hold in general, because we have 
a counterexample due to A. Pli$ (1954) where 


all coefficients are of class C”, m=2, b,, 20 
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A. P. Calderón showed that Carleman's 
result can be extended to the case n 2 (Amer. 
J. Math., 80 (1958)). Consider the following 
linear partial differential equation of the kth 
order: 


0 \ 0*3 
m P or; SE Zkad- 0, (12) 


where P(x, y,¢) is a homogeneous polynomial ` 
of degree j of £ —(£,, ..., Ča) with real coeffi- 
cients and B is a differential operation in (x, y) 
of order at most k — 1. We assume that the 
coefficients of P(x, y, č) are functions of (x, y) of 
class C!, their derivatives are Hólder continu- 
ous, and the coefficients of B are bounded and 
continuous. If the characteristic equation of 
(12), 


k 
P4Y P(x, y, EUR =0, 
j=1 


has only distinct roots for ë #0, then the C*- 
solution of the Cauchy problem is unique in a 
neighborhood of x =a. Calderón proved this 
except for the cases k 2 4, n — 2, where a cer- 
tain topological difficulty arises. S. Mizohata 
(J. Math. Soc. Japan, 11 (1959)) succeeded in 
obtaining the proof for the exceptional cases. 

This result can be extended to systems of 
equations under similar assumptions. See L. 
Hormander [4] for an extension to the com- 
plex coefficient case. S. Mizohata, T. Shirota, 
and H. Kumanogo discuss the uniqueness 
theorem for equations of double character- 
istics or of parabolic type. 

For nonlinear equations there are, in gen- 
eral, very few results about the global existence 
of solutions. For example, if the function F in 
equation (2) in the normal form satisfies the 
Lipschitz condition with constants 4 and B 
that are independent of x, then we get a global 
existence theorem. The method of proof of 
this theorem is as follows: First, we prove the 
existence for |x| € e, and sufficiently small e, 
by Picard's tsuccessive iteration method. Then 
we regard x —e, as the hyperplane on which 
the initial values are assigned and proved the 
existence of a solution on e, €|x;| € £;, and so 
on. The same method can be applied to non- 
linear systems of the first order if they are 
special types of quasilinear systems. 


G. Construction of Solutions by Asymptotic 
Expansion 


Let a(x, D) Zem a, (x) D* be a linear par- 
tial differential operator of order m, with 
coefficients of class C^. We write a(x, č)= 
Eaim alx) = h(x, E) + h'(x, £) - ... with h 
and h' homogeneous in € of degree m and m — 
1, respectively. Let K : p(x) «0 be a regular 
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surface of codimension 1. We assume that K 
is a simple characteristic, i.e., h(x, 9.) - 0 

and (dh/0¢;(x, p,)) #(0) on K. Let f(t) (j= 

0, 1, ...) be a sequence of functions satisfying 
df,/dt(t)= f;., (t), j 2 1, 2, .... Then the equation 
a(x, D)u(x)=0 has a formal solution in the 
form 


u(x)- £ flout). 


The coefficients u,(x) are obtained by solving 
successively the equations 


L, [uj] = 2. L, [uj]. (13) 
where 
" ch 12 8h 
L = SE e Dit- zra, LE x YTxx; 
1 PETA x) 2 Hs 080, Px) Px x, 
t h'(x, Px) 


and L, (2 « k & m) are differential operators of 
order k depending only on a and o. By this 
method of asymptotic expansion of solution, 
fundamental solutions and singularities of 
solutions for hyperbolic equations have been 
studied (Hadamard [5, 6]; R. Courant and P. 
Lax, Proc. Nat. Acad. Sci. US, 42 (1956); Lax, 
Duke Math. J., 24 (1957); D. Ludwig, Comm. 
Pure Appl. Math., 22 (1960); S. Mizohata, J. 
Math. Kyoto Univ., 1 (1962); — 325 Partial 
Differential Equation of Hyperbolic Type). 

If a(x, D) is an operator with analytic coeffi- 
cients, this formal solution is convergent. By 
using this fact, Mizohata constructed fnull 
solutions. In fact, put fí(t) - t"* "/T(p--m4 
D (pz0),/20,1,.... Define u(x) by u(x) 20 
for p(x) «0 and u(x) oa fi(o(x))uj(x) for 
9 (x) Z 0 obtained by the preceding process. 
Then u(x) is a null solution of a(x, D)u(x) 20 
(Mizohata, J. Math. K yoto Univ., 1 (1962)). 

The Cauchy problem in the case when the 
initial surface has characteristic points had 
been studied by J. Leray and by L. Gárding, T. 
Kotake, and Leray. Let a(x, D) be a differential 
operator with holomorphic coefficients in a 
complex domain. Let S:s(x)=0 be a regular 
surface and T be a subvariety of codimension 
1 of S. Suppose that S is noncharacteristic 
on S — T, but characteristic on T. Consider 
the Cauchy problem (1) of Section A. Then 
the solution u(x) is ramified around a char- 
acteristic surface K that is tangent to S on T, 
and it can be uniformized (Leray, Bull. Soc. 
Math. France, 85 (1957); Gárding, Kotake, and 
Leray, Bull. Soc. Math. France, 92 (1964)). 

Next we consider the Cauchy problem (1) 
when the initial surface S (x, =0) is nonchar- 
acteristic, but the initial values w,(x) (O & k < 
m — 1) have singularities on a regular sub- 
variety T (x, — x4 —0) of S. We assume that 
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h(0, €,, 1,0, ...,0)=0 has m distinct roots. Then 
there exist m characteristic surfaces K;: q;(x) — 
0 (1x ix m) originating from T. @,(x) is ob- 
tained by solving the tHamilton-Jacobi equa- 
tion h(x, 9.) 20, o(0, xz, ..., x,) 2 x3. Now, if 
w,(x) (0 € k x m— 1) has a pole along T, the 
Cauchy problem (1) has a unique solution in 
the form 


ej Fx 
E n G(9log e| HO) 
where F;(x), G,(x) (1 & i & m) and H(x) are holo- 
morphic functions in a neighborhood of x — 

0 and p, (1 <i<m) are integers >0. In order 

to obtain this solution, we set u(x) in the form 
u(x) - Ef Eo ffe oo)u, del, where folt) is 

a function with a pole or a logarithmic sin- 
gularity at t=0 chosen so that u(x) satisfies 
the initial conditions. Thus we can determine 
the coefficients u; ,(x) by solving successively 
the equations (13) on each K; (1 <i<m) (Y. 
Hamada, Publ. Res. Inst. Math. Sci., 5 (1969); 
C. Wagschal, J. Math. Pures Appl., 51 (1972)). 
When the multiplicity of characteristic roots is 
more than 1, the situation is not the same. For 
example, consider the Cauchy problem 


1 
u(0 x) —. 


Diu—D;u-0, 


D,u(0, x,)=0. 


The solution is 
u(x)= 3. (—1yn! x?"/Qn)! x5*! 
k=0 


with essential singularities along x, =0. This 
situation occurs quite generally. We factor 
h(x, €)=h, (x, Ey: ... h(x, £s, where h; (1 <i<s) 
are irreducible polynomials of degree m, in ¿ 
with holomorphic coefficients. We assume that 
the equation [J}_, 4,(0, £,,1,0,...,0) 2-0 has p 
distinct roots (p=m,+...+m,). Then there 
exist p characteristic surfaces K; (1x i & p) 
originating from T. We suppose more gener- 
ally that the initial values w,(x) are multi- 
valued functions ramified around T. Then the 
Cauchy problem (1) has a unique holomorphic 
solution on the universal covering space of V 
— UR, K;, where V is a neighborhood of x=0. 
In fact, this is solved by transforming this 
problem into tintegrodifferential equations. 
Such a method of solution is closely related 

to the method discussed in this section. See 
Hamada, Leray, and Wagschal (J. Math. Pures 
Appl., 55 (1976)). In this case, even if the initial 
values have only poles, the solution in general 
may have essential singularities along | )?_, K;, 
but if a(x, D) satisfies Levi’s condition (a(x, D) 
is well decomposable), the solution does not 
yield essential singularities along Uf K;. See 
J. De Paris (J. Math. Pures Appl., 51 (1972)). 





321 Ref. 
PDEs (Initial Value Problems) 


References 


[1] R. Courant and D. Hilbert, Methods of 
mathematical physics II, Interscience, 1962. 
[2] I. G. Petrovskii, Lectures on partial dif- 
ferential equations, Interscience, 1954. (Orig- 
inal in Russian, second edition, 1953.) 

[3] L. Hórmander, Linear partial differential 
operators, Springer, 1963. 

[4] E. Goursat, Cours d'analyse mathéma- 
tique, Gauthier-Villars, IT, second edition, 
1911; III, fourth edition, 1927. 

[5] J. Hadamard, Leçons sur la propagation 
des ones et les équations de l’hydrodynamique, 
Hermann, 1903 (Chelsea, 1949). 

[6] J. Hadamard, Le probléme de Cauchy et 
les équations aux dérivées partielles linéaires 
hyperboliques, Hermann, 1932. 

[7] C. Carathéodory, Calculus of variations 
and partial differential equations of first order, 
Holden-Day, 1965. (Original in German, 
1935.) 

[8] S. Mizohata, Theory of partial differential 
equations, Cambridge University Press, 1973. 
(Original in Japanese, 1965.) 

[9] F. Treves, Linear partial differential equa- 
tions with constant coefficients, Gordon & 
Breach, 1966. 

[10] F. John, Partial differential equations, 
Springer, 1971. 

[11] J. Dieudonné, Eléments d'analyse, 4, 7, 8, 
Gauthier-Villars, 1971—1978. 


322 (XIII.20) 
Partial Differential Equations 
(Methods of Integration) 


A. General Remarks 


The methods of integrating partial differential 
equations are not as simple as those for tordi- 
nary differential equations. For ordinary dif- 
ferential equations, we often obtain the desired 
solution by first finding the general solution 
containing several arbitrary constants and 
then specializing those constants to satisfy 
prescribed additional conditions. The situa- 
tion is more complicated, however, for partial 
differential equations. In the formal general 
solution of a partial differential equation we 
have arbitrary functions instead of arbitrary 
constants, and there are cases where it is im- 
possible, or very difficult, to find a suitable spe- 
cialization of these functions so that the given 
additional conditions are fulfilled. For this 
reason, methods of solution are rather specific 
and are classified according to the types of 
additional conditions. In many cases, we as- 
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sume that a problem is mathematically well 
posed, guess the possible solutions, and verify 
it directly. 

A problem is said to be mathematically well 
posed (properly posed or correctly posed) if, 
under assigned additional conditions, the 
solution (i) exists, (ii) is uniquely determined, 
and (iii) depends continuously on the assigned 
data. By carefully examining problems in 
physics and engineering, we usually obtain 
many well-posed and important problems. In 
these problems, usually the data are suffi- 
ciently smooth functions. In these cases, part 
(iii) of the above definition (the continuous 
dependence of the solutions on the data) fol- 


lows often from assumptions (1) and (ii). For 


example, telliptic equations like tlaplace’s 
equation u,, t Hen — 0 for u(x, y) describe laws 
of static or stationary phenomena such as the 
field of universal gravitation, the electrostatic 
field, the magnetostatic field, the steady flow 
of incompressible fluids without vortices, and 
the steady flow of electricity or heat. For this 
equation, "boundary value problems are well 
posed, but finitial value problems are not (— 
323 Partial Differential Equations of Elliptic 
Type). By contrast, for thyperbolic equations 
like u,, — u,, =0 for u(x, t) and tparabolic equa- 
tions like u, — u,, —0 for u(x, t) which control 
the change (in reference to time) of the various 
stationary phenomena, initial value problems 
or mixed problems with both boundary con- 
ditions and initial conditions are well posed 
(— 325 Partial Differential Equations of Para- 
bolic Type). 

For the rest of this article we explain funda- 
mental and typical methods of integration (— 
Appendix A, Table 15). 


B. The Lagrange-Charpit Method 


For the partial differential equation of the first 
order 


F(x, y, u, p, q) - 0, LSS 


we consider a system of ordinary differential 
equations called tcharacteristic differential 
equations: 

dx dy du —dp —dq 


SE = ^ (2) 
F, F, pr, +h, Feb, +48, 





If we obtain at least one tintegral of this sys- 
tem containing p, q, and an-arbitrary constant 
a in the form 


G(x, y, u, p, q) ^ a, (3) 


and if we find p and q from (1) and (3), then du 
= pdx+qdy is an texact differential form, and 
by integrating it we get a solution (x, y, u, a, b) 
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=0 of (1). A solution of (1) containing two 
arbitrary constants is called a tcomplete solu- 
tion. Here, setting b = g(a) and eliminating a 
from the two equations ®(x, y, u, a, g(a)) -0 
and 6, -- 0,9'(a) 20, we obtain a solution 
involving an arbitrary function. Such a solu- 
tion is called a general solution of (1). For 
example, consider pq — 1. Since the character- 
istic differential equations are 

dx dy du  —dp — dq 


q pop 0 0" 

we have p=a (constant), and hence from the 
original equation we get g—a !. Then du= 
adx+a™ dy is an exact differential form, 
and by integrating it a complete solution u — 
ax +a~1y+b is obtained. A general solution 
is found by eliminating a from u - ax -a ! y 
g(a) and 02 x —a ^? y - g'(a). 

Likewise, when we have n independent vari- 

ables x,, ..., x, in the equation 


Ou ; 
F(X 450-55 Res Us p1, Pr =O, Pia? (1) 
Xi 
we can use the tcharacteristic differential equa- 
tions to find a complete solution and a gen- 
eral solution (the Lagrange-Charpit method; — 
82 Contact Transformations). 


C. Separation of Variables and the Principle of 
Superposition 


The simplest and most useful method is the 
separation of variables. Concerning the equa- 
tion u2 -- u2 2 1 in u(x, y), for example, by set- 
ting u= (x) - V(y) we obtain g'(x} c (y = 
1 or ott - 1 — y'(yy". Since the right-hand 
side is independent of x and the left-hand side 
is independent of y, both sides must be equal 
to the same constant o7. From this we get a 
(complete) solution involving two arbitrary 
constants a, p: 


u=ax+t+./1—a? y f. 


For "near equations, it is often effective to 
write the solution as a product u = o(x)v/( y). 
From the theat equation u, = u,,, we obtain by 
this process a relation V'(y)/v(y) = e" (x)/e(x). 
from which we get a particular solution u= 
ae ^" sinv(x — a) containing a parameter v. 

Next, when the equation is linear and 
homogeneous, by forming a linear combina- 
tion of particular solutions that correspond 
to various values of a parameter v, we obtain 
a new solution (the principle of superposi- 
tion). For example, by integrating the solution 
e "cos vx with respect to the parameter v 
between the limits —oo and oo (namely, by a 
superposition consisting of a linear combina- 
tion and a limiting process), we obtain a new 
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solution 


oo 2 
zii emgeet 7) 
-o y 4y 


(y20, (4 


which is the tfundamental solution of the heat 
equation. This name refers to the fact that the 
function (4) can be used to obtain solutions of 
the heat equation under some initial condi- 
tions. More exactly, a solution of u, —u,, —0 
that is a function of class C? in y>0, is con- 
tinuous in y 20, and coincides with a bounded 
continuous function g(x) on y 20 can be ob- 
tained by a superposition of the solution (4) 
such as 








Ed 2 

us.) — | aen P 2 js 
2 /ny J -« 4y 

The method of separation of variables ap- 
plied after a suitable transfòrmation of vari- 
ables is often successful. In particular, by using 
orthogonal coordinates, tpolar coordinates, or 
*cylindrical coordinates according to the form 
of boundary, we often obtain satisfactory 
results. For example, concerning the boundary 


. value problem for Laplace's equation Au — u,, 


+u,,=0, which is smooth in the circle r? = 
x? + y? <1 and takes the value of a given con- 
tinuous function g(0) on the circumference 

r — 1, we can use polar coordinates to rewrite 
the equation in the form 


u, 1 
Au Uu, Lt Mag = D 
rr 


and apply the method of separation of vari- 
ables to obtain particular solutions r"cos n0, 

r" sin nO. Hence it is reasonable to suspect that 
by a superposition of these particular solutions 
we can obtain the desired solution: 


u(x, y) => + x (a, cos n0 + b, sinn6)r". 


In fact, this series is a desired solution if the 
coefficients a,, b, can be chosen so that the 
series converges uniformly for 0<r< 1 and can 
be differentiated twice term by term for O & r « 
1, and if we have 


a oo 
g(0) SCH + Y. (a,cos n0 + b, sin n6). 
n=1 
By virtue of the uniqueness of the solution of 
an elliptic equation, this is the unique desired 


solution. The boundary value problem in the 
preceeding paragraph is well posed. 


D. Mixed Problems 


For linear homogeneous equations of hyper- 
bolic or parabolic type, mixed problems fre- 
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quently appear, i.e., problems in which both 
initial and boundary conditions are assigned. 
These problems are, furthermore, classified 
into two types. 

For the first type, homogeneous boundary 
conditions are assigned. For example, in vibra- 
tion problems of a nonhomogeneous string 
between 0<x <I, we must find the solution of 
the equation 


(T(x)u,), = pt, 


satisfying an initial condition u= f(x), u, 2 g(x) 
for t —0, under homogeneous boundary con- 
ditions: (i) u— 0 for x 20 and x=/ in the case 
of two fixed ends; (ii) u, —0 for x 20 and x=! 
in the case of two free ends; (iii) —u,--ogu— 
0 for x =0, and u,+o,u=0 for xz ës 7 0, 
0, 70), where the two ends are tied elastically 
to the fixed points; (iv) T(0)u(0) = T(I)u(l), 
T(0)u'(0) = T(I)u'(l) (periodicity condition); 
(v)u, w are finite at x 20 and x=! (regularity 
condition). 

The method of separation of variables is 
applicable to problems of this type also. For 
example, suppose that we have two fixed ends. 
Disregarding the initial condition for a while, 
we find a particular solution fulfilling only 
the boundary condition by setting u(x, t) - 
y(x)expivt. Here, y(x) must satisfy 


(TG)y) -v!p(x)y-0, y(0)=y()=0. (5) 


Except when the solution is trivial (1.e., u(x, t) 
— 0), y(x)#0 must hold. But in the special 
case T(x)z 1, p(x) 2 1, l= m, the values of v for 
which functions of this kind exist are only 
1,2, 3,... (the corresponding y(x) is sin vx). 
Also, in more general cases, nontrivial solu- 
tions y(x) exist only for some discrete values 
of v. These v are called teigenvalues of (5), and 
the corresponding solutions y(x) are called 
teigenfunctions for the eigenvalues v. That is, 
the desired particular solution can be obtained 
by solving the teigenvalue problem (5). Again, 
when T(x)=p(x)=1, /=1, particular solutions 
are sin nxexpint, sinnxexp( — int) (n— 1,2, ...). 
We consider 


oo 
u(x, t)= 3. (a, cosnt + b, sinnt)sin nx, (6) 
n=1 


which is obtained by a superposition of these 
particular solutions. If we can determine the 

coefficients a,, b, so that this series converges 
uniformly and is twice differentiable term by 

term, and 


oo 


f(x)2 Y a,sinnx, g(x)= Y, nb,sinnx, 

n-1 n=1 
then the series (6) is a solution of the mixed 
problem in question. If the uniqueness of the 
solution of the mixed problem is proved, it is 
not necessary to look for any solution other 
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than the one obtained by combining the 
method of separation of variables and the 
principle of superposition. 

Furthermore, the method described in this 
section is applicable to solving the following 
nonhomogeneous equation, which. charac- 
terizes the motion of a string under the in- 
fluence of an external force f(x, t): 


Un — ux, = f(x, t) (7) 


under the boundary condition for the first type 
of mixed problem and the initial condition u= 
u, —0 for t=0. In this case, we expand the 
unknown u and the function f(x, t) in terms of 
the system of eigenfunctions {sinnx}, and by 
substituting 


u= } a,(tsinnx, f(x,t)= A A,(t)sinnx 
n=1 n=1 
into (7), we reduce the problem to determining 
a,(t) (n= 1, 2,...). When the external force 
varies with a harmonic oscillation over time as 
in f(x, t)= — e(x)exp( — iot), a similar method 
can be applied by setting u — v(x)exp( — ict). 
For mixed problems of the second type, the 
homogeneous initial condition u — 0, u,=0 for 
t — 0 is assigned, but the boundary condition 
is nonhomogeneous. For example, when an 
oscillating string is at rest until t=:0, and for t 
>0 its right end is fixed and its left end moves 
subject to an assigned rule, the behavior of the 
string is described by the solution of u,,—u,, = 
0 under the boundary condition u(0, t)= f(t), 
u(l, t)=0 (t>0). This is called a transient prob- 
lem. If we now choose an arbitrary function 
B(x, t) that fulfills all the boundarv and initial 
conditions, and if we set u— B(x, t) = v(x, t) and 
B,.— B, = f(x, t), then v satisfies 


Ui — Uxx = f(x, t) 


with v=v,=0 for t=0; v=0 for x:=0 and x=]. 
Then v(x, t) describes the oscillation of a string 
that is at rest until t =0 and moves under the 
effect of an external force represented by f(x, t) 
for t>0. Problems of this kind often appear in 
electrical engineering. 

Such problems can be reduced to problems 
of the first type in the manner described in the 
previous paragraph, but there are some direct 
methods that are more effective, the first being 
Duhamel's method. Consider the case where 
f(t) is the unit impulse function: 


fsh t>0, 


for t>0 


0, t<0, 


and let U(x, t) be a solution corresponding to 
this case and vanishing for t <0, x 2 0. Then a 
solution corresponding to the general case is 
given by 


Ce 
zii aU sr 9f (oat, 


0 
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The second method is based on application 
of the tLaplace transformation. Denoting the 
Laplace transform of a solution u(x, t) by 


[ ue" dt DD ; 
0 p 


we multiply both sides of u,,—u,,.=0 by e7” 
and integrate the result with respect to t from 
0 to oo. Then, taking account of the initial 
condition, we have 


v,,—p^v-0 


with the boundary condition 
v(0, »=»| f()e "dt, ol p)=0. 
0 


If for p=a+iß (a>) we can find a solution 
t(x, p) of this boundary value problem for the 
ordinary differential equation, then the desired 
solution is given by 


T GH v(x, p) 


e" d 
an ni p P, 


where L is a path in «> a, parallel to the 
imaginary axis. This is called the Bromwich 
integral. 


E. Green's Formula and Application of 
Fundamental Solutions 


Given a linear partial differential operator 
L[u]=))a,(x)D?u, Prin, Bal 
p 


D? zQh* + PafAxPs .. dxPs, 
we call the operator 


L*[v] 25. ( D^: *- * ^D'(a,(x)v) 
p 


the adjoint operator of L, and the operator 


Lol E (71): * * D^(ay()0) 


the transposed operator of L. Sometimes ‘L is 
also called the adjoint operator of L. In the 
complex Hilbert space, the adjoint operators 
are more appropriate than the transposed 
operators. In this case, the operator L* defined 
above is usually called the formal adjoint oper- 
ator to distinguish it from the one defined by 


(L[u], v)=(u, L*[v]) for all ue D(L), 


D(L) being the domain of the definition of 
L (— 251 Linear Operators). These trans- 
posed or adjoint operators are often used to 
represent (at least locally) the solutions u of 
L(u]— f. 

We explain in more detail the specific case 
where the number of independent variables is 
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2. For the linear partial differential operator 





u ðu Qu 
L(u)— SE 
(u) A(x, y) 5 t 2B(x, y) * Oo Waa S 
+ D(x, yee —+ Ely, De Ps yu 


and its adjoint partial differential operator 
3 (Av) .0?(Bv) O?(Cv 

( : ) " (Bv) 4 ( 2 ) 

Ox Ox dy oy 


O(Dv) (Ev) 
Ox — Qy 


M(v)= 








+ Fo, 
we have tGreen’s formula: 


| (vL(u) — uM (v)) dx dy 
D 


B Óx Qy 
= -| (roa. (8) 


P=v{Au, + Bu,} —u{(Av),+(Bv),} + Duv, 
= v{Bu,+Cu,} —u{(Bv),+(Cv),}+ Euv, (9) 


and OD denotes the boundary curve of the 
domain D, n the internal normal of 0D at a 
point of 0D, and s the arc length. 

We can apply this formula for solving a 
nonhomogeneous equation L(u)= f as follows. 
Assume that there exists a solution of L(u) =f 
satisfying the assigned additional condition, 
and choose a fundamental solution of M(v)= 
0 having an adequate singularity at a point 
(Xo, yo) of D and fulfilling a suitable boundary 
condition. Then, if we substitute these solu- 
tions u and v into (8), we obtain an explicit 
representation of u(xo, yo). If we can verify that 
the function u(x, y) thus obtained is a solution 
of L(u) — f fulfilling the assigned additional 
conditions, then we see, under the assumption 
of uniqueness of solutions, that this and only 
this function u(x, y) is the desired solution. For 
example, consider a boundary problem for 

2 2u 2 2 

L(u a s z» for which M()- + : E e 
The problem is, for a circle of radius r with 
center at the origin, to find a function u(x, y) 
that is continuous in the interior and on the 
circumference C, of the circle, and that satis- 
fies L(u) — f in the interior of the circle ( f is 
bounded and continuous in the interior of the 
circle) and is equal to a given continuous 
function g on the circumference C,. In this 
case, we consider the circumference K, of 
sufficiently small radius € with center (xg, yo) 
contained in the interior of the first circle, and 
set 


v(x, y) - (1/21)log 1/p + h(x, y), 
p=((x— xo FU, (10) 
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We apply formula (8) to the domain D, en- 
closed by the circumferences K, and C,. If, 

in (10), h satisfies M(v) 20 in the interior of the 
circle enclosed by C, and vanishes on C,, then 
we get 


Il faxdy=-| (ou, wry ds+ | gv ds. 
D, K, e 


By letting £ tend to zero, the first term on the 
right-hand side yields 


2" | 0logp 
-utso | 2a 0 od — 
o 4n Op 





= u(Xo, yo). 


by the logarithmic singularity of v at the point 
(Xo, yo). Therefore we have an explicit repre- 
sentation of u, 


i va vfdsdy, — (11) 
C, x2-y?xr2 


and it is easily verified that this is the desired 
solution. 

As stated in the previous paragraph, to 
apply Green’s formula it is necessary to find a 
solution v of M(v)=0 possessing a fundamen- 
tal singularity as the logarithmic singularity, 
i.e., a so-called fundamental solution. As fun- 
damental singularities for 

ôv Q?v Ov Gin 
EN dU ay? Ox? 
of parabolic type and of hyperbolic type, we 
must take those given respectively by 


v(x, y) 
1 (x — xg 
= el H SE 
0, Y<Yo; 
and 
en? En 
, dx—xol 2 y — yo 


(— 323 Partial Differential Equations of Ellip- 
tic Type; 325 Partial Differential Equations of 
Hyperbolic Type; 327 Partial Differential 
Equations of Parabolic Type). 
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323 (X1II.24) 
Partial Differential Equations 
of Elliptic Type 


A. General Remarks 


Suppose that we are given a ‘linear second- 
order partial differential equation 


2 


L[u]z X, M iE PL 


= f(x), (1) 


where x =(X,,...,X,), aj; — aj. If the quadratic 
form È aj£;6; in € is positive definite at every 
point x of a domain G, this equation (or the 
operator L) is said to be elliptic (or of elliptic 
type) in G. For n 22, a,,(x)a54(x) - (a,,(x)? > 
0 is the condition of ellipticity. In this case, 

by a change of independent variables, the 
equation is transformed locally into the canon- 
ical form 


Qu Ou 


ax EE 


Saber vi es yu 


= f(x, y). 


The operator X? 0?/0x2, denoted by A, is 
called Laplace's operator (or the Laplacian). 
The simplest examples of elliptic equations are 
Au =0 (Laplace's differential equation) and 

Au = f(x) (Poisson's differential equation) (— 
Appendix A, Table 15). 


B. Fundamental Solutions 


Let K be the n-dimensional ball with radius R, 
center xo, boundary Q (an (n — 1)-dimensional 
sphere), and area S,, and let r be the distance 
from x, to x. Then for any function u(x) of 
class C? we have 


Geb uas 
EOS M UE ey 3 


x | (r^ "— Ri "Aude 
K 





for n» 2, 


1 1 R 
u(x) =>— | udS—— | log—Audx 
27 Jo 2mjg F 


for n=2. 


A 
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Thus, if u is a solution of Poisson's equation 

= f(x), we have a representation of u(xo) by 
replacing Au by f in the integrals just given. 
Next, concerning the solutions of the equation 
Au + cu =0 (c 0, constant), the following 
relation holds: 


zT ads E02 A(R /c) 
a (R,/c/2)" 


where T is the tgamma function, n =(n—2)/2, 
and J, is the tBessel function of order v. 
Now, if we put 


1—-n/2 
(È 4) =p?" n>3, 
et 


2 —1/2 1 
ER œ) =log-, n=2, 
i=1 r 





u(xo), 


ip^ 


V(x, = 


then the function u(x) defined by 


1 
ui V(x, 9f(5)46, 
nJ4G 





represents a tparticular solution of Au = f(x), 
where V(x, £) as a function of the variables x 
satisfies AV(x, 5) - 0 except at x= €. 

Consider now the more general case (1). À 
function E(x, £) is called a fundamental solution 
(or elementary solution) of (1) or of L if 


TM | E(x, f(@dé 
G 


provides a solution of (1) for any f(x)e C? 
with compact support. À fundamental solution 
E(x, £) is a solution of the equation L[E]=0 
having a singularity at x= č of the form 

— o; /a( r*7" (n23) or o; aé) logr 

(n —2), where a(£) = det(a?(£)) and r= 

(Za! (5) (x; — £) (x — EI and (a) is the 
inverse matrix of (a;;). 

Roughly, there are three different methods 
of constructing fundamental solutions. The 
first and most general is to use pseudodif- 
ferential operators (— 345 Pseudodifferential 
Operators). The second is that of J. Hadamard 
[1], which uses the geodesic distance between 
two points x and £ with respect to the Rie- 
mannian metric È a?(x) dx;dx;. This is impor- 
tant in applications of elliptic equations to 
geometry. The third method is due to E. E. 
Levi [2] and is as follows: Let 


SR n>3, 


Weih a logr, n=2. 


To obtain a solution of L[u] = f(x) we set 


u= | V(x, jol) al) dé. 
G 
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Writing L[V(x, €)]= x(x, č), we obtain the 
following *integral equation of Fredholm type 
in ot 


1 1 

p(x) -+| x(x, čjo) a(č) dě — — f(x). 
On Jo Dy 

If we denote the resolvent of this Squation by 

x(x, č) and put 


(x, č)= V(x, č) -o| V(x, &)xG", 5). a(8) de’, 
G 
then E(x, £) - — y(x, £)./ a(£)/o, 1s seen to 


be a fundamental solution of L. Thus Levi's 
method enables us to construct a fundamental 
solution locally by successive approximation, 
because the integral equation in o as above 
has a unique solution expressible by Neumann 
series if the domain G is small enough (— 189 
Green's Operator). 


C. The Dirichlet Problem 


Let G be a bounded domain with boundary I. 
We call the problem of finding a solution u 

of the given elliptic equation in G that is con- 
tinuous on GUT and takes the assigned con- 
tinuous boundary values on T the first bound- 
ary value problem (or Dirichlet problem). In 
particular, the Dirichlet problem for Au=0 
has been studied in detail (— 120 Dirichlet 
Problem). 

If c(x) «0 and f(x) «0 or if c(x) and 
f(x) «0 in (1), a solution u does not attain its 
local negative minimum in G. This is called 
the strong maximum principle (— [3,4] for 
Hopf’s maximum principle and Giraud’s theo- 
rem). The maximum principle is one of the 
most powerful tools available for the treat- 
ment of elliptic equations of the second order 
with real coefficients. From this it follows that 
the solution of the Dirichlet problem for (1) is 
unique if c(x) « 0. Furthermore, concerning the 
uniqueness of the solution, we have the follow- 
ing criterion: If there exists a function ex) 
0 of class C? in G and continuous on GUT 
such that L[o(x)] «0, then the solution of the 
Dirichlet problem is unique. 

Let G be a tregular domain in the plane. If 
we denote fGreen’s function of A relative to 
the Dirichlet problem in G by K(x, y; 6, a, then 
the solution u of Au = f(x, y) vanishing on T is 
given by 


1 
u(x, y)= H f | K(x, y; č, mf (5, mdé dn, 
G 


where f(x, y) satisfies a "Holder condition. 
The Dirichlet problem for 


L{u]=Au+a(x, »- + b(x, De tet yu 


= f(x, y) 
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with prescribed boundary values reduces to 
the problem in the previous paragraph. In fact, 
let h(x, y) be a function that coincides with the 
prescribed boundary value on I. Then if we 
put u = h(x, y) - v, the problem is reduced to 
finding a solution v satisfying an equation 
similar to the one for L[u] (replacing f by 

f —L[h]) and vanishing on T. Now suppose 
that T consists of a finite number of tJor- 

dan curves whose tcurvatures vary continu- 
ously, a and b are continuously differentiable, 
and c and f satisfy Hólder conditions. Let 
K(x, y; €,4) be Green's function for the Diri- 
chlet problem relative to A. To find a solution 
of L[u] — f vanishing on I we set 


1 
T Ile 
Then p is a solution of the integral equation 
p(x, y)-- {| H(x, y, č, n)p(č, n)dč dn = f(x, y), 
G 


where H(x, y, č, n)=(—1/27) {a(x, y)K (x, y; 

En) b(x, y) KyG y; & n) + elx, y) KG y; 6 m]. 
Therefore we have the following alternatives: 
Either L[u] = f has a unique solution for any 
f and any given boundary values on I, or 
L[u]-0 has nontrivial solutions vanishing on 
T (in this case the number of linearly indepen- 
dent solutions is finite). 

More general equations of type (1) have 
been studied by J. Schauder and others. 
Schauder proved, first, that when the b,(x) and 
c(x) are zero, a;(x) and f(x) satisfy Holder 
conditions, and the boundary T of G is of class 
C^, then there exists a unique solution u(x) 
vanishing on T. Next, when the b,(x) and c(x) 
are continuous and aj; and f satisfy Holder 
conditions, he showed the following alterna- 
tives: Either L[u] =f admits a unique solution 
vanishing on I for every f, or L[u]=0 has 
nontrivial solutions vanishing on I (in this 
case, the number of linearly independent solu- 
tions is finite). 

In (1), suppose that a,,, b;, and c are Holder 
continuous of exponent a (0<a< 1) uniformly 
on G and that I is of class C?**. Then we have 
the following inequality (Schauder's estimate) 
for any ue C?**(G): 


[ula a.c S kl, et War) 
 Kjllullo.a. Q) 


where ||: ||, s stands for the norm in the func- 
tion space C*(S) (— 168 Function Spaces). K, 
and K, are positive constants depending on L, 
G, and « but independent of u. More precisely, 
K, depends only on the ellipticity constant A 
of L defined as the smallest number > 1 such 
that 


ANE? «Y a(x) EiE)< Ale]? (3) 
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for any (x, £)e G x R”. The inequality (2) is one 
of the most important a priori estimates in the 
theory of elliptic equations [5,6] (— inequality 
(9) in Section H). 


D. Quasilinear Partial Differential Equations 


Consider the second-order partial differential 
equation in u(x,, ..., xy: 


F(X 5-565 XM Dis Be Pip o Pip so Pan) 70, 


where p;— Ou/Óx;, p= 0^ u/Óx;Ox;. If, for a 
solution u(x), È; j=, (OF/0p;j) 6,4; is a positive 
definite form, we say that the equation is ellip- 
tic at u(x). Moreover, if for any values of u, p,, 
p;; this quadratic form is positive definite, we 
say simply that the equation is of elliptic type. 
The equation is called quasilinear if F is linear 
in pj. For example, the equation of tminimal 
surfaces (— 334 Plateau's Problem) 


(1 - p)p11 —2p1Pz P153 (1 + pD)p2; =0 


is a quasilinear elliptic equation. Furthermore, 
Auc f(x, y, U, uy, Uy) 


is also elliptic. E. Picard solved this equation 
by the method of successive approximation [7]. 
Specifically, let h(x, y) be the tharmonic func- 
tion taking the assigned boundary values on T. 
Starting from uo(x, y)= h(x, y), we define the 
functions u,(x, y) successively as the solutions 
of 


Au, = f(x, Y, Un-15 Ou, .,/Óx, Qu, .,/0y), 


coinciding with h on T. Let K(x, y; 6, n) be 
Green’s function in G and in the region de- 
fined by |u—h(x, y)| <A, |p—h,|< B, |q— 
h,| <B, (x, y)e G, and let the supremum of 

| f(x, y, u, p, q)| be C. Assume now that 


C C 
31) K dé dn « A, sil |K,|dé dn <B, 
2n Ile 2n IIe 


C 
> K dE dn <B, 
sil ,Idé dn 


and that f satisfies a Hólder condition in (x, y). 
Moreover, let 


| f(x, y. wu, p, q)— f(x, y, u, p. q)| 
<Llu—u'|+L'(|p'—p|+lq'—4)). 


Assume finally that 


{| (LK +L(|K,|+|K,|))dédn<y<t. 
G 


Under these assumptions, (u,(x, y)} is uni- 
formly convergent, and the limit u(x, y) coin- 
ciding with h(x, y) on I satisfies 


Au = f(x, y, u, Ou/Ox, Qu/O y). 
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Furthermore, it is known that the solution is 
unique within the region mentioned above. 
This method can be applied when G ts small 
and the values of h, and h, are limited. 

When f does not contain p and q, the fol- 
lowing method is known. Let ox, y) and 
@(x, y) both be continuous on GUT and of 
class C? in G. Suppose that f(x, y, u) satisfies a 
Holder condition in ox, y) <u € (x, y), and 
that 


AO < f(x, ys lx, y)). 


Then; given a continuous function g on I' such 
that o x 9 <ð, there exists a unique solution u 
of Au = f(x, y, u) such that 


Ao > f(x, y, a(x, y)), 


w(x, y) & u(x, y) & a(x, y) in G, 
u(x, y) - o on T. 
Finally, consider the equation 
LEN DR 
ax? Oxdy dy?” 


where A, B, and C are functions of x, y, u, p, q 
and AC — B? » 0. Under the following condi- 
tions, there exists a solution of the Dirichlet 
problem: A, B, C are of class C?, and their 
derivatives of order 2 always satisfy Hólder 
conditions; G is tconvex; and the boundary 
value g along I considered as a curve in xyu- 
space represented by the parameter of arc 
length is of class C?** (a » 0). Moreover, any 
plane having 3 common points with this curve 
has slope less than a fixed number A. The 
proof of this theorem is carried out in the 
following way: For any function u satisfying 


ju(x, y|&max|lel — IulsA [u |<A, 


replacing u, p, q by u(x, y), u(x, y), uy(x, y), 
respectively, in A, B, and C, we have the linear 
equation in v: 


0?v 


Ox Oy 





2 2 

AD 5 2B[u] + CUS =0. 
We can obtain the solution v taking the 
boundary value o on T. Thus we have a 
mapping u—v. Applying the tfixed-point 
theorem tn function space to this mapping, we 
have the desired solution v(x, y)=u(x, y). 

Concerning the Dirichlet problem for the 
second-order semilinear elliptic partial dif- 
ferential equation 


n ^u ou ĝu 
pr wé lan äi, x) 


a work by M. Nagumo (Osaka Math. J., 6 
(1954)) establishes a general existence theorem. 
For the general nonlinear equation 


F(x,, Xs HS P1, s P»Dii «Pijs «+.» Dui) 7 0, 
if £(0F/0p;)£;6,» 0, F, «0, the solution of the 
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Dirichlet problem for this equation is unique. 
Even when F, <0 does not hold, the conclu- 
sion remains the same if we can reduce the 
equation to this case by a suitable change of 
variables. 

Quasilinear equations in divergence form 





Qu Qu 
= ne |, 4 


or more generally, any quasilinear elliptic 
equation 


E afande, OH) n 
fr NC Ox, 0x, 8x;0x; 


sen Ou Ou (5) 
SR H 'Qx, x, H 


and even quasilinear elliptic systems have been 
treated in detail in several recent works [8,9]. 
J. Serrin [10] treated the Dirichlet problem 
and established the existence and the unique- 
ness of solutions for some classes of equations 
of type (5) containing the minimal surface 
equation. His method is to estimate the maxi- 
mum norms of u and of its first derivatives, to 
apply a result of O. A. Ladyzhenskaya and N. 
N. Ural’tseva [9] and the Schauder estimate 
(2), and finally to use the Leray-Schauder 
fixed-point theorem [11]. 





E. Relation to the Calculus of Variations 


Consider the bilinear form 


Qu Ou 


J= E (z au) ES 
*2) ho) us c(x)u? + 2f cu) dx, 


where we assume X a;(x)¢;¢;>0. Under the 
boundary conditions imposed on v, if there 
exists a function u that makes J minimum, 
then assuming some differentiability condi- 
tion on a;/(x), b,(x), c(x), we have the tEuler- 
Lagrange equation 


d ĝu 
Glanz 


-(«- Xd he uf) = 
which is a linear second-order self-adjoint 
elliptic equation. 

B. Riemann treated the simplest case, where 
a,(x) = 6/, b((x) — c(x) - i.e., the case Au =0. 
He proved, assuming the existence of the mini- 
mum of J, the existence of the solution of Au = 
0 with assigned boundary values. This result, 
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called Dirichlet’s principle, was used by D 
Hilbert, R. Courant, H. Weyl, O. Nikodym, 
and others to show the existence of solutions 
for linear self-adjoint elliptic equations. 

If F(x,,..., Xn u, P1, ..., Pn) satisfies 
X (0^ F/0p,0pj 6,5; 0 (and F has some regu- 
larity), then the function that minimizes the 
integral 


CH ĝu 
J=] Fl Xa, Xp Man a ]dx1,..-, dx, 
G Ox, Óx, 


with the given boundary condition satisfies the 
Euler-Lagrange equation (of type (4)) 
Ou n n 


2 F, AIT X ha x Fp, 


PiP; 
Wel RE Ox; i=1 OX; i21 


^ 


F,-0 


(p; — 0u/Ox;) and the boundary condition as 
well. This is also an elliptic partial differential 
equation in u. The case where F is a function 
of p alone (in particular, the case of tmini- 
mal surfaces) has been studied by A. Haar, T. 
Rado, J. Serrin, and others, particularly for 

F —(1-- p2- ... - p2)? in the case of the mini- 
mal surface equation. 


F. The Second and Third Boundary Value 
Problems 


Let G be a domain in R" with a smooth 
boundary consisting of a finite number of 
hypersurfaces. Also, let B[u] be the boundary 
operator defined by 


Ou 
B[u]—-az-- Bu, (6) 
ON 


where a — (3 ajjc0s(vo xj))^) "^, vo is the 
outer normal of unit length at the point xe S, 
and v is the conormal defined by 


cos(vxj - Y agcos(vyxj/a, | i-1,...,n. 

jal 
The problem of finding the solution u(x) of the 
equation L[u]= f continuous on the closed 
domain G and satisfying B[u] — o on the 
boundary S of G is called the second boundary 
value problem (or Neumann problem) when 
f =0, and the third boundary value problem 
(or Robin problem) when f £0. In general, in 
boundary value problems, the condition that 
the solutions must satisfy at the boundary is 
called the boundary condition. We assume that 
the boundary S of G is expressed locally by a 
function with *Hólder continuous first deriva- 
tives (G is then called a domain of class C!:^). 
Assume that G is such a domain, c <0, f z: 0, 
and at least one of c and f! is not identically 0. 
Then the second and third boundary value 
problems admit one and only one solution. 
When c=0 and fi z0, the solutions of the 
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second boundary value problem are deter- 
mined uniquely up to additive constants. Fur- 
thermore, let M be the tadjoint operator of L, 
and let 


B'[v] = a24 (D —b)v, 


where 
n 0d;; 
b= 5 cos(vox; — |. 
p (vox |è x d 


Then if the boundary S of G is of class C! and 
f. * are continuous, in order that there exist 
at least one solution u of the second or third 
boundary value problem relative to L[u] = f, it 
is necessary and sufficient that 


| fas- | gvdS —0, 
G S 


where v is any solution of M[v]=0 with the 
boundary condition B'[v] —0. Here the neces- 
sity is easily derived from Green's formula. G. 
Giraud used the notion of fundamental solu- 
tion to reduce the second and third boundary 
value problems relative to L[u]— f to a prob- 
lem of integral equations, under the assump- 
tions that G is a domain of class C!, the co- 
efficients of L and f satisfy Hólder conditions, 
and ọ and f are continuous [3; also 12]. 


G. Method of Orthogonal Projection 


The theory of "Hilbert spaces is applicable to 
the boundary value problems in Section F. In 
general, let H"(G) be the space of functions in 
L(G) whose partial derivatives in the sense of 
distributions up to order m belong to L(G). 
For elements f and g in H"(G), we define the 
following inner product: 


(fd. A. | D' f(x) D' g(x) dx, 
G 


ja] € m 
where 


la] 
DT EE laj=a,+...4a, 
(-* 168 Function Spaces). With respect to this 
inner product, H"(G) is a Hilbert space. When 
u satisfies L[u] = f (f € L;(G)) and ¢(x) is an 
arbitrary element of H'(G), Green's formula 
yields 


- X (uz Ou Eat +$ (noz. o) 


Q 
+ (c(x)u, oi + | ac. 94S -(f. 9). 
s Ou 


where b;i(x) 2 b(x)2 — 2,(0a;;/0x,). Taking 


account of the boundary condition on u: 
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adu/dv + Bu — 0, we get 


du 09 Qu 
trus a m 
Yu =) d ! x, e) (cu. o) 


d puodS — —(f, o). 
S 


Thus the problem is reduced to finding u(x)e 


H(G) satisfying this equation for all oe H!(G). 


This equation can be regarded as an equation 
in H!(G). If necessary, by replacing c(x) by 
c(x) — t for a large t, we can show that for any 
f(x)eL,(G), there exists a unique solution 
u(x)e H!(G) [12,13]. Now, for a solution of 
this functional equation, if we take o(x)e 
BG), we have (u, L*[o]) -(f, 9), where L* is 
the tadjoint operator of L. This means that 
u(x) is a solution of L[u] =f in the sense of 
distributions, and we call such a u(x) a weak 
solution. Such a treatment may be called the 
method of orthogonal projection, following 
Weyl. In this case, it can be shown that if we 
assume smoothness of the coefficients, the 
boundary S, and f, then the solution u(x)e 
H'(G) belongs to H**?(G) when f(x)e H*(G) 
(s —0, 1, ...). Thus, if we apply Green's for- 
mula, we can see that u satisfies the boundary 
condition a Qu/Ov + Bu — 0. In particular, when 
s n/2, we see that u(x)e C?(G) by tSobolev’s 
theorem [12]. In other words, u(x) is a gen- 
uine solution. 

Next, we introduce the complex parameter 
A and consider the boundary value problem 
(L4 AD[u]^ f, f e L(G), adu/dv + Bu =0. If t 
is large, (L — tI) is a one-to-one mapping from 
the domain 2(L) = (ue H?(G)|a0u/0v + Bu —0j 
onto L,(G). Thus, denoting its inverse, which 
acts on the equation from the left, by G,, we 
have 


(4 06G)[u] - Gf. 


Conversely, since the solution u(x) contained 
in L(G) (hence also contained in Z(L)) satis- 
fies the equation and the boundary condition, 
the problem is reduced to the displayed equa- 
tion in L(G) considered above. Now, since G 
is bounded and G, is a continuous mapping 
from L(G) into H?(G), Rellich’s theorem yields 
that G, is a compact operator when it is re- 
garded as an operator in L(G). So we can 
apply the tRiesz-Schauder theorem (— 189 
Green's Operator). ` 


H. Elliptic Equations of Higher Order 


The differential operator of order m: 


gll 
L= y aD ps 
SH del Oxi... Oxi 


Ja|=a,+...+a,, 
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is called an elliptic operator if 2,.,,a,(x)£^ #0 
(€ #0). In particular, if 


Re Y a,(x)&*>clé|", c»0, 
ja| 7m 


L is called a strongly elliptic operator. In this 
case, m is even. L. Gárding studied the Diri- 
chlet problem for strongly elliptic operators 
[14]. If we put m — 2b, the boundary value 
condition is stated as @/u/dv4 = f(x) (j —0, 1, 
...,b — 1), where v is the normal of unit length 
at the boundary. Using the notion of function 
space, this boundary condition means that 
the solutions belong to the closure H°(G) of 
ZG) in H'(G) (— 168 Function Spaces). In 
this treatment, Gárding's inequality 


(—1)’Re(L[u], u) 2 ó||ullg —clull?, 
ueH(G), (7) 


where 6 and c are positive constants, plays an 
important role. 

In general, for an elliptic operator L defined 
in an open set G, if u(x) satisfies L[u] = f(x) 
and f(x) belongs to H? on any compact set in 
G, then u(x) belongs to H"** on every compact 
set in G (Friedrich’s theorem [15]). 

General boundary value problems for elliptic 
equations of higher order have been consid- 
ered by S. Agmon, A. Douglis, and L. Niren- 
berg [16], M. Schechter [17], and others. 
These problems are formulated as follows: 


L[w]-f(x) ` Be, D)u(x) = (x), 
xe$, j=1,2,...,b(=m/2), (8) 


where the B,(x, D) are differential operators 
at the boundary and f and (9j) are given 
functions. Under certain algebraic conditions 
(Shapiro-Lopatinskii conditions) on (L, {B;}), 
the problems are treated also in H"(G); hence 
the L? a priori estimates play a fundamental 
role: If ue H"(G), 


b 
Il < K (|| Lu] + 2. 1Bjull a - m;- 72,5 + lul) 
(9) 


where K is a constant determined by (L, B;, G), 
Il“ ll, s is the norm in H*(S), and m; are the 
orders of B; (compare (9) with (2)). Under these 
estimates the boundary value problem is said 
to be coercive. In applications, the theory of 
interpolation of function spaces are also used 
[18] (— 168 Function Spaces). Variational 
general boundary value problems have been 
treated by D. Fujiwara and N. Shimakura (J. 
Math. Pures Appl., 49 (1970)) and others. For 
systems of such equations, there are works by 
F. E. Browder (Ann. math. studies 33, Prince- 
ton Univ. Press, 1954, 15-51) and others. 
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I. Analyticity of Solutions 


In a linear elliptic equation Lu — f, suppose 
that all the-coefficients and f are of class C? 
(resp. real analytic) in an open set G and that u 
is a distribution solution in G. Then u is also of 
class C? (resp. real analytic) in G [19]. Hence 
the linear elliptic operators are hypoelliptic 
(resp. analytically hypoelliptic) (— 112 Dif- 
ferential Operators). In particular, "harmonic 
functions (i.e., solutions of Au — 0) are (real) 
analytic in the domain of existence, whatever 
the boundary values may be. 

Hilbert conjectured that when F(x, y, u, p, 
qr, St) (P=P1,4= P2 =Pi1,8=P12,t=P22) 
is analytic in the arguments, then any solu- 
tion u of the elliptic equation F =0 is analytic 
on the domain of existence (1900, Hilbert's 
19th problem; — 196 Hilbert). This conjec- 
ture was proved by S. Bernshtein, Radó, and 
others, and then H. Lewy proposed a method 
of extending this equation to a complex do- 
main so that it can be regarded as a thyper- 
bolic equation (Math. Ann., 101 (1929)). This 
result was further extended by I. G. Petrovskii 
to a general system of nonlinear differential 
equations of elliptic type (Mat. Sb., 5 (47), 
(1939)). 


J. The Unique Continuation Theorem 


Since all the solutions of Laplace's equa- 

tion Au=0 are analytic, it follows that if u(x) 
vanishes on an open set in a domain, then 
u(x) vanishes identically in this domain. This 
unique continuation theorem can be extended to 
linear elliptic partial differential equations with 
analytic coefficients in view of the analyticity 
of solutions. This fact is also proved by apply- 
ing *Holmgren's uniqueness theorem (— 321 
Partial Differential Equations (Initial Value 
Problems)). The unique continuation theorem, 
first established by T. Carleman for second- 
order elliptic partial differential equations 
L[u] -0 with C!-coefficients in the case of 
two independent variables, was extended to 
second-order linear elliptic equations with C?- 
coefficients in the case of any number of inde- 
pendent variables by C. Müller, E. Heinz, 

and finally by N. Aronszajn [20]. This re- 
search was extended by A. P. Calderón [21] 
and others in the direction of establishing the 
uniqueness of the Cauchy problem. However, 
it is to be remarked that even if we assume 
that the coefficients are of class C”, we cannot 
affirm the unique continuation property for 
general elliptic equations. A counterexample 
was given by A. Plis (Comm. Pure Appl. Math., 
14 (1961)). See also the work of K. Watanabe 
(Tohoku Math J., 23 (1971)). 
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K. Elliptic Pseudodifferential Operators and 
the Index 


A pseudodifferential operator P(x, D) with 
symbol p(x, Ces, (— 345 Pseudodifferential 
Operators) is said to be elliptic, provided there 
exists a positive constant c such that | p(x, &)| > 
c(1 4- |£|)" for all xe R” and |£|z c ''. The no- 
tion of ellipticity can be extended to oper- 
ators on a manifold. The theory of elliptic 
pseudodifferential operators has been widely 
applied to the study of elliptic differential 
equations, and is particularly useful in the 
calculation of the *index of elliptic operators. 
B. R. Vainberg and V. V. Grushin [22] cal- 
culated the index i of the tcoercive boundary 
value problem for an elliptic operator by 
showing that i 1s equal to the index of some 
elliptic pseudodifferential operator on the 
boundary. 

Example [23]: Given a real vector field 
(v,, v2) on the unit circle x? + x2— 1, suppose 
the vector (v, (x), v, (x) rotates / times around 
the origin as the point x =(x,,x,) moves once 
around the unit circle in the positive direc- 
tion. Then the index of the boundary value 
problem 


8? g 
Uhu = f(x), 


Ou Ou 
V1 G)—- vx) z— = g(x). 
OX, OX, 


Kee <1, 


xs D 


is equal to 2 — 2l. 

M. F. Atiyah and I. M. Singer determined 
the index of a general elliptic operator on a 
manifold in terms of certain topological invar- 
iants of the manifold (— 237 K-Theory H). 
The index of noncoercive boundarv value 
problems has also been studied by Vainberg 
and Grushin, R. Seeley (Topics in pseudo- 
differential operators, C.I.M.E. 1968, 335- 
375), and others. 


L. The Giorgi-Nash-Moser Result 


Let us state the following result (J. Moser 
[24]): Let L be of the form 


where aj;— a; are real-valued, of class L”(G), 
and such that the ellipticity condition (3) holds 
at almost everywhere in G with some 42 1. 
Also, let G' be any subdomain of G whose 
distance from 0G is not smaller than 6>0. 
Then, for any weak solution ue H'(G) of the 
equation Lu=0 and for any two points x and 
y in G’, we have the inequality 


Iu(x) — wt ve A|x — yl ull aen, 
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where A and a (470,0 «a < 1) depend only on 
(n, 4,6) and are independent of the particular 
choice of (L, G, G', u). Moser proved that the 
above inequality is a corollary to a Harnack- 
type inequality (— 327 Partial Differential 
Equations of Parabolic Type G). 


M. Asymptotic Distribution of Eigenvalues 


Let L be an elliptic operator on G of order m 
with smooth coefficients realized as a self- 
adjoint operator in L?(G) under a nice bound- 
ary condition, where G is a bounded domain 
in R” with smooth boundary. Let N(T)(T » 0) 
be the number of eigenvalues of L smaller than 
T. Then, it holds that 


N(T)— CT" +an error term, as T> +00, 


cl af a(x, £) "^ dS, (10) 
G s" 


where a(x, č) is the tprincipal symbol of L and 
S"^! is the unit sphere on R”. C is independent 
of the boundary condition and, if L is of con- 
stant coefficients, of the shape of G. 

Formula (10) was at first established by 
H. Weyl [25] for the case of L — Laplacian, 
and hence it is often called Weyl’s formula. 
Weyl’s method is based on the minimax prin- 
ciple [26]. T. Carleman (Ber. Math.-Phys. 
Klasse der Sachs. Akad. Wiss. Leipzig, 88 
(1936)) studied the behavior of the trace of 
the Green's function of zI-L as |z|— oo in the 
complex plane (— [27]). S. Minakshisun- 
daram (Canad. J. Math., 1 (1947)) discussed 
this formula in connection with the heat equa- 
tion; see also S. Mizohata and R. Arima (J. 
Math. Kyoto Univ., 4 (1964)). L. Hórmander 
(Acta Math., 121 (1968)) treated the case of 
compact manifolds without boundary and 
obtained the best possible error estimate. 
H. P. McKean and I. M. Singer (J. Differential 
Geometry, 1 (1967)) treated the case of mani- 
folds and discussed the geometric meaning of 
this formula. In general, N(T) is no more than 
O(T"") if L is of degenerate elliptic type (C. 
Nordin, Ark. Mat., 10 (1972)). 


N. Equations of Degenerate Elliptic Type 


An operator L of the form (1) is said to be 
degenerate at x? eG in the direction £e R" if E 
is a null vector of the matrix (a,(x°)). L is said 
to be of degenerate elliptic type if (a,,(x)) is 
nonnegative definite at any xeG and if L 
is degenerate at some point of G in some 
direction. 

Suppose that the coefficients of L and the 
boundary I of G are smooth enough. At xer, 
denote by v(x) 2 (v, (x), ..., v,(x)) the unit outer 
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normal vector to G. Let X, be the set of xe F 
at which L is not degenerate in the normal 
direction. Also, let Z1, 25, and Le be the sets 
of xeI'N 2, at which b(x) 0, «0, and =0, 
respectively, where b(x) is defined by 


n n Q 
b= Baal. ZEIL an 


Then the Dirichlet problem for equation (1) is 
to find a function u(x) defined on GU X, U X, 
satisfying 


L{uJ=f in G, (1) 
u=g on 2,U23, (12) 
where f and g are given functions. 


Let 1<p<oo. We set q— p/(p — 1). We also 
put 


Ob ED "aal 


c*(x)— c(x)— $ ax, AE Óx;Ox; 








(13) 


We have the following existence theorem [28]: 
If (i) either c «0 on G or c* <0 on G and if 

(ii) pc + qc* <0 on G, the Dirichlet problem 

(1) and (12) (with g 20) has a weak solution 

ue L'(G) for any f e L'(G). The regularity of 
solutions is also discussed in [28]. The value of 
b(x) is closely related to the regularity near the 
point xer if L is degenerate at x in the nor- 
mal direction (— also M. S. Baouendi and C. 
Goulaouic, Arch. Rational Mech. Anal., 34 
(1969)). 

Degenerate elliptic equations of type (1) 
have also been investigated from the proba- 
bilistic viewpoint (— 115 Diffusion Processes). 
The general boundary value problems for 
degenerate elliptic equations of higher order 
have been treated by M. I. Vishik and V. V. 
Grushin [29], N. Shimakura (J. Math. Kyoto 
Univ., 9 (1969)), P. Bolley and J. Camus (Ann. 
Scuola Norm. Sup. Pisa, IV-1 (1974)), and 
others. 
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A. Quasilinear Partial Differential Equations 
and Their Characteristic Curves 


Suppose that we are given a tquasilinear 
partial differential equation 


È hu n-o, üh- sn sach (1) 


A curve defined by a solution x; — x;(t), u— u(t) 
of the system of ordinary differential equations 


dx; 


dr =P(x,u) izl,..,m 


du 

dt Fee Q(x, u) 

is called a characteristic curve of (1) (— 320 
Partial Differential Equations; 322 Partial 
Differential Equations (Methods of Integra- 
tion)). A necessary and sufficient condition for 
u — u(x) to be a solution of (1) is that the char- 
acteristic curve passing through any point 

on the hypersurface u — u(x) (in the (n+ 1)- 
dimensional xu-space) always be contained in 
this hypersurface. For example, the character- 
istic curve of 37, x;Ou/Ox; — ku is x; AH el, 
u=u°e™ (a solution of x; 2 x;, u = ku). There- 
fore the solution u — u(x) is a function such 
that u(Ax,, ..., Ax,)  A*u(x,, ..., Xn) (A e' zs 0), 
i.e., a homogeneous function of degree k. 


B. Nonlinear Partial Differential Equations 
and Their Characteristic Strips 


We denote the value of Ou/Ox; by p; and define 
the surface element (or hypersurface element) 
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by the (2n + 1)-dimensional vector (x, u, p) = 
Le, Xns U, Pis +-+» Pn). Consider the partial 
differential equation 


F(x,,... 


,Pa)=0, = Qu/óx;. 


(2) 


» Xn. US Pise 


A set (x(t), u(t), p(t)) of surface elements de- 
pending on a parameter t and satisfying the 
system of ordinary differential equations 


dx; du on dp; 
=F,, ER i — [= —(F + p;F, 
dt Pi apr nf RE dt ( ,* Pi 





is called a characteristic strip of equation (2), 
and the curve x = x(t), u=u(t) is called a char- 
acteristic curve of (2). For quasilinear equa- 
tions, this definition of characteristic curve 
coincides with the one mentioned in Section A. 
Furthermore, an r-dimensional tdifferentiable 
manifold consisting of surface elements satisfy- 
ing the relation 


du = y p;dx; =0 
i-1 


is called an r-dimensional union of surface 
elements. A solution of the partial differential 
equation (2) is, in general, formed by the set 
of all characteristic strips possessing, as ini- 
tial values, surface elements belonging to an 
(n — 1)-dimensional union of surface elements 
satisfying F(x, u, p) - 0. 

An example of a nonlinear partial differen- 
tial equation of first order is pg —z =0 (where 
Xı =X, X2 =Y, u—Z,p, =p, p, =4). The equa- 
tions of the characteristic strip are x' =q, y' =p, 
Z =2pq, p' =p, q' — q, and therefore the char- 
acteristic strip is given by y= yg + (po/qo)x, 

z— zo + 2pox t (Po/9)X^, p = Po ^ (Po/q0) x, 

q— qo + X (if we take x as an independent vari- 
able and impose an initial condition y= yo, 
2-29, P= Po, 4= qo at x=0). Putting Zu = 
W(yg) (an arbitrary function) for x =0, we 
have, furthermore, y = yo + W(yo)(W ‘(yo))?, z= 
W(yo)- 2W(yg)/W'(yo) + W(yo)x?/(W'(yg)?. 
The elimination of y from these expressions 
yields a general solution z — z(x, y). In this 
case, a tcomplete solution is 4az — (x + ay + b)? 
(where a, b are constants), and a tsingular 
solution is z ^0. 


C. Complete Systems of Linear Partial 
Differential Equations 


For functions P(x) (i21, ..., x 2(x,, ..., x,)) of 
class C”, define a differential operator X by 


d ; 


We call k differential operators X;— | 
Ei- Pi(x)3/ðx, (i9 1, ... k) mutually inde- 
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pendent when the rank of the matrix (Pj) is 
equal to k. If a system of k independent linear 
partial differential equations involving one 
unknown function f(x), 


X,f=0,...,X,f=0, (3) 


has the maximum number n —k of independent 
"integrals, then the system (3) is called a com- 
plete system. A necessary and sufficient condi- 
tion for the system (3) to be a complete system 
is that there exist k? functions 47; (x) of class 
C? such that 


[X, Xj] e X,X, — X, X, ax. 





that is, 
S EE OF} k 
Ay P}. 
à (n "ës, sl, à de 
Here, [ X;, X;] is a differential operator of first 


order, called the commutator of the differential 
operators X;, X; or the Poisson bracket. 


D. Involutory Systems 


For two functions F(x, u, p), G(x, u, p) of x, u, p 
of class C”, we define the Lagrange bracket 


[F, G] by 
0G 0G 
Hep ZU SE i: x) 
ôG (| OF OF 
“ap, \ax, "Brëll 


If F, G do not contain u and are homogeneous 
linear forms with respect to p, then F and G 
are differential operators F = X,u and G= X,u 
with respect to u (for p, = du/dx,), and we see 
that [F, G] 2 [ X,, X,],. This bracket has the 
following properties: 


[F, G]- —[G, F], 





[F, o(G,, ..., G)]= dae gif G;], 


CLF, a H], F]«LLH. F], G] 


“Ee, m "rm na St G]. 

When F, G are functions of x and p only, we 

usually use the notation (F, G) and call it also 

the Poisson bracket. In this case, the right- 

hand side of the third relation vanishes. 
Consider k partial differential equations 

involving one unknown function u(x,, ..., x,), 


Qu 
Ox, 





F(xwp)-0, i=1,...,k; D (4) 


If a common solution u(x) of these equations 
exists, it is also a solution of [F, F;] 20 (i, j= 
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1,...,k). Therefore, from the equations thus 
obtained, we take independent equations and 
add them to the original system. If we then 
have more than n+ 1 equations, the original 
system has no solution. Otherwise, we obtain 
a system F, 0 (j=1,...,r) for which the F, are 
independent (i.e., the rank of (OF;/ép,) is equal 
to r), and all [F;, F;] =0 can be derived from 
F,=0. A system (4) such that [F,, F;] «0 for all 
i, j is called an involutive (or involutory) system. 
We always treat a system by extending it to an 
involutory system. When k = 1, we regard the 
equation itself as an involutory system. 

When the equations (4) are mutually inde- 
pendent, a necessary and sufficient condition 
for them to have in common a solution with 
n — k degrees of freedom (a solution that coin- 
cides with an arbitrary function on an ade- 
quate manifold of dimension n — k) is that the 
system (4) be a system of equations involving 
unknowns p and equivalent to an involutory 
system. 

An involutory system (4) can be extended 
to an involutory system consisting of n inde- 
pendent equations by adding n — k suitable 
equations 


PM, P) = Oy 41, ..., F(X, U, p)—a,. (4) 


That is, we can find successively f =F, (=k + 
1, ...,n) such that F, satisfies the system of 
equations 


[F,f]-20, i-L.. 


which is a complete system of linear partial 
differential equations for f, and the F; (i— 

1, ...,n) are mutually independent. Then, if we 
find that p, as functions of (x, u, a) (a ss (a,4,, 
..., 0,)) from (4) and (4), the system of total 
differential equations 


Ou 


— = /,(x, u, a), 
zm [A ) 


,L—1, 


v=1,...,N, 


is tcompletely integrable, and we can find u as 
a solution containing essentially n — k + 1 
parameters c, à,.,, ...,a,, that is, a complete 
solution of (4). Moreover, if we have an in- 
volutory system of n+ 1 independent equa- 
tions F, 20,...,F, Z0, F,4,2a,.4,..., Fu 
a,+,, then we find a complete solution by 
eliminating p,, ..., p, between the equations. 
This method of integrating an involutory 
system is called Jacobi's second method of 
integration. 


E. Relation to the Calculus of Variations 


Consider a partial differential equation of first 
order F(x,, ..., Xa, U, Pis ..., p,) =O. If a solution 
u(x) of this equation is given as an implicit 
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function by o(x,, ..., x,,u) 20, we have 


Op/OÓx, EISE? 
óp/0u ` ` Gopiëu 








SE 


This gives formally a partial differential equa- 
tion with independent variables u, x,, ...,x, 
and a dependent variable o. It does not con- 
tain o explicitly. That is, this equation has the 
form 


F(x,, ...,X444,009/0x,, -..,00/0X,4,)=0. 


Then, by finding a partial derivative, say 
din, as a function of the remaining ones 
from the displayed equation, we get a partial 
differential equation of the form 


0o/0t + H(t, x, 0p/0x) — 0, 


which is called the normal form of the partial 
differential equation of first order. Setting p; — 
0o/0x;, the equations of the characteristic 
curve of this equation are 

dx 


dp, 
di p, (b x, p), Se H, (t, x. p), 


which are called Hamilton's differential 
equations. 

Now, consider the tEuler-Lagrange dif- 
ferential equations 


dF, /dt—F,,=0, i=1,...,n, 


for the integral 


S dx 
J=) Fi(t,x,x’)dt, '=—, 
| (t, x, x') x di 


Under the assumption that det(F,;..) #0, we 
put F,,— pi and solve these relations with 
respect to x; in the form, say, x;=«,(t, x, p). 
Furthermore, if we put 


n 
(È T = H(t, x, p), 
v=1 x'=ọ(t,x, p) 

then the Euler-Lagrange equations are equiva- 
lent to Hamilton’s differential equations 


dx;/dt=H,, dp,/dt=—H,, i=1,...,n, 


since F(t, x, x')= Lj. p;H,, — H. 

A curve represented by a solution of the 
Euler-Lagrange equations is called a station- 
ary curve. Now consider a family of stationary 
curves in a domain G of the (n+ 1)-dimensional 
tx-space such that passing through every point 
of G there is one and only one curve in this 
family, and suppose that the family is ttrans- 
versal to an r-dimensional manifold 9I (r x n) 
(that is, Fót — 2, F,,6x;=0 for the differentials 
ôt, dx; along A; in particular, if 9I consists of 
only one point (r 20), a stationary curve pass- 
ing through this point is transversal to 9I). In 
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this case, if we denote by V(t, x) the value of 
the integral J along the stationary curve from 
9I to any point (t, x) of G, then the equation 


0V/0t H(t, x, 0V/0x) 20 


holds. This equation is called the Hamilton- 
Jacobi differential equation or the canonical or 
eikonal equation. Conversely, a solution of this 
equation ts equal to the value of the integral J 
for a family of stationary curves transversal to 
an adequate 9I. 


F. The Monge Differential Equation 


Consider the partial differential equation (2). 
By eliminating p and t between 


—-F, a 5 p» and F(x,u,p)=0 


(for example, by eliminating p between dx;/dx, 
= F,,/F,, and F=0 when F, #0), we obtain 


M(x,, ..., Xps U, 0x5/0x,, ..., 0x, /0x,) —O. 


This equation is called the Monge differential 
equation, and the curve represented by its solu- 
tion is called an integral curve of the equation. 
In the (n + 1)-dimensional tx-space, a curve 
that is an envelope of a 1-parameter family of 
characteristic curves of the partial differential 
equation (2) is a solution of the Monge equa- 
tion. À characteristic curve is also an integral 
curve. When n —2, an integral curve that is not 
a characteristic curve is a tline of regression of 
the surface generated by the family of charac- 
teristic curves tangent to the integral curve 
under consideration (which is an integral sur- 
face of F(x, u, p) 20). If F is linear mp, i.e., 
quasilinear, all integral curves coincide with 
characteristic curves. 
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325 (XIII.25) 
Partial Differential Equations 
of Hyperbolic Type 


A. Second-Order Linear Hyperbolic Equations 


A thnear partial differential equation in n+ 1 
variables t, x —(x,, ..., x,) of the second order, 


ie Qu 


-$a “aa 


Qu 


D ETT 0xjÓx; 


ij 


L[u]- 


Ou A Qu 
Ee: (1) 


with coefficients ao;, ..., a that are functions in 
(t, x) is said to be hyperbolic (or of hyperbolic 
type) (with respect to the t-direction) in tx- 
space if the characteristic equation of equation 
(1) considered at each point of tx-space, 


H(t, x; À, ¢)= -Y doiSi DE Y duër Š; =0, 


i,j=1 

(2) 
has two distinct real roots A=, (t, x, é), 
A,(t, x, €) for any n-tuple of real numbers ¢ = 
(i.e Ča) (0, ..., 0). In particular, (1) is called 
regularly hyperbolic if these two roots are 
separated uniformly, that is, 

lim |A,(t, x, 6) — Ax(t, X, é)|=c>0. 

(t, x),]£| 71 
A typical example of hyperbolic equations is 
the wave equation 


Ou u Qu 


Eu a Ta 


=Q. (3) 
Equation (3) is also called the equation of a 
vibrating string, the equation of a vibrating 
membrane, or the equation of sound propaga- 
tion according as n— 1, 2, or 3. Another exam- 
ple is 


Zu ðu ` Ou 





which describes the propagation of electric 
current in a conducting wire with leakage and 
is called the telegraph equation (— Appendix 
A, Table 15). 

A hyperplane A(t — t9) - ... + €,(x,— x9) =0 
passing through a point p? — (t9, x?) in tx-space 
and having normal direction (A, £) is called a 
characteristic hyperplane of (1) at p? if the 
direction (A, €) satisfies the characteristic equa- 
tion at p?: H(t9, x°; 4, £) - 0. A hypersurface S: 
s(t, x) 20 in tx-space is a characteristic hyper- 
surface of (1) if at each point of S the tangent 
hyperplane of $ is a characteristic hyperplane 
of (1), that is, H(t, x; s,, s,) 20 everywhere on 
S. According to the theory of first-order par- 
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tial differential equations, a characteristic 
hypersurface of S is generated by so-called 
bicharacteristic curves, i.e., solution curves t = 
t(t), x(t) of a system of ordinary differential 
equations 





du dx, 0 

de © O? d © 
da dic 

dr to 4 «879. dt ae Xp? 
H(t, x; 4, 6) - 0. 


Now if (1) is hyperbolic, the set of all charac- 
teristic hyperplanes at p? {A(t—1°)+... + &,(x, 
— x0) «0| H (t9, x9; 4, £) 2 0] has as its envelope 
a cone C(p?) with the vertex p°. Moreover, 
since the intersection of any hyperplane t = 
constant and the cone C(p?) is an (n— 1)- 
dimensional ellipsoid or two points for n= 1, 

a conical body D, (p?) (D (p?)) is determined, 
whose boundary consists of the part of C(p°) 
with tÈ ty (t & t9) and the interior of the ellip- 
soid on the hyperplane t — constant. A tsmooth 
curve y in tx-space is called timelike if the 
tangent vector of y at each point p on y be- 
longs to D, (p) or D (p). Consider the set of 
points that can be connected with the point p? 
by a timelike curve. We call its closure an 
emission, and a subset Z, (p?) (2. (pf of the 
closure for which t >t) (t X to) a forward (back- 
ward) emission. An emission is a conical body 
surrounded by characteristic hyperplanes in 
some neighborhood of the vertex p°. If the co- 
efficients of (1) are bounded functions, the 
emissions Z , (p?) are contained in a conical 
body 


K = fe x) | A34. (t — 1°) 2 x (xi SH 
i-1 


independently of the situation of p?, where 
€ = maxX,. x), et (144 (t, X, CH, | As (t, x, CHL 


B. The Cauchy Problem 


Important for the hyperbolic equation (1) is 
the *Cauchy problem, i.e., the problem of 
finding a function u — u(t, x) that satisfies (1) in 
t>0 and the initial conditions 


u(0,x)—ug(x) ` Ou/Ot(0, x) 2 u,(x), (4) 


where the functions uj(x) and u, (x) are given 
on the initial hyperplane t=0. 

Suppose that (1) is regularly hyperbolic and 
the coefficients are bounded and sufficiently 
smooth De, of class C" with v sufficiently 
large). Then for the Cauchy problem the fol- 
lowing theorem holds. Theorem (C): There 
exists a positive integer l {= [n/2] +3), depend- 
ing on the dimension n+ 1 of the tx-space, 
such that if the functions u(x) and u; (x) in (4) 
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are of class CT. then there exists a unique solu- 
tion u=u(t, x) of class C? in the domain 0x t « 
o0, —oo <x;< oo (1 «ix n). Morecver, this 
correspondence (uo(x), u; (x)] ^ u(t, x) is con- 
tinuous in the following sense: If a sequence of 
initial functions (ug, (x), u4,(x)} (k 1,2, ...) 
and their derivatives up to the /th order tend 
to 0 uniformly on every compact set in the 
hyperplane t —0, then the sequence of corre- 
sponding solutions u,(t, x) also tends to 0 
uniformly on every compact set in each hyper- 
plane t — constant. In other words. the Cauchy 
problem for regularly hyperbolic equations is 
twell posed in the sense of Hadamard [2]. 

For dependence of the solution on initial 
data, the following proposition is valid: The 
values of the solution u at a point p? — (t9, x?) 
depend only on the initial data on a domain 
Go (domain of dependence) of the initial hyper- 
plane, which is determined as the intersection 
of the backward emission Z. (p?) and the 
initial hyperplane. We have the following dual 
proposition: A change in the initial conditions 
in a neighborhood of a point Q, of the initial 
hyperplane induces a change of values of the 
solution only in some neighborhood of the 
forward emission Z, (Q5) (domain of influence). 
If the coefficients of the equation are bounded, 
the intersection of emissions Z , (p?) and the 
hyperplane t 2 constant is always compact. 

It follows that the domain of dependence and 
the domain of influence are bounded. In some 
special cases, there exists a proper subdomain 
of G, such that the solution depends only on 
the initial data on the subdomain. For exam- 
ple, for the wave equation (3) with n=3, the 
solution for the Cauchy problem at a point 

p? — (t9, x9) (— Section D) is determined, as 
can be seen from the solution formula (12), 

by the initial data in a neighborhood of the 
cone with vertex p9:(t — 19)? = Y? (x, HH, 
namely, in a neighborhood of the intersection 
of bicharacteristic curves (lines, in this case) 
passing through p? and the initial hyperplane. 
If the solution of the Cauchy problem has 
such a property, it is said that Huygens's prin- 
ciple is valid, or that diffusion of waves does 
not occur. For the wave equation (3), Huy- 
gens's principle is valid only for odd n» 1. 


C. The Energy Inequality 


The energy conservation law for the wave 
equation (3), 


E(t) 

id © (fou? 2 féu\? 
JE E 
— constant, 
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is generalized to the so-called energy inequal- 
ity for hyperbolic equations, which plays an 
essential role in deducing the well-posedness of 
the Cauchy problem and the properties of the 
domain of dependence of the solution. Let the 
coefficients of a hyperbolic equation (1) be 
bounded functions, and let G(1) be the inter- 
section of the conical body K = ((t, x)| À2,,(t — 
OP >>", (x,—x})*} with the hyperplane t= 
t(t<t'). The k(> 1)th-order energy integral 
of the solution u(t, x) of (1) on G(t) is defined 
by 


E? (u, G(1)) 


m agt... ta, &k 


Then the following inequality holds: 
E(u, G(1)) < CEQ (u, G(t?)), 


EI 2 
` dx. 


ore. ox O 








t xctl, (5) 


where the constant C is independent of u. We 
call (5) the energy inequality (J. Schauder [5]). 
For the wave equation (3), the hypothesis / = 
[n/2]+3 in theorem (C) can be replaced by a 
weaker condition l= [n/2] + 2, but if we take 
[— [n/2] +1, there is an example for which no 
global solution of class C! exists. In general, 
even though the initial functions are of class 
C', the solution in the Cauchy problem for 
hyperbolic equations may not be of class C!, 
while if the energy of the initial functions is 
bounded, the energy of the solution is also 
bounded. 


D. Representation Formulas for Solutions of 
the Cauchy Problem 


We consider solution formulas that represent 
solutions of the Cauchy problem explicitly as 
functionals of the initial functions. The prob- 
lem of solving (1) under the condition (4), or 
mere generally, the problem of solving an 
equation L[u] = f(t, x) under (4), can be re- 
duced, by transforming the unknown function 
u and applying fDuhamel’s method, to the 
Cauchy problem with initial conditions on the 
hyperplane t — c: 


u(t,x)=0, du/Ot(t,x)=@(x) (4) 


for arbitrary t. We define a function ,(s) of a 
real variable s by 


La(s)=|s|*/42niy*q! 
= —s'log|s|/(2ni)" q! 


(n odd), 
(n even), (6) 


where n is the dimension of the x-space and q 
is a positive integer such that q +n is even. 
Then for a function g(x) of class C" (with v 
sufficiently large) with compact support, the 
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following equality holds [7]: 


xax = y) w) do, 
(7) 


where A, is the *Laplacian with respect to the 
variables y -(y,, ..., Yn) and do is the surface 
element of the unit sphere |œ|= 1 in x-space. 
Now, since the tprinciple of superposition is 
valid because (1) is linear, we can infer from 
formula (7) that the Cauchy problem (1) with 
initial condition (4) can be reduced to the 
one for initial conditions with parameters 

y, ox 


o= f AGD o(y)dy f 


— o0 lo|71 


u(t,x)=0, Oujét(t, x)= X(x — y)o). EW 


In fact, since y,(s) is (q — 1)-times differentiable 
by definition, the Cauchy problem (1) with 
initial condition (4") has a unique solution 
R(t, x; t, y;@) for q chosen large enough so 
that theorem (C) can be applied to (1) with 
initial condition (4"). Moreover, R,(t, x; t, y; œ) 
is a function of (t, x, t, y, œ) of class C", and v 
increases with q. Now, let g(x) be a function 
of class C" with sufficiently large v and with 
compact support. Then, by (7) and the defini- 
tion of R,, the integral 


| roo f R,(t, xit, y;@)d@ (8) 
|o|=1 


"OO 


is a solution of the Cauchy problem (1) with 
initial condition (4). Therefore, when R, is 
found explicitly, (8) yields a solution formula 
of the Cauchy problem (1) with initial condi- 
tion (4’) as a functional of the initial func- 
tions. Since in (8), the integral foli R,do is 
not necessarily of class C"** as a function of 
(t, x; v, y), AP*9? lu Rado is in general not a 
function in the ordinary sense. But we denote 
it by R(t, x; t, y) formally, and understand that 
a linear operator 


u(t, x)= [ne x; v, y)o(y)dy (9) 


is defined by (8). The kernel R(t, x; t, y) in this 
sense is called a fundamental solution or Rie- 
mann function of the Cauchy problem. If we 
extend the function f,,-, R,(t, x; v, y; o) do de- 
fined for t2 1 to t « t, assigning it the value 0 
there, then R(t, x; c, y) = AC*92 (1 BAL x; 
t, y; 0) dc can be considered a Tdistrt button 
on (t, x; t, y)-space, and the equality L(t, x, 
0/0t, 0/Ox) R(t, x; v, y) — Lëtz, y, 0/01, 0/O y R(t, x; 
t, y) 2 ó(t —t)6(x — y) is valid, where L* is the 
tadjoint operator of L and ô is tDirac’s ó- 
function. In other words, R(t, x; t, y) is a fun- 
damental solution of L in the sense of distri- 
bution theory. 

The fundamental solution R(t, x; t, y) can be 
analyzed using the asymptotic expansion with 
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respect to the sequence of functions {y,(s)}, 
and we have the following important result: 
If the coefficients of (1) are of class C” (resp. 
real analytic), then the fundamental solu- 
tion R(t, x; t, y) is of class C” (real analytic) 
in (t, x) except for points that are on bichar- 
acteristic curves of (1) passing through the 
point (t, y). In the language of the Cauchy 
problem, the smoothness of the solution u at 
a point p— (t, x) depends only on the smooth- 
ness of the initial conditions on a neighbor- 
hood of the intersection of the initial hyper- 
plane and all bicharacteristic curves pass- 
ing through p. This fact is called Huygens's 
principle in the wider sense. Behavior of the 
fundamental solution R(t, x; t, y) near dis- 
continuous points has also been investi- 
gated [2]. 

For the wave equation (3), the fundamental 
solution can be constructed, and therefore we 
can write the solution formula explicitly. The 
solution formula for (3) with initial condition 
(4) for nz 3 is 


u(t, x) 


1 Q^? t 
F sai (t? — 7)" 9? cQ(x, x) dr, 
—2J N 





Q(x, d= f o (x 4- o) do, 
|o|=1 


WI 


where w, =2,/n"/T (n/2) is the surface area 
of the unit sphere of n-dimensional space. 
Solutions of the Cauchy problem (3) with 
initial condition (4) for n=1, 2, and 3 are, 
respectively, 

uo(x -t)--ug(x—1) 1 C 


t, x)= = 
RE 2 +5 





u, ()dč 
(10) 


(d’Alembert’s solution), 


al uo(6,, 65) d6, dé, 
on Ot Je, t? —(x1 — WEE 


1 u (či, Cold dé, 

















+ (11) 
2n caf t — (x, —&y -(x, —6y 
(Poisson's solution), and 
1 6 | Mot, 65,63) 
EE 0 x 3 do, 
I us (51, 62. 63) 
RP E do, (12) 


(Kirchhoff's solution), where C, is a disk in the 
$165-plane with center (x,, x5) and radius t, D, 
is a sphere in the €, č, £,-space with center 
(X1, X5, X3) and radius t, and dw, is the surface 
element of D. 
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E. Second-Order Nonlinear Hyperbolic 
Equations 


A second-order nonlinear differential equation 
^u a Gu du Ou  Q?u 
SACO. T XU, usa 
ot? Ot Ox, tx, Ox,0x; 
Ixi "en (13) 


is called hyperbolic in a neighborhood of a 
function U (t, x) if the linear equation of the 
form (1) obtained from (13) is hyperbolic, 
where dot, x) and a;,(t, x) are determined 

by substituting u by U(t, x) in the partial de- 
rivatives of A with respect to O^ u/Ctóx; and 
67 u/Ox;Ox;, respectively. If the functions A 

in (13) and U = xọ(x)+ tu, (x) determined by 
(4) are sufficiently smooth with respect to 

t, X, u, ..., O^ u/Ox;Ox;, the Cauchy problem for 
(13) with initial condition (4) has a unique 
solution in some neighborhood of the initial 
hyperplane under the condition that equation 
(13) is hyperbolic in a neighborhoed of U. In 
general, initial value problems for nonlinear 
equations have only local solutions. 


F. Higher-Order Hyperbolic Equations 


An Nth-order linear differential equation in 
n+ 1 variables t, x Z(x,, ..., x,) with constant 
coefficients 


ô ô Q Xt der y 
L ES Ge Boa) 4. AUT 
(5 x) itat i (5) Es) i 
=0, (14) 


where a —(2,, ...,0,), |x| =a, +...+4,, and 


ER oe 
(=) EZE 


is hyperbolic in the sense of Gárding if the 
following two conditions are satisfied: (i) the 
partial derivative 0"/0t" appears in L; (ii) the 
real parts of the roots 4 — A,(£), ..., Ay() of the 
characteristic equation L(/, i£) 2 0 are bounded 
functions of real variables =(é,,...,€,). 
When L is a homogeneous equation of the 
Nth order, condition (ii) is equivalent to the 
following condition: (ii) 4, (£), ..., A4(£) are 
purely imaginary for all real Z —(£,,..., Č) Æ 
(0, ..., 0). The principal part (consisting of the 
highest-order terms) of a hyperbolic equation 
is also hyperbolic. If (14) is hyperbolic, a 
theorem analogous to theorem (C) holds for 
the Cauchy problem for (14) with initial 
conditions 





O*ufot*(0, x)= u(x) O<k<n—1; (15) 


that is, the Cauchy problem for (14) with initial 
condition (15) is well posed in the sense of 
Hadamard. Conversely, if the Cauchy problem 
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for (14) with initial condition (15) is well posed, 
then (14) must satisfy the hyperbolicity con- 
ditions (i) and (ii) in the sense of Gárding. In 
other words, well-posedness of the Cauchy 
problem is equivalent to hyperbolicity in the 
sense of L. Gárding [14]. Gárding's conditions 
for hyperbolicity cannot be generalized to the 
case of variable coefficients, since the influence 
of the lower-order terms in the equation is 
taken into account in the definition of hyper- 
bolicity. However, in the case of constant 
coefficients, an N th-order homogeneous equa- 
tion remains hyperbolic for any addition of 
lower-order terms if and only if the character- 
istic equation has N distinct purely imaginary 
roots for any real £ —(£,, ..., Č} x (0, ...,0). In 
this case the equation is called hyperbolic in 
the strict sense. Thus a linear equation with 
variable coefficients 


ôu 6 \*/ dv 
L[u]- — £x] (> 
Tre Y aste(z) (2) a 
oN 
=0 | (16) 


is called hyperbolic in the sense of Petrovskii if 
the characteristic equation 
A+ Y daalt, x)A%(iG)" —0 

aotjal=N 
has N distinct purely imaginary roots (called 
characteristic roots) 4, (t, x, č), ..., An(t, x, £) for 
each point p — (t, x) and each č 40. Moreover, 
if the characteristic roots 4,, ..., Ay are sepa- 
rated uniformly, i.e., the inequality 
a" cp |Aj(t, x, 6) — A(t, x, 0| 9 c» 0 
holds, (16) is said to be regularly hyperbolic. In 
the second-order case, this definition is equiva- 
lent to the previous one. Theorem (C) holds 
for the Cauchy problem for a regularly hyper- 
bolic equation (16) with initial conditions 
(15). For the domain of dependence of the 
solution, a result analogous to the case of the 
second-order equation can be obtained using 
an energy inequality [9, 10]. If the coefficients 
are of class C”, Huygens's principle in the 
wider sense is valid, that is, discontinuity of the 
solution is carried over only along bicharacter- 
istic curves. 


G. Systems of Hyperbolic Equations 

For systems of equations 

y Lj[u]-0, 1<i<l, 

ge 

where the L;; are higher-order linear differen- 
tial operators of the form (16), several types 


of hyperbolicity are formulated in connection 
with the well-posedness of the Cauchy prob- 
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lem. We take up two important types, hyper- 
bolicity in the sense of Petrovskii and sym- 
metric hyperbolicity due to Friedrichs. 

We call a system of linear differential 
equations 


Y Y» at jeans 
Ei sehen "7 ge ôt Óx Ka 


1<i<l, (17) 


a system of hyperbolic differential equations (in 
the sense of Petrovskii) if the determinant 


" ô Vof Q Y 
dan, m 


calculated formally using the matrix of dif- 
ferential operators in the system, is hyperbolic 
in the sense of Petrovskii as a single equation 
of Ni Eu n)th order. Petrovskii showed 
that the Cauchy problem for a system that is 
hyperbolic in this sense is well posed [10]. 
There were some imperfections in his argu- 
ment, which have been corrected by others 
(^ S. Mizohata [12]). In the case of constant 
coefficients, the Cauchy problem for (17) is 
well posed if and only if (18) is hyperbolic in 
the sense of Gárding. 

K. O. Friedrichs, observing that the energy 
inequality played an essential role in Petrov- 
skiřs research, studied symmetric hyper- 
bolic systems of equations, since for them 
the energy inequalities are valid most natu- 
rally. A system of first-order linear differential 
equations 


Ou A ĝu 
Aalt, IA A;(t, Do rs (19) 


is called symmetric hyperbolic (in the sense of 
Friedrichs) if the matrices A,(t, x) (0 € i X n) are 
symmetric and Ao(t, x) is positive definite. A 
typical example is provided by Maxwell's 
equations. For this system it has been shown 
that the Cauchy problem is well posed and 
the domain of dependence of the solution is 
bounded [13]. 


H. Weakly Hyperbolic Operators 


We adopt the following definition of hyper- 
bolicity: a linear differential operator of Nth 
order 


p 0 Vofí QV 
L=— t,x)(—) (— 
ot" nue A) (=) (2) 


is called hyperbolic if the Cauchy problem for 
L[u]=0 with initial condition (15) is well 
posed in Hadamard's sense. A hyperbolic 
operator L is called strongly hyperbolic if L 
remains hyperbolic for any addition of lower- 
order terms, and weakly hyperbolic otherwise. 
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À necessary condition for hyperbolicity is 
that all characteristic roots of Ly(t, x, 4, £) 20 
be real for any (t, x, €) (P. D. Lax [15], Mizo- 
hata [16]). In the case of constant coefficients, 
strongly hyperbolic operators have been char- 
acterized by K. Kasahara and M. Yamaguti 
(Mem. Coll. Sci. Kyoto Univ. (1960)). 

As for operators with variable coefficients, it 
is known that not only regularly hyperbolic 
operators but also some special classes of not 
regularly hyperbolic operators are strongly 
hyperbolic (V. Ya. Ivrii, Moscow Math. Soc., 
1976). Let us consider the hyperbolicity of an 
operator L which is not regularly hyperbolic 
under the assumption that the multiplicities of 
the characteristic roots are constant and at 
most 2; namely, the characteristic polynomial 
Lat, Ass Ait Y a, alt, X) A 67 


agtlal=N 
to XN-1 


is decomposed in the following way: 
Lait, X, A, č) 
s N-s 
PE lI (4 Ke A(t, X, oy II (4 — Aj(t, X, SI 
j=l 


j=s+1 
lim |Aj(t, x, č)— A(t, x, £)| - c» 0. 
e 


Assume that the Aj(t, x, č) j=1,2,...,N—S, 
are real. Then L is hyperbolic if and only if it 
satisfies E. E. Levi’s condition, i.e., 


1 / O^ Ly A, 
ix do | us «x 02? a 


2 
^ OLy Od, 
=0 
* A aiat, SH , 











for all (t,x, č) 552,55 


where Ly. , denotes the homogeneous part 
of (N — 1)th order among lower-order terms 
of L (Mizohata and Y. Ohya [17]). Thus, 

for a weakly hyperbolic operator with vari- 
able coefficients, even the principal part is 
not necessarily hyperbolic. J. Chazarain [18] 
has studied weakly hyperbolic operators 
with characteristic roots of arbitrary constant 
multiplicity. O. A. Oleínik [19] studied the 
Cauchy problem with nonconstant multiplic- 
ities for second-order equations. For higher- 
order equations, if the multiplicity of charac- 
teristic roots at (f, X) is at most p, or more 
precisely, if there exist positive rational num- 
bers q and r (q Èr) such that 


QV, 

(5) Ly(t, x 0, Cl z0 
OA 

and 


ô o ' Lylé, 2,0, €)=0 
01) Mot) Vx). Vox) Ext %OO= 


(“EER"\0) 
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for xo +|a|+ gy +r|B|<p, then it is necessary 
for the well posedness of the Cauchy problem 
that 


Q Vo / 0 V f/ 0 MH f/ ð M t. E 
ONE, wun 


(“EER"\O) 





for ao + |a| J- as +r|B|+s(1+q)<p are satis- 
fied, where Ly_, (1 «sx N) are the homoge- 
neous lower-order terms of order N —s of L 
(Ivrii and V. M. Petkov [20]). 

On the other hand, the sufficient condition 
in the case of multiplicity 2 is given by some 
conditions related to the subprincipal symbol 

1 (== EE ) 
N-1773 "ausa D 

2\ dACt ZO Gs, 





which corresponds to Levi's condition in 
the case of constant multiplicity, and to the 
*Poisson brackets (A. Menikoff, Amer. J. 
Math., 1976; Ohya, Ann. Scuola Norm. Sup. 
Pisa, 1977; L. Hórmander, J. Anal. Math., 
1977). 

The Cauchy problem for a weakly hyper- 
bolic system of equations is more complicated, 
because of the essential difficulty that the 
matrix structure 


( 2, ad xe) 
a9 t la| n; 


associated with (17) is not clear in general (— 
references in [20]). 


I. Gevrey Classes 


Classically, the functions of class s (sz 1) 

of Gevrey (— 58 C?-Functions and Quasi- 
Analytic Functions G; 168 Function Spaces B 
(14) were introduced into the studies of the 
fundamental solution for the heat equation: 


Je. (R^) = (o(x)e C*(R")|for any compact 
subset K of R" and any rnulti-indices 
a,there exist constants Cx and A 
such that sup, |(6/éx)* o(x)| « 
Cx A" |a|t5). 


This class of functions was used efficiently in 
the studies of the Cauchy problem for tweakly 
hyperbolic partial differential equations: 


Tt oS E 
c u(t, x)= f( , X) 


in [O, T] x R"ZQ, 
e MV 
(=) u(0,x)2u(x) j=0,1,...,N—1. 


We assume that the multiplicities of the char- 
acteristic roots are constant, i.e., 


k 
Ly(t, X, A, ¢) = I] (A X At, x, e", 
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where v, is constant for any (t, x, eO x R”, 
A,(t, x, £) is real and distinct, and E$- v; — N. 
Let max, xs p. If we suppose that L(t, x, 
6/at, 0/0x) — E(t, x, 0/0t, 0/Ox) is a (tpseudo) 
differential operator of order at most q, where 


p ; ð d 
i=1 mi E at’ dx 


GD 


is a (pseudo)differential operator with a,(t, x, 
0/6t, 6/6x) being strictly hyperbolic (pseudo) 
differential operators associated with Ly, 
then for any s such that 1 € s « p/q, the 
Cauchy problem is well posed in y® (Q), 
provided that all a, a(t, x) of L and f(t, x) 
belong to y(2 (Q), and that {u,(x)}5<jcn—1 are 
given in y9. (R") (Ohya [21], J. Leray and 
Ohya [22]). This result was proved even for 
the case of arbitrary nonconstant multiplicity 
of characteristic roots by M. D. Bronshtein 


[23]. 


J. Lacunas for Hyperbolic Operators 


The theory of lacunas of fundamental solutions 
of hyperbolic operators, initiated by Petrovskiï 
[24], has been developed further in a paper 
[25] by M. F. Atiyah, R. Bott, and L. Gårding. 
Let L(£) 2 Ly(č)+ M(£) be a hyperbolic 
polynomial with respect to the vector 9e R" —0, 
where L,(€) is the principal part of L; this 
means that Ly(0) 4 0 and L(¢+t@)40 when 
|Imt| is sufficiently large. Then L has a funda- 
mental solution E = E(L, 0, x) in the form 


E(L, 6, END 


where c is sufficiently large and the integral 

is taken in the sense of tdistribution. The con- 
vex hull of the support of E, denoted by K — 
K(L,0) — K(A, 0), is a cone depending only 

on the real part Re A of the complex hyper- 
surface A: L(£) 20, and contained in the union 
of the origin and the half-space x: 00. Let A; 
be the ttangent conoid of A at £, transported 
to the origin, and define the wavefront surface 
W(A, 0) by the union of all K(A,,0) for č 40. 
Then it can be shown that the singular sup- 
ports of E(L, 0) and all the E(L4, 0) are con- 
tained in W(A, 0) and, moreover, that they are 
locally holomorphic outside W. In [25], the 
Herglotz-Petrovskii-Leray formulas are gen- 
eralized to any nonstrict L(¢). Thus we have 


DPE(L*, 0, x) 
= const | (x: EE LA) "oč, g>0, (20) 
DPE(LK, 0, x) 


= const | (x- EEF LE(E) *o(£), oe 
tx: 02* 


Q0) ` 
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when xe K(A, 6) \ W(A, 0). Here, g=mk —|B|—n 
is the degree of homogeneity of the left-hand 
side, and o — at INTE ^... Aden 

... Adé,. The integrands are closed trational 
(n — 1)-forms on (n — 1)-dimensiónal complex 
tprojective space and are integrated over cer- 
tain thomology classes of = a(A, 0, x)* and t, 
Oo? These formulas provide means of obtain- 
ing topological criteria for lacunas. Let Z c 
K be a maximal connected open set, where 
E(L, 0, x) is holomorphic. .Z is said to be a 
weak (strong) lacuna of L if E(L, 0, x) is the 
restriction of an entire function to .Z(E(L, 0, x) 
=0 in SL In [25] it is shown that x belongs 
to a weak lacuna for all E(L*,, 6,-) if and only 
if Ga? =0. The sufficiency directly follows from 
(20^) and the necessity follows from a theo- 
rem of A. Grothendieck (Publ. Math. Inst. 
HES, 1966) which implies that the rational 
forms which appear in (20) span all the tco- 
homology classes in question. 


K. Mixed Initial-Boundary Value Problems 


Let Q be a domain in R" with a sufficiently 
smooth boundary I, let L(t, x, 0/0t, 0/0x) be a 
linear hyperbolic operator of Nth order de- 
fined in [0, oo) x Q — ((t, x)| te[0, oo), xeQ}, 
and let B,(t, x, 0/0t, 0/0x), j — 1, 2, ..., b, be linear 
differential operators of N;th order defined in a 
neighborhood of [0, oo) x I. 

The problem of finding a function u(t, x) 
satisfying the conditions 


L[w]-0 in (0,0)xQ, 
Bj[w]-0 on (0,o5))xT, j=1,2,...,b, 
O*u/Ot*(0, x) 2u,(x) O<k<N-1, (21) 


is called a mixed initial-boundary value prob- 
lem. A typical example of such a mixed prob- 
lem is provided by the case L= (] (n=2) and 
B[u]- (t, x), which describes the vibration of 
membranes with a fixed boundary. 

The mixed problem (21) is said to be well 
posed if for any initial data u,(x)e C*(Q), 
OxkxN —1, which are compatible with the 
boundary conditions, there exists a unique 
solution u(t, x)e C*([0, oo) x Q). 

Mixed problems for second-order hyper- 
bolic equations are considered in [6]. In regard 
to mixed problems for hyperbolic equations 
of higher order, we make the following four 
assumptions: (i) Q=R", = {(x', x,)| x' eR" `, 
Xa > 0]; (ii) T = {x|x,=0} is not characteristic 
for L or B; (iii) L is regularly hyperbolic; 

(iv) Nj; & N — 1 and N;z N, if j  k. 

We denote the principal parts of L and B, 
by Ly(t, x’, x,, 0/0t, 0/0x', 0/0x,) and Belt, x’, 
6/ét, 0/0x', 0/0x,), respectively. By the hyper- 
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bolicity of L, an equation in x 


Ly(t,x’,0,4,¢,«)=0 for ImA«0, éeR™! 


has u roots x; with Imxj' >0, and N — u roots 
k; with Imx; «0, and the number u is inde- 
pendent of (t, x’) and (A, EL A necessary con- 
dition for the well-posedness of the mixed 
problem (21) is that the number of boundary 
conditions coincide with this integer u. The 
function R defined by 





R(t, x’, AC) 
Blix ER) 
al jolt > : ) ax ; 
c L (t, x’, 4, a |j. [71.2.4 
where L* Z [I («— kj (5x, 4, €)) and C isa 


contour enclosing all x , is called a Lopatinski 
determinant. 

We say that L and B; satisfy the uniform 
Lopatinski condition if 

inf — |R(,x', 4, 2)| 2c» 0. 
(t, x'),.ImA«0 
ie ela 

When the uniform Lopatinski condition is 
satisfied, the mixed problem (21) is well posed, 
and (21) represents a phenomenon with a finite 
propagation speed, which is the same as that 
of the Cauchy problem for L[u] =0 (T. Bala- 
ban [26], H. O. Kreiss [27], and R. Sakamoto 
[28]). An analogous result holds in the case 
of a domain Q with a compact boundary T, 
provided that L and B; satisfy the uniform 
Lopatinski condition at every point of I. 

In the treatment of mixed problems for L 
and B, not satisfying the uniform Lopatin- 
ski condition, the well-posed problems have 
been characterized for operators with con- 
stant coefficients when QO =R”, (Sakamoto 
[29]). For general domains, however, the well- 
posedness of mixed problems depends not 
only on the properties of the Lopatinski deter- 
minant but also on the shape of the domain 
(M. Ikawa [30]). 


L. Asymptotic Solutions 


In order to explain some properties of phe- 
nomenon governed by hyperbolic equations, 
asymptotic solutions play an important role. 
Consider, for example, the acoustic problem 


Qu=0 in (0, 00) x Q, 
u=0 on (0, oo) x I. 
Let w(t, x; k) be a function defined in (0, o0) x 


Q with parameter k 21 of the form 


N 
w(t, x; k) = ele 2 v(t, SIE) (22) 
= 
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where ¢ is a smooth function satisfying the 
eikonal equation |Vo|^ = 1. If v, j 0,1, ..., N, 
satisfy the transport equations 


Ov; 
2-14 2V: Vue Age On, 9.70, 
O 


we have 
[]w - O(k ML 


Then w of (22) is an approximate solution of 
[]u —0 for large k, and it represents a wave 
propagating in the direction Vo. 

When supp wN (0, oo) x Iz o, if w hits the 
boundary I transversally, we can construct 


N 
w*(t, x; k) e eo (79 V v(t, ak) 
j=0 


such that 
[Vo*|-21 inQ, ọt=ọ, and 


Co 
—->0 on F, 
OV 


and vj' satisfy the transport equations and v; 
= —v; on (0, oo) x T, where v is the unit inner 
normal of I. Then w+ w* is an approximate 
solution, and w* represents a reflected wave 
propagating in the direction 


Vo* Vo —2(Vo:v)v. 


These asymptotic solutions show that the 
high-frequency waves propagate approxi- 
mately according to the laws of tgeometric 
optics. 

If asymptotic solution (22) has a caustic, (Le, 
{{x+/Ve(x)|leR}|xesupp w} has an envel- 
ope, w of the form (22) cannot be an asympto- 
tic solution near the caustic. The asymptotic 
behavior of high-frequency solutions near the 
caustic was first considered by G. B. Airy 
(Trans. Cambridge Philos. Soc., 1838). Under 
the condition that the principal curvatures of 
the caustic are positive, w in the form (22) can 
be prolonged to a domain containing the 
caustic satisfying the asymptotic solution 


w(t, x; k) — e'490079 t Ai( — K?? p(x))go(t, x; k) 
ik A k?’ 9))gy (t, x; k)], 
(23) 
where Ai is the Airy function 


Ip. 
Ne. | git? mgr 
2n 


—00 


and 
q(t, x; k) SS gi (ts x)k!e7 
J 


(D. Ludwig [31]). 
Concerning the reflection of grazing rays by 
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strictly convex obstacles, the reflected wave 
can be constructed by the superposition of 
asymptotic solutions of the type (23). 

The methods of construction of asymptotic 
solutions of the forms (22) and (23) are also 
applicable to Maxwell equations or more 
general hyperbolic systems (R. K. Luneberg 
[32]; Ludwig and Granoff, J. Math. Anal. 
Appl., 1968; Guillemin and Sternberg, Amer. 
Math. Soc. Math. Surveys, 14 (1977)). 


M. Propagation of Singularities 


Let L be a hyperbolic operator with C^ coeffi- 
cients and consider the Cauchy problem L[u] 
=0 with initial condition (15). When the initial 
data have singularities, the solution also has 
singularities for t>0, which is a property of 
hyperbolic equations quite different from the 
properties of parabolic ones. It should be 
noted that the tpropagation of singularities 
cannot be derived from the Huygens prin- 
ciple in the wider sense, i.e., even for regu- 
larly hyperbolic operators of second order 

we cannot determine the location and the 

type of singularities of the solutions for ini- 
tial data with singularities directly from the 
singularities of the fundamental solution 
R(t, x; t, y). 

Suppose that the multiplicities of character- 
istic roots of L are constant. Assume that u,, k 
—0,1,..., N — 1, have, on either side of a suffi- 
ciently smooth (n — 1)-dimensional manifold T, 
continuous derivatives of sufficiently high 
order to suffer jump discontinuities across I. 
Then the solution u has continuous partial 
derivatives of sufficiently high order every- 
where except on the characteristic surfaces of 
L issuing from I’, and across these the partial 
derivatives of u have jump discontinuities 
(Courant and Hilbert [1]). 

For more general singularities of initial 
data, it is known that the twavefront propa- 
gates along the tbicharacteristic strips satisfy- 
ing p, —A,(t, x, Vp) 20, 1— 1,2, ..., N — s, that is, 
WFu(:,t) is contained in ((x(t), £(t) e T*(R”)| 
(dxj/dt)(s) — (02/06) (s, x, €), (d6;/dt)(s) 
—(QÀ,/0x) (s, x, E), (x(0), E)E Ya WEI)? 

(J. Chazarain [18]). 

The propagation of singularities is more 
complicated in mixed problems because of the 
reflections of singularities at the boundary. 
For the *acoustic problem, R. B. Melrose [33] 
showed the following: Suppose 6 — CO c 
(x|Ix| Ri for some R 70, and all the broken 
rays according to the geometric optics starting 
from Q4 2 Of (|x| « R] go out of Nz in a fixed 
time. Then for initial data with singularities in 
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Qe, the solution becomes smooth in Q, for 
sufficiently large t. 
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A. General Remarks 


Let A[u(x)] 20 be a tquasilinear second-order 
partial differential equation. The type (felliptic, 
thyperbolic, or tparabolic) of the equation 
depends on the location of the point x. If the 
type varies as the point x moves, the equation 
is said to be of mixed type. An example is the 





equation 
2 e? 2 e 2 e 
o Pe 
c* j ôx c^ éxéy c* ] dy 


(1) 


of 2-dimensional stationary flow without 
rotation of a compressible fluid without vis- 
cosity, where ¢ is the velocity potential, u = 
0g/éx and v— 09/0y are the velocity compo- 
nents, and c is the local speed of sound, which 
is a known function of the speed q =: (u? + v?)'? 
of the flow. Equation (1) is of elliptic type if q4 < 
c, i.e., the flow is subsonic, and of hyperbolic 
type if q >c, i.e., the flow is supersonic. If there 
exist points where the flow is subsonic as well 
as points where it is supersonic, (1) is of mixed 
type. The study of equations of mixed type has 
become important with the development of 
high-speed jet planes. 


B. Chaplygin's Differential Equation 


It is difficult to solve equation (1) directly since 
it is nonlinear. However, we can linearize it by 
taking q and 0 — arctan(v/u) as independent 
variables (the hodograph transformation). The 
linearized equation takes the form 

2 2 
sa Eisch xK(x)z 0, (2) 
which is called Chaplygin's differential equa- 
tion. Equation (2) is hyperbolic for x >0 and 
elliptic for x <0. The study of general equa- 
tions of mixed type, even when they are linear, 
is much more difficult and less developed 
than the study of equations of nonmixed type. 
Almost all research so far has been on equa- 
tion (2) or slight modifications of it. 


C. Tricomi's Differential Equation 


The simplest equation of the form (2) is 


?^z ez 
gu ES "n 
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which is called Tricomi's differential equation. 
F. G. Tricomi considered the following bound- 
ary value problem for (3): In Fig. 1, AC and 
BC are two tcharacteristic curves of (3) and o 
is a Jordan curve connecting A and B. We seek 
a solution of (3) in the domain D bounded by 
AC, BC, and o that takes given values on o and 
on one of the two characteristic curves, say on 
AC. This boundary value problem is called the 
Tricomi problem. Tricomi proved the existence 
and uniqueness of the solution of his problem 
under some conditions on the shape of o and 
the smoothness of the boundary values. After 
Tricomi, much research has been done on his 
and similar problems for equations of form (2) 
[2]. We can also consider problems such as 
finding a solution of (3) (or of (2)) satisfying 
the initial conditions 


z(0,y)—zi(y, — (0z/0x)(0, y) - z; (y) 


on the common boundary x —0 of the elliptic 
domain and the hyperbolic domain of the 
equation. This is called the singular initial 
value problem. S. Bergman [3] obtained an 
integral formula for the solution under the 


condition that z,(y) and z;(y) are real analytic. 





Fig. 1 


D. Friedrichs's Theory 


For the study of equations of mixed type it 
would of course be most convenient if there 
existed a general theory of boundary value 
problems independent of the type of the equa- 
tion. However, constructing such a general 
theory is considered very difficult, because the 
twell-posedness of boundary conditions as 
well as the analytic properties of solutions are 
quite different according to the type. The first 
contributor to the solution of this difficult 
problem was K. O. Friedrichs [4], who noticed 
that although the methods of solving the 
tCauchy problem and the fDirichlet problem 
are quite different, both methods utilize energy 
integrals in the proof of the uniqueness of 
solutions. Using this observation, he succeeded 
in constructing a unified theory that enables 
us to treat various types (including the mixed 
type) of linear equations in a single scheme— 
an admissible boundary value problem for a 
symmetric positive system of first-order linear 
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partial differential equations. There is, how- 
ever, a difficulty in Friedrichs’s theory since it 
does not give a unified procedure for reducing 
a given boundary value problem for a given 
equation to an admissible boundary value 
problem for a symmetric positive system of 
partial differential equations. The study of 
equations of mixed type that are of more 
general form than (2) by means of Friedrichs’s 
theory is an open problem. 


E. Further Studies 


Work on equations of more general type than 
(2) or (3) has appeared (not all depending on 
Friedrichs's theory). For example, the fol- 
lowing equations are treated in [5, 6, 7], 
respectively: 





n SCH t ôz 
De "et Ot ^ 
Q 8? 
eti 
Ox? |y| dy? 

ĝ?z P a 
Gy) * itai —K(y)z=h, 


where z=(z,,...,Z,) and G(y) and K(y) are 
symmetric matrices. 
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327 (XIl1.26) 
Partial Differential Equations 
of Parabolic Type 


A. General Remarks 


Consider a second-order linear partial dif- 
ferential equation 


n ^u 
E [usn dox 


EE =| 


xj 


+c(x, t)u tas (1) 
for an unknown function u of (n+ 1) indepen- 
dent variables (x, t) - (xi, ..., X,, t), where aj;— 
aj. This equation is said to be parabolic (or 
of parabolic type) if and only if the quadratic 
form X aj6,6; in č is positive definite at each 
point (x, f of the region under consideration. x 
and t are sometimes called the variables of 
space and of time, respectively. 

The most widely studied of the parabolic 
equations is the equation of heat conduction (or 
the heat equation): 


D biz An DN =0 2 
[u]= e , (2) 


where A= D"_, 67/6x? is the Laplacian taken 
over the space variables. 


B. The Equation of Heat Conduction 


The 1-dimensional case of the heat equation is 


u Qu 


L{uj=— -—= 
LS an e 


(3) 
for the temperature u(x, t) in a rod, considered 
as a function of the distance x measured along 
the rod and the time t. Equation (3) was one of 
the first treated in the theory of partial differ- 
ential equations. Consider a finite rod with 
constant temperature 0 at its ends x =a and x 
— b. Thermodynamics suggests that the initial 
temperature ~(x) (o(a) = (b) — 0) prescribed at 
t=0 is sufficient to determine the distribution 
of heat u(x, t) in the rod at all later times t >0. 
On such physical grounds, we can expect that 
a solution to the following problem exists: 
Find a continuous function u(x, t) that satisfies 
equation (3) for a x € b, t >0, and the bound- 
ary conditions 


u(a, t) = u(b, t) - 0, 


lim u(x,t)2o(x, | a&xxb. (4) 
t 


According to J. Fourier the answer to this 
problem is expressed in a series Au c,u, CON- 
structed by superposition of the particular 
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solutions u, — sin |/ A4, (x — a)exp( — A,t). Here 
the 4, are the roots of sin /4, (b — a) 0, and 
the c, are chosen so that Eg c,u,(x, 0) = o (x). 
In fact, if g(x) is continuously differentiable, 
then X7, c,u,(x, t) is the required solution. 
Thus we are led to the problem of expanding 
a given function (x3 in a tFourier series. 

The temperature distribution in an infinite 
rod is given by a continuous function u(x, t) 
that satisfies equation (3) for t>0 and that, for 
t =0, takes values given by o(x), where 


lim u(x, t) 2 o (x). (5) 
t{o 
If g(x) is bounded, then it can also be repre- 


sented by superposition of particular solutions 
e I'm of (3) as 


1 2 
p (a) e "ISS /4t da, 
24 / nt i 


t>0, 


oi, o (6 





u(x, t) = 


Partial differential equations of parabolic 
type are important because of their connec- 
tion with various phenomena in the physical 
world; they include not only equations that 
govern the flow of heat but also those that 
describe diffusion processes (— 115 Diffusion 
Processes). 


C. Partial Differential Equations of Parabolic 
Type in Two Variables 


We are concerned mainly with the partial 
differential equation of parabolic :ype in two 
variables: 


Zu Qu 2 


a(x, d + 2b(x, NES = +e, des 


0 ô 

+ d(x, t) — +.e(x, + f(x, thu=g, (7) 
Óx ôt 

with ac — b?. In the region where |a| + |c| » 0, 


equation (7) can be reduced to the form 


Go Oo dom  , end 
ap td nz tele Oz t Eog (7) 
by an appropriate change of variables č = 
U(x,t), t= V(x, t). If e' <0 in this region, we 
can assume without loss of generality that our 
equation takes the canonical form 


e 


u Ou 
A zd 5060 


dis De Óx 


ois, — (8) 


with a 0, from the outset. It has the single 
family of *characteristics 


t 2 constant. (9) 


There are four typical problems to be posed 
with regard to equation (8). 
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The first consists of determining, in some 
neighborhood of a given curve C nowhere 
tangent to a characteristic, a solution u that 
possesses prescribed values of u and du/dn, 
or of a linear combination of u and du/én, 
along C. For instance, the problem of finding 
a solution u(x, t) such that u(xo, t) - g(t) and 
u,(Xo, t) — h(t) for given functions g(t) and 
h(t) is a problem of this type. Consider the 
equation 


Ou Ou 
AGI (qo 


in the region a «t « b, xo € x. According to E. 
Holmgren, a solution u(x, t) satisfying the 
conditions 


lim u(x, t) — g(t), lim u,(x, t) — h(t), (11) 
x|xo x}xo 


where g'(t) is bounded and continuous, exists if 
and only if 


wn | 542 


Und e 


is of class C? and satisfies 


IW Min! Vi? (13) 





d: —k(t) (12) 


for positive constants M and r. 
In the second type of problem we are re- 
quired to find in a region of the form 


qi (t &x« p(t), (1 «t «t, (14) 


a solution of (8) that takes prescribed values 
on part of the boundary of that region. Here 
we impose the hypothesis on the curves x — 
p(t) and x= o, (t) that they are nowhere tan- 
gent to a characteristic (9). M. Gevrey [3] 
showed that if such a solution does exist, the 
functions q,(t) and (t) must satisfy the *Hól- 
der condition with exponent «> 1/2: 


Io(t--h) et) &c|hl", c=constant, — (15) 


for sufficiently small h. The problem of heat 
conduction mentioned in Section B corre- 
sponds to the particular case o (t) 2 constant, 
Q»(t) — constant, for which condition (15) is 
automatically fulfilled. 

The third type of problem is to find in a 
region of the form 


axxxb,  t»0, (14) 


a solution of (8) that satisfies the conditions 


lim u(x, t)= p(x); 


ô 
SE E, 


h=constant >0; 
Ox 


for x-a, 
Qu 


+Hu=0 for x=b, 
Ox 


H=constant >0. 


(16) 





37D 
PDEs of Parabolic Type 


This problem, posed for equation (10), is also a 
mathematical formulation of the problem of 
heat conduction in a rod [4]. If @(x) is of class 
CT, then the solution to this problem can be 


expanded as 


u(x, t)= Y Gë ^" o (xX), 


b 
Cor | P(x) @,(x) dx, (17) 


a 


where ¢,(x) is a normalized function (f? o2(x)dx 


= 1) that satisfies the boundary conditions 
on(a)—he,(a)=0, — o, (b)-- He, (b)=0 (18) 


and the equation 





= — An Pn(X). (19) 


The fourth type of problem is to find a 
function u(x, t) that satisfies (8) for t>0 and 
the initial condition lim, 9 u(x, t)= (x). It 
corresponds to the problem of heat conduc- 
tion in an infinite rod. 


D. Green’s Formula 


The tadjoint of the differential operator L[u] 
in (3) is given by 


Qv On 


(20) 
Integration by parts yields the identity 


Il (VL[o]— eM [v ])dx dt 
G 


sl nice: EA (21) 
E c\ ôx Óx 


where G is the region bounded by the closed 
curve C, and the line integral on the right is. 
evaluated in the counterclockwise direction 
over C. We call (21) Green's formula for the 
partial differential equation of parabolic type 
(3). As in the case of partial differential equa- 
tions of elliptic type (— 323 Partial Differen- 
tial Equations of Elliptic Type), this formula 
is used to establish the uniqueness of the solu- 
tion of (3) and to derive an integral repre- 
sentation for it. 

For example, the uniqueness of the solution 
is established in the following way: Let the 
curve AD and BE in Fig. 1 be such that no 
characteristic meets either of them in more 
than one point. If u(x, t) is continuous in the 
closed region (A BED), vanishes on AD, BE, 
and the segment AB, and satisfies equation (3) 
in (ABED) except on AB, then it vanishes 
identically. Green’s formula (21), applied to the 
region G=(ABQP) and the functions y z 1, 
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9 zw, yields 


ô 2 
d ES dxdt | uU dx esl). 


In a similar way, by extending Green's for- 
mula suitably, we are able to prove unique- 
ness theorems for the four problems stated in 
Section C for more general linear parabolic 
equations. 


D E 
P Q 
A B 
Fig. 1 


To obtain a representation for solutions we 
proceed as follows: Let u(x, t) be a solution of 
(3) and Let 


U(x, f x,t)— emt ` 05) 


1 
J/4n( B — t) 
be a particular solution. Applying Green's for- 
mula to the region (PABQM P) and the func- 
tions o =u(x, t), Y = U (Xo, to +h, x, t), where h 
is a positive number and M is a point with 
coordinates (Xo, to), we obtain 


dx 
u(x, ler o9 Ah. 
ji: JJ Anh 


-f u(x, t)U (Xo, to +h, x, t)dx 


Since the integral in the left-hand side of this 
equality approaches u(xo, to) as h|O, we can 
establish the basic representation formula 


EE u(x, t)U (xg, to, x, t) dx 


PABQ 


eu ju (23) 
ôx Ox 


for solutions u of (3). Formula (23) shows that 
u(Xo, to) is determined in terms of the partic- 
ular solution (22) if we know the values of u 
and du/éx on the part PABQ of the boundary 
of the region (ABED). The function (22) is 
called the fundamental solution of (3) because 
it plays the same role as the fundamental 
solution logr (r=((a—x)? +(B—y)?)"”) of 
Laplace’s equation 0?u/Ox? + 0?u/0y? =0. 

Similarly, the following function E (called 
the Gauss kernel) is a fundamental solution of 
equation (2): 








= ad 
E(a, fi x, eni y "epi -EE 


v 
ax2 = Av(x, A) = 
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where (x, x) e R" x R” and t « f. For equation 
(3), the following maximum principle holds: 

In the region (ABED) of Fig. 1, suppose that 
L[u] 20 and that u takes its maximum value 
K at an interior point M. Then u is identically 
equal to K on the segment QP and in the 
region (ABQP). More generally, various ver- 
sions of the maximum principle are known for 
equation (1) [5]. 


E. The Laplace Transform Method 
Let u(x, t) be a solution of (3) for t >0 and 


v(x, a= | e "u(x,t)dt, A>0, (25) 
0 

be its tLaplace transform with respect to t. 

Utilizing integration by parts, we have 


| e "u(x,t)dt-[e “u(x, tiie 


0 
val e “u(x, t)dt 
0 


= — (x)+Av(x, A), (26) 


provided that lim, ,,, e ^'u(x, t)=( and 
lim, u(x, t) = p(x). We find in view of (3): 


(x). (26) 


Once the solution of (26) has been found, the 
desired solution of (3) can be derived by invert- 
ing the Laplace transform (25). This idea can 
also be applied to the solution of parabolic 
equations with constant coefficients in (n+ 1) 
variables, such as (1). 


F. General Second-Order Equations of 
Parabolic Type 


Consider the equation (1) with f -: 0, which can 
be written as 


ĝu 


à = A(t)u, (27) 


where A(t) is a second-order 'elliptic operator 
with parameter t. Let D be a region (bounded 
or unbounded) of points x whose boundary 
is a smooth hypersurface S. We pose the fol- 
lowing initial boundary value problem for 
(27): Find a function u(x, t) that satisfies in 

D x (0, oo) equation (27) together with the 
conditions 


lim u(x,t)-e(x) xeD, 
t 


Qu(x,t)/On-- h(x,t)u(x,t) 2 f(x,t), xeS, (16) 


where G/én is the directional derivative in the 
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outward tconormal direction at (x, t) (x e S), 
and h(x, t) z 0. 

The Laplace transform is not suitable for 
solving problems (27) and (16^). Instead, the 
theory of 1-parameter semigroups of linear 
operators (— 378 Semigroups of Operators 
and Evolution Equations) can be applied to 
establish similar fundamental results. Let m be 
a large positive integer. For t>0, put t, =kô 
for k=0,1,...,m—1 with 6=t/m. By the La- 
place transform method as described above, 

. we can associate with y a unique solution v of 
A(t,)v 2 Av — y with 4—1/ó. We put R,V = Av. 
By iterating this procedure m times, we have 
a function u, (x, t) - R,, ,R,, ;... Roo start- 
ing from the initial value o at t=0. Then we 
obtain a solution u(x, t) of (27) and (16^) as 
the limit of u,,(x, t) as m— oo. The following 
results are known [6, 7]: (i) There exists a u= 
U(é, t, x,t) (x, £eD,t» 120) that, as a func- 
tion of x and t, satisfies equation (27) and the 
homogeneous boundary conditions (16^) with 
o —0, f=0. (ii) The function u(x, t) defined by 


u(x, =| p(g)U(E, 0, x; t)d¢ i 


+ | a | f, (ui 1, X,) 
0 S 


QU(£, x, x, t) ; 
Glen (23) 


is a solution of (27) and satisfies (16^). Thus 
U(é,t, x, t) is a generalization of the function 
(24), called the fundamental solution of the 
linear parabolic equation (27) with boundary 
conditions (16). Besides the properties (i) 
and (ii), the fundamental solution satisfies 
U(E,t, x,t) 20, Jp U(E, t, z, )U(z, w, x, t)dz = 
U(é, t,x, t) (c « w « t), and further fp U(£, <, 
x, t)dx = 1 under some additional assump- 
tions. Therefore this theory is of consider- 
able significance from the point of view of 
the theory of probability (— 115 Diffusion 
Processes). 

It can be shown that a weak solution of the 
parabolic equation (27) is a genuine solution. 
That is, if u(x, t) is locally summable and 


E | u(x, t) (US D + A(0* (x, d dt=0 
o JD ot 





for any function ¢(x, t) of class C? in D x (0, oo) 
and vanishing outside a compact subset of D 
(A(t)* is the adjoint of the partial differential 
operator A(t) and dx — dx, ... dx, ,), then 

u(x, t) satisfies (27) in D x (0, oo) in the usual 
sense. In particular, when the coefficients of 
A(t) are infinitely differentiable, any solu- 

tion u(x, t) of (27) in the distribution sense is a 
genuine solution (— 125 Distributions and 
Hyperfunctions). 
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If the function h in the boundary conditions 
(16’) and the coefficients of A are independent 
of t, then the fundamental solution UL. t, x, t) 
depends only on £, x, and t — x and is written 
as U(£, x; t — 1) (t7 x). Furthermore, if A is 
tself-adjoint, then there exist a sequence of 
'eigenfunctions {Y (x; 4)| p — 1,2, ...) (Ay + 
Aw, — 0) and a sequence ( p,(A)) of measures 
on the real line for which the following hold: 
(1) The fundamental solution U(£, x; t) is ex- 
pressed in the form 


Dietz Y. ih ei THE 


(2) The solution u(x, t) of (27) satisfying (16’) 
with f(x, t)=0 is expanded as 


ux) Y | epl 2o 0) dp, (0), 


where 


9,(4) 7 | W(x; 4)e (x) dx 


(v (x) is the function given in (16°). 


G. Nash's Results. 


Let us consider a parabolic equation 


Ou "^ @ Ou 
Se —— 4 ds ere 28 


where a;,—a5 are real-valued functions of class 
C? and equal to constants outside a fixed 
compact set of R” for all t> t, (this regularity 
assumption can be relaxed). Suppose that 
there exists a constant Az 1 such that A^! |£|? 
« aale, t) 6« IER for all (t, x, £)e(to, 00) x 
R” x R”. Then, for any bounded solution u(x, t) 
of (28) in (tp, oo) x R” and for any (x, y, t, s) such 
that xcR", yeR", and tọ « t& 5, the inequality 


|u(x, t) — u(y, s)| 


sl eo e 
t — fg t—to 

holds, where B=sup{|u(x, t)] | tz t9, x eR") and 
p —a/(2a 4- 2). In this inequality, the constants 

a and A are positive, depending on (n, À) but 
independent of the particular choice of (och, to 
and of u [9]. 

As a corollary to this theorem, J. Nash 
proved that, if the a; do not contain t and if 
v(x) is a bounded solution in R" of the elliptic 
equation obtained by replacing Ou/Ot by 0 in 
(28), the inequality 


|v(x) —v(y)| & A'B'|x — yl (30) 








holds for any (x, y) eR" x R", where o = a/(a 4- 1) 
with « in (29) and B’=sup|v(x)|. The constant 
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A' depends only on (n, A) (— 323 Partial Dif- 
ferential Equations of Elliptic Type L). 


H. Partial Differential Equations of p- 
Parabolic Type 


Let p and m be given positive integers. Let us 
consider an equation for an unknown function 
u of (n+ 1) independent variables (x, t) of the 
type 

m a Aj 
GE (31) 
where o —(a,, ...,0,) and 0*/0x* 2 (0/Ox,y^ ... 
(0/0x,. We write also |x| o, +... +a. In 
(31), X, , is the summation taken over the 
(a,j) such that pj 4-]a| x pm and 0 € j « m. Let 
us denote by (4,(x, t, £) z-, the roots 4 of the 
equation 


ACE a, (x, t) (i£)* 4 =0, (32) 


where 57, ; is the summation over the (o, j)'s 
such that pj + |x| - pm and that 0 j « m. We 
say that the equation (31) is p-parabolic (or of 
p-parabolic type) in the sense of I. G. Petrov- 
skii if and only if there exists a positive num- 
ber 6 such that 


Re A(x, t, č) < —ô|č, 1<k<m, (33) 


for any (x, t) in the region under consideration 
and for any če R”. The integer p is then seen to 
be even. Equation (27) is p-parabolic if — A(t) 
is strongly elliptic of order p. The heat equa- 
tion (2) is 2-parabolic in this sense. Similarly, 
we can define the p-parabolic systems of equa- 
tions [10]. 

p-parabolic equations are known to be 
thypoelliptic if the coefficients are of class 
C” [11]. S. D. Eidel’man obtained precise 
estimates of the fundamental solutions and 
of their derivatives for p-parabolic equa- 
tions [10]. The mixed initial boundary value 
problems are investigated in detail also by 
Eidel'man [10] and by R. Arima (J. Math. 
Kyoto Univ., 4 (1964)). 
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A partition of a positive integer n is an ex- 
pression of n as the sum of positive integers. 
The number of partitions of n, where the order 
of the summands is ignored and repetition is 
permitted, is denoted by p(n) and is called the 
number of partitions of n. For example p(5) 2 7 
since S=44+1=3+2=3414+1=24+241= 
2+14+141=14+1+1+4+141. Therefore, p(n) 
equals the number of fconjugate classes of the 
tsymmetric group of order n and is closely 
related to the trepresentation theory of this 
group. 

The generating function of p(n) is 
—-1 


Dal" ae: (a-x9) 


The unit circle |x|= 1 is the tnatural boundary 
of f(x), which is holomorphic in |x| « 1. The 
Dedekind eta function, which is closely related 
to f(x), is defined by the following formula for 
the complex variable t taking values in the 
upper half-plane: 


n(1) - exp(zic/12) Il (1 —exp(2zint)). 
n=] 
Hence n(: + 1)=exp(ni/12)n(z). L. Euler (1748) 


obtained the following formula (called the 
pentagonal number theorem because n(3n — 1)/2 
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is a *pentagonal number): 


Una 


n=1 


8 


zs 1 E y (—1y(x"97702 + x"Gn*1y2) 
n=1 


This follows easily from the Jacobi-Biehler 
equality 


Din emer 727) 


—-1-Y4"G"-z7") (Iql<1, z#0). 

n=1 
By using the ttransformation formula for A. 
functions, we can infer from the pentagonal 


number theorem that n(—1/t)=./7/i y(t). 
Hence, if a, b, c, d are integers satisfying ad — 
bc 1 and c» 0, then 


at+b " ct d (0). 
n ct+d CS ZS 


where e is a 24th root of unity. It is known 
that (x) is a tcusp form of weight — 1/2 with 
respect to the full tmodular group I'(1) [9, 11]. 

C. L. Siegel (1955) gave a simple proof of the 
formula n(—1/t)= Bei n(t). Later S. Iseki 
(1957) gave another proof by using a new 
method, known as the x— f formula [12]. 

The size of p(n) increases rapidly with n; for 

instance p(10) — 42 and p(100) = 190,569,292. 
By making use of a remarkable identity, G. H. 
Hardy and S. Ramanujan (1918) proved the 
following inequalities, where A and B are 
suitable constants: 


(A/n)e? ^ < p(n) « (B/n)e?? v^. 








Subsequently they obtained 


n) ~ (1/4 /3 n)exp(n,/2n/3). 


After, P. Erdós (1942) and D. J. Newman 
(1951), A. G. Postnikov [8] succeeded in 
proving 


jus exp(z, / 21/3 meaa, Ka 
4 /3n n" 


by means of an elementary function-theoretic 
method. Multiplying both sides of Euler's 
formula by the generating function of p(n) and 
comparing the coefficients, we obtain 

Y (-D*'pn—o)-0, 
Oso, <n 
where oj, —k(3k — 1)/2 (k 20, +1, +2,...)isa 
pentagonal number. From this formula we 
can calculate p(n) successively; in fact P. A. 
MacMahon obtained in this way the values of 
p(n) for n up to 200. 

Hardy and Ramanujan proved the following 
transformation formula for the generating 
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function f(x) of p(n): 


(OC) 


e T AZ 
S Wa zen ( -m) 





(hk)21, hh'z —1(modk) 
where W, , is defined by 
W, , = exp(nis(h, k)) 


and the value of s(h, k) was given by Radema- 
cher in the form 


eX) 


Here the symbol ((t)) in the sum denotes the 
function that is 0 for integral t and t — [t] — 1/2 
otherwise ([ ]is the Gauss symbol). With 
regard to the Dedekind sum s(h, k), we have the 
reciprocity law for Dedekind sums: 


TA E 
4 12\k h hk 


If we make substitutions a— h', b —(hh' + 1)/k, 
c=k, and d= —h in the Hardy-Ramanujan 
transformation formula, then the e appearing 
in the transformation formula of y(t) is seen to 
be equal to exp( — zis(c, d) - ni(a 4- d)/12c). A 
direct proof of the transformation formula and 
the reciprocity law was given by K. Iseki 
(1952). 

According to Cauchy's integral formula, p(n) 
can be represented as an integral: 


s(h, k)+ s(k, h) - 





2ni JX 


where the contour F is taken inside the unit 
circle around the origin. The generating func- 
tion f(x) varies greatly: namely, letting r51—0 
in x 2rexp(2zip/q), where p and q are fixed 
integers, it follows from the transformation 
formula that f(x) ~~ exp(x?/69? (1 — r)). Never- 
theless, we can deal with the integral by the 
circle method, introduced by Hardy and 
Ramanujan, which threw light on recent addi- 
tive number theory. Hardy and Ramanujan 
thus obtained 





(CA, 
Meca o *) otf 
where 
À= /n— 1/24, C=2,/2/3, D>C/2. 


The theory was improved by Rademacher 
(1937, 1943), who expanded p(n) into the 
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series 
1 2 d m 
zo V A Wk —| ————— |, 
p(n) n /2 A Al ) 4 Àn 
where 
A,(n)= X W, , exp( — 2nihn/k). 


h(modk), (h, k) -1 


Rademacher (1954) proved further that 


g \3?2 
p(n)=2n (5) 


where 


oO n 


Liz) 2. ire ane: 

Rademacher (1943) had developed an inge- 
nious proof by taking “Ford’s circle” as the 
contour T. 

Ramanujan observed that p(Sm+4)=0 
(mod 5), p(7m + 5)=0 (mod 7), and p(lim+ 
6) 20 (mod 11). Rademacher (1942) and New- 
man studied these cases by using y(t). More 
generally, A. O. L. Atkin proved that if 24n = 1 
(mod 577° 11°) then p(n) 0 (mod 5^7! *"11 1*) 
(Glasgow Math. J., 8 (1967)). At present, this is 
the best result. 

Let n=n,+n,+...+n, bea partition of n. 
Many problems arise when we put additional 
conditions on the n;. For instance, we may 
require that the n; satisfy certain congruence 
relations (L. K. Hua (1942), S. Iseki (1959)) or 
are powers of integers (E. M. J. Wright (1934), 
L. Schoenfeld (1944), S. Iseki (1959)) or are 
powers of primes (T. Mitsui (1957)). The par- 
tition problem can also be extended to the 
case of an algebraic number field of finite 
degree (Rademacher, G. Meinardus (1953), 
Mitsui (1978)). 
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329 (XX1.38) 
Pascal, Blaise 


Blaise Pascal (June 19, 1623—August 19, 1662) 
was born in Clermont-Ferrand in southern 
France. He lost his mother when still an infant 
and was brought up by his father, Etienne 
Pascal (discoverer of the curve called Pascal's 
tlimacon). As a youth, he demonstrated a 
remarkable ability for mathematics. In 1640, 
under the influence of Desargues, he dis- 
covered *Pascal’s theorem on conic sections, 
and in 1642 invented an adding machine. After 
hearing of Toricelli’s experiments in 1646, he 
became interested in the theory of fluids and 
on his own began to conduct experiments; this 
research put to rest the prevailing opinion that 
nature abhors a vacuum and that, therefore, a 
vacuum cannot exist. Pascal formulated the 
principle stating that pressure, when applied at 
any point within a contained liquid, is trans- 
mitted throughout the fluid. By means of this 
principle, he explained various phenomena 
concerning fluids such as the atmosphere and 
laid the foundations for hydrostatics. 

Between 1652 and 1654, Pascal was pre- 
occupied with social affairs, but subsequently 
he began to devote himself to religion. He 
entered the Abbey Port-Royal of the Jansenist 
sect, where he remained until his death. Im- 
mediately before his entry, however, he and 
Fermat exchanged correspondence about 
games of chance, and these letters proved to 
be the beginning of the theory of tprobability. 
Concerning games of chance, Pascal had con- 
ducted research on fPascal’s triangle, and in 
this study he formulated and used tmathemat- 
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ical induction. He also indicated a way to 
obtain the sum of the mth powers of the con- 
secutive terms of an arithmetic progression, 
and with an intuitive idea of limits obtained 
the formula (8 x" dx —a"*! /(m 4- 1). While in 
Port-Royal, he published Lettres provinciales 
(1657), in which he carried on a dispute with 
the Jesuits. His book Pensées shows his deep 
involvement with religion; however, he did not 
abandon mathematics. In 1658, he determined 
the area enclosed by a tcycloid and its base, 
the barycenter and area of the figure enclosed 
by a cycloid and straight lines parallel to its 
base, and the volume of the figure obtained by 
rotating it around these lines. The study of the 
methods used by Pascal to obtain these re- 
sults, which were forerunners of differential 
and integral calculus, led tLeibniz to discover 
the fundamental theorem of calculus. Pascal 
also formulated clear ideas about axioms. 
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330 (11.8) 
Permutations and 
Combinations 


Let there be given a set Q of n elements. If 

we choose k distinct elements of Q and ar- 
range them in a row, we have a k-array or k- 
permutation of elements of Q. The number of 
such arrays is (n), — n(n — 1) ...(n—k - 1). 

The polynomial (x), 2 x(x — 1)... (x — k - 1) 

in x of degree k is called the Jordan factorial of 
degree k. In particular, (n), =n!, n factorial, is 
the number of permutations of Q. A subset of 
Q is called a k-subset if it contains exactly k 
elements. The number of k-subsets (or k- 


combinations) of Q is (; )- (n),/k!. The 
binomial coefficients (*) are defined by the 


: x 
generating function (1+ z) 2 275.6 ( ) z”. For 
n 


any complex number x, the series is conver- 
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: x 
gent for |z| « 1, and it is verified that ( )- 
n 


(x),/n! in terms of the Jordan factorial (x),. 
The same results hold in a tcomplete field with 
tvaluation, in particular in a tp-adic num- 

ber field. In any case, we have the recursive 
relation 


ee ees) (i21) 


and in general 


ECC?) 


which leads to many identities involving 
binomial coefficients. The recursive relation 


allows us to compute the values of o easily 


for small integers n, k, as was noticed by 
Pascal. The arrangement of these values in a 
triangular form: 


1 3 3 1 
1464 1 


is called Pascal’s triangle. For integral values 
x, (1 +2) are polynomials, and we have (a+ 


by XL LU a" *b* (binomial theorem). As 
a generalization, we have 


n! 


(at... ay 2 Y ;ab .. agn 


pi! Del 
(multinomial theorem), where the sum is ex- 
tended over all nonnegative p; with È p,— n. 

The number of ways of choosing k elements, 
allowing repetition, from a set of n elements is 

—n n+k— 
GOU J- 
k k 

number of nonnegative integral solutions of 


34x; = k. As an example of binomial coeffi- 
cients with noninteger arguments, we have 


C n 2 eg 
H H 
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331 (XII.13) 
Perturbation of Linear 
Operators 


A. General Remarks 


Historically, the perturbation method was 
developed as an approximation device in 
classical and quantum mechanics. In the per- 
turbation theory of eigenvalues and eigenfunc- 
tions, created by L. Rayleigh and E. Schródin- 
ger, the main concern was to find solutions as 
power series in a parameter x that could be 
regarded as small. In the perturbation theory 
for linear operators, however, we are con- 
cerned more generally with the behavior of 
spectral properties of linear operators when 
the operators undergo small change. The 
foundation of the mathematical theory, includ- 
ing a complete convergence proof of perturba- 
tion series, was laid down by F. Rellich [1] 
and T. Kato [2, 3]. Another major topic in 

the perturbation theory for linear operators 

is the perturbation of continuous spectra, 
which was initiated by K. O. Friedrichs (Math. 
Ann., 115 (1938); [4]). It is closely related to 
scattering theory and is discussed more fully 
in 375 Scattering Theory. A standard reference 
in this field is [5] (also — [6,7]). Most of 

the material presented in this article is taken 
from [5]. 

For problems in Hilbert spaces there are 
two general frameworks in which to formulate 
perturbation situations: the operator formula- 
tion and the form formulation. In the former 
we deal with a family of operators T(x) direct- 
ly, while in the latter, we deal with associated 
semibounded Hermitian (or, more generally, 
sectorial) forms t(x). The latter is applicable 
only when there is semiboundedness (or a 
sectorial property) inherent in the problem, 
but is usually more general than the former 
in such problems, since the latter (resp. the 
former) requires roughly the constancy of 
the domain of the "square root" of T(x) (resp. 
the domain of T(x)). In this article we discuss 
problems in the operator formulation. For 
the form formulation — [5] and [7]. 

In this article X, Y, ..., are complex Banach 
spaces and T, A, ... are linear operators unless 
other specifications are made. The following 
notations defined in 251 Linear Operators are 
used without further explanation: D(T), R(T), 
B(X, Y), B(X), I (T) (the graph of T), c(T) (the 
spectrum of T), p(T) (the resolvent set of T), 
and R(¢; T) (the resolvent of T). We also use 
C(X, Y) (resp. A(X, Y)) to denote the set of all 
*closed linear operators (resp. all linear 
operators) from X to Y and C(X)- C(X, X). 


1234 


B. Stability of Basic Properties 


(1) Let Te C(X, Y). Important notions for 
characterizing the smallness of Ae A(X, Z) 
relative to T are the following. (i) A is said to 
be relatively bounded with respect to T (or 
simply T-bounded) if D(A) 5 D(T) and there 
exist a, bz O0 such that 


(x)  lAulzxaluly-- bl Tully for all ue D(T). 


The infimum, denoted by ||A||,, of b for which 
(x) holds with some a is called the T-bound of 
A. (ii) A is said to be relatively compact with 
respect to T (or T-compact) if D(A)> D(T) 
and A is compact from D(T) with the graph 
norm of T to Z (— 68 Compact and Nuclear 
operators F). T-compactness of A implies T- 
boundedness (and in Hilbert spaces || A|| — 0). 
(2) Let Te C(X, Y), and let Ae A(X, Y) be T- 
bounded. (i) If | Ally « 1 (or if A is 7-compact), 
then T+ Ae C(X, Y). (ii) If, in addition, X = Y 
is a Hilbert space, T is tself-adjoint, and A is 
'symmetric, then T+ A is self-adjoint (Rellich- 
Kato theorem) [1,8]. Gii) Suppose that T is a 
*Fredholm operator. If either A is T-compact 
or the inequality («) holds with constants a, b 
satisfying bp +a « p for a certain positive 
number p determined by T, then T 4- A is 
a Fredholm operator and ind(T + 4) —- ind T 
(for ind T, nul T, and def T — 251 Linear 
Operators). In the latter case where bp +a « p, 
we also have nul(7+ A) x nul T and def(T + 
A) « def T. 


C. Continuity and Analyticity of Families of 
Closed Operators 


In order to handle unbounded operators, 
which are important in applications, it 1s neces- 
sary to introduce generalized notions of con- 
vergence and analyticity of families of closed 
operators. 

(1) C(X, Y) becomes a *metric space by a 
distance function d(S, T) having the property 
that ó(F(S), r(T)«d4(s, T) «2é(T (S), T(T)), 
where for closed subspaces M and N we put 


ó(M, N) 2 max[ó(M, N), ô(N, M)], 


ó(M,N)— sup dist(u, N); ô(0, N)=0. 


ueM, lul 71 l 
ô(M, N) is called the gap between M and N [5]. 
When T,—>T in this metric, T, is said to con- 
verge to T in the generalized sense. This gen- 
eralized convergence coincides with the norm 
convergence if T,, Te B(X, Y). If X = Y and 
p(T) z Ø, then T, T in the generalized sense 
if and only if for some (or equivalently all) 
Cep(T) we have Cep(T,;) for sufficiently large 

n and IRC T;) — R(; T)]| 20, n5 co. This is 
called norm resolvent convergence. 
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(2) When X = Y, there is also the notion of 
strong convergence in the generalized sense 
[5], which is roughly the strong convergence 
of resolvents. In particular, when T, and T are 
self-adjoint operators in a Hilbert space, 7, —^ T 
strongly in the generalized sense if R(C; T,)> 
R(C; T) strongly for some (or equivalently all) 
C with Im¢ #0. This is called strong resolvent 
convergence. 

(3) Let Dc € be a domain. The notion of 
analyticity (holomorphic property) of a family 
T(k)e B(X, Y), xe D, of bounded operators 
is well known (— 37 Banach Spaces K). 

This notion is generalized to a family T(x)e 
C(X, Y), keD, of closed operators [1, 5]. 
Namely, T(x) is said to be holomorphic in D if 
at each xc D there exist a Banach space Z 
and U(x)e B(Z, X), V(x)e B(Z, Y), defined near 
Ko, such that (i) U(x) and V(x) are holomor- 
phic at x, as families of bounded operators; 
(ii) U(x) is one to one and onto D(T(k)); (iii) 
T(x)U (x) = V(x). Let us mention several spe- 
cial cases. (I) if X = Y and if (ep(T(x)) for all 
KED, then T(x) is holomorphic in D if and 
only if R(¢; T(k)) is holomorphic in D. (II) If 
D(T(x)) is independent of x and if T(x)u is 
holomorphic in D for every ue D(T(x)) then 
T(x) is holomorphic in D (holomorphic family 
of type (A) [5]). (III) Let Te C(X, Y), and let 
T? e A(X, Y) such that D(T™)> D(T) and 

| Tul c"! (a|ul + bl| Tull), ue D(T), where a, 
b, cz 0. Then T(k)u= Tu - kT!u 4 ... -- k^ T? 
ct ..., uE D(T) defines a holomorphic family of 
type (A) in D= (x||x| «(b4- c) !). (IV) If X=Y 
is a Hilbert space and if T(x) is self-adjoint for 
real x, T(x) is said to be a self-adjoint family. 
In particular, the family discussed in (III) is a 
self-adjoint holomorphic family if T is self- 
adjoint and T is symmetric. 


D. Perturbation of Isolated Eigenvalues 


(1) Separation of the spectrum. Let TeC(X). 
Suppose that a bounded subset A of o(T) is 
separated from the rest of o(T) by a simple 
closed contour I Ge I c p(T) and A(a(T)\ A) 
lies inside (outside) of I). Then the operator 


sl R(E; T)d¢, 
2ni Jr 


which is independent of T, is a projection (i.e., 
PeB(X) and P? = P). The closed subspaces 

X, 2 PX and X, z (I — P)X treduce T and give 
rise to the decomposition T= T|y,  T|y, = 
T, & T,. In particular, o(T,) 2 o(T)f) (inside 
of T} and o(T,)=0(T)N {outside of T}. 

(2) Let T(x) be holomorphic in D. We as- 
sume that 0€ D and regard T = T(0) as the 
unperturbed operator. Suppose that A and I 
are as in (1) with T replaced by T®. Then 
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there exists ó >0 such that |x|<6 implies I € p 
(T(Kk)). This follows from the upper continu- 
ity of compact components of the spectrum 
with respect to the metric d of C(X) [5]. Thus 
the separation of the spectrum discussed in (1) 
is applicable to T(x). In particular, corre- 
sponding to the projection 


1 
Pozz] R(GT(k))al, |x| <6, 


T(x) is decomposed as T(k) = T, (x) ® T; (x); 
and the problem of determining the spectrum 
of T(x) inside T is reduced to the problem of 
determining the spectrum of T, (x) (|x| « à). 
Suppose now that A= {29} is an isolated 
eigenvalue of T® and that mz dim P(0) X < oc. 
Then dim P(x)X =m, |k| « ó. Moreover, a base 
(19. .... @m(K)} of P(k)X can be constructed 
in such a way that the ọ;(x) are holomorphic 
in (|| «ó' <ô} [3, 5]. Thus the problem for 

T, (x) in this case is just the finite-dimensional 
eigenvalue problem det (4ó;, —(T(x)¢,(x), 
ix)? — 0. The totality (A;(x)) of solutions 

of this equation, i.e., the totality of eigen- 
values of T(x) near Jp, is expressed by one or 
several power series of x!/? with a suitable 
integer pO. If T(x) is a self-adjoint family, 

we can take p — 1 so that the eigenvalues are 
holomorphic near Ao. If H(x) - H? - k H€? 
+... is a self-adjoint holomorphic family 
described in example (IV) in (3) of Section C 
and if m— 1, the power series A(x) = 224? can 
be explicitly computed as 4, =(H" uo, uo), 22 
=(H ug, ug) + (SH uy, Huy), ..., where 
Hug = ouo with |lug| =1 and where S 
=lim,,,, RU H) — P(0)) is the reduced 
resolvent. This series is known as the Rayleigh- 
Schródinger series. The power series for the 
associated eigenvectors u(k) — 2, xu, can also 
be computed. For details, including the case of 
a degenerate 2, (m 1), in which the situation 
becomes more complicated due to the splitting 
of eigenvalues, — [5]. The perturbation theory 
discussed in this subsection is called analytic 
(or regular) perturbation theory. 

(3) Even when a problem cannot be handled 
by means of analytic perturbation, it may 
happen that the coefficients 4; and u; of formal 
power series can be computed up to a cer- 
tain j. In many such cases it can be shown 
under general assumptions that an asymptotic 
expansion such as A(Kk) — Ao 4- 4, K 4- o(Kk) is 
valid as long as the coefficients involved can be 
computed legitimately [2, 5]. Estimates for 
o(«) can also be given. This provides a rigor- 
ous foundation for the perturbation method in 
many important practical problems. The case 
of degenerate 4, can be treated similarly. The 
strong convergence in the generalized sense 
mentioned in (2) of Section C is used here. This 
theory is called asymptotic perturbation theory. 
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E. Perturbation of Continuous Spectra 


For continuous spectra, studying the mode 

of change under perturbation is not usually a 
tractable problem. Rather, certain parts of 
continuous spectra tend to be stable under 
perturbation; and the study of this stability has 
been a major topic in perturbation theory 
(also — 375 Scattering Theory). In this section 
we discuss only self-adjoint operators and let 

. H - (AdE(A), Ho, ..., be self-adjoint opera- 
tors in a Hilbert space X. For B, and || jj, to 
be used below — 68 Compact and Nuclear 
Operators. 

(1) The essential spectrum (— 390 Spectral 
Analysis of Operators E) is stable under com- 
pact perturbation. Namely, if H = Hj + K with 
compact K, then o,(H)=0,(Ho) (H. Weyl, 
Rend. Circ. Mat. Palermo, 27 (1909)). More 
generally, it suffices to assume that R(¢; H) 

— R(G; Ho) is compact for some (or equivalently 
all) Ce p(H)/N p(y). Conversely, if X is sepa- 
rable and if c,(H) — o,(Hy), then there exist a 
unitary operator U and a compact operator K 
such that H - UHQU ^! -- (J. von Neumann, 
Actualités Sci. Ind., 229 (1935)). Moreover, any 
self-adjoint operator H in a separable Hilbert 
space can be changed into H+ K with a pure 
point spectrum by adding a K eB, with ||K||,, 
<e for any p» 1 and £>0 (S. T. Kuroda, Proc. 
Japan Acad., 34 (1958)). I. D. Berg (1971), W. 
Sikonia (1971), J. Voigt (1977), and D. Voicu- 
lescu (1979) have extended these results to 
normal operators and m-tuples of commuta- 
tive self-adjoint operators. Also — 390 Spec- 
tral Analysis of Operators I, J. 

(2) The absolutely continuous spectrum (— 
390 Spectral Analysis of Operators E) is stable 
under perturbation by the *trace class. Name- 
ly, if H=H)+K, with K eB, then the abso- 
lutely continuous parts of H. and H are tuni- 
tarily equivalent, and in particular o,.(H)= 
o, (H3) (M. Rosenblum, Pacific J. Math., 7 
(1957); T. Kato, Proc. Japan Acad., 33 (1957)). 
Among generalizations we mention the follow- 
ing two. (i) If R(¢; H) — R(G; Họ)e B, for some 
£€p(H)N p(Ho), then the absolutely continuous 
parts of (Hy) and q(H) are unitarily equiva- 
lent for any smooth strictly increasing real 
function o (M. Sh. Birman, Izv. Akad. Nauk 
SSSR, ser. mat., 27, (1963); T. Kato, Pacific J. 
Math., 15 (1965)). (ii) If Hp and H act in different 
Hilbert spaces X; and X, respectively, and if 
there exists J e B(X}, X) such that JD(H;)c 
D(H) and such that the closure of HJ — JH, 
belongs to B,(X,, X), then the same conclusion 
as in (i) holds (D. Pearson, J. Functional Anal., 
28 (1978)). (i) can be derived from (ii). Perturba- 
tion theory for absolutely continuous spectra 
is closely related to the study of generalized 
wave operators in scattering theory. In fact, 
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the existence and the completeness of the latter 
implies the unitary equivalence of absolutely 
continuous parts. All the results mentioned 
above are proved by scattering-theoretic 
methods, either by the wave operator ap- 
proach or by the abstract stationary approach 
(— 375 Scattering Theory, esp. B, C). 


F. Some Other Topics 


(1) For the perturbation theory for semigroups 
of operators and evolution equations, not 
discussed in this article, — [5, 7,9]. 

(2) The detailed structure of continuous 
spectra is hard to analyze. An eigenvalue 4, of 
Ho which is embedded in the continuous spec- 
trum may diffuse into the continuous spectrum 
in the presence of a perturbation. Ia such a 
case, H (Kk), k £0, has no eigenvalues near A, 
but may have a continuous spectrum highly 
concentrated around Ae, This phenomenon 
of spectral concentration is studied, especially 
for some concrete problems, in relation to 
resonance poles (or poles of the holomorphic 
continuation of the resolvent or the scattering 
matrix). In some problems, it is proved that 
the first few terms of the perturbation series for 
A(x) that are still computable are related to the 
real part of the resonance. Some problems of 
resonance can be treated by the technique of 
dilation analyticity, a technique which is also 
effective in other problems of spectral analysis 
(J. Aguilar and J. M. Combes, Comm. Math. 
Phys., 22 (1971)). 

(3) A vast quantity of results in tne spectral 
theory of the Schrodinger operators appear- 
ing in the fSchrédinger equation in quantum 
mechanics can be obtained by perturbation 
methods. 

For the topics mentioned in (2) and (3) — 


[7]. 
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332 (VI.7) 
Pi (x) 


The ratio of the circumference of a circle to its 
diameter in a Euclidean plane is denoted by 7, 
the initial letter of zepiuecpoc (perimeter). 
Thus z can be defined as 


1 
d dx/ , /1—x?. 
o ; 


The symbol x has been used since W. Jones 
(1675-1749) and L. Euler. The fact that this 
ratio is a constant is stated in Euclid's Ele- 
ments; however, Euclid gave no statement 
about the numerical value of z. As an approxi- 
mate value of z, 3 has been used from antiq- 
uity. According to the Rhind Papyrus, (4/3)* 
was used in ancient Egypt. Let L,(I,) be the 
perimeter of a regular n-gon circumscribed 
about (inscribed in) a circle of radius 1. Then 
the relations 


2 .1 1 
Ewech, =—+— 


Lo qu 
Ln = v LL; 


hold. Archimedes obtained 319 <n <34 by 
calculating Log and lge. In 3rd-century China 
Liu Hui used x «3.14. In Sth-century China, 
Tsu Chung-Chih mentioned 22/7 as an inaccu- 
rate approximate value and 355/113 as an 
accurate approximate value of z. These values 
were obtained by methods similar to those of 
Archimedes. In 5th-century India, Aryabhatta 
obtained z= 3.1416, and in 16th-century 
Europe, Adriaen van Roomen obtained 
m= 355/113. 

F. Viète represented 2/z in the following 
infinite product: 
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Using this formula, L. van Ceulen (1540-1610) 
calculated z to 35 decimals. In the 17th and 
18th centuries, the Japanese mathematicians T. 
Seki, K. Takebe, and Y. Matunaga computed 
x to 50 decimals. Since the 17th century, many 
formulas that represent x as a sum of infinite 
series or as a limit have been used to obtain 
more accurate approximate values. The fol- 
lowing are representations of z known in those 
days: 


n 2:2:4:4-6:6... 
2 1:3:3:5:5.7... 





(J. Wallis) 


E DE Ee us 
4 1+2+2+2+.. 
the notation — 83 Continued Fractions) 


(W. Brouncker; for 


=1—1/⁄3+1/5—1/7+... 
(J. Gregory, G. W. F. Leibniz) 


=4 Arctan 1/5— Arctan 1/239 (J. Machin). 


A formula combining Machin’s representation 
of z and the power series Arctan x —x— 
(1/3)? 4- (1/5)x5 — ... is called Machin's for- 
mula and was often used for calculating an 
approximate value of z. By utilizing this for- 
mula, in 1873 W. Shanks obtained an approxi- 
mate value of z up to 707 decimals. No im- 
provement of his approximation was obtained 
until 1946 when D. F. Ferguson calculated 710 
digits of z and found that Shanks's value was 
correct only up to the 527th digit. The com- 
putatton of an accurate approximate value of 
x has been made easier by the recent develop- 
ment of computing machines, and an approxi- 
mate value up to 1,000,000 decimals has been 
obtained. P. Beckmann [2] gives a detailed and 
humorous historical account of the calculation 
of x from ancient times up to the present com- 
puter age. Various numerical results obtained 
by electronic computers are not formally pub- 
lished, some being deposited in the UMT 
repository of the editorial office of the journal 
Mathematics of computation. Choong et al. [3], 
using information in [1], obtained the first 
21,230 partial denominators of the regular 
continued fraction representation of x and 
described how their numerical evidence tallies 
with theoretical results, obtained by the met- 
rical theory of continued fractions, which is 
valid for almost all irrational numbers (e.g. 

^ [4)). 

In 1761, J. H. Lambert used Brouncker's 
expression of z in a continued fraction to 
prove that z is irrational. In 1882, C. L. F. 
Lindemann proved that z is a *transcendental 
number using Euler's formula el = —1. The 
approximate value of z up to 50 decimals is 
3.141592653589793238462643383279502884197 
16939937510... (— Appendix B, Table 6). 
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Plane Domains 


A. Domains in the Complex Plane 


A *domain (e, a *connected open set) in the 
*complex plane or on the fcomplex sphere is 
called a plane domain. The tclosure of such a 
domain is called a closed plane domain. In this 
article, we consider only subsets of the com- 
plex plane (or sphere), and a plain domain is 
called simply a domain. The tinterior of a 
‘Jordan curve J in the complex plane is a 
domain called a Jordan domain. In a domain 
D, a Jordan arc whose two endpoints lie on 
the boundary of D ts called a cross cut of D. 

For a domain D, each of the following three 
conditions is equivalent to the condition that 
D is tsimply connected: (1) For every cross cut 
Q of D, D—Q has exactly two *connected 
components. (2) Every Jordan curve in D is 
*homotopic to one point, that is, it can always 
be continuously deformed to a point. (3) The 
*monodromy theorem holds in D. 

If D is a domain on a complex sphere, each 
of the following three conditions is equivalent 
to the condition that D is simply connected: (4) 
The boundary of D consists of a single tcon- 
tinuum or a single point. (5) For every Jordan 
curve C in D, either the interior or the exterior 
of C is contained in D. (6) The complement 
of D with respect to the complex sphere is 
a connected (not necessarily arcwise con- 
nected) closed set. Jordan domains are simply 
connected. 

Let n 22 be an integer and D a plane 
domain. The thomology group H,(D, Z) is 
identical to Z" ! if and only if the complement 
of D in the complex sphere has n connected 
components. Then D is said to be n-ply con- 
nected or multiply connected without speci- 
fying n. If D is an n-ply connected domain, 
there exist n — 1 suitable mutually disjoint 
cross cuts Q,,..., Q,., such that D— 
(Q,U...UQ, ) is simply connected. 

Some typical examples of domains are as 
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follows: (1) Circular domain: |z — c| « r. (2) 
Half-plane: Rez 7 0, or Imz » 0. (3) Angu- 

lar domain: « < arg(z — c) « f. (4) Annular 
domain: r « |z — c| « R. (5) Slit domain: a 
domain obtained by excluding a Jordan arc 
I from a domain D, where all points on T 
(except an endpoint lying on the boundary of 
D) are contained in D. In this case, the Jordan 
arc I is called the slit of the domain. 


B. Boundary Elements 


A boundary point P of a domain D is called 
accessible if there exists a sequence of points 
P, tending to P such that the line segments 

P, P,,... lie completely in D. For example, for 
the domain obtained by removing x = 1/(n+ 
1, 0<y<1/2 (n=1,2,...) from the square 
0<x<1,0<y<1, the boundary points with 
x-0,0xy« 1/2 are all inaccessible (Fig. 1). 





Fig. 1 


Let the domain D be bounded by a smooth 
Jordan curve, and let P be a boundary point of 
D. Take an angular domain D' with vertex at 
the point P and the initial parts of the two 
sides of D' lying in D. A curve in D converging 
to the point P from the interior of the angular 
domain D' is called a Stolz's path or a nontan- 
gential path ending at the point P. 

Let D be a simply connected domain. A 
sequence {q,} of cross cuts mutually disjoint 
except for their endpoints is called a funda- 
mental sequence of cross cuts if it satisfies the 
following two conditions (Fig. 2): (1) Every q, 
separates q,., and q,,, on D. (2) For v» oo, 
the sequence q, tends to a point on the bound- 
ary. Let (q,] be a fundamental sequence of 
cross cuts, and denote by D, the subdomain of 
D separated by q, that contains q,,,. The inter- 
section (\ D, consists only of the boundary 
points of D. Two fundamental sequences {q,}, 
Lol of cross cuts are equivalent if every D, 
contains all q; except for a finite number of v, 
and every D, contains all q, except for a finite 
number of v. Here D,, D, are the subdomains 
constructed from q, and qj, as above. This 
condition determines an equivalence relation, 
under which the equivalence class of funda- 
mental sequences of cross cuts is called a 
boundary element. This notion is due to C. 
Carathéodory [2]. The boundary element of a 
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multiply connected domain is defined similarly 
for each isolated component of the boundary. 
For example, each point of a slit domain, 
except for the endpoint of T lying on the 
boundary of the domain, determines two 
distinct boundary elements on each side. A 
closed domain is usually considered to be the 
union of a domain and the set of all its bound- 
ary elements. Various notions of deal bound- 
ary come from considering suitable boundary 
elements for various purposes (— 207 Ideal 
Boundaries). 


C. Domain Kernels 


Let {G,} be a sequence of domains containing 
the origin 0. If a suitable neighborhood of the 
origin is contained in G, for all v, there exists a 
domain G such that every closed domain 
containing the origin and contained in G is 
contained in G, except for a finite number of v. 
The union K of such domains G is called the 
domain kernel of the sequence {G,} (Carathéo- 
dory). If there is no neighborhood of the origin 
contained in G, for all v, we put K = {0}. 

If every infinite subsequence of {G,} has the 
same domain kernel K, then we say that the 
sequence {G,} converges to K. The notion of 
domain kernel is important in considering the 
limits of a sequence of conformal mappings 
(— 77 Conformal Mappings). 
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Plateau's Problem 


A. Origin 


Because of surface tension, a soap membrane 
bounded by a given closed space curve takes 
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the shape of a minimal surface, i.e., a surface of 
the least area. This experiment was performed 
by the Belgian scientist J. A. Plateau (1873) 

to realize minimal surfaces; hence Plateau’s 
problem is that of determining the minimal 
surfaces bounded by given closed space curves. 
It is a problem of the tcalculus of variations. 


B. Formulation 


Let T be a tsimple closed curve in xyz-space 
such that its projection C on the xy-plane is 
also a simple closed curve. Let D be the finite 
domain bounded by C. We consider surfaces 
z=2(x, y) having common boundary I. Then 
under suitable assumptions on the smooth- 
ness of z(x, y), the problem is to minimize the 
functional 


ml V/A p? 4 q? dx dy; 
D 


Oz Oz 
p= ax’ = 
with the condition that z — z(x, y) has T as its 
boundary. The *Euler-Lagrange differential 
equation for the functional J[z] is 
ó epe iex 
or (1 +.q7)r—2pqs + (1 4- p?)t «0, r = ?z/0x?, 

S OQ? z/OxOy, t 0? z/0y?, which is a second- 
order tquasilinear partial differential equation 
of elliptic type and whose geometric interpre- 
tation had already been given by M. C. Meus- 
nier (1776). 

To formulate the problem more generally, 
let a surface be expressed in vector form x= 
z(u, v) by means of parameters u, v. Let its 
tfirst fundamental form be dx? = E du? + 
2F dudv + G dv? and its tsecond fundamental 
form be —dxdn = Ldu? -- 2M dudv+ N dv’, 
with n —n(u, v) the unit normal vector. By 
equating to zero the tfirst variation of the 
areal functional 


{| EG — F? dudv 


based on infinitesimal displacement in the 
normal direction, we obtain the Euler- 
Lagrange equation in the form 


2H =(NE—2MF + LG)(EG — F?) 20, 


where H — (R4! + R;3)/2 is the tmean curva- 
ture of the surface and R, and R, are the 
tradi of principal curvature. Since *Beltrami’s 
second differential form satisfies A, x = Hn, the 
condition for a minimal surface becomes Ax = 
0 (with A the tLaplace operator) provided 
that isothermal parameters u, v satisfying E = 
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G, F=0 are chosen, i.e., the vector z(u, v) rep- 
resenting a minimal surface is harmonic (the 
components of this vector are tharmonic func- 
tions of u, v). Let y(u, v) be a harmonic vector 
conjugate to x(u, v). Then isothermality is ex- 
pressed by the condition that the analytic 
vector Ẹ(w)= x(u, v) + in(u, v) (w =u + iv,i— 
Be ) satisfies (w)? — 0 (Weierstrass). In 
general, a minimal surface is defined as a sur- 
face with everywhere vanishing mean curva- 
ture, and Plateau's problem is to determine the 
minimal surface with a preassigned boundary. 
In this formulation, the problem can be easily 
generalized to an n-dimensional Euclidean 
space R" (— 275 Minimal Submanifolds). 


C. Existence of a Solution 


The existence of a solution of Plateau's prob- 
lem was discussed by S. N. Bernshtein (1910) 
from the viewpoint of a "boundary value prob- 
lem of the first kind for the elliptic partial 
differential equation in the previous section. A. 
Haar (1927) dealt with the minimal problem 
for the functional J[z] by a direct method 

in the calculus of variations. Previously, 
Riemann, Weierstrass, Schwarz, and others dis- 
cussed the case where the given space curve 

I is a polygon, in connection with the *mono- 
dromy group concerning a second-order ltnear 
ordinary differential equation. Subsequently, 
R. Garnier (1928) investigated the existence of 
a solution by the limit process when F is a 
simple closed curve with bounded curvature. 
However, when T is assumed merely to be 
trectifiable, the existence of a solution was first 
shown by the limit process by T. Radó (1930). 
He further discussed the general case where I 
can bound a surface with finite area. On the 
other hand, by introducing a new functional 
depending on boundary values instead of the 
areal functional, J. Douglas (1931) succeeded 
in giving a satisfactory result for the existence 
problem. R. Courant (1937) gave another 
existence proof by reducing Plateau's problem 
to the *Dirichlet principle [3]. 

At present, the methods of discussing the 
existence of solutions of Plateau's problem can 
be classified into the following three sorts 
(represented, respectively, by Rado, Courant, 
and Douglas): 

(1) The first method is to minimize directly 
the areal functional f ( / EG — F?” du dv. The 
variational equation of the areal functional 
becomes H =0. 

(2) Dirichlet’s functional for a scalar function 
f(u, v) is defined by D[ f] =f (i? +f?) du dv, 
and for an n-dimensional vector function f(u, v) 
with components fi(u,v) (j — 1, ..., n) by D[T] = 

=1 DI fj]. The existence of a solution of 
Plateau's problem can be discussed by starting 
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from the variational problem of minimizing 
D[T]. The variational equation of D[T] is Af= 
0. 

(3) An analytic vector (w) is representable 
in terms of the boundary values of its real part. 
For instance, if the domain of w is the unit 
disk |w| < 1, then Poisson's integral formula 


1 2n id 
at- [^ bOO) a d0 ilm B(w) 


2n Jo 
with the boundary E b(8) =: Re Steff) 
can be used. On the other hand, the vector 
function that minimizes the Dirichlet integral 
among functions with fixed boundary values is 
harmonic. Based on these facts, Douglas trans- 
formed Dirichlet's functional with harmonic 
argument functions into a functional whose 
arguments are boundary functions. Specifi- 
cally, by starting from the problem of minimiz- 
ing Douglas's functional 


** (ul bai" 
Alb]= al [x AR AsinX(9— oy2 4 49. 


we can prove the existence of solution of 
Plateau's problem satisfactorily. 


D. The Generalized Case 


Up to now we have been concerned with 
Plateau's problem in the case of a single simple 
closed curve. Douglas, Courant, and others 
treated the generalized case of a finite number 
of boundary curves, where fgenus and orienta- 
bility are assigned as the topological structure 
of the surface to be found. The existence of a 
solution has been shown in this case also. The 
problem is further generalized from the case of 
fixed boundary to the case where the bound- 
ary is merely restricted to lie on a given mani- 
fold [3]. On the other hand, C. B. Morrey 
(1948) generalized the problem by replacing 
the ambient space R" by an n-dimensional 
tRiemannian manifold and gave the existence 
proof in considerable generality [6]. 


E. Relation to Conformal Mappings 


There is a notable relation between Plateau's 
problem and conformal mapping when the 
dimension of the space is 2. Namely, the exis- 
tence proof of the solution of the former for a 
Jordan domain implies tRiemann’s mapping 
theorem together with W. F. Osgood and C. 
Carathéodory's result on boundary corre- 
spondence (— 77 Conformal Mappings). 


F. New Developments 


Among recent contributions to the study of 
Plateau's problem, the following remarkable 
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results have emerged. One of them is con- 
nected with the final result of Douglas (1939) 
on the existence of solution surfaces. The 
mapping of a 2-dimensional manifold with 
boundary into R" defining Douglas's solution 
of the Plateau problem for a finite number of 
simple closed curves is a tminimal immersion 
with the possible exception of isolated points 
where it fails to be an immersion. These points 
are called branch points. It was then proved 
by R. Osserman (1970) and R. D. Gulliver 
(1973) that for n2 3 the mapping of Douglas's 
theorem, which is a surface of least area, is 
free of branch points, i.e., is an immersion. 
Osserman also gave examples of generalized 
minimal surfaces in R” (n> 3) with true branch 
points. In this connection, Gulliver also dealt 
with an analogous problem for surfaces of 
prescribed mean curvature. 

Next, we mention the question of boundary 
regularity. H. Lewy (1951) proved that if the 
boundary of a minimal surface is analytic, 
then the surface is analytic up to the bound- 
ary. Subsequently, S. Hildebrandt (1969) and 
others proved that if the boundary is of class 
C™*, m z |, the surface is also of class C"^* up 
to the boundary. There are also some recent 
results on the number of solutions of Plateau's 
problem. For instance, J. C. C. Nitsche (1973) 
proved the uniqueness of solutions for analytic 
boundaries of ‘total curvature at most 4r. 

Further developments in connection with 
Plateau's problem have emerged in the work 
of E. R. Reifenberg (1960) and others, who 
sought to minimize the tHausdorff measure 
among general classes of geometric objects, 
not as parametrized manifolds, but as subsets 
of R". The existence and regularity of solutions 
of Plateau’s problem from this point of view 
have been discussed by H. Federer (1969), W. 
H. Fleming, F. J. Almgren, and others [10]. 
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Henri Poincaré (April 29, 1854—July 17, 1912) 
was born in Nancy, France. After graduating 
from the Ecole Polytechnique, he taught at the 
University of Caen in 1879, then at the Uni- 
versity of Paris in 1881. He was made a mem- 
ber of the Académie des Sciences in 1887 and 
of the Académie Frangaise in 1908. He died in 
Paris. 

His achievements center on analysis and 
applications to theoretical physics and astron- 
omy. However, his work covered many fields 
of mathematics, including arithmetic, algebraic 
geometry, spectral theory, and topology. His 
tuniformization of analytic functions by means 
of the theory of tautomorphic functions in 
1880 is especially notable. His paper on the 


. tthree-body problem won the prize offered by 


the king of Sweden in 1889. 

The methods he developed in his three- 
volume Mécanique céleste (1892—1899) began 
a new epoch in celestial mechanics. In addi- 
tion, Poincaré opened the road to talgebraic 
topology and made suggestive contributions to 
the ttheory of relativity and tquantum theory. 
He asserted that science is for science's own 
sake [4], and his popular philosophical works 
concerning the foundations of natural science 
and mathematics exhibit a lucid style. 
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A. General Remarks 


On the existence of polynomial approxima- 
tions, we have Weierstrass's approximation 
theorem, which is formulated in the following 
two forms: (i) If f(x) is a function that is con- 
tinuous in the finite interval [a, b], then for 
every ¢>0 there exists a polynomial P,(x) of 
degree n — n(c) such that the inequality | f(x) 
— P (x)| <£ holds throughout the interval 

[a, b]. (ii) If f(0) is a continuous function of 
period 27, then corresponding to every posi- 
tive number e there exists a trigonometric 
polynomial of degree n = n(c), 


P,(0)— ag + y (a, cos k0 + b, sin k0), (1) 
k=1 


such that the inequality | f(0)— P,(0)| <€ holds 
for all values of 0. The second form of Weier- 
strass’s theorem follows from the first, and 
conversely. M. H. Stone obtained a theorem 
that generalizes Weierstrass’s theorem to the 
case of functions of several variables. Of the 
many direct proofs now available for Weier- 
strass’s theorem, we mention two simple ones. 
To prove version (1) of the theorem, we can 
assume that the given function f(x) is defined 
in the segment TO. 1]. Consider the Bernshtein 
polynomial 


: k n k n— 
B= CC (1—x)**. 


Then B,(x) converges to f(x) tuniformly. To 
prove (ii), we can apply !Fejér's theorem on 
*Fourier series. We have the following gen- 
eralization of (i): Let p,, p>,... be a sequence of 
positive numbers such that lim p, = oo. Then 
linear combinations of x? 2 1, x?', x”’,... can 
uniformly approximate each continuous func- 
tion on TO. 1] with arbitrary precision if and 
only if £ p, ! = —oo (Müntz's theorem). 


B. Best Approximations 


Let @o{x), qo, (x), ... be a sequence of linearly 
independent continuous functions on a 
bounded closed domain A in R”. For any 
given continuous function f(x), a function 
Bisi X1- oc, o, (x) attains 


€9 
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is called the best approximation of f(x) by a 
linear combination of (9,(x)]. For any given 
n there is a best approximation of j/'(x) by a 
linear combination of @o(x), ..., PiX), but such 
an approximation is not always unique. For 
such an approximation to be unique it is neces- 
sary and sufficient that the determinant of the 
matrix (o,(x;)) (k, i—0, 1,2, ..., n) is not zero, 
where Xo, x,, ..., x, are n+ 1 arbitrary distinct 
points of A (Haar's condition) (Math. Ann., 78 
(1918)). If (o, (x)) satisfies this condition, the 
system of functions Job a... is called a 
Chebyshev system (or unisolvent system). The 
sets (1,x, x^, ..., x") on [a,b], (1, cos x, ..., 
cosnx} on [0, x] and {sinx,...,sinnx} on [0,7] 
are Chebyshev systems. For a Chebyshev sys- 
tem {@,(x)} on [a,b], let P,(x) be a linear com- 
bination of (x), ..., o, (x) that is not identical 
to the function f eC[a,b]. Then P,(x) is the 
best approximation for f(x) if and only if there 
are at least n + 2 distinct points x, €... < X45 
of [a, b], where | f(x) — P,(x)| attains its maxi- 
mum (these points are called deviation points) 
and (f(x) — P,(x)) (P41) — (xi...) «0 (i= 
1,... n+ 1) (Chebyshev's theorem). 

For example, consider the polynomial P,(x) 
AH TE, a X- ag with real coeffi- 
cients such that 


n n-i 
max Ix"—a,.,x 


Sie 
—-1<x<l1 ol 


takes its smallest value. Then x" — P,(x)— 
2- "^U T (x), where T,(x) 2 cos(narccos x) is 
the Chebyshev polynomial of degree n. 

Since the best approximation is desired for 
numerical computation, several methods have 
been developed to find it (C 300 Numerical 
Methods). However, when the set 4 c R” 

(n 2 2) contains three nonintersecting arcs 
emanating from a common point, A admits no 
Chebyshev system. Thus we do not always 
have a unique best-approximatior. polynomial 


[16]. 


C. Degrees of Approximation and Moduli of 
Continuity 


For a continuous function f(x) defined on 
[a, b], the modulus of continuity of kth order is 








defined by 
E k 
ol Uz sup Zur Ia 
lal <t v=0 v 
Dees 


for t € (b — a)/k. In particular, c, is the ordi- 
nary modulus of continuity. Put Ež(f)= 

inf, DO x col f(x) — PpO), where 
f is a continuous periodic function of period 
22 and P,(x) is a trigonometric polynomial of 
the form (1). Then E? ( f) &c,o( f ; Y/(n 4- 1)) 
(Jackson's theorem [1]), where c, is indepen- 
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dent of f. The best possible coefficient c, has 
been determined by J. Favard [2]. Further 
investigations on the relation between E*( f) 
and w,(f;t) have been carried out by S. N. 
Bernshtein [3] and A. Zygmund [4]. S. B. 
Stechkin obtained the following results: 


1 n 
ofa) Ä (»-- I Ef) 


[5,6]. For the approximation of feC'([ —1,1]) 
by polynomials, there exists a polynomial 
P,(x) of degree at most n such that for any 
xe[-1,1], 


ISO) — P CAS M, (G9) o C£ 5 £0), 


where M, is a constant not depending on f, x, 
and n, f(x) is the rth derivative of f(x), and 
t(x) =(1/n)(,/1—x? +(|x|/n)). We also have 
theorems evaluating c,( f”; t) in terms of | f(x) 
— P,(x)|. For the proof of these theorems, 
estimation of the magnitude of the derivative 
of the polynomial of degree n plays an essen- 
tial role. For example, we have the Bernshtein 
inequality max, | T; (x)| &nmax,T,(x)| for any 
trigonometric polynomial T;(x) of degree n 
and the Markov inequality 


[Peo «min A1 —x,n] max. 1P,69] 
-1iSxS 


for xe[ —1, 1] and any polynomial P,(x) of 
degree n. 


D. Approximation by Fourier Expansions 


If (o, (x) is an torthonormal system of func- 
tions in L,(a,b) and f is any function in 

L,(a, b), then among all linear combinations of 
$o(x), ---, 9, (x) the one that gives the best 
mean square approximation to f (i.e., the one 
for which the integral 


| (0-5 nell dx 


a 


attains its minimum) is the Fourier polynomial 
Yi -od,9,(x), where a, = fa f(x) e, (x) dx. Con- 
sequently, the least square approximation (or 
best approximation with respect to the L,- 
norm) by trigonometric polynomials is given 
by the partial sum s,(x) of the Fourier series of 
f(x). For L, (1 « p « oo), s(x) also gives the 
best approximation up to a constant factor, 
but in the case of uniform approximation we 
have | f(x)—s,(x)|< A(logn)o,( f; n !), and this 
result cannot be improved in general. There is 
no linear operation that gives the best trig- 
onometric approximation. In approximation 
with a linear combination of g(x), ..., 9, (x), 
the saturation phenomenon of approximation 
often appears. For example, observe the arith- 
metic means of s,(x) (i.e., *Fejér means o,(x)). If 
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feLipa (0<a<1)(— 84 Continuous Func- 
tions A), then | f(x) — o,(x)| - O(n ^). However, 
If(x) — o,(x)| = O(n“) if and only if the tconju- 
gate function f(x)e Lip 1; |f(x) — o,(x)| 2 o(n !) 
if and only if f(x) is constant (M. Zamansky 
[7], G. Sunouchi and C. Watari [8]). 


E. Trigonometric Interpolation Polynomials 


Since the trigonometric system is a Chebyshev 
system, given 2n + 1 distinct points Xo, x,, ..., 
x5, and arbitrary numbers cs, c,, ..., C5, 
there is always a unique trigonometric poly- 
nomial of degree n with prescribed values c, at 
the points x,. Given any continuous function 
f(x) with period 2z, the trigonometric poly- 
nomial that coincides with f(x) at the points 
x, is called the trigonometric interpolation 
polynomial with nodes at x,. If x, —2nzk/(2n + 1) 
(k — 0, 1, ..., 2n), then the interpolating trigo- 
nometric polynomial is given by 

a 2 sin((n + 1/2) (x — xj)) 
A= ri 25 09 ane -xy2) 

=x [ot 1/2)(x — t)) 

X Jo sin((x — t)/2) 





do,(t), 


where ¢,(t) is a step function that has the 
value 2nj/(2n + 1) in [2zj/(2n + 1), 2x(j+ 1)/ 

(2n 4- 1)]. U,( f, x) resembles the partial sum 
s,(x) of the Fourier series of f(x). If f(x) is con- 
tinuous and of fbounded variation, then 

U,( f, x) converges uniformly to f(x) (D. Jack- 
son [1]). Although the partial sum s,(x) of the 
Fourier series of a continuous function f(x) 
converges almost everywhere to f(x), there is a 
continuous function for which U,( f, x) diverges 
everywhere (J. Marcinkiewicz [9]). Moreover, 
there exists a continuous function for which 
(1/n) (Xk-1 U,( f, x)) diverges everywhere (P. 
Erdós [10], G. Grünwald [11]). Restating 
these facts for the algebraic polynomial case, 
we can conclude that there is a continuous 
function defined in [ —1, 1] for which the *La- 
grange interpolation polynomial and its arith- 
metic mean are both divergent everywhere if 
we take as nodes ihe roots of the Chebyshev 
polynomial of degree n. 


F. The Case of a Complex Domain 


If a given function f(z) is holomorphic in a 
bounded tsimply connected domain E in the 
complex plane and continuous in £, then f(z) 
is approximated uniformly by polynomials on 
any compact set in E (Runge's theorem). This 
theorem was first studied by C. Runge, and his 
results were developed by J. L. Walsh and M. 
V. Keldysh (e.g., [12]). When E contains no 
interior point, the polynomial approxima- 





336 G 


Polynomial Approximation 


tion of a continuous function defined in E 
was given by M. A. Lavrent'ev. Unifying these 
two results, S. N. Mergelyan obtained the 
following theorem [13]: A necessary and suffi- 
cient condition for an arbitrary function con- 
tinuous on a compact set E and holomorphic 
inside E to be approximated on E uniformly 
by polynomials is that the set E does not 
divide the complex plane. 

On the degree of approximation of poly- 
nomials to f(z) on a simply connected domain, 
there are the following results: Let D be a 
closed bounded set whose complement K is 
connected and regular in the sense that K 
possesses a tGreen’s function G(x, y) with a 
pole at infinity. Let Dg be the locus G(x, y) 
log R>0. When f(z) is holomorphic on D, 
there exists the largest number p with the 
following property: f(z) is single-valued and 
holomorphic at every interior point of D,. If 
R <p, there exist polynomials P,(z) of degree n 
(n— 1,2, ...) such that |f/(z) — Pils M/R" for 
z€D, where M is a constant independent of n 
and z. On the other hand, there exist no such 
polynomials P,(z) on D for R p (Bernshtein 
and Walsh [12]). 


G. Lagrange's Interpolation Formula 


For each n (n—0, 1, ...), let z^, Ji... 20, be 
a given set of real or complex numbers, and let 
f(z) be an arbitrary function. Then there is a 
unique polynomial of degree n that coincides 
with f(z) at each point z® (k=1,...,n+1). 
This is called Lagrange’s interpolation poly- 
nomial and is given by 


P= Y fip) o 
& 


EN 
— dort) 
az) (z — zt)...(z— 20). 


The sequence P,(z) does not always converge 
to f(z). For example, if we take f(z) 2 1/z and 
the (n+ 1)st roots of 1 as z”, then P,(z) 5 z" 
and P,(z) converges to f(z) only at the point 1. 
For real variables also, there are examples 

of divergent P,(z). However, if f(z) is holo- 
morphic in [z| € p (p 1), then P,(z) with the 
(n+ l)st roots of 1 as nodes converges to f(z) 
uniformly in |z| <1. 

When z" is independent of the choice of n, 
P,(z) coincides with the sum of the first n terms 
of *Newton's interpolation formula. In this 
case, P,(z) is called Newton's interpolation 
polynomial and is given by 


P.(z)=a 9+ a,(z—z,)+a,(z—2,)(z—Z2)+... 
+ d,(Z—2,)...(2—-Z,), 


where ag = (21); a, =(f (22) —f(Z1))AZ2 —z1) 
(z,#z,), a4 = f (z;)(z,=2,); and so on. Suc- 
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cessive coefficients of the polynomial P,(z) can 
be calculated by ‘finite differences. Conver- 
gence of Newton’s interpolation polynomial ts 
closely connected to convergence of ‘Dirichlet 
series. 


H. Chebyshev Approximation 


Let D be a bounded closed subset of the com- 
plex plane, and f(z) a continuous function on 
D. Then there exists a polynomial z,(z) of de- 
gree n such that max,.p|f(z)—7,(z)| attains 
the infimum E,( f). The polynomial z,(z) is 
unique and is called the best approximation 
polynomial (in the sense of Chebyshev). If D is 
simply connected and f(z) is single-valued and 
holomorphic on D, then z,(z) converges to f(z) 
uniformly on D. Moreover, in this case there 
exist a number M that does not depend on n 
and a number R » 1 such that |f(zi— z,(z)| < 
M/R". Assuming that f(z) satisfies certain ad- 
ditional conditions, W. E. Sewell [14] proved 
the existence of a constant r such that |f(z) — 
n,(z)| € M/n' R". Furthermore, by approximat- 
ing f(z) 2 z" by polynomials of degree n— 1, 
we can show that there exists a polynomial 
T,(z) of degree n such that 


min len |z" az" '+... alt =T 
zeD 


T,(z) is called a Chebyshev polynomial of de- 
gree n with respect to the domain D. Similar 
statements are valid for functions of a real 
variable. In particular, when D —[ —1,1], we 
have 


T,(x) =cos(narccos x)/2" !, 


which is the ordinary (real) Chebyshev poly- 
nomial. Generally, the limit 


Lin 
lim (no | To) =p(D) 


exists, and p(D) coincides with the tcapacity 
and ttransfinite diameter of D [15]. For new 
results and applications of Chebyshev poly- 
nomials — [17]. 

If the method of evaluating the degree of 
approximation using the absolute value |f(z) — 
n,(z)| is replaced by methods using a *curvi- 
linear integral or ‘surface integral, as explained 
below, we still obtain similar results. Let D be 
a closed domain in the complex plane with a 
boundary C that is a rectifiable Jordan curve. 
If f(z) is single-valued and holomorphic on D, 
then there exists a polynomial z,(z) of degree 
n that minimizes the integral {-u(z)| f(z) — 

n, (z)|?|dz| (p 0), where u(z) is a given posi- 
tive continuous function on C. Moreover, 
|f(z) — n, (z)) < M/R" for some R > 1 (actually 
{x,(z)} is *overconvergent). If D is a closed 
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Jordan domain and if f(z) is single-valued and 
holomorphic in D, then there exists a poly- 
nomial z,(z) of degree n that minimizes the 
integral f f5u(z) f(z) — n, (z)|^ dS, where u(z) is 
a given positive continuous function on D. 
Moreover, | f(z) — z,(z)| < M/R” for some R>1 
on D. 


I. Approximation by Orthogonal Polynomials 
on a Curve 


Let C be a rectifiable Jordan curve in the 
complex plane, and let p,(z) e *L;(C). If 
fcp,(z)p,(z)|dz|=4,,, then {p,(z)} is called 

an orthonormal system on C. Given a holo- 
morphic function f(z) on D, we set a, = 
{-f(z)p,(z)|dz| and consider the formal series 
dio 4%, D, (z). If we denote the nth partial sum 
of this series by s,(z), then s,(z) is the least 
square approximation by a linear combination 
of po(z), ..., p,(z). This and other results, such 
as. tBessel’s inequality, the *Riesz-Fischer 
theorem, etc., are all valid here as in the theory 
of general torthogonal systems. In particular, if 
we take |z| - R as C, then {1,z,z?,...} is an 
orthogonal system. Since in this case 


1 E 1 f(z) 
ames Lite SE |dz| 


and s,(z)=a)+a,z+...+a,2", the tTaylor 
expansion of f(z) coincides with the ortho- 
gonal expansion of f(z). 

Given a compact domain D and a holomor- 
phic function on D, if there exist orthogonal 
polynomials p,(z) such that the orthogonal 
expansion of f(z) with respect to p,(z) con- 
verges to f(z) uniformly on D, we say that 
{p,(z)} belongs to the domain D. The problem 
of existence and determination of such poly- 
nomials for any given domain was proposed 
and first solved by G. Faber. Generalizations 
were given by G. Szegó, T. Carleman, and 
Walsh. Roughly speaking, p,(z) is given by the 
orthogonalization of the system (1,2, z7, ...] 
with respect to the curvilinear integral on C= 
OD or the surface integral on D. 





J. Numerical Approximation of Functions 


The accuracy of the approximation of a given 
function f(z) by the partial sums of its Taylor 
expansion X a,(x — xg)" decreases rapidly as 
the distance |x — xo| increases. The accuracy of 
the approximation of f(x) defined on a com- 
pact interval [ A, B] by a (polynomial) function 
(x) can be evaluated by means of the least 
square approximation, the best approximation 
with respect to the uniform norm, and so on. 
The second method is best suited to numer- 
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ical calculation of functions. To get the best 
approximating polynomial @(x)= P,(x) — 
23i -oCi94(X) of f(x) (— Section B), we must 
determine coefficients c, that satisfy the con- 
ditions of Chebyshev's theorem. The first step 
in this process is the orthogonal development 
of f(x) by Chebyshev polynomials {T,(x)}: 
Pn(X) = Li=o a, T,(u), u - (x —(A + B/2)((B— 
A)/2). The error | f(x) — o, (x)| is estimated by 
a constant multiple of Tp (u): f(x) — o,(x)| € 
K|T,,,(u)|. This Chebyshev interpolation is 
actually given by ay = N ! XN, f(x), a= 
2N ! Pa f(x) Tu) (k— 1, ..., n), where N = 
n+ 1 and the u;=(x;—(A + B/2)((B— Ay2) 
(i— 1, ..., N) are the roots of T,(u). Let M 

be the extremum of the error |f(x) — q,(x)| 

of such an approximation, and set f(x;) — 
P(x) = + M; (i 1,2, ..., N). Consider a func- 
tion „(x)= 2a, T, (u) satisfying f(x) — 9(x)— 
+ M. Then solve the linear equation $,(x;) — 
Q,(X;) 3x (M — Mj) with respect to Aa, =a, — a, 
and M. Repeat this process until Aa, becomes 
sufficiently small. 

À computer can perform the division very 
quickly, and the rational approximation of a 
function, for example by its tcontinued frac- 
tion expansion, is often useful. 
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A. Polynomials in One Variable 


Let R be a commutative fring and ao, di... a, 
elements of R. An expression f (X) of the form 


f(X)-ag*a, X +... +4, X" (1) 


is called a polynomial in a variable X over R; if 
a, #0, the number n is called the degree of the 
polynomial f(X) and is denoted by deg f. If a, 
= ], the polynomial (1) is called a monic poly- 
nomial. The totality of polynomials in X over 
R forms a commutative ring with respect to 
ordinary addition and multiplication (whose 
definition will be given later). It is called the 
ring of polynomials (or the polynomial ring) of 
X over R and is denoted by R[ X ]. We say 
that we adjoin X to R to obtain R[X ]. 


B. Polynomials in Several Variables 


Let R[ X, Y] denote the ring R[ X ][ Y ], name- 
ly, the ring obtained by adjoining Y to R[X ]. 
An element of R[ X, Y ] can then be expressed 
as Za, X"Y". This expression is called a 
polynomial in X and Y over R. Generally, 
RX, ..., Xm] = R[X,,..., Kal X,,] is called 
the polynomial ring in m variables (on m inde- 


1246 


terminates) X,, ..., X,, over R, and its element 


FS X eX aus AX X 


(2) 


(X denotes a finite sum for nonnegative in- 
tegral v; beginning with v, =v, =... = v, = 0) is 
called a polynomial in m variables X,, ..., X,, 
over R. We call each summand a term of the 
polynomial, esca the coefficient and v, + v; 
+... v, the degree of this term. The greatest 
degree of terms is called the degree of the 
polynomial F. The term a. o of degree 0 

is called the constant term of F. If a polyno- 
mial F in X,,..., Xm is composed of terms of 
the same degree n, then F is called a homoge- 
neous polynomial (or form) of degree n; a 
polynomial consisting of a single term, such 
as aX}! X» ... X", is called a monomial. 

Now let 24,25, ...,0,, be elements of R (ora 
commutative ring S containing R), and let 
F(a, 0% ,...,%,) denote the element of R (or 
S) obtained by substitution of «,,2,, .... Xm 
for Xi, X5, ..., X, in F(X,, Xz, ..., Xm). It is 
also called a polynomial in 2,,25, ...,0,,. If 
F(&,805,...,0,) 0, then (a, 25, ..., Bel 18 
called a zero point (in S) of the polynomial 
F(X,, X5, ..., Xm) (or a solution of the alge- 
braic equation F(X,,..., X,,)=0). In the case 
of one variable, it is called a root of F(X,) (or 
of F(X,)=0). 


C. Polynomial Rings 


Addition and multiplication in RLX ] are 
defined by 


(Za; X) + (X bX?) Elat b) X, 


i (x aX’ (È b X?) = El a;b;) X*. 


A polynomial f(X )e RLX ] can be regarded 
as a function of a commutative ring R' con- 
taining R into itself such that c> f (c). In this 
sense, f(X)-- g(X ) and f(X)g(X) are the func- 
tions such that c++ f(c) - g(c) and c f(c)g(c), 
respectively. 

It holds that deg( f(X)+g(X))< 
max (deg f (X ), deg g(X)}, deg f(X )g(X)< 
deg f(X )+ degg(X ). If R is an tintegral do- 
main, then the latter inequality is an equality, 
and therefore RLX ] is an integral domain. 
For these inequalities and for convenience 
elsewhere, we define the degree of ( to be 
indefinite. 

Assume that R is a field. For given f,geR[X] 
(degg 2 1), we can find unique q, re R[X ] such 
that f —gq-- r and degr « degg or r — 0 (divi- 
sion algorithm). This q is called the integral 
quotient of f by g, and r is called the remainder 
of f divided by g. The same fact remains true 
in the general RIX ] if g(X ) is monic. (— 369 
Rings of Polynomials). 
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D. Factorization into Primes 


Let k be an integral domain. Since k[ X ] and 
hence k[ X, ..., Xn] are integral domains, we 
can define the concepts concerning divisibility 
(such as a divisor, a multiple, etc.) (2 67 Com- 
mutative Rings). If k is a *unique factorization 
domain, then so are k[ X ] and k[X,, ..., Xm]. 
A polynomial over k is said to be primitive if 
the greatest common divisor of all the coeffi- 
cients is equal to 1. Every polynomial over k 
can be uniquely expressed as a product of 
some primitive polynomials and an element of 
k; a product of primitive polynomials is primi- 
tive (Gauss's theorem). 

If k is a field, then KLX ] and k[ X,, ..., Xn] 
are unique factorization domains. Further- 
more, to find the greatest common divisor 
(f.g) of f gek[X ], we can use the Euclidean 
algorithm, that 1s, apply the division algorithm 
repeatedly to obtain 


f=94ith1, g—ridor3, Fi =F243 ra, 


degg » degr, —degr, >..., 


so that after a finite number of steps we attain 
r,-4 — T, dy ari 70). Then r, - (f, g). Accord- 
ingly, k[ X ] is a *principal ideal domain. This 
algorithm is applied to Euclid rings (— 67 
Commutative Rings L). 


E. The Remainder Theorem 


Let k be an integral domain, f(X)eKk[X ], and 
let g(X )= X —a(aek). Then using the division 
algorithm, we get 


f(X)-(X —a)q(X) +r, 
q(X)ek[X], rek. 


Therefore, f (x) 2r; that is, the remainder of 
f (X) divided by X —« is equal to f(a). This is 
called the remainder theorem. If f (x) 0, then 
f(X) is divisible by X —« in k[X ]. 


F. Irreducible Polynomials 


Let k be a field. A polynomial TO ek X ] of 
degree n is said to be reducible over k if f is 
divisible by a polynomial of degree v «n in 
k[ X ] (v 40); otherwise, it is said to be irreduc- 
ible over k. Any polynomial of degree 1 is 
irreducible. A polynomial f is a *prime element 
of k[ X ] if and only if f is irreducible over k. 
Let I be a unique factorization domain. If 
f(X)is a polynomial (1) in I[ X ] such that 

for a prime element p in I, a, x 0 (mod p), 
4,.,954, 52... 2agz0 (mod p) but ap #0 
(mod p?), then f(X) is irreducible over the 
field of quotients of J (Eisenstein's theorem). 

If a polynomial (2) in m variables over an 
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talgebraic number field k is irreducible, we 
can obtain an irreducible polynomial in 

X, ..., X, (0< u <m) from the polynomial 
F(X,, ..., Xm) by assigning appropriate values 
in k to X,,,, ..., Xm (Hilbert's irreducibility 
theorem, J. Reine Angew. Math., 110 (1892)). 
These two theorems have been generalized in 
many ways and given precise formulations. In 
Hilbert's irreducibility theorem, the algebraic 
number field may be replaced, for example, 
by any infinite field that is *finitely generated 
over its tprime field (K. Dórge, W. Franz, E. 
Inaba). 


G. Derivatives 


Given a polynomial 
TUE, Kik: 9. aps he. eo oe due 


over a field k, we define the (formal) derivative 
of f with respect to X; as fi(X,,..., X,)— 
Xvid, wm XY Xfi? ... Xs and denote 

it by 0f/0X;. The map fr 0f/0X; is called 

the (formal) derivative with respect to X;. In 
particular, if m — 1, then 0f/OX is denoted by 
df/d X. The usual rules of tderivatives also 
hold for the formal derivative. If df/d X =0 for 
an irreducible polynomial f(X ) in k[ X ], then 
f (X ) ts said to be inseparable; otherwise, f (X) 
is separable. If the *characteristic of the field k 
is 0, then every irreducible polynomial f(X) 
(#0) is separable. When k is of characteristic 
p #0, an irreducible polynomial f(X ) is insep- 
arable if and only if we can write f(X )- g(X?). 


H. Rational Expressions 


The ffield of quotients of the polynomial ring 
k[ X,, ..., X,] over a field k is denoted by 

k(X,, ..., X,) and is called the field of ra- 

tional expressions (or field of rational func- 
tions) in variables X,,..., X, over k. Its element 
is called a rational expression in X,, ..., X,. 
It can be written as a quotient of one poly- 
nomial f(X,, ..., X,) by another polyno- 
mial g(X,, ..., X,) #0. Also, an expression 

S (015 ...,o,)/g(0,, ..., Gel obtained by replac- 
ing X,, ..., X, with elements «,,...,a, of k 
in the above expression is called a rational 
expression in 0, ..., X, (provided that 


g(a, -s Oy) #0). 


I. Symmetric Polynomials and Alternating 
Polynomials 


Let f(X,,..., X,) be a polynomial in vari- 
ables X,, ..., X, over an integral domain J. If 
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f(X,, ..., Xn) is invariant under every per- 
mutation of X,,..., X,, it is called a sym- 
metric polynomial (or symmetric function) of 
X,,...,X,. If f(X,,..., X,) is transformed 
into —f(X,,..., X,) by every todd permuta- 
tion of X,,..., X„ it ts called an alternating 
polynomial (or alternating function). Also, 
an expression f{(x,,...,%,) obtained from 
f(X,, ..., Xn) by replacing X,,..., X, with 
elements 2,, ...,«, of J is called a sym- 
metric (alternating) function of a,, ..., o, if 
f (X,, ..., Xn) is symmetric (alternating). 

Let the coefficient of X"^* in the expansion 
of (X — X,)...(X — X,) be denoted by ( — 1)*o,. 
Then we have c, =) X;,2 X, t ...-X,,0,— 
KEE E CEET X t.t Xni Xn y= 
X, X3, ..., Xn. Obviously, these are sym- 
metric polynomials of X,, ..., X,. Moreover, 
for every element o of the polynomial ring 
ILY,, Y, ..., Y], @(0,, 0, ...,0,) is a Sym- 
metric polynomial of X,, ..., X,. Conversely, 
every symmetric polynomial of X,,..., X, 
can be uniquely expressed as a polynomial 
(04,05, ...,0,). Thus the totality of symmetric 
polynomials of X,,..., X, is identical with the 
,0,]. This is called the funda- 
mental theorem on symmetric polynomials, and 
01,05, ...,0, are called elementary symmetric 
polynomials (or elementary symmetric func- 
tions). For example, for s, X: X, (v2 1,2,. .), 
we have s, —0,,5, —02 —26,,54—01 — 30,04 + 
30,,5,2 01 — 4010; + 202 + 40,04 — 40,. 
Concerning the elementary symmetric poly- 
nomials and the s,, we have the relations 
$,—01$,-., 055,5 — TE D) la, 5, + 
(—1)'vo, 20(v-1,2, ...), and s,—9,5,., 
+...4(—1)"6,5,-,=0 (yo nt 1,n+2,...) 
(Newton's formulas). 

Let p(X,, ..., X,) (X, — X, X4, — X)... 
(X,— X,YXX4 — X5)...(X,— X, ,) be the product 
of n(n — 1)/2 differences between X,,..., X,. 
Then the polynomial p is invariant under even 
permutations of X,,...,X,, and p becomes — p 
under odd permutations. Hence p is an alter- 
nating polynomial of X,, ..., X,. It is called the 
simplest alternating polynomial of these vari- 
ables. Because of its particular expression, p is 
also called the difference product of X,, ..., X,. 
If the characteristic of I is different from 2, an 
alternating polynomial f is divisible by the 
simplest alternating polynomial p; it can be 
written as f — ps, where s stands for a sym- 
metric polynomial. 


ring E 


J. Discriminants 


The square D(X}, ..., X,) - p(X}, ..., X,) of 
the simplest alternating polynomial p is a 

symmetric polynomial, and it is therefore a 
polynomial in o,,...,0,. D(a, %,...,%,) 20 
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gives a criterion for the condition that some of 
2,, ...,5X, are equal. If x,,...,%, are the roots of 
an ‘algebraic equation ag X" +a, X" '+...+ 
a, =0 of degree n, then D(x,,...,%,) is called 
the discriminant of the equation. It can be 
expressed in terms of coefficients de, a,, ...,a, 
of the equation. For instance, if n= 2, we 

have a2D =a? —4aga;; if n —3, we have aD = 
aĵa? -18a59a,a5a4, — 4aga3 —4a1a, - 27a2a2. 
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A. Newtonian Potential 


In dynamics, a potential means a function u 
of n variables x,,...,x, such that — grad u = 
— (Ou/Ox,, ..., Ou/Ox,) gives a field of force in 
the n-dimensional (n > 2) Euclidean space R”. 
Given a point P in R” and a measure p, the 
functions u(P) given by the integrals 


u(P) = — | logPQdu(Q) n=2, 


«ny- | POrduld, nz3, 

are typical examples of potential fuactions. 
They are called the logarithmic potential and 
Newtonian potential, respectively. However, 
some authors mean by Newtonian potential 
the function u(P)= f PQ  du(Q) in R?. Usu- 
ally, the measure u is taken to be a nonnega- 
tive "Radon measure with compact 'support. 
These potentials are *superharmonic in R" and 
harmonic outside the support of u. Conversely, 
any harmonic function defined on a domain in 
R" can be expressed as the sum of a potential 
of a single layer and a potential of a double 
layer (defined 1n the next paragraph). Because 
of this close relation between potentials and 
harmonic functions, sometimes potential 
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theory means the study of harmonic functions 
(— 193 Harmonic Functions and Subhar- 
monic Functions). (For the representation by 
potentials of superharmonic functions — 193 
Harmonic Functions and Subharmonic Func- 
tions S.) 

Suppose that the measure u of R? satisfies 
du — p dx with sufficiently smooth density p 
and volume element dt. Then the Newtonian 
potential u of u satisfies Poisson’s equation 
Au = —47p. If the support of p is contained in 
a surface S and du = p do with density p and 
surface element doc, then the potential u of p is 
called the potential of a single layer (or simple 
distribution). If p is continuous on S, then u is 
continuous in the whole space, and the 'direc- 
tional derivative of u at a point P in the direc- 
tion of the normal line to S at P, tends to 


do(Q) 


P=Py 





| ô 1 
— 2np(Pg) 4- L——- 
s Np PQ 
as P approaches P, along the normal line. 
Therefore, as P moves on the line, the direc- 
tional derivative jumps by — 4zp(P,) at Po. If p 
satisfies the tHdlder condition at P, € S, then 
the derivative at P in the direction of any fixed 
tangent line at P, has a finite limit as P tends 
to Py along the normal line. The integral 


PQ 


is called the potential of a double layer (or 
double distribution). If p is continuous on S, 
then the limits at P, of u from the two direc- 
tions along the normal line at P, exist and are 
2np(P3) 4- (P) and — 2xp(P;) - u(P,). If, fur- 
ther, p is of class C? on S, then each partial 
derivative of u has a finite limit as P tends to a 
point of S. 


ð 1 
«n- | Pa po ^" (9) 
s ONg 


B. Generalized Potential 


The classical notion of potentials is generalized 
as follows: Let O be a space supplied with a 
measure u (2:0) and ®(P, Q) a real-valued 
function on the product space O x O. When 
the integral ( O(P, Q)du(Q) is well defined at 
each point Petit, it is called the potential of u 
with kernel ® and is denoted by o (P, ul or 
ÖP). The function dP. Q) 2 4(Q, P) is called 
the adjoint kernel of . When 


(u, v)— [foe Q)du(Q)dv(P) 


= foe, u)dv(P) 


exists for measures u, v>0, the value is called 
the mutual energy of u and v. In particular, 
(u, u) is called the energy of u. The definition of 
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potential given above may be too general, and 
some restrictions are called for. We assume 
that Q is a flocally compact Hausdorff space; 
Q is a flower semicontinuous function on Q x 
Q satisfying —oo « 9 x oo; and u, v, and 4 are 
nonnegative Radon measures with compact 
support in Q. In particular, when Q =R”, the 
potential with the kernel ®(P, Q) - PO" 

(0 « a « n) is called a potential of order x (some- 
times of order n — «) or a Riesz potential. 


C. The Maximum Principle and the Continuity 
Principle 


Let (2 — R", and let P. ui be the kernel of 
the Newtonian potential. Then ®(P, ul satis- 
fies the following principles: (1) Frostman's 
maximum principle (first maximum principle): 
Suppeo O(P, (ls SUPpes, P(P, p) for any 4, 
where S, is the support of u. (2) Ugaheri’s 
maximum principle (dilated maximum prin- 
ciple): There is a constant c>0 such that 
SUPpeg P(P, H) S c suppes, P(P, u) for any p. (3) 
À variation of Ugaheri's maximum principle: 
Given any compact set K c Q, there exists a 
constant c which may depend on K such that 
Supper O(P, i) < csuppes, (P, p) for any A 
with S, c K. (4) Upper boundedness principle: 
If O(P, u) is bounded from above on S,, then 
it is bounded on Q also. (5) For any compact 
set K c Q and any y with S, c K, if (P, u) is 
bounded above on S,, then it is bounded on K 
also. (6) Continuity principle: If ®(P, 1) is con- 
tinuous as a function on S,, then it is also con- 
tinuous in Q. Generally, the relations shown 
in Fig. 1 hold among principles (1)-(6), where 
(a) (b) means that (a) implies (b), and (c)-»(d) 
means the negation of (c) (d). 


Z (3) 


(1) = (2) (5) = (6) 


Sa 
Sa Z 
SQ (4) ix 
Fig. 1 


If the continuity principle holds for a gen- 
eral kernel ®(p, ul of a potential and if for any 
H there is a sequence {P,} of points in Q—S, 
that has an accumulation point on S, and 
along which ®(P,, ij) supo. s, P(P, ol, then (1) 
holds also. The second condition is valid, for 
instance, when Q= R”, P. u) is 'subharmonic 
in R" — S,, and limsup (P, u) « supgcgs P(Q, u) 
as P tends to the point at infinity. T. Ugaheri, 
G. Choquet, and N. Ninomiya studied (2) and 
(6). Ugaheri showed that for any nonnegative 
decreasing function g(r) defined in [0, oc) and 
satisfying eil — oo, the kernel ®(P, Q) - o(PQ) 
satisfies (2) in R". M. Ohtsuka established 
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that (6) (5) and (5)-»(6) in general and proved 
(5)—(6) in the special case where 6 is contin- 
uous on Q x Q in the wider sense (i.e., ® may 
have oo as its value) and © is finite outside the 
diagonal set of Q x O. The examples in Sec- 
tions F, I, and J show that the potentials with 
kernels satisfying a weak condition such as (6) 
possess a number of important properties. We 
note that (6) does not necessarily hold in gen- 
eral. (For literature on (1)-(6) and other re- 
lated principles — [18].) 


D. The Energy Principle 


Denote by E the class of all measures of finite 
energy. A symmetric kernel is called positive 
definite (or of positive type) if (yj —», u »)= 

(u, i) t- (v, v) —- 2(1, v) 20 for any u, ve E. If 

the equality (u — v, u — v) 20 always implies p= 
v, then the kernel is said to satisfy the energy 
principle. Some characterizations for a kernel 
to be of positive type or to satisfy the energy 
principle were given by Ninomiya. Using 
them, he showed that a symmetric kernel that 
satisfies Frostman's maximum principle or the 
domination principle (— Section L) and a 
certain additional condition is of positive type. 
Choquet and Ohtsuka generalized these defi- 
nitions and results [18]. 


E. Topologies on Classes of Measures 


Let C, be the space of continuous functions 
with compact support in Q and AM. be the 
class of measures in Q. The tseminorms pp — 
v5|f f du — f dv| (f € Co) define the vague 
topology on Mo . The class of unit distribu- 
tions can be topologized by the vague topol- 
ogy, which induces a topology on Q itself. This 
topology coincides with the original topology 
in Q. A subclass M of Mj is relatively com- 
pact with respect to the vague topology if on 
every compact set in Q, the values of the mea- 
sures of M are bounded. Denote by L the class 
of measures 4 such that (2, u) is finite for all 
pe Mg , and define the fine topology on Mọ by 
the seminorms u — v |(4, u) — (A, v)| (4€ L). This 
topology was introduced by H. Cartan [4] for 
the Newtonian kernel. It is the weakest topol- 
ogy that makes each ®(P, A), Jet, continu- 
ous. Further, when the kernel is of positive 
type, the weak topology is defined on E by the 
seminorms u — v-](4, u) — (A, v)| (4e E). The 
strong topology is defined on E by the semi- 
norm J/(u — v, u — v). 

For the Newtonian kernel, Cartan [5] 
showed that vague «fine « weak «strong on E, 
where vague fine, for instance, means that 
the fine topology is stronger than the vague 
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topology. He proved that the fine, weak, and 
strong convergences are equivalent for any 
sequence {j} of measures with bounded 
energies. B. Fuglede [16] called a kernel of 
positive type consistent when any *Cauchy 
net with respect to the strong topology that 
converges vaguely to a measure converges 
strongly to the same measure. This notion is 
used to give conditions for E to be tcomplete 
with respect to the strong topology [11,18]. 
Moreover, Fuglede called a consistent kernel 
satisfying the energy principle perfect, and 
studied the cases where tconvolution kernels 
on a locally compact topological group are 
consistent or perfect [11]. For instance, PO~? 
(0 «a « n) in R” and the Bessel kernel, which 
was studied by N. Aronszajn and K.. T. Smith, 
are perfect. 


F. Convergence of Sequences of Potentials 


We are concerned with determining when a 
family of potentials corresponding to a class 
of measures {4a} with indices œ in a directed 
set converges. If all S, are contained in a 
fixed compact set and H. converges. vaguely 
to Ho, then liminf (P, uo) > (P, Ho) in Q. If, 
moreover, ® is continuous in the wider sense 
and both ® and @ satisfy the continuity prin- 
ciple, then equality holds quasi-everywhere 
(q.e.) in Q in this inequality [18]; we now de- 
fine the notion q.e. First, for a nonempty com- 
pact set K in Q, define W(K) to be :nf(j, u), 
where S, c K and u(K)— Il, and W(@) to be 
co for the empty set Ø. Next, set W(X) = 
inf, e W(K) for an arbitrary set X in Q and 
W,(X) —supy. 5 WG), where G is an open set 
in Q. When a property holds except on a set X 
such that W,(X) = oo (resp. W(X) = oo), we say 
that the property holds quasi-everywhere (q.c.) 
(resp. nearly everywhere (n.e.)). The terms q.e. 
and n.e. are also used in the theory of *capac- 
ity, although their meaning is not tae same as 
here. (For results on the convergence of se- 
quences of potentials — [18].) 


G. Thin Sets 


A set X <Q is called thin at Pj when either P, 
is an isolated point of the set X U {P} with 
respect to the original topology of Q or there 
exists a measure u such that lim inf (P, u) > 
Q(P,, 4) as Pe X — (P5) tends to Py. If Po is 
an isolated point of X U {Po} with respect to 
the topology weakest among those stronger 
than both the original topology and the fine 
topology, then X is thin with respect to the 
adjoint kernel ®. The converse is true in a 
special case [5]. The notion of thinness was 
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introduced in 1940 by M. Brelot and inves- 
tigated in detail in [3]. Let g(r) be a positive 
decreasing function. Suppose there exist posi- 
tive numbers ra, 6, a such that o(r) <ag((1+ 
6)r) in 0 € r « rg. Assume that Q is a metric 
space with distance p, and take g(p(P, Q)). 

as the kernel ®(P, Q). Then a necessary and 
sufficient condition for X to be thin at P, is 
XR S/W,(X)) «oo (s 1) [19], where X;= 
(PeX|s! € o(p(P, Po))<s/*"}. This criterion 
was obtained by N. Wiener in 1924 and uti- 
lized to give a condition for a boundary point 
P, of a domain D in R? to be tregular with 
respect to the tDirichlet problem for D. In this 
situation, P, is regular if and only 1f the com- 
plement of D is thin at Py. When every com- 
pact subset of X is thin at Py, X is called inter- 
nally thin at P). A necessary and sufficient 
condition for X to be internally thin at P, is 
XS/W(Xj«oo. 


H. Polar Sets 


Brelot (1941) called a set A polar when there 

is a measure p for which (P, ju) 2 oo on A. 
Consider U(Q, K) (= V(K, Q)) as defined 

in 48 Capacity C, and define Ü,(Q, X) by 
sup, yinfg e U(Q, K) for an arbitrary set X, 
where G is an open set. Then for any p, X = 
(P | (P, u) 2 oo] is a G; set for which the value 
of U,(Q, X) is infinite. Conversely, given a G; 
set A of tNewtonian outer capacity zero in R” 
(n> 3), there is a measure u such that the set of 
points where the Newtonian potential of u is 
equal to oo coincides with A and u(R" — A)=0 
(Choquet [7]). This result is called Evans's 
theorem (or the Evans-Selberg theorem) (— 

48 Capacity) in the special case where A is 
compact. 


I. Quasicontinuity 


A function f in Q is called quasicontinuous if 
there is an open subset G in Q of arbitrarily 
small capacity such that the restriction of f to 
Q — G is continuous. Naturally, quasicontinu- 
ity depends on the definition of capacity. Sup- 
pose that whenever the potential of a measure 
u with kernel © is continuous as a function on 
S, it is quasicontinuous in Q; d is then said to 
satisfy the quasicontinuity principle. Assuming 
in addition that ® is positive symmetric and 
taking 1/U (Q, G) as the capacity of G, we find 
that every potential is quasicontinuous in Q 
(M. Kishi). A similar result is valid for a non- 
symmetric kernel if the continuity principle is 
assumed (Choquet [6 ]). 


338L 
Potential Theory 


J. The Gauss Variational Problem 


Given a compact set K and a function f on K, 
the problem of minimizing the Gauss integral 
(u, u) 2 | f du for a measure u such that $, C K 
is called the Gauss variational problem. When 
an additional condition is imposed on yp, the 
problem is called conditional. Among many 
results obtained for this problem [18], the 
following is typical: If is symmetric, K sup- 
ports a nonzero measure of finite energy, and f 
is finite upper semicontinuous on K, then there 
exists a u such that f(P) x (P, u) n.e. on K 
and f(P)>®(P, ul on S,. When 6 is not sym- 
metric, the same relations hold for some pz 

if is positive and © satisfies the continuity 
principle (Kishi [15]), although the method 
using Gauss variation is not applicable. When 
® is symmetric and of positive type, the Gauss 
integral with f(P) — (P, v) (ve E) is equal to 
\|4—v||?—||v||?, and the minimizing problem 

is equivalent to finding the projection of v to 
(ue EIS, c K}. In some cases, this projection is 
equal to the measure obtained by the balayage 
of v to K (— Section L). 


K. Equilibrium Mass Distributions 


A unit measure u supported by a compact set 
K is called an equilibrium mass distribution on 
K if (P, u) is equal to a constant a n.e. on K 
and ®(P, p) «a in Q. The kernel is said to 
satisfy the equilibrium principle if there exists 
an equilibrium mass distribution on every 
compact set. If a 0, 1/a can be regarded as a 
kind of capacity, and p/a is called a capacitary 
mass distribution. Corresponding to tinner 
and outer capacities, inner and outer capaci- 
tary mass distributions and their coincidence 


can be discussed [11]. When ® is symmetric, 


Frostman's maximum principle is equivalent 
to the equilibrium principle. 


L. The Sweeping-Out Principle 


A kernel is said to satisfy the sweeping-out 
principle (or balayage principle) if for any com- 
pact set K and measure p there exists a mea- 
sure v supported by K such that ®(P, v) - 
(UP. u) n.e. on K and P. v) x O(P, u) in Q. 
When we find such a v, we say that we sweep 
out u to K, and finding v is called a sweeping- 
out process (or balayage). This terminology 
originated in the classical process for the New- 
tonian potential in which the exterior of K is 
covered by a countable number of balls and 
the masses inside the balls are repeatedly 
swept out onto the spherical surfaces. For any 
general kernel, the balayage principle implies 
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the domination principle (also called Cartan’s 
maximum principle), which asserts that if the 
inequality ®(P, ul ®(P, v) is valid on S, for 
HEE and any v, then the same inequality holds 
in Q. The converse is true if is positive, sym- 
metric, continuous in the wider sense, and 
finite outside the diagonal set. In contrast to 
the domination principle, ® is said to satisfy 
the inverse domination principle if the inequal- 
ity D(P, u) « (P, v) on S, for ue E and any v 
implies the same inequality in Q. In a special 
case, the domination principle implies Frost- 
man's maximum principle [17]. For the equi- 
librium and domination principles for non- 
symmetric kernels — [14]. 

Corresponding to inner and outer capaci- 
tary mass distributions, we can examine inner 
and outer balayage mass distributions and 
inner and outer Gauss variational problems 
and their coincidences [5, 18]. With respect to 
the Newtonian potential, a point P is called an 
internally (externally) irregular point of X if the 
inner (outer) balayage mass distribution to X 
of the unit measure e, at P is different from ep. 
X is thin (internally thin) at P if and only if P 
is an externally (internally) irregular point of X 


(Cartan [5]). 


M. Other Principles 


A kernel 6 is said to satisfy the uniqueness 
principle if u =v follows from the equality 
(P, u) - 9(P, v), which is valid ne in Q. Nino- 
miya and Kishi studied this principle. A ker- 
nel ® is said to satisfy the lower envelope prin- 
ciple if given u and v, there is a å such that 
®(P, 4) 2 min(O(P, u), (P, v)). If ® satisfies the 
domination principle and Č satisfies the con- 
tinuity principle, then ® satisfies the lower 
envelope principle on every compact set con- 
sidered as a space. Conversely, if ® satisfies 
the lower envelope principle on every compact 
set considered as a space, then ® satisfies the 
domination principle or the inverse domi- 
nation principle under some additional con- 
ditions (Kishi). A kernel is said to satisfy the 
complete maximum principle if the inequality 
O(P, u) x &(P, v) x a on S, implies the same 
inequality in Q for we E, any v, and a z 0 (Car- 
tan and J. Deny, 1950). This principle implies 
both Frostman's maximum principle and the 
domination principle. Potentials of order a 
(n—2<a<n) in R” and the Yukawa potential 
with kernel aPQ~' exp(—APQ) in R? satisfy 
the complete maximum principle. Relations 
between this principle and some other prin- 
ciples were studied by Kishi [14]. 

While all principles discussed so far are 
global, Choquet and Ohtsuka made a local 
study. 
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N. Diffusion Kernels 


By means of the bilinear form f f du (f eC, 

ue Mo), we now introduce weak topologies in 
the space C of continuous functions in Q and 
in the class M, of Radon measures of general 
sign with compact support. Similarly, we in- 
troduce weak topologies in Cy and in the class 
M of measures of general sign with not neces- 
sarily compact support. A positive linear map- 
ping G of M, into M that is continuous with 
respect to these two weak topologies is called 
a diffusion kernel. The linear mapping G* of 
C, into C that is determined by f fdGu= 

f G*f dy is called a transposed mapping. We can 
define the balayage principle for G and the 
domination principle for G* as in the case 
where kernels are functions. Then G satisfies 
the balayage principle if and only if G* satis- 
fies the domination principle (Choquet and 
Deny). The complete maximum principle has 
been defined and studied for G* (Deny). G. A. 
Hunt obtained a relation between this prin- 
ciple and the representation of G*f in the 
form LC P, f dt with a *semigroup P. His result 
is important in the theory of *stochastic pro- 
cesses (— 261 Markov Processes). 


O. Convolution Kernels 


A diffusion kernel G on a locally compact 
Abelian group induces a convolution kernel x 
if G is translation-invariant. It is called a Hunt 
kernel when there exists a vaguely continuous 
semigroup (al, a such that x= f8 c dt and a, 
— £g, the Dirac measure at the origin. A Hunt 
kernel satisfies the domination principle and 
the balayage principle to all open sets, and it 
satisfies the complete maximum principle if 
and only (Ia is sub-Markovian, (da, <1. 
The Fourier transform of such a sernigroup 
has a closed connection with a negative- 
definite function [1]. 

For a convolution kernel x satisfying the 
domination principle, or, equivalently, the 
balayage principle, the inequality 


Klon )K(@2) € K(w, — @,)K(@, +0) 


is valid for all relatively compact open sets oi 
and w, [8]. It has a unique decomposition 

K — 9 (Kg +K), where ¢ is a continuous ex- 
ponential function, ko is equal to 0 or a Hunt 
kernel satisfying the complete maximum prin- 
ciple, and x, is a singular kernel satisfying the 
domination principle such that x, «e, —x, for 
every xeS, [12]. 


P. Potentials with Distribution Kernels 


A function f in R" is called slowly increasing 
in the sense of Deny if there exists a positive 
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integer q such that f f(P)(1+ OP?) *dx(P) < 
oo. A tdistribution K is called a distribution 
kernel if the fFourier transform FK of K isa 
function k 2 0 and both k and 1/k are slowly 
increasing in the sense of Deny. Given such 

a distribution K, the family W of distributions 
T for which FT are functions and | T||? = 
[Kk(FTY dx < oo (this is called the energy of 

T) is a tHilbert space with inner product 

(T,, T)) - [KF T, FT, dt. However, the family of 
Newtonian potentials of measures with finite 
energy is not a Hilbert space (Cartan). The 
family of functions of class C? with compact 
support is *dense in W. For every Te W the 
function FK x FT is slowly increasing in the 
sense of Deny. There exists a distribution U 
=U" that satisfies FU = FK x FT, called the 
K-potential of T. Since W is complete, the 
method of projection is applicable, and prob- 
lems of equilibrium, balayage, and capacity 
can be examined. For instance, if tDirac’s 
distribution 6 is taken as a distribution kernel, 
then the corresponding capacity is the Le- 
besgue measure. In the case of the Newtonian 
kernel, 0U7/0x; — f; is defined a.e. in R” for any 
T e W. These f; are square integrable, T = 
—c,25-, 0fj/0x;, and || TI? = cp 37. [GI dv, 
where 1/c, 2 2(n — 2)n"?/T (n/2). Furthermore, 


UT =(n—2) DIE 


where x;, y; are components of P, Q, respec- 
tively. Every ordinary potential of a double 
layer is a special case of UT. Conversely, let f 
be a function in R" that is absolutely continu- 
ous along almost every line parallel to each 
coordinate axis and whose partial derivatives 
are square integrable. Then f is equal to the 
potential of some Te W with Newtonian ker- 
nel up to an additive constant. These results 
are due to Deny [9]. 


Q. Dirichlet Spaces 


In this section, functions are assumed to be 
complex-valued. Let Q be a locally compact 
Hausdorff space, £z 0 a Radon measure in Q, 
and C, the space of continuous functions with 
compact support. A Hilbert space D consisting 
of locally £-integrable functions is called a 
Dirichlet space if C, N D is dense in both C, 
and D, the relations |v(P) —v(Q)| < |u(P) — u(Q)| 
and |v(P)| x |u(P)| for ue D and a function v 
always imply ve D and lvl < ||u||, and for any 
compact set K c Q, there exists a constant 
A(K) such that fx lu dÉ < A(K)lu|| for every 

u€ D. The notion of Dirichlet space was in- 
troduced by A. Beurling. A function ue D is 
called a potential if there exists a Radon mea- 
sure u such that (u, 9) —- ( @dy holds for every 
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q€ C41 D. If in addition 5 z 0, then u is called 
a pure potential. For any pure potential, the 
lower envelope principle, the equilibrium 
principle, the balayage principle, and the com- 
plete maximum principle hold [2]. Suppose 
that € is a locally compact Abelian group and 
D is a Dirichlet space such that U,u(x) 2 u(x — 
y)eD and ||U,u|| = Jul for every ue D and 
y€€Q. Then we call D special and characterize 
it in terms of a real-valued continuous func- 
tion on Q [2]. (For axiomatic potential theory 
— 193 Harmonic Functions and Subharmonic 
Functions.) 
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A. General Remarks 


Let a and co, c,, c5, ... be elements of a "feld 
K and z be a variable. A series of the form P= 
Xzoc,(z—ay is called a power series (in one 
variable). We assume that K is the field of 
complex numbers. For a given power series P, 
we can determine a unique real number R 
(0 x R< oo) such that P converges if [z —a| « R 
and diverges if R « |z — a|. We call R the radius 
of convergence and the circle |z — a| = R (some- 
times |z —a| < R) the circle of convergence of P. 
The value of R is given by R= 1/lim sup t/c] 
(Cauchy-Hadamard formula) with the conven- 
tions 0— 1/oo, oo 2 1/0. Also R=lim|c,/c,44\|, 
provided that the limit on the right-hand side 
exists. 

A power series *converges absolutely and 
uniformly on every compact subset inside 
its circle of convergence and defines there a 
single-valued complex function. Since the 
series is ttermwise differentiable, the function is 
actually a holomorphic function of a complex 
variable. Conversely, any function f(z) holo- 
morphic in a domain can be represented by a 
power series in a neighborhood of each point 
a of the domain. Such a representation is 
called the Taylor expansion of f(z) at a (or in 
the neighborhood of a). A power series that 
represents a holomorphic function is called a 
holomorphic function element. K. Weierstrass 
defined an analytic function as the set of all 
elements that can be obtained by tanalytic 
continuations starting from a given function 
element (— 198 Holomorphic Functions). 

Besides the series £o c,(z — a)", a series of 
the form Q = >". 9c,z ” is called a power series 
with center at the point at infinity, and its value 
at oo is defined to be cy. By putting z—a =t 
when its center a is a finite point and z ! =t 
when its center is oo, every power series can be 
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written in the form 27, c,t", and such a t is 
called a local canonical parameter. 

When t is a local canonical parameter, a 
series of the form X... c,t" is called a Lau- 
rent series and a series 2... c,t"/* (k a fixed 
natural number) is called a Puiseux series, after 
the French mathematicians A. Laurent and V. 
A. Puiseux. Power series are sometimes called 
Taylor series. 

If we perform tanalytic continuations of a 
power series from its center along radii of its 
circle of convergence, we encounter a tsingular- 
ity on the circumference for at least one radius. 
For a power series with the circle of conver- 
gence |z|= R, the argument a of the singularity 
on |z| - R nearest z =R is given in the follow- 
ing way. Suppose, for simplicity, that the 
radius of convergence R of È c,z" equals 1, 
and put 


p.h) = ("Jes 
v=0 y 
P(h)=lim sup Aal 


Then « is obtained from 


cosa = PL (0)= lim (P(h)— Lh 


(S. Mandelbrojt, 1937). In particular, if all the 
c, are real and nonnegative, z= R is a singular- 
ity (Vivanti's theorem). 


B. Abel’s Continuity Theorem 


As a property of the power series on the circle 
of convergence, we have Abel's continuity 
theorem: If the radius of convergence of f(z) — 

1-9 0,2" is equal to 1 and X7, a, coverges 
(or is t(C, k)-summable (k> —1)) to A, then 
f(z)-5 A when z approaches 1 in any sector 
{z||z| <1, Jarg(1 — 2) «(n/2) —8), 07.0 
(*Stolz’s path). 

The converse of this theorem is not always 
true. The existence of lim,_, f(z) does not 
necessarily lead to the convergence of 3. 4,. 
Even Cesaro summability of £oa, does not 
always follow from the existence of lim,.., f(z). 
If a, — o(1/n) and f(z)— A when z approaches 1 
along a curve ending at z — 1, then 2a, con- 
verges to A (Tauber's theorem, 1897). The 
theorems concerning additional sufficient 
conditions for the validity of the converse of 
Abel’s theorem are called theorems of Tauber- 
ian type (or Tauberian theorems). In Tauber’s 
theorem, the hypothesis on the a, may be 
replaced by a, — O(1/n) or nRea,, nIma, may 
be bounded from above but not necessarily 
from below (G. H. Hardy and J. E. Little- 
wood). Here, the condition a, — O(1/1) cannot 
be weakened (Littlewood). Sufficient condi- 
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tions for Xa, to be tsummable for various 
summation methods are also known. 


C. Lambert Series 


A series of the form 


n 


oo Z 
È t 


is called a Lambert series. If Za, is convergent, 
(1) converges for any z with |z| #1, and more- 
over, it converges uniformly on any compact 
set contained in |z| « 1 or |z| 1. If Xa, is 
divergent, (1) and the power series Y a, z" 
converge or diverge simultaneously for |z| #1. 

There is a detailed study of Lambert series 
by K. Knopp (1913). If R is the radius of con- 
vergence of 2a,z" and Xanda = A, is the sum 
extending over all divisors of n, then we have 
the reciprocity relation 





Iz| 1, (1) 





which holds for |z| « min(R, 1). As special cases 
of this relation, we have 


n 


zz X Hn); z> 
n=1 = 





ü doa -Y 9(n) a 
where yp and ¢ are the 'Móbius function and 
"Euler function, respectively. 

If the na, are real and bounded from below, 


"eh 
Amo » (1 -2na. a o es 
implies Za, —s. Hardy and Littlewood (1921) 
showed that this theorem of Tauberian type is 
equivalent to the tprime number theorem. 


D. Singularities of Power Series 


Given a power series P — Y a,z", if the tbranch 
in |z|] « R* of the analytic function f(z) deter- 
mined by P is single-valued meromorphic but 
the branch in |z| « R' with R’ > R* has sin- 
gularities other than poles, then R* is called 
the radius of meromorplty of P and |z|= 
(sometimes |z| « R*) is called the circle of 
meromorphy of P. R* can be computed in the 
following way. Put 


1, —limsup|4/ D&?|, 


n- oo 


a, Os 1 CH Og kp 


(p)... nsi n2 Ont pei 
DP = : 


a 


n+p Os p41 Qn+2p 
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Then the sequence of numbers |,. , /l, is in- 
creasing, and ml, Aas R*. If R,, R2,... are 
different valued of LA, then f(z) is holo- 
morphic at points not lying on |z| - R, (Hada- 
mard's theorem, 1892). 

When a point a and a set A of points in the 
complex plane are given, the set of points that 
can be joined with a by a segment disjoint 
from A is called the star region determined 
by a and A. Take any half-line starting at 
a; the point of A lying on it and nearest to 
a is called a vertex. For a power series 2: c,z", 
the set of centers of the function elements 
obtained by analytic continuations along 
half-lines starting at the origin is called the 
star region of X c,z" with respect to the 
origin. Let {a} and (fj) be the set of vertices 
of the star regions with respect to the origin 
of X a,z" and È b,z", respectively. Then the. 
star region determined by the origin and 
the set {a} is contained in the star region 
of 2.a,b,z" (Hadamard's multiplication theo- 
rem, 1892). 

The following are some results concerning 
conditions for the coincidence of the circle of 
convergence of a power series and its tnatural 
boundary. Let the a, be positive numbers 
and b a natural number greater than 1. If the 
radius of convergence of 3? , a,z" is equal to 
1, then |z| — 1 is its natural boundary (Weier- 
strass, E. I. Fredholm). If the radius of con- 
vergence of LZ age zs (with (4,) an increas- 
ing sequence of natural numbers) is 1 and 
lim inf, (4,4, —An)/An > 0, then |z|=1 is the 
natural boundary (Hadamard’s gap theo- 
rem). The latter condition was weakened to 
lim inf, |, (4,4, — 4)// 4,» 0 by E. Borel 
(1896). E. Fabry (1896) showed that with the 
radius of convergence of 2:3. 9 a, z" being 1, if 
there exist a suitable sequence of natural num- 
bers m, «m; « ... and a number 0 (0 0 — 1) 
such that lim s;/m; — 0, where the s; are the 
number of nonzero a, contained in the inter- 
val (m;(1 — 0), m;(1 + 9)), then [z[ — 1 is the 
natural boundary of 2a,z". By applying 
this theorem to X a,z^ with radius of con- 
vergence 1, it can be shown that if lim, ,, 4, /n 
= 00, then |z|=1 is its natural boundary. 
These theorems are called gap theorems 
because they concern power series with 
gaps in their exponents. A generalization 
of Fabry's theorem was obtained by G. Pólya. 
It is known that Fabry's last condition 
above is in a sense the best possible (Pólya, 
1942). 

Regarding the natural boundary of a power 
series, we also have the following result: When 
the radius of convergence of X a,z" is 1, by a 
suitable choice of the sequence {e,} (&, +1), 
the series 3 6,a,z" has |z| — 1 as its natural 
boundary (A. Hurwitz, P. Fatou, Polya). 
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E. Overconvergence 


If the radius of convergence of f(z) 2 X a,z" is 
1, the sequence of partial sums S(1, z), S(2, z), 
..., Where S(n, z) 2 355.9 a,z", is naturally di- 
vergent for |z|>1, but a suitable subsequence 
S(n,, z) (k=1, 2, ...) may still be convergent. 
A. Ostrowski [7,8] called this phenomenon 
overconvergence and proved the following re- 
sult: By definition, f(z) 2 X.5a,z^' has a 
lacunary structure if the sequence (4,) has a 
subsequence {/,,} such that 4, ., > A, (1-- 0) 
(0 — 0). If this situation occurs, then in a suffi- 
ciently small neighborhood of a point on |z| — 
1 where f(z) is holomorphic, At, z) (k = 

1,2, ...) converges uniformly. This result in- 
cludes Hadamard's gap theorem as a special 
case. Conversely, any power series for which 
overconvergence takes place can be repre- 
sented as the sum of a power series having a 
lacunary structure and a power series whose 
radius of convergence is greater than 1. G. 
Bourion [9] gave a unified theory of these 
results using *superharmonic functions. 

R. Jentsch [10] showed that all singularities 
of a power series on its circle of convergence 
are accumulation points of the zeros of the 
partial sums. On the other hand, if the zeros 
of a subsequence S(n,, z) (Kk — 1,2, ...) has no 
accumulation point on |z| 2 1, then the power 
series has a lacunary structure and overcon- 
vergence takes place for S(n,, z) (k — 1,2, ...). 
If logn,.., — O(n,) and S(n,, z) (k=1,2,...) is 
overconvergent, then all boundary points of 
the domain of overconvergence are accumula- 
tion points of the zeros of S(n,, z) (Ostrowski 
[8]). 

A power series is completely determined by 
its coefficients, but little is known about the 
relations between the arithmetical properties 
of its coefficients and the function-theoretic 
properties of the function represented by the 
series. A known result is that if the power 
series $ c,z" with rational coefficients repre- 
sents a branch of an talgebraic function, then 
we can find an integer y such that the c," 

(n 2 1) are all integers (Eisenstein’s theorem, 
1852). 

For power series of several variables — 21 
Analytic Functions of Several Complex Vari- 
ables; for formal power series — 370 Rings of 
Power Series. For power series expansions — 
Appendix A, Table 10.IV. 
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Probabilistic Methods in 
Statistical Mechanics 


A. Introduction 


Probabilistic methods are often very useful in 
the rigorous treatment of the mathematical 
foundations of statistical mechanics and also 
in some other problems related to statistical 
mechanics. As examples of such methods, we 
explain here (1) Ising models, (2) Markov 
statistical mechanics, (3) percolation processes, 
(4) random Schródinger equations, and (5) the 
*Boltzmann equation. 


B. Ising Models 


The Ising model was proposed by E. Ising (1] 
to explain the phenomena of phase transitions 
of a ferromagnet, in which either a + or — 
spin is put on each site of a crystal lattice, and 
interaction between nearest neighboring sites 
is taken into consideration. 

Let V be a cube in the d-dimensional integer 
lattice space Z4 and X, — ( +1, —1]" be the 
totality of spin configurations in V. Each ele- 
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ment of X, is denoted by o = Lehr (o; — +1 
or —1). We suppose that a spin configuration 
c has a potential of the type 
U,(o)- — Y, coh). e, 

(,j cV ieV 
where <i, jy means that (i,j) is a nearest neigh- 
boring pair of sites and h stands for the para- 
meter of an external field. A fprobability mea- 
sure on X, is called a state on V. For each 
state u, the free energy is defined by 


1 
Fu: 3. mtoalvielt-z 3. ulo)logulo), 
gei p aeXy 
where B= 1/kT (k is the Boltzmann constant, 
T is the tabsolute temperature). Then there 
exists a unique state 


exp( — BUy(o)) 
2 ex, exp( = BUyQ)) J 


on V which minimizes the free energy Fy (vari- 
ational principle). g^ is called a Gibbs state on 
V of the Ising model with parameter (£, h). 
Physically, a Gibbs state is an equilibrium 
state, in which various physical quantities are 
calculated. 

Since for each Z?^-homogeneous (i.e., trans- 
lationally invariant with respect to Z^) state u 
the mean free energy 


gf (e)— 0€ Xy, 


fà lim Fol 

is well defined, a (limiting) Gibbs state can also 
be defined for the infinite domain V= Z? by 
the above mentioned variational principle. 
However, at present the probabilistic defi- 
nition of Gibbs states given by Dobrushin [2] 
and Lanford and Ruelle [3] is prevalent. 

It is known that if an external field is present 
(i.e., h #0) there is only one Gibbs state for any 
B. On the other hand when an external field is 
absent (h 2 0) and d 22, there are at least two 
Gibbs states, i.e., a phase transition occurs, for 
a sufficiently low temperature. 

Finally, we mention some known facts in 
this field. In the following we assume h=0. (1) 
In the 1-dimensional case, the phase transition 
never occurs. (2) For d 22, there exists a crit- 
ical value B (dl such that the phase transition 
does not occur for any £ < fi (d) but it occurs 
for every B > f, (d). The calculation of BO) has 
been carried out by Onsager [4]. (3) In the 2- 
dimensional case, every Gibbs state is Z?- 
homogeneous [5,6]. (4) For d 23, there is a 
Z/-inhomogeneous Gibbs state for sufficiently 
large f [7]. 


C. Markov Statistical Mechanics 


Stochastic Ising models, infinite interacting 
particle systems, and many models occurring 
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in physics, biology, and sociology are formu- 
lated as a class of inftnite-dimensional Markov 
processes. The field of investigation of station- 
ary states and statistical or ergodic properties 
of these processes is called Markov statistical 
mechanics, which has made rapid progress 
during the last decade. We explain this field by 
looking at a typical class of processes. 

Let Z’ be the d-dimensional integer lattice 
space. Putting + or — spin on each site on Z^, 
let us consider a random motion of spins which 
evolves while interacting with neighboring 
spins. Let X — (--1, —1}” be the totality of 
spin configurations and an element of X be 
denoted by y = {n(i)} ez: (n(i)}= +1 or —1). The 


. process is described in terms of a collection of 


nonnegative functions c;(n) defined for ie Z° 
and ge X. For the configuration y, at time t, 
y, (i) changes to —15,(i) in the time interval 

[t t - At] with probability c,(y,)At + o(At). 
This process on X is called a spin-flip model. 
For an initial distribution o we denote by o, 
the distribution at time t. If 4, — for all t>0, 
pis called a stationary state. 

Example 1. Stochastic Ising models. À sto- 
chastic Ising model was proposed by Glauber 
[8] to describe the random motion in a fer- 
romagnet upon contact with a heat bath. Then 
the flip rate (c,()] is defined by the potential 
of the Ising model. It is known that any Gibbs 
state of the Ising model is a reversible station- 
ary state of the stochastic Ising model, and the 
converse is also valid. Free energy plays an 
important role in the study of the ergodic 
properties of these models. In particular, the 
mean free energy is a nondecreasing functional 
along the distributions 4, 0 € t « oc. 

Example 2. Contact processes. A contact 
process was introduced by Harris [11] to 
investigate the spread of infection. The flip rate 
of the contact process is given by 


j- 1 if «(i--1, 
coll kd 


if (ij=—1 and 

*Ullj-il2-L n= +1} =k. 
Here +1 denotes an infected individual and 
—1 denotes a healthy one. Denote by —1 the 
configuration at which all sites are healthy and 
by 6_, the unit point mass at —1; then 6_, is 
a stationary state. The most important result 
is the following: There exists a critical value 4; 
(0 « 4, « oo) such that if 4< 4, 6_, is a unique 
stationary state, and if A>, there is another 
stationary state u satisfying 


u[ne X; q(i) 2 +1 for infinitely many i]=1. 
D. Percolation Processes 


A percolation process is a mathematical model 
which describes the random spread of a fluid 
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through a medium. It can be used to describe 
phenomena such as the penetration through a 
porous solid by a liquid or the spread of an 
infectious disease [14]. Usually, the process is 
identified as a site percolation process or a 
bond percolation process. Here we describe the 
latter only. 

Let L= (S, B} be a countable connected 
*graph with a set of sites (tvertices) S and a set 
of bonds (tedges) B. Each bond b is open with 
probability p and closed with probability 1 — p 
independently of all other bonds. Suppose that 
a fixed point o is the source of a fluid which 
flows from o along open bonds only. 

Let Op) be the probability that the fluid 
spreads infinitely far, and define the critical 
percolation probability p, — inf( p | 0(p) 0]. 
Then it is known that (1) p, = 1/2 for the 
square lattice [16], (2) p, =2sin(z/18) for the 
triangular lattice, and (3) p, = 1 —2sin(z/18) 
for the honeycomb lattice. 


E. Random Schródinger Equations 


Random Schrödinger equations are !'Schródin- 
ger equations in R" with random potentials 
U(x, 0); therefore the corresponding operators 
are of the form 


A(o)2 —A-U(:,o), 


where w denotes a random parameter in a 
probability space (Q, %, P) and A denotes the 
*Laplacian in R”. It is assumed that this sys- 
tem of potentials forms a spatially homoge- 
neous random field with the tergodic property. 
This system of equations is considered to be a 
model describing the motion of quantum- 
mechanical particles in a random medium. 
Mathematically, the problem is to investigate 
various spectral properties of the self-adjoint 
operators A(c). Since A(o) and its shifted 
operator A(@(- + x)) are tunitarily equiva- 
lent, the above assumption on the potentials 
U(-,c) implies that the spectral structures are 
independent of each sample o a.s. if their 
structures of A(c) are measurable with respect 
to (Q, à). 

Several rigorous results have been obtained. 
In the (-dimensional case, if the potentials 
U(-,@) are functionals of a strongly ergodic 
*Markov process, then it is proved that A(w) 
has only a pure point spectrum [17], and each 
eigenfunction decays exponentially fast [18]. 
In multidimensional cases, asymptotic behav- 
ior at the left edge of the mean of the resolu- 
tions of the identity for a certain A(w) have 
been investigated [19]. It is assumed that the 
random potential for this A(c) takes the form 


U(x, ol -| p(x — y)n(dy, o»), 
R” 
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where (z(dy, œ)} is a *Poisson random mea- 
sure with mean measure dy and íx) is a non- 
negative measurable function satisfying @(x)= 
o(|x| ^7?) as |x| 5 oc. Let E,(x, y, ©) be the 
continuous kernel for the resolution of the 
identity for A(c), and denote by N(A) the mean 
of E,(0,0, œw). Then N(A) is a nondecreasing 
function vanishing on (—00,0) and has the 
following asymptotic form at 4 — C: 


A"? log N(A)>—yh? as 4-90, 


where y; is the first eigenvalue of —A witha 
Dirichlet boundary condition on the ball in R" 
with unit volume. The quantity N (4) can be 
identified with a limiting state density of A(co), 
namely, the limit function of 


N,(4, e) = # (jl Aj (0) AN 


as V tends to R” regularly, where ; Aj (c) is 
the set of eigenvalues of A(w) in a smooth 
bounded domain V in R" with a Dirichlet 
boundary condition and |V| is the volume of 
V. To obtain the above asymptotic behavior of 
N(A), the theory of "large deviation for Mar- 
kov processes plays a crucial role [20]. 


F. Boltzmann Equation 


In the kinetic theory of gases the tBoltzmann 
equation is derived from the fLiouville equa- 
tion by considering the BBGKY hierarchy of 
particle distribution functions for N particles 
and then by taking the limit N —co under 
certain conditions (— 402 Statistical Me- 
chanics). Mathematically rigorous discussions 
were given by O. E. Lanford [21] for a gas of 
hard spheres; he showed that solutions of the 
BBGKY hierarchy converge to those of the 
Boltzmann hierarchy for small time under the 
Boltzmann-Grad limit (N> 0, Nd?>1,d= 
the diameter of the hard spheres). 

An approach to the Boltzmann equation in 
the spatially homogeneous case can also be 
based on a tmaster equation. M. Kac [22] 
considered a Poisson-like process describing 
the random time evolution of the n-tuple of 
the velocities of n particles. For a gas of hard 
spheres this is determined by the master 
equation 


HU, Xis e a X 


Ot 


» 
Ni <i<j<n 


| fult, X ase Xis oes E Xn) 
S2 


— u(t, Ais St I(x; — xj, I) di, 


ERT, (1) 


[xU LX 


where S? is the 2-dimensional unit sphere, dl is 
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the uniform distribution on S? and 


Asset all,  x;2x;—-(x;— xl. 


Let o be a positive constant and S( /nc) 
denote the (3n — 1)-dimensional sphere with 
center 0 and radius Vno. Given a symmetric 
probability density u, on S( /ne) for each 
n>1, a sequence {u,} is said to have Boltz- 
mann’s property or to be chaotic (or u-chaotic 
to stress u), if there exists a probability density 
u on R? such that 
lim P1(%1) «++ P(X) un, +++» Xa) dx, 


n- o0 


| 
i= P,(x)u(x)dx 


for each mz 1 and q, € Co(R°), 1 xcu: Kac’s 
assertion is that the Boltzmann equation is to 
be derived from the master equation via the 
propagation of chaos; more precisely, if (u,) is 
a u-chaotic sequence, then {u,(t)} is also u(t)- 
chaotic, where u,(t) is the solution of the mas- 
ter equation (1) with u,(0)=u, and u(t) is the 
solution of the following Boltzmann equation 
with u(0) — u: 


ô 
Pu x)= | {u(t, x’Ju(t, y) — ult, x)u(t, y)} 
t S? x R? 


x |(x—y, Dldldy, 


xX=x+(y—x, Dl, y=y—(y—x, IL 


The propagation of chaos was verified by 
Kac [22], H. P. McKean [24], and others 
for a considerably wide class of nonlinear 
equations of Boltzmann type (with cutoff). The 
propagation of chaos is the stage correspond- 
ing to the tlaw of large numbers. The next 
stage is the tcentral limit theorem or fluctu- 
ation theory, which was also discussed by Kac 
[23] and McKean [25]. Moreover, based on 
Kac’s work [22], McKean [26] introduced a 
class of tMarkov processes associated with 
certain nonlinear evolution equations includ- 
ing the Boltzmann equation; a process of this 
type describes the time evolution of the veloc- 
ity of a particle interacting with other similar 
particles. In the case of the spatially homo- 
geneous Boltzmann equation of Maxwellian 
molecules without cutoff, such a Markov 
process was constructed by solving a certain 
tstochastic differential equation [27]. This 
implies the existence of probability measure— 
valued solutions of the equation. 
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A. General Remarks 


A probability measure ® on a *measurable 
space (S, €) is defined to be a ‘measure on 

(S, S) with (S) — 1 (— 270 Measure Theory). 
In probability theory, probability measure 
appears usually as the *probabihity distribution 
of a trandom variable (— 342 Probability 
Theory). Unless stated otherwise, we regard a 
*topological space T as a measurable space 
endowed with the topological c-algebra B(T) 
on T, i.e., the to-algebra generated by the 
*open subsets of T. Hence the distribution of 
an R"-valued random variable is a probability 
measure on (R", B” = 8(R")). From this proba- 
bilistic background we often call a probability 
measure on (R", 8") an n-dimensional (proba- 
bility) distribution. For probability measures 
on topological spaces — 270 Measure Theory. 
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B. Quantities Characterizing Probability 
Distributions 


Several different quantities characterize the 
properties of probability distributions in one 
dimension: the mean (or mathematical expecta- 
tion) m = |2 xd®(x), the variance o? = |2, |x — 
m|? d®(x), the standard deviation c, the kth 
moment a, = (7. x" d(x), the kth absolute 
moment fj, = [ER |x|* dé(x), the kth moment 
about the mean y, = [7 „(x —m)‘ d@(x), etc. 

A one-to-one correspondence exists be- 
tween a 1-dimensional distribution ® and its 
(cumulative) distribution function F defined by 
F(x)2 $((—oo, x]). A distribution function is 
characterized by the following properties: (1) It 
is monotone nondecreasing; (2) it is right con- 
tinuous; (3) lim, , .., F(x) 20 and lim, ,, ,, F(x) 
= |. Similar statements hold for the multi- 
dimensional case. 

Let X (w) be a real random variable on a 
*probability space (Q, %, P). Then the distri- 
bution of X is given by ®(E)= Pol X(o)e 
E), Ee 8, and the characteristic quantities 
of ® defined above are given in terms of X (w) 
as follows: m= E(X), o? = E(X —my, F(x)= 
P((o| X(o)x xj), etc. The moments and the 
moments about the mean are connected by 


r 
the relation u, — SM a, ,(— m) (r= 1,2, 


...). When ® is an n-dimensional distribution, 
the following quantities are frequently used: 
the mean vector, which is an n-dimensional 
vector whose ith component is given by m; = 
f x,;d@(x); the covariance matrix, which is 

an n xn matrix whose (i, j)-element is o; — 
f(x, — mj)(x; — mj) d9(x); the moment matrix, 
which is an n x n matrix whose (i, j)-element is 
mj; = | x,xjd (x). (The covariance matrix is 
also called the variance matrix or the variance- 
covariance matrix.) The covariance matrix 
and the moment matrix are "positive definite 
and symmetric. The quantities listed above 
are defined only under some integrability 
conditions. 


C. Characteristic Functions 


Consider a probability measure d defined on a 
measurable space (R", B”), where 8" is the o- 
algebra of all *Borel sets in R". The character- 
istic function of ® is the *Fourier transform o 
defined by 


ea | = d@(x),  zeR', (1) 
R” 


where (z, x) denotes the tscalar product of z 
and x (z, xeR”). Let X be an n-dirnensional 
random variable with probability distribution 
® defined on a *probability space (Q, B, P). 
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Then the Fourier transform of ® is also called 
the characteristic function of X, which can also 
be written as E(e'*: 9?) (— 342 Probability 
Theory). 

The following properties play a fundamental 
role in the study of the relationship between 
probability distributions and characteristic 
functions: (1) the correspondence defined by (1) 
between the n-dimensional probability distri- 
bution Ó and its characteristic function q is 
one-to-one. (ii) For any a,, b,eR, a, « b, (p= 
1,2, ...,n), we have 


1\" (¢ C n pg tbpZy __ p—idyZy 
emt n (RE 
eo 2n] Je -e p= —iz, 
z,)dz,,...,dz,, (2) 


where f(t; a, b) denotes the modified indicator 
function of [a, b] defined by 


x 9(z,, ..., 


1, (a,b) 
ECH t —a or b, 
0, té[a, b], 


and x 2(x,, ..., x,)e R”. If an n-dimensional 
interval I 2 [a,, ...,a,; b,, ..., b,] defined by 
à; & X; € b; (i— 1,2, ...,n) is an interval of con- 


tinuity for the probability distribution 6, i.e., 
Q (0I) 20, where OI denotes the boundary of I, 
then the left-hand side of (2) is equal to ®(J). 
Equation (2) is called the inversion formula for 
the characteristic function o. 

The characteristic function o of an n- 
dimensional probability distribution has the 
following properties: (i) For any points z”, 
..., ZP of the n-dimensional space R” and any 
complex numbers a,,...,a,, we have 


o(z? — z9)a;a, >0. 


Ok 


(ii) p(z) converges to q(0) as z™ —0. (iii) (0) 
— 1. A complex-valued function o of ze R" is 
called tpositive definite if it satisfies the in- 
equality in (1). Any continuous positive defi- 
nite function g on R” such that ol — 1 is the 
characteristic function of an n-dimensional 
probability distribution (!*Bochner's theorem) 
(— 192 Harmonic Analysis). A counterpart to 
Bochner’s theorem holds for any positive 
definite sequence as well (Herglotz's theorem). 

The characteristic function is often useful for 
giving probability distributions explicitly. (For 
characteristic functions of typical probability 
distributions — Appendix A, Table 22. For 
general information about criteria that can be 
used to decide whether a given function is a 
characteristic function — [8].) 

ne moment generating function defined by 

)= fexp(—(z, x))d@(x) (ze R”) does not 
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necessarily exist for all n-dimensional distri- 
butions but does exist for a number of useful 
probability distributions ®, and then f(z) 
uniquely determines di. 

Given a 1-dimensional distribution ® with 
Pk < +œ, we denote by y, the coefficient of 
(iz)‘/k! in the *Maclaurin expansion of log Q(z). 
We call y, the (Ath order) semi-invariant of ®. 
The moments and semi-invariants are con- 
nected by the relations y, =&1, Y2 =&2 — %? = 
0°, Y3 = X3 — 3a ët +203, y4 =a, — 302 — 4a %3 
+ 12070, —6a1, ... 


D. Specific Distributions 


Given an n-dimensional distribution ®, a 
point a with ®({a})>0 is called a disconti- 
nuity point of ®. The set D of all discontinuity 
points of ® is at most countable. When ®(D) 
=1, ® is called a purely discontinuous distribu- 
tion. In particular, if D is a lattice, ® is called a 
lattice distribution. If the distribution function 
of ® is a continuous function, ® is called a 
continuous distribution. By virtue of the 
tLebesgue decomposition theorem, every 
probability distribution can be expressed in 
the form 


=a, È; +a,®, KT EL ER 


d4,,05,0420, a,+a,+a,=1, 

where 6, is purely discontinuous, 6, is tab- 
solutely continuous with respect to tLebesgue 
measure, and ®, is continuous and 'singular. 
Let be an absolutely continuous distri- 
bution. Then there exists a unique (up to Le- 
besgue measure zero) measurable nonnega- 
tive function f(x) (x eR") such that (E) = 

Le f(x) dx. This function f(x) is called the 
probability density of ©. 

We now list some frequently used 1- 
dimensional lattice distributions (for explicit 
data — Appendix A, Table 22): the unit distri- 
bution with ©({0}) = 1; the binomial distribu- 
tion Bin(n, p) with parameters n and p; the 
Poisson distribution D) with parameter 4; 
the geometric distribution G(p) with parameter 
p; the hypergeometric distribution H(N, n, p) 
with parameters N, n, and p; and the negative 
binomial distribution NB(m, q) with parameters 
m and q. The following k-dimensional lattice 
distributions are used frequently: the multi- 
nomial distribution M (n, p) with parameters n 
and p; the multiple hypergeometric distribution; 
the negative multinomial distribution; etc. 

The following 1-dimensional distributions 
are absolutely continuous: the normal distri- 
bution (or Gaussian distribution) N (1, o?) with 
mean y and variance g? (sometimes N (0, 1) is 
called the standard normal distribution); the 
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Cauchy distribution C(u, o) with parameters 
u (median) and c; the uniform distribution 

U (o, fl) on an interval Ta f]; the exponen- 

tial distribution e(c) with parameter o; the 
gamma distribution T (p, c); the ty? distribu- 
tion y°(n); the beta distribution B(p, q); the F- 
distribution F(m, n); the Z-distribution Z(m, n); 
the t-distribution t(n); etc. Furthermore, there 
are several k-dimensional absolutely continu- 
ous distributions, such as the k-dimensional 
normal distribution N (u, 2) with mean vector 
Bot. H2, ---, Hy) and covariance matrix L = 
(c;j the Dirichlet distribution, ctc. 


E. Convolution 


Given any two n-dimensional distributions ®,, 
d. the n-dimensional distribution (E) = 
frer %e(x  y)dÓ, (x) d 5(y) is called the com- 
position (or convolution) of ®, and ®, and is 
denoted by ®, a 5,, where y, is the indicator 
function of the set E. Let X, and X, be tinde- 
pendent random variables with distributions 
o, and ®,. Then the distribution of X, + X, is 
P, ad, When F; is the distribution function of 
®; (i= 1,2), the distribution function F, * F, of 
P, * D, is expressed in the form F, * F,(x)= 
Jae Fi(x — y)dF,(y). If b, has a density f, (x), 
then ®, +, has a density f(x) le, fi(x— 
y)dF,(y). If o(z) is the characteristic function 
of the convolution of two probability distri- 
butions ®, and ®, with characteristic func- 
tions q, and @,, then @ ts the product of o, 
and $,:9(z) 2 q,(z): o, (z). Therefore, for every 
k, the kth order semi-invariant of the convo- 
lution of two distributions is equal to the sum 
of their kth order semi-invariants. Suppose 
that we are given a family of distributions 
P= { P(x, f, ...)) indexed with parameters 
a, f, .... If for (x1, f, ...) and (a2, f;, ...) there 
exists (x5, f4, ...) such that D(a,, f,,...)* 
D(x, B2, ...) 2 (x5, B4, ...), then we say that ® 
has a reproducing property. Some of the distri- 
butions listed above have the reproducing 
property: P(4,)* P(45) P(A, 4- 45). Bin(n,, p) 
Bin(n,, p) - Bin(n, +n, p), NB(m,,q)* 
NB(mz,q)= NB(m, + mz, q), Na, 97) * 
N (12.03) =N (ty + la, 07 +03), T (pi, 0)» 
V(p2,6)=T(p, + Pz, 9), C(ji,01)* C(u5, 05) = 
C(u; + Ho, 0, +02), etc. 

Given a 1-dimensional distribution function 
F(x). 


OAD= max (F(x--D)—-F(x—-1), 1»0, 


is called the maximal concentration function of 
F (P. Lévy [6]). Since it satisfies the relation 
Or «r (D € Qr, (D) (i= 1,2), we can use it to 
study the properties of sums of independent 
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random variables. The mean concentration 
function defined by l 


cut-x | (F(x +1)— F(x —1))* dx 
is also useful for similar purposes. 

Let Nim, v) be the 1-dimensional normal 
distribution with mean m and variance v, and 
let PL. a) be the distribution obtained through 
translation by a of the Poisson distribution 
with parameter J. If 1-dimensional distri- 
butions ®,, P, (k — 1,2) exist such that N (m, v) 
=, *0,, P(4, a) V, * VT,, we have 0, = 
N (m,, v,), V, — P(/,,a,) (k= 1, 2) for some 
My, Uys Ay, dy (k = 1,2). These are known, respec- 
tively, as Cramér's theorem and Raikov's 
theorem. Yu. V. Linnik proved a similar fact 
(the decomposition theorem) for a more gen- 
eral family with reproducing property by using 
the theory of analytic functions [€]. 


F. Convergence of Probability Distributions 


The concept of convergence of distributions 
plays an important role in limit theorems and 
other fields of probability theory. When Q is a 
topological space, we consider convergence of 
probability measures on Q with respect to the 
tweak topology introduced in the space of 
measures on Q (— 37 Banach Spaces). Such 
convergence is called weak convergence in 
probability theory. For a sequence of n- 
dimensional distributions ®, (k — 1,2,...) to 
converge to ® weakly, each of the following 
conditions is necessary and sufficient. (1) For 
every continuous function with compact sup- 
port, lim, , fg» f(x) db, (x) = fr f(x) d P(x). (2) 
At every continuity point of the distribution 
function F(x,, ..., x,) of 6, lim, ,, F,(x,, .... Xn) 
= F(x,,...,X,) (F, is the distribution func- 
tion of ®,). (3) For every continuity set E of 

® (namely, a set such that (E — 1:9?) — 0), 

lim, ,,, P(E) = (E). (4) For all open Gc R", 
lim inf, ,(G) > (G). (5) For all closed Fc 
R”, limsup, „x ®,(F)< D(F). (6) lin,- e(®,, P) 
=0, where p is a metric defined in the follow- 
ing way: Given any n-dimensional distribu- 
tions ®,, ®,, we put ey =inf{e] P (F) « (F^ + 
e for every closed F} (F* is the -neighborhood 
of F) and define p(Ó,, ,) mate, £21). The 
metric p, called the Lévy distance. was intro- 
duced by Lévy [6] in one dimension and by 
Yu. V. Prokhorov in metric spaces [10]. Each 
of these conditions except (2) is still necessary 
and sufficient for ©, to converge weakly to ® 
when 6, and 6 are probability measures on a 
tcomplete separable metric space. It should 
also be noted that the probability measures 
on a complete separable metric space consti- 
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tute a complete separable metric space with 
respect to the Lévy distance. 

A family ®,(ae A) of probability measures 
on a complete separable metric space is said to 
be tight if for every ¢>0 there exists a tcom- 
pact set K = K (c) such that ,(K*) « e for all 
oeh A family ®, (xe A) is tight if and only if it 
is *totally bounded with respect to the topol- 
ogy induced by the Lévy distance. Hence a 
tight family ®, (x€ A) has a weakly convergent 
subsequence. 

We can give a criterion for the convergence 
of probability measures in terms of their char- 
acteristic functions. Suppose that ®, and ® are 
n-dimensional probability measures with char- 
acteristic functions o, and @. Then 6, con- 
verges weakly to ® if and only if for every z, 
lim, ole (z). Let o, be the characteristic 
function of an n-dimensional probability dis- 
tribution ®,. If the sequence {p,} converges 
pointwise to a limit function o and the conver- 
gence of o, is uniform in some neighborhood 
of the origin, then ¢ is also the characteristic 
function of an n-dimensional probability dis- 
tribution ® and the sequence {®,} converges 
weakly to ® [7] (Lévy's continuity theorem). 

For any probability distribution concen- 
trated on [0, oo), the use of tLaplace trans- 
forms as a substitute for Fourier transforms 
provides a powerful tool. The method of 
probability generating functions is available 
for the study of arbitrary probability distri- 
bution concentrated on the nonnegative inte- 
gers [14]. The method of moment-generating 
functions is also useful. There are many results 
on the relation between these functions, proba- 
bility distributions, and their convergence [14- 
16]. 

Let ®,, ®,,... and ® be 1-dimensional 
distributions. If all absolute moments exist, 

21 By — oo for fj IS Le dt) oo, 
and Dm, 2, x/d®,(x) =J&,, xid®(x) (j= 
0, 1,2,...), then ®, weakly converges to È. 
This condition is sufficient but not necessary. 


G. Infinitely Divisible Distributions 


An n-dimensional probability distribution ® 
is called infinitely divisible if for every posi- 
tive interger k, there exists a probability dis- 
tribution ®, such that © =, « 5, « ... « dii, 

(2 9^). Both normal distributions and Pois- 
son distributions are infinitely divisible. If an 
n-dimensional distribution ® satisfies the con- 
dition fais (dx) « &, we say that Dev(c). Then 
$ is an infinitely divisible distribution if and 
only if for every e> 0 we can find 6,, ,, ..., 
OQ, ev(s) such that D=@, «5, « ... «0. 
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Let X,;, i=1, 2, ..., n(k), be independent 
random variables for every k, and assume that 
the distribution of X,; belongs to v(¢,), i= 
1,2, ..., n(k), where e, 0 as k— oo. If the 
probability distributions of the sums X, — 
379 X; converge to a probability distribu- 
tion as k— oo, then the limit distribution is 
infinitely divisible. 

The characteristic function of a 1-dimen- 
sional infinitely divisible distribution can be 
written in the form 


q(z) 2 exp (i -57 


o0 2 
x | A(u, 2) 7 460) (3) 





where y is a constant, v is a nonnegative con- 
stant, G(u) is a nondecreasing bounded func- 
tion with G(—00)—0, A(u, z)=exp(iuz)—1— 
izu/(1 -- u?), and the value of A(u, z)(1 + u?)/u? 
at u=0 is defined to be —z?/2. Formula (3) is 
called Khinchin's canonical form. For the char- 
acteristic function of an infinitely divisible n- 
dimensional distribution, the canonical form is 
as follows: 


n 


Q(z) - exp (im z2- 5 Á CpaZpŽ4 
Por 


T A i(z, x) 
Jo j prj) 
SZ)eR', (4) 


z=(2;,.. 





where meR", (c,,) is a positive semidefinite 
matrix, and n(dx) is a measure on R" such that 
n((0]) 20 and 





2 

| Ix n(dx) « oo. 
wel-clx? 

Formula (4) is called Lévy’s canonical form. If 

a 1-dimensional infinitely divisible distribution 

® satisfies Jp: x? d(x) « oo, then its character- 

istic function is given by 


q(z) - exp (m: SR 


"OO 


+f" (e (rien) (3) 


where m is a real constant, v is a nonnega- 
tive constant, and K(u) is a nondecreasing 
bounded function such that K(—0oo)=0. It 
is called Kolmogorov's canonical form. (For 
infinitely divisible distributions on a homo- 
geneous space — 5 Additive Processes.) 

Let ® and ¥ be n-dimensional distributions. 
If for some A>0, Y(E) - (AE) (AE — (4£|6e 
E}) for every set E, we say that and VV are 
equivalent. Let and VV be probability distri- 
butions with distribution functions F and G 
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and characteristic functions o and y. Then 
the following three statements are equivalent: 
(1) 9 and V are equivalent; (2) G(x) = F(Ax) for 
every x; and (3) v(Az)— ọ(z) for every z. We 
call a stable distribution if for every pair of 
distributions ®,, ®, equivalent to ®, the con- 
volution 6, +, is equivalent to ®. If ® is 
stable, every distribution equivalent to ® is 
also stable. We can characterize stable distri- 
butions in terms of their characteristic func- 
tions Q(z) as follows: For every pair An, 45 7» O, 
there exists a A=A(A,,2,)>0 such that (Az) — 
q(A1z)o(45z). We can restate this characteri- 
zation as follows: ® is stable if and only if 

for every pair of independent random vari- 
ables X, and X, with identical distribution 
and for any positive numbers 4, and 4,, there 
exists a positive number 4 such that (A, X, + 
45 X5)/4 has the distribution ®. By the defi- 
nition we see that all stable distributions are 
infinitely divisible. 

In the 1-dimensional case, putting o(z) — 
expw(z), we have y(Az) 2 (A4 z) + V(4.z), 
which implies v(z) 2 ( — co + i(z/|z|)ci)|zl*, 
where c; 20, —oo «c, «o0, 0«a «x 2. The 
parameter « is called the exponent (or index) 
of the stable distribution. The stable distribu- 
tions with exponent x —2 are the normal dis- 
tributions, and the stable distributions with 
exponent «=1 are the Cauchy distributions. 
We have y(z) 2 —c,|z|* for a symmetric stable 
distribution. (For the stable distribution with 
exponent 1/2 — Appendix A, Table 22). 

Generalizing stable distributions, we can 
define quasistable distributions, which B. V. 
Gnedenko and A. N. Kolmogorov [17] called 
stable distributions also. Let F be the distri- 
bution function of a 1-dimensional distri- 
bution ®. © is said to be quasistable if to 
every b, >0, b, >0 and real 4,, 4; there corre- 
spond a positive number b and a real number 
A such that we have the relation F((x —4,)/b,)* 
F((x— A3)/b3) = F((x — å)/b). 

Let ( Xj] be a sequence of independent ran- 
dom variables with identical distribution. If for 
suitably chosen constants A, and B, the distri- 
butions of the sums B, ! (£7, Xj) — A, converge 
to a distribution, the limit distribution is a 
quasistable distribution (Lévy). A necessary 
and sufficient condition for a distribution to 
be quasistable is that its characteristic func- 
tion q(z) satisfy the relation o(b,z)o(b,z) — 
Q(bz)e?? (y  4— 4, — A3). The characteristic 
function of a quasistable distribution has the 
canonical representation 


Q(z) - exp v (z), 
y (z) — imz — c|z*(1 + iB(z/|z|)eo(z, ai), 


where m is a real number, c 20,0 «2x2, 
IBI & 1, and w(z, x)= tan(zxa/2) (xÆ 1), w(z, x)= 
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(2/n)log |z| (x 2 1). The parameter x is called 
the exponent of the quasistable distribution. A 
quasistable distribution with «41 is obtained 
from a stable distribution by translation, but 
quasistable distributions with «= 1 are not. 

Semistable distributions are ancther gen- 
eralization of stable distributions. A distri- 
bution is called semistable if its characteristic 
function q(z) satisfies the relation y(qz)= 
q* wy (z) for a positive number q ( 1), where 
¢(z)=exp((z)). Also in this case, the general 
form was obtained by Lévy [6]. 

A I-dimensional probability distribution ® 
is called an L-distribution if the distribution 
function F of is the convolutior. of F(x/a) 
and some other distribution function F,(x) for 
every 0a « 1. ® is an L-distribution if and 
only if there exists a sequence of independent 
random variables {X,} such that for suitably 
chosen constants B, 0 and A, the distri- 
butions of the sums B, ! (X; ., X,) — A, con- 
verge to ® and sup, <,<, P(| X,/ B,| > £) 50 as 
n oo for every e» 0. Quasistable distribu- 
tions are L-distributions. 


H. The Shape of Distributions 


Let F(x) be a 1-dimensional distribution func- 
tion. The quantity C, such that F(¢,—0)<p< 
F(¢,) (0 p « 1) is called the quantile of order 
p of F. In particular, the quantity C, is called 
a median. If F satisfies the relation 1 — F(m- x) 
= F(m— x), it is called symmetric. In any 1- 
dimensional symmetric distribution, every 
moment of odd order about the mean (if it 
exists) is equal to zero. 

The ratio y, = 44/6? is used as a measure of 
departure from symmetry of a distribution and 
is called the coefficient of skewness. Further- 
more, the ratio y, = ;/0* —3 is called the 
coefficient of excess. For the norraal distri- 
bution, we have y; =y =0. If y #9, y3 ex- 
presses the degree of deviation from the nor- 
mal distribution. 

A distribution function F(x) is called uni- 
modal if there exists one value x -- a such that 
F(x) is convex for x «a and concave for x a. 
All L-distributions (and hence quasistable 
distributions) are unimodal [18]. 


I. Kolmogorov's Extension Theorem 


Let Q — R7, where T is an arbitrary index set. 
We associate with RI the c-algebra $8" gen- 
erated by the cylinder sets, i.e., (o €O|z, (ce 
E,,...,%,(@)€E,}, where ziel denotes the rth 
coordinate of œ, E,¢B(R'), 1 k&n, t, «t4 « 
..<t,, and n=1,2,.... Given a probability 
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measure ® on (R’, $87), we can define a finite- 
dimensional tmarginal distribution ®, for any 
finite subset S of T by &4(E) 2 (x; !(E)), Ee 
BS, where z is the natural tprojection 7: 
RI — R?. The measures {®,} satisfy the follow- 
ing consistency condition: If S, c S, (c T) are 
finite and if Ee®*:, then 


Ds (E) 7 6 (5,5, (E)), (6) 


where zs, s, : R*: ^ R?: is the natural projection. 

Conversely, if we are given a family of finite- 
dimensional probability measures {®,} which 
satisfies the consistency condition (6), then 
Kolmogorov's extension theorem [1] asserts 
that there exists a unique probability measure 
® on (R7, 87) such that GEI 2 (z, ! (E)), 
Ee? for any finite Sc T. 

This theorem is useful in constructing 'sto- 
chastic processes. For example, let d, ,, 
be the tproduct measure of k copies of a given 
probability measure on R!. Then the family 
{Onnan Ni Hä AgE Z, ke N} satisfies the 
consistency condition and hence, by Kolmo- 
gorov’s extension theorem, determines a prob- 
ability measure on RŽ, which is denoted by 4. 
Thus X,(w)=1,(@), neZ, (c e (R7, $87, %)), 
are independent identically ®-distributed ran- 
dom variables. 

Kolmogorov's extension theorem is gen- 
eralized to the case where the component 
spaces are tstandard measurable spaces (— 
270 Measure Theory) instead of R!, and also 
to the case where product spaces are replaced 
by tprojective systems [19]. 


J. Characteristic Functionals on Infinite- 
Dimensional Spaces 


Contrary to the finite-dimensional case, 
Bochner's theorem does not necessarily hold 
in infinite-dimensional spaces. For example, let 
(T, ||-||) be an infinite-dimensional tHilbert 
space and (t)=exp(—||t||”). @ is continuous 
and positive definite, and (0) — 1. But it is 
known that there is no probability measure on 
T = T* (topological dual of T) which corre- 
sponds to o. Bochner's theorem is generalized 
to infinite-dimensional spaces as follows. 

Let T be a real tvector space endowed with 
the topology t defined by a system of tHilbert- 
ian seminorms (||: ||, ae A}. Define a new 
topology I(t) of T by all Hilbertian semi- 
norms ||: || each of which is HS-dominated by 
some || lla xe A, Le, sup((2.;lleill?) 2; (e) :a- 
orthonormal} < oo. If I(1) 7 x, then (T, t) is 
called a *nuclear space. 

Let T;* be the topological dual of (T, T) (i.e., 
the set of all t-continuous real valued linear 
functionals on T). Define a Borel structure 
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DUT of T* as the c-algebra generated by 
the system of half-spaces (xe T,*; x(t)<a}, te 
T, aeR!. For a probability measure on 
(T5, B(T*)), define 


on=| exp(i¢x,t>) d(x), teT, 
mm ` 


which is called the characteristic functional of 
®. A functional o on T is the characteristic 
functional of a probability measure ® on 

(T,;*, B(T,*)) such that ët L T,*)=1 for some 
sequence (2,) c A, where ®* is the *outer 
measure (— 270 Measure Theory) and T;* is 
the topological dual of (T, ||-||,), if and only if 
is positive definite, (0) — 1, and continuous 
with respect to the topology I(t) [23]. 

As special cases of the foregoing theorem, 
we have the following. If (T, x) is a nuclear 
space, then every positive definite t-continuous 
functional g with q(0) — 1 is the characteristic 
functional of a probability measure on T;* 
(Minlos [24]). Schwartz’s spaces /(R") and 
Z(R") are nuclear. Let (T; t= ||: ||) be a Hilbert 
space. A tHilbert-Schmidt operator U is, by 
definition, a tbounded linear operator on T 
such that E; |} UVe;||?< oo, by any tcomplete 
orthonormal system {e;} (this quantity does 
not depend on the choice of {e;}). Define a 
seminorm ||: liy by |[tlly = UO, te T; fora 
Hilbert-Schmidt operator U. Then the topol- 
ogy I(t) coincides with the topology induced 
by the system of seminorms ||: ly, where U are 
Hilbert-Schmidt operators, which is called 
the Sazonov topology. Thus every functional 
o on T, which is positive definite, 9(0) — 1, 
and continuous with respect to the Sazonov 
topology, is the characteristic functional of a 
probability measure on T;* (Sazonov [25]). 

The probability measure on 5^ with the 
characteristic functional exp(— f 2. | f(s)? ds), 
f€4,is the probability measure of a Gaussian 
twhite noise on S”. 
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A. History 


The origin of the theory of probability goes 
back to the mathematical problems connected 
with dice throwing that were discussed in 
letters exchanged by B. Pascal and P. de Fer- 
mat in the 17th century. These problems were 
concerned primarily with concepts such as 
*permutations, tcombinations, and *binomial 
coefficients, whose theory was established at 
about the same time [1]. This elementary 
theory of probability was later enriched by the 
work of scholars such as Jakob Bernoulli [2], 
A. de Moivre [3], T. Bayes, L. de Buffon, 
Daniel Bernoulli, A. M. Legendre, and J. L. 
Lagrange. Finally, P. S. Laplace completed the 
classical theory of probability in his book 
Théorie analytique des probabilités (1812). In 
this work, Laplace not only systematized but 
also greatly extended previous important 
results by introducing new methods, such as 
the use of *difference equations and 'generat- 
ing functions. Since the 19th century, the 
theory of probability has been extensively 
applied to the natural sciences and even to the 
social sciences. 

The definition of a priori probability due to 
Laplace provoked a great deal of argument 
when it was applied. For example, R. von 
Mises advocated an empirical theory of proba- 
bility based on the notion of Kollektiv (col- 
lective), which is a mathematical model of 
mass phenomena [5]. However, these argu- 
ments are concerned with philosophical rather 
than mathematical aspects. Nowadays, the 
main concern of mathematicians lies not in the 
intuitive or practical meaning of probability 
but in the logical setup governing probability. 
From this viewpoint the mathematical model 
of a random phenomenon is given by a proba- 
bility measure space (Q, B, P), where Q is the 
set of all possible outcomes of the phenom- 
enon, P(E) represents the probability that an 
outcome belonging to E be realized, and $ is 
a c-algebra consisting of all sets E for which 
P(E) is defined. All probabilistic concepts, 
such as random variables, independence, etc., 
are defined on (Q, $5, P) in terms of measure 
theory. Such a measure-theoretic basis of 
probability theory is due to A. Kolmogorov 
[6]. though similar considerations had been 
made before him for special problems, for 
example, in the work of E. Borel concerning 
the strong law of large numbers [7] and in the 
rigorous definition of Brownian motion by 
N. Wiener [8]. 

Ever since probability theory was given 
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solid foundations by Kolmogorov, it has made 
tremendous progress. The most important 


concept in today's probability theory is that of ` 


'stochastic processes, which correspond to 
functions in analysis. In applications a sto- 
chastic process is used as the mathematical 
model of a random phenomenon varying with 
time. The following types of stochastic pro- 
cesses have been investigated extensively: 
fadditive processes, 'Markov processes and 
tMarkov chains, tmartingales, tstationary 
processes, and tGaussian processes. tBrownian 
motion and tbranching processes are impor- 
tant special stochastic processes. In the same 
way as functions are often defined by differen- 
tial equations, there are stochastic processes 
which can be defined by tstochastic differen- 
tial equations. The theory of stochastic pro- 
cesses and stochastic differential equations 
can be applied to tstochastic control, fstochas- 
tic filtering, and statistical mechanics. The 
tergodic theory that originated in statistical 
mechanics is now regarded as an important 
branch of probability theory closely related to 
the theory of stationary processes. 


B. Probability Spaces 


Let Q be an tabstract space and % be a tø- 
algebra of subsets of ©. A probability measure 
(or probability distribution) over Q(B) is a set 
function P(E) defined for Ee% and satisfying 
the following conditions: (P1) P(E) z 0; (P2) for 
every sequence (E,) (nz 1,2, ...) of pairwise 
disjoint sets in 38, 


P (u s.) =) P(E,); 


(P3) P(Q)=1. The triple (Q, B, P) is called a 
probability space. The space Q (resp. each 
element c of Q) is called the basic space, space 
of elementary events, or sample space (resp. 
sample point or elementary event). We say that 
a condition (o) involving a generic sample 
point c is an event; in particular, it is called a 
measurable event or random event if the set E 
of all sample points satisfying e(w) belongs 

to B. We assume that an event is always a 
measurable one, since we encounter only 
measurable events in the theory of probability. 
Because of the obvious one-to-one corre- 
spondence between measurable events and B- 
measurable sets (i.e., the correspondence of 
each event e with the set E of all sample points 
o satisfying £), a B-measurable set itself is 
frequently called an event. If ¢ is an event and 
E is the 8-measurable set corresponding to e, 
we call P(E) or Pr(s) the probability that the 
event c occurs, Le, the probability of the event 
E. The complementary event (resp. impossible 
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event, sure event) is the complementary set E* 
(empty set (Z, whole space Q). For a finite or 
infinite family (E;) (AeA), the sum event (resp. 
intersection or product event) of E, is the set 
Ua E; (Q Ej. If EN F = Ø, then we say that E 
and F are mutually exclusive or that they are 
exclusive events. 

By the definition of P, we have 0 « P(E)< 
1 for any event E, P(@)=0, and P(Q)- 1. 
Moreover, if (E,) (n— 1,2, ...) is a sequence of 
pairwise exclusive events and E is the sum 
event of E,, we have 


P(E)- » P(E,). 


This property is called the additivity of proba- 
bility. If P(E) — 1, the event E is said to occur 
almost certainly (almost surely (abbrev. a.s.), 
for almost all œ, or with probability 1). 

Given a finite sequence (E,) (n— 1,2, ..., N) 
of events, we say that the events E, (n— 1,2, 
..., N) are mutually independent or that the 
sequence (E,) (n— 1,2, ..., N) is independent 
if every subsequence satisfies 


k 
P(E;, n E, n "P NE= I] P(E;). 
j=1 


Given an infinite family {E,} (4€ A) of events, 
we say that the events E, (AeA) are mutually 
independent or that the family {E,} (AeA) is 
independent if every finite subfamily is inde- 
pendent. The concept of independence of 
events can be generalized to a family {8,} 
(AeA) of o-subalgebras of B as follows. A 
family (8;) (AeA) of o-subalgebras of events is 
said to be independent if for every choice of 
E EB, the family {E,} (AeA) of events is 
independent. 

For a sequence {E,} (n— 1,2, ...) of events, 
the sets limsup, E, and liminf, E, are called the 
superior limit event and inferior limit event, 
respectively. The superior limit event (inferior 
limit event) is the set of all c for which in- 
finitely many events among E, (all events 
except finitely many E,) occur. Therefore 
P(liminf, E,) is the probability that infi- 
nitely many events among E, occur, and 
P(limsup, E,) is the probability that the events 
E, occur for all n after some number ny, where 
ng depends on o in general. The Borel-Cantelli 
lemma, which is concerned with the evaluation 
of P(lim sup, E,), reads as follows: Given a 
sequence (E,) (n—1,2,...) of events, we have 
(i) whether the events E, (n=1,2,...) are mutu- 
ally independent or not, X, P(E,) « oo implies 
that P(limsup, E,) — 0; and (ii) if the events 
E, (n=1,2,...) are mutually independent, 
X,P(E,)- oo imples that P(limsup, E,) — 1. 
Frequently, applications of part (ii) are greatly 
hampered by the requirement of independence; 
a number of sufficient conditions for depen- 
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dent events to have the same conclusion as 
(ii) have been discovered. The Chung-Erdós 
theorem [9] 1s quite useful in this connection. 


C. Random Variables 


Let (Q, B, P) be a probability space. A random 
variable is a real-valued function X defined on 
Q that is B-measurable ie, for every real 
number a, the set {w|X(w)<a} is in 38). If 

Xi X5, ..., X, are random variables, the map- 
ping X =(X,, X,, ..., X,) from Q into R” is said 
to be an n-dimensional (or R"-valued) random 
variable. More generally, a mapping X from 
(Q, B) into another *measurable space (S, Œ) is 
called an (S, €)-valued random variable if it is 
measurable, that is, for every set A of &, the set 
{w| X(w)e A} belongs to B. 

Let B! be the c-algebra of all *Borel subsets 
of the real line R. Then each random variable 
X induces a probability measure ® on (R, 8!) 
such that 


$(A)- P((o|X(o)eA)), ` Jempi, 


The measure € is called the (1-dimensional) 
probability distribution of the random variable 
X or simply the distribution of X. The point 
function F defined by 


F(a)=P({@| X(@)<a}), oeh, 


is a monotone nondecreasing and right con- 
tinuous function such that Im... F(a)=1, 
lim,- F(a) «0. The function F is called the 
cumulative distribution function (or simply the 
distribution function) of the random variable X. 
Similarly, an n-dimensional random variable 
X =(X,,..., Xn) induces its n-dimensional prob- 
ability distribution (or simply n-dimensional 
distribution) and its n-dimensional distribution 
function F(a,,...,a,)=P({@|X,(w)<a,,..., 
X,(@)<a,}). If the X, (n2 1,2, ..., N) are k,- 
dimensional random variables (n— 1,2, ..., N) 
we say that the Je X. , k,)-dimensional ran- 
dom variable X =(X,, X2, ..., Xy) is the joint 
random variable of X, (n — 1,2, ..., N) and that 
the (/-dimensional) distribution ® of X is the 
joint distribution (or simultaneous distribution) 
of X, (n2 1,2, ..., N). On the other hand, the 
k,-dimensional distribution 6, of X, is called 
the marginal distribution of the /-dimensional 
distribution ®. 

Given a finite sequence ( X,) (n 1,2, ..., N) 
of random variables, if the relation 


, 


P({o|X,(@)E A, (n=1,2,...,N)}) 

N 
=|] Dien Sege AA) (1) 
holds for every choice of 1-dimensional *Borel 


sets A, (n— 1,2, ..., N), we say that the random 
variables X, (n— 1,2, ..., N) are mutually inde- 
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pendent or that the sequence (X,] (n= 1,2, 
..., N) is independent. Given an infinite family 
{X,} (AEA) of random variables, we say that 
the random variables are mutually independent 
or that the family is independent if every 
finite subfamily is independent. The latter 
definition of independence of random vari- 
ables is compatible with the previous defini- 
tion of independence of o-subaigebras of $8; 
i.e., if BLX,] denotes the c-subalgebras of 

B generated by the sets {w| X,(@)eA,}, with 
A, an arbitrary 1-dimensional Borel set, the 
independence of the family {X,} (AeA) in the 
latter sense is equivalent to the independence 
of the family {8[X,]} AeA) in the previous 
sense. If the X, (n2 1,2, ..., NF) (X, (4€ A)) are 
k,- (k;-) dimensional random variables, then 
the independence of the family f X,) (n— 1,2, 
..., N) (1X4) (46 A)) is defined similarly; it is 
enough to take k,- (k;-) dimensional Borel 
sets A, (A,) for 1-dimensional Borel sets in 
equation (1). 

Given a family {X,} (4€ A) of random vari- 
ables, the smallest c-algebra with respect to 
which every X, is measurable is called the øc- 
algebra generated by {X,} (AeA) and is de- 
noted by S[X,|A4€A]. Each element of this 
class is said to be measurable with respect to 
the family {X,} (4€ A) of random variables. 

Since a random variable X is a 8- 
measurable function, we can speak of the 
"integral of X relative to the measure P on $8. 
If X is integrable relative to P, the integral of 
X over A is denoted by E(X; A). E(X;Q), usu- 
ally denoted by E(X), is called the mean, ex- 
pectation, or expected value of X, denoted also 
by M(X) or my. If (X — E(X)Y is integrable, 


V(X)- E(X — E(X))’) 


is called the variance of X, denoted by o?(X). 
The standard deviation of X is the nonnegative 
square root a(X) of the variance. If X and Y 
are two random variables for which E((X — 
E(X))(Y — E(Y))) exists, the value E((X — 
E(X))(Y — E(Y))) is called the covariance of 

X and Y. When X and Y have finite variances, 
the correlation coefficient of X and Y 1s defined 
by 


E((X — E(X)(Y — E(Y))) 
(E((X — E(X)?) E((Y - E(Y))*) 77 


It follows that E(aX + bY)=aE(X)+bE(Y), 
V(aX + b) - a?V(X) for any real numbers a, 

b, and that, in particular, E(X Y)-- E(X)E(Y), 
V(X + Y)=V(X)+ V(Y) for mutually indepen- 
dent random variables X and Y. It also fol- 
lows from the definition that —1< p(X, Y)<1 
in all cases. The independence of X and Y 
implies that p(X, Y) —0, but the converse is 
false in general. The variance is iraportant 
because of the well-known Chebyshev inequal- 
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ity: If X is a random variable with finite vari- 
ance ol, 


P(X — E(X)2ox«o?/c* 


for every positive number c. 


D. Convergence of Random Variables 


If P(lim, ,,, X, = X,,)=1, the sequence {X,,} is 
said to converge almost everywhere (almost 
certainly, almost surely (a.s.), or with proba- 
bility 1) to X,,. If lim,.,,, P(|X,—X.|>«)=0 
for every positive number e, the sequence {X,} 
is said to converge in probability to X,,. For a 
given positive number p the sequence {X,} is 
said to converge in the mean of order p to X,, if 
im,» E(| X, — X, |^) =9. Finally, if the ran- 
dom variables X, (n — 1,2, ..., oo) have distri- 
butions 6, (n — 1,2, ..., 00), respectively, and if 


lim | ` "dëst | ` fodol) 


noo 
—o0 


for every continuous function f with compact 
support, the sequence (X, is said to converge 
in distribution to A. Note that the sequence 
of random variables converging in distribution 
may not converge in any ordinary sense. For 
example, random variables converging in 
distribution may even be defined on different 
probability spaces. On one hand, almost sure 
convergence does not in general imply conver- 
gence in the mean. On the other hand, either 
almost sure convergence or convergence in the 
mean implies convergence in probability, and 
convergence in probability implies conver- 
gence in distribution. However, P. Lévy [10] 
proved that if the X, (n— 1,2,...) are mutually 
independent, the sequence 


is convergent almost everywhere if and only if 
it is convergent in distribution (or in proba- 
bility). The famous three-series theorem of 
Khinchin and Kolmogorov [11] claims that 
the series X, X, with X,, X,, ... independent is 
convergent almost surely if and only if there 
exists a sequence of independent random 
variables X1, X5, ... such that each of the three 
series 


3 PO, SA EGG. LV) 


is convergent. 


E. Conditional Probability and Conditional 
Expectation 


Let (Q, B, P) be a probability space and $y a c- 
subalgebra of B. If X is a random variable 
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with finite mean, the function 
un- | X(oc)dP, Ees, 
E 


defines on (Q, Y) a completely additive set 
function which is *absolutely continuous 
with respect to P. Therefore, by the *Radon- 
Nikodym theorem, there is an -measurable 
function f such that 


uB- | f(o)dP | forevery Ee. 
E 


This function is unique up to a set of P- 
measure zero and is called the conditional 
expectation (or conditional mean) of X relative 
to %, denoted by E(X | y). When $$ is gen- 
erated by a random variable Y, we also write 
E(X | Y) for E(X | Ẹ) and call it the conditional 
expectation of X relative to Y. In this case, 
there is a 'Borel measurable function f such 
that E(X | Y) - f(Y(o)), and we write E(X | Y= 
y) for f(y). The same fact holds when Y is a 
multidimensional random variable. It follows 
from the definition that the conditional expec- 
tation has the following properties, up to a set 
of P-measure zero: (i) if X 20, then E(X | §)> 
0; (ii) E(aX --bY|W)- aE(X |W) À- bE(Y | S); üii) 
E(E(X |W)) = E(X); (iv) if X and ware mutu- 
ally independent, i.e., S[ X ] and ğ are mutu- 
ally indepent, then E(X |) — E(X); (v) if X is 
t-measurable, then E(X |y) 2 X and E(XY |) 
= XE(Y|$y (vi) if Dm, X, = X, with | X,| € 
Y and Y is an integrable random variable, 
then lim, ,,, E(X, D = E(X,, | 8); (vii) if © is 

a o-subalgebra of %, then E(E(X | %)|6)= 
E(X | G); (viii) if X? is integrable and Y is any 
-measurable random variable, then E(( X — 
E(X Is E(X — XL 

When X is the indicator function (i.e., the 
characteristic function) ze of a set E in 3B, 
E(yg|) is called the conditional probability 
of E relative to (Y and is denoted by P(E |). 
In particular, if § ={F, F°, @,Q} with 1 > 
P(F)>0, P(E|®) is the simple function which 
takes the values P(EN F)/P(F) on F and 
P(EN F)/P(F^) on F°. These values are de- 
noted respectively by P(E | F) and P(E| F°). 
The definition of P(E| Y) or P(E| Y= y) is also 
the same as in the case of the conditional 
expectation. 

Let % be a o-subalgebra of B and Y a real 
random variable. According to the foregoing 
definition, P(Ye E|%) or P(Y !(E)| 8) is the 
conditional probability of the occurrence of 
the event Ye E under %. Since P(Ye E|W) is 
determined except on a P-null set depending 
on E, an arbitrary version of P(Ye E|), 
viewed as a function of E, does not always 
satisfy the conditions of a probability measure. 
However, we can prove that there exists a 
nice version of P( Ye E|) which is a proba- 
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bility measure in Ee for every weEQ and 
that such a version is unique almost surely. 
This version is called a regular conditional 
probability of Ye E under $Y or the conditional 
probability distribution of Y under %; this is 
written as P,(E| x). P(Ye E| X), (Ye E| X =x), 
P (E| X), and P,(E| X =x) are interpreted 
similarly. The conditional probability distri- 
bution can be defined not only for real ran- 
dom variables but also for every random 
variable which takes values in an tanalytic 
measurable space. 


F. Bayes's Formula 


Let E, E;, ..., E, be pairwise exclusive events, 
and assume that one of them must occur. If E 
is another random event, we have 


P(E)P(E| E) 


POSTE) FE (ELE, +. + P(E) PELE) 


where P(E,) ts the probability of the event E; 
and P(£|£j) is the conditional probability of E 
under the assumption that the event E; has 
occurred. This is called Bayes's formula. In 
practical applications E,,..., E, usually repre- 
sent n unknown hypotheses. Suppose that 

the probabilities on the right-hand side of the 
formula are given. We then apply Bayes's 
formula to reevaluate the probability of each 
hypothesis E; knowing that some event E has 
occurred as the result of a trial. This is why 
P(E) (P(E;| E)) ts called the a priori (a pos- 
teriori) probability. However, the determina- 
tion of the values of a priori probabilities is 
sometimes difficult, and we often set P(E) = 
1/n in practical applications, although this 
has caused a great deal of criticism. 

When X is a random variable subject to the 
distribution with continuous probability den- 
sity f(x), Bayes's formula is extended to the 
following form: 


J (Xo) P(E| X — xo) 


Tel El ga SEI — x) fG)dx' 


where f(x|E) is the conditional probability 
density of the random variable X under the 
assumption that the event E has occurred, and 
P(E| X = xo) is the conditional probability of E 
relative to X. 


G. Zero-One Laws 


In probability theory there are many theorems 
claiming that an event with certain properties 
has probability 0 or 1. Such theorems are 
called zero-one laws. Here, we mention two 
famous examples, Kolmogorov’s zero-one law 
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[6] and the Hewitt-Savage zero-one law. Let 
a=a(X,,X,,...) be an event concerning a se- 
quence of random variables ( X,]. « is called a 
tail event concerning {X,} if for every n, occur- 
rence or nonoccurrence of « depends only on 
{Xn Aaen, } For example, (lim, ,, X, —0] is 
a tail event. a is called a symmetric event con- 
cerning (X, if occurrence or nonoccurrence of 
a is invariant under every finite permutation of 
X,, X, .... For example, the event that X7. X, 
>0 for infinitely many n’s is a symmetric 
event. Kolmogorov's zero-one law: Every tail 
event concerning a sequence of indepen- 

dent random variables has probability 0 or 1. 
Hewitt-Savage zero-one law: Every symmetric 
event concerning a sequence of independent 
and identically distributed random variables 
has probability O or 1. 

Kolmogorov's zero-one law can be extended 
as follows: Let %,,n=1,2,..., be a sequence of 
independent o-subalgebras of B. Then the ø- 
algebra T= (^, J,., Bn» called the tail o- 
algebra of {Ẹ,}, is trivial, i.e., P(A)=0 or 1 for 
every det Kolmogorov's zero-one law is a 
special case where $y, is the c-algebra gen- 
erated by X, for every n. 
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Projective Geometry 


A. Introduction 


Projective geometry is the most fundamental 
of classical geometries and one of the first 
examples of axiomatized mathematics. 


B. Construction of Projective Geometry 


We construct projective geometry axiomati- 
cally [4]. Given two sets P, Q and a trelation 
r cP x Q, consider the triple $= (P, Q, T]. We 
call each element of P a point and each ele- 
ment of Q a line. If (p, I) eT holds for a point p 
and a line L then we say that the line / contains 
the point p. When two lines /, and l, contain a 
point p, we say that they intersect at p. When 
several points are contained in the same line, 
these points are said to be collinear, and when 
several lines contain the same point, these lines 
are said to be concurrent. For d we impose the 
following axioms: 

(1) There exists one and only one line that 
contains two given distinct points. 

(II) Suppose that we are given noncollinear 
points po, Pi, and p;, and distinct points q,, 
q2- Now suppose that (po, Pi, di) and (po, 
P2,42} are collinear triples. Then the line con- 
taining p4, p; and the line containing q,, q; 
necessarily intersect (Fig. 1). 


Po 


Pi P» 


Fig. 1 


(IIT) Every line contains at least three dis- 
tinct points. 

The Ẹ that satisfy axioms (I) and (II) and 
axioms (I), (II), and (IIT) are called general 
projective geometry and projective geometry, 
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respectively. The set of all points that are 
contained in a line is called the point range 
with the line as its base. In projective geome- 
try, there exists a one-to-one correspondence 
between the set of lines and the set of point 
ranges, so we can identify every line with a 
point range. In this case a line (ed is repre- 
sented as a subset of P, and the relation (p, l)e 
T means that the point p belongs to the set l. 

Let S be a subset of P and p,, p, be any two 
distinct points of S. If the line that contains 
pı and p; is always contained in S, then S is 
called a subspace. Points and lines are sub- 
spaces. Now we impose the following axiom: 

(IV) There exist a finite number of points 
such that any subspace that contains all of 
them contains P. | 

We call a projective geometry satisfying 
axiom (IV) a finite-dimensional projective 
geometry, which from this point on will be 
the sole object of our consideration. We call 
P a projective space. Consider sequences of 
subspaces of the type PZP ,2... 2P,2 
P) + Ø, where is the empty set. The num- 
ber n of the longest sequence is called the 
dimension of P. If P is of dimension n, we 
write P" instead of P. We call P! a projec- 
tive line and P? a projective plane. Each sub- 
space S of P, together with the set of lines 
of P contained in S, gives a finite-dimensional 
projective geometry, and so S is a projective 
space. Lines and points are projective spaces 
of dimensions 1 and 0, respectively. By con- 
vention, the empty set is a (—1)-dimensional 
projective space. We call each 2-dimensional 
subspace a plane and each (n — 1)-dimensional 
subspace in P" a hyperplane. 

Let M, N be subspaces of P, and for a pair 
of points pe M, qe N consider the set pUq 
of all points on the line that contains p and 
q. The set (pUq|pe M,qe N] is denoted by 
MUN, and we call it the set spanned by M 
and N. By convention, we put @ UM=M 
and pU p=p. Then P’U P" is the projective 
space of the lowest dimension which contains 
P' and P*. On the other hand, if we denote 
the intersection of P” and P? by P”AN P5, then 
it is the projective space of highest dimension 
that is contained in both of them. We call 
P'U P? and P'fP5 the join and the intersec- 
tion of P' and P^, respectively. When the 
dimension of the space spanned bart) 
points is r, we say that these points are inde- 
pendent; otherwise they are dependent. If any 
r+1 points of a given subset M of P" are 
independent for each r <n, we say that points 
of M lie in a general position. The space P" 
necessarily contains r 4- 1 independent points, 
and there necessarily exists a P” that contains 
r+1 arbitrary given points in a projective 
space; it is unique if the points are indepen- 
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dent. If D'U P*= P' and P'(1P*— P", then 
r+s=t+u. We call the latter the dimension 
theorem (or intersection theorem) of projective 
geometry. 

The set 2, of all hyperplanes that contain a 
P5 ? in P" is called a pencil of hyperplanes, 
and the PS * common to them is called the 
center of 2,. If a pencil of hyperplanes contains 
two distinct hyperplanes of P", then the pencil 
is determined uniquely by these two. When 
n —2 and 3, it is called a pencil of lines and a 
pencil of planes, respectively. Each pencil of 
hyperplanes of P", or more generally, each 
pencil of hyperplanes of a subspace of an arbi- 
trary dimension in P", is called a linear funda- 
mental figure of P" or simply a fundamental 
figure. In P", the set X, of all P"^', P"^'*!.,,,, 
P"^! that contain the same P5 ' ! is called 
the star with center P5 " '. Each set that con- 
sists of the totality of subspaces of an arbitrary 
demension in the same P" or a subset of it is 
called a P’-figure. 

Under the assumption that P” and P* do not 
have points in common, the operation of con- 
structing P'U P* from P” and P" is called pro- 
jecting P^ from P”, Assuming that P” and P 
have points in common the operation of con- 
structing P'1 P5 from P” and P" is called cut- 
ting P? by P”. Suppose that we are given spaces 
P, P. and P,, and a fundamental figure 2 in 
the space P. By projecting 2 from P, and then 
cutting it by P), we can construct a funda- 
mental figure X' on P,. This operation is called 
projection of X from P, onto P,, and we call P) 
the center of projection (Fig. 2). In this case, we 
say that X and A are in perspective and denote 
the relation by Z x 2”. If for two fundamental 
figures X and Z” there exist a finite number of 
fundamental figures F; (1 <i</) such that ZX 
Fik...KF,K’, then we say that X and X are 
projectively related to each other and denote 
this by X 4 X' (Fig. 3). Now for arbitrary sub- 
spaces Pr, P5 (O rn), we take P5" ! that 


R, 
3j 
P 
ES P, 
Fig. 2 
x 
P 
F 
Fig. 3 
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have no points in common with them and 
project each point of P onto P5 from br, 
The one-to-one correspondence P > P5 thus 
obtained is called a perspective mapping. If a 
one-to-one correspondence D — P; is repre- 
sented as the composite of a finite number of 
perspective mappings, then we cal! it a projec- 
tive mapping. These mappings are extended to 
those of fundamental figures, too. 

Suppose that in a proposition or a figure in 
P", we interchange P” and P"! (0rzn) 
and also interchange contains and is contained 
(and related terms). The proposition or the 
figure thus obtained is said to be dual to the 
original one. In projective geometry, if a pro- 
position is true, then its dual proposition is 
also true (duality principle). This is assured 
because propositions dual to axioms (UL (IV) 
hold; and P” and 2, are dual to each other. 
The projective space Pý obtained by the prin- 
ciple of duality, by regarding the hyperplanes 
of P" as points of Pi is called the dual space of 
P". 


C. Projective Coordinates 


Here we introduce projective coordinates in P”. 
Consider Desargues’s theorem: Suppose that 
Pi» Po, P3 and 41, da, d4, are two sets of points 
in P”, each of which is independent and satis- 
fies p;#q; (i= 1,2, 3). If the three lines p; U q; 
(i — 1,2, 3) are concurrent, then the three 
points (p; U p3)f1(q5 Uq3), (p3 Up) (43 U qi). 
(p, U p;)1(q, U q5) are collinear. The converse 
is also true. This theorem holds for n z 3 gen- 
erally. However, when n —2, there exist projec- 
tive geometries for which it does not hold; we 
call these non-Desarguesian geometries. In 
such cases tt is impossible to introduce coordi- 
nates, so we assume Desargues's theorem for 
n2. 

When four points p; (1 <i<4) in P" lie on 
the same plane and in general position, we call 
the figure that consists of these four points and 
the six straight lines o — p;Up; (1 xix j«4)a 
complete quadrangle p, p,p3p4; each p, is called 
a vertex, and each gj; is called a side. If six 
points q; (1 <i<6) on a line l are points of 
intersection of six sides du, 913. 014: 034» 024: 
da of a complete quadrangle with /, we call 


Fig. 4 
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them a quadrangular set of six points (Fig. 4). 
By Desargues's theorem, we can show that if 
there are given three fixed distinct points on a 
line J, then any pair of distinct points on / 
determines uniquely a point on / such that the 
six points thus obtained constitute a quadran- 
gular set. The quadrangular property is invari- 
ant under projective mappings. On a line / 

we fix three mutually distinct points po, Di, Po- 
For any two points p,, p, different from p,, on 
I, we take the point s such that po, Px» Po» Ds. 
Py» $ constitute a quadrangular set of six points 
and call s the sum of p, and p, with respect 

to [Po; Po; P4] (Fig. 5). On the other hand, 

the point t such that po, Px, Di, Pao» Py, t con- 
stitute a quadrangular set of six points is 
called the product of p, and p, with respect to 
[ Po: P. P1] (Fig. 6). When we are given a fixed 
triple [ po. Po; p1] on a line l, as before, the set 
of points on ! not equal to p, is called a point 
range of the number system, provided that we 
exclude p,, from the point range. We call the 
set of three points [ po, Po» p, ] a frame (or 
projective frame) of /, and we call p, the origin, 
pı the unit point, and p,, the supporting point. 


A point range of the number system consti- 
tutes a tfield (which may be noncommutative) 
with respect to the previously defined sum and 
product. We call the field a Staudt algebra, 
and an abstract algebra isomorphic to it is 
called a coefficient field of P". We denote by 
K(po. Po» P1) the Staudt algebra that is deter- 
mined by a frame [ po, Po» P1]. A projective 
mapping of l onto itself that leaves invariant 
each of three distinct points po, Po» p, on! is 
necessarily an tinner automorphism of the field 
K(po; Po» P1). Denoting by (po, Po: P1] a frame 
on a line l of P" with coefficient field K, we call 
each isomorphism 0: K(po, p, p,) K a co- 
ordinate system of Í. For each point p on | we 
call the element č — 0(p) of K the inhomoge- : 
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neous coordinate of p with respect to this 
frame. Also, we call the pair (x°, x!) such that 
x9, x3 eK and x! (x?) ! = homogeneous co- 
ordinates of p. Since the supporting point p,, is 
excluded from K (po, Po» P1), we fix (0, x!) such 
that x! 40 as the homogeneous coordinates 
of P. In order for (x°, x!) and (y°, y!) to be 
homogeneous coordinates of the same point, it 
is necessary and sufficient that there exist an 
element 10 of K such that y* — x^4 (a=0, 1). 

In conformity with these results, we now 
introduce coordinates in P". A set à —[ao, 

4,, ..., âp, Hl of ordered n+ 2 points in a gen- 
eral position is called a frame (or projective 
frame) of P"; each of a, (O0 a <n) is called a 
fundamental point, and u is called a unit point. 
For A-a,U...Ua, (0&ag « ... «a, <n), we 
denote by A* the space spanned by the re- 
maining fundamental points. For any point 

p that is not contained in A*, we put p, — 
AN(pU A*) and call it the component of p 

on A. Then iy, — [a,,, ..., a, Hal is a frame 

of A. Hereafter, we shall omit u, for brevity. 
Suppose that isomorphisms 0,,: K(a,, aj)  K 
are assigned for each pair a, B (0 &a « B x n). 
Under a certain condition, the system (0,5) is 
determined by one of the H. In this case we 
denote {6,,} by 0 and call (15, 0) a projective 
coordinate system of P". For any point p of P" 
not contained in Ao — a, U ...Ua,, we denote 
by p; the component of p on a, Ua; (1 <i<n), 
and we put €'=6(p,). The elements of the 
ordered set (£!, £?, ..., č") are called the in- 
homogeneous coordinates of p with respect to 
J, and those of the set (x°, xt, ..., x") such that 
x'(x9) ! = &! are called the homogeneous co- 
ordinates of p. When p is contained in Ay, we 
define (0, x’, ..., x") as homogeneous coordi- 
nates of p with respect to %, provided that 
(x!, ..., x") are homogeneous coordinates of p 
with respect to y, . 

Now we represent the point whose coordi- 
nates are GH, x!, ..., x") simply by x. In P", 
when coordinates are introduced, a necessary 
and sufficient condition for points z to be on 
the line that passes through two distinct points 
x and y is that z^ — x*A-F y*u (0a x n), when 
A, pE K are parameters. More generally, a 
point z is contained in the space spanned by 
r+1 independent points x, (0x f <r) in P" if 
and only if z*= 35 5x54? (enen, A? e K). In 
particular, the equation of a hyperplane is 
represented in the form 27.9 X,z* 20 (X,€K) 
with respect to variable coordinates z^. There- 
fore each hyperplane is uniquely determined 
by the ratio of Xo, X,,..., X,. We call Xo, 

X,, ..., X, hyperplane coordinates of the hyper- 
plane. If n —2, they are called coordinates of a 
line, and if n— 3, plane coordinates of a plane. 
(For coordinates of P" in P" — 90 Coordinates 
B.) 
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D. Projective Transformations 


A one-to-one correspondence o between the 
point sets of two projective spaces P" and P" is 
called a collineation in the wider sense if for any 
three points p,, P2, p that are collinear, o(p; 
(i= 1,2, 3) are also collinear and vice versa. If 
P"— P}, we call ọ a correlation; if P" = P", we 
cal! o a collineation. If we denote a correlation 
by to, any other correlation is obtained as a 
composite of t, and a collineation. If t is a 
correlation, it naturally induces a mapping 

Pi P", which we also denote by t. Then tot 
is a collineation. If tot is an identity, we call t 
an involutive correlation. Suppose that o: P"— 
P" is a collineation in the wider sense and 
O<r<n-—1. Then @ induces a one-to-one cor- 
respondence between the set of r-dimensional 
subspaces of P" and the set of r-dimensional 
subspaces of P"; and if P' > P* in P", then 

oP > oP*. 

Next, suppose that we are given two projec- 
tive spaces P" and P" that are subspaces of a 
space P" (n « N). (When Desargues's theorem 
holds, any two projective spaces of the same 
dimension can be identified with subspaces of 
a projective space of higher dimension.) In this 
case, when a collineation in the wider sense 
vm: P" P" is a projective mapping, we call it a 
projective collineation in the wider sense. A 
projective collineation is also called a projective 
transformation. The totality of collineations of 
P" constitutes a ftransformation group and is 
called the group of collineations of P"; we de- 
note it by €(P"). The totality of projective 
transformations of P" constitutes a tnormal 
subgroup of €(P"); we denote it by 6(P") and 
call it the group of projective transformations. 
The totality of projective transformations that 
leave invariant a frame & of P" constitutes a 
subgroup %*!(%). It is isomorphic to the 
group of tinner automorphisms 3(K) of the 
coefficient field K of P". A collineation is not 
necessarily a projective transformation. The 
former is obtained as a composite of a projec- 
tive transformation and an automorphism of 
the coefficient field. Specifically, if we denote 
the group of tautomorphisms of K by AWK), 
then &(P^)/(6(P") = W(K)/3(K). Hence in order 
for all collineations to be projective transfor- 
mations, it is necessary and sufficient that all 
automorphisms of the coefficient field be inner 
automorphisms. If the coefficient field is the 
real number field, then collineations are always 
projective transformations. For the complex 
number field, however, this is not necessarily 
true. 

Now, we consider the following three pro- 
positions: (1) The coefficient field of P" is com- 
mutative. (2) Given frames and zk of P", 
there exists a unique projective transformation 
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sending $Y onto ak. (3) Given two distinct lines 
l, and }, contained in a plane in P" and two 
sets of three distinct points p; (i= 1,2, 3) and q; 
(i— 1, 2, 3) that lie ont, and l, respectively, 
then the three points (p,Uq3)N(p3 Jq,), 

(p3 Uq1) (p, Uq3), and (p, Uq;5)P (p; Uq,) are 
collinear. These three propositions are mutu- 
ally equivalent. We call proposition (2) the 
fundamental theorem of projective geometry 
and proposition (3) the theorem of Pappus. If 
the coefficient field is the real (complex) num- 
ber field, we call the projective space a real 
(complex) projective space. In classical geom- 
etry, only these cases were studied. 

Suppose that the coefficient field is com- 
mutative. Then, if we assign an isomorphism 
05: K(Po, p, p.) -^ K for the Staudt algebra 
K(po. Po, p1) on a line in a space, then the 
isomorphism 0 of the Staudt algebra K(qo, 
d dl on an arbitrary line onto K can be 
uniquely determined so that 0^! o6, is a pro- 
jective mapping. Utilizing such isomorphisms, 
we can determine homogeneous coordinates 
in an arbitrary subspace of P" by a frame on it. 

Suppose that the coefficient field is a com- 
mutative field whose characteristic is not 2. 
For four collinear points p; (1 <i <4) in P", 
where p,, p2, p3 are distinct and p, Z p,, we 
consider a frame such that p,, p2, and p, are, 
respectively, the supporting point, the origin, 
and the unit point. The inhomogeneous co- 
ordinate À of p, with respect to this frame is 
called the anharmonic ratio (cross ratio or 
double ratio) of these four points and is de- 
noted by [p1, P2; P3, p4]. If we dencte the 
inhomogeneous coordinates of p; with respect 
to a general frame by (x2, x]) (i= 1, 2, 3, 4), then 
2 can be expressed as 
A— [pi P3 P3; P4] 

(x1x3 — x$xi) a xà — xàx3) 


(ell xxix} xS) 





Moreover, if we interchange the order of the 
four points, then we have 


A P2; P1; Pas P3] 
=[P3; P4; P1. P2] 
= [P4 P3; P2; Di 


1 
[P1 P2; Pa, P3] 77 


[P1,P3;P2,Pal=1—A, 


1 
[Pi P3; Pa P2] Sig 
A—1 
LPi Da; P2. P3] E. 
A 
[Pis Pa; P3 Dale 
ÀA-—1 
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In general, these six values are different; how- 
ever, there are the following two exceptions: 
when A= —1, 1/2, and 2; and when 4 is a root 
of 4?— 44-10. When A= —1, these four 
points are called a harmonic range of points, 
and the points p,, p, are called harmonic 
conjugates with respect top, P2; or p,, p; and 
P3, p4 are said to be harmonically separated 
from each other. When 4? — 4-- 1 —0, these 
four points are said to be an equianharmonic 
range of points. For the dual of these, we can 
consider the anharmonic ratio of four hyper- 
planes of a pencil of hyperplanes. The concept 
of the anharmonic ratio can be extended fur- 
ther to the case of four elements of funda- 
mental figures in general. The anharmonic 
ratio is a quantity that is invariant under 
projective transformations. 

Each projective transformation xXx is 
expressed with respect to homogeneous co- 
ordinates x^ (0 x à « n) of P" as 


n 
pxte y tix, p, tge K, 
B=0 


p#0,  det(t2)40. (1) 


Conversely, if T — (t5) is a tregular matrix 
(tg € K), then (1) determines a projective trans- 
formation. So there is a one-to-one corre- 
spondence between projective transformations 
and tequivalence classes of the regular ma- 
trices T — (t5) with the tequivalence relation 
T~AT (Ae K - (0]). Therefore, when K is com- 
mutative, the group of projective transforma- 
tions G(P") of P" is isomorphic to the factor 
group PGL(n- 1, K) of the "general linear 
group GL(n -- 1, K) with the coefficient field K 
by its center (pI| pe K ~{0}}; that is, O(P") = 
PGL(n ^ 1, K). 

Extending the definition of projective trans- 
formations, we call the transformation repre- 
sented by (1), with an arbitrary square matrix 
that is not necessarily regular, a projective 
transformation. When T is regular, it is called 
a regular projective transformation, and when 
T is not regular, it is called a singular projec- 
tive transformation. In particular, if the frank 
of T is n+ 1 —h, then we say that the projective 
transformation is singular of the hth species. If 
(1) is singular of the hth species, n+ 1 hyper- 
planes 275-9 tgx* =0 (0a « n) have a space 
PN) in common. We call PRT the singular 
subspace of this transformation. À projective 
transformation is not defined on its singular 
subspace. A singular projective transformation 
of the hth species is the composite of the pro- 
jection of P" onto some P"~* with the singular 
subspace as its center and a regular projective 
transformation of P"~*. 

If the coordinates of a point are denoted 
by (x^) and hyperplane coordinates with 
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respect to some frame by (X,), then the linear 
transformation 


n 
Ge DÄ Y Dal, p USE K, (2) 
f-0 


is a correlation. (Here also, we extend the 
definition of correlation and include the case 
where T, — (175) is not regular.) The condition 
that t, is an involutive correlation is given by 
T, = +'T,. When T, = —'T,, the involutive 
correlation tą is called a null system. The 
correlation Tą is a null system if and only if 
any point x of P" is contained in the hyper- 
plane t,(x). When T, —'T,, we call the involu- 
tive correlation v, a polar system. For a polar 
system t,,, the set of points x that are con- 
tained in hyperplanes c, (x) constitutes a qua- 
dric hypersurface (or hyperquadric). 


E. Quadric Hypersurfaces 


Let t, be a polar system, and let O3 ! be the 
totality of points x contained in c, (x). Then 
the equation of the quadric hypersurface Q^"! 
is given by 


TMs 


t5 x* xf — 0. (3) 


For such a correlation t, we call a relation 
between the set of points x of P” and the set of 
hyperplanes t,(x) a polarity with respect to 

^ |. We call t,(x) the polar of x with respect 
to OST, and x the pole of t, (x) with respect 
to OT If the points of intersection of a line 
passing through a point x with Q5! and 1, (x) 
are denoted by z,, z,; y, then x, y; z,,z5 isa 
harmonic range of points. When a point x lies 
on the polar of a point y, we say that x and y 
are mutually conjugate. Each point on Q^ ' is 
conjugate with itself, and the converse is also 
true. We call the polar of a point on Q7"! the 
tangent hyperplane of Q^ ! at that point. 

If t, is regular or singular of the hth species, 
we call the corresponding quadric hypersur- 
face regular or singular of the hth species. If 7, 
is singular of the hth species, its singular sub- 
space is contained in Q5 !. We call points on 
this singular subspace singular points of Q5 `. 
OS !, which is singular of the first species De, 
Q5 ! with just one singular point), is called a 
cone. 

We call a subspace contained in Q5 "a 
generating space. If it is a line we call it a 
generating line. We put q —(n— 2)/2 or (n— 1)/2 
according as n is even or odd. Then, if the 
coefficient field is an talgebraically closed 
field, for each regular Q5 ` there necessarily 
exist q-dimensional generating spaces. Also, 
Q3 is a truled surface covered by two families 
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of generating lines, and Q?* is covered by two 
families of k-dimensional generating spaces. 

If p; (1 <i<5) are five points in a general 
position in a plane, then there exists one and 
only one Q1 passing through these points; we 
call O a conic. In order that six points p; 
(1«i«6) in a plane lie on Ql, it is necessary 
and sufficient that the three points (p, U pa) N 
(p4U ps), (p; U p3)(p5 U po), and (p3 U pa) 
(pg U pı) be collinear (Pascal's theorem). The 
dual of the last theorem is called Brianchon's 
theorem. 

Given two hypersurfaces Q5 !, Q5 ! in P", 
we consider another Q5 ! such that the polar 
of an arbitrary point x with respect to Q5! 
belongs to the pencil of hyperplanes deter- 
mined by polars of x with respect to Q7! 
and OT The set of all such 03"! is called a 
pencil of quadric hypersurfaces. It is the set of 
all Q^"! that pass through the intersection of 
OCT and Q* 1. In the cases n—2 and 3, we 
call it a pencil of conics and a pencil of quad- 
rics, respectively. 

Denoting by /? and I? (0€ ix j € 3) the 
tPlücker coordinates of two straight lines / 
and / in P?, we put 


(l D= [01723 EM [92713 + [pop 
4 JO! [23 — 702 13 4 793 AE) (4) 


Then (1,1) 2 0 holds. If we regard these /? as 
homogeneous coordinates of P3, then there 
exists a one-to-one correspondence between 
the points on the regular quadric hypersurface 
Q^ defined by (1,1) 2 0 and the lines in P?. We 
say that each point of Q5 is the image of the 
line corresponding to it in P?. Two lines (and 
T intersect if and only if (J, 1) D. Geometrically, 
this means that the images of l and Í are con- 
jugate with respect to Q4. Therefore the line 
passing through the images of | and T is a 
generating line of Q2. The image of a pencil of 
lines in P? is a generating line of Q5. Quadric 
hypersurfaces and sets of lines in P? are im- 
portant objects of study in both projective and 
algebraic geometry. In particular, linear line 
congruences (linear line complexes) that are 
families of lines dependent upon two (three) 
parameters are of great interest. In these 
theories, quadric hypersurfaces play a fun- 
damental role. When the coefficient field is 
noncommutative, the above theory has to be 
greatly modified. 


F. Projective Geometry and the Erlangen 
Program 


From the standpoint of the tErlangen pro- 
gram of F. Klein, the aim of projective geom- 
etry is to study properties that are invariant 
under the group of projective transformations. 
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Utilizing various subgroups of this group, we 
can reconstruct various classical geometries. 
For example, consider the projective space P" 
whose coefficient field is the real number field, 
and fix a hyperplane /7,,. Let S(P”) be the 
subgroup of projective transformations formed 
by all projective transformations that leave JI. 
invariant. Then the geometry that belongs to 
this group is taffine geometry. Similarly, fix 

an imaginary regular quadric hypersurface 
Q^ ? in DH. and consider the geometry that 
belongs to the subgroup of S(P") leaving this 
Q^ ? invariant. We thus obtain Euclidean 
geometry. Moreover, if we assign some regular 
quadric hypersurface Q5 !, then the geometry 
belonging to the subgroup of 6(P") that leaves 
the Q5 ! invariant is a tnon-Euclidean or 
tconformal geometry according as the trans- 
formation space is the set of inner points of 
Q57? or the whole OST. 


G. Projective Geometry and Modular Lattices 


tLattices (lattice-ordered sets) and projective 
geometry are intimately related. The totality of 
subspaces of each dimension in general projec- 
tive geometry $ constitutes a tcomplete 
tmodular lattice LOD) with respect to the in- 
clusion relation. If $B is a finite-dimensional 
projective geometry, then it is an tirreducible 
complemented modular lattice of finite theight. 
Conversely, suppose that L is a modular lat- 
tice with *tminimum element ®, and denote by 
P the totality of elements p tprime over 6 (i.e., 
tatomic elements) and by Q the totality of 
elements / prime over atomic elements. Then, if 
p «land (p, l)er, SS(L) 2 (P, Q, T] is a general 
projective geometry. If L is an irreducible 
complemented modular lattice of finite height, 
then P(L) is a finite-dimensional projective 
geometry; in this case we have P = 3B(L(9B)) 
and Lz L((L)). So we may consider projec- 
tive geometry and irreducible complemented 
modular lattices as having the same mathe- 
matical structure. If a lattice L is an n- 
dimensional projective geometry, its *dual lat- 
tice is also an n-dimensional projective geom- 
etry, and this is the principle of duality. 


H. Analytic Representations of Projective 
Geometry 


Let K be an arbitrary field, commutative or 
noncommutative. For an arbitrary natural 
number n, we consider an (n+ 1)-dimensional 
(for the noncommutative case, right or left) 
linear space V"*! (K) over K. The totality of 
linear subspaces in it constitutes an irreducible 
complemented modular lattice P"(K) with 
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respect to the inclusion relation, and P"(K) 
gives rise to an n-dimensional projective geom- 
etry. We call it a right or left projective space. 
Points of P"(K) correspond to (right or left) 
1-dimensional linear subspaces. Conversely, 

it can be shown that an n-dimensional projec- 
tive geometry over K is isomorphic to P"(K). 
Therefore projective geometries can be com- 
pletely classified by means of the natural 
number n and the field K except when n=2 
and the geometry is non-Desarguesian. We 
may restate this fact as follows: We consider a 
space P= V"*! (K)— {0}. If we fix a basis of 
V"*! (K), then we can represent P — (x =(x°, 
xi, ...,x"|x*eK, Oa xn], x (0,0, ...,0). If 
there exists a nonzero element 4 of K such that 
y x4, then the elements x and y are called 
equivalent; we write x~ y. We denote by P"(K) 
the factor set of P under the foregoing equiva- 
lence relation, and by [x] the equivalence class 
that contains x. We put l([x], [y])= {[z]|z*= 
x*à+ y*m, VÀ, ne K) and Q= {Ex} [YI Ex], 
[y] e P"'(K)). We call each element of P"(K) a 
point and each element of Q a line. Then these 
'points and lines and the natural inclusion 
relation satisfy axioms (I)-(IV) and give an n- 
dimensional projective geometry. When K is a 
*topological field (e.g., the real number field, 
the complex number field, or the fquaternion 
field), we may define the topology of P"(K) as 
the factor space P"(K)= P/~. In particular, if 
K is the real number field R, then P"(R) is 
homeomorphic to the factor space obtained 
from the n-dimensional hypersphere S":(x°)? + 
... +(x")? = 1 in the (n+ 1)-dimensional Eucli- 
dean space E"*! by identifying the end points 
of each diameter. Hence P"(R) is compact. 
Similar facts hold for the cases of the complex 
and quaternion number fields. Since the group 
of projective transformations G(P"(K)) acts 
ttransitively on P"(K), if K is a topological 
field we can regard P"(K) as a thomogeneous 
space of the topological group G(P"(K)). 
Moreover, the totality of r-dimensional sub- 
spaces in P"(K) constitutes a tGrassmann 
manifold. In algebraic geometry the !direct 
product of two projective spaces is important; 
we call it a biprojective space. 


I. Tits's Theory of Buildings (Generalization of 
Projective Geometry) l 


In a situation when a triple (G, B, N) consisting 
of a group G and its subgroups B, N satisfies 
the axioms of a BN-pair or Tits system (— 13 
Algebraic Groups R), a new geometric object, 
called a “building,” was introduced by J. Tits 
[9]. His theory contains projective geometry 
as a particular case. The theory of buildings 
has deep connection with algebraic groups. 
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The Tits system corresponds to a projective 
geometry in the following case. Let k be any 
commutative field, and let G be the general 
linear group of degree n over k, i.e., G consists 
of all nonsingular square matrices of degree n 
with entries in k. Let B be the subgroup of G 
consisting of all upper triangular matrices (i.e., 
matrices whose entries below the principal 
diagonal are all zero). Let N be the subgroup 
of G consisting of all monomial matrices (i.e., 
matrices such that each column and each row 
contain just one nonzero entry). Then (G, B, N) 
forms a Tits system called type (A, ,). The 
corresponding theory of buildings of the type 
above is nothing but the projective geometry. 
Thus by means of Tits's theory of buildings the 
relationships among projective geometry and 
other geometries have been clarified [9]. 
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A. Definitions 


Let A and A' be subsets (with relative topol- 
ogy) of tcomplex manifolds X and X' of dimen- 
sion n, respectively. A homeomorphism f of 

A onto A’ is called a pseudoconformal trans- 
formation if there exists a tbiholomorphic 
mapping f of an open neighborhood of A in X 
onto an open neighborhood of A' in X' such 
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that f(x) — f(x) for xe A. If there exists such a 
mapping f, A is said to be pseudoconformally 
equivalent to A'. Pseudoconformal geometry is 
a geometry that studies geometric properties 
invariant under the pseudoconformal equiva- 
lence. However, most studies in pseudocon- 
formal geometry so far have concentrated 
mainly on the investigation of smooth hyper- 
surfaces in a complex manifold — more specifi- 
cally, the smooth (or real analytic) boundaries 
of bounded domains in C". In fact, to pseudo- 
conformal geometry on hypersurfaces we can 
apply the methods of differential geometry as 
well as those of the theory of functions of 
several complex variables. 

H. Poincaré [1] studied perturbations of 
the boundary of the unit ball in C? that are 
pseudoconformally equivalent. E. Cartan [2] 
studied the equivalence problem of hyper- 
surfaces in C? and gave the complete list of 
all simply connected hypersurfaces on which 
the group of pseudoconformal automorphisms 
acts transitively. Such a hypersurface is called 
homogeneous. 

Let M be a smooth hypersurface in a com- 
plex manifold X with the *almost complex 
structure tensor J, i.e., J,: T, X > T, X is an 
involutive linear automorphism of the tangent 
space T, X of X at x induced by the complex 
structure of X. Put H.M TE MIT AM for 
xe M. The union of all H,M is called the 
bundle of holomorphic tangent vectors of M 
and is denoted by H(M). H(M) is also called 
the CR (Cauchy-Riemann) structure of M. Let 
M' be a smooth hypersurface in a complex 
manifold X’. A diffeomorphism f: M — M' is 
called a CR-equivalence if the *differential 
mapping Tf: TM — TM' of f preserves the 
CR-structures, where TM denotes the ttangent 
bundle of M. If f: M M' is a pseudocon- 
formal transformation, then f is clearly a CR- 
equivalence. Let E, be the annihilator of H,M 
in T*(M). Then the union of E, (xe M) defines 
a tline bundle E over M. The Levi form L, at 
x€ M, defined only up to a multiplier, is the 
quadratic form on H. AM defined by L,(u, v) = 
dO(u, v) for u, ve H, M, where @ is a nonvanish- 
ing section of E in a neighborhood of x. If the 
Levi form is nondegenerate at every point of 
M, M is called a nondegenerate hypersurface. 
In particular, if the Levi form is definite, then 
M is called strictly pseudoconvex. 


B. Equivalence Problem 


Cartan studied the equivalence problem for 
the case n — 2, and obtained a criterion for two 
hypersurfaces in C? to be pseudoconformally 
equivalent. N. Tanaka (1965) generalized the 
method of Cartan for the case n > 3 and ob- 
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tained a criterion in terms of *Cartan connec- 
tions in some fiber bundle over the hypersur- 
faces. However, he did not publish the proof of 
his result until S. S. Chern and J. Moser [4], in- 
dependently of Tanaka [3], obtained a similar 
result and gave the first proof of this result. 
Let M be a real analytic hypersurface in C"! 
(n > 1) whose Levi form has p positive and 

q negative eigenvalues (p 4- q — n). Let H be 

the subgroup of SU(p-- 1, q 4 1) leaving the 
point (1,0, ...,0)e C"*? fixed. According to the 
Cartan-Tanaka-Chern-Moser result, we can 
construct functorially a principal fiber bundle 
Y over M with structure group H and a Car- 
tan connection w on Y with values in the Lie 
algebra of SU(p+1,q+ 1) such that if M and 
M' are pseudoconformally equivalent, then 
there is a bundle isomorphism q of Y to Y’ 
preserving the Cartan connections: g*a’ — c, 
where Y’ is the corresponding principal fiber 
bundle over M’ and o is the Cartan connec- 
tion on Y’. Conversely, if there is a bundle 
isomorphism q of Y to Y’ such that o*o' — c, 
then M and M' are pseudoconformally equiva- 
lent. By using this solution of the equivalence 
problem, we can prove that the group A(M) 
of all pseudoconformal automorphisms of a 
nondegenerate real analytic hypersurface M in 
a complex manifold X of dimension n is a Lie 
transformation group of dimension not exceed- 
ing n? 4- 2n. H. Jacobowitz [5] constructed 

a similar bundle B over M and a Cartan con- 
nection on B in a different way from that of 
Chern and Moser. We do not know whether 
B and Y actually coincide. 


C. Classification 


Cartan (1932) classified all simply connected 
homogeneous hypersurfaces in C?. In partic- 
ular, he proved that if M is a compact homo- 
geneous strictly pseudoconvex hypersurface 
with dim M —3, then M is pseudoconformally 
equivalent to either (1) $? or its quotient by 
the action of a root of unity or (2) the hyper- 
surface given in the 2-dimensional projective 
space by the equation in homogeneous coordi- 
nates: (z,Z, - z22; +2323) = m?|zi zzi? 
(m> 1) or the double covering of such a sur- 
face. A. Morimoto and T. Nagano [6] and 
later H. Rossi [7] tried to generalize this 
result and obtained a partial classification of 
simply connected compact homogeneous 
hypersurfaces with dimension > 5. D. Burns 
and S. Shnider [8] classified all simply 
connected compact homogeneous strictly 
pseudoconvex hypersurfaces M with dim M — 
2n - 1 2 5. They proved that M is pseudo- 
conformally equivalent to S?"*! or the tangent 
sphere bundle of a rank one tsymmetric space 
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or the unit circle bundle of a homogeneous 
negative line bundle over a homogeneous 
algebraic manifold. 

On the other hand, a real hypersurface M in 
a complex manifold X of complex dimension 
n+1 is called spherical if at every point pe M, 
there is a neighborhood of p in X such that 
UNM is pseudoconformally equivalent to an 
open submanifold of $?"*!, The hyperquad- 
ric Q"*! = 0U,,, is spherical, where U,,, = 
(6... Zare" Imz) >z? +... + 
LGL If M is spherical, then the universal 
covering space M of M is also spherical. If M 
is a homogeneous spherical hypersurface, then 
there is a covering into mapping f: M 5 S?"*!, 
and f(M) is a homogeneous domain in Sin"). 
We know that the only compact simply con- 
nected spherical M is S?"*!, Burns and Shnider 
classified all homogeneous domains M in 
$?^*!: M is pseudoconformally equivalent 
to (I) or (II) of the following: (Ia) S?"*! — 
V 1 $?"*!, where V is a complex vector sub- 
space of C"*! with O dime V «n. (Ib) Q"*! — 
L,:0,0xmzn, where L,, is a certain sub- 
group of SU (n + 1, 1)/(center). (IT) $?"*1 — 
S?"*1 (1R"*1, At present, it seems difficult to 
extend Cartan's classification of all simply 
connected homogeneous hypersurfaces to 
higher dimensions. K. Yamaguchi (1976) 
treated a hypersurface M in a complex mani- 
fold of dimension n with a large automorphism 
group A(M). He showed that if dim A(M)= 
n? + 2n, then M is a real hyperquadric in the 
n-dimensional complex projective space PBC 
(— Section B). He then showed that the sec- 
ond largest dimension for A(M) is equal to 
n? +1 except when n=3 and the index r=1, 
for which dim A(M)=11(=n? + 1). Under the 
additional assumption that M is homoge- 
neous, he showed that if dim A(M) — n? + 1, 
then M is the affine part of a real hyperquadric 
in P,C (except when n= 5 and r=2). He also 
obtained a similar result in the nonhomoge- 
neous case. 


D. Relations to Other Equivalences 


Let D, and D, be bounded domains in C" with 
smooth boundary 0D; = M, (i 1,2) for which 
we denote by H(M;) the CR structures. We 
consider the following propositions (A)-(D) for 
these domains: (A) D, is biholomorphically 
equivalent to D. (B) M, is CR equivalent to 
M. (C) M, is pseudoconformally equivalent to 
M,. (D) There is a diffeomorphism f: D >D, 
such that f|» : D, — D, is biholomorphic. 

It is clear that (C) implies (D) and that (D) 
implies (A). On the other hand, we can prove 
that (B) is equivalent to (D). When does (A) 
imply (B) and when does (B) imply (C)? 
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C. Fefferman [10] proved that (A) implies 
(D) when D, and D, are strictly pseudoconvex. 
S. Bell generalized the result of Fefferman in 
the case when one of D, and D, is strictly 
pseudoconvex. Bell and E. Ligocka [11] 
proved that if M, and M, are real analytic and 
if D, and D, are pseudoconvex, then (A) im- 
plies (D). When D, and D, are not strictly 
pseudoconvex and M, is not real analytic, we 
do not know whether (A) implies (D) or not. 
As remarked by Burns, Shnider, and Wells 
(1978), by using the theorem of Fefferman, 
we can prove that (A) implies (C) when M, 
and M, are real analytic and if D, and D, 
are strictly pseudoconvex. I. Naruki [12] 
obtained the same result. We do not know 
whether (A) implies (C) when M, and M, are 
real analytic and D, and D, are pseudoconvex, 
though we know that (A) implies (B). We do 
not know whether (B) implies (C) in general. 

S. I. Pinchuk [13] proved the following: 

Let D, D' be strictly pseudoconvex domains in 
C" with simply connected real analytic bound- 
aries 6D, OD Let f: UC" be a nonconstant 
holomorphic mapping from a connected neigh- 
borhood U of a point pedD in C" into C" such 
that f(U 10D) c OD Then we can find a holo- 
morphic mapping 7: D D' such that f(x)= 
f(x) for xe DN U. Combining this theorem 
with Fefferman's result we see that for two 
domains as above, D is biholomorphically 
equivalent to D' if and only if 0D is locally 
pseudoconformally equivalent to ôD’, i.e, there 
are neighborhoods U and V of a point pech 
and qecD’, respectively, such that U (10D and 
VND’ are pseudoconformally equivalent. 

Concerning the *proper holomorphic map- 
pings rather than diffeomorphisms, Burns and 
Shnider (1979) proved the following theorem: 
Let M;= 0D; (i= 1,2) be strictly pseudoconvex, 
and let f: D, >D, be a proper holomorphic 
mapping. (a) If D, = D,, then f extends smoothly 
up to the boundary D,. (b) If 6D, is real analy- 
tic for i— 1, 2, then f extends holomorphically 
past the boundary. 


E. Deformations of Domains 


Let M be a compact connected strictly pseudo- 
convex real hypersurface in a complex mani- 
fold X of dimension n+ 1. Let o be a smooth 
strictly tplurisubharmonic function defined 

on a neighborhood V of M such that M — 
(xeV|o(x) 20] and de #0 on M. Let U = 
(ixeV| —e« o(x) « v) for small ¢>0 such that 
U is compact and OU is smooth. Let A(U) 

be the open set in C*(U) of strictly plurisub- 
harmonic functions y with dy ^ dy ^ (dd 40 
on U. Let Bc R* be a small open ball around 
0. We denote by A(U x B) the set of y e C^(U 
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x B) such that y (x, t) 2 ,(x)e AV) for all te B. 
For peAU x B) we set M, ,={xeU|W,(x)= 
ô}. After introducing these notations, Burns, 
Shnider, and Wells (1978) proved the following 
theorem. There exists an open dense set ¥ c 
AU x B) with qe Y^ and a set of tsecond 
category 4? c Y^ such that for every v e 4, 
t;€ B and Ae small enough, (i) M,, is CR- 
equivalent to M,, 5, if and only if t, 215,0, = 
65. (it) The group of CR-automorphisms of 

M, „ə, reduces to the identity only. For we 
YU x B), taking te B, óc R small enough, M, ; 
is a compact connected strictly pseudoconvex 
hypersurface in X. If M bounds the relatively 
compact region D in X then M, ; also bounds 
a relatively compact region D, ;. In particular, 
there exist smooth families of deformations 

of the unit ball in C"*! of arbitrary high di- 
mension. There are arbitrary small perturba- 
tions of the unit sphere in C"*! that admit no 
pseudoconformal transformations other than 
the identity. 


F. Topics Related to Pseudoconformal 
Geometry 


(1) Pinchuk (1975) proved the following: Let 
D,, D, be strictly pseudoconvex domains in C" 
with C? boundary 0D,, 0D,. Let U be a neigh- 
borhood of a point pech, in C". If there is a 
C'-mapping f:U N D, such that f is 
holomorphic on UND, and f(UNéD,)céD,, 
then there is a holomorphic mapping f: U’> 
C" of a neighborhood U’ of U N ôD, into C" 
such that f(x)— f(x) for xe UND, NU’. This 
result is related to the implication (B) ^ (C) in 
Section D. 

(2) H. Alexander [14] proved the follow- 
ing: Let U be a connected neighborhood of a 
point pe S?"! in C" and f: UC" a holomor- 
phic mapping such that f(U 15?" !) c $?"^!. 
Then either f is a constant mapping or there is 
a biholomorphic automorphism f: B, B, of 
the unit open ball B, such that f(x) = f(x) for 
xeU N B,. He also proved that every proper 
holomorphic mapping f: B, B, is necessarily 
an automorphism of B, if n» 1. G. M. Henkin 
(1973) proved that every proper holomorphic 
mapping f: D, D, of a strictly pseudoconvex 
domain D, into a strictly pseudoconvex D, can 
be extended continuously to a function f: D, 
D. More precisely, there is a constant c>0 
such that |f(z,) — f(z;)| &c|z, —z,|'” for every 
Zu, Z2€D,. 

(3) Let M be a real hypersurface in C"*! 
with H(M) the bundle of holomorphic tan- 
gent vectors to M. We take a real nonvanish- 
ing 1-form Ó that annihilates H(M). S. M. 
Webster (1978) called the pair (M, 0) a pseudo- 
Hermitian manifold. He considered the equiv- 
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alence problem of pseudo-Hermitian mani- 
folds by applying Cartan's method of equiv- 
alence. He proves, among other things, that 
the group of all pseudo-Hermitian transforma- 
tions of the nondegenerate pseudo-Hermitian 
manifold (M, 0) of dimension 2n-- 1 is a Lie 
transformation group of dimension not ex- 
ceeding (n+ 1)?. Webster considered the rela- 
tion between pseudo-Hermitian manifolds 

and pseudoconformal geometry and proved 
that for n>2 the ellipsoid E given by the equa- 
tion A, x7 + B, yp t+... + aan Een + Bass Yati = 
1, where z, =x, +iy, (k=1,...,n+1) is pseudo- 
conformally equivalent to the hypersphere 
S?"*! if and only if A, = B, (k=1,...,n+ 1). 
This result gives, by virtue of Fefferman’s 
theorem, a necessary and sufficient condition 
for an ellipsoidal domain to be biholomorphi- 
cally equivalent to the unit ball. 
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345 (XIII.33) 
Pseudodifferential Operators 


A. Pseudodifferential Operators 


Pseudodifferential operators are a natural 
extension of linear partial differential opera- 
tors. The theory of pseudodifferential opera- 
tors grew out of the study of singular integral 
operators, and developed rapidly after 1965 
with the systematic studies by J. J. Kohn and 
L. Nirenberg [1], L. Hórmander [2], and 
others. The term “pseudodifferential operator" 
first appeared in Kohn and Nirenberg [1]. 

Let P be a tlinear partial differential opera- 
tor of the form 
P=p(x,D,)= Y, a,(x)Dz, (1) 

jal <m 

and let u(x) be a function of class Co (Q) 
(c Cg (R")). Then by means of the tFourier 
inversion formula, Pu(x) can be written in the 
form 


pos) exp(ix:¢)p(x,¢)a(c)de, (2) 
~ 


where á(£) denotes the tFourier transform of 
u(x) (— 160 Fourier Transform H). But this 
representation of Pu(x) has a meaning even if 
p(x, č) is not a polynomial in č. Thus, for a 
general function p(x, č), the pseudodifferential 
operator P = p(x, D,) with the symbol p(x, €) is 
defined by (2). A symbol class is determined in 
accordance with various purposes, but it is 
always required that the corresponding opera- 
tors have essential properties in common 

with partial differential operators. Hórman- 
der [3] defined a symbol class A7 ,(€2) for real 
numbers m, p, and ô with p 20 and 620 in the 
following way: Let p(x, £) be a C?-function 
defined in Q x R”. If for any pair of multi- 
indices «, f and any compact set K c R", there 
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exists a constant C, , x such that 
|D2DPp(x, £)| < C, g, (1 + |)" 9t og, 


xeK, Zen 


then p(x, č) is said to be of class S7 (Q). The 
operator P defined by (2) is called a pseudo- 
differential operator (of order m) of class 

SD (Q) and is often denoted by P=p(x, Die 
87. 4(Q). When Q=R" and constants C, , «= 
C, , are independent of K, we denote $7 ,(R") 
simply by S7',, and set 


= N w(-. () 2) 


Ze U S 
—oo«m«oo 
Differential operators (1) with coefficients of 

class 4 (— 168 Function Spaces B(13)) belong 
to ST o. The complex power (AFD of 1— 
A-1—27.,0?/0x? is defined as a pseudo- 
differential operator of class ST*; by the sym- 
bol (1 --|£|?)*?. Operators of class S7; are 
continuous mappings of F into S. Therefore, 
for any real s, the operator (1 — A? can be 
uniquely extended to be a mapping of S’ into 
ST by the relation 


(1 — AY? u, v) = <u, (1 — A? v5, 


Ve, vef. 


For any 1 <r < oo and real s, the 'Sobolev 
space H*' is defined by | 


H*' (ue '|(1— Ay? ue L,(R"], 
which is a Banach space provided with the 
norm Tu, — IO — AY" ull, . In particular, 


H*— H*? is a Hilbert space with the norm 
lulls = Wull, Set 


H ?'— U HS”, H” =H *? 
H?” = N Hr, Ha ÉH*?. 
—o«s«oo 
Then 
J"5H "56, H *5L,(R")25H*?(c4). 


Choosing the Hórmander class Sz in the 
case 0<6<p<1 and ó «1 as a model class, 
we here list the main results of the theory of 
pseudodifferential operators: 

(i) Pseudolocal property. The operator P of 
class $7, in general does not have the local 
property ue F’ = supp Duc suppu, but if p> 
0, then P has the pseudolocal property ue S’ > 
singsupp Pu c singsuppu [3]. 

(ii) Algebra of pseudodifferential operators. 
Let P= p(x, Dies, and P;= pí(x, D,)e Sps, 
j=1, 2. Then there exist P* = p*(x, D,)e S7, 
and Q =q(x, D,)e S7: such that (Pu, v) = 
(u, P*v) for u, ve Z, i.e., P* is the formal ad- 
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joint of P, and Q = P, P. Furthermore, if 
we set p¥(x, £) = Di(iD,"p(x, £) and q,(x, £)-— 
(iD.)*p (x, €) Dp;(x, č), then for any integer 
N we have 





1 DE 
p*(x, 9- Y gt eyes” n (3) 
and 

1 X 
q(x, 9- Part pes turum (4) 


Hence the operator class 57, is an algebra in 
the sense 


PES s, Pe S55 q= E 
=> P*c$5, P,tP,eS55, Pen, 


where m, =max(m,,m,). In particular, if 0 < 
ó«px], we have m—(p —ó)N — —oo and 
m,+m,—(p—06)N-» —oo as No oo. Then, we 
say that p*(x, č) and q(x, č) have asymptotic ex- 
pansions in the sense of (3) and (4), respectively: 


1 
p*(x, £) ~EP (x, č) 
and 
1 
q(x, č) ~È 465 é) 


[3.4]. 
(iii) H*-boundedness. For P eS% ; and any 
real s there exists a constant C, such that 


Pull, CG Hu, er 


[4]. 

(iv) A sharp form of Garding’s inequality. 
Let p(x, &) - (pg Gx, č); j, k— 1, ..., 1) bea Her- 
mitian symmetric and nonnegative matrix of 
DA, £)e S7 ,. Then for P = p(x, D,) there exists 
a constant C such that 


ue H**" (5) 


Re(Pu, u) > — C ulli - to - o2 (6) 


where u — (u,, ...,uj) with uje H"^, j— 1, ...,l, 
and lullZ;s ES Xii EI 

(v) Invariance under coordinate transfor- 
mations. Assume that O«1—pxóxpxl. Let 
x(y)=(x,(y), .... x,(y)) be a C*-coordinate 
transformation from Rj onto R} such that 
Ox,(y)/€y;e 2, j, k-1,...,n,andC Te 
|det(0, x(y))| € C for a constant C>0, where 
det(é,x(y)) denotes the determinant of the 
Jacobian matrix (0,x(y)) ^ (Ox,( y)/0y;). Then 
for any P— p(x, D,)e S7 ; in R}, there exists 
an operator Q — q(x, D,)eS7 in R} such that 
(Qw)(y)  (Pu)(x(y)) for w(y) 2 u(x(y))e ^. This 
fact enables us to define pseudodifferential 
operators on C^-manifolds [3, 4]. 

(vi) Parametrix. For a given PeS7,, an 
operator Een, is called a left (resp. right) 
parametrix of P if EP — 1I (resp. PE — I) is of 
class S *. If E is a left and right parametrix of 
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P, we call it a parametrix of P. For a differen- 
tial operator P, the existence of a left (resp. 
right) parametrix is a sufficient condition for P 
to be thypoelliptic if p 0, (resp. the equation 
Pu- f € 2' is locally solvable). 

The estimate (5), in particular, when m=s= 
0, has been obtained by Hórmander [3], V. V. 
Grushin (Functional Anal. Appl., 4 (1970)), 

H. Kumano-go (J. Fac. Sci. Univ. Tokyo, 17 
(1970)) when 0€ ó « p <1, and A. P. Calderon 
and R. Vaillancourt [5], H. O. Cordes (J. 
Functional Anal. 18 (1975)), T. Kato (Osaka 

J. Math., 13 (1976)), Kumano-go [4], and 
others when Ox óx px and ó « I. A sharp 
form of Gárding's inequality has been proved 
by Hórmander [6], P. D. Lax and L. Niren- 
berg [7], and sharpened by A. Melin (Ark. 
Mat., 9 (1971)), C. Fefferman and D. H. Phong 
(Proc. Nat. Acad. Sci. US, 76 (1979)), and 
Hórmander [8]. A general sufficient condition 
for the existence of a parametrix for an opera- 
tor of class S7; was obtained by Hórmander 
[3]. Let P=p(x, D,) belong to S7; with 0< 

ó « p & 1l. Assume that the symbol p(x, €) satis- 
fies the following conditions: (1) for some Cp > 
0, real m ( x m), and R>0, we have |p(x, €)| > 
Colé" (E| > Ry (ii) for any o, f there exists a 
constant Co, e such that 


| DE DE p(x, £)/p(x, EH < Cal del, 


{E|>R. 


Then there exists a parametrix Q — q(x, D,) of 
P in the class $, 7". 

By means of operators of class S? 9(Q) we 
can define the wave front set of ue Z'(Q), 
which enables us to resolve singsuppu on 
T*(Q) 0, the cotangent bundle of Q minus its 
zero section. An operator P = p(x, D)eSt at 
is said to be microlocally elliptic at (GH. £9) e 
T*(Q) 0 if lim, ,, |p(x9, «£9)|/|c£9|" > 0. For 
a distribution ue Z'(€2), we say that a point 
(x9, £9) of T*(Q)~0 does not belong to the 
wave front set (or the singular spectrum) of u, 
denoted by WEG, if there exist a(x), b(x)e 
Ce (Q), a(x°) 40, b(x?) #0, and PeS? (9), 
which is microlocally elliptic at (x^, £9), such 
that aPbue Co (Q). Then we easily see that 
WF(u)is a closed conic subset of T*(Q)~0. An 
important fact is that the relation sngsuppu = 
Proj, WF (u) (the projection of WF(u) on Q) 
holds, from which we can perform a so-called 
microlocal analysis, the analysis on T *(Q) 0, 
of sing suppu. As the sharp form of the pseudo- 
local property of an operator P eS% s, if 
0xó«pxl, P has the micro-pseudolocal prop- 
erty: ue F’ > WF(Pu)c WF(u). 

Pseudodifferential operators of multiple 
symbol have been defined by K. O. Friedrichs 
(Courant Inst., 1968) and Kumano-go [4]. 
More refined and useful classes of pseudo- 
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differential operators have been defined by R. 
Beals (Duke Math. J., 42 (1975)) Hórmander 
[8], and others. 

The theory of pseudodifferential operators 
has found many fields of application, such 
as M. F. Atiyah and R. Bott (Ann. Math., 86 
(1967)) on the tLefschetz fixed-point formula; 
Friedrichs and P. D. Lax (Comm. Pure Appl., 
18 (1965)) on symmetrizable systems; Hórman- 
der [6], Yu. V. Egorov (Russian Math. Surveys, 
30 (1975)) on subelliptic operators; Kumano- 
go (Comm. Pure Appl. Math., 22 (1969)), F. 
Treves (Amer. J. Math., 94 (1972)), S. J. Alin- 
hac and M. S. Bouendi (Amer. J. Math., 102 
(1980)) on uniqueness of the Cauchy problem; 
S. Mizohata and Y. Ohya (Publ. Res. Inst. 
Math. Sci., 4 (1968)), Hórmander (J. Analyse 
Math., 32 (1977)) on *weakly hyperbolic equa- 
tions; C. Morawetz, J. V. Ralston, and W. A. 
Strauss (Comm. Pure Appl. Math., 30 (1977)), 
M. Ikawa (Pub. Res. Inst. Math. Sci., 14 (1978)) 
on the exponential decay of solutions; and 
Nirenberg and Treves [16], Beals and Feffer- 
man [17] on local solvability theory. 

For recent developments in the theory of 
pseudodifferential operators and its applica- 
tions — Kumano-go [4], M. Taylor (Prince- 
ton Univ. Press, 1981), Treves (Plenum, 1981), 
and others. 


B. Fourier Integral Operators 


A Fourier integral operator A: C5 (R^) Z'(R") 
is a locally finite sum of linear operators of the 
type 


Af(x) (anyon | 


exp(io(x, 0, y)) 
RN*^ 


xa(x, 6, y)f(y)dyd0. (T) 


Here a(x, 0, y) is a C?-function satisfying the 
inequality 


(DS Dj Di a(x, 0, y| « C(14- ||)" 8i *a "eidel 1D 


for some fixed m and p, 1/2x p « 1, and any 
triple of multi-indices o, f, y, and for @(x, 0, y) 
a real-valued function of class C? for 0 0 and 
homogeneous of degree 1 in 0 there. The func- 
tion q is called the phase function and a the 
amplitude function. 

Let C, — ((x, 0, y)| de (x, 0, y) 20, 840} and 
W = {(x, yje R” x R"|30 #0 such that (x, 0, vie 
Cp}. If d, , (x, 0, y) £0 for 040, then the 
kernel distribution k(x, y) of A is of class C” 
outside W. There have been detailed studies 
of the case where the d, 4 ,(Op(x, 0, y)/00j), 
j=1,2,...,N, are linearly independent at 
every point of C,. In this case, C, is a smooth 
manifold in R” x (R"-.0) x R”, and the mapping 
0:C,5(x, 0, y)o (x, y, n), $£—d,0(x,6, y n= 
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d, p(x, 0, y), is an immersion of C, to T*(R" x 
R") 0, the cotangent bundle of R” x R” minus 
its zero section. The image OC, — A, is a conic 
Lagrange manifold, i.e., the canonical 2-form 
o = de; dx;—Ljdn; dy; vanishes on A, 
and the multiplicative group of positive num- 
bers acts on A,. Let A,,...,42, be a system of 
local coordinates in A,. These, together with 
00/00;,...,0@/08,, constitute a system of 
local coordinate functions of R” x (R^ \.0) x R” 
in a conic neighborhood of C,. Let J denote 
the Jacobian determinant 


ô a 
glatt Ei 


00, 00, 
D(x, 0, y) 


The function a m alc, ' is called the 
local symbol of A. Here alc, is the restriction of 
a to C,. The conic Lagrange manifold A, = 
A,(A) and the symbol a, =a, (A) essentially 
determine the singularity of the kernel distri- 
bution k(x, y) of the Fourier integral operator 
A. Conversely, given a conic Lagrange mani- 
fold A in T*(R" x R")\0 and a function a, 

on it, one can construct a Fourier integral 
operator A such that A,(A)=A and ay (A)= 
a,. Those Fourier integral operators whose 
associated conic Lagrange manifolds are the 
graphs of homogeneous fcanonical transfor- 
mations of T*(R")-.0 are most frequently used 
in the theory of linear partial differential equa- 
tions. Let A be a Fourier integral operator 
such that A,(A) is the graph of a homoge- 
neous canonical transformation x. Then the 
adjoint of A is a Fourier integral operator 
such that the associated conic Lagrange mani- 
fold is the graph of the inverse transformation 
X. Let A, be another such operator; if the 
associated conic Lagrange manifold is the 
graph of y,, then the composed operator A, A 
is also a Fourier integral operator and the 
associated conic Lagrange manifold is the 
graph of the composed homogeneous canon- 
ical transformation y, y. 

A pseudodifferential operator of class 
ST, -,(R") is a particular type of Fourier inte- 
gral operator. In fact, a Fourier integral oper- 
ator A is a pseudodifferential operator of class 
AS", -(R") if and only if A,(A) is the graph of 
the identity mapping of T*(R")--0. Hence for 
any Fourier integral operator A, A*A and 
AA* are pseudodifferential operators. 

The following theorem is due to Egorov 
[11]: Let P and Q be pseudodifferential opera- 
tors of class $7, (R") with the symbols 
p(x, č) and q(x, EL respectively, and let A be a 
Fourier integral operator such that the asso- 
ciated conic Lagrange manifold A,(A) is the 
graph of a homogeneous canonical transfor- 
mation y of T*(R")~0. If the equality PA=AQ 
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holds, then g(x, 5) — p(x(x, €)) belongs to the 
class $711," ^(R"). 

Assume that m — 1, p — 1, and that p, (x, č) 
is a real-valued C?-function, homogeneous 
of degree 1 in č for |€]> 1, such that p(x, £) 

— pi lx, Ce S o(R") and dp (x9, £9) 40 at 

(x9, £9), |£9| » 1, where p, (x°, £9) 20. Then one 
can find a Fourier integral operator A such 
that the function q(x, €) of Egorov’s theorem 
satisfies the relation q(x, č)— č, € S? 9 (R"). 

The theory of Fourier integral operators has 
its origin in the asymptotic representation of 
solutions of the wave equation (— 325 Partial 
Differential Equations of Hyperbolic Type 
L; also, e.g., (12, 13, 14]). G. I. Eskin (Math. 
USSR-Sb., 3 (1976)) used a type of Fourier 
integral operator in deriving the energy esti- 
mates and constructing the fundamental solu- 
tions for strict hyperbolic operators. Hór- 
mander (Acta Math., 121 (1968)) introduced 
the term "Fourier integral operators," and 
applied these operators to the derivation of 
highly accurate asymptotic formulas for spec- 
tral functions of elliptic operators. Egorov 
(Math. USSR-Sb., 11 (1970)) applied his theo- 
rem and the corollary stated above to the 
study of hypoellipticity and local solvability 
for pseudodifferential operators of principal 
type. Using Egorov's theorem and the same 
corollary, Nirenberg and Treves [16] obtained 
decisive results concerning local solvability for 
linear partial differential operators of principal 
type; these results were completed by Beals 
and Fefferman [17]. Hórmander and J. J. 
Duistermaat [9, 15] constructed a general 
global theory of Fourier integral operators 
making use of Maslov's theory [14], which 
was originally published in 1965. By virtue 
of this research, the Fourier integral oper- 
ator has come to be recognized as a powerful 
tool in the theory of linear partial differential 
operators. An interesting application of the 
global theory of Fourier integral operators 
appeared in J. Chazarain (Inventiones Math., 
24 (1974)). The boundedness of Fourier inte- 
gral operators in the spaces L?(R") (or the 
space H?) has been studied in several cases. 
Some sufficient conditions for boundedness 
have been obtained by Eskin (Math. USSR- 
Sb., 3 (1967)), Hórmander [9], D. Fujiwara 
[18], Kumano-go (Comm. Partial Diff. Eq., 1 
(1976)), K. Asada and Fujiwara (Japan. J. 
Math., 4 (1978)), and others. A calculus of 
Fourier integral operators in R" was given in 
Kumano-go [4]. 

The propagation of wave front sets by 
means of a Fourier integral operator is de- 
scribed as follows. Let us consider a phase 
function of the form (x, £, y)=S(x, č)— y- č in 
R} x R? x R}, and let a(x, č) be an amplitude 
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function independent of y of class S 9. Then 
by (7) the Fourier integral operator A = Ae is 
defined by 


Agu(x) ^ (2x) sel 


R 


.exp(iS(x, Sal, $)8() dc. 
(8) 


Let T be the canonical transformation with 
the tgenerating function S(x, č), i.e., T is de- 
fined by y=V-S(x,n), £— V,S(x, n). Then for 
the Fourier integral operator A, we have 


WF(Asu)c {(x, lz T(y. IG. e WF(u)j, 
ve, (9) 


Next consider a hyperbolic operator L= D, + 
p(t, x, D,) for a real-valued symbol p(t, x, £)e 
48? ([0, Ty]; al with some T,>0. For a small 
O« Tx T, the solution S(t, x, č) of the eikonal 
equation 0,S + p(t, x, V.S) 20 on [0, T] with 
the initial condition S|,-9 =x: exists in 4! 
([0, T]; St o). Consider the Cauchy problem 
Lu=0 on [0, T], ol as uo. Then there exists 
an amplitude function e(t, x, £)e 2! ([0, T]; 

S? o) such that the solution u(t) is found in 
the form u(t) — E,(t)uy. On the other hand, 

let (x, &) 2 (X (t, y, n), E(t, y, n)) be the bichar- 
acteristic strip defined by *Hamilton's ca- 
nonical equation dx/dt = V.;p(t, x, č), dč/dt = 

—V, p(t, x, č) with (x, &)y-o — (y, n). Then 

(X (t, y, n), E(t, y, n)) can be solved by means 
of the relations y=V,S(t, X, n), E — V.S(t, X, n), 
as a family of canonical transformations with 
a parameter te [0, T']. Thus by means of (9) 
we have 


WF(u(t)) c {(x, Ess QX (t, v. n), E(t, y, n) 
((y.r)eWF(ug)j, (10) 


which is the fundamental result in the study of 
the propagation of wave front sets as solutions 
of general hyperbolic equations (-— 325 Partial 
Differential Equations of Hyperbolic Type M). 

The works of Egorov, Nirenberg and 
Treves, and Hórmander motivated the theory 
of hyperfunctions developed by M. Sato and 
gave rise to the concept of tquantized contact 
transformations, which correspond to Fourier 
integral operators in the theory of distribu- 
tions. The above-stated transformation theo- 
rem of Egorov has been studied in detail with 
reference to systems of pseudodifferential 
equations with analytic coefficients [19] (— 
274 Microlocal Analysis). 
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A. General Remarks 


Psychometrics is a collection of methods for 
drawing statistical conclusions from vari- 

ous psychological phenomena which are ex- 
pressed numerically or quantitatively. It con- 
sists chiefly of statistical methods to deal with 
psychological measurements and of theories 
dealing with mathematical models concerning 
learning processes, social attitudes, and mental 
abilities. 


B. Sensory Tests 


A measurement wherein human senses are 
taken as the gauge is called a sensory test. The 
panel of judges must be composed appropri- 
ately, and the examining circumstances must 
be controlled. Various methods of psycholog- 
ical measurements are applied. In the following 
sections we describe the basic statistical proce- 
dures used in sensory testing. 


C. Paired Comparison 


When there are t objects (treatments or stimuli 
in some cases) O,, O,, ..., 0,, the method of 
comparing them two at a time in every pos- 
sible way is called paired comparison. The 
following are typical mathematical models of 
this method. 


(1) Thurstone-Mosteller Model. Suppose that 
the probability that O; is preferred to O; for a 
pair (0;, Oj) is pj. Of the n judges who compare 
this pair, the number who prefer O; is n,;, and 
the number who prefer O; is nj; =n—n,;. In this 
comparison it is assumed that the strengths of 
the stimuli O;, O; to the senses are random 
variables X;, X;, and O; is preferred when X; 
X;. Furthermore, it is assumed that the joint 
probability distribution of X; and X; is the 2- 
dimensional tnormal distribution with jj; and 
6? as mean and variance of X,, and p as corre- 
lation coefficient of X; and X;. There is no loss 
of generality in assuming that 2c?(1— p)— 1 
and Xi., #;=0. Let B(x) be the standardized 
normal distribution function and pj; — II: — 


H 


DJ. Using p;;=n;/n as estimates of the true 
py, we can obtain the estimates f;. Using pj; 
= Q( fj; — i) and pj; we can test the hypothesis 


that p; = 45 — ...— He 


(2) The Bradley-Terry Model. The experi- 
mental method in the Bradley-Terry model 
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is the same as in the Thurstone- Mosteller 
model. It is postulated that, associated with 
O,, O,, ..., O, there exist parameters 7; for O; 
(1; Z0, Xi- 2;= 1) such that pj — z/(n; + 7). 
Obtaining the *maximum likelihood estimator 
of z;, we can test the appropriateness of the 
models. 


(3) Scheffé’s Model. Each pair O;, O; is pre- 
sented to 2n judges; n of them examine OQ, first 
and O; next, and the remaining n examine the 
pair in the opposite order. A judgment is re- 
corded on a 7-point (or 9-, 5-, or 3-point) scale. 
In the 7-point scaling system a judge pre- 
sented with the ordered pair (0j, O) marks one 
of the seven points 3, 2, 1,0, —1, —2, —3, 
meaning, respectively, that O; is strongly pre- 
ferable to O;, O; is moderately preferable to O;, 
O; is slightly preferable to O;, no preference, O; 
is slightly preferable to O;, etc. The mark given 
by the kth judge on his preference of O; to O; is 
denoted by XA, which can be regarded as the 
sum of a main effect, deviation of subtractivity, 
order effect, and error. Significance tests for 
these effects and estimates of various para- 
meters are given by using statistical tlinear 
models. *BIBD, tPBIBD, etc., can also be 
applied to paired comparisons. 


D. The Pair Test, Triangle Test, and Duo-Trio 
Test 


The pair test, triangle test, and duo-trio test 
are sensory difference tests. The methods are 
as follows. Pair test: À judge is requested to 
designate a preference between the paired 
samples A and B. Triangle test: A judge is 
requested to select two samples of the same 
kind out of A, A, B. Duo-trio test: A judge is 
first acquainted with a sample A and then is 
requested to choose from A and B the one he 
has seen in the previous step. In all the above 
cases, the hypothesis that A and B are different 
and that the judge has no ability to determine 
the difference between them is tested by using 
the *binomial distribution. 


E. Scaling 


(1) One-Dimensional Case. Psychometric 
scaling methods are procedures for construct- 
ing scales for psychological phenomena. Some 
of them require judgments concerning a par- 
ticular attitude that is considered unidimen- 
sional. Under the assumption that a psycho- 
logical phenomenon is a random variable 
with some distribution law and the parameters 
of the distribution law determine psycholog- 
ical scales, a psychological scaling is given by 
estimating the parameters. The Thurstone- 
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Mosteller model is a method for scaling a set 
of stimuli by means of observable proportions. 


(2) Multidimensional Scaling (MDS). Multidi- 
mensional scaling is a collection of methods to 
deal with data consisting of many measure- 
ments on many objects and to characterize the 
mutual distance (dissimilarity), or closeness 
(affinity), by representing those objects by a 
small number of indices or by points in a 
small-dimensional Euclidean space. It has seen 
useful applications in the analysis of people's 
attitude and perception and their characteri- 
zations by means of a few numbers or points 
in a space of low dimension. 

Historically, MDS was first developed 
by Torgerson (1958) and refined further by 
Shepard (1962) and Kruskal (1964). The 
method developed by Torgerson and also the 
INDSCAL method by Carrol and Chang 
(1970) are called metric multidimensional scal- 
ing, while the method by Shepard and Kruskal 
is called the nonmetric MDS. The former is 
applied when the data are represented in con- 
tinuous scales and the latter when the data are 
in discrete nominal or ordinal scales. Tech- 
niques of multidimensional scaling are closely 
related and sometimes actually equivalent to 
various methods of multivariate analysis, 
especially principal component analysis, ca- 
nonical correlation analysis, and discriminant 
analysis (— 280 Multivariate Analysis). 


F. Factor Analysis 


Though factor analysis can be considered to 
be a method to deal with multivariate data in 
general (— 280 Multivariate Analysis), it has 
had close connections with psychometric 
studies, in both theoretical developments and 
applications. Historically, it was initiated by 
Spearman (1927) and developed further by 
Thurston (1945) in order to measure human 
abilities from test scores. Mathematically, the 
model of factor analysis is formulated as fol- 
lows: Let za be the standardized score of the 
jth test achieved by the kth subject, j= 1, ..., p; 
k=1,...,N; then it is assumed that it can be 
represented as a linear combination of r com- 
mon factors and one specific factor as 


Zo — aj fit jo fat ss aj fad Hr, (1) 


where fi, i= 1,...,r, k— 1, ..., N, represents the 
magnitude of the ith common factor (ability) 
in the kth subject and a; is the size of contri- 
bution of the ith factor to the score of the jth 
test. 
Usually it is assumed that (i) V(z,) = 1, (11) 

V(f)=1 and Cov(f,, f.) =0 for iz i', (i1) V(v;) 
= I and Cov(y;, v;)=0, j 4)’, (iv) Cov(f;, vj) =0. 
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Then it follows that 

fo = Cor(z;, zy) = Ki y + KI KI + eee + jrr» 
and 

1—aj +a} + vols Lotus hä Ju, 


where h? is called the communality of the jth 
variable z, and u? is called the specificity. It 
follows easily from (1) that any orthogonal 
transformation of the scores does not affect the 
model. 

Problems of factor analysis are classified 
into three types: 
(1) Estimation of communality: There are 
several methods of determining communality 
or initial estimates of it when some iterative 
procedure is used. 
(2) Determination of factor loadings, which is 
the estimation of the aj: A number of methods 
have been proposed, among which those often 
used are the MINRES by Harman (1967), the 
varimax by Kaiser (1958), and the tmaximum 
likelihood based on the normal model by 
Lawley and Maxwell. 
(3) Estimation of factor scores f;,: Usually fac- 
tor scores are estimated after factor loadings 
have been determined. Thurstone proposed 
F—ZR'!A and Harman F —ZA(A'A) !, 
where F, Z, A are the matrices of factor scores, 
test scores, factor loadings, respectively, and R 
is the correlation matrix of the z's. 


G. Learning Theory 


(1) General Description. Assume that a se- 
quence of trials is done in order to study some 
given behavior and that on each trial 
particular events occur (stimuli, responses, 
reinforcements, etc.) that influence the en- 
suing behavior. Then the behavior itself is 
modified by such a sequence of trials. Learning 
models refer to such processes of behavior 
modification, and they are frequently repre- 
sented by recursive formulas for response 
probabilities. 

Assume that two mutually exclusive re- 
sponse alternatives A, and A, occur on the nth 
trial (n— 1,2, ...) with respective probabilities 
P, and 1 — P,, and that an event E; occurs on 
the nth trial with Pr(é, = E)— m; (i— 1,2, ...,t; 
Èi- 72; 1). Then the recursive formula for P, 
is of the form P,,, = f(B,:6,,6,.,, ..., 64). If 
the formula can be written as f= f(P,; én, 
dn En-a), then the response probability is 
called d-trial path dependent. In the special 
case d =0, it is called path independent. For 
simplicity we write f(P,; 6, = Ej, 6, , = Ej, ..., 
6, Ej)- fij (P) (Lj, -k= 1,2, ..., t). If the 
response probability is path independent, 
then An (P) — fif; ... MP where f,(P,)= 
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f(P,; &, = E,) (v=i,j, ..., k). When the recursive 
formula can be expressed as f = f(P,; &,, n), 
the response probability is said to be quasi- 
independent of path [11]. In the recursive for- 
mula f, two events E; and E; (i#j) are said to 
be commutative if f. ;.., (Pest: (Pn) If 
any two events are commutative, the condi- 
tion of event commutativity is satisfied. By 
making f explicit with respect to n, we write 
B,— F(n; &, ,,..., 6,; Pi). Under the condition 
of event commutativity, the explicit formula 
can be written P, — F(N,, Nz, ..., Ni; P,), where 
N; is the frequency of occurrence of E; in the 
first (n — 1) trials (2;., N;2n— 1). If both event 
commutativity and path independence of re- 
sponse probability are satisfied, the explicit 
formula P. f: f?: ... f'(P,) can be obtained. 


(2) Linear Models. In a linear model, the re- 
cursive formula is written as a linear function 
of P. 

Example (1). Bush-Mosteller model [6]. The 
Bush-Mosteller model assumes the response 
probability to be path independent. The re- 
cursive formula is expressed as f;(P,) = a; P, + 
(1 —2;)4;, 6, — E;. Here a; (0 € o; € 1) represents 
the degree of ineffectiveness of E; for learning 
and A; (0€ 4, € 1) is the tfixed point of f;. A 
necessary and sufficient condition for E; and E; 
(ij) to be commutative is that either f; or f; 
be an fidentity operator or 4;— Aj. 

Example (2). Estes's stimulus-sampling 
model [7]. We can consider the stimulus as a 
set composed of m elements, each of which 
corresponds to either response A, or A,; the 
manner of their correspondences depends on 
each trial. If J, elements correspond to A, 
on the nth trial, then we have P, = J,/m. Sup- 
pose that on the nth trial s ( € m) elements are 
sampled, among which X, elements corre- 
spond to A,, and the remaining X; (—s— X,) 
elements correspond to A,. As a result of 
the nth trial, if A, is reinforced, we set Y, = 1; 
otherwise, we set Y, 20. Furthermore, assume 
that J,,, —J, + X, Y, — X,(1— KL Hence, we 
obtain the recursive formula P,,, = P, + (X, Y, 
— X; (1 — Y,)}/m. In this model, the response 
probability is path independent and 6, = 
(X,, Y,). Other linear models have been pro- 
posed in which the response probability is 
either quasi-independent of path [8] or path 
dependent [10]. 


(3) Nonlinear Models. In nonlinear models the 
recursive formula cannot be written as a linear 
function of P 

Example (3). Luce's f-model [9]. Let the 
response strengths of A, and A, on the nth 
trial be v, and v,, respectively (both positive), 
and assume that P,, the response probability 
of A,, is expressed as v,/(v, 4- v;). The response 
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strengths v, and v, depend on each trial. Under 
the assumption that the response strength is 
path independent and that v„+; changes in- 
dependently from v}, the recursive formula of 
t, Is written as Dn = P:n) 6, = E;. Here, if we 
assume ¢,(v)>0 for v>0 and g;(cv)= co;(v) for 
v7 0 and c» 0, then 9,{v,) Bt with fj; 0. In 
a similar way, the recursive formula for v, can 
be expressed as q;(v,) — f; v, (B; > 0). Therefore 
we have ff = P,/AP, rb Ge P,)} (b; = Bi /B;), 
6, = E;. This model is nonlinear, and the re- 
sponse probability is path independent. By 
making the recursive formula explicit, we 
obtain P, = P, TP, - (1 — P,exp(Zi-1 N;logb;)}. 
Hence it is clear that the events are commuta- 
tive. Other nonlinear models in which the re- 
sponse probability is either quasi-independent 
of path [5] or path dependent [9] have also 
been proposed. 

Here we have taken up only the case in 
which the number of response alternatives is 
2, but we can generalize to the case of more 
than two alternatives. For fitting a model and 
experimental data, expected response prob- 
abilities and various other statistics deduced 
from the model (total error, trial number of 
first success or last error, and sequential sta- 
tistics such as length of response trun or tauto- 
correlation between responses) are used. Es- 
timation methods have also been devised for 
the parameters involved. 
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A. General Remarks 


Any "extension field of the rational number 
field Q of degree 2 is called a quadratic field. 
Any quadratic field k is obtained from Q by 
adjoining a square root of a square-free integer 
Ge, an integer #0, 1 with no square factor) 
m:k- Q(./m). If m is positive (negative), k is 
called a real (imaginary or complex) quadratic 
field (— 14 Algebraic Number Fields). Let 


(un) for m= 1 (mod4), 
E m for m= 2, 3 (mod 4). 


Then (1, œw) is a ‘minimal basis of k. That is, 
any talgebraic integer x of k has the unique 
expression x—a-- bo with a, be Z. The *dis- 
criminant d of k is given by de min case m=1 
(mod 4) and d= 4m in case mz 2,3 (mod 4). The 
conjugate element of an element x — a + bm 
(a, be Q) of k over Q is given by a 2a — b /m. 
The mapping o:x—42' is an tautomorphism of 
the field k. 


B. Units 


Let k be an imaginary quadratic field. The 
tunits of k are +1, iin case m= — 1; +1, 
too, tolo =l t —3)2) in case m= — 3; 
and +1 in all other cases. 

Let k be a real quadratic field. There exists 
a unit e, that is the smallest one among the 
units (> 1) of k. Any unit £ of k can be uniquely 
expressed in the form £= +e% (ne Z). That is, 


+et! is a "fundamental unit of k. The fun- 


damental unit £9 =(x+y./d)/2 (> 1) can be 
calculated by finding a minimal positive inte- 
gral solution (x, y) of "Bells equation x? — 
dy? = +4 by using continued fractions (— 83 
Continued Fractions; for a table of the fun- 
damental unit of k for m< 100 — [1]). 


C. Prime Ideals 


The decomposition of a prime number p in the 
tprincipal order o of k is given as follows: (i) 
Let p|d, where d is the discriminant of k. Then 
p is decomposed in o in the form (p)= p?. (ii) 
Let p #2 and (p,m)=1. If (m/p)=1, then (p)= 
pp’ in o (pp) and N(p)= N(p’)=p. If (m/p)— 
— 1, then (p}=p in o and N(p)=p’. Here p, p' 
are prime ideals of o, N means the fnorm, and 
(m/p) is the *Legendre symbol. (iii) Let 24d, 
that is, mz 1 (mod 4). If mz 1 (mod 8), then 
(2)— pp'(pz v) and N(p)—- N(p)22. If mz 5 
(mod 8), then (2)=p and N(p)— 4. 
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D. The Kronecker Symbol 


Let k = QC/m), and let d be the discriminant 
of k. We define the symbol y for k as follows: 
(i) x(p) — 1 if (p) e p»' (pp) in o; (i) x)= —1 
if (p) » in o; (iii) x(p) 0 if (p)— ET in o; and 
(iv) x(n) = M x(Gp; Y? for n= IT; př > 0. In partic- 
ular, we define y(1) — 1. If (n, d) — L, the sym- 
bol y can also be defined using the tJacobi 
symbol as follows: If m= 1 (mod 4), then y(n) = 
(n/|/m| and if m=2, 3 (mod 4), then y(n) = 
¢i(n)(n/\m'|) for d= 2°m', where (i) 73 (n)= 

( — 1"7!? for e=2, m =3 (mod 4); (ii) x3(m = 
(— Y 795 for e=3, m = 1 (mod 4); and (iti) 
k(n) =(— 1 08* 0777 for e=3, m'=3 
(mod 4). If (n, d) #1, then y(n) C. For a nega- 
tive integer —n, we define x(—ni=(sgnd) x(n). 
The symbol y(n) for neZ is called the Kro- 
necker symbol for k. 

The Kronecker symbol for k has the follow- 
ing four properties: (1) x(n) —O0 In, d) # 1, and 
a(n) = +1 if (n,d)=1; (2) x(m)= x(n) if mzn 
(mod d); (3) x(mn) = (m) x(n); (4) x(n) = 1 if and 
only if nz N (a) (modd) for some integral ideal 
a of k such that (a, (d)) = 1. (Property (4) shows 
that a quadratic field provides a class field; — 
59 Class Field Theory.) 


E. Ideal Classes 


The ‘class number h of k was calculated by 
P. G. L. tDirichlet (1840) by analytical meth- 
ods as follows: 


[ed . FK 
hlogeg= S 2 x(r)logsin-7 for m» 0, 


zi 
Y. x(r for m<0, 


Tag A 


where d is the discriminant of k, £ is the 
positive fundamental unit (> 1) of k, w is the 
number of roots of unity in k, and x is the 
Kronecker symbol. 

Denote by h(d) the class number of the 
imaginary quadratic field wit discriminant d. 
It was conjectured from the time of C. F. 
*Gauss that h(d)— oo as [d| oo. This conjec- 
ture was proved by H. Heilbronn (1934). More 
precisely, C. L. Siegel (Acta Arith., 1 (1935) 
proved 


Jim (logh(d Weg Ni) 


h(d)- 1 holds for |d| — 3,4, 7, 8, 11, 19, 43, 67, 
163. In 1934, Heilbronn and E. H. Linfoot 
proved that there can be at most one more 
such d, and finally A. Baker and H. M. Stark 
independently proved that these nine numbers 
are the only ones for which h(d)= 1 (Baker, 
Mathematica, 13 (1966); Stack, Michigan Math 
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J., 14 (1967)). Also, Baker and Stark proved 
independently (Ann. Math., (2) 94 (1971); Math. 
Comp., 29 (1975)) that h(d) —2 holds only for 
|d| — 15, 20, 24, 35, 40, 51, 52, 88, 91, 115, 123, 
148, 187, 232, 235, 267, 403, and 427. 

For real quadratic fields, we have 


Jim (log(h(d)loge,))/log /4) — 1, 


where e, is the fundamental unit of k= Q(./4) 
(& » 1). However, it is not yet determined 
whether there exist infinitely many d with h(d) 
=1(— Appendix B, Table 4). 


F. Genera 


Let G be the group of all (fractional) ideals of 
k, and let H be the group of all tprincipal 
ideals (x) of k such that N(a)>0. Each coset of 
G modulo H is called an ideal class in the 
narrow sense. (This notion is a special case of 
the notion of ideal classes in the narrow sense 
of algebraic number fields; — 14 Algebraic 
Number Fields G.) In the cases (i) m «0 and 
(11) m» O0, N(s9) 2 — 1, the usual classification 
of ideals and classification of ideals in the 
narrow sense are identical. When m0, N (eo) 
= 1, each ideal class is divided into two ideal 
classes in the narrow sense. We call an ideal 
class in the narrow sense simply an ideal class. 
Let p,,...,p, be the set of all prime numbers 
dividing d. For ne Z with (n,d)=1, we define 
Xi (n), ..., x, (n) as follows: For p;#2, we define 
x(n) =(n/p,); for p, 22, we identify y; with y* in 
the definition of the Kronecker symbol. In 
order that y,(n) — ...—y,(n) 2 1 for ne Z with 
(n, d) — 1, it is necessary and sufficient that 
nz N(a) (mod d) for an integer « of k (where 
N (a) z aa). Since y, y; ... y, is equal to the 
Kronecker symbol, it follows that nz N(a) 
(mod d) for an integral ideal a of k is a neces- 
sary and sufficient condition for y, (n)... 
x,(n)= 1 to hold for ne Z with (n, d) — 1. Put 
&; — yi(N(a)) (i2 L, ..., t), where a is an integral 
ideal with (a, (d)) — 1. Then (e,, ..., £j) is uni- 


quely determined for the ideal class C contain- - 


ing a and does not depend on the choice of a. 
The set § of all ideal classes of k such that 

(£1, ...,&) — (1, ..., 1) is called the principal 
genus of k, and $ is a subgroup of the ideal 
class group € of k. Each coset of © modulo $ 
is called a genus of k. For each genus, the 
values e; = y;,(N(a)) (i9 1, ..., t) are uniquely 
determined. We call (,, ...,&) the character 
system of this genus, and each genus is uni- 
quely determined by its character system. A 
necessary and sufficient condition for (£4, ..., £) 
to be a character system for some genus is that 
£&, — +1 and £,£, ... e, — 1. Hence there are 2⁄7! 
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genera of k, and ©/§ is an Abelian group of 
type (2,2, ...,2). 

In order that an ideal class C belong to the 
principal genus, it is necessary and sufficient 
that C = C1 ^ = C? hold for some ideal class 
C,. From this it follows that there are t—1 
tinvariants of the ideal class group € of k 
that are powers of 2. Each ideal class C such 
that C" =C is called an ambig class of k. There 
are 2* ! ambig classes, and they form an 
Abelian group of type (2,2, ..., 2). Each ideal a 
of k with a^ =a is called an ambig ideal of k. 
Let (p) =p? (i1, ..., t). Then each ambig 
ideal is uniquely expressed in the form a— 
pi! ... p;t(a) by some oe and v; —0, 1. Each 
ambig class contains exactly two ambig ideals 
of the form pj’... pt. For example, for k= 
Q(./ —65) we have d= — 2°: 5-13, t=3, h= 
8, € — (E, A, A?, A3, B, AB, A? B, A?B), where 
A* — E, B? =E, A" = A?, and B^ — B; the prin- 
cipal genus is $= (E, A*}, and the ambig 
classes are {E, A”, B, A?B]. 


G. Norm Residues 


A quadratic field k is the tclass field over Q for 
an ideal group H. The tconductor of H is said 
to be the conductor of k/Q or simply the con- 
ductor of k. The conductor f = II,f, of k= 

OL dml is given by f =d for m>0 and f= 

dp, for m «O0. That is, the p-conductor f, =p 
for p|d, p x 2; and f, =2° for 2|d, d=2°m' 

((2, m’)= 1). By means of the tHilbert norm- 
residue symbol, the Kronecker symbol y is 
expressed by 





Ka for (a,d)=1, 
x(a)— < pid \ P 

0 for (a, d) #1. 
H. History 


The arithmetic of quadratic fields was origi- 
nally developed in terms of the theory of binary 
quadratic forms with rational integral coeffi- 
cients by Gauss and Dirichlet [2]. The theory 
was then translated into the terms of ideal 
theory by J. W. R. Dedekind [2] (— 348 Qua- 
dratic Forms M). For example, the theory of 
genera for quadratic fields explained in Section 
F was first developed by Gauss in terms of 
binary quadratic forms, and the class number 
formula was obtained by Dirichlet as a formula 
for binary quadratic forms. Hilbert [4, ch. 2] 
developed the arithmetic of quadratic fields 
systematically by introducing the Hilbert 
norm-residue symbol (— [1,5,6]). Later the 
arithmetic of quadratic fields assumed the 
aspect of a simple example of class field theory 
(— 59 Class Field Theory). 
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A. General Remarks 


A quadratic form Q is a quadratic homoge- 
neous polynomial with coefficients in a *field 
K, written 


Q(x,, ..., X) Ci Xi X. 


If the coefficients cj, belong to the field of real 
(complex) numbers, we call Q a real (complex) 
quadratic form. Let V be an n-dimensional 
vector space over K. For a vector x in V 
whose coordinates are x,,..., x,, we put Q(x) 
— Q(x,, ..., x,). This gives rise to a mapping 
x2 Q(x) of V into K. Such a mapping satisfies 
the following two conditions: (i) Q(ax) 2 a?Q(x) 
(ae K); and (ii) Q(x + y) - Q(x)  Q(y) = B(x, y) 
is a 'symmetric bilinear form on V. Conversely, 
if a mapping Q: VK satisfies these two con- 
ditions, then Q must come from a quadratic 
form (— 256 Linear Spaces). B(x, y) is called 
the symmetric bilinear form associated with Q. 
We assume that the *characteristic of K is 
not 2. Putting aj, =a,;=C,/2 (i< k), a; — cj 
(i=1,...,n), we have Q(x) 2 È; -1 XiX. The 
matrix A —(a;,) is the matrix of the quadratic 
form Q, and the determinant |A] is the dis- 
criminant of Q, denoted by A(Q). (Sometimes, 
instead of | A, we call (—1)""^!/72"| A], the 
discriminant of Q.) The rank of the matrix A is 
called the rank of Q. Using the notation for the 
"inner product of vectors, we can write Q(x) 
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—(x, Ax)='xAx; 2 ! B(x, y) LN, Ay) «'xAy. 
We say that Q is nondegenerate if B(x, y) is 
nondegenerate De, if |A| 40). 

Consider a linear substitution x; = ZP, Px: 
(Le., x= Px’, with an n x m matrix P). Then we 


get a new quadratic form OG) with the ma- 


trix 'PAP. If each p; belongs to the field K (to 


a subring R of K that contains the unit ele- 
ment of K), we say that Q represents Q' over 
K (resp. R). A basic problem in the theory of 
quadratic forms is to determine the exact 
conditions under which a given quadratic form 
Q represents another quadratic form Q’. The 
problem of representing numbers by a quadra- 
tic form (representation problem) is the par- 
ticular case corresponding to m= |. Any qua- 
dratic form Q represents 0 by taking the zero 
matrix as P. Hence, by the expression “Q 
represents 0 over K,” we usually mean the 
nontrivial representation of 0 over K by Q, i.e., 
Q(x)=0 for some nonzero vector x. If Q is 
nondegenerate and represents 0, then it repre- 
sents any element of K*. Given ar element u 
in K*, we consider the quadratic form Q' de- 
fined by Q'(x,, ...,x,,,) = Q(x) — ux2,,. Then 

Q represents u if and only if Q’ represents 0. 

Another important special case is that of 
n —m. Then the discriminant of Q’ is given by 
| P|?| AL In particular, if | P| #0 (|P| is an inver- 
tible element of R), we say that Q is equiva- 
lent to Q' over K (resp. R). This gives rise to 
an equivalence relation. Equivalent forms 
have the same rank. On the other hand, if the 
rank of Q is r, then Q is equivalen: to a form 
E- a;x? over K (a; Z0, i— 1, ...,r). Generally, 
for elements a and b in K* = K — {0}, we write 
ac bif a:b" e(K*)*. Then if Q is equivalent to 
Q', we have A(Q)~ A(Q’). 

When we specify a field K, we assume that 
the coefficients of the quadratic forms and the 
coordinates of linear transformations are all 
contained in the field K. In particular, the 
equivalence of the forms is equivalence over K. 


B. Complex Quadratic Forms 


If K is the field of complex numbers C, then 
a form of rank r is equivalent to the form 
371. x2; hence over € two forms of the same 
dimension are equivalent if and only if they 
are of the same rank. 


C. Real Quadratic Forms 


Now let K be the field of real numbers R. If Q 
is of rank r, then it is equivalent to the form 
Er- x? — LG. x24; (p+q=r). Here, p and q 
are uniquely determined by Q (Sylvester's law 
of inertia). We call ( p, q) the signature of Q. 
Two quadratic forms of the same dimension 
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are equivalent if and only if they have the 
same signature. A quadratic form with the 
signature (n, 0) (resp. (0, n)) is called a positive 
(negative) definite quadratic form. Q is called a 
definite quadratic form if it is either positive or 
negative definite; otherwise it is an indefinite 
quadratic form. Each of the following con- 
ditions is necessary and sufficient for a form Q 
to be positive definite: (1) for any nonzero real 
vector x we have Q(x) 0; (ii) all the tprincipal 
minors of the matrix of Q are positive. Q is 
negative definite if and only if —Q is positive 
definite. A form with n variables is called a 
positive (negative) semidefinite quadratic form 
if its signature is (r, 0) (resp. (0, 7)), where 
l<r<n. 

A linear transformation x'—x = Px’ that. 
leaves invariant the unit form 277, x? is an 
orthogonal transformation. Then P is an tor- . 
thogonal matrix. Any quadratic form can be 
transformed into a diagonal form 17. , a,x? via 
an orthogonal transformation. Here a,,...,a, 
are the teigenvalues of the matrix of the form. 
Two forms Q and Q' are equivalent with re- 
spect to an orthogonal transformation if and 
only if the corresponding matrices have the 
same eigenvalues. 


D. Quadratic Forms over Finite Fields and 
p-adic Number Fields 


Let Q and Q' be nondegenerate quadratic 
forms over the finite field F,. They are equiva- 
lent if and only if they have the same rank and 
A(Q)~ A(Q’). Moreover, if the rank of Q is not 
less than 3, then Q represents 0. 

Next, suppose that Q and Q' are nondegen- 
erate quadratic forms over the tp-adic number 
. field K. They are equivalent if and only if they 
have the same rank, A(Q) ~ A(Q’), and they 
have the same Minkowski-Hasse character y, 
where y is defined as follows: Let C(Q) be the 
tClifford algebra of Q, and let C*(Q) denote 
C(Q) if n is even and C *(Q) if n is odd. Then 
X(Q) 21 or —1 according as C*(Q) z M,(K) or 
M,(K) & D(K). (Here M, is the total matrix 
algebra of degree t over K and D(K) is the 
unique tquaternion algebra over K.) Also, if Q 
has rank not less than 5, then Q represents 0 in 
K. 


E. Quadratic Forms over a General Field K 


The following facts are valid on any field K 
whose characteristic is not 2. Given a quadra- 
tic form Q, with variables x,, ..., x, and an- 
other form Q, with variables x,.,,, ..., Xn+m 
we get a new quadratic form Q, ® Q, or 


Q, +Q, defined by Q, $ Q;(x,,.. 


og Xn+m) = 
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Q,(x1, X) Q3; evt s e Q: e Q; 
is called the direct sum of Q, and Q,. The 
matrix of Q, ® Q, is the direct sum of the 
matrices of Q, and Q,. If Q, and Q are 
equivalent and Q, ® Q, and Q; ® Q^ are also 
equivalent, then Q, and Q^ are equivalent 
(Witt's theorem). 

The quadratic form x4 x3 +x3x4 +... + 
X3, -1 X2, is called the kernel form and is de- 
noted by N,. Any nondegenerate quadratic 


form Q(x,, ..., x,) is equivalent to the direct 
sum of a kernel form N,(x,, ..., X2,) and a form 
Qo(X2,541. -.., X), Where if Qo Z0, i.e., n» 2r, we 


have Qo(x5,,,, ..., x,) 20 only if x5,,, =... = x, 
=0. N, and Q, are uniquely determined by Q 
up to equivalence. The decomposition N, ® Qo 
is called the Witt decomposition of Q (E. Witt 
[5]). The number r is called the index of Q. An 
element x in V is said to be singular with re- 
spect to Q if Q(x) 20. A subspace W of V is 
said to be totally singular if all the elements 

in W are singular. Let B be the symmetric 
bilinear form associated with Q. Then x is 
singular with respect to Q if and only if B(x, x) 
=Q (characteristic of K #2). We say that x is 
isotropic if B(x, x) 20. Thus a subspace W is 
totally singular if and only if it is totally iso- 
tropic De, B(x, y)=0 for all x, ye W). The 
index r of Q is the dimension of a maximal 
totally singular subspace of V. In particular, if 
K =R and (p,q) is the signature of Q, then the 
index r= min(p, q). Here we must be careful, 
since some authors call the number p — q or p 
or q the index of Q. To make the distinction 
clear, we also call our r the index of total 
isotropy, and the number p — q the index of 
inertia. 

Necessary and sufficient conditions for a 
nondegenerate Q to be a kernel form are: 

n —the rank of Q =0 (mod2) when K =C; 
n=0 (mod2) and p—q=0 when K=R; n= 
0 (mod 2) and A(Q)~ 1 when K =F; n=0 
(mod 2), A(Q)~ 1, and x(Q) 2 1 when K is a 
p-adic number field. 

Let N, ® Qo, N; ® Qo be Witt decompo- 
sitions of Q and Q', respectively. We say that Q 
and Q' belong to the same type if Qo and Q% 
are equivalent and denote the set of types of 
nondegenerate quadratic forms over K by W. 
We define the sum of the types of Q and Q' as 
the type of Q OO. and this gives W the struc- 
ture of a commutative group. The type of a 
kernel form is the identity element of this 
group W, called the Witt group. The structure 
of the Witt group depends on K. If K =C, then 
W = Z/2Z,; if K =R, then WZ; if K isa 
tlocal field with a tnon-Archimedean valua- 
tion, then W is a finite group; if K — F,, then 
W =(Z/2Z) + (Z/2Z) if q21 (mod4), W = Z/4Z 
if 4 23 (mod4), and W = Z/2Z if q is a power 
of 2. 
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F. Hermitian Forms 


An expression H(x) — 37 ,., aj,X;x, is called a 
Hermitian form if a;, € C, a, = a,;. (Here Ou, x; 
are the complex conjugates of aj, x;, respec- 
tively.) The value H(x) is a real number. As for 
quadratic forms, we define the notions of the 
matrix of H and the discriminant, rank, and 
*sesquilinear form associated with H. The 
matrix A of H is a *Hermitian matrix whose 
principal minors are real numbers. If we 
apply a linear transformation P(x') — x, we 
obtain a Hermitian form with respect to x' 
whose matrix is given by 'PAP. Any Her- 
mitian form is equivalent to a form ZE, X;x; 
— Èt- Xj. ;Xp«g (p, q) is called the signature of 
H. We define the notions of positive definite, 
negative definite, and indefinite Hermitian 
forms as we did for quadratic forms over the 
field of real numbers. Each of the following 
conditions is necessary and sufficient for H to 
be positive definite: (i) H(x) 0 for any non- 
zero complex vector x; (11) all the principal 
minors of the matrix of H are positive. The 
definition of a semidefinite Hermitian form is 
given in the same manner as for a quadratic 
form. 

À linear transformation that leaves the 
Hermitian form 277 , X;x; invariant is called a 
unitary transformation, and its matrix is a 
tunitary matrix. Any Hermitian form can be 
transformed via a unitary transformation into 
a diagonal form 37. a;x;x;, where a,, ....a, 
are the eigenvalues of the matrix of the Her- 
mitian form. 

The notion of Hermitian forms can be gen- 
eralized as follows: Suppose that K is a fdivi- 
sion ring with an involution aa (ac K) (i.e., 
a—d is a linear mapping of K onto itself such 
that à =a, ab —- ba). Then a Hermitian form H 
over K is defined by 


n 
H(x)= 5 X jin Xy, 
i,k=1 
where x;€ K, a;,€K, a, =4,;. In particular, if 
we have, for any given vector x whose coordi- 
nates belong to K, an element ain K such that 
H(x)=a +a, then we have a Witt decompo- 
sition for H. Two examples of such K having 
involutions that differ from the identity map- 
ping are a separable quadratic extension K of 
a field L and a *quaternion algebra K over a 
field L. 


G. Quadratic Forms over Algebraic Number 
Fields 


Let K be an algebraic number field of fi- 
nite degree, p be an *Archimedean or non- 
Archimedean ‘prime divisor of K, K, be the 
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p-completion of K, and Q and Q' be nonde- 
generate quadratic forms over K. Then Q rep- 
resents Q' over K if and only if Q represents 

Q' over K, for all p, and Q represents 0 in K 
(e, there exists a nonzero vector x whose co- 
ordinates belong to K such that Q(x) — 0) if 
and only if it represents O in K, for all p (3,4 ]. 
In particular, Q and Q' are equivalent over K 
if and only if they are equivalent over K, for 
all p. Hence the invariants with respect to 
equivalence over K of a nondegenerate qua- 
dratic form Q over K are n— the rank of Q, A 
=the discriminant of Q, the Minkowski-Hasse 
character y, for *prime divisors p of K, and the 
index of inertia j; of Q over Koss for each 
treal infinite prime divisor p, , (4 — L, ...,r,) of 
K. Here the following properties hold: (1) y, — 1 
for all but finitely many p; (i1) IT, — 1 (this is 
equivalent to the tproduct formula of norm- 
residue symbols); (iii) A~~(—1) +12 in Kasa 
and (iv) Xpo. ae if j, =0, 1, 2, 7 (mod 8), = —1 
if j, =3, 4, 5, 6 (mod 8) [3,4]. Conversely, if 

the system (n, Xp» x... .j;. A} satisfies condi- 
tions (1)- (1v), then it is the set of invariants of a 
quadratic form over K (Minkowski-Hasse 
theorem). In general, if a property concerning 
K holds if and only if it holds for all K, we 
say that the Hasse principle holds for the 


property. 


H. Class and Genus of a Quadratic Form 


Let K be an algebraic number field of finite 
degree. Quadratic forms Q and Q' over K are 
said to be of the same class if they are equiva- 
lent over the tprincipal order o in K. On the 
other hand, Q and Q' are said to be of the 
same genus if (1) they are equivalent over 

the principle order o, in K, for all non- 
Archimedean prime divisors p of K and (ii) 
they are equivalent over K, for all the Archi- 
medean prime divisors p of K. A genus is 
decomposed into a finite number of classes. 
For example, if K is the field of rational num- 
bers, the number of classes in the genus of 
D7, x? is 1 for m<8, while it is 22 for m» 8. 


I. Reduction of Real Quadratic Forms 


Let A be an m x m matrix and X anmxn 
matrix. We put A[X ]='XAX. Then we can 
write O(x)=S[x]='xSx, where S is the matrix 
of the quadratic form Q. In this section we put 
K =R and define two forms to belong to the 
same class if they are equivalent over the ring 
of rational integers. We identify the form Q 
with its matrix S —(s;j). Let S be a positive 
definite form in m variables. Then S is said to 
be a reduced quadratic form if S[9,] È gu and 
Snap 201 «kam, 1 xlxm-— 1), where g, is an 
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arbitrary vector whose coordinates g,, ...,g,, 
are integers such that (g,, ...,g,,) — 1. Any class 
of positive definite quadratic forms contains at 
least one (and generally only one) reduced 
form. For a reduced form R =(r,,), the follow- 
ing inequalities hold: O € r,, € r;5 € ... & ry; 
Try &ry, k<; rí,7255... r4 c(m)|R|, where 
c(m) depends only on m. The set of all sym- 
metric matrices of degree m forms a linear 
space of dimension mim + 1)/2 in which the 
subset P formed by the positive definite sym- 
metric matrices is a convex open subset. More- 
over, the subset R formed by all reduced 
positive definite symmetric matrices is a con- 
vex cone whose boundary consists of finitely 
many hypersurfaces and whose vertex is the 
origin. Let S be an indefinite quadratic form 
whose signature is (n, m — n). The set of positive 
definite quadratic forms H such that S! [H] 
=S forms a variety of dimension n(m — n), 
which is denoted by H(S). We say that S is 
reduced if H(S)N R z Ø. Given a natural 
number D, there are only a finite number of 
definite or indefinite reduced quadratic forms 
with rational integral coefficients whose dis- 
criminant is +D. Hence the number of classes 
of quadratic forms with rational integral coef- 
ficients and discriminant +D is also finite. 


J. Units 


Let S be a symmetric matrix of degree m with 
rational coordinates. Let O(S) be the set of all 
real m x m matrices W for which SEW] — S, 
and I'(S) be the subset of O(S) consisting of 
the integral matrices. Àn element of I'(S) is 
called a unit of S. T(S) is a finite group if S is 
definite, but otherwise it is infinite (except for 
the case m=2, —|S|— 7?, with rational r). O(S) 
is a tLie group, and I (S) is a discrete subgroup 
with a finite number of generators. The thomo- 
geneous space O(S)/I (S) is of finite measure 
with respect to a fHaar measure defined on the 
space. 


K. Minkowski-Siegel-Tamagawa Theory 


Let S and T be rational integral positive de- 
finite symmetric matrices of degree m and n, 
respectively (mz n). Let A(S, T) be the number 
of rational integral solutions for the equation 
S[X]- T, and E(S) be the order of the group 
of units T(S). We put 


A(S,, T) | A(S2, T) 





M(S, T)= 





E(S;) E($) 
Mesa ee 
E(S,) E(S;) 
pro _M(S,T) 


M(S) ' 
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where S,, S,,... is a complete system of re- 
presentatives of the classes in the genus of 

S. M(S) is called the measure of genus of S. 

On the other hand, for a natural number q, 

we denote by A,(S, T) the number of the solu- 
tions of the congruence equation S[ X]: T 
(mod 9). If q is a prime power p^, then the ratio 
Em nq DO RT 4 (S, T) takes a constant value 
X, (S, T) for sufficiently large a (where e, „= 1/2 
if m—nz2;-—1 otherwise). Furthermore, let us 
consider a domain B in the Euclidean space of 
dimension n(n + 1)/2 formed by the set of nx n 
symmetric matrices containing T, and let B, be 
the domain formed by the matrices X such 
that S[.X] e B. Then B, is a domain in the 
space of dimension mn formed by all m x n 
matrices. Let «,,(S, T) be the limit of the ratio 
v(B,)/v(B) of the volumes of B and B, as the 
domain B shrinks toward the point T. Then 
Siegel’s theorem states: 3. (S, T) IT, a, (S, T) - 
€A,(S, T), where ¢=2 ifm=n+1 or m=n>2 
and £= 1 otherwise. The infinite product of the 
left-hand side of this equation does not con- 
verge absolutely if either m=n=2 or m=n+2, 
and in those cases the order of the product TT, 
is considered to be the natural order of the 
primes p. 

A special case of Siegel’s theorem was 
proved by H. Minkowski, but it was C. L. 
Siegel [6] who proved it in its general form. 
Except for a finite number of p, the numbers 
x, (S, T) have been calculated. The explicit 
form of x, (S, T) is also known. In particular, if 
we take the identity matrix E™ of degree m as 
S, then the formula in Siegel's theorem is re- 
lated to the problem of expressing natural 
numbers as sums of m squares. For m— 

2,3, ...,8, the genus of E™ contains only 

one class. Hence, putting n—1, T=t(=a 
natural number), we obtain from Siegel's 
theorem the number of ways in which we can 
express t as the sum of m squares [6, pt. I]. 
Siegel's result was generalized by Siegel himself 
to the case where the form S is indefinite [6, pt. 
IT] and where the coefficients of the forms are 
elements of an algebraic number field of finite 
degree [6, pt. III]. Also, regarding the number 
of possible ways to express a natural number t 
as a sum of m squares, the following formula 
was obtained by C. G. J. Jacobi for the case 
where m=4, n— 1: 


A(EQ, »-s(Ya- X 1) 
d|t 4d|t 

For the case m= 3, n=1, it is known that if t is 
odd and A(E®), t) 0, then t#7 (mod 8) (for 
details — P. T. Batemann, Trans. Amer. Math. 
Soc., 71 (1951)). 

T. Tamagawa used the theory of tadelized 
algebraic groups and proved that the tTama- 
gawa number z(SO(n, S)) of the special ortho- 
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gonal group is 2. He also showed that from 
this fact, Siegel’s theory in this section can be 
deduced (— 13 Algebraic Groups P) [10]. 


L. Theta Series 


Let Q(x,, ..., Xm) be a positive definite form 
with integral coefficients. For a complex num- 
ber z, we put 


exp(2ziQ(x,, ..., x,)z). 
where x,, ..., Xm run over all the integers. If 
Im z 7 0, the series converges and represents 
an tentire function of z. These series are 
called theta series. If we denote by A(n) the 
number of integral solutions of the equation 
Q(x,, ..., x,) =H, we have 


F(z,Q)= yy A(n)e?""*, 

n=0 
Moreover, if m= 2k, we have the following 
transformation formula: 


F (=. o) =e(d)(cz +d)‘ F(z, Q), 
cz+d 
where a, b, c, d are integers such that ad — bc 
= 1, c=0 (mod N), N is a natural number 
determined by Q, and e is a character mod N. 
In other words, F(z, Q) is a *nodular form 
with respect to the tcongruence subgroup of 
level N. Using the theory of modular forms, E. 
Hecks showed that A(n) — Ao(n) 4- O(n"?), 
where A,(n) is a number-theoretic function of 
n determined by the genus of Q. 


M. Binary Quadratic Forms with Integral 
Coefficients 


Now put m —2. Given a form Q(x, y) ax? + 
ber * cy?, we put D(Q) - b? — 4ac and call it 
the discriminant of Q (i.e., D(Q) = —4A(Q)). Q is 
said to be primitive if (a, b, c) - 1. When D(Q) 
is not a square, the theory concerning Q is 
closely related to the arithmetic theory of the 
*quadratic field Q(,/D) — k. Let d be the tdis- 
criminant of k and put D=df?. When f — 1, 
there is a one-to-one correspondence between 
the "deal classes of k and the classes of qua- 
dratic forms with discriminant d (when D « 0, 
we consider the classes of positive definite 
forms). The correspondence is given in the 
following manner: If a is an ideal in k with a 
basis x,, %2, then the corresponding form is 
given by Q(x, y) 2 N(a) ! N(a, x - a; y), where 
N is the absolute *norm. If f » 1, we must 
replace the ring of integers o by the *order of 
the ‘conductor f. That is, if we consider the 
ring formed by the elements x + f yo (x, y are 
rational integers; the meaning of œ is ex- 


1296 


plained shortly), we again have a one-to-one 
correspondence between the classes of ideals of 
this order and the classes of quadratic forms 
[3]. We let c =\/d/2 if d=0 (mod 4; — (1-- 
/4)/2 if d= 1 (mod 4). When D>, we can 
introduce the notion of proper equivalence as 
follows: Q and Q' are properly equivalent if the 
matrix of Q is transformed to the matrix of Q' 
by a linear transformation P whose determi- 
nant is 1. Then, in the correspondence for the 
case f=1, if we take x, =r >0, x, =s +t@, t>0 
(r.s, te Q), then we get a relation between the 
classification of the forms and the classification 
in the finer sense of the ideals in k. 

Suppose that D>0 is not a square. Let (t, u) 
be an integral solution of *Pell's equation t? — 
Du? = +4. Then the units of the form Q(x, y) 
—ax? +bxy 4 cy? with discriminant D are 
given by 


x (t — bu)/2 —cu 

U au (t + bu/2/ 

Let (t5, ug) be the smallest positive integral 
solution of t? — Du? —4, put £p ss (t5 + uo / Dy2, 
and let hp be the class number in the finer 


sense of the forms of discriminant D. Then the 
following formula holds (Dirichlet): 


1 o 1/D 
T EAS E 
po 258) 


where (D/n) is the *Kronecker symbol (here D 
= f "d we put (D/n)=0 if ( . n) ¥1, and (D/n) 
— (d/n) if ( f, n) - 1). For D <0, the order wp of 
the units is known: it is 6 if D = —3; 4 if D = 
—4; and 2 otherwise. We also have 





With respect to the numbers hp and ep, little 
else is known. 
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A. Problems 


À quadratic programming problem is a special 
type of mathematical programming (— 264 
Mathematical Programming) where the objec- 
tive function is quadratic while the constraints 
are linear. A typical formulation of the prob- 
lem is as follows. 
(Q) Maximize z=c’x —1x'Dx under the 

condition Ax <b and x 20, xeR". 

Let the Lagrangian form for this problem be 


Q(x, X) 2 ex —1x'Dx + X'(b — Ax). 


Then, from the general properties of Lagran- 
gian forms, the following theorem can be 
proved. 

Theorem: If x — x* is an optimal solution of 
the problem (Q), there exists a vector A* satis- 
fying the conditions 
—Dx*-c-c«À*, A*>0; 

HAS =(c — Dx*)x*. 

Moreover, if the matrix D is nonnegative de- 
finite, the above conditions are also sufficient 
for x =x* to be optimal. The second condition 
can be shown to be equivalent to 


A*'(b— Ax*) 20 and x*'(4'A* + Dx* —c)— 0. 


By introducing the slack vectors uz 0 and 
v Z0, the conditions can be expressed as 


x20, y20, uz0, v20; 


(CO Ax+tu=b, A’y+Dx—v=c; 


yu=0, x’v=0. 
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When D is nonnegative definite, any feasible 
solution of the above system of equalities gives 
an optimal solution of the primary problem 
(Q), and when D is positive definite, the solu- 
tion is unique. When D is not nonnegative 
definite, the optimal solution of (Q), if it exists, 
is one of the feasible solutions of (C). The last 
line of (C) implies that the solution must be a 
basic solution of the linear system of equalities, 
and it also restricts the possible combinations 
of the basic variables. Since there exist only a 
finite number of possible combinations of the 
basic variables, the quadratic programming 
problem can be solved in a finite number of 
steps, if it has an optimal solution. 


B. Duality 


The dual problem of (Q) is the following. 

(QD) Minimize w — b'y - 1x'Dx under the 
condition A'y + Dx zc and x z0, y z 0. 

If D is nonnegative definite, the following 
theorem holds. 

Theorem: If the primary problem (Q) has a 
solution x — x*, then the dual problem has a 
solution x 2 x* and y 2 y*, and max z = min w. 

À more general form of the quadratic pro- 
gramming problem can be given as follows. 

(Q) Maximize z=c’x — 1x'Dx under the 
condition xe V and b — Axe W, where V and 
W are closed convex cones in R" and R", 
respectively. 

Then the dual problem is expressed as 
follows. 

(QD) Minimize w — b'y —ix'Dx under the 
condition xe V, ye W* and A’y+ Dx —ceV*, 
where V* and W* are the dual cones of V and 
W. 

The above theorem holds for both (Q) and 


(QD). 


C. Algorithms 


Various algorithms have been proposed for 
quadratic programming [1, 2, 4], most of 
which are based on condition (C). Wolfe [4] 
proposed a method based on the simplex 
method for linear programming. If we intro- 
duce the artificial vectors € and n, we can find 
a feasible solution of (C) by solving the follow- 
ing linear programming problem. 

(LQ) Maximize z= —1'&8— 1'i under the 
condition that 
Ax+u—€=b, A’y+Dx—v4+y=¢; 
x20, y>0, u>0, v>0, $20, 20; 
yu=0, x’v=0. 

(LQ) can be solved by applying the simplex 
algorithm with the only modification being 
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that the last line of the condition restricts the 
possible changes in the basic variables. When 
D is positive definite, we can always obtain 

a solution if there is a feasible solution of 

the original problem, and Wolfe proposed a 
modification of the foregoing algorithm for the 
case when D is nonnegative definite that tells 
whether or not it has an optimal solution, and 
gives it if it has one. Some other algorithms are 
also effective when D is positive or nonnega- 
tive definite, but when D is not nonnegative 
definite, no simple effective method has been 
found to reach the optimal solution even when 
its existence has been established. 
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A. Introduction 


A subset F of a 3-dimensional Euclidean space 
E? is called a quadric surface (surface of the 
second order or simply quadric) if F is the set of 
zeros of a quadratic equation G(x, y, z)=0, 
where the coefficients of G are real numbers. 
The equation G(x, y,z)=0 is written as 


ax? + by? c cx? +d + 2fyz + 2gzx + 2hxy + 2f'x 
+2g'y+2h'z=0. (1) 


In general, a straight line intersects a quadric 
surface at two points. If it intersects the sur- 
face at more than two points, then the whole 
straight line lies on the surface. Suppose that 
we are given a quadric surface and a point O. 
Suppose further that we are given a straight 
line passing through the point O and intersect- 
ing the quadric surface at two points A and A’. 
If AO =OA ' for all such straight lines, then the 
point O is called the center of the quadric 
surface. 
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B. Classification 


The subset defined by equation (1) may be 
empty; for example, x? + y? +z? + 1—0. In this 
article, we consider only quadric surfaces that 
are not the empty set. When a quadric surface 
F without tsingular points has a center or 
centers, we say that F is central. 

If we choose a suitable rectangular coordi- 
nate system, the equation of a central quadric 
surface is written in one of the following forms: 


ter Q) 
pta ^ 
vel (7) 


When the equation takes the form (2), (3), (4), 
(5), or (6), we call the quadric surface an ellip- 
soid, hyperboloid of one sheet, hyperboloid of 
two sheets, elliptic cylinder (or elliptic cylin- 
drical surface), or hyperbolic cylinder (hyper- 
bolic cylindrical surface), respectively. When 
the equation takes the form (7), the surface 
coincides with a pair of parallel planes. If a= 
b in (2), (3), (4), or (5), the surface is a surface 
of revolution with the z-axis as the axis of revo- 
lution. In this case, we call the surface an 
ellipsoid of revolution, hyperboloid of revolution 
of one sheet, hyperboloid of revolution of two 
sheets, or circular cylinder (or circular cylin- 
drical surface), respectively. [f a— b —c for an 
ellipsoid of revolution, then tne surface is a 
sphere with radius a. 

If we choose a suitable rectangular coordi- 
nate system, the equation of a noncentral 
surface of the second order is written in one of 
the following forms: 


x2 y? 
EE 8 
z atp (8) 
x? y? 
DRM MR 9 
e ne (9) 
2 
2:7. (10) 


a 


When the equation takes the form (8), (9), or 
(10), we call the surface an elliptic paraboloid, 
hyperbolic paraboloid, or parabolic cylinder (or 
parabolic cylindrical surface), respectively. If a 
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= b in (8), the surface is called an elliptic para- 
boloid of revolution. 

Among these, (2), (3), (4), (8), and (9) are 
sometimes called proper quadric surfaces, and 
the others degenerate quadric surfaces. 

Equations (2)-(10) are called the canonical 
forms of the equations of these surfaces (a, b, c 
in canonical forms should not be confused with 
a, b, c in (1)). The planes x 20, y 20, and z=0 
in surfaces (2), (3), and (4) and the planes x =0 
and y=0 in surfaces (8) and (9) are called 
principal planes of the respective surfaces; and 
lines of intersection of principal planes are 
called principal axes. For a surface of revo- 
lution, positions of principal planes and prin- 
cipal axes are indeterminate. We call a, b, c in 
equations of canonical form the lengths of the 
principal axes, or simply the principal axes. If 
F is a hyperboloid of one sheet or a hyperbolic 
paraboloid, there are two systems of straight 
lines lying on F; two straight lines belonging 
to the same system never meet (and are not 
parallel), and two straight lines belonging to 
different systems always meet (or are parallel). 
If F satisfies (3), these systems of straight lines 
are given by 


b b 

1 y x z 1 y 
=-{ 1+5 —+-=-{1—-]. 
" d 4) Tu al z) 


If F satisfies (9), then two such systems are 
given by 





x y x y 
Dur Ee 
a b ao i 
n x y 2z 
a b à (a b yu 


(A and u are parameters). We call these straight 
lines generating lines of the respective surfaces. 
A hyperboloid of one sheet and a hyperbolic 
paraboloid are truled surfaces described by 
these generating lines. 

When a quadric surface has singular points, 
they are double points. The set of double 
points of a quadric surface F is either a single 
point O, a straight line L or a plane z. In the 
second case, F consists of two planes passing 
through ! or l itself, and in the third case, F 
coincides with z. In the first case, we say that 
F is a quadric conical surface (or quadric cone) 
with vertex O. Its equation is written in the 
form Ax? + By? + Cz? «0 (ABC x0). When A, 
B, C are of the same sign, F consists of only 
one point O. Otherwise, we can assume that A, 
B>0O, C= —1. In this case, if A= B, F is called 
a right circular cone, and if A z B, F is called 
an oblique circular cone. 

Given hyperboloids (3) and (4), we call the 
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quadric cones 


x? y? z? 

Se 3) 
a’ b^ c 
and 

x) yp z ) 
"e gta v 


asymptotic cones of (3) and (4), respectively. 


C. Poles and Polar Planes 


Suppose that we are given a straight line A 
passing through a fixed point P not contained 
in a quadric surface F, and S intersects the 
surface at two points X, Y. The locus of the 
point Q that is the harmonic conjugate of P 
with respect to X and Y is a plane. We call this 
plane z the polar plane of P with respect to the 
quadric surface F, and P the pole of the plane 
n. If the polar plane of a point P contains a 
point Q, then the polar plane of Q contains P. 
In this case, we say that the two points P and 
Q are conjugate to each other with respect to 
the quadric surface. When the point P is on 
the quadric surface, the tangent plane at P is 
regarded as the polar plane of P. If the polar 
plane (with respect to a quadric surface) of 
each vertex of a tetrahedron is the face corre- 
sponding to that vertex, we call this tetra- 
hedron a self-polar tetrahedron. If the polar 
planes (with respect to a quadric surface) of 
four vertices of a tetrahedron A are four faces 
of a tetrahedron B, the same property holds 
when we interchange A and B. We say that 
such tetrahedrons are polar tetrahedrons with 
respect to the quadric surface. Suppose that we 
are given a quadric surface and two planes. If 
the pole (with respect to the quadric surface) of 
one plane is on the other plane, these two 
planes are said to be conjugate with respect to 
the surface. 

When we are given two fpencils of planes in 
tprojective correspondence, the locus of lines 
of intersection of two corresponding planes is 
generally a hyperboloid of one sheet or a 
hyperbolic paraboloid. In particular, if the 
axes of these pencils of planes intersect, the 
locus is a quadric conical surface, and if the 
axes are parallel, the locus is a quadric cylin- 
drical surface (i.e., an elliptic or hyperbolic 
cylinder). When there exists a projective corre- 
spondence between two straight lines not on a 
plane, the locus of lines joining corresponding 
points is a quadric surface (M. Chasles). 


D. Surfaces of the Second Class 


A surface F in E? is called a surface of the 
second class if it admits two tangent planes 
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passing through an arbitrary straight line L 
provided that FN L= @. This surface can be 
represented as the set of zeros of a homoge- 
neous equation of the second order in *plane 
coordinates u,, uz, Ha, u4. It is possible that a 
surface of the second class degenerates into a 
conic or two points. In general, quadrics are 
surfaces of the second class, and vice versa. 

As in the case of quadrics, we can define 
poles, polar planes, and polar tetrahedrons 
with reference to surfaces of the second class. 
Four straight lines joining corresponding 
vertices of two tetrahedrons polar with respect 
to a surface of the second class are on the same 
quadric. We say that two such tetrahedrons 
are in hyperboloid position. 


E. Confocal Quadrics 


A family of central quadrics represented by the 
following equations is called a family of con- 
focal quadrics: 


x? y* z? 


Lx bey Ee 











1, a>b>c>0, (11) 


where k is a parameter. For a quadric belong- 
ing to this family, any point on the ellipse 
x?/(a— c) 4- y?/(b — c) « 1, z «0 or the hyperbola 
x?/(a — b) — z2/(b — c) -.1, y=0 is called a focus. 
This ellipse and hyperbola are called focal 
conics of the quadric. 

Given an ellipsoid F and a point X (x, y, z) 
not contained in the principal plane, we can 
draw three quadrics F', F", F" passing through 
X and confocal with F. These surfaces F', F", 
F" intersect each other and are mutually per- 
pendicular. One of them is an ellipsoid, an- 
other one a hyperboloid of one sheet, and the 
third a hyperboloid of two sheets. Let k,, k,, 
k4 be the values of the parameter k in (11) 
corresponding to these three surfaces. Then 
the coordinates x, y, z of the point X are given 
by 

a fetta rae cha 
EE ` 


d 
7 (a—b)(c—b) ` 


[eee v) 
(a-c(b—c) ` 








We call k,, kz, k; the elliptic coordinates of the 
point X. 

Two points (x, y, z), (x’, y’, 2’) are called corre- 
sponding points if they belong to confocal 
quadrics of the same kind, 

x? 2 z 
A ptus us 1, 
a b c 

n2 n2 ^2 
GP OP Gy 
at+k b+k cck 











> 
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and satisfy 











If P,, P5; Q1, Q3 are corresponding points, 
then P, Q, — P,Q, (J. Ivory). 


F. Circular Sections 


When the intersection of a plane and a quadric 
is a circle, the intersection is called a circular 
section. In general, circular sections are cut off 
by two systems of parallel planes through a 
quadric. The point of contact on the tangent 
plane parallel to these is an *umbilical point of 
the quadric. 


G. Quadric Hypersurfaces 


A subset F of an n-dimensional Euclidean 
space E" is called a quadric hypersurface (or 
simply hyperquadric) if it is the set of points 
(x4, ..., x) satisfying the following equation of 
the second degree: 


i 


n n 
Y, ay Xj;X,+2 Y, bix;-- c0, (12) 
,k-1 i=1 

where au, b;, c are all real numbers. We can 
assume without loss of generality that the 
matrix A —(aj,) is symmetric. Assume that A is 
not a zero matrix. In the case n=2, F is a 
conic, and in the case n= 3, it is a quadric 
surface. The theory of classification of quad- 
ric surfaces can be generalized to the n- 
dimensional case as follows: Let r(A*) — r* be 
the rank of the (n+ 1) x (n+ 1) matrix 


Ay, e Qin b, 
* 

S Ani Ann b, 
b, b, c 

by 

S A 
= b, ; 

b, ba c 


and put r(A)=r. Then we have the following 
three cases: (I) r=r*; (II) r+ 1=r*; and (III) r+ 
2 —r*. Corresponding to each case, equation 
(12) can be simplified (by a coordinate trans- 
formation in E") to the following canonical 
forms, respectively: 


(I) Kä dë =0, 
i=1 


(ID ¥ Ax? +1=0, 


i=] 


(II) Y A,x? +2x,4,=0, 
i-1 
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where (4,, ..., 4,,0, ..., 0) (with n—r zeros) is 
proportional to the teigenvalues of the matrix 
A. In general, we have 1 &r«n. In the cases 
where r=n in forms (I) and (II) and r+ 1 =n in 
(III), the hypersurface is called a properly (n — 
1)-dimensional quadric hypersurface, and in 
other cases, a quadric cylindrical hypersurface. 
In cases (I) and (II), the quadric cylindrical 
hypersurface is the locus of (n — r)-dimensional 
subspaces passing through each point of a ` 
properly (r — 1)-dimensional quadric hypersur- 
face and parallel to a fixed (n — r)-dimensional 
subspace. In case (IIF), the quadric cylindrical 
hypersurface is the locus of (n — r — 1)-dimen- 
sional subspaces passing through each point 
of a properly r-dimensional quadric hypersur- 
face and being parallel to a fixed (n — r — 1)- 
dimensional subspace. For form (1) with 4; 0 
(i— 1, ..., n), a properly (n — 1)-dimensional 
quadric hypersurface reduces to a point in E"; 
for form (II) with 4; 0 (i— 1, ..., n) it becomes 
the empty set. Suppose that we are given a 
quadric hypersurface F that is neither a point 
nor empty. Then the system (4,, ..., A,} as- 
sociated with F in its canonical equation is 
unique up to order (and signature in form 
(III)). Suppose that we are given a quadric 
surface F and a point P on F. Suppose further 
that if a point X other than P is on F, then the 
whole straight line PX lies on F. In this case, 
the hypersurface is called a quadric conical 
hypersurface (or simply quadric cone). For 
example, for case (I), we can take P — O (the 
origin), and the hypersurface is a quadric cone. 
In cases (I) and (H), the hypersurface is sym- 
metric with respect to the origin. In these 
cases, a hypersurface is called a central quadric 
hypersurface, and in case (III), it is called a 
noncentral quadric hypersurface or parabolic 
quadric hypersurface. When we cut a parabolic 
quadric hypersurface by a (2-dimensional) 
plane containing the x, , ,-axis, the section is a 
parabola. If 4; « 0 (i— 1, ..., r) in form (ID), then 
the surface is called an elliptic quadric hyper- 
surface, and if there are both positive and 
negative numbers among the /,, the surface is 
called a hyperbolic quadric hypersurface. The 
section of an elliptic quadric hypersurface by a 
plane is always an ellipse. The section of a 
hyperbolic quadric hypersurface by a plane is 
an ellipse, a hyperbola, or two straight lines. In 
general, the section of a quadric hypersurface 
by a subspace is a quadric hypersurface on 
that subspace. | 


H. Quadric Hypersurfaces in an Affine Space 


In Section G we considered a quadric hyper- 
surface defined by (12) in an n-dimensional 
Euclidean space E" and transformed the equa- 
tion to canonical form by an orthogonal 
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transformation of coordinates in E". If we 
regard E" as an n-dimensional taffine space 
over the real number field and reduce (12) to 
the simplest form by a coordinate transforma- 
tion in the affine space, we have the following 
canonical forms corresponding to cases (I), (IT), 
and (IIT) discussed in Section G: 


(D (0: Y x2- y xj —0, 
Gi 


j=s+1 


r 


(ID. (s,t):} x?- Y x?+1=0, 
i-1 


j=s+1 


(H) (50: 3 x? Y, x?42x,44120, 

i-1 j=s+i 
where 0<s<r and r—s=t. The terms properly 
(n — 1)-dimensional, cylindrical, conical, para- 
bolic, elliptic, and hyperbolic can be defined in 
terms of this affine classification. For example, 
a cone is of type (I), a parabolic hypersurface is 
of type (IIT), an elliptic hypersurface is of type 
(IT) (0, r), a hyperbolic hypersurface is of type 
(II) (s, t) (s, t > 0), and type (IT) (s, 0) represents 
the empty set. A necessary and sufficient con- 
dition for a (nonempty) hypersurface to be 
represented by two canonical forms N(s, t), 
Nie, t) is that (i) N = N' and (ii) if N —(I) or 
N —(HII), then s— 5, t=tť or s-t', t—s', and if 
N —(lHI), then s—5', t — t'. 


I. Quadric Hypersurfaces in a Projective Space 


Suppose that we are given a field K of charac- 
teristic not equal to 2 and an n-dimensional 
tprojective space P" over K. A subset F of P" is 
called a quadric hypersurface (or simply hyper- 
quadric) if F is represented by a homogeneous 
equation of the second degree E? p-o aj, x;x, 
=0, where (xo, x,, ..., Xn) are homogeneous 
coordinates in P" and aj,€ K; A—(a4) isa 
nonzero symmetric matrix. The problem of 
classifying such surfaces is reduced to that of 
tquadratic forms or, equivalently, to that of 
symmetrix matrices in K. Two symmetric 
matrices A and B are equivalent if there exists 
a regular matrix T such that B—'TAT (— 348 
Quadratic Forms). In particular, when K is an 
talgebraically closed field or a treal closed 
field, a simple result is obtained. If K is an 
algebraically closed field, then the equation of 
the quadric hypersurface is reduced to the 
canonical form 37. x2 —0, where r 2 r(A) - the 
rank of A. When K is a real closed field, then 
the canonical form is Lo x? ann x7 =0. 
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A. Historical Remarks 


*Newtonian mechanics (classical mechanics) 
successfully explained the motion of mechan- 
ical objects, both celestial and terrestrial, on 
a macroscopic scale. It failed, however, to 
explain blackbody radiation, which was dis- 
covered in the last decade of the 19th century. 
M. Planck introduced a hypothesis of discrete 
energy quanta, each of which contains an 
amount of energy E equal to the frequency of 
the radiation v multiplied by a universal con- 
stant h (called Planck’s constant). He applied 
this hypothesis to derive a new formula for 
radiation that gives predictions in good 
agreement with observations. A. Einstein 
proposed the hypothesis of the photon as a 
particlelike discrete unit of light rays. Assum- 
ing that many physical quantities, including 
energy, have only discrete values, N. H. Bohr 
explained the stability of electronic states in 
atoms. As illustrated in these examples, quan- 
tum mechanics is applied to study the motion 
of microscopic objects, including molecules, 
atoms, nuclei, and elementary particles. 


B. Quantum-Mechanical Measurement 


Fundamental differences between the new 
mechanics and classical mechanics are due 
to the facts that many physical quantities, 
for example, energy, can take only discrete 
values in the microscopic world, and that 
states of microscopic objects are disturbed 
by observation. 

A (pure) state at a certain time is expressed 
by a unit vector y in a tHilbert space %, and 
observables, or physical quantities, are ex- 
pressed by tself-adjoint operators in such a 
space. 

Let a, (n— 1,2, ...) be teigenvalues of an 
observable A, and let P, be a *projection opera- 
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tor onto the eigenspace spanned by teigen- 
vectors belonging to the eigenvalue a,. Sup- 
pose that A = 2a, P,. Then the hypothesis 
on measurement in quantum mechanics is 
given as follows. 

When an observable A is observed in a state 
w, one of the eigenvalues a, is found with 
probability proportional to (y, P,w)?. When an 
eigenvalue a, is once observed, a state jumps 
from y to an eigenstate P,w which belongs to 
the eigenvalue a,. Quantum mechanics pre- 
dicts only a probability p, with which a certain 
value a, is found when an observable A is 
observed. This probability, given by (v, P, v), 
is not changed even if y is replaced by ey, 
0x 0 « 2x. Therefore e'?^y represents the same 
state as y. The set of ew, 0x 0 «2n, fora 
fixed y (viel = 1) is called a unit ray. 

If P, is 1-dimensional and P,9-- q, |g|| — 1, 
then (v, P,V) — (V, o)? is called the transition 
probability between the two states. 

The expectation value (or expectation) of an 
operator A in a state yj, usually normalized to 
(V, V)- 1l, is defined to be <A> — (iy, Ap) = 
CR 

A general self-adjoint operator A can be 
written as A= ( 4dP(4). When A is observed in 
a state y, the probability for a value to be 
found between 4, and A, (45 44; 4; included 
and 4, excluded if P(A) is right continuous) is 
(V(P(A5) — P(A,)), V) (— 390 Spectral Analysis 
of Operators). 

The quantity (o, Ay) is called the matrix 
element of A between the two states g and y. 
A state y can be viewed as a functional (A) = 
<A) on the set of all observables A (its value 
being the expectation), which is linear in A, 
positive in the sense det AT A) 20 for any opera- 
tor A, and normalized: y(1)— 1. fO Ax 1 and 
V(A) — Aw, CA) - (1 — Av; (A) for all obser- 
vables A, then the state V is called a mixture of 
states i, and y, with weights å and (1— 4). If 
a state is not a nontrivial De, 40, 1, V, z y) 
mixture, it is called pure. The state <A> = 
(V, Aw) on the set of all self-adjoint operators 
A given by a vector y is pure in this sense. If 
sup, V(A,) 2 V(A) whenever A, is an increasing 
net of positive operators with A as its limit, 
then y is called normal. Any norrnal positive 
linear functional on the set of all self-adjoint 
operators can be described by a trace-class 
positive operator p, called the density matrix, 
as <A> — tr(Ap). If (y, is a complete ortho- 
normal set, where each y, is an eigenvector of 
p belonging to the eigenvalue 4,, then (45 = 
Dn Ass, Ars). 


C. Canonical Commutation Relations 


In quantum mechanics, canonical variables 
are represented by the self-adjoint operators 
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Q, (coordinates) and P, (momenta), k — 1, ..., 
N, which satisfy the canonical commutation 
relations 


KOT P] = thd, 1, 


where 1 is the identity operator, [A, B] de- 
notes the commutator AB — BA, h=h/(2n), and 
the relation is supposed to hold on a certain 
dense domain of vectors. Self-adjoint opera- 
tors Q, and P, satisfying the above relations 
are unique up to quasi-equivalence under a 
suitable domain assumption, e.g., if X», Q2 + 
Y, P2 is essentially self-adjoint on a dense 
domain invariant under multiplication of the 
Q's and P's and on which the above relations 
are satisfied. Under such an assumption, Q, 
and P, are unitarily equivalent to a direct 

sum of the Schródinger representation on 

L, (R”, dx, ...dxy), where Q, is multiplication 
by the kth coordinate x, and P, is the differen- 
tiation — ih(0/Ox,) (Rellich-Dixmier theorem). 

The above Schródinger representation 
is called the position representation (or q- 
representation). The formulation using the 
function space L, of real variables p,, k= 
1, 2, ..., N, on which the operators P, act as 
multiplications by p,, is called the momentum 
representation (or p-representation). 

If Hermitian operators A and B satisfy the 
canonical commutation relations in the form 
(Av, By) — (By, Aw) — ih(y, y), then the follow- 
ing Heisenberg uncertainty relation holds for 
the expectation: 


h? 
(A- <A>}? 48-0» 2. 


This gives the uncertainty in observations, 
which means that two observables A and B 
cannot simultaneously be observed with ac- 
curacy. This is another important property of 
microscopic motion that cannot be found in 
macroscopic motion. 

In a direct sum of the Schródinger repre- 
sentation of the canonical commutation re- 
lations, the unitary operators 


U(a)=expi) 4.01, dio Z R; 


with real parameters a, and b,, k=1,..., N, 
satisfy the following Weyl form of the canon- 
. ical commutation relations: 


U(aJU(a)-U(a--a) ` V(b)V(b)- V(b--b), 


U (a)V(b) — V(b) Gen —i) a, n.) : 
k 


Conversely, any pair of families of unitary 
operators U(a) and V(b) satisfying these rela- 
tions and depending continuously on para- 
meters a and b are unitarily equivalent to 
those obtained as above (von Neumann unique- 
ness theorem). 
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D. Time Evolution and the Schródinger 
Equation 


The time t of an observation is fixed in the 
foregoing discussion. A state changes, how- 
ever, as the time t changes, in such a way that 
the transition probability between states 1s 
preserved. By Wigner's theorem (— Section 
HL this time evolution of states can be im- 
plemented by unitary operators U, defined by 
the transformation of vectors y ^ U,j = y. 
Furthermore, under some continuity assump- 
tion, such as that of (o, U,w), U, can be made a 
continuous one-parameter group. By Stone's 
theorem, U,=e~#*" for a self-adjoint operator 
H. This operator is called the Hamiltonian 
operator (or simply Hamiltonian) determined 
by the structure of a system. An infinitesimal 
change in y corresponding to an infinitesimal 
change in t can be generated by this operator 
H as follows: 
ô 
ih A: = Hy. 
This equation is called the time-dependent 
Schródinger equation. 

A state y changes but observables do not 
change with time in the Schródinger pic- 
ture above. The other picture, known as the 
Heisenberg picture, is equally possible. In 
this picture, the state is expressed by a time- 
independent vector, while operators A vary 
with time as follows: A> U,* AU, = A(t). Rates 
of change of operators A(t) can be calculated 
by means of the equation 


dA(t) i 
— — —-|H, A(t)], 

el, A] 
which is called the Heisenberg equation of 
motion. When time t changes, the expectation 
value of an operator A in a state y changes in 
both pictures according to 


dC Ay/dt =i<[H, A])/h. 


According to classical analytical dynamics, a 
change of a dynamical quantity that is a func- 
tion of canonical variables q; (positions) and 
p; (momenta) is given by 


dA/dt = —(H, A), 


where H is a tHamiltonian function and the 
parentheses ( , ) denote the tPoisson bracket. 
A replacement of the Poisson bracket (A, B) by 
[ A, B]/ih transforms this classical equation 
into the quantum-mechanical equation above. 
It should be noticed that the mathematical 
structure of the Poisson bracket is similar to 
that of commutator. In this transition from 
classical to quantum mechanics the corre- 
spondence principle can be used. This requires 
that the laws of quantum mechanics must lead 
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to the equations of classical mechanics in the 
classical situation, where many quanta are 
involved and h can be regarded as infinites- 
imally small in the commutation relation. 

The correspondence principle suggests that 
Hamiltonian operators in quantum mechanics 
can be obtained from Hamiltonian functions 
Hin. q,) of canonical variables p, and q, in 
classical mechanics after replacing p, and q, by 
the operators P, and Q, in the Schródinger 
representation (up to uncertainty of about the 
order of operators). This process of moving 
from canonical variables and the Hamiltonian 
function in classical mechanics to canonical 
operators and the Hamiltonian in quantum 
mechanics is called quantization. Taking a 
system of s particles and letting x,, y,, and zę 
be the Cartesian coordinates of the kth parti- 
cle, we usually write the equation of motion as 


+ f) 


which is a second-order partial differential 
equation. Here m, is the mass of the kth par- 
ticle; A, is the *Laplacian of x,, y,, and z,; and 
V is a real function called the potential energy. 
This equation is the time-dependent Schródin- 
ger equation. The partial differential operator 
on the right-hand side is called the (s-body) 
Schrödinger operator and V(x,.yi.z,, .... 
X, Ys» Zs t) is called a wave function. The prob- 
ability of finding a particle in the volume 
dx, dy, dz, bounded by Xp, x, -F dx,, Yrs Ve kd 
and z,, Zut dz, is proportional to |(x,, Eu, Zu, 
, Xo Yo Zy DP. Usually |y}? is normalized 
so that its integral over the whole space is 1. 
We sometimes call y the probability ampli- 
tude. When y is given by e "P" ^o(x,, ... z), 
the expectation value of an operator A in a 
state y, CA» — [W* Ay dx, ...dz,, does not de- 
pend on time. When this is the case, v is 
called a stationary state. 

A real value E and a function @(x,,...,2,) 
are found by solving an teigenvalue prob- 
lem Ho = Eq. This equation is the time- 
independent Schródinger equation, which is a 
second-order partial differential equation. 
Since the Hamiltonian H stands for the energy 
of this system, the eigenvalues E are the energy 
values that this system can take. 

When a potential function V is given, it is 
a nontrivial matter to prove that the (s-body) 
Schródinger operator with the given V is es- 
sentially self-adjoint on the set of, for example, 
all C?-functions with compact supports so 
that its closure H defines mathematically the 
continuous one-parameter group of unitaries 
U, — e H^ for the time evolution of the quan- 
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tum system of s particles with the given inter- 
action potential V. If V satisfies an estimate 
| Vis « 41 Rath? + uiv] (called the Kato per- 
turbation on Ho) for some nonnegative A « 1 
and uz: 0 and for all y in a dense domain on 
which H, is essentially self-adjoint, where Ho 
denotes the Schrodinger operator with V — 0 
(called the free Hamiltonian or the kinetic 
energy term), then Ho + V is essentially self- 
adjoint on the same domain. For the case 
where V consists of Coulomb interactions 
between electrons and Coulomb potentials 
on electrons by fixed nuclei, for example, 
such an estimate and hence the essential self- 
adjointness of Hamiltonians for atoms and 
molecules were established first by T. Kato 
(Trans. Amer. Math. Soc., 70 (1951). 

For a 1-body Schródinger operator (or 2- 
body Schródinger operator after the center 
of mass motion has been separated out), the 
point spectrum is that of the particle trapped 
by the potential, and the state represented by 
its eigenvector is called a bound state. The 
eigenvalue is nonpositive for a reasonable class 
of potentials V (for example, if V(x) (xe R?) is 
continuous and ((|x}~'~*) as |x|-» oo for some 
e>0), and its absolute value is called the bind- 
ing energy. The eigensolutions of the Schró- 
dinger equation are what have been called 
stationary states above. 

There are also stationary solutions that 
do not correspond to the point spectrum and 
hence are not square integrable. They are used 
in the stationary methods of scattering theory 
(— 375 Scattering Theory). 


E. Some Exact Solutions for the 1-Body 
Schródinger Equation 


(1) Harmonic oscillator. First consider the case 
in which the space is of 1 dimension, so that 
the Laplacian A is (d/dx)*. Let m be the mass 
of the particle and V(x)= mo? x?/2 for a posi- 
tive constant c (called the angular frequency). 
The Hamiltonian 


H = — (h?/2m)(d/dxy) + mo? x?/2 
has simple eigenvalues 
E,—ho(n--(1/2), | n20,1,2,..., 


with a complete orthonormal set of eigen- 
functions 


V. (x) lie t}, 


where H,(q) is a tHermite polynomial and c, is 
the normalization constant: 


q —(moo[h)'? x, 


(n/2] 
H,(q)= (Ui ntQay "i (n — 2k)tk!], 


c, = 2?" (n? nh/(mo)) ^. 
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When the space is of r dimensions, n in E, is 
replaced by n, +... +n, with nonnegative 
integers n,,...,n,, and the corresponding eigen- 
function is TTj-, v, (xj 

(2) One-dimensional square-well potential. 
Let V(x) V for |x| €x a/2 and V(x) 20 for |x| > 
a/2 (x eR). If V z 0, there are no point spectra. 
If V «0 and 


N —1«a( —2mV)2 (nh) « N, 


then there are N eigenvalues (N —1,2,...) 
obtained as the roots E «O0 of one of the fol- 
lowing equations: 


((V/E) — 1)? = tan (a( - 2mE)! 2 /(2h)), 
((V/E) — 1)? = —cot{a(—2mE)!2/(2h)}. 


(3) Separation of angular dependence for 
central potential. If V(x) (x eR?) depends only 
on r -|x| 2 (22. (xj)? (called a central poten- 
tial), then all eigenvalues E and a basis for 
eigenfunctions y (x) can be obtained in terms 
of the polar coordinate r, 0, o (x, 2rsin0cos o, 
X,=rsin 0 sin gy, x4 —rcos0) as 


W(x) = Yin(8, o)r "utr 
—(h?/(2m))u"(r) + {h (1+ 1)/(2mr) 
+ V(r)— E u(r) «0, 


oo 

4 -| |u(r)|? dr < oo, 
0 

where the angular function Y,, is an eigenfunc- 

tion of the square L? of the orbital angular 

momentum L — — ix x V: 


Yim = ci, Pr (cos 6)e'"*, 
Cj, 7 (—1)" (21 + 1) (L— m)tI/(4n(1 + m)!) 92, 


m-l,i—1,..,—l*1,—1l  Iz0,12.... 


Here Pr'(x) is an tassociated Legendre 
polynomial: 


Pr(x) 2 (1 — x2) P (q!*"(x? — 1)/dx!*™)/(2'1!). 


The above equation for u(r) is called the radial 
equation. The nonnegative integer / is the 
azimuthal quantum number, and the integer m 
is the orbital magnetic quantum number. The 
wave function w(x) with the angular depen- 
dence Y,„(0, ol is called the S-wave, P-wave, D- 
wave, ... according as /=0, 1,2, .... 

(4) Hydrogen-type atom. Let V(r)= — Ze?/r 
(Z 7 0). For each l and m, there are eigenvalues 
— e Z? (2an?) with eigenfunctions Wyim = 
r ur) Y, (0, p), where n=1+4+ 1, 1+2,... is 
the principal quantum number, 
unl) =u Lii (ss le *?,  s—2Zr/(na), 
Gaz —{(n—l— D'E2Z/a(n +1) ]?/29))??, 


and L4(x) is the uth derivative of the tLaguerre 
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polynomial 


Ly(x)= » (N!)?(—x)"/{(N —v)07]. 


The eigenvalue is determined by n, and its 
multiplicity is n*, corresponding to the differ- 
ent possible values of | and m. 


F. Path Integrals 


R. P. Feynman (Rev. Mod. Phys., 20 (1948)) 
has given the solution of the Schródinger 
equation as an integral of el over all possible 
paths q(t), where L= L(q, å) (d — (d/dt)q(t)) is 
the classical Lagrangian for the Hamiltonian 
system. This integral is called the Feynman 
path integral. Mathematical reformulation of 
the formula in terms of the Wiener measure 
has been given by M. Kac (Proc. 2nd Berkeley 
Symp. Math. Statist. Probability, 1950; Proba- 
bility and related topics in the physical sciences, 
Wiley, 1959). 

Consider the 1-body Schródinger operator 
H = Ho + V (form sum), where V is the sum of a 
locally integrable function bounded below and 
a Kato perturbation on Ho. Let b(t) (t >0) be 
the Wiener process and q(t) - hb(t)/(2m)!?. For 
any L, functions f, 


(ef) (x) 
=E (re + q(t))exp d — | V(x+ voten) 


0 

for almost all x, where E denotes the expecta- 
tion for the Wiener process. If V is a sum of 
L, and L,, functions (for spatial dimension 
x 3), then the right-hand side is continuous in 
x for t» 0. This is called the Feynman-Kac 
formula. 

Let Lo be the Hamiltonian for a 1-dimen- 
sional harmonic oscillator with m =œ = 
h — 1 and y, be the eigenfunction (x)= 
n !^exp(— x?/2). Consider H = Lo + V (form 
sum), where V is a sum of a locally integrable 
function bounded below and a Kato pertur- 
bation on Lo. Let q(t) (teR) be Gaussian ran- 
dom variables with mean 0 and covariance 
E(q(t)q(s)) 22 'exp(— |t —s1) (called the oscil- 
lator process). For any f(x) in L,(R, và dx) 
(j=1,...,n) and to €t... € t, &t,4, 


(Jo, e Hf, eI, Le en HH) 


-(TTLga eno] - | 5 Vlas) as} 


The above path integral formulas are closely 
related to the Trotter product formula 


e —t(A-- B) = lim (e Aire tmn (t > 0), 


noco 


where A and B are self-adjoint operators 
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bounded below and A + B is essentially self- 
adjoint. (The same formula holds without the 
boundedness assumption when teiR.) 


G. The Dirac Equation 


The Schródinger equation is not relativistically 
invariant. The Klein-Gordon equation 


( yy =0 Sg 12 mc 
C)-*)v-0, 0a gag EE 


is obtained by replacing p, by (h/i)0/0x;, 
(k — 1,2,3) and E by ihdé/ét in the relativistic 
identity 


E? —m?^c* + pic, 


where c is the speed of light. Wave functions of 
free particles are believed to satisfy this equa- 
tion. P. A. M. Dirac assumed that the y of a 
free electron is expressed in terms of a tspinor 
with four components satisfying a linear dif- 
ferential equation that automatically implies 
the Klein-Gordon equation. Relativity re- 
quires the equal handling of space and time. 
The Dirac equation 

3 d 


Y EE (8) 


4-0 





(x? — ct) satisfies these requirements. The 
coefficients y" can be so determined that 
every component of y also satisfies the Klein- 
Gordon equation. Thus the y“ are found to be 
4 x 4 matrices satisfying the commutation 
relations ^y" - y "y" —2g"" (jg, v —0, 1, 2, 3), 
where g^" —0 for uix v and g? = —g™=1 (k= 
1,2, 3). Sixteen linearly independent matrices 
are obtained by repeated multiplication of five 
matrices, which include the four matrices y, 
zi. y^, y? and the identity matrix. Any 4 x 4 
matrix can be expressed as a linear combina- 
tion of these sixteen matrices. 

The Dirac equation has tplane wave 
solutions 


dis, t) - uexp((i/h)p: x — (i/f) Et), 
where the energy eigenvalues E are 
x / mi cs rp. 


There are four independent eigensolutions u™?, 
vi, u®, and uf*, because u has four compo- 
nents. Two of them are of positive energy and 
the other two are of negative energy. Although 
the negative energy case is physically undesir- 
able, it has to be taken into account in order 
to obtain a mathematically complete set. To 
solve this difficulty, Dirac proposed the hypoth- 
esis (Dirac’s hole theory) that all the negative 
energy states are filled up by an infinite num- 
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ber of electrons in the normal state of the 
vacuum. The absence from the vacuum of a 
negatively charged electron in a negative 
energy state could then be expected to mani- 
fest itself as a positively charged particle (posi- 
tron) with positive mass and kinetic energy. If 
gamma rays are absorbed to excite an electron 
from a negative energy state into a positive 
energy state, an electron-position pair must be 
created. Y. Nishina and O. Klein calculated 
the cross section of Compton scattering (the 
Klein-Nishina formula) by use of the Dirac 
equation and found good agreement with 
observations, thus providing evidence that the 
Dirac equation is correct. The existence of 
negative energy states, however, forces us to 
give up considering the Dirac equation as 

an equation of one electron. The positron 
theory is introduced, and the Dirac equation 
is considered as the classical field of electron 
waves and is second-quantized (— 377 Second 
Quantization). 

The Klein-Gordon equation can also be 
considered to be the classical wave equation of 
matter and can be second-quantized. Motions 
of particles with zero spin, pi mesons (n) for 
example, obey this equation. 

We can rewrite the Dirac equation as 
ihoy/0t =H, H — ca: p4- mc? f, where y* = 
Ba, (k=1, 2,3), and y? — fl. Since H cannot 
commute with the orbital angular momentum 
of an electron L=r x p= — ihr x V, L is not 
conserved. However, the total angular momen- 
tum J =L + (h/2)e can be conserved when o is 
a vector whose components can be given as 


a, 0 1 0 0 o 
h = = 
¢ ` Leier? E WÉI ( ol 


and the o,, called Pauli spin matrices, are given 


E 0 1 0 -i 1 0 

E $ 6, = 5 0z = . 
Porc gr Re o eed 
(The y" are called Dirae’s y matrices.) The 


e h . A 
quantity S 259 is the intrinsic angular momen- 


tum of the electron, also called the spin. Many 
particles besides the electron, the neutron for 
example have spin. The matrix S? = 52 + S? + 
S2 is diagonal and is equal to s(s+ 1)? I (I is 
the identity matrix). For the electron s= 1/2, 
and h/2 is called the absolute value of the spin. 
Therefore we say that electrons are particles of 
spin h/2. This was predicted in the theory of 
light spectra. 

When the speed of an electron is very small, 
so that (v/c)? can be neglected, states of the 
electron can be expressed in terms of two- 
component wave functions. This approxi- 
mation is called the Pauli approximation. If 
the spin-orbit term that appears in the Pauli 
approximation is also neglected, these two 
components become independent of each 
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other and individually satisfy the Schródinger 
equation. 


H. Application of Representation Theory of Lie 
Groups 


A symmetry (with active interpretation) is 

a bijective mapping of pure states (repre- 
sented by unit rays in a Hilbert space) preserv- 
ing transition probabilities between them. 
Wigner's theorem says that any symmetry can 
be implemented by either a unitary or an 
antiunitary mapping of the underlying Hilbert 
space as a mapping of unit rays onto them- 
selves. Furthermore a connected Lie group 

of symmetries is implemented by unitaries, 
which form a projective representation. Eigen- 
states y of the Schródinger equation are func- 
tions of the coordinates of each particle. Let 
these coordinates be denoted together as x. 

Suppose that an operator T operates on the 
x, as, for example, a rotation of the coordinate 
system or a permutation of the labels of the 
particles. If T commutes with H or H is invar- 
iant under the transformation x x' = Tx, 
then Ty (x)= y'(x) 2 V(T ^! x) satisfies the same 
Schrödinger equation as y, where the trans- 
formation of the function is defined by y’(x’)= 
V (x). The set of transformations xx’ = Tx 
forms a tgroup {T}, and the corresponding 
transformations y — TV give a (generally 
infinite-dimensional) representation of this 
group, which should be tunitary on L;-space 
relative to the Lebesgue measure dx if T leaves 
the measure invariant. There are degeneracies 
of the energy eigenvalues, each of which is 
equal to the dimension of the corresponding 
representation of the group {T}. If the repre- 
sentation for each eigenvalue is decomposed 
into irreducible ones, then the decomposed 
stationary state can be labeled by an firreduc- 
ible representation. 

When H is spherically symmetric, i.e., H is 
invariant under the 3-dimensional rotation 
group, states are classified by the irreducible 
representations D, of the trotation group (— 
258 Lorentz Group). The square of the sum L 
of all orbital angular momenta has eigenvalues 
L(L+ 1)h?, where L must be 0 or a positive 
integer. There are 2L + 1 degenerate states, 
each of which belongs to a different M, the 
z-component of L, where M ranges from L to 
— L by unit steps. Even when there is an inter- 
action between the orbital angular momentum 
and the spin angular momentum, states are 
labeled by the irreducible representations D, 
of the rotation group, where J is the sum of 
the orbital angular momentum L and the spin 
angular momentum S (J =L +S) and eigen- 
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values of J? are given by J (J +1)h?. Each J 
must be zero, a positive integer, or a half- 
integer. Adding inversions to the pure rota- 
tions, we obtain the 3-dimensional orthogonal 
group (— 60 Classical Groups I). Irreducible 
representations of this group are written as 
Di. where + corresponds to the characters 
of the inversion relative to the origin. States 
with 4- are called even states and those with 
— , odd states. For example, energy levels of 
atoms and nuclei can be classified by Dj. 

To obtain matrix elements of observables 
between two stationary states, group repre- 
sentation theory is useful. The transformation 
of every observable obeys a certain rule under 
the transformation of coordinates. The scalar 
is transformed according to Dj , the vector 
according to D, , the pseudovector according 
to D. and the traceless tensor according to 
Dj . If the transformation of an observable 
is given by D,, then a matrix element of this 
observable between the states belonging to D, 
and D,. vanishes unless the tensor product 
representation D, ® D, contains as a factor a 
representation equivalent to D,.. In electro- 
magnetic transitions in atoms or nuclei, D, ® 
Dy, =Dy4,+Dy4+Dy,_, (J' 21) if the electric 
dipole transition dominates (J = 1). This im- 
plies the selection rule J' J’ +1, J’, J' — 1. 
When J' —0, only the transition 01 is pos- 
sible. More general selection rules can be 
obtained in the same way for general multi- 
pole transitions. Representation theory is 
useful in determining general formulas of 
transition strengths. 

There is a class of particles, many of which 
can occupy the same state, called bosons. 
There is another class of particles, of which 
only one can occupy a given state, called fer- 
mions. For example, the electron, neutron, and 
proton are fermions, while the photon and pi 
meson are bosons. Two identical particles, 
both of which are either fermions of the same 
kind or bosons of the same kind, cannot be 
distinguished. Therefore the Hamiltonian 
should be invariant under permutations be- 
tween identical fermions, or between identical 
bosons. A system consisting of N identical 
particles can be classified by the irreducible 
representations of the fsymmetric group Sy of 
N elements. When the particles are fermions, 
two of them cannot occupy the same state (this 
law is called the Pauli principle), so that only 
totally antisymmetric states are permissible 
for fermions. When a system consists only of 
fermions of the same kind with spin h/2 and a 
Hamiltonian of this system does not include 
terms depending on spins, then the wave func- 
tions are just products of spin and orbital 
parts. In order to make wave functions totally 
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antisymmetric, orbital wave functions with a 
total spin v/2 should be limited to those corre- 
sponding to the *' Young diagram [2% °, 1°]= 
T(2,2,...,1,1). 


I. Polyatomic Molecules 


The group generated by the 2-dimensional 
rotations about the axis connecting two atoms 
and the reflections with respect to the planes 
containing this axis is used to classify states 

of diatomic molecules. Stationary states can 
be classified by the absolute value A of the 
angular momentum of the diatomic system 
around this axis, which can be zero or a posi- 
tive integer. When A 2 1, the corresponding 
state has twofold degeneracy, whereas when A 
— 0, two states (labeled by +) arise, depending 
on the character of the reflections. If these two 
atoms are identical, the molecular states can 
be classified further as even and odd according 
to the character of reflections with respect to 
the plane containing the center of mass and 
perpendicular to the axis. The classification of 
the spectral terms of a polyatomic molecule is 
related to its symmetry, described by the set of 
all transformations that interchange identical 
atoms. For example, stationary states of meth- 
ane molecules CH, are classified by the irre- 
ducible representations of the group T, (which 
is generated by adjoining reflection symmetry 
to the *tetrahedral group T). Level structures 
of a crystal are classified by the irreducible rep- 
resentations of its symmetry groups. 

In the approximation of many-body prob- 
lems by means of independent particles, the 
wave function of the total system is con- 
structed by multiplying the wave functions of 
the individual particles. To construct such a 
wave function, it is useful to consider the re- 
duction to irreducible parts of the *tensor 
products of representations of the groups 
attached to the individual particles. For exam- 
ple, an atom with two electrons carrying the 
angular momenta J, and J, has 2J' +1 differ- 
ent angular momentum states, where J' = 
min(J,, J2) corresponding to the decompo- 
sition D, & D, =D; +3, Dj +3,1 ++ 
D, . The right-hand side of this equation 
gives all possible states of the atom. 


J. Charge Symmetry 


The proton and the neutron can be considered 
to be different states of the same particle, 
called the nucleon, because these two particles 
have very similar natures except for their 
charges, masses, etc. As an approximation, the 
Hamiltonian of a system consisting of protons 
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and neutrons may be taken to be invariant 
under the interchange of protons and neu- 
trons. This invariance is called charge sym- 
metry. Analogous to ordinary spin, isospin can 
be introduced to describe the two states of nu- 
cleons. The up state of the isospin corresponds 
to the proton and the down state corresponds 
to the neutron. Consider transformations 
belonging to the tspecial unitary group SU(2) 
in the 2-dimensional space spanned by the 
proton state and the neutron state. If a Hamil- 
tonian of N nucleons is invariant under any 
transformation belonging to SU(2), then the 
eigenstate of these N nucleons is classified 

by its irreducible representations Dy, where 

T stands for the total isospin of each state. 
This invariance, called isospin invariance, 
holds in the nucleus and elementary particles 
if electromagnetic and weak interactions, and 
possibly the interaction responsible for the 
proton-neutron mass difference, are neglected. 
When a state of N nucleons has an isospin 

T —v/2, the orbital-spin wave function of this 
state must correspond to the Young diagram 
[2^ ", 1"], since the isospin wave function 
multiplied by the orbital-spin wave function is 
a totally antisymmetric wave function. If this 
Hamiltonian is also independent of spin, it is 
invariant under unitary transformations in the 
4-dimensional space spanned by the four inter- 
nal states of the nucleon: up and down spins, 
up and down isospins. Therefore the states of 
N nucleons can be classified by the irreducible 
representations of the group U(4) (Wigner's 
supermultiplet theory). 


K. The C*-Algebra Approach 


The uniqueness of operators satisfying the 
canonical commutation relations (represen- 
tations of CCR’s) up to quasi-equivalence (— 
Section C) no longer holds if the number of 
canonical variables become infinite (a so-called 
system of infinitely many degrees of freedom), 
a point first emphasized by K. O. Friedrichs 
(Mathematical aspects of the quantum theory 
of fields, Interscience, 1953), and physical 
examples illustrating this point were given 
by L. van Hove (Physica, 18 (1952)) and R. 
Haag (Mat. Fys. Medd. Danske Vid. Selsk., 29 
(1955)). The use of C*-algebras in physics was 
first advocated by I. E. Segal (Ann. Math., 
48 (1947)), and the physical relationship 
among all inequivalent representations of a 
C*-algebra was first discussed by R. Haag and 
D. Kastler (J. Math. Phys., 5 (1964)). 

In C*-algebra approach, a physical obser- 
vable is an element of a C*-algebra A and a 
state is a functional o on % (its value (A) is 
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the expectation value of the observable AeA 
when measured in that state) that is linear, 
positive in the sense p(A*A)>0 for any Ac?t, 
and normalized, i.e., ||| 2 or equivalently 
Q(1)— 1 if Legd The *GNS construction associ- 
ates with every state o a Hilbert space H,, a 
representation z,(A), AeA, of 91 by bounded 
linear operators on H,, and a unit cyclic vec- 
tor Q, in H, such that o(A4) —(O,,, 2,(A)Q,). 
Two states o and y (or rather z, and z,) are 
called disjoint if there is no nonzero mapping 
T from H, to H, such that Tr, (A) — n,(A4)T 
for all Ae WU. Abundant disjoint states occur 
for a system of infinitely many degrees of 
freedom, e.g., equilibrium states of a infinitely 
extended system with different temperatures 
(— 402 Statistical Mechanics), superselection 
sectors explained below (— 150 Field Theory) 
and equilibrium or ground states with broken 
symmetry. 

Because actual measurement can be per- 
formed only on a finite number of observables 
(though chosen at will from an infinite number 
of possibilities) and only with nonzero experi- 
mental errors, information on any state o can 
be obtained by measurements only up to a 
neighborhood in the weak topology: |g(A;)— 
aj| € & i— 1, ..., n. A set K, of states can 
describe measured information on states at 
least equally well as another set K, (K, physi- 
cally contains K,) if the closure of K, in the 
weak topology contains K,. From another 
viewpoint, all states of K, are weak limits of 
states in K, and are physically relevant if 
states in K, are physically relevant. The set of 
all mixtures of vector states (V, z(A)w) for any 
fixed faithful representation z of W is weakly 
dense in the set of all states of 9L, a point em- 
phasized by Haag and Kastler as a foundation 
of the algebraic viewpoint in the formulation 
of quantum theory. 

Under 360? rotation a vector representing 
a state of a particle with spin h/2 acquires a 
factor —1 (— 258 Lorentz Group), while the 
vacuum vector would be unchanged. A non- 
trivial linear combination (superposition) of 
these two would then be changed to a vector 
in a different ray. If the 360? rotation is not to 
produce a physically observable effect, then we 
should either forbid nontrivial superpositions 
of states of the two classes or, equivalently, 
restrict observables to those leaving the sub- 
space spanned by vectors in each class invar- 
iant so that the relevant linear combinations 
of vectors, when considered as states on the 
algebra of observables in the form of expecta- 
tion functionals (y, Ay), are actually mixtures 
(rather than superpositions) of states in two 
classes and are invariant under the 360° rota- 
tion. This is called the univalence superselec- 
tion rule and has been pointed out by A. S. 
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Wightman, G. C. Wick, and E. P. Wigner 
(Phys. Rev., 88 (1952)). 

In quantum field theory, the vacuum state 
can be taken to be pure (by central decompo- 
sition if necessary) and in the associated GNS 
representation (called the vacuum sector) all 
vectors can be assumed to be physically rele- 
vant pure states. In principle, all physically 
relevant information is in the vacuum repre- 
sentation; for example, a particle with spin h/2 
can also be discussed in the vacuum sector if 
we consider a state of this particle in the pre- 
sence of its antiparticle at a far distance, such 
as behind the moon (the behind-the-moon 
argument). However, it is mathematically more 
convenient to consider the states of the par- 
ticle without any compensating object (in the 
same way that an infinitely extended gas is 
more convenient for some purposes than a 
finitely extended gas surrounded by walls), 
which can be obtained as weak limits of states 
in the vacuum sector by removing the com- 
pensating particle to spatial infinity and which 
produce inequivalent representations called 
superselection sectors. 


L. Foundation of Quantum Mechanics 


Hilbert spaces and the underlying field of 
complex numbers, which constitute a mathe- 
matical background for quantum mechanics, 
are not immediately discernible from physical 
observations, and hence there are various 
attempts to find axioms for quantum mechan- 
ics that imply the usual mathematical struc- 
ture and at the same time allow direct physical 
interpretation. 

One approach of this kind focuses attention 
on the set of all observables that have only 
two possible measured values 1 (yes) and 0 
(no), called questions, together with their order 
structure (logical implications) and associated 
lattice structure (join, meet, and orthocomple- 
mentation as logical sum, product, and nega- 
tion). This ts called quantum logic in contra- 
distinction to the situation in classical physics, 
where it would form a Boolean lattice. The 
lattice of all orthogonal projections (corre- 
sponding to all closed subspaces of a Hilbert 
space) in quantum mechanics is a tcomplete, 
torthocomplemented, weakly modular (also 
called orthomodular) *atomic lattice satisfy- 
ing the covering law, where weak modularity 
means c ^(c' v b) - b and b v(b' Ac) 2 c when- 
ever b «c, and the covering law means that 
every b #0 possesses an atom p under it (p <b) 
and that if an atom q satisfies q^ b —0, then 
any c between qv b and b(qvbz cz b)isb 
or qv b. Conversely any such lattice is a di- 
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rect sum of irreducible ones, each of which, 

if of dimension (length of longest chain) 3, 
can be obtained as the lattice of subspaces V 
satisfying (V^). = V in a vector space over a 
(generally noncommutative) field with an anti- 
automorphic involution *, equipped with a 
nondegenerate Hermitian form. In this ap- 
proach, an additional requirement is needed 
to restrict the underlying field and its * to be 
more familiar ones, such as real, complex, or 
quaternion fields and their usual conjuga- 
tions +. If that is done, then the set of all prob- 
ability measures on the lattice De, assign- 
ment of expectation values 0 < u(a) € 1 for all 
elements a in the lattice such that p(\/;a;)= 

X; u(aj) if a; La; for all pairs ij, u(a) z 0, 

L(1) = 1) is exactly the restriction to questions 
of states p(A) — tr(p.A) given by the density 
matrices p (Gleason's theorem). 

It is also possible to characterize the set of 
all states equipped with the convex structure 
(mixtures) geometrically. The set of all states 
(without the normalization condition) of a 
finite-dimensional, formally real, irreducible 
tJordan algebra over the field of reals (the 
positivity of a state ọ is defined by (a?) 20) 
has been characterized as a transitively homo- 
geneous self-dual cone in a finite-dimensional 
real vector space (a cone V is transitively 
homogeneous if the group of all nonsingular 
linear transformations leaving V invariant is 
transitive on the topological interior of V) by 
E. B. Vinberg (Trans. Moscow Math. Soc., 12 
(1963); 13 (1965)), where the relevant Jordan 
algebras were completely classified earlier by 
P. Jordan, J. von Neumann, and E. P. Wigner 
(Ann. Math., 36 (1934)) as direct sums of the 
following irreducible ones: the Jordan algebra 
(with the product Ao B — (AB + BA)/2) of all 
Hermitian n x n matrices over the real, com- 
plex, or quaternion field, all 3 x 3 Hermitian 
matrices over octanions, or the so-called spin 
balls (the set of all normalized states being 
a ball) linearly generated by the identity and 
y; G — L, ..., n) satisfying "om —0 if j £k and 
Wah 

In infinite-dimensional cases, this type of 
characterization extends to the “natural” posi- 
tive cones of vectors (A. Connes, Ann. Inst. 
Fourier, 24 (1974); J. Bellissard and B. Iochum, 
Ann. Inst. Fourier, 28 (1978)); while the convex 
cone of all states (without normalization) of 
Jordan algebras and C*-algebras have been 
characterized in terms of a certain class of 
projections associated with faces of the cone, 
called P-projections, by E. M. Alfsen, F. W. 
Shultz, and others (Acta Math., 140 (1978); 144 
(1980)). In finite-dimensional cases, Araki 
(Commun. Math. Phys., 75 (1980)) has given a 
characterization allowing direct physical inter- 
pretation by replacing P-projection with a 
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notion of filtering corresponding to quantum- 
mechanical measurement. 

Due to some features of quantum- 
mechanical measurement not in conformity 
with common sense, there have arisen hidden 
variable theories that are deterministic and 
reproduce the quantum-mechanical prediction. 
For a situation where a pair of (correlated) 
particles in states a and b are created and 
their spins (1 or —1, i.e., up or down spin) 
measured at positions distant from each other, 
the expectation value E(a, b) for the product 
would be given in a hidden variables theory 


by E(a, b) — |n dp(A) for a probability 


measure p and the functions A, and B, of 
hidden variables 2, representing spins and 
hence satisfying | 4,| € 1 and |B,| x 1. Then the 
following Bell's inequality holds: 


| E(a, b) — E(a, b') + E(a’, b) + E(a', b’)| <2. 


This contradicts both quantum-mechanical 
predictions and experimental results, so that 
hidden variable theories of this type have been 
rejected. 
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Quasiconformal Mappings 


A. History 


H. Grótzsch (1928) introduced quasiconformal 
mappings as a generalization of conformal 
mappings. Let f(z) be a continuously different- 
iable homeomorphism with positive Jacobian 
between plane domains. The image of an in- 
finitesimal circle |dz|= constant is an infini- 
tesimal ellipse with major axis of length (| f;| 

+ |f|)|dz| and minor axis of length (| f,]— 

| f:|)|dz|. When the ratio K(z)=(|f,|+ 
IEDAIF1—l&|) is bounded, f is called quasi- 
conformal. If K z 1, then f is conformal. 
Grótzsch noticed that Picard's theorem still 
holds under the weaker condition; he deter- 
mined the quasiconformal mappings between 
two given domains, which are not conformally 
equivalent to each other, providing the smallest 
sup K, that is, those closest to conformality 
[1]. 

We cannot speak of the history of quasi- 
conformal mappings without mentioning the 
discovery of extremal length by A. Beurling 
and L. V. Ahlfors (— 143 Extremal Length), 
which has led to the precise definition for 
` quasiconformality itself. 

Quasiconformal mappings have less rigidity 
than conformal mappings, and for this reason 
they have been utilized for the type problem or 
the classification of open Riemann surfaces 
(Ahlfors, S. Kakutani, O. Teichmiiller, K. I. 
Virtanen, Y. Tóki; — 367 Riemann Surfaces). 
Quasiconformal mappings have important 
applications in other fields of mathematics, 
e.g., in the theory of !partial differential equa- 
tions of elliptic type (M. A Lavrent'ev [2]) 
and especially in the problem of moduli of 
Riemann surfaces, including the theory of 
Teichmüller spaces (— 416 Teichmüller 
Spaces). These applications are explained in 
Sections C and D. 


B. Definitions 


The current definitions of quasiconformality, 
which dispense with continuous differentia- 
bility, are due to Ahlfors [3], A. Mori [4], and 
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L. Bers [5] (^ C. B. Morrey [6]). Consider an 
orientation-preserving topological mapping f 
of a domain D on the z( =x + iy)-plane. The 
quasiconformality of f is defined as follows. (1) 
(the geometric defintion) Let Q be a curvilinear 
quadrilateral, i.e., a closed Jordan domain with 
four specified points on the boundary, and 

let the interior of Q be mapped conformally 
onto a rectangular domain I. The ratio (> 1) 
of the sides of I, called the modulus of Q and 
denoted by mod Q, is uniquely determined. If 
mod f(Q) x K mod Q for any curvilinear 
quadrilateral Q in D, then f is called a K- 
quasiconformal mapping of D. This is equiva- 
lent to: (2) (the analytic definition) f is ab- 
solutely continuous on almost every line seg- 
ment parallel to the coordinate axes contained 
in D (this condition is often referred to as ACL 


K-1 
in D) and satisfies the inequality || Ski f. 


almost everywhere in D with some constant 

K z 1. When the value of K is irrelevant to the 
problem considered, K-quasiconformal map- 
pings are simply said to be quasiconformal. 

The K-quasiconformal mapping f satisfies 
the so-called Beltrami differential equation 
fz- uf. 
almost everywhere in D with the measurable 
coefficient u. The maximal dilatation (1 + 
Iu] Mt — llul!) does not exceed K. Some- 
times f is called, for short a u-conformal map- 
ping. These notions are also defined for map- 
pings between tRiemann surfaces, where the 
(—1, 1)-form udz dz! is independent of the 
choice of the local parameter z. 

If in the above statements f is not neces- 
sarily topological but merely a continuous 
function satisfying the same requirements, 
we call it a -conformal function. (If in addi- 
tion lulle «(K — D/K + 1), we call it a K- 
quasiregular function or K-pseudoanalytic 
function.) A -conformal function is repre- 
sented as the composite g oh of an analytic 
function g with a u-conformal mapping h. 


C. Principal Properties and Results 


The inverse mapping of a K-quasiconformal 
mapping is also K-quasiconformal. The com- 
posite mapping f; o f; of a K,-quasiconformal 
mapping f, with a K,-quasiconformal 
mapping fz, if it can be defined, is K, K,- 
quasiconformal. A 1-quasiconformal mapping 
is conformal. Every quasiconformal mapping 
is ttotally differentiable a.e. (almost every- 
where), its Jacobian is positive a.e., and (|f,| 4- 
LEE 12D SK ae. 

Let f be a K-quasiconformal mapping of |z| 
« 1 onto |w| « 1. Then f extends to a homeo- 
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morphism of |z| € 1 onto |w|< I. If, further- 
more, f(0)=0, then the Hölder condition 





Iz; —z;l V. c UK 
6 | SUteu-fGals lelz, —23] 
holds for |z,| € 1, [z;| € 1, and 16 is the best 
coefficient obtainable independently of K 
(Mori). This shows that any family of K- 
quasiconformal mappings of |z| « 1 onto |w| 
« 1 is *normal. For further properties and 
bibliography — O. Lehto and Virtanen [7] 
and Ahlfors [8]. 


(a) Boundary Correspondences and Extensions. 
Ahlfors and Beurling characterized the corre- 
spondence between |z|=1 and |w| 2 1 induced 
by f [9]. What amounts to the same thing, the 
following theorem holds: Let p(x) be a real- 
valued monotone increasing continuous func- 
tion on R such that Im... “(x)= +00. Then 
there exists a quasiconformal mapping of the 
upper half-plane y>0 onto itself with bound- 
ary correspondence x> p(x) if and only if 

1 metal 


7&7 up 

p u(x)—u(x-t) 

for some constant p 2 1 and for all x, te R. 
Theorem of quasiconformal reflection (Ahl- 

fors [10]). Let L denote a curve which passes 

through oo and divides CU {00} into two 


domains 2, Q* such that QU LUO* — CU {a}. 


Then there exists an orientation-reversing 
quasiconformal mapping of €) onto Q* which 
keeps every point of L fixed if and only if some 
constant C exists satisfying |65 —¢,|/|€, — 
G[sC for any three points ¢,, Go, Ge on L 

such that ed és, 


(b) Mapping Problem. Given a measurable 
function u in a simply connected domain D 
with "ul. <1, there exists a u-conformal 
mapping of D onto a plane domain A which is 
unique up to conformal mappings of A [8]. 
When z is real analytic and the derivatives of 
functions are defined in the usual manner, a 
classical result concerning the tconformal 
mapping of surfaces asserts the existence of a 
solution of Beltrami's differential equation 
fz- uf. 

Concerning the dependence of u-conformal 
mapping on 4, Ahlfors and Bers [11] obtained 
the following important result: Denote by f^ a 
u-conformal mapping of the whole finite plane 
onto itself that preserves 0 and 1. The space of 
functions u has the structure of a Banach 
space with L,,-norm, and the space of map- 
pings f^ also has the structure of a Banach 
space with respect to a suitable norm. If Lou) 
= u(z; t) is a family of u depending on the 
parameter t with "ul. & k « 1 and y(t) is 
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continuous (resp. continuously differentiable, 
real analytic, complex analytic) in t, then f ^? 
is also continuous (continuously differentiable, 
real analytic, complex analytic). For the proofs 
of these important results, which have opened 
up a new way to study *Teichmiiller space, the 
extension and reflection of quasicoaformal 
mappings are made essential use of. 


(c) Extremal Quasiconformal Mappings. Let 
K(f) denote the maximal dilatation of a quasi- 
conformal mapping f. Suppose that a family 
F = { f } of quasiconformal mappings is given. 
If some fọe F exists such that K( f;) attains 
the infimum of K(f) for all fE F, then fo is 
called an extremal quasiconformal mapping in 
F 
Let R={(x, y)|0<x<a,0<y<b}, R= 
L(x, y)|O<x'<a’,0<y' <b’ be a pair of 
rectangular domains. Let ¥ be the family 
of all quasiconformal mappings of R onto 
R' which map each vertex to a vertex with 
(0, 0) — (0, 0). Then the unique extremal quasi- 
conformal mapping for F is the affine map- 
ping x’ =(a'/a)x, y' —(b'/b) y (Grotzsch [1]). 
Next suppose that we are given two homeo- 
morphic closed Riemann surfaces R, S and a 
thomotopy class ¥ of orientation-preserving 
homeomorphisms of R onto S. Then ¥ con- 
tains a unique extremal quasiconfermal map- 
ping. More precisely, either R and S are con- 
formally equivalent to each other or else R 
admits an essentially unique analytic (2, 0)- 
form ® such that the respective local co- 
ordinates z, w of R, S satisfy the differential 
equation 


(0w/0z)0w/0z) -(K — DAK - 0]5/| — (D 


with some constant K » 1 everywhere on R, 
at which ® 40 (Teichmüller [12], Ahlfors 
[3]). This turns out to be a generalization of 
Grótzsch's extremal affine mapping. The ex- 
tremal mapping f satisfying equation (1) is 
sometimes referred to as the Teichmiiller 
mapping. 

Consider again a -conformal mapping g 
of the unit disk D:|z| « 1 onto itself which 
induces a topological automorphism of the 
boundary [z| 2 1. If we define ¥ as the family 
of all quasiconformal automorphisms f of D 
satisfying f(e'?) « g(e'*?), then the extremal 
quasiconformal mapping in ¥ exists but is not 
always determined uniquely (K. Strebel [13 ]). 
As to the Teichmüller mapping, the unique- 
ness theorem is as follows: If the norm ||®|| = 
f {p|®(z)| dx dy of in (1) is finite, the Teich- 
müller mapping is the unique extremal quasi- 
conformal mapping in F. Otherwise, the 
uniqueness does not hold in general (Strebel 
[13]). On the other hand, a necessary and 
sufficient condition ts proved for the Beltrami 
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coefficient o of a quasiconformal mapping of 
F to be extremal (R. S. Hamilton [14], E. 
Reich and Strebel in [15]). Moreover, this 
last result can be extended to the extremal 
quasiconformal mapping between arbitrary 
Riemann surfaces. 


D. Applications 


In the earlier stage of development of this 
theory, quasiconformal mappings were applied 
only to the ttype problem of simply connected 
Riemann surfaces and to the classification of 
Riemann surfaces of infinite genus (— 367 
Riemann Surfaces). This application is based 
on the fact that it is often possible to find a 
quasiconformal mapping with the prescribed 
boundary correspondence even when no 
equivalent conformal mapping exists and 

the fact that the classes Og and Oy» (— 367 
Riemann Surfaces) of Riemann surfaces are 
invariant under quasiconformal mappings, as 
they are under conformal mappings. 

It is worth remarking that the investigation 
of quasiconformal mappings is intimately 
connected with the recent development of the 
theory of 'Kleinian groups via Teichmüller 
spaces. 

The theory of quasiconformal mappings was 
also applied by Lavrent'ev [16] and Bers [2] 
to partial differential equations, particularly 
to those concerning the behavior of fluids. 
They utilized the fact that if the density and its 
reciprocal are bounded in a steady flow of a 
2-dimensional tcompressible fluid, then the 
mapptng of the physical plane to the potential 
plane (the plane on which the values of the 
tvelocity potential and the fstream function 
are taken as coordinates) is quasiconformal, 
and that if in addition the supremum of the 
*Mach number is smaller than 1, then the 
mapping from the physical plane to the thodo- 
graph plane is pseudoanalytic. 


E. Similar Notions 


The term quasiconformal was used differently 
by Lavrent'ev, as follows: A topological map- 
ping f —u- iv is called quasiconformal with 
respect to a certain system of linear partial 
differential equations when u and v satisfy 
the system. This is a generalized definition 
because the system may not be equivalent 

to a Beltrami equation. However, it is reduced 
to a quasiconformal mapping if the system is 
uniformly elliptic. Bers used the term pseudo- 
analytic to describe a certain function related 
to linear partial differential equations of ellip- 
tic type. This function is pseudoanalytic in 
the sense of Section B on every relatively 
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compact subset and has properties similar to 
those of analytic functions. 

Analytic transformations in the theory 
of functions of several variables are called 
pseudoconformal by some mathematicians, 
and there is a similar term quasi-analytic. The 
latter is an entirely different notion from the 
one discussed in this article. 


F. Generalization to Higher Dimensions 


Let f be a continuous ACL-mapping of a 
subdomain G of R" into R" whose Jacobian 
matrix is denoted by f'(x). Furthermore, the 
operator norm and the determinant of f' are 
denoted by || f’|| and det f’, respectively. Then 
f is said to be quasiregular if all the partial 
derivatives of f are locally of class L" on G 
and if there exists a constant K 2 1 such that 
(II f^ lox)" € K - det f'(x) almost everywhere in 
G. The smallest K > 1 for which this inequality 
is true is called the outer dilatation of f and 

is denoted by K,( f). If f is quasiregular, 

then the smallest K > 1 for which the inequal- 
ity det /'(x) « K : [miny,-., | f'(x + y)|"] holds 
almost everywhere in G is called the inner 
dilatation of f and is denoted by K (f). If 
max(K ,( f), Ko( f )) « K', then f is said to be 
K'-quasiregular. An orientation-preserving 
mapping is called K-quasiconformal (J. Vài- 
sala [17]) if it is a K-quasiregular homeomor- 
phism. When n —2, these definitions agree 
with those given in Section B. 

For nz3 the following properties also still 
hold: A quasiregular mapping is discrete, 
open, totally differentiable a.e. and is abso- 
lutely continuous (O. Martio, S. Rickman, 
and Vaisala [18]). Quasiconformal extension 
of higher-dimensional half-spaces have been 
studied by Ahlfors and L. Carleson [15]. 
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A. General Remarks 


Racah algebra is a systematic method of cal- 
culating the tmatrix element (V, Ay’) in fquan- 
tum mechanics, where A is a dynamical quan- 
tity and y and w’ are irreducible components 
of the state obtained by combining n tangular 
momenta. The angular momentum j has x-, 
y-, z-components jy, jy, Jz, respectively. Each 
component is i times the infinitesimal rotation 
around the respective axis and is the generator 
of the infinitesimal rotation for every irreduc- 
ible component y. The addition of two angu- 
lar momenta leads to a *tensor representation 
D(j,) & D(j;) of two tirreducible representa- 
tions of the 3-dimensional rotation group. 
The problem is to decompose this tensor rep- 
resentation into the direct sum of irreducible 
representations. 


B. Irreducible Representations of the Three- 
Dimensional Rotation Group 


Irreducible representations of the group SO(3) 
of 3-dimensional rotations can be obtained 
from irreducible representations D(j) (j= 
0, 1,2, ...) of its *universal covering group 
SU(2) of 2 x 2 matrices with determinant 1, 
through the 2-fold covering isomorphism 
SO(3)& SU (2)/1 +I} (— 60 Classical Groups I). 
The representation D(j) (j=0, 1/2, 1, 3/2, ...) of 
SU (2) is the 2j-fold tensor product A... Gi A 
of AeSU(2) restricted to the totally symmetric 
part of the 2j-fold tensor product space. Let 
u — (1) and v — (9) be a basis for the complex 2- 
dimensional space on which SU (2) operates. 
The symmetrized tensor product of (j + m)-fold 
u and (j —m)-fold v multiplied by a positive 
normalization constant (m —j,j — l, ..., —j) 
defines an orthonormal basis of the represen- 
tation space of D(j), which we shall denote by 
v Gm). 

Decomposition of the tensor product of two 
irreducible representations D(j,) and D( j;) 
into irreducible components leads to 


D(jj)  D(j;)7 -YD(j) 
Jh tjs 
For the basis we can write 
y Gm) 
= 2. WF, mw Gzm;)G,m; jamal Jj; jm), 


and the coefficients are called the Clebsch- 
Gordan coefficients or Wigner coefficients. The 
vectors Ņ (jm) in each irreducible represen- 


Ji tji-1l...lA —j2l- 
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tation space are determined only up to an 
overall phase factor (a complex number of 
modulus 1). By a suitable choice of the result- 
ing arbitrary phase (which may depend on 
ji» ja, TL the coefficients are given by 

Ji t2jzl tz 
(j m, jam; ji jo jm) 2 (m, - m5, m) 


(9+ DU To DIG =j)! t — Ji)! 


(jı +j2+j+1)! 
GIE 
(jo—mjt(--m)!(j—m)! 
m;- v)! 








EE eg Cl Deh H TE Fee AU 








VU 0m DIG, 7 m2 +m)! 
(5t m, —v)G—ji*m;rv)!G—ji— 


They satisfy torthogonality relations. Another 
concrete expression for the same coefficients, 
but of a different appearance, was obtained 
earlier by Wigner. Wigner introduced the 3j- 
symbol, given by 


( JiJ2J3 )- crai +1)7? 


m,m»;m; 
x(jum,jomiljijaja —ms) 


for m, +m,+m,=0 and zero otherwise. This 
is invariant under cyclic permutations of 1, 2, 3 
and is multiplied by ( —1)': 7:*/» under trans- 
positions of indices as well as under the simul- 
taneous sign change of all the m's. The 3j- 
symbol multiplied by ( —1)2*^:77: is the V- 
coefficient of Racah. 

There are two ways, (D( jj) & D(j;)) ® D(j4) 
and D(j,) & (D(j;) € D(j4)), to reduce the 
tensor product of three irreducible representa- 
tions, and two corresponding sets of basis 
vectors. The transformation coefficient for the 
two ways of reduction is written in the form 


CT, full lf, JaJaGas PD 
SS TEE DQjas +1) Wir od JaiJi2J23)- 


Here W(abcd; ef), called the Racah coefficient, 
can be written as the sum of products of four 
Wigner coefficients. W has the following sym- 
metry properties: 


W(abcd; ef ) - W(badc; ef) 
= W(cdab; ef ) 
= W(acbd; fe) 
ct ITT 77 * W(ebcf; ad) 
=( —1) +- W (aefd; bc) 


and satisfies an orthogonality relation. The 6j- 


abe 


symbol (255; is related to the Racah coefficient 
by 


. —( o 1yatbtctd abe 
W(abdc; ef ) 2 ( —1) ES 
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C. Irreducible Tensors 


A dynamical quantity T} (q=k,k—1,..., —k) 
that transforms in the same way as the basis of 
D(k) under rotation of coordinates is called an 
irreducible tensor of rank k. That is, it satisfies 


Citi, TF]  J/ (kx q (kx: 4-1) Thi, 

Lies Dia 

Here [a,b] — ab — ba. The matrix element of 
this quantity between two irreducible compo- 
nents can be written in the form 

il lef 

—(1//2j - 1 Mail T? |o Ti m'kq | j'kjm), 
where « is a parameter to distinguish multiple 
components with the same j, and components 
of different « are assumed to be orthogonal. 
In this formula the Clebsch-Gordan coeffi- 
cients are determined from group theory, while 
(ajll T? ||a/j) depends on the dynamics of the 
system. 

When T® and U“ operate only on the state 
vectors in the subspaces H, and H,, respec- 
tively, of the total space (tHilbert space) H = 
H, x H,, their scalar product (T9, U)= 
&,(—1) T} U% has the matrix element 
(01% jy j; jm|(T 9, Olai, oj £ jm) 
=(—1) IWO jajaja jk) 

x (o jill T 9o j1) (o j| Uoc, J). 

For an irreducible component of the tensor 
product of two irreducible tensors, 


TT Q pes 
ET 2 T2 Uz2(kq1k5q5 | ky k5kq), 


qı t4254 
the matrix can be written as 
(oj, jail ETE? & U'rslllen, 727) 


= Okt DOE +1 +1) 


SÄI (ai, | T? |a 71) (321 US leja) 


itu 


The last factor, the 9j-symbol, is defined as 
the matrix element between basis vectors of 


[DG) x DG3]xEDG3 x D(j4)] and [D(j,) x 
D(j3)] x [DG2) x DG: 
Chid2Viadisiasa)iml bh J5(U13)72J4 24) jm) 


=V (jia + DQja4 D) O15 + 1)(2j24+ 1) 


h jh jv 
X4 j3 Ja jaa 
jia Joa J 
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The 9j-symbol can be written as a weighted 
sum of the products of the three W's. 

See [6] and [7] for explicit formulas of 
Clebsch-Gordan coefficients and [8] for Racah 
coefficients. 


References 


[1] E. P. Wigner, Group theory and its appli- 
cation to the quantum mechanics of atomic 
spectra, Academic Press, 1959. 

[2] M. E. Rose, Elementary theory of angular 
momentum, Wiley, 1957. 

[3] U. Fano and G. Racah, Irreducible ten- 
sorial sets, Academic Press, 1959. 

[4] A. R. Edmonds, Angular momentum in 
quantum mechanics, Princeton Univ. Press, 
1959. 

[5] F. Bloch, S. G. Cohen, A. De-Shalit, S. 
Sambutsky, and I. Talmi, Spectroscopic and 
group-theoretical methods in physics (Racah 
memorial volume), North-Holland, 1968. 

[6] E. U. Condon and G. H. Shortley, Theory 
of atomic spectra, Cambridge Univ. Press, 
1935 (reprinted with corrections, 1951). 

[7] M. Morita, R. Morita, T. Tsukada, and M. 
Yamada, Clebsch-Gordan coefficients for j, = 
5/2, 3, and 7/2, Prog. Theoret. Phys., Suppl., 26 
(1963), 64—74. 

[8] L. C. Biedenharn, J. M. Blatt, and M. E. 
Rose, Some properties of the Racah and as- 
sociated coefficients, Rev. Mod. Phys., 24 
(1952), 249—257. 


354 (XVI.5) 
Random Numbers 


A. General Remarks 


A sequence of numbers that can be regarded 
as realizations of independent and identi- 
cally distributed trandom variables is called 

a sequence or table of random numbers. It is 

a basic tool for the "Monte Carlo method, 
tsimulation of stochastic phenomena in nature 
or in society, and tsampling or trandomiza- 
tion techniques in statistics: Random numbers 
used in practice are pseudorandom numbers 
(— Section B); theoretically, the definition 

of random numbers leads to an algorithmic 
approach to the foundations of probability 
[1,2]. 


B. Pseudorandom Numbers 


Tables of numbers generated by random 
mechanisms have been statistically tested and 
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published. To generate random numbers on a 
large scale, electronic devices based on sto- 
chastic physical phenomena, such as thermo- 
electron noise or radioactivity, can be used. 
For digital computers, however, numbers 
generated by certain simple algorithms can be 
viewed practically as a sequence of random 
numbers; this is called a sequence of pseudo- 
random numbers. 

Distribution of random numbers that are 
easily generated and suitable for general use is 
the continuous uniform distribution on the 
interval (0, 1), which is approximated by the 
discrete distribution on (0, 1, ... 
Random numbers with distribution function 
F(-) are obtained by transforming uniform 
distributions by F !(-). For typical distribu- 
tions, computation tricks avoiding the direct 
computation of E !(*) have been devised. 
Among them the use of *order statistics and 
acceptance-rejection techniques have wide 
applicability. 

For the generation of uniform pseudoran- 
dom numbers on (0,1,..., N— 1}, N=n*(n=a 
computer word length), the following algo- 
rithms are used. Each of them is written in 
terms of simple computer instructions. (1) The 
middle-square method was proposed by von 
Neumann. We square an integer of s digits of 
radix (or base) n and take out the middle s 
digits as the next term. We repeat this process 
and obtain a sequence of pseudorandom num- 
bers. The sequence thus generated might be 
cyclic with a short period, possibly after many 
repetitions. The lengths from initial values to 
the terminal cycles are empirically checked. 
(2) The tFibonacci sequence {u,} defined by 
Uy 41 =U,-, +u, (mod m) is apparently regular, 
but it is uniformly distributed. (3) The con- 
gruence method [3]: Define a sequence by 
Uy 44 & au, T c (mod m) or (mod r^ + 1). If c=0, 
the procedure is called the multiplicative con- 
gruence method, otherwise the mixed con- 
gruence method. The cycle, that is, the mini- 
mum k such that u, — He, and the constants 
a, c, and uo that make the cycle maximum 
for given n and s are determined by number 
theory. The points (uj, uj. ,, ... Hun K= 
0, 1, 2, ..., lie on a small number of parallel 
hyperplanes in the /-dimensional cube. Good 
choice of the constant a makes the sequence 
quite satisfactory. (4) H. Weyl considered 
sequences f(k) 2 ka (mod 1), where « is an 
irrational number and Kk — 1, 2,..., whose 
values are uniformly distributed on the inter- 
val (0, 1). They are not independent, though 
they can be used for some special purposes. A 
modified sequence x, = k?^« (mod 1) is known 
to be random for any irrational a in the sense 
that the tserial correlation N ! X , x,x,,,— 
1/4 converges to 0 uniformly in l as Noo. 


,N—1} (N>1). 
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C. Statistical Tests 


To check uniform random numbers on (0, 1) 
the following tests are used: (1) Divide (0, 1) 
into subintervals; then the frequency of ran- 
dom numbers falling into these is a multi- 
nomial sample. tGoodness of fit can be tested 
by the ‘chi-square test; independence can be 
tested by observing the frequency of tran- 
sitions of subintervals in which a pair of con- 
secutive numbers falls, as well as by observing 
the overall properties, such as uniformity of 
the frequency of patterns of subintervals in 
which a set of random numbers falls. (2) For a 
set of random numbers, the distance of the 
empirical distribution function from that of the 
theoretical one is tested by the 'Kolmogorov- 
Smirnov test. (3) Observe the rank orders of a 
set of random numbers, and test the random- 
ness of their permutations (test the number of 
runs up and down). 


D. Kolmogorov-Chaitin Complexity and Finite 
Random Sequences 


As Shannon’s entropy is a quantity for mea- 
suring the randomness of random variables, 
the Kolmogorov-Chaitin complexity [4, 5] 
is that of individual objects based on logic 
instead of probability. For constructive ob- 
jects x e X, ye Y and a partial recursive func- 
tion A: Y x {1,2,...} +X, define 


ee n)- x), 


oo (if A(y, n) 2 x for no n). 
The function A is said to be asymptotically 
optimal if for any B there exists a constant 
C such that K ,(x| y) € Kg(x| y) - C for any 
xéX and ye Y. For an asymptotically op- 
timal A, which is known to exist, K ,(x| y) is 
simply denoted by K (x | y) and is called the 
Kolmogorov-Chaitin complexity of x given y. 
P. Martin-Lóf [6] discussed a relation be- 
tween complexity and randomness. Consider 
any statistical test for the randomness on the 
set of (say) finite decimal sequences which is 
effective in the sense that it has a finite algo- 
rithm. Then there exists a constant C indepen- 
dent of L and M such that 


K(£&,,..., £j |L) 2 Llog; 10— M 


implies the acceptance of the decimal sequence 
Zen, by the test at the level 1 —2^7^^€. 
This condition on the complexity is satisfied 
by at least (1 —27*)10* sequences among the 
decimal sequences of length L. 


E. Collective and Infinite Random Sequences 


For finite sequences, the notion of randomness 
is obscure by nature. For infinite sequences, 
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however, clearer definition is possible. Based 
on the notion of collectives by R. von Mises, a 
definition of infinite random sequences has 
been given by A. Church [7]. A selection func- 
tion is a (0, 1}-valued function on the set of 
(say) finite decimal sequences such that ín, 
9(51, 65, ..., 6, ,) = 1} is an infinite set for any 
infinite sequence £,, £,, .... For a selection 
function o and an infinite sequence £,, £,, ..., 
the o-subsequence is defined as £, , Čr,» ..., 
where (n, «n; €...) (n, Plis ..., 6, ,) - 1). 
For a class y of selection functions, an infinite 
decimal sequence is called a -collective if each 
of the numbers 0, 1,...,9 appears in it with a 
limiting relative frequency of 1/10, and the 
same thing holds for any o-subsequence with 
oeh, By definition, a random sequence is a y- 
collective for the class y of recursive selection 
functions. Almost all real numbers are random 
in their decimal expansions. 


F. Normal Numbers 


Let x — [x] 2» È x,r " be the r-adic expansion 
of the fractional part of a real number x. For 
any ordered set B, —(b,, ..., b,) of numbers 

0, 1, ...,r— 1, let N,(x, B,) be the number of 
occurrences of the block B, in the sequence 
Xis Eu If N (x, B)/n9r * as noo for every 
k and every B,, then x is said to be normal to 
base r. Almost all real numbers are normal to 
any r. D. G. Champernowne [8] constructed a 
normal number given by the decimal expan- 
sion 0.1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, .... 
No one has so far been able to prove or dis- 
prove the normality of such irrational num- 
bers as 7, e, ,/2, ,/3,.... W. Schmidt [9] 
proved that the normality to base r implies the 
normality to base p if and only if logr/log p is 
rational. A real number whose decimal expan- 
sion is random in the above sense is normal to 
base 10. For the converse, a necessary and 
sufficient condition for a selection function o 
(for which 9(£,, ..., £j) depends only on L) to 
have the property that the normality implies 
the (9]-collectiveness has been obtained in 


[10]. 
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A. Axioms for the Real Numbers 


The set R of all real numbers has the following 
properties: 

(1) Arithmetical properties: (1) For each pair 
of numbers x, yeR, there exists one and only 
one number weR, called their sum and de- 
noted by x+y, for which x+ y= y+ x (com- 
mutative law) and (x+ y) -z2 x - (y 4-z)' 
(associative law) hold. Furthermore, there 
exists a unique number 0 (zero) such that 
x+0=x for every xeR (existence of tzero 
element). Also, for each x, there exists one and 
only one number —xeR for which x +(—x) 

— 0. DU For each pair of numbers x, yc R, 
there exists one and only one number weR, 
called their product and denoted by xy, for 
which xy = yx (commutative law), (xy)z = 
x(yz) (associative law), and (x + y)z = xz + 

yz (distributive law) hold. Furthermore, 

there exists a unique number 1 (unity) eR 
such that 1x 2 x for every xeR (existence of 
tunity element). Also, for each x #0 (xeR) 
there exists one and only one number x ! eR 
for which xx! — 1. Owing to properties (i) 
and (ii), all four arithmetic operations obey 
the usual laws (with the single exception of 
division by zero); in other words, R is a field. 

(2) Order properties: (i) For each x, y c R, 
one and only one of the following three rela- 
tions holds: x < y, x 2 y, or x> y. With x «y 
meaning x « y or x — y, the relation « obeys 
the transitive law: x x y and y xz imply x <z, 
which makes R ttotally ordered. (ii) Order and 
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arithmetical properties are related by: x € y 
implies x +z<y+z for any zeR, and x y 
and 0x z imply xz < yz; in other words, R is 
an *ordered field. l 

In particular, x eR with x>0 is called a 
positive number, and x eR with x « 0 a negative 
number. We write |x| 2 x if x >0 and |x|= 
— x If x < 0, and call |x| the absolute value 
of x. 

(3) Continuity property: If nonempty subsets 
A and B of R, with a <b for each pair ae A 
and be B, satisfy R= AU B and AC B- Ø 
(empty set), then the pair (A, B) of sets is called 
a cut of R. For each cut (A, B) of R, there exists 
a number xeR (necessarily unique) such that 
for every ae A, a < x, and for every be B, 

b zx (Le, x=sup A =inf B). This property of 
R is called Dedekind's axiom of continuity (— 
294 Numbers). 

The set R of all real numbers is determined 
uniquely up to an isomorphism, with respect 
to arithmetic operations and ordering, by 
properties (1)-(3). R forms an additive Abelian 
group; its subgroup (0, +1, +2,..., +n,...} 
generated by 1 can be identified with the 
group Z of integers. The subset of all positive 
integers (1,2, ..., n, ...) may be identified with 
the set N of all natural numbers. The subset 
im/n|m,ne Z, n0; of R forms the subfield of 
R generated by 1. It can be identified with the 
field Q of all rational numbers. À real number 
that is not rational is called an irrational 
number. 


B. Properties of Real Numbers 


(1) For each pair of positive numbers a and 
b» a, there exists a natural number n with 
a « nb (Archimedes' axiom). 

(2) For each pair of positive real numbers a 
and b with a « b, there exists a rational num- 
ber x such that a « x « b (denseness of rational 
numbers). 

(3) For any subset A in R *bounded from 
above (below), the *least upper bound of A: a 
—sup A ('greatest lower bound of A:b — inf A) 
exists. 

Let {a,} be a tsequence of real numbers. 
Assume that for each arbitrary positive num- 
ber g there exists a number n, such that |a, — b| 
<e for all n» ng. Then we write lim, ,, à, = 
b (or a,b) and call b the limit of {a„}. We 
also say that {a,} is a convergent sequence 
or that a, converges to b. 

(4) If for two sequences {a,}, {b,}, we have 
aj Sasi Ka, S... <b, <S... <b, <b, and 
lim(b, — a,) — 0, then there exists one and 
only one number ceR with lima, =limb, =c 
(principle of nested intervals). 
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(5) Let {a,} be a sequence of real numbers. 
If for any arbitrary positive ¢ there exists a 
natural number n, satisfying |a, — a,| « € for 
all m, n7 ng, then {a,} is called a fundamental 
sequence or Cauchy sequence. Any funda- 
mental sequence of real numbers is convergent 
(completeness of real numbers). 

For a set with properties 1 and 2 of Section 
A, it can be proved that property 3 of Section 
A is equivalent to property 3, or properties 1 
and 4, or properties 1 and 5 of this section. 


C. Intervals 


For two numbers a, beR with a « b, we write 
(a,b) 2 ix|a« x « bj, 
(a,b]={x|a<x<b}, 
[a,b)={x|a<x<b}, 
[a, bJ={xla<x<b}, 


and call them (finite) intervals, of which a and 
b are their left and right endpoints, respectively. 
Specifically, (a, b) is called an open interval and 
[a,b] a closed interval. The symbols oc and 
—oo are introduced as satisfying oc >x, x> 
—00, 0 > — c0 for all xeR. Writing +00 for 
oc, we call +00 and —oo positive infinity and 
negative infinity, respectively. To extend the 
concept of intervals, we define (—oo, b)= 
{x|x<b,xeER}, (—00,b] ={x|x <b, xeR}, 

(a, o)={x]la<x,xeER}, [a, o)={xla<x, 

xe Rj, and (—oo, oo) - R, and call them infinite 
intervals. 

Let {a,} be a sequence of real numbers. If 
for each infinite interval (a, oo) (( —o0,)) there 
exists a number n, such that a, e(a. oo) (a,€ 
(—oo, a)) for all n n, then we write a, -4-oo 
(a, — —0oo) and call oo ( —oo) the limit of a,, 
denoted as before by lim a,. 


D. Topology of R 


With the collection of all its open intervals 

(a, b) as an topen base, R is a ttopological 
space (order topology) that satisfies the tsep- 
aration axioms T,, T,, T4. In R every (finite or 
infinite) interval (including R itself) 1s tconnec- 
ted, and the set Q of rational numbers is dense. 
A necessary and sufficient condition for a 
subset F of R to be tcompact is that F be 
bounded and closed (Weierstrass’s theorem). 
In particular, any finite closed interval is com- 
pact. R is a locally compact space satisfying 
the second fcountability axiom. Further, any 
(finite or infinite) open interval is homeomor- 
phic to R. The topology of R may also be 
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defined by the notion of convergence (— 87 
Convergence). 

Arithmetic operations in R are all con- 
tinuous: If a,—a and b,—b, then a, - b, 5. a +b, 
a, — b,—a-— b, a,b, ab, and a,/b,—ajb (where 
b z:0, b, 40). Hence R is a ttopological field 
(regarding the characterization of R as a topo- 
logical group or a topological field — 422 
Topological Abelian Groups). 

R as a topological Abelian group (with 
respect to addition) is isomorphic to the top- 
ological Abelian group R * of all positive real 
numbers with respect to multiplication. To be 
precise, there exist topological mappings f: R 
>R* with f(x+ y) » f(x)f(y) and g:R* >R 
with g(xy)=g(x)+ g(y). If f(1)—a, g(b) - 1, 
then f, g are uniquely determined and are 
written f(x) —- a*, g(x) 2 log, x. 

Regarding R as a topological Abelian group 
(with respect to addition), any proper closed 
subgroup [ of R is discrete and isomorphic to 
the additive group Z of integers. That is, for 
some £70 we have F = {ne|ne Z}. In parti- 
cular, the quotient group R/Z as a topological 
group is isomorphic to the rotation group of a 
circle (1-dimensional ftorus group). Elements 
of R/Z are called real numbers mod 1. 


E. The Real Line 


Let | be a Euclidean straight line considered to 
lie horizontally, say from left to right. Let po, 
p, be two distinct points on L with p, situated 
to the left of p,. Then there exists one and only 
one bijection q from the set L of all points of I 
to R satisfying (i) o(po) 20, o(pi) — 1; (ii) if p 
lies to the left of q, then o(p) « @(q); and (itt) 
for two line segments pq and p'4' (where p and 
p' are to the left of q and q', respectively), 

pq 2 p'd' (pq and pa are tcongruent) <> o(q) — 
9(p)- o(q') — e(p'). Then (p) is called the 
coordinate of the point p, and (po, pı) the frame 
of the line /. A Euclidean straight line with a 
fixed frame is called a real line (identified with 
R by the mapping ¢) and is usually denoted by 
the same notation R or R!. 
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A. General Remarks 


A function whose tdomain and trange are both 
the set of natural numbers (0, 1,2, ... } is called 
a number-theoretic function. In this article, the 
term natural number is used to mean a non- 
negative integer. Hilbert (1926) and K. Gódel 
[1] considered certain number-theoretic func- 
tions, called recursive functions by them and 
now called primitive recursive functions after S. 
C. Kleene [2] (the definition is given in Section 
B). Gódel introduced an efficient method of 
arithmetizing metamathematics based on 
representing certain finitary procedures in 
metamathematics by primitive recursive func- 
tions. Then the following problem naturally 
arises: How shall we define a finitary method? 
In other words, how shall we characterize a 
number-theoretic function that is effectively 
computable, or provided with an algorithm of 
computation? Gódel defined the notion of 
general recursive function by introducing a 
formal system for the elementary calculation of 
functions, following the suggestion given by J. 
Herbrand. Kleene later improved Gódel's 
definition and developed the theory of gen- 
eral recursive functions [2]. Furthermore, A. 
Church and Kleene defined 4-calculable func- 
tions using the A-notation (Church [4]), and E. 
L. Post and A. M. Turing defined the notion of 
computable functions by introducing the 
concept of Turing machines. These notions, 
introduced independently and almost simulta- 
neously, were found to be equivalent. Hence 
such functions are now simply called recursive 
functions. Here, instead of giving the definition 
of recursive functions in the original style (the 
Herbrand-Gódel-Kleene definition), we give it 
by utilizing the idea of introducing schemata, a 
natural extension of the notion of primitive 
recursive functions. We employ the letters x, y, 
Z, X4, X5, ... for variables ranging over the 
natural numbers. 
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B. Primitive Recursive Functions 


Consider the following five definition 
schemata: 


D e(x)—x (=x+I), 


(ID) ` q(x,,....X,)—4 
(q a given natural number), 
(HD @(x,,..-,x,)=x; (eisen 
(IV). (xi, sus XÐ =Y Deere Xa 
Xm X1. e Sall 
(V) ` eil Xa, x) Y(X- XS) 
PUY Xas Eeler XCV PLY; X2; Sal 
E heh 


where dai )isa constant natural number if 
n — l. A function is called primitive recursive if 
it is definable by a finite series of applications 
of the operations (IV) and (V) (V, y. yi. .... Xm 
are already-introduced functions) starting from 
functions each of which is given by (I), (II), or 
(IIT). Given the functions y, ..., Y we define 
the relativization (with respect to y, ..., V) of 
the definition of primitive recursive functions 
as follows: A function is called primitive recur- 
sive in V, ..., if it is definable by a finite 
series of applications of (IV) and (V) starting 
from V4, ..., V, and from functions each of 
which is given by (I), (ID), or (IIT). 

We say that a function o(x,, ..., x,) is 
the representing function of a tpredicate 
P(x,,...,X,) if o takes only 0 and 1! as values 
and satisfies 


PCs 2X4) 50: 


Xp) > 9(x,, wale 


Then we call P a primitive recursive predicate if 
its representing function @ is primitive recur- 
sive. The following functions and predicates 
are examples of primitive recursive ones: a+ b, 
a:b, a’, a!, min(a, b), max(a, b), |a — b|, a=b, 

a « b, a|b (a divides b), Pr(a) (a is a prime 
number), p, (the (i+ 1)st prime number, p, = 2, 
Pp, =3,...), (a); (the exponent of p; of the unique 
factorization of a into prime numbers if a #0; 
otherwise, 0). 

Whenever we are given a concept or a 
theorem, we always transform it by replacing 
the predicates contained in it (if any) by corre- 
sponding representing functions. Then an 
operation Q is called primitive recursive if 
the function or the predicate Q(vi,,.... V, 
Q,,...,Q,,) that results from the application 
of 2 to functions y, ..., V; and predicates 
Qi,...,Qu (L mz0,I-- m» 0) is primitive 
recursive In V4, ..., Qm. Put q(x,,..., Xp Z) 
Än UI, x, y). Then o is primitive 
recursive in y, and the finite sum È, <, is a 
primitive recursive operation in this sense. 


1322 


Similarly, the following operations are primi- 
tive recursive: the finite product IT, <+, the 
logical connectives 1, v, A,  (— 411 
Symbolic Logic), definitions by cases, the 
bounded quantifiers Jy, .,, YY, <z, and the 
bounded u-operator py, ., defined as follows: 
Hy, « R(x, y) is the least y such that y <z and 
R(x, y) holds, if there exists such a number y; 
otherwise, it equals z. The following operation 
is also primitive recursive: 


Q(y, X5, E n] 

=7(Y, Cy: X3, HE Xn) X3, dives); 
where 

ÖY; Xz x) [] p? 9799. 


i<y 


A function q is said to be primitive recursive 
uniformly in y, ..., V; when q is definable by 
applying a primitive recursive operation to 
Vio e V 

Almost all results mentioned in this section 
were given by Gödel [1]. There are further 
investigations on primitive recursive functions 
by R. Péter (1934), R. Robinson (1947), and 
others. Note that a function definable by a 
*double recursion is not necessarily primitive 
recursive. Péter (1935, 1936) investigated in 
detail functions that are definable, in general, 
by k-fold recursions for every positive interger 
k [8]. 


C. General Recursive Functions 


The following u-operator is used to define 
general recursive functions by extending primi- 
tive recursive functions. For a predicate R( y) 
on the natural numbers, uyR( y) is the least y 
such that R( y), if 3yR( y); otherwise, uyR( y) is 
undefined. Generally, uy(v(x,, ..., x,, y) - 0) 

is not necessarily defined for each n-tuple 

(x4, ..., x,) of natural numbers. Now, a func- 
tion is called a general recursive function (or 
simply recursive function) if it is definable by a 
series of applications of schemata including a 
new schema 


(VI) qo(x,,... 


for the definition of o from any function V 
that satisfies 


Xy) 7 HUE, -es XQ», y)20) 


Vx, s Wx (Eas -s Xn Y)=0), 


in addition to those used to define the primi- 
tive recursive functions. Thus, by definition, 

a primitive recursive function is general recur- 
sive. A general recursive predicate is a predicate 
such that its representing function. is general 
recursive. The facts, including the ones con- 
cerning relativization, that are valid for pri- 


* 
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mitive recursive functions are also valid for 
general recursive functions. 


Kleene's Normal Form Theorem. For each n, 
we can construct a primitive recursive predi- 
cate T,(z, x,, ..., Xn y) and a primitive recursive 
function U(y) such that given any general 
recursive function q(x,, ..., Xa), a natural 
number e can be found such that 


Yxi ven VX ay Tale, X45 Xp b (1) 
ten, Xn) = U(uy T, (2, xi, ..., Xn y). (2) 


Any natural number e for which (1) and (2) 
hold is said to define ¢ recursively or to be a 
Gödel number of a recursive function o. Let 
V, --., V; (abbreviated V) be any given func- 
tions. We can relativize Kleene's normal form 
theorem with respect to them as follows: 

For each n, we can construct a predicate 

T, (z, X1, ..., X,, y) that is primitive recursive in 
Y such that given any function o that is gen- 
eral recursive in V, a natural number e can be 
found such that 


Voss Vx sy Ts (e Xs e Xn Y) (3) 
voten, Sales U luy Tr (e, x, ..., Xn HI (4) 


where U( y) is the primitive recursive function 
mentioned in Kleene's normal form theorem. . 
A natural number e for which (3) and (4) hold 
is said to define o recursively in ¥ or to be a 
Gödel number of o from Y. In particular, a 
Gödel number e of o from Y can be found 
independently of Y (except for | and the re- 
spective numbers of variables of V, ..., Vj) 
when o is general recursive uniformly in Y. 

Now let S be a fformal system containing 
ordinary number theory. À number-theoretic 
predicate P(x,, ..., x,) is said to be decidable 
within S if there is a formula P(a,,...,a,) (with 
no free variables other than the distinct vari- 
ables a, ...,a,) of S such that for each n-tuple 
(xi, ..., Xn) of natural numbers (the symbol |- 
means provable in S), 


() FPX., 
and 
(i) P(x,,...,x,) &l- P(x,, .... X), 


where x,, ..., X, designate the numerals corre- 
sponding to x4, ..., x, in S. If S is a consistent 
system such that primitive recursive predicates 
are decidable within S and the predicates Pf, 
(for any formula A, Pf,(x,, ..., Xn, y) means 
that y is the Gódel number of a proof of 
A(x,, ..., X,)) are primitive recursive, then a 
necessary and sufficient condition for P to be 
decidable within S is that P is a general recur- 
sive predicate (A. Mostowski, 1947). 

Church (1936) proposed the following state- 


x) or | “1P(x,,...,X,) 
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ment: Every effectively calculable function is a 
general recursive function. The converse of this 
is evidently true by the definition of recursive- 
ness. So Church’s thesis and its converse pro- 
vide the exact definition of the notion of effec- 
tively computable functions. Though this 
notion is somewhat vague and intuitive, the 
definition seems to be satisfactory, as men- 
tioned at the beginning of this article. There- 
fore, any function with a computation proce- 
dure or algorithm can be assumed to be gen- 
eral recursive. Utilizing this, various decision 
problems have been negatively solved (— 97 
Decision Problem). Furthermore, traditional 
descriptive set theory can be reinvestigated 
from this point of view, and the concept of 
effectiveness used in tsemi-intuitionism is 
clarified using general recursive functions 

(^ 22 Analytic Sets). 


D. Recursive Enumerability 


A set {¢(0), o(1), p(2), ... } enumerated by a 
general recursive function o (allowing repe- 
titions) is called a recursively enumerable set. 
The empty set is also considered recursively 
enumerable. It is known that in this definition 
“general recursive” can be replaced by “primi- 
tive recursive” (J. B. Rosser, 1936). A set E of 
natural numbers is recursively enumerable if 
and only if there is a primitive recursive pre- 
dicate R(x, y) such that xe E<>4dyR(x, y) 
(Kleene [2]). 

Generally, a predicate E(x,,...,X,,) is called 
a recursively enumerable predicate if there is a 
general recursive predicate R(x,, ..., Xm 
yp +-+, Yn) Such that E(x,,...,x,,)<> dy, ... 

Jy, R(x,, .... Xm Mise Yn). (Here “general 
recursive" can be replaced by "primitive 
recursive.") 

We call a set E a recursive set if the predi- 
cate xe E is general recursive. The set C= 
{x|dyT, (x, x, y)} is an example of a set that 
is recursively enumerable but not recursive, 
and it has the following remarkable property: 
For every recursively enumerable set E, there 
is a primitive recursive function such that 
xe E < q(x)e C. In this sense, the set C is said 
to be complete for the class of recursively 
enumerable sets. Post's problem, which asked 
whether the sets that are recursively enumer- 
able but not recursive have the same tdegree of 
(recursive) unsolvability as that of C, was 
negatively solved simultaneously by R. M. 
Friedberg (1957) and A. A. Muchnik (1956- 
1958). A recursively enumerable set E is gen- 
eral recursive if and only if there ts a gen- 
eral recursive predicate R(x, y) such that 
x£E« 3yR(x, y) (Kleene [5], Post [6]). 
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E. Partial Recursive Functions 


A function ~(x,,...,X,) is called a partial func- 
tion if it is not necessarily defined for all n- 
tuples (x,, ..., x,) of natural numbers. For two 
partial functions w(x,,...,x,) and 7(x,,...,X,), 
V(x,. ... X m ln, Xp) means that if either 
V (x, ...,x,) OF z(X,, ..., Xp) 1s defined for 

X4, ..., X4, SO ts the other, and the values are 
the same. For any given natural number e, 
P(X 15026 x.) m U(uyT (e, xi, ... x4, y) (or 

~ U(uyT, (e, X15 ..., Xp, y) is a partial func- 
tion, in general. We say that such a function is 
partial recursive (partial recursive (uniformly) in 
Y) and that a natural number e defines o 
recursively ((uniformly) in V) or is a Gódel 
number of a partial recursive function q (from 
Y). When a natural number e is a Gödel num- 
ber of o(x,,..., x,) (a Gödel number of o 

trom V), o(x,,..., X„) is sometimes written as 
lel Gau x)(el Di e x,)). Tf a predi- 

cate R(x,,...,X,, y) is general recursive, then 
uy R(x,, ..., Xp, y) is partial recursive. There- 
fore, {z} (x4, ..., x,) is a partial recursive func- 
tion of the variables of z and of x,,..., Xp- 

On the partial recursive functions, the fol- 
lowing two theorems, given by Kleene [3], are 
most useful. (1) For natural numbers m, n, a 
primitive recursive function ATI, Vis .... Ym) 
can be found such that, for any natural num- 
ber e, de y nose E oe (e, Vee 
Vm) } (Xi... X4). (2) For any partial recursive 
function y(z, x;, ..., x, , a natural number e 
can be found such that (ej (x4. ..., x,)— 

WE, x4... X4). 

The notion of partial recursive functions 
appeared first in the theory of ‘constructive 
ordinal numbers of Church and Kleene (1963). 
Partial recursive functions can be defined in 
the Herbrand-Gódel-Kleene style as a natural 
extension of general recursive functions, and 
they are also definable by a finite series of 
applications of the schemata (IV), (V), and (VI) 
(in each schema, = used for the definition of o 
should be replaced by ~) starting from func- 
tions given by (I), (II), and (HI). 


F. Extension of Recursive Functions to 
Number-Theoretic Functionals 


Letz... Ge be number-theoretic functions of 
one variable. If o(x,, ..., x,) is (partial) recur- 
sive uniformly in ,,...,%,, then a Gödel 
number e of q is found independently of 

Aise Am ANd (Xj, a) x,) is expressed as 

U (yT 7*"(e, X,, ..., X4, y)). Now suppose 
that 2,,...,,, range over the set N of all 
number-theoretic functions of one variable, 


1324 


and put 


QE Am hag vey Xn) 


= U(uy "Zeile, zx, Se y). 


We call such a functional oí(o,,...,o,,, 
Au, X,) (partial) recursive, and with it we 
can develop a theory of recursive functions of 
variables of two types. 

Extending the notion of recursive func- 
tionals, Kleene introduced and investigated 
the recursive functionals of variables of arbit- 
rary (finite) types [10, 11]. The natural num- 
bers are the objects of type 0, and the one- 
place functions from type-j objects to natural 
numbers are objects of type j+ 1. Denote 
variables ranging over the type-j objects by ai. 
B^, y), ..., or ad, od, o, ..., etc. Consider a 
functional (simply called a function) of a given 
finite number of such variables of types taking 
natural numbers as values. A function ¢ is 
called a primitive recursive function if it is 
definable by a finite series of applications of 
the following schemata (I)-(VIID, where a isa 
variable of type 0, b is any list (possibly empty) 
of variables that are mutually distinct and 
different from the other variables of the 
schema, and y, y are given functicns of the 
indicated variables. (I) (a, b)=a’; (ID o (b) — 
q (q is a natural number); (HI) o(a, b) =a; 

(IV) o(b) 2 V(x(b), b); (V) o, b) — v(b), pa’, b) 
= y(a, (a, b), by; (VI) o(a) — (a) (a; is a 

list of variables from which a is obtained by 
changing the order of two variables of the 
same type); (VII) ọ(«', a, b) - «! (a); (VIII) 

(a, b) « a) (4a ^y (a, al ?, b)) (417? desig- 
nates that y is a function of the variables o? ?). 
We assign to each function g(a) a natural 
number called an index (which plays the same 

role as a Gódel number) in such a way that 

it reflects the manner of application of the 
schemata used to introduce g(a). Now, we 
write g(a) with an index e as (ej (a). We call 

a function (a) partial recursive if it is defin- 
able by a finite series of applications of the 
schemata (I)-(VIIT) (~ is employed instead of 
— in (IV)-(VI) and (VIII)), and (IX) o(a, b, c) = 
(a) (b. c) (c is a finite list of variables of arbitrary 
types). In particular, g(a) is called a general re- 
cursive function if it is defined for all values of the 
argument a. These notions can be relativized 
also with respect to any given functions. Note 
that for the case of types <1, priraitive recurs- 
ive functions, partial recursive functions, and 
also general recursive functions in the present 
sense are equivalent to the corresponding 
notions (introduced via relativization with 
uniformity) in the ordinary sense already de- 
scribed. The following theorem is important: 
Let r be the maximum type of a. Then there 
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are two primitive recursive predicates M, N 
such that 


te (a) ~ 
we» V2" an"? M (e, a w, 6719"), 

r>2, 
we 3E" Wn" 2N(e, a, w, E" !,n' 2), 

r2. 


Every function definable using (IX") (a) — 
ux(V (a, x) 20) instead of (IX) is partial recur- 
sive. However, not all the partial recursive 
functions of variables of types >2 can be 
obtained by applying schemata (I)-(VIIT) and 
(IX). 

Further developments have been pursued by 
J. E. Fenstad, J. Moldestad, and others in 
abstract computation theory [20—23]. 


G. Recursive Functions of Ordinal Numbers 


G. Takeuti introduced a notion of primitive 
recursiveness for functions from a segment of 
the ordinal numbers to ordinal numbers. 
Using this, he constructed a model of set 
theory in ordinal number theory. In connec- 
tion with recursive functions of ordinal num- 
bers, there are also investigations by A. Lévy, 
M. Machover, Takeuti and A. Kino, T. Tugué, 
S. Kripke, and others. 

Early treatments of recursive functions of 
ordinal numbers dealt only with functions on 
infinite cardinals. For example, Takeuti con- 
sidered functions with a fixed infinite cardinal 
« as a domain and a range, and defined x- 
recursive functions using schemata similar to 
the abovementioned (I)- (VI). Subsequently 
Kripke observed that the assumption that x is 
a cardinal is not necessary, and introduced 
the notion of admissible ordinals. An admis- 
sible ordinal x has the closure properties 
required for the construction of the calculus, 
and whenever o, p <x and f = f(x) is com- 
putable, then f — f(x) is computable in fewer 
than x stages. Given an admissible ordinal x, 
K-recursiveness can be defined, as in the case of 
general recursiveness, by various equivalent 
methods, e.g., schemata, the equation calculus, 
and definability in both quantifier forms. Most 
of the elementary properties of general recur- 
sive functions (e.g., the normal form theo- 
rem, parametrization theorem, enumeration 
theorem, etc.) are also valid for x-recursive 
functions. The notions of degrees of unsolva- 
bility and recursive enumerability can also be 
generalized, yielding the notions of x-degrees 
and x-recursive enumerability, respectively. 
The fine structures of these properties are 
currently the objects of intensive research. 
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Every infinite cardinal is admissible. The 
least admissible ordinal is w, and the next is 
the ordinal o, of Church and Kleene, i.e., the 
first nonconstructive ordinal. In fact, for every 
nz 1, the first ordinal not expressible as the 
order type of a A} predicate is admissible (— 
Section H). For each infinite cardinal x there 
are x^ admissible ordinals of power x. Platek 
investigated recursion theory in a still wider 
setting. He dealt with functions defined, not on 
a segment of ordinal numbers, but on a set, 
and introduced the notion of admissible sets, 
i.e., sets on which a well-behaved recursion 
theory can be developed. An admissible set is a 
transitive e-model of a certain weak set theory, 
and an ordinal x is admissible if and only if 
there exists an admissible set A such that 
A0, —x, where O, is the class of all ordinal 
numbers. 

Recent developments have shown that 
generalized recursion theory, set theory, and 
infinitary logic are closely related. In addition 
to the abovementioned, there are some investi- 
gations by Y. N. Moschovakis and others 
[14-27]. 


H. Hierarchies 


Utilizing the theory of trecursive functions, S. 
C. Kleene succeeded in establishing a theory 
of hierarchies that essentially contains class- 
ical descriptive set theory as an extreme case 
[5, 10, 31, 32]. Although research following a 
similar line had also been done by M. Davis, 
A. Mostowskii, and others, it was Kleene who 
succeeded in bringing the theory to its present 
form. 

Sets or functions are described by tpredi- 
cates, which we classify as follows. Let a, b, ..., 
0,,05, X, D, be variables ranging over 
the set N of natural numbers, and o, f, ..., 
04,005, ..., 6,1]... be variables ranging over 
the set N“ of all tnumber-theoretic functions 
with one argument. Let y, ..., Yil > 0) be 
number-theoretic functions. A predicate 
P(a,,...,0,,0,,...,a,) (mpnz0,m--n» 0) 
with variables of two ftypes is called analytic 
in V4, ..., V4 (120) if it is expressible syntacti- 
cally by applying a finite number of logical 
symbols: >, v, ^, 1, dx, Vx, 3C, VC, to tgen- 
eral recursive predicates in y,,..., v. In partic- 
ular, when P is expressible without function 
quantifiers 3£, V£, it is called arithmetical in 
de, V 020) When 1-0, they are called 
simply analytic and arithmetical, respectively. 

For brevity, consider the case | = 0, 
and denote by a a finite list of variables 
(04, ..., Ams A15 ..., à). Every arithmetical pre- 
dicate P(a) is expressible in a form contained 
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in the following table (a): 


(a) R(a); 
3xR(a,x) ` VxdyR(a, x, y), ..., 
VxR(a,x) | 3xVyR(a, x, y), ..., 


where each R is tgeneral recursive. In order to 
obtain such an expression we first transform 
the given predicate into its *prenex normal 
form and then contract successive quantifiers 
of the same kind by the formula 


d. see) 


<> 3xA(Q)o, Dh CD) 


and its *dual form." Each form in (a) (or the 
class of all predicates with that form) is de- 
noted by X? or HÜ. where the suffix k refers 
to the number of quantifiers prefixed, and 2 
or M shows that the outermost quantifier is 
existential or universal, respectively. A predi- 
cate that is expressible in both forms 2? and 
II? (or the class of such predicates) is denoted 
by AN. A predicate belongs to A? if and only if 
it is general recursive (an analog of *Suslin’s 
theorem). 

For k> 1, there exists in X$ (or JIR) an 
enumerating predicate that specifies every 
predicate in X? (I72). For example, for H9 
and m=n=1, there is a *primitive recursive 
predicate Sta, z, a, x, y) such that, given a 
general recursive predicate Ria, a, x, y), we 
have a natural number e such that 


Vx dyR(a, a, x, y) ee Vx 3yS(o, e, a, x, y) 


(enumeration theorem). In this theorem, we 
can take T7(z,a, x, y) (-» Section F) as 
Sta, z, a, x, y). For each k >0, there exists a 
X9, UD, predicate that is not expressible in 
its dual form /7?,, (22,1) (hence, of course, in 
neither X? not /7°) (hierarchy theorem). There- 
fore, table (a) gives the classification of the 
arithmetical predicates in a hierarchy. This 
hierarchy is called the arithmetical hierarchy. 
For each k > 1, there exists a complete predi- 
cate with respect to X2 (172), that is, a XP (112) 
predicate with only one variable such that 
any 2? (I1) predicate is expressible by sub- 
stituting a suitable general (or more strictly, 
primitive) recursive function for its variable 
(theorem on complete form). When m — 0, all 
the general recursive predicates in X? exhaust 
A?,, (Post's theorem). 

Concerning the function quantifiers, we 
have 


Jaj -.. 305 40:55 Xm) 


e FWA(At(a(t))os «5 Aa) B 
J3xA(x) Ja. A (a (0)), (3) 
Vx3a.A(x, x) <> 3a Vx A(x, Ata(2* - 3)), (4) 
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and their dual forms. For any general recur- 
sive predicate R, there is a primitive recursive 
predicate S such that 


Ja R(a, a) & Ja 3xS(a(x), a) &» JyS( y, a) (5) 


and its dual hold. Using these facts, we can 
classify the forms of all analytic predicates by 
the table (b): 


(b) A(a); 


VadxR(a,a,x) |daVp3xR(a,a, fl. x), ..., 


Zo Veto, az | Va3BVxR(a,o, D, x), ..., 


where A is arithmetical and each R is general 
recursive. Similarly, denote by Z}, 77] each 
form of predicate in (b) (or the class of all 
predicates reducible to that form), where is 
the number of function quantifiers prefixed; 
also, denote by Aj the (class of) predicates 
expressible in both forms Z} and JT}. For Xj, 
IT} (k z 1), we have the enumeration theorem, 
the hierarchy theorem, and the theorem on 
complete form. The hierarchy given by table 
(b) is called the analytic hierarchy. 

For 1|» 0 (namely, when predicates are 
arithmetical or analytic in y, ..., W), we can 
*uniformly trelativize the above results with 
respect to y,, ..., V. Now let (Xp "i", 
Ij 791), (r=0, 1) be the corresponding 
hierarchy relative to VA, ..., Yı. Given a set 
C (c N") of functions with one argument, 
we can consider hierarchies of predicates 
which are arithmetical or analytic in a finite 
number of functions in C. Such a hierarchy is 
called a C-arithmetical or C-analytic hierarchy 
and denoted by (X?[C], WP[C]}, or (22 [C]. 
Hj [C], respectively. That is, when we re- 
gard X;[C] as a class of predicates (or sets) 
P, it is the family {P] PeXp ^r, E, Zug 
C,!=0,1,2,...}. These notations have been 
given by J. W. Addison [28, 29]. The N*- 
arithmetical hierarchy and the N'-analytic 
hierarchy for sets correspond respectively to 
the finite Borel hierarchy and the orojective 
hierarchy in the tspace of irrational numbers. 
Addison called the theory of those hierarchies 
classical descriptive set theory, and in contrast 
to this, the theory of arithmetical and analytic 
hierarchies for sets (C = @) effective descrip- 
tive set theory [28]. 

We now restrict our consideration to pre- 
dicates for natural numbers (Le., to the case 
m — 0). Define the predicates L, by Lo(a) 

à - à, L, 4 (a) e 3x Ti(a, a, x). For each k20, 
L,+ı(a) is a XQ,, predicate which is of the 
highest *degree of recursive unsolvability 
among the 2?,, predicates, and its degree is 
properly higher than that of L,(a). Thus LA 
k —0, 1, 2, ..., determine the arithmetical 
hierarchy of degrees of recursive unsolvability. 
Kleene has extended the series of L, by using 
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the system S, (— 81 Constructive Ordinal 
Numbers) of notations for the constructive or- 
dinal numbers as follows [6]: H,(a)<>a=a; 
for ve, H5,(a)«» 3x Zeta, a, x); for 3: 5"€O, 
H3.5y(a)<> H, (Qo) where y, = {y} (no). This 
H, is defined for each y € O, and it is of a prop- 
erly higher degree than that of H, when 

z «oy. If |y]=|z| (|y| is the ordinal number 
represented by y), then H, and H, are of the 
same degree (C. Spector [34]). Thus a hierar- 
chy of degrees is uniquely determined by con- 
structive ordinal numbers. This hierarchy is 
called the hyperarithmetical hierarchy of de- 
grees of recursive unsolvability. A function or 
predicate is said to be hyperarithmetical if it is 
recursive in H, for some yeO. These concepts 
and the results mentioned below can be rela- ` 
tivized to any given functions or predicates. 

A necessary (Kleene [31]) and sufficient 
(Kleene [32]) condition for a predicate to be 
hyperarithmetical is that it be expressible in 
both one-function quantifier forms A} (an 
effective version of Suslin's theorem). Denote 
by Hyp the set (C N") of all hyperarithmetical 
functions a. For an arithmetical predicate 
A (a, a), 3a, cup A(%, a) is always a IT} predicate 
(Kleene [33]). Conversely, for any 17] predi- 
cate P, there is a general recursive predicate R 
such that P(a) & 3a. uy, VxR(a, a, x) (Spector 
[35]). As to tuniformization, for a J} predi- 
cate Pio b), we have Vx 3yP(x, y) > Joscryp 
VxP(x, a(x)) (G. Kreisel, 1962). Let E be an ob- 
ject of type 2 defined by: E(x) 2 0 if 3x(a(x) — 0), 
otherwise E(x) — 1. A function q(a,, ..., a,) is 
hyperarithmetical if and only if it is general re- 
cursive in E (Kleene [10]). A predicate that is 
hyperarithmetical relative to J7} predicates 
(k 2:0) is of A, (Kleene [32]), but the con- 
verse does not hold in general (Addison and 
Kleene, 1957). 

Kleene extended his theory of hierarchy 
to the case of predicates of variables of any 
type by utilizing the theory of general recur- 
sive functions with variables of finite types 
0, 1,2, ... [10]. Let a' be a list of variables of 
types «t. We say a predicate P(a‘) is of order r 
in completely defined functions y,, ..., y 
(Iz: 0) (for brevity, denote them by V) if P is 
syntactically expressible in terms of variables 
of finite types, predicates that are general 
recursive in V, and symbols of the *predicate 
calculus with quantification consisting only of 
variables of types <r. The predicates of order 
0 in V are exactly the general recursive ones in 
Y. When tè 1, and VV are functions of vari- 
ables of type 0, a predicate P(a') is of order 1 
(of order 2) in V if and only if P is arithmetical 
(analytic). 

We have theorems similar to (2)-(4) and 
the following theorem and its dual for r 2 2: 
For any given general recursive predicate 
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Pia, 0", &'?), there is a primitive recursive 
predicate Bio, ar". £^?) such that 


Jo' VE"? P(a', ot, eto) 
«dg "ET ^R(ag 5,677). (6) 


Using these equivalences, each predicate P(a‘) 
of order r+ 1 (rz 0) is expressible in one of 
the following forms: 


(c) B(a); 
Var dé" Ra, a”, £' 1), 

Ja” VB"3E"R(a,a", BYE), ..., 
da’ Vé' ! R(a, at, E74), 

Var IB" VE" R(a, a", BYE), ..., 


where B is of order r and each R is general 
recursive. When t=r + 1, table (c) gives the 
classification of the predicates of order r + 1 
into the hierarchy. In fact, for the predicates 
P(a'*!) in each form, we have the enumera- 
tion theorem, the hierarchy theorem, and the 
theorem on complete form (Kleene [10]). 

D. A. Clarke [30] has published a detailed 
review of the general theory of hierarchies. 
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357 (VI.6) 
Regular Polyhedra 


A. Regular Polygons 


A *polygon in a Euclidean plane bounding a 
tconvex cell whose sides and interior angles 
are all respectively congruent is called a regu- 
lar polygon. When the number of vertices 
(which equals the number of sides) is n, it is 
called a regular n-gon. There exist a circle 
(circumscribed circle) passing through all the 
vertices of a regular n-gon and a concentric 
circle (inscribed circle) tangent to all the sides. 
We call the center of these circles the center of 
the regular n-gon. The n vertices of a regular 
n-gon are obtained by dividing a circle into 

n equal parts. (When a polygon in a Eucli- 
dean plane bounds a tconvex cell, this 2-cell is 
sometimes called a convex polygon. Thus 
regular polygon sometimes means the convex 
cell bounded by a regular polygon as de- 
scribed above.) A necessary and sufficient 
condition for a regular n-gon to be geometri- 
cally constructible is that n be decompo- 
sable into the product of prime numbers n= 
2"p, ... py (m2 0), where the p; (i2 1,2, ...) 

are different tFermat numbers (— 179 Geo- 
metric Construction). 


B. Regular Polyhedra 


Consider a regular polygon on a plane, and 
take a point on the line perpendicular to the 
plane at the center of the polygon. The set of 
points on all half-lines joining this point and 
points on the polygon (considered as a convex 
cell) is called a regular polyhedral angle having 
this point as vertex (Fig. 1). 


Fig. I 


When a ‘convex polyhedron $ in E? satis- 
fies the following two conditions, we call it a 
regular polyhedron: (1) Each face of %, which is 
a 2-dimensional cell, is a regular polygon, and 
all faces of ware congruent to each other. (2) 
Its vertices are all surrounded alike. That is, by 
the projection of % from each vertex of %, we 
obtain a regular polyhedral angle; these regu- 
lar polyhedral angles are all congruent to each 
other. From (2) we see that the number of 
edges emanating from each vertex of % is 
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independent of the vertex. It has been known 
since Plato's time that there are only five kinds 
of regular polyhedra: tetrahedrons (Fig. 2), 
octahedrons (Fig. 3), icosahedrons (Fig. 4), cubes 
or hexahedrons (Fig. 5), and dodecahedrons 
(Fig. 6) (see also see Table 1). 


Fig. 2 
Regular tetrahedron. 


Fig. 3 
Regular octahedron. 


Fig. 4 Fig. 5 
Regular icosakedron. Regular hexahedron 
(or cube). 


Sus 
Ba 


Regular dodecahedron. 


From a given regular polyhedron, we can 
obtain another one by taking as vertices the 
centers of all the faces of the given polyhedron 
(Fig. 7). We say that the given regular poly- 
hedron and the one obtained in this way are 
dual to each other. The octahedron and hexa- 
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hedron are dual to each other, as are the 
icosahedron and dodecahedron. The tetra- 
hedron is dual to itself. For a regular poly- 
hedron %, there exist concentric circumscribed 
and inscribed spheres whose center is the 
center of symmetry of % and is called the 
center of %. Drawing tangent planes to the 
circumscribed sphere at each vertex, we can 
obtain a regular polyhedron dual to the given 
one (Fig. 8). 





Fig. 7 


Fig. 8 


In a regular polyhedron, let a be the length 
of an edge, 0 the magnitude of the dihedral 
angle at each edge, and R and r the radii of 
circumscribed and inscribed spheres, respec- 
tively. Then the following relations hold (we 
assume that each face is a regular p-gon and q 
faces meet at each vertex): 


. 0 3 Rn 
sin-=cos—/ sin-, 
2 gi p 
R —Csin- | sin=co d (1) 
= <sin—/ sin—cos-, 
2 q p 2 


a m 0 R n mn 
r—--cot-tan-, -—=tan—tan— 

2 p 2 r p 
(see Table 2). Corresponding to these poly- 
hedra, we have finite subgroups of OO) called 
regular polyhedral groups (— 151 Finite 


Groups). 


C. Higher-Dimensional Cases 


It is possible to generalize these considerations 
to higher dimensions to define inductively 


Table 1. Regular Polyhedra in 3-Dimensional Euclidean Space E? 


Number of 
Figure Face Vertices 
Regular Equilateral 
tetrahedron triangle 4 
Regular Equilateral 
octahedron triangle 6 
Regular Equilateral 
icosahedron triangle 12 
Regular : 
hexahedron Square 8 
Regular Regular 
dodecahedron pentagon 20 


Number of 

Number of Number of Faces around 
Edges Faces a Vertex 

6 4 3 
12 l 8 4 
30 20 5 
12 6 3 
30 12 3 
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Table 2. Numerical Values for Eqs. (1) 


Number of 
Faces sinĝ 
4 2V2/3 
6 l 
8 2V2 /3 
12 2/ V5 
20 2/3 


0 


70°31'43.6” 
90° 
109°28'16.4” 


116?33'54.2" 


138?11'22.8" 





Table 3. Regular Polyhedra in 4-Dimensional Euclidean Space E" 


Figure 


Regular 5-hedron 
Regular 8-hedron 
Regular 16-hedron 
Regular 24-hedron 
Regular 120-hedron 
Regular 600-hedron 


3-Dimensional Regular Polyhedra 


Kind 
Tetrahedron 
Cube 
Tetrahedron 
Octahedron 
Dodecahedron 
Tetrahedron 


a: dual to itself; b: dual to each other 


Table 4. Regular Polyhedra in n-Dimensional Euclidean Space (n > 5) 


Figure 

Regular (n + 1)-hedron 
Regular 2n-hedron 
Regular 2"-hedron 


Regular Polyhedron in R^ ^! 


Kind 


Regular n-hedron 


Regular (2n — 2)-hedron 


Regular n-hedron 


a: dual to itself; b: dual to each other 


regular polyhedra in E", n z 4. When n=4 we 
have 6 kinds of regular polyhedra (Table 3). 
For nz 5 we have only 3 kinds (Table 4) (— 70 


Complexes). 
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R/a r/a 
V6 /4 V 6/12 
v3 /2 1/2 
1/V2 1/ V6 
V3 (V5 +1) 254 11V5 
4 2V10 
(5 + V5 ) 34+ V5 
2V2 4V3 
Number of 
Number Vertices Duality 
5 5 a 
8 16 l b 
16 8 
24 24 a 
120 600 | b 
600 120 
Number of 
Number Vertices Duality 
ntl n+l a 
2n 2 l b 
2 2n 
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A. General Remarks 


In its wider sense the term relation means tn- 
ary relation (n= 1,2,3, ...) (2 411 Symbolic 
Logic G), but in this article we restrict our- 
selves to its most ordinary meaning, Le, to the 
case n 2 2. Let X, Y be two sets and x, y be 
two variables taking their values in X, Y, 
respectively. A proposition R(x, y) containing 
X, y Is called a relation or a binary relation if it 
can be determined whether R(a, b) is true or 
false for each pair (a, b) in the tCartesian prod- 
uct X x Y. For example, if both X and Y are 
the set of rational integers, then the following 
propositions are relations: x € y, x — y is even, 
x divides y. A relation R(x, y) is sometimes 
written as x Ry. 

For a given relation R, we define its inverse 
relation R ! by yR ! x«xRy. Then R is the 


1331 


inverse relation of R ^!. In the example above, 
the inverse relation of x < y is yz x, and the 
inverse relation of x is a divisor of y is y is a 
multiple of x. A relation R is called reflexive if 
xRx holds. R is called symmetric if x Ry +> yRx 
(namely, if R and R~! are identical). R is called 
transitive if xRy and yRz imply xRz. R is 
called antisymmetric if xRy and yRx imply 
X — y. A reflexive, symmetric, and transi- 
tive relation is called an equivalence relation 
(— 135 Equivalence Relations). A reflexive and 
transitive relation is called a tpreordering. A 
reflexive, transitive, and antisymmetric relation 
is called an tordering (— 311 Ordering). 
Suppose that we are given a relation xRy 
(xe X, ye Y). Then the set G — ((x, y)|xRy}, 
which consists of elements (x, y) of the Car- 
tesian product X x Y satisfying x Ry, is called 
the graph of the relation R. Conversely, for any 
subset G of X x Y, there exists a unique rela- 
tion R with the graph G given by xRy« 
(x, y)eG. 


B. Correspondences 


For a subset G of the Cartesian product X x 
Y, the triple I =(G, X, Y) is called a corre- 
spondence from X to Y. The set X is called the 
initial set of the correspondence T, and Y the 
final set of T. A relation xRy (xe X,ye Y) 
determines a correspondence I 2 (G, X, Y) by 
its graph G, and conversely, a correspondence 
I determines a relation R. Given a corre- 
spondence I —(G, X, Y), the sets A=pry G 
and B — pr, G, where pr, X x Y X and 
Div X x Y Y are the fcanonical projections, 
are called the domain and range of the corre- 
spondence T, respectively. For xe X, the set 
(ye Yl(x, y)e G] is denoted by G(x) or T(x), 
and we say that any element y of G(x) corre- 
sponds to x by I. 

For a subset G of X x Y, we define a subset 
G`! of Y x X by (x, y)eG«(y,x)eG ^ !. Given 
a correspondence I = (G, X, Y), the corre- 
spondence (G '!, Y, X) is denoted by F~! and is 
called the inverse correspondence of T. If G is 
the graph of a relation R, then G~ is the 
graph of the inverse relation R+. The domain 
of the correspondence I is the range of TT, 
and vice versa. We have(T !) ! «T. 

Suppose that we are given correspondences 
T, =(G,,X, Y) and I, =(G,, Y, Z). We define 
a subset G of X x Z by: (x, z)e G + there exists 
ye Y satisfying (x, y)e G, and (y, z)e G;. Then 
the correspondence T =(G, X, Z) is denoted by 
I; oT, and is called the composite of F, and 
I5. We have the associative law (I5015)oT, = 
D30(D; 0T3) and the law (I, oI) ! » T1? 0 
T5*. 

Let I' be a correspondence from X to Y, and 
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assume that to any x belonging to the domain 
A of T there corresponds one and only one 

ye Y, namely, l' (x) - ( y] for any xe A. Then I 
is called a univalent correspondence. If T and 
I~ are both univalent correspondences, I is 
called a one-to-one correspondence. For given 
sets A and B, a univalent correspondence with 
domain A and range B is called a tmapping (or 
*function) with domain A and range B (— 381 
Sets C). 


References 


See references to 381 Sets. 
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A. History 


The theory of relativity is a system of theoret- 
ical physics established by A. Einstein and is 
composed of special relativity and general 
relativity. Toward the end of the 19th century, 
it was believed that electromagnetic waves 
propagate through the ether, a hypothetical 
medium. À number of experimenters tried to 
find the motion of the earth relative to the 
ether, but all these attempts were unsuccessful 
(A. A. Michelson, E. W. Morley). Studying 
these results, in 1905 Einstein proposed the 
special theory of relativity, which extended 
Galileo's relativity principle of tNewtonian 
mechanics to telectromagnetism and radically 
revised the concepts of space and time. Almost 
all the conclusions of special relativity theory 
are now confirmed by experiments, and this 
theory has even become a guiding principle for 
developing new theories in physics. By extend- 
ing special relativity, Einstein established 
(1915) the general theory of relativity. Its prin- 
cipal part is a new theory of gravitation con- 
taining Newton's theory as a special case. Its 
conclusions about the solar system are com- 
patible with observed facts that are regarded 
as experimental support for the theory. Effects 
due to general relativity other than those just 
described have been studied to a considerable 
extent, but it is hard at present to test theoret- 
ical results experimentally, and there are some 
doubts about the limit of its applicability. 


B. Special Relativity 


In Newtonian mechanics, natural phenomena 
are described in a 3-dimensional Euclidean 
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space considered independent of time. In spe- 
cial relativity, however, it is postulated that 
space and time cannot be separated but are 
unified into a 4-dimensional pseudo-Euclidean 
space with the tfundamental form 


ds? =F gay dx* dx? =c? dt? — dx? — dy? — dz’, 
a.b 


where a, b=0, 1, 2, 3 and 
(xV, x! x? 9 (cfi x, y, Z). 


Here (x, y, z) are spatial Cartesian coordinates, 
t is time, and c is the speed of light. This space 
was introduced by H. Minkowski and is called 
Minkowski space-time. By means of it Min- 
kowski gave an ingenious geometric interpre- 
tation to special relativity. 

A nonzero vector V is called timelike, null 
(or lightlike), or spacelike according as V? » 0, 
—0, or <0, where V^ 3,,9,,V^V*. 

The group of motions in Minkowski space- 
time is called the inhomogeneous Lorentz 
group. Its elements can be written as 


Bol iva „i ab | 
X =) cix Tc, > Gar? Cj = Jij 
a a,b 


where cj and c' are constants. The transfor- 
mations with c'=0 are usually called Lorentz 
transformations, and the group G composed of 
these transformations is called the homoge- 
neous Lorentz group or simply the Lorentz 
group. These are important concepts in special 
relativity. If Gy denotes the *Àconnected compo- 
nent of the identity element of G, the factor 
group G/G, is an Abelian group of type (2, 2) 
and of order 4. We call Go the proper Lorentz 
group. A frequently used element of Go is 


x—vt , t—(v/c?)x 
E = t Ss 
1 -v?/c? ./1—v?/c? 

ysy Zë Jose (1) 


Such transformations form a 1-dimensional 
subgroup of Go with v as a parameter, and the 
composition law of the subgroup is given by 


uctv 


LLL. -—————. 
worse Cm 1 4- uv/c? 


Elements of G not belonging to Gg are 


, VO! 0 RC e. 
Tix o roe i=1,2,3, 


Six" Hl x" —xb5  i21,2,3. 


Both T (time reversal) and S (space reflection 
or parity transformation) have aroused much 
interest among physicists. 

Historically, the transformation formula (1) 
was first obtained by H. A. Lorentz, under the 
assumption of contraction of a rod in the 
direction of its movement in order to over- 
come the difficulties of the ether hypothesis, 
but his theoretical grounds were not satisfac- 
tory. On the other hand, Einstein started with 
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the following two postulates: (1) Special prin- 
ciple of relativity: A physical law should be 
expressed in the same form in all inertial sys- 
tems, namely, in all coordinate systems that 
move relative to each other with uniform 
velocity. (it) Principle of invariance of the speed 
of light: The speed of light in a vacuum is the 
same in all inertial systems and in all direc- 
tions, irrespective of the motion of the light 
source. From these assumptions Einstein 
derived (1) as the transformation formula 
between inertial systems x — (ct, x, y, z) and 

x' — (ct', X', y', z^) that move relative to each 
other with uniform velocity v along the com- 
mon x-axis. This was the first step 1n special 
relativity, and along this line of thought, Ein- 
stein solved successively the problems of the 
Lorentz-Fitzgerald contraction, the dilation of 
time as measured by moving clocks, the aber- 
ration of light, the Doppler effect, and Fresnel's 
dragging coefficient. 


C. Relativity and Electromagnetism 


In special relativity, a physical quantity is 
represented by a *tensor (or a scalar or a vec- 
tor) in Minkowski space-time, and physical 
laws are written in tensor form and are invar- 
iant under Lorentz transformations of coordi- 
nates. This is the mathematical expression of 
the special principle of relativity. Since the 
transformation (1) tends to a Galileo transfor- 
mation in Newtonian mechanics as c oo, the 
special principle of relativity is a generalization 
of the Newton-Galileo principle of relativity. 
To summarize mathematically, it may safely 
be said that special relativity is a theory of 
invariants with respect to the Lorentz group. 
To illustrate this conclusion we consider elec- 
tromagnetic theory. 

The electric field E is usually represented by 
a "polar vector" and the magnetic field H by 
an “axial vector" in a 3-dimensional Euclidean 
space. Even if the magnetic field does not exist 
in one inertial system, the field can arise in 
another system that moves uniformly rela- 
tive to the original system. In view of this, 
electric and magnetic fields are considered in 
relativity to form one physical quantity with 
components 


0 -E, ES -E, 


E, 0 H, —H, 


Ei T 
Wi E, —H, 0 1H, 
E, H, —H, 0 


This quantity transforms as an ‘alternating 
tensor of degree 2 under Lorentz transfor- 
mations. In like manner, the electric charge 
density p and the electric current density J 
are unified into a fcontravariant vector with 
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respect to Lorentz transformations: 
s — (s9, el, s?, s?) 
=(p,J,/c, J,/c, J,/c). 


Such a vector s in Minkowski space-time 

is sometimes called a four-vector as distin- 
guished from an ordinary vector such as J. If 
the electromagnetic field F;; and the current 
four-vector s are thus defined, the "Maxwell 
equations, the basic equations of electro- 
magnetism, can be written in tensor form: 


QS P OR OR 
= axe ^ — 0x! Ox Gi ^ 
where 


FÜ-Yy gg" Fp, 3.9" g, 7 9j. 

a,b a 
In the same way, the equation of motion for a 
charged particle in an electromagnetic field 
can be expressed as 
a x^ 


E “SAS 


“mes 





Fa) "Fa. 


where e and m are the charge and mass of 
the particle, respectively, and s is the arc 
length along the particle trajectory (— 130 
Electromagnetism). 

Though special relativity originated in 
studies of electromagnetic phenomena, it has 
gradually become clear that the theory is valid 
also for other phenomena. One interesting 
result is that the energy of a particle moving 
with uniform velocity v is given by 


E=mce?/./1—v?/c?, 


and accordingly even a particle at rest has 
energy mc? (rest energy). This shows the equiv- 
alence of mass and energy, with the conversion 
formula given by E — mc?. This conclusion has 
been verified experimentally by studies of 
nuclear reactions and has become the basis 

of the development of nuclear power. The 
special principle of relativity also showed its 
validity in the electron theory of P. A. M. 
Dirac (1928) and the quantum electrodynamics 
of S. Tomonaga (1943) and others. It has, 
however, been shown that the invariance for 
space reflection (namely for the coset SG, of 
the Lorentz group G) is violated in the decay 
of elementary particles (T. D. Lee and C. N. 
Yang, 1956; C. S. Wu et al., 1957). Similar 
results have been obtained for time reversal 

(J. H. Christenson, J. W. Cromin, V. L. Fitch, 
and R. Turlay, 1964). 


D. General Relativity 


Special relativity has its origin in studies of 
electromagnetic phenomena, while the central 





359 D 
Relativity 


part of general relativity is a theory of gravita- 
tion founded on the general principle of rela- 
tivity and the principle of equivalence. The first 
principle is an extension of the special prin- 
ciple of relativity to accelerated systems in 
general. It requires that a physical law should 
be independent of the choice of local coordi- 
nates in a 4-dimensional tdifferentiable mani- 
fold representing space and time (space-time 
manifold). Since a physical quantity is repre- 
sented by a tensor on the space-time manifold, 
physical laws are expressed in tensor form, in 
agreement with the first principle. The second 
principle claims that gravitational and inertial 
mass are equal, and accordingly fictitious 
forces due to acceleration (such as centrifugal 
force) cannot be distinguished from gravita- 
tional force. This had been shown with high 
accuracy by the experiments of R. von Eótvós 
(1890) and others. 

Starting from these two principles, Einstein 
was led to the following conclusion. If a gravi- 
tational field is produced by matter, the cor- 
responding space-time structure is altered; 
namely, flat Minkowski space-time is changed 
into a curved 4-dimensional manifold with 
*pseudo-Riemannian metric of tsignature (1, 3). 
The "fundamental tensor g;; of this manifold 
represents the gravitational potential, and the 
gravitational equation satisfied by g;; can be 
expressed as a geometric law of the manifold. 
Gravitational phenomena are thus reduced to 
properties of the geometric structure of the 
space-time manifold. This idea, which was not 
seen in the older physics, became the motif in 
the development of tunified field theories. 

Now the gravitational law proposed by 
Einstein is an analog of the.* Poisson equation 
in Newtonian mechanics. Let R; and R be the 
*Ricci tensor and the tscalar curvature formed 
from a, Then outside the source of a gravita- 
tional field, g,; must satisfy 


Gj;2Rj—g;R/2-0, thatis, Rj—0, Q) 


and inside the source, 


Gj=kT;, (3) 


where x is the gravitational constant. Here the 
energy-momentum tensor 7; is a symmetric 
tensor representing the dynamical state of 
matter (energy, momentum, and stress). Usu- 
ally (2) and (3) are called the exterior and 
interior field equations, respectively. 

Next, the equation of motion of a particle in 
a gravitational field is given by 

2 t 
"Zu Yos x “- +1, (4) 





if the particle mass is so small that its effect on 
the field is negligible. Here ó/ós stands for 
*covariant differentiation with respect to the 
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arc length s along the particle trajectory. In 
other words, a particle in a gravitational field 
moves along a timelike *geodesic in the space- 
time manifold. Similarly, the path of light is 
represented by a null geodesic, whose equation 
is formally obtained from (4) by replacing the 
right-hand side of the second equation by zero. 

Experimental verification of the theory of 
general relativity has been obtained by detec- 
tion of the following effects: the shift of spec- 
tral lines due to the gravity of the earth and of 
white dwarfs, the deflection of light or radio 
waves passing near the sun, the time delay of 
radar echo signals passing near the sun, and 
the advance of the perihelion of Mercury. All 
the observational data are compatible with the 
theoretical results. Time delay and the advance 
of the perihelion have been observed in a 
binary system of neutron stars. It is generally 
accepted that these results are experimental 
verifications of general relativity. 

It should be noted that (2) has wave solu- 
tions, which have no counterpart in Newton's 
gravitational theory. This fact implies that 
gravitational effects propagate with the veloc- 
ity of light. The gravitational waves transport 
energy and momentum, and the gravitational 
mass is decreased by the emission of the waves. 
Experiments to detect gravitational waves 
generated in the universe have been planned, 
and the decrease of the orbital period of a 
binary star system due to the emission of 
gravitational waves has been observed. 

The concept of gravitational waves suggests 
the existence of a quantum of the gravitational 
field (graviton); however, the detection of the 
graviton is far from feasible. 

In the interior equation (3), the matter pro- 
ducing a gravitational field is represented by a 
tensor T; of class C°. But there is also a way of 
representing it by singularities of a solution of 
the exterior equation (2). From this point of 
view, the equation of motion of a material 
particle (1.e., singularity) is not assumed a 
priori as in (4), but is derived as a result of (2) 
(A. Einstein, L. Infeld, and B. Hoffman, 1938). 

In the static and weak field limits, the funda- 
mental form is approximately given by New- 
ton's gravitational potential @ as 


ds? =c?(1—/c*) dt? 
—(1+@/c?)(dx? + dy? + dz’), 


and (2) and (3) reduce in this limit to Laplace’s 
and Poisson’s equations, respectively. New- 
ton’s theory of gravity is valid in the limit 
o/c? «« 1. 

Stimulated by the discoveries of neutron 
stars and black holes and by the big-bang 
theory of the universe in the 1960s, numerous 
studies of general relativity have been carried 
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out on such problems as the gravitational field 
of a spinning mass, the dynamical process of 
gravitational collapse, the space-time structure 
of black holes, the generation of gravitational 
waves, the global structure and dynamics of 
the universe, and so on. A comparison of the 
theoretical predictions and the observations is 
generally favorable, but the phenomena in the 
universe are so complex that the effects of 
general relativity cannot always be isolated. 


E. Solutions of Einstein's Equations 


The isometric symmetry of space-time is de- 
scribed by tKilling vectors. The stationary 
metric is characterized by a timelike Killing 
vector, in which case equation (2) reduces to 
an elliptic partial differential equation on a 
3-dimensional manifold. If the space-time is 
axially symmetric as well as stationary, (2) 
reduces to the Ernst equation: 


(e+ £*)V^e — 2Ve- Ve, (5) 


where V represents divergence in a flat space. 
The metric tensors are derived from the com- 
plex potential e The solutions of (5) can be 
obtained using techniques developed for the 
soliton problem. 

One example of stationary and axially sym- 
metric solutions is the Kerr metric, which is 
written as 


2 
ds? =c? d? — (asin? 0d — cdt}? 
p 


2 dr? 2 S 2 | oaitein? 2 
—p WELL — (r^ - a^)sin* Ode", 
(6) 


with p? —r? +a? cos? 0 and A=r? — 2mr + 

a?. This metric solution represents a gravita- 
tional field around a spinning mass with mass 
M=mc?/G and angular momentum J = 

Mac. When a—0, this metric reduces to the 
Schwarzschild metric. 

Applying a Bácklund transformation to the 
Kerr metric, an infinite series of stationary and 
axially symmetric solutions can be derived. All 
these solutions belong to the space-time metric 
with, in general, two Killing vectors. 

The dynamical evolution of space-time 
structure has been studied by means of the 
*Cauchy problem of general relativity. Choos- 
ing appropriate dynamical variables, equation 
(2) or (3) is divided into constraint equations 
in terms of the initial data and evolution 
equations in terms of the dynamical variables. 
The latter thyperbolic equations may also be 
written in Hamiltonian form. 

A typical example of such a problem is 
the dynamics of a spatially homogeneous 3- 
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dimensional manifold; this has been studied as 
a cosmological model. The space-time with 

a constant scalar curvature R is called de Sitter 
space, and reduces to the Minkowski space if 
R=0. If the 3-dimensional space is isotropic 
as well as homogeneous, the metric takes the 
form 


ds? =c? dt? — a(t)? (dx? + f(x)? 
x (d0? -- sin? 0dq?)], 


where f(x) —sin x, x, or sinh x. These are called 
Robertson-Walker metrics and are considered 
to describe a realistic expanding universe. 


F. Global Structure of Space-Time 


Following the advances of modern differential 
geometry, manifestly coordinate-independent 
techniques to analyze space-time properties 
have been applied to general relativity. The 
mathematical model of space-time is a con- 
nected 4-dimensional tHausdorff C?-manifold 
endowed with a metric of signature (1,3). The 
metric allows the physical description of local 
causality and of local conservation of energy 
and momentum. The metric functions obey the 
Einstein field equation (2) or (3). 

In order to clarify the global structure of 
the solutions of Einstein's equations, maxi- 
mally analytic extension of the solutions has 
been studied. The maximal extension of the 
Schwarzschild metric is given as 


—— et (dy? = du’) 


= r (de +sin? Ad’), 


using the Kruskal coordinates, which are re- 
lated to the coordinates of (6) by 


tanht/4m=u/v or v/u. 


To study the global structure at infinity, a 
tconformal mapping of the metric is used. For 
example, the Minkowski metric is written in 
the form ds? = Q? ds?, where 


ds? = dp dq — isin?(p— q)(d0? + sin? 0de?) 


and Q —secpsecq, tanp=t+r, tanq —t—r. 
By means of this mapping, all points, includ- 
ing infinity, are assigned finite p, q coordinate 
values in —z/2xqxpxn/2. 

Singularities in space-time are one of the 
major problems concerning the global struc- 
ture of the manifold. For some Cauchy prob- 
lems relevant to cosmology and gravitational 
collapse, the inevitable occurrence of a sin- 
gularity has been proved (singularity theorem). 
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A sufficient condition for occurrence of a 
singularity is that there be some point p such 
that all the null geodesics starting from p 
converge to p again. In addition to this con- 
dition, for the proof of the singularity theorem 
it is presumed that the space-time is free of 
closed nonspacelike curves, that a Cauchy 
surface exists, and that the energy-momentum 
tensor satisfies the condition 


(tyra) rino 
for any timelike vector V’. The singularity 
whose existence is implied by this theorem 
means that the space-time manifold is geodesi- 
cally incomplete (the space-time is complete if 
every geodesic can be extended to arbitrary 
values of its affine parameter). 

The causal structure of space-time is also 
related to the global structure of the manifold. 
In this regard, black holes have been intro- 
duced as the final state of gravitational col- 
lapse. In the black-hole structure of space- 
time, there exists a closed surface called an 
event horizon in an asymptotically flat space. 
The event horizon is the boundary (the set 
of points) in space-time from which one can 
escape to infinity, or the boundary of the set 
of points that one can see from the infinite 
future. Then the black hole is a region from 
which no signal can escape to the exterior of 
the event horizon. 

If we assume that singularities do not exist 
in the exterior of the event horizon, a station- 
ary black-hole structure is uniquely described 
by the Kerr metric [6]. In the case of spheri- 
cally symmetric collapse, this assumption 
is verified and the final metric is given by 
the Schwarzschild metric. However, it is not 


known whether this assumption is true in 


more general gravitational collapse. 
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360 (XX1.8) 
Renaissance Mathematics 


Toward the middle of the 13th century, scho- 
lastic theology and philosophy were at their 
height with the Summa theologiae of Thomas 
Aquinas (1225?— 1274); but in the latter half 
of the century, the English philosopher Roger 
Bacon (1214-1294) attacked Aquinian philoso- 
phy in his Opus majus, insisted on the impor- 
tance of experimental methods in science, and 
strongly urged the study of mathematics. The 
Renaissance flourished first in Italy, then in 
other European countries in the {Sth and 16th 
centuries, in the domains of the arts and liter- 
ature. Newer ideas in mathematics and the 
natural sciences dominated the 17th century. 
However, it was the invention of printing in 
the 15th century, the translation of the Greek 
texts of Euclid and Archimedes into European 
languages, and the importation of Arabian 
science into Europe during the Renaissance 
that prepared for this development. 

In the 15th century, the German priest 
Nicolaus Cusanus (1401-1464) discussed 
infinity, the convergence of infinite series, and 
some problems of quadrature. During the 
same period, the German scholar Regiomon- 
tanus (1436-1476) wrote the first systematic 
treatise on trigonometry independent of as- 
tronomy. Leonardo da Vinci (1452-1519), 
the all-encompassing genius born in the same 
century, left manuscripts in which he wrote 
about mechanics, geometric optics, and per- 
spective. Da Vinci's contemporary, the Ger- 
man painter A. Dürer (1471—1528), wrote a 
textbook on perspective. In 1494, L. Pacioli 
(1445?—1514) published Summa de arithmetica, 
one of the first printed books on mathematics. 
Its content, influenced by Arabian mathemat- 
ics, includes practical arithmetic and double- 
entry bookkeeping. The book enjoyed wide 
popularity. 

The best known result of 16th-century 
mathematics is the solution of algebraic equa- 
tions of degrees 3 and 4 by the Italian mathe- 
maticians Scipione del Ferro, N. Tartaglia 
(1506-1557), G. Cardano (1501-1576), and L. 
Ferrari (1522-1565). Cardano published the 
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solution of equations of the third degree in his 
book Ars magna (1545). The solution was due 
to Tartaglia, to whom acknowledgment was 
made, although publication of the method 
was against his will. This constitutes a famous 
episode in the history of mathematics, but 
what is historically more important is the fact 
that essential progress beyond Greek mathe- 
matics was made by mathematicians of this 
period, since the Greeks were able to solve 
equations only of degrees 1 and 2. Algebra 
was subsequently systematized by the French 
mathematician F. Viéte (1540—1603). 

By the end of the 15th century, practical 
mathematics (influenced by the Arabians) had 
become popular in Europe, and more ad- 
vanced mathematics began to be studied in 
European universities, especially in Italy. In 
1543, N. Copernicus (1473—1543) published 
his heliocentric theory (1543); G. Galilei (called 
Galileo) (1564-1642), the indomitable propo- 
nent of this theory, was also born in the 16th 
century. Copernicus studied at the Univer- 
sities of Bologna, Padua, and Ferrara; Galileo 
studied at the University of Pisa and taught at 
the Universities of Pisa, Padua, and Florence. 
A system of numeration was imported from 
Arabia to Europe in the 13th century; by the 
time of S. Stevin (1548?—1620?) it took the 
definite form of a decimal system, and with the 
development and acceptance of printing, the 
forms of the numerals became fixed. 
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A. Introduction 


The concept of renormalization was intro- 
duced by S. Tomonaga, J. S. Schwinger, M. 
Gell-Mann, and F. E. Low in order to over- 
come the difficulty of divergence in field theory. 
If the upper bound of the momentum is limited 
to a finite cutoff value A, then physical quan- 
tities, for example, the mass m of an electron, 
can be obtained as finite quantities by letting 
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A go to infinity after summing all divergent 
terms. This is called the renormalization method 
using subtraction. Since the cutoff A is arbitrary 
insofar as it is finite, the Green's functions are 
indefinite because they depend on A. This 
dependence on the cutoff A corresponds to 
the response for the scale transformation of 
length, and this transformation is a certain 
(semi) group, called a renormalization group. 
Several kinds of renormalization group have 
been used in field theory, as well as in the 
statistical mechanics of phase transition. 


B. Renormalization Group in Field Theory 


[1-3] 


A typical method to resolve the ultraviolet 
divergence is to add subtraction terms in the 
Lagrangian so that they cancel the divergence. 
This cancellation is usually performed in each 
order of the perturbation expansion. When 
the addition of a finite number of subtraction 
terms cancels the divergence, the relevant 
Hamiltonian is said to be renormalizable. 
Otherwise it is called unrenormalizable. The 
Lagrangian density 


1 1 1 

Po 750" 990,090 "mt "re, 
for example, can be renormalized by the trans- 
formation po = Z1? 9, Jo =Z,Z; 7g, and m= 
m? + óm?. All the divergences are taken into 
the renormalization constants Z3, Z,, and 
óm?, so that the renormalized quantities o. 
g, and m are finite. These renormalization 
constants can be calculated by means of a 
perturbation method, but the requirement of 
circumventing the divergences alone is not 
sufficient to determine them explicitly. This 
indeterminacy is usually expressed as Z3(4) 
and Zu i.e., in terms of a parameter u, 
called the renormalization point. The u- 
dependence of these functions can be deter- 
mined by means of the following renormaliza- 
tion conditions: 

lim (p? -- m^) G?X(p, p)= 1; 


p^tu220 


u? 
I (pi, un) 1, pip;- 4 Ul — 46,)). 


Since the renormalization constants depend on 
the continuous parameter u, the renormalized 
Green's function and coupling constant g 

are also functions of jj, and consequently the 
quantity d@"({p;}, vu, g(u)) defined by 


d" p), H. g(u)) 


2N 
= h (p2 + vil Gite, p, 9(H)) 
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satisfies the renormalization equation 
p+ 85 —2Ny 4o (p), a, g)--0 
Ou ôg 1J)5 rs H 


where $= udg/dy and y 23udlog Z;(u)/du. If 
the coefficients f and y are calculated pertur- 
bationally up to a certain order, the renormal- 
ized Green's function is obtained up to the 
same order by solving the foregoing renor- 
malization equation. This is the first kind of 
renormalization group. A second kind ex- 
presses the response of the renormalized 
Green's function to the change in the mass and 
coupling constant, and is expressed by the 
Callan-Symanzik equation [4, 5] 


ô d 
TEEN GEN — AGE", 


where Bio) - Zmo0g/0my and y,(g) -3Zmg: 
0log Z;/0mo, and where Z is determined by 
Zmgóm/àm, =m. The inhomogeneous term is 
defined by 


AGE" Zn, A Gg? 
mo 


0 
+ Zmo UN log Z4Gg. 
0 


The irreducible Green's function IK) satisfies 


d ô 
m——+ B(g)=-—2Ny,(g) [TEY =iATẸ™. 
om 0g 


Since the inhomogeneous term can be neg- 
lected in the high-energy region, the foregoing 
equation becomes homogeneous, and con- 
sequently its solution is 


m 
Cal ps. 


g(a) 
= ev] — anf Y(g)B "ie ag | 


g 
x T" (p,, m, g(4)). 


Here g(A) is the solution of the equation 

fg B^ (g')dg' 2 logA. Since dimensional analy- 
sis yields IP (Ap; m, g)= A* "fe (Pi, m/2, 9), 
the foregoing solution shows that high-energy 
phenomena can be described by the low- 
energy phenomena whose coupling constant is 
given by g,, z g(oo). In particular, when g,, — 0, 
high-energy phenomena are described by the 
asymptotically free field. This circumstance is 
called asymptotic freedom. 


C. Renormalization Group Theory in 
Statistical Physics 


The renormalization group technique has 
proved to be powerful in statistical physics, 
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particularly in studies of phase transitions and 
critical phenomena [6-10]. The correlation 
length č diverges like £(T) -(T— T.) " near 
the critical point T., where v is called the crit- 
ical exponent of €. Similarly, the correlation 
function C(R) for the distance R behaves like 
C(R)-R V* ?*? x exp( KR), k-£ !, where 
d is the dimensionality of the system and y is 
the exponent describing the deviation of the 
singular behavior of C(R) from classical the- 
ory. Renormalization is useful in evaluating 
these critical exponents systematically. The 
fundamental idea ts to eliminate some degrees 
of freedom, to find recursion formulas for 
interaction parameters, and then to evaluate 
critical exponents from their asymptotic be- 
havior near the fixed point. There are many 
different ways of carrying out this idea ex- 
plicitly. Roughly classifying these into two 
groups, we have (i) momentum-space renor- 
malization group theories [6—10] and (ii) 
real-space renormalization group theories 
[10,11]. 

The common fundamental structure of these 
renormalization group techniques is explained 
as follows. First the momentum space or real 
space is divided into celis and rapidly fluctuat- 
ing parts, namely, small-momentum parts 
inside each cell are integrated or eliminated, 
and consequently the remaining slowly fluc- 
tuating parts, namely, long-wave parts, are 
renormalized. The original Hamiltonian % is 
transformed into J£, by means of this elimi- 
nation process and by some scale transforma- 
tion that preserves the phase space volume. 
This renormalization operation is written as 
R,: ie., #, - RyJf,. Similarly we have #, = 
R, 4%, — R2 Mo, ..., 6,  RyAP, RI .... 
This transformation R, has the (semi) group 
property R,,, = R,R,.. A generator G is defined 
by G—lim, ,, lb, — 1)/(b—1). That is, R, = 
exp(IG), e! 2 b. The transformation of J is 
expressed as d /dl 2 G[. ]. The fixed point 
H*=R,H* is the solution of G[.£*] 20. In 
order to find critical exponents from the as- 
ymptotic behavior of G near #*, we consider 
a Hamiltonian of the form # = #*+wQ and 
expand G[J] as GLH * --wQ] -wKQ + 
O(w?). If the operator K thus defined has a 
negative eigenvalue 4;, the corresponding 
physical quantity Q; becomes irrelevant after 
repeating the renormalization procedure, and 
the physical quantity Q; corresponding to a 
positive eigenvalue 4;7 0 becomes relevant. 
Thus, by introducing a field h; conjugate to 
the relevant operator Q;, we study the Hamil- 
tonian J£ — J£* -- Y, h;Q;. The free energy per 
unit volume f [2€ ]  f(h,,h;, ...) is found to 
have the scaling property 


fy, ss hy s) b 4 f(b?ih,s ..., bh, A 
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By taking Q, as the energy operator, we have 
h, — T— T; st. By the normalization bèh, — 1, 
we obtain the scaling law 


PU cg E Jt Ag a ESL). 


The critical exponent of the specific heat de- 
fined by C~t~* is given by the formula «= 
2 — d/À,. Other scaling exponents (09; can be 
obtained via the formula q;— 4;/4, from the 
eigenvalues of K. The simplest example of 
R, is the case where a single interaction para- 
meter K is transformed into a new parameter 
K' by K'=f,(K). The fixed point K* is given 
by the solution of K* = f,(K*). The correla- 
tion exponent v defined by €~(K — K*) " is 
given by the Wilson formula v —logb/log A, 
A — (df, /d K)y e, In most cases, R, is con- 
structed perturbationally, and critical expo- 
nents are usually calculated in power series of 
£&d,—d,as Q— 9o + 1E + 95€? +..., where 
d, denotes the critical dimension. This is called 
the c-expansion. The first few terms are cal- 
culated explicitly for specific models, such as 
the ~*-model. By applying the Borel sum 
method to these c-expansions, one can esti- 
mate critical exponents [10,11]. 

The renormalization group method can be 
applied to other many-body problems, such as 
the Kondo effect [12]. 
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A. General Remarks 


For a mathematical system A, a mapping from 
A to a similar (but in general "more concrete") 
system preserving the structure of A is called a 
representation of A. In this article, we consider 
the representations of tgroups and fassociative 
algebras. For representations of other alge- 
braic systems — 42 Boolean Algebras; 231 
Jordan Algebras; 248 Lie Algebras. For topo- 
logical, analytic, and algebraic groups — 13 
Algebraic Groups; 69 Compact Groups; 249 
Lie Groups; 422 Topological Abelian Groups; 
423 Topological Groups; 437 Unitary Repre- 
sentations. For specific groups — 60 Classical 
Groups; 61 Clifford Algebras. 


B. Permutation Representations of Groups 


We denote by Gu the group of all tpermuta- 
tions of a set M (— 190 Groups B). À permuta- 
tion representation of a group G in M isa 
homomorphism CG G,,. We denote by ay the 
permutation of M corresponding to ae G and 
write ay(x)— ax (xe M). Then we have a con- 
dition (ab)x = a(bx), 1x 2 x (a, beG, 1 is the 
identity element, xe M). In general, if the prod- 
uct axe M of aeG and xe M is defined and 
satisfies this condition, then G is said to oper- 
ate on M from the left, and M is called a left 
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G-set. Giving a permutation representation of 
G in M is equivalent to giving the structure of 
a left G-set to M. A reciprocal permutation 
representation of G in M is an tantihomomor- 
phism G>G,,, which becomes a homomor- 
phism if we define the multiplication in Gu by 
the right notation x( fg) ^ (xf le If the product 
xae M of ae G and xe M is defined and satis- 
fies the conditions x(ab) =(xa)b and x1 =x, 
then, as before, G is said to operate on M from 
the right, and M is called a right G-set. Giving 
a reciprocal permutation representation of G 
in M is equivalent to giving the structure of a 
right G-set to M. 

A (reciprocal) permutation representation 1s 
said to be faithful if it is injective; the corre- 
sponding G-set is also said to be faithful. In 
particular, we can take G itself as M and de- 
fine the left (right) operation by the multipli- 
cation from the left (right). Then we have a 
faithful permutation representation (reciprocal 
permutation representation), which is called 
the left (right) regular representation of G. For 
ac G, the induced permutation ag: x-ax (xa) 
is called the left (right) translation by a. 

We call a left G-set simply a G-set. If a sub- 
set N of a G-set M satisfies the condition that 
aeG, xeN implies axe N, then N forms a G- 
set, which is called a G-subset of M. If a G-set 
M has no proper G-subset (Ge, one different 
from M itself and the empty subset), then for 
any two elements x, ye M there exists an ele- 
ment acG satisfying ax = y. In this case, the 
operation of G on M is said to be transitive, 
and the corresponding permutation represen- 
tation is also said to be transitive. If an equiva- 
lence relation R in a G-set M is compatible 
with the operation of G (i.e., R satisfies the 
condition that ae G, R(x, y) implies R (ax, ay)), 
then the quotient set M/R forms a G-set in the 
natural way, called the quotient G-set of M by 
R. If a G-set M has no nontrivial quotient G- 
set, i.e., if the only equivalence relations com- 
patible with the operation are 


R(x x) forany x,x eM 


and 


R(x,x) ifandonlyif x=x, ` 

then the operation of G on M and the corre- 
sponding permutation representation are said 
to be primitive. 

A mapping f: M — M' of G-sets is called a 
G-mapping (G-map) if the condition f(ax) — 
af(x) (ae G, xe M) is satisfied. G-injection, G- 
surjection, and G-bijection are defined natur- 
ally. The inverse mapping of a G-bijection is 
also a G-bijection. Two permutation represen- 
tations are said to be similar if there exists a 
G-bijection between the corresponding G-sets. 
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Let M be a transitive G-set, and fix any 
element xe M. If we view G as a G-set, the 
mapping f: G— M defined by f(a)=ax is a G- 
surjection and induces a G-bijection f: G/R 
2 M. Here an equivalence class of R is pre- 
cisely a left coset of the stabilizer (stability 
group or isotropy group) H. = (aeG|ax— x]. 
Hence we have a G-bijection G/H, — M. Con- 
versely, for any subgroup H of G, G/H isa 
transitive left G-set. A transitive G-set is called 
a homogeneous space of G. 

For a family {M,},-, of G-sets, the Car- 
tesian product IT, M, and the tdirect sum 
Drea M; become G-sets in the natural way; 
they are called the direct product of G-sets and 
the direct sum (i.e., disjoint union) of G-sets, 
respectively. Every G-set M is the direct sum of 
a family {M,} of transitive G-subsets, and each 
M, is called an orbit (or system of transitivity). 
For a G-set M, the direct product G-set M* — 
M x ... x M (k times) contains a G-subset 
M'9 — ((x,, ...,x,)| ij implies x; xj. If M® 
is transitive, M is said to be k-ply transitive. If 
M is transitive and the stabilizer of each point 
of M consists of the identity element alone, M 
is said to be simply transitive. 

If M has n elements, a permutation repre- 
sentation of a group G in M is said to be of 
degree n. When G is a group of permutations 
of M, the canonical injection Go © y is a 
faithful permutation representation; this case 
has been studied in detail (— 151 Finite 
Groups G). 


C. Linear Representations of Groups and 
Associative Algebras 


Let K be a tcommutative ring with unity ele- 
ment and M be a K-module. Though we shall 
mainly treat the case where K is a field and M 
is a finite-dimensional "linear space over K, the 
case where K is an tintegral domain and M is 
a tfree module over K of finite rank is also 
important. Since K is commutative, we can 
write Ax =x/ (Ae K, xe M). Let &,(M) be the 
associative algebra over K consisting of all K- 
endomorphisms of M, and let GL(M) be the 
group of all tinvertible elements in &(M), 
where we assume M # {0}. Let A be an as- 
sociative algebra over K. A linear represen- 
tation of the algebra A in M is an algebra 
homomorphism A &,(M). We always assume 
that A has a unity element and the homomor- 
phisms are unitary. For convenience, we can 
also consider a linear representation in the 
trivial space M = {0}, which is called the zero 
representation. A reciprocal linear represen- 
tation is an antihomomorphism A76,(M). A 
linear representation of a group G in M isa 
group homomorphism G — GL(M). This can 
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be extended uniquely to a linear representa- 
tion of the tgroup ring K[G] in M, and con- 
versely, the restriction of a linear representa- 
tion of K[G] in M to G is a linear representa- 
tion of G; and similarly for reciprocal linear 
representations. Thus the study of (reciprocal) 
linear representations of a group G in M can 
be reduced to the study of (reciprocal) linear 
representations of the group ring K [G] in M. 

We now consider the linear representation 
of associative algebras, which we call simply 
"algebras." (Note that a group ring has a 
canonical basis—the group itself —and allows 
a more detailed investigation; — Sections G, 
I) 

Given a commutative ring K with unity and 
a linear representation p of a K-algebra A in a 
K-module M, we introduce the structure of a 
left A-module into M by defining ax = p(a)x 
(ae A, xe Mj; the structure of a K-module in M 
obtained by the canonical homomorphism K 
— A coincides with the original one. This A- 
module is called the representation module of 
p. Conversely, for any left A-module M we can 
define a linear representation p of A in M 
(with M viewed as a K-module via K — A) by 
putting p(a)x — ax; the representation module 
of p coincides with the original one. This rep- 
resentation p is called the linear representa- 
tion associated with M. A reciprocal linear 
representation of A corresponds to a right A- 
module. Thus the study of (reciprocal) linear 
representations of A is equivalent to the study 
of left (right) A-modules. For instance, if the 
operation of a group G on M is trivial: ox =x 
(c € G, x e M), the corresponding representation 
of G in M assigns the identity mapping I, to 
every a € G. Furthermore, if M — K, this repre- 
sentation is called the unit representation of G 
(over K). 

Let p, p' be linear representations of A in 
K-modules M, M', respectively. Then an 
A-homomorphism M  M' is precisely a K- 
homomorphism f': M — MI satisfying the con- 
dition f o p(a) — p'(a)o f (ae A); this is some- 
times called a homomorphism from p to p'. 

In particular, an A-isomorphism is a K- 
isomorphism f: M — M' satisfying the con- 
dition fo p(a)o f ~' = p'(a) (ae A); in this case 
we say that p and p' arc similar (isomorphic or 
equivalent) and write p= p’. 

Let M be the representation module of a 
linear representation p of an algebra A. If p is 
injective, p and the corresponding M are said 
to be faithful. For example, the linear represen- 
tation associated with the left A-module A is 
faithful; this is called the (left) regular represen- 
tation of A. If M is tsimple as an A-module, 

p is said to be irreducible (or simple). A homo- 
morphism from an irreducible representation p 
to p must be an isomorphism or the zero 
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homomorphism (Schur's lemma). In particular, 
if K is an talgebraically closed field and M is 
finite-dimensional, then such a homomor- 
phism is a scalar multiplication. A linear rep- 
resentation is said to be reducible if it is not 
irreducible. If M is tsemisimple as an A- 
module, p is said to be completely reducible (or 
semisimple). If A is a semisimple ring, any 
linear representation of A is completely reduc- 
ible. The converse also holds (— 368 Rings G). 

The linear representations associated with a 
submodule and a quotient module of M as an 
A-module are called a subrepresentation and a 
quotient representation, respectively. The linear 
representation associated with the direct sum 
M,+...+M, of the representation modules 
Mı, ..., M, of linear representations p,, ..., p, is 
written p, +... + p, and called the direct sum of 
representations. If p is never similar to the 
direct sum of two nonzero linear representa- 
tions, then p is said to be indecomposable; this 
means that M is tindecomposable as an A- 
"module. 

For linear representations p, p' of a group G 
in M, M', we define the linear representation 
p & p' in M & M' by (p & p')(g)— o(g) € p'(g) 
(ge Gy; this is called the tensor product of repre- 
sentations p and o. 


D. Matrix Representations 


Let K" be the K-module consisting of all n- 
tuples (£;) of elements in a commutative ring 
K. &,(K") is identified with the K-algebra 
M,(K) of all n x n matrices Lal over K:(45)(£j) 
cu 4j. Thus a linear representation of 
Ain K”, i.e., a homomorphism A> M,(K), is 
called a matrix representation of A over K, and 
n is called its degree. A matrix representation 
of a group G over K of degree n is a homo- 
morphism G — GL(n, K), where GL(n, K) is 
the group of all n x n invertible matrices. If 
er, ...,e,) is a tbasis of a K-module M, then by 
the K-isomorphism K"— M given by the as- 
signment (£j) È}; e;č; we have a bijective 
correspondence between the matrix represen- 
tations of A of degree n and the linear repre- 
sentations of A in M, and the corresponding 
representations are similar. Explicitly, the 
linear representation p corresponding to a 
matrix representation a—(A,,(a)) is given by 


n 


plaje;= Y eA,(a), acA. 

i-1 
Hence giving the finite-dimensional linear 
representations over a field K is equivalent to 
giving the matrix representations over K. Let 
T, T' be matrix representations of degree n, w. 
Then a homomorphism from T to T’ is an n' 
x n matrix P satisfying PT(a) — T'(a)P (ae A). 
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Therefore T and T’ are similar if and only if 
n — n' and PT(a)P ! = T'(a) (ae A) for some 
nxn invertible matrix P. For a representation 
of a group G, it suffices that this equation is 
satisfied by all ae G. 

We always assume that K is a field. Then a: 
K-module is a linear space over K. A linear 
representation p over K of a K-algebra A is 
said to be of degree n if its representation 
module M is of dimension n over K. Suppose 
that a sequence (0) 2 Moc M, c ...cC M,- M 
of A-submodules of M is given. We take a 
basis (e,, ...,e,) of M over K such that (e,, 

---, €m) forms a basis of M; over K (1 <i<r). 
Then the matrix representation corresponding 
to p relative to the basis (e,,...,e,) has the 


form 
Ty (a) Ti, (a) Tı, (a) 
Ty (a) Ta (a) 
a—T {a= ; 
0 T, (a) 


where, if we put n; — dim Mj/M, , —m;—m;.,, 
Ti(a) is an n; x n; matrix and T;(a)=0 for i» j. 
The residue classes of en, e, forma 
basis of the quotient space M;/M,. , over K, 
and the matrix representation corresponding 
to the linear representation p; associated with 
M,/M;,_, relative to this basis is given by Ty. 
The sequence {M,} is a tcomposition series if 
and only if each p; (hence Ty) is irreducible. In 
this case, p, ..., p, are uniquely determined by 
p up to their order and similarity (Jordan- 
Holder theorem). An irreducible representation 
p' similar to some p, is called an irreducible 
component of p. The number p >Q of p; similar 
to p' is called the multiplicity of o as an irre- 
ducible component of p. We also say that o 
contains p' p times or p' appears p times as an 
irreducible component of p. The representa- 
tion p is completely reducible if and only if it 
is similar to the direct sum of its irreducible 
components (admitting repetition). In this case, 
ois similar to the matrix representation 


Ty (a) 0 


0 T,, (a) 


E. Coefficients and Characters of Linear 
Representations 


We consider the linear representations of an 
algebra over a field K. A right (left) A-module 
M is regarded as a linear space over K. In its 
dual space M*, we introduce the structure of a 
left (right) A-module using the inner product 

€ , > as follows: (x, ax*> = <xa, x*5 (<x, TO? 
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= (ax, x*5), where ae A, xe M, x*e M*. If p is 
the representation associated with M, the rep- 
resentation associated with M* is called the 
transposed representation (dual representation 
or adjoint representation) of p, and is denoted 
by 'p. The linear mapping ‘p(a) is the ttrans- 
posed mapping of p(a). If M is finite- 
dimensional over K, we have (M*)* = M as an 
A-module. For a linear representation p of a 
group G, the mapping g'p(g) ! (geG) is 
called the contragredient representation of p. 
The reciprocal linear representation associated 
with the right A-module A is called the right 
regular representation of A, and its transposed 
representation De, the representation as- 
sociated with the left A-module A*) is called 
the coregular representation of A. For any 
finite-dimensional semisimple algebra and 
group ring of a finite group, the regular repre- 
sentation and the coregular representation are 
similar (— 29 Associative Algebras H). 

Let p be a linear representation of A over K 
and M be its representation module. For any 
xe M, x* e M*, we define a ‘linear form p, 6 
A* on A by p, (a) = (ax, x*5 (ae A). This is 
called the coefficient of p relative to x, x* and 
is determined by its values at generators of 
A as a linear space. In particular, a coefficient 
of a linear representation p of a group G can 
be regarded as a function on G taking values 
in K. For a fixed x* e M* the assignment x^ 
Px.x gives an A-homomorphism M > A*, 
where A* is considered as a left A-module. 
Therefore any nonzero coefficient p, ,. of an 
irreducible representation p generates an A- 
submodule of A* isomorphic to M. In other 
words, any irreducible representation of A is 
similar to some subrepresentation of the co- 
regular representation of A. In particular, 
any irreducible representation of a finite- 
dimensional semisimple algebra or a finite 
group is an irreducible component of the 
regular representation. Let A* be the subspace 
of A* generated by all coefficients p, „+ (xe M, 
x* e M*) for a given linear representation p. 
Then p zo implies Až = Až.. If p,, ..., p, are 
irreducible representations of A such that p; 
and p; are not similar unless i=j, the sum 
Aj t... A5, in A* is direct. In particular, for 
a semisimple algebra A, let the p; (1 &« i & r) be 
the irreducible representations associated with 
the minimal left ideals of the *simple compo- 
nents A; of A. Then any irreducible represen- 
tation is similar to one and only one of p,, ..., 
p,, and A* can be decomposed into the direct 
sum of As ..., Aj. In addition, each A% is 
canonically identified with A7. 

We shall treat finite-dimensional representa- 
tions exclusively. Let (e,,...,e,) be a basis of 
the representation module M of p over K, and 
let a T(a) — (4;(a)) be the matrix representa- 
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tion that corresponds to p with respect to 

this basis. Then = pe er (1 Sij &n), where 
(et, ..., ef) is the dual basis. If K is algebrai- 
cally closed and p is irreducible (or more gen- 
erally, absolutely irreducible), then {4,;} is 
linearly independent; therefore we have dim A7 
=n? (G. Frobenius and I. Schur). We take a 
matrix representation T corresponding to p 
and put y,(a)=tr T(a) (ae A). Then y, is a 
function on A that is uniquely determined by p 
and belongs to 47; y, is called the character of 
p. For a linear representation p of a group G, 
the character of p can be regarded as a func- 
tion on G. Moreover, it can be viewed as a 
function on the set of all *Àconjugate classes of 
G. The character of p is equal to the sum of the 
characters of the irreducible components of p 
taken with their multiplicities. The character 
of an irreducible representation is called an 
irreducible character (or simple character). If K 
is of characteristic 0, then p=’ is equivalent 
to *, — Xp and the different irreducible charac- 
ters are linearly independent. The character 

of an absolutely irreducible representation 

(— Section F) is called an absolutely irreducible 
character. If we consider absolutely irreducible 
characters only, the statement holds irrespec- 
tive of the characteristic of K. 

The sum of all absolutely trreducible charac- 
ters of A is called the reduced character (or 
reduced trace) of A. The direct sum of all ab- 
solutely irreducible representations of A is 
called the reduced representation of A, and its 
character is equal to the reduced character. 
The determinant of the reduced representation 
is called the reduced norm of A. 


F. Scalar Extension of Linear Representations 


Let K, L be commutative rings with unity 
element, and fix a homomorphism o: K oL. 
We denote by M? the scalar extension o*(M) 
=M Gy L of a K-module M relative to 
OiXA BS u—xGA"u(xeM;4^eK,ueL)( 277 
Modules L). For an algebra A over K, the 
scalar extension A? of the K-module A has 
the natural structure of an algebra over 
L:(a® áb & pw) =ab @ Au (a, be A; A, ue L). 
For a group G, we can regard (K[G])° = 
L[G]:g 99 4294 (geG, 4€ L). If M is a left 
A-module, then M” has the natural structure 
of a left A^-module; (a Q A)(x @ u) 2 ax 69 Au 
(ae A, x«M; A, ue L). For the linear represen- 
tation p associated with M, the linear repre- 
sentation p? over L associated with M7 is 
called the scalar extension of p relative to 
c:p^(aG 1)=p(a) @ 1,. Let (ej, ...,e,) bea 
basis of M over K. If the matrix representation 
a~»(A,(a)) corresponds to M relative to this 
basis, then the matrix representation corre- 
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sponding to M^ relative to the basis (e, © 1, 
...,€, 0) 1) over L is given by a® 1>(A,(a)’). 
A linear representation over L is said to be 
realizable in K if it is similar to the scalar 
extension p^ of some linear representation p 
over K. 

In particular, if o: K —^L is an isomorphism, 
p? is called the conjugate representation of p 
relative to c. The conjugate representation 
relative to the automorphism o:4—4 (complex 
conjugation) of the complex number field is 
called the complex conjugate representation. 1f 
m is an ideal of K and o: K ^ K/m (tresidue 
class ring) is the canonical homomorphism, 
then the construction of p" from p is called the 
reduction modulo m (— Section J). If p isa 
prime ideal of K and c: K O K, (‘local ring) is 
the canonical homomorphism, then the con- 
struction of p? from p is called the localization 
relative to p. If K is an integral domain and p 
=K — (0j, then K, is the "feld of quotients of 
K. We can also consider the “completion of 
representation” with respect to p. 

Let K be a field, L a field extension, and 
o:K—L the canonical injection. Then for a 
linear representation p of A and its represen- 
tation module M, the scalar extensions p°, M^ 
are written pl, M^, respectively. In view of M 
<M", Ac A}, &,(M) c 6, (M^) by the natural 
injections, we can regard p^ as an extension 
of the mapping p. We shall consider finite- 
dimensional representations exclusively. For 
linear representations p,, p; over K, pı = p, is 
equivalent to pF c pL. An irreducible repre- 
sentation p over K is said to be absolutely 
irreducible if its scalar extension p* to any field 
extension L is irreducible; an equivalent con- 
dition is that the scalar extension of to the 
talgebraic closure K is irreducible. Another 
equivalent condition is that every endomor- 
phism of the representation module M of p 
must be a scalar multiplication. If every irre- 
ducible representation of A over K is absolutely 
irreducible, K is called a splitting field for 
A. For a group G, if the field K is a splitting 
field for the group ring K[G], then K 
is called a splitting field for G. Let A be finite- 
dimensional over K. If K is a splitting field for 
A, any irreducible representation of A¥ is 
realizable in K for any field extension L of K. 
For an arbitrary field K, the scalar extension 
p* of an irreducible representation p to a 
tseparable algebraic extension L of K is com- 
pletely reducible. For simplicity, we assume 
that K is tperfect and L= K. Then the multi- 
plicities of all irreducible components of p* are 
the same; this multiplicity is called the Schur 
index of p. 

The set S(K) of *algebra classes over K, each 
of which is represented by a (central) simple 
component of the group algebra K[G] of 
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some finite group G, is a subgroup of the 
tBrauer group B(K) of K, known as the Schur 
subgroup of B(K). Recent research has clarified 
considerably the structure of this group [19]. 


G. Linear Representations of Finite Groups 


Let G be a finite group of order g. The linear 
representation of G over K is equivalent to the 
linear representation of the group ring K[G], 
concerning which we have already stated the 
general facts. If K is the ring Z of rational 
integers, a linear representation over K is 
sometimes called an integral representation. 
We assume that K is a field. If the character- 
istic of K is zero or more generally not a divi- 
sor of g, every linear representation of G over 
K is completely reducible (H. Maschke). Such 
a representation is called an ordinary represen- 
tation. If g is divisible by the characteristic of 
K, we have a modular representation (— Sec- 
tion I). 

The exponent of G is the smallest positive 
integer n satisfying a" — 1 for every element 
aéG. A field containing all the nth roots of 
unity is a splitting field for G (R. Brauer, 1945). 
Consequently, for such a field K, any scalar 
extension of an irreducible representation over 
K is irreducible, and any irreducible represen- 
tation over any field extension of K 1s realizable 
in K. We fix a splitting field K for G and as- 
sume that K is of characteristic 0; for example, 
we can assume K — C. 

The number of nonsimilar irreducible repre- 
sentations of G is equal to the number of 
conjugate classes in G. Each irreducible repre- 
sentation appears as an irreducible component 
of the regular representation with multiplicity 
equal to the degree. In addition, each degree is 
a divisor of the order g of G. Let p be a linear 
representation of a subgroup H of G and M 
be its representation module. Then the linear 
representation of G associated with the K [G]- 
module K [G] Gi; M is called the induced 
representation and is denoted by pê. If the 
matrix representation T corresponds to p, then 
using the partition of G into the cosets G — 

a, HU ...Ua,H we can write the matrix 
representation corresponding to of as 


T (a, aai) T (a, 'aa,) 
a> 


3 


T (a; 'aa,) T (a, 'aa,) 


where we define 7(b)=0 for bé H. The induced 
representation from a representation of degree 
1 of a subgroup is called a monomial represen- 
tation. To such a representation corresponds 

a matrix representation T such that T(a) has 
exactly one nonzero entry in each row and 
column for every ae G. For the trivial sub- 
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group H — (ej, we obtain the regular represen- 
tation of G. In general, for an irreducible 
representation o of G and an irreducible repre- 
sentation p of a subgroup H, the multiplicity 
of e in pf coincides with that of p in the res- 
triction gu of o to H (the Frobenius theorem). 
The following orthogonality relations hold for 
irreducible characters y and y of G: 


nn )% x7, 
2, xtawa ) ay. 


A 


a §9/9a Cla) C(D. 
2 xla)x(b "Lh C(a) #C(b). 


In the second formula, y ranges over all the 
irreducible characters of G, C(a) denotes the 
conjugate class of G containing a, and g, is the 
number of elements in C(a). 


H. Linear Representations of Symmetric 
Groups 


All irreducible representations of a symmetric 
group C, over the field Q of rational numbers 
are absolutely irreducible. Hence the represen- 
tation theory of S, over a field of character- 
istic zero reduces to that over Q. Since the 
group algebra A = Q[&,] is semisimple, to 
obtain an irreducible representation of S, it is 
sufficient to find a tprimitive idempotent (i.e., 
an idempotent that is not the sum of two 
orthogonal nonzero idempotents) of A. Such 
an idempotent can be obtained in the follow- 
ing way. As in Fig. 1, we draw a diagram T 
consisting of n squares arranged in rows of 
decreasing lengths, the left ends of which are 
arranged in a single column. Such a diagram T 
is called a Young diagram; if it has k rows of 
lengths f; 2 f;2...2/,-0, f, f; t... fin 
then it is written T= T(f,,..., f,). We put the 
numerals 1 to n in any order into the n squares 
of T— T( fi. f;, ..., f, as in Fig. 1, for example. 
We then denote by o any permutation of S, 
that preserves each row and construct an 
element of A:s= Yo. Similarly, we denote by 

t any permutation of C, that preserves each 
column and set t= X (sgn t)r. If we set u=t-s 
= +t-o, then u is a primitive idempotent of 
A except for a numerical factor. This implies 
that u yields an irreducible representation of 
S, The element u of A is called the Young 
symmetrizer associated with T. 


[1123 4|5|6 |7]8] 
[9 ron pizs a] 






Fig. 1 
Young diagram. n — 26; f, =8, f; —6, f3=6, , =4, f; 
=2. 
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If we put the numerals 1 to n into the n 
squares of T in a different order, we obtain 
another symmetrizer u' associated with T. 
However, these two irreducible representations 
associated with u and w’ are similar. Hence 
there corresponds to T a fixed class of irre- 
ducible representations of G,, i.e., a fixed irre- 
ducible character of S,. Moreover, any two 
different Young diagrams yield different 
irreducible characters, and any irreducible 
character is obtained by a suitable Young 
diagram. Thus there exists a one-to-one corre- 
spondence between the Young diagrams and 
the irreducible characters of GA 

The method of determining the character 
associated with a given diagram was found by 
Schur and H. Weyl (— 60 Classical Groups). 


I. Modular Representations of F inite Groups 


Let G be a finite group of order g, and let K be 
a splitting field of G of characteristic p40. If p 
is a divisor of g, we have the case of modular 
representation, in which the situation is quite 
different from the case of ordinary representa- 
tion. The theory of modular representations 
of a finite group was developed mainly by 
Brauer after 1935. 

The elements of G whose orders are prime 
to p are called p-regular. Let k be the number 
of p-regular classes of G, Le, conjugate classes 
of G containing the p-regular elements. Then 
there exist exactly k nonsimilar absolutely 
irreducible modular representations F,, F,, 
...,,. The number of nonsimilar indecom- 
posable components of the regular representa- 
tion R of G is also equal to k, and we denote 
these representations by U,, U,, ..., Up. We 
can number them in such a way that F, ap- 
pears in U, as both its top and bottom compo- 
nent. If the degree of F, is f, and that of U, is 
u,, then U, appears f, times in R and F, ap- 
pears u, times in R. The multiplicities c,, of F, 
in U, are called the Cartan invariants of G. 

Take an algebraic number field Q that is a 
splitting field of G. Let p be a prime ideal in 
Q dividing p, and let o be the domain of tp- 
integers of Q. Then the residue class field o/p is 
a finite field of characteristic p and a splitting 
field of G. Hence we can assume that o/p — K, 
where K is the field considered at the begin- 
ning of this section. Let Z,, Z2, ..., Z, be the 
nonsimilar irreducible representations of G in 
Q. We can assume that all the coefficients of 
Z; are contained in o. Replacing every coeffi- 
cient in Z; by its residue class mod p, we obtain 
a modular representation Z;. The modular 
representations Z,,...,Z, thus obtained may 
be reducible. The multiplicities d,, of F, in Z; 
are called the decomposition numbers of G. 
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They are related to the Cartan invariants by 
the fundamental relations 


n 
Cae X did;;. 
i=1 


The determinant |c,;| of degree k is a power 

of p. We set g — p*g', (p,g) - 1. Then we may 
assume that Q contains a primitive g'th root of 
unity ó(c o). Since (p, g’)= 1, the residue class 
ó(c K) of 6 is a primitive g'th root of unity. 

Let M be a modular representation of G. The 
characteristic roots of M (a) for a p-regular 
element a are powers 6” of 5. We replace each 
à" by ô” and obtain an element €(a) of Q as the 
sum of these 6”. In this manner we define a 
complex-valued function č on the set of p- 
regular elements of G. We call £ the modular 
character (or Brauer character) of M. Two 
modular representations have the same irre- 
ducible components if and only if their modu- 
lar characters coincide. Denoting by p, the 
modular character of F, and by y, that of U,, 
we have the following orthogonality relations 
for the modular characters: 


g, K=A, 
0, KA, 


3 olan a C(a) — C(b), 


» ona)» | 


0 C(a)#C(b). 


In the first sum, a ranges over all p-regular 
elements of G. 

We say that F, and F, belong to the same 
block if there exists a sequence of indices x, o, 
B,...,y, A such that Cka #0, Cag #0, ...,c,, 0. 
This is obviously an equivalence relation, and 
F,, F,, ..., F, are classified into a finite number, 
say s, of blocks B,, B,, ..., B,. If F, belongs to 
a block B,, we say by a stretch of language 
that the corresponding U, also belongs to B,. 
All the irreducible components of Z, belong 
to the same block since c, ; 0 if d #0 and 
d;, #0. If the irreducible components of Z; 
belong to B,, we say that Z; belongs to B,. Let 
x, be the number of Z; belonging to B, and y, 
the number of F, belonging to P Then x, > y,. 
If y; is the ordinary character of Z,, then y; 
can be considered as the modular character 
of Z,. If we denote the degree of Z; by z;, then 
g, X; (a)/7; for ae G is an algebraic integer and 
hence belongs to o. Now Z; and Z; belong to 
the same block if and only if g,y;(a)/z;= 
9aX;(a)/z; (mod p) for all p-regular elements 
a of G. 

If p* is the highest power of p that divides all 
the degrees z; of Z; belonging to B,, then it is 
also the highest power of p dividing all the 
degrees f, of F, belonging to B,. We call d— 
e—a the defect of B,; obviously 0€ d <e. If Z; 
belongs to a block of defect d, then the power 
of p dividing z; is p* ^*^ (h;>0). A block of 
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defect 0 contains exactly one ordinary repre- 
sentation Z;, hence also exactly one modular 
representation F, (x, — y, — 1). Moreover, we 
have Z;= F, = U,. It follows that all the de- 
grees z; of Z; belonging to a block of defect 1 
are exactly divisible by p*^!; the converse is 
also true. Z; belongs to a block of defect 0 if 
and only if y,(a) 20 for any element a of G 
whose order is divisible by p. 

Let D be any p-Sylow subgroup of the tcen- 
tralizer C,(a) of an element a of G, and let 
(D:1)=p*. Then d is called the defect of the 
class C(a), and D is called a defect group of 
C(a). The number of blocks of defect e is equal 
to the number of p-regular classes of defect e. 
Let B, be a block of defect d. Then there exists 
a p-regular class of defect d containing an 
element a such that g,7,(a)/z;#0 (mod p) for 
any Z, in B,. The defect group D of C(a) is 
called the defect group of B,, and D is uniquely 
determined up to conjugacy in G. The number 
of blocks of G with defect group D is equal to 
the number of blocks of the tnormalizer N,(D) 
with defect group D. 

An arbitrary element x of G can be written 
uniquely as a product x — sr —rs, where s, 
called the p-factor of x, is an element whose 
order is a power of p, and r is a p-regular 
element. We say that two elements of G belong 
to the same section if and only if their p-factors 
are conjugate in G. This is an equivalence 
relation. Obviously, each section is the union 
of conjugate classes of G. If the p-factor of x is 
not conjugate to any element of the defect 
group D of B,, then y,(x)=0 for all Z; in B,. 
Let 91, 95, -> 9, be the absolutely irre- 
ducible modular characters of C, (s), and let y; 
be the absolutely irreducible ordinary charac- 
ters of C,(s). Since 


xi sr) 2 &xi(r) - e damit, reCg(s), 


we have 


x(sr) = H ra (sr) — Y di, 9; (r). 


The dj, are called the generalized decompo- 
sition numbers of G. If the order of s is pl then 
the d;, are algebraic integers of the field of the 
p'th roots of unity. Let s be conjugate to an 
element of D. There corresponds to B, a union 
B, of blocks of Co(s), and if o p, then B, and 
B, contain no irreducible modular represen- 
tations in common. We have d =0 for any 

Z, in B, (ie., o4 B.). Brauer's original proof 
of this result was considerably complicated; 
simpler proofs were given independently by K. 
Iizuka and H Nagao. From these relations we 
get the following refinement of the orthogon- 
ality relations for group characters. If Z; and 
Z; belong to different blocks of G, then 

Laes Xi(4) x(a) 20, where a ranges over all 
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the elements belonging to a fixed section S of 
G. If elements a and b of G belong to different 
sections, then X, y,(a)y,(b !) 20, where y; 
ranges over all the characters of G belonging 
to a fixed block B,. 


J. Projective Representations of Finite Groups 


Let V be a finite-dimensional linear space over 
a field K, and let P(V) be the ?projective space 
associated with V (— 343 Projective Geom- 
etry). The set of all projective transforma- 
tions of P(V) forms the group PGL(V), which 
can be identified with the quotient group 
GL(V)/K*1,. Here K* = K — {0} and K*1, is 
the set of all scalar multiples of the identity 
transformation 1, of V and is the center of 
GL(V). A homomorphism G>PGL(V) is 
called a projective representation of G in V or 
simply a projective representation of G over 
K. Two projective representations (p, V) and 
(o. V’) of G are said to be similar if there exists 
an isomorphism o: PGL(V)— PGL(V’) induced 
by a suitable isomorphism V-— V' such that 
qop(a)oqo '=p'(a) (ae G). Let V; {0} bea 
subspace of V. We can assume that P(V,)c 
P(V). If (p, V) is a projective representation 

of G such that each p(a) (ae G) leaves P(V,) 
invariant, we get a projective representation 
(o, Vi) by restricting the p(a) to P(V,). In this 
case (p,, Vi) is called a subrepresentation of p. 
A projective representation is said to be irre- 
ducible if there exists no proper subrepresen- 
tation of p. 

A mapping e: G— GL(V) is called a section 
for (p, V) if x(a(a)) = p(a) for each ae G, where 
z is the natural projection of GL(V) onto 
PGL(V). Any section c defines a mapping 
f:Gx Go K* satisfying o(a)a(b) = f(a, b)o(ab) 
(a, be G). The set { f(a, b), veg is called the 
factor set of p with respect to c. The mapping 
f is a *2-cocycle of G with values in K*. The 2- 
cohomology class c, € H?(G, K*) of f is deter- 
mined by p and is independent of the choice of 
sections for p. A projective representation p 
has a section o which is a linear representation 
of G in V if and only if c, — 1. If G is a finite 
group, for any ce H?(G, K*) there exists an 
irreducible projective representation p of G 
over K which belongs to c, i.e., c, — c. If p and 
p’ are similar, then c, —c,.. The tensor product 
p € p' of two projective representations p and 
p' can be defined as in the case of linear repre- 
sentations, and we have c gp =€,’ cy. IE K is 
algebraically closed, then H?(G, K*) is deter- 
mined by the characteristic of K. When K is 
the complex field C, the group H?(G, C*) 
M(G) is called the multiplier of G. If Mt(G)=1, 
then G is called a closed group, and any pro- 
jective representation of G is induced by a 
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linear representation of G. In general, if p is a 
projective representation of G over C, then the 
order of c, is a divisor of the degree of p (di- 
mension of V). Moreover, if p is irreducible, 
then both the degree of p and the square of the 
order of c, are divisors of the order of G. K. 
Yamazaki, among others, studied the projec- 
tive representations of finite groups in detail. 


K. Integral Representations 


Every complex matrix representation of G is 
equivalent to a matrix representation in the 
ring of algebraic integers. If an algebraic num- 
ber field K is specified, every K[G ]-module V 
contains G-invariant R-'lattices (briefly, G- 
lattices), where R is the ring of integers in K. 
A G-lattice L is characterized as an R[G]- 
module, which ts finitely generated and *tor- 
sion free (hence *projective) as an R-module. It 
provides an integral representation of G as an 
automorphism group of the R-projective 
module L. 

R[G ]-modules L and M need not be iso- 
morphic even when the K[G]-modules K ® L 
and K & M are isomorphic. The set of G- 
lattices in a fixed K[G]-module V is divided 
into a finite number of R[G]-isomorphism 
classes (Jordan-Zassenhaus theorem). Let p be 
a prime ideal of R and R, be the localization 
of R at p. The study of R,-representations is 
intimately related with modular representation 
theory. For any R[G]-module L there is an 
associated family of R,[G ]-modules L, = 
R, © L, where p ranges over all primes of R. 
G-lattices L and M in a K[G]-module V are 
said to be of the same genus if L, = M, for 
every p. The number of genera of G-lattices in 
V is given by IT, ^, (g = order G), where h, 
denotes the number of R,[G]-equivalence 
classes of R,[G]-lattices in V. When V is ab- 
solutely irreducible, the number of R[G]- 
equivalence classes in a genus equals the (1deal) 
*class number of K (J. M. Maranda and S. 
Takahashi). 

The *Krull-Schmidt theorem, asserting the 
uniqueness of a direct sum decomposition into 
indecomposable R[G]-modules, holds if R is a 
complete discrete valuation ring or if Risa 
discrete valuation ring and K is a splitting field 
of G. The condition for the finiteness of the 
number of nonisomorphic indecomposable G- 
lattices is known. In particular, for R=Z it 
reduces to the requirement that the Sylow p- 
subgroup of G be cyclic of order p or p? for 
every p|g. Regarding projective Z[G ]-modules 
— 200 Homological Algebra G. 

The isomorphism problem, i.e., the question 
of whether the isomorphism Z[G]z Z[H] of 
integral group algebras implies the isomor- 
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phism G =H of groups, has been answered 
affirmatively for certain special cases such 
as tmeta-Abelian groups. 

R[G] is an R-torder in K [G], and in this 
context, a considerable portion of the integral 
representation theory has been extended to 
more general orders in separable algebras [14— 
16]. 
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363 (XX1.40) 
Riemann, Georg Friedrich 
Bernhard 


Georg Friedrich Bernhard Riemann (Septem- 
ber 17, 1826-July 20, 1866) was born the son 
of a minister in Breselenz, Hanover, Germany. 
He attended the universities of Góttingen and 
Berlin. In 1851 he received his doctorate at the 
University of Góttingen and in 1854 became 
a lecturer there. In 1857 he rose to assistant 
professor, and in 1859 succeeded P. G. L. 
tDirichlet as full professor. In 1862 he con- 
tracted tuberculosis, and he died at age 40. 
Despite his short life, his contributions en- 
compassed all aspects of mathematics. 

His doctoral thesis (1851) stated the basic 
theorem on tconformal mapping and became 
the foundation for the geometric theory of 
functions. In his paper presented for the posi- 
tion of lecturer (1854), he defined the tRie- 
mann integral and gave the conditions for 
convergence of trigonometric series. In his 
inaugural lecture in the same year, he dis- 
cussed the foundations of geometry, intro- 
duced n-dimensional manifolds, formulated the 
concept of fRiemannian manifolds, and defined 
their curvature. In his paper of 1857 on tAbel- 
ian functions, he systematized the theory of 
tAbelian integrals and Abelian functions. In 
his paper of 1858 on the distribution of prime 
numbers, he considered the Riemann zeta 
function as a function of a complex variable 
and stated fRiemann's hypothesis concerning 
the distribution of its zeros. It remains for 
modern mathematics to investigate whether 
this hypothesis is correct. In his later years, 
influenced by W. Weber, Riemann became 
interested in theoretical physics. He gave lec- 
tures on the uses of partial differential equa- 
tions in physics that were edited and published 
by H. Weber. 
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364 (VII.3) 
Riemannian Manifolds 


A. Riemannian Metrics 


Let M be a tdifferentiable manifold of class C" 
(1 &r Xo), and g be a *Riemannian metric of 
class CT on M. Then (M, g) or simply M is 
called a Riemannian manifold (or Riemannian 
space) of class C” (— 105 Differentiable Mani- 
folds). The metric g is a fcovariant tensor field 
of order 2 and of class C" !; it is called the 
fundamental tensor of M. Using the value g, of 
g at each point pe M, a positive definite inner 
product g,(X, Y), X, Ye T,, is introduced on 
the ttangent vector space T, to M at p, and 
hence T, can be considered as a fvector space 
over R with inner product that can be identi- 
fied with the Euclidean space E" of dimension 
n — dim M. Utilizing the properties of the space 
E", we can introduce various notions on T, 
and M. (For example, given a tangent vector 
Le T,, we define the length || L| =||L\|,,, of L 
to be the quantity g,(L, LIT. A normal vector 
at a point p of a submanifold N of M is well 
defined as an element of the orthogonal com- 
plement of the subspace T,(N) of T,(M) with 
respect to g,; a differential form of degree 1 is 
identified with a tangent vector field.) A neces- 
sary and sufficient condition for a differenti- 
able manifold M of class C" to have a Rie- 
mannian metric is that M be tparacompact. A 
Euclidean space E" has a Riemannian metric 
expressed by ET, dx' & dx in terms of an 
orthogonal coordinate system (x). 

We assume that M is connected and of class 
C”. A curve x:(a, b] M is called piecewise 
smooth or of class D® if x is continuous and 
there exists a partition of To, b] into finite 
subintervals [t;_,,t;] such that the restrictions 
x|[(t;-1,t,] are timmersions of class C”. The 
length ||x|| of such a curve x is defined to be 
(2 Lt dt, where x'(t) is the tangent vector of 
x defined for almost all values of t. As in a 
Euclidean space, the length (xl is independent 
of the choice of parameter t, and the concepts 
of tcanonical parameter and orientation of x 
can be defined (— 111 Differential Geometry 
of Curves and Surfaces). A function d: M x M 
—> (0, oo) is defined so that the value d(p, q), p, 
qe M, is the infimum of the lengths of curves of 
class D^? joining p and q. The function d is a 
distance function on M, and the topology of 
M defined by d coincides with the original 
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topology of M. There exists an essentially 
unique structure of a Riemannian manifold on 
(real or complex) elliptic or thyperbolic space 
(— 285 Non-Euclidean Geometry), and d is 
the distance function of these spaces. 

If there exists an immersion @ of a differenti- 
able manifold N in a Riemannian manifold 
(M, ai then a Riemannian metric Q*g is de- 
fined on N by the tpullback process (ILL 
lido (L)1,). (For example, a submanifold and 
a 'covering manifold of M have Riemannian 
manifold structures induced by the natural 
mappings (— 365 Riemannian Submanifolds).) 
If M — E? and N is a 2-dimensional submani- 
fold of M, then o*g is the tfirst fundamental 
form of N. Assume further that ¢ is a diffeo- 
morphism and N has a Riemannian metric 
h. If 9*g =h, then (N, h) is said to be isometric 
to (M, g), and ¢ is called an isometry. The set 
I(M) of all isometries (isometric transforma- 
tions) of M onto M is a group. À necessary 
and sufficient condition for a mapping y: 
NM to be an isometry is that d,(p,g)= 
dr (p), v(q)), p. qe N. In particular, I(E") is 
the *congruent transformation group. 

If a differentiable manifold M is the product 
manifold of Riemannian manifolds (M,, g,) 
and (AM, g5), then (M, zT g, - 13g;) is called 
the Riemannian product of M, and M,, where 
Ta t — 1, 2, are projections from M to M,. 

Let F be the ftangent n-frame bundle over 
M and B= B,(M) be the subset of F consisting 
of all orthonormal frames with respect to g. 
Then B is an O(n)-subbundle of F of class C”, 
called the tangent orthogonal n-frame bundle 
(or orthogonal frame bundle). In this way we 
get a one-to-one correspondence between the 
set of all O(n)-subbundles of F and the set of 
all Riemannian metrics of M. 


B. Riemannian Connections 


There exists a unique taffine connection in the 
orthogonal frame bundle B whose ftorsion 
tensor is zero. This connection is called the 
Riemannian connection (or Levi-Civita connec- 
tion; — 80 Connections K). Let V denote the 
*covariant differential operator defined by this 
connection (— 80 Connections, 417 Tensor 
Calculus). (For a vector field X, the covariant 
differential operator V, acts on any tensor field 
T defined on a submanifold having X as a 
tangent vector field.) The covariant differen- 
tial Vg of the fundamental tensor g vanishes 
identically. The tconnection form of the Rie- 
mannian connection is expressed by n? differ- 
ential {-forms (0j), un on B, and we have 

wj + €] - 0. Let (w) <i<n be the canonical 
1-forms on B. Then (o), en together with 
(o) give rise to an absolute parallelism on 

B (that is, they are linearly independent at 
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every point). Let (0°) and (6/) be the corre- 
sponding set of differential 1-forms on the 
orthogonal frame bundle B, over another 
Riemannian manifold N with dim N — dim M. 
If there exists an isometry y: MN, then the 
differential dy is a diffeomorphism from B= 
By to By, and we get (dy/)*(0*) = o, (dy/)*(07) = 
o. Conversely, if there exists a diffeomorphism 
Y: By — By satisfying V*(0*) = c, V*(07) = 

œj and M is torientable, then there exists an 
isometry y: MN such that dy 2 V holds on 
a connected component B, of B. Moreover, y 
is uniquely determined if we choose one Bo. In 
this way the problem of the existence of an 
isometry from M may be reduced to one of the 
existence of a diffeomorphism from B preserv- 
ing absolute parallelism (as well as the order of 
the basis (c, w/)). 

According to the general theory of affine 
connections, the Riemannian connection on M 
determines a tCartan connection uniquely 
with E" — I(E")/O(n) as tfiber, which is called 
the Euclidean connection. As a consequence, 
every tangent vector space T, (M) is regarded 
as a Euclidean space E}, and for a given curve 
x:[a, b] ^ M of class D? and for te[a, b] there 
exists an isometry I, ,: E: Ee Satisfying the 
following three conditions (we denote I, , by 
Ij): (1) If x is a composite of two curves y and 
z, then I, — 1, I. (2) Differentiability: If x is of 
class C? at to, then tT, , is of class C? at to. 
(3) I, depends on the orientation of x but not 
on the choice of its parameter. The develop- 
ment x of x is the curve in Eža defined by x(t) 
= I, ,(x(t)), and we get It = [|x|]. (1, is some- 
times called the development along x.) Utilizing 
the concept of development, the theory of 
curves in E" can be used to study curves on M 
(^ 111 Differential Geometry of Curves and 
Surfaces). For example, if x is a segment, then 
x or x([a, b]) is called the geodesic arc (— 80 
Connections Lj; the tFrenet formula is auto- 
matically formulated and proved. The rotation 
part IË of I, (the composite of I, and the paral- 
lel displacement of E, translating I,(x(b)) to 
x(a)) is regarded as an isomorphism of the 
inner product space Tyo) to Tota): IF is ex- 
tended to an isomorphism of the ttensor alge- 
bra F (1,4) to 7 (Taa), which is denoted by 
the same symbol Z? and called the parallel 
displacement or parallel translation along x. 
Given a tensor field K on M, we have Vya K 
= [dI£.(K (x(t)))/dt],-,,. In particular, a neces- 
sary and sufficient condition for VK =0 is that 
IF(K (x(b))) = K (x(a)) for any x, in which case 
K is said to be parallel. 


C. Exponential Mapping (— 178 Geodesics) 


À curve x on M or the image of x is called a 
geodesic if any subarc x |[a, b] of x is a geo- 
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desic arc. Let N(S) be the normal bundle of a 
submanifold S of M, that is, the differentiable 
vector bundle over S consisting of all normal 
vectors at all points of S. Then S is contained 
in N(S) as the set of zero vectors at all points 
of S. There exist a neighborhood U of S in 
N(S) and a mapping Exp,: U— M of class C? 
with the following property: There exists a 
geodesic arc x with the initial tangent vec- 

tor LeU, length ||x|| = ||L||, and final point 
Exps(L). Let Uş be the largest U with this 
property. Then Exp,: U;— M is determined 
uniquely by S. The mapping Exp; is called 
the exponential mapping on S. If the rank of 
the Jacobian matrix of Exp; is less than n at 
Let, then L or Exps(L) is called the focal 
point of S on the geodesic s Exps(sL) (0x s, 
SL € Us). If S is compact, then S has an open 
neighborhood V; in N(S) satisfying the follow- 
ing three conditions: (i) V; c Us; (ii) ||L|| = 
d(Exps(L), S) for Le Vs, where the right-hand 
member expresses the infimum of the distance 
between the point Exp,(L) and points of S; (iii) 
the restriction Exps| V; is an embedding. The 
image Exps(V,) is the tubular neighborhood of 
S. In the special case where S consists of only 
one point p, N(( pj) coincides with the tangent 
vector space T,(M), and the focal point of p is 
called the conjugate point of p, given as the 
zero point of the fJacobi field (— 178 Geo- 
desics, 279 Morse Theory). In this case, V, is 
denoted by V,. If T, is identified with R” (or 
E") by means of an orthonormal basis of T,, 
then (Exp,) ! defined on Exp, (V,) is a coordi- 
nate mapping, called the normal coordinate 
mapping. Furthermore, Exp,(V,) contains a 
neighborhood W, of p such that there exists a 
unique geodesic arc x joining any two points q 
and r of W, with ||x|| =d(q,r) and contained in 
W,. W, is called a convex neighborhood of p. 


D. Curvature 


The set of differential 1-forms (wf, œj), by 
means of which absolute parallelism is given in 
the orthogonal frame bundle B of M, satisfies 
the tstructure equation do! — — 2oj ^ w/, 

doj = — ag ^ o +, and (O$) is called the 
curvature form of the Riemannian connection 
of M. This form ts expressed by a tensor field 


. R(— 80 Connections; 417 Tensor Calculus ) of 


type (1, 3) on M, called the curvature tensor; if 
Ri, are the components of R with respect to 
an orthonormal frame be B of the tangent 
vector space T, of M, then Q§=(1/2)X Ripo" ^ 
c)! at b. Let (X, Y) be an orthonormal basis of 
a 2-dimensional subspace P of T,. Then the 
inner product K,(P) of X and R(X, Y)Y is 
determined by P independently of the choice 
of the basis (X, Y), where the i-component of 
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R(X, Y)Z with respect to the basis b of T, is 
given by X Ry, Z/X*Y'. K,(P) is the "Gauss: 
ian curvature of the surface Exp,(V, P) and 
is called the sectional curvature (or Riemannian 
curvature) of P. The curvature tensor R is 
uniquely determined by the function K,(P) 

of p and P. If dim M 2 3 and if at every point p 
of M, K,(P) has a constant value M, indepen- 
dent of the choice of P, then M, is a constant 
independent of the choice of p (F. Schur). If 

K (P) is constant, then M is called a space of 
constant curvature. If VR —0, then M is called 
a locally symmetric space (— 412 Symmetric 
Riemannian Spaces and Real Forms; 413 
Symmetric Spaces). In a local sense, Riemann- 
ian metrics of these spaces are uniquely deter- 
mined by the curvature tensor R up to a con- 
stant factor. If M is of constant curvature K, 
complete, and simply connected, then M is 
isometric to E", the sphere (which is the uni- 
versal covering Riemannian manifold of a real 
telliptic space), or a real thyperbolic space 
according as K is 0, positive, or negative. The 
compact spaces of positive constant curvature, 
that is, the Riemannian manifolds having the 
sphere as the universal covering Riemannian 
manifold, were completely classfied by J. A. 
Wolf [1]. A complete, simply connected, 

and locally symmetric space is a *symmetric 
Riemannian space. The Ricci tensor (R;;) is 
defined by R;;2 — X, Ri. Let Q be the qua- 
dratic form on T, given by (R,,). Then the 
value Q(L) for a unit vector Le T, is the mean 
of K (P) for all sections P (2-dimensional 
subspaces of T,) containing L and is called the 
Ricci curvature (or mean curvature) of the 
direction L at p. The mean R of Q(L) for all 
the unit vectors L at p is called the scalar 
curvature at p (— 417 Tensor Calculus). Q(L) 
and R are expressed by Q(L) — Xg,(R(X;, L)L, 
Xj and R 2 X; Q(Xj), up to positive constant 
factors, in terms of an orthonormal basis (X;) 
of T,. If the Ricci tensor of M is a scalar mul- 
tiple of the fundamental tensor, then M is 
called an Einstein space. (When dim M 23, this 
scalar is constant.) If M is a *Kühler manifold 
and P is restricted to a complex plane (in- 
variant under the almost complex structure), 
then K(P) is called the holomorphic sectional 
curvature. A Kahler manifold M of constant 
holomorphic sectional curvature is locally 
isometric to a complex Euclidean space, ellip- 
tic space, or hyperbolic space. 

The properties of the sectional curvature 
and the Ricci curvature are closely related to 
the behavior of geodesics of Riemannian mani- 
folds, and these properties reflect those of the 
topological structures of the manifolds (— 178 
Geodesics). The compact simply connected 
homogeneous Riemannian manifolds of strict- 
ly positive sectional curvature have been 
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classified [2-4]. Related to algebraic geome- 
try, as the solution of the Frankel conjecture, 
the following holds: If a compact Káhler mani- 
fold has strictly positive sectional curvature, 
then it is biholomorphic to the complex pro- 
jective space [5,6] (— 232 Kahler Manifolds). 
Furthermore, curvature tensors are related to 
*characteristic classes. For example, we have 
the Gauss-Bonnet formula: If M is an even- 
dimensional compact and oriented Riemann- 
ian manifold, the integral of a, K,,c on M is 
equal to the *Euler-Poincaré characteristic, 
where 


a, =n!/(2" x"? (n/2)!), 


0 is the volume element of M, and Kn is 
defined as follows: For a positive even num- 
ber s, Kq, is a real-valued function of the s- 
dimensional subspaces P of the tangent vector 
spaces T, of M, which is given by 


Kqg(P)— bie Ap AC Baba KOR KO 


cR OE 


íi XX, o X5 
in terms of an orthonormal basis (X,, ..., X;) of 
P, where b, —( —1)/? (23? s!), © is summation 
over all pairs of s-tuples satisfying {i,,...,i,', 
(haelt, A n) 6. i is the sign of 
(iisi), 3 is the inner product in T, with 
respect to g,, R, is the value of the tensor R 

at p, and R,(X;, X;) X, is as already defined 

at the beginning of this section. In particular, 
Kack If K of a compact and orientable 

M is constant for a certain s, then the kth 
*Pontryagin class of M (with real coefficients) 
vanishes for all k > s/2. 


E. Holonomy Groups 


Let p be a fixed point of M, and let Q, be the 
set of all closed oriented curves of class D^ 
with initial and final points p and with para- 
meters neglected. The set H = (1,|xeQ,j, 
called the holonomy group of M, is a subgroup 
of I(T,) (T, is identified with E") independent of 
the choice of p (— 80 Connections), and x], 
is a homomorphism from Q, to H. The restric- 
tion Hy of this homomorphism to all closed 
curves homotopic to zero is called the re- 
stricted holonomy group. The rotation part h of 
H, called the homogeneous holonomy group, is 
a subgroup of the orthogonal group O(n) of 
T,. The rotation part hy of Ho, called the re- 
stricted homogeneous holonomy group, is a 
connected component of h and a tcompact Lie 
group. The ?Lie algebra of hy is spanned by 
MARX, Y))Ix:[a, b] is of class D”, 
x(a)— p, and X, Ye Tomp where R,4(X, Y) is 
the endomorphism of the linear space TA 
defined in Section D. 
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If M — E", then H — (ej, where e is the iden- 
tity element. If h= (e) (ho — (e]), then M is 
called flat (locally flat) (C 80 Connections E). 
Local flatness is equivalent to M being locally 
isometric to E". If M is complete and H (re- 
garded as a transformation group of E") has a 
fixed point, then M is isometric to E". Any 
finite rotation group h is the homogeneous 
holonomy group of some locally flat and com- 
pact Riemannian manifold. 

With respect to the linear group h of T,, 
we get a unique decomposition T,= Vi ® 
Va, 9 ... ® Vn of mutually orthogonal sub- 
spaces, where Vo (dim Vo 2 0) consists of all 
h-invariant vectors and Vj, i— 1, ...,r, are 
irreducible h-invariant subspaces. If h or ho is 
irreducible (reducible) on T,, then M is called 
irreducible (reducible). If M is complete and 
simply connected (hence h= ho), then M is the 
Riemannian product of closed submanifolds 
Ma, 4 —0,1, ..., r, satisfying Va = T(M,). 
This decomposition M = TT M,, is determined 
uniquely by M and called the de Rham de- 
composition of M [7]. In this case h is the 
direct product of closed subgroups hi, where 
every ba acts on Vig), 8 Aa as the identity, 
and can be regarded as the homogeneous 
holonomy group of M,,). If ho is irreducible 
and M is not locally symmetric, then h, acts 
ttransitively on the unit sphere of T,. The 
classification of possible candidates for such 
ho has been made [8,9]. For example, if n 
is even and hy is the tunitary group U (n/2), 
then h acts transitively on the unit sphere. A 
necessary and sufficient condition for h to be 
contained in U(n/2) is that M have a tcom- 
plex structure and the structure of a Kahler 
manifold. 

The group h acts naturally on the ttensor 
algebra 7 (1,) of T,. If a tensor field A on M is 
parallel, then A, is invariant under h. Con- 
versely, if Ao € 7 (T,) is invariant under h, there 
exists a unique parallel tensor field A satisfying 
A,= Ag. The orthogonal frame bundle B is 
treducible to the h-bundle. 


F. Transformation Groups 


The group I(M) consisting of all isometries of 
M with the tcompact-open topology is a tLie 
transformation group. The isotropy subgroup 
at any point is compact. In particular, if M is 
compact, so is [(M). The differential dq of 
@eI(M) is a transformation of the orthogonal 
frame bundle B. If b, is a fixed point of B, then 
the mapping f defined by o—dqo(b;) embeds 
I(M) as a closed submanifold of B, and the 
differentiable structure of I(M) is thus deter- 
mined. If f is surjective, it follows from the 
structure equation that M is of constant curva- 
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ture and equals E", a real *hyperbolic space, or 
a real felliptic space (or a sphere). A necessary 
and sufficient condition for the image of fi to 
be a subbundle of B is that I(M) be transitive. 
If the image of fj contains the h-bundle, then 
M is a symmetric space. If M is compact and 
I(M) is transitive, then the image of f is con- 
tained in the h-bundle (— 191 G-Structures). If 
I(M) is transitive on M, then M is complete 
and is the thomogeneous space of I(M). Con- 
versely, a homogeneous space M — G/K of 

a Lie group G by a compact subgroup K has 
a Riemannian metric invariant under G. In 
general, an element of (M) preserves quan- 
tities uniquely determined by the Riemannian 
metric g, such as the Riemannian connection, 
its curvature, the set of all geodesics, etc. Fur- 
thermore, any element of I(M) commutes with 
V and the tLaplace-Beltrami operator. If M is 
compact and oriented, then the connected 
component I,(M) of I(M) preserves any thar- 
monic differential form. If M is complete and 
simply connected, then J,(M) is clearly decom- 
posed into a direct product by the de Rham 
decomposition of M. An element of the Lie 
algebra of I(M) is regarded as a vector field X 
on M, called the infinitesimal motion, which 
satisfies the equation Lyg —0; that is, V;č; + 
V,¢;=0, where Ly denotes tLie derivation 

and the č; are tcovariant components of X 
with respect to a natural frame (0/0X;), i= 
1,...,n(— 417 Tensor Calculus). This equa- 
tion is called Killing’s differential equation, 
and a solution X of this equation is called a 
Killing vector field. The set of all Killing vec- 
tor fields is a Lie algebra of finite dimension 
(<dim B). If M is complete, then this Lie alge- 
bra coincides with that of I(M). If M is com- 
pact and the Ricci tensor is negative definite, 
then I(M) is discrete. If, furthermore, the sec- 
tional curvature is nonpositive, then an iso- 
metry of M homotopic to the identity trans- 
formation is the identity transformation itself. 

It is known that dim I(M)<n(n+ 1)/2 if 
dim M =n, and the maximum dimension is 
attained only when M is a space of constant 
curvature. For Riemannian manifolds with 
large [(M), extensive work on the structures of 
M and I(M) has been done by I. P. Egorov, S. 
Ishihara, N. H. Kuiper, L. N. Mann, Y. Muto, 
T. Nagano, M. Obata, H. Wakakuwa, K. 
Yano, and others [10, 11]. 

The fixed point set of a family of isometries 
has interesting differential geometric properties 
[10]. For example, let G be any subset of I(M) 
and F the set of points of M which are left 
fixed by all the elements of G. Then each con- 
nected component of F is a closed ttotally 
geodesic submanifold of M. If M is compact 
and f is an isometry of M, then A, — x(F), 
where A, denotes the tLefschetz number and 
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y(F) the tEuler characteristic of the fixed point 
set F of f. As for the existence of fixed points 
of an isometry, the following are known: Let f 
be an isometry of a compact, orientable Rie- 
mannian manifold M with positive sectional 
curvature. If dim M is even and f is orientation 
preserving, or if dim M is odd and f is orienta- 
tion reversing, then f has a fixed point. In the 
case of nonpositive curvature, the following is 
basic: Every compact group of isometries of a 
complete, simply connected Riemannian mani- 
fold with nonpositive sectional curvature has a 
fixed point (E. Cartan). If a compact, orien- 
table Riemannian manifold admits a fixed- 
point-free 1-parameter group of isometries, 
then its *Pontryagin numbers vanish. 

On a Riemannian manifold M, a transform- 
ation of M which preserves the Riemannian 
connection, or equivalently which commutes 
with covariant differentiation V is called an 
affine transformation. Let A(M) denote the 
group of all affine transformations of M. A 
transformation preserving the set of all geo- 
desics is called a projective transformation. Let 
P(M) denote the group of all projective trans- 
formations of M. A transformation preserv- 
ing the angle between tangent vectors is called 
a conformal transformation. Let C(M) de- 
note the group of all conformal transforma- 
tions. They are Lie transformation groups 
with respect to suitable topologies. Clearly, 
I(M)c A(M)c P(M), I(M) c C(M) (— 191 G- 
Structures). 

Aal ML the connected component of A(M), is 
decomposed into a direct product according to 
the de Rham decomposition of M when M is 
complete and simply connected (J. Hano). If 
M is complete and irreducible, then A(M)= 
I(M) except when M is a 1-dimensional Euclid- 
ean space. If M is complete and its restricted 
homogeneous holonomy group ho leaves no 
nonzero vectors, then A,(M)= b(M). If M is 
compact, then A9(M) — I,(M) always. 

If M is complete and has a parallel Ricci 
tensor, then the connected component P4(M) 
= Aal ML, unless M is a space of positive con- 
stant sectional curvature (n > 2) (Nagano, N. 
Tanaka, Y. Tashiro). If M is compact, simply 
connected, and has constant scalar curvature, 
then P,(M) — I,(M), unless M is a sphere (n> 2) 
(K. Yamauchi). 

Similarly to the case of P(M), it is known 
that if M is complete and has a parallel Ricci 
tensor, then the connected component C,(M) 
= (M), unless M is a sphere (n> 2) (Nagano). 
A conformal transformation remains con- 
formal if the Riemannian metric g is changed 
conformally, namely, to e/g, f being any 
smooth function on M. A subset of C(M, g) is 
called essential if it cannot be reduced to a 
subset of I(M,g) for any metric g conformal to 
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g. When M is compact, C(M) or Co(M) is 
essential if and only if it is not compact. If 
Co(M) is essential, then M is conformally 
diffeomorphic to a sphere or a Euclidean space 
(n7 2) [12-15]. When M is compact and has 
constant scalar curvature and Co (M) z (M), 
sufficient conditions for M to be isometric to a 
sphere have been obtained by S. I. Goldberg 
and S. Kobayashi, C. C. Hsiung, S. Ishihara, A. 
Lichnerowicz, Obata, S. Tanno, Tashiro, K. 
Yano, and others. For example, if C,(M) is 
essential, then M is a sphere [14]. In general, 
however, there are compact Riemannian mani- 
folds with constant scalar curvature for which 
Co(M) z I5 (M) (N. Ejiri). 


G. Spheres as Riemannian Manifolds 


A Euclidean n-sphere S" (n z 2) has the prop- 
erties of a Riemannian manifold. It is a space 
of positive constant sectional curvature 1/r? 

(r = radius) with respect to the natural Riemann- 
ian metric as a hypersurface of the Euclidean 
(n+ 1)-space E"*!. A sphere is characterized by 
the existence of solutions of certain differential 
equations on a Riemannian manifold. On a 
unit sphere S” in E"*!, the eigenvalues of the 
*Laplace-Beltrami operator A on smooth 
functions are given by 0« 4, <... « A, « ..., A, 
— k(n-- k — 1). It is known that eigenfunctions 
f corresponding to å}, Af — 4, f, are the restric- 
tions to S" of harmonic homogeneous poly- 
nomial functions F of degree k on EST. On a 
compact Riemannian manifold M, if the Ricci 
curvature of M is not less than that of S", then 
the first eigenvalue 4, of A on M satisfies 4, > 
A, =n [16]. Conversely, under the same as- 
sumption on the Ricci curvature, if 4, =n, 
then M is a sphere (Obata). On the other 
hand, if g is the standard metric on S”, then 
Af — nf is equivalent to the system of differen- 
tial equations 


VVS + fgj; — 0. (E) 


A complete Riemannian manifold M (nz 2) 
admits a nontrivial solution of (E,) if and only 
if M is a sphere (Obata, Tashiro). In general, 
the restriction f to S" of a harmonic homogene- 
ous polynomial of degree k satisfies Af — 

k(n -- k — 1)f as well as a certain system (E,) 

of differential equations of degree k + 1 involv- 
ing the Riemannian metric. For example, 


V,VjV;f 29; V.f + ga Vf + gy; V; f =9. (E2) 


If a complete Riemannian manifold M admits 
a nontrivial solution of (E,) for some integer 
k 2:2, then M is locally isometric to a sphere 
(Obata, Tanno, S. Gallot [17]). The gradient of 
a solution of (E,) is an infinitesimal conformal 
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transformation and that of (E,) is an infini- 
tesimal projective transformation. 

As the Kahler or quaternion Kahler version 
of (E2), there is a system of differential equa- 
tions characterizing the complex projective 
space or the quaternion projective space as a 
Káhler manifold (Obata, Tanno, D. E. Blair, 
Y. Maeda). 

On a sphere, a Riemannian metric which is 
conformal to the standard metric and has the 
same scalar curvature as the standard one is 
always standard, namely, it has a positive 
constant sectional curvature [14]. 


H. Scalar Curvature 


On a 2-dimensional Riemannian manifold M, 
the sectional curvature, the Ricci curvature, 
and the scalar curvature ail coincide with the 
tGaussian curvature, which is a function on 
M. If M is compact, by the tGauss-Bonnet 
theorem the Gaussian curvature K of M must 
satisfy the following sign condition in terms of 
the fEuler characteristic y(M): 

if y(M)>0, then K is positive somewhere; 

if y(M)=0, then K changes sign unless it is 
identically zero; 

if Y(M) <0, then K is negative somewhere. 
This sign condition is also sufficient for a given 
function K to be the Gaussian curvature of 
some metric on M. More precisely, starting 
with a Riemannian metric with constant Gaus- 
sian curvature, one can say that a smooth 
function K is the Gaussian curvature of some 
metric conformally equivalent to the original 
metric if and only if K satisfies the foregoing 
sign condition [18]. 

H. Yamabe [19] announced that on every 
compact Riemannian manifold (M, g) of di- 
mension n > 3, there exists a strictly positive 
function u such that the Riemannian metric 
gd—u^*" ?g has constant scalar curvature. N. 
S. Trudinger, however, pointed out that his 
original proof contains a gap in some cases. 
The problem reduces to the following non- 
linear partial differential equation on a com- 
pact manifold M: 





Rue 200-72 4" 


Au+ Ru, 
aE a u 


where R is the scalar curvature of g and R a 
constant which should be the scalar curva- 
ture of g —u* "^ 26 (— 183 Global Analysis). 
Nevertheless, Yamabe's original proof can be 
pushed to cover a large class of metrics with 
Íu RdM <0. Furthermore, it has since been 
solved for a wider class: namely, if M is not 
conformally flat and n 2 6, or if it is conform- 
ally flat and its fundamental group is finite, 
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then the problem has been solved affirmatively 
[20]. 

On the other hand, any smooth function on 
a compact manifold M of dimension n z 3 that 
is negative somewhere 1s the scalar curvature 
of some metric on M. In particular, on a com- 
pact manifold (n> 3) there always exists a 
Riemannian metric with constant negative 
scalar curvature [18]. Any smooth function 
can be the scalar curvature if and only if M 
admits a metric of constant positive scalar 
curvature. The foregoing results show that 
there is no topological obstruction to the 
existence of metrics with negative scalar curva- 
ture of a compact manifold of dimension n 3. 

For positive scalar curvature, there is a 
topological obstruction. A compact tspin 
structure (spin manifold) having nonvanishing 
+4-genus cannot carry a Riemannian metric of 
positive scalar curvature. The existence of such 
a manifold has been shown. If a compact, 
simply connected manifold M of dimension 
nz 5 is not a spin manifold, then there exists a 
Riemannian metric of positive scalar curva- 
ture. Furthermore, if M is a spin manifold and 
spin tcobordant to M’ with positive scalar 
curvature, then M carries a Riemannian metric 
of positive scalar curvature [22]. À torus T" 
cannot carry a metric of positive scalar curva- 
ture. In fact, any metric of nonnegative scalar 
curvature on T” must be flat [22]. 

Let K, and R, denote the sectional curva- 
ture and the scalar curvature, respectively, of a 
Riemannian metric g. Then the following are 
known for a compact manifold M of dimen- 
sion >3: If M carries a metric g with K,« 

0, then it carries no metric with R >0. If M 
carries a metric g with K, <0, then it carries 
no metric with R 20. If M carries metrics g4, 
g2 with K, <0 and R, 20, then both metrics 
are flat [22]. 

If the assignment of the scalar curvature to a 
Riemannian metric is viewed as a mapping of 
a space of Riemannian metrics into a space of 
functions on a manifold M, then locally it is 
almost always surjective when M is compact 
(A. E. Fischer and J. E. Marsden, O. Koba- 
yashi, J. Lafontaine). 


I. Ricci Curvature and Einstein Metrics 


In this paragraph the manifolds under con- 
sideration are assumed to be of dimension 

n> 3. The Ricci tensor (R,,) is a symmetric 
tensor field of type (0, 2) on a Riemannian 
manifold. The problem of finding a Riemann- 
ian metric g which realizes a given Ricci 
tensor reduces to the one of solving a system 
of nonlinear second-order partial differential 
equations for g. The Bianchi identity (— 417 
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gv, R, R0 

must be satisfied. There is a symmetric (0, 2)- 
tensor on R" which cannot be the Ricci tensor 
for any Riemannian metric in a neighborhood 
of Oc R". However, if a C* (or C?) symmetric 
tensor field (R;,) of type (0, 2) is invertible at a 
point p, then in a neighborhood of p there 
exists a C” (or C?) Riemannian metric g such 
that (R;;) is the Ricci tensor of g [24]. 

The positivity of the Ricci curvature on a 
Riemannian manifold puts rather strong re- 
strictions on the topology of the manifold 
(— 178 Geodesics). However, nonnegative 
Ricci curvature and positive Ricci curvature 
are not too far from each other. If, on a com- 
plete Riemannian manifold M with nonnega- 
tive Ricci curvature, there is a point at which 
the Ricci curvature is positive, then there 
exists a complete metric on M with positive 
Ricci curvature [25-27]. 

If a Riemannian manifold (M, g) is an Ein- 
stein space, then g is called an Einstein metric 
on the manifold M. Let v, denote the volume 
element determined by g. When M is compact, 
M denotes the space of Riemannian metrics 
on M with total volume 1. The integral of the 
scalar curvature 4(g) =|, R,v, is a functional 
on M. The critical points of 4 are Einstein 
metrics (D. Hilbert). Let Æ, (€= M) denote the 
space of metrics with constant scalar curva- 
ture. Then if 4 is restricted to æ, then the 
*nullity and *coindex at the critical point are 
finite [28, 29]. 

An Einstein metric is always real analytic in 
some coordinate system. In particular, if two 
simply connected Einstein spaces have neigh- 
borhoods on which metrics are isometric, then 
they are isometric [30]. Though S” with stan- 
dard Riemannian metric is a typical example 
of an Einstein space, S^**? (k > 1) carries an 
Einstein metric that is not standard [31]. 
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A. Introduction 


If an timmersion (or an fembedding) f ofa 
tRiemannian manifold (M, g) into a Riemann- 
ian manifold (M, g) satisfies the condition f *g 
=g, then f is called an isometric immersion 
(or embedding) and M is called a Riemannian 
submanifold of M. In this article, /(M) will be 
identified with M except where there is danger 
of confusion. Suppose dim M — and dim M — 
n+p. Then the tbundle F(M) of orthonormal 
tangent frames of M, the bundle F,(M) of 
orthonormal normal frames of M, and their 
*Whitney sum F(M)@ F,(M) are tprincipal 
fiber bundles over M with tstructure groups 
O(n), O(p), and O(n) x O(p), respectively. These 
are subbundles of the restriction to M of the 
bundle F(M) of orthonormal frames of M. The 
vector bundles associated with F(M), F,(M), 
and F(M) @ F,(M) are, respectively, the ttan- 
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gent bundle T(M), the tnormal bundle v(M), 
and their Whitney sum T(M) @ v(M). 


B. General Results for Immersibility 


An n-dimensional real analytic Riemannian 
manifold can be locally isometrically embed- 
ded into any real analytic Riemannian mani- 
fold of dimension n(n + 1)/2 (M. Janet (1926), 
E. Cartan (1927)). The generalization to the 
C^ case is an open question even when the 
ambient space is Euclidean. 

An n-dimensional compact C’ Riemannian 
manifold (3 <r < oo) can be isometrically em- 
bedded into an (n(3n + 11)/2)-dimensional 
Euclidean space (J. F. Nash (1956)). An n- 
dimensional noncompact C” Riemannian 
manifold (3 <r € oo) can be isometrically em- 
bedded into a 2(2n + 1) (3n + 7)-dimensional 
Euclidean space (Nash (1956), R. E. Greene 
(1970)). 

Let M be an n-dimensional Riemannian 
manifold with tsectional curvature Ky and M 
an (n+ p)-dimensional Riemannian manifold 
with sectional curvature K g. Then M cannot 
be isometrically immersed into M in the fol- 
lowing cases: 

(1) p<n—2 and K,, < K g (T. Otsuki (1954)); 
(2) p<n—1, Ky x Kg <0, M is compact, 

and M is complete and simply connected 

(C. Tompkins (1939), S. S. Chern and N. H. 
Kuiper (1952), B. O'Neill (1960)); 

(3 p<n—1, Ky «0, Kg is constant (<0), M is 
compact, and M is complete and simply con- 
nected [2]. 


C. Fundamental Equations 


Let TOM, g) (M, d) be an isometric immer- 
sion. Let V and V denote the tcovariant differ- 
entiations with respect to the *Riemannian 
connections of M and M, respectively. For 
vector fields X and Y on M, the tangential 
component of V, Y is equal to V, Y. Put 


o(X, Y) V, Y — V, Y. (1) 


Then o is a v(M)-valued symmetric (0, 2) tensor 
field on M, which is called the second funda- 
mental form of M (or of f). For a normal 
vector à at xe M, put g(A-X, Y)— ó(o(X, Y), ¢). 
Then A, defines a symmetric linear transfor- 
mation on T,(M), which is called the second 
fundamental form in the direction of Z The 
eigenvalues of A, are called the principal curva- 
tures in the direction of &. The connection on 
v(M) induced from the Riemannian connec- 
tion of M is called the normal connection of 

M (or of f). Let V+ denote the covariant differ- 
entiation with respect to the normal con- 
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nection. For a tangent vector field X and a 
normal vector field č on M, the tangential 
(resp. normal) component of V, £ is equal to 
—A,X (resp. V; €), that is to say, the relation 


VyE= — AX + Vy (2) 


holds. (1) is called the Gauss formula, and (2) is 
called the Weingarten formula. 

Let R, R, and R+ be the tcurvature tensors 
of V, V. and V-, respectively. Then the tinte- 
grability condition for (1) and (2) implies 


R(X, Y)Z=R(X, Y)Z  Acx a Y — Aan z X 
*(Vxa)( Y, Z)—(Vyo)(X, Z) (3) 


for vector fields X, Y, Z tangent to M, where 
V' denotes covariant differentiation with re- 
spect to the connection in T(M) Q v(M). (3) is 
called the equation of Gauss and Codazzi. 
More precisely, the tangential component of 
(3) is given by the equation of Gauss and the 
normal component of (3) is given by the equa- 
tion of Codazzi. Similarly, for vector fields é 
and y normal to M, the relation 


g(R(X, Y)é, n - (R^ (X, Y)En) 
—g([Az,A,]X, Y) (4) 


holds, which is called the equation of Ricci. 
Formulas (1)-(4) are the fundamental equa- 
tions for the isometric immersion f: M M. 

As a particular case, suppose M is a tspace 
form of constant curvature c. Then the equa- 
tions of Gauss, Codazzi, and Ricci reduce 
respectively to 


R(X, Y)Z —c(g(Y, Z)X — g(X, Z) Y) - Aan. z X 


— Ácix.z) Y, (3, 
(Vxo)( Y, Z)—(Vyo)(X, Z)=0, (394 
g(R^ (X, Y)¢, n =g LA: A,] X, Y). (4) 


Conversely, let (M, g) be an n-dimensional 
simply connected Riemannian manifold, and 
suppose there is given a p-dimensional *Rie- 
mannian vector bundle v(M) over M with 
curvature tensor R+ and a v(M)-valued sym- 
metric (0, 2) tensor field c on M. For a tcross 
section ¢ of v(M), define Az by g(4, X, Y)— 
(c(X, Y), £», where < , > is the fiber metric 
of v(M). If they satisfy (3.),, (3,),, and (4,), then 
M can be immersed isometrically into an (n + 
p)-dimensional complete and simply connected 
space form M"*?(c) of curvature c in such a 
way that v(M) is the normal bundle and o is 
the second fundamental form. Moreover, such 
an immersion is unique up to an tisometry of 
M" (c). 

Let (e 4), <4<n+p be a local cross section of 
F(M) such that its restriction to M gives a 
local cross section of F(M)@ F,(M), and let 
(c^) be its dual. Then f*w*=0 forn+1<a< 
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n+p. Let (2$) <4.B<n+p and (D$), «4 pcs, 

be the *connection form and the 'curva- 

ture form of M with respect to (e,), and put 
cg = f *àg. Then (oj), <i j<n is the con- 
nection form of M with respect to (ej), <j<n- 
(207) 1 <icn<acntp gives the second fundamental 
form, that-ts, 


ole, e) —). e (eje,. (1) 
Put of =È bon), Then (hj) is the matrix repre- 
senting the symmetric linear transformation 
Ae, with respect to (ej), that is, 

A,,e; — ) hie). (2^) 


Uy 


Moreover, (@g)n+1 <a.p<n+p 1$ the connec- 
tion form of the normal connection with re- 
spect to (€,),41 <a<ntp: Let (05, <i,j<n and 
(D5 )n+1<a,p<n+p be the curvature forms of (cj) 
and (o5), respectively. Then the equations of 
Gauss, Codazzi, and Ricci are given respec- 
tively by 


ntp 


ftd x. anos (1<ij<n), (3’), 
ntp 

f*4r—dof- Y onol (1<i<n<a<n+p), 

B DI 

f*O4=O% + 2 onog  (ntl<a,p<n+p). 

k=1 (4) 


D. Basic Notions 


Let M be a Riemannian submanifold of M. A 
point x € M is called a geodesic point if c —0 at 
x. If every point of M is a geodesic point, then 
M is called a totally geodesic submanifold of 
M. M isa totally geodesic submanifold of M if 
and only if every geodesic of M is a geodesic of 
bi | 

A mapping b: M v(M) defined by x^ 
7 2214 0(6;, €) is independent of the choice of 
an orthonormal basis (el b is called the mean 
curvature vector and ||b || is called the mean 
curvature. M is called a minimal submanifold 
of M if }=0(— 275 Minimal Submanifolds). 

A point xe M is called an umbilical point if 
0 —gGbat x. xeM is an umbilical point if 
and only if A; is proportional to the identity 
transformation for all če v,(M). If every point 
of M is an umbilical point, then M is called a 
totally umbilical submanifold of M. 

A point xe M is called an isotropic point if 
lo(X, X)l/I.X |? does not depend on X e 
T.(M). If every point of M is an isotropic 
point, then M is called an isotropic submani- 
fold of M. It is clear that an umbilical point is 
an isotropic point. 

p(x) 2 dim (^, (ker A, is called the index 
of relative nullity at x e M. 
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E. Rigidity 


An isometric immersion f: M — M is said to be 
rigid if it is unique up to an isometry of M, 
that is, if /': M — M is another isometric im- 
mersion, then there exists an isometry q of M 
such that f'—9o f. If f: M M is rigid, then 
every isometry of M can be extended to an 
isometry of M. 

An isometric immersion f: M — M"*!(c) of 
an n-dimensional Riemannian manifold into 
an (n+ 1)-dimensional complete and simply 
connected space form is rigid in each of the 
following cases: 

(1) n=2, c=0, and M is compact and of posi- 
tive curvature (S. Cohn-Vossen (1929)). 

(2) The index of relative nullity is x n —3 at 
each point (R. Beez (1876); — [8]). 

(3) n2 5, c» 0, M is complete, and the index of 
relative nullity is X n— 2 at each point (D. 
Ferus (1970)). 

(4) n24, c 0, and the tscalar curvature of M 
is constant ( z n(n — 1)c) (C. Harle (1971)). 

À generalization of (2) for the case of higher 
codimension was obtained by C. Allendoerfer 
(1939). Various rigidity conditions have been 
studied by S. Dolbeault-Lemoine, R. Sack- 
steder, E. Kaneda and N. Tanaka, and others. 


F. Totally Geodesic and Totally Umbilical 
Submanifolds 


A totally geodesic submanifold of a space form 
is also a space form of the same curvature. 
Totally geodesic submanifolds of compact 
tsymmetric spaces of rank 1 were completely 
classified by J. A. Wolf (1963), and totally 
geodesic submanifolds of symmetric spaces of 
rank z2 were studied by Wolf and B. Y. Chen 
and T. Nagano [4]. 

Let f:M—>M"*?(é) be a totally umbilical 
immersion of an n-dimensional Riemannian 
manifold M into an (n+ p)-dimensional space 
form. Then M is a space form M"(c) with cz 6, 
and f(M) is contained in a certain (n 4- 1)- 
dimensional totally geodesic submanifold 
M"*?(é) of M"*?(é). If 60, then f(M) is local- 
ly a hypersphere; if č=0, then f(M) is locally 
a hyperplane or a hypersphere; if ¢<0, then 
f(M)is locally a geodesic sphere, a horo- 
sphere, or a parallel hypersurface of a totally 
geodesic hypersurface [2]. 


CG Minimal Submanifolds 


For general properties of minimal submani- 
folds — 275 Minimal Submanifolds. 

There is no compact minimal submanifold 
in a simply connected Riemannian manifold 
with nonpositive sectional curvature (O'Neill 
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(1960)). On the contrary, a sphere has plenty 
of compact minimal submanifolds. 
For each positive integer s, an n-dimensional 


can be minimal- 


(s-- n — 2)! | 
sni f 
dimensional unit sphere and the immersion 

is rigid if n=2 or s« 3 (E. Calabi (1967), M. 

do Carmo and N. Wallach (1971)). 

Among all n-dimensional compact minimal 
submanifolds of an (n+ p)-dimensional unit 
sphere, the totally geodesic submanifold is 
isolated in the sense that it is characterized by 
each of the following conditions: 


sphere of curvature 


n 
s(s+n—1) 


ly immersed into a fos *n-l) 


(1) sectional curvature > (T. Itoh 


n 
2(n+1) 
(1978)), 

(2) Ricci curvature >n—2 (N. Ejiri (1979)), 


n 
2—1/p e 





(3) scalar curvature 7 n(n— 1)— 


Simons (1968)). 


H. Submanifolds of Constant Mean Curvature 


A manifold of constant mean curvature is a 
solution to a variational problem. In partic- 
ular, with respect to any volume-preserving 
variation of a domain D in a Euclidean space, 
the mean curvature of M — OD is constant if 
and only if the volume of M is critical. 

The interesting question "If the mean curva- 
ture of an isometric immersion f: M M"*!(c) 
of an n-dimensional compact Riemannian 
manifold M into an (n 4- 1)-dimensional space 
form M"*! (c) is constant, is M a sphere?" has 
not yet been completely solved, where M"*! (c) 
denotes a Euclidean space, a hyperbolic space, 
or an open hemisphere according as c —0, <0, 
or >0. The answer is affirmative in the follow- 
ing cases: (1) dim M —2, and the tgenus of M is 
zero (H. Hopf (1951), Chern (1955)). (2) f is an 
embedding (A. D. Alexandrov (1958); — [8]). 

These results remain true even if the as- 
sumption “the mean curvature is constant" is 
replaced by the weaker condition “the prin- 
cipal curvatures k; 2 ... > k, satisfy a relation 
Q(k,, ... ka) 20 such that &qo/0k; 0." 

Unlike an open hemisphere, a sphere S 
admits many compact hypersurfaces of con- 
stant mean curvature; among which totally 
umbilical hypersurfaces and the product of 
two spheres are the only ones with nonnega- 
tive sectional curvature (B. Smyth and K. 
Nomizu (1969)). 

A nonnegatively or nonpositively curved 
complete surface of nonzero constant mean 
curvature in a 3-dimensional space form M°(c) 
is either a sphere or a *Clifford torus if c> 
0 and is either a sphere or a right circular 


nt) 
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cylinder if c <0 (T. Klotz and R. Osserman 
(1966—1967), D. Hoffman (1973)). 


I. Isoparametric Hypersurfaces 


A hypersurface M of M is said to be isopara- 
metric if M is locally defined as the tlevel set 
of a function f on (an open set of) M with 
property 


dí^d|df|?—0 and dfad(Af)=0. 


A hypersurface M of a complete and simply 
connected space form M"*' (c) is isoparametric 
if and only if M has locally constant principal 
curvatures (Cartan). If c <0, M has at most 
two distinct principal curvatures (Cartan). If 
c0, the number of distinct principal cur- 
vatures of M is 1, 2, 3, 4, or 6 (H. Münzner 
(1980)). If c=0, then M is locally S* x E"'*, 
and if c « 0, then M is locally E" or S* x Hr 
(Cartan). Isoparametric hypersurfaces of S"*! 
having at most three distinct principal curva- 
tures were completely classified by Cartan. R. 
Takagi, T. Takahashi, and H. Ozeki and M. 
Takeuchi obtained several results for isopara- 
metric hypersurfaces of S"*! with four or six 
distinct principal curvatures [7]. 

If a subgroup of the isometry group of 
M"* (c) acts transitively on M, then M is 
isoparametric. The converse is true if c<0, or 
if c 0 and M has at most three distinct prin- 
cipal curvatures (Cartan), but not true in gen- 
eral (Ozeki and Takeuchi [7]). 


J. Isometric Immersions between Space Forms 


Let f: M'"(c)-» M"*"(C) be an isometric immer- 
sion of an n-dimensional space form into an 
(n+ p)-dimensional space form. 

(1) Ifn=2, p-1, c>0, cz 6€, M7(c) is complete, 
and M?(?) is complete and simply connected, 
then f is totally umbilical (H. Liebmann 
(1901); > [2]). 

(2) If nz2, p=1, c<0,c<é, M?(c) is complete, 
and M?(č) is complete and simply connected, 
then f does not exist (D. Hilbert (1901); — 
[2]. 

(3) If nz2, p21, c20 «6, M?(0) is complete, 
and M°(é) is complete and simply connected, 
then there exist infinitely many f (L. Bianchi 
(1896); — [2]). 

(4) f n22, p=1, c0» 6, M?(0) is complete, 
and M?(?) is complete and simply connected, 
then f(M"?(0)) is either a horosphere or a set of 
points at a fixed distance from a geodesic (J. 
Volkovand S. Vladimirova, S. Sasaki; — [2]). 
(5) If nz3, p=1, and c», then f is totally 
umbilical. 

(6) If p=1, c=č=0, M"(0), is complete, and 
M"*! (0) is complete and simply connected, 
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then f is cylindrical (A. Pogorelov (1956), P. 
Hartman and L. Nirenberg (1959), and others). 
(7) If p<n—1,c=é€>0, and both M"(c) and 
M"*?(é) are complete, then f is totally geodesic 
(D. Ferus (1975)). 

(8) If p<n—1, €>c>0, M"(c) is complete, and 
M"**(6) is complete and simply connected, 
then f does not exist (J. Moore (1972)). 


K. Homogeneous Hypersurfaces 


Let M be an n-dimensional thomogeneous 
Riemannian manifold which is isometrically 
immersed into an (n+ 1)-dimensional complete 
and simply connected space form M"*'(c). 

(1) If c=0, then M is isometric to S* x E" * (S. 
Kobayashi (1958), Nagano (1960), Takahashi 
[9]). 

(2) If c>0, then M is isometric to E? or else is 
given as an orbit of a subgroup of the isometry 
group of M"*'(c) (W. Y. Hsiang, H. B. Lawson, 
Takagi; — [7]). 

(3) If c «0, then M is isometric to E", S* x H"'*, 
or a 3-dimensional group manifold 


e 0 x 
B= [o e` (un) 
0 0 1 


with the metric ds? =e ?'dx? + e?! dy? 4+ dt? 
(Takahashi (1971)). Each of the hypersurfaces 
above except E? in (2) and B in (3) is given as 
an orbit of a certain subgroup of the isometry 
group of M"* (c). 


L. Káhler Submanifolds 


A tcomplex submanifold of a *Káhler manifold 
is a Kahler manifold with respect to the in- 
duced Riemannian metric. A complex analytic 
and isometric immersion of a Kahler manifold 
(M, J, g) into a Kahler manifold (M, J, g) is 
called a Kahler immersion, and M is called a 
Kahler submanifold of M. A Kahler submani- 
fold is a minimal submanifold. A compact 
Kahler submanifold M of a Kahler manifold 
M can never be homologous to 0, that is, there 
exists no submanifold M’ of M such that M — 
OM’. If [M] e H,(M, Z) denotes the thomology 
class represented by a Kahler submanifold M 
of M, then vol(M) x vol(M") holds for any 
submanifold M'e[M] with equality if and 
only if M' is a Kahler submanifold (W. Wir- 
tinger (1936)). 

A Kahler manifold of constant fholomor- 
phic sectional curvature is called a complex 
space form. An n-dimensional complete and 
simply connected complex space form is either 
P,(C), C", or D,. Every Kahler submanifold of 
a complex space form is rigid (Calabi [10]). 

Kahler immersions of complex space forms 
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into complex space forms were completely 
determined by Calabi [10] and by H. Naka- 
gawa and K. Ogiue (1976). 

C" (resp. D,) is the only fHermitian sym- 
metric space that can be immersed in C" (resp. 
D,) as a Kahler submanifold (Nakagawa and 
Takagi [11]), and Káhler tmmersions of Her- 
mitian symmetric spaces into P,,(C) were 
precisely studied by Nakagawa, Y. Sakane, 
Takagi, Takeuchi, and others. More generally, 
Kahler immersions of homogeneous Káhler 
manifolds into P,,(C) were determined by 
Takeuchi (1978). 

0,= (Ezi]e P... (O)] xz =0} in P,,,(C) is 
the only Einstein-Káhler hypersurface of a 
complex space form that is not totally geodesic 
(B. Smyth (1967), S. S. Chern (1967)). The 
result remains true even if “Einstein” is re- 
placed by "parallel Ricci tensor" (Takahashi 
(1967)). Besides linear subspaces, Q, is the only 
Einstein-Káhler submanifold of P,,(C) that is a 
complete intersection (J. Hano (1975)). 

Integral theorems and pinching problems 
with respect to various curvatures for com- 
pact Kahler submanifolds of P,,(C) have been 
studied by K. Ogiue, S. Tanno (1973), S. T. 
Yau (1975), and others [12]. For example, if 
the holomorphic sectional curvature of P, AC) 
is 1, then each of the following is sufficient for 
an n-dimensional compact Kahler submani- 
fold to be totally geodesic: 

(1) holomorphic sectional curvature > 1/2 (A. 
Ros (1985)), 

(2) sectional curvature > 1/8 (A. Ros and L. 
Verstraelen (1984)), 

(3) Ricci curvature >n/2 [12], 

(4) embedded and scalar curvature >n? (J. H. 
Cheng (1981)). 

The index of relative nullity u(x) of an n- 
dimensional complete Káhler submanifold M 
of P,(C) satisfies Min, u(x) =0 or 2n (K. Abe 
(1973)). 


M. Totally Real Submanifolds 


An isometric immersion of a Riemannian 
manifold (M, g) into a Káhler manifold 

(M, J, j) satisfying JT,(M) c v,(M) at each 
point xe M is called a totally real immersion, 
and M is called a totally real submanifold of 
M. A totally geodesic submanifold P,(R) in 
P,(C), S! x S! in P,(C) and an immersion P,(C) 
— Bus zl Cl defined by [2;] [ziz;] give typical 
examples of totally real submanifolds. 


N. Submanifolds with Planar Geodesics 


A surface in E? whose geodesics are all plane 
curves is (a part of) a plane or sphere. More 
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generally, let f: M — M"(c) be an isometric 
immersion of M into a complete and simply 
connected space form M"(c). If the image 

of each geodesic of M is contained in a 2- 
dimensional totally geodesic submanifold of 
M"'(c), then f is either a totally geodesic im- 
mersion, a totally umbilical immersion or a 
minimal immersion of a compact symmetric 
space of rank 1 by harmonic functions of de- 
gree 2; the last case occurs only when c» 0 
(S. L. Hong (1973), J. Little (1976), K. Saka- 
moto [14]). 

A Kahler submanifold of a complete and 
simply connected complex space form with 
the same property as above is either a totally 
geodesic submanifold or a Veronese sub- 
manifold (a Kahler immersion of P,(C) into 
Ba al CH (K. Nomizu (1976)). 

Submanifolds with the above property are 
closely related to isotropic submanifolds with 
V'g =0. Submanifolds with V’c =0 in sym- 
metric spaces have been studied by Ferus, H. 
Naito, Takeuchi, K. Tsukada, and others. 


O. Total Curvature 


Let f: M > E" be an isometric immersion of an 
n-dimensional compact Riemannian manifold 
M into a Euclidean space. Let v, (M) be the 
unit normal bundle, S"! the unit sphere cen- 
tered at Oc E", and let f, :vV,(M) S"^! be the 
parallel translation. Let o and Q be the tvol- 
ume elements of v, (M) and S"~', respectively. 
Then for each Ze, (M), f*O— (det A;)o holds. 
As a generalization of the ttotal curvature for 
a space curve, the total curvature of the im- 
mersion f is defined as 


1 
ee ee *Oo 
vol(S"7!) M SH 


1 
~ vol(S"-!) Lan 


uf)= 


|det A.|a. 


If B(M) is the least number of critical points of 
a tMorse function on M, then 


infc(f)= 8(M)22 


holds (Chern and R. Lashof (1957, 1958), 
Kuiper (1958)). t( f) 22 if and only if f is an 
embedding and f(M) is a convex hypersurface 
of some E"*! in E" (Chern and Lashof (1958)). 
If «( f) « 3, then M is homeomorphic to $” 
(Chern and Lashof (1958)). These results gen- 
eralize theorems for space curves by W. Fen- 
chel (1929), I. Fary (1949), J. Milnor (1950), 
and others. 

An isometric immersion f which attains 
inf( f) is called a minimum immersion or a 
tight immersion. tExotic spheres do not have 
minimum immersions (Kuiper (1958)). tR- 





365 Ref. 
Riemannian Submanifolds 


spaces have minimum immersions and, in 
particular, a minimum immersion of a sym- 
metric R-space is a *minimal immersion into a 
hypersphere (Kobayashi and Takeuchi (1968)). 

The total mean curvature of an isometric 
immersion f: M >E of an n-dimensional 
compact Riemannian manifold into a Euclid- 
ean space is, by definition, 


| IA 
M 


It satisfies 


| (bis 1 > vol(S”), 
M 


where S” is the n-dimensional unit sphere. The 
equality holds if and only if f is totally umbil- 
ical (T. J. Willmore (1968), Chen [3]). 
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Riemann-Roch Theorems 


A. General Remarks 


The fRiemann-Roch theorem (abbreviation: 

R. R. theorem) is one of the most significant 
results in the classical theory of talgebraic 
functions of one variable. Let X be a compact 
*Riemann surface of tgenus g, and let D = 
xm, P, be a tdivisor on X. We denote by 

deg D the degree of D, which is defined to be 
Èm. The divisor D is said to be positive if 

D 0 and m; Z0 for all i. A nonzero *mero- 
morphic function f on X determines a divisor 
(f)—-Xa;Q; — X bRj(a;, b; 0), where the Q; are 
the zeros of order a; and the R; are the poles of 
order b;. The set of meromorphic functions f 
such that ( f) 4- D is positive, together with the 
constant f —0, forms a finite-dimensional 
linear space L(D) over C. The R. R. theorem 
asserts that dim L(D) 2 degD —g + 1+r(D), 
where r(D) is a nonnegative integer determined 
by D. If K is the *canonical divisor of X, then 
r(D)- dim L(K — D) (2 9 Algebraic Curves C; 
11 Algebraic Functions D). (For the R. R. 
theorem for algebraic surfaces — 15 Algebraic 
Surfaces D.) 

Generalizations of this important theorem 
to the case of higher-dimensional compact 
*complex manifolds were obtained by K. 
Kodaira, F. Hirzebruch, A. Grothendieck, M. 
F. Atiyah and I. M. Singer, and others. Let X 
be a compact complex manifold, B be a tcom- 
plex line bundle over X, and (B) be the *sheaf 
of germs of holomorphic cross sections of B. 
When B is determined by a divisor D of X, we 
have H?(X, ((B)) z L(D). Hence a desirable 
generalization of the R. R. theorem will pro- 
vide a description of dime H°(X, @(B)) in terms 
of quantities relating to the properties of X 
and B. Following this idea, various theorems 
of Riemann-Roch type have been obtained. 


B. Hirzebruch's Theorem of R. R. Type 


Keeping the notation given in Section A, we 
put 7(X, &(B)) 2 3,( —1)* dim H*(X, O(B)). 
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Generally, if 1s an arbitrary tcoherent ana- 
lytic sheaf on X, we can define y(X, F) using 
the same formula (replacing O(B) by F). The 
quantity "LK, F) has simple properties in 
various respects. For example, if the sequence 
O> F’ >F —"-—40O0 is exact, we have y(X, F) 
—JX(X, F')+ x(X, F”). If an tanalytic vector 
bundle F depends continuously on auxiliary 
parameters, then y(X, O(F)) remains constant. 
Let X be a projective algebraic manifold of 
complex dimension n. We consider the tChern 
class c=1+c,+...+c, of X and express it 
formally as the product IT7-, (1 4 y;). Thus the 
ith Chern class c; is expressed as the ith ele- 
mentary symmetric function of y4, ..., Yp. COn- 
sider the formal expression T(X) — IT, y;/ 

(1 —e ?). T(X) can be expanded as a formal 
power series in the y;, and each homogeneous 
term, being symmetric in the y;, can be ex- 
pressed as a polynomial in c,, ...,c,, and thus 
determines a cohomology class of X. Similarly, 
we consider the formal expression of the Chern 
class of the vector bundle F as 1+d,+...+4, 
— [If-, (1 - 9j, where q is the dimension of the 
fiber of F. We put ch(F)= D7, ei The formal 
series ch(F) is also an element of the coho- 
mology ring of X whose (v+ 1)st term consists 
of a 2v-dimensional cohomology class. We call 
ch(F) the tChern character of F (— 237 K- 
Theory B), and define T(X, F) to be the value 
of ch(F) T(X) at the tffundamental cycle X. 
(The multiplication ch(F)- T(X) is formal. 
T(X, F) is determined by the term of dimen- 
sion 2n alone.) T(X, F) is called the Todd char- 
acteristic with respect to F. Hirzebruch's theo- 
rem of R. R. type asserts that y(X, O(F))= 
T(X, F). In particular, when n— 1, F-[D] 
(the line bundle determined by the divisor D), 
Hirzebruch's formula yields the classical R. R. 
theorem. If F satisfies the conditions for the 
vanishing theorem of cohomologies, the for- 
mula gives an estimate for dim H9(X, O(F)) 
[11]. 

In 1963, Atiyah and Singer developed a 
theory on indices of elliptic differential 
operators on a compact orientable differenti- 
able manifold and obtained a general result 
that includes the proof of Hirzebruch's 
theorem for an arbitrary compact complex 
manifold [4, 5] (— 237 K-Theory H). 


C. R. R. Theorem for Surfaces 


If X is a compact complex surface, i.e., a com- 
pact complex manifold of dimension 2, then 
for complex line bundles F, and F,, the inter- 
section number (F, F;) is defined to be c,(F,)U 
c4 (F5) X]. The R. R. theorem for a line 
bundle F on X is stated as follows: x(X, O(F)) 
=(F?)/2—(KF)/2+((K?)+c,(X))/12, where 
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K is the canonical line bundle of X and c,(X) 
denotes the value at X of the 2nd Chern class 
of X, that is, the Euler number of X. 

The Noether formula y, =((K?)+c,(X))/12 
follows from the above identity. The R. R. 
theorem for surfaces is a powerful tool for the 
study of compact complex surfaces. 


D. Grothendieck's Theorem of R. R. Type 


Grothendieck took an entirely new point of 
view in generalizing Hirzebruch's theorem. 
The following is a description of his idea as 
reformulated by A. Borel and J.-P. Serre [8]. 
We consider a nonsingular quasiprojective 
algebraic variety X (— 16 Algebraic Varieties) 
over a ground field of arbitrary characteristic. 
Namely, X is a closed subvariety of an open 
set in a projective space (over an algebraically 
closed ground field). Consider the group K(X), 
which is the quotient of the free Abelian group 
generated by the equivalence classes of alge- 
braic vector bundles over X modulo the sub- 
group generated by the elements of the form 
F—F'—F", where F, F', F" are classes of 
bundles such that there exists an exact se- 
quence 02 F'2 F5 F"—0. A similar construc- 
tion for the (equivalence classes of the) tcoher- 
ent algebraic sheaves instead of the vector 
bundles yields another Abelian group K'(X). 
It can be shown that K(X) is isomorphic to 
K'(X) by the correspondence F — ((F) (=the 
sheaf of germs of regular cross sections of F). 
Addition in K(X) is induced by the Whitney 
sum of the bundles, and K(X) has the struc- 
ture of a ring with multiplication induced by 
the tensor product. For a vector bundle F, its 
Chern class c(F)=1+¢,(F)+...+c¢,(F) (q= 
the dimension of the fiber) is defined as an 
element of the tChow ring A(X) with appro- 
priate properties. (A(X) is the ring of the 
rational equivalence classes of algebraic cycles 
on X, and c,(F) is the class of a cycles of co- 
dimension i.) We define ch(F) as before. It 

can be shown that c(F) and ch(F) are deter- 
mined by the image of F in K(X), and we have 
ce +n) =c(S)e(n), ch(č +) — ch($) + ch(n), 
ch(én) =ch(é)ch(n) (£, qe K(X)). If we have a 
tproper morphism f: Y X between quasi- 
projective algebraic varieties Y and X, we 
have homomorphisms f': K(X) K(Y) and 

fi: K(Y)— K(X). The former is defined by 
taking the induced vector bundle and the 
latter by the correspondence 


$»(-fs, 
where ¥ is a coherent algebraic sheaf on Y 
and (2*f)F is the gth tdirect image of F 


under f. (Since f is proper, (#4f)F is coher- 
ent.) Between Chow rings we have homomor- 
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phisms f *: A(X)5 A(Y) and f,: A(Y)— A(X), 
defined by taking inverse and direct images of 
cycles. With this notation, the theorem asserts 
that if X and Y are quasiprojective and f: Y^ 
X is a proper morphism, then f,(ch(2) T(Y)) — 
cht f(£)) T(X). This is called Grothendieck's 
theorem of R. R. type. If X consists of a single 
point, the theorem gives Hirzebruch's theorem 
for algebraic bundles. Since algebraic and 
analytic theories of coherent sheaves on a 
complex projective space are isomorphic, this 
result covers Hirzebruch's theorem (— 237 K- 
Theory). 

The subgroup R(Y) of A(Y) given by R(Y)— 
{ch(é)- T(Y)| € K(Y)] is called the Riemann- 
Roch group of Y. Thus, using the notions 
developed by Grothendieck, the R. R. theorem 
can be expressed as follows: R(Y) is mapped 
into R(X) by a proper morphism f: Y X. 
Generalizations to *almost complex manifolds 
and ‘differentiable manifolds were made by 
Atiyah and Hirzebruch in this latter form [1]. 
One of the remarkable results is that an ele- 
ment of R(Y) takes an integral value at the 
fundamental cycle. This theorem ts obtained 
by taking X to be a single point. 


E. R. R. Theorem for Singular Varieties 


Let X be a projective variety over C and let 
HX) (resp. H (X)) denote the singular ho- 
mology (resp. cohomology) group with rational 
coefficients. Note that K(.X) may not agree 
with K'(X) when X is singular. The R. R. 
theorem for X formulated by P. Baum, W. 
Fulton, and R. MacPherson [6] says that 
there exists a unique natural transformation 
t: K'(X) H(X) such that (1) if £e K(X) and 
ne K'(X), then € @ ge K'(X) and «($69 5) — 
ch(£) (t(y)); (2) whenever X is nonsingular, 
t(@y)= T(X)(X). Note that the naturality of t 
means that for any f: X > Y and any ne K'(X), 


Ft) — fun). 
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A. General Remarks 


Riemann considered certain surfaces, now 
named after him, obtained by modifying in a 
suitable manner the domains of definition of 
multiple-valued *analytic functions on the 
complex plane in order to obtain single-valued 
functions defined on the surfaces. For example, 
consider the function z= f(w) 2 w?, where w 
varies in the complex plane, and its inverse 
function w=g(z)= dge Then g(0)=0 and 

g(oo) — œ, whereas if z #0, oo, there exist two 
values of w satisfying g(z) ^ w. By setting z 

— re? (r» 0,0« 0 2n), the corresponding 

two values of w are w, — Vr e and w, = 
yreur Now consider how we should 
modify the complex z-plane so that we can 
obtain a single-valued function on the modi- 
fied surface representing the same relationship 
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between z and w. Let z, and z, be two copies 
of the complex plane. Delete the nonnegative 
real axes from z, and z, and patch them 
crosswise along the slits (Fig. 1). The surface R 
thus obtained is locally homeomorphic to the 
complex plane except for the origin and the 
point at infinity, and situations at the origin 
and the point at infinity are as indicated in 
Fig. 1. For z ZO, oo, there are two points z, 
and z, in z; and z,, respectively, with the 
same coordinate z. Let w, and w, correspond 
to z, and z,, respectively. Then the function 
w=./z becomes single-valued on R, and w, 
and w, are holomorphic functions of z, and 
Z2, respectively. The surface R is called the 
Riemann surface determined by w= /z. . 





Fig. 1 


Working from the idea illustrated by this 
example, H. Weyl and T. Radó gave rigorous 
definitions of Riemann surfaces. The usual de- 
finition nowadays is as follows: Let d be a set 
of pairs (U, wy) of open sets U in a tconnected 
tHausdorff space R and topological mappings 
Wy of U onto plane regions satisfying the 
following two conditions: (i) R =|] u, yea U; 
(ii) for each (U, e: L (U2, Yu EA with V= 
U, YU, 7 Ø, by, o Vu] gives an (orientation- 
preserving) tconformal mapping of each con- 
nected component of yy (V) onto a corre- 
sponding one of yy (V). Two such sets W, 
and W, are equivalent, by definition, if W, U 
UA, also satisfies conditions (i) and (ii). The 
equivalence class (91) of such 9I is called a 
conformal structure (analytic structure or com- 
plex structure) on R (— 72 Complex Mani- 
folds). A pair (R, (GI consisting of a connected 
Hausdorff space R and a conformal structure 
(91) is called a Riemann surface, with R its 
base space and (91) its conformal structure. (A 
Riemann surface in this sense is sometimes 
called an abstract Riemann surface.) It is a 
complex manifold of tcomplex dimension 1. 
For (U, Wy) in We(QW), (U, wy) (or sometimes U 
itself) is called an analytic neighborhood, and 
Wy is called a local uniformizing parameter (or 
simply a local parameter). In the remainder of 
this article we call R itself a Riemann surface. 

From condition (i) it follows that a Riemann 
surface R is a real 2-dimensional ttopological 
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manifold. Moreover, by condition (ii) it can be 
deduced that R satisfies the *second counta- 
bility axiom and consequently is a tsurface 
and admits a tsimplicial decomposition (T. 
Rado, 1925). It is also orientable (— 410 Sur- 
faces). Therefore R is a tlocally compact metric 
space. It is not possible to define curve lengths 
on R compatible with the conformal structure, 
but since angles can be defined, R is consid- 
ered to be a real 2-dimensional space with a 
*conformal connection. It is customary in the 
theory of functions to call R closed if it is 
compact and open otherwise. A plane region D 
is considered an open Riemann surface with 
the conformal structure d =(D, 1:D D). A 
Riemann sphere is also considered to be a 
closed Riemann surface whose analytic neigh- 
borhoods are given by (U,, ei and {U,, 1/o}, 
where U,(U,) is the domain corresponding to 
{|z|<2}({|z|> 1/2} U {co}) under the stereo- 
graphic projection o. 

A function f on a Riemann surface is said to 
be meromorphic, holomorphic, or harmonic 
on R if f oV! is tmeromorphic, tholomor- 
phic, or tharmonic in the usual sense on w,(U) 
for every analytic neighborhood (U, Wy). More 
generally, suppose that for mappings between 
plane regions we are given a property 8 that 
is invariant under conformal mappings. A 
mapping T of a Riemann surface R, onto 
another Riemann surface R, is said to have 
the property $f if the mapping Vy, o Toy! 
of Wy (U,) into wy (U;) has the property $ 
for every pair of analytic neighborhoods 
(U1, Vy.) nd (U2, v, ) in R, and R,, respec- 
tively. Thus such a mapping T may be con- 
formal, tanalytic, *quasiconformal, harmonic, 
etc. If there exists a one-to-one conformal 
mapping of a Riemann surface R, onto an- 
other Riemann surface R,, then R, and R, are 
said to be conformally equivalent. Two such 
Riemann surfaces are sometimes identified 
with each other. 


B. Covering Surfaces 


One of the main themes of the theory of func- 
tions is the study of analytic mappings of a 
Riemann surface R into another Riemann 
surface Ro, i.e., the theory of covering surfaces 
of Riemann surfaces. 

Suppose, in general, that there are two 
surfaces R and Ry and a continuous open 
mapping T of R into Re such that the inverse 
image of a point in Ry under T is an tisolated 
set in R. Then T is called an inner transfor- 
mation in the sense of Stoilow and (R, Ry; T) a 
covering surface with Rọ its basic surface and 
T its projection. Often R is called simply a 
covering surface of Ry. A point pg with po — 
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T(p) is called the projection of p, and p is said 
to lie above pp. In this case, there exist sur- 
face coordinates (U, y) and (Up, Wo) at p and 
Do, respectively, such that y(U)= (z ||z| « 1], 
V(p) —0, (Uo) — (w | hw] « 1), Vo (po) —0, and 
w=(,0 Tow ')(z)—z", with the positive 
integer n independent of the choice of coordi- 
nate neighborhoods. If n» 1, then p ts called a 
branch point, n the multiplicity, and n — 1 the 
degree of ramification. The set of all branch 
points forms an at most countably infinite 

set of isolated points. If there is no branch 
point, then the covering surface (R, Rọ; T) and 
the projection T are said to be unramified. For 
a given curve C, in Ry and a point p in R lying 
above the initial point of Co, a curve C in R 
with p its initial point satisfying T(C) - C, is 
called the prolongation along C, (or the lift of 
Co) starting from p. If any proper subarc of 
C, sharing the initial point with C, admits a 
prolongation along itself starting from p but 
C, does not admit a prolongation along itself 
starting from p, then R is said to have a rela- 
tive boundary above the endpoint of Cy. A 
covering surface without a relative boundary is 
called unbounded. A tsimply connected surface 
R® that is an unramified unbounded covering 
surface of Ry is said to be a universal covering 
surface of Ry. The universal covering surface 
exists for every R. 

Suppose that R is an unbounded covering 
surface of Ry. Then the number of points on 
R that lie above each point of Ry is always 
constant, say n (< +00), where the branch 
points of R are counted with their multiplic- 
ities. n is called the number of sheets of R over 
Ro, or R is said to be n-sheeted over Ry. If R 
and R, are compact surfaces with tEuler char- 
acteristic y and yo, respectively, and if R is an 
n-sheeted covering surface of Ro, then we have 
the Riemann-Hurwitz relation: y 2 ny, — V, V 
being the sum of the degrees of ramification. A 
topological mapping S of an unramified un- 
bounded covering surface R of Ry onto itself 
such that ToS — T is called a covering trans- 
formation. The group of all covering trans- 
formations of a universal covering surface of 
Ro is isomorphic to the "fundamental group 
De, the 1-dimensional homotopy group) of 
Ro. 

In a covering surface (R, Ry; T) whose basic 
surface Ry is a Riemann surface, T can be 
regarded as an analytic mapping of R onto Ro 
by giving R a conformal structure in a natural 
manner. In particular, if Ro is the sphere, then 
its covering surface is a Riemann surface. 
Conversely, any Riemann surface can be re- 
garded as a covering surface of the sphere. This 
fact had long been known for closed Riemann 
surfaces; for open Riemann surfaces, it can be 
deduced from the existence theorem of an- 
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alytic functions proved by H. Behnke and K. 
Stein, which states that an open Riemann 
surface is a tStein manifold. Historically, by a 
Riemann surface mathematicians meant either 
the abstract Riemann surface or a covering 
surface of the sphere, until the two notions 
were proved identical. 

Suppose that R is a covering surface of the 
z-sphere Ry with the projection T: R> Ro, and 
denote by R, the region that lies above 0 « |z — 
a| € rg. If there exists a topological mapping 
y of R, onto ((r,0)|O«r «ry, —oc «0« oo] 
such that a re? = T(y ^! (r, 0)), then R is said 
to have a logarithmic branch point above a; in 
contrast, a branch point of multiplicity n of the 
type defined previously is sometimes called an 
algebraic branch point. 

Ahlfors's theory of covering surfaces, which 
treats covering surfaces (R, Ro; T) not only 
from the topological viewpoint but also from 
the metrical one, is particularly important. 

Let R and R, be either compact surfaces with 
simplicial decompositions or their closed 
subregions with boundaries consisting of 1- 
simplexes and vertices such that T' preserves 
simplicial decompositions. Here we call the 
part of the boundary of R whose projection is 
in the interior of Rọ the relative boundary. 
With respect to a suitable metric on Ro, let S 
be the ratio of areas of R and Ro, L the length 
of the relative boundary, and — p and — p, the 
*Euler characteristics of R and R,, respec- 
tively. Then Ahlfors's principal theorem asserts 
that max(0, p) 2 p S — hL, where h 01s a 
constant determined only by Ry. This has been 
applied widely in various branches of mathe- 
matics, including the theory of distribution of 
values of analytic mappings between Riemann 
surfaces. 


C. Uniformization 


Suppose that we are given a correspondence 
between the z-plane and w-plane determined 
by a ffunction element po —(z$, w$) (GZ = Po(t), 
wš = Qo(t)). This correspondence generally 
gives rise to a multiple-valued analytic func- 
tion w= f(z). We show how to construct a 
Riemann surface so that the function w= f(z) 
can be considered a single-valued function on 
it. We use f again to mean the connected 
component of the set of function elements 
p=(z*, w*) (z* = P(t), w* = Q(t)) in the wider 
sense containing po, where the analytic neigh- 
borhood of each point p is defined to be the set 
of elements that are direct analytic continua- 
tions of p. Then f is a Riemann surface. For a 
point p — (z*, w*) (z* = P(t), w* = Q(t)) in f, set 
z=F(p)=P(0), w=G(p)=Q(0). Then two 
meromorphic functions z= F(p) and w= G(p) 
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are defined on f, and f can be considered as a 
covering surface of the z-sphere and the w- 
sphere. We call f an tanalytic function in the 
wider sense. Thus we obtain a single-valued 
function w= G(p) on the Riemann surface f 
that can be regarded as a modification of the 
original function w= f(z). Suppose that there 
exist two meromorphic functions z= o (0) and 
w — (C) on a region D in the C-plane, and let 
z= DU — 659) and wess Q(C —6,) be tLaurent 
expansions of o and y at each point £, in D. If 
the function element p, — (z*, w*) (z* = P(t), wi 
— Q(t)) belongs to the Riemann surface f, then 
the correspondence w= f(z) determined by the 
function element p; is said to be locally unifor- 
mized on D by z= ¢(¢) and w= y (C). In par- 
ticular, if (p; | Ce D) — f, then f is said to be 
uniformized by z= 91) and w = y (C). If an 
analytic function f in the wider sense, consid- 
ered as a Riemann surface, is conformally 
equivalent to a region D in the ¢-plane, then f 
can be uniformized by z = F(p) and w=G(p). 
In general, f is not conformally equivalent to a 
plane region, but if an unramified unbounded 
covering surface ( f, f; T) of f is conformally 
equivalent to a region D in the ¢-plane, then f 
is uniformized by z= Fo T(¢) and w=Go T(C) 
(Schottky's uniformization). In particular, since 
the universal covering surface ( f ^, f ; T) of f is 
simply connected, it is conformally equivalent 
to the sphere |£| € oo, the finite plane || « oo, 
or the unit disk || < 1. Consequently, f is 
uniformized by z= Fo T(¢) and w=Go T(C). 
Therefore analytic functions in the wider sense 
are always uniformizable. 

For example, an talgebraic function f consid- 
ered as a Riemann surface is always closed. If 
its *genus g =0, then f is the sphere and is thus 
uniformized by rational functions z = F(¢) and 
w=G(Q). If g>0, then (f^, f; T) is conformally 
equivalent to |f|<1 or |f|< oo, and hence f is 
uniformized by z= Fo T(c) and w=Go T(Q). 
When |¢|< 1, z and w are tautomorphic func- 
tions with respect to the group of linear trans- 
formations preserving || « 1, while if |£| < oo, 
they are telliptic functions. 


D. The Type Problem 


A simply connected Riemann surface R is 
conformally equivalent to the sphere, the finite 
plane, or the unit disk. Then R is said to be 
elliptic, parabolic, or hyperbolic, respectively. 
The problem of determining the types of sim- 
ply connected covering surfaces of the sphere 
by their structures, such as the distributions of 
their branch points, is called the type problem 
for Riemann surfaces. For example, if a simply 
connected covering surface does not cover 
three points on the sphere, it must be hyper- 
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bolic (Picard's theorem). The Nevanlinna 
theory of meromorphic functions stimulated 
this type problem. However, it is difficult to 
measure the ramifications of covering surfaces, 
and many detailed reuslts of the type problem 
obtained in the 1930s are limited mainly to 
the case where all branch points lie above a 
finite number of points on the sphere. A suffi- 
cient condition for R to be of parabolic type, 
given by Z. Kobayashi (using the so-called 
Kobayashi net, and a sufficient condition for 
R to be of hyperbolic type, given by S. Kaku- 
tani (using quasiconformal mappings), are 
significant results on the type problem. The 
type problem had by then been extensively 
generalized to the following classification 
theory. 


E. Classification Theory of Riemann Surfaces 


Riemann surfaces are, as pointed out by Weyl, 
*not merely devices for visualizing the many- 
valuedness of analytic functions, but rather an 
essential component of the theory ... the only 
soil in which the functions grow and thrive." 
So the problem naturally arises of how to 
extend various results in the theory of analytic 
functions of a complex variable to the theory 
of analytic mappings between Riemann sur- 
faces. In general, open Riemann surfaces can 
have infinite genus and are quite complicated. 
So to obtain fruitful results and systematic 
development, one usually sets certain restric- 
tions on the properties of the Riemann sur- 
faces. In connection with this, R. Nevanlinna, 
L. Sario, and others initiated the classification 
theory of Riemann surfaces, which classifies 
Riemann surfaces by the existence (or non- 
existence) of functions with certain properties. 
Denote by X(R) the totality of functions on 
a Riemann surface R with a certain property 
X. The set of all Riemann surfaces R for which 
X(R) does not contain any function other than 
constants is denoted by Ox. The family of 
analytic functions and the family of harmonic 
functions are denoted by A(R) and H(R), re- 
spectively. The family of positive functions, 
that of bounded functions, and that of func- 
tions with finite Dirichlet integrals are denoted 
by P(R), B(R), and D(R), respectively. From 
these families, various new families are created, 
e.g, ABD(R)= A(R)N B(R)N D(R). Usually 
Ong, Onn» Ongo» Oas» Oan» O4gp; and also Og, 
the family of Riemann surfaces on which there 
are no Green's functions, are considered. P. J. 
Myrberg found an example of a Riemann 
surface of infinite genus which has a large 
boundary but belongs to O,,. The idea behind 
Myberg's example is often used to construct 
examples in classification theory. From works 
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of Y. Toki, L. Sario, K. I. Virtanen, H. L. 
Royden, M. Parreau, M. Sakai, and others, it 
can be seen that there are inclusion relations, 
as indicated in Fig. 2, among the classes just 
mentioned. There ts no inclusion relation 
between O,, and Opp. For Riemann surfaces 
of finite genus, Og = Og. Closed Riemann 
surfaces are all in Og. 

Open Riemann surfaces in Oç are also said 
to be parabolic (or of null boundary), and those 
not in Og, hyperbolic (or of positive boundary). 
Several characterizations for parabolic 
Riemann surfaces are known. 


c Om = npn c. 
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Fig. 2 


From a similar point of view, the classifica- 
tion problem for subregions was studied in 
detail by Parreau, A. Mori, T. Kuroda, and 
others. We call a noncompact region Q which 
is the complement of a compact subset of a 
Riemann surface a Heins’s end. M. Heins 
called the minimal number ( € oo) of gen- 
erators of the semigroup of the additive class 
of HP-functions that vanish continuously at 
the relative boundary of a Heins’s end Q the 
harmonic dimension of Q. Its properties were 
investigated by Z. Kuramochi, M. Ozawa, and 
others. Generally, a function f is said to be X- 
minimal if f is positive and contained in X(R) 
and there exists a constant C, for every g in 
X(R) with f>g>0 such that g = C,f. The 
family of Riemann surfaces R not belonging to 
Og and admitting X-minimal functions on R is 
denoted by Ux. C. Constantinescu and A. 
Cornea and others studied Riemann surfaces 
in Uyg and Uyp where HD is the class of posi- 
tive functions in HD or limits of monotone 
decreasing sequences of such functions. There 
are inclusion relations Uyg Z Oyg — Oc, Unp 
2 Oyp — Os, and Uy, & Uys. One of the inter- 
esting results in classification theory is the fact, 
discovered by Kuramochi, that Uyg U Oc 5 O45 
and Uyn UOS ẸṢ 0,,. 

Classification theory has a very deep con- 
nection with the theory of *ideal boundaries. 
A. Pfluger and Royden showed that the classes 
Og and Oy, are invariant under quasicon- 
formal mappings. However, it is still an open 
problem whether Og, is invariant in this sense. 


F. Prolongations of Riemann Surfaces 


As classification theory shows, pathological 
phenomena occur for Riemann surfaces from 
the viewpoint of function theory in the plane. 
These are caused by the complexity of ideal 
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boundaries of Riemann surfaces, and in par- 
ticular by the complexity of the set of ideal 
boundary points at which handles of Riemann 
surfaces De, parts with cycles not homologous 
to 0) accumulate. Hence it is desirable to find 
larger Riemann surfaces so that ideal bound- 
ary points of original surfaces that are not 
accumulating points of handles become inte- 
rior points. Suppose that a Riemann surface 

R is conformally equivalent to a proper sub- 
region R' of another Riemann surface R,. 
Then R, is said to be a prolongation of R, and 
R is prolongable. A nonprolongable Riemann 
surface is said to be maximal. Closed Riemann 
surfaces are always maximal, but there also 
exist maximal open Riemann surfaces (Rado). 
However, every open Riemann surface is 
homeomorphic to a prolongable Riemann 
surface (S. Bochner). Characterizations of 
prolongable Riemann surfaces and relation- 
ships between the various null classes men- 
tioned in Section E and prolongations were 
investigated from several viewpoints by R. de 
Possel, J. Tamura, and others. 


G. Analytic Mappings of Riemann Surfaces 


Apart from the development of the classifica- 
tion theory of Riemann surfaces, efforts have 
been made to extend various results in the 
theory of analytic functions of a complex 
variable to the case of analytic mappings be- 
tween Riemann surfaces. L. Sario studied the 
method of normal operators, which is utilized 
to construct harmonic functions on Riemann 
surfaces with given singularities at their ideal 
boundaries; and he extended the main theo- 


rems of Nevanlinna to analytic mappings 


between arbitrary Riemann surfaces (— 124 
Distributions of Values of Functions of a 
Complex Variable). M. Heins introduced the 
notions of Lindelóf type, Blaschke type, and 
others which are special classes of analytic 
mappings. Utilizing these notions, Con- 
stantinescu and Cornea, Kuramochi, and 
others extended various results in the theory 
of cluster sets (— 62 Cluster Sets) by studying 
the behavior of analytic mappings at ideal 
boundaries. The theory of *capacities on ideal 
boundaries has also been developed. 

On every open Riemann surface R there 
exists a nonconstant holomorphic function 
(Behnke and Stein). Furthermore, Gunning 
and Narasimhan proved that there exists a 
holomorphic function on R whose derivative 
never vanishes. In other words, R is conform- 
ally equivalent to an unramified covering 
surface of the sphere. Such a locally homeo- 
morphic analytic mapping is called the im- 
mersion of R. The proof is based on the fol- 
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lowing deep result (S. Mergelyan’s theorem): . 
Suppose that K is a compact set on R such 
that R—K has no relatively compact con- 
nected component. Then every continuous 
function which is holomorphic on the interior 
of K can be approximated uniformly on K by 
a holomorphic function on R [22]. 

A surface of genus 0 is said to be planar (or 
of planar character or schlichtartig). A simply 
connected surface is planar. Using the Dirich- 
let principle, P. Koebe proved the following 
general uniformization theorem: Every planar 
Riemann surface R can be mapped conform- 
ally to the canonical slit regions on the ex- 
tended complex plane C. More precisely, given 
a point p on R, there exists the extremal hori- 
zontal (vertical) slit mapping F,(F,) such that 
(i) F,(F,) maps R conformally to a region on C 
whose boundary consists of horizontal (ver- 
tical) slits and possibly points; (ii) F, and F, 
have a simple pole at p with residue 1; (iii) the 
total area of the slits and points is zero. Sup- 
pose that F, — 1/z t- a;-- c;z 4 ... (i21,2) in 
terms of local parameter z at p. Then s=(c, — 
C,)/2 is called the span of R. It is known that 
s (= |d(F, — F3)/21?) Z0, where the equality 
holds if and only if R belongs to O,p. In the 
case of finite genus g, there also exist the con- 
formal mappings of R onto the parallel slit 
regions on the (g + 1)-sheeted covering surface 
of € (Z. Nehari, Y. Kusunoki, and others). 

L. Ahlfors proved that a Riemann surface 
of genus g bounded by m contours can be 
mapped conformally to an at most (2g + m)- 
sheeted unbounded covering surface of the 
unit disk. 

The structures of closed Riemann surfaces 
are determined by the algebraic structures of 
meromorphic function fields on them. H. Iss'sa 
obtained a noteworthy result which estab- 
lished that open Riemann surfaces are also 
determined by their meromorphic function 
fields [24]. It is known too that open Rie- 
mann surfaces are determined by their rings 
of holomorphic functions. 


H. Differential Forms on Riemann Surfaces 


Since Riemann surfaces are considered real 2- 
dimensional differentiable manifolds of class 
C”, differential 1-forms o —udx -- vdy and 
differential 2-forms Q=cdx ^ dy are defined on 
them, and operations such as the exterior 
derivative dw — (0v/OÓx — 0u/Oy) dx ^ dy and 
exterior product can be defined (— 105 Dif- 
ferentiable Manifolds). Since coordinate trans- 
formations satisfy the Cauchy-Riemann dif- 
ferential equation, the conjugate differential 

* = —vdx+udy of œw can be defined. A dif- 
ferential form c satisfying do =d * w =0 is 
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called a harmonic differential, and one with * œ 
= —i is said to be pure. A pure differential is 
expressed as w= f dz. Here, if f is a holomor- 
phic function, then c is called a holomorphic 
(or analytic) differential, and if f is a meromor- 
phic function, then c is called a meromor- 
phic (or Abelian) differential. The differential 
form c is a holomorphic differential if and 
only if it is closed (i.e., dco — 0) and pure. The 
differential c is called exact (or total) if c is 
written as dF = F,dx + F,dy with a globally 
single-valued function F. 

Next, the set of all measurable differentials 
o with |o? =|{p@A*@< oo forms a tHil- 
bert space with respect to the norm Te, The 
method of orthogonal decomposition in the 
theory of Hilbert spaces is the main device to 
study this space and also its suitable subspaces 
and to obtain the existence theorem of har- 
monic and holomorphic differentials with 
various properties (— 194 Harmonic Inte- 
grals). However, in contrast to differentiable 
manifolds, it should be noted that finer or- 
thogonal decompositions into subspaces with 
specific properties hold for open Riemann 
surfaces. For instance, let TO be the Hilbert 
space of analytic (harmonic) differentials with 
finite norm, and set I7, — (c€T, |o is exact], 
I4, — (o eI, |o is tsemiexact}; then we have 
the orthogonal decompositions 


= LES + Is. 
T, =r WK t Ls = Dus + "Thm etc., 


where *T, stands for the space {w|*weTI,} and 
I’, and Tpm are known as the space of analytic 
Schottky differentials and the space of dif- 
ferentials of harmonic measures, respectively. 
Both spaces I, and I,m have remarkable 
properties [9]. 


I. Abelian Integrals on Open Riemann Surfaces 


The systematic effort to extend the theory of 
Abelian integrals on closed Riemann surfaces 
to open Riemann surfaces was initiated by 
Nevanlinna in 1940. At the first stage of the 
development, only those Riemann surfaces 
with small boundaries (i.e., ones in Og or Oyp) 
were treated; later a more general treatment 
was made possible by the discovery of the 
notion of semiexact differentials (K. Virtanen, 
Ahlfors). 

Let R be an arbitrary open Riemann sur- 
face. A 1-dimensional cycle C is called a divid- 
ing cycle if for any compact set in R there 
exists a cycle outside the compact set homol- 
ogous to C. A differential is said to be semi- 
exact if its period along every dividing cycle 
vanishes. 
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Ahlfors defined the distinguished (com- 
plex harmonic) differentials with polar sin- 
gularities and obtained in terms of them a 
generalization of Abel’s theorem. Indepen- 
dently, Y. Kusunoki defined.the semiexact 
canonical (meromorphic) differentials and 
gave in terms of them a formulation of Abel's 
theorem and the Riemann-Roch theorem on R 
[27]. It was proved that a meromorphic dif- 
ferential df — du + idv is semiexact canonical if 
and only if du is (real) distinguished, and then 
u is almost constant on every ideal boundary 
component of R (in appropriate fcompactifi- 
cation of R). Hence every meromorphic func- 
tion f for which df is distinguished is almost 
constant on every ideal boundary component, 
and therefore f reduces to a constant by the 
boundary theorem of Riesz type if R has a 
large boundary. Whereas by the Riemann- 
Roch theorem above a nonconstant meromor- 
phic function f such that df is (exact) canon- 
ical exists on any open Riemann surface R 
with finite genus, and f gives a canonical par- 
allel slit mapping of R (— Section G). H. L. 
Royden [28] and B. Rodin also gave gener- 
alizations of the Riemann-Roch theorem. 

M. Yoshida, H. Mizumoto, M. Shiba, and 
others further generalized the Kusunoki type 
theorems. The Riemann-Roch theorem for a 
closed Riemann surface can be deduced from 
that for open Riemann surfaces by considering 
an open Riemann surface obtained from a 
closed surface by deleting a point. 

Riemann's period relation on R has been 
studied for various classes of differentials, but 
for the case of infinitely many nonvanishing 
periods, no definitive result has been obtained. 
The same is true for the theory of Abelian 
differentials with infinitely many singularities. 
On the other hand, the analogy to the classical 
theory is lost completely if no restriction is 
posed on the differentials on R. In this con- 
text the following results due to Behnke and 
Stein [26] are outstanding: (1) There exists an 
Abelian differential of the first kind on R with 
infinitely many given periods. (2) For two 
discrete sequences 1 p,) and {q,} of points in 
R, there exists a single-valued meromorphic 
function with zeros at p, and poles at qm. It is 
further proved that there exists an Abelian 
differential with prescribed divisor and peri- 
ods (Kusunoki and Sainouchi). This gener- 
alizes the results above and the Gunning- 
Narasimhan theorem. 
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A. Definition 


A nonempty set A is called a ring if the follow- 
ing conditions are satisfied. 

(1) Two toperations, called addition and 
multiplication (the ring operations), are defined, 
which send an arbitrary pair of elements a, b 
of A to elements a+b and ab of A. 

(2) For arbitrary elements a, b, c of A, these 
operations satisfy the following four laws: (i) 
a+b=b+a (commutative law of addition); (ii) 
(a+ b)+c=a+(b+c), (ab)c — a(bc) (associative 
laws); (iii) a(b + c) 2 ab 4- ac, (a + b)c — ac + bc 
(distributive laws); and (iv) for every pair a, b of 
elements of A, there exists a unique element c 
of A such that a-- c — b. Thus a ring A is an 
*Abelian group under addition. Each element 
a of a ring A determines operations L, and R, 
defined by L,(x) 2 ax, R,(x)=xa (xe A). Thus a 
ring A has the structure of a tleft A-module 
and a tright A-module. Since the operations 
L, and R, commute for every pair a, b of ele- 
ments of A, the ring A is also an A-A-bimodule 
(— 277 Modules). 

The identity element of A under addition is 
called the zero element and is denoted by 0. It 
satisfies the equation a0=0a=0 (ae A). An 
element ec A is called a unity element (identity 
element or unit element) of A if it satisfies ae — 
ea — a (ae A). If A has such a unity element, it 
is unique and is often denoted by 1. A ring with 
unity element is called a unitary ring. Most of 
the important examples of rings are unitary. 
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Hence we often call a unitary ring simply a 
ring. If a ring has only one member (namely, 
0), then 0 is the unity element of the ring. Such 
a ring is called a zero ring. However, if a ring 
has more than one element, the unity element 
is distinct from the zero element. A ring is 
called a commutative ring if it satisfies (v) ab 

= ba (a, be A) (commutative law for multipli- 
cation) (^ 67 Commutative Rings). 

In this article we shall discuss associative 
rings. Certain nonassociative rings are impor- 
tant; an example is tLie algebra. (An algebra is 
a ring having a tground ring.) 


B. Further Definitions 


An element a40 of a ring A is called a zero 
divisor if there exists an element b #0 such that 
ab —0 or ba=0. A commutative unitary ring 
having more than one element is called an 
integral domain if it has no zero divisors (— 67 
Commutative Rings). Elements a and b of a 
ring are said to be orthogonal if ab = ba =0. An 
element a satisfying a" —0 for some positive 
integer n is called a nilpotent element, and a 
nonzero element a satisfying a? — a is called an 
idempotent element. An idempotent element is 
said to be primitive if it cannot be represented 
as the sum of two orthogonal idempotent 
elements. For any subsets S and T of a ring A, 
let S+ T(ST) denote the set of elements s+ 
t(st) (seS, te T). In particular, SS is denoted 
by S? (similarly for S°, S^, etc.), and further- 
more, (aj + S((a) S) is denoted by a+ S(aS). If 
ST= TS — (0j, then subsets S and T are said 
to be orthogonal. A subset S of a ring is said to 
be nilpotent if S" 20 for some positive integer 
n, and idempotent if $? — S. 

For an element a of a unitary ring A, an 
element a’ such that a’a=1 (aa' = 1) is called a 
left (right) inverse element of a. There exists a 
left (right) inverse element of a if and only if A 
is generated by a as a left (right) A-module. If 
there exist both a right inverse and a left in- 
verse of a, then they coincide and are uniquely 
determined by a. This element is called the 
inverse element of a and is denoted by a !. An 
element that has an inverse element is called 
an invertible element (regular element or unit). 
The set of all invertible elements of a unitary 
ring forms a group under multiplication. A 
nonzero unitary ring is called a ‘skew field (or 
tdivision ring) if every nonzero element is inver- 
tible. Furthermore, a skew field that satisfies 
the commutative law is called a commutative 
field or simply a field (— 149 Fields). In a 
general ring A, if we define a new operation 
(a, b)»aob by setting aob=a+b—ab, then 
A is a tsemigroup with the identity element 
0 with respect to this operation. The inverse 
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element under this operation is called the 
quasi-inverse element, and an element that has 
a quasi-inverse element is called a quasi- 
invertible element (or quasiregular element). An 
element a of a unitary ring 1s quasi-invertible if 
and only if the element 1 — a is invertible. 


C. Examples 


(1) Rings of numbers. The ring Z of rational 
integers, the rational number field Q, the real 
number field R, and the complex number field 
C are familiar examples (— 14 Algebraic 
Number Fields, 257 Local Fields). 

(2) Rings of functions. The set K’ of func- 
tions defined on a set J and taking values in a 
ring K forms a commutative ring under point- 
wise addition and multiplication. In particular, 
let K =R, and let J be an interval of R. Then 
the set C?(I) of continuous functions, the set 
COU) of functions that are r-times continuously 
differentiable, and the set C°(J) of analytic 
functions on J are subrings (— Section E) of 
RI 

(3) Rings of expressions. The set K[X,,..., 
X,,] of polynomials and the set K[[X,, ..., 
X,] of tformal power series in indeterminates 
X;,, ..., X, with coefficients in a commutative 
ring K are commutative rings (— 369 Rings of 
Polynomials, 370 Rings of Power Series). 

(4) Endomorphism rings of modules. The set 
6&(M) of tendomorphisms of a *module M 
over a ring K is in general a noncommutative 
ring. In particular, if M is a finite-dimensional 
"linear space over a field K, then &,(M) can 
be identified with a tfull matrix ring (— 256 
Linear Spaces, 277 Modules). 

(5) For other examples — 29 Associative 
Algebras, 36 Banach Algebras, 67 Commuta- 
tive Rings, 284 Noetherian Rings, and 439 
Valuations. 


D. Homomorphisms 


A mapping f: A B of a ring A into a ring B 
satisfying conditions (i) f(a 4- b) ^ f(a) 4- f(b) 
and (ii) f(ab) = f(a)f(b) (a, be A) is called a 
homomorphism. If à homomorphism f is 
*bijective, then the inverse mapping f~': B.A 
is also a homomorphism, and in this case f 

is called an isomorphism. More precisely, a 
homomorphism (isomorphism) of rings is 
often called a ring homomorphism (ring isomor- 
phism). There exists only one homomorphism 
of any ring onto the zero ring. For unitary 
rings A and B, a homomorphism f: AB is 
said to be unitary if it maps the unity element 
of A to the unity element of B. By a homomor- 
phism, a unitary homomorphism is usually 
meant. In this sense there exists a unique hom- 
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omorphism of the ring Z of rational integers 
into an arbitrary unitary ring. A composite of 
homomorphisms is also a homomorphism. The 
identity mapping 1, of a ring A is an isomor- 
phism. A homomorphism of a ring A into itself 
is called an endomorphism, and an isomor- 
phism of A onto itself is called an automor- 
phism of A. If a is an invertible element of a 
unitary ring A, then the mapping xaxa ! 
(xe A) is an automorphism of A, called an 
inner automorphism. 

When condition (it) for a homomorphism is 
replaced by (ii) f(ab) = f(b) f(a) (a, be A), a 
mapping satisfying (i) and (ii) is called an 
antihomomorphism. In particular, if an anti- 
homomorphism f is bijective, then the inverse 
mapping f~t is also an antihomomorphism, 
and f is called an anti-isomorphism. Antiendo- 
morphisms and antiautomorphisms are defined 
similarly. 


E. Subrings, Factor Rings, and Direct Products 


A subset S of a ring A is called a subring of A if 
a ring structure is given on S and the canon- 
ical injection $ A is a ring homomorphism. 
Thus the ring operations of S are the restric- 
tions of those of A. If we deal only with uni- 
tary rings and unitary homomorphisms, then a 
subring S necessarily contains the unity ele- 
ment of A. The smallest subring containing a 
subset T of a ring A is called the subring gen- 
erated by T. The set of elements that commute 
with every element of T forms a subring and is 
called the commutor (or centralizer) of T. In 
particular, the commutor of A itself is called 
the center of A. 

A tquotient set A/R of a ring A by an equiv- 
alence relation R is called a factor ring (quo- 
tient ring) of A if a ring structure 1s given on 
A/R and the canonical surjection A— A/R 
is a ring homomorphism. This is the case if 
and only if the equivalence relation R is com- 
patible with the ring operations (i.e., aRa' and 
bRb' imply (a+ b) R(a' + b') and (ab) Rio bi 
Let « and f be elements of the factor ring A/R, 
represented by a and b, respectively. Then the 
definition of factor ring implies that «+ f(xf)) 
is the equivalence class represented by a+ 
b(ab). Every ring A has two trivial factor 
rings, namely, A itself and the zero ring O. If A 
has no nontrivial factor rings, then A is called 
a quasisimple ring (— Section G). If f: AB is 
a ring homomorphism, then the image f(A) is 
a subring of B, and the equivalence relation R 
in A defined by f(aRb« f(a) 2 f(b)) is com- 
patible with the ring operations of A. Thus f 
induces an isomorphism A/R f(A) (— Sec- 
tion F). 

If {A;},-, is a family of rings, the Cartesian 
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product A — IT;.,; A; forms a ring under the 
componentwise operations (a;)+ (b) 2 (a; + b;) 
and (a;) (bj) — (a;b;). This ring is called the direct 
product of the family of rings {4;} ;. The 
mapping p;: A A, that assigns to each (aj) its 
ith component a, is called a canonical homo- 
morphism. For any set of homomorphisms 

fi: B9 A, (ie I), there exists a unique homomor- 
phism f: B— A such that f, — p;o f for each i. 


F. Ideals 


A subset of a ring A is called a left (right) ideal 
of A if it is a submodule of the left (right) A- 
module of A (— 277 Modules). In other words, 
a left (right) ideal J of A is an additive sub- 
group of A such that AJ c J (JA c J). Under 
the operations induced from A, J is a ring 
(however, J is not necessarily unitary). A sub- 
set of A is called a two-sided ideal or simply 

an ideal of A if it is a left and right ideal. 

For an ideal J of a ring A, we define a rela- 
tion R in A by aRb«a—beJ. Then R is an 
equivalence relation that is compatible with 
the operations of A. Each equivalence class is 
called a residue class modulo J, and the quo- 
tient ring A/R is denoted by A/J and called the 
residue (class) ring (or factor ring) modulo J. If 
it is a field, it is called a residue (class) field. 
Conversely, given an equivalence relation R 
that is compatible with the operations of A, 
the equivalence class of 0 forms an ideal J of 
A, and the equivalence relation defined by J 
coincides with R. 

If f: A>B is a ring homomorphism, then 
the tkernel of f as a homomorphism of addi- 
tive groups forms an ideal J of A, and f in- 
duces an isomorphism A/J-> f(A). It Sis a 
subring and J is an ideal of a ring A, then S + 
J is a subring of A and SN J is an ideal of S. 
Furthermore, the natural homomorphism S^ 
(S -- J)/J induces an isomorphism S/SJ 
(S 4- J)/J (isomorphism theorem). 

A left (right) ideal of a ring A is said to be 
maximal if it is not equal to A and is properly 
contained in no left (right) ideal of A other 
than A. Similarly, a left (right) ideal of A is said 
to be minimal if it is nonzero and properly 
contains no nonzero left (right) ideal of A. 

If e is an idempotent element of a unitary 
ring A, then 1 — e and e are orthogonal idem- 
potent elements, and A= Ae + A(1 — e) is the 
direct sum of left ideals. This is called Peirce's 
left decomposition. Peirce's right decomposition 
is defined similarly. A left ideal J of A can be 
expressed as J — Ae for some idempotent 
element e if and only if there exists a left ideal 
J’ such that A —J 4- J' is a direct sum decom- 
position. More generally, if e,, ..., e, are ortho- 
gonal idempotent elements whose sum is equal 
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to 1, then A= Ae, + ... + Ae, is the direct sum 
of left ideals. Conversely, if A=J,+...+J, is 
the direct sum of left ideals and 1 =e; +... + 
e, (e;€ Jj) is the corresponding decomposition 
of the unity element, then e}, ...,e, are ortho- 
gonal idempotent elements. In particular, if 
J4, ..., J, are two-sided ideals, then each J; is a 
ring with unity element e;, and by a natural 
correspondence, the ring A is isomorphic to 
the direct product TI, J;. In this case, A is 
called the direct sum of ideals J,, ..., J, and is 
denoted by A= O}_, Jp or A= ZB, Jj. 

A ring A is called a left (right) Artinian ring 
if it is tArtinian as a left (right) A-module De 
if A satisfies the tminimal condition for left 
(right) ideals of A). A ring A is called a left 
(right) Noetherian ring if it is Noetherian as a 
left (right) A-module De. if A satisfies the 
*maximal condition for left (right) ideals of A). 
If A is commutative, left and right are omitted 
in these definitions. The property of being 
Artinian or Noetherian is inherited by quo- 
tient rings and the direct product of a finite 
family of rings, but not necessarily by subrings. 
For general rings, the maximal and minimal 
conditions for left (right) ideals are indepen- 
dent, but for unitary rings, left (right) Artinian 
rings are necessarily left (right) Noetherian (Y. 
Akizuki, C. Hopkins). 


G. Semisimple Rings 


The statement that a unitary ring A is tsemi- 
simple as a left A-module is equivalent to the 
statement that A is semisimple as a right A- 
module; in this case A is called a semisimple 
ring (— Section H). Every module over a semi- 
simple ring is also semisimple. A semisimple 
ring is left (right) Artinian and Noetherian. A 
semisimple ring is called a simple ring if it is 
nonzero and has no proper ideals except {0}, 
that is, if A is a quasisimple ring. Thus A is a 
simple ring if and only if A is a nonzero, uni- 
tary, quasisimple, left (right) Artinian ring. If A 
is a semisimple ring, then it has only a finite 
number of minimal ideals A,,...,A,, and A is 
expressible as the direct sum A= 4A, +... + Ap 
where each A; is a simple ring, called a simple 
component of A. Any ideal of A is the direct 
sum of a finite number of simple components 
of A. Quotient rings of a semisimple ring and 
the direct product of a finite number of semi- 
simple rings are also semisimple. 

Any left (right) ideal of a semisimple ring A 
is expressible as Ae (eA) for some idempotent 
element e, and Ae (eA) is minimal if and only if 
e is primitive. In particular, a minimal left 
(right) ideal is a simple left (right) A-module 
that is contained in a certain simple compo- 
nent. Two minimal left (right) ideals are iso- 
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morphic as A-modules if and only if they are 
contained in the same simple component. If A; 
I«ixn, are the simple components of A, then 
for each simple left A-module M there exists 
a unique i such that 4; M #10}, and M is 
isomorphic to a minimal left ideal contained 
in A,. 

If M is a finite-dimensional linear space over 
a (skew) field D, then the endomorphism ring 
A=6)(M) of M over D is a simple ring. Con- 
versely, for any ring A, the endomorphism ring 
D=6,(M) of a simple A-module M is a (skew) 
field (Schur’s lemma). If A is a simple ring, then 
any simple A-module M can be considered as 
a finite-dimensional linear space over D= 
&,(M), and A is isomorphic to &5(M) (Wed- 
derburn's theorem). Furthermore, if r is the 
dimension of M over D, then &5(M) is isomor- 
phic to the full matrix ring M,(D°) of degree 
r over the field D^, which is anti-isomorphic 
to D. The dimension r is also equal to the 
‘length of A as an A-module. The center of 
A — 65(M) is isomorphic to the center of D, 
which is a commutative field. Thus a simple 
ring is an associative algebra over a commuta- 
tive field (— 29 Associative Algebras). 


3 


H. Radicals 


Let A be a ring. Then among ideals consisting 
only of quasi-invertible elements of A, there 
exists a largest one, which is called the radical 
of A and denoted by R(A). The radical of the 
residue ring A/*R(A) is {0}. A ring A is called a 
semiprimitive ring if R(A) is {0}. On the other 
hand, A is called a left (right) primitive ring if it 
has a ‘faithful simple left (right) A-module. The 
radical 9R(A) is equal to the intersection of all 
ideals J such that A/J is a left (right) primitive 
ring. In a unitary ring A, (A) coincides with 
the intersection of all maximal left (right) 
ideals of A. Furthermore, in a left (right) Ar- 
tinian ring A, R(A) is the largest nilpotent 
ideal of A, and the condition R(A) = {0} is 
equivalent to the condition that A is a semi- 
simple ring. 

Among ideals consisting only of nilpotent 
elements of A, there exists a largest one, which 
is called the nilradical (or simply the radical) 
and denoted by 9(A4) (2 67 Commutative 
Rings). The nilradical of A/9t(A) is {0}. In 
general, 9t(A) is contained in *R(A), and if A is 
left (right) Artinian, we have 9t(A) 2 R(A). A 
ring A is called a semiprimary ring if A/9t(A) is 
left (right) Artinian and therefore semisimple. 
Furthermore, a ring A is called a primary ring 
(completely primary ring) if A/9(A) is a simple 
ring (skew field). A primary ring is isomorphic 
to a full matrix ring over a completely primary 
ring. 
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A. General Remarks 


In this article, we mean by a ring a tcommuta- 
tive ring with tunity element. Let R be a ring, 
and let X,,..., X, be variables (letters, indeter- 
minates, or symbols). Then the set of *poly- 
nomials in X,, ..., X, with coefficients in R is 
called the ring of polynomials (or polynomial 
ring) in n variables X,,..., X, over R and is 
denoted by R[X,,..., X,] (2 67 Commuta- 
tive Rings; 284 Noetherian Rings; 337 Poly- 
nomials; 368 Rings). On the other hand, when 
R and R' are rings with common unity element 
and R c R^, then for a subset S of R’ we denote 
the subring of R’, generated by S over R, by 
R[S]. When S (x,, ..., x,), then there is a 
homomorphism q of the polynomial ring 
R[X;,..., X,] onto R[S] defined by 


e(Xa;, La AE Xy) Za, aV XT a Eé 


(a ER). 


Bach 
If o is an isomorphism, then x,,..., x, are said 
to be algebraically independent over R; and 
otherwise, algebraically dependent over R. 
Thus the ring of polynomials in n variables 
over R may be regarded as a ring R[x,, ...,x,] 
generated by a finite system of algebraically 
independent elements x,, ..., x, over R. 
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B. Ideals, Homogeneous Rings, and Graded 
Rings 


Consider the polynomial ring R[X]=R[X,, 
..., X,] in n variables over a ring R. A poly- 
nomial f e R[ X] is a tzero divisor if and only if 
there is a nonzero member a of R for which af 
=Q. If a is an fideal of R, then RIX ]/aR[X]= 
(R/a) LX,, ..., X,]. Therefore, if p is a tprime 
ideal of R, then pR[ X] is a prime ideal of 
R[X ]. If R is a tunique factorization domain 
(u.f.d.), then R[ X] is also a u.f.d. If R is a nor- 
mal ring, then so is R[ X). By the tHilbert 
basis theorem, if R is tNoetherian, then R[ X] 
is also Noetherian. If m is the tKrull dimen- 
sion of R, then Krull dim R[X]2 n 4- m; the 
equality holds if R is Noetherian. If R is a 
field, then R[ X] is not only a u.f.d. but also a 
*Macaulay ring. 

A homogeneous ideal of R[ X] is an ideal 
generated by a set of *homogeneous poly- 
nomials f; (the degree of f, may depend on 4). 
When a is a homogeneous ideal, an element in 
RIX ]/a is defined to be a homogeneous ele- 
ment of degree d if it is the class of a homoge- 
neous polynomial of degree d modulo a, and 
the quotient ring R[X ]/a is called a homoge- 
neous ring. More generally, assume that a ring 
R is, as a tmodule, the direct sum Ło R; 
of its submodules R; (i — 0, 1,2, ...) and that 
R;Rjc Rj,; for every pair (i,j). Then we call R 
a graded ring, and an element in R, a homo- 
geneous element of degree d. (In some literature 
the term graded ring is used in a wider sense; 
see below) In a graded ring R, if an ideal is 
generated by homogeneous elements, then the 
ideal is called a homogeneous ideal (or graded 
ideal). In a graded ring R= D7, R, if the ideal 
È; R; has a finite basis, then R is generated 
(as a ring) by a finite number of elements over 
its subring Ry. Therefore, the graded ring 
R=>R; is Noetherian if and only if Ro is 
Noetherian and R is generated by a finite 
number of elements over Rọ. In this case, every 
homogeneous ideal is the intersection of a 
finite number of homogeneous tprimary ideals, 
and every prime divisor of a homogeneous 
ideal is a homogeneous prime ideal. 

The notion of a graded ring is generalized 
further as follows: A ring R is graded by an 
additive semigroup / (containing 0) if R is 
Lier R; (direct sum) and if R;R; c Riq; 


C. Zero Points 


(1) Zero Points in an Affine Space. We consider 
the polynomial ring K[X]— K[X., ..., X,] 

in n variables over a field K and a field O 
containing K. A point (a,, ...,a,) of an n- 
dimensional taffine space Q” = f (a,, ...,a,)la;€ 
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QJ is called a zero point of a subset S of K[X] 
if f(a,,...,a,)=0 for every f(X,,..., X,)eS. 

A point (a,, ...,a,) is called algebraic over K 
(K-rational) if every a; is algebraic over K (is 
an element of K). In this way we define alge- 
braic zero points and rational zero points. 
Zero points of S are zero points of the tideals 
generated by S. Therefore, in order to investi- 
gate the set of zero points of S, we may restrict 
ourselves to the case where S is an ideal. De- 
note by V(S) the set of zero points of S. If a,, 
a, are ideals of K[X], then (i) V(a, Na,)= 
V(a,a5)— V(a,)U V(a;); (ii) V(a; +a2)=V(a,)N 
V(a5); and (iii) if a, and a, have a common 
tradical, then V(a,) — V(a,). 


(2) Zero Points in a Projective Space. A point 
(åa, ..., 4a,) of an (n — 1)-dimensional tprojec- 
tive space over Q (with a;EQ, some a; #0, Je, 
A4 #0) is called a zero point of a polynomial 
f(X,, ..., Xn) If, f being expressed as È f; with 
homogeneous polynomials f; of degree i, 
Silai, ...,a,) 20 for every i (this condition 
holds if and only if f(Aa,,...,4a,)=0 for any 
element A in Q, provided that Q contains in- 
finitely many elements). Therefore, zero points 
of a subset S of K[X] are zero points of the 
smallest homogeneous ideal containing S. 
Thus, in order to study the sets of zero points, 
it 1s sufficient to consider sets of zero points of 
homogeneous ideals, and propositions similar 
to (i), (ii), and (iii) of part 1 of this section hold 
for homogeneous ideals a,, az. 


D. The Normalization Theorem 


Let a be an ideal of theight h in the poly- 
nomial ring K[X]=K[X,,...,X,,] in n vari- 
ables over a field K. Then there exist elements 
Yis- Y, of K[X] such that (i) K[X] is tin- 
tegral over K[Y]2 KL[Y,, ..., Y,] and (ii) 

Y,, ..., Y, generate aN K [Y] (normalization 
theorem for polynomial rings). 

Using this theorem, we obtain the following 
important theorems on finitely generated 
rings. 

(1) Normalization theorem for finitely gen- 
erated rings. If a ring R is finitely generated 
over an integral domain J, then there exist an 
element a (40) of J and algebraically indepen- 
dent elements z,, ...,z, of R over J such that 
the tring of quotients Rs (where S={a"|n= 
1,2,...}) is integral over I[a !,z,, ...,z,]. 

(2) If p is a prime ideal of an integral domain 
R that is finitely generated over a field K, then 
(height of p)+ (*depth of p)=(ftranscendence 
degree of R over K), and the depth of p coin- 
cides with the transcendence degree of R/p 
over K. In particular, if m is a maximal ideal 
of R, then R/m is algebraic over K. 
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(3) Hilbert's zero point theorem (Hilbert 
Nullstellensatz). Let a be an ideal of the poly- 
nomial ring K[X]=K[X,,..., X,] over the 
field K, and assume that the field O containing 
K is *algebraically closed. If fe K[X] satisfies 
the condition that every algebraic zero point 
(— Section C) of a is a zero point of f, then 
some power of f is contained in a. 


E. Elimination Theory 


Let fi,..., fy be elements of the polynomial 
ring R- I[X,,..., Xm Yj,..., Y, ] in m+n 
variables over an integral domain I. For each 
maximal ideal m of I, let o, be the canonical 
homomorphism with modulus m, and let Q,, 
be an algebraically closed field containing Tim. 
Let W, be the set of points (a,, ..., a,) of the n- 


dimensional affine space € over Q, such that 


m 


the system of equations o, ( f) (X,, .... Xm 
ai, ...,0,) 20 (i= 1,2, ..., N) has a solution in 
Q”, To eliminate X,,..., X,, from fi,..., fy is 


to obtain g(Y;, ..., Y)e I[ Y, ..., Kl such that 
every point of W, is a zero point of ¢,,(g) for 
every m; such a g (or an equation g — 0) is 
called a resultant of f,, ..., fy. The set a of 
resultants forms an ideal of I[ Y, ..., Y,], and 
Io, gui] is called a system of resultants if 
the radical of the ideal generated by it coin- 
cides with a. If J is finitely generated over a 
field, then, denoting by b the radical of the 
ideal generated by f,,..., fy, we have a=bN 
I[Y,,..., Y,]. In particular, let I be a field. It is 
obvious that Wa is contained in the set V of 
zero points of a. However, it is not necessarily 
true that V= Wo. If every f; is homogeneous 
in X,,..., X, and also in Y,,..., Yp, then we 
have V= Wo). 

If we wish to write a system of resultants 
explicitly, we can proceed as follows: Regard 
the f; as polynomials in X, with coefficients in 
I[X5,..., X,, Y, ..., Y,], and obtain resultants 
R(f;, f) by eliminating X, from the pairs f;, f;. 
Then eliminate X, from these resultants, 
and so forth. To obtain R(j;, fj, we may use 
Sylvester's elimination method. Namely, let f 
and g be polynomials in x with coefficients 
in I:f2agx" +a x"! +... + am g—bgx" + 
b,x" +... +b. Let D(f, g) be the following 
determinant of degree m + n: 


Ao d, °° An 0 E 0 

D Ag c 044 Om Q -.-. 0 
n eae 

0 -- O do d Am-1 lm 

Bs by se bib owes. 0 
" d 

Q nh b CES 
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Then D( f, g)=0 if and only if either f and g 
have a common root or a, = by = 0. Therefore, 
if I is a u.£.d. and ay and by have no common 
factor, then D( f, g) is the required resultant 
R(f,g). (For other methods of elimination see 
B. L. van der Waerden, Algebra, vol. II. For 
criteria on whether a finitely generated ring 
over a field is tregular — 370 Rings of Power 
Series B.) 


F. Syzygy Theory 


(1) Classical Case. The notion of syzygy was 
introduced by Sylvester (Phil. Trans., 143 
(1853)), then generalized and clarified by Hil- 
bert [3], whose definition can be formulated 
as follows: Let R=k[X,,...,X,] be a poly- 
nomial ring of n variables over a field k. R 
has the natural gradation (i.e., R is a graded 
ring in which each X; (1 <i<n) is of degree 1 
and elements of k are of degree 0). Let M 
be a finitely generated graded R-module. If 
fi s fm form a minimal basis of M over 
R consisting of homogeneous elements, we 
introduce m indeterminates u,,...,u,, and 
put F= tem Ru; the free R-module gen- 
erated by u,, ...,u,. Set deg(u) = deg( f) (1 < 
j&m) and supply F with the structure of a 
graded R-module. Let q be the graded R-. 
homomorphism of F onto M defined by o(uj) 
= f; Then N 2 Ker(o) is a graded R-module 
uniquely determined by M up to isomorphism 
(of graded R-modules); N is called the first 
syzygy of M. For a positive integer r, the rth 
syzygy of M is inductively defined as the first 
syzygy of the (r — 1)st syzygy of M. The Hilbert 
syzygy theorem states that for any finitely 
generated graded R-module M, the nth syzygy 
of M is free. In other words, M admits a free 
resolution of length <n, i.e., an exact sequence 
of the form 


02 F>... S FI S FO. M0, 


where v <n and each F (0x i « v) is a finitely 
generated free graded R-module. It follows 
that if M, denotes the homogeneous part of 
degree d in M, there exists a polynomial P(X) 
of degree <n—1 such that dim,(M,) — P(d) for 
sufficiently large d; P(X) is called the Hilbert 
polynomial (or characteristic function) of the 
graded R-module M. 


(2) Generalization by Serre. The syzygy theory 
was generalized by J.-P. Serre [2] as follows: 
Let R be a Noetherian ring and M a finitely 
generated R-module. Then we can find a fi- 
nitely generated free R-module F and an R- 
homomorphism q of F onto M. The kernel of 
om. called a first syzygy of M, is not uniquely 
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determined by M. However, if N, and N, are 
first syzygies of M, then there exist finitely 
generated tprojective R-modules P, and P, 
such that N, ® P, = N, @ P, (— 277 Modules 
K). For a positive integer r, an rth syzygy is 
defined inductively as in (1) of this section. An 
important result of Serre is that R is a fregular 
ring of tKrull dimension at most n if and only 
if an nth syzygy of every finitely generated R- 
module is *projective. 


(3) Serre Conjecture. D. Quillen (Invertiones 
Math., 36 (1976)) and A. Suslin (Dokl. Akad. 
Nauk SSSR (26 Feb. 1976)) solved the Serre 
conjecture by proving that every projective 
module over a ring of polynomials over a field 
is free. 


(4) Special Cases. In the following special 
cases, we can define the first syzygy of M 
uniquely up to isomorphism: (1) R is a Noe- 
therian tlocal ring and M is a finitely gen- 
erated R-module; and (ii) R is a graded Noe- 
therian ring $4,0 Ra, where Ro is a field and 
M isa finitely generated graded R-module [4]. 
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A. Rings of Formal Power Series (— 67 Com- 
mutative Rings; 284 Noetherian Rings; 368 
Rings) 


Let R be a commutative ring with unity ele- 
ment 1. Let E, be the module of thomogeneous 
polynomials of degree d in X,,..., X, with 
coefficients in R. A formal infinite sum ZZ at 
zo +a, +... a, ...-- of elements a,€ F; is 
called a formal power series or simply power 
series in n variables X,,..., X, with coefficients 
in R, and a, is called the homogeneous part of 
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degree d of the power series. The homoge- 
neous part de of degree zero is called the 
constant term. Addition and multiplication 
are defined by (£ a4) + (EZ bj) = X(azt by), 

(È aa) (X by) = La(Li+j;-44;5)). By these oper- 
ations, the set of power series forms a com- 
mutative ring, which is called the ring of 
(formal) power series (or (formal) power series 
ring) in X,,..., X, over R and is denoted by 
R[[X;. .... X,]] or R(X,, ..., Xn}. If there is a 
natural number N such that a,=0 for every d 
>N, then the power series È a, is identified 
with the polynomial ag +4, +... 4- ay. Thus 
R[X;, ..., X,] c R[[X,, ..., X,]]. Set X- 

X X;R[EX;,.... X,]). Then R[[X;..... X,]] 
is tcomplete under the X-adic topology 

(— 284 Noetherian Rings B). 

Assume that R' is a commutative ring con- 
taining R and having a unity element in com- 
mon with R, a' is an ideal of R' such that R' is 
complete under the a'-adic topology, and 
X4, ...,X, are elements of a’. Then an infinite 
sum Xc;, aU XI! ... Xin (each i; ranges over 
nonnegative rational integers and oC R) 
has a well-defined meaning in R' (namely, if S, 
is a finite sum of these terms such that 2i; « d, 
then the infinite sum is defined to be lim, ,,, S,). 
This element Ze xy ... A is also called a 
power series in x,, ..., x, with coefficients in 
R. The set of such power series in x,, ..., x, 
is a subring of R', called the power series 
ring in x,, ..., x, over R and denoted by 
R[[x;, ..., x,]] or R{x,,...,x,}. Defining o by 
EC, NN. XI = Eie a xb oxi we 
obtain a ring homomorphism ø: R[[X;, ..., 
X,]] R[[x,, ..., x,]]. If ọ is an isomorphism, 
then we say that x,, ..., x, are analytically 
independent over R. 

If m is a *maximal ideal of the formal power 
series ring R[[X;, ..., X,] ], then m=MNR is 
a maximal ideal of R and m is generated by m 
and X,,...,X,. An element f of the power 
series ring is tinvertible if and only if its con- 
stant term f, is an invertible element of R, and 
in this case f !=EZo fo  -(fo—fY. HR 
is one of the following, then R[[X;,..., X,]] is 
also of the same kind: (i) Noetherian ring, (ii) 
tlocal ring, (iti) tsemilocal ring, (iv) tintegral 
domain, (v) tregular local ring, (vi) Noetherian 
tnormal ring. But even if R is a tunique fac- 
torization domain (u.f.d.), RE[X,,...,X,J] 
need not be a u.f.d. (If R is a field, or more 
generally, if R is a regular semilocal ring, then 
R[[X;, ..., X,]] is a u.£.d.). In particular, a 


formal power series ring k[[ X]] in one vari- 


able X over a field k is an integral domain 
whose field of quotients is called the field of 
(formal) power series (or (formal) power series 
field) in one variable X over k and is denoted 
by k((X)); an element of k((X)) is expressed 
uniquely in the form XZ, a, X" (a,ek,reZ). 
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B. Rings of Convergent Power Series 


Let K be a field with multiplicative *valuation 
v (for instance, K = C, the complex number 
field, and v(x) 2 |a] for xe C). A formal power 
series f(X,,..., X,) X c.i X... Xin is 

said to be a convergent power series if there 

are positive numbers r;, ...,r,, M such that 
rien, ii np <M for every (i, -s in). 

In this case, if a;e K and v(aj) « r,, then 

Ze, aj... ap has its sum in the *completion 
of K. The set of convergént power series is a 
subring of K[[X,, ..., X,]]. It is called the ring 
of convergent power series (or convergent power 
series ring) in n variables over K and is de- 
noted by K(X,,..., X,»» or K{X,,..., Xn} 

It is a regular local ring of fKrull dimension 

n. Hence it is a u.f.d. and its completion is 
K[LX,,..., X,]]. F v is a *trivial valuation, 
then K(X,,..., X,» = K[[X;, ..., X,]]. 


Weierstrass's Preparation Theorem. For an 
element f - Ze, , Xp. Xipe KXX,,..., 
X,,>>, assume that co. 9;—O for i=0, 1, ..., 
r — | and ce 9, 0. Then for an arbitrary 
element g of K ((X,,..., Xa, there exists a 
unique qe K «Xi, ..., X,5» such that g — 
qf ce Xl XiKQX,,..., X, 4». In partic- 
ular (considering the case where g — X;), there 
is an invertible element u of K ((X,,..., X,5» 
such that fu -fo- fy X,4- ... f XL 
X; (ie K&X,,..., an Ai l 

By this theorem, we see easily that if a is an 
ideal of theight h of K ((X,,..., X,55, then 
K«X,,...,X,>>/a ts isomorphic to a ring that 
is a finite module over K ((X,, ..., X, 4,5». 

If p is a tprime ideal of K ((X,, ..., X,55, 
then pK[[X,,..., X,]] is a prime ideal. 


The Jacobian Criterion. Let K be a field, and 
let R be the ring of polynomials K ( X,, ..., X,], 
the ring of formal power series K[[ X,, ..., 
X,]], or the ring of convergent power series 
KX,,..., X,»» in n variables X,,..., X, 

over K. tPartial derivatives 0/0 X; are well 
defined in R. For f,,..., f€ R, a Jacobian 
matrix J( f,,..., f) is defined to be the t xn 
matrix whose (i, j)-entry is €f;/0X;. Let p bea 
tprime divisor of the ideal X; f, R, and let q be 
a prime ideal containing p. If the *rank of 
(J(fi. ..., f) modulo q) is equal to the height 
of p, then the ring R,/X f; R, is a regular local 
ring. The converse is also true if K is a perfect 
field (if K is not a perfect field, then, modifying 
J(fi, ..., f), we can have a similar criterion 


[1]). 


C. Hensel Rings 


A Hensel ring (or Henselian ring) is a com- 
mutative ring with unity element satisfying the 
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following two conditions: (i) R has only one 
maximal ideal m (Le., R is a *quasilocal ring); 
and (ii) if f, go, hy are monic polynomials in 
one variable x (here, a polynomial in x is 
called monic if the coefficient of the term of the 
highest degree is 1) such that f — goh; € mR[x], 
go R[x] - ho R[x] + mR[x] 2 R[x], then there 
are monic polynomials g, he R[x] such that 
f=ghand ggg, hz hy modulo m. 

Important examples of Hensel rings are 
complete local rings, rings of convergent 
power series, and tcomplete valuation rings. 
When R is a Hensel ring, a commutative ring 
R' with unity element such that R' is a finite R- 
module is the direct sum of a finite number of 
Hensel rings. For any quasilocal ring Q, there 
exists a Hensel ring OQ, called the Henseliza- 
tion of Q (for details — [1]), for which the 
following statements hold: (i) Ó is a *faithfully 
flat Q-module; (ii) if m is the maximal ideal of 
Q, then the maximal ideal of 0 is m0, and 
Õ/mÕ = Q/mQ; (iii) if R is a Hensel ring that 
contains Q and has a maximal ideal n, and 
nimm. then there ts one and only one Q- 
homomorphism q of Ó into R; (iv) if Q isa 
*normal ring, then the Q-homomorphism ø is 
an injection; (v) if Q is a local ring, then Ó is 
also a local ring, and Q is dense in Q. 
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371 (XVIII.10) 
Robust and Nonparametric 
Methods 


A. General Remarks 


Robust and nonparametric or distribution-free 
methods are statistical procedures specifically 
devised to deal with broad families of proba- 
bility distributions. 

In the theory of statistical inference it is 
usual to assume that the probability distri- 
bution of the population from which the ob- 
served values are chosen at random is specified 
exactly except for a small number of unknown 
parameters (— 401 Statistical Inference). In 
practical applications, however, it often hap- 
pens that the assumptions made for the model, 
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especially those about the shape of the distri- 
bution, may not hold for the actual data. In 
such cases robust and/or nonparametric pro- 
cedures that do not require exact knowledge of 
the shape of the distribution and yet prove to 
be relatively efficient or valid are required. The 
term nonparametric or distribution-free is used 
for problems of testing hypotheses, and the 
term robust is mainly used for problems of 
point estimation (— 396 Statistic; 399 Statis- 
tical Estimation; 400 Statistical Hypothesis 
Testing). 

Although the idea of the sign test appears in 
the work of J. Arbuthnot (1710), the theoret- 
ical foundation for nonparametric tests was 
first given in the proposals for the permutation 
test by R. A. Fisher (1935), the rank test by F. 
Wilcoxon (1945), and the test based on U- 
statistics by H. B. Mann and D. R. Whitney 
(1947). In the years that followed two impor- 
tant ideas appeared: the concept of asymptotic 
relative efficiency by E. J. G. Pitman (1948) 
and the development of the theory of U- 
statistics by W. Hoeffding (1948). H. Chernoff 
and I. R. Savage (1958) showed, in studying 
the asymptotic distribution of a class of rank 
statistics, that the asymptotic efficiencies of 
nonparametric tests are incredibly high. These 
findings accelerated the studies of nonpara- 
metric tests; recent progress is summarized in 
the books by J. Hájek and Z. Sidák [1], M. L. 
Puri and P. K. Sen [2], R. H. Randles and 
D. A. Wolfe [3], and P. J. Huber [6]. 

On the other hand, G. E. P. Box (1953) first 
coined the term robustness in his sensitivity 
studies, in which he investigated how the 
standard statistical procedures obtained under 
certain assumptions are influenced when such 
assumptions are violated. Two papers by J. W. 
Tukey (1960, 1962) provided the initial founda- 
tion for robust estimation. J. L. Hodges and 
E. L. Lehmann (1963) noticed that estimators 
of location could be derived from nonpara- 
metric tests and that these estimators have 
sometimes much higher efficiency than the 
sample mean. À similar study for scale was 
made by S. Kakeshita and T. Yanagawa 
(1967). Huber (1964) proposed an estimator of 
location by generalizing the method of least 
squares. Along with the idea of the influence 
curve introduced by F. R. Hampel (1974) the 
estimator proposed by Huber has become a 
core of subsequent studies of robust estimation. 
K. Takeuchi (1971) proposed an adaptive 
estimate that is asymptotically fully efficient 
for a wide class of underlying distribution 
functions. The developments of the theory of 
robust estimation are reviewed by Huber 
[4-6] and R. V. Hogg [7]. Various proposed 
estimators are compared in the book by D. F. 
Andrews et al. [8]. 
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B. The One-Sample Problem 


Let F(x) be a tdistribution function of a tran- 
dom variable X, (X,, ..., X,) be an indepen- 
dent trandom sample of size n from F(x), and 
(x4, ..., Xa) be an observed sample value. The 
100p percentile of F is denoted by £,, i.e., 
F(£,) — p. For testing the thypothesis H: č, < 
&? against the falternative hypothesis A: 

ip £9, the following procedure is proposed. 
Let i(x,, ..., x,) be the number of x, that are 
greater than ET. A test procedure by the fol- 
lowing Ttest function ¢ is tuniformly most 
powerful in some neighborhood of €,=€° for 
the double exponential distribution, where 


Q(x,, ..., x,) is defined by the equations 
1 when i(x,,...,X,)>¢, 
Q(x,,...,x,) 2 4a when i(x,,...,x,)-c, 
O when i(x,,...,x,) «c 


(0<a<1t,0<c<n). This procedure is called 
the sign test. 

Suppose that F(x) is symmetric about x — 
Zus, Let Rj’ be the rank of |X;— ¿°| among 
1X, — £9, ...,|X, — EL and let V (t) — 1, 0 ac- 
cording as t>0, <0. Set 


$05, ..., Ken Y a (Rz P(X; — £9) 
ici 


for some weights a,(1), ..., a, (n). The following 
procedure o, called the signed rank test, is 
also used for testing the hypothesis H :£,,; < &? 
against the alternative hypothesis A: č;  £?: 


1 when S,(x,,...,x,)» 6, 


a when $,(x,,...,x,)—-c, 


O when S,(x,,...,x,) «c. 


The procedure with a,(i)=i is frequently used 
and is called the Wilcoxon signed rank test, 
which is the uniformly most powerful rank 
test in a neighborhood of £j, — Z? for F(x) - 
1/(1+e7*), the logistic distribution. 


C. The Two-Sample Problem 


Let F and G be continuous distribution func- 
tions of random variables X and Y, respec- 
tively, (X,, ..., Xm) and (Y,, ..., ¥,) be the corre- 
sponding random samples, and (x,, ..., Xm) and 
(D, Yn) be the respective sample values. 
Consider the problem of testing the hypothesis 
H:F(x)z G(x) against the alternative hypoth- 
esis A, F(x) E G(x) or A;:F(x) 2 G(x) for all x 
and F(x)# G(x). When the alternative hypoth- 
esis A, is true, we say that the random vari- 
able Y is stochastically larger than X and write 
F >G. A frequently used example of such an 
alternative hypothesis A, is G(x) 2 F(x — 0), 0 
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>0. Let # be the family of all strictly increas- 
ing continuous functions. Then the hypothesis 
H and the alternative hypothesis A, are in- 
variant under the group of transformations 

of the form x;=h(x,), y; h(y) (i 1, ....m; j 
=1,...,n,he ). The tmaximal invariant 
statistic in this case is the rank (R,, ..., Rm) 

of (X,,..., Xm) or the rank (S,, ..., $) of (Y;, 
..., Y) when the combined sample (Xj, ..., 
Xm; Yi, s.s, Y,) is ordered in an ascending order. 
If a test function q(x,, ..., X4; Y1 ---, Yn) satis- 
fies P,(F, F) &x and P,(F, G) za for any F>G, 
then q is considered a desirable test, where 


PAF, G)= |... [ot nis 


x [] dF (x) T] 4G). 


Lehmann’s Theorem. If ¢ satisfies the con- 
ditions stating that yf > y; (j= L, ..., n) yield 
gx, Xu Yt S VE) Z P(X, Kai Yrs Yn) 
then P,(F, G) 2 P,(F, F). If in addition o is a 
tsimilar test, then o is unbiased (— 400 Statis- 
tical Hypothesis Testing C). 

The Wilcoxon test (or the Mann-Whitney U- 
test) is described by a test function o — 1 when 
U zc and 9 =0 when U «c, where U is a tU- 
statistic defined by 


q m n 


x , Vero y) 


mn j=] j= 


U= 


with w(x, y) 2 1 when x € y and w(x, y)=0 

when x> y. This test is similar and unbiased. 
Testing the hypothesis H: F =G against 

the alternative hypothesis A: F #G = F(x/o), 

o> 1, is another two-sample problem, for 

which the following test was proposed by T. 

Tamura. The test function is given by ọ = 

1 for U >c and g=0 for U «c, where U = 

( "Gr"? Ze Mey V(x;, Xy; yj Vj) with y(u, u'; 

v,U)-1 whenv«ucv,v«u <v orv <u< 

v, v <u «v and y(u, u’; v, v) =0 otherwise. 
The following statistic Ty is used frequently 

in nonparametric problems. Let x,, ..., X, 

Yis ---, Ya be arranged in order of magni- 

tude. Set z, — +1 or 0 when the kth value (k= 

1,2, ...,n-- m) in the arrangement is an x; or 

y; respectively. For a given set of N —n m 

reals {e,}, Ty is defined by Ty 2m ! Dp e,2,. 

Set Hy(x)=Ay FA (x) + (1 — Ay) G,(x), where 

F, (x) and G,(x) are the tempirical distribution 

functions based on (x,, ..., Xm} and (y,, ..., Va) 

respectively, and 0< 4; € Ay 2 m/N €1—4, «1. 

Then Ty is represented by the integral 


Luna 


with e, =Jy(k/N). Chernoff and Savage [9] 
proved that under some regularity conditions 
the asymptotic distribution of Ty is normal 
and that the asymptotic mean u and the vari- 
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ance o? of Ty are given by 


u= IL (H(x)) dF (x), 


No? =2(1—A) dE —G(y)) 


x<y 


x J'(Ho9)J'(GI(y) dF (x) dF (y) 


= 
FELT | IER — F(y)) 


x<y 


x (Gor onde). 


where J(H) — Im, Jy(H), H(x)  AF(x) - 

(1— 4)G(x), and 4— lim åy. When e, — k/N, the 
statistic Ty is equivalent to the *U-statistic in 
the Wilcoxon test. When e, is the mean E(Z,) 
of the kth order statistic Z, in an independent 
sample of size N from N(0, 1), then the test by 
Ty 1s called the Fisher-Yates-Terry normal 
score test. When J is the inverse function of the 
distribution function of N(O, 1), then the test is 
called the van der Waerden test. 


D. The k-Sample Problem 


Let (Xj, J= 1, ..., n) bea random sample of 
size n; from the population with a distribu- 
tion function F;(x) for each i— 1, ..., k. The k- 
sample problem is concerned with testing the 
hypothesis H: F,(x)— ...— F,(x) against an 
alternative hypothesis A,: not all the F;(x) 

are equal, A,: F,(x)= F(x — 0) with 0,20, or 
A3: F(x) = F(x/oj) with o; o. Several tests have 
been proposed for this problem, using quadra- 
tic forms of the vector-valued U-statistic U — 
(Ul, ..., U*) whose coordinates U! are defined 
by means of a function 


Voies X pio epe ets Mage do deese 


When N =È n; œ with n; — p; N, 0 « p; « 1, and 
> p;= 1, then 


V -(/ N(U! — E(U!), ...,./N(U*— E(U*))) 


has asymptotically a *multivariate normal 
distribution N(0, 2). Let B be the projection 
matrix corresponding to the eigenspace for the 
zero eigenvalues of the matrix 2, and let A be 
a matrix such that AB=0, XA =I — B. The 
statistic VA'V has asymptotically a tnoncen- 
tral chi-square distribution with degrees of 
freedom — rank 2. Several kinds of test repre- 
sented by a critical region of the form VAV’ > 
c are proposed, among which the Kruskal- 
Wallis test is a particular one having 


: 1 
W(x 5.5%) =) ——óo(xu X) t=1,...,k, 
a Mina 
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as basic functions, where ó(x, y) - 1 when x « y 
and ó(x, y) - 0 otherwise. 


E. Asymptotic Relative Efficiency of Tests 


If there is more than one test procedure for a 
given testing problem, then one may wish to 
compare these procedures. Let (9,) and {Yn} 
be two sequences of level a tests, where o, and 
y, are test functions based on a sample of 

size n. The tpower functions of o, and y, are 
denoted by £(@|¢@,) and f(0|w,), respectively. 
Let 0 be a real parameter and {6,} be such that 
0, 0, as i oo. Consider a hypothesis 0 = 6; 
and a sequence (0, of alternative hypotheses. 
If, for any increasing sequences (n;) and {n*} 
of positive integers satisfying o «lim f(6,| Pn) = 
_ lim (8;| Wr) < 1, limn?/n; (—e((6,], {Wn}, say) 
exists and is independent of « and lim Ge, ), 
then e is called Pitman's asymptotic relative 
efficiency of {¢,} against (y,). Suppose further 
that the tests To, and (V,) are based on sta- 
tistics T, 2 t,(X) and T;* — t*(X), respectively, 

in the following manner: 


1 when t,(x)^ c, 
Q,(X)—4a when t,(x)=c, 
O when t,(x) « c, 


1 when tz(x)»c*, 


W,(x)= 4b when t*(x)=c*, 
0 when tf(x)«c*, 
where X=(X,,..., X,) and x 2(x,, ..., x,). Put 


0, =0 and H = k/ /n (k 2 constant) for simplic- 
ity. If T, and T,* are asymptotically normal, 
then under some conditions e is given by the 
formula 


(dE, (T,)/40 al o6( Tr") 


e=lim : 
(dE,(T,*)/d0 | s.) 95(T,) 


As an example, consider a two-sample prob- 
lem on a flocation parameter. If the popula- 
tion distribution is normal and the Wilcoxon 
test is used to test the hypothesis of equality of 
means, then its asymptotic relative efficiency 
against Student's test is 3/z. For the same 
problem, the asymptotic relative efficiencies of 
the Fisher-Yates-Terry normal score test and 
the van der Waerden test against Student's test 
are both unity. For the hypothesis of equality 
of means in the k-sample problem, the asymp- 
totic relative efficiency of the Kruskal-Wallis 
test against the F-test is 3/z, provided that 

the sample is distributed normally. 


F. Kolmogorov-Smirnov Tests 


Let F,(x) be the empirical distribution function 
based on a random sample of size n from 
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Fo(x). Set 

d, = sup|F,(x)— Fo(x)|, 
D, —sup(F,(x) — Fo(x)), 
F(X) — Fo(x) 








5,= Sup , 
a<Fo(x)|  Fo(x) 

s- sup BOPO 
ax Fo(x) Fatz) 

Then 


lim Pd, <z/x/n) - LG) 


= Y (—1)fe Hei, 


keen 


lim P(D, <z/,/n)=K(z)=1—e72”, 


J 


KI ke) ;Nj-71 
UE (o+4) À 
n n 


lim B(s alt È (17 


[n(1-DY / 
B(D«D)-1-D Y |. 
J=0 


x (2k + Her (2E eae” 


: 3 z./a/(1—a) ` 
lim P(S, « z/ /n)— f | e"? dt. 
no T Jo 


The statistics d,, D,, s,, S, are frequently used 
to test the hypothesis F(x)= F(x). (This prob- 
lem is called testing goodness of fit.) 

In a two-sample problem, let F,,(x) and 
G,(x) be two empirical distribution functions 
based on samples of sizes m and n from F(x) 
and G(x), respectively. Set 


dis =sup | F,(x) = G,(x)|, 
Din = sup(F,(x) GR G,(x)). 


If the hypothesis F = G is true, we have 
lim P,(dm n « z/ / N) - L(z), 


lim POD, , « z// N) - K(z), 


provided that m and n tend to oo so that N — 
mn/(m 4- n) — oo and m/n is constant. Taking 
account of these facts, d,, , and D,, , are used 
to test the hypothesis F =G. The tests using 
the statistics dp, Dy, das D,, , are called 
Kolmogorov-Smirnov tests. 


G. Interval Estimation 


Let (X1, ..., X,) be an independent random 
sample from the population with a distri- 
bution function F(x — 0), where 0 is an un- 
known location parameter, and (x,, ..., x,) 
be its observed value. Suppose that F(x) is 
continuous and symmetric about the origin. 
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Using the statistic $, = S(x,, ..., x,) for the 
one-sample nonparametric test for testing the 
hypothesis H :0 —0, a tconfidence interval of 0 
is constructed as follows. For an appropriately 
given y (0 « y « 1), select constants d, and d, in 
the range of S(x,,..., x,) that satisfy 


P (d, «S(X,, ..., X,) «d,] 21—y, 


where P, means the probability under the 
hypothesis H : 0 — 0. If there exist statistics 
L.(x,...,x,) and U,(x,, ..., x,) such that 
L(x,, ..., x,) &0 « U,(x,, ..., Xp) if and only if 
d, «S(x, —0,..., x,—0) « d, for all 0, then the 
confidence interval of 0 with 100(1 — 7)? con- 
fidence coefficient is given by (L,(x,,...,X,), 


U.(x,, ..., x,)). This interval is distribution- 

free, i.e., it holds that 

PÍL,(X,, ..., X,) «0«U,X,,..., X) 21—y 
for all F. 


When $, is the statistic for the Wilcoxon 
signed rank test, L, and U, are given by L,— 
Wi -4,, and Us = Wa -4,, where M=n(n+ 
1)/2 and Wues... < Way are ordered values 
for M averages (x; + xj/2 (i € j — 1,2, ...,n). 


H. Point Estimation 


Let (X,, ..., X,) be an independent random 
sample from the population with a distri- 
bution function F(x —0), where 0 is an un- 
known location parameter, and let (x,,..., Xp) 
be its observed value. 

There are four methods of constructing 
robust estimators of 0. Let X, < ...« Xim be 
ordered values of X,,..., X,. The first method is 
to use T; —a, Xu - ...- a, X, for some given 
constants d,, ...,a, such that 37.,a;— 1l. T, is 
called the L-estimator. Àn example is 


T, (a) =(PX qan+1) t X ((an}+2) T 
+ DÄ teen EX 2a), 


where p= 1 4- [xn] — xn. This estimator is called 
the «-trimmed mean. Let J be a real-valued 
function such that f% J(t)dt=1, and set a, — 
(i is J(t) dt; then as n oo, T, converges to 
T(F)— f4 J(t)F ~ (t)dt in probability. Suppose 
that F is a distribution function having an 
*absolutely continuous density function f. 
Denote the derivative of f by f’, and let I(F) 
be the tFisher information on 0. Set y(t)= 
—f'(/f(t) and J(t) - V'(CF OIF). Then 
Chernoff, J. L. Gastwirth, and M. V. Johns 
[10] proved that under some regularity con- 
ditions T, is an *asymptotically efficient es- 
timator of 0 for F. 
Let p be a real-valued (usually convex) 

function of a real parameter with derivative V 
— p'. The second method is to estimate 0 by T, 
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by minimizing ZE, p(x; — T,) or by satisfying 


T, is called the M-estimator. When p(t) — t?, it 
agrees with the least squares estimator. Let 
di) be the distribution function of the stan- 
dard normal distribution, H (x) be an arbi- 
trary continuous distribution function which 
is symmetric about the origin, F be a class 
of distribution functions of the form F(x)= 

(1 —2e)®(x)+ eH (x) for a given e (0 « e « 1), 
and V(p, F) be the asymptotic variance of 

T,. Huber [11] proved that p minimizing 
supr, V(p, F) is given by py(t) - 2/2, K|t|— 
K?/2 defined for |t| < K, >K, respectively, for 
some constant K. Under quite general condi- 
tions, the M-estimator converges as n— oo 

to T(F) in probability, which is defined by 
(u(x—T(F)) dF (x) =0. If (t) - —f'(/f() is 
chosen for w(x), then T, is the *naximum likeli- 
hood estimator of 0 for F and is asymptoti- 
cally efficient under some regularity condi- 
tions. Generally, the M-estimator defined 
above is not scale invariant. A scale invariant 
version of the M-estimator is obtained by 
replacing the defining equation by 


e LX Ta 
Zoll - Jes a) 


n 





where S, is any robust estimate of scale, e.g., 
the median of {|x;— M|/0.6745],., ,....,, where 
M is the sample median, or by solving the 
simultaneous equations (1) and 


with respect to T, and S,. In the context of the 
maximum likelihood estimation, y is chosen to 
be y(t) 2 ty(t) — 1 (— 399 Statistical Estimation 
P). 

The third method employs nonparametric 
tests for testing the hypothesis H:0 —0 against 
the alternative hypothesis 4:07 0. Let J bea 
real-valued and nondecreasing function such 
that f4 J(t)dt=0 and R, (6) be the rank of |X, 

— 0| among | X, — 0|, ...,|X,— 0|. Set (t) -1,0 
according as t0, <0 and S(X, —0, ..., X, — 0) 
—YL4J(R (0) - nQn-4 1) ¥(X, —0). Let 


0* =sup{6; S(X, —0,..., X, —0) ul, 
0** —inf(6; S(X, —0,..., X, —0) <p}, 


where u ts the expected value of S(X,, ..., X,) 
under the hypothesis H:0 —0. Then an es- 
timator of 0 is defined by T, —(0* + 0**)/2. 
Hodges and Lehmann [12] first proposed this 
technique, and this estimator is called the R- 
estimator. When F(x) is symmetric about the 
origin and J(t) 2 t —1, S tends to be the statis- 


tic for the Wilcoxon signed rank test, and the 
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R-estimator reduces to the median of n(n + 1)/2 
averages (X; + X)/2 (1 <i<j<n). As n oo, the 
R-estimator converges in probability to T(F), 
defined by 


NEE 


For symmetric F, the R-estimator defined by 
the statistic S with J(t)  —f'(F !()/f(F ' (t)) 
is asymptotically efficient under some regular- 
ity conditions. 

Although the above three methods provide 
robust estimators, which are seldom affected 
by outlying observations or contamination by 
gross errors, their behavior still depends on F. 
The last method of constructing robust es- 
timators consists of estimating 0 adaptively 
by utilizing information on the shape of F. 
Among these, a striking one is the asymptoti- 
cally fully efficient estimator for a wide class 
of F proposed by Takeuchi [13]. The es- 
timator is constructed by using subsamples 
of size K (K « n) drawn from the original 
sample, estimating the elements of the tco- 
variance matrix of the order statistics by U- 
statistics, and selecting the best weights of the 
L-estimator. L. A. Jaeckel [14] made an a- 
trimmed mean adaptive estimator by selecting 
an « that minimizes the estimated asymptotic 
variance. 


I. The Influence Curve 


Let T(F) be a functional of a distribution 
function F, and let an estimator T, of 0 cal- 
culated from an empirical distribution function 
F, converge to T(F) in probability as n oo. A 
real-valued function IC(x; F, T) defined by 


T(1— 3 
Ire F, T)=lim PO 10) etree) TUS 
EH E 


for all x 


is called the influence curve, where 6, is the 
distribution function of a point mass | at x. 
The curve was first introduced by Hampel 
[15] to study the stability aspect of estima- 
tors against a small change of F. As an exam- 
ple, when F is symmetric about the origin, 
the influence curve for the «-trimmed mean 
IC(x; F, T) is given by 


F~ (o)/(1 — 22) 
x/(1 —2a) 


when x«F (a), 
when F7!(a)<x 

«F !(1—2), 
FH at Ae when x» F^!(1—9). 


By substituting the empirical distribution 
function F, for F in T(F), we can represent the 
robust estimators discussed in Section H, at 
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least approximately, by T, — T(F,). Under some 
conditions, it can be proved that as n— oo, 


n^ (T(F,) — nn » IC(X;; F, T))50 


in probability. Thus it follows that n!^(T, — 
T(F)) is asymptotically normally distributed 
with asymptotic variance f (IC(x; F, T)? dF(x). 


J. The Regression Problem 


Consider the linear regression problem (— 403 
Statistical Models D) 


P 
X;— Y, bajt En i-1,2,...,n, 
feat 


where the X; are observable variables, the 6; 
are regression coefficients to be estimated, the 
aj are given constants, and £4, £5, ...,£, are 
identically and independently distributed 
random errors whose distribution function is 
given by F(x). The idea of the M-estimator is a 
direct generalization of the method of least 
squares; namely, to adopt (0,, ..., 0) as an 
estimator for (6,, ...,0,) that minimizes 

ZE, P(X; — Baal for some function p such as 
the one described above. 

R-estimators of the regression coefficients 
are obtained by minimizing 27. a,(R)A,, 
where A, — X;— 2,6;a;, R; is the rank of A; 
among A,,...,A,, and a,(-) is some monotone 
function satisfying 17, a,(i) 2 0. It has been 
proved that minimizing 277. ., a,(Rj)A, is asymp- 
totically equivalent to minimizing 


P 
2 
j=i 


the properties of which were first studied by J. 
Jurečková [16]. 


n 


Y a,(Rj)a; 


i=1 


5 








K. Dependence 


Let (X,, Y,), ..., (X,, Yp) be random samples 
from a population with a bivariate distri- 
bution function, R; be the rank of X; among 
Xio ..., X, when they are rearranged in an 
ascending order, and 5, be the rank of Y, 
among Y,,..., Y, defined similarly as R;. Vari- 
ous quantities are devised to measure the 
degree of dependence between X and Y. 


(1) Spearman's Rank Correlation. Set d; — R; 
—§,. Then r,= 1 — 6 £; d? /(n? —n) is called 
Spearman’s rank correlation. If there is no 
dependence between X and Y, ie., if the X; and 
Y; are independent random variables, then 
E(r,) 20 and V(r)) 2 (n— 1) !. 


(2) Kendall’s Rank Correlation. Take pairs 
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9;;— l; otherwise, p= —1. The statistic r, = 


2 
tion, where X runs over all possible pairs 
chosen from (R,,8,), ..., (R,, Sj). 
If there is no dependence between X and Y, 
then E(r,) 20 and V(r,) 2 2(2n + S)/(9n(n — 1)). 


=f 
R 2:09; is called Kendall’s rank correla- 


(3) Rank Correlation. R; and S,, i— 1, ...,n, 
are replaced by the corresponding normal 
scores, i.e., the means of order statistics in an 
independent sample of size n from N (0, 1); then 
the usual tsample correlation coefficient is cal- 
culated from these scores. This correlation co- 
efficient re is called rankit correlation; and if 
there is no dependence between X and Y, then 
E(z) - 0, V(z) 2 (n—3) !, asymptotically, where 


T fg : S : 
(tFisher’s z-transformation). 
rR 
These statistics are used to test the hypoth- 
esis of independence. 





ND 
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372 (XXI.3) 
Roman and Medieval 
Mathematics 


The Romans were interested in mathematics 
for everyday use; their arithmetic consisted of 
computation (by means of the abacus), weights 
and measures, and money. For their mone- 
tary system, they developed a computational 
method using duodecimal fractions. Julius 
Caesar (102?—44 B.c.), known for his calendar 
reform in 46 B.c., also undertook to measure 
his territory, which aroused a demand for 
land surveying techniques. Books on prac- 
tical geometry which provided this knowledge 
were called gromatics (a “groma” was a land 
surveying instrument). Toward the end of the 
Western Roman Empire (476), Greek mathe- 
matics was studied; during this period Boe- 
thius (c. 480—524) wrote his two books on 
arithmetic and geometry. The former was a 
summarized translation of Nichomachus' 
book, and the latter included propositions 
from the first three books of Euclid's Elements 
(without proof) and practical geometry. 
Music, astronomy, geometry, and arith- 
metic, which constituted the *mathemata" of 
Plato's Academy (closed in 529), were treated 
as the "quadrivium" (the four major subjects) 
in the Encyclopedia of Martianus Capella, 
Cassiodorus, Isidorus, Hispalensis, and others. 
After the establishment of the Roman Church 
in the 5th century, the quadrivium was to be 
studied for the glory of God. Books on mathe- 
matics from this period laid emphasis on the 
computation of an ecclesiastical calendar and 
mystical interpretations of integers, as seen 
in books by Bede Venerabilis, Alcuin, and 
Maurus from the 7th through 9th centuries. 
Arabian science was imported first through 
Spain— under Moorish influence beginning in 
711, the year of the fall of the Visigoths—and 
then through the Crusades (1096-1270). Com- 
putation with figures, originating in India, 
replaced the abacus in the 12th century, when 
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Arabian books on arithmetic and algebra, 
along with Greek books on geometry and 
astronomy (such as books by Euclid and 
Ptolemy), were translated into Latin. Italian 
merchants, whose occupation necessitated 
computation, rapidly adopted the new system. 
Representative books of this period are Liber 
abaci (1202) and Practica geometrica (1220) by 
Leonardo da Pisa (also known as Fibonacci, 
c. 1170-1250). The former includes the four 
arithmetic operations, showing Indian in- 
fluence, commercial arithmetic, and algebra. 
The new methods were not limited to mer- 
chants. The French bishop Nicole Oresme 

(c. 1323-1382), who influenced Leonardo da 
Vinci (1452-1519), introduced fractional expo- 
nents and conceived the graphic representa- 
tion of temperature, a precursor of coordi- 
nates and functions. 

From the 11th century, universities de- 
veloped from theological seminaries, first in 
Italian cities such as Bologna and Palermo, 
and later in Paris, Oxford, and Cambridge. 
Mathematics was taught in these universities, 
although no remarkable creative contributions 
were made. However, theologians such as 
Albertus Magnus (c. 1193-1280) and Thomas 
Aquinas (12252-1274) discussed infinity in a 
way that went beyond Greek thought and thus 
helped to lay a basis for the modern philo- 
sophy of mathematics. 
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A. General Remarks 


The sample survey is a means of getting sta- 
tistical information about a certain aggregate 
from the observation of some but not all of 
it. The aggregate concerned is usually called 
the finite population, and the observed part 

is called the sample. Introducing a random 
mechanism, J. Neyman established a method 
of ascertaining objectively the reliability of 
such information. This method is mainly ap- 
plied to demographic statistical surveys and 
opinion polls, but it is also applicable to ran- 
dom samples of physical materials. We briefly 
sketch the mathematical structure of this 
method without going into detail about tech- 
nical problems that arise in the practical sur- 
vey situation. 

Suppose that the population consists of N 
units, where N is called its size. Each unit has 
some characteristic «, which is an element of 
some set Q. The set of all characteristics of the 
units in the population is designated by 0— 
{%1,---,@y}, which we regard as a parameter. 
The set of all possible 0 is denoted by O c ON. 

Suppose that one unit is chosen and ob- 
served according to some procedure, the index 
number of the unit in the population is J, and 
the observed characteristic is X. It is assumed 
that the observation is without error, hence 
X =q}. 

Denote the whole sample by (J, X) —-(J,, ..., 
Jas X4, ..., Xn) where n is the sample size and 
X;=a,. The sample size n may be a random 
variable, and among J,, ..., J,, duplication 
may be allowed. The probabilistic scheme 
for J is called the sampling procedure, and if 
it satisfies the condition 


(c) Pr(J; 5j is independent of a, and of X;,,, 
-.., X, (but may depend on X,,..., X; 4), 


it is called a random sampling procedure. More- 
over, if the *joint distribution of J is indepen- 
dent of 0, it is called regular. Specifically, if n is 
constant and Pr{J} is symmetric in J, it is 
called uniform. 

The two main mathematical problems of 
sample surveys are to determine a random 
sampling procedure and to provide methods 
whereby statistical inferences can be made 
concerning 0 (— 401 Statistical Inference). 


B. The Problem of Inference 


Condition (c) is assumed. The probability of 
(J, X) is given by 
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Pri X), ud Xs BEE en 
=Pr{J, =j1} %o(X1) Pr(J; =j2| J1, X1} Xo(X2) 
ags Pr{J =j Xi, TE (EE Xn-1 7 Xol Xn) 


where y4(X;) is defined as 1 if X;=a z, and 0 
otherwise. This formula can be shortened to 
the form 


Pr{(J, X)} PU. X)y«(X, J), (1) 
where 

(X J= 1 if X;=a,,i=1,...,n, 
naim 0 otherwise. 


Expression (1) is the fundamental model for 
the sample survey problem. Note that P(J, X) 
is independent of the parameter 0. Therefore 
if we let I-(1,, ...,L,) (Jj <... < I,) be the set 
of numbers in (J,, ..., J,) after deleting dupli- 
cations, and let Y 2 (Y,, ..., Y,) be the corre- 
sponding X values, then the joint distribution 
of I and Y can also be expressed as 


Pr{(L, Y)} PO. Y¥)x0(Y, D, 
where 


1 if Y=% ja 1, ...,m, 


1 (Y, D= ‘0 otherwise. 

Since 76(Y, I) = y9(X, J) for all 6, the tcondi- 
tional probability distribution of (J, X) for 
given (I, Y) is independent of y,(X, J), and 
hence (J, X) is a tsufficient statistic. According 
to the general theory of sufficient statistics, we 
can restrict ourselves to the class of procedures 
depending only on (I, Y). 


C. Estimation 


Suppose that g(0) — g(a,, ..., xy) is a real para- 
meter whose unbiased estimators are under 
discussion. 

Theorem. There exists an unbiased estimator 
of g(0) if and only if there exists a decomposition 


g(0)— 5 ËCH trey GI 
Pr{1=(j (v), ii) > 9, 


If the sampling procedure is regular, the 
second condition can be replaced by Pr(I^ 
(1v), ....5,0))) 2 0, v2 1, 2, .... Hence, for 
example, if o; is real and the sampling proce- 
dure is regular, X 2 $ o;/N is testimable if and 
only if Pr{Isi}>0 for all i, and o2 = X (a; — 
TAN — 1) Z X (a; — a)? /N(N — 1) is esti- 
mable if and only if Pr{I3i,j}>0 for all i and 
j. Also, IT, a; is not estimable unless Pr(I— 
(1, ..., N)} 2 0. The decomposition (2) is not 
unique, and corresponding to different decom- 
positions, different unbiased estimators are 
derived. Also, for the case of regular sampling 


v=1,2,.... Q) 
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procedures, for any 0 — 0, it is always possible 
to construct an unbiased estimator g(0) such 
that Pr(9(0) 2 g(05)|0 — 05) — 1 if g(0) is esti- 
mable. Hence the variance of the locally best 
unbiased estimator is always 0, and the tuni- 
formly minimum variance unbiased estimators 
exist only in the trival case. 

If some kind ef symmetry exists among the 
population units as well as the sampling pro- 
cedure and the parameter, it would be natural 
to require the same kind of symmetry for the 
estimators. Let G be a group of permutations 
among N numbers. Assume that for any 0e 8 
and yeG, we have y6€ 9 and 9g(50) — g(0). If 
Pr{yJ}=Pr{J} for all y, then the sampling 
procedure is said to be invariant with respect 
to G. An estimator is also called invariant if its 
value does not change under any permutation 
yeG of the numbers of sample units. Thus if G 
is the set of all permutations De, the tsym- 
metric group), then the invariant estimator is a 
function of Y (or X) only. Moreover, if the 
dimension m of Y is constant, Y is complete 
(under some mild conditions); hence there 
exists a unique minimum variance invariant 
unbiased estimator. 

When there is some additional information, 
it can be represented by auxiliary variables 
B,;---» By, Which are known and assumed to 
have some relation to o, ..., «y. Assume that 
the «; are real numbers and that the parameter 
to be estimated is 0—àX —(Xa;)/N. If we can 
assume that g; and fj; are approximately pro- 
portional, we can estimate the unknown popu- 
lation mean & by a* =(X/Z) x B where X is 
the sample mean of the «’s and Z the sample 
mean of the f’s. Although &* is not unbiased, 
we may expect that it has small error if the 
relation between two variables is close. 27 is 
usually called the ratio estimator. 

In practical research, as an estimator of the 
population total A — 2,a;, we usually use A= 
Y X;/B,, where P;= Pr{J3i}. Its variance is 
V(A) EIER Pj) (a,/P-a;/P" and is 
estimated by v(Â)= ZZ (P, — BAPAE 
X;/P)”, where P= Pr{J>(i,j)}. When N is 
unknown, it can be estimated by the same 
procedure as A (say N), and the population 
mean à can be estimated by & — A/N, which is 
called a ratio estimator. & is biased except 
when N is known. 


D. Asymptotic Confidence Intervals 


It is usually impossible to obtain any meaning- ` 


ful tconfidence interval based on exact small- 
sample theory. But when the o; are real and 
the sampling procedure is uniform and with- 
out replacement, the sample mean X is asymp- 
totically normal with mean x and variance 
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1 1 
“(i -x) 02, where o2 crea as 
N and n> œ, and limsupn/N « 1, provided 
that 


2 
max (4,—« 
max (a, ) 


Also, the sample variance converges to c7 as 
n oo. From these results we can construct 
asymptotic confidence intervals for a. 


E. The Problem of Sampling Procedures 


In determining the sampling procedures, both 
the technical aspects of sampling and the 
accuracy of the estimators should be consid- 
ered. The most commonly used methods are 
multistage sampling and stratified sampling, or 
some combination of the two. For example, 
the population is partitioned into several 
clusters. First we select some of them accord- 
ing to a probability scheme and then choose 
units from the selected clusters. This procedure 
is called two-stage sampling. The probabilities 
for the selection of clusters may be uniform or 
proportional to the size of the clusters. Strati- 
fied sampling is the method of dividing the 
population into several subpopulations, called 
strata, and selecting the sample units within 
each stratum. If the size of the ith stratum is 
N,, the size of the sample chosen from this 
stratum is n;, and the probability is uniform 
within each stratum, then the most common 
estimator for the population mean g is given 
by 


a=) N,X,/N, 


where X, is the mean of the sample values in 
the ith stratum. The variance of 2 is given by 


2 
(ete 


where o? is the population variance within the 
ith stratum. 

If the cost of drawing one sample unit in the 
ith stratum is equal to c;, then for fixed cost, 
the variance of the estimator is minimized 
when 


n,/N,co;/,/¢;, 


which is called the condition of optimum 
allocation. 


F. Replicated Sampling Plan 


W. E. Deming proposed an effective method in 
practical sample surveys, called a replicated 
sampling plan, following J. W. Tukey’s hint. It 
enables us to easily evaluate variances of esti- 
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mates for any estimator and any sampling 
procedure. Let the sample be composed of k 
subordinate samples which are selected by the 
same random sampling procedure from the 
same population, and let ĝ;(J;, X;) be the esti- 
mate from the ith subordinate sample by the 
same estimator and Î be the estimate from the 
whole sample by the same estimator. Then, 
provided Ê= X 0,/k approximately, the var- 
iance of 6 can be estimated by v(0) — X:(6,— 
Ô)? /k(k — 1). If the sample is selected by the 
simple random sampling procedure and is of 
large scale, 0, and Ó are approximately normal 
variates, and v(0) is evaluated by using the 
sample range of the 0,. In large-scale sample 
surveys, even when the random sampling 
procedure is not simple, the theory related to 
the normal distribution can be applied to the 
6, and Ô. It has been shown that v(0) evaluated 
by the above method includes not only the 
sampling error but also the random part of the 
nonsampling errors. 


G. Conceptual Problems 


Although it has been established that the 
sample survey method is useful in large-scale 
social or economic surveys, there are difficult 
conceptual problems about the foundations of 
the method (especially when auxiliary informa- 
tion exists) that are still far from being settled. 
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374 (XVIII.4) 
Sampling Distributions 


A. General Remarks 


To perform statistical inference, it is necessary 
to find the tprobability distribution of a 'sta- 
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tistic involved in the situation (— 396 Statistic, 
401 Statistical Inference). In general, the proba- 
bility distribution of a statistic is called the 
sampling distribution. A set {X,, X3, ..., X,] 

of *random variables that are independently 
and identically distributed according to a 
distribution F is called a random sample 

from F. A common sampling distribution de- 
scribed in this article is that of the statistic Y — 
f(X,, ..., X,), where the set {X,,..., X,} isa 
random sample from a fnormal distribution. 
Examples of such a statistic Y of dimension 1 
are the tsample mean, tsample variance, linear 
or quadratic forms of ( X,, ..., X„}, their ratios, 
and *order statistic, while examples of Y of 
higher dimensions are the sample mean vec- 
tor and the sample covariance matrix and 

its eigenvalues. The tnormal distribution 

with mean u and *variance c? is denoted by 
N(u, 07), while the tp-dimensional (p-variate) 
normal distribution with mean vector u and 
covariance matrix X is denoted by N (a, X) 

(— Appendix A, Table 22). 


B. Samples from Univariate Normal 
Distributions 


If random variables X,,..., X, are distributed 
independently according to N(u,,01), ..., 
NI, 02), then a linear form Za X; has the 
distribution NU. gie, X.;a26?). In particu- 
lar, if {X,,..., Xa} isa random sample from 
N(u, 07), then the sample mean X =>, X;/n 
has the distribution N (u, 7/n). 

Let {X,,...,X,} bea random sample from 
the distribution N(0, 1). The sampling distri- 
bution of the statistic Y = X; X? is called the 
chi-square distribution with n degrees of free- 
dom and is denoted by y?(n). It has the tproba- 
bility density 


Lye: (r (3)) y Weg ye 


for 0 « y « oo, f, (y) 20 elsewhere, where T is 

the tgamma function. The distribution of Y= 
" 1(X; - uj? depends only on n and A= 

>; #7, and is called the noncentral chi-square 

distribution with n degrees of freedom and 

noncentrality 4 and denoted by y? (n, 2). It has 

the probability density 


o CAM) 
Sn A= Xe a (3) gma0) 


for 0< y « oo, where f,,,, and f, are the prob- 
ability densities of chi-square distributions 
and ab is an extended hypergeometric func- 
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tion. Noncentral chi-square distributions 
have the following treproducing property: If 
Y,, ..., Y, are distributed independently ac- 
cording to y?(n,, 44), ..., X? (n4, Aj), then Y, Y, 
has the distribution vn, X; 4). Also, we 
have Cochran's theorem (Proc. Cambridge 
Philos. Soc., 30 (1934)): If X,, ..., X, are dis- 
tributed independently according to N(pg,, 1), 
..., N(u,, 1) and if for quadratic forms Q,,= 
Y,X;af? X, X; for m— 1, ..., k the matrices 

A,, — (aff?) with Trank r,, satisfy the condition 
A4 t ...-- A,— I (unit matrix), then a necessary 
and sufficient condition for Q,, ..., Q, to have 
independent noncentral chi-square distri- 
butions with r,,...,7, degrees of freedom, 
respectively, is that r; 4- ... -- r, — n. In partic- 
ular, when ji; — 0 for all i, they have (central) 
chi-square distributions, and the theorem 
implies their reproducing property. 

Let X and Y be independent random vari- 
ables having distributions N(6, 1) and y?(n), 
respectively. Then the sampling distribution 
of T-X / / Yin is called the noncentral t- 
distribution with n degrees of freedom and 
noncentrality ô and is denoted by t(n, ô). Its 
probability density is given by 


DEE -ap äi? Tn ke 02) 
is anT(n/2) 


k=0 k! 
(t/ fn 


x (1 + t2 /nyo***02 


for —oo «t < oo. In particular, when 6=0, the 
distribution is called the t-distribution with n 
degrees of freedom and is denoted by t(n). Its 
probability density is simplified to 


Dis" 1/2 (: Es 
en T (n/2) 


n 
for —oo «t < oo. 

Let {X,,...,X,} be a random sample from 
N(u, 07). Exact sampling distributions of the 
tsample variance S? = X(X;— X)?/(n — 1) and of 
the t-statistic T — nX — MAA where u= 
Ho is a given number, were essentially obtained 
by Student [5]: (n — 1)52/c? and T are dis- 
tributed according to y?(n— 1) and t(n— 1), 
respectively. His proof was made rigorous by 
R. A. Fisher (Metron, 5 (1925)), who proved in 
particular that X and S? are independent. If 
H£ Uo, then T follows the distribution t(n— 

1, V n(u — uo)/o). 

Let X and Y be distributed independently 
according to y?^(m, A) and x? (n), respectively. 
The distribution of Z 2 (X/m)/( Y/n) is called 
the noncentral F-distribution with m and n 
degrees of freedom and noncentrality A4. In 
the special case when A=0 it is called the F- 
distribution with m and n degrees of freedom 
and is denoted by F(m, n). The probability 


—(nt1)2 


f= 
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densities fm 3 and fm n of these distributions 
are given by 


(0 (miny? 
I^ Bin, zeen 


o AW 1 (m/nym»** 
= Lr a E 
Ins) 2. € (2 k! B((m/2) 4- k, n/2) 
g(mi2)*k-1 


x (1 + mz/ny(v *02)*k 


zim/2) -1 





m+n m A mz/n 
L2 42 F xs E 
EE ( 2 E 


for 0 « z « oo, where B and , F, are the theta 
function and the confluent hypergeometric 
function (— 167 Functions of Confluent Type), 
respectively. 

Let X be a random variable having the 
distribution F(m, n). The distribution of Z — 
4log X is called the z-distribution with m and n 
degrees of freedom and is denoted by z(m, n). 
Its probability density is given by 


2(m/ny"? em 
B(m/2, n/2) (1 + me?*/ny"* »? 


for —oo «z« oo. If S2 - X*,(X;— Xy//(m— 1) 
and $2 = £} (Y, — YY?/(n— 1) are sample var- 
iances based on independent samples of sizes m 
and n taken from N (u, o?) and N(v, t”), respec- 
tively, then the statistic z=4log(S?/S?), which 
was introduced by Fisher (Proc. Int. Math. 
Congress, 1924), is distributed according to 
zim — 1,n — 1) under the hypothesis o? = 1?. 
Fisher [6] tabulated percent points of z(m, n). 


C. Samples from Multivariate Normal 
Distributions 


Let X be a p-dimensional random vector, 
namely, a vector having real random variables 
as its components. X has the p-variate normal 
distribution N (a, 2) if and only if for any real 
vector a=(d,,...,a,)’, the random variable a'X 
has the normal distribution N (a'u, a' Za). If 
X,, ..., X, are independent and have p-variate 
normal distributions N(g,, 24), ..., N (Hn Zh) 
respectively, and if 4,,..., A, arem x p real 
matrices, then the random vector A,X; +... + 
A,X, has the m-variate normal distribu- 
tion Ni, 2), where u= 25-, Aja; and Y= 
14A). 

Suppose that (X,,..., X,] is a random 
sample from the p-variate normal distribution 
NI. 2), and let X=(X,,...,X,) be a px n ma- 
trix. Then the probability distribution of W — 
XX' is called the Wishart distribution with 
scale matrix £ and n degrees of freedom and 
is denoted by W,(2,n) or simply W(Z, n). If 
n2 p— 1, the joint probability density function 
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of p(p + 1)/2 arguments of W=(W,,) is 
Sz (W)-(,Q/2)2z]?) ' 
xetr(—X ! W/2)| W "7772 


for W>0, where W>0 means that W is tposi- 
tive definite, etr(A)=exp(tr A), and T „, a multi- 
dimensional gamma function, is defined as 


L,(a)- o pip Da I r Län: D) 

for a>(p— 1)/2. When n p— 1 the distri- 
bution is singular and has no probability 
density. 

Suppose that X,, ..., X, are independent 
and obey normal distributions N (gi, 2), 

., N(u,, X), and let X 2(X,,...,X,), M = 
(His ..., Hna) Then the distribution of W= XX’ 
is called the p-dimensional noncentral Wis- 
hart distribution with scale matrix 2, n de- 
grees of freedom, and noncentrality matrix 
Q=% MM' and is denoted by W(X, n, Q). If 
n2 p — 1, the probability density function is 


et( —O/2)gFi(n/2; QE! W/A) fs. .(W) 


for W>0. ,Fj is a hypergeometric function 
with matrix argument, which is defined by 
s Ee eod Diu DS) LER Gps nena 
b,,..., bg; S, I), where 


a FP iussa doi Das s Dg S, T) 


C,(S)C.(T) 
KIC) ` 





K — (k,, ..., kp) is an ordered set of integers such 
that kı +... +k, =k and k; >... 2 k, 20, and 
where C,(S) is a zonal polynomial (— [8]) of a 
symmetric matrix S. The multivariate hyper- 
geometric coefficient (a), 1s given by 


p 1 
(a). [I DEEN D) ; 
ki 


i=1 


(a), =aļla+ 1)...(a-- k — 1). 


The noncentral Wishart distribution is sin- 
gular when n € p — 1. Similarly to the noncen- 
tral chi-square distribution, the noncentral 
Wishart distribution has the reproducing 
property with respect to both the number of 
degrees of freedom and the noncentrality 
matrix. Also, Cochran's theorem can be ex- 
tended to the multivariate case: Let X,,..., X, 
be p-variate random vectors independently 
distributed according to N(#,,2),...,N(#,, 2), 
respectively, and let A m= (ayy), m E SE ..., k, be 
p x p real matrices of trank r„ and such that 
A, t Àj 9 ... - A, — 1 (unit matrix). A necessary 
and sufficient condition for random matrices 
Q,, — X, ja X;X;, m— 1, ..., to be indepen- 
dently distributed according to noncentral 
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Wishart distributions with r,, ..., r, degrees of 
freedom, respectively, is that r; +... +r, — n. If, 
in particular, u; =... —4,-—0 and if r,, > p, then 
Q,, is distributed according to W(X, r,,). 

If W has the distribution W(Z,n) with n> 
p — 1, then the eigenvalues 4,,...,4, (4, 2... > 
4,70) of W have the joint probability density 
function C,,|Z| "^ 4FPP(— 21/2, Alain 
IT, (4; Ah where A is a diagonal matrix 
with diagonal elements 4,,...,4, and C, ,= 
n2 PP p (p/2)T (n/2)) *. If X=], then the 
joint probability density function becomes 
C, ,etr( — A/2)]A|" » 92 TT(4;— 

i«j 

Suppose that S, and S, have independent 
Wishart distributions W(2,n,) and W(X, n3), 
respectively. The random matrix B=(S, + 
S;) !?S,(S, - S;) '? is called the beta matrix, 
and its distribution is denoted by B(n,/2,n;/2). 
Its probability density function is 


n, n n 
DI e *yr (2r, (2) jase p-1)/2 


x Hu = B\"27 su? 





for Oc B« I. 

Suppose that S has the distribution W(X, n) 
and B has Bin, /2,n,/2); then, for any non- 
singular symmetric matrix Q, 


(bd Te p*l 
p{S<0}=4 220° | LZ 2 y 


n+p+i nn+p+! 1 
UJ -;—- XQ 
(Epee Seel 


and 








H, m ptin,t+tpti 
F(=, -2+ ;— ;Q J. 
SS Kam 2 P RE ) 


If (X,,..., Xp} is a random sample from 
Ni X), then the sample mean X == D"_, X,/n 
and the sample covariance matrix S=}, (X, 
—X)(X, — X)'/(n — 1) are distributed indepen- 
dently according to the respective distributions 
Nim, Eist and W(E,n—1).If n» p—1, T?— 
n(X — uo} S~! (X — uo) is called the noncentral 
Hotelling T? statistic with n— 1 degrees of 
freedom and noncentrality 4— n(yu — uo) X^! (u 
— uto). (n— p) T?/p(n — 1) has a tnoncentral F- 
distribution with p and n— p degrees of free- 
dom and noncentrality 2. 

Let X -(X,, ..., X, and Y-(Y,,..., H, 
p €q denote two random vectors, X, , and 
£, their respective covariance matrices, and 
2,2 the p x q matrix of covariances between 
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the components of X and Y. Each of the 
nonnegative roots p;, ..., p, of the equation 
12,,25125,— p? Z,,| 20 is called the canonical 
correlation coefficient. Let (X,, Y ,), ..., (Xn, Yn) 
be a random sample from the (p + q)-variate 
normal distribution NI, X), and let S,, = 
Yi X Xin, S12 2 $5, = Loar Xa Kan, S22 = 
255-1 Y, Y;/n, and S=(S,); j-,,;. The sample 
canonical correlation coefficients are the non- 
negative roots r;, ...,r; of [S,,8;785, — 7S, | 
—0 and for n>p+q athe probability density 
function of rf, ...,77 is 


Call P? |"? |R?|47 p- 1217 — Ri" q—p-1)/2 


np 
x[It (r; Aus) sai » RAP i 


i<j 


mr Cs) 9n) 


and R? and P? are diagonal matrices with 
elements r? and pj, respectively. If, in partic- 
ular, p — 1, then p, and r, are, respectively, 
the population and the sample multiple cor- 
relation coefficients, and (n— q)r2/q(1 —r2) 
follows the distribution F(q, n — q) whenever 
p1 70. 

Let ((X,, Y1), ...,(X,, ¥,)} bea random 
sample from the 2-dimensional normal distri-- 
bution with tcorrelation coefficient p. Then the 
sample correlation coefficient 


|. E(X;- XY(X,- Y) 
= (2,(X,— X)?Z,(¥,— Y)? 
has probability density 


SAT p) 
=(2"-3/n(n—3)!)(L— p?) »2(1— 19? 


SE SEI i-o? 
k=0 


for —1<r<1. For the special case p =0, the 
probability density becomes 


Gen 

alt: ) 
va "r(e: (n— 2) 
which implies that T=./n—2R/,/1—R? 


has the tt-distribution with n—2 degrees of 
freedom. 

Given a random sample from a p-variate 
normal distribution, the probability density of 
the tsample partial correlation coefficient 
R,5.5..., between the first and the second 
components with the remaining components 
fixed is given by f,-+2(3 P12-3...p) Where f is 


f= 


“(1- pysMe 
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the density of R mentioned in the previous 
paragraph and os e is the population par- 
tial correlation. 


D. Large-Sample Theory 


So far we have dealt with random samples 
{X,,...,X,} composed of finitely many ran- 
dom variables (or vectors). The theory dealing 
with such finite cases is called small-sample 
theory, which is not always suitable for numer- 
ical applications. In comparison with this, in 
large-sample theory, where the sample size is 
assumed to be sufficiently large, an approxima- 
tion of the sampling distribution can often be 
obtained easily by means of the fcentral limit 
theorem. 

If for three sequences X,, Ha, and o,, n= 
1, 2, ..., of random variables, real numbers, 
and positive numbers, respectively, the se- 
quence (X, — Hell, tconverges in distribution 
to N(0, 1) as n— oo, then the sequence X, is 
said to be asymptotically distributed according 
to N(u,, 02). The definition can be extended 
to higher dimensions. We write X, =0,(r,) for 
a sequence r, of positive numbers if and only 
if X, /r, tconverges in probability to zero as 
n oo. The following theorem is useful: If 
X,—a ^ oy(r,), where a is a constant and 
r,— o(1), and if a real-valued function f(x) is 
of class C“ in a neighborhood of x =a, then 
fO) Y. cf C - af +09). 
If X, is asymptotically distributed according 
to N(u, c?/n) and f(x) is differentiable at x = 
u with the derivative f'(u) 40, then f(X,) is 
asymptotically distributed according to N( f(u), 
Cf'G)Y e? /n). In higher-dimensional cases, if 
X, is asymptotically distributed according to 
N (4, Z/n) and f(x) is continuously differenti- ` 
able in a neighborhood of x = with nonzero 
vector € —(0f/Ox,, ... ,Of/Ox,),-,,, then f(X,) 
is asymptotically distributed according to 
N(f(u), cZe'/n). 

Let (X,,..., Xn} be a random sample from 
a univariate distribution with finite fmoments 
v, = E(X?) for i— 1, ..., k, and let a; X, Xi/n 
be its ith sample moment. Then the random 
vector (a,, ..., a,) asymptotically follows the k- 
variate normal distribution as n— oo with 
mean vector (v,, ..., v) i covariance matrix 
n^! (cj), where o,;= vj.;—v;vj. Let Mi- Z,(X.— 
Kin and yu; ^ E(X — v, Se i=2,...,k be the 
sample central moment of order i i ‘and popula- 
tion central moment of order i, respectively. 
Then the random vector (X, M,, ..., M,) obeys 
the k-variate normal distribution asymptoti- 
cally as n oo with mean vector (v4, H2, ..., My) 
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and covariance matrix Ge ij. where oz 
Ho, 01,7 ity — lla Hiis Oj = Hie; — lica Han 
JB Bia Sun mE. for i, jz 2. 

A random variable y? that has a tchi-square 
distribution with n degrees of freedom obeys 
the distribution N (n, 2n) asymptotically as 
n oo. Also, RE / 2n — 1 obeys N(0, 1) 
asymptotically. The latter distribution approx- 
imates y? indirectly better than N (n, 2n) ap- 
proximates y2 directly. The t-distribution with 
n degrees of freedom obeys N(0, 1) asymptoti- 
cally as noo. If X, obeys an F-distribution 
with m and n degrees of freedom, then mX, 
obeys asymptotically the distribution y?(m) as 
n oo. If X, obeys a *binomial distribution 
Bin(n, p), then X, obeys asymptotically the dis- 
tribution N (np, np(1 — p)), and Arcsin,/ X, /n 
obeys asymptotically N(Arcsin, / p, 1/4n) as 
n oo. This transformation is called the arcsin 
(or angular) transformation. If (X,, X,, ..., X) 
obeys the multinomial distribution Mu(n; p,, 
Da. Pi), then it is asymptotically distributed 
according to the normal distribution N (u, L.A 
where gt, — (np;, ..., AP), Za = (op), oP = 

np,(1 — pj), and of) — —np;p;, (ij), and the 
NE variable THNX, — npj /np;, where 
X,,;,2n—(X,-...- X) and p,,—1—(p, + 

.. + py), obeys asyiptotically the distribution 
20) [11]. 

If X, has the *Poisson distribution with 
mean 4,. where 4, oo as n oo, then X, and 
JJ X, obey the respective distributions N (ån, An) 
and NIAE 1/4) asymptotically. If R is the 
sample correlation coefficient based on a ran- 
dom sample of size n from a 2-dimensional 
(bivariate) normal distribution with popula- 
tion correlation coefficient p, then R is asymp- 
totically distributed according to N(p,(1— 

de as n oo, and therefore z 2 1log((1-- 
R)/(1 — R)) obeys asymptotically the distri- 
bution Se log((1 + p)/(1 — p)), 1/n) asymptoti- 
cally. This transformation is called Fisher’s z- 
transformation. The distribution 


1 1 1 
N| -log Te 4 ; 
2 "1—-p 2(n—1) n—3 


gives a better approximation. 





E. Empirical Distribution Function 


Let {X,,...,X,} bea random sample from a 
distribution F. The random function 


F(x) = | {number of X’s that are <x} 


is called the empirical distribution function. For 
any collection of fixed x's( os xg « x, <... 
« x, « oo), the random vector (nF,(x,), n(F,(x;) 
= F,,(%1)), ++ MF A(X) — Fj —1))) obeys the 
*multinomial distribution Mu(n; p,,..., DÄ 
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where p;— F(xj) — F(x; ,), j= Ll, .... k, provided 
that the p's are positive. In particular. the 
vector is asymptotically distributed according 
to the k-variate normal. The result is substan- 
tially strengthened as follows: the Glivenko- 
Cantelli theorem states that sup, | F(x) — F(x)| 
converges to zero with probability 1 as n tends 
to infinity. If F(x) is continuous, then the ran- 
dom function Jn(F,) — F(t)) converges in 
distribution to a tGaussian process X(t) such 
that E(X(t)) 20 and E(X (s) X (t)) = F(s)(1— 
F(t)) for s<t. A Gaussian process X(t), 0< 

t 1, with this moment condition is called a 
Brownian bridge if F(t)— t, for O x t « 1. If F(x) 
is continuous, then the distributions of the ran- 
dom variables C, — ./ nsup,(F,(x) — F(x)) and 
D, — /nsup|F,(x) — F(x)| do not depend on F. 
Asymptotically, they have identical distribu- 
tions with sup, B(t) and sup, | B(t)|, respectively, 
where B(t) is a Brownian bridge. We have 


P(sup, B(t)<x)=1—e72”, 


P(sup,|B(t)|< 


(ie -2k?x? 
T 


9=1+2 (71 


x20. 


Let {X,,...,X,,} and {Y,,..., ¥,} be random 
samples from continuous distributions F and 
G, respectively, and let F,,(x) and G,(x) be their 
empirical distribution functions. Under the 
hypothesis H, F =G, the distribution of the 
Kolmogorov-Smirnov test statistic 


Dyn = SUP | bal G9) 

does not depend on F (or G), and asymptoti- 
cally, as moo and m/n>/ <1, the random 
function Vm(F,t)— G,(t)) converges in distri- 
bution to a Gaussian process X(t) such that 
E(X (0) —0 and E(X(s)X(t))=(1 + A)F(s)(1— 
F(t)),s<t. 


F. Edgeworth and Cornish-Fisher Expansions 


Let {X,,X,,...,X,} be a sample from a distri- 
bution with mean p and variance o?. The 
random variable (X, + X)+...+X,—np)/,/no 
is called the normalized sum of the sample. 
The distribution function F,(x) of the normal- 
ized sum of a sample from an absolutely 
continuous distribution F with higher-order 
moments admits the Edgeworth expansion 


[15]: 


F(x) 


y-2 1 k ] v1 
TUE R,(x) Ka sose) , 


where ® and ¢ are the fcumulative distribution 
and the *probability density functions, respec- 
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tively, of N(0, 1), and B is a quantity bounded 
by a constant depending on F and v. R,(x) is 
the polynomial given by 


R (x)= — XH,«31-1(x) IL (es 
where H,(x) is the *Hermite polynomial of 
degree k, y, is the *cumulant (tsemi-invariant) 
of order k of the distribution of (X, — ue, the 
summation extends over all nonnegative m's 
such that m, + 2m, + ... + km, —k, and I2 m, 
+m,+...+m,. In particular, we have R,(x)= 
— ys — 1)/6 and R2(x)= —y4(x° — 3x)/24 
— y2(x5 — 10x? + 15x)/72. For a tlattice distri- 
bution F concentrated on 0, +1, +2,... but 
not on 0, +p, +2p, ... for any p> 1, the fol- 
lowing expansion is valid for x «0, +1, 

T2 out 


x—nu 
pic 
( ne 
v-i1 


STEE oul. Jana , 


where z=(x —nut+ LN, due and the Q's are 
suitable polynomials; Q,(z)= R,(z) and Q,(z) 
=R,(z)+2/240?. l 

The Edgeworth expansion makes it possible 
to derive asymptotic formulas for the relation 
between those u and v such that F,(v) = 4(u). If 
F is an absolutely continuous distribution with 
moments of order v (> 3), then we have the 
Cornish-Fisher expansions [16]: 


v-2 1 k 
— =~. —(v-1)/2 
u Vp A, (v) (5) +O(n ) 


and 


y-2 


z dy -(v-1)/2 
v ir A aal —) - O(n 


where the A's and B’s are polynomials derived 
from the R's of the Edgeworth expansion; 
A,(v)= —y3(v? — 1)/6, A; (0) 2 — ya? dos 
+ y2(4? — 7v)/36, B, (u)= y4(u? — 1)/6, Ba (u 

ya(u? —3u)/24 — y3 (2u? — 5u)/36. 

The expansions imply, in particular, that the 
random variable v + 32-2 A,(r)n "? with v= 
(X, X... X,— nuy (ne? is asymptoti- 
cally distributed according to N(0, 1) and that 
the 100x% point v, of F, is approximated by u, 
XL B,(u,)n "?,where u, is the 100x% point 
of N(0, 1). These approximations can be im- 
proved further in some cases by a suitable 
transformation of the sum X, + X,+...+X,. 
Thus, for example, if X is distributed accord- 
ing to y?(n), then the Cornish-Fisher expan- 
sions with v —3 are 


en Äer mell 
2Nn n 
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1 2 1 
psu; Pee »«o(*) 
AN n n 


where v=(X — ny, / 2n. However, the distri- 
bution of the random variable 


On (/ XN? ju 
2 n 9n 


is much better approximated by N(0, 1), and 


1 2 n 2 3 
a 9n T 


gives a more accurate approximation to the 
100%% point of the distribution y? (n). These 
are called the Wilson-Hilferty approximations 
(Proc. Nat. Acad. Sci. US, 17 (1931)). 

The Edgeworth expansion was shown to be 
valid in more general situations by R. N. Bhat- 
tacharya and J. K. Ghosh [17]. In particular, 
they obtained the following: Let (X,, X;, 

..., X,] be a random sample from a p-variate 
distribution with a nonzero 'absolutely con- 
tinuous component w.r.t. Lebesgue measure 
on R”. Let fo (=D, fi, ..., f, be linearly in- 
dependent, real-valued, and continuously dif- 
ferentiable functions. For i=1,...,n, put Z;= 
(fi (X), fo (X5), ..., f, (X), and assume that 

the distribution of Z, has moments up to the 
order v (23). Let H be a real-valued function 
on R* such that the vth order derivatives are 
continuous in a tneighborhood of u= E(Z,). 
Let V —(vjj), i, j — 1, ..., k, be the covariance 
matrix of the random vector Z,, and put o? = 
È v;;lil;, where 1; = OH (z)/0z; lz- „m and z= 

(Zis ..., Zk). Then 





Pr{ /n(H(Z) — H(u)e Bj -| Wy, n(x) dx 
B 


sup 
B 





= O(n = Wane). 


where Z — X", Z,/n, the supremum is taken 
over all Borel measurable sets B, 


y-2 1 k d 
V^, n(X) -(1 + à v P, (-£)) (x), 


¢,(x) is the probability density function of the 
normal distribution N(0, o°), and the P's are 
polynomials whose coefficients are indepen- 
dent of n. 


G. Order Statistics 


Let ( X,,..., X,] bea random sample from a 
univariate distribution with continuous prob- 
ability density f(x) and distribution function 
F(x), and let X, € ... € Xm be torder statistics. 
The joint probability density of Y, = Ka, Y= 
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Am, Yp-1 = X9, and Y,= X, is given by 


n! 
(x —1)(8 —a— 10 ...(q—2— Dtn- al 
x (F DEHE) FUN *... 





x(F(y)- F(y,-0) * (1 Ey)". 


x Teil... Tel 


for ~ «y, €... e y, «oo, where a< B <... < 
€ «n. If for given constants 0€ 4, «€... dn « 
1, each of the subscripts o, ..., n tends to infin- 
ity as n oo under the conditions r; =n}; + 

o( / n), where a —r,, ...,1]—r,, then the ran- 
dom vector (Y,,..., ¥,) asymptotically obeys 
the p-dimensional normal distribution with 
mean vector (¢,,...,¢,) and covariance matrix 
n "iech where o; =a; — All — Af (EDS (6) for 
i<j and £, is the 4,-quantile of the population, 
defined by 4;= F(£;). 

Suppose that there exist two sequences a, 
and b, of real and positive numbers, respec- 
tively, such that as n oo the sequence (X m — 
a,)/b, converges in distribution to a nonde- 
generate distribution G. The underlying dis- 
tribution F is said to belong to the domain 
of attraction of the limiting distribution G 
(DA(G), for short). Except for the change of 
location and scale, only the following three 
distributions have nonempty domains of 
attraction: 


Writing F(x) 2 1— F(x), «(F)=inf{x| F(x)» 0] 
and w(F)=sup{x| F(x) « 1}, we have the fol- 
lowing theorem (B. V. Gnedenko, Ann. Math., 
44 (1943)): 

FeDA(G,) iff o(F)— oo and there exists an a 
such that 

lim F(a--ux)/F(a--u)- x^? for all x; 


FeDA(G,) iff a& o(F) « co and 


lim F (a—ux)/F(a—u)=x’ for all x0; 


FeDA(G,) iff there exists a positive function 
R(t) such that 

lim, F(t-- xR(t)/F(t)2e"* for all x. 
to 
If F(x) is twice differentiable, f(x) = F'(x) 
is positive for sufficiently large x, and 
lim, ocr) 4 { (1 — F(x))/f()j/dx =0, then 
FeDA(G;). Noticing the relation X,,— 

max{—X,, —X,..., — X,j, we can also 

derive the possible limiting distributions for 
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the sequence (X, — a,)/b, and their domains of 
attraction. The statistics R, = Xm — Xa and 
M, —(X,* Xq)/2 are called the range and the 
midrange of the sample, respectively. If, for 
some a,, à,, and b,, both sequences (X, — 
a,)/b, and (X,,, — a;)/b, converge to nondegene- 
rate distributions G and H, respectively, then 
they are asymptotically independent, and we 
have 


lim Pr((R,—a, - ab, & xj 
xi (1 — H(y—x)dG(y), 
lim Pr{(2M, — a, — a)/b, <x} 


-| H(x — y)dG(y). 


H. Characterization of the Distribution by 
means of a Property of the Sampling 
Distribution 


A distribution or a family of distributions can 
be characterized by a property of the sam- 
pling distribution of a suitable statistic. Let 
(X4, X, ..., Xn} be a random sample from a 
nondegenerate distribution F, and let X, € 

... <X;,,, be the torder statistics. The tsample 
mean X is independent of the tsample var- 
iance $? = UK: X)?/(n— 1) iff F is normal 
N(u, 0°) (Kawata and Sakamoto, J. Math. Soc. 
Japan, 1 (1949)). Let aj, i, j=1,...,n, be real 
numbers such that La,,=0 and X a; #0. If F 
has a finite second moment, then the condition 
E(X;X,X;| X) - const. implies that F is nor- 
mal. Two linear statistics L; =a, X, +... + 
a, X, and L,=b,X,+...+b,X,, are inde- 
pendent only if F is normal, provided that 

a,b; #0 for some j. In fact, the X's need not be 
identically distributed: If L, and L, are inde- 
pendent and a,b, #0, then the distribution of 
X; is normal (Skitovich-Darmois theorem). Yu. 
V. Linnik [23] gave a necessary and sufficient 
condition for the normality of F to be equiv- 
alent to the identity of the distributions of L, 
and L,. The condition is stated in terms of the 
zeros of the entire function o(z)=|a,\7?+...+ 
la, —|b,l —... —|6, |. The result contains as 
a special case the following characterization 
theorem for the normal distribution: If È a? = 
1 and L, has a distribution identical to that of 
X,, then F is normal N (4,0?) with u(2X:a;— 1) 
— 0. R. Shimizu gave a complete description of 
the characteristic function of the distribution 
for which L, has the same distribution as X,. 
In particular, it was proved that if logla, |/ 
log|a;| is an irrational number, « is the posi- 
tive number given by È |a,|*=1, and if L, has 
a distribution identical to that of X,, then F is 
the *stable distribution with *characteristic 
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exponent «. The result was extended in [24] 
to the cases where the a's are random variables 
independent of the X's. If E(X;) 2 0 and E(X 

| X, — X, X, — X, ..., X, — X)-0, then F is 
normal [26]. Let 44, H2, ..., Hn be a set of real 
numbers. If the sampling distribution of the 
statistic X(X;-- i)? depends on the j/s only 
through X y?, then F is normal. If X; is posi- 
tive and has finite mean, then the condition 
E(X | X,/X,, X4/X,, ..., X,/ X,) - const. implies 
that F is the gamma distribution. If the distri- 
bution F is not concentrated on a lattice 0, 

p, 2p, ..., then E(Xq,,, — X | Xu) = const. 

for some k implies that F is the exponential 
distribution: F(x)=1—e7*, x 3 0. Xk+) — Xw 
has the identical distribution for some k with 
min(X;, Xz, ..., X,.,] iff F is exponential. If 
du, 45, ...,Q, are positive numbers such that 
a;+a,+...+a,=1 and such that loga, /loga, 
is irrational, then the sampling distribution of 
min( X, /a,, X;/a;, ..., X,/a,] is the same as 
that of X, iff F is exponential. X,,) is indepen- 
dent of X,,, — X iff F is exponential. 

Suppose that the distribution F has a 
bounded density function and that the integral 
IS. e* dF(x) is finite on a neighborhood of t= 
0. Let (X,, X2, ..., X,) bea random sample 
from a distribution of the family 2 = (F((x— 
u)/e)| —oo <u < 00,00). For nz 9, the sam- 
pling distribution of the statistics 


xx (X,—X1) (X«— X4). (X5 — X4) 
(X, El Us: El (X5— Xz)’ (X-X) 


sgn(X; — X,), sgn(X; — X), 


sgn(Xs — X4), senz, — xə} 


uniquely determines the family 2. If F is 
symmetric, then for nz 6 the distribution of 
UGG — X3/X; — X). (Xs — X3/( — Xj 
uniquely determines 2 [25]. 


I. U-Statistics 


Let (X, ..., X,) bea random sample from a 
certain distribution, and let o(x,, ..., Xm) bea 
real-valued function that is symmetric with 
respect to the arguments x4, ..., x. The 
statistic 


u=(") Y o(X,,, e Xa) 


where the summation extends over all combi- 
nations (%1, --., Gel taken from (1,2, ..., n), is 
called a U-statistic. Let 0 — E(q(X;, ..., Zell 
and assume that E((X,, ..., X,,)”) is finite. 
Then the mean and variance of U are given by 


71 m n 
E(U) -6, GEN 3 (7) =) b 
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where ¢; is the covariance between o(X,, 
,— Xm) and 9(X,,..., X; Xi 4, ..., Xm) with 
Anen X, an additional independent 
random sample of size m — i from the same 
distribution. If , #0, then U obeys the distri- 
bution N(8, mit, /n) asymptotically as n oo 
(W. Hoeffding, Ann. Math. Statist., 19 (1948)). 
These results can be generalized to the case 
of several populations and samples. Let 
Aan, Xin ees ene Xen, be c independent 
random samples, each drawn from one of c 
populations. Assume that a real-valued func- 
tion (X11, +++) Eis Xe1s +++ Xem) IS SYM- 
metric with respect to the arguments x;,, ..., 
Xim for each i— 1, ..., c. Then 


c nj -1 
v- DU 2e OX sanas X atii 
i-1 i 


X, 


ca(cl1)» ***? X eatem) 


where the summation extends over all combi- 
nations (a(i1), ..., «(im,)) of m; numbers taken 
from (1,2, ..., nj) for each i— 1, ...,c, is called a 
U-statistic. The mean and variance of U can 
be obtained as before, while U is asymptoti- 
cally normally distributed as the sample sizes 
n,,...,n, tend to infinity in fixed proportion. 
Furthermore, if there are given several U- 
statistics, their tjoint distribution is asymptoti- 
cally normal. 


J. Distributions Having Monotone Likelihood 
Ratio, and Pólya-Type Distributions 


Let (X, 8B) be a sample space and 
2 —ip,0)I0e0j 


be a family of probability densities with re- 
spect to a fixed to-finite measure. The function 
P(x) regarded as a function of 0 with a fixed 
observed value of x is called the likelihood 
function, and its value at a particular point @ is 
called the likelihood of that point. The family 
P with Oc R is said to have monotone likeli- 
hood ratio with respect to a real-valued function 
T(x) if and only if for any 0 «6' such that 0 
and 0' belong to Q the ratio pẹ (x)/pẹ(x) is a 
nondecreasing function of T(x). Under the 
assumption that X cR and 67 log p4(x)/0x00 
exists, a necessary and sufficient condition for 
P to have monotone likelihood ratio with 
respect to T(x)=x is that 0? log pg(x)/0x00 20 
for any x and 0. If ( X;,..., X,) is a random 
sample from a distribution that has a mono- 
tone likelihood ratio and if a real-valued func- 
tion y(x,, ..., x,) is nondecreasing in each of 
its arguments, then the expectation E,(W(X,, 
...,X,)) is a nondecreasing function of 0. 

The family 2 is said to be of Pólya type n 
if and only if for any m— 1, 2, ...,n and any 
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real numbers x, <... < x, and 0, « ... < 6, 
the determinant of the matrix ( po (xj). i, j= 
1,...,m, is nonnegative, and 2 is said to be 
strictly of Pólya type n if the determinant is 
positive. Being of Pólya type 2 is equivalent to 
having a monotone likelihood ratio. If F is 
(strictly) of Polya type n for any positive in- 
teger n, then it is said to be (strictly) of Pélya 
type. An ‘exponential family of distributions 
with probability density p(x)=exp(@x + a(0) 
+s(x)) for xeX cR and EQ cR is strictly 
of Pólya type. Each of the noncentral chi- 
square distribution, noncentral t-distribution, 
and noncentral F-distribution is strictly of 
Pólya type with respect to the noncentrality 
parameter. 


References 


[1] H. Cramér, Mathematical methods of 
statistics, Princeton Univ. Press, 1946. 

[2] S. S. Wilks, Mathematical statistics, Wiley, 
1962. 

[3] M. G. Kendall and A. Stuart, The ad- 
vanced theory of statistics I, Griffin, fifth edi- 
tion, 1952. 

[4] N. L. Johnson and S. Kotz, Distributions 
in statistics, 4 vols., Wiley, 1969-1972. 

[5] Student (W. G. Gosset), The probable 
error of a mean, Biometrika, 6 (1908), 1—25. 
[6] R. A. Fisher, Statistical methods for re- 
search workers, Oliver & Boyd, 1925. 

[7] T. W. Anderson, An introduction to multi- 
variate statistical analysis, Wiley, 1958. 

[8] A. T. James, Distributions of matrix vari- 
ates and latent roots derived from normal 
samples, Ann. Math. Statist., 35 (1964), 475— 
501. 

[9] A. G. Constantine, Some non-central 
distribution problems in multivariate analysis, 
Ann. Math. Statist., 34 (1963), 1270-1285. 
[10] H. Chernoff, Large-sample theory: 
Parametric case, Ann. Math. Statist., 27 (1956), 
1-22. 

[11] K. Pearson, On the criterion that a given 
system of deviations from the probable in the 
case of a correlated system of variables is such 
that it can be reasonably supposed to have 
arisen from random sampling, Philos. Mag., 
(5) 50 (1900). Reproduced in K. Pearson, Early 
statistical papers, Cambridge Univ. Press, 
1948. 

[12] K. R. Parthasarathy, Probability mea- 
sures on metric spaces, Academic Press, 1967. 
[13] P. Billingsley, Convergence of probability 
measures, Wiley, 1968. 

[14] R. H. Randles and D. A. Wolfe, Introduc- 
tion to the theory of nonparametric statistics, 
Wiley, 1979. 


1396 


[15] F. Y. Edgeworth, The law of errors, 
Trans. Cambridge Philos. Soc., 20 (1904). 

[16] E. A. Cornish and R. A. Fisher, Moments 
and cumulants in the specification of distri- 
butions, Rev. Int. Statist. Inst., 5 (1937). Repro- 
duced in R. A Fisher, Contributions to mathe- 
matical statistics, Wiley, 1950. 

[17] R. N. Bhattacharya and J. K. Ghosh, On 
the validity of the formal Edgeworth expan- 
sion, Ann. Statist., 6 (1978), 434—451. 

[18] R. N. Bhattacharya and R. Ranga Rao, 
Normal approximation and asymptotic expan- 
sions, Wiley, 1976. 

[19] E. J. Gumbel, Statistics of extremes, 
Columbia Univ. Press, 1958. 

[20] A. E. Sarhan and B. G. Greenberg (eds.), 
Contributions to order statistics, Wiley, 1962. 
[21] H. A. David, Order statistics, Wiley, 
second edition, 1981. 

[22] J. Galambos, The asymptotic theory of 
extreme order statistics, Wiley, 1978. 

[23] Yu. V. Linnik, Linear forms and statis- 
tical criteria, Selected Transl. Math. Statist. 
Prob., 3 (1962), 1-90. (Original in Russian, 
1953.) 

[24] R. Shimizu and L. Davies, General char- 
acterization theorems for the Weibull and the 
stable distributions, Sankhya, ser. A, 43 (1981), 
282-310. 

[25] Yu. V. Prokhorov, On a characterization 
of a class of probability distributions by distri- 
butions of some statistics, Theory of Prob. 
Appl., 10 (1965), 438—445. (Original in Russian, 
1965.) 

[26] A. M. Kagan, Yu. V. Linnik, and C. R. 
Rao, Characterization problems in mathemat- 
ical statistics, Wiley, 1973. 

[27] J. Galambos and S. Kotz, Characteriza- 
tions of probability distributions, Springer, 
1978. 

[28] G. P. Patil, S. Kotz, and J. K. Ord (eds.), 
Statistical distributions in scientific work III, 
Characterizations and applications, Reidel, 
1975. 

[29] S. Karlin, Decision theory for Pólya type 
distributions, Case of two actions I, Proc. 3rd 
Berkeley Symp. Math. Stat. Prob. I, Univ. of 
California Press, 1956, 115—128. 


375 (XX.26) 
Scattering Theory 


A. General Remarks 


The path of a moving (incident) particle is 
distorted when it interacts with another (tar- 
get) particle, such as an atom or a molecule. 
Phenomena of this sort are generally called 
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scattering. Scattering is called elastic when the 
internal properties of the incident particle and 
the target remain unchanged after the colli- 
sion, and inelastic when the internal properties 
change, other particles are emitted, or the two 
particles form a bound state. 

The extent of scattering depends on the sizes 
of the incident and target particles. The scat- 
tering cross section is defined as the probability 
that the incident beam will be scattered per 
unit time (normalized to one particle per unit 
time crossing unit area perpendicular to the 
direction of incidence). In fclassical mechanics 
the scattering cross section is equal to the 
cross section of the target perpendicular to the 
incoming beam, hence the term "cross sec- 
tion." The probability of scattering into a unit 
solid angle in a particular direction is called 
the differential cross section. The probability 
that the incoming particle is absorbed by the 
target, called the absorption cross section, is 
intimately connected with the scattering cross 
section. Analyses of scattering give infor- 
mation on the structure and interactions of 
atoms, molecules, and elementary particles. 
One can also study the scattering of acoustic 
and electromagnetic waves by inhomogeneous 
media and obstacles, by considering notions 
similar to the above. 

Scattering theory may be dated back to 
Lord Rayleigh. Since the advent of quantum 
mechanics in the mid-1920s, scattering prob- 
lems, mainly for central (spherically sym- 
metric) potentials, have been investigated 
strenuously by physicists. It may be said, how- 
ever, that a scattering theory having mathe- 
matically rigorous foundations began around 
the 1950s, when the pioneering work of K. 
Friedrichs (Comm. Pure Appl. Math., 1 (1948)), 
A. Ya. Povzner (Mat. Sb., 32 (1953)), T. Kato 
(J. Math. Soc. Japan, 9 (1957)), J. M. Cook 
(J. Math. Phys., 36 (1957)), and J. M. Jauch 
(Helv. Phys. Acta, 31 (1958)), among others, 
appeared, and scattering theory has now grown 
into a branch of mathematical physics. 

General references for mathematical scatter- 
ing theory are, e.g, [1-5]. 


B. Wave and Scattering Operators 


In tquantum mechanics the dynamics of 

an interacting system is given by a *one- 
parameter group of unitary operators e DH. 
where t denotes the time and H, called the 
Hamiltonian of the system, is a tself-adjoint 
operator acting in a tHilbert space #. Ele- 
ments of J£ represent (pure) states of the sys- 
tem. Let H, be the "free" Hamiltonian of the 
corresponding “noninteracting” system. (There 
are at present no generally accepted definite 
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criteria for “free” and “noninteracting.”) Ho is 
assumed to be tabsolutely continuous, which 
is the case in most practical situations. Then 
the outgoing and incoming wave operators 

W, zW,(H, Ho) are defined, if they exist, by 


W, =s-lim e" e "Ho 


EE t> to 


(s-lim = fstrong limit). 


This means that given any free motion e "oj 
there is an initial (t =0) state u, (= W, u) such 
that e "P, and e Dou are asymptotically 
equal at t= +0. W, are tisometric, intertwine 
the two dynamics: e "H W, = W}, e "ie, and 
map # (which is nothing but the *abso- 
lutely continuous subspace (Ho) for Ho) 
onto a closed subspace of 2. (HL The scatter- 
ing operator S is defined as S = W* W_ (A* is 
the Hilbert-space tadjoint of A). S commutes 
with Ho and maps states in the remote past 
into states in the distant future. One of the 
most important problems in scattering theory 
is to prove the tunitarity of S, or equivalently, 
Ran W, =Ran W_ (Ran=trange = timage). W, 
is called complete if Ran W, = #,.(H). The 
completeness of W, implies that S is unitary. 
As a typical example we consider the 1-body 
problem. Note that the 2-body problem re- 
duces to the 1-body problem by separating out 
the center-of-mass motion, which is free. Then 
H, = — ^ (the negative tLaplacian in R°), H = 
— A-- V, the operator V being multiplication 
by a real-valued function V(x), called the po- 
tential, and # = L,(R°). If V is short range, i.e., 
if, roughly speaking, V(x) = O(|x| * 5) (e>0) at 
oo, the wave operators are known to exist and 
to be complete (S. Agmon, Ann. Scuola Norm. 
Sup. Pisa, (4) 2 (1975)). If the potential V(x) 
is long range, i.e., if, roughly speaking, V(x)= 
O(Ix| ^) (e> 0) at oo, then the foregoing defini- 
tion of the wave operators has to be modified. 
For the Coulomb potential V(x)=c/x, for 
instance, one can adopt the following defini- 
tion of modified wave operators: 
Hr. =s-lim e" exp(—itHy — i(c/2)H; + logt). 


T tot 


It can be shown that the W, exist (which 
implies that the ordinary wave operators do 
not exist) and are complete: Ran W, = #,.(H) 
(J. D. Dollard, in [6]). The same result obtains 
for more general long-range potentials (H. 
Kitada, J. Math. Soc. Japan, 30 (1978); T. 
Ikebe and H. Isozaki, Integral Equations and 
Operator Theory, 5 (1982)). 

If the wave operators exist and are com- 
plete, they give tunitary equivalence between 
the fabsolutely continuous parts of H, and H 
(^ T. Kato [7]; 331 Perturbation of Linear 
Operators). 

In the foregoing discussion it was tacitly 
assumed that in dealing with scattering phe- 
nomena we adhere to states in 2£,.(Ho) and 
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H,(H). A more physically intuitive definition 
in the case of potential scattering (Hg = — A, 

H = Ho + V) of scattering states Z , (H) is the 
following: fe X (H) if and only if for any r>0, 


if? 
lim A le Dr dr D 


T5 to T jg 


(|, || =L,-norm), 


where F, is the (projection) operator of multi- 
plication by the characteristic function of 
[xeR? ||x| « r]. In general no inclusion rela- 
tions between X , (H) and #,.(H) are known. 

But for a wide class of potentials it is known 
that J£,.(H) as well as the tcontinuous sub- 
space of H coincides with Z(H) (in this case 
there is no tsingular continuous spectrum) (W. 
O. Amrein and V. Georgescu, Helv. Phys. Acta, 
46 (1973); C. Wilcox, J. Functional Anal., 12 
(1973); Amrein, in [8]). 

A purely abstract result in scattering theory 
may be noted. Let Ho and H be self-adjoint 
operators in an abstract Hilbert space # such 
that V= H — H is a *trace-class (*nuclear) 
operator. Then the generalized wave operators 
W. (H, Ho) — s-lim, , + o exp(itH)exp( — it Ho) 

P (Ho) exist, where P,.(H,) is the *projection 
onto J€,. (Ho). Since this statement is sym- 
metric in Hy and H, the "inverse" generalized 
wave operators W, (Ho, H) also exist, from 
which one can conclude that W, (H, Hp) are 
complete. Moreover, the invariance principle 
holds, which means roughly the following: 

If ¢ is a strictly increasing function on R, 
then W,(¢(H), #(Ho)) exists and is equal to 
W,(H, Ho). This result can be applied to poten- 
tial scattering when V(x)e L,(R?)'1 L,(R?) 
(Kato [7]; D. B. Pearson, J. Functional Anal., 
28 (1978)). 


C. Stationary (Time-Independent) Approach 


We again consider the 1-body problem as in 
Section B. V(x) is assumed to verify certain 
appropriate decay conditions at oo as the 

case may be. Consider the tresolvents Ro(z) — 
(Hy —z) ! and R(z)=(H —z) ! forzeC—R, 
which are well-defined bounded integral opera- 
tors on # = L,(R?). Here we note the follow- 
ing: [0, oo) is the tcontinuous spectrum of 

H, and H, (—o0,0) is contained in the tresol- 
vent set of Ha, and H has possibly 'discrete 
eigenvalues in [ —a,0) with (—oo, —a) con- 
tained in the resolvent set, where a is a posi- 
tive number. When z approaches a positive 
real value, Ro(z) and R(z) do not have limits as 
*bounded operators on #. But if we regard 
them as operators from L, , to Ly ., (L5 a= 
{u|(1+|x|)*u(x)eL2(R*)}), y> 1/2, they can be 
shown to have boundary values Ro(A + i0) and 
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RU x i0), 4» 0 (limiting absorption principle; — 
Agmon, loc. cit., for short-range potentials; for 
long-range potentials — R. Lavine, J. Func- 
tional Anal., 12 (1973); T. Ikebe and Y. Saito, 
J. Math. Kyoto Univ., 12 (1972); and Saito, 
Publ. Res. Inst. Math. Sci., 9 (1974). With these 
boundary values we can define “stationary” 
wave operators whose range is easily proved 
to coincide with 3€,.(H), and show their equal- 
ity to the time-dependent wave operators 
discussed in Section B, thus obtaining the 
completeness of the latter. 

H, is known to have generalized (improper) 
eigenfunctions oo(x, £) el" ` with generalized 
(improper) eigenvalues |é|*. The associated 
eigenfunction expansion is nothing but the 
Fourier integral expansion. An eigenfunction 
expansion, similar to the Fourier expansion, 
that diagonalizes H can be obtained by using 
generalized eigenfunctions 


9 +(x, č) 
= Po(x, £) - (R( £I? +10) Raat: 2) (x), 


which are the solutions to the Lippmann- 
Schwinger equation 


p(x, C) = Polx, č) 





| f etüdix-» 
— | Vives, Ody. 
4r Jg: Ix — yl 


A rough statement of this is the following: 
Let à ,(£) - 2n) ?? [ o, (x, Su(x)dx. Then 

lä = ||ull, (Hu) 5) = I$ 8($), and u(x) — 
(2n) ?? (o, (x, ü(£)d£ (— e.g. [4, XL6] fora 
more precise statement). 

In view of the fact that S commutes with Ho, 
we can show that S admits the following repre- 
sentation: Let à(£) - (2x) ?? f @o(x, £)u(x) dx be 
the tFourier transform of u. Then 


(Su) LÉI & ($0)(5) 
-aai | ; [EIT (6, ler ole ider, 


where 
TG, €) - Qn) ? | Pox, 6) Vx) ex, 6’) dx 


is the kernel of the so-called T-operator, which 
is a tcompact operator on L;(S?) under suit- 
able conditions on V(x). T(é, Ets related to 
the experimentally measurable total cross 
section (for incident momentum Cl o(£): 


suas Läb on du (o EI, 
TU: om, ole -2z?T(Ào, Ao) (A>0). 


The quantity f(A; c», o») is called the scattering 
amplitude and appears in the asymptotic 
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expansion of o , (x, č) as 


+ilšllxl 





94x, E — lEI: Â, 


Ix| 
where á—a/|a| for ae R?. 

An abstract version of the limiting absorp- 
tion principle and eigenfunction expansion 
is known as the Kato-Kuroda theory, for 
which the reader is referred to Kato and S. T. 
Kuroda in [6] and in Functional analysis and 
related fields, Springer, 1970, and to Kuroda 
(J. Math. Soc. Japan, 25 (1973)). 


D. Time-Dependent Approach 


Consider the same situation as in Section C. 
Since scattering is a time-dependent phenom- 
enon, it seems natural to develop scattering 
theory in a time-dependent fashion. Indeed, 
there is an approach to the completeness of 
wave operators that does not resort to any 
eigenfunction expansion results, but instead 
follows the temporal development of the wave 
packet e^ "^ y. The completeness of W, will be 
established if one can show that any ue %,,(H) 
orthogonal to Ran W, is 0. A crucial step to 
prove this is to find a clever decomposition of 
a wave packet into an outgoing and an incom- 
ing one, or to find projections P, such that 

P, +P_=I and P ,e "Ho goes to 0 as t> Foo. 
Some compactness arguments are also needed. 

To construct such a decomposition or pro- 
jections one looks at the scalar product x: é, 

x and ¢ being the position and momentum 
(operators), respectively. The main idea is that 
if this is positive, the particle will be outgoing 
(to infinity), and if negative, incoming (from in- 
finity). But since we work in the framework of 
quantum mechanics, this classical-mechanical 
intuition should be properly modified. 

Besides the completeness of wave operators 
it can also be shown through the above ap- 
proach that the singular continuous spectrum 
of H is absent. For details — [4, XI.17] and 
V. Enss, Comm. Math. Phys., 61 (1978). 


E. Partial Wave Expansion 


In this section we assume that the potential 
V(x) is central, i.e., V(x) is a function of |x| 
alone. Then the scattering operator S turns out 
to commute not only with H, but also with the 
tangular momentum operator L-2xxi ! V 
(vector product). The eigenvalues of L? =L- L 
(scalar product) are /(1 4- 1) (120, 1,2, ...), and 
those of L,, the third component of L, are m— 
—l, —(I—1),...,1— 1, Lif L? has eigenvalue 
I(I-- 1), while simultaneous eigenfunctions are 
given by suitably normalized tspherical har- 
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monics Y,,(o), o€S*. Let E, be the projection 
onto the subspace spanned by functions of the 
form XL. —)Uim(T) Yim(@) (r= |x| >0,@=x/r). H 
becomes an orthogonal sum of 3€, = E,3£. The 
aforementioned commutation property claims 
that E,S=SE,=S,, and the operator $; reduces 
to multiplication by a scalar function e??? in 
Af, (— Section C). 5,(A) is called the phase shift. 
Defining the partial wave scattering amplitude 
f(A) =(2iA) t (e?) — 1), one obtains the 
partial wave expansion of the scattering 
amplitude: 


f(so,of)- E QI -- Df()Picos6) 
i-o 


where @ is the angle between w and w’, and P, 
is a (Legendre polynomial. The total cross 
section is o(4) - 4n4 ? X5? 9 20 - II f (4)? 
[1-5,9]. 


F. Many-Body Problem (Multichannel 
Scattering) 


We consider only the 3-body case, which is 
complicated enough compared with the 2- 
(essentially 1-) body case. The complications 
are both kinematical and dynamical. The 
configuration of a 3-body system is given by 
a point in R?. Once we choose the center-of- 
mass coordinates, there is no kinematically 
natural way to choose the remaining six co- 
ordinates. In the 2-body case a freely moving 
particle in the remote past will be freely mov- 
ing in the distant future. But in the 3-body 
case there come into play various other dy- 
namical processes, such as capture, breakup, 
rearrangement, and excitation. 

The 3-body Hamiltonian is a self-adjoint 
operator in L,(R?) of the form 





1 

Fi 2m, Aid » Vi(x, — x5), 

where A, is the 3-dimensional Laplacian as- 
sociated with particle i, m; is the mass of par- 
ticle i, and each Vj(x) (= V;,(x)= Vj( —x)) is 

a real-valued function decaying at oo in R? 
(not in R?). If we remove the center-of-mass 
motion, H can be written in the form 


H-H,GI-IGH (Q =ttensor product), 


where Hp is the center-of-mass Hamiltonian in 
L (R°) representing the uniform free motion 
of the center of mass, and H is the Hamiltonian 
of relative motion in L,(R°). One should note 
as mentioned above that there is no unique 
natural way of choosing coordinates in R? and 
representing H, but there are many equivalent 
representations. Suppose, for instance, that 
particles 1 and 2 and particles 1 and 3 form 
tbound states (12) and (13) and that there are 
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no bound states between 2 and 3. We partition 
the whole system into clusters: (1) (2) (3), (12) 
(3), and (13) (2) (( ) represents a cluster and 
figures in ( ) are the particles forming the 
cluster). A channel is a partition into clusters 
together with a specified bound-state eigen- 
function. Take, for instance, channel (12) (3), 
and suppose y e L,(R?) is the eigenfunction 

in question. If we take x 2x; —x, and y 2x4— 
(m; m5) ! (m,x4 4 m5x;), then 








mix mx 
eva -e ^( : ^). 
mm m, +m, 
where m ! =m;!+(m, +m) n ! mj! 
m;!. Let us neglect the interactions between 
(12) and (3), i.e., set V;,4— V, =0 to define the 

cluster decomposition Hamiltonian 

1 1 
A203) = 7345 5 Oat Vi x(x). 
Let #4 2)(3)= L;(R?) (called the channel Hil- 
bert space consisting of functions of y), and 
define a mapping 1: 4%, ag> 3€ = LORD 
(functions of x and y) by (tf X(x, y) 2 v (x)ft y). 
The channel wave operators W, 2,334 are de- 
fined by 


Wi zua: -slim efe fano 
as isometries from J£, zua into #. Their 
ranges are not expected to coincide as in the 
2-body case. Waya and Wi. are 
similarly defined. Note that Hiyoo =”. 

If x and f are distinct channels, we have 

Ran W,, L RanW,, and Ran W,- L RanW,_, 
but no such relations exist between Ran W,, 
and Ran W,_ or Ran W,- and Ran W,,. De- 
fine, for channels « and fj, $,,: Hg > J£, by Sag 
=(W,..)*W,_. Now the scattering operator 

S for the 3-body system is defined as the 
‘direct sum of S,, acting in the Hilbert space 
E,D H: S= Eag D S,,. Naturally the ques- 
tion arises: Is S unitary? The first affirmative 
answer was made by L. D. Faddeev (Israel 
Program for Scientific Translations, 1965 (in 
English; original in Russian, 1963)), and later 
the work of J. Ginibre and M. Moulin (Ann. 
Inst. H Poincaré, A21 (1974)) and L. Thomas 
(Ann. Phys., 90 (1975)) came out. The method 
of these authors is stationary. There have also 
been some attempts using time-dependent 
methods. 


G. Inverse Problem 


The inverse problem in potential scattering 
may be formulated at least mathematically as 
follows: Given the scattering operator or scat- 
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tering amplitude, determine the potential(s) 
giving rise to the operator or amplitude. We 
consider here only the 2-body case (as to the 
many-body case, almost nothing is known). 
The central-potential case can be reduced to 
1-dimensional problems on (0, oo). In the 1- 
dimensional case the celebrated Gel'fand- 
Levitan theory (— 112 Differential Operators 
O) has long been known and has been success- 
fully applied even to nonlinear problems such 
as the "Korteweg- de Vries equation. In the 
3-dimensional case, however, the problem 
becomes difficult; so far there has not been 
any satisfactory theory comparable to that for 
the 1-dimensional case. The potential V(x) is a 
function R?—R. The scattering amplitude 

f (4; c, €) is a function f: R x S? x S? C. Let 
M be the mapping that takes V into f. The 
inverse problem deals with M ~'. Several ques- 
tions may be posed (in order of increasing 
difficulty): (1) Is M one-to-one? (2) When it is 
known that M is one-to-one and f is in the 
image of M, how does one (re-)construct the 
V that yields f? (3) What conditions charac- 
terize the image of M? Question (1) has been 
rather satisfactorily answered insofar as short- 
range potentials are concerned. Concerning 
questions (2) and (3), attempts have been and 
are being made to generalize the Gel'fand- 
Levitan theory, but it may be said that we 

are still at the beginning stage. References are 
[2,9,10] and R. G. Newton (J. Math. Phys., 
20 (1980); 21 (1981); 22 (1982)). 


H. Scattering for the Wave Equation 


Consider the twave equation u,,—-Au=0 in R°. 


The solution u(t)= u(t, x) is uniquely deter- 
mined by the initial data { f, g} = {u(0), u,(0)}, 
and U,(t){ f. gj = {u(t), u,(t)} defines the solu- 
tion operator U(t). The set of data { f. g] with 
finite energy: f(IVf |? - |g|?) dx < oo forms a 
Hilbert space J£,. Uj(t) is a unitary group 

on #. A similar description is possible for 
solutions of the wave equation in an exte- 
rior domain Q outside an obstacle with zero 
boundary condition. Denote the resulting 
Hilbert space and solution operator by # 
and U(t), respectively. Let J: Hg- be the 
identification operator defined by (J f. g])(x) 
—if.g] (x), xeQ. The wave operators are de- 
fined by W, =s-lim,_, ,,, U(—t)J Uo(t). The 
existence of W, is shown rather easily by using 
tHuygens's principle. As in Section B, we 
define the scattering operator S = W*W and 
say that W, is complete if Ran W, = J£,.(H), 
where H is the self-adjoint tinfinitesimal gen- 
erator of U(t): U(t) 2e DH The completeness 
of W, and the unitarity of S are proved on the 
basis of the abstract translation representation 
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theorem: Let U (t) be a unitary group on a 
Hilbert space #. Suppose there exist sub- 
spaces D, and D. , called outgoing and incom- 
ing subspaces, such that U(t)D , C D, for 
*tz0, (),,4U()D., = (0), and Le U(t)D 
is dense in J£. Then we have two unitary 
operators 8. : W — L,(R; N), where N is an 
auxiliary Hilbert space, such that 2, U (t): 
4 ,' is right translation by t, and D, (D_) is 
mapped onto L,(0, oo; N) (L;(—00,0; N)) by 
B, LR ). 

Turning to the concrete situation, one can 
study the detailed properties of S. The unique- 
ness theorem in the inverse problem is also 
obtained, to the effect that S determines the 
obstacle uniquely. The foregoing treatment of 
scattering is known as the Lax-Phillips theory 
(P. D. Lax and R. S. Phillips, in [6,8, 11]. 
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376 (XIX.15) 
Scheduling and Production 
Planning 


Production planning emerges in many situ- 
ations. Models of economic planning can be 
classified as (1) fiscal policy-oriented, (2) final 
demand-oriented, (3) structure-oriented, (4) 
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expenditure-oriented, and (5) industrialization- 
oriented. Types (2), (3), and (5) belong to pro- 
duction planning in a broad sense, as emphasis 
is placed on production in these models. 

A typical production planning theory of 
primary importance is activity analysis, which 
has made remarkable progress since its initi- 
ation by T. C. Koopmans [1]. Its principal 
theoretical content consists of tlinear pro- 
gramming. Most applications of linear pro- 
gramming are more or less concerned with 
production activities. Because of the additivity 
and divisibility of production as well as the 
limitation of production intensities, problems 
of production planning can be formulated as 


_ problems of linear programming. The meth- 


ods of flinear algebra are used to obtain an 
optimal production plan and are very impor- 
tant in modern economic analysis, because 
these methods not only provide practical 
algorithms but also clarify the role of price, 
especially in *dual linear programming 
problems. 

The originators of general equilibrium 
theory (— 128 Econometrics) failed to give an 
analytical demonstration of the existence of 
solutions of certain systems of equations of 
economic relevance. The existence of a deter- 
minate equilibrium was established first by 
A. Wald for a system of equations of the 
Walras-Cassel type. On the other hand, J. von 
Neumann [3] proved the existence of non- 
negative solutions o, ß, x;, y; for a system of 
inequalities 


m m 
a} gue Y) byx;, — j21,2,...,n, 
Kal = 


i=1 


BY. do Mr Z biy;, 112... 5 

j=l 
by reducing the problem to the proof of the 
existence of a fsaddle point (— 292 Nonlinear 
Programming A) of the function 


LK. Y)= 2. 2. byxiyl 2. p» ajXiyj 


by means of Brouwer's fixed-point theorem. In 
this result an equilibrium is defined in the 
broad sense that demand for goods does not 
exceed their supply, rather than requiring 
exact balance. 

A second important kind of production 
planning is related to both tinventory control 
(— 227 Inventory Control) and sales planning. 
An example is the minimization of 


R (z(t) + B max(dz/dt, 0)) dt 
0 


subject to the condition z(t) 2 r(t), where z(t) 
and r(t) are the output and demand, respec- 
tively, at time t. Stabilization of employment 
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and production can also be classified as a 
production planning problem of this kind, in 
which inventory holding is considered as a 
means for lessening the change of employment 
level. This is related to the problem of smooth- 
ing production by inventory control. Dynamic 
programming (— 127 Dynamic Programming) 
is very useful in dealing with problems of 
smoothing production. 

Production planning as a production man- 
agement tool is often embodied in scheduling. 
Consider a project that consists of n indivi- 
sible tasks (jobs or activities) J;, i— 1,2, ..., n, 
each requiring p; units of time for processing, 
where p; is given either deterministically or 
probabilistically. A precedence constraint 
(generally a *partial ordering) partially specify- 
ing the order in which these tasks are to be 
processed is also imposed by technical con- 
siderations: One attempts to find a schedule 
(i.e., a specification of the time to process the 
tasks Jj) consistent with the given precedence 
constraint. 

Well-known techniques developed for this 
purpose are PERT (program evaluation and 
review technique) [4] and CPM (critical path 
method) [5], in which the precedence con- 
straint is represented by an acyclic tdirected 
graph, called an arrow diagram, a project 
network, or a PERT network, such that each 
task J; corresponds to an arc of length equal 
to p;. An arrow diagram is illustrated in Fig. 1. 
The longest path from the start node to the 
end node in the network is called the critical 
path, and gives the minimum time necessary to 
complete the project. Following computation 
of the critical path by means of dynamic pro- 
gramming, computations are also made for the 
earliest (latest) start time, the earliest (latest) 
finish time, and the floats (e, the allowances 
for such start and finish times) of each task to 
be satisfied in order to complete the project 
within the indicated minimum time. These are 
used to review and control the progress of the 
project. In PERT the processing time of each 
task is probabilistically treated on the basis of 
three estimates: most likely, optimistic, and 
pessimistic. From these data other parameters, 





Fig. 1 

A, B, ..., M denote tasks, while the associated in- 
tegers are their processing times. Bold arrows indi- 
cate the critical path. The start node is on the left, 
the end node is on the right. 
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such as the probability of completing the 
project before the specified due date, are com- 
puted. In CPM, on the other hand, a mini- 
mum cost schedule to attain the given due 
date is obtained by utilizing tnetwork flow 
algorithms (J. E. Kelley [6], D. R. Fulkerson 
[7]) in which the processing time of a task is 
determined by linear interpolation between the 
normal time (achieved with low cost) and the 
crash time (high cost). 

PERT and CPM are used in various areas 
of application, e.g., civil engineering and the 
construction industry, shipbuilding, produc- 
tion of automobiles, machines, and electric 
apparatus, and management of research and 
development programs. PERT was originally 
developed by the US Navy to monitor and 
control the development of the Polaris fleet 
ballistic missile program, while CPM was 
developed by the RAND Corporation and the 
Du Pont Corporation, both in the late 1950s. 
Computers have been essential from the begin- 
ning, to handle the large amount of associ- 
ated data. A number of application program 
packages, each with some additional features, 
are currently available, e.g., PERT/TIME, 
PERT/COST, CPM, and RAMPS. 

The machine sequencing (scheduling) prob- 
lem arises when the resources, instruments, 
workers, and so forth, required to process a 
task are abstractly formulated as machines 
and if the restriction on the number of avail- 
able machines is taken into consideration 
(1.e., the conflict between tasks competing for 
the same machine at the same time must be 
resolved). Usually one machine is assigned 
to each task. Such a machine is either (a) 
uniquely determined for each task or (b) chosen 
from a given set of machines; in the latter case, 
there might be (1) parallel machines with the 
same capability or (ii) machines with different 
capabilities. The precedence constraints are 
also ramified into independent (i.e., no con- 
straint), in-tree, out-tree, series-parallel, and 
general partial ordering constraints. Each task 
has a ready time (release time) r; such that J; 
cannot be processed before it, and a due time 
d;. One is asked to find a schedule satisfying 
the above machine constraints, precedence 
constraints, and ready time constraints, while 
considering an optimality criterion that is a 
function of the completion time C; of J; (i= 
1,2, ..., n). Typical criteria for minimization 
are: (1) maximum completion time (makespan) 
Cmax = max; C; (2) flowtime (total completion 
time) F = 3:C;; weighted flowtime X w;C;, 
where w; 20 are weights representing the rela- 
tive importance of J;; (3) maximum lateness 
Lmax = Max; L;, L;- C;— d;; (4) total tardiness 
T= T, T; 2max(0, Lj), and weighted total 
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tardiness > w; T;; (5) number of tardy tasks 
U = U;, U;- 1 (if C; dj), 0 (otherwise), and 
weighted number of tardy tasks 3 w;U;. 

Numerous problems can be defined by 
combining the above conditions. Typical ones 
might be: the job-shop scheduling problem, in 
which n tasks are scheduled on m machines of 
type (a), and where the maximum completion 
time is minimized; the flow-shop scheduling 
problem, which is the same as the job-shop 
scheduling problem except that n — n'm tasks 
are divided into n' groups of m tasks processed 
on machine 1, machine 2, ..., machine m, re- 
spectively, in this order; the multiprocessor 
scheduling problem, in which the maximum 
completion time of n independent tasks on 
m parallel machines is minimized; the one- 
machine sequencing problem, assuming only 
one machine (with various types of precedence 
constraints and optimality criteria), and others 
[8,9]. 

These machine sequencing problems are 
examples of the combinatorial optimization 
problem (— 281 Network Flow Problems E), 
as the processing time p; is usually considered 
to be a given constant. Their computational 
complexity (— 71 Complexity of Computa- 
tions) has been extensively studied with an 
emphasis on the classification between those 
problems solvable in polynomial time and 
those that are *NP-complete, as summarized 
in [10]. Table 1 lists representative results for 
one-machine sequencing problems with r; — 0 
(i— 1,2, ..., n). The improvement of the algo- 
rithm efficiency is pursued for both polynomi- 
ally solvable problems and NP-complete prob- 
lems. tBranch and bound (— 215 Integral 
Programming D) is a common approach used 
to solve NP-complete problems such as the 
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job-shop and flow-shop scheduling problems 
[8,9]. Many approximation algorithms to 
obtain good suboptimal schedules in reason- 
able computation time are also known, and 
their worst-case and average accuracies have 
been analyzed [10], as these are important in 
practical applications. 

In more realistic scheduling situations, 
other factors, such as the set-up cost, balanc- 
ing of production lines, frequent modifications 
and updatings of project data, capacity of 
factories, manpower planning including the 
possibility of overtime and part-time employ- 
ment, should be taken into account. Both 
deterministic and probabilistic models have 
been proposed for these cases. Mathematical 
tools used to compute adequate schedules 
include *mathematical programming, *queu- 
ing theory, and fsimulation techniques. 
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Table 1. One-Machine Sequencing Problems with r;=0 


Optimality Precedence 
Criterion Constraint 
Cae Partial order 
2C Partial order 
Z wt Series-parallel 
Partial order 
Loss Partial order 
ET Independent 
Partial order 
Xw Independent 
È U; Independent 
In-tree, out-tree 
X wt Independent 
Independent 


Other 

Constraints Complexity 
None O(n?) 

None NP-complete 
None O(nlogn) 

p= 1 (i=1,2,...,n) NP-complete 
None O(n?) 

None Not known? 
None NP-complete 
None NP-complete 
None O(nlogn) 

p; 71 (i 1,2, ...,n) NP-complete 
Di Dr ZW O(nlogn) 


None 


NP-complete?” 





a. An algorithm with O(n? Pmax) running time is known, where pmax = MaX; p; 


b. An algorithm with O(n X p,) running time is known. 
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A. Fock Space 


For a complex Hilbert space K with dim K > 1, 
K*" denotes the n-fold tensor product of K 
with itself (where the vectors f, ©... & f, with 
f;€ K are total). Let EP be the projection 
operators on totally symmetric and antisym- 
metric parts of K®": 


EAO -O fa) 
=(n!) ! Ye, (P) fpa, 8 Wi © Spin» 


where the sum is over all permutations P, 

£, (P) 1 and e. (P) is the signature of P (+1 
for even permutations and —1 for odd permu- 
tations). The following orthogonal direct sum 
is called a Fock space (symmetric for E,, and 
antisymmetric for E ): 


FA(K)= y 9 E? Ke» 

n=0 
Here the term for n=0 is the 1-dimensional 
space C, and a vector €) represented by 1eC is 
called the vacuum vector in ¥,(K). The sub- 
space 7, (K), EU K®" is called the n-particle 
subspace. The operator N= X97 n, which 
takes the value n on 7, (K),, is called the num- 
ber operator. 

On the algebraic sum 


the creation operator a (f) for f € K is defined 
as the unique linear operator with domain D, 
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satisfying 
a'(f)E9 f, GH 
= (n+ ESDS O f 6... 9 fy 


For f € K, f denotes the element of the dual 
K* satisfying f(g)=(g, f) for ge K (the inner 
product is linear in the first entry). The annihi- 
lation operator a( f) is defined by 


a(f) EPf, & ... Of, 
=n Y e^ (f, f) ET" f, O j 9 fe 


where the tensor product of f,, k xj, appears in 
the jth term and £= +1 depending on which 
of + is taken in Z, (K). For n=0, a(f)Q is 
defined to be 0. The adjoint of a*(f) coincides 
with a( f) on D,. 

The creation and annihilation operators 
map D, into itself and satisfy the following 
commutation relations on D, : 


[a (f. a ()]« =[a( fi) al fl, =0, 
[a(fi) a * CF] =z fi). 


where [A, B]; — AB BA and F is used de- 
pending on the choice of + in ¥,(K). These 
relations are often called canonical commuta- 
tion relations for [ , ].. (CCRs) and canonical 
anticommutation relations for[ , |, (CARs). 

On ZZ (K), a*(f) and a(f) are bounded 
with la (NI 2 laCÉ)I UL On F,(K), both 
a^ (f) and a( f) are not bounded, though 
a'(f)N '? and a(f)N !? are bounded. 

On F,(K), 2"? (a*(f)-- a( f) is essentially 
self-adjoint. Let /( f) be its closure. The oper- 
ator W(f)=e' is unitary and satisfies the 
identity 


WO, IM W(fi + fi)exp( - iEm( fi, f2)/2). 


It also satisfies (W(f)O, Q) - exp( —4 ! | f |?). 

Let K, be a real subspace of K such that the 
inner product ( f, g) in K is real for any f and g 
in K, and K=K, +iK,. (K, is then a real 
Hilbert space.) The unitary operators U(f)— 
W(f) and V(f)— W(if) for f e K, satisfy the 
following Weyl form of the CCRs: 


U(f,)U( f= U(f, tf 
V(fy)V(fz)— HO, + fr), 
U(fi)V (fa) = VDU (fi exp(— if, f2)). 


The infinitesimal generators of the continu- 
ous one-parameter groups of unitaries U(tf) 
and V(tg) (te R) are denoted by oi f) and z(g) 
and satisfy the following CCRs: 

[0Cfi). 9(f2)] Y — [n(g1) n(g2)] V — 0, 


[o Cf) n(g) Y — iC, gY, 
where [A, B] - AB— BA and VcD,. 
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If Q is a linear operator on K, I'(Q) denotes 
the linear operator on 4, (K) defined as the 
closure of 2° Q9"| D, . It is bounded if 
IQI «1. F(Q)T(Q;) 2T (0; 07) on D,. If H is 
a self-adjoint operator on K, then T (e*”)= 
expit dT (H) defines a self-adjoint operator 
dT (H) on £, (KL usually called a bilinear 
Hamiltonian and denoted (a*, Ha). More 
explicitly, 


dT (H)EVf, & ... & f, 
=F EPS, oi... Hie Of,. 
j=l 


If U is a unitary operator on K, then 


T(UW(f)T(U) !  W(Uf). 


B. Second Quantization 


À single (scalar) particle in quantum mechan- 
ics is described by a wave function WGL, xe 
R?, considered as a unit vector in a Hilbert 
space K = L,(R?). The system consisting of n 
such identical particles is described by a totally 
symmetric function (x, ..., x,), xe R?, con- 
sidered as a unit vector in the totally sym- 
metric part E® KS” of the n-fold tensor prod- 
uct of the one-particle Hilbert space K, where 
the restriction to totally symmetric wave func- 
tions is referred to as Bose statistics. In a non- 
relativistic system, the Hamiltonian operator 
on a 1-particle space is 


T= —h*Qm) ! A,, 


called the kinetic energy (A, denotes the La- 
placian); on an n-particle space it is typically 
given by 
H,= —h?(2m)* Y A,» V(x- x), 
j=1 i<j 

where V is a 2-body potential. 

The totality of multiparticle spaces EP K 9" 
can be described in terms of the Fock space 
4 (K), the vacuum vector Q (no-particle 
state), and the annihilation and creation oper- 
ators, denoted by 


ainsi: 


etn | tenens 


Since the CCRs (for operator-valued 
distributions) 


DP (3), P(y)J- 2PP*0), V *()]- =0, 
[P (3), Y *(y)]- 2ó*(x— y) 


are a continuous generalization of CCRs for 
canonical variables in quantum mechanics and 
since ‘¥(x) comes from the wave function by 
way of quantization, the above formalism is 
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called second quantization. The Hamiltonians 
H, for all n can now be combined into the 
expression 


H= | ¥ wen 


1 
«5 [eov een 


x V(x)W(y d?xd?y, 


where the first term is dT (T). 

For particles such as electrons the system of 
n identical particles is described by a totally 
antisymmetric wave function, the total anti- 
symmetry being referred to as Fermi statistics. 
Then the antisymmetric Fock space 4 (KI 
can be used in exactly the same manner as 
F,(K) in the preceding case. 

The method of second quantization was 
introduced by P. Dirac [2] for the case of 
bosons and extended by P. Jordan and E. 
Wigner [3] to fermions. Electromagnetic 
waves, when quantized in this way, represent 
a system of photons, and the quantization of 
electron waves leads to the particle picture of 
the electron. The method of second quantiza- 
tion is intimately connected with the notion 
of fields, as shown below for free fields, and is 
the basis of the perturbation approach in field 
theory (— 150 Field Theory). 


C. Free Fields 


Let o; (j= 1, 2, 3) be tPauli spin matrices and 
6o=(6 1). Let p= D3_99,p" for a 4-vector p= 
(p°, p) with p — (p!, p?, p?) and p? =(m? 4- p?) 2. 
Let uj(a, A) be the irreducible unitary repre- 
sentation [m,,j] of A! on a Hilbert space 
K;— L,(R?, C"), (2p?) ? (m/p)9?!d* p) (> 258 
Lorentz Group C (3)). 

Consider first the symmetric Fock space 
4, (Kg). For any complex-valued rapidly 
decreasing C?-function f (fe (R^), let 


HEET 
(p=(p°,p), p°=(p? +m?)*?), 
A(fy=a*(fy+a(f) (- Luna: 


where p: x ^ p?x? — È} p/x and the bar 
denotes the complex conjugate. Then A(x) as 
an operator-valued distribution satisfies the 
*Wightman axiom and is called the free scalar 
field of mass m. It satisfies the Klein-Gordon 
equation 


(D, +m?)A(x)¥ «0 


(n.- eio - Y eyes). 
Zei 
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it has the 4-dimensional scalar commutator 
(Ac, A(y)] - Y = A,(x — y), 

and it has the two-point function 

i(A(x) A(y)Q, Q) = An (x — y). 


Here VH is any vector in D,; for example, Q is 
the vacuum vector, and the invariant distri- 
butions A,, and A; are defined by 


witzer" fee ap 


Am(x) = (2n) ? Jong du * d? p. 


If we define U(a, A(A))=T (uola, A)), then 
U (a, A) A(x) U(a, A)* = A(Ax +a). 


If ge F(R?) and he Y(R°), g(p) and h(p) 
obtained by substituting g(x)ó(x?) and 
— h(x)ó'(x?) into f in the defining equation 
of f above are in Ky. We then define 


o(g)-a*(g)-- a(g) (- | vs) 


n(h)=a*(h) + a(h) (=| xmas). 


The operator-valued distributions g(x) and 
n(x) are the canonical field and its conjugate 
field at time 0 for the free scalar field and 
satisfy the following canonical commutation 
relations: 


[o(x), o(y)] - V =[x(x), z(y) ]_¥ =9, 
[o(x), ay)] - V =i5°(x—y)¥. 

If we set T(t) & U (teo, 1) for ey =(1, 0, 0,0) 
and (a ® g)(x) 2 a(x9)g(x) for xe F(R) and 
ge F(R), then for YeD, 


T(t)o(g) T(t)? Pa(t) dt, 


— 00 


4ceae- | 


oC 


— A(x’ @ g) = | T(t)n(g) TA Falt) dt, 


or, equivalently, 

A(x) = T(x?)e(x) T(x^)*, 

0A(x)/Ox? = T(x9)n(x) T(x9)*. 

If A(x) is a classical field, then w(x) and z(x) 
are the value of A(x) and its time derivative at 


x? — 0, and they serve as initial data for the 
Klein-Gordon equation 


(L1, m?) A(x) «0. 


Consider next the antisymmetric Fock space 
F (Ky, @ Kip). For f, e (R^, C^) (C?-valued 
rapidly decreasing C^-functions), write f — 
Us f fe ms fas f sti Ja), define f. 
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as before, and let 
Vf) 2 m'? (a((f..-- (p/m)f.) 0) 
+a*(0@ (orf. —(B/m)osf .)] 


where o; =(? Als — ie. Then w,(x) as an 
operator-valued distribution satisfies the 
*Wightman axiom and is called the free Dirac 
field of mass m. In the present formulation, 
(V4, Y2} is a contravariant spinor of rank (1,0) 
and (V4, Y4} is a covariant spinor of rank 

(0, 1). This field satisfies the Dirac equation 


(z y" (0/0x") im) V (x) =0, 
u 
as well as the relations 


LW.(x), vy) ] . Y — 0, 
alx), Vs y)* ] P 


= d y"(0/0x")— n) oi ele" 
(ele ILL. Q) 


= d (EE n) oi A (x— y). 
H af 


Here Wei, A. and A; are as described 
above, and the y's are Dirac matrices in the 
following form (somewhat different from but 
equivalent to the usual form; — 351 Quantum 
Mechanics): 


SAL y-[ "äi 
1 0 o 0 


and the o’s are *Pauli spin matrices. 


D. Coherent Vectors and Exponential Hilbert 
Space 


In the symmetric Fock space ¥,(K), a vector 
of the form 


expf/z QOO Ce (n!) !^f98"e z (KI 


for f € K is called a coherent vector. The set of 
exp f is linearly independent (in the algebraic 
sense) and total. The inner product is given by 


(exp fi, exp f) - exp( fi. fa). 


Conversely, we can define 7, (K) abstractly by 
introducing this inner product into the formal 
linear combinations of expf, f € K, and by 
completion. In this sense, 7, (K) is also de- 
noted as exp K and is called an exponential 
Hilbert space [5]. Then 


exp)? K,= @ expK;,, 
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where exp 22? f is identified with & exp f;. 

If the number of indices is infinite, the right- 
hand side is the incomplete infinite tensor 
product containing the product of the vacuum 
vector Q. 

If K= fz K,du(å) and the measure p is 
nonatomic, then for any measurable set S in 
E, there corresponds a decomposition exp K = 
(exp K (S)) @ (exp K (S*)), where K(S)— 

[s K,du(A) and S° is the complement of S in 

E, and an associated von Neumann algebra 
R(S)= B(K(S)) & 1, where B(K(S)) is the set of 
all bounded linear operators on K(S). The 
system {R(S)} forms a complete Boolean lat- 
tice of type I factors on exp K. Coherent vec- 
tors are characterized by the property of being 
a product vector for ( R(S)) in the sense that 
for any S, Ae R(S) and A'e R(S*), the vector 

Y —exp f satisfies 


(AA, III" — (A*P, P) (APP, PP). 


In this sense, we can interpret exp K as a con- 
tinuous tensor product of exp K , and also, if 
Y= (V, du(A), exp V as a continuous tensor 
product of exp P, [5]. 
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378 (XII.14) 
Semigroups of Operators 
and Evolution Equations 


A. Introduction 


The analytical theory of semigroups was 
inaugurated around 1948 in order to define 
exponential functions in infinite-dimensional 
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function spaces. Then it was generalized to the 
theory of evolution equations as ordinary 
differential equations in infinite-dimensional 
linear spaces. 


B. The Hille-Yosida Theorem 


Let X be a flocally convex topological linear ` 
space, and denote by L(X) the totality of con- 
tinuous linear operators defined on X with 
values in X. A family (T;|t 2 0] of operators 
T,¢ L(X) is called a (one-parameter) semigroup 
of class (C°) or a strongly continuous semigroup 
if it satisfies the following two conditions: (i) 

T, T; = T,,, (the semigroup property), To=] (the 
identity operator); and (ii) Dm, T;x — T, x 
(Vxe X, Vt, 20). When X is a Banach space, 
(ii) is implied by w-lim, |, T;x =x (Vx e X), as 
proved by N. Dunford in 1938. In this case 
there exist constants M » 0 and B>0 such that 
(iii’) || ; || € Me” (Vt z 0). Hence, considering 

e *'T in place of T,, we can assume the equi- 
continuity: (iti) For any continuous seminorm 
p on X, there exists a continuous seminorm 

q on X such that p(T;»x) « q(x) (Vxe X, Vt 20). 
Such semigroups are called equicontinuous 
semigroups of class (C?) (abbreviated e.c.s.g. 
(C9). 


Example 1. X = L, (0, oo) with oo» pz 1. 
(T; x)(s) — x(t —- s). 


Example 2. X = L,(—00, œ) with co» pz 1. 





(T3) (s) -Qnt) 1? | es (: z- x(u)du, t»0, 


—00 


—x(s) t=0. 


Example 3. X = BC(— œ, oo). 


Ge k 
T) =e Y ar xls — qd t>0. 


Here 4 and u are positive constants. (For these 
examples, we have || T,|| x 1; hence (iti) is satis- 
fied.) For L, and BC — 168 Function Spaces. 
We assume in the remainder of the article 
that X is sequentially complete, that is, if a 


sequence {x,} of X satisfies lim, ,, — p(x, — Xm) 


=0 for every continuous seminorm p on X, 
then there exists a unique xe X such that 
lim, a P(x — x,) =0. 

The infinitesimal generator A of an e.c.s.g. 
(C°) {T,|t>0} is defined by 


Ax =lim t ! (T; — Dia, (1) 
tlO 


(This is also called the generator of T;.) Then 
we have the following results. 





378 C 
Semigroups of Operators, Evolution Equations 


(I) Differentiability theorem. For every 
complex number 4 with ReA- 0, the resolvent 
(4I — A) ! eL(X) exists and 


oo 


a—aytx=| e "Txdt  (VxeX), (2) 


0 
where the integration is Riemannian. Hence 
the domain D(A) of A is dense in X and coin- 
cides with the range R((A4I — A) !), and A isa 
closed linear operator such that the family 


{AAI A) y |470,n20,1,2,...] (3) 


is equicontinuous. 

(II) Representation theorem. Let J, = 
(I—n~' A) ! and consider the approximations 
to T: 


T x—e7" S (mt) !(ntJ,)"x, 
m=0 


Tx =(J,,-1)"X. 
Then 


T.x= lim Tx = lim Tx (5) 
uniformly on every compact set of t. 

(III) Converse theorem. Let a linear opera- 
tor A with both dense domain D(A) and range 
R(A) in X satisfy the condition (n1 — A) ! e 
L(X) for n2 1,2,.... Then a necessary and 
sufficient condition for A to be the infinitesi- 
mal generator of an e.c.s.g. (C9) is that the 
family of operators 


(I —n7 A) "In L2, ...;mz0,1,...] (3) 


be equicontinuous. Since such a semigroup 
{T,|t>0} is uniquely determined by A, we can 
write 7; 2 exp(tA). 

These three theorems together are called the 
Hille-Yosida theorem or the Hille-Yosida- 
Feller-Phillips-Miyadera theorem. 


Examples of Infinitesimal Generators. A = 
d/ds for example 1 above, A—2 !d?/ds? for 
example 2, and (Ax)(s) 2 A(x(s— u) — x(s)) for 
example 3. 


C. Groups 


An operator A in a Hilbert space X gener- 
ates a group (T;| —oo «t < œ} of tunitary 
operators of class (C?) satisfying T, T, = T;,, 
for —oo «t, s« oo if and only if A is equal 
to iH for some 'self-adjoint operator H (M. 
H. Stone's theorem, 1932). In a locally con- 
vex space, a necessary and sufficient condi- 
tion for a given e.c.s.g. (C?) {T,|t>0} to be 
extended to an equicontinuous group of class 
(C9) (T;| —oo «t < oo] is that the family (3’) 
be equicontinuous also for n= +1, 4-2, .... 
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D. Holomorphic Semigroups 


For an e.c.s.g. (C°) (1,1 2:0], the following 
three conditions are equivalent (K. Yosida, 
1963; the equivalence between (ii) and (iii) for 
Banach spaces was proved earlier by E. Hille, 
1948): (i) When t 0, 


T/x -lim h^ (T,,,— T)» 


exists for all xe X and ((Ct 17)" |n 1,2, ... 
and 0 «t « 1j is equicontinuous for a certain 
constant C » 0. (ii) When t>0, T; admits a 
convergent expansion T, given locally by T;x 
=P old 00 T? x/n!. The extension exists 
for |argA| € arctan(Ce ^ !), and the family of 
operators [e ^T; is equicontinuous in 4 for 
larg 4| «arctan(2 * Ce !) with some positive 
constant k. (iii) For the infinitesimal generator 
A of T,, there exists a positive constant C, such 
that ((C,A(AI — A) !)"] is equicontinuous in 
n=1,2,... and in 2 with Re(A4)21 e, £70. 
An e.c.s.g. (C?) {T} satisfying the above condi- 
tions is called a holomorphic semigroup. 

For example, introduce 


atic 
Lupi" qut. 
A>0, t>0, o>0, O«ca«l1, 
=0, A<0, 6) 


where the branch of z* is taken so that Rez" > 
0 for Rez» 0. Following S. Bochner (1949), 
we define 


a 


Lafe fs Txds t>0, 
0 
=x, t=0, (7) 


from a given e.c.s.g. (C?) {T,|t>0}. Then 
{T, ,|t>0} is a holomorphic semigroup (Yo- 
sida, T. Kato, and A. V. Balakrishnan, 1960). 
Its infinitesimal generator A, can be consid- 
ered as the fractional power ( — A} of — A, 
multiplied by —1. 

Fractional powers ( — AY, «eC, of operators 
have also been defined for operators A satisfy- 
ing the weaker condition than (3^) that (4(4— 
A) ! |4» 0] is equicontinuous (Balakrish- 
nan, H. Komatsu). If A is such an operator, 

— / —A generates a holomorphic semigroup 
and the unique uniformly bounded solution of 
the "elliptic" equation 


xy,——Ax, t0, lim X, Xo (8) 
to 


is the solution of 


d 


Kaes —J —AAx, t»0, limx,—xe, 
t20 


and therefore x, 2 exp( —t,/ —A)x, (Balakrish- 
nan). Equation (8) has also been discussed by 
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M. Sova and H. O. Fattorini from a different 
point of view. 


E. Convergence of Semigroups 


Let a sequence {exp(tA,)|n=1,2,...} of e.c.s.g. 
(C?) be equicontinuous as a family of operators 
eL(X). Then a necessary and sufficient con- 
dition for there to exist an e.c.s.g. (C?) exp(tA) 
such that lim, ,,(exp(tA,))x =(exp(tA))x uni- 
formly on every compact interval of t is that 
lim, Lol — A,) ! x 5 J, x exist (for some 4o 
with Re A, >0 and for all xe X) and be such 
that R(J; ) is dense in X (H. F. Trotter, Kato). 


F. Miscellaneous Semigroups 


(i) Distribution semigroups. The semigroup 

of translations (T; x)(s) 2 x(t--s) in X = 
L4, ( —00, oo) is not continuous and hence not 
measurable in t. However, T;x is an X-valued 
distribution. For semigroups (7; such that T,x 
is an X-valued distribution for xe X, an ana- 
log of the Hille-Yosida theorem is known 
(J.-L. Lions, 1960). It has been generalized to 
ultradistribution semigroups by J. Chazarain 
and to hyperfunction semigroups by S. Ouchi. 

(ii) Dual semigroups. The above semigroup 
IT of translations in L,(—oo, oo) is obtained 
as T, = S* from the e.c.s.g. (C9) {S,} defined by 
(S,x)(s) 2 x(s — t) in L,(—0oo, oo). Let B=d/ds be 
the infinitesimal generator of (S). The restric- 
tion of {S*} to the space of uniformly con- 
tinuous functions, which is the closure of the 
domain D(B*) of the dual B* in L,,( —oo, oo), 
is an e.c.s.g. (CHL This fact holds for the semi- 
group {S*} of an e.c.s.g. (C9) (Sj) in a Banach 
space X in general (R. Phillips, 1955) and also 
in a locally convex space. 

(iii) Locally equicontinuous semigroups. The 
infinitesimal generator A of the semigroup of 
translations (T,x)(s) 2 x(t +s) in X 2 C(—oo, oo) 
is d/ds. A has no resolvent since all complex 
numbers are eigenvalues of A. {T,} is not 
equicontinuous but locally equicontinuous, 
i.e., (,|0xtxt,) is equicontinuous for any 
t,» 0. For locally equicontinuous (C?) semi- 
groups an analog of the Hille-Yosida theorem 
is obtained by using the notion of generalized 
resolvents (T. Kómura, 1968; Ouchi, 1973). 

(iv) Differentiable semigroups. The notion of 
the holomorphy of semigroups in Section D is 
weakened to the differentiability. A character- 
ization of a semigroup {T,} such that T,x is 
infinitely differentiable in t>0 is given by 
using the resolvent of the infinitesimal gen- 
erator (A. Pazy, 1968). 

(v) Nonlinear semigroups. For a (C°) semi- 
group (1;) of contractions Oe, | Tx — T; y|| < 
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|x — yl| for x, ye X) in a Hilbert space X, an 
analog of the Hille-Yosida theorem 1s known 
(Y. Kómura, 1969). This result has been par- 
tially extended to Banach spaces (— 286 Non- 
linear Functional Analysis X). 


G. The Evolution Equation 


Let T,=exp(tA) be an e.c.s.g. (C9). Then for 
xe D(A), 


T'x = AT,x(= T, Ax). (9) 


Considered in suitable function spaces, the 
tequation of heat conduction (A= A =the 
tLaplacian), the tSchrédinger equation (A= 
yol (A— V(x))), and the twave equation 
given in matrix form 


JC) ol) 


are all of the form (9). For a linear operator A, 
in X depending on t, the ordinary differential 
equation in X 


Ae Ax f(t), ¢20, (10) 


is called the evolution equation. A family of 
operators {V(r,s)|r>s 20} in X which gives 
general solutions to the homogeneous evolu- 
tion equation 


X — AX, (11) 


(Le., for any 520, ae D(A,), x, 2 V(t, s)a is a 
solution to (11) for x,— a) is called the evolu- 
tion operator associated with the generators 
(Aj). An evolution operator Hir, s)) satisfies 
(i) V(r,r) 2 1, (11) V(r, s)V (s, t) - V(r, t). The 
solution to (10) is formally expressed by 


x, V(t, ER Vit, s)f(s)ds. (12) 
0 


H. Integration of the Evolution Equation 


For equation (11) we have the following result 
(Kato, 1953; Yosida, 1966). Assume the follow- 
ing four conditions: (i) D(A,) is independent of 
t and dense in the Banach space X, and for all 
a>0, (I—«A,) ! eL(X) with the estimate |(J— 
xA) ! | <1; Gi) B,,—( — A) — Aj) * is uni- 
formly bounded in the norm for 0<s, t< ġ; 

(ii) Eri IR, cc Bulle N, where N is inde- 
pendent of the partition (02 t5 <t; <... «t,— 
ly; (iv) B, o is weakly differentiable with respect 
to t such that the differentiated operator 

OB, o/ôt is strongly continuous in t. Under 
these assumptions, we can prove that for xo € 
D(Ay), the limit V(t, 0)xo —s-lim, ,,, V(t, O)xo, 
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with 


Sch [kt] [kKt]NV V! 
DOE 
1 [kt] - 1 V! 
Dp) 
1 /[ks]+1\\7! 
(rr 7 ) 
qp RE 
k k 


(0x sx tl), exists and gives the unique solu- 
tion of (11). If f(t) ts continuously differenti- 
able, the right-hand side of (12) exists and 
gives a unique solution to the inhomogeneous 
equation (10). 














I. The Evolution Equation of Parabolic Type 


Equation (10), for which every A, is the in- 
finitesimal generator of a holomorphic semi- 
group, is said to be of parabolic type by ana- 
logy to parabolic partial differential equations. 
Under weaker conditions, especially without 
the condition that D(A,) is independent of t, 
the existence of solutions of an equation of this 
type is obtained. Moreover, differentiability or 
analyticity of solutions follow from some 
natural assumptions. 

(i) Existence of weak solutions. Let X bea 
Hilbert space. For te(0,/], let V, be a subspace 
and at the same time a Hilbert space with 
respect to a norm |||: |||, stronger than ||: |. 
Since the form (A,x, y) is tsesquilinear (linear 
in x and antilinear in y), we get a sesquilinear 


functional a(t,-,-) on V, x V, such that 
a(t, x, y= —(A,x, y), x, ye D(Aj), 


if D(A,) is dense in V, with respect to |||: ||, and 
if 


A,X, YS CUM lll. 


V, a(t,-,-) should be measurable in a certain 
sense. A solution x, of the equation (10) in 
[0,7] satisfies 


I I 
| a(t, EN (x, v;) dt 
0 0 


! 
-| (f(t), v) dt + (Xo, Vo) (13) 
0 


X, ye D(A,). 


for any differentiable X-valued function v, such 
that v,e V, v, =0, {6 lv, |? dt < co, and le Ivi? dt 
« oc. A solution x, of (13) is called a weak 
solution of equation (10), though it does not 


necessarily satisfy (10). If the relation 
alt, x, x) All Sala, seh, 


holds for some 4, az 0, a weak solution of (10) 
in the sense of (13) exists for a given xy e X 


1410 


(Lions, 1961). In order to obtain the unique- 
ness or the differentiability of weak solutions, 
we need some additional conditions. 

(ii) Some properties of strong solutions. Let 
X be a Banach space. Let every semigroup 
(Tf?! generated by A, be holomorphic in a 
complex sector |arg 4| x 0, 0 7 0, independent 
of t. Suppose one of the following conditions 
holds: (1) For some x, 0 «a « 1, D(A?) is in- 
dependent of t and for 1 —« « p <1, 
\(AZ—ADAG*|<C'lt—sP, — Gse[0,1] 

(P. E. Sobolevskti, 1958-1961; Kato, 1961); 
(2) A, + is differentiable in t, 

ld A, !/dt —dA; !/ds| <C’|t—s|? 

for some C'-0,0«f <1, and 


^ 


e s 
x (A.— 4) ! 
ct 


N 
ES a 
[4| 

















for every 4:|arg4| 2 x/2— 0 for some N, 0< 
& « 1 (Kato and H. Tanabe, 1962). Then a 
differentiable evolution operator {V (t, all as- 
sociated with (11) exists. 

The most interesting property of evolution 
equations of parabolic type is the analyticity 
of solutions. If A, is holomorphic in t in a cer- 
tain sense, then the solutions are holomorphic 
(Tanabe, 1967; first noted by Komatsu, 1961). 
Furthermore, a characterization of evolution 
operators {V(t, s)} holomorphic in some com- 
plex neighborhood of [0,7] (called holomorphic 
evolution operators) is obtained by using the 
resolvent of A, (Kato and Tanabe, 1967; K. 
Masuda, 1972; — [8]). 


J. Application to Semilinear Evolution 
Equations 


The evolution equation with a nonlinear addi- 
tive term f(t, x,):x;=A,x,+ f(t, x,) can be writ- 
ten as an inhomogeneous integral equation x, 
=V(t,0)xo + fó V(t, s) f(s, x,) ds in the Banach 
space X, by means of the evolution operator 
{V(t, s); introduced in Section G. The exis- 
tence, differentiability (Kato, H. Fujita, and 
Sobolevskii, 1963-1966), and analyticity 
(Masuda, 1967) of solutions of the Navier- 
Stokes equation has been obtained by reduc- 
ing it to an integral equation of this type. 

Concerning quasilinear equations in which 
A, depends on x,, the existence, differentia- 
bility, and analyticity of their solutions have 
been discussed. 
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379 (X.18) 
Series 


A. Convergence and Divergence of Infinite 
Series 


Let {a,} (n—1,2,3,...) be a sequence of real or 
complex numbers. Then the formal infinite 
sum d, +a, +... is called an infinite series (or 
series) and is denoted by 2, a, or La,. The 
number a, is the nth term of the series Xa,, 
and s,— a, +4,+ ... +a, is the nth partial sum 
of Za, Also, for a finite sequence a,, a ,...,a,, 
the sum a, t à5 4- ... t a, is called a series. To 
distinguish these two series, the latter is called 
a finite series. In this article, series means an 
infinite series. If the sequence of partial sums 
{Sa} tconverges to s, we say that the series Za, 
converges or is convergent to the sum s and 
write 357 ,a,—s or Da,=s. If the sequence 
{s,} is not convergent, we say that the series 
diverges or is divergent. In particular, if {s,} is 
divergent to +00 (—oo) or *oscillating, we say 
that the series is properly divergent to +00 
(—00) or oscillating, respectively. 

The notation X a, is customarily used for 
both the sum s of the convergent series and the 
formal series, which may or may not be con- 
vergent. When the series is convergent, the 
sum is sometimes called the Cauchy sum in 
contrast to the “summations” of series, which 
are not necessarily convergent (— Sections K 
ff.. 
Applying the tCauchy criterion for the 
convergence ofa sequence, we see that a neces- 
sary and sufficient condition for Xa, to be 
convergent is that for any given e£» 0, we can 
take N sufficiently large so that 


\Sm— Bal =|@n41 + cau <E 


for all m, n such that m» n» N. Hence if Da, 
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converges, the a, —0 as n— oo, but the converse 
is not always true. 

Elementary properties of the convergence of 
series are: (1) If a, and X b, converge to a, b, 
respectively, then X (a, + b,) converges to a+b. 
(2) If Xa, converges to a, then È ca, converges 
to ca for any constant c. (3) Suppose that {bm} 
is a subsequence of (a,) obtained by deleting a 
finite number of terms a, from {a,}. Then È bm 
is convergent if and only if Xa, is convergent. 
(4) When a series 2 a, converges to a and {b,,} 
is a sequence such that b, =a; +43 +... +4,» 
b2=G,. 41 Feat, Fan ba, 414-004 
a,,,---, then 2b, also converges to a. The 
converse, however, is not always true. For 
example, 1 — 1 -- 1 — 1 7 ... is oscillating, but 
(1—1)-(1— 1) 4... zs 


B. Series of Positive Terms 


Suppose that Za, is a series of positive (or 
nonnegative) terms. Since its partial sums s, 
form a tmonotonically increasing sequence, 
the series is convergent if and only if {s,} is 
bounded. For example, the series Sg; n`? 
(p» 0) converges if p» 1 because s, « 2?! / 
(2?! — 1), whereas it diverges if p < 1 because 
$5» > 1 - (m + 1)/2. The geometric series 

Y 1a"! (a0) converges for a «1 because 
s, — (1 — a")/(1 — a), and diverges for az 1 be- 
cause s, >n. 

Some criteria for the convergence of a series 
La, of nonnegative terms are: (1) If {a,} is 
monotone decreasing, then the series 3 a, and 
2:2'a,. have the same convergence behavior 
(Cauchy's condensation test). (2) Suppose that 
f(x) is a positive monotone decreasing func- 
tion defined for x 2 1 such that f(n) — a, (n= 
1,2,...). Then the series Za, and the inte- 
gral (? f(x) dx have the same convergence 
behavior (Cauchy's integral test), for example, 
Xn? (p>0) and f? x "dx. (3) If for any posi- 
tive constant k we have a, < kb, except for a 
finite number of n, then the convergence of 
È b, implies the convergence of £ a, If kb, < 
a, and È b, diverges, then Za, also diverges 
(comparison test). (4) Let a, — 0 and b, 0. If 
à, 1/0, € b, /b, except for a finite number of 
values of n and È b, converges, then $ a, also 
converges; if a,, ,/a, Z b, /b, and È b, di- 
verges, then >) a, also diverges (— Appendix A, 
Table 10). 


C. Absolute Convergence and Conditional 
Convergence 


A series > a, (with real or complex terms a,) is 
called absolutely convergent if the series È le, 
is convergent. If a convergent series is not 
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absolutely convergent, then it is called con- 
ditionally convergent. An absolutely convergent 
series is convergent. A real series } a, whose 
terms have alternating signs is called an alter- 
nating series. An alternating series > a, is 
convergent if the absolute values of terms lo, 
form a monotone decreasing sequence which 
converges to zero (Leibniz’s test). An abso- 
lutely convergent series remains absolutely 
convergent under every rearrangement of 
terms and retains its sum under the rearrange- 
ment (Dirichlet’s theorem). If a series with real 
terms is conditionally convergent, then it is 
possible to rearrange its terms so that the 
rearranged series converges to any given num- 
ber, diverges to +00 (or —oo), or is oscillating 
(Riemann’s theorem). A convergent series 
whose convergence behavior is unaffected by 
rearrangement and whose sum remains un- 
changed is called unconditionally convergent 
(or commutatively convergent). A real or com- 
plex series is unconditionally convergent if and 
only if it is absolutely convergent. The notion 
of infinite series can be extended to any com- 
plete tnormed linear space, and absolute con- 
vergence can be defined by replacing the ab- 
solute values of the terms by the norm of the 
terms. However, in general, unconditional 
convergence is not always equivalent to ab- 
solute convergence. 


D. Abel's Partial Summation 


Let (a9,a,,45, ... and (bo, b,, ba... be 
arbitrary sequences, and put 4,— ao +4; + 
... +a, for nz 0. Then the following formula 
of Abel's partial summation holds: 


ntk ntk 
3 a,b,— Kä A,(b, Blau 
vent] vent] 


+ Anti 


for any n>0 and any k 21; this formula also 
holds for n= —1 if we put A_, =0. 

Abel’s partial summation enables us to 
deduce a number of tests of convergence for 
series of the form X:a,b,. In particular, the 
following criteria are easy to apply: 
(1) Xa,b, is convergent if Za, is convergent 
and if the sequence {b,} is monotone and 
bounded (Abel's test). 
(2) Xa,b, is convergent if the sequence {s,} of 
partial sums of X a, is bounded and if {b,} is 
monotone and converges to zero (Dirichlet's 
test). 
(3) Xa,b, is convergent if X (b, — b,,,) is ab- 
solutely convergent and if Za, is (at least 
conditionally) convergent (test of du Bois- 
Reymond and Dedekind). 

For example, criterion (2) implies that if (5,j 
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is monotone and converges to zero, then the 
power series È b,z" of a complex variable z is 
convergent on the unit circle |z| — 1 except at 
most for z=1; the case z= —1 gives Leibniz's 
test for alternating series (— Section C). 


E. Double Series 


A sequence with two indices, i.e., a mapping 
from the Cartesian product N x N of two 
copies of the set of natural numbers N to a 
subset of the real or complex numbers, is 
called a double sequence and is denoted by 
{Amn} OF {Amn}. If there exists a number / such 
that for any positive e there is a natural num- 
ber N (c) satisfying |a,,, —!|<e for all m> N (c) 
and n> N (c), then we say that the sequence 
(a,,] has a limit (and write lim, n> amn = 1. 
This limit should not be confused with re- 
peated limits such as lim, , (iMm amn). If 
lim, ,,, Amn = O uniformly in n and lim, ,,, «, 
=], then Dm. u.a a, =l. For a given 
double sequence {amn}, the formal series 

Dm n=1 mn 18 called a double series and is some- 
times denoted by X amn. In contrast with 
double series, the ordinary series discussed 
previously is called a simple series. 

Given a double series È dmn, when the double 
sequence of partial sums s,,,— Än Än Ou 
is convergent to s, then X amn is said to be 
convergent to the sum s. On the other hand, 
if San is not convergent, > amn is said to be 
divergent. If 27? , dmn converges to b, for 
each m, then £z, b, = 35-1 (X5. dmn) is called 
the repeated (or iterated) series by rows. If 
X 5-, d, Converges to c, for each n, then 
Ate Done (um =1 Amn) is called the re- 
peated (or iterated) series by columns. Even if 
two repeated series by rows and columns are 
convergent, the two sums are not always iden- 
tical, and È Gen is not always convergent. 
However, if the double series $ amn is conver- 
gent and E. amn is convergent for each m, then 
the repeated series by rows is convergent to 
the same sum. A similar statement is valid for 
the repeated series by columns. 

Suppose that we are given a double series 
È amn of nonnegative terms. If any one of 
Dm. n amn Zum Zum Amn aNd 22, Lim Amn IS cOnver- 
gent, the other two converge to the same sum. 
If the diagonal partial sum Spm = 22/—; 271-1 Ga 
converges to a, then the double series È amn 
also converges to a. 

If X |a,,,| converges, the double series £ a,,,, 
is called absolutely convergent, whereas if > amn 
converges but not absolutely, then $ amn is 
called conditionally convergent. If > an is 
absolutely convergent, then any series ob- 
tained from È on by arranging the terms in an 
arbitrary order is convergent to the same sum. 
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F. Multiplication of Series 


The series ZZ, c,, where c, =a; bp + a; b, , 
T... a,b,, is called the Cauchy product of 
two series 22,7, a, and Eroi b,. (1) Let Xa, and 
È b, be two convergent series and A, B be the 
sums of these series. If their Cauchy product 

X c, is also convergent, then it has the sum C= 
AB (Abel’s theorem). (2) If at least one of the 
two convergent series 2X a, and X b, with the 
sums A, B, respectively, is absolutely conver- 
gent, then their Cauchy product È c, is also 
convergent and has the sum C= AB (Mertens's 
theorem). (3) If Za, and È b, are absolutely 
convergent, then their Cauchy product X c, is 
absolutely convergent (Cauchy's theorem). (4) 
Let Xa, and È b, be two convergent series 
with the sums A, B, respectively. If {na,} and 
(nb,) are bounded from below, then È c, is 
convergent and has the sum C — AB (Hardy's 
theorem). 


G. Infinite Product 


Let {a,} be a given sequence with terms a, # 
0 (n2 1,2, ...). The formal infinite product 

dur das... is denoted by II, a,. We call 
D,7—4,'05'...: a, its nth partial product. If the 
sequence {p,} is convergent to a nonzero limit 
p, then this infinite product is said to converge 
to p, and p is called the value of the infinite 
product. We write [Ta,— p. If {p,} is not con- 
vergent or is convergent to 0, then the infinite 
product is called divergent. Sometimes we 
consider the infinite product [J] a, with a, — 0 
for a finite number of n's; and then by conver- 
gence or divergence of [I a, we mean that of 
the infinite product I1a,, where the sequence 
{aj} is obtained by deleting zero terms from 
{a,}. Usually we do not treat an infinite prod- 
uct with a, — O0 for an infinite number of n’s. 

A necessary and sufficient condition for 
II, a, to be convergent is that for any posi- 
tive e there is a number N such that |p,,/p, — 
1| «& for all n, m» N. If II a, converges, then 
a, 1, but the converse is not always true. 

It is often convenient to write an infinite 
product as [](1+a,). Then [](1 +a,) and 
Dd log(1+a,) have the same convergence be- 
havior, where the imaginary part if of the 
logarithm is assumed to satisfy 0 x |0| <n. If 
a, Z0, convergence of [T(1--a,) implies con- 
vergence of 2:a,, and vice versa. 

If TI(1 +|a,]) converges, then TI(1- a,) is 
said to be absolutely convergent. An absolutely 
convergent infinite product is also convergent, 
and the value of the infinite product is un- 
changed by the alteration of the order of 
terms. 
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H. Termwise Differentiation of Infinite Series 
with Function Terms 


Uniform convergence of an infinite series 

È f, (x) is defined by uniform convergence of 
the sequence of the partial sums ZE, f(x) 

(— 435 Uniform Convergence). If the infinite 
series > f,(x) defined on an interval J of the 
real line is convergent at least at one point of I 
and X f, (x) is convergent uniformly in J when 
the derivatives Cl exist, then X f, (x) is con- 
vergent to f(x) uniformly in I, and X. f,(x) is 
termwise differentiable, that is, f (x)= X f, (x). 
If the q,(z) are holomorphic in a complex 
domain D and X ,(z) converges to (z) uni- 
formly on every compact subset of D, then 

È o, (z) also converges to q'(z) uniformly on 
every compact subset of D (Weierstrass's 
theorem of double series). (For termwise inte- 
gration — 216 Integral Calculus.) 


I. Numerical Evaluation of Series 


In some special cases, we can express the nth 
partial sum s, of a series 2a, as a well-known 
function of n. Specifically, if Za, is an arith- 
metic progression 25 (a -- (k — 1)d) or a geo- 
metric progression 3? aq* !, we have 


_a(q’—1) 


n 
5,7; (2a + (n— Dd) Sn SCH 


respectively. If |g| « 1, then ZZaggf converges 
to a/(1 — q). If B,,,(x) is the (r+ 1)st tBer- 
noulli polynomial, then s, — 1" - 2" - ...-- n" — 
TR. 6) T! /(r + 1). This sum was studied by 
J. Bernoulli, who gave formulas up to r — 10 in 
his Ars conjectandi. 

In the series È u,,, if we can find another 
sequence {v,} such that u,=v,—v,-,, then s, 
=U; Hat, Fins v,— v9. For example, if u, 
— n(n 4 1)(n 4 2), then v, =n(n+ 1) (n - 2)(n 
3)/4 and s, =v, because vo —0 (— 104 Dif- 
ference Equations). Series with trigonometric 
function terms are calculated analogously. 

There are cases where the sum X a, itself can 
be expressed in a satisfactory form although 
we cannot find an appropriately simple ex- 
pression for each partial sum s,. For example, 
C(r) - A. 1/m’ can be represented by tBer- 
noulli numbers if r is even. In particular, ¢(2) = 
12/6, ((4) « 1*/90 (— Appendix A, Table 10). 

If an infinite series converges rapidly, we can 
get a good approximation by taking a suitable 
partial sum. On the other hand, if the series 
converges less rapidly, an effective means for 
evaluating series is afforded by transformation 
of series. If the kth tdifference is exactly zero, 
then 
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Since the absolute value of finite differences 
often decreases rapidly, it is sometimes con- 
venient to consider the series whose terms are 
the differences of the original series. One finite 
difference method is Euler's transformation of 
infinite series. In particular, the formula 

NP do 


(—1)a, = D 


d = 2n*1 


MMs 


k 


is useful for numerical calculation of sums of 
slowly converging alternating series. In numer- 
ical calculation of the series, we usually start 
calculating the numerical values of the first few 
terms; we then apply such transformations as 
Euler’s to the remainder, and calculate the 
partial sums of the transformed series. 

When we calculate the sum of an infinite 
series approximately, we must estimate the 
error, i.e., the remainder that must be added to 
yield the sum of the series itself. We can esti- 
mate the maximum error by derivatives or 
differences of higher orders. We also have the 
transformations of Markov and Kummer. In 
the former, every term of the series is repre- 
sented by convergent series, and in the latter, 
the given series is reduced by subtracting an- 
other convergent series, which has a known 
sum and similar terms [1]. 


J. Infinite Series and Integrals 


In numerical calculation of functions, we 
sometimes use the Euler-Maclaurin formula 


[5]: 
1 xto 
HERE f(z)dz 
e » 


Eso.) +R 


R=-0| al 5 m BN 2) ront + cz) dz, 


where 


zs B,(í—z—-1) €<z, 
Bea ise 


The speed of the convergence for this formula 
is greater than that for Taylor’s expansion 
when £o is large, since the terms of the for- 
mula are *Bernoulli polynomials B,(€) in 
0<é<1. We also have Boole's formula, with 
*Euler polynomials as its terms [4]. The for- 
mulas discussed in this section are also used to 
calculated approximately the partial sums of 
infinite series. 

Another method of evaluating sums of 
infinite series analytically entails transforming 
infinite series to definite integrals using the 
tresidue theorem. If an analytic function f(z) is 
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holomorphic except at poles a, (n= 1,2, ..., k) 
in a domain bounded by the closed curve C 
and containing the points z 2m (m— 1,2, ..., 
N), then 


5 5 f(m) al x(cot 2z) f(z) dz 
C 


m=1 


— > Res[z(cot 2z) f(z) ].—., 
n=1 


When the left-hand side of this equation is 
replaced by 


X cuf 


we replace cot zz by coseczz. Res[F(z) ],-, is 
the tresidue of F(z) at z - a. The line integral 
along C is often calculated easily by choosing 
a suitable deformation of C. Sometimes it 

can be shown immediately that the integral 
along C is zero, or its asymptotic value can be 
evaluated by the *method of steepest descent. 


K. History of the Study of Divergent Series 


Mathematicians in the 18th century did not 
concern themselves with the question of 
whether tseries were tconvergent or ‘divergent. 
This indiscriminateness led to various con- 
tradictions. In 1821 an exact definition of the 
notion of convergence of series (Section A) was 
given by A. L. Cauchy; since then, mathema- 
ticians have mostly concerned themselves with 
convergent series. However, since divergent 
series appeared in many problems in analysis, 
the study of such series could not be neglected, 
and it became desirable to give a suitable 
definition of their sum. Although some results 
were given by L. Euler, N. Abel, and others, it 
was during the latter part of the 19th century 
that methods of summation of divergent series 
were studied systematically. This study consti- 
tuted a new branch of mathematics. 

In the following sections, some important 
methods of summation of divergent series 
are mentioned. Cesaro’s method (-— Section 
M) was the forerunner of the theory whose 
general foundation ts now the theory of linear 
transformations. 


L. Linear Transformations 


For a sequence {s„} (n— 0, 1, 2,...) of real or 
complex numbers, assume that o, = 2/59 anisi 
converges for n=0, 1,2, ..., where (a;,) is a 
given matrix (i,k —0, 1,2,...). The mapping 

T :(s,] 5 1o,] is called a linear transformation, 
and {a,} is called the transform of {s,} under 
T. If Ge matrix satisfies a;, =Q (k >i), then T is 


defined for any sequence {s,} and T is said to 
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be triangular. If the transform {o,} under T is 
defined and convergent whenever (s,) con- 
verges, then T is called a semiregular transfor- 
mation. If in addition {o,} has the same limit 
as {s,}, then T is called a regular transforma- 
tion. If for any bounded sequence {s,} the 
transform 16,j is defined and convergent, then 
T is called a normal transformation. If T is tri- 
angular and the transform {a,} of {s,} under 
T is divergent to oo whenever s,— oc, then T 
is called a totally regular transformation. 

Let T be a regular transformation. If for at 
least one divergent sequence (s,) the transform 
{on} of {s,} under T converges, then T is called 
a method of summation. The limit s of fo, is 
called the sum of {s,} under the method T of 
summation, and (s, is said to be T-summable 
to s. For a given method of summation T,, let 
D(T,) be the set of sequences that are T,- 
summable. If D(T,)= D(T,), then the methods 
T; and T, are called equivalent. If D(T,) = D(T,), 
then we say that T, is weaker than T, and 
T is stronger than T,. If D(T,)¢ D(T;) and 
D(T;)  D(T,), then T, and T, are called mutu- 
ally noncomparable. The following theorems 
on linear transformations of sequences are 
important: 

(1) Kojima-Schur theorem. In order that T 
be semiregular it is necessary and sufficient 
that (i) lim, ,,, a,, exist for each k; (ii) t, = 
2A-o|a,,] exist and {t,} be bounded; and (iii) 
Dm, 22:9, exist. In that case, we have 
3 |lim aale oo 
k=Q "o9 


and 


lim o, = lim y daf 


noo noo k=0 


= (im s Jim p as) 
noo n> k=0 
+ » (im a4) (s — lim s) 
k=0 Mr noo 


In particular, in order that To, converge 
whenever s, 0 it is necessary and sufficient 
that conditions (i) and (ii) be satisfied. In that 
case, 


lim o, — lim > Ang Sy 


n oo noo © p= 


oo 
= Y lima, s, 
CH 


(I. Schur, J. Reine Angew, Math., 151 (1921); T. 
Kojima, Tóhoku Math. J., 12 (1917)). 

(2) Toeplitz's theorem. In order that T 
be regular it is necessary and sufficient that 
(1) lim, ,,, Oe — 0 for each k; (ii); and (iii) 
bm, 32-90, = 1 (O. Toeplitz, Prace Mat.- 
Fiz., 22 (1914)). In particular, (ii) and OC" 
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im, ,. 3 kd, = 1 for each K imply that T is 
regular (Perron's theorem). 

(3) Schur's theorem. In order that T be 
normal it is necessary and sufficient that (i); (ii); 
(iit); and (iv) for any ¢>0 there exist a K >0 
such that 342 &., [a,,| «e for each n. 

(4) In order that the regular triangular trans- 
formation T' be totally regular, it is necessary 
and sufficient that a,, 2 0 except for a finite 
number of k. 


M. Cesaro’s Method of Summation 


We write 


Oli y Ax", 
n=0 


Ae naa " n* l 
n I (a4 1) 


(1—x) *! Y ux"z(1-x) * Y sx" 
n=0 n=0 





my gut 
n=0 

where s, = 27-9 u;. Thus the series 2; u; is as- 
sociated with the sequence [57]. If of = s2/A* 
converges to s as n— oo, then we say that Zu, 
is summable by Cesaro’s method of order « (or 
simply (C, a)-summable) to s and write 22 yu, 
=s (C, a). This method of summation is called 
Cesaro’s method of summation of order o (or 
simply (C, x)-summation). 

It is natural to consider (C, «)-summa- 
tion for oz —1. We say that Du, is (C, —1)- 
summable if Zu, converges and nu, — o(1). 
Here Du, =s (C, 0) means lim, ,,, s, — s, and 
Ère u, =s (C, 1) means s —lim, ,, (sg +5, + 

ES, 4)/n. Generally, we have the following 
results: 

(1) Aj is increasing if «>0 and decreasing if 
07x» —1. A? =1 and 4220 if «> —1. 

(2) s^ = X o Af HE An F, 
Suë, 

(3) (C, a) (xz: 0) is regular, and D(C, «)> 
D(C, B) if «> B» —1. 

(4) If Du, — s (C, a), then u, = o(n?). More- 
over, if Du,=s’ (C, o), then Zi, Liss St 
S (C, x) and È Au, — As (C, à) for any number å. 

(5) If Zus s (C, a) and Xu, —s' (C, f), 
then their tCauchy product $ v, — Sg (C, a + 
p a 1) guod s theorem). Moreover, if 

Ai|sP-g 1 (ui,_,)|/A2 = O(L), then X v,— 
ss’ (C. B) (T. Kojima). Ha, f'  —1, s@?(u,)= 
O(n”), and s/?(u,) = O(n^), then X v, = 
ss’ (C, a’ + ' +2) (G. Doetsch). 

(6) For any integer x 0, in order that Xu, 
=s (C, a) it is necessary and sufficient that 
there exist {v,} such that u, — (n + 1)(v, — v.) 
and 23v, — s (C, 4 — 1) (G. H. Hardy, Proc. 
London Math. Soc., (2) 8 (1910)). This condi- 
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tion is equivalent to (i)- (iii) together: (1) the 
series X2, st? /(k 4- 1)... (k - a) converges to 
the limit 5,; (ii) b, = o(1) as n— oo; and (iii) 
(st !/Az !) - (n- x)T(a)b,,, >S as no oo. 

(7) If Xu, =s (C, a) (a >0), one of the follow- 
ing five conditions is sufficient for the conver- 
gence of >) u, (this is a kind of *Tauberian 
theorem): (1) nu, = o(1y; (ii) t, = Au vu, =o(n); 
(iii) X n^|u,|?*! < oo (pz 0); (iv) nu,» —K (K is 
independent of n); (v) liminf(s,, —5,) 20 as m> 
n oo, m/n— 1 (R. Schmidt’s condition). 

(8) If a’ >a> —1 and Xu,-5s(C,«), then Xu, 
=s (C, a + £) for any e» 0. 

For a given series Y, u,, we write H9 for s,, 
H} for the arithmetic mean of {Hf}, and H. 
for the arithmetic mean of ( H1]. Similarly, we 
can define {HP} for any integer p. If HPs 
as n— oo, then X u, is said to be summable 
by Hólder's method of order p (or (H, p)- 
summable) to s, and we write È u, — s(H, p). 
For any integer p 20, (H, p)-summability is 
equivalent to (C, p)-summability (Knopp- 
Schnee theorem). 


N. Abel’s Method of Summation 


If the radius of convergence of the power series 
Ecour" is 21 and X; u,r"osasr-1,then 
Zu, is said to be summable by Abel’s method 


(or A-summable) to s, and we write È u,, — s (A). 


The transformation matrix is denoted by A, 
and the transformation is called Abel's method 
of summation. 

(1) If 3 u,— s (A), then limsup, ,, |u,|'" « 1. 
(2) If Zus S (Al and Yiu) =s’ (A), then D(u,+ 
u,)s +s (A) and X Au, = As (A) for any constant 
A. Moreover, 245.444 4,7 $—Ug — Uy — ... — Uy 
(A). 3) If Xu, 2 s (A) and Xu, =s' (A), then 
the Cauchy product is 22u, — ss' (A). (4) If Du, 
— s (A) and one of the following five conditions 
is satisfied, then $ u, =s: (i) nu, — o(1); (ii) t, = 
255-1 vu, — o(n); (iii) nu, = O(1); (iv) nu, > — M; 
(v) lim inf(s,, —5,) 20 as m» n— oo and m/n 1. 
These theorems are fTauberian, in the original 
form proved by Tauber (Monatsh. Math., 8 
(1897)). (5) The matrix A is regular, and D(C, a) 
c D(A) for any a> —1.(6) If 22u,— s (A) and 
$,2:0, then Xu, — s (C, 1). Moreover, if o? = 
O(1), then Xu, — s (C, x +£) for x> —1 and 
£20. 





O. Borel's Method of Summation 


If for a given series È u,, 





is convergent for all x, and u(x)/e* s as x 
> oo, then È u, is said to be summable by 
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Borel's exponential method to the sum s, and 
we write > u, —s (B). The transformation thus 
determined is denoted by B and is called 
Borel's method of summation. If 


| u(x)e *dx=s, 
0 
then Xu, is said to be summable by Borel's 
integral method (or 8-summable) to s, and we 
write 2:u,— s (8). Then we have: (1) B is regu- 
lar and D(C, «)< D(B) (a> —1), while D(C, a) 
and D(B) are noncomparable. (2) If the radius 
of convergence of 3 Za? is 21 and Yiu, = 
s (B), then Xu, =s (A). (3) If 22,41 u, s (B) 
(resp. (B)), then Du, =stuyt+u,t+...+u,(B) 
(resp. (8)), but the converse is not always true. 
(4) Su, =s (B) implies |u, |!" =o(n). (5) If Du, = 
s (B) and Eu, =s (B), then X(u, 4 w,) 2 s - s' (B), 
and È 4u, — 4s (B) for any constant 4. The 
same is true for summation (B). (6) If 32u,— s 
(B) and if one of the following two conditions 
is satisfied, then X, u, — s: (i) n u, == 0(1); 
(ii) lim inf(s,, —5,) 2 0 as m» n— oo and (m— 
n) /n>0. (7) if Lu, =s (B) and si loin IL 
then X u, =s (C, a) (x2: 0). (8) If X u,— s (A) and 
u(t)> — Mt lexpt, then £ u,— s (B). (9) If the 
sequences {n,} and {ny} satisfy n,., >My, 
n,./n>1+e(k=1,2,...;€>0), u, D Die US 
ny), and Lu, =s (B), then s, 55 as k oo. 

If Zus s (8) and 


Ip d^u(x) 
o 


dx? 
converges for all 4—0, 1,2, ..., then È u, is 
said to be absolute Borel summable (or |38|- 
summable). Concerning this we have: (1) If 
> |u,| converges, then È u, is |$8|-summable, 
but even if È u, converges, > u, is not always 
|B|-summable. If È u, is |$98|-summable, then 
È u, is B-summable to s. In this case, we 
write Du, =s ([8]). (2) 325.5 u,— s UI) implies 
Macken Un =S— (Uo +U; +... E uy) (181). (3) If 
Xu, =s (B), Ł uj, =s' (B), and if at least one of 
them is |B|-summable, then their Cauchy 
product is 3v, — ss' (|8|). 


e *dx 











P. Euler's Method of Summation 


In the series > u,, if 


k+1 k+1 
1 Sot ) $14... 
k+1 
2-€*0 
sd 


(s, = Èt o uj) converges to s as k— oo, then Lu, 
is said to be summable by Euler's method, and 
we write 3, u, — s (E). The transformation thus 
obtained is called Euler's method of sum- 
mation. A necessary and sufficient condition 
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for Xu, =s (E) is that 3o, — s, where v, = 
gU ME (; 
is regular. We have Lu, =s if E u, — s (E) and 
if one of the following two conditions is satis- 
fied: (i) dn u, = O(1); (ii) lim inf(s,, —s,) > 0 for 
mn-oo,(m— n /no 1. Cesaro and Euler 
summations are noncomparable. As an ex- 
tension of Euler's method, the Euler method 
of summation of pth order is also defined 


(e.g, — [6]). 


Ja. This summation method 


Q. Nórlund's Method of Summation 


For a positive sequence (p,), let P,2 37.9 p, 
oo as n— oo and p,/P,, —0 as n— oo. If 


REG 


converges to s as n— oo, then È u, is said to be 
summable by Nórlund's method of type 1 p,], 
and we write $ u, — s (N, {p,}). The transfor- 
mation thus obtained is also regular and is 
called Nórlund's method of summation. If 

È u„=s (C, 1) and 0< po <p; € ...,then X u, = 
s (N, {p,}). Cesàro’s method is actually a 
special case of this method. 


R. M. Riesz's Method of Summation 


Let {4,} be a sequence with increasing terms 
and tending to +œ as n— oo. If 


R(A, kt) ( * Apu] 
Anat 

converges to s as t oo, then È u, is said to be 
summable by Riesz's method of order k and 
type 4,, and we write È u, — s (R, A, k). The 
transformation thus obtained is regular and is 
called Riesz's method of summation of the kth 
order. [n particular, if 4, —n, then D(R, /,, k) - 
D(C, k). 


S. Riemann's Method of Summation 


converges for h>0 and tends to s as h—0, then 
È u, is said to be (R, k)-summable to s. When 

k =1, this method, often called Lebesgue's 
method of summation, is not regular. When k = 
2, it is ordinarily called Riemann's method of 
summation and is regular. Corresponding to 
these cases, if 
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as h-0, then È u, is called (R,)-summable or 
(R,)-summable to s, respectively. The summa- 
tion method (R,) is not regular, while (R,) is 
regular. If Zu is (R,)-summable, then it is 
also (R,)-summable, but (R, 2) and (R,) are 
noncomparable. 

Other methods of summation were devel- 
oped by G. H. Hardy and J. E. Littlewood, E. 
Le Roy, C. J. de La Vallée Poussin, and others 
(e.g., — [6]). 

For related topics — 121 Dirichlet Series, 
159 Fourter Series, 339 Power Series. 
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380 (X.15) 
Set Functions 


A. General Remarks 


A tfunction whose domain is a ffamily of sets 
is called a set function. Usually we consider 
set functions that take real values or +00. 
For example, if f(x) is a real-valued function 
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defined on a set X, and if we assign to each 
subset A of X values such as sup, f, inf, f, or 
sup, f — inf, f, then we obtain a corresponding 
set function. In particular, a set function whose 
domain is the family of left open intervals in 
R” is called an interval function. To distinguish 
between set functions and ordinary functions 
defined at each point of a set, we call the latter 
point functions. For example, if f(x) is an tinte- 
grable (point) function with R (  R!) as its 
domain and we put F(I)— f} f(x)dx for I= 

(a, b], then we obtain an interval function F 
on R. 


B. Finitely Additive Set Functions 


Let (E) be a real-valued set function defined 
on a ‘finitely additive class B in a space X. If 
® satisfies the finite additivity condition: 


E, E eB, E,NE,=@ imply 


JL, U Ej) - G(E,) + P(E), 


then di EI is called a finitely additive set 
function on 38. For each Ee we denote 
sup{®(A)|A cE, Ae 8j (inffD(A)| AC E, Ae 
$981) by V(®; E) (V(d; E)), the upper (lower) vari- 
ation of ®. Since (Z) 2 0, we have V(®; E) 
0 « V(Q; EL V(d; E) = V(O; E)+|V(®; E)| is 
called the total variation of b on E. When 

we deal with a fixed ®, instead of V(®; E), 
V(®; E), V(®; E) we write simply HIEL V(E), 
V(E). If V(®; E) is bounded, then 6 is said to 
be of bounded variation. If (E) 20 (<0) for 
every Ee Le ECE’ implies G(E) x (E) 
(D(E) 2 (E), then ® is said to be monotone 
increasing (decreasing). Every finitely addi- 
tive set function of bounded variation can be 
represented as the difference of two monotone 
increasing finitely additive set functions. 

Let Jy be a fixed interval in R” and F(I) be 
an interval function defined for left open inter- 
vals I c lọ, where @ is considered as a degener- 
ate left open interval. If, for any two left open 
intervals I,, I, such that I, UJ, is an interval 
and N L= Qj, we have F(U D) S F(Ij) - 
F(L), then we call F(I) an additive interval 
function in J. Specifically, if f(x) is a real- 
valued bounded function on R and D is an 
interval function determined by D(J)= f(b) 
— f(a), where I — [a, bitte, D(I) is the incre- 
ment of f(x)), then D is an additive interval 
function on R, called the increment function 
of f. For a given f the increment function is 
determined uniquely. Conversely, for a given 
D, a function f such that D is its increment 
function is determined uniquely up to an addi- 
tive constant. In this sense an additive interval 
function in R may be identified with the corre- 
sponding point function on R. 

Let *R(1;) be the finitely additive class of all 
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finite unions R of left open intervals in Jp. 
Then any additive interval function F(I) can 
be extended to a finitely additive set function 
F(R) defined on 9R(I,). For the rest of this 
article, it is understood that an additive inter- 
val function means this extended set function. 
If for any ¢>0 there exists a ó >0 such that 
|| « ó implies F(I) « e (where |F| is the vol- 
ume of the interval J), then we say that F is 
continuous. 


C. Completely Additive Set Functions 


Let P(E) be a real-valued set function defined 
on a tcompletely additive class 8 in a space X. 
If ® satisfies the complete additivity condition: 


E,E,€8,  EjYE,-Qj (j¥k) 
imply gd d =}, UEL 

j=l j=l 

then d EI is called a completely additive set 
function (or simply additive set function) on 8. 
In this case the corresponding upper variation 
V(E), lower variation V(E), and total variation 
V(E) are all completely additive set functions, 
and for every Ee% we have ®(E) = V(E) -- V(E) 
(Jordan decomposition). Furthermore, V(E) — 
sup ZC, |O(Ej)|, where the supremum is taken 
over all decompositions of E such that E= 
Ufa E; (Eje 8, E, E, =O, j #k). The com- 
pletely additive nonnegative set functions are 
the same as the finite measures. Hence the 
Jordan decomposition implies that every com- 
pletely additive set function is represented as 
the difference of two finite measures. A com- 
pletely additive set function is also called a 
signed measure. 

Any continuous additive interval function 
of bounded variation can be extended to a 
completely additive set function. The notion 
of additive interval function of bounded 
variation is a generalization of that of function 
of bounded variation (— 166 Functions of 
Bounded Variation). 

Let be a completely additive set func- 
tion and ya finite or o-finite measure, both 
defined on 88. If u(E)=0 implies ®(E)=0, then 
® is said to be absolutely continuous with 
respect to u or -absolutely continuous. Then di 
is -absolutely continuous if and only if for 
any £2 0, there exists a ó 0 such that u(E) <ô 
implies |®(E)| <e. If for given and u there 
exists an EEB such that u(E,) 0 and (E) 

— Q(Ef Eo) for every Ec %, then ® is said to 
be singular with respect Lou or y-singular. 

In a fo-finite measure space (X, B, u), every 
completely additive set function ®(£) defined 
on B can be represented uniquely as the sum 
of a p-absolutely continuous set function and a 
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u-singular set function (Lebesgue decompo- 
sition theorem). Also, (E) is u-absolutely 
continuous if and only if (E) can be repre- 
sented as the indefinite integral f; f(x)du of a 
function f that is integrable on X with respect 
to u (Radon-Nikodym theorem). This function 
f(x) is called the Radon-Nikodym derivative, 
d®/du, of ® with respect to u (— 270 Measure 
Theory L (iii)). 


D. Differentiation of Set Functions 


Let m be the Lebesgue measure in R" and 
E a Lebesgue measurable set. We denote 
sup(m(E)/m(Q)) for all cubes Q such that Ec Q 
by r(E) and call it the parameter of regularity 
of E. If for a sequence of sets {E,} there exists 
an « such that r(E,) » «0, then (E, is called 
a regular sequence. If all the E, contain a point 
P and the tdiameter of E, tends to 0 as n> oo, 
then we say that (E, converges to the point P. 
Let ® be a set function in R”, For a regular 
sequence (E, of closed sets converging to a 
point P, we put /=limsup(®(E,)/m(E,)) and 
define the general upper derivative of b at P to 
be the least upper bound of (for all such se- 
quences (E,), denoted by D®(P). Similarly, the 
general lower derivative D®(P) of ® at P is 
defined to be the greatest lower bound of 
lim inf(®(E,,)/m(E,,)) for all regular sequences 
{E,} of closed sets converging to P. The or- 
dinary upper (lower) derivative, denoted by 
(E)(®(E)), is defined in the same way by taking 
regular sequences of closed intervals instead 
of closed sets. Do, DỌ, o, ® are point func- 
tions derived from ®. Clearly, D®(P)<@(P)< 
(P) D®(P). If DO(P) = D®(P), then we write 
it simply as D®(P). If D®(P) is finite, then we 
call it the general derivative of b at P and say 
that is derivable in the general sense at P. If 
Q(P) — (P), then we write it as (P). If (P) is 
finite, then we call it the ordinary derivative of 
® at P and say that ® is derivable in the ordi- 
nary sense. We have the following theorems: 
(1) A completely additive set function is deriv- 
able in the general sense *almost everywhere 
(Lebesgue). The Radon-Nikodym derivative of 
a set function absolutely continuous with 
respect to the Lebesgue measure is equal 
almost everywhere to the generalized deriva- 
tive of the set function. (2) An additive interval 
function of bounded variation is derivable in 
the ordinary sense almost everywhere (Lebes- 
gue). (3) An additive interval function 6 is 
derivable in the ordinary sense at almost all 
points such that ®(P)< +œ or (P) —oo. 
For the proof of these theorems, Vitali's 
covering theorem is essential: Let A be a given 
set and § a family of measurable sets in Eu- 
clidean space. If for each xe A there is a 
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regular sequence of sets belonging to 5 that 
converges to x, then there exists a finite or 
countable set of disjoint E „e $$ such that 
m(A | Je, E)=0. 
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381 (II.1) 
Sets 


A. Definitions and Symbols 


G. Cantor defined a set as a collection of ob- 
jects of our intuition or thought, within a 
certain realm, taken as a whole. Each object in 
the collection is called an element (or member) 
of the set. The notation ae A (A3a) means 
that a is an element of the set A. In this case 
we say that a is a member of A or a belongs to 
A. The negation of ae A (A3a) is written a£ A 
or ae A (A$a or A5a). The set having no 
element, namely the set A such that ad A for 
every object a, is called the empty set (or null 
set) and is usually denoted by @. Two sets A 
and B are identical, i.e., A = B, if every ele- 
ment of A belongs to B, and vice versa. The 
set containing a, b, c, ... as its elements is 

said to consist of a, b, c, ... and is denoted by 
(a, b, c, ... ). The symbol (x|C(x)] (or (x; C(x)}, 
sometimes E,[C(x)]) denotes the set of all 
objects that have the property C(x). Thus {a} 
is the set whose only element is a, and (a, b] is 
the set with two elements a and b, provided 
that a zb. A set is called a finite set or an 


infinite set according as the number of its 


elements is finite or infinite. 

A set A is a subset of a set B if each element 
of A is an element of B. In this case we also say 
that A is contained in B or that B contains A, 
and we write A c B and B> A. The negation 
of AC B(B2 A)is Ac B (B+ A). For every 
set 4, @ c A. Ac Band Bc C imply Ac C. 
If Ac B and Bc A, then A — B. A is a proper 
subset of B (in symbols: AZ B, B2 A) if Ac 
B and A z B. Some authors use € (2) for c 
(>), and c (>) for Ẹ (2). 
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B. Algebra of Sets 


The union ( join or sum) of sets A and B, 
written A U B, is the set of all elements which 
belong either to A or to B or to both. The 
intersection (meet or product) of sets A and B, 
written AN B, is the set of all elements which 
belong to both A and B. In other words, ve 
AU B if and only if xe A or xe B or both, and 
xe ANBif and only if xe A and xe B. Given 
sets A, B, and C, AUB=BUA, ANB=BNA 
(commutative law); (AU B)JU C= AU(BU C), 
(AN B) C 2 An (Bf C) (associative law); 
AU(BNC)=(AUB)N(AUC), AN(BUC)= 
(AN B)U(A NC) (distributive law); AU(A NM B)= 
A, AN({(AU B)=A (absorption law). 

Two sets A and B are disjoint if AN B= (7. 
In this case the set C= AU Bis said to be the 
disjoint union (or sum) of A and B, and ts 
written sometimes as C= A + B. The set of 
elements of A which are not members of B is 
denoted by A — B, and is called the difference 
of A and B (or relative complement of B in A). 
If A> B, A—B is called the complement (or 
complementary set) of B with respect to A. 

We often consider a theory in which we 
restrict our attention to elements and subsets 
of a certain fixed set Q, and call it the universal 
set of the theory. In geometic terms, Q is also 
called the space or the abstract space, elements 
of Q are called points, and subsets of Q point 
sets. If A is a subset of Q, Q—A is simply 
called the complement of A and is denoted A". 
For AcQ and Bc OQ, A» B and A*c B° are 
equivalent. Furthermore, we have AU A* =Q, 
ANA =Ø, A**— A; and (AN BF =A°U BS, 
(AU By = A°N B* (de Morgan's law). 

The power set of a set X, written *B(X), is 
the set of all subsets of X. A set whose ele- 
ments are sets is often called a family of sets. 

The pair consisting of objects a and b is 
denoted by (a, b). Two pairs (a, b), (c, d) are 
defined to be equal if and only if a— c and 
b — d. A pair (a, b) is called an ordered pair, 
while the set (a, bj is sometimes called an 
unordered pair. Generally the n-tuple 
(a, b, c, ...,d) of n given objects a, b, c, ..., d 
is defined to be ((...((a, b), c), ...), d so that 
(a, b, c, ...,d) 2 (a, b', c', ..., d') if and only if a= 
a’, b — b',...,d—d'. The Cartesian product (or 
direct product) of sets A and B, written A x B, 
is the set of all pairs (a, b) such that ae A and 
beB. Ax B= and only if either A= @ or 
B=; Ax BcCxD if and only if Ac C and 
Bc D, provided that neither A nor B is empty. 
Furthermore, 


(A x B)U(A’ x B)=(AU A') x B, 
(A x BJ (C x D) -(An C) x (BND). 


The subset ((a,a)|ae A} of A x A is called 
the diagonal of A x A and is denoted by A ,. 
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Generally the Cartesian product (or direct 
product) of sets A, B, ..., D, written Ax Bx 
... X D, is defined as the set ((a, b, ..., d)|ae A, 
beB, ..., defi, 


C. Mappings 


If there exists a rule which assigns to each 
element of a set A an element of a set B, this 
rule is said to define a mapping (or simply 
map), function, or transformation from A into 
B. The term transformation is sometimes re- 
stricted to the case where A — B. Usually 
letters f, g, 9, V, ... stand for mappings. The 
expression f: A— B (A B) means that f isa 
function which maps A into B. If f: A B and 
ae A, then f(a) denotes the element of B which 
is assigned to a by f. We call f(a) the image of 
a under f. The notation f:a|»b (or f:a > b) 
is often used to mean f(a) — b (but not in the 
present volumes). The domain of a mapping 
f: A B is the set A, and its range (or codo- 
main), written f(A), is the subset { f(a)|ae A} 
of B. Two functions f and g are equal (f =g) 
if their domains coincide and f(a)= g(a) for 
each a in the common domain. 

For a mapping f: A B and a set Ce P(A), 
f(C) is defined to be the set { f(x)| xe C]. This 
definition induces the mapping from Di Alto 
P(B) which is usually also denoted by f. If 
A E B(A) (i= 1,2), then f(A, U 45) = f(4)U 
f(45) and f(A, Aj) c f(A,) f(A;). The 
inverse image of D € B(B), denoted by ; ^"! (D), 
is defined to be the set {x|xe A, f(x)e D); thus 
the mapping f. ! :$B(B)— 3B(A) is defined. If 
B,e P(B) (i= 1,2), then f^" (B,UB)) — f. '(B)U 
f (By f (B,0B)-f (BO f ' (By 
f (B—B,)2A-—f !(B,) Furthermore, A, c 
f 'of(A) and fof '(B)c B,. 

A mapping g is an extension of a mapping f 
to a set A’ if A’ is the domain of g and contains 
the domain A of f, and if g(a)= f(a) for each a 
in A. In this case f is called a contraction (or 
restriction) of g to A or simply a partial map- 
ping of g, and is denoted by g| 4. A mapping f 
is the constant mapping (or constant function) 
with the value be if f(a) — bg for every a in the 
domain of f. The identity mapping (or identity 
function) on A, often denoted by 14, is the 
mapping with the domain A such that f(a) — a 
for every a in A. Given two mappings f: A—B 
and g: B2 C, the mapping from A to C which 
assigns oi f(a)) to each ae A is called the com- 
posite of f and g and is denoted by go f. If f: 
A2B,g:B5C,and h: CoD,then (hog)o f= 
ho(go f) (associative law for composition of 
mappings). 

A mapping f: A9 B is from A onto B if 
f(A) — B. In this case f is also called a surjection 
(or a surjective mapping). A mapping f: AB 
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is one-to-one (1-1, or injective) if aa’ implies 
f(a) + f(a’) for every pair of elements a and a’ in 
A, that is, if for each b in the range of A, there 
exists only one element a of A such that f(a) — b. 
Such an f is also called an injection. In par- 
ticular, given a subset B of a set A, the injec- 
tion f: B A defined by the condition f(b) — 

b for each be B is called the inclusion map- 

ping (inclusion or canonical injection). A neces- 
sary and sufficient condition for f: A B to be 
a surjection is that go f— ho f imply g =h for 
every pair of mappings g: B>C and h: BC. 
For f: A B to be an injection it is necessary 
and sufficient that f og — f oh imply g =h for 
every pair of mappings g: C A and h: C A. 
A mapping which ts both a surjection and an 
injection is called a bijection (or bijective map- 
ping). If f: A— B is a bijection, then the mapping 
from B to A which assigns to each element b of 
B the unique element a of A such that f(a) 2 b 
is called the inverse mapping (inverse function 
or simply inverse) of f, and is denoted by f ~+. 
We have fof ! -15 and f! o f—1, for every 
bijection f: A B. 

If the domain A of a mapping f: AB is the 
Cartesian product of A, and A,, f(a) - b 
(where a —(a,,a;)) is written as f(a,,a;) — b. 
Given A= A, x A;, B= B, x B,, and f;: A, B, 
(i2 1,2), the mapping f: A— B defined by the 
condition f(a,, a5) —( fi(a4), f2(a2)) is called the 
Cartesian product (or direct product) of the map- 
pings of f, and f}, and is denoted by f, x f,. 

For a mapping f: AB, the subset G = 
{(a, f(a))|ae A} of A x B is called the graph 
of f. The basic properties of the graph G of f 
are: (1) For every ae A there exists a be B such 
that (a, b)e G. (2) (a, b)e G and (a, b')e G imply 
b — b'. Conversely, a subset G of A x B with 
these two properties determines a mapping 
f: A B such that (a, b)e G if and only if f(a) — 
b. All notions concerning a mapping f: A—^B 
can be transferred by means of its graph to 
those concerning a subset of a Cartesian prod- 
uct A x B. 

Given sets A and B, we denote by B^ the set 
of mappings from A to B. If a mapping is 
identified with its graph, B^ is considered to 
be a subset of P(A x B). For X e P(A), the 
mapping cy: A (0, 1} such that cy(x) — 1 if 
xe X and cy(x)=0 if xX is called the charac- 
teristic function (or representing function) of X. 
By assigning to each X € P(A) its characteristic 
function cy € (0, 1}4, we obtain a one-to-one 
correspondence between (A) and {0, 1}4; 
hence P(A) is sometimes denoted by 24. 


D. Families of Sets 


A mapping from a set A to a set A is also 
called a family of elements of A indexed by A. 
A is its index set (or indexing set). In this case, 
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the image f(A) of AeA is denoted by a,, and 
the mapping itself is denoted by Join ({a,} 
(4e A), or simply {a,}). In particular, if the set 
A is the power set of a set, the family {a,},., is 
called a family of sets indexed by A, or simply 
a family of sets. (Moreover, if A is chosen to be 
a subset of the power set P(X) of a set X and 
f to be the identity mapping on A, then the 
family of sets resulting from f can be identified 
with the set of subsets A itself.) 

The union | J;.4 A, of a family of sets 
(Alien is the set of all elements a such that 
ae A, for at least one A in A. Their intersection 
[zea A; is the set of all elements a such that 
ae A for all Ain A. A family of sets {A,} 2, is 
mutually disjoint if 2# u implies A, A, = Ø. 
In this case A —| J;.4 A; is called the disjoint 
union (or direct sum) of the sets of the family, 
and (A,),., is called a partition (or decompo- 
sition) of A. For families of sets, the following 
hold: (J Azu - UU, 45. (45s (laa) 
(AeA, ue M,) (associative law); (( Jie, Aa) 
(ten B,) = amen x M( A; B,), (laea A,)U 
(Quem Ba) = ha. ien, wt: LU B,) (distributive 
law); (ier Ayr — arm Aj, (() zen Ay) = 
LI 4$ (de Morgan’s law). 

A family {A,},¢, of sets is a covering of a set 
A, or covers A, if Ac | J; Aj. 

Given f: X — Y, {A,},-, and (B,j;.4 (where 
A,c X and B;c Y), then f(| re, Ay) = 
aca f (A3), TU asa Ac (eA f (Ass and 
f "(Ua B,)= reat 1 (B;). Pix Bj)- 
(Jaca f (B). 


E. Direct Sum and Direct Product of Families 
of Sets 


Given a family {A,},., of sets indexed by A, a 
set S, and a family of injections {i,:A,—>S} ca, 
then the pair (S, (i,3;.4) is called the direct sum 
of {Ayca If {i,(A,)} rea is a partition of S. In 
this case, S is written Za A, (or 2, A, or 
II; A,) Each A, is called a direct summand of 
S, and each i, is called a canonical injection. 
The Cartesian product (or direct product) 
IL A; (IT, A;) of {Ag} aca (where A; c X) is 
the set of all mappings from A to X such that 
f(4)€ A; for every 4e A. The sets A, are the 
direct factors of IT, A;. Each element f of 
IL A; is denoted by (x;);.4 or (..., x;, ...) 
(where x; = f(A)). The element x; is the Ath 
component (or coordinate) of f. The mapping 
pri: IL A; A, which assigns x, to each 
{xi hea € Tea 41 is called the projection of 
II;.4 A; onto its Ath component. If A —(1,2], 
IL. A, can be identified with A, x A,. 


F. Set Theory 


It was G. tCantor who introduced the concept 
of the set as an object of mathematical study. 
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Cantor stated: “A set is a collection of definite, 
well-distinguished objects of our intuition or 
thought. These objects are called the elements 
of the set” (G. Cantor, Math. Ann., 46 (1895)). 
Cantor introduced the notions of tcardinal 
number and tordinal number and developed 
what is now known as set theory. He proved 
that the cardinal number of the set of tran- 
scendental numbers is greater than that of 
algebraic numbers, and that all Euclidean 
spaces have the same cardinal number regard- 
less of their dimension. He stated the fcon- 
tinuum hypothesis and also conjectured the 
twell-ordering theorem (G. Cantor, Math. 
Ann., 21 (1883)), which was proved by E. Zer- 
melo [2]. In this proof Zermelo stated the 
*axiom of choice explicitly for the first time, 
and used it in an essential way. 

Meanwhile it was pointed out that Cantor's 
naive set concept leads to various logical 
tparadoxes (— 319 Paradoxes). Since the set 
concept plays a fundamental role in every 
branch of mathematics, the discovery of the 
paradoxes had a serious impact upon math- 
ematics, and led to a systematic investigation 
of the ffoundations of mathematics. In the 
course of attempts to avoid paradoxes, set 
theory was reconstructed as taxiomatic set 
theory (- 33 Axiomatic Set Theory), in which 
Cantor's theory of cardinal numbers and 
ordinal numbers was restored. Also, the theory 
of the algebra of sets, which forms a basis for 
various branches of mathematics, was re- 
constructed. Axiomatic set theory is consid- 
ered to be free from paradoxes. 


G. Classes 


A set in the naive sense is a collection 
{x|C(x)} of all x which satisfy a certain con- 
dition C(x). The only principle for generating 
sets in naive set theory is the axiom of compre- 
hension, which asserts the existence of the set 
{x|C(x)} for any condition C(x). However, this 
principle leads to paradoxes if the notion of an 
arbitrary set is considered to be well defined; 
for example, the *Russell paradox is caused by 
the set {x|x¢x}. This situation necessitates 
some restrictions on the axiom of comprehen- 
sion. The simplest way to overcome the para- 
doxes is to adopt Zermelo's axiom of subsets: 
Given a set M and a condition C(x), there 
exists a set (x | xe M, C(x)]. But this axiom 
cannot produce any sets other than subsets of 
sets whose existence is preassumed. Hence 
further generating principles of sets had to be 
introduced. The following axioms are usually 
chosen as generating principles. 

Axiom of pairing: For any two objects (pos- 
sibly sets) a and b, there exists a set (a, b]. 
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Axiom of power set: Given a set A, its power 
set P(A) exists. 

Axiom of union: For any familv of sets the 
union exists. 

Axiom of substitution (or replacement): For 
any set A and any mapping f from A, there 
exists a set of all images f(x) with xe A. 

In ordinary theories of mathematics the set 
of natural numbers, the set of real numbers, 
etc., are assumed to exist, in addition to sets 
generated by the axioms in this section. In 
pure set theory the axiom of infinity is needed 
to secure the existence of infinite sets. 

The concept of the “set” {x|x¢x} does not 
automatically lead to Russell's paradox. The 
trouble arises when this "set" is regarded as a 
member of a collection represented by x. This 
leads to a narrower concept of sets. Consider a 
fixed collection V consisting of sets in the naive 
sense and closed under the set-theoretic oper- 
ations mentioned in the axioms. Call a mem- 
ber of V a set in the narrow sense. Then set 
theory becomes free from the known para- 
doxes if the qualification for being a set is 
restricted in this narrow sense. When sets in 
the narrow sense are called simply sets, sets in 
the naive sense are called classes. The object 
(x|xéx] (where x ranges over sets in the nar- 
row sense) is a class which is not a set. Those 
classes which are not sets are called proper 
classes; for example, the class V of all sets and 
the class of all ordinal numbers are both pro- 
per classes. For classes, unrestricted use of the 
comprehension axiom again leads to para- 
doxes, but other set-theoretic operations are 
justifiably applicable to classes. 

The notion of classes was first introduced 
in connection with the construction of an 
axiomatic set theory. The term class was used 
originally to denote certain subclasses of the 
class V of all sets. In these volumes the term 
set is mostly used to mean a set in the naive 
sense, and most of the notions defined for sets 
are applicable to classes. 
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A. General Remarks 


In 1968 K. Borsuk introduced the notion 

of shape as a modification of the notion of 
thomotopy type. His idea was to take into 
account the global properties of topological 
spaces and neglect the local ones. It is a classi- 
fication of spaces that is coarser than the 
homotopy type but that coincides with it on 
tANR-spaces. 

Let 2 be the category whose objects are all 
tpolyhedra and whose morphisms are homo- 
topy classes of continuous mappings between 
them. For spaces X and Y, denote the set of all 
thomotopy classes of continuous mappings of 
X to Y by [X, Y] and the homotopy class of a 
mapping f by [f]. For a space X, let My be 
the functor from 2 to the tcategory of sets 
and functions that assigns to a polyhedron P 
the set Z7,(P) - [X, P]. A morphism o P5 Q 
of 2 induces the function o, IX. P] [X, Q] 
defined by o,([£/]) 2o: [f/1for Lf]: X9 P. A 
tnatural transformation from Hy to Hy is a 
shape morphism from X to Y. A continuous 
mapping f: X — Y defines the shape morphism 
f* of X to Y as follows: For [g]: YP in 
IT,(P), the composition To f] is an element of 
IT,(P). The correspondence: [g]—[g: f] de- 
fines a natural transformation from 77, to Hy 
and hence determines the shape morphism 
f* :X 5 Y. The identity mapping 1, on X 
defines the identity shape morphism 1% on 
X. Given spaces X and Y, X shape dominates 
Y if there are shape morphisms č: Y> X and 
9: X 2 Y such that &7=1¥, and we write 
Sh(X) x Sh(Y). If, in addition, në = 17, then 
X and Y are of the same shape, and we write 
Sh(X)=Sh(Y). A shape category 4 is the 
category whose objects are all topological 
spaces and whose morphisms are shape mor- 
phisms between them. If we replace topo- 
logical spaces by pointed ones, the pointed 
shape category is obtained. In what follows, 
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for simplicity, we assume that all spaces are 
metrizable and the mappings are continous. 
General references are Borsuk [1], J. Dydak 
and J. Segal [3], R. H. Fox [4], S. Mardešić 
[7]. 


B. Chapman’s Complement Theorem 


Let X be a compactum. A closed set A of X is 
a Z-set in X if for any £» 0 there is a mapping 
f: X 5 X —A such that d(x, f(x)) «e for xe X, 
where d is a metric on X. The Hilbert cube Q is 
the countable product [T;2, /;, where I; is the 
closed interval TO. 1]. The subset s = [J£ I? 
(1? =(0, 1)) is called the pseudointerior of Q. The 
following facts are known: (1) If a compact 
metric space X is contained in s or Q— s, then 
X is a Z-set in Q. (2) For any continuous 
mapping f of a compact metric space X into 
Q, there exists an embedding g of X into Q 
such that g is arbitrarily close to f and the 
image g(X) in Q is a Z-set. The complement 
theorem (T. A. Chapman [2]) states: Let X 
and Y be Z-sets in Q. Then Sh(X) - Sh(Y) iff 
Q — X and Q — Y are homeomorphic. 


C. FAR, FANR, Movability, and Shape 
Group: Shape Invariants 


A closed set A of a compactum X is a funda- 
mental retract of X if there is a shape mor- 
phism r: X >A such that r-i* =1%, where i is 
the inclusion of A into X. A compactum X is 
a fundamental absolute retract (FAR) (resp. 
fundamental absolute neighborhood retract 
(FANR)) if for any compactum Y containing 
X, X is a fundamental retract of Y (resp. of 
some closed neighborhood of X in Y). A com- 
pactum X is movable if for any embedding 

X <Q and for any neighborhood U of X in Q 
there is a neighborhood V of X satisfying the 
following condition: For any mapping f of a 
compactum Y to V and for any neighborhood 
W of X, there is a homotopy H:Y x I29U 
such that H(y, 0) 2 f(y) and H(y, l)e W for 

ye Y. In this definition, if Y is replaced by a 
compactum with dimension <k, then X is said 
to be k-movable. Pointed FAR, FANR, mova- 
bility, and k-movability are defined similarly 
in the pointed shape category. The follow- 

ing facts are known (Borsuk [1], Dydak and 
Segal [3], J. Keeslings [5], J. Krasinkiewicz 
[8]). A compactum X is a FAR if and only 

if X is a pointed FAR iff Sh(X)=Sh(point), i.e., 
X has the same shape as a one-point space. A 
pointed compactum (X, x) is a pointed FANR 
iff Sh(X, x) <Sh(K, k) for some pointed poly- 
hedron (K, k). An FANR is movable. À com- 
pact connected Abelian topological group is 
movable if and only if it is locally connected. 
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A continuous image of a pointed 1-movable 
compactum is pointed 1-movable. It is un- 
known whether (i) an FANR is a pointed 
FANR and (ii) movability means pointed 
movability. For a pointed compactum (X, x), 
let {(K,,k,)|i=1,2,...} be a countable tin- 
verse system consisting of pointed finite poly- 
hedra whose limit is (X, x). The limit group 
lim ,(K;,, kj) is the kth shape group of (X, x), 
where z,(K, k) is the kth homotopy group of 
(K, k). It is known that the shape groups for 
movable compacta behave like homotopy 
groups for ANR. A property P of spaces is 

a shape invariant if whenever X has P and 
Sh(X) 2 Sh(Y), then Y has P. FAR, FANR, 
movability, k-movability, and shape groups 
are shape invariants. 


D. CE Mappings 


A mapping f of a space X onto a space Y is 

a cell-like (CE) mapping if it is proper and 
Sh(f !(y)) - Sh(point) for each point y of Y. It 
is known (R. B. Sher [9], Y. Kodama [6]) that 
if there is a CE mapping of X to Y with finite 
dimension, then Sh(X) - Sh(Y). Here the finite- 
dimensionality of Y is essential. A Q-manifold 
is a space, each point of which has a closed 
neighborhood homeomorphic to Q. The fol- 
lowing are known (Chapman [2], J. E. West 
[10]): (1) If f is a CE mapping of a Q-manifold 
M to an ANR X, then the mapping g:M x Q^ 
X xQ defined by g(m, x) -( f (m), x) for (m, x)e 
M xQ is approximated by homeomorphisms. 
Ás a consequence, if X is a locally compact 
ANR, then X x Q is a Q-manifold. (2) Every 
compact ANR is a CE image of a compact Q- 
manifold. (3) Every compact ANR has the 
same homotopy type as that of a compact 
polyhedron. The following problem raised by 
R. H. Bing is open: Is a CE image of a finite- 
dimensional compactum finite-dimensional? 
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A. Presheaves 


Let X be a ttopological space. Suppose that 
the following conditions are satisfied: (1) There 
exists an (additive) Abelian group F(U) for 
each open set U of X, and ¥(@)= {0}; and 
(ii) there exists a homomorphism ryy: (V) 
F(U) for each pair U c V, such that ryy=1 
(identity) and ry, zs Pur Oryw for UC VcW. 
We call F, consisting of a family 1. #(U)} of 
Abelian groups and a family of mappings 
{ryy}, a presheaf (of Abelian groups) on X. If 
ae (V) and U cV, we write ro,(a) 2 a| U and 
call it the restriction of a to U. A homomor- 
phism o between two presheaves F and 4 on X 
is a family (9(U)) of group homomorphisms 
q(U): F(U) > 2(U) satisfying ry, o o(V)— 
e(U)oryy whenever U c V. The presheaves 
on X and their homomorphisms form a 
tcategory. 


B. Axioms for Sheaves 


A presheaf 4 is called a sheaf (of Abelian 
groups) if it satisfies the following condition: If 
U is open in X and (U,);., is an topen covering 
of U, and if for each ie I an element s; of F(U) 
is given such that s;| U; U;— s;| U;N U; for all i 
and j, then there exists a unique se F(U) such 
that s| U; — s; for all i. By definition, a homo- 
morphism between two sheaves is a homomor- 
phism of the presheaves. The sheaves on X 
also form a category. 

Let ¥ be a presheaf, x a point of X, and 9t, 
the tdirected set of open neighborhoods of x, 
with the order opposite to that of inclusion. 
Then {F (U)|U €91,] is an inductive system of 
groups. The finductive limit lim, F(U) of 
groups (4 (U)! is denoted by F, and called 
the stalk of F over x. The image of se F(U) in 
F is called the germ of s at x and is written s,. 
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A homomorphism 9: ¥ —4 of presheaves 


induces a homomorphism q,: £,— 4. of stalks. 


C. Sheaf Spaces 


We introduce a topology on the tdirect sum 
SZ ss lx ZS, in the following way: For each 
open set U of X and each se F(U), consider 
the set My, s= (s,| xe U) of the germs defined 
by s at the points of U, and take the set of all 
such My , as a tbase of open sets of the to- 
pology. If p: ¥’—X is the mapping that maps 
the points of F, to x, then p is continuous, and 
each p !(x) (= £,) has the structure of an 
Abelian group. Moreover, the following con- 
ditions are satisfied: (i) p is a tlocal homeo- 
morphism, and (ii) the group operations on 

p !(x)are continuous in the sense that (a, b) 5 
a+b is a continuous mapping from the 

*fiber product F’ x, F’ De, the subspace 

{(a, b)| p(a) = p(b)} of the product space F’ x 
4 ')to F’ and a-» —ais a continuous map- 
ping from F” to itself. In general, a topological 
space F’ with a structure satisfying these 
conditions is called a sheaf space over X. _ 

When F’ is a sheaf space, a continuous 
mapping s from a subspace A of X to F’ such 
that pos=1, is called a section of F’ over 
A. The set of sections over A, denoted by 
I'(A, F’), is an Abelian group in the obvious 
way. If we associate I'(U, F’) with each open 
set U and define ry, by the restriction of sec- 
tions (rp, (s) 2 s| U), then we get a sheaf F” on 
X. If we start from a presheaf F and get F” 
via F’, the correspondence FF" is a tcovar- 
iant functor from the category of presheaves 
to the category of sheaves, and F” is called the 
sheaf associated with the presheaf F. If 7 isa 
sheaf, we can prove ¥”= 4 . Conversely, if we 
start from a sheaf space ¥’ and construct the 
sheaf F” and then the sheaf space F”, then 
F” is canonically isomorphic to F”. Since we 
can identify a sheaf and the corresponding 
sheaf space, both are usually denoted by the 
same letter. In particular, when F is a sheaf, 
4 (U) is usually written T (U, F). 

Given a section sel (X, F) of a sheaf, the 
points xe X for which s, 40 in F, from a 
closed set (the sheaf space F is not necessarily 
Hausdorff even if X is so). This set is called the 
support of s and is denoted by supps. 

In the theory of this and the previous two 
sections, we can replace Abelian groups by 
groups, rings, etc. Then F(U) is a group or 
ring accordingly, and F(Z) the group consist- 
ing of the identity element or the ring consist- 
ing of the zero element, respectively. We thus 
obtain the theories of sheaves of groups, 
sheaves of rings, etc. In general, a presheaf F 
on X with values in a category € is a tcon- 
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travariant functor from the category of open 
sets of X to €, and a homomorphism between 
presheaves F and 4 on X is a tnatural trans- 
formation between the functors F and 4. 
The presheaves (sheaves) of Abelian groups 
on a space X form an tAbelian category, de- 
noted by 2* (¢*). For a homomorphism f: 
4 — of presheaves, the image, coimage, 
kernel, and cokernel of f in 287 are given by 


(Im f)(U) — Im f(U), 

(Coim f)(U) — Coim f(U), 
(Ker f (U) 2 Ker f(U), 
(Coker f)(U)= Coker f(U). 


When F and 4 are sheaves, the kernel of f in 
€* coincides with the kernel in 2X, while the 
image and cokernel of f in €* are the as- 
sociated sheaves of the image and the cokernel 
in 2X, respectively. Thus, f: —4 induces 

f FG, at each xe X, (Ker f), 2 Ker f,, 

(Im f), — Im f,, (Coker f), = Coker fy, and a 
sequence of sheaves 05.7 4 2f 0 is exact 
if and only if 022, 59,5 x, 50 is exact at 
each xe X. 


D. Examples 


(1) Let G be an Abelian group (or some other 
talgebraic system) with tdiscrete topology. 
The Cartesian product X x G gives rise to a 
sheaf on X, called a constant sheaf (or trivial 
sheaf ). 

(2) Let X be a topological space and Y be a 
topological Abelian group (e.g., the real or 
complex numbers). We obtain a sheaf ¥ on X 
by putting F (U)— the set of all continuous 
mappings U — Y and r,, =the natural restric- 
tion. The stalk over xe X is the set of germs at 
x of continuous functions into Y. This sheaf is 
called the sheaf of germs of continuous func- 
tions with values in Y. 

(3) When X is an tanalytic manifold and Y is 
a commutative tLie group, we define the sheaf 
of germs of analytic mappings with values in Y 
in the same way. If Y is the complex number 
field C, this sheaf is the sheaf © of germs of 
analytic (or holomorphic) functions. A fcon- 
nected component of the sheaf space ( can be 
identified with the tanalytic function deter- 
mined by the function element corresponding 
to a point on that component. The sheaf of 
germs of functions of class C’ on a C^-manifold 
(r & s) is similarly defined. 

(4) Given a tvector bundle B over a topolog- 
ical space X, we define a sheaf on X by F(U) 
=I (U) (=the module of sections of B over U) 
and ry, =the natural restriction. Here the stalk 
over xe X consists of the germs at x of sections 
of B, and is called the sheaf of germs of sec- 
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tions of the vector bundle B. We have similar 
definition for the sheaf of germs of differenti- 
able (analytic) sections when X ts a 'differenti- 
able (complex) manifold. The case where B is a 
*tensor bundle (e.g., the *cotangent bundle 
X*(X))1s important. The sheaf 9I'( X) of germs 
of C*-sections of the r-fold *exterior power of 
X*(X)is called the sheaf of germs of differen- 
tial forms of degree r (0 € r x dim X). 


E. Sheaf Cohomology 


The category @* of sheaves of Abelian groups 
on X has sufficiently many *injective objects. A 
sheaf F with the property that px T (X, F) 
>I (U, F) is surjective for any open set U is 
said to be flabby (or scattered). An injective 
sheaf is flabby. 

Fix a nonempty family ® of closed subsets 
of X satisfying the following two conditions: (1) 
A, Be = AUBe 6; (i1) any closed set con- 
tained in an element of ® belongs to ®. Put- 
ting I5(7)— (s|se (X, F), suppse®} for 
each ¥ € €*, we obtain a ‘left-exact covariant 
functor I3 from €* to the category (Ab) of 
Abelian groups. Therefore, by the general 
theory of homological algebra, we can define 
the fright derived functors RT}: € * (Ab) 

(q —0,1,2,...). We put R?T$4(27)— H&(X, F) 
and call the H$(X, F) (q—0, 1, ...) the coho- 
mology groups with coefficient sheaf 7 and 
family of supports ® (— 200 Homological 
Algebra I). When € is the family of all closed 
subsets of X, we write H*?(X,.7 ) instead of 
H(X, F). 

Thus the cohomology group H$(X, F) 
is the gth cohomology of the complex 
Te(2) S57, (2) 4 T4(92)5... induced by an 
tinjective resolution 02.7 5 99 5 9! 5, 
of the sheaf F HSN. Zi Kerd?/Imd*^! 
(q250,1,...;d 1 20) 

H$(X, £ ) - YS (.7), and from an exact se- 
quence of sheaves 05 4 >G 5 H 0 we get an 
exact sequence 02 H§(X, 7) HENX. 4)» 
H9(X,3*)- Hl(X, F)> HAX, 4)  Hy(X, A) 
>HZ(X, F )>... 

Similarly, the coliomolauy groups Hà(X, F) 
can also be calculated with an exact sequence 
02.7 — 2°-+ Q! —...., where each ui is as- 
sumed to be I,-acyclic (i.e, H3(X, 2')=0 for 
q » 0) instead of injective. The flabby sheaves, 
for instance, are I3-acyclic, so we can compute 
H4(X,F) by a flabby resolution of F (R. 
Godement). For example, let X be an n- 
dimensional tparacompact C*-manifold and 
4 —R. Then OS RSW (X) 530! (X)5... is 
exact, where W(X) is the *sheaf of germs of 
C -differential forms of degree q, d? is tex- 
terior differentiation (Poincaré's theorem), and 
we have H?(X,31?) 20 for p» O0. Therefore 
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H*(X, R) is the qth cohomology of the com- 
plex 0^ D*(X) S D! (X)55..., where D'(X) 

=T (X, 9I(X)) 2 the group of C*-differential 
forms of degree i on X. This proves the de 
Rham theorem, which says that the tde Rham 
cohomology group is isomorphic to the *(sin- 
gular) cohomology group of X with real coeffi- 
cients (— 105 Differentiable Manifolds R). For 
a sheaf ¥ of noncommutative groups, we can 
define the first cohomology H' (X, F) [2]. 


F. The Cech Cohomology Group 


Let 11 — (U;j be an open covering of X, and 
write U;U;= U;;, etc. Put 


ip 


CHAR TT TQ, A 


p=0, 1, 2,.... 


An element af C?(F) is called a cochain of 
degree p. Define d: C"(4) C?*! (F) by 

(df yi, Au =r- y (d bes | Ui citys 
and denote the gth cohomology of the com- 
plex (C?(.7), d) thus obtained by HIU, F). 
When an open covering $8 is a refinement 

of U, there is a canonical homomorphism 
HU, F)> HB, F). So we can take the 
inductive limit of the groups HIU, Z) with 
respect to the refinement of open coverings. 
This limit group is denoted by H4(X,#) and is 
called the Cech cohomology group with coeffi- 
cient sheaf F. It coincides with H4(X, F) for 
q<1, and if X is paracompact, for all q. 


G. Relation to Continuous Mappings 


Let X and Y be topological spaces and f: X 

> Y be a continuous mapping. If 7 is a sheaf 
on Y, the fiber product X x ,.Z (where F is 
viewed as a sheaf space over Y) is a sheaf on 
X. It is denoted by f*(F) or f! (4) and is 
called the inverse image of Z. The correspon- 
dence ZZ — f *( 7) is an exact functor from e" 
to @*. Next, let Y be a sheaf on X. Associating 
I(f *(U),4) with each open set U of Y, we 
obtain a sheaf on Y, which we denote by f, (4) 
and call the direct image of 4. The corre- 
spondence f, is a left-exact functor 6* > @”, 
and we can consider its right derived functors 
R*f,. The sheaf R?f, (4) is the sheaf associated 
with the presheaf that associates H*(f !(U), 2) 
with each open set U. 

A homomorphism y from ¥ to f,(4) is also 
called an f-homomorphism from ¥ to 4. To 
give such a y is equivalent to giving a family 
of homomorphisms of the stalks W,: 4, fx 
(xe X) satisfying the continuity condition: For 
any open set U of Y and any section sef (U, F) 
over U, the E. y from f (U) to Y 
defined by o(x) 2 V (s( f(x)) is continuous. 

The id d f* and f, are related by 
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Hom(f *(7),4)z Hom(F, f,(Y)). The Leray 
tspectral sequence 


E?" — HY, R*f,(9)) > A(X, 9) 


exists and connects the cohomologies of X and 
of Y. 


H. Ringed Spaces 


Let X be a topological space and © be a sheaf 
on X of commutative rings with unity element 
such that ©, # {0} for any xe X. Then the pair 
(X, €) is called a ringed space, and © is called 
its structure sheaf. A morphism (X, 6)5(X', 0’) 
is by definition a pair ( f, 0) consisting of a 
continuous mapping f: X — X' and an f- 
homomorphism 0: 0—0. When each 0, is a 
tlocal ring, (X, ©) is called a local ringed space. 
A morphism of local ringed spaces is defined 
to be a pair (f,0):(X, 6) —(X', 0") as before, 
satisfying the additional condition that 0 is 
local Ge, 05: Gun — €, maps the maximal ideal 
into the maximal ideal for each xe X). These 
concepts are important in algebraic geometry 
and the theory of functions of several complex 
variables. 


I. Direct Products and Tensor Products 


Let F, (AeA) be sheaves of Abelian groups on 
a topological space X. The sheaf F on X 
defined by F(U)=T]1, ¥,(U) and ryy - ILir5 
is denoted by F =[], F, and called the direct 
product of sheaves 12.) For each xe X there 
is a natural mapping ¥,—-]1,(F,),, which is in 
general neither injective nor surjective. When 
A is a finite set, T Z; is also written F =F, 
+... +F, and is called the direct sum of the 
sheaves. The inductive limit ¥ —indlim F, of 
an inductive system of sheaves on X also 
exists, and 7, — indlim Z, y. 

Let (X, ©) be a ringed space. A sheaf of 
Abelian groups 7 on X is called a sheaf of 0- 
modules (or simply an 0-module) if F (U) is an 
O(U)-module for each U and ry,: F (V) >F (U) 
is a module homomorphism compatible with 
O(V)—-O(U) for each U c V. Then F, is an O- 
module for each xe X. For a fixed (X, €), the 
-modules form an Abelian category. When 
F and 4 are ©-modules, the tensor product 
JE — Ql of F and 4 as sheaves over d is 
defined as follows: Define a presheaf by U^ 
4 (U) & «uj; (U) and ry, 2rg, & roy, and let 
H be the associated sheaf of this presheaf. 
Then we have 3€, — F, ® DC 

The notion of coherent sheaves is important 
in the theory of modules (— 16 Algebraic 
Varieties E). 
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J. History 


About 1945, J. Leray established the theory of 
sheaf coefficient cohomology groups (in a form 
slightly different from that in Sections E and 
F) and the theory of spectral sequences to 
study the relation between the local properties 
of a continuous mapping and the global coho- 
mologies. In the theory of functions of several 
complex variables, K. Oka conceived the idea 
of "ideals of indefinite domain." These two 
ideas were unified by H. Cartan into the pre- 
sent form of sheaf theory. As a link between 
local properties and global properties, sheaf 
theory has been applied in many branches 

of mathematics (— 16 Algebraic Varieties; 

21 Analytic Functions of Several Complex 
Variables; 23 Analytic Spaces; 72 Complex 
Manifolds). 
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A. Siegel Domains 


Let D be a bounded domain in C” and G,(D) 
the full tholomorphic automorphism group of 
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D, which is a *Lie transformation group with 
respect to the tcompact-open topology. If 

G, (D) acts transitively on D, then D is called a 
thomogeneous bounded domain. The study of 
homogeneous bounded domains was initiated 
systematically by E. Cartan in 1936, while the 
notion of Siegel domains, which was intro- 
duced by I. I. Pyatetskii-Shapiro, has made 
remarkable contributions to the study of 
homogeneous bounded domains. 

Let V be a convex domain in an n- 
dimensional real vector space R. V is called a 
regular cone if for every xe V and 4-0, Axe V 
and if V contains no entirely straight lines. Let 
W be a complex vector space. A mapping F: 
Wx W RC (the tcomplexification of R) is a V- 
Hermitian form if the following conditions are 
satisfied: (Fi) F (u, v) is C-linear in u, (Fii) F (u, v) 
= F(v, u), where the bar denotes the tconjuga- 
tion with respect to R, (Fiii) F(u, u)e V (the 


closure of V), and (Fiv) F(u, u) 2 0 implies u=0. 


Given a regular cone Vc R and a V-Hermitian 
form. E on W, one can define a Siegel domain 
D(V, F) (of the second kind) by putting D(V, F) 
z (x iy u)e REx W|y— F(u,u)e V), which is 
holomorphically equivalent to a bounded 
domain in RC x W. When W —(0), D(V, F) is 
reduced to D(V)2 (x ipe RC| yeV), which 

is called a Siegel domain of the first kind. A 
mapping L:W x W RC is a nondegenerate 
semi-Hermitian form if L can be written as L 
=L,+L,, where L, and L, are RC-valued 
functions satisfying the conditions: (Li) L, 
satisfies (Fi) and (Fii); (Lii) L, is a symmetric 
C-bilinear form; (Liii) L(u, v)=0 for all ue W 
implies v 20. Let B be a bounded domain in a 
complex vector space X and L, (pe B) an RS- 
valued nondegenerate semi-Hermitian form on 
W depending differentiably on pe B. Consider 
a domain D(V, L, B) in RE x Wx X defined by 
putting D(V, L, B)={(x +iy,u, ppe RE x Wx X| 
y — ReL,(u, u)e V, pe Bj. The domain D(V, L, B) 
is called a Siegel domain of the third kind over 
B if it is holomorphically equivalent to a 
bounded domain. D(V, L, B) is a fiber space 
over B. 

By the affine automorphism group G, of a 
Siegel domain D(V, F)c RC x W we mean the 
group consisting of all elements in the complex 
affine transformation group of R€ x W leaving 
D(V, F) stable. The full holomorphic automor- 
phism group G, of D(V, F) contains G, asa 
closed subgroup. If G, acts transitively on 
D(V, F), then D(V, F) is said to be homogene- 
ous. A homogeneous Siegel domain is 
necessarily affinely homogeneous, i.e., G, acts 
transitively on D(V, F) [2]. The 'Bergman 
metric of D(V, F} which is a G,-invariant *Káh- 
ler metric, is *complete [3], and so D(V, F) is a 
*domain of holomorphy. 
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Examples. H (n, R) denotes the vector space 
of all real symmetric matrices of degree n, and 
H * (n, R) the regular cone consisting of all 
positive definite matrices in H (n, R). 

(1) The Siegel domain of the first kind 
D(H*(n,R))={X +iY|Xe€H(n, R), Ye 
H*(n,R)} is called the tSiegel upper half- 
plane, which is holomorphically equivalent to 
the classical tsymmetric domain of type III. 

(ii) Let u, ve C and F(u, v) be the 2 x 2 diag- 
onal matrix diag(uv, 0), which is an H ' (2, R)- 
Hermitian form on C. The resulting Siegel 
domain is 


D(H * 2; R), F) 
ps 


Imz, 


ZEE 





Imz, 
Imz, 
eH*(Q, m. 


(iti) Let B= (teC||t| « 1j, and let u, ve C. 
Put L,(u,v) 2 (1 — |t|?) ! (u0 + tuv). Then L,(u, v) 
is a nondegenerate semi-Hermitian form, and 
we have D(H * Q, R), L, B) 2 ((z, u, t)e C? | Imz 
—(1—1t2) ! Re(|ul? +tu?)>0, |t| « 1), which 
is a Siegel domain of the third kind and is 
holomorphically equivalent to the Siegel upper 
half-plane of dimension 3. 

The domains in (i) and (ii) are both affinely 
homogeneous; the latter was originally found 
by Pyateteskii-Shapiro in 1959 [1] and pro- 
vides the least-dimensional example of non- 
symmetric homogeneous bounded domains, 
which answered affirmatively Cartan's con- 
jecture (1936): Are there non-fsymmetric 
homogeneous bounded domains in CH (n z A" 


B. Infinitesimal Automorphisms of Siegel 
Domains 


For a Siegel domain D(V, F)c R€ x W, the Lie 
algebra 9, of G, can be identified with the Lie 
algebra of infinitesimal automorphisms, i.e., all 
complete holomorphic vector fields on D(V, F). 
Let G(V) be the group consisting of all the 
linear automorphisms of R leaving V stable. 
Let us fix a base in R, and let (z,,Z3,...,Z,) be 
the complex linear coordinate system in R* 
corresponding to it. Choose a complex linear 
coordinate system (u,,u5, ..., Um) in W. We 
write F(u, v) as F(u, v)=(F; (u, v), ..., F,(u, v)). 
Consider the following two vector fields in the 
Lie algebra g, of G,: 


6 0 . 6 
Fins t Dates» I= Yalta 


and put g2 = {X ega | [E, X]=4X}, 4€ Z. Then 
ga can be written as a *graded Lie algebra in 
the following way: q,— 0, ? - 8, | -- o9. Here 
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we have 


" d 
9, = b rm 





aR}, 


0 0 
—1 E 2i F 
Sa | id, (u, c) E? T Yaa A 


c=(c,), «sec. 


and g consists of all vector fields X,, 5 of the 
form 


à Ó 
Diis * Yu Pappa s 


where the matrices A — (ay), B —(b,,) satisfy 
the conditions: exptAe€ G(V), teR; AF(u,v)— 
F(Bu, v)+ F(u, Bv). For X, 9, €a9 we define 

tr X4 5, to be the sum of the trace of A and 
that of B. Let ai be the A-eigenspace of ad E in 
a, (Ae Z). Then g, can be written also in the 
form of a graded Lie algebra: gg ^g ! + 
a? +g! +9, and g^ — g/ is valid for A= —2, 

— 1, 0. Furthermore g, can be nicely deter- 
mined by g, in the following manner. p, de- 
notes a polynomial on R€ x W homogeneous 
of degree u in z,,...,z,, and homogeneous of 
degree A in u,, ..., Um. Let à! (resp. à?) be the 
set of all polynomial vector fields of the form 


Q ô 
Yuri a AIP o Ma Ee 


d ô 
(es Y Has + VII x.) 
k a 


Then we have g! - (X eá! |[X, 9 ! ] c g?], and 
go? ={X ed? |[X,9- ?]c o5 [X.a “]¢9', 

Im Tr[X, Y]=0 for Yeg 7}. Another descrip- 
tion of al and g? has been given in terms of 
Jordan triple systems [4]. The explicit descrip- 
tions of al and g? have been given for most 
homogeneous Siegel domains D(V, F) over 
irreducible self-dual cones V (— Section D; T. 
Tsuji, Nagoya Math. J., 55 (1974)). 8, = a, is 
valid for the Siegel domains which are irreduc- 
ible quasisymmetric but not symmetric (— 
Section D). Main references for this section are 
[2-6]. 


C. j-Algebras and Homogeneous Bounded 
Domains 


The notion of j-algebra was introduced by 
Pyatetskii-Shapiro [1], which reduces the 
study of homogeneous bounded domains to 
purely algebraic problems. Let g be a Lie 

_ algebra over R, and f a subalgebra of g, (j) a 
collection of linear endomorphisms of g, and oi 
be a linear form on g. Then the quadruple 
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{g, £ (j), œ} (sometimes abbreviated g) is called 
a j-algebra if the following conditions are 
satisfied: (i) jfc f for je(j) and j =j(mod f) for j, 
J'e(y (ii) ? = — id(mod P); (iii) j[k, x] 2 [k, jx] 
(mod f) for Eet xe g; (iv) [jx, jy] Zj[jx. y] + 
jix, jy] +[x, y] (mod f for x, yeg (v) e([k. x]) 
=0 for ket (vi) o(Ljx,jy]) = o(Ls, vlt (vii) 
o(Ljx, x]) » 0 for xét. Let g’ be a subalgebra 
of g such that jg’ c a +f. Then, putting f' — 

q ME, one can naturally induce a j-algebra 
structure on the pair (g', Ek The j-algebra 
thus obtained is called a j-subalgebra of {g, f, 
(j), 0]. A j-algebra {g, t, (j), œ} is called proper 
(resp. effective), if, for any j-subalgebra (g^, f’} 
with g’ compact semisimple, o is contained in 
t (resp. if {g, Í} is an effective pair). 

Now let D be a homogeneous bounded 
domain in C", G a connected tLie subgroup 
of G,(D) acting *transitively on D, and K the 
tisotropy subgroup at a point in D. The Lie 
algebras of G and K are denoted by g and f, 
respectively. Then the pair {g, t) becomes an 
effective proper j-algebra. Conversely, to every 
effective proper j-algebra there corresponds a 
homogeneous bounded domain. The identity 
component of G,(D) is isomorphic to the iden- 
tity component of a treal algebraic group via 
the adjoint representation. Let {g, É, (j), œ} 
be a j-algebra. Suppose that g satisfies the 
following conditions: (i) g— 9 ?--g ! -- 9° asa 
graded Lie algebra; (ii) g? —t-- jg ?; (iii) there 
exists a je(j) such that jg ! 2g '; and (iv) 
there exists an reg ? such that [ jx, r] - x 
for zeg 7. Such a decomposition is called 
a Siegel decomposition of g. To an effective j- 
algebra admitting a Siegel decomposition 
there corresponds a unique Siegel domain up 
to affine equivalence. Vinberg, Gindikin, and 
Pyatetskii-Shapiro (Appendix in [1] or Trans. 
Moscow Math. Soc., 12 (1963)) proved that 
the Lie algebra g,(D) of G,(D) contains a j- 
subalgebra admitting a Siegel docomposition 
and corresponding to the same domain D, 
and obtained the realization theorem: Every 
homogeneous bounded domain D is holomor- 
phically equivalent to a Siegel domain. In con- 
sequence, D is diffeomorphic to a Euclidean 
space, and the isotropy subgroup K,(D)isa 
maximal compact subgroup of G,(D). We have 
the decomposition G,(D) = K,(D): T (semi- 
direct), where T is an R-splittable solvable 
subgroup of G,(D) acting simply transitively 
on D. T is uniquely determined up to conju- 
gacy in G?(D) (=the identity component of 
G,(D)), and is called the Iwasawa group of D. 

The j-algebra structure of the Lie algebra t 
of the Iwasawa group T is characterized by 
the following properties: (1) for every tet, the 
eigenvalues of adt are all real; (ii) there exists 
a tcomplex structure j such that [jx,jy]= 
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jLjx, y] jx jy] - x. y] for x, yet; and (ii) 
there exists a linear form o on t such that 
o(Ljx,jy]) - oe([x, y]) and that o([ jx, x])>0 
for x z0. A Lie algebra satisfying (1)- (i1) is 
called a normal j-algebra. There exists a one- 
to-one correspondence between the set of 
holomorphic equivalence classes of homog- 
eneous bounded domains and the set of j- 
isomorphism classes of normal j-algebras; by a 
j-isomorphism here we mean an isomorphism 
which commutes with j. Let {g,j,@} be a 
normal j-algebra and define an inner product 
<, >ong by <x, y> = @(Ljx, y]). The ortho- 
gonal complement b with respect to < , > of 
the commutator subalgebra a, of g is an 
Abelian subalgebra of g, and the adjoint repre- 
sentation of b on g, is fully reducible. One has 
g= Än Ee Des fo, and gı 2 Änn en where f, = 
ixeal[A x] 2 a(A)x, heb}. The linear form a 
on b is called a root of g. There exist / roots 
95, ..., % (L2 dim b) such that b can be written 
in the form b 2jf, +... HI I being the rank 
of g. Then, after a suitable change of the num- 
bering of the as, any root « will be seen to 

be of the form (x; + %,)/2, (#;—«,)/2 or o;/2, 
where 1 xi k«l. A normal j-algebra admits 
a unique Siegel decomposition which can be 
constructed by using root spaces. 


D. Equivariant Holomorphic Embedding 


We retais the notation of Section B. Let 

D(V, F)c R© x W be a Siegel domain and ge be 
the tcomplexification of the Lie algebra gp. g`! 
has the complex structure defined by the endo- 
morphism ad J. Let oa, be the + i-eigenspaces 
in the complexification gc! of g^! under ad I. 
Let us consider the complex subalgebras b — 
q— +ge+ ac ac and n — ac? - 9; of gc, 
where the subscripts C denote the complexi- 
fication of the respective space. Let Ge be the 
connected *complex Lie group generated by 
ac and containing G? (=the identity com- 
ponent of G,) as a subgroup. The Lie algebra 
of the normalizer B of b in Ge coincides with 

b. Identifying RC x W with n as a complex 
vector space, and denoting by z the natural 
projection of Ge onto the complex coset space 
Gc/B, the composite mapping t= zexp is a 
holomorphic embedding of n into G/B, 

which induces a holomorphic G?-tequivariant 
embedding of D(V, F) into Gc/B as an open 
submanifold. This embedding is called the 
Tanaka embedding. By the (Shilov) boundary S 
of D(V, F) we mean the real submanifold S = 
[Gc iy we RE x W| y=F(u,u)} of RE x W, 
which is a subset of the boundary of D(V, F). S 
has the natural *CR-structure induced from the 
complex structure of R* x W. Every element 
of g, can be extended to a unique holomorphic 
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vector field on RC x W which is tangent to S, 
and its restriction to S is an infinitesimal *CR- 
automorphism on $, i.e., a complete vector 
field generating a 1-parameter group of tCR- 
equivalences of S onto itself. Conversely, every 
infinitesimal CR-automorphism on $ can 

be extended uniquely to a holomorphic vector 
field on R© x W, and an element of g, is char- 
acterized as an infinitesimal CR-automorphism 
on A whose extension leaves the 'Bergman 
kernel form of D(V, F) invariant [6]. Let n— 
dime gc, m= dim b, and let k—2(7) — 1. Then 
Gc/B, and consequently D(V, F), is embedded 
holomorphically into the complex *Grass- 
mann manifold of m-dimensional subspaces in 
ac and so into the tcomplex projective space 
P,(C). Any element of G? is induced from a 
projective transformation and hence is a bi- 
rational transformation on D(V, F). 

Let D be a homogeneous bounded domain 
in C", g, the Lie algebra of G,(D), and f, the 
isotropy subalgebra of q,; and let ac be the 
complexification of q,. 9, is a j-algebra. Let us 
define the complex subalgebra o of gc by 
putting g- — (x ijx|Ixea,,je(j)]. Then we 
have g-=9,+9_.9, I g- zb, Let Gc be the 
connected Lie group generated by ge and 
containing GQ(D) as a subgroup. Let G_ be the 
connected (closed) Lie subgroup of Ge gen- 
erated by g_. Then D can be holomorphically 
embedded in Gc/G_ as the open G?(D)-orbit of 
the origin of Gc/G- [8]. This embedding is 
called the generalized Borel embedding. CJ: ` 
is compact if and only if D is symmetric, and in 
this case G-/G_ coincides with the compact 
dual [9]. 

Let (t9, j, œ} be a normal j-algebra of rank | 
corresponding to a homogeneous bounded 
domain Dp, and define the Hermitian inner 
product h by h(x, y) 2 o(Ljx, y])+ io([x, y]) for 
x, yétg. tg has | — 1 (normal) nontrivial j-ideals 
(e, j-invariant ideals) up to j-isomorphisms. 
Take a j-ideal t, of tj. Then we have ty —t, + 
t5, t; being a (normal) j-subalgebra of t, de- 
fined as the orthogonal complement of t, in 
tg with respect to h. The geometric version of 
this is that D, is represented as a holomorphic 
fiber space over the homogeneous bounded 
domain D, corresponding to t,, with fibers 
holomorphically equivalent to the homog- 
eneous bounded domain D, corresponding to 
t,. For this fibering there exists a universal 
fiber space D over the product D, of certain 
classical symmetric domains, with the same 
fibers, which plays the same role as that of a 
*universal fiber bundle in topology. Here, D is 
again a homogeneous bounded domain. The 
fiber space Dy D, is induced from the fiber 
space DD, by the classifying mapping 4 of 
D, to D,. Let f! be the generalized Borel em- 
bedding of D into Gc/G-. Then there exists a 
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complex Abelian subalgebra m of gc satisfying 
gc =9- +m (semidirect), and one can con- 
struct a biholomorphic mapping f of B(D) onto 
a certain Siegel domain of the third kind over 
D, in the vector space m [8]. The fiber space 
D, D, coincides with the one induced from 
the aforementioned Siegel domain of the third 
kind by the composite mapping of A and f' f. 
Every realization of a homogeneous bounded 
domain as a Siegel domain of the third kind is 
obtained by this method. 


E. Classification of Homogeneous Bounded 
Domains 


The main concern is to classify all homoge- 
neous bounded domains in C" up to holomor- 
phic equivalence. Since the realization as 
Siegel domains has been set up, the second 
step is to get the uniqueness theorem: The 
holomorphic equivalence of two homogeneous 
Siegel domains implies that they are linearly 
equivalent, that is, there exists a (complex) 
linear isomorphism between the ambient vec- 
tor spaces which carries the one domain to the 
other. The uniqueness theorem was first stated 
in 1963 (Appendix in [1]), rigorously proved 
in 1967 [10], and in 1970 the homogeneity 
assumption was removed [2]. A homogeneous 
Siegel domain is called irreducible if it is not 
holomorphically equivalent to a product of 
two homogeneous Siegel domains. Every 
homogeneous Siegel domain is linearly equiva- 
lent to a product of irreducible homogeneous 
Siegel domains [3, 10]. A homogeneous Siegel 
domain D(V, F) is irreducible if and only if the 
regular cone V is irreducible, i.e., if V cannot 
be written as a direct sum of two regular 
cones. So the problem is to classify irreducible 
homogeneous Siegel domains up to linear 
equivalence. This reduces to classifying two 
kinds of nonassociative algebras with bigrada- 
tion, called T-algebras and S-algebras [11]. 
Nonsymmetric homogeneous Siegel domains 
appear in dimension 4. The numbers of such 
domains are finite up to dimension 6, but in 
every dimension z 7 there is at least one con- 
tinuous family of nonsymmetric irreducible 
homogeneous Siegel domains, which are not 
mutually holomorphically equivalent. 

There is a remarkable class of homogene- 
ous Siegel domains, called quasisymmetric 
[12], which contains the class of symmetric 
bounded domains. A regular cone VCR is 
called self-dual if there exists an inner product 
( , )on R such that V — (xeR|(x, y) » 0 for 
ye V—(0)}, V denoting the closure of V. V is 
called homogeneous if the group G(V) is tran- 
sitive on V. Suppose that V is homogeneous 
self-dual. Then the group G(V) is tself-adjoint 
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with respect to the inner product ( , ) on R. 
Let g(V) be the Lie algebra of G(V). Then the 
totality f(V) of skew-symmetric operators in 
g(V) with respect to ( , ) is a *maximal com- 
pact subalgebra of g(V) and is the isotropy 
subalgebra of g(V) at a point ee V. Consider 
the associated tCartan decomposition g(V)= 
t(V)+ p(V). For each xeR there exists a 
unique element T(x)e p(V) such that T(x)e- x. 
Let F be a V-Hermitian form on a complex 
vector space W. Define a Hermitian inner 
product < , > on W by <u, vò —(e, F(u,v)) 

for u, ve W, and let H(W) be the set of Her- 
mitian operators on W with respect to < , >. 
A (homogeneous) Siegel domain D(V, F)c 

R© x W is called quasisymmetric if V is homo- 
geneous self-dual and if for each x eR there 
exists R(x)e H(W) such that F(R(x)u, v)+ F(u, 
R(x)v) = T(x)F(u, v) for u, ve W. The normal j- 
algebra t of an irreducible quasisymmetric 
Siegel domain is characterized by the following 
conditions: dimf, ,,,5 —a (l<i<k<l); and 
dim k, , —b (1 &i& I), where a, b are some 
constants and / is the rank of t (D'Atri and de 
Miatello). Quasisymmetric Siegel domains 
have been completely classified (M. Takeuchi, 
Nagoya Math. J., 59 (1975), also [12]). 


F. Generalized Siegel Domains and Further 
Results 


Let Q be a domain in C" x C" (n, mz 0) which 


. is holomorphically equivalent to a bounded 


domain and contains a point of the form (z, 0), 
ze C". Q is called a generalized Siegel domain 
with exponent c (c e R), if Q is invariant under 
holomorphic transformations of C" x C" of the 
types 


(zu) 5(z--a,u) forall aeR", 


(zu)—(ze"u) forall teR, 


(zu) (e'z, e"u) forall teR. 


Let D be a bounded domain in C", and T a 
subgroup of G,(D). T is said to sweep D if there 
exists a compact set K c D such that l'K =D. 
T is said to divide D if T, provided with dis- 
crete topology, acts properly on D and sweeps 
D. D is called sweepable (resp. divisible) if there 
exists a subgroup T of G,(D) which sweeps 
(resp. divides) D. A divisible generalized Siegel 
domain is symmetric. A sweepable generalized 
Siegel domain with exponent c #0 (resp. c — 0) 
is a Siegel domain (resp. a product of a Siegel 
domain of the first kind and of a homogene- 
ous bounded circular domain) ([13]; also A. 
Kodama, J. Math. Soc. Japan, 33 (1981)). 
Some results have been obtained concerning 
geometry of bounded domains, homogeneous 
bounded domains, and Siegel domains in 
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complex Banach spaces [14], and also con- 
cerning the unitary representations of the 
generalized Heisenberg group on the square- 
integrable cohomology spaces of 6,-complexes 
on the Shilov boundary of a Siegel domain 


[15]. 
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385 (XVI.6) 
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A. General Remarks 


Simulation, in its widest sense, is a method of 
utilizing models to study the nature of certain 
phenomena. This method is employed when 
experimentation with the actual phenomena in 
question is difficult because of high cost in 
time or money. Also, it is sometimes almost 
impossible to carry out observations when the 
behavior of the objects can be influenced by 
their surroundings. 

We can classify simulation techniques into 
the following four types, although simulations 
in practical use are usually a mixture of them. 

The first type is model experimentation, 
which includes model basins and wind tun- 
nels in hydrodynamics and pilot plants in the 
chemical industry. In advance of construction 
in a real situation, we perform experiments 
on a small scale and verify or modify those 
theories upon which the construction is based. 

The second type is analog simulation or 
experimental analysis. We investigate the 
properties of real objects by experiments on 
alternative phenomena satisfying the same 
differential equations as those known for or 
assumed to be satisfied by the real objects. For 
example, we use an equivalent electric network 
to study dynamic vibration, and dynamic 
systems to study heat conduction problems. 
When theoretical analysis of the actual phe- 
nomenon is difficult, we look for other phenom- 
ena with similar properties and study them 
in order to construct mathematical models for 
them. This type of simulation has come into 
practical use mainly in engineering problems, 
but recently it has been utilized for the study 
of economic phenomena, nervous systems, the 
circulating system of an artificial heart, etc. 
Analog simulation was in the past often per- 
formed by means of *analog computers. Now- 
adays, analog simulation is more frequently 
performed by digital computers than by ana- 
log ones. And with the progress of electronics 
it has become easier to make special-purpose 
simulators. 

The third method of simulation, simulation 
in the narrow sense, has become more impor- 
tant as *digital computers have been developed. 
In general this method is applied to problems 
that are more complicated and of larger scale 
than problems treated by analog simulation. 
When the mathematical expressions of the 
phenomenon and the algorithms of its dy- 
namic structure are known, it is easy to simu- 
late it by means of a computer program. In 
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particular, when these techniques are used to 
study systems such as sets of machines, equip- 
ment at factories, or management organiza- 
tion, we call them system simulations. Major 
fields where system simulation techniques have 
been used are traffic control on highways or at 
airports, arrangement or operation of ma- 
chines at factories, balancing problems in 
chemical processes, production scheduling in 
connection with demands and stocks, overall 
management problems, and design of informa- 
tion systems. The method has also been ap- 
plied in designing plants and highways and in 
the study of social or biological phenomena. 
Also, when we investigate instruction systems 
of computers that are yet to be completed 

or develop programming systems for such 
computers, existing computers can be used to 
simulate the new ones. *Random numbers play 
an important role where the simulation must 
include random fluctuations (— 354 Ran- 
dom Numbers). In such instances, the method 
is often called the Monte Carlo method (— 
Section C). 

The fourth method of simulation deals with 
systems containing human beings. Among 
them are war games for training in military- 
operation planning, business games for train- 
ing in business enterprises, and simulators for 
training pilots and operators of atomic power 
plants. The contribution of human decision to 
simulation processes is characteristic of these 
cases. For example, the participants in a busi- 
ness game are divided into several enterprise 
groups. Each group discusses and decides how 
to invest in plants, equipment, research, and 
advertising and how to schedule production 
for each quarter. On the basis of the deci- 
sions, a computer outputs the records of sales, 
stocks, and cash for each quarter, according 
to hidden rules. From the results, each group 
decides on the next steps. In this way, the 
groups compete for development. This type 
of simulation is important not only for train- 
ing but also for investigating the mechanism 
of human decision. Slightly different from 
this type of simulation, the “perceptron” and 
EPAM (Elementary Perceiver and Memorizer) 
are related to artificial intelligence and have 
been used extensively in the cognitive sciences 
and in research into the structure and function 
of the human brain. 

The third type of simulation has attracted 
attention in particular and has been used both 
in theoretical problems, such as the explication 
of various phenomena, and in practical prob- 
lems, such as design or optimum operation of 
systems or prediction of their behavior. For 
school education and training of technicians, 
this is put to use together with simulations of 
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the fourth type. But their use has also induced 
heated discussions and controversies on the 
validity of results. 


B. Programming Languages for Simulation 


We usually describe models by using general 
purpose language: such as FORTRAN, or list 
processing languages such as LISP to simulate 
situations on computers. For system simula- 
tion, a number of programming languages 
have been developed and put to practical use. 
They can be divided roughly into two cate- 
gories: those for which systems change continu- 
ously and those describing discrete changes. 
CSMP (Continuous System Modeling Pro- 
gram), CSSI (Continuous System Simulation 
Language), and DDS (Digital Dynamics 
Simulator) belong to the former, and hence all 
involve integration mechanisms; but each 

has a different way of describing a model. 
DYNAMO, which has been implemented, or J. 
W. Forrester's Industrial Dynamics and World 
Dynamics, are used extensively. To control 
simulation time, one may use GPSS (General 
Purpose Simulation System), SIMULA 
(Simulation Language), or SIMSCRIPT 
(Simulation Scriptor), each employing a differ- 
ent method to describe state transitions. 


C. The Monte Carlo Method 


The Monte Carlo method was introduced by J. 
von Neumann and S. M. Ulam around 1945. 
They defined this as a method of solving deter- 
ministic mathematical problems using tran- 
dom numbers. L. de Buffon's needle experi- 
ment, in which the approximate value of z is 
obtained by dropping needles at random many 
times, is a classical example of this method. 
Another example is the problem of evaluat- 
ing a definite integral I = (^ f(x)dx (B > f(x)z 
AZO). First we generate many pairs of (uni- 
form) random numbers (x, y, where a<x<b_ 
and A « y « B. The proportion (p) of pairs 
satisfying y < f(x) gives an estimate of the inte- 
gral, i.e., I = p(B— A)(b — a). The techniques 
for inverting matrices, solving tboundary value 
problems of partial differential equations, and 
so on, are also examples of the Monte Carlo 
method in this sense. However, direct numer- 
ical calculation seems to be more useful in 
dealing with this sort of problem. At present, 
Monte Carlo methods are usually used when it 
is difficult to construct (or solve) mathematical 
equations describing the phenomena in ques- 
tion, for example, when the phenomena in- 
volve tstochastic processes such as random 
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walks. Some methods have been devised so as 
to get precise results efficiently. 
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A. Basic Notion 


It is often useful to focus attention on the 
relation between a physical system's input and 
output, without worrying about intermediate 
processes (the black box), which may be in- 
sufficiently understood or too complicated to 
analyze. For the scattering of particles, this 
leads to the notion of an S-matrix that directly 
relates the state of incoming particles (before 
scattering processes take place) to that of 
outgoing (scattered) particles. 

In typical cases, the incoming and outgoing 
particles are described as mutually noninter- 
acting (these are called free particles). This 
implies particle motions along straight lines at 
constant speeds (asymptotic to the actual 
motion at infinite past for incoming particles 
and at infinite future for outgoing particles) in 
classical mechanics, and wave functions (or 
vectors in a Hilbert space) obeying the Schró- 
dinger equation with a free Hamiltonian Hj, in 
quantum mechanics and more or less the same 
in quantum field theory. 

A wave function for n particles is an L,- 
function of their momenta p4, ..., p, (each p; 
being a 3-dimensional vector) with respect to 
an appropriate measure (normally the Le- 
besgue measure du(p) — II dp, in quantum 
mechanics and the Lorentz-invartant measure 
dy p) = EL((m? + p?) '? d? pj] in quantum field 
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theory, where m is the mass of the particle), 
and the S-matrix S is described in terms of the 
S-matrix elements (p,, ..., p,lS|pi. .... Ph) 
(which is a distribution) as 


(®, S'P)— Jang Sip) F(p)du(pYdu(p?) 


where ® and H are wave functions for n and n 
particles, p=(p,,...,p,), and similarly for p'. 
(p|S|p’) gives quantities measured in scattering 
experiments, as will be explained in Section B 
(2). 

In quantum mechanics, the free and actual 
(interacting) motion of particles is described in 
the same Hilbert space with free and interact- 
ing Hamiltonians H, and H A vector in 
interacting motion behaves like a vector ¢ in 
free motion at infinite past if 


I(exp[ —iHt]) —(expL —iHot])o | 50 


and hence 
P= W (H; Hy)¢, 


W (H;H,)- lim ete "Tei. 
t=- 
Such a ® is often written as 


o-9^()- | o"tpiotp dup. 


and is called an in-state. A definition for an 
out-state dn is obtained by changing t —oo 
to t> +œ and W_ to W, . The S-matrix ele- 
ment is defined (as a distribution) by 


(pIS|p) - (b*" (p), re, 


The existence and properties of W, (called 
twave operators) are central subjects in scat- 
tering theory (— 375 Scattering Theory). 

In quantum field theory, the asymptotic 
description is given in terms of vectors in 
tFock space, and the in- and out-states are 
constructed in terms of *asymptotic fields 
(— 150 Field Theory). 

The foregoing description actually applies 
only to a system of one-component particles 
of the same kind. More generally, additional 
(discrete) variables, say «, are needed to dis- 
tinguish different kinds of particles and differ- 
ent spin components of each kind of particle, 
and the p's appearing in the above formulas 
should be replaced by (p, as, along with re- 
lated changes in the measure. 

Even in a quantum mechanics of many 
identical particles a bound state, if it exists, is 
to be treated as another particle (different from 
the original one) and should be distinguished 
by as in the asymptotic description o. 

If the interaction is of long range (e.g., 
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Coulomb interaction), the classical path of a 
particle does not have an asymptote in gen- 
eral, and correspondingly the wave operators 
W, do not exist for the usual free Hamil- 
tonian. Still, an asymptotic description of 
scattering is possible in some cases. 

In the presence of massless particles, such 
as photons in quantum field theory, another 
difficulty, called the infrared problem, can arise 
in the asymptotic description of scattering 
because the scattered particle may be accom- 
panied by an infinite number of massless par- 
ticles (with very small energy). In such a situ- 
ation, a representation of a free massless field 
not equivalent to the standard Fock represen- 
tation is believed to be a possible candidate — 
for the asymptotic description of scattering. 


B. Basic Properties 


(1) Invariance. Let ⁄ be the Hilbert space for 
the asymptotic description of scattering, such 
as the space of L;-functions o(p,, ..., Pn) rela- 
tive to the measure du(p). The S-matrix is an 
operator on # whose matrix element is as 
described above. (The corresponding operator 
S in the Hilbert space # describing the inter- 
acting states is defined by Sd"! (o) = '"(¢) 
and is sometimes called an S-operator.) 

S is said to be invariant under a group G of 
transformations of the p's (and possibly the o's) 
if (U(g)g)(p) 9 o(g ^! p) defines a continuous 
unitary representation U(g) and if U(g)S= 
SU (g) for all ge G. First, S is usually invari- 
ant under time translation. In the quantum 
mechanics of a particle scattered by a rotation- 
ally invariant potential, S is invariant under 
the group of rotations of 3-dimensional vec- 
tors p; in the quantum mechanics of many 
. particles (mutually interacting through central 
potentials), S is invariant under the 3-dimen- 
sional Euclidean group of transformations 
p>Rp +a with rotation R; in relativistic field 
theory, S is assumed to be invariant under the 
tinhomogeneous Lorentz group of transfor- 
mations p>Ap-+a with p —(p?, p) and homo- 
geneous Lorentz transformation A. 

In all these examples, G is of type I, i.e., 
there is a direct integral decomposition 


H= Lem & L(k)dv(k), 


U(g)= | U,(g) & 1 gay dv(k), 


into irreducible representations U, on #(k), 
which are mutually inequivalent, where (Kk) is 
some Hilbert space for each k. The S-matrix is 
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invariant under G if and only if 


S= | 1,9) ® S(k)dv(k). 


When a (scalar) particle is scattered by a 
central potential, irreducible representations 
are labeled by the energy E(p)= p?/(2m) (for 
time translation) and the angular momentum / 
(for rotations) with dim L(E(p), I) = 1. There- 
fore each S(k)=S,(|p|) is a number. For any 
given energy, 1— 0, 1,2,... are referred to as the 
S-wave, P-wave, D-wave, ... or generally as 
partial waves. 

For relativistic scalar particles, irreducible 
representations (with positive energy and 
nonzero real mass) are labeled by the center- 
of-mass energy squared s (—(X (n? + p2)2y — 
(È pj?) and the total angular momentum L If s 
is below the threshold (3m)? of 3-particle 
scattering, dim .Z'(k) 2 1 and S(k)=S,(s) is a 
number. 


(2) Unitarity. Sọ is supposed to represent the 
t= +œ asymptotic (free) behavior of the state 
that initially (i.e., at t= —oo) behaves like a 
free state o If there is no loss of probability in 
the description of scattering, the S-matrix is 
isometric. If all asymptotic configurations 

are realized as a result of scattering, the S- 
matrix must also be unitary. The mappings 


qi" oe, d" Leier (W, in quantum 
mechanics) are proved to be isometric under 
a general assumption. The unitarity is then 
proved in potential scattering (under some 
conditions on the potential) by showing that 
the two wave operators W, have the same 
range. In fact, a somewhat stronger result — 
that this range is the same as the absolutely 
continuous spectral subspace for the interact- 
ing Hamiltonian H—is usually proved and is 
called completeness (of scattering states in the 
absolutely continuous spectral subspace of H). 
If the mappings ®™ and ^" (or W,) are iso- 
metric and have the same range, then O'"(@) = 
Q^" (So), which shows that the state behaving 
like o at t= —oo behaves like Sg at t= 4-oo. 
In the simple multiplicity cases such as 
S,({p|) and S,(s) above (which correspond to 
the physical situation of purely elastic scatter- 
ing without any production or change of par- 
ticles), these numbers must be of the form e?? 
due to unitarity, where the real number ó, is 
called the phase shift. In terms of the phase 
shift, the differential cross section do/dQ, which 
is the average number of particles scattered per 
unit time per unit solid angle around the direc- 
tion forming an angle 0 with the incident 
uniform parallel beam of unit intensity (one 
particle per unit time per unit area) when 
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viewed in the center-of-mass system, and the 
total elastic cross section c, — [(do/dQ)dQ are 
given by the following formulas, called the 
partial wave expansion: 


da/dQ =| f(s,0)?, 

fs, k^ V (OI IL P(cos0), f= sinde”, 
i-0 

Gq 7 Ank ? V (21+ Hsin? à, 
l 


where dQ — sin 0 d0 dọ (invariant measure on a 
2-dimensional sphere 5?) and k is the wave 
number of the particle in the center-of-mass 
system (k-h '|p| h^! [(s/4) m^] 7). The 
function f is called the scattering amplitude. 
The forward scattering amplitude f(s, 0) is 
related to the total cross section o,,, by the 
optical theorem: 


Gi = 4k! Im f(s, 0), 


which follows from the unitarity of the S- 
matrix. Here the total cross section is expressed 
as the area of the transverse cross section 

of a classical (impenetrable) scatterer that 
would scatter the same amount of particles. 

In a purely elastic region, o,,, = Se and the 
optical theorem is the same as the assertion 

Im fiel fil. 

Even for values of s above the threshold of 
inelastic scattering, the restriction of S, to the 
subspace of two particles is again described by 
numbers e??!, where 6, is now complex, and 
the same formulas for the differential and total 
elastic cross sections hold, except that uni- 
tarity of Sy now implies Im f, >| fl’. 


(3) TCP Symmetry. In the framework of either 
faxiomatic quantum field theory or the *theory 
of local observables, the TCP theorem (or 
PCT theorem) shows [1] that to every particle 
there corresponds another particle with the 
same mass and spin, which is called the anti- 
particle and can be the original particle itself, 
such that any particle is the antiparticle of its 
antiparticle, and the following relation, called 
TCP invariance (or PCT invariance [2]) holds 
for S-matrix elements: 


(p,a|S{p’, ai 
=n lai OFEN p, Q,a'|S|p, 05a). 


Here F(a) and F(a’) are the number of particles 
with half odd integer spin amongst the incom- 
ing and outgoing particles (i.e., in x and a’), 
n(x) and nfa) are the product of +1 assigned 
to each particle in x and a’, respectively, with 
the assignment to a particle and its antiparticle 
the same for bosons and opposite for fermions, 
(p. X) is an abbreviation for an ordered n-tuple 
of energy-momentum 4-vectors p; and other 
indices o; (for spin components and particle 
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species), i= I, ..., n, and similarly for (p’, x’), and 
0, is the mapping from particles to their anti- 
particles (without changing spin indices). We 
may view the mapping |p, x) (a)i* |p, 05%) 
as an antiunitary operator O, on J£, satisfying 
0,50,! = S* so that 


(1,892) 5 (0092, 5,9691). 


This is related to the "TCP operator O in 
quantum field theory by Git) = V/^(O, o) 
and O'"(~) = Y (©, op), The name TCP 
comes from the combination of T for time 
reversal (incoming Æ outgoing, v z p/m — v), 
C for charge conjugation (particle æ anti- 
particle), and P for parity, which is a quan- 
tum number for space inversion (p- — p). 

TCP symmetry was suggested to G. Lüders 
(Dansk. Mat. Fys. Medd., 1954) by B. Zumino 
in the form that P-invariance implies TC sym- 
metry. W. Pauli (Niels Bohr and the develop- 
ment of physics, Pergamon 1955, 30) realized 
that TCP is a symmetry. R. Jost (Helv. Phys. 
Acta, 1957) gave its proof in the framework of 
axiomatic quantum field theory. 


(4) Crossing Symmetry. In the framework of 
either axiomatic quantum field theory or the 
theory of local observables, it has been shown 
by J. Bros, H. Epstein, and V. Glaser (Comm. 
Math. Phys., 1 (1965); also [3]) that there exist 
analytic functions H (Kk, x) of complex variables 
k — (k,, ..., k3) ìn a certain domain D such that 
each k, is a complex 4-vector, the variables are 
on the mass-shell manifold defined by k? =m? 
(k? is in the Minkowski metric) and £ k;=0, 
and the boundary value of H(k, x) as k; ap- 
proaches £p; from Im s 0 (s being the square 
of the sum of two K's for incoming particles) in 
D is the scattering amplitude for the following 
processes involving the particles A; and their 
antiparticles A; (j= 1, ..., 4) with 4-momenta 
pj, £j: 1 for A; and —1 for A; (some of the A's 
and A's may coincide): 


ONE TEE EE 
DI | A4 A42A, +A), 
DI — A,» A42 45 A4, 
üi) | 4,4424, A5, 
(i) A,4442454 43, 
(ui) A, A42 À, +Å. 


Any pair of relations taken from (i)-(ii1) con- 
stitutes a crossing symmetry. 


(5) High Energy Theorems. M. Froissart (Phys. 
Rev., 123 (1961)) obtained from the Mandel- 
stam representation the following bound for 
the forward scattering amplitude (called the 
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Froissart bound) at large s: 
|F(s,0)| « (const)s(log s)", 
Fis, t) ^ kf (s, 0), t —-2k^ (cos 0 — 1). 


A. Martin has obtained such a bound in the 
axiomatic framework. As a consequence we 
have the Froissart-Martin bound: 


Gi <42R CS (1+ ey, (log(s/so)Y^, 


where ¢>0 is arbitrary, R can be taken to be 
4m] (m, is the mass of a pion) for many cases, 
such as zz, nK, «N, and zA scattering, and so 
is an unknown constant. 

Many other upper and lower bounds for 
scattering amplitudes have been obtained 
under other assumptions [4, 5]. 

I. Ya. Pomeranchuk (Sov. Phys. JETP, 7 
(1958)) suggested the asymptotic coin- 
cidence of total cross sections at high energy 
for scattering of AB and AB, where B is the 
antiparticle of B. This Pomeranchuk theo- 
rem has been shown to hold by Martin 
(Nuovo Cimento, 39 (1965)) by using the ana- 
lyticity derived in the axiomatic framework 
under the following sufficient condition: 

The existence of Im... [o(AB) — c(AB)] and 
lim, ,,,(slogs) ! f(s, 0)=0. 


C. S-Matrix Approach 


(1) History. All the information needed to 
understand the experimental elementary- 
particle scattering data seems to be expressible 
by S-matrix elements. It was therefore natural 
to try to develop a foundation (and practical 
methods of calculation) for the theory of ele- 
mentary particles on the basis of some gen- 
eral properties of the S-matrix, especially 
when other approaches, such as quantum field 
theory, faced difficulties. W. Heisenberg (Z. 
Phys., 120 (1943)) first pointed out the possi- 
bility of such an approach soon after the intro- 


duction of the S-matrix by J. A. Wheeler (Phys. 


Rev., 52 (1937)). Unfortunately, in the early 
1940s not much dynamical content could be 
given to such an S-matrix-theoretic approach 
because only unitarity and some invariance 
properties, such as tLorentz invariance, were 
used to characterize the S-matrix. In the late 
1950s, through the study of the analyticity of 
the S-matrix in connection with dispersion 
relations in quantum field theory, it became 
evident that causality is another important 
determinant of S-matrix structure. In practice, 
causality in position space is used in the form 
of the analyticity of the S-matrix elements in 
the energy-momentum space (variables dual to 
space-time positions in the Fourier transform). 
Subsequently it was realized that analyticity 
combined with unitarity gives surprisingly 
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strong control over the structure of the S- 
matrix (G. F. Chew [6]; H. P. Stapp, Phys. 
Rev., 125 (1962); J. C. Polkinghorne, Nuovo 
Cimento, 23 and 25 (1962); J. Gunson, J. Math. 
Phys., 6 (1965); D. I. Olive, Phys. Rev., 135B 
(1964); Chew [7]; R. J. Eden et al. [8]). The 
study of the S-matrix based on its general 
properties, such as invariance, unitarity, and 
analyticity, is called S-matrix theory. In the 
present form, it is adapted only to massive 
particles with short-range interactions, and 
its applicability is believed to be limited to 
strongly interacting systems. 


(2) Landau-Nakanishi Variety. C. Chandler 
and Stapp (J. Math. Phys., 10 (1969)) and D. 
Iagolnitzer and Stapp (Comm. Math. Phys., 14 
(1969)) clarified the analytic structure of the S- 
matrix in terms of Landau equations (— 146 
Feynman Integrals) based on the important 
physical idea of macroscopic causality, which 
gives much more precise information in the 
physical region than a superficial application 
of tlocality (also called microcausality) in 
axiomatic quantum field theory, though it is 
possible that a detailed study starting from 
microcausality and incorporating fasymptotic 
completeness (the so-called nonlinear program 
in axiomatic quantum field theory) might 
eventually entail the macroscopic causality 
condition (e.g., J. Bros, in [9]; lagolnitzer 

[10, ch. IV]; also K. Symanzik, J. Math. Phys., 
1 (1960)). 


(3) Microlocal Analysis. An important fact is 
that the normal analytic structure of the S- 
matrix discussed by Iagolnitzer and Stapp 
essentially coincides with the notion of analy- 
ticity in microlocal analysis, i.e., with micro- 
analyticity (C 274 Microlocal Analysis; F. 
Pham and Iagolnitzer, Lecture notes in math. 
449, Springer, 1975; M. Sato, Lecture notes 

in phys. 39, Springer, 1975)). In a word, the 
tLandau equations have acquired a new inter- 
pretation in the description of the micro- 
analytic structure of the S-matrix. In the new 
developments, the Landau equations define a 
variety in the cotangent bundle (over the mass- 
shell manifold in momentum space), and the 
*singularity spectrum of S-matrix elements is 
assumed to be confined to | Je 7 * (G) (except 
for the so-called points, where G ranges 
over all possible Feynman graphs and £ *(G) 
denotes the positive-« Landau-Nakanishi 
variety associated with G (— 146 Feynman 
Integrals). The union | Je £+ (G) is known to 
be locally finite and hence makes sense (Stapp, 
J. Math. Phys., 8 (1967)). The old interpre- 
tation of Landau equations, as defining a 
variety in energy-momentum space, corre- 
sponds now to considering the variety L*(G) 
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obtained by projecting Z * (G) from the co- 
tangent bundle to the base manifold (e. the 
mass shell manifold). The new interpretation of 
Landau equations led Sato (Lecture notes in 
phys. 39, Springer, 1975) to make a further 
intriguing conjecture that the S-matrix would 
satisfy a special toverdetermined system (a 
*holonomic system) of *(micro-) differential 
equations whose tcharacteristic variety is given 
by the complexification of Landau-Nakanishi 
varieties. This conjecture is closely related to 
the monodromy-theoretic approach by T. 
Regge (Publ. Res. Inst. Math. Sci., 12 suppl. 
(1977) and the references cited therein) and his 
co-workers. 


(4) Discontinuity Formula. It has turned out 
that the above approach is closely related to 
the so-called discontinuity formula obtained by 
combining the unitarity and the analyticity of 
the S-matrix. Actually T. Kawai and Stapp 
(Publ. Res. Inst. Math. Sci., 12 suppl. (1977)) 
have shown that Sato's conjecture can be 
verified at several physically important points 
on the basis of the discontinuity formula. The 
discontinuity formula was first found by R. E. 
Cutkosky (J. Math. Phys., 1 (1960)) for Feyn- 
man integrals. It describes the ramification 
property of the integral around its singularities 
(— 146 Feynman Integrals). An analogous 
formula has been shown to be valid also for 
the S-matrix, and it demonstrates how strict 
are the constraints derived from unitarity and 
analyticity (Eden et al. [8, ch. 4]; M. J. D. 
Bloxham et al., J. Math. Phys., 10 (1969); J. 
Coster and Stapp, J. Math. Phys., 10 (1969); 
also Stapp in [11] and Iagolnitzer [10]). Note, 
however, that the derivation of the hitherto- 
known discontinuity uses either some ad hoc 
assumptions or some heuristic reasoning 
which is not rigorous or sometimes is even 
erroneous from the mathematical viewpoint. 
Efforts to give a rigorous proof are still being 
made, and these present several mathemati- 
cally interesting problems (eg. Iagolnitzer in 
[9] and M. Kashiwara and Kawai in [9]). 


(5) Regge Poles. The results stated so far con- 
cerning the analyticity of the S-matrix have 
been primarily derived in the low-energy 
region. It is commonly hoped that these results 
can be related to its high-energy behavior 
through the inner consistency of S-matrix 
theory, even though it is still unclear to what 
extent such a relationship can be developed. 
Such a hope was advocated by Chew, who had 
been inspired by the results of Regge (Nuovo 
Cimento, 14 (1959); 18 (1960)) for potential 
scattering. After being adapted to the relativ- 
istic case, Regge's idea took the following 
form: Consider the scattering of two incoming 
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and two outgoing scalar particles with equal 
mass m>0. Let f(s) be the partial scattering 
amplitude defined earlier. Regge introduced 
the idea of extending the function f(s) to an 
analytic function f(I, s) (Ie C) and of applying 
the Sommerfeld-Watson transformation in 
order to replace the partial wave expansion by 
the integral 


i! SEI (21+ 1)f(I, s) B( — cos 0) dl/sin nl 
c 


=F(s,t) 


for a certain contour C in the complex /-plane 
which encircles (0, 1,2, ... }. If TU s) is mero- 
morphic in Reiz —1/2 and if it tends to zero 
sufficiently rapidly at infinity, then one can 
change the contour C so that, with the help of 
Cauchy's integral formula, F ~ constant: t”, 
a(s)= max Rel(s), where the maximum is taken 
over all the poles of f(I, s). Thus the poles of 
the extended function f(l, s) determine the 
asymptotic behavior of F as t— œ (Regge 
behavior) under the assumption that f(l, s) can 
be chosen to satisfy suitable analyticity and 
growth order conditions. These poles are 
called Regge poles. Even though meromorphy 
conditions are found to be satisfied for scatter- 
ing by a (Yukawa) potential, they do not seem 
to be satisfied for the full S-matrix in the relati- 
vistic case. More general cases than those 
discussed here, i.e., the cases where more vari- 
ables are considered, are also being studied 
but without full success at the moment. For 
details and references — [7, 12,13]. 


(6) Veneziano Model. In connection with 
Regge-pole theory, we note an interesting 
observation by G. Veneziano (Nuovo Cimento, 
57A (1968)) to the effect that T (1 — a(s))E(1— 
a(t))/T(1— (s) — a(t)), with a(s) being linear 

in s, satisfies a crossing symmetry (in s and t) 
and shows an exact Regge-pole behavior. 
Although the many results that have been 
obtained give rise to a hope of constructing a 
realistic model of the S-matrix starting from 
the aforementioned function, no one has yet 
succeeded [14]. A more promising approach is 
the topological expansion procedure in which 
the first term of the expansion apparently 
shares with the potential-scattering functions 
the property of having only poles in the com- 
plex /-plane, along with several other physi- 
cally important properties of Veneziano’s 
function. 
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A. General Remarks 


Solitons are nonlinear waves that preserve 
their shape under interaction. Mathematically, 
the theory of solitons continues to develop as 
a theory of completely integrable mechanical 
systems. Typical examples are the Korteweg- 
de Vries equation (— Section B), the Toda 
lattice (— 287 Nonlinear Lattice Dynamics), 
and the Sine-Gordon equation 


Ha — Uz, = sinu, 


studied classically in connection with trans- 
formations of surfaces of constant negative 
curvature. 


B. The KdV Equation 


In the late 19th century J. Boussinesq and then 
D. Korteweg and G. de Vries obtained equa- 
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tions describing water waves having traveling- 
wave solutions. The equation 


u,—6uu,+u,,=0,  u-u(xt), (1) 


derived by de Vries is called the KdV equation 
for short. Putting u(x, t) 2 s(x — ct — ô), we find 
that s is an felliptic function, and we obtain 


s= sech? GC 


2 


as its degenerate form. This solution is called a 
solitary wave. 

Around 1965, M. Kruskal and N. Zabusky 
solved the KdV equation numerically, taking 
several separated solitary waves as the initial 
data. They found that the waves interact in a 
complicated way but that eventually the initial 
solitary waves reappear. Noting the particle- 
like character of the waves, they called each of 
these waves a soliton. Subsequently, the KdV 
equation was found to have an infinite number 
of constants of motion. 

C. Gardner, J. Greene, Kruskal, and R. 
Miura associated the 1-dimensional *Schrédin- 
ger operator — d?/dx? + u(x, t) to each solution 
u(x, t) of the KdV equation and showed that 
its 'eigenvalues are preserved in time. More- 
over, they applied inverse scattering theory 
(— Section D) and obtained explicit formulas 
for the solutions. 


C. Lax Representation 


Let 


L=—D*+u(x), D=d/dx. 


For 
M = —4D? + 6uD + 3u,, 


the commutator [ M, L] - ML — LM is the 
operator of multiplication by the function 
Guu, — Han, So u lä, t) is a solution of the 
KdV equation if and only if 


L,-(M,L]. (2) 


Equation (2) is also the condition that all L= 
L(t) are tunitarily equivalent to each other 
and the tspectrum of L is preserved through 
time. 

Most equations having soliton solutions can 
be represented in the form of (2) for a suitable 
pair of L and M. This representation is called 
the Lax representation. One sometimes says 
that an isospectral deformation of L is given 
by (2). 

On the other hand, isomonodromic defor- 
mations (— 253 Linear Ordinary Differential 
Equations (Global Theory)) have been studied 
extensively by M. Sato and his co-workers, 
and relations to soliton theory have been 
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discovered (Sato, T. Miwa, and M. Jimbo, Publ. 
Res. [nst. Math. Sci., 14 (1978), 15 (1979)). 

In the present case, the requirement that the 
commutator [ M, L] be a multiplication by a 
function determines an essentially unique 
(2n 4- {)th-order ordinary differential operator 
M=A,, the differential operator part of the 
fractional power L"*!?, [A,, L] is a poly- 
nomial in u and its derivatives, denoted by 
K,[u]. The equation u, = K,[u] is called the 
nth KdV equation. The transformation taking 
the initial data u(x) to the solution u,(x, t) of 
the nth KdV equation is denoted by Tu) 
Then F(t) T, (s) — T, (s) R(t), Le, the flows de- 
fined by these higher-order KdV equations 
commute. This property and the existence of 
infinite number of invariant integrals are con- 
sequences of the complete integrability of the 
higher-order KdV equations considered as 
infinite-dimensional Hamiltonian systems. 
The KdV equations can be studied group- 
theoretically as Hamiltonian systems on a 
certain coadjoint orbit in the *dual space of 
the tLie algebra of a certain class of *pseudo- 
differential operators (M. Adler, Inventiones 
Math., 50 (1979); also — B. Kostant, Advances 
in Math., 34 (1979) for the analogous facts for 
the Toda lattice). 

The ordinary differential equation K,[u] —- 0 
is called a stationary KdV equation. By the 
commutativity of the flows T(t), each KdV 
flow leaves invartant the space of solutions of 
K,[u] O0. The flows restricted to this space 
form a completely integrable Hamiltonian 
system with finitely many degrees of free- 
dom (S. P. Novikov, Functional Anal. Appl., 8 
(1974)). 

K,[u] =0 is also the condition that there 
exist an ordinary differential operator M 
which commutes with L. J. Burchnal and T. 
Chaundy (Proc. London Math. Soc., (2) 21 
(1922)) studied this problem and showed that 
such L and M are connected by the relation 
M? — P(L), where P is a certain polynomial of 
degree 2n 4- 1 and that the potential u is ex- 
pressed by the *theta function associated with 
the thyperelliptic curve w? = P(z). 


D. Inverse Scattering Method 


Let u(x) be a potential such that u(x)-+0 as 
x coo. The equation Lf — C?f (Im > 0) has 
solutions f, (x, C) that can be represented as 


filet fE K(x, tjet qr). 


Putting C= č + in and noting that f, (x, ë) and 
f. (x, — č) are independent solutions of Lf = 
& f, one can express f_ as 


f-(x, = ald) f. (x, — E) + b(GD f(x, č). 
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The coefficient a(£) can be continued analyti- 
cally to the upper half-plane, where it has only 
a finite number of zeros, all of which are sim- 
ple and lie on the imaginary axis. Denote 
them by in; (j— 1, ..., n). The *point spectrum 
of the operator L consists of the numbers 

— n}, and the associated eigenfunctions are 
f(x, inj, which are real-valued. Put 


cj Q fos in)? dx). 


and call t(£) — a(2) * and r(£) = b(Z)/a(4) the 
transmission coefficient and the reflection 
coefficient, respectively. The triplet r(£), nj, c; 
(jo 1, ...,n) is called the scattering data. It is 
connected with the kernel K =K , by the 
Gel'fand-Levitan-Marchenko equation 


oo 


K(x, t) - Ficen | F(x - t-- s)K(x, s)ds 


D 


(t 7 0), 


SEN r(Z)e? dx - 2Y. cje "äs, 
jal 


The potential is given by 
u(x)= —(0K/Ox)(x, 0). 


When the reflection coefficient r(£) vanishes 
identically, the potential is called a reflection- 
less potential. The kernel K then becomes a 
'degenerate kernel and the potential is ex- 
pressed by 


2 


d 
u(x)- —2 1098 D(x), 


where D(x) is the determinant of then x n 
matrix whose j, k entry is An -- c;exp( — (n; 
my) X} An; mJ. 

The authors of [1] showed that if u(x,t) is a 
solution of the KdV equation and if u(x, t) 50 
(x +00), then the time development of the 
scattering data of the potential u(x, t) is as 
follows: n and y; do not depend on t, and 


ré p= rE et, 


The solution associated with the reflectionless 
potentials are obtained by replacing c; by ¢,(t) 
in the formula for D(x). These are soliton 
solutions of the KdV equation. 

R. Hirota developed a method of treating 
functions like D(x) directly for most of the 
equations in the soliton theory (Lecture notes 
in math. 515, Springer, 1976). 

A certain geometric method that enables 
one to obtain solutions of the Sine-Gordon 
equation from a known solution has been 
studied in the transformation theory of sur- 
faces of constant negative curvature (G. Dar- 
boux, Leçons sur la théorie générale des sur- 


ert) cue. 


faces, Chelsea, 1972, vol. 3, ch. 12), and its 


generalizations are called Backlund transfor- 
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mations. Soliton solutions of the KdV equa- 
tion can also be constructed by this method. 
Relations to the inverse scattering method, 
differential systems, and transformation groups 
have also been studied (Lecture notes in math. 
515, Springer, 1976). 


E. Periodic Problem 


Let the potential u(x) be of period L and con- 
sider Hill's equation (— 268 Mathieu Func- 
tions E) Lf — Af. The real (4-) axis is divided 
into intervals of unstable solutions (u-intervals) 
alternating with intervals of stable solutions. 
One of the u-intervals is of the form (—00, åo] 
and the others are finite, possibly degenerating 
to points. The special potentials that have only 
a finite number of nondegenerate u-intervals 
are the periodic analog of the reflectionless 
potentials, and are called the finite-gap (or 
-band) potentials. 

Consider the eigenvalue problem Lf = Af, 
f(x) 2 f(x -- 1) 20 for a fixed real parameter 1. 
Exactly one of its eigenvalues belongs to each 
finite u-interval. These eigenvalues move with 
t, but those in degenerate intervals cannot 
move. 

Let u be a finite-gap potential, F — [25;.,, 
À3;] ( — 1, ..., g) be the nondegenerate (finite) 
u-intervals, and u(t) be the associated eigen- 
values in J;. The potential is recovered by the 
formula 


Put P(4)— II;44(4 — 2j, realize the *Riemann 
surface S of the hyperelliptic curve w? — P(z) as 
a two-sheeted cover of the *Riemann sphere, 
and consider the y(t) as points on S. Let o 
(j=1,...,g) bea basis for the space of the 
*differentials of the first kind on S. Fix a point 
P, in S and put 


g y(t) 

w(t)= Y. | Qj 

k=1 J po 
(— 3 Abelian Varieties L). Then the locus 
Deutz... w, (2) ( —90 « 1 < oo) on the *Jacobian 
variety turns out to be a straight line, the 
direction v, being determined by the *periods 
of certain ‘differentials of the second kind on 
S. Employing the solution of *Jacobr's inverse 
problem, we can write the potential in terms of 
the tRiemann theta function as 


2 
u(x)— ee O(xv, -- c) - C, (3) 
dx" 
where c is a certain constant vector and C is a 
constant. 
Suppose now that u(x, t) is a solution of the 
KdV equation which is a finite-gap potential 
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for each t. Then g and 7; are preserved in time. 
The determination of the direction v, on the 
Jacobian variety is similar to that of v,, and 
the solutions of the KdV equation are ob- 
tained by replacing the vector c by tv,+c’ in 
(3). The case g — 1 is the elliptic traveling wave 
solution (— Section B). Most of these results 
have been extended to the general periodic 
problem (H. P. McKean and E. Trubowitz, 
Comm. Pure Appl. Math., 29 (1976)). 


F. Two-Dimensional KdV Equation 


Let S be a compact *Riemann surface of tgenus 
g, and let p, be a fixed point on S. Put F(x)— 
a,K" +... +d) and G(k)- b,k" -...--bg. A 
function y(x, y, t, p of peS and of x, y, t is 
uniquely determined by the following con- 
ditions: (a) it is meromorphic on S—{p,,}, and 
its pole ‘divisor is a general divisor of degree g 
and does not depend on x, y, t; (b) for a local 
parameter z at p, (z(p,) 20) and k=z ', y= 
y exp( — kx — F(k)y — G(Kk)t) is holomorphic 
near pp, and Wo(p,,)= 1. 

Moreover, there is a differential operator 
L=a,D" a, 4,D" 14+ 30-4 u(x, y, t) D? such 
that y, — Ly. Expanding Yo at p, as 1 + 
È- 55 y. t)z’, one can express the coeffi- 
cients u; by 6,65, .... Analogously, there is an 
M =b, D" b, D"! +E;-o vD’ such that 
V, — My. The operators L and M satisfy the 
relation 


L,—M,=[L, M], (4) 


which is a generalization of the Lax represen- 
tation. The coefficients of L and M satisfy a 
certain system of nonlinear differential equa- 
tions (V. E. Zakharov and A. B. Shabat, Func- 
tional Anal. Appl., 8 (1974); I. M. Krichever, 
Functional Anal. Appl., 11 (1977)). 

Example. Let F(k)=x? and G(k) 2 x? - c. 
Then one finds 


L=D? +u, M=D?+(3u/2+c)D +v, 


where u= — 26, and v2 36,6, — 361 363. 
Eliminating v from (4), one has 


3u,, + ( = 4u, + 4cu, + Uxxx + uuy) SC 0, 


the so-called two-dimensional KdV equation 
(Kadomtsev-Petvyashvili equation). If u(x, y, t) 
does not depend on y, the equation reduces to 
the KdV equation, and if u does not depend 
on t, to the Boussinesq equation. 

The condition for reduction to these spe- 
cial cases can be described in terms of the 
meromorphic functions admitted by S. Sup- 
pose that there is a meromorphic function 
E,(p) holomorphic for p # p and of *prin- 
cipal part F(x) at p— p, . Then y is writ- 
ten as (x,t, p)exp{E,(p)y}, and the coeffi- 
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cients of L and M do not depend on y. 9 
satisfies Lp = Eq, and (4) reduces to the Lax 
representation. 

If such an E, exists, S is hyperelliptic, and 
Pa 1s one of its *branch points over the Rie- 
mann sphere. Thus the result of the previous 
section is recovered. 


G. Solvable Models in Field Theory 


The Sine-Gordon equation has been studied 
extensively as a solvable model in *field theory. 
It is a special case of a field in two space-time 
dimensions with values in a *symmetric space; 
this can also be treated by a variant of the 
inverse scattering method (V. E. Zakharov and 
A. V. Mikhailov, Sov. Phys. JETP, 47 (1978)). 

Much work has been done on the semiclass- 
ical fquantization of equations encountered in 
soliton theory. Recently, a method of exact 
quantization (called quantum inverse scatter- 
ing) was developed (see, for example, L. A. 
Takhtadzhyan and L. D. Faddeev, Russian 
Math. Surveys, 34 (5) (1979)). 
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A. General Remarks 


The term special functional equations usually 
means functional equations that do not in- 

volve limit operations. Such functional equa- 
tions appear in various fields, but there is no 
systematic method for solving them. Usually 
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they are solved by reduction to functional 
equations of some standard type. In this article 
functions are all real-valued functions of real 
variables unless otherwise specified. 


B. Additive Functional Equations and Related 
Equations 


Suppose that we are given an equation 


fr y)=f(x)+ f(y). (1) 


Clearly, f(x)=cx (c a constant) is a solution. If 
f(x) is continuous, (1) has no other solution 
(Cauchy). The same conclusion holds under 
any one of the following weaker conditions: 

(i) f(x) is continuous at a point; (ii) f(x) is 
bounded in a neighborhood of a point; (iit) 
f(x) is tmeasurable in a neighborhood of a 
point. However, it was shown by G. Hamel 
and H Lebesgue by means of *transfinite in- 
duction that equation (1) has infinitely many 
nonmeasurable solutions. On the other hand, 
it was proved by A. Ostrowski [2] that if a 
solution f(x) of equation (1) does not take any 
value between two distinct numbers for x on 
a set of positive measure, then f(x) 1s con- 
tinuous. This result can be extended to the 
case where x is a point (x,, ..., x,) of an n- 
dimensional Euclidean space. In this case, any 
continuous solution is of the form 


f(à- Y Gx; 


(C; constant). 
Next, we consider the equation 


f xty\ S+S) 
X x ` 





Q) 


Any solution of equation (1) satisfies this equa- 
tion. When x is a point (x, ..., x,) of an n- 
dimensional Euclidean space, any continuous 
solution of (2) is of the form f(x) — ZIL, C;jx; 
Co (C; constant). If a solution f(x) of (2) de- 
fined on a tconvex set K does not take any 
value between two distinct numbers for x on a 
set of positive measure, then f(x) is continuous 
(M. Hukuhara). 

Consider the equation 


g(x + y)» g69gty). (3) 


If a solution g(x) vanishes at some point £, 
then g(x) z0. Excluding this trivial case, we 
assume that g(x) never vanishes. Then, putting 
y=x, we see that g(x)>0 for all x. The substi- 
tution f(x)—logg(x) then reduces equation (3) 
to equation (1). Thus we see that any continu- 
ous solution of (3) is of the form g(x)=exp(cx). 
Next, we consider the equation 


g(uv) = g(u)g(v). (4) 
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If a solution g(u) vanishes at some c£ #0, then 
g(u)=0. Excluding this case, we assume that 
g(u) £0 for u #0. For u, v0, by the substi- 
tution x —logu, y zlogv, we have an equation 
of the form (3). On the other hand, putting v= 
—1, we have g(— wu) - g( —1)g(u). Since g?(—1) 
— g(1)— 1, we see that any continuous solution 
of (4) is of the form lut or (sgnu)|ul* according 
as g(—1)=1 or —1. 


C. The General Addition Theorem and Related 
Functional Equations 


The general addition theorem is: If the equation 


f(x y) FC (x), FO) (5) 


has a continuous nonconstant solution f(x) on 
—æ « x € +o, then f(x) is strictly monotone, 
and F(u, v) is strictly monotone increasing and 
continuous with respect to u and v for a <u, 

v « B and satisfies x < F(u, v) « B for «<u, v< 

p. There is also a constant c satisfying F(c, c)= 
c, and the identity F(F(u,v), w) 2 F (u, F(v, w)) 
holds for any u, v, w in the interval (o, f). Con- 
versely, if F(u, v) is such a function, then (5) has 
a continuous nonconstant solution on —oo « 
x< +00. Let f(x) be such a solution. Then any 
other continuous solution is given by f(cx). 
When F(u, v) is continuously differentiable, a 
continuous solution f(x) of (5) can be obtained 
as a solution of the differential equation 


fo)-RE(G).ae  c-f'(0) 


satisfying the initial condition f(0) 2 a. 
Consider the equation 


F(f(x), f(y), f(x + y)) - 0. 


Suppose that F(u, v, w) is a polynomial in u, v, 
and w. If this equation has a tmeromorphic 
solution f(x), then f(x) must be a rational 
function, a rational function of expcx, or an 
elliptic function [1, p. 64]. 

Next, consider the equation 


fox y) f(x — y) 2Cf(x) + SO). (6) 


Any solution continuous on —oo « x « oo is of 
the form f(x)=cx?. When x is a point (x,, 
..., X4) of an n-dimensional Euclidean space, 
any continuous solution of (6) is given by a 
quadratic form f(x) — X7 ;c;;x;x;. 

Consider the equation 


fG y) fix») - 2fo9f(). (7) 


Any solution continuous on —oo <x « oo is 
of the form f(x) 2 cosh cx =(e*+e *y/2 or 
f(x) 2 cos cx. If f(x) is allowed to take com- 
plex values, then any continuous solution can 
be written in the form f(x) -(e** + e 9/2 in 
terms of a complex number b. A special case 
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is cos x, since b may take purely imaginary 
values. 


D. Schróder's Functional Equation 


Schróder's functional equation is 
f(8(x)) = cf G9). (8) 


where 0(x) is a given function and c is a con- 
stant. A general solution of (8) can be written 
as f(x) — f,(x)e(x), where f,(x)-40 is a particu- 
lar solution of (8) and @(x) is a general solu- 
tion of the equation Q(0(x)) = g(x). Suppose 
that there is a point a such that 0(a) 2 a, and 
0(x) and f(x) are both differentiable in a neigh- 
borhood of x =a. Then we have f’(a)=0 or 
0'(a) — c. Consider the case where 0'(a) 2 c, and 
suppose that O(x) is twice differentiable at 

x =a. When |c| 2|0'(a)| « 1, define 6,(x) (n= 

0, 1,2, ...) by 6, (x) =x and 0,(x) — 0(0,. ,(x)). 
Then the sequence {(6,(x—a)c~"} (n=0, 1, 

2, ...) converges uniformly in a neighborhood 
of x =a, and its limit function f(x) is a solu- 
tion of (8). When |c| 2|0'(a)| » 1, put 0(x)— u. 
Then we have the equation f(0 !(u)) - c !f(u), 
and the problem reduces to the previous case. 
The results obtained for equation (8) can be 
extended to the following system of equations: 


FO; (x), 0; (x), Sis, 0,(x)) 
=A, f(x) - 0f; (x) + gx), 


where the 0;(x) are given functions holomor- 
phic in a neighborhood of x =0, 0,(0)=0, 
the coefficients of f in the right-hand side of 
(9) are numbers such that the matrix A =(a,) 
(aj — 0 except for a; = A a; ,.;—ój is the tJor- 
dan canonical form of the matrix formed by 
1(06,/0x,), o], and the gx) are polynomials 
consisting of terms of the form constant x 
xq! x32..." with exponents m,, Mz, ...,m, 
for which 4;— Ai^ ... A. If O «|44| « 1(j— 1,2, 
..., n), then we can always choose the coeffi- 
cients of the polynomials g;(x) so that equa- 
tion (9) has a solution { f(x); holomorphic in 
a neighborhood of x 20. The same conclusion 
can be obtained for | 4| 1 (jo 1, ...,n). 
Consider Abel's functional equation 


f(0(x)) — f(x) 1. (10) 


If we put exp f(x) 2 (x), then we have Schró- 
der's functional equation 


q9(0(x)) — eo(x). 
Consider the equation 
f(x + I= A(x) f(x). (11) 


If we put v(x) «log f(x), then we have a ‘linear 
difference equation of the form 


j=1,2,...,n, (9) 


p(x + 1)— o(x) -log A(x). 
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A. Special Functions 


The term special functions usually refers to 
the classes of functions listed in (1)-(4) (other 
terms, such as higher transcendental functions, 
are sometimes used). (1) The *gamma function 
and related functions (— 174 Gamma Func- 
tion); (2) *Fresnel's integral, the terror function, 
the tlogarithmic integral, and other functions 
that can be expressed as indefinite integrals of 
elementary functions (— 167 Functions of 
Confluent Type D); (3) telliptic functions (— 
134 Elliptic Functions); (4) solutions of tlinear 
ordinary differential equations of the second 
order derived by the method of separation of 
variables in certain partial differential equa- 
tions, e.g., *Laplace's equation, in various 
*curvilinear coordinates. Recently, new types 
of special functions, such as !Painlevé's, have 
been introduced as the solutions of special 
differential equations. 

In this article we discuss class (4); for the 
other classes, see the articles quoted. Class 
(4) 1s further divided into the following three 
types, according to the character of the tsin- 
gular points of the differential equations of 
which the functions are solutions. Equations 
with a smaller number of singular points than 
those indicated in (1)-(3) below can be in- 
tegrated in terms of elementary functions. 

(1) Special functions of hypergeometric type 
are solutions of differential equations with 
three tregular singular points on the Riemann 
sphere. Examples are the thypergeometric 
function and the *Legendre function. Any 
function of this type reduces to a hypergeo- 
metric function through a simple transforma- 
tion (— 206 Hypergeometric Functions; 393 
Spherical Functions). 
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(2) Special functions of confluent type arc 
solutions of differential equations that are 
derived from *hypergeometric differential 
equations by the confluence of two regular 
singular points, that is, by making one of the 
regular singular points tend to the other one 
so that the resulting singularity is an *irregular 
singular point of class 1 (— 167 Functions of 
Confluent Type). Any function of this type can 
be expressed by means of fWhittaker func- 
tions, of which many important special func- 
tions, such as *Bessel functions, are special 
cases (— 39 Bessel Functions). Also, one can 
reduce to this type the tparabolic cylindrical 
functions, that is, the solutions of differential 
equations with only one singular point which 
is at infinity and is irregular of class 2. 

(3) Special functions of ellipsoidal type are 
solutions of differential equations with four or 
five regular singular points, some of which 
may be confluent to become irregular singular 
points. Examples are tLamé functions, tMath- 
ieu functions, and !'spheroidal wave functions 
(— 133 Ellipsoidal Harmonics; 268 Mathieu 
Functions). In contrast to types (1) and (2), 
functions of type (3) are difficult to charac- 
terize by means of tdifference-differential 
equations and have not been fully explored. 
Sometimes the term special function in the 
strict sense is not applied to them. To specify 
the special functions of types (1) and (2), 
the term classical special functions has been 
proposed. 


B. Unified Theories of Special Functions 


Though many special functions were intro- 
duced separately to solve practical problems, 
several unified theories have been proposed. 
The classification in Section A based on dif- 
ferential equations may be regarded also as a 
kind of unified theory. Other examples are: 

(1) Expression by *Barnes’s extended hyper- 
geometric function or its extension to the case 
of several variables by means of a definite inte- 
gral of the form 


[car dy)? ... gell (E— zy dt 


(— 206 Hypergeometric Functions). 

(2) A unified theory [14] that includes the 
gamma function and is based upon Truesdell's 
difference-differential equation 


OF (z, «)/6z = F(z, a + 1). 


(3) Unification from the standpoint of ex- 
pansions in terms of fzonal spherical functions 
of a differential operator (the Laplacian) in- 
variant under a transitive group of motions 
on a 'symmetric Riemannian manifold (— 437 
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Unitary Representations). With this approach 
a great variety of formulas can be derived in a 
unified way [3,4]. 
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A. General Remarks 


Throughout this article, X stands for a *Àcom- 
plex linear space and A for a *linear operator 
in X. Except when X is finite-dimensional, 

A need not be defined over all X. A linear 
operator A in X is by definition a linear map- 
ping whose ‘domain D(A) and ‘range R(A) are 
linear subspaces of X. A complex number 7 is 
said to be an eigenvalue (proper value or char- 
acteristic value) of A if there exists an x e D(A) 
such that Ax — Ax, x £0. Any such x is called 
an eigenvector (eigenelement, proper vector, 
characteristic vector) associated with 7. When 
X is a ffunction space, the word eigenfunction 
is also used. For an eigenvalue Z of A, the 
subspace M (å) of X given by 


M(4)2 Mi Als ix| Axa Ax], 


i.c., the subspace consisting of 0 and all eigen- 
vectors associated with 2, is called the eigen- 
space associated with A. and the number m(A) 
=dim M (4) is called the geometric multiplicity 
of 4. The eigenvalue 4 is said to be (geometri- 
cally) simple or degenerate according as m(A) 
=! or m(4) z 2. The problem of seeking eigen- 
values and eigenvectors ts referred to as the 
eigenvalue problem. 

When X is a ttopological linear space, the 
notion of eigenvalues leads to a more general 
object called the spectrum of A. Let 4 be a 
complex number and put A; — Al — A, where I 
is the *identity operator in X. Furthermore, 
put Ri «(A,) ! «(A41 — A)"', if the inverse 
exists. Then the resolvent set p(A) of A is de- 
fined to be the set of all 4 such that R, exists, 
has domain *dense in X, and is continuous. 
The spectrum o(A) of A is, by definition, the 
complement of p(A) in the complex plane, and 
it is divided into three mutually disjoint sets: 
the point spectrum o,(A), the continuous spec- 
trum o. (A), and the residual spectrum o Al 
These are defined as follows: op(A)={/|R, 
does not exist} — (4|4 is an eigenvalue of A}; 
o, (A) — (4| R; exists and has domain dense in 
X, but is not continuous}; og(A) — (4| R, exists, 
but its domain is not dense in X}. 

Let X be a ‘Banach space and B(X) the set 
of all *bounded linear operators with domain 
X. If A is a ‘closed operator in X, then Ae 
p(A) if and only if R € B(X). Moreover, o(A) 
is a closed set. In particular, if Ae B(X), then 
c(A) is a nonempty compact set. In this case 
the spectral radius r,(A) is defined as r,(A)= 
SUP reo 4y|Al- Then r(A) < | A" n=1,2,..., 
and || A"|| " >r (A), n 20. 
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In many problems of analysis crucial roles 
have been played by methods involving the 
spectrum and other related concepts. This 
branch of analysis is called spectral analysis. 
For an infinite-dimensional X the theory is 
well developed when X is a "Hilbert space and 
A is 'self-adjoint or tnormal. 


B. Eigenvalue Problems for Matrices 


Throughout this section let X be an N- 
dimensional complex linear space (N « oc) and 
A a linear operator in X. (We assume that A 
is defined over all X.) With respect to a fixed 
basis (W,,...,Wy) of X, the operator A is repre- 
sented by an N x N matrix, also denoted by A. 
Then the eigenvalues of A coincide with the 
roots of the tcharacteristic equation det(4/ — 
A)=0. There are no points of the spectrum 
other than eigenvalues, that is ¢(A) =a (A). 
Let 4€6(A). The multiplicity m(A) of 2 as a 
root of the characteristic equation is called 
the (algebraic) multiplicity of the eigenvalue å. 
The sum of m(A) over all the eigenvalues of A 
is equal to N. The eigenvalue 4 is said to be 
(algebraically) simple or degenerate according 
as m(4)— 1 or m(4) 22. Let v2 1,2, ..., and 
N,(A)= {x|(21— A)’x =0}. Then (N,(4)] forms 
a nondecreasing chain of subspaces M(4) — 
N,(4)c N,(4)c ..., which ceases to increase 
after a finite number of steps. When v > m(A), 
the space N,(4) is equal to a fixed subspace 

M (4), sometimes called the root subspace (or 
generalized eigenspace or principal subspace) of 
A associated with 7. A vector in the root sub- 
space is called a root vector (or a generalized 
eigenvector). Then dim M(4) 2 (4) and hence 
m(4) < (A). When A is a *normal matrix, 
M(4)2 M(A) and m(4)  m(A). 

If two matrices A and B are *similar, i.e., if 
there exists an invertible matrix P such that 
B= P'AP, then A and B have the same eigen- 
values with the same algebraic (and geometric) 
multiplicities. The same conclusion holds for 
A and A’, where A’ is the ttranspose of A. 

For the adjoint matrix A* we have o{A*)= 
c(A)& (4| 4ec(A)). For an arbitrary poly- 
nomial f the relation c(f(A)) 2 f(c(A) z 

{ {(A)|A€o(A)} holds (Frobenius's theorem). 
These relations can be extended to operators 
in a Banach space. In particular, o( f(A))= 
f(a(A)) if A is a bounded operator and f is a 
function holomorphic in a neighborhood of 
o(A) (for the tspectral mapping theorem 

— 251 Linear Operators). 

In the next four paragraphs, in which the 
spectral properties of tnormal or "Herman 
matrices is discussed, we introduce into X the 
Euclidean tinner product ( , ), regarding X 
as a space of N-tuples of scalars. Let A be an 
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N x N normal matrix. Then the eigenspaces 
associated with different eigenvalues of A are 
mutually orthogonal. Moreover, the eigen- 
spaces of A as a whole span the entire space 
X. One can therefore choose a tbasis of X 
formed by a tcomplete orthonormal set of 
eigenvectors of A. Specifically, there exists a 
basis (jj — L, ..., Nj of X such that Ag,= 
He: and (pi, ;) än, where à; is the *Kro- 
necker delta. Moreover, H4, ..., uy exhaust 
all the eigenvalues of A. In terms of the basis 
(65), an arbitrary xe X can be expanded as 


N 
Ax= 2 L(x, 9) 9;— x AP,X, (1) 
j=l Aea(A) 

where P; is the orthogonal tprojection on the 
eigenspace associated with the eigenvalue A. 

Of particular importance among normal 
matrices are Hermitian matrices and ‘unitary 
matrices. The eigenvalues of a Hermitian 
matrix are real, and those of a unitary matrix 
have the absolute value 1. 

Solving the eigenvalue problem of a normal 
matrix A leads immediately to the diagonali- 
zation of A. For instance, let U be the N x N 
matrix whose jth column is equal to g,. Herc 
the basis {¢;} is as before, and each g; is re- 
garded as a column vector. Then U is unitary, 
and the transform U*AU of A by U is the 
diagonal matrix whose diagonal entries are 
the 1j. The problem of transforming a *Hermi- 
tian form to its canonical form can also be 
solved by means of U. In fact, a Hermitian 
form Q — Q(x) on X is expressed as Q(x)— 
(Ax, x) with a Hermitian matrix A. For this 
A, construct U as before. Then by the trans- 
formation x = Uy of the coordinates of X, the 
form Q is converted to its canonical form 
Q — u,ly,l +... +uylyyl?. When X is a real 
linear space and A is a real symmetric matrix, 
U is an orthogonal matrix. By means of the 
orthogonal transformation x= Uy, the sur- 
face of the second order Q(x) — 1 in R" is con- 
verted to the form u, y? +... - uy yg — 1. The 
orthogonal transformation x — U y is called 
the transformation to principal axes of the sur- 
face Q(x) — 1. 

When A is not normal, it can be transformed 
into !Jordan's canonical form by a basis o 
taken from the root subspaces M{A). However, 
g; need not be orthonormal even when A is 
diagonalizable. 


C. Spectral Analysis in Hilbert Spaces 


Throughout the rest of this article except for 
the last section, X is assumed to be a Hilbert 
space with inner product ( , ). Furthermore, 
the most complete discussions will be confined 
to tnormal or ‘self-adjoint operators. A funda- 
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mental theorem in spectral analysis for such 
operators is the spectral theorem, which as- 
serts that a representation such as (1) holds 

in a generalized form. When the operator is 
*compact, we have only to replace the sum by 
an infinite sum (— 68 Compact and Nuclear 
Operators). In the general case we need a kind 
of integral. This is discussed in detail in Sec- 
tions D and E. The general theory of spectral 
analysis for nonnormal operators, however, is 
rather involved even in Hilbert spaces, but two 
important developments can be noted. One is 
the theory of Volterra operators, and the other 
is the theory of essentially normal operators. 
The former is discussed in Section H and the 
latter and its related results in Sections I and J. 


D. Spectral Measure 


Let # be a tcompletely additive class of sub- 
sets of a set Q, that is, (Q, 4) is a tmeasurable 
space. An operator-valued set function E— 
E(-) defined on Z is said to be a (self-adjoint) 
spectral measure if (i) E(M), Me Z,is an 
*orthogonal projection in X; (ii) E(Q) = I; and 
(iii) E is tcountably additive, that is, 


E( U m,)= Y E(M,) 


(‘strong convergence) for a disjoint sequence 
(M, of subsets in 2. A spectral measure E 
satisfies E(M N N)= E(M)E(N) 2 E(N) E(M), 
M, Ne&. Spectral measures which are fre- 
quently used in spectral analysis are those de- 
fined on the family Z, (Z) of all *Borel sets in 
the field of real (complex) numbers R (C). A 
spectral measure on 4, (2) is sometimes re- 
ferred to as a real (complex) spectral measure. 
For such a spectral measure E the support (or 
the spectrum) of E, denoted by A(E), is defined 
to be the complement of the largest open set G 
for which E(G)=0. A complex spectral mea- 
sure such that A(E)c R can be identified with 
a real spectral measure. 


Let E be a spectral measure on 23. and put 
E,=E((—«,A4]}), —o<i<oo. (2) 


Then E, satisfies the relations 


E,E, = E minta. w)» RR E,= Ep 
-lim E. = imr E= 
oe E E 3) 


where s-lim stands for strong convergence. A 
family | E;);.g of orthogonal projections satis- 
fying the relation (3) is called a resolution of the 
identity. Relation (2) gives a one-to-one cor- 
respondence between the resolutions of the 
identity and the spectral measures on 24. 

Let E be a spectral measure on &,, and let x, 
y€ X. Then the set function M —S(E(M)x, x)= 
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| E(M)x||? is a bounded regular *measure in 
the ordinary sense, and the set function M > 
(E(M)x, y) is a complex-valued regular tcom- 
pletely additive set function. For every com- 
plex Borel *measurable function f on R, the 
operator S( f) in X is defined by the relations 





nah | | AER 
isis | faxis y H 
xeD(S(f), yeXx. 


S(f) is a densely defined closed operator and 
is denoted by S(f) — (*., f(4) E(d4). The corre- 
spondence f  S( f) satisfies formulas of the 
so-called operational calculus (— 251 Linear 
Operators). In particular, $( f) — $( f)*, and 
hence S( f) is self-adjoint if f is real-valued. If f 
is bounded on the support of E, then S( f) is 
everywhere defined in X and is bounded. S( f) 
is sometimes called the spectral integral of f 
with respect to E. The operator S( f) can be 
defined in a similar way for a spectral mea- 
sure on &, (and for a more general spectral 
measure). 


E. Spectral Theorems 


For every self-adjoint operator H in a Hilbert 
space X, there exists a unique real spectral 
measure E such that 


H- | ` AE(dÀ). (5) 


— oc 


In other words, H and E correspond to each 
other by the relations 


E 





E AN(E(dÀ)x, x) « en]. 


— 00 


(Hx, y) — | A(E(dA)x, y), (6) 


xeD(H)  yeX. 


This is the spectral theorem for self-adjoint 
operators. The support of E is equal to the 
spectrum c(H), so that we can write 


H- | AE(dà)— | : Axsun A) E(dA), 
o(H) — o0 


where yẹ stands for the *characteristic func- 
tion of M. Formulas (5) and (6) are called the 
spectral resolution (or spectral representation) 
of the self-adjoint operator H. We call E the 
spectral measure for H, and the {E,} corre- 
sponding to E by formula (2) (or sometimes 

E itself) the resolution of the identity for H. 
Let 4 be a real number. Then A4eo,(H) if and 
only if E((41) 40. Also, Aeo((H) if and only if 
E(14]) 0 and E(V) #0 for any neighborhood 
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V of 4. The spectral measure E can be repre- 
sented in terms of the resolvent Rio H) = 
(oi — H) ! of H by the formula 


b-ó 


Ela Bye (R(u— cei; H) 


410 elo 2zi Jato 
— R(u* ei; H)} du 


(strong convergence). 

For every normal operator A in X, there 
exists a unique spectral measure E on the 
family of all complex Borel sets 23 such that 


A =| zE(dz). 
ro 


This 1s called the complex spectral resolution 
(or complex spectral representation) of the 
normal operator A. The support A(E) is equal 
to o(A). There are characterizations of point 
and continuous spectra similar to the case of 
self-adjoint operators. Normal operators have 
no residual spectra. For a unitary operator U, 
the support of the associated spectral measure 
is contained in the unit circle I, so that U can 
be represented as 


u- | e? F(d0) (7) 
r 


with a spectral measure F defined on I. For- 
mula (7) is the spectral resolution of the unitary 
operator U. 

For a self-adjoint operator H ={%,, AE(dZ.) 
the following two types of classification of 
o(H) are often useful. 

(i) The essential spectrum c,(H) is by defini- 
tion the set c(H) minus all the isolated eigen- 
values of H with finite multiplicity. When H 1s 
bounded this definition of the essential spec- 
trum coincides with that to be given in Section 
I for a general Ae B(X). o(H) - o,(H) is called 
the discrete spectrum of H. 

(it) The set X,.(H) (resp. X,(H)) (called the 
space of absolute continuity (resp. singularity) 
with respect to H) of all ue X such that the 
measure (E(dA)u, u) is absolutely continuous 
(resp. singular) with respect to the tLebesgue 
measure is a closed subspace of X that reduces 
H. The restriction of H to X,, (resp. X,(H)) is 
called the absolutely continuous (resp. sin- 
gular) part of H, and its spectrum, denoted by 
c, (H) (resp. o,(H)), is called the absolutely 
continuous (resp. singulzr) spectrum of H. Note 
that c,,(H) and o, (H) may not be disjoint. 


F. Functions of a Self-Adjoint Operator 


Let H = | AE(d4) be a self-adjoint operator 
in X. For a complex-valued Borel measurable 
function f on R, we define f(H) to be the oper- 
ator S( f) determined by (4) in reference to the 
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resolution of the identity E associated with H: 
f(H "| f) E(d4). 


For an arbitrary ae X, let L,(a) be the L,- 
space over the measure p, — u4() 2 (E(-)a, a). 
In other words, f € L.,(a) if and only if ae 
D(f(A)). The correspondence f f(H)a gives 
an isometric isomorphism between L,(a) and 
the subspace M(a)={ f(H)a| f € L;(a)] of X. 
(In particular, M (a) is closed.) H is reduced by 
M (a), and the part of H in M (a) corresponds 
to the multiplication f(4) 4f(A) in L,(a). 

For a given self-adjoint operator H there 
exists a (not necessarily countable) family 
{dg}ace of elements a, of X such that 
X 2 Y M(aj), (8) 

Zeg 
where È stands for the ‘direct sum of mutually 
orthogonal closed subspaces. Consequently, X 
is represented by the direct sum 2.6 L(d,) of 
L ,-spaces. If xe D(H) is represented by { foo 
in this representation, Hx is represented by 


(Afojeco- 


G. Unitary Equivalence and Spectral 
Multiplicity 


In this section X is assumed to be a tseparable 
Hilbert space. Then (8) can be made more 
precise. Namely, for a self-adjoint operator H, 
we can find a countable family Jo, "Zu of ele- 
ments of X such that 


X= 7 M(a,)= x Lal (9) 


Ha,,, 1$ "absolutely continuous 


with respect to Ha, n=1,2,.... (10) 


Furthermore, if {a,} is another family satisfy- 
ing (9) and (10), then jj, and y,, are absolutely 
continuous with respect to each other 
(Hellinger-Hahn theorem). j,. is said to be the 
maximum spectral measure. 

Two operators H, and H, are said to be 
unitarily equivalent if there exists a unitary 
operator U such that H, = U*H,U. A crite- 
rion for unitary eqvivalence of self-adjoint 
operators can be given in terms of the spectral 
representation given previously. Namely, let 
(at, i— 1, 2, be a sequence satisfying (9) and 
(10) with respect to H;. Then H, and H, are 
unitarily equivalent if and only if pan and Hap 
are absolutely continuous with respect to each 
other for all n= 1,2, .... 

A self-adjoint operator H is said to have a 
simple spectrum if there exists an ae X such 
that M(a)= X. Such an ae X is called a gen- 
erating element of X with respect to H. 
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Self-adjoint operators with simple spectra 
are closely related to Jacobi matrices. Let H 
be such an operator with a generating ele- 
ment ae X. Take a complete orthonormal set 
(G, Tz, in Lal such that G, — G,(^) is a poly- 
nomial of degree n — 1 and 4G,(^)e L;(a). Then 
Lol, ga = G,(H)a, is a complete orthonor- 
mal set in X. The matrix representation {dmn}, 
amn — (H gs, Gm) of H with respect to the basis 
{gn} has the following properties: (i) Gen —O0 if 
[m —n|z:2; (ii) à, ,,, =Gayn £9; (iN) a, is real. 
Any infinite matrix {amn satisfying (i), (ii), and 
(iii) is called a Jacobi matrix. A Jacobi matrix 
determines a ‘symmetric operator whose tde- 
ficiency index is either (0,0) or (1, 1). Any self- 
adjoint extension has a simple spectrum. (For 
more details about Jacobi matrices and their 
applications — [8].) 


H. Triangular Representation of Volterra 
Operators 


A linear operator A in a Hilbert space X is 
called a *Volterra operator if it is ;compact 
and fquasinilpotent De, 0 is the only spec- 
trum). The name is justified because under 
very general assumptions such an operator is 
unitarily equivalent to the integral operator 
of Volterra type in the vector-valued L, space 
on [0, 1]. Let $ be a maximal ‘totally ordered 
family of orthogonal projections in X such 
that PX is an tinvariant subspace of a Volterra 
operator A for every Pe $B. Such a family $ 
always exists and is called a maximal eigen- 
chain of A. Then generalizing the triangular 
representation of nilpotent matrices, we have 
the integral representation 


4=2i| PA, dP, 
RU 


where A, — (4 — A*)/(2i) is the imaginary part 
of A and the integral 1s the limit in norm of 
approximating sums of the form ÈQ; A,(P,— 
P, 4) for finite partitions 02 Py) < P, <... < P,— 
I of $ in which Q, is an arbitrary projection 
in $B such that P, ., <Q; & P, (M. S. Brodskii). 
Conversely, let P be a totally ordered family of 
orthogonal projections that contains 0 and 
the identity. If the integral A = fy PBdP con- 
verges in norm for a compact linear operator 
B, then A is a Volterra operator and 3 is an 
eigenchain of A. If, moreover, B is self-adjoint, 
we have B— A, (I. C. Gokhberg and M. G. 
Krein; Brodskii). Furthermore, assume for 
simplicity that 35 is continuous in the sense 
that for every P, < P, in 3B there exists an 
element P in *B such that P, < P« P,. If Bisa 
*Hilbert-Schmidt operator, then the integral 

A = (4 PBdP converges in the *Hilbert-Schmidt 
norm and the mapping B A is an orthogonal 
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projection to the set of all Volterra operators 
of Hilbert-Schmidt class possessing ‘8 as an 
eigenchain (Gokhberg and Krein). Volterra 
operators with the imaginary part of the *trace 
class are especially important for applications. 
In this case we have the following fundamen- 
tal theorem on the density of the spectrum of 
the real part Ag of the Volterra operator A: 
Let n, (r; Ag) and n. (r; Ag) be the numbers of 
eigenvalues of Ap in the intervals [1/r, 0c) and 
( —oo, —1/r], respectively. 


Then lim n,(r; Ag)/r = lim n_(r; Ag) — h/m. 
The number h is given by 
i- | dP A, dP]l, 

KÉ 


zinfy I(B; — P; -,) A(P; — Pj JI, 


where the norm is the ttrace norm and the 
infimum is taken over all finite partitions P; of 
a maximal eigenchain *B for A. We refer for 
further details and applications to the books 
by Gokhberg and Krein [9, 10]. 


I. Fredholm Operators and Essential Spectra 
of Operators 


Throughout Sections I and J we assume that 
X is a separable infinite-dimensional complex 
Hilbert space, and we consider only bounded 
linear operators in X. The set BX) of all 
*compact linear operators in X is a *maximal 
two-sided ideal of the *C*-algebra B(X) of all 
bounded linear operators. The simple quotient 
C*-algebra A(X)=B(X)/B©(X) is called the 
Calkin algebra. We denote the quotient map- 
ping by z: B(X)— A(X). Then an operator 
Ac€B(X)is a *Fredholm operator if and only if 
its image n(A) is an invertible element of A(X). 
Let F(X) be the set of all Fredholm operators 
in X, and let F,(X), ne Z, be its subset of all 
operators of *index n. F,(X) is a connected 

set in B(X), and in particular, Fo(X) is the 
inverse image of the connected component of 
the identity in the multiplicative topological 
group z(F(X)) of all invertible elements in 
A(X). The index gives the group isomor- 
phisms F( X)/F; (X) = n(F(X))/n(Fo(X)) = Z. 
More generally, we have for any compact 
topological space Y the group isomorphisms 
LY, F(X)] x [Y X(F(X))] = K(Y) of the groups 
of homotopy classes of continuous mappings 
and the K-group in the *K-theory (M. F. 
Atiyah [ 12). 

If N is a tnormal operator, its essential 
spectrum c,(N) is defined to be the set of all 
4€c(N) that is not an isolated eigenvalue of 
finite multiplicity. Let H, and H, be self- 
adjoint operators. Then o,(H,)=0,(H,) if and 
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only if U*H, U=H,+ K for a unitary opera- 
tor U and a compact operator K (Weyl-von 
Neumann theorem). 

I. D. Berg and W. Sikonia (1971) extended 
this result to normal operators N, and N,. 
Moreover, for any compact subset Y of € 
there exists a normal operator N such that 
c, (N)- Y. Hence it follows that the essential 
spectrum o,(N) of a normal operator N coin- 
cides with the *spectrum e(x(N)) of the image 
in A(X). Thus we define the essential spec- 
trum o,(A) of an arbitrary operator A to be 
the spectrum c(z(A)) in A(X). An operator 
A € B(X) is said to be essentially normal (resp. 
essentially self-adjoint, essentially unitary) if 
1(A) is normal (resp. self-adjoint, unitary) in 
A(X). (Note that this definition of essentially 
self-adjoint operators is completely different 
from that in 251 Linear Operators E.) Since an 
essentially self-adjoint operator is the sum of a 
self-adjoint operator and a compact operator, 
the Weyl-von Neumann theorem classifies 
essentially self-adjoint operators up to *unitary 
equivalence modulo B™(X). An operator A is 
essentially normal if and only if the commu- 
tator [A, A*] is compact, but it need not be 
the sum of a normal operator and a compact 
operator. For example, let V be the unilateral 
shift operator that maps the orthonormal basis 
e; of X into e;,, for every i=1,2,.... Then V is 
essentially unitary, but it cannot be written as 
the sum of a normal operator and a compact 
operator. The essential spectrum o,(V) is 
the unit circle, whereas the spectrum o(V) 
is the unit disk and ind(V — 2) — —1 for |4| « 1. 


J. The Brown-Douglas-Fillmore (BDF) Theory 


The following is the main theorem for essen- 
tially normal operators, due to L. G. Brown, 
R. G. Douglas, and P. A. Fillmore [14]. Let A, 
and A, be essentially normal operators. There 
are a unitary operator U and a compact oper- 
ator K such that U* A, U=A,+K if and only 
if 0.(4,) — 0,(A,) and ind(A, — 4) 2 ind(A, — 4) 
for every A in the complement of the essential 
spectrum. An essentially normal operator 

A is the sum of a normal operator and a 
compact operator if and only if ind(4 — 4) 20 
for every å in the complement of c,(A). 

To prove this and many other facts, they 
developed the theory of extension of B(X) by 
the C*-algebra C(Y) of continuous complex- 
valued functions on a compact metrizable 
space Y [14-16]. This revealed deep relations 
between the theory of operator algebras on 
Hilbert spaces (— 36 Banach Algebras, 308 
Operator Algebras) and algebraic topology (in 
particular, K-theory; — 237 K-Theory). Exten- 
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sion theory also gives a natural setting for the 
"index theory of elliptic differential operators 
due to Atiyah and I. M. Singer [13,16]. 

An extension of B“(X) by C(Y) is a tshort 
exact sequence 


02 BO?(X)5 ESC(Y)20 (11) 


of a C*-subalgebra E of B(X) and *«-homo- 
morphisms, i.e., E is a C*-subalgebra of B(X) 
containing the identity / and including B'?(X) 
as a C*-subalgebra, and ¢ is a «-homomor- 
phism onto C(Y) whose kernel is equal to 
B(X). Or equivalently, an extension is a uni- 
tal (identity preserving) *-monomorphism Tt: 
C(Y)— A(X) defined by zz x0 `t. (For general 
extension of C*-algebras — 36 Banach Alge- 
bras.) Two extensions (E,, 9,) and (E,, 2) 

(or 1, and t,) are said to be equivalent if there 
exists a *-isomorphism y: E, +E, such that 
95,0 V =@, (or equivalently there exists a uni- 
tary operator U such that z(U*) z,(f)n(U)— 
t4( f) for every f e C(Y)). We denote by Ext(Y) 
the set of all equivalence classes of extensions 
of B^(X) by C(Y). 

Let A be an essentially normal operator in 
X with the essential spectrum o,(A)= Y. Then 
the C*-algebra E, generated by B^(X), A and 
the identity J, and the «-homomorphism o, of 
E, onto C(Y) which sends A to the function 
y(z)=z define an extension (E,, 4). It is easy 
to see that two essentially normal operators A, 
and A, are unitarily equivalent modulo B“(X) 
if and only if 6,(4,) 2 0,(A,) and the extensions 
(E4,,@4,) and (E, 9.) are equivalent. Con- 
versely, if Y is a compact subset of € and (11) 
is an extension, then (E, o) is equivalent to 
(E1, 94), where A is an essentially normal 
operator in E such that (A) — y. 

Extensions of B?^(X) by C(Y) appear also 
in different parts of analysis. Let X be the Hil- 
bert space L,(M) on a compact differentiable 
manifold M relative to a fixed smooth measure 
and let E be the C*-subalgebra of B(X) gener- 
ated by all zeroth-order tpseudodifferential 
operators together with B(?^(X). Then E and 
the tsymbol mapping o: E C(S*(M)) define 
an extension of B^(X) by C(S*(M)), where 
S*(M) is the tcosphere bundle of M. This 
extension is closely related to the *Atiyah- 
Singer index theorem. Let Q be a ‘strongly 
pseudoconvex domain in C". Then the C*- 
algebra generated by *Toeplitz operators with 
continuous symbol gives rise to an extension 
of B?9(H,(0Q)) by C(0Q). 

Let 1, and t, be +-monomorphisms from 
C(Y) into A(X) and a, 2[1,] and a, =[1,] 
be corresponding elements in Ext(Y). Then 
the sum a, +a, € Ext(Y) is defined to be the 
equivalence class of t: C(Y)—> A(X) which 
sends f to t, (f) ® 12(fyE A(X) D A(X)c 
A(X Q X)z A(X). It does not depend on the 


1451 


choice of t,, t; and the unitary X @ X x X. 
An extension t: C(Y)— A(X) is said to be triv- 
ial if there exists a unital «-monomorphism 
c:C(Y)5 B(X) such that t=z00. For each 
compact metrizable space Y there exists a 
unique equivalence class of trivial extensions 
in Ext(Y). Ext(Y) 1s an Abelian group in 
which the class of trivial extensions is the 
identity element. The extension (E,, p4) for 
an essentially normal operator A is trivial 

if and only if A=N+K with N normal and 
K compact. Hence follows the Weyl-von 
Neumann-Berg-Sikonia theorem. The BDF 
theorem for essentially normal operators is 
proved by the pairing Ext(Y) x K'(Y)+Z 
defined by the index, where K'(Y)=K(SY*)= 
lim, LY, GL(n, C)] (— 237 K-Theory; [13]). 
The induced homomorphism y,, : Ext(Y)— 
Hom(K'(Y), Z) is always surjective, and it 

is an isomorphism for Y c R? or Y=S" but 
not for Y c R*. 

Ext is a tcovariant functor from the cate- 
gory of compact metrizable spaces to the 
category of Abelian groups. It is thomotopy 
invariant. Define for n=0, 1, ... the group 
Ext, .,(Y) by Ext(S" Y), where S"Y is the n-fold 
tsuspension. Then we have the periodicity 
Ext, , (Y) z Ext, (Y). Moreover, for each pair 
of compact metrizable spaces Y 2 Z we have 
the long exact sequence 


Ext (Z) 5 Ext( Y) 5 Ex(Y/Z) 5 Ext TAS NR 


where Y/Z is the space obtained from Y by 
collapsing Z to a point and 0 is q,r, ! : 

Ext,( Y/Z) — Ext,(SZ) defined by q: YUCZ~> 
(YUCZ)/Y=SZ and r: YUCZS(YUCZJCZ = 
Y/Z, CZ being the tcone over Z. Ext, satisfies 
the *Eilenberg-Steenrod axioms for homology 
theory except for the dimension axiom, which 
is replaced by Ext,(S?) « Ext(S" !) 2 Z for n 
even and =0 for n odd. 

The von Neumann algebra B(X) is classified 
as a ‘factor of type I,,. In the case of a factor 
M of type II, another index theory has been 
developed by H. Breuer [17] and others re- 
placing B((X) by the closed ideal of M gen- 
erated by finite projections and using the 
tsemifinite trace on M for the dimensions of 
kernels and cokernels of operators in M. 


K. Spectral Analysis in Banach Spaces 


Spectral analysis becomes rather involved for 
general operators in a Banach space as well as 
for nonnormal operators in Hilbert space. 

For a fcompact operator A, the nature of 
the spectrum o(A) and the structure of A in the 
root subspace associated with a nonzero eigen- 
value are well known (— 68 Compact and 


390 Ref. 
Spectral Analysis of Operators 


Nuclear Operators). However, a full spectral 
analysis may not be possible without further 
assumptions. 

For a tclosed operator with nonempty re- 
solvent set ot A), an foperational calculus can 
be developed by means of a function-theoretic 
method based on the fact that the resolvent 
R,=(AI— A)! is a B(X)-valued holomorphic 
function of 4 in p(A). In particular, the tspec- 
tral mapping theorem holds (— 251 Linear 
Operators G). 

A general class of operators having asso- 
ciated spectral resolution was introduced by 
N. Dunford. Let X be a Banach space. An 
operator Ee B(X) is called a projection if E? = 
E. As before we can define a (projection- 
valued countably additive) spectral measure on 
28 An operator Ac B(X) is said to be a spec- 
tral operator if there exists a spectral measure 
E on £, satisfying the following properties: (1) 
E(M)A = AE(M), Mea: (it) o(Al ganx) = M, 
Me&,, where Aly is the restriction of A to Y 
and M is the closure of M; (iii) there exists a 
k z 0 such that | E(M)| <k for all Me. E is 
unique. À spectral operator A is expressed as 
A=S+N, where S= [czE(dz) and N is *quasi- 
nilpotent. A is said to be a scalar operator if N 
— 0. Unbounded spectral operators are defined 
similarly, with (i) E(M)A c AE(M) in place of 
(i). However, for unbounded spectral opera- 
tors A we no longer have the decomposition 
A=S+N. (For more details about spectral 
operators — [3]. For other topics related to 
the material discussed in this section — 68 
Compact and Nuclear Operators; 251 Linear 
Operators; and 287 Numerical Computation 
of Eigenvalues.) 
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A. General Remarks 


Let E? be a Euclidean 3-space with a standard 
coordinate system (x,, x5, x4) and E? be the 
x4 x3-plane. Consider a domain D in E^ asa 
vibrating membrane with the fixed boundary 
OD. Then the height x4 = F(x,,x5,t) obeys 

the differential equation of hyperbolic type, 
 F(x,,x5,t)/0t? 2 c? AF(x,, x, TL where A= 
6x1 + 0? /0x2 denotes the *Laplacian in E? 
and c is a constant (we put c= 1 in the follow- 
ing). For solutions of the form F(x,,x,,t)— 
U(x,, x;)V(t), U is a solution of the ?Dirichlet 
problem in Dc E?; AU +AU =0, where A 

is a positive constant called an teigenvalue 

of A. Solutions of the form F(x,,x5,t)— 
U(x,,x,)sin./At represent the pure tones that 
the membrane produces as normal modes. 
That is, the shape of D is related to the possi- 
ble sounds or vibrations (i.e., to the eigen- 
values of A) through the Dirichlet problem. 
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The set of eigenvalues of A is called the spec- 
trum of D and is denoted by Spec(D). There 
arises the question of how much information 
Spec(D) can impart about the geometric prop- 
erties (e, the shape, extent, and connected- 
ness) of D. Generally, spectral geometry is the 
study of the relations between the spectrum 

of domains D of *Riemannian manifolds or 
compact Riemannian manifolds (M, g) and the 
geometric properties of D or (M, gl. 


B. Spectra 


Let Z?(M) denote the space of smooth *p- 
forms on a compact m-dimensional C”— 
Riemannian manifold (M, g). Then eigenvalues 
of the *Laplacian (Laplace-Beltrami operator) 
^ acting on Z?(M) are discretely distributed m 
[0, co), and each multiplicity is finite (— 68 
Compact and Nuclear Operators, 323 Partial 
Differential Equations of Elliptic Type). The 
spectrum for p-forms Spec'(M, g) is (4,1 S 
Àp,2 & ... }, where each eigenvalue is repeated 
as many times as its multiplicity indicates. 
If 0 is an eigenvalue, its multiplicity is equal 
to the pth "Bet number of M. 

In the following, mainly the case p=0 
is explained. Spec?(M, g) is denoted by 
Spec(M, g). 0 is always in Spec(M, g) and its 
multiplicity is 1. So we put 49 =0, and A, is 
the first nonzero eigenvalue. A geometric 
meaning of Af at x for a function f is as fol- 
lows: If (y,j5-; are m geodesics mutually or- 
thogonal at x and parametrized bv arc length, 
then (Af) (x)= 2Z4G 0 7,) (0). 

Let {@,;}72 be an orthonormal basis of 
4* (M) consisting of eigenfunctions: Aq; + 4; 9; 
—0, (91,95? la Gi 9; = 0. Then the tfunda- 
mental solution E(x, y, t) of the heat equation 
^ —ó/t 20 is given by E(x, y, t)  X;e g(x) & 
girl as a function on M x M x (0, œ). We 
put Z(t)= fm E(x, x, t) Zug ^". Z(t) and 
Spec(M, g) are equivalent. The Minakshisun- 
daram-Pleijel asymptotic expansion of Z(t), 


Z(t) - (1/Ant)"? (ag ta,t+a,t?+...), tlO, 


is the bridge connecting Spec(M, g) and geo- 
metric properties of (M, g), because dg, di... 
can be expressed as the integrals of functions 
over M defined by g =(g;;), the components 
Ri, of the *Riemannian curvature tensor, and 
their derivatives of finite order [1]. ag is the 
volume of (M, g) and a, —(1/6) fa S, where S is 
the *scalar curvature. a; was calculated by H. 
P. Mckean, I. M. Singer, and M. Berger, and 
a, by T. Sakai. 

Let D be a bounded domain in E? or, more 
generally, a bounded domain in a Riemannian 
manifold, and assume that the boundary OD is 
piecewise smooth. For smooth functións which 
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take the value 0 on OD, eigenvalues of the 
Laplacian A are discretely distributed tn (0, oo), 
and each multiplicity is finite. We denote the 
spectrum of D by Spec(D) 2 (4, « 45; & ...]. The 
multiplicity of A, is 1, and an eigenfunction 
f corresponding to 4, takes the same sign in 
D. The behavior of Z(t) 2 X;e ^' for D is dif- 
ferent from that for (M, g) since Z(t) reflects the 
geometric situation of 0D in this case. 

(M, g) and (N, h), or D, and D, are called 
isospectral if they have the same spectra. 

Examples for which the spectrum is explic- 
itly calculable are as follows: spheres (ST. go = 
canonical), real projective spaces (RP”, go), 
complex projective spaces (CP", Jo, go), (S?"*!, 
g,= suitably deformed from go), flat tori, (and 
for domains D) unit disks, rectangles, equi- 
lateral triangles, etc. 


C. Congruence and Characterization 


Let D, and D, be bounded domains in E?. An 
open question is whether isospectral D,, D, are 
congruent. Concerning this, there is M. Kac’s 
paper with the famous title “Can one hear the 
shape of a drum?” Let D be a bounded domain 
in E? with smooth boundary 0D. If D has r 
holes, then 


Z(t) - A(D)/4nt — L(0Dy/A. /Ant +(1 — r2, 
t |0, 


holds (A. Pleijel, Kac, P. Mckean, I. M. Singer; 
— [7]), where A(D) denotes the area of D and 
L(ôD) denotes the length of OD. This theorem 
implies that the area, the length of ôD, and the 
number of holes are determined by Spec(D). 

In particular, if D, is a unit disk, Spec(D,) = 
Spec(D,) implies that D, and D, are congruent. 
There are some other results on Z(t) for do- 
mains D of surfaces in E" or for domains D in 
Riemannian manifolds (M, g). 

Two isometric (M, g), (N, h) are isospectral. 
Concerning the question of whether isospec- 
tral (M, g), (N, h) are isometric, there are some 
counterexamples. The first is the case of two 
flat tori T!°, given by J. Milnor. Examples 
with nonflat metrics were given by N. Ejiri 
using warped products and by M. F. Vigneras 
for surfaces of constant negative curvature. 

In those examples, M and N are homeo- 
morphic. A. Ikeda showed that there are ‘lens 
spaces that are isospectral but not homotopy 
equivalent. 

Examples of affirmative cases are as follows: 
Spec(M, g)=Spec(S”, go), m<6, implies that 
(M, g) is isometric to (S", gel (M. Berger, S. 
Tanno). The result is the same for (RP", go), 
m<6. For n<6, (CP", Jo, go) is characterized 
by a spectrum among *Káhlerian manifolds 
(M, J, g). 
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The number of nonisometric flat tori (or 
more generally, compact flat Riemannian 
manifolds) with the same spectrum is finite (M. 
Kneser, T. Sunada). 

If one considers spectra for two types of 
forms, then the situation turns out to be sim- 
pler. For example, if Spec?(M, g) 2 Spec?(N, h) 
for p=0, 1, then (M, g) is of constant curvature 
K if and only if (N, h) is also and K' — K (V. K. 
Patodi [9]). 


D. The First Eigenvalue 


The first eigenvalue 4, for (M, g} or fora 
domain D in (M, g) reflects the geometric situ- 
ation of (M,g) or D. A lower bound of 7, given 
by J. Cheeger is 2, >h(M)?/4, where h(M) is 
the isoperimetric constant, defined by 


h(M)=inf{vol(S)/min[vol(M,), vol(M,)]}, 


where the inf is taken over all smoothly em- 
bedded hypersurfaces S dividing M into two 
open submanifolds M,, M,, 0M, =0M,=S, 
and vol means the volume. 1/4 is the best 
possible estimate. 

Let p denote the maximum radius of a disk 
included in a simply connected D c E?. Then 
44 Z M(Ap?) (W. K. Hayman, R. Osserman). 

If (M, 9) has tRicci curvature >k>0, then 
2, Z2 mk/(m — 1) holds, and the equality holds if 
and only if (M, g) is isometric to (Gm. (1/k)go) 
(A. Lichnerowicz, M. Obata). 4, can also be 
estimated from k and the diameter d(M) of M 
(P. Li, S. T. Yau). 

To obtain upper bounds of 4, the minimum 
principle of 4, is useful. We state it only for 
(M, ot 


ay mint) | vn | e 


where inf is taken over all piecewise smooth 
functions f satisfying fu f—0, f #0, and ( , ) 
denotes the local inner product with respect to 
g. An upper bound of 4, for (M, g) of nonnega- 
tive curvature is 4, <c(m)/d(M)?, where c(m) is 
some constant depending on m (Cheeger). 

For (M, g) or D a submanifold of another 
Riemannian manifold, there exist some esti- 
mates of 4, in terms of second fundamental 
forms, etc. 

A,Z j^ /(A(D)) holds for D c E?, and the 
equality holds if and only if 0D is a circle (C. 
Faber, E. Krahn), where j denotes the first zero 
of the *Bessel function Jy. This estimate is 
generalized in many directions; for example, 
for D c (M^, g) in terms of the integral of the 
Gauss curvature, etc. It is very useful to note 
that the estimate of 4, for D is deeply related 
to the isoperimetric inequality (— 228 Isoperi- 
metric Problems). 
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E. Hersch Type Theorem 


With respect to the first eigenvalue /,(g) 

and the volume Vol(M, g) of (M,g), A4,(g): 
vol(M, g)* is invariant under a change of 
metric g c?g (c is a constant). Hersch’s prob- 
lem is stated as follows: Is there a constant 
k(M) depending on M so that for any Rie- 
mannian metric g on M, 4,(g): vol(M, g)?" < 
k(M)? J. Hersch proved this for a 2-sphere M 
= S? with k(S?)= 8n, and in this case the 
equality holds if and only if g is proportional 
to the canonical metric go. 

The Hersch type theorem holds for an 
oriented Riemann surface M of genus q with 
k(M)=8nx(q+ 1) (P. C. Yang, S. T. Yau). There 
is no such constant k(S") for an m-sphere S", 
m z 3 (H. Urakawa, H Muto, S. Tanno). 


F. The Multiplicity of 4; 


By a theorem of K. Uhlenbeck each eigenvalue 
for a Riemannian manifold (M, g) is simple. 
However, for (S”, go) the first eigenvalue 4, is 
m and its multiplicity is m+ 1. Furthermore, for 
some g, deformed from go, 4,(g,) of (S?"*!, g,) 
has multiplicity n? + 4n + 2, which is larger 
than m+ 1 (22n 4 2). 

The multiplicity m(4;) of the ith eigenvalue 
2; for a Riemann surface of genus q satisfies 
m(4;) € Aq - 2i - 1 (S. Y. Cheng, G. Besson). 


G. kth Eigenvalue 


The minimum principle for 4, of Spec(M, g) is 
stated as follows: Let f; be an eigenfunction 
corresponding to 4, O& i € k— 1. Define H, , 
to be the set of piecewise smooth functions 

f #0 orthogonal to each f,, Le, fu ff; - 0. Then 


i-i] | (Vf, Vf) | | "| 


where inf is taken over f € H,.,. We have the 
minimax principle for 4, of the first and second 
type. We state the second type only: 


d erer! B 
Lg+1 OF feLys, M M 


A,— inf sup 


where L, denotes a k-dimensional linear sub- 
space of Z9(M). From this, for 1-parameter 
metrics g, (a « u « b) on M, the continuity of 
A,(g,) with respect to u follows. 


H. Courant-Cheng Nodal Domain Theorem 


Let f be an eigenfunction on (M, g) or D. The 
set of all zero points of f is called the nodal set 
of f (or the nodal curve of f if m —2). Each 
connected component of the complement of 
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the nodal set in (M, g) or D is called a nodal 
domain of f. The nodal set of f is a smooth 
submanifold of (M, g) or D except for a set 

of lower dimension. The number of nodal 
domains of an eigenfunction corresponding to 
the ith eigenvalue is <i+1 for (M,g) and <i 
for D (Courant-Cheng nodal domain theorem). 


I. Estimate of N(A) 


N(A) is defined as the number of eigenvalues of 
(M, g) or D which are less than or equal to å. 
For Dc E?, H. A. Lorentz conjectured that the 
behavior of N(4) for 4— oo does not depend on 
the shape of D but only on the area A(D) of D, 
ie., lim; 4, N(A)/A= A(D)/An. This was proved 
by H. Weyl. Generally, for D or (M, g), the 
behavior of N(A) for 4— oo is vol(D)A"??/ 
(An)"? T (m/2 + 1), and this is related to the 

first term vol(D)/(4nxt)"? of the asymptotic 
expansion of Z(t) by tTauberian and tAbelian 
theorems. 


J. Spec(M, g) and Geodesics 


Let T" = E"/T be a flat torus, I' being the 
lattice for T". Let I'* be the lattice dual to T. 
Then Poisson's formula, 

y e **"* -(vol(T")/(4nt)"?) 3 e Jl, 
nef? ef 

gives a clear relation between Spec(T") = 
(41? |y, ye U*) and the set (xl xeU] of 
lengths of closed geodesics on T". 

If (M, g) satisfies some conditions, then 
Spec(M, g) determines the set of lengths of 
periodic geodesics (Y. Colin de Verdiere), and 
the spectrum characterizes those Riemannian 
manifolds whose geodesics are all periodic (J. 
J. Duistermaat, V. W. Guillemin). 


K. Spec"(M, g) and the Euler-Poincaré 
Characteristic 


Let (M, g) be oriented and even dimensional. 
Let E?(x, y, t) be the fundamental solution of 
the theat equation for p-forms. Corresponding 
to Z(t) for Spec(M, g), we get Z"(t— fy E^ = 
Ye ^v, Then 


» (—1)?Z?(t)= S (—1)? | ER = y(M), 


where y(M) denotes the tEuler-Poincare char- 
acteristic of M (Mckean, Singer). On the other 
hand, the tGauss-Bonnet theorem is y(M)— 
fu C, where C is a function on M expressed 

as a homogeneous polynomial of components 
of the Riemannian curvature tensor. Then 
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Patodi proved 


(—1}trE”=C+0(t), t0. 


"3 
Ma 


L. -Function 


Let (X, g) be a compact oriented 4k-dimen- 
sional Riemannian manifold with boundary 
€ X = Y and assume that some neighborhood 
of Y in (X, g) is isometric to a Riemannian 
product Y x [0, £). Define an operator B 
acting on forms of even degree on Y by 


Bw-(—0)y''*!(«d—d*)w, weG?"(Y), 


where * denotes the tHodge star operator and 
d denotes exterior differentiation on Y. Then 
B? =A holds. Using the spectrum (4j of B, we 
define the 4-function by 


n(s)= 3. (sgnugIup *. 
atO 
n(s) is a spectral invariant, and 
sen(x)= | Li(pi, -+--> py) — (0) 
X 


holds (Atiyah, Patodi, and Singer [2]), where 
sgn( X) is the signature of the quadratic form 
defined by the tcup product on the image of 
H?*(X, Y) in H?*(X), L, is the kth tHirzebruch 
L-polynomial, and p,,...,p, are the Pon- 
tryagin forms of (X, g). 


M. Analytic Torsion 


Let y be a representation of the fundamental 
group 7,(M) of (M, oi by the orthogonal group 
and E, be the associated vector bundle. Let A* 
be the Laplacian acting on E,-valued p-forms 
on M and {/% ;} be its spectrum. Then 





m d 
log T(M, x)= $ (ipio (eta?) 
p= i s=0 


is independent of the choice of g. T(M, y) is 
called the analytic torsion of M. T(M, y) is 
equal to the *R-torsion t(M, y) (W. Müller, 
Cheeger). 


N. Concluding Remarks 


An tisometry y of (M, g) commutes with the 
Laplacian and induces a linear transformation 
w* of each eigenspace V,. Using the asymp- 
totic expansion of X tr(V7)e ^, the Atiyah- 
Singer *G-signature theorem has been proved 
(H. Donnelly, Patodi). 

The Atiyah-Singer tindex theorem has been 
proved by using Gilkey’s theory and the heat 
equation (Atiyah, R. Bott, Patodi). 

Let (N, h) be a complete Riemannian mani- 
fold of negative curvature. Then A is extended 
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to an unbounded self-adjoint operator for 
L,(N). Generally A has a continuous spec- 
trum. Some conditions for (N, h) to have pure 
point spectrum were given in terms of curva- 
ture (Donnelly, P. Li). 

If D is a minimal submanifold of another 
Riemannian manifold, estimates of 4, are 
related to the stability of D (— 275 Minimal 
Submanifolds). 

On the nonexistence of the 1-parameter 
isospectral deformation (M, g9) (M, g,), there 
are results for (i) m —2 and gy of negative 
curvature, m z 3 and negatively pinched gy (V. 
Guillemin, D. Kazhdan); (ii) flat metrics gy (R. 
Kuwabara); and (iii) go of constant positive 
curvature (Tanno). 

As for spectral geometry for complex La- 
placian on Hermitian manifolds, there are 
results by P. Gilkey, Tsukada, and others. 


References 


[1] M. F. Atiyah, R. Bott, and V. K. Patodi, 
On the heat equation and the index theorem, 
Inventiones Math., 19 (1973), 279—330. 

[2] M. F. Atiyah, V. K. Patodi, and I. M. 
Singer, Spectral asymmetry and Riemannian 
geometry I, II, III, Proc. Cambridge Philos. 
Soc., 77 (1975), 43—69, 405—432; 79 (1976), 71— 
99, 

[3] M. Berger, P. Gauduchon and E. Mazet, 
Le spectre d'une variété riemannienne, Lecture 
notes in math. 194, Springer, 1971. 

[4] J. Cheeger, A lower bound for the smallest 
eigenvalue of the Laplacian, Problems in 
Analysis: Symposium in Honor of Salomon 
Bochner, R. C. Gunning (ed.), Princeton Univ. 
Press, 1970, 175—199. 

[5] P. B. Gilkey, The index theorem and the 
heat equation, Publish or Perish, 1974. 

[6] M. Kac, Can one hear the shape of a drum ? 
Amer. Math. Monthly, 73 (April 1966), 1-23. 
[7] H. P. Mckean and I. M. Singer, Curvature 
and eigenvalues of the Laplacian, J. Differen- 
tial Geometry, 1 (1967), 43-69. 

[8] S. Minakshisundarum and A. Pleijel, Some 
properties of the eigenfunctions of the Laplace 
operator on Riemannian manifolds, Canad. J. 
Math., 1 (1949), 242—256. 

[9] V. K. Patodi, Curvature and the funda- 
mental solution of the heat operator, J. Indian 
Math. Soc., 34 (1970), 269—285. 


392 (XX.5) 
Spherical Astronomy 


Spherical astronomy is concerned with the 
apparent positions of celestial bodies and their 
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motions on a celestial sphere with center at an 
observer on the Earth, while *celestial mechan- 
ics is concerned with computing heliocentric 
true positions of planets and comets and geo- 
centric true positions of satellites. The purpose 
of spherical astronomy is to find all possible 
causes of displacement of the apparent posi- 
tions of celestial bodies on the celestial sphere 
from their geocentric positions and to study 
their effects. Atmospheric refraction, geo- 
centric parallax, aberration, annual parallax, 
precession, nutation, and proper motion are 
examples of these causes. 

When light from a celestial body passes 
through the Earth’s atmosphere, it is refracted 
since air densities at different heights are differ- 
ent. This phenomenon is called atmospheric 
refraction. The effect of refraction on the 
apparent direction of the celestial body is a 
minimum when the body is at its culmination, 
and vanishes when this coincides with the ob- 
server's zenith, while the maximum refraction 
of 34:5 occurs when the body is at the horizon. 

Topocentric positions differ appreciably 
from geocentric positions for the Moon and 
planets, since their geocentric distances are not 
large compared with the radius of the Earth. 
The difference is largest when the observer is 
on the equator and the celestial body is at the 
horizon, and this largest value is called the 
geocentric parallax. The geocentric parallax of 
the Moon is between 53:9 and 60:2; those of 
the Sun, Mercury, Venus, Mars, Jupiter, and 
Saturn are, respectively, 8764—8794, 6"—16^"5, 
5" 32", 375-2375, 174-271, and 078-171. For 
fixed stars geocentric parallaxes can be re- 
garded as zero since the stars are far from the 
Earth. 

The Earth moves in an orbit around the Sun 
with period of one year (365.2564 days) and 
rotates around the polar axis, which is inclined 
at 6675 to the orbital plane (the ecliptic), with 
period of one day (23 hours, 56 minutes, 4.091 
seconds). Therefore the observer on the Earth 
moves with a speed depending on the latitude 
(0.465 km/sec on the equator) due to the rota- 
tion and moves with an average speed of 
29.785 km/sec on the ecliptic. Due to these 
motions of the observer, apparent directions of 
celestial bodies are displaced from their geo- 
metric directions. Displacement due to the 
rotation is called diurnal aberration, and that 
due to the orbital motion, annual aberration. 
The effect of diurnal aberration is between 0” 
and 0732 and varies with a period of one day, 
while that of the annual aberration is between 
0" and 207496 and varies with a period of one 
year. Moreover, to compute the positions of 
celestial bodies, the travel time of light to the 
observer should be taken into account. 
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Annual parallax for a fixed star is half the 
difference of its apparent directions, which are 
measured at the ends of a diameter perpendic- 
ular to the direction of the star from the orbit 
of the Earth. The effect of the annual paral- 
lax varies with a period of one year. However, 
except for nearby stars, it is not necessary to 
take this effect into account when computing 
apparent positions. 

The pole of the Earth on the celestial sphere 
moves on a circle around the pole of the eclip- 
tic due to the gravitational attraction of the 
Moon, Sun, and planets, and therefore the 
equinox moves clockwise on the ecliptic. 
Because the resultant of the attractive force of 
the Moon, Sun, and planets changes periodi- 
cally, the motion of the equinox is not uni- 
form. Therefore the motion is expressed as the 
sum of secular motion, called precession, and 
periodic motion, called nutation. Since the 
positons of fixed stars on the celestial sphere 
are measured with respect to the equator and 
the equinox, their right ascensions and decli- 
nations are continuously changing because 
of precession and nutation. 

Since the stars are not fixed in space but 
themselves have proper motions, their posi- 
tions on the celestial sphere are continuously 
changing. 

Spherical astronomy also includes predic- 
tions of solar and lunar eclipses, the theory 
of ‘orbit determination to compute apparent 
positions of celestial bodies in the solar system 
by use of orbital elements, and the compu- 
tation of ephemerides for the Sun, Moon, 
planets, and fixed stars. Practical astronomy, 
which develops theories and methods of ob- 
servation by use of meridian circles, transit 
instruments, zenith telescopes, sextants, the- 
odolites, telescopes with equatorial mountings, 
and astronomical clocks, and navigational 
astronomy, which deals with methods for 
determining the positions of ships and aircraft, 
are closely connected to spherical astronomy. 

It should be noted that recently radar has 
been used to measure distances to the Moon 
and planets accurately, a contribution to de- 
termining the size of the solar system with 
precision. 
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A. Spherical Functions 


The term spherical functions in modern termi- 
nology means a certain family of functions on 
tsymmetric Riemannian spaces obtained as 
simultaneous teigenfunctions of certain inte- 
gral operations (— 437 Unitary Representa- 
tions). In this article, however, we explain 
only the classical theory of Laplace's spherical 
functions with respect to the rotation group in 
3-dimensional space. 

Solutions of tLaplace’s equation AV=0 that 
are homogeneous polynomials of degree n 
with respect to the orthogonal coordinates x, 
y, z are called solid harmonics of degree n. If n 
is a positive integer, there are 2n + 1 linearly 
independent solid harmonics of degree n. In 
*polar coordinates (r, 0, ol they are of the form 
r" Y (0, o), where Y,(0, o), the surface harmonic 
of degree n, satisfies the differential equation 


D. QTY. 1 ey, 
—— 4| sin0 qs 2 
sin 0 00 00/ sin^0 do 


4 n(n- 1) Y, 0. 





Here, if we apply ‘separation of variables to 0 
and om and put z 2 cos6, then the component in 
ọ is represented by trigonometric functions, 
and the other component in 0 reduces to a 
solution of Legendre's associated differential 
equation 





d*w dw 
— 2 — — —BÀ 
(1—z hg GER 
2 
aneneen (1) 


B. Legendre Functions 


With m —0 in (1) and n replaced by an arbi- 
trary complex number v, the equation is re- 
duced to Legendre's differential equation 


d 
E 2: 57 e vi 1) 20, (2) 


whose fundamental solutions are represented 
by 


1 (1+,z+) (2 — 1) 
hel: Tcp (3) 





S 1 (1+,-1+) (7 =1y d J 
9, 7 inva 2-0" » 
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where the contour of integration in (3) is a 
closed curve with positive direction on the 
C-plane, avoiding the half-line ( —oo, —1), 

and admitting 1 and z as inner points of the 
domain it bounds, whereas the contour in 

(4) is a closed co-shaped curve encircling the 
point 1 once in the negative direction and 
the point —1 once in the positive direction. 
The functions P,(z) and Q,(z) are called Le- 
gendre functions of the first and second kind, 
respectively. The integral representation (3) is 
Schláfli's integral representation. If Re(v+ 1) 
0, we can deform the contour of integration 
and obtain 





1 1 (1 E GP 
0.274 | ont (5) 
If v is an integer, it is convenient to use the 
representation (5). 

From (3)-(5), we can obtain the recurrence 
formulas for Legendre functions of distinct 
degrees. The recurrence formulas for P,(z) and 
Q,(z) have exactly the same form (— Appendix 
A, Table 18.11). Expanding the integrand in (3) 
and (4) with respect to z — 1 and £/z, the fol- 
lowing identities are obtained: 


1— 
eiae r(v- l, —v, 2! l1—z|«2, 


TE Jn F(v4-1) 
Q7 o T4 3/2 


F v+1 v4+2 e 1 
Vv QA 3 Is 
2.795 2 z* 


[z|» 1 Jargz| <z, 





where F(x, B, y, z) is the thypergeometric func- 
tion. These expansions are the solutions in 
series of Legendre's differential equation in the 
neighborhood of the tregular singular points 
z — | and oo, respectively (C Appendix A, 
Table 18.IT). 

If v is a positive integer, since ¢ = 1 is not a 
*branch point in (3), P,(z) is represented by 
Rodrigues's formula 


2. 1yw 
DIE d ey dt 











2ni J 2^(z Ee") 
] d" 
Em iem B (6) 


In this case, P,(z) is a polynomial of degree n 
such that 


pigs y (dcc esee 


70 2"ri(n—r)'(n—2r)! 


n-2r 
3 





Po(z)— l, 


which is called the Legendre polynomial (Le- 
gendre, 1784). The tgenerating function for the 
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Legendre polynomials is (1 — 2p cos0 + p?) ?, 
whose expansion with respect to p is of the 
form 37-9 P,(z)p", z 2cos0. Here the !generat- 
ing function (1 — 2p cos 0 + p?) '? is the inverse 
of the distance between two points (p, 0) and 
(1,0) in polar coordinates. Hence P,(z) is also 
called the Legendre coefficient. If z is real, (((2n 
1/2)? P.(z)1 Za constitutes an orthonormal 
system on [—1,1] (— 317 Orthogonal Func- 
tions). The n zeros of P,(z) are all real, simple, 
and lie in (—1, 1). For sufficiently large n, we 
have 


P,(cos 0) 


dee ne 
F nese n 2 *1 2 


Q, (cos 0) 


n 1 a 
= = —)+O(1/n>? 
SGOR (1/n^^) 


as n oo. 








C. Associated Legendre Functions 


For any positive integer m, the functions 
(1—z?)"? d™P,(z)/dz™, 
Qv) -(1—z?y"? d"Q, (zy dz" 


are called the associated Legendre functions of 
the first and second kind, respectively. This 
definition, due to N. M. Ferrers, is convenient 
for the case —1 <z « 1. For arbitrary com- 
plex z in a domain G obtained by deleting the 
segment [ —1, 1] from the complex plane, the 
following definition, due to H. E. Heine and 
E. W. Hobson, is used: 


Pr(z)— 


Py"(z) - (z — 1"? d" P,(z)/dz™, 
Qv(z)- (z^ — 1"? a"g,(z)az". 


The associated Legendre functions satisfy the 

. associated Legendre differential equation (1). 

In particular, for v —n (a positive integer) and 

z — x (real), 

{(2n+ 1)(n—m)!/2(n + mj? P™(z), 
n=0, 1,...m=constant, 


constitute an orthonormal system on [ —1, 1]. 
The addition theorem for the Legendre func- 
tions is 


bia 


—m) 
n+m)! 








P,(z,)P, rei A Pr (i) 


x Pi"(z,)cosm@, 


where the equality with the plus sign was 
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obtained by Ferrers and that with the minus 
sign by Heine and Hobson. 


D. Surface Harmonics 


From the considerations so far for the sur- 
face harmonics KI. oi, 2n 4- 1 independent 
solutions 


P(cos0), ` Pz"(cosO0)sin mo, 


qP,(cos0)cosmp, 1l<m<n, 


are obtained. Since P,(cos0) vanishes on n 
latitudes of the unit sphere, and P7'(cos0): 
cosmo and P7'(cos0)sin mo vanish on n —m 
latitudes and m longitudes of the unit sphere, 
respectively, the former functions are called 
zonal harmonics and the latter, tesseral har- 
monics. The general form of surface harmonics 
Y, of order n is given by a linear combination 
of zonal and tesseral harmonics: 


Y,(0, p) = A, oP, (Cos 0) 


+ Y (A, cosmo + B, ,,sinmo)Pz(cos0). (7) 
m=1 

Expressing two surface harmonics VU) and 

Y!? in linear combinations such as (7), the 

following orthogonality relations hold: 


| | YPO, o) Y? (0, o)sin0d0do 


SO 








! 
"(A400 40) (1) DO 
! (AT Ann T sec ` 


Since the family of all zonal and tesseral har- 
monics constitutes a *complete orthogonal 
system, it is possible to expand a function 

f(0, ol on the sphere into an orthogonal series: 


f0.o) - Y. Y (0.9 
= Y (Ain 9) y (A, wm cos mo 
m=1 


+ B, msin me) P; (cos D) . 


To obtain surface harmonics, the following 
method is effective. Let v be a direction pro- 
portional to the direction cosines l, m, n. Then 
a function 


poss id oon 0M afl 
ox óz]r dv\r)’ 
i. erer 
is a solution of Laplace's equation. Physically, 


this corresponds to a tpotential of double pole 
with moment « and direction v. A more gen- 
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eral multipole potential 


df "x c) 


also satisfies Laplace's equation. If we put V— 
U, (x, y,z)r 2" !, U, is a spherical function of 
order n (Maxwell's theorem). Various spherical 
functions correspond to particular directions 
v. For example, if every v; is equal to z, we 
have zonal harmonics; and if n — m of the v; are 
equal to z and m of the v; are symmetric on the 
xy-plane, we obtain tesseral harmonics. Let y 
be an angle between two segments connecting 
the origin to the points (r, 0, ol and (r,6', 9") in 
polar coordinates. Then cos y= cos 0 cos 0' + 
sin dain 0' cos(q — ¢’), and if we choose the 

line connecting the origin to a point (r’, 0. 9’) 
as the axis defining P,, we have 


jm (= ð yð 


Fj(cosy)- C717 





rox r 


P n/i 
rôz] Nr]. 


These are called biaxial spherical harmonics, 
which can also be represented (by the addition 
theorem) by means of spherical harmonics 
with respect to each axis. 


E. Extension of the Legendre Functions 
We extend the associated functions with posi- 


tive integer m to any number m. First, if m is a 
negative integer, we put 


P,"(z)-(1 ums [a is dC, i vem 


1 1 


La e 
«| at | P,(5,) dC, 
1 1 
z in 
orraz- | ic, | E 
63 Wi 
ol Q,(C,)d5;, 
a definition due to Ferrers. Then 
PE cuc a E He 
P; (z)=( 1) Tvimci)" (z), 
T mi (v—mt+}) m 
A C" 59 9. 


When we use the definition due to Heine and 
Hobson, the factor ( —1)" in these formulas is 
excluded. Two fundamental solutions, called 
hypergeometric functions of the hyperspherical 
differential equation 


(1— 254? w/dz? —2(u + 1)z dw/dz 
+(v—p)(v+yt 1)w- 0, 
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are 


—vmi 


e 
IN, Dt 7) — 
Po 2’ Ansinva 


x det 1+,z-,1-) (^ - 1 dt 
(-2y m 


e +v)ni " S (^ =>, 1)” 
ez) == —— — $ a 2 dl, 


27^ 4isin va (C—z ter 


where the contour of integration for the latter 
integral is a curve encircling the point —! once 
in the positive direction and the point 1 once 
in the negative direction. Then the associated 
Legendre functions for an arbitrary number yu 
are defined as follows: 


Hu Tocat dy DÉI 
p! (z)= 2^T (v4 1) (z 1) EPIS 
gige L ET "E (we Q(z). 


2^T(v +1) 


If v — u is a positive integer, P^? is called 

the Gegenbauer polynomial, also denoted by 
C,_,(z). The C,_,,(z) are obtained as coeffi- 
cients of the expansion of the generating func- 
tion (1 —2hz 4 22) 2#*))? (— Appendix A, 
Table 20.1). 

For spherical functions of several variables 
there is an investigation by P. Appell and J. 
Kempé de Fériet [2] (— 206 Hypergeometric 
Functions D). 
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A. General Remarks 


Stability was originally a concept concerned 
with stationary physical states. When a state 
is affected by a small disturbance, this state 

is said to be stable if the disturbance subse- 
quently remains small, and unstable if the dis- 
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turbance gradually increases. For instance, 
consider a rod placed in the Earth's gravi- 
tational field with one end fixed at a point 
around which the rod can rotate freely. When 
the rod is placed vertically, this state is station- 
ary. It is stable if the rod is hanging down from 
the fixed end, and unstable if it is standing on 
the fixed end. In physical systems only the 
stable state is practically realizable, so this dis- 
tinction is important. 

The concept of stability is used not only in 
relation to physical states but also in many 
other fields of science. We shall restrict our- 
selves to stability of solutions of differential 
equations. There, the term stability is used in 
the sense that a small change in the initial 
values results in a small change in the solution. 
As long as the solution is considered within a 
finite interval of the independent variable, this 
stability is naturally guaranteed by the con- 
tinuity of the solution with respect to its initial 
values (— 316 Ordinary Differential Equations 
(Initial Value Problems)). The problem arises 
when an independent variable moves over an 
unbounded interval. 


Let (x1; s Xa} =X, (x(t), ..., xul) X(t), 
(x(t), ..., x,(t)) 2 x' (t) (the symbol ' means 
differentiation by t), and |x| = 25-,|xj]. Con- 


sider the differential equation 
x' —f(t, x), (1) 


for which the existence and uniqueness of the 
solution of the initial value problem is as- 
sumed for |t| « oo, [x| « oo. Let x 2 e(t) bea 
solution of (1). If for any £20 and tọ, a 00 
can be chosen so that Ix(to) — e(to)| <ô implies 
Ix(t) — e(t)| «efor ty €t « oo (—o0 «t &t9), 
where x(t) is any solution of (1), then x = g(t) is 
said to be (Lyapunov) stable in the positive 
(negative) direction. If it is stable both in the 
positive and negative directions, it is said to be 
stable in both directions. In the remainder of 
this article we will consider stability in the 
positive direction only. Corresponding asser- 
tions for stability in the negative direction can 
be obtained by reversing the sign of t. 


B. Classification 


We denote by x 2 x(t, fo, Xo) a solution of (1) 
such that x 2 x, at t— to. 

Suppose a solution x = g(t) ts stable. If for 
any t, there exists a €>0 such that |x(t, tp, Xo) 

e(t)| 50 as t— oo for any x(t, to, xo) with |x 
— ((to)| « C, p(t) is said to be asymptotically 
stable. 

If a constant 6 in the definition of stability 
can be chosen independently of to, g(t) is said 
to be uniformly stable. When equation (1) 
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is ‘autonomous, stability implies uniform 
siability. 

If (1) g(t) is uniformly stable and (2) for any 
to and 7 >0, there exist a ¢ >0 independent 
of ty and a T>0 independent of t, such that 
[Xo — 9(t9)| « £ and t» t9 4- T imply |x(t, to, Xo) 
— e(t)| <n, then g(t) is said to be uniformly 
asymptotically stable. 

Suppose that there exists a positive number 
À with the following property: For any to 
and ¢>0 one can take a ó(c) » 0 such that 
Ixo — (to) < (c) implies [x(t, to, xo) - GO| < 
£e ^* o for t> tg. Then @(t) is said to be 
exponentially stable. Exponential stability 
implies untform asymptotic stability. 


C. Criteria 


To deal with the stability of x = g(t), we need 
consider only the case g(t) 2 0, since the trans- 
formation x =y + e(t) reduces equation (1) to 


y —f(t. y - e(t)) — flt, o(t)) — F(t, y), 
F(t,0)=0, (2) 


and thus x — e(t) is transformed into y=0. If F 
is continuously differentiable with respect to y, 
(2) can be written in the form 


y-FE,(0)y--g(Ly) — g(t.y)—o(lyl). 


The linear part of this equation, 
y -F(t, Oy, 


is called the variational equation for (1). So, 
in this section, we can state several criteria 
for stability of the null solution y =0 of the 
equation 


Ig(t. y)| — o(Iy - (3) 


(1) If (3) is linear (e, g(t, y)=0), then y z0 is 
stable if and only if every solution of (3) is 
bounded as t— oo. 

(II) If (3) is linear, uniform asymptotic sta- 
bility implies exponential stabilitv. 

Let f(t, y) be a function defined for |y| <p, t 
>a. If there exists a continuous function w(y) 
such that w(0) 20, w(y)- 0 (y £0). f(t, y) > w(y) 
(vi € p, t7 o), then f(t, y) is said to be positive 
definite. If — f(t, y) is positive definite, then 


y — P(t)y t g(t, y). 


f (t, y) is said to be negative definite. 


(III) The existence of a Lyapunov function 
V(t, y) with the following properties implies the 
stability of y z 0: (i) V(t, y) is positive definite 
and differentiable, (ii) V(t, 0) — D. (iii) V(t, y) — 
V+ V: (P(t)y +g(t, y)) «0. 

The existence of V(t, y) with the following 
properties implies the uniform asymptotic 
stability of y 20: (i) same as (i) above, (ii) there 
exists a continuous function v(y) such that 
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v(0) — 0, v(y) > 0 (y 40), V(t, y) & v(y), (iii) V(t, y) 
is negative definite. 

Hereafter we shall assume that [g(t, y)| = 
o(|y|) as y 50 uniformly with respect to t. 

(IV) If P(t) is a constant matrix all of whose 
*eigenvalues have negative real parts, then 
y «0 is asymptotically stable [3,4]. 

(V) Let P(t) be continuous and periodic with 
period T and Z be a ‘fundamental system of 
solutions of the variational equation 


z = P(t)z. (4) 


Then there exists a constant matrix C such 
that Z(t + T)=Z(t)C. Let 4,,..., 4, be the 
eigenvalues of C. Then the numbers p, = 

(log 4,)/T (k=1,...,n) are called the charac- 
teristic exponents of (4). Obviously they are 
determined up to integral multiples of 2zi/T. If 
the real parts of the characteristic exponents 
are all negative, then y=0 is asymptotically 
stable [3,4]. 

(VI) If f(t, x) in (1) is periodic in t with period 
T and (1) admits a periodic solution x = g(t) 
with period T, then (1) can be reduced to (3) by 
putting x — y + e(t), and P(t), g(t, y) are both 
periodic in t with period T. Thus criterion (V) 
can be applied as a stability criterion for the 
periodic solution of (1). There are many other 
criteria for various particular forms of the 
equation (— 290 Nonlinear Oscillation). For 
the *autonomous case where f(t, x) is of the 
form p(x) or p(x)+ q(t) with q(t 4- T) — q(t), 
many results have been found. 

(VIT) If the solution zz 0 of 


z = Pils 


is uniformly asymptotically stable, then the 
solution y =Q of (3) is also uniformly asymp- 
totically stable [4]. 


D. Conditional Stability 


Let g(t) be a solution and & a family of solu- 
tions of (1). If for any £20, a 6>0 can be 
determined so that |x(to)—(to)| <ô implies 
Ix(t) — e(t)| <e for ty t « oo for any solution 
x(t) in %, then g(t) is said to be stable with 
respect to the family X. If a family & can be 
found so that a solution is stable with respect 
to X, the solution is said to be conditionally 
stable. For instance, in equation (3), if P(t) is a 
constant matrix some of whose eigenvalues 
have negative real parts, g(t, y) is differentiable 
with respect to y, and g,(t, y)=o(1) uniformly 
in t as y 20, then y z0 is conditionally stable. 

We now mention a weaker kind of stabil- 
ity called orbital stability. Let g(t) be a solu- 
tion and e any positive number. If there can 
be found a positive number 6 such that for 
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any solution x(t) with [x(t,) — e(to9)| <ô for 
some t, and ty, UL X(t) belongs to the s- 
neighborhood of | 1 (t), then g(t) is said 
to have orbital stability. 

When f(t, x) in equation (1) is independent of 
t, (1) is often called a *dynamical system. In the 
theory of dynamical systems, not merely the 
stability of a solution itself but also the sta- 
bility of a closed invariant set is of importance 
(— 126 Dynamical Systems). 

It is also of importance to investigate the 
change in solution caused by a small change in 
the right-hand member of the equation. Sup- 
pose, for instance, that the right-hand member 
of the equation depends continuously on a 
parameter e. Then the question arises as to 
how the solution changes if e changes. The 
theory of such problems is called perturbation 
theory. Suppose that the equation 


x' — f(t, x, c) 


admits a periodic solution g(t) for e=0. Then 
y(t) is said to be stable under perturbation if 
for £250 the same equation admits a periodic 
solution lying near g(t). In tcelestial mechanics 
and *nonlinear oscillation theory this concept 
plays an important role. 
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A. Definitions 


Stationary process is a general name given to 
all tstochastic processes (— 407 Stochastic 
Processes) that have the property of being 
stationary (to be defined in the next para- 
graph) under a shift of a time parameter t that 
extends over T, which is either the set of ail 
real numbers R (a continuous parameter) or 
the set of all integers Z (a discrete parameter). 
Let (Q, B, P) be a tprobability space and 
{X,(w)} (te T, €Q) a complex-valued fsto- 
chastic process. If for every n, every t,, fa, 
t,€ T, and every *Borel subset E, of complex 
n-dimensional space C", the equality 


P((X,, — 198 s Xn) EEn) 


— P((X,,,..., X, )e En) (1) 


holds, then {X,} is called a strongly (or strictly) 
stationary process; while if E(|X,|?) is finite for 
every t, and if the fmoments up to the second 
order are stationary, i.e., if 


E(X,,,) = E(X)), 
E(X, les E(X,X,), Q) 


then {X,} is called a weakly stationary process 
or a stationary process in the wider sense. The 
"stationary" in the latter sense obviously in- 
cludes the former if E(| X,|?) « oo. Condition 
(2) is equivalent to 


E(X,)-—m  (aconstant independent of t), 


E((X,—m)(X,—m))— p(t —s) 
(a function of t —s). (3) 


We call m and p(t) the mean and the covari- 
ance function of ( X,). 

In the continuous parameter case, we as- 
sume fcontinuity in probability, 


lim P(|X,.,—X,|>e)=0, &»0, 


for a strongly stationary process, and continu- 
ity 1n mean square, 


lim E(|X,., — X^) 0, 


for a weakly stationary process. The latter 
assumption is equivalent to continuity of the 
covariance function p(t). 

A *Gaussian process is strongly stationary if 
and only if it is weakly stationary; and so it is 
simply called a tstationary Gaussian process. 
Such processes constitute a typical class of 
stationary processes (— 176 Gaussian Pro- 
cesses C). 
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B. Spectral Decomposition of Weakly 
Stationary Processes 


The covariance function p(t) is obviously 
tpositive definite and continuous. Therefore, 
by *Bochner's theorem, we have the tspectral 
decomposition of p(t): 


p(t)= | e'"F (dd), (4) 
2 


where T” is either R (when T=R) or [—7,7] 
(when T= Z) and F is a bounded measure on 
T’. The decomposition (4) is called the Khin- 
chin decomposition of p(t), and F(dA) is called 
the spectral measure. If the process ( X,) is real- 
valued, then the spectral measure F(d4) is 
symmetric with respect to the origin. 

To obtain the spectral decomposition of a 
weakly stationary process X, itself, we intro- 
duce the "Hubert space L,(O) (where Q= 
Q(B, P) is the basic probability space on 
which each X, is regarded as a tsquare inte- 
grable function). Let M(X) be the subspace of 
L,(Q) spanned by the X, (te T) and the con- 
stant function 1. Since {X,} is weakly station- 
ary, we can define a one-parameter group of 
*unitary operators U, (te T) determined by 
U, X, — X,,, and U,1=1. By !Stone's theorem 
we have the spectral decomposition of U,: 


U | e" E(d) (5) 


Setting M(A)= E(A)(X, — m), we obtain the 
spectral decomposition of X,: 


Xis UXo=m+| e" M (da). (6) 


e 
We also have 
(M(A), 1)=0, 
(M(A,), M(A2))= F(A, N A2). (7) 


The study of weakly stationary processes is 
based on the decomposition (6). For example, 
the weak law of large numbers for | X,j, 


1 B 
tim sl X,dt=m+M({0}), (8) 
is an immediate consequence of (6). In the 
discrete parameter case a similar result is 
obtained by replacing the integral sign in 
expression (8) by the summation sign. In partic- 
ular, if F is continuous at the origin, we have 
M({0})=0, and only the constant m remains 
in the right-hand side of (8) [1, 2]. 


C. Weakly Stationary Random Distributions 


Just as we introduce fdistributions as gen- 
eralizations of ordinary functions, we define 


1463 


weakly stationary random distributions as 
generalizations of weakly stationary processes. 
Let 2 be the space of all functions of class C? 
on T=R with compact support. We introduce 
the same topology on @ as in the theory of 
distributions. If a random variable X, € L;(Q) 
is defined for every oe Z2 and the mapping 9 
>X, is continuous in the L;-sense and linear, 
then the family {X,} of random variables is 
called a random distribution in the wider sense 
(— 407 Stochastic Processes). Furthermore, if 


(X, o. D) 2 (X, 1), 
[e Xl xx: LE Xo) (9) 


for every he R!, where ( , ) stands for the inner 
product in L,(Q) and 


t, (t) — p(t — h), 


then {X,,} is said to be a weakly stationary 
random distribution. With a weakly stationary 
process we can associate a weakly stationary 
random distribution by the relation 


x=] X,o()dt. (10) 
T 


This correspondence is one-to-one, and there- 
fore we can identify {X,} with {X,} as we 
identify an ordinary function with a distri- 
bution. From equations (9) it follows that 
there exist a constant m and a distribution 

p such that E(X,) —m| o(t)dt and E(X,,— 

E(X (X, — E(X,)) = (p * ý), where a de- 
notes convolution, and y/(t) 2 V ( — t). We call m 
and p the mean value and covariance distri- 
bution of { X,,}, respectively. By the generalized 
Bochner theorem p can be expressed in the 
form 


AQ)= Louna atay= Joen 
where F(d4) is a slowly increasing measure, i.e., 
fuser mui (11) 


for some positive integer k. F(dA) is called the 
spectral measure. This expression is the gen- 
eralization of the Khinchin decomposition. 
The spectral decomposition corresponding to 
(6) and the tlaw of large numbers for X, can 
be discussed in a manner similar to that for 
weakly stationary processes (K. Itó [3]). 


D. Prediction Theory 


Let {X,} be a weakly stationary process. Sup- 
pose that its values X, (s <t) up to time t are 
Observed. Prediction theory deals with the 
problem of forecasting the future value X,., 
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(t » 0) from the known values X, (s«t). If the 
domain of the admissible predictors is limited 
to linear functions of X, (s« t), the theory is 
called linear prediction theory. We can assume 
without loss of generality that the mean value 
m of X, is zero and that the spectral measure 
F(dA) of {X,} is not a zero measure. 

Let M(X) be the subspace of L,(Q) spanned 
by the X, (est then J(X)— V, AJ (X). A linear 
predictor for X,,, from X, (s<t) is an element 
Y of SOL If a linear predictor minimizes 
the prediction error o*(t)=E(|X,,,—Y|*) in 
AK (X), it is called an optimum linear predictor, 
which turns out to be the t(orthogonal) projec- 
tion of X,+, on .4,(X) and which is denoted by 
A, Since ( X,) is stationary, the error o?(z) 
does not depend on t for such a predictor. 
Corresponding to the spectral decomposition 
(6) of X,, the optimum linear predictor is ex- 
pressed in the form 


f| e" Q(4)M (44), (12) 
af 


where @,(-) is square integrable with respect to 
the spectral measure F (dA). 

The subspace .4,(X) is nondecreasing in t. If 
MX) is independent of t, i.e., M{X)= M(X) 
for every t, then ( X,] is said to be determinis- 
tic. In this case we have X, ,= X,+, for every t 
and t>0, since X,,,€.4,(X). This means that 
the linear predictor enables us to determine 
the unknown quantities without error, and 
therefore such a process is of no probabilistic 
interest. On the other hand, if (\,.4,(X)= {0}, 
then {X,} is said to be purely nondeterministic. 
A general { X) is expressed as the sum of the 
deterministic part ( X7) and the purely nonde- 
terministic part ( X7) (Wold decomposition). 
Furthermore, we have ./(X^)— (^, (X), and 
M(X)= M(X%+ MAX”) (direct sum). Thus 
{Xf} and (X7 can be dealt with separately. 

A weakly stationary process | X,j is purely 
nondeterministic if and only if the spectral 
measure F (dA) is absolutely continuous with 
respect to the Lebesgue measure, and the 
density f(A) is positive almost everywhere and 
satisfies 


i log f(A)d2 > —oo 


=n 


(discrete parameter case), 


dA —oo 





? log f(A) 
1+4? 


— 0 
(continuous parameter case). 


By using f(A) the optimum linear predictor can 
be obtained. 

First, we explain the discrete parameter 
case. There exists a function y(z) 2 X2, a,z' in 
the *Hardy class H, relative to the unit disk 
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such that its boundary value satisfies 
f2)- 0/2z)|»(e ^). 


Then we can find a sequence of mutually or- 
thogonal random variables (£j (te Z) such 


that {X,} admits a backward moving average 
representation 
X,- p A Ma (13) 


There are many pairs {a,} and {¢,} which give 
the representation (13), but if y(z) is maximal 
(optimal), namely, if y(z) is expressed as 


iA 
je ae | log f(A) s 2 


(14) 


then the representation (13) ts canonical in the 
sense that JZ( X) =M (C) for every t. Hence the 
optimum predictor X, , for X,+, is given by 


R, = y dtt- "a s 


s=- —n 


ak e"$(4)M(dA, (15) 


The prediction error ot) of this predictor is 
given by 


t-1 
a*(t)= } lal’. 
s=0 


Example. Let the covariance function of a 
weakly stationary process {X,} be e "' (a 0). 
Then we have 


f) =(1/2n)(1 — B?)|1 — fe ^| 7?, 


The maximal y(z) is expressed as ,/1— fi?(1— 
Bz)‘, and 


n it oo 
ise | IT Y fre M(di) 


L—p'X, 
e?()-21-2ff'e * e fpzi—e 2%. 


We now come to the continuous parameter 
case. By replacing the holomorphic function 
y(z) on the unit disk with the one on the half- 
plane, we see that almost all results obtained 
in the discrete parameter case hold similarly in 
this case. The maximal y(z) is expressed as 


1 jene dA 
= RES l a 
"leng (s. [- og BER i) 


Using the *Fourier transform a, of the bound- 
ary function of y(z) and a process (£,] with 
orthogonal increments, we have the canonical 
backward moving average representation for 
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the process 1 X,j: 


t 
x=f 


which enables us to obtain the optimum pre- 
dictor and the prediction error in a manner 
similar to the discrete parameter case [4]. In 
particular, if the optimal y(z) is of the form y(z) 
— c/P(iz), where c is a constant and P(z) is a 
polynomial of degree p, then X, is p —1 times 
differentiable and P(d/dt) X, — (d/dt)£, up to a 
multiplicative constant; therefore X, , is ob- 
tained explicitly. To obtain the optimum linear 
predictor for Y e M(X), we first establish the 
expression 


i des 


Y= » f(s)é, or | f(s) dé,, 
and then take Xl- -o /(s), or (2, f(s) dé, for 
the optimum linear predictor. 

The results stated above can be generalized 
to multivariate (n-dimensional) stationary 
processes [6, 7] and to the case where the 
parameter space T is multidimensional. 

N. Wiener observed the individual tsample 
process X (t, ol and discussed a method of 
finding the optimum predictor for X (t +t, œ) 
by using a linear functional 


| X (t — s, co) dK (s) 
0 


(K is of *bounded variation) of the values X, 
(s<t) [8]. The spectral measure played an 
important role in his observation. Calculations 
in this case are analogous to those mentioned 
above. 

For a weakly stationary random distribu- 
tion {X(@)} (pe 2), the prediction theory is 
reduced to that for ordinary stationary pro- 
cesses. Assume that the spectral measure F(d/) 
of (X(o)] satisfies (11). Set e(t) 2 exp(t) (t <0), 
— 0 (t 0), and let e, (t) be the k-times convo- 
lution of e(t) with itself. Set Y(qo) = X(e, * q). 
Then (Y(q)j is equivalent to a weakly station- 
ary process. It is obvious that JZ,í(X)— MY) 
for every t, where .Z(X) is the linear subspace 
spanned by {X (o)|support of o c(—oo.t]j. 
This consideration allows us to reduce the 
prediction problem for {X (9)) to that for the 
stationary process corresponding to { Y(q)j. 

Nonlinear prediction theory is formulated 
as follows. Let 3B, be the smallest c-algebra 
with respect to which every X, (s € t) is mea- 
surable and H,(X) be the subspace of L;(Q) 
consisting of all 98," measurable elements. The 
problem is to forecast X,+, (t 0) by using 
an element of H,(X). The optimum predictor is 
obviously equal to E(X,,,|38,). For a station- 
ary Gaussian process it has been proved that 
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the optimum predictor found in H,(X) belongs 
to MAX). Therefore the optimum nonlinear 
predictor coincides with the optimum linear 
predictor. However, except for stationary 
Gaussian processes, no systematic approach 
for nonlinear prediction theory has been es- 
tablished so far. (For a typical case that arises 
from a stationary Gaussian process — 176 
Gaussian Processes H.) 


E. Interpolation and Filtering 


Interpolation and filtering of stationary pro- 
cesses have many similarities with prediction 
theory, both in the formulation of the prob- 
lems and in their method of solution. 

Let {X,} be a weakly stationary process, all 
of whose values {X,|t¢ Tj]. T, some interval, 


are known with the exception of those at te T;. 


The problem of linear interpolation of the 
unknown value X, (te T,) is to find the best 
approximation of this random variable by the 
limit of linear combinations of the known 
values. The following example illustrates the 
problem in the discrete parameter case. 

Example. Let T; = {to} and f(4)dÀ be the 
spectral measure of ( X,j. The interpolation of 
X,, has an error if and only if 


mx 
NE oO. 


Expressing X, in the form (6) with m=0, the 
best (linear) interpolation X, of X, is given by 


[4 T 1 =1 
‘Mol 1—2 À ——d M (dA), 
x ( dng Eum n) ) (dd) 


and the error of the interpolation ts expressed 
by 


> S 3 d: 
E(|X,, — X, |) 2 4n? zd. s 
(x, - x, eee (| aa) 


The problem of interpolation for multivariate 
stationary processes has also been discussed 
[7]. 

The filtering problem originated in com- 
munication theory as a technique to extract 
the relevant component from a received signal 
with noise [8, 13]. Suppose that a complex- 
valued stationary process {X,} with continu- 
ous parameter is expressed in the form 


oo 
sl e" M(dA) « S, LN. 
where (S,, N,) is a (2-dimensional) weakly 
stationary process with mean vector 0. Here, S, 
and N, indicate the signal and noise, respec- 
tively. The filtering problem is to find the 
clement of (X) that approximates $,,, as 
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closely (relative to the .@(X)-norm) as possible. 
The best approximation is the projection of 
S,., on M(X), but its expression in terms of 
the spectral measure becomes extremely com- 
plicated [8]. This problem is usually discussed 
under the assumption that S, and N, are ortho- 
gonal. Let us further assume that their spectral 
measures are absolutely continuous. The den- 
sity functions are denoted by f,(A) and fy(A), 
respectively. If ( X,|te T} is observed, then the 
best (linear) approximation $, of S, is given by 


$- | e" og(4) M (då), 
where @o(A) = fs(4)/( Js) + fy(A)). The mean 
square error E(|S, — $^) of this filtering is 


R ISASA SSA) + Jet Aldi. 


F. Strongly Stationary Processes and Flows 


Let {X,(@)} (te T, we Q(B, P)) be a strongly 
stationary process. To study it we take the 
coordinate representation of {X,} as follows. 
Let Q be the complex vector space C7, 88 the o- 
algebra generated by the Borel *cylinder sets, 
X (œ) the tth coordinate of the function we 
C7, and P the probability distribution of the 
process ( X,) defined on (Q, 8). Define the 
shift transformation S, of Q onto itself by 
(S,@) (s) 2 (s 4- t). Then {S,} forms a group 
of *measure-preserving transformations 

on Q(B, P) (—^ 136 Ergodic Theory) since 
{X,} is strongly stationary. Thus we are given 
a (measure-preserving) "low {S,} (te T) 

on Q(B, P). Conversely, if {S,} (te T) is a 
(measure-preserving) flow on a probability 
space Q(B, P), then {X,}, given by X,(w)= 
f(S,q), is a strongly stationary process, pro- 
vided that f is measurable. Many properties 
of a strongly stationary process are closely 
related to those of the corresponding flow. 
For example, the tstrong law of large num- 
bers for a strongly stationary process follows 
from ?Birkhoff's individual ergodic theorem 
for flows. *Ergodicity, several kinds of *mixing 
properties, and the spectral properties of a 
strongly stationary process are defined in 
accordance with the respective notions for 
the corresponding flow. Now we give some 
examples of flows corresponding to strongly 
stationary processes. 

(1) If X, (te Z) are mutually independent and 
have the same probability distribution, then 
the process {X,} (te Z) is strongly stationary 
and { \,%, is trivial (the definition of 38,5 D). 
Hence the corresponding flow is a *Kolmo- 
gorov flow. 
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(2) Similarly to (1), the flow corresponding 
to ‘Gaussian white noise ( 176 Gaussian 
Process) is also a Kolmogorov flow. 

(3) The mixing properties of the flow corre- 
sponding to a stationary Gaussian process Is 
determined by the smoothness of its spectral 
measure F (dA). The flow is ergodic if and only 
if F is continuous De, F has no point mass). In 
this case, the flow is also tweakly mixing. For 
the flow to be *strongly mixing, it is necessary 
and sufficient that the covariance function p(t) 
of the process tend to zero as |t|— oo. In this 
case the flow is *mixing of all orders (— 136 
Ergodic Theory) [14-16]. 


G. Analytic Properties of Sample Functions of 
Stationary Processes 


In the continuous parameter case, we always 
assume that *continuity in probability holds 
for strongly stationary processes and tmean 
square continuity holds for weakly stationary 
processes. Hence the processes discussed here 
are all continuous in probability, and without 
loss of generality we can assume that the sta- 
tionary processes are tseparable and tmea- 
surable (— 407 Stochastic Processes). 

Let {X,} be a weakly stationary process. 
Assume that the moments up to order 2n of 
the spectral measure F (d^) are all finite. Then 
almost all tsample functions of {X,} are n— 1 
times continuously differentiable, and almost 
all sample functions of | X" ?| are absolutely 
continuous. Define the spectral distribution 
function F(A)= F((—oo, 4]) for the spectral 
measure F(d4) of {X,}. If F satisfies 

Y, Inl'G*(n 1) — F(n))'? « oo (16) 
for a nonnegative integer r, then almost all 
sample functions of ( X,) have continuous 
rth derivatives. In particular, if the condition 
(16) is satisfied for r=0, then almost all sam- 
ple functions are continuous. Conditions for 
Holder continuity of almost all sample func- 
tions of a weakly stationary process have also 
been obtained [17, 18]. (For sample functions 
of stationary Gaussian processes — 176 
Gaussian Processes F.) 

For a strongly stationary process | X,j with 
E(X,)=0 and finite E(X2), the sample covari- 
ance function 
T 

X,45(@)X,(@) ds 


-T 


R(t)= lim E 

T>% 2T 
is determined with probability 1. We can 
therefore apply the theory of tgeneralized 
harmonic analysis, due to Wiener [9]. (Further 
results on sample function properties are 
found in [19].) 
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H. Strongly Stationary Random Distributions 


Let Z be the space of all C”-functions with 
compact support and & be the space of tdistri- 
butions. If X,(c) is defined for we Q(B, P) and 
o € Z, and if for almost all œ, X (c) belongs to 
Z' as a linear functional of o, then {X,} is 
called a random distribution. Suppose that the 
Joint distribution of the random variables 
Xo Xano Bee, Un) = P(t — h)) is inde- 
pendent of h. Then we call ( X,(c)j a strongly 
(or strictly) stationary random distribution. If 
we identify random distributions that have the 
same probability law, then {X,,} is determined 
by the characteristic functional 


C(o)- E(e*). 


For {X,} to be strictly stationary it is neces- 
sary and sufficient that the equality C(t,@) 

= C(q) hold. The simplest example of a strictly 
stationary random distribution is the Gaussian 
white noise (— 176 Gaussian Processes, 341 
Probability Measures) [20]. 


I. Generalizations of Stationary Processes 


The concept of stationary processes is gen- 
eralized in many directions. 

(1) Let T be a set different from R or Z, and 
suppose that there is given a group G of trans- 
formations that map T onto itself. If a family 
{X,} of random variables with parameter te T 
has the property that for every choice of ran- 
dom variables X, , X, , ..., X, the joint distri- 
bution of (X,, , ..., X,, ) is always independent 
of g€G, then {X,} (te T) is said to be a strictly 
G-stationary system of random variables. 
Similarly, a weakly G-stationary system of 
random variables can be defined [21, 22]. 

(2) Let T be a Riemannian space, and let G 
be the group of all isometric transformations 
on T or one of its subgroups. Suppose that a 
*tensor field X,(@) of constant rank is asso- 
ciated with any c € (88, P) at every point t. 
Then X(o) 2 (X,(o)|te T} is called a random 
tensor field over the Riemannian space T. Any 
g € G induces an isometric transformation of 
the tangent vector space at t to that at gt. 
Hence g maps a tensor field X(o) to another 
tensor field gX(w) for every œ. If X(c) and 
gX(q) have the same probability law, then 
X(%) is said to be strictly G-stationary. X (c) is 
defined to be weakly G-stationary in a similar 
way [21,22]. 

(3) In the same way as we extended stochas- 
tic processes to random distributions, we can 
generalize random tensor fields to random 
currents and discuss stationary random cur- 
rents [21]. 

(4) Stochastic process with stationary incre- 
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ments of order n. Assume that ( X,] (re R) is not 
necessarily a stationary process but that the 
nth-order increment of X, is stationary. Then 
by taking the nth derivative D"X, in the sense 
of random distributions, we obtain a station- 
ary random distribution. From the properties 
of D"X, we can investigate the original process 
itself. Brownian motion is an example of a 
stochastic process with stationary increments 
of order 1. 

(5) Weakly stationary processes of degree k. 
A weakly stationary process is a process whose 
moments up to order 2 are stationary. Gen- 
eralizing this, we can define a weakly station- 
ary process of degree k by requiring the mo- 
ments up to order k to be stationary. We can 
obtain more detailed properties of such pro- 
cesses than those of weakly stationary pro- 
cesses [23]. 
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A. General Remarks 


A statistic is a function of a value (e a 
sample value) observed in the process of sta- 
tistical inference (— 401 Statistical Inference). 
A statistic is used for two purposes: (a) to 
characterize the set of observed values or 
sample values, and (b) to summarize the in- 
formation contained in the sample about the 
unknown parameters of the population from 
which it is assumed to have been drawn. 


B. Samples and Statistics 


The basic concepts in statistical inference are 
tpopulation and tsample. Let (Q, 2, P) bea 
*probability space, where P is a tprobability 
measure on &. A random variable X defines a 
1-dimensional probability distribution ®(A) 
= P{w|X(w)e A}, where A is a 1-dimensional 
*Borel set, which gives rise to a 1-dimensional 
probability space (R, 31. 4). Here #' is the 
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family of all 1-dimensional tBorel sets. Let 
Xis X,, ..., X, be "independent random vari- 
ables with identical 1 -dimensional distribu- 
tions. The *n-dimensional random variable 

X =(X,, X5, ..., Xn) is called a random sample 
of size n from the population (Q, 2, P). In 
particular, when each of X,,..., X, takes only 
two values (usually 0 and 1), the sample is 
called a Bernoulli sample or a sequence of 
Bernoulli trials. Generally, if ®, is the tn- 
dimensional probability distribution deter- 
mined by X (i.e., the direct product of n copies 
of the 1-dimensional probability distribu- 

tion ®), then the n-dimensional probability 
space (R", Z", ,), where Z" is the family 

of n-dimensional Borel sets, is called an n- 
dimensional sample space. A point belonging 
to the set of actually observed values of the 
sample X, which is a random variable by 
definition, is called a sample value and is de- 
noted by x. Thus the sample value can be 
expressed as x = X (w) (w€ Q) and regarded as 
a point in the sample space (sample point). The 
basic underlying structure which determines 
the probability distribution is the set Q, which 
we can view as describing the physical struc- 
ture of the observed phenomena, but statistical 
procedures are always carried out through the 
observations of samples, and Q itself is often 
disregarded. The 1-dimensional probability 
distribution ® (the n-dimensional probability 
distribution ®, determined by X) is called the 
population distribution in the 1-dimensional (n- 
dimensional) sample space, since it is induced 
from the probability measure on (Q, 2). 

A statistic Y is a random variable expressed 
as Y= f(X), where f is a tmeasurable func- 
tion from the sample space (R", 2", ®,) into a 
measurable space (R, #'). The value of the 
statistic Y corresponding to a sample value x 
of the sample X is denoted by y= f(x). 

When we deal with a statistical problem we 
often have no exact knowledge of the popula- 
tion distribution ®(®,) except that it belongs 
to a family 2 —(P,|0€ ©} of probability mea- 
sures on 2! (4/"). We call 0 the parameter of 
the probability distribution and © the para- 
meter space. The typical cases described in this 
section can be extended as follows: (1) The 
distribution may be an r-dimensional proba- 
bility distribution. In this case a sample of size 
n induces an nr-dimensional sample space. (2) 
Random variables X,, ..., X,, being mutually 
independent, may not have identical distri- 
butions. (3) Random variables X,,..., X, may 
not be mutually independent. In both cases 
(2) and (3) the sample space is of the form 
(R", 2", ®,), but n may not be the sample size 
itself, nor P, be the direct product of n copies 
of identical 1-dimensional components. (4) The 
most general sample space is expressed as a 
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certain measurable space (X, £) and a family 
4 —(P,|0€ G9) of probability measures on ai. 
A statistic, in general, is a random variable 
expressed as Y — f(X) by a measurable func- 
tion f defined on a sample space (X, £) taking 
values in another meaurable space (ON. €). 
When (Y, €) is (R, 4!) or (Rn, 2"), Y= f(X) 
is accordingly called a 1-dimensional or n- 
dimensional statistic. 


C. Population and Sample Characteristics in 
the 1-Dimensional Case 


In a 1-dimensional probability space (R, 

48!, P3) the following quantities, called popu- 
lation characteristics, are used to characterize 
the population distribution P,: Letting F(z) — 
P3((—00, z]) be the fdistribution function of 
Py, we use the population mean j= | z dF(z); 
the population variance c? = f (z — u)? dF (z); the 
population standard deviation c( > 0); the popu- 
lation moment of order k vu, =f z* dF (z) (y, = u); 
the kurtosis 44/04; the coefficient of excess 
(44/0) — 3; the skewness 2/07; the o-quantile 
or 10027;-point m satisfying F(m—0) «oa « 
F(m+ 0); the median, which is the 50%-point; 
the first and third quartiles, which are the 
25%-point and 75°%-point, respectively; the 
range, which is the third quartile minus the 
first quartile; and the mode, which is the value 
or values of z for which dF (z)/dz attains its 
maximum. 

Sometimes the kurtosis and others are 
called population kurtosis, etc. Here the word 
"population" is used when it is desirable to 
distinguish population characteristics from the 
sample characteristics defined in 341 Proba- 
bility Measures. 

Let x 2(x,, ..., x,) be a point of an n- 
dimensional sample space (a sample value). 
Corresponding to each 1-dimensional Borel 
set A, the number of components of x that 
belong to A is called the frequency of A in the 
sample value x —(x,, ..., x,), and (frequency)/n 
is called the relative frequency of A. If we take 
À — (—00, z] and regard its relative frequency 
F(z) as a function of z, it becomes a 'distri- 
bution function for every x eR", called the 
empirical distribution function based on x. 

Various characteristics can be defined from 
the empirical distribution function in exactly 
the same way as population characteristics are 
derived from a population distribution func- 
tion. These are called sample characteristic 
values and can be expressed as functions of 
Nas 1525. Xue 

Assuming that x —(x,,...,x,) is a sample 
value of a sample X =(X,,..., X,), the statistic 
obtained by substituting X for x in the func- 
tion denoting a sample characteristic value is 
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called a sample characteristic and given the 
same name as the corresponding population 
characteristic, except that the word "popula- 
tion" is replaced by the word "sample." A 
sample characteristic is a function of random 
variables. Hence it is also a random variable, 
and the problem of deriving its probability 
distribution from the assumed population 
distribution is called that of sampling distri- 
bution (— 374 Sampling Distributions). Thus 
we define the sample mean X = X7 , X;/n; the 
sample variance 17. , (X; — X)?/n (sometimes 
D7_1(X;— X)?/(n— 1) is taken as the sample 
variance); the sample standard deviation 


[Y (X,— X)^/n, 


which is the positive square root of the sample 
variance; the sample mode, which is the value 
taken by the largest number of X;; and the 
sample moment of order k 27 ,(X;— X)*/n. 
Among other statistics of frequent use are 
the order statistic, i.e., the set of values of 
X;,..., X, arranged in order of magnitude and 
usually denoted by X, < X5, < ...« Xo. 
Various other statistics are defined in terms 
of order statistics: the sample median X,,., = 
X (12; for odd n and = (X m2) + X(,,5,44/2 
for even n, the sample range R = max X;— 
min X; — X,,, Xa and so on. The empir- 
ical distribution function F,(z) or its standard- 
ized form $,(2) — n {F,(z)— F(z)) can also be 
considered to be a function of the order sta- 
tistics, and hence is a statistic taking values in 
the space of functions of a real variable. So is 
the empirical characteristic function 


¢,(t)= [expats dF,(z) =}, exp(itx,))/n. 


In a sequence of Bernoulli trials, a set of 
successive components with an identical value 
is called a run. For example, (01100010) has a 
run of 0 of the length 3 and a run of 1 of the 
length 2. 

Among the statistics listed in the previ- 
ous paragraphs, the order statistic is an n- 
dimensional statistic and all others are 1- 
dimensional. 


D. Other Cases 


Let (R?, B?, P) be a 2-dimensional probability 
space with a 2-dimensional population distri- 
bution P,, and let (X,,..., X,) (X;=(U;, V)) be 
a random sample of size n from P}. In this 
case also, the population characteristics for 
the tmarginal distributions of U, and V; and 
sample characteristics for (U,, ..., U,) and 
(Vi, ..., V,) are defined as in Section C. 

As an index for association between U; and 
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V; the population covariance f(u — up dir — 
H2) dF(u, v) of U; and V; and the population 
correlation coefficient, which is equal to (popu- 
lation covariance)/o,,0,;,, are defined. Here 
F(u, v) is the joint distribution function of U; 
and V,, Gen: and Ha are the respective popula- 
tion means of U; and V,, and ou and o) are 
the respective standard deviations. As corre- 
sponding sample characteristics, we have the 
sample covariance Y, (U, — U)(V, — Hun of 
(U,,...,U,) and (V,, ..., V,) and the sample 
correlation coefficient 


Y U-0V-7) 


m 1/2 n A 1/2? 
(Suo) Eu) 
i-1 i=1 


where U =£} U;/n and V ED: V;/n. 
Similarly, statistics of the samples from a 
population of k-dimensional distribution 
(k 2 3) can be defined (— 280 Multivariate 
Analysis). More generally, in statistical in- 
ference we encounter samples where observed 
values may not be mutually independent or 
identically distributed, but have more com- 
plicated probability structures. Statistics as 
functions of such samples are also considered. 





Ms: 


Il 


E. General Properties of Statistics 


The general theory of statistics has been 
studied in a measure-theoretic framework. 

(€, I, FP) is called a statistical structure, where 
(4^, . 7) is a measurable space and 2 is a family 
of probability measures on (X, £). A a- 
subfield Z of . (hereafter abbreviated o-field) 
is called sufficient for 2 if for any Ae. there 
exists a -measurable conditional proba- 
bility of A independent of Pe Z, that is, a 2- 
measurable function @,(x) satisfying 


P(AD B)= | 94 G9) d Pox) 
B 
for all P€? and Be 2. 

For any two o-fields Z, and 2,, the nota- 
tion 4, c AM [4 means that to each set A, 
in &, there corresponds an A, in &, satisfy- 
ing P,((A, — A,)U(A, — A,)) 30 for all Pe 2P. 
When the reverse relation 2, c 2, [4] also 
holds, we write 2, = 22, [4]. 

For a statistic t which is a measurable func- 
tion from (2, £) to (Y, €), (t) 2 (B| Bex, 
t(B)e €] is a o-field and is called the o-field 
induced by t. If A(t) is sufficient for 28. t is said 
to be sufficient for 2. Since sufficiency of a 
statistic means that of a c-field, we consider 
only sufficiency of a o-field. 

B is called necessary if for any sufficient 2, 
we have 4 c 24,[ 2]. A necessary and sufficient 
c-field is called a minimal sufficient c-field. A 
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necessary and sufficient statistic is also called 
a minimal sufficient statistic. Such a statistic 
does not always exist; 4, containing a suffi- 
cient 4, is not always sufficient. 

B is said to be complete if for every 2- 
measurable integrable function o, |y p(x)dP,(x) 
=0 for all Pe? implies P,({x| p(x) 40})= 
0 for all Phe. B is said to be boundedly com- 
plete if for every bounded Z-measurable o. 
fr p(x) dP,(x)=0 for all P€? implies P,( {x | 
(x) 40})=0 for all P,e 2. When A(t) is 
(boundedly) complete, t is called (boundedly) 
complete. If 4, A, and &, is (boundedly) 
complete, 4, is also (boundedly) complete. If 
B, is (boundedly) complete and sufficient and 
B is minimal sufficient, we have 4, = 4[F]. 


F. Dominated Statistical Structure 


When all Pe are absolutely continuous with 
respect to a o-finite measure 4 on £, then 

(X, A, FP) is said to be a dominated statistical 
structure and 2 is said to be a dominated 
family of probability distributions. In this case, 
P, has the density f;(x) 2 dP4/d4 with respect to 
4 by the Radon-Nikodym theorem. If og is 
separable, 2 is a separable metric space with 
respect to the metric of, , Pj.) — sup. | Ps, (B) 
— P, (B)|. There exists a countable subset A’ = 
{Pa > Po... j of P such that P,(N) —0 for all 

P e ?' implies P,(N)=0 for all Pe 2. If we put 
4o = XC; Ps, c; 7 0, Lic; Ll, 49 dominates 2, 
and if Z is sufficient for A we can choose a 2- 
measurable version of dP, (din, Conversely, if 
there exists a o-finite measure å such that we 
can choose a Z-measurable version of d P/dA 
for all PE Z, then 2 is sufficient. 

If 2 is dominated by a o-finite 2, 2 is suffi- 
cient if and only if there exist a Z-measurable 
gy and an ./-measurable h independent of 0 
satisfying 
P. ad a.e. (£, Al for all Pe 
da 
This is called Neyman’s factorization theorem. 

With a dominated statistical structure, there 
exists a minimal sufficient o-field, and a o-field 
containing a sufficient c-field is also sufficient. 

We say that a o-field Z is pairwise sufficient 
for 2 if it is sufficient for every pair { P, , P, j 
of measures in 28. A necessary and sufficient 
condition for 28 to be sufficient for a domi- 
nated set 2? is that # be pairwise sufficient for 

Recently, a more general statistical struc- 
ture has been studied. Put «,(u) —- (A| Ae A, 
(A) « œ}. A measure non . is said to be a 
localizable measure if there exists ess-sup F (p) 
for any subfamily F <.¥,(,), that is, if there 
exists a set Eed such that u(A — E)=0 holds 
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for all AEF, and p(A—S)=0 for all ACF 
implies u(E —S)=0. A o-finite measure is 
localizable. A measure u is said to have the 
finite subset property if for any A satisfying 0< 
u(A), there exists a Bc A satisfying 0 < p(B) < 
oo. A statistical structure (X, o. P) is said to 
be weakly dominated if 7 is dominated by a 
localizable measure u with the finite subset 
property and a density dP,/du exists for all 
P€ 2. In this case a minimal sufficient o-field 
exists, and a pairwise sufficient c-field is suffi- 
cient. For example, let d — 2^ and 2 be the 
set of all discrete probability measures on oi. 
# is weakly dominated by the counting mea- 
sure u which is localizable on æ. 

The order statistic 1s sufficient if 28 is domi- 
nated, X,,..., X, are mutually independent 
and identically distributed random variables 
with 2 =R", and each P,€ 2 is invariant under 
every permutation of the components of the 
points x —(x,, ..., x,) in X. Moreover, the 
order statistic is complete if 2 is large enough. 
For example, we have the following theorem: 
The order statistic is complete if every P? (the 
component of P, on 4 <4’, 0e 9) is ab- 
solutely continuous with respect to the Lebes- 
gue measure / on R and (P? |0€ 9] contains 
all P? for which g(z) - dP?/dl is constant on 
some finite disjoint intervals in Z c R. A 
similar result holds for discrete distributions. 

We call 0 a selection parameter when f(x) 
= c(0)yyg (x)h(x), where h(x) is a positive 2- 
measurable function, y; (x) is the indicator 
function of a set E€ Z, and c(0) is a constant 
depending on 0. Here 0 determines the carrier 
E, of f,(x) but does not essentially affect the 
functional form of f;(x). A necessary and suffi- 
cient statistic is given by t*(x) 2 () (Ej| Ega x, 
P€ 2j. Here the class of sets of the form given 
in the right-hand side of this expression is 
taken as Y, and we set € 2 (C|C c 9,t* '(C)e 
B). We call t*(x) the selection statistic. Two 
examples follow. 

(I) *Uniform distributions. Let © = | (o, f)| 
—o0 «ax « [I « oo], Zus R, P? be the uniform 
distribution on (a, f), and X Z(X,...., X,) bea 
random sample of size n having P? as its popu- 
lation distribution. Then E, — (x|x < min; x; € 
max; x; <P} and f(x) (B — a) "xy (x). If we 
put t(x) ^ (min, x;, max; xj), Y RI, and € = 
the set of all Borel sets of R?, it follows that 
t(x) — t*(x) [27], where t*(x) is the selection 
statistic. Hence t(x} itself is necessary and 
sufficient. 

(II) *Exponential distributions. Put © = 
(—00, o0), Xo =R, and 


" ge 72-9) zz, 
Z)-— 
dee gi z «0, 


where « is a known constant, and let X = 
(X,,..., X,) be a random sample of size n 
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having a population distribution with density 
function g,(z). Then 


Jare) 


f(x) 2e CET ) 


If we put t(x) 2 min; x; and let t*(x) be the selec- 
tion statistic, it follows that t(x) —- t*(x) [27], 
and t(x) is necessary and sufficient. If © = 

(ia, Olne oo, —oo « 0 « oo], then t(x)= 
Tomm, x;, E; x;j is a necessary and sufficient 
statistic. 


G. Exponential Families of Distributions 


A dominated 2 is called an exponential family 
of distributions if and only if /;(x) - dP4/d4 can 
be expressed in the form 

Ck 


ps a 


s(x)ag(0) + x9 (0) + I 


xeX, VEO, (1) 


where the s,(x) (j —0, 1, ..., k) are real-valued 
#-measurable functions and the «,(@) ( j — 0, 
1,...,k) are constants depending on 0. If there 
exists a sufficient statistic for 7 that is not 
equivalent to but is in a certain sense simpler 
than the sample itself or the order statistics, 
then it can be shown under some regularity 
conditions that 28 must be an exponential 
family. The following theorem provides an 
instance of the hypotheses that guarantee 
such a conclusion: Let X be a sample from 

a l-dimensional probability space (Z), 2o, Py) 
with P? the population distribution, where Zo 
is a finite or infinite interval in R and & is 
the class of all Borel sets. Let / denote the 
Lebesgue measure. Assume that (P?! is domi- 
nated by l and g,(z)=dP)/dl is greater than a 
positive constant and continuously differenti- 
able in z on Za, Assume further that there 
exists a sufficient statistic t(x) with the prop- 
erty that for each open subset B of 7 (c R") 
and 4-null set N there are two points x #=x’ 

in B — N such that t(x) Z t(x'). Then 2 is an 
exponential family, and the k given in (1) is 
less than n. Similar results are known also 

for cases where X,, ... X, are not identically 
distributed. 

It is evident from the construction of a 
necessary and sufficient statistic that the 
statistic t(x) — (s, (x), ...,5,(x)), where the s(x) 
are those appearing in (1), is sufficient for an 
exponential family and necessary if o, (6), ..., 
x, (0) arc linearly independent. If 1 (x, (0), ..., 
&4,(0))|Üe 9) contains a k-dimensional interval, 
t(x) is complete. The distribution of t(x) is of 
exponential type. When X,,..., X, are mutu- 


few (Y. 


fime ($ 
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ally independent with a common distribu- 
tion of exponential type, the distribution of 
X =(X,,...,X,,) is of exponential type, and 
vice versa. The family (1) of distributions is a 
special form of *Pólya-type distributions, and 
various distributions given in (IIT)-(VIT) below 
are written in this form. In the following exam- 
ples, for a sample of size n from the specified 
distribution, f(x) is the density with respect 
to Lebesgue measure in (III), (IV), and (V) 
and to counting measure in (VI) and (VII) 
(— Appendix A, Table 22). 

(III) *Normal distributions N (u, 07), Xo = 
(—20, o0). 


fime - P xi ka (5s J5 


JC nog togas }. 


(IV) !T-distributions T (p, o), 2, —(0, oo). 


n 1. 
fux) «exp (v- D», logxi—— 2 Xi 


—nplogo — nlog rn). 


(V) tExponential distributions e(u, o), Yo = 
(u, 00). 


el - (5 KEEN 
i-l o o 


(VD) *Binomial distributions Bin(N, p), % = 
(0, 1, ..., N}. 


joes +nN log(1— p) 
p 


d) 


(VII) *Poisson distributions P(A), 4 — 
10.123... 


DË È logs) =n) 
i-1 


izi 


H. Ancillary Statistics 


A statistic t(x) is called an ancillary statistic 
when for every element A in (t), P,(A) is 
independent of 0, or in other words, when the 
distribution of t(x) is independent of 0. A suffi- 
cient condition for a statistic to be ancillary is 
that it is independent of some sufficient statis- 
tic. Conversely, an ancillary statistic is inde- 
pendent of all boundedly complete sufficient 


statistics. 


L Invariant Statistics 


Suppose that we are given groups of one-to- 
one measurable transformations G and G on 
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X and O, respectively. Suppose also that we 
are given a thomomorphism gg from G to G 
satisfying Pio ! B) - P4,(B). In this case P= 
(P,|0€ 0] is called G-invariant. If G is transi- 
tive, there exists a fixed element 0, of O that is 
sent to an arbitrary 0 by an element g of G. In 
this case, 0 is called a transformation para- 
meter. In particular, if © =R, X is a random 
sample from a population distribution, and 
P,(B)= P, (B— 0), where B—0—(x|(x, + 

0, ..., x, -0)e B}, then 0 is called a location 
parameter. When © — (0, oo) and P,(B) — 
P,(B/0), where B/0 — (x|(0x,, ...,0x,)e B}, 0 

is called a scale parameter. Now assume that 0 
is a combination of these two kinds of param- 
eters such that 0 — (a, B) (—oo «x« oo,0 « 

p « o») and P,(B)= P, ((B — x)/B), where 0, = 
(0, 1). Then if 2 is an exponential family, (1) 

of Section G can be written as 


dP? | m (Gay 

di (À d j ) 
where the k; (j —0, 1, ..., m) are constants. 

We call t(x) an invariant statistic with re- 
spect to a general transformation group G 
when t(gx) — t(x) for all ge G and xe Y. An 
invariant statistic is said to be maximal invar- 
iant with respect to G if, for t(x) — t(x’), there 
exists a ge G such that x —gx'. If tọ is maximal 
invariant with respect to G, a statistic t is 
invariant under G if and only if t, (x) 2 t(x’) 
implies t(x) = t(x’). 

When 2 is G-invartiant, a set A (€.9/) is called 
G-invariant tf gA =A for all ge G. We denote 
by .«/? the set of all G-invariant sets in Z. 

s? is clearly a o-field. A set A (c «7) is called 
almost G-invariant if gA = A Lo, P) for all geG. 
We denote by a? the o-field consisting of all 
almost G-invariant sets. If Z is G-invariant, 2 
is sufficient for 28. g#=Z for all ge G, and 
moreover, B? = B* (A, P), then 2° is a suffi- 
cient o-subfield of gi. where 429 — 4.4? and 
48* — 0. *. 


g.(z) 


J. Various Definitions of Sufficiency 


There are many different definitions of suffi- 
ciency, and the relations among them have 
been investigated. A o-field 2 is called decision- 
theoretically sufficient or D-sufficient if for a 
given s -measurable decision function 6 there 
exists a Z-measurable decision function 6’ 
such that 


| ó(x, bani- | ó'(x, E)dP,(x) 
D x 


for all Ee 2, Pye, where a decision space 

(D, 2) is quite arbitrary. Z is called test suffi- 
cient if for any given -measurable test func- 
tion q, there exists a Z-measurable test func- 
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tion g’ satisfying E,(@~) = Edel for all Pje 2. 
Let (©, @) be a measurable space of parameter 
0 and Ó be the set of all probability measures 
on €. Moreover we assume that P,(B) is €- 
measurable as a function of 0 for any fixed 

Be X. For any £c Ó, we define Az by 


BEE o-[ P(A)d£(0, | Aes, Cet. 


We denote by D the extension of 4, to x @. 
B is said to be Bayes sufficient if 


E; (Uz. cl x @)=E; Us, cl x €) 


for all £e Ó, Ce€, that is, the a posteriori 
distribution on © given d coincides with that 
given & for any a priori Z When 2 is domi- 
nated, these definitions coincide with the class- 
ical definition of sufficiency. Generally, a D- 
sufficient c-field contains at least one sufficient 
o-field. A c-field containing a sufficient c-field 
is Bayes sufficient. Hence Bayes sufficiency 
follows from D-sufficiency and from classical 
sufficiency. If a D-sufficient o-field is separable 
it is sufficient. 

The notion of prediction sufficiency or ade- 
quacy was defined by Skibinsky [10]. Let 
(X, Y) be a pair of random variables defined 
over the probability space (£ x Y, S x B, A). 
We suppose that X is the sample to be ob- 
served, and Y is (are) the value(s) of future 
observation(s) about which we are to make 
prediction(s) based on X. X and Y have joint 
probability distribution with an unknown 
parameter. A statistic T= T(X) or a subfield 
€ of xX is said to be prediction sufficient or 
adequate if (a) given T, X and Y are condition- 
ally independent (or given €, d and & are 
conditionally independent or Markov) and (b) 
T is sufficient for X (€ is sufficient for æ). It 
was proved that in any form of prediction on 
Y, we may restrict ourselves to the class of 
procedures that are functions of T (or are €- 
measurable). 
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397 (XVIII.1) 
Statistical Data Analysis 


A. Statistical Data 


Statistical data analysis is comprised of a col- 
lection of mathematical methods whereby 
we can deal with numerical data obtained 
through observations, measurements, surveys, 
or experiments on the "objective" world. The 
purpose of statistical analysis is to extract the 
relevant information from that numerical data 
pertinent to the subject under consideration. 
The nature and the properties of the subject 
and also the purpose of the analysis may vary 
greatly. The subject may be physical, biolog- 
ical, chemical, sociological, psychological, 
economic, etc. in nature, and the purpose of 
the analysis can be purely scientific, as well as 
technological, medical, or managerial. Because 
of the great diversity of statistical data, the 
methods of statistical data analysis and the 
manner of application should differ greatly 
from situation to situation; we cannot expect 
a single unified system of methods to be ap- 
plicable to all cases. Nevertheless, we have 
several formal methods of statistical analysis 
that are more or less mutually related and 
have been successfully applied to most, if not 
all, statistical data. 

Statistical data can be classified into several 
types according to a few criteria: according 
to the property of each observation or mea- 
surement, they can be either quantitative or 
qualitative; according to whether only one 
Observation is made on each object under 
investigation or many observations on the 
same object, they can be either univariate or 
multivariate; and according to whether the 
Observations are made at one time or consecu- 
tively in the course of time, they may be either 
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cross sectional or time series. Each different 
type of statistical data requires a different 
type of procedure (— 280 Multivariate Anal- 
ysis, 421 Time Series Analysis). 


B. Frequency Distributions and Histograms 


Statistical data have the simplest structure 
when they consist of a collection of observa- 
tions made on an aggregate of objects sup- 
posedly of the same kind. Such an aggregate is 
usually called a population, and the number of 
its members (its size) is denoted by N. When 
the data are qualitative or categorical, each 
member of the population is classified into 
several types according to some criteria, the 
data consist of the numbers of the members 
of the population classified into each of the 
categories. Such numbers are usually called 
frequencies, and the set of frequencies is called 
the frequency distribution. 

When the data are quantitative and univari- 
ate, one quantitative attribute of each member 
of the population is observed, and the results 
are given as a set of N real numbers (x,, X2, 
..., Xy). When N is large, as is usually ex- 
pected, it is necessary to summarize these 
results in some manner. One common method 
is to tabulate the frequency distribution: We 
define a certain number of intervals (o, ,, aj], 
i-1,...,K, ag «a, €... «ag, ay min x;, 
max x; € ay; and we count the numbers of 
those x's falling within each of the intervals 
and tabulate those numbers or frequencies f;, 
i— 1,2, ..., K. Frequency distribution is often 
represented in the form of a histogram, where 
the endpoints of the intervals are marked on 
the horizontal axis, and above each interval a 
rectangle of area proportional to the frequency 
for the interval is drawn. It is usually recom- 
mended that the widths of the intervals in the 
frequency distribution be equal, especially 
when it is to be represented by a histogram. It 
is, however, often impossible or impractical to 
do so, and sometimes a logarithmic or other 
functional scale is used in the abscissa of the 
histogram; then it is desirable that the inter- 
vals of the transformed values are approxi- 
mately of equal lengths. The number K of the 
intervals should also be of an appropriate 
magnitude, neither too large nor too small; K 
is often constrained by the size N of the popu- 
lation, the shape of the distribution, or other 
factors. Usually, K is chosen to be between 6 
and 20. 

From the frequency distribution, we obtain 
the cumulative distribution by associating with 
each endpoint a; of the intervals the number F, 
of x's not greater than a;, namely, F;= Y;.; f; 
The curve obtained by connecting K + 1 points 
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of coordinates (a;, F,), i—0, 1, ..., K, by linear 
segments is called the cumulative distribution 
curve (or polygon). 


C. Characteristics of the Distribution 


In order to summarize univariate quatitative 
data, various values are calculated from the 
values x,, ..., xy. Such values are called statis- 
tics (singular, ‘statistic) and are used to char- 
acterize the distribution of the values. Various 
types of statistics characterize different aspects 
of the distribution: 

(a) Representative value or measure of loca- 
tion: a value which is supposed to give the 
"representative," “typical,” or “most common" 
value in the population. By far the most com- 
monly used measure is the mean X= X? , xj/N. 
X is sometimes called the arithmetic mean, and 
some other "means" are also calculated: es- 
pecially when all the values are positive, the 
geometrical mean Xg — (T1, x; ^) or harmonic 
mean Xy —(X(1/x)/N) ! may be calculated; 
more generally, for some monotone function 
f(x) we can calculate the f-mean by X; = 
f (Xi f(xyy N), of which the geometric and 
the harmonic means are special cases. Another 
measure of location is the median, which is the 
value in the population located exactly in the 
middle of the ordering of the magnitudes; more 
precisely, if x, « x, « ... un are the values 
in the population arranged according to their 
magnitudes, the median Xmea = ue ua for odd 
N, and —$(xuyj + Xquj2)41) for even N. The 
mode is also sometimes used; this is defined as 
the value (usually the center) corresponding to 
the highest frequency. 

(b) The measure of variability or dispersion 
shows how widely the values in the popula- 
tion vary. The most common measure is the 
standard deviation, which is defined by s, — 

J Xi(xi — XY /N , and its square is called the 
variance V2. A similar measure is the mean 
absolute deviation D, = 2/,|x; — XI/N. Another 
type of a measure of dispersion is the range R, 
— max x; — min x;, and the interquartile range 
Q,— Xana — Xia and more generally the 
interquantile range X,,y, — X( -em for some a. 
The ratio of the standard deviation to the 
mean is called the coefficient of variation (C. V. 
for short) and is used as a measure of relative 
variability when all the values in the popula- 
tion are positive. 

(c) Characteristics often used to characterize 
the "shape" of distributions are the moments 
(around the origin) m, = N ^! Y; x* and the 
central moments (moments around the mean) 
M,— N ^! Xi(x;— XY for a positive integer k; for 
a specific k these are called the kth moments. 
Central kth moments with odd k are equal to 
zero when the distribution is symmetric, i.e., 
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when the histogram is symmetrically shaped. 
Hence the third moment or its ratio to the third 
power of the standard deviation s? is used as a 
measure of the asymmetry of a distribution; 
this is called the skewness. The fourth moment 
is large 1f there are some values which are far 
off from others and small when all values are 
concentrated; hence it tends to be large when 
the histogram has a rather sharp peak in the 
center and has a long tail in either direction 
or both, and tends to be small when the histo- 
gram is flat in the center and drops off sharply 
at both ends. Accordingly, the ratio M,/V2 is 
used as a measure of long-tailedness of the 
distribution; this ratio minus 3 is called the 
kurtosis. 


D. Theoretical Frequency Distribution 


When the observed values can be any real 
number (sometimes in an interval), the size of 
the population N is increased indefinitely, and 
the widths of the intervals are decreased to 

0, the histogram is expected to approach a 
smooth curve. And in the limit when N is 
infinity, we can assume that the distribution is 
represented by a mathematically well-behaved 
function f(x) and that the ratio of the numbers 
of those values in the population within the 
interval (a, b) to the size of the population 
approaches |? f(x) dx. Such a function f(x) is 
called the frequency function or density func- 
tion. Various types of functions have been 
proposed and used as "theoretical" frequency 
functions to approximate the actually ob- 
served frequency distributions. The most im- 
portant is the normal density function 


1 1 
p(x) = exp] ` EEN 
c. / 2n 26 


The following density functions most com- 
monly appear in applications: the gamma 
density, f(x)=x?~' exp(—x/a)/a’T(p) for x>0 
and =0 for x <0; the beta density, f(x)= 

x" !(1—xy !/B(p,q) for O<x<1 and =0 
otherwise. 

We can conceive of a population of infinite 
size with some density function; the term 
theoretical distribution is used to mean such 
a population with its density function, and 
more specifically the tnormal distribution, etc. 
Such a population and associated density is 
often called a continuous distribution. For a 
theoretical distribution, the mean, variance, 
and moments are naturally defined by 








DE | xf(x)dx, o= | (x — ug f(x)dx, 


Hy [e —u*fGe)dx, m= Ju (x) dx. 
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It should, however, be noted that the mean, 
the variance, or the moments may not exist for 
particular distributions. 

K. Pearson introduced a system of density 
functions defined as solutions of the differen- 
tial equation 


din f (x)/dx ^ (A + Bx)(C + Dx + Ex’), 


where A, B,..., E are constants. A distribution 
thus obtained is called a Pearson distribution. 
The normal, gamma, and beta distributions 
together with some other commonly used dis- 
tributions, such as the t- and F-distributions, 
are Pearson distributions. 


E. Measure of Concentration 


When all the observed values are nonnegative 
in nature, we may sometimes require some 
measure of inequality or concentration of the 
distribution. For such a purpose we order the 
Observed values according to their magnitudes 
and obtain xa) € x, €... X Xy; we define S; = 
Lj<iX for i=1,...,N, and Sy=0, plot N 

1 points (S;/Sy,i/N), 1—0, 1, ..., N, and con- 
nect them by line segments. The graph thus 
obtained is called the Lorentz curve or the 
curve of concentration, and it connects the 
origin and the point (1,1). It lies below the 45° 
line, and if all the values are nearly equal the 
curve comes close to the 45? line, but if values 
are widely unequal, the curve comes close to 
the horizontal axis and suddenly jumps to the 
point (1,1). The area between the curve and the 
45* line is called the area of concentration, and 
it is equal to one-fourth of the mean difference 
ó divided by the mean, where ó is defined by 


1 
ð=— Yý ý [x;— xi. 
GK J 


G — ó/x is called the Gini coefficient of con- 
centration and is used as a measure of con- 
centration or inequality of distribution. Other 
measures, including the coefficient of variation, 
are also used to represent the concentration. 


F. Discrete Distributions 


There are cases where the observed values 
are taken only from the nonnegative integers, 
e.g., the number of individual animals of a 
specific species in an area, of accidents during 
a specified time, etc. In such cases, when we 
increase the number of observations, the distri- 
bution does not approach one with a contin- 
uous density function but rather one with 

a certain theoretical discrete distribution. 
Among theoretical discrete distributions, the 
most commonly used are the binomial distri- 
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bution: f;—,C;p/(1 — p)" for O<j <n, and 
the Poisson distribution: f, — e 7 ///j! for j= 
0, 1, .... The hypergeometric distribution: f; — 
MG x- MC, wa for max(0, M +n— N)« 
j&min(n, M), and the negative binomial distri- 
bution: f;— ;., C, , P'(1 — pY, j— 0, 1, ..., are 
also often used. 

For discrete distributions we can define 
moments by u= X; jf; and u= AT uf, and 
ss Xj f, Kee c 


G. Generating Functions and Cumulants 


For a theoretical distribution with the density 
function f(x), the moment generating function 
M (0) is defined by M(0)= fe f(x)dx. When 
M(0) is well defined in an open interval includ- 
ing the origin, the distribution has all kth 
moments, and it can be expanded as 


1 1 


from which the term “moment generating 
function” is derived. When is replaced by it 
with real t, we have the characteristic function 
Q(t) M (it), which can be expanded as 


VUE I quw 
$()-—1 tus) ml Trein) 
*o(lu^ 


if the distribution has moments up to the 
kth. The function K(0) 2 In M (0) is called the 
cumulant generating function, and the coeffi- 
cients x; in the expansion 


K(0)- &,0-- 20? +... 


are called the cumulants. The kth cumulant x, 
is expressed as a polynomial of the moments of 
order not exceeding k; thus 


Kim, K2= hy Hy, Kyu Anik + Aur, 
etc. 


For the normal distribution, 





2g? 
Mice n+" ` 


hence 


22 


Eddi 
K(0)— n8 ——, 





and the kth cumulant for k 23 is equal to 0. 
Cumulants are used as measures indicating 
whether the distribution is close to or different 
from the normal. 

For discrete distributions, the moment 
generating function is defined as M(0) — 
X efit, but the probability generating func- 
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tion P(t) - E f;t/ = M (Int), the factorial 
moment generating function M (0) — P(0 + 1), 
and the factorial cumulant generating functions 
K(t) «1n M (t) are also of use. The coefficient of 
t in the Maclaurin expansion of M (t) is ex- 
pressed as uj and called the kth factorial 
moment; it is equal to y=}; j — 1)...(j — 
k+1)f;. That in the expansion of K (t) is the 
factorial cumulant. The factorial cumulant vu 
is expressed by the same polynomial in uj, as 
Ky 1s expressed in uj. For the Poisson distri- 
bution, M(0) - exp A(e® — 1); hence R(t) - At 
and it follows that the factorial cumulants kj 
for k 22 are all equal to zero if and only if the 
distribution is Poisson. 


H. Bivariate Distribution 


When two quantitative observations are ob- 
tained for each member of a population of 
size N, the results are given as N pairs of real 
numbers (x;, yj), i= 1,2, ..., N. Such data are 
called bivariate data and the distribution, 
bivariate distribution. Those data can be illus- 
trated as N points in a plane with coordinates 
(x; yj), and such an illustration is called a 
scatter diagram. In order to characterize a 
bivariate distribution, we often use bivariate 
moments l 

1 
Mi= gav, 
where x and y are the means of x and y, re- 
spectively; especially, the (1,1) moment M, , 
is called the covariance and is denoted as 
Cov(x, y). The most often used measure of 
the strength of the relation between x and 
y values is the correlation coefficient r, , = 
Cov(x, y)/SxSy, where s, and s, are the stan- 
dard deviations of x and y. It is easily shown 
that —1 <r, „<1, and when there exists a 
nearly linear relationship between x and y 
values, r, , is close to either +1 or —1 accord- 
ing to whether the x and y values change in 
the same direction or in opposite directions. 
When there is no clear relationship between x 
and y, the correlation coefficient is close to 
zero, but it may not be a good measure of the 
relationship when x and y values are related 
nonlinearly. 

A linear function y — a + bx is called the 
linear regression function of y on x, for which 
the sum of the square distances X;(y;—a— 
bx;)? is minimized. For the linear regression 
function the coefficients a and b are deter- 
mined by b — Cov(x, y)/s2 and a « y — bx, and b 
is called the regression coefficient. We have 
that Lily; —a— bx)? /Xyi— y? -1-— "ips Le., 
that the square of the correlation coefficient is 
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equal to 1 minus the ratio of the variance of 
the residual y, — a — bx; to that of y; hence it is 
sometimes called the coefficient of determina- 
tion. Similarly, the linear regression function 
of x on y is defined by x «c-r dy, where d= 
Cov(x, y)/s? and c - x — dy. We have bd —r2,, 
and |1/d|=|b/r2,|> |p|. 

We can tabulate the bivariate frequency 
distribution by splitting the range of x values 
into K intervals (a;_,,a,;], i=1,.... K, and the 
range of y values into L intervals (b;.,, bj], j= 
1, ..., L, and counting the number f; of cases 
for which a. «x <a, and bh, < yx bj. In 
contrast to the bivariate frequency distribu- 
tion, the distributions of x and y values are 
called the marginal distributions. 


L Bivariate Density Function 


As we did for the univariate distribution, we 
can consider the limiting shape of the bivariate 
frequency distribution when the size N of the 
population tends to infinity and define a con- 
tinuous bivariate distribution with density 
function f(x, y), with which the ratio of those 
members in the population with values (x, y) in 
a set S in a plane is given by [fs f(x, y) dx dy. 
The bivariate density function is also called 
the joint density, and then the density functions 
of x and y are called the marginal density 
functions and are given by f, (x) — f f(x, y dy 
and f;(y) — f f(x, y) dx. The joint moments of a 
continuous bivariate distribution are defined 
by uy, lte Y Qr — ua) fc. y) dx dy, where 
Uu and p, are the means of x and y, respec- 
tively. The joint moment generating function is 
defined by M(t,, t;) - ffet» f(x, y) dxdy, and 
the cumulant generating function by K(t,,t,) 
=log M({¢,,t,), from which the joint cumulants 
k+l 
et Br Oe K(t,.t5)], =0,1,=0 
are derived. 

The conditional density of y given x is de- 
fined by f(y|x) — f(x, yy/f, (x), and the distri- 
bution with this density function is the con- 
ditional distribution of y given x; this latter can 
be interpreted as the distribution of y of those 
members in the population with x values in 
the interval (x, x 4- dx], where dx is small. The 
conditional density and the conditional distri- 
bution of x given y are similarly defined. The 
mean and the moments of the conditional 
distribution are called the conditional mean 
and the conditional moments. The conditional 
mean of y given x, considered as a function of 
x, Is called the regression function of y on x. 

By far the most important theoretical bivari- 
ate density is the bivariate normal density, 
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which is given by 


f(x, y) Qno o4 /1— p?) ' 
vs 1 (x— uy Una 
PI- e? 


oi o2 
~2p SAN Ho 


010? 








for which the mean of x is u, and that of y is 
Hl, the variances of x and y are of and o2, 
respectively, and the covariance of x and y is 
equal to po,6,. The bivariate normal distri- 
bution has several remarkable properties: All 
the k, l joint cumulants are equal to zero for 
k +1>3; the marginal distributions of x and 
y are normal; the regression functions of y on 
x and x on y are both linear; the conditional 
distribution of y given x (and x given y) is 
normal and the conditional variance is con- 
stant; and the contours f(x, y) ^ c for different 
values of c are equicentric ellipsoids. 


J. Higher-Dimensional Data 


When the data are of more than two dimen- 
sions, i.e., more than two observations are 
made on each of the objects, we designate the 
data by Nk-tuples of real numbers (xj,, xj, 
nk im 1,..., N (k23). Then we can calcu- 
late the moments of each of the variates and 
the joint moments, which are defined by 


va ub — X) (xi; X32 ... (x4 — X^. 
Also, we can arrange the variances and covar- 
iances in a symmetric matrix of order k, and 
we call it the (variance-) covariance matrix. A 
covariance matrix is easily shown to be non- 
negative definite. The determinant of the covar- 
iance matrix is called the generalized variance. 
The matrix with the (i,j) element equal to the 
correlation coefficient of the ith and the jth 
variates r; (r; is set equal to 1) is called the 
correlation matrix and is denoted by R. R is 
also nonnegative definite. If we denote the 
(i,j) cofactor of R by R,;, the quantity defined 
by 


is called the multiple correlation coefficient of 
the ith variate and all other variates; and 


WITT: D Aen — Rail, R&R; 


is called the partial correlation coefficient of 
the ith and the jth variates given all other 
variates. The meaning of these coefficients will 
be elucidated below. A linear function a, + 
A,X, T d5X3 +... - 44, X4, 18 called the linear 
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regression function of x, on x,,..., x, ,, when 
the coefficients a,,...,a, , are so determined 
that the sum Q 2 X,(xi, —ag—4a,Xxj, —...— 
à, 4 Xa 1)? is minimized. They are determined 
from the equation 


Cid, + Ci5d5 +... + Ci 1041 = Ci, (*) 
i1, ...,k— 1, 


with ay — X, —a,X, —...—4, 4X,-,, where C; 
are the covariances. d,, ...,a,.., thus deter- 
mined are called the regression coefficients of 
X, ON X,, ..., X-,, and such a procedure is 
called the method of least squares. The equa- 
tion (x) is called the normal equation. If we 
write X4 —ago-- d, X;j4 +--+ 4, 4X;4-,, We have 
Q-X(xa—*4) 2 La: x)? ERa- xy- 
YX(xa—X x(1 = Bun, ai?) where Ru. A 
is the multiple correlation coefficient of x, and 
Xj, EA, Which is also equal to the correla- 
tion coefficient of x, and €,. The square of the 
multiple correlation coefficient is also called 
the coefficient of determination. The quantities 
Xj — X;, are called the residuals. Let £; ,-; and 
£j, be the values of regression functions of x, , 
and x,, respectively, on x,,...,X,—2, and let 
Yik-1 7 Xig-1 — Xiy-1 and yj, — Xi, En be the 
residuals; then the correlation coefficient of y,_, 
and y, is equal to the partial correlation co- 
efficient of x,_, and x, given x, ..., X42. 

We have the following relation between the 
multiple and the partial correlation coefficients: 


=(1 — Riis, ll TA Mk 2). 


Multiple and partial correlation coefficients 
are also expressed in terms of the correlation 
coefficients of the variates. For example, it can 
be shown that 


Riu: =(r?, +133 —2ri 3r, 3r53)/(1 — rp) 
and that 

= 2 2 
F23ļ1 =(r23 — rari —rn3(l-ri. 


For higher dimensions, we can also define 
the (joint) density function f(x,,x;, ..., x,) and 
the ( joint) moment generating function 


M (t,, t5, ...,ty) 





= |. fesptt Haka vu 


X f(X1s X2, at EH 


The most important multivariate joint den- 
sity is that of the multivariate normal distri- 
bution, which is expressed by 


1 " 
fo. xp n7 EY etos 


SEN 
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where X is the covariance matrix with ele- 
ments og, Z^! — (o?), and u; is the mean of the 
ith variate. For the multivariate normal dis- 
tribution the moment generating function is 


given by 


l 
M (t,, e., el Can I 20] 


K. Contingency Tables 


When several qualitative observations are 
made on N objects, each object is classified 
according to the combination of the cate- 
gories, and the data are summarized by the 
numbers N(i,,i5, ..., i) of the objects that 

fall in the i, th category according to the first 
observation, i,th category in the second obser- 
vation, etc. A table that shows the results 

of such observations is called a (k-way) contin- 
gency table. If there are m, categories 1n the 
first criterion, m, categories in the second, 

etc., the contingency table is also called an m, 
by m; by... by m, table. The numbers Ñ (j, ij) of 
the objects which are classified into the i;th 
category according to the jth criterion are 
called marginal frequencies. If we have 


N(i,, i9, ..., 4 N =N(1,i,)N(2, i)... N( i N* 


for all i,,i5, ..., i; 


then the k observations or criteria are 
independent. 

The simplest contingency table is a 2 by 2 
table, where several measures for the relation 
of two observations or criteria have been 
proposed, among which the most commonly 
used are the measure of association defined by 

N(1,1)N(2,2) — N(1,2)N(2, 1) 


~ N(1, 1)N(2,2)+ N(,2)NQ, 1) 





and the odds ratio 
NO. T)N (2,2) 
N(1,2)N(2, 1) 
and also 
y N(1, DN(2,2)— N(L, 2)N(2, 1) 


JR, DN. 2)N (2, DNQ,2) 








where N(i,j;) are marginal frequencies. The 
two observations are independent if and only 
if Q=0 or o=1 and V=0. V is equal to the 
correlation coefficient of the variables x, and 
X5, for which x;=0 if the object is classified 
into the first category according to the ith 
criterion and x;= 1 if it is classified into the 
second category. In a two-way m, by m, table 
a measure of association is defined by 

x? NG,)N — o, 2 
eaten 
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it can be shown that X?/N =0 if and only if 
the two criteria are independent, and that 

0x X?/N <min(m, — 1,m, — 1). When we take 
X; — 1 if the object is classified into the ith 
category according to the first criterion and 
x; —0 otherwise and take y;— 1 if it belongs to 
the jth category in the second criterion and 
y,=0 otherwise, it is shown that the sum of 
squares of the multiple correlation coefficients 
of x, and y,,...,Ym,-1 is equal to X?/N. 


L. Decomposition of the Variance 


When one observation is qualitative while 
another is quantitative the objects are classi- 
fied into several categories according to the 
first observation, while for each object the 
value of the second observation is also given. 
Let x;; be the observed value of the jth object 
in the group of the ith category; then for each i 
we can obtain frequency distributions of x;;, 
and compare these distributions. Let N; be the 
number of objects in the ith category, and 

X; — X xj/ N; be the mean in the ith category. 
Then the weighted variance of the x; defined 
by vy = X, N(x; — X/N, where x is the mean of 
all the observations, i.e., x =X N;x;/N, is called 
the between-group variance, and the weighted 
mean of the variances of each of the groups 
defined by vy = 3, È (x; SEIN is called the 
within-group variance. It can be shown that ke 
-FW-V-XEX(G;—X)/N,ie.the variance of 
all the observations is decomposed as a sum of 
the between-group and the within-group var- 
iances. The ratio W/V is called the correlation 
ratio, which is equal to the square of the multi- 
ple correlation coefficient of x and y,, .... Yms 
where y;=1 if the object is in the ith category 
and y; — 0 otherwise. 


M. Ordinal Data 


When the observation is not quantitative but 
there exists a natural ordering among the 
categories into which the objects are classified, 
the observation is said to be in an ordinal 
scale, or simply ordinal. 

When two ordinal observations are made 
on the same set of N objects, we can define 
several measures of association between the 
two ordinal scales. For each pair of objects we 
define a variable c; i, j=1,...,N, ij, as cj; 
if the ith object is classified as "better" (or 
"superior") than the jth object according to 
both of the measurements, c; — —1 if the order- 
ings are different in the two scales, or c;;—O if 
they are in the same category according to 
either or both of the scales. A measure of asso- 
ciation is then given by SX X cj/N(N —1), 
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which takes a value between —1 and +1 but 
usually cannot attain +1. Other ways of nor- 
malizing the sum £ cj; have been proposed. 

Another method of calculating the associ- 
ation is the scoring method, i.e., giving a set of 
ordered real numbers to the categories of each 
of the scales and calculating the correlation 
coefficient between the scores. The simplest 
scores are 0, 1,...,m—1 when there are m 
categories, but other methods of scoring are 
also used. Scores that give the largest pos- 
sible correlation are called canonical scores, 
which are obtained as the characteristic vec- 
tors of the matrices NN' and N'N, where N is 
the matrix of the contingency table. 


N. Time Series Data 


Time series data can be recorded in a con- 
tinuous time scale, but usually measurements 
are made at discrete times, which are most 
commonly equally spaced. Hence we here 
denote them as x(t), t — 1,2, ..., T. First we 
consider the quantitative univariate case. The 
intertemporal change of x(t) is often decom- 
posed into three parts: 


x(t)=m(t)+ c(t) ^ e(t), 


where m(t) is called the trend, and represents 
the secular, systematic change of x; c(t) is 
called the cycle, and represents the recurrent 
pattern of the change; and e(t) is called the 
error or random fluctuation, and represents 
the irregular changes. Such a decomposition 
cannot be defined rigorously without assuming 
some probabilistic or stochastic model for x(t), 
but it is intuitively clear and practically useful 
in many applications. 

There are two ways to estimate the trend. 
One is to calculate the moving average X(t) — 
(x(t — k) - x(t — k - 1) - ... - x(t) - ...- x(t 
k))/(2k + 1) and use it as an estimate of the 
trend of x(t); here k should be chosen to sub- 
stantially eliminate the cyclic and random 
parts. More generally we can use the weighted 
moving average defined as x(t)= Lf — w(j)x 
(t-- j), where w( —j) 2 w(j) and X w(j)=1. The 
second method is to assume some functional 
form, usually a polynomial in t, for the trend: 
m(t)=d)+a,t+...+a,t*, and to determine the 
coefficient by least squares, i.e., to calculate the 
values of ao, a,,..., a, which minimize Zei) 
—àg—a,t— ...—a,t*y. 

There are two cases of cyclical changes. One 
is the case when there is a clearly defined rele- 
vant external time period, such as the seasons 
of the year or the days of the week. In such 
cases the effects of such external periodical 
cycles must be eliminated, and the process 
which does that is called seasonal adjustment 
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of the time series data. Various methods of 
seasonal adjustment have been proposed and 
applied, but none is definitive. The other case 
is where the cyclical changes are produced 
from the observed process itself; here, the 
length of the period and the pattern of the 
cyclical change must be estimated. 

Now assume that the data do not contain 
any trend, or that the trend has been effec- 
tively eliminated. First we calculate the cor- 
relation coefficient between x(t) and x(t +s) by 


3 (x(t) —x)(x(t + s)— x7 
r(s)= 
JY (x() EH Y (x(t +58) — 


where 








T-s T 


x- Y x(\(T-s), x= 2, XOT —s), 


t=1 t=s+ 


or more simply by 
r(s)\= TY (x()—x)(x(t-+ s) EH 


(T — s / > (x(t)—x)’), 


where x =» x(t)/T. r(s) is called the serial 
correlation coefficient or the autocorrelation 
coefficient of lag s. When there exists a clear 
and definite cyclical change of period s in the 
data, r(s) is close to 1. The diagram in which 
the serial correlation coefficient r(s) is plotted 
against s is called the correlogram. In order to 
see the cyclical properties of the data more 
clearly, we calculate the power spectral density 


w(4)— 1 +27 r(s)cos As. 


The graph of w(A) is called the power spectrum, 
or simply the spectrum. w(A) represents the 
square of the width of the sine curve of fre- 
quency 7/2z or of period 2z/4 contained in the 
data. The spectral density is closely related to 
the intensity, defined by 


2 2 
này (stessi + Daten) d 


which is proportional to the square of the 
multiple correlation coefficient of x(t) and the 
functions cos At and sin At and is large if the 
data contains a sine curve of frequency Ain It 
can be shown that I(2) is approximately equal 
to w(2)V(x)/n. The spectral density thus ob- 
tained usually oscillates irregularly and far 
from smoothly; hence smoothing by use of a 
"spectral window" is often applied (— 421 
Times Series Analysis). 

When several observations are made in time 
series data, we speak of multivariate time 
series. Let x;(t) be the ith observation in the 
tth period. The correlation coefficient between 
x,(t) and vii +s) is called the serial cross- 
correlation coefficient and is denoted by r,(s) 
(s — 0, +1, £2, ...). Analogously to the univari- 
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ate case we define 


w,(A)= > ry(s)cos As-- i ri (s)sin As 
S s 


= pi(A) m CDI 


and we call p,(4) the cospectral density be- 
tween the ith and the jth variables, and g,,(A) 
the quadrature spectral density. p7 + qj, is called 
the amplitude, and 


pa) t aC) 
nw) 
where w; and w, are the spectral densities of 


the ith and the jth variables, is called the 
coherence. 


Cyl) 


O. Events in Time Scale 


Some data give us the time points at which a 
specific event occurs. Let T; be the time when 
the event occurs for the ith time. Then usually 
the most important information we want to 
obtain is about the time intervals d;= 7;,, — T;. 
If there is a periodicity in the occurrences of 
the event, the d; will be approximately equal. 
On the other hand, if the event tends to occur 
repeatedly after its first appearance, some d 
will be small while others will be large. When 
there is no periodicity, no tendency to repeti- 
tion, and no increasing or decreasing trend 

in the occurrences, we can suppose that the 
event occurs simply by chance, and this is 
good reason to suppose that the density func- 
tion of the distribution of the intervals is 
exponential, i.e., it can be expressed by f(d)— 
(exp( — d/a))/a for d — 0. Also in such a case 
the number of occurrences in fixed time in- . 
tervals are distributed according to the Pois- 
son distribution. Such a sequence of occur- 
rences of an event is called a *Poisson process. 
More generally, let f(d) be the density function 
of the time intervals; then 


f(a) 
1— f A f(c)dc 
is called the hazard rate or hazard function. 


The hazard function is constant if and only if 
the process is Poisson. 


h(d)= 


P. Probabilistic Models 


In many applications of statistical data anal- 
ysis, the data exhibit variabilities and fluctu- 
ations that are due to fortuitous or hazardous 
causes or chance effects and that obscure the 
information contained in the data. In such 
cases we assume that the chance variabilities 
and fluctuations are random variables distri- 
buted according to some probability distribu- 
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tion, and the information we require is repre- 
sented by a set of unknown constants that 
characterize the probability distribution of the 
data as unknown parameters. Suppose, to take 
the simplest case, that we have repeated obser- 
vations of results of some experiment under a 
fixed condition and that we obtain the values 
X4, X5, Aa, Since the experimental condition 
is fixed, the variations among the x, values can 
be considered to be due to chance causes, 

such as variations in materials, uncontrolled 
small fluctuations in experimental conditions 
or instruments, and various other variations 
usually called the errors. Whatever the true 
causes of the variations, we can consider them 
to be random, and we can regard the values 
X,, X5, ..., X, as the results of random experi- 
ments or the realizations of random variables 
X,, X;, ..., X, independently and identically 
distributed according to some probability dis- 
tribution. Or we may think of a hypothetical 
infinite population of the results of supposedly 
infinite replications of the experiment under 
the same fixed condition, and regard the actual 
observations as n values chosen from this 
population at random. We may also consider 
that in this hypothetical infinite population, 
the frequency distribution is represented by a 
density function f, which in turn determines 
the probability distribution of each observa- 
tion. We may be interested in the "average" 
values of the result of the experiment as well as 
the magnitude of the variability; then those 
values are represented by the mean and the 
variance of the population distribution. If the 
form of the population distribution is assumed 
to be completely specified except for the mean 
u and the variance g?, the density function f is 
determined without these two parameters, and 
is expressed as f(x; u, o). The joint density for n 
repeated observations is TT, f(x; u, o). The set 
of assumptions that determines the probability 
distributions of the observations in terms of 
the unknown parameters is called the probabi- 
listic model, and its determination is called the 
problem of specification. 

Once the probabilistic model is given, the 
purpose of statistical data analysis can be 
formulated as making judgments on the values 
of the parameters, which may sometimes go 
wrong but can be relied on with some margin 
of probability of error that can be mathemati- 
cally rigorously ascertained. The formal proce- 
dure of making such judgments is called sta- 
tistical inference, and its mathematical theory 
has been well established over the last hundred 
years (— 401 Statistical Inference). 

]n most cases of statistical inference, the 
joint density function of the data plays an 
important role, and when it is regarded as a 
function of the unknown parameters for given 
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values of observations it is called the ‘likeli- 
hood function. 


Q. Exploratory Procedures 


In many applications of statistical data anal- 
ysis, we are not quite sure of the validity of the 
probabilistic model assumed, or we admit that 
the models are, at best, approximations to 
reality and hence cannot be exactly correct; 
the approximation may not be precise enough 
for the conclusions drawn from the assump- 
tions to be practically reliable. Therefore we 
have to check whether the model assumed is at 
least approximately valid for the data, and 
if not, we have to look for a better model that 
reflects more accurately the structure of the 
actual data. Thus in many practical appli- 
cations of data analysis, we have to scruti- 
nize the structure of the data and try various 
models before settling on a model and draw- 
ing final conclusions (which are still suscep- 
tible to further revisions when more data are 
obtained). Methods used in such a process 
are called exploratory procedures, which de- 
pend partly on the formal procedure of testing 
hypotheses and partly on intuitive reasoning 
sometimes combined with graphical presen- 
tations of the data, and also on scientific and 
empirical understanding of the subject matter. 
Suppose in the simplest case that n obser- 
vations X}, ..., X, are assumed to be indepen- 
dently and identically distributed. Under the 
condition that all those values are observed 
under the same well-controlled situation, this 
assumption is reasonable. But in reality some 
of the observations may be subject to some 
unexpected effect due to either a fortuitous 
outside cause or some "gross error" in the 
measurement procedure, the process of re- 
porting, etc., and may show much greater 
variation than others. Such observations can 
be detected by certain outlier tests or simply 
by looking at the data carefully, and if it is 
established that some observations are de- 
finitely outside of the possible random varia- 
bility or are subject to some hazardous ex- 
ternal effect, those data could be omitted from 
consideration. Further, the assumed proba- 
bility density f(x, 0) may not well approximate 
the distribution of the actual data even after 
the “outliers” are omitted. Some test for good- 
ness of fit should be applied, and if the hy- 
pothesis is rejected, we have to modify our 
model. Also, if we are provided with several 
candidates for the model to be adopted, we 
have to apply some procedure of model selec- 
tion (— 400 Statistical Hypothesis Testing, 
403 Statistical Models). It could also happen 
that the supposedly uniform conditions of 
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Observation, measurement, or experimentation 
did not actually prevail but that there has been 
some heterogeneity among the observations. 
Bimodality or multimodality of the histogram, 
i.e., existence of two or more peaks in the 
histogram, usually strongly suggests such 
heterogeneity. In such cases, grouping or stra- 
tification of the observations is required to 
make the conditions of observation within 
each group nearly uniform. 
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398 (XVIII.6) 
Statistical Decision 
Functions 


A. General Remarks 


The theory of statistical decision functions was 
established by A. Wald as a mathematically 
unified theory of statistics (— 401 Statistical 
Inference). In this theory, the problems of 
mathematical statistics, for example, statistical 
hypothesis testing and 'statistical estimation, 
are formulated in a unified way [1]. 

A *measurable space (X, 8) with a fixed 
*probability measure is called a sample space, 
and an element ve X is called a sample point. 
Suppose that we are given a family 2 — { P,| 
0€! of probability measures on (X, B), 
where Q is called the parameter space, and a 
trandom variable X takes values in Z accord- 
ing to a true probability distribution P as- 
sumed to belong to Y. This article deals with 
the problems involved in making a decision 
about the parameter 0, called determining the 
true value of the parameter, such that P — P,. 
To describe the procedure for such a decision 
based on the observation of the behavior of X, 
we need a triple Laf, €; D) consisting of a set .o/, 
a ta-algebra & of subsets of ai. and a set D of 
mappings ô from Z into the set of probability 
measures on (£, ©), 0: x —^ó(-|x), such that 
for a fixed Cep. the function 6(C| x) is B- 
measurable. We call d an action space or 
decision space, ó a statistical decision function 
(or simply a decision function) or statistical deci- 
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sion procedure, and D a space of decision func- 
tions. In actual decision procedures, ó(C | x) is 
the probability that an action belonging to C 
is taken, based on the observation of sample 
point x. We further consider a nonnegative 
function w:Q x X >R, called a loss function, 
such that for a fixed 6, w(0, a) is €-measurable. 
By averaging the loss, we obtain the risk 
function: 


r(0, à) =| | w(0, a)ó(da| x) P,(dx). 
Xo 


Two decision functions ô and A are identified 
if d(C |x) 2 óà'(C|x) for almost every x with 
respect to P4, for all 9€ and all Ce&. When 
to each xe 4 there corresponds a unique ac- 
tion a, such that ó((a,]) | x)= 1, the decision 
function 6 is said to be nonrandomized; other- 
wise, randomized. The system (2%, B, Z2, Q, A, 
€. W, D) is called a statistical decision problem. 

From the point of view of this theory, tpoint 
estimation, "interval estimation, and statistical 
hypothesis testing (— 400 Statistical Hypoth- 
esis Testing) are described as follows. 

(1) In point estimation we assume that the 
action space . is a subset of R and that we 
are given functions o: 4 5. and I: 25 R. 
The problem is to estimate the value of /(P) 
by using the real value o(x) at an observed 
sample point xe Z. As the loss function, we 
often set w(0, a) - C(0)(a—I(P,))’, where C(0) is 
a function of 0, and call it a quadratic loss 
function (— 399 Statistical Estimation). 

(2) In interval estimation we assume that 
each action is represented as an interval in 
R. Each interval [u, v] can be represented 
by a point (u,v) of the half-space R” = {z= 
(z.25)| zi <Z2}, which may be taken as the 
action space. A weighted sum aw,(0, z)+ 
Bw,(0, z) («, B > 0) of two functions, 


1 (0€[zi,22]). 
0 (Oe[z,,22]), 


w(0,z)=Z2—21, 


ER 


often supplies the loss function (— 399 Statis- 
tical Estimation). 

(3) In testing a hypothesis H:0€« versus 
an falternative A:0€c, (o9, = Ø, og Uo, 
=Q), the action space can be expressed by the 
set consisting of two points a,, a,, where a, 
denotes the decision to reject H and a, the 
decision to accept H. The loss function is 
defined as follows: 


1 (8 
wt0.a,)= 4) pe 
` In (0609) 
va) (eo) 


This is called a simple loss function. Whatever 
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testing procedure 6 is adopted, the proba- 
bilities of the *errors of the first or second kind 
coincide with the values of r(0, 6) for Deche or 
0€ c,, respectively (— 400 Statistical Hypoth- 
esis Testing). 

When Q is the union of mutually disjoint 
nonempty sets €, W,...,@,, Y= (44,05, 
Gei, and w(0,aj) — c;; (0€ c0;) with c; z 0 
(ij), cj; 20, the decision problem is called 
an n-decision problem. 


B. Optimality of Statistical Decision Functions 


Consider the problem of choosing the best 
decision function ô. When r(6, 6,)<r(0, 63) for 
all 0 and there exists at least one 6, such that 
r(05, 0,) Fifa, 65), the decision function ô, is 
said to be uniformly better than A, If there 
exists a decision function ô in D that is uni- 
formly better than any other 6 in D, it is the 
best decision function. However, such a func- 
tion ô does not always exist. A decision func- 
tion ô in D is said to be admissible if there 
exists no other decision function 6 in D that is 
uniformly better than 6. In other words, à is 
admissible if and only if the validity of the 
inequality r(0,6)<r(0, 5) for some óc D and all 
dei implies r(0, 0) — r(0, ô) for all get. When 
there is no information about P except that it 
is a member of Z, we follow the minimax 
principle and choose a function ó* for which 
we have inf;, psupgegr (0, 9) 2 sups.g r(0, *). 
This decision function 6* is called a minimax 
decision function or minimax solution. 

Let & be a o-algebra of subsets of Q, and 
suppose that r(0, ô) is §-measurable for any 
fixed ó. If, furthermore, we are given a proba- 
bility measure é, called an a priori distribution, 
on (Q, %), we choose a ó that satisfies 


inf | r(0,5)dé(0) = | r(0, à) dé(0). 
óeDJo Q 

Such a ô and the integral of r(0, 5) are called a 
Bayes solution and the Bayes risk relative to c, 
respectively. Let F be a family of a priori dis- 
tributions on (Q, 251. If 6 satisfies 


ei | r(0, 5)dé(0)— inf | r(0, oun) =0, 
čeF Q 6eD JQ 

ó is called a Bayes solution in the wider sense 
relative to F. If 2 is tdominated by À with a 
B x %-measurable f(x, 0)=dP,/di, w(0, a) is 
iy x €-measurable, and 


As faca 





| w(0, à) f(x, 0) d£ (0) 
Q 


aed 


= inf | wi. af (x, naso) 


is nonempty and &-measurable for 2-almost 
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every x, then a Bayes solution 6 with respect 
to é satisfies 0(A,|x) — 1 for 4-almost every x. 
For a sample point x with fo f(x, 0)d£(0) 40, a 
probability measure n(-|x, é) on (Q, %) defined 
by 


fx, dël) 

n(Blx, =" 

| f(x, 0) dEl) 
o 


is called an a posteriori distribution. To get a 
Bayes solution it is enough to minimize the 
value of f oW(0, a) dq(0| x, č) for every observed 
x. 

If € in the definition of A, defined above is a 
o-finite measure on (Q, %), a decision function 
ô satisfying 6(A,|x)=1 for A-almost every x is 
called a generalized Bayes solution with respect 
to č. 

Let D' be a subset of the space D. If for any 
óc D — D' there exists a ó'e D' that is uniformly 
better than 6, then D' is called a complete class. 
If for any óc D there exists a ó' e D' that is 
either uniformly better than A or has the same 
risk function as 6, then D' is called an essen- 
tially complete class. If D' is complete and any 
proper subset of D' is not complete, then D' is 
called a minimal complete class. If a minimal 
complete class exists, it is unique and coin- 
cides with the set of all admissible decision 
functions. 


C. n-Decision Problems 


In an n-decision problem where vg = (a,, 
...,4,}, We set 6,(x) = ó(a;| x) for a decision 
function 6, where 6,({x) is B-measurable and 
satisfies 0,(x) 2 0, 6,(x)+...+6,(x)=1. We 
consider the set 2 of vector-valued functions 
A(x) — (0, (x), ..., 6,(x)) whose components 6,(x) 
satisfy the conditions just given. Such a vector- 
valued function A(x) can be identified with 
(x); we write ó(x) instead of A(x) also. If in 
addition the parameter space Q is a finite set 
(0,,6;, ...,0,], we can consider a mapping 

y: — R* defined by (ô) — (r(0,, ô), ..., r(06,,0)), 
and then A  y(2)-— (V(6)|óe 2) is convex and 
closed in R*. If 6 is nonrandomized, then for 
each x, one and only one of the 6,(x) is 1, and 
all others are 0. Hence, in this case, Z is the 
disjoint union of B-measurable subsets B, 
(i1, ..., n) such that 6(a;|x)=1 if and only 

if xe B;. A probability measure m on (4^, 8) is 
said to be atomless if for any set Ae% with 
m(A)>0 and any b with 0 «b «m(A), there 
exists a subset Be% of A such that m(B) — b. If 
Q is finite and every member of 2 1s atomless, 
the image (PZ) of the set 2° of all nonrandom- 
ized decision functions coincides with dl 
This shows that 2° is an essentially complete 
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class in Z. However, when some members of 
f are not atomless, i/(2) ts not always equal 
to V(£9), but V(£9) is a closed subset of R*. In 
particular, when n — 2 for given probability 
measures P,, ..., P, over (X, B), the set S of all 
points (P, (B), ..., P,(B)) (Be 8) in R* is a closed 
set. If in addition P,,..., P, are all atomless, S 
is convex. These results are known as the 
Lyapunov theorem. 

A two-decision problem with c, — {1}, 
œw, = {2}, and c2 1 (ij), «0 (i—j) is called a 
dichotomy. We discuss this problem in some 
detail in order to explain the concept of opti- 
mality of decision functions. For a dichotomy, 
S — (2) is a set in R? that (i) is convex, (ii) is 
closed, (iii) is symmetric about (1/2, 1/2), (iv) 
contains the points (0, 1), (1,0), and (v) is a 
subset of interval [0, 1] x [0, 1] (Fig. 1). The set 
of decision functions ó that is mapped under y 
onto the curve ACDB in Fig. ! constitutes the 
minimal complete class, and a decision func- 
tion 6 mapped onto the point D is a minimax 
solution. Let č be an a priori distribution such 
that 


Cla (=f 


(2+ fP=1,a20, $20). 






ax+ y= constant 


Fig. 1 


Then a Bayes solution relative to € is mapped 
under y onto a supporting point c with direc- 
tion ratio — x/f and is obtained as the char- 
acteristic function of the set E= [x |af(x) « 
Bg(x)}, where f and g are tRadon-Nikodym 
derivatives dP,/d4 and dP,/dd with respect to 
À= P, 4- P5, that is, f, g are measurable func- 
tions on Z such that for any measurable set 
E, we have 


nuer faa, r=] gah, 
E E 


This fact implies that the most powerful test 
constructed in the Neyman-Pearson funda- 
mental lemma (— 400 Statistical Hypothesis 
Testing B) is precisely a Bayes solution. 


D. Complete Class Theorems 


Suppose that F is tdominated by a o-finite 
measure 4, and let f(x, 0) be the Radon- 
Nikodym derivative of P, with respect to 4. 
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Consider a subspace L of L,(4^, 4) containing 
(rte (let). and the following equivalence 
relation between bounded 38-measurable func- 
tions on 2: p, and o, are defined to be equiv- 
alent if and only if f o, fd4 = f p, fd4 holds 

for every Tel. We further assume that the 
dual space of L is the linear space Mt of the 
equivalence classes of bounded measurable 
functions just defined. Let C,(.-7) be the set of 
all continuous functions on . with compact 
support, nog the integral of «€ Col) with 
respect to a probability measure z over 2%, 
and goó the integral 


D :Ix)g(x) dA 


for ge L, óc D. As a base for neighborhoods of 
the space of decision functions around à, € D, 
we consider V(05:0,, ...,9,,91,.... 0, Y= 
15|\g,06,0%;-g,0504;|<e,i=1,...,n}, 
where o;€ Cy(.), gel, £2 0. 

Let F be the set of all a priori distributions é 
on (Q, %) each of which assigns the total mass 
| to a finite subset of Q, B the set of all Bayes 
solutions relative to some c(e F), and W the set 
of all Bayes solutions in the wide sense relative 
to F. Suppose that d is a flocally compact, 
separable metric space and w(0, a) is lower 
semicontinuous with respect to a for every 
fixed 0. Then if D is compact and convex, 
the intersection of W and the closure B of B 
constitutes an essentially complete class [2]. 
Moreover, if Q is compact with respect to the 
metric du, 05) — supg.a| P5, (B) — P,,(B)| and 
iwC.a)|ae gi is a uniformly bounded and 
*equicontinuous family on Q, then the class of 
Bayes solutions relative to some a priori distri- 
bution is a complete class [1]. These propo- 
sitions are called complete class theorems (for 
complete classes in specified problems and 
admissibility of individual procedures — 399 
Statistical Estimation; 400 Statistical Hypoth- 
esis Testing; and Appendix A, Table 23). 

Two measurable spaces (S, ^) and (R, 2) are 
said to be isomorphic if there exists a corre- 
spondence p of S and R (Ee implies p(E)e 2 
and, conversely, p(E)e 2 implies Ee S) where 
p is One-to-one onto. Let 2 be the set of all 
decision functions associated with a sample 
space (Z^, 8) and an action space Lo, ©), T bea 
‘statistic on (2^, B) taking values in a measur- 
able space (7, ©), and 2* be the set of all 
decision functions having sample space (Y, €) 
and action space Lo, ©). The set of all de 2 for 
which there exists a ó* e ot satisfying d(C | x) 
=6*(C| T(x)) (Ce&) is denoted by 27. If (i) 

T is a *sufficient statistic and if (it) (2, ©) is iso- 
morphic to the measurable space R* associated 
with the c-algebra of its Borel subsets, then 2; 
is essentially complete in Z. Conversely, if (i) F 
is a dominated family, (ii) f(x, 0)>0 always 
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holds on 2, and (iii) for any pair 0,, 0,(€Q), 
there exists a 0,(e Q) such that no element a in 
J makes two of the w(0,, a) (i= 1, 2, 3) attain 
their minimum simultaneously, then essential 
completeness of Z4 in 2 implies that T is 
sufficient [3] (— 399 Statistical Estimation). 


E. Invariance 


Suppose that there exist one-to-one transfor- 
mation groups G, G, and G of X, Q, and «7, 
respectively, onto themselves (transformations 
belonging to G, G, and G are *measurable with 
respect to B, &, and ©, respectively) and that 
there exist homomorphisms gg, g d of G to 
G and G, respectively, such that for Be we 
have P(g ! B) — P4(B) and w(g0, ga) = w(0, a). 
Then a decision problem (2%, B, 29. Q, .»7, ©, 
w, D) is said to be invariant under (G, G, G). In 
a decision problem invariant under (G, G, G), a 
decision function satisfying 6(gC | gx) 2 (C |x) 
is called an invariant decision function. 
Suppose that the transformation group G is 
locally compact and is the union of a count- 
able family {K,,} of compact subsets. Let I' be 
the c-algebra of Borel subsets of G such that 
the mapping (g, x)—(g, gx) is measurable in 
the sense that the inverse image of any set in 
T x Bis also a set in I x 8. For such a T the 
‘orbit G(x) of G through x is I-measurable. 
We assume that following conditions: (1) 2 is 
dominated; (2) G operates effectively on Z; (3) 
G operates ttransitively on Q; (4) for any com- 
pact subset J of G, 


lim (WK V/W (K, J 77) 91, 


where vi is a right-invariant Haar measure 
(— 225 Invariant Measures C) on G and 
K, J «(gh !|geK,, heJ Y; (5) there 

is a *conditional probability distribution 
P,(-:z), given ze G(x), on X; (6) the integral 
fca w(8, ja) P(dgx :z) attains its minimum 
value b(0, z) for any 0 and ze G(x); and (7) 


n2o00 


lim d w(0, ga) P((dgx:z) — b(0, al z0 
K,x 


uniformly in a, where K,x={gx|geK,}. Then 
the best invariant decision function exists 
which is also minimax in Z [4] (— 400 Statis- 
tical Hypothesis Testing). It is shown in [9] 
that some invartant minimax decision func- 
tions are not admissible. 


F. Sequential Decision Problems 


In the general framework of statistical dect- 
sion theory, not only the decisions to be taken 
but the number of samples to be observed 
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may be determined based on the previous 
observations. 

A simple formulation is illustrated for the 
sequential decision problem given below. 
Suppose that X,, X,,..., X, are independently 
and identically distributed random variables 
with probability measure P, We assume that 
the X's are to be observed one by one, and 
at the ith stage, when we have observed X,, 

., X; we are to decide whether to continue 
sampling and observe X,., or to stop obser- 
vation and choose an action or decision in Z, 
utilizing all the observations thus far obtained. 
Then a decision rule is defined by a sequence 
of pairs (0,, s,), n — 0, 1, ..., where 6, is a map- 
ping from the space X" to 2 (for the sake of 
simplicity we here exclude randomized deci- 
sions), and s, = s,(x,, ..., x,) is a measurable 
function from X" to the interval (0, 1]. s, gives 
the probability of stopping the sampling when 
the first n of the X's are observed, and ô, de- 
fines the decision taken when the observations 
are stopped. We include sọ and 6, to denote 
the probability of taking a decision without 
making any observation and the decision to be 
taken then. We call 6, the terminal decision 
rule and s, the stopping rule. Then for such a 
decision rule 6 the total expected loss or the 
risk is given by 


oo n-i 
r(0, 0) — 2 [Thats 


xx) WO 0 6 ess X) 


Sols DE neus 


n 

t c, (X, | Xi gett el I] dx, 

j= 
where c,(x,|x,, ..., x, (IS the cost of observa- 
tion of X, — x, when X,—x,,..., X,- ,—X,.,. 
The rule à is called a sequential decision rule 
or a sequential decision function, and the whole 
setup a sequential decision problem. In most of 
the sequential decision problems the cost of 
Observation is assumed to be equal to a con- 
stant c per observation, and then the Bayes 
risk r*(5,) for the prior distribution čą satisfies 
the relation 


r*(£) 2 min fit | w(0,d)dé,, 


owes, Sisi 


where z,(x,, £j) denotes the posterior distri- 
bution when X, — x, is observed under the 
prior distribution £4; and the Bayes decision 
rule satisfies the condition 


S, X1, X) 1 A PF (nglxqs Xn; So) 


latäin, ie Xas Go) 


=0 otherwise, 
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and ó,(x,, ..., x,) - d* if (w(0,d*) dro (x, ..., 
Xn: Čo) = inf, [ w(0, d) dud(x, , ... , x,; Čo), where 
Rg(X,, ..., Xn; Čo) is the posterior distribution 
given X, —x,,..., X, = x, under the prior Za, 

For the dichotomy problem with the simple 
loss function discussed above, the Bayes deci- 
sion rule has the form 


S4 (X4, ..., Xp) = 1 if no(x,,..., X,) KA, OF SN, 
=0 otherwise, 

and 

GX, X) d, if Zelt), X,) En, 


=dy if 1o(X1,---, Xp) Z Tiz, 


where zo(x,,..., x,) is the posterior prob- 
ability that 0 = 0p, given X, —x,, X,— x,. This 
amounts to the following rule: Continue sam- 
pling as long as 


931€ 405, EZE Pill P (xi) € 92; 
and decide on d, as soon as A(x,, ..., x,) 2; 
and on d, as soon as A(x,, ..., x,) € y,, which 
is actually equivalent to a tsequential proba- 
bility ratio test (— 400 Statistical Hypothesis 
Testing). 


G. Information in Statistical Experiments 


The part & — (47,98, P,Q) of a decision problem 
A — (4,38, 2, Q, A, ©, w, D) is called a statistical 
experiment. In this section, we consider n- 
decision problems, i.e., those wherein Q con- 
sists of a finite number (1,2, ..., n] of states, 


and we denote the set S= {r(1, ô), r(2, ô), ..., 


r(n,ó)|Óe 2] by L(A). Let £, and £, be two 
experiments having a common parameter 
space Q, and let A, and A, be two decision 
problems composed of &, and &,, respectively, 
and a common (£, D. w). We say that the 
experiment 6, is more informative than the 
experiment 6, if L(A,) 2 L(A,) for any action 
space Lo, &) and any loss function w [6]; 

that is, whatever the actions proposed and the 
loss incurred, the experiment &, can offer a 
decision procedure at least as efficient as the 
experiment &,. Thus the set L(A) with A= 

Lë, 8. &, w, D) represents some feature of infor- 
mation that & can provide about the states Q. 
However, comparison of L(A) is not easy to 
carry out. S. Kullback and R. A. Leibler de- 
fined the concept of information for the case of 
a dichotomy A [5]. If the probability distri- 
bution induced by a random variable X has a 
Radon-Nikodym derivative f, (x) (70) or f(x) 
(20) with respect to J, we define 





I(X:1,2)—1(fi. f) E fi 


398 H 
Statistical Decision Functions 


calling this the Kullback-Leibler information 
number (or K-L information number). This 
number is uniquely determined by the set $ = 
L(A) in Fig. t, and the larger S becomes, the 
larger /( f,, f;) becomes. If x, f (x+ B= 1) area 
priori probabilities and 4(1|x) and n(2|x) are a 
posteriori probabilities of 1 and 2, respectively, 
we have, from the Bayes theorem, 








°F x) "Qx "e 


Here the right-hand side stands for the change 
in the probability of the occurrence of the state 
after an observation of x, and the *expectation 
of the left-hand side under f, is Hfi, f2). The K- 
L information number has the following prop- 
erties: (i) (X:1,2) 20, and 1(X:1,2)20— 
f =g; (ii) independence of X and Y implies 
I(X:1,2)- I(Y:1,2) 2 I(X, Y:1,2); (i) fora 
statistic T'—t(x), I(X:1,2) x I(T:1, 2), where 
the equality holds if and only if T ts sufficient 
for P={P,, Pj, in which dP, /dA = f, and dP,/ 
dÀ — fa; and, as a result of (it), (iv) if X,,..., X, 
are distributed independently with the same 
distribution, 7(X,,..., X,:1,2) 2 nI(X,:1,2). 
Suppose next that Q is the real line and that 
the Radon-Nikodym dertvative g(t:0) of the 
distribution of a statistic T with respect to a 
measure À has the following properties: (1) the 
set of all t at which g(t:0) —0 is independent of 
0; (tt) g(t:0) is continuously twice differentiable; 
and (iii) the order of differentiation with re- 
spect to 0 and integration with respect to t 
can be interchanged. Then 1(T:0,0 -- d0) — 
I(T:0)d0? for an infinitesimal displacement d0 
of 0, where 


I(T:0)= |, (CS st g(t:0)dà 


Here I(T':0) coincides with the Fisher infor- 
mation (— 399 Statistical Estimation). 
Suppose that we are given a sequence (X,, 
X», ...) of independent random variables 
whose distributions have as their density either 
fs. fa.) or (G,,92,---), that is, f; and g; are 
candidates for the density of distribution of X; 
(i=1,2,...). A method to determine which of 
the sequences ( f;), (g;) actually corresponds to 
(Xj) is given by the Kakutani theorem. Let F be 
the distribution of (X,, X,,...) when each X; is 
distributed according to f;, and let G be that of 
(X4, X>,...) when each X; is distributed accord- 
ing to g,. To see how X,, X,,... are actually dis- 
tributed, we assume that the loss incurred by an 
incorrect decision is 1 and the loss incurred 
by a correct decision is 0. Denote such a deci- 
sion problem (dichotomy) by A. Then we gen- 
erally have L(A) c I, where I ={(x, y)|O<x, 
y<1}. A necessary and sufficient condition for 


m) Sg y(t |x) a 
o lo 
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L(A)— I is that IL, p(f,,9,)=90, where 
Pl Sas In) = | falx) Gn(x) dx. 
Consequently, if the X, (n=1,2,...) have the 
same distribution, we have L({A)= I, and the 


correct decision can be made with no error 
based on infinitely many independent obser- 
vations of X,, X,,.... 


H. Relation to Game Theory 


The theory of statistical decision functions 

is closely related to game theory. From the 
game-theoretic viewpoint, a statistical decision 
problem is considered to be a zero-sum two- 
person game played by the statistician against 
nature. A strategy of nature is the true distri- 
bution P of the variable X or the true value of 
0, and a strategy of the statistician is a decision 
ô. In this setup, the risk function r(0, ô) can be 
regarded as a *payoff function paid by the 
statistician to nature. An a priori distribution 
€ is a tmixed strategy of nature. A randomized 
decision function is a mixed strategy of the 
statistician. A minimax decision function cor- 
responds to a minimax strategy of the statis- 
tictan. A minimax strategy of nature is called a 
least favorable a priori distribution. If a dect- 
sion problem is strictly determined as a game, 
a minimax solution is a Bayes solution in the 
wide sense. 

If dp is a Bayes solution with respect to čo 
and r(0, 05) < R(čo, ôo) for all 0, the decision 
problem is strictly determined, and A, is mini- 
max and €, is a least favorable a priori distri- 
bution, where R(£,, 5p) = fo r(0, 09) d£ (0). If 
ĉ is a Bayes solution in the wide sense and 
r(0, do) is constant as a function of 6, the deci- 
sion problem is strictly determined, and A 
is minimax. If ô is admissible and r(0, 05) = 
c< oo, dn is minimax. If Q is a finite set and 
inf; super, ô) « oc, the decision problem is 
strictly determined and there exists a least 
favorable a priori distribution. We have few 
general results about generalized Bayes solu- 
tions (2 173 Game Theory; [6]). 
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A. General Remarks 


Statistical estimation is one of the most im- 
portant methods of statistical inference (— 401 
Statistical Inference). Its purpose is to estimate 
the values of ‘parameters (or their functions) 
involved in a distribution of a statistical *popu- 
lation by using observations on the popula- 
tion (— 396 Statistic). Let ?—(P,[0c 8! bea 
family of *probability distributions, indexed by 
a parameter 0 and defined over a *measurable 
space (Le, sample space) (7, 38). Let X bea 
*random variable taking values in 2 and dis- 
tributed according to a probability distribu- 
tion P that is a member of 7. Statistical esti- 
mation is a method of estimating the *true 
value of the parameter 0 (i.e., the parameter 0 
such that P = P,) or the (true) value g(0) of a 
given parametric function g (e, a function 
defined over ©) or both, at 0, based on the 
Observed value x of the random variable X. 
The function g maps the parameter values into 
R, R*, or some function space. Statistical esti- 
mation methods are classified into two types: 
point estimation, which deals with individual 
values of g(0), and interval (or region) estima- 
tion, by means of which regions that may con- 
tain the value g(0) are considered. We can also 
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include in statistical estimation the problem 
of predicting the tolerance region in which the 
value of a yet unobserved random variable 
may come out. 


B. Point Estimation 


In the method of point estimation for a given 
parametric function g, we choose a measurable 
mapping q from the sample space (X, B) into a 
measurable space (Loi, &) and state that “the 
value of g(0) is g(x)” for an observed value x, 
where .° is a set containing the range of g and 
€ is a tcomplete additive class of subsets in a. 
The mapping ø, or the random variable o(X) 
taking values in the space oi. is called an 
estimator of g(0), while the value q(x) deter- 
mined by the observed value x is called an 
estimate of g(0). This estimate is sometimes 
termed a nonrandomized estimate in contrast 
to the following generalized notion of esti- 
mator. A mapping from Z to a set of proba- 
bility distributions defined over (. 7, &) is called 
a randomized estimator, which reduces to a 
nonrandomized estimator when each image 
distribution degenerates to a single point. We 
assume that ./ =R and C — the class 38 of all 
*Borel sets in R, unless stated otherwise. We 
denote the 'expectation and 'variance with 
respect to P, by E, and V,, respectively. 


C. Unbiasedness 


An estimator (X) of g(0) may not be exactly 
equal to g(0) for any 0€ O except for trivial 
cases, but could instead be stochastically 
distributed around it. An estimator q( X) is 
said to have unbiasedness if it is stochastically 
balanced around g(0) in some sense, such as 
mean, median, or mode. A statistic p(X) is 
called a (mean) unbiased estimator of g(0) if 


Es(o(X)) — g(0) 


for any 0€ O. A parametric function g ts said 
to be estimable if it has an unbiased estimator. 
For example, the sample mean is unbiased for 
the population mean: E,(X)= E(X) for any 
(ep Unbiasedness usually implies mean un- 
biasedness, and we assume this unless stated 
otherwise. The function 


b(0) = Ey(p(X)) — 9 (0) 


is called the bias of the estimator (X). If we 
restrict ourselves to unbiased estimators (X) 
only, it is best to choose, if possible, a p(X) 
whose variance V,(y(X)) is minimum uni- 
formly for every (ep 

Theorem (Rao-Black well). If T — t(X) is a 
*sufficient statistic, then for any unbiased 
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estimator ~(X) of g the tconditional expecta- 
tion y(t) 2 E(p(X)| T =t) yields another un- 
biased estimator o*(X) — y (t(X)) of g, which 
satisfies V;(q*) x V,(@) for all 0€ ©, with the 
equality holding if and only if @(x)= @*(x) 
(a.e. P). The notation ae, 2 means that the 
statement concerned holds with probability 1 
with respect to P, for each 0€ OQ. An estimator 
ọ of g(0) is called a uniformly minimum var- 
iance (or UMV) unbiased estimator if o is 
unbiased for g(0) and has a minimum variance 
uniformly in O among the class of unbiased 
estimators for g(0). 

Theorem (Lehmann-Scheffé). If T is a suffi- 
cient and fcomplete statistic, then for any 
estimable parametric function g(0), there 
exists a unique UMV unbiased estimator 
of g(0) that 1s a function of T. For exam- 
ple, suppose that X =(X,, X2, ..., Xn) isa ran- 
dom sample from a population with exponen- 
tial type distribution P, with density p(x) = 
B(O)u(x)exp(X:E. , a,(0)t;(x)) with respect to 
Lebesgue measure and that the set |(o,(0), ..., 
2,(0))| 0€ O! contains some open set of R*. In 
this case, T — (t, (X), ..., t, (X)) is a sufficient 
and complete statistic, and hence every real- 
valued measurable function y(T) is the unique 
UMV unbiased estimator of the parametric 
function E,(V(T)). If for any 0€ O the tmedian 
of the distribution of an estimator (X) equals 
a real parametric function g(0) when X is dis- 
tributed as P}, i.e., if 


Bola «5« Po{ P(X)  q(0), 


then q(X) is called a median unbiased esti- 
mator. For example, a sample median (suitably 
defined for the case of an even number of 
samples) is median unbiased for the popula- 
tion median. If @(X) is a median unbiased 
estimator of g, then for any real-valued mono- 
tone function h, an estimator h(p(X)) for 
h(g(0)) is median unbiased, that is, median 
unbiasedness is preserved under monotone 
transformations, which is not the case with 
mean unbiasedness. 

Restricting our consideration to the class of 
all median unbiased estimators, we can use the 
function 


P,lo(X)<u} foru«0 


Pje(X)2uj foru>é 


a(u, 0, o) — d 


as an indicator of the behavior pattern of an 
estimator o. The estimator qo that minimizes 
a(u, 0, o) for all values of u and 0 (u z 0) is said 
to be uniformly best. This property is also 
preserved under monotone transformations. 
Theorem (Birnbaum). If a family of distri- 
butions (P,|0€ € cR} has a monotone ‘like- 
lihood ratio with respect to a statistic t(x) and 
the distribution function F(t,0) of T=t(X) is 
continuous both in t for any 0 and in 0 for any 
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t, then there exists a uniformly best median 
unbiased estimator of 0. Actually, if 0 — (t) is a 
solution of F(t,0) — 1/2, then o(X) 2 Ó(t(X)) is 
such an estimator. 

If for any 0€ © the *mode of the density 
function or the probability mass function of an 
estimator o (X) is equal to g(0), then o(X) is 
called a modal unbiased estimator of g(0). 


D. Lower Bounded of the Variance of an 
Unbiased Estimator 


When there does not exist a sufficient and 
complete statistic, we can still seek to minimize 
the variance of the (mean) unbiased estimator 
at every fixed point 0 —0,. In the remainder 
of this section, Z is assumed to be dominated 
by a measure u, and p(x) denotes the density 
function of P, with respect to u and 7(x)= 
Po(x)/ Po (X). The following theorem guaran- 
tees the existence of the locally best unbiased 
estimator. 

Theorem (Barankin). Let Æ be the set of all 
unbiased estimators of a parametric function 
g(0) with finite variance at 0 — 09. Assume that 
M is not empty and E, ((n4(X))?) « oc. Then 
there exists an estimator cy in Æ that mini- 
mizes the variance at 0, within Æ. Actually, 
{Qo} = MNA, where A is the linear space 
generated by {z,(x)|0€0}. The minimum 
variance is given as follows: 


Vo (Po) 
=inf{ V (PX) oe. 


= SUP; d aA) |En e aza) | 


Ssi d Y E 


where h(0) — g(0) — g(05) and A is the (i, j)- 
component of the inverse of the n x n matrix 
(Ay) with 4; — Es (t5 (X)n9 (X)) and where the 
supremum sup, is taken over all positive in- 
tegers n, 0,,...,0,€ 0 and a,, ..., a, e R, and 
becomes sup, when the supremum ts taken 
over n and the d This theorem leads to the 
following three theorems with respect to the 
lower bound of the variance of an unbiased 
estimator. The first is immediate and the last 
two are obtained by replacing some order 
differences of x, with the corresponding 
differentials. 

Theorem. For any unbiased estimator o(X) 
of g and 05€ O, we have 


Vs (p(X)) CB ((g(8)— 9(09))^ / E, (r9 (X) 


—ng (X))j 


(Chapman-Robbins-Kiefer inequality). 
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Theorem. Suppose © cR. For any unbiased 
estimator (X) of g and under certain regular- 
ity conditions, we have 


(g (6) 
Ep ((0log po X}/04 | -»,)°) 


(Cramér-Rao inequality), where the equality 
holds only for the exponential type distri- 
bution pé(x) = f(O)u(x) exp(a(0) p(x)). An ex- 
ample of such regularity conditions is D) (111): 
(i) Eg, ((9(X))) < oo for all 0€ 8; (ii) py(x) has 
a partial derivative Dale) at 0 = 0, (a.e. Ps); 
and (iii) 


li (me um Secci | 
im E, 
A8—0 90 po (X)A0 De, (X) 


Corollary. Let X =(X,,..., X,) bea random 
sample from a distribution with density f(x, 0) 
and let 


1(0) = Eq((Olog f(X, , 0)/00)). 


Since E,((é log p4(X)/00)?) = nI (0), the Cramér- 
Rao inequality implies 


Va (o (X) 2 tg (09))" /nI (Oo). 


The number 7(0,) is called the Fisher infor- 
mation of the distribution f(x, 0). When the 
equality holds for an unbiased estimator (X), 
(X) is called an efficient estimator of g(0). In 
general, the efficiency of an unbiased estimator 
Q at 0 — 0, is defined by 


Eff(p) = (g (65) /(n1(85) Vo, (9). 


Theorem. For any unbiased estimator o(X) 
of g(0) and under certain regularity conditions, 
we have 





Vo (0(X)) = 





k k 


>} 2 g ?(0,) 


i= 


Vo (o(X)) gO) K" 


V. 


Wie mn inequality), where g?(0,) — 
9(0)/40*| a, and K is the (i, j)-component 
d the inverse of the matrix (K;;) with 


DX) p(X 
Ky= E, Kee ) pa. ( d iíj-2l...,k. 
Po (X) po, (X) 


An example of such regularity conditions is 
(i)- (iii): (i) Ey, (9 (X))) < œ; (ii) pell is k-times 
differentiable with respect to 0 at 0 = 0s; (iii) 
the ith partial derivative pgJ(x), i< k, satisfies 


lim E, | pe( X)lo- e )1a=6, MS -0. 
an "UL p (XAO pa (X) 
where APj(X)]|o-a, = Po,+a0(X) — Po (X) and 
A'P(X)— A(A ! B(X)) for i> 2. For k=1 the 
Bhattacharyya lower bound is the same as the 
Cramér-Rao lower bound. In general, the 
former gives a sharper lower bound than the 
latter. 

If the parameter is multidimensional, 0 — 
(0,, ..., 6, Y, then for any unbiased estimator 
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(X) of g(0) and under similar conditions to 
those for the 1-dimensional case, we have 


k k 
>} 2G (90) Gj (Uo 


i=1 j=1 


Vs (P(X) 


where g;(05) = 09(8)/00;|g-9, and J” is the (i, j)- 
component of the inverse of the matrix (J;j) 
with 


Jij7- Es (Clog pe(X)/06;| 4-6, 0log PeX Vëlo al 


If 0*(X) 2 (0f (X), ..., 0F(X)) is an unbiased 
estimator of 0 —(0,,...,0,) (i.e., E(0* (X)) = 6, 
for i2 1,2, ..., k), then the 'covariance matrix 
V, (0*(X)) of 0*(X) at Oo satisfies V (0*(X)) z 
J~', that is, the difference V,(0*(X)) —J ! isa 
nonnegative definite matrix. If X =(X,,..., X,) 
is a random sample from a distribution having 
density f(x,,0), then by setting 


I; — Es (Clog f(X,, 0)/00;| a 
x ólog f(X,, 0)/00;| s, ), 


we have J,,=nl,;. The matrix I —(I;) is called 
the Fisher information matrix of the distri- 
bution f(x,, 0). 


E. Decision-Theoretic Formulation 
(— 398 Statistical Decision Functions) 


Let W(0, a) (20) be the loss incurred from an 
estimate (or action) a of the parameter when 
the true value of the parameter is 0. The risk 
function of an estimator (X) of the para- 
metric function g(0) is then defined as 


r(0, p) = E(W(0, o(X))). 


Statistical decision theory deals with the prob- 
lem of minimizing, in an appropriate manner, 
the risk function by a suitable choice of o. The 
notions of complete class, Bayes estimator, 
admissibility, minimax estimator, and invar- 
iant estimator, explained here and in Sections 
F-I, are the most important of the theory. The 
unbiased estimator explained in Section C 
may also be considered an important concept 
of the theory. 

A class C of estimators is said to be essen- 
tially complete if for any estimator @ there 
exists an estimator qo, in C such that 


r(0, Po) & r(0, p) 


for any 0e ©. The following two theorems 
hold, provided that the action space .o/ is R 
and the loss function W(0, a) is convex with 
respect to ae. for any dech, 

Theorem (Hodges-Lehmann). If WO a)— oo 
as |a| 00, then the class of all nonrandomized 
estimators is essentially complete. 

Theorem. If T — t(X) is a sufficient statistic, 
then the class of all functions of T is essentially 
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complete. Actually, given any estimator (X) 
of g(0), the conditional expectation (t) — 
E(p(X)| T =t) yields an estimator al Alz 
y (t(X)) satisfying r(0, 9) « r(0, o) for any 0, 
where the equality holds when and only when 
P =p, provided that W is convex in the strict 
sense. 

A loss function of the form 


W(0,a)— 4(0)(a—g(0)*, — 2(0) 0. 


is called a quadratic loss functions. If, in partic- 
ular, Z(0)z 1, then 


r(0, p) = Es((o(X) — g(0))*) 


is called the mean square error of the estimator 
p(X) of g(0). This error coincides with the 
variance when the estimator is unbiased. 


F. Bayes Estimators 


Let € be an a priori distribution over the 
parameter space O associated with a certain 
*g-algebra oe, and assume that r(0, o) is %- 
measurable for every ~. Denote by E: the 
average operator relative to €. The infimum 
of the average risk r(£, q) = E*(r(0, )) = 

E* E((W(0, o(X))) for o running over its range 
is called the *Bayes risk relative to £, while an 
estimator (X) of g(0) at which the average 
risk r(é, o) attains the infimum is called a 
Bayes estimator relative to č. A Bayes esti- 
mator is obtained as follows: Assume that 28 
is dominated by a measure u with 3B x &- 
measurable density p,(x), the loss function 
W(0, a) is Ẹ x &-measurable, and 


| Pox) dé (0) < oc. 
e 


For each observed value of x, the Bayes esti- 
mator il takes the value a that minimizes 


r(a| x) = ES(W(0, a)| x) 
-| W(0,a)p(O|x)dc(0), 
o 


where p(0|x) is the probability density, with 
given x, of 0. We call r(a|x) the posterior risk. 
Theorem (Girshick-Savage). Suppose that 
the loss function is quadratic. For any x the 
value of the posterior risk is either oo (for 
every value of a) or finite (for all or only one 
value of a). If the Bayes risk relative to č is 
finite, then a Bayes estimator o*(X) relative to 
č is determined uniquely as follows: @*(x)= a 
if r(ag| x) « oo for only one value ag, whereas 
q* (x)= E*(g(O)A(0)| x) E Lol x) if r(a| x) « oo 
for every a. If E*(4(0)) « oo, then a Bayes esti- 
mator is either biased or has average risk zero. 
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G. Admissibility of Estimators 


An estimator p(X) of a parametric function 
g(0) is said to be admissible if and only if for 
any estimator (X) of g(0) the inequality 
r(0, ls r(0, po) for all 0€ © implies that 


r(0, o) — r(0, po) 


for all 0e O. If an estimator is the unique 
Bayes estimator relative to some a priori 
distribution, then it is admissible. For exam- 
ple, let X,,..., X, bea random sample from 
N(0, 1), and let W(0,a) —(a —0Y. Then ọ(X) 
— (c 4 no? X)/(1 ^- no?) is the unique Bayes 
estimator relative to the prior distribution 
N (c, c?) of 0, where X is the sample mean. 
Hence any estimator of the form o(X)—aX +h 
is admissible when 0 «a «1 and --œ « b « oc. 
In the rest of this section and the next sec- 
tion, we restrict ourselves to quadratic loss 
functions. If a statistic of the form co(X) with 
real c is an admissible estimator of g(0), then 


auo 2U )<e<sup Dec 
a Elo’) 6 Est?) 

Theorem (Karlin). Let X be a random vari- 
able having a 1-dimensional exponential type 
distribution dP,(x) = f(0)e9* du(x) with a param- 
eter space O —1(0,0) — {0| f2 e** du(x) « oo] 

a closed or open interval, and let g(0) = E(X) — 
— p'(0)/B(0) be a parametric function to be 
estimated. Then the estimator g,(X)=X/ 

(4 4- 1) for real å is admissible provided that 
Lë ^ d0— oo as b—0 and FAP ^d0 oc 
as a0 for any ce(0,0). 

Corollary. When © —(—oo, oo) the estimator 
Qo(X) — X is admissible. 

Corollary. Let © =(— œ, oo), and assume 
that both intervals ( —o0,0] and IO. 20) have 
positive measure with respect to p. Then every 
estimator of the form o(X)-aX with O<a<1 
is admissible. This theorem can be applied to a 
random sample X,,..., X, drawn from an 
exponential type distribution because the 
sufficient statistic X => X;/n has an exponen- 
tial type distribution and E,(X)= E,(X,). 

Theorem (Karlin). Let X be a random 
variable having a distribution dP,(x) = q(0): 
r(x)dx for Ox x x0 and =0 otherwise, with the 
parameter space © — (0, oo), where ff r(x)dx < 
+œ and [5 r(x)dx = oc. Among the estima- 
tors of the type o, (X)— c(q(X)) * for g(0) — 
(q(0)) * with «>0, only one estimator with the 
value c 2 (2a + 1)/(x+ 1) is admissible. This 
theorem is applicable also when the size of the 
random sample is larger than 1. 

Theorem (Stein). Let X,,..., X, bea random 
sample from a univariate distribution dP,(x) = 
f(x —0)dx with a location parameter 0. Define 





oC 


A Asi | HIEN 
x i=1 


149] 


for k — 0, 1, 2. If 


Ed Kg A, A, 2\ 3/2 n 
PGE) Om 


H 
x [| dxi « oo. 
i-l 


then the Pitman estimator Qo(X,,..., X,)= 
A,(X,, ..., X, Ao(X,, ..., Xn) of the parameter 
0 is admissible. 


Inadmissibility of the Usual Estimator for 
Three or More Location Parameters. Let X — 
(X,,..., KA be a k-variate normal random 
variable with mean 0 —(0,, ..., 0,) and covar- 
iance matrix I, the identity. Then the Pitman 
estimator of 0 is Ô= X. However, Stein showed 
that X is inadmissible. It is strictly dominated 
by the estimator 0*(X)— (1 —(k 2| X?) X, 
where |:| denotes the Euclidean norm |X |? = 

k X?. That is, if k>3, EJ0*(X)—0|^ < 
E|X —0|? for any 0. 

An estimator such as 0*(X) is called Stein's 
shrinkage estimator (James and Stein, Proc. 4th 
Berkeley Symp., 1 (1960)). 


H. Minimax Estimation 


An estimator o*(X) is said to be minimax if 
and only if 


sup r(0, p*)=infsup r(0, q). 
0 o 8 


If an estimator of is admissible and the risk 
r(0, *) is constant with respect to 0, then oi is 
minimax. 

Theorem (Hodges-Lehmann). À Bayes esti- 
mator q* relative to an a priori distribution 
€ is minimax if € assigns the whole proba- 
bility to a subset wc O, r(0, q*) is constant 
(say, c) for ec, and r(Q, p*)<c for eo, 
Let X have a binomial distribution B(n, 0), 0< 
0 «1; 0is unknown. If the prior distribution 
of 0 is a beta distribution B(/n/2, „/n/2), then 
the Bayes estimator is T*(X)=(X + \/n/2)(n+ 
n), which has constant risk E,(T*(X)—0)? = 
Q(1--  /n))? for all 0, 0 0 « 1. Thus, ac- 
cording to this theorem, T *(X) is minimax. It 
is interesting to compare the mean square 
error of T*(X) to that of minimum variance 
unbiased estimator Ha X /n, 0(1— 0)/n. 

Theorem (Wald). If there exists a sequence 
of prior distributions {€,} such that 


no; 


lim intr p*)— int | r(0, vun) «0 
? Je 


for any 0€, then o? is minimax. For exam- 
ple, the last theorem led to the proof of the 
fact that the Pitman estimator of a location 
parameter is minimax [4]. In the discussion of 
robust estimation, Huber (Ann. Math. Statist., 








399 J 
Statistical Estimation 


35 (1964)) proved that Huber’s minimax robust 
location estimator minimizes the maximum 
asymptotic variance over some family of sym- 
metric distributions in a neighborhood of the 
normal distribution (— 371 Robust and Non- 
parametric Methods). 


I. Invariant Estimator 


For simplicity, assume that the range A ofa 
parametric function g(0) coincides with the 
parameter space, and consider the point esti- 
mation of g(0) 2 0 (0 is not necessarily real). 
Suppose that there exist two groups G= {t} 
and G = {T} of one-to-one measurable trans- 
formations of % and O, respectively, onto 
themselves such that (i) there exists a homeo- 
morphic mapping x from G to G; (ii) if X 
has a distribution P}, then tX has the distri- 
bution Ba: and (iii) W(t0, ta) = W(0, a) for 
any 0, a, and t. An estimator q is said to be 
invariant if it satisfies @(tx)=T(x) for any 
1€G, a.e. Z. An estimator q is called a best 
invariant estimator if the risk function r(0, q^), 
where o is an invariant estimator, takes its 
minimum value when g= q. 

If the group G is ‘transitive on ©, then the 
risk function of any invariant estimator is 
independent of the value of 0, and hence any 
admissible invariant estimator is best invar- 
iant. For example, in the point estimation of a 
location parameter with a quadratic loss func- 
tion, the Pitman estimator is best invariant. 

Theorem. If © is a compact topological 
space, G is a group of homeomorphisms of © 
onto itself, and G is homeomorphic to @ 
under ge G >g0,€@ with a fixed Oe. then 
any Bayes estimator relative to the *right- 
invariant Haar measure over G is best in- 
variant. This result can be generalized to a 
locally compact © (— 398 Statistical Decision 
Functions). 


J. Sequential Estimation 


Estimation methods based on sequential sam- 
pling are not as popular as fsequential tests, 
because their efficiency is not very large com- 
pared to that of nonsequential estimation. A 
generalization of the Cramér-Rao inequality to 
any sequential unbiased estimator o(X) of a 
parametric function g(0) is the Wolfowitz 
inequality, 


Vtt XS (700)? AEAN) 


for every He, under regularity conditions 
similar to those for the fixed-size sample prob- 
lem, where N is the sample size and 7(0) the 
Fisher information. 
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K. Asymptotic Theory 


In practical problems of statistical inference 
the sample size n is often large enough to give 
sharp estimates of the parameters involved; 
then the sample distributions of estimators can 
be approximated closely by their asymptotic 
distributions, which are of a simpler nature. 
Assume that X =(X,, X,, ...) is a sequence of 
independent and identically distributed (i.i.d.) 
random variables with the common distri- 
bution P}, 0€ O. For each n, let o, =9,(X,, 
..., X,) be an estimator of g(0) that is a func- 
tion of 8 to Z (c R”). Thus oe, is a measurable 
mapping from (2", 98") to (A, €). Let us de- 
note the distribution of o, by ¥(¢@,), (o, 0) 
or Z'(o, | Py), the last two emphasizing that the 
underlying probability distribution is Pj. For 
example, if the mean vector Ej(q,) 2 m,(0) and 
the covariance matrix v,(0) — V,(¢,) exist for 
every n, and if Z[v,(0) !? (o, (X) —m,(0))| 0] 
N,(0, I) as n oo, then Y(¢,,| 8) is approxi- 
mated by a p-variate normal distribution 

N, (m, (0), v,(0)) (— 341 Probability Measures 
D). o, is said to be asymptotically (mean) 
unbiased for g(0) if m,(0)— g(0) for any 0€ O as 
n— oo. But we often calculate the asymptotic 
distribution without obtaining the exact mean 
and covariance matrix of the estimator q, for 
each n. In the asymptotic theory it may be 
reasonable to regard the sequence of estima- 
tors {@,} rather than each estimator o, as an 
"estimator," but we do not bother with the 
difference between these definitions of an 
estimator. 


Consistency. o, is called a consistent esti- 
mator of g(0) if o, converges to g(0) in proba- 
bility as n— oo: 


lim P,1|9,— g(0)|  £) —0 for any e>0 and 


every Uer. 


If the convergence is almost sure, it is called 
a.s. consistent. For example, if o, is asymptoti- 
cally unbiased with the covariance matrix v, (0) 
such that |v,(0)| 30 as n> oo, then o, isa 
consistent estimator of g(0). A sufficient con- 
dition for existence of a consistent estimator is 
given by the following result. 

Theorem (LeCam). Let Z be a Euclidean 
space and $8 the o-algebra of all Borel sets in 
X. If the parameter space © is a locally com- 
pact subset of R*, P, z P, for any 0  0' (identi- 
fiability condition), and P, -+ P, whenever 0, 
0, then there exists a consistent estimator of 0. 
Q, — g(T;) is a consistent estimator of g(0) if 
{T,} is a consistent estimator of 0 and g(0) is a 
continuous function of 0. 


Asymptotic Normality. The class of estimators 
is restricted to what are called consistent esti- 
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mators. An estimator {@,} is said to be asymp- 
totically normally distributed if the asymptotic 
distribution of n'?(o, — g(0)) is normal: 


L{n"*(p, —g(8))|0] N,((0), v(0)) as n— oo. 


H(0) and v(@) are called the asymptotic bias and 
asymptotic covariance matrix, respectively. 
They are not always equal to the limits, if any, 
of the mean and covariance matrix of n? (o, — 
g(0)). To, is usually called a consistent and 
asymptotically normal (CAN) estimator if the 
asymptotic distribution of n'?(q, — g(0)) is 
normal with the asymptotic bias zero. Then 
the distribution .Z(o, |0) is approximated 
by N,(0, v(0)/n). For example, the *noment 
method estimator is a CAN estimator. 
Theorem. Let {¢,} be a CAN estimator of 
g(0)c R! with asymptotic variance v(0). Then 


lim inf Ej (n| o, —9(80)?] 2 v(0) for every 0€. 


Theorem. Suppose that g(0) is a continu- 
ously differentiable function from © ( c R*) to 


. R? (p<k). Let G(0) — (09,(0)/00j, the tJaco- 


bian matrix. If {T,} is a CAN estimator of 6 
with asymptotic covariance matrix v(0), then 
{g(T,)} is a CAN estimator of g(0): 


A En? (gT) — 9(6)10] Np(0, G(6)v(0)G(0). 


An estimator (T, is said to be a best asymp- 
totically normal (BAN) estimator of 6 if {T,} 

is a CAN estimator of 0 with asymptotic 
variance I! (0), where I(0) is the tFisher in- 
formation matrix on in a single observation. 
We can see that the maximum likelihood (ML) 
estimator (— Section M) is a BAN estimator. 


Functional on Distribution Functions. Let oP") 
be a functional on distribution functions to RI. 
Let us consider the class of estimators that 
are defined by o, =¢(F,), where Ô, is the tem- 
pirical distribution function of n samples X, 
...,X,. An estimator (o,] with o, = o(£;) 
for each n is said to be Fisher consistent for 
g(0) if Q(F,) — g(0) for every 0€ O when F; is 
the true distribution function. {¢@,} is also a.s. 
consistent for g(0) if (,! is a Fisher consistent 
estimator of g(0) and if o is a continuous func- 
tional. Furthermore, if o is differentiable, we 
can see that {¢,} is also a CAN estimator by 
using the fact that n!2(£ (Fp !(r)) — t), O«t« 1, 
converges weakly to the tBrownian bridge. 
Let S be a set of distribution functions. S is 
said to be a star-shaped set of F if HeS 
implies F? =(1—t)F +tH €S for any te[0, 1]. 
Theorem (Von Mises). Assume that 
(F1) there exists a star-shaped set 5, at F such 
that lim, P, {f€ Sọ} =1; 
(F2) for any te[0, 1] and HeS,, there exist 
derivatives (d'/dt")g[(1—t)F -- tH], i— 1, 2; 
(F3) there exists ir from R! to R! such that 
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d oo 
Let April tol y)d(H(y) 
— F(y)) for all He S,; and 
d 
(F4) lim P, e" sup A 2 UP >e}=0 
9o te(0, 1] 
for any 6 and £» 0, where FY —(1— t) F, 4 tF,. 


Then if o, with o, = o(£,) is Fisher consistent 
for g(0), we have 


A [n'? (o, — g(0))10] NI, »(8)), 








where 


w= | dat TORIN Wun) i 


(c,j-Consistency. For a sequence of positive 
numbers c, tending to infinity as n oo, an 
estimator T, is called consistent for 0€ © with 
order c, (or {c,}-consistent for short) if for 
every £7 0 and every 0€ O there exist a suffi- 
ciently small positive number 6 and a suffi- 
ciently large number K satisfying 
lim sup sup[P, {c,| T, —t| 2 K;]|x—0| « 0] <e. 
Let {c,} be a maximal order of consistency. 
This notion was introduced by Takeuchi and 
Akahira. They studied consistent estimators of 
location parameters with various orders. Let 
X —O-R!'.Suppose that for every 0€ G, P, 
has a density function f(x —6) with respect to 
the Lebesgue measure. 

Theorem. Assume that 
(OCI f(x) » 0 ifa«x «band f(x) 20 if x xa 
or x«b; 
(OC2) there exist positive numbers 0 «a « fi < 
oo and 0 « A’, B' < oo such that 


i — l~a c A' 
Jm (x—ay *fego A^, 
lim (b— x)! ?f(x)— B'; 
xb-0 


(OC3) f(x) is twice continuously differentiable 
in the interval (a, b) and there exist positive 
numbers 0 < A", B" < oo such that 


Jim. (xa)? f o=", 
lim (b=)? Dill B". 


Assume further that f "(x) is bounded if & >2. 
Then for each « there exists a consistent esti- 
mator with the order given in Table 1. 








Table 1 
{c,}-consistent 

H order c, estimator 

O<a<2 n" {min X; + max X;— 
(a+b)}/2 

a= 2 (nlogn)'? ML estimator 
ML estimator 


gio n^ 
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L. Moment Method 


The moment method is also utilized to obtain 
estimators. Suppose that 4 c R! and Oc R*. 
Denote the *population distribution function 
by F, and the fempirical distribution function 
of n samples X,, ..., X, by F,. The following 
system of simultaneous equations is derived 
from letting the jth *population moment 


uj (0) =E,(X/)= Jun 
be equal to the jth tsample moment 
m, nd Y Xi- [Fan 

at 


For example, for j — 1, ..., k, 


u(0) — (u (0), ... (OD) = (Mars 


A moment method estimator is determined as a 
solution 0-0, (x)e 0 of k numbers of simulta- 
neous equations. 

Theorem. Assume that the function u(0) 
from O to R* is continuously differentiable and 
that the Jacobian matrix M(0) 2 04(0)/00 = 
(6u,(0)/00)), i, j= 1, ..., k, is nonsingular in a 
neighborhood of the true parameter. Then the 
moment method estimator exists and is a 
CAN estimator: 


L{n'?? (0, —0)10]— N,(0, M(0) 1 v(0)M (0) ^, 


where v(0) 2 (covo(X', X), i, j— 1, ..., k. In 
general, a moment method estimator is not 

a BAN estimator. However, in view of its 
simple form, a moment method estimator is 
important and often utilized as a first-step 
estimator in order to determine the maximum 
likelihood estimator by the iteration method. 


D Dal m,. 


M. Maximum Likelihood Method 


Suppose that a distribution P, has the density 
function f(x, 0), 0€ O c R*, with respect to a *c- 
finite measure u, and let x,, ..., x, be observed 
values of random samples X,, ..., X, from the 
population f(x, 0). Then the function L, of 0 
defined by 


Ltts, xg m ] fex) 


is called the likelihood function. If 0 = (x, , 

..., X,) Maximizes the value of L,(0) for fixed 
X,,...,X, and if it is a measurable mapping 
from (2", B") to (O, €) with € a *c-algebra 

of subsets of ©, then 0,(X)= O(X,, T AE 
called the maximum likelihood (ML) estimator 
of 0. This method of finding estimators ts 
called the maximum likelihood method. If the 
parameter is transformed into a new para- 
meter rn — h(0) by means of a known one-to- 
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one bimeasurable transformation h and if 
there exists a unique ML estimator Ó, of 0, 
then #,(X)=h(0,(X)) is a unique ML estimator 
of y. In other words, the ML estimator is in- 
variant for every one-to-one transformation. 

Many statisticians have investigated and 
improved known adequate regularity con- 
ditions under which the ML estimator exists 
and is a BAN estimator. 

Theorem (Wald). Assume that 
(C1) © is a closed subset of R* with nonempty 
interior OT: 
(C2) for any xe Z, f(x, 0) is continuous with 
respect to 0 and limy ,,, f(x, 0) —0 if © is not 
bounded; 
(C3) if 0; 4 0,, then P, # P, and | f(x, 06,) — 
f(x, 02) du(x) > 0; 
(C4) E, (llog f(X, 0,)]) < oo; and 
(CS) Es, (log * f(X, 0, p)) « oo and E, (log* ok, r)) 
« oo, where f(x, 0, p) 2 sup! f(x, 0); |U' —0| x pj 
and ot, r) - supí f(x, 0); |0| » rj. (The last 
two functions are measurable according to 
assumption (C2).) 
Then if a sequence of measurable functions, 


(0,(x,, .... Xn) ys satisfies 


KEN LA =l 
liminfq———— — ——— }>C>0 (a.s. EL 
n> dE Ux EE Xa) : 
then as n oo, 0,(X,,..., X,) converges a.s. to 
the true value 6, of the parameter. Hence if 
the ML estimator exists, it is a.s. consistent. 
Pfanzagl (Metrika, 14 (1969)) and Fu and 
Gleser (Ann. Inst. Statist. Math., 27 (1975)) 
gave rigorous proofs for the existence of the 
ML estimator. 

Theorem. Under assumptions (C1) and (C2), 
there exists a maximum likelihood estimator 
0, for any positive integer n. That is, 6, = 
0,(x,,...,X,) is a measurable function from 
(7", B") to (O, €) and satisfies LL x,, ..., x,) 
=SUpy L(0; x,, ..., Xp). 

In the remainder of this section we suppose 
that assumptions (C1)-(C5) are satisfied. We 
use the notation 


= < f(x, 0) |, a k-column vector, 
60 X00, 





vf = f(x, 0) k x k matri 
=| —— f(x,0)}, ak xk matrix, 
OUT \ 60,20; 





(E log f(x, 0) 


ppp GE 
00? 00, 00,00, ) d 


Theorem (Cramér). Assume that 

(ANI) for a.s. [u]x, f(x, 0) is three-times con- 
tinuously differentiable with respect to each 
component of 0 —(0,,...,0, € O9; 


0 
(AN2) trace". | Teo) 
x 00 


fD 
and | E du — 0; 





du=0 
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(AN3) d “rossi. 0) 


(AN4) the Fisher information matrix I(0) — 


E; | (sos f(x, D) Lin f(x, 0) | exists and 


is positive definite for Dep ` and 


(AN5) there exists an H(x) such that 
3 


0 
ap: 08 foe 0) 


3 
)- oo for 0e 9^; 


< H(x) and E,(H(X)) « C, a 








constant for 0€@°. 

Then the maximum likelihood estimator is 

a BAN estimator: Z[n'?(8, —0)| 0] 
N(0,1(0)!) as n oo for 6€ O°. Note that 
under assumption (AN1) the likelihood func- 
tion attains the maximum in @° with proba- 
bility tending to 1 as n oo if the true value 
of the parameter exists in O?. Hence the ML 
estimator is determined as a root 0— (xi, E 
x,) of the likelihood equation with the same 
probability as above: 





d " d 
Lestan-1É 00, kenn) m oa 
We also call n~! Glog L,(0)/C0 the likelihood 
estimating function. The essential fact used in 
the proof of the above theorem is the asymp- 
totic equivalence of the ML estimator and 
the likelihood estimating function: 


A, (0)—1(0)n'?(6,—0)0 in P as noo, 


where A,(0)=n !?01og L,(0)/00. Note the fact 
that 


S [A,(0)|0] N,(0, 1(0)) as n>% 


holds according to the central limit theorem 
(— 250 Limit Theorems in Probability Theory 


B (1). 


Contiguity. We now turn to the situation 
where we need asymptotic distributions under 
the alternative distribution P,,,-1,2, with 0+ 

n '^heO in estimation as we do in testing 
hypotheses (— 400 Statistical Hypothesis 
Testing). The notion of contiguity, due to 
LeCam (1960), is basic for the asymptotic 
methods of estimation theory. We consider 
sequences {P,' and {P;} of probability mea- 
sures on (Z", 8") with the *Radon-Nikodym 
derivatives p, and p, with respect to a o-finite 
measure, such as P, + P;. Denote by y,— 
A[P,; P,] a generalized log-likelihood ratio 
that is defined by log p;/p, on the set ( p,p;, 

7 0] and is an arbitrary measurable function 
on the set ( p,p, =0}. Let ( B,] be any sequence 
of ($3"1-measurable sets, and let | T,} be any 
sequence of {8"}-measurable functions. A 
sequence of distributions { #[T,| P,]) is said to 
be relatively compact if every subsequence 

{n} c {n} contains a further subsequence 

(mj c {n'} along which it converges to a prob- 
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ability distribution. In the Euclidean space 
relative compactness is equivalent to tight- 
ness: that is, for every ¢>0 there is a b(e) such 
that P,1|T,|» b(c)] <e for every n. 

Theorem. The following statements are all 
equivalent. 

(1) For any {T7,}, T, 0 in P, if and only if T, 
0 in P}. 

(2) For any {T,}, ( ZLT,| P,]] is relatively 
compact if and only if ( “[T,| P,]! is relatively 
compact. 

(3) For any {B,}, P,1B,] 50 if and only if 
P;B,) 0. 

(4) Whatever the choice of y,, {f[y,|P,]} and 
(ts, EI) are relatively compact. 

(5) Whatever the choice of y,, { [»,| P,]} is 
relatively compact. Furthermore, if (mj c (nj 
is a subsequence of {n} such that Y[y,,| Pa] 
converges to ¥[y], then E{e”} — I. 

Two sequences {P,} and (Pj satisfying 
requirements (1)-(5) of the above theorem are 
said to be contiguous. 

Theorem. Suppose that (P,) and {P/} are 
contiguous. Let (mj c {n} be a subsequence 
such that lz, T,,| Pa) converges to a limit 
Te, T]. Then #[y,,, Tml Ph] converges to 
e* Z[y, T], where v=e* Z[y, T] is given by 
v(A) - Leid Z[x, T]. 

Now, set P, = Pj and P, = P5,,- 2, for each 
n. Under suitable regularity conditions, such 
as assumptions (C1)-(C5) and (AN1)-(ANS), it 
is easy to see that {Py} and {Phen uzp} are 
contiguous. At the same time, we can see that 
the asymptotic linearity of A,(0 4- n !? 5h; 0) — 
ALPS, o P5 ] holds (say) in the vicinity of 
the true parameter as follows: 


1 
A (0 +n "h; 0) AU) (Oh 0 in Py, 


The asymptotic equivalence of the ML es- 
timator and A,(0), and the asymptotic linearity 
of the log-likelihood function and A, (0), leads 
to the regularity ( Section N) of the ML 
estimator. 

Theorem. Under suitable regularity con- 
ditions as above, the ML estimator is regular: 


[n^ (0, — 0) — h|O-- n ^h] N.(, 1(6) "), 
for any heR* with 0--n "^ he. 


N. Asymptotic Efficiency 


In Section D we discussed lower bounds of 
variances of unbiased estimators for finite 
sample size and defined the efficiency of an 
unbiased estimator with variance v,(0) by 
(v,(0)nI(0)) !. In this section we first discuss 
the asymptotic efficiency of a CAN estimator 
in the same vein as in the case of finite sample 
size. Second, we see a specific approach to the 
large-sample theory of estimation. Throughout 
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this section we assume (C1)- (C5) and (AN 1)- 
(ANS) stated in Section M. 


BAN Estimators. We suppose that the para- 
meter space © is a subset of RI in this para- 
graph. We restrict our attention to the class 
of CAN estimators (1, of the real-valued 
parameter 0 for which Z[n!?(T,—0)|0]^ 
N(O, v(8)) as n> œ. Fisher's conjecture con- 
cerning the lower bound to asymptotic var- 
iance v(0) of any CAN estimator is 


v(0) > I(0) !, 


where /(0) is the Fisher information on 0 in a 
single observation. The asymptotic efficiency 
of a CAN estimator with asymptotic var- 
iance v(0) is defined by (v(0)1(0)) '. A CAN 
estimator with asymptotic variance I(6)~' is 
called a BAN estimator or an asymptotically 
efficient estimator. Note that under suitable 
regularity conditions there always exists an 
asymptotic efficient estimator, for example a 
ML estimator, although for a sample of finite 
size there exists an efficient estimator if and 
only if the family of density functions is of the 
exponential type. 

Unfortunately, Fisher's conjecture is not 
true without further conditions on the compet- 
ing estimators. A counterexample was pro- 
vided by Hodges and reported by LeCam 
(1953). Let {T,} be any CAN estimator with 
the asymptotic variance v(0). Consider the 
estimator 


pun af ence 
"lm aLa, 


n 


where 0 «a « 1 is a constant. (7,j is also a 
CAN estimator with asymptotic variance v'(0) 
such that 


40)— a^v(0) if 0—0, 
BREC) otherwise. 


Let {T,} be a BAN estimator; then 7, is an 
estimator with asymptotic variance less than 
I(0) ' and is called a superefficient estimator. 

Theorem (LeCam). The set of points 0 for 
which the inequality due to Fisher fails is of 
Lebesgue measure zero. A condition due to 
Bahadur leads to the continuity of asymptotic 
variance which implies the validity of the 
above inequality. 

Theorem (Bahadur). Suppose that {T,} is a 
CAN estimator with asymptotic variance v(0) 
satisfying the condition 


1 
lim inf P, -i2( T, « 05 - n^ 17} <> 


or 


NI = 


lim inf Py, +n - ui T, > Ay +n tay x 
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Then the following inequality, due to Fisher, 
holds at 0 = 0: 


Gul = IA". 


Regular Estimators. Wolfowitz and Kaufman 
considered an operationally more justifiable 
restriction on competing estimators, called 
the uniformity property, stating that, for an 
estimator (7,) of 0, [n'"(T, —0)|0](y) con- 
verges to any limit L(y) uniformly in (y, 0)e 
R* x C, where C is any compact subset of the 
interior ? of O C R*. The ML estimator {6,} 
also has this uniformity property under suit- 
able regularity conditions, such as (C1)-(C5) 
and (ANI)-(ANS) to which some uniform- 
ity properties are added [29]. We note that 
asymptotic variance is not a good measure- 
ment of asymptotic efficiency unless an estima- 
tor is a CAN estimator, and that asymptotic 
concentration is in general a more pertinent 
measurement. 

Theorem. For an estimator {T,} with uni- 
formity property above, it holds that the limit 
L(y) is a probability distribution function and 
continuous for either one of the variables y or 0 
if the other is fixed and furthermore that the 
probability measure L, is absolutely contin- 
uous with respect to the Lebesgue measure on 
R*. 

Theorem. The asymptotic concentration of 
the ML estimator {0,} about 0 is not less than 
that of any estimator ( T,) with uniformity 
property: For any convex and symmetric set 
S c R* about the origin, 


lim P,1n'?(0, —0)e S) > lim P, (n'?(T, —0)e S]. 


Schmetterer (Research papers in statistics, F. 
N. David (ed.), 1966) provided the notion of 
the continuous convergence of distributions of 
estimators of 0 which is weaker than uniform 
convergence. An estimator {T,} is said to be 
regular if 


Lfn (T, —0)— h|0--n ^h] 2L, as noo, 


where L, is a probability distribution inde- 
pendent of h with Gin "^ heG. It was shown 
that the ML estimator {ĝ,} is regular under 
ordinary regularity conditions. Hájek obtained 
the following characterization of the asymp- 
totic distribution of any regular estimator and, 
independently Inagaki (Ann. Inst. Statist. 
Math., 22 (1970); 25 (1973)) obtained a similar 
result. 

Theorem. The asymptotic distribution L, of 
any regular estimator | T,; is represented as 
the *convolution of a normal distribution N, 
and some residual distribution Gy: 


La = N,* Go, 
where N,— N,(0, 1(0) '), the asymptotic distri- 
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bution of the ML estimator {6,} in the ordinary 
regular case. It follows from the characteriza- 
tion L,=N,* Gp, that the first two theorems 

in this section hold also for regular estimators. 
(— LeCam, Proc. 6th Berkeley Symp., 1972, 
and Roussas and Soms, Ann. Inst. Statist. 
Math., 25 (1973).) 


O. Higher-Order Asymptotic Efficiency 


In Section N it was shown that the ML esti- 
mator is a BAN estimator. In general, how- 
ever, there exist many BAN estimators. For 
example, consider the case of a *multinomial 
distribution where probabilities of events are 
parametrized. That is, let X —(n,,..., ny), n= 
n,+...+n,, be distributed as a multinomial 
distribution M(n;7,,...,%m), T1 +- +2, = 1, 
and let Je be a subset of m-vectors, He = 
In(0) 5 (n,(0), ...,1,(0);0£ 0), OCR". De- 
fine p —(f,,..., Pm) —(n,/n,..., n, /nf. Then 

we consider (1) the ML estimator, (ii) the 
minimum-chi-square estimator, (ti) the mini- 
mum modified chi-square estimator, (iv) the 
minimum Haldane discrepancy (D,) estima- 
tor, (v) the minimum Hellinger distance (HD) 
estimator, and (vi) the minimum Kullback- 
Leibler (K-L) information estimator. These are 
defined as the values of the parameter 0 that 
minimize the following quantities, respectively: 
(i) ML= —logL,— —n XM, Pilog ;,(0); (ii) 

xo = Er- (np; —n7,(0))? /nn.(0)); (ii) mod x? = 
Zr (np; — nz(0))? (np) (iv) D, = Ly n;(0)*!/ 
fi; (v) HD =cos TE (p;7,(0))'"; and (vi) 
K-L= 2, 2,(0)log(z,(0)/p;). Rao [32] showed 
that under suitable regularity conditions these 
estimators are Fisher consistent and BAN 
estimators. 


Fisher-Rao Approach to Second-Order Asymp- 
totic Efficiency. For 0€ O c R!, let p, be the 
density for n t.i.d. observations x =(x,,...,X,); 
and let qs, be the density of estimator T;. The 
*Fisher information contained in X and in T, 
are defined by nI(0) — E(dlog p,,/40)? and 

nl, (0) — E(d log qy,/d0)’, respectively. Rao 
defined the first-order (asymptotic) efficient 
estimator T, satisfying one of the following 
conditions: (1) n!? |dlog p,,/d0 — dlog q5,/d0| ^O 
in probability; (2) TO — 7. (0) 0 as n> 

00; (3) the asymptotic correlation between 

n!? (T, — 0) and n'? dlog p,, /d is unity; (4) 

In!? dlog p,,/d0 — a — Bn? (T, — 0)| ^0 in prob- 
ability. We note that the larger the condition 
number (j), the stronger the condition. A first- 
order efficient estimator is a BAN estimator. 
Fisher proposed 

E; = lim (nI(0) —n17, (0) = lim Vo(d log po,/d0 


noc 


— dlog qo, /d0) 
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as a measure of second-order (asymptotic) 
efficiency to distinguish different BAN es- 
timators. Fisher stated without any sort of 
proof that the maximum likelihood estimator 
minimizes E Le, maximizes second-order 
efficiency. Rao proposed 


E, - min V,(d log pp,/d0 — an"? — fin(T, — 0) 
A 


m An(T, < UM 


as a measure of second-order efficiency for 
first-order efficient estimators 11.) satisfying 
condition (4). He showed that the estimators 
mentioned above for multinomial distribution 
are first-order efficient estimators satisfying 
condition (4) and furthermore calculated 
second-order efficiencies of these estima- 

tors measured in terms of E,: (i) E,(ML)= 
7^(0)1(0); (ii) Ex(72)= A(0) + E(ML); (iii) 

E (mod y?) 2 4A(0) + E (ML); (iv) E(D) = 
(k+ 1)?A(0)+ E(ML) (v) E,(HD)=A(0)/4 

+ E,(ML); (vi) E,(K-L) = A(0) 4- E,(ML) with 


77 (0) —(Ho2 — 22, + Hao)/ D (0) =] 


— tti io 7 201 59)/ D (0), 
A(0) — (ni(0)/n4(0))?/2— pao /H(0) 


+ 130/217 (0)), 
where 
Hrs = Hrs(0) = Eg Lf UC 0)/40)/fUX , 0)Y 
x (d^ f (X, 0)/407)/f(X , dur) 


= Y ny iym OY Oy (0) 
Rao [33] gave another definition of second- 
order efficiency based on the expansion of the 
variance of T, after correcting for bias: V,(T,,) 
— (n1(0))  4- (0)n ? tr o(n ?). The quantity 
V/(0) is considered to be a measure of second- 
order efficiency. The results of Fisher and Rao 
were confined to multinomial distributions. 
Efron (Ann. Statist., 3 (1975)) and Ghosh and 
Subramanyan (Sankhya, sec. A, 36 (1974)) 
extended the results to the so-called curved 
exponential family of distributions. Efron gave 
a geometric interpretation to the effect that 
second-order efficiency is related to the curva- 
ture of a statistical problem corresponding 

to (0) above, and S. Amari recently extended 
this differential-geometric approach to the 
discussion of higher-order efficiency of esti- 
mators. Rao suggested that E is equal to E;. 
Ghosh and Subramanyan gave a sufficient 
condition for the equality to hold, whereas 
Efron provided a counterexample to show that 
E5 4 E, in general. 


Pfanzagl and Takeuchi-Akahira Approaches to 
Higher-Order Asymptotic Efficiency. For each 
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k — 1, 2, ...,a '(c,j-consistent estimator {T,} is 
sald to be the kth-order asymptotically median 
unbiased (AMU) estimator if for any 0c © 

( c R!) there exists a positive number 6 such 
that 


lim sup ck! 


n> x: |t-0|«ó 





or 


1 kl 
lim sup c; 


no |[x-6|«óà 





1 
Steil 


This notion, which is an extension of the con- 
dition due to Bahadur for the asymptotic 
efficiency, was introduced by Takeuchi and 
Akahira. For a kth-order AMU estimator 
{T,}, Gett, 0) c, ! G, (t, 0) - ... + c, "" Gy (t, 0) 
is called the kth-order asymptotic distribution 
of c,(T, — 0) if 


lim ck" |P(c,(T, 0) <t} — Go(t, 0) 


—6, Gi1(t,0)— ...—c, 1 Gar (t, 0) =0. 


Pfanzagl and Takeuchi and Akahira obtained 
the concrete form of the second- or third-order 
asymptotic distribution of the ML estimator. 
A kth-order AMU estimator is said to be 
kth-order asymptotic efficient if the kth-order 
asymptotic distribution of it attains uniformly 
the bound for the kth-order asymptotic distri- 
butions of the kth-order AMU estimators. 
Takeuchi and Akahira showed that under 
suitable regularity conditions, T, is second- 
order asymptotic efficient if 


P,((n1(0)) ^ (T, —0) <t} - (1) +n "^ Gp, (0) 
--2u39(0))/(61(0)*?) 
x tà (t) totn '?), 


where ®(f) is the standard normal distribution 
function and c(t) is its density function, and 
further that the modified ML estimator 


Ur — 0, +n sel BIG) 


for the ML estimator ô, is second-order asymp- 
totic efficient. Pfanzagl (Ann. Statist., | (1973)) 
obtained a similar result. The formulation due 
to Takeuchi and Akahira is more extensive 
since it can be applied to the so-called non- 
regular cases. 


P. Estimating Equations 


We often determine an estimator as a solution 
0— T,(xi, ..., x) of an equation P (x,,...,x,:0) 
— 0; for example, the ML estimator as a solu- 
tion of the likelihood equation. In such case, 
such an equation is called an estimating equa- 
tion and V',(0)  P,(X,,..., X,3 0), a random 
function, is called an estimating function [3]. 
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Call T, an estimator based on an estimating 
function ¥,(0). The following result is a modifi- 
cation of a theorem due to Hodges and Leh- 
mann (Ann. Math. Statist., 34 (1963)). 
Theorem. Let O be an interval of R!. Sup- 
pose that a real-valued estimating function 
V, (0) satisfies the following three conditions: 
(M1) V ,(0) is a nonincreasing function of the 
real parameter 0; 
(M2) for any real number h, n? P,(0, rn" !? fn) 
— n1? P (0,5) 4- yh—0 in probability, where y is a 
positive constant; and 
(M3) Z[n!? P, (05)] (y) 2 (y/o), where © is a 
continuous distribution function with zero 
mean and unit variance. 
Define an estimator based on VP, by T, =(0* + 
0**)/2, where 67 —sup(0| V,(0): 0) and 0** 
= inf(0 | XP, (8) 0), Then we have Z[n'?(T, — 
09) ]( y) ^ 6( yy/c) as n> oo. Huber consid- 
ered a formulation that guarantees the asymp- 
totic normality of an M-estimator. An M- 
estimator T, is defined by a minimum problem 
of the form Xf- p(X;, T,) 2 ming Zu p(X;, 0) 
or by an implicit equation X- w(X,, T;) «0, 
where p is an arbitrary function and w(x, 0) — 
Op(x, 0)/00. Note that p(x, 0) 2 —log f(x, 0) 
gives the ordinary ML estimator. Let © be a 
closed subset of R*, let (2, 8, P) be a proba- 
bility space, and let w(x, 0) be some function 
on Z x © with value in R*. Assume that X}, 
X,,... are independent random variables with 
values in Z having the common probability 
distribution P that need not be a member of 
the parametric family. Consider an estimating 
function 


1 n 
ug: Y VAO) 


Assume that 

(N1) for each fixed 0€ 0, w(x, 0) is B- 
measurable and w(x, 0) is tseparable; 

(N2) the expected value 4(0) — E{w(X, 0)} 
exists for all 0e O, and has a unique zero at 
degt: 

(N3) there exists a continuous function that 
is bounded away from zero, b(0) z b,» 0, 
such that E(sups {|Y (X. 0)|/b(0)}) « oo, 

lim inf, {|4(0)|/b(0)} > 1, and 

E(lim sU Pao {1Y (X, 0) — A(0)|/b(0)}) <1: 
(N4) for u(x, 0, d)=sup {|w(x, x) — v (x, 0| 
It—-6|xdj. 

(i) as d 90, E(u(X,0, d)) 0, 

(ii) there exist positive numbers dy, b,, and b, 
such that E(u(X,0,d)) « b, d and E(u(X, 0, d)*) 
<b,d for 0 and d satisfying 0 « [0 — 05| -- d & do; 
(N5) in some neighborhood of 0,, 2(0) is con- 
tinuously differentiable and the Jacobian ma- 
trix at 0 — 0s, A —(04;(05)/00),, is nonsingular; 
and 

(N6) the covariance matrix Z — E{ W(X, )- 

V (X, 05) ] exists and is positive definite. 
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Theorem. Suppose that an estimator {T,} 
satisfies V,(T;) 0 a.s. (or in probability) as 
n oo. Then, under (N1)-(N4) (i), T; 0, a.s. 
(or in probability) as n oc. 

Lemma. Under (N1)-(N5), 


sup {|'¥,(t) — (05) — Arel *? + AC); 
|t - als dal ^0 


in probability as n— oc. 

Suppose that (.Z [n ^ P,(T,;)]] is ttight. It 
implies that V,(7,) 0 in probability, and 
hence from the above theorem T,- 0, in prob- 
ability. Thus letting t = T, we have 


PRCT) — Pn(Oo) — AT) Mn? + |4(15)1)0 


in probability. That is, for any e>0 there 
exists an nọ such that for nz no, P (n'?|A(T,)| 
«e [n7 V, (T,) ^ n^ p, (05)]] > 1 — c. This 
and the tightness of { Z[n' ^ w,(T,)]) and 

LP [n P, (05)] ) lead to the tightness of 
(.Z [n 7 (T, — 0,)]), and the converse is also 
true. At the same time we have n!?W,(T;) — 
n? (05) — Ant? (T, — 05)-50 in probability. 
The following theorem is a straightforward 
consequence. 

Theorem. Suppose that an estimator Ó, satis- 
fies nii (8) 50 as n— oo. Then, under (N1)- 
(N6), L[n!2(6, —8))| P] 9 N,(0, A ! X ^'^!) as 
n oo. 


Q. Interval Estimation 


Interval estimation or region estimation is a 
method of statistical inference utilized to esti- 
mate the true value g(0) of the given para- 
metric function by stating that g(0) belongs to 
a subset S(x) of £, based on the observed 
value x of the random variable X. If 


P,g(0€S(X))21—a Tor ans deg 


for a constant x (0 «o < 1), then the random 
region S(X) is called a confidence region of g(0) 
of confidence level 1 — o, and the infimum of 
the left-hand side with respect to 0€ O is called 
the confidence coefficient. In particular, if Qc 
R and a confidence region is an interval, as 

is often the case, then the region is called a 
confidence interval, and two boundaries of the 
interval are called confidence limits. If a partic- 
ular subset S(X) among the set of confidence 
regions of g(0) of confidence level 1 — x mini- 
mizes Pj1g(0')e S(X)] for all pairs 9 and 0' (70), 
S(X) is said to be uniformly most power- 

ful. If a confidence region S(X) of g(0) of confi- 
dence level 1 —« satisfies P,{g(0’)eS(X)} «1— 
a for all pairs 0 and 0' (#0), then it is said to 
be unbiased. The notion of invariance of a con- 
fidence region can be defined similarly, and 

the definition for S(X) being uniformly most 
powerful unbiased (UMPU) or uniformly most 
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powerful invariant (UMPI) can be formulated 
in an obvious manner. 

For each Oe let A(05) be an tacceptance 
region of a ftest of level a of the fhypothesis 
0 —0,. For each xe g let S(x) - (0| xe A 
0€ 8j. Then S(X) is a confidence region of 8 
of confidence level 1 — a. If A(@,) is an accep- 
tance region of a UMP test of the hypothesis 
0 = for each 65, then S(X) is à UMP con- 
fidence region of 0 of confidence level 1 — o. 
Furthermore, corresponding to an acceptance 
region A(8)) of a UMP unbiased (invariant) 
test, a UMP unbiased (invariant) confidence 
region can be constructed in a similar manner. 


R. Tolerance Regions 


Let X and Y be distributed according to prob- 
ability distributions P* and P7 labeled by a 
common 0€ over measurable spaces (X, $3) 
and (4, ©), respectively, and consider the prob- 
lem of predicting a future value of the ran- 
dom variable Y using the observed value x of 
the random variablé X. If a mapping S(x) 
sending a point x to a set belonging to € is 
used to predict that the value of Y will lie in 
the set S(x), then the random region is called a 
tolerance region of Y. In particular, if a toler- 
ance region of a real random variable is an 
interval, it is called a tolerance interval and its 
boundaries tolerance limits. 

For simplicity, suppose that X and Y are 
independent. There are several kinds of toler- 
ance regions. First, if P} (P; (YeS(X)) 2 f) 2 y 
for any fe, then S(X) is called a tolerance 
region of Y of content fj and level y. Second, if 
Ej (Pa {YeS(X)})>B for any 0&6, then S(X) 
is called a tolerance region of Y of mean con- 
tent fi. Suppose that the random variable X = 
(Xi, ..., Xa) is a trandom sample and that 
both X, and Y obey the same distribution. 

If further the set (P7 |0€ ©} forms the totality 
of 1-dimensional tcontinuous distributions 
and the distribution of Pj (YeS(X)] does 

not depend on the choice of 0, then S(X) is 
called a distribution-free tolerance region. 

For example, if X, € X, € ... S X, is an 
"order statistic, then the interval (Xa, X] (for 
r « S) is a distribution-free tolerance interval 
for a random variable Y, independent of X,, 
...,X,, which has the same distribution as X,. 
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A. General Remarks 


A statistical hypothesis is a proposition about 
the *probability distribution of a sample X. If 
it is known that the ‘distribution of X belongs 
to a family of distributions A = (P,|0eQj with 
a parameter space Q, the hypothesis can be 
stated as follows: The value of the parameter 0 
belongs to c, where wy is a nonempty subset 
of the parameter space Q. This hypothesis is 
also written simply as H:6€«g. When cj 
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consists of one point, it is called a simple hy- 
pothesis, otherwise a composite hypothesis. 

Let X be a tsample space associated with a 
to-algebra B of subsets of X. To test a hypoth- 
esis H is to decide whether H is false, based on 
the observation of a sample X (e 2). The asser- 
tion that H is not false does not necessarily 
imply the validity of H. Such an assertion is 
called the acceptance of H, while the opposite 
assertion, that H is false, is called the rejection 
of H. In this framework for the testing prob- 
lem, H is often called a null hypothesis (— 401 
Statistical Inference). 

Consider a testing procedure in which H is 
rejected with probability (x) (0 x o(x) x 1) 
and accepted with probability 1 — g(x), when 
x€A ts observed. This testing procedure is 
characterized by the function o on X with 
range in [0, 1]. Here g(x) is taken as B- 
measurable on Z, and is called a test function 
or test. If (x) is the indicator function 7,(x) of 
a set B (c 8), then the test is rejecting H when 
x belongs to B and accepting H otherwise. The 
set B is called a critical region, and its comple- 
mentary set an acceptance region. A test is 
called a nonrandomized test if it is the indicator 
function of a set. Other tests are called random- 
ized tests. 

Suppose that the tdistribution of the sample 
X is a probability measure P, on (2^, 83). The 
probability of rejecting H when 0 is the true 
value of the parameter is calculated from 


Eolo) -| (x) Po(dx). 
T 


Let x be a given constant in (0, 1). If a test 
Q(x) satisfies Elp) x x for all @€@y, or, in 
other words, if the probability of rejecting H 
when H is true is not greater than a, x is called 
the level of and such a test is called a level « 
test. We denote the set of all level « tests by 
(x), and sup,.,,, Ej(«) is called the size of o. 
To judge the merit of tests, we introduce a 
different hypothesis A: The true value of 0 
belongs to w c Q— wy. This is called an alter- 
native hypothesis, or, for simplicity, an alterna- 
tive. The errors of a test are divided into two 
kinds: errors owing to the rejection of the 
hypothesis H when it is true, and errors owing 
to the acceptance of H when it is false. The 
former are called errors of the first kind, and 
the latter, errors of the second kind. The proba- 
bility E,(@) of rejecting H when 0€0,, that 
is, the probability of the correct decision being 
made for Ueo,, is called the power of a test or 
the power function. The probability of com- 
mitting an error of the second kind ts 1 — E,(@) 
for JVew,. A testing problem is indicated by 
the notation (2, B, P, wy, 4). A test pina 
class (a) of tests is said to be uniformly most 
powerful in (x) (or UMP in ®(2)) if for any 
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y ee (on), E, (o) 2 Elh) for all eo, When o, 
consist of a single point, it is said to be most 
powerful. 


B. The Neyman-Pearson Fundamental Lemma 


Let u be a *o-finite measure over (X, 8) and 
fios faa be wintegrable real-valued func- 
tions. If c,, ..., c, are constants such that the 
set Q(c,, ..., €p) of test functions o satisfying 


[osue 15x 1,25 7.4 if, 


is not empty, then there exists at least one test 
Po in ®(c;, ..., c,) that maximizes Tot, du 
among all o in M(c,,...,c,). A test @ is one 

of these tests if it satisfies the following two 
conditions: 

(1) For appropriate constants k,,...,k, 290, 


L when Liz bitch 

B(x) i 
0 when Lil È Lis 

eat 


almost everywhere with respect to u, and 
(2) the equation 


Jatdrn TE TZ, 2th 


holds. 
It (Cj 5%: 


of forte fon 


of the n-space R" and @ satisfies (2) and maxi- 
mizes f pf,41du among all p in $(c,, ...,c,). 
then @ satisfies (1). These statements are called 
the Neyman-Pearson lemma. 

As an illustration, suppose that Q —(1,2, ..., 
n,n4 1) and that {P,|@eQ) is dominated by a 
o-finite measure u. Let f(x) be the density of 
P, with respect to ji. When ou is a finite set 
11, ..., nj and w, consists of a single point 
n+ 1, then $ satisfying (1) and (2) with c, = 
...Cc,-a& is a uniformly most powerful level 
g test. 

If 8 is generated by a countable number of 
sets, then there exists a most powerful level « 
test for any hypothesis against a simple alter- 
native A :0 — H. A method of obtaining such a 
most powerful test is given in the Lehman- 
Stein theorem: Denote by f, the density func- 
tion of P, with respect to a o-finite measure p, 
and define 


,C,) is an interior point of the subset 





Qisa tes} 


hix)- | So(x)da(8) 


for a probability measure å on c. Consider 
testing the simple hypotheses H,: The density 
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of distribution of the sample is h,, against 

the alternative A:0 —0, and let o, be a most 
powerful level a test for this problem (H;:A). If 
SUpeco,, Eol) <4, 9; is a most powerful level x 
test for testing H:@é€w, against A:0 —0. The 
measure 4 satisfies Fa: lz E,(@,) for any 
probability measure A on @, and is called a 
least favorable distribution. 

When the alternative hypothesis w, consists 
of more than one point, a uniformly most 
powerful test does not generally exist. How- 
ever, if Q=R, c —(—00, 05], «4 — (05, ©), and 
foo is a density function with *monotone 
likelihood ratio with respect to a statistic T(x), 
then a UMP level x test o(x) exists and is 
defined by o(x)— 1 if T(x)>c; = a constant 
if T(x) 2 c; and =0 if T(x) «c. For a one- 
parameter exponential family of distributions, 
there exists a UMP level a test for testing 
H:og -(—00,0,]U[0,, ©) against A:w,= 
(0,, 02) (0, <0). However, a UMP test does 
not exist for the problem obtained by inter- 
changing the positions of w, and @,. 

Since hypothesis-testing problems admitting 
UMP tests are rather rare, alternative ways for 
judging the merit of tests are needed, and two 
have been devised. The first is to restrict the 
class ® of tests and to find a UMP test in this 
restricted class. The second is to introduce an 
alternative criterion of optimality and to select 
a test accordingly. The first is discussed in 
detail in Sections C, D, and E, and the second 
in Section F. 


C. Unbiased Tests 


The unbiasedness criterion is based on the 
idea that the probability of rejecting the hy- 
pothesis H when it is true (the probability of an 
error of the first kind) should preferably be no 
larger than that of rejecting H when it is false 
(the power). If a level « test o satisfies Eg (o) > « 
for 0&c,, then q is called an unbiased level x 
test. Let 6, (x) be the set of all unbiased level a 
tests. A UMP test in ®, (x) ts called a uni- 
formly most powerful (or UMP) unbiased level 
x test. 

If P, is of the texponential family whose 
parameter space Q is a finite or an infinite 
open interval of R*, then there exists a UMP 
unbiased level « test for the following two 
problems: (1) H:0, <a, A:0, >a, where 0, is 
the first coordinate of 0 — (0,,...,0,); and (2) 
H:0, =a, A:0, £a. For example, when the 
sample is normally distributed with unknown 
mean u and unknown variance a’, the Student 
test (defined in Section G) for a hypothesis 
H: =p, against an alternative A: u # (to is a 
UMP unbiased test. 
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D. Similar Tests and Neyman Structure 


If Ej(«) is constant for all 0€ c (<Q), q is 
called a similar test with respect to w. If E,(@) 
is a continuous function of 0, an unbiased 

test o is similar with respect to the common 
boundary & of wy and w,, provided that Q 

is a topological space and the density function 
is continuous in c. Therefore, in this case, 
unbiased tests are found in the class of all 

tests similar with respect to à. Let a statistic 
y= T(x) be tsufficient for A, = (P,|0€c]. A 
test o is said to have Neyman structure with 
respect to T if the conditional expectation 
E(q| T(x) — y) of o equals a constant Tal 
(For the notation [2,,] — 396 Statistic.) A test 
Q9 having Neyman structure with respect to the 
statistic T(x) is similar with respect to œ. A test 
ọ similar with respect to w has Neyman struc- 
ture with respect to T(x) if and only if the 
family 2— (Qu — P T !|0€o] of "marginal 
distributions of T is *boundedly complete. 


E. Invariant Tests 


Consider groups G and G of one-to-one trans- 
formations on 4 and Q, respectively. Suppose 
that each element of G is a measurable trans- 
formation of Z onto itself Ge, gBeB for any 
Be) and that a homomorphism gg of G 
into G is defined so that P,(g 1 B) Bell. 
The hypothesis H:@€,, and the alternative A: 
Jew, are said to be invariant under G if gwy = 
@, and gw, =a, for all ge G, and in this case 
the testing problem (Z, B, P, wy, w,) is said 
to be invariant under G. A test is called an 
invariant test if @(gx) — (x) for every ge G, 
and Ej4,(9) = Eloi holds for any invariant o. 
Accordingly, if G is ttransitive on cg, and 
invariant test is similar with respect to wy. If 
the sample space % is a subset of R” that is 
invariant under the translation (x,,..., x,) 
(x, a, ..., X, +a) with a real a and if there 
exists a 0' 2a: 0€Q such that P,.(B)= P,({(x, 
—4, ..., X,— a)|(x,, .... x,)e B}) for any 0€Q, 
then a is a transformation on Q. In this case 
the real number a is called a location para- 
meter. Furthermore, if the sample space 4 is a 
subset invariant under the similarity trans- 
formation (x,, ..., x,)9(ax;, ...,ax,) (a> 0) and 
if there exists a 0' —a: 0e such that P,(B)— 
Po {(x1/a, ..., x,/a)| (xi, ..., x,)e Bj) for any 
0€Q, then the real number a is called a scale 
parameter. The invariance principle states that 
a test for a testing problem invariant under G 
should preferably be invariant under G. A test 
(x) is called an almost invariant test if (gx) 
= q(x) [F] for all geG. 

Suppose that the testing problem of a hypoth- 
esis under consideration is invariant under a 
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transformation group G on the sample space 
X. Denote the set of all invariant level a tests 
by 5,(x). A test that is uniformly most power- 
ful in ®,(«) is called a uniformly most powerful 
(in short, UMP) invariant level x test. If there 
exists a unique UMP unbiased level a test oz. 
then a UMP invariant level a test (if it exists) 
coincides with o*[4]. When T(x) is maximal 
invariant under G, a necessary and sufficient 
condition for g(x) to be invariant is that o be 
a function of T(x). 

For example, suppose that the sample X 
—(X,,..., X,) is taken from N (u, o?) with 
unknown u and oi In this situation, Y=(X, S) 
is a sufficient statistic, where X = Y; X;/n and S 
— JJ X(X;— Xy. Let G be the group of trans- 
formations (x, s) 5(cx, cs) (c > 0) on the range 
W of Y and G be the group of transformations 
(u, 0?) (cy, co?) (c > 0) on the parameter space. 
Both the hypotheses H, : y/o? <0 and H;: 
ugi =0 are invariant under G. Since t= n: 
x/(s/../ n — 1) is maximal invariant, any invari- 
ant level a test is in the class of functions of 
t. The Student test, defined in Section G, 1s 
UMP invariant under G. 


F. Minimax Tests and Most Stringent Tests 


Minimax tests and most stringent tests are 
sometimes used as alternatives to UMP tests. 
Suppose that 2 —(P,|0eQ) is a tdominated 
family and $8 is generated by a countable 
number of sets. A level æ test of is called a 
minimax level « test if for any level « test 9, 


inf Ej(q*) 2 inf E,(¢). 
bewa Gea, 


Such a test exists for any ze 1). Ifa group G 
of measurable transformations on 4 leaves a 
testing problem invariant, then an intimate 
relation exists between the minimax property 
and invariance. Concerning this relation, we 
have the following theorem: For each a e(0, 1) 
there is an almost invariant level « test that is 
minimax if there exists a o-field 9I of subsets 
of G and a sequence {v,} of probability mea- 
sures on (G, Qf) such that (i) BEB implies 

(65g) gxe B1 eB x M; (ii) AeA, ge G implies 
Age: and (iii) lim, ,,,|v,(Ag) — v,(A)| 20 for 
any Ae and geG. Fundamental in the invar- 
iant testing problem is the Hunt-Stein lemma: 
Under the condition just stated, for any o 
there exists an almost invariant test i such 


that 


inf Ej(9) < E(J) «sup Fell 

Set geG 

The following six types of transformation 
groups satisfy the condition of the theorem: (1) 
the group of translations on R”, (2) the group 
of similarity transformations on R", (3) the 
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group of transformations g —(a, b): (x,, ..., x,)e€ 
R" (ax, +b, ...,ax, + b)eR" (0 €&a« oo, =% < 
b < œ), (4) finite groups, (5) the group of ortho- 
gonal transformations on R", and (6) the direct 
product of a finite number of the groups men- 

tioned in (1)-(5). 

We call £*(0)=sup,-aa) Eolo) an envelope 
power function, and o" (e 6(o)) is called a most 
stringent level test if 
sup (BF (0) — E(9*)) < Sup (B7 (0)— E,(~)) 
for any pe (zx). There exists a most strin- 
gent level a test for each xe(0, 1). If a testing 
problem is invariant under a transformation 
group G on X and G satisfies the condition 
in the Hunt-Stein lemma, then a uniformly 
most powerful invariant level a test is most 
stringent among the level a tests (— 398 Sta- 
tistical Decision Functions). 

Admissibility of a test and completeness of a 
class of tests are defined with respect to the 
probability of an error of the second kind (— 
398 Statistical Decision Functions). The uni- 
formly most powerful level « test and the 
uniformly most powerful unbiased level « test 
are admissible. 


G. Useful Tests Concerning Normal 
Distributions (— Appendix A, Table 23) 


In this section, we treat the rejection regions S 
that are commonly used in testing problems 
related to normal distributions. Let « be the 
level of S, and let c(x) and dia) be positive 
numbers determined by o. In (1)-(5) below, the 
sample consists of n mutually independent 
random variables X,,..., X, each of which 

is assumed to be normally distributed with 
mean y and variance o". For any sample point 
x z (xi, ..., x,), denote Zu x;/n by X and 

Y^. (x, x} by s?. (1) To test the hypothesis 

uX Ho against an alternative 4> uo, we can 
use as a critical region $ = (x |t(x)» c(a)], 
where the test statistic t(x) is given by Jn (x 
—[o)/./s7/(n —1). (2) To test the hypothesis u 
=p against an alternative p Æ Ho, we can use 
S = {x||t(x)|>c(a)} with the same test statistic 
t(x) as in (1). These tests based on the statistic 
t(x) are generally called Student tests or t-tests. 
(3) To test the hypothesis o? = o2 against the 
alternative o? >c with o2 >0, we can use S= 
1x|x?G) 2 c(x)], where y? — s?/o2. (4) To test 
the hypothesis o? > oz against the alternative 
o? « o, we can use S= {x| x (x) & c(a)], where 
y! is the same as in (3). (5) To test the hy- 
pothesis o? — o2 against o0? #04, we can use 
S={x|y7?(x)<c(a) or 2d(a)}, where y? is the 
same as in (3). Each of these tests based on the 
statistic y^ is called a chi-square test. Among 
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these tests, (3) is UMP and (1) is UMP when 

x z 1/2. All tests (1)-(5) are UMP unbiased, 
and (3)-(5) are UMP invariant under the 
translations (x,, ..., x,)-(x, +4, ...,X, +4) 
(—oo <a< oo). Since (1) and (2) are also UMP 
invariant under the transformations (x,, ..., x,) 
—(ax;, ...,ax,) (0 «a < oo), they are most 
stringent tests. 

Suppose that X,,..., Xm are independently 
distributed according to N(,,07) and that 
Y,, ..., Y, are independently distributed ac- 
cording to N(15,02), where j1,, H2, 0,, and o; 
are assumed unknown unless otherwise stated. 
Here we give the important tests for Ur, H2, 67, 
c2. Let x 2 (x, ..., x,) and y=(y,,..., Yn) be 
sample points in R" and R", respectively, and 
denote 27., x;/m, 37- Lyn, Zi 1 (x; — x)’, and 

Li-1(vi— y) by x, y, sz, and sł, respectively. (6) 
Assume that c, and o, are known, and con- 
sider a hypothesis 4, =. When an alterna- 
tive 44 >> (H; X u5) is taken, we can use as a 
critical region S= ((x, y)| T(x, y) > c(a)] (S= 
(ts, y) TG y) > cla)}), where T(x, y) 2 (— 
DU, / o/m4- o2/n. These tests are both UMP 
unbiased and invariant under the translations 
(Dr, Xm His S Y) (14, Xa Yi + 
a, ..., y, t a). (7) Assume that e; —6;, and 
consider a hypothesis u, = 4;. When py zu; 
(u, > p2) is the alternative, S = { (x, y} | T(x, y)] 
>c(a)} (S={(x, y)| T(x, y) » c(a)]) can be 
used as a critical region, where T(x, y) 2 
(x — y) / m 4- n — 2/( / Am -- 1/n,/s? +52). 
Both tests are UMP unbiased and are in- 
variant under (x,, ..., Xm Di, Vn) 9 (ax, + 
b,...,ax, bay, +b, ...,ay, +b) ( SE, E 
00,0 «a « oo). (8) Testing the hypothesis H: 
jt, =p, is called the Behrens-Fisher problem. 
Note that nothing is assumed about the rela- 
tion of the variances a? and o2 of the two 
samples X and Y, in contrast to (7). It is dif- 
ficult to construct a statistic whose distribu- 
tion is independent of c? and o2 when H is 
true. Compare this with (1)-(7), where the 
proposed statistics have this property and are 
used to construct similar critical regions S. 
The critical region 


{(x, yn — y s /m(m — 1) + s2/n(n — 1) 
> f(s1/s2), 


with an appropriately chosen f, is similar to 
such a region S. This test is called Welch's test. 
(9) For a hypothesis c, —605, we can use as a 
critical region S={(x, y)| F(x, y) &c(o)], S= 
{(x, lie, y) > e(a)}, or S= (Gs F(x, y) > dla) 
or « c(a)], when o; € 65, 0; <61, Or 6; 05, 
respectively, is taken as alternative, where 

F(x, y)=(n—1)s2/(m—1)s;. All these tests are 
UMP unbiased and are invariant under the 
transformations (x4, ..., X4, 4, Ya 

(ax, t b, ..., ax, b, ay, +b, ... ay, +b) SE 
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b « o0,0«a« oc). A test based on F(x, y) is 
called an F-test. 


H. Linear Hypotheses 


Let X,,..., X, be independent and distributed 
according to NI, o?) (i— 1,2, ..., n), where ji, 
Hla, -.+5 H; (S <n), o are assumed to be unknown 
and u;=0 (s «i n). The hypothesis is H: x; = 
H=... =, —0 (r <s), and the alternative 
hypothesis is that at least one u; 1 <i<r, does 
not vanish. The critical region S = ((x,, ...,x,)] 
F(x1, X43, X) = Deiat XIX 
c(x)] is a UMP unbiased test for this prob- 
lem, and S is invariant under the group g, of 
translations 


EE ME PEORES MEL EE Ji 
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the group g, of similarity transformations 
(x4, ..., Xa} 5 (cx,, ..., cx,), the group g4 — O(r) 
of orthogonal transformations in RI — 1(x,, 
...,X,)}, the group g4 = O(n — s) of orthogonal 
transformations in R" *—1(x,,,,..., x,)], and 
finite products of elements of the groups g,, 
g2, g3, and g4. This test is also a kind of F- 
test. More generally, let us denote X=(X,, 
X>,...,X,) and assume that it is expressed as 


X= AéE+ W, W=(W,, W,,..., Wy, (1) 


where € =(€,,&),...,€,)', s&n is a vector of un- 
known parameters and A a matrix of known 
constants, and W,, W,,..., W, are distributed 
independently according to the normal distri- 
bution with mean 0 and variance o". Then a 
general linear hypothesis is a hypothesis stating 
that the vector € lies within a linear subspace 
M of R*. The set of points Ač with č satisfying 
a linear hypothesis H is the linear subspace 
L(B) of L(A) spanned by the column vectors 
of an n x k, matrix B. Assume, for example, 
that the dimension of L(B) (=the rank of B) is 
s—r. Let C bean n x k, matrix whose column 
vectors span the orthocomplement L1(B) of 
the space L(B) with respect to the space L(A). 
Then the model (1) can be written as 


X=By+CC+W, E(W)=0, (2) 


with a k,-vector y and a k,-vector CG and hence 
the hypothesis H is represented by £— 0'. We 
denote by Y = P4X the projection of X onto 
the space L(A) and by Z the projection PX of 
X onto the space L(B). The quantity Qp = X(P, 
— P,)X equals the square of the length of the 
vector Y — Z and represents the sum of squares 
of residuals for the hypothesis H. The error 
mean square 6? = X'(I — PX/(n —s)— Q,/(n — s) 
and also 67,=Q,,/r under H are unbiased 
estimators of o). 
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Assume that B'C =0. Let U be an orthogo- 
nal transformation in R" such that the first, 
...,rth, (s+ 1)st; ..., nth rows of UB and the 
(s+ 1)st, ..., nth rows of UC are all equal to 
zero vectors. Using the notation X = UX, 7= 
U By, £ 2 UCC, and W = UW, we obtain the 
canonical form Ñ =4+¢+ W of the model (2). 
W is also a vector of independentlv and identi- 
cally distributed normal random variables. 
The hypothesis H is expressed as €=0. In this 
model, we have E(X;)=0 for i=s+1,...,n, and 
moreover, E(X;)=0 for i2 1, ...,r,s-- Lh ..., n if 
and only if H is true. 

The least squares estimator Y of Aé is the 
tmaximum likelihood estimator, and X — Y, 

Y — Z, Z are distributed independently ac- 
cording to the n-variate normal distributions 
N (0, ou — P4). N((P4 — Pp) C6, c? (P, — Py), 
and N(By+ BCL o? Pj), respectively. Hence 
Q/a?, Qu /o?, and X'P,X/o? are distributed 
independently according to the *noncentral y?- 
distributions with n — s, r, and s—r degrees of 
freedom and *noncentrality parameters 0, 
CCP, — Pg) CC/o?, and (By + PCE (By + 
P4CQ/o^, respectively. The ‘likelihood ratio 
test of the hypothesis H has a critical region 
65/6? >c, is a tuniformly most powerful invar- 
iant test with respect to the group of linear 
transformations leaving the hypothesis H 
invariant, and is the *most stringent test. This 
test is also uniformly most powerful among the 
tests whose tpower function has a single vari- 
able Z'C'(P, — Pp) CC/o?. Furthermore, for s— 
r — 1, this test is a tuniformly most powerful 
unbiased test. In the decomposition X'X — 
X'(P, — PX - X' PX -X'(I— PX =Q + 
Q5 Q, the terms Qj, and Q, are called the 
sum of squares due to the hypothesis and due 
to the error, respectively. Such a process of 
decomposition is called the analysis of var- 
iance and its result is summarized in the anal- 
ysis of variance table (Table 1). 


I. The Likelihood Ratio Test 


The likelihood ratio test is comparatively easy 
to construct. Let L(x,, ..., x,; 0) be the *likeli- 
hood function. Then 


SUpeco,, LE), Xn 0) 





A(X, e., Xp) = 


SUpseo, uo, L(X1. -Xn 0) 
is called the likelihood ratio, and the test corre- 
sponding to the critical region S= {(x,,...,x,)| 
A(x,,---,X,)<c,} is called the likelihood ratio 
test, where c, is a positive constant deter- 
mined by the level a. Let 04(x,, ..., x,) and 
d. len, Xn) be the tmaximum likelihood 
estimators for 0 in o and in is U 0, respec- 
tively; that is, L(x; 0,(x))= Spa, L(x; 0) and 
L(x; Ôn v 4(X)) = SUP peo, Uo, Lx; 0). Then 
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Table 1 
Degrees of Ratio of 
Factor Sum of Squares Freedom Mean Square Variances 
H Qj — X'(P4 — Poli r ôf =Qy/r 65/6? 
B Qg=X' P,X s—r 63 /(s —r) 
Error Q, - X'(I — PX n—s 6? — Q,/(n — s) 
Total X'X n 
As L(x; Oy) called a test. For example, a likelihood ratio 
ium L(x; Da, AJ test is frequently understood as a sequence 


The F-test for a linear hypothesis is a likeli- 
hood ratio test, and other examples of the like- 
lihood ratio test are shown in Appendix A, 
Table 23. However, the likelihood ratio test 
does not necessarily have the desirable prop- 
erties stated in the preceding sections. 


J. Complete Classes 


The set of critical regions of the type (x| T(x)> 
c} for the problems H:0 € 0, and A:0 0p, 
where the distribution family 2 of the statistic 
T is of *Pólya type 2 in the strict sense, and the 
set of regions of the type (x|c « T(x) «d] for 
the problem H:0, € 0x 0, and A:0 «0, or 0, 
<0, where the distribution family 2 of the 
statistic T is of *Pólya type 3 in the strict 
sense, are examples of minimal complete 
classes [6]. It has been proved under a mild 
condition that the set of all tests with convex 
critical regions is essentially complete when 
the underlying distributions are of exponential 
type and the null hypothesis is simple. 

Let P4,€ 2 (i—0,1) and let Bo be a g- 
subalgebra of B. 3B, is sufficient for B w.r.t. 
"P, Ps, if and only if the class of all 8,- 
measurable test functions is essentially com- 
plete, i.e. iff for every critical region Be% there 
exists a B -measurable test function po such 
that Ey (qo) € Es, (7) and E, (Po) > Es (Xp). 
Assume that 2 = [P,|0€ 81 is dominated; if for 
every B-measurable test function q there 
exists a By-measurable test function y such 
that E,(W)=E,(¢) for all 0€ 9, then Bo is 
sufficient for B w.r.t. A [9, 10] (— 398 Statis- 
tical Decision Functions D, 399 Statistical 
Estimation E). 


K. Asymptotic Theory 


Let (2,,98,, P.) (Y= 1,2, ..., 1) be a sequence of 
probability spaces, where the parameter space 
Q is common to all v. Let (4°, B”, P) be the 
direct product probability space of (25, B,, Bal 
for v2 1,2, ..., n. For each sample space (4, 
Din P™), denote a test function for H:0e 

My( CQ) and A:@E€a4(CQ—wy) by mx, 

..., X4). A sequence {@,} (n= 1,2, ...) is often 


of tests S — ((x4, ..., Sall A Gn. X) & A), 
where (4,] is a sequence of constants and 
A,(X1,--.,X,) is the likelihood ratio defined by 
(479, B”, Pf?) and wy. If a test («,] satisfies 
Ed, +0 (Gey) and Ej(9,) (ed as 
n 90, {Pp} is said to be a consistent test. If 
these convergences are uniform with respect to 
0, {Pay is said to be a uniformly consistent test. 
When a uniformly consistent test exists, wy 
and o, are said to be finitely distinguishable. 
Suppose that the observed values are identi- 
cally distributed (that is, (4,, 8,, Bel is a copy 
of a probability space (7, 8, P,)) and c, and 
w4 are both compact with respect to the met- 
ric p(0, 0") = supg.s| P4(B) — P,.(B)]. In this 
case, c and w, are finitely distinguishable if 
E,(@) is a continuous function of 0 for any o 
[4]. Kakutani's theorem (— 398 Statistical 
Decision Functions) is regarded as a proposi- 
tion concerning distinguishability when the 
null hypothesis and the alternative are both 
simple. 

The following result about the ‘limit distri- 
bution of a likelihood ratio is due to H. Cher- 
noff [3]: Let 4 be an n-space and Q be an 
open subset of R* containing the origin 0. 
Suppose that the observed random variables 
are independent and distributed according to a 
density f(x, 0); that is, the likelihood function 
L(x;0) is II, f(x;, 0). Moreover, assume the 
following regularity conditions: 

(1) log f(x, 0) is three-times differentiable with 
respect to 0 at every point of the closure of 
some neighborhood N of 0 — 0. 

(2) There exist an integrable function F and a 
measurable function H such that (1) |0f/00;| < 
F(x) for every 0e N; (ii) |0? f/00,00;| < F(x) for 
every 0e N; (iii) [2° log f/00,00,00,,| < H (x); and 
(iv) supp E(H(x)) « oc. 

(3) For every i, j= 1,2, ..., k, we have J¥= 
E,[(0log f/60,) (0 log f/00,) ] < oo, and the 
matrix J, — (JJ) is positive definite for all 0e N. 
Let PL Xs oxy Xp; @)=SUPoco 605,0) for 
a subset w of Q. Consider testing a hypothesis 
Jew, against an alternative d€w,, where O is 
an accumulation point of cg. If 2*(x,, ..., Xp) 
= P(x,, ..., x; og) P(x,, ..., Ze 04), A* plays 
essentially the same role as the likelihood ratio 
A and hence can be used in its place. We call a 
subset C of Q a cone if 0€ C implies ale C for 
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any a2» 0. A subset o of Q is said to be appro- 
ximated by C if c satisfies inf, c || x — yl = 
o(lyil) for all yew and inf, x — yl — oClxll) 
for all xe C around the origin, where ||x||? — 
E? x2. Suppose that wy and c, are appro- 
ximated by two cones Cy and C}, respectively. 
Then, setting z, — /n Jo ! A(x) with 


A È x Clog f(x, 0) 


^ H 
a=1 60, 





Ia 6log f(x,, 9) | 
ii s 00, ; 


the limit distribution of —21og 4*, when 0 

— 0, coincides with that of infec, (z, — 0)Jo(z, 
— 0) —infecc (z, — 0) Jo(z, — 0). In particular, 
when Q=R* and og — ((02, ...,09,0,.,,...,0,)] 
—0 «0, « oo, i— s - 1, ..., k] and some regular- 
ity conditions are assumed, the limit distri- 
bution of — 2log A5 is the ty?-distribution 

with s degrees of freedom if the hypothesis is 
true. 

The asymptotic behavior of the chi-square 
test of goodness of fit is also very important. 
Suppose that (X,,..., X,) has a multinomial 
distribution n!(x,!...x,!) ! pP ... p*COK, x; 
=n, x,Z 0), and consider testing H:p, =p’, 

Py 7 Py. The chi-square test of goodness of 
fit has a critical region of the type (x|x^(x,, 
4X) 2 c), where y?(x,, ..., x) is XE, ((x;— 
np?)?/np?), i.e., the weighted sum of the squares 
of the differences between the value p? of p; 
and the maximum likelihood estimator x;/n of 
p:i- The limit distribution of y?(x,, ..., x,) 
when p;— p?, i— 1,2, ..., k, is the chi-square 
distribution with k — 1 degrees of freedom. 

Suppose moreover that k functions p;(0) 
(i=1,...,k;s<k) of 0 (CR?) are given and that 
the hypothesis to be tested is that the sample 
has been drawn from a population having a 
distribution determined by H: p; = p;(0) (i= 
1, ..., ; s <k) for some value of 0. In this 
case the chi-square test of goodness of fit could 
be applied after replacing the parameter 0 
in p; Dill (i= 1,2, ..., n) by the solutions 
0, (x, ..., Xy) of the system of equations of the 
modified minimum chi-square method, 





i=1 Pi 90 


(j— 1, ..., s). Suppose that (1) pj(0) c? » 0 

(i 1, ..., k) and XE, p,(0) = 1; (2) p,(0) is twice 
continuously differentiable with respect to the 
coordinates of 0; and (3) the rank of the matrix 
(0p;/00)) is k. Then the system of equations 
above has a unique solution 0 —0,(x,, ... , xj), 
and 6, converges in probability to 0) when 0— 
05. The asymptotic distribution of 7?(x) — 

Zi (o; np(0,)Y np.(6,)) is the chi-square 
distribution with n —s— 1 degrees of freedom 
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[5]. For the test of goodness of fit, the em- 
pirical distribution function may also be used 
(— 371 Robust and Nonparametric Methods). 

A test of independence by contingency 
tables is one application of the chi-square test 
of goodness of fit. We suppose that n individ- 
uals are classified according to two categories 
A and B, where A has r ranks A,, A5, ..., A, 
and B has s ranks B,, B2, ..., B,. Let p, p. p; 
be the probabilities that the observed value 
of an individual belongs to 4;, Bj, A; B,, re- 
spectively. Let x;., x.;, x;; be the numbers of 
individuals belonging to A;, B;, and A;N B;, 
respectively. Table 2 is called a contingency 
table. To test the null hypothesis H that the 
divisions of A and B into their ranks are inde- 
pendent, that is, H: p; — p;.p.;, the statistic 
LKA KEE NIE is ap- 
plied. When H is true, y? is asymptotically 
distributed according to the chi-square distri- 
bution with (r — 1)(s — 1) degrees of freedom as 
n oo. 

Likelihood ratio tests and chi-square tests of 
goodness of fit are consistent tests under con- 
ditions stated in their respective descriptions. 
In general, there are many consistent tests for 
a problem. Therefore it is necessary to con- 
sider another criterion that has to be satisfied 
by the best test among consistent tests. Pit- 
man's asymptotic relative efficiency is such a 
criterion. Other notions of efficiency have also 
been introduced. 

A completely specified form of distribution 
is rather exceptional in applications. More 
often we encounter cases where distribution of 
the sample belongs in a large domain. Various 
tests independent of the functional form of 
distribution have been proposed, and the 
asymptotic theory plays an important role in 
those cases (— 371 Robust and Nonparamet- 
ric Methods). 

The following concept of asymptotic effi- 
ciency is due to R. R. Bahadur [11]: Let (7;] 
be a sequence of real-valued statistics defined 
on 49, {T,} is said to be a standard sequence 
(for testing H) if the following three conditions 
are satisfied. 

(I) There exists a continuous probability 
distribution function F such that for each 
0€ cg, lim, ,,, P? (T, « t] = F(t) for every 
teR'. 





Table2. Contingency Table 
B, B, ee B, Total 
A, X11 X12 fex Xis X. 
A, X21 X22 SC X25 X3. 
A, Zei KSE Ga Xrs Xp- 
Total x, ds A n 
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(II) There exists a constant a, 0 «a « oo, 
such that log{1 — F(t)) = —(at?/2) (1 -- o(1)] as 
t 90. 

(IIT) There exists a function b(0) on Q— wy 
with 0 c b(0) « oo such that for each det? — o, 


lim PTT, jul" hg tj =0 


for every t » 0. 

Suppose that {T,} is a standard sequence. 
Then T, has the asymptotic distribution F if H 
is satisfied, but otherwise T, oo in proba- 
bility. Consequently, large values of T, are 
significant when T, is regarded as a test statis- 
tic for H. Accordingly, for any given xe Z”, 

1 — F(T;(x)) is called the critical level in terms 
of T,, and is regarded as a random variable 
defined on # [1]. It is convenient to describe 
the behavior of this random variable as n> oo 
in terms of K,, where K,(x)2 —2log[1— 
F(T,(x))]. Then for each 0&€«,, K, is asymp- 
totically distributed as a chi-square vari- 

able 73 with 2 degrees of freedom and for 0e 
Q— og, K,/n—ab?(0) in probability as n> oc. 
The asymptotic slope of the test based on {T,,} 
(or simply the slope of ( 7,1) is defined to be 
c(0) —- ab?(0). Note that the statistic Kl? is 
equivalent to T, in the following technical 
sense: (i) (K1?! is a standard sequence; (ii) for 
each 0 cQ, the slope of (K]?! equals that of 
{T,}; and (iii) for any given n and x, the critical 
level in terms of K1^? equals the critical level in 
terms of T,. Since the critical level of KI? is 
found by substituting K^ into the function 
representing the upper tail of a fixed distribu- 
tion independent of F, {K}/?! is a normalized 
version of {T,}. Suppose that ( T4) and {TP} 
are two standard sequences defined on 2, 
and let F(x), a;, and b,(0) be the functions and 
constants prescribed by conditions (I)- (III) for 
i— 1, 2. Consider an arbitrary but fixed 8 in 

Q — og, and suppose that x is distributed 
according to P,. The asymptotic efficiency 

of {7} relative to {T,} is defined to be 

um: (ll — c,(0)/c, (0), where c;(0) — a;b2(0) is the 
slope of (T,?), i— 1, 2. The asymptotic effi- 
ciency is called Bahadur efficiency. 

Several comparisons of standard sequences 
are given in [11]. The relationship between 
Bahadur efficiency and Pitman efficiency for 
hypothesis-testing problems has also been 
studied. Under suitable conditions the two 
efficiencies coincide. 


L. Sequential Tests 


Let X,, X,,... be a given sequence of random 
variables. To test a hypothesis concerning the 
distributions of these variables (sample sizes 
are not predetermined), we observe first X,, 
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then X,, etc. At each stage a decision is made 
on the basis of the previously obtained data 
whether the observation should be stopped 
and a judgment made on the acceptability of 
the hypothesis. Such a test is called a sequen- 
tial test. Let X,, X,,... be independent and 
identically distributed by f(x). For testing a 
simple hypothesis H:0 —0 against a simple 
alternative A:0— 1, we have the sequential 
probability ratio test: Let G,(x,, X2, el 

= IT fi (xM f(x), and preassign two 
constants a, « a,. After the observations of 
X;, ..., X, are performed, the next random 
variable X,,, is observed if ag < G,(x,, ..., x) 
<a,. Otherwise the experiment is stopped, 
and we accept H when G, x a or accept A 
when a, <G,. The constants a, and a, are 
determined by the desired probabilities «, and 
a, of errors of the first and second kind, re- 
spectively. It is known that, among the class of 
sequential tests in which the probability of 
error of the first (second) kind is not greater 
than a, (x,), the sequential probability ratio 
test minimizes the expected number of obser- 
vations when either H or A is true (— 398 Sta- 
tistical Decision Functions; 404 Statistical 
Quality Control). 
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A. The Statistical Model 


Broadly and loosely speaking, the term "statis- 
tical inference" may imply any procedure for 
drawing conclusions from statistical data. But 
now it is usually understood more rigorously to 
mean those procedures based upon a *proba- 
bilistic model of the data to obtain conclusions 
concerning the unknown parameters of the 
population that represents the probabilistic 
model by viewing the observed data as a tran- 
dom sample extracted from the population. 

As the simplest example, suppose that, for 
some system, we have a number of observa- 
tions from repeated measurements or experi- 
ments under a supposedly uniform condition. 
If we can assume that there are no systematic 
trends or tendencies involved, we can suppose 
that the variations among repeated observa- 
tions are due to random causes and assume 
that the observed values X,, X,,... are inde- 
pendently and identically distributed random 
variables. Our purpose in making observations 
is to draw some information from the data, that 
is, to make some judgment on an unknown 
system quantity 0, which together with some 
other quantity (quantities) n characterizing the 
measurement or the experiment, determines 
the distribution of the X;. We assume that the 
distribution has a density function f(x; 0, n). 
This amounts to assuming that the observed 
values X,, X,,... area sample randomly 
drawn from a hypothetical population of the 
results of the measurements or experiments 
supposedly continued indefinitely. Then the 
problem of statistical inference is one of mak- 
ing some judgment based on the random 
sample. The set of hypotheses postulating the 
distribution of the observed values is called the 
probabilistic model of the observations, and 
the problem of determining a model in a spe- 
cific situation is called that of specification. 


B. Bayesian and Non-Bayesian Approaches 


There are two different ways to make infer- 
ences on the population parameters: the Baye- 
sian approach and the non-Bayesian approach. 
In the Bayesian approach it is assumed that 
we have some probability density z(0, n) for 
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the parameters 0 and y. Then, given the ob- 
servations X, —x,, X; —x;, ..., the conditional 
probability density for 0 and » is given by 


)= [fe 0, 4):t(0, n) 
SST] ts 8, 0) (0, 0) d0 dy 


7 is called the prior density and ĵ the posterior 
density for the parameters. Then all the in- 
formation obtained from the sample is consid- 
ered to be contained in the posterior distri- 
bution with the density P(0, n), and conclusions 
on the parameters can be drawn from it. 

The prior density z(0, n) does not necessarily 
represent a frequency function of a population 
of which the parameters are a random sample, 
but in most cases treated by the Bayesian 
approach it is considered to be a summary of 
the statistician's judgment over relative possi- 
bilities of the different values of the parameters 
based on all the information obtained before 
the observations are made. Bayesians claim 
that it is always possible to determine such a 
prior distribution in a coherent way, specifying 
the subjective probability, representing a per- 
son's judgment under uncertainty, as opposed 
to the objective probability, representing the re- 
lative frequencies in a population. L. J. Savage 
[7] succeeded in developing a formal mathe- 
matical theory of the subjective probability 
from a set of postulates about the consistent 
behavior of a person under uncertainty. 

The non-Bayesian statisticians, however, do 
not accept the Bayesians' viewpoint and insist 
that statistical inference should be free from 
any subjective judgment and be based solely 
on the objective properties of the sample de- 
rived from the assumed model. The theory 
developed by R. A. Fisher, J. Neyman, and E. 
S. Pearson, and others is based on the non- 
Bayesian approach. 





Di. als, x,, ... 


C. Problems of Non-Bayesian Inference 


The most commonly used forms of statistical 
inference are point estimation, used when we 
want to get a value as the estimate for the 
parameter; interval estimation, when we want 
to get an interval that contains the true 
value of the parameter with a probability not 
smaller than the preassigned level; and hypoth- 
esis testing, when it is required to determine 
whether or not some hypothesis about the 
parameter values is wrong (— 399 Statis- 
tical Estimation, 400 Statistical Hypothesis 
Testing). 

In any type of statistical inference, the prob- 
lem can be abstractly formulated by deter- 
mining a procedure that defines a rule, based 
on the sample observed, for choosing an ele- 
ment from the set of possible conclusions. 
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Such a procedure is evaluated by the probabil- 
istic properties derived under different values 
of the parameter from the distribution of the 
sample, and it is usually required to satisfy 
some type of validity criteria (such as un- 
biasedness of an estimator, size of a test, etc.), 
and among those satisfying them, one which 

is considered to be best according to some 
optimality criterion (such as minimum var- 
iance or most-powerfulness) is looked for. But 
in the sense of objective probability, the prob- 
abilistic property of a procedure is relevant 
only for the frequencies in repeated trials when 
the same procedure is applied to a sequence of 
samples obtained from the population and has 
no direct implications for the conclusion ob- 
tained by applying the procedure to a spe- 
cific sample we have in hand. For this reason 
Neyman argued that in statistical inference 
there is really no such thing as inductive in- 
ference but only inductive behavior. Fisher 
disagreed strongly with this argument and 
emphasized that statistical analysis is induc- 
tion and that tts purpose is to allow us to 
draw the proper conclusions from a particular 
sample and that the probabilistic properties 

of the procedure should and could have rele- 
vance for a particular conclusion obtained 
from a specific sample, provided that all the in- 
formation contained in the sample is used. The 
arguments between Fisher and Neyman led to 
a heated controversy between their followers 
that is still not completely settled. Fisher's 
arguments lead to the principle of sufficiency 
and the principle of conditionality. The prin- 
ciple of sufficiency dictates that all inferences 
should be based on a sufficient statistic 1f there 
is one, and the principle of conditionality re- 
quires that any inference should be based on 
the conditional distribution given the ancillary 
statistic, Le. a statistic whose distribution is 
independent of the parameter, if there is such a 
statistic. These two principles are accepted by 
many statisticians who do not follow all of 
Fisher's arguments, though the principle of 
conditionality sometimes leads to difficulties 
due to nonuniqueness of the ancillary. 


D. Specification Problem 


It is often difficult and sometimes impossible 
to have an exactly correct model for the data, 
and we must be satisfied with a model that 
gives a sufficiently close approximation and is 
mathematically tractable as well. It may also 
happen, however, that a model first specified 
may be far from reality and could lead to 
erroneous conclusions if relied on blindly. 
Here, the problem of model selection arises (— 
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403 Statistical Models), i.e., choosing the best 
of various possible models. 

We may also seek procedures that are little 
affected by the departure of the distribu- 
tion of the data from the assumed or some 
other model that satisfies the condition of 
validity without any assumption about the 
exact shape of the distributions (— 371 Robust 
and Nonparametric Methods). Generally, the 
problem of determining the model or specifi- 
cation should not be dealt with by mathemat- 
ical methods alone, and it should be consid- 
ered by taking into account the properties 
and nature of the subject under considera- 
tion and also the process of measurement or 
experimentation. 


E. History 


The first appearance of statistical inference as 
a method of grasping numerical characteris- 
tics of a collective was seen in the study by J. 
Graunt (1662) of the number of people who 
died in London. W. Petty applied Graunt's 
method further to the comparison of commu- 
nities in his Political arithmetic (1690). J. P. 
Süssmilch, a member of the Graunt school, 
perceived the regularity in mass observations 
and stressed the statistical importance of this 
regularity. The development of the theory of 
probability inevitably affected the theory of 
statistical inference. The method of T. Bayes 
was the first procedure of statistical inference 
in the current meaning of this expression. We 
now have a theorem bearing his name (the 
Bayes theorem), which is stated in current 
language as follows: If we know the proba- 
bility P-(E) that a cause C produces an effect E 
and if the prior (or *a priori) probability P(C) 
of the existence of the cause C is also known, 
then the posterior (or *a posteriori) probability 
of C, given an effect E, is equal to 


|. P(C)PAE) 
Ye P(O)PA) 


(— 342 Probability Theory F). This theorem, 
easily extendable to the continuous case, sug- 
gests the following inference procedure: If we 
are informed that an effect E has taken place, 
then we calculate the probabilities P(C) for 
every cause C, compare them, and infer that 
the C* with P,(C*)=max,P,(C) is the most 
probable cause of E. 

Both P. S. Laplace and C. F. Gauss dis- 
cussed the theory of estimation of parameters 
(— 399 Statistical Estimation) as an application 
of the Bayes theorem. In his research, Laplace 
considered a monotone function W(|t — 0|), 
and W — |t —0| in particular, of the distance 
|t — 0| between a parameter value 0 and its 


P,(C) 
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estimate t as a measure of significance of the 
error of the estimate t. Gauss, following La- 
place, used this weight function W(|t — 0|) of 
error, and going beyond Laplace, realized that 
it would be mathematically fruitful to put 
W(|t —8|) =(t —0. Such considerations led 
him to the study of the tleast squares method, 
in which the terminology and notation he de- 
vised are still in use. He also developed the 
theory of errors and recognized the impor- 
tance of the normal distribution and found 
that the least square estimate is equal to the 
most probable value if the errors are normally 
distributed. 

F. Galton, a biologist, revealed the useful- 
ness of statistical methods in biological re- 
search and explored what we call tregression 
analysis (— 403 Statistical Models) by intro- 
ducing the concepts of regression line and 
*correlation coefficient. His research on re- 
gression analysis originated from the study of 
the correlation between characteristics of 
parents and children, but he failed to realize 
the difference between tpopulation character- 
istics and tsample characteristics. 

Following Galton, K. Pearson developed 
the theory of regression and correlation, with 
which he succeeded in establishing the basis of 
biometrics (— 40 Biometrics). He arrived at 
the concept of population in statistics: A sta- 
tistical population is a collective consisting of 
observable individuals, while a sample is a set 
of individuals drawn out of the population and 
containing something telling us about charac- 
teristics of the population. Thus statistical 
research is regarded as investigation that 
focuses not on a sample as such but on a 
population from which the sample has been 
drawn. This consideration raised the problem 
of the goodness-of-fit test (— 400 Statistical 
Hypothesis Testing), that is, the problem of 
knowing whether a sample is likely to have 
been drawn from a population whose distri- 
bution was determined by theoretical con- 
siderations. K. Pearson characterized some 
population distributions occurring in practice 
by a differential equation, and classified them 
into several types. Using this classification, he 
discussed goodness-of-fit tests and developed 
the y?-distribution (chi-square distribution) in 
relation to the problem of testing hypotheses. 

Statisticians in the time of K. Pearson 
thought of a population as a collective having 
infinitely many individuals (i.e., an infinite 
population), which led to the idea that the 
larger the size of a sample (e, the number of 
individuals in the sample), the more precisely 
could the sample give information about the 
population. They carried out inferences, in- 
cluding the testing of hypotheses (— 400 Sta- 
tistical Hypothesis Testing), by approximate 
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methods, which later came to be termed large 
sample theory. Suppose, for instance, that 
{X,,...,X,} is an "independent sample of size 
n from a normal population N (u, o?). The 
random variable Z — nX — ug)/o with X = 
È; X;/n is distributed according to N(0, 1) 
when u= p. Therefore, if the size n is suffi- 
ciently large (n-» oo), we estimate ¢ by ó — 
(X.(X;— Xy!/(n (ID and deal with the 
random variable T= Jn(X — uo)/ó obtained 
by inserting ¢ in place of ø in the expression 
for Z, as if T itself were distributed according 
to N(0, 1). 


F. Development in the 20th Century 


W. S. Gosset, writing under the pen name 
"Student," reported in 1908 the discovery of 
the exact distribution of T and thereby opened 
the new epoch of exact sampling theory (— 374 
Sampling Distributions). This work of Student 
made it possible to perform statistical in- 
ference by means of small samples and conse- 
quently changed statistical research from the 
study of collectives to that of uncertain phe- 
nomena; in other words, the concept of popula- 
tion was once again related to a tprobability 
space with a tprobability distribution (De, a 
population distribution) containing unknown 
parameters. Thus it began to be emphasized 
that a sample has to be drawn at random De, 
a random sample) from the population if we 
are to make an inference about a parameter 
based on the sample. 

Fisher presented a complete derivation, . 
using the multiple integration method, of 
the *t-distribution (the sampling distribution 
of T). In addition, Fisher introduced the con- 
cepts of tnull hypothesis and significance test, 
which were the starting points for later pro- 
gress in the theory of hypothesis testing. He 
also added the concepts of tconsistency, teffi- 
ciency, and tsufficiency to the list of possible 
properties of testimators, and he studied the 
connection between the information contained 
in a sample and the accuracy of an estimator, 
which led to the idea of amount of informa- 
tion. Fisher also proposed the tmaximum 
likelihood estimator, which is formally equi- 
valent to the most probable value, but he 
renamed it and gave it a foundation com- 
pletely independent of any prior information 
and showed that it leads to the at least asymp- 
totically efficient estimator. 

Fisher made efforts to obtain a distribution 
of the parameter directly from the sample 
observation, hence independently of the con- 
cept of prior probability. He sought in this 
way to be released from the weakness of the 
Bayes method. For this purpose he introduced 


1511 


the concept of fiducial distribution, which was 
the subject of bitter controversy in the period 
that followed. As an example of a fiducial 
distribution, we consider here the tBehrens- 
Fisher problem: Let X,, ..., Xm and Y,,..., Y, 
be samples drawn independently from the 
populations N(u,,¢7) and NI, 62), respec- 
tively, where the parameters 4, H2, 0,, and o; 
are all unknown. The problem raised is to test 
the hypothesis 4, = u, or to estimate ô= 4, — 
Ht, by an interval. To solve this problem, we 
put 


X=) Xm, Y=% Yn, 
i j 


Si -Y 0,— XY! m- 1), 


d-s 

J 
and learn that T, = /m(X — u$, and T, 
=./n(¥—p)/S, are mutually independent 
and distributed according to the t-distribution 
with degrees of freedom m — 1 and n— 1, re- 
spectively. From this fact Fisher reasoned as 
follows: Given observed values x, y, s,, s; of 
the variables X, Y, S,, S,, the distributions of 
the parameters u, and u, are induced from 
the distributions of T, and T, by means of 
transformations 


=x T,s, Is 
Hi KH H257 on . 

Consequently the distribution of ó 2 x — y — 
(T; s,/A/ m — T;s;/. /n) is obtained. These 
distributions are called the fiducial distribu- 
tions of the parameters 44, 42, and 6. The 
interval |ó — (x — y)| <c of 6 deduced from the 
fiducial distribution of ó is called a fiducial 
interval of ó. 

Neyman and E. S. Pearson developed a 
mathematical theory of testing hypotheses, in 
which they deliberately defined a family of 
population distributions admissible for formal 
treatment and considered alternative hypoth- 
eses within the family. They proposed to 
relate a test to its *power function, on the basis 
of which the test would be judged. Their ideas 
brought mathematical clarity to the theory of 
inference. Furthermore, concerning interval 
estimation, Neyman devised an alternative to 
the fiducial interval, the *confidence interval, 
which has full mathematical justification. 
Unfortunately it was later found that the 
confidence interval, fiducial interval, and the 
Bayes posterior interval based on the posterior 
distribution often gave distinctly different 
results to the same problem, which became a 
source of controversy among different schools 
of thought. 

Since the publication of A. Wald's theory of 
statistical decision functions (— 398 Statistical 
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Decision Functions) in 1939, there has been 

a steady increase in its importance. In this 
theory the totality 2 of available statistical 
procedures, which is considered implicitly in 
the Neyman-Pearson theory, is put forth ex- 
plicitly as a set and defined as the space of 
decision functions. Wald also defined the *risk 
function of a statistical decision procedure and 
used it as a basis for judging procedures. In 
addition, he employed the concept of prior 
probability and the Bayes procedure for the 
purpose of proving the tcomplete class theo- 
rem. Wald's idea of bringing the concept of 
prior probability back into statistical theory 
carried a great deal of weight, and much litera- 
ture has now been accumulated on this sub- 
ject. Prior probability as a technique in statis- 
tics was abandoned after Fisher's introduction 
of the maximum likelihood method indepen- 
dent of prior probability and Neyman's 
assertion that a probability distribution on thc 
*parameter space made no sense. In addition, 
Wald linked statistical inference to games (— 
173 Game Theory) and introduced the *mini- 
max principle into statistics. The decision- 
theoretic setup also enabled him to develop a 
theory of sequential analysis by comparing 
the cost of sampling with the risk of erro- 
neous decisions (— 400 Statistical Hypothesis 
Testing). 

After the publication of Savage’s book in 
1954, there was a revival of the Bayesian ap- 
proach, i.e., one based on the concept of sub- 
jective probability, and now the group of those 
statisticians who accept the Bayesian approach 
are called Bayesians or neo-Bayesians. 


G. Applications 


Methods of statistical inference are applied in 
many fields where statistical data are used for 
scientific, engineering, medical, or managerial 
purposes. Methods of producing data that 
are appropriate for statistical inference have 
also been developed. R. A. Fisher developed 
the method of statistical *design of experi- 
ments (— 102 Design of Experiments) that 
when it is impossible or impractical to elimi- 
nate completely experimental errors or vari- 
abilities, provides the procedures to obtain 
such data. These data, though subject to ran- 
dom errors, are susceptible to rigorous statis- 
tical inference. For this purpose Fisher intro- 
duced the principles of trandomization, flocal 
control, and treplication in the design of experi- 
ments. W. A. Shewhart defined the !state of 
statistical control in mass-production pro- 
cesses, where the variabilities of the products 
can be considered to be due to chance causes 
alone and hence are statistically analyzable. 
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Applying the idea of statistical inference to this 
situation, Shewhart established the method of 
statistical quality control (— 404 Statistical 
Quality Control). Neyman introduced the 
method of random sampling into statistical 
surveys and developed the theory of estima- 
tion and allocation based on the theory of 
statistical inference (— 373 Sample Survey). 

In many applied fields there exist systems 
of statistical methods which have been devel- 
oped specifically for the respective fields, and 
although all of them are based essentially on 
the same general principles of statistical in- 
ference, each has its own special techniques 
and procedures. Specific names have been 
invented, such as biometrics (— 40 Biometrics) 
econometrics (— 128 Econometrics), psycho- 
metrics (— 346 Psychometrics), technometrics, 
sociometrics, etc. 
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A. General Remarks 


One cubic centimeter of water contains about 
3x 10°? water molecules. A macroscopic sys- 
tem of matter thus consists of an enormous 
number of particles incessantly moving in 
accordance with the laws of dynamics (— 271 
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Mechanics; 351 Quantum Mechanics). Dy- 
namical description of such microscopic mo- 
tion in full detail is impossible and even mean- 
ingless. À physical process in thermodynamics 
or thydrodynamics is described in terms of a 
relatively small number of macroscopic vari- 
ables, such as temperature, pressure, and a 
velocity field. Such a process shows a remark- 
able simplicity which is a statistica! result of 
the molecular chaos. This is the reason why 
statistical mechanics is needed as a theoretical 
model to unify microscopic dynamics and 
*probability theory. Thus statistical mechanics 
aims at deriving physical laws in the macro- 
scopic world from the atomistic structures of 
the microscopic world on the basis of micro- 
scopic dynamical laws and probabilistic laws. 
Its function is twofold. First, statistical me- 
chanics should give microscopic proofs of the 
macroscopic laws of physics, such as those of 
thermodynamics or the laws of macroscopic 
electromagnetism. Second, it should also pro- 
vide us with detailed knowledge of physical 
properties of a given material system once its 
microscopic structure is known. In this sense, 
statistical mechanics is an essential basis of the 
modern science of materials. 

Strictly speaking, the dynamics of the 
microscopic world obeys *quantum mechanics. 
However, even before the birth of quantum 
mechanics, statistical mechanics had pro- 
gressed on the basis of classical mechanics. 
This stage of statistical mechanics is often called 
classical statistical mechanics, in contrast to 
quantum statistical mechanics based on quan- 
tum mechanics. Statistical mechanics has a 
fully developed formalism to apply to physical 
systems in thermal equilibrium. This is some- 
times called statistical thermodynamics or 
equilibrium statistical mechanics. Until the 
1950s the term "statistical mechanics" had often 
been used in this narrow sense. In a wider 
sense it is concerned with systems in more 
general states, for instance, in nonequilibrium 
states. In the modern literature, a general 
statistical mechanical theory of nonequilib- 
rium systems is often referred to as the statis- 
tical mechanics of irreversible processes. 


B. History 


The early stage of statistical mechanics can 

be traced back to the kinetic theory of gases, 
which started in the 18th century. In dilute 
gases, gas molecules fly freely through the 
whole volume of the vessel and collide only 
from time to time. In thermal equilibrium, the 
average energy of each molecule is determined 
by the temperature of the gas; namely 


m v2/2=m v? 2 9 m v2/2=kT/2, 
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where (v,, Vp, v.) is the velocity, an overbar 
means the average, m is the mass of a mole- 
cule, T is the absolute temperature, and k is 
the Boltzmann constant (= 1.38 x 107'° erg: 
deg '). The velocity of each molecule is only 
probabilistic, and a ‘distribution function 

f (v, Vy, vz) is defined as the *probability den- 
sity that the velocity of a given molecule ts 
found to be in the neighborhood of (vy, vy, v,). 
In a dilute gas, this 1s given by 
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Dis p,. Dynamical states of the system 
constitute a set of points in this space. At a 
given time, the state of the system is repre- 
sented by a point P in the phase space, and 
hence the motion of the system is represented 
by the motion of P. If the system is conserva- 
tive, its energy function is constant. Let W be 
the *Hamiltonian function. Then the motion 
of P is confined to an energy surface defined 
by the condition # = E — constant. Measure 


on an energy surface is defined as the limit of 
the volume element lying between two neigh- 
boring energy surfaces corresponding to the 
energies E and E+dE. The motions of P form 
a *topological group that makes this measure 
invariant (*Liouville's theorem). 

A dynamical quantity A(p, q) of the system 


Tip, Vy, v) = Gett SEA +v? + ou) (1) 


the Maxwell-Boltzmann distribution law. 

L. Boltzmann viewed the velocity distri- 
bution function as changing in time as a result 
of molecular collisions and gave an equation 


of the form changes its value as the phase point P moves 
: on the energy surface. The time average A of 
SCH ALf ] - TL ]. (2) | Ais identified with the value of A observed in 


/ the equilibrium state of the system, namely, 
the average of A with respect to the invariant 
measure. Boltzmann justified this assumption 
by the following reasoning. If the energy sur- 
face has a finite measure and the trajectory of 
P does not make a closed curve on the energy 
surface, it can be assumed that the trajectory 
will move around practically everywhere on 
the surface. Mathematically formulated, the 
only measurable subset of the surface that has 
a nonzero measure and is invariant under the 
motion is the whole surface. This assumption 
is the ergodic hypothesis [6-9]. The long-time 
average of A will then equal the average of A 
over the entire energy surface with weight 
function equal to the measure previously in- 
troduced. The latter average is called the phase 
average and is denoted by <A>. Boltzmann 
thus asserted that 


where A[ f] is the change of the distribution 
function f by acceleration due to the presence 
of external forces and I'[ f ] is the change 
caused by molecular collisions. l'[ f] is an 
integral which is nonlinear in f. This type of 
equation is called a Boltzmann equation [ 1, 2]. 

Boltzmann introduced the H-function by the 
definition 


H= f | | Mo dodo, (3) 


and proved on the basis of equation (2) that 
dH /dt<0. This theorem is known as the H- 
theorem [1-4]. The equilibrium distribution 
(1) is therefore obtained from equation (2) as 
the solution that makes H a minimum. In fact 
the H-function is related to the entropy S by 
S= —kH. (4) A -(AY. (5) 
Boltzmann further showed (1877) that the 
distribution function of a system in thermal 
equilibrium can be obtained on more general 
grounds without relying on a kinetic equa- 
tion of the type (2) and that the statistical 
mechanics of systems in equilibrium can thus 
be constructed on a basis much more general 
than that given by a kinetic theory. It was W. 
Gibbs, however, who clearly established (1902) 
the complete framework of statistica] thermo- 
dynamics, although he had to confine himself 
to classical statistical mechanics [5]. 


Efforts of mathematicians to study the ergodic 
hypothesis created an important branch of 
mathematics called ergodic theory (— 136 
Ergodic Theory). 


D. Ensembles in Classical Statistical 
Mechanics 


Once we admit the ergodic hypothesis, or 
more specifically the assumption (5), the cal- 
culation of the observed value of a physical 
quantity A for a system in equilibrium is re- 
duced to finding the phase average of A on 
an energy surface. The task of statistical me- 
chanics of systems in equilibrium is thus re- 
duced essentially to calculating phase averages 
and establishing relationships between them 
[10-13]. 

For a set (called an ensemble in this case) of 


C. The Ergodic Hypothesis 


For a given dynamical system with n *degrees 
of freedom, the phase space is defined as a 2n- 
dimensional space with *generalized coordi- 
nates q,, ...,q, and generalized momenta 
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identical systems with the same energy, we 
consider the phase average for the tprobability 
space with the measure mentioned in Section 
C on the energy surface corresponding to the 
given energy value. Gibbs called a probability 
space of this kind a microcanonical ensemble. 
An average in this probability space is defined 
by 


AdS dS 
Die sa mrt. — d 
c -clgrad #|/ Je lgrad af | 


where # is the Hamiltonian function, grad 2 
is its gradient in the 2n-dimensional phase 
space, and the integration is carried over the 
energy surface with dS as surface element. 

When the observed system is in mechanical 
contact with a heat reservoir, the composite 
system consisting of the system and the heat 
reservoir is regarded as an isolated system 
with constant energy. Then an ensemble of the 
composite systems is treated as a microcanon- 
ical ensemble. It is more convenient and more 
physical, however, to consider the heat re- 
servoir simply as providing an environment 
characterized by its temperature T, and to 
concentrate only on the system in which we 
are interested. Then the system is no longer 
isolated and exchanges energy with its en- 
vironment. Since the energy of the system is 
no longer constant, the system will be found 
in any part of the phase space with a certain 
probability. To find the probability distri- 
bution for an ensemble of this system is a 
problem of asymptotic evaluation which is 
solved on the basis of the ergodic hypothesis 
and the fact that a heat reservoir has an ex- 
tremely large number of degrees of freedom. 
This asymptotic evaluation is traditionally 
done with the help of 'Stirling's formula or by 
using the Fowler-Darwin method [10], but 
it is essentially based on the *'central limit 
theorem [11]. 

The probability space of this kind of en- 
semble of systems in contact with heat re- 
servoirs was called a canonical ensemble by 
Gibbs [5]. If dT is a volume element of the 
phase space of the system, the probability of 
finding a system arbitrarily chosen from the 
ensemble in a volume element dT is given by 


Pr(dT ) = Cexp( — X /kT)dT. (7) 


Accordingly, the average of a dynamical quan- 
tity A is given by 


(ye | e nrar] | e-nntar. (8) 
For example, the average energy is 


tye z= pre nmar] [o-nmnar. (9) 
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By a traditional convention we introduce the 
parameter 


B=1/kT 
and write (9) as 
E= —@log Z(By/0, 


where Z(f) is called the partition function or 
the sum over states and is given for a system 
composed of N identical particles by 


ap [er ar ynt (10) 


If an exchange of particles with the environ- 
ment takes place in addition to an exchange of 
energy, the probability of finding a system 
with particle number N in the volume element 
dI is given by 


Cexp(— PH 4- BuN)dT/N'!, 


where yu is a real parameter called the chemical 
potential; this characterizes the environment 
with regard to the exchange of particles. This 
ensemble is called the grand canonical en- 
semble. The average of a dynamical quantity A 
is then given by 


Ain NE} [aserrean D 
N 
(11) 


where the dependence of A and ¥ on N is 
now explicitly written, and where 


SU, wies Y. NT! jemmmar (12) 
N 


is called the grand partition function. 


E. Ensembles in Quantum Statistical 
Mechanics 


The quantum counterpart of the classical 
ergodic hypothesis is that to each of these 
quantum states an equal probability weight 
should be assigned [10]. A microcanonical 
ensemble is then defined by this principle of 
equal weight, which yields in turn 


(45-5 AJ? (13) 


instead of (6). Here the index l refers to the 
quantum states lying in the interval AE, and 
A, is the quantum-mechanical expectation of 
a dynamical variable A in the quantum state 
LA canonical ensemble is now defined by 
assigning 


P,—e PEJN e PE (14) 
j 


to the jth quantum state as the probability 
that the system will be found in that state. The 
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expectation value of A must be given by 
(A5 2 Ae JS e P5 =tr Ae PH tre PH, 
j j 
(15) 


where H is the Hamiltonian. The partition 

function is defined by 

Zee etree, (16) 
Jj 

corresponding to (10). 

For a system consisting of identical parti- 
cles, quantum mechanics requires a particular 
symmetry of its fwave function; namely, the 
wave function must be even or odd with re- 
spect to permutation of any two particles 
according as the particles are bosons or fer- 
mions. This symmetry requirement is peculiar 
to quantum mechanics. Thus, even for an ideal 
gas consisting of noninteracting particles, 
quantum statistics leads to results characteris- 
tically different from those of classical statis- 
tical mechanics. This difference becoraes more 
significant when the particle mass ts smaller, 
the density is larger, and the temperature is 
lower. Quantum effects of this kind are seen in 
metallic electrons, in liquid helium, in an as- 
sembly of photons or phonons, and in high- 
density stars. The statistical laws obeyed by 
bosons are called Bose statistics, and those 
obeyed by fermions, Fermi statistics. 

The expectation value of A in the grand 
canonical ensemble is given in quantum sta- 
tistics by 


(45 =E(B, u) tr(Ae PR PHM), (17) 


where H is the tsecond-quantized Hamil- 
tonian, N is the number operator, the trace tr 
is taken on the (nonrelativistic) *Fock space 
(symmetric or antisymmetric according to 
Bose or Fermi statistics), and =(f, u) is the 
grand partition function given by 


E(B, u) - tre Dirk, (18) 


F. Many-Body Problems in Statistical 
Mechanics 


Since statistical mechanics is primarily con- 
cerned with systems with large numbers of 
particles, problems in statistical mechanics 

are essentially many-body problems. In prac- 
tice, however, there are some cases where 
extreme idealization is possible, as in ideal 
gases, where the interaction between gas mole- 
cules is ignored. In some cases we can proceed 
by successive approximation, taking the par- 
ticle interactions as perturbations. Such per- 
turbational treatments are, however, entirely 
useless for some problems, such as phase tran- 
sitions, of which an example is the conden- 
sation of gases into liquid states, where the 
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interaction of particles plays a critical role. 
Such problems are clearly many-body prob- 
lems. There are a number of important and 
interesting problems in this category, for 
example, transitions between ferromagnetic 
and paramagnetic states and those between 
the superconducting and normal states of 
metals. Transition from a high-temperature 
phase to a low-temperature phase is generally 
regarded as a consequence of the appearance 
of a certain type of order in thermal motion. 
This kind of phase change is called an order- 
disorder transition [14—16]. 


G. Thermodynamic Limit and Characterization 
of Equilibrium States 


Although an actual system is finitely extended, 
the enormous sizes of the usual macroscopic 
systems in comparison to the sizes of their 
constituent particles justifies the idealization to 
infinitely extended systems. At the same time, 
there are several mathematical advantages in 
considering infinitely extended systems, such 
as the absence of walls (replaced by the bound- 
ary condition at infinity, should it be relevant), 
appearance of phase transitions as mathemat- 
ical discontinuities rather than mathematically 
smooth though quantitatively sudden changes, 
and mathematically clear-cut occurrence of 
broken symmetries. 

Equilibrium states of infinitely extended 
systems are usually obtained by taking the 
limit of the equilibrium states of systems in a 
finite volume V as both V and the number of 
particles N tend to oo with the density p = N/V 
fixed; this is called the thermodynamic limit. 

It is sometimes possible to formulate the 
dynamics of infinitely extended systems direct- 
ly and to characterize their equilibrium states, 
which more or less coincide with the thermo- 
dynamic limit of equilibrium states of finitely 
extended systems [17-21]. The simplest and 
most fully investigated case of lattice spin 
systems is explained below in detail [17]. Since 
classical systems can be viewed as special cases 
of quantum systems, we start with the latter. 
To be definite, we take a v-dimensional cubic 
lattice Z” with a lattice site n —(n,, ...,n,) 
specified by its integer coordinates n;. (In the 
lattice case, the thermodynamic limit is simply 
the limit as V oo.) 

The C*-algebra 9I of observables is gen- 
erated by the subalgebra 9I, at each lattice site 
n, which is assumed to be the algebra of all d x 
d matrices (for example, linear combinations 
of tPauli spin matrices o? — (of, of”, of”) and 
the identity for d — 2) and to commute with 
operators at other lattice sites. The group of 
lattice translations nn +a is represented by 
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automorphisms y, of M, satisfying y, W, = 9I, , , 
(7,0 — a '*?). For any subset A of Z", 9I(A) 
denotes the C*-subalgebra of 9I generated by 
QW. ne A. 

A model is specified by giving a potential ® 
which assigns to each finite nonempty subset 
I of Z” an operator (I) = @(1)* e9t(I). The 
Hamiltonian for a finite subset A of Z" is given 
by U(A) - X; - , (I). In order to control long- 
range interactions, various assumptions are 
introduced. Examples are finiteness of either of 
the following: 


I] sup?, NU) IS), (19) 
lI if — sup > [()]. (20) 


Here N(I) is the number of points in 7. 

Let YS! be a maximal Abelian *-subalgebra 
of 3I, (such as (c, +c,0!}) satisfying y, WU! = 
We, and A be the Abelian C*-subalgebra 
of A generated by dl. ne Z”. If (I) is in 9I*! 
for all J, we call the potential ® Abelian or 
classical. There exists a conditional expecta- 
tion z*' which is a positive mapping of norm 
! from A onto YW" satisfying 2°'(A BC) = 
An" (B)C for A and C in W” and x°"(1)= 1. If 
a state ~ on WI satisfies ~(A)= o(n*(A)), we 
call the state « classical. Classical states are in 
one-to-one correspondence with the restriction 
on Y*!, which can be viewed as a probability 
measure on the spectrum (also called configu- 
ration space) of the C*-algebra I‘! of obser- 
vables for classical spin lattice systems. This 
correspondence makes it possible to view 
classical spin lattice systems as quantum spin 
lattice systems with Abelian interactions. 

For a given potential ®, the time evolution 
of the infinitely extended system is described 
by the one-parameter group a, t€ R, of *- 
automorphisms of Y defined as the following 
limit: 

x(A)— lim eil Ae "UN  (4e9p, (21) 
AZZ” 

The limit exists if (7) -0 for N()>N 

and |||®|||< oo, or if for some 4-0 

E, e^" (sup, X, (|9()] 15x, NI) e n])« 

oo, or if v — 1 (1-dimensional lattice) and 

sup, E, (Ib) In e, ale Ø, I Dx, o0) 

Ø « oc. An alternative way is first to de- 

fine 6g(A)= X, i[®(7), A] for Ael JA9t(A) 

(A is a finite subset of Z"), which exists if 

Ill « oo, and to prove that the closure õp 

of dg is a generator of a one-parameter sub- 

group a, (=expfd9). In the above cases, dp is a 

generator. 

A general canonical ensemble for a system 
in a finite subset A of the lattice Z", with 
some boundary condition in the outside A° = 
Z'\A, is given by @,(A)=(t, @ y)(e /P x 
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Ae PH?) where t, (9 y is the product of 
the unique tracial state t, on 9((A) and a state 
V on 9I(A*) (the boundary condition), H(A)— 
U(A)+ W(A) and W(A)- Zu) IA, 
I (1 A* z (75 (the surface energy). The follow- 
ing conditions on a state o of 9I are mutually 
equivalent under the condition that dg is a 
generator (which holds under any one of the 
conditions described above) and is satisfied 
by any limit state of the above o, as A 3 Z^ 
(e, a state in (\, {pa | A C A, Y}, with the 
bar denoting weak closure). 

1. KMS condition: 9(4o;,(B)) 2 @(BA) for 
any A, Be such that «,(B) is an entire func- 
tion of t. (9 is called a B-KMS state.) 

2. Local thermodynamic stability: For any 
finite subset A of Z" and for any state y having 
the same restriction to 9I(A) as the state o 
under consideration, Fy (o) « E, ,(V) (the 
minimality of the free energy multiplied by £), 
where Fy ,(9)— Dot HAN — S (0). Sal) = 
lim {S,.(@)—S,,(g)} as A’ 7 Z" (the open 
system entropy), S4(9) = — o(log p, (p)) (the 
closed system entropy) and the density matrix 
Palp)E AA)” is defined by o(4) — c (pa(9) A) 
for all Ae WA). 

3. Gibbs condition: For every finite subset A 
of Z”, the perturbed state p?”™ (not neces- 
sarily normalized) is the product o x y of 
the Gibbs state o (A) — Die 7" A)/tr e "UM 
on AeW (A) and some (unknown) state y on 
3I(A*), where the representative vector 6 for o 
for the GNS representation z, is assumed to 
be separating for z, (9I)", and then o^" 4) — 
(Q, 1, (A)Q) for 


a-Yr|' wl ds, .. E ds, Ag, 


x (W(A) Ag e. Ag n (W(A))®, 
(22) 





di 


where A, is the "modular operator for ® and 
the series converges. 

For a classical potential, this condition re- 
duces to the conditions that ¢ is classical and 
that the restriction of o to W® as a measure on 
the configuration space (1... d^ satisfies the 
following DLR equation due to R. L. Do- 
brushin, O. E. Lanford, and D. Ruelle: The 
conditional probability for E(A)e {1 ...d}“ 
knowing é(A‘)e{1...d}“"“ is proportional 
to exp( — BH(A)), Wicre H(A)  U(^) - W(A) is 
a function of €(U(A) depending only on £(A)). 

4. Roepstorff-Araki-Sewell inequality: For 
any Ael JA9I(A), ip(A*5(A)) is real and 


— ifip(A*0g(4)) 2 S(o(A * A), 9(AA*)), 23) 
where S(u, v) 2 ulog(u/v) if u 0, v>0, S(O, v) — 


0 for v 2 0 and S(u,0) +% for u» 0. 
5. Roepstorff-Fannes-Verbeure inequality: 
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For a-entire Ae, 


B * 
E" | p(A*a,,(A))da< F(p(A* A), o(A A*)), 
0 
Q4) 


where F(u, v)=(u—v)/log(u/v) for u 0, v — 0, 
uz: v, F(u,u)=u for u»0, F(u,0)= F(0,v) =0. 

If the interaction is translationally invariant 
(De, y, (I) = (I +a) for all ae Z” and J) and if 
we restrict our attention to translationally 
invariant states De, p(y,(A)) = 9(A) for all 
Ae and ae Z`), then the following conditions 
are also equivalent to the above. 

6. Variational principle: Deia) — stolz fely) 
— s(y)) for all translationally invariant y (the 
minimality of the mean free energy), where 
e(o) — lim N(A) ! e(U(A)) = lim N(A) ! e(H(A)) 
(the mean energy), s(~) = lim N(A) ! S,(o) (the 
mean entropy), the infimum value fle(q) — sto) is 
— P(B®) with P(ffb) «lim N(A)~! logz,(e P")) 
(the pressure), and the limits exist if A 7 Z” is 
taken in the following van Hove sense: For 
any given cube C of lattice points, the mini- 
mal number n; (C) of translations of C that 
cover A and the maximal number n, (C) of 
mutually disjoint translations of C in A satisfy 
na (Cyn, (C)>1 as A 7 Z”. 

7. Tangent to the pressure function: P(®) 
is a continuous convex function on the Ba- 
nach space of translationally invariant with 
ID || < oo. A continuous linear functional « on 
this Banach space is a tangent to P at ® if 
Pi ¥)> P(D)+ x(V) for all V. For a trans- 
lationally invariant state y, we define a, (V) — 
V(X,o N(I) ! W(I)) The condition is that 
— 9, Is a tangent to P at Po. (Conversely, any 
tangent a to P at fi arises in this manner.) 

The set Ky of all (normalized) -KMS states 
is nonempty, compact, and convex. A fj-K MS 
state o is an extremal point of K, if and only if 
it is factorial (1.e., the associated von Neumann 
algebra nz, (9[)" has a trivial center). It then has 
the clustering property Im... {@(Ay,(B))— 
mt Amt BI) =0 and ts interpreted as a pure 
phase. Any f-K MS state has a unique integral 
decomposition into extremal fj-K MS states. 

For any ®, K, is a one-point set for suffi- 
ciently small |f|. For a 1-dimensional system 
(v— 1), Ky consists of only one point (unique- 
ness of equilibrium states usually interpreted 
as indication of no phase transition) if the sur- 
face energy W([ — N, N]) is uniformly bounded 
(H. Araki, Comm. Math. Phys., 44 (1975); A. 
Kishimoto, Comm. Math. Phys., 47 (1976)). For 
the two-body interaction ®({m,n})= —J|m— 
n| 20" oa, this condition is satisfied if x>2 
while Il < oo and o, defined if «> 1. There is 
more than one KMS state (with spontaneous 
magnetization) for 22 x 1 and large HI >0, 
and hence a phase transition exists (F. J. 
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Dyson, Comm. Math. Phys., 12 (1969); J. Fróh- 
lich and T. Spencer, Comm. Math. Phys., 83 
(1982)). If a 1-dimensional interaction has a 
finite range (e, (7) 20 if the diameter of 7 
exceeds some number rj) or if it is classical 
and Èo N(I) ! (diam I + 1)||®(D)|| < co for 
d=2, then g(A) for pe Ky and AEWA) for 
a finite A is real analytic in ff and any other 
analytic parameter in the potential (Araki, 
Comm. Math. Phys., 14 (1969); [22]; M. Cas- 
sandro and E. Olivieri, Comm. Math. Phys., 
80 (1981)). 

For a 2-dimensional Ising model with the 
nearest-neighbor ferromagnetic interaction 
[23], K, consists of only one point for 0< f < 
B. while K, for f — f, has exactly two extremal 
points corresponding to positive and negative 
magnetizations (M. Aizenman, Comm. Math. 
Phys., 73 (1980); Y. Higuchi, Colloquia Math. 
Soc. János Bolyai, 27 (1979)). In this case, all 
KMS states are translationally invariant, 
while there exist (infinitely many) translation- 
ally noninvariant KMS states for sufficiently 
large f tf v 3 (Dobrushin, Theory Prob. Appl., 
17 (1972); H. van Beijeren, Comm. Math. Phys., 
40 (1975)). 

The accumulation points of f-K MS states 
as fj +00 (or —%) provide examples of 
ground (or ceiling) states defined by any one of 
the following mutually equivalent conditions 
[,, 2, (or 1,2.) (O. Bratteli, A. Kishimoto, 
and D. W. Robinson, Comm. Math. Phys., 64 
(1978)): 

1, (1_). Positivity (negativity) of energy: For 
any Ael JAWA), —ip(A*d,(A)) is real and 
positive (negative). 

2, (2). Local minimality (maximality) of 
energy: For any finite subset A of Z” and for 
any state y with the same restriction to W(A‘) 
as the state o under consideration, o(H(A)) 
W(H(A)) tot HAUS V(H(A))). 

For translationally invariant potentials and 
states, the following condition is also equiva- 
lent to the above: 

3, (3.). Global minimality (maximality) of 
energy: e(~) € e(V) (e(q) > e(w)) for all trans- 
lationally invariant states wy. 

The totality of KMS, ground, and ceiling 
states can be characterized by the follow- 
ing formulation of the impossibility of per- 
petual motion: Let P, = P* €% be a norm- 
differentiable function of the time te R with a 
compact support, representing (external) time- 
dependent perturbations. Then there exists a 
unique perturbed time evolution a7 as a one- 
parameter family of *-automorphisms of 9I 
satisfying (d/dt)aP (A) =a) (65(A) - iLB,, A]) 
for all AeA in the domain of dg. A state o 
changes with time t as q,(A)- mit (4)) under 
the perturbed dynamics xë, and the total 
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energy given to the system (mechanical work 
performed by the external forces) is given by 
L? (o) 2 9, 9,(dP,/dt)dt. For KMS states at 
any fj, as well as ground and ceiling states, 
L*(9) 20 for any P If ọ is a factor state, the 
converse holds, i.e., L^(o) z 0 for all P, implies 
that o is either a KMS, ground, or ceiling 
state. The condition Li) 20 for all P, is 
equivalent to — ig(U*ó$(U)) 20 for all unitary 
U in the domain of dg and in the identity 
component of the group of all unitaries of 9I. 
A state ọ satisfying this condition is called 
passive, and a state p whose n-fold product 
with itself as a state on W®” is passive relative 
to x?" for all n is called completely passive. 
The last property holds if and only if g is a 
KMS, ground, or ceiling state (W. Pusz and 
S. L. Woronowicz, Comm. Math. Phys., 16 
(1970)). 

The totality of KMS, ground, and ceiling 
states can be characterized by a certain sta- 
bility under perturbations (P, considered 
above) under some additional condition on 
x, (R. Haag, D. Kastler, and E. B. Trych- 
Pohlmeyer, Comm. Math. Phys., 38 (1974); O. 
Bratteli, A. Kishimoto, and D. W. Robinson, 
Comm. Math. Phys., 61 (1978)). 

When a lattice spin system is interpreted as 
a lattice gas, an operator Nell (such as (of? 
+ 1/2) is interpreted as the particle number 
at the lattice site n and N(A) — X,,.4 N, is the 
particle number in A. It defines a representa- 
tion of a unit circle T by automorphisms tg 
of 9I defined as tọ( A) = lim e/*9 Ae "NA 
(A 7 £"), called gauge transformations (of the 
first kind). The grand canonical ensemble can 
be formulated as a fj-KMS state with respect 
to zt (instead of xj), where the real constant 
u is called the chemical potential. It can be 
interpreted as an equilibrium state when the 
gauge-invariant elements {Ae Y|t(A)= A} 
instead of I are taken to be the algebra of 
observables or as a state stable under those 
perturbations that do not change the particle 
number. 


H. The Boltzmann Equation 


Statistical mechanics of irreversible processes 
originated from the kinetic theory of gases. 
Long ago, Maxwell and Boltzmann tried to 
calculate viscosity and other physical quanti- 
ties characterizing gaseous flow in nonequilib- 
rium. The *Boltzmann equation is generally a 
nonlinear *integrodifferential equation. On the 
basis of this equation mathematical theories 
were developed by D. Enskog, S. Chapman, 
and D. Hilbert [2]. 

Free electrons in a metal can be regarded 
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as forming an electron gas, in which electron 
scattering by lattice vibrations or by impurities 
is more important than electron-electron scat- 
tering. Following the example of gas theories 
H. A. Lorentz set forth a simple theory of 
irreversible processes of metallic electrons. 
His theory was, however, not quite correct, 
since metallic electrons are highly quantum- 
mechanical and classical theories cannot 

be applied to them. Quantum-mechanical 
theories of metal electrons were developed by 
A. Sommerfeld and F. Bloch. 


L Master Equations 


The Boltzmann equation gives the velocity 
distribution function of a single particle in the 
system. This line of approach can be extended 
in two directions. The first is the so-called 
master equation. For example, consider a 
gaseous system consisting of N particles, and 
ask for the probability distribution of all the 
momenta, namely, the distribution function 
Iw(P1.---»Pyit), where p,,..., py are the mo- 
menta of the N particles. The equations of 
motion are deterministic with respect to the 
complete set of dynamical variables (x,, p,, ..., 
Xy, Py). The equation for f(p,, ..., py, t) may 
not be deterministic, but it may be stochastic 
because we are concerned only with the vari- 
ables p,, ..., py, with all information about 
the space coordinates x,,..., xy disregarded. 
This situation is essentially the same in both 
classical and quantum statistical mechanics. 
If the duration of the observation process is 
limited to a finite length of time and the preci- 
sion of the observation to a certain degree of 
crudeness, the time evolution of the momen- 
tum distribution function f, can be regarded 
as a ' Markov process. In general. an equation 
describing a Markov process of a certain dis- 
tribution function is called a master equation. 
Typically it takes the following form for a 
suitable choice of variables x: 


(0/0t)f (x, t) 
= | deis, x) f(x, t) — Welte (25) 


where W(x, x’) is the transition probability 
from x to x’. By expanding the first integrand 
into a power series in x — x’, with x’ fixed and 
by retaining the first few terms, we obtain the 
Fokker-Planck equation: 


(6/0t)f (x, e —(€/0x) (a, (x) f(x, 9) 
+(6°/0x?)(a2(x)f(x.0)/2, 6) 


a, (x)= | W(x, x 4 r)r" dr. (27) 
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J. The Hierarchy of Particle Distribution 
Functions 


Another way of extending the Boltzmann 
equation is to consider a set of distribution 
functions of one particle, of two particles, and 
generally of n (< N) particles selected from the 
whole system of N particles. For example, a 
two-particle distribution function is the func- 
tion f;(x,, V1, X5, Da, t) for positions and veloc- 
ities of two particles at time t. The complete 
dynamics of the entire system of particles 
can be projected to the time evolution of this 
hierarchy of distribution functions. The equa- 
tion of motion for f, then contains the func- 
tion f, if the interaction of particles if pairwise, 
the equation for f; contains f}, and so on. 
Thus the equations of motion for the set of 
distribution functions make a chain of equa- 
tions. The whole chain is equivalent to the 
deterministic equations of motion for the 
dynamics of N particles. However, if the parti- 
cle number N is made indefinitely large, with 
the time scale of observation always finite, the 
chain of equations for the distribution func- 
tions asymptotically approaches a stochastic 
process if certain conditions are satisfied. 
Approximate methods of solving the hierarchy 
equations in classical cases have been devel- 
oped by J. Yvon, J. G. Kirkwood, M. Born, 
H. S. Green, and others. 

In quantum statistics, similar hierarchy 
equations can be considered. A typical exam- 
ple is the so-called Green's function method 


[27]. 


K. Irreversible Processes and Stochastic 
Processes 


The statistical mechanics of physical processes 
evolving in time is a hybrid of dynamics and 
the mathematical theory of stochastic pro- 
cesses. A typical example is the theory of 


b 
Brownian motion. A colloidal particle floating 


in a liquid moves incessantly and irregularly 
because of thermal agitation from surrounding 
liquid molecules. For simplicity, an example of 
1-dimensional Brownian motion is considered 
here. Phenomenologically we assume that a 
colloid particle follows an equation of motion 
of the form 


mů= — myu + f(t), (28) 


called the Langevin equation, where m is the 
mass of the colloid particle and u is the veloc- 
ity. The first term on the right-hand side is the 
friction force due to viscous resistance, and the 
second term represents a random force acting 
on the particle from surrounding molecules. 

If (28) describes the Brownian motion in 
thermal equilibrium, the friction constant my 
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and the random force cannot be independent, 
but are related by a theorem asserting that 


m= mr, +> at. (29) 


In an electric conductor, the thermal motion of 
charge carriers necessarily induces irregular- 
ities of charge distribution, and so an electro- 
motive force that varies in time in a random 
manner is created. This random electromotive 
force is similar to the random motion of a 
Brownian particle and ts called the thermal 
noise. For such a thermal noise there exists a 
relation similar to (29) between the resistance 
and the random electromotive force. This 
relation is known as the Nyquist theorem. 
These theorems are contained in a more gen- 
eral theorem called the fluctuation-dissipation 
theorem. 

When an external force is applied to a 
system in thermal equilibrium, some kind 
of irreversible flow, an electric current, for 
example, is induced in the system. The rela- 
tionship between the flow and the external 
force is generally represented by an admit- 
tance. If the external force is periodic, the 
admittance is a function of frequency œw and 
is given by 


TE f e" i(odt, (30) 
0 


where y(t) is equal to the correlation function 
of the flow that appears spontaneously as the 
fluctuation in thermal equilibrium when no 
external force is applied. This general ex- 
pression for an admittance, often called the 
Kubo formula, gives a unified viewpoint from 
which responses of physical systems to weak 
external disturbances can be treated without 
recourse to the traditional kinetic approach. 

The static limit (co —0) of the admittance is 
the transport coefficient. The reversibility of 
dynamics leads to relations among transport 
coefficients, called Onsager's reciprocity rela- 
tions in the thermodynamics of irreversible 
processes. 

When external disturbances are so large that 
the system deviates considerably from thermal 
equilibrium, the responses may show character- 
istic nonlinearities. Such nonlinear phenomena 
are important from both experimental and 
theoretical points of view, and constitute a 
central subject of modern research (— 433 
Turbulence and Chaos). 
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A. General Remarks 


A statistical model is defined by specifying the 
structure of the probability distributions of the 
relevant quantities. When a statistical model is 
used for the analysis of a set of data, its role is 
to measure the characteristics of a certain con- 
figuration of the data points. R. A. Fisher [1] 
advanced a systematic procedure for the appli- 
cation of statistical models. The process of 
statistical inference contemplated by Fisher 
may be characterized by the following three 
phases: (1) specification of the model, (2) esti- 
mation of the unknown parameters, and (3) 
testing the goodness of fit. The last phase is 
followed by the first when the result of the 
testing is negative. Thus the statistical in- 
ference contemplated by Fisher is realized 
through the process of introduction and selec- 
tion of statistical models. 

We always assume that the true distribution 
of an observation exists in each particular 
application of statistical inference, even though 
it may not be precisely known to us. Our par- 
tial knowledge of the generating mechanism of 
the observation suggests various possible con- 
straints on the form of the true distribution. 
The basic problem of statistical inference is 
then to generate an approximation to the true 
distribution by using the available obser- 
vational data and a model defined by a set 
of probability distributions satisfying the 
constraints. 


B. The Criterion of Fit 


The use of statistical models can best be ex- 
plained by adopting the predictive point of 
view, which defines the objective of statistical 
inference as the determination of the predictive 
distribution, the probability distribution of a 
future observation defined as a function of the 
information available at present. The perfor- 
mance of a statistical inference procedure is 
then evaluated in terms of the expected dis- 
crepancy of the predictive distribution from 
the true distribution of the future observation. 
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The probabilistic interpretation of thermo- 
dynamic entropy developed by L. Boltzmann 
[2] provides a natural measure of the discre- 
pancy between two probability distributions. 
The entropy of a distribution specified by the 
density f(y) with respect to the distribution 
specified by g( y) is defined by 


B(f;g =- [f d lanas 
g(y) 


where, as in what follows, the integral is taken 
with respect to some appropriate measure dy. 
This definition of entropy is a faithful repro- 
duction of the original probabilistic interpre- 
tation of the thermodynamic entropy by Boltz- 
mann and allows the interpretation that the 
entropy B( f ;g) is proportional to the logar- 
ithm of the probability of getting a statistical 
distribution of observations closely approxi- 
mating f(y) by taking a large number of inde- 
pendent observations from the distribution 
g( y). (For a detailed discussion — [3].) 
Obviously we have 


B(f a- |; (y)ogg(y)dy — | I(y)log f(y) dy 


The second term on the right-hand side is a 
constant depending only on f(y). The first 
term is the expected log likelihood of the dis- 
tribution g(y) with respect to the true distri- 
bution f(y). Thus a distribution with a larger 
value of the expected log likelihood provides a 
better approximation to the true distribution. 
Even when f(y) is unknown, logg(y) provides 
an unbiased estimate of the expected log like- 
lihood. This fact constitutes the basis of the 
objectivity of the tlikelthood as a criterion for 
judging the goodness of a distribution as an 
approximation to the true distribution. 


C. Parametrization of Probability 
Distributions 


When we construct a statistical model it is a 
common practice to represent the uncertain 
aspect of the true distribution by a family of 
probability distributions with unknown para- 
meters. This type of family is called a para- 
metric family; the model is called a parametric 
model. The parameters in a parametric model 
are the keys to the realization of the infor- 
mation extraction from data by statistical 
methods. Accordingly, the introduction of 
mathematically manageable parametric 
models forms the basis for the advance of 
statistical methods. 


(1) Pearson’s System of Distributions. A wide 
family of distributions can be generated by 
assuming a rational-function representation of 
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the sensitivity of the density function f(y) 
given by 


dota, y... ta, y" 


d 
E 
dy 98/0) bo tbyy+...+b,y4 


Pearson’s system of distributions is defined by 
putting p=1 and q—2 and by assuming vari- 
ous constraints on the parameters a; and b; 
and the support of f( y) [4]. 

E. Wong [5] discussed the construction of 
continuous-time stationary Markov processes 
with the distributions of Pearson's system as 
their stationary distributions. This allows a 
structural interpretation of the parameters of a 
distribution of the system. 


(2) Maximum Entropy Principle and the Ex- 
ponential Family. To develop a formal theory 
of statistical mechanics E. T. Janes [6] intro- 
duced the concept of the maximum entropy 
estimate of a probability distribution. This 
concept leads to a natural introduction of the 
exponential family. Following Kullback [7], 
we start with a distribution g(y) and try to find 
f(y) with prescribed expectations of statistics 
T,()), .... T, (y) and with maximum entropy 
B(f;g). Such a distribution f(y) is given by the 
relation 


f(y) x exp[1; Ti (y) - ...- « T, (y) ]9Cy). 


where it is assumed that the right-hand side is 
integrable. By varying the parameters 1,, ..., t, 
over the allowable range we get the exponen- 
tial family of distributions. I. J. Good [8] con- 
sidered the Janes procedure as a principle for 
the generation of statistical hypotheses and 
called it the maximum entropy principle. 


(3) Parametric Models of Normal Distribu- 
tions. Of particular interest within the ex- 
ponential family is the family of normal distri- 
butions. This is obtained by assuming the 
knowledge of the first- and second-order mo- 
ments of a distribution and applying the maxi- 
mum entropy principle [9]. Obviously, the 
parametrization of a normal distribution is 
concerned only with the mean vector and the 
variance-covariance matrix. 

Let X -(X,, ..., X,)' be an n-dimensional 
normal random variable with mean EX — 
(m,, ...,m,) and variance-covariance matrix 
Z — (cj, where o; 2 E(X; —m)(X; —mj). (E 
denotes expectation and ' denotes the trans- 
pose.) A nonrestrictive family of n-dimensional 
normal distributions is characterized by n 
n(n + 1)/2 parameters, m; (i= 1, ..., n), and oj 
(i 1,..., mj L ..., n). The prior information 
on the generating mechanism of X introduces 
constraints on these parameters and reduces 
the number of free parameters. 

Reduction of the dimensionality of the 
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parameter vector m — (m,, ...,m,)' is realized 
by assuming that m is an element of a k- 
dimensional subspace of R" spanned by the 
vectors a, —(4,,, ..., Ain) ..., ay m ld, Akn)» 
i.e., by assuming the relation 


m- Ac, 
where A =(a,,...,4,) is an n x k matrix and c— 
(c,, ..., CN is a k-dimensional vector with k< 


n. This parametrization is obtained when for 
each X; the observation (a;,,...,a5,) is made on 
a set of k factors and the analysis of the linear 
effect of these factors on the mean of X; is 
required. We have the representation X = Ac 
+W, where W is an n-dimensional normal 
random vector with EW =0 and variance- 
covariance matrix Z (— Section D). 

To complete the model we have to specify 
the variance-covariance matrix 2. One of the 
simplest possible specifications is obtained by 
assuming that the X; are mutually indepen- 
dent and of the same variance o". This reduces 
X to g?l, where I denotes an n x n identity 
matrix. With this assumption the number 
of necessary parameters to represent the 
variance-covariance matrix reduces from n(n 
+1)/2 to 1. The model obtained with the 
assumptions X —6?1] and m= Ac is called the 
general linear model (or regression model) 
with normal error, or simply the normal linear 
model. The model is called a regression model 
also when the a; are random variables (— e.g., 
[10, 11]). 

A typical example of nontrivial parametri- 
zation of the covariance structure of X is ob- 
tained by assuming the representation 


X=m+AF+W, 


where F —( f,, ..., LI denotes the vector of 
random effects and W the vector of measure- 
ment errors. It is assumed that F and W are 
mutually independent and normally distri- 
buted with EF =0 and EW — Also the com- 
ponents of W are assumed to be mutually 
independent. The variance-covariance matrix 
X of X is then given by 


X -—AQOA'-A, 


where ®= EFF’ and A= EWW', which is 
diagonal. 

When A is a design matrix, the parametri- 
zation provides a components-of-variance 
model (or random-effects model) of which the 
main use is the measurement of the variance- 
covariance matrix of the random effects 
fi, f, rather than the measurement of F 
itself. If we consider F to be representing the 
effects of some latent factors for which A is not 
uniquely specified, the above representation 
of X gives merely a formal, or noncausal, 
parametrization of X. In this case the model is 
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called the factor analysis model and the dimen- 
sion g of F is called the number of factors. By 
keeping the number of factors sufficiently 
smaller than the dimension of X, we get a 
parametrization of X with a smaller number of 
free parameters than the unconstrained model. 
Starting with g = 1 and successively increasing 
the number of factors, we can get a hierarchy of 
models with successively increasing numbers 
of parameters. (— [12] for a very general 
modeling of the variance-covariance matrix.) 


(4) Parametrization of Discrete Distribution. 
Consider the situation where the observation 
produces one of the events represented by r= 
0, 1,2, ..., k with probability p(r), where k 
may be infinite. Represent by X =(X,,..., Xn) 
the result of n independent observations. The 
probability p(X) of getting such a result is 
given by the relation 


k 
logp(X)— H 0,n,{X), 


where 0, 2log p(r) and n,(X) denotes the num- 
ber of X;s which are equal to r. (The term not 
depending on the 0,’s is omitted in the above 
and subsequent formulas, since it is immaterial 
for problems of inference.) Thus a nonrestric- 
tive model is obtained by assuming only the 
relations 0, «0 and XE. e* — 1. Obviously the 
model defines an exponential family and vari- 
ous useful parametrizations are realized by 
introducing some constraints on the para- 
meters H. 

When the events r are arranged in a 2- 
dimensional array (i,j) (i2 1, ...,m;j — 1, ...,n) 
we have 


One simple parametrization is given by 
0j — n 6 + P+ Yi. 


where it is assumed that 27, à; = 2j-, f= 
ZP, äus Lj=1 Y= 0. Obviously this is a pa- 
rametrization of 0 as a linear function of 
the parameters a;, fj, and 3, and the model 
thus obtained is called the log linear model. 
The model shows a formal similarity to the 
analysis-of-variance model (— Section D). By 
introducing successively more restrictive as- 
sumptions on the parameters, we can get a 
hierarchy of models for the analysis of a two- 
way contingency table. Extension to cases 
when more than two factors are involved is 
obvious (— e.g., [13]). 

Here we consider that X; is a dichotomous 
variable, i.e., k= 1, and that the probability of 
X;— 1 may depend on i, i.e., we have 


log p(X;) 2 nbo: + (0, ; — Dan: (LE 
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where 0,5, —log Prob(X; 2 0) and 0,;= 

log Prob(X; — 1). We assume that a vector of 
observations a; —(a;,, ...,a;,) is available 
simultaneously with X; and that we are inter- 
ested in analyzing the relation between a; and 
the probability distribution of X;. The analysis 
is realized by exploring the functional relation 
between 1, — 0,, — 05, and a;. We can assume a 
linear relation 


T= AC, 


where t=(t,,...,7,), A=(a1,--.,a,), and c= 
(ci, ..., Cell. The parameter 7; =log{ p(X;= 
1)/(1 — p(X; = 1))} is the log odds ratio or logit 
of the event X;— 1, and the model is called 
the linear logistic model [ 14]. A hierarchy 

of models can be generated by assuming a 
successively more restrictive linear relations 
among the components of C. 


D. General Linear Models 


Another class of models often used in practical 
applications is composed of general linear 
models or linear regression models, where the 
observed value is considered to be the sum of 
the effects of some fixed causes and the error. 
Let X z(X;, ..., X,)' be an n-dimensional 
trandom variable, and denote the expectation 
of X by E(X) 2 (u,, ..., He. If E(X) is of the 
form Aé with an unknown parameter é = 


(CN, and a given n x k matrix A, then 
we can express X as 
X=AE+W, E(W)=(0,...,0), (1) 


with the error term W=(W,,..., W,y. We 
frequently assume a set of conditions on the 
distribution of X; for example, (1) X,, ..., X, 
are mutually "independent, (ii) X,, ..., X, 
have a common unknown 'variance c?, (iii) 
(X,, ..., X,) is distributed according to an n- 
dimensional tnormal distribution. The equa- 
tions (1) together with conditions on the 
distribution are called a linear model. 

Among the methods of statistical analysis of 
linear models are regression analysis, analysis 
of variance, and analysis of covariance as 
explained below, but these are not clearly 
distinguished from each other. (I) In design-of- 
experiment analysis, i.e., analysis of variance, 
the matrix A and the vector č in (1) are called 
a design matrix and an effect, respectively. In 
this case entries of A are assumed to be either 
1 or O. (II) In regression analysis, we are first 
given a linear form x = Lu aj£; of a vector 
a —(a,, ...,a,) with coefficient vector £— 

(čis -3 £j). Let X,,..., X, be the observed 
values of x at n points a, —(d,,, ..., di, ....à, 
—(a,,, -s ang), respectively, where n» k. If the 
observations are unbiased, that is, if E(X) = 
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Èt ajén i=1,...,n, then the model (1) is 
obtained with A = (a;;). Usually one of the 
components of the vector a is taken as unity. 
In this framework the form x= 2;a;4; is called 
the linear regression function or regression 
hyperplane, and for k=2, the graph of the 
linear function x — a, č, + £j and its coefficient 
č, are called the regression line and regression 
coefficient, respectively. The components of 
the vector a are called fixed variates or ex- 
planatory variables. Frequently we encounter 
the case where the vector a, and consequently 
the matrix A, are random variables. When this 
is the case, a discussion like that above can be 
carried out for given A by regarding Aé in (1) 
as the conditional expectation of the vector X. 


E. The Method of Least Squares 


Consider the subspace L(A) of the tsample 
space R" spanned by the column vectors of A. 
Then the dimension s of L(A) equals the rank 
of A, and L(A) and its torthocomplement 

L^ (A) are called the estimation space and the 
error space, respectively. The torthogonal 
projection y of a point x to the space L(A) is 
expressed as y= P,x with a real tprojection 
matrix P,. The variable Y= P,X is called the 
least squares estimator of E(X), and the rou- 
tine of getting such an estimator Y, called the 
method of least squares, minimizes the squared 
error (X — Ač} (X — Ač) for a given X. This 
method consists of two operations solving the 
normal equation A' AC = A'X with respect to é, 
and setting Y = Aé, where ë is a solution of the 
equation. For s =k, we obtain Y = A(4'A) ! A'X 
directly. Even when s « k, where the solution 
of the normal equation is not unique, Y is 
uniquely determined. The quantity Q = X'(I — 
P,)X, where I is the unit n x n matrix, is the 
squared distance of the point X from the space 
L(A) and is called the error sum of squares 
with n—s degrees of freedom. 

A linear function BE of the parameter ë with 
coefficient vector B=(f,,..., DI is called a 
linearly estimable parameter (or estimable 
parameter) if there is a linear unbiased esti- 
mator, that is, an unbiased estimator of the 
form b'X, of f'E. In order that 8'E be estimable 
it is necessary and sufficient that $’ be a linear 
combination u' A of the row vectors of the 
matrix A. A linear unbiased estimator that has 
minimum variance among all linear unbiased 
estimator uniformly in £ is called the best 
linear unbiased estimator (b.l.u.e.). If the con- 
ditions (i) and (ii) of Section D are satisfied, 
then for any given n-vector u the b.l.u.e. of a 
parameter y =w AC is given by j=u'Y with 
Y = P,X, and its variance equals (u'P,u)o?, 
while the expectation of the quantity Q is 
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given by (n— s)o?. This proposition is known 
as the Gauss-Markov theorem. Hence the 
blue ? — u'Y is frequently cited as the least 
squares estimator of y. The quantity ó? = 
Q/(n — s) is an unbiased estimator of o? and is 
called the mean square error. If in addition the 
condition (iii) of Section D is assumed, then ? 
and 6? are the *uniformly minimum variance 
unbiased estimators of y and o?, respectively. 
When the error term W in (1) has covar- 
iance matrix X —67?X, with an unknown real 
parameter o? and a known matrix Xo, the 
valaue of the parameter € minimizing Q = 
(X — AE Zo (X — Aë) is called the generalized 
least squares estimator of č if it exists. This 
estimator has properties similar to those of the 
least squares estimator. 


F. Model Selection and the Method of 
Maximum Likelihood 


When a parametric family of distributions 
{ f(-|0); 0€] is given and an observation x 
is made, log f(x |0) provides an unbiased esti- 
mate of the expected log likelihood of the 
distribution f(-|0) with respect to the true 
distribution of the observation. The value of 0 
which maximizes this estimate is the maximum 
likelihood estimate of the parameter and is 
denoted by 0(x) (— 399 Statistical Estimation). 
In a practical application we often have to 
consider a multiple model, defined by a set 
of component models { f;(-| 0); 0,6 6, (i= 
|, ...,K). The problem of model selection is 
concerned with the selection of a component 
model from a multiple model. The difference 
of the difficulties of handling a simple model 
defined by a one-component model, and a 
multiple model is quite significant. For a sim- 
ple model { f(-| 6); 0€ ©}, each member of the 
family is a probability distribution. In the case 
of a multiple model, its member is a model 
which is simply a collection of distributions 
and does not uniquely specify a probabilistic 
structure for the generation of an observation. 
Thus the likelihood of each component model 
with respect to the observation x cannot be 
defined and the direct extension of the method 
of maximum likelihood to the problem of 
model selection is impossible. This constitutes 
a serious difficulty for the handling of multiple 
models. Apparently, Fisher used the proce- 
dure of testing to solve this difficulty. 


(1) Analysis of log Likelihood Ratios. The 
procedure of model selection by testing, which 
is applicable to a wide class of models, is the 
method of analysis of log likelihood ratios 
[15]. Consider the situation where a model 

is to be determined by using a hierarchy of 


1524 


models { f(-|0;); 0;¢0;} (i— 1, ..., k) such that 
0,<0,c...< @0,. The comparison of models 
is then realized through the comparison of the 
maximum likelihoods f(x |0;(x)), where 0,(x) 
denotes the maximum likelihood estimate of 
0; based on the data x. For 9, c ©, the log 
likelihood ratio is defined by 


^(0;/0; x)= —2logi f(x] 6G)/f(x | 0/3))j. 


The analysis of log likelihood ratios is realized 
by the decomposition 


^(0,/0,; x) 2 A(0,/0,; x) + A(O,/O3;x)+... 
+ A(0,..,/0,; x). 


The log likelihood ratios A(0,..,/0,; x), 
A(0,.,/0, ,; x), ..., A(Q,/O,; x) are succes- 
sively tested by referring to chi-square distri- 
butions with the degrees of freedom d(k) — 
d(k — 1), d(k — 1) - d(k — 2), ...,d(22 — d(1), 
respectively, where d(i) denotes the dimension 
of the manifold ©;. The assumption of the chi- 
square distributions is only asymptotically 
valid under the usual regularity conditions (— 
400 Statistical Hypothesis Testing). The model 
defined with O; for which A(0,/0, ,; x) first 
becomes significant is selected and f(y|0,(x)) is 
accepted as the predictive distribution. The 
problem of how to choose the levels of sig- 
nificance to make the test procedure a proce- 
dure for model selection remains open. 


(2) Model selection by AIC. One way out of 
the difficulty of model selection is to assume 

a prior distribution over O; for each model 

{ f:(-| 0); 0;€O;}. This leads to Bayesian model- 
ing, which is discussed in Section G. Another 
possibility is to replace each component model 
Lf C10):0,69,] by a distribution f;(:|0,(x)) 
specified by the maximum-likelihood estimate 
0;(x). The problem here is how to define the 
likelihood of each distribution f;(-|6;(x)). An 
information criterion AIC was introduced by 
H. Akaike [16] for this purpose; it is defined 
by 


AIC =(—2)log,(maximum likelihood) 
+2 (number of estimated parameters). 


We may consider —0.5 AIC to be the log 
“likelihood” of f(-|0(x)) which is corrected for 
its bias as an estimate of E, E, log f(y|0(x)), 
where E, denotes the expectation with respect 
to the true distribution of x, and where it is 
assumed that x and y are independent and 
identically distributed. The maximum “likeli- 
hood" estimate of the model is then defined by 
the model with minimum AIC. This realizes a 
procedure of model selection that avoids the 
ambiguity of the testing procedure. It is appli- 
cable, at least formally, even to the case of a 
nonhierarchical set of models. 
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G. Bayesian Models 


Consider the situation where an observation x 
is made and it is desired to produce an esti- 
mate p( y|x) of the true distribution of a future 
Observation v. p(ylx) is called a predictive 
distribution. Assume that x and y are sampled 
from one of the distributions within the family 
{g(v | O)f(x| 0); 0€ O}, ie., x and y are stochasti- 
cally independent but share common struc- 
tural information represented by 0. As a design 
criterion of p( y| x) we assume a probability 
distribution z(0) of 0. The model { f(:|050€ 9j 
with z(0) is called a Bayesian model and z (0) is 
called the prior distribution. From the relation 


E, E, Eya log ply | x)= E.E--Jog p(y | x), 


where E,), denotes the expectation of y con- 
ditional on x and E, the expectation with 
respect to the marginal distribution of x, it 
can be seen that the optimal choice of p(y|x) 
which maximizes the expected log likelihood is 
given by the conditional distribution 


poyi) = fav 0)p(0] x) dé, 


where p(8| x) ts the posterior distribution of 0 
defined by 


ptt) fest Justen! | 


When a prior distribution z(0) is specified, 
the parametric family of distributions { f(-| 0); 
0€ O3 is converted into a stochastic structure 
which specifies a probability distribution of 
the observations. The likelihood of the struc- 
ture, or the Bayesian model, with respect to 
an observation x is defined by 


[re | 0)n(0) dO. 


When there is uncertainty about the choice of 
the prior distribution we can consider a set of 
possible prior distributions and apply the 
method of maximum likelihood. Such a proce- 
dure is called the method of type II maximum 
likelihood by I. J. Good [17]. For a multiple 
model { f;(-|0);0;€O;} (i= 1, ..., k), if prior 
distributions z,(0;) are defined, a model selec- 
tion procedure is realized by selecting the 
Bayesian model with maximum likelihood. 

Bayesian modeling has often been consid- 
ered as not quite suitable for scientific appli- 
cations unless the prior distribution is objec- 
tively defined. However, even the construction 
of an ordinary statistical model is always 
heavily dependent on our subjective judgment. 
Once the objective nature of the likelihood of 
a Bayesian model is recognized, the selection 
or determination of a Bayesian model can 
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proceed completely analogously to Fisher’s 
scheme of statistical inference (— e.g., [18]). 
The basic underlying idea of both the mini- 
mum AIC procedure and Bayesian modeling is 
the balancing of the complexity of the model 
against the amount of information available 
from the data. This unifying view of the con- 
struction of statistical models is obtained by 
the introduction of entropy as the criterion for 
judging the goodness of fit of a statistical 
model (— [19] for more details). 


References 


[1] R. A. Fisher, Statistical methods for re- 
search workers, Oliver & Boyd, 1925; Hafner, 
fourteenth edition, 1970. 

[2] L. Boltzmann, Über die Beziehung zwi- 
schen dem Zweiten Hauptsatze der mechani- 
schen Warmetheorie und der Wahrscheinlich- 
keitsrechnung respektive den Sátzen über das 
Warmegleichgewicht, Wiener Berichte, 76 
(1877), 373-435. 

[3] I. N. Sanov, On the probability of large 
deviations of random variables, IMS and AMS 
Selected Transl. Math. Statist. Prob., 1 (1961), 
213—244. (Original in Russian, 1957.) 

[4] K. Pearson, Contributions to the mathe- 
matical theory of evolution H, Skew vari- 
ation in homogeneous material, Philos. Trans. 
Roy. Soc. London, ser. A, 186 (1895), 343—414. 
Also included in Karl Pearson's early statis- 
tical papers, Cambridge Univ. Press, 1948, 41— 
112. 

[5] E. Wong, The construction of a class of 
stationary Markoff processes, Proc. Amer. 
Math. Soc. Symp. Appl. Math., 16 (1963), 264— 
276. Also included in A. H. Haddad (ed.), 
Nonlinear systems, Dowden, Hutchinson & 
Ross, 1975, 33-45. 

[6] E. T. Janes, Information theory and statis- 
tical mechanics, Phys. Rev., 106 (1957), 620— 
630. 

[7] S. Kullback, Information theory and statis- 
tics, Wiley, 1959 (Dover, 1967). 

[8] I. J. Good, Maximum entropy for hypoth- 
esis formulation, especially for multidimen- 
sional contingency tables, Ann. Math. Statist., 
34 (1963), 911—934. 

[9] C. E. Shannon and W. Weaver, The math- 
ematical theory of communication, Univ. of 
Illinois Press, 1949. 

[10] S. R. Searle, Linear models, Wiley, 1971. 
[11] F. A. Graybill, Theory and application of 
the linear model, Duxbury Press, 1976. 

[12] K. G. Jóreskog, A general method for 
analysis of covariance structures, Biometrika, 
57 (1970), 239—251. 

[13] Y. M. M. Bishop, S. E. Feinberg, and P. 
W. Holland, Discrete multivariate analysts: 
Theory and practice, MIT Press, 1975. 


404 A 
Statistical Quality Control 


[14] D. R. Cox, The analysis of binary data, 
Chapman & Hall, 1970. 

[15] I. J. Good, Comments on the paper by 
Professor Anscombe, J. Roy. Statist. Soc., ser. 
B, 29 (1967), 39—42. 

[16] H. Akaike, A new look at the statistical 
model identification, IEEE Trans. Automatic 
Control, AC-19 (1974), 716—723. 

[17] I. J. Good, The estimation of proba- 
bilities, MIT Press, 1965. 

[18] G. E. P. Box, Sampling and Bayes’ in- 
ference in scientific modeling and robustness, 
J. Roy. Statist. Soc., ser. A, 143 (1980), 383— 
430. 

[19] H. Akaike, A new look at the Bayes pro- 
cedure, Biometrika, 65 (1978), 53-59. 


404 (XVIII.14) 
Statistical Quality Control 


A. General Remarks 


According to the Japanese Industrial Standard 
(JIS) Z 8101, “Quality Control (QC) is a system 
comprising all the methods used in manufac- 
turing products or providing services econom- 
ically that meet the quality requirements of 
consumers.” To emphasize that modern qual- 
ity control makes use of statistical methods, 

it is sometimes referred to as Statistical Qual- 
ity Control (SQC). In order to implement 
effective QC, statistical concepts and methods 
must be applied and the “Plan-Do-Check- 
Action” (PDCA) cycle must be followed in 
research and development, design, procure- 
ment, production, sales, and so on. These QC 
activities are executed on a company- 

wide basis from the top management to the . 
production workers. This type of QC is called 
Company-Wide Quality Control (CWQC) or 
Total Quality Control (TQC). 

The quality Q is an abstract notion of the 
conformity of a product or service to con- 
sumers’ requirements; it also refers to the total 
of the characteristics of a product or service as 
perceived by consumers. The quality charac- 
teristics may include both measurable physical 
and/or chemical features, such as strength and 
purity, or features such as color or texture as 
appreciated by individuals. These latter char- 
acteristics could be called “consumer qual- 
ities.” Furthermore, the concept “quality” 
has also been used to describe the social im- 
pact of a product or service. This might be 
called “social quality.” Examples of social- 
quality issues are pollution by solid waste or 
drainage in the production stage; degradation, 
maintainability, and safety of a product in 
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daily use; and pollution following disposal. 
For a product or service to conform to this 
sort of quality it has become necessary to 
conduct QC activities not only during produc- 
tion but also at early stages of design and 
development of new products. 

The measured characteristics of quality 
vary from one product to another because of 
natural variability in the material and produc- 
tion process involved, ability of individual 
workers, errors in different sorts of measure- 
ment, etc. If the variations among the mea- 
sured values from a process can be attributed 
to “chance causes” and their distribution ex- 
pressed by a probability or a probability den- 
sity function, the process is said to be in a 
“state of statistical control” according to W. A. 
Shewhart or in a “stable state” by JIS. In this 
case the value of a characteristic is deemed to 
be the realization of a random variable X. 
Sometimes the variations are attributed to 
“assignable causes,” which must be identified 
and eliminated. 


B. Control Charts 


The control chart provides a means of evaluat- 
ing whether a process is in a stable state. 

The control chart is made by plotting points 
illustrating a statistic of the qualitv charac- 
teristics or manufacturing conditions for an 
ordered series of samples or subgroups. A 
sheet of the control chart is provided with a 
middle line between a pair of lines depicting 
the upper control limit (UCL) and the lower 
control limit (LCL). The stable state is as- 
sumed to be exhibited by points within the 
control limits. Points falling outside the con- 
trol limits suggest some assignable causes, 
which should be eliminated through corrective 
measures. 

The idea underlying control charts as devel- 
oped by Shewhart is to apply the statistical 
principle of significance to the control of pro- 
duction processes. Other types of control 
charts have also been developed, for example, 
acceptance control charts and adaptive control 
charts. These have been successfully applied 
to many quality control problems. 

The foundation of Shewhart's control chart 
is the division of observations into what are 
called "rational subgroups." A rational sub- 
group is the one within which variability is 
due only to chance causes. Between different 
subgroups, however, variations due to assign- 
able causes might be detected. In most pro- 
duction processes the rational subgroup com- 
prises the data collected over a short period 
of time during which essentially the same con- 
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dition in material, tool setting, environmental 
factors, etc., prevails. 

The limits on the control charts are placed, 
according to Shewhart, at a Ae distance from 
the middle line, where oc is the population 
standard deviation (or standard error) of the 
statistic; 3c expresses limits of variability 
within the subgroup. Assuming that the popu- 
lation distribution of an observed character- 
istic is "normal" the range between the limits 
should include 99.7% of the points plotted so 
long as the process is “in control" at the mid- 
dle value. Accordingly, 0.37; of the plotted 
points from the *in control" process fall out- 
side the limits, and thus give an erroneous 
"out of control" signal. 

To determine that the process is in con- 
trol for a normally distributed characteristic 
N(u,o?), we have to investigate the variability 
between the means p and the standard devi- 
ations c of different distributions of X for 
different subgroups. Thus the state of control 
of a process is determined with control charts 
for 


X-Y Xn and s= | Y (X, Xytn- 9, 
= a 


which are the appropriate statistics corre- 
sponding to u and c. Despite the theoretical 
drawback of the statistical range R = max; X; — 
min; X, against s, use of the range 1s often pre- 
ferred in QC work because of its simplicity 

in computation. Hence the X-R charts are 
obtained from the previously collected k ra- 
tional subgroups each of size n as follows: 





UCL=X+A,R, UCL=D,R, 


LCL=X—A,R. LCL=D,R, 


where X and R are the averages of the k values 
of X and R, respectively, and 





3 d d 
A, = , D,=143—, D,=1-3—, 
nd, d; d; 
E[R]=4d,0, V[R]-E[(R—E[R])] 
E 0°. 


For n « 7, LCL for R cannot be given because 
D, becomes negative. 

The other commonly used control charts are 
the p chart (proportion of nonconformity: 
binomial distribution) and the c chart (number 
of defects: Poisson distribution). For those 
charts the above theory of normal distribution 
is also used to approximate the binomial and 
Poisson values. 

It is generally sufficient to use the agreed-on 
decision criterion (3c limits) and to recognize a 
relatively small risk (x = 0.003) for practical 
purposes. It should be noted, however, that a 
shift of the process mean u by Lo would not be 
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Observed at a ratio of 97.7%, which is the value 
of the risk f for each plotted point, under the 
normal distribution of the plotted statistic. 
One reason why the control chart has been a 
practical tool in many applications is this lack 
of sensitivity to a relatively small shift of the 
level. If greater sensitivity is required, 2o limits, 
“warning limits" are used. This results in a 
greater risk « of erroneously finding a process 
out of control. 

Other decision criteria based on aspects of 
run theory are also used. Charts using ac- 
cumulated data from several rational sub- 
groups for each plotted value are sometimes 
recommended: Moving average and moving 
range charts and the cusum ("cumulative 
sum") chart are examples. The statistical 
theory for these charts is more complicated 
than that for the simple charts discussed here. 


C. Sampling Inspection 


Sampling inspection determines whether a lot 
should be accepted or rejected by drawing a 
sample from it, observing a quality character- 
istic of the sample, and comparing the ob- 
served value to a prescribed acceptance crite- 
rion. Sampling may be conducted in several 
stages. Definite criteria are required to decide 
at each stage whether to accept or reject the 
lot or continue sampling on the basis of sam- 
ple values observed so far. There also must 

be some rules to determine the size of the next 
sample if it is to be taken. These criteria and 
rules together are called the sampling inspec- 
tion plan. The number of samples eventually 
drawn and observed and their sizes are gener- 
ally random variables. In single sampling in- 
spection the final decision is always reached 
after one stage of sampling is completed. Dou- 
ble sampling inspection makes the final deci- 
sion after at most two stages of sampling are 
completed. Multiple sampling inspection makes 
the final decision after at most N stages of 
sampling are completed (N « oo). Inspection 
without a predetermined limit on the number 
of sampling stages, sequential sampling inspec- 
tion, is usually constructed so that the proba- 
bility of the indefinite continuation of sam- 
pling is 0. 

Once a sampling inspection plan is deter- 
mined, the probability for accepting a lot with 
given composition can be calculated. This 
probability as a function of lot composition ts 
called the operating characteristic of the plan. 
In most cases, the quality of a lot is expressed 
by a real parameter 0 (e.g., fraction defective, 
i.e., percentage of defective products, or the 
average of some quality characteristic), and we 
use only inspection plans whose operating 


404 Ref. 
Statistical Quality Control 


characteristics are expressed as a function of 0. 
The graph of this function is called an OC- 
curve. We impose certain desirable conditions 
on the OC-curve and design plans to satisfy 
them. Tables for this purpose, sampling inspec- 
tion tables, are prepared for practical use. The 
condition most frequently employed is ex- 
pressed in the following form in terms of four 
constants fo, 0,, x, p: The probability of rejec- 
tion is required to be at most « when 0 «6, (or 
0 > 0o), and the probability of acceptance at 
most f/ when 02 0, (or 0 € 0,). Here « is called 
the producer's risk, and fj the consumer's risk. 

If the rejection of a lot is identified with the 
rejection of a statistical hypothesis 0 < 0p, the 
OC-curve is actually the power curve of the 
test upside down (i.e., the graph of 1 minus 
the *power function), and the producer's and 
consumer's risks are precisely the ferrors of the 
first and second kind. The choice of a plan is 
actually the choice of a test under certain con- 
ditions on its power curve. Commonly used 
plans are mostly based on well-established 
tests, some of which have certain optimum 
properties. A few examples, (1)-(4), are given 
below. Here sampling inspection by attribute is 
an inspection plan that uses a statistic with a 
discrete distribution, whereas sampling inspec- 
tion by variables uses a statistic with a contin- 
uous distribution (— 400 Statistical Hypoth- 
esis Testing). 

(1) Single sampling inspection by attribute 
concerning the fraction of defective items in a 
lot: Let the defective fraction be denoted by p 
and identified with 0 in the preceding para- 
graph. Assign two values of p, say po and p, 
(0 py « p, <1), the producer's risk a, and the 
consumer's risk D Together they give con- 
ditions to be fulfilled by the OC-curve. Draw n 
items from a lot at random, and suppose that 
they contain Z defective items. The decision is 
then made after observing Z, whose distri- 
bution is thypergeometric and approximately 
*binomial when the size of the lot is large 
enough. There exists a plan that minimizes n 
among all plans satisfying the imposed con- 
ditions under either of the two assumptions 
about the distribution of Z. It rejects the lot 
when Z is greater than a fixed number deter- 
mined by po, pı, % and $. This plan is based 
on the *UMP test of the hypothesis p € po 
against the alternative p 2 p,. 

(2) Single sampling inspection by variables 
concerning the population mean yp in the case 
where the population distribution is NI, o?) 
with known o°: Draw a sample (X,, ..., X,) of 
size n from a lot. Assume that the X are inde- 
pendently distributed with the same distri- 
bution N(u, a?). Suppose that smaller values of 
the quality characteristic stand for a more 
desirable quality. If two values of 4, say uy and 
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H4, X, and f are assigned, a plan can be estab- 
lished to minimize n. It rejects the lot when 
the sample mean X = Y, X;/n exceeds a fixed 
number determined by Ge, H4, x, and fj. This 
too is based on the UMP test of u € uo against 
HZ. 

(3) Cases where the samples are drawn in 
more than one stage: As in (2), assign two 
values of 0, a, and £; there is still liberty to 
choose n,, nz, ..., which are the sizes of the 
samples drawn at each stage. Hence there are 
many possible plans fulfilling the imposed 
conditions. Among them a plan is sought 
to minimize the expectation of n=n,+n,+ 
... (called the average sample number). For 
example, plans based on the sequential proba- 
bility ratio tests are in common use (— 400 
Statistical Hypothesis Testing). 

(4) Among other special plans, sampling 
inspection with screening and sampling inspec- 
tion with adjustment are worthy of mention. In 
the first plan, all the units in the rejected lots 
are inspected and defective units replaced by 
nondefective ones. In this case, fixing p, (the 
lot tolerance percent defective) and f, or the 
average fraction defective after the inspection 
(average outgoing quality level), we attempt to 
minimize the expected amount of inspection, 
that is, the expected number of inspected units 
including those in the rejected lots. In the 
second plan, acceptance criteria are tightened 
or loosened according to the qualitv of the lots 
just inspected. 
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405 (XVII.16) 
Stochastic Control and 
Stochastic Filtering 


A. General Description of Stochastic Control 


Stochastic control is an optimization method 
for systems subject to random disturbance. Let 
T be a compact convex subset of R*, called a 
control region. Let W, be an tn-dimensional 
Brownian motion and c,(W)— c(W,; s <t) (say 
F) be the least to-field for which W,, s <t, are 
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measurable. An t¥,-progressible measurable 
T-valued process is called an admissible con- 
trol. For an admissible control U, the system 
evolves according to an n-dimensional con- 
trolled stochastic differential equation (CSDE) 


dX, —o(X,, U) aW, (Xp U,) dt, 


where a symmetric n x n matrix a(x, u) and n- 
vector y(x, u) are continuous in R” xT and 
Lipschitz continuous in xe R”. Hence the 
CSDE has a unique solution, called the re- 
sponse for U,. The problem is to maximize (or 
minimize) the performance J: 


J(t, X, Q, U) = | | e So (Xs, (E U) dt 


0 
pe loc. "ed 


where X, is the response for U, with X, =x and 
t is a constant time or a *hitting time asso- 
ciated with a target set. We put V(t, x, q)— 
SUPucadm.controt J (T, X, 9, U) the value function 

as a function of x. If the supremum value is 
attained at an admissible control Ü,, then Ü, is 
called an optimal control. 


B. Bellman Principle 


In order to get V(t +s, x, ø), R. Bellman ap- 
plied the following two-stage optimization. 
After using any U up to time t, a controller 
changes U to an optimal one. Then at time t + 
s the performance J (t, x, V(s,-, @), U) is ob- 
tained. Taking the supremum with respect to 
U, one gets Vi 4- s, x, 9). This is called the 
Bellman principle. Let C be the *Banach lattice 
of the totality of bounded and uniformly con- 
tinuous functions on R". Suppose that o, y, f 
and c(>0) are bounded and smooth; then for 
constant time t, the value function V(t, x, o) 
belongs to C whenever pe C. Moreover, the 
family of operators V(t) defined by V(t)o(x)— 
V(t, x, 9) becomes a *monotone contraction 
semigroup on C. The semigroup property 
V(t+s, x, 9) — V(t, x, His, -, g)) is nothing but 
the Bellman principle. The tgenerator G is 
expressed by 

Go(x)— SUP {L*e(x)—c(x, up(x) + f(x, u)j 

for a smooth function @, where L” is the gen- 
erator of *diffusion of the response for con- 
stant control u(c I), namely, 


E T eA 
=- 9; (X, u)ac; (x, Hl 
2 i,j pat SES uP OX; OX; 


n Ó 
+ VAX, —-, 
Laks 


Furthermore, assume that a is uniformly posi- 
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tive definite and o is smooth. Then V(t)o(x) 

belongs to tW,);2, for any p and is the unique 
solution of the Bellman equation ( = dynamic 
programming equation) 


oW 
Gy 9UP (L"W — c(x, u) W + f(x, u)) 
uer 


a.e. on (0,00)xR", W(0,x)—o(x) on R". 


In addition if inf, ,c(x, u) » 0, then W= 

bm... V(t)@ exists and is the unique solution 
of the Bellman equation sup, (L"W — c(x, u)W 
+ f(x, u)} 20 a.e. on R”. When 7 is the hitting 
time, the value function is related to the Bell- 
man equation with a boundary condition. 


C. Feedback Control 


In practical problems we specify the kind of 
information on which the decision of the con- 
troller can be based at each time. We fre- 
quently assume that the data obtained up to 
that time is available. The following situations 
are possible: (1) The controller knows the com- 
plete state of the system. This is called the case 
of complete observation. (2) The controller has 
partial knowledge of the state of system. This 
is called the case of partial observation. A 
feedback control (= policy) is a function of 
available information, namely, a I -valued 
progressible measurable function defined on 
[0, oo) x CID, oc), where j is the dimension of 
data and C’[0, oo) is a metric space of totality 
of j-vector valued continuous functions on 

[0, oo). A policy U is called a Markovian policy 
if U(t, £) is a Borel function of t and the tth 
coordinate of £. When a policy U is applied, 
the system is governed by the 'SDE 


dX,=a(X,, U(t, Y)dW,4- y(X, U(t, Y))dt 


with data process Y. When the SDE has a 
tweak solution, U is called admissible. For 
example, when X — Y, any Markovian policy is 
admissible if « is uniformly positive definite. 
Let X, be a weak solution for U. Then its 
performance Jr, x, 9, U) < V(t, x, @). 

(1) The case of complete observation. When 
a is uniformly positive definite, an optimal 
Markovian policy can be constructed in the 
following way. Since I is compact, there exists 
a Borel function Ü on [0, oo) x R" which gives 
the supremum, namely, 


sup (L"V(t)g(x)— c(x, u) V(t)e(x) + f(x, u)) 


uel 
= LUt-9y(t)o(x)— c(x, Ü(t, x)) V(t)o(x) 
4 f(x, Ü(t, x)). 


This relation implies that V(t) g(x) — J(t, x, p, 
Ü (t, X,)) for any weak solution X,. Hence 
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Ü is an optimal Markovian policy. Especially 
when a, c, and f are independent of u and 

y(x, u) = R(x): u, where R(x) is an n x k matrix, 
an optimal Markovian policy Ü is obtained 
by a measurable selection of maximum points 
of (grad. V(t), R(x); u). Since the supremum 
of a linear form is attained at the boundary 
VT, one can suppose that Ü (t, x) belongs to 
oT. This is called bang-bang control. 

(2) The case of partial observation. One 
useful method is the separation principle. This 
means that the control problem can be split 
into two parts. The first is the mean square 
estimate for the system using a ‘filtering. The 
second is a stochastic optimal control with 
complete observation. But generally speaking, 
the problem of deciding under what conditions 
the separation principle is valid is difficult. In 
the case of the following linear regulator the 
separation principle holds. 

Suppose that the system process X, and the 
observation process Y, obey the following 
SDEs: 


dX,— A(t) aW, - (B(t) X, + b(t, U (t, Y))dt, 
dY,=dW,+ H (t) X, dt, 


where A(t), B(t), and H(t) are nonrandom 
matrix-valued functions and W, is a j- 
dimensional Brownian motion independent of 
W,. The problem is to search for a feedback 
control which gives the maximum value. Sup- 
pose that a feedback control U (t, č) is Lip- 
schitz continuous in €€C/[0, oo). Put 


Q(s, x) 


T 
— sup aj Lit, X U(t, D 
U,Lip s 

where (X,, Y,) is the unique solution for U with 
the initial condition X, — x, Y, — 0. By the Lip- 
schitz condition of U, o, 2 o(Y;; s <t) is inde- 
pendent of U, and the tconditional expectation 
X, — E(X,/o,) is governed by the following 
SDE, by way of the tKalman-Bucy filter: 


dX,= P(t) H(t) aW +(B(t)X,+ b(t, U,)) dt 


with some oa,-progressible measurable control 
U, and an n-dimensional Brownian motion 
WS adapted to c,. Moreover, P(t) is the terror 
matrix satisfying the *Riccati equation, and 
H' 1s the transpose of H. Let g(t, x) be the 
probability density of the normal distribution 
N(0, P(t), and put (t X, u)= ( L(t, x,u) g(t, x 
— X)dx and P(x) 2 ('(x)g(T, x — £) dx; then 
the problem turns into 


à 
Q6) sup Bal | Ee Goa + Hx, | 
U s 


Recalling the SDE for X,, we can use the Bell- 
man equation for choosing an optimal one. 
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D. Stochastic Maximum Principle 


A stochastic version of *Pontryagin's maxi- 
mum principle gives a necessary condition for 
optimality. This means that the instantaneous 
value of optimal control maximizes the sto- 
chastic analog of Pontryagin's Hamiltonian. 
suppose that the system evolves according to 
an n-dimensional CSDE 


dX, —a(Xj)dW,- y(X,, U,) dt. 


The problem is to seek conditions on admis- 
sible control U, such that E.A f(X,, U,)dt] is 
maximized, where T is a constant time. Assume 
that x, y, and f are bounded and smooth. 
Define a Hamiltonian H on R” x I x R” by 
H(x,u, WV) - y(x, u): V + f(x, u). Let U, be op- 
timal and X, its response starting at x. Then 
under some conditions there exist 4>0 and F,- 
progressively measurable q, , — (4,1, . di 
qik) (Kk 1, ..., n) and P, (P, , Y, n) which 
satisfy the SDE 

d'H. Ju, Ü) bo. A, d dr 
f OX, ô 


Xk 
+q dW, k=1,...,n, 


and H(X,, Ü,, Y) maner H(X,, u, V) ae. 


uer 


E. Optimal Stopping and Impulse Control 


Suppose that X, is an n-dimensional diffusion 
whose generator A is an elliptic differentia! 
operator. Let t be a tstopping time. The op- 
timal stopping problem is to seek a stop- 

ping time so that E,[g(X,] is maximized, 
where g is nonnegative and continuous. 1 is 
called optimal. The value function V(x)= 

sup, E,[g(X.)] is characterized as the least 
*excessive majorant of g. Moreover, under 
some conditions V belongs to the domain of A 
and is the unique solution of the tfree bound- 
ary problem; Vz g, AV«O0, and (V —g): AV=0. 
Therefore, in the Hilbert space framework, the 
value function is related to the variational 
inequality. An optimal stopping time is pro- 
vided by the hitting time for the set {x| V(x) — 
g(x)}. 

Impulse control is a variant of the optimal 
stopping problem. At some moment (=stop- 
ping time) a controller shifts the current state 
to some other state. But not all shifts are al- 
lowed: State x can be shifted to a state of x+ 
[0, oc". Let t, k= 1, 2, be a sequence of in- 
creasing stopping times and č, be a [0, oo)"- 
valued c, (X)-measurable random variable. 
The sequence U = (1,,65,, 15,6, ... is called an 
impulse control. U transfers the process X, to 


t 


vu) 


0 


adw Late: S £55 


0 cxt 
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if 
S ô 
ax.” 


H 


1 
A Yit + DH) 


y OX;Oxj 
and the problem is to seek U so as to maximize 
E (foe "f(Y")dt Ek e ^"K(5)], where 

À (20) is constant and the function K (20) 
stands for the cost of shifting. The value func- 
tion is related to a quasivariational inequality. 


F. General Description of Stochastic Filtering 


The problem of estimating the original signal 
from data disturbed by noises is called a sto- 
chastic filtering problem. Let X,, te[O, T], bea 
continuous stochastic process with values in 
R", called a signal (or system) process. It is 
transformed (or coded) to h(t, XL where h(t, x) 
is an m-vector-valued continuous function. 
Suppose that it is disturbed by a noise W, and 
we observe Y, — h(t, X,)+ W,. Usually W is 
assumed to be a *white noise independent of 
X,. Since the white noise is a generalized func- 
tion, the integral of Y,, i.e., 


0 


eil h(s, X,)ds+ W, 


is called an observation process, where W, is an 
*m-dimensional Brownian motion independent 
of X,. It is assumed for convenience that | X,|? 
and fo |A(s, XA ds are integrable. 

Assume that X, is a 1-dimensional signal 
process. The least square estimation of X, by 
nonlinear functions of observed data Y, s « t, 
is called a nonlinear filter of X, and is denoted 
by X,. Let F, or o(Y;s«t) be the least to-field 
for which VK s<t, are measurable. Then the 
filter X, is equal to an *Z,-measurable random 
variable such that E| X, — Y, < E| X, — ZI? 
holds for any measurable L? random vari- 
able Z. Hence it coincides with the tcon- 
ditional expectation E[ X,|.2;]. Now let H, be 
the closed linear space spanned by Y, s«t. 
The least square estimation of X, by elements 
of H. i.e., the orthogonal projection of X, onto 
H,, is called the linear filter of X, and is de- 
noted by X,. Obviously, the mean square error 
of a nonlinear filter is less than or equal to that 
of a linear filter, but a linear filter is calculated 
more easily. If (X,, Y) is a (Gaussian process, 
both filters coincide. 

When X, is an n-dimensional process (X;', 
..., X”), the n-vector process X,=(X/,..., X7) 
(or X, 2 (X4, ..., X")) is called the nonlinear 
(or linear) filter of X,. 


G. Kalman-Bucy Filter 


Suppose that the signal process X, is governed 
by a ‘linear stochastic differential equation 
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(LSDE) 


t t 
XoXo | at: | B(s)dW,, 

0 0 
where A(s) (or B(s)) isannxn(ornxr) 
matrix-valued continuous function, W, is an r- 
dimensional Brownian motion independent of 
the noise W,, and the initial data X, is a Gauss- 
ian random variable independent of W, and 
H. Suppose further that h(t, x) is linear, i.e., 
h(t, x) = H(t)x, where H(t) is an m x n-matrix- 
valued function. Then the joint process (X,, Y.) 
is Gaussian. Hence the nonlinear filter X, 
coincides with the linear filter and satisfies 

t 

X,- E[Xy] + | (A(s)— P(s) H(s) H(s)) €,ds 


0 


t 
d P(s)H(s)' dY,, 
0 

where H (s) is the transpose of H(s), and P(t)= 
(P,(t)) is the error matrix defined by P,(t)= 
E(Xi — Xi) (Xj — Xj). It satisfies the matrix 
Riccati equation 


dP(t) r 
SC A(t)P(t)+ P(t) A(0) 


— Po Hi P(t) + BDT, 
P(0) 2 covariance of Xo. 


Let (t, s) be the "fundamental solution of the 
linear differential equation dx/dt 2 (A(t) — 
P(t) H(ty H(t))x. Then the solution €, is repre- 
sented by 


t 
X, (50) ELX4] + | bit, s)P(s)H(sy dY,. 

0 
This algorithm is called the Kalman-Bucy filter 
[1]. Analogous results for discrete-time models 
have been obtained by Kalman. 


H. Nonlinear Filter 


In the study of nonlinear filters, the tcon- 
ditional distribution z(dx)= P(X,edx | F,) is 
considered besides X,. Suppose that X, is 
governed by the SDE 


t t 
a(s, X )ds + | b(s, X)aW,, 


o 


X, = Xo | 
Q 
where a(s, x) (or bis, x)) is an n-vector (n x r- 
matrix) valued Lipschitz continuous function. 
Then z,( f) [ f(x)n,(dx) satisfies the SDE 


t 


nf) - ELfGG)] | n,(Lf)ds 


0 


d (alh f)—2,(h.) T )) (d Y, — n,(h.,) ds), 
0 
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where h,(x) — h(s, x) and 


of 
ox! 





Lf(x) - Y; a'(s, x) 


1 o 
+ 22. (5 bas, X)bg, (s, ») axtaxt . 


Under additional conditions on a(s, x) and 
b(s, x), rt, (dx) has a density function z,(x), and 
it satisfies 


t t 


L*n,(x)ds + | T (x) (h(x) 


0 


= [conn dx) DUU ch) 


where L* is the formal adjoint of L. 

The process I, = Y, — f$ z.(h,) ds is a Brown- 
ian motion such that eil, s x t) C F holds 
for any t. If oU,;s<t)=F, holds for all t, J, is 
called the innovation of Y,. The innovation 
property is not valid in general. A sufficient 
condition is that (X,, Y) is a Gaussian process 
or h(t, x) be a bounded function. However, in 
any case, t¥,-adapted martingales are always 
represented as tstochastic integrals of the form 
Ei- fo Giedi, where the OU are F,-adapted 
processes. 


n(x)— T(x) T | 


0 


I. Bayes Formula 


Let C (or D) be the space of all continuous 
mappings x (or y) from [0, T] into R” (or R”) 
equipped with the uniform topology. x, (y,) is 
the value of x (y) at time t. Let &(C) be the 
least o-field of C for which x,, s<t, are mea- 
surable. &(D) is defined similarly. We denote 
by Dy, Dy, Dy y the tlaws of processes X,, W, 
and (X,, Y respectively. These are defined on 
(C), Z:(D), and Z,(C) ® &,(D), respectively. 
Then 9, y is equivalent (mutually tabsolutely 
continuous) to the product measure ®, ® Dy 
on each &(C) & &{D). The *Radon-Nikodym 
density o, of By y with respect to Dy ® Dy is 
written as 


(x, y) exp D Jis x,)dyi 


=) IA 


where h’ and y! are the corresponding com- 
ponents of vectors and dy! denotes the tIto 
integral. 

The conditional distribution z,(dx) is com- 
puted by the Bayes formula: 


PAS) 
p(1)' 


where Y 2(Y,; 0x t € T). Moreover, p,( f) satis- 


n(f)— 





pf) [r (x,)%(x, Y), (dx), 
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fies the LSDE 


aan: wt: | pli) p.(.f) d1,. 
0 W 


The density p,(x) (if x exists) satisfies 


t 


P(X) = po(x) + | L* p,(x)ds 


O 


d wel | elorhivrdy Jans 


If h(t, x) is a smooth function, then «,(x, y) is 
continuous in y, so that z,(f) or p,(f)is a 
continuous functional of the observed data 
(Y; s & t). Thus the filter z, is a frobust statistic. 

Remarks. (i) the signal and noise are not 
independent if the signal is controlled based 
on the observed data. In these cases, correc- 
tion terms are sometimes needed for the SDE 
of the nonlinear filter. (11) If the *sample paths 
of the signal process are not continuous, a 
similar SDE for a nonlinear filter is valid with 
L being replaced by some integrodifferential 
operator. If it is a (Markov chain with finite 
state, L is the generator of the chain. (iii) Sev- 
eral results are known for the case where 
the noise W, is not a Brownian motion but a 
tPoisson process. 
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406 (XVII.14) 
Stochastic Differential 
Equations 


A. Introduction 


Stochastic differential equations were rigor- 
ously formulated by K. Itó [7] in 1942 to 
construct diffusion processes corresponding to 
Kolmogorov's differential equations. For this 
purpose he introduced the notion of stochastic 
integrals, and thus a differential-integral cal- 
culus for sample paths of stochastic processes 
was established. This theory, often called Itó's 
stochastic analysis or stochastic calculus, has 
brought an epoch-making method to the theory 
of stochastic processes. It provides us with a 
fundamental tool for describing and analyzing 
diffusion processes that we can apply effec- 
tively to limit theorems and to the probabil- 
istic study of problems in analysis. It also 
plays an important role in the statistical the- 
ory of stochastic processes, such as fstochastic 
control or *stochastic filtering. Stochastic dif- 
ferential equations on manifolds provide a 
probabilistic method for differential geome- 
try, sometimes called stochastic differential 
geometry. Recently, many interesting examples 
of infinite-dimensional stochastic differential 
equations have been introduced to describe 
probabilistic models in physics, biology, etc. 

A unified theory of stochastic calculus has 
been developed in the framework of Doob's 
martingale theory and this, combined with 
Stroock and Varadhan's idea of martingale 
problems, provides an important method in 
the theory of stochastic processes (— 262 
Martingales). 


B. Stochastic Integrals 


As is well known, almost all sample paths of a 
Wiener process are continuous but nowhere 
differentiable (— 45 Brownian Motion), and 
hence integrals with respect to these functions 
cannot be defined as the usual Stieltjes inte- 
grals. But these integrals can be defined by 
making use of the stochastic nature of Brown- 
ian motion. Wiener defined them (the Wiener 
integrals) for nonrandom integrands, but Itó 
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defined them for a large class of random inte- 
grands. Itó's integrals have been extended in 
the martingale framework by H. Kunita and S. 
Watanabe, and by others [14, 19], as shown 
below. 

Let (Q, F, P) be a probability space, and 
let F= {F} >0 be an increasing family of o- 
subfields of F. Usually we assume that {F} is 
right continuous, i.e., Za" sel ln fr =F, 
for every t>0. Denote by M —.ZJ(F) the total- 
ity of all continuous square-integrable martin- 
gales X —(X,) relative to {Z}; to be precise, 

X is an {¥,}-martingale such that, with proba- 
bility 1, X, —0, t5 X, is continuous and EX) 
« oo for every t Z0. We introduce the metric 
IX- YI 2 EZ42^"min(1, | X, — Y,1;) on æ 
where || ||; stands for the L,(Q, P)-norm. We 
always identify two stochastic processes X — 
(X,) and Y (Y) if sample functions t— X, 

and t— Y, coincide with probability 1. Then, by 
virtue of Doob's inequality || maxo <,<,|X,— 
Y, €2/ X,— Yl; (^ 262 Martingales), M 
becomes a complete metric vector space. 

Next, by an integrable increasing process we 
mean a process A =(A,) with the following 
properties: (i) A is adapted to {F}, i.e., A, is 
¥,-measurable for every t > 0; (ii) with proba- 
bility 1, Ag —0, t A, is continuous and nonde- 
creasing; (iii) A, (20) is integrable for every 
t Z0, re, E(A,)< oo. We denote by £ = A(F) 
the totality of integrable increasing processes. 
We call a process V 2 (V) an integrable process 
of bounded variation if V is expressed as V, — 
A} — A? with A’, A? e.. The totality of inte- 
grable processes of bounded variation is de- 
noted by Y^ 2 Y (F). It follows from the *Doob- 
Meyer decomposition theorem that, for every 
M, Neff, there exists a unique Hei such that 
M,N,— V, is an (.Z,]-martingale. We denote 
this V as <M, N). In particular, (M, M» e £, 
and it is denoted simply by <M). (M, N» is 
called the quadratic variation process because 
Zi- (M, — M, JON, — AN. si, NA in 
probability as |A| 50, where A:tp =O<t, <... 
«t,—t is a partition and |A| - max, «;«,|f; — 
t; .,]. Brownian motion is the most important 
example of continuous square-integrable mar- 
tingales, and this is characterized in our frame- 
work as follows. Suppose that a d-dimensional 
continuous {¥,}-adapted process X = (Xj) 
satisfies Mi — Xi — Xie M and (M', M4), = 
éi, i, j=1, 2, ...,d. Then X is a d-dimensional 
Brownian motion such that X, — X,, and the 
F are independent for every uzvz t. Such a 
Brownian motion is called an (.2,]-Brownian 
motion, and a system of martingales M'e./ 
having this property is often called a system 
of { ¥,}-Wiener martingales. 

Now, we fix Me M. We denote by (M) 
the totality of real, (.7;j-adapted, and measur- 
able processes ® =(@(t)) such that |/®||? 4 = 
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EL fo (s? d(M»,] < oo for every t>0. Two 
®,, P,e (Mare identified if |P; —®,||, u= 
0 for all t>0. Since || ||, 4 is an L,-norm 

on [0,t] x Q with respect to the measure 

Iu (ds, dw) 2 d (M »,(o) P(do»), it is easy to see 
that X (M) is a complete metric vector space 
with the metric | — ®'|| y = 57,2 "min(l, || 
— Q' |, m), P, P'E (M). If  — (o(t)) is given, 
for a partition 0— to «t, « ... «t,...—oo and 
4, measurable bounded functions f;, i— 0, 
1,..., by 


omo fo t=0, 


fi-1(0). LEE 


then De Z(M) and the totality Y, of such 
processes are dense in (M). If De Y, we 
define I" ($) — (1 (4) (t)),.. by 


iz 1,2, ... 


5 


n-1 


M()(t) = 25 fí(M,,., = M.) LO a M,,), 


i=0 


WESTER 


Then I" ($e. Æ, and it holds also that (1M (®), 
IN(V)» = [5 (SV (s)d(M, N), for M, Ne M 
and ®, V e Y,. In particular, |^ (0)(t)]]2 = 
E[X1"(0)»,]— ||? 4, and hence ||1"(®)|| = 
||| ,. This implies that De c Y,(M)> 

I" ()e. M is an isometric linear mapping, and 
hence it can be extended to Z(M) uniquely, 
preserving the isometric property. TN (le 

M is called the stochastic integral of De 
La(M) by M e. A. 1™()(t) is often denoted by 
fo (s) 4M,, and the random variable I" (®)(t) 
obtained by fixing t is also called a stochastic 
integral. 

The definition of stochastic integrals can be 
extended further by the following localization 
method. For an F,\-*progressively measur- 
able process X =(X,) and {¥,}-stopping time 
o, the stopped process X? (X7) is defined by 
X? =X, no (t^e - min(t,o)). It follows from the 
*optional sampling theorem of Doob that 
X* e Af if X eI. For 6 —(o(r))e £,(M) and an 
{ 4 ]-stopping time c, ®, — (5, (t)) defined by 
$,(t) — I,,..,, (t) also belongs to (M) and it 
holds that /M(o,) — [I^ (0) ]*. Keeping these 
facts in mind, we give the following definition. 
Let .//^* = (M =(M,)|there exists a sequence 
of ( Z,-stopping times c, such that o, « oc, 
0,100 as n— oo a.s. and Mise A for every n 
—1.2,... . 7 * and Zar are defined in a 
similar way. For M, N e M’, (M, Nye Y^** is 
defined to be the unique process in 2 oz such 
that <M, N” M, N»?" for a sequence of 
stopping times Jo, as above, which can be 
chosen common to M and N. (M, M» is de- 
noted by <M) as before. We fix M e M% and 
set *(M)- (6 =(M(t))[a real, { ¥,}-adapted 
and measurable process such that, with proba- 
bility one, fọ (s? d(M», < œ for every tz 0]. 
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For De 7 (M), we can choose a sequence of 
{ #,}-stopping times {a,} such that. o, « oo, 
0,1 oo as. and MEM, b, e (M?) n=1, 

2, .... For example, set c, ^ min[n, inf(t| (M), 
+ fo D(s) d(M»,z n) ]. Then there exists an 
I" (D)e M’: such that IM()% = IV^(6, ) for 
n — 1,2, ..., which is unique and indepen- 
dent of a particular choice of {o,}. I^ (®) is 
called the stochastic integral of De 21° (M) 
by M e le, IM (d) (t) is often denoted by 

fo ®(s)dM,, and the random variable 1” (®)(0) 
obtained by fixing t is also called a stochastic 
integral. 

Some of the basic properties of stochas- 
tic integrals are: (i) If M e äis, De zou 
and V e Z7" (I"(à)), then OW e Z7"*(M) and 
IM (d) = I" (Np (ii) If Me, le and ®, Pe 
ZF*(M), then for every a, BER we have a + 
BY =(a@(t) + BP (t))e A (M) and IM(ad + BY) 
— a1" (o) 4- BIM (VP). Also if M, N e //** and b 
e £3°°(M)N #3}°°(N), then for x, BER we have 
Qd + BN) and I*M *8N(@) -- 1M (o) + 
BIN (6). (iii) If M, N e 4, be ZI (M), and 
V e ZN), then DY e Z*((M,N»)and 
(Zb I"(V)5, = fo 0, P, d M, N>,. Here, 
L(V) for Kei oe is defined as follows: 
With probability 1, s2 V, is of bounded varia- 
tion on every finite interval TO. t], the total 
variation of which is denoted by INL. Then 
INle lr, and we define ¥!°*(V) (pz 1) to 
be the totality of real, (.Z,]-adapted, and 
measurable processes ® =(@(t)) such that, 
with probability, {§|®(s)|’d|V|,< oo for 
every t>0. In particular, Z}°°(M) = l**(«M»). 
(iv) If M e M, e Z(M), Ge c isan {F,}- 
stopping time, then 1” (®,) = IM" (q) = IM" (d) 
= [1^ (0) T. (v) If dt) Ja f, where o is 
an { ¥,\-stopping time and f is a bounded 
F -measurable random variable, then Be 
E **(M) for every M e M and IM(®)(t)= 
f(M,— ML (vi) The definition of stochastic 
integrals is independent of the increasing 
family of o-subfields in the following sense: If 
{G} is another family such that M belongs 
to the class ./"* for both (2) and ISL and D 
belongs to the class (M) for both {4%} and 
{G,}, then I (à) is the same whether it is de- 
fined with respect to {F} or {G}. 

In particular, N = I^ (i), M e M, De 
£3°°(M), satisfies <N, L», — fo b(s)d( M, LD, 
for all Le äer Conversely, Ne.@' having 
this property is unique, and hence it coin- 
cides with I (à). I" ($)e.4 if and only if 
HOY d(M»,e v. 

The above definition of stochastic integrals 
can be extended with a slight technical modifi- 
cation to the case when M, is not necessarily 
continuous [19]. Among such general stochas- 
tic integrals, a particularly important role is 
played by stochastic integrals describing point 
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processes, including Poisson point processes as 
an important special case. These stochastic 
integrals are important in the study of discon- 
tinuous processes including * Lévy processes; 
even in the study of continuous processes, such 
as diffusion, they provide an important tool 
for the treatment of excursions [4, 6, 8 ]. 

Let (Q, F, P) and {F,} be as above. By a 
continuous iens n with respect to 
{LF}, or simply a semimartingale when there is 
no danger of confusion, we mean a process 
X —(X (t)) of the following form: X (t) - X(0) 4- 
M (t) - V(t), where X(0) is an Z,-measurable 
random variable, M -(M(t))e.Z'** and V = 
(V(t))e ze M and V are uniquely deter- 
mined from X, and this decomposition is 
called the semimartingale decomposition of X. 
M is called the martingale part and V the drift 
part of the semimartingale X. A semtmartin- 
gale X is often called an Ité process if M (t) — 
fo GE and V(t)= fọ Y(s)ds, where B(t)e 
AM is an { ¥,\-Brownian motion, De Z7*, and 
We ge. (When V, —t, (V) is denoted 
simply by °°.) A d-dimensional process 
whose components are semimartingales is 
called a d-dimensional semimartingale. The 
following formula, originally due to Itó and 
extended by Kunita and Watanabe, is of fun- 
damental importance in stochastic calculus. 

Itó's formula. Let X (t) - (X ' (t), ..., X*(t)) be 
a d-dimensional semimartingale and X'(t) — 
X'(0) 4- M'(t) 4- V'(t) be the semimartingale 
decomposition of components. Let F(x)— 
F(x!, ..., x?) be a C?-function defined on Ri. 
Then F(X (t)) is also a semimartingale, and we 
have 


F(X(t)= roy. | D,F(X (s) aM'(s) 
+3 [or X (s))dV'(s) 


d t 
$5 Y | D;D,F(X (s))d(M', M?» (s) 
i j=1 JO 


(where D;= 


In others words, if Y (t)  Y(0) - M(t) 4- V(t) is 

the semimartingale decomposition of Y(t) — 
F(X(t)), then M(t) =EL, fi D,F(X (s) dM'(s) 

and V(r) XL, [5 D; F(X (S) dV'(s) + 

1/22; i-i fS D;D;F(X () 44M", MIX (s). 

We now discuss other important transfor- 
mations on semimartingales. 

Time change. Let Ae dl, and assume fur- 
ther that with probability 1, tA, is strictly 
increasing and lim,,,, A,= oo. Let u—C, be the 
inverse function of t A,, i.e., C, — mint | 
A,Z uj. Then for every u2 0, C, is an {F}- 
stopping time. Set F = Fp, tz 0. For an {F}- 
*progressively measurable process X —(X (t)), 





e/Ox!). 
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we define X^ —(X^(t)) by X^(t) 2 X(C,) and 
call it the time change of X determined by A 
Then X^ is progressively measurable with 
respect to LZ. If X : X (t) — X(0)2- M(t) - V(t) 
is a semimartingale with respect to {Fj}, then 
X^ is a semimartingale with respect to 1%}, 
and its semimartingale decomposition is given 
by X4(t)= X(0)+ M4(t)+ V^(t). The map- 
pings M M^ and V — V^ are bijections 
between ëlo? and Zo: and between ^e 

and Zoé, respectively, where .// '** and Ze 
are defined relative to {¥,}. Furthermore, 
(MA, N^» - CM, N»* for every M, N e M. 
Noting that De ¥!°°(M) can always be chosen 
{.F,}-progressively measurable (in fact (.7j- 
*predictable), the mapping  — 0^ defines a 
bijection between £3°°(M) and Z7*(M^), and 
we have IM^(^) — [I^ IT. 

Transformation of drift (Girsanov transfor- 
mation). For me M", set D, (t) - exp[m,— 
4<m),]. Then D, — 1 €.Z**, and if m satisfies 
a certain integrability condition (for example, 
E(exp[4<m),]) < oo for every t 20, in partic- 
ular, <m», & ct for all t for some constant c» 
0), then D, is a martingale, i.e., E(D,,(t)) = 1 
for all tz 0. If E(D, (t) 2 1 for all t, then there 
exists a probability P on (Q, F) (if (Q, F) is a 
nice measurable space and  — V,. 9 4, which 
we can assume without loss of generality) such 
that P(A)= E(D, (t): A) for a Ac F, tz 0. Let 
X be a semimartingale with the decomposition 
X (t) — X(0) 4- M (t) - V(t). On the probability 
space (Q, F , P) with the same family {F}, X is 
still a semimartingale but its semimartingale 
decomposition is given by X (t) 2 X (0) - M (t) - 
V(t), where M(t)= M(t)—<M,m)(t) and P'()— 
V(t)+ <M, m» (t). Furthermore, it holds that 
(M, N» 2 M, N», M, N e A(**, This result is 
known as Girsanov's theorem. The transfor- 
mation of probability spaces given above is 
called a transformation of drift or a Girsanov 
transformation since it produces a change as 
shown above of the drift part in the semimar- 
tingale decomposition. 

In the discussion above, the increasing 
family (.Z,j was fixed. It is also important to 
study how the semimartingale character 
changes under a changing increasing family 


[12]. 


C. Stochastic Differentials 


In this section, we introduce stochastic dif- 
ferentials of semimartingales and rewrite the 
results in the previous section in more conve- 
nient form. Let (Q, F, P) and {F} be as above 
and M, A, Y^, M, glo, goe be defined as 
in Section B. By 2 we denote the totality of 
continuous semimartingales relative to {F}. 
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For X €2,let X (t)= X(0)+ My(t)+ V,(t) be the 
semimartingale decomposition. We write 
formally X (t) — X (0) — fọ dX (s) and call dX 
(denoted also by d X, or d X (t)) the stochastic 
differential of X. To be precise, dX can be 
considered as a random interval function 
dX (1)= X(t) — X(s), I — (s,t] or the equivalence 
class containing X under the equivalence 
relation X ~ Y on 2 defined by X ~ Y if and 
only if X (t) — X (0) 2 Y(t) — Y(0), t>0. For X, 
Ye 2 and a, B eR, «dX + BdY is defined by 
dok +BY) and dX - dY by d(My, My». Let d2 
be the totality of stochastic differentials of 
elements in 2 and dæ and d¥ be that of 
elements in ./** and ¥"', respectively. d2 is 
a commutative algebra under the operations 
just introduced. Note that dX -dYed¥ and 
that dX - dY =0 if either of dX and dY is in 
di In particular, dX -dY-dZ=0 for every 
dX, dY, and dZ. Let 4 be the totality of {F}- 
progressively measurable processes ® =(®(t)) 
‘such that, with probability 1, supo... |d(s)] 
« oo for every tz 0. Noting that Bc *(M) 
for any M e M, we define 6 - dX ed2 for 
PeZ and X €2 to be the stochastic differ- 
ential of the semimartingale fọ $(s) dM ,(s) 
fo b(s)dV,(s). ®- dX is uniquely determined 
by ® and dX. Itó's formula is stated, in this 
context, as follows: For X -(X!,..., X4), X'e 
2, and F:R^9R, which is of class C?, F(X)e 
2, and 


dF(X)= S D,F(X): dX! 


La SES 
E Y, D,DBjF(X) dX dX’. 
i j=1 


We now define another important operation 
on the space d2. Noting that 2c 2, we define 
X odY for X, Ye 2 by 


1 
XodY=X-d¥+7dX dy. 


This is uniquely determined from X and dY, 
and is called the symmetric multiplication of X 
and dY. It is also called a stochastic differential 
of the Stratonovich type or Itó's circle operation 
since the notation was introduced by Itó [9]. 
fo X odY is called the stochastic integral of the 
Stratonovich type, whereas fọ X -dY is that of 
the Itó type. Under this operation, Itó's for- 
mula is rewritten as follows: For X —(X!,..., 
X$), Xie 2, and F:R?5R, which is of class C?, 
F(X), D;F(X)e 2, and 


D,F(X)odX!. 


| 
Ms 


dF(X) 


This chain rule for stochastic differentials takes 
the same form as in the ordinary calculus. For 
this reason symmetric multiplication plays an 
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important role in transferring notions used in 
ordinary calculus into stochastic calculus and 
in defining intrinsic (De, coordinate-free) no- 
tions probabilistically. In particular, it is fun- 
damental to the study of stochastic differential 
equations on manifolds (— Section G). 


D. Stochastic Differential Equations 


Here, we give a general formulation of stochas- 
tic differential equations in which the infini- 
tesimal change of the system may depend on 
the past history of the system; however, equa- 
tions of Markovian type, in which the infini- 
tesimal change of the system depends only 
on the present state of the system, are consid- 
ered in most cases. Let W be the space of d- 
dimensional continuous paths: W4 = C([0, oc) 
—R‘):= the totality of all continuous functions 
w:[0, oo) Ra, endowed with the topology of 
the uniform convergence on finite intervals 
and Z(W^) be the topological o-field. For each 
t 2 0, define p,: W^ W’ by (p,w)(s) 2 w(t ^s), 
and let 2,(W4)— p, ! (&(W^4), tz 0. Let off 
be the totality of functions a(t, w) — (alt, w)): 
[0, 00) x W4 o R* Q R' (:=the totality of d x 
r real matrices) such that each component 
a(x, w) ((—1,2, ...,d;j — 1,2, ...,r) is 4([0, 00) 
x B(W*)-measurable and 4,(W")-measurable 
for each fixed t 20. In general, «i(t, w) is called 
nonanticipative if it satisfies the second property 
above. An important case of ae «/^" is when it 
is given as a(t, w) — c(t, w(t)) by a Borel func- 
tion a:[0, 00) x Ro R? & R'. In this case, « is 
called independent of the past history or of 
Markovian type. For a given ae gi" and fie 
4/*!, we consider the following stochastic 
differential equation: 


(1) dX()— Y ait, X) dB/()- BAL, X) dt, 


j= 


also denoted simply as 
dX (t)=a(t, X)dB(t)+ B(t, X) dt. 


Here X(t) - (X! (t), ..., X4(t)) is a d-dimensional 
continuous process. B(t) - (B! (t), ..., B'(t)) is a 
r-dimensional Brownian motion with B(0) — 0. 
A precise formulation of equation (1) is as 
follows. X —(X (t)) is called a solution of equa- 
tion (1) if it satisfies the following conditions: 
(i) X is a d-dimensional, continuous, and (.Z]- 
adapted process defined on a probability space 
(Q, ¥, P) with an increasing family {4}, i.e., 
X:Q5 W! which is F,/B,(W*)-measurable for 
every t Z0; (ii) aj(t, X)e 2", Bt, X)e Z1", 
i=1,...,d,j=1,...,r(— Section B for the 
definition of 4°); (iii) there exists an r- 
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dimensional { ¥,!-Brownian motion B(t) with 
B(0) «0 such that the equality 


t 
9 


x'(t).— KI Y | ai(s, X) dB/(s) 
j=l 


t 
ef B(sX)ds  i=1,2,...,d, 
0 


holds with probability 1. 

Thus a solution X is always accompanied 
by a Brownian motion B. To emphasize this, 
we often call X a solution with the Brownian 
motion B or call the pair (X, B) itself a solution 
of (1). In the above definition, a solution is 
given with reference to an increasing family 
(4). The essential point is that o-fields o(B(u) 
— B(vr;uzvzt) and o(X(s), B(s); 0< s< t) are 
independent for every t: If X satisfies the con- 
ditions of solutions stated above, then the 
specified independence is obvious, and con- 
versely, if this independence is satisfied, then by 
setting F, = ( ), o o(X(s), B(s);0<s<t+e), the 
conditions of solutions stated above are satis- 
fied. But it is usually convenient to introduce 
some increasing family {.¥,} into the definition 
of solutions as above. When x and f are of the 
Markovian type, a(t, w) c(t, w(t)), BU, w)= 
b(t, w(t)), the corresponding equation 


(2 dX(t)=oa(t, X(t)) dB(t) 4 b(t, X (t))dt 


is called a stochastic differential equation of 
Markovian type. Furthermore, if c(t, x) and 
b(t, x) are independent of t, i.e., a(t, x) 2 o(x) 
and b(t, x) = b(x), the equation 


(3) dX (t)=a(X(t))dB(t) + b(X(t))dt 


is called a stochastic differential equation of 
time homogeneous (or time-independent) Mar- 
kovian type. 

Next, we define the notions of the unique- 
ness of solutions. There are two kinds of 
uniqueness: uniqueness in the sense of law (in 
distribution) and pathwise uniqueness. When 
we consider the stochastic differential equa- 
tions as a means to determine the laws of 
continuous stochastic processes, uniqueness in 
the sense of law is sufficient. If, on the other 
hand, we regard the stochastic differential 
equation as a means to define the sample 
paths of solutions as a functional of the ac- 
companying Brownian motion, i.e., if we re- 
gard the equation as a machine that pro- 
duces a solution as an output when we input a 
Brownian motion, the notion of pathwise 
uniqueness is more natural and more impor- 
tant. As we shall see, this notion is closely 
related to the notion of strong solutions. 

These notions are defined as follows. For a 
solution X =(X(t)) of (1), X(0) is called the 
initial value, its law on Ri is called the initial 
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law (distribution), and the law of X on W° is 
called the law (distribution) of X. We say that 
the uniqueness in the sense of law of solutions 
for (1) holds if the law of any solution X is 
uniquely determined by its initial law, i.e., if 
whenever X and X' are two solutions whose 
initial laws coincide, then the laws of X and 
X' coincide. In this definition, we restrict 
ourselves to the solutions whose initial values 
are nonrandom, i.e., the initial laws are ó- 
distributions at some points in R*. Next, we 
say that the pathwise uniqueness of solutions 
for (1) holds if whenever X and X' are any 
two solutions defined on the same probabil- 
ity space (Q, F, P) with the same increasing 
family {F} and the same r-dimensional 131. 
Brownian motion such that X (0) — X (0) a.s., 
then X(t)  X'(t) for all t>0 a.s. In this defi- 
nition also, the solutions can be restricted to 
those having nonrandom initial values. 

We say that equation (1) has a unique strong 
solution if there exists a function F(x, w): R^ x 
W — W° (Wo = {we W"|w(0)=0}) such that tF- 
following are true: (i) For any solution (X, ' ) 
of (1), X = F(X(0), B) holds as: (ii) for any 
R?-valued random variable X (0) and an r- 
dimensional Brownian motion B=(B(t)) with 
B(0)=0 which are mutually independent, X = 
F(X (0), B) is a solution of (1) with the Brown- 
ian motion B and the initial value X (0). If 
this is the case, F(x, w) itself is a solution of (1) 
with the initial value x, and with respect to the 
canonical Brownian motion B(t, w) — w(t) on 
the r-dimensional Wiener space (Wj, F , P), F 
is the completion of Z(W;) with respect to the 
r-dimensional Wiener measure P. If equation 
(1) has a unique strong solution, then it is clear 
that pathwise uniqueness holds. Conversely, 
if pathwise uniqueness holds for (1) and if a 
solution exists for any given initial law, then 
equation (1) has a unique strong solution, 
[6,25]. 

The existence of solutions was discussed by 
A. V. Skorokhod [20]. If the coefficients x and 
D are bounded and continuous on [0, oo) x W^, 
a solution of (1) exists for any given initial law. 
This is shown as follows [6]. We first con- 
struct approximate solutions by Cauchy's 
polygonal method and then show that their 
probability laws are ttight. A limit process in 
the sense of probability law can be shown to 
be a solution. The assumption of bounded- 
ness above can be weakened, e.g., to the fol- 
lowing condition: For every T» 0, a constant 
K,>0 exists such that 


(4) let, will + llt, wii Kra tw 
te[0, T], wei 


Here ||w||, 2 maxo <,<,|w(s)|. In the case of the 
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Markovian equation (2), it is sufficient to 
assume that c(t, x) and b(t, x) are continuous: 


(5) To, di + b(t, I< K70 +x), 


te[0, T], xeR* 


If these conditions are violated, a solution X(t) 
does not exist globally in general but exists 

up to a certain time e, called the explosion 
time, such that lim,;,|x(t)| = œ% if e < oc. To 
extend the notion of solutions in such cases, 
we have to replace the path space W^ by the 
space W^ that consists of all continuous func- 
tions w:[0, oo) 5 R! (  R"U {A} =the one-point 
compactification) satisfying w(t) 2 A for every 
tz e(w) (—inf(t|w(t) 2 Aj). 

Now, we list some results on the uniqueness 
of solutions. First consider the equations of 
the Markovian type (2), and assume that the 
coefficients are continuous and satisfy the 
condition (5). (1) If c, b are Lipschitz continu- 
ous, Le, for every N > 0 there exists a constant 
Ky such that jett, x) — o(t, y)|| + ||b(t, x) — 
b(t, y) <Kylx—yl, te[0, T], x, ye By:= 
{ze R'||z| <N}, then the pathwise uniqueness 
of solutions holds for equation (2). Thus the 
unique strong solution of (2) exists, and this 
is constructed directly by Picard's succes- 
sive approximation (Itó [7,8]). (ii) Ifd=1, o 
is Hólder continuous with exponent 1/2 and 
b is Lipschitz continuous, i.e., for every N » 0, 
Ky exists such that 


lolt, x) — e(t, y)? + [b(t, x) — b(t, y) & Ky|x — yl, 


te[0, N], x, ye By, 


then the pathwise uniqueness of solutions 
holds for equation (2) (T. Yamada and 
Watanabe [25]). (iii) If the matrix a(t, x)= 
a(t, x)o(t, x)* (i.e., a"(t, x)= Ez- cilt, leit, x) 
is strictly positive definite, then the unique- 
ness in the sense of law of the solution for (2) 
holds (D. W. Stroock and S. R. S. Varadhan 
[21]). (iv) An example of stochastic differential 
equations for which the uniqueness in the 
sense of law holds but the pathwise uniqueness 
does not hold was given by H. Tanaka as 
follows: d ^r — 1, b(t, x) &0 and ott, x)= 1,9 
— Í: ¿<o Another example in the non- 
Markovian cases was given by B. S. Tsirel'son 
(see below). 

Next, consider non-Markovian equations of 
the following form: 


(60 dX(t)=dB(t)+ p(t, X)dt; 


i.e., the case d =r and a(t, w)— I (identity ma- 
trix). Assume further that fj e .«/^' is bounded. 
Then a solution of (6) exists for any given 
initial distribution, unique in the sense of law, 
and it can be constructed by the Girsanov 
transformation of Section B as follows. On a 
suitable probability space (Q, 7 , P) with an 
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increasing family {F} such that F =\/,.94, 
we set up an ,-measurable, d-dimensional 
random variable X (0) with a given law and 

a d-dimensional { ¥,}-Brownian motion B 

— (B(t)) such that B(0) — 0. Set X(t) =X (0) 

+ B(t) and M (t) 2 exp[fo B(s, X) aB(s) — 

1 folB(s, X)|? ds]. Then M(t) is an (.Z]- 
martingale, and the probability P on (Q, F) is 
determined by P(A)= E(M,; A), Ae F,. By 
Girsanov's theorem, B(t) = X(t) — X(0)— 

fo B(s, X)ds is a d-dimensional {.¥,}-Brownian 
motion on (Q, £, P), and hence (X. B) isa 
solution of (6). Any solution is given in this 
way and hence the uniqueness in the sense of 
law holds. But the pathwise uniqueness does 
not hold in general; an example was given 

by Tsirel'son [1,6] as follows. Let {t,} bea 
sequence such that 0 €... «t, «t, ,<to=1 
and Jm, t, —0. Set 


0, tztg and t—0, 


B(t, w) dl 
, Wl ' 


lita — dE 





telt tj), i=0, AEEA 


where 0(x) 2 x — [x], xe R, is the decimal part 
of x. 

Time changes (— Section B) are also used to 
solve some stochastic differential equations 


[6]. 


E. Stochastic Differential Equations and 
Diffusion Processes 


In this section we consider equations of time- 
independent Markovian type (3) only. The 
time-dependent case can be reduced to the 
time-independent case by adding one more 
component X*+! (t) such that d X^ ! (t) — dt. 
Further, we assume that coefficients o(x)e 
R^ & R’ and b(x)e R are continuous on Ri 
and the uniqueness in the sense of law of solu- 
tions holds. Let P,, x e R2, be the law on W^, 
or on W^? if there is an explosion, of a solution 
with the initial law 6, (=the unit raeasure at 
x). Then {P,} possesses the tstrong Markov 
property with respect to {F}, where 7, is a 
suitable completion of Z,(W4) or &(W4), and 
hence (W^, {F}, P.) or (W4,{F}, P.) is a diffu- 
sion process *on R^(— 115 Diffusion Pro- 
cesses, 261 Markov Processes). 
Let A be the differential operator 

pou. wë d e 

A E Y, a?(x)DDj- Y b(x)D, (D; 0/0x?) 
i, j=l i-l 

with the domain C2(R?) (= the totality of C?- 
functions on R^ with compact supports), where 
a(x) Ez- oi(x)oj(x). By Itó's formula, 


t 


(7) fwo swo- | (Af wis) ds 


0 
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is an { ¥,}-martingale for every Te C2(R?) (we 
set f(A) —0), and this property characterizes 
the diffusion process. The diffusion is gen- 
erated by the operator A in this sense. Fur- 
thermore, if for some 47 0, (4 — A)(C2(R4)) 
is a dense subset of C,, (R°) (=the totality 
of continuous functions f on R? such that 
lim, f(x) =0) then the ttransition semi- 
group of the diffusion is a tFeller semigroup 
on C, (R^), and its infinitesimal generator A is 
the closure of (A, C2(R^)). Hence u(t, x)= 
E.Lf(w(t))]. f € C2(R?), is the unique solu- 
tion of the evolution equation du/dt — Au, 
u|,-o =f. Generally, if the coefficients c and 
b are sufficiently smooth, we can show, by 
using the stochastic differential equation (3), 
that u(t, x) is also smooth for a smooth f 
and satisfies the heat equation 0u/ét = Au. 
Taking the expectations in (7), we have the re- 
lation E,[ f(w(t))] — f(x) - fo E.LAf(w(s))] ds. 
which implies that the transition probability 
P(t, x, dy) of the diffusion satisfies the equation 
Op/6t = A* p in (t, y) in a weak sense, where A* 
is the adjoint operator of A. If 0/0t — A* is 
*hypoelliptic, we can conclude that P(t, x, dy) 
possesses a smooth density p(t, x, y) by appeal- 
ing to the theory of partial differential equa- 
tions. Recently, P. Malliavin showed that a 
probabilistic method based on the stochastic 
differential equations can also be applied to 
this problem effectively, [6, 16, 17]. 

If c(t, x) is continuous and v(t, x) is suffi- 
ciently smooth in (t, x) on [0, oo) x Ri. then the 
following fact, more general than (7), holds: 


(8) v(t, vno | c(s, vn) — v(O, x) 


0 


— | e | c (u, vidi. 
0 0 


x (6v/0t 4- (A + c)v)(s, w(s)) ds 


is a local martingale De, € .//**) with respect 
to {FZ Pl By applying the toptional sampling 
theorem to (8) for a class of 1.Z,|-stopping 
times, we can obtain the probabilistic repre- 
sentation in terms of the diffusion of solutions 
for initial or boundary value problems related 
to the operator A [3,4]. 


F. Stochastic Differential Equations with 
Boundary Conditions 


As we saw in the previous section, diffusion 
processes generated by differential operators 
can be constructed by stochastic differential 
equations. A diffusion process on a domain 
with boundary is generated by a differential 
operator that describes the behavior of the 
process inside the domain, and a boundary 
condition that describes the behavior of the 
process on the boundary of the domain. For 
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example, consider a reflecting ‘Brownian mo- 
tion on the half-line [0, oo). This is a diffusion 
process X =(X,) on [0, oo) obtained by set- 
ting X,=|x,| from a 1-dimensional Brownian 
motion x,. The corresponding differential 
operator is A =4d?/dx?, and the boundary 
condition is Lu = du/dx | „=o — 0, that is, the 
transition expectation u(t, x)= E,[ f(X,)] is 
determined by éu/ét = Au, Lu —0, and ul, 

= f. In constructing such diffusion processes 
with boundary conditions, stochastic differen- 
tial equations can be used effectively. In the 
case of reflecting Brownian motion, it was 
formulated by Skorokhod in the form 


(9) dX (t)=dB(t)+doi(t). 


Here B(t) is a 1-dimensional Brownian mo- 
tion (B(0) « 0), X (t) is a continuous process 
such that X (t) 20, and oi has the following 
property with probability 1: @(0)=0, t5 

y(t) is continuous and nondecreasing and 
increases only on such t that X (t) - 0, i.e., 

fofi (X G)do(s)— o(t). Given a Brownian 
motion B(t) and a nonnegative random vari- 
able X(0) which are mutually independent, 

X (t) satisfying (9) and with the initial value 
X (0) is unique and given by X (t) - X (0) - B(t), 
t «oy —min(t| X(0)+ B(r) 20) and X(t)= B(t) 
—min,, en B(S), tz ga (P. Lévy, Skorokhod; 
— [6, 18]). 

In the case of multidimensional processes, 
possible boundary conditions were determinea 
by A. D. Venttsel’ [24]. Stochastic differential 
equations describing these diffusions were for- 
mulated by N. Ikeda [5] in the 2-dimensional 
case and by Watanabe [23] in the general 
case as follows. Let D be the upper half-space 
R4.={x=(x!,...,.x9|x?20}, AD ={x|x4=0}, 
and D ={x|x4>0}. The general case can be 
reduced, at least locally, to this case. Suppose 
that the following system of functions is 
given: (x): DOR! x R”, b(x): D> R^, t(x): 0D 
RI Is RS, f(x): 0Do R^ !, and p(x):6éD> 
[0, oc), which are all bounded and con- 
tinuous. Consider the following stochastic 
differential equation: 

C 


dX'(t) = Y, of X() [p(X (0) dBA(O) 


-. 
tl 


+ b'(X (0)1g(X (0) dt 
+ E i(X() ap X (0) aM*(2) 
k=1 


+ f'QX (0) d (0), 


(10)4 


r 


dX*(t) 3. aXX (0) 4 Bi) 


j=1 
 b"(X (t)) p(X (t)) dt + do(t), 
LX (0)dt = p(X(O)de(o. 
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By a solution of this equation, we mean a 
system of continuous semimartingales X — 

(X (t), B(t), M(t), o(t)) over a probability space 
(Q, F, P) with an increasing family {F,} satis- 
fying the following conditions: (i) X (t) 2 (X! (t), 
..., X*(t)) is D-valued, i.e., X4(t) 20; (ii) with 
probability 1, (0) 20, t o(t) is nondecreas- 
ing, and f$ I55(X (s) de(s) = g(t); (iii) B(t) and 
M(t) are r-dimensional and s-dimensional 
systems of elements in .#'°, respectively, 

such that (B, B», 2 t, (B', M"5,—0, and 
«M™, M'5, — Ó"" (t), i, jl, ...,r m, n= L...,s, 
and finally (iv) the stochastic differentials of 
these semimartingales satisfy (10). 

The processes B(t), M(t), and (t) are sub- 
sidiary, and the process X (t) itself is often 
called a solution. We say that the uniqueness 
of solution holds if the law of X — (X (t)) is 
uniquely determined from the law of X(0). As 
before, the existence and the uniqueness of 
solutions imply that solutions define a diffu- 
sion process on D, and these are guaranteed 
if, for example, mm. a^ *(x) — 0 and o, b, 

t, P are Lipschitz continuous, [6, 23]. Here, 
we set a?(x) 2 Et- oj(x)o](x) and a?(x) — 
Mia ti(x)t{(x). It is a diffusion process gener- 
ated by the differential operator 

| x d 
A E Y, a*(x)D,D;4- 2, b'(x)D,, 


i fl 


and by the Venttsel' boundary condition, 


Lu(x)= b x G)D,Djuix) - Y. B#(x)D,u(x) 


NI = 


+ D,u(x)— p(x)(Au)(x)=0 on 6D. 


G. Stochastic Differential Equations on 
Manifolds 


Let M be a connected o-compact C*-manifold 
of dimension d, and let W,, = C([0, 60) M) be 
the space of all continuous paths in M. If M 
is not compact, let M = M U {A} be the one- 
point compactification of M and W,, be the 
space of all continuous paths in M with A as a 
Trap, These path spaces are endowed with 
the c-fields Z(W,,) and Z(W,,), respectively, 
which are generated by Borel cylinder sets. 
By a continuous process on M we mean a 
(Wu, B(W,,))-valued random variable, and by 
a continuous process on M admitting explo- 
sions we mean a (W,,, Z(W,,))-valued ran- 
dom variable. In this section the probability 
space is taken to be the r-dimensional Wiener 
space (Wj, F, P) with the increasing family 
LF}, where F is generated by 2,(W;) and 
P-null sets. Then w=(w(t)), we W3, is an r- 
dimensional { F,\-Brownian motion. 

Suppose that we are given a system of C*- 
vector fields Ay, A,, ..., 4, on M. We consider 
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the following stochastic differential equation 
on M: 


(11) dX,=A,(X,)odw*(t)+ AS(X,) dt. 


(Here, the usual convention for the omission of 
the summation sign is used.) A precise mean- 
ing of equation (11) is as follows: We say that 
X =(X,) satisfies equation (11) if X is an {F}- 
adapted continuous process on M admitting 
explosions such that, for any C*-function f 

on M with compact support (we set f(A) — 

0), f(X;) is a continuous semimartingale 
satisfying 


02) df(X) 2(ALF)(X)) o dw" (t) + (Ao f )(X.) dt, 


where o is Itó's circle operation defined in 
Section C. This is equivalent to saying that 
X,=(X/},..., X2), in each local coordinate, is 
a d-dimensional semimartingale such that 


(13) dXi2oji(X)odw'(t) - b(X,)dt 
= o, (X) dw*(t) 


1g "M ; 
dÉ y Disi +b |a 
KH 


where A,(x) =o, (x) D;, k=1, 2,..., r, and Ag(x) 
= b'(x)D,. By solving the equation in each 
local coordinate and then putting these solu- 
tions together, we can obtain for each xe M 
a unique solution X, of (11) such that X, = 

x. We can also embed the manifold M in a 
higher-dimensional Euclidean space and solve 
the stochastic differential equation there. We 
denote the solution by X (t, x, w). The law P, 
on Wy of [t> X (t, x, w)] defines a diffusion 
process on M which is generated by the dif- 
ferential operator 4 —31Y., A2 + Apo. 

Next, if we consider the mapping x^ 
X (t, x, w); then, except for w belonging to a 
set of P-measure 0, the following is valid: For 
all (t, w) such that X (t, x9, w)e M, the mapping 
x X (t, x, w) is a diffeomorphism between a 
neighborhood of x, and a neighborhood of 
X (t, X9, w). This is based on the following fact 
for stochastic differential equations on R’. If 
in equation (3) the coefficients oj and b! are 
C”-functions with bounded derivatives of all 
orders a, |x| 2 1, then, denoting by X(t, x, w) 
the solution such that X (0) =x, we have that 
x X (t, x, w) is, with probability 1, a diffeo- 
morphism of Ri for all t [13]. 

Example 1: Stochastic moving frame [6, 15]. 
Let M be a Riemannian manifold of dimen- 
sion d, O(M) be the orthonormal frame bundle 
over M, and L,, L3, ..., La be the basic vector 
fields on O(M), that is, 
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where e — (e,, ...,e,) is an orthonormal basis in 
T.(M), x, 2 Exp(te,), i.e., the geodesic such that 
Xo — x and X =e,, and e, is the parallel translate 
of e along x,. Let b be a vector field on M and 
L, be its horizontal lift on O(M), i.e., L, isa 
vector field on O(M) determined by the follow- 
ing two properties: (i) L, is horizontal and (ii) 
dn(L,) — b, where n: O(M)— M is the projec- 
tion. Consider the following stochastic differ- 
ential equation on O(M): 


dr(t) — L,(r(t))o dw'(t)+ L(r(t)) dt. 


Solutions determine a family of (local) diffeo- 
morphisms r— rí(t, r, w) - (X (t, r, w), e(t, r, w)) 
on O(M). The law of [t X (t, r, w)] depends 
only on x ^ z(r), and it defines a diffusion 
process on M that is generated by the dif- 
ferential operator 1A,, +b (Ay, is the Laplace- 
Beltrami operator). Using this stochastic mov- 
ing frame r(t, r, w), we can realize a stochastic 
parallel translation of tensor fields along the 
paths of Brownian motion on M (a diffusion 
generated by 4A,,) that was first introduced 
by Itó [10], and by using it we can treat heat 
equations for tensor fields by means of a prob- 
abilistic method. 

Example 2: Brownian motion on Lie groups. 
Let G be a Lie group. A stochastic process 
{g(t)} on G is called a right-invariant Brownian 
motion if it satisfies the following conditions: 
(i) With probability 1, g(0)=e (the identity), 
and t—g(t) is continuous; (ii) for every t Ès, 
g(t)g(s) ! and e(g(u); u < s) are independent; 
and (iii) for every £2 s, g(t)g(s) ! and g(t— s) 
are equally distributed. 

Let Ao, A,, ..., A, be a system of right- 
invariant vector fields on G, and consider the 
stochastic differential equation 


(14) dg,— A;(g) o dw'(t) + Ao(g,) dt. 


Then a solution of (14) with gu =e exists 
uniquely and globally; we denote this solution 
by g°(t, w). It is a right-invariant Brownian 
motion G, and conversely, every right-invariant 
Brownian motion can be obtained in this way. 
The system of diffeomorphisms o  g(t, g, w) 
defined by the solutions of (14) is given by 
g-»g(t, g, w)  g? (t, w)g. 

Generally, if M is a compact manifold, the 
system of diffeomorphisms g,: x X (t, x, w) 
defined by equation (11) can be considered as 
a right-invariant Brownian motion on the 
infinite-dimensional Lie group consisting of all 
diffeomorphisms of M [2]. 
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A. Definitions 


The theory of stochastic processes was origi- 
nally involved with forming mathematiçal 
models of phenomena whose development in 
time obeys probabilistic laws. Given a basic 
*probability space (Q, 8, P) and a set T of real 
numbers, a family {X,},er of real-valued tran- 
dom variables defined on (Q, B, P) is called a 
stochastic process (or simply process) over 

(Q, B, P), where t is usually called the time 
parameter of the process. For each finite r-set 
Ir, the "joint distribution of (X, , ..., 
X, ) is called a finite-dimensional distribution 
of the process {X,} er- Stochastic processes 
are classified into large groups such as taddi- 
tive processes (or processes with independent 
increments), *Markov processes, Markov 
chains, ‘diffusion processes, ‘Gaussian pro- 
cesses, ‘stationary processes, ‘martingales, and 
*branching processes, according to the prop- 
erties of their finite-dimensional distributions. 
This classification is possible because of the 
following fact, a consequence of Kolmo- 
gorov’s textension theorem (— 341 Proba- 
bility Measures I): Given a system # of finite- 
dimensional distributions satisfying certain 
‘consistency conditions, we can construct a 
suitable probability measure on the space 

W — R' of real-valued functions on T so that 
the stochastic process {X,',.7, obtained by 
setting X,(w)=the value of we W at t, has 

P as its system of finite-dimensional distri- 
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butions. Now, consider two stochastic pro- 
cesses 4 = { X, her and Y={Y,},.7. Y is called 
a modification of X if they are defined over a 
common probability space (Q, $8, P) and P(X, 
= Y)- 1 (te T). Regardless of whether 2 and 
W are defined over a common probability 
space or over different probability spaces, X 
and Y are said to be equivalent or each is said 
to be a version of the other if their finite- 
dimensional distributions are the same. Ac- 
cording to Kolmogorov's extension theorem, 
every stochastic process has a version over the 
space W — R*. 

The function X (o) of t obtained by fixing œ 
in a stochastic process {X,},.7 is called the 
sample function (sample process or path) corre- 
sponding to c. In applying various operations 
to stochastic processes and studying detailed 
properties of stochastic processes, such as 
continuity of sample functions, the notions of 
measurability and separability play important 
roles. We assume that T is an interval in the 
real line, and (if needed) that the probability 
measure P is tcomplete. Denote by & the class 
of all *Borel subsets of T. A stochastic process 
( Xi]. is said to be measurable if the function 
X (c) of (t, c) is Ẹ x B-measurable. Continuity 
in probability defined in the next paragraph 
gives a sufficient condition for a stochastic 
process to have a measurable modification. 

A stochastic process {X,},.7 is said to be sepa- 
rable if there exists a countable subset S of T 
such that 


Plo 


lim inf X,(o) < X,(o) 


sot.seS 





<lim sup X,(o) 


sot, seS 


for any te r}= L. 


It was proved by J. L. Doob that every sto- 
chastic process has a separable modification 
[6]. 

Various types of continuity are considered 
for stochastic processes. {X,}er is said to be 
continuous in probability at se T if P(| X, — X,| 
»£)20 (ts, te T) for each c£» 0; it is said to 
be continuous in the mean (of order 1) at se T if 
E(| X, 2 X,|) 50 (ts, te T). Continuity in the 
mean of order p ( —1) is defined similarly. Con- 
tinuity in the mean of any order iraplies con- 
tinuity in probability. Suppose that {X,}er is 
separable. Then 


n.- lo 


are measurable events. If P(D,)>0, then se T is 
called a fixed point of discontinuity. The con- 
dition P(| JL D) D means that almost all 
sample functions are continuous. Regularity 
properties of sample functions of processes, 
such as continuity or right continuity, have 


tos,te e 
R se? 





X (@)= im, al and (] D, 
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been studied by many people. The following 
theorem is due to A. N. Kolmogorov: Let T= 
[0, 1]. If 


E(|X,—X,)")<clt—s|""* 


for constants y >0, ¢>0, and c>0, then 
{Xer has a modification {X,} er for which 
almost all sample functions are continuous, 
and 


D e v = 
dr h SE IX—X, -0|- 1 
t,seT 
for any ô(0 <ô <é/y). Each of the following is a 
sufficient condition for 4 ={X;,},.7 to havea 
modification for which almost all sample paths 
are right continuous functions with left limits. 
(1) 4 is an additive process which is continuous 
in probability (P. Lévy [3,4], K. Itó [8]; ^ 5 
Additive Processes B). (ti) X is a supermartin- 
gale that is continuous in probability (Doob 
[6]; — 262 Martingales C). 


B. Increasing Families of o-Algebras 


In the investigation of stochastic processes 
(especially Markov processes, martingales, 
and stochastic differential equations), the 
notion of increasing families of c-algebras 
often plays an important role. Let (Q, 8, P) 

be a probability space, and let T=[0, œ). A 
family (38,),. of e-subalgebras of $ is called 
an increasing family of c-algebras on (Q, 8, P) 
if $8. cB, for s « t. A process | X,],. is said to 
be adapted to {%,} if X, is 8,- measurable for 
each tc T. {X,} is said to be progressively 
measurable (or a progressive process) with 
respect to {%,} if for every te T the mapping 
(s, c0) X,(o) of [0, t] x Q into R is measurable 
with respect to the o-field 8([0, t]) x B,. A 
process {X,} with right continuous paths, 
adapted to {%,}, is progressively measurable 
with respect to {8,}. The same conclusion 
holds for a process with left continuous paths. 
A subset A of [0, oo) x Q is said to be progres- 
sive if the indicator process a(c) — 1 4(t, ol of A 
is a progressive process. A random time t on 
Q with values in [0, oo] is called a stopping 
time (or Markov time) if {t <1} €98, for all t> 
0. Constants (> 0) are stopping times. If a 
and t are stopping times, then min(o, t) and 
max(c, 1) are also stopping times. The limit of 
an increasing sequence of stopping times is a 
stopping time, while the limit of a decreasing 
sequence of stopping times is a stopping time 
with respect to {B,,}, where B,, = ( ..,95,. 
Let 3B, be the class of AeB such that AN 
(reien, ("te Ty; then it is a c-algebra if x is 
a stopping time. If | X,} is a progressive pro- 
cess and if t is a stopping time, then XI. 
is 8,- measurable. An increasing family 199. 
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of g-algebras on (Q, 8, P) is said to be com- 
plete if the probability space (Q, B, P) is com- 
plete and if all the P-negligible sets belong 

to By. Fron now on we assume that (38,] is 
complete and right continuous (e, B, = 8, , 
for all t z 0). Let B be a subset of R and {X,} 
be a process. We call tg=inf{t>0|X,(w)e B] a 
hitting time for B. Measurability of t, is not 
always guaranteed, that is, ze is not always a 
stopping time. G. A. Hunt showed that for a 
wide class of Markov processes hitting times 
for analytic sets are stopping times. This result 
Is based on a theorem of G. Choquet on capa- 
citability and was generalized by P. A. Meyer 
as follows: (1) For every progressively mea- 
surable process, hitting times for analytic sets 
are stopping times; and (i1) for every progres- 
sive set A, D, — infit 2 O|[(t,c)e A] is a stop- 
ping time. The following notions on measur- 
ability are also important. The predictable c- 
algebra on [0, oo) x Q, denoted by Y, is defined 
to be the least c-algebra on TO. oc) x Q with 
respect to which every process X,(q) that is 
adapted to {%8,} and has left continuous paths 
is measurable in (t, œ). The well-measurable or 
optional c-algebra on [0, oc) x Q, denoted by 
C, is defined to be the least c-algebra on [0, oc) 
x Q with respect to which every process X,(«) 
that is adapted to (3B,] and has right continu- 
ous paths with left limits is measurable in (t, œ). 
A process { X,} defined on Q is said to be pre- 
dictable (resp. well-measurable or optional) if 
the function (t, o) X,(@) or [0, oo) x Q is 
measurable with respect to the predictable o- 
algebra 2 (resp. the optional o-algebra ©). For 
further information regarding the notions 
given in this section — [10]. 

Up to this point it was assumed that the 
space in which a process {X,},.7 takes values, 
namely, the state space of {X,},.7, is a set of 
real numbers; but in general, topological 
spaces or merely measurable spaces can be 
taken for the state spaces of stochastic pro- 
cesses. The general definitions and results 
already given can be extended to stochastic 
processes whose state spaces are *locally com- 
pact Hausdorff spaces satisfying the second 
*countability axiom. 

Moreover, the time parameter set T ofa 
process {X,},-7 need not be a set of real num- 
bers. For example, P. Lévy [12] and H. P. 
McKean [13] investigated stochastic processes 
with several-dimensional time; such processes 
are sometimes called random fields. The case 
in which T is Z, 4, or in general a space of 
functions (which is nuclear) has also been 
investigated (— Section C). A probabilistic 
formulation of equilibrium states given by R. 
L. Dobrushin [14] initiated recent probabilis- 
tic study of statistical mechanics. For further 
information concerning processes with general 
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time parameter spaces — 136 Ergodic Theory, 
176 Gaussian Processes, 340 Probabilistic 
Methods in Statistical Mechanics. 


C. Random Distributions and Generalized 
Stochastic Processes 


The investigation of random distributions was 
initiated by I. M. Gel'fand [15] and Itó [17]. 
Denote by. 2 the space of functions of t (—oo 
«t « oo) of class C? with compact *support, 
and by Z' the space of tdistributions. At func- 
tion X(¢g, c) of o€Q and o€4 is called a 
random distribution (or generalized stochastic 
process) if X (o, œ) is a distribution as a func- 
tion of o for almost all œ and is measurable as 
a function of c for each fixed o. Denote by 
$8(2") the smallest c-algebra containing sets of 
the form (ye Z'| y(qo)e Ei (me, E is a Borel 
set). A random distribution is nothing but a 
£'-valued random variable. For a random 
distribution X (o, œ), a probability measure D, 
on $8(Z") is induced by 


®,(B)=P{o|X(-,w)eB}, BeB(F’). 


The functional 
c(q)— E(ei*t e) = Lean eg, 


is called the characteristic functional of X (q, œ) 
or Q,. The functional c(q) is continuous posi- 
tive definite, and c(0) 2 1. Conversely, given a 
functional c(q) with these properties, a theo- 
rem of R. A. Minlos (— 341 Probability 
Measures J) states that there exists a unique 
probability measure ® on 8(2") whose charac- 
teristic functional equals c(q). In other words, 
a random distribution with the character- 

istic functional c(q) can be constructed over 
(2/', B(F'), D). 

Typical classes of random distributions that 
have been investigated so far are stationary 
random distributions and random distri- 
butions with independent values at every 
point. (For stationary ones — 395 Stationary 
Processes.) A random distribution X (ọ, œ) is 
called a random distribution with independent 
values at every point if 9, (t), (t) Z0 implies the 
independence of X(@,,@) and X(@,, œ), that is, 
c(p, + 95)— c(q,)c(q;). A sufficient condition 
for the functional of the form 


c(~) - exp (| flot, o'(t), .... o") 


(f is continuous and /(0, ...,0) 20) to be the 
characteristic functional of a stationary ran- 
dom distribution with independent values at 
every point is that the function exp(sf (xo, x,, 
zl of (Xo, X4, ..., x) e R* be positive defi- 
nite for each s» 0 [16]. Under this condition, 
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a concrete representation of f is known [16]. 
When Kk —0, a necessary and sufficient condi- 
tion for c(q) to be the characteristic functional 
of a stationary random distribution with 
independent values at every point is that 

exp f(x) be the characteristic function of an 
"infinitely divisible distribution. One such 
random distribution is the so-called Guassian 
white noise, namely, the distribution deriva- 
tive of Brownian motion whose characteristic 
functional is 


l 2 
orl = fioo it) 


A family {X,} peg of real-valued random 
variables indexed by 2 is called a random 
distribution in the wide sense if X, is linear in 
p, namely, X,,,,, —aX, - bX, with proba- 
bility 1 for fixed o, Y € Z, and real constants a, 
b, and if X, 0 in probability whenever o0 
in the topology of 2. (For a typical class of 
random distributions in the wide sense — 395 
Stationary Processes C.) A random distri- 
bution in the wide sense has a modification 
that is a random distribution. 

In the definition of random distributions 
(in the wide sense) one can replace the space 
£ by the space F of rapidly decreasing C^- 
functions or in general by some space of 
functions that is nuclear. For example, one 
can define random distributions as “’-valued 
random variables. 

Up to this point random distributions of 
one variable have been considered. Random 
distributions of several variables are called 
generalized random fields and have been in- 
vestigated by Itó [18], A. M. Yaglom [19], 
Gel'fand and N. Ya. Vilenkin [16], and others. 
Moreover, K. Urbanik [20,21] developed a 
theory of generalized stochastic processes 
based on G. Mikusinski’s theory instead of L. 
Schwartz's theory of distributions. 


D. Random Measure 


Let (S, %, m) be any o-finite measure space, 
and put So — (Aet |[m(A) « oo]. By virtue of 
*Kolmogorov's extension theorem, there exists 
a family {W(A)} 4.5, of real random variables 
indexed by o such that (1) for any mutually 
disjoint A}, ..., 4,€ Wo. (W(A,). .... W(A,)] is 
independent; (ii) for any A€ o, W(A) is Gauss- 
ian distributed with mean 0 and variance 
m(A); and (iii) for any A, Be o, E( W(A)W(B)) 
=m(AN B). Similarly, there exists a family 
(N(A)] 4eg, of real-valued random variables 
indexed by o such that (i) for any mutually 
disjoint A,,...,4,€ os (N(A,). ..., N(A,)} is 
independent; (ii) for any A€ o, N(A) is tPois- 
son distributed with mean m(A); and (iii) for 
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any A, Be o, E(N(A)N(B))=m(AN B). 
{W(A)} acm, and {N(A)} seq, are called a 
Gaussian random measure and a Poisson ran- 
dom measure associated with the measure 
space (S, %, m), respectively. By using these 
random measures, the theory of multiple inte- 
grals can be developed. 

By a point function p on $ we mean a map- 
ping p: D, S, where the domain D, is a count- 
able subset of (0, oo). p defines a counting 
measure N, on (0, oo) x S such that N,((0, t] x 
U)= #{seD,;s<t, p(s)e U} (t>0, UE Go). 

For a point function p and t>0, the shift point 
function ©,p is defined by De, = (se(0, oo); s + 
t€ D,j and (O,p)(s)— p(s- t). Let Tg be the 
totality of point functions on S and A(T.) be 
the smallest o-field on Hg with respect to 
which all p^ N,((o,t] x U,t>0, Ve o) 

are measurable. A point process on S is a 

(Hs, Z(IT;))- valued random variable. Then 
there exists a point process p on S such that 

(i) for any t>0, p and O,p have the same prob- 
ability law, and (ii) N, is a Poisson random 
measure associated with ((0, oo) x S, Z(0, oo) x 
BTIT,), dt x m(ds)). The point process p is called 
the stationary Poisson point process with the 
characteristic measure m. 
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A. General Remarks 


Stochastic programming is a method of finding 
optimal solutions in mathematical program- 
ming in its narrow sense (— 264 Mathematical 
Programming), when some or all coefficients 
are stochastic variables with known probabil- 
ity distributions. There are essentially two dif- 
ferent types of models in stochastic program- 
ming situations: One is chance-constrained pro- 
gramming (CCP), and the other is a two-stage 
stochastic programming (TSSP). The differ- 
ence between them depends mainly on the 
informational structure of the sequence of ob- 
servations and decisions. For simplicity, let us 
here consider stochastic linear programming, 
which is the best-known model at present. 

Let Ay, A be m x n-dimensional matrices and 
x, CER” and b, bae R”. Suppose further that 
components of A, b, c are random variables, 
while those of Ag, bo are constants. Consider 
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the following formally defined linear program- 
ming problem: min,(c'x| Ax <b, xe Xo}, Xy = 
{x| Aox € bo, x 20]. Let (Q, B, P) be a proba- 
bility space (— 342 Probability Theory) such 
that {A(q), b(c), c(œ)} is a measurable trans- 
formation on Q into R"*"*"*» 


B. Chance-Constrained Programming (CCP) 


This method is based on the assumption that 
a decision x has to be made in advance of 

the realization of the random variables. Sup- 
pose that A,(@) is the ith row of A(«), and 
bi(c) is the ith component of b(@). We call 
P(1o|A;(o)x <b(m)}) >a, a chance constraint, 
where a; is a prescribed fractional value deter- 
mined by the decision maker according to his 
attitude toward the constraint A;(c)x <b): 
if he attaches importance to it, he will take 

o; as great as possible. Defining feasible sets 
X,(a,) and X by X,(x)— (x| Pol Adel 
bio! >a}, X 2 X9 (( Y. Kall, we can 
formulate CCP as follows: min, ( F(x)| xe Xj, 
where F: X >R, is the certainty equivalent of 
the stochastic objective function c'x. We have 
four models of CCP corresponding to differ- 
ent types of F(x): (i) E-model: F(x) =@’x, c = 
E,,c(o) x. (ii) V-model: F(x) 2 Var(c(o)'x) = 
X'V.x, where V, is a variance-covariance ma- 
trix of c(o). (iti) P,-model: F(x) — f, P((c|c(oy 
<f}) za, 1/2 <4 <1. (iv) P-model: F(x)= 
P({w|c(@)'x>y}) for a given constant y. In 
particular, if the components of A(c), b(o), 
c(o) have a multidimensional normal distri- 
bution, the certainty equivalent of the ith 
chance constraint is derived in the following 
form: A;x +07! (æ) (x'Vx + 2wix + v2)? « b, 
where A. V,, wi, v; b; are expectation vectors 
or a variance-covariance matrix of A,(«) and 
bo) and O(t)= f+ exp( — 27/2) dz/ /2n. The 
set X;(o;) for this constraint can be shown to 
be convex for 1/2 x; € 1, by using the con- 
vexity of the function |/ x'Vx for a positive 
semidefinite matrix V. Under the same assump- 
tion we can obtain the objective functions 
F(x) e &'x-- 6! (æo) / x'V.x for the P,-model 
and F(x) 2 (c'x — y)/J/ x' V.x for the P,-model. 
These four models have been shown to be 
computable by applying convex programming 
techniques, including the conjugate gradient 
method. Further studies on the convexity of a 
more general chance constraint P({@|A(w)x < 
b(co)j)z a, 0x a & 1, appear in several articles. 


«x 


C. Two-Stage Stochastic Programming (TSSP) 


This method divides the decision process into 
two stages. First stage: Before the realization 
of random variables, one makes a decision x, 
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being allowed to compensate for it after the 
specification of those values. Second stage: 
One obtains an optimal compensation ye 

R' for the given x and the realized values of 
the random variables. Assuming that qe R" 

is a random vector in addition to A, b, c, we 
can formulate TSSP as follows. First stage: 
min, E, ((c(o)'x + Q(x, »)| xe ll: second 
stage: Q(x, &) 2 min, (q(o) y| Wy = A(cv)x — 
b(w), y 20], where X =X NK, K - (x| Q(x, o) 
< +œ with probability 1} and q()’y is a loss 
function for the deviation A(w)x — b(w). The 
m x n matrix W is called a compensation ma- 
trix. Several theorems have been proved: (i) K 
is a closed convex set; (it) Q(x) = E,,Q(x, œ) is a 
convex function on K if the random variables 
in A(o), b(@), q(o") are square integrable; (iii) 
if P has a density function, then Q(x) has a con- 
tinuous gradient on K; (iv) when P has a finite 
discrete probability distribution, a TSSP prob- 
lem is reduced to a linear programming prob- 
lem having a dual decomposition structure. 
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A. Examples of Structure 


Structure is a unified description of mathe- 
matical objects such as tordered sets, trings, 
tlinear spaces, *topological spaces, tprob- 
ability spaces, *manifolds, etc., using only the 
concepts tset and relation. The following are 
examples. 
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(1) Order. An *ordering on a set A is a 
binary relation in A (— 358 Relations) with 
a *graph « such that (1) if ae A, then (a, alen 
(tt) if (a, b)ex and (b, ales, then a=b; and (iit) 
if (a, b)e x and (b, c)ex, then (a, chen, where a is 
an element of the *power set (4 x A). We say 
that a determines a structure of ordering on 
the set A. 

(2) Law of composition. A law of compo- 
sition on a set A is a mapping from A x A (or 
a subset) to A. This mapping is considered as 
a ternary relation with a graph xe (Ax A 
x A), satisfying the following two conditions 
(or possibly only (11)): (1) if (a, b)e A x A, then 
(a, b, c)ex for some ce A; (ii) if (a, b, c)e x and 
(a, b, c')e x, then c — c'. We say that x deter- 
mines the structure of a law of composition 
in A. The *associative law in the law of com- 
position determined by « is given by: (iii) if 
(a, b, x)ea, (b, c, y)e x, (x, c, zea, and (a, y, z') ex, 
then z 2 z. A set with conditions (1), (ii), and 
(itt) ts called a semigroup. 

(3) Operation. An operation of a set A ona 
set B is a mapping from A x B (or a subset) to 
B. It is considered as a ternary relation with 
the graph y e *B(A x B x B), satisfying the fol- 
lowing two conditions (or possibly only (ii): (i) 
if (a, b)e A x B, then (a, b, c) e? for some ce B; 
(11) if (a, b, c)ey and (a,b, c)ey, then c =c. We 
say that y determines the structure of an oper- 
ation of A on B. Each element x of A ts called 
an operator on B; A is called a domain of 
operators on B, and B is called an A-set. When 
B is the main object of consideration, an oper- 
ation of A on B is sometimes called an external 
law of composition of A on B. The law of com- 
position of A as described in (2) is then called 
an internal law of composition of 4. When a 
domain of operators A on B determined by 
7€ (A x B x B) has an internal law of compo- 
sition determined by xe A x A x A), it is 
usually assumed that the following conditions 
on x, y are satisfied: (iti) if (a, b, x) eo, (b, c, y)e y, 
(X, c, z)e y, and (a, y, z)ey, then z =z. If we 
denote the law of composition by (a, b) Sab 
and the operation by (a, b) a: b, then con- 
dition (iii) may be written: (11) (ab): c =a; (b-c) 
for a, be A and ce B. When an A-set B with an 
external law of composition determined by 
y € (A x B x B) has an internal law of compo- 
sition determined by fe S(B x B x B), it is 
usually assumed that the following condition 
on fi, y is satisfied: (iv) if (a, b, x) e y, (a, c, y) e y, 
(b,c, z)e fl; (x, y, w)e f, and (a, z, w')ey, then w= 
w'. According to the notation ab and a: b, it 
is described as: (1v) (a: b) (a: c) 2 a: (bc) for ae A 
and b, ce B. 

The mapping A x Bo B(B x A— B) is called 
a left (right) operation of A on B. To emphasize 
leftness or rightness, "left-" or "right-" is at- 
tached to corresponding concepts. 
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(4) Topology. A *topology on a set A is 
determined by a set xe SA), called the *sys- 
tem of open sets, satisfying the following 
conditions: (i) @ea and Aex; (ii) if Boa, then 
|] Bea; and (iii) if f <a is finite, then (^) fiea. 
We say that « defines the structure of a topol- 
ogy on the set A (— 425 Topological Spaces). 


B. Mathematical Structures 


We now explain the concept of mathematical 
structure for the case of a *linear space (— 256 
Linear Spaces). A linear space has two basic 
sets, one of which is a set K of elements called 
scalars and the other, a set V of elements called 
*vectors, two laws of compositions in K called 
addition and multiplication, a law of compo- 
sition in l/called addition, and an operation of 
K on Vcalled scalar multiplication. The laws 
of composition and the operation are given by 
elements of power sets: x,, x, € B(K x K x K), 
x4 € (Vx Vx V), and xe 3B(K x Vx V); and 
the basic properties of the linear space, such as 
H(a+b)=4a+ 4b (LEK, a, be V), are described 
as propositions on K, V, %4, ...,&%4 and denoted 
by PORK, Vo ota ess 

Up to now, we have been considering a 
given linear space. To give a description of a 
linear space in general, we use the symbols 
X4, X5, 61, ..., £4 instead of the symbols K, 
Ha, Ga, replace conditions such as a, € 
BK x Kx K),... by & eX, x X, x X,),..., 
and consider the set 2 of these symbols and 
formulas: 


e E EE 
č eX, x X, Xx Xik..., 
e € P(X, x X4 x X5). 


The set 2 is called the type of linear space. 
Similarly, we consider the set F of all 
P(X,, X5, 5, ..., 54)... corresponding to the 
basic properties P(K, V,a,,...,%4),... of the 
linear space. The set F is called the axiom 
system of the linear space. 

In general, let 4,,..., Ám be the basic sets 
(K and V in the preceding example). The basic 
concepts 2,,..., x, (X1, ---, 44 in the preceding 
example) are given as elements of finitely 
generated sets from A,,...,A,,, i.e., elements 
of sets obtained by a finite number of ap- 
plications of the operations of forming the 
*Cartesian product and the tpower set from 
A,,...,A,,. Basic properties are given as pro- 
positions on A4,, ..., Am 9,, ..., Dn, These basic 
properties and A4,,...,4,, X1, ..., X, determine 
a mathematical system. We consider also a 
type X of symbols X,,...,X,, of basic sets and 
symbols £,, ..., č, of basic concepts, and an 
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axiom system Į of basic properties. The pair 
(I) determines a mathematical struc- 

ture. When we substitute sets A,,...,A,, for 
Ke, and 7, 5005 Xn for £C, where 
X,,..., Xm and dn, Č, satisfy the axiom 
system T, then (A,,..., Am X1, -+3 Xn) is called a 
mathematical system with the mathematical 
structure (Z, I), or a model of the structure 
(X, I). Two mathematical systems are called 
similar or of the same kind if they have the 
same mathematical structure. Groups, rings, 
topological spaces, etc., are mathematical 
structures. Mathematical systems are some- 
times called algebraic systems in the wider 
sense, and when we consider mainly their laws 
of composition (and operations), we call them 
algebraic systems. We explain this in further 
detail in Section C. 


C. Algebraic Systems 


Algebraic systems are sets with laws of compo- 
sition and operations satisfying certain axiom 
systems; the laws of composition and oper- 
ations and the axiom systems they satisfy 
determine their type (— 2 Abelian Groups, 29 
Associative Algebras, 42 Boolean Algebra, 67 
Commutative Rings, 149 Fields, 151 Finite 
Groups, 190 Groups, 231 Jorden Algebras, 248 
Lie Algebras, 368 Rings). Each algebraic sys- 
tem has its own theory, but general properties 
and related concepts are dealt with from a 
common standpoint. From this common 
ground we often get an insight into concepts 
from which arose a general theory of math- 
ematical systems. We describe here only al- 
gebraic systems, but it is possible to describe 
similar concepts for mathematical systems. 
The following is a description based mainly on 
one law of composition (a, b) aab; a similar 
description is possible for the case of two or 
more laws of composition. 

The law of composition ab is sometimes 
written a+b, a'b, [a,b], etc. A mapping f: A> 
A’ of similar algebraic systems A and A’ is 
called a homomorphism provided that f (ab) = 
f (a) f (b) (a, be A). A’ is said to be homomor- 
phic to A if there is homomorphism from A 
onto A’. If f is one-to-one, onto, and its in- 
verse mapping f ' A’—A is also a homomor- 
phism, then f is called an isomorphism, A' is 
said to be isomorphic to A, and the relation is 
written A = A’. The composition of homomor- 
phisms is a homomorphism, and the identity 
mapping is an isomorphism. A homomor- 
phism of A to A itself is called an endomor- 
phism, and if it 1s also an isomorphism, then 
it is an automorphism. The set of all endo- 
morphisms of A forms a 'semigroup under 
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composition, and the set of automorphisms 
forms a group under composition. The con- 
cept of homomorphism is a fundamental 
concept appearing in all algebraic systems. A 
homomorphism is sometimes called a 
representation. 

An element e of an algebraic system A with 
a law of composition ab is called an identity 
element if ae = ea =a (for all ae A). If such an 
element exists, then it is unique. In the case of 
a ring, two laws of composition, addition and 
multiplication, are given. In this case the iden- 
tity element for multiplication (if it exists) is 
called the identity element (or tunity element) 
of the ring. In the case of homomorphism 
between groups, the identity element is map- 
ped to the identity element, but this does 
not always hold in general algebraic systems. 
Since the identity element plays an important 
role, it is frequently added to the basic con- 
cepts. Homomorphism between mathematical 
systems is generally defined to induce a map- 
ping between basic concepts. A semigroup and 
a ring with a unity element are called a unitary 
semigroup (monoid) and a unitary ring, respec- 
tively, and homomorphisms between these 
systems are restricted to mappings that map 
the identity element to the identity element. 

Let A and A' be similar algebraic systems, 
and let A be a subset of A’. A is called a sub- 
system of A’ if the mapping f: A A’ defined 
by f(a) - a(ae A) is a homomorphism. A sub- 
system of a group (ring) is called a *subgroup 
(*subring), and similarly for other algebraic 
Systems. 

An fequivalence relation R in an alge- 
braic system A is called compatible with the 
law of composition if R(a, a’) and R(b, b’) 
imply R(ab, a'b’). Consider the *quotient set 
A/R. Then the law of composition in A/R is 
uniquely determined so that the mapping 


f: A—AJR defined by ae f(a) is a homomor- 


phism. The algebraic system thus obtained 
is called a quotient system. A quotient sys- 
tem of a group (ring) is a group (ring), called 
a ‘quotient group (tquotient ring). Other 
cases, including those where operations are 
given, are treated similarly (— 52 Categories 
and Functors). 
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A. The Notion of a Surface 


The notion of a surface may be roughly ex- 
pressed by saying that by moving a curve we 
get a surface or that the boundary of a solid 
body is a surface. But these propositions can- 
not be considered mathematical definitions of 
a surface. We also make a distinction between 
surfaces and planes in ordinary language, 
where we mean by surfaces only those that are 
not planes. In mathematical language, how- 
ever, planes are usually included among the 
surfaces. 

A surface can be defined as a 2-dimensional 
*continuum, in accordance with the definition 
of a curve as a 1-dimensional continuum. 
However, while we have a theory of curves 
based on this definition, we do not have a 
similar theory of surfaces thus defined (— 93 
Curves). 

What is called a surface or a curved surface 
is usually a 2-dimensional ttopological mani- 
fold, that is, a topological space that satisfies 
the tsecond countability axiom and of which 
every point has a neighborhood thomeomor- 
phic to the interior of a circular disk in a 
2-dimensional Euclidean space. In the follow- 
ing sections, we mean by a surface such a 2- 
dimensional topological manifold. 


B. Examples and Classification 


The simplest examples of surfaces are the 2- 
dimensional !simplex and the 2-dimensional 
‘sphere. Surfaces are generally *simplicially 
decomposable (or triangulable) and hence 
homeomorphic to 2-dimensional polyhedra (T. 
Rado, Acta Sci. Math. Szeged. (1925)). A tcom- 
pact surface is called a closed surface, and a 
noncompact surface is called an open surface. 
A closed surface ts decomposable into a finite 
number of 2-simplexes and so can be inter- 
preted as a tcombinatorial manifold. A 2- 
dimensional topological manifold having a 
boundary is called a surface with boundary. A 
2-simplex is an example of a surface with 
boundary, and a sphere is an example of a 
closed surface without boundary. 

Surfaces are classified as torientable and 
*nonorientable. In the special case when a sur- 
face is embedded in a 3-dimensional Euclid- 
ean space E?, whether the surface is orien- 
table or not depends on its having two sides 
(the “surface” and “back”) or only one side. 
Therefore, in this special case, an orientable 
surface is called two-sided, and a nonorientable 
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surface, one-sided. A nonorientable closed 
surface without boundary cannot be embed- 
ded in the Euclidean space E? (— 56 Charac- 
teristic Classes, 114 Differential Topology). 

The first example of a nonorientable surface 
(with boundary) is the so-called Möbius strip 
or Mobius band, constructed as an 'identi- 
fication space from a rectangle by twisting 
through 180° and identifying the opposite 
edges with one another (Fig. 1). 


A D 





Fig. 1 


As illustrated in Fig. 2, from a rectangle 
ABCD we can obtain a closed surface homeo- 
morphic to the product space S! x S! by 
identifying the opposite edges AB with DC 
and BC with AD. This surface is the so-called 
2-dimensional torus (or anchor ring). In this 
case, the four vertices A, B, C, D of the rec- 
tangle correspond to one point p on the sur- 
face, and the pairs of edges AB, DC and BC, 
AD correspond to closed curves a’ and b’ on 
the surface. We use the notation aba !b ^! to 
represent a torus. This refers to the fact that 
the torus is obtained from an oriented four- 
sided polygon by identifying the first side and 
the third (with reversed orientation), the sec- 
ond side and the fourth (with reversed orienta- 
tion). Similarly, aa~' represents a sphere (Fig. 
3), and a,b,a;'b;'a,b,az'b5' represents the 
closed surface shown in Fig. 4. 


A b D 
(ol 
B b C 
2 


Fig. 2 


B 


Fig. 3 
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Fig. 4 


All closed surfaces without boundary are 
constructed by identifying suitable pairs of 
sides of a 2n-sided polygon in a Euclidean 
plane E?. Furthermore, a closed orientable 
surface without boundary is homeomorphic to 
the surface represented by aa™ or 
aib,aj bj! ...a ba, bz". (1) 


P P P 


The 1-dimensional *Betti number of this 
surface is 2p, the 0-dimensional and 2-dimen- 
sional *Betti numbers are 1, the *torsion coeffi- 
cients are all 0, and p is called the genus of the 
surface. Also, a closed orientable surface of 
genus p with boundaries c,, ...,c, is repre- 
sented by 


WiC Wy)... WCW, arbia bi. sss aba, bi 


Q) 


(Fig. 5). A closed nonorientable surface with- 
out boundary is represented by 


10,0505 ...ü 


qd: (3) 
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The 1-dimensional Betti number of this 
surface is q— 1, the 0-dimensional and 2- 
dimensional Betti numbers are 1 and 0, re- 
spectively, the 1-dimensional torsion coeffi- 
cient is 2, the 0-dimensional and 2-dimensional 
torsion coefficients are 0, and q is called the 
genus of the surface. A closed nonorientable 
surface of genus q with boundaries c,, ...,c, 
is represented by 


, wl -1 
Wi CIW e MAC d, Ay... Ag Ag. (4) 


Each of forms (1)-(4) is called the normal form 
of the respective surface, and-the curves aj, b;, 
w, are called the normal sections of the surface. 
To explain the notation in (3), we first take the 
simplest case, aa. In this case, the surface is 
obtained from a disk by identifying each pair 
of points on the circumference that are end- 
points of a diameter (Fig. 6). The surface aa is 
then homeomorphic to a tprojective plane of 
which a decomposition into a complex of 
triangles is illustrated in Fig. 7. On the other 
hand, aabb represents a surface like that 
shown in Fig. 8, called the Klein bottle. Fig. 9 
shows a handle, and Fig. 10 shows a cross cap. 


a a 





1551 





Fig. 10 


The last two surfaces have boundaries; a 
handle is orientable, while a cross cap is non- 
orientable and homeomorphic to the Móbius 
strip. If we delete p disks from a sphere and 
replace them with an equal number of handles, 
then we obtain a surface homeomorphic to 
the surface represented in (1), while if we 
replace the disks by cross caps instead of by 
handles, then the surface thus obtained is 
homeomorphic to that represented in (3). 
Now we decompose the surfaces (1) and (3) 
into triangles and denote the number of i- 
dimensional simplexes by x; (i=0, 1,2). Then in 
view of the tEuler-Poincaré formula, the sur- 
faces (1) and (3) satisfy the respective formulas 


o —9, +4 —2(1— p), 
Xo — 04-05 —2— q. 


The *Riemann surfaces of *algebraic func- 
tions of one complex variable are always sur- 
faces of type (1), and their genera p coincide 
with those of algebraic functions. 

All closed surfaces are homeomorphic to 
surfaces of types (1), (2), (3), or (4). A necessary 
and sufficient condition for two surfaces to be 
homeomorphic to each other is coincidence of 
the numbers of their boundaries, their orienta- 
bility or nonorientability, and their genera (or 
*Euler characteristic al — à! +27). This propo- 
sition is called the fundamental theorem of the 
topology of surfaces. The thomeomorphism 
problem of closed surfaces is completely solved 
by this theorem. The same problem for n 
(n> 3) manifolds, even if they are compact, 
remains open. (For surface area — 246 Length 
and Area. For the differential geometry of 
surfaces — 111 Differential Geometry of 
Curves and Surfaces.) 
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A. General Remarks 


Symbolic logic (or mathematical logic) is a field 
of logic in which logical inferences commonly 
used in mathematics are investigated by use of 
mathematical symbols. 

The algebra of logic originally set forth by 
G. Boole [1] and A. de Morgan [2] is actually 
an algebra of sets or relations; it did not reach 
the same level as the symbolic logic of today. 
G. Frege, who dealt not only with the logic 
of propositions but also with the first-order 
predicate logic using quantifiers (— Sections 
C and K), should be regarded as the real 
originator of symbolic logic. Frege's work, 
however, was not recognized for some time. 
Logical studies by C. S. Peirce, E. Schróder, 
and G. Peano appeared soon after Frege, but 
they were limited mostly to propositions and 
did not develop Frege's work. An essential 
development of Frege's method was brought 
about by B. Russell, who, with the collabor- 
ation of A. N. Whitehead, summarized his 
results in Principia mathematica [4], which 
seemed to have completed the theory of sym- 
bolic logic at the time of its appearance. 


B. Logical Symbols 


If A and B are propositions, the propositions 
(A and B), (A or B), (4 implies B), and (not A) 
are denoted by 


A^B, AvB, A->B, JA, 


respectively. We call 71A the negation of A, 

A ^ B the conjunction (or logical product), 

A v B the disjunction (or logical sum), and 

A— B the implication (or B by A). The propo- 
sition (4-9 B)A(B- A) is denoted by AB 
and is read "A and B are equivalent." A v B 
means that at least one of A and B holds. The 
propositions (For all x, the proposition F(x) 
holds) and (There exists an x such that F(x) 
holds) are denoted by VxF(x) and 3xF(x), 
respectively. A proposition of the form VxF(x) 
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is called a universal proposition, and one of the 
form 4xF(x), an existential proposition. The 
symbols ^, v, >, e, 1, V, 3 are called log- 
ical symbols. 

There are various other ways to denote 
logical symbols, including: 


A^B: A&B, A: B, 

Av B: A+B, 

A>B: ADB, A=B, 

AcB: A2B, Az B, A~B, AS cB, AB, 
JA: ~A, A, 

VxF(x): G)F(x), IIxF(x), AxF(x), 

AxF (x): (Ex)F(x), XExF(x) VxF(x). 


C. Free and Bound Variables 


Any function whose values are propositions is 
called a propositional function. Vx and 3x can 
be regarded as operators that transform any 
propositional function F(x) into the propo- 
sitions VxF (x) and dx F(x), respectively. Vx and 
3x are called quantifiers; the former is called 
the universal quantifier and the latter the 
existential quantifier. F(x) is transformed 

into VxF(x) or 3xF(x) just as a function f(x) 
is transformed into the definite integral 

fd f(x) dx; the resultant propositions VxF(x) 
and 3xF(x) are no longer functions of x. The 
variable x in Vx F(x) and in 3xF(x) is called a 
bound variable, and the variable x in F(x), 
when it is not bound by Vx or 3x, is called a 
free variable. Some people employ different 
kinds of symbols for free variables and bound 
variables to avoid confusion. 


D. Formal Expressions of Propositions 


A formal expression of a proposition in terms 
of logical symbols is called a formula. More 
precisely, formulas are constructed by the 
following formation rules: (1) If 9I is a formula, 
Wis also a formula. If 9I and $8 are for- 
mulas, 9I A B, A v B, A 5 B are all formulas. 
(2) If &(a) is a formula and a is a free variable, 
then Vx (x) and 4x%(x) are formulas, where x 
is an arbitrary bound variable not contained 
in (a) and (x) is the result of substituting x 
for a throughout Veto), 

We use formulas of various scope accord- 
ing to different purposes. To indicate the scope 
of formulas, we fix a set of formulas, each 
element of which is called a prime formula (or 
atomic formula). The scope of formulas is the 
set of formulas obtained from the prime for- 
mulas by formation rules (1) and (2). 
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E. Propositional Logic 


Propositional logic is the field in symbolic 
logic in which we study relations between 
propositions exclusively in connection with the 
four logical symbols ^, v, >, and 71, called 
propositional connectives. 

In propositional logic, we deal only with 
operations of logical operators denoted by 
propositional connectives, regarding the vari- 
ables for denoting propositions, called propo- 
sition variables, only as prime formulas. We 
examine problems such as: What kinds of 
formulas are identically true when their propo- 
sition variables are replaced by any propo- 
sitions, and what kinds of formulas can some- 
times be true? 

Consider the two symbols Y and A, 
read true and false, respectively, and let A — 
{Y, Aj. A univalent function from A, or 
more generally (rom a Cartesian product 
A x ... X À, into À is called a truth function. 
We can regard a, v, >, T as the following 
truth functions: (1) Aa B= Y for A=B= Y, 
and An B= A otherwise; (2) Av B= A for 
A=B= A. and Av B= Y otherwise; (3) 
A->B= À for A= Y and B= A, and 
AB — Y otherwise; (4) 7A=A for A= Y, 
and 1A=Y for A- A. 

If we regard proposition variables as vari- 
ables whose domain is A, then each formula 
represents a truth function. Conversely, any 
truth function (of a finite number of indepen- 
dent variables) can be expressed by an appro- 
priate formula, although such a formula is not 
uniquely determined. If a formula is regarded 
as a truth function, the value of the function 
determined by a combination of values of the 
independent variables involved in the formula 
is called the truth value of the formula. 

A formula corresponding to a truth function 
that takes only Y as its value is called a tau- 
tology. For example, Uv IIA and (lt 8) 
9I) Ware tautologies. Since a truth func- 
tion with n independent variables takes values 
corresponding to 2" combinations of truth 
values of its variables, we can determine in a 
finite number of steps whether a given formula 
is a tautology. If U8 is a tautology (that is, 
9I and B correspond to the same truth func- 
tion), then the formulas 9I and $8 are said to be 
equivalent. 


F. Propositional Calculus 


It is possible to choose some specific tau- 
tologies, designate them as axioms, and derive 
all tautologies from them by appropriately 
given rules of inference. Such a system is called 
a propositional calculus. There are many ways 
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to stipulate axioms and rules of inference for 
a propositional calculus. 

The abovementioned propositional calculus 
corresponds to the so-called classical propo- 
sitional logic (— Section L). By choosing ap- 
propriate axioms and rules of inference we can 
also formally construct intuitionistic or other 
propositional logics. In intuitionistic logic the 
law of the texcluded middle is not accepted, 
and hence it 1s impossible to formalize intui- 
tionistic propositional logic by the notion of 
tautology. We therefore usually adopt the 
method of propositional calculus, instead of 
using the notion of tautology, to formalize 
intuitionistic propositional logic. For example, 
V. I. Glivenko's theorem [5], that if a formula 
3I can be proved in classical logic, then 7 71% 
can be proved in intuitionistic logic, was ob- 
tained by such formalistic considerations. A 
method of extending the classical concepts of 
truth value and tautology to intuitionistic 
and other logics has been obtained by S. A. 
Kripke. There are also studies of logics inter- 
mediate between intuitionistic and classical 
logic (T. Umezawa). 


G. Predicate Logic 


Predicate logic is the area of symbolic logic in 
which we take quantifiers 1n account. Mainly 
propositional functions are discussed in predi- 
cate logic. In the strict sense only single- 
variable propositional functions are called 
predicates, but the phrase predicate of n argu- 
ments (or n-ary predicate) denoting an n- 
variable propositional function 1s also em- 
ployed. Single-variable (or unary) predicates 
are also called properties. We say that a has 
the property F if the proposition F(a) formed 
by the property F is true. Predicates of two 
arguments are called binary relations. The 
proposition R(a, b) formed by the binary re- 
lation R is occasionally expressed in the form 
aRb. Generally, predicates of n arguments are 
called n-ary relations. The domain of defini- 
tion of a unary predicate is called the object 
domain, clements of the object domain are 
called objects, and any variable running over 
the object domain is called an object variable. 
We assume here that the object domain is not 
empty. When we deal with a number of predi- 
cates simultaneously (with different numbers of 
variables), it is usual to arrange things so that 
all the independent variables have the same 
object domain by suitably extending their 
object domains. 

Predicate logic in its purest sense deals 
exclusively with the general properties of 
quantifiers in connection with propositional 
connectives. The only objects dealt with in this 
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field are predicate variables defined over a 
certain common domain and object variables 
running over the domain. Propositional vari- 
ables are regarded as predicates of no vari- 
ables. Each expression F(a,,...,a,) for any 
predicate variable F of n variables a,,...,a, 
(object variables designated as free) is regarded 
as a prime formula (n — 0, 1,2, ...), and we deal 
exclusively with formulas generated by these 
prime formulas, where bound variables are 
also restricted to object variables that have a 
common domain. We give no specification for 
the range of objects except that it be the com- 
mon domain of the object variables. 

By designating an object domain and sub- 
stituting a predicate defined over the domain 
for each predicate variable in a formula, we 
Obtain a proposition. By substituting further 
an object (object constant) belonging to the 
object domain for each object variable in a 
proposition, we obtain a proposition having a 
definite truth value. When we designate an 
object domain and further associate with each 
predicate variable as well as with each object 
variable a predicate or an object to be sub- 
stituted for it, we call the pair consisting of the 
object domain and the association a model. 
Any formula that is true for every model is 
called an identically true formula or valid 
formula. The study of identically true formu- 
las is one of the most important problems in 
predicate logic. 


H. Formal Representations of Mathematical 
Propositions 


To obtain a formal representation of a math- 
ematical theory by predicate logic, we must 
first specify its object domain, which is a non- 
empty set whose elements are called individ- 
uals; accordingly the object domain ts called 
the individual domain, and object variables are 
called individual variables. Secondly we must 
specify individual symbols, function symbols, 
and predicate symbols, signifying specific indi- 
viduals, functions, and *predicates, respectively. 
Here a function of n arguments is a univa- 
lent mapping from the Cartesian product 

D x ... x D of n copies of the given set to D. 
Then we define the notion of term as in the 
next paragraph to represent each individual 
formally. Finally we express propositions for- 
mally by formulas. 

Definition of terms (formation rule for terms): 
(1) Each individual symbol is a term. (2) Each 
free variable is a term. (3) f(t,,...,¢,) is a term 
if £,,..., t, are terms and f is a function symbol 
of n arguments. (4) The only terms are those 
given by (1)-(3). 

As a prime formula in this case we use any 


4111 
Symbolic Logic 


formula of the form F(t,,...,¢,), where F is a 
predicate symbol of n arguments and f,,...,t, 
are arbitrary terms. To define the notions of 
term and formula, we need logical symbols, 
free and bound individual variables, and also a 
list of individual symbols, function symbols, 
and predicate symbols. 

In pure predicate logic, the individual 
domain is not concrete, and we study only 
general forms of propositions. Hence, in this 
case, predicate or function symbols are not 
representations of concrete predicates or func- 
tions but are predicate variables and function 
variables. We also use free individual variables 
instead of individual symbols. In fact, it is now 
most common that function variables are 
dispensed with, and only free individual vari- 
ables are used as terms. 


I. Formulation of Mathematical Theories 


To formalize a theory we need axioms and 
rules of inference. Axioms constitute a certain 
specific set of formulas, and a rule of inference 
is a rule for deducing a formula from other 
formulas. A formula is said to be provable if it 
can be deduced from the axioms by repeated 
application of rules of inference. Axioms are 
divided into two types: logical axioms, which 
are common to all theories, and mathematical 
axioms, which are peculiar to each individual 
theory. The set of mathematical axioms is 
called the axiom system of the theory. 

(I) Logical axioms: (1) A formula that is the 
result of substituting arbitrary formulas for the 
proposition variables in a tautology is an 
axiom. (2) Any formula of the form 


Vx) st) or $&()o3xsQ) 


is an axiom, where (F(t) is the result of sub- 
stituting an arbitrary term t for x in (x). 

(I1) Rules of inference: (1) We can deduce a 
formula 93 from two formulas YW and UB 
(modus ponens). (2) We can deduce UM Vx F(x) 
from a formula WI F(a) and 3x (x) M 
from (a) 9I, where a is a free individual 
variable contained in neither 9I nor (x) and 
(a) is the result of substituting a for x in (x). 

If an axiom system is added to these logical 
axioms and rules of inference, we say that a 
formal system is given. 

A formal system S or its axiom system is 
said to be contradictory or to contain a con- 
tradiction if a formula ?I and its negation G&U 
are provable; otherwise it is said to be consis- 
tent. Since 
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is a tautology, we can show that any formula 
is provable in a formal system containing a 
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contradiction. The validity of a proof by 
reductio ad absurdum lies in the fact that 


(915 (88 ^ 7138))— TA 


is a tautology. An affirmative proposition 
(formula) may be obtained by reductio ad 
absurdum since the formula (of propositional 
logic) representing the discharge of double 
negation 
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is a tautology. 


J. Predicate Calculus 


If a formula has no free individual variable, we 
call it a closed formula. Now we consider a 
formal system S whose mathematical axioms 
are closed. A formula 9f is provable in S if 

and only if there exist suitable mathematical 
axioms €,,...,€, such that the formula 


(E a... AE) >A 


is provable without the use of mathematical 
axioms. Since any axiom system can be re- 
placed by an equivalent axiom system contain- 
ing only closed formulas, the study of a formal 
system can be reduced to the study of pure 
logic. 

In the following we take no individual sym- 
bols or function symbols into consideration 
and we use predicate variables as predicate 
symbols in accordance with the commonly 
accepted method of stating properties of the 
pure predicate logic; but only in the case of 
predicate logic with equality will we use predi- 
cate variables and the equality predicate = as 
a predicate symbol. However, we can safely 
state that we use function vartables as function 
symbols. 

The formal system with no mathematical 
axioms is called the predicate calculus. The 
formal system whose mathematical axioms are 
the equality axioms 


a=b > (W(a) W(b) 


is called the predicate calculus with equality. 

In the following, by being provable we mean 
being provable in the predicate calculus. 

(1) Every provable formula is valid. 

(2) Conversely, any valid formula is prov- 
able (K. Gödel [6]). This fact is called the 
completeness of the predicate calculus. In fact, 
by Godel’s proof, a formula 9I is provable if 
9I is always true in every interpretation whose 
individual domain is of fcountable cardinality. 
In another formulation, if G% is not provable, 
the formula Y is a true proposition in some 
interpretation (and the individual domain in 
this case is of countable cardinality). We can 


a=a, 
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extend this result as follows: If an axiom sys- 
tem generated by countably many closed 
formulas is consistent, then its mathematical 
axioms can be considered true propositions 
by a common interpretation. In this sense, 
Gódel's completeness theorem gives another 
proof of the 'Skolem-Lówenheim theorem. 

(3) The predicate calculus is consistent. 
Although this result is obtained from (1) in this 
section, it is not difficult to show it directly 
(D. Hilbert and W. Ackermann [7]). 

(4) There are many different ways of giving 
logical axioms and rules of inference for the 
predicate calculus. G. Gentzen gave two types 
of systems in [8]; one is a natural deduction 
system in which it ts easy to reproduce formal 
proofs directly from practical ones in math- 
ematics, and the other has a logically simpler 
structure. Concerning the latter, Gentzen 
proved Gentzen's fundamental theorem, which 
shows that a formal proof of a formula may be 
translated into a "direct" proof. The theorem 
itself and its idea were powerful tools for ob- 
taining consistency proofs. 

(5) If the proposition 3x9 (x) is true, we 
choose one of the individuals x satisfying the 
condition W(x), and denote it by ex%(x). When 
Ix (x) is false, we let ex9I(x) represent an 
arbitrary individual. Then 


jJx3l(x) 2 W(exW(x)} (1) 


is true. We consider ex to be an operator as- 
sociating an individual ex9I(x) with a propo- 
sition M(x) containing the variable x. Hilbert 
called it the transfinite logical choice function; 
today we call it Hilbert's c-operator (or £- 
quantifier), and the logical symbol ¢ used in 
this sense Hilbert's c-symbol. Using the e- 
symbol, 3xAM(x) and Vx9I(x) are represented by 


WexW(x)), Mex 1 A(x)), 


respectively, for any U(x). The system of predi- 
cate calculus adding formulas of the form (1) 
as axioms is essentially equivalent to the usual 
predicate calculus. This result, called the £- 
theorem, reads as follows: When a formula € is 
provable under the assumption that every 
formula of the form (1) is an axiom, we can 
prove & using no axioms of the form (1) if € 
contains no logical symbol e (D. Hilbert and 
P. Bernays [9]). Moreover, a similar theorem 
holds when axioms of the form 


Vx(I(x)e98(x))  ex9l(x) = ex9S3(x) (2) 


are added (S. Maehara [10]). 
(6) For a given formula YW, call I a normal 
form of d when the formula 


We WU’ 


is provable and WU satisfies a particular con- 
dition. For example, for any formula d there is 
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a normal form WU satisfying the condition: W 
has the form 


Qi Xi 2-0 XB exu 


where Qx means a quantifier Vx or dx, and 
B(x, ..., x,) contains no quantifier and has no 
predicate variables or free individual variables 
not contained in 9L. A normal form of this 
kind is called a prenex normal form. 

(7) We have dealt with the classical first- 
order predicate logic until now. For other 
predicate logics (— Sections K and L) also, we 
can consider a predicate calculus or a formal 
system by first defining suitable axioms or 
rules of inference. Gentzen’s fundamental 
theorem applies to the intuitionistic predicate 
calculus formulated by V. I. Glivenko, A. 
Heyting, and others. Since Gentzen’s funda- 
mental theorem holds not only in classical 
logic and intuitionistic logic but also in several 
systems of first-order predicate logic or pro- 
positional logic, it is useful for getting results 
in modal and other logics (M. Ohnishi, K. 
Matsumoto). Moreover, Glivenko’s theorem 
in propositional logic [5] is also extended to 
predicate calculus by using a rather weak 
representation (S. Kuroda [12]). G. Takeuti 
expected that a theorem similar to Gentzen’s 
fundamental theorem would hold in higher- 
order predicate logic also, and showed that 
the consistency of analysis would follow if 
that conjecture could be verified [13]. More- 
over, in many important cases, he showed 
constructively that the conjecture holds par- 
tially. The conjecture was finally proved by 
M. Takahashi [14] by a nonconstructive 
method. Concerning this, there are also con- 
tributions by S. Maehara, T. Simauti, M. 
Yasuhara, and W. Tait. 


K. Predicate Logics of Higher Order 


In ordinary predicate logic, the bound vari- 
ables are restricted to individual variables. In 
this sense, ordinary predicate logic is called 
first-order predicate logic, while predicate logic 
dealing with quantifiers VP or 3P for a predi- 
cate variable P is called second-order predicate 
logic. 

Generalizing further, we can introduce the 
so-called third-order predicate logic. First we 
fix the individual domain D,. Then, by intro- 
ducing the whole class D' of predicates of n 
variables, each running over the object domain 
Do, we can introduce predicates that have Di 
as their object domain. This kind of predicate 
is called a second-order predicate with respect 
to the individual domain Dy. Even when 
we restrict second-order predicates to one- 
variable predicates, they are divided into vari- 
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ous types, and the domains of independent 
variables do not coincide in the case of more 
than two variables. In contrast, predicates 
having D, as their object domain are called 
first-order predicates. The logic having quan- 
tifiers that admit first-order predicate variables 
is second-order predicate logic, and the logic 
having quantifiers that admit up to second- 
order predicate variables is third-order predi- 
cate logic. Similarly, we can define further 
higher-order predicate logics. | 

Higher-order predicate logic is occasionally 
called type theory, because variables arise that 
are classified into various types. Type theory is 
divided into simple type theory and ramified 
type theory. 

We confine ourselves to variables for single- 
variable predicates, and denote by P such a 
bound predicate variable. Then for any for- 
mula (a) (with a a free individual variable), 
the formula 


3PVx(P(x)eW(x)) 


is considered identically true. This is the point 
of view in simple type theory. 

Russell asserted first that this formula can- 
not be used reasonably if quantifiers with 
respect to predicate variables occur in (x). 
This assertion is based on the point of view 
that the formula in the previous paragraph 
asserts that (x) is a first-order predicate, 
whereas any quantifier with respect to first- 
order predicate variables, whose definition 
assumes the totality of the first-order predi- 
cates, should not be used to introduce the first- 
order predicate (x). For this purpose, Russell 
further classified the class of first-order predi- 
cates by their rank and adopted the axiom 


3P*Vx(P*(x)ew(x)) 


for the predicate variable P* of rank k, where 
the rank i of any free predicate variable occur- 
ring in (x) is <k, and the rank j of any 
bound predicate variable occurring in (x) is 
« k. This is the point of view in ramified type 
theory, and we still must subdivide the types if 
we deal with higher-order propositions or 
propositions of many variables. Even Russell, 
having started from his ramified type theory, 
had to introduce the axiom of reducibility 
afterwards and reduce his theory to simple 
type theory. 


L. Systems of Logic 


Logic in the ordinary sense, which is based on 
the law of the excluded middle asserting that 
every proposition is in principle either true or 
false, is called classical logic. Usually, propo- 
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sitional logic, predicate logic, and type theory 
are developed from the standpoint of classical 
logic. Occasionally the reasoning of intuition- 
istic mathematics is investigated using sym- 
bolic logic, in which the law of the excluded 
middle is not admitted (— 156 Foundations of 
Mathematics). Such logic is called intuitionistic 
logic. Logic is also subdivided into proposi- 
tional logic, predicate logic, etc., according to 
the extent of the propositions (formulas) dealt 
with. 

To express modal propositions stating possi- 
bility, necessity, etc., in symbolic logic, J. Lu- 
kaszewicz proposed a propositional logic called 
three-valued logic, having a third truth value, 
neither true nor false. More generally, many- 
valued logics with any number of truth values 
have been introduced; classical logic is one of 
its special cases, two-valued logic with two 
truth values, true and false. Actually, however, 
many-valued logics with more than three truth 
values have not been studied much, while 
various studies in modal logic based on classi- 
cal logic have been successfully carried out. 
For example, studies of strict implication 
belong to this field. 
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412 (IV.13) 
Symmetric Riemannian 
Spaces and Real Forms 


A. Symmetric Riemannian Spaces 


Let M be a tRiemannian space. For each point 
p of M we can define a mapping o, of a suit- 
able neighborhood U, of p onto U, itself so 
that o, (x) 2 x .,, where x, (|t| <€, xo =p) is any 
*geodesic passing through the point p. We call 
M a locally symmetric Riemannian space if for 
any point p of M we can choose a neighbor- 
hood U, so that c, is an tisometry of U,. In 
order that a Riemannian space M be locally 
symmetric it is necessary and sufficient that the 
*covariant differential (with respect to the 
*Riemannian connection) of the fcurvature 
tensor of M be 0. A locally symmetric Riemann- 
ian space is a treal analytic manifold. We say 
that a Riemannian space M is a globally sym- 
metric Riemannian space (or simply symmetric 
Riemannian space) if M is connected and if for 
each point p of M there exists an isometry o, 
of M onto M itself that has p as an isolated 
fixed point (i.e., has no fixed point except p in a 
certain neighborhood of p) and such that o7 is 
the identity transformation on M. In this case 
6, is called the symmetry at p. A (globally) 
symmetric Riemannian space is locally sym- 
metric and is a tcomplete Riemannian space. 
Conversely, a tsimply connected complete 
locally symmetric Riemannian space is a (glob- 
ally) symmetric Riemannian space. 


B. Symmetric Riemannian Homogeneous 
Spaces 


À thomogeneous space G/K of a connected 
*Lie group G is a symmetric homogeneous 
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space (with respect to 0) if there exists an in- 
volutive automorphism (i.e., automorphism of 
order 2) 0 of G satisfying the condition Kj c 
K c Ką, where K, is the closed subgroup con- 
sisting of all elements of G left fixed by 0 and 
K? is the connected component of the iden- 
tity element of K,. In this case, the mapping 
aK 5 0(a)K (aeG)is a transformation of 

G/K having the point K as an isolated fixed 
point; more generally, the mapping 0, :aK > 
ag(ag) ! 0(a)K is a transformation of G/K 
that has an arbitrary given point a9 K of G/K 
as an isolated fixed point. If there exists a G- 
invariant Riemannian metric on G/K, then 
G/K is a symmetric Riemannian space with 
symmetries (0, | ao € G} and is called a sym- 
metric Riemannian homogeneous space. A 
sufficient condition for a symmetric homoge- 
neous space G/K to be a symmetric Riemann- 
ian homogeneous space is that K be a com- 
pact subgroup. Conversely, given a symmetric 
Riemannian space M, let G be the connected 
component of the identity element of the Lie 
group formed by all the isometries of M; then 
M is represented as the symmetric Riemannian 
homogeneous space M — G/K and K is a com- 
pact group. In particular, a symmetric Rie- 
mannian space can be regarded as a Riemann- 
ian space that is realizable as a symmetric 
Riemannian homogeneous space. 

The Riemannian connection of a symme- 
tric Riemannian homogeneous space G/K is 
uniquely determined (independent of the 
choice of G-invariant Riemannian metric), and 
a geodesic x,(|t| « oo, xg — ag K) passing 
through a point aj K of G/K is of the form 
x, — (exptX)ay K. Here X is any element of the 
Lie algebra g of G such that 0(X) — — X, where 
0 also denotes the automorphism of o induced 
by the automorphism 0 of G and exptX is the 
*one-parameter subgroup of G defined by the 
element X. The covariant differential of any G- 
invariant tensor field on G/K is 0, and any G- 
invariant ‘differential form on G/K is a closed 
differential form. 


C. Classification of Symmetric Riemannian 
Spaces 


The *simply connected tcovering Riemannian 
space of a symmetric Riemannian space is also 
a symmetric Riemannian space. Therefore the 
problem of classifying symmetric Riemannian 
spaces is reduced to classifying simply con- 
nected symmetric Riemannian spaces M and 
determining tdiscontinuous groups of iso- 
metries of M. When we take the tde Rham 
decomposition of such a space M and repre- 
sent M as the product of a real Euclidean 
space and a number of simply connected irre- 
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ducible Riemannian spaces, all the factors are 
symmetric Riemannian spaces. We say that M 
is an irreducible symmetric Riemannian space if 
it is a symmetric Riemannian space and is 
irreducible as a Riemannian space. 

A simply connected irreducible symmetric 
Riemannian space is isomorphic to one of the 
following four types of symmetric Riemannian 
homogeneous spaces (here Lie groups are 
always assumed to be connected): 

(1) The symmetric Riemannian homoge- 
neous space (G x G)/{(a, a)|ae G} of the direct 
product G x G, where G is a simply connected 
compact ‘simple Lie group and the involutive 
automorphism of G x G 1s given by (a, b) (b, a) 
((a, b)e G x G). This space is isomorphic, as a 
Riemannian space, to the space G obtained by 
introducing a two-sided invariant Riemannian 
metric on the group G; the isomorphism is 
induced from the mapping G x G »(a, b) 
ab Ter, 

(2) A symmetric homogeneous space G/K, 
of a simply connected compact simple Lie 
group G with respect to an involutive auto- 
morphism 0 of G. In this case, the closed sub- 
group K= (aeG |0(a) — a) of G is connected. 
We assume here that 0 is a member of the 
given complete system of representatives of the 
*conjugate classes formed by the elements of 
order 2 in the automorphism group of the 
group G. 

(3) The homogeneous space G°/G, where GE 
is a complex simple Lie group whose ‘center 
reduces to the identity element and G is an 
arbitrary but fixed maximal compact subgroup 
of GS. 

(4) The homogeneous space GK. where Go 
is a noncompact simple Lie group whose 
center reduces to the identity element and 
which has no complex Lie group structure, 
and K is a maximal compact subgroup of G. 
In Section D we shall see that (3) and (4) are 
actually symmetric homogeneous spaces. All 
four types of symmetric Riemannian spaces are 
actually trreducible symmetric Riemannian 
spaces, and G-invariant Riemannian metrics 
on each of them are uniquely determined up to 
multiplication by a positive number. On the 
other hand, (1) and (2) are compact, while (3) 
and (4) are homeomorphic to Euclidean spaces 
and not compact. For spaces of types (1) and . 
(3) the problem of classifying simply connected 
irreducible symmetric Riemannian spaces is 
reduced to classifying tcompact real simple Lie 
algebras and *complex simple Lie algebras, 
respectively, while for types (2) and (4) it is 
reduced to the classification of noncompact 
real simple Lie algebras (— Section D) (for the 
result of classification of these types — Ap- 
pendix A, Table 5.II). On the other hand, any 
(not necessarily simply connected) irreducible 
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symmetric Riemannian space defines one of 
(1)-(4) as its *universal covering manifold; if 
the covering manifold is of type (3) or (4), the 
original symmetric Riemannian space is neces- 
sarily simply connected. 


D. Symmetric Riemannian Homogeneous 
Spaces of Semisimple Lie Groups 


In Section C we saw that any irreducible sym- 
metric Riemannian space is representable as a 
symmetric Riemannian homogencous space 
G/K on which a connected semisimple Lie 
group G acts *almost effectively (— 249 Lie 
Groups). Among symmetric Riemannian 
spaces, such a space M — G/K is characterized 
as one admitting no nonzero vector field that 
is *parallel with respect to the Riemannian 
connection. Furthermore, if G acts effectively 
on M, G coincides with the connected compo- 
nent I(M)° of the identity element of the Lie 
group formed by all the isometries of M. 

We let M — G/K be a symmetric Riemann- 
ian homogeneous space on which a con- 
nected semisimple Lie group G acts almost 
effectively. Then G is a Lie group that is "locally 
isomorphic to the group /(M)?, and therefore 
the Lie algebra of G is determined by M. Let g 
be the Lie algebra of G, f be the subalgebra of 
g corresponding to K, and 0 be the involutive 
automorphism of G defining the symmetric 
homogeneous space G/K. The automorphism 
of g defined by 0 is also denoted by 6. Then f= 
(Xeg|0(X)— X]. Putting m={X eg|0(X)= 
— X}, we have g=m +f (direct sum of linear 
spaces), and m can be identified in a natural 
way with the tangent space at the point K of 
G/K. The *adjoint representation of G gives 
rise to a representation of K in g, which in- 
duces a linear representation Ad, (k) of K in m. 
Then {Ad,,,(k)| ke K} coincides with the tres- 
tricted homogeneous holonomy group at the 
point K of the Riemannian space G/K. 

Now let o be the tKilling form of g. Thent 
and m are mutually orthogonal with respect to 
o, and denoting by p; and o, the restrictions 
of o to f and m, respectively, œ is a negative 
definite quadratic form on t. If p is also a 
negative definite quadratic form on nt, g is a 
compact real semisimple Lie algebra and G/K 
is a compact symmetric Riemannian space; in 
this case we say that G/K is of compact type. 
In the opposite case, where q,, is a *positive 
definite quadratic form, G/K is said to be of 
noncompact type. In this latter case, G/K is 
homeomorphic to a Euclidean space, and if 
the center of G is finite, K is a maximal com- 
pact subgroup of G. Furthermore, the group 
of isometries J(G/K) of G/K is canonically 
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isomorphic to the automorphism group of 
the Lie algebra g. When G/K is of compact 
type (noncompact type), there exists one and 
only one G-invariant Riemannian metric on 
G/K, which induces in the tangent space m 
at the point K the positive definite inner 
product — On (Pm). 

A symmetric Riemannian homogeneous 
space G/K, of compact type defined by a sim- 
ply connected compact semisimple Lie group 
G with respect to an involutive automorphism 
0 is simply connected. Let g=m-+f, be the de- 
composition of the Lie algebra g of G with 
respect to the automorphism 0 of g, and let af 
be the *complex form of g. Then the real sub- 
space gg = Ji m 4- f, in gf is a real semi- 
simple Lie algebra and a treal form of af. Let 
G, be the Lie group corresponding to the Lie 
algebra ge with center reduced to the identity 
element, and let K be the subgroup of G, cor- 
responding to I. Then we get a (simply con- 
nected) symmetric Riemannian homogeneous 
space of noncompact type G,/K. 

When we start from a symmetric Riemann- 
ian space of noncompact type G/K instead of 
the symmetric Riemannian space of compact 
type G/K, and apply the same process as in 
the previous paragraphs, taking a simply 
connected G, as the Lie group corresponding 
to gg, we obtain a simply connected symmetric 
Riemannian homogeneous space of compact 
type. Indeed, each of these two processes is the 
reverse of the other, and in this way we get a 
one-to-one correspondence between simply 
connected symmetric Riemannian homoge- 
neous spaces of compact type and those of 
noncompact type. This relationship is called 
duality for symmetric Riemannian spaces; 
when two symmetric Riemannian spaces are 
related by duality, each is said to be the dual 
of the other. 

If one of the two symmetric Riemannian 
spaces related by duality is irreducible, the 
other is also irreducible. The duality holds 
between spaces of types (1) and (3) and be- 
tween those of types (2) and (4) described in 
Section C. This fact is based on the following 
theorem in the theory of Lie algebras, where 
we identify isomorphic Lie algebras. (i) Com- 
plex simple Lie algebras af and compact real 
simple Lie algebras g are in one-to-one corre- 
spondence by the relation that al: is the com- 
plex form of g. (it) Form the Lie algebra a, in 
the above way from a compact real simple Lie 
algebra o and an involutive automorphism 0 
of g. We assume that 0 is a member of the 
given complete system of representatives of 
conjugate classes of involutive automorphisms 
in the automorphism group of g. Then we get 
from the pair (g, 0) a noncompact real simple 
Lie algebra go, and any noncompact real 
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simple Lie algebra is obtained by this process 
in one and only one way. 

Consider a Riemannian space given as a 
symmetric Riemannian homogeneous space M 
— G/K with a semisimple Lie group G, and let 
K be the 'sectional curvature of M. Then if M 
is of compact type the value of K is >0, and 
if M is of noncompact type it is <0. On the 
other hand, the rank of M is the (unique) di- 
mension of a commutative subalgebra of q 
that 1s contained in and maximal in m. (For 
results concerning the group of isometries of 
M, distribution of geodesics on M, etc. — [3].) 


E. Symmetric Hermitian Spaces 


A connected fcomplex manifold M with a 
*Hermitian metric is called a symmetric Her- 
mitian space if for each point p of M there 
exists an isometric and *biholomorphic trans- 
formation of M onto M that is of order 2 and 
has p as an isolated fixed point. As a real ana- 
lytic manifold, such a space M is a symmetric 
Riemannian space of even dimension, and the 
Hermitian metric of M is a *Káhler metric. Let 
I(M) be the (not necessarily connected) Lie 
group formed by all isometries of M, and let 
A(M) be the subgroup consisting of all holo- 
morphic transformations in [(M). Then A(M) 
is a closed Lie subgroup of I(M). Let G be the 
connected component A(M)? of the identity 
element of A(M). Then G acts transitively on 
M, and M is expressed as a symmetric Rie- 
mannian homogeneous space G/K. 

Under the de Rham decomposition of a 
simply connected symmetric Hermitian space 
(regarded as a Riemannian space), all the 
factors are symmetric Hermitian spaces. The 
factor that is isomorphic to a real Euclidean 
spaces as a Riemannian space is a symmetric 
Hermitian space that is isomorphic to the 
complex Euclidean space C". A symmetric 
Hermitian space defining an irreducible sym- 
metric Riemannian space is called an irreduc- 
ible symmetric Hermitian space. The problem 
of classifying symmetric Hermitian spaces is 
thus reduced to classifying irreducible sym- 
metric Hermitian spaces. 

In general, if the symmetric Riemannian 
space defined by a symmetric Hermitian space 
M is represented as a symmetric Riemannian 
homogeneous space G/K by a connected semi- 
simple Lie group G acting effectively on M, 
then M is simply connected, G coincides with 
the group A(M)? introduced in the previous 
paragraph, and the center of K is not a ‘dis- 
crete set. In particular, an irreducible sym- 
metric Hermitian space is simply connected. 
Moreover, in order for an irreducible symmetric 
Riemannian homogeneous space G/K to be 
defined by an irreducible symmetric Hermitian 
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space M, it is necessary and sufficient that the 
center of K not be a discrete set. If G acts 
effectively on M, then G is a simple Lie group 
whose center is reduced to the identity ele- 
ment, and the center of K is of dimension 1. 
For a space G/K satisfying these conditions, 
there are two kinds of structures of symmetric 
Hermitian spaces defining the Riemannian 
structure of G/K. 

As follows from the classification of irre- 
ducible symmetric Riemannian spaces, an 
irreducible Hermittan space defines one of the 
following symmetric Riemannian homogeneous 
spaces, and conversely, each of these homoge- 
neous spaces is defined by one of the two kinds 
of symmetric Hermitian spaces. 

(I) The symmetric homogeneous space G/K 
of a compact simple Lie group G with respect 
to an involutive automorphism 0 such that the 
center of G reduces to the identity element and 
the center of K is not a discrete set. Here 0 
may be assumed to be a representative of a 
conjugate class of involutive automorphisms 
in the automorphism group of G. 

(II) The homogeneous space Gj/K of a 
noncompact simple Lie group G, by a maxi- 
mal compact subgroup K such that the center 
of Go reduces to the identity element and the 
center of K is not a discrete set. 

An irreducible symmetric Hermitian space 
of type (I) is compact and ts isomorphic to a 
trational algebraic variety. An irreducible 
symmetric Hermitian space of type (IT) is 
homeomorphic to a Euclidean space and is 
isomorphic (as a complex manifold) to a 
bounded domain in C” (Section F). 

By the same principle as for irreducible 
symmetric Riemannian spaces, a duality holds 
for irreducible symmetric Hermitian spaces 
which establishes a one-to-one correspondence 
between the spaces of types (I) and (II). Fur- 
thermore, an irreducible symmetric Hermitian 
space M, of type (II) that is dual to a given 
irreducible symmetric Hermitian space M, 

— G/K of type (I) can be realized as an open 
complex submanifold of M, in the following 
way. Let G© be the connected component of 
the identity element in the Lie group formed 
by all the holomorphic transformations of M,. 
Then G is a complex simple Lie group con- 
taining G as a maximal compact subgroup, 
and the complex Lie algebra g] of G€ contains 
the Lie algebra g of G as a real form. Let 0 be 
the involutive automorphism of G defining the 
symmetric homogeneous space G/K, and let g 
=m-+f be the decomposition of g determined 
by 0. We denote by Gy the real subgroup of GE 
corresponding to the real form go = /-1 m+ 
t of af. Then Gy (i) is a closed subgroup of 

GE whose center reduces to the identity ele- 
ment and (ii) contains K as a maximal com- 
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pact subgroup. By definition the space M, is 
then given by Gj/K. Now the group Gp acts on 
M, as a subgroup of GS, and the orbit of Go 
containing the point K of M, is an open com- 
plex submanifold that is isomorphic to M, (as 
a complex manifold). M, regarded as a com- 
plex manifold can be represented as the homo- 
geneous space GC/U of the complex simple Lie 
group G€. 


F. Symmetric Bounded Domains 


We denote by D a bounded domain in the 
complex Euclidean space C" of dimension n. 
We call D a symmetric bounded domain if for 
each point of D there exists a holomorphic 
transformation of order 2 of D onto D having 
the point as an isolated fixed point. On the 
other hand, the group of all holomorphic 
transformations of D is a Lie group, and D 1s 
called a homogeneous bounded domain if this 
group acts transitively on D. A symmetric 
bounded domain is a homogeneous bounded 
domain. The following theorem gives more 
precise results: On a bounded domain D, 
*Bergman's kernel function defines a Kahler 
metric that is invariant under all holomorphic 
transformations of D. If D is a symmetric 
bounded domain, D is a symmetric Hermitian 
space with respect to this metric. and its defin- 
ing Riemannian space is a symmetric Riemann- 
ian homogeneous space of noncompact type 
G/K with semisimple Lie group G. Conversely, 
any symmetric Hermitian space of noncom- 
pact type is isomorphic (as a complex mani- 
fold) to a symmetric bounded domain. When 
D is isomorphic to an irreducible symmetric 
Hermitian space, we call D an irreducible 
symmetric bounded domain. A symmetric 
bounded domain is simply connected and can 
be decomposed into the direct product of irre- 
ducible symmetric bounded domains. 

The connected component of the identity 
element of the group of all holomorphic trans- 
formations of a symmetric bounded domain D 
is a semisimple Lie group that acts transitively 
on D. Conversely, D is a symmetric bounded 
domain if a connected semisimple Lie group, 
or more generally, a connected Lie group 
admitting a two-sided invariant tHaar mea- 
sure, acts transitively on D. Homogeneous 
bounded domains in C" are symmetric 
bounded domains if n € 3 but not necessarily 
when nz 4. 


G. Examples of Irreducible Symmetric 
Riemannian Spaces 


Here we list irreducible symmetric Riemannian 
spaces of types (2) and (4) (— Section C) that 
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can be represented as homogeneous spaces of 
classical groups, using the notation introduced 
by E. Cartan. We denote by M,— G/K a sim- 
ply connected irreducible symmetric Riemann- 
ian space of type (2), where G is a group that 
acts almost effectively on M, and K is the 
subgroup given by K = K? for an involutive 
automorphism 0 of G. For such an M,, the 
space of type (4) that is dual to M, is denoted 
by M,=G,/K. Clearly dim M, — dim M,. (For 
the dimension and rank of M, and for those 

: M, that are represented as homogeneous 
spaces of simply connected *exceptional com- 
pact simple Lie groups — Appendix A, Table 
5.1II.) In this section (and also in Appendix A, 
Table 5.11), O(n), U (n), Sp(n), SL(n, R), and 
SL(n, C) are the *orthogonal group of degree n, 
the *unitary group of degree n, the 'symplectic 
group of degree 2n, and the real and complex 
‘special linear groups of degree n, respectively. 
Let SO(n) — SL(n, R)N O(n) and SU(n)= 

SL(n, C) U (n). We put 


-I, 0 0 0 
rH ŁO 0 
Kp.4= 0 0 -I, Of? 

0 0 0 1, 


where J, is the p x p unit matrix. 

Type AI. M,— SU (n)/SO(n) (n> 1), where 
0(s) 2 $ (with s the complex conjugate matrix 
of s). Ma = SL(n, R)/SO(n). 

Type AII. M, = SU(2n)/Sp(n) (n> 1), where 
0(s) 2 J,5J, !. My SU*(2ny/Sp(n). Here 
SU*(2n) is the subgroup of SL(2n, C) formed 
by the matrices that commute with the trans- 
formation (z,,...,2,, Z,44, ..., Za) 9 (Zap ses 
., —z,) in C"; SU*(2n) is called the 
quaternion unimodular group and is isomorphic 
to the commutator group of the group of all 
regular transformations in an n-dimensional 
vector space over the quaternion field H. 

Type AUT M,=SU(p+q)/S(U, x U,) (p> 
q> 1), where S(U, x U,)=SU(p+q4)N(U(p) x 
U(q)), with U(p) x U(q) being canonically 
identified with a subgroup of U (p +q), and 
0(s) — I, äi, This space M, is a tcomplex 
Grassmann manifold. M, — SU (p, q)/S(U, x U,), 
where SU (p,q) is the subgroup of SL(p+q,C) 
consisting of matrices that leave invariant the 
Hermitian form z,2,+...+2,Z,—Zp+1Zp+1 — 

eT ÉpsaZ pq: 

Type AIV. This is the case g=1 of type AIII. 
M, is the (n — 1)-dimensional complex projec- 
tive space, and M, is called a Hermitian hyper- 
bolic space. 

Type BDI. M,— SO(p- q)/SO(p) x SO(q) 
(p>q21,p>1,p+q#4), where 0(s)— I, ,sl, ,. 
M, 1s the treal Grassmann manifold formed by 


Zon =Z ge 
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the oriented p-dimensional subspaces in R?**. 
M, = SOo(p. q)/SO(p) x SO(q), where SO(p, q) is 
the subgroup of SL(n, R) consisting of matrices 
that leave invariant the quadratic form x? +... 
Jha e Kaes and SOo(p, q) is the 
connected component of the identity element. 

Type BDII. This is the case q—1 of type 
BDI. M, is the (n— 1)-dimensional sphere, and 
M, is called a real hyperbolic space. 

Type DHI. M, — SO(2n)/U (n) (n> 2), where 
U (n) is regarded as a subgroup of SO(2n) by 


identifying se U (n) with 
Res Ims 
eSO(2n), 
—Ims Res 


and @(s)=J,sJ, !. My=SO*(2n)/U(n). Here 
SO*(2n) denotes the group of all complex 
orthogonal matrices of determinant 1 leaving 
invariant the skew-Hermitian form z,Z,,, — 
2nt121 + ZaZn42 — Zqe 222 +... + Za an Zann; 
this group is isomorphic to the group of all 
linear transformations leaving invariant a 
nondegenerate skew-Hermitian form in an n- 
dimensional vector space over the quaternion 
field H. 

Type CI. M, =Sp(n)/U(n) (n= 1), where U (n) 
is considered as a subgroup of Sp(n) by the 
identification U (n) c SO(2n) explained in type 
DIII and 0(s) 25( — J,5J,, !). Mo — Sp(n, R)/U (n), 
where Sp(n, R) is the real symplectic group of 
degree 2n. 

Type CH. M,=Sp(p+q)/Sp(p) x Sp(q) (p= 
qZ 1), where Sp(p) x Sp(q) is identified with a 
subgroup of Sp(p +q) by the mapping 


A, B, 
C, DI 


A 0 B0 
off Oy OF Bs 
C 0 D 0 


and 0s) - K „45K p.q- Mo=Sp(p, q)/Sp(p) x 
Sp(q). Here Sp(p, q) is the group of complex 
symplectic matrices of degree 2(p +q) leav- 
ing invariant the Hermitian form (z,, ..., 
Zp+a) Kp a (i Saal this group is interpreted 
as the group of all linear transformations leav- 
ing invariant a nondegenerate Hermitian form 
of index p in a (p 4- q)-dimensional vector space 
over the quaternion field H For q= 1, M, is 
the quaternion projective space, and M, is 
called the quaternion hyperbolic space. 

Among the spaces introduced here, there are 
some with lower p, q, n that coincide (as Rie- 
mannian spaces) (— Appendix A, Table 5.IIl). 


H. Space Forms 


A Riemannian manifold of tconstant curvature 
is called a space form; it is said to be spherical, 
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Euclidean, or hyperbolic according as the con- 
stant curvature K is positive, zero, or negative. 
A space form is a locally symmetric Riemann- 
ian space; a simply connected complete space 
form is a sphere if K >0, a real Euclidean 
space if K =0, and a real hyperbolic space if 

K <0. More generally, a complete spherical 
space form of even dimension is a sphere or 

a projective space, and one of odd dimension 
is an orientable manifold. A complete 2- 
dimensional Euclidean space form is one of 
the following spaces: Euclidean plane, cylinder, 
torus, Mobius strip, *Klein bottle. Except for 
these five spaces and the 2-dimensional sphere, 
any closed surface is a 2-dimensional hyper- 
bolic space form (for details about space forms 


> [6]. 


I. Examples of Irreducible Symmetric Bounded 
Domains 


Among the irreducible symmetric Riemannian 
spaces described in Section H, those defined by 
irreducible symmetric Hermitian spaces are of 
types AHI, DIII, BDI (q = 2), and CI. We list 
the irreducible symmetric bounded domains 
that are isomorphic to the irreducible Her- 
mitian spaces defining these spaces. Positive 
definiteness of a matrix will be written >0. 

Type Inm (m' 2 mz 1). The set of all m x m' 
complex matrices Z satisfying the condition 
L,,—'ZZ »0 is a symmetric bounded domain 
in Con. which is isomorphic (as a complex 
manifold) to the irreducible symmetric Hermi- 
tian space defined by M, of type AIII (p— m, 
q=m’). 

Type II,, (m 2 2). The set of all m x m com- 
plex tskew-symmetric matrices Z satisfying the 
condition I, —'ZZ »0 is a symmetric bounded 
domain in C""^!?? corresponding to the type 
DIII (n 2 m). 

Type III, (m 2 1). The set of all m x m com- 
plex symmetric matrices satisfying the con- 
dition 1,, —'ZZ »0 is a symmetric bounded 
domain in C""*? corresponding to the type 
CI (n 2 m). This bounded domain is holomor- 
phically isomorphic to the tSiegel upper half- 
space of degree m. 

Type IV,, (mz: 1, m #2). This bounded 
domain in C" is formed by the elements 
(z,. ---, Zm) satisfying the condition |z; |? + 
Iz ROM Iz E. +z31)/2 <1, and 
corresponds to the type BDI (p=m, q=2). 

Among these four types of bounded 
domains, the following complex analytic iso- 
morphisms hold: I, , S IL; = II, x IV,, IT, S 
I, 4, IV, x IL, IV, «I, 3, IV; SI, (For 
details about these symmetric bounded 
domains — [2].) There are two more kinds of 
irreducible symmetric bounded domains, 
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which are represented as homogeneous spaces 
of exceptional Lie groups. 


J. Weakly Symmetric Riemannian Spaces 


À generalization of symmetric Riemannian 
space is the notion of weakly symmetric Rie- 
mannian space introduced by Selberg. Let M 
be a Riemannian space. M is called a weakly 
symmetric Riemannian space if a Lie sub- 
group G of the group of isometries I(M) acts 
transitively on M and there exists an element 
Lu €I(M) satisfying the relations (i) u4Gu ! = G; 
(ii) u? € G; and (iii) for any two points x, y of 
M, there exists an element m of G such that ux 
— my, uy — mx. A symmetric Riemannian space 
M becomes a weakly symmetric Riemannian 
space if we put G— [(M) and p=the identity 
transformation; as the element m in condition 
(iii) we can take the symmetry a, at the mid- 
point p on the geodesic arc joining x and y. 
There are, however, weakly symmetric Rie- 
mannian spaces that do not have the structure 
of a symmetric Riemannian space. An example 
of such a space is given by M = G = SL(2, R) 
with a suitable Riemannian metric, where yu 

is the inner automorphism defined by 


(o =) 


(Selberg [4]). On a weakly symmetric Rie- 
mannian space, the ring of all G-invariant 
differential-integral operators is commutative; 
this fact is useful in the theory of spherical 
functions (— 437 Unitary Representations). 
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413 (VII.7) 
Symmetric Spaces 


A *Riemannian manifold M is called a sym- 
metric Riemannian space if M is connected and 
if for each pe M there exists an involutive 
‘isometry o, of M that has p as an isolated 
fixed point. For the classification and the 
group-theoretic properties of symmetric Rie- 
mannian spaces — 412 Symmetric Riemann- 
ian Spaces and Real Forms. We state here 

the geometrical properties of a symmetric Rie- 
mannian space M. Let M be represented by 
G/K, a symmetric Riemannian homogeneous 
space. The tLie algebras of G and K are de- 
noted by g and f respectively. Let us denote by 
t, the fleft translation of M defined by aeG, 
and by X* the vector field on M generated by 
X eg. We denote by 0 the differential of the 
involutive automorphism of G defining G/K 
and identify the subspace m= {X ea|0(X) 

= — X } of g with the tangent space T,(M) of 
M at the origin o — K of M. The 'represen- 
tation of f on m induced from the *adjoint 
representation of g is denoted by ad,,. 


A. Riemannian Connections 


M is a complete real analytic *homogeneous 
Riemannian manifold. If M is a symmetric 
Hermitian space, it is a thomogeneous Káh- 
lerian manifold. The tRiemannian connection 
V of M ts the tcanonical connection of the 
homogeneous space G/K and satisfies Vy X* = 
[X, Y] (Yem)for each X ef and V, X* «0 
(Yem) for each X em. For each Xem, the 
curve yy of M defined by y,(t)=(exptX)o 
(t€ R) is a tgeodesic of M such that ,(0)=o0 
and ?,(0)— X. In particular, the texponen- 
tial mapping Exp, at o is given by Exp, X — 
(exp X)o (X em). For each X em, the tparal- 
lel translation along the geodesic arc y, (t) 
(0x t € tg) coincides with the differential of 
Texptox: If M is compact, for each pe M there 
exists a smooth simply closed geodesic passing 
through p. Any G-invariant tensor field on M 
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is "parallel with respect to V. Any G-invariant 
*differential form on M is closed. The Lie 
algebra b of the trestricted homogeneous 
holonomy group of M at o coincides with 


` ad, [m m]. If the group /(M) of all isometries 


of M is *semisimple, one has h= {A eal(m)| 
A:g,—0, A: R, 0] —ad,,t. Here, g, and R, 
denote the values at o of the Riemannian 
metric g and the tRiemannian curvature R of 
M, respectively, and A: is the natural action 
of A on the tensors over m. If, moreover, M 
is a symmetric Hermitian space, the value 
Jg at o of the talmost complex structure J 
of M belongs to the center of b. In general, 
b — (0) if and only if M is *flat, and b has no 
nonzero invariant on m if and only if I(M) 
is semisimple. 


B. Riemannian Curvature Tensors 


The Riemannian curvature tensor R of M is 
parallel and satisfies Ro (X, Y)— —ad,,[X, Y] 
(X, Yem). Assume that dim M z 2 in the fol- 
lowing. Let P be a 2-dimensional subspace of 
m, and (X, Y} an orthonormal basis of P with 
respect to g,. Then the *sectional curvature 
K(P) of P is given by K(P)—g,([[X, Y], X], Y). 
K =0 everywhere if and only if M is flat. If M 
is of tcompact type (resp. of tnoncompact 
type), then K 20 (resp. K <0) everywhere. 

K 70 (resp. K <0) everywhere if and only if 
the trank of M is 1 and M is of compact type 
(resp. of noncompact type). For any four 
points p, q, p', q' of a manifold M of any of 
these types satisfying d(p, q) ^ dp. q'), d being 
the *Riemannian distance of M, there exists 

a 6 € I(M) such that ¢(p)=p and $(q) - q'. 
Other than the aforementioned M's, the only 
Riemannian manifolds having this property 
are circles and Euclidean spaces. If K >0 
everywhere, any geodesic of M is a smooth 
simply closed curve and all geodesics are of the 
same length. For a symmetric Hermitian space 
M, the *holomorphic sectional curvature H 
satisfies H —0 (resp. H >0, H <0) everywhere 
if and only if M is flat (resp. of compact type, 
of noncompact type). 


C. Ricci Tensors 


The *Ricci tensor S of M is parallel. If Ø, 
denotes the restriction to m x m ofthe *Killing 
form o of g, the value S, of S at o satisfies S, = 
—19,. If M is tirreducible, it is an tEinstein 
space. 5 — 0 (resp. positive definite, negative 
definite, nondegenerate) everywhere if and 
only if M is flat (resp. M is of compact type, M 
is of noncompact type, /(M) is semisimple). If 
M is a tsymmetric bounded domain and g is 
the ‘Bergman metric of M, one has S= — g. 
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D. Symmetric Riemannian Spaces of 
Noncompact Type 


Let M be of noncompact type. For each pe M, 
p is the only fixed point of the tsymmetry o,, 
and the exponential mapping at p is a diffeo- 
morphism from 7,(M) to M. In particular, M 
is diffeomorphic to a Euclidean space. For 
each pair p, qe M, a geodesic arc joining p 
and q is unique up to parametrization. For 
each pe M there exists neither a ‘conjugate 
point nor a *cut point of p. If M is a symmetric 
Hermitian space, that is, if it is a symmetric 
bounded domain, then it is a *Stein manifold 
and holomorphically homeomorphic to a 
*Siegel domain. 


E. Groups of Isometries 


The isotropy subgroup at o in I(M) is denoted 
by J,(M). Then the smooth mapping I,(M) x 
m (M) defined by the correspondence ¢ x 
X e» $1, x Is surjective, and it is a diffeo- 
morphism if M is of noncompact type. If M 

is of noncompact type, /(M) is isomorphic to 
the group A(q) of all automorphisms of g in a 
natural way, and I,(M) is isomorphic to the 
subgroup A(g, f)={¢e A(g)] &(D —tj of A(g), 
provided that G acts almost effectively on M. 
Moreover, in this case the center of the iden- 
tity component J/(M)? of I(M) reduces to the 
identity, and the isotropy subgroup at a point 
in I(M)? is a maximal compact subgroup of 
I(MY?. If I(M) is semisimple, any element of 
I(MY? may be represented as a product of an 
even number of symmetries of M. In the fol- 
lowing, let M be a symmetric Hermitian space, 
and denote by A(M) (resp. H(M)) the group of 
all holomorphic isometries (resp. all holomor- 
phic homeomorphisms) of M, and by A(M)? 
and H(MY their identity components. All 
these groups act transitively on M. If M is 
compact or if I(M) is semisimple, one has 
A(M)° =I(M}°. If I(M) is semisimple, M is 
simply connected and the center of I(M)? 
reduces to the identity. If M is of compact 
type, M is a trational tprojective algebraic 
manifold, and H(M)? is a complex semisimple 
Lie group whose center reduces to the identity, 
and it is the tcomplexification of /(M)?. In 
this case, the isotropy subgroup at a point in 
H(MY is a tparabolic subgroup of H(M)?. If 
M is of noncompact type, one has H(M)? = 
I(M)°. In the following we assume that G is 
compact. 


F. Cartan Subalgebras 


A maximal Abelian fLie subalgebra in m is 
called a Cartan subalgebra for M. Cartan sub- 
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algebras are conjugate to each other under the 
*adjoint action of K. Fix a Cartan subalge- 
bra a and introduce an inner product ( , ) 

on a by the restriction to a x a of g,. For an 
element « of the dual space a* of a, we put 
m, = {X em|[(H,[H, X]]= —a(H)y X for any 
Hea}. The subset 2 = (xea* — {0} |m, 4 {0}} 
of a* is called the root system of M (relative to 
a). We write m, 2 dim m, for xe X. The subset 
D={Hea|a(H)exZ for some «e Xj of ais 
called the diagram of M. A connected compo- 
nent of a — D is called a fundamental cell of M. 
The quotient group W of the normalizer of a 
in K modulo the centralizer of a in K is called 
the Weyl group of M. W is identified with a 
finite group of orthogonal transformations of 
a. 


G. Conjugate Points 


For a geodesic arc y with the initial point o, 
any *Jacobi field along y that vanishes at o and 
the end point of y is obtained as the restriction 
to y of the vector field X* generated by an 
element X et. For Hea—{0}, Exp, H isa 
conjugate point to o along the geodesic yy if 
and only if «(H)e zZ — {0} for some «eZ. In 
this case, the multiplicity of the conjugate 
point Exp, H is equal to 3 Dyer, «unesz - (1 Ma- 
From this fact and Morse theory (— 279 
Morse Theory), we get a tcellular decompo- 
sition of the tloop space of M. The set of all 
points conjugate to o coincides with K Exp, D 
and is stratified to a disjoint union of a finite 
number of connected regular submanifolds 
with dimension « dim M — 2. 


H. Cut Points 


We define a ‘lattice group F of a by P= 
{Aea|Exp, A=o}, and put C,={Hea| 

Max «er (9i 2(H, A)/(A, A) 2 1}. Then, for He 
a— {0}, Exp, H is a cut point of o along the 
geodesic yy if and only if H € C,. The set C, of 
all cut points of o coincides with K Exp, C, 
and is stratified to a disjoint union of a finite 
number of connected regular submanifolds 
with dimension x dim M — 1. The set of all 
points ‘first conjugate to o coincides with C, if 
and only if M is simply connected. 


I. Fundamental Groups 


Let I, denote the subgroup of a generated by 
1 Qx/(x, a))a | ae X ), identifying a* with a by 
means of the inner product ( , ) of a. This is a 
subgroup of T. We regard T as a subgroup of 
the group 7(a) of all motions of a by parallel 
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translations. The subgroup W = WT of I(a) 
generated by I and the Weyl group W is 
called the affine Weyl group of M. W leaves 
the diagram D invariant and acts transitively 
on the set of all fundamental cells of M. Take a 
fundamental cell ø such that its closure c 
contains 0, and put W, — (weW |w(c)-— c]. 
Then the fundamental group z,(M) of M is an 
'Abelian group isomorphic to the groups W, 
and CT, z, (M) is a finite group if and only if 
M is of compact type. In this case, the order 
of 1, (M) is equal to the cardinality of the set 
TNG as well as to the index [T Tel Moreover, 
if we denote by W* the group W, for the 
symmetric Riemannian space M* — G*/K* de- 
fined by the *adjoint group G* of G and K* = 
fae G* | a0 — 0a), then W, is isomorphic to a 
subgroup of W*. If M is irreducible, W* is 
isomorphic to a subgroup of the group of all 
automorphisms of the *extended Dynkin dia- 
gram of the root system 2. 


J. Cohomology Rings 


Let P(g) (resp. P(f)) be the tgraded linear space 
of all tprimitive elements in the tcohomology 
algebra H(q) of g (resp. H(t) of D), and P(g, t) 
the intersection of P(g) with the image of the 
natural homomorphism Ha, t)  H(g), where 
H (a, f) denotes the relative cohomology alge- 
bra for the pair (g, f). Then one has dim P(g, 1) 
4- dim P(f) 2 dim P(g). Denote by A P(g, f) the 
exterior algebra over P(g, t). The tgraded 
algebra of all G-invariant polynomials on g 
(resp. all K-invariant polynomials on f) is 
denoted by /(G) (resp. I(K)), where the de- 
gree of a homogeneous polynomial with de- 
gree p is defined to be 2p. We denote by I ' (G) 
the ideal of /(G) consisting of all fe I(G) such 
that f(0) —0, and regard I(K) as an I ' (G)- 
module through the restriction homomor- 
phism. Then the treal cohomology ring H(M) 
of M is isomorphic to the tensor product 
AP(g, Ð & U(K)/1 * (GH(K)). If K is connected 
and the t Poincaré polynomials of P(g), P(t), 
and P(g,f) are Df, 0", 33,1?" and 
Ei, UPC. respectively, then the Poincaré 
polynomial of H(M) is given by IT li 
p TR Tr IBa(b-08- 
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Systems of Units 


A. International System of Units 


Units representing various physical quantities 
can be derived from a certain number of fun- 
damental (base) units. By a system of units we 
mean a system of fundamental units. Various 
systems of units have been used in the course 
of the development of physics. Today, the 
standard is set by the international system of 
units (systéme international d'unités; abbre- 
viated SI) [1], which has been developed in 
the spirit of the meter-kilogram system. This 
system consists of the seven fundamental units 
listed in Table 1, units induced from them, and 
unit designations with prefixes representing 
the powers of 10 where necessary. It also con- 
tains two auxiliary units for plane and solid 
angles, and a large number of derived units 


[1]. 


B. Systems of Units in Mechanics 


Units in mechanics are usually derived from 
length, mass, and time, and SI uses the meter, 
kilogram, and second as base units. Neither 
the CGS system, derived from centimeter, 
gram, and second, nor the system of gravita- 
tional units, derived from length, force, and 
time, are recommended for general use by 
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Table 1 
Quantity 
Length 


Mass 


Time 


Intensity of 
electric current 


Temperature 
Amount of 
substance 


Luminous 
intensity 


Table 2 


Quantity 


Frequency 
Force 


Pressure and stress 


SI unit 


meter 


kilogram 


second 


ampere 


kelvin 


mole 


candela 


Work, energy, quantity of heat 


Power 


Quantity of electricity 
Electromotive force, potential 


difference 


Electric capacitance 
Electric resistance 


Electric conductance 
Flux of magnetic induction 


magnetic flux 


Magnetic induction, magnetic 


flux density 
Inductance 
Luminous flux 
[luminance 
Activity 
Adsorbed dose 
Radiation dose 


Symbol 


kg 


mol 


cd 
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Description 


The meter is the length equal to 1,650,763.73 wave- 
lengths in vacuum of the radiation corresponding 
to the transmission between the levels 2p!? and 5d? 
of the krypton-86 atom. 

The kilogram is equal to the mass of the interna- 
tional prototype of the kilogram. 

The second is the duration of 9,192,631.770 periods 
of the radiation corresponding to the transmission 
between the two hyperfine levels of the ground 
state of the cesium-133 atom. 

The ampere is the intensity of the constant current 
maintained in two parallel, rectilinear conductors 
of infinite length and of negligible circular section, 
placed 1 m apart in vacuum, and producing a force 
between them equal to 2 x 10^" newton (m: kg:s ?) 
per meter of length. 

The kelvin, the unit of thermodynamical tempera- 
ture, is 1/273.16 of the thermodynamical tempera- 
ture of the triple point of water. 

The mole is the amount of substance of a system 
containing as many elementary entities as there are 
atoms in 0.012 kg of carbon-12. 

The candela is the luminous intensity in a given 
direction of a source emitting monochromatic 
radiation of frequency 540 x 10!? hertz (=s~'), the 
radiant intensity of which in that direction is 1/683 
watt per steradian. (This revised definition of 
candela was adopted in 1980.) 





SI unit 


hertz 
newton 
pascal 
joule 


watt 


coulomb 


volt 


farad 


ohm 


siemens 
weber 


tesla 


henry H 
lumen im 


lux 


becquerel Bq 


Bray 


sievert Sv 


Unit in terms of SI 
base or derived 
units 


1Hz=1s'! 

1 N= 1 kg: m/s? 
1 Pa- 1 N/m? 
1J=1N-m 

1 W=1J/s 
1C=ł} A's 

1 V=1W/A 


F I F=1 C/V 
Q 1 Q-1 V/A 
S 1S-10^! 
Wb 1 Wb=1V-s 


T 1 T-1 Wb/m? 


1 H=1 Wb/A 
1 Im=1 ed: sr 
Ix 1 1x:2 1 Im/m? 
1 Bq-1s^! 

1 Gv=1 J/kg 
1 Sv=1 J/kg 
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the SI Committee. Besides the base units, 
minute, hour, and day, degree, minute, and 
second (angle), liter, and ton have been ap- 
proved by the SI Committee. Units such as the 
electron volt, atomic mass unit, astronomical 
unit, and parsec (not SI) are empirically de- 
fined and have been approved. Several other 
units, such as nautical mile, knot, are (area), 
and bar, have been provisionally approved. 


C. System of Units in Thermodynamics 


The base unit for temperature is the degree 
Kelvin (^K; formerly called the absolute tem- 
perature). Degree Celsius (°C), defined by t= 

T —273.15, where T is in °K, is also used. 

The unit of heat is the joule J, the same as the 
unit for other forms of energy. Formerly, one 
calorie was defined as the quantity of heat that 
must be supplied to one gram of water to raise 
its temperature from 14.5°C to 15.5°C; now 
one calorie is defined by 1 cal 2 4.1855 J. 


D. Systems of Units in Electricity and 
Magnetism 


Three distinct systems of units have been 
developed in the field of electricity and mag- 
netism: the electrostatic system, which origi- 
nates from Coulomb's law for the force be- 
tween two electric charges and defines mag- 
netic quantities by means of the Biot-Savart 
law; the electromagnetic system, which origi- 
nates from Coulomb's law for magnetism; and 
the Gaussian system, in which the dielectric 
constant and permeability are taken to be non- 
dimensional. At present, however, the rational- 
ized MKSA system of units is adopted as the 
international standard. It uses the derived units 
listed in Table 2 (taken from [2]), where the 
derived units with proper names in other fields 
are also listed. 


E. Other Units 


In the field of photometry, the following defi- 
nition was adopted in 1948: One candela (cd) 
(20.98 old candle) is defined as 1/(6 x 105) of 
the luminous intensity in the direction normal 
to a plane surface of 1 m? area of a black body 
at the temperature of the solidifying point of 
platinum. The total luminous flux emanating 
uniformly in all directions from a source of 
luminous intensity | cd is defined as 4x lumen 
(Im). One lux (Ix) is defined as the illuminance 
on a surface area of 1 m? produced by a lumi- 
nous flux of 1 ed uniformly incident on the 
surface. In 1980, the definition was revised as 
shown in Table 1. 
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For theoretical purposes, a system of units 
called the absolute system of units is often 
used, in which units of mass, length, and time 
are chosen so that the values of universal 
constants, such as the universal gravitational 
constant, speed of light, Planck's constant, and 
Boltzmann's constant, are equal to 1. 
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415 (XXI.41) 
Takagi, Teiji 


Ten Takagi (April 21, 1875—February 28, 
1960) was born in Gifu Prefecture, Japan. 
After graduation from the Imperial University 
of Tokyo in 1897, he continued his studies in 
Germany, first with Frobenius in Berlin and 
then with Hilbert in Góttingen. He returned 
to Japan in 1901 and taught at the Imperial 
University of Tokyo until 1936, when he re- 
tured. He died in Tokyo of cerebral apoplexy. 

Since his student years he had been inter- 
ested in Kronecker's conjecture on *Abelian 
extensions of imaginary quadratic number 
fields. He solved it affirmatively for the case of 
QC/—-1 ) while still in Gottingen and presented 
this result as his doctoral thesis. During World 
War I, he pursued his research in the theory of 
numbers in isolation from Western countries. 
It developed into ‘class field theory, a beautiful 
general theory of Abelian extensions of alge- 
braic number fields. This was published in 
1920, and was complemented by his 1922 
paper on the freciprocity law of power residues 
and then by tArtin’s general law of reciprocity 
published in 1927. Besides these arithmetical 
works, he also published papers on algebraic 
and analytic subjects and on the foundations 
of the theories of natural numbers and of real 
numbers. His book (in Japanese) on the his- 
tory of mathematics in the 19th century and 
his General course of analysis (also in Japanese) 
as well as his teaching and research activities 
at the University exercised great influence on 
the development of mathematics in Japan. 
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Consider the set M, consisting of the con- 
formal equivalence classes of closed Riemann 
surfaces of genus g. In 1859 Riemann stated, 
without rigorous proof, that M, is parame- 
trized by m(g) (201£g 20, =1 if g=1, =3g—3 
if g z 2) complex parameters (— 11 Algebraic 
Functions). Later, the introduction of a topol- 
ogy and m(g)-dimensional complex structure 
on M, were discussed rigorously in various 
ways. The following explanation of these 
methods is due to O. Teichmüller [1, 2], L. V. 
Ahlfors [3, 4], and L. Bers [5-7]. For the 
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algebraic-geometric approach — 9 Algebraic 
Curves. 

The trivial case g =0 is excluded, since My 
consists of a single point. Take a closed Rie- 
mann surface Ry of genus g 2 1, and consider 
the pairs (R, H) consisting of closed Riemann 
surfaces 91 of the same genus g and the tho- 
motopy classes H of orientation-preserving 
homeomorphisms of Ry into R. Two pairs 
(R, H) and (R', H^) are defined to be con- 
formally equivalent if the homotopv class 
H'H '! contains a conformal mapping. The set 
T, consisting of the conformal equivalence 
classes (9, HY is called the Teichmüller space 
(with center at Ro). Let H, be the group of 
homotopy classes of orientation-preserving 
homeomorphisms of Ro onto itself. $, is a 
transformation group acting on T, in the sense 
that each ye $, induces the transformation 
CR, H» CR, Hy». It satisfies T,/H, = M,. The 
set 3, of elements of §, fixing every point of T, 
consists only of the unity element if g 2 3 and 
is a normal subgroup of order 2 if g — 1, 2. For 
the remainder of this article we assume that 
g Z2. The case g — 1 can be discussed similarly, 
and the result coincides with the classical one: 
T, can be identified with the upper half-plane 
and $,/3, is the tmodular group. 

Denote by B(9R,) the set of measurable 
invariant forms udz dz ^! with |||, <1. For 
every ue B(R,) there exists a pair (R, H) for 
which some he H satisfies hz = uh, (— 352 
Quasiconformal Mappings). This correspon- 
dence determines a surjection ue B(R,)— 
<R, H» cT,. Next, if Q(915) denotes the space 
of holomorphic quadratic differentials o dz? 
on Ry, a mapping ue B(815).5 PEQ(Ro) is 
obtained as follows: Consider u on the uni- 
versal covering space U (= upper half-plane) 
of Ry. Extend it to U* (=lower half-plane) by 
setting 4 =0, and let f be a quasiconformal 
mapping f of the plane onto itself satisfying 
f; uf,. Take the 'Schwarzian derivative p= 
Lf. zt of the holomorphic function f in U*. 
The desired ¢ is given by o(z)=w(z) on U. It 
has been verified that two u induce the same 
ọ if and only if the same CH HY corresponds 
to u. Consequently, an injection CH. He 
T, o € Q(98,) is obtained. Since Q(8,) — 
C"'? by the Riemann-Roch theorem, this in- 
jection yields an embedding T, c C", where 
T, is shown to be a domain. 

As a subdomain of C”, the Teichmüller 
space is an m(g)-dimensional complex analytic 
manifold. It is topologically equivalent to the 
unit ball in real 2m(g)-dimensional space and 
is a bounded tdomain of holomorphy in C"'9, 

Let (2,,...,X5,) be a 1-dimensional ho- 
mology basis with integral coefficients in Ro 
such that the intersection numbers are (a;, a) 
= (8, 4i, 54.) =9, (%;, Ga deër, i, j— 1, ...,g. 
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Given an arbitrary (9t, H» €T,, consider the 
tperiod matrix Q of R with respect to the 
homology basis Ha,,..., Ha,, and the basis 
€, ..., €, Of *Abelian differentials of the first 
kind with the property that |,,,,@;=6,. Then Q 
is a holomorphic function on T,. Furthermore, 
the analytic structure of the Teichmüller space 
introduced previously is the unique one (with 
respect to the topology defined above) for 
which the period matrix is holomorphic. 

§, is a properly discontinuous group of 
analytic transformations, and therefore M, is 
an m(g)-dimensional normal fanalytic space. 
$5, is known to be the whole group of the 
holomorphic automorphisms of T, (Royden 
[8] thus T, is not a tsymmetric space. 

To every point t of the Teichmüller space, 
there corresponds a Jordan domain D(t) in the 
complex plane in such a way that the fiber 
space F,— {(t, z)zeD(x), ce T, c C"'?) has the 
following properties: F, is a bounded domain 
of holomorphy of C" 9*1, It carries a properly 
discontinuous group (5, of holomorphic auto- 
morphisms, which preserves every fiber D(z) 
and is such that D(1)/6, is conformally equiva- 
lent to the Riemann surface corresponding to 
t. F, carries holomorphic functions Fir, 2), 
j=l,...,5g—5 such that for every z the func- 
tions F//F,, j—2, ...,5g —5 restricted to Dir 
generate the meromorphic function field of the 
Riemann surface D(t)/G,. 

By means of the textremal quasiconformal 
mappings, it can be verified that T, is a com- 
plete metric space. The metric is called the 
Teichmiiller metric, and is known to be a 
Kobayashi metric. 

The Teichmiiller space also carries a natu- 
rally defined Kahler metric, which for g=1 
coincides with the tPoincaré metric if T, is 
identified with the upper half-plane. The tRicci 
curvature, tholomorphic sectional cruvature, 
and ‘scalar curvature are all negative (Ahlfors 
[9]). 

By means of the quasiconformal mapping 
f, which we considered previously in order 
to construct the correspondence 4t ø, it is 
possible to regard the Teichmüller space as 
a space of quasi-Fuchsian groups (— 234 
Kleinian Groups). To the boundary of T,, it 
being a bounded domain in C”™™, there corre- 
spond various interesting Kleinian groups, 
which are called tboundary groups (Bers [10], 
Maskit [11]). 

The definition of Teichmiiller spaces can be 
extended to open Riemann surfaces Ry and, 
further, to those with signatures. A number of 
propositions stated above are valid to these 
cases as well. In particular, the Teichmiiller 
space for the case where R, is the unit disk is 
called the universal Teichmüller space. It is a 
bounded domain of holomorphy in an infinite- 
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dimensional Banach space and is a symmetric 
space. Every Teichmüller space is a subspace 
of the universal Teichmüller space. 
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A. General Remarks 


In a tdifferentiable manifold with an affine 
connection (in particular, in a *Riemannian 
manifold), we can define an important opera- 
tor on tensor fields, the operator of covariant 
differentiation. The tensor calculus is a differ- 
ential calculus on a differentiable manifold 
that deals with various geometric objects and 
differential operators in terms of covariant 
differentiation, and it provides an important 
tool for studying geometry and analysis on a 
differentiable manifold. 
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B. Covariant Differential 


Let M be an n-dimensional smooth manifold. 
We denote by (M) the set of all smooth 
functions on M and by Xz(M) the set of all 
smooth tensor fields of type (r,s) on M. X)(M) 
is the set of all smooth vector fields on M, and 
we denote it simply by X(M). 

In the following we assume that an affine 
connection V is given on M. Then we can 
define the covariant differential of tensor fields 
on M with respect to the connection (— 80 
Connections). We denote the covariant deriva- 
tive of a tensor field K 1n the direction of a 
vector field X by V, K and the covariant dif- 
ferential of K by VK. The operator V, maps 
X'(M) into itself and has the following 
properties: 

(1) Vy, y 2 Vy + Vy, Vex — f Vy, 

(2) Vy(K - K') Z VK - V. K', 

(3) Vy(K © K') - (Vy K) & K' +K (V K)), 

(4) V. f= Xf. 

(5) Vy commutes with contraction of tensor 
fields, where K and K' are tensor fields on M, 
X, Ye X(M) and Teil 

The torsion tensor T and the curvature ten- 
sor R of the affine connection V are defined by 


T(X, Y)- V,Y—-V, X —LX, Y], 

for vector fields X, Y, and Z. The torsion ten- 
sor is of type (1, 2), and the curvature tensor is 
of type (1, 3). Some authors define — R as the 
curvature tensor. We here follow the conven- 
tion used in [1-6], while in [7,8] the sign of 
the curvature tensor is opposite. The torsion 


tensor and the curvature tensor satisfy the 
identities 


T(X,Y)- — T(Y, X, R(X, Y)- — R(Y, X), 
R(X, Y)Z + R(Y,Z) X + R(Z, X)Y 
=(Vy T)(Y, Z) + (V, T)(Z, X) c (V T)(X, Y) 
+ T(T(X, Y), Z) + T(T(Y, Z), X) 
t T(T(Z, X), Y), 
(Vx R)(Y, Z) + (Vy R(Z, X) + (Vz R)(X, Y) 
- R(X, T(Y, Z))+ R(Y, T(Z, AH 
t R(Z,T(X, Y). 


The last two identities are called the Bianchi 
identities. 

The operators V, and V, for two vector 
fields X and Y are not commutative in general, 
and they satisfy the following formula, the 
Ricci formula, for a tensor field K: 


Vx(Vy K) — Vy (Vy K) - Vy, vk =R(X, Y) K, 


where in the right-hand side R(X, Y) is re- 
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garded as a derivation of the tensor algebra 
Y,.X(M). 

A moving frame of M on a neighborhood U 
is, by definition, an ordered set (e,,...,e,) of n 
vector fields on U such that e,(p), ..., e,(p) are 
linearly independent at each point pe U. For 
a moving frame (e,, ...,e,) of M on a neigh- 
borhood U we define n differential 1-forms 
0*, ...,0" by 0'(e)) éi, and we call them the 
dual frame of (e,, ...,e,). For a tensor field K 
of type (r,s) on M, we define n" ^ functions 
Kj'7t on U by 
Ky7r-Kt(e;, ON ERN) 
and call these functions the components of K 
with respect to the moving frame (e,, ...,e,). 

Since the covariant differentials Ve; are 
tensor fields of type (1, 1), n? differential 1- 
forms oj are defined by 


EM 
Vej— 0j Q e;, 


where in the right-hand side (and throughout 
the following) we adopt Einstein's summation 
convention: If an index appears twice in a term, 
once as a superscript and once as a subscript, 
summation has to be taken on the range of 
the index. (Some authors write the above 
equation as de;— «je; or De;— «je;.) We call 
these 1-forms cj the connection forms of the 
affine connection with respect to the moving 
frame (e,, ...,e,). The torsion forms ON and 
the curvature forms Qj are defined by 


G/2d0'oiA0,  Qizdoitoj^o. 
These equations are called the structure equa- 
tion of the affine connection. V. If we denote 
the components of the torsion tensor and the 
curvature tensor with respect to (e;, ...,e,) by 
TA and Ri, (—0'(R(e,, e)e;)), respectively, 
then they satisfy the relations 


< il 
P. 


TENES 
Ge all ^ 0*, HR A 0". 


Using these forms, the Bianchi identities are 
written as 


dO! + i ^ O= ^ 07, 
d Ae ^ OF — of AO, — 0. 


Let K be a tensor field of type (r,s) on M 
and K?'7 be the components of K with re- 
spect to (e,, ..., e,). We define the covariant 
differential DK "7 and the covariant deriva- 
tive Kj by 

f 
is, Eet Aka iU at ie 
DK} = Ki Ok dK + 2. Kj, 
Ges 


Ji: JAeejek Js 


s - . 
D igst a 
Serato 
v-]1 
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Then K^? , are the components of VK with 
respect to the moving frame (e,,...,e n): Some 
authors write WEI: s instead of Ki z L9, 6]. 

Using components ‘the Bianchi identities 
are written as 


Rh, + Rit Russ Tix + Tj. i+ 20. 


J 


LT + TETA + Th T 


tj? 


Rl. pt Ria. j yt Ri. k^ ERE, b F Ri T; k+ Ri Tg- 


The Ricci formula is written as 


Kh- d =? Ric, Ku 


Ji- KÉ e Kj. Jedk7 dices ds 


— 


Jy sed 


Let (x!, ..., x") be a local coordinate system 
defined on a neighborhood U of M. Then 


(6/0x!, ..., 6/0x") is a moving frame of M on U, 


and we call it the natural moving frame asso- 
ciated with the coordinate system (x!, ..., x"). 
Components of a tensor field with respect to 
the natural moving frame (0/Ox!, ..., 0/0x") are 
often called components with respect to the 
coordinate system (x!, ..., x"). We define an n? 
function T}; on U by oj — dest, where wj are 
the connection forms for the natural moving 
frame. I; are called the coefficients of the 
affine connection V. The components of the 
torsion tensor and the curvature tensor with 
respect to (x! x") are given by 


i pi i 
=li lk 


Bi — 6, du Ó Ij + Tala — TAM 


where 0; — 0/Ox' 

With respect to the loregomp coordinate 
system, the components Kj ry of the covar- 
iant differential VK of a uso field K of type 
(r, s) are given by 


Jis 4s 


Ki d = Kin rap Kin: ssip 


Se a Gi 
2. TS Kj uae 


C. Covariant Differential of Tensorial Forms 


A tensorial p-form of type (r,s) on a manifold 
M isan alternating SW(M)-multilinear mapping 


of X(M) x ... x X(M) to XM). A tensorial p- 
form of type (0,0) is a differential p-form in the 
usual sense. A tensorial p-form of type (1,0) is 
often called a vectorial p-form. 

If an affine connection V is provided on M, 
we define the covariant differential of tensorial 
forms. Let « be a tensorial p-form of type (r, s). 
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The covariant differential Da of « is a tensorial 
(p+ 1)-form of type (r,s) and is defined by 


(p+ UDa(X,, ... X44) 


p*1 


zy (oi) Wyle S 


i=1 


dde Yal CX; Kl 


E? 


Ee E E prih 


J 


where X, means that X; is deleted. If x is of 
type (0,0), Dæ coincides with the usual exterior 
differential da. 

The simplest example of a tensorial form is 
the identity mapping of X(M), which will be 
denoted by 0. Some authors write this vec- 
torial form as dp or dx, where p or x expresses 
an arbitrary point of a manifold. We call 0 the 
canonical vectorial form of M. The torsion 
tensor T can be regarded as a vectorial 2-form, 
and we have 2D0 = T. The curvature tensor R 
can be regarded as a tensorial 2-form of type 
(1, 1), i.e., (X, Y)  R(X, Y)e X1 (M), and the 
Bianchi identities are written as DT — R ^ 0, 
DR —0, where the exterior product R ^ x of R 
and a tensorial p-form « is defined by 


(p+ I)(p+2)(R A al, Xp+2) 
= 2 (-AY  ! ROG, X)o(X,, ..., €... X, 
i<j 


X,«2)- 


In general, 2D?^x =R ^ « holds for an arbitrary 
tensorial form x. 

Let (e,, ...,e,) be a moving frame of M ona 
neighborhood U and 0!, ..., 0" be its dual 
frames. A tensorial p-form « of type (r, s) is 
written as 


aor Ge, 8... Oe, OH @... G 05, 


on U, where the o7 are the usual differential 
p-forms on U. We call them the components of 
x with respect to (e, ...,e,). Then the compo- 
nents of Da, which we denole by Daji i ,are 
given by 


Iul, Yd, rin 
Dau t= day itor A Qj 


S 
= 
Then we have 


r 
Dia ie ty 2 Qi: ^ ol = X Qe Ao 


s Js Jv dies 
v=1 


This is an expression of 2D?a=R ^ x in terms 
of components. The components of the ca- 
nonical vectorial form 0 are the dual forms 
d... Op of (e,, ..., e,), and we have DI — G5, 
which means that the components of D0 are 
the torsion forms Or. 
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D. Tensor Fields on a Riemannian Manifold 


Let (M, g) be an n-dimensional Riemannian 
manifold (— 364 Riemannian Manifolds). The 
fundamental tensor g defines a one-to-one 
correspondence between vector fields and 
differential 1-forms. A differential 1-form « 
which corresponds to a vector field X is de- 
fined by «(Y)— g(X, Y) for any vector field Y. 
This correspondence is naturally extended to a 
one-to-one correspondence between X7(M) and 
X7 (M), where r - se r' +s’. Let (e,,...,e,) bea 
moving frame of M on a neighborhood U and 
gı; be the components of g with respect to the 
moving frame. Let (oi be the inverse matrix of 
the matrix (g;). The g} are the components of 
a symmetric contravariant tensor field of order 
2. Let X! be the components of a vector field X 
and a, be the components of the differential 1- 
form a corresponding to X. Then X‘ and a; 
satisfy the relations à; — g; X? and X'— gn: If 
Ki, are the components of a tensor field K of 
type (1, 2) (here taken for simplicity), then 

K nij = KiGan> KI = Kgg”, 


hij h „ai bj 
K™=Kig"g”, ..-, 


are the components of a tensor field of type 
(0, 3), (2, 1), (3, 0), ..., respectively, all of which 
correspond to K. We call this process of ob- 
taining the components of the corresponding 
tensor fields from the components of a given 
tensor field raising the subscripts and lowering 
the superscripts by means of the fundamental 
tensor g. 

On a Riemannian manifold, we use the 
*Riemannian connection, unless otherwise 
stated. The covariant derivative with respect to 
the Riemannian connection is given by 


for vector fields X, Y, and Z. The coefficients 
of the Riemannian connection with respect to 
a local coordinate system (x!, ..., x") are usu- 
ally written as (j;), called the Christoffel sym- 
bols, which are given by (4 = 9"(0,9;, + gra — 
0,9,j)/2. The curvature tensor R of the Rie- 

mannian connection satisfies the identities 


R(X, Y)Z + R(Y, Z) X - R(Z, X) Y -0, 
(Vx R)(Y, Z) + (Vy R)(Z, X)+(V,R)(X, Y)=0, 
R(X, Y) 2 —R(Y, X), 
g(R(X, Y)Z, W)=g(R(Z, W)X, Y) 
= —g(Z, R(X, Y)W), 
g(R(X, Y)Z, W) - g(R(X, Z)W, Y) 
4 g(R(X, W)Y, Z) - 0. 
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In terms of the components, these identities 
are 


Ri, Ri Rog 0, 

Ri. T Réi, + Bä =0, 

Rix aa Ri, Run = Rau i Ram, 
Run + Rat Raxy 0, 


where Run = Rita, 

The *Ricci tensor $ of the Riemannian 
manifold is a tensor field of type (0. 2) defined 
by 


S(X, Y)= trace of the mapping Zo R(Z, X) Y 


for vector fields X and Y. The components S;; 
of the Ricci tensor are given by S,;== Ri, The 
*scalar curvature k of the Riemannian mani- 
fold M is a scalar on M defined by k — g/S;. 
The Ricci tensor and the scalar curvature 
satisfy the identities 


S(X,Y)2S(Y, X) or Si = S; 


Uy? 


Sij. — Su, j = Rikja Jobs, — Ok. 

For a moving frame of a Riemannian mani- 
fold, it is convenient to use an orthonormal 
moving frame. A moving frame (e,, ...,e,) is 
orthonormal if e, , ..., e, satisfy g(e;, ej) = dr, 
Since the components of the fundamental 
tensor with respect to an orthonormal moving 
frame are A, raising or lowering the indices 
does not change the values of the components. 
Some authors write all the indices as sub- 
scripts. Also they write the dual 1-forms, the 
connection forms, and the curvature forms as 
0;, co, and Q, respectively, instead of 0, cj, 
and Q). With respect to an orthonormal mov- 
ing frame, the connection forms oj and the 


curvature forms (LN satisfy 
oj+oj=0 and Qi+Qj=0. 


On a Riemannian manifold, the divergence 
of a vector field and the operators d, ô, and A 
on differential forms (— 194 Harmonic In- 
tegrals) can be expressed by using the covar- 
iant derivatives with respect to the Riemann- 
ian connection. 

If X! are the components of a vector field 
X with respect to a local coordinate system 
(x, ..., x"), the divergence div X of X is given 
by div X =X", 

Let « be a differential p-form on M. a is 
written locally in the form a —(1/p!)u; dei: ^ 
... ^ dx'», where the coefficients 3. A are skew- 
symmetric in all the indices. We call o; ; 
the components of æ with respect to the co- 
ordinate system. Since « is regarded as an 
alternating tensor field of type (0, p), we can 
define the covariant differential Va of a. 
Then the components of da, da, and Aw are 
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given by 
pti e 
(da), un 2. Ch SE Du 
ab 
(ëch. ipa "H Uu qub 


(ak. -ip se^ gl Hr. -ip ab T Zä e iios, 
E YR aisin% —! 


Ucw 


For a smooth function f and a differential 1- 
form fj we have 


1 2: 
Af- yu a af) 
g 


(Ab) = — g” [Bras 
where g — det(g;). 


Sia DL 


E. Van der Waerden- Bortolotti Covariant 
Differential 


Let E be a finite dimensional smooth tvector 
bundle over a smooth manifold M and T(E) be 
an %(M)-module of all smooth sections of E. A 
connection V' in E is a mapping of X(M) x 
T(E) to I(E) such that 


(1) Vi +M =Vee + Vien, 
(2) V, (f£) Xf E LVS, 
(3) Vj v6 m VE Vo, 

(4) Vj C f VG, 


for X, YeX(M), Zuel (E), and few(M). V,£ 
is called the covariant derivative of & in the 
direction X. 

An element K of ¥5(M) © T(E) is called a 
tensor field of type (r, s) with values in E (or 
simply an E-valued tensor field of type (r,s)). K 
can be regarded as an %(M)-linear mapping of 
Xi(M) to I(E) or an &(M)-multilinear map- 

s 


" TG 
ping of X(M) x... x X(M) to (M) © T(E). 
For a given £e I (E), a mapping X 5 V,£ de- 
fines a tensor field of type (0, 1) with values in 
E which we call the covariant differential of é, 
denoted by V’é. 

The curvature tensor R' of V' is a tensor 

field of type (0, 2) with values in E* ® E (E* is 
the dual vector bundle of E), and is defined by 


R(X, Y)é Maike Cl - WV eo) — Vix. ie 


for any vector fields X and Y and any Zel (E). 

If an affine connection V is given on M, we 
can define the van der Waerden- Bortolotti 
covariant derivative V, K for V and V' of a 
tensor field K of type (r,s) with values in E. It 
is defined by 


(Vr K)(S)=Vx(K(S))— 
for any SeXi(M). If we regard £eT (E) as an E- 


K(V,S) 
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valued tensor field of type (0,0), we have Vois 
VE The covariant derivative V, R’ of the 
curvature tensor R’ of V' is a tensor field of 
type (0, 2) with values in ES © E is defined by 


(Vx R)(Y, Z)é = V((R'CY, Z)E)— R'(Vx Y, Z)£ 


—R'(Y, Vy Z) — R(Y, Z)V x6. 
The Bianchi identity is written as 


(V, R)(Y, Z)+ (V; R')(Z, X)+(VzR)(X, Y) 


— R'(X, T(Y, Z)) - R'(Y, T(Z, X)) 
t R'(Z, T(X, Y)), 


where T is the torsion tensor of V. The Ricci 
formula is given by 


(Ve (Vy K) (S) - (Vy (Vx K) (S) -(Vix, y, K)(S) 
—R'(X, Y): K(S)—K(R(X, Y) S), 


where R is the curvature tensor of V, Ke 
X'(M) O T(E) and SeX;(M). 

In the following we assume that the fiber of 
E is of finite dimension m. A moving frame of 
E on a neighborhood U of M is an ordered set 
(£i, ..., Gel Of local sections £,,..., Ge on U such 
that ¢,(p),...,€,,(p) are linearly independent at 
each point p of U. Let (e,,...,e,) be a moving 
frame of M on U. Then an E-valued tensor 
field K of type (r, s) is locally written as 


Kier, Gene, OOO Ey 


where 6!, ..., 0" are the dual 1-forms of (e,, 

e,). The n’**m functions Kom on U are 
called the components of K with respect to 
(e, ..., Gul and (€,,...,&,,). We define the con- 
nection forms ws‘ of the connection V' by VC, 
— 0g GC Then the curvature forms Qy are 
defined by 


OF = dag + e ^ og —iR$0! ^0, 


where R5; are the components of the curva- 
ture tensor R’, i.e., R'(e, ele Bän, 

For a given tensor field K of type (r, s) 
with values in E, the mapping X 2 V, K de- 
fines a tensor field VK of (r,s+ 1) with values 
in E which we call the van der Waerden- 
Bortollotti covariant differential of K. Then if 
K 2 T are the components of K with respect 
to d ...,@,) and (£,, ..., Čm) the components 
Kor a of VK are Geen by 
Ki A * Of = dK deck Y Kite 


Js 


J È KA ant tK Op 
Let f be a smooth mapping of M into a 
smooth manifold M’. The differential f (or df) 
can be regarded as a tensor field of type (0, 1) 
with values in f * T(M'). Assume that M (resp. 
M^) has a Riemannian metric g (resp. g^). We 
denote the Riemannian connection of M by V. 
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From the Riemannian connection of M' a 
connection V'in f * T(M") can be defined. Let 
(rf, ..., y") be a local coordinate system of M’ 
on a neighborhood V and (x!, ..., x") be a local 
coordinate system on a neighborhood U of M 
such that f(U)c V. Put £,(p) - (6/6y?)( f(p)) for 
a point pe U. Then (é,,...,¢,,) is a moving 
frame of f *T(M"). The components of f, with 
respect to (0/Ox!, ..., 0/0x") and (£,, ..., el are 
given by f*(p) - (Cy*/Ox')(p). The Laplacian 
Af of the mapping f is a tensor field of type 
(0,0) with values in f * T(M") and is defined 

by (Af Y =g'f?,. If Af =0, the mapping f is 
called a harmonic mapping (— 195 Harmonic 
Mappings). 


F. Tensor Fields on a Submanifold 


Consider an n-dimensional smooth mani- 
fold M immersed in an (n+ m)-dimensional 
Riemannian manifold (M, g). If we denote the 
immersion MM by f, then g=f*gisa 
Riemannian metric on M, and we denote its 
Riemannian connection by V. The induced 
bundle f * T(M) splits into the sum of the 
tangent bundle T(M) of M and the normal 
bundle THM). The Riemannian connection on 
M induces connections in f *T(M) and in 
T+(M) which are denoted by V and V+, re- 
spectively. The van der Waerden- Bortolotti 
covariant derivative for V and V+ is denoted 
by V. 

For vector fields X and Y on M, the tan- 
gential part of Vy Y (here we regard Y asa 
section of f * T(M)) is V, Y, and we denote the 
normal part of V, Y by h(X, Y). Then h isa 
symmetric tensor field of type (0, 2) with values 
in THM), and we call h the second funda- 
mental tensor of the immersion f. For Ze 
l(T-(M)) the tangential part of Vyé (here č 
is also regarded as a section of f * T(M)) is 
denoted by — A.X and the normal part of Vyé 
is Vy č. Thus we have 


V.Y-V.Y-h(X,Y) Vio — AX 4 Vy. 
h and A are related by 


We have the following formulas, called the 
equations of Gauss, Codazzi, and Ricci: 


g(R(X, Y)Z. W) - (R(X, Y)Z, W) 
t g(h(X, Z), h(Y, W)) 
—g(h(X, W),h(Y, X)), 
g(R(X, Y)Z, ZONK, Z), č) 
—G(Vyh)(X, Z), č), 
(R(X, VU. n 2 9(R^ (X, Y). n) 
+g(LAz, 4,] X, Y), 
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for X, Y, Z, We X(M) and £, ne T (T (M), 
where R, R, and R^ are the curvature tensors 
of V, V, and V+, respectively. 

For the manifold M immersed in M, we use 
a moving frame (e;, ...,e,, 6, ..., Čm) such that 
(e. ...,e,) is an orthonormal moving frame of 
M on a neighborhood U and (£,,...,£,) isa 
moving frame of T+(M) on U with g(£,, £5) — 
Au. Then we can define the connection forms 
aj for V and c; for V+. If we extend (e;, ..., 
85,61, , 6s) to an orthonormal moving frame 
(GG... SA of M such that e&(p) — ej(p) (i— 1, 
...,A) and e, p)= £,(p) (x= L,..., m) for peU, 
then the restriction f*04 and f*@{ of the 
dual 1-forms and the connection forms of 
M with respect to (e, ...,&,.,,) satisfy the 
relations 


f*ü-9, f*9"*—0, f*ai-aj, 


Tei, far => hyo, 
j 


where hj; are the components of the second 
fundamental tensor h with respect to (e,, ...,e 
MR 

The components hj , of the covariant dif- 
ferential Vh of h are defined by 


n? 


hj ,0* —dhj — hio? — hao; + hf cp. 


aj i 


In terms of the components, the equations 
of Gauss, Codazzi, and Ricci are given by 


Run = Rijk T » (hj hy, e? hi, ki 


Ria = ik j T: hj ks 
p L 
Rg =R 08g — SR (hahh = hf, hir). 
Let (x, ..., x") be a local coordinate system 


on a neighborhood U of M and (y’,..., y"*™) 
be a local coordinate system on a neighbor- 
hood V of M such that f(U)c V. Regarding 
the differential f, of the immersion f as a ten- 
sor field of type (0, 1) with values in f * T(M), 
we denote the components of f, with re- 
spect to (x!, ..., x”) and (yl, ..., y"^") by B4 
(i=1,...,n; A=1,...,n+m). Then we have 

B? — 0y^/0x'. We denote by V' the van der 
Waerden- Bortolotti covariant derivative for V 
and V. Then the components Bj, of V'f, are 
given by 


Bit = 0B — (5) Bi + BF BPG}, 


where 0; — 0/0x*, {£}, and {4} are the Chris- 
toffel symbols of the Riemannian metrics g 
and g, respectively. 

Let (¢,,...,¢,,) be an orthonormal moving 
frame of T/(M) on U and £7 be the compo- 
nents of č, with respect to (y!, ..., y"*"). Then 


we have 
A Lord 
Bibi a? 


where h;; are the components of the second 
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fundamental tensor with respect to (0/Ox!, ..., 
9/0x") and (6,, ..., Gm): 

A tensor field K with values in T (M) can 
be regarded as a tensor field with values in 
f * T(M), and VK is the normal component of 
V'K. For example, if we regard the second 
fundamental tensor h as a tensor field with 
values in f * T (M), the components of h with 
respect to the coordinates (x, ..., x") and 


(y, ..., y"*") are equal to Di. and we have 


X _ pDA B- 
hix Se Bia Ca ÜAB- 
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418 (IX.20) 
Theory of Singularities 


A. Introduction 


Let fi, f;, ..., f, be tholomorphic functions 
defined in an open set U of the complex space 
C". Let X be the analytic set f; +O) N... N 

f, (0). Let zo € X, and let g,,...,g, be a sys- 
tem of generators of the ideal ./(X),, of the 
germs of the holomorphic functions which 
vanish identically on a neighborhood of ze in 
X. zy is called a simple point of X if the matrix 
(6gi/0z;) attains its maximal rank, say k, at z= 
Zo. In this case, X is a 'complex manifold of 
dimension n — k near zo. Otherwise, zo is called 
a singular point of X. 
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B. Resolution of Singularities 


Let X be a complex analytic space, and let Y 
be its singular locus. A resolution of the sin- 
gularity of X is a pair of a complex manifold X 
and a proper surjective holomorphic mapping 
1: X X such that the restriction z | s. iy is 
biholomorphic and X — x! (Y) is dense in X. 
H. Hironaka proved that there exists a reso- 
lution for any X such that x ! (Y) is a divisor 
in X with only *normal crossings [16,17]. 
Suppose that a compact connected ana- 
lytic subset Y of a complex manifold X has a 
'strongly pseudoconvex neighborhood in X. 
Then the contraction X/Y naturally has a 
structure of a *normal complex analytic vari- 
ety such that the projection X o X/Y isa 
resolution of X/Y (H. Grauert [14]). 


C. Two-Dimensional Singularities 


Let X be a normal 2-dimensional analytic 
space. Then the singular points of X are 
discrete. 

Among the resolutions of X, there exists a 
unique resolution z: X >X with the following 
universal property: For any resolution z': X’> 
X, there exists a unique mapping p: X’> X 
with z' —z 0 p. This resolution is called the 
minimal resolution. 

Let 1: X +X be a resolution of a singular 
point x of X, and let A; (i— 1, ..., m) be the 
irreducible components of x ^! (x). The matrix 
(A;- Aj) of the tintersection numbers is known 
to be negative definite (P. Du Val [12]). 

The resolution z: X 5 X is called good if (i) 
each A; is nonsingular, (ii) A; A; (ij) is at 
most one point and the intersection is trans- 
verse and (iii) no three A;'s meet at a point. 
For a given good resolution zx: X >X, we 
associate a diagram in which the vertices v; 
(i 1, ..., m) correspond to A, (i— 1, ..., m) and 
v; and v; are joined by a segment if and only if 
ANA, FMD. 

The geometric genus p,(X, x) of a singular 
point xe X is the dimension of the stalk at x 
of the first direct image sheaf R' n 0g, where 
n: X — X is a resolution of xe X and 6g is the 
*structure sheaf of X. The definition is inde- 
pendent of the choice of the resolution, and 
p, X, x) is a finite integer. 

Among the positive cycles of the form Z = 
Zuel (Le. n; 20) such that Z- 4; «0 for 
each i— 1, ..., m, there exists a smallest one Zo, 
which is called the fundamental cycle [3]. 

(1) Rational singularities. A singular point x 
of X is called rational if p,(X, x) =0. (The sin- 
gularity (X, x) is also called rational even when 
dim X > 3 if the direct image sheaf R'z, Cz —0 
for i» 0.) 
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For a rational singularity xe X, the *multi- 
plicity of X at x equals — Z? and the local 
embedding dimension of X at x is — Z2 + 1. 
Hence a rational singularity with multiplicity 
2, which is called a rational double point, is 
a hypersurface singularity. The following 
weighted homogeneous polynomials (— Sec- 
tion D) give the complete list of the defining 
equations up to analytic isomorphism: 


À,: x" *1 + y? 4-27, 





weights (1/(n+1), 1/2, 1/2, nzl; 
Duixt x^ +z, 
weights (l/(n—1) (n—2)/2(n—1) 1/2, n2z4; 


Ee: Xt - y? z?, 


weights (1/4, 1/3, 1/2); 
Ej yy 

weights (2/9, 1/3, 1/2); 
Eg: x5 ty? +27, 

weights (1/5, 1/3, 1/2), 


where the labels appearing at the left are given 
according to the coincidence of the diagram of 
the respective minimal resolutions and the 
*Dynkin diagrams. Rational double points 
have many different characterizations [11 ]. 
The generic part of the singular locus of the 
unipotent variety of a tcomplex simple Lie 
group G (—the orbit of the subregular *unipo- 
tent elements in G) is locally expressed as the 
product of a rational double point and a poly- 
disk. The tuniversal deformation of a rational 
double point and its *simultaneous resolution 
are constructed by restricting the following 
diagram on a transverse slice to the subregular 
unipotent orbit (Brieskorn [7]; [34]): 


Y ————— G 


| | 


T—— ——o T/W 


where T is a *Cartan subgroup of G with the 
action of the Weyl group W, G— T/W is the 
quotient mapping by the fadjoint action of G 
and Y= {(x, B)|xeG and B is a *Borel sub- 
group of G with xe Bj, and other morphisms 
are defined naturally so that the diagram 
commutes. Here, Y T is the simultaneous 
resolution of the morphism G T/W. 

(2) Quotient singularities. A singular point 
x€X is called a quotient singularity if there 
exists a neighborhood of x which is analyti- 
cally isomorphic to an orbit space U/G, where 
U is a neighborhood of 0 in C? and G is a 
finite group of analytic automorphisms of U 
with the unique fixed point 0. The quotient 
singularities are rational, and their resolutions 
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have been well studied [6]. U/G has a rational 
double point at 0 if and only if G is conjugate 
to a nontrivial finite subgroup of SU(2). 

(3) Elliptic singularities. The singularity 
(X, x) is called minimally elliptic if p, (X, x)= ! 
and (X, x) is Gorenstein [23]. The following 
are examples of minimally elliptic singularities. 

A singular point x € X is called simply ellip- 
tic if the exceptional set A of the minimal 
resolution is a smooth ‘elliptic curve [33]. 
When A4? — —1, —2, —3, (X, x) is a hyper- 
surface singularity given by the following 
weighted homogeneous polynomials: 


Ec:x? +y? +z? +axyz, 
(1/3, 1/3, 1/3), 
E,:x++y*++27+axyz, 
(1/4, 1/4, 1/2), A?= —2; 


weights 


weights 
E,:x° + y? +27 + axyz, 
(1/6, 1/3, 1/2), 


(4) Cusp singularities. A singular point xe X 
is called a cusp singularity if the exceptional 
set of the minimal resolution is either a sin- 
gle rational curve with a *node or a cycle of 
smooth rational curves. Cusp singularities 
appear as the boundary of tHilbert modular 
surfaces [18]. The hypersurface cusp singular- 
ities are given by the polynomials 


weights A? — —1, 


Tor ty? + z" +axyz, 
where 1/p+1/q+ t/r<1 and a#0. 


D. The Milnor Fibration for Hypersurface 
Singularities 


Let V be an analytic set in C", and take a 
point zo € V. Let S,=S(zo,€) be a (2N — 1)- 
dimensional sphere in C" with center zy and 
radius e 0, and let K, = VfS,. If z is suffi- 
ciently small, the topological type of the pair 
(S,, K,) is independent of e [27]. By virtue of 
this fact, the study of singular points consti- 
tutes an important aspect of the application of 
topology to the theory of functions of several 
complex variables. 

A singular point z, of V is said to be isolated 
if, for some open neighborhood W of zy in C", 
WV -— zo) is a smooth submanifold of W— 
{zo}. In that case, K, is a closed smooth sub- 
manifold of S,, and the diffeomorphism type 
of (S,, K,) is independent of (sufficiently small) 
£2» 0. So far, the topological study of such 
singular points has been primarilv focused 
on isolated singularities. When V is a plane 
curve, that is, N 22 and r— 1, all the singular 
points of V are isolated, and the submanifold 
K, of the 3-sphere S, can be described as an 
iterated torus link, where type numbers are 
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completely determined by the *Puiseaux ex- 
pansion of the defining equation f of V at the 
point zo [5]. In 1961, D. Mumford, using a 
resolution argument, showed that if an alge- 
braic surface V is tnormal at z, and if the 
closed 3-manifold K, is simply connected, then 
K, is diffeomorphic to the 3-sphere and Ze is 
nonsingular [29]. The following theorem in 
the higher-dimensional case is due to E. Bries- 
korn [8] (1966): 

Every thomotopy (2n — 1)-sphere (n #2) 
that is a boundary of a *z-manifold is dif- 
feomorphic to the K, of some complex hyper- 
surface defined by an equation of the form 
f(z)=z2 +... ze =0 at the origin in C"*!, 
provided that n z 2. The hypersurface of this 
type is called the Brieskorn variety. Inspired by 
Brieskorn's method, J. W. Milnor developed 
topological techniques for the study of hyper- 
surface singularities and obtained results such 
as the Milnor fibering theorem, which can be 
briefly stated as follows: 

Suppose that V 1s defined by a single equa- 
tion f(z) «0 in the neighborhood of z; € C"*'. 
Then there is an associated smooth tfiber 
bundle 9:8, — K,—S!, where q(z)- f(z)/|f(2)| 
for zeS, — K,. The fiber F= !(p)(peS!) has 
the homotopy type of a finite CW-complex of 
dimension n, and K, is (n — 2)-connected. 

Suppose that z, is an isolated critical point 
of f. Then F has the homotopy type of a tbou- 
quet of spheres of dimension n [27]. The Mil- 
nor number oi f) of f is defined by the nth Betti 
number of F, and it is equal to dime Cc: , / 
(0f/0z,, ... . Of/6z, th where feet. is the ring 
of the germs of analytic functions of n+ 1 
variables at z=z,). The Milnor monodromy 7, 
is the automorphism of H,(F) that is induced 
by the action of the canonical generator of 
the fundamental group of the base space ST. 
The *Lefschetz number of h, is zero if z° is 
a singular point of V. Let A(t) be the charac- 
teristic polynomial of h,. Then K, is a homol- 
ogy sphere if and only if A(1)= +1 [27]. It is 
known that A(t) is a product of *cyclotomic 
polynomials. 

The diffeomorphism class of (S,, K,) is com- 
pletely determined by the congruence class of 
the linking matrix Lie, ej) (1 € ij € u( f), where 
en, Gun IS an integral basis of H,(F) and 
L(ej, ej) is the thinking number [21, 10]. 

The Milnor fibration is also described in the 
following way. Let E(e, ô) be the intersection of 
f '!(D£*) and B(e), the open disk of radius e 
and center zo, where Dž is fne C |O « |n| « ó). 
The restriction of f to E(e, ô) is a "locally triv- 
ial fibration over DŽ if ô is sufficiently smaller 
than e [27]. 

Let f(z) be an analytic function; suppose 
that f(0) 20 and let Zewen de ZP be the Taylor 
expansion of f at z=0. Let I’, (f) be the con- 
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vex hull of the union of ( p -- (R *)"*! ) for 

pe N"*! c R"*! with a, 0, where R* ^ [xe 
R[xz 0], and let I'( /) be the union of com- 
pact faces of I, ( £). We call I'( f£) the Newton 
boundary of f in the coordinates z,, ...,z,.,. 
For a closed face A of I'( f) of any dimension, 
let fA(z) — Lipa pz”. We say that f has a non- 
degenerate Newton boundary if (0f4/0z,, ..., 
OfA/0z, ul is a nonzero vector for any ze(C*)""! 
and any Ael (f). Suppose that f has a non- 
degenerate Newton boundary and 0 is an 
isolated critical point of f. Then the Milnor 
fibration of f is determined by I'( f) and u(f), 
and the characteristic polynomial can be ex- 
plicitly computed by T( f) [22,38]. 

f(z) is called weighted homogeneous if there 
exist positive rational numbers r,, ...,7,.,, 
which are called weights, such that a, =0 if 
Y2 pir; z 1. An analytic function f(z) with an 
isolated critical point at 0 is weighted homo- 
geneous in suitable coordinates if and only if 
f belongs to the ideal (6f/éz,,..., Of/0z, .,) (K. 
Saito [32]). Suppose that f(z) is a weighted 
homogeneous polynomial with an isolated 
critical point at 0. Then the Milnor fibration of 
f is uniquely determined by the weights, and 


1 

uCf) I (- i) The surface f ^! (0) for 
lj 

n=2 is a rational double point if and only if 


p 


E. Unfolding Theory 


An unfolding of a germ of an analytic func- 
tion f(z) at 0 is a germ of an analytic function 
F(z,t), where te C" (m is finite) such that F(z, 0) 
= f(z). We assume that f has an isolated crit- 
ical point at 0. Among all the unfoldings of f, 
there exists a universal one, in a suitable sense, 
that is unique up to a local analytic isomor- 
phism. It is called the universal unfolding of f 
[36,37,26] (— 51 Catastrophe Theory). Ex- 
plicitly it can be given by F(z, t)= f(z) - t, @,(z) 
+...+1t,@,(z), where 9,(z) (i= 1,..., u) are 
holomorphic functions which form a C-basis 
of the Jacobi ring Geet o/(0f/0z,, ..., Of/62Z444) 
(u — uCf ». 

In the universal unfolding F(z, t) of f, the set 
of points (zo, to) such that F(z, tg) has an iso- 
lated critical point at 2) with the Milnor num- 
ber u( f) and Fo, t9) 2 0 forms an analytic set 
at (z, t) 20. The modulus number of f is the 
dimension of this set at 0. This set is some- 
times called the -constant stratum. Let g be a 
germ of an analytic function. g is said to be ad- 
jacent to f (denoted by f 9), if there exists a 
sequence of points (z(m), t(m)) in C"*! x C# 
that converges to the origin such that the 
term of F(z, t(m)) at z(m) is equivalent to g. 
Adjacency relations are important for the 
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understanding of the degeneration phenomena 
of functions. The unfolding theory can be 
considered in exactly the same way as that for 
the germ of a real-valued smooth function 
that is finitely determined [36, 26]. 

The germs of analytic functions with 
modulus number 0, 1, and 2 are called simple, 
unimodular, and bimodular, respectively. They 
were classified by V. I. Arnold [1] (— Ap- 
pendix A, Table 5.V). Simple germs corre- 
spond to the equations for the rational double 
points, and unimodular germs define simply 
elliptic singularities or cusp singularities. Every 
unimodular or bimodular germ defines a sin- 
gularity with p,=1. 


F. Picard-Lefschetz Theory 


Let f(z) be a holomorphic function such that 
f(0) 20 and 0 is an isolated critical point with 
the Milnor number u. Let F(z, t) be a universal 
unfolding of f at 0. Let f: E(c, ) 5 Df be the 
Milnor fibration of f by the second description 
in Section D. There exists a positive number r 
and a codimension 1 analytic subset A (called 
the bifurcation set) of B'(r), the open disk of 
radius r with the center O in the parameter 
space CH, such that for any toe B'(r) — A, f, = 
F | ge) x1, has u different nondegenerate crit- 
ical points in B(c). Let p,,...,p, be the critical 
points of f, For each p;, one can choose local 
coordinates (y;, ..., Yn+1) so that f, (y)— f (P:i) 
yp +y,- Such an f, is called a Morsi- 
fication of f. 

Let B; be a small disk with center p; in C"*!. 
Then for any q; which is near enough to f, (p;), 
the intersection f, (4) B, is diffeomorphic to 
the tangent disk bundle of the sphere S”. The 
vanishing cycle e; is the corresponding n- 
dimensional homology class of fr (q) N Bi. 
(We fix q;.) The self-intersection number of e; is 
given by 


Vaud 2(—1y*79?, n even, 
€;, €j = 
i 0, n odd. 


For a sufficiently small t, € B'(r) — A, one has 
the following: (i) | f, (p))] <ô; (ii) the restriction 
of f, to E is a fiber bundle over D', where D' 
={weC||w|<6, and w f, (p) for i— L, ...,u] 
and E= f,.' (D')' B(e); (iii) the restriction of 
the above fibration to {w||w|=6} is equivalent 
to the restriction of the Milnor fibration of f 
to {w||w|=6}. Let wọ be a fixed point of D’, 
and let F = f, ' (w9)'Y E. Then F is diffeomor- 
phic to the Milnor fiber of f. Let l; be a simple 
path from w, to q;, and let y; be the loop |w— 
VAVA] - |q; — f, (p). We suppose that the 
union of the l; is contractible to wọ. By parallel 
translation of the vanishing cycle e; along /,, 
we consider ee H,(F). The collection {e;|i= 
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1,..., 4} is an integral basis of H,(F), which is 
called a strongly distinguished basis (Fig. 1). 

Now let h; be the linear transformation of 
H,(F) that is induced by the parallel trans- 
lation along Lol 1. The Picard-Lefschetz for- 
mula says that 


hi(e) 3e—(—1) "^U? Ce ee; for ee H,(F). 


Here < , > is the intersection number in 
H,(F). For n even, h; is a reflection. 

The Milnor monodromy h, of f is equal 
to the composition h, ...h, under a suitable 
ordering of the h;. The subgroup of the group 
of linear isomorphisms of H,(F) generated by 
h,,...,h, is called the total monodromy group. 

When f is a simple germ and n =2 mod4, 
the total monodromy group is isomorphic to 
the tWeyl group of the corresponding Dynkin 
diagram. Even-dimenstonal simple singular- 
ities are the only ones for which the mono- 
dromy group is finite. These are also char- 
acterized as the singularities with definite 
intersection forms. 


Fig. 1 


G. Stratification Theory 


The notion of Whitney stratification was first 
introduced by H. Whitney to study the sin- 
gularities of analytic varieties [39] and was 
developed by R. Thom for the general case 
[37]. 

Let X and Y be submanifolds of the space 
R”. We say that the pair (X, Y) satisfies the 
Whitney condition (b) at a point ye Y if the 
following holds: Let x; (i—1,2,...) and y; 

(i 1,2, ...) be sequences in X and Y, respec- 
tively, that converge to y. Suppose that the 
tangent space T, X converges to a plane T in 
the corresponding Grassmannian space and 
the secant x;y; converges to a line L. Then L 
c T. We say that (X, Y) satisfies the Whitney 
condition (b) if it satisfies the Whitney con- 
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dition (b) at any point ye Y. Let h be a local 
diffeomorphism of a neighborhood of y. One 
can see that (h( X), h(Y)) satisfies the Whitney 
condition (b) at h(y) if (X, Y) satisfies it at y. 
Thus the Whitney condition can be considered 
for a pair of submanifolds X and Y of a mani- 
fold M using a local coordinate system. Let S 
be a subset of a manifold M, and let Y be a 
family of submanifolds of M. F is called a 
Whitney prestratification of S if is a locally 
finite disjoint cover of S satisfying the follow- 
ing: (i) For any Kei, the frontier X — X isa 
union of Ye 4; (it) for any pair (X, Y) (X, Ye 
SY), the Whitney condition (b) is satisfied. A 
submanifold X ms is called a stratum. There 
exists a canonical partial order in S that 

is defined by X « Y if and only if X c Y — Y. 

Let V be an analytic variety, and let Z be 
an analytic stratification of V that satisfies 
the frontier condition (1). Then there exists a 
Whitney prestratification ^ that is finer than 
M (Whitney [39]). 

For a given Whitney prestratification S, 
one can construct the following controlled 
tubular neighborhood system: For each X e, 
a ttubular neighborhood |T,| of X in M and 
the projection z,:|T,|— X and a tubular func- 
tion py :| Ty] 3R * (=the square of a norm 
under the identification of | T,| with the *nor- 
mal disk bundle of X) are given such that the 
commutation relations 


Ty ny(m)— ny(m), pyny(m) = py(m) 
for me M, X « Y. 


are satisfied whenever both sides are defined. 

By virtue of this, the notions of vector fields 
and their integral curves can be defined on a 
Whitney prestratified set so that several im- 
portant results on a differentiable manifold 
can be generalized to the case of stratified sets. 
For example, the following is Thom's first 
isotopy lemma: Let M and P be differentiable 
manifolds, and let (5, ^) be a Whitney pre- 
stratified subset of M. Let f: SP be a con- 
tinuous mapping that is the restriction of a 
differentiable mapping from M to P. Suppose 
that the restriction of f to each stratum X of 
F is a proper submersion onto P. Then f:S> 
P isa fiber bundle [37]. 


H. b-Functions 


Let f(z) be a germ of an analytic function in 
C""! with f(0)=0. The b-function of f at 0 
is the monic polynomial b,(s) of lowest de- 
gree among all polynomials bist with the 
following property [4, 20]: There exists a 
differential operator P(z, 6/6z, s), which is 

a polynomial in s, such that b(s)f*(z)= 

P(z, 0/0z, s) f **! (z). Since b,(s) is always 
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divisible by s+ 1, we define b,(s) by (s)/(s + 1). 
All the roots of b,(s) 0 are negative rational 
numbers (M. Kashiwara [20]. When f has an 
isolated critical point at 0, the set (exp(2zia |a 
is a root of b (s)=0} coincides with the set of 
eigenvalues of the Milnor monodromy [25]. 
The name “b-function” is due to M. Sato. 
He first introduced it in the study of *prehomo- 
geneous vector spaces. Some authors call it the 
Bernstein (Bernshtein) polynomial. 


I. Hyperplane Sections 


Let V be an algebraic variety of complex di- 
mension K in the complex projective space P". 
Let L be a hyperplane that contains the 
singular points of V. Then the trelative homo- 
topy group 7,(V, VN L) is zero for i « k. Thus 
the same assertion ts true for the relative 
homology groups (S. Lefschetz [24]; [28]). 

Let f be a holomorphic function defined in 
the neighborhood of 0e C"*! and f(0) «0. Let 
H be the hypersurface f ^! (0). There exists a 
*Zariski open subset U of the space (= P") of 
hyperplanes such that for each LeU, there 
exists a positive number « such that z;(B(r) — 
H, (B(r)— H)( L)-0fori«nand 0 «re, 
where B(r) is a disk of radius r (D. T. Lé and 
H. Hamm [15]). This implies the following 
theorem of Zariski: Let V be a hypersurface of 
P", and let P? be a general plane in P". Then 
the fundamental group of P" — V is isomorphic 
to the fundamental group of P? — C, where C 
=V N P?. The fundamental group of P?— C is 
an Abelian group if C is a nodal curve [9,13]. 

Suppose that f has an isolated critical point 
at 0. Let win" li be the Milnor number y f). 
Take a generic hyperplane L. The Milnor 
number of f|, is well defined, and we let i = 
ulf | ,). Similarly one can define p of f and 
let up —(gu"* D, vil... wD). Let f(z) be a de- 
formation of f. Each f, has an isolated critical 
point at 0, and t ts a point of a disk D of the 
complex plane. Let W= ((z, t)| f,(z)=0} and 
D' x10) x D. W— D' and D' satisfy the Whitney 
condition (b) if and only if x*( f) is invariant 
under the deformation [35]. The Whitney 
condition (b) implies topological triviality of 
the deformation. 
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A. Basic Concepts and Postulates 


Thermodynamics traditionally focuses its 
attention on a particular class of states of a 
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given system called (thermal) equilibrium 
states, although a more recent extension, 
called the thermodynamics of irreversible 
processes, deals with certain nonequilibrium 
states. In a simple system, an equilibrium state 
is completely specified (up to the shape of the 
volume it occupies) by the volume V (a posi- 
tive real number), the mole numbers N,, ..., N, 
(nonnegative reals) of its chemical compo- 
nents, and the internal energy U (real). (More 
variables might be needed if the system were, 
e.g., inhomogeneous, anisotropic, electrically 
charged, magnetized, chemically not inert, or 
acted on by electric, magnetic, or gravitational 
fields.) This means that any of the quantities 
associated with equilibrium states (called 
thermodynamical quantities) of a simple sys- 
tem under consideration is a function of V, 
N,,..., Np and U. 

When n copies of the same state are put next 
to each other and the dividing walls are re- 
moved, V, N,,..., N,, and U for the new state 
will be n times the old values of these variables 
under the assumptions that each volume is 
sufficiently large and that the effects of the 
boundary walls can be neglected. Thermo- 
dynamical quantities behaving in this manner 
are called extensive. Those that are invariant 
under the foregoing procedure are called inten- 
sive. More precisely, the thermodynamic vari- 
ables are defined by homogeneity of degree 1 
and 0 as functions of V, N,,..., N, and U. 

By a shift of the position of the boundary 
(called an adiabatic wall if energy and chemical 
substances do not move through it) or by 
transport of energy through the boundary 
(called a diathermal wall if this is allowed) or 
by transport of chemical components through 
the boundary (called a permeable membrane) 
(in short, by thermodynamical processes), these 
variables can change their values. If these shifts 
or transports are not permitted (especially 
for a composite system consisting of several 
simple systems, at its boundary with the out- 
side), the system is called closed. Otherwise it is 
called open. 

Those equilibrium states that do not under- 
go any change when brought into contact 
with each other across an immovable and 
impermeable diathermal wall (called a ther- 
mal contact) form an equivalence class. This 
is sometimes called the Oth law of thermo- 
dynamics. The equivalence class, called the 
temperature of states belonging to it, is an 
intensive quantity. 

The force needed to keep a movable wall at 
rest, divided by the area of the wall, is called 
the pressure. It is another intensive quan- 
tity. For a (slow) change of the volume by an 
amount dV under a constant pressure P, me- 
chanical work of amount — PdV is done on 
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the system. Together with a possible change 
of the internal energy, say of amount dU, the 
amount 


óQ-dU — PdV (1) 


of energy is somehow gained (if it is positive) 
or lost (if it is negative) by the system. This 
amount of energy is actually transported from 
or to a neighboring system through diathermal 
walls so that the total energy for a bigger 
closed composite system is conserved. This is 
called the first law of thermodynamics, and AO 
is called the heat gain or loss by the system. 

If two states of different temperatures T, 
and T, are brought into thermal contact, 
energy is transferred from one, say T;, to the 
other (called heat transfer). This defines a 
binary class relation denoted by T, > T,. The 
Clausius formulation of the second law of 
thermodynamics says that it is impossible to 
make a positive heat transfer from a state of 
lower temperature to another state of higher 
temperature without another change else- 
where. By considering a certain composite 
system, one reaches the conclusion that there 
exists a labeling of temperatures by posi- 
tive real numbers T, called the absolute tem- 
perature, for which the following is an exact 
differential: 


6Q/T=(dU —PdV)/T=ds. (2) 


The integral S is an extensive quantity, called 
the entropy. Furthermore, the sum of the en- 
tropies of component simple systems in an 
isolated composite system is nondecreasing 
during any thermodynamic process, and the 
following entropy maximum principle holds: 
An isolated composite system reaches an 
equilibrium at those values of extensive param- 
eters that maximize the sum of the entropies 
of component simple systems (for constant 
total energy and volume and within the set of 
allowed states under a given constraint). 

A relation expressing the entropy of a given 
system as a function of the extensive param- 
eters (specifying equilibrium states) is known 
as the fundamental relation of the system. If it 
is given as a continuous and differentiable 
homogeneous function of V, N,,...,N,, and U 
and is monotone increasing in U for fixed V, 
N,,..., N,, then one can develop the thermo- 
dynamics of the system based on the above en- 
tropy maximum principle. A relation express- 
ing an intensive parameter as a function of 
some other independent variables is called an 
equation of state. 

Another postulate, which is much less fre- 
quently used, is the Nernst postulate or the 
third law of thermodynamics, which says that 
the entropy vanishes at the vanishing abso- 
lute temperature. 
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B. Various Coefficients and Relationships 


The partial derivative ĉ/ĉx of a function 
f(x, y,...) with respect to the variable x 
with the variables y,... fixed is denoted by 
(0f/0x),..... We abbreviate N,,...,N, as N in 
the following. 

If the fundamental relation ts written as U = 
U(V. N,, ..., N,, S) (instead of S being repre- 
sented as a function of the other quantities), 
then (2) implies 


(QU/0S), y= T, (QU/GV)y s= — P. 


The other first-order partial derivatives of U 
are 


uj 7 (QUON), wis, 


with u: called the chemical potential (or elec- 
trochemical potential) of the jth component. 

If a system is surrounded by an adiabatic 
wall (i.e., the system is thermally isolated) and 
goes through a gradual reversible change 
(quasistatic adiabatic process), then the entropy 
has to stay constant. If a system is in thermal 
contact through a diathermal wall with a large 
system (called the heat bath) whose tempera- 
ture is assumed to remain unchanged during 
the thermal contact, then the temperature of 
the system itself remains constant (an iso- 
thermal process). The decrease of the volume 
per unit increase of pressure under the latter 
circumstance is called the isothermal compress- 
ibility and is given by 


Kp = —V ^ (QV/OP),. y. 


Under constant pressure, the increase of the 
volume per unit increase of the temperature is 
called the coefficient of thermal expansion and 
is given by 


as V^ (9V/OT)y v. 


Under constant pressure, the amount of (quasi- 
static) heat transfer into the system per mole 
required to produce a unit increase of tem- 
perature ts called the specific heat at constant 
pressure and is given by 


Cp: N^! T(0S/OT))p, v. 


where N — N, +... + N,. The same quantity 
under constant volume is called the specific 
heat at constant volume and is given by 


c, =N T(OS/0T)y y. 


The positivity of c, is equivalent to the convex- 
ity of energy as a function of entropy for fixed 
values of V and N. 

Because of the first-order homogeneity of an 
extensive quantity as a function of other ex- 
tensive variables, one can derive an Euler 
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relation, such as 
U=TS—PV+p,N, + e aV N,, 


for a simple system. Its differential form im- 
plies the following Gibbs-Duhem relation: 


SdT—VdP-- N du, ...4- N,dp,—O. 


Because of the identity 


ENEE) 
Cx ek OyJ NOx J^ 


there arise relationships among second deriva- 
tives, known as the Maxwell relations: 





(0T/OV)s y= —(ôP/ðS)y y. 
(CV/2S)p, y= (OT/EP)s, n. 
(0S/6V),. y =(P/AT)y y. 
(0/0P),. w= —(QV/O T)». v. 


By computing the Jacobian of transformations 
of variables, further relations can be obtained. 
For example, 


Cp—c, +N ITV [ks 


C. Legendre Transform and Variational 
Principles 


The Legendre transform of a function f(x,, ..., 
yy.) relative to the variables x is given by 


(pi, syn =S- XP; 
J 


as a function of the variables p;— Cf/Óx; and y. 
The original variables x can be recovered as 
x, = 0g/Cp;. 

In terms of Legendre transforms, the en- 
tropy maximum principle can be reformulated 
in various forms: 

Energy minimum principle: For given values 
of the total entropy and volume, the equilib- 
rium is reached at those values of uncon- 
strained parameters that minimize the total 
energy. This principle is applicable in rever- 
sible processes where the total entropy stays 
constant. 

Helmholtz free energy minimum principle: 
For given values of the temperature (equal to 
that of a heat bath in thermal contact with the 
system) and the total volume, the equilibrium 
is reached at those values of the unconstrained 
parameters that minimize the total Helmholtz 
free energy, where the Helmholtz free energy 
for a simple system is defined as a function of 
T, V, N,,..., N, by 


F-U-TS, 
dF = —-SdT—-PdV+p,dN,+p,dN,. 


Enthalpy minimum principle: For given 
values of the pressure and the total entropy, 
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the equilibrium is reached at those values of 
unconstrained parameters that minimize the 
total enthalpy, where the enthalpy for a sim- 
ple system is defined as a function of S, P, 
N,,..., N, by 


H —U - PV, 
dH=TdS+VdP+y,dN,+...+4,dN,. 


Gibbs free energy minimum principle: For 
constant temperature and pressure, the equi- 
librium is reached at those values of uncon- 
strained parameters that minimize the total 
Gibbs free energy, where the Gibbs free energy 
for a simple system is given as a function of T, 
P, Nuecht, by 


G-U —TS4 PV, 
dG— —SdT-- VdP 4 u, dN, ^... ui, dN,. 


References 


[1] M. W. Zemansky, Heat and thermo- 
dynamics, McGraw-Hill, 1951. 

[2] H. B. Callen, Thermodynamics, Wiley, 
1960. 

[3] J. W. Gibbs, The collected works of J. 
Willard Gibbs, vol. 1, Thermodynamics, Yale 
Univ. Press, 1948. 

[4] M. Planck, Treatise on thermodynamics, 
Dover, 1945. (Original in German, seventh 
edition, 1922.) 

[5] E. A. Guggenheim, Thermodynamics, 
North-Holland, 1949. 

[6] F. C. Andrews, Thermodynamics: Prin- 
ciples and applications, Wiley, 1971. 


420 (XX.8) 
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A. n-Body Problem and Classical Integrals 


In the n-body problem, we study the motions of 
n particles P.(x;, yj, z;) (i2 1,2, ..., n) with arbi- 
trary masses m,{ Di following *Newton's law 
of motion, 


12:558 (1) 


where w; is any one of x;, Y; or Z; 


U = yX k^mmj/r;, 


ifj 


with k? the gravitation constant, and 





ps x; xy +i yy (zy. 


Although the one-body and two-body prob- 
lems have been completely solved, the prob- 
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lem has not been solved for n> 2. The three- 
body problem is well known and is important 
both in celestial mechanics and in mathemat- 
ics. For n 3 the problem is called the many- 
body problem. 

The equations (1) have the so-called ten 
classical integrals, that is, the energy integral 
X(nj/2) (0)? + (9? + (29^) - U = constant 
(w — dw/dt), six integrals of the center of mass 
YX,m;w,—constant, 3, m;w,—(X,m;wj)t + con- 
stant, and three integrals of angular momen- 
tum > m;,(u;w; — w;u;) = constant (u z wl, Using 
these integrals and eliminating the time t and 
the ascending node by applying Jacobi’s 
method, the order of the equations (1) can be 
reduced to 6n — 12. H. Bruns proved that alge- 
braic integrals cannot be found except for the 
classical integrals, and H. Poincaré showed 
that there is no other single-valued integral 
(Bruns, Acta Math., 11 (1887); Poincaré [2, I, 
ch. 5]). These results are called Poincaré-Bruns 
theorems. Therefore we cannot hope to obtain 
general solutions for the equations (1) by 
tquadrature. General solutions for nz 3 have 
not been discovered except for certain specific 
cases. 


B. Particular Solutions 


Let r; be the position vector of the particle P; 
with respect to the center of mass of the n- 
body system. A configuration r={r,,...,7,} 
of the system is said to form a central figure 
(or central configuration) if the resultant force 
acting on each particle P, is proportional to 
m;r;, where each proportionality constant is 
independent of i. The proportionality con- 
stant is uniquely determined as — UD, m;r? 
by the configuration of the system. A con- 
figuration r is a central figure if and only if 
ris a ‘critical point of the mapping rt 

U?(r) 32, mir? [5,6]. A rotation of the sys- 
tem, in planar central figure, with appropriate 
angular velocity is a particular solution of 
the planar n-body problem. 

Particular solutions known for the three- 
body problem are the equilateral triangle solu- 
tion of Lagrange and the straight line solu- 
tion of Euler. They are the only solutions 
known for the case of arbitrary masses, and 
their configuration stays in the central figure 
throughout the motion. 


C. Domain of Existence of Solutions 


The solutions for the three-body problem 

are analytic, except for the collison case, i.e., 
the case where min; =0, in a strip domain en- 
closing the real axis of the t-plane (Poincare, P. 
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Painlevé). K. F. Sundman proved that when 
two bodies collide at t = to, the solution is 
expressed as a power series in (t — 15)? in a 
neighborhood of ty, and the solution which 

is real on the real axis can be uniquely and 
analytically continued across t = t, along the 
real axis. When all three particles collide, the 
total angular momentum f with respect to the 
center of mass must vanish (and the motion 1s 
planar) (Sundman's theorem); so under the 
assumption f #0, introducing s= ['(U + 1)dt 
as a new independent variable and taking it 
for granted that any binary collision is analyt- 
ically continued, we see that the solution of 
the three-body problem is analytic on a strip 
domain |Im s| <ô containing the real axis of 
the s-plane. The conformal mapping 


c —(exp(zs/2ó) — I)/(exp(rs/20) + 1) 


maps the strip domain onto the unit disk 

|o| « 1, where the coordinates of the three 
particles w,, their mutual distances r,,, and the 
time t are all analytic functions of œw and give a 
complete description of the motion for all real 
time (Sundman, Acta Math., 36 (1913); Siegel 
and Moser [7]). 

When a triple collision occurs at t = to, G. 
Bisconcini, Sundman, H. Block, and C. L. 
Siegel showed that as tt, (i) the configura- 
tion of the three particles approaches asymp- 
totically the Lagrange equilateral triangle 
configuration or the Euler straight line con- 
figuration, (11) the collision of the three par- 
ticles takes place in definite directions, and 
(iii) in general the triple-collision solution 
cannot be analytically continued beyond t — fo. 


D. Final Behavior of Solutions 


Suppose that the center of mass of the three- 
body system is at rest. The motion of the 
system was classified by J. Chazy into seven 
types according to the asymptotic behavior 
when t> +00, provided that the angular mo- 
mentum f of the system is different from zero. 
In terms of the torder of the three mutual dis- 
tances ro (for large t) these types are defined as 
follows: 

(i) H^: Hyperbolic motion. r,;~ t. 

(ii) HP*: Hyperbolic-parabolic motion. r, ;, 
rj47-t and rus ET, 

(iii) HE*: Hyperbolic-elliptic motion. r,4, 73 ~t 
and r,, <a (a —finite). 

(iv) P": Parabolic motion. r; ~t°®. 

(v) PE*: Parabolic-elliptic motion. r43, 7.3 ~t? 
and r,, «a. 

(vi) L*: Lagrange-stable motion or bounded 
motion. r « à. 

(vii) OS*: Oscillating motion. lim, „„ supr; = oo, 
lim, ,,, supr; < oc. 
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Define H^, HE, etc. analogously but with 
t— —oo. There are three classes for each of the 
motions HP, HE, and PE, depending on which 
of the three bodies separates from the other 
two bodies and recedes to infinity, denoted by 
HP, HE. PE; (i= 1,2, 3), respectively. The 
energy constant is positive for H- and HP- 
motion, zero for P-motion, and negative for 
PE-, L-, and OS-motion. For HE-motion, h 
may be positive, zero, or negative. 

We say that a partial capture takes place 
when the motion is H^ for t2 —oo and HE; 
for t— +00 (for h 0), and a complete capture 
when the motion is HE; for t5 --oo and L* 
for t2 +00 (for h « 0). We say also that an 
exchange takes place when HE; for t> —oo 
and HE,’ for t +00(t £j). The probability of 
complete capture in the domain h<0 is zero 
(J. Chazy, G. A. Merman). 


E. Perturbation Theories 


The radius of convergence in the s-plane for 
Sundman's solution is too small and the con- 
vergence is too slow in the w-plane to make it 
possible to compute orbits of celestial bodies, 
and for that purpose a perturbation method is 
usually adopted. When the masses m;, ...,m, 
are negligibly small compared with m, for the 
n-body problem, the motion of the nth body is 
derived as the solution of the two-body prob- 
lem for m, and m, by assuming m; =... = 

m,.., =0 as a first approximation, and then 
the deviations of the true orbit from the ellipse 
are derived as tperturbations. In the general 
theory of perturbations the deviations are 
derived theoretically by developing a disturb- 
ing function, whereas in the special theory of 
perturbations they are computed by numerical 
integration. In general perturbation theory, 
problems concerning convergence of the solu- 
tion are important, and it becomes necessary 
to simplify the disturbing function in deal- 

ing with the actual relations among celestial 
bodies. Specific techniques have to be devel- 
oped in order to compute perturbations for 
lunar motion, motions of characteristic aster- 
oids, and motions of satellites (e.g., the system 
of the Sun, Jupiter, and Jovian satellites). 


F. The Restricted Three-Body Problem 


Since the three-body problem is very difficult 
to handle mathematically, mathematical inter- 
est has been concentrated on the restricted 
three-body problem (in particular, the planar 
problem) since Hill studied lunar theory in 
the 19th century. For the restricted three-body 
problem, the third body, of zero mass, cannot 
have any influence on the motion of the other 
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two bodies, which are of finite masses and 
which move uniformly on a circle around 

the center of mass. In the planar case, let us 
choose units so that the total mass, the angu- 
lar velocity of the two bodies about their 
center of mass, and the gravitation constant 
are all equal to 1, and let (q,,q,) be the coordi- 
nates of the third body with respect to a rotat- 
ing coordinate system chosen in such a way 
that the origin is at the center of mass and the 
two bodies of finite masses u and 1 — u are 
always fixed on the q,-axis. Then the equa- 
tions of motion for the third body are given 
by a Hamiltonian system: 











dq, 0H dp, OH 

fe, ae Bat (2) 
dt Op; dt 04; 

with 


1 
H — (pi +p3)+q2Pı—4ıP2— U (41,42), 


Ln H 
= 2 Se 2 EW 
NEIEN t4 Aditu) *q2 


The equations (2) have the energy integral 
Hip. q) - constant, called Jacobi's integral. 
Siegel showed that there is no other algebraic 
integral, and it can be proved by applying 
Poincaré's theorem that there is no other 
single-valued integral. Regularization of the 
two singular points for the equations (2) and 
solutions passing through the singular points - 
were studied by T. Levi-Civita, and solu- 
tions tending to infinity were studied by B. O. 
Koopman. 

After reducing the number of variables by 
means of the Jacobi integral, the equations (2) 
give rise to a flow in a 3-dimensional manifold 
of which the topological type was clarified by 
G. D. Birkhoff (Rend. Circ. Mat. Palermo, 39 
(1915)). Since this flow has an tinvariant mea- 
sure, the equations have been studied topo- 
logically, and important results for the re- 
stricted three-body problem, particularly on 
periodic solutions, have been obtained. 








G. Stability of Equilateral Triangular Solutions 


Suppose that the origin q;=p;=0 is an tequi- 
librium point for an autonomous Hamiltonian 
system with two degrees of freedom: 

dq, OH 


dt êp; 


dp | 0H 
dt ôq’ 








with the Hamiltonian H being analytic at the 
origin. When the 'eigenvalues of the corre- 
sponding linearized system are purely imagi- 
nary and distinct, denoted by +4., +4,, and 
ALk, +4,k, 40 for 0 « |k, | - ]k5| x 4 (where k; 
is an integer), we can find suitable coordinates 
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é;, H, so that the Hamiltonian H takes the form 
, ; 1 : 
H=4,% +420, "eu ET 2e126) by +0503) 


+H; +... 


with ¢;=¢;n; and real c;;. It is necessary that 
n;=./ —1 €, for the solutions to be real. In ad- 
dition, if the condition 


is satisfied, then the origin is a tstable equi- 
librium point of the original system (V. I. 
Arnol'd, J. Moser) [7]. 

For Lagrange equilateral triangular solu- 
tions of the planar restricted three-body prob- 
lem, the eigenvalues 4 of the linearized system 
derived from (2) are given as roots of the 
equation 44 + 4? 4-(27/A)u(1 — ui) «0 and are 
purely imaginary if ui — 1) « 1/27. Applying 
the Arnol'd-Moser result, A.M. Leontovich 
and A. Deprit and Bartholomé showed that 
the Lagrange equilibrium points are stable 
for u such that 0 < u < uo, where yo is the 
smaller root of 27u(1 — uu) 2 1, excluding three 
values: u,, H: at which å k, cA; k, —0 |k,| 4- 
|k;| x 4 and u, at which D —0. 

Arnold proved that if the masses m,, ...,m 
are negligibly small in comparison with m,, 
the motion of the n-body system is fquasi- 
periodic for the majority of initial conditions 
for which the eccentricities and inclinations of 
the osculating ellipses are small. 


n 
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A. Time Series 


A time series is a sequence of observations 
ordered in time. Here we assume that mea- 
surements are quantitative and the times of 
measurements are equally spaced. We consider 
this sequence to be a realization of a stochastic 
process X, (— 407 Stochastic Processes). Usu- 
ally time series analysis means a statistical 
analysis based on samples drawn from a sta- 
tionary process (— 395 Stationary Processes) 
or a related process. In what follows we denote 
the sample by X z(X,, X,, ..., X4). 


B. Statistical Inference of the Autocorrelation 


Let us assume X, (t an integer) to be real- 
valued and weakly stationary (— 395 Station- 
ary Processes) and for simplicity EX,=0 and 
consider the estimation of the autocorrelation 
p, — R,/Rg of time lag h, where R, 2 EX,X,,;. 
We denote the sample autocovariance of time 
lagh as 

E 1 t— |h| 

R=F 2 X Xo, 

and define the serial correlation coefficient 

of time lagh by p, — R,/R,. It can be shown 
that the joint distribution of (A T(À, Phl 

| «hx Hj tends to an H-dimensional tnor- 
mal (Gaussian) distribution with mean vec- 
tor 0, if one assumes that X, is expressed as 

X =E jE}, where D-a |bj| < +00, 
X -al|j| 7b? < +00, and the é, are indepen- 
dently and identically distributed random vari- 
ables with Eë, 20 and E£? < +00. 

When X, is an autoregressive process of 
order K (— Section D) and also a fGaussian 
process, it can be shown that the asymptotic 
distribution of G/T(,— Phl 1<h<K}as T> 
oo is equal to the asymptotic distribution of 
G/T(À, — oy) I «hx K}, where f, is the tmaxi- 
mum likelihood estimator of p,. In general, 
it 1s difficult to obtain the maximum likeli- 
hood estimator of p,. The statistical properties 
of other estimators of p,, e.g., an estimator 
constructed by using sgn(X)) (sgn( y) means 
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1 (y>0), 0(y=0), —1 (y «0)) have also been 
investigated. 

Testing hypotheses concerning autocorre- 
lation can be carried out by using the above 
results. Let us now consider the problem of 
testing the hypothesis that X, is a fwhite noise. 
Assume that X, is a Gaussian process and that 
a white noise with EX? =o? exists, and define 
C,= XLG(X,— X)(X,,, — X) and $,— C,/C, for 
hz 0, where X,.,,— X; and X 2 XL, X,/T. 
Then the probability density function of j, can 
be obtained and it can be shown that 

" m A 
PO m y- 2.0; —y a Án SYS Är 

= 


J 
where 4; —cos27j/T and 


(T-1)2 


Aj= I] (UA T=3,5,..., 
ke 
T/2-1 
Am (j—A) /l1*A, T=4,6,..., 
Kx 
l<m<(T—3)/2 if T is odd, 
l<m<T/2-1 if T is even. 


This can be used to obtain a test of 
significance. 


C. Statistical Inference of the Spectrum 


To find the periodicities of a real-valued 
tweakly stationary process X, with mean 0, the 
statistic, called the periodogram, 


T 2 
—2mnitÀ 
3 Xe 


"Iz 


is used. If X, is expressed as 


E 





L 
X,= Y {m,cos2nd,t +m sin 2råt} + Y, 
1-1 


where (mj), {m;}, and ( Y,} are mutually inde- 
pendent random variables with Em,= Em,=0 
and V(m;) 2 V(mj) 2 o? and {Y,} is independent 
and identically distributed with means 0 and 
finite variances o". the distribution of I,(A) 
converges to a distribution with finite mean 
and finite variance at 2z +4 for1</I<L 
when T tends to infinity. On the other hand, 
the magnitude of 14(4) is of the order of T at 
Ass kA, 1<1<L. This means that we can find 
the periodicities of X, by using LI, (4). When 
X,— Y, we find that the distribution of 2/,-(A)/ 
c? (when 240, + 1/2) or I4 (2)/6? (when 1 =0 
or + 1/2) tends to the ty? distribution with 
degrees of freedom 2 or 1, respectively, and 
I(un), Iíu5), ..., I(uy) are asymptotically in- 
dependent random variables for Oslo: 
(ul. lues 1/2 when T oc. Applying 
this result, we can test for periods in the data. 
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Let f(A) be the spectral density function of a 
real-valued weakly stationary process X,. In 
general, the variance of |E, X jg CRAT | 
does not tend to 0 as T tends to infinity; hence 
l (å) cannot be used as a good estimator for 
the spectral density. To obtain an estimate of 
f (4), several estimators defined by using weight 
functions have been proposed by several 
authors. Let W;(A) be a weight function de- 
fined on (—oo, oo), and construct a statistic 

JA = [Aa tr) Wr- p) dp. Let us use f(u) 
for the estimation of f(A). W,(A) is called a 
window. An important class of W;(A) is as fol- 
lows. Let W(2) be continuous, W(A) = W(— A), 
W(0)— 1, |W) <1, and Ir. W(4) di< 
+œ, and let H be a positive integer depend- 
ing on T such that H >œ and H/T—0 as T^ 

oo. Putting w;— W(j/H), we define W,(4) by 
e E rt we 7". Then f (A) can be 
expressed as fi Ss E R, we 274, 
where Rer -1' X,4,X,/T for hz 0and 
RU ui X XIT for h<0. Let X, be 
stationary to the fourth order (— 395 Station- 
ary Processes) and satisfy 


Kei 
3 |R,« +00, 


h--—o 
E 
5 [Co mt.pl< +0, 
hl, p= — 0 
where C, , , , is the fourth-order joint tcumu- 
lant of X,, X,,,, X,,,, and X,,,. Then we have 


lim — V(J(0)) - 2/0? | ^ Wy dà, 
Too H ids 
im gye =y |^ waa; 


lim - VFD = foy | © WO) dà, 


440, +1/2, 


lim ^ CovJU), Jud)=0, Ze () 
{w,} or W,(A) should have an optimality, e.g., 
to minimize the mean square error of f(A). But, 
generally, it is difficult to obtain such a {w,} or 
W,(A). 

Several authors have proposed specific 
types of windows. The following are some 
examples: (t) (Bartlett) w, —(1 —|h|/H) for |h| < 
H and w,=0 for |h| > H; (it) (Tukey) w,— 

XE Qa cos(nlh/H) for |h| x H and w,=0 for 
|h| > H, where the a, are constants such that 
Xj lai € +0, 322 2,2; 71 and a,=a_,. 
The Hanning and Hamming windows are a, 
=0.50, a, =a_, 20.25, and a,=0 for ||| 22 and 
ay —0.54, a, =a_, =0.23, and a,=0 for |I| 2 2, 
respectively [2]. Let Zeit bje.;, where 
ZE [bj| <+ and the e are independently 
and identically distributed random variables 
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with Ee, 2 0 and Es? < +o. Let (4j| xj « M] 
be arbitrary real numbers such that 0x 2, « 4, 
<... x AM < 1/2, where M is an arbitrary posi- 
tive integer. Then the joint distribution of 
{./T/H(f(A,)— ETG4)) 1<v <M} tends to the 
normal distribution with means 0 and covar- 
iance matrix X, which is defined by (1). Let us 
assume, furthermore, that lim, all — w(x)y/|xl* 
=C and Eg. ,|h[l"|R,| < +00, where C, q, and 
p are some positive constants satisfying the 
following conditions: (i) when p 2 q, H4/T-+0 
(p>1) and H**! "/T50(pz 1) as Too and 
Jm, T/H?*4*! is finite; (ii) when p <q, H?/T 
20 (pz1)and H/T—0 (pz 1)as T— oo and 
lim... T/H?^*! 20, Then ./T/H(f(4,)— 
Ef(2,)) in the results above can be replaced 

by V T/H GU.) SA) 

Estimation of higher-order spectra, partic- 
ularly the bispectrum, has also been discussed. 
Let X, be a weakly stationary process with 
mean 0, and let its spectral decomposition be 
given by X,— (1,5 e?" ^ qZ(1) (> 395 Station- 
ary EUN We assume that X, is a weakly 
stationary process of degree 3 and put R,, n, 
= EX, KA, En, for any integers h, and hy. 
Then we have 


[212 f 
Rann = g?ni thad d F(A, ZA 
1/2 J —1/2 


Symbolically, dF(4,, As EdZ(4,)dZ(A;) 
dZ(—2,—4,). If F(A4,, 45) is absolutely contin- 
uous with respect to the Lebésgue measure of 
R? and (F(4,, 45)/024,04, = f(4,, 45), we call 
TL: 45) the bispectral density function. When 
X, is Gaussian, Ra n, =O and f(2,, 4;) «0 for 
any h,, h, and any 4,, 4,. f(4,, 4;) can be 
considered to give a kind of measure of the 
departure from a Gaussian process or a kind 
of nonlinear relationship among waves of 
different frequencies. We can construct an 
estimator for f(4,, 4,) by using windows as in 
the estimation of a spectral density [3]. 


D. Statistical Analysis of Parametric Models 


When we assume merely that X, is a stationary 
process and nothing further, then X, contains 
infinite-dimensional unknown parameters. In 
this case, it may be difficult to develop a satis- 
factory general theory for statistical infetence 
about X,. But in most practical applications of 
time series analysis, we can safely assume at 
least some of the time dependences to be 
known. For this reason, we can often use a 
model with finite-dimensional parameters. 
This means, mainly, that the moments (usually, 
second-order moments) or the spectral density 
are assumed to be expressible in terms of finite- 
dimensional parameters. As examples of such 
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models, autoregressive models, moving average 
models, and autoregressive moving average 
models are widely used. 

A process X, is called an autoregressive 
process of order K if X, satisfies a difference 
equation Xi oa, X,- = č, where the a, are 
constants, dg = 1, ay #0, and the £, are mutu- 
ally uncorrelated with Ec, —0 and V(£j) o2 > 
0. We usually assume that X, is a weakly sta- 
tionary process with EX,=0. We sometimes 
use the notation AR(K) to express a weakly 
stationary and autoregressive process of order 
K. Let {€,} be as above. If X, is expressed as X, 
zYXL,b£E, ; where the b; are constants, by = 1 
and b, £0, X, is called a moving average pro- 
cess of order L (MA(L) process). Furthermore, 
if X, is weakly stationary with EX, —0 and 
expressed as XE. oa X, -= Dg bič, with ag 
=1, bus, and agb, £0, then X, is called an 
autoregressive moving average process of order 
(K, L) (ARMA(K, L) process). Let A(Z) and 
B(Z) be two polynomials of Z such that A(Z) 
zXE a,Z* * and B(Z)- XI bj Z- , and let 
{a,|1<k<K} and {B,|1</1<L} be the solu- 
tions of the associated polynomial equations 
A(Z)=0 and B(Z)=0, respectively, we as- 
sume that |a,|<1 for 1 x k « K and |fj| « 1 for 
I «Ix L. This condition implies that X, is 
purely nondeterministic. Let the observed 
sample be ( X,|1 <t< T}. If we assume that X, 
is Gaussian and an ARMA(K, L) process, we 
can show that the "maximum likelihood es- 
timators {4,} and f5,) of {a,} and {b,} are 
‘consistent and asymptotically efficient when 
T — co (“asymptotically efficient" means that 
the covariance matrix of the distribution of 
the estimators is asymptotically equal to the 
inverse of the information matrix) [5] (— 

399 Statistical Estimation D). Furthermore, 
if X, is an AR(K) process, the joint distribu- 
tion of G/T(à,— a)l 1<k<K} tends to a K- 
dimensional normal distribution with means 0, 
and this distribution is the same as the one to 
which the distribution of the ‘least-square 
estimators {4,} minimizing Q — XL y.,(X,4 
YE ia, X, A tends when T— oc. If X, isa 
MA(L) or ARMA(K, L) process (L > 1), the 
likelihood equations are complicated and 
cannot be solved directly. Many approxi- 
mation methods have been proposed to ob- 
tain the estimates. 

When X, is an AR(K) process with |«,|< 
1 for 1<k<K, R, satisfies X oa, R, ,—0 
for hz 1. These are often called the Yule- 
Walker equations. R, can be expressed as 
R,— X, Gat if the a, are distinct and ay # 

0, where {C;} are constants and determined 
by R, forO<h<K—1. When X, is an 
ARMA(K, L) process, X. o a, R,-,,—0 for 
h>L+1, and the C; of R,=X4_, Ca} are 
determined by (R,|O <h  max(K, L)}. 
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The spectral density is expressed as f(A4) — 

c2 |B(e?"^)|?/| A(e?"^). If X, is Gaussian, 

the maximum likelihood estimator of f(A) is 
asymptotically equal to the statistic obtained 
by replacing of, {b,}, and {a,} in f(A) with ôf, 
ibi), and (4j, respectively, where 6? is the 
maximum likelihood estimator of o2, when 

T —oo. 

When we analyze a time series and intend 
to fit an ARMA(K, L) model, we have to 
determine the values of K and L. For AR(K) 
models, many methods have been proposed to 
determine the value of K. Some examples are: 
(i) (Quenouille) Let (Z* A(1/Z))? = X74 AjZ/, 
and Gx = D7 o A(R,/Ro), where Â; is obtained 
by replacing {a,} in A, by {d,}, and we con- 
struct the statistic x7 = Ef- Gen, Then x4 has a 
ty? distribution asymptotically with f degrees 
of freedom under the assumption that K > Kọ, 
where Ko is the true order, as T — oo. Using 
this fact, we can determine the order of an AR 
model. (ii) (Akaike) We consider choosing an 
order K satisfying Ki < K «& Ky, where K, and 
Ky are minimum order and maximum order, 
respectively, specified a priori. Then we con- 
struct the statistic AIC(K) - (T — K)logó2(K) 
+2K, where 


T 
ĉ?(K)= 2- (X, à; X,., Tee dg X, /T 
t=K+1 


and {4,|1<k<K} are the least square esti- 
mators of the autoregressive coefficients of an 
AR(K) model fitting X,. Calculate AIC(K) for 
K=K,, K,+1,..., Ky. If AIC(K) has the 
minimum value at K = K, we determine the 
order to be K [6] (— 403 Statistical Models 
F). Parzen proposed another method by using 
the criterion autoregressive transfer function 
(CAT). Here CAT(K) = 1 — 6?(00)/62(K) + K/T, 
where 67(K)=(T/(T — K))6?(K) and 6?(oo) is 
an estimator of o7(00) - exp([ 25 log f(A) dd) 
[7]. (iii) We can construct a test statistic for 
the null hypothesis AR(K) against the alterna- 
tive hypothesis AR(K + 1) (Jenkins) or use a 
multiple decision procedure (T. W. Anderson 
[8]). 

Not much is known about the statistical 
properties of the above methods, and few 
comparisons have been made among them. 

Another parametric model is an exponential 
model for the spectrum. The spectral density is 
expressed by f(4) 2 C?exp(2 X, 0, cos(2nk4)], 
where the 0, and C are constants. 

We now discuss some general theories of 
estimation for finite-dimensional-parameter 
models. Let X, be a real-valued Gaussian 
process of mean 0 and of spectral density f(A) 
which is continous and positive in [ —1/2, 1/2], 
and let the moving average representation of 
X, be X, — Xo biči- where £, is a white noise 
and o = Eé?. We assume that f(4)/o? — g(4) 
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depends only on M parameters 0 — (0,,0,, 

., 04) which are independent of o2. Then 
the logarithm of the tlikelihood function can 
be approximated by —(1/2)( Tlog2zo2 + 
X' Er (6)X/o2]) by ignoring the lower-order 
terms in T, where o2 El is the covariance 
matrix of X. E it is difficult to find an 
explicit expression for each element of $7! (8). 
Another approximation for the logarithm of 
the likelihood function is given by 


SIN E fa D al 
NEE 


Under mild conditions on the regularity of 
g(A), the estimators 0 — (0, ,Ó,, ...,Ó,,) and CH 
obtained as the solutions of the likelihood 
equations, are *consistent and asymptotically 
normal as T tends to infinity. This means that 
the distribution of ,/ T(62 — 62) is asymptoti- 
cally normal and JT (62 — og) ) and JT(— 0) 
are asymptotically independent. The asympto- 
tic distribution of JT (8— 6) is the normal dis- 
tribution N(0, I), where the (k, /)-component 
Ta of T is given by 


GIN E DES) 3 
0 








2] X 60, 00, 


E. Statistical Analysis of Multiple Time Series 


Let X, (X(U, XO), ..., X®Y be a complex- 
valued weakly stationary process with EX, —0 
and EX,X; — R, ,. R, , is the p x p matrix 
whose (k, !)-component is RP = EX(? X”. We 
discuss the case when t is an integer. R, has 
the spectral representation 


12 
R,= | e" d F(A), 
—1/2 


where F(A) is a p x p matrix and F(4,) — F(A,), 
4,2 45, is Hermitian nonnegative. Let f (4) 
be the (k, It WE of the spectral density 
matrix f(A), Le, (4) » ^5 f(u) du, of the 
absolutely continuous part in the Lebesgue 
decomposition of F(A). The function f (4) for 


k x:lis called the cross spectral density function. 


f (4) represents a kind of correlation between 
the wave of frequency 4 included in X and 
the one included in X/^. 

Let X, - (X(!, XP, ..., XÍPY and Y,= 
(Y,D, YO, ..., ¥) be two complex-valued 
weakly cut prone with EX, — 0, 
EY,=0, ENK Ri. EY, Y; — RY , and 
EX,Y; BIL We assume NET. A,X 
where A, is a q x p matrix whose components 
are constants depending on s. Put A(A) = 
EZ a Ae ?U*. A(A) should exist in the 
sense of mean square convergence with respect 
to the spectral distribution function F for X,. 


Ten 
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The function A(A) is called the matrix fre- 
quency response function. 

As a measure of the strength of association 
between X/? and X/? at frequency å, we intro- 
duce the quantity 7^'(4) UA PAs). 
y*'(4) is called the coherence. Let X= 
ES off XO +y, where n, is a weakly sta- 
tionary process with mean 0 and uncorrelated 
with X, —oo «s« œ. If E|g |? —0, y*'(4) 1. 
EIS". ak! X .|? 20, y*'(4) 20. Generally, 
we have 0 « y*'(4) « I. 

For the estimation of F(A), A(4), and 7*'(4), 
the theories have been similar to those for the 
estimation of the spectral density of a scalar 
time series. For example, an estimator of f(A) 
is given [11] in the form 


EZ 


T-1 
f(A) = > Rawe Saniha 
h=-(T-1) 


where 


SI 


R,= 2 X, XT 


and the w, are the same as in Section C. 

We can define an autoregressive, moving 
average, or autoregressive moving average 
process in a similar way as for a scalar time 
series. The a, and b, in Section D should be 
replaced by p x p matrices and the associated 
polynomial equations A(Z) 20 and B(Z)=0 
should be understood in the vector sense [11]. 
There are problems with determining the coeffi- 
cients uniquely or identifying an ARMA(K, L) 
model, and these problems have been dis- 
cussed to some extent. 


F. Statistical Inference of the Mean Function 


Let X, be expressed as X, =m, + Y, where m, is 
a real-valued deterministic function of t and Y, 
is a real-valued weakly stationary process with 
mean 0 and spectral distribution function F(A). 
This means that V UN — m,. We consider the 
case when m, = 3/4, Col, where C 2(C;, C;, 

.,Cyu) isa E of unknown coefficients 
and g, — (QU, gp, ..., oy is a set of known 
(regression) functions. 

Let us construct ‘linear unbiased estimators 
(C; XL, y, all le M) for the coefficients 
C,, where the y, are known constants. Put 
$ — (94,95, ..., 91). Then the ‘least squares 
estimator of C is given by C 2 ($'6) ! X 
when ©’® is nonsingular. Let X be the covar- 
iance matrix of X. Then the tbest linear un- 
biased estimator is C* 2(D'Z Lët It X. 
We put || o9 12 =E (PPY and assume that 
Im, IPP lF — oo, Wm, IPP a/l” 
=| for 1<j<M and any fixed h and assume 
the existence of y? 2 Dm, XL, 90, 9(9/ 
Ilo? || po ||; for 1 <j, k « M. We also assume 
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that F(A) is absolutely continuous and F'(4)2 
f(^) is positive and piecewise continuous. 

Let y, be the M x M matrix whose ( j, k)- 
component is f^. Then y, can be repre- 
sented by 


1/2 

y= | e qa, 
—1/2 

where G(A) — G(u) is a nonnegative definite 

matrix for 2> u. Assume that yy = G(1/2) 

— G(—1/2) is nonsingular and put H(A)= 

Wo! Gyo '?, and for any set S, H(S) 

fs H(d). Suppose further that S,, $5, ..., S, 

are q sets such that H(S,)>0, X4_, H(S)— 

I, H(S) H(S,) «0, j  k, and for any j there is 

no subset S; c S; such that H(S;)>0, H(S;— Sj) 

>0 and H(S;)H(S;—S;)=0. We have q< M. 

It can be shown that the spectrum of the re- 

gression can be decomposed into such disjoint 

sets $,,..., Then we can show that € is 

asymptotically efficient in the sense that the 

asymptotic covariance matrix of C is equiva- 

lent to that of €* if and only if f(4) is constant 

on each of the elements 5, Especially, if y® = 

(ein, C is asymptotically efficient. 


G. Nonstationary Models 


It is difficult to develop a statistical theory for 
a general class of nonstationary time series, 
but some special types of nonstationary pro- 
cesses have been investigated more or less in 
detail. Let X, (t an integer) be a real-valued sto- 
chastic process and V be the backward dif- 
ference operator defined by VX,= X,— X, , 
and V^ X, — V(V^'! X) for d>2. We assume 
that X, is defined for t 2 tọ (tọ a finite integer), 
and EX? < +o. For analyzing a nonstation- 
ary time series, Box and Jenkins introduced 
the following model: For a positive integer d, 
Y, - V! X, t 2 tg-- d is stationary and is an 
autoregressive moving average process of 
order (K, L) for t 2 t9 +d - max(K, L). They 
called such an X, an autoregressive integrated 
moving average process of order (K,d, L) and 
denoted it by ARIMA(K,d, L). The word 
"integrated" means a kind of summation; 

in fact, X, can be expressed as a sum of the 
weakly stationary process Y,, i.e., 


XXX (xo S Y Je 


S2571 s; 51 


when t9 = —d + 1. Using this model, methods 
of forecasting and of model identification and 
estimation can be discussed [13]. 
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Another nonstationary model is based on 
the concept of evolutionary spectra [14]. In 
this approach, spectral distribution functions 
are taken to be time-dependent. Let X, be 
a complex-valued stochastic process (t an 
integer) with EX, —0 and R, ,— EX, X,. In the 
following, we write simply f for (IZ. We now 
restrict our attention to the class of X, for 
which there exist functions {u,(A)} defined on 
[ —1/2, 1/2] such that R, , can be expressed as 
R, s= u,(2)u,(A) d4(4), where u(4) is a measure. 
u,(A) should satisfy {|u,(A)|? du(4) « +00. Then 
X, admits a representation of the form X,— 
fu, (4)dZ(A), where Z(A) is a process with 
orthogonal increments and E|dZ(A)|? =dy(A). 
If u,(A) is expressed as u,(4) = y,(4)e?*9* and 
y,(4) is of the form y,(4) = f e?""" dT ,(w) with 
[dT ;(w)| having the absolute maximum at w= 
0, we call u,(4) an oscillatory function and 
X, an oscillatory process. The evolutionary 
power spectrum dF,{A) is defined by dF,(A) = 
(AI? du (A). 

Other models, such as an autoregressive 
model whose coefficients vary with time or 
whose associated polynomial has roots outside 
the unit circle, have also been discussed. 
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422 (IV.7) 
Topological Abelian Groups 


A. Introduction 


A commutative topological group is called a 
topological Abelian group. Throughout this 
article, except in Section L, all topological 
groups under consideration are locally com- 
pact Hausdorff topological Abelian groups 
and are simply called groups (— 423 Topolog- 
ical Groups). 


B. Characters 


A character of a group is a continuous func- 
tion y(x) (xe G) that takes on as values com- 
plex numbers of absolute value 1 and satis- 
fies y(xy) — x(x)x( y). Equivalently, y is a 1- 
dimensional and therefore an irreducible 
*unitary representation of G. Conversely 
any irreducible unitary representation of G 
is 1-dimensional. Indeed, for a topological 
Abelian group, the set of its characters coin- 
cides with the set of its irreducible unitary 
representations. If the product of two char- 
acters y, 7' is defined by yy '(x) = y(x)y'(x), then 
the set of all characters forms the character 
group C(G) of G. With tcompact-open topo- 
logy, C(G) itself becomes a locally compact 
topological Abelian group. 


C. The Duality Theorem 


For a fixed element x of G, y(x) (ye C(G)) is 

a character of C(G), namely, an element of 
CC(G). Denote this character of C(G) by x(y), 
and consider the correspondence G3x x(y). 
That this correspondence is one-to-one follows 
from the fact that any locally compact G has 
*sufficiently many irreducible unitary repre- 
sentations (— 437 Unitary Representations) 
and the fact that if G is an Abelian group, then 
any irreducible unitary representation of G isa 
character of G. Furthermore, any character 
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of C(G) is given as one of the x(y); indeed, by 
this correspondence, we have G= CC(G) 
(Pontryagin's duality theorem). 

By the duality theorem, each of G and C(G) 
is isomorphic to the character group of the 
other. In this sense, G and C(G) are said to be 
dual to each other. 


D. Correspondence between Subgroups 


Let G, G' = C(G) be groups that are dual to 
each other. Given a closed subgroup g of G, 
the set of all y’ such that y'(x)= 1 for all x ing 
forms a closed subgroup of G', usually denoted 
by (G', g). The definition of (G, g’) is similar. 
Then gelt, g) - g' gives a one-to-one corre- 
spondence between the closed subgroups of G 
and those of G’. If g, 2g;, then g,/g; and 
(G',g5)/(G', g,) are dual to each other. If the 
group operations of G, G' are written in addi- 
tive form, with 0 for the identity, then x(y) 1 
is written as x(y') 20. In this sense, (G’, g) is 
called the annihilator (or annulator) of g. 


E. The Structure Theorem 


Let 9f be the set of all groups (more precisely, 
of all locally compact Hausdorff topological 
Abelian groups). If Gi, G; eM, then the direct 
product G, x G,eU, and if Ge% and H isa 
closed subgroup of G, then Hedi and G/H et 
In addition, if H is a closed subgroup of a 
group G such that H e9I and G/H EM, then 
Ge. In other words, 9f is closed under the 
operations of forming direct products, closed 
subgroups, quotient groups, and *extensions 
by members of W. Furthermore, the operation 
C that assigns to each element of YI its dual 
element is a reflexive correspondence of 9I 
onto YW, and if G >H, the annihilator (C(G), H) 
of H is a closed subgroup of C(G). Also, 
C(G/H) x (C(G), H), C(H)z C(G)/(C(G), H). 
Furthermore, C(G, x G;)z C(G,) x C(G,). 
Finally, H =(G,(C(G), H )) (reciprocity of 
annihilators). 

Typical examples of groups in % are the 
additive group R of real numbers, the additive 
group Z of rational integers, the 1-dimensional 
*torus group T=R/Z, and finite Abelian 
groups F. The torus group T is also isomor- 
phic to the multiplicative group U(1) of com- 
plex numbers of absolute value 1. The direct 
product R" of n copies of R is the vector group 
of dimension n, and the direct product T" of 
n copies of T is the torus (or torus group) 
of dimension n (or n-torus). Both T" and F 
are compact, while R" and Z" are not. We 
have C(R)=R, C(T) 2 Z, C(Z)- T. Any finite 
Abelian group F is isomorphic to its character 
group C(F). The direct product of a finite 
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number of copies of R, T, Z, and a finite 
Abelian group F, namely, a group of the form 
R’ x T” x Z” x F, is called an elementary topo- 
logical Abelian group. 

Any group in Y is isomorphic to the direct 
product of a vector group of some dimension 
and the extension of a compact group by a 
discrete group (the structure theorem). Hence, 
if the effect of the operation C is explicitly 
known, then the problem of finding the struc- 
ture of groups in Y is reduced to the pro- 
blem concerning discrete groups alone. For 
the structure of groups in YM, the following 
theorem is known: If Ge% is generated by a 
compact neighborhood of the identity e, then 
G is isomorphic to the direct product of a 
compact subgroup K and a group of the form 
R" x Z" (n, m are nonnegative integers). Then 
any compact subgroup of G is contained in K, 
which is the unique maximal compact sub- 
group of G. A group Gel generated by a 
compact neighborhood of e is the *projec- 
tive limit of elementary topological Abelian 
groups. L. S. Pontryagin first proved a struc- 
ture theorem of this type and then the duality 
theorem. 


F. Compact Elements 


An element a of a group Ge% is called a com- 
pact element if the cyclic group (a"|neZ] gen- 
erated by a is contained in a compact subset 
of G. The set C, of all compact elements of G 
is a closed subgroup of G, and the quotient 
group G/C, does not contain any compact 
element other than the identity. In particular, 
if G 1s generated by a compact neighborhood 
of the identity, then C, coincides with the 
maximal compact subgroup K of G. Let Co 
be the set of all compact elements of a group 
Ge. The annihilator (C(G), Co) is a con- 
nected component of the character group 
C(G) of G. If G is a discrete group, then a 
compact element of G is an element of G of 
finite order. 


G. Compact Groups and Discrete Groups 


Suppose that two groups G, X €9I are dual to 
each other. Then one group is compact if and 
only if the other group is discrete. By the du- 
ality theorem, the properties of a compact 
Abeltan group G can be stated, in principle, 
through the properties of the discrete Abelian 
group C(G). The following are a few such 
examples. Let G be a compact Abelian group. 
Then its *dimension is equal to the trank of the 
discrete Abelian group C(G). A subgroup Y of 
a discrete Abelian group X is called a divisible 
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subgroup if the quotient group X/Y contains 
no element of finite order other than the iden- 
tity. A compact Abelian group G is locally 
connected if and only if any finite subset of the 
character group C(G) is contained in some 
divisible subgroup of C(G) generated by a 
finite number of elements. Hence if a compact 
locally connected Abelian group G has an 
topen basis consisting of a countable number 
of open sets, then G is of the form T^ x F, 
where F is a finite Abelian group and T° is the 
direct product of an at most countable number 
of 1-dimensional torus groups T. 


H. Dual Decomposition into Direct Products 


Let G be a compact or discrete Abelian group, 
and let M= ( H,|ae Aj be a family of closed 
subgroups of G. Let A(9) = (,., Ha, and 
denote by Z(M) the smallest closed sub- 
group of G containing | J,. 4 H,. Then, with 

Q — 1(C(G), H,)|xe A}, the relations A(Q) = 
(C(G), Z(9)) and Z(0) 2 (C(G), AWM) hold. 
Furthermore, suppose that G is decomposed 
into the direct product G = [T,. 4 H,, and for 
each ae A put K,— Z(9t — (H,]), X,— 

(C(G), K,). Then X, is the character group of 
H,, and C(G) can be decomposed into the 
direct product C(G) 2 TT,. 4 Xa. This decompo- 
sition of C(G) into a direct product is called 
the dual direct product decomposition corre- 
sponding to the decomposition G=[],.,H,. 


I. Orthogonal Group Pairs 


Suppose that for two groups G, G' there exists 
a mapping (x, x')— xx’ of the Cartesian prod- 
uct G x G' into the set U(1) of all complex 
numbers of absolute value 1 such that 


6 x3)x' ele x) (x2x°), 
la) x5) = (x4 (xx). 


Then G, G' are said to form a group pair. Sup- 
pose that G, G' form a group pair, and con- 
sider xx’ to be a function x(x’) in x’. If two 
functions x, (x') and x,({x’) coincide only when 
X, =x, and the same ts true when the roles 

of G and G' are interchanged, then G, G' are 
said to form an orthogonal group pair. If G is 
a compact Abelian group, G' is a discrete 
Abelian group, and G, G' form an orthogonal 
group pair, then G, G' are dual to each other. 


J. Commutative Lie Groups 


An elementary topological Abelian group 
R’ x T” x Z" x F is a commutative *Lie group. 
Conversely, any commutative Lie group G 
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generated by a compact neighborhood of the 
identity is isomorphic to an elementary topo- 
logical Abelian group. In particular, any con- 
nected commutative Lie group G is isomor- 
phic to R' x T" for some (and m. A closed 
subgroup H of the vector group R” of dimen- 
sion n is isomorphic to R? x Z?^ (0<p+q <n). 
More precisely, there exists a basis a,,...,a, of 
the vector group R” such that H — [X P, x;a; 4- 
M pana; x;e R,n;eZ;. Hence the quo- 
tient groups of R” that are ‘separated topolog- 
ical groups are all isomorphic to groups of the 
form R' x T" (0 € 1-- m n). Any closed sub- 
group of the torus group T" of dimension n is 
isomorphic to a group of the form T^ x F 

(0x p x n), where F is a finite Abelian group. 
Hence the quotient groups of T" that are 
separated topological groups are all tsomor- 
phic to T" (0 m n). A tregular linear trans- 
formation of the linear space R" is a continu- 
ous automorphism of the vector group R’, 
and in fact, any continuous automorphism of 
R" is given by a regular linear transformation. 
Indeed, the group of all continuous automor- 
phisms of R" is isomorphic to the *general 
linear group GL(n, R) of degree n. Any continu- 
ous automorphism of the torus group T" — 
R"/Z" of dimension n is given by a regular 
linear transformation q of R” such that o(Z") 
= Z". Hence the group of continuous auto- 
morphisms of T" is isomorphic to the multi- 
plicative group of all n x n matrices, with de- 
terminant +1 and with entries in the set of 
rational integers. 


K. Kronecker's Approximation Theorem 


Let H be a subgroup of a group Ge% (not 
necessarily closed). Then (G,(C(G), H )) coin- 
cides with the closure H of H. In particular, H 
is *dense in G if and only if the annihilator 
(C(G), H ) consists of the identity alone. Now 
let G— R" and let H be the subgroup of R” 
generated by 0 —(0,,...,0,)e R” and the na- 
tural *basis e, =(1,0,...,0),...,e,=(0,...,0, D) 
of R". Then H is dense in R" if and only if 

(R", H)= {0}; that is, 0,,...,0,, are linearly 
independent over the rational number field Q 
(Kronecker's approximation theorem). This 
theorem implies that the torus group T" of 
dimension n has a cyclic subgroup and a 1- 
parameter subgroup that are both dense in T". 


L. Linear Topology 


Consider the discrete topology in a field Q. 
Suppose that an O-module G has a topology 
that satisfies *Hausdorff's separation axiom 
and is such that a base for the neighborhood 
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system of the zero element 0 consists of Q- 
submodules, and suppose that G together with 
this topology constitutes a topological Abelian 
group. Then this topology is called a linear 
topology. If a linear topology ts restricted to a 
O-submodule, then it is also a linear topology. 
If G is of finite rank, then any linear topology 
is the discrete topology. The discrete topology 
on G is a linear topology. Let H be a Q- 
submodule. Then the subset V = H +g of G 
obtained by translating H by an element g of 
G is called a linear variety in G. If V is a linear 
variety, then V is also a linear variety. If Q- 
modules G, G' have linear topologies, a homo- 
morphism of G into G' is always assumed to 
be open and continuous with respect to these 
topologies. A linear variety V in G is said to be 
linearly compact if, for any system (V, j of 
linear varieties closed in V with the ‘finite 
intersection property, we have ( ), V, 2j. In 
this case V is closed in G. If linearly compact 
Q-submodules can be chosen as a base for the 
neighborhood system of the zero element of 
G, we say that G is locally linearly compact. 
The set Co(G) of homomorphisms of an Q- 
module G with linear topology into Q is also 
an O-module. For any linearly compact Q- 
submodule H of G, let U(H)—(y|x(g) — 

0, ge H Then, with {U(H)} as a base for the 
neighborhood system, a linear topology can 
be introduced in C,(G). According as G is 
discrete, linearly compact, or locally linearly 
compact, Co(G) ts linearly compact, discrete, 
or locally linearly compact. Let G, H be Q- 
modules each of which has a linear topology, 
and let o:G3g 9,€ Co(H), y: HShovy,e 
Co(G) be homomorphisms such that o, (I) « 
W,(g). Then if one of o, y is an isomorphism, 
so 1s the other. This is an analog of the Pon- 
tryagin duality theorem and is called the 
duality theorem for O-modules. In particular, 
a linearly compact Q-module is the direct sum 
of 1-dimensional spaces (S. Lefschetz [3]). 
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A. Definitions 


If a *group G has the structure of a *topolog- 
ical space such that the mapping (x, y) 5 xy 
(product) of the Cartesian product G x G into 
G and the mapping xx '! (inverse) of G into 
G are both continuous, then G is called a topo- 
logical group. The group G without a topo- 
logical structure is called the underlying group 
of the topological group G, and the topolog- 
ical space G is called the underlying topological 
space of the topological group G. Let G, G' be 
topological groups. A mapping f of G into G' 
is called an isomorphism of the topological 
group G onto the topological group G' if f 

is an *isomorphism of the underlying group 

G onto the underlying group G' and also a 
thomeomorphism of the underlying topolog- 
ical space G onto the underlying topological 
space G'. Two topological groups are said to 
be isomorphic if there exists an isomorphism of 
one onto the other. 


B. Neighborhood Systems 


Let 9i be the *neighborhood system of the 
identity e of a topological group G. Namely, N 
consists of all subsets of G each of which con- 
tains an open set containing the element e. 
Then % satisfies the following six conditions: 
(1) If VeM and U c V, then Vee. (ii) If U, 
Hei, then UN Ve. (iii) If VEN, then ee U. 
(iv) For any U eù, there exists a We9t such 
that WW z(xy|x, ye W] cU. (v) FUER, 
then U`! eS. (vi) If UeR and aeG, then 
aUa ‘eM. Conversely, if a nonempty family N 
of subsets of a group G satisfies conditions (1)- 
(vi), then there exists a *topology O of G such 
that 9t is the neighborhood system of e and 

G is a topological group with this topology. 
Moreover, such a topology is uniquely deter- 
mined by N. *Left translation xax and tright 
translation x xa in a topological group G are 
homeomorphisms of G onto G; thus if 9t is the 
neighborhood system of the identity e, then 
a3 = Nia is the neighborhood system of a, 
where aR = (aU | UER}. 

If the underlying topological space of a top- 
ological group G is a Hausdorff space, G is 
called a T,-topological group (Hausdorff topo- 
logical group or separated topological group). 
If the underlying topological space of a topo- 
logical group G is a tT,-topological space, 
then, as is easily seen, it is a tF] -topological 
space. If it is a T,-topological space, then by 
the fact that the topology may be defined by a 
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*uniformity, it is a fcompletely regular space, 
hence, in particular, a Hausdorff space (— Sec- 
tion G). Thus a topological group whose un- 
derlying topological space is a Ty-topological 
space is a T,-topological group. 


C. Direct Product of Topological Groups 


Consider a family (G,],. , of topological 
groups. The Cartesian product G=J],.,G, of 
the underlying groups of G, is a topological 
group with the *product topology of the un- 
derlying topological spaces of G,. This topo- 
logical group G = TT,.,4 G, is called the direct 
product of topological groups G, (oe A). 


D. Subgroups 


Let H bea subgroup of the underlying group 
of a topological group G. Then H is a topolog- 
ical group with the topology of a ttopological 
subspace of G (‘relative topology). This topo- 
logical group H is called a subgroup of G. A 
subgroup that is a closed (open) set is called a 
closed (open) subgroup. Any open subgroup is 
also a closed subgroup. For any subgroup H 
of a topological group G, the closure H of H 

is also a subgroup. If H is a normal subgroup, 
so is H. If H is commutative, so is H. In a T;- 
topological group G, the tcentralizer C(M)= 
{xeG|xm=mx (me M)} of a subset M of G 

is a closed subgroup of G. In particular, the 
*center C= C(G) of a T,-topological group is a 
closed normal subgroup. 


E. Quotient Spaces 


Given a subgroup H of a topological group G, 
let G/H = (aH |aeG} be the set of tleft cosets, 
and let p be the canonical surjection p(a) - aH 
of G onto G/H. Consider the *quotient topo- 
logy on G/H, namely, the strongest topology 
such that p is a continuous mapping. Since 
a subset A of G/H is open when p ^! (A) is an 
open set of G, p is also an topen mapping. 
The set G/H with this topology is called the 
left quotient space (or left coset space) of G by 
H. The right quotient space (or right coset 
space) H\G={Ha|aeG} is defined similarly. 
The quotient space G/H is discrete if and only 
if H is an open subgroup of G. The quotient 
space is a Hausdorff space if and only if H isa 
closed subgroup. If G/H and H are both tcon- 
nected, then G itself is connected. If G/H and 
H are both tcompact, then G is compact. If H 
is a closed subgroup of G and G/H, H are both 
"locally compact, then G is locally compact. 
Suppose that H is a normal subgroup of a 
topological group G. Then the quotient group 
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G/H 1s a topological group with the topology 
of the quotient space G/H. This topological 
group is called the quotient group of the topo- 
logical group G by the normal subgroup H. 


F. Connectivity 


The *connected component Go containing the 
identity e of a topological group G is a closed 
normal subgroup of G. The connected compo- 
nent that contains an element ac G is the coset 
aGo = Goa. Go is called the identity component 
of G. The quotient group G/G, is *totally dis- 
connected. A connected topological group G 
is generated by any neighborhood U of the 
identity. Namely, any element of G can be 
expressed as the product of a finite number of 
elements in U. Totally disconnected (in partic- 
ular, discrete) normal subgroups of a con- 
nected topological group G are contained in 
the center of G. 


G. Uniformity 


Let Ne be the neighborhood system of the 
identity of a topological group G, and let U, 
={(x, y)eG x GlyexU} for Ue9t,. Then a 
*uniformity having (U,] Ue %,} as a base is 
defined on G. This uniformity is called the left 
uniformity of G. Left translation xax of G is 
*uniformly continuous with respect to the left 
uniformity. The right uniformity is defined 
similarly by U, — f(x, y)| ye Ux]. These two 
uniformities do not necessarily coincide. The 
mapping xx ^! is a tuniform isomorphism of 
G considered as a uniform space with respect 
to the left uniformity onto the same group G 
considered as a uniform space with respect to 
the right uniformity. A topological group G 

is thus a *uniform space under a uniformity 
*compatible with its topology, and hence it is 
a completely regular space if the underlying 
topological space is a T,-space. 


H. Completeness 


If a topological group G is *complete with 
respect to the left uniformity, then it is also 
complete with respect to the right uniformity, 
and conversely. In this case the topological 
group G is said to be complete. A locally com- 
pact T,-topological group is complete. If a T,- 
topological group G is isomorphic to a dense 
subgroup of a complete T,-topological group 
G, then G is called the completion of G, and G 
is said to be completable. A T,-topological 
group G is not always completable. For a T,- 
topological group G to be completable it is 
necessary and sufficient that any tCauchy filter 





423 J 
Topological Groups 


of G considered as a uniform space with res- 
pect to the left uniformity is mapped to a 
Cauchy filter of the same uniform space G 
under the mapping xx '. Then the com- 
pletion G of G is uniquely determined up to 
isomorphism. A commutative T,-topological 
group always has a completion G, and G 

is also commutative. If each point of a T,- 
topological group G has a ‘totally bounded 
neighborhood, there exists a completion G, 
and G is locally compact. 


I. Metrization 


If a *metric can be introduced in a T,- 
topological group G so that the metric gives 
the topology of G, then G is said to be metri- 
zable. For a T,-topological group G to be met- 
rizable it is necessary and sufficient that G 
satisfy the first axiom of countability. Then 
the metric can be chosen so that it is left in- 
variant, i.e., invariant under left translation. 
Similarly, it can be chosen so that it is right 
invariant. In particular, the topology of a 
compact T,-topological group that satisfies 
the first axiom of countability can be given by 
a metric that is both left and right invariant. 


J. Isomorphism Theorems 


Let G and G' be topological groups. If a homo- 
morphism f of the underlying group of G 

into the underlying group of G' is a contin- 
uous mapping of the underlying topological 
space of G into that of G’, f is called a con- 
tinuous homomorphism. If f is a continuous 
open mapping, f is called a strict morphism (or 
open continuous homomorphism). A continuous 
homomorphism of a *paracompact locally 
compact topological group onto a locally 
compact T,-topological group is an open 
continuous homomorphism. 

A topological group G’ is said to be homo- 
morphic to a topological group G if there 
exists an open continuous homomorphism f of 
G onto G'. Let N denote the kernel f ^! (e) of f. 
Then the quotient group G/N is isomorphic to 
G', with G/N and G' both considered as topo- 
logical groups (homomorphism theorem). Let 
f be an open continuous homomorphism of a 
topological group G onto a topological group 
G', and let H be a subgroup of G'. Then H = 
f !(H')is a subgroup of G, and the mapping 
o defined by o(gH) — f(g)H' is a homeomor- 
phism of the quotient space G/H onto G'/H'. 
In particular, if H' is a normal subgroup, then 
H is also a normal subgroup and q is an iso- 
morphism of the quotient group G/H onto 
G'/H' as topological groups (first isomorphism 
theorem). Let H and N be subgroups of a topo- 
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logical group G such that HN — NH. Then the 
canonical mapping f:A(H N N)hN of the 
quotient space H/H (1 to HN/N is a con- 
tinuous bijection but not necessarily an open 
mapping. In particular, tf N is a normal sub- 
group of the group HN, then f is a continuous 
homomorphism. In addition, if f is an open 
mapping, the quotient groups H/H f! N and 
HN/N are isomorphic as topological groups 
(second isomorphism theorem). For example, f 
is an open mapping (1) if N is compact or (2) if 
G is locally compact, HN and N are closed 
subgroups of G, and H is the union of a count- 
able number of compact subsets. Let H bea 
subgroup of a topological group G and N be a 
normal subgroup of G such that H >N. Then 
the canonical mapping of the quotient space 
(G/N)/(H/N) onto G/H is a homeomorphism. 
In particular, if H is also a normal subgroup, 
the quotient groups (G/N)/(H/N) and G/H are 
isomorphic as topological groups (third iso- 
morphism theorem). 


K. The Projective Limit 


Let {G,},-4 be a family of topological groups 
indexed by a *directed set A, and suppose that 
if x « f, there exists a continuous homomor- 
phism f,,:G,—G, such that fa, = fag O fp, if 
x « f| <y. Then the collection (G,, fag} of the 
family {G,},<4 of topological groups together 
with the family | f,,} of mappings is called a 
projective system of topological groups. Con- 
sider the direct product [],.,G, of topological 
groups {G,!, and denote by G the set of all 
elements x= (x,),. 4 of [I G, that satisfy x, = 
fag (x5) for x< $. Then G is a subgroup of 
II G,. The topological group G obtained in 
this way is called the projective limit of the 
projective system {G,, fag} of topological 
groups and is denoted by G =lim G,. If each G, 
is a T;-topological (resp. complete) group, then 
G is also a T,-topological (complete) group. 
Now consider another projective system 
IG fag; of topological groups indexed by the 
same A, and consider continuous homomor- 
phisms u,: G,— G; such that u, O fag = fag O Ug 
for x< f. Then there exists a unique continu- 
ous homomorphism u of G —lim G, into G' = 
lim G, such that for any «€ A, uo f, — f; ou 
holds, where f,( fz) 1s the restriction to G(G’) of 
the projection of [I G, (T G;) onto G,(G;). The 
homomorphism u is called the projective limit 
of the family {u,} of continuous homomor- 
phisms and is denoted by u=lim u,. Let G be 
a T,-topological group, and let {H,},.4 bea 
decreasing sequence (H, > H, for x < f) of 
closed normal subgroups of G. Consider the 
quotient group G/H,, and let f,, be the canon- 
ical mapping gH, gH, of G; to G, for a <$. 
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Then 1G,, fg} is a projective system of topo- 
logical groups. Let f, be the projection of G 
onto G,— G/H,, and let f —lim f,. Now assume 
that any neighborhood of the identity of G 
contains some H, and that some H, ts com- 
plete. Then f —lim f, is an isomorphism of G 
onto lim G/H, as topological groups. (For a 
general discussion of the topological groups 
already discussed — [1,4].) 


L. Locally Compact Groups 


For the rest of this article, all topological 
groups under consideration are assumed to 

be T,-topological groups. The identity com- 
ponent Go of a locally compact group G is the 
intersection of all open subgroups of G. In 
particular, any neighborhood of the identity of 
a totally disconnected locally compact group 
contains an open subgroup. A totally dis- 
connected compact group is a projective limit 
of finite groups with discrete topology. 

A T,-topological space L ts called a local Lie 
group if it satisfies the following six conditions: 
(1) There exist a nonempty subset M of Lx L 
and a continuous mapping 4: M > L, called 
multiplication ( (a, b) is written as ab). (11) If 
(a, b), (ab, c), (b, c), (a, bc) are all in M, then (ab)c 
— a(bc). (iii) L contains an element e, called the 
identity, such that L x {e} c M and ae=a for 
all ae L. (iv) There exists a nonempty open 
subset N of L and a continuous mapping v: N 
>L such that av(a) — e for all ae N. (v) There 
exist a neighborhood U of e in L and a homeo- 
morphism f of U into a neighborhood V 
of the origin in the Euclidean space R". (vi) Let 
D be the open subset of V x V defined by D = 
(06 EV xX VIG x, f QDeM,. f^! 09. 

J (y)eU]. Then the function F: DV defined 
by F(x, y) DUT (x), / ! (y) is of *class C^. 

For any neighborhood U of the identity e of 
a connected locally compact group G, there 
exist a compact normal subgroup K and a 
subset L that is a local Lie group under the 
*induced topology and the group operations of 
G such that the product LK is a neighborhood 
of e contained in U. Furthermore, under (/, k) 
lk, LK is homeomorphic to the product 
space L x K. Any compact subgroup of a 
connected locally compact group G is con- 
tained in a maximal compact subgroup, and 
maximal compact subgroups of G are *conju- 
gate. For a maximal compact subgroup K of 
G, there exists a finite number of subgroups 
H,,...,H, of G, each of which is isomorphic to 
the additive group of real numbers such that G 
— KH, ...H,, and the mapping (k,^,, ..., bi 
—kh, ... h, is a homeomorphism of the direct 
product K x H, x... x H, onto G. Any locally 
compact group has a left-invariant positive 
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measure and a right-invariant positive mea- 
sure, which are uniquely determined up to 
constant multiples (— 225 Invariant Mea- 
sures). Using these measures, the theory of 
harmonic analysis on the additive group R of 
real numbers can be extended to that on G (— 
69 Compact Groups; 192 Harmonic Analysis; 
422 Topological Abelian Groups; 437 Unitary 
Representations). 


M. Locally Euclidean Groups 


Suppose that each point of a topological 
group G has a neighborhood homeomorphic 
to an open set of a given Euclidean space. 
Then G is called a locally Euclidean group. If 
the underlying topological space of a topolog- 
ical group has the structure of a *real analytic 
manifold such that the group operation (x, y) 
xy !is a real analytic mapping, then G is 
called a *Lie group. A Lie group is a locally 
Euclidean group. 


N. Hilbert's Fifth Problem 


Hilbert's fifth problem asks if every locally 
Euclidean group is a Lie group (— 196 Hil- 
bert). This problem was solved affirmatively 
in 1952; it was proved that any *locally con- 
nected finite-dimenstonal locally compact 
group is a Lie group (D. Montgomery and L. 
Zippin [3]). In connection with this, the rela- 
tion between Lie groups and general locally 
compact groups has been studied, and the 
following results have been obtained: A neces- 
sary and sufficient condition for a locally 
compact group to be a Lie group is that there 
exist a neighborhood of the identity e that 
does not contain any subgroup (or any normal 
subgroup) other than 1e]. A locally compact 
group has an open subgroup that is the projec- 
tive limit of Lie groups. Hilbert's fifth problem 
is closely related to the following problem: 
Find the conditions for a ttopological trans- 
formation group operating teffectively on a 
manifold to be a Lie group (— 431 Transfor- 
mation Groups). 


O. Covering Groups 


Let © be the collection of all tarcwise con- 
nected and *locally arcwise connected T,- 
topological groups. Suppose that G* eG is a 
*covering space of Ge © and the ‘covering 
mapping f:G*—G is an open continuous 
homomorphism, with G* and G considered as 
topological groups. Then G* (or, more pre- 
cisely, (G*, f )) is called a covering group of G. 
Then the kernel f ! (e) - D of f is a discrete 
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subgroup contained in the center of G*, and 
G*/D and G, considered as topological groups, 
are isomorphic to each other. Let z,(G) be the 
‘fundamental group of G. The natural homo- 
morphism / * :z,(G*)— n,(G) induced by f is 
an injective homomorphism, and if we identify 
1,(G*) with the subgroup f *(1, (G*)) of 2,(G), 
we have D zn, (G)/n, (G*). Conversely, if D is 
any discrete subgroup contained in the center 
of G* € (5, then G* is a covering group of G 

= G*/D. For any covering space (G*, f) of 
Ge, a multiplication law can be introduced 
in G* so that G* is a topological group be- 
longing to © and (G*, f) is a covering group of 
G. In particular, any Ge © has a *simply con- 
nected covering group (G, o). Then for any 
covering group (G*, f) of G, there exists a 
homomorphism f *: 6G G*, and (G, f *) is a 
covering group of G*. Furthermore, o = f of *. 
Hence, in particular, any simply connected 
covering group of G is isomorphic to G, with 
G and G considered as topological groups. 
This simply connected covering group (G, ¢) is 
called the universal covering group. 

Let G and G' be topological groups, and let 
e and e' be their identities. A homeomorphism 
f of a neighborhood U of e onto a neighbor- 
hood U' of e' is called a local isomorphism of G 
to G' if it satisfies the following two conditions: 
(1) If a, b, ab are all contained in U, then f(ab) 
 f(a)f(b). (ii) Let f^! =g, then if a’, b’, a'b' 
€ U’, g(a'b') 2 g(a)g(b') holds. If there exists 
a local isomorphism of G to G', we say that G 
and G' are locally isomorphic. If G* is a cover- 
ing group of G, then G* and G are locally 
isomorphic. For two topological groups G and 
G' to be locally isomorphic it is necessary and 
sufficient that the universal covering groups of 
G and G' be isomorphic. For two connected 
Lie groups to be locally isomorphic it is neces- 
sary and sufficient that their *Lie algebras be 
isomorphic. 

Let f be a mapping of a neighborhood U of 
the identity of a topological group G into a 
group H such that if a, b, ab are all contained 
in U, then f(ab)= f(a)f(b). Then f is called a 
local homomorphism of G into H and U is 
called its domain. A local homomorphism of a 
simply connected group Ge (5 into a group H 
can be extended to a homomorphism of G into 
H if the domain is connected [2,4]. 


P. Topological Rings and Fields 


If a ring R has the structure of a topological 
group such that (x, y)-ox + y (sum) and (x, y) 
—xy (product) are both continuous mappings 
of R x R into R, then R is called a topological 
ring. If a topological ring K is a field (not 


necessarily commutative) such that xx"! 
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(inverse element) is a continuous mapping of 
K* = K — {0} into K*, then K is called a topo- 
logical field. Let us assume that K is a topo- 
logical field that is a locally compact Haus- 
dorff space and is not discrete. If K is con- 
nected, then K 1s a tdivision algebra of finite 
rank over the field R of real numbers; hence 
it is isomorphic to the field R of real numbers, 
the field C of complex numbers, or the tqua- 
ternion field H. If K is not connected, then K 
is totally disconnected and is isomorphic to a 
division algebra of finite rank over the tp-adic 
number field Q, or a division algebra of finite 
rank over the ‘formal power series field with 
coefficients in a finite field [4]. 

For various important classes of topological 
groups — 69 Compact Groups; 249 Lie 
Groups; 422 Topological Abelian Groups; 424 
Topological Linear Spaces. 
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A. Definition 


A tlinear space E over the real or complex 
number field K is said to be a topological 
linear space, topological vector space, or linear 
topological space if E is a *topological space 
and the basic operations x + y and ax (x, ye E, 
«€ K) in the linear space are continuous as 
mappings of E x E and K x E, respectively, 
into E. The coefficient field K may be a gen- 
eral ttopological field, although it is usually 
assumed to be the real number field R or the 
complex number field C, and accordingly E is 
called a real topological linear space or a com- 
plex topological linear space. Topological linear 
spaces are generalizations of ‘normed linear 
spaces and play an important role in the study 
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of *function spaces, such as the tspace of dis- 
tributions, that are not fBanach spaces. 

Each topological linear space E is equipped 
with a tuniform topology in which translations 
of the neighborhoods of zero form a tuniform 
family of neighborhoods, and the addition x 
+y and the multiplication xx by a scalar a are 
uniformly continuous relative to this uniform 
topology. In particular, if for each x z:0 there 
is a neighborhood of the origin that does not 
contain x, then E satisfies the *separation 
axiom T, and hence is a tcompletely regular 
space. The tcompletion Ê of E is also a topo- 
logical linear space. 

We assume in this article that K is the real 
or complex number field and E is a topological 
linear space over K satisfying the axiom of T,- 
spaces. Then E is finite-dimensional if and only 
if E has a ttotally bounded neighborhood of 
zero. The topology of E is *metrizable if and 
only if it satisfies the ffirst countability axiom. 


B. Linear Functional 


A K-valued function f(x) on E is said to be a 
linear functional if it satisfies (1) f(x + y)= f(x) 
+ f(y) and (ii) f(xx) — af(x). A linear functional 
that is continuous relative to the topologies 

of E and K is said to be a continuous linear 
functional. (Sometimes continuous linear 
functionals are simply called linear functionals, 
while abstract linear functionals are called 
algebraic linear functionals.) The following 
three statements are equivalent for linear 
functionals f(x):(1) f(x) is continuous; (ii) the 
half-space (xe E|Re f(x)» 0] is open; (iii) the 
hyperplane (x € El f(x) — 0] is closed. 


C. The Hahn-Banach Theorem 


A linear functional f(x) defined on a linear 
subspace F of E can be extended to a continu- 
ous linear functional on E if and only if there 
exists an open ‘convex neighborhood V of the 
origin in E that is disjoint with {xe F | f(x) =1}. 
Furthermore, if f(x) can be extended, at least 
one extension f(x) never takes the value 1 on 
V (Hahn-Banach theorem). 


D. Dual Spaces 


The set E' of all continuous linear functionals 
on £ is called the dual space of E 1t is often 
denoted by E* and is also called the conjugate 
space or adjoint space. It forms a linear space 
when f+g and af (f, g€ E, «€ K) are defined 
by (f - g)G) 9 f(x) + g(x) and (af )(x) = a( f(x) 


for xe E. 
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E. Locally Convex Spaces 


A topological linear space is said to be locally 
convex if it has a family of convex sets as a 
"base of the neighborhood system of 0. It 
follows from the Hahn-Banach theorem that 
for each x Z0 in a locally convex space E there 
is a continuous linear functional f such that 
f(x) z A subset M of E is said to be circled if 
M contains «M — (ax |xe Mj whenever |x| x 1. 
A set that is both circled and convex is called 
absolutely convex. In a locally convex space, a 
family of absolutely convex and closed sets can 
be chosen as a base of the neighborhood sys- 
tem of the origin. Let A and B be subsets of E. 
A is said to absorb B if there is an x 0 such 
that «A >B. A set V that absorbs every point 
cet is called absorbing. Neighborhoods of 0 
are absorbing. 


F. Seminorms 


A real-valued function p(x) on E is said to be 
a seminorm (or pseudonorm) if it satisfies (i) 
0x p(x) « + (xe Ey; (it) p(x - y) < p(x) + 

p(y); and (iii) p(ax) — |x|p(x). The relation V = 
ix|p(x) € 1} gives a one-to-one correspon- 
dence between seminorms p(x) and absolutely 
convex absorbing sets V whose intersection 
with any line through the origin is closed. 

In terms of seminorms, the Hahn-Banach 
theorem states: Let E be a linear space on 
which a seminorm p(x) is given. If a linear 
functional f(x) defined on a linear subspace F 
of E satisfies | f(x)| < p(x) on F, then f(x) can 
be extended to the whole space E in such a 
way that the inequality holds on E 

The topology of a locally convex space is 
determined by the family of continuous semi- 
norms on it. Conversely, if there is a family 
of seminorms {p,(x)} (AEA) on a linear space 
E over K that satisfies (tv) p; (x) —O for all 
^ implies x —0, then there exists on E the 
weakest locally convex topology that renders 
the seminorms continuous. This topology is 
called the locally convex topology determined 
by {p,(x)}. 

We assume that E is a locally convex space 
whose topology is determined by the family of 
seminorms {p,(x)} (4€ A). Then a tnet x, of E 
converges to x if and only if p; (x, — x)-50 for 
all 4€ A. If F is a locally convex space whose 
topology is determined by the family of semi- 
norms {q,(y)}, then a necessary and sufficient 
condition for a linear mapping u: E F to be 
continuous is that for every q,(y) there exist a 
finite number of 4,,..., 4,€ ^ and a constant 
C such that q,(u(x)) < C(p; (lt... + p; (x) 
(x € E). 

A set is said to be bounded if it is absorbed 
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by every neighborhood of zero. When the 
topology of E is determined by the family 
{p,(x)} of seminorms a set B is bounded if and 
only if every p; is bounded on B. Totally 
bounded sets are bounded. The unit ball in 

a normed space is bounded. Conversely, a 
locally convex space is normable if it has a 
bounded neighborhood of 0. A locally convex 
space is called quasicomplete if every bounded 
closed set is complete. Since Cauchy sequences 
{x,} are totally bounded, all Cauchy sequences 
converge in a quasicomplete space (e, the 
space is sequentially complete). 


G. Pairing of Linear Spaces 


Let E and F be linear spaces over the same 
field K. A K-valued function B(x, y) (xe E, 
yeF)on Ex F is called a bilinear functional or 
bilinear form if for each fixed ye F (resp. xe E), 
it is a linear functional of x (resp. y). When a 
bilinear functional (x, y» on E x F is given so 
that (x, y» =0 for all ye F (all xe E) implies 

x =0(y=0), then E and F are said to form a 
(separated) pairing relative to the inner product 
(x, y». A locally convex space E and its dual 
space E' form a pairing relative to the natural 
inner product (x, x» 2 x'(x) (xe E, x' e E). 


H. Weak Topologies 


When E and F form a pairing relative to an 
inner product <x, y5, the locally convex top- 
ology on E determined by the family of semi- 
norms {|<x, y»|| ye F} is called the weak top- 
ology (relative to the pairing (E, F5) and is 
denoted by e(E, F). A net x, in E is said to 
converge weakly if it converges in the weak 
topology. When E and E are a locally convex 
space and its dual space, c(E, E’) is called the 
weak topology of E, and c(E', E) the weak* 
topology of E'. The weak topology on a lo- 
cally convex space E is weaker than the orig- 
inal topology on E. Consequently, a weakly 
closed set is closed. If the set is convex, the 
converse holds, and hence a convex closed set 
is weakly closed. Also, boundedness ts pre- 
served if we replace the original topology by 
the weak topology. Thus a weakly bounded 
set is bounded. 

Let E and F form a pairing relative to 
(x, y», and let A be a subset of E. Then the set 
A" of points ye F satisfying Re(x, y» > —1 for 
all xe A is called the polar of A (relative to the 
pairing). If A is absolutely convex, A" is also 
absolutely convex and is the set of points y 
such that |<x, y»| € 1 for all xe A. If A isa 
convex set containing zero, its (weak) closure 
is equal to the bipolar A°° =(A°)° (bipolar 
theorem). In general, let A be a subset of a 
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topological linear space E. We call the smallest 
closed convex set containing A the closed 
convex hull of A. If E is locally convex, the 
bipolar A^^ relative to E coincides with the 
closed convex hull of AU {0}. 

A subset B of the dual space E is *equi- 
continuous on E if and only if it is contained 
in the polar V^ of a neighborhood V of 0 in E. 
Also, V^ is weak*- compact in E (Banach- 
Alaoglu theorem). 


I. Barreled Spaces and Bornological Spaces 


An absorbing absolutely convex closed set in a 
locally convex space E is called a barrel. In a 
sequentially complete space (hence in a quasi- 
complete space also), a barrel absorbs every 
bounded set. A locally convex space is said to 
be barreled if each barrel is a neighborhood of 
0. A locally convex space is said to be quasi- 
barreled (or evaluable) if each barrel that ab- 
sorbs every bounded set is a neighborhood of 
0. Furthermore, a locally convex space is said 
to be bornological if each absolutely convex set 
that absorbs every bounded set is a neighbor- 
hood of 0. Bornological spaces are quasi- 
barreled. However, they are not necessarily 
barreled. Furthermore, barreled spaces are not 
necessarily bornological. A metrizable locally 
convex space, Le, a space whose topology is 
determined by a countable number of semi- 
norms, is bornological. A complete metrizable 
locally convex space is called a locally convex 
Fréchet space ((F)-space or simply Fréchet 
space). To distinguish it from Fréchet space as 
in 37 Banach Spaces, it is sometimes called a 
Fréchet space in the sense of Bourbaki. (F)- 
spaces are bornological and barreled. 

A continuous linear mapping u: E F of one 
locally convex space into another maps each 
bounded set of E to a bounded set in F. Con- 
versely, if E is bornological, then each linear 
mapping that maps every bounded sequence 
to a bounded set is continuous. 


J. The Banach-Steinhaus Theorem 


In the dual space of a barreled space E, each 
(weak*-)bounded set is equicontinuous. Thus 
if a sequence of continuous linear mappings 
u, of E into a locally convex space F con- 
verges at each point of E, then u, converges 
uniformly on each totally bounded set of E, 
and the limit linear mapping is continuous 
(Banach-Steinhaus theorem). 


K. The S-Topology 


Let E and F be paired linear spaces relative 
to the inner product (x, y». When a family S 
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of (weakly) bounded sets of F generates a 
dense subspace of F, the family of seminorms 
{supyeg|<x, y>|| BES} determines a locally 
convex topology on E. This is called the S- 
topology or topology of uniform convergence on 
members of S, because x,—x in the S-topology 
is equivalent to the uniform convergence of 
(x,, y» 5 Cx, y» on each BeS. The space E 
with the S-topology is denoted by E, The 
weak topology is the same as the topology 

of pointwise convergence. The S-topology in 
which S is the family of all bounded sets in F 
is called the strong topology and is denoted by 
P(E, EL The dual space E of a locally convex 
space E is usually regarded as a locally convex 
space with the strong topology BE. E). It is 
called the strong dual space. The topology of a 
locally convex space E is that of uniform con- 
vergence on equicontinuous sets of E The 
topology of a barreled space E coincides with 
the strong topology P(E, E"). 


L. Grothendieck's Criterion of Completeness 


Let E and F be paired spaces, and let S bea 
family of absolutely convex bounded sets of F 
such that: (i) the sets of S generate F; (ii) if B,, 
Bech, then there is a BES such that B, B, 
and B,>B,. Then E; is complete if and only 
if each algebraic linear functional f(y) on F 
that is weakly continuous on every Bes is ex- 
pressed as f(y) - (x, y> for some xe E. When 
E; is not complete, the space of all linear func- 
tionals satisfying this condition gives the com- 
pletion Ê; of Es. 


M. Mackey’s Theorem 


Let E, F, and S satisfy the same conditions 

as in Section L. Then the dual space of E, is 
equal to the union of the weak completions of 
AB, where 470 and Bes (Mackey's theorem). 


N. The Mackey Topology 


When E and F form a pairing, the topology on 
E of uniform convergence on convex weakly 
compact sets of F is called the Mackey topol- 
ogy and is denoted by t(F, F). The dual space 
of E endowed with a locally convex topology 
T coincides with F if and only if T is stronger 
than the weak topology ott. F) and weaker 
than the Mackey topology t(E, F) (Mackey- 
Arens theorem). A locally convex space is said 
to be a Mackey space if the topology is equal 
to the Mackey topology t(E, EL Every quasi- 
barreled space is a Mackey space. 
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O. Reflexivity 


Let E be a locally convex space. The dual 
space E" of the dual space E' equipped with 
the strong topology contains the original space 
E. We call E semireflexive if E" = E, and reflex- 
ive if in addition the topology of E coincides 
with the strong topology DE EL E is semi- 
reflexive if and only if every bounded weakly 
closed set of E is weakly compact. E is re- 
flexive if and only if E is semireflexive and 
(quasi)barreled. 

A barreled space in which every bounded 
closed set is compact is called a Montel space 
or (M)-space. (M)-spaces are reflexive, and 
their strong dual spaces are also (M)-spaces. 

Many of the function spaces that appear 
in applications are (F)-spaces or their dual 
spaces. For these spaces detailed consequences 
of the countability axiom are known [7,8]. A 
convex set C in the dual space E of an (F)- 
space E is weak*-closed if and only if for every 
neighborhood V of 0 in E, CA V° is weak*- 
closed (Krein-Shmul’ yan theorem). The 
strong dual space E' of an (F)-space E is 
(quasi)barreled if and only if it is bornological. 
In particular, the dual space of a reflexive (F)- 
space is bornological. 


P. (DF)-Spaces 


A locally convex space is called a (DF)-space 
if it satisfies: (i) There is a countable base of 
bounded sets (i.e., every bounded set is in- 
cluded in one of them); (11) if the intersection 

V of a countable number of absolutely con- 
vex closed neighborhoods of 0 absorbs every 
bounded set, then V is also a neighborhood of 
0. The dual space of an (F)-space is a (DF)- 
space, and the dual space of a (DF)-space is an 
(F)-space. A linear mapping of a (DF)-space E 
into a locally convex space F is continuous if 
and only if its restriction to every bounded set 
of E is continuous. A quasicomplete (DF)- 
space is complete. 


Q. Bilinear Mappings 


A bilinear mapping b(x, y) on locally convex 
spaces E and F (xe E, ye F) to a locally convex 
space G is said to be separately continuous if 
for each fixed ye F (xe E) it is continuous as a 
function of x (y). The linear mappings obtained 
from b(x, y) by fixing x (y) are denoted by 
b.(y) (b,(x)). We call b(x, y) hypocontinuous 

if for each bounded set B of E and B' of F, 

ib. Cy)Ixe Bj and {b,(x)| ye BY} are equicon- 
tinuous. A continuous bilinear mapping is 
hypocontinuous. However, the converse is 
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not always true. A separately continuous bi- 
linear mapping is not necessarily hypocontinu- 
ous. If both E and F are barreled, however, 
then every separately continuous mapping 

is hypocontinuous. If E is an (F)-space and 

F is metrizable, then every separately continu- 
ous bilinear mapping is continuous. Simi- 
larly, if both E and F are (DF)-spaces, then 
every hypocontinuous bilinear mapping is 
continuous. 


R. Tensor Products 


It is possible to introduce many topologies in 
the ttensor product E ® F of locally convex 
spaces E and F. The projective topology (or 
topology x) is defined to be the strongest topol- 
ogy such that the natural bilinear mapping 
Ex F>E QF is continuous. The dual space 
of E &, F is identified with the space B(E, F) 
of all continuous bilinear functionals on E x 
F. The completion of Ee, F is denoted by 
E & F. The topology of biequicontinuous con- 
vergence (or topology £) is defined to be the 
topology of uniform convergence on sets V* x 
U^, where V and U are neighborhoods of 0 
in E and F, respectively, considering the ele- 
ments of E Q F as linear functionals on E ot 
by the natural pairing of E & F and Eet, 
The completion of Ee, F is denoted by E & 
F. The dual space of E ®, F coincides with 
the subspace J(E, F) of B(E, F) composed of 
the union of the absolute convex hulls of the 
products V? ® U? of equicontinuous sets. The 
elements of J(E, F) are called integral bilinear 
functionals. 

Closely related to E Ô F is L. Schwartz's 
e tensor product Ec F [12]. (They coincide if E 
and F are complete and if E or F has the tap- 
proximation property.) Ec F can be regarded 
as (i) a space of vector-valued functions if E is 
a space of functions and F is an abstract lo- 
cally convex space, especially a space of func- 
tions of two variables 1f E and F are, respec- 
tively, spaces of functions of one variable, and 
(ii) a space of operators Go F if E is the dual 
space G' of a locally convex space G. 


S. Nuclear Spaces 


Let E be a locally convex space, V be an ab- 
solutely convex closed neighborhood of the 
origin, and p(x) be the seminorm correspond- 
ing to V. Then we denote by E, the normed 
space with norm p(x) obtained from E by 
identifying the two elements x and y with 
p(x—y)20. If U c V, then a natural linear 
mapping oy. y: Ey E, is defined. 

A locally convex space E is said to be a 


424 T 
Topological Linear Spaces 


nuclear space (resp. Schwartz space or simply 
(S)-space) if for each absolutely convex closed 
neighborhood V of 0 there is another U such 
that @y , is a *'nuclear operator (resp. tcom- 
pact operator) as an operator of Ey into the 
completion of Ey. A nuclear space or (S)-space 
is an (M)-space if it is quasicomplete and 
quasibarreled. A locally convex space E is a 
nuclear space if and only if the topologies z 
and e coincide on the tensor product E ® F 
with any locally convex space F. Accordingly, 
it follows that B(E, F) - J(E, F). This can be 
regarded as a generalization of Schwartz's 
kernel theorem, which says that every sepa- 
rately continuous bilinear functional on 2, x 
2, is represented by an integral with kernel 

in Z;,. The theory of topological tensor prod- 
ucts and nuclear spaces is due to Grothen- 
dieck [9]. 

A locally convex space E is a nuclear (F)- 
space if and only if E is tsomorphic to a closed 
subspace of C^?( —oo, oo) (T. Komura and Y. 
Komura, 1966). An example of a nuclear (F)- 
space without basis is known (B. S. Mityagin 
and N. M. Zobin, 1974). 


T. Gelfand Triplet 


Let H and L be Hilbert spaces. If L is a dense 
subspace of H and the injection L5 H isa 
tHilbert-Schmidt operator, then H = H’ is 
regarded as a dense subspace of L' and the 


injection H' 5 L' is a Hilbert-Schmidt operator. 


In this case, (L, H, L’) is called a Gel’fand trip- 
let (or a rigged Hilbert space). 

A subset of H is called a cylindrical set if it 
is expressed in the form Py ! (B) by the ortho- 
gonal projection P, onto a finite-dimensional 
subspace F and a Borel subset B of F. Ifa 
finitely additive positive measure u with (ul, 
— ] defined on the cylindrical sets of H satisfies 
(i) u is countably additive on cylindrical sets 
for a fixed F and (ii) for any £» 0 there exists 
a 6>O such that lei <ô implies {yeH | 
| O6 y»| 2 1] « & then p is the restriction of a 
countably additive measure fi defined on the 
Borel subsets of L' (Minlos's theorem, 1959). 

Let T be a self-adjoint operator in H. Then 
T has a natural extension 7 in L' and almost 
every continuous spectrum 4 of T has an asso- 
ciated eigenvector x, in L': Tx, — dx, x, e L'. 


U. The Extreme Point Theorem 


Let A be a subset of a linear space E. A point 
x€ Á is said to be an extreme point if x is an 
extreme point of any real segment containing 
x and contained in A. If A is a compact convex 
subset of a locally convex space E, A is the 
convex closed hull of (1.e., smallest convex 
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closed set containing) the set of its extreme 
points (Krein-Milman theorem). In applica- 
tions it is important to know whether every 
point of A is represented uniquely as an inte- 
gral of extreme points. For a metrizable con- 
vex compact subset A of a locally convex 
space E, the following two conditions are 
equivalent (Choquet's theorem): (i) A is a sim- 
plex, i.e., if we put A = (4x, 4) xeA, 4» 0] 

C E x R!, the vector space A— A becomes a 
flattice with positive cone A; (ii) for any xe A 
there exists a unique positive measure U on 
A with Vull, — 1 such that ((x) — (lC v) du(y) 
(le E") and the support of u is contained in the 
set of extreme points of A. 


V. Weakly Compact Set 


A subset of a quasicomplete locally convex 
space is relatively weakly compact if and only 
if every sequence in the set has a weak ac- 
cumulation point (Eberlein's theorem). If E is a 
metrizable locally convex space, every weakly 
compact set of E is weakly sequentially com- 
pact (Shmul’yan’s theorem). If E is a quasi- 
complete locally convex space, the convex 
closed hull of any weakly compact subset is 
weakly compact (Kreins theorem). If E is not 
quasicomplete, this is not necessarily true. 


W. Permanence 


Each subspace, quotient space, direct product, 
direct sum, projective limit, and inductive limit 
(of a family) of locally convex spaces has a 
unique natural locally convex topology. These 
spaces, except for quotient spaces and induc- 
tive limits, are separated, and a quotient space 
E/A is separated if and only if the subspace A 
is closed. The limit of a sequence E, c E;c ... 
is said to be a strictly inductive limit if E, has 
the induced topology as a subspace of E,,,,. If 
E is a strictly inductive limit of a sequence E, 
such that E, is closed in E,,, or if E is the 
inductive limit of a sequence E, c Ec... such 
that the mapping E, E,,, maps a neigh- 
borhood of 0 to a relatively weakly compact 
set, then E is separated and each bounded set 
of E is the image of a bounded set in some E,. 
If E= | J E, is the strictly inductive limit of the 
sequence |E,], then the topology of E, coin- 
cides with the relative topology of E, c E. The 
strictly inductive limit of a sequence of (F)- 
spaces is called an (LF)-space. 

Any complete locally convex space (resp. 
any locally convex space) is (resp. a dense 
linear subspace of) the projective limit of 
Banach spaces. Every (F)-space E is the projec- 
tive limit of a sequence of Banach spaces E, — 
E; .... In particular, E is said to be a count- 
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ably normed space if the mappings E E, are 
one-to-one and |x||, € |x||,., for all xe E with 
E considered as a subspace E,. We call E a 
countably Hilbertian space if, in particular, the 
E, are ! Hilbert spaces. An (F)-space with at 
least one continuous norm is a nuclear space if 
and only if it is a countably Hilbertian space 
such that the mappings E,., — E, are Hilbert- 
Schmidt operators or nuclear operators. 

A locally convex space is bornological if 
and only if it is the inductive limit of normed 
spaces. À locally convex space is said to be 
ultrabornological if it is the inductive limit of 
Banach spaces, or in particular, if it is quasi- 
complete and bornological. 

Properties of spaces, such as being complete, 
quasicomplete, semireflexive, or having every 
bounded closed set compact, are inherited 
by closed subspaces, direct products, projec- 
tive limits, direct sums, and strictly inductive 
limits formed from the original spaces, and 
properties of spaces, such as being Mackey, 
quasibarreled, barreled, and bornological, are 
inherited by quotient spaces, direct sums, in- 
ductive limits, and direct products formed 
from the spaces. (For direct products of high 
power of bornological spaces, unsolved prob- 
lems still exist concerning the inheritance of 
properties.) Quotient spaces of (F)-spaces are 
(F)-spaces, but quotient spaces of general 
complete spaces are not necessarily complete. 
There are examples of a Montel (F)-space 
whose quotient space is not reflexive and a 
Montel (DF)-space whose closed subspace is 
neither a Mackey space nor a (DF)-space. The 
property of being a Schwartz space or a nu- 
clear space is inherited by the completions, 
subspaces, quotient spaces of closed subspaces, 
direct products, projective limits, direct sums 
of countable families, and inductive limits of 
countable families formed from such spaces. 
Tensor products of nuclear spaces are nuclear 
spaces. Y. KOmura gave an example of a non- 
complete space that is quasicomplete, borno- 
logical, and nuclear (and hence a Montel 
space). 


X. The Open Mapping Theorem and the 
Closed Graph Theorem 


Let E and F be topological linear spaces. The 
statement that every continuous linear map- 
ping of E onto F ts open is called the open 
mapping theorem (or homomorphism theorem), 
and the statement that every linear mapping of 
F into E is continuous if its graph is closed in 
F x E is called the closed graph theorem. These 
theorems hold if both E and F are complete 
and metrizable (S. Banach). 

A locally convex space is said to be B- 
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complete (or fully complete) if a subspace C of 
E is weak*-closed whenever CN V° is weak*- 
closed for every neighborhood V of 0 in E. (F)- 
spaces and the dual spaces of reflexive (F)- 
spaces are B-complete. B-complete spaces are 
complete, and closed subspaces and quotient 
spaces by closed subspaces of B-complete 
spaces are B-complete. If E is B-complete and 
F is barreled, then the open mapping theorem 
and the closed graph theorem hold (V. Ptak). 

Both theorems hold also if F is ultraborno- 
logical and E is a locally convex space ob- 
tained from a family of (F)-spaces after a finite 
number of operations of taking closed sub- 
spaces, quotient spaces by closed spaces, direct 
products of countable families, projective 
liinits of countable families, direct sums of 
countable families, and inductive limits of 
countable families. This was conjectured by 
Grothendieck and proved by W. Słowikowski 
(1961) and D. A. Raikov. Later, L. Schwartz, 
A. Martineau, M. De Wilde, W. Robertson, 
and M. Nakamura simplified the proof and 
enlarged the class of spaces E [15]. 

(LF)-spaces, the dual spaces of Schwartz (F)- 
spaces, and the space 2’ of distributions are 
examples of spaces E described in the previous 
paragraph. 


Y. Nonlocally Convex Spaces 


The space L, for 0 « p «1 shows that non- 


. locally convex spaces are meaningful in func- 


tional analysis. Recently, the Banach-Steinhaus 
theorem, closed graph theorems, etc. have 
been investigated for nonlocally convex topo- 
logical linear spaces [13]. 


Z. Diagram of Topological Linear Spaces 


The spaces in Fig. 1 are all locally convex 
spaces over the real number field or the com- 
plex number field and satisfy the separation 
axiom T,. The notation 4 B means that 
spaces with property A have property B. Main 
properties of dual spaces are listed in Table 1. 
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A. Introduction 


Convergence and continuity, as well as the 
algebraic operations on real numbers, are 
fundamental notions in analysis. In an abstract 
space too, it is possible to provide an ad- 
ditional structure so that convergence and 
continuity can be defined and a theory analo- 
gous to classical analysis can be developed. 
Such a structure is called a topological struc- 
ture (for a precise definition, — Section B). 
There are several ways of giving a topology to 
a space. One method is to axiomatize the 
notion of convergence (M. Fréchet [1], 1906; 
— 87 Convergence). However, defining a 
topology in terms of either a neighborhood 
system (due to F. Hausdorff [3], 1914), a clo- 
sure operation (due to C. Kuratowski, Fund. 
Math., 3 (1922)), or a family of open sets is 
more common. 


B. Definition of a Topology 


Let X be a set. A neighborhood system for X is 
a function U that assigns to each point x of X, 
a family U(x) of subsets of X subject to the 
following axioms (U): 
(1) x€U for each U in U(x). 
(2) If U,, U,E U(x), then U, O U; elM(x). 
(3) If U e U(x) and U c V, then Ve U(x). 
(4) For each U in U(x), there is a member W of 
U(x) such that U e u(y) for each y in W. 

A system of open sets for a set X 1s a family 
© of subsets of X satisfying the following 
axioms (O): 
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(0 X, Get 
(2) If 0,,0, €, then O0,00,c9. 
(3) If O,eD (AeA), then (let, 

A system of closed sets for a space X isa 
family $5 of subsets of X satisfying the follow- 
ing axioms (F): 

(1) X, eg. 
(2) If Fi, F,e%, then FUF eÑ. 
(3) If F,e% (AeA), then (\,., F; e D. 

A closure operator for a space X is a func- 
tion that assigns to each subset A of X,a 
subset A^ of X satisfying the following axioms 
(C): 

(1) 8° = B. 

(2) (AU BF = A*U B°. 
(3) Ac A*. 

(4) A* 2 A"*. 

An interior operator for a space X is a func- 
tion that assigns to each subset A of X a 
subset A! of X satisfying the following axioms 
(D: 

(1) Ki X. 

(2) (An By « A' B'. 
(3) ACA. 

(4) A! — Al 

Any one of these five structures for a set X, 
i.e., a structure satisfying any one of (U), (O), 
(F), (C), or (D, determines the four other struc- 
tures in a natural way. For instance, assume 
that a system of open sets O satisfying (O) is 
given. In this case, each member of © is called 
an open set. A subset U of X is called a neigh- 
borhood of a point x in X provided that there 
is an open set O such that xeO c U. If U(x) is 
the family of all neighborhoods of x, the func- 
tion x U(x) satisfies (U). The complement of 
an open set in X is called a closed set. The 
family % of all closed sets satisfies (F). Given a 
subset A of X, the intersection A^ of the family 
of all closed sets containing A is called the 
closure of A, and each point of A" is called an 
adherent point of A. The closure A“ is the 
smallest closed set containing A, and the func- 
tion A— A^ satisfies (A). The closure A^ is 
also denoted by A or Cl A. Dually, there is a 
largest open subset A! of A. The set Ai (also 
denoted by A? or Int A) is called the interior of 
A, and each point of A? is called an interior 
point of A. The closure and interior are related 
by 4? — X —(X — A) and A— X — (X — Ay. 
The correspondence A A° satisfies (I). Con- 
versely, open sets can be characterized vari- 
ously as follows: 


A is open <+ Ae U(x) for each x in A 


—X-—AeSs 
<>(X—A)=X—A 
« A? — A. 


When a structure satisfying (U), (F), (C), or 
(I) is given, one of the four characterizations of 
open sets can be used to define a system of 
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open sets satisfying (O) and hence the other 
structure. 

A topological structure or simply a topology 
for a space X is any of these five structures for 
X. If two topologies t, and t, for X give rise 
to identical systems of open sets, then t, and c, 
are considered to be identical. For this reason 
“topology” frequently means simply "system of 
open sets" in the literature. A topological space 
is a set X provided with a topology t and is 
denoted by (X,t) or simply X when there is no 
ambiguity. 


C. Examples 


(1) Discrete Topology. Let X be a set, and let 
the system O of open sets be the family of all 
subsets of X. The resulting topology is called 
the discrete topology, and X with the discrete 
topology is a discrete topological space. In this 
space, A = A4? = A for each subset A, and A isa 
neighborhood of each of its points. 


(2) Trivial Topology. The trivial (or indiscrete) 
topology for a set X is defined by the system of 
open sets which consists of X and @ only. If 
AZ X, then 4? = Ø, and if Az Ø, then A — X. 
Each point of X has only one neighborhood, 
X itself. 


(3) Metric Topology. Let (X, p) be a *metric 
space, i.e., a set X provided with a *metric p. 
For a positive number e, the -neighborhood 
of a point x is defined to be the set U,(x) — 
iylye X, p(x, y) « sj. Let U(x) be the family 

of all sets V such that U,(x)< V for some e; 
then the assignment x — U(x) satisfies (U) and 
hence defines a topology. This topology is the 
metric topology for the metric space (X, p). 


(4) Order Topology. Let X be a set *linearly 
ordered by <. For each point x in X, let U(x) 
be the family of all subsets U such that xe 

i yJa «y «bj cU for some a,b. The function 
x—>U(x) satisfies (U) and defines the order 
topology for the linearly ordered set X. 


(5) Convergence and Topology. We can define 
the notion of convergence in a topological 
space, and conversely we can define a topology 
using convergence as a primitive notion (— 87 
Convergence). In particular, for a metric space, 
the metric topology can be defined in terms of 
convergent sequences (— 273 Metric Spaces). 


D. Generalized Topological Spaces 


When a space X is equipped with a closure 
operator that does not satisfy all of (C), the 
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space is called a generalized topological space 
by some authors. Topological implications of 
each axiom in (C) have been investigated for 
such spaces. 


E. Local Bases 


Let X be a topological space, and let x be a 
point of X. A collection U(x) of neighbor- 
hoods of x is called a base for the neighborhood 
system (fundamental system of neighborhoods 
of a point x or local base at x) if each neigh- 
borhood of x contains a member of lH, (x). Let 
{Wo(x)|xeX} be a system of local bases; then 
the system has the following properties (U,): 
(1) For each V in W(x) xeV c X. 

(2) If Vj, V; ello(x), then there is a V3 in U(x) 
such that V, c V, N V. 

(3) For each V in W(x), there exists a Wc V 
in U(x) such that for each y in W, V contains 
some member of Har 

Conversely, suppose that (1£5(x)| xe Xj is 
a system satisfying (Ug). For each x in X, 
let U(x) consist of all subsets V of X such 
that V >U for some U in U(x). Then the sys- 
tem fM(x)| x e X) satisfies (U) and therefore 
defines a topology for X. This topology is 
called the topology determined by the system 
(Ms G)IxeX;1. 

For instance, in a metric space X, the set of 
é-neighborhoods of x(s 0) is a local base at x 
with respect to the metric topology. In an 
arbitrary topological space, the collection of 
all open sets containing x, i.e., the open neigh- 
borhoods of x, is a local base at x. 

Two systems satisfying (U,) are called 
equivalent if they determine the same topology. 
For systems {Up(x)|xeX} and (8,(x)| xe X] 
to be equivalent it is necessary and sufficient 
that for each x in X each member of U,(x) 
contain a member of %,(x) and each member 
of B(x) contain a member of 1f, (x). 

Sometimes the word "neighborhood" stands 
for a member of a local base or for an open 
neighborhood. However, this convention is 
not used here. 


F. Bases and Subbases 


A family ©, of open sets of a topological space 

X is called a base for the topology (base for the 

space, or open base) if each open set is the 

union of a subfamily of Dp. A base Dg for the 

topology of a topological space X has the 

following properties (Op): 

(DU 9s- X. 

(2) If W,, W€ Og and xe W, N W,, then there is 

a W, in O, such that xe W; CW, N Wy. 
Conversely, if a family ©, of subsets of a set 
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X satisfies (Oy), then ©, is a base for a unique 
topology. A member of ©, is called a basic 
open set. 

A family Oo, of open sets of a topological 
space X is a subbase for the topology (or sub- 
base for the space) if the family of all finite 
intersections of members of Dog is a base for 
the topology. If Doo a subbase for the topol- 
ogy of a topological space X, then | } Doo — X. 
Conversely, if Do, is a family of subsets of a 
set X such that | J Ooo = X, then the family of 
all finite intersections of members of Dgo is a 
base for a unique topology t. A subset of X ts 
open for z if and only if it is the union of a 
family of finite intersections of members of 
Dan, The system of open sets relative to t is 
said to be generated by the family oo. Thus 
any family of sets defines a topology for its 
union. 

A set § of subsets of a topological space is 
called a network if for each point x and its 
neighborhood U there is a member Fe such 
that xe F c U (A. V. Arkhangel’skii, 1959). If 
all Fe & are required to be open, the network 
iy is exactly an open base. 


G. Continuous Mappings 


A mapping f on a topological space X into a 
topological space Y is called continuous at a 
point a of X if it satisfies one of the following 
equivalent conditions: 

(1) For each neighborhood V of f(a), there is a 
neighborhood U of a such that f(U) c V. (1) 
For each neighborhood V of f(a), the inverse 
image f ! (V) is a neighborhood of a. 

(2) For an arbitrary subset A of X such that 
ae A. f(a)e f(A). 

Continuity can also be defined in terms of 
convergence (— 87 Convergence). 

If f is continuous at each point of X, f is 
said to be continuous. Continuity of f is equiv- 
alent to each of the following conditions: 

(1) For each open subset O of Y, the inverse 
image f ^! (O) is open in X. 

(I) The inverse image under f of each member 
of a subbase for the topology of Y is open in 
X. 

(2) For each closed subset F of Y, the inverse 
image f !(F) is closed. 

(3) For each subset A of X, f(A)c f(A). 

The image f(X) of X under a continuous 
mapping f is called a continuous image of X. 
Let X, Y, and Z be topological spaces, and let 
f: X 5 Y and g: YO Z be mappings. If f is con- 
tinuous at a point a of X and g is continuous 
at f(a), then the composite mapping go f: X 
>Z is continuous at the point a. Hence if f 
and g are continuous, so is go f. 

When a continuous mapping f: X > Y is 
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bijective and f~} is continuous, the mapping 
f is called a homeomorphism (named by H. 
Poincaré, 1895) or topological mapping. Two 
topological spaces X and Y are homeomorphic, 
X x Y, if there is a homeomorphism f: X > Y. 

The relation of being homeomorphic is an 
*equivalence relation. A property which, when 
held by a topological space, is also held by 
each space homeomorphic to it is a topological 
property or topological invariant. The problem 
of deciding whether or not given spaces are 
homeomorphic is called the homeomorphism 
problem. 

A mapping f: X — Y is called open (resp. 
closed) if the image under f of each open (resp. 
closed) subset of X ts open (closed) in Y. A 
continuous bijection that is either open or 
closed is a homeomorphism. 

A continuous surjection f: X > Y is called a 
quotient mapping if U c Y is open whenever 
f (U)is open (— Section L). If moreover 
f|f ^ (S) is quotient for each Sc Y as a map- 
ping from the subspace (— Section J) f^! (S) 
onto the subspace S, then f is called a hered- 
itarily quotient mapping. Open or closed con- 
tinuous mappings are hereditarily quotient 
mappings. 


H. Comparison of Topologies 


When a set X is provided with two topologies 
1, and t, and the identity mapping: (X, 1,) 
—(X, t) is continuous, the topology t, is said 
to be stronger (larger or finer) than the topol- 
Ogy 15, T is said to be weaker (smaller or 
coarser) than 7,, and the notation t, È t; or 
t; € t, is used. Let O;, %;, and a; be the 
system of open sets, system of closed sets, 
neighborhood system, and closure opera- 
tion for X relative to the topology 1; (i= 1,2), 
respectively. Then each of the following is 
equivalent to the statement 1, 2 t5: 

(15 213. 

Q) 8,285. 

(3) For each x in X, 15, (x) 2 M, (x). 

(4) A": c A® for each subset A of X. 

Let S be the family of all topologies for X. 
Then S is ordered by the relation >. The 
discrete topology is the strongest topology for 
X. If (1,| 4e A] is a subfamily of S, then among 
the topologies stronger than each r;, there is a 
weakest one t, —2supit,| 4€ Aj. Similarly, 
among the topologies weaker than each r;, 
there is a strongest one t, — infíz; | 4e A]. In 
fact, let O, be the family of all open sets rela- 
tive to t,; then the system of open sets for t, ts 
generated by | ]zea Ox, and the system of open 
sets for t is precisely (La O;. The family S is 
therefore a fcomplete lattice. 
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I. Induced Topology 


Let f be a mapping from a set X into a topo- 
logical space Y. Then the family | f£. ! (O)|O is 
open in Y | satisfies axioms (O) and defines a 
topology for X. This topology is called the 
topology induced by f (or simply induced topol- 
ogy), and it is characterized as the weakest 

one among the topologies for X relative to 
which the mapping f is continuous. 


J. Subspaces 


Let (X, x) be a topological space and M be a 
subset of X. The topology for M induced by 
the inclusion mapping f: M X, i.e., the 
mapping f defined by f(x) 2 x for each x in M, 
is called the relativization of t to M or the 
relative topology. The set M provided with the 
relative topology is called a subspace of the 
topological space (X, x). Topological terms, 
when applied to a subspace, are frequently 
preceded by the adjective "relative" to avoid 
ambiguity. Thus a relative neighborhood of a 
point x in M is a set of the form M N U, where 
U is a neighborhood of x in X. A relatively 
open (relatively closed) set in M is a set of the 
form MNA, where A is open (closed) in X. For 
a subset A of M, the relative closure of A in M 
is M N A, where A is the closure of A in X. A 
mapping f: X — Y is called an embedding if f is 
a homeomorphism from X to the subspace 
f(X), and in this case X is said to be embedded 
into Y. A property P is said to hold locally on 
a topological space X if each point x of X has 
a neighborhood U such that the property P 
holds on the subspace U. A subset A of X is 
locally closed if for each point x of X, there 
exists a neighborhood V of x such that VN A is 
relatively closed in V. A subset of X is locally 
closed if and only if it can be represented as 
ON F, where O is open and F is closed in X. 


K. Product Spaces 


Let X be a set, and for each member 2 of an 
index set A, let f, be a mapping of X into a 
topological space X,. Then there is a weakest 
topology for X that makes each f, continuous. 
In fact, this topology is sup{t,}, where qt, is the 
topology for X induced by f,. In particular, let 
LX; | 4€ Aj be a family of topological spaces, 
and let X be the Cartesian product Iisa X;. 
Then the weakest topology for X such that 
each projection pr; : X — X, is continuous is 
called the product topology or weak topology. 
The Cartesian product [T;.4 X; equipped with 
the product topology is called the product 
topological space or simply the product space 
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or direct product of the family | X;| 2€ Aj of 
topological spaces. If O is the family of all 
open subsets of X,, the union | J; pr;  (2;) 
is a subbase for the product topology. If 
x={x,} is a point of X, then sets of the type 
( Yi pr; (U= Taiga.. Xa XU, X.. X U, 
form a local base at x for the product to- 
pology, where 4,,...,4,€ A and U; is a neigh- 
borhood of Xj. Each projection pr, X >X, is 
continuous and open, and a mapping f from a 
topological space Y into the product space 
II, X; is continuous if and only if pr; o f: Y 
— X, is continuous for each å. Given a family 
{f} of continuous mappings f}: X; Y, the 
product mapping IT, f}: [1; X; IL, Y; is con- 
tinuous with respect to the product topologies. 
For the Cartesian product [T; X; of a family 
(X;|4€ ^] of topological spaces, there is an- 
other topology called the box topology (or 
strong topology). À base for the box topology 
is the family of all sets TT. O;, where O, is open 
in X; for each 4. For a point x= (x;j, the 
family of all sets of the form [], U, is a local 
base at x relative to the box topology, where 
U, is a neighborhood of x; for each 4. With 
respect to the box topology, each projection 
pri: I; X; X, is continuous and open, and 
the product mapping II/;: II; X; IL, Y, ofa 
family { f,} of continuous mappings f: X; ^ Y, 
is continuous. For a finite product of topolog- 
ical spaces, the product topology agrees with 
the box topology, but for an arbitrary product 
the product topology is weaker than the box 
topology. For the Cartesian product of topo- 
logical spaces the usual topology considered is 
the product topology rather than the box 
topology. 


L. Quotient Spaces 


Let f be a mapping of a topological space 

X onto a set Y. The quotient topology for Y 
(relative to the mapping f) is the strongest 
topology for Y such that f is continuous. A 
subset O of Y is open relative to the quotient 
topology if and only if f~'(O) is open. Given 
an equivalence relation — on a topological 
space X, the *quotient set Y= X/~ provided 
with the quotient topology relative to the ` 
projection o: X > Y is called the quotient topo- 
logical space (or simply quotient space). A 
mapping f from the quotient space Y= X/— 
into a topological space is continuous if and 
only if f o o is continuous. 

A partition of a space X is a family {A,|A€ 
A! of pairwise disjoint subsets of X such that 
LJ; A;7 X. A partition {A,} of a topological 
space X determines an equivalence relation ~ 
on X such that the family (4,7 is precisely 
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the family of all equivalence classes under ~, 
and therefore the partition determines the 
quotient space Y= X/~. This space is called 
the identification space of X by the given par- 
tition. Each member A, of the partition can 
be regarded as a point of Y, and the projec- 
tion o: X — Y satisfies o(x) — A; whenever xe 
A,. A partition | 4;| 2€ A] of a topological 
space is called upper semicontinuous if for 
each A, and each open set U containing A,, 
there is an open set V such that A, CV CU, 
and V is the union of members of ( A;| Ae Aj. 
A partition ( 4;|4€Aj is upper semicontinu- 
ous if and only if the projection o; X +> Y= 
{A,|AeA} is a closed mapping. 


M. Topological Sums 


Let X be a set, and for each member 4 of an 
index set A, let f, be a mapping of a topo- 
logical space X; to X. Then the family {Oc 

X | f; ! (O) is open for any 4] satisfies the 
axioms of the open sets. This topology t is 
characterized as the strongest one for X that 
makes each f, continuous. A mapping g on X 
with t to a topological space Y is continuous if 
and only if go f; : X,— Y is continuous for each 
4€ ^. The simplest is the case where X is the 
disjoint union of X, and f, is the inclusion 
mapping. Then we call the topological space X 
the direct sum or the topological sum of ( X;j 
and denote it by @X, or LLX,. More gener- 
ally let the set X be the union of topological 
spaces {X,},-, such that for each 4 and ue A 
the relative topologies of X; X, from X; and 
X, coincide. Then we call the topology t the 
weak topology with respect to {X,}. If X; X, 
is closed (resp. open) in X, for any u, then X; 
is closed (resp. open) in X and the original 
topology of X, coincides with the relative 
topology. If, moreover, for each subset I of A, 
F —| Jj r X, is closed and the weak topology 
of F with respect to {X,},., coincides with the 
relative topology induced by t, then X with t 
is said to have the hereditarily weak topology 
with respect to {X,} (or to be dominated by 
{X,}). A topological space has the hereditarily 
weak topology with respect to any locally 
finite closed covering, and every CW-complex 
(— 70 Complexes) has the hereditarily weak 
topology with respect to the covering of all 
finite subcomplexes. 

When | X,j is an increasing sequence of 
topological spaces such that each X, is a sub- 
space of X,,,,, then the union X =| ) X, with 
the weak topology is called the inductive limit 
of 1 X,j and is denoted by lim X,. Each X, may 
again be regarded as a subspace of X. 
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N. Baire Spaces 


For a subset A of a topological space X, the 
set X — A is called the exterior of A, and the set 
AN X —A is called the boundary of A, de- 
noted by Bd A, Fr A, or 6A. A point belonging 
to the exterior (boundary) of A is an exterior 
point (boundary point or frontier point) of A. If 
the closure of A is X, then A ts said to be dense 
in X. When X — A is dense in X, Le, when the 
interior of A is empty, A is called a boundary 
set (or border set), and if the closure A is a 
boundary set, A is said to be nowhere dense. 
The union of a countable family of nowhere 
dense sets is called a set of the first category 
(or meager set). A set that is not of the first 
category is called a set of the second category 
(or nonmeager set). The complement of a set of 
the first category is called a residual set. In the 
space R of real numbers, the set Q of all ra- 
tional numbers ts of the first category, and the 
set R — Q of all irrational numbers is of the 
second category. Both Q and R — Q are dense 
in X and hence are boundary sets. The union 
of a finite family of nowhere dense sets is no- 
where dense, and the union of a countable 
family of sets of the first category is also of the 
first category. A subset A of X is nowhere 
dense in X if and only if for each open set O, 
ONA is not dense in O. 

A topological space X is called a Baire space 
(Baire, 1899) if each subset of X of the first 
category has an empty interior. Each of the 
following conditions is necessary and sufficient 
for a space X to be a Baire space: 

(1) Each nonempty open subset of X is of the 
second category. 

(2) If F,, Fy, ... is a sequence of closed subsets 
of X such that the union UI, F, has an inte- 
rior point, then at least one F, has an interior 
point. 

(3) If 0,,0,, ... is a sequence of dense open 
subsets of X, then the intersection ( );, O, is 
dense in X. 

An open subset of a Baire space is a Baire 
space for the relative topology. A topological 
space that is homeomorphic to a complete 
metric space (— 436 Uniform Spaces I) is a 
Baire space (Baire-Hausdorff theorem). A 
locally compact Hausdorff space (— Section 
V) is also a Baire space. The class of Cech- 
complete completely regular spaces (— Section 
T) includes both of these spaces, but there are 
also Baire spaces that are not in the class. A 
subset A of a topological space is said to sat- 
isfy Baire’s condition or to have the Baire 
property if there exist an open set O and sets 
P,, P, of the first category such that A = 
(OUP,)~—P,. A ‘Borel set satisfies Baire’s 
condition. 
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O. Accumulation Points 


A point x is called an accumulation point, or a 
cluster point of a subset A of a topological 
space X if xe A—{x}. The set of all accumula- 
tion points of a set A is called the derived set 
of A and is denoted by A’ or A4. A point x 
belongs to A’ if and only if each neighborhood 
of x contains a point of A other than x itself. A 
point belonging to the set A*= A — A' is called 
an isolated point of A, and a set A consisting of 
isolated points only, i.e., A = A7. is said to be 
discrete. If each nonempty subset of A contains 
an isolated point, then A is said to be scat- 
tered; and if A does not possess an isolated 
point, Le., A c A', then A is said to be dense in 
itself. The largest subset of A which is dense in 
itself is called the kernel of A. If A= A’, then A 
is called a perfect set. 

If x is an accumulation point of A, then for 
each neighborhood U of x, UN(A- (x1) 
Furthermore, it is possible to classify an ac- 
cumulation point of A according to the *car- 
dinality of U N (A — (x]). A point x is called a 
condensation point of a set 4 if for each neigh- 
borhood U of x, the set UMA 1s uncountable. 
A point x is a complete accumulation point of 
A if for each neighborhood U of x, the set 
UNA has the same cardinality as A. 


P. Countability Axioms 


A topological space X satisfies the first count- 
ability axiom if each point x of X has a coun- 
table local base (F. Hausdorff [3]). Metric 
spaces satisfy the first countability axiom. In 
fact, the family of (1/n)-neighborhoods (n= 
1,2, ...) of a point is a local base of the point. 
The topology of a topological space that 
satisfies the first countability axiom is com- 
pletely determined by convergent sequences. 
For instance, the closure of a subset A of such 
a space consists of all limits of sequences in A 
(— 87 Convergence). A topological space X is 
said to satisfy the second countability axiom or 
to be perfectly separable if there is a countable 
base for the topology. *Euclidean spaces satisfy 
the second countability axiom. If X contains a 
countable dense subset, X is said to be sepa- 
rable. A space that satisfies the second count- 
ability axiom satisfies the first and is also a 
separable Lindelóf space (— Section S). How- 
ever, the converse is not true. Each of the 
following properties is independent of the 
others: separability, the first countability 
axiom, and the Lindelóf property. If a metric 
space is separable, then it satisfies the second 
countability axiom. There are metric spaces 
that are not separable. 
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Q. Separation Axioms 


Topological spaces that are commonly en- 
countered usually satisfy some of the following 
separation axioms. 

(al Kolmogorov's axiom. For each pair of 
distinct points, there is a neighborhood of one 
point of the pair that does not contain the 
other. 

(T,) The first separation axiom or Fréchet's 
axiom. For each pair x, y of distinct points, 
there are neighborhoods U of x and V of y 
such that x é V and y £U. 

Axiom (Tj) can be restated as follows: 

(Ti) For each point x of the space, the sin- 
gleton {x} is closed. 

(T,) The second separation axiom or Haus- 
dorff's axiom [3]. For each pair x, y of dis- 
tinct points of the space X, there exist disjoint 
neighborhoods of x and y. 

Axiom (T,) is equivalent to the following: 

(T5) In the product space X x X the diago- 
nal set A is closed. 

(T3) The third separation axiom or Vietoris's 
axiom (Monatsh. Math. Phys., 31 (1921)). 
Given a point x and a subset A such that x¢ A, 
there exist disjoint open sets O, and O, such 
that x eO, and A c O,. (In this case, the sets 
{x} and A are said to be separated by open 
sets.) 

Axiom (T) can be restated as (T5) or (T3): 

(T3) For each point x of the space, there is a 
local base at x consisting of closed neighbor- 
hoods of x. 

(T3) An arbitrary closed set and a point not 
belonging to it can be separated by open sets. 

(T4) The fourth separation axiom or Tietze's 
first axiom (Math. Ann., 88 (1923)). Two dis- 
joint closed sets F, and F, can be separated by 
open sets, i.e., there exist disjoint open sets O, 
and O, such that F, cO, and F;cO,. 

(T5) Tietze's second axiom. Whenever two 
subsets A, and A, satisfy 4,145 — 4,0 4, = 
Ø, A, and A, can be separated by open sets. 

It is easily seen that (T5) — (T4), (T5) and 
(T3) = (T2), (T4) and (T,) = (T3). Axiom (T) is 
equivalent to each of (T2) and (T2): 

(T;) Whenever F, and F, are disjoint closed 
subsets, there exists a continuous function f on 
the space into the interval [0, 1] such that f is 
identically 0 on F} and 1 on F;. 

(T2) Each real-valued continuous function 
defined on a closed subspace can be extended 
to a real-valued continuous function on the 
entire space. 

The implications (T4) = (T4) and (T4) > (T2) 
are known as Uryson's lemma (Math. Ann., 

94 (1925)) and the Tietze extension theorem 
(J. Reine Angew. Math., 145 (1915)), respec- 
tively. In addition, there are two more related 
axioms: 
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(T3,) Tikhonov's separation axiom. For each 
closed subset F and each point x not in F, 
there is a real-valued continuous function f on 
the space such that f(x) - 0 and f is identically 
| on F. 

(T3) (N. Vedenisov). For each closed subset 
F, there is a real-valued continuous function f 
on the space such that F = {x| f(x) =0}. 

Axioms (T;) and (T,) are equivalent to the 
following (T5) and (T,), respectively: 

(T5) Each subspace satisfies (T4) 

(T) X satisfies (T4) and each closed set is a 
*G;-set. 

The following implications are valid: (T3:) > 
(T3), (Tz) > (T3), (T4) and (T,) (Ty). 

Table 1 gives a classification of topological 
spaces by the separation axioms. Each line 
represents a special case of the preceding line. 

A *metrizable space is perfectly normal, but 
the converse is false (for metrization theorems 
— 273 Metric Spaces). Among the spaces 
satisfying the second countability axiom, 
regular spaces are normal (Tikhonov's theo- 
rem, Math. Ann., 95 (1925)) and metrizable 
(Tikhonov-Uryson theorem; P. Urvson, Math. 
Ann., 94 (1925)). 

Table 2 shows whether various topological 
properties are preserved in subspaces, product 
spaces, and quotient spaces. The topological 
properties considered are T,, T, = Hausdorff, 
T, =regular, CR = completely regular, T, = 
normal, T; — completely normal, M = metriz- 
able, C, — first axiom of countability, Cj = 
second axiom of countability, C 2 compact, 
S=separable, and L — Lindelóf. Each position 
is filled with © or x according as the prop- 
erty (say, P) listed at the head of the column 
is preserved or not in the sort of space listed 
on the left obtained from space(s) all having 
property P. 


R. Coverings 


A family M= | M; ,.4 of subsets of a set X is 
called a covering of a subset A of X if Ac 
(J; M,. If M is finite (countable), it is called 
a finite covering (countable covering). An open 
(closed) covering is a covering consisting of 
open (closed) sets. 

A family Wè of subsets of a topological space 
X is said to be locally finite if for each point x 
of X, there is a neighborhood of x which inter- 
sects only a finite number of members of W. If 
moreover {M,},-, is disjoint, then M is called 
discrete. Wè is called star-finite if each mem- 
ber of M intersects only a finite number of 
members of Mt. A o-locally finite or c-discrete 
family of subsets of X is respectively the union 
of a countable number of locally finite or dis- 
crete families of subsets of X. A covering M 
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Table 1. Separation Axioms 
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Axioms Spaces Satisfying the Axioms 

(To) T,-space (Kolmogorov space) 

(T,) T,-space (Kuratowski space) 

(T) T,-space (Hausdorff space, separated space) 
(To) and (T3) T,-space (regular space) 

(T;) and (T4) Completely regular space (Tikhonov space) 


(T,) and (T;) 
(T,) and (Ts) 
(T) and (T6) 


T;-space (normal space) 
T;-space (completely normal space, hereditarily normal space) 
T,-space (perfectly normal space) 





Table 2. Topological Properties and Spaces 


- 


Space T 


| T1; T, CR 
Subspace O O O O 
Closed subspace O O O O 
Open subspace O O © O 
Product O O O O 
Countable product © O O O 
Quotient space x x x x 


is called point-finite if each infinite number 

of members of IR has an empty intersection. A 
covering I is a refinement of a covering Yt 
(written Wi< 90) if each member of W is con- 
tained in a member of 9t, The order of the 
covering W is the least integer r such that any 
subfamily of M consisting of r+ 1 members 
has an empty intersection. 

Let M be a covering of X, and let A bea 
subset of X. The star of A relative to Wt, de- 
noted by S(A, M), is the union of all members 
of M whose intersection with A is nonempty. 
Let MÅ denote the family {S({x}, M)} xex and 
207 the family {S(M, W); a. Then Wt and 
M* are coverings of X, and 9t -« 900^ — 002 < 
9^5. A covering W is a star refinement of a 
covering 9t if M* <N, and WM is a barycentric 
refinement (or A-refinement) of 9t if 99t^ — 9t. 

A sequence 9Jt,, Wt,,... of open coverings of 
a topological space ts called a normal sequence 
if Må, <M, forn=1,2,...,and an open 
covering 90i is said to be a normal covering if 
there is a normal sequence W, M, ... such 
that M, <M. The support (or carrier) of a 
real-valued function f on a topological 
space X is defined to be the closure of the set 
{xl f(x) 40}. Let { ftaa be a family of con- 
tinuous nonnegative real-valued functions on 
a topological space X, and for each « in A, let 
C, be the support of f,. The family { fj... is 
called a partition of unity if the family TC. 
is locally finite and X, f,(x) 2 1 for each x in X. 
If the covering (C,1,., is a refinement of a 
covering W, the family { f,},.4 is called a par- 
tition of unity subordinate to the covering Wi. A 
partition of unity subordinate to a covering W 
exists only if M is a normal covering (— Sec- 
tion X). If p is a continuous tpseudometric on 
a T,-space X, then define a covering M, for 


XXXXOX 





O O O O x x x 
O O O O O X O 
O O O O x O x 
x x x x O x x 
Xx O O O O O X 
x x x X O O O 


each natural number n by M,={U(x;2 DI, e 
where U(x; 6)={y| p(x, virt, Then the se- 
quence M, M, ... is a normal sequence of 
open coverings. Conversely, given a normal 
sequence WM, 9UL,, ... of open coverings of X, 
there exists a continuous pseudometric p such 
that p(x, y) z2 ^" whenever xe S( y, 99i,), and 
p(x, y)z2 "^! whenever x éS(y, M,). If in 
addition for each x the family (S(x, Mi,)|n= 
1,2,...} is a local base at x, then the metric 
topology of p agrees with the topology of X. 


S. Compactness 


If each open covering of a topological space X 
admits a finite open covering as its refinement, 
the space X ts called compact; if each open 
covering of X admits a countable open refine- 
ment, X is said to be a Lindelóf space (P. Alek- 
sandrov and P. Uryson, Verh. Akad. Wetensch., 
Amsterdam, 19 (1929); if each open covering 
of X admits a locally finite open refinement, X 
is called paracompact (J. Dieudonné, J. Math. 
Pures Appl., 23 (1944)); and if each open cover- 
ing of X admits a star-finite open refinement, 
X 1s said to be strongly paracompact (C. H. 
Dowker, Amer. J. Math., 69 (1947)) or to have 
the star-finite property (K. Morita, Math. 
Japonicae, 1 (1948)). The space X is compact 
(Lindelöf) if for each open covering M of X, 
there is a finite (countable) subfamily of M 
whose union is X. 

The following properties for a topological 
space X are equivalent: (1) The space X is 
compact. (2) If a family {F,},., of closed sub- 
sets of X has the finite intersection property, 
i.e., each finite subfamily of {F,},., has non- 
empty intersection, then (\, F, # Ø. (3) Each 
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infinite subset of X has a complete accumu- 
lation point. (4) Each *net has a convergent 
*subnet. (5) Each funiversal net and each 
*ultrafilter converge. 

If a subset A of X is compact for the relative 
topology, A is called a compact subset. A 
subset A of X is said to be relatively compact if 
the closure of A in X is a compact subset. A 
closed subset of a compact topological space is 
compact, and a compact subset of a Hausdorff 
space is closed. A continuous image of a com- 
pact space is compact, each continuous map- 
ping of a compact space into a Hausdorff space 
is a closed mapping, and a continuous bijec- 
tion of a compact space onto a Hausdorff 
space is a homeomorphism. The product space 
of a family {X,},., of topological spaces is 
compact if and only if each factor space is 
compact (Tikhonov's product theorem, Math. 
Ann., 102 (1930)). A compact Hausdorff space 
is normal. A compact Hausdorff space is 
metrizable if and only if it satisfies the second 
countability axiom. A metric space or a tuni- 
form space is compact if and only if it is 
*totally bounded and *complete. A subset of a 
Euclidean space 1s compact if and only if it is 
closed and bounded. In a discrete space only 
finite subsets are compact. The cardinality of a 
compact Hausdorff space with the first count- 
ability axiom cannot exceed the power of the 
continuum (Arkhangel’skii). 

There are a number of conditions related to 
compactness. A topological space is sequenti- 
ally compact if each sequence in X has a con- 
vergent subsequence. A space X is countably 
compact (M. Fréchet (1]) if each countable 
open covering of X contains a finite subfamily 
that covers X. A space X is pseudocompact (E. 
Hewitt, 1948) if each continuous real-valued 
function on X is bounded. Some authors use 
compact and bicompact for what we call coun- 
tably compact and compact, respectively. N. 
Bourbaki [9] uses compact and quasicompact 
instead of compact Hausdorff and compact, 
respectively. A T,-space is countably compact 
if and only if each infinite set possesses an 
accumulation point. If X is countably com- 
pact, then X is pseudocompact, and if X is 
normal, the converse also holds. If a tcomplete 
uniform space is pseudocompact, then it is 
compact. A space satisfying the second counta- 
bility axiom is compact if and only if it is 
sequentially compact. If X is sequentially 
compact, then X is countably compact, and if 
X satisfies the first countability axiom, the 
converse is true. 


T. Compactification 


A compactification of a topological space X 
consists of a compact space Y and a homeo- 
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morphism of X onto a dense subspace X, of Y. 
We can always regard X as a dense subspace 
of a compactification Y. If X is completely 
regular, then there is a Hausdorff compac- 
tification Y such that each bounded real- 
valued continuous function on X can be 
extended continuously to Y. Such a compacti- 
fication is unique up to homeomorphism; it is 
called the Stone-Cech compactification of X 
(E. Cech, Ann. Math., 38 (1937); M. H. Stone, 
Trans. Amer. Math. Soc., 41 (1937)) and is 
denoted by BOL Let TC: be the set of all 
continuous functions on a completely regular 
space X into the closed interval J = [0, 1]. 
Then a continuous mapping o of X into a 
parallelotope 1^ — [T, 7; (1; ^ I) is defined by 

e (x) 1f, G9) eq, and the mapping ¢ isa 
homeomorphism of X onto the subspace o(X) 
of I^ (Tikhonov's embedding theorem, Math. 
Ann., 102 (1930)). The closure p(X) of o(X) in 
I^ is the Stone-Cech compactification of X. 
The natural mapping B(X, x X,)f(X,) x 
B{X,) is a homeomorphism if and only if X, x 
X, is pseudocompact (I. Glicksberg, 1959). 

For a topological space X, let oo be a point 
not in X, and define a topology on the union 
XU (oc] as follows: A subset U of X U {œ} is 
open if and only if either oo #U and U is open 
in X, or oce U and X —U is a compact closed 
subset of X. The topological space XU {a0} 
thus obtained is compact, and if X is not 
already compact, the space X U {20} is a com- 
pactification of X called the one-point com- 
pactification of X (P. S. Aleksandrov, C. R. 
Acad. Sci. Paris, 178 (1924)). The one-point 
compactification of a Hausdorff space is not 
necessarily Hausdorff. The one-point compac- 
tification of the n-dimensional Euclidean space 
R" is homeomorphic to the n-dimensional 
sphere S". 

A completely regular space X is a tG;-set in 
the Stone-Cech compactification B(X) if and 
only if it is a G,-set in any Hausdorff space Y 
which contains X as a dense subspace. Then X 
is said to be Cech-complete. 


U. Absolutely Closed Spaces 


A Hausdorff space X is said to be absolutely 
closed (or H-closed; P. Aleksandrov and P. 
Uryson, 1929) if X is closed in each Hausdorff 
space containing it. A compact Hausdorff 
space is absolutely closed. A Hausdorff space 
is absolutely closed if and only if for each open 
covering {N,\,-, of X, there is a finite sub- 
family of {N,\,<, that covers X. The product 
space of a family of absolutely closed spaces is 
absolutely closed. Each Hausdorff space is a 
dense subset of an absolutely closed space. 
Similarly, a regular space X is said to be r- 
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closed if X is closed in each regular space 
containing it (N. Weinberg, 1941). 


V. Locally Compact Spaces 


A topological space X ts said to be locally 
compact if each point of X has a compact 
neighborhood (P. Aleksandrov and P. Uryson, 
1929). A *uniform space X is said to be uni- 
formly locally compact if there is a member U 
of the *uniformity such that U(x) is compact 
for each x in X (= 436 Uniform Spaces). A 
noncompact space X ts locally compact and 
Hausdorff if and only if the one-point com- 
pactification of X is Hausdorff, and this is the 
case if and only if X 1s homeomorphic to an 
open subset of a compact Hausdorff space. A 
locally compact Hausdorff space is completely 
regular, and for each point of the space, the 
family of all of its compact neighborhoods 
forms a local base at the point. A locally 
closed, hence open or closed, subset of a lo- 
cally compact Hausdorff space is also locally 
compact for the relative topology. If a sub- 
space A of a Hausdorff space X is locally 


compact, then A is a locally closed subset of X. 


The Euclidean space R" is locally compact, 
and hence each locally Euclidean space, i.e., a 
space such that each point admits a neighbor- 
hood homeomorphic to a Euclidean space, is 
locally compact. A topological space is called 
c-compact if it can be expressed as the union 
of at most countably many compact subsets. 


W. Proper (Perfect) Mappings 


A mapping f of a topological space X into a 
topological space Y is said to be proper (N. 
Bourbaki [9]) (or perfect [14]) if it is con- 
tinuous and for each topological space Z, the 
mapping f x 1l: X x ZoY x Z is closed, where 
(f x 1D)(x, z) 2 (f(x), z). A continuous mapping 
f: X 5 Y is proper if and only if it is closed and 
f~'({y}) is compact for each y in Y. Another 
necessary and sufficient condition ts that if 
{x,}y is a *net in X such that its image { f(x,)} 
converges to ye Y, then a subnet of [x,] con- 
verges to an xef !(y) in X. A continuous 
mapping of a compact space into a Hausdorff 
space is always proper. For a compact Haus- 
dorff space X, a quotient space Y is Hausdorff 
if and only if the canonical projection o: X > 
Y is proper. 

For a continuous mapping f of a locally 
compact Hausdorff space X into a locally 
compact Hausdorff space Y, the following 
three conditions are equivalent: (1) f is proper. 
(2) For each compact subset K of Y, the in- 
verse image f ! (K) is compact. (3) If XU Ix, ] 
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and YU y, 1 are the one-point compactifi- 
cations of X and Y, then the extension f, of 
f such that f (x)= y, is continuous. 

The composition of two proper mappings is 
proper and the direct product of an arbitrary 
number of proper mappings is proper. 


X. Paracompact Hausdorff Spaces 


A paracompact Hausdorff space (often called 
simply a paracompact space) is normal. For a 
Hausdorff space X, the following five con- 
ditions are equivalent: (1) X is paracompact. 
(2) X is fully normal (J. W. Tukey [8]), i.e., 
each open covering of X admits an open 
barycentric refinement. (3) Each open covering 
has a partition of unity subordinate to it. (4) 
Each open covering is refined by a closed 
covering {F,|xe A} that is closure-preserving, 
i.e., ULF,| Be Bj is closed for each Bc A. (5) 
Each open covering (U,|x€ Aj has a cushioned 
refinement | V,|xe Aj, ie, CI(U(V,|Be Bj)c 
UTUg| Pe Bj for each Bc A. The implication 
(D-(2) is Dieudonné’s theorem. The implication 
(2) (1) is A. H. Stones theorem (1948), from 
which it follows that each metric space is para- 
compact. The implications (5) (4) (1) is 
Michael's theorem (1959, 1957). 

For normal spaces, the following weaker 
versions of (2) and (3) hold: A T,-space X is 
normal if and only if each finite open covering 
of X admits a finite open star refinement (or 
finite open barycentric refinement). For each 
locally finite open covering of a normal space, 
there is a partition of unity subordinate to it. 

For a regular space X the following three 
conditions are equivalent: (1) X is paracom- 
pact. (2) Each open covering of X is refined by 
a c-discrete open covering. (3) Each open 
covering of X is refined by a c-locally finite 
open covering. Tamano's product theorem: For 
a completely regular space X to be paracom- 
pact it is necessary and sufficient that X x p(X) 
be normal (1960). 

For a *connected locally compact space X, 
the following conditions are equivalent: (1) X 
is paracompact. (2) X is o-compact. (3) In the 
one-point compactification X U {oo}, the point 
oo admits a countable local base. (4) There is a 
locally finite open covering {U,},-, of X such 
that U, is compact for each 2. (5) X is the 
union of a sequence {U,} of open sets such 
that U, is compact and U,c U,,, (n=1,2,...). 
(6) X is strongly paracompact. 

Every *F,-set of a paracompact Hausdorff 
space is paracompact (Michael, 1953). When a 
T,-space X has the hereditarily weak topology 
with respect to a closed covering {F,}, then X 
is paracompact Hausdorff (normal, completely 
normal or perfectly normal) if and only if each 


425 Y 
Topological Spaces 


F, is (Morita, 1954; Michael, 1956). In par- 
ticular, every CW-complex is paracompact 
and perfectly normal (Morita, 1953). 


Y. Normality and Paracompactness of Direct 
Products 


A topological space X is discrete if X x Y is 
normal for any normal space Y (M. Atsuji and 
M. Rudin, 1978). There are a paracompact 
Lindelóf space X and a separable metric space 
Y such that the product X x Y is not normal 
(Michael, 1963). The following are conditions 
under which the products are normal or 
paracompact. Let m be an infinite *cardinal 
number. A topological space X is called m- 
paracompact if every open covering consisting 
of at most m open sets admits a locally finite 
open covering as its refinement. When m is 
countable, it is called countably paracompact. 
If X has an open base of at most m members, 
m-paracompact means paracompact. The 
following conditions are equivalent for a topo- 
logical space X: (1) X is normal and count- 
ably paracompact; (2) The product X x Y is 
normal and countably paracompact for any 
compact metric space Y; (3) X x I is normal, 
where I — [0,1] (C. H. Dowker, 1951). Rudin 
(1971) constructed an example of a collection- 
wise normal space (— Section AA) that is not 
countably paracompact. When m is general 
the following conditions are equivalent: (1) X 
is normal and m-paracompact; (2) If Y is a 
compact Hausdorff space with an open base 
consisting of at most m sets, then X x Y is 
normal and m-paracompact; (3) X x I" is 
normal; (4) X x 10, 1j" is normal (Morita, 
1961). In particular, the product X x Y of 

a paracompact Hausdorff space X and a 
compact Hausdorff space Y is paracompact 
(Dieudonné, 1944). 

A topological space X is called a P-space if 
it satisfies the following conditions: Let Q be 
an arbitrary set and (G(o,, ...,2;)|o,, ... GEQ, 
i— 1,2, ... ) be a family of open sets such that 
G(x, ...,0;) E G(a,, ...,2;,0,,,). Then there is a 
family of closed sets {F(a,,...,%;)|%1,...,4;€Q, 
i— 1,2, ...) such that F(a,, ...,a;)) c G(a,, ...,a) 
and that if | J, G(a,,...,a%,)=X for a se- 
quence {a,}, then | J£, F(a,, ...,a;) — X. Per- 
fectly normal spaces, countably compact 
spaces, Cech-complete paracompact spaces 
and o-compact regular spaces are P-spaces. 
Normal P-spaces are countably paracom- 
pact. A Hausdorff space X is a normal (resp. 
paracompact) P-space if and only if the prod- 
uct X x Y is normal (resp. paracompact) for 
any metric space Y (Morita, Math. Ann., 154 
1964). 
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The product X x Y of locally compact Haus- 
dorff spaces X and Y is a locally compact 
Hausdorff space. If, in this case, X and Y are 
paracompact, then so is the product. If the 
direct product space [], X, of metric spaces is 
normal, then X; are compact except for at 
most countably many /, and hence the prod- 
uct space is paracompact (A. H. Stone, 1948). 

A class @ of topological spaces is called 
countably productive if for a sequence X; of 
members of € their product [] X, is again a 
member of €. The classes of (complete) (sepa- 
rable) metric spaces form such examples. The 
class of paracompact and Cech-complete 
spaces is countably productive (Z. Frolik, 
1960). A topological space X is called a p- 
space if it is completely regular and there 
is a sequence Wi; of families of open sets in 
the Stone-Cech compactification B(X) such 
that, for each point xe X, xef) S(x, M) =X 
(Arkhangel’skii, 1963). X is called an M-space 
if there is a normal sequence IN; of open 
coverings of X such that if K, 2 K,2...isa 
sequence of nonempty closed sets and Kc 
S(x, Mj), i— 1,2, ...,foran xe X, then () K;# 
@ (Morita, 1963). The class of paracompact 
p-spaces and that of paracompact Hausdorff 
M-spaces are the same and are countably pro- 
ductive. For a covering § of X and an xe X 
we set C(x, ®)=(\{F|xeFeR}. X is called a 
2-space if X admits a sequence %; of locally 
finite closed coverings such that if K, >K,> 
... is a sequence of nonempty closed sets and 
K;c C(x, &,), i2 1,2,..., for an xe X, then 
(\K;4#@ (K. Nagami, 1969). X-spaces are 
P-spaces. The class of all paracompact £Z- 
spaces is also countably productive. Among 
the above classes each one is always wider 
than its predecessors. Yet the product X x Y of 
a paracompact Hausdorff P-space X anda 
paracompact Hausdorff Z-space Y is paracom- 
pact. Other examples of countably productive 
classes are the Suslin spaces and the Luzin 
spaces (— Section CC) introduced by Bour- 
baki (1958), the stratifiable spaces by J. G. 
Ceder (1961) and C. J. R. Borges (1966), the No- 
spaces by Michael (1966) and the o-spaces by 
A. Okuyama (1967). 


Z. Strongly Paracompact Spaces 


Regular Lindelóf spaces are strongly paracom- 
pact. Conversely, if a connected regular space 
is strongly paracompact, then it is a Lindelöf 
space (Morita, 1948). Hence a connected non- 
separable metric space is not strongly para- 
compact. Paracompact locally compact Haus- 
dorff spaces and uniformly locally compact 
Hausdorff spaces are strongly paracompact. 
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These classes of spaces coincide under suitable 
tuniform structures. 


AA. Collectionwise Normal Spaces 


A Hausdorff space X is called a collectionwise 
normal space if for each discrete collection 
(F,|a€ A} of closed sets of X there exists a 
disjoint collection {U,|aeA} of open sets with 
F,c U, (xe A) (R. H Bing, 1951). If X satisfies 
an analogous condition for the case where 
each F, is a singleton, X is called a collection- 
wise Hausdorff space. Paracompact Hausdorff 
spaces are collectionwise normal (Bing). Every 
point-finite open covering of a collectionwise 
normal space has a locally finite open refine- 
ment (Michael, Nagami). 

A topological space X is called a developable 
space if it admits a sequence U,, i= 1,2, ..., of 
open coverings such that, for each point xe X, 
{S(x, 1M)|io 1,2, ... ) forms a base for the 
netghborhood system of x (R. L. Moore, 1916). 
A regular developable space is called a Moore 
space. The question of whether or not every 
normal Moore space is metrizable is known 
as the normal Moore space problem (— 273 
Metric Spaces K). Collectionwise normal 
Moore spaces are metrizable (Bing). 


BB. Real-Compact Spaces 


A completely regular space X is called real- 
compact if X is complete under the smallest 
*uniformity such that each continuous real- 
valued function on X is uniformly continuous 
(— 422 Uniform Spaces). This notion was 
introduced by E. Hewitt (Trans. Amer. Math. 
Soc., 64 (1948)) under the name of Q-space, 
and independently by L. Nachbin (Proc. Inter- 
national Congress of Mathematicians, Cam- 
bridge, Mass., 1950). 

A Lindelóf space is real-compact. If X, and 
X, are real-compact spaces such that the rings 
C(X,) and C(X,) of continuous real-valued 
functions on X, and X; are isomorphic, then 
X, and X, are homeomorphic (Hewitt). If X is 
real-compact, then X is homeomorphic to a 
closed subspace of the product space of copies 
of the space of real numbers, and conversely. 


CC. Images and Inverse Images of Topological 
Spaces 


Each continuous mapping f: X > Y is decom- 
posed into the product ioo p of continuous 
mappings p: X o X/—, h: X/ — 5 f(X) and i: 
f(X)9 Y, where ~ is the equivalence relation 
such that x, ^ x; if and only if f(x,)= f(x4). 
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The mapping f is open (resp. closed) if and 
only if these mappings are all open (resp. 
closed). Then h is a homeomorphism. The 
image of a paracompact Hausdorff space 
under a closed continuous mapping is para- 
compact (Michael, 1957). 

Let f: X 2 Y be a perfect surjection. Then Y 
is called a perfect image of X and X a perfect 
inverse image of Y. If, in this case, one of X 
and Y satisfies a property such as being com- 
pact, locally compact, o-compact, Lindelöf, or 
countably compact, then the other also satis- 
fies the property. When X and Y are com- 
pletely regular, the same is true with regard 
to Cech completeness. Properties such as 
regularity, normality, complete normality, 
perfect normality, and the second countability 
axiom are preserved in perfect images; but 
complete regularity and strong paracompact- 
ness are not. Perfect images of metric spaces 
are also metrizable (S. Hanai and Morita, A. 
H. Stone, 1956). Conversely, perfect inverse 
images of paracompact spaces are paracom- 
pact. If a Hausdorff space is a perfect inverse 
image of a regular space (resp. k-space; — 
below), then it is a regular space (resp. k- 
space). Every paracompact Cech-complete 
space is a perfect inverse image of a tcomplete 
metric space (Z. Frolik, 1961). A completely 
regular space is a paracompact p-space if and 
only if it is a perfect inverse image of a metric 
space (Arkhangel’skii, 1963). A mapping f: X > 
Y is called quasi-perfect if it is closed and 
continuous and the inverse image IT ! (y) of 
each point ye Y is countably compact. A topo- 
logical space X is an M-space if and only if 
there is a quasi-perfect mapping from X onto 
a metric space Y (Morita, 1964). Let f: X —Y 
be a quasi-perfect surjection. If one of X and 
Y is a Z-space, then the other is also a X-space 
(Nagamti, 1969). 

A topological space X is called a Fréchet- 
Uryson space (or a Fréchet space) if the closure 
of an arbitrary set A c X is the set of all limits 
of sequences in A (Arkhangel’skii, 1963). X is 
called a sequential space if A < X is closed when- 
ever A contains all the limits of sequences in 
A (S. P. Franklin, 1965). X is called a k'-space 
if the closure of an arbitrary set A is the set of 
all points adherent to the intersection AM K 
for a compact set K in X (Arkhangel’skii, 
1963). X is called a k-space if A c X is closed 
whenever AN K is closed in K for any compact 
set K (— Arkhangel'skii, Trudy Moskov. Mat. 
Obshch., 13 (1965)). Spaces satisfying the first 
countability axiom are Fréchet-Uryson spaces. 
The Fréchet-Uryson spaces (resp. sequential 
spaces) are characterized as the images under 
hereditarily quotient (resp. quotient) mappings 
of metric spaces or locally compact metric 
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spaces. Similarly the k'-spaces (resp. k-spaces) 
coincide with the images under hereditarily 
quotient (resp. quotient) mappings of locally 
compact spaces. The image of a metric space 
under a closed continuous mapping is called a 
Lashnev space. Any subspace of a Fréchet- 
Uryson space is a Fréchet-Uryson space. Con- 
versely, a Hausdorff space is a Fréchet-Uryson 
space if any of its subspaces is a k-space. Cech- 
complete spaces are k-spaces. A Hausdorff 
space is called a Suslin space (resp. Luzin space) 
if it is the image under a continuous surjection 
(resp. continuous bijection) of a complete 
separable metric space (Bourbaki [9]; also — 
22 Analytic Sets). 

In Figs. 1, 2, and 3, the relationships be- 
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tween the various properties are indicated 
by the arrows. 
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426 (IX.1) 
Topology 


The term topology means a branch of mathe- 
matics that deals with topological properties 
of geometric figures or point sets. A classical 
result in topology is the Euler relation on 
polyhedra: Let zo, x,, and x, be the numbers 
of vertices, edges, and faces of a polyhedron 
homeomorphic to the 2-dimensional sphere; 
then ao — x, +% —2 (*Euler-Poincaré formula 
for the 2-dimensional case; actually, the for- 
mula was known to Descartes). It is one of the 
earliest results in topology. In 1833, C. F. 
Gauss used integrals to define the notion of 
‘linking numbers of two closed curves in a 
space (— 99 Degree of Mapping). It was in 

J. B. Listing's classical work Vorstudien zur 
Topologie (1847) that the term topology first 
appeared in print. 

In the 19th century, B. Riemann published 
many works on function theory in which topo- 
logical methods played an essential role. He 
solved the homeomorphism problem for com- 
pact surfaces (— 410 Surfaces); his result is 
basic in the theory of algebraic functions. In 
the same period, mathematicians began to 
study topological properties of n-dimensional 
polyhedra. E. Betti considered the notion of 
*homology. H. Poincaré, however, was the first 
to recognize the importance of a topological 
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approach to analysis in general; he defined the 
*homology groups of a complex [1]. He ob- 
tained the famous *Poincaré duality theorem 
and defined the "fundamental group. He con- 
sidered *polyhedra as the basic objects in top- 
ology, and deduced topological properties 
utilizing tcomplexes obtained from polyhedra 
by ‘simplicial decompositions. He thus con- 
structed a branch of topology known as com- 
binatorial topology. 

In its beginning stages combinatorial top- 
ology dealt only with polyhedra. In the late 
1920s, however, it became possible to apply 
combinatorial methods to general tcompact 
spaces. P. S. Alexandrov introduced the con- 
cept of approximation of a *Àcompact metric 
space by an inverse sequence of complexes and 
the definition of homology groups for these 
spaces. His idea had a precursor in the notion 
of *simplicial approximations of continuous 
mappings, which was introduced by L. E. J. 
Brouwer in 1911. In 1932, E. Cech defined 
homology groups for arbitrary spaces utilizing 
the tinductive limit of the homology groups of 
polyhedra; and tCech cohomology groups for 
arbitrary spaces were also defined. S. Eilenberg 
established tsingular (co)homology theory 
using *singular chain complexes (1944). The 
axiomatic approach to (co)homology theory is 
due to Eilenberg and Steenrod, who gave 
axioms for (co)homology theory in a most 
comprehensive way and unified various (co)- 
homology theories (1945) (— 201 Homology 
Theory. 

The approach using algebraic methods has 
progressed extensively in connection with the 
development of homology theory. This branch 
is called algebraic topology. In the 1920s and 
1930s, a number of remarkable results in alge- 
braic topology, such as the tAlexander duality 
theorem, the *Lefschetz fixed-point theorem, 
and the *Hopf invariant, were obtained. In the 
late 1930s, W. Hurewicz developed the theory 
of higher-dimensional thomotopy groups (— 
153 Fixed-Point Theorems, 201 Homology 
Theory, 202 Homotopy Theory). J. H. C. 
Whitehead introduced the concept of *CW 
complexes and proved an algebraic charac- 
terization of the homotopy equivalence of CW 
complexes. N. Steenrod developed tobstruc- 
tion theory utilizing tsquaring operations in 
the cohomology ring (1947). Subsequently, 
the theory of ‘cohomology operations was 
introduced (— 64 Cohomology Operations, 
305 Obstructions). The theory of ‘spectral 
sequences for ‘fiber spaces was originated by 
J. Leray (1945) and J.-P. Serre (1951) and was 
successfully applied to cohomology operations 
and homotopy theory by H. Cartan and Serre 
(1954) (— 148 Fiber Spaces, 200 Homological 
Algebra). The study of the combinatorial 
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structures of polyhedra and tpiecewise linear 
mappings has flourished since 1940 in the 
works of Whitehead, S. S. Cairns, and others. 
S. Smale and, independently, J. Stallings 
solved the *generalized Poincaré conjecture in 
1960. The *Hauptvermutung in combinatorial 
topology was solved negatively in 1961 by B. 
Mazur and J. Milnor. E. C. Zeeman proved 
the unknottedness of codimension 3 (1962). 
The recent development of the theory in con- 
junction with progress in !differential topol- 
ogy is notable. The Hauptvermutung for com- 
binatorial manifolds was solved in 1969 by 
Kirby, Siebenmann, and Wall. In particular, 
there exist different combinatorial structures 
on tori of dimension > 5, and there are topo- 
logical manifolds that do not admit any com- 
binatorial structure (— 65 Combinatorial 
Manifolds, 114 Differential Topology. 235 
Knot Theory). 

The global theory of differentiable manifolds 
started from the algebraic-topological study of 
‘fiber bundles and 'characteristic classes in the 
1940s. R. Thom's fundamental theorem of 
tcobordism (1954) was obtained through ex- 
tensive use of cohomology operations and 
homotopy groups. Milnor (1956) showed that 
the sphere S" may have differentiable struc- 
tures that are essentially distinct from each 
other by using "Morse theory and the *index 
theorem of Thom and Hirzebruch. These 
results led to the creation of a new field, *dif- 
ferential topology (— 56 Characteristic Classes, 
114 Differential Topology). 

Since 1959, A. Grothendieck, M. F. Atiyah, 
F. Hirzebruch, and J. F. Adams have devel- 
oped tK-theory, which is a generalized coho- 
mology theory constructed using stable classes 
of tvector bundles (— 237 K-Theory). 

*K not theory, an interesting branch of top- 
ology, was one of the classical branches of 
topology and is now studied in connection 
with the theory of low-dimensional manifolds 
(— 235 Knot Theory). 

On the other hand, G. Cantor established 
general set theory in the 1870s and introduced 
such notions as taccumulation points, topen 
sets, and tclosed sets in Euclidean space. The 
first important generalization of this theory 
was the concept of *topological space, which 
was proposed by M. Fréchet and developed by 
F. Hausdorff at the beginning of the 20th 
century. The theory subsequently became a 
new field of study, called general topology or 
set-theoretic topology. It deals with the topo- 
logical properties of point sets in a Euclidean 
or topological space without reference to 
polyhedra. There has been a remarkable devel- 
opment of the theory since abount 1920, nota- 
bly by Polish mathematicians S. Janiszewski, 
W. Sierpinski, S. Mazurkiewicz, C. Kuratow- 
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ski, and others. The contributions of R. L. 
Moore, G. T. Whyburn, and K. Menger are 
also important (— 382 Shape Theory, 425 
Topological Spaces). 

Topology is not only a foundation of vari- 
ous theories, but ts also itself one of the most 
important branches of mathematics. It consists 
of thomology theory, thomotopy theory, !dif- 
ferential topology, ‘combinatorial manifolds, 
*K-theory, ‘transformation groups, ‘theory of 
singularities, tfoliations, *dynamical systems, 
*catastrophe theory, etc. It continues to de- 
velop in interaction with other branches of 
mathematics (— 51 Catastrophe Theory, 126 
Dynamical Systems, 154 Foliations, 418 Theory 
of Singularities, 431 Transformation Groups). 


References 


[1] H. Poincaré, Analysis situs, J. Ecole Poly- 
tech., (2) 1 (1895), 1—121. (Oeuvres, Gauthier- 
Villars, 1953, vol. 6, 193—288.) 

[2] P. S. Aleksandrov and H. Hopf, Topologie 
L Springer, 1935 (Chelsea, 1965). 

[3] S. Lefschetz, Algebraic topology, Amer. 
Math. Soc. Colloq. Publ., 1942. 

[4] S. Eilenberg and N. E. Steenrod, Founda- 
tions of algebraic topology, Princeton Univ. 
Press, 1952. 


427 (1X.12) 
Topology of Lie Groups and 
Homogeneous Spaces 


A. General Remarks 


Among various topological structures of "Lie 
groups and *homogeneous spaces, the struc- 
tures of their *(co)homology groups and 
*homotopy groups are of special interest. Let 
G/H be a homogeneous space, where G is a 
Lie group and H is its closed subgroup. Then 
(G, G/H, H) is a 'fiber bundle, where G/H is 
the base space and H is the fiber. Thus homol- 
ogy and homotopy theory of fiber bundles 
(‘spectral sequences and thomotopy exact 
sequences) can be applied. The *cellular de- 
composition of *Stiefel manifolds, *Grassmann 
manifolds, and *Káhler homogeneous spaces 
are known. Concerning tsymmetric Riemann- 
ian spaces, we have various interesting meth- 
ods, such as the use of invariant differential 
forms in connection with real cohomology 
rings and the use of "Morse theory in order to 
establish relations between the diagrams of 
symmetric Riemannian spaces G/H and homo- 
logical properties of their "loop spaces and 
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some related homogeneous spaces [4, 5]. Lie 
groups can be regarded as special cases of 
homogeneous spaces or symmetric spaces, 
although their group structures are of partic- 
ular importance. A connected Lie group is 

. homeomorphic to the product of one of its 
compact subgroups and a Euclidean space 
(*Cartan-Mal'tsev-Iwasawa theorem). Hence 
the topological structure of a connected Lie 
group is essentially determined by the topolog- 
ical structures of its compact subgroups. 


B. Homology of Compact Lie Groups 


Let G be a connected compact Lie group. 
Since G is an * H-space whose multiplication is 
given by its group multiplication h, H*(G; k) 
and H,(G; k) are dual *Hopf algebras for any 
coefficient field k. Also, H*(G; k) is isomorphic 
as a 'graded algebra to the tensor product of 
telementary Hopf algebras (— 203 Hopf Alge- 
bras), but no factor of the tensor product is 
isomorphic to a polynomial ring because G is 
a finite tpolyhedron. In particular, if k=R 
(the field of real numbers), then H*(G;R)= 
An, -< xj) (the exterior (Grassmann) alge- 
bra over R with generators x,, ..., x, of odd 
degrees). Here we can choose generators x; 
such that h*{x, =1@x;+x;@1, 1 «il. The 
x; that satisfy this property are said to be 
primitive. Since in this case the tcomulttplica- 
tion h* is commutative, the multiplication h, 
is also commutative and the Hopf algebra 
H(G; R) is an exterior algebra generated 

by elements y; having the same degree as 

x; (i=1,...,/). When the characteristic of the 
coefficient field k is nonzero, h, need not be 
commutative. 

The dimension of a tmaximal torus of a 
connected compact Lie group G is indepen- 
dent of the choice of the maximal torus and is 
called the rank of G. The rank of G coincides 
with the number / of generators of H*(G; R). E. 
Cartan studied H*(G; R) by utilizing invariant 
differential forms. The cohomology theory of 
Lie algebras originated from the method he 
used in his study. H*(G; R) is invariant under 
‘local isomorphisms of groups G. For tclass- 
ical compact simple Lie groups G, R. Brauer 
calculated H*(G; R), while C.-T. Yen and C. 
Chevalley calculated H*(G; R) for *exceptional 
compact simple Lie groups (— Appendix A, 
Table 6.IV). The degrees of the generators 
have group-theoretic meaning. Suppose that 
the degree of the ith generator is 2m; — 1, 1< 
ixl, and that m, «m; S... e m,. When G is 
simple, there is a relation m;-- m, ;,, — const- 
ant (Chevalley's duality). We have a proof for 
this property that does not use classification. 

The cohomology groups H*(G; Z,) (where p 
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is a prime and Z, = Z/pZ) have been deter- 
mined as graded algebras for all compact 
simple Lie groups by A. Borel, S. Araki, and P. 
Baum and W. Browder (— Appendix A, Table 
6.IV). 


C. Cohomology of Classifying Spaces 


Let (EG, Bg, C) be a *universal bundle of a 
connected compact Lie group G and p a prime 
or zero. Suppose that the integral cohomology 
of G has no p-torsion (no torsion when p —0). 
Then H*(G; Z,) — Az G^, .... xj) (H*(G; Z) = 
Aal, xi) when p=0), an exterior alge- 

bra with deg x; 22m;— 1, 1<i<l, and the 
generators x; can be chosen to be ftransgres- 
sive in the spectral sequence of the universal 
bundle. Let y,, ..., y; be their transgression 
images. Then deg y; 2 2m,, 1 <i</, and the 
cohomology of the *classifying space Bg over 
Z, (resp. Z) is the polynomial algebra with 
generators y,,..., yj. Let T be a maximal torus 
of G. Then B;=E,/T is a classifying space of 
T, the *Weyl group W— N(TYT of G with 
respect to T operates on B, by *right transla- 
tions, and H*(T; Z) has no torsion and is an 
exterior algebra with / generators of degree 1. 
Thus H*(B4; Z) - Z[u,, .... uj], degu; — 2. Let 
Iy be the subalgebra of H*(B,; Z) consisting 
of W-invariant polynomials, and let p be the 
projection of the bundle (B7, Bg, G/T). Then 
under the assumption that G has no p-torsion 
(no torsion), the cohomology mapping p* over 
Z,(Z) is monomorphic, and p*: H*(B,; Z) = 
Iw © Z,(H*(Bc; Z) x Iw) [1]. In the case of 
real coefficients, we have H*(Bg; R)z Iy Gi 


R for all G, and m,,...,m, are the degrees 
of generators of the ring Iy of W-invariant 
polynomials. 


Example (1) G= U (n): l=n and G has no 
torsion. W operates on H*(B,; Z) as the group 
of all permutations of generators uj, ...,u,. 
Thus generators of I, are the telementary 
symmetric polynomials o,, ...,0, Of u,, ..., Up- 
Let c,, ...,c, be the *universal Chern classes; 
then p*(c)— o; and H*(Byu4; Z) - Z[c,, Cal, 

Example (2) G = SO(n): l= [n/2] and G 
has no p-torsion for p#2. W operates on 
H*(B4; Z) as the group generated by the per- 


mutations of generators u,,...,u, and by the 
transformations c(u;) — e;u;, e; = +1, where the 
number of u; for which e; = —1 is arbitrary for 


odd n and even for even n. Thus the generators 
of Jy are the elementary symmetric poly- 
nomials oi... oi of u2, ..., uf for odd n and 
0,,...,0;, and u, ... uj for even n. Let p,, ..., p, 
be the tuntversal Pontryagin classes and y be 
the *universal Euler-Poincare class in the case 
of even n. Then p*(pj)- o; and p*(y) - u, ... uj 
for integral cohomology. Denote the mod p 
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reduction of p; and y by p; and 7, respectively. 
Then H*(Byo 141); Zp — Z,[p1. .... p;] and 
II Baan Zp) = Z pis pio x10 (p=9 or 
> 2). 

Example (3) G = O(n): If we use the subgroup 
Q consisting of all diagonal matrices instead of 
T, then we can make a similar argument for 
Z;-cohomology. Since Q zx (Z;)", H*(Bo; 
£5) = 155 [v.v [075 vc visa 
polynomial ring with degv;= 1), and W, = 
N(Q)/Q operates on Bọ by right translations 
and on H*(Bo; Z) as the group of all permuta- 
tions of v,,...,0,. Let fy, be the subalgebra 
of H*(Bo; Z2) consisting of all W;-invariant 
polynomials. Then Ju. is a polynomial ring 
generated by the elementary symmetric poly- 
nomials o, ...,6, of v, ..., v,. The projec- 
tion p3: Bo Bom induces a monomorphic 
cohomology mapping of over Z,, and of: 
H*( Boy; Z2)= Iw. Let wi, W, be the tuni- 
versal Stiefel- Whitney classes. Then p¥(w,) = 
o; and H*(Bom Z2)= Z;[w,,....w,] [2]. 


D. Grassmann Manifolds 


The following manifolds are called Grass- 
mann manifolds: The manifold M, ,,, ,(R) con- 
sisting of all n-subspaces of R"*"; the mani- 
fold AM. ,(R) consisting of all oriented n- 
subspaces of R"*": and the manifold A, a (C) 
consisting of all complex n-subspaces of C"*". 
These are expressed as quotient spaces as 
follows: M,,,, ,(R) - O(n + m)/O(n) x O(m), 

M, ,,,(R) = SO(n + m)/SO(n) x SO(m), and 
M, S (C) 9 U(n+m)/U(n) x U(m). They admit 
cellular decompositions by *Schubert varieties 
from which their cohomologies can be com- 
puted (— 56 Characteristic Classes). M,,,, ,(R) 
and M, ,,, ,(R) have no p-torsion for p 2, and 
M,+m.n(C) has no torsion. These spaces are m-, 
m-, and (2m + 1)-classifying spaces of O(n), 
SO(n), and U (n), respectively. Hence their 
cohomologies are isomorphic to those of Bg 
(G — O(n), SO(n), U(n)) in dimensions <m, 
«m, and x 2m, respectively; and they are poly- 
nomial rings generated by suitable univer- 

sal characteristic classes in low dimensions. 


E. Cohomologies of Homogeneous Spaces G/U 
(Rank G = Rank U) 


Let G be a compact connected Lie group and 
U a closed subgroup of G with the same rank 
as G. Denote the degrees of generators of 
H*(G; R) and H*(U; R) by 2m, — 1, ...,2m,— 1, 
and 2n, — 1, ...,2n, — 1, respectively. Then the 
real-coefficient *Poincaré polynomial Po of the 
homogeneous space G/U is given by P)(G/U, t) 
= [J 1 —1°")/(1 — t") (G. Hirsch). When G, U, 
and G/U have no p-torsion, the same formula 
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is valid for the Z ,-coefficient Poincaré poly- 
nomial [1]. When U ts the *centralizer of a 
torus, G/U has a complex analytic cellular 
decomposition [3]. Hence G/U has no torsion 
in this case. This was proved by R. Bott and 
H. Samelson by utilizing Morse theory [5] 

(— 279 Morse Theory). The case U = T has 
also been studied. 


F. Homotopy Groups of Compact Lie Groups 


The ‘fundamental group z,(G) of a compact 
Lie group G is Abelian. Furthermore, z,(G)= 
0. If we apply Morse theory to G, the varia- 
tional completeness of G can be utilized to 
show that the loop space OG has no torsion 
and that its odd-dimensional cohomologies 
vanish [4]. Consequently, when G is non- 
Abelian and simple, we have 14(G) zz A 
*periodicity theorem on ‘stable homotopy 
groups of classical groups proved by Bott is 
used in K-theory (— 202 Homotopy Theory; 
237 K-Theory). (For explicit forms of homo- 
topy groups — Appendix A, Table 6.VI). 

Homotopy groups of Stiefel manifolds are 
used to define characteristic classes by *ob- 
struction cocycles (— 147 Fiber Bundles; Ap- 
pendix A, Table 6.VI). 


References 


[1] A. Borel, Sur la cohomologie des espaces 
fibrés principaux et des espaces homogènes de 
groupes de Lie compacts, Ann. Math., (2) 57 
(1953), 115—207. 

[2] A. Borel, La cohomologie mod2 de cer- 
tains espaces homogènes, Comment. Math. 
Helv., 27 (1953), 165—197. 

[3] A. Borel, Káhlerian coset spaces of semi- 
simple Lie groups, Proc. Nat. Acad. Sci. US, 
40 (1954), 1147-1151. 

[4] R. Bott, Àn application of the Morse 
theory to the topology of Lie-groups, Bull. 
Soc. Math. France, 84 (1956), 251—281. 

[5] R. Bott and H. Samelson, Applications of 
the theory of Morse to symmetric spaces, 
Amer. J. Math., 80 (1958), 964—1029. 


428 (XIII.17) 
Total Differential Equations 


A. Pfaff's Problem 


A total differential equation is an equation of 
the form 


RER (1) 


where œ is a ‘differential 1-form ZE, a;(x) dx; 
on a manifold X. A submanifold M of X is 
called an integral manifold of (1) if each vector 
č of the ttangent vector space T,(M) of M at 
every point x on M satisfies w(é)=0. We de- 
note the maximal dimension of integral mani- 
folds of (1) by m(w). J. F. Pfaff showed that 
m(w) 2 (n — 1)/2 for any w. The problem of 
determining m(«) for a given form « is called 
Pfaff's problem. This problem was solved by 
G. Frobenius, J. G. Darboux, and others as 
follows: Form an *'alternating matrix 


(aij «i jen (2) 
from the coefficients of the *exterior derivative 
of «c, 


deo -— 


2 3 Y aj(x)dx,Adx;, 


fe 


T 


where aj; — 0a,/Cx; — 0a;/0x;. Suppose that the 
rank of (2) is 2t. Then the rank of the matrix 


» 23] 
a; D /izijzn 


is 2t or 2t+ 2. In the former case m(w)=n—t, 
and o can be expressed in the form 


H 
Kë U»i-1 dus 
i=l 


by choosing a suitable coordinate system 
(u,, ..., u,). In the latter case m(w)=n—t—1, 
and w can be expressed in the form 


~ 


H 


u5; , dus; t düz 
1 


by choosing a suitable coordinate system 
(u,, ..., uy). This theorem is called Darboux's 
theorem. 

A I-form o is called a Pfaffian form, and 
equation (1) is called a Pfaffian equation. A 
system of equations w;=0 (1 <i<s) for 1-form 
oi, is called a system of Pfaffian equations or a 
system of total differential equations [6, 12, 26]. 


B. Systems of Differential Forms and Systems 
of Partial Differential Equations 


Let Q be a system of differential forms c, 
O<p<n, lÓxixv, on X, where o is a p-form 
on X. A submanifold M of X is called an 
integral manifold of O —0 if for each p (0€ p 
dim M), any p-dimensional subspace E, of 
T,(M) satisfies wP(E,)=0 (1 & i€ vy) at every 
point x on M. Denote the maximal dimension 
of integral manifolds of Q=0 by m(Q). The 
problem of determining m(Q) for a given sys- 
tem Q is called the generalized Pfaff problem, - 
and will be explained in later sections. By 
fixing a local coordinate system of X and 
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dividing it into two systems (x,,..., x,) and 
(yx, V.) (m n—r), we can consider the 
problem of finding an integral manifold of Q 
=0 defined by 


ya yat, X) 1xoaxm. 


This problem can be reduced to solving a 
system of partial differential equations of the 
first order on the submanifold N with the local 
coordinate system (x,, ..., x,). 

Consider a system of partial differential 
equations ®=0 of order /: 


PAlXis Ya Bis, 1<A<s, (3) 


with leier, 1<, fi &m, ji +... 0j, €l, where 


Vip te Fs 
` d € 1 y 
Sich Ye 


Gel... Geh @ 
A submanifold defined by y, = y,(x,, ..., XA 
1<a<m, is called a solution of © =0 if it satis- 
fies (3) identically. The problem of determining 
whether a given system ®=0 has a solution 
was solved by C. Riquier, who showed that 
any system can be prolonged either to a pas- 
sive orthonomic system or to an incompatible 
system by a finite number of steps. A system 
of partial differential equations is called a 
prolongation of another system if the former 
contains the latter and they have the same 
solution. A passive orthonomic system is one 
whose general solution can be parametrized by 
an infinite number of arbitrary constants. A 
solution containing parameters is called a 
general solution if by specifying the parameters 
we can obtain a solution of the ‘Cauchy prob- 
lem for any initial data. A system (3) ts said to 
be incompatible if it implies a nontrivial rela- 
tion f(x,,...,x,)=0 among the x;. 

The problem of solving a system ®=0 of 
partial differential equations can be reduced to 
that of finding integral manifolds of a system 
of differential forms X as follows: Let J' be a 
manifold with the local coordinate system 


Harar 5 Leien Leen, Pm, 
ji+... +j, <l), 
and Z be a system of 0-forms o, (1 <4 € s) and 


1-forms 


dy,— Y. pidx;, 
i=} 


dpj E pj dx, 

(1 xo, Bx m,j, +... +j, «l). Then an integral 
manifold of © =0 of the form 

y,7 Yi, x) Lesen, 
pi tap ce Sy 


Jp rae 4 St 
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gives a solution y, 2 y,(x,, ...,x,, 1 «o «m, of 

€^ —0, and y, and pit * satisfy (4). 
Conversely, a solution y, = y,(x,, ..., x,), 

| xxm, of ®=0 gives an integral manifold 

of 2 —0 if we define p ^(x,, ..., x) by (4) 

[23, 24, 26]. 


C. Systems of Partial Differential Equations of 
First Order with One Unknown Function 


Consider a system of independent tvector 
fields on N: 
Ó 


L;- 2 b(x)z—. 


Óx; 


ISi Ss. 


We solve a system of inhomogeneous 
equations 


Liy — fjo)y — g,(x) - 0, 


for a given system of f,(x) and g,(x). The sys- 
tem (5) is called a complete system if each of 
the expressions 


[L,, Li]y— (Lifa = L,f3y —( faga — fi9,) 


l<i<p<s, (6) 


1<4<s, (5) 


—(L,8, We Lal 


is a linear combination of the left-hand sides of 
(5), where [L; L,] means the tcommutator of 
L, and L,. This condition is called the com- 
plete integrability condition for (5). Suppose 
that the homogeneous system 


L,y-0, 1<åi<s, (7) 


is complete. Then it has a system of ffunction- 
ally independent solutions y,, ..., y, ,, and any 
solution y of (8) is a function of them: y= 
V(y;. -3 V, LU the inhomogeneous system 
(5) is complete, then the homogeneous system 
(7) is complete. This notion of a complete 
system is due to Lagrange and was extended 
to a system of nonlinear equations by Jacobi 
as follows (— 324 Partial Differential Equa- 
tions of First Order C). 

Consider a system of nonlinear equations 


e E EE E LE P,)=0, 1xAxs, (8) 


where p; — &y/Ox;. The system (8) is called an 
involutory system if each of [F,, F,], 1<a< 
LH KS, is a linear combination of F,,..., F 


5 ANT 


Here *Lagrange's bracket [F, G] is defined by 


r OF (0G ôG 
EG = + is 
[FG] à Op, E 4 c) 


» OG {CF se OF 

£ Op; \ ax, i Qy j 

Suppose that the system (8) is involutory and 
F,,..., F; are functionally independent. Then, 








in general, we can solve the following tCauchy 
problem for an (r — s)-dimensional submani- 
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fold N , of N: Given a function f on N, ,, find 
a solution y of (8) satisfying y 2 f on N,_,. We 
can construct a solution by integrating a sys- 
tem of ordinary differential equations called a 
*characteristic system of differential equations. 
Hence the solution of these problems may be 
carried out in the C?-category (— 322 Partial 
Differential Equations (Methods of Integra- 
tion) B) [7,11]. 


D. Frobenius's Theorem 


Let X be a tdifferentiable manifold of class C^ 
and Q be a system of independent 1-forms cj, 
1 <i<s, on X. Then the system of Pfaffian 
equations Q=0 is called a completely inte- 
grable system if at every point x of X, 


Ss 
do, = 3. Dun, 1<i<s, 
fe 


for 1-forms 6,; on a neighborhood of x. Sup- 
pose that Q=0 is completely integrable. Then 
at every point x of X, there exists a local co- 
ordinate system ( f,,..., f, x,4,..., x) ina 
neighborhood U of x for which a tangent vec- 
tor č of X at ze U satisfies (4) 2 0, 1<i<s, 
if and only if £f, 0, 1 <i<s. In this case, each 
of the df; is a linear combination of @,,...,@,, 
and conversely, each of the œ; is a linear com- 
bination of df,, ..., df,. In general, a function f 
for which df is a linear combination of 
(,,...,@, is called a first integral of Q — 0. 

The theorem of the previous paragraph 
is called Frobenius's theorem, which can be 
stated in the dual form as follows: Let D(X) be 
a tsubbundle of the *tangent bundle T(X) over 
X. The mapping X 3x  D,(X) is called a dis- 
tribution on X. It is said to be an involutive 
distribution if at every point x of X we can 
find a system of independent vector fields L; 
(1 <i<s) ona neighborhood U of x such that 
the L,(z) (1 <i<s) form a basis of D,(X) at 
every zeU and satisfy [L;, L;]=0 (L,,..., Lo), 
1 <i<j<s, on U. A connected submanifold M 
of X is called an integral manifold of D(X) if 
T,(M) = D,{X) at every point x of M. Suppose 
that D(X) gives an involutive distribution on 
X. Then every point x of X is in a maximal 
integral manifold M that contains any integral 
manifold including x as a submanifold. 


E. Cartan-Káhler Existence Theorems 


Let X be a treal analytic manifold. Denote the 
*sheaf of rings of differential forms on X by 
A(X) and its subsheaf of C(X)-modules of p- 
forms on X by A,(X), 1 x pxn, where Ó(X) is 
the sheaf of rings of 0-forms on X. A subsheaf 
of ideals X is called a differential ideal if it is 
generated by Z,, 0 p n, and contains dX, 
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where Z,— Xf A,(X). Consider a differential 
ideal X on X. Denote the *Grassmann mani- 
fold of p-dimensional subspaces of T;(X) with 
origin xe X by G,(x), and the Grassmann 
manifold | J ex G(x) over X by G,(X). An 
element E, of G,(x) is called a p-dimensional 
contact element with origin x. An element E, 
of G,(x) is called an integral element of X, if 
w({E,)=0 at x for any p-form w in 2; further- 
more, E, is called an integral element of X 

if any element E, contained in E,, 0 € q& p, is 
an integral element of 3. In particular, 0- 
dimensional and 1-dimensional integral ele- 
ments are called integral points and integral 
vectors, respectively. It can be proved that an 
element E, is an integral element of 2 if and 
only if it is an integral element of 3. The polar 
element H(E,) of an integral element E, with 
origin x is defined as the subspace of T,( X) 
consisting of all vectors that generate with E, 
an integral element of X. Let (Z,)°, O& p «n, be 
the subsheaf of @(X)-modules in €(G,(X)) 
consisting of all 0-forms 


a 


joo ER. 
n " TEE PLE TERIS 
I €i, «...«iy n 


on G,(X) derived from a p-form 


a 


1 <i, <...<ipSn 


i ay PX ^ NAX; EL ps 

where z,,...;, is the "Grossmann coordinate of 
E,. An integral element E> is called a regular 
integral element if the following two conditions 
are satisfied: (i) (2,)° is a regular local equation 
of I, at E$, where I, is the set of all integral 
elements of 2; (ii) dim H(E,) — constant around 
E? on 1X,. This definition, due to E. Kahler, is 
different from that given by E. Cartan [4]. 

Here, in general, a subsheaf ® of @(X) is 
called a regular local equation of I4» at an 
integral point x, if there exists a neighborhood 
U of x, and *cross sections $,,...,9, of b on 
U that satisfy the following two conditions: (i) 
do, ...,dqo, are linearly independent at every x 
on U; (ii) a point x of U is an integral point of 
® if and only if g,(x)=...=@,(x)=0. 

First existence theorem. Suppose that we are 
given a p-dimensional integral manifold M 
with a regular integral element 7T,(M) ata 
point x on M. Suppose further that there exists 
a submanifold F of X containing M such that 
dim F=n—t,,,, dim(7,(F)N H(E,))=p +4, 
where E,— T,(M) and t,,, =dim H(E,) — p — 1. 
Then around x there exists a unique integral 
manifold N such that dim N =p + 1 and F> 
NOM. 

This theorem is proved by integrating a 
system of partial differential equations of 
Cauchy-Kovalevskaya type. E. Cartan [2-4] 
also tried to obtain an existence theorem by 
integrating a system of ordinary differential 
equations. 
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A chain of integral elements Ey c E, c ... 
c E, is called a regular chain if each of E, 
(0x p «r)is a regular integral element. For a 
regular chain Ey c E, c ... C E,, define t,,, by 
fun dim H(E,) —p—1, 0€ p «r, and define s, 
by s,—t,—t,,,— 1 (O&p«r), s,=t,, where to 
— dim Xs. Then we have s,2 0 (0x p &r), so 
+...+5,=tg—r, and we can take a local co- 
ordinate system (x4, ..., X, Mi, Ym) m-n—r, 
around E, that satisfies the following four 
conditions: 
(i) IX is defined by y, ue, 

ô 0 


(ii) Hi) - a - 


H 3 
KS Ox, 


= Ym =9; 





ô é | 
E ka i 


O<p<r; 
à 6 
iii E,— EE CIO ND T e 
QI Ze im xl 


(iv) E ,=(0,...,0,0,...,0). 
The integers se, ..., s, are called the characters 
of the regular chain Ej c ...— E,. 

Second existence theorem. Suppose that a 
chain of integral elements Ey c ... c E, is regu- 
lar, and take a local coordinate system satisfy- 
ing (i)- (iv). Consider a system of initial data 


l<p<r; 


nee 
Lora) eeo E (X1) 
Sots, +1 (x, ? X2), Weg E E , x5), 


fi dn pa EC e Sch iud s diss nae s Xp). 


Then if their values and derivatives of the 
first order are sufficiently small, there exists 
a unique integral manifold defined by y, = 
ValX1, X). Vp=O, 1 &ato—r«fim, 
such that 


VAR aba Xp Q, 0,0) = fahxi -3 My) 


$gT sS. XX SSgdusbS, Oszpsr. 


This theorem is proved by successive appli- 
cation of the first existence theorem. These 
two theorems are called the Cartan-Káhler 
existence theorems. Z is said to be involutive at 
an integral element E, if there exists a regular 
chain Ej c ...c E,. An integral manifold pos- 
sessing a tangent space at which Z is involutive 
is called an ordinary integral manifold or ordi- 
nary solution of X. An integral manifold that 
does not possess such a tangent space is called 
a singular integral manifold or singular solution 
of X. 

Cartan's definition of ordinary and regular 
integral elements is as follows: An integral 
point EQ is an ordinary integral point if X, is a 
regular local equation of 1X, at E}. An ordi- 
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nary integral point EN is a regular integral 
point if dim H(E,) is constant on JX) around 
Ej. Inductively, an integral element E? is 
called an ordinary integral element if (2,,)° 
is a regular local equation of IX, at E? and 
E; contains a regular integral element E? ,. 
An ordinary integral element E> is a regular 
integral element (in the sense of Cartan) if 
dim H(E,) is constant on IX, around E It can 
be proved that X is involutive at an integral 
element E, if and only if E, is an ordinary 
integral element of X. An integral manifold 
possessing a tangent space that is a regular 
integral element of X is called a regular in- 
tegral manifold or regular solution of X. Let 
m,,, be the minimal dimension of H(E,), 
where E, varies over the set of p-dimensional 
ordinary integral elements, and g be an integer 
such that m,z p (1 «px g) and m,,, =p. Then 
this integer g is called the genus of X. It is the 
maximal dimension of ordinary integral mani- 
folds of X. However, in general, it is not the 
maximal dimension of integral manifolds of 2. 
D. C. Spencer and others have been trying 
to obtain an existence theorem in the C?- 
category analogous to that of Cartan and 
Kahler. (For a system of linear partial dif- 
ferential equations — [2, 4, 11, 13, 25, 27].) 


F. Involutive Systems of Partial Differential 
Equations 


To give a definition of an involutive system of 
partial differential equations, we define an 
involutive subspace of Hom(V, W), where V 
and W are finite-dimensional vector spaces 
over the real number field R. Let A be a sub- 
space of Hom(V, W). For a system of vectors 
U,, ..., t, In V, A(v;, ...,v,) denotes the sub- 
space of A that annihilates v,,...,v,. Let g, 

be the minimal dimension of A(v,, ..., v) as 
(vis ..., v) varies, where 0< p <r =dim V. A 
basis (v,,...,v,) of V is called a generic basis if 
it satisfies g, — dim A(v,,...,v,) for each p. 
There exists a generic basis for any A. Let W & 
S*(V*) be the subspace of Hom(V, Hom(V, W)) 
consisting of all elements č satisfying €(u)v = 
¢(v)u for any u and v in V. Then the prolonga- 
tion pA of A is defined by pA = Hom(V, A) 
W &S?(V*). For any basis (v,, ...,v,) of V, we 
have the inequality 


dim pA x Y, dim A(v,, ...,v,). 
p=90 


The subspace A is called an involutive subspace 
of Hom(V, W) if dim pA = 2-0 9. This notion 
of an involutive subspace was obtained by V. 
W. Guillemin and S. Sternberg [13]. 

A triple (X, N; x) consisting of two mani- 
folds X, N and a projection zx from Xonto N is 
called a fibered manifold if the tdifferential 7, 
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is surjective at every point of X. Take the set 
of all mappings f from a domain in N to X 
satisfying zo f — identity for a fibered manifold 
(X, N;n). Then an tl-jet j!( f) is an equivalence 
class under the equivalence relation defined as 
follows: j!( f) 2ji(g) if and only if x =u, f(x) = 
glu), and : 

eege Gite ti 


g 
Eeer 
ôxi KEE 


Oxit... Oxi 
i, +... +i,<l, where (x,,...,x,) is a local co- 
ordinate system of N around x=u (— 105 
Differentiable Manifolds X). 

Denote the space of all l-jets of a fibered 
manifold (X, N; n) by J'(X, N; x) or simply J’. 
Then a subsheaf of ideals ® in €(J!) is called a 
system of partial differential equations of order 
l on N. A point z of J' is called an integral 
point of ® if (z) «0 for all oe. The set of all 
integral points of is denoted by /®. Let zl be 
the natural projection of J' onto J' !. Then at 
a point z of J’, we can identify Ker zl with 
Hom(T;(N), Ker zi), where x «xz ...n'z. The 
principal part C,(®) of is defined as the sub- 
space of Kerni that annihilates ®. The pro- 
longation p® of ® is defined as the system 
of order l+ 1 on N generated by and 6,9, 

I «k «dim N, where ô, is the formal derivative 
with respect to a coordinate x, of N: 


o 
(0,9) GE Cf Is 9S ) pe OJ). 


Let w be an integral point of pb and z be 


1'*! w, Then we have the identity 


pC.(0)— C, (po). 


The following definition of an involutive 
system is due to M. Kuranishi [19]: ® is invo- 
lutive at an integral point z if the following two 
conditions are satisfied: (1) is a regular local 
equation of [6 at z; (ii) there exists a neighbor- 
hood U of z in J’ such that (x'*!) Dip 
forms a fibered manifold with base U f1I and 
projection zl"). 

A system of partial differential equations is 
said to be involutive (or involutory) if it has an 
integral point at which it is involutive. Fix a 
system of independent variables ( y,, ..., yy) in 
X. Then a system of differential forms is said 
to be involutive (or involutory) if it has an in- 
tegral element at which it is involutive and 
dy, A...Adyy Z0. It can be proved that these 
two definitions of involutive system are equiva- 
lent [19,25]. 


G. Prolongation Theorems 


Cartan gave a method of prolongation by 
which we can obtain an involutive system 
from a given system with two independent 
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variables, if it has a solution. He proposed the 
following problem: For any r2, construct a 
method of prolongation by which we can 
obtain an involutive system from a given sys- 
tem with r independent variables, if it has a 
solution. To solve this problem, Kuranishi 
prolonged a given system ® successively to 
p, t=1, 2, 3, ..., and proved the following 
theorem: Suppose that there exists a sequence 
of integral points z' of p' with z'*^'z'«z' , 

t — 1, 2, 3, ..., that satisfies the following two 
conditions for each t: (i) p'® is a regular local 
equation of I(p'®) at z^ (ii) there exists a 
neighborhood Hi of z' in J(p'«b) such that 
x'*' yV" contains a neighborhood of z' ! in 

I(p' ! 4) and forms a fibered manifold 

(V! x^! V5 n'*!). Then p'9 is involutive at z' 
for a sufficiently large integer t. 

This prolongation theorem gives a powerful 
tool to the theory of finfinite Lie groups. How- 
ever, if we consider a system of partial dif- 
ferential equations of general type, there exist 
examples of systems that cannot be prolonged 
to an involutive system by this prolongation, 
although they have a solution. To improve 
Kuranishi's prolongation theorem, M. Mat- 
suda [22] defined the prolongation of the 
same order by pp®=p®N O(J') for a system ® 
of order L This is a generalization of the classi- 
cal method of completion given by Lagrange 
and Jacobi. Applying this prolongation suc- 
cessively to a given system ®, we have ¥ = 

5-1 po. Define the p,-operation by p, = 
Ux pSp. Then applying this prolongation 
successively to WH. we have the following theo- 
rem: suppose that there exists a sequence of 
integral points z‘ of p V with zx'*'z'—z' 1, 

t — 1, 2, 3, ..., that satisfies the following two 
conditions for each t: (i) p, ' is a regular local 
equation of (p, VP) at z^; (ii) dim pC(p; V) is 
constant around z' on (p; V). Then p; P is 
involutive at z' for a sufficiently large integer t. 

To prove this theorem Matsuda applied the 
following theorem obtained by V. W. Guil- 
lemin, S. Sternberg, and J.-P. Serre [25, ap- 
pendix]: suppose that we are given a sub- 
space A, of Hom(V, W) and subspaces A, of 
Hom(V, A, ,) satisfying A,C pA,_,, t=1, 2, 

3, .... Then A, is an involutive subspace of 
Hom(V, A, 4) for a sufficiently large integer t. 
Thus Cartan's problem was solved affirma- 
tively. To the generalized Pfaff problem these 
prolongation theorems give another solution, 
which differs from that obtained by Riquier. 


H. Pfaffian Systems in the Complex Domain 
Consider a linear system of Pfaffian equations 


n m 
du;— 2 2. as (x)u;dx,, i=1,...,m, 
Ka 


k=1 


428 H 
Total Differential Equations 


where x —(x,, ...,x,) is a local coordinate of a 
complex manifold X and aj; are meromor- 
phic functions on X. If we put uz'(u;, ...,u,) 
and A" (x) — (a5 (x)), k=1,...,n, the system is 
written as 


du= ( y A*(x) ix) u. (9) 
k=1 


System (9) is completely integrable if and only 
if 
04A) dA! 


d'H 


j l=1,...,7. 

Suppose that (9) is completely integrable. If the 
A* (x) are holomorphic at x? —(x$, ..., x9)e X, 
there exists for any u? e C" one and only one 
solution of (9) that is holomorphic at x? and 
satisfies u(x?) 2 u?. This implies that the solu- 
tion space of (9) is an m-dimensional vector 
space; the basis of thís space is called a funda- 
mental system of solutions. Therefore any 
solution is expressible as a linear combination 
of a fundamental system of solutions and can 
be continued analytically in a domain where 
the A*(x) are holomorphic. A subvariety of X 
that is the pole set of at least one of the A*(x) 
is called a singular locus of (9), and a point on 
a singular locus is called a singular point. 

R. Gérard has given a definition of regular 
singular points and an analytic expression of a 
fundamental system of solutions around a 
regular singular point, and he studied systems 
of Fuchsian type [8; also 9, 30]. 

Let Q= $1., A*(x)dx,. Then the system (9) 
can be rewritten as 


(d—Q)u=0. 


If we consider a local coordinate (x, u) of a 
fiber bundle over X, the operator d—Q in- 
duces a meromorphic linear connection V over 
X. Starting from this point of view, P. Deligne 
[5] introduced several important concepts and 
obtained many results. 

The first results for irregular singular points 
were obtained by Gérard and Y. Sibuya [10], 
and H. Majima [20] studied irregular singular 
points of mixed type. 

The systems of partial differential equations 
that are satisfied by the hypergeometric func- 
tions of several variables are equivalent to 
linear systems of Pfaffian equations [1]. This 
means that such systems of partial differential 
equations are fholonomic systems. M. Kashi- 
wara and T. Kawai [15] studied holonomic 
systems with regular singularities from the 
standpoint of microlocal analysis. Special 
types of holonomic systems were investigated 
by T. Terada [28] and M. Yoshida [29]. 

Consider a system of Pfaffian equations 


os, jHl...,r, (10) 


428 Ref. 
Total Differential Equations 


where c; — Lif =, ag (x)dx, and x ^(x,, ..., x,). 
Suppose that aj, are holomorphic in a domain 
D of C" and that dw, Aw, A...A@,=0 in D. 
Denote by S the zero set of w, A...Aw,=0.A 
point of S is called a singular point of (10). If 
the codimension of S is z 1, then system (10) ts 
completely integrable in D — S. The following 
theorem was proved by B. Malgrange [21 ]: 
Let x°eS, and suppose that the codimen- 
sion of S is 23 around x; then there exist 
functions f, j— 1, ...,r, and gy, , k= Ll, ...,r, 
that are holomorphic at x? and satisfy oe 
Lia gy df, and det(gy(x°)) 40. 
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429 (XI.6) 
Transcendental Entire 
Functions 


A. General Remarks 


An entire function (or integral function) f(z) is a 
complex-valued function of a complex variable 
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z that is holomorphic in the finite z-plane, 

z *- oc. If f(z) has a pole at oo, then f(z) is a 
polynomial in z. A polynomial is called a 
rational entire function. If an entire function is 
bounded, it is constant (*Liouville's theorem). 
A transcendental entire function is an entire 
function that is not a polynomial, for example, 
eXpz, sin z, cosz. An entire function can be 
developed in a power series 3:79 a,z" with 
infinite radius of convergence. If f(z) is a tran- 
scendental entire function, this is actually an 
infinite sertes. 


B. The Order of an Entire Function 


If a transcendental entire function f(z) has a 
zero of order m (mz 0) at z=0 and other zeros 
at Xis Wives Apm ee ess E 
oo), multiple zeros being repeated, then f(z) 
can be written in the form 


f(z) =el z" I] ( -2) e?) 

k=1 Xy 
where g(z) is an entire function, g,(z) — (z/a,) + 
(1/2)(z/o)" + (1/3) (2/04)? +... - (1/p,) (2/04), 
and p,, p;, ... are integers with the property 
that Eg [z/a,|"**! converges for all z (Weier- 
strass's canonical product). 

E. N. Laguerre introduced the concept of 
the genus of a transcendental entire function 
f(z). Assume that there exists an integer p for 
which Eol] +P converges, and take the 
smallest such p. Assume further that in the 
representation for f(z) in the previous para- 
graph, when p, = p; — ...— p, the function g(z) 
reduces to a polynomial of degree q; then 
max(p,q) is called the genus of f(z). For tran- 
scendental entire functions, however, the order 
is more essential than the genus. The order p 
of a transcendental entire function f(z) is de- 
fined by 


p -limsuploglog M (r)/log r, 

where M (r) is the maximum value of | f(z)| on 
|z| 7r. By using the coefficients of f(z) 2 X a,z" 
we can write 


E 


p=limsupnlogn/log(1/|a,|). 
The entire functions of order 0, which were 
studied by Valiron and others, have prop- 
erties similar to polynomials, and the en- 
tire functions of order less than 1/2 satisfy 
lim, —.,,minj,)-,, | /(z)] = oo for some increasing 
sequence r, [20 (Wiman's theorem). Hence 
entire functions of order less than 1/2 cannot 
be bounded in any domain extending to infin- 
ity. Among the functions of order greater than 
1/2 there exist functions bounded in a given 
angular domain D:x «argz « o n/u. Vf | f(z)| 
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« expr^ (p « u) and f(z) is bounded on the 
boundary of D, then f(z) is bounded in the 
angular domain (— 272 Meromorphic Func- 
tions). In particular, if the order p of f(z) is an 
integer p, then it is equal to the genus, and 
g(z) reduces to a polynomial of degree <p (J. 
Hadamard). These theorems originated in the 
study of the zeros of the fRiemann zeta func- 
tion and constitute the beginning of the theory 
of entire functions. 

There is some difference between the prop- 
erties of functions of integral order and those 
of others. Generally, the point z at which f(z) 
—w is called a w-point of f(z). If {z,} consists 
of w-points different from the origin, the in- 
fimum p, (w) of k for which X 1/|z,|* converges 
is called the exponent of convergence of f — w. 
If the order p of an entire function is integral, 
then p,(w)— p for each value w with one pos- 
sible exception, and if p is not integral, then 
p1(w) — p for all w (É. Borel). Therefore any 
transcendental entire function has an infinite 
number of w-points for each value w except for 
at most one value, called an exceptional value 
of f(z) (Picard's theorem). In particular, f(z) 
has no exceptional values if p is not integral. 
For instance, sinz and cosz have no excep- 
tional values, while e7 has 0 as an exceptional 
value. Since transcendental entire functions 
have no poles, oo can be counted as an excep- 
tional value. Then we must change the state- 
ment in Picard's theorem to "except for at 
most two values." Since the theorem was ob- 
tained by E. Picard in 1879, problems of this 
type have been studied intensively (— 62 
Cluster Sets, 272 Meromorphic Functions). 

After Picard proved the theorem by using 
the inverse of a *modular function, several 
alternative proofs were given. For instance, 
there is a proof using the Landau-Schottky 
theorem and *Bloch's theorem and one using 
*normal families. Picard's theorem was ex- 
tended to meromorphic functions and has also 
been studied for analytic functions defined in 
more general domains. There are many fully 
quantitative results, too. For instance, Valiron 
[3] gave such results by performing some 
calculations on neighborhoods of points where 
entire functions attain their maximum ab- 
solute values. 

Thereafter, the distribution of w-points in a 
neighborhood of an essential singularity was 
studied by many people, and in 1925 the Ne- 
vanlinna theory of meromorphic functions was 
established. The core of the theory consists of 
two fundamental theorems, *Nevanlinna's first 
and second fundamental theorems (— 272 
Meromorphic Functions). Concerning com- 
posite entire functions F(z) — f(g(z)), Pólya 
proved the following fact: The finiteness of the 
order of F implies that the order of f should 
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be zero unless g is a polynomial. This gives 
the starting point of the factorization theory, 
on which several people have been working 
recently. Several theorems in the theory of 
meromorphic functions can be applied to 

the theory. One of the fundamental theorems 
is the following: Let F(z) be an entire func- 
tion, which admits the factorizations F(z) — 
P.(fn(Z)) with a polynomial P, of degree m 
and an entire function f, for all integers m. 
Then F(z) - Acos,/ H(z) *- B unless F(z)= 

A exp H(z)+ B. Here, H is a nonconstant entire 
function and A, B are constant, A40. 


C. Julia Directions 


Applying the theory of tnormal families of 
holomorphic functions, G. Julia proved the 
existence of Julia directions as a precise form 
of Picard's theorem [5]. A transcendental 
entire function f(z) has at least one direction 
argz — H such that for any e» 0, f(z) takes on 
every (finite) value with one possible exception 
infinitely often in the angular domain 0 —2 < 
argz «0-F e. This direction argz — 0 is called 

a Julia direction of f(z). 


D. Asymptotic Values 


tAsymptotic values, 'asymptotic paths, etc., 
are defined for entire functions as for mero- 
morphic functions. In relation to tIversen’s 
theorem and *Gross's theorem for inverse 
functions and results on ‘cluster sets, tordinary 
singularities of inverse functions hold for entire 
functions in the same way as for meromorphic 
functions. Also, as for meromorphic functions, 
*transcendental singularities of inverse func- 
tions are divided into two classes, the tdirect 
and the findirect transcendental singularities. 
The exceptional values in Picard’s theorem 
are asymptotic values of the functions, and 
oo is an asymptotic value of any transcenden- 
tal entire function. Therefore f(z) oo along 
some curve extending to infinity. Between 
the asymptotic paths corresponding to two 
distinct asymptotic values, there is always an 
asymptotic path with asymptotic value oo. By 
tBloch’s theorem, A. Bloch showed that the 
tRiemann surface of the inverse function of a 
transcendental entire function contains a disk 
with arbitrarily large radius. Denjoy conjec- 
tured in 1907 that ue 2p, where p is the order 
of an entire function and y is the number of 
distinct finite asymptotic values of the func- 
tion, and L. V. Ahlfors gave the first proof 
(1929). This result contains Wiman's theorem. 
There are transcendental entire functions with 
p= 2p. It was shown by W. Gross that among 
entire functions of infinite order there exists 
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an entire function having every value as its 
asymptotic value. 
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A. History 


A complex number a is called a transcendental 
number if x is not *algebraic over the field of 
rational numbers Q. C. Hermite showed in 
1873 that e is a transcendental nuraber. Fol- 
lowing a similar line of thought as that taken 
by Hermite, C. L. F. Lindemann showed that 
7 is also transcendental (1882). Among the 23 
problems posed by D. Hilbert in 1900 (— 196 
Hilbert), the seventh was the problem of estab- 
lishing the transcendence of certain numbers 
(e.g., 27^). This stimulated fruitful investiga- 
tions by A. O. Gel'fond, T. Schneider, C. L. 
Siegel, and others. The theory of transcen- 
dental numbers is, however, far from complete. 
There is no general criterion that can be uti- 
lized to characterize transcendental numbers. 
For example, neither the transcendence nor 
even the irrationality of the *Euler constant 

C «lim, ,,, (14- 1/2 ... - 1/n—1log a) has been 
established. A survey of the development of 
the theory of transcendental numbers can be 
found in [18], in which an extensive list of 
relevant publications up to 1966 is given. 


B. Construction of Transcendental Numbers 


Let Q be the field of talgebraic numbers. Sup- 
pose that a is an element of Q that satisfies the 
irreducible equation f(x) 2 agx" +a, x" ! +... 
t a, —0, where the a; are rational integers, 

ay #0, and ag, di, a, have no common 
factors. Then we define H (a) to be the maxi- 
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mum of |a;| (i—0, ..., n) and call it the height 
of x. J. Liouville proved the following theorem 
(1844): Let € be a real number (č ¢ Q). If 
inf{q"|é — p/q| | p/qeQ} =0 for any positive 
integer n, then € is transcendental. 

Transcendental numbers having this prop- 
erty are called Liouville numbers. Examples 
are: (i) £2 3:549 ", where g is an integer not 
smaller than 2. (ti) Suppose that we are given 
a sequence {n,' of positive integers such that 
n, — 0 (k— oo). Let ë be the real number 
expressed as an "infinite simple continued frac- 
tion ba + 1/b, 3- 1/b, + .... Let B; be the denomi- 
nator of the lth *convergent of the continued 
fraction. If b, zb ? for k> 1, then € is a 
Liouville number. 

On the other hand, K. Mahler [8,9] proved 
the existence of transcendental numbers that 
are not Liouville numbers. For example, he 
showed that if f(x) ts a nonconstant inte- 
gral polynomial function mapping the set 
of positive integers into itself, then a number 
č expressed, e.g., in the decimal system as 
Oy, y; ya... is such a number if we put y, = f(n), 
n— 1, 2, 3, .... (In particular, from f(x)— x we 
get the non-Liouville transcendental number E 
=0.123456789101112....) Mahler proved this 
result by using *Roth's theorem (1955) (— 182 
Geometry of Numbers). Both Liouville and 
Mahler utilized the theory of *Diophantine 
approximation to construct transcendental 
numbers. 

On the other hand, Schneider [10-12] and 
Siegel [3] constructed transcendental num- 
bers using certain functions. Examples are: 
expa (xeQ, a z0y x? (xeQ, «#0, 1; BEQ—Q); 
J (1), where J is the *modular function and c 
is an algebraic number that is not contained 
in any imaginary quadratic number field; 
yo(2ni/x), where g is the Weierstrass O- 
function, xe Q, and «40; and B(p, q), where B 
is the "Beta function and p, qe Q— Z. 

Since e z expl and | =exp 2zi, the tran- 
scendence of e and z is directly implied by the 
transcendence of expa (xeQ, a0). 


C. Classification of Transcendental Numbers 


(1) Mahler's classification: Given a complex 
number £ and positive integers n and H, we 
consider the following: 


n 


Y ao” 


v=0 


a,c Z, 


w,(H, EH 











la,| € H, >) 401. 
v=0 


w,,(¢) SW lim sup( ES log w,(H, ¢)/log H), 


Hoax 


w(¢)=w=lim sup w,(¢)/n, 


noo 
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and let u — the first number n for which w, is 
oc. Then we have the following four cases: 
(i) w=0, p= o6; (ii) 0<w < oo, p= o0; (iii) w= 
u= 96; (iv) w= œ, u< œ, corresponding to 
which we call £ an A-number, S-number, T- 
number, or U-number. The set of A-numbers 
is denoted by A, and similarly we have the 
classes S, T, and U. It is known that A = Q. 
If two numbers č and y are talgebraically 
dependent over Q, then they belong to the 
same class. If € belongs to S, the quantity 0(£) 
=sup{w,(é)/n|n=1,2,...} is called the type of 
€ (in the sense of Mahler). Mahler conjectured 
that almost all transcendental numbers (except 
a set of Lebesgue measure zero) are S-numbers 
of the type 1 or 1/2 according as they belong 
to R or not. Various results were obtained 
concerning this conjecture (W. J. LeVeque, J. 
F. Koksma, B. Volkmann) until it was proved 
by V. G. Sprindzhuk in 1965 [14,15]. The 
existence of T-numbers was proved by W. M. 
Schmidt (1968) [16]. All Liouville numbers are 
U-numbers [7]. On the other hand, logg (xe Q, 
x 2-0, «#1) and z are transcendental numbers 
that do not belong to U. 

(2) Koksma's classification: For a given 
transcendental number č and positive numbers 
n and H, we consider the following: 


w*(H, £) 2 min(|£—a||xeQ, 
H(x)« H, [Q(a):Q]<n}, 
w*(€) = wt =lim sup(—log(Hw*(H, £))/log H), 


H^ 


w*(¢) = w* =lim sup wr (¢)/n, 

and let u* =the first number n for which w* is 
oo. Then we have the following three cases: (1) 
w* < oo, u* = 00; (ii) w* = u* = oo; (iii) w* = oo, 
u* < oo. We call ë an S*-number, T*-number, 
or U*-number according as (i), (1i), or (iii) holds 
and denote the set of St-numbers by S*, etc. If 
č belongs to S*, we call 0*(€)=sup{w*(é)/n|n 
— 1,2, ...] the type of £ (in the sense of Kok- 
sma). It can be shown that S=S*, T=T*, and 
U=U*, and that if Zeg, then 0*(£) «0(£) « 
0*(£)-- 1. 


D. Algebraic Independence 


Concerning the algebraic relations of tran- 
scendental numbers, we have the following 
three principal theorems: 

(1) Let o, ..., 2, be elements of Q that 
are linearly independent over Q. Then 
expa,, ..., expo, are transcendental and alge- 
braically independent over Q (Lindemann- 
Weierstrass theorem). 

(2) Let Jil be the *Bessel function and « a 
nonzero algebraic number. Then Jual and 
Jual are transcendental and algebraically 
independent over Q (Siegel). 
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(3) Let 2,,...,o, be nonzero elements of Q 
such that log», ..., log, are linearly inde- 
pendent over Q. Then 1, loga,,..., logs, are 
linearly independent over Q (A. Baker). 

Besides these theorems, various related 
results have been obtained by A. B. Shidlovskii, 
Gel'fond, N. I. Fel'dman, and others. A quanti- 
tative extension of theorem (3), also by Baker, 
will be discussed later. 

First we give more detailed descriptions 
of theorems (1) and (2). Let o, ..., o, be 
as in theorem (1), sz [Q(a,. ...,9,):Q]. 
P(X,,...,X,,) be an arbitrary polynomial in 
Q[X,, ..., Xm] of degree n, and H(P) be the 
maximum of the absolute values of the coeffi- 
cients of the polynomial P. Then there exists a 
positive number C determined only by the 
numbers %,,...,%,_, and n(— deg P) such that 


—2s{2(2smnt mtn) -,) 


|P(e^:, ..., e*n)| > CH(P) 

In particular, if x is a nonzero algebraic 
number, then expa belongs to S and 0(expa) 
«85? 4 6s. 

(2) Let x be a nonzero algebraic number, 
s=[Q(z):Q], PeQ[X,, X,], deg P =n. Then 
there exists a positive number C determined 
only by « and n such that | P(Jo(a), Jo(2))| > 
CH(P) -82sin? 

Theorems (1) and (2) are actually special 
cases of a theorem obtained by Siegel. To state 
this theorem, the following terminology is 
used: An entire function f(z) Änt z"/n! 
is called an E-function defined over an *alge- 
braic number field K of finite degree if the fol- 
lowing three conditions are satisfied: (i) C, c K 
(n=O, 1,2, ...). (ti) For any positive number e, 
C, = O(n*"). (iit) Let q, be the least positive 
integer such that C,q, belongs to the ring © of 
algebraic integers in K (0 n, O xk <n). Then 
for an arbitrary positive number e, q, — O(n*"). 

A system { fi(z), .... f, (z)] of E-functions 
defined over K is said to be normal if it satis- 
fies the following two conditions: (1) None of 
the functions f;(z) is identically zero. (ii) If the 
functions w, = f,(z) (k= 1, ..., m) satisfy a sys- 
tem of thomogeneous EE "differential equa- 
tions of the first order, then w, = Af Qu(z)wy. 
where the Q,,(z) are rational functions of z, 
with coefficients in the ring ©. The matrix (Q,) 
can be decomposed by rearranging the order 
of the indices k, l if necessary into the form 


Ww. 0 
0 "WI 


where 


Qi ne n 


W-|.. , 1<t<r, ym -m. 


=1 
M t E Met ) 
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The decomposition is unique if we choose r 
as large as possible, in which case we call 
W, ..., W, the primitive parts of (Q,;). The 
requirement ts that the primitive parts W, are 
independent in the following sense: If there are 
numbers Ce K and polynomial functions 

z)e K [z] such that 


r P.) 
St (Cir Cm W, : = 0, 
SS P 


NS 
then C, —0, P,,(z) 2 0. 
Let N be a positive integer. A normal sys- 


tem { f;(z), .... f, (z)] of E-functions is said to 
be of degree N if the system {Fyn (z)— 


Ai"... fuz y mIn;z0, LF, nx NN] i is also 


a nog] system of E-functions. Then the 
theorem obtained by Siegel [4] is: Let N be an 
arbitrary positive integer and | fi(z), .... f, (z)j 
be a normal system of E-functions of degree 
N defined over an algebraic number field of 
finite degree K satisfying the system of differen- 
tial equations f;(z) 2 X72, Q,(z)fi(z), where 
Quale O(z), | «kx m. If x 1s a nonzero alge- 
braic number that 1s not a tpole of any one 
of the functions Q,,(z), then f, (x), ..., f, (x) are 
transcendental numbers that are algebraically 
independent over the field Q. 

Theorem (3) at the beginning of this sec- 
tion implies, for example, the following: (1) If 
Xj, ..., 0, and fl, ..., f, all belong to Q and 
v=o; SH +...+a,log f, 40, then y is tran- 
scendental. (it) Was -3 Xn Pos Pigs Bn ate 
nonzero algebraic numbers, then e^o... afn 
is transcendental. (iii) If à, ..., , are alge- 
braic numbers other than 0 and 1. and f}, 

.., f, also belong to Q, with 1, f,...., fj, 
linearly independent over Q, then a: ... xn 
is transcendental. 

Baker [17] also obtained a quantitative 
extension of theorem (3): Suppose that we 
are given integers A >4, d 24 and nonzero 
algebraic numbers %,,...,%, (nz 2) whose 
heights and degrees do not exceed A and d, 
respectively. Suppose further that 0 «à « 1, 
and let loga;, ..., logo, be the principal values 
of the logarithms. If there exist rational in- 
tegers b,,...,5, with absolute value at most H 
such that 


0 «|b, loga; +... +b, loga, <e’, 
then 
H «(4*5 1g?" log A)2"*1*. 


This theorem has extensive applications in 
various problems of number theory, including 
a wide class of *Diophantine problems [19]. 

A number of new, interesting results on the 
algebraic independence of values of exponen- 
tial functions, elliptic functions, and some 
other special functions have been obtained 
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recently by D. Masser, G. V. Chudnovskii, M. 
Waldschmidt, and other writers. In particular, 
Chudnovskii (1975) obtained the remarkable 
result that L'(1/3) and T(1/4) are transcenden- 
tal numbers. See [20-24]. 
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A. Topological Transformation Groups 


Let G be a group, M a set, and f a mapping 
from G x M into M. Put f(g, x) 2 g(x) (ge G, 
x€ M). Then the group G is said to be a trans- 
formation group of the set M if the following 
two conditions are satisfied: (i) e(x) 2 x (xe M), 
where e is the identity element of G; and (ii) 
(gh)(x) 2 g(h(x)) (xe M) for any g, heG. In 

this case the mapping x g(x) is a one-to-one 
mapping of M onto itself. 

Let G be a transformation group of M. If G 
is a topological group, M a topological space, 
and the mapping (g, x) 5g(x) a continuous 
mapping from G x M into M, then G is called 
a topological transformation group of M. In 
this case x g(x) is a homeomorphism of M 
onto itself. The mapping (g, x) g(x) is called 
an action of G on M. The space M, together 
with a given action of G, is called a G-space. 

For a point x of M, the set G(x)— (g(x)] 
g€ Gj is called the orbit of G passing through 
the point x. Defining as equivalent two points 
x and y of M belonging to the same orbit, we 
get an equivalence relation in M. The quotient 
space of M by this equivalence relation, de- 
noted by M/G, is called the orbit space of 
G-space M. 

If G(x) « {x}, then x is called a fixed point. 
The set of all fixed points is denoted by MT. 
For a point x of M, the set G, - ígeG|g(x)— 
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xj is a subgroup of G called the isotropy 
subgroup (stabilizer, stability subgroup) of G at 
the point x. A conjugacy class of the subgroup 
G, 1s called an isotropy type of the transforma- 
tion group G on M. 

The group G is said to act nontrivially (resp. 
trivially) on M if M # M* (resp. M = M9). The 
group G is said to act freely on M if the iso- 
tropy subgroup G, consists only of the identity 
element for any point x of M. 

The group G is said to act transitively on M 
if for any two points x and y of M, there exists 
an element géG such that g(x)= y. 

Let N be the set of all elements ge G such 
that g(x) — x for all points x of M. Then N isa 
normal subgroup of G. If N consists only of 
the identity element e, we say that G acts effec- 
tively on M, and if N is a discrete subgroup of 
G, we say that G acts almost effectively on M. 
When N # (ej, the quotient topological group 
G/N acts effectively on M in a natural fashion. 

An equivariant mapping (equivariant map) 
(or a G-mapping, G-map) ^: X 5 Y between G- 
spaces 1s a continuous mapping which com- 
mutes with the group actions, that is, h(g(x) — 
g(h(x)) for all ge G and xe X. An equivariant 
mapping which is also a homeomorphism is 
called an equivalence of G-spaces. 

For a G-space M, an equivalence class of the 
G-spaces G(x), xe M, is called an orbit type of 
the G-space M. 


B. Cohomological Properties 


We consider only tparacompact G-spaces and 
tČech cohomology theory in this section. We 
shall say that a topological space X is finitistic 
if every open covering has a finite-dimensional 
refinement. The following theorems are useful 
[1-3]. 

(1) If G is finite, X a finitistic paracompact 
G-space, and K a field of characteristic zero or 
prime to the order of G, then the induced 
homomorphism z*: H*(X/G; K) > H*(X; Km 
is an isomorphism. Here, z is a natural projec- 
tion of X onto X/G. The group G acts natu- 
rally on H*(X; K), and H*(X; KI denotes the 
fixed-point set of this G-action. 

(2) Let X be a finitistic G-space and G cyclic 
of prime ordor p. Then, with coefficients in 
Z/pZ, we have 


(a) for each n S rank H'(X9) « Y rank H'(X), 


(b  x(X)+(p— 1) x(X%) = px(X/G). 


Here the *Euler-Poincaré characteristics y( ) 
are defined in terms of mod p cohomology. 
(3) Smith's theorem: If G is a p-group (p 
prime) and if x ts a finitistic G-space whose 
mod p cohomology is isomorphic to the n- 








1634 


sphere, then the mod p cohomology of the 
fixed-point set X° is isomorphic to that of the 
r-sphere for some —1 <r <n, where (—1)- 
sphere means the empty set. 

(4) Let T* denote the k-dimensional toral 
group. Let X be a T*-space whose rational 
cohomology is isomorphic to the n-sphere, and 
assume that there are only a finite number of 
orbit types and that the orbit spaces of all 
subtori are finitistic. Let H be a subtorus of T*. 
Then by the above theorem the rational coho- 
mology of X is isomorphic to that of the 
r(H)-sphere for some —1 <r(H)<n. Assume 
further that there is no fixed point of the T*- 
action. Then, with H ranging over all subtori 
of dimension k — 1, we have 


n4 1- Y (r(H) 4-1). 


C. Differentiable Transformation Groups 


Suppose that the group G is a transformation 
group of a tdifferentiable manifold M, G is a 
*Lie group, and the mapping (g, x) g(x) of 

G x M into M is a differentiable mapping. 
Then G is called a differentiable transformation 
group (or Lie transformation group) of M, and 
M is called a differentiable G-manifold. 

The following are basic facts about compact 
differentiable transformation groups [3, 4]: 

(5) Differentiable slice theorem: Let G be a 
compact Lie group acting differentiably on a 
manifold M. Then, by averaging an arbitrary 
*Riemannian metric on M, we mav have a G- 
invariant Riemannian metric on M. That is, 
the mapping x—>g(x) is an tisometry of this 
Riemannian manifold M for each géG. For 
each point xe M, the orbit G(x) through x is a 
compact submanifold of M and the mapping 
g g(x) defines a G-equivariant diffeomor- 
phism G/G, = G(x), where G/G, is the left quo- 
tient space by the isotropy subgroup G,. G, 
acts orthogonally on the ttangent space 7, M 
at x (resp. the tnormal vector space N, of the 
orbit G(x)); we call it the isotropy representa- 
tion (resp. slice representation) of G, at x. Let E 
be the tnormal vector bundle of the orbit G(x). 
Since G acts naturally on E as a bundle map- 
ping, the bundle E is equivalent to the bundle 
(G x N,J/G, over G/G, as a *G-vector bundle, 
where G, acts on N, by means of the slice 
representation and G, acts on G bv the right 
translation. We can choose a small positive 
real number e such that the texponential 
mapping gives an equivariant *diffeomorphism 
of the e-disk bundle of E onto an invariant 
*tubular neighborhood of G(x). 

(6) Assume that a compact Lie group G acts 
differentiably on M with the orbit space M* = 
M/G connected. Then there exists a maximum 
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orbit type G/H for G on M (Le., H is an iso- 
tropy subgroup and H is conjugate to a sub- 
group of each isotropy group). The union Mm 
of the orbits of type G/H is open and dense in 
M, and its image Mif, in M* is connected. 

The maximum orbit type for orbits in M 
guaranteed by the above theorem is called the 
principal orbit type, and orbits of this type are 
called principal orbits. The corresponding 
isotropy groups are called principal isotropy 
groups. Let P be a principal orbit and Q any 
orbit. If dim P > dim Q, then Q is called a sin- 
gular orbit. If dim P 2 dim Q but P and Q are 
not equivalent, then Q is called an exceptional 
orbit. 

(7) Let G be a compact Lie group and M a 
compact G-manifold. Then the orbit types are 
finite in number. 

By applying (5) and (6) we have that an iso- 
tropy group is principal if and only if its slice 
representation is trivial. 

The situation is quite different in the case 
of noncompact transformation groups. For 
example, there exists an analytic action of 
G=SL(4,R) on an analytic manifold M such 
that each orbit of G on M is closed and of 
codimension one and such that, for x, ye M, 
G, is not isomorphic to G, unless x and y lie 
on the same G-orbit [5]. 


D. Compact Differentiable Transformation 
Groups 


Many powerful techniques in ‘differential 
topology have been applied to the study of 
differentiable transformation groups. For 
example, using the techniques of *surgery, we 
can show that there are infinitely many free 
differentiable circle actions on *homotopy 

(2n + t)-spheres (n z 3) that are differentiably 
inequivalent and distinguished by the rational 
*Pontryagin classes of the orbit manifolds 

(W. C. Hsiang [6]). Also, using *Brieskorn 
varieties, we can construct many examples of 
differentiable transformation groups on homo- 
topy spheres [3,4, 7]. Differentiable actions 
of compact connected Lie groups on homol- 
ogy spheres have been studied systematically 
(Hsiang and W. Y. Hsiang [4]). 

The Atiyah-Singer *index theorem has 
many applications in the study of transfor- 
mation groups. The following are notable 
applications: 

(8) Let M be a compact connected *oriented 
differentiable manifold of dimension 4k with 
a fspin-structure. If a compact connected 
Lie group G acts differentiably and nontriv- 
ially on M, then the A-genus / (M),[M]» 
of M vanishes (where .o/(M) denotes the +Ã- 
characteristic class of M) (M. F. Atiyah and F. 
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Hirzebruch [8], K. Kawakubo [9]). For fur- 
ther developments, see A. Hattori [10]. 

(9) Let M be a closed oriented manifold with 
a differentiable circle action. Then each con- 
nected component F, of the fixed point set 
can be oriented canonically, and we have 


HM) IU. 


where /( ) denotes the *Thom-Hirzebruch 
index [8,9]. 

Let G be a compact Lie group and G^ 
EG- BG the tuniversal G-bundle. Then the 
*singular cohomology H*(EG x , X) is called 
equivariant cohomology for a G-space X and is 
an H*(BG)-module. Let G — U(1), M a dif- 
ferentiable U(1)-manifold, F = M*, and i: F^ 
M the inclusion mapping. Then the ‘localiza- 
tion of the induced homomorphism 


S^'i*:S H*(EG x M)A S ! H*(BG x F) 


is an isomorphism, where S~! denotes the 
localization with respect to the multiplicative 
set S= {at*} with a, k ranging over all posi- 
tive integers and t the generator of H?(BG). 
Theorems (8) and (9) can be proved by the 
above localization isomorphism. 

Let M be a differentiable manifold. The 
upper bound N(M) of the dimension of all the 
compact Lie groups that acts effectively and 
differentiably on M is called the degree of 
symmetry of M. It measures, in some crude 
sense, the symmetry of the differentiable mani- 
fold M. The number N(M) depends heavily on 
the differentiable structure. For example, 
N(S") — m(m + 1)/2 for the standard m-sphere, 
but N(Z") « (m -- 1)?/16 +5 for a thomotopy 
m-sphere (m z 300) that does not bound a 
*x-manifold [11]. Also, N(P,(C)) 2 n(n + 2) 
for the complex projective n-space P, (C), but 
N(hP,(C)) « (n - 1)(n 4- 2/2 for any homotopy 
complex projective n-space hP,(C) (nz 13) 
other than P,(C) (T. Watabe [12]). 

Let X be a differentiable closed manifold 
and h: X 2 P,(C) be an orientation-preserving 
*homotopy equivalence. There is a conjec- 
ture about the total A-classes that states: If X 
admits a nontrivial differentiable circle action, 
then f(X) — h* (P, (C)) (T. Petrie [13]). It is 
known that if the action is free outside the 
fixed-point set, then the conjecture is true 
(T. Yoshida [14]). 


E. Equivariant Bordism 


Fix a compact Lie group G; a compact ori- 
ented G-manifold (i, M) consists of a compact 
*oriented differentiable manifold M and an 
orientation-preserving differentiable G-action 
V:Gx MM on M. 
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Given families F > F' of subgroups of G, a 
compact oriented G-manifold (i, M) is (F, F')- 
free if the following conditions are satisfied: (1) 
if x e M, then the isotropy group G, is conju- 
gate to a member of F; (ii) if xe @M, then G, is 
conjugate to a member of F’. 

If F' is the empty family, then necessarily 
CM is empty and M is closed. In this case we 
say that (V, M) is F-free. 

Given (V, M), define —(V, M) - (y, — M) 
with the structure precisely the same as (y, M) 
except for tortentation. Also define Ou. M)— 
(V, 6M). Note that if (V, M) is (F, F’)-free, then 
(V, 0M) is F"-free. Define (V, M) and (ie, M") to 
be isomorphic if there exists an equivariant 
orientation-preserving diffeomorphism of M 
onto M’. 

An (F, F’)-free compact oriented n- 
dimensional G-manifold (i, M) is said to bord 
if there exists an (F, F)-free compact oriented 
(n+ 1)-dimensional G-manifold (®, W) to- 
gether with a regularly embedded compact n- 
dimensional manifold M, in 0W with M, 
invariant under the G-action such that 
(6, M) is isomorphic to (i, M) and G, is con- 
jugate to a member of F’ for xe0W — M,. 
Also, M, is required to have its orientation in- 
duced by that of W. 

We say that (V, M,) is bordant to (w,, M,) if 
the disjoint union (V4, Mj) -- (Y2, — M,) bords. 
Bordism is an equivalence relation on the class 
of (F, F’)-free compact oriented n-dimensional 
G-manifolds. The bordism classes constitute 
an Abelian group Of(F, F’) under the oper- 
ation of disjoint union. If F' is empty, denote 
the above group by OZ (F). The direct sum 


Os(F. F)- OFF, F’) 

is naturally an Q-module, where Q is the 

*oriented cobordism ring. If F consists of all 

subgroups of G, then O$(F) is denoted by OS. 
Suppose now that F > F' are fixed families 

of subgroups of G. Every F'-free G-manifold 

is also F-free, and so this inclusion induces 

a homomorphism a: O%(F')>O%(F). Simi- 

larly every F-free G-manifold is also (F, F’)- 

free, inducing a homomorphism f:O0%(F)> 

O%(F, EL Finally, there is a homomorphism 

0:08 (F, F')>O%_, (F’) given by 0(j, M)— 

(V, 6M). Then the following sequence is exact 


[15]: 


.. SO0€(F)S 08(F)A O9(F, F) SOS ,(F)5.... 


A weakly almost complex compact G- 
manifold (y, M) consists of a tweakly almost 
complex compact manifold M and a differ- 
entiable G-action y: G x MM that preserves 
the weakly almost complex structure on M. 
US(F, EL US are defined similarly, and they 
are U,-modules, where U, is the tcomplex 
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cobordism ring of compact weakly almost 
complex manifolds. 

To study O€ and U£, (co)bordism theory is 
introduced (P. E. Conner and E. E. Floyd 
[16]), which is one of the *generalized (co)- 
homology theories. Miscellaneous results are 
known, in particular, for G a cyclic group of 
prime period. By means of the equivariant 
*Thom spectrum, equivariant cobordism 
theory can be developed (T. tom Dieck [17]); 
this is a multiplicative generalized cohomology 
theory with Thom classes (— 114 Differential 
Topology; also — 201 Homology Theory, 56 
Characteristic Classes). 


F. Equivariant Homotopy 


Let G be a compact Lie group. On the category 
of closed G-manifolds, we say that two objects 
M, N are y-equivalent if ;(M) — y(N^") for all 
closed subgroups H of G, where y( ) is the 
*Euler-Poincaré characteristic. On the set of 
equivalence classes A(G), a ring structure is 
imposed by disjoint union and the Cartesian 
product. We call A(G) the Burnside ring of G. 
If G is finite, A(G) is naturally isomorphic to 
the classical Burnside ring of G [18]. 

Denote by S(V) the unit sphere of an or- 
thogonal G-representation space V. Let V, 

W be orthogonal G-representation spaces. 
The equivariant stable homotopy group 
LLS(V), S(W)] ], which is defined as the direct 
limit of the equivariant homotopy sets [S(V + 
U), S(W + U) ]; taken over orthogonal G- 
representation spaces U and suspensions, is 
denoted by œ, for x= V — W e RO(G). The 
*smash product of representatives induces a 
bilinear pairing w, x 50,5. Then œ is a 
ring, and œ, is an o9-module. The ring coy is 
isomorphic to the Burnside ring of G, and c, is 
a *projective @o-module of rank one. The wo- 
module o, is free if and only if S(V) and S(W) 
are stably G-homotopy equivalent [18]. 

Let E be an orthogonal G-vector bundle 
over a compact G-space X. Denote by S(E) the 
sphere bundle associated with E. Let E, F be 
orthogonal G-vector bundles over X. Then E 
and F have the same spherical G-fiber homo- 
topy type if there exist fiber-preserving G- 
mappings f: S(E)S(F), f': S(F)  S(E) and 
fiber-preserving G-homotopies h,: S(E) 5 S(E), 
kk S(F)  S(F) such that ho = f' o f, h, 2 identity, 
ho — f o f’, hi «identity. Let KO,(X) be the 
‘equivariant K-group of real G-vector bundles 
over X. Let T; (X) be the additive subgroup of 
KO,(X) generated by elements of the form [E] 
— [F], where E and F are orthogonal G-vector 
bundles having the same spherical G-fiber 
homotopy type. The factor group J;(X)— 
KO,(X)/T,(X) and the natural projection 
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Jg: KOg( X) Jg (X) are called an equivariant 
J-group and an equivariant J-homomorphism, 
respectively (— 237 K-Theory). 

In particular, J;(xoj) is a factor group of 
the real representation ring RO(G). "Adams 
operations on representation rings are the 
main tools for studying the group Jel{xo}) 


[18]. 


G. Infinitesimal Transformations 


Let {:G x MM be a differentiable action of 
a Lie group G on a differentiable manifold M. 
Let X be a "left invariant vector field on G. 
Then we can define a differentiable vector field 
f'(X)on M as 


f QO, ho Dm Denn — LX), q)) Mai 


for each qe M and any differentiable function 
h defined on a neighborhood of q. It ts easy 
to see that f (X), —0 if and only if q is a 
fixed point of the one-parameter subgroup 
{exp(tX)}. A vector field f ' (X) is called an 
infinitesimal transformation of the differenti- 
able transformation group G. 

The set g of all infinitesimal transformations 
of G forms a finite-dimensional "lie algebra 
(the laws of addition and *bracket product are 
defined from those for the vector fields on M). 
If G acts effectively on M, g is isomorphic to 
the Lie algebra of the Lie group G (— 249 Lie 
Groups). In fact, the correspondence X > 
f ' (X) defines a Lie algebra homomorphism 
TT from the Lie algebra of all left invariant 
vector fields on G into the Lie algebra of all 
differentiable vector fields on M [19]. 

The following fact [20] is useful for the 
study of noncompact real analytic transfor- 
mation groups. Let g be a real *semisimple 
Lie algebra and p:g—L(M) be a Lie algebra 
homomorphism of g into a Lie algebra of real 
analytic vector fields on a *real analytic mani- 
fold M. Let p be a point at which the vector 
fields in the image p(g) have common zero. 
Then there exists an analytic system of coordi- 
nates (U;u,, ...,u,) with origin at p in which 
all the vector fields in p(g) are linear. Namely, 
there exists a;;€ g* = Hom,(q, R) such that 


^ 
A(X),= A. NEE Aen, qeU. 

ij rui 
The correspondence X >(a;(X)) defines a Lie 
algebra homomorphism of o into sl(m, R). 

For example, we can show that a real ana- 
lytic SL(n, R) action on the m-sphere is charac- 
terized by a certain real analytic vector field on 
(m—n-+ \)-sphere (5 «nm «2n —2) [21]. In 
particular, there are infinitely many (at least the 
cardinality of the real numbers) inequivalent 
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real analytic SL(n, R) actions on the m-sphere 
(3<n<m). 

Conversely, let q be a finite-dimensional Lie 
algebra of vector fields on M. Although there 
is not always a differentiable transformation 
group G that admits g as its Lie algebra of 
infinitesimal] transformations, the following 
local result holds. Let G be the tsimply con- 
nected Lie group corresponding to the Lie alge- 
bra g. Then for each point x of M, there exist 
a neighborhood Ü of the identity element e 
of G, neighborhoods V, W (V c W) of x, and a 
differentiable mapping f of Ü x V into W with 
the following properties. Putting f(g, y) - g( y) 
(ue Ü,ye V), we have: (i) For all ye V, e(y) — y. 
(ii) If g, he Ü, ye V, then (gh)(y) 2 g(h(y)), pro- 
vided that ghe U, h(y)e V. (iii) Let X be an 
arbitrary element of g. Put g, 2 exp( — tX), the 
corresponding one-parameter subgroup of G. 
If £20 is taken small enough, then we have 
g, € U for |t|<« so that g,(y)(Itl<e, ye V) is well 
defined. Therefore g, determines a vector field 
X on V by the formula 


X,h— lim (h(g,(y)) Mun, 


The vector field X coincides with the restric- 
tion of X to V. This local proposition is often 
expressed by the statement that g generates a 
local Lie group of local transformations, which 
is called Lie's fundamental theorem on local 
Lie groups of local transformations. 


H. Criteria 


It is important to know whether a given trans- 
formation group is a topological or a Lie 
transformation group. The following theorems 
are useful for this purpose [22, 23]: 

(10) Let G be a transformation group of a 
*locally compact Hausdorff space M. If we 
introduce the tcompact-open topology in 
G, then G is a topological transformation 
group of M when M is locally connected or M 
is a *uniform topological space and G acts 
*equicontinuously on M. 

(11) Suppose that M is a *C'-manifold and 
G 1s a topological transformation group of M 
acting effectively on M. If G is locally compact 
and the mapping x g(x) of M is of class C! 
for each element g of G, then G is a Lie trans- 
formation group of M. 

(12) Assume that G is a transformation 
group of a differentiable manifold M and G 
acts effectively on M. Let q be the set of all 
vector fields on M defined by one-parameter 
groups of transformations of M contained in G 
as subgroups. If g is a finite-dimensional Lie 
algebra, then G has a Lie group structure with 
respect to which G is a Lie transformation 
group of M, and then g coincides with the Lie 
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algebra formed by the infinitesimal transfor- 
mations of G. 

By applying theorems (10), (11), and (12) we 
can show that the following groups are Lie 
transformation groups: the group of all tiso- 
metries of a *Riemannian manifold; the group 
of all affine transformations of a differentiable 
manifold with a tlinear connection (generally, 
the group of all transformations of a differenti- 
able manifold that leave invariant a given 
*Cartan connection), the group of all analytic 
transformations of a compact complex mani- 
fold (this group is actually a complex Lie 
group); and the group of all analytic (holomor- 
phic) transformations of a bounded domain 
in C”, 

For related topics — 105 Differentiable 
Manifolds, 114 Differential Topology, 122 
Discontinuous Groups, 153 Fixed-Point 
Theorems, 427 Topology of Lie Groups and 
Homogeneous Spaces, etc. 
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A. Plane Trigonometry 


Fix an orthogonal frame O-X Y in a plane, and 
take a point P on the plane such that the angle 
POX is «. Denote by (x, y) the coordinates of 
P, and put OP =r (Fig. 1). We call the six ratios 
sina = y/r, cosa = x/r, tana = y/x, cota = x/y, 
seca — r/x, coseca — r/y the sine, cosine, tan- 
gent, cotangent, secant, and cosecant of o, re- 
spectively. These functions of the angle « are 
called trigonometric functions or circular func- 
tions (— 131 Elementary Functions). They 

are periodic functions with the fundamental 
period z for the tangent and cotangent, and 2x 
for the others. The relation sin? «+ cos?«— 1 
and the addition formulas sin(a + f) 2 sina cos f 
t cosasin f, cos{x t f) 2 cosa cos p F sinasin f 
follow from the definitions (— Appendix A, 
Table 2). Given a plane triangle ABC (Fig. 2), 
we have the following three properties: (i) a= 
bcos C + ccos B (the first law of cosines); 

(ii) a? = b? +c? —2bc cos A (the second law of 
cosines); (iii) a/sin A = b/sin B 2 c/sin C =2R, 
where R is the radius of the circle circum- 
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scribed about AABC (laws of sines) (— Ap- 
pendix A, Table 2). Thus we obtain relations 
among the six quantities a, b, c, / A, / B, and 
ZC associated with the triangle ABC. The 
study of plane figures by means of trigono- 
metric functions is called plane trigonometry. 
For example, if a suitable combination of 
three of these six quantities (including a side) 
associated with a triangle is given, then the 
other three quantities are uniquely determined. 
The determination of unknown quantities 
associated with a triangle by means of these 
laws is called solving a triangle. 





Fig. 2 


B. Spherical Trigonometry 


The part ABC of a spherical surface bounded 
by three arcs of great circles is called a spher- 
ical triangle. Points A, B, C are called the 
vertices; the three arcs a, b, c are called the 
sides; and the angles formed by lines tangent 
to the sides and intersecting at the vertices are 
called the angles of the spherical triangle (Fig. 
3). If we denote the angles by A, B, C, we have 
the relation A+ B-- C—x=E>0,and E is 
called the spherical excess. Spherical triangles 
have properties similar to those of plane trian- 
gles: sina/sin A —sin b/sin B - sin c/sin C (laws 
of sines), and cosa —cosbcosc -- sinbsinccos A 
(law of cosines). The study of spherical figures 
by means of trigonometric functions, called 
spherical trigonometry, is widely used in astron- 
omy, geodesy, and navigation (— Appendix 
A, Table 2). 


A 


SE 


Fig. 3 
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C. History 


Trigonometry originated from practical prob- 
lems of determining a triangle from three of 
its elements. The development of spherical 
trigonometry, which was spurred on by its 
applications to astronomy, preceded the devel- 
opment of plane trigonometry. In Egypt, 
Babylon, and China, people had some knowl- 
edge of trigonometry, and the founder of 
trigonometry is believed to have been Hippar- 
chus of Greece (fl. 150 B.c.). In the Almagest 
of Ptolemy (c. 150 A.D.) we find a table for 
2sina for «=0, 30’, 1°, 1°30’,... that is exact 
to five decimal places, and the addition for- 
mulas. The Greeks calculated 2sina, which ts 
the length of the chord corresponding to the 
double arc. Indian mathematicians, on the 
other hand, calculated half of the above quan- 
tities, that is, sina and 1 —cosa for the arc a. 
In the book by Aryabhatta (c. 500 A.D.) we 
find laws of cosines. The Arabs, influenced by 
Indian mathematicians, expressed geometric 
computations algebraically, a technique also 
known to the Greeks. AbûÎ Wafá (in the latter 
half of the 10th century A.D.) gave the correct 
sines of angles for every 30' to 9 decimal places 
and studied with Al Battani the projection 
triangle of the sundial, thereby obtaining the 
concepts of sine, cosine, secant, and cosecant. 
Later, a table of sines and cosines for every 
minute was established by the Arabs. Regio- 
montanus (d. 1476), a German, elaborated on 
this table. The form he gave to trigonometry 
has been maintained nearly intact to the pre- 
sent day. Various theorems in trigonometry 
were established by G. J. Rhaeticus, J. Napier, 
J. Kepler, and L. Euler (1748). Euler treated 
trigonometry as a branch of analysis, gen- 
eralized it to functions of complex variables, 
and introduced the abbreviated notations that 
are still in use (— 131 Elementary Functions). 
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Turbulent flow is the irregular motion of fluids, 
whereas relatively simple types of flows that 
are either stationary, slowly varying, or peri- 
odic in time are called laminar flow. When 
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a laminar flow is stable against external dis- 
turbances, it remains laminar, but if the flow is 
unstable, it usually changes into either another 
type of laminar flow or a turbulent flow. 


A. Stability and Bifurcation of Flows 


The velocity field u(x, t), x being the space 
coordinates and : the time, of a flow of an 
incompressible viscous fluid in a bounded 
domain G is determined by the *Navier-Stokes 
equation of motion, 


^ 
ou 





i 
S +(u- grad)u — vAu 4 — grad p — 0, (1) 
ct p 


and the equation of continuity, 
divu- Q0, (2) 


with the prescribed initial and boundary con- 
ditions, where A denotes the Laplacian, p the 
pressure, p the density, and v the kinematic 
viscosity of the fluid. Suitable extensions must 
be made in the foregoing system of equations if 
other field variables, such as the temperature 
in thermal-convection problems, are to be 
considered. 

The stability of a fluid flow is studied by 
examining the behavior of the solution of 
equations (1) and (2) against external distur- 
bances, and, in particular, stability against 
infinitesimal disturbances constitutes the linear 
stability problem. The stability characteristics 
of the solution of equations (1) and (2) depend 
largely upon the value of the tReynolds num- 
ber R=UL/v, U and L being the representa- 
tive velocity and length of the flow, respec- 
tively. Let a stationary solution of equations 
(1) and (2) be eis, R). If the perturbed flow 
is given by u,(x, R)+ v(x, R)exp(at), v being 
the perturbation velocity, and equation (1) is 
linearized with respect to v, we obtain a ‘linear 
eigenvalue problem for c. The flow is called 
linearly stable if max(Reo) is negative, and 
linearly unstable if it is positive. For small 
values of R, a flow is generally stable, but it 
becomes unstable if R exceeds a critical value 
R,, which is called the critical Reynolds num- 
ber [1]. 

The instability of a stationary solution gives 
rise to the *bifurcation to another solution 
at a tbifurcation point R, of the parameter R. 
If Imc =0 for an eigenvalue c at R=R,, a 
stationary solution bifurcates from the solu- 
tion uo at R,, and if Imo Z0, a time-periodic 
solution bifurcates at R.. The latter bifurca- 
tion is called the Hopf bifurcation. A typical 
example of stationary bifurcation is the gener- 
ation of an axially periodic row of Taylor 
vortices in Couette flow between two rotating 
coaxial cylinders, which was studied by G. I. 
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Taylor (1923), with excellent agreement be- 
tween theory and experiment [2]. Hopf bifur- 
cation is exemplified by the generation of 
Tollmien-Schlichting waves in the laminar 
*boundary layer along a flat plate, which was 
predicted theoretically by W. Tollmien (1929) 
and H. Schlichting (1933) and later confirmed 
experimentally by G. B. Schubauer and H. K. 
Skramstad (1947) [3]. 

In either type of bifurcation (Ime =0 or 40) 
the bifurcation is called supercritical if the 
bifurcating solution exists only for R » R,, 
subcritical if it exists only for R « R,, and 
transcritical if it happens to exist on both sides 
of R.. The amplitude of the departure of the 
bifurcating solution from the unperturbed 
solution uo tends to zero as Ro R,. The be- 
havior of the bifurcating solution around the 
bifurcation point R, is dealt with systemat- 
ically by means of bifurcation analysis. In 
supercritical bifurcation, the bifurcating solu- 
tion is stable and represents an equilibrium 
state to which the perturbed flow approaches 
just as in the cases of Taylor vortices and 
Tollmien-Schlichting waves. On the other 
hand, for subcritical bifurcation the bifurcat- 
ing solution is unstable and gives a critical am- 
plitude of the disturbance above which the 
linearly stable basic flow (R < R.) becomes 
unstable. In this case, the instability of the 
basic flow gives rise to a sudden change of the 
flow pattern resulting in either a stationary 
(or time-periodic) or even turbulent flow. The 
transition to turbulent flow that takes place 
in Hagen-Poiseuille flow through a circular 
tube and is linearly stable at all values of R 
(R, — oo) may be attributed to this type of 
bifurcation. 

The concept of bifurcation can be extended 
to the case where the flow Ve is nonstationary, 
but the bifurcation analysis then becomes 
much more difficult. 


B. Onset of Turbulence 


The fluctuating flow resulting from an insta- 
bility does not itself necessarily constitute a 
turbulent flow. In order that a flow be turbu- 
lent, the fluctuations must take on some irreg- 
ularity. The turbulent flow is usuallv defined 
in terms of the long-time behavior of the flow 
velocity u(x, t) at a fixed point x in space. The 
flow ts expected to be turbulent if the fluctuat- 
ing velocity 


" 
óu(x, t) — u(x, t) im >| u(x, t) dt (3) 


Tox o 


satisfies the condition 


1 T 
lim lim A óu;(x, t)Óuj(x, t 4- t) dt =0, (4) 
ro Toc T 0 
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where the subscripts label the components. 
Condition (4) implies that the *dynamical 
system of a fluid has the mixing property. This 
condition also states that the velocity fluctu- 
ation ĝu; has a continuous frequency spectrum. 
In practical situations the frequency spectrum 
of a turbulent flow may contain both the line 
and continuous spectra, in which case the flow 
is said to be partially turbulent. 

L. D. Landau (1959) and E. Hopf (1948) 
proposed a picture of turbulent flow as one 
composed of a *quasiperiodic motion, u(t) = 
f(@,t,@2t,...,@,t), with a large number of 
rationally independent frequencies o, ..., c, 
produced by successive supercritical bifurca- 
tions of Hopf type. This picture of turbulence 
is not compatible with the foregoing definition 
of turbulence, since it does not satisfy the 
mixing property (4). The fact that the gener- 
ation of real turbulence is not necessarily 
preceded by successive supercritical bifurca- 
tions casts another limitation on the validity 
of this picture. 

The concept of turbulence is more clearly 
exhibited with respect to a dynamical system 
of finite dimension. Although we are without 
a general proof, it 1s expected that the Navier- 
Stokes equation with nonzero viscosity v can 
be approximated within any degree of ac- 
curacy by a system of finite-dimensional ffirst- 
order ordinary differential equations 


E F(X 5 
2 (X). (5) 
Thus the onset and some general properties of 
turbulence are understood in the context of 
the theory of *dynamical systems. Turbulence 
is related to those solutions of equation (5) 
that tend to a tset in the "phase space that 
is neither a fixed point, a tclosed orbit, nor a 
*torus. A set of such complicated structure is 
called a nonperiodic ‘attractor or a strange 
attractor. Historically, the strange attractor 
originates from the strange Axiom A attrac- 
tor that was found in a certain class of dy- 
namical systems called the Axiom A systems. 
However, this term has come to be used ina 
broader sense, and it now represents a vartety 
of nonperiodic motions exhibited by a system 
that is not necessarily of Axiom A type. The 
above-mentioned Landau-Hopf picture of 
turbulence was criticized by D. Ruelle and F. 
Takens (1971), who proved for the dynamical 
system (5) that an arbitrary small perturbation 
on a quasiperiodic *flow on a k-dimensional 
torus (k > 4) generically (in the sense of residual 
sets) produces a flow with a strange Axiom A 
attractor [4]. 

There exist a number of examples of first- 
order ordinary differential equations of rela- 
tively low dimension whose solutions exhibit 
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nonperiodic behavior. An important model 
system related to turbulence is the Lorenz 
model (1963) of thermal convection in a hort- 
zontal fluid layer. This model is obtained by 
taking only three components out of an in- 
finite number of spatial *Fourier components 
of the velocity and temperature fields. The 
mode! is written as 


= —0X +0Y, 


dt 
dY 

—-——XZ-rX —Y, 

dt 

SS XY—bZ 6 
db m (6) 
where o (>b + 1) and b are positive constants 
and r is a parameter proportional to the Ray- 
leigh number. Obviously, equations (6) have 

a fixed point X = Y= Z =0 representing the 
state of thermal convection without fluid flow. 
For r « 1, this fixed point is stable, but it be- 
comes unstable for r> 1, and a pair of new 
fixed points X = Y=+,/b(r—1), Z=r—-1 
emerges supercritically. This corresponds to 
the onset of stationary convection atr=1. Ata 
still higher value of r 2o(c - b --3)(co —b—1)a 
subcritical Hopf bifurcation occurs with re- 
spect to this pair of fixed points, and for a 
certain range of r above this threshold the 
solutions with almost any initial conditions 
exhibit nonperiodic behavior. This corre- 
sponds to the generation of turbulence. The 
property 

0X or OZ 


e 7 
TUE 7 RD E o) 


where the dots denote time derivatives, shows 
that each volume element of the phase space 
shrinks asymptotically to zero as the time 
increases indefinitely. This property is char- 
acteristic of dynamical systems with energy 
dissipation, in sharp contrast to the *measure- 
preserving character of Hamiltonian systems 
[5]. 

For a certain class of ordinary differen- 
tial equations, the bifurcation to nonperiodic 
motion corresponds neither to the bifurcation 
of tori, just as in the Ruelle-Takens theory, nor 
to subcritical bifurcation, as in the Lorenz 
model. Such a bifurcation takes place when 
nonperiodic motion emerges as the conse- 
quence of an infinite sequence of supercritical 
bifurcations at each of which a periodic orbit 
of period T bifurcates into one of period 2T. 
If we denote the nth bifurcation point by r,, 
the distance r,,, —r, between two successive 
bifurcation points decreases exponentially with 
increasing n, and eventually the bifurcation 
points accumulate at a point r,, beyond which 
nonperiodic motion is expected to emerge. It is 
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not yet clear if any of the above three types of 
bifurcation leading to nonperiodic behavior is 
actually responsible for the generation of real 
turbulence. 

Some important properties of a dynamical 
system with a nonperiodic attractor, which 
may be either a flow or a ‘diffeomorphism, can 
be stated as follows: 

(1) The distance between two points in the 
phase space that are initially close to each 
other grows exponentially in time, so that the 
solutions exhibit a sensitive dependence on the 
initial conditions. 

(it) The nonperiodic attractor has *Lebesgue 
measure zero, and such a system is expected to 
have many other *ergodic *invariant measures. 

The irregular behavior of a deterministic 
dynamical system is also called chaos, but this 
concept is more abstract and general than 
that of turbulence, and covers phenomena ex- 
hibited by systems such as nonlinear electric 
circuits, chemical reactions, and ecological 
systems. 


C. Statistical Theory of Turbulence 


The statistical theory of turbulence deals with 
the statistical behavior of fully developed 
turbulence. The turbulent field is sometimes 
idealized for mathematical simplicity to be 
homogeneous or isotropic. In homogeneous 
turbulence the statistical laws are invariant 
under all parallel displacements of the coordi- 
nates, whereas in isotropic turbulence invar- 
iance under rotations and reflections of the 
coordinates is required in addition. 

The instantaneous state of the fluid motion 
is completely determined by specifying the 
fluid velocity u at all space points x and can 
be expressed as a phase point in the infinite- 
dimensional tphase space spanned by these 
velocities. The phase point moves with time 
along a path uniquely determined by the solu- 
tion of the Navier-Stokes equation. In the 
turbulent state the path is unstable to the 
initial disturbance and describes an irregular 
line in the phase space. In this situation the 
deterministic description is no longer useful 
and should be replaced by a statistical treat- 
ment. Abstractly speaking, turbulence is just a 
view of fluid motion as the random motion of 
the phase point u(x) (— 407 Stochastic Pro- 
cesses). The equation for the *characteristic 
functional of the random velocity u(x) was first 
given by E. Hopf (1952). An exact solution 
obtained by Hopf represents a tnormal distri- 
bution associated with a white energy spec- 
trum, but so far no general solution has been 
obtained [6]. 

Besides the formulation in terms of the 
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tprobability distribution or the characteristic 
functional, there is another way of describing 
turbulence by tmoments of lower orders. This 
is the conventional statistical theory originated 
by G. I. Taylor (1935) and T. von Karman 
(1938), which made remarkable progress after 
World War II. The principal moments in this 
theory are the correlation tensor, whose (i, j)- 
component is the mean of the product of two 
velocity components u; at a point x and u; at 
another point x +r, 


B,(r)= Cuj(x)uj(x Tr)». (8) 


and its tFourier transform, or the energy spec- 
trum tensor, 


1 
RAN (s | Bueno. /—1k-r)dr. (9) 


In isotropic turbulence 6 is expressed as 
1 kik; 
ouo zs Et, k —|kl, (10) 


where E(k) is the energy spectrum function, 
representing the amount of energy included in 
a spherical shell of radius k in the wave num- 
ber space. The energy of turbulence & per unit 
mass is expressed as 


1 1 1 
ENEE | 0,094. 


-| E(k)dk. (11) 
0 

The state of turbulence is characterized by 
the Reynolds number R = EJ?/(vk1?), where E, 
and k, are representative values of E(k) and k, 
respectively. For weak turbulence of small R, 
E(k) is governed by a linear equation with the 
general solution 


E(k, t) E(k, O)exp( —2vK? t), (12) 


E(k, 0) being an arbitrary function. Thus E(k) 
decays in time due to the viscous dissipation. 
For strong turbulence of large R, it is difficult 
to obtain the precise form of E(k), and this is 
usually done by way of some assumption that 
allows us to approximate the nonlinear effects 
[7]. 

Some of the similarity laws governing the 
energy spectrum and other statistical functions 
can be determined rigorously but not neces- 
sarily uniquely. For 3-dimensional incompress- 
ible turbulence, the energy spectrum satisfies 
an inviscid similarity law 


E(k)/Eo = F.(k/ko) (13) 


in the energy-containing region k == O(k,) char- 
acterized by a wave number kọ, and a viscous 
similarity law 


E(k)/Eo = R ?^ F,(k/(R*" ko)), (14) 
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in the energy dissipation region k=O(R*"*k,), 
where F, and F, denote dimensionless func- 
tions generally dependent on the initial con- 
dition and the time [6]. 

If an assumption is made to the effect that 
the statistical state in the energy-dissipation re- 
gion depends only upon the energy-dissipation 
rate e= — dé /dt besides the viscosity v (or R), 
then (14) becomes Kolmogorov's equilibrium 
similarity law (1941): 


E(k) 5g! v54 F (Kg y "2911. (15) 


where F is a dimensionless function. For 
extremely large R (or small v) there exists 

an inertial subregion between the energy- 
containing and energy-dissipation regions such 
that the viscous effect vanishes and (15) takes 
the form 


E(k) 2 Ke? k 55, (16) 


where K is an absolute constant. Kolmogorov’s 
spectrum (16) has been observed experimen- 
tally several times, and now its consistency 
with experimental results at large Reynolds 
numbers is well established [8]. 

Kolmogorov (1962) and others modified (16) 
by taking account of the fluctuation of e due 
to the intermittent structure of the energy- 
dissipation region as 


E(k) = K'e k^ (Lk) "P. (17) 


where e is now the average of the fluctuating e, 
u is the covariance of the log-normal distribu- 
tion of e, and L is the length scale of the spatial 
domain in which the average of e is taken [8]. 
À similar modification, with the exponent 
—p/3 in place of — 4/9, is obtained using a 
fractal model of the energy-cascade process. 
These corrections to E(k), based upon the 
experimentally estimated o of 0.3-0.5, are 

too small to be detected experimentally, 

but the deviation is expected to appear more 
clearly in the higher-order moments [8-10]. 
It should be noted that Kolmogorov's spec- 
trum (16) itself does not contradict the notion 
of intermittent turbulence and gives one 

of the possible asymptotic forms in the limit 
Roo. 

The 1-dimensional Burgers model of tur- 
bulence satisfies the same similarity laws as 
(13) and (14), but it has an inviscid spectrum 
E(k)ock ? instead of (16). Two-dimensional 
incompressible turbulence has no energy- 
dissipation region, and hence Kolmogorov’s 
theory is not valid for this turbulence. It has 
an inviscid spectrum E(k)ock 3, first derived 
by R. H. Kraichnan (1967), C. E. Leith (1968), 
and G. K. Batchelor (1969). These inviscid 
spectra for 1- and 2-dimensional turbulence 
have been confirmed by numerical simulation 


[11]. 
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A. History 


Unified field theory 1s a branch of theoretical 
physics that arose from the success of tgen- 
eral relativity theory. Its purpose is to dis- 
cuss in a unified way the fields of gravitation, 
electromagnetism, and nuclear force from the 
standpoint of the geometric structure of space 
and time. Studies have continued since 1918, 
and many theories of mathematical interest 
have been published without attaining, how- 
ever, any conclusive physical theory. 

A characteristic feature of relativity theory is 
that it is based on a completely new concept of 
space and time. That is, in general relativity 
theory it is considered that when a gravita- 
tional field is generated by matter, the struc- 
ture of space and time changes, and the flat 
*Minkowski world becomes a 4-dimensional 
*Riemannian manifold (with signature (1, 3)) 
having nonvanishing curvature. The tfunda- 
mental tensor g;; of the manifold is interpreted 
as the gravitational potential, and the basic 
gravitational equation can be described as a 
geometric law of the manifold. It is character- 
istic of general relativity theory that gravita- 
tional phenomena are reduced to space-time 
structure (— 359 Relativity). The introduction 
of the Minkowski world in tspecial relativity 
theory was a revolutionary advance over the 
3-dimensional space of Newtonian mechanics. 
But the inner structure of the Minkowski 
world does not reflect gravitational phenom- 
ena. The latter shortcoming is overcome by 
introducing the concept of space-time repre- 
sented by a Riemannian manifold into general 
relativity theory. 

When a coexisting system of gravitational 
and electromagnetic fields is discussed in gen- 
eral relativity theory, simultaneous equations 
(Einstein-Maxwell equations) must be solved 
for the gravitational potential g;; and the 
electromagnetic field tensor E, Thus the 
gravitational potential g;; is affected by the 
existence of an electromagnetic field. As the 
validity of general relativity began to be ac- 
cepted, it came to be expected that all physical 
actions might be attributed to the gravita- 
tional and electromagnetic fields. Thus various 
extensions of general relativity theory have 
been proposed in order to devise a geometry 
in which the electromagnetic as well as the 
gravitational field directly contributes to the 
space-time structure, and to establish a uni- 
fied theory of both fields on the basis of the 
geometry thus obtained. These attempts are 
illustrated in Fig. 1. 
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B. Weyl’s Theory 


The first unified field theory was proposed by 
H. Weyl in 1918. In Riemannian geometry, 
which is the mathematical framework of gen- 
eral relativity theory, the *covariant derivative 
of the tfundamental tensor g;; vanishes, i.e., 


Vig = O95 /0X' — gi, Vig — dal =O, (1) 


where T} is the *Christoffel symbol derived 
from gj. Conversely, if I7, is considered as the 
coefficient of a general taffine connection and 
(1) is solved with respect to T}, under the con- 
dition T= I7; then the Christoffel symbol 
derived from g; coincides with T}. In this 
sense, (1) means that the space-time manifold 
has Riemannian structure. On the other hand, 
Weyl considered a space whose structure is 
given by an extension of (1), 


Vig, = 2A :Gixs (2) 


and developed a unified field theory by regard- 
ing A; as the electromagnetic potential. This 
theory has mathematical significance in that it 
motivated the discovery of Cartan's geometry 
of connection, but it has some unsatisfactory 
points concerning the derivation of the field 
equation and the equation of motion for a 
charged particle. 

The scale transformation given by j;— p^g;; 
is important in Weyl’s theory. If in addition to 
this transformation, A; is changed to 


A;= A;—0log p/Óx', (3) 


then (2) is left invariant and the space-time 
structure in Weyl’s theory remains unchanged. 
We call (3) the gauge transformation, cor- 
responding to the fact that the electromagnetic 
potential A; is determined by the electro- 
magnetic field tensor F; up to a gradient vec- 
tor. In the *field theories known at present, the 
gauge transformation is generalized to various 
fields, and the law of charge conservation is 
derived from the invariance of field equations 
under generalized gauge transformation. 


C. Further Developments 


A unified field theory that appeared after 
Weyl’s is Kaluza's 5-dimensional theory (Th. 
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Kaluza, 1921). This theory has been criticized 
as being artificial, but it is logically consistent, 
and therefore many of the later unified field 
theories are improved or generalized versions 
of it. The underlying space of Kaluza's theory 
is a 5-dimensional Riemannian manifold with 
the fundamental form 


ds? =(dx* + A, dx?) Lü dx" dx”, 


where A; and g; are functions of x' alone (a, b, 
2,1 j=0, 1, 2, 3). The field equation and the 
equation of motion of a particle are derived 
from the variational principle in general re- 
lativity theory. The field equation is equivalent 
to the Einstein-Maxwell equations. The trajec- 
tory of a charged particle is given by a geo- 
desic in the manifold, and its equation is re- 
ducible to the Lorentz equation in general 
relativity. 

After the introduction of Kaluza's theory, 
various unifield field theories were proposed, 
and we give here the underlying manifolds or 
geometries of some mathematically interesting 
theories: a manifold with *affine connection 
admitting absolute parallelism (A. Einstein, 
1928); a manifold with *projective connec- 
tion (O. Veblen, B. Hoffman, 1930 [4]; J. A. 
Schouten, D. van Dantzig, 1932); wave geome- 
try (a theory based on the linearization of the 
fundamental form; Y. Mimura, 1934 [3]; a 
nonholonomic geometry (G. Vranceanu, 1936); 
a manifold with *conformal connection (Hoff- 
man, 1948). 

The investigations since 1945 have been 
motivated by the problem of the representa- 
tion of matter in general relativity theory. 
Einstein first represented matter by an energy- 
momentum tensor T; of class C?, which must 
be determined by information obtained from 
outside relativity. Afterward he felt that this 
point was unsatisfactory and tried to develop 
a theory on the basis of field variables alone, 
without introducing such a quantity as T;;. 
This theory is the so-called unitary field theory, 
and a solution without singularities is required 
from a physical point of view. His first attempt 
was to remove singularities from an exterior 
solution in general relativity by changing the 
topological structure of the space-time mani- 
fold. This idea was then extended to a unified 
field theory by J. A. Wheeler, and an interpre- 
tation was given to mass and charge by apply- 
ing the theory of *harmonic integrals (1957) 
[2]. 

Einstein's second attempt was to propose 
a nonsymmetric unified field theory (1945) 

[1, Appendix II; 6]. The fundamental quantities 
in this theory are a nonsymmetric tensor du 
and a nonsymmetric affine connection I5. 

The underlying space of the theory can be con- 
sidered a direct extension of the Riemannian 
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manifold, since (1) is contained in the field 
equations (notice the order of indices in this 
equation). E. Schródinger obtained field equa- 
tions of almost the same form by taking only 
I5, as a fundamental quantity (1947) [5]. 
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A. Uniform Convergence of a Sequence of 
Real-Valued Functions 


A sequence of real-valued functions { f,(x)] 
defined on a set B is said to be uniformly con- 
vergent (or to converge uniformly) to a function 


f(x) on the set B if it converges with respect 


to the tnorm || o || =sup{|@(x)|| xe Bj, i.e., 
lim, ,4.]| f, —f || 20 (- 87 Convergence). In 
other words, | f,(x)} converges uniformly to 


f (x) on B if for every positive constant ¢ we 


can select a number N independent of the 
point x such that | f, (x) —f (x)| <e holds for all 
n>N and xe B. By the *completeness of the 
real numbers, a sequence of functions { f,(x)} 
converges uniformly on B if and only if we can 
select for every positive constant e a number N 
independent of the point x such that | f, (x) — 


fa(x)| <£ holds for all m, n>N and xe B. 


The uniform convergence of a series >, f, (x) or 
of an infinite product [T, f, (x) is defined by the 
uniform convergence of the sequence of its 
partial sums or products. If the series of the 
absolute values 3. | f,(x)| converges uniformly, 
then the series X, f,(x) also converges uni- 
formly. In this case the series X, f,(x) is said to 
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be uniformly absolutely convergent. À sequence 
of (nonnegative) constants M, satisfying 

Lf, (xX)| € M, is called a dominant (or majorant) 
of the sequence of functions | f,(x)}. A series of 
functions X, f,(x) with converging majorant 
series >), M, is uniformly absolutely conver- 
gent (Weierstrass's criterion for uniform 
convergence). 

Let {2,(x)} be another sequence of functions 
on B. The series X, ^, (x) f, (x) is uniformly 
convergent if either of the following conditions 
holds: (i) The series X, f, (x) converges uni- 
formly and the partial sums of the series 
YA) 7 4444 (CH are uniformly bounded, ie., 
bounded by a constant independent of xe B 
and of the number of terms; or (ul the series 
Y AA) — 44,44 (X) converges uniformly, the 
sequence {/,(x)} converges uniformly to 
0, and the partial sums of X, f,(x)| are 
uniformly bounded. 


B. Uniform Convergence and Pointwise 
Convergence 


Let { f,(x)j be a sequence of real-valued func- 
tions on B, and let f(x) be a real-valued func- 
tion also defined on B. If the sequence of 
numbers { f,(xo); converges to f (xg) for every 
point xo€ B, we say that | f(x)! is pointwise 
convergent (or simply convergent) to the func- 
tion f(x). Pointwise convergence is, of course, 
weaker than uniform convergence. If we repre- 
sent the function f(x) by the point [Ieg f(x) = 
Lf] of the *Cartesian product R^ — [T,. 4 R, 
then the pointwise convergence of 1 f, (x)! to 

f (x) is equivalent to the convergence of the 
sequence of points ([ f, ]} to [f] in the *prod- 
uct topology of R7. 

When B is a *topological space and every 
f(x) is continuous, the pointwise limit f (x) of 
the sequence { f,(x)] is not necessarily con- 
tinuous. However, if the sequence of continuous 
functions | f,(x)} converges uniformly to f(x) 
then the limit function f(x) is continuous. On 
the other hand, the continuity of the limit does 
not imply that the convergence is uniform. If 
the set B is *compact and the sequence of 
continuous functions { f(x) is monotone (Le., 
fi x) Sfn+i (X) for all n or f(x) 2 fa: (x) for all n) 
and pointwise convergent E a continuous 
function f(x), then the convergence is uniform 
(Dini's theorem). 


C. Uniform Convergence on a Family of Sets 


Let B be a topological space. We say that a 
sequence of functions 1 f,(x)} is uniformly 
convergent in the vidis sense to the function 
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f (x), depending on circumstances, in either of 
the following two cases: (i) Every point xe B 
has a neighborhood U on which the sequence 
1f, G9] converges uniformly to f(x); or (ii) 

{ f,(x)} converges uniformly to f(x) on every 
compact subset K in B. If B is locally com- 
pact, the two definitions coincide. The term 
uniform convergence on compact sets is also 
used for (ii). 

In general, given a family F of subsets in B, 
we may introduce in the space ¥ of real- 
valued functions on B a family of tseminorms 
lf lg m suptlfG)l| xe Kj for every set Ke2. 
Let T be the topology of defined by this 
family of seminorms (— 424 Topological 
Linear Spaces). A sequence { f,(x1j is called 
uniformly convergent on 2 if it is convergent 
with respect to T. In particular, when 2 coin- 
cides with {B}, {{x}|xe B}, or the family of all 
compact sets in B, then uniform convergence 
on ¥ coincides with the usual uniform conver- 
gence, pointwise convergence, or uniform 
convergence on compact sets, respectively. If 2 
is a countable set, the topology T is tmetri- 
zable. Most of these definitions and results 
may be extended to the case of functions 
whose values are in the complex number field, 
in a *normed space, or in any *uniform space. 


D. Topology of the Space of Mappings 


Let X, Y be two topological spaces. Denote by 
C(X, Y) the space of all continuous mappings 
f: X 5 Y. This space C(X, Y), or a subspace .7 
of C(X, Y), is called a mapping space (or func- 
tion space or space of continuous mappings) 
from X to Y. A natural mapping ®:¥ x XY 
is defined by di f, 9) LO) er xe). We 
define a topology in ¥ as follows: for a com- 
pact set K in X and an open set U in Y, put 
W(K,U)={ feF¥ | f(K)eU }, and introduce a 
topology in F such that the base for the to- 
pology consists of intersections of finite num- 
bers of W(K;, U,). This topology is called the 
compact-open topology (R. H. Fox, Bull. Amer. 
Math. Soc., 51 (1945)). When X is a *locally 
compact Hausdorff space and Y is a *Haus- 
dorff space, the compact-open topology is the 
*weakest topology on F for which the func- 
tion ® is continuous, If, in this case, F is 
compact with respect to the compact-open 
topology, then the compact-open topology 
coincides with the topology of pointwise 
convergence. 

In particular, when Y is a *metric space (or, 
in general, a *uniform space with the Ge 
ity 1), the compact-open topology in F coin- 
cides with the topology of uniform conver- 
gence on compact sets. A family Z is called 
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equicontinuous at a point xe X if for every 
positive number e (in the case of uniform 
space, for every Vel) there exists a neighbor- 
hood U of x such that p( f (x), f(p)) « e (f(x), 
fpe V) for every point pe U and for every 
function fe F (G. Ascoli, 1883—1884). If X isa 
‘locally compact Hausdorff space, a necessary 
and sufficient condition for ¥ to be relatively 
compact (i.e., for the closure of F to be com- 
pact) with respect to the compact-open top- 
ology (e, to the topology of uniform conver- 
gence on compact sets) is that ¥ be equicon- 
tinuous at every point xe X and that the set 
(SISEF } be relatively compact in Y for 
every point xe X (Ascoli's theorem). In partic- 
ular, when X is a o-compact locally compact 
Hausdorff space and Y is the space of real 
numbers, a family of functions ¥ that are 
equicontinuous (at every point xe X ) and 
uniformly bounded is relatively compact. 
Hence, for any sequence of functions { fy} 

in 7, we can select a subsequence { f,,,)} 
which converges uniformly on compact sets 
(Ascoli-Arzelà theorem). 


E. Normal Families 


P. Montel (1912) gave the name normal family 
to the family of functions that is relatively com- 
pact with respect to the topology of uniform 
convergence on compact sets. This terminol- 
ogy is used mainly for the family of complex 
analytic functions. In that case, it is customary 
to compactify the range space and consider Y 
to be the *Riemann sphere. Using this notion, 
Montel succeeded in giving a unified treatment 
of various results in the theory of complex 
functions. 

A family of analytic functions .7 on a finite- 
dimensional tcomplex manifold X is a normal 
family if it is uniformly bounded on each com- 
pact set (Montel's theorem). Another criterion 
is that there are three values on the Riemann 
sphere which no function fe F takes. More 
generally, three exceptional values not taken 
by fe.7 may depend on f, if there is a positive 
lower bound for the distances between these 
three values on the Riemann sphere. This gives 
an easy proof of the *Picard theorem stating 
that every ‘transcendental meromorphic func- 
tion f(z) in |z| « oo must take all values except 
possibly two values. In fact the family of func- 
tions f.(z) — f(z/2"), n2 1,2,3, ...,in {1 « |z| 
<2} cannot be normal. Using a similar proce- 
dure, G. Julia obtained the results on tJulia’s 
direction. 

F. Marty introduced the notion of spherical 
derivative | /'(2)]/(14-] f(z)|7) for the analytic 
or meromorphic function f(z) and proved that 
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for a family F ={ f(z)} of analytic functions to 
be normal, it is necessary and sufficient that 
the spherical derivatives of fe F be uniformly 
bounded. This theorem implies Montel's 
theorem and its various extensions, including, 
for example, quantitative results concerning 
*Borel’s direction. 

A family F of analytic functions of one 
variable defined on X is said to form a quasi- 
normal family if there exists a subset P of X 
consisting only of isolated points such that 
from any sequence { f,1 ( f, €.7) we can select a 
subsequence { fany converging uniformly on 
X — P. If P is finite and consists of p points, 
the family ¥ is called a quasinormal family of 
order p. For example, the family of at most +p- 
valent functions is quasinormal of order p. 

The theory of normal families of complex 
analytic functions is not only applied to *value 
distribution theory, as above, but also used to 
show the existence of a function that gives the 
extremal of functionals. The extremal function 
is usually obtained as a limit of a subsequence 
of a sequence in a normal family. A typical 
example of this method is seen in the proof of 
the *Riemann mapping theorem. This is per- 
haps the only general method known today in 
the study of the iteration of *holomorphic 
functions. By this method, Julia (1919) made 
an exhaustive study of the iteration of mero- 
morphic functions; there are several other in- 
vestigations on the iteration of elementary 
transcendental functions. On the other hand, 
A. Wintner (Comm. Math. Helv., 23 (1949)) 
gave the implicit function theorem for analytic 
functions in a precise form using the theory of 
normal families of analytic functions of several 
variables. 


References 


[1] N. Bourbaki, Eléments de mathématique, 
HI., Topologie générale, ch. 10, Espaces fonc- 
tionnels, Actualités Sci. Ind., 1084b, Hermann, 
second edition, 1967; English translation, 
Theory of sets, Addison-Wesley, 1966. 

[2] J. L. Kelley, General topology, Van Nos- 
trand, 1955, ch. 7. 

[3] J. Dieudonné, Foundations of modern 
analysis, Academic Press, 1960, enlarged and 
corrected printing, 1969. 

For normal families of complex functions, 
[4] P. Montel, Leçons sur les familles nor- 
males de fonctions analytiques et leurs appli- 
cations, Gauthier-Villars, 1927. 

[5] G. Valiron, Familles normales et quasi- 
normales de fonctions méromorphes, 
Gauthier-Villars, 1929. 


436 A 
Uniform Spaces 


436 (1I.22) 
Uniform Spaces 


A. Introduction 


There are certain properties defined on *metric 
spaces but not on general *topological spaces, 
for example, tcompleteness or *uniform con- 
tinuity of functions. Generalizing metric spaces, 
A. Weil introduced the notion of uniform 
spaces. This notion can be defined in several 
ways [3,4]. The definition in Section B is that 
of Weil [1] without the tseparation axiom for 
topology. 

We denote by A, the diagonal ((x, x) xe X } 
of the Cartesian product X x X ofa set X with 
itself. If U and Vare subsets of X x X, then the 
composite Vo U is defined to be the set of all 
pairs (x, y) such that for some element z of X, 
the pair (x, z) is in U and the pair (z, y) is in V. 
The inverse U~! of U is defined to be the set of 
all pairs (x, y) such that ( y, x)e U. 


B. Definitions 


Let % be a nonempty family of subsets of 
X x X such that (i) if Ue% and U c V, then 
Ve M; (ii) if U, Ve, then UN Ve M; (n) if 
Ue, then A, c U; (iv) if Uew, then U e 
4f; and (v) if U € 4/, then Vo Va U for some 
Ve 4. Then we say that a uniform structure 
(or simply a uniformity) is defined on X by 
4. Ifa uniformity is defined on X by 4, then 
the pair (X, 4) or simply the set X itself is 
called a uniform space, and 4 is usually called 
a uniformity for X. 

A subfamily # of the uniformity 7% is called 
a base for the uniformity 4 if every member of 
^! contains a member of 2. If a family 24 of 
subsets of X x X is a base for a uniformity 4, 
then the following propositions hold: (ii^) if 
U, Ve 2, then there exists a We 22 such that 
WcURV;(i)if Ve &, then A, cU; (iv) if 
U € Z4, then there exists a Ve 2 such that 
VcU *(v)if Ue, then there exists a Ve Z 
such that Vo Vc U. Conversely, if a family # 
of subsets of a Cartesian product X x X satis- 
fies (ii')- (v), then the family 7 -(U|U c X x X, 
Vc U for some Ve ¥} defines a uniformity on 
X and Z is a base for %. Given a uniform 
space (X, Y), a member V of 4 is said to be 
symmetric if V= V 1 The family of all sym- 
metric members of X is a base for X. 


C. Topology of Uniform Spaces 


Given a uniform space (X, 4/), an element 
xe X, and Ve, we put U(x)={ v| ye X, (x, y) 
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EU j. Then the family 4(x) - (U(x))Uewj 
forms a neighborhood system of xe X, which 
gives rise to a topology of X (— 425 Topo- 
logical Spaces). This topology is called the 
uniform topology (or topology of the uni- 
formity). When we refer to a topology of a uni- 
form space (X, 4/), it is understood to be the 
uniform topology; thus a uniform space is 

also called a uniform topological space. If 4 is 
a base for the uniformity of a uniform space 
(X,U), then (x) 2 (U(x)| Ue 8} is a base for 
the neighborhood system at each point xe X. 
Each member of ~ is a subset of the topolog- 
ical space X x X, which is supplied with the 
product topology. The family of all open 
(closed) symmetric members of 4/ forms a 

base for %. A uniform space (X, %) is a *T,- 
topological space if and only if the intersection 
of all members of X is the diagonal A, In this 
case, the uniformity of (X, %) is called a T,- 
uniformity, and (X, %) is called a T, -uniform 
space. A T,-uniform space is always tregular; a 
fortiori, it is a T,-topological space. Hence a 
T; -uniform space is also said to be a Haus- 
dorff uniform space (or separated uniform 
space). Moreover, a uniform topology satisfies 
*Tikhonov's separation axiom; in particular, a 
T, -uniform space is *completely regular. 


D. Examples 


(1) Discrete Uniformity. Let X be a nonempty 
set, and let 4 2 (U|A, CU c X x X jJ. Then 
(X, 2%) is a T,-uniform space and :2 — (A,] isa 
base for 4/. This uniformity is called the dis- 
crete uniformity for X. 


(2) Uniform Family of Neighborhood System. 
A family {U,(x)},.4(xeX) of subsets of a set X 
is called a uniform neighborhood system in X if 
it satisfies the following four requirements: (1) 
x € U,(x) for each xe A and each xe X; (ii) if x 
and y are distinct elements of X, then y¢ U,(x) 
for some ae A; (iii) if x and f are two elements 
of A, then there is another element ye A such 
that U,(x) c U,(x)N U,(x) for all xe X; (iv) if x 
is an arbitrary element in A, then there is an 
element f in A such that ye U,(x) whenever x, 
y € U,(z) for some z in X. If we denote by 

U, (x € A) the subset of X x X consisting of all 
elements (x, y) such that xe X and ye U,(x), 
then the family {U,|«e A} satisfies all the 
conditions for a base for a uniformity. In par- 
ticular, it follows from (ii) that TL 4, U,— Ax, 
which is a stronger condition than (ii) in 
Section B. For instance, if (U,|xe A} is a base 
for the neighborhood system at the identity 
element of a T,-topological group G, then we 
have two uniform neighborhood systems 
(U1(x)) and (U7(x)], where U1(x) - xU, and 
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U;(x)- U,x. Two uniformities derived from 
these uniform neighborhood systems are called 
a Heft uniformity and a tright uniformity, 
respectively. Generally, these two uniformities 
do not coincide (— 423 Topological Groups). 


(3) Uniform Covering System [4]. A family 
Ul, ju 4 Of tcoverings of a set X is called a 
uniform covering system if the following three 
conditions are satisfied: (i) if U is a covering 
of X such that U~U, for all xe A, then U 
coincides with the covering A={ {x} ly; 

(ii) if «, B € A, then there is a ye A such that 
V, ROT, and UU. (itt) if xe A, then there 
is a BEA such that U; is a *A-refinement of 
uU, (U;)^<U,). For an example of a uni- 
form covering system of X, suppose that we 
are given a uniform neighborhood system 
{Us }aea (XE X). Let 4t, — (Uo) (ee A). 
Then (4,],., is a uniform covering system. 
On the other hand, for a covering U= (U;1,,4, 
let S(x, W) be the union of all members of 

U that contain x. If {U,},.4 is a uniform 
covering system and U,(x) — S(x, U,), then 
TUN, (xe X) is a uniform neighborhood 
system. Hence defining a uniform covering 
system of X is equivalent to defining a T, - 
uniformity on X. 


(4). In a metric space (x, d) the subsets U, = 
(G5 y)|d(x, Y) «rj, r» 0, form a base of uni- 
formity. The uniform topology defined by this 
coincides with the topology defined by the 
metric. 


E. Some Notions on Uniform Spaces 


Some of the terminology concerning topolog- 
ical spaces can be restated in the language of 
uniform structures. À mapping f from a uni- 
form space (X, %) into another (X’, 4) is said 
to be uniformly continuous if for each member 
U' in 4' there is a member U in Y such that 
(f(x), f(y) e U' for every (x, y) eU. This con- 
dition implies that fis continuous with respect 
to the uniform topologies of the uniform 
spaces. Equivalently, the mapping is uniformly 
continuous with respect to the uniform neigh- 
borhood system TU GH). if for any index H 
there is an index « such that ye U,(x) implies 
fL ye Usl f(x). If f: X o X' and g: X' O. X" are 
uniformly continuous, then the composite 
oof X X" is also uniformly continuous. A 
bijection f of a uniform space (X, 4/) to another 
(X', 4") is said to be a uniform isomorphism if 
both fand f! are uniformly continuous; in 
this case (X, %) and (X', 4) are said to be 
uniformly equivalent. A uniform isomorphism 
is a homeomorphism with respect to the uni- 
form topologies, and a uniform equivalence 
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defines an equivalence relation between uni- 
form spaces. 

If 4^, and 4, are uniformities for a set X, we 
say that the uniformity X, is stronger than the 
uniformity V, and 4 ; is weaker than %, if the 
identity mapping of (X,4/,) to (X,4/ ;) is uni- 
formly continuous. The discrete uniformity is 
the strongest among the uniformities for a set 
X. The weakest uniformity for X is defined by 
the single member X x X; this uniformity is 
not a T,-uniformity unless X is a singleton. 
Generally, there is no weakest T,-uniformity. 
A uniformity A, for X is stronger than an- 
other 4, if and only if every member of 4, is 
also a member of %,. 

If f is a mapping from a set X into a uniform 
space (Y, ^) and g is the mapping of X x X into 
Y x Y defined by g(x, y) 2 ( f(x), f( y)), then 
B={g (Hei) satisfies conditions (ii)- 
(v) in Section B for a base for a uniformity. 
The uniformity % for X determined by Z is 
called the inverse image of the uniformity Y^ 
for Y by f; Y is the weakest uniformity for X 
such that f is uniformly continuous. Hence a 
mapping f from a uniform space (X, 4/) into 
another (Y, Y^) is uniformly continuous if and 
only if the inverse image of the uniformity Y^ 
under f is weaker than the uniformity X. If A is 
a subset of a uniform space (X, %), then there 
is a uniformity Y^ for A determined as the 
inverse image of 4/ by the inclusion mapping 
of A into X. This uniformity *^ for A is called 
the relative uniformity for A induced by X, or 
the relativization of X to A, and the uniform 
space (A, 7^) is called a uniform subspace of 
(X, 4). The uniform topology for (A, ^) is the 
relative topology for A induced by the uniform 
topology for (X, 4). 

If ((X,, 4;)) 18 a family of uniform 
spaces, then the product uniformity for X = 
II. X; is defined to be the weakest uni- 
formity Y such that the projection of X 
onto each X, is uniformly continuous, and 
(X,U) is called the product uniform space of 
{(X,, Rn, The topology for (X, %) is the 
product of the topologies for (X;, 4/;) (AEA). 


F. Metrization 


Each *pseudometric d for a set X generates a 
uniformity in the following way. For each 
positive number r, let V, ,={(x, y)e X x 

X |d(x, y) « r]. Then the family (V, ,|r» 0] 
satisfies conditions (ii)-(v') in Section B for a 
base for a uniformity %. This uniformity is 
called the pseudometric uniformity or uniform- 
ity generated by d. The uniform topology for 
(X,4/) is the pseudometric topology. A uni- 
form space (X, Y) is said to be pseudometrizable 
(metrizable) if there is a pseudometric (metric) 
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d such that the uniformity X is identical with 
the uniformity generated by d. A uniform 
space is pseudometrizable if and only if its 
uniformity has a countable base. Conse- 
quently, a uniform space is metrizable if and 
only if its uniformity is a T,-uniformity and 
has a countable base. For a family P of 
pseudometrics on a set X, let V, „= (x, ye 
X x X |d(x, y) «rj for de P and positive r. 
The weakest uniformity containing every 
V, , (de P,r 0) is called the uniformity gen- 
erated by P. This uniformity may also be de- 
scribed as the weakest one such that each 
pseudometric in P is uniformly continuous on 
X x X with respect to the product uniformity. 
Each uniformity X on a set X coincides 
with the uniformity generated by the family Py 
of all pseudometrics that are uniformly con- 
tinuous on X x X with respect to the product 
uniformity of % with itself. It follows that each 
uniform space is uniformly isomorphic to a 
subspace of a product of pseudometric spaces 
(in which the number of components is equal 
to the cardinal number of Py} and that each 
T,-uniform space is uniformly isomorphic to a 
subspace of a product of metric spaces. A 
topology t for a set X is the uniform topology 
for some uniformity for X if and only if the 
topological space (X, t) satisfies *Tikhonov’s 
separation axiom; in particular, the uniformity 
is a T,-uniformity if and only if (X, 1) is tcom- 
pletely regular. 


G. Completeness 


If (X, 4) is a uniform space, a subset A of X is 
called a small set of order U(U € 4) if Ax Ac 
U. A ?filter on X is called a Cauchy filter 
(with respect to the uniformity %) if it contains 
a small set of order U for each U in X. Ifa 
filter on X converges to some point in X, then 
it is a Cauchy filter. If fis a uniformly con- 
tinuous mapping from a uniform space X into 
another X', then the image of a base for a 
Cauchy filter on X under fis a base for a 
Cauchy filter on X'. A point contained in the 
closure of every set in a Cauchy filter dé is the 
limit point of %. Hence if a filter converges to 
x, a Cauchy filter contained in the filter also 
converges to x. 

A tnet x(W) = (x, La (where 9I is a directed 
set with a preordering <) in a uniform space 
(X, 4) is called a Cauchy net if for each U in % 
there is a y in W such that (x,,x,)¢ U for every 
x and f such that y x x, y < f. If Wis the set N 
of all natural numbers, a Cauchy net {x,},en 
is called a Cauchy sequence (or fundamental 
sequence). Given a Cauchy net {x,y let A, 
—ixg| B z aj. Then B={A,|xe} is a base for 
a filter, and the filter is a Cauchy filter. On the 
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other hand, let 8 be a base for a Cauchy filter 
8. For U, He we put U x Vif and only if 

U >V. Then 8 is a directed set with respect to 
<S. The net {xy tveg, where xy is an arbitrary 
point in U, is a Cauchy net. A proposition 
concerning convergence of a Cauchy filter is 
always equivalent to a proposition concerning 
convergence of the corresponding Cauchy net. 

A Cauchy filter (or Cauchy net) in a uni- 
form space X does not always converge to a 
point of X. A uniform space ts said to be com- 
plete (with respect to the uniformity) if every 
Cauchy filter (or Cauchy net) converges to a 
point of that space. A complete uniform space 
is called for brevity a complete space. A closed 
subspace of a complete space is complete with 
respect to the relative uniformity. À pseudo- 
metrizable uniform space is complete if and 
only if every Cauchy sequence in the space 
converges to a point. Hence in the case of a 
metric space, our definition of completeness 
coincides with the usual one (— 273 Metric 
Spaces). 

A mapping f from a uniform space X to an- 
other X' is said to be uniformly continuous on 
a subset A of X if the restriction of f to A is 
uniformly continuous with respect to the rela- 
tive uniformity for A. If fis a uniformly con- 
tinuous mapping from a subset A of a uniform 
space into a complete T,-uniform space, then 
there is a unique uniformly continuous exten- 
sion f of f on the closure A. 

Each T,-uniform space is uniformly equiva- 
lent to a dense subspace of a complete T,- 
uniform space; this property ts a generaliza- 
tion of the fact that each metric space can be 
mapped by an isometry onto a dense subset of 
a complete metric space. A completion of a 
uniform space (X, 7) is a pair (f,(X*, @*)), 
where (X *, 4/*) is a complete space and f is 
a uniform isomorphism of X onto a dense 
subspace of X*. The T,-completion of a 
T,-uniform space is unique up to uniform 
equivalence. 


H. Compact Spaces 


A uniformity 4/ for a topological space (X, t) is 
said to be compatible with the topology t if the 
uniform topology for (X, 4/) coincides with t. 
A topological space (X, 1) is said to be unifor- 
mizable if there is a uniformity compatible 
with c. If (X,7) is a compact Hausdorff space, 
then there is a unique uniformity % compa- 
tible with t; in fact, 4 consists of all neighbor- 
hoods of the diagonal A, in X x X; and the 
compact Hausdorff space is complete with this 
uniformity. Hence every subspace of a com- 
pact Hausdorff space is uniformizable, and 
every ‘locally compact Hausdorff space is 
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uniformizable. Any continuous mapping from 
a compact Hausdorff space to a uniform space 
is uniformly continuous. A uniform space 

(X, %) is said to be totally bounded (or precom- 
pact) if for each U e€% there is a finite covering 
consisting of small sets of order U; a subset of 
a uniform space is called totally bounded if it is 
totally bounded with respect to the relative 
uniformity. A uniform space X is said to be 
locally totally bounded if for each point of X 
there is a base for a neighborhood system 
consisting of totally bounded open subsets. A 
uniform space is compact if and only if it is 
totally bounded and complete. If f is a uni- 
formly continuous mapping from a uniform 
space X to another, then the image f(A) of 

a totally bounded subset A of X is totally 
bounded. 


I. Topologically Complete Spaces 


A topological space (X, x) is said to be topo- 
logically complete (or Dieudonné complete) if it 
admits a uniformity compatible with t with 
respect to which X is complete. Each *para- 
compact Hausdorff space is topologically 
complete. Actually such a space is complete 
with respect to its strongest uniformity. À 
Hausdorff space which is homeomorphic to 

a *G,-set in a compact Hausdorff space is 

said to be Cech-complete; A metric space is 
homeomorphic to a complete metric space if 
and only if it is Cech-complete. A Hausdorff 
space X is paracompact and Cech-complete if 
and only if there is a *perfect mapping from X 
onto a complete metric space. 
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A. Definitions 


A homomorphism U of a ttopological group G 
into the group of *unitary operators on a 
*Hilbert space $ (4 (01) is called a unitary 
representation of G if U is strongly continuous 
in the following sense: For any element xe 9, 
the mapping g  U,x is a continuous mapping 
from G into $. The Hilbert space § is called 
the representation space of U and is denoted 
by $(U). Two unitary representations U and 
U' are said to be equivalent (similar or isomor- 
phic), denoted by U = U', if there exists an 
"isometry T from §(U) onto §(U’) that satis- 
fies the equality To U, — U,o T for every g in 
G. If the representation space $(U) contains 
no closed subspace other than $ and {0} that 
is invariant under every U,, the unitary repre- 
sentation U is said to be irreducible. An element 
x in §(U) is called a cyclic vector if the set of 
all finite linear combinations of the elements 
U,x(ge G) is dense in $(U). A representation 
U having a cyclic vector is called a cyclic 
representation. Every nonzero element of the 
representation space of an irreducible repre- 
sentation is a cyclic vector. 


Examples. Let G be a ‘topological transfor- 
mation group acting on a *locally compact 
Hausdorff space X from the right. Suppose 
that there exists a Radon measure u that is 
invariant under the group G. Then a unitary 
representation R" is defined on the Hilbert 
space $ = L?(X, u) by the formula (R^ f)(x) 

= f(xg) (fe$, xe X,geG). The representation 
R" 1s called the regular representation of G 

on (X, u). If G acts on X from the left, then 
the regular representation L" is defined by 
(Lf (x) Ta ! x). In particular, when X is 
the *quotient space H\G of a tlocally compact 
group G by a closed subgroup H, any two 
invariant measures H, H (if they exist) coincide 
up to a constant factor. Hence the regular 
representation R” on (X, ul and the regular 
representation R” on (X, j/) are equivalent. In 
this case, the representation R" is called the 
regular representation on X. When H - (ej, a 
locally compact group G has a Radon measure 
#0 that is invariant under every right (left) 
translation hg (hgh) and is called a right 
(left) *Haar measure on G. So G has thc regu- 
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lar representation R (L) on G. R (L) is called 
the right (left) regular representation of G. 


B. Positive Definite Functions and Existence 
of Representations 


A complex-valued continuous function o on a 
topological group G is called positive definite 
if the matrix having o(g; ! gj as the (i,j)- 
component is a tpositive semidefinite Her- 
mitian matrix for any finite number of ele- 
ments g,,...,g, in G. If U is a unitary repre- 
sentation of G, then the function @(g)=(U,x, x) 
is positive definite for every element x in §(U). 
Conversely, any positive definite function (g) 
on a topological group G can be expressed as 
Q9 (g) « (U,x, x) for some unitary representation 
U and x in $(U). Using this fact and the 
*Krein-Milman theorem, it can be proved that 
every locally compact group G has sufficiently 
many irreducible unitary representations in the 
following sense: For every element g in G other 
than the identity element e, there exists an 
irreducible unitary representation U, generally 
depending on g, that satisfies the inequality 

U, #1. The groups having sufficiently many 
finite-dimensional (irreducible) unitary repre- 
sentations are called tmaximally almost 
periodic. If a connected locally compact group 
G is maximally almost periodic, then G is the 
direct product of a compact group and a vec- 
tor group R”. On the other hand, any non- 
compact connected ‘simple Lie group has no 
finite-dimensional irreducible unitary repre- 
sentation other than the unit representation 
g1 (— 18 Almost Periodic Functions). 


C. Subrepresentations 


Let U be a unitary representation of a topo- 
logical group G. A closed subspace St of §(U) 
is called U-invariant if 9t is invariant under 
every U, (ge G). Let R# {0} be a closed invar- 
iant subspace of $(U) and V, be the restric- 
tion of U, on 9t. Then V is a unitary represen- 
tation of G on the representation space 9t and 
is called a subrepresentation of U. Two unitary 
representations L and M are called disjoint if 
no subrepresentation of L is equivalent to a 
subrepresentation of M; they are called quasi- 
equivalent if no subrepresentation of L is dis- 
joint from M and no subrepresentation of M is 
disjoint from L. 


D. Irreducible Representations 


Let U be a unitary representation of G, M 
be the "von Neumann algebra generated by 
{U,|g¢G}, and M’ be the tcommutant of M. 
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Then a closed subspace 9t of §(U) is invariant 
under U if and only if the tprojection operator 
P corresponding to 9t belongs to M'. There- 
fore U is irreducible if and only if M' consists 
of scalar operators (a1 |xe C] (Schur's lemma). 
A representation space of a cyclic or irreduc- 
ible representation of a tseparable topological 
group is tseparable. 


E. Factor Representations 


A unitary representation U of G is called a 
factor representation if the von Neumann 
algebra M = (U,|ge G} is a tfactor, that is, 
MM’ = {al|aeC}. Two factor representa- 
tions are quasi-equivalent if and only if they 
are not disjoint. U is called a factor represen- 
tation of type I, II, or HI if the von Neumann 
algebra M is a factor of ftype I, H, or III, 
respectively (— 308 Operator Algebras). A 
topological group G is called a group of type I 
(or type I group) if every factor representation 
of G is of type I. Compact groups, locally 
compact Abelian groups, connected tnilpotent 
Lie groups, connected tsemisimple Lie groups, 
and real or complex ‘linear algebraic groups 
are examples of groups of type I. There exists a 
connected solvable Lie group that is not of 
type I (— Section U), but a connected solvable 
Lie group is of type I if the exponential map- 
ping is surjective (O. Takenouchi). A discrete 
group G with countably many elements is a 
type I group if and only if G has an Abelian 
normal subgroup with finite index (E. Thoma). 


F. Representation of Direct Products 


Let G, and G, be topological groups, G the 
‘direct product of G, and G, (G =G; x G,), and 
U; an irreducible unitary representation of G; 
(i= 1, 2). Then the ttensor product representa- 
tion U, & U;:(g,, 97) U,, ® U,, is an irreduc- 
ible unitary representation of G. Conversely, 
if one of the groups G, and G, is of type I, then 
every irreducible unitary representation of G 
is equivalent to the tensor product U, & U, 

of some irreducible representations U; of G; 
(i= 1,2), 


G. Direct Sums 


If the representation space § of a unitary 
representation U is the tdirect sum @,-7 $(a) 
of mutually orthogonal closed invariant sub- 
spaces {§(«)},.;, then U is called the direct 
sum of the subrepresentations U(«) induced on 
Dia) by U, and is denoted by U = @,,., U(a). 
Any unitary representation is the direct sum of 
cyclic representations. A unitary representa- 
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tion U is called a representation without multi- 
plicity if U cannot be decomposed as a direct 
sum U, OU, unless U, and U, are disjoint. If 
U is the direct sum of (U(a)],., and every U (a) 
is irreducible, then U is said to be decomposed 
into the direct sum of irreducible representa- 
tions. Decomposition into direct sums of 
irreducible representations is essentially 
unique if it exists; that is, if U = @,., U(a) 

= (py, V(P) are two decompositions of U 
into direct sums of irreducible representations, 
then there exists a bijection o from J onto J 
such that U(a) is equivalent to V(q(a)) for 
every xin J. A factor representation U of type 
I can be decomposed as the direct sum U — 
aer U(x) of equivalent irreducible represen- 
tations U(a). In general, a unitary representa- 
tion U cannot be decomposed as the direct 
sum of irreducible representations even if U is 
not irreducible. Thus it becomes necessary to 
use direct integrals to obtain an irreducible 
decomposition. 


H. Direct Integrals 


Let U be a unitary representation of a group G 
and (X, u) be a *measure space. Assume that 
the following two conditions are satisfied by 
U: (i) There exists a unitary representation 

U (x) of G corresponding to every element x 
of X, and §(U) is a ‘direct integral (— 308 
Operator Algebras) of §(U(x)) (xe X) (written 
$(U)- [s $(U GO) dux); (ii) for every g in G, 
the operator U, is a decomposable operator 
and can be written as U, — fx U,(x)du(x). Then 
the unitary representation U is called the 
direct integral of the family {U (x)} xex of uni- 
tary representations and is denoted by U= 

[x U(x)du(x). If every point of X has mea- 
sure 1, then a direct integral is reduced to a 
direct sum. 


I. Decomposition into Factor Representations 


We assume that G is a locally compact group 
satisfying the second countability axiom, and 
also that a Hilbert space is separable. Every 
unitary representation U of G can be decom- 
posed as a direct integral U =|, U(x)du(x) 

in such a way that the center A of the von 
Neumann algebra M" = (U,|ge GJ" is the set 
of all *diagonalizable operators. In this case 
almost all the U(x) are factor representations. 
Such a decomposition of U is essentially 
unique. There exists a tnull set N in X such 
that for every x and x’ in X — N (x zx), U(x) 
and U(x’) are mutually disjoint factor repre- 
sentations. Hence the space X can be identified 
with the set G* of all quasi-equivalence classes 
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of factor representations of G endowed with a 
suitable structure of a measure space. The 
space G* is called the quasidual of G. The 
measure u is determined by U up to tequiva- 
lence of measures. 


J. Duals 


A topology is introduced on the set G of all 
equivalence classes of irreducible unitary rep- 
resentations of a locally compact group G in 
the following way. Let H, be the n-dimensional 
Hilbert space /,(n) and I, the set of all irreduc- 
ible unitary representations of G realized on 
H, (1 <n< oc). We topologize I, in such a way 
that a tnet {U^}; ez in I, converges to U if and 
only if (U7 x, y) converges uniformly to (U,x, y) 
on every compact subset of G for any x and y 
in H,,. Equivalence between representations in 
I, is an open relation. Let G, be the set of all 
equivalence classes of n-dimensional irreduc- 
ible unitary representations of G with the 
topology of a quotient space of J, and G — 

BE G, be the direct sum of topological spaces 
G,. Then the topological space G is called the 
dual of G. G is a locally compact tBaire space 
with countable open base, but it does not 
satisfy the tHausdorff separation axiom in 
general. If G is a compact Hausdorff topolog- 
ical group, then G is discrete. If G is a locally 
compact Abelian group, then G coincides with 
the *character group of G in the sense of 
Pontryagin. If G is a type I group, then there 
exists a dense open subset of G that is a locally 
compact Hausdorff space. The to-additive 
family generated by closed sets in G is denoted 
by 38. In the following sections, a measure on 
G means a measure defined on 3B. 


K. Irreducible Decompositions 


In this section G is assumed to be a locally 
compact group of type I with countable open 
base. For any equivalence class x in G, we 
choose a representative U(x)ex with the rep- 
resentation space H(U(x)) — L; (n) if x is n- 
dimensional. For any measure p on C, the 
representation Ur = fc U(x)d(x) is a unitary 
representation without multiplicity. Con- 
versely, any unitary representation of G with- 
out multiplicity is equivalent to a U" for some 
measure u on G. Moreover, U” is equivalent to 
U” if and only if the two measures u and v are 
equivalent (that is, u is absolutely continuous 
with respect to v, and vice versa). A unitary 
representation U with multiplicity on a sepa- 
rable Hilbert space $ can be decomposed as 
follows: There exists a countable set of mea- 
SUITES Ui, UH. Whose supports are mutu- 
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ally disjoint such that U = (a U(x)du, (x) ® 
2le UG) du5(x) ® ... & oo fa UGO du. (x). 

The measures 4, fo, ..., Ho are uniquely 
determined by U up to equivalence of mea- 
sures. Any unitary representation U on à 
separable Hilbert space § of an arbitrary 
locally compact group with countable open 
base (even if not of type I) can be decomposed 
as a direct integral of irreducible representa- 
tions. In order to obtain such a decompo- 
sition, it is sufficient to decompose $ asa 
direct integral in such a way that a maximal 
Abelian von Neumann subalgebra A of M' 
={U,|geG}' is the set of all diagonalizable 
operators. In this case, however, a different 
choice of A induces in general an essentially 
different decomposition, and uniqueness of the 
decomposition does not hold. For a group of 
type L the irreducible representations are the 
“atoms” of representations, as in the case of 
compact groups. For a group not of type I, it 
is more natural to take the factor represen- 
tations for the trreducible representations, 
quasi-equivalence for the equivalence, and the 
quasidual for the dual of G. Therefore the 
theory of unitary representations for a group 
not of type I has different features from the 
one for a type I group. The theory of unitary 
representation for groups not of type I has not 
yet been successfully developed, but some 
important results have been obtained (e.g., L. 
Pukanszky, Ann. Sci. Ecole Norm. Sup., 4 
(1971)). 

Tatsuuma [1] proved a duality theorem for 
general locally compact groups which is an 
extension of both Pontryagin's and Tannaka's 
duality theorems considering the direct in- 
tegral decomposition of tensor product 
representations. 


L. The Plancherel Formula 


Let G be a unimodular locally compact group 
with countable open base, R(L) be the right 
(left) regular representation of G, and M, N, 
and P be the von Neumann algebras generated 
by {R,}, {L,}, and {R,, L,}, respectively. Then 
M’=N, N’=M, and P'’'= MANAN. If we decom- 
pose $ into a direct integral in such a way 
that P’ is the algebra of all diagonalizable 
operators, then M(x) and N(x) are factors for 
almost all x. This decomposition of $ pro- 
duces a decomposition of the two-sided regu- 
lar representation (R,, L,} into irreducible 
representations and a decomposition of the 
regular representation R(L) into factor repre- 
sentations. Hence the decomposition is realized 
as the direct integral over the quasidual G* of 
G. Moreover, the factors M(x) and N(x) are of 
type I or II for almost all x in G*, and there 
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exists a ttrace t in the factor M(x). For any f 
and g in L,(G)M L,(G), the Plancherel formula 


| fists) as= | (U* (x) U,(x)) du(x) (1) 
G G* 


holds, where U,(x) =, f(s)U,(x)ds and U* is 
the tadjoint of U. The inversion formula 


i | EU, (X) UT (x)) du(x) (2) 
G* 


is derived from (1) for a function A — f *g 
(f.ge L i(G)N L,(G)). In (1) and (2), because of 
the impossibility of normalization of the trace 
t in a factor of type H, the measure u cannot 
in general be determined uniquely. However, if 
G is a type I group, then (1) and (2) can be 
rewritten as similar formulas, where the repre- 
sentation U(x) in (1) and (2) is irreducible, the 
trace t is the usual trace, and the domain of 
integration is not the quasidual G* but the 
dual G of G. The revised formula (1) is also 
called the Plancherel formula. In this case the 
measure u on G in formulas (1) and (2) is 
uniquely determined by the given Haar mea- 
sure on G. The measure u is called the Plan- 
cherel measure of G. The support G, of the 
Plancherel measure u is called the reduced dual 
of G. The Plancherel formula gives the direct 
integral decomposition of the regular repre- 
sentation into the irreducible representations 
belonging to G,. Each U in G, is contained in 
this decomposition, with the multiplicity equal 
to dim §(U). 


M. Square Integrable Representations 


An irreducible unitary representation U ofa 
unimodular locally compact group G is said to 
be square integrable when for some element 

x £0, in $(U), the function o(g) =(U,x, x) 
belongs to L?(G, dg), where dg is the Haar 
measure of G. If U is square integrable, then 
vm, ,(g) =(U,x, y) belongs to L^(G, dg) for any x 
and y in $(U). Let U and U’ be the two square 
integrable representations of G. Then the 
following orthogonality relations hold: 


| (U,x, y (U,u, v) dg 
G 


0 if U is not 
d equivalent to U', (3) 
dy'(x,u)(v,y) ifU=U". 


When G is compact, every irreducible unitary 
representation U is square integrable and 

finite-dimensional. Moreover, the scalar dy in 
(3) is the degree of U if the total measure of G 
is normalized to 1. In the general case, the 

scalar dy in (3) ts called the formal degree of U 
and is determined uniquely by the given Haar 
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measure dg. Let y be an element in §(U) with 
norm 1 and V be the subspace To, ,|xe $(U)] 
of L?(G). Then the linear mapping T:x5 dy 
(x,y is an isometry of $(U) onto V. Hence U is 
equivalent to a subrepresentation of the right 
regular representation R of G. Conversely, 
every irreducible subrepresentation of R is 
square integrable. Thus a square integrable 
representation ts an irreducible subrepresen- 
tation of R (= L). Therefore, in the irreducible 
decomposition of R, the square integrable 
representations appear as discrete direct sum- 
mands. Hence every square integrable repre- 
sentation U has a positive Plancherel measure 
u(U) that is equal to the formal degree dy. 
There exist noncompact groups that have 
square integrable representations. An example 
of such a group is SL(2, R) (— Section X). 


N. Representations of L(G) 


Let G be a locally compact group and L,(G) 
be the space of all complex-valued integrable 
functions on G. Then L,(G) is an algebra over 
C, where the convolution 


(«o | f(st ^ ')g(t)dt 
G 


is defined to be the product of f and g. Let A 
be the *modular function of G. Then the map- 
ping f(s) f *(s) Ais ^ !)f(s !)is an *invol- 
ution of the algebra L,(G). Let U be a unitary 
representation of G, and put U', — Le U, f (s) ds. 
Then the mapping f — UL, gives a nondegen- 
erate representation of the Banach algebra 
L4(G) with an involution, where nondegenerate 
means that (U',x| f e L.(G), xe$(U)]^ reduces 
to {0}. The mapping UU’ gives a bijection 
between the set of equivalence classes of uni- 
tary representations of G and the set of equiv- 
alence classes of nondegenerate representa- 
tions of the Banach algebra L,(G) with an 
involution on Hilbert spaces. U is an irreduc- 
ible (factor) representation if and only if U' ts 
an irreducible (factor) representation. There- 
fore the study of unitary representations of G 
reduces to that of representations of L,(G). If 
U; is a ‘compact operator for every f in L,(G), 
then U is the discrete direct sum of irreducible 
representations, and the multiplicity of every 
irreducible component is finite. (See [2] for 
Sections A-N.) 


O. Induced Representations 


Induced representation is the method of con- 
structing a representation of a group G in a 
canonical way from a representation of a 
subgroup H of G. It is a fundamental method 
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of obtaining a unitary representation of G. Let 
G be a locally compact group satisfying the 
second countability axiom, L be a unitary 
representation on a separable Hilbert space 
§(L) of a closed subgroup H of G, and m, n, 

A, and 6 be the right Haar measures and the 
modular functions of the groups G and H, 
respectively. Then there exists a continuous 
positive function p on G satisfying p(hg)= 
d(h)Ath) ! p(g) for every hin H and g in G. 
The *quotient measure y —(pm)/n is a quasi- 
invariant measure on the coset space H\G (— 
225 Invariant Measures). Let $ be the vector 
space of weakly measurable functions f on G 
with values in $(L) satisfying the following 
two conditions: (i) f(hg) = L,f(g) for every h in 
H and g in G; and (ii) [£^ — [c Lei? duo) 
< +00, where g represents the coset Hg. By 
condition (i), the norm || f(g)|| is constant on a 
coset Hg — j and is a function on H\G, so the 
integral in condition (ii) is well defined. Then § 
is a Hilbert space with the norm defined in (ii). 
A unitary representation U of G on the Hilbert 
space $ is defined by the formula 


(USD= V p(9s)/p(g) f(gs). 


U is called the unitary representation induced 
by the representation L of a subgroup H and is 
denoted by U — U* or Ind§ L. Induced repre- 
sentations have the following properties. 

(1) ULE: = U4: @ UÙ or more generally, 
USU ~ ( ULC) du(x). Therefore if U" is 
irreducible, L 1s also irreducible (the converse 
does not hold in general). 

(2) Let H, K be two subgroups of G such 
that H c K, L bea unitary representation of H, 
and M be the representation of K induced by 
L. Then two unitary representations UN and 
U of G are equivalent. 

An induced representation U^ is the repre- 
sentation on the space of square integrable 
sections of the *vector bundle with fiber H(L) 
‘associated with the principal bundle (G, H\ 
G, H) (^ G. W. Mackey [3], F. Bruhat [4]). 


P. Unitary Representations of Special Groups 


In the following sections we describe the fun- 
damental results on the unitary representa- 
tions of certain special groups. 


Q. Compact Groups 


Irreducible unitary representations of a com- 
pact group are always finite-dimensional. 
Every unitary representation of a compact 
group is decomposed into the direct sum of 
irreducible representations. Irreducible unitary 
representations of a compact connected Lie 
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group are completely classified. The characters 
of irreducible representations are calculated in 
an explicit form (— 69 Compact Groups; 249 
Lie Groups). Every irreducible unitary repre- 
sentation U of a connected compact Lie group 
G can be extended uniquely to an irreducible 
holomorphic representation U* of the com- 
plexification G* of G. US is holomorphically 
induced from a 1-dtmensional representation 
of a Borel subgroup B of G^ (Borel-Weil 
theorem; — R. Bott [5]). 


R. Abelian Groups 


Every irreducible unitary representation of 
an Abelian group G is 1-dimensiona!. *Stone’s 
theorem concerning one-parameter groups 

of unitary operators, U, = |2 e'™ dE}, gives 
irreducible decompositions of unitary repre- 
sentations of the additive group R of real 
numbers. *Bochner's theorem on tpositive 
definite functions on R is a restatement of 
Stone's theorem in terms of positive definite 
functions. The theory of the *Fourier trans- 
form on R, in particular tPlancherel’s theorem, 
gives the irreducible decomposition of the 
regular representation of R. The theorems of 
Stone, Bochner, and Plancherel have been 
extended to an arbitrary locally compact 
Abeltan group (— 192 Harmonic Analysis). 


S. Representations of Lie Groups and Lie 
Algebras 


Let U be a unitary representation of a Lie 
group G with the Lie algebra g. An element x 
in $(U) is called an analytic vector with respect 
to U if the mapping g- U, x is a real analytic 
function on G with values in $(U). The set of 
all analytic vectors with respect to U forms a 
dense subspace Y= 9I(U) of $(U). For any 
elements X in g and x in 9I(U), the derivative 
at t=0 of a real analytic function U,,,,yx is 
denoted by V(X)x. Then V(X) ts a linear 
transformation on 9I, and the mapping V: X 
— V(X) is a representation of g on W. We call 
V the differential representation of U. The rep- 
resentation V of g can be extended uniquely 
to a representation of the *universal en- 
veloping algebra 8 of g. Two unitary repre- 
sentations UI! and U®) of a connected Lie 
group G are equivalent if and only if there 
exists a bijective bounded linear mapping T 
from $(U'!) onto $(U'?) such that T maps 
AU) onto 9I(UO?) and satisfies the equality 


(To V9X))x Z(V?(X)o T)x 


for all X ing and x in WU), Let X,,..., X, 
be a basis of g and U be a unitary representa- 
tion of G. Then the element A= X24 ....- X? 
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in the universal enveloping algebra $8 of g is 
represented in the differential representation V 
of U by an ‘essentially self-adjoint operator 
V(A). Conversely, if to each element X in g 
there corresponds a (not necessarily bounded) 
*skew-Hermitian operator p(x) that satisfies 
the following three conditions, then there 
exists a unique unitary representation U of the 
simply connected Lie group G with the Lie 
algebra g such that the ‘closure of V(X) coin- 
cides with the closure of p(X) for every X in g: 
(i) There exists a dense subspace © contained 
in the domain of p(X)p(Y) for every X and Y 
in g; (ii) for each X and Y ing, a and b in R, 
and x in O, p(aX +bY)x=ap(X)x+ bp(Y)x, 
p( X, Y )x - G(X)p(Y) — p(Y)p(X)x; (ii) the 
restriction of p(X,)*+...+p(X,)? to © is an 
essentially self-adjoint operator if X,,..., X, is 
a basis of g (E. Nelson [6]). 


T. Nilpotent Lie Groups 


For every irreducible unitary representation of 
a connected nilpotent Lie group G, there is 
some 1-dimensional unitary representation of 
some subgroup of G that induces it. Let G be a 
simply connected nilpotent Lie group, g be the 
Lie algebra of G, and p be the contragredient 
representation of the adjoint representation of 
G. The representation space of p ts the dual 
space q* of g. A subalgebra b of g is called 
subordinate to an element f in g* if f annihi- 
lates each bracket [ X, Y] for every X and Y in 
b: (f, LX, Y ]) 20. When b is subordinate to f, a 
1-dimensional unitary representation L of the 
analytic subgroup H of G with the Lie algebra 
b is defined by the formula 4,(exp X) — e^". ? 
(X €b). Every 1-dimensional unitary represen- 
tation 4, of H is defined as in this formula by 
an element f in g* to which b is subordinate. 
The unitary representation of G induced by 
such a 4, is denoted by U(f, b). The represen- 
tation U( f, b) is irreducible if and only if b has 
maximal dimension among the subalgebras 
subordinate to f. Two irreducible represent- 
ations U( f, b) and U( f. b) are equivalent if and 
only if f and f' are conjugate under the group 
p(G). Therefore there exists a bijection be- 
tween the set of equivalence classes of the 
irreducible unitary representations of a simply 
connected nilpotent Lie group G and the set of 
orbits of p(G) on af (A. A. Kirillov [7]). 


U. Solvable Lie Groups 


Let G be a simply connected solvable Lie 
group. If the exponential mapping is bijective, 
G is called an exponential group. All results 
stated above for nilpotent Lie groups hold for 
exponential groups except the irreducibility 
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criterion. In this case the representation U( f, b) 
is irreducible if and only if b is of maximal 
dimension among subordinate subalgebras 
and the orbit O= p(G)f contains the affine 
subspace f -- b^ = f - (g|g(b) OI (Pukanszky 
condition). 

The situation is more complicated for gen- 
eral solvable Lie groups. The isotropy sub- 
group G,— (ge G|p(g)f — f) at feg* is, in 
general, not connected. A linear form f is 
called integral if there exists a unitary charac- 
ter n; of G; whose differential is the restriction 
of 2nif to g; (the Lie algebra of G,). Using the 
notion of "polarization," an irreducible unitary 
representation of G is constructed from a pair 
(f. y) of an integral form f €g* and a character 
ge If G is of type I, then every irreducible 
unitary representation of G is obtained in this 
way. A simply connected solvable Lie group G 
is of type I if and only if (i) every f'eg* is in- 
tegral and (ii) every G-orbit p(G)f in g* is 
locally closed (Auslander and Kostant [8 ]). 

As an example, let x be an irrational real 
number. Then the following Lie group G is not 


e" 0 z 
of type /:G=<| 0 e" 1 reRawech. 
0 0 1 


V. Semisimple Lie Groups 


A connected semisimple Lie group is of type I. 
The character y = yy of an irreducible unitary 
representation U of G is defined as follows: Let 
C$ (G) be the set of all complex-valued C*- 
functions with compact support on G. Then 
for any function f in Cg (G), the operator U, 
zs U, f(g) dg belongs to the *trace class, and 
the linear form y: f— T; U, is a distribution in 
the sense of Schwartz. The distribution y is 
called the character of an irreducible unitary 
representation U. A character y is invariant 
under any inner automorphism of G and is a 
simultaneous eigendistribution of the algebra 
of all two-sided invariant linear differential 
operators on G. Two irreducible unitary repre- 
sentations of G are equivalent 1f and only if 
their characters coincide. The distribution y is 
a tlocally summable function on G and coin- 
cides with a real analytic function on each 
connected component of the dense open sub- 
manifold G' consisting of regular elements in 
G. In general, y is not real analytic on all of G 
(Harish-Chandra [9, ITE 10]. 


W. Complex Semisimple Lie Groups 


There are four series of irreducible represen- 

tations of a complex semisimple Lie group G. 
(1) A principal series consists of unitary 

representations of G induced from 1- 
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dimensional unitary representations L of a 
*Borel subgroup B of G. L is uniquely deter- 
mined by a unitary character ve Hom(A, U(1)) 
= A* of the *'Cartan subgroup A of G con- 
tained in B. Hence the representations in the 
principal series are parametrized by the ele- 
ments in the character group A* of the Cartan 
subgroup A. If we denote U} by U”, two repre- 
sentations U” and U"'(v, v'e A*) are equivalent 
if and only if v and v' are conjugate under the 
*Weyl group W of G with respect to A. 

(2) A degenerate series consists of unitary 
representations induced by 1-dimensional 
unitary representations of a *parabolic sub- 
group P of G other than B. (A parabolic sub- 
group P is any subgroup of G containing a 
Borel subgroup B.) 

(3) A complementary series consists of irre- 
ducible unitary representations U* induced by 
nonunitary 1-dimensional representations 
L of a Borel subgroup B. In this case, con- 
dition (ii) in the definition of U^ (— Section O) 
must be changed. When L is a nonunitary 
representation, then the operator Uj is not a 
unitary operator with respect to the usual L,- 
inner product (ii). However, if L satisfies a 
certain condition, then US leaves invariant 
some positive definite Hermitian form on the 
space of sufficiently nice functions. Completing 
this space, we get a unitary representation U*. 
The representations thus obtained form the 
complementary series. 

(4) A complementary degenerate series con- 
sists of irreducible unitary representations 
induced by nonunitary 1-dimensional repre- 
sentations of a parabolic subgroup P z B. 

Representations belonging to different series 
are never equivalent. It seems certain that any 
irreducible unitary representgtion of a con- 
nected complex semisimple Lie group is equiv- 
alent to a representation belonging to one of 
the above four series, but this conjecture has 
not yet been proved. Moreover, E. M. Stein 
[11] constructed irreducible unitary repre- 
sentations different from any in the list ob- 
tained by I. M. Gel’fand and M. A. Naimark 
(Neumark) [12]. These representations belong 
to the complementary degenerate series. The 
characters of the representations in these four 
series are computed in explicit form. For ex- 
ample, the character y, of the representation 
U" in the principal series can be calculated as 
follows: Let 4 be a linear form on a Cartan 
subalgebra a such that v(exp H)=e* for 
every H in a, let D be the function on A de- 
fined by D(exp H) = JJ, |e"? —e «4212. 
where a runs over all positive roots. Then the 
character y, of a representation U" in the 
principal series is given by the formula 
4, (exp H) 2 D(exp H) ! Y. eun, 


sew 
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In the irreducible decomposition of the regular 
representation of G, only irreducible represen- 
tations belonging to the principal series arise. 
Hence the right-hand side in the Plancherel 
formula is an integral over the character group 
A* of a Cartan subgroup A. Under a suitable 
normalization of the Haar measures in G and 
A*, the Plancherel measure u of G can be 
expressed by using the Haar measure dv of A*: 


du(vy=w "IA. 09/Co, o9" dv, 


where w is the order of the Weyl group, p is 
the half-sum of all *positive roots, and « runs 
over all positive roots (Gel'fand and Naimark 


[12]). 


X. Real Semisimple Lie Groups 


As in the case of a complex semisimple Lie 
group, a connected real semisimple Lie group 
G has four series of irreducible unitary repre- 
sentations. However, if G has no parabolic 
subgroup other than a minimal parabolic 
subgroup B and G itself, then G has no repre- 
sentation in the degenerate or complementary 
degenerate series. Examples of such groups are 
SL(2, R) and higher-dimensional "Lorentz 
groups. In general, the classification of irreduc- 
ible unitary representations in the real semi- 
simple case is more complicated than in the 
complex semisimple case. Irreducible unitary 
representations arising from the irreducible 
decomposition of the regular representation 
are called representations in the principal 
series. The principal series of G are divided 
into a finite number of subsertes corresponding 
bijectively to the conjugate classes of the tCar- 
tan subgroups of G. 

A connected semisimple Lie group G has a 
square integrable representation if and only if 
G has a compact Cartan subgroup H. The set 
of all square integrable representations of G is 
called the discrete series of irreducible unitary 
representations. The discrete series is the sub- 
series in the principal series corresponding to a 
compact Cartan subgroup H. The representa- 
tions in the discrete series were classified by 
Harish-Chandra. Let lj be the Lie algebra of 
H, P the set of all positive roots in b for a fixed 
linear order, x the polynomial TL. Ha, and 
F the set of all real-valued linear forms on 
JV b. Moreover, let L be the set of all linear 
forms 4 in F such that a single-valued char- 
acter č, of the group H is defined by the for- 
mula £ (exp X) — e^05, and let L’ be the set 
of all 4 in L such that (4) #0. Then for each 
Ain L’, there exists a representation c(A4) of 
G in the discrete series, and conversely, every 
representation in the discrete series is equivalent 
to (4) for some 4 in L’. Two representations 
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0(4,) and w(2,) (4,, 4; € L’) are equivalent if 
and only if there exists an element s in W; — 
N(H)/H such that 4; — s4,, where N(H) is the 
normalizer of H in G (W; can act on F as a 
linear transformation group in the natural 
way). The value of the character x,.0n the 
subgroup H of the representation «(A) (4e L’) 
is given as follows: Let e(å) be the signature of 
n(4) 2 I1,. p ^(H,), and define q and A by q= 
(dim G/K)/2 and A(exp H) — IT, (e^? — 

e *0D7) Then the character x, of the repre- 
sentation c(4) has the value (—1)?2(A) y (h) = 
A(h) ! X,cw (det s)é,,(h) on a regular element 
hin H The formal degree d(c(^)) of the 
representation w(A) is given by the formula 
d(a(A))= C ! [W;]|n(4)|, where C is a positive 
constant (not depending on 4) and [Wg] is the 
order of the finite group W, (Harish-Chandra 
[13]). A formula expressing the character y, on 
the whole set of regular elements in G has been 
given by T. Hirai [14]. The representations in 
discrete series are realized on L?-cohomology 
spaces of homogeneous holomorphic line 
bundles over G/H (W. Schmid [15]). They are 
also realized on the spaces of harmonic spinors 
on the *Riemannian symmetric space G/K 
(M. Atiyah and Schmid [16]). They are also 
realized on the eigenspaces of a Casimir opera- 
tor acting on the sections of vector bundles 
on G/K (R. Hotta, J. Math. Soc. Japan, 23; 

N. Wallach [17]). An irreducible unitary rep- 
resentation is called integrable if at least one 
of its matrix coefficients belongs to L!(G). 
Integrable representations belong to the dis- 
crete series. They have been characterized by 
H. Hecht and Schmid (Math. Ann., 220 (1976)). 
The theory of the discrete series is easily ex- 
tended to reductive Lie groups. 

The general principal series representations 
of a connected semisimple Lie group G with 
finite center are constructed as follows. Let K 
be a maximal compact subgroup of G. Then 
there exists a unique involutive automorphism 
0 of G whose fixed point set coincides with K. 
0 is called a Cartan involution of G. Let H bea 
-stable Cartan subgroup of G. Then H is the 
direct product of a compact group T— Hf! K 
and a vector group A. The centralizer Z(A) of 
A in G is the direct product of a reductive Lie 
group M —0(M) and A. M has a compact 
Cartan subgroup T. Hence the set M, of the 
discrete series representations of M is not 
empty. Let x be an element of the dual space 
a* of the Lie algebra a of A and put g,= 
(Xegl[H, X]=a(H)X(VHea)} and A= 
{xea*|g,4 [01]. Let A" be the set of posi- 
tive elements of A in a certain order of a* and 
put n= Zen 0, and N=expn. Then P= MAN 
is a closed subgroup of G. P is called a cuspidal 
parabolic subgroup of G. Let De M, and vea*. 
Then a unitary representation D & e" of P 
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is defined by (D & e") (man) = D(m)e^»e? 

(me M,ae A,ne N). The unitary representation 
zo, of G induced by Deet is independent of 
the choice of A" up to equivalence. Thus zo, 
depends only on (H, D, v). The set of represen- 
tations [zy ,| De M,, vea*} is called the prin- 
cipal H-series. If v is regular in a* (i.e, (v, a) 40 
for all xe A), then zp , is irreducible. Every zo. 
is a finite sum of irreducible representations. 
The character 0, , of 2, , is a locally sum- 
mable function which is supported in the 
closure of | J,,; g(M A)g "'. If two Cartan 
subgroups H, and H, are not conjugate in G, 
then every H,-series representation is disjoint 
from every H,-series representation. Choose 

a complete system {H,,...,H,} of conjugacy 
classes of Cartan subgroups of G. Then every 
H; can be chosen as O-stable, The union of the 
principal H;-series (1 & i &r) is the principal 
series of G. The right (or left) regular represen- 
tation of G is decomposed as the direct integral 
of the principal series representations. Every 
complex-valued C?-function on G with com- 
pact support has an expansion in terms of the 
matrix coefficients of the principal series repre- 
sentations. Harish-Chandra [18] proved these 
theorems and determined explicitly the Plan- 
cherel measure by studying the asymptotic 
behavior of the Eisenstein integral [19,20]. 


Y. Spherical Functions 


Let G be a locally compact tunimodular group 
and K a compact subgroup of G. The set of all 
complex-valued continuous functions on G 
that are invariant under every left translation 
L, by elements k in K is denoted by C(K\G). 
The subset of C(K\G) that consists of all two- 
sided K-invariant functions is denoted by 
C(G, K). The subset of C(G, K) consisting of all 
functions with compact support is denoted by 
L= L(G, K). Lis an algebra over C if the prod- 
uct of two elements f and g in L is defined by 
the convolution. 

Let 4 be an algebra homomorphism from L 
into C. Then an element of the eigenspace F(A) 
={weC(K,G)| f «V =A(f)W (Vf eL)] is called 
a spherical function on KG. If F(A) contains a 
nonzero element, then F(A) contains a unique 
two-sided K-invariant element w normalized 
by oe) — 1, where e is the identity element 
in G. This function w is called the zonal spher- 
ical function associated with A. In this case, 
the homomorphism 4 is defined by 4( f) — 

Le f(g)o(g dg. Hence the eigenspace F(A) 

is uniquely determined by the zonal spherical 
function o. A function œ #0 in C(G, K) isa 
zonal spherical function on K\G if and only 
if c satisfies either of the following two con- 
ditions: (i) The mapping f+ f f(g)o(g !)dg is 
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an algebra homomorphism of L into C; (ii) œ 
satisfies the functional equation 


f co(gkh) dk = w(g)on(h). 
K 


When G is a Lie group, every spherical func- 
tion is a real analytic function on K\G. 


Z. Expansion by Spherical Functions 


In this section, we assume that the algebra L 
of two-sided K-invariant functions is com- 
mutative. In this case there are sufficiently 
many spherical functions of K\G, and two- 
sided K-invariant functions are expanded by 
spherical functions. An irreducible unitary 
representation U of G is called a spherical 
representation with respect to K if the represen- 
tation space $(U) contains a nonzero vector 
invariant under every U,, where k runs over K. 
By the commutativity of L, the K-invariant 
vectors in §(U) form a 1-dimensional sub- 
space. Let x be a K-invariant vector in §(U) 
with the norm |jx|| «1. Then c(g)=(U,x, x) is 
a zonal spherical function on K\G, and for 
every y in $(U), the function ~,(g)=(U,x, y) 
is a spherical function associated with c. 
Moreover, in this case the zonal spherical 
function c is a positive definite function on G. 
Conversely, every positive definite zonal spher- 
ical function o can be expressed as w(g)= 
(U,x, x) for some spherical representation U 
and some K-invariant vector x in $(U). 

The set of all positive definite zonal spher- 
ical functions becomes a locally compact space 
Q by the topology of compact convergence. 
The spherical Fourier transform f of a function 
f in L,(KXG) is defined by 


fo) [ f(ayog "ld. 


There exists a unique "Radon measure u on 
Q such that for every f in L, f belongs to 
L,(Q, u). Also, the Plancherel formula 


| Gate | flojo ydo) 4 
G Q 


holds for every f and g in L, and an inversion 
formula f(s)= fo f(c)e(s)du(c) holds for a 
sufficiently nice two-sided K-invariant func- 
tion f [21]. Identifying a positive definite zonal 
spherical function with the corresponding 
spherical representation, we can regard Q as 

a subset of the dual G of G. The Plancherel 
formula for two-sided K-invariant functions is 
obtained from the general Plancherel formula 
on G by restricting the domain of the integral 
from G to Q. When G is a Lie group and L is 
commutative, a spherical function on KG can 
be characterized as a simultaneous eigenfunc- 
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tion of G-invariant linear differential operators 
on K\G. 


AA. Spherical Function on Symmetric Spaces 


The most important case where the algebra 
L= L(G, K) is commutative is when K\G is a 
tweakly symmetric Riemannian space or, in 
particular, a symmetric Riemannian space 
When K\G is a compact symmetric Riemann- 
ian space, a spherical representation with 
respect to K is the trreducible component of 
the regular representation T on K\G, and 

a spherical function on KG is a function 

that belongs to the irreducible subspaces in 
L,(K\G). In particular, if G is a compact con- 
nected semisimple Lie group, the highest 
weights of spherical representations of G with 
respect to K are explicitly given by using the 
Satake diagram of KG. The Satake diagram 
of K\G is the *Satake diagram of the noncom- 
pact symmetric Riemannian space KG, dual 
to K\G or the Satake diagram of the Lie alge- 
bra of Go. If a symmetric space is the under- 
lying manifold of a compact Lie group G, then 
G can be expressed as G = K\(G x G), where K 
is the diagonal subgroup of G x G. In this case, 
a zonal spherical function œw on G = K\(G x G) 
is the normalized character of an irreductble 
unitary representation U of G:w(g)= 

(deg U) ' T; U,. The explicit form of c is 

given by *'Weyl's character formula (— 249 
Lie Groups). 

The zonal spherical functions on a sym- 
metric Riemannian space K\G of noncompact 
type are obtained in the following way: Let G 
be a connected semisimple Lie group with 
finite center, K be a maximal compact sub- 
group of G, and G=NA, K be an 'Iwasawa 
decomposition. Then for any g in G there 
exists a unique element H(g) in the Lie algebra 
a, of A, such that g belongs to N exp H(g)K. 
Let a be a Cartan subalgebra containing a,, P 
be the set of all positive roots in a, and p— 
(22, 39/2. Then for any complex-valued linear 
form v on a,, the function 


oup | gU" elen dk 
K 


is a zonal spherical function on the symmetric 
Riemannian space KG. Conversely, every 
zonal spherical function w on K\G is equal to 
w, for some v. Two zonal spherical functions 
œw, and w, coincide if and only if v and v' are 
conjugate under the operation of the Weyl 
group Wo = N,(AYZ&(A) of KXG (Harish- 
Chandra [22], S. Helgason [23}). If v ts real- 
valued, then œ, is positive definite. Such a 
zonal spherical function c, is obtained from 
a spherical representation belonging to the 
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principal A-series. Let OQ be the set of all 
zonal spherical functions o, associated with 
the real-valued linear form v. Then the support 
of the Plancherel measure u on K\G is con- 
tained in (Dua, We can choose v as a parameter 
on the space Q,. Then the right-hand side of 
the Plancherel formula can be expressed as an 
integral over the dual space L of a, . More- 
over, the Plancherel measure u is absolutely 
continuous with respect to the Lebesgue mea- 
sure dv on the Euclidean space L and can be 
expressed as 


du(@,) — cg "let! ? dv 


under suitable normalization of u and dv. The 
problem of calculating the function c(v) can be 
reduced to the case of symmetric spaces of 
rank 1 and can be solved explicitly. Let p, be 
the multiplicity of a restricted root x and I{v) 
be the product 


where x runs over all positive restricted roots 
and B is the "beta function. Then c(v) = I(iv)/ 

I( p) [20, 24). Every spherical function f on 
K\G is expressed as the Poisson integral of 

its "boundary values" on the Martin boundary 
PG of KG, where P= MA, N is a minimal 
parabolic subgroup of G. The boundary values 
of f form a hyperfunction with values in a line 
bundle over P\G (K. Okamoto et al. [25]). 


BB. Spherical Functions and Special Functions 


Some important special functions are obtained 
as the zonal spherical functions on a certain 
symmetric Riemannian space M = K\G (G is 
the motion group of M). In particular when 

M is of rank 1, then the zonal spherical func- 
tions are essentially the functions of a single 
variable. For example, the zonal spherical 
functions on an n-dimensional Euclidean space 
can be expressed as 


orl Al (m+ (vr) "J, (vr), 


where 2m — n — 2 and J,, is the *Bessel function 
of the mth order. The zonal spherical function 
on an (n — 1)-dimensional sphere $"! = 
SO(n— 1) SO(n) is given by 


co (0) =F + 1)P(n —2)T (v-- n —2)! C"(cos 0) 
(v2 0, 1,2, ..., 


where C7'(z) is the *Gegenbauer polynomial. 
The zonal spherical functions on an (n — 1)- 
dimensional Lobachevskii space can be ex- 
pressed as 


wo, (t)= 2^1? T'(m + 1/2)7sinh "*!2, 


x Bm. e (cosht) 
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using a generaltzed *associated Legendre func- 
tion PË. Many properties of special functions 
can be proved from a group-theoretic point of 
view. For example, the addition theorem is 
merely the homomorphism property U,, — 

U, U, expressed in terms of the matrix com- 
ponents of U. The differential equation satis- 
fied by these special functions is derived from 
the fact that a zonal spherical function œ is an 
eigenfunction of an invariant differential oper- 
ator. The integral expression of such a special 
function can be obtained by constructing a 
spherical representation U in a certain func- 
tion space and calculating explicitly the inner 
product in the expression w(g)=(U,x, x) (N. 
Ya. Vilenkin [26]). 


CC. Generalization of the Theory of Spherical 
Functions 


The theory of spherical functions described in 
Sections Y—BB can be generalized in several 
ways. First, spherical functions are related to 
the trivial representation of K. A generaliza- 
tion is obtained if the trivial representation of 
K is replaced by an irreducible representation 
of K. The theory of such zonal spherical func- 
tions is useful for representation theory [20]. 
For example, the Plancherel formula for 

SL (2, R) can be obtained using such spherical 
functions (R. Takahashi, Japan. J. Math., 31 
(1961)). Harish-Chandra's Eisenstein integral 
is such a spherical function on a general semi- 
simple Lie group G. He used it successfully to 
obtain the Plancherel measure of G. Another 
generalization can be obtained by removing 
the condition that K is compact. In particu- 
lar, when K\G is a symmetric homogeneous 
space of a Lie group G, the algebra 2 of all G- 
invariant linear differential operators is com- 
mutative if the space KXG has an invariant 
volume element. In this case, a spherical func- 
tion on K\G can be defined as a simultaneous 
eigenfunction of Y. The character of a semi- 
simple Lie group is a zonal spherical func- 
tion (distribution) in this sense. The spherical 
functions and harmonic analysis on sym- 
metric homogeneous space have been studied 
by T. Oshima and others. T. Oshima and J. 
Sekiguchi [27] proved the Poisson integral 
theorem (— Section AA) for a certain kind of 
symmetric homogeneous spaces. 

The spherical functions and unitary repre- 
sentations of topological groups that are not 
locally compact are studied in connection with 
probability theory and physics. For example, 
the zonal spherical functions of the rotation 
group of a real Hilbert space are expressed by 
Hermite polynomials. 
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DD. Discontinuous Subgroups and 
Representations 


Let G be a connected semisimple Lie group 
and F be a discrete subgroup of G. Then the 
regular representation T of G on IG is de- 
fined by (f)(x) 9 f(xg) (fe L'(I^G)). The 
problem of decomposing the representation T 
into irreducible components is important in 
connection with the theory of *automorphic 
forms and number theory. First assume that 
the quotient space (Ce is compact. Then for 
every function f in L,(G), the operator T(f) is 
a compact operator. Hence the regular repre- 
sentation T on T\G can be decomposed into 
the discrete sum T= X4, T™ of irreducible 
unitary representations T9, and the multiplic- 
ity of every irreducible component is finite. 
The irreducible unitary representation U of G 
is related to the automorphic forms of T in the 
following way: Let x be a nonzero element in 
the representation space $ = §(U) of U. $ is 
topologized into a tlocally convex topological 
vector space $$, by the set N, of tseminorms: 
N, = {Pe(y) =max,.¢|(U,x, y)|}, where C runs 
over all compact subsets in G. The topology 
4. of H, is independent of the choice of x 
provided that dim{ T, x|ke K] « oo, where K is 
a maximal compact subgroup of G. Let $* be 
the completion of $, with respect to the topo- 
logy 2. (the completion is independent of 

the choice of x). $* contains the original Hil- 
bert space $ as a subspace. Then the repre- 
sentation U of G on § can be extended to a 
rcpresentation U* of G on the space $*. An 
element f in $* invariant under U* for every y 
in F is called an automorphic form of I of type 
U. Then the multiplicity of an irreducible 
representation U in the regular representation 
T on IG is equal to the dimension of the 
vector space consisting of all automorphic 
forms of type U. This theorem is called the 
Gel’fand—Pyatetskii-Shapiro reciprocity law 
[28]. Let T2 E, T? be the irreducible de- 
composition of T and y, be the character of 
the irreducible unitary representation T®. 
Then for a suitable function f on G, the in- 
tegral operator K, on §(T)=L?(T\G) with 
kernel k,(x, y)=Lyer f(x ! yy) belongs to the 
trace class. By calculating the trace of K; in 
two ways, the following trace formula is 
obtained: 


»3 | S(Dulgdg=¥ | fox yx) dx, 
Ss 0149» 
where (yj is the conjugate class of y in I and 
D, is the quotient space of the centralizer G, of 
y in G by the centralizer I, of y in I. 

When the groups G and T are given ex- 
plicitly, the right-hand side of the trace for- 
mula can be expressed in a more explicit form, 
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and the trace formula leads to useful conse- 
quences. A similar trace formula holds for the 
unitary representation U* induced by a finite- 
dimensional unitary representation L of T 
instead of the regular representation T on 
IG. When the quotient space I\G is not 
compact, the irreducible decomposition of the 
regular representation T on I\G contains not 
only the discrete direct sum but also the direct 
integral (continuous spectrum). A. Selberg 
showed that even in this case, there are explicit 
examples for which the trace formula holds for 
the part with discrete spectrum. Also, the part 
with continuous spectrum can be described by 
the *generalized Eisenstein series. Analytic 
properties and the functional equation of the 
generalized Eisenstein series have been studied 
by R. Langlands [30]. Recent developments 
are surveyed in [31]. 


EE. History 


Finite-dimensional unitary representations of 
a finite group were studied by Frobenius and 
Schur (1896—1905). In 1925, tWeyl studied the 
finite-dimensional unitary representation of 
compact Lie groups. The theory of infinite- 
dimensional unitary representation was init- 
tated in 1939 by E. P. Wigner in his work on 
the inhomogeneous Lorentz group, motivated 
by problems of quantum mechanics. 

In 1943, Gel'fand and D. A. Raikov proved 
the existence of sufficiently many irreducible 
unitary representations for an arbitrary locally 
compact group. The first systematic studies of 
unitary representations appeared in 1947 in 
the work of V. Bargmann on SL(2, R) [31] 
and the work of Gel'fand and Neumark on 
SL(2, C). Gel'fand and Naimark established 
the theory of unitary representation for com- 
plex semisimple Lie groups [12]. 

Harish-Chandra proved theorems concern- 
ing the unitary representations of a general 
semisimple Lie group; for instance, he proved 
that a semisimple Lie group G is of type I [7] 
and defined the character of a unitary repre- 
sentation of G and proved its basic properties 
[9, III; 10]. Harish-Chandra also determined 
the discrete series of G and their characters. 
Harish-Chandra [18] proved the Plancherel 
formula for an arbitrary connected semisimple 
Lie group G with finite center. Hence har- 
monic analysis of square integrable functions 
on G is established. 

Further studies on harmonic analysis on 
semisimple Lie groups have been carried out. 
In particular, Paley- Wiener-type theorems, 
which determine the Fourier transform image 
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of the space Cè (G) of C?-functions with com- 
pact support, have been proved for the group 
PSL(2,R) (L. Ehrenpreis and F. Mautner 
[33]), complex semisimple Lie groups (Zhelo- 
benko [34]), and two-sided K-invariant func- 
tions on general semisimple Lie groups (R. 
Gangolli [35]). A. W. Knapp and E. M. Stein 
[36] studied the intertwining operators. 

Concerning the construction of irreducible 
representations, G. W. Mackey [3] and Bruhat 
[4] developed the theory of induced represen- 
tations of locally compact groups and Lie 
groups, respectively. B. Kostant [37] (see 
Blattner's article in [38]) noticed a relation 
between homogeneous tsymplectic manifolds 
and unitary representations and proposed a 
method of constructing irreducible unitary 
representations of a Lie group. Selberg's re- 
search [29] revealed a connection between 
unitary representations (or spherical functions) 
and the theory of automorphic forms and 
number theory. A number of papers along 
these lines have since appeared [31]. In con- 
nection with number-theoretic investigations 
of an talgebraic group defined over an alge- 
braic number field, unitary representations of 
the *adele group of G or an algebraic group 
over a *p-adic number field have been studied 
(— [31,38], Gel'fand, M. I. Grayev, and I. I. 
Pyatetskii-Shapiro [39], and H. M. Jacquet 
and R. P. Langlands [40]). 

For the algebraic approach to the infinite- 
dimensional representations of semisimple Lie 
groups and Lie algebras — [41]. 

For surveys of the theory of unitary repre- 
sentations — [2,19,20,31,38]. 
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A. General Remarks 


A single-valued tanalytic function f(z) defined 
in a domain D of the complex plane is said to 
be univalent (or simple or schlicht) if it is injec- 
tive, ie., if f(z,) A f(z.) for all distinct points z,, 
z, in D. A multiple-valued function f(z) is also 
said to be univalent if its distinct function 
elements always attain distinct values at their 
centers. The derivative of a univalent function 
is never zero. The limit function of a tuni- 
formly convergent sequence of univalent func- 
tions is univalent unless it reduces to a con- 
stant. When f(z) ts single-valued, the univalent 
function w= f(z) gives rise to a one-to-one 
‘conformal mapping between D and its image 


f(D). 


B. Univalent Functions in the Unit Disk 


A systematic theory of the family of functions 
tholomorphic and univalent in the unit disk 
originates from a distortion theorem obtained 
by P. Koebe (1909) in connection with the 
uniformization of analytic functions. In gen- 
eral, distortion theorems are theorems for 
determining bounds of functionals, such as 
LG). (2), arg f(z), within the family under 
consideration. In particular, distortion theo- 
rems concerning the bounds of the arguments 
of f(z) and f(z) are also called rotation theo- 
rems. Though results were at first qualitative, 
they were made quantitative subsequently by 
L. Bieberbach (1916), G. Faber (1916), and 
others. Any univalent function f(z) holo- 
morphic in the unit disk and normalized by 
f(0) 2 0 and f'(0)= 1 satisfies the distortion 
inequalities 


TE 
(Iz? ^ az)’ 
l1 — |zi ; (EEN 
dxiap <4 Osa 


Here the equality holds only if f(z) is of the 
form z/(1 — ez)? (|e| « 1). In deriving these in- 
equalities, Bieberbach centered his attention 
on the family of *meromorphic functions oC) 
=% +È ob,4 ” univalent in |C| » 1. He es- 
tablished the area theorem X2, v|b,|? <1, 
which illustrates the fact that the area of the 
complementary set of the image domain is 
nonnegative. Bieberbach, R. Nevanlinna 
(1919-1920), and others constructed a sys- 
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tematic theory of univalent functions in the 
unit disk based on this theorem. 

After the area theorem, the chief tools in 
the theory of univalent functions have been 
Löwner’s method, the method of contour 
integration, the variational method, and the 
method of the extremal metric. In contrast to 
the theory of univalent functions based on 
Bieberbach’s area theorem, K. Löwner (1923) 
introduced a new method. In view of a theo- 
rem on the domain kernel (C. Carathéodory, 
1912), it suffices to consider an everywhere 
dense subfamily in order to estimate a con- 
tinuous functional within the family of univa- 
lent functions holomorphic in the unit disk. 
Löwner used the subfamily of functions map- 
ping the unit disk onto the so-called bounded 
slit domains. Namely, the range of a member 
of this subfamily consists of the unit disk slit 
along a Jordan arc that starts at a periphery 
point and does not pass through the origin. 
A mapping function of this nature is deter- 
mined as the integral f(z, to) of Löwner’s dif- 
ferential equation 


GE aed) 1+ SH 
et 1 — k(t)f(z, t) 


with the initial condition f(z,0) 2 z, where x(t) 
is a continuous function with absolute value 
equal to 1. Any univalent function f(z) holo- 
morphic in the unit disk and satisfying f(0) — 
0, f'(0) — 1 has an arbitrarily close approxi- 
mation by functions of the form e'?f(z, to). By 
means of this differential equation Lówner 
proved that |a4| € 3 for any univalent function 





Óxtxto, 


f(2)-2z- Xia," (|z| « 1) and also derived 


a decisive estimate concerning a coefficient 
problem for the inverse function [2]. 

G. M. Golusin (1935) and I. E. Bazilevich 
(1936) first noticed that Lówner's method is 
also a powerful tool for deriving several distor- 
tion theorems. They showed that classical 
distortion theorems can be derived in more 
detailed form (Golusin, Mat. Sb., 2 (1937), 685); 
in particular, Golusin (1938) obtained a precise 
estimate concerning the rotation theorem, i.e., 


larg f CH 


Ge 
"le -log(lzl?/(1 —|z])), 


Iz « 1/2, 
1//2x|z| « 1. 


Lówner's method was also investigated by A. 
C. Schaeffer and D. C. Spencer (1945) [8]. 

The method of contour integration was 
introduced by H. Grunsky. It starts with some 
2-dimensional integral which can be shown to 
be positive. Transforming it into a boundary 
integral and using the *residue theorem, we 
obtain an appropriate inequality by means of 
this integral. By this method Grunsky estab- 
lished the following useful inequality (Math. 
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Z., 45 (1939)). For g(C) 2 C-- Eo b, C", which 
is univalent in |¢|> 1, let 


Gs ) o 
lo BIEN gu Y ez p 
Zb m-in-i 


(|2|>1,|¢]> 1). 


The coefficients c,,, are polynomials in the 
coefficients b, of g. Then Grunsky's inequality 
is: For each integer N and for all complex 
numbers /,,..., Ay, 


N N 


2. Cnn mn 


m=1 n= 








SIE 
n “n n 


It is known that if this inequality holds for 
an arbitrary integer N and for all complex 
numbers 4,, ..., Ay, then g(C) is univalent in 
[C1 1. There are several generalizations of 
Grunsky's inequality [13]. 

The variational method was first developed 
by M. Schiffer for application to the theory of 
univalent functions. He first used boundary 
variations (Proc. London Math. Soc., 44 (1938)) 
and later interior variations (Amer. J. Math., 
65 (1943)). The problem of maximizing a given 
real-valued functional on a family of univalent 
functions is called an extremal problem, and a 
function for which the functional attains its 
maximum is called an extremal function. The 
variational method is used to uncover charac- 
teristic properties of an extremal function by 
comparing it with nearby functions. Typical 
results are the qualitative information that the 
extremal function maps the disk |z| « 1 onto 
the complement of a system of analytic arcs 
satisfying a differential equation and that the 
extremal function satisfies a differential equa- 
tion. Following Schiffer, Schaeffer and Spen- 
cer [8] and Golusin (Math. Sb., 19 (1946)) gave 
variants of the method of interior variations. 

H. Grótzsch (1928—1934) treated the theory 
of univalent functions in a unified manner 
by the method of the *extremal metric. The 
idea of this method is to estimate the length 
of curves and the area of some region swept 
out by them together with an application 
of *Schwarz’s inequality (— 143 Extremal 
Length). After Grótzsch, the method of the ex- 
tremal metric has been used by many authors. 
In particular, O. Teichmüller, in connection 
with this method, formulated the principle 
that the solution of a certain type of extremal 
problem ts in general associated with a tqua- 
dratic differential, although he did not prove 
any general result realizing this principle in 
concrete form. J. A. Jenkins gave a concrete ex- 
pression of the Teichmüller principle; namely, 
he established the general coefficient theo- 
rem and showed that this theorem contains 
as special cases a great many of the known 
results on univalent functions [11]. 
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Univalence criteria have been given by 
various authors. In particular, Z. Nehari (Bull. 
Amer. Math. Soc., 55 (1949)) proved that if 
(UI f(z), 21 «2(1 —|z]?) ? in |z| <1, then f(z) is 
univalent in |z| « 1, and E. Hille (Bull. Amer. 
Math. Soc., 55 (1949)) proved that 2 is the best 
possible constant in the above inequality. 
Here, 1 f(z),z} denotes the 'Schwarzian deriva- 
tive of f(z) with respect to z: 


d(f'oN. v fy 
V na (Fa) 


C. Coefficient Problems 








In several distortion theorems Koebe's ex- 
tremal function z/(1— ez? =~, ne" ! z"(Je| = 

1) is extensively utilized. Concerning this, 
Bieberbach stated the following conjecture. If 
f(z) 9 z - Eo a,z" is holomorphic and univa- 
lent in |z| « 1, then |a,|<n (n— 2,3, ...), with 
equality holding only for Koebe's extremal 
function z/(1 — zz)? (|e| - 1). This conjecture 
was solved affirmatively by L. de Branges in 
1985 after enormous effort by many mathema- 
ticians, as described below. 

Bieberbach (1916, [1]) proved |a;| x2 asa 
corollary to the area theorem. This result can 
be proved easily by most of the methods. In 
1923 Lówner [2] proved |a4| x 3, introducing 
his own method. Schaeffer and Spencer gave a 
proof of [a4| x 3 by the variational method 
(Duke Math. J., 10 (1943)). Furthermore, Jen- 
kins used the method of the extremal metric to 
prove a coefficient inequality that implies 
[a3] € 3 (Analytic Functions, Princeton Univ. 
Press, 1960). The problem of the fourth coeffi- 
cient remained open until 1955, when P. R. 
Garabedian and Schiffer [3] proved |a4| <4 by 
the variational method. Their proof was ex- 
tremely complicated. Subsequently, Z. Char- 
zynski and Schiffer gave an alternative brief 
proof of |a4| x 4 by using the Grunsky inequal- 
ity (Arch. Rational Mech. Anal., 5 (1960)). M. 
Ozawa (1969, [4]) and R. N. Pederson (1968, 
[5]) also used the Grunsky inequality to prove 
lag| <6. In 1972, Pederson and Schiffer [6] 
proved |a;| <5. They applied the Garabedian- 
Schiffer inequality, a generalization of the 
Grunsky inequality which Garabedian and 
Schiffer had derived by the variational method. 

On the other hand, W. K. Hayman [7] 
showed that for each fixed f(z) 2 z + E£, a,z", 


la, 





lim 


noo H 


—axl, 


with the equality holding only for Koebe's 
extremal function z/(1 — zz)? (\e|= 1). Further, 
it was shown that Koebe's extremal function 
z/(1—z) gives a local maximum for the nth 
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coefficient in the sense that Re{a,} <n when- 
ever |a, — 2| « ô, for some 6, >0 (Garabedian, 
G. G. Ross, and Schiffer, J. Math. Mech., 14 
(1964); E. Bombieri, Inventiones Math., 4 
(1967); Garabedian and Schiffer, Arch. Ra- 
tional Math. Anal., 26 (1967)). 

In the most general form, the coefficient 
problem is to determine the region occupied 
by the points (a), ...,a,) for all functions f(z)= 
z+L2.,4,2" univalent in |z| « 1. Schaeffer 
and Spencer [8] found explicitly the region for 
(42, a3). 

For the coefficients of functions g(£) - £ + 
Eob, ™” univalent in |C| 1, the following 
results are known: |b; |< 1 (Bieberbach [1]). 
|b5| € 2/3 (Schiffer, Bull. Soc. Math. France, 66 
(1938); Golusin, Mat. Sb., 3 (1938)), |b} < 1/2 + 
e © (Garabedian and Schiffer, Ann. Math., 

(2) 61 (1955)). 


D. Other Classes of Univalent Functions 


We have discussed the general family of func- 
tions univalent in the unit disk. There are also 
several results on distortion theorems and 
coefficient problems for subfamilies deter- 
mined by conditions such as that the images 
are bounded, ‘starlike with respect to the 
origin, or tconvex. For instance, if f(z) 2z 
X2 .a,z"is holomorphic and univalent in 
|z|< 1 and its image is starlike with respect to 
the origin, then |a,| x n (n=2, 3, ...). If the im- 
age of f(z) is convex, then f(z) satisfies |a,| < 
1 (n—2,3, ...) and the distortion inequalities 


[Z| |z| 


14i SI sy P 
1 1 


mea lisa 





Here the equality sign appears at zo (0 « |zo| « 
1) if and only if f(z) is of the form z/(1 + ez) 
with e= 4 lzol/zo. 

On the other hand, problems on conformal 
mappings of multiply connected domains 
involve essential difficulties in comparison 
with the simply connected case. Although 
Bieberbach's method is unsuitable for multi- 
ply connected domains, Lówner's method, 
the method of contour integration, the varia- 
tional method, and the method of the ex- 
tremal metric remain useful (— 77 Con- 
formal Mappings). 


E. Multivalent Functions 


Multivalent functions are a natural generaliza- 
tion of univalent functions. There are several 
results that generalize classical results on 
univalent functions. 
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A function f(z) that attains every value at 
most p times and some values exactly p times 
in a domain D is said to be p-valent in D and is 
called a multivalent function provided that p> 
1. In order for f(z) 2 3:::5 a,z", holomorphic 
in |z| € 1, to be p-valent there, it is sufficient 
that it satisfies 


p—1<Re(zf(z)/f(z))<p+! 


on |z| - 1. Hence it suffices to have 


p Eé 

cke 2, Hilda al > 2, "lapi sal: 

If f(z) 2 (1 3- a z - a5 z? + ...)/z? is holomorphic 
and p-valent in 0 « |z| 1, then 


dl BEES 
dr Jo 


for any increasing function F(p) in p 20. In 
particular, if F(p) — p°, this becomes an area 
theorem from which follow coefficient esti- 
mates, etc., for p-valent functions. 

Various subfamilies and generalized families 
of multivalent functions have been considered. 
Let f(z) be p-valent in D, and cor c4z 4... + 
c, 4Z" | -- cy f (z) be at most p-valent in D 
for any constants co, c,, ..., c,. Then f(z) is 
said to be absolutely p-valent in D. If a function 
f(z) holomorphic in a convex domain K satis- 
fies Re(e'f(z))>0 for a real constant a, then 
f(z) is absolutely p-valent in K. If f(z) is ab- 
solutely p-valent in D, then 


E b,z* + ul cz" + ul 
k=0 k=0 


is at most p-value in D for any constants b, 
and c,. 

If f(z) is p-valent in the common part of a 
domain D and the disk centered at each point 
of D with a fixed radius p, then f(z) is said to 
be locally p-valent in D, and ois called its 
modulus. À necessary and sufficient condi- 
tion for f(z), holomorphic in D, to be at most 
locally p-valent is that f '(z), ..., f(z) not 
vanish simultaneously. In order for f(z), holo- 
morphic in D, to be locally absolutely p-valent 
it is necessary and sufficient that f(z) 40. 
Let the number of Re'?-points of f(z) in D be 
n(D, Re'?). If f(z) satisfies 

2n 


— | n(D,Re')dp<p, 
2n 0 


for any R 70, it is said to be circumferentially 
mean p-valent in D. If f(z) satisfies 


R Gi 
| | n(D, Re'^ RdR do < pnR?, 
it is said to be areally mean p-valent in D. If 


f(z)! with q 1 is areally mean p-valent in D, 
then f(z) is areally mean p/q-valent in D. For 
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f(z) «(1 - a.z - a4z?  ...)/z? holomorphic and 
areally mean Z2-valent in 0 « |z| € 1, the follow- 
ing area theorem holds: 


wo 
Y (n-a, <a. 
n=1 


Let E be a set containing at least three 
points. If f(z) in D attains every value of E at 
most p times and a certain value of E exactly p 
times (it may attain values outside E more 
than p times), then f(z) is said to be quasi-p- 
valent in D. If w= f(z) is p-valent in D and g(w) 
is quasi-q-valent in f(D), then g( f(z)) is at 
most quasi-pq-valent in D. 

The first success in obtaining sharp inequal- 
ities for multivalent functions was attained 
by Hayman. In his work, an essential role was 
played by the method of tsymmetrization. For 
instance, he obtained the following result. If 
f(z)  z? ay íz"*! +... is holomorphic and 
circumferentially mean p-valent in |z| « 1, then 
lap. | €2p, and for |z| 2 r, O«r « t, 


r? ry? 
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A. Introduction 


There are two related kinds of valuations, 
additive (— Section B) and multiplicative (— 
Section C). The notion of valuations, originally 
defined on (commutative) ‘fields, has been 
extended to more general cases (— Section K); 
however, we first consider the case of fields. 


B. Additive Valuations 


In this article, we mean by an ordered additive 
group a totally ordered additive group, namely, 
a commutative group whose operation is 
addition, which is a *totally ordered set satisfy- 
ing the condition that az b and cz d imply a 
+c>b+d and —a< —b. Suppose that we are 
given a field K, an ordered additive group G, 
and an element oo defined to be greater than 
any element of G. Then a mapping v: K— 
GU (oo) is called an additive valuation (or 
simply a valuation) of the field K if v satisfies 
the following three conditions: (1) v(a) - oo if 
and only if a=0; (ii) v(ab) = v(a) + v(b) for all a, 
b 40; and (iii) v(a + b) > min {v(a), v(b)]. 

The set (v(a)]ae K — 10] j is a submodule of 
G and is called the value group of v, while the 
set R, = iae K |v(a) 20] is a subring of K and 
is called the valuation ring of v. The ring R, has 
only one tmaximal ideal {a|v(a)>0}, called the 
valuation ideal of v (or of RL and the tresidue 
class field of R, modulo the maximal ideal is 
called the residue class field of the valuation v. 
We have v(a) € v(b) if and only if aR, 2 bR,. 
Two valuations v and v' of the field K are said 
to be equivalent when v(a) x v(b) if and only if 
v'(a) « v (b); hence v and v' are equivalent if and 
only if R,= R,.. The rank of v is defined to be 
the *Krull dimension of the valuation ring R,, 
and the rational rank of v to be the maximum 
(or supremum) of the numbers of linearly 
independent elements in the value group. An 
extension (or prolongation) of v in a field K' 
containing K is a valuation v' of K' whose 
restriction on K is v; such an extension exists 
for any given v and K'. Sometimes a valuation 
of rank 1 is called a special valuation (or ex- 
ponential valuation), and a valuation of a gen- 
eral rank is called a generalized valuation. On 
the other hand, if k is a subfield of K such that 
v(a)=0 for every nonzero element a of k, then 
v is called a valuation over the subfield X. 


C. Multiplicative Valuations 


A multiplicative valuation (or valuation) of a 
field K is a mapping w: K->T U {0} that satis- 
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fies the following three conditions, where It 
the multiplicative group of positive real num- 
bers: (i) w(a) 2 0 if and only if a —0; (ii) w(ab) 

= w(a)w(b); and (iii) w(a +b ) € C(w(a) + w(b)), 
where C is a constant (independent of the 
choice of a and b, but dependent on the choice 
of w). 

The value group of w is defined to be 
(w(a)| ae K — (0j ). Extensions of a valuation 
and equivalence of valuations are defined as in 
the case of additive valuations. Thus w' is 
equivalent to w if and only if there is a positive 
r such that for all ae K, w(a)= w'(ay. In each 
equivalence class of valuations of a field, there 
exists a valuation for which the constant C in 
condition (iii) can be taken to be 1. A valu- 
ation w is said to be a valuation over a subfield 
k if w(a) — 1 for any nonzero element a of k. 

We call w an Archimedean valuation if for 
any elements a, be K, a0, there exists a 
natural number n such that w(na) > w(b); 
otherwise, w is said to be a non-Archimedean 
valuation. If w is an Archimedean valuation of 
a field K, then there is an injection o from K 
into the complex number field C such that w is 
equivalent to the valuation w' defined by w'(a) 
— |c(a)|. If w is a non-Archimedean valuation 
of a field K, then w(a+ b) x max (w(a), w(b)}. 
Hence in this case we get an additive valuation 
v of K when we define v(a) 2 — log w(a) (ae K), 
and either v is of rank 1 or v(K) — 11,0] (in the 
latter case, v is called trivial). Conversely, every 
additive valuation of rank 1 of K is equivalent 
to an additive valuation obtained in this way 
from a non-Archimedean valuation. (This is 
why an additive valuation of rank 1 is called 
an exponential valuation.) Therefore a non- 
Archimedean valuation determines a valuation 
ring and valuation ideal in a natural manner. 
Thus we can identify a non-Archimedean 
valuation with an additive valuation of rank 1. 


D. Topology Defined by a Valuation 


Let w be a multiplicative valuation of a field 
K. When the 'distance between two elements 
a, b of K is defined by w(a— b), K becomes a 
*topological field. (Although this distance may 
not make K into a tmetric space, there exists a 
valuation w' equivalent to the valuation w 
such that K becomes a metric space with 
respect to the distance w'(a — b) between a and 
b (a, be K).) If K is tcomplete under the topol- 
ogy, then we say that K is complete with 
respect to w and w is complete on K. On the 
other hand, suppose that w' is an extension of 
w in a field K’ containing K. If w’ ts complete 
and K is tdense in K' under the topology de- 
fined by w’, then we say that the valuation w' 
is a completion of w and that the field K' is a 
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completion of K with respect to w. For any w, 
a completion exists and is unique up to iso- 
morphism. When w is a non-Archimedean 
valuation, the valuation ring of the completion 
of w is called the completion of the valuation 
ring of w. 

When v is an additive valuation of a field K, 
we can introduce a topology on K by taking 
the set of all nonzero ideals of the valuation 
ring R, of v as a base for the neighborhood 
system of zero. Important cases are given by 
valuations of rank 1, which are the same as 
those given by non-Archimedean valuations. 

If w is a complete non-Archimedean valu- 
ation of a field K, then the valuation ring R,, 
of w is a *Hensel ring, which implies that if K’ 
is a finite algebraic extension of K such that 
[K': K]—n, then w is uniquely extendable to a 
valuation w' of K’ and w'(a)' 2 w(N(a)), where 
N is the "nom Ny. 


E. Discrete Valuations 


For a non-Archimedean valuation (or an 
additive valuation of rank 1) w, if the valuation 
ideal of w is a nonzero *principal ideal gen- 
erated by an element p, then we say that p is a 
prime element for w, w is a discrete valuation, 
and the valuation ring for w is a discrete valu- 
ation ring. The condition on the valuation 
ideal of w holds if and only if the value group 
of w is a discrete subgroup of the (multiplica- 
tive) group T of positive real numbers: In the 
terminology of additive valuations, a valuation 
w is discrete 1f and only if it is equivalent to a 
valuation w' whose value group is the additive 
group of integers. Such a valuation w' is called 
a normalized valuation (or normal valuation). 
However, we usually mean normalization of a 
discrete non-Archimedean valuation as in 
Section H. Sometimes an additive valuation 
whose value group is isomorphic to the direct 
sum of a finite number of coptes of Z (the 
additive group of integers) with a natural 
lexicographic order is called a discrete valu- 
ation. Concerning a complete discrete valu- 
ation w, it is known that if the valuation ring 
of w contains a field, then it is isomorphic to 
the ring of tformal power series in one variable 
over a field (for other cases — 449 Witt Vec- 
tors A). 


F. Examples 


(1) Trivial valuations of a field K are the addi- 
tive valuation v of K such that v(a) — 0 for all 
ac K — (0) and the multiplicative valuation w 
of K such that w(a)- 1 for all ae K — {0}. 

(2) If K is isomorphic to a subfield of the 
complex number field, then we get an Archi- 
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medean valuation using the absolute value, 
and as stated in Section C, every Archimedean 
valuation of K is equivalent to a valuation 
obtained in this way. 

(3) Let p be a *prime ideal of a tDedekind 
domain R, nep be such that x ép?, and K be 
the field of quotients of R. Then each nonzero 
element a of K can be expressed in the form 
z'ab ! (reZ;a,beR;a,bép), where r, the degree 
of « with respect to p, is uniquely determined 
by «. Hence, letting c be a constant greater 
than 1, we obtain a non-Archimedean valu- 
ation w defined by w(a)=c ”. This valuation w 
is called a p-adic valuation. We also get an 
additive valuation v defined by v(x) — r, called 
a p-adic exponential valuation. The completion 
K, of K with respect to v is called the p-adic 
extension of K. If K is a finite "algebraic num- 
ber field, the K, is called a p-adic number field. 
If p is generated by an element p, then "p-adic" 
is replaced by "p-adic." For instance, given a 
rational prime number p, we have a p-adic 
valuation of the rational number field Q, and 
we obtain the p-adic extension Q, of Q, which 
is called the p-adic number field. Every non- 
zero element a of Q, can be written as a 
uniquely determined expansion X7, a, p" 

(a, Z0,reZ,a,e Z, 0 xa, « p) Then we obtain a 
valuation v of Q, defined by v(a) « r. This 
valuation v is a discrete additive valuation, 
and Q, is complete with respect to v. The 
valuation ring of v is usually denoted by Z,, 
which is called the ring of p-adic integers. Each 
element of Q, (Z,) is called a p-adic number (p- 
adic integer). 

(4) Consider the field of tpower series k((t)) 
in one variable t over a field k. For 0zae 
k((t)), we define v(x) 2 r ifa=L~,a,t” (a,€k, 
a, #0). Then v is a discrete valuation of k((t)), 
and k((t)) is complete with respect to this 
valuation. 

(5) Let v be an additive valuation of a field 
K with the valuation ring R, and the valuation 
ideal m,. Let v' be an additive valuation of the 
field R,/m, with the valuation ring R,.. Then 
R" ={aeR,|(amodm,)eR,,} is a valuation 
ring of K. A valuation v" whose valuation ring 
coincides with R" is called the composite of v 
and v. 


G. The Approximation Theorem and the 
Independence Theorem 


The approximation theorem states: Let 

Wi, ...,W, be mutually nonequivalent and 
nontrivial multiplicative valuations of a field 
K. Then for any given n elements a}, ...,a, of 
K and a positive number e, there exists an 
element a of K such that w,(a—a,)<é (i= 

1,2, ...,n). 
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From this follows the independence theorem: 
Let e,, ..., e, be real numbers, and let w; and 
K beas in the approximation theorem. If 
II;w;(a)i—1 for all ae K — {0}, then e, =... 
exp d): 

Similar theorems hold for additive valu- 
ations. The following independence theorem is 
basic: Let v,, ..., v, be additive valuations of a 
field K, R,, ..., R, their valuation rings, and 
nt;,...,m, their maximal ideals. Let D=();R;, 
p; =m, D, and consider the rings of quotients 
D,,. Then D, — R;. If R; R; (for i#j), then D 
has exactly n maximal ideals p,, ..., Pp. 


H. Prime Divisors 


Let K be an 'algebraic number field (algebraic 
function field of one variable over a field k). An 
equivalence class of nontrivial multiplicative 
valuations (over K) is called a prime divisor 
(prime spot) of K. 

If K is an algebraic number field of degree n, 
there are exactly n mutually distinct injections 
0,,...,0, of K into the complex number field 
C. We may assume that o;(K) is contained in 
the real number field if and only if i € r, and 
6, i44,(d) and 6, +i(a) are conjugate com- 
plex numbers (n—r, 2i»0, ae K). For iser, 
let v;i(a)  |o;(a)|, and for 1 <i<(n—r,)/2, let 
v, +i(4) — o, dall", Then v,, pn (75 — 
(n—r,)/2) is a maximal set of mutually non- 
equivalent Archimedean valuations of K. 
Equivalence classes of v,,...,v,, are called real 
(infinite) prime divisors, and those of ee 
v, +n are called imaginary (infinite) prime 
divisors; all of them are called infinite prime 
divisors. An equivalence class of non- 
Archimedean valuations of K is called a finite 
prime divisor. 

An Archimedean valuation of K is said to 
be normal if it is one of the valuations v;. If v is 
non-Archimedean, then v is a p-adic valuation, 
where p is a prime ideal of the principal order 
0 of K (— Section F, example (3)). Hence if a is 
an element of K, there exists a constant c (c > 
1) such that v(a)=c ", where r is the degree 
of a with respect to p. In particular, if c is the 
norm of p (i.e., c is the cardinality of the set 
0/p), then the valuation v is called normal. Any 
finite prime divisor is represented by a normal 
valuation. Then we have the product formula 
II. w(a)— 1 for all ae K — {0}, where w ranges 
over all normal valuations of K. 

For a function field, a normal valuation is 
defined similarly, using e^ instead of the norm 
of p, where e is a fixed real number greater 
than 1 and f is the degree of the residue class 
field of the valuation over k. In this case we 
also have the product formula. 
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I. Extending Valuations to an Algebraic 
Extension of Finite Degree 


Assume that a field K’ is a finite algebraic 
extension of a field K. Let v be an additive 
valuation of K and v’ be an extension of v to 
K'. We denote the valuation rings, valuation 
ideals, and value groups of v and v' by R,, Rp, 
m, m,,, and G, G’, respectively. Then the 
degree of the extension f, — TR. Am, R,/m,] is 
called the degree of v' over v. The group index 
e, —[G':G] is called the ramification index of 
v' over v. If v' ranges over all extensions of v in 
K', then the sum X f,.e,. is not greater than 
[K': K] and the equality holds when v is a 
discrete valuation and either K’ is tseparable 
over K or v is complete. 


J. Places 


Let k, K, and L be fields, and suppose that k 
cK. Let f be a mapping of K onto LU {0} 
such that f(ab)= f(a)f(b) and f(a+b)= f(a) 

+ f (b), whenever the right member is meaning- 
ful, and such that the restriction of f to k is an 
injection. Here oo is an element adjoined to L 
and satisfying oo +a =a + oo — oc, ooa — aoo 

= oo (for any nonzero element a of EL 1/00 
—0, and 1/0 = æ. Then f is called a place of K 
over k. In this case R2 (xeK|f(x) zoo] isa 
valuation ring of K containing k. Let m be the 
maximal ideal of R. Then f can be identified 
with the mapping g: K ^ R/mU {oo} defined as 
follows: If ae R, then g(a)=(amod m); other- 
wise, g(a) — oo. Places of K over k can be 
classified in a natural way, and there exists a 
one-to-one correspondence between the set 

of classes of places of K over k and the set 

of equivalence classes of additive valuations 
over k. When K is an talgebraic function field, 
we usually consider the case where k is the 
‘ground field. Then if a, ..., a, € R, (a4, ...,a,) 
—(g(a;),...,g(a,)) gives a tspecialization of 
points over k. Conversely, if a;, bek are 

such that (a;, ...,a,) (b, ..., b,) is a speciali- 
zation over k, then there is a place f of K 

over k such that (b,, ..., b,) is isomorphic to 
(f(a), ---, f(a,)) (usually there are infinitely 
many such f’s). 


K. Pseudovaluations 


A pseudovaluation ¢ of a ring A (not neces- 
sarily commutative) is a mapping of A into the 
set of nonnegative real numbers satisfying the 
following four conditions: (i) (a) «0 if and 
only if a=0; (ii) o(ab) € o(a)o(b); (iii) pla 1 b) « 
p(a)+ (b); and (iv) e(—a)- (a). These con- 
ditions are weaker than those for multipli- 
cative valuations, but with them a topology 
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can be introduced into A as in Section D, with 


respect to which A becomes a topological ring. 


L. History 


The theory of valuations was originated by K. 
Hensel when he introduced p-adic numbers 
and applied them to number theory [1]. J. 
Kürschák (J. Reine Angew. Math., 142 (1913)) 
first treated the theory of multiplicative valu- 
ations axiomatically; it was then developed 
remarkably by A. Ostrowski (Acta Math., 41 
(1918)). However, in their theory condition (iii) 


(— Section C) was given only in the case C — 1, 


thus excluding the normal valuation of an 
imaginary prime divisor in an algebraic num- 
ber field. A valuation with general C was 
introduced by E. Artin [3]. The theory of ad- 
ditive valuations was originated by W. Krull 
(J. Reine Angew. Math., 167 (1932)), although 
the concept of exponential valuations existed 
before. The theory of valuations is used to 
simplify *class field theory and the theory of 
algebraic function fields in one variable. For 
these purposes, the notion of multiplicative 
valuations is sufficient (— 9 Algebraic Curves; 
59 Class Field Theory). The idea is also used 
in the theory of normal rings and in alge- 
braic geometry, for both of which the con- 
cept of additive valuations is also necessary. 
Pseudovaluations were used by M. Deuring 
(Erg. Math., Springer, 1935) in the arithmetic 
of algebras. 


References 


[1] K. Hensel, Theorie der algebraischen 
Zahlen, Teubner, 1908. 

[2] O. F. G. Schilling, The theory of valu- 
ations, Amer. Math. Soc. Math. Surveys, 1950. 
[3] E. Artin, Algebraic numbers and algebraic 
functions, Gordon & Breach, 1967. 

[4] O. Zariski and P. Samuel, Commutative 
algebra Il, Van Nostrand, 1960. 

Also - references to 67 Commutative Rings. 


440 (X.35) 
Variational Inequalities 


A. Introduction 


Variational inequalities arise when we con- 
sider extremal problems of functionals under 
unilateral constraints. Some problems in 
physics and engineering are studied by for- 
mulating them as elliptic, parabolic, and 
hyperbolic variational inequalities [1-8]. 
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B. Stationary Variational Inequality 


Let D be a bounded domain in m-dimensional 
Euclidean space and f € L,(D) be a given real- 
valued function. Consider the variational 
problem of minimizing the following func- 
tional J with the argument function v: 


J[v] -| Igrad Pda] fv dx. 
D D 


Here, we suppose the set of admissible func- 
tions to be the closed convex subset 


K «ive Ho (D)| v&O a.e. in D) 


of the Hilbert space Hå (D) (— 168 Function 
Spaces). It can be shown by choosing a mini- 
mizing sequence that there exists a minimum 
value of J which is realized by a unique ue K. 
Since the stationary function u belongs to 

Hj (D), it can be shown that the boundary 
condition u|;p —O is satisfied in the sense that 
the *trace youe H?(0D) (— 224 Interpolation 
of Operators) of u on D vanishes a.e. on ôD. 
In view of the fact that J[u] x J[v] is valid for 
any ve K, it can be verified that the stationary 
variational inequality 


—Au—f x0 
u<0 (1) 
(—Au—f)-u=0 


is satisfied in D in the sense of differentiation 
of distributions (— 125 Distributions and 
Hyperfunctions). The problem (1) is a Dirichlet 
problem with obstacle. Moreover, we can prove 
the regularity of ue H?(D) under an assump- 
tion of suitable smoothness for OD by estab- 
lishing the boundedness of the solutions u, in 
H?(D) of the penalized problems associated 
with (1): 


Í 
—Au,+-u; =f (e>0), 
E 


uslap =Q. 


Here we note that the u, are the stationary 
functions of the ordinary variational problems 
of minimization in H (D) of the functionals 


1 
t= | jeradedx—2 | rien [lot Pas 
D D e 


with the penalty term (the third term of the 
right-hand side of the equality above). We 
have thus found that the stationary variational 
inequality (1) is the Euler equation of a con- 
ditional problem of variation (— 46 Calculus 
of Variations). 


C. Variational Inequality of Evolution 


Let ye H' (D) be a given function on D such 
that y |p Z0 and Ay e L,(D). The variational 
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inequality of evolution 


Cu 
— Au « 0, 
Ot 


C 





u «wy, 
(San) u—y=0 (t>0,xeD), 


u(0, x)= a(x) (xe D), 


u(t, X)|;p70 (r0) 


can be formulated as an abstract Cauchy 
problem (— 286 Nonlinear Functional Analy- 
sis X) 


du 
— eE Au 


d 
d (t 7 0), 


u(-+0)=a 


in a Hilbert space with a multivalued operator 
A= —6q, where 09 is the subdifferential of the 
following lower semicontinuous proper convex 
function on the Hilbert space L;(D): 


qv) 


1 ; 
| |gradv|?dx if ve Hj(D) and re, 
D 


+00 otherwise. 


Thus the solution u is given by the vector- 
valued function 


u(t) — e'^ a. 


Here ei? is the tnonlinear semigroup generated 
by A (— 88 Convex Analysis, 378 Semigroups 
of Operators and Evolution Equations). 


D. Optimal Stopping Time Problem and 
Variational Inequalities 


Let {X,},59 be an m-dimensional Brownian 
motion (— 45 Brownian Motion) and consider 
the problem of finding a ‘stopping time o that 
minimizes 


J«[o] - E. (| ra) (xe R") 
0 


under the restriction that 0x o € o5, where 
Gap is the *hitting time for the boundary OD. 
Let us define 


u(x) — min J, [o]. 


Then the ‘principle of optimality in dynamic 
programming gives the stationary varia- 
tional inequality (1) with A replaced by 1A, 
and we can show by the tDynkin formula that 
an optimal stopping time ó is the hitting time 
for the set (xeO|u(x) 20] (^ 127 Dynamic 
Programming). We can systematically discuss 
problems in mathematical programming and 
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operations research by introducing quasivaria- 
tional inequalities, which are slight generaliza- 
tions of variational inequalities (— 227 Inven- 
tory Control, 408 Stochastic Programming). 
The above-mentioned facts are applicable to 
general ‘diffusion processes described by *sto- 
chastic differential equations (— 115 Diffusion 
Processes, 406 Stochastic Differential Equa- 


tions). We have thus found the relation 


free boundary problem «variational inequality 
optimal stopping time problem 


(— 405 Stochastic Control and Stochastic 
Filtering). 


E. Numerical Solution of Variational 
Inequalities 


Since the solution u of the variational inequal- 
ity (1) is the stationary function for the vari- 
ational problem, we can apply to the evalu- 
ation of the function u numerical methods 
based on the direct method of the calculus of 
variations (— 300 Numerical Methods). The 
*finite element method, which can be regarded 
as a type of Ritz-Galerkin method, is exten- 
sively employed to calculate numerical solu- 
tions. In view of the unilateral constraint 

u « 0, iteration methods, such as the Gauss- 
Seidel iteration method, are used with modifi- 
cations. An algorithm of relaxation with pro- 
jection is proposed in [3] (— 304 Numerical 
Solution of Partial Differential Equations). 
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A. General Remarks 


Among the principles that appear in physics 
are those expressed not in terms of differen- 
tial forms but in terms of variational forms. 
These principles, describing the conditions 
under which certain quantities attain ex- 
tremal values, are generally called variational 
principles. Besides Hamilton's principle in 
classical mechanics (— Section B) and Fermat's 
principle in geometric optics (— Section C), 
examples are found in *electromagnetism, 
'trelativity theory, *quantum mechanics, ‘field 
theory, etc. Independence of the choice of 
coordinate system is an important character- 
istic of variational principles. Originally these 
principles had theological and metaphysical 
connotations, but a variational principle is 
now regarded simply as a postulate that pre- 
cedes a theory and furnishes its foundation. 
Thus a variational principle is considered to 
be the supreme form of a law of physics. 


B. Mechanics 


In 1744 P. L. Maupertuis published an almost 
theological thesis, dealing with the principle 
of least action. This was the beginning of the 
search for a single, universal principle of me- 
chanics, contributions to which were made 
successively by L. Euler, C. F. Gauss, W. R. 
Hamilton, H. R. Hertz, and others. 

Let {q,} be the tgeneralized coordinates of a 
system of particles, and consider the integral of 
a function L(q,, q,,t) taken from time tọ to t,. 
If we compare the values of the integral taken 
along any arbitrary path starting from a fixed 
point P, in the coordinate space at time ty and 
arriving at another fixed point P, at time f,, 
then the actual motion q,(t) (which obeys the 
laws of mechanics) is given by the condition 
that the integral is an *extremum ('stationary 
value), that is, AU) Ldt =0, provided that the 
function L is properly chosen. This is Hamil- 
ton’s principle, and L is the ‘Lagrangian func- 
tion. In tNewtonian mechanics, the tkinetic 
energy T of a system of particles is expressed 
as a fquadratic form in g,. Furthermore, if the 
forces acting on the particles can be given by 
— grad V, where the potential V does not de- 
pend explicitly on g,, we can choose L= T— 
V. Also, for a charged particle in *special rela- 
tivity, we can take L= —moc?(1— v?/c?)? — 
ep 4- e(v: A), where m, is the rest mass of the 
particle, e is the charge, v is the velocity (with 
v its magnitude), c is the speed of light in 
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vacuum, and o and A are the scalar and 
vector potentials of the electromagnetic field, 
respectively. 

In general relativity theory, the motion of a 
particle can be derived from the variational 
principle ô [ds =0 (ds is the Riemannian line 
element). Hence, geometrically, the particle 
moves along a tgeodesic curve in 4-dimen- 
sional space-time. 


C. Geometric Optics 


The path of a light ray between two points P, 
and P, (subject to reflection and refraction) is 
such that the time of transit along the path 
among all neighboring virtual paths is an 
extremum (stationary value). This is called 
Fermat’s principle. If the index of refraction 

is n, Fermat's principle can be expressed as 

ô [5: nds =0 (ds is the Euclidean line element). 
The laws of reflection and refraction of light, 
as well as the law of rectilinear propagation of 
light in homogeneous media, can be derived 
from this principle. 


D. Field Theory 


Not only the equations of motion of a system 
of particles, but also various field equations 
(tMaxwell’s equations of the electromagnetic 
field, *Dirac's equation of the electron field, the 
meson field equation, the gravitational field 
equation, etc.) can be derived from variational 
principles in terms of appropriate Lagrangian 
functions. In *field theory the essential virtue 
of the variational principle appears in the fact 
that the properties of various possible fields as 
well as conservation laws can be systematically 
discussed by assuming relativistic invariance 
and gauge invariance of the Lagrangian func- 
tions adopted. In particular, for an electro- 
magnetic field in vacuum, the Lagrangian 
function density is L —(H? — E?)/2, and the 
integration Is carried out over a certain 4- 
dimensional domain. 


E. Quantum Mechanics 


If H is the Hamiltonian operator for any 
quantum-mechanical system, the eigenfunc- 
tion y can be determined by the variational 
principle 


landen with | var 1, 


where y is the complex conjugate of y and dr 
is the volume element. Based on this varia- 
tional principle, the *direct method of the 
calculus of variations is often employed for 


441 F 
Variational Principles 


an approximate numerical calculation of the 
energy eigenvalues and eigenfunctions. In par- 
ticular, by restricting the functional form of 

y to the product of one-body wave functions, 
we can obtain Hartree’s equation. A further 
suitable symmetrization of y leads to Fock’s 
equation. 


F. Statistical Mechanics 


Let q be a statistical-mechanical state of a 
system, and let Stol and E(q) be the state's 
entropy and energy (mean entropy and mean 
energy for an infinitely extended system); T 

is the thermodynamical temperature, and 
f(q)- E(q) — TS(q) is the free energy. Then the 
equilibrium state for Tz 0 is determined as the 
state o that gives the minimum value of f(q) 
(maximum for T « 0). 
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A. Definitions 


The vector concept originated in physics from 
such well-known notions as velocity, acceler- 
ation, and force. These physical quantities are 
supplied with length and direction; they can be 
added or multiplied by scalars. In the Euclid- 
ean space E" (or, in general, an taffine space), 

a vector a is represented by an oriented seg- 
ment pq. Two oriented segments pq; and 
Poq are considered to represent the same 
vector a if and only if the following two con- 
ditions are satisfied: (1) The four points p,, q,, 
P2. q2 lie in the same plane z. (2) p,41/ /p54; 
and p; q2/ /q,q,. Hence a vector in E" is an 
equivalence class of oriented segments pġ, 
where the equivalence relation p,q; ~ Dad 

is defined by the two conditions just given. 
Hereafter, we denote the vector by [pq], or 
simply pq. The points p and q are called the 
initial point and terminal point of the vector pd. 
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Given a vector a — pj and a real number J, 
we define the scalar multiple Aa as the vector 
př, where r is the point on the straight line 
containing both p and q such that the ratio 
[ pr: pq] is equal to A (if p 2q, then we put r= 
p). The operation (4, a) Aa is called scalar 
multiplication. Given two vectors a = pd and 
b= 45, the vector c — ps is called the sum of a 
and b and is denoted by c —a +b. The vector 
pp =0 is called the zero vector. If a — pj, we put 
—a- qp. 

Scalar multiplication and addition of vec- 
tors satisfy the following seven conditions: (1) 
a+b=b+a (commutative law); (2) a - (b-- c) - 
(a+ b) - c (associative law); (3) a +0 — a; (4) 
for each a there is —a such that a - ( — a) — 0; 
(5) (a - b) 2 7a + Ab, (A+ wa= åa + pa (dis- 
tributive laws); (6) 4(ua) — (4u)a (associative 
law for scalar multiplication); and (7) la=a. 
Hence the set V of all vectors in E" forms a 
treal linear space. Sometimes, a set satisfying 
(1)-(7), that is, by definition, a linear space, is 
called a vector space, and its elements are 
called vectors. 

The pair consisting of a vector pd and a 
specific initial point p of pq is sometimes called 
a fixed vector. An illustration of this is given 
by the force vector with its initial point being 
where the force is applied. By contrast, a vec- 
tor pq is sometimes called a free vector. If we 
fix the origin o in E", then for any point p in 
E", the vector op is called the position vector of 
p. 

If two vectors a =p and b —oq are ‘linearly 
dependent, they are sometimes said to be 
collinear. If there vectors a — op, b — od, and 
€ — or are linearly dependent, they are some- 
times said to be coplanar. 

If a set of vectors e,, ..., e, forms a tbasis of 
a vector space V, then the vectors e; are called 
fundamental vectors in V. Each vector ae V is 
uniquely expressed as a= > o;e; (x;e R). We 
call (x,,...,«,,) the components of the vector 
a with respect to the fundamental vectors 
€j,- €p- 


B. Inner Product 


In the Euclidean space E”, the length of the 
line segment pd is called the absolute value (or 
magnitude) of the vector a — pj and is denoted 
by |a|. A vector of length one is called a unit 
vector. For two vectors a — op and b— oq, the 
value (a, b}= |a| |b| cos0 is called the inner 
product (or scalar product) of a and b, where 0 
is the angle / poq. Instead of (a, b), the nota- 
tions a*b, or ab are also used. If neither vec- 
tor a nor vector b is equal to 0, then (a, b) - 0 
implies / poq — z/2, that is, the orthogonality 
of the two vectors op and od If we take an 
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*orthonormal basis (e,, ..., e,) in E" (De, a set 
of fundamental vectors with |e;|= 1, (e, e) —0 
(ij), the inner product of vectors a = X oe;, 
b — Y fe; is equal to Xf- o; f;. The inner 
product has the following three properties (i) 
(x, x) 20 and is zero if and only if x = 6; (ii) 

(x, y) - (y. x); (iii) (x, TL EE y) 2 (x4. y) - (x2. y). 
(ax, y) 7 a(x, y) («€ R). Similar linearity holds 
for y. 

Generally, an R-valued *biltnear form (x, y) 
on a linear space V satisfying the previous 
three conditions is also called an inner prod- 
uct. If a linear space V is equipped with an 
inner product, the space is called an inner 
product space (— 256 Linear Spaces H; 197 
Hilbert Spaces). If V is an inner product space, 
the absolute value |x| of xe V is defined to be 


Sx x). 





C. Vector Product 


In the 3-dimensional Euclidean space E?, we 
take an orthonormal basis e,, e;, e;. Let a and 
b be vectors in E? whose components with 
respect to e;, ej, e, are (x1, X2. %3), (Bis Bo, Bs). 
The vector 


X2 X3 Di 0 
B; Bs B, p. 


which ts symbolically written as 


d veda 
€, e; €3, 
Bs By 




















e; €&j Ei 
4, o 


Xi H 
D PG 


is called the exterior product or vector product 
of a and b and is denoted by [a,b] or a x b. 
The vector [a, b] is determined uniquely up to 
its sign by a and b and is independent of the 
choice of the orthonormal basis. 

Suppose that we have a = op, b = od. Then 
[[a, b]| —1Ja|- |b| sin 0, where 0 — Z poq. Also 
|[a, b]| is equal to the area of the parallel- 
ogram determined by a and b. To illustrate the 
orientation of [a,b], we sometimes use the idea 
of a turning screw. That is, the direction of a 
right-handed screw advancing while turning at 
0 from p to q (within the angle less than 180^) 
coincides with the direction of [a,b] (Fig. 1). 
The exterior product has the following three 
properties: (1) [a,b] = —[b. a] (antisymmetric 
law); (2) [4a, b] = ^ [a, b] (associative law for 


axb 
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scalar multiplication); (3) [a, b + c] = [a, b] + 
[a,c] (distributive law). The vector product 
does not satisfy the associative law, but it does 
satisfy the *Jacobi identity [a, [b, c] ] + [b, 
[c,a]]+ [c, [a, b] ] 2 0. The vector [a, [b,c]] is 
sometimes called the vector triple product, 

and for this we have Lagrange's formula [a, 
[b.c] ] =(a, c) b — (a, b)c. 

Let a, b, c be vectors in E? whose compo- 
nents with respect to an orthonormal funda- 
mental basis are (%1, %2, %3), (f,, B2, B3), and 
(15 32.73). Then (a, [b, c]) =(b, [c,a]) 2 (c. [a, b]) 
— [a. b, c], and the common value is equal to 
the determinant of the 3 x 3 matrix 


o X Os 
|^ f; Bs 


71 EE Ri 





The value denoted by [a, b, c] is called the 
scalar triple product of a, b, c and is equal to 
the volume of the parallelotope whose three 
edges are a — op, b — od, and c— or with com- 
mon initial point o. The triple a, b, c is called a 
right-hand system or a left-hand system ac- 
cording as [a, b, c] is positive or negative. We 
have [a, b, c] 20 if and only if a, b and c are 
coplanar. (For the *exterior product of vectors 
in E" and the concept of tp-vectors — 256 
Linear Spaces O.) 


D. Vector Fields 


In this section we consider the case of a 3- 
dimensional Euclidean space E? (for the gen- 
eral case — 105 Differentiable Manifolds). A 
scalar-valued or a vector-valued function 
defined on a set D in E? is called a scalar field 
or a vector field, respectively. The continuity 
or the differentiability of a vector field is de- 
fined by the continuity or the differentiability 
of its components. 

For a differentiable scalar field f(x, y, z), 
the vector field with the components (f/x, 
Of /C y, Cf/0z) is called the gradient of f and 
is denoted by grad f. For a differentiable vec- 
tor field V(x, y, z) whose components are 
(u(x, y, Z}, v(x, y, zZ), w(x, y, z)), the vector field 
with components 


E Ov Ou Ow Ov =) 


is called the rotation (or curl) of V and is de- 
noted by rot V (or curl V). Also, for a differenti- 
able vector field V, the scalar field defined by 
Ou/Ox + Ov/Oy + Ów/0z is called the divergence 
of V and is denoted by div V. Utilizing the 
vector operator V having differential operators 
(C/€x, 0/0Oy, C/0z) as its components, we may 
write simply grad f= Vf, div V =(V, V), rot V = 
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[V, V]. The symbol V is called nabla, atled 
(inverse of delta), or Hamiltonian. 

A vector field V with rot V =0 is said to be 
irrotational, (lamellar, or without vortex). A 
vector field V with div V =0 is said to be sole- 
noidal (or without source). Thus grad f is irro- 
tational and rot V is solenoidal. In a small 
neighborhood or in a tsimply-connected do- 
main, an irrotational field is a gradient, a 
solenoidal field is a rotation, and an arbitrary 
vector field V is the sum of these two kinds of 
vector fields: V = grad o + rotu (Helmholtz 
theorem); the function ¢ is called the scalar 
potential of V, and the vector field u is called 
the vector potential of V. Furthermore, the 
operator V? = VV = div grad = 0?/0x? + 0?/0y? 
--0?/0z? is called the Laplace operator (or 
Laplacian) and is denoted by A. A function 
that satisfies Ag =0 is called a *harmonic 
function. Locally, an irrotational and solenoi- 
dal vector field is the gradient of a harmonic 
function. If A is a vector field whose compo- 
nents are (9,, 95, 93) Ge., A(v) =(P (Y), Po), 
Q4(v))), we can let A operate on A by setting 
AA — (AQ, ^Q5, ^o4). We then have AA = 
V?A — graddiv A — rotrot A. 

Suppose that we are given a vector field V 
and a curve C such that the vector V(p) is 
tangent to the curve at each point pe C. The 
curve C is the *integral curve of the vector field 
V and is called the vector line of the vector 
field V. The set of all vector lines intersecting 
with a given closed curve C is called a vector 
tube. Given a closed curve C and a vector field 
V, the tcurvilinear integral ftv, ds) (where ds is 
the line element of C) is called the circulation 
(of V) along the closed curve C. A vector field 
is irrotational if its circulation along every 
closed curve vanishes; the converse is true in a 
simply connected domain. Further, let v, be 
the tnormal component of a vector field V 
with respect to a surace S, and let dS be the 
volume element of the surface. We put ndS = 
dS, where n is the unit normal vector in the 
positive direction of the surface S. Then the 
*surface integral (v,dS = f (V, dS) is called the 
vector flux through the surface S. A vector 
field whose vector flux vanishes for every 
closed surface is solenoidal. (For the corre- 
sponding formulas — 94 Curvilinear Integrals 
and Surface Integrals. For generalizations to 
higher-dimensional manifolds — 105 Dif- 
ferentiable Manifolds; 194 Harmonic Integrals; 
Appendix A, Table 3.) 
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A. General Remarks 


Integrals whose values are elements (or sub- 
sets) of *topological linear spaces are gen- 
erally called vector-valued integrals or vec- 

tor integrals. As in the scalar case, there are 
vector-valued integrals of Riemann type (— 37 
Banach Spaces K) and of Lebesgue type. In 
this article we consider only the latter. There 
are cases where integrands are vector-valued, 
where measures are vector-valued, and where 
both are vector-valued. The methods of inte- 
gration are also divided into the strong type, 
in which the integrals are defined by means of 
the original topology of the topological linear 
space X, and the weak type, in which they 

are reduced to numerical integrals by apply- 
ing continuous linear functionals on X. Com- 
bining these we can define many kinds of 
integrals. 

Historically, D. Hilbert's ‘spectral resolution 
is the first example of vector-valued integrals, 
but the general theory of vector-valued inte- 
grals started only after S. Bochner [1] defined 
in 1933 an integral of strong type for functions 
with values in a Banach space with respect to 
numerical measures. Then G. Birkhoff [2] 
defined a more general integral by replacing 
absolutely convergent sums with uncondi- 
tionally convergent sums. At approximately 
the same time, N. Dunford introduced inte- 
grals equivalent to these. Later, R. S. Phillips 
(Trans. Amer. Math. Soc., 47 (1940)) general- 
ized the definition to the case where values 
of functions are in a "locally convex topolog- 
ical linear space, and C. E. Rickart (Trans. 
Amer. Math. Soc., 52 (1942)) to the case where 
functions take subsets of a locally convex 
topological linear space as their values. The 
theory of integrals of weak type for functions 
with values in a Banach space and numerical 
measures was constructed by I. M. Gelfand 
[3], Dunford [4], B. J. Pettis [5], and others 
(1936-1938). N. Bourbaki [6] dealt with the 
case where integrands take values in a locally 
convex topological linear space. As for inte- 
grals of numerical functions by vector-valued 
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measures, a representative of strong type inte- 
grals is the integral of R. G. Bartle, Dunford, 
and J. T. Schwartz [7] (1955). Weak type 
integrals have been discussed by Bourbaki [6], 
D. R. Lewis (Pacific J. Math., 33 (1970)), and 
I. Kluvánek (Studia Math., 37 (1970)). The 
bilinear integral of Bartle (Studia Math., 15 
(1956)) is typical of integrals in the case where 
both integrands and measures are vector- 
valued. For interrelations of these integrals — 
the papers by Pettis and Bartle cited above 
and T. H. Hildebrandt report in the Bulletin 
of the American Mathematical Society, 59 
(1953). 

Since the earliest investigations [1—3] the 
main aim of the theory of vector-valued inte- 
grals has been to obtain integral representa- 
tions of vector-valued (set) functions and 
various linear operators [8]. However, there is 
the fundamental difficulty of the nonvalidity of 
the Radon-Nikodym theorem. Whatever de- 
finition of integrals we take, the theorem does 
not hold for vector-valued set functions un- 
conditionally. Many works sought conditions 
for functions, operators, or spaces such that 
the conclusion of the theorem would be re- 
stored; the works of Dunford and Pettis [9] 
and Phillips (Amer. J. Math., 65 (1943)) marked 
a summit of these attempts. Later, after A. 
Grothendieck's investigations (1953-1956), 
this problem began to be studied again, begin- 
ning in the late 1960s, by many mathemati- 
cians (— J. Diestel and J. J. Uhl, Jr., Rocky 
Mountain J. Math., 6 (1976); [10]). 

Recently, integrals of multivalued vector- 
valued functions have also been employed in 
mathematical statistics, economics, control 
theory, and many other fields. Some contri- 
butions are, besides Rickart cited above, G. B. 
Price (Trans. Amer. Math. Soc., 47 (1940), H. 
Kudo (Sci. Rep. Ochanomizu Univ., 4 (1953)), 
H. Richter (Math. Ann., 150 (1963)), R. J. Au- 
mann [11], G. Debreu [12], and M. Huku- 
hara (Funkcial. Ekvac., 10 (1967)). Further- 
more, C. Castaing and M. Varadier [13] have 
defined weak type integrals of multivalued 
functions and introduced many results con- 
cerning them. In the following we shall give 
explanations of typical vector-valued integrals 
with values in a Banach space only. 


B. Measurable Vector-Valued Functions 


Let x(s) be a function defined on a tø-finite 
measure space (S, €, u) with values in a 
Banach space X. This is called a simple func- 
tion or finite-valued function if there exists a 
partition of S into a finite number of mutually 
disjoint measurable sets A,, 45, ..., A, in each 
of which x(s) takes a contant value c;. Then 
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x(s) can be written as Èj- cjx a(S} where x as) 
is the tcharacteristic function of A,. A func- 
tion x(s) is said to be measurable or strongly 
measurable if it is the strong limit of a se- 
quence of simple functions almost everywhere, 
that is, Im, ||x,(s)—x(s)|| =0 a.e. Then the 
numerical function ||x(s)]| is measurable. If u is 
a "Radon measure on a compact Hausdorff 
space S, then the measurable functions can be 
characterized by *Luzin's property (— 270 
Measure Theory I). 

A function x(s) is said to be scalarly measur- 
able or weakly measurable if the numerical 
function (x(s), x'» is measurable for any tcon- 
tinuous linear functional x'e X’. A function x(s) 
is measurable if and only if it is scalarly mea- 
surable and there are a ‘null set £j cS and a 
tseparable closed subspace Y c X such that 
x(s)e Y whenever sé E, (Pettis measurability 
theorem). 


C. Bochner Integrals 


A measurable vector-valued function x(s) is 
said to be Bochner integrable if the norm 

\|x(s)|| is tintegrable. If x(s) is a Bochner inte- 
grable simple function X c; Za,(s), then its Boch- 
ner integral is defined by 


| x(s)du = Y WIER 
s 


For a general Bochner integrable func- 

tion x(s) there exists a sequence of simple 
functions satisfying the following condi- 
tions: (i) Dm, lx, (s) — x(s)|| =0 a.e. (ii) 
lim, «fs 1x,(5) — eil du=0. Then jsx,(s)dy 
converges strongly and its limit does not de- 
pend on the choice of the sequence {x,(s)}. We 
call the limit the Bochner integral of x(s) and 
denote it by fs x(s)du or by (Bn) fs x(s)du to 
distinguish it from other kinds of integrals. 

A Bochner integrable function on S is Boch- 
ner integrable on every measurable subset 

of S. The Bochner integral has the basic prop- 
erties of Lebesgue integrals, such as linear- 
ity, complete additivity, and *absolute con- 
tinuity, with absolute values replaced by 
norms. *Lebesgue's convergence theorem and 
*Fubinr's theorem also hold. However, the 
Radon-Nikodym theorem does not hold in 
general (— Section HL Let T be a ‘closed 
linear operator from X to another Banach 
space Y. If both x(s) and Tx(s) are Bochner 
integrable, then the integral of x(s) belongs 
to the domain of T and 


"(| AE Tx(s)dy. 
S S 


If, in particular, T is bounded, then the as- 
sumption is always satisfied. If ji is the "Le 
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besgue measure on the Euclidean space R", 
then Lebesgue’s differentiability theorem 
holds for the Bochner integrals regarded as 
a set function on the regular closed sets (— 
380 Set Functions D). 


D. Unconditionally Convergent Series 


Let 22, x; be a series of elements x; of a 
Banach space X. It is said to be absolutely 
convergent if È | x;|| < co. It is called uncon- 
ditionally convergent if for any rearrangement 
a the resulting series È x, converges strongly. 
Then the sum does not depend on «. Clearly, 
an absolutely convergent series is uncondition- 
ally convergent. If X is the number space or is 
finite-dimensional, then the converse holds. 
However, if X 1s infinite-dimensional, there 

is always an unconditionally convergent 

series which is not absolutely convergent 
(Dvoretzky-Rogers theorem). 

A series È x; is unconditionally convergent 
if and only if each subseries converges weakly 
(Orlicz-Pettis theorem). If x; is an uncon- 
ditionally convergent series, then Xx; x’) 
converges absolutely for any continuous linear 
functional x'e X'. If X is a Banach space con- 
taining no closed linear subspace isomorphic 
to the *sequence space co, then conversely a 
series È x; converges unconditionally when- 
ever 2| (xj, x'?| « oo for any x'e X' (Bessaga- 
Pełczyński theorem). A Banach space that is 
*sequentially complete relative to the weak 
topology, such as a freflexive Banach space, 
and a separable Banach space that is the dual 
of another Banach space, such as /, and the 
*Hardy space H,(R"), satisfy the assump- 
tion, while co, /,., and L,,(Q) for an infinitely 
divisible Q do not. The totality of absolutely 
convergent series (resp. unconditionally con- 
vergent series) in X ts identified with the *topo- 
logical tensor product /, & X (resp. !, 69 X) 
(Grothendieck). 


E. Birkhoff Integrals 


We say that a series 3 B; of subsets of X con- 
verges unconditionally if for any x;e B; the 
series È x; converges unconditionally. Then 
È B; denotes the set of such sums. A vector- 
valued function x(s) is said to be Birkhoff 
integrable if there is a countable partition A: 
S=| J2 Aj (4e &, A;N A, = GI (JFK), WA) < 
oo) such that the set x(A,) of values on A; are 
bounded and È u(A4j)x(Aj) converges uncon- 
ditionally and if the sum converges to an ele- 
ment of X as the partition is subdivided. The 
limit is called the Birkhoff integral of x(s) 
and is denoted by (Bk) |, x(s) dj or simply by 
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{sx(s)du. A Birkhoff integrable function is 
Birkhoff integrable on any measurable set. The 
Birkhoff integral has, as a set function, com- 
plete additivity and absolute continuity mu. 

It is linear in the integrand but Fubini's theo- 
rem and the Radon-Nikodym theorem do not 
hold. A Bochner integrable function is Birk- 
hoff integrable, and the integrals coincide. The 
converse does not hold. 


F. Gel'fand-Pettis Integrals 


A scalarly measurable function x(s) is said to 
be scalarly integrable or weakly integrable if 
for each x'e X’, (x(s), x'» is integrable. Then 
the linear functional x* on X' defined by 


| (x(s) x >du=<x', x*» 
S 


is called the scalar integral of x(s). Gel'fand [3] 
and Dunford [4] proved that x* belongs to 
the bidual X". Hence scalarly integrable func- 
tions are often called Dunford integrable and 
the integrals x* the Dunford integrals. More 
generally, Gel'fand [3] showed that if x'(s) 

is a function with values in the dual X' ofa 
Banach space X such that <x, x’(s)> is inte- 
grable for any xe X, then there is an x'e X’ 
satisfying 


| (x, X'(s)» du 2 (x, x 5. 
s 


This element is sometimes called the Gel'fand 
integral of x'(s). A scalarly integrable func- 
tion x(s) is scalarly integrable on any measur- 
able subset A. If the scalar integral is always in 
X, i.e., for each A there is an x,€X such that 


| (x, x'(s)> du=<x4,X'>, x' e X', 
A 


then x(s) is said to be Pettis integrable or 
Gel'fand-Pettis integrable and x, is called the 
Pettis integral or Gel'fand-Pettis integral on A 
and is denoted by (P) f, x(s)dg or simply by 

fa x(s)du. The Pettis integral has complete 
additivity and absolute continuity as a set 
function, similarly to the Birkhoff integral. 
Again, Fubini's theorem and the Radon- 
Nikodym theorem do not hold. The scalar 
integral on measurable sets of a scalarly inte- 
grable function x(s) is completely additive and 
absolutely continuous with respect to the 
*weak* topology of X" as the dual to X’. It is 
completely additive or absolutely continuous 
in the norm topology if and only if x(s) is 
Pettis integrable (Pettis [5]; [10]). If x(s) is 
Pettis integrable and f(s) is a numerical func- 
tion in L,(S), then the product f(s)x(s) is 
Pettis integrable. Birkhoff integrable functions 
are Pettis integrable, and the integrals coin- 
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cide. Conversely, if a measurable function is 
Pettis integrable, then it is Birkhoff integrable. 
When X satisfies the Bessaga-Pelczynski con- 
dition (— Section D), a measurable scalarly 
integrable function is Pettis integrable. 


G. Vector Measures 


Let ® be a set function defined on a com- 
pletely additive class S of subsets of the space 
S and with values in a Banach space X. It is 
called a finitely additive vector measure (resp. a 
completely additive vector measure or simply a 
vector measure) if ®(A, UA,)=(A,)+ 6(A;) 
whenever A, and A,€©& are disjoint (resp. 

bt 721 A) = LL; (Aj) in the norm topology 
for all Aje Š such that A;N A, = Ø (j #k)). We 
remark that the latter sum always converges 
unconditionally. A set function ® ts completely 
additive if and only if <®(A), x'» is completely 
additive for all x'e X’ (Pettis complete additiv- 
ity theorem). 

Let be a finitely additive vector measure 
and E be a measurable set. The total variation 
of ® on E and the semivariation of ® on E are 
defined by 


n 


V(®)(E)=sup 2. (A) (1) 
and 
| ||(E) 2 sup 2, 2/9(A) e (2) 














respectively, where the suprema are taken over 
all finite partitions of E: E =|] A; (4je &, A; 
A, — (ij  k)) and all numbers x; with |oj| « 1. 
If V(®)(S)< oo, then ® is called a measure of 
bounded variation. | |(S) « oc if and only if 
sup{||®(A)|| | 4e €] « æ. Then © is said to 

be bounded. The function V(®)(E) of E is 
finitely additive but ||®||(£) is only subaddi- 
tive: ||@||(A U B) x ||®||(A) + ||®||(B). If ® is a 
vector measure of bounded variation, then 
V(®) is a positive measure. Every vector mea- 
sure is bounded. A completely additive vector 
measure on a ‘finitely additive class £ can 
uniquely be extended to a vector measure on 
the completely additive class © generated by 
£ (Kluvánek). 

Let u be a positive measure and be a 
vector measure. Then we have (A4) 0 as 
u(A)0 if and only if ® vanishes on every A 
with (A) —O0. Then € is said to be absolutely 
continuous with respect to u. For every vector 
measure € there is a measure u such that 
[1(4)—0 as u(A)0 and that O x n(A) « 
I|(A) (Bartle, Dunford, and Schwartz). As a 
set function, the Bochner integral is a vector 
measure of bounded variation and the Pettis 
integral is a bounded vector measure. Both 
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are absolutely continuous with respect to the 
integrating measure. Let X be L,(0, 1) for 

1 €px oo, and define P(E) for a Lebesgue 
measurable set E to be the characteristic func- 
tion of E. If p— 1, ® is a vector measure of 
bounded variation. If 1 « p « oc, is a bounded 
vector measure, but it is not of bounded varia- 
tion on any set E with u(E) 0. If p= œ, then 
® is no longer completely additive. These vec- 
tor measures are absolutely continuous with 
respect to the Lebesgue measure, but they 
cannot be represented as the Bochner integral 
or the Pettis integral. 

Let P be a vector measure on Š. An G- 
measurable numerical function f(s) is said to 
be ®-integrable if there exists a sequence of 
simple functions f,(s) such that f,(s)— f(s) a.e. 
and that for each Ee G, [,-f,(s)d® converges 
in the norm of X. Then the limit is indepen- 
dent of the choice of f,. It is called the Bartle- 
Dunford-Schwartz integral and is denoted 
by fg f(s)d®. Lebesgue's convergence theo- 
rem holds for this integral. If is absolutely 
continuous with respect to the measure p, 
then every Tel. (ul is -integrable, and the 
operator that maps f to fsfdo is continuous 
with respect to the weak* topology in Lo (44) 
and the weak topology of X. Hence it is a 
*weakly compact operator. In particular, the 
range of a vector measure is relatively compact 
in the weak topology [7]. If is the vector 
measure of the Pettis integral of a vector- 
valued function x(s), then the above integral is 
equal to the Pettis integral of f(s)x(s). 

A vector measure ® is said to be nonatomic 
if for each set A with P(A) <0 there is a subset 
B of A such that (B) 40 and ®(A B) Z0. If X 
is fintte-dimensional, then the range of a non- 
atomic vector measure is a compact convex set 
(Lyapunov convexity theorem). This has been 
generalized to the infinite-dimensional case in 
many ways, but the conclusion does not hold 
in the original form (— Kluvanek and G. 
Knowles [15]; [10]). 


H. The Radon-Nikodym Theorem 


As the above examples show, the fRadon- 
Nikodym theorem does not hold for vector 
measures in the original form. From 1967 

to 1971, M. Metivier, M. A. Rieffel, and S. 
Moedomo and Uhl improved the classical 
result of Phillips (1943) and proved the follow- 
ing theorem. 

Radon-Nikodym theorem for vector mea- 
sures. The following conditions are equivalent 
for u-absolutely continuous vector measures b 
defined on a finite measure space (S, S, uy: (1) 
There is a Pettis integrable measurable func- 
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tion x(s) such that 


san) x(s)dy. 


A 
(ii) For each £20 there is an Ee G such that 
vu E) « e and such that {®(A)/u(A)| Ae G, 
A c Ej is relatively compact. (iii) For each Ee 
© with u(E) 0 there is a subset F of E with 
u(F)>0 such that {@(A)/u(A)| AES, ACF} 
is relatively weakly compact. Then ® is of 
bounded variation if and only if x(s) is Boch- 
ner integrable. 

On the other hand, since Birkhoff and 
Gel'fand it has been known that for special 
Banach spaces X every p-absolutely continu- 
ous vector measure of bounded variation 
with values in X can be represented as a Boch- 
ner integral with respect to u. Such spaces are 
said to have the Radon-Nikodym property. 
Separable dual spaces (Gel'fand, Pettis; Dun- 
ford and Pettis), reflexive spaces (Gel'fand, 
Pettis, Phillips), and /,(Q), Q arbitrary, etc., 
have the Radon-Nikodym property, while 
L (0, 1) (Bochner), co (J. A. Clarkson), L,(Q) 
on a nonatomic Q (Clarkson, Gel'fand), and 
C(Q) on an infinite compact Hausdorff space 
Q, etc., do not. Gel'fand proved that L,(0, 1) 
(and cj) is not a dual by means of this fact. 
From 1967 to 1974, Riefell, H. B. Maynard, 
R. E. Huff, and W. J. Davis and R. P. Phelps 
succeeded in characterizing geometrically the 
Banach spaces with the Radon-Nikodym 
property. We know today that the following 
conditions for Banach spaces X are equivalent 
[10]: (1) X has the Radon-Nikodym property. 
(ii) Every separable closed linear subspace of 
X has the Radon-Nikodym property. (iii) 
Every function f:[0, 1] X of bounded 
variation is (strongly or weakly) differentiable 
a.e. (iv) For any finite measure space (S, S, ul 
and bounded linear operator T: L,(S)9 X, 
there is an essentially bounded measurable 
function x(s) with values in X such that 


=| f()x()du fe L,(S). 
s 


(v) Each nonvoid bounded closed convex set K 
in X is the tclosed convex hull of the set of its 
strongly exposed points, where a point xye K 
is called a strongly exposed point of K if there 
is an x'e X’ such that (xo, x> » (x, x’) for all 
xe K {xo} and that any sequence x,c K with 
limx,, x> = (xo, x» converges to x, strongly. 
A Banach space X is said to have the Krein- 
MiPman property if each bounded closed 
convex set in X is the closed convex hull of its 
"extreme points. A Banach space X with the 
Radon-Nikodym property has the Krein- 
Mil'man property (J. Lindenstrauss). If X isa 
dual space, then the converse holds (Huff and 
P. D. Morris). A Banach space with the Krein- 
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Mil'man property clearly has no closed linear 
space isomorphic to co, but there are Banach 
spaces that do not contain c, and do not have 
the Krein-Mil'man property. The dual X of 

a Banach space Y has the Radon-Nikodym 
property if and only if the dual of every sepa- 
rable closed linear subspace of Y is separable 
(Uhl, C. Stegall). 


I. Integrals of Multivalued Vector Functions 


Let T (s) be a multivalued function defined on 
a o-finite complete measure space (S, Š, ul with 
values that are nonempty closed subsets of a 
separable Banach space X. The inverse image 
of a subset E of X under T (s) is, by definition, 
the set of all s such that l'(s)'1 E z Ø. T (s) is 
said to be measurable or strongly measurable if 
the inverse image of each open set in X under 
I (s) belongs to S. Let B(X) be the tBorel field 
of X, and G x B(X) be the product completely 
additive class, that is, the smallest completely 
additive class containing all direct products 

A x B of Ae G and Be B(X). Then the mea- 
surability of F(s) is equivalent to each of the 
following: (i) The graph {(s, x)|xeI'(s, sec] 
of T (s) belongs to S x B(X). (ii) The inverse 
image of every Borel set in X under T (s) be- 
longs to G. (iii) For each xe X, the distance 
d(x, '(s)) 2 pf! lx — y] | yeT'(s)) between x 

and F(s) is measurable as a function on S. 

A measurable function x(s) on S with values 
in X is called a measurable selection of I (s) if 
x(s) is in T (s) for all s. (X being separable, we 
need not discriminate between strong and 
weak measurability.) The measurability of 
I (s) is also equivalent to the following impor- 
tant statement on the existence of measurable 
selections of I (s): (iv) There are a countable 
number of measurable selections {x,(s)} of T (s) 
such that the closure of the set (x,(s)| n— 

1,2, ... } coincides with T (s) for all se S. T (s) 

is said to be scalarly measurable or weakly 
measurable if the support function ó'(x', '(s))= 
sup(x, x» |xeT(s)] is measurable on S for 
all x'e X’. The strong measurability of F (s) 
clearly implies the weak one. If the values of 
I'(s) are nonempty weakly compact convex 
sets, then the measurabilities are equivalent. 
Hereafter we shall assume that I (s) takes the 
values in the weakly compact convex sets. If 
the support function 6’(x’, '(s)) is integrable on 
S for all ver, then T (s) is said to be scalarly 
integrable. Then the scalar integral of I (s) is 
defined to be the set in X" of all scalar inte- 
grals of its measurable selections, i.e., 


| reda- | | x(s)du 
S S 


x(s) is a measurable 





selection of I op 
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If Viel =sup {|x| | xeT(s)) is integrable, 
then every measurable selection is Bochner 
integrable and the integral [T (s) dj becomes 
a nonempty weakly compact convex set in 

X. When the values of I (s) are nonempty 
compact convex sets, there is another method, 
by G. Debreu, of defining the integral. Let 

€ be the class of all nonempty compact con- 
vex sets in X and 6 be the Hausdorff met- 

ric, i.e., for K, and K;e € define 6(K,,K,)= 
max [supíd(x, K,)|xeK,}, sup{d(x, K,)|xe 
Kj]. Further, for K,, K,e2 and «>0 define 
the sum and the nonnegative scalar multiple 
by K,- K; (x, x;|xi € Ki, x5 € K5] and 
a-K,={ax|xeK,}, respectively. Then £L en- 
dowed with the Hausdorff metric and the 
above addition and scalar multiplication is 
isometrically embedded in a closed convex 
cone in a separable Banach space Y by the 
Rádsróm embedding theorem (Proc. Amer. 
Math. Soc., 3 (1952)). Let o be this isometry. 
Then the (strong) measurability and the (strong) 
integrability of F (s) are defined by the measur- 
ability and the Bochner integrability of the Y- 
valued function ol Lei, respectively, and its 
(strong) integral as the inverse image of the 
Bochner integral of ol (s)) under o: 


| TI()du-o' (| orina). 
5 S 


This definition of integral for strongly measur- 
able I (s) is shown to be compatible with that 
mentioned before. It is clear by the definition 
that the integral value in this case is a non- 
empty compact convex set and that most prop- 
erties of Bochner integrals also hold for this 
integral. 
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François Viète (1540—-December 13, 1603) was 
born in Fontenay-le-Comte, Poitou, in western 
France. He served under Henri IV, first as a 
lawyer and later as a political advisor. His 
mathematics was done in his leisure time. He 
used symbols for known variables for the first 
time and established the methodology and 
principles of symbolic algebra. He also sys- 
tematized the algebra of the time and used it 
as a method of discovery. He is often called the 
father of algebra. He improved the methods of 
solving equations of the third and fourth de- 
grees obtained by G. Cardano and L. Ferrari. 
Realizing that solving the algebraic equation 
of the 45th degree proposed by the Belgian 
mathematician A. van Roomen can be reduced 
to searching for sin(«/45) knowing sin x, he 
was able to solve it almost immediately. How- 
ever, he would not acknowledge negative roots 
and refused to add terms of different degrees 
because of his belief in the Greek principle of 
homogeneity of magnitudes. He also contri- 
buted to trigonometry and represented the 
number z as an infinite product. 
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Studien zur Gesch. Math., (B) 3 (1934), 18— 
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John von Neumann (December 28, 1903- 
February 8, 1957) was born in Budapest, 
Hungary, the son of a banker. By the time he 
graduated from the university there in 1921, he 
had already published a paper with M. Fekete. 
He was later influenced by H. Weyl and E. 
Schmidt at the universities of Zürich and 
Berlin, respectively, and he became a lecturer at 
the universities of Berlin and Hamburg. He 
moved to the United States in 1930 and in 
1933 became professor at the Institute for 
Advanced Study at Princeton. In 1954 he was 
appointed a member of the US Atomic Energy 
Commission. The fields in which he was first 
interested were tset theory, theory of 'func- 
tions of real variables, and tfoundations of 
mathematics. He made important contribu- 
tions to the axiomatization of set theory. At 
the same time, however, he was deeply inter- 
ested in theoretical physics, especially in the 
mathematical foundations of quantum me- 
chanics. From this field, he was led into re- 
search on the theory of *Hilbert spaces, and 

he obtained basic results in the theory of *oper- 
ator rings of Hilbert spaces. To extend the the- 
ory of operator rings, he introduced tcontinu- 
ous geometry. Among his many famous works 
are the theory of *almost periodic functions 

on a group and the solving of !Hilbert's fifth 
problem for compact groups. In his later years, 
he contributed to *game theory and to the 
design of computers, thus playing a major role 
in all fields of applied mathematics. 
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A disturbance originating at a point ina 
medium and propagating at a finite speed in 
the medium is called a wave. For example, a 
sound wave propagates a change of density or 
stress in a gas, liquid, or solid. A wave in an 
elastic solid body is called an elastic wave. 
Surface waves appear near the surface of a 
medium, such as water or the earth. When 
electromagnetic disturbances are propagated 
in a gas, liquid, or solid or in a vacuum, they 
are called electromagnetic waves. Light is a 
kind of electromagnetic wave. According to 
*general relativity theory, gravitational action 
can also be propagated as a wave. 

It many cases waves can be described by the 
wave equation: 


e "a zx) 


ox? 0y? Oz 
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Here t is time, x, y, z are the Cartesian coordi- 
nates of points in the space, c is the propaga- 
tion velocity, and y represents the state of the 
medium. 

If we take a closed surface surrounding the 
origin of the coordinate system, the state 
dot. t) at the origin at time t can be determined 
by the state at the points on the closed surface 
at time t — r/c, with r the distance of the point 
from the origin. More precisely, we have 


Od (f or 10) 
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Here n is the inward normal at any point of 
the closed surface, and the integral is taken 
over the surface, while the value of the inte- 
grand is taken at time t — r/c. This relation is 
a mathematical representation of Huygens's 
principle, which is valid for the 3-dimensional 
case but does not hold for the 2-dimensional 
case (— 325 Partial Differential Equations of 
Hyperbolic Type). 

A plane wave propagating in the direction 
of a unit vector n can be represented by y= 
F(t—n-r/c), where F is an arbitrary function 
and r(x, y, z) is the position vector. The sim- 
plest case is given by a sine wave (sinusoidal 
wave): i = Asin(ct —k -r +ô). Here A(ampli- 
tude) and 6 (phase constant) are arbitrary con- 
stants, k is in the direction of wave propaga- 
tion and satisfies the relation |k|]c =w. œw is the 
angular frequency, «2/27 the frequency, k the 
wave number vector, |k| the wave number, 27/« 
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the period, and 27/|k| the wavelength. The 
velocity with which the crest of the wave ad- 
vances is equal to w/|k|=c and is called the 
phase velocity. 

A spherical wave radiating frorn the origin 
can generally be represented by 


d \"1 r 
vo) F(a) 


where o, is the tsolid harmonic of order n. 

Waves are not restricted to those governed 
by the wave equation. In general. y is not a 
scalar, but has several components (e.g., Y% may 
be a vector), which satisfy a set of simulta- 
neous differential equations of various kinds. 
Usually they have solutions in the form of 
sinusoidal waves, but the phase velocity c= 
«/|k| is generally a function of the wavelength 
^. Such a wave, called a dispersive wave, has a 
propagation velocity (velocity of propagation 
of the disturbance through the medium) that 
is not equal to the phase velocity. A distur- 
bance of finite extent that can be approxi- 
mately represented by a plane wave is propa- 
gated with a velocity c — Adc/d4, called the 
group velocity. Often there exists a definite 
relationship between the amplitude vector 
A (and the corresponding phase constant ó) 
and wave number vector k, in which case the 
wave is said to be polarized. In particular, 
when A and k are parallel (perpendicular), 
the wave is called a longitudinal (transverse) 
wave. Usually equations governing the wave 
are linear, and therefore superposition of two 
solutions gives a new solution (tprinciple of 
superposition). Superposition of two sinusoi- 
dal waves traveling in opposite directions gives 
rise to a wave whose crests do not move (e.g., 
V = Asinotsink-r). Such a wave is called a 
stationary wave. Since the energy of a wave is 
proportional to the square of y, the energy of 
the resultant wave formed by superposition 
of two waves is not equal to the sum of the 
energies of the component waves. This phe- 
nomenon is called interference. When a wave 
reaches an obstacle it propagates into the 
shadow region of the obstacle, where there is 
formed a special distribution of energy de- 
pendent on the shape and size of the obtacle. 
This phenomenon is called diffraction. 

For aerial sound waves and water waves, 
if the amplitude is so large that the wave 
equation is no longer valid, we are faced with 
*nonlinear problems. For instance, shock 
waves appear in the air when surfaces of dis- 
continuity of density and pressure exist. They 
appear in explosions and for bodies traveling 
at high speeds. Concerning wave mechanics 
dealing with atomic phenomena — 351 Quan- 
tum Mechanics. 
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Karl Weierstrass (October 31, 1815—February 
19, 1897) was born into a Catholic family in 
Ostenfelde, in Westfalen, Germany. From 1834 
to 1838 he studied law at the University of 
Bonn. In 1839 he moved to Münster, where he 
came under the influence of C. Gudermann, 
who was then studying the theory of elliptic 
functions. From this time until 1855, he taught 
in a parochial junior high school; during this 
period he published an important paper on 
the theory of analytic functions. Invited to the 
University of Berlin in 1856, he worked there 
with L. Kronecker and E. E. Kummer. In 
1864, he was appointed to a full professorship, 
which he held until his death. 

His foundation of the theory of analytic 
functions of a complex variable at about the 
same time as Riemann is his most fundamental 
work. In contrast to Riemann, who utilized 
geometric and physical intuition, Weierstrass 
stressed the importance of rigorous analytic 
formulation. Aside from the theory of analytic 
functions, he contributed to the theory of 
functions of real variables by giving examples 
of continuous functions that were nowhere 
differentiable. With his theory of tminimal 
surfaces, he also contributed to geometry. His 
lectures at the University of Berlin drew many 
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listeners, and in his later years he was.a re- 
spected authority in the mathematical world. 
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Hermann Weyl (November 9, 1885- December 
8, 1955) was born in Elmshorn in the state of 
Schleswig-Holstein in Germany. Entering the 
University of Góttingen in 1904, he also au- 
dited courses for a time at the University of 
Münich. In 1908, he obtained his doctorate 
from the University of Góttingen with a paper 
on the theory of integral equations, and by 
1910 he was a lecturer at the same university. 
In 1913, he became a professor at the Federal 
Technological Institute at Zürich; in 1928— 
1929, a visiting professor at Princeton Univer- 
sity; in 1930, a professor at the University of 
Góttingen; and in 1933, a professor at the 
Institute for Advanced Study at Princeton. He 
retired from his professorship there in 1951, 
when he became professor emeritus. He died in 
Zürich in 1955. 

Weyl contributed fresh and fundamental 
works covering all aspects of mathematics and 
theoretical physics. Among the most notable 
are results on problems in 'integral equations, 
*Riemann surfaces, the theory of tDiophantine 
approximation, the representation of groups, 
in particular compact groups and ‘semisimple 
Lie groups (whose structure he elucidated), the 
space-time problem, the introduction of taffine 
connections in differential geometry, fquantum 
mechanics, and the foundations of mathemat- 
ics. In his later years, with his son Joachim he 
studied meromorphic functions. In addition to 
his many mathematical works he left works in 
philosophy, history, and criticism. 
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A. General Remarks 


Let I be an tintegral domain of character- 
istic 0, and p a fixed prime number. For each 
infinite-dimensional vector x =(X9,X1,..-) 

with components in I’, we define its ghost com- 
ponents x'9), x), ... by x — xo, x" 2 Af + 
px?" Tt... 4 p"x,. We define the sum of the 
vectors x and y —(yo, y,....) to be the vector 
with ghost components x! + yO, x? 4 yO), 
..., and their product to be the vector with 
ghost components xy, viii... The sum 
and product are uniquely determined vectors 
with components in I. Writing their first two 
terms explicitly, we have 


x+y 


RH p v, poy 
= Xo+ Yo xı - Y1— } - XoYOó b 
v=1P y 


Xy —(Xgyo. X15 - Yxb L PX), Séch 


In general, it can be proved that the nth 
components o,(x, y) and z,(x, y) of the sum and 
product are polynomials in xo, yo, X4, Y4, ..., 
a, y, Whose coefficients are rational integers. 
With these operations of addition and multi- 
plication, the set of these vectors forms a tcom- 
mutative ring, of which the zero element is 
(0,0, ...) and the unity element is (1,0, ...). 

Let k be a fteld of characteristic p. For vectors 
(£o, £1, ...), and (4),,,...) with components 
in k, we define their sum and product by 

(Sos Sis e) Ulo lis) m C 0, 0)... ) and 
(Cor &1. Ia, fi.) =Ò. T(N), ). Since 
the coefficients of o, and z, are rational in- 
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tegers, these operations are well defined. With 
these operations, the set of such vectors be- 
comes an integral domain W(k) of character- 
istic 0. Elements of W(k) are called Witt vec- 
tors over k. 

If we put V(Zp, či, ...) =(0, čo, 5,, ...) and 
(£9, £4, ... P 2 (£8, £1, ...), we get the formula 
pé — Vč”. (Note that this €? is not the pth 
power of č in W(k) in the usual sense.) There- 
fore, if we put |£| 2 p "for a vector č whose 
first nonzero component ts £,, then this abso- 
lute value | | gives a ‘valuation of W(k). In par- 
ticular, when k is a ‘perfect field, denoting the 
vector (Co, 0, ...) by {čo} we get EE = 
E piti? ' and W(k) is a *complete valu- 
ation ring with respect to this valuation. 
Therefore the "field of quotients of W(k) is a 
complete valuation field of which p is a prime 
element and k is the ‘residue class field. Con- 
versely, let K be a field of characteristic O that 
is complete under a ‘discrete valuation v, o be 
the valuation ring of v, and k be the residue 
class field of v. Assume that k is a perfect field 
of characteristic p. If p is a prime element of v, 
then o = W(K). If v(p) - v(x*) (e> 1) with a 
prime element z of o, we have o = W(Kk) [n]. 
and z is a root of an *Eisenstein polynomial 
X*cra, X* +... +a, (aj€ pW(k), 4. € p? W(K)). 
In this way we can determine explicitly the 
structure of a *p-adic number field ( 257 
Local Fields). 


B. Applications to Abelian p-Extensions and 
Cyclic Algebras of Characteristic 


Next we consider W,(k) - W(k)/V"W(k). The 
elements of W,(k) can be viewed as the n- 
dimensional vectors (čo, ..., £, ,), but their 
laws of composition are defined as in the pre- 
vious section. They are called Witt vectors of 
length n. We define an operator œ by gë = 
&? — C. Using it, we can generalize the theory 
of *Artin-Schreier extensions (— 172 Galois 
Theory) to the case of Abelian extensions of 
exponent p" over a field of characteristic p. 
Indeed, let k be a field of characteristic p and 
č — (£o, ..., 6, ,) an element of W(k). If = 
(Yo, .... 14 1) Is a root of the vector equation 
(9 X — č =0, then the other roots are of the 
form 4 + a(x — (xo, ...,9,.,), & € F,). In partic- 
ular, if Zë ok — (a? —a| xek}, the field K = 
kino, .... 95, ,) 18 a cyclic extension of degree 
p" over k, and conversely, every cyclic extension 
of k of degree p" is obtained in this way. Let 
(1/(2)£ denote the set of all roots of (o X — 

č =0. Then more generally, any finite Abelian 
extension of exponent p" of k can be obtained 
as K «- k((1/o)£| £c H) with a suitable finite 
subgroup H/(oW,(k) of W,(k)/ 9 W,(k), and 
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the Galois group of K/k is isomorphic to 
H/o W,(k). 

Moreover, for a tcyclic extension K = 
k((1/@)B) of exponent p" over k and for 
x€k(a #0), we can define a *cyclic algebra 
(x, B] generated by an element u over K by the 
fundamental relations u” =~, o0 — fl, uüu ^! 

— (0 (1,0, ...,0) (where 0 —(05, ...,0, , uQu ^! 
— Dällt ! ,...,u0, qu TH. and (a, f] is a central 
simple algebra over k. 

Using these results, we can develop the 
structure theory of the "Brauer group of expo- 
nent p" of a ‘field of power series in one vari- 
able with coefficients in a finite field F, (of a 
*field of algebraic functions in one variable 
over F,) exactly as in the case of a p-adic field 
(of an algebraic number field) (E. Witt [1]; 

— 29 Associative Algebras G). 

On the other hand, W,(k) is a commutative 
‘algebraic group over k and is important in the 
theories of algebraic groups and tformal 
groups (— 13 Algebraic Groups). 


References 


[1] E. Witt, Zyklische Kórper und Algebren 
der Charakteristik p vom Grad p", J. Reine 
Angew. Math., 176 (1937), 126-140. 

[2] H. Hasse, Zahlentheorie, Akademie- 
Verlag., 1949. 

[3] N. Jacobson, Lectures in abstract algebra 
HI, Van Nostrand, 1964. 





449 Ref. 
Witt Vectors 





450 A 
Zeta Functions 


450 (V.19) 
Zeta Functions 


A. Introduction 


Since the 19th century, many special functions 
called ¢-functions (zeta functions) have been 
defined and investigated. The four main prob- 
lems concerning ¢-functions are: (1) Methods 
of defining ¢-functions. (2) Investigation of 

the properties of ¢-functions. Generally, ¢- 
functions have the following four properties 

in common: (i) They are meromorphic on the 
whole complex plane; (it) they have *Dirichlet 
series expansions; (tit) they have Euler product 
expansions; and (iv) they satisfy certain func- 
tional equations. Also, it is an important prob- 
lem to find the poles, residues, and zeros of ¢- 
functions. (3) Application to number theory, in 
particular to the theory of decomposition of 
prime ideals in finite extensions of algebraic 
number fields (— 59 Class Field Theory). (4) 
Study of the relations between different č- 
functions. 

Most of the functions called C-functions or 
L-functions have the four properties of prob- 
lem (2). The following is a classification of the 
important types of ¢-functions that are already 
known, which will be discussed later in this 
article: 

(1) The ¢- and L-functions of algebraic num- 
ber fields: the Riemann 4-function, Dirichlet L- 
functions (study of these functions gave im- 
petus to the theory of ¢-functions), Dedekind 
C-functions, Hecke L-functions, Hecke L- 
functions with *Gróssencharakters, Artin L- 
functions, and Weil L-functions. (2) The p-adic 
L-functions related to the works of H. W. 
Leopoldt, T. Kubota, K. Iwasawa, etc. (3) The 
C-functions of quadratic forms: Epstein C- 
functions, ¢-functions of indefinite quadratic 
forms (C. L. Siegel), etc. (4) The č- and L- 
functions of algebras: Hey C-functions and 
the C-functions given by R. Godement, T. 
Tamagawa, etc. (5) The {-functions associated 
with Hecke operators, related to the work of 
E. Hecke, M. Eichler, G. Shimura, H. Jacquet, 
R. P. Langlands, etc. (6) The congruence ¢- 
and L-functions attached to algebraic varieties 
defined over finite fields (E. Artin, A. Weil, A. 
Grothendieck, P. Deligne), č- and L-functions 
of schemes. (7) Hasse C-functions attached to 
the algebraic varieties defined over algebraic 
number fields. (8) The ¢-functions attached to 
discontinuous groups: Selberg C-functions, the 
Eisenstein series defined by A. Selberg, Gode- 
ment, and I. M. Gelfand, etc. (9) Y. Ihara's ¢- 
function related to non-Abelian class field 
theory over a function field over a finite field. 
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(10) ¢-functions associated with prehomoge- 
neous vector spaces (M. Sato, T. Shintani). 


B. The Riemann ¢-Function 


Consider the series 


1 1 1 
C(s)=1 ty ty Passus Ped 
which converges for all real numbers s> 1. It 
was already recognized by L. Euler that ¢(s) 
can also be expressed by a convergent infinite 
product I0 —p ^) !, where p runs over all 
prime numbers (Werke, ser. I, vol. VII, ch. XV, 
8 274). This expansion is called Euler's infinite 
product expansion or simply the Euler product. 
However, Riemann was the first to treat Ziel 
successfully as a function of a complex variable 
s (1859) [R1]; for this reason, it is called the 
Riemann ¢-function. As can be seen from its 
Euler product expansion, ¢(s) is holomorphic 
and has no zeros in the domain Res>1. 
Riemann proved, moreover, that it has an 
analytic continuation to the whole complex 
plane, is meromorphic everywhere, and has 

a unique pole s=1. The functions (s — 1)C(s) 
and c(s) — 1/(s — 1) are integral functions of 

s. This can be seen by considering the integral 
expression 


x (X Senf 
n=] 


From this last formula, we also obtain an 
equality 


c(s)=¢(1 —s), 
where 
&(s) 2 n 97 T(s/2)E(s). 


This equality is called the functional equation 
for the €-function. The residue of £(s) at s=1 is 
l, and around s- 1, 


l 
Eltere: 1]), 


where C is *Euler's constant. This is called the 
Kronecker limit formula for (s). 

The function ¢(s) has no zeros in Resz 1, 
and its only zeros in Res <0 are simple zeros 
at s= —2, —4,..., —2n, .... But čis) has in- 
finitely many zeros in 0< Res < 1, which are 
called the nontrivial zeros. B. Riemann conjec- 
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tured that all nontrivial zeros lie on the line 
Res= 1/2 (1859). This is called the Riemann 
hypothesis, which has been neither proved nor 
disproved (— Section I). 

If N(T) denotes the number of zeros of ((s) 
in the rectangle 0< Res<1,0<Ims<T, we 
have an asymptotic formula 


1+log2z 
2n 


(H. von Mangoldt, 1905). Also, ¢(s) has the 
following infinite product expansion: 


l 
Mul Tlog T T + O(log T) 


1 


(s— nosz" m]! — es, 
r(3+1) : í 


where b is a constant and p runs over all non- 
trivial zeros of ¢(s) (J. Hadamard, 1893). 
Hadamard and C. de La Vallée-Poussin 
proved the tprime number theorem, almost 
simultaneously, by using some properties of 


C(s)(— 123 Distribution of Prime Numbers B). 


The following approximate functional equa- 
tion is important in investigating the values of 


C(s): 


l 
C(s)= Y —- (S). —— 


n«xH ncyH 
*O(x +O Wee) 


where ¢ is *Euler's function, and f(s) = 

p(s) —5), s=o+it, 2xxy=|t|, and the 

approximation is uniform for —h<o<h, 

X k, y k with h and k positive constants 

(G. H. Hardy and J. E. Littlewood, 1921). 
Euler obtained the values of (s) for positive 

even integers s: 


22m-1i n?" B, 
(2m)! 


(m — 1,2, 3, ..., and the B,,, are * Bernoulli 
numbers). The values of Ziel for positive odd 
integers s, however, have not been expressed 
in such a simple form. The values of C(s) for 
negative integers s are given by £(0) — B,(0) 
=—43,¢(1—n)= NA Hed d 
n 
the B,(x) are *Bernoulli polynomials. 
As a slight generalization of Ciel, A. Hurwitz 
(1862) considered 


ml 





where 


, O«axlí. 
= Y uy (n+ F Es 


This ts called the Hurwitz ¢-function. Thus 

C(s, 1) - £(s), and C(s, 1/2) 2 Q* — 1)&(s). This 
function C(s, a) can also be continued analyti- 
cally to the whole complex plane and satisfies 
a certain functional equation. But in general it 
has no Euler product expansion. 
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C. Dirichlet L-Functions 


Let m be a positive integer, and classify all 
rational integers modulo m. The set of all 
classes coprime to m forms a multiplicative 
Abelian group of order h = (m). Let y be a 
"character of this group. Call (n) the residue 
class of n mod m, and put y(n) — y((n)) when 

(n, m) 2 1 and (n) 20 when (n, m) 4 1. Now, the 
function of a complex variable s defined by 





L(s)= _< ER 


is called a Dirichlet L-function. This function 
converges absolutely for Res 1 and has an 
Euler product expansion 


SE I 1—x(p)p * 

If there exist a divisor f of m (f zm) and a 
character y? modulo f such that z(n)— 7°(n) 
for all n with (n, m) — 1, we call y a nonprimitive 
character. Otherwise, y is called a primitive 
character. If y is nonprimitive, there exists such 
a unique primitive 7°. In this situation, the 
divisor f of m associated with y? is called the 
conductor of y (and of x°). We have 


L(s, y) - L(s, lp 4 —x*()p ?). 
pim 


Let y be primitive. If the conductor f — 1, 
then y is a trivial character (y — 1), and L(s) is 
equal to the Riemann ¢-function C(s). On the 
other hand, if f> 1, then L(s) is an entire func- 
tion of s. In particular, if y is a nontrivial 
primitive character, L(1) — L(1, y) is finite and 
nonzero. P. G. L. Dirichlet proved the theorem 
of existence of prime numbers in arithmetic 
progressions using this fact (= 123 Distri- 
bution of Prime Numbers D). 

Lis x) has a functional equation similar to 
that of £(s); namely, if y is a primitive character 
with conductor f and we put 


&(, = Gr)? T (Gs + a)/2)L(s, 2) 


where a 20 for y(—1)21 and a=1 for y(—1)— 
—], then we have 


č(s, x) 2 W(x)&(1 — s, X). 
where 
W(y-(—-iff "iy, t= Y, xq 


nmod f 


(£, — expQi/f)). The latter sum is called the 
Gaussian sum. Note that | W(y)| — 1. 

The values of L(s) for negative integers 
s are given by L(1—m, x)= —B, m/m (m= 
1,2, 3, ...), where the B, mare defined by 





d ene -=$ B, 


u=1 ef!— 
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Moreover, if y( —1)— —1, we have 


L(1, 0- D y (- xG) x) 


= nit(y) Bs. 1» 


and if y(—1)— 1, xy x: 1, we have 


Pos PS 


Pi (—x69logl1 — 7l). 

In certain cases, the functional equation can be 
utilized to obtain the values of L(m, y) from 
those of L(1 —m, y). Actually, if ( 1) 2 1, m= 
2n22,4,6, ..., we have 








(—1) zy 

L(2n, —B, on)s 
L(n,y)— Q es t(x)( 2n) 
and if y(—1)2 —1, m=2n+ 1 =3,5,7,..., we 
have 
L(2n+1,7%) 

= (—1)" Qn 2n+1 
Adel T(X)(— B, 2,1). 


Dirichlet L-functions are important not only 
in the arithmetic of rational number fields but 
also in the arithmetic of quadratic or cyclo- 
tomic fields. 


D. ¢-Functions of Algebraic Number Fields 
(Dedekind ¢-Functions) 


The Riemann ¢-function can be generalized to 
C-functions of algebraic number fields (— 14 
Algebraic Number Fields). Let k be an alge- 
braic number field of degree n, and let a run 
over all integral ideals of k. Consider the se- 
quence £,(s) - 2, N(a) *. This sequence con- 
verges for Res 1 and has an Euler product 
expansion &,(s) - IL,(1 — N(») ?) !, where p 
runs over all prime ideals of k. This function, 
which is continued analytically to the whole 
complex plane as a meromorphic function, is 
called a Dedekind /-function. Its only pole is 

a simple pole at s= 1, with the residue h,x,. 
Here h, is the tclass number of k, and x, = 

2^ t3" R/(w|d|!?), where r, (2r;) is the number 
of isomorphisms of k into the real (complex) 
number field, w is the number of roots of unity 
in k, d is the *discriminant of k, and R is the 
tregulator of k (R. Dedekind, 1877) [D1]. 

The function £,(s) has no zeros in Resz l, 
while in Res <0 it has zeros of order r, at —1, 
—3, —5, ..., zeros of order r, +r, at —2, —4, 
—6, ..., and a zero of order r; - r, — at s— 0. 
All other zeros lie in the open strip 0 « Res « 1, 
which actually contains infinitely many zeros. 
It is conjectured that all these zeros lie on the 
line Res— 1/2 (the Riemann hypothesis for 
Dedekind ¢-functions). To obtain a generali- 
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zation of the functional equation for the 
Riemann ¢(s) to the case of £,(s), we put 


id| s S ry z 
EEN 7 V(sP^6,). 


Then £,(s) 2 £,(1 — s) (Hecke, 1917). If K isa 
Galois extension of k, then £,(s)/C,(s) is an 
integral function (H. Aramata, 1933; R. Brauer, 
1947). 


[J 





E. Hecke L-Functions 


As a generalization of Dirichlet L-functions to 
algebraic number fields, Hecke (1917) defined 
the following L-function L,(s, y): Let k be an 
algebraic number field of finite degree, and let 
m — m[[p, be an tintegral divisor (m the finite 
part, I Tp, the tinfinite part). Consider the 
"ideal class group of k modulo ri and its char- 
acter y (here we put y(a)=0 for (a, n) z 1). 
Then the L-functions are defined by 


)- i x(a Y/N(ay 


[H2], where a runs over all integral ideals of k. 
L,(s, x) is called a Hecke L-function. It con- 
verges for Res> 1 and has an Euler product 
expansion 


elle Ip NET 

Here p runs over all prime ideals of k. If there 
is a divisor f| (as ñ) and a character y° 
modulo f such that 7°(a)= z(a) for all a with 
(a, m) — 1, then y is called nonprimitive: other- 
wise, y is called a primitive character. In gen- 
eral, there exist unique such t and 7°. In this 
situation, f is called the conductor of y. If x is 
primitive and the conductor i is (1), then y is 
a trivial character and L,(s, y) coincides with 
G.S). If y is primitive and y #1, then L,(s, y) 

is an integral function of s, and L,(1, y) 40. 
Utilizing this fact, it can be proved that there 
exist infinitely many prime ideals in each class 
of the ideal class group modulo an integral 
divisor m of k. 

Let y be a primitive character with the con- 
ductor f d be the discriminant of k, o, ko 
be all distinct isomorphisms of k into the real 
number field R, and f be the finite part of f. 
Then if č is an integer of k such that £21 
(mod f), we have 


Egna)... (sgn én)" 


where a,, (m — 1, ...,r;) is either 0 or 1, depend- 
ing on y. By putting 


CS, Y= 
d|N(DV a 


m=i 
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we have the following functional equation for 
the Hecke L-function: 


ex (S. Y= WO =S, x) 


where W(x) is a complex number with ab- 
solute value 1 and the exact value of W(x) is 
given as a Gaussian sum. Just as some prop- 
erties concerning the distribution of prime 
numbers can be proved using the Riemann C- 
function and Dirichlet L-functions, some prop- 
erties concerning the distribution of prime 
ideals can be proved using the Hecke L- 
functions (— 123 Distribution of Prime Num- 
bers F). 

T. Takagi used Hecke L-functions in found- 
ing his !class field theory. In the other direc- 
tion, this theory implies L(1, y) 40 (y #1). 

Let K be a ‘class field over k that corre- 
sponds to an ideal class group H of k with 
index h. By using class field theory, we obtain 
Ck) 2 II, L,(s, x), where the product is over all 
characters y of ideal class groups of k, such 
that y(H)= 1. This formula can be regarded as 
an alternative formulation of the decompo- 
sition theorem of class field theory (— 59 Class 
Field Theory). By taking the residues of both 
sides of the formula at s= 1, we obtain hgkx 
= bell. LA). x). 

In particular, if k — Q (the rational number 
field) and K is a quadratic number field Q(/d) 
(d is the discriminant of K), then we have 


2 (d 
GG) - Q3) Lis), EES 


where (d/n) is the *Kronecker symbol, and we 
put (d/n) 2 0 when (n, d) Z 1. From this, we 
obtain the class number formula for quadratic 
number fields (— 347 Quadratic Fields). A 
similar method is used for computation of 
class numbers of cyclotomic fields K (— 14 
Algebraic Number Fields L). 

In general, the computation of the relative 
class number hy /h, when K/k is an Abelian 
extension is reduced to the evaluation of 
L(1, y). This computation has been made suc- 
cessfully for the following cases (besides for the 
examples in the previous paragraph): k is 
imaginary quadratic and K is the absolute 
class field of k or the class field corresponding 
to tray S(m); k is totally real and K is a totally 
imaginary quadratic extension of k. H. M. 
Stark and T. Shintani made conjectures about 
the values of L(1, y) [S25, S19]. 

Let L(s, x) be a Hecke L-function for the 
character y. Then it follows from the func- 
tional equation that the values of L(s, y) at s 
=0, —!, —2, —3,... are zero if k is not totally 
real. Furthermore, if k is a totally real finite 
algebraic number field, then these values of 
Lis, y) are algebraic numbers (C. L. Siegel, H. 
Klingen, T. Shintani). 
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F. Hecke L-Functions with Gróssencharakters 


E. Hecke (1918, 1920) extended the notion 
of characters by introducing the *Gróssen- 
charakter y and defined L-functions with 
such characters: 





a 
Lis o Ys. 
He also proved the existence of their Euler 
product expansions and showed that they 
satisfy certain functional equations [H3]. 
Moreover, by estimating the sum X yq. Z(P), 
he obtained some results on the distribution of 
prime ideals. 

Later, Iwasawa and J. Tate independently 
gave clearer definitions of the Gróssencharak- 
ter y and L,(s, y) by using harmonic analysis 
on the adele and idele groups of k (— 6 Adeles 
and Ideles) [L3]. 

Let J, be the idele group of k, P, be the 
group of tprincipal ideles, and C, = J, /P, be 
the idele class group. Then a Gróssencharakter 
is a continuous character y of C,, and y in- 
duces a character of J,, which is also denoted 
by y. Let J, — J,, x Jo be the decomposition 
of J, into the infinite part J,, and the finite 
part Jọ. Let U, be the unit group of Jp, and 
for each integral ideal m of k, put U,,9= 
fue U9|uz 1 (mod mJ), so that {U,, 9} forms 
a base for the neighborhood system of 1 in Jo. 
Put Jos (a€eJo|[a, — 1 for all p| mj, and with 
each ae J,, 9, associate an ideal à = [T, p™®, 
where a — (a,) and the ideal in k, generated 
by a, is equal to p». Then the mapping 
a-»à gives a homomorphism of J,, 9 into the 
group G(m)= (à|(a, nt) 2 1], and its kernel is 
contained in U,, 9. Since y is continuous, 
X(Um als 1 for some m. The greatest common 
divisor f of all such ideals m is called the con- 
ductor of y. For each ae J; o, x(a) depends 
only on the ideal à (e G(f)); hence by putting 
7(a)=7(a), we obtain a character 7 of G(f). 
Now put L,(s, y) - 7(@)/N(a)*, where the sum 
is over all integral ideals Ge G(f). This is called 
a Hecke L-function with Gróssencharakter y. 
For y zl, it is an entire function. On the other 
hand, if we restrict y to J,, = R*^ x C*^, then 
for uz (a,, ...,8,, 0, 44, On, 4,,) € J x, we have 


ritr i P ritr a ei 
dor ia feme E (Y 
j=l j=l t=r,+1 [aii 


where e; —0 or 1, eet, 4ER. The numbers e;, 
e, 4; are determined uniquely by y. Putting 


eS. X) 
— (lO. (sey 
ES KIM LI DU 2 


f=r,+1 


ntn +ån/—1 
x I] AË "Dun 
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we have a functional equation 
Wohl — s. x). 


where W(y) is a complex number with ab- 
solute value 1. 

We can express €,(s, y) by an integral form 
on J, as 


ex, X) = 


&(s y= ef oly) V (ry d*r, 
DI 


where V(r) is the *total volume of the idele r, c 
is a constant that depends on the *Haar mea- 
sure d*r of J,, and g(r) is defined by 


p(t)=[] e,GJ. 
D 


Py, (X) 


poss) 


oe 2 E 
mne Uy qns 


l J2 
mI Kg 7l e; 2 0, 


2 > 
i Gr ky =C, 
=— x tie ME en 
2n 
x)= STIAN: e b RA 
aer ee ei p finite. 
=0, x €(bf), 


Hence (bf), ' is the p-component (— 6 Adeles 
and Ideles B) of the ideal (bj) ^! (b is the *dif- 
ferent of k/Q) and 4(x) is an additive character 
of k, defined as follows. Q, is the *p-adic 

field, Z, is the ring of p-adic integers, A, is the 
mapping Q,5Q,/Z,c Q/Z c R/Z, and 4= 
490 Tan, By putting z())-— IT, x, G5) t= 

( ... X, ...), we have 


čs =c] ed x0) V(x) "dx, 
P Jk, 


where p runs over all prime divisors of k, finite 
or infinite. Moreover, with a constant C,, we 
have 


| Pp, (Zp, Ax)¥, Dy {00> *d*x 
kp 


zt 


=C nm" Let, d4uctey2 
D. 


xT(s--/—14;-e)2, ky, ,-R, 


P. 


=C E (27) (s+ G/ -1 Ai Jeiy2) 


Pa 
xFG-cG/-1 Ale, ky, =C, 


| P(X) Zp(X) V(x) “sd*x 
kp 


nt b 712 1 
CNN HOS) ` Sta Nie? 


= C,N((bDU qu): M pif. 


Here c,(y,) is a constant called the local 
Gauss sum, and p(U;, »)! is the volume of 
{uek |u = 1 (mod f);. These integrals over k, 
are ihe I -factors and Euler factors of £,(s, 7). 
according as p is infinite or finite. The func- 


1698 


tional equation is obtained by applying the 
*Poisson summation formula for (x) and its 
*Fourier transform on the adele group A, (— 6. 
Adeles and Ideles). 

Let D, be the connected component of | in 
C,. If z(D,)= 1, the corresponding 7 is a char- 
acter of an ideal class group of k with a con- 
ductor f. Conversely, all such characters can 
be obtained in this manner. 

As stated in Section E, the Hecke L- 
functions with characters (of ideal class 
groups) can be used to describe the decom- 
position law of prime divisors in class field 
theory. However, for L-functions with Gros- 
sencharakter, such arithmetic implications 
have not been found yet, except that in the 
case of Gróssencharakters of A, type, Y. 
Taniyama discovered, following the suggestion 
of A. Weil, that the L-function has a deep 
connection with the arithmetic of a certain 
infinite Abelian extension of k [T2, W7]. In 
particular, when Lí(s, y) is a factor of the 
*Hasse ¢-function of an Abelian variety A with 
*complex multiplication, it describes the arith- 
metic of the field generated by the coordinates 
of the division points of A. 


G. Artin L-Functions 


Let K be a finite Galois extension of an alge- 
braic number field k (of degree n), G — G(K/k) 
be its Galois group, o — A(c) be a matrix 
representation (characteristic 0) of G, and x 
be its character. Let p be a prime ideal of k, 
and define L,(s, y) by 


x (of 
log L,(s Res» 1, 
D= 2. ONG A 
with z(p") —(1/e)3.,. y(o"x), where T is the 


"inertia i of p, | T| 2 e, and o is a *Frobe- 
nius automorphism of p. Then we have 


L,(s, = 


-LY Aloo, 


€ teT 


det(E — A, 3 N(p) »)'!, 


In particular, if T= 
K/k), then 


Ls, x) = 

Now put 

Lis, A K/k) SCH I] L,(s, z) 
p 


{1} (ie, p is tunramified in 


det(E — A(o): N(p) IT. 


Res» 1, 


and call L(s, y, K/k) an Artin L-function [A2]. 
(1) The most important property of L(s, y, 

K/k) is that if K/k is an Abelian extension 

and y is a linear character, it follows from class 

field theory that zip) is the character of the 

ideal class group of k (modulo the ‘conductor 

of K/k) and that the Artin L-function equals 
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a Hecke L-function. This equality is equivalent 
to Artin’s freciprocity law, and in fact Artin 
obtained his reciprocity law after he conjec- 
tured the equality. 

(2)If K'> K >k and K'/kis a Galois exten- 
sion, then L(s, y, K/k) = L(s, y, K’/k). 

(3) If K >Q>k and y ts a character of 
G(K/Q), then L(s, y, K/Q)= L(s, Zy, K/k), where 
Zy Is the character of G(K/k) *induced from y. 

(4) If y, =1, then L(s, y,, K/k) - C,(s). 

(5) L(s, 41 + 32, K/k) = Ls, %1, K/K) 

L(s, y, K/k). 

Conversely, the Artin L-function L(s, y, K/k) 
is characterized by properties (1)-(5). 

(6) If vz ts the *regular representation of G, 
then L(s, Xr, K/k) 2 C,(s); hence 
Cx(s) - G.(S) II L(s, y, Kk) 

Ia 
where y runs over all irreducible characters 
#1 of G. 

(7) Every character of a finite group G can 
be expressed as y — 2m; y, (m;e Z), where each 
Z,, 1s an "induced character from a certain 
linear character y; of an elementary subgroup 
of G (Brauer's theorem). (Here an elementary 
subgroup is a subgroup that ts the direct 
product of a cyclic group and a p-group for 
some prime p.) Hence (3) and (5) imply that an 
Artin L-function is the product of integral 
powers (positive or negative) of Hecke L- 
functions Lo (s, yi): 


Lís, x, K/k) « T] Los, dr: 


Hence an Artin L-function is a univalent 
meromorphic function defined over the whole 
complex plane. Artin made the still open con- 
jecture that if y is irreducible and y z 1, then 
L(s, 7, K/k) is an entire function (Artin's 
conjecture). 

This conjecture holds obviously if all m; are 
nonnegative. Except for such a case, Artin's 
conjecture had no affirmative examples until 
1974, when Deligne and Serre [D9] proved 
that each "new cusp form" of weight 1 gives 
rise to an entire Artin L-function Lis, 7, K/k) 
with 7(1)- 2 and (p) —0 (p is the complex 
conjugation); by this method, some nontrivial 
examples were computed by J. Tate and J. 
Buhler (Lecture notes in math. 654 (1978)). 
Then R. P. Langlands [L5] constructed non- 
trivial examples of Artin's conjecture for cer- 
tain 2-dimensional representations 


Gal(K/k)3e— A(c)e GL(2, C) 


by using ideas of H. Saito and T. Shintani 
[S1, 520]. This method works for all represen- 
tations for which the image of the A(c) in 
PGL(2, C) is the *tetrahedral group. It also 
works for some *octahedral cases, but a new 
idea is needed in the *icosahedral case. 
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(8) Let p, (i9 1, ...,r, +15) be the infinite 
primes of k. Put 
WSs Z Pai K/K) 
—(F(s/2 ((s + 1)/2)? 
for complex yp, ;, 
— [(s/2)9 NT {s 4. 1)/2) 2 gen 
for real p, ;, 


where e € G is the complex conjugation deter- 
mined by a prime factor of p, ; in K. Next we 
introduce the notion of the conductor f, with 
the group character y defined by Artin (J. 
Reine Angew. Math., 164 (1931)). First, for any 
subset m c G, we put y(nt) 2 X, (m); then 

1, is given by 

L7 iG. K/K) - | v^" 


p 


where 


20] 
ftp) - Ltex(— xT); +{p z) x(V) 


+ ip? x(1) - x()] 4 ...]. 


and where V,, V,,..., are the higher tramifi- 
cation groups of prime factors of p in K (in 
lower numbering) and p*'=|V,| (— 14 Alge- 
braic Number Fields I). 

Now put 








m 


quo 


G(s, x, K/k)— ( 
x Il y(s, D Pair K/k) : L(s, X K/k). 
Pai 


Then the functional equation ts written 


CU —5, x, K/k) Wes, x. K/I), |W()| o 1. 


The known proof of this functional equation 
depends on (7) and the functional equations of 
Hecke L-functions discussed in Section E. As 
for the constants W(y), there are significant 
results by B. Dwork, Langlands, and Deligne 
[D6]. 

(9) There are some applications to the 
theory of the distribution of prime ideals. 


H. Weil L-Functions 


Weil defined a new L-function that is a gen- 
eralization of both Artin L-functions and 
Hecke L-functions with Gróssencharakter 
[W5]. Let K be a finite Galois extension of an 
algebraic number field k, let C be the idele 
class group K4/K" of K, and let «€ 
H?(Gal(K/k), C,) be the canonical coho- 
mology class of *class field theory. Then this 
Xy4 determines an extension Wg of Gal(K/k) 
by Ck: 1 > Cg 2 Wy, Gal(K/k)1 (exact), and 
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the transfer induces an isomorphism Wg’, 

25 C,, where ab denotes the topological com- 
mutator quotient. If L is a Galois extension 
of k containing K, then there is a canonical 
homomorphism H ^ Hien. Hence we define 
the Weil group W, for k/k as the *projective 
limit group proj, lim Wx, of the Wy, ,. It is 
obvious that we have a surjective homomor- 
phism q: W,— Gal(k/k) and an isomorphism 
ri C, >We’, where W2 is the maximal Abelian 
Hausdorff quotient of W,. For we W,, let || || 
be the adelic norm of r, ! (w). 

If k, is a tlocal field, then we define the Weil 
group W, for k,/k, by replacing the idele class 
group C, with the multiplicative group K 
in the above definition, where K,, denotes a 
Galois extension of k,. If k, is the completion 
of a finite algebraic number field k at a place v, 
then we have natural homomorphisms k > C, 
and Gal(k,/k,) Gal(k/k). Accordingly, we 
have a homomorphism W, — W, that com- 
mutes with these homomorphisms. 

Let W, be the Weil group of an algebraic 
number field k, and let p: W, GL(V) bea 
continuous representation of W, on a complex 
vector space V. Let v= p be a finite prime of k, 
and let p, be the representation of W, induced 
from p. Let ® be an element of W, such that 
q (4) is the inverse Frobenius element of p in 
Gal(k, /k,), and let J be the subgroup of Wy, 
consisting of elements w such that o(w) be- 
longs to the tinertia group of p in Gal(k,/k,). 
Let V! be the subspace of elements in V fixed 
by p, (1), let Np be the norm of p, and let 


L,(V, s) — det(1 — (Np) *p,(5)| V!) *. 


We can define L,(V, s) for each Archimedean 
prime v also, and let 


LV, s)=]] L(V. s). 


Then this product converges for s in some 
right half-plane and defines a function L(V, s). 
We call L(V,s) the Weil L-function for the 
representation p: W, 5 GL(V). This function 
L(V,s) can be extended to a meromorphic 
function on the complex plane and satisfies the 
functional equation 


L(V,s)- (V, S)L(V*,1— s) 


(T. Tamagawa), where V* is the dual of V, and 
&(V, s) is an exponential function of s of the 
form ab‘ [T6]. 

P. Deligne generalized these results in thc 
following manner: Let W’ be a *group scheme 
over Q which is the *semidirect product of W, 
by the additive group G,, on which W, acts by 
the rule wxw ! = ||w||x. We can define the 
notion of representations of W; and the L- 
functions of them in the natural manner [T6]. 
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I. The Riemann Hypothesis 


As mentioned in Section B, the Riemann 
hypothesis asserts that all zeros of the Rie- 
mann ¢-function in 0 c Res « 1 lie on the line 
Res- 1/2. In his celebrated paper [R1], Rie- 
mann gave six conjectures (including this), 
and assuming these conjectures, proved the 
*prime number theorem: 


n(x)~——~Li(xy= | ——— 
logx 


Here z(x) denotes the number of prime num- 
bers smaller than x. Among his six conjectures, 
all except the Riemann hypothesis have been 
proved (a detailed discussion is given in [L1 ]). 
The prime number theorem was proved inde- 
pendently by Hadamard and de La Vallée- 
Poussin without using the Riemann hypothesis 
(— Section B; 123 Distribution of Prime Num- 
bers B). 

R. S. Lehman showed that there are exactly 
2,500,000 zeros of £(c + it) for which 0 «t < 
170,571.35, all of which lie on the critical : 
line g — 1/2 and are simple (Math. Comp., 20 
(1966)). Later R. P. Brent extended this com- 
putation up to 75,000,000 first zeros (1979). 

Hardy proved that there are infinitely many 
zeros of C(s) on the line Res 1/2 (1914). Fur- 
thermore, A. Selberg [S6] proved that if No (T) 
is the number of zeros of £(s) on the line with 0 
« Ims « T, then No(T)> AT log T (A is a posi- 
tive constant) (1942). Thus if N(T) is the num- 
ber of zeros of ¢(s) in the rectangle 0 « Res « 1, 
0«Ims« f, then liminf, ,,, No(T)/N(T)>0. 

N. Levinson proved lim inf, ,,, No(T)/N(T) 

> 1/3 (Advances in Math., 13 (1974)). If N(T) 
is the number of zeros of C(s) in 1/2— e< Res 
<1/2+¢,0<Ims<T, then lm, N(T)/N(T) 
= 1 for any positive number e (H. Bohr and E. 
Landau, 1914). Bohr studied the distribution 
of the values of ¢(s) in detail and initiated the 
theory of talmost periodic functions (1925). 

D. Hilbert remarked in his lecture at the 
Paris Congress that the Riemann hypothesis 
is equivalent to 


n(x)-2 Li(x) 4 Ox logx) ra 


(H. von Koch, 1901). It is also equivalent to 


u(n) - OUNI Noo, 


Mz 


ll 
Kon 


n 


for any €» 0, where p(n) is the Möbius func- 
tion. Assuming the Riemann hypothesis, we 
get 


1+log2x 


1 
NI) 8 T~ T+ o(log T) 


(Littlewood, 1924). 
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The computation of the zeros of the ¢- 
functions and the L-functions of general alge- 
braic number fields is more difficult, but con- 
jectures similar to the Riemann hypothesis 
have been proposed. 

Weil showed that a necessary and sufficient 
condition for the validity of the Riemann hypo- 
thesis for all Hecke L-functions Lis, y) is that 
a certain ‘distribution on the idele group J, be 
positive definite [W1 (1952b) ]. 

It is not known whether the general ¢- and 
L-functions of algebraic number fields have 
any zeros in the interval (0, 1) on the real axis 
(see the works of A. Selberg and S. Chowla). 
Similar problems are considered for the vari- 
ous ¢-functions given in Sections P, Q, and T. 


J. p-Adic L-Functions 


Let y be a *primitive Dirichlet character with 
conductor f, and let L(s, y) be the *Dirichlet L- 
function for y. Then the values L(1— n, y) of 
L(s, y) at nonpositive integers 1 —n (n— 1,2,...) 
are algebraic numbers (— Section E). Let p 

be a prime number, let Q, be the *p-adic num- 
ber field, and let C, be the completion of the 
algebraic closure Q, of Q,. It is known that 
C, is also algebraically closed. Since Qc Q,, 
we fix an embedding Q c Q, and consider 
(L(1—n, x)) 5-4 as a sequence in C,. 

Let | |, be the extension to C, of the stan- 
dard p-adic valuation of Q,. Let q be p or 4 
according as p Z2 or p —2, and let w be the 
primitive Dirichlet character with conductor q 
satisfying c(n) 2 n (mod q) for any integer n 
prime to p. Then T. Kubota and H. W. Leo- 
poldt proved that there exists a unique func- 
tion L,(s, x) satisfying the conditions [K5]: 


() Lengt ZA Lader net 


(2 4.,-01fy:1and the series 27:5 a,(s 

— 1)" converges for |s— 1], «|g ! p*^?|; 

(3) L,(0» —n x) e (1 xo "p! ")L(1 —n, xo ") 
holds for n2 1,2,3, .... 

The function L,(s, y) satisfying these three 
conditions is called the p-adic L-function for 
the character y. It is easy to see that L,(s, y) is 
identically zero if y(—1)— —1, but L,(s, y) is 
nontrivial if y( —1)— 1. 

Let B, be the Bernoulli number. Then B, 
satisfies the conditions: (1) B,/n is p-integral if 
(p — (ln (von Staudt) and (2) (1/1) B, a (1/(n - p 
— 1) B,,,.., (mod p) holds in this case (Kum- 
mer). The generalization of these results for 
the generalized Bermoulli number B, , was 
obtained by Leopoldt. Since L(1—n, y) = 
— (1/n)B, n such p-integrabilities and con- 
gruences can be naturally interpreted and 
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generalized in terms of the p-adic L-functions 
L(s, y). 

We assume xt —1)— 1. Then L,(0, y) - (1— 
xo ' (p)L(0, yo") and yo !(—1)2 —1. 
Hence we can express the first factor hy 
of the class number of a cyclotomic field 
Q(exp(2zi/N)) as a product of some L,(0, y's. 
By using this fact, K. Iwasawa proved [I7] 
that the p-part pr: of the ffirst factor hy,» 

(N e N) satisfies 


e, —Antup'tv (4, p, veZiAuzO) 


' for any sufficiently large n. Here Iwasawa 


conjectured 4 —0, which was proved by B. 
Ferrero and L. Washington [F1]. Also, we can 
obtain some congruences involving the first 
factor hy of Q(exp(2zi/N)) from this formula. 

Let y be a nontrivial primitive Dirichlet 
character with conductor f, let 


D 
sti: È aler? 


be the *Gaussian sum for y, and let log, be the 
p-adic logarithmic function. Then Leopoldt 
[L6] calculated the value L(1, y) and obtained 


L (1. x) 
=— ( SNE Y x(a)log, (1 —e ?"//), 


As an application of this formula, Leopoldt 
obtained a p-adic *class number formula for 
the maximal real subfield F = Q(cos(2z/N)) of 
Q(exp(2zi/N)): Let Ce, F) be the product of 
the L (s, y) for all primitive Dirichlet characters 
x such that (1) x( —1) — 1 and (2) the conductor 
of y is a divisor of N. We define the p-adic 
regulator R, by replacing the usual log by the 
p-adic logarithmic function log,. Let h be the 
class number of F, m=[F:Q], and let d be the 
discriminant of F. Then the residue of C,(s, F) 
ats=1 is 


mt SE 
x p Ja 

Hence £,(s, F) has a simple pole at s=1 if and 
only if R, #0. In general, for any totally real 
finite algebraic number field F, Leopoldt con- 
jectured that the p-adic regulator R, of F is 
not zero (Leopoldt's conjecture). This conjec- 
ture was proved by J. Ax and A. Brumer for 
the case when F is an Abelian extension of Q 
[A4, B7]. 

By making use of the Stickelberger element, 
Iwasawa gave another proof of the existence 
of the p-adic L-function [17]. In particular, he 
obtained the following result: Let y be a primi- 
tive Dirichlet character with conductor f. Then 
there exists a primitive Dirichlet character 0 
such that the p-part of the conductor of 0 is 
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either 1 or q and such that the conductor and 
the order of y0 ^! are both powers of p. Let o, 
be the ring generated over the ring Z, of p- 
adic integers by the values of 0. Then there 
exists a unique element f(x, 0) of the quotient 
field of o,[[x] ] depending only on 0 and 
satisfying 


L (s, x) 7 2f(C (1 + qo — 1,0), 


where qo is the least common multiple of q and 
the conductor of 0, and C 2 y(1-- qo) !. Fur- 
thermore, Iwasawa proved that f(x, 0) belongs 
to og [x] ] if 0 is not trivial. 

Let P — Q(exp(2zi/q)) and, for any n> 1, let 
P, = Q(exp(2xi/qp")). Let Pa —| J,. , P,. Then 
P is a Galois extension of Q satisfying 
Gal(P,,/Q)=Z, (the multiplicative group of p- 
adic units), and P is the subfield of P /Q cor- 
responding to the subgroup | + qZ, of Z5. 

Let y be a C,-valued primitive Dirichlet 
character such that (1) y( —1)2 —1 and (2) the 
p-part of the conductor f, of y is either 1 or 
q. Let K, be the cyclic extension of Q corre- 
sponding to w by class field theory. Let K = 
Kp: P, K,=K:P,and K, —K-P,.Let A, 
be the p-primary part of the ideal class group 
of K,, let A, A,, (n2 m) be the mapping in- 
duced by the *relative norm N, « and let Xx 
— lim A,. Since each A, is a finite p-group, X, 
is a Z,,- module. Let V, — X, Gs, Cp. and let 


V, = (ve Vx | d(v) - V(0)v for all óeGal(K/Q)]. 


Let qo be the least common multiple of f, and 
q, and let y be the element of Gal(K , /K) 
that corresponds to 


14 q961 - qZ,— Gal(P, /P) 


by the restriction mapping Gal(K ,,/K) 

S Gal(P,, /P). Let f, (x) be the characteristic 
polynomial of y, — 1 acting on V,.. Hence f,(x) 
is an element of o, [x ]. 

We assume that wy! is not trivial. Let 
f(x, ob TU be as before. Then f(x, e !) is an 
element of o, [ [x] ]. Iwasawa conjectured that 
f, (x) and f(x, ew ~') coincide up to a unit of 
o [[x]] (Iwasawa's main conjecture). This con- 
jecture was proved recently by B. Mazur and 
A. Wiles in the case where y is a power of c. 

Let F be a totally real finite algebraic num- 
ber field, let K be a totally real Abelian exten- 
sion of F, and let y be a character of Gal(K/F). 
Let L(s, y) be the tArtin L-function for z Then 
we can construct the p-adic analog L,(s, 7) of 
L(s, y) (J.-P. Serre, J. Coates, W. Sinnott, P. 
Deligne, K. Ribet, P. Cassou-Nogues). But, 
at present, we have no formula for L,(1, 7). 
Coates generalized Iwasawa's main conjecture 
to this case, but it has not yet been proven. 

P-adic L-functions have been defined in 
some other cases (e.g. — [K3, M1, M3]). 
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K. ¢-Functions of Quadratic Forms 


Dirichlet defined a Dirichlet series associated 
with a binary quadratic form and also consid- 
ered a sum of such Dirichlet sertes extended 
over all classes of binary quadratic forms with 
a given discriminant D, which is actually 
equivalent to the Dedekind ¢-function of a 
quadratic field. Dirichlet obtained a formula 
for the class numbers of binary quadratic 
forms. The formula is interpreted nowadays as 
a formula for the class numbers of quadratic 
fields in the narrow sense. 

According as the binary quadratic form 
is definite or indefinite, we apply different 
methods to obtain its class number. 

Epstein ¢-functions: P. Epstein generalized 
the definition of the C-function of a positive 
definite binary quadratic form to the case of n 
variables (Math. Ann., 56 (1903), 63 (1907)). Let 
V be a real vector space of dimension m with a 
positive definite quadratic form Q. Let M bea 
*lattice in V, and put 


‘ols M) Y, e Rage 
s,M)= 5 ——, es»—. 
ii Ost 2 


This series is absolutely convergent in Res 
m/2, and 


-1 
lim (7) Cols, My- bM) "ert? 


Sam? 
D(M) —det|Q(x;, x)|; 


where x,,..., Xm Is a basis of M and Q(x, y) 
=(O(x + y)— Q(x) — Q(y))/2. If the Q(x) 

(xe M, x 40) are all positive integers, we can 
write 





where a(n) is the number of distinct xe M with 
Q(x)— n. In general, let x,,..., Xm be a basis of 
the lattice M and x¥,...,x* be its dual basis 
(Q(x; xf) 2 0). Call M*= 2; x* the dual 
lattice of M. If we consider the 3-series (‘theta 
series) 

Dain, M)= 3. exp(—2uQ(x) (Reu>0), 


xeM 
then 
Jolu, M)=(u "P D(M) 1?) o(u -1 M*). 


With &o(s, M)=x "°F (s)o(s, M), the displayed 
equality leads to the functional equation 


m \ 
&o(s, M)=D(M) '?- lan |: 
2 / 
In general, &o(s, M) has no Euler product 
expansion. 
Consider the case where M =} Zx; (x;= 
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(0, ...,0,1,0,...,0), Ox) En, u2, for x= 
(u,, ..., Um). If we put £,(s) - Cols, M). L(s)— 

aO C-A/n)n 5, then we have 
Gs) 26s), 
Cals) - AC(5): L(s)- 4 

x (the Dedekind ¢-function of Q(/ —1)), 

Ca(s) = 8(1—2* "Visite 
Ce(s)= —4(C(s)L(s — 2) — AC(s — 2) L(s)), 
Gg(s)= 16(1 —2' 7E 2* )0(s)c(s — 3), 
S10(8) 7 (4/5) (C) Ls — 4) + 4° t(s — 4) L(5)) 


4 


Hu 


nez 1 (uuy 
DEA 
Eral) =c, 2 Eleis 526—263) 


eso / AG), 


where ~{,/A(t)} is the Dirichlet series corre- 
sponding to ./A(t) by the *Mellin transform 
and A(:) 2 z(ILi, (1 —2")}?* with z « e?" 
¢m(S) has zeros on the line Re s= ø = m/4, given 
explicitly for m —4, 8 as follows: 


m=4: s=1+Ini/log2, /=1,2,..., 
m=8: s=2+(i/log2)(Qintarctan ,/ 15), 
Ie +1,.... 


Regarding the Epstein ¢-function of binary 
quadratic forms 


sols) E Q(m, n) *, 
with 
Q(x, y) 2 ax? - bxy - cy?, 
a,b, ceR,a»0,c»0, A —-4ac — b? » 0, 
we have the Chowla-Selberg formula (1949): 


as! fI 
(ols)— | 2£239a * — * 
Sols) ( CU sja L'(s)A* 1? 


"Os prísl 
x C(2s ul, ;)) 


SCH nab 
x Y nt! 9,_5,(n)cos— 
n=1 a 





^ ës" 32 
ux 
a T(s)A*%? 1/4 


znAi? 


«| oap] oe hd, 
o VUE 


where o, (n) — 3,4, d* and ¢(s) is the Riemann ¢- 
function. By using this formula, we can give 
another proof of the following result of H. 
Heilbronn: Let h(— A) be the class number of 
the imaginary quadratic field with discrimi- 
nant —A. Then h( — A)> x (A x). 
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The following generalization of thts result 
was obtained by C. L. Siegel [S22]: Let k be a 
fixed finite algebraic number field. Let K be a 
finite Galois extension of k, and let d 2 d(K), 
hz h(K), and R 2 R(K) be the discriminant of 
K, the class number of K, and the regulator of 
K, respectively. We assume that K runs over 
extensions of k such that ( K:K]/logd —0; then 
we have 


log(hR) — log dl. 


Siegel (-functions of indefinite quadratic 
forms: Siegel defined and investigated some ¢- 
functions attached to nondegenerate indefinite 
quadratic forms, which are also meromorphic 
on the whole complex plane and satisfy certain 
functional equations [$24]. 

The case of quadratic forms with irrational 
algebraic coefficients was treated by Tama- 
gawa and K. G. Ramanathan. 


L. ¢-Functions of Algebras 


K. Hey defined the ¢-function of a 'simple 
algebra A over the rational number field Q 
(M. Deuring [D10]) (— 27 Arithmetic of As- 
sociative Algebras). Consider an arbitrary 
*maximal order o of A, and let 


1 
Lull A 


, Res>l, 
N(af 


with the summation taken over all left integral 
ideals a of o. Then C, is independent of the 
choice of a maximal order v. Let k be the 
‘center of A, and put [A:k]=n7’. First, C, is 
decomposed into Euler's infinite product 
expansion ¢4(s)= IT, Z,(s) (p runs over the 
prime ideals of k). For p not dividing the dis- 
criminant D of A, Z,(s) coincides with the p- 
component of []"=} C, (ns — j). Hence ¢,(s) 
coincides with IT; C, (ns —j) up to a product 
of p-factors for plp which are explicit rational 
functions of N(p) ™. 

Moreover, if A ts the total matrix algebra of 
degree r over the division algebra D, then we 
have C4(s) 2 TI A 6e(rs —j), and Ces) satisfies a 
functional equation similar to that of c, (s) 
(Hey). Also, €,(s) is meromorphic over the 
whole complex plane, and at s— 1, (n — 1)/ 

n, ..., in, tt has poles of order 1. Using analy- 
tic methods, M. Zorn (1931) showed that the 
simple algebra A with center k such that A, is 
a matrix algebra over k, for every finite or in- 
finite prime divisor p of k is itself a matrix 
algebra over k (— 27 Arithmetic of Associative 
Algebras D). A purely algebraic proof of this 
was given by Brauer, H. Hasse, and E. 
Noether. G. Fujisaki (1958) gave another proof 
using the [wasawa-Tate method. As a direct 
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application of the ¢-function, the computation 
of the residue at s= 1 of ¢, leads to the formula 
containing the class number of maximal order 
D. 

Godement defined the ¢-function of fairly 
general algebras [G1], and Tamagawa inves- 
tigated in detail the explicit C-functions of 
division algebras, deriving their functional 
equations [T1 ]. 

Let Á— IT, A, be the adele ring of A, and let 
G — I, 6, be the idele group (of A). We take a 
maximal order D, of A, and a maximal com- 
pact subgroup U, of G,. Let œw, be a tzonal 
spherical function of G, with respect to U,; 
that is, œ, is a function in G, and satisfies 


@,(ugv)=@,(g) (u, Ge Del, 
| w,(guh) du= Wy(g)@, (A). 
Up 


In addition, we define the weight function o, 
on A, by 


the characteristic function of D, 
when p is finite, 
p, (x) = 
exp(—2T,(xx*)) 
when p is infinite, 


where T, is the treduced trace of A,/R and * is 
a positive tinvolution. Tamagawa gave an 
explicit form of the local C-function with the 
character w, defined by 


Cs, 2) = | e, (og IN, (g); dg, 
Gp 


where N, is the treduced norm of A,/k,, and 
| |, is the valuation of k,. Then œ= [T, c, is 
the zonal spherical function of G with respect 
to [TU, — U. In particular, if o is a positive 
definite zonal spherical function belonging to 
the spectrum of the discrete subgroup I = A* 
— (all the invertible elements of Aj of G, then 
the Tamagawa C-function with character c is 
given by 


soo | p(go(g")ligll*dg, 


where (g)=I1¢,(g,) and | || is the volume 
of the element g of G. When A is a division 
algebra, Cie, c») is analytically continued to a 
meromorphic function over the whole complex 
plane and satisfies the functional equation. 
The Tamagawa ¢-function may also be consid- 
ered as one type of C-function of the Hecke 
operator. When A is an indefinite quaternion 
algebra over a totally real algebraic number 
field di. the groups of units of various orders of 
A operate discontinuously on the product of 
complex upper half-planes. Thus the spaces of 
holomorphic forms are naturally associated 
with A. The investigation of C-functions asso- 
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ciated with these holomorphic automorphic 
forms was initiated by M. Eichler and ex- 
tended by G. Shimura, H. Shimizu, and others. 
Eichler investigated the case © =Q, and 
Shimura and Shimizu investigated the case for 
an arbitrary totally real field ® by defining 
general holomorphic automorphic forms, 
Hecke operators, and corresponding ¢- 
functions. The functional equations of these ¢- 
functions were proved by Shimizu. Shimizu 
generalized Eichler’s work and found relations 
among ¢-functions of orders of various quater- 
nion algebras belonging to different discrimi- 
nants and levels [S10]. For the related results, 
see, e.g., the work of K. Doi and H. Naganuma 
[D12]. 


M. ¢-Functions Defined by Hecke Operators 


The Z-functions of algebraic number fields, 
algebras, or quadratic forms, and the L- 
functions are all defined by Dirichlet series, are 
analytically continued to univalent functions 
on the complex plane, and satisfy functional 
equations. One problem is to characterize the 
functions having such properties. 

(1) H. Hamburger (1921—1922) characterized 
the Riemann ¢-function (up to constant multi- 
ples) by the following three properties: (1) It 
can be expanded as £(s) 2 X, a,/n' (Res»»0); 
(1i) it is holomorphic on the complex plane 
except as s= 1, and (s— 1)C(s) is an entire func- 
tion of finite tgenus; (iii) G(s) 2 G(1 —s), where 
G(s) x *?T(s/2)L(s). 

(2) E. Hecke’s theory [H4]: Fixing 4-0, k> 
0, y= +1, and putting 


R(s) - Qn/A) *T(s))o(s) 


for an analytic function o(s), we make the 
following three assumptions: (i) (s — k) (s) is 
an entire function of finite genus; (ii) R(s)= 
y R(k — sy; (iii) p(s) can be expanded as (s) - 
Lr, a,/n> (Res 0$). Then we call p(s) a 
function belonging to the sign (A, k, y). 

The functions £(2s), L(2s), and L(2s — 1) 
satisfy assumptions (1)- (iii), where L may be 
either a Dirichlet L-function, an L-function 
with Gróssencharakter of an imaginary qua- 
dratic field, or an L-function with class charac- 
ter of a real quadratic form whose I -factors 
are of the form I'(s/2)T'((s + 1)/2) «1 (s). If (s) 
belongs to the sign (A, k, y), then n ^ p(s) be- 
longs to the sign (nA, k, y). To each Dirichlet 
series (s) — ÈE; a,/n? with the sign (4, k, y), 
we attach the series Til ge + E£] ae?" 
where 


ao — yQn/4) *T(K)Res,-.(9(s)) 
— y Res,-,(R(s)). 


This correspondence ¢(s)— f(t) may also be 


1705 


realized by the *Mellin transform 


SËNN ( y a yn dy 
0 n=1 
al (fy) —ag)y* ' dy, 
0 


1 
um sel R(s)y "ds, 


Res=a 

In this case, (i) f(t) is holomorphic in the 
upper half-plane and f(t + A) = f(t), (ii) 
f(-1/0)(— io =p f(a), and (iii) f(x iy) 
O( y*?**) (y— +0) uniformly for all x. 

Conversely, the Dirichlet series (s) — 
$2 1a,n ? formed by the transformation in 
the previous paragraph from f(t) satisfying (i)— 
(iii) belongs to the sign (A, k, y). We also say 
that the function f(t) belongs to the sign 
(4 k, y). 

If k is an even integer, then the functions f(z) 
belonging to (1, k,(—1)*”) are the tmodular 
forms of level 1 and weight k. A necessary and 
sufficient condition for a function ọ(s) belong- 
ing to (1, ,(—1)*?) to have an Euler product is 
that the corresponding modular form f(z) be a 
simultaneous eigenfunction of the ring formed 
by the "Hecke operators T, (n — 1,2,...). In this 
case, the coefficient a, of o(s) È a,n " coin- 
cides with the eigenvalue of T,. Namely, if f| T, 
— t, f, we have q(s) 2 a (2255, tan °), and this 
is decomposed into the Euler product o(s) — 

a IL, —t,p ^ pF! 77?)7. We call ¢(s)/a; a 
¢-function defined by Hecke operators (Hecke 
[H5]). For example, ¢(s):¢(s—k+ 1) and the 
Ramanujan function 


Ke 


2. t(n)n ^-][ü-«(pp^ 0p 7» 
n= p 

are C-functions defined by Hecke operators. 
Hecke applied the theory of Hecke operators 
to study the group I (N) [H5]; the situation 
is more complicated than the case of I'(1)— 
SL(2, Z). The space of automorphic forms of 
weight k belonging to the tcongruence 
subgroup 


Ty(N)= (e au Z) 


is denoted by 99, (T, (N)). The essential part of 
9t, (T5(N)) is spanned by the functions f(t)= 
Zoe satisfying the conditions: (1) o(s) — 
Zon "bas the Euler product expansion 


e(s-IIa-ap5" 


SIN 





c=0 (mod) 


x [|] d- app +p. 
ptN 
(2) The functional equation R(s) z y R(k — s) 
holds, where R(s) e Qx/./ N) ?T (s)o(s). (3) 
When y is an arbitrary primitive character of 
Z such that the conductor f is coprime to N, 
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then 
Ris, x) -Qu/A/ N f) *T (Y, a,x(n^ 


extends to an entire function satisfying the 
functional equation Ris, y) - oR(k—s, y) (œ| = 
1) (Shimura). Conversely, (2) and (3) charac- 
terize the Dirichlet series (s) corresponding to 
f(t)EM,(To(N)) (Weil [W1 (1967a)]). 
Considering the correspondence f(t) = 
Xa,q"-—qo(s)—-Xa,n ? not as a Mellin trans- 
formation but rather as a correspondence 
effected through Hecke operators, we can 
derive the ¢-function defined by Hecke op- 
erators. When the Hecke operator T, is de- 
fined with respect to a discontinuous group 
I and we have a representation space Wit of the 
Hecke operator ring X, we denote the matrix 
of the operation of T; e £^ on W by (T,)= 
(C.J and call the matrix-valued function 
Y (Ty)gn ^? the C-function defined by Hecke 
operators. The equation o(s)—- 3a,n *isa 
specific instance of the correspondence in the 
first sentence, where T =T (N), MWM, (E5(N)), 
dim Nt= 1. One advantage of this definition is 
that it may be applied whenever the concept of 
Hecke operators can be defined with respect to 
the group T (for instance, even for the Fuchs- 
ian group without a fcusp). Thus when F is a 
Fuchsian group given by the unit group ofa 
quaternion algebra over the rational num- 
ber field Q and Mi is the space of automor- 
phic forms with respect to T, the -function 
X(T,n "te defined (Eichler). Moreover, by 
using its integral expression over the idele 
group Jẹ of 6, we can obtain its functional 
equation following the Iwasawa-Tate method 
(Shimura). Furthermore, by algebrogeometric 
consideration of 7,, it can be shown that 


(elt — (de? (en? 
—C(s)c(s-— I)det (rta —(Tye,p ^ 
P 


+ (Ry)g, p! Es 3) 


coincides (up to a trivial factor) with the Hasse 
C-function of some model of the Riemann 
surface defined by I' when Mis the space S, 
of all tcusp forms of weight 2 (Eichler [E1], 
Shimura [S12]). 

The algebrogeometric meaning of 
det(3(T,)s, n *), when W is the space &, of all 
cusp forms of weight k, has been made clear for 
the case where T is obtained from D (N), T(N), 
and the quaternion algebra (M. Kuga, M. 
Sato, Shimura, Y. Ihara). From these facts, it 
becomes possible to express (T,)¢,, the decom- 
position of the prime number p in some type of 
Galois extension (Shimura [S14], Kuga), in 
terms of Hecke operators. These works gave 
the first examples of non-Abelian class field 
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theory. Note that this type of ¢-function may 
be regarded as the analog (or generalization) 
of L-functions of algebraic number fields, as 
can be seen from the comparison in Table I. 


Table 1 
Algebrai 

gebraic Ideal 
number Character x ZEx(nn ^ 
S group 
field | i 
Algebraic Hecke Representation 

» ei 

group ring space Wt ET, un 


As for special values of ¢-functions defined 
by Hecke operators, the following fact is 
known: Let f(t)= X a,q" eWt,(SLQ, Z)) bea 
common eigenfunction of the Hecke operators, 
and let o(s) 2 S a,n ? be the corresponding 
Dirichlet series. Let K, be the field generated 
over the rational number field Q by the coeffi- 
cients a, of f. Then, for any two integers m and 
m' satisfying 0 « m, m' <k and mz m' (mod 2), 
the ratio (R(m): R(m')) of the special values of 





D (s) pies £ 

EE | EHNEN 
(27) 0 

at m and m belongs to the field K}. 

G. Shimura discovered this fact for Rama- 
nujan's function A(t) (J. Math. Soc. Japan, 11 
(1959)), and then Yu. I. Manin generalized it 
to the above case and, by constructing a p-adic 
analog of (s) from it, pointed out the impor- 
tance of such results [M1]. R. M. Damerell 
also used such results to study special values 
of Hecke's L-function with Gróssencharakter 
of an imaginary quadratic field (Acta Arith., 

17 (1970), 19 (1971)). Furthermore, Shimura 
generalized these results to congruence sub- 
groups of SL(2, Z) (Comm. Pure Appl. Math., 
29 (1976)), and to Hilbert modular groups 
(Ann. Math., 102 (1975)). The connection 
between special values of {-functions and the 
periods of integrals has been studied further by 
Shimura, Deligne, and others. 

In addition, in connection with nonholo- 
morphic automorphic forms H. Maass consid- 
ered L-functions of real quadratic ftelds (with 
class characters) having I'(s/2)? or T ((s + 1)/2)? 
as I -factors. Furthermore, T. Kubota studied 
the relation of ¢-functions ¢,(s) of an arbitrary 
algebraic field k or ¢-functions of simple rings 
to (nonanalytic) automorphic forms of several 
variables and considered the reciprocity law 
for the Gaussian sum from a new viewpoint. 


N. L-Functions of Automorphic 
Representations (I) 


R. P. Langlands reconstructed the theory of 
*Hecke operators from the viewpoint of repre- 
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sentation theory. and defined very general L- 
functions. He proposed many conjectures 
about them in [L4], and he and H. Jacquet 
proved most of them in [J1] for the case G 
—GL,. 

First Langlands defined the L-group “G for 
any connected reductive algebraic group G 
defined over a field k in the following manner 
[ B6]. 

There is a canonical bijection between iso- 
morphism classes of connected ‘reductive alge- 
braic groups defined over a fixed algebraically 
closed field k and isomorphism classes of troot 
systems. It is defined by associating to G the 
root data V(G) — (X*(T), o, X, (T), $^), where 
T is a ‘maximal torus of G, X*(T) (X,(T)) the 
group of characters (*1-parameter subgroups) 
of T,  (") the set of roots (coroots) of G with 
respect to T. 

Since the choice of a *Borel subgroup B of 
G containing T'is equivalent to that of a basis 
A of ®, the aforementioned bijection yields 
one between isomorphism classes of triples 
(G, B, T) and isomorphism classes of based 
root data ‘¥o(G) =(X*(T), A, X,(T), A^). There 
is a split exact sequence 


| 29Int Go Aut G— Aut V4(G) 1. 


and this mapping induces a canonical bijection 
Aut V, (G) 2 Aut(G, B, T, (x, 1, A) if X, € G, (xe A) 
are fixed. 

Let G be a connected reductive algebraic 
group defined over k. Let T be a maximal 
torus of G, and let B be a Borel subgroup of G 
containing T. Let V$(G) 2(X*(T), A, X (T), A") 
be as before. Then there is a connected reduc- 
tive algebraic group *G° over C such that 
Yo(G)Y =(X,(T), A’, X*(T), A) corresponds to 
the triple (G9, ^ B9, "T'?), where +B? and LI" 
are a Borel subgroup of *G? and the maxi- 
mal torus of ^B?. For example, (1) if G— GL,, 
then ^G? = GL,(C); (2) if G=Sp,,, then ^G? = 
SO. CH g 

We assume that k is the algebraic closure of 
k and G is defined over k. Then ; e Gal(k/k) 
induces an automorphism of the k-group 
G x ,k. Hence y defines an element of 
Aut(“G°, ^ B9, - T?) because it is a holomorphic 
image of Aut Yo (G x , k) 2 Aut Bal x EI". 
Hence we can define the *semidirect product 
LG — G9 qGal(k/k), and call it the L-group 
of G. 

Let k be a *local field, and let G be a connec- 
ted reductive algebraic group defined over K. 
We identify G with the group of its k-rational 
points. Let W; be the Weil-Deligne group of 
k (— Section H), and let ®(G) be the set of 
homomorphisms o: W;"G over Gal(k/k). Let 
II(G) be the set of infinitesimal equivalence 
classes of irreducible admissible representations 
of G. If k is a non-Archimedean field, then 
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TI(G) is the set consisting of equivalence classes 
of irreducible representations zx: G Aut V on 
complex vector spaces V such that the space 
V* of vectors invariant by K is finite dimen- 
sional for every compact open subgroup K of 
G and such that V={ } V*. where K runs over 
the compact open subgroups of G. If k is an 
Archimedean field, then /7(G) is the set consist- 
ing of equivalence classes of representations z 
of the pair (g, K) of the Lie algebra g of G and 
a maximal compact subgroup K satisfying 
similar conditions [B6]. Then Langlands 
conjectured that we can parametrize /7(G) 

by dich as /7(G) «| J, H(G), Let ne H(G), 

(o €€(G)), and let r be a representation of TC. 
Then we can define the L-function Lis x, r) and 
the e-factor ris, z, r) of xt by 


L(s,m,r)-L(sroq) s(sm,r)—s(sroq,wV) 


where the right-hand sides are those of the 
Weil-Deligne group (— Section H) and i isa 
nontrivial character of k. 

Let G be a connected reductive group over a 
global field & (i.e., an algebraic number field of 
finite degree or an algebraic function field of 
one variable over a finite field), let z be an 
irreducible admissible representation of G,, 
where G, is the group of rational points of G 
over the *adele ring k, of k, and let r bea 
finite-dimensional representation of ^G. Let y 
be a nontrivial character of k, which is trivial 
on k. For any place v of k, let r, be the repre- 
sentation of the L-group of G, — G x, k, in- 
duced by r, and let y, be the additive character 
of k, associated with y. It is known that z is 
decomposed into the tensor product Gi, of 
7, € H(G(k,)) [B6]. Hence we put 


L(s,m, r) - [| LG. m. r,). 


els, m, r) = | | 5,7. Fo). 

The local factor L(s, z,, r,) is in fact defined 
if v is Archimedean, or G is a ‘torus, or q is 
unramified (i.e., G, is quasisplit and splits over 
an unramified extension of k,, and G(o,) is a 
special maximal compact subgroup of G(k,), 
and z, is of class one with respect to G(o,), 
where o, is the integer ring of KL It follows 
that the right-hand side [1 L(s, z,, r,) is defined 
up to a finite number of non-Archimedean 
places v. Furthermore, Langlands proved that 
I] a(s, x. r,) is in fact a finite product, and the 
infinite product [T L(s, z,,7,) converges in some 
right half-plane if z is automorphic ie, if 7 is 
a subquotient of the right regular representa- 
tion of Gy in GÀ G,). It is conjectured that 
L(s, t, r) admits a meromorphic continuation 
to the whole complex plane and satisfies a 
functional equation 


L(s, n, r)=e(s, t, r) L(1 — s, 7i, r) 
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if z is automorphic, where zt is the *contragre- 
dient representation of z. Furthermore, if G — 
GL, and r is the standard representation of 
GL,, then we can construct Lí(s, z, r) and 

ris, t, r) by generalizing the Iwasawa-Tate 
method. We can also show in this case that 
L(s, 1, r) is entire if z is cuspidal. The conjec- 
tures are studied in some other cases [B6]. 


O. L-Functions of Automorphic 
Representations (11) 


A. Weil generalized the theory of "Hecke 
operators and the corresponding L-functions to 
the case of *automorphic forms (for holomor- 
phic and nonholomorphic cases together) of 
GL, over a global field [W9]. Then H. Jacquet 
and Langlands developed a theory from the 
viewpoint of *representation theory [J1, J2]. 
They attached L-functions not to automorphic 
forms but to ‘automorphic representations of 
GL (Kk). 

Let k be a non-Archimedean local field, and 
let o, be the maximal order of k. Let JY, be 
the space of functions on G,=GL,(k) that are 
locally constant and compactly supported. 
Then J£, becomes an algebra with the convol- 
ution product 


er «te | fi (fata! h)dg, 

LG, 

where dg is the *Haar measure of G, that 
assigns | to the maximal compact subgroup 
K,— GL;(v,). Let x be a representation of J£, 
on a complex vector space V. Then we say that 
z is admissible if and only if z satisfies the 
following two conditions: (1) For every v in V, 
there is an f in J£, so that z(f)u =v; (2) Let o; 
(i=1,...,r) be a family of inequivalent irreduc- 
ible finite-dimensional representations of K,, 
and let 


Then é is an idempotent of J4,. We call such a 
č an elementary idempotent of 2, Then for 
every elementary idempotent č of 2. the 
operator z(č) has a finite-dimensional range. If 
z ts an admissible representation of GL,(k) (— 
Section N), then 


an=] f(g)n(g)dg (JEH) 
6, 


gives an admissible representation of 3X, in 
this sense. Furthermore, any admissible repre- 
sentation of J£, can be obtained from an ad- 
missible representation of GL. (K). 

Let k be the real number field. Let #, be the 
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space of infinitely differentiable compactly 
supported functions on G,(=GL,(k)) that are 
K,( — OQ, k)) finite on both sides, let #, be the 
space of functions on K, that are finite sums 
of matrix elements of irreducible represen- 
tations of K,, and let 2£, — 2£, ® J£ ,. Then 
Hi As, and J£, become algebras with the 
convolution product. Let z be a representation 
of J£, on a complex vector space V. Then z is 
admissible if and only if the following three 
conditions are satisfied: (1) Every vector v in V 
is of the form v= X1, z( f)v; with Le and 
vie V; (2) for every elementary idempotent 
€(g)=X}-, dim(a,) ' tro,(g~'), where the o; 

are a family of inequivalent irreducible repre- 
sentations of K,, the range of z(£) is finite- 
dimensional; (3) for every elementary idempo- 
tent ¢ of #, and for every vector v in n(£)V, 
the mapping fi z( f )v of ££, é into the finite- 
dimensional space z(£)V is continuous. We 
can define the Hecke algebra J£, and the no- 
tion of admissible representations also in the 
case k — C. In these cases, an admissible repre- 
sentation of #, comes from a representation 
of the tuniversal enveloping algebra of GL (Kk) 
but may not come from a representation of 

GL ;(k). It is known that for any local field k, 
the "character of each irreducible representa- 
tion is a locally integrable function. 

Let k be a global field, G, = GL,(k), and let 
G4 — GL;(k4) be the group of rational points 
of G, over the adele ring k, of k. For any place 
v of k, let k, be the completion of k at v, let G, 
— GL; (k,), and let k, be the standard maximal 
compact subgroup of G,. Let J£, be the Hecke 
algebra a, of G,, and let £, be the normalized 
Haar measure of K,. Then e, is an elementary 
idempotent of #,. Let # = ©,, H, be the 
restricted tensor product of the local Hecke 
algebra .#, with respect to the family {e,}. We 
call # the global Hecke algebra of G,. 

Let z be a representation of # on a com- 
plex vector space V. We define the notion of 
admissibility of x as before. Then we can show 
that, for any irreducible admissible represen- 
tation z of # and for any place v of k, there 
exists an irreducible admissible representation 
T, of #, on a complex vector space V, such 
that (1) for almost all v, dim V: — 1 and (2) x is 
equivalent to the restricted tensor product 
G9 n, of the z, with respect to a family of 
nonzero x,€ VF. Furthermore, the factors jn, 
are unique up to equivalence. 

Let k be a local field, let y be a nontriv- 
ial character of k, and let #, be the Hecke 
algebra of G, = GL; (Kk). Let z be an infinite- 
dimensional admissible irreducible represen- 
tation of #,. Then there is exactly one space 
W(x, y) of continuous functions on G, with 
the following three properties: (1) If W is in 
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W(x, y), then for all g in G, and for all x in k, 
1 x 
W( E 1 ) g) - V o)W(g); 


(2) W(n, V) is invariant under the right trans- 
lations of J£,, and the representation on 
W(x, V) is equivalent to z; (3) if k is Archi- 
medean and if W is in W(z, y), then there is 
a positive number N such that 


0 
ek hon 


as |t| 5 oo. We call W(z, y) the Whittaker 
model of z. The Whittaker model exists in the 
global case if and only if each factor z, of z= 
G9 7, is infinite-dimensional. 

Let k be a local field, and let z be as before. 
Then the L-function Lis, x) and the g-factor 
é(s, t, W) are defined in the following manner: 
Let œw be the quasicharacter of k* (1e., the 
continuous homomorphism k* — C *) defined 
by 


a 0 . 
dk i-e idy. 


Then the tcontragredient representation č of nz 
is equivalent too & z. For any g in G, and 
W in W(z, y), let 


0 
¥(g, s, w= | vd? er: d* a, 
k* 


¥(g,s,W)= | 


k* 


ek EE 


Then there is a real number s, such that these 
integrals converge for Re(s) ze for any ge G, 
and We W(z, y). If k is a non-Archimedean 
local field with F, as its residue field, then 
there is a unique factor Lis, x) such that 
L(s,n) ! is a polynomial of q ? with constant 
term 1, 


Pig, S, W) = Y (g, S, WyL(s, n) 


is a holomorphic function of s for all g and W, 
and there is at least one W in W(x, y) so that 
P(e, s, W)=a* with a positive constant a. If k is 
an Archimedean local field, then we can define 
the gamma factor L(s, z) in the same manner. 
Furthermore, for any local field k, if 


(s, s, W)= P(g, s, W)/L(s, 7), 


then there is a unique factor sis, v, x) which, as 
a function of s, is an exponential such that 


> [0 1 
d 1 Ah 1—s, W)- e(s, T, W) P(g, S, W) 


for all ge G, and We W(z, y). 

Let z and z' be two infinite-dimensional 
irreducible admissible representations of G,. 
Then z and z' are equivalent if and only if the 
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quasicharacters w and o are equal and 


L(l—s, 77! G i)e(s, yQ m, V) 
L(s, x ® n) 


| Ls, x ! G #)e(s, y @ n', y) 








L(s, y & 1) 


holds for any quasicharacter y. In particular, 
the set (L(s, y @ n) and eis, y & z, y) for all y] 
characterizes the representation z. 

Let k be a global field, G, = GL,(k), G4 — 
GL,(k,), and let K; = [I K, be the standard 
maximal compact subgroup of G,. Then the 
*global Hecke algebra ¥ acts on the space of 
continuous functions on G,\G, by the right 
translations. Let o be a continuous function 
on G,\G,. Then ¢ is an automorphic form if 

and only if (1) o is K,-finite on the right, (2) 

- for every telementary idempotent £ in 26. the 
space (54€) q is finite-dimensional, and (3) q is 
slowly increasing if k is an algebraic number 
field. An automorphic form q is a cusp form if 
and only if 


NEM 
d 0 1 g)dx— 


for all g in G4. Let a be the space of automor- 
phic forms on G, G,, and let ou be the space 
of cusp forms on G, G,. They are #-modules. 
Let y — II v, be a nontrivial character of k\k,, 
and let z be an irreducible admissible repre- 
sentation m= (9, x, of the global Hecke al- 
gebra X = Q9, Hr- If t is a tconstituent of the 
H -module £, then we can define the local 
factors Lis, x,) and e(s, z,, ,) for all v, al- 
though z, may not be infinite-dimensional. 
Further, the infinite products 


L(s, n) - II L(s, x,) and L(s, t) - [I L(s, z,) 


converge absolutely in a right half-plane, and 
the functions L(s, z) and L(s, č) can be analyti- 
cally continued to the whole complex plane as 
meromorphic functions of s. If z is a constitu- 
ent of 2g, then all z, are infinite-dimensional, 
L(s, z) and Lis z) are entire functions, and z is 
contained in äu with multiplicity one. If k is 
an algebraic number field, then they have only 
a finite number of poles and are bounded at 
infinity in any vertical strip of finite width. If k 
is an algebraic function field of one variable 
with field of constant F,, then they are rational 
functions of q 7 In either case, eis, z,, V, ) = 1 
for almost all v, and hence 


SÉ n) = II £(s, T, V.) 


is well defined. Furthermore, the functional 
equation 


L(s, n) — e(s, z) L(1 — s, ) - 


is satisfied. 
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As for the condition for x being a constitu- 
ent of za, we have the following: Let z= @ n, 
be an irreducible admissible representation of 
H. Then z is a constituent of oi, if and only if 
(1) for every v, z, is infinite-dimensional; (2) the 
quasicharacter y defined by 


a 0 B d 
n( mr Ze ) 2 n(a)id. 


is trivial on k*; (3) z satisfies a certain con- 
dition so that, for any quasicharacter œw of 
k*\kX, L(s,@ @ n)- II L(s, o, ® x,) and 

L(s, ! & #,)=T1 L(s, o! Q fi) converge on a 
right half-plane; and (4) for any quasicharacter 
w of k*\kx, L(s,w © x) and L(s,o ! & &) are 
entire functions of s which are bounded in 
vertical strips and satisfy the functional 
equation 


L(s,co @ n) «e(s, 0 Qn) L(1 —s,c leng 


P. Congruence ¢-Functions of Algebraic 
Function Fields of One Variable or of 
Algebraic Curves 


Let K be an talgebraic function field of one 
variable over k =F, (finite field with q ele- 
ments). The ¢-function of the algebraic function 
field K /k, denoted by C, (s), is defined by the 
infinite sum X N (UAU), where the summation 
is over all integral divisors d of K/k and where 
the norm N(9I) equals q%8°9, Equivalently, ` 
¢x(s) is defined by the infinite product [T,(1 — 
N(p) ?) !, where p runs over all prime divi- 
sors of K/k. By the change of variable u=q~*, 
Ck(s) — Zy(u) becomes a formal power series in 
u. Çg(s) and Z,(u) are sometimes called the 
congruence ¢-functions of K/k. 

The fundamental theorem states that (i) 
(Rationality) Z,(u) is a rational function of u 
of the form Z,(u)= P(u)/(1 —u)(1 — qu), where 
P(u)e Z[u] is a polynomial of degree 2g, g 
being the genus of K; (ii) (Functional equation) 
Zeil satisfies the functional equation 


Zy(1/u) - q* * u^ 7* Z x (u/q); 


and (iii) if P(u) is decomposed into linear fac- 
tors in C[u]: P(u) « EI2*, (1 — ou), then all the 
reciprocal roots o; are complex numbers of 
absolute value dJa Statement (iii) is the analog 
of the Riemann hypothesis because it is equiva- 
lent to saying that all the zeros of ¢(s)= 
Zy(q ?)lie on the line Res = 1/2. 

The congruence ¢-function was introduced 
by E. Artin [A1 (1924)] as an analog of the 
Riemann or Dedekind ¢-functions. Of its fun- 
damental properties, the rationality (i) and 
the functional equation (ii) were proven by 
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F. K. Schmidt (1931), using the *Riemann- 
Roch theorem for the function field K/k. The 
Riemann hypothesis (iii) was verified first in 
the elliptic case (g = 1) by H. Hasse [H1] and 
then in the general case by A. Weil [W2 
(1948)]. For the proof of (tii), it was essential 
to consider the geometry of algebraic curves 
that correspond to given function fields. 

Let C be a nonsingular complete curve over 
k with function field K. Then Z,(u) coincides 
with the €-function of C/k, denoted by Z(u, C), 
which is defined by the formal power series 
exp(X7_, N,u"/m). Here N,, is the number of 
rational points of C over the extension k,, of k 
of degree m. The rationality of Z,(u) is then 
equivalent to the formula 


2g 

N,—-l-q"—Ya" (meN), 
i=l 

and the Riemann hypothesis for Z,(u) is 

equivalent to the estimate 


(*) |N,—1—4"|x2gq"7 (meN). 


Now if F is the qth power morphism of C to 
itself (the Frobenius morphism of C relative to 
k), then an important observation is that N,, is 
the number of fixed points of the mth iterate 
F" of F. In other words, N,, is equal to the 
intersection number of the graph of F" with 
the diagonal on the surface C x C, and is re- 
lated to the "trace" of the Frobenius corre- 
spondence. Then (*) follows from *Castel- 
nuovo's lemma in the theory of correspon- 
dences on a curve. This is Weil's proof of the 
Riemann hypothesis in [W2]; compare the 
proof by A. Mattuck and J. Tate (Abh. Math. 
Sem. Hamburg 22 (1958)) and A. Grothendieck 
(J. Reine Angew. Math., 200 (1958)) using the 
Riemann-Roch theorem for an algebraic 
surface. 

On the other hand, let J be the *Jacobian 
variety of C over k. For each prime number ! 
different from the characteristic of k, let Mia) 
denote the */-adic representation of an endo- 
morphism « of J obtained from its action on 
points of J of order /" (n— 1,2, ...). Letting x 
be the endomorphism of J induced from F 
(which is the same as the Frobenius morphism 
of J), we have P(u) — det(1 — M,(z)u), i.e., the 
numerator of the €-function coincides with the 
characteristic polynomial of M,(z). In this 
setting, the Riemann hypothesis is a conse- 
quence of the positivity of the Rosati antiauto- 
morphism [E1]. This is the second proof given 
by Weil [W2], and applies to arbitary Abelian 
varieties. 

Recently E. Bombieri, inspired by Stepa- 
nov’s idea, gave an elementary proof of (*) 
using only the Riemann-Roch theorem for a 
curve (Sém. Bourbaki, no. 430 (1973)). 
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Q. ¢-Functions of Algebraic Varieties over 
Finite Fields 


Let V be an algebraic variety over the finite 
field with q elements F,, and let N,, be the 
number of F,™-rational points of V. Then the 
C-function of V over F, is tne formal power 
series in Z[[u]] defined by 


Z(u, (el 5 Nain; 
m=1 


alternatively it can be defined by the infinite 
product Tal —42**^) !, where P runs over 
the set of prime divisors of V and deg P is the 
degree of the residue field of P over F, (in 
other words, P runs over prime rational 0- 
cycles of V over F,). 


Weil Conjecture. In 1949, the following prop- 
erties of the ¢-function were conjectured by 
Weil [W3]. Let V be an n-dimensional com- 
plete nonsingular (absolutely irreducible) 
variety over F,. Then (1) Z(u, V) is a rational 
function of u. (2) Z(u, V) satisfies the functional 
equation 

Z((qru)', V)= +g"? u*Z(u, V), 

where the integer y is the intersection number 
(the degree of A, : Ap) of the diagonal sub- 
variety A, with itself in the product V x V, 


which is called the Euler-Poincaré character- 
istic of V. (3) Moreover, we have 


P (u); Py(u): ...: P3, ,(u) 
Po(u): Palu)... P5,(u) 





Z(u, V)= 


where P,(u) = [14 (1 — xu) is a polynomial 
with Z-coefficients such that af? are algebraic 
integers of absolute value q”? (0 x h x int the 
latter statement is the Riemann hypothesis for 
V/F, (4) When V is the reduction mod p of a 
complete nonsingular variety V* of character- 
istic 0, then the degree B, of P,(u) 1s the hth 
Betti number of V* considered as a complex 
manifold. 

This conjecture, called the Weil conjecture, 
has been completely proven. To give a brief 
history, first B. Dwork [D13] proved the 
rationality of the ¢-function for any (not neces- 
sarily complete or nonsingular) variety over 
F,. Then A. Grothendieck [A3, G2, G3] devel- 
oped the /-adic étale cohomology theory with 
M. Artin and others, and proved the above 
statements (1)-(4) (except for the Riemann 
hypothesis) with P,(u) replaced by some 
P, (u)€Q,[uJ; and S. Lubkin [L7] obtained 
similar results for liftable varieties. Finally 
Deligne [D4] proved the Riemann hypothesis 
and the independence of l of P, (u). More 
details will be given below. Before the final 
solution for the general case was obtained, the 
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conjecture had been verified for some special 
types of varieties. For curves and Abelian 
varieties, its truth was previously shown by 
Weil (— Section P). In the paper [W3] in 
which he proposed the above conjecture, Weil 
verified it for Fermat hypersurfaces, i.e., those 
defined by the equation ge Af LL Xp 
=0 (a,;eF 7); in this case, the ¢-function is of 
the form P(uy "TE o(1 —q/u) with a poly- 
nomial P(u) that can be explicitly described 

in terms of Jacobi sums. Dwork [D14] studied 
by p-adic analysis the case of hypersurfaces in 
a projective space, verifying the conjecture for 
them except for the Riemann hypothesis. Fur- 
ther nontrivial examples were provided by *K3 
surfaces (Deligne [D2], Pyatetskii-Shapiro, 
Shafarevich [P1]) and cubic 3-folds (E. Bom- 
bieri, H. Swinnerton-Dyer [B5]); in these 
cases the proof of the Riemann hypothesis was 
reduced to that of certain Abelian varieties 
naturally attached to these varieties. It can be 
said that the Weil conjecture has greatly in- 
fluenced the development of algebraic geome- 
try, as regards both the foundations and the 
methods of proof of the conjecture itself; see 
the expositions by N. Katz [K2] or B. Mazur 
[M2]. 


Weil Cohomology, /-Adic Cohomology. The 
Weil conjecture suggested the possibility of 

a good cohomology theory for algebraic 
varieties over a field of arbitrary characteristic. 
We first formulate the desired properties of a 
good cohomology (S. Kleiman [K4]). Let k be 
an algebraically closed field and K a field of 
characteristic 0, which is called the coefficient 
field. A contravariant functor V— H*(V) from 
the category of complete connected smooth 
varieties over k to the category of augmented 
Z* -graded finite-dimensional anticommuta- 
tive K-algebras (cup product as multiplication) 
is called a Weil cohomology with coefficients in 
K if it has the following three properties. (1) 
Poincaré duality: If n — dim V, then a canonical 
isomorphism H?"(V)z K exists and the cup 
product H/(V) x H?"?(V)— H^'(V)z K in- 
duces a perfect pairing. (2) Künneth formula: 
For any V, and V, the mapping H*(V,) © 
H*(Vj)-5 H*(V, x V;) defined by a © b> 
Projt(a): Projf(b) is an isomorphism. (3) Good 
relation with algebraic cycles: Let C/(V) be the 
group of algebraic cycles of codimension j on 
V. There exists a fundamental-class homomor- 
phism FUND: C/(V) H?(V) for all j, which 
is functorial in V, compatible with products 
via Künneth's formula, has compatibility of 
the intersection with the cup product, and 
maps O-cyclee C"(V) to its degree as an ele- 
ment of K = H7"(V). If a Weil cohomology 
theory H exists for the V's over k, we can 
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prove the Lefschetz fixed-point formula: 


((graph of F): (diagonal), xy 
= Y (—1)tr(F*| HIEN 
j=0 


for a morphism F: VV. 

If k=C (the field of complex numbers), the 
classical cohomology V— H*(V*", Q), where 
V^" denotes the complex manifold associated 
with V! gives a Weil cohomology. If k is an 
arbitrary algebraically closed field and if l is a 
prime number different from the characteristic 
of k, then the principal results in the theory of 
the étale cohomology state that the l-adic 
cohomology V HE(V, Qj) is a Weil coho- 
mology with coefficient field Q, (the field of l- 
adic numbers) [A3, D5, G3, M4]. In defining 
this, Grothendieck introduced a new concept 
of topology, which is now called Grothendieck 
topology. In the étale topology of a variety V, 
for example, any étale covering of a Zariski 
open subset is regarded as an “open set.” With 
respect to the étale topology, the cohomology 
group H*(V, Z/n) of V with coefficients in Z/n 
is defined in the usual manner and is a finite 
Z/n-module. If l is a prime number as above, 
lim, H*(V, Z/l") is a module over Z, =lim, Z/I* 
of finite rank, and 


H&(V, Qj) — (lim, H*(V, Z/P)) @ zQ: 


defines the l-adic cohomology group, giving 
rise to a Weil cohomology. 

For the characteristic p of k, p-adic étale 
cohomology does not give Weil cohomology; 
but the crystalline cohomology (Grothendieck 
and P. Berthelot [ B2, B3]) takes the place of p- 
adic cohomology and is almost a Weil coho- 
mology: in this theory the fundamental class is 
defined only for smooth subvarieties. 

Now fix a Weil cohomology for k = F,, an 
algebraic closure of a finite field F,. Given an 
algebraic variety V over F, let V2 V Gk 
denote the base extension of V to k; then ba 
rational points of V can be identified with the 
fixed points of the mth iterate of the Frobenius 
morphism F of V relative to F,. Then the 
Lefschetz fixed-point formula implies the 
rationality of Z(u, V); more precisely, letting 
P(u) = det(1 — uF* | H/(V)) be the characteristic 
polynomial of the automorphism F* of HIH) 
induced by F, we have 


2n . 
Z(u, V)= [T56) ^. 
j=0 


The functional equation of the ¢-function then 
follows from the Poincaré duality. This proves 
(1), (2), and a part of (3) in the statement of 
the Weil conjecture. Further, in the case of 
l-adic cohomology, (4) means that deg P;(u) = 
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dime H/(V, Q,) is equal to the jth Betti num- 
ber of a lifting of V to characteristic 0; this 
follows from the comparison theorem of M. 
Artin for the l-adic cohomology and the class- 
ical cohomology, combined with the invariance 
of l-adic cohomology under specialization. 


Proof of the Riemann Hypothesis. In 1974, 
Deligne [D4, I] completed the proof of the 
Weil conjecture for projective nonsingular 
varieties by proving that, given such a V over 
F,, any eigenvalue of F* on Hİ (V, Q,) is an 
algebraic integer, all the conjugates of which 
are of absolute value ai? (This implies that 
P(u)=det(1 —uF*| H}, (V, Qj) is in Z[u] and 
is independent of l.) The proof is done by 
induction on n= dim V; by the general results 
in l-adic cohomology (the weak Lefschetz 
theorem on a hyperplane section, the Poin- 
caré duality, and the Künneth formula), the 
proof is reduced to the assertion that (*) any 
eigenvalue a of F* on H4(V, Qj) is an alge- 
braic integer such that |o | q^? for all 
conjugates x’ of a The main ingredients in 
proving (*) are (1) Grothendieck's theory of L- 
functions, based on the étale cohomology with 
compact support and with coefficients in a Q,- 
sheaf [G2, G3]; (2) the theory of Lefschetz 
pencils (Deligne and Katz [D7]), and the 
Kajdan-Margulis theorem on the monodromy 
of a Lefschetz pencil (J. L. Verdier, Sém. Bour- 
baki, no. 423 (1972)); and (3) Rankin's methods 
to estimate the coefficients of modular forms, 
as adapted to the Grothendieck's L-series. By 
means of these geometric and arithmetic tech- 
niques, Deligne achieved the proof of the 
Riemann hypothesis for projective nonsingular 
varieties. For the generalization to complete 
varieties, see Deligne [D4, H]. 


Applications of the (Verified) Weil Conjecture. 
(1) The Ramanujan conjecture (— 32 Auto- 
morphic Functions D): The connection of this 
conjecture and the Weil conjecture for certain 
fiber varieties over a modular curve was ob- 
served by M. Sato and partially verified by Y. 
Ihara [11] and then established by Deligne 
[D3]. The Weil conjecture as proven above 
implies the truth of the Ramanujan conjecture 
and its generalization by H. Petersson. 

(2) Estimation of trigonometric sums: Let q 
be the power of a prime number p. Then 


2ni 
exp — Ire, ell 
(Xpo x,)e F; p 


«(d —1y'q"?, 


where F(X;, ..., X,)eE,[ X,,.... X,] 1s a poly- 
nomial of degree d that is not divisible by p, 
and the homogeneous part of the highest 
degree of F defines a smooth irreducible 





1712 


hypersurface in P" ^! This is a generalization 
of the Weil estimation of the Kloosterman sum 
(LD4, W1 (1948c)]; —^ 4 Additive Number 
Theory D). 

(3) The hard Lefschetz theorem: Let Le 
H*(V) be the class of a hyperplane section 
of an n-dimensional projective nonsingular 
variety V over an algebraically closed field. 
Then the cup product by L': H" (V) H"*(V) 
is an isomorphism for all i € n. Deligne 
[D4, IT] proved this for /-adic cohomology, 
from which N. Katz and W. Messing [K1] 
deduced its validity in any Weil cohomology 
or in the crystalline cohomology. 

Also some geometric properties of an alge- 
braic variety V are reflected in the properties 
of Z(u, V). The ¢-function Z(u, A) of an Abelian 
variety A determines the isogeny class of A 
[T4]. For any algebraic integer a, every conju- 
gate of which has absolute value q!7, there 
exists an Abelian variety A/F, such that a is a 
root of det(1 —uF*|H!(4)) 20 [H6]. J. Tate 
[T3] conjectured that the rank of the space 
cohomology classes of algebraic cycles of 
codimension r is equal to the order of the pole 
at u= 1/q' of Z(u, V). This conjecture is still 
open but has been verified in certain nontrivial 
cases, e.g., (1) products of curves and Abelian 
varieties, r= 1 (Tate [T4]), (2) Fermat hyper- 
surfaces of dimension 2r with some condition 
on the degree and the characteristic (Tate 
[T3], T. Shioda, Proc. Japan Acad. 55 (1979)), 
and (3) elliptic K3 surfaces, r = 1 (M. Artin and 
Swinnerton-Dyer, Inventiones Math. 20 (1973)). 


R. C- and L-Functions of Schemes 


Let X be a tscheme of finite type over Z, and 
let |X| denote the set of closed points of X; for 
each xe|X |, the residue field k(x) is finite, and 
its cardinality 1s called the norm N (x) of x. The 
C-function of a scheme X is defined by the 
product ¢(s, X) Taxi) — N(x) *) +. This 
converges absolutely for Re s> dim X, and it is 
conjectured to have an analytic continuation 
in the entire s-plane (Serre [S7 ]). It reduces to 
the Riemann (resp. Dedekind) £-function if X 
— Spec(Z) (resp. Spec(o), o being the ring of 
integers of an algebraic number field), and to 
the C-function Z(q ?, X) (— Section Q) if X 
is a variety over a finite field F,. The case of 
varieties defined over an algebraic number 
field is discussed in Section S. 

Let G be a finite group of automorphisms of 
a scheme X, and assume that the quotient Y 
— X/G exists (e.g., X 1s quasiprojective). For 
an element x in |X|, let y be its image in | Y], 
and let D(x)- (geG|g(x) 2 xj, the decompo- 
sition group of x over y. The natural mapping 
D(x) Gal(k(x)/k( y)) is surjective, and its 


1713 


kernel I(x) is called the inertia group at x. An 
element of D(x) is called a Frobenius element 
at x if its image in Gal(k(x)/k( y)) corresponds 
to the N(y)th-power automorphism of k(x). 
Now let R be a representation of G with char- 
acter y. The Artin L-function L(s, X, y) is de- 
fined by 


L(s, X, =e 3. Y x(y")N or) 


yelY|n=1 


= ee 

ye 

where zivil denotes the mean value of y on the 
` nth power of Frobenius elements F, at x (x 
any point of | X | over y), and similarly R(F,) 
denotes the mean value of R(F,); it converges 
absolutely for Res dim X. Again this is re- 
duced to the usual Artin L-function (— Sec- 
tion G) if X is the spectrum of the ring of 
integers of an algebraic number field. The 
Artin L-functions of a scheme have many 
formal properties analogous to those of Artin 
L-functions of a number field (Serre [S7]). 

Let us consider the case where X is an alge- 
braic variety over a finite field F, and ele- 
ments of G are automorphisms of X over E;; 
in this case, L(s, X, y) is a formal power series 
in u-q ^, which is called a congruence Artin 
L-function. For the case where X is a complete 
nonsingular algebraic curve and y is an irre- 
ducible character of G different from the trivial 
one, Weil [W2] proved that L(s, X, y) is a 
polynomial in u=q 5; thus the analog of 
*Artin's conjecture holds here. More generally, 
for any algebraic variety X over F,, Grothen- 
dieck [G2, G3] proved the rationality of 
L-functions together with the alternating 
product expression by polynomials in u, as in 
the case of C-functions, by the methods of l- 
adic cohomology. Actually, Grothendieck 
treated a more general type of L-function 
associated with Ladic sheaves on X, which 
also play an important role in Deligne's proof 
of the Riemann hypothesis (— Section Q). 


S. Hasse C-Functions 


For a nonsingular complete algebraic variety 
V defined over a finite algebraic number field 
K, let V, be the reduction of V modulo a prime 
ideal p of K, K, be the residue field of p, and 
Z (u, V,) be the ¢-function of V, over K,. The 
C-function £(s, V) of the complex variable s, 
determined by the infinite product (exclud- 

ing the finite number of p's where V, is not 
defined), 


Cs, V=] Z(N@) >, V). 
p 


is called the Hasse C-function of V over the 
algebraic number field K. For this function, 
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we have Hasse's conjecture [W4]: £(s, V) isa 
meromorphic function over the whole complex 
plane of s and satisfies the functional equation 
of ordinary type. Sometimes it is more natural 
to consider 


Gs, His PN) SV)". (0xj«2dimV), 
p 


where Pu, V.) is the jth factor of Z(u, V,), and 
we have a similar conjecture for them. For the 
definition of ¢;(s, V) taking into account the 
factors for bad primes and the precise form of 
the conjectural functional equation, see Serre 
[S8]. Note that (is, V) converges absolutely 
for Res j/2-- 1 as a consequence of the Weil 
conjecture. 

Hasse's conjecture remains unsolved for the 
general case, but has been verified when V is 
one of the following varieties: 

(I,) Algebraic curves defined by the equation 
y* — yx +ô and Fermat hypersurfaces (Weil 
[W6]). 

(Ib) Elliptic curves with complex multiplica- 
tion (Deuring [D11}). 

(I) Abelian varieties with complex multiplica- 
tion (Taniyama [T2], Shimura and Taniyama 
[S11], Shimura, H. Yoshida). 

(Ia) Singular K3 surfaces, i.e., K3 surfaces 
with 20 Picard numbers (Shafarevich and 
Pyatetskii-Shapiro [P1], Deligne [D2], T. 
Shioda and H Inose [821 ]). 

(II) Algebraic curves that are suitable 
models of the elliptic modular function fields 
(Eichler [E1], Shimura [S12]). 

(I1,) Algebraic curves that are suitable 
models of the automorphic function fields 
obtained from a quaternion algebra (Shimura 
[813, $15]). 

(II,) Certain fiber varieties of which the base 
is a curve of type (II,) or (II) and the fibers are 
Abelian varieties (Kuga and Shimura [K6], 
Ihara [11], Deligne [D3]). 

(Ha) Certain Shimura varieties of higher 
dimension (Langlands and others; — [B6]). 

In these cases, C(s, V) can be expressed by 
known functions, i.e., by Hecke L-functions 
with Gróssencharakters of algebraic number 
fields in cases (I) or by Dirichlet series corre- 
sponding to modular forms in cases (II). This 
fact has an essential meaning for the arithme- 
tic properties of these functions. For example, 
the extended *Ramanujan conjecture concern- 
ing the Hecke operator of the automorphic 
form reduces to Weil's conjecture on varieties 
related to those in cases II. Moreover, for 
(IL)-(11,) the essential point is the congruence 
relation 7, = I7 + Dr (Kronecker, Eichler [E1], 
Shimura). In particular, for (IT,) this formula ts 
related to the problem of constructing class 
fields over totally imaginary quadratic exten- 
sions of a totally real field F utilizing special 
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values of automorphic functions and class 
fields over F. Actually, the formula is equiva- 
lent to the reciprocity law for class fields 
(Shimura). 

One of the facts that makes the Hasse ¢- 
function important is that it describes the 
decomposition law of prime ideals of algebraic 
number fields when V is an algebraic curve or 
an Abelian variety (Weil, Shimura [S14], 
Taniyama [T2], T. Honda [H6]). In that case, 
its Hasse C-function has the following arithme- 
tic meaning. 

Let C be a complete, nonsingular algebraic 
curve defined over an algebraic number field 
K, and let J be the Jacobian variety of C de- 
fined over K. For a prime number J, fix an 
l-adic coordinate system 2, on J, and let 
K(J,1?) be the extension field of K obtained 
by adjoining to K all the coordinates of the 
l'th division points (v=1,2,...) of J. Then 
K(J,I*)/K is an infinite Galois extension of K. 
The corresponding Galois group (5(J, 1”) has 
the /-adic representation a> M7 (a) by the l- 
adic coordinates £,. Almost all prime ideals p 
of K are unramified in K(J,/°)/K. Thus when 
we take an arbitrary prime factor $ of p in 
K(J,1*), the Frobenius substitution of $, 


= SE 
Ow = €.» 5 


is uniquely determined. Furthermore, the 
characteristic polynomial det(1 — M;*(ay)u) is 
determined only by p and does not depend on 
the choice of the prime factor P; we denote 
this polynomial by P,(u, C). In this case, for 
almost all p, P,(u, C) is a polynomial with 
rational integral coefficients independent of l; 
namely, the numerator of the ¢-function of the 
reduction of C mod p. Thus 


G(s, C) [[ PN) *O) 
p 


^ [T det(1 — Mf(oq)N(») H) 
p 


Here the product [T det) — Mf(c4)N(p) "1! 
has the same expression as the Artin L- 
function if we ignore the fact that Mf is the l- 
adic representation and K(J,/”) is the infinite 
extension. Thus if we can describe ¢(s, C) ex- 
plicitly, then the decomposition process of the 
prime ideal for intermediate fields between 
K(J,1*) and K can be made fairly clear. In 
fact, this is the case for examples (I,)—(I,) and 
(114)- (IT,), from which the relations between 
the arithmetic of the field of division points 
K(J,1°)/K and the eigenvalues of the Hecke 
operator have been obtained. Thus for curves 
and Abelian varieties, ¢(s, V) is related to the 
arithmetic of some number fields; but it is not 
known whether similar arithmetical relations 
exist for other kinds of varieties except in a few 
cases. 
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Tate's Conjecture. For a projective nonsin- 
gular variety V over a finite algebraic number 
field K, let 9U'(V) denote the group of algebraic 
cycles of codimension r on V= V ® xC modulo 
homological equivalence and let 9l'(V) be the 
subgroup of U'(V) generated by algebraic 
cycles rational over K. Then Tate [T3] conjec- 
tured that the rank of 9I'(V) is equal to the 
order of the pole of £,, (s, V) at s=r+ 1. This 
conjecture is closely connected with Hodge's 
conjecture that the space of rational coho- 
mology classes of type (r,r) on V is spanned by 
9l'(V); in fact, the equivalence of these conjec- 
tures is known for Abelian varieties of (CM 
type (H. Pohlmann, Ann. Math., 88 (1968)) and 
for Fermat hypersurfaces of dimension 2r 
(Tate [T3], Weil [W6]). Thus, when r=1, 
Tate's conjecture for these varieties holds by 
Lefschetz's theorem, and when r » 1, it holds in 
certain cases where the Hodge conjecture is 
verified (Shioda, Math. Ann., 245 (1979); Z. 
Ran, Compositio Math., 42 (1981)). Further 
examples are given by K3 surfaces with large 
Picard numbers (Shioda and Inose [S21]; T. 
Oda, Proc. Japan Acad., 56 (1980)). 


L-Functions of Elliptic Curves. Let E be an 
elliptic curve (with a rational point) over the 
rational number field Q, and let N be its con- 
ductor; a prime number p divides N if and 
only if E has bad reduction mod p (Tate [T5]). 
The L-function of E over Q is defined as 
follows: 


Lí(s, E) 


-[It-s»*?"[[ü-a,p^*p' 7)", 
DIN ptN 
where e, —0 or +1 and 1—a,u pu? = 
P, (u, E mod p). There are many interesting 
results and conjectures concerning Lis E) 
[T5]: 
(1) Functional equation. Let 


&(s, E) 2 N*? (2n) SF (s). (s, E). 


Then it is conjectured that £(s, E) is holo- 
morphic in the entire s-plane and satisfies the 
functional equation é(s, E) « + £(2 — s, E). This 
is true if E has complex multiplication (Deur- 
ing) or E is a certain modular curve (Eichler, 
Shimura). 

(2) Taniyama-Weil conjecture. Weil [W1 
(1967a)] conjectured that, if L(s, E) 
=b a,n then f(t)= E£, a,e?""" is a cusp 
form of weight 2 for the congruence subgroup 
I5(N) which is an eigenfunction for Hecke 
operators; moreover E is isogenous to a factor 
of the Jacobian variety of the modular curve 
for To(N) in such a way that f(t)dz corre- 
sponds to the differential of the first kind on E. 
If this conjecture is true, then the statements in 
(1) follow. 


1715 


(3) Birch-Swiunerton-Dyer conjecture. As- 
suming analytic continuation of L(s, E), B. 
Birch and H. Swinnerton-Dyer [B4] conjec- 
tured that the order of the zero of L(s, E) at s 
— | is equal to the rank r of the group E(Q) of 
rational points of E which is finitely generated 
by the Mordell-Weil theorem. They verified 
this for many examples, especially for curves of 
the type y? =x? — ax. J. Coates and A. Wiles 
(Inventiones Math., 39 (1977)) proved that if E 
has complex multiplication and if r>0 then 
L(s, E) vanishes at s— 1. This conjecture has a 
refinement which extends also to Abelian 
varieties over a global field (Tate, Sém. Bour- 
baki, no. 306 (1966)). 

(4) Sato's conjecture. Let 


1—a,u+ pu? =(1 —n,u)(1 — zu), 


with z, = Jp e" (0 — 0, « 1). When E has 
complex multiplication, the distribution of 0, 
for half of p is uniform in the interval [0, x], 
and 0, is 7/2 for the remaining half of p. Sup- 
pose that E does not have complex multipli- 
cation. Then Sato conjectured that 


(the number of prime numbers p 
S less than x such that Ope[x, f]) 
lim ME E d OE EE 
ee (the number of prime 
numbers less than x) 





2 8 
ei sin^0d0 (O<a<fß<nr) 
n x 


(Tate [T3]). 


H. Yoshida [Y1] posed an analog of Sato's 
conjecture for elliptic curves defined over 
function fields with finite constant fields and 
proved it in certain cases. 

(5) Formal groups. Letting L(s, E) Zon? 
as before, set f(x) 2 3; ,a,x"/n. Honda [H6] 
showed that f! ( f(x) - f(y)) is a "formal (Lie) 
group with coefficients in Z and that this 
group is isomorphic over Z to a formal group 
obtained by power series expansion of the 
group law of E with respect to suitable "local 
uniformizing coordinates at the origin. Such 
an interpretation of the C-function also applies 
to other cases in which ¢-functions of "group 
varieties may be characterized as Dirichlet 
series whose coefficients give a normal form of 
the group law; e.g., the case of algebraic tori 
(T. Ibukiyama, J. Fac. Sci. Univ. Tokyo, (IA) 21 
(1974)). 


T. Selberg ¢-Functions and ¢-Functions 
Associated with Discontinuous Groups 


Let 'c SL(2, R) be a *Fuchstan group operat- 
ing on the complex upper half-plane H= {z= 
x+iy|y>0}. When the two eigenvalues of 

an element yeT are distinct real numbers ¢,, 
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& (6165 = 1, 6 < £5), we call y thyperbolic. Then 
the number & is denoted by N(y) and is called 
the norm of y. When y is hyperbolic, y” (n= 
1,2, 3,...) is also hyperbolic. When +y is not a 
positive power of other hyperbolic elements, y 
is called a primitive hyperbolic element. The 
elements conjugate to primitive hyperbolic 
elements are also primitive hyperbolic ele- 
ments and have the same norm as y. Let P,, 
P,,... be the conjugacy classes of primitive 
hyperbolic elements of T, and let y;e P; be their 
representatives. Suppose that a matrix repres- 
entation y-» M(y) of T is given. Then the analy- 
tic function given by 


zs M) []T] det — M6)NGJ-7^ 


is called the Selberg ¢-function (Selberg [S5]). 
When I' MH is compact and T is torsion-free, 
then Z,(s, M) has the following properties. 

(1) It can be analytically continued to the 
whole complex plane of s and gives an tin- 
tegral function of genus at most 2. 

(2) It has zeros of order (2n + 1)(2g — 2)v at 
—n (n—0,1,2,3, ...). Here g is the genus of the 
Riemann surface ("H and v is the degree of 
the representation M. All other zeros lie on the 
line Res= 1/2, except for a finite number that 
lie on the interval (0, 1) of the real axis. 

(3) It satisfies the functional equation 


Zeil —s, M)» Zi (s, M) exp( run 
s—1/2 
x | ont) 
0 


dx dy . 
A(INH)- ; 7^2nQg—2) x-iyeH. 
por 


where 





Property (2) shows that the Riemann hy- 
pothesis is almost valid for Z,;(s, M). The proof 
is based on the following fact concerning the 
eigenvalue problem for the variety T\H: The 
eigenvalue 4 of the equation 


y?(02/0x?  0?/0y?)u-Au20, ue L'(TAH) 


cannot be a negative number. 

Using this function, T. Yamada (1965) inves- 
tigated the unit distribution of real quadratic 
fields. 

Selberg c-functions are defined similarly 
when TG has finite volume but is noncom- 
pact. In this case, however, the decomposition 
of L,(I'^G) into irreducible representation 
spaces has a continuous spectrum; hence the 
properties of the Selberg ¢-function of T are 
quite different from the case when I'AG is 
compact. Selberg defined the generalized 
Eisenstein series to give the eigenfunctions of 
this continuous spectrum explicitly. When I = 
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f 


SL(Q, Z), the series is given by ———.. 
(c,d)=1 lct+d|?5 


This type of generalized Eisenstein series is 
also defined for the general semisimple alge- 
braic group G and its arithmetic subgroup. 

It has been studied by Selberg, Godement, 
Gel’fand, Harish-Chandra, Langlands, D. 
Zagier, and others. 


U. Ihara £-Functions 


Let k, be a p-adic field, o, the ring of integers 
in k, and G= PSL;(R) x PSL;(k,). Suppose 
that T is a subgroup of G such that (1) T is dis- 
crete, (2) T\G is compact, (3) F has no torsion 
element except the identity, (4) le (the projec- 
tion of I in PSL,(R)) is dense in PSL,(R), and 
(5) T, (the projection of T in PSL;(k,)) is dense 
in PSL;(k,). Then E I'4zI,. Let X 2 ix 
iy| y» 0) be the upper half-plane, and let T 

act on X via lr. The action of T on X is 

not discontinuous, but the subgroup Io = 

(y eT | projection of y to r,e PSL;(o,)j oper- 
ates on X properly discontinuously. For 

each ze X, define I, — (y eT |y(z) zi. Then 

T. is isomorphic to Z or {1}. Let Pl 

(ze X |T; =Z}. The group T acts on P(T), 
since T, and T,, are conjugate in I. Let P(T) 
=P(T)/T. Suppose that Pe P(T) is represented 
by ze X. Choose a generator y of I', and pro- 
ject y to I',. Then y is equivalent to a diagonal 


A 0 
matrix ( 2 with Aek We denote the 
0 4! P? 


valuation of k, by ord, and consider |ord,(A)]. 
This value depends only on P and we denote it 
by deg(P). The Ihara ¢-function of I' is defined 
by 


Zju)- [| (1-uy, 


PeP(I) 


Ihara proved that 


TI satt niu) 

i=1 
SEELEN 
where q is the number of elements in the re- 
sidue class field of p, and g is the genus of the 
Riemann surface ')\X and H — (g — 1)q(q — 1). 
Similar results hold even if I has torsion ele- 
ments and the quotient ING is only assumed 
to have finite volume. 

Aside from the factor (1 —u)", this looks like 
Weil's formula for the congruence C-function 
of an algebraic curve defined over F,2. Ihara 
conjectured that the first factor of Zeil is 
always the congruence ¢-function of some 
algebraic curve over F,;, and furthermore that 
T could be regarded as the fundamental group 
of a certain Galois covering of this curve 
which describes the decomposition law of 


(1 — uy", 
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prime divisors in this covering [12. 13,14). 

He verified the conjecture in the case I = 
PGL,(Z[1/p]) by using the *moduli of elliptic 
curves. Related results have been obtained by 
Shimura, Ihara, Y. Morita, and others. 


V. (-Functions Associated with 
Prehomogeneous Vector Spaces 


M. Sato posed a notion of prehomogeneous 
vector spaces and defined ¢-functions as- 
sociated with them. Sato's program has been 
carried on by himself and T. Shintani [S2, 53, 
S17,S18]. Let G be a linear algebraic group, V 
a finite-dimensional linear space of dimension 


, n, and p a rational representation G— GL(V), 


where G, V, and p are defined over Q. The 
triple (G, p, V) is called a prehomogeneous 
vector space if there exists a proper algebraic 
subset S of Ve such that Vc — S is a single Gc- 
orbit. The algebraic set S is called the set of 
singular points of V. We also assume that G is 
reductive and S is an irreducible hypersurface 
of V. Let V* be the dual vector space of V, 
and p* the dual (contragredient) representa- 
tion of G. Then (G, p*, V*) is again a pre- 
homogeneous vector space, and we denote 
its set of singular points by S*. There are 
homogeneous polynomials P and Q of the 
same degree d on V and V*, respectively, 
such that S= (xe X(|P(x) 20] and S*={x*e 
V*|Q(x*)=0}. P and Q are relative invariants 
of G, i.e., P(p(g)x) = x(g) P(x) and Q(p*(g)x*) 
= y(g) !Q(x*) (for get, xe V, and x*eV*) 
hold with a rational character y of G. Put 
G!zkery-ígeG|xy(g) 2 1). Denote by Gg 
the connected component of 1 of the Lie 
group Gg. Let R-S=V U... UV, V&—S*— 
V*U...UV,* be the decompositions of Vg —5 
and Vi — S* into their topologically con- 
nected components. Then V; and V;* are Gg- 
orbits. We further assume that Va (1 S decom- 
poses into the union of a finite number of GR- 
orbits. Set IT = Gg N G4}, and take T-invariant 
lattices L and L* in Vg and V$, respectively. 
Consider the following functions in s: 


of, s)— 1 F(xMPOOP dx, 


of, ul f*(x*)|Qoc)r dx*, 
vi 


and 


Zi f. L, y=] 


GR 


99 3, foto) d, 


ve 


Z7(f*,L*,s) 


= | mal" Y f*(p*(g)x*)dg, 
Gg; T x*e yt 


L*N 
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where f and f* are trapidly decreasing func- 
tions on ke and Vf, respectively, dx and dx* 
are Haar measures of Vg and Vj, respectively, 
and dg is a Haar measure of G. Then the 
ratios 


Z f. L, s) - 
urai D 
ZI SS) eats om 





O(f*,s—nid) 


are independent of the choice of f and f* and 
are Dirichlet series in s. These Dirichlet series 
ci, L) and Z7 (s, L*) are called ¢-functions 
associated with the prehomogeneous space. 
Considering Fourier transforms of | P(x)|* and 
|Q(x*)}*, we obtain functional equations for č; 
and ¢¥ under some additional (but mild) con- 
ditions on (G, p, V) as follows. The Dirichlet 
series č; and ¢* are analytically continuable to 
meromorphic functions on the whole s-plane, 
and they satisfy 


v(L*)£* (n/d — s, L*) 
—y(s — n/d)Q2n) ^ “|b exp(nd,/— 1 5/2) 


I 
x 2 uj(s) e. (s, L), 


with a l-factor y(s) - II, l'(s— c; 4- 1). 

Here u;,(s) (1 <i,j €l) are polynomials in 

exp( z,/ —1 s) with degree <d, and b, and c; 
are constants depending only on (G, p, V). 

Epstein's C-functions and Siegel's Dirichlet 
series associated with indefinite quadratic 
forms are examples of the above-defined C- 
functions. Shintani defined such £-functions 
related to integral binary cubic forms and 
obtained asymptotic formulas concerning the 
class numbers of irreducible integral binary 
cubic forms with discriminant n, which are 
improvements on the results of Davenport 
[517]. 

Recently M. Sato studied C-functions of pre- 
homogeneous vector spaces without assuming 
the conditions that G is reductive and S is 
irreducible. In this case, ¢-functions of several 
complex variables are obtained. For examples 
and classification of prehomogeneous vector 
spaces — [$84]. 
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Differential Geometry 
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Knot Theory 
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Inequalities 
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Differential and Integral Calculus 
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Total and Partial Differential 
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Functions 
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Gamma Functions and Related 
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1. Algebraic Equations (— 10 Algebraic Equations) 


(I) Quadratic Equation ax?+bx+c=0 (a0) 


The roots are 


a 2-. _ Bh’ Le 
ga E Age — E aC ` iss 959), 


The discriminant is b?—4ac. 


(II) Cubic Equation ax? + bx?+cx+d=0 (a0) 


By the translation =x + b/3a, the equation is transformed into £?-- 3p£- q—0, where 
p=(3ac —b?)/9a?, qz (2b? —9abc + 27a? d)/27a?. 

Its discriminant is — 27(q? + 4p?). The roots of the latter equation are 
£- Va t VB, oVa to? V B , o^ Va toV B, 


where 


(Cardano’s formula). 


D zem. ~1+V3i a |= —q+Vq?+4p3 
W=e eye s mur E EC Rd 


B 2 


Casus irreducibilis (the case when q? -- 4p? « 0). Putting a =re® ( B. — a), the roots are 


£-2Vr cos(8/3), 2Vr cos[(8--22)/3], 2Vr cos[(84-42)/3]. 


(III) Quartic Equation (Biquadratic Equation) ax^-- bx? + cx*+ dx c e-0 (a 3-0) 


By the translation £— x + b /4a, the equation is transformed into 
£4+ pé*+ g£* r—0. 


The cubic resolvent of the latter is f° — pt? — 4rt + (Apr — q?) 2 0. If ty is one of the roots of the 
cubic resolvent, the roots E of the above equation are the solutions of two quadratic equations 


¿7+ Vto—p [£—-4/2(9—-p)]* 1/270 ` (Ferrari's formula). 


2. Trigonometry 


(I) Trigonometric Functions (— 432 Trigonometry) 


(1) In Fig. 1, OA=OB=OP=1, and L 
MP sin, OM =cos@, AT=tané, 


BL=cot@, OT=sec@, OL — cosecÓ. 
(2) sin? -- cos/8 — 1, 


tanü—sin0/cos0, ^ cotü—1/tanU,  sec#=1/cos8, 


cosecÓ — l/sinü, ` l-tan?8-sec?0, 1+cot?@=cosec?é. Fig. 1 


(3) 






+(-—1)’s 
(le 


+t 
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V5 —1 
4 


V 10+2V5 
4 














(5) Addition Formulas 
sin (a + B) = sina cos f + cosa sin B, cos (a + B)=cosa cos B x sina sin B, 
tan (a + B) (tana tan B)/(17 tanatan f). 

(6) sin2a—2sinacosa, ^ cos2a-cos/a—sin?a 22cos?a — 1 ^ 1 — 2sin"a, 


tan2a 22tana/(1-— tan?a). 
sin 3a =3 sina — 4sin?a, cos3a =4cos*a — 3 cosa, 
tan 3a = (3 tana — Gnad — 3 tan?a). 


SC n ( y 2i*1 n-(2i*1) 
sin na = . —1 sint acos PY a, 


w21 f n E l 
cosna= A. 4 ( —1)'sin?a cos" 74a. 
i 


i-o 


(7) sin(a/2)-(1—c0osa)/2, ^ cos(a/2)— (14-cosa)/2, 
tan(a/2)- (1— cosa)/(1- cosa). 


(8 2sinacosB=sin(a+ B)- sin(a — B), 2cosasinB=sin(a+ B) —-sin(a — B), 
2cosa cos B 7 cos(a + B) - cos(a — B), —2sinasin f =cos(a+ B) — cos(a — f). 
sina t sin f =2sin[(a + B)/2]cos[(a — 8)/2], 
sina — sin fi 2 2cos[(a + 8)/2]sin [(a — 8)/2], 
cosa + cos fj - 2cos[(a + 8)/2]cos[(a — B)/2], 
cosa — cos B= — 2sin[(a + 8)/2]sin[(a — 8)/2]. 


(II) Plane Triangles 


As shown in Fig. 2, we denote the interior angles of a triangle ABC by a, B, y; the corresponding 
side lengths by a, b, c; the area by S; the radii of inscribed, circumscribed, and escribed circles 
by r, R, r4, respectively; the perpendicular line from the vertex A to the side BC by AH; the 
midpoint of the side BC by M; bisector of the angle 4 by AD; and the lengths of AH, AM, AD 
by hy, m4, fa, respectively. Similar notations are used for B and C. Put s=(a+b+c)/2. The 
symbol ... means similar formulas by the cyclic permutation of the letters 4, B, C, and corre- 
sponding quantities. 





Se = mi^ - us =2R (law of sines). 
a=bcosy+ccosB, ... (the first law of cosines). 
=b?+c?—2becosa, ... (the second law of cosines). 
sin(a/2)-(s—bys—c)/be, ...; cos*(a/2)= s(s — a)/ bc, 
(b+ c)sin(a/2)=acos[(B—y)/2], ...; (b—c)cos(a/2)=asin[( B — y)/2], 


a+b _ tan[(a+B)/2] 
a-b  tan[(a—B)/2] 





(Napier’s rule). 
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S= ah, /2— (1/2)bcsina —(1/2)a?sin fisin y /sina = abc /AR —2R?sinasin fi sin y 
=rs=r4(s—a)= Vrrargre 
= V s(s— aY(s— bYs— c) (Heron's formula). 

r — (s— a)tan(a/2) 2 4R sin(a/2)sin( B/2)sin(y/2). 

rg=stan(a/2)=(s— b)cot(y/2)=4R sin(a/2)cos( B/2)cos(y/2). 

1/r=(1/hg)+(1/hg) * (1/ hc). 

m? = (2b? + 2c? — a”) /4=(b? + c?  2be cosa)/A. 

f4 7 2bccos(a/2)/(b - c) 2X bes(s— a) /(b- c). 

fafafc =8abers?/(b+ c)(c + aa + b). 





Fig. 2 


(III) Spherical Triangles 


We denote the interior angles of a spherical triangle by a, B, y; the corresponding sides by a, b, 
c; the area by S; and the radius of the supporting sphere by p. We have 


sina: sinb :sinc —sina : sin f : sin y (law of sines). 
cosa =cosbcosc +sinbsinccosa, ...; cosa = —cos fl cos y + sin f' sin y cosa, 


(law of cosines). 


sina cos 8 -cosbsinc—sinbcosccosa, ... (law of sines and cosines). 
cotasinb=cosbcosy+cotasiny, ... (law of cotangents). 
tan[(a+ 5)/2]/tan[(a — 6)/2]=tan[(a+ 8)/2]/tan[(a — 8)/2] ... (law of tangents). 


tan[(a + B)/2]tan(y/2) 2 cos((a — b)/2]/cos[(a  b)/2], ...; 

tan[(a — 8)/2]tan (y/2) ^sin[(a — b)/2]/sin[(a + b)/2) ...; 

tan[(a + b)/2)cot(c/2) 2 cos[((a — B)/2]/cos[(a+ B)/2] ...; 

tan [(a — 5)/2]cot(c/2) 2 sin((a — B)/2]/sin[(at+ B)/2), ... (Napier's analogies). 


cos? (a/2R) + cos? (b/2R) 4- cos?(c/2R) 
S= —n)p? =2p? n H 's f la). 
(x 4- B-y —7n)p p* arccos 2cos(a/2R)cos(b/2R)cos(c/2R) (Heron's formula) 


For a right triangle (y 2 7/2), we have Napier's rule of circular parts: taking the subscripts 
modulo 5 in Fig. 3, 


sin 6, — tan, , ,tan6, us CO, ;cos0, >. 
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For example, we have 


cosc = cosa cosb 7 cota cot £j, x/2—af0, 





cos B ^ tanacotc=cosbsina, 


sina = tanb cot B ^ sincsina. Fig. 3 


3. Vector Analysis and Coordinate Systems 


We denote a 3-dimensional vector by A=(A,,4,,4,)=A,i+ Aj A,k, |A| = VAT AL AP. 


(I) Vector Algebra (— 442 Vectors) 
Scalar product A+B=AB=(A,B)=4,B,+ A, B, + A.B, = |A|B|cos0 
(where @ is the angle between A and B). 
Vector product 
ij 
AXBz([A,B]- (4, B, — A,B, i+ (4, B, ~ A,B, j+ (A,B, ~ A,B )k=|Ax 4, 
B. B, 


N 


SS D 


\A X B| |A| B|sin8. 
A*:B-B-:A. A:A-A?-JA. AXA-0. A*(AXB)=0. (AxBP-«|JAP|BP? —(A * BY. 
AX(BXC)-(A*C)B-(A*B)C. AX(BXC)+Bx(CXA)+CX(AXB)=0. 
(AXB)*(CXD)- A* (Bx (CXD)) - (A* C(B: D) - (B: C(A* D). 


A. A, A 
x y z 
Scalar triple product [ABC]=A+(BXC)=B+(CXA)=C*(AXB)=|B8, B, B,. 
CG G G 


[BCD]A +[ACD]B +[ABD]C=[ABC]D. [ABCJEFG]- 








(ID Differentiation of a Vector Field (— 442 Vectors) 


d d d 














V ae Hay "ka: (Nabla), 
dp. dp, de 
gradgz Vg- 3e ait 3; (gradient of ọ), 

Lasso s 94, [94, OA, (9A, dA 
rotA=VXA= 3 Fa ( SC \i+ (42-3) (rotation of A), 
EET a fA 
ivA= SC ch áy + 3z (divergence of A), 


Ag=V*p=divgradg= ty + x + a (Laplacian of p). 

grad (qu) =p grad y + y grad q, 

grad (A * B)= (B ° grad)A + (A * grad)B+ AX rot B-- BXrotA, 
rot(pA)=grotA—AXgradg, rot(AXB)-(B*grad)A — (A*grad)B + A div B — B div A, 
div(pA)=qdivA+A*gradg, div(AXB)2 B*:rotA— A *rotB. 

rotgradg=0, divrotA=0. AA graddivA — rotrot A. 


A(f» q) =(df/dp)dgp  (d?f/ dg? grado}, | (qu) * pAy Aq + (grad p grady). 
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(III) Integration of a Vector Field (— 94 Curvilinear Integrals and Surface Integrals, 
442 Vectors) 


Let D be a 3-dimensional domain, B its boundary, dV the volume element of D, dS the surface 
element of B, and dS=n dS, where n is the outer normal vector of the surface B. We have 


Gauss's formula Jf [sv ^av- f [.as-A- ff maas, 
Jffrotaav= ff aSx^- ff (axayas, 
Jf fsradoav= [f os: 


Green's formula f f, p dS = J J f, (pAy + grad g* grad y) dV, 
ff [oss v les - fff ot otav, 
va: - fff Save ff [2% 08 (1) as 


where r is the distance from the point xp. 

Let B be a bordered surface with a boundary curve T, ds the line element of I’, dS the surface 
element of B, and ds — tds, dS — ndS, for t the unit tangent vector of I and under the proper 
choice of the positive direction for the surface normal n. We have 


Stokes's formula {| E Ac (t: A)ds, {| dS E ods. 
B r r B r 


If the domain D is simply connected, and the vector field V tends sufficiently rapidly to 
0 near the boundary of D and at infinity, we have 


Helmholtz's theorem — V—gradg-*rotA, y= STE divV aw dv, A= THES rotV ; 


Aer dor 


(IV) Moving Coordinate System 


Denote differentiation with respect to the rest and the moving systems by d/ dt, d* / dt, respec- 
tively. Let the relative velocity of the systems be v. Then we have 


d* * 
GO ee e dA CA [v* gradA — (A * grad)v]. 


v=wxr 
dA d*A , 
um A * [wA- ((wxr)* grad)A] 


When the domain of integration is also a function of t, 
d m 9A . ER . 
al ds= f | T + grad (v* A) vXrotA ) ds, 
4 fa: dS= fri +rot(Axv)+vdiva | dS, 


ZS ff[oav- Hn tv: gradg) + edivv | av= fff Pars ffo: dS. 


(V) Curvilinear Coordinates (— 90 Coordinates) 


Let (x,,...,x,) be rectangular coordinates in an n-dimensional Euclidean space. If 


x= 9 (uy, TA (j= l, iM), J =det (dy,/du,) #0, 


the system (1, ...,u,) may be taken as a coordinate system of an n-dimensional space, and the 
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original space is a Riemannian manifold with the first fundamental form 


dg; de 
Bu" ~ Zu Du, (f, ko, ...,n), 


gadet(g,)- J?. 


When the metric is of the diagonal form g, = Së the coordinate system (yx... Hl is called 
an orthogonal curvilinear coordinate system or an isothermal curvilinear EE system. In 
such a case we have J= g,...g,, and the line element is given by d? =F" | gj du 


For a scalar f and a vector $= (£,,...,$,), we have 
f, IN af(s a 
(grad f), PU (j=1,...,n), Af= 3 SE , 
: Qu; iren Qu, ei Qu, 


: d 0 (g$; 
tre Ge (2 od zl (rot) x SE RES KS (ko l,...,n). 


When n —2, the rot may be considered a scalar, rot £— (rot£);;, and when n 3, the rot may be 
considered a vector, with components 


rot£- ((rot)o3, (rot) (rot&)1;). 
The following are examples of orthogonal coordinates. 
(1) Planar Curvilinear Coordinates. In the present Section (1), we put 
X15 xX, QE M=, U= UU, §1 =P, §2= 4. 
deis pi dit och),  J-c9(xy)/9(uv)- Vpq. 
Planar orthogonal curvilinear coordinates may be represented in the form x + iy=F(U+ iV), 


F being a complex analytic function, by suitable choice of the functions U = U (u), V= V (v). 
(i) Polar Coordinates (r,@) (Fig. 4). 


x-rcosÜü, y=rsin@; x-iy-exp(logr- i). 
r= M xo y? , 8-arctan(y/ x). 
p-l q=r, Jer, ds??-dr4r?. 


$i 3:973 
2'ràór rape 





Fig. 4 


(ii) Elliptic Coordinates ( u, ») (Fig. 5). Among the family of confocal conics 


2 2 
x J 

+ —— =] (a>b), 
agin b*+p ( ) 





there are two values of p for which the curve passes through a given point P (x,y). Denote the 
two values of p by p and v, where p > — b? >v > — a?. The curve corresponding to p= u or p=» 
is an ellipse or a hyperbola, respectively. Then we have the relations 


x?=(pta’)(vt+a?)/(a?—b*), ` y?2(u b?)(» - b)/(b2— a). 


Let the common foci be (+c,0) (c? 2 a?— b?). Then we have 


nya oy ; r- Voce) y? 


where r}, r; are the distances from the two foci as in Fig. 5, and 


4(a? u) m Gy ray, Aalt v) - (r, r3). 


NN NM xd 
(aab) " 2V (xay b) 


(iii) Parabolic Coordinates (a, 8) (Fig. 6). Among the family of parabolas y? = 4p(x + p) with the 
focus at the origin and having the x-axis as the principal axis, there are two values of p for which 
the curve passes through a given point P (x,y). Denote the two values of p by a, B (a 5 0» B). 
We have x= - (a- B), y= V —4af . 

(iv) Equilateral (or Rectangular) Hyperbolic Coordinates (u,v) (Fig. 7). This is a system that 
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replaces x/2, y/2 in (iii) by — y and x, respectively, with Ja=u, V —B =v. The relations are 


x=uv, y=(u?—v?)/2; x+iy=i(u—iv)?/2, u?, v=, /x?+y +y, Boden, ui +v. 
The curves x =constant or y — constant are equilateral hyperbolas. 
(v) Bipolar Coordinates (£, ai (Fig. 8). These coordinates represent a point P(x,y) on a plane as 
the intersection of the family of circles passing through two fixed points (+ a,0) and the family 
of loci on which the ratio of distances from the same two fixed points (+ a,0) is constant. The 
latter is the set of Apollonius' circles. The relations are 





asinh& asinn 
*7 coshé+cosy’ Y= cosh£tcosq (—o»«£«o0,0« n «2m). 
u a 
P=1= coshétcosq- 


(2) Curvilinear Coordinates in 3- Dimensional Space. In the present Section (2), we put x, =x 
X3 =y, X3 -2. 
(i) Circular Cylindrical Coordinates (Cylindrical Coordinates) (p, p, z) (Fig. 9). 


H 


X=pcosy, y=psing, z=z. 


KE F) 1f EE 
ds? = dp? + p?dq? + dz?, J=p. A =F We ap) te ace ta 


p? dg? = az? 
(ii) Polar Coordinates (Spherical Coordinates) (Fig. 9). 


x=rsin@cosy, y-rsinÜsing, z=rcosé. 


r=\Vx?+y24+22, p=arctan(y/x), O=arctan(Vx?+y? /2). 
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The angles ọ and @ are called azimuth and zenith angle, respectively. We further have 
ds? = dr? + r°d0* + r^ sin?8 dy’, J=r’sin@. 


1 3 f23f (EH EE 1 df 

Af areae * cag e s * ag se 
(iii) Euler’s Angles (Fig. 10). Let (x,y,z) and (&,7,¢) be two linear orthogonal coordinate systems 
with common origin O. Denote the angle between the z-axis and the $-axis by 8; the angle 
between the zx-plane and the z$-plane by ọ; and the angle between the y-axis and the intersec- 
tion OK of the xy-plane and the £n-plane (or the angle between the £-axis and the intersection 
OL of the z$-plane and the £g-plane) by y. The angles 0, y, and y are called Euler’s angles. The 
direction cosines of one coordinate axis with respect to the other coordinate system are as 
follows: 











~sin@cos V 
sin d siny 
cos@ 





sing cosÓ cosy + cos sinu 
—sing cosÜ sin y + cos cosy 
sing sin 


cosy cosÓ cosy — sing sini 
— cosq cosÜ siny — sing cosy 
cos y sin 





(iv) Rotational (or Revolutional) Coordinates (u, v, p). Let (u,v) be curvilinear coordinates (Sec- 
tion (1)) on the zp-plane. The rotational coordinates (u,v, p) are given by the combination of 
X =pcosg, y=psing with the coordinates on the zp-plane. We have 

ds? = p? du? + q^ dv? + p? dy’, 
where p, q are the corresponding values for the coordinates (u,v). 
(v) Generalized Cylindrical Coordinates (u,v,z). These are a combination of curvilinear coordi- 
nates (u,v) on the xy-plane with z. We have 

ds? = p? du? + q? dv? + dz?. 


For various selections of (u,v) we have coordinates as follows: 


(u,v) Rotational Geen 
i Coordinate System a pene 
Coordinate System 
Linear rectangular Circular cylindrical Linear rectangular 
coordinates coordinates coordinates 


Polar coordinates Circular cylindrical 


Spherical coordinates 


(DG) coordinates 
Elliptic coordinates Spheroidal coordinates” Elliptic cylindrical 
(D0) coordinates 
Parabolic coordinates Rotational parabolic Parabolic cylindrical 
((1)ii)) coordinates coordinates 
Equilateral hyperbolic Rotational hyperbolic Hyperbolic cylindrical 
coordinates ((1)(iv)) coordinates coordinates 
Bipolar coordinates Toroidal coordinates? Bipolar cylindrical 
((1)(v)) Bipolar coordinates coordinates 


Notes 
(1) When the p-axis is a minor or major axis, we have prolate or oblate spheroidal coordi- 
nates, respectively. 
(2) We take the z-axis as the common principal axis of the parabolas. 
(3) Where the line passing through two fixed points is the p-axis. 
(4) Where the line passing through two fixed points is the z-axis. 


(vi) Ellipsoidal Coordinates (A, u,v) (Fig. 11). Among the family of confocal quadrics 


x? y z? 


+ + =] (a»b»c»0) 
a+p bin c*+p ( 
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there are three values of p for which the surface passes through a given point P (x,y,z). Denote 
the three values of p by A, p, v, where A> — c?» p> — b?» v» — a?. The surfaces corresponding 
to p=A, p= p, and p — v are an ellipsoid, a hyperboloid of one sheet, and a hyperboloid of two 
sheets, respectively. We have 





uc M e. iz. A. 
(a?— b2)(a? — c?) > y (b?— c?)(b? a?) d 
h 
zie Oen h(a) (A a?) + a2)(» + a), 
_ VA=HaA-») ` Vix-—sltn-A 
Se 20) ^ P Zei u) 
ge VEDON e) VG ace PY e. 





4. Differential Geometry 


(I) Classical Differential Geometry (— 111 Differential Geometry of Curves and Surfaces) 
(1) Plane Curves (Fig. 12). At a point P (xo,yg) on a curve y = f(x), the equation of the tangent 
line is y — Yo= f'(xoK(x — xo), 

PT- |voyl tye Zeil 


and the tangential shadow TM = y,/ yo. The equation of the normal line is f'(xo( y — yg) +(x — 
Xo) = 0, 


PN=|yoVl+ye |, 


and the normal shadow MN = yo yo. The slope of the tangent is tana = f'(xo) — yo. 1 he curvature 
at P is 


e 1/ PQ f" (xg)/ [3 f'(x) 
The coordinates of the center of curvature Q are 


(xo FG) [ 1 Tell ]/f" Gi). Fo) ++ Gu I/F" Gel) 


3/2 
] 





Fig. 12 


(2) Space Curves x; — x, (1) (i= 1,2,3), or x=x(t). The line element of a curve x=x(f) is 


ds e ider (dx) + (x3) Al y xl dt (= 4). 


a=! 


The curvature is 


k= > X2- 5/58? 


For t^ s (arc length), the curvature is VG, and the torsion is 7 7 [det(x;, xy, 
Xy )as1,2,3]/ x”, where "e d/ds. When we denote Frenet's frame by ($$, $), we have =x’, 
q-£/x, $-£Xm1 (vector product). 
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The Frenet-Serret formulas are 
€ sm, n=- kitti, f =- m 


(3) Surface in 3- Dimensional Space x, = x, (u1, u2) (a = 1,2,3). The first fundamental form of the 
surface is 


Bik = Du, Ou, (j,k=1,2). g=det(g,) >0. 


a= 
Let ( g/^) be the inverse matrix of ( Sal, The tangent plane at the point x® is given by 
det (x, = xO, (dx,/ du)”, (8x, / uz) zm. 


The normal line at the point x is given by x, — x = ©, where t is a parameter, and v, is the 
unit normal vector, given by 





9 (xg, Xy) 
ub. ORAT 123 
Vy = bua; = +1 
Va g 3 (upu) ( By ~ ) 
The second fundamental form is 
u E: : dn, Ox, EE 
k = Z2 Te Gu äu, de^ E | Ou, au = et (Ax). 
The principal radii of curvature R,, R, are the roots of the quadratic equation 
l i ] h 
wi > Ga RR + cO 
j,k 


The mean curvature (or Germain’s curvature) is 
II) 
Hz; (x* CS x)=? T2 gh 
27 k 


and H =0 is the condition for the given surface to be a minimal surface. The Gaussian curvature 
(or total curvature) is 





and K=0 is the condition for the surface to be developable. 
We use the notations of Riemannian geometry, with gj, the fundamental tensor: 


d Xa Ki d 9x, h G T 
Qu; Ou, Es E up,  (Gauss’s formula). 


Ry hyha — hyh, (Gauss’s equation). ` bz Ah, ~(Codazzi-Mainardi equation). 


MERERI TERI) 
MOULE 22; m eeu 


ax 2 hgg” Gs (Weingarten's formula). 





The third fundamental form is given by 


Qr, Wy » o 
= X g hihi, =2 Hh; — Kix. det (J, )= K g= Kh. 
Jt 





Qu; Ou, 


(4) Geodesic Curvature. Let C:u;=u,(s) be a curve on a surface S and p be the curvature 
of C at a point P. Let 8 be the angle between the osculating plane of C and the plane tangent 
to S. The geodesic curvature p, of C at P is given by 


du; du j ) dvj du, 
Pp = pcosó = V g el? EE 
Js 


im i,2 
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=Q is the condition for C to be a geodesic. Let D be a simply connected domain on the 
surface S, whose boundary T consists of n smooth curves. Let 0, be the outer angle at the 
intersection of two consecutive curves (a = 1, ..., n). Then we have the Gauss-Bonnet formula: 


Lie" ff Kas= 2r- Y 6, 


a=1 


(II) Riemannian Geometry, Tensor Calculus  (— 417 Tensor Calculus) 


In the present section, we use Einstein’s convention (omission of the summation symbol). 
(1) Numerical Tensor. 


1 (=k) 


Kronecker’s ô ô, ok, a= { Gab) 
J š 


D — ({i} US). 
GH = det (8 K R =i +1 ((J,) 9 (4) and (j,) is an even permutation of (k,)), 
—1 ({i,}={%,}) and (,) is an odd permutation of (k,). 


5 , d E l...n Ji = d+ Sn 
Eddington’se e Aë, eh dem Olt, 
~ dplpaii da = 18ih. u = gii- hel ai Her „glg gh 
ô; hber (n— p)! GN ee det (4), 1 eng: aj... = Ej, 0105... d 
P n 


. (2) Fundamental Objects in Riemannian Geometry. Let gj, be the fundamental tensor, and ( g^) 
be the inverse matrix of (g). We put g=det(g,,). 
The Christoffel symbol is 


i l ja Za; Zak Og i a ð log V g 
aiei) (aa 
ax* Al dx ax * 


which has the transformation rule 


PEE {i | 
e Eger: dee D dE 
jk] Ox" \ agi azk VS 9x/ox* 


under a coordinate transformation. 

A geometrical object I, with a similar transformation rule is called the coefficient of the 
affine connection. The torsion tensor is 
The equation of a geodesic is 
d?xi i dx! dxf — 
ds? jk] ds ds 

The covariant derivative of a tensor of weight W with respect to a coefficient of affine connec- 
tion T5, is given by 





As = E E Ps Jp- me Jp R= ie Jp a es Vie, TA 
EC D al* 
Ma dM ` aq Eb eem = kpyssk, aak eas Ky "N kick 


For the Christoffel symbol, we denote the covariant derivative by ;/. Then we have the following 
formulas: 


gii = 9, g*., = 0, ó]., — 0, Ee PS =0, (1/./g)e~ ^ = 


For a scalar f grad f —(f.j), 
for a covariant vector v; rotu = (V; —v,,;)=(@v,/dx* — dv, /0x?), 


1 O( gv 
Je ox) ` 


and for a contravariant vector v/ divv= ni j= 





Beltrami’s differential operator of the first kind is 


A f e gH, f.,. 
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Beltrami’s differential operator of the second kind is 


| | 8(Ve g” (af/ax*)) 
A, f= div grad f= Ve RSR SE 


For a domain D with sufficiently smooth boundary T, we denote the directional derivative 
along the inner normal by d/ än, the volume element by dV, and the surface element on T by dS. 
Then we have Green's formulas, 


dp oy 
[aie +vare)ave—fusras, ` Deet Ais: f DEE 
We denote the curvature tensor with respect to the coefficients of a general affine connection 


Dj, by Bin and by Ri, when Tj, = p I We have the following formulas: 


E. IL api e 
Hm E WE TIU DL 





Rie = g; Rt) = — Rig =R er a 
ijkl = Bai kl gk "H l| axtax! ` axiax* axax!  gx/ox* 


ell Mä 


Bianchi's first identity ier + Rig + Bis 70, 
~ (Bia + Bey + Bir )=2( Sau + Sars + Sie ) * (Sa Sat Sia SG + SaS ); 


AH 9^8, 07g, " 97g 9?g, | 


Bianchi’s second identity Bi + Rims + Rimk:1= 9, 


Bue + Bjim + Bim = —2( Bima Sit + Biha Sin + Bia Sak ); 


D “9 E i Ss 3 
Ricci's tensor Kaz — Rig = Ry; 
Lupa 
scalar curvature Rei 
Ricci’s formula Pho% — T h 
kk] Ke EJds 
p 
-2n Ti eh- Ge “Sp Bor + Ti Jp Be .4+2T 4, seit WTI BA 
Au 22 ki sede septo met ba KN bake SP 


where S and B are the torsion and curvature tensors given above, respectively, and W is the 
weight of the tensor T. 

(3) Special Riemannian Spaces (— 364 Riemannian Manifolds). In the present Section (3), n 
means the dimension of the space. 

(i) Space of Constant Curvature Ri = p(85,—8.5)); p-R/n(n—1), 

(ii) Einstein Space Ry, = pg;, p — R/n, for n> 3, where R is a constant. 

(iii) Locally Symmetric Riemannian Space Bi, zc. 

(iv) Projectively Flat divi Weyl’s projective curvature tensor is defined by 


Wiz = Rig EST | “(Rud — Rj,dj). 


The condition for the space to be projectively flat is given by Wi, —0, Ry — Ry. 
If n>3, the latter condition follows from the former condition, and the space reduces to a 
space of constant curvature. If n=2, the former condition W=0 always holds. 
R 
(v) Concircularly Flat Space Zi, 2 Rig + "a nin 1j 5^ ôf — g$) ^0. This space reduces to a 
space of constant curvature. 
(vi) Rep Flat Space. Weyl’s conformal curvature tensor is defined by 
i) R(r — 81%) 
Cu = Riu Tec — Rôi — Rað +g Rj— gy Ri) — res 
_ SR Rgy, 
T,=- E 
(n—2) TE 1)(n—2) 
The condition for the space to be conformally flat is given by Cj, —0, Mra = IT;.,. 
If n>4, the latter condition follows from the former condition, and if n 23, the former condi- 
tion C —0 always holds. 
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5. Lie Algebras, Symmetric Riemannian Spaces, and 
Singularities 


(1) The Classification of Complex Simple Lie Algebras and Compact Real Simple Lie Algebras 
(— 248 Lie Algebras) 


(1) Lie Algebra. The unitary restriction of a noncommutative finite-dimensional complex simple 
Lie algebra g is a compact real simple Lie algebra g„, and g is given by the complexification gf 

of g,. There exists a bijective correspondence between the classifications of these two kinds of 

Lie algebras. Using Dynkin diagrams, the classification is done as in Fig. 14 (— 248 Lie Algebras). 
The system of fundamental roots (a,, ...,,} of a simple Lie algebra g is in one-to-one corre- 
spondence with the vertices of a Dynkin diagram shown by simple circles in Fig. 14. The number 
of simple circles coincides with the rank! of g. The double circle in Fig. 14 means —1 times the 
highest root 0. Sometimes we mean by the term “Dynkin diagram" the diagram without the 
double circle and the lines issuing from it. Here we call the diagram with double circle representing 
— 0 the extended Dynkin diagram. Corresponding to the value of the inner product with respect 
to the Killing form — 2(a;, «,)/(«;, od (155) (which must be 0, 1, 2, or 3), we connect two vertices 
representing a; and a; as in Fig. 13. When the value is 0, we do not connect a and o In Fig. 13, 
the left circle corresponds to a; and the right circle to a;. 





0 o o 
1 0—-0 
— 2(a,,2;)/(a;,;) n 
2 o<=0 
3 oco 
Fig. 13 
—0 
©) Es 
A g . Q Q -———- Q O o 
(21) 1 1 1 1 l 1 d 
1 2 3 4 3 2 
E; O O O D O e (O) 
B, 1 2 2.2 8 ki 
(122) EE 92 
©) 
— 8 
2 3 4 5 6 4 2 
Es ©) O D O D D O O 
—0 
C, 6 2 2 72 
(9—0——0——0— - --- —0——0«—0 
(123) fer? 
2 3 4 2 
F © O O—»0 O 
— 0 


H 1 
np 92 2 2 23 d 
> O oO O—---- 2 3 
SS B e 
—B 1 —6 


Fig. 14 We have relations B, =C, = A,, C; = B,, and D, — A3. (D, = A, + A,, which is not simple.) In this figure, 
the number at each vertex means the coefficient m, in 0 = 22 m,q,. 


From Fig. 14, we have the following information. 
(i) The quotient group of the automorphism group A (g) of g with respect to the inner automor- 
phism group J (g) is isomorphic to the automorphism group of the corresponding Dynkin dia- 
gram. The order of the latter group is 2 for A, (/ > 2) since the diagram is symmetric. It is also 
2 for D, (1 > 5) and for E,, and it is 6 (=3!) for D4. For all other cases, the order is I. 
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(ii) The order of the center of the simply connected Lie group associated with g is equal to the 
index of the subgroup consisting of elements stabilizing — 0 in the group of automorphisms of 
the extended Dynkin diagram of g (S. Murakami). This index is equal to the order of the 
fundamental group of the adjoint group of g and the number of connected Lie groups, whose 
Lie algebra is g. 
(iii) Any parabolic Lie subalgebra of g is isomorphic to a subalgebra generated by the root vector 
X, (and elements of the Cartan subalgebra) such that a = En;a;, where (a,, ...,a,} is a system 
of fundamental roots, n; 2 0 (i— 1, ...,7) or n; «0 (i=1,...,/), and n; —0 for a; belonging to a 
fixed subset S of (a, ...,aj). 

Hence, isomorphism classes of parabolic Lie subalgebras are in one-to-one correspondence 
with the set of subsets S of (a, ...,aj). l 
(iv) Maximal Lie subalgebra f of g with the same rank / as g. The Lie subalgebra f is classified 
by the following rule. First we remove a vertex a; from the Dynkin diagram. If the number 7, 
attached to the vertex is 1, f is given by the product of the simple Lie algebra corresponding to 
the Dynkin diagram after removing the vertex a; and a one-dimensional Lie subalgebra. If m,» 1, 
f is given by the diagram after removing a; from the extended Dynkin diagram. 


(2) Lie Groups. The classical complex simple Lie groups of rank n represented by A, B, C, D (in 
Cartan’s symbolism) are the complex special linear group SL(n+ 1, C), the complex special 
orthogonal group SOQn + 1, C), the complex symplectic group Sp(n, C), and the complex special 
orthogonal group SO(2n, C), respectively. The classical compact simple Lie groups of rank n 
represented by A, B, C, D are the special unitary group SU (n + 1), the special orthogonal group 
SOQn + 1), the unitary-symplectic group Sp(n), and the special orthogonal group SO(2n), respec- 
tively (— 60 Classical Groups). 


Complex Compact Dimension 
Form Form 



















Cartan’s 






















Symbol 
A, SL(n+1,©) SU(n4 1) (nt 1-1 n 
B, SO(2n41,C) SO (2n 1) 2n? n n 
C, Sp(n,C) Sp(n) 2n? n n 
D, SO(2n,C) SO(2n) 2n?—n n 
G; Aut (^ Aut 14 2 
F, : Aut sr Aut% 52 4 
E, 78 6 
E, 133 7 
Eg 248 8 





Here G is the Cayley algebra over R, G* is the complexification of ©, & is the Jordan algebra 
of Hermitian matrices of order 3 over ©, X° is the complexification of X, and AutA is the 
automorphism group of A. 


(II) Classification of Noncompact Real Simple Lie Algebras 


Classical Cases 






Maximal Compact 
Lie Algebra of o 






Cartan's Symbol 





Noncompact Real 
Simple Lie Algebra g 










zl p + 1; RI &u(p +1) 

AII &l(n; H) 8p(n) 
AIII $u(p,q; C) Su p) + 8u(q) 

BI 80( p.a RI 8o(p)t$0(qQ) (ptq=2m+1l) 
BII $0(1,n — 1; R) 8o(n—1) (n=2m+1) 

CI $p(p; R) u(p) 

CII u(p,q; H) 8p(p) + sp(q) 

DI Sot p, q; R) 80(p)+80(q) (p+q=2m) 

80(1,n—1;R) 8o(n—1) (n=2m) 






Sot p; H) u(2p) 
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Here the field F is the real field R, the complex field C, or the quaternion field H (RCC CH). 
H is an algebra over R. For a quaternion x= xo xii + xj  x4k(xg, xy, x3, x4 € R), we put 


Ass X9— Xii — Xj — xk, 
x* = xg xii — x5 j + xgk. 


Then gl (n; F)= (set of all square matrices over F of order n), 


sin F)2 (4 €al(n; F)|trA =0}, 
$o(p,q; F) - (A € gl (p qi F)|'A* 1, q+ 1,,A =0}, 


where J, , is the symmetric transformation of the Euclidean space R?* 7 with respect to R?, i.e., 


AI 0-——0——0— ---- —0-——0 EI were 
AII $——0——9— ----—0——9 
PN OR EI H ARN 
Age | P uM 
EI 
CoD | 
0——0------ 
EN d aaa 
BI 0——0—----—0——9— ---- —9—79 
BII 0——9—-9— ---- —9—9—59 EV o—o—o i o—o 
CI 0——0— ----—0——0«—0 
EM 
CIT 
$— —90—-9— ---- —0——e«—o 
EVI ad Nn 
0—0— ee 
EVI RE Man 
DI e 
EIX 
DII re FI 0—o0—0o——o 
$—-—0——e-— ---- FII e—e—25e—-—o 


DI 
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I,,q is the diagonal sum of the unit matrix J, of order p and — I,. We have 
80(n; F)=80(n,0; F), 
u(p,q; F)- [A &al(p a; F)| ML q+ 1, al, 
u(n; F)=u(n,0; F), 
8b(n; F)- (4 Egl(2n; F)|‘AJ+JA=0}, 


where J is the matrix of an alternating form Z7. (x; y;,,, — Xi+n Yi) Of order 2n. 

A noncompact real simple Lie algebra g is classified by the relation of the complex 
conjugation operator o with respect to the complexification o" of g. The results are given by 
Satake's diagram (Fig. 15). 

In the diagram, the fundamental root corresponding to a black circle is multiplied by — 1 
under o for a suitable choice of Cartan subalgebra, and the arc with an arrow means that two 
elements corresponding to both ends of the arc are mutually transformed by a specia! transfor- 
mation p such that o — pw (wE W). 


(III) Classification of Irreducible Symmetric Riemannian Spaces (— 412 Symmetric Riemannian 
Spaces and Real Forms) 


A simply connected irreducible symmetric Riemannian space M — G/ K is either a space in the 
following table or a simply connected compact simple Lie group mentioned in (I). The noncom- 
pact forms uniquely corresponding to the compact symmetric Riemannian space are in one-to- 
one correspondence with the noncompact real simple Lie algebras mentioned in (II). 





Cartan's Rank 
























Symbol 
AI SU(n)/SO(n) (n»2) (n~1)(n+2)/2 
All SUQn)/Sp(n) (n>) (n—1)(2n+ 1) n-] 
AIII U(p+q)/U(p)X U(q) (p>q>1) q 
BDI SO(p+q)/SO(p)XSO(q) (p>4q>2,p+q#4) q 
BDII SO(n+1)/SO(n) (n>2) l 
DIII SOQD/U(l) (124) U/2] 
CI Sp(n)/U(n) (n>3) n 
CH Sp(p*ta)/Sp(p)xSp(q) (p 29> 1) q 
EI E«/ Sp(4) 6 
EII E,/ SUQ)-SU(6) 4 
EI E,/ Spin(10)- SO (2) 2 
EIV E/F, 2 
EV E;,/ SU(8) 7 
EVI E,/Spin(12)- SU(2) 4 
EVII E,[Eg: SOU 3 
EVIII E,/Spin(16) 8 
EIX E,/E,: SU() 4 
FI F,/Sp(3): SUQ) 4 
FII F,/Spin(9) l 
G G,/S0(4) 2 
Notes 


The group G=U(p +q) in AIMI is not effective, unless it is replaced by SU (p+ q). To be 
precise, K — Sp(4) in EI should be replaced by its quotient group factored by a subgroup of order 
2 of its center. K in EII is not a direct product of simple groups; the order of its fundamental 
group z,(K) is 2. To be precise, K in EV or EVIII should be replaced by its quotient group 
factored by a subgroup of order 2 of its center. The K's in EI, EUL EVI, EVII, EIX, and FI are 
not direct products. The fundamental group z,(K) of K is the infinite cyclic group Z for EII, 
EVII; for all other cases, the order of z,(K) is 2. 

In EII, EVII, the groups Es, E; are adjoint groups of compact simple Lie algebras. In other 
cases, Es and E; (Eg, F4 and G, also) are simply connected Lie groups. 

The compact symmetric Riemannian space M is a complex Grassmann manifold for AIII, 

a real Grassmann manifold for BDI, a sphere for BDII, a quaternion Grassmann manifold for 
CII, and a Cayley projective plane for FII. 
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(IV) Isomorphic Relations among Classical Lie Algebras 


The isomorphic relations among the classical Lie algebras over R or C are all given in the 
following table. In the table, we denote, for example, the real form of type AI of the complex Lie 
algebra with rank 3 by A,I in Cartan's symbolism. When there are nonisomorphic real forms of 
the same type and same rank (e.g., in the case of D4I) we distinguish them by the rank of the 
corresponding symmetric Riemannian space and denote them by, e.g., D4I,, where p is the index 
of total isotropy of the sesquilinear form which is invariant under the corresponding Lie algebra. 


Cartan's Symbol 











Isomorphisms among Classical Lie Algebras 


A=B =C; 812, C) = $0(3,C) « 8p (1, C); 8u (2) = 30(3) = sp(1) 
B,=C, 80(5, C) «8p 2, C); 80(5) = 8p (2) 
A; = D; SI(4, C) = 80 (6, C); 85u (4) = 80 (6) 
A l=AH=BI=CI $IQ, R)= su (1, 1; C) z80Q, 1; R) =8p(1; R) 
BjL; = CI 80(3,2; R)& 8p (2, R) 
Bl 2 CjIlI 80(4, 1; R)&u(1, 1; H) 
Als DÄ 8L (4, Biz 80 (3,3; R) 
A3II = Djl, $L(2, H) = 80(5, 1; R) 


AIi, = DjL 


81 (2,2; C) = 80(4, 2; R) 
8u (3,1; C) z 80(3; H) 


D4b DAD 80(6,2; Biz 80(4, H) 

D;-A,XA,* 80(4, C) 812, C) x 81 (2, C); 80 (4)  8u (2) x 8u (2) 
Djl; - AJIXA;I 80(2,2; R) = 8L, R) x 8L(2; R) 
DII 2 A, X Al soi: H)e ŝu (2) X 8L; R) 






Dol, 2 A;* 803, 1; R)& 8L 2, C) 








Note 

(*) In these 3 cases, there are isomorphisms given by the replacement of 8[(2, C) 
or $1 (2) by isomorphic Lie algebras of type B, or type C, due to the isomorphism 
A, zx B, = C.. 


(V) Lists of Normal Forms of Singularities with Modulus Number m=0, 1, and2 (— 418 Theory 
of Singularities) 


Letters A, ..., Z stand here for stable equivalence classes of function germs (or families of function 


germs). 


(1) Simple Singularities (m — 0). There are 2 infinite series A, D, and 3 “exceptional” singularities E,, 
E,, Es: 









Notation Restrictions 











A, rth +y? +z? n>1 
D, x" 14 xy? +z? n>4 
Es x* +y? 42? 

xy +y? +z? 

x+y? +2? 





(2) Unimodular Singularities (m= 1). There are 3 families of parabolic singularities, one series of 
hyperbolic singularities (with 3 subscripts), and 14 families of exceptional singularities. 


The parabolic singularities 













Restrictions 
a? +270 
at— 6420 
a® —432 #0 





Normal form 
x? +y? +z +axyz 
x* + y* +z? +axyz 
xÉ -- y? +z? c axyz 


Notation 





X, — E, 
Jom Es 
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The hyperbolic singularities 
Notation | Normal form Restrictions 
1 1 1 
Tu x? + y* -- z' +axyz m ou 





The 14 exceptional families 





























































Gabrielov | Dolgachév Gabrielov | Dolgachév 

Notation | Normal form numbers | numbers | Notation | Normal form numbers numbers 
Kj; | xit+y?+z27+axy? 237 237 H. x* + xy* +274 ay® 256 344 
Kis x5 + xy5 +2? c ay? 238 245 Qio x*«Wy*-yzaxy) |334 239 
Kja x? +y? 427+ axy® 239 334 Qi x34 y? z -F xz? -F az? 335 247 
Zii x3 y y) zt axy* 245 238 Qi; xi+y54+y227+axy* 1336 336 
Zi x?’ y+xyt +z? ax 1246 246 Sis x*4y?z4+x274ax9z | 344 256 

x? y+ y 4274 axy? 247 335 x?y+y?z+xz?+az5 345 345 

x*- y5 zi ax?y) 255 255 x? +y? +zt+axyz? |444 444 





(3) Bimodular Singularities (m — 2). There are 8 infinite series and 14 exceptional families. In all the 
formulas, a — a, +a, y. 


The 8 infinite series of bimodular singularities 




















Notation Normal form 



















Restriction Milnor number 
























Jo x? bx? y? E y?  z? E exy? 4b? +2740 
d x? +x? y3 +z? ay?*? p>0, gu 0 16+p 
Zi. y(x? * dx^y? c exy? + y?) z? 4d? 27 40 15 
Zus y(x? +x? y? + ay®*?) +z? p>0, a) #0 15+p 
W,.o x* rax? y! +yf zz ag#4 15 
Wi,» x4 +x? y! Hayt? + 2? p>0, a) #0 ERT 
Wii (x? + yy + axy*** +z? q>0,a,49 (EE KEEN 
Wi (x? +y?) ax? y 4 z? q>0, dy #0 15+2q 
Q>.0 x? + yz? c ax? y? e xy* DEET 14 
OË x3 + yz? x? y! 4+ az8*7 p»0,ay Z0 144p 
Sio x? z 4 yz? + y? azy? a2 #4 14 
Si,» x?2z+yz74+x7y?+ay>t? p>0, a)#0 14+p 
SÉ 24-1 x?z +yz? zy? +axy?t4 q70,a, 0 144-29 —1 
S¥ 24 X^z +yz? 4 zy? ax? y?" q>0, ag #0 14+2q 
Us X xz xy). aylz ag(a 4-1) z0 14 

x? E xz? xy? ray! +z? g>0, ag #0 

XM xz? xy) 4 ay?**z q>0, a9 #0 












Normal form Notation Normal form 












x? 4 y!04 22 taxy? x*- y! +z? c ax y 

Eis x3 xy! +z? c ay!! x? +yz? +y +z? +axy’ 

E30 x? + yt! +2? + axy8 x? +yz? + xy5 +z? +ay? 

Zia x?’ y +y? - z? +axy® XM yz? +y? +2?  axy$ 

Zá x! y xy8 +z? tor x^z yz? 4 xy* 4 z? -- ay$ 
Zio xy E y? +z? c axy! x?z t yz? + yf +z’ t azy* 
W, x* 4 xy? +z ay? x? + xz? +y? +27 rax? y? 
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Adjacency relations between simple and simply elliptic singularities 








sc Si; Uu 12 T238 Zia 27 Wu T ase 
| 
Qu T336 Su T 345 T 444 T 237 Zu T 246 T 255 
| | 
Quo Tan T 344 J1o7 Eg i 
E c ' 
T3534 X= E; 
| 
P= E, 
References 
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183-219. 

[3] K. Saito, Einfach elliptische Singularitáten, Inventiones Math., 23 (1974), 289—325. 


6. Topology 


(1) ^-Cobordism Groups of Homotopy Spheres and Groups of Differentiable Structures on 
Combinatorial Spheres 


(1) The Structure of the h- Cobordism Group 6, of n-Dimensional Homotopy Spheres. In the 
following table, values of 8, have the following meanings: 0 means that the group consists only 
of the identity element, an integer 7 means that the group is isomorphic to the cyclic group of 
order /, 2' means that the group is the direct sum of / groups of order 2, + means the direct sum, 
and ? means that the structure of the group is unknown. 





n 123 45 6 7 8 9 10 11 12 #13 14 15 16 17 18 
2 or 2* or 
062 00? 00 0 28 2 4+2 6 992 0 3 2 812842 2 4+? 8+2 





(2) The Group T, of Differentiable Structures on the n-Dimensional Combinatorial Sphere. 


I0. (n+3), D,20. 


(II) Adem's Formula Concerning Steenrod Operators Sq and 2 (— 64 Cohomology Operations) 


For the cohomology operators Sq and Y, we have 


a b ga b c 1 atb-c c 
Sq*Sq’= ) ES Sq Sq’ (a<2b). 
c=0 m 
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Wa ane((b-e(p— D-1 


c-0 


ee (a « pb), 


(b—c)(p 
a—p 


i [2/p] x —1) EE 
POP’ = Y (—1y** g ôP P 
c=0 


jo pes < SECH Jena (uen) 
Several simple cases of the formula above are as follows. 
Sq! Sq? =Sq2""*?, Sq! Soin") «0, 
Sq! Sq" = Sq*"*? + Sq"?! Sq!, Sq! Sq"! = Sq*"*?Sq', 
Sq1Sq"*? 2 Sq"** Sq!, Sq? Sq^^*3 Sir"? + Sq'^* *Sq!, 
Sot Sq?" = Sq9"* ^ + Sq9"* 3 Sq! + Sq®"*?Sq?, Sq*Sq8"*! — Sq9* Sq! + Sq?" 3 Sq?, 
Sot Sg"? = Soin" SQ, Sq*Sq8**3 = Sq?"+5 go), 
So Sq?'** — Sq?" Sq! + Sq? Sai), Sq* Sq*"* 5 = Sq9"*? + Sq9** Sq! + Suën" "gei, 
So Sq9"* = Sq9"*19 + Song Sq? Sot Sq?" 7 = Sq * 1 + Sq? géi, 


2g" — (n Ny", 
PEP =n PP gn, 


(II) Cohomology Ring H*(7,n;Z,) of Eilenberg-MacLane Complex (— 70 Complexes) 


Z means the set of integers, and Z,=Z/pZ, where p is a prime number. 

(1) The case p 22, 7=Z or Zy (f > 1). The degree of a finite sequence 7 =(i,,i,_,,...,7,) of 
positive integers is defined by d(I)=i +i +... rt If such a sequence satisfies i, ,, > 2i, (k= 
1,...,7—1), it is called admissible, and we define its excess by 


e(1)=(i,-2i,_,) +... +(i,~27r,) +i, 22i, - d(I). 


Further, we put Sq! = Sq* Sq... Sq". Then we have H*(Z,;, Z,)=Z,[Sq’u,|I is admissible, 
e(1)<n], H*(Z, n; Z;) - Z;[Sq!u,|I is admissible, e(1) <n, i, > 1]. 

Here, u,¢ H"(1, n; Z;) is the fundamental cohomology class. I = @ (empty) is also admissible, 
and for this case we put n(I) — e(1) —0, Sq! — 1. Due to the Künneth theorem, we have 


H*(n 7,n;Z,)) = H*(0,n;Z,)8 H*(2',n;Z,) 


if 7 is finitely generated. In particular, we have 


H*(Z,,1;Z,)=Zl uj], 
H*(Zp,2; Z3) = al up, Sq'uy, Sq?Sq'up, ..., Sq? Sq" ! ... Sq'u,... |, 
H*(25,3; Z2) = Zo| u3, Sq" Sq? 7! ...$9u,, Sg€ * Y" Sg * ^t, 
Sq? * sq" -!...Sglu|r20, s20]. 


(2) The case p #2, 7=Z or Z,; (f > 1). We define the degree of a finite sequence I= (i, i, i, 
41, ig) of nonnegative integers by d(/)=i,+ ... +i, + ig. The sequence J is called admissible 
if it satisfies the following conditions: 


i, =2A,(p—-1)+e, (Ay is a nonnegative integer, e =O or 1 (0< k < r)), and 
ig 0 Or l, i > 2p-—2, (Au > Pik (1€kx«r-—1). 


We define its excess by eiis pi, — (p — 1)d (I). Further, we put 9" =5%9% ,,,6*1996*o, and 
assume that u, € H"(m,n; Z) is the fundamental cohomology class. Then we have 


H*(Z,7,n; Z,)= ZAP! u, |I is admissible, e(I) « n(p — 1), n+d(J) is even] 
& Aa (P u,|I is admissible, e(I) « n(p— 1), n+ d(I) is odd). 


H*(Z,n;Z) is given by the above formula when the admissible sequence is J with ip=0. 
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(IV) Cohomology Ring of Compact Connected Lie Groups  (— 427 Topology of Lie Groups and 
Homogeneous Spaces) 


(1) General Remarks. Let G be a compact connected Lie group with rank | and dimension n. 
We have H*(G; R)= Ag(x,, ..., xj, where Ak(x,, ..., xj means the exterior algebra over K of a 
linear space V 2 Kx, +... + Kx, with the basis {x,,...,x,} over K. We define a new degree in 
Ak, --., Xj) by putting deg x, =m; (m, is odd) (1 & ix I), where m, - ...-- m, — n. The = means 
isomorphism as graded rings. 

(2) Classical Compact Simple Lie Groups. We set deg x; =i. 


H*(U (n); R)s Ag(x, Xs, ....X24 1); 
H*(SU (n); R)& Ag(xs.xs, ..., X25 1). 
H*(Sp(n); R) = pl ran, Eäs 21); 
H*(SO (n); Z;) «(Having xj, x5, ...,x,.., as a simple system of generators) 

= Zl x), x3, sakao l= 1...) 

(n= [n/2], s(i) is the least integer satisfying 20(2i — 1) 2 n) 

H*(SO (2n); K) = A x(%3)X7 -s Kan ze X221), 
H*(SO(Qn-1); K)& A x(x3,X7, ....X44-5), Where K isa commutative field whose 


characteristic is not 2. 


For SO(n), Sorten For SU(m), p*Gu-0-(*. ucieno-i 


For Sp(n), as CT al Gol A DCH Poe 


(3) Exceptional Compact Simple Lie Groups. n and m; (1 <i « /) given in (1) are as follows. 
G5: n=14, m=3, 1l. 
F: n=52, m=3, IL 15, 23. 
Eg: n=78, m=3, 9, 11, 15, 17, 23. 
Ez: n=133, m;=3, 11, 15, 19, 23, 27, 35. 
Eg: n=248, m,=3, 15, 23, 27, 35, 39, 47, S9. 


(4) p-Torsion Groups of Exceptional Groups. The p-torsion groups of exceptional Lie groups 
are unit groups except when p=2 for G,; p=2, 3 for F4, Ee Ez; and p 22, 3, 5 for Eg. The 
cohomology ring of Z, as a coefficient group in these exceptional cases is given as follows. Here 
we put deg x; — i. 


H*(65:25) = Zl x,l/(x$)8 Az (Sq)4); 
H*(F4;Z)= Z,|x3]/(x3)@ ECKE 
H*(F Z5) =Zs[ 89x, ]/((8P x3) )@Az,( x3, 923, 9); 
H*(E¢;Z,)= Zi xl/(x$)8 Az,(Sa^xs. Sq^Sq^x4, x ys, Sq*Sq^Sq'x,, Sq*x,,), 
H*( Es; Z3) = Z,[ 89!x, LE 95) Aa Xs 9 x3, xs, xy 9 x, x17); 
H*( E7; 25) al x3, Sq7x3,Sq*Sq7x3 | /(x$.( Sq?x3)°, (Sq4Sq?x3)’) 
Az (xis Sq*°Sq48q7x3, Sq*x15,Sq*Sq*x5), 
H*(E4;24) 2 Z4 [ ô 9'x, /((8 GN 8 Az (Xs, 81x Xy, 95344, 95 1x4, x25, x35); 
H*( Eg} Z5) 5 L| x4, Sq^x3, Sa Sg xa us | / (x35; (SPx), (Sq*Sq?x3)",x4s) 
G Az ( Sq’Sq4Sq?x3, Sq*x 15, Sq*Sq*x s, Sq^Sq^ Sq*x s), 
H*Eg Zs) = Z5 [ 89x58 I3 65] / (89), (899935) 
B Az (x3, 9 1x3, x15, 99 9 oc, PX 1 45, x39, X47), 


5 
H*( Eg; Zs) =Z,| 8 P'x,]/((8 P!x3) Je AX» qx os PX i5, X27 X355 X39, X27). 
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(V) Cohomology Rings of Classifying Spaces (— 56 Characteristic Classes, 427 Topology of Lie 
Groups and Homogeneous Spaces C) 


(1) Let H*(G; K) Axa, X2, ..., x,). Then the deg x; are odd and the x; may be assumed to be 
transgressive. y; being its image, the following formula holds: 


H*(BG; K) 2 K[y,, yao. -.-. Ya] (Borel’s theorem). 


(2 H*(BU(n)-H*(BGL(n, C)) 2 Z[ci. c5, .... €p], 
H*(BSU (n)) = H*(BSL(n, C)) 2 Z[c;, ...,c,], 
H*(BSp(n)) ^ Z[q1. 42. Al 
H*(BO(n); Kj) - H*(BGL(n, R); Z,)= K;[w;, Wo, ..., Wal; 
H*(BSO(n); K,)= H*(BSL(n, R); Z;) - K;[w5, ..., Wp), 
H*(BSO(Qm-- 1); K)= K[ P1; P2, -<-> Pal. 

H*(BSO(2m); K) 2 K[ P1; Pa, --->Pm-1> Xl- 


Here, K denotes a field of characteristic #2, and K, is the field of characteristic 2. The c; denote 
the ith Chern classes and the q; the ith symplectic Pontryagin classes, the w; the ith Stiefel- Whitney 
classes. Moreover, the p; denote the ith Pontryagin classes, and y the Euler class. Their degrees are 
given as follows: deg c; — 2i, deg q; = deg p; Ai, deg w; =i, and deg y = 2m. 


(3) Wu's Formula. Let H?(BSO(n); Z;) 2 Z;[w;, ...,w,] and H*(BU (n), Zj) = Z;[c,, Co Cn]. 
We have 


t 


Sq? c; = ( ka (co = 1). 
O<t<j t 


Here the symbol (?) denotes the binomial coefficient for a> b; ()= 1, and (2) =0 otherwise. 


f i—j+t-1 
Sq!w; ( June (wg — 1), 
O<t<j 


(VI) Homotopy Groups of Spheres (— 202 Homotopy Theory) 


Table of the (n+ k)th Homotopy Group z,.,($") of the n-Dimensional Sphere S". The table 
represents Abelian groups. 0 stands for the unit group; integer / the cyclic group of order l; oo the 
infinite cyclic group; 2' the direct sum of l groups of order 2; and + means the direct sum. 




















2| 0 es 2 2 12 2 2 3 15 2 22 1242 84422 2 
3|0 œ ER 2 12 2. 2 3 15 2 22 1242 84422 2 6 
4lo0 æ 2 ES o412 22 22 24«3 15 2 22 120-1242 84425 26 244642 
5|0 o 2 2 J 4 P 2 2 2 30 2 23 7242 504 4-2? 23 6+2 
ein o 2 2 24 o, oo 2 60 2442 23 7242 504-4 240 6 
7|0 e 2 2 24 0 ES 2 120 23 24 24+2 504 +2 0 6 
810 « 2 2 24 0 0 ` 2 E oo + 120 24 25 2442442 50442 0 6+2 
9| 0 ee 2 2 24 0 0 2 7 240 p 2^ «at 2442 504-2 “0 6 
0| 0 e 2 2 24 0 0 2 240 2 oen 1242 504 12 6 
| 0 o 2 2 24 0 0 2 240 2? EF 6+2 504 2 6+2 
12| 0 æ 2 2 24 0 0 2 240 22 23 m 6 7 F oo + 504 2 6+2 
13| 0 o 2 2 24 0 0 2 240 2 23 6 G 504 2 6 
4| 0 e 2 2 24 0 0 2 240 22 23 6 504 : E | oo +3 


23 
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Table of the (n + k)th Homotopy Group 7,, ,(.S") of the n-Dimensional Sphere S" (Continued) 





22 
1 0 0 0 0 0 0 0 0 0 
2 6 30 30 642 12-22? 12422? 132+2 22 2 
3 30 30 642 12422 1242? 13242 2 2 210 
4 | 2520+6+2 30 6+6+2 24+12+4+2? 120+ 12 4- 25 1324-25 26 2442? 9240-- 64-2 
5 642 3042 2 44-2 24422 264-2 642? 6+2 90+ 22 
6 1242 60+2 504+ 22 24 244-642 1056 -- 8 480-- 12 6 180-- 22 
7 24-4 120+ 23 24 24 24+2 26442 24 642 72423 
8 | 240424-4 120425 2! 6+24 504+ 24+2 26442 2443 1242? 1440424424 
9 1604 240+ 23 24 24 2442 26442 24 642? 144 4-2? 
10 164-2 240422 240-42 23 2442? 2644 6 5044-24 6+2? 144-642 
i 16+2 240+ 2 2 23 844-2 2644-2? 24422 24 484.22 
12 48-442 2404-2 2 24 48044-4402 264 4 25 244-25 642^ 2016-1242? 
13 164-2 4804-2 2 24 84-842 264 +2? 244-2? 44-2) 164-22 
14 842 48042 2442 24 8+8+2 2644442 240-24 4+2 16+22 
15 4+2 480+2 23 25 8+8+2 264 + 22 24 23 16+23 
mm omi 
16 242 e * 48042 24 26 2448-842 2644-22 24 24 240+ 164-2? 
oe Nath eal 
17 2+2 48042 , 23 25 84-842 26442? 24 23 1642? 
c Em 
18 2+2 480+2 22 i oo + 24 8-442 26442 24412 23 164-22 
rdv diu occa i 
19 242 48042 22 24 842? 2644-2 2442 24 16+ 22 
SSeS SS i 
20 242 480-2 22 24 8+2 oo-26442 24422 24 164+ 22 
Lm ERU 
21 242 480+2 22 24 8+2 264+2 | 2442 23 8+2? 
KEE EE 2 2 
22 242 480 4-2 22 24 842 264-2 24 ver? 4+2 
23 2+2 480+2 22 24 842 26442 24 22 23 
>24 2+2 480+2 22 24 8+2 2644-2 24 22 22 


E EE m 


Remarks 

(1) When n > k +1 (below the broken line in the table), 7, ,,(S”) is independent of 5 and is 
isomorphic with the kth stable homotopy group Gz. 

(2) Let 1, € 7,(S") be the identity on S^; n,€73(S7), rgE77(S*), 0g E7715(S%) be the Hopf 
mapping $?—S?, $7 S4, S 5 S8 (induced mapping in the homotopy class), respectively; and 
[eoms aml € Tam 1 (S7?) (m¥1,2,4) be the Whitehead product of t». These objects generate 
infinite cyclic groups which are direct factors of v, , , (S^) corresponding to the original map- 
pings. 

(3)9,,;— E"N, Das E"v,4 Gun El"og(n»l) (Eis the suspension) are the generator 
for 7,4, ,(S"), which contains the mappings. 

(4) The orders of the following compositions are 2:gs, 437 


Na Natl (n > 2), Ha H Pnt (n 2 5), On ° On+7 (n 2 16), Tn ? 541 (n=3,4), 
Vs ? 53 (n=4,5), Mn ? On 41 (n>7), On 7547 (n > 8), Vig? G43, Fy, ? Vig, 


Tia ? ln 1? 152 (n 22), V, 9 V. 439 Da el > 4), 05? 04,47? On 414 (n 2 9). 





(VII) The Homotopy Groups x,(G) of Compact Connected Lie Groups G 
Here the group G is one of the following: 


SO(n) (n>2), Spin(n) (n>3), U(n) (n>1), SU(n) (n>2), 
Sp(n) (n>1), G, F4 Eg E, Eg. 


(1) The Fundamental Group 7,(G). 


o (G=U(n)(n>1), SO(2)), 
m(G)e42 (G=SO(n)(n>3)), 
O = (for all other groups G ). 
(2) Isomorphic Relations (k > 2). 
T, (U (n)) & v, (SU (n)) (n 2 2), 
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z,(U (1) =m (SO Q)) «0. 
m, (Spin(n)) 2v, (SO (n)) (n > 3), 
m, (Spin(3)) = v, (Sp(1)) = v, (SU 2)) & v, CS?), 
m, ( Spin(4)) = v, (Spin (3))+ m, (S? ), 
m,(Spin(5)) = m, (Sp Q2)), 
m, (Spin(6)) = v, ( SU (4)). 
(3) The Homotopy Group 7,(G) (k > 2). 
n (G)=0. 
m4(G)moo (G#SO(2), U(1, SO(4), Spin(4), n4(SO(4) zoo + oo. 


242 (G=SO(4), Spin(4)), 
m4(G) =, 2 (G=Sp(n), SU(2, SO(3) SO(S) Spin(3), Spin(5)), 
0 (G=SU(n) (n>3), SO(n) (n>6), G, Fy Eg, E, Eg). 


2+2 (G=SO(4), Spin(4)), 

(G=Sp(n), SUQ) SO(3) SO(S) Spin(3), Spin(5)), 
(G=SU(n) (n>3), SO(6), Spin(6)), 

0 (G=SO(n), Spin(n)(n>7), Ga Fy Eg En Eg). 


2 
m(G)= 
o 

















m, (G), k E 6. 
Gk.6 7 8 9 10 T 12 13 14 15 
Sod): 12 2. Be Bas. dD 2 2 12-42 844-2? 2? 
A | 0 ee 0 D, 120 Dn ED. 1680 2 
S3 |0 o 0 0 0 o0 2 2. | 1080 | 2 _ 
Sp4 | 0 oe 0 0 0 oo 2 2 0 oo 
SUQ): | 12 2 2 3 15 2 2 1243 8442? 2^ 
SU3)|6 0 D 3 30 4 60 6 8442 36 
SUM) | 0 o, 24. 2 | 042 4 60 4 1680-2 — 7242 
SU) | 0 o 0 æ, 120 0 2360 4 1680 6 
SU( | 0 v 0 c 0 co | 720 2 500-2 6 
su) | 0 © 0 œ 0 20 0 o =, 500 OO ` 
SU(8) 0 co 0 o0 0 oo 0 oo 0 oo 
SO() | 0 o 0 0 120 2 2 4+2 1680 2 
SO( | 0 co 24 2 12042 4 60 4 168042 — 7242 
SO()|0 o 2 2? 8 ot2 0 2 2520-842 25 
SO(8) | Oloto 2 D 2448 c2 0 2 2520+120+8+2 27 
so) | 0 o 12 2? 8 co+2 0 2 842 00 +23 
SO(10| 0 oe 2}a+2 4 oo 12 2 8 oo +2? 
Soul 0 o 2 2 ,2. 20 2 2 8 oo +2 
SO2| 0 e 2 2 0 Late 2 2? 2444 sti 
SO(1)) 0 o 2 2 0 od 2... 2 8 co +2 
SO(QM)| 0 o 2 2 0 6o 0 10. 4 co 
SOQ5| 0 o 2 2 0 oo 0 TN a he oo 
SO(16)| 0 00 2 2 0 oo 0 0 0 L +o 
SOQ7)| 0 æ 2 2 0 oo 0 0 0 oo 
e 3 0 2 6 0 o42 0 0 168+2 2 
F, 0 0 2 2 0 ot2 0 0 2 oo 
E; 0 0 0 00 0 o 12 0 0 oo 
E; 0 0 0 0 0 oo 2 2 0 o0 
Es 0 0 0 0 0 0 0 0 0 co 
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(4) Stable Homotopy Groups. For sufficiently large n for fixed k, the homotopy groups for 
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classical compact simple Lie groups G= Sp(n), SU (n), SO (n) become stable. We denote them 
by the following notations. Here we assume k > 2. 


m (Sp) = v, (Sp(n)) (n 2(k—1)/4), 
m,(U)- m, (U(n))&m,(SU(n) (n>(k+1)/2), 
m, (0) - v,(SO(n)) (n>k+2). 


Bott periodicity theorem 
œ (k=3,7 (mod8)), 


m,(Sp)z42 (k=4,5 (mod8)), 
0  (k=0,1,2,6 (mod8)). 


œ (k=3,7 (mod8)), 
m (0)= 2 (k=0,1 (mod8)), 
0  (k22,4,5,6 (mod8)). 


œ (kzl (mod2)), 


«| 0 (k=0(mod2)). 


Metastable Homotopy Groups. 


(a,b) means the greatest common divisor of two integers a and b. 


m4, (SU (n)) ent. 


2 (neven), 


T3541 ( SU (n)) 0 (n odd) 


H 


Tan+2(SU (n)) 


W 


(n+1)!/2  (nodd). 


(24,n) (n even), 


(24,7+3)/2 (n odd). 


| (n+1)!+2 (neven, 24), 
eU) 


(n - 2)! (24, n)/48 (n even, » 4), 
(n+2)!(24,n+3)/24 (n odd). 


73:45 (SU (n)) = m, 4, (U (n+ 1)). 


EI 


[es (U D) ` (022,3 (mod4), 23), 
stil T, el UU (n+ 1))+2 (n=0,1 (mod AN). 
_{ Qn+1)! (neven), 
nep) 2(2n+1)! (n odd). 
Tan+3(Sp(n))=2. 
_]|2+2 (neven), 
COD ) (n odd). 
[Q4 n*2)*2. (neven), 
nati) (24, n - 2) (n odd). 
_ { (2n+3)1(24, n42)/12. (n even), 
sesso) (2n+3)1(24, n--2)/24 (n odd). 
Tan+7(Sp(n)) =2. 


Tan+g( Sp(n))=2+2. 
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The homotopy groups 7,,, ;(SO(n)) for n > 16,32 i» — 1 are determined by the isomorphism 
T+i(SO (n)) = Mapil O) Myint ( Vit3+ni+3 (R)) 
and the homotopy groups of V,.,, wll given below. 
(6) Homotopy Groups of Real Stiefel Manifolds H. sii O (m n)/ I, x O(n). 
LP V. eia ) = Ue S” ). 
ga E V miim ) =0 (k 2 1). 


2 (n=2s—1,m>2), 


LAT Ee R (n —25). 


Tn+k Vinenm) (K 9 1,2,3,4,5) are given in the following table. 


Ne 2 3 4 5 6 8s—-1 8s 8s+1 8s+2 Berti 8s+4 8s4+5 854+6 
0 


2 24+ 0 2 2+ oc 2 2+ oo 2 2+ oo 2 2+ oo 2 2+ oo 


Tatl 0 2 2 4 0 2 2 4 0 2 2 4 
4 22 4 22 4 2 4 2 4 2 4 2 
7,42 2- oo 2: 4- oo 2 2+0 22 4+% 2 2+% 22 4+0 2 
2 22 8 0 2 2 8 0 2 2 8 0 
2 o-1242 2 2442 22 2442 2 2444 2 2442 22 2442 
2 c-1244 2? 1242 2 24-4 23 1242 22 24+4 2 1242 
Sai 2 o?-1244 2? 12400 22 2444400 2? 12400 2? 2444400 2? 12400 
2 12-44 oo 2 12 2 244-8 2 12 2 44 48 2 12 
c -2  2?4*24 2 24 2 24 2 24 2 24 2 24 
oo +4 24 23 2 4 22 22 2 4 22 22 2 
Tn +4 4+ oo 25 22 2 8 23 2 2 8 23 2 2 
4+ oo? 24 24- o6 2 8- oc 2 2 8+ oc 22 oo 2 
44- oc 23 2 0 8 2 0 2 16 2 0 0 
2 23 0 oo 0 0 0 0 0 0 0 0 
Tns 2+24 24 24 wt+2 24 2 24 2 24 2 24 
2 25 2 seid 2 2 2 4 22 22 2 4 





(VIII) Immersion and Embedding of Projective Spaces (— 114 Differential Topology) 


We denote immersion by c, and embedding by c. P"(A) is an n-dimensional real or complex 
projective space where A=R or C, k(P"(A)j is the integer k such that P"(4)c R* and P"(A)¢ 
R*^!, and k(P"(A)) is the integer k such that P^(A)c R* and P"(A) c: Rik). 

In the table, for example, numbers 9-11 in the row k{P"(R)} for n=6 mean P5(R) c R$, 
PS(R)c R!!. 
















2 
k{P"(R)} |2 4 9—]] 9—12 lv cht un SS 
k(P"^(R) |2 3 x ug 7 & 15 l5 16 16  17—19 
k(P"(CO) |3 7 l5 n 20. -2225 MD 33 38 38-41 
k(P"(O) |3 7 8—9 15 16—17 22 22-25 31 32~33 38 38-41 














2-2 (r»s»0) 








k (P"(R)) 2n—3—2n- 1 
k (P"(R)) 2n-1 2n—-3 àn—4 2n-6 Gi 
k (P"(C)) 4n-] 4n —3 4n—2 gc 4n—2 


k(P'(O) | 4n—-1 4n—2 





App. A, Table 7 1748 
Knot Theory 


7. Knot Theory (— 235 Knot Theory) 


Let k be a projection on a plane of a knot K. We color the domains separated by k, white and 
black alternatively. The outermost (unbounded) domain determined by k is colored white. In 
Fig. 16, hatching means black. Take a point (a black point in Fig. 16) in each black domain. The 
self-intersections of k are represented by white points (Fig. 16). Through each white point we 
draw a line segment connecting the black points in the black regions meeting at the white point. 
In Fig. 16, we show this as a broken line. We assign the signature + if the torsion of K at the 
intersection of k has the orientation of a right-hand screw (as in Fig. 17, left), and the signature — 
if the orientation is opposite (as in Fig. 17, right). The picture of the line segments with signatures 
is called the graph corresponding to the projection & of the knot K. Given such a graph, we can 
reconstruct the original knot K. 





Fig. 17 


Fig. 18 shows the classification table of knots for which the numbers of intersections of k are 
3 to 8 when we minimize the intersections. The projection of k is described by a solid line, and 
the graph by broken lines. We omit the signatures since for each graph from 3, to 8,4 they are 
all + or all —. Such knots are called alternating knots. 


8. Inequalities (— 88 Convex Analysis, 211 Inequalities) 


(1) [a+] « |a| * |b], 


la - b| > ||a| — |I]. 
For real a,, we have Da? » 0, and the equality holds only if all a, 20. 
(2) n!«n"c(n!y (n23) 
e" »n"/n!. 
nV^ <33 (n 43). 
(3) 2/-«(sinx)/x«1 (0«x«m/2) (Jordan’s inequality). 
(4) Denote the elementary symmetric polynomials of positive numbers a, ...,a, >0 by S, 
(r1, ...,n). Then 


s/ [s / GO] GUTE 


If at least one equality holds, then a, = ... — a,. In particular, from the two external terms, we 
have the following inequalities concerning mean values: 


P 8 l/n » 1 
EE Saf So 
v=] pel pel H 


For weighted means, we have 


S xs Io (SA=1, X20) 


pel p=] 


(5) When a,>0, 5,50, p>1, q» 1, (1/p) * (1/4) l, 


P VPe n 1/4 i 
| K | | > e» > v ab, (Hölder’s inequality). 


y=} p=] p=] 


The equality holds only if (a,} = c(b,) (c is a constant). 
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dé Ge ation table of knots. The signatures from 3, to 8), are all + or all —. 





When p=q=2, the nequality is ribs SC nequality, the Cauchy-Schwarz inequality, 
or Bunyakovskii’s inequality. As special c we have 


E Si Y x (a, >0), 

p-1 pul d, 

| 5 2 e| 5 d (a, » 0). 
pel p=} 


When 0< p < 1, we have an inequality by reversing the inequality sign in Hólder's inequality. 
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(6) When a,>0, b, 50, p » 0, and (a,) and (5,) are not proportional, we have 


ü l/p T Lin 
E EN d d * 


p=] v=} 


n 1/p 
Y (5 | (p21)  (Minkowski's inequality). 


p-l 





The integral inequality corresponding to (5) or (6) has the same name. 


n n 
(7) 1a,50 È ap= È ap=l, b20 
1 


p= p= 


n 
flee fl (3 awt) 
bell p=l 

In particular, for the determinant A= det(a,,), 


n n 
ar< II | 2 DN 
v=1\ p=1 
The equality in this holds only if all rows are mutually orthogonal. If all |a,,| < M, we have 
|A|<n"/2M" ` (Hadamard's estimation). 


(8) Suppose that a function f(x) is continuous, strictly monotone increasing in x » 0, and 
f(0)=0. Denote the inverse function of f by CT For a,b >0, we have 


ab da dx+ ‘ce (x)dx | (Young's inequality), 


and the equality holds only if b= f(a). 
In particular, for f(x) x^^! (p 1), we have 


a? bs 
p q 
where (1/p)+(1/q)=1. 


(9) If p,q> I, (1/p)* (1/4)7 l, a, > 0, b, > 0, 


2 ab, 


00 a,b 


Vp 1/4 
p r oo 2 = l 
EI viii" cl A 2 (a) i | > (b,) | (Hilbert's inequality), 


p= 


and the equality holds only when the right-hand side vanishes. 
(10) For a continuous function f(x) > 0 (0< x « oo), we put 

F(x)= f"s()dt, 

0 
and assume that p> 1. Then 
F P 
f^ d dx «( £ 
0 x p-1 


and the equality holds only if f(x) is identically 0. 
Further, if f(x) » 0, 


f i exp] L f "logf(1) d dx «e f °? f(x)dx  (Carleman’s inequality). 
0 X Jo 0 








) f "ir GO]' dx (Hardy's inequality), 


(11) Leta« x «£« b, p» 1, and 
EXOTIC 





p 
f "ie (Hl dx < X EH f ii fG)f dx — (Hardy-Littlewood supremum theorem). 
a a 


(12) If f(x) is piecewise smooth in 0< x < v and f(0)=f(7)=0, 
E | P dx > L [f GM. dx — (Wirtinger's inequality), 
0 


and the equality holds only if f(x) is a constant multiple of sin x. 
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9. Differential and Integral Calculus 


(I) Derivatives and Primitive Functions (— 106 Differential Calculus, 216 Integral Calculus) 








F(x)= f f(x)dx f(x)= F'(x) 
aq + Dy (a, B constants) ag’ + By’ 
ed qut ey 
o/b (4*0) (ed — edd 
Log lol (9250) 9' / 9 (logarithmic differentiation) 
O(y) (composite) (d9/dg):q' 
c (constant) 0 
x" nx"! 
x"*l/(n41) x" (n#-1) 
log|x| 1/x 
log,|x| (log,e)/ x 
x(logx — 1) logx 
expx =e* expx =e* 
a” (a>0) a*loga 
x* x*(1+logx) 
(x — l)e* xe* 
sinx COS X 
cosx —sinx 
tan x sec^x 
cotx ~cosec*x 
secx sec x tan x 
cosec x — cosec x cot x 
sinh x —-(e*—e *)/2 cosh x 
coshx —(e*--e *)/2 sinh x 
tanh x ^ sinh x /cosh x sech?x 
. coth x =cosh x /sinh x — cosech?x 
sech x = 1/cosh x — sech x tanh x 
cosech x = 1 /sinh x — cosech x coth x 
arcsinx (|F|« 7/2) 1/V1— x2 
arccosx (0c F« m) -1/Vi-x? 
arctanx (|F|« c/2) 1/(14- x?) 
arccotx (|F|« 7/2) ~1/(1+ x?) 
arcsecx. (0c F« m) 1/Ix| V x?- 1 
arccosecx  (|F| « 7/2) - Ix Vx?-1 
arcsinh x =log(x + Vx?41 ) 1/Vx^-1 
arccosh x =log(x + Vx?— 1 ) 1/Vx?-1 
ig 14x | ae (Ix| « 1) l 
2 l=x arccothx — (|x| » 1) 1-x? 
arcsech x = 1/xV1—x? 


arc cosech x —1/|x|V 1+ x? 
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F(x)= f f(x) dx 





l a—x 
2a ES i 
(1/a)arctan(x/a) 


(xV1— x? +aresinx)/2 


[x Vx?x1 +log(x+Vx2+1 )]/2 


— log|cosx| 
log |sin x| 
log |tanx| 
log| tan{(m /4) + (x/2)]| 
log | tan(x /2)| 
(x/2)—(1/4)sin2x 
sin x — x COS x 


COS X - x sin x 


nsin mx sinnx + mcosmx cosnx 


2 2 
= (n?m?) 


n?—m 
ebx bsinax — acosax 
a? +b? 
ox bcosax + asinax 
a? + b? 
xarcsinx + V1— x? 
xarctanx —(1/2)log(1 + x?) 


det( Gx); k=1,...,7 


(II) Recurrence Formulas for Indefinite Integrals 


(1) L,25]——— (mis a positive integer). 
+x 


1 x 2m-—3 





m 


I 


D p= 





2m—2 (+x! 2m—2 "^7 


= f E (m is an integer, a0). 
Vax? +bx+c 
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F= F'(x) 


] 
22.27 


X d 
1/(x? a?) 
Vi-x? 
Vx**1l 


tanx 








cot x 
1 /sinx cosx 
secx 
cosec x 
sin?x 
xsinx 


xcosx 


sin mx cos nx 


e”*sinax 


e* cosax 


arcsin x 
arctan x 
È det(gj ... 95, 19 jr Dv «1i 


vein e li isn 


I,-arctanx. 


The case m « 0 is reduced to the case m > 0 by the change of variable 1/ x =. 


] SRS cle 
Le — x"-VWax* bxc - 
ma 


(1/ Va )log|2ax+b+2Va Vax*+bx+c | 
l= -1 .  2ax+b 





(QQm—1)b | " (m — 1)c 


2ma 


15-2 (m 2 1); 


(a 0), 
(a « 0); 


In this case, for the integrand to be a real function it is necessary that 5b2— 4ac » 0. 


(3) I= [x"e* dx (m is an integer). 


Les Ae" ml, 1; Ip=e*, I= Eix, 
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where Ei is the exponential integral function (— Table 19.II.3, this Appendix). 


(4) L,,— f x"(logx)'dx (m,n are integers, n > 0). 


m+) xmtl 


Inn = ay WORX) — et Imni Imo Sen (MH = 1), 14, = (ogxy*!/(n 1), 





(5) Lz f x”sinxdx, J,, = f x"cosxdx (m is a nonnegative integer). 


In = — x" cosx t+ mJ,,_) = x" (msinx— xcosx) - m(m- DI 5, 
Je 7 x"sinx — mI, ,—x"- (xsinx* mcosx)- m(m- IM 
Jus —cosx, 

Jo sin x, 


(60 Inna= f sin"x-cos"xdx (m,n are integers). 





fox sin"*!xcos" "lx mn) I 
mn m+n m+n mn-2 
(m+ n0), 
Lom —sin"-!xcos"*!y m-l I 
mot mtn mtn nn 
—sin"*lycos"*t!x  m+n+2 


La a a eg 


n+l n+1 
sin"*lxcos"*lx m+n+2 l 
hat 2 RE m$ ly 
1,,;7(sin'x/2, Jos —cosx, J, _,=—log|cosx|, lg;7Ssinx, Ipg =x, 


Ja - = log|tan[(x/2)+(7/4)]], 71; =log|sinx|, 
1.,97log|tan(x/2), J_,,-,=log|tanx|. 


` =; ~ In-2, -(n—a("# 1). 





Im, -m= f tan"x dx = 


(III) Derivatives of Higher Order 





QU > (eru? (Leibniz's formula) - 
p 
n—l 
x^ II (k-»x*^" 
p-0 
(xt ay n! 
expx exp x 
a* (a0) a*(loga)" 
logx (—197(n— 1)/x" 
sin x sin[x + (n7/2)] 
COS X cos[x + (nz /2)] 
e™ cos bx r"e** cos(bx + n0) (where a=rcos8, b=rsin@) 
n-l 
=> Cy (^7 l)Cr- (te Ae 20 3) 0/2770 — x07 
arcsin x TER 
(where 2» — 1)! &1:3:5-...-Qy - 1, (7 Dite 0) 
arctanx (—1)""'(n—1)!sin’@sinn® ` (where x =cot@) 





j- e (5) 2520. (ITT, 
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Higher-order derivatives of a composite function g(r)z f(xq(t), ...,x,(0) | 
dg & of dx de of d'x af dx dx 


di 6 gx, dt ' uA." Ox; qp? us rd, dt dt’ 











ie 





do n of d?x, n 9?f d*x, dx; n 9 dx, ax; dx, 
d? | | 9X ut d | 0x;8x; Ox; d? dt ege 0x;0x, 0x, dt dt dt 


For a function z =z (x4, ...,x,) determined implicitly by F(z; x}, ...,x,) 20, we have 
Foe FaFa t FeFo FoP Faz 


iX 
EE 


F, F2 F? 


d: Fy, d 2, 


dr, F,' oxóx, _ 








Schwarzian derivative: 


d dun 3|(a dy V? 
oe 2 V G3-31 6) (8). (yix) 0e m (ax B) (ex d), 


osx)e( S toa -{x;z}]= — (Z i, (ay +b)/( +d); x) = (yix). 


(IV) The Taylor Expansion and Remainder 


If f(x) is n times continuously differentiable in the interval [a,b] (i.e., of class C"), 


n-l(b—a) 
f(b)* Y $e f(a)+R, ` (Taylor's formula). 
v=0 
R,, is called the remainder, and is represented as follows: 


(b-a)(b-& " 


Å 1l b APTI gan) mum SC ae, co TEEN) 
i ml H ELE (n-1)!p T 
On (1-6y-7/*(a4.0(6—d) (12p»0, 0<0<1, a«£«b, Ead 8(5—a) 





- (n— Dp 
(Roche-Schlómilch remainder); 


B l (b— a)" fe) (Lagrange's remainder); 
E l p; (67 6-8" SOO — (Cauchy's remainder). 
nD! l 


If f(x,y) is m times continuously differentiable in a neighborhood of a point (xo, Gol, 


fo hy K) E xs tÈ) Gs + Rn 





X 
out», Xu 
i 5 : pu Ro oo) ER 
O< u-v&m-l;ur20 Bets dx dy 
] d 
R= (hee tke) flxot E, pat Ok) (0<0<1). 


If all partial derivatives up to order m—1 are totally differentiable, 


m-l] i 
f(x thy, ....x%,+h4)= > A T3 f(x, x) Rm 
»-0 "Va H 


1 


"Mgr, GER EE AN 





bin... Bh + R 
$ Oxf... 0x," id 
where the > means the sum for r,, ...,»v, in the domain 0< v+... +v, « m—l; pr... Ba >O. 


The remainder R,, is expressed as 


m 


ET 5 h x] fx, 9h, x, 0h) (0c8-1). 
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(V) Definite Integrals [4] 


In the following formulas, we assume that m, n are positive integers. 6,,, is Kronecker’s delta 
(6,7 0 or 1 for mæn or m2 n), T is the gamma function, B is the beta function, and C is the 
Euler constant. 

For simplicity, we put 


5:...(m-2) mz 20* V? T[(m/2) - 1]/ Vr =m! /2"- 9? [Gn — 1)/2]! 
m!!= (m is odd), 
2:4-6:...-(m- 2) m2"? p[(m/2) + 1] 2272 (m /2)! 

(m is even). 
In an n-dimensional real space, the volume of the domain 


2 [r(1/p)] 
lxi! +... [x «1 (p»0) Eu 
p" nT(n/p) 
For p —2, this is the volume of the unit hypersphere, which is 
tl? B Geif" /nt (n is even), 
T[(n/2)+1) 122)" 9"? /n1 (nis odd). 


The surface area of the (n — 1)-dimensional unit hypersphere 


ex PI is FLAME | Qm) ^ /(n- 2)! (n is even), 


24 
D T(n/2) 202)" 7? /(n—2)» (nis odd). 


"ere ze f '( D £ 
f e *dx f, (os dx zT(p). 


4 T(q*1 
f s (1084) NU (Rep, Reg » — 1). 


IR 
l p- 4-1 oo S Em - T( p)T(a) 
L^ l(1—x)0* dx= Le Gao Bg e 
Ln xd Moo me 
0 (14x) "^ c T(14 5) 


(Rec >0; Rea, Reb» —1; Reb» Re S£), 


ala- x)" Ge ANM (gp)! 
| cq Ta eee cp? C9 


(O0<ce<b<a or 0<b<a<c;Rep,Reg>0). 


ge 1 ove (2n—1)!! 
ntl WAP S ETT 
c xm T 
———— dx -———————————— (2m+]<2n). 
-el4x? nsin[(2m + 1)z/2n] Ee 
œ Adem 


1 
GE MOS A mcr ca 
R Tay de T(a)T(1-— a) "p (0—a« 1). 


[Fea Yt y a dx= 3r( 2") (2n 1)! Vr fore (p=2n), 
0 la| ` 2 2 n!/2 (p=2n+1). 


fOe- et )ax- (b- a) Vg (a,b > 0). [Pe Ol des Va 
0 0 2 
e Vg 
e wt GU») T Poad gt See 
[re EE (a>0). Í agra dE. 
2 
QUE A EM d 
d g?* — g7 ax et tio uh 


dx = 7 


o x die EU 7 
R o l-x? 8 
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1og(1/x) | pT o x ılog(1/x) n? 
L Sz x CR. £709 Í pa f Ee 
Per at M Ht +x\1 om) 


l T 
` E n — = log2. 
0 




















0 VI—x? 2 
ilog(1 x l ] e (—])"' 
IW oi (ee (* E as EE 
0 14x? l+x n=0 (2n 1 
2 
« (logx) 167? l x? — +1 
————— dx = d dx =lo SEAN) 
Í l+x+x? 81V3 0 be Een kj Ga ) 
f log|logx|dx= f °e-logtdr= E 
0 0 
T. : 
Í sin mx sin nx dx = 4 cos mx cosnx dx = deg 
0 0 
s [1-(-)"""|—" (m+n); 
f sin mx cosnx dx = m*—n 
S 0 (m2 n). 
7/2 inp q Béi pti kal ( =5)) 
i sin?x cos?x dx ;^( 73 Rep, Reg > a 
(Le 3 p-(Diitoe-1(Ml/ZUptaill (p,q are even positive integers), 


(p—-U!!(q— D)!!/Cp* q)!! 
(p,q are positive integers not both even). 


P NR 7/2 va Tli *0721 WS 
H sin?xdx- d cos?x dx = z TÍG/241] (Rep » — 1); 


S (7/2)2n—1)!/(2n)!  (p-2n), 
(2n)!!/(2n+1)!! (p=2n+1). 





I7 sin(x?)dx= f” cos(x?)dx = cae sinx 4 ue SOR dx NZ (Fresnel integral). 
— oo — oo 


e sinax , T e tanx , 7 
jt dx=F (a>0), f ae dx 


NIA 


(take Cauchy’s principal value at x = (n + iy) 














œ sin?" * lx e Qn- 1)! oo sin?x T c sin(x?) T 
x = dx-—-. f dx= ^. 
x 2 (2n)! 2 4 
co Sin qx ag?! 
—— dx = ———_———__ (0< p<2). 
o x f 2T( p)sin( pr /2) eps) 
2 
"EP dee Te- Ip. °° COS OX dy = (1+ e-24) (a>0). 
o 14x? 2 o 14x? 4 
o sinax T = o xsinax T 
——— dx» =(1-—e77) (a>0). cee de e H (a»0) 
o x(l-4x?) 2 o 1+x? H 
^ gu t lx cosx y x= c sin?" + x cos?" lx da 7 Mm Un" D 
x 0 x 2 (2m+2n)!! 


7/2 (a>b>0), 


 sinax cos bx adx=17/4 (a=b>0), (Dirichlet’s discontinuous factor). 


x 
0 (b>a>0) 
e o EE = 27 £/2.. b č "ES 
d ee (la] <1). d a pg = agp (ab #0). 


[f xsinx y= 7 
0 1+cos*x 4 


1757 App. A, Table 10 


Series 
X cosnx di za"/(1—a?) (]a|« 1), 
0 1—2acosx +a? v/a"(a?—1) (la|» 1). 
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10. Series  (- 379 Series) 


(I) Finite Series 


(1) S,-1*-2*-...-n* (kisan integer). For k » 0, we have 


+1-i 


By (2+) - Ball & i k+1\ Bai (n+ 1)" 
yep a i | 275 3 


where B, is a Bernoulli number and B,(x) is a Bernoulli polynomial. In particular, 
Sp=n, Sy=n(n+1)/2, Sj-n(n*-1)2n*1)/6, S9 gäint 1) /4, 
S,7 n(n* 1)(2n+ 1)(3n? 4 3n — 1)/30. 

For k «0 and k= —- 1, 
S ,-c— [e D'/u- D! |[ "logrG9/ dx"... , 





uas. rl vc a soon mura EE 
CR 0-0D(ai-0^! Send? ai G+)! @-D! (att 


For !=1, the second term in the latter formula is replaced by log[1/(n+ 1)]. Here T is the gamma 
function, and the constants c, are 


IC (Euler constant) (/- 1), 
IT |£U) (is the Riemann zeta function) (/ 22). 


(2) Sio ecpisces E 1 (n+m)! 


i=} i=] (i—1)! mz (n-1)!' 


BESCHE, 4 D n! 
|o TEE SES 


i, Gitm-1)! m-l 


(geometric progression) 


i | a(a"-1)/(a-1) (a #1) 
| n (a=1) 


j= 








n +1 
(a+jd)=(n+1)a+ no) d= ntl (a+a+nd) (arithmetic progression) 
Ze 2 2 
CN d n _ (n+1)8 . 
Y, sin(a +jB)=sin(a+ 5 B )sin —— sin, 


j-0 
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y cos(a + jB) — =cos(a+ 5 B)s E SE n$, 


j=0 


n 
Y. cosec Xa = cot(a/2)— cot2"a. 


j=0 


(IT) Convergence Criteria for Positive Series > a, 


In the present Section II, we assume that a, >0. 

Cauchy's criterion: The series converges when lim sup Dia, « 1 and it diverges when 

lim sup J/a, >l. 

d'Alembert's criterion: The series converges when limsupa,,,/a,< 1 and diverges when 

lim infa,,,/a, 1. 

Raabe's criterion: The series converges when lim infn[(a,/a,.,) — 1 and diverges when 
limsupn[(a,/a,,,) — 1] « 1. 

Kummer’s criterion: For a positive divergent series Y(1/b,), the series Za, converges when 
lim inf[(5,a,/a,,,) — b,,,] 7 0 and diverges when limsup[(b,a,/a, ,,) — b,,, ] « 0 diverges. 
Gauss's criterion: Suppose a,/a, ,, — 1 + (k/n) - (0,/n^), where 2> 1 and (6,) is bounded. 
Then the series Zo, converges when k 1; and diverges when k <1. 

Schlómilch's criterion: For a decreasing positive sequence a,|0, let n, be an increasing sequence 
of positive integers and suppose that (n,+2 —n,,,)/(n,,, —n,) is bounded. Then the two series 
Xa, and D(n,,, —n,)a,, converge or diverge simultaneously. 

Logarithmic criterion: For a positive integer k, we put 


log, x =log(log,_, x), log, x=logx. 


Then for sufficiently large n we have 
The first logarithmic criterion: If 


<0, p»1 then Xa, converges, 


ay Ee doge nog. ul 20, pel then Xa, diverges 


The second logarithmic criterion: If 








ra n log,n log,..,n log,n ” 
a  n-tllogí(n-1) log, (nol) Vlog,(n4- 1) 
«0, p»1 then Za, converges, 
20, pel then Za, diverges. 
(III) Infinite Series 
eo TK oo (=) E 
$*-——-lg2 X ot e 5 (Leibniz's formula), 
SC i ere 2i-1 4 
MEG See E: 





Z 2iQi-1Qi-2 A? 





o (2i)! 1 oo i l l 
es TE SEET ` A 2 L -log(1 poss il C (C is Euler's constant). 
Putting 
oo = i—i Se " i-1 
(n= $ 5 a(n)= A a B(n)= S ( LZ i (m= X Ke 


1 Qi- 1) i=l 1 (21-1) 
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Series 
we have 
(Gei? (2? — Uein 
2n) = —— — By, a(2n) 2 -—— — — B,,, 
Wi n)- 2(2n )! 2n a( n) 2(2n)! 2n 
(22^71— J) qr?” qntl 
B(2n)= (Qn)! Bans e(2n l= 3i 2n: 


where B, is a Bernoulli number, and E, is an Euler number. 
£Q)2-?/6, £(4)2«*/90, §(6)=7°/945, 
a(2)2 77/8, a(4)=74/96, a(6)— 5/960. 
BQ)-«?/12, B(4)27«^/720, B(6)-31«5/30240. 
c(1) 5 7/4, €(2)=0.91596 5594177219 0150546035 14932... (Catalan's constant), 


e(3)— 4/32, e(5)=575/1536, «(7)-61«7/92160. 


(IV) Power Series (— 339 Power Series) 


(1) Binomial Series (1 + x) 2 372. EO ki This converges always in |x| « 1. If a 0, it converges 


in —1< x< 1l, and if —1« <0, it converges in — 1 < x <1. When a is 0 or a positive integer, 
it reduces to a polynomial and converges in |x|< oc. 





=> (-De! (|x|« D, 
i=0 











s (Gm Qi! , (-D'QD' , 
Vi¢x = 3 Ae y (|x| < 1), = A —— EI (|x|<1). 

i-o (2i - 122: (it? V1i+ i-o (i 

(2) Elementary Transcendental Functions  (— 131 Elementary Functions). 
oo i n 

E = S ash x X= 
e* =expx > m Jim (1+ x) , a*=exp(xloga) (|x|« oo). 

Ei NN DECH | 
log(1 + x)= X ——7:—x (-I«x«l) logx= 25 EE P (0 « x « oo). 

i=l 
, < (-1) gl d CD 2i 
gaa) Gm? ; anu m ear? (|x| < 00). 
oo 22i 232i -] ER 

tanx = 2 e 2 (Ix <5) (B; is a Bernoulli number) 





1 90 Ro Qu bo 
t o EN t 2i-1 EEUU 
cot x - »» v x (0<|x/<3), 





o E. . 
secx= > Hyd (Ixl SE (E; is an Euler number), 





Zen Qi)! 
1 2?! —2)B 
cosecx o + ltr T Zäit (O0< |x| <n). 
c (Qi! y e (=f , 
arcsinx = >) Ql x (|x| < 1), TO au 


ie e ! «1 
S pg Wl , 2701? Okt 


i= 
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(V) Partial Fractions for Elementary Functions 





oo 
8x 
tanx= A ———————, cotx= SET Ss , 
2 (2n+1) s? — 4x? "ug RE 
e (-1)'Qn-1 e (—})" 
secx=4 Y ee. LN ` Een 
nat Dn (ei dal x nai X? n^n 
co 1 SS 1 
secx= Y p coeck- X} ————2. 
n=—o0o [x-(Qn* 0/2) | n2 -o (x nr) 


(VI) Infinite Products | (— 379 Series F) 








eo oo 
An? 7 . x —x/a = ee TE): B 
i 22 E (Wallis formula), Hi: e e T(ltat+x) 


(C is Euler’s constant). 


Jl (iz See / 3-3) 
Ii 1/(1—p *)2$(s) (p ranges over all prime numbers, s> 1), 


Ii (1- Ex Im P cos 2. = DE EE 
nzl nz? x nzl 2" x” n=] Qn- (ei 


oo 
For|g|«]l, putting g)= il (+q), m= il +4", a= Il a-42775, 
n=l n-i ee 
Kal 
q4- [| (174?) we have 4,4:4,71. 
n-l 


Further, putting q = e'”’, we have the following formulas concerning $-functions (— 134 Elliptic 


Functions): 
9, (0, T) = 94493) Véi (0, T) E 2q 444945 9, (0, 7) = 9493s 91 (0, T7) = 2nq ^q; 


11. Fourier Analysis 


(I) Fourier Series (— 159 Fourier Series) 


(1) Fourier coefficients dy= D f "f (x)dx, a= = f "f (x)cos === dx 
a Jo a Jo 


b, = RE dx. 
nax -| f(x) (0<x<a), 
a 


Ka 
Fourier cosine series ` dot Y, a,cos 
f(-x) (-a«x«0) 


pel 
exe f(x) (0<x<a), 


Fourier sine series Y 6,sin— = 


The next table shows the Fourier coefficients of the functions F(x) directly in the following 
manner from a given function f(x) on the interval [0,a]. For x in [— a,0] and when the cosine 
series (a,) is in question, we set f(x) — f(— x), and when the sine series (5,) is in question we 
set f (x)= — f(— x). Thus f(x) is extended in two ways to functions on [— a,a]. The functions 
F (x) are the periodic continuations of such functions. We remark that the sum of the Fourier 
series given by the Fourier coefficients in the right hand side has, in general, some singularities 
(discontinuity of the function or its higher derivatives, for example) at the points given by the 


integral multiples of a. We assume that y is not an integer. 
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Fourier Analysis 
f(x) do a, (n=1,2,...) 
l 1 0 
—2a 
x 3 "lei 
2 
2 os — Im 
x 5 cC (- 
m ka} 2ka| C- "e^ —1] 
ka ka? + n?n? 
T mx sin ur TE y2 gals 
a um u? 
uTx l—cosuz 2 nl 1—(-1)" cos pr | 
sin —— EE 
a Du T p-nm 
1-A? 
a PA l 2A" (Al) 
1 2A cos(7x / a) - A? (M « D 
Asin(7x/a) 
] - 2A cos(7x / a) +r? 
By, (x/2a) 0 (—1)"*!220m)!/ nay" 
OT EEN 
logsin(7x/2a) —log2 —1/n 
(1/2)cot(zx / 2a) 
Note 


b, (n=1,2,...) 
[1 (7 D? *!Da/ nz 
2a 
n+] eh 
cy 
2 
i28 Leyes 
n" 


ee: Jee 
k?a? + n?n? 

2 [(- D'cospz — 1] 

T p-n? 

2 nsinur 


CY E 


A” (AM«n 


(— 1)"* Dm 1)! /Qngym*! 


1 


(1)The Fourier series does not converge in the sense of Cauchy, but it is summable, for example, 
by the Cesaro summation of the first order. 


sin nx 


S = äis: x) (0<x<27). 





(2) yc Im" 1 OSM =log(2cos> ) (-7«x«m) > 


n=] n=] 


æ% cos(Qn—-l)x | 
= zloglcot>) (9«x«2m x#m), 











zi 2n-1 
$ sin2n-1)X _ f 7/4  (0cx«m) 
Zu An-l —mc/A (mq«x«2m). 

S cosnx T 

> 2 = Sie: nsus (0 € x « 27), 

n=] n 

S sinnx x t 

5 = -xlog2- f log(sin 5 )dt (0 « x «2m). 
n=] n? 0 2 

oo a^ oo a^ 

5 STORE e^*"*cos(gsinx)-1l, > PT sin nx = e7°°S*sin(a sin x). 
nal n=l] ° 

= į COSNX | TCOSAX 1 

2 CW hui asinar jj 6 7590 

n-iA$Sinnx _ msinax  , 
S (1) mg O2sinar (-—7<x<7m). 


n=] 


In the final two formulas, we assume that a is not an integer. 


(II) Fourier Transforms (— 160 Fourier Transform) 


The Fourier transform ¥[f] and the inverse Fourier transform F [g] for integrable functions f 


and g are defined as 
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Zoll STI Gel "^ | foe 5 dx, 
i 


4[g(0]2 F Eg] (x)= Qn) "? | aoea, TEE HE o EA 


In some textbooks the factor (2x) "? is deleted or the symbols i and — i are switched when 
defining F and F. However, conversion of the formulas above to ones due to other definitions 
is straightforward. These transforms are also defined for some nonintegrable functions, or even 
more generally for ttempered distributions. The Fourier transform Z and the inverse Fourier 
transform F defined on the space of tempered distributions S’ = 5^'(R") are linear horneomorphic 
mappings from 5" to itself. Useful formulas of these transforms are given in the table below, 
where a — (94,05, ...,0,) (x; —0, 1,2, ...), |a| - a, +a,+...+4,, C is tEuler’s constant, 4€ C, and 
Z,={meZ|m>0}. 
Case 1. n— 1. First we explain the meaning of the symbols appearing in the table: 

x, =max(x, 0) (the positive part of x), 

x. =max(—x,0) (the negative part of x), 

x4 and x^ are understood in the sense of finite parts (— 125 Distributions and Hyperfunctions), 

(x + ie? =exp[A Log(x + ie)] (e 40; Log is the principal value of log) 

=(x? 467)? exp[iAArg(x+ie)] ^ (—m«Argz«m), 
(x + i0 — lim (x +ie)* (limit in the sense of distributions). 
co 


Then the following formula holds: 
(x i0 9 x4 etx 


Pfx" zx? -(—1)"x" (meZ) (Pf is the finite part). 


In the special case m= —1, Pf x^! coincides with Cauchy's principal value p.v.x !. 




















T (e?) F(T] (e) 
DÉI IN 
P(x) (polynomial) 27 P(d/d£)ó(£) 
p.v. 1/x [7/2 isgn č 
Pfx™ V/nf2 (Ci fm-—1)]£"7 sgnE (meN, Z) 
xt he D emet + eint gai] 
Ts S 
| AED emg ee (gZ) 
Ber? 

x" Gei, /2x) [n8 — i(—1)"m! PEE] (meZ) 
x." X ;-c) ei Tesné -en ogle | (meN, Z) 

2am- L 2 : , 
x^ c e 

|- AE) namq de Gei 

2n 

x Cg. HIFI (meZ,) 
SCH - Ip j- cet esencia (meN, Z) 

J2n(m—- D! LN j 2 d i 
(x + i0)? Län e" T(E Géi, 
(x — i0}? [2n en 20ET (EZ) 
(x i0)" =x" n i" (meZ,) 
x log|x| ai? isgn £-(C +log| él) 
e Sie afa e"? (a» 0) 
em (x»0) 1 1 
tc (<0) Ree 
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T (e v) | F(TI (ES) 


g als! fat Ja? +é? 

















—— (a0) 
Vix Jai = 
De ittas B Kla /£? +b?) (a>0,b>0) 
€ n Jeb 
x? +a? Be zh n 
log yz srt ele eil (a20,b 20) 
arctan(x/a) E (aeR,a#0) 
sin ax 1 1 
——-r Z 

IW E en us =a) Ta. Go 
cos ax 1 1 
IJ X See? SL Tal ns) SCH 
sinax V n/2 (lél«lal) 

x 0 (}¢|> Jal) 

l(a?-—-x?)*0? (|x| <a) él 
‘0 daso dë (Ë ty J_falé}) (Rev<1/2, a»0) 
0 (|x| <a) (S) 
D rns WEE emma: y J^ Kel) (-1/2<Rev<1/2, a>0) 

1 J2 Oé 

GP raptam taxa) K,(al£l) (Rev» —1/2, a»0) 





Case 2. n» 1. Let r=./x?4+x3+...4x2 and p=./t?4+23+...4+ £2, where x 2 (x) e R" and 


—(C)eR". If f e L,(R") depends only on r, then 7 [ f ] depends only on p and is expressed as 


FES p= p t ?? | fry"? Jo ys rp)dr. 


The constant C in the table stands for Euler's number. 











T (e) FTES’) 
ó(x) (2n)? 
P(x) (polynomial) Qxy'? P(ia/BE)5(2) 
202 (n4 2y2) o a EN 
Pfr? oa Pier" Geif, i¢-n-22,) 
ER Qay"(—A)"ó(£ (meZ.) 
AEN (—1)"p>" 1 I'((n/2)+m) 
GET echte cl See? P "Menta nez) 
p P «Ko dl 
(Leroy riu UL) 
(ir (27)? (1 AAL) (meZ.) 
e 2AT fn +A) e -nal 2 Ln 4/2) | 1D(74/2) 
GC BEE SÉ | p 2T(n-3) eh 
(Ad2Z,, A¢—n—2Z,) 
r?"logr (—1)n7120m*2n-7! mt T'((n/2) +m) Pf p^ "7?" 


1 1% 1 IY((n/2-4m) x 
+ np] log2—S0+5 $: d E AL) (meZ,) 


(-1) 2 1. 151 ipai 





—n-2m tnmen 
MOS SAT mimi Tes 08573€*5 Y 3 (2) m) 
QE m 1 dë E as) , , 
24 4j? AT((n2)-m) T((n/2)2 mp S 
2^ [n41 a 











The Fourier transform mentioned above is a transformation in the family of complex-valued func- 
tions or distributions. Similar transformations in the family of real-valued functions are frequently 
used in applications: 
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oo 
Fourier cosine transform fu) St F(t)cosut dt. 
0 


Fe) (t>0), 


2 oo 
Inverse transform = | S-(u)cos ut du = F(—t) (t«0) 


oo 


Fourier sine transform f(w= | F(t)sin ut dt. 
0 


PU  (t>0), 


2 oo 
Inverse transform = | ] f,(u)sin ut du = E F(-) Get 


The Fourier transform can be expressed in terms of these transforms. For example (in R1), 


sue Lf(0 - f(—0] vun: Lf(t) — f(— t)]sinut dt. 






l — cos au 
u 


sin au 
u 






(0c t« a) 
0 (a« t) 


p! 









(diverges) (7/2)sgnu 









GT (0cacl) l'(a)cos(7a/2)u ^? l'(a)sin(za/2)u * 
1/(a? 4 1?) me "ul die [e~ “ Ei(au) — e^'Ei( — au)]/ a 
eT a/ (a?  w?) u/(a? 4 u?) 
e^" (ReA>0) Vm /4À e Im e" /*(u/2VÀ )/ VÀ O 
gy V2 /4 (u/4X)e 7 7 






















sinat z/2 (O<u<a) l a+u 
t (uo 0 (a « u) aal S| 
tanh(at/2) cosech u 
sech(at /2) sech u 
cos(rarcsinu sin( v arcsinu 
Ju (Benz -1) Ls 
y t er>- Sen, EI 
(u—- Vi?-1 ) PR 
=; o (1« u) 
Vu^-1 


0 (O<u<a) 





J (at) ao 

: 1/ Vi? - a? (a « u) 
2 arcsinu 
= —=——  (0<u< 1) 
WAT 

Ne(t) ) log(u— Vu^-1 ) 


———————— UL e Hl 


id Vu?^-1 
(arcsinhu)/ V 14- u? 








7/2V Mx i? 





Notes 
(2) Ei is the exponential integral function (— Table 19.11.3, this Appendix). 


(3) We put 9{x)= | «a. 
0 
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(1) Laplace Transforms  (— 240 Laplace Transform) 
oo 
Laplace transform V(p)s f e PF(t)dt (Rep>0). 
0 


F(t) (t>0), 
0 (t<0). 


Inverse transform (Bromwich integral) x if Cri Opry ( p)dp= | 
c— ioo 
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(Vie? et 
i i J, (t) (Re»»-1) SS ee 
/P Um 
Vale p? —p)’ 
_{0 (O<t<a) = l ( 
-a-f (edd e^ 9 /p ia (Rev 0) SC 
[x/a] (integral part) 1/p(e^? — 1) 
Qa) Tl» (1/2)] 


1*7! (Rea 0) 


e ? 


T(a)/p* 


1/(p+a) 


SI (on (Rer> — x) 





Li 172 
Ve (ph a2) C? 











»/2 H y 
e ";^7! (Rea>0, (p+a)~*T(a) PP, QNT) Ae t 
(Rep » - 1) 2’p 
e- F(t) (a>0) V(p-a) 
(l-e7')/t log(1 p^!) Jet? (104p?) 7 
(nt) "Mie" De werd >0) Jo(x Vt) e 4 /p 
a log( V 1p? -p) 
logt — (logp + C)/p™ ED a 
T Vip! 
líp-1lV 
, xdi LP dza 
sina a/(p*+a*) nt) rim 
cosat p/(p? +a’) 
" m T(a*tn-*l) | [p-1l 
i 7" x — x? /4, a die 
sin(x Vt ) “2 puit dis e Xt) n! eeu p | 
7 _ p E S (1-p) (4) 
t7 V? cos(xVt) Vm /p e-*/* Hong (Vt) z Gat) n+(3/2) 
t7! sinxt arctan(x/p) 
x^ 1— cosax) 1log[1 + (a2/p?)] 
E (1-p)' 
; "ENS. -pn 
sinh at a/(p^— a^) Va Qn eeng 
coshat p/(p?-a?) 


Notes 


(1) C is Euler’s constant. 


(2) L, (1) is a Laguerre polynomial. 
(3) H,(t) is a Hermite polynomial. 
(4) 2n * 1)!! 2n D)(2n- D2n-3)...5:3-1. 


(II) Operational Calculus (— 306 Operational Calculus) 


0 (t<0) 


Heaviside function (unit function) 1(/)= 
1 (£20) 


Dirac delta function (unit impulse function) 6(t)= lim z+ €) — 1(t—)]. 
£0 
When an operator Min) operates on 1(f) and the result is A(t) we write Q(p)l(r)— A(t). 
In the following table (i) of general formulas, we assume the relations 2;(p)1(t)= An 
(i2 1,2). 


Carson's integral Q(p)2p f "end (dt (Rep 50). 
0 


d 
Laplace transform V(p)= = f "en (t)dt. 
0 
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(i) General 






Mu elt €x p) 


Formulas 










QA p)=pV(p) 
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(ii) Examples 














A(t) + A(t) p Bt 
aQ,p) aA,(t) I/p" (n20,1,2, ... (1^ / nlt) 
pip) A,(0)8 (0) +A) p/(p*a) (e "7 

p?/(p? +a’) (cos at)1(t) 
Lop) f Ace ap /(p^ à?) nat 
P 0 
(ap) A,(t/a) 
= a a t t^ 
[p/(pta)Qy(p*a)|e "dun (Rea20) agt — +— + (a+ ai tayr + JIO 
P pt I SA 
Zo Xp) [4 (r)A x1 — 7) dr P > P (ent UD 
P EE 0 à : k=l PPk k=ł Pk 


= f At~ Aga) ar Om Y. Eer 
0 kl Pk 


13. Conformal Mappings (~ 77 Conformal Mappings) 





Original Domain Image Domain Mapping Function 


























Z2 
|z| € 1 (unit disk) Iw] «1 wee. E. dzd«l, Jel=1 (general form) 
Z 
Z— o 
Imz >0 (upper half-plane) Iw| «1 w-£———, ]mzg»0, |eļ=1 (general form) 
— Zo 
Imz » 0 (upper half-plane) Imw>0 = ar ` , a,b,c,d are real; ad-- bc >0 
(general form) 
O<argz<a Sg wen 27/2 
(angular domain) 
lz| «c i, Imz»0 l+z\2 
I = 
(upper semidisk) wee E ( LZ | 
O<argz<a, sad 
|z| «1 Imw>0 v- (15277) 
(fan shape) l-z 
z—p 
a<arg—— « f EU ene 
qd 0c argw « y wa(e ei yes 
{circular triangle) 2-9 
0cImz«q Imw>0 wenn 
(parallel strip) 
Rez « 0, 
0cImz«q Imw>0, |w|«1 wenn 
(semiparallel strip) 
y^»4c(x c), 
z=xt+iy, c>0 Imw>0 w=Vz —ic 
(exterior of a parabola) 
y?<4e%(x +c), ae 
z=x+iy, c>0 Imw >0 w=isec™ S 


(interior ot a parabola) 2i 
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Original Domain Image Domain Mapping Function 
x? 
[c+ a/of 
2 "MEE 
yY 2.. 
Wed Iw|2c LEM , z=w+ {d 
[c - (1/c)] 2 
z=xtiy, c»l 
(exterior of an ellipse) 
2 2 
A vus E <4, 
cosa  sin?a 5 sis —2«) 
. —-ilaZt VZ^— 
z=x+iy, Imw>Q w=(e 5 | 
0Ocacm/2 
(exterior of a hyperbola) 
2 2 
X B S 4, T 
cosa  sin"a (lr Vgi-4 Vis 
x »0, Puy ( 2 ) 
z-xtiy, Imw>0 
x 7 
dE de E P 
(right-hand side interior 2 
of a hyperbola) 
Slit domain with boundary l 
1 er n 
Kik [Rew| « 2, Imnw 20 META 
Slit domain with boundary E z 
HES 4 zn w= = 
|w| > 1/4, argw (1 e^?) 
Slit domain with boundary 
» 1/4» 
EE pota w= LS 
argw =À * (2jm / p), (14 e 7 Az?) 
jz29,..,p-l 
—2/2<Rez<a/2 Sint domain with boundary ; 
; w=sinz 
(parallel strip) [Rew| > 1, Imw=0 
-7«Imz«m Slit domain with boundary = z 
: w=zte 
(parallel strip) Rew< —-l,Imw=+7 


Arbitrary circle 
or half plane 


Arbitrary circle 
or half-plane 


Imz>0 


Imz>0 


Interior of an n-gon 


Exterior of an n-gon 


Interior of an 
equilateral triangle 


Interior of an 
isosceles right triangle 


n 
w=c f? II (=z) dee (cs, 
c' are ee? where the inverse image 
of the vertex with the inner angle a;r (j= 
l,...,n) is z= ze, When z, 7 oo, we omit the 
factor (1 — z,) ^! (Schwarz-Christoffel 
transformation) 


n 
wae f un? II «- zl" dt + c 
j=l 
(c 7*0, c' are constants), where the inverse 
image of the vertex with the inner angle o 
V=l,...,n) is z= Zj, and the inverse image 
of oo isz=p 


j i 
w= f yeucy^ 
: Ñ 
vc Jua 
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Original Domain Image Domain Mapping Function 
: : . z l 
Interior of a right triangle w=] ————— dt 
Imz>0 E 8 
with one angle 7/6 ? Yea- 
Interior of a = el TT GEET 
l =| (l-t dt 
als regular n-gon e f ( ) 
0 « Rez « [^P : d 
0cImz «o,/i iusso w= P eae ( 8 is the Weierstrass - 
(rectangle) function) 
—K«Rez«K Riser: 
O<Imz<K’ imus z= TT di 
(rectangle)? i (1-00 — kr?) 
| (sn is Jacobi's sn function) 
Ee logq « Rew «0, 
(upper half- ER wein: 
ring domain) (rectangle) 
2 2 
4+2 <l, o we-VAl-B? als [77^ dt d 
|z|<1 E 0 (1- 2) — 2) 
w-utv, A>B>0; 1 AS Gow 
(interior of an ellipse) 7 08 1-5” K 
: i w” 3 w” A 
eg interior of a (wi) - 2 (07) = R(z) 
circular polygon : . 
(R (z) is a rational function) 
HEWER 
0 
m DEE EU -i.l 
Interior of an f tinal) 5$ *(1— z) * *dt 
equilateral circular triangle Gë f 
Iz|<1 with inner angle The vertices are the images of 
m/k, 1<k<% z=1, e?"i/3 and eri/3: and 
[4] _ TÍG/9 * Q/29)]TQ/3) 
dz Jamo T[((1/6) - (1/2kX)]T (4/3) 
Interior of a circular fic mco EE Doo: B ap dt 
e se D triangle with inner =. 20 
angles za, 7B, ry, NEE Lë cese 
a B4 y«10 f tU = (l-t) ^ (1-t-zt) ^ dt 
irl 1, J=J (1), rendu, J= g2/(82— 2783) (the 
~1/2<Rer<0 ImJ >0 absolute invariant of the elliptic modular 
function); J(e?"/9)=0, J(i)=1, J(o)-oo 
Ass Alz), T =w;/, A=(e, = e3)/ Ce, = €3); 
2 
[r- 1/2| « 172, ImÀ <O 16) 4 [X -A(0) 1] 
—1<Rer<0 SS amr a aa DR 
77 Metal 
A(—1)=œ, A(O)=1, A(oo)=0 
Notes 


(1) K,K’,k’ are the usual notations in the in the theory of elliptic integrals: 


TP -—— a ce ey ÀÁ 
9 vü-2)1- kt) 9 vVa-2P-x??) 


(2) When a + 8+ y ^ 1, the circular triangle is mapped into the ordinary linear triangle by 
a suitable linear transformation, and we can apply the Schwarz-Christoffel transformation. 
When a+ B+ y » 1, we have a similar mapping function replacing the integral representa- 
tions of hypergeometric functions in the formula by the corresponding integral representa- 
tions of the hypergeometric functions converging at a, B, and y. 
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14. Ordinary Differential Equations 


(I) Solution by Quadrature 


a,b,c,... are integral constants. 
(1) Solution of the First-Order Differential Equations (— 313 Ordinary Differential Equations). 
(i) Separated type dy / dx = X (x) Y (y). The general solution is 
d 
f—-fxo0ece 


(ii) Homogeneous ordinary differential equation dy / dx = f( y / x). Putting y = ux, we have 
du / dx =(f(u)—u]x, and the equation reduces to type (i). The general solution is 


veel f 5| (2) 


(iii) Linear ordinary differential equation of the first order. dy / dx + p(x) y + q(x)=0. The general 
solution is 


yale facon ods] / co. 


where 


P(x)sexp| f pa] 


(iv) Bernoulli's differential equation dy / dx + p(x) y + q(x) y * 0 (a <0, 1). Putting z - y! *, the 
equation is transformed into 
dz / dx - (1— o) p(x)z * (1— a)g(x) 40, 


which reduces to (iii). 

(v) Riccati’s differential equation dy / dx + ay?= bx". If m= —2, 4k/(1 —2K) (k an integer), this 
is solved by quadrature. In general, it is reduced to Bessel's differential equation by ay = w' / u. 
(vi) Generalized Riccati differential equation dy / dx + p(x) y? -- q(x) y +r(x)=0. If we know one, 
two, or three special solutions y = y;(x), the general solution is represented as follows. When y(x) 
is one known special solution, 


ven) Siet | f oP save, 
where 


P(x) sex - f (46) +2097) d]. 


When vu ya(x) are the known solutions, 
y—yi(x) 
y—y2(x) 

When y,(x), Sch ¥3(x) are known solutions, 


= el f p(x){ a(x) =y (x)} dx. 


y—y (x) P ys x) - y (x) 
y—y(x) — y60- 169. 


(vii) Exact differential equation P(x,y)dx + Q (x,y)dy =0. If the left-hand side is an exact 
differential form, the condition is 3P / dy = 3Q/dx. The general solution is 


[Pace f(o- 3; [ rx) 


(viii) Integrating factors. A function M (x,y) is called an integrating factor of a differential 
equation P (x,y)dx + Q (x.y)dy =0, if M (x,y)|P (x,y)dx + Q (x.y)dy] is an exact differential 
form dg(x,y). If we know an integrating factor, the general solution is given by g(x,y) — c. If we 
know two independent integrating factors M and N, the general solution is given by M/ N ^c. 
(ix) Clairaut's differential equation y = xp + f (p) (p = dy / dx). The general solution is the family 
of straight lines y = cx + f (c), and the singular solution is the envelope of this family, which is 
given by eliminating p from the original equation and x + f'(p) 20. 

(x) Lagrange's differential equation y = xp(p)+ (p) (p = dy / dx). Differentiation with respect 
to x reduces the equation to a linear differential equation Lei p) — py(dx / dp)-- 9'(p)x + y/(p)-0 
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with respect to x, p (see (iii)). The general solution of the original equation is given by eliminating 
p from the original equation and the solution of the latter linear equation. The parameter p may 
represent the solution. If the equation p = (p) has a solution p = po, we have a solution y = pox + 
(po) (straight line). This solution is sometimes the singular solution. 
(xi) Singular solutions. The singular solution of f(x,y,p)=0 is included in the equation resulting 
from eliminating p from f= 0 and df/ dp =0, though the eliminant may contain various curves 
that are not the singular solutions. 
(xii) System of differential equations. 

eq.(1)  dx:dy:dz- P:Q:R. 
A function M(x, y, z) is called a Jacobi's last multiplier for eq. (1) if M is a solution of a partial 
differential equation (0M P/0x) -- (OM Q/0y) 4- (OM R/0z) - 0. If we know two independent last 
multipliers M and N, then M/N =c is a solution of eq. (1). If we know a last multiplier M and 
a solution f =a of eq. (1), we may find another solution of (1) as follows: solving f =a with re- 
spect to z and inserting the solution into eq. (1), we see that M(Qdx — Pdy)/f, is an exact differen- 
tial form dG(x, y, a) in three variables x, y, and a. Then G(x, y, f(x, y, z))=b is another solution of 
eq. (1). 


(2) Solutions of Higher-Order Ordinary Differential Equations. The following (i)- (iv) are several 
examples of depression. 

(i) f(x, y 9, y & * D, vis (0 — k < n). Set y ? — z; the equation reduces to one of the (n— 
k)th order in z. 

(ii) f(y, y’, y", ..., y 9) =0. This is reduced to (n — Dst order if we consider y' — p as a variable 
dependent on y. 

(11) y" = f(y). The general solution is given by 


x=at f[2 f fO) dy. 


We have a similar formula for y= f( y^ 2), 
(iv) Homogeneous ordinary differential equation of higher order. If the left-hand side of F(x,y, 
y’, y 0) 20 satisfies the homogeneity relation F(x, py, py’, ..., py?) = p°F (x,y,y', ..., 9), the 
equation is reduced to one of the (n — 1)st order in u by u-y'/y. 

If F satisfies F(px,p' y, p^ ! y', ..., p!" y™) = p*F (x,y,y', ..., y), then u=y/x!, t=logx 
reduces the equation to one of type (ii) not containing t. 


(v) Euler's linear ordinary differential equation. 


pa (x)x" y 9 + pai (x)x" 7 yP... E pi x) xy  po(x) y = q(x) 
is reduced to a linear equation by t —logx. 
(vi) Linear ordinary differential equations of higher order (exact equations). A necessary and 
sufficient condition that L[ y] 27. op;(x) y= X (x) is an exact differential form is 
27 (7 1D) pf? =0, and then the first integral of the equation is given by 


n-|)n-j-1 k . 
Z £X (ODPRO fX adst c. 
j=0 k=0 


(vii) Linear ordinary differential equation of higher order (depression). 


n 
Liyl= È pG9y9 2 x (x). 
j=0 

If we know mutually independent special solutions y,(x), ...,y,,(x) for the homogeneous linear 
ordinary differential equation L{ yj=0, the equation is reduced to the (n — m)th linear ordinary 
differential equation with respect to z by a transformation z = A (y), where A( y) -- 0 is the mth 
linear ordinary differential equation with solutions y,(x), ...,y,, (x). For example, if m I, the 
equation is reduced to the (n — l)st linear ordinary differential equation with respect to z by the 
transformation 


y(x)=y, (x) f2(x)dx. 
Also, if n= m=2, the general solution is 


yo yyteyi- | Tyzdx tyr f Ty d, 


where T (x)z X (x)/Lyy (x) y3(x) — y4(x) y1(x)]. The denominator of the last expression is the 
Wronskian of y, and va 
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(viii) Regular singularity. For a linear ordinary differential equation of higher order, 
eq.(1) — xy? x" (x) yO P... p,(x)y =0, 


the point x =0 is its regular singularity if p,(x), ...,p,(x) are analytic at x —0. 
We put pọ= 1 and 


oo n 
2 f£()x'z Dd p G)p(p- 1...(o—j* 1). 
yz) j=0 
If p is a root of the characteristic equation fo(p) —0 and p+ 1,p+2,... are not roots, we can 
determine the coefficients c, uniquely from 


m 


eq. (2) © cfa- (p+r)=0 (m=1,2,...), 
pes DÜ 
starting from a fixed value co (0), and the series y = x?£7. 9c, x" converges and represents a 
solution of eq. (1). If the differences of all pairs of roots of the determining equation are not 
integers, we have n linearly independent solutions of eq. (1) applying the process for each 
characteristic root. 

If there are roots whose differences are integers (including multiple roots), we denote such a 
system of roots by p}, ...,p;. We arrange them in increasing order, and denote the multiplicities 
of the roots by ej, ...,@,, respectively. Put q, =p, — p, (k=1,2, ...,/509 q4« q5 € ... < qj). Take 
N > q; and a constant c (0). Let A be a parameter, and starting from co= co(A)= cl. fo(À + 
k), we determine c, = c, (À) uniquely by the relation (2). Putting 


misse kënt, ke (k-l...,])  forhin m,,í«h«m,- 1l, 


the series 


gf oo oo 
eq. (3) ZEE d zx 2 
ra A= PK RT 


h 


> Deet" 


j-0 





converges and gives e, independent solutions of eq. (1). Hence for k —1, ...,7, we may have 
E’ 1€ =m, mutually independent solutions of (1). Applying this process to every characteristic 
root, we have finally n independent solutions of (1) (Frobenius method). 

In the practical computation of the solution, since it is known to have the expression (3), we 
often determine its coefficients successively by the method of undetermined coefficients. 


(3) Solution of Linear Ordinary Differential Equations with Constant Coefficient (— 252 Linear 
Ordinary Differential Equations). Let oa, a, be constants. We consider the following linear 
ordinary differential equation of higher order (eq. (1)) and system of linear ordinary differential 
equations (eq. (2)). 
n 
eq. (1) Y ay? 7 X (x). 
i=0 


n 
eq. OI w= Moy tX% (x) Ge1...,2n). 
k=1 
(i) The general solution of the homogeneous equation (cofactor) is given by the following 
formulas: 
foreg.(1) y=x/expAx (j=0,1,,...,e.—I:k=1,...,m), 


m 
for eq. (2)  »j(x)7 X pa(x)exphyx | (jl, nl 
k=l 


where A,,...,A,, are the roots of the characteristic equation of eq. (1) or eq. (2) given by 
n 
eq. (1?) Y aA'=0, 
i=0 


eq. (27) det(a,, = Six ) = 0, 


respectively. We denote the multiplicities of the roots by e}, ..., €m (e, - ...- e, HE Pjx(X) is a 
polynomial of degree at most e, — 1 containing e, arbitrary constants. 

If all the coefficients in the original equation are real, and the root 4, =, +iv, is imaginary, 
then 4, = 4, — iv, is also a root with the same multiplicity. Then we may replace exp 4,x and 
exp 4, x by exp u; x cos v, x and exp 4 xsin v, x, and in this way we can represent the solution using 
real functions. 
(ii) Inhomogenous equation. The solution of an inhomogeneous linear ordinary differential equa- 
tion is given by the method of variation of parameters or by the method described in Section 


(2) (vii). 
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We explain the method of variation of parameters for eq. (2). First we use (i) to find a funda- 
mental system of n independent solutions y; = p(x) (k 7 1, ...,) by (i)). Inserting y; = 
2-0 (X)eu (x) into eq. (2), we have a system of linear equations in the c; (x). Solving for the 
c, (x) and integrating, we have c, (x). 

Special forms of X (x) or X,(x) determine the form of the solutions, and the parameters may 
be found by the method of undetermined coefficients. The following table shows some examples 
of special solutions for eq. (1). In the table, a, k, a, b, c, are constants, p,, q, are polynomials of 
degree r, and 7, is the operator defined by 


IL Ee L |sinax f cosax- F (x) dx — cosax Í sinax- F(x) dx] (a¥#0). 





Special Solution 









P,(x) A=0 is an m-tuple root of(1^) x"g (x) 
ke™ à= a is anm-tuple root of (1’) ex Ter? 
e "p, (x) A=a is anm-tuple root of (1^) x"q (x)e?* 
WË 2 2 
SOR Y. | V) pO HA Aand c, cos(ax + b)+ c,sin(ax + b) 
sin(ax + b) og(- a?) + a&j(— a?) 40 


(1) 2 0)/fQ?) and f(A?) 
is divisible by (A?-- a2)" 
(but not by (A? + ai? 


cos(ax + b) 
sin(ax + b) 





cos(ax + b) 
sin(ax + b) 


«ri 








(II) Riemann's P-Function and Special Functions (— 253 Linear Ordinary Differential 
Equations (Global Theory)) 


(1) Some Examples Expressed by Elementary Functions. A,B are integral constants. 














x—b\" x—b\* j 
de à : x dE ECH SS 
i aw —yw by (=) E 

u u C= [4+ Blog ese (n= w). 


a b c 
db u v «jean feci ir moa (At+ptv=1). 


These are for finite a, b, c. If c= oo, x — c should be replaced by 1. 


oc 0 : 
ple " Ae**--Be^* (a <a’), 
a 0 0 x e*(Ax+B) (a — a'). 
a O0 1 
oO 0 
Lon x ax,.o—l 
P a E" =A+Bfe x? ldx (a0). 
0 0 0 
oo 0 
P appe: "AN conet A+ gier are (az a). 


Riemann's P-function is reduced to Gauss's hypergeometric function with parameters a =À + u 
t»B-AÀcTucTy, y=1+A+X’ by transforming a, b, c to 0, 1, oo by a suitable linear transforma- 
tion and by putting z 2 x ^(x— 1)^^y. 


(2) Representation of Special Functions by Riemann's P-function. 
(i) Gauss's hypergeometric differential equation x(1— x)y" +[y— (a + B 1)x]y' — afy =0. 


0 l o 
y=P 0 0 a x}. 
I-y v-a-B B 
A special solution is F(a,,y;x) (— 206 Hypergeometric Functions). 
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(ii) Confluent hypergeometric differential equation xy" — (x — p) y’ - y =0. 


[ra] 0 
vr 
ISE 0 A 0 xj 
| p-A lp 


A special solution is 


€ TAt+k T 
TTE we Lei 


x* 
Zu TO) T(uwk) kU' 





(1/4)- n? 


2 


k 2 
TX x y-0. 


(ii) Whittaker's differential equation y" | — 4 
oo 0 


o_o 
Y-Pi 1/2 k  l2-n xf 
ee ae 
Special solutions are My, (x), Wk n(x). 


(iv) Bessel’s differential equation xy” + xy’ +(x? — »?) y =0. 


00 0 
e a 
ETA 4172 v xf 
—i l/2 -—v 


Special solutions are J,(x), N,(x) (— 39 Bessel Functions). When m=0, 1, 2, ..., Jm-12(%)= 
( SH 1)" 2mti24-WM2 wm LD d" (cos x)/d(x?)™. 
(v) Hermite's different equation (parabolic cylindrical equation) y" —2xy' + 2ny=0. 


oo 0 
oC 


y=Pig -n2 0 æf 
1 (n+1)/2 1/2 
When n=0,1,2,..., the Hermite polynomial H,,(x)=(—1)"27"/e*" d" (e- Zu dx" is the solu- 
tion. 
(vi) Laguerre’s differential equation xy" +(/— x +l) y' - ny =Q. 


o0 0 
——_ 


y=P 0 —n 0 xp 
1 /+n+1 -1 


When n —0, 1,2, ..., the Laguerre polynomial L'(x) - (1/n!) x 'e* d"(x"*'e *)/dx" is the 
solution. 
(vii) Jacobi's differential equation x(1 — x)y" + [q — (p - 1)x]y' +n(n+ p)y=0. 
0 l oo 
y=P\1-q q-p ptn x}. 
0 0 —n 
When n=0,1,2,..., the Jacobi polynomial 
P e ; = TI(g)x! "20 ei d^[x**"71(1-3)"*"*] 
n ILR SE rín+q) dx” 


is the solution. 
(viii) Legendre's differential equation (1— x2) y" —2xy' +n(n+1)y =0. 


1 -1 oo 
y=P:0 0 n+l xj. 
0 0 —n 


When n —0,1,2, ..., the general solution is 


ds neo tener f 
X (x1-1)"*! 


The Legendre polynomial P,(x) — [d" ((x? — 1)") / dx"]/2"n! is a special solution. 
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(3) Solution by Cylindrical Functions of Ordinary Linear Differential Equations of the Second 
Order. We denote cylindrical functions by C,(x) (— 39 Bessel Functions). 









Equation Solution 













2. y?y? 


Le ys poet E [yao 
x 


Ba 
» X 





y+ y —x*C,( Bx) 





1-2a a? — y?y? 
y+ | = Se Sa T By -20x12 |y=0 y = x*exp(iBx Y) C,( Bx") 
x 





» «x 20) y+ y=exp| fuco dx eu 





Gras aa: D ke 


y" a y x? 2y 20 y Vx C, 2, (ax") 
y=C,(e*) 
"+(e — y =0 f 
PET y= C, Bx) 


x? y" + xy’ +( B2x?- v?)y =0 


y= C,(ix) (modified 
x? y" t xy! - (x? yy =0 


Bessel function) 





(III) Transformation Groups and Invariants 


d é uo d ; ; 
Let U = ER "5. be the infinitesimal transformation of a given continuous transformation group 
x y 


d d 
of two variables, and U’ = Let gtt ap be that of its extended group. 
x p 


ôy 
We have 
i a On OE E 
dr dy ox dr ` 
We put 
dy u d?y 


P DN 


Let a, B and y be invariants of the Oth, first, and second order, respectively. The general form 
of the diffferential equation of the first or of the second order invariant under U is given by 
(a, 8)=0 (or B= F(a)), and Y(a, 8, y)=0 (or y= G(a,P)), respectively, where F, b, V, G 
denote arbitrary functions of the corresponding variables. 


Group With Infinitesimal 








Transformation U Invariants Note 
ia 
1 0 x p r (1) 
l 0 0 y p r (1) 
=y x 1+p? x?+y? | (y—xp)/(x+yp) r/(+p?y/* (2) 
0 y p x p/y Wéi (3) 
x 0 -p y xp x?r (3) 
x y 0 Dë P xr 
X —y —2p Xy xp x (4) 
ux vy (v—p)p y"/x' | x!” or px/y niae 
u y 0 2X — Wy p r 
0 h(x) h'(x) x h(x)p—h'(x)y h(x)r — h"(x)y 
1 ky) ro k'(y) (5) 
k 0 -k'(yp T a eae 
(») (y)P y 2 KGS x SMSen 
p r k'(y)p? 
0 k k'(y)p —— Saree (6) 
9) * kO) kO) KOP 


h(x) 0 —h'(x)p y h(x)p (h(x))'r+ h(x)h (x) p 
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Ist 2nd 
p Kal Cé 
k(y)  h(x) k(y) 









Group With Infinitesimal 
Transformation U 









+ h(x)k'(y)p 








xh(x) 





tat? xp) -po ` IS 


+ h'(x) 


(3% las 














l—p x+y 
TT- Or 
l+p x—y 


(x - yp)/ (1 + pXx —») 


r 
(Ip y 









Notes 
(1) Parallel translation. 
(2) Rotation. 
(3) Affine transformation. 
(4) Similar transformation; the equation is a homogeneous differential equation. 
(5) Linear differential equation. 
(6) Separated variable type. 
(7) When k(y) 2 y^, the equation is Bernoulli's differential equation. 


Reference 


[1] A. R. Forsyth, A treatise on differential equations, Macmillan, fourth edition, 1914. 


15. Total and Partial Differential Equations 


(I) Total Differential Equations (— 428 Total Differential Equations) 


Suppose we are given a system of total differential equations 
n 
dzj= >) PyGGz)dx, (j=1,2,...,m). 
k=1 


A condition for complete integrability is given by 


OP, (xiz) OP, (x;z) OP, (x3z) OP, (x; 2) 

a a + 2 ap wore RE EEN + 2 = d (x; z). 
Under this condition, the solution with the initial condition (x9, ..., x9; z9, ..., zl is obtained as 
follows: First, solve the system of differential equations dz;/dx, = Pj, (x1, x5, ..., x9; z) in x, with 
the initial condition Stell z?, and denote the solution by z;=¢,(x,). Next, considering x, as a 
parameter, solve the system of differential equations dz;/dx; = Balu, X2, X$, ..., x9; z) in x, with 
the initial condition z,(x})=,(x,), and denote the solution by z;— oj(x,, x2). Repeat the pro- 
cess, until we finally have z;— qj(x,, ..., x,), which is the solution of the original equation. Or, 
if we have m independent first integrals f(x; z) — c; of the equation dz;/dx, = P;,(x; z), we may 
transform the equation into du; — 34, Qj (x; u) dx, by the transformation u;= fj(x; z). Since the 
Qj. (x; u) do not involve x, and the equation is a completely integrable total differential equation, 
we have reduced the number of variables. We obtain the general solution by repeating this 
process n times. 

For 


P (xy, z) dx + Q (x,y,z) dy + R Gy, z)dz -0 
(n=3,m=1), the complete integrability condition is 


ðQ ƏR ƏR OP ap 9QY- 
NEIE aa - ae J a0 
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(ID Solution of Partial Differential Equations of First Order (— 322 Partial Differential 
Equations (Methods of Integration), 324 Partial Differential Equations of First Order) 


Let z be a function of x and y, and 
p=0z/dx, g=0z/dy, r=d7z/dx*, s=d7z/dxdy, t=07z/dy?. 
We consider a partial differential equation of the first order F(x,y,z,p, q) - 0. 


(1) The Lagrange-Charpit Method. We consider the auxiliary equation 


F, Fa pf,taqF, Fit pF, Frot, 


which is a system of ordinary differential equations. Let G (x,y,z,p,q) 7 a be the solution of the 
auxiliary equation. Using this together with the original equation F=0, we obtain p= P(x,y,z,a), 
des Q(x,y,z,a), and the complete solution by integrating dz = P dx+ Q dy. If we know another 
solution of the auxiliary equation A (x,y,z,p,q) — b independent of G = a, we have the complete 
solution z= ®(x,y,a,b) by eliminating p and q from F=0, G=a, and H=b. 


(2 Solution of Various Standard Forms of Partial Differential Equations of the First Order. 
The integration constants are a, b. 

(i) f(p, q) 20. The complete solution is z — ax + (a)y +b, where the function t= q(a) is defined 
by TU, a) - 0. 

(ii) f(px, q) =0, f(x, qy)=0, f(p/z, q/z) — 0. These equations reduce to (i) if x —e*, y=e¥, z—e?, 
respectively. 

(iii) f(x,p,q)=0. If we can solve for p = F(x,q), the complete solution is z= f F(x,a)dx + ay + b. 
A similar procedure applies to f( y,p, 4) — 0. 

(iv) f(z,p,q)=9. Solve f(z, t,at)=0 for t= F(z,a). The complete solution is then given by x + 

ay + b= f dz / F (za). If we eliminate a and b from the complete solution ®(x,y,z,a,b)=0 and 
9@/da=00/0b=0, we have the singular solution of the original equation. 

(v) Separated variable type f(x,p) 7 g( y, q). Solve the two ordinary differential equations f(x,p) 
=a and g(y,q)7 a for the solutions p = P (x,a) and q = Q(y,a), respectively. Then the complete 
solution is z= f P (x, a)dx t f Q(y,a)dy + b. 

(vi) Lagrange's partial differential equation Pp + Qq = R. Here P, Q, R are functions of x, y, and 
z. Denote the solutions of the system of differential equations dx : dy :dz= P: Q: by u(x,y,z)= 
a, v(x,y, z) - b. Then the general solution is ®(u,v)=0, where ® is an arbitrary function. A 
similar method is applicable to 


> P, x, — -R(x, "a 
EA J 


If we have n independent solutions u,(x)= a; of a system of n differential equations dx,/ P;— 

dz / R (j=1,...,n), the general solution is given by ®(u),...,u,)=0. 

(vii) Clairaut's partial differential equation z = px + q y + f(p,q). The complete solution is given 
by the family of planes z = ax + by + f (a,b). The singular solution as the envelope of the family 
of planes is given by eliminating p and q from the original equation and x= — df/dp and 


y 7 — 9f/9q. 


(III) Solutions of Partial Differential Equations of Second Order (— 322 Partial Differential 
Equations (Methods of Integration)) 


(1) Quadrature. Here oe and y are arbitrary functions. 
(i) r= f(x). The general solution is z= f f f(x)dxdx + e(y)x + p(y). A similar rule applies to 
t= f(y). 
(ii) s= f(x, y). The general solution is z= f f f(x, y)dxdy + e(x) - W(y). 
(iii) Wave equation. r — t 20. The general solution is z= (x + y) - v(x — y). 
(iv) Laplace's differential equation. r -- t 20. Let x -- iy 2 C and q, y be complex analytic func- 
tions of (. The general solution is z= o(C) 4- V(C), and a real solution is z= (0) + ©. 
(v) r+ Mp=N, where M and N are functions of x and y. The general solution is given by z= 
IO LG YN (x, y)dx + e()1/L6c y) dx + v (y), L(x, y) 2 exp [EM (x; y) dx]. In the integration, y is 
considered a constant. 

A similar method is applicable to s+ Mp- N, s+ Mq- N, and te - Mq N. 
(vi) Monge-Ampere partial differential equation. Rr + Ss+ Tt+ U(rt ^5?) H where R, S, T, U, V 
are functions of x, y, z, p, q. 
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First, in the case U=0, we take auxiliary equations 
eg, (1) Rdy?+ Tdx? — S dx dy =0, 
eq. (2)  Rdpdy* Tdgdx — Vdx dy. 


Equation (1) is decomposed into two linear differential forms X; dx + Y; dy =0 (i—1,2). The 
combination with (2) gives a solution uj(x.y, z,p,q) = de, GR, ZP, q) b; (i= 1,2), and we have 
intermediate integrals F;(u;, v;) 2 0 (i= 1,2) for an arbitrary function F;. We have the solution 
of the original equation by solving the intermediate integrals. If S*4~4RT, two intermediate 
integrals are distinct, and hence we can solve them in the form p= P (x,y,z), q= Q (x,y,z), and 
then we may integrate dz = P dx + Qdy. 

Next, in the case U0, let A, and A, be the solutions of U?A? + USA + TR + UV —0. We have 
two auxiliary equations 


d\, Udy + Tdx + Udp —0, A,U dy + Tdx + Udp=0, 
A,Udx + Rdy + Udq=0, \,Udx+ Rdy + Udq= 


and from the solutions u;=a,, v; — b; (i= 1,2), we have intermediate integrals F,(u;, v;) =0 (i= 1, 2). 
If 4(TR + UV) 4S?, 4, #A,, we have two different intermediate integrals F,=0. Solving the simul- 
taneous equations F;=0 in p= P (x, y, z), q — Q(x, y, z), we may also find the solution by integrat- 
ing dz 2 Pdx + Q dy. 

(vii) Poisson's differential equation. P — (rt — s?" Q, where P= P(p, q, r,s, t) is homogeneous with 
respect to r, s, t and we assume that Q = Q(x, y, z) satisfies 0Q/0z # oo for x, y, z when rt ^ s?. The 
equation P(p, o(p), r, ro'(p), r(o'(p))?) 20 is then an ordinary differential equation in o as a func- 
tion of p. We first solve this for o, and then solve a partial differential equation of the first order 
q=0(p) by the method (IT) (2) (i). 


(2) Intermediate Integrals. Let f(x,y,z,p,q,r,5,t) be polynomials with respect to r, s, t. Suppose 
that f(x,y,z,p,q,r,s, t) * 0 has the first integral u(x,y,z,p,q)=0. We insert 


du Qu du du = du Qu , ou du 
Gel Ee GEZ gp Em ILE Ae 


into the original equation, and replace all the coefficients that are polynomials of s by 0. We thus 
obtain a system of differential equations in u. If u and v are two independent solutions of this 
system, an intermediate integral of the original equation is given in the form (u, v)=0. 


(3) Initial Value Problem for a Hyperbolic Partial Differential Equation L[u]=u,, + au, + bu, 
+cu=h. 


u(n) =[(UR rap e län f f R Cy: EMA Cy) dx dy 


«f l («35 - nx du a { acos(n, x)+ beos(n,y)) uk |ds, 


where A is the hatched region in Fig. 19, and the conormal n’ is the mirror image of the normal 
n with respect to x 7 y. 


ulEmM=(uR ot (^R (uy au) dy f R (u,+ bu) d ff R (xy: E mhl) dxd 


(characteristic initial value problem). 





Fig. 19 


Here [7] is the hatched rectangular region in Fig. 20. R (x,y;& m) is the Riemann function; it 
satisfies 


M [R (x,y; tal 
R, — bR=0 (on x=8), 
R,- aR -0 (on y 2), 
R (&n;&)7 1. 
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Example (i). u,, = h(x,y). Roy; &m)- 1. 
u(é,n) = 1lu, t ugl 2 J, [s eos) + ucostny)]ds+ ff hG ax. 


Example (ii). Telegraph equation u,,+cu=0 (c 50). Rn: &n)= Jo(2 V c(x—£&)(y—9) ) 
es n 
Example (iii). u,, + pu +u,)=0 (n= a constant >0). 


dal XT pl A a oie STOOD 
RGs&p-( Ei | d Wéider } 


(IV) Contact Transformations (— 82 Contact Transformations) 


We consider a transformation (x), ...,x,; z) (X, -Xn Z). We put pj 0z / 0x, P= 0Z/0X; 
(j=1,...,n). The transformation is called a contact transformation if there exists a function 
p(x, z,p) »*O satisfying dZ — X P,dX;— p(x,z,p)(dz — X p; dx;). 

A transformation given by (2n + 1) equations Q=0, 0Q/0X;-- P,90/90Z 20, dQ/dx; + 
p;d%/dz=0 generated by a generating function Q(x,z, X, Z) is a contact transformation. 


MEE = | Transformation Name 


X =-p, P=-x;, Legendre’s 
Z-2Zpx-z transformation 





Generatng Function 













ExX;tztZ 








Xj=— PZ, 
P= — (2X; — x)/QZ — 2) 





ZXj-tZ?-XxX,-zZ 





Z/(2Z - z) Pedal transformation 









X; 7x, — ap(1- X p?) H, 
P;=P;, 
Statt SEH 


ZU ett A ai Similarity 





Xj = X= Byes 
X(X-xy-z?-z2 P= — p(Zp?—1) 12, 


Z-zXpj-1^ 





(V) Fundamental Solutions  (— 320 Partial Differential Equations H) 


A function (or a generalized function such as a distribution) T satisfying LT = 8 (6 is the Dirac 
delta function) for a linear differential operator L is called the fundamental (or elementary) 
solution of L. In the following table, we put 


1 (x»0) 


(Heaviside function). 
0 (x<0) 


m 92 ai Cl ai TE 
usas e ee R | 
J, is the Bessel function of the first kind; K, and J, are the modified Bessel functions. ( —Table 
19.IV, this Appendix.) 

2 


s=) Mee x?-...-x2_,  (ifx,»0 and the quantity under the radical sign is positive), 


0 (otherwise). 


(For Pf (finite part) — 125 Distributions and Hyperfunctions.) 





Operator Fundamental Solution 
d / dx 1(x) 
d” dg 1! (x>0) 
dx" 0 (x <0) 
9" / 0x,0x,. .. 0x, l1(x,)1(x;)...1(x,) 
at SE (i= /-1) 
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Operator Fundamental Solution 


-[r (5)/2- Herl (n>3) 








A 
(1/2m)logr (n=2) 
Se NE n/2 = n—2m is a positive | 
fr (5)/2 vee Der n VE ol "- elds eae odd integer 
AU ge 
logr (n-2 SE 
AY Jom 4, — (lep/i = n—2m- : 
. RE Lg ul (2k n| Macer " 
kAm—h 
A ý 22" m—(n/2) 
7 D 
g” (Pf 2" 77)/5 0/27 132m- 15s — !T[m 4 1 — (n/2)] 
Ajo 07 07» Sa Inin (VAs ) (A »0) 
— A)” pp gn eg 
(A 1s a and +0) gO = E voeem Jp - 2X VINs ) (A<0) 
3 ^4 EH ý we) cl - Sé (x, — 0) 
dr >È dx? (m - D! A2 Vig, i=] 
i 0 (x, « 0) 


(VI) Solution of Boundary Value Problems (— 188 Green's Functions, 323 Partial Differential 
Equations of Elliptic Type, 327 Partial Differential Equations of Parabolic Type) 


L[u] = Au, +2Bu,, + Cu,, + Du, + Eu, + Fu, 
M [v] =(Av),,+2(Bo),, + (Cv),, — (Do), - (Ev), + Fe. 
; = = Qe 294 
Green's formula ACAD uM lvl) axdy = d Pu A Uy )+ Ge) ds. 


A cos(n, x) + Bcos(n,y) 2 Pcos(n', x), 


.(l 
eq: xD | B cos(n, x)  Ccos(n,y) = Pcos(n',y). 


Q=(A, + B,— D )cos(n, x) - (B,-- C, — E )cos(n,y). 


The integration contour C is the boundary of the domain D (Fig. 21), n is the inner normal of 
C, and n’, called the conormal, is given by (1). 


Fig. 21 





(1) Elliptic Partial Differential Equation L[u]= uxx + u, + au, + bu, + cu — h. 
uE) =- fu) o ds f f 6 Gri hc) ddp. 


Here G (x,y; & m) is Green's function, which satisfies M (G (x,y: £ q)) «0 in the interior of D 
except at (vin, and 


G(x,yi&m)- — (1/2«)logy(x — £) + (y-n) + a regular function, 


G(x,;£m)-0 ((xy)€C). 
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(2) Laplace's Differential Equation in the 2-Dimensional Case ou + PE =Q. 
u(x,y)=a(rp)=Ref(z) (z=x+iy=re”). 
(1) Interior of a disk (7 < 1). 
1-7? 





_ T. e" tz 2 _ 1 2. 
f(z) zi ie) dg, (8) f, ü (1,9) = de 


1—-2rcos(0 — 9) 4 r 
(Poisson's integration formula). 
(ii) Annulus (0c q& r« I). 


f(z)= SIT (ds ("alapt (w)dp—alogz | 


(Villat’s integration formula). 
an t a[2b.cchy qe -ü C PERSA 
=- logztg) a E wi), (8 (1,9) - (9,9)) dọ, 2m z 084- 


Here $, and $4 are the Weierstrass ¢-functions (— 134 Elliptic Functions) with the fundamental 


periods 20, and Jus, 

" j poo u(t,0) 1 ço uls,O0)y 

iii) Half-plane (y > 0). f(z)= + dt, u(x,y)=— | ——"—~— dt. 
(ii) Half-plane (y 20). f(2)= SC es ea 
(3) Laplace's Differential Equation in the 3-Dimensional Case. 

(1) Interior of a pus (r « 1). 





= 2 
ü(r. 0) o q- xj. d à (1,9,9) — —.—" — — inoaode, 
(1—2rcosy +r) 


where 


COS y = cos O cos + sin Osin f cos(®— q). 
(ii) Half-space (z > 0). 


u(é,n,0) 
u(x,yz)= 3- f f^ - — z dédn. 
=> f(x- +(y—n) +27} 
(4) Equation of Oscillation (Helmholtz Differential Equation) Au + k*u=0. Let u, be the nor- 


malized eigenfunction with the same boundary condition for the eigenvalue k,. Green's function 
is 








u,(P )ux(Q) 
G(P,Q)- SE 
EE 
Boundary 
Domain Condition Eigenvalue Eigenfunction 
rectangl nam x » 
ectangle u=0 kam = TA] zt n" sin nr — sin mr 7- 
0«x«a, O«y«b a a b 
(n, m2 1,2, ...) 

circle O<r<a u=0 Kam is the root of J,, (kx) 20 Jn Kamt en"? 


k„m 18S the root of Jn Kam?) ` Nox Kum?) 
annulus b<r<a u=0 Je (ka) N, (kb) J, (k ) Nin (Ky ma) SC 
a 
— J,, (kb) N,, (ka) 20 m\Kam 





fan shape 0 k,,, is the root of J, (ka) =0 
O«r«a, O0«g«a u= (po pre /a) J, Kan FEI up 
rectangular parallelepiped 2 2 2 
O<x<a, O«y«b dw eg km =T LOIN LENT ia cose Ž cosmn ~ cos Ir a 
: $ ðn zm a b c a b c 
0«z«c 
k„ is the root of y, (ka) — 0, 


sphere O<r<a E: =0 where mu V, (Knit) P" (cos 0)* me 
Vu (p) & Vv/2 Jaaa) 


2 2 
(5) Heat Equation. e =x Au (.- 23 T. € ; k is a positive constant} Boundary condi- 
xy Xm 
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tion: hu — k ðu/ðn = o, where h and k are nonnegative constants with h+k=1, and ọ is a given 


function. 
u(P.)= f GC. Q,0u(Q.0) dV * f af {Seer 


Here V is the domain, and S is its boundary. G (P,Q, t) is the elementary solution that satisfies 
dG/dt=K AG in V and kðG/ðn= hG on S, and further in the neighborhood of P= Q, t=0, it 
has the form G (P, Q,r) =(4rt)~"/2e~®’/4*" + terms of lower degree (R= PQ). 

(i) —oo <x «oo, G- U(x —&t), where U(x,t) e^" /^*/ V4axt (similar in the following case 
(ii)). 

(ii) 0€ x « oo. u(0, t)=0: G-2U(x—£rn-U(x-£r). 


he GeU(x-ED0* Hien - Diet f Te mU Gs dn, 


+ G(P,Q.t- 7) |9(Q,7)48,. 






































(iii) De x « I. u(0,1t)= (1t) -0: dell = j- Sr d 
94 9,0)» 2* (1,00: c-o eor Ji 
«(0,1)» 2" (1,1) =0: e—al SS dE d 

a) ar | 





Here ? is the elliptic theta function: (x|r) = 94(x,7) 2 1--2Xe 7" cos2nm x. 
(iv) 0€ x « oo, Ox y « oo. u(x, 0,7)=u(0,y,t) 20: 
Gelee: 8/4 ... e C9 däre orm) e 0 *n'/) / Ami, 


b u —0 on the boundary: 


2 2 
m n MTX mat amy . nv 
en (enn S sin sin sin, 


(v) 0«x«a, 0« y 


< 
ud. y $ exp 
ab " 
0 





a? b b 


<z<c.u=0 on the boundary: 


oo 2 2 2 
ix (emen 
2 d a P e 


^ 
m 


abc Jl mein 
lnx . Ini mm . wer. noi 
xsin E sin sin sin n in aint 
a S 5 sin b sin sin =>. 
(vii) 0< r « oc. Spherically symmetric. Ix| - r, lel r: 
C= (et: CH dät e "(rf äert /8mpp (mt) 7. 
(viii) O<r<a. oe symmetric. u —0 on the boundary: 
l = rnnt dal, nor’ 
= Qnarr’ De sin sm a 


n=l 
(ix) a & r< oc. Spherically symmetric. kðu/ðr— hu=0 on the boundary: 


l —-í(r—r zr rm 
G= SS e ( Vis A e (r+ 2a} dät — af z k —— — (4r xt)? 
Sarr’ (at) 





2 
xexp | AEE) t(r-r- zait 


‘a 2a ah + k EY oo 2 
xerte| 028 4 HEE ue | (erfex= e! ar), 
2V Kt ak J 
(x) 0<r< œ. Axially symmetric: Gene" tM p Cep! /2t) / Amt. 
(xi) 0< r< a. Axially symmetric. 2 — hu —0 on the boundary: 
1 $ Jo (ra,)Jo (r'an) 
Ta? ve (Jo (aa, Er + (J, (aa, yy 


where a, is given by ka,J,(aa,,) — h Jo(aa,,) = 0. 


ze — ka, t/a 
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16. Elliptic Integrals and Elliptic Functions 


(I) Elliptic Integrals (— 134 Elliptic Functions) 


(1) Legendre-Jacobi Standard Form. 
Elliptic integral of the first kind 
sing dt 


F(k g e E 
"JL Vi-EsPy h — Vü-P- En) 


Elliptic integral of the second kind 


er sing _ 12,2 
E(k.) S ("V 1- Khsintg a= f Ee 
0 


1-7? 


(k is the modulus). 





Elliptic integral of the third kind 


i dj E: di 
IT(9,n,k)= Fe ee em — 
o (t nsiny) Vi- sity Jo — (tn?) Va- AA- ee) 


When 9 7 7/2, elliptic integrals of the first and the second kinds are called complete elliptic 
integrals: 


7 l 
K(k)= F(k LEg(l Luut 
a-f” VORSN e Kee 2 E 2" | 


qe des 
pue s(s5)- f V1—ksin’y «- [ y d= S EECH) 
0 0 


where F is the hypergeometric function. 
K(k) =K(V1-k? )=K'(k) | Ek E(VI-k&? )&E'(k) | (k?-21—- K^; k' is the 
complementary modulus). 








EK'-E'K—KK'- E (Legendre's relation). « 


AE l[E-k?F X singcosg 3E _ E-F 
ok ER 


k V1- K'sinlg 


(2) Change of Variables. 





tn(é-g)- Kane: F( pY) =U + OF 9. 





E( gr dl pep EU KF el pep sind. 


l+’ 
, (1+ k)sing Wk 
MS ue r( x a + OF) 


g( x)= ii Deet: k?F (k,q) 


j| sume V1- k? sin? 
l+ksin’@ 
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ky sing, COS F(ki pı) E(ky,91) 

ix MI NN — P KF(kg) L Esc Bee. 
Vi-KEsiàg V1-k sin? V1-k?sin?g 

k' -itang xs —iF(k,g) i[E(k,q)- F(k,q)- V1-— E sginie tang] 

i ksing ` VI Ke ` kF(kg) y EG 9) kF) 


(3) Transformation into Standard Form. 
(i) The following are reducible to elliptic integrals of the first kind (we assume a> b >0 for 
parameters). 

















AF(k,) A k o 
KW, l v3 -1 V3 +1—x 
pou EET arc COS ————— 
ur v3 2V2 V3 -1+x 
f dt 1 V3 +1 GE 
| Vi-# V3 2V2 V3 +1-x 
d s 3 == arccos =x? 
0 Vis: * 2 v2 1+x? 
d di d b = 
RENE = Se Se 
5 (a?— 17)(b2— 1?) a a 
x ]— b 2 
KE l 1- (biet arcsin Su? 
» Vla- Pyn- p) 4 1—(b/a) 
f dt ] b | fi-(a/xY 
BE mca pum UI REPRE MR: GM T — arcsin MET: 
a V(?=a2)(?— P) 12(5/3) 
* dt l 2 X 
E S S S S ZZ = yl—-(/a) arc tan = 
f V (a2 + 12)(b? + 2) 4 b 
f dt l 4 2 ft+(b/ay 
ERE CI VV = LUN E LNÉ aen SSS arc sin A 
o Vla- (ph e) Va? +b? Va? +b? 1+(b/x) 
f dr l a b 
ee UN UN TE gu tma KEE arc cos — 
p Mieitcdhtd-  Væ+b Vat p? x 





(ii) The following are reducible to elliptic integrals of the second kind (we assume a > b >0 for 
parameters). 
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AE(k,q) A k 9 
e 9o. 29 
a^—t 
f PC dr a 
0 
j 24:3 TEN 
a dt Va? b? de arccos Č 
a*—t a? + b? a 
X 
x 
1. [Pta a? +b? a b 
Kä dr SE SE arccos — 
, t? db p? Va? + b? x 


arcsin = 
b 





afo 

















2 
2 (EZE 
: mus dt Va?- b? EN. A arcsin La = 
IT Vaitbi 1t (x/a) 
241.25 2 2 
(b2 4 Di p a b 
x — 2 
dt 1 Va?— b? 1—(5/x) 
-5 ————— arcsin Y / — 3 
p PV (2-8)(a2- ?) ab p 1— (b/a) 


(II) Elliptic Theta Functions 


(1) For Imc» 0, we put g=e'™ and define 


eoo 
959 (u,7) 2 094, (u, 7) 12 M (— 1)"9" cos2nru, 


n=] 


$1(u,7)z2 > (~1)¢ SE sin(2n + 1)zu, 


a> 


9, (u,t)=2 y q^ * 4/2F cos(2n + Deu, 


n=0 
= 2 
$4 (ur) 14-2 Y, q” cos2nau. 


n=] 


Each of the four functions d, (j=0, 1,2,3) as a function of two variables u and r satisfies the 
following partial differential equation 
079 (u,7) 99 (u,7) 
——— — -4ri 
Ju? dr 
(2) Mutual Relations. 
9(u) + Iu) - P u) + 03(u), ` 99 (u) = Kod (u) + K'83(u), 





8j(u)— —k'OQ(u) - kOZ(u), OP) = kóg(u) — k'8(u), 
where k is the modulus such that iK'(k)/ K (k)  r, and k’ is the corresponding complementary 
modulus. 


$5(0/85(0, — K'285(0)/93(0). 


am ZO KO Am 
$:(0 P 20) — Aa) 

(3) Pseudoperiodicity. In the following table, the only variables in 9 are u and 7. m and n are 
integers. 











$50) = 79(0)93(0)99(0), 
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Increment of u Bo $1 9, 9, Exponential Factor 
m4 nr (C(e CDA OC DA, 94 ocn 
j x Qu + nr)] 
m-tn 9, CDA (C Drt (D 
l n; mM Ay m, 
m*(n* 5) (-1yi$, (Dti (= DA a. epf "Je Dh 
l 
x {2u+ LEE d 
SEET EN a. a (Dri (Im, ( 2) | 
Zeros u = m m nr med nu 
2 2 
«(ne 3) +nT «(ne 5) 


oo 
(4) Expansion into Infinite Products. We put Qo I| (1— an. Then we have 


n-l 


oo 
$y (4) 2 Q II (1-247?! cos2su - q*"~7), 


nzi 


oo 
81 (u) 22Qgg"^sinzu || (1-2q?"cos2au + q^"), 


n=1 


oo 
$4 (u)=2 Qoq tcosmu || (1297 cos2zu * q^"), 


nz] 


oo 
ð, (u)= Qo II (12 24?7! cos2ru + 4*7?). 


n-l 


(III) Jacobi's Elliptic Functions 


(1) We express the modulus k and the complementary modulus as follows. 
, am ` am 
$3 (0) 85 (0) 
Then we have 


K(k)=K= Ob (0, K(k)=K’=—itK. 


k?+k?=1. 


The relation between q and k is 


q= e me KK) 
1/2 4 6 8 
WR ys sr 
An SE ) +15(4 +150(£) +1707(£) +... |, 
VERNE 
ge ët Zrt Ur pe. ATs, where AEN 
pou ie Vig 


(2) Functions sn, cn, dn; Addition Theorem. 
9 2K 9 2K DÀ 2K 
ge SE en(u,k)z V JUR) ) dnlu,k)=Vk' Suak 
Vk ®(u/2K) d (u/2K)’ Po (u/2K) 
sn?u +cn%u =], dn?u + k?sn?u = 1. 


snucnvodno+snocnudnu cnucno—snudnusnodno 











sn(u + v)= cn(u + v) = 
1 — k? sn2usn7v ' 1 — &?sn?usn?o 
dnudnv-— k?snucnusnocno 
dn(u + v)= SS : 
| - k*^sn*u snio 
dsnu dcnu ddnu 
T =cnudnu — — = —snudnu LL = ~—k*snucnu. 


du : du í du 
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(3) Periodicity. In the next table, m and n are integers. 











Increment of u snu cnu dnu 
2mK +2niK’ (-—1)"snu (-—1)"*"cnu (-1)"dnu 
8 T _ pva+1 CNU rags SNU deeg 
(2m —- DK -2niK (GI Gren (—1)"""k aaa (—1)"k dix 
"ki (vn "m —pyntatl; -ı dnu ze (ti MU 
2mK t (2n t iK (—1)"k Se (-1) ik inu i(— 1) mp 
= "EK. (mt) -ı dnu yntan’ xp 1y; ,Snu 
(2m—-1)K+(2n+1)iK’ (—-U"* k Se (—- ImTDEk aa ( — lik Se 
Zeros u= 2nK +2miK’ (2n+1)K+2mik’ (2n+1)K+(2m+ DiK' 
Poles u= 2nK t (2m t 1)iK’ 2nK * (2m t l)iK" 2nK * (2m * iK" 
Fundamental periods 4K, 2iK' 4K, 2K+2iK’ 2K, 4iK’ 


(4) Change of Variables. In the next table, the second column, for example, means the relation 
sn(ku,1/k)=ksn(u,k). 











u k sn cn dn 
ku 1/k ksn dn cn 
iu k' i= L dn 
cn cn cn 
j k , Sn cn uh 
oe 2 Dir du dn 
: ~ $n l cn 
iku / k ik dn dn dn 
ik'u D ik’ 5B. dn D 
k cn cn cn 

zz 2 

2Vk (1+k)sn cndn TESI 

(1 k)u —— Eemere? 1+ksn? 

l+k 1+ ksn? 14 ksn? 


(Gauss's transformation) 

















-p 1—(1+k’)sn? 1-(1—k’)sn? 
(1-4 Ein Ge TL (EK) — 
(Landen's transformation) 
(+k yu (=x ) I4 VE. ` Kee da E. Mär dn VE 
2 ANE TI d-vVK (dn)(K-edn —1-Vk I- Vk ` VI dnY(K dn) 
2(1+k’) 
(1+dn)(k’+dn) 
E A E a Ee 
acobi's transformation. sn( iu, k) 2 i ————., en(iu, kK) 2 ————., 
en(u, K^) cn(u,k’) 
dn(iu, k) dn(u, K^) 
n(iu k)= ———. 
cn(u, k’) 


(5) Amplitude. | ? 
The inverse function p =am(u, k) of u(k, p)= F (k, oi -f 
o Vl-kĉ?sin?y 


is called the amplitude. 
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sn(u, k) 2 sing — sinam(u, k), en(u, k) 7 cosq — cosam(u, k), 
dn(u,k)= V 1— K?sin?)g = V1— k?sn?(u,k) . 
u(k,x)= (" — A ——, x —sn(u, k). 

0 1— 82 - Kid) 


n 


2q u — pim í-T(K'/K) 
am(u, k)= aK 2 n(14- q?) via > ) (q= e =e ). 


am(8, 1) 7 gd0 (Gudermann function). 


(IV) Weierstrass's Elliptic Functions 


(1) Weierstrass's 8 -function. For the fundamental periods 2w,, 2w3, we have 


l l 
u (u—-2no,—2me4) | (2no,*2mo,) 


|] 82 2, 83 4 8 u6 38283 e. 
=-2 00" +28” * 09" * 6160“ 


u 

l l 
£:60y/— — gay — l 
nm (2nw,+2mw;)* am (2nw,+ mee? 


where >’ means the sum over all integers except m — n —0. 
P(—u)= P(u). Putting vz — (o, +3), ¢ = P(w) (j=1,2,3) we have 
eite, +e,=0, eje, t €5€4 t €3€) = — 3/4, €10659€47 23/4. 
1 
g (u)=de /du=—2 2: ong. ma) 


e"(u)-4[ P(u)—e, |[ e(u)— e;]| P(u)—e; | -4£*(u)— g $(u) — g3. 
Addition theorem 





£'(u) - e'(v) 
8G (uc v)- — £(u)— Stolz 1 ouo. 


(e;j— e(e; — e) 


Sin) — e (j,k,1) 9 (0,2,3). 


P(utw)=e+ 


Using theta functions corresponding to r=w;/w), 


d7logd, (u/2w,) a), 


p'(u) Se ES oP (0) 3, (u/20,)9, (u/2w,)% (u/2w,) 
The relations to Jacobi’s elliptic functions are 


q zexp(imo4/ o). 





u cn?u dn?u 1 
9 1———— |= +(e,-— = + — — = + — ——; 
) EI ( 1 e) m €? (ei e3) Ss e3 (ei ez) ns 


Vee 
ET 
where the modulus is k= Ge K(k)=w Ve- e. 
17 €3 


(2) $-function. 
l l u l 
uj=— tt 
Swe u PA Exe (2nw, + 2mw3)° 2no, t 2mo, 
lo 3234-4855. 8 34 Ji y. 

Tu 60" 140" 8400" 18480" 

7 Ui uc hu + dlogd, (u/2w,)/du. 
§'(u) = - e(u). 
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Pseudoperiodicity. Putting yz $(vo) (j=1,2,3) we have 
§(u+2nw, +2mw,)= £ (u) - 2m, t 2mm, (n,m=0, +1, +2,...), 


— 1 em 
12e, à((0) ' 


1193 — 130, = 159; — 9102 = 2395 — 13093 = Ti / 2 (Legendre's relation). 


1 £"Q)- £"(v) 


n= nı 72 9570, 


Addition theorem $(u* v)-$(u)T $(v0) 








2 f)-i(). 
(3) o-function. 
7 1 S 2 nem=0, +1, 
del - e e hl ctas 2 nct as | EO ande 
m 2nw, +2mw; 2nw,+2mw, 2\2nw,+2mw; (n,m)-+(0,0) 
Ei s 83 7 8 9 


=u- Hc cie iren 
24.3.5 2*3-5.7 2».32 5.7 


-2 qiu? 3, (u/2w,) 
= 20) a "am (0) . 


£(u)2o'(u)/o(u). c(—u)-2 —o(u). 
Pseudoperiodicity. o(ut+2nw, * 2mos4) S (— 1) * "^ * ""texp(2nm, + 2my3)(u + no, + mo3)]o(u). 
(4) Cosigma functions o,, 05, 03. 

o(u- oj) -( I TES 


Mou ios ul (j=1,2,3; 9,2 9). 
o (w) 20, J 8+1 (0) USE 


ou) = — e" 


2 











, o,(u) , X 2o1(u)o3(u)os(u) Se o(2u) 
nz ahr eeng Gl 
_ o(u/a) | 01 (u/o) | oX(u/a) "e KE 
etu See EE d ECL where a-Ve, €5 Ec 
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17. Gamma Functions and Related Functions 


(I) Gamma Functions and Beta Functions (— 174 Gamma Function) 


In this Section (I), C means Euler's constant, B, means a Bernoulli number, ¢ means the Rie- 
mann zeta function. 


(1) Gamma function. rs | e 't* dt (Rez » 0) 
0 


1 (04) 
me e 't* ! dt. 
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In the last integral, the integration contour goes once around the positive real axis in the positive 
direction. 


T(nt¢ Dan!  (n2012,..)  I(1/2-2Vm. 


n-] ` 














I(z*l)ezT(2, OU- Siss — II r(z+ 2 = Oren- (nz), 
sin zz dac n 
NOE d Z \o=z/n ENG sinzz _ = x §Qnt+l) 2n41 
T =ze Un ; log TO +z) 5 log Cz z Pes hac 
(lz|< D. 
IT + p 2 oo 2 
ES = [[ 1 (14 as z) (x,y are real and x » 0). 
T(x) n=0 (x+n) 
Asymptotic expansion (Stirling formula). 
PI o LT) Be z172" 
I(z) zge^z?^^!V2az ECT | | (Jargz| « v) 
mugs Gg l 1 139 571 - 
meu 0 my [1+ at -- - LOL), 
l2z 288z? 518402?  2488320z* 


Q) Beta Function, — B(x,y)= f "echt mid — (Rex, Rey>0) 
0 


-T(x)TCy) /T(x +y). 
(3) Incomplete Gamma Function. 


y(v,x)= f 'trcte7tdtS TG) xU Pe PW, 29/2) (Rev > 0). 

0 

(4) Incomplete Beta Function. B(x.y)= (ec -Pdt  (0«a«l) 
0 


(5) Polygamma Functions. Y(z)= £ log T(z) 


T(z) of ont -zt oo 
j Se 2 s soe ER EE 














n=0 
Ves goose CD sg 
n=0 za n=0 (ZA) EE 


(II) Combinatorial Problems | (— 330 Permutations and Combinations) 


Factorial n! - n(n— l)(n—2)...3:2-1. 0!=1. 


_ a(a—1)...(a— r^ 1) 


Binomial coefficient ( ei Fy 


(1) Number of Permutations of n Elements Taken r at a Time. 
,P,2 n(n-l)...(n-r*1)en!/(n—-r).. 


Number of combinations of n elements taken r at a time 


Foe ICI M en MT DE, RET NIS 


Number of multiple permutations wll =n". 
. € (nt r—1)! 
Number of multiple combinations no, =p} 41€7 Hoar 
r'(n — 1)! 


Number of circular permutations PF 
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n 
(2) Binomial Theorem. — (ab - Y ( : (Ewe " 
r=0 


n! 


Multinomial theorem (ait... ta)" =È PEN 
I s... m 


p p 
NEE 


The latter summation runs over all nonnegative integers satisfying p, +... +p, n. 


References 


See references to Table 16, this Appendix. 


18. Hypergeometric Functions and Spherical Functions 


(I) Hypergeometric Function  (— 206 Hypergeometric Functions) 


2 T(at+n) T(b+ T n 
(1) Hypergeometric Function. F(a,b;c;2)= > ra FP Pm) (e) z 


nco Ma)  I(b Teta) nto 





The fundamental system of solutions of the hypergeometric differential equation 


2 
01-7277 [c - (a-* b- 12] 4 — abu=0 at z=0 is given by 


u, 7 F(a,b;c;z), us zl *F(a-ctl,b—c-1; 2—c;z) (c40, — 1, -2, ...). 


F(a,b;c;z)- F(b,a;c;z). dF /dz=(ab/c)F(a+1, b+1; c+1; z). 
T(c)T(e— a- b) 


F(a,b;c;l)z ———————— Re(a t b — ; 
(a,b;c;l) Dos 5 (Re(a + b — c) « 0) 
r 
F(biciz)o etc UW oC ta (Rec»Reb»0, jz|« D, 
i I(c) io T(a sS)T(b - S)T(— s) 
F(a,b;c;z)9 — ———— — L zy ds. 
(n bi6iz)* ze TOTO -io T(c^ s) are 
(2) Transformations of the Hypergeometric Function. 
F (a,b; c; z)- 0 — 2) *F(a,c- b; c; z&) 
-(1—zy-^^bF(c—a,c— b; c; z) 
I(c)T(b — a) 1 
«(0-37 O16 79 qo apa el 
CD NEC E A e EE 
T(c)T(a - 5) 1 
+(1- eer —a; b~at+1; —— 
re Tab) Aor Hed js] 


e l(c)T(c— a— b) 


m-——————————- F(a,b; a+ b—c+]1; 1- 
EE EE ee a 


T(c)I(a- b— c) 





— 7\¢-a-—b Kë ONE om su me icm. E 
T(l-z) TG) F(c—a,c—b; c—a—b+1; 1—z) 
= I(c)T(b — a) cr n 8 8 -d 
= DOP a) z) F(a.1 c+a;l—b+a; d 
T(c)T(a — b) 


ACANAR xcu E o Kelt 
Fone F(5.1 c+b:1 a+b; =). 
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(3) Riemann's Differential Equation (— Table 14.II, this Appendix). 


a tere = D ee IF: 


dz? z—a z—b z—c dz 
aa'(a—b)(a—-c)  BB'(b—-c)(b—a)  vy'(c—a)(c— b) u M 
d z—a = z—b Y z-—t een 


Here we have a+a'+p+B'+y+y =1 (Fuchsian relation). The solution of this equation 
is given by Riemann's P-function 


a b c 
u=Pia B y z 
a’ B’ y 


-£y (EF [serere +y; loa 








(c—b)(z—-a) | 
(c— a)(z — b) 


(a — a^, B— B'. y — y zinteger). 
We have 24 representations of the above function by interchanging the parameters a, b, c; a, 
a’; B, B’; y, Y in the right-hand side. 
(4) Barnes's Extended Hypergeometric Function. 


I Q1, ..., Bases 2 = A ee Re RE wh n^ 1 1 eee t 1 


=T(atn)/T(a) . F(ab;c;z)m;F.(ab;ciz) oFoa(x)=e*%, Fola; x)=(1— x) 7%. 


(5) Appell's Hypergeometric Functions of Two Variables. 


Po: DÉI Y; xy) = S $ eat teh x" y^ 


m=0 n=0 m!n!(y),., 


Sc Sei (a) m+n(Bm( Bn 
Fa; B, B15 x)= SRI ar Os m A a d 
MORSU De tee 


H 


n 


F(aaBByuxy)e X E eee BeBe om ys 


m=0n=0 m!n!(Y)man 


= = bech, Bman 
F P; Y: 3 , = Paty, Ee 
EE EEN 


m n 


(6) Representation of Various Special Functions by Hypergeometric Functions. 


(1— x)2 F(—v,b;b;x) e "x (Ss) (tanhx)F(1+ 5, dk 





> l+n; sechix . 


log(14 x) 2 xF(1,1; 2; — x), llogit* = së 51; 5327), 


sin nx = n(sinx)F( a : zü : Zeie) 














cosnx- F(A^. 8. l.g *x)= (45. OP 
(5 23 33 SIX (cos x) Fi 3-257354 


1 1.3 xm = | oso 2 
arcsin x = xis. y PE x) arctanx e xF( 5,1; 5: Gel 
(2n-1)!! eis EE 
Plet IG (omnta 35* ) 
,Qn* D!! 3. 3 : 1 
TEL kl ët "Cogr zl ann Kë ) (spherical function), 
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where 
umo ped e, Ee Q2(- Del. 
m(m-—2)...3-1 (m odd), 


11 a 1 1 gm ds 
Koze y 1; æ), Ew=zr( A l; “) (complete elliptic integral). 


x x? vie 1 ) 
=—— F 1; —— )=————_ | F, | v+- =; 2v + 1; 2ix ]. 
Ja: 3x1) (o a 2T(v4 10)! L a" 


e*- E F(a,b;a;x/b) ,Fi(a;a; x)= gFo(x). 
—00 


(II) Legendre Function (— 393 Spherical Functions) 


(1) Legendre Functions. The generalized spherical function corresponding to the rotation group 
of 3-dimensional space is the solution of the following differential equation. 
d*u du 


= 52) 4 94€ 
(1 SE 2:73, * 





TAZ, 





2 
e 2 
»(v-1)— cn 


When »=0, the equation is Legendre's differential equation, and the fundamental system of 
solutions is given by the following two kind of functions. 
Legendre function of the first kind $ (z)zm P,(2)=2F\(— v, v+l; l; Izy, 
j 
Legendre function of the second kind 








T(r-l1)Vm — ,, (3 y+ 
24 1 


= yu a: 
Q, (z) 2 3 2 H p+ 2 3 z? l 


~ PATB] 
Q,(x)= 18, (x+ 10) +9, (x — 10) 


(cos rz) P, (x) - P, (— x) 
= n — — ————————— (rzinteger; —1« x«l) 
Janne 


Recurrence formulas: 
PQ=P_,1). Q(z) -Q_,-1(2)=x(cot vx) P,(z) ^ (vsinteger). 
B,(—2)=e* "PB, (z)—(2/m)(sinvm) Qz) O,(—z)=—e* "Q(z) (+ =sgn(Imz)). 
(2?— 1) dB, (z)/dz=(v + DIB, 41(2)- 2B, (2)], 
(2v 1)z3B,(z) 9 Qr + DB, 4 (2) + PB, 1(2), 
(2?-1)dO,(z)/dz=(» + DÉI, zl - zo, ()], 
(2r + 1)20,(z)=(v + 1)9, 4 (2) + PD, GG), 








T(» +1) y IEN 3 1 
m471/2)--Mgppg —— —— — z7”! BI poen 5) 
P, (z) 7 T[» * (3/2)] 2a 2 2 2 z? 
T[v *- (1/2)] l—-» ~-p ] 1 
een HUM (arn ce Lag H 
d TO) ANA 270253 z 


Lo l 
v—n v+ntl 





P,(cos0)= 9/7. Y (ml 
0 


n= 


)P,(cos8) (rzinteger; 0« 0 <r). 














TES 2 Va 
Estimation: P,(cos8)| < ———_, ,(cos8)| < (O<O0<7; vl) 
| | V vr sin 8 2 | V v sin 
E _ pp (vl —y 
P(I)=1, P= SS R {z ) 





l »tl —y 
KOLENE "eerst yr ZR: 
S 4V7 2 ( 2 ) 
(2) The Case v=n (20,1,2, ...). In the following, the symbol !! means 


nit m(m-2)...4-2 (m even), 
^— |m(m-2)...5-3:1 (m odd). 
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The function P, is a polynomial of degree n (Legendre polynomial) and is represented as follows: 














P,(z)= »; 2 cy 
` (2a)! x zd are o.c 
ny? ( 2o xti o 5) 
_ (27-1)! ae n(n—1) n(n—1)(n—2)(n— 3) n 
RE Ona C. Gna a wp 
er (2m+2j-1)!! 
ki? 1) Qj!Qm-2j ^ > 
ZŠ ez Qme2jeD ae 
Ee (Dr Die 2017 
(2n! _. es 
P,,(cos@) = mop ly -— l-n p 
T 2(2n—1)!! 1:3 n(n—1) 
= Ga [oom t 1 Qn- Gc min 2)0+ — 13 Qn De at $9740 
1-3-5 n(n—1)(n—2) 
*t133 3 Qn-DGn-3)0n-5 3)0n- 5) cos(n—6)0 +... 
" [ex (n even), 
0 (n odd). 
4 Qn 1: (n1) 
E Qn DI ino ONE T-Qn3) sin(n - 3)8 
1-3. 
ISIE) sin(n - 5)8 + ... (ad infinitum) (0« 0 « 7). 


1-2- (2n - 3)(2n +5) 
Laplace-Mehler integral representation 


P,,(cos@) = 1 {"(cosd+ i sin H cos o)” dp 
0 


E pe ds v2 «sinn (1/2)]g | 
7 Jo Vcosg-—cosÓ 7 Je YVcos0—cosg 


DEN a sl Lors V 27+ p? ) 





dz” 
P,(1)=1,  P(-0)-(-1"*, Pay 4 (0) =0, 
ww QD! (= 1)"Qn-1)!! 
ORC aa ome 
Recurrence formulas: nP,(z) - Qn — VzP, ..(z) (n— V) P,. (2) -0, 
S dP, n(n+1) 
(z -DuapTnaGP,- Py) = a 2n4] Fu (Parti Pi)T7(n*Y(P,,- zP,). 


a. Zl ztl 
Q,(z)= Aus ae 1) log — |- 5 Pale log 44 








n+l 
=n f”... GE 


is (z2—1)""! 
oo (t—z)" 
A (Gay 


n d” fe p [2n A 
REH je »'f TENE (Rez » 1). 


== 2^ 
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2-(2n)!! 
(2n+1)!! 
1-3-(n+1)(n+2) 
PE add 


1:(n 1) 
1-:(2n+3) 


Q,(cos0) = cos(n + 1)8+ cos(n+3)@ 





cos(n - 5)8  ...] (0« 8 « v). 




















l+x l (ET 
Q,(x)7 x dx" eie -1)" lo 9081 zt |- 7 Palos 7 
l (7 ~ 1 
= 5 P,(x)logs——~ E EP, (9. 
2 1- pem. 
1+ 1+ l 1+ 3 
Q)- zlog A2, Q= žogi- Q= 1Gx?- log + - 3x. 


(3) Generating Functions. 


x h"P, (z) (I| & min|z € Vz*— 1 |), 





V1-2hz +h? 2s z+ VP=1)). 


(If — 1 < z < 1, the right-hand side is equal to 1.) 
oo ES 
= È (27+ DP,()A, (z)  (-V?-1|«|z* Vzi-1 |). 
n=0 


= vV1- 2 e 
E qp tc ie = > t"©,,(z) (Rez > l, |z| < 1). 
V1l-2iz+z? z7-] n=0 


= Vx?+y?+z?,  cos@=z/r, x,y real, 
CO 
Lely > | | | 8 i | 
m=1 | yQmim tz) EE TK yQmim — z) +x?+y? 


phat 
(Here the square root of a complex number is taken so that its real part is positive.) 





oo G-A 
i+ > eli ) (Rez » 0), 


n=] 
- » (—1) By, 
I. GER $ Qn EEN r?-!p, 1(cosü) | (0«0«2m;z real). 
pal 


(4) Integrals of Legendre Polynomials. 


Orthogonal relations: f. P,(z) P, (z) dz = Sym In+1 Le 


MESA (k=0,1,...,n— 1). 
-1 


A(A—2)...A—n+2) 
(A n DA -n—10)...(AÀ 1) 
(A— (A —3)...( n2) 
(A no D(A-n—10)...(A -2) 


(n even), 
Tat: 
ý (nodd) — (ReA» —1). 


: 2(m—n+1)(m—n+3)...(mt+n-1) 
f P, (cos0 )sin m8 d = (m — n)(m—n-42)...(m* n) 
: 0 (otherwise). 


(m>n; m+n is odd), 


(5) Conical Function (Kegelfunktion). This is the Legendre function corresponding to the case 
y=—(1/2)+iA (A is a real parameter), 
ARP. (4A? + 17(? +37) 8 
P_ajaea(cos8)=1+ E sit 5 ee KC 


P_aysyealx)= P_«y2-al). 
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(III) Associated Legendre Functions (— 393 Spherical Functions) 


(1) Associated Legendre Functions. The fundamental system of solutions of the differential 
equation in (ID) (1) is given by the following two kind of functions when p+0. 
Associated Legendre function of the first kind: 


zt1\'? dex 
PE oso ) fi ( »,rtlhl-y ^) 
where we take the branch satisfying arg[(z + 1)/(z — 1) |j"? =0 for z> 1 in the expression raised to 
the (u/2)th power. 


Associated Legendre function of the second kind: 


E eiun To tut Dya EE v+u+2 PEEL s d 
27! Tnr[v-c(3/2)] 1 2 ct 2 O? ut qa 








where we take the branch satisfying arg(z? — UR =0 for z> 1 in (z*—1)"?, and argz " ^ ! =0 
for z» 0 inz^" ^^ !, respectively. 


P! (x)= e'"" 589 (x i0) = e" (x — i0) 


u/2 = 








"PO-aill-s 
OF (x)=e "Mel fue 128) B(x + i0) + egy x — i0) |/2 
T(r+u+1) 
TENE G p — —————__ P,” we 1). 
2sin pa E (x)cosum TI(»-p-l)' e J (ERAN 


Integral representations: 


CR p/2 +1 2872 
$,^(z)- o -D DEN E di 
Zens Tli 0/21 J, ten) 


n—2 oo 1 yt 
DEER (z2-1) f (sinh)? +! 


(Ren — L, larg(z + 1)| <7). 








2’T(p—v)T(v+1) (z+coshr)**”"*? 
(Rez > —1, [arg(z € 1)| « v, Rev > —1, Re(p—v)>0). 


Bo *(2) = Và Rem CH, zi DP" [° cosh[v-- (1/2)]: 


— — — s1075 dt 
T(u* v DI(g—») a (z ^ cosh r)" ^ : 


(Rez > - 1, Jarg(z X 1)| v, Re(u t »y) » — 1, Re(u — ») 50). 


a * cosh| (» +(1/2)}1] di 
$B^(cosha) = Vi ERR (sinha)" (oem (a>0, Reu<3). 
0 


ut(1/2)] (cosh a — cosh t) 
B (sin8 ) " cos[ (»- (1/2)) o] do 1 

P} 0 m 0<0<7,R =). 

GE 7 rf- sant (cos — cosg )" * /? ( GER die) 

B T(2u 4 1)2-"(sin8)" x: petre 
P,-* (cos0)= E SM 

ren) E J (a 2) 72 

(Re( u - v») » — 1, Re( y») 50). 

P,” (cos0)— eres M e" "ett, (rain )t" dt (0«0«5 Re(u v) - 1). 
oos £7 e "up DEE ee 


(Re(v +p) > - l,Re»» — l,[arg(z € 1)| « v). 


ei h Si -[» * (1/2) 
O#(cosha) = “V5 a EC TE muria MM 
T[-a4(1/2)] ut 0/2) J (cosh t — cosh a) 


(a >0, Rey «1/2, Re(v * u) » — 1). 
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Recurrence formulas: 
(z^ - 1) 4585 (2)/ dz — (v — p+ DBE, (2) - (0 2382 2), 
Qr + 12987 (z) 7 (x — p+ DP, iG) + (+ KE Ch 
$^ , (2) 8702. 


T(v— yu 1) 


=p z= —2iun 
POSE T(vt+ptl) 


Q(z), 


(1— x?) 4P/ (x)/ dx 9 (v + 1)xP#(x)—(v— u + DER, (x). 

The case when y is an integer m(m —0, 1,2, ...) and v is also an integer n: 
PP **(x)*- 2(m tr 1)x(1 — x2) ?PP* (x) (n— m)\(n+ m+ 1P” (x)=0. 
(2n * 1)xP"(x)—(n— m+ 1) P755 (x) (n+ m) P(x) =0 (0€ m«n 
(x?— DPI (x)/ dx — (n — mt 1) P7, (x) - (n - D)xP? (x) «0, 

PP? (x)- P” (x)e Qn 1) V1— x? P” (x). 


T(rv— yu 1) 


Che T(v+ pt) 


KE — Zen (sin ur)" (z) I 


OF (z)sin[(» + u)g] Op. (z)sin[(» — um] we" (cos se B#(z), 


em 2), 


PEC- 2)=eF "Pe (z)—(2/m)Isin(y + w)mje~#"0,(z) (F = —sgn(lmz)) 


O#(—z)=—e7""™QH(z) (+ =sgn(Imz)). 
ef gut v) 
Vissimha 
e "bx € 10) = e* "LO (x) x (m /2) P^ Q9. 


e^" (cosha)- qB-r-VA(cotha) ^ (Recosha 0). 


sin(y + u)r T COST COS UT 


Qt,- (x)= Q (x)- — P} (x), 
sin(v — u)m 


sin(v — u)r 
T(»— yu 1) 


im T(» - u4 1) 


cos ur P, (x)— i sin vd (x) |, 
Pj (— x) =[cos(v + u)r] P7 (x) - Q/ m)[sin(v + 4) 7] Q7 (x), 


QO} (— x)= - [cos(» + u)v] Q (x) t (7/2)[sin(v + u)r] P^ (x). 
TU. +v~+m)(z?—- iy 
DO +»—m)m!2" 

—M(5\ = (72 1\—m/2 [7 e m 
$-"(z)- (22-1) f T f P,(z)(dz)". 


d"$1(z) 
dx" ’ 





Gd 


On)- 3-17? 





Plan mer; mtl; e-n 





P”(x)=(- ee m*»tl; mel; 
I(1-»—m)m!2" 2 
=(-1)"(1 yn m 
zt xm f... f P, Gd" m C yn me dÉ 
Griet eu mn 9 Qr QC 


The values at the origin are 


Var 2^ 


de sl caD/2 nf» ue DA 


Q,” (x). 


BY) 


QF "(z)=(—1)"(z?- weg NEN Nro CO 


=) 
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dP} (0) E 2^* !sin[r(v + p) /2]TlG + wt 2)/2] _ Va 2et! 
dx T[(»— p * 1/2] Vv Ti- y  1)/2l]r[C- » - 10/2] 





PU One 
Z Zenn) 
yt jur 
2 J/r[(»-»*0/21. 


O#(0)= —2^-! Var sin( 


dQ} (0) 
dx 








-)2 r cos( 


(2) Generating Functions. 


n! 


an S RE (cos). 


(cos0 + isinÜ sing)" = P,(cos0)-2 Y (- i)" 
m=] 
l l 


du e E 
v—n u^ (cos) | (0<9< 7, n 20). 





P, "(cos0)— sno y c 


(3) Orthogonal Relations. 


+1 2 (n+m)! 
pr p dx = L—— —_—_ 6,y. 
T n (x) n (x) X Intl (n— m)! nn 


Ae (ntm) 
2n+1 (n-m)! 


San’ On: 


f RE f "sin e * -") p" (cosB)P"" (cos8) dO— 
0 0 
(4) Addition Theorems. 
W,(zt- Vzi-1 Vt?-1 cosp)=B,(2) BF) 2 € (UnL "(£)cos mg 
m=1 
(Rez >0, Ref >0, Jarg(z — 1)| <2, Jarg($ — D)| « v). 
Q(t- V? -1 VI?—1 cose) - S, (8, (7) +2 5 (- D" 9^(1)9; "(r)cosmp 


m=1 


(rr real, 1«t' «t, vz-negative integer, p real). 


P,(cos8 cos8' +sin 8 sinb’ cosg)= P,(cos0) P,(cos8")-2 >) (— 1)"P,"" (cos0^) P" (cos0")cos mp 
ml 
T(v— m4 1) 


Porm” (cos8 )P7" (cos8")cos mp 


oo 
= P, (cos0)P, (cos0") +2 Y 


m=) 


(0<0<7, 0«0' «m, 0+6' <1, o real). 


Q, (cos cos0' + sinÜsin8' cosy) = P,(cos8") Q,(cos0) -2 3 (— 1)"P, "(cos0") Q" (cos0)cos mp 


m=! 
(0 8' «7/2, 00 v, 0-- 0' ca, q real). 
Gase GE oo 
O, (rr + V7241 Vs? x1 cosha)= o lO OONO ire- me 
(tr T T osha) PN UNSER TT PNE Di ie OI" (ir )e 


(Tt, 7’, a 50). 
(5) Asymptotic Expansions. 
2’T[»+(1/2)] s 277 Tl- r-a] 


iye iu Ad 
s. Va -REl Vr T(— u— v) 


SC E [1406 77)] 


(vy * (1/2) integer, largz| « v, |z|>1). 
Va eis T(rt- ut 1) 





= -v»-] -2 
Q(z) Ge ri»«6/2] [i+ 0(z )] 
(v+(1/2)#negative integer, largz| « v, |z|>>1). 
Pipa Tv+ptl) cos[ (v (1/2)) 8 — (2/4) - (uyn/2)] [140079] 


Va Tl»4(3/2)) V2sin@ 


(e«0«m—6, e>0, |v|»1/e). 
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2T(» * p 1) 
PP E E 
a Va Tl» (3/2)] Se 
1 1 3 2141 d tl) um 
i=0 r(5 «a)r(5 -a)r(v 1 SI sin )* 9? , 
DÉI 1 
Q! (cos8 ) = Va aen 


1 1 3 21-115, QUE Dm | um 
o0 T(z eerr(s arre EN ws (r^ 5 ys 4 d 


x won» 





l aiD 
Des r(5 «a)r(5 -a)r(vs 1 5) Gang" 


(In the final two formulas the series converges when v + p #negative integer, v + (1/2) #negative 
integer, 7/6 « 0 « 57/6.) 





29 Jal (n) 
2 


l 8. . 40 
IC DER P, * (cos0)  J, (n) * sin 7 25 patr do): O(sin*5) 
(n2 Qr + 1)sin(6@/2)). 
(6) Estimation. When vr > 1, y-pt+1>0, p> 9, 
oit 8 K l 





|P,*" (cos8)| « 











TI(v-1) V»rsinó (sind) 
|0,** (cos8)| E m 
|p,t7 coste rr war) mu 


(7) Torus Functions. These are solutions of the differential equation 


2 2 
ECH A cothn™ — jobs Wë u —0. 
N 4  sinh?q 





The fundamental system of solutions is given by 
Tr -a/acoshn), £7. 1/2)(coshy). 

The asymptotic expansion when m=0 is 

98, La 72(cosh q) 


— Dte^-0/2» | 2T?2[n - (1/2 
= (n )te E ( / )] (og4+n)e-?,Fi (5 m ine le") ea B R 


` Tin+(1/2)]V7 | mnn- 1)! 
Here 
= ] ES = 
rS (1/2)In (0/2) 5, (1/2)8/2)!n - 1 /2)lIn G2) any 
1:(n— 1) 1:2:(a— D(n-2) 
" (2n—3)12n— 1)!! dg 
[Qn -2)!T 
T[n+(1/2)] & r[i- a/2]r[s 4 1 (0172)] m 
UE aM (n- 1) Lea! — pan wt Uy — 9j 1/2) On «1-0/2)€ WS. 
where 
TEEN E EE E 
References 


See references to Table 16, this Appendix. 
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19. Functions of Confluent Type and Bessel Functions 


(I) Hypergeometric Function of Confluent Type — (— 167 Functions of Confluent Type) 


(1) Kummer Functions. 


c T r n 
o(z)—iFi (a; e de 2 “te wen "i 








n=0 
r(e) 1 zt 1 c-a-1 
aree pee wed dt (0<Rea<R 
T(aM(c—a). IK (2-10) (0« Rea « Rec) 
Del" +1 oe 3 
= ena aii 477 T a gente Ge Rese Rew. 
Tae- a) f. e” (0-10) (19 f) ( a « Rec) 


The fundamental system of solutions of the confluent hypergeometric differential equation 
(Kummer's differential equation) 


dr 
^E L2 
when c 250, — 1, —2, ..., is given by 


+(c-2)2 -a= =0, 


é(z)miF(a;c;z) | éyz)mz' *F(a-c-*l;2-c;z). 
d, F(a; c; z)/dz - (a/ c Fi(a-* 1;c+1;2z), 
F(a; c;z)= e*,Fy(c - ac; — z), 
a,Fi(a*l;ctl;z)-(a— ce), Fi(aict 1;z)+¢,F (a; ¢; 2), 
a,Fi(a*t1;c;z)-(z*2a— c) Fi(aieiz) t (c- a, F,(a— 15 ¢; 2). 
Putting (a), =a(atl)...(at+n—1)=T(a+n)/T(a) we have 
: 1 T Lap (a)na 
] —— |F, (a; ¢;z) = ————— 
m, Ti Ft (eT yy 
Asymptotic expansion: 


9? (a)(a—c4l 
boas EEN Lach 


n=0 


Fi (a+n+1;n+2;z) (n20,1,2, ...). 


(c— a), (1 — a), z", 


(=z) "+ B,ez^- 2: 
n=0 de 


n! 


o = (a),(a — c *- 1), =n 
ANT. DA —— 0 E z) + Bez" >) 
n=O i n=0 


(c — a), (1 — a), BS 


n! 


(|z|>|al, |z|>>|c], 37/2 «argz « 7/2, c integer), 


where 
Aus e ""T(c)/I(c- a), B, -T(c)/T(a), 
A42 e arr HO c)/T(I — a), B,-T(Q-c)/T(a—- c^ 1). 


(2) The fundamental system of solutions at z=0 of the hypergeometric differential equation of 
confluent type 


EE EE E | dë 


dei d z? 





is given by 


z0/2*19-: PÄ) Aner £2p 4 o;z]. 


(II) Whittaker Functions (— 167 Functions of Confluent Type) 


(1) A pair of linearly independent solutions of Whittaker’s differential equation 


er |. 1, x, Dt 


SS 1 W=0 





N | 
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is given by M, ,,(z) 2 z*^* UPDe772 F [+u x (1/2; x24 1; z]. 
Whittaker functions: 
r(-2 IO 
(-2p) DER (2p) 
T[(1/2) - x — x] T[(1/2) 4 & — x] 


When 2y is an integer, the above definition of W, ,(z) loses meaning, but by taking the limit 
with respect to p we can define it in terms of the following integrals. 


;pt0/26—2/2 
T[ n (0/2) - x] 
EONS Ee 

T( u (1/2) - «] Yo Z 
(Re[a+(1/2)—xK]>0, |argz|<z). 


e i pix De eil s- ui 0/2] 75 p 0/2]. , 
ami j T[ -«* p (1/2)]TI - x - n (072)] 


W, (2)= M, -,(z) = W, -a (2). 


x—(1/2 


p - (0/2) 
) di 





W.(Z)= 

ioo 
M, ua saa o) m C7 yzn*0/26 -:/2 Qu^ DF (-1; 2u + 1;z) (120, 12:55): 
M,,, (2) = gin UM. (e"z). 


T(2y41 ; ; T(2u41 
M,,,(2)= ( H ) ew |. (ez) + ( H ) eeh, (z) 
)J TÍ u* (072) - «] T[ n (1/2) 4 «] 
(-3vc/2«argz« 7/2, 2uz —1,—2,...). 
T(2u+1 , T(2u+1 , 
M, C7 DT y (ec) — IOn*U — 2-st-a-a/nly, (z) 
T[ ik (1/2) - x] T[ i (1/2) 4 «] 


(—7/2«argz «37/2, 2u2 -1,-2,...). 

W, (2) = SIN, aj mz äi [(1/2) - «+p JW, -,, (z) 

e SIDD, anmaeas( 10/2) - x- ulWe-1,4 (2). 
zdW,,,(z)/dz=[«—(z/2)] Wey 2) - | p°- («- 0/2) ]W.- 14 (2). 

When x is sufficiently large we have 

M, TP Qu + D 77/97 !/5cos[2(2«)^ — um — (0/4) |, 
Wa (Z~— (4z/ x) expl —k-klog x)sin[ 2(z«)'/? — mk — (7/4) | 
Wg (Z)~(z/4n) 


Asymptotic expansion: 


exp(x— Klog« — 2(zx) ^). 


aua Zj4.K 
W, (Deme "7 





«(1+ Y [4 - (k- 0/2)] Le- (x - 8/217] Lu 


nich 
(2) Representation of Various Special Functions by Whittaker Functions. 


(i) Probability integral (error function) erf x =O(x)= Hos T "e^ dt 
Va Ju 


E E e" ix" Lie" SD ai yg (x2) 


Asymptotic expansion: 


-x? D D * 
Nod sott jede D E ; 
2 2x 2x2 (2x?) (2x?) 


(x ve =c- ie 
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where C (x), S (x) are the following Fresnel integrals. 


C(x)= [ess tai i + sin x?+ ot 


S(x)= (EE dt= 5 em ^«o(- z} 


(ii) Logarithmic integral 
z dt 
o logt 


= — (log 1/2) z! PW a «(—logz). 


Liz= (When z> 1, take Cauchy’s principal value at t=1.) 


Liz is sometimes written as liz. 


(3) Exponential Integral 


; l 
Eix= f d E dt (When x »0, take the Cauchy's principal value at 1 — 0 while integrating.) 
— oo 


oo n 
-C-log|x|- > dé (x real, +0) 
n=] i 





Te 1 1 
=e : EN! ces F reu PCr x|. 
ET 
Cosine integral Cix=- f° l u= C+logx- f "1-088 gy 
x 0 


Sine integral Six= e sini —— dt, 
0 


æ Sint ; T 
v UD Six—-. 


Sixz-— 
RS 2 


1 1! 2! 3! 
Asymptotic expansi Ei Ci gel Cara t 
ympto pansion iix 2 Cix-isixze (iratum (Get ) 


(HI) Bessel Functions  (— 39 Bessel Functions) 


(1) Cylindrical Functions. A cylindrical function Z, is a solution of Bessel's differential equation 


d, 4 aZ, 2 
SS E +(1-5)z =0, 








Recurrence formulas: 
Z,—1(z) + Z,41 (z) =(2»/2z)Z, (2), Z,-1(2)- Z,41(z)=2dZ, (z)/ az. 
derre (z)dz 2 z'*'Z, ,(z), | eee (z)dz= —- z ™°’Z, (z). 


As special solutions, we have the following three kinds of functions. 
(i) Bessel function (Bessel function of the first kind). 


LGs(£) y LH ( Z )- Ma, Qiz) 


of 5] ES FF (largz| « 7). 
foo (IT (vp 1-1) M2 Qiz) "2j" T(y +1) 


J, Lem: les enwi], (z). 
J-a (z)=(- 0, (2). 


Esch l d\"(si 
SE Z "WE E (n20,1,2, ...). 


(ii) Neumann function (Bessel function of the second kind). 





WEE 





om looss, (z)- J EEN (vzinteger; |argz|« v), 
& (Gu 
co DG D1 32 


EE [o x 1) 


i=0 m-l 


Mode (X(Clog5 ) - s). 


; (ZY letz e(l 
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N_,(z)=(- 1)"N, (z) (n=0,1,2, ...; largz| « v). 
N, (e0™z)=e 70N, T Gee este (z). 
Naaa) =- D" Zant 
(iii) Hankel function (Bessel function of the third kind). 
Hf" (z)&J, (z) + iN, (z), 
HO (z)zJ, (z)— iN, (2). 
H (iz /2) = -2ie- 9"? (az) Wo, (2). 
H®(z)=e""H(z), HO (z)9e-'""HO(z) HO (x) — Hit) (xv real). 


(2) Integral Representation. 


Hansen-Bessel formula ` J,(z)— i d +T giz costo in[t — (7/2) gy 
=r 


n 


WT i 
=! f e" 995! cos nt dt 
T JQ 





[eos sint — nt)dt (n720,1,2, ...). 
0 
Mehler’s formula Jo(x)= E J ?sin(x cosh 7) dt, 

0 


No(x)= — E [ cos cosh t) dt (x » 0). 


2(z/2) 3 
Poisson’s formula J,(z)= ME CL d / ^cos(z cos t)sin?" t dt (Rev > x} 
Va Tv - (1/2)] "o : 
4z/2) S ron 
N,(z)= ate | f / Zeintz sinz)cos? t dt — d SE cosh” rar | 
Va Tir +(1/2) 0 


(Rez>0, Rer» -1/2). 


Schlafli’s formula ` J,(z)— 1 f "cos(z sint — vt)dt — mE d e-zsinhie~"dy (Bez 0 
0 0 


N,(z)= 1 f "sin(z sint — vt) dt — 1 f Persio? + (cospr)e™”]dt (Rez »0). 


d 2i c— ioo 


so- D el Al El ""Idt (c>0, Jargz| «v, Rev» —1). 


2(x/2) ` oo sin xt 
J,(x)= d 
(e vera Gaye 
" 2(x/2) ` c  cosxt 1 1 
Ge veritus qup m Lon -sRe«;) 


Gym gr [COP emcommemias (Rez 0) 
— 00 


(The contour goes once around the negative real axis in the positive direction.) 


Sommerfeld’s formula — J,(z)-7 ER 2n — 94 io | izcosty icht (0/2) gy 
tion 


up i [n eiz costei- 6/2) dt, 


—mqtioo 
H(z) = "o. ee (—yn<argz<7-n, 0<n<7). 
H{(z)= -7 SC Se ež coshrtdt (O<argz< v; when » —0, it holds also at z=0). 
0 
ie "7 (2/2) [| ereh sin? edt 


Va TĪ» +(1/2)] "o 
(0cargz«v, Rev>-—1/2; when z=0, —1/2« Rer « 1/2). 


HY - 
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— iva /2 . 
Hz) = SES e-i dr — (Oc argz c m; when argz 20, — 1 c Rer « 1). 
0 


(3) 


(4) 


Generating Function. 


ze oo 
al 17 ean $ X tc 207 o. 


nl 


oo 
exp(iz cos8) = zx iJ, (z)e"? = Jo(z) +2 Y i"J, (z)cos n0. 
n 


nzl 


[5^o&-2 X 1a 
n=0 


l 
Ess 








Kapteyn's series 











pre $e 


Schlómilch's series. Supposing that f(x) is twice continuously differentiable with respect 
to the real variable x in 0< x « v, we have 


> nz 
2 1 z2 Ee on( ) 





Ti 5 Zon 3 a,Jo(nx) (0« x « m), 


n=} 


where  ay=2f(0)+ 2 d ey f */2 (using)dg, 
n=- sfa uf Ve uf’(using)cos ng dq. 
1=Jo(z)+2 > Jyn(2) =[Jo(z)P +2 > [J, CF. 


Addition Theorem. For the cylindrical function Z,, we have 
oo 
eZ (kR)= Y JL(kp)Z,.,(kr)e"* 
He — DE 


—pe 
(R= Vrpi-1rcse, 0<4<3, site, 0<p<r, 


r — pe" 
k is an arbitrary complex number), 
Z, (kR d SR: 
ue e Ak "riet 5j Ee 


n= 





(v zz negative integer). 


i+] 
exp[C- D ikR | (CDt = 
R "i ve Vrp 2 Om + DJs a2 ko) Hg a 2r) Pm (cos q) 


(11,2). 


oo 
un a Y imQm+ DJ m+ a/ai KP) Pm (cos 9) 


m=0 


-YTQ) X Qoi, s (KoXkp)""CiXcosg) (+0, -1,-2,...), 


m=0 


where P,, is a Legendre polynomial, and C™ is a Gegenbauer polynomial. 


(5) 
with 


Infinite Products and Partial Fractions. Let j, , be the zeros of z ^ '7,(z) in ascending order 
respect to the real part. We have 


= (2/2) 39 EE 25. 
LT trn ii (1 =| (rx —1, -—2, -3,...). 





nel 


Note that if v is real and greater than — 1, all zeros are real. 
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Kneser-Sommerfeld formula 


wd, (xz) z Se J, Lëck Lab) 
4J, G) [J, (2) N, (Xz) - N,C2)J, (Xz)] = > CREG 





(O<x<X<1, Rez>0). 
(6) Definite Integrals. 


n/2 Liya z _ 1 pzsin(zsin@ ) 
f J,(z.6088)cos 8 di = >- d J4, (idt, d J (dto — f — ang — egent, 





b/2a) T( p+) ty Ant) —p 
UU PEE! u p l EE 
[re (uf Idee Teen z jo Li rt | 
(Re(a+ib)>0, Re(a—ib)>0, Re(u+v)>0). 
(2x) T» + (1/2)] 


(at p) Va 


(Vath -aY 
vb” 


1 
]'e"eora- EEN Rea» Im). 
0 


[e oE " (Rev>0, Rea»|Imb]). 
0 . 
tdr g^ Voie 


(r,x real; —-/2«argVr?—k? sii O«argk« m). 


; — Ó— ik V r3 x? 
Weyrich's formula Sf eH (VHF )dr= 4 
— ce 


V r?4- x? 
(r,x real; O«argVk?—7? <a, O«argk« m). 


Sommerfeld's formula J "Äztie" ENGEN 
0 


Weber-Sonine formula 








a/2p) T[(»- u)/2 + 2g? 
[saneta EP D p /2p) TIG + p)/ l r SEN, qais d 
0 2p"T(r 4*1) 2 4p? 
(Re(ut+r)>0, largp| «7/4, a0), 
[| (ane rint die Set? (Re»»-1, Jargp| « 7/4). 
0 (2p7) 


Sonine-Schafheitlin formula 
a*T[(ut+v—A+ 1)/2] 


?] J, (Bt) ^ dto MM — 
f a (at)J,(bt)t ^ dt 2p» I DP[(— uo 0A D/2]T(Cu 1) 


ptv—-A+l1 u—-ep-At] E" 
F(A EE DR pl p 
(Re(u+v—à+1)>0, ReA»-1, O0<a<b). 


(7) Asymptotic Expansion. 
(i) Hankel’s asymptotic representation. We put 


äu lag 291. [42 - Qm - ] 


22mm! 


(»,m)z (m21,2,3, ...); (r,0)z I. 


For |z»l»|, |z[»1, 


[ M-1 m (v, 2m) 
porum 
) 


-yZ sin(2— 7-7 Soar auer 
= Z 


(—7«argz « m), 





J,(z)- 2 cos(z— E = 1) 


«oder 
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M-1 m (0,2 
No 2. an(e- F-5) Ep ou Sc 
M-1 m(v,2m+1) EHS 
+2 e om "HI SZ d 
(—7 «argz « m), 
HO(z)- 2 T EE T_T "e CO. oc (—7«argz <27) 
y (z)= m exp i(z 2 4)] a (—2iz)” g , 
M-1 
most OR aeree] creme 


(i?) Debye's asymptotic representation. 


v=x, 1—(v/x)>e, v/ x-sina, when 1 (v/x) » (3/ x)» ?, 


td T= exp) ix [cosa +(a — 5 sina] | 














" 2 
im/4 l 5 3e?7i/4 
e 
x X eler ztn) y? 
77 385 3- 5e57i/4 
+ (79g + 376 n 7g + 3456 n Wa .] 


(X =[— x cos(a/2)]'/?). 
v=x, (»/x)—-l>e, v/x=cosho, when j|5?—x?|V?»1, |5?2-— xa 


H((x)— v exp[x(o cosho — sinh o)] 
T 


Jl. duas PN cl 2 , 385 dE 
d *(s 74 coth s) s Has 37g Otho + sgag Oth o anys De 








(X =[- xsinh(o/2)] ^. 
When v=x, |x—»|&x!?, x1, x—»-6, 


61/3e SE Léia “acl na 











Q) 
H, Vic 31/2, 1/3 x? 


* (13 - x + Pore +. | 
(iii) Watson-Nicholson formula. When x,» »0, w-[(x/»y? - 1]/2, 
H,O(x) 937 Pw exp[( — 1)'* i ((7/6) + »(w —(w3/3) — arc tanw))] H)»w?/3) + O |v "H 
(t= 1,2). 


(IV) Functions Related to Bessel Functions 


(1) Modified Bessel Functions. 
I nie] (e "/?2z) 


66 (z/2) ^" 
nao RTO n1) 


K,(z)z D emn Mem )= = B e "77? QXe- in/27) 
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Recurrence formulas: 
1,-\(z)— 1,441G) 7 Qv/ 2), ), 
L, (2) + 4,442) 721). 
K, (27 K, (27 -Qv/2)K, C), 
K, (2) + K, (2) = —2K,(z), 
K_,(z)=K,(z). 


Airy’s integral: f eoxe- 47 $3 DEA Jual J| 


20 1 2xVx 
f, cos(t? + tx) dt z Vx kd SC | (x >0). 











H. Weber's formula: egener E zl [re "ep (at)J,(bt)t dt 
2p! ` 2p? 0 
(Rev>-—1, Jargp|c 7/4; a,b>0). 


4sin( uy — » 
Inn P" Ce Qzsinhne'rmrdr 
o 


Watson's formula: JaN, (Zz) - J, (Z)N, (Zz) = 
(Rez>0, Re(p—v)< 1), 


= ie 
io = Ge N,(2) Es 





e e K4izsinhrie dt  (Rez>0). 
Nicholson’s formula: J2(z)+ NX(z)= Ge =f D K(2z sinh r)cosh 2vt dt (Rez » 0). 
m^ 40 


Dixon-Ferrar formula: — J(z)- N2(z) = 


Beo f ? K>,(2z sinh r) dt 
7 0 


(Rez >0; -j«Re«j) 


(2) Kelvin Functions. ` ber,(z) ibei,(z) 2 J, (e *?"//*7), 
her, (z) + i hei, (z)= H Xe *57//47), 
ker,(z) & — (7/2)hei,(z), 
kei, (z) z (7 / 2)her, (z). 
When » is an integer n, — ber,(x)— ibei,(x) 2 (— lU,(Vi x), 
her, (x)— ihei,(x)2 (—1)* !H(X(Vi x) — (x real). 


NM LL ENS f 7/^sin(z cos8 sin?" dð 
T[»* (1/2)] Vr 70 


(3) Struve Function. 4H,(x)= 


` on (—1)"(2/2) 7"! 
mao Vim+(3/2)IT [p+ m (3/2) 


Anger function: J,(z)= i Í "cos( vO — zem dé. 
0 


H. F. Weber function: E,(z2)- © {sin(v6 — zsin6) dd, 
0 


2 d? d _ 2 
Putting V,=z rar? -tz2- p?, 
4 2 vtl bes S 
E e 
T[vy+(1/2)]V2 7 


z — v)cos 
V,E 2)» — IL = ee 
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When v is an integer n, J,(z)=J,,(z). 
[ Dd = zo) FOH) -IDON 


T. 


f Nolt)dt = zNa) + SEIN OH - NCOHOM. 
{n/2] n(n—j—1) 
Zen jAa/2) "` 
Ox) 9 1/t. 


I 21423 ont USE 
n=] 


t-z 


(4) Neumann Polynomials. 0, (t) (n is a positive integer), 





Schläfli polynomials: S, (D= d tO, (t) — cos? e (n is a positive integer), 


So(t)=0. 
VS, (x) 2 2n 2(x — n)sin'(nz/2) (V, is the same operator defined in (3)). 


T(v = RE 
Lommel polynomials: Rm, (2)= Uae, d 1m , I j|» —m, 
T(»)(2/2) g : 
1-»—m; -2) 
7 (uz /2sinvm)yJ, 4, (2)J -v4 (2+ D"J — lz, 21 
(m is a nonnegative integer). 
References 


See references to Table 16, this Appendix. 


20. Systems of Orthogonal Functions (— 317 orthogonal Functions) 


[ Pal) Pax) 0(x) dx 8,4, 


Notation p,(x) | Interval (a,b) | Weight (x) 


Pa (x) 










Legendre 2/(2n+1) 

















Gegenbauer C?(x) (1— x?y 70/2. | Qa (Qn  n)/2? (n + vn (T (5)? 
Chebyshev T, (x) (1—x?) 7 a(n=0); 7/2 (n2 1) 
Hermite H,(x) Va «n! 
2 
. mill T(a - 5 — y- 0) 
Jacob G, (a, y; uoc 
Ter nla; x) (a 2n)T(a - n)T Cy 4 5) 
Laguerre Le(x) T(a+n+1)/n! 








For Legendre polynomials P,(x) — Table 18.II, this Appendix. 


(I Gegenbauer Polynomials (Gegenbauer Functions) 


g T(n-*2v) S ( 35 1 1-t 
?(f)e——— n , —nh; 2 ese 
"O= Fay zb [MS RS INE. 


LQv-4- 25n[v--(/2)]f 1 (1/4)- (v/2) 
esse eoo tira 
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= TN 1 d'P, (1) 
Generating function CELLO Cra C PX x) = EE 








nI(2v+n) 


Orthogonal relation Í " (sin"8) C? (cos0)C; (ee BW = Ee 
"'(rtn)nür(r 


(I) Chebyshev (Tschebyscheff) Polynomials 


(1) Chebyshev Polynomial (Chebyshev Function of the First Kind) 


T,,(x) = cos(n arc cos x) 
=(1/2)| (x+iV1 =x? ) *(x-iV ETA 1 


= F(n,—ni 1/2; (1- x)/2) 


[n/2] TEMA 
= € Oyy) a- 
2, (z) ^ 


| (-D'ü-32^^ anü- xy 07 


(2n— 1)! dx" 
Chebyshev function of the second kind 
U,(x)zsin(narccos x) 
-a/20| (x+iVI-¥? ~(x-iV1=x ) ] 
(0) a aed 0/9 
(2n—1)!! dx"! l 


T(x), U,(x) are mutually linearly independent solutions of Chebyshev’s differential equation 
(1 — x2)y" — xy' +n? y 4 0. Recurrence relations are 


Tasi (x) Af, (x) + T(x) =0, U,, (x) —2xU,, (x) + U, | (x) 50. 


Generating function: 





1-7? S 1 1 
5 = To (x) +2 A 7, (x), ———, = = XU, j^. 
1=2rx+ P 2 nlx) lI-2x*8 Vix Zu Se 
Orthogonal relation: 
+1 T (x)T, (x) 0 (m#n), +] U, (x) U, (x) E (m2 n0), 
UV, de v/2 (m=n+#0), EE —— — dx- 2 
= Vi-x ^ (Hens): x Vi-x 0 (otherwise). 


Orthogonality in finite sums. Let ug, u}, ...,4, be the zeros of T, , ,(x). All zeros are real and 
situated in the interval (— 1, 1). Then we have 


k 0 (mz*n, or m=n=k+1), 
È Tn (ud T, Qi) 7 (k+1)/2 (1«men«k), 
Po kl (m=n=0). 


Let p, (x) be the best approximation of x" in — 1 < x < 1 by polynomials of degree at most n — 1. 
Then we have x" —p,(x) 2 27"*' T, (x). 
(2) Expansions by T,,(x). 


ege I(a)+2 Y. L(a)T,GO, 


n=) 


oo 
sinax=2 X (—1) Jona (2) T4, 4 (X), 


n=0 


nets S Ge) e Rose 


nzi 
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oo 
log(1 + xsin2a) =2logcosa—2 Y l- tana) T, (x), 


n=] 
: v3 2n*1 
arctanx = ep Tn = Tən+1 (x). 


n-l 
(III) Parabolic Cylinder Functions (Weber Functions) (— 167 Functions of Confluent Type) 
Parabolic cylinder functions: 


1/4)*(vj2) , -1/2 
D= AmE Watt, -1/4(2 */2) 


BCEE X Maa we 
P(t —v)/2] T(—v/2) 


v2 9-2/4 —v 1 si Jak Ly 3 z? 
=2 Ja Fi >>>? RE E 355 

ER Se 22/2] T(—v/2) 2 52535 
The solutions of Weber's differential equation 


d?u D SE, 
ext i 4e 


are given by 
D,(z), D,(-z), D_,_, (iz), D., .(- iz), 








and the following relations hold among them. 


D, (z)= [TG D/ V2s ][e*"?D_,_ | (iz) t7". , (C iz)] 
ce '"pí(-z)« (Vis /T(-9) ee E (iz) 


"D, (— 2) [V2s /T(-») Je'**?p.,., (- iz). 


Integral representation: 








—z?/A4 
D,G)- c al: "Hien lr (Rey <0). 
e Leier 2 D, Se = PT(—v)D,(z)dv (c <0, Jargr| & 7/4). 
c— ioo 


Recurrence formula: 
D,41(2)- zD,(z) t vD, 1(z)90, dD, (z)/dz+(1/2)zD, (z) — »D, .,(z) 20. 


NEE? 20*0/2 m 
ie ee DAD) e e 
Hass P Eu) 


Asymptotic expansion: 


- v(v— vr(v — l)(v -2)(» e 
D Gee er i Ps eae orte) (largz|< 27). 


D, (0) — 


2-424 


2 x A ] 
D. =e? /4/2 1-et( — ) , erf(x)z—— | e "dt (error function). 
TOL "el c w=- f ( ) 


(IV) Hermite Polynomials 


For the parabolic cylinder functions, when v is an integer n, we have 
D, (z) S (7 Uer "^q" (e -?/2)/ qzn— e ?/^g, (z/ V2 ), 
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where H,(x) is the Hermite polynomial 
H, (x) =27"/2(- 1)'e vq" (e *)/dx" eX "2p, (V2 x). 


A Hermite polynomial is more often defined by the following function He, (x) (e.g, in W.F. 
Magnus, F. Oberhettinger, and R. P. Soni [1]). 


He,(x) &( — 1)'e* "4" Le" */2)/ dx" =e* /4D (x)=H, (x/V2 ). 
The function y = H,(x) is a solution of Hermite’s differential equation 
y" —2xy' * 2ny =Q. 


H, (x) is a polynomial in x of degree n, and is an even or odd function according to whether n 
is even or odd. 


H5, (x) =(—1)"(2n— DIS F, (= n; 1/2; x?), 
Hy (x)=(- 1)"Qn+ D V2 zb, (75;3/2; x?). 


Recurrence formula: 


H, (x)= VÀ xH, (x) - nH, (x)= V2 xH, (x)— H; (x)/ V2 , 


H, (x)= v2 nH, — (x). 


(- D'Qn)! 


xu LU Dn: UI, Basen (0)=0. 


Han (0) = 
Generating function: 


eo 
e? s- Cm Y H,(x)t"/n!. 
n-0 


Orthogonal relation: 


+ co 


H, (x) H, (x)e7 dx = 8, n! Vm . 
— oo 


(V) Jacobi Polynomials 
G,(a,y;x) 8 F(-n,a*niyix) 


y-al (y n) d” 


=x!7Y(1— x) PG) dx” 


fees (1 sw) T] 





These satisfy Jacobi's differential equation x(1— x)y" [y ^ (a l)x]y' * n(a-- n)y =0. 
Orthogonal relation: 


niT(atn-—yt IT (yy 


1 - 
Y71(1— x)" “Ga (o, y; x) G, (a, y; x) dx = ——— MM 
f> (17x) Gn(a,yix)G,(a,y;x)dx (a 2n)T(a * n)T(y n) "" 


(Rey>0, Re(a—- y)» -l). 


Representation of other functions: 





" .1-x Z A. Tex 
P, (x)=G (1,1; Lei ) T, (x) dÉ ; ) 


C (x)=(- m G, Laun 
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(VI) Laguerre Functions 


(1) Laguerre Functions. 


T'( 4 v-- 1) 
T(x4- 1)F (v4 1) 


T (a v-1) 
uu E E ea Ed fy 
repro set bo 


These satisfy Laguerre's differential equation 


zd? [Li (z)]/ dz? - (a - 1— z)d LL (9 (2)]/ az + vL f? (z) 20. 


L®(z)= 2; GON gus Mija+1y/21+v,0/2(2) 


(2) Laguerre Polynomials. When v is an integer n (n=O, 1,2, ...), the function L®(x) reduces 
to a polynomial of degree n as follows. 


Laguerre polynomials: 


x, 7a J 
LOC) hr 4 enta Cie iA 








Es 
(n+ Ss 


LP(x)ei Lj"(x)el LAS (x) = e 5 Lila) (m=0,1,2,...). 


Recurrence formulas: 
nL®(x)=(—x+2n+a-1)L® (x) - (a a — (Më) 
xd[L@ (x)]/dx =nL©(x)-—(n+a)L™ (x) (n=2,3,...). 
Generating function: 
e "/0-0 oo 
= A L@(x)t” (den). 
(1 "m pe Deg 


Orthogonal relations: 


[ e sb (x) LO dx= Snnl (atn+1)/nt=6,,,00 + a)(^5*). 


Hy, (x) e - Dn! LE (2), Hy ux) (72) n! V2 xL? (x?). 
(3) Sonine Polynomials. 


(71) 


= (COEUR 


Le (x). 


(VII) Orthogonal Polynomials 


" x(x—1)...(x— k- 1) 
Ps ede Al" HEI rma) ken 


(where n, m are positive integers and n <m). 
We have the same polynomials if we replace x* in P,(1 — 2x) by 
x(x—1)...(x-—k+1)/m(m-1)...(m—k+1) (k-29,1,...,n). 
Orthogonality in finite sums: 


(m n 1)!(m— n)! 


> Pin (k) Pi m (k) = Sn 


co Gar mt 
Chebyshev's q functions: 
1 = 
qn (m, x) = al (x), Erm (x)= [2 (n/n)! gq (mm, x-1). 


For given data y, at m points x, — x, +(k—1)h (k=1, ..., m) that are equally spaced with 
step h, the least square approximation among the polynomials Q(x) of degree n( « m), i.e., the 


polynomial that minimizes the square sum of the residues S= 3 [y,— Q(x,)]' is given by the 
k=1 
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following formula (— 19 Analog Computation): 


MB X-—X m c B2 
Q(x)= A. süs— t) S= dw - Ae 
S, h e S, 
k=0 K= k=0 





B= 5 ct m (i), Au > [& rof 


i=l i=l 


References 


See references to Table 16, this Appendix. 


21. interpolation  (— 223 Interpolation) 


(1) Lagrange’s Interpolation Polynomial. 


f(x)= > f(x) Ear sr e E eec A 
s=0 


x,— xg) (x, 7 xi)... — x, G7 x, D. G7 AR 


Aitken's interpolation scheme. The interpolation polynomial f(x) corresponding to the value 
Js — fx) (520,1, ...,5) is given inductively by the following procedure. The order of xo x,, 
...,X, is quite arbitrary. 

Bso(x)my, (s=0,1,...,”), 
Psk (x) = [ G5 = x) p 0) 7 Gu 7 x) p, 00 ]/ G5 7 x) (s=k+1,k+2,...,n), 
F(x) =P yn (2). 
(2) Interpolation for Equally Spaced Points. When the points x, lie in the order of their sub- 


scripts at a uniform distance h (x, = x 9+ sh), we make the following difference table (Ax = h). 
Forward difference: 


AEAEE fa - ffo )-fG), A =A iA], 





Difference 
Variable Value of Function | (Ist) (2nd) (3rd) (4th) 





Backward difference: 
AA -ETA 
Central difference: 
Ai Bun un 905 57 Ai, 


Newton interpolation formula (forward type): 











u(u— 1) u(u—1)(u—2) 
f Gt ux) =f (xo) + 7 dot ~r b+ ———3: — A 
u(u—1)(u—-2)(u —3) 
4 ji Ag+... 
Gauss’s interpolation formula (forward type): 
ü u(u—1) ., u(u—l)(u-l) , 
f(xo** uAx) =f (x9) + Tr fot ji tg 


P u(u—1)(ut1)(u—-2) 


ai At rt, 
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Stirling's interpolation formula: 


A_,+A 2 u(u2—1) A+A? u^(u?— 1) 
FG ux) =f (xo) Tp pa s To toa 





gb Las 


Bessel's interpolation formula: 


jeu je RU Ss Ce see edu 


7 Teta ao. Ta a 
4 4 
Jd E E 5-2 000 
zl ie d 2 


Everett's interpolation formula: 





2_ 4 21 2_4 
ff (xt uAx) = f(x, Ax) = &f(xg) + Une EG L EM us 
Er Ne EN 2. 2. 
+ uf(x,)+ ae gs KU UUED a wee (E=1-u) 


3 5! 


(3) Interpolation for Functions of Two Variables. Let x,,  xo-- mAx, y, z yg ndy 
(m and n are integers). We define the finite differences as follows: 


A (xoyo) =f (x10) -F Goo). 

A,(xo,¥o) =f (x0,¥1) — f xoyo), 

A2(xo,yo) & A (xi,yo) 7 AG yo) = 87 (1,9). 

A (xo yo) S A (xi, y9) — Axe yo) = A (xy 1) 7 AL 0,70) 

A3(x0,¥o) = Ann) SZ A, (Xo. yo) = 5 (xo,1), 
Newton's formula: 

Tat uAx, yo+ vdy) =f (xo. yo) + (uA, + vA,) (xo. yo) 

+ (1/20 [u(u— DA2 + 2uvd,, + v(v — DA; |(xo.¥0) + DS? 

Everett's formula. Putting s=1—u,t=1—v we have 

f (xp t+ uAx,yot vAy) = stf (xo,yo) + sof (xo, y 4) + utf (xy yo) + uof Gxy y) 

-(1/6)] us(1 + 5) (182 (xo. yo) + 082 (xo, y) } + us (1 + u){ 18? Goog) + 982 Gy) 


t ot(14- 0 (587 (xo. yo) + ud? (x10) } tot(14 v) (587 (xo) + ub? G0) | odes 


References 


[1] F. J. Thompson, Table of the coefficients of Everett's central-difference interpolation formula, 
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[3] H. T. Davis, Tables of the higher mathematical functions I, Principia Press, Bloomington, 
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[4] K. Hayashi and S. Moriguti, Table of higher transcendental functions (in Japanese), Iwanami, 
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22. Distribution of Typical Random Variables 
(— 341 Probability Measures, 374 Sampling Distributions) 


In the following table, Nos. 1-13 are 1-dimensional continuous distributions, and Nos. 20-21 are 
k-dimensional continuous distributions, for which the distribution density is the one with respect 
to Lebesgue measure. Nos. 14-19 are 1-dimensional discrete distributions, and Nos. 22-24 are 
k-dimensional discrete distributions, where the density function P (x) means the probability at 
the point x. 

The characteristic function, average, and variance are given only for those represented in a 
simple form. 
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Distribution of Typical Random Variables 




















No. Name Symbol Density Function Domains 
1 (wi 
1 Normal N(p,o?) ea = S 00 Kei oo 
(2207) 20 
2 
1 (logy — p) 
2 Logarithmic normal — Ta LAE iui Mu O-zy « oo 
(roy? Y Zei 
3 Gamma I(p,o) [I(p)]^ lo Px? le -x/* O<x< 0 
4 Exponential e( 1,0) (1/0) exp(— (x — 2)/0) Heke ee 
5 Two-sided exponential (1/2a)e -lxi/c —o«x«oo 
6 Chi square x(n) 27^/7T(n/2)]- Ix /2-1e - x/2 0<x< oo 
7 Beta B(p,4) [B(p.q)]" Ix^- (1— xe! 0cx«l 
—(m*n2 
2 Kgxtn/m-i[1 + pee 
8 F F(m,n) iii l (neza) A O<x< o 
Kp zl B (m/2,n/2)) Tim 
—(m*n)2 
Kre™|1 2z 
9 z z(m,n) rem xo WI i — ee <0 
Kul B(m/2,n/2)] (m/n 
10 t tn) [Vn B(n/2,1/2)] 1 - ((2/n)]- 972 — eme fe e 
-1 
(x—p) 
11 Cauchy C(1,0) (10)-!| t+ S —oo«x« oo 
c 
One-side stable -1/2y -3/2 EN 
12 for exponent 1/2 c(27) X exp( — c2/2x) O« x« oo 
13 Uniform rectangular U(a, B) 1/(B — a) a«x«p 
14 Binomial Bin(n,p) (7 Te x-0,1,2, ...,n 
15 Poisson P(A) eM x! x=0,1,2,... 
x integer 
16 Hypergeometric H(N,n,p) (E ya) 0« x « Np, 
O«n-x« Nq 
17 Negative binomial NB(m,p) T(m- xT(m)x!]- Ip mg x=0,1,2,... 
18 Geometric G(p) pq x=0,1,2,.. 
19 Logarithmic K,q*/x, Kj = —1/logp x1,2,3,... 
Gei *^z|-12 oo < 
m ‘ - yy ony My 
20 Multidimensional normal N(p, X) Sé expl -(x-4)Z-!(x— uy /2], os 
x-(x eege breit, woes Dal z= (oj) 
TG... »43) EN 
De Eat? Xp -Xk >0, 


21 Dirichlet Pie)... Pia | 


ut, tx <l 
LTE Gat i. + xj) 





-1 
Wyle. xa! WË pXk+1, Xp Xk 
22 Multinomial M(n.(p)) Eet geen -0,1,...,n 
p. m n= Grot sot xi) eut... Leen 
Multa . pi (pn (4) Ei Ak integers 
23 ultidimensional HIN, a (p) x, Xk+t ap 0« x, < Np, 
hypergeometric 
Xy 41m n— x, oo x) (i=1,...,k+1) 
: : T(m-^ x, +... x) hrs XK 
24 Negative polynomial Uu ere 
Vo PEPE... pet, =0,1,2,... 


T(m)x,!...x,! 
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Conditions for Characteristic Mean Variance No. 
Parameters Function 
2,2 
Oe He, . ot 2 
ex £— —— l 
E (uv) , , 
—o«pu«oo, ent (o2) eieiei — 90% 2 
o>0 
p,o>0 (1 — iat)? op op 3 
—-o«pu«o0,0»0 e" (1 — ton") uto o? 4 
o>0 (1+ 02/2)! 0 202 3 
n positive integer (1—-2i0)- 7/2 n 2n 6 
P Pa 
ptq (p+q) (pt+q+)) 
— n 2n? (m-- n—2) 
m,n positive integers (n>2) — z (n> 4) 8 
-2 m(n-—2) (n—4) 
m, n positive integers 9 
n positive integer 0 (n»1) n/(n—2)(n»2) 10 
8 pee 5 "es exp(ipt — o|¢]) none none 11 
0<c<% exp[ — eļe|!/20 — it/{e))] none none 12 
—oo«ca«fi«o (e&t — eiat) / ii( 8— a) (a+ B)/2 (8—-ay/12 13 


nnn nnn a Oio 











ptq-lpq»0, i 
t4» 14 
n positive integer (pen+ 4) "p npa 
A20 exp[ —A(1 e?) A A 15 
- [7] n^! 
AN. Np,n positive integers x F(—n,— Np; Nq—n« l;e"), np D 16 
Non m' z m!/(m — n)! 
ptq-1pq»0, p^ mq mq 17 
m>0 (1- geit)” p p? 
P q 
p+q=l,p,g>0 = => 18 
1 - ge" p p? 
ptq-lpq»0 — Kz log(1— ge") Kiq/p K,a(1 - Kyg)/p? 19 
SUP LUI ————nár n E EE SE 
=< py, e Mk 5 
« oo, Z symmetric e (ur - +). V(x) = oj, 
ge SC 2 E(x) =p; 20 
positive definite Cov(x, xj) = gy 
quadratic form t= (ty, ist) 
V (x;) 
E(x) = Coo, sot ran, 
DEED 120 vi Cov(x; xj) = — Cr;vj, 21 
yt») Cm, +... +% 43)" 
XGyt cru 
Tut =I, 
ge dam (pei... puel y E(x)= V(x) =np,(1 — p), 22 
Py deal Pı + PK Pe+i i) = pi Cov(x, x) = —npjp; 
n positive integer 
Bi eet Peaaml V (x) - €np,(1 — pj), 
EE 0, pX) Pi 
DIEU ET E(x;)= np; Cov(x; xj) Cnp;p; 2 
N, NP,» ..., Np, n ue 
positive integers "C N-1 
PotP\ t+... + P= 2 
; mp; V(x) = mp po + p)/ p. 
1, Pot Pk pé (V — pie — ... — ppe)" E(x) = — EE Sé 24 
Po Cov(x,, lr mp,p;/p2 


20, m>0 
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Remarks 

1. Reproducing property with respect to p, o?. 

2. X =log Y : N (j,0?). 

3. Reproducing property with respect to p. 

4. e(0,0) 2 T(1,o). 

6. n is the number of degrees of freedom; reproducing property with respect to n. 

8. m and n are the numbers of degrees of freedom. 

9. e?! = F(m,n). 

10. n is the number of degrees of freedom. 

11. C(0,1) 2 £(1); reproducing property with respect to u and o. 

14. Reproducing property with respect to n. 

15. Reproducing property with respect to A. 

17. Reproducing property with respect to m. 

18. G(p) - NB(Lp). 

20. Generalization of normal distribution; reproducing property with respect to p and X. 
22. Generalization of binomial distribution; reproducing property with respect to n. 

23. Generalization of hypergeometric distribution. 

24. Generalization of negative binomial distribution; reproducing property with respect to m. 


23. Statistical Estimation and Statistical Hypothesis 
Testing 


Listed below are some frequently used and well-investigated statistical procedures. (Concerning 
main probability distributions — 398 Statistical Decision Functions, 399 Statistical Estimation, 
400 Statistical Hypothesis Testing). The following notations and conventions are adopted, unless 
otherwise stated. 

Immediately after the heading number, the distribution is indicated by the symbol as defined 
in Table 22, this Appendix. It is to be understood that a random sample (x,, x;, ..., x,) is ob- 
served from this distribution. Where two distributions are involved, samples (x,, ..., x,,) and 
(D, Kal are understood to be observed from the respective distributions. 

Next, a necessary and sufficient statistic based on the sample is marked with * when it is com- 
plete, and * otherwise. Then appears the sampling distribution of this statistic. For those statistics 
consisting of several independent components, the distribution of these are shown. Greek lower- 
case letters except x and y denote unknown parameters. Italic lowercase letters denote constants, 
each taking arbitrary real values. Italic capital letters denote constants whose values are specified 
in each procedure; repeated occurrences of the same letter under the same heading number specify 
a certain common real value. 

Problems of point estimation, interval estimation, and hypothesis testing are presented, with 
corresponding estimators, confidence intervals, and tests (critical regions) as their solutions. All 
the confidence intervals here are those constructed from UMP unbiased tests, having 1 —« as 
confidence levels. Alternative hypotheses are understood to be the negations of corresponding 
null hypotheses. Significance levels of all the tests are «. The following symbols are attached to 
each procedure to describe its properties. 

For estimators: 

UMV: uniformly minimum variance unbiased. 

ML: maximum likelihood. 

AD: admissibility with respect to quadratic loss function. 

IAD: inadmissibility with respect to quadratic loss function. 
For tests: 

UMP: uniformly most powerful. 

UMPU: uniformly most powerful unbiased. 

UMPI( ): uniformly most powerful invariant with respect to the product of transformation 

groups shown in( ). 

LR: likelihood ratio. 

O: group of orthogonal transformations. 

L: group of shift transformations. 

S: group of change of scales. 

AD: admissibility with respect to simple loss function. 
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IAD: inadmissibility with respect to simple loss function. (Note that UMPU implies AD.) 
The following symbols denote 100(1 — a)% points of respective distributions, a being 

sufficiently small. 

u(a): standard normal distribution. 

tj (a): t-distribution with f degrees of freedom. 

x}(a): x? distribution with f degrees of freedom. 

Ff (a): F-distribution with (f;,f,) degrees of freedom. 


(1) N(p,b?) | Zen N(nyunb?). 
Point estimation of u. x= 1Xx: UMV,ML, AD. 


Interval estimation of p. (s t u(a/2) SC ) 
n 


Hypothesis [u< k]. x>k+ vele UMP,LR. 
n 
Hypothesis [h< u « ]; x « h—- C or x >/+C: UMPU,LR. 


Q) N(a,07). ZG—ay*. oe?^xl. (o?xl is the o?-multiplication of a random variable obeying 
the vital distribution. We use similar notations in the following.) 


2(x,- a 3 
Point estimation of oi. SE UMV,ML,IAD. 


Interval estimation of oi, (AZ(x,— a, BX(x,— ay). 
Hypothesis loi < k]. X(x;— a)? >x2(a)k: UMP,LR. 
Hypothesis [o?=k]. E(x;- ay « Ak or (x, - ay» Bk: UMPU. 


Èx; : N (ny, no? 
(3) N(p,02). ES ee 
2(x;- X) ev 
Point estimation of y. x: UMV,ML, AD. 


VEG =X y 
Interval estimation of u. | X € t,..,(a/2) 3——————— |. 
n(n— 1) 





V n(n—-1 
Y — I 
Hypothesis [u< HL ——t— E > E UMPU,LR. 
VEG SV Vn(n-1) 
Hypothesis [ p= k]. e es > ol). UMPU,LR, UMPI(S, O) for k =0. 
Vis 3) Vn(n-1) 
E E(x-x) E(x-x) 
Point estimation of oi. =T © UMV, IAD. ———: ML, IAD. 


—2 
Ti(n— D/2 XT xX 
Point estimation of c. Tin- DO a : UMV,IAD. 
V2 T(n/2) nl 


Interval estimation of o°. (AX(x, — XY, BX(x,— xy). 
Hypothesis [o? EL E(x- xy»x2 (a)k: UMP, LR. 
Hypothesis [c?) Kl E(x- XY « Ak or Z(x,- x » Bk: UMPU. 
Hypothesis [o? » EL Z(x,- X « x2 ,(1—a)k: UMPU,UMPI(L). 
Hypothesis E < d ——* >E: UMPI), AD. 
het? y 

(4) Bin(N,0). Xx. Bin(Nn,0). 

Point estimation of 8. e UMV, ML, AD. 


Hypothesis [« k]. x>A: UMP. 
Hypothesis [h< @</]. x<Borx>C: UMPU. 


(5) H(N,m,0)(n2l) x*. 
Point estimation of @. “*; UMV,AD. 
Hypothesis [É < k]. x>A: UMP. 
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(6) NB(N,0) Zen NB(Nn,0). 


2 E Nn-l : . 
Point estimation of 8. Natz l (1 when the denominator is 0): UMYV, AD. 
Nn, 
Naas ML. 


Hypothesis [« kJ. 2x;<A: UMP. 
Hypothesis [h<@</]. 2x,<Bordx,;>C: UMPU. 
(7) P(A). Xx*. P(n). 
Point estimation of A. x: UMYV, ML, AD. 
Hypothesis [À < k]. x>A: UMP. 
Hypothesis [h<A</]. x<Borx>C: UMPU. 
(8 G(0) ZXx*. NB(n0). 
For the point estimation of 0 and hypothesis testing — (6). 
(9) U[0,0]. maxx;*. 
Point estimation of 0. maxx: ML, IAD. 
Hypothesis [ < kJ maxx,>(1—a)!/"k: UMP. 
Hypothesis [— EL maxx, « ka!/" or maxx,>k: UMP. 


nl 





maxx: UMV, IAD. 


(10) U[£ n]. (minx; max x;)*. 
nmin x; — max x; 


Point estimation of £. UMV, IAD. minx: ML, IAD. 


n—l 
. NOM ¿+n minx; +maxx; 

Point estimation of CR i Paras UMV, AD. 

Hypothesis [ņn— £ « k]. maxx; — minx; » ka'/^: UMP. 


(11) dk ety} (minx,, maxx;)*. 


"T min x; + max x; 
Point estimation of 8. cL. EE ML, AD. 
Hypothesis [9< k]. minx, >k+ 1 — al/" or maxx; k+ LE UMP. 


2 
(12) ein, o). fee E Kee 


min x; e(p,a/n) 


Point estimation of o. UMV, IAD. x—minx;: ML, IAD. 


n-l 
- i min x; — E i x: UMV,IAD. minx: ML, IAD. 
Hypothesis [o < Kusch Ex;<h or Zx;»5kloga !/"--h: UMP. 
Hypothesis [h<o</]. Xx,-naminx,;<A or Ex, nminx;» B: UMPU. 


Point estimation of p. 








n min x, — k nminx; — k 


«Ü or e 


wL———-— >C: UMPU. 
x;— nminx; 


Hypothesis [ u= k]. Yrx-naminx 


(13) T(p,o). ZEx;*. T(np,o). 


Point estimation of o. UMV, ML, IAD. 


hey [I 


Interval estimation of o. (C2x,, DY x). 
Hypothesis [o < k]. 2x,;>A: UMP. 
Hypothesis [o = k]. Ex;« Ck or Ex,» Dk: UMPU. 


Nina Eat p 


14 
oF N (2,57). Èy; N (n; us, n5b?) 


Point estimation of i, —u,.. X —y: UMV, ML, AD. 


2 p2 
f i a^ b 
Interval estimation of i, — H2. (s —ytu(a/2) | —r- a . 
n, n; 


2 bp? 
Hypothesis fu, ue <k]. x-y>k+u(a) —+—: UMP, LR. 
n, n, 
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* 


N o? 2x N (n, iy nio?) 
(15) N 2 ZY; cy N (n uo, n30?) |- 
Unt | steBa- +209) } Les, 
ny tA 7 


Point estimation of uj — 43. x-y: UMV, ML, AD. 


2 
Interval estimation of p, — Wz. [rre 1 + 1 F m ? d 
V 1 2 ptn- 


(x—y—k) V An, Vn + n,—2 
Vnitn, Vs? 








Hypothesis [uj — u< k]. t > ty +n,-2(a): UMPU, 


UMPI(L), LR. 
Hypothesis In — go k]. [t| 5,44, ol UMPU, UMPI(L), LR. 
2 2 


s : 5 
SE UMV, IAD. m ML, IAD. 


Interval estimation of 07. (As?, Bs?). 

Hypothesis [o? < EL s^» x2, .,,-5(a)k: UMP, LR. 

Hypothesis [o?- k]. s?« Ak or s?» Bk: UMPU. 

Hypothesis [o? > k]. 52» x2,4,4,-x(1—a)k: UMPU, UMPI(L), LR. 





Point estimation of o?. 


N(n,ol) [Exs X(x-X)| 
N ( 2.01). È yi, X(»-y) 
ei | ZQ-x) Bm FY 
Interval estimation of =< Ae et 
92 X(y;—-») Z(yi-Y) 
(nj-1) Za X) 
(n,—1) (y -yy 


(16) 


2 


| 
— <k 
02 





Hypothesis > Fm>(a)k: UMPU, UMPI(L, S), LR. 








5 2 Xj E(x- XY, 4 5 
(17) N (41, Mo, 61, 03, p). (OQ 2(6-*)(%-7) 
È y; Z(y;—-y) , 
T Z(x,-x)(yi-y) 
Point estimation of p. r= ——————————————: ML. 
X(x-XyX(Qwu-») 

t, 1(a/2) 
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Appendix B 
Numerical Tables 


Gë Numbers and Primitive Roots 
SCH Modulo p 

ium Numbers and Euler Numbers 
m Numbers of Algebraic Number Fields 
— of Finite Groups; 
Crystallographic Groups 

M Constants 

M of Polynomial Approximations 


1. Prime Numbers and Primitive Roots (— 297 Number Theory, 


Elementary 


In the following table, p is a prime number and r is a corresponding primitive root. 

























p r r p r 
2 9 857 3 
3 2 5 859 2 
36 2 2 863 5 
7 3 3 877 2 

11 2 2 881 3 

13 2 3 883 2 

17 3 2 887 5 

19 2 7 907 2 

23 5 3 911 17 
29 2 7 919 7 
31 3 7 929 3 
37 2 251 11 937 5 
41 7 3 941 2 
43 3 5 947 2 
47 2 953 3 
43 967 5 

971 11 

97] 3 

983 5 

991 7 

997 7 





*Mersenne numbers. À prime number of the form 2? — 1 is called a Mersenne number. There exist 
27 such p’s less than 44500: p —2, 3, 5, 7, 13, 17, 19, 31, 61, 89, 107, 127, 521, 607, 1279, 

2203, 2281, 3217, 4253, 4423, 9689, 9941, 11213, 19937, 21701, 23209, 44497. The even perfect 
numbers are the numbers of the form 2? ! (2? — 1), where 2? — 1 is a Mersenne number. 


2. Indices Modulo p (^ 297 Number Theory, Elementary) 


Let r be a primitive root corresponding to a prime number p. The index l= Ind,a of a with respect 
to the basis r is the integer | in O& [« p — 1 satisfying r'=a(mod p). az b(mod p) is equivalent to 
Ind, ass Ind, b(mod(p — 1)). The index satisfies the following congruence relations with respect to 
mod(p — 1): Ind, ab z Ind, a 4- Ind, b, Ind, a^z nInd,a, Ind,a=Ind,rInd,a. 
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App. B, Table 2 
Indices Modulo p 


We can solve congruence equations using these relations. The following is a table of indices. 




















133 17 19 

a. 

5 10 — 
14 7 15 
18 21 9 
11 7 4 
il 7 735 
9 7 3 
32 38 19 
11 16 45 
24 10 37 
45 40 38 
40 47 26 
19 64 10 
39 49 16 
59 21 62 
34 21 32 
71 56 47 
23 6 35 
25 89 SI 
66 30 96 
72 70 80 
14 29 7 

101 87 105 
22 S 99 
94 38 84 
18 43 35 
25 38 46 
64 107 ól 
53 124 &84 
23 124 120 
26 40 124 
51 57 125 

103 53 58 

130 73 33 

114 166 54 

164 175 135 

112 98 1 
141 31 145 
25 159 154 
172 123 55 
144 199 154 
147 144 172 
61 99 178 
195 24 352 


23 29 3 
20 — 
27 9 — 
t5 21 9 
4 3 12 
16 41 34 
5 35 3 
39 46 33 
15 28 49 
57 35 59 
28 44 447 
15 68 11 
46 35 11 
26 11 56 
60 12 38 
57 59 3 
7] 13 46 
86 91 84 
24 86 57 
62: 322 27 
3 98 34 
41 89 50 
121 113 46 
233 51 29 
125 91 73 
237 294 56 
95 120 132 
145 42 34 
135 129 62 
9 107 69 
99 150 90 
20 144 102 
135 118 62 
53 48 99 
134 33 175 
162 123 82 
120 36 141 
118 70 164 
21 179 115 
163 128 82 
34 8 197 
131 191 175 


37 


41 


43 


124 


115 


113 


47 


124 


218 
193 


(table continued on following page) 
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Bernoulli Numbers and Euler Numbers 


















5 7 11 13 
233 72 1 165 222 197 158 
239 66 74 138 1 4 43 
241 190 182 138 1 25 47 





3. Bernoulli Numbers and Euler Numbers 


Functions) 


B, are Bernoulli numbers; £, are Euler numbers. 


n 
2 l 6 
4 l 30 
6 ] 42 
8 l 30 

10 5 66 

12 691 2730 

14 7 6 

16 3617 510 

18 43867 798 

20 174611 330 

22 854513 138 

24 236364091 2730 

26 8553103 6 

28 23749461029 870 

30 8615841276005 14322 


4. Class Numbers of Algebraic Number Fields 


(D) Class Numbers of Real Quadratic Field 


23 


112 
63 
57 

134 

28 


29 


132 
160 
154 
203 

94 


156 
176 
75 
208 
197 


221 
187 
45 
74 
172 


31 


182 
188 
151 
226 
242 


136 
259 
222 
47 
62 
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37 41 8 47 

8 85 25 139 
31 99 15 113 
73 6 219 114 
187 64 77 85 
219 19 207 61 
170 17 154 65 
56 200 254 32 
182 156 1 213 
87 126 55 218 
135 23 132 75 





0.16667 
0.03333 
0.02381 
0.03333 
0.07576 


0.25311 
1.16667 
7.09216 
54.97118 
529.12424 


6192.12319 
86580.25311 
1425517.16667 


27298231.06782 
601580873.90064 


Numerator of B, Denominator of B, | R | 


(^ 177 Generating 


E, 
l 

5 

61 

1385 
50521 


2702765 
199360981 
19391512145 
2404879675441 
370371188237525 


6.934887 x 10!6 
1.551453 x 10? 
4.087073 x 10?! 


1.252260 x 1074 
4.415439 x 1026 





(— 347 Quadratic Fields) 


Let k 2 Q( Vm ), where m is a positive integer without square factor (1« m e 501). A is the class 
number (in the wider sense) of k. The — sign in the row of N (e) means that the norm N (e) of 
the fundamental unit is — 1. When N (e)= +1, the class number in the narrow sense is 2h, and 















when N (e) 7 — 1, the class number in the narrow sense is also A. 

ZG i NO INO 
2 1 4 — 1253 1 335 2 l = 
3 1 173 1 -— |254 3 337 1 - 1 
Sd Ze 174 2 255 4 339 2 426 2 
6 1 177 1 257 3 — |341 1 427 6 
7 4d 178 2 258 2 345 2 429 2 

10 2 — 179 1 259 2 346 6 430 2 

lIl ot IL 1 — |262 1 347 1 431 ] 

l3 1 - 182 2 263 1 349 1 433 1 -— 

14 1 — |183 2 265 2 - l 434 4 
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m h N(e)| m h N(c)| m h N(s)| m h N(e)| m h N(e))| m h N(e) 
15 2 101 1 — 2 2 2 4 

17 1 — |102 2 2 2 2 l 

19 1 103 1 2 l 2 4 

21 1 105 2 2 l l $ 

22 1 100 2 - l 2 3 8&8 = 
23 1 107 1 l 4 2 3 

26 2  — [10 1 — 2 l 2 4 — 
29 1 — JI, 2 4 l 2 l 

30 2 Ill 2 l l l 2 

31 1 113 1 - l 2 4 ]: e 
33 1 114 2 l 1 2 2 

34 2 115 2 2 2 l l 

35 2 118 1] 2 2 2 l 

37 1 — ER 2 2 2 2 4 

38 ] Eë, 2, = l 4 l ko 
39 2 123 2 l 4 l 2 "eg 
4] 1 es |127 1 4 l l e 
42 2 129 1 l 2 l 4 

43 1 130 4 - l 2 2 l 

46 1 131 1 l 2 2 2 

47 1 133 1 2 l l 2 

51 2 134 1 l l 4 ] 
SE: e HS. 3 2 2 2 3 

55 2 138 2 4 2 l 2 

57 1 139 1 2 1 2 2 

58 2 — A9 1 2 l 2 3 

59 1 142 3 3 2 l 2 

61 1 - |I8 2 8 1 l l 

62 1 145 4 - l l 8 l 

65 2 -— |146 2 3 2 5 2... 
66 2 149 1 -— 2 l 2 2 

67 ] 151 1 4 2 2 4 

69 1 154 2 1 2 2 ST = 
70 2 155 2 6 3 2 l 

71 1 157 1 ee l 4 1 l 

733 1 se |158 1 2 4 4 1 

74 2 — |159 2 l 3 2 De ës 
77 1 161 1 l 2 l 2 

78 2 163 1 2 1 2 ] 

79 3 165 2 2 4 l 2 

82 4 -— | 166 1 l 1 2 5 

83 ] 167 1 1 1 l l 





One can find a table of fundamental units and representatives of ideal classes for 0 « m < 2025 in 
E. L. Ince, Cycles of reduced ideals in quadratic fields, Royal Society, London, 1968. 


(II) Class Numbers of Imaginary Quadratic Fields  (— 347 Quadratic Fields) 


Let k 2 Q(V — m ), where m is a positive integer without square factor (1 « m « 509). h is the 
class number of k. In the present case, there is no distinction between the class numbers in the 
wider and narrow senses. 











oh~ RO 0000 
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App. B, Table 4.III 
Class Numbers of Algebraic Number Fields 


[m him bim A 

2 8 30 
14 4 4 462 8 
15. 2 5 463 7 
17 4 4 465 16 
19 1 3 466 8 
21 4 4 467 7 
22 2 10 469 16 
23 3 6 470 20 
26 6 12 47] 16 
29 6 2 473 12 
30 4 4 474 20 
31 3 8 478 8 
33 4 8 479 25 
34 4 4 481 16 
35 2 14 482 20 
BT 2 4 483 4 
38 6 5 485 20 
39 4 8 487 7 
41 8 6 489 20 
42 4 3 491 9 
43 1 6 493 12 
46 4 12 494 28 
47 5 8 497 24 
51 2 8 498 8 
53 6 8 499 3 
55 4 2 501 16 
57 4 6 502 14 
58 2 10 503 21 
59 3 10 505 8 
61 6 2 506 28 
62 8 5 509 30 
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There are only 9 instances of m for which h — 1, and only 18 instances of m for which h =2 (Baker, 


Stark). All these cases are in this table. 


One can find a table of structures of the ideal class groups and representatives of ideal classes 
for m « 24000 in H. Wada, A table of ideal class groups of imaginary quadratic fields, Proc. Japan 


Ácad., 46 (1970), 401—403. 
(III) Class Numbers of Cyclotomic Fields 


Cyclotomic field k = Q(e?"?) (1 <!< 100; | prime). h, is the first factor of the class number of k 
(— 14 Algebraic Number Fields). 
















3 1 41-1861 5-53-377911 
3l 47 5-139 67 67-12739 |83 3-279405653 

7 1 53 4889 71 7-792441 |89 113-118401449 
| | 


97 571-3457-206209 


59 3.59.233 | 73 89.134353 





h,>1 for / » 19 (Uchida). 


5. Characters of Finite Groups; Crystallographic Groups 


(1) Symmetric Groups S,,, Alternating Groups A, (3 < n < 7), and Mathieu Groups M, (n= 
11, 12,22, 23,24) 


(1) In each table, the first column gives the representation of the conjugate class as we represent 
a permutation by the product of cyclic permutations. For example, (3)(2)? means the conjugate 


class containing (123)(45)(67). 


1825 App. B, Table 5.I 
Finite Groups; Crystallographic Groups 


(2) The second column gives the order of the centralizer of the elements of the conjugate class. 
(3) In the table of S,, the first row gives the type of Young diagram corresponding to each 
irreducible character. For example, [3,22, 1] means 7 (3,2,2, 1). 

(4) In the table of 4,, when we restrict the self-conjugate character of S, (the character with *) 
to A,, it is decomposed into two mutually algebraically conjugate irreducible characters, and 
therefore we show only one of them. The other irreducible character of A, is given by the 
restriction to A, of the character of S, that is not self-conjugate. 

(5) In the table of M,, each character with a bar over the degree is one of the two mutually 
algebraically conjugate characters. 


( V-3)/2 ef =(+V5 )/2, & =(-1+V-7)/2 
ej =(-14V—-11)/2, e£ =(-14+V—15)/2, ef =(-1+V—-23 )/2 





S4 H pu mgr DI i A, 
(1) 24 l 3 2 3 l (1) 12 
(2) 4 l l 0 -1 -1 O 3 
(3) 3 l 0 -] 0 l (3) 3 
(4) 4 l -1 0 1 -1 OH 4 
Qy 8 1 -1 2 -1 | 








[3, I?]* [25:1] [2, 1°] 





[5] [41] 





[3,2] 













(1) 120 


1 
o 12; 1 2 l 0 esl x2- A] 
3) 6| 1 l -1 0 =l 1 l 
4) 4/1 0 -1 0 l 0 -] 
ary 8| ! 0 l $) l 0 l 

(30) 6 | 1 = = 
(5 5| 1 








DI [3,1] 2r] (214) In 








Kent ` Fei ` Fesch pi ` Fest ` Fee ` Fi ` ` bech ` Ferch ` be ` Fe 
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U] [6.1] 












1 6 14 I$ 

1 4 6 5 0 

1 3 2 3 2 

D 2 0 1 0 

E 2 2 -1 2 -1 l —4 l -1 2 -1 2 2 1 
1 l 0 =i l -1 l 0 -1 I -1 l 0 -1 -1 
1 bt -i 0 ->l 0 1 0 l 0 -1 0 -1 l | 
1 oO -1 0 0 1 0 0 0 -i 0 0 l 0 -I 
1 0 0 -1 0 l -1 0 -1 1 0 -1 0 0 l 
1 0 2 -3 0 1 -3 0 3 -1 0 3 -2 0 -l 
1 0 -I 0 2 -1 0 2 -1 2 0 -1 0 l 
l l 0 -1 0 l 0 -1 I -1 
l 2 2 1 -1 -1 ~1 2 -1 1] 
l 0 0 l l -1 -=i 0 1 -1 
1 0 -i 0 0 0 1 0 -] | 















(1) 2520 


(3) 36 i 
Qy 24 sc? 
(5 5 0 
(420) 4 0 
Gy 9 I 
3X2} 12 l 
(7) 7 d 
(7) 7 J 








Mu - (1) g| ıl 10 11 55 45 44 16 10 
(2)4 48 | 1 2 3 -1 -3 4 0 -2 
(4)? 8 | 1 2 -1 — 1 0 0 0 
(3 18| 1 l 2 l 0 -] -2 l 
(5)? 5 | 1 0 l 0 0 -1 1 0 

(8)(2) 8} 1 0 -] l -] 0 0 tiv2 
(8)(2) 8 | | 0 -] l -] 0 0 IVD. 
(0000) 6] 1 -] 0 -—] 0 1 0 d 
(11) slc -1 0 0 l 0 eg -] 
(11) 11| 1 -1 0 0 l 0 Die -1 
g—11-10-9.8— 7920. bandi 









M, (D l 144 176 16 
OH 192/1 3 es 0 0 
4y 3271 3 -1 3 -1-1 1 2 -2-1.0 0 0 O0 
GP 54/1 2 2 l 1 1 0 0 3 0 3 0 -4 -2 
GH 101 1 1 0 0 0 0 -1 1 -1 0 -1 1| l 
(829 8/1 1 -1-1 J -1 -1 0 0 +4 0 0 0 0 
(00) 6/1 0 0 -1 -1 1 0 0 ~-1 0 1| 0 0 O0 
dil) yl 0 0 0 0 O0 1 -10 0 -1 1 0 ef 
(10) 1|1 0 0 0 0 O0 1 -10 0 -I 1! 0 & 
(29° 24011 -1 -1 -5 -5 -5 5 6 6 -1 0 4 -4 4 
(100) 10}1 -1 -1 0 0 O0 O 1 1] -1 0 -1 1 -I 
(yay 32]] -1 3 -1 3 -1 | 2 -2 -1 0 0 O0 O0 
(3*5, 36/1 -1 -I 1 1 3 0 0 3 0 -3 -1 | 
(6) 1 -1 0 0 -i 0 1 -1 1 
(84) i -1 0 0 0 0 0 





g=12-11-10-9-8=95040. 
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My (1) 1 21 55 154 
(2 1 5 7 10 
(3$ 1 3 1 l 
(5)? 1 |] 0 -1 
(A^ 2y 1 1 -1 2 
(4)4(2)° | ] 3 -2 
(y 1 0 -1 0 
(y 1 0 -1 0 

(89 (4)2) 1-1 J 0 

OKOKON l -1 1 1 
(11)? 1 -1 0 0 
(11 1 -1 0 0 

g=22-21-20-48 = 443520. 

My (1) g |1 22 230 231 
(2)8 2688) 1 6 22 7 
(3° 18011 4 5 6 
(Gr 1511 2 0 1 

(4)*Qy 32 };1 2 2 -l 
(7? 1411 1 -1 0 
OH 4/1 1 -1 0 

(8)(4)Q) 8 Il 0 0 =li 

(60»(32QP 12]|1 0 1 -2 

(11? 11]1 0 -1 0 
(11 11 |1 O0 -1 0 

(41553) 15111 -1 0 l 

(15)(5)3) 15 |1 -1 0 1 

(472) 1411 -1 1 0 

(1472) 14 }1 -1 1 0 
(23) 23 ceed 0 ] 
(23) 23 }1 -1 0 l 

g=23-22-21-20-48 = 10200960. 

Mj (14 g jl 23 7.36 23-41 
Qu pam. 28 3 
(3)6 27.401 5 9 10 
GG 60 |1 3 2 3 
(AO 128 |1 3 4 1 
(TP 42 |1 2 6 l 
Ou Qa |1 2 0 l 
BAAR) 16 || 1 0 -1 

(60P(39(2» 24 |l | 1 -2 

(11)? li |1 ]) -1 0 
(1553 15 |I 0 =i 0 
(15(5(3 15 |i 0 -1 0 
(102) 14 |1 0 o -1 
(100) 14 |]10 0 -1 
(23) 232 |1 0 -1 0 
(23) 23 |1 0 -1 0 
(12)? 12 J1 ~1 0 1 
(6) 24 |1 -1 0 ] 
(4) 96 {1 ~1 0 l 
(3) 7.12|]1 -1 0 l 
Q)? 15[2]1 =1 102 -—MH 
(02Qy 20 [i —1 2 -1 
(2093) 21 |1 ~1 O 1 
(21)(3) 21 |1 -1 0 1 
Län 32 11 -1 4 -3 

(2(644)0) 12 |l -1 1! 0 





Aide en eneen E o E E e 
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23-77 
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280 231 385 99 45 
-8 7 l 3 -—3 

l -3 -2 0 0 

0 ) 0 -1 O0 

0 -! 1 -1 |I 

0 —] 1] 3 l 

d 0 0 | e 

0 0 0 | ge 

0 -1 1] -1 -1 

l 1 -2 0 d 
£4 0 0 0 l 
eg 0 0 0 l 
1035 2024 45 990 231 
27 8 ~3 ~18 7 
0 — 0 0 —3 
0 -] 0 0 l 
-] 0 } 2 -] 
-1 | ef ef 0 
-1 l E £3 0 
l 0 —1 0 —-] 
0 -1 0 0 1 
] 0 l 0 0 
l 0 1 0 0 
0 -1 0 0 est 
0 -1 0 0 £5 
-1 1 -e ey 0 
-1 ] =E E 0 
0 0 —] l l 
0 d -1 1 1 
55.64 45 22.45 23.45 23- 
64 =3  —18 —21 27 
10 0 0 0 0 
0 0 0 0 0 
0 ] 2 3 —1 
-1 ef ei aeg -1 
—] ey £3 283 — 
0 =q 0 —1 1 
—-2 0 0 0 0 
0 H 0 1 1 
0 0 0 0 0 
0 0 0 0 0 
1 —£3 ef 0 -1 
} ES EN 0 -1 
1 — 1 0 0 
H -l1 1 0 0 
0 \ 1 -1 0 
0 -] —] } 2 
0 l -2 -1 3 
~8 3 3 -3 6 
0 5 —10 -5 35 
0 0 0 0 0 
-l e E3 -E3 -1 
-1 ej ey —£3 -1 
D -3 6 3 3 
0 0 0 0 0 


fe 


m 


CO Out D e 


II eec L eu eo we = 
v 


en — 


^n 


m 
Zu Bu Ei CO e CN E 


| e oco 


| 


| L d 
Qu | 
N RO 


Dm m 


——wc c oocooo-ooo 
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11-35-27 


(y^ g 23.21 23.55 23.88 23-99 23.144 23-11-21 23.7.36 77-72 
(DE 21-210 | 35 49 8 21 48 49 —28 — 56 —21 
(36 27-40 | 6 5 -1 0 0 -15 -9 9 0 
(5)4 60 -2 0 -1 -3 -3 3 l -1 0 
(4)4(2) 128 3 l 0 1 0 -3 4 0 -1 
(7 42 0 -2 l 2 1 0 0 0 0 
CH 42 0 -2 1 2 1 0 0 0 0 
(89 (4)2) 16 -] l 0 -1 0 -1 0 0 l 
(60P3QY 24 2 1 -1 0 0 l -1 1 0 
(11)? ll -1 0 0 0 1 0 -1 0 0 
(15)(5)(3) 15 l 0 -1 0 0 0 l -1 0 
(15)(5)(3) 15 1 0 -1 0 0 0 l -1 0 
(14)(7)(2) 14 0 0 l 0 -1 0 0 0 0 
(14)(7)(2) 14 0 0 I 0 -1 0 0 0 0 
(23) 23 0 0 0 0 0 0 0 l -1 
(23) 23 0 0 0 0 0 0 0 l -1 
(12)? 12 0 0 0 0 0 0 0 0 0 
(6)4 24 0 0 0 2 -2 0 0 0 0 
(4)6 96 3 -3 0 -3 0 -3 0 0 3 
3) 7-72 | 0 8 8 6 -6 0 0 0 0 
(2)!2 15-27 | 3 -15 24 4-19 16 9 36 24 — 45 
(10)?(2)? 20 -2 — l l -1 l -1 0 
(21)(3) 21 0 1 l -1 1 0 0 0 0 
(21X3) 21 0 1 i -1 1 0 0 0 0 
(4)4(2)* 32|3 -7 8 -3 0 l -4 -8 3 
(12)(6)(4)(2) 12 0 -1  -1 0 0 1 -1 1 0 





g24:23- 22: 21: 20: 48 — 244823040. 


(II) General Linear Groups GL(2, q), Unitary Groups U(2, q), and Special Linear Groups SL(, q) 
(q is a power of a prime) (— 151 Finite Groups I) 


(1) The notations are as follows. &-exp[JzV —1/(q—1) X m-exp2vV-1 /(g?—1)], 
o-exp[2z V —1 /(q--1)) pisthe generator of the multiplicative group of GF(g) — (0), wis 
the generator of the multiplicative group of GF (q°)— (0), wi eo, B is an element of GL(2, 4) 
with order 4?— 1, and B,= B9}, 

(2) The first column gives a representative of the conjugate class. 


General Linear Group GL(, ai 
X,(1) 





X,,(9) 


Le n Zn 





a 
i .) g^" ger" (q+ Lev" +04 (q— Dou t 1) 
p 
p? m 0 gm +n)a — QQ4na(Q tl) 
k e * 
E gi engt) g^ +b) g 4 * nb V, o mb na 0 
p 
B* ere — g" 0 = ("+q") 





1«c«q?-1, c0 (modq+ 1). 
1«m«n«q- l, for Y, 1I«n«q?-1 


m,n? 


(1) l<a<q-l, 1<b<q-1, azb (modq- 1), 
(2) We assume that 1 < n < q— 1, for X,(1), X,(q), 
for Z,, n z0(modq + 1). Here, Z, = Z, when n— n'q(mod q? - 1). 


Unitary Group U (2, 4). 








Y. K di 
m,n <n 





s 
uM ein qo? (q= Not” (q+ lo” 
Ge | o? 0 — g(m*ns c" 
l a? 
E g^ 6*0 m g^ 6*2)» — (otn ag motns) 0 
t 
a 
po g^" oT! 0 nup -nug 
LE n" +n 


1829 App. B, Table 5.III 
Finite Groups; Crystallographic Groups 


(1) ( o ; ) KI Séi are the canonical forms of an element of U(2, q) in GL(2, q?). 
a w 


Q2)l<s<qtl, 1<t<qtl, sÉt(modqt*l) 1<u<q*—-1, ugz0(modq- 1). When u= 
— u'q (modq?-— 1) u, u’ gives the same conjugate class. 
(3) The ranges are 1 < n < q +1 for X,(D, X,(qg; 1«m«n«q-*lforY,, l«n« q’—1 for 
Z,, nt (modq - 1). When n'z — nq (modq?— 1), we have Z/ = Zi, 

Special Linear Group SL (2, 2") (the case when q= 2"). 





(1) l<a<(q—-2)/2, 1<c<q/2. 
(2) 1<n<(q—-2)/2, 1<m<q/2. 


Special Linear Group SL(2,4) (q= power of an odd prime number, e =(q — 1)/2, e' =(q¢+1)/2). 











q+1 q-1 


l ,q— 
Log Cy) (men ml yr 


Il E + + 
pl: nmn 1 l I A 
| z z : 
pr ; i] 1 0 l 1 7 A 
P\Z 1 9 (- 1)" ep (7 fk (-DfA* 
PZ 1 0 LI" eic (Im? (-1)*A* 
(" gi 1 4 ten 0 (- 1f 0 
p 


— (em +07”) 





(1) 1<a<(q—3)/2, 1<c<(q—1)/2, 1<n<(q—3)/2, 1<m<(q—1)/2, 


At=f-1 + [(—1)*q] 121/2, pt ={1 + [(—1)Y4] us. 
(2) The last two columns mean two characters (with the same signs), respectively. 


(III) Ree group Re(4), Suzuki Group Sz(q), and Janko Group J. 


Ree group Re(q) (4237*!23m?). 
The order of Re(q) is qq? -- 1(q—1), q92 ? 4-1, m, 2 q- 3m 1l, m —5q4—-3m«1. 


Uu 


1 ol d 940 (q-lmm,/2 (q—-1)mm./2 m(q-1)  q^«1 
J 2 q -q  -(q-W2 (q—-1)/2 0 q*l 
X 3 0 q —(qtm)/2 (q— m)/2 —m l 

Y 9 0 0 m m —m l 

T 3 0 0 a a 2a l 
Ioh 3 0 0 a a 2a l 
YT 9 0 0 B p -B 1 
EC, 8 0 0 B p -B l 
JT 6 0 0 Y —y 0 l 
JT! 6 0 0 Y -y 0 l 
R* l l 0 0 0 pttp '^ 
Si =] -3 l -1 0 0 
JR? ] cm 0 0 0 p^ + p— Ha 
Js? -1 1 l -1 0 0 
Hi -] 0 -1 0 -1 0 
W' —l 0 0 l l 0 





App. B, Table 5.11] 1830 


Finite Groups; Crystallographic Groups 


Z 
Io .] gtl (q—1)4o (9—1)40 (44 — Um, (oi — Im 
J 2 orl 3tg «Dy c -(4-1) 0 0 
X 3 l 2q-1 2q-1 -M4 -m_ 
Y 9 l -1 = —] -Í 
T 3 1 -1 =~] —3m-1 3m-] 
T 3 l -1 Sa —3m-] 3m—1 
YT 9 l Sab Sch -1 -1 
YT 9 l -1 e] -] =] 
JT 6 -] -3 l 0 0 
JT! 6 -1 -3 l 0 0 
R” ptp ^ d d 0 0 
S? 0 a(vb) o (Ab) 0 0 
JR* —(p'^ +p") 0 0 0 0 
JS? 0 o(vb) o (Ab) 0 9 
2 
ys 0 0 0 — Y (v4 0787) 0 
i=0 
2 , 
Ww' 0 0 0 0 — NM (w +w) 


i0 





(1) The first column gives a representative of conjugate class, and the second column gives its 
order. The orders of R, S, V, W are (q — 1)/2, (q + 1)/4, m_, m, , respectively. R, S, T are commuta- 
tive with J. 

(2) R°~R™%, V-ys.ysw.y-c.yuy-cwo— Miz Wn WP ww Ww, Here 
we fix an integer ô satisfying 6° = 1 [mod(q-- 1)/4], (6 — 1, (q+ 1)/4) I. 


JR'"—JR^*, | JS*— JS ^5, 
where A~ B means that A and B are mutually conjugate. 
(3) p=exp[4r V-I /(q-1)], v=exp(27V—1 /m.), w=exp(27V—1 /m,), 
o zexp[8r V — 1 /(q4-1)]. 
(4) 1< w<(q—3)/4, 1 <à « (q—3)/8. 
Here v is considered mod(q + 1)/4 and 
Y,= Yp = Ym Y SY aY 


kK is considered modm_ and 





b bó bd? -b — bó — bô? 
SS" SU mS S "S d 





Zem Zag Lage Zag Z wm Z wë 
Tt is considered mod m, and 
Z; = Lig Zei Z L= Z y= ZL 


2 1 
(5) o(vb)= ET Y (8 A gg — 9b), o'(Ab)= by (off A g — ABB) — rof Lo 


i20 i=0 
ee en EE EY omn ive; 
(6) a= ——À——. = EE y= SE We show one of the two mutually 


complex conjugate characters, for the characters A, B, C. 
Suzuki group Sz(g). The order of Sz(q) is q*(q?-- 1)(q— 1) (¢=22"*1,2q=r7). 





Xa Ys Z, 

l l a g+ (4-r+1Xq-1) (q* r*t1Xq- 1) r(q-)/2  r(q-72 
o 1 0 1 r—ì -r-] -r/2 erf 

p |! 0 l -1 -1 rV-1/2 -rVY-1 /2 
p'}1 0 1 -1 -1 NA rV-1/2 
ai |1 1 ug 0 0 0 0 

m. |1 -1 0 — (eB + eft + er i+ ey Ban 0 l 1 

mý |1 -1 0 0 —(e]* + e] + e5 Y* ey Y) -1 -] 





(1) The first column gives a representative of the conjugate class. 

(2) To, Ti, m, are the elements of order q— 1 g+r+1, q— r- 1, respectively. 

(3) ra pn, £, are the primitive q— 1, qc r- 1, q— r- 1 roots of 1, respectively. 

(4) mi and 7, ` are mutually conjugate elements, and hence X, and X. , give the same character. 
i, a run over the representatives of mod g—1, and i,a#0 (modq- 1). 
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(5) vj, 7,7, at, mo” are mutually conjugate, and hence Yg, Y o Ypg Y -pq give the same 
character. j, 8 run over the representatives of modg+r+1, and j, B x 0 (mod qt r4 1). 

(6) ai, nz", 139, 1; “4 are mutually conjugate, and hence Z,, Z_,, Zyp Z_yq give the same 
character. k, y run over the representatives of mod q— r-- 1, and k, y#0 (mod q— r1). 









Janko Group J. 

133 76 76 56 56 120 120 120 
-—-3 4  —4 0 0 0 0 0 
—-2 l 1 2 2 0 0 0 
£^ 1 1 Je" jet 0 0 0 
er 1 l 2e* 27 0 0 0 
0 1 -1 0 0 0 0 0 
0 -] -1 0 0 l l l 
—-g -1 4 0 0 0 0 
—eg* —] | 0 0 0 0 
l — —-] 

E 0 0 

€ 0 0 


(1) The order of J is 8-3-5-7-11-19= 175560. 

(2) The first column gives the order of the elements of each conjugate class. 

(3) p=exp(27V—1 /19), A, =p+p +8 +p''+p'+4+p'®, M=p° +p +p 6+0 +p° +p", 
A,7 pttp +p +p +p +p, et =(1+ V5 )/2. 
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(IV) Three-Dimensional Crystal Classes (— 92 Crystallographic Groups) 


Crystal System | Geometric Crystal Classes Arithmetic Crystal Classes Number of 














; (2) 
Bravais Types Schoenflies | International Space Groups 
Notation Notation” 
Short Full 
Triclinic C, 1 1 (P, 1) 1 
P S.C 1 1 (P,1) 2 
Monoclinic C; 2 2 (P,2) (C,2) 3-5 
PC Cin m m (P, m) (C, m) 6-9 
2 
Can 2/m E (P, 2/m) (C, 2/m) 10-15 
Orthorhombic D,(V) 222 222 (P, 222) (C, 222) (F, 222) (1, 222) 16-24 
P,C,F,l Ch mm2 mm2 (P, mm2) (C, mm2) (A, mm2) (F, mm2) 
(I, mm2) l 25-46 
222 
D2,(V,) mmm —-—— | (P,mmm) (C, mmm) (F, mmm) (1, mmm) 47-74 
mmm 
Tetragonal C, 4 4 (P, 49? (1,4) 75-80 
PI S, 4 4 (P, 4) (1,4) 81-82 
4 
Can 4/m x (P, 4/m) (1, 4/m) 83-88 
D, 422 422 (P, 422)? (1, 422) 89—98 
Cy 4mm 4mm ( P, 4mm) (1, 4mm) 99-110 
DANA 42m 42m (P, 42m) (P, 4m2) (I, 4m2) (1, 42m) 111-122 
422 
Don 4/mmm ——— | (P,4/mmm) (1, 4/mmm) 123-142 
mmm 
Trigonal C, 3 3 (P, 3) (R, 3) 143-146 
PR Belt 3 3 (P, 3) (R, 3) 147-148 
3 32 32 (P, 312) (P, 321)? (R, 32) 149-155 
Cy 3m 3m (P, 3m1) (P, 31m) (R, 3m) 156-161 
= _2 = — e 
Dza 3m 3— (P, 31m) (P, 3m1) (R, 3m) 162-167 
m 
Hexagonal Ce 6 6 (P, 6) 168-173 
P ech 6 6 (P,6) -174 
6 
m 


Dg 622 622 (P, 622) 177-182 
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Number of 
Space Groups” 


Crystal System | Geometric Crystal Classes 


Bravais Types 


Arithmetic Crystal Classes 





Schoenflies | International 
Notation Notation? 



















Hexagonal Ce. (P, 6mm) 8 183-186 
P D; (P, 6m2) (P, 62m) 187-190 
(on Den (P, 6/mmm) 191-194 
Cubic T (P, 23) (F, 23) (1,23) 195-199 
P,F,I T, (P, m3) (F, m3) (I, m3) 200-206 

O (P, 43279) (F, 432) (I, 432) 207-214 

T; (P 43m) (F, 43m) (1,43m) 215-220 

O, (P, m3m) (F, m3m) (1, m3m) 221-230 
Notes 


(1) The notation is based upon International tables for X-ray crystallography I, Kynoch, 1969. In 
each crystal system, the lowest class is a holohedry. 

(2) These correspond to the consecutive numbers of space groups in the book cited in (1). 

(3)- (10) Enantiomorphic pairs arise from these classes: two pairs for (4), (8), (9), and one pair for 
the others. 

For the shapes of Bravais lattices — 92 Crystallographic Groups E, Fig. 3. 


6. Miscellaneous Constants 


V2 =1.41421 35623 73095, V10 =3.16227 76601 68379. 


V2 = 1.25992 10498 94873. W100 = 4.64158 88336 12779. 
logi92 = 0.30102 99956 63981 = 1 /3.32192 80948 87364. 


(I) Base of Natural Logarithm e (1000 decimals) 


e — 2.71828 18284 59045 23536 02874 71352 66249 77572 47093 69995 95749 66967 62772 40766 30353 54759 
45713 82178 52516 64274 27466 39193 20030 59921 81741 35966 29043 57290 03342 95260 59563 07381 
32328 62794 34907 63233 82988 07531 95251 01901 15738 34187 93070 21540 89149 93488 41675 09244 
76146 06680 82264 80016 84774 11853 74234 54424 37107 53907 77449 92069 55170 27618 38606 26133 
13845 83000 75204 49338 26560 29760 67371 13200 70932 87091 27443 74704 72306 96977 20931 01416 
92836 81902 55151 08657 46377 21112 52389 78442 50569 53696 77078 54499 69967 94686 44549 05987 
93163 68892 30098 79312 77361 78215 42499 92295 76351 48220 82698 95193 66803 31825 28869 39849 
64651 05820 93923 98294 88793 32036 25094 43117 30123 81970 68416 14039 70198 37679 32068 32823 
76464 80429 53118 02328 78250 98194 55815 30175 67173 61332 06981 12509 96181 88159 30416 90351 
59888 85193 45807 27386 67385 89422 87922 84998 92086 80582 57492 79610 48419 84443 63463 24496 
84875 60233 62482 70419 78623 20900 21609 90235 30436 99418 49164 31409 34317 38143 64054 62531 
52096 18369 08887 07016 76839 64243 78140 59271 45635 49061 30310 72085 10383 75051 01157 47704 
17189 86106 87396 96552 12671 54688 95703 50354. 


e (in octal) = 2.55760 52130 50535 5. 

1/e-0.36787 94411 71442, ` e?— 7.38905 60989 30650 — | /0.13533 52832 36613, 
Ve = 1.64872 12707 00128 = 1/0.60653 06597 12633. 

log, 10— 2.30258 50929 94046 = 1 /0.43429 44819 03252, 

log,2 = 0.69314 71805 59945 = 1 / 1.44269 50408 88964. 


(I) The Number 7 (1000 decimals) (— 328 Pi(z)) 


App. B, Table 7 1834 
Coefficients of Polynomial Approximations 


1 —3.14159 26535 89793 23846 26433 83279 50288 41971 69399 37510 58209 74944 59230 78164 06286 20899 
86280 34825 34211 70679 82148 08651 32823 06647 09384 46095 50582 23172 53594 08128 48111 74502 
84102 70193 85211 05559 64462 29489 54930 38196 44288 10975 66593 34461 28475 64823 37867 83165 
27120 19091 45648 56692 34603 48610 45432 66482 13393 60726 02491 41273 72458 70066 06315 58817 
48815 20920 96282 92540 91715 36436 78925 90360 01133 05305 48820 46652 13841 46951 94151 16094 
33057 27036 57595 91953 09218 61173 81932 61179 31051 18548 07446 23799 62749 56735 18857 52724 
89122 7938] 83011 94912 98336 73362 44065 66430 86021 39494 63952 24737 19070 21798 60943 70277 
05392 17176 29317 67523 84674 81846 76694 05132 00056 81271 45263 56082 77857 71342 75778 96091 
73637 17872 14684 40901 22495 34301 46549 58537 10507 92279 68925 89235 42019 95611 21290 21960 
86403 44181 59813 62977 47713 09960 51870 72113 49999 99837 29780 49951 05973 17328 16096 31859 
50244 59455 34690 83026 42522 30825 33446 85035 26193 11881 71010 00313 78387 52886 58753 32083 
81420 61717 76691 47303 59825 34904 28755 46873 11595 62863 88235 37875 93751 95778 18577 80532 


17122 68066 13001 92787 66111 95909 21642 01989. 


7 (in octal) = 3.11037 55242 10264 3. 

1/7 —0.31830 98861 83791, 72— 9.86960 44010 89359 = 1/0.10132 11836 42338, 
Ve — 1.77245 38509 05516 = 1/0.56418 95835 47756, 

Vis =2.50662 82746 31001 = 1/0.39894 22804 01433, 

V2 /2. = 1.25331 41373 15500 — 1 /0.79788 45608 02865, 

War — 1.46459 18875 61523 = 1 /0.68278 40632 55296. 

logyo7 = 0.49714 98726 94134, logem = 1.14472 98858 49400. 


(III) Radian rad 


| rad = 57? 29571 95130 82321 =3437'.74677 07849 393 = 20626 4" 80624 70964. 
1° 20.01745 32925 19943 rad, | 1' = 0.00029 08882 08666 rad, 1” — 0.00000 48481 36811 rad. 


(IV) Euler's Constant C (100 decimals) (— 174 Gamma Function) 


C — 0.5TT21 56649 01532 86060 65120 90082 40243 10421 59335 93992 
35988 05767 23488 48677 26777 66467 09369 47063 29174 67495. 


e © zs 1.78107 24179 90197 98522. 























seS, 
ber ^ 
n Kë 
3 1.83333 333 2.45000 000 3.31822 899 100 5.18737 752 
4 2.08333 333 8 2.71785 714 20 3.59773 966 500 6.29282 343 
5 2.28333 333 2.92896 825 4.79920 534 1000 


7.48547 086 


7. Coefficients of Polynomial Approximations 


In this table, we give some typical examples of approximation formulas for computation of func- 
tions on a digital computer (— 19 Analog Computation, 336 Polynomial Approximation). 


(I) Exponential Function 
(1) Putting joe? +l=qt+yt zla is an integer, — i <y <3) we have 
e*=2%(y),0(y) = Yay’, which gives an approximation by a polynomial of the 7th degree, 
where the maximal error is 3x 10 !!. 
ay = 0.70710 678116, a, 0.49012 90717 2, a5 0.16986 579572, a4 0.03924 73321 5. 
447:0.00680 09712, | a5-0.00094 28173, a= 0.00010 93869, — a5, 0.00001 0826. 
(2) An approximation by a polynomial of the 11th degree: e* = Zajx/(—1«x«90). 
Maximal error 1x 107". 
dp = 0.99999 99999 990, a, = 0.99999 99999 995, a, — 0.50000 00000 747, 
447 0.16666 66666 812, a,=0.04166 66657 960, | 45 0.00833 33332 174, 
dg = 0.00138 88925 998, a,=0.00019 84130 955, gess 0.00002 47944 428, 
dg = 0.00000 27550 711, 4,92 0.00000 02819 019, a,, — 0.00000 00255 791. 
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(3) se te ea E <x<logv2). 
x kot kx? + Ki ( ) 
een MES 
2 1+ kx? 


Maximal error 1.4x 107 ^. 
ka= 1.00000 00000 00327 1, Kk, 0.10713 50664 56464 2, 
K47- 0.00059 45898 69018 8, &4— 0.02380 17331 57418 6. 


(II) Logarithmic Function 


(1) An approximation by a polynomial of the i 1th degree: log(1-- x)&Xa;x' (0< x « 1). 
Maximal error 1.1 x 107 !9. 


497 0.00000 00001 10, a; = 0.99999 99654 98, a, — 0.49999 82537 98, 
a3 =0.33329 85059 64, a= — 0.24963 72428 65, gess 0.19773 31015 60, 
dg— — 0.15744 88954 13, a, 0.11712 91156 18, dg — — 0.07364 03719 14, 


ag = 0.03469 74937 56, — ous —0.01046 82295 69. ous 0.00148 19917 22. 


- V3 E ] T 
(2) For 1€ x <2, and putting y= E 0a )(-1«y«H) then logx = logV2 + 
x+ 
Y a, y? *! gives an approximation by a polynomial of the 11th degree (0 < i <5), where the 
maximal error is 9.2x 10 P. 
ay = 0.34314 57505 076106, 4, = 0.00336 70892 56222 5, a, — 0.00005 94707 04347 4, 
a, = 0.00000 12504 99776 2, gess 0.00000 00285 68292 8, as = 0.00000 00007 43713 9. 


(III) Trigonometric Functions 


X do odo xd i : EE i aoc mein JL 
(1) We put 5z =P +5 tg +y (P isan integer; q 0,1; 7=0,1; -1 <z< 1), and s=sin SCH 
c=cos T. 
If r=0, sinx z(— 1)%s, cosx =(— 1)%e, 
If r=1, sinx=(— (fe, cosx = —(— 1)%s. 


Here s and c are computed by the following approximation formulas. Putting — z?/2- y, 
$(y) — sin(vz /4)zz£ a; kl. c(y) 9 cos(uz /A) ZI b, y' gives an approximation by a polynomial 
of the 5th degree, where the maximal errors are 5: 2x 107 7, c: 2x 107. 

dy = 0.78539 81633 97426, a, =0.16149 10243 75338, a4, 0.00996 15782 61200, 

a, = 0.00029 26094 99152, | 442: 0.00000 50133 389, as = 0.00000 00555 1357. 

by= 0.99999 99999 999, b, =0.61685 02750 601, b, =0.06341 73767 885, 

b, = 0.00260 79335 007. b 47: 0.00005 74476 09, b, 0.00000 07765 93. 

sin(xx/2). ; "T TET à 
(2) - zY(-lya;x"(—1&x«1) This gives an approximation by a polynomial of 10th 
degree (0x i« 5), where the maximal error is 2.67 x 10^! !. 

ag = 1.57079 63267 682, a,=0.64596 40955 820, a,=0.07969 26037 435, 

a, =0.00468 16578 837, a,=0.00016 02547 767, a,-0.00000 34318 696. 





TX. 


2 2 
(3) tan 45 n ai zk. zb Gi (continued fraction) (—1«x«l). 
4 


IK, 

Maximal error 9.8 x 107 "2, 

ky = 0.78539 81634 9907, k 6.19229 46807 1350, k, = — 0.65449 83095 2316, 
k, = 520.24599 06398 9939, k,= — 0.07797 95098 7751. 


(IV) Inverse Trigonometric Functions 


(1) An approximation by a polynomial of the 21st degree (0 <i < 10): 
arcsinx = Eajx?*! (le 1/ V2 ). 
Maximal error 107 !9, 
ag 1.00000 00005 3, a, =0.16666 65754 5, a, 0.07500 46066 5, 44— 0.04453 58425 7. 
aą= 0.03175 26509 6, 4, 0.01176 582819, ges 0.06921 261857, oss — 0.14821 09628 8, 
dg = 0.32889 76635 2, oa —0.35020 41201 5, 2,52 0.19740 50325 0. 
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(2) Putting x2w-tu(we-ii;-i«uxlv- 





arc tan x =arctanw+ (v), tf(v) 2 arctanv. 
The values of arctanw: 
arctan(1/8)— 0.12435 49945 46711, arctan(3/8) — 0.35877 06702 70611, 
arctan(5/8) 0.55859 93153 43560,  arctan(7/8) — 0.71882 99996 21623. 
(Uc) is computed by an approximation by a polynomial of the 9th degree (0 « i <4), where 
(rise arctanp = X(- lav!) 
Maximal error 1.6x 107», 
dg = 0.99999 99999 9992, a, =0.33333 33328 220, a, 0.19999 97377 6, 


a4 0.14280 9976, 


(3) arctanx = dn 


a4 — 0.10763 60. 


x^ x?| . i 
jp +... + 7 | (continued fraction) (—} <x « D). 
Ik, lke 


Maximal error 3.6 x 10- !9. 
ko =0.99999 99936 2,  k, — — 3.00000 30869 4, k= — 0.55556 97728 4, 


k= — 15.77401 81 


127 3, 


k4,— — 0.16190 80978 0, k,— — 44.57191 79508 8, kę= —0.10810 67493 1. 


(V) Gamma Function 


An approximation by a polynomial of the 8th degree: 


T(24x)& Majx'(- 


1/2« x « 1/2). 


Maximal error 7.6 x 1078. 


ag = 0.99999 9926, 


a, =0.42278 4604, a,—0.41184 9671. a;=0.08156 52323, 


a,= 0.07406 48982. a,— —0.00012 513767, a =0.01229 95771, a= — 0.00349 61289, 
ag = 0.00213 85778. 


(VI) Normal Distribution 


ae o 
(1) ral deeg EE 


of the 6th degree. 


1 ZEN ae f 
«is (0x x « oo). This gives an approximation by a polynomial 


Maximal error 2.8 x 1077. 
ao = 0.07052 30784, a, —0.04228 20123, a, — 0.00927 05272, 
a, —0.00015 20143, 24, 0.00027 65672, a, = 0.00004 30638. 


Q) P(x)= 


zl. 


oo 
e "n? dt. 





2x* d, 
4P(x)(1— P(x))« |e( ZZ) | etae t ) (0 x x « oo). 


Maximal error 2 x 10 5. 
49 —0.0055, a,=0.0551, a,=14.4. 
(3) The inverse function of (2) 





1/2 
ZEE d ) >, y--log[4PQ9(1— P(x))] (0x y « oc). 
yt+a, 


Maximal error 4.9 x 107+. 


ag = 2.06117 86, 


a, = —5.7262204, a,=11.64059 5. 


Statistical Tables for 
Reference 


Statistical Tables 


[1] J. A. Greenwood and H. O. Hartley, Guide 
to tables in mathematical statistics, Princeton 
Univ. Press, 1962. 

[2] Research Group for Statistical Sciences (T. 
Kitagawa and M. Masuyama, eds.) New statis- 
tical tables (Japanese), explanation p. 264, 
table p. 214, Kawade, 1952. 

[3] R. A. Fisher and F. Yates, Statistical tables 
for biological, agricultural and medical re- 
search, explanation p. 30, table p. 137, Oliver 
& Boyd, third edition, 1948. 

[4] E. S. Pearson and H. O. Hartley, Bio- 
metrika tables for statisticians, explanation 

p. 104, table p. 154, Cambridge Univ. Press, 
third edition, 1970. 

[5] K. Pearson, Tables for statisticians and 
biometricians I, 1930, explanation p. 83, table 
p. 143; II, 1931, explanation p. 250, table p. 262, 
Cambridge Univ. Press. 

[6] Statistical tables JSA-1972 (Japanese), 
table p. 454, explanation p. 260, Japanese 
Standards Association, 1972. 


Tables of Special Statistical Values 


[8] Harvard Univ., Tables of the cumulative 
binomial probability distribution, Harvard, 
1955, 


n 

2 (ea 5 dec., 

p=0.01(0.01)0.50, 

n= 1(1)50(2)100(10)200(20)500(50)1000. 

[9] National Bureau of Standards, NBS ap- 
plied mathematical series, no. 6, Tables of 
the binomial probability distribution, 1950, 
(le "~i and the partial sum: 7 dec, p= 
0.01(0.01)0.50, n 2 2(1)49. 

[10] T. Kitagawa, Table of Poisson distribu- 
tion (Japanese), Baihükan, 1951, e^ "m'/i*: 
7-8 dec., m=0.001(0.001)1.000(0.01) 10.00. 
[11] G. J. Lieberman and D. B. Owen, Tables 
of the hypergeometric probability distribu- 


tion, Stanford, 1961, Glee E(N) 


Ei n 
6 dec.. N = 2(1)50(10)100(100)2000, n = 
1(D(N 72), k  1(1)a. 

[12] National Bureau of Standards, NBS no. 


23, Tables of normal probability functions, 
1942, 
gx) 8 (1/ V2 )exp( — 5x’), 


d(x)= f" (x)dx: 15 dec., 
x —0(0.00001)1.0000(0.001)8.285. 


[13] K. Pearson, Tables of the incomplete 
beta-functions, Cambridge, second edition, 
1968, 1. (p. q): 8 dec., p, g=0.5(0.5) 11(1)50. 
[14] K. Pearson, Tables of the incomplete 
gamma-function, Cambridge, 1922, revised 
edition, 1951, 


IH (u.p)- ag (1/e*)(v? /T(p + Dd) dev: 


7 dec., p =0.0(0.1)5.0(0.2)50.0, u =0.1(0.1)20.0; 
p= — 1.0(0.05)0.0, u — 0.1(0.1)51.3. 

[15] N. V. Smirnov, Tables for the distribution 
and density function of the r-distribution, 
Pergamon, 1961, 6 dec., f= 1(1)35, r= 
0(0.01)3.00(0.02)4.50(0.05)6.50. 

[16] G. J. Resnikoff and G. J. Lieberman, 
Tables of the non-central t-distribution, Stan- 
ford, 1957. 

[17] F. N. David, Tables of the ordinates and 
probability integral of the distribution of the 
correlation coefficient in small samples, Cam- 
bridge, 1938. 

[18] D. B. Owen, The bivariate normal proba- 
bility distribution, Sandia Corp., 1957, 

7 (h,a): 6 dec., a= 0.000(0.025)1.000, oc, k = 


0.00(0.01)3.50(0.05)4.75, T (h,a) 


f^ ax | x? +y? 
"osea 
[19] National Bureau of Standards, NBS no. 


50, Tables of the bivariate normal distribution 
function and related functions, 1959, L(h, kr) 


| f*9f[? I 

L J 27V 1- r? 
x? y?- 2rxy 
2(1— r?) 


r= £0,00(0.05)0.95(0.01)0.99, 
h, k =0.0(0.1)4.0. 





xew] - [oa 6 dec., 


Tables of Allocation 


[20] T. Kitagawa and M. Midome, Table of 
allocation of elements for experimental design 
(Japanese), Baihûkan, 1953. 

[21] R. C. Bose, W. H. Clatworthy, and S. S. 
Shrikhande, Tables of partially balanced de- 
signs with two associate classes, North Caro- 
lina Agric. Expt. Station Tech. Bull., 1954 
(table of PBIBD). 


Numerical Tables for 
Reference 


General Tables 


[1] M. Boll, Tables numériques universelles, 
Dunod, 1947. 

[2] P. Barlow, Barlow's tables, Robinson, 
1814, third edition, 1930. 

[3] W. Shibagaki, 0.0176 table of elementary 
functions (Japanese), Kyóritu, 1952. 

[4] K. Hayashi, Table of higher functions 
(Japanese), Iwanami, second edition, 1967. 
[5] E. Jahnke and F. Emde, Funktionentafeln 
mit Formelin und Kurven, Teubner, second 
edition 1933 (English translation: Tables of 
functions with formulae and curves, Dover, 
fourth edition, 1945). 

[6] Y. Yoshida and M. Yoshida, Mathematical 
tables (Japanese), Baihükan, 1958. 

[7] M. Abramowitz and I. A. Stegun (eds.), 
Handbook of mathematical functions with 
formulas, graphs and mathematical tables, 
National Bureau of Standards, 1964 (Dover, 
1965). 

[8] A. Fletcher et al. (eds.), Index of mathe- 
matical tables I, II, Scientific Computing 
Service, Addison- Wesley, second edition, 
1962. 


Multiplication Table 


[9] A. L. Crelle, Rechentafeln welche alles 
Mutiplicieren und Dividiren mit Zahlen unter 
1000 ersparen, bei grósseren Zahlen aber die 
Rechung erleichtern und sicherer machen, 

W. de Gruyter, new edition 1944. 


Table of Prime Numbers 


[10] D. N. Lehmer, List of prime numbers 
from 1 to 10,006,721, Carnegie Institution of 
Washington, 1914. 


Series of Tables of Functions 


[11] British Association for the Advancement 
of Science, Mathematical tables, vol. 2, 
Emden functions, 1932; vol. 6, Bessel func- 
tions, pt. 1, 1937; vol. 8, Number-divisor 
tables, 1940; vol. 9, Tables of powers giving 
integral powers of integrals, 1940; vol. 10, 
Bessel functions, pt. 2, 1952. 

[12] Harvard University, Computation 
Laboratory, Annals, vol. 2, Tables of the 
modified Hankel functions of order one-third 
and their derivatives, 1945; vol. 3, Tables of 
the Bessel functions of the first kind of orders 


zero and one, 1947; vol. 14, Orders seventy- 
nine through one hundred thirty-five, 1951; 
vol. 18, Tables of generalized sine- and 
cosine-integral functions, pt. 1, 1949; vol. 19, 
pt. 2, 1949; vol. 20, Tables of inverse hyper- 
bolic functions, 1949; vol. 21, Tables of (e: 
generalized exponential-integral functions, 
1949, 

[13] National Bureau of Standards, Applied 
Mathematics Series (AMS), AMS 1, Tables 
of the Bessel functions Yo(x), Y;(x), Ko(x), 
K(x), 0< x « 1, 1948; AMS 5, Tables of sines 
and cosines to fifteen decimal places at 
hundredths of a degree, 1949; AMS 11, Table 
of arctangents of rational numbers, 1951; 
AMS 14, Tables of the exponential function 
e* (including e^ *), 1951; AMS 16, Tables 

of n! and T(n + 5) for the first thousand val- 
ues of n, 1951; AMS 23, Tables of normal 
probability functions, 1953; AMS 25, Tables 
of the Bessel functions Yg(x), Y;(x), Ko( x), 
K,(x),0< x <1, 1952; AMS 26, Tables of 
Arctan x, 1953; Tables of 10*, 1953; AMS 
32, Table of sine and cosine integrals for 
arguments from 10 to 100, 1954; AMS 34, 
Table of the gamma function for complex 
arguments, 1954; AMS 36, Tables of circular 
and hyperbolic sines and cosines for radian 
arguments, 1953; AMS 40, Table of secants 
and cosecants to nine significant figures at 
hundredths of a degree, 1954; AMS 41, 
Tables of the error function and its derivative, 
1954; AMS 43, Tables of sines and cosines 
for radian arguments, 1955; AMS 45, Table 
of hyperbolic sines and cosines, 1955; AMS 
46, Table of the descending exponential, 1955. 


Tables of Special Functions 


[14] Akademiya Nauk SSSR, Tables of the 
exponential integral functions, 1954. 

[15] J. Brownlee, Table of log I(x), Tracts for 
computers, no. 9, Cambridge Univ. Press, 
1923. 

[16] L. Dolansky and M. P. Dolansky, Table 
of log, (1/p), p: log;(1/p) and p:log;(1/p)-- 

(1 — plog;(1/(1 — p), M.LT. Research Lab. of 
Electronics tech. report 227, 1952. 

[17] L. M. Milne-Thomson, Die elliptischen 
Funktionen von Jacobi, Springer, 1931; En- 
glish translation: Jacobian elliptic function 
tables, Dover, 1950. 

[18] H. T. Davis, Tables of the higher math- 
ematical functions, Principia Press, vol. 1, 
Gamma function, 1933; vol. 2, Polygamma 
functions, 1935; vol. 3, Arithmetical tables, 
1962. 

[19] R. G. Selfridge and J. E. Maxfield, A 
table of the incomplete elliptic integral of the 
third kind, Dover, 1958. 


1839 


[20] W. Shibagaki, 0.01% table of modified 
Bessel functions and the method of numerical 
computation for them (Japanese), Baihükan, 
1955. 

[21] W. Shibagaki, Theory and application 

of gamma function, Appendix. Table of 
gamma function of complex variable effective 
up to 6 decimals (Japanese), Iwanami, 1952. 
[22] L. J. Slater, A short table of the Laguerre 
polynomials, Proc. IEE, monograph no. 103c, 
1955, 46-50. 

[23] G. N. Watson, A treatise on the theory 
of Bessel functions, Appendix, Cambridge 
Univ. Press, 1922, second edition, 1958. 

See also Statistical Tables for Reference. 


Tables of Approximation Formulas of 
Functions 


24] S. Hitotumatu, Approximation formula 
(Japanese) Takeuti, 1963. 
[25] T. Uno (ed.), Approximation formulas 
of functions for computers 1-3, Joint research 
work for mathematical sciences, pt. IV, sec- 
tion 5, 1961—1963 (Japanese, mimeographed). 
[26] Z. Yamauti, S. Moriguti, and S. Hito- 
tumatu (eds.), Numerical methods for elec- 
tronic computers (Japanese), Baihükan, I, 
1965; II, 1967. 
[27] C. Hastings (ed.), Approximations for 
digital computers, Princeton, 1955. 
[28] A. J. W. Duijvestijn and A. J. Dekkers, 
Chebyshev approximations of some transcen- 
dental functions for use in digital computing, 
Philips Research Reports, 16 (1961), 145—174. 
[29] L. A. Lyusternik, O. A. Chervonenkis, 
and A. P. Yanpol’skii, Mathematical analysis: 
functions, limits, series, continued fractions, 
Pergamon, 1965. (Original in Russian, 1963.) 
[30] J. F. Hart et al. (eds.), Computer approxi- 
mations, Wiley, 1968. 


Numerical Tables for Reference 


Journals 


Abh. Akad. Wiss. Góttingen 
Abhandlungen der Akademie der Wissen- 
schaften in Gottingen (Göttingen) 


Abh. Bayer. Akad. Wiss. Math.-Nat. Kl. 
Abhandlungen der Bayerschen Akademie der 
Wissenschaften. Mathematisch-Naturwis- 
senschaftliche Klasse (Munich) 


Abh. Math. Sem. Univ. Hamburg 
Abhandlungen aus dem Mathematischen 
Seminar der Universitat Hamburg (Hamburg) 


Acta Arith. 
Acta Arithmetica, Polska Akademia Nauk, 
Instytut Matematyczny (Warsaw) 


Acta Informat. 
Acta Informatica (Berlin) 


Acta Math. 
Acta Mathematica (Uppsala) 


Acta Math. Acad. Sci. Hungar. 
Acta Mathematica Academiae Scientiarum 
Hungaricae (Budapest) 


Acta Math. Sinica 
Acta Mathematica Sinica (Peking). Chinese 
Math. Acta is its English translation 


Acta Sci. Math. Szeged. 
Acta Universitatis Szegediensis, Acta Scien- 
tiarum Mathematicarum (Szeged) 


Actualités Sci. Ind. 
Actualités Scientifiques et Industrielles (Paris) 


Advances in Math. 
Advances in Mathematics (New York) 


Aequationes Math. 
Aequationes Mathematicae (Basel- Waterloo) 


Algebra and Logic 
Algebra and Logic (New York). Translation of 
Algebra 1 Logika 


Algebra i Logika 

Akademiya Nauk SSSR Sibirskoe Otdelenie. 
Institut Matematiki. Algebra i Logika 
(Novosibirsk). Translated as Algebra and 
Logic 


Algebra Universalis 
Algebra Universalis (Basel) 


Amer. J. Math. 
American Journal of Mathematics (Baltimore) 


Amer. Math. Monthly 
The American Mathematical Monthly 
(Menasha) 


Amer. Math. Soc. Colloq. Publ. 
American Mathematical Society Colloquium 
Publications (Providence) 





Amer. Math. Soc. Math. Surveys 
American Mathematical Society Mathematical 
Surveys (Providence) 


Amer. Math. Soc. Proc. Symp. Pure Math. 
American Mathematical Society Proceedings 
of Symposia in Pure Mathematics (Provi- 
dence) 

Amer. Math. Soc. Transl. 

American Mathematical Society Translations 
(Providence) 


Amer. Math. Soc. Transl. Math. Monographs 
American Mathematical Society Translations 
of Mathematical Monographs (Providence) 


Ann. Acad. Sci. Fenn. 

Suomalaisen Tiedeakatemian Toimituksia, 
Annales Academiae Scientiarum Fennicae. 
Series A. I. Mathematica (Helsinki) 


Ann. der Phys. 
Annalen der Physik (Leipzig) 


Ann. Fac. Sci. Univ. Toulouse 

Annales de la Faculté des Sciences de l'Uni- 
versité de Toulouse pour les Sciences 
Mathématiques et les Sciences Physiques 
(Toulouse) 


Ann. Inst. Fourier 
Annales de l'Institut Fourier, Université de 
Grenoble (Grenoble) 


Ann. Inst. H. Poincaré 
Annales de l'Institut Henri Poincaré (Paris) 


Ann. Inst. Statist. Math. 
Annals of the Institute of Statistical Mathe- 
matics (Tokyo) 


Ann. Mat. Pura Appl. 
Annali di Matematica Pura ed Applicata 
(Bologna) 


Ann. Math. Statist. 
The Annals of Mathematical Statistics 
(Baltimore) 


Ann. Math. 
Annals of Mathematics (Princeton) 


Ann. Physique 
Annales de Physique (Paris) 


Ann. Probability 
The Annals of Probability (San Francisco) 


Ann. Polon. Math. 
Annales Polonici Mathematici. Polska 
Akademia Nauk (Warsaw) 


Ann. Roumaines Math. 

Annales Roumaines de Mathématiques. 
Journal de l'Institut Mathématique Roumain 
(Bucharest) 


Ann. Sci. Ecole Norm. Sup. 
Annales Scientifiques de l'Ecole Normale 
Supérieure (Paris) 


1841 


Ann. Scuola Norm. Sup. Pisa 
Annali della Scuola Normale Superiore de 
Pisa, Scienze Fisiche e Matematiche (Pisa) 


Ann. Statist. 
The Annals of Statistics (San Francisco) 


Arch. History Exact Sci. 
Archive for History of Exact Sciences 
(Berlin-New York) 


Arch. Math. 
Archiv der Mathematik (Basel-Stuttgart) 


Arch. Rational Mech. Anal. 
Archive for Rational Mechanics and Analysis 
(Berlin) 


Ark. Mat. 
Arkiv fór Matematik (Stockholm) 


Ark. Mat. Astr. Fys. 
Arkiv for Matematik, Astronomi och Fysik 
(Uppsala) 


Astérique 
Astérique. La Société Mathématique de 
France (Paris) 


Atti Accad. Naz. Lincet, Mem. 
Atti della Accademia Nazionale dei Lincei, 
Memorie (Rome) 


Atti Accad. Naz. Lincei, Rend. 
Atti della Accademia Nazionale dei Lincei, 
Rendiconti (Rome) 


Automat. Control Comput. Sci. 

Automatic Control and Computer Sciences 
(New York). Translation of Avtomatika i 
Vychislitel'naya Tekhnik. Akademiya Nauk 
Latviiskoi SSR (Riga) 


Bell System Tech. J. 
The Bell System Technical Journal (New 
York) 


Biometrika 
Biometrika, A Journal for the Statistical 
Study of Biological Problems (London) 


Bol. Soc. Mat. São Paulo 
Boletim da Sociedade de Matematica de Sao 
Paulo (Sao Paulo) 


Bull. Acad. Pol. Sci. 
Bulletin de l'Academie Polonaise des Sciences 
(Warsaw) 


Bull. Amer. Math. Soc. 
Bulletin of the American Mathematical 
Society (Providence) 


Bull. Calcutta Math. Soc. 
Bulletin of the Calcutta Mathematical Society 
(Calcutta) 


Bull. Math. Statist. 
Bulletin of Mathematical Statistics (Fukuoka, 
Japan) 





Journals 


Bull. Nat. Res. Council 
Bulletin of the National Research Council 
(Washington) 


Bull. Sci. Math. 
Bulletin des Sciences Mathématiques (Paris) 


Bull. Soc. Math. Belg. 
Bulletin de la Société Mathématique de 
Belgique (Brussels) 


Bull. Soc. Math. France 
Bulletin de la Société Mathématique de 
France (Paris) 


Bull. Soc. Roy. Sci. Liége 

Bulletin de la Société Royale des Sciences de 
Liége (Liége) 

C. R. Acad. Sci. Paris 


Comptes Rendus Hebdomadaires des Séances 
de l'Académie des Sciences (Paris) 


Canad. J. Math. 
Canadian Journal of Mathematics (Toronto) 


Casopis Pést. Mat. 
Casopis pro Péstováni Matematiky, Ces- 
koslovenská Akademie Véd (Prague) 


Colloq. Math. ` ` 
Colloquium Mathematicum (Warsaw) 


Comm. ACM 
Communications of the Association for 
Computing Machinery (New York) 


Comm. Math. Phys. 
Communications in Mathematical Physics 
(Berlin) 


Comm. Pure Appl. Math. 
Communications on Pure and Applied 
Mathematics (New York) 


Comment. Math. Helv. 
Commentarii Mathematici Helvetici (Zurich) 


Compositio Math. 
Compositio Mathematica (Groningen) 


Comput. J. 
The Computer Journal (London) 


Crelles J. 
= J. Reine Angew. Math. 


CWI Newslett. 
Centrum voor Wiskunde en Informatica. 
Newsletter (Amsterdam) 


Cybernetics 
Cybernetics (New York). Translation of 
Kibernetika (Kiev) 


Czech. Math. J. 
Czechoslovak Mathematical Journal (Prague) 


Deutsche Math. 
Deutsche Mathematik (Berlin) 


Journals 


Differentsial'nye Uravneniya 
Differentsial nye Uravneniya (Minsk). Trans- 
lated as Differential Equations 


Differential Equations 
Differential Equations (New York). Transla- 
tion of Differentsial’ nye Uravneniya 


Dokl. Akad. Nauk SSSR 

Doklady Akademii Nauk SSSR 

(Moscow). Soviet Math. Dokl. is the English 
translation of its mathematics section 


Duke Math. J. 
Duke Mathematical Journal (Durham) 


Econometrica 
Econometrica, Journal of the Econometric 
Society (Chicago) 


Edinburgh Math. Notes 
The Edinburgh Mathematical Notes (Edin- 
burgh) 


Enseignement Math. 
L'Enseignement Mathématique (Geneva) 


Enzykl. Math. 

Enzyklopadie der Mathematischen Wissen- 
schaften mit Einschluss ihrer Anwendungen 
(Berlin) 


Erg. Angew. Math. 
Ergebnisse der Angewandte Mathematik 
(Berlin-New York) 


Erg. Math. 
Ergebnisse der Mathematik und ihrer Grenz- 
gebiete (Berlin-New York) 


Fund. Math. 
Fundamenta Mathematicae (Warsaw) 


Funkcial. Ekvac. 

Fako de l'Funkcialaj Ekvacioj Japana 
Matematika Societo. Funkcialaj Ekvacioj 
(Serio Internacia) (Kobe, Japan) 


Functional Anal. Appl. 

Functional Analysis and its Applications 
(New York). Translation of Funktsional. Anal. 
Prilozhen. 


Funktsional. Anal. Prilozhen. 

Funktsionalnyi Analiz i ego Prilozheniya. 
Akademiya Nauk SSSR (Moscow). Translated 
as Functional Anal. Appl. 


General Topology and Appl. 
General Topology and its Applications 
(Amsterdam) 


Hiroshima Math. J. 
Hiroshima Mathematical Journal. Hiroshima 
Univ. (Hiroshima, Japan) 


Hokkaido Math. J. 
Hokkaido Mathematical Journal. Hokkaido 
Univ. (Sapporo, Japan) 


1842 


IBM J. Res. Develop. 
IBM Journal of Research and Development 
(Armonk, N.Y.) 


Illinois J. Math. 
Illinois Journal of Mathematics (Urbana) 


Indag. Math. 
Indagationes Mathematicae = Nederl. Akad. 
Wetensch. Proc. 


Indian J. Math. 
Indian Journal of Mathematics (Allahabad) 


Indiana Univ. Math. J. 
Indiana University Mathematics Journal 
(Bloomington) 


Information and Control 
Information and Control (New York) 


Inventiones Math. 
Inventiones Mathematicae (Berlin) 


Izv. Akad. Nauk SSSR 

Izvestiya Akademii Nauk SSSR (Moscow). 
Math. USSR-Izv. is the English translation 
of its mathematics section 


J. Aigebra 
Journal of Algebra (New York) 


J. Analyse Math. 
Journal d'Analyse Mathématiques (Jerusalem) 


J. Appl. Math. Mech. 

Journal of Applied Mathematics and 
Mechanics (New York). Translation of Prikl. 
Mat. Mckh. 


J. Approximation Theory 
Journal of Approximation Theory (New York) 


J. Assoc. Comput. Mach. (J. ACM) 
Journal of the Association for Computing 
Machinery (New York) 


J. Austral. Math. Soc. 

The Journal of the Australian Mathematical 
Society (Sydney) 

J. Combinatorial Theory 


Journal of Combinatorial Theory. Series A 
and Series B (New York) 


J. Comput. System Sci. 
Journal of Computer and System Sciences 
(New York) 


J. Differential Equations 
Journal of Differential Equations (New York) 


J. Differentia] Geometry 
Journal of Differential Geometry (Bethlehem, 
Pa.) 


J. Ecole Polytech. 
Journal de l'Ecole Polytechnique (Paris) 


J. Fac. Sci. Hokkaido Univ. 

Journal of the Faculty of Science, Hokkaido 
University. Series I. Mathematics (Sapporo, 
Japan) 


1843 


J. Fac. Sci. Univ. Tokyo 
Journal of the Faculty of Science, University 
of Tokyo. Section I. (Tokyo) 


J. Franklin Inst. 
Journal of the Franklin Institute (Phila- 
delphia) 


J. Functional Anal. 
Journal of Functional Analysis (New York) 


J. Indian Math. Soc. 
The Journal of the Indian Mathematical 
Society (Madras) 


J. Inst. Elec. Engrs. 
Journal of the Institution of Electrical En- 
gineers (London) 


J. Inst. Polytech. Osaka City Univ. 

Journal of the Institute of Polytechnics, Osaka 
City University. Series A. Mathematics 
(Osaka) 


J. London Math. Soc. 
The Journal of the London Mathematical 
Society (London) 


J. Math. Anal. Appl. 
Journal of Mathematical Analysis and Ap- 
plications (New York) 


J. Math. and Phys. 

Journal of Mathematics and Physics (Cam- 
bridge, Massachusetts, for issues prior to 1975; 
for 1975 and later, New York) 


J. Math. Econom. 
Journal of Mathematical Economics 
(Amsterdam) 


J. Math. Kyoto Univ. 
Journal of Mathematics of Kyoto University 
(Kyoto) 


J. Math. Mech. 
Journal of Mathematics and Mechanics 
(Bloomington) 


J. Math. Pures Appl. 
Journal de Mathématiques Pures et Ap- 
pliquées (Paris) 


J. Math. Soc. Japan 
Journal of the Mathematical Society of Japan 
(Tokyo) 


J. Mathematical Phys. 
Journal of Mathematical Physics (New York) 


J. Multivariate Anal. 
Journal of Multivariate Analysis (New York) 


J. Number Theory 
Journal of Number Theory (New York) 


J. Operations Res. Soc. Japan 
Journal of the Operations Research Society 
of Japan (Tokyo) 


Journals 


J. Optimization Theory Appl. 
Journal of Optimization Theory and Applica- 
tions (New York) 


J. Phys. Soc. Japan 

Journal of the Physical Society of Japan 
(Tokyo) 

J. Pure Appl. Algebra 

Journal of Pure and Applied Algebra 
(Amsterdam) 


J. Rational Mech. Anal. 
Journal of Rational Mechanics and Analysis 
(Bloomington) 


J. Reine Angew. Math. 
Journal für die Reine und Angewandte 
Mathematik (Berlin). = Crelles J. 


J. Res. Nat. Bur. Standards 

Journal of Research of the National Bureau 
of Standards. Section B. Mathematics and 
Mathematical Physics (Washington) 


J. Sci. Hiroshima Univ. 

Journal of Science of Hiroshima University. 
Series A (Mathematics, Physics, Chemistry); 
Series A-I. (Mathematics) (Hiroshima) 


J. Soviet Math. 

Journal of Soviet Mathematics (New York). 
Translation of (1) Itogi Nauki—Seriya 
Matematika (Progress in Science— Mathe- 
matical Series); (2) Problemy Matematichesk- 
ogo Analiza (Problems in Mathematical 
Analysis); (3) Zap. Nauchn. Sem. Leningrad. 
Otdel. Mat. Inst. Steklov. 


J. Symbolic Logic 

The Journal of Symbolic Logic (New Bruns- 
wick) 

Japan. J. Math. 

Japanese Journal of Mathematics (Tokyo) 


Jber. Deutsch. Math. Verein. Uber D.M.V.) 
Jahresbericht der Deutschen Mathematiker 
Vereinigung (Stuttgart) 


Kibernetika (Kiev) 

Otdelenie Matematiki, Mekhaniki i Kiber- 
netiki Akademii Nauk Ukrainskoi SSR. 
Kibernetika (Kiev). Translated as Cybernetics 


Kódai Math. Sem. Rep. 
Kódai Mathematical Seminar Reports 
(Tokyo) 

Linear Algebra and Appl. 


Linear Algebra and Its Applications (New 
York) 


Linear and Multilinear Algebra. 
Linear and Multilinear Algebra (New York) 


Mat. Sb. 

Matematicheskit Sbornik (Recueil Mathéma- 
tique). Akademiya Nauk SSSR (Moscow). 
Translated as Math. USSR-Sb. 


Journals 


Mat. Tidsskr. A 
Matematisk Tidsskrift. A (Copenhagen) 


Mat. Zametki 

Matematicheskii Zametki. 

Akademiya Nauk SSSR (Moscow). Translated 
as Math. Notes 


Math. Ann. 
Mathematische Annalen (Berlin-Gottingen- 
Heidelberg) 


Math. Comp. 
Mathematics of Computation (Providence). 
Formerly Math. Tables Aids Comput. 


Math. J. Okayama Univ. 
Mathematical Journal of Okayama University 
(Okayama, Japan) 


Math. Japonicae 
Mathematica Japonicae (Osaka) 


Math. Nachr. 
Mathematische Nachrichten (Berlin) 


Math. Notes 

Mathematical Notes of the Academy of 
Sciences of the USSR (New York). Trans- 
lation of Mat. Zametki 


Math. Rev. 
Mathematical Reviews (Ann Arbor) 


Math. Scand. 
Mathematica Scandinavica (Copenhagen) 


Math. Student 
The Mathematical Student (Madras) 


Math. Tables Aids Comput. (MTAC) 
Mathematical Tables and Other Aids to 
Computation (Washington). Name changed 
to Mathematics of Computation in 1960 
(vol. 14ff.) 


Math. USSR-Izv. 
Mathematics of the USSR-Izvestiya (Provi- 
dence). Translation of Izv. Akad. Nauk SSSR 


Math. USSR-Sb. 
Mathematics of the USSR-Sbornik (Provi- 
dence). Translation of Mat. Sb. 


Math. Z. 
Mathematische Zeitschrift (Berlin-Góttingen- 
Heidelberg) 


Mathematika 
Mathematika, A Journal of Pure and Applied 
Mathematics (London) 


Meed. Lunds Univ. Mat. Sem. 

Meddelanden frán Lunds Universitets 
Matematiska Seminarium = Communications 
du Séminaire Mathématique de l'Université 
de Lund (Lund) 


Mem. Amer. Math. Soc. 
Memoirs of the American Mathematical 
Society (Providence) 
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Mem. Coll. Sci. Univ. Kyóto 
Memoirs of the College of Science, University 
of Kyóto. Series A (Kyoto) 


Mem. Fac. Sci. Kyushu Univ. 

Memoirs of the Faculty of Science, Kyushu 
University. Series A. Mathematics (Fukuoka, 
Japan) 


Mémor. Sci. Math. 
Mémorial des Sciences Mathématiques (Paris) 


Michigan Math. J. 
The Michigan Mathematical Journal (Ann 
Arbor) 


Mitt. Math. Ges. Hamburg 
Mitteilungen der Mathematischen 
Gesellschaft in Hamburg (Hamburg) 


Monatsh. Math. Phys. 
Monatschefte für Mathematik und Physik 
(Vienna) 


Monatsh. Math. 
Monatshefte für Mathematik (Vienna) 


Monograf. Mat. 
Monografje Matematyczne (Warsaw) 


Moscow Univ. Math. Bull. 

Moscow University Mathematics Bulletin 
(New York). Translation of the mathematics 
section of Vestnik Moskov. Univ., Ser. I, Mat. 
Mekh. 


Nachr. Akad. Wiss. Góttingen 
Nachrichten der Akademie der Wissenschaften 
in Góttingen. Math.-Phys. Kl. (Géttingen) 


Nagoya Math. J. 
Nagoya Mathematical Journal (Nagoya) 


Nederl. Akad. Wetensch. Proc. 

Koninklijke Nederlandse Akademie van 
Wetenschappen, Proceedings. Series A. 
Mathematical Sciences (Amsterdam) — Indag. 
Math., Proc. Acad. Amsterdam 


Nieuw Arch. Wisk. 
Nieuw Archief voor Wiskunde (Groningen) 


Numerische Math. 
Numerische Mathematik (Berlin-Góttingen- 
Heidelberg) 


Nuovo Cimento 
Il Nuovo Cimento (Bologna) 


Osaka J. Math. 
Osaka Journal of Mathematics (Osaka) 


Osaka Math. J. 
Osaka Mathematical Journal (Osaka) 


Pacific J. Math. 
Pacific Journal of Mathematics (Berkeley) 


Philos. Trans. Roy. Soc. London 
Philosophical Transactions of the Royal 
Society of London. Series A (London) 
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Phys. Rev. 
The Physical Review (New York) 


Portugal. Math. 
Portugaliae Mathematica (Lisbon) 


Prikl. Mat. Mekh. 

Adademiya Nauk SSSR. Otdelenie Tekhnich- 
eskikh Nauk. Institut Mekhaniki Prikladnaya 
Matematika 1 Mekhanika (Moscow). Trans- 
lated as J. Appl. Mat. Mech. 


Proc. Acad. Amsterdam 
— Nederl. Akad. Wetensch. Proc. 


Proc. Amer. Math. Soc. 
Proceedings of the American Mathematical 
Society (Providence) 


Proc. Cambridge Philos. Soc. 
Proceedings of the Cambridge Philosophical 
Society (Cambridge) 


Proc. Imp. Acad. Tokyo 
Proceedings of the Imperial Academy (Tokyo) 


Proc. Japan Acad. 
Proceedings of the Japan Academy (Tokyo) 


Proc. London Math. Soc. 
Proceedings of the London Mathematical 
Society (London) 


Proc. Nat. Acad. Sci. US 

Proceedings of the National Academy of 
Sciences of the United States of America 
(Washington) 


Proc. Phys.-Math. Soc. Japan 
Proceedings of the Physico- Mathematical 
Society of Japan (Tokyo) 


Proc. Roy. Soc. London 
Proceedings of the Royal Society of London. 
Series À (London) 


Proc. Steklov Inst. Math. 

Proceedings of the Steklov Institute of 
Mathematics (Providence). Translation of 
Trudy Mat. Inst. Steklov. 


Prog. Theoret. Phys. 
Progress of Theoretical Physics (Kyoto) 


Publ. Inst. Math. 
Publications de l'Institut Mathématique 
(Belgrade) 


Publ. Inst. Math. Univ. Strasbourg 
Publications de l'Institut de Mathématiques 
de l'Université de Strasbourg (Strasbourg) 


Publ. Math. Inst. HES 
Publications Mathématiques de l'Institut des 
Hautes Etudes Scientifiques (Paris) 


Publ. Res. Inst. Math. Sci. 
Publications of the Research Institute for 
Mathematical Sciences (Kyoto) 


Journals 


Quart. Appl. Math. 
Quarterly of Applied Mathematics (Provi- 
dence) 


Quart. J. Math. 
The Quarterly Journal of Mathematics, Ox- 
ford. Second Series (Oxford) 


Quart. J. Mech. Appl. Math. 
The Quarterly Journal of Mechanics and 
Applied Mathematics (Oxford) 


Rend. Circ. Mat. Palermo 
Rendiconti del Circolo Matematico de 
Palermo (Palermo) 


Rend. Sem. Mat. Univ. Padova 
Rendiconti del Seminario Matematico 
dell'Universitá di Padova (Padua) 


Rev. Mat. Hisp. Amer. 
Revista Matemática Hispano-Americana 
(Madrid) 


Rev. Mod. Phys. 
Reviews of Modern Physics (New York) 


Rev. Un. Mat. Argentina 
Revista de la Unión Matemática Argentina 
(Buenos Aires) 


Rev. Univ. Tucumán 

Revista Universidad Nacional de Tucumán, 
Facultad de Ciencias Exactas y Tecnologia. 
Serie A. Matemáticas y Fisica Teorica 
(Tucumán) 


Roczniki Polsk. Towar. Mat. 

Roczniki Polskiego Towarzystwa Matema- 
tycznego. Serja I. Prace Matematyczne 
(Krakow) 


Rozprawy Mat. 
Rozprawy Matematyczne, Polska Akademia 
Nauk, Instytut Matematyczny (Warsaw) 


Russian Math. Surveys. 
Russian Mathematical Surveys (London). 
Translation of Uspekhi Mat. Nauk 


Sammul. Góschen 
Sammulung Góschen (Leipzig) 


Sankhya 
Sankhya, The Indian Journal of Statistics. 
Series A and Series B (Calcutta) 


S.-B. Berlin. Math. Ges. 
Sitzungsberichte der Berliner Mathematischen 
Gesellschaft (Berlin) 


S.-B. Deutsch. Akad. Wiss. Berlin 
Sitzungsberichte der Deutschen Akademie 
der Wissenschaften zu Berlin, Mathematisch- 
Naturwissenschaftliche Klasse (Berlin) 


S.-B. Heidelberger Akad. Wiss. 
Sitzungsberichte der Heidelberger Akademie 
der Wissenschaften, Mathematisch-Natur- 
wissenschaftliche Klasse (Heidelberg) 


Journals 


S.-B. Math.-Nat. Kl. Bayer. Akad. Wiss. 
Sitzungsberichte der Mathematisch-Natur- 
wissenschaflichen Klasse der Bayerischen 
Akademie der Wissenschaften zu München 
(Munich) 


S.-B. Óster. Akad. Wiss. 
Sitzungsberichte der Osterreichische Akade- 
mie der Wissenschaften (Vienna) 


S.-B. Phys.-Med. Soz. Erlangen 
Sitzungsberichte der Physikalisch- 
Medizinischen Sozietät zu Erlangen (Erlangen) 


S.-B. Preuss. Akad. Wiss. 

Sitzungsberichte der Preussischen Akademie 
der Wissenschaften. Physikalisch-Mathema- 
tische Klasse (Berlin) 


Schr. Math. Inst. u. Inst. Angew. Math. Univ. 
Berlin 

Schriften des Mathematischen Instituts und 
des Instituts für Angewandte Mathematik der 
Universitat Berlin (Berlin) 


Schr. Math. Inst. Univ. Münster 
Schriftenrethe des Mathematischen Instituts 
der Universitat Münster (Münster) 


Sci. Papers Coll. Gen. Ed. Univ. Tokyo 
Scientific Papers of the College of General 
Education, University of Tokyo (Tokyo) 


Sci. Rep. Tokyo Kyoiku Daigaku 
Science Reports of the Tokyo Kyoiku 
Daigaku. Section A (Tokyo) 


Scripta Math. 

Scripta Mathematica. A Quarterly Journal 
devoted to the Philosophy, History, and Ex- 
pository Treatment of Mathematics (New 
York) 


Sém. Bourbaki 

Séminaire Bourbaki (Paris) 

SIAM J. Appl. Math. 

SIAM Journal of Applied Mathematics. A 
Publication of the Society for Industrial and 
Applied Mathematics (Philadelphia) 

SIAM J. Comput. 

SIAM Journal on Computing (Philadelphia) 


SIAM J. Control 
SIAM Journal on Control (Philadelphia) 


SIAM J. Math. Anal. 

SIAM Journal on Mathematical Analysis 
(Philadelphia) 

SIAM J. Numer. Anal. 

SIAM Journal on Numerical. Analysis 
(Philadelphia) 

SIAM Rev. 

SIAM Review (Philadelphia) 


Siberian Math. J. 
Siberian Mathematical Journal (New York). 
Translation of Sibirsk. Mat. Zh. 








1846 


Sibirsk. Mat. Zh. 

Akademiya Nauk SSSR. Sibirskoe Otdelenie. 
Sibirskii Matematicheskii Zhurnal (Moscow). 
Translated as Siberian Math. J. 


Skr. Norske Vid. Akad. Oslo 

Skrifter Utgitt av det Norske Videnskaps- 
Akademii Oslo. Matematisk-Naturvidens- 
kapelig Klasse (Oslo) 


Soviet Math. Dokl. 

Soviet Mathematics, Doklady (Providence). 
Translation of mathematical section of Dokl. 
Akad. Nauk SSSR 


SRI J. 
Stanford Research Institute Journal (Menlo 
Park) 


Studia Math. 
Studia Mathematica. (Wrocław) 


Sûbutu-kaisi 
Nihon Sûgaku-buturi-gakkai Kaisi (Tokyo) 


Sûgaku 
Sûgaku, Mathematical Society of Japan 
(Tokyo) 


Summa Brasil. Math. 
Summa Brasiliensis Mathematicae (Rio de 
Janeiro) 


Tensor 
Tensor (Chigasaki, Japan) 


Teor. Veroyatnost. i Primenen. 

Teoriya Veroyatnostei 1 ee Primenenie. 
Akademiya Nauk SSSR (Moscow). Translated 
as Theor. Prob. Appl. 


Theor. Prob. Appl. 

Theory of Probability and Its Applications. 
Society for Industrial and Applied Mathe- 
matics. English translation of Teor. Veroyat- 
nost. i Primenen. (Philadelphia) 


Tóhoku Math. J. 
The Tóhoku Mathematical Journal (Sendai, 
Japan) 


Tóhoku-rihó 
Tóhoku Teikokudaigaku Rikahókoku 
(Sendai, Japan) 


Topology 
Topology. An International Journal of 
Mathematics (Oxford) 


Trans. Amer. Math. Soc. 
Transactions of the American Mathematical 
Society (Providence) 


Trans. Moscow Math. Soc. 

Transactions of the Moscow Mathematical 
Society (Providence). Translation of Trudy 
Moskov. Mat. Obshch. 


Trudy Mat. Inst. Steklov. 
Trudy Matematicheskogo Instituta im. V. A. 


1847 Journals 


Steklova. Akademiya Nauk SSSR (Moscow- 
Leningrad). Translated as Proc. Steklov Inst. 
Math. 


Trudy Moskov. Obshch. 

Trudy Moskovskogo Matematicheskogo 
Obshchestva (Moscow). Translated as Trans. 
Moscow Math. Soc. 


Tsukuba J. Math. 
Tsukuba Journal of Mathematics. Univ. 
Tsukuba (Ibaraki, Japan) 


Ukrain. Mat. Zh. 

Akademiya Nauk Ukrainskoi SSR. Institut 
Matematiki. Ukrainskii Matematicheskii 
Zhurnal (Kiev). Translated as Ukrainian 
Math. J. 


Ukraintan Math. J. 
Ukrainian Mathematical Journal (New York). 
Translation of Ukrain. Mat. Zh. 


Uspekhi Mat. Nauk 

Uspekhi Matematicheskikh Nauk (Moscow- 
Leningrad). Translated as Russian Math. 
Surveys 


Vestnik Moskov. Univ. 

Vestnik Moskovskogo Universiteta. I, Mate- 
matika i Mekhanika (Moscow). Mathematical 
section translated as Moscow Univ. Math. 
Bull. 


Vierteljschr. Naturf. Ges. Zürich 
Vierteljahrsschrifte der Naturforschenden 
Gesellschaft in Zürich (Zurich) 


Z. Angew. Math. Mech. GAMM 
Zeitschrift für Angewandte Mathematik und 
Mechanik, Ingenieurwissenschaftliche For- 
schungsarbeiten (Berlin) 


Z. Angew. Math. Phys. (Z. A.M.P.) 
Zeitschrift für Angewandte Mathematik und 
Physik (Basel) 


Z. Wahrscheinlichkeitstheorie 
Zeitschrift für Wahrscheinlichkeitstheorie und 
Verwandte Gebiete (Berlin) 


Zbl. Angew. Math. 
Zentralblatt für Angewandte Mathematik 
(Berlin) 


Zbl. Math. 


Zentralblatt für Mathematik und ihre Grenz- 
gebiete (Berlin-Góttingen-Heidelberg) 


Zh. Eksper. Teoret. Fiz. 
Zhurnal Eksperimental’noi i Teoreticheskoi 
Fiziki (Moscow) 


Publishers 


Academic Press 
Academic Press Inc., New York-London 


Addison-Wesley 

Addison-Wesley Publishing Company, Inc., 
Reading (Massachusetts)-Menlo Park (Cali- 
fornia)-London-Don Mills (Ontario) 


Akadémiai Kiadó 

A kiadásért felós: az Adadémiai Kiadó igaza- 
tója (Publishing House of the Hungarian 
Academy of Sciences), Budapest 


Akademie- Verlag 
Berlin 


Akademische Verlag. 
Akademische Verlagsgesellschaft, Leipzig 


Allen 
W. H. Allen & Co. Ltd., London 


Allen & Unwin 
Allen & Unwin, Inc., Winchester (Massachu- 
setts) 


Allyn & Bacon 
Allyn & Bacon, Inc., Newton (Massachusetts) 


Almqvist and Wiksell 
Almqvist och Wiksell Fórlag, Stockholm 


Asakura 
Asakura-syoten, Tokyo 


Aschelhoug 

H. Aschelhoug and Company, Oslo 
Baihükan 

Tokyo 

Benjamin 

W. A. Benjamin, Inc., New York-London 


Birkhauser 
Birkhauser Verlag, Basel-Stuttgart 


Blackie 
Blackie & Son Ltd., London-Glasgow 


Cambridge Univ. Press 

Cambridge University Press, London-New 
York 

Chapman & Hall 

Chapman & Hall Ltd., London 


Chelsea 
Chelsea Publishing Company, New York 


Clarendon Press 
Oxford University Press, Oxford 


Cremona 
Edizioni Cremonese, Rome 


de Gruyter 
Walter de Gruyter and Company, Berlin 


Dekker 
Marcel Dekker, Inc., New York 


Deutscher Verlag der Wiss. 
Deutscher Verlag der Wissenschaften, Berlin 


Dover 
Dover Publications, Inc., New York 


Dunod 
Dunod, Editeur, Paris 


Elsevier 
Elsevier Publishing Company, Amsterdam- 
London-New York 


Fizmatgiz 
Gosudarstvennoe Izdatel’stvo Fiziko- 
Matematicheskoi Literatury, Moscow 


Gauthier- Villars 
Gauthier-Villars & C/*, Editeur, Paris 


Ginn 
Ginn and Company, Waltham (Massachu- 
setts) Toronto- London 


Gordon & Breach 
Gordon & Breach, Science Publishers Ltd., 
London 


Goztekhizdat 
Gosudarstvennoe Izdatel'stvo Tekhniko- 
Teoreticheskoi Literatury, Moscow 


Griffin 
Charles Griffin and Company Ltd., London 


Hafner 
Hafner Publishing Company, New York 


Harper & Row 
Harper & Row Publishers, New York- 
Evanston-London 


Hermann 
Hermann & C*, Paris 


Hirokawa 
Hirokawa-syoten, Tokyo 


Hirzel 
Verlag von S. Hirzel, Leipzig 


Holden-Day 
Holden-Day, Inc., San Francisco-London- 
Amsterdam 


Holt, Rinehart and Winston 
Holt, Rinehart and Winston, Inc., New York- 
Chicago-San Francisco-Toronto-London 


Interscience 
Interscience Publishers, Inc., New York- 
London 


Iwanami 
Iwanami Shoten, Tokyo 


Kawade 
Kawade-syobó, Tokyo 


Kinokoniya 
Kinokoniya Company, Tokyo 
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Kyóritu 

Kyóritu-syuppan, Tokyo 

Lippincott 

J. B. Lippincott Company, Philadelphia 


Longman 
Longman Group, Ltd., Harlow (Essex) 


Longmans, Green 

Longmans, Green and Company, Ltd., 
London-New York-Toronto-Bombay- 
Calcutta- Madras 


Macmillan 
The Macmillan Company, New York-London 


Maki 
Maki-syoten, Tokyo 


Maruzen 
Maruzen Company Ltd., Tokyo 


Masson 
Masson et Ce, Paris 


Math-Sci Press 
Math-Sci Press, Brookline (Massachusetts) 


McGraw-Hill 
McGraw-Hill Book Company, Inc., New 
York-London-Toronto 


Methuen 
Methuen and Company Ltd., London 


MIT Press 
The MIT Press, Cambridge (Massachusetts)- 
London 


Nauka 
Izdatel'stvo Nauka, Moscow 


Noordhoff 
P. Noordhoff Ltd., Groningen 


North-Holland 
North-Holland Publishing Company, 
Amsterdam 


Oldenbourg 
Verlag von R. Oldenbourg, Munich-Vienna 


Oliver & Boyd 
Oliver & Boyd Ltd., Edinburgh-London 


Oxford Univ. Press 
Oxford University Press, London-New York 


Pergamon 
Pergamon Press, Oxford-London-Edinburgh- 
New York-Parts-Frankfurt 


Polish Scientific Publishers 
Panstwowe Wydawnictwo Naukowe, Warsaw 


Prentice-Hall 
Prentice-Hall, Inc., Englewood Cliffs (New 
Jersey) 


Princeton Univ. Press 
Princeton University Press, Princeton 


Publishers 


Random House 
Random House, Inc., New York 


Sibundó 

Tokyo 

Springer 

Springer-Verlag, Berlin (-Góttingen)- 
Heidelberg-New York 


Teubner 

B. G. Teubner Verlagsgesellschaft, Leipzig- 
Stuttgart 

Tókai 

Tókai-syobó, Tokyo 

Tokyo-tosyo 

Tokyo 


Tokyo Univ. Press 
Tokyo University Press, Tokyo 


Ungar 
Frederick Ungar Publishing Company, New 
York 


Univ. of Tokyo Press 
University of Tokyo Press, Tokyo 


Utida-rókakuho 
Tokyo 


Van Nostrand 
D. Van Nostrand Company, Inc., Toronto- 
New York-London 


Vandenhoeck & Ruprecht 

Góttingen 

Veit 

Verlag von Veit & Company, Leipzig 
Vieweg 

Friedr Vieweg und Sohn Verlagsgesellschaft 
mbH, Wiesbaden 

Wiley 

Wiley & Sons, Inc., New York-London 


Wiley-Interscience 
Wiley & Sons, Inc., New York-London 


Zanichelli 
Nicola Zanichelli Editore, Bologna 


Special Notation 


This list contains the notation commonly and frequently used throughout this work. The symbol * 
means that the same notation is used with more than one meaning. For more detailed definitions 


or further properties of the notation, see the articles cited. 





property P(x) 


Article 
and 
Notation Example Definition Section 
I. Logic 
V VxF(x) Universal quantifier (for all x, F(x) | 411B, C 
holds) 
j 3xF(x) Existential quantifier (there exists 411B,C 
an x such that F(x) holds) 
A, & AAB, A&B Conjunction, logical product 411B* 
(A and B) 
v AvB Disjunction, logical sum (A or B) 4] 1B* 
"d JA Negation (not A) 411B 
>, >, > AB, A>B Implication (A implies B) 411B* 
e, e, 2 AB Equivalence (A and B are logically 411B 
equivalent) 
II. Sets 
€ xeX Membership (element x is a 381A 
member of the set X) 
€ xéX Nonmembership (element x is not 381A 
a member of the set X) 
AcB Inclusion (A is a subset of B) 381A 
d Ac B Noninclusion (A is not a subset 381A 
of B) 
= ASB Proper inclusion (A is a proper 381A 
subset of B) 
e Empty set 381A 
Ey AUB, |) A, Union, join 381B, D* 
N,N ANB, (A; Intersection, meet 381B, D* 
SE A*, C(A) Complement (of a set A) 381B 
—, N A-B ANB Difference (A — B = AN B') 381B 
x AxB Cartesian product (of A and B) 381B* 
R, ~ xRy,x~y Equivalence relation 135A* 
(for two elements x, y) 
/ A/R Quotient set (set of equivalence 135B* 
classes of A with respect to an 
equivalence relation R) 
IT TL. A. Cartesian product (of the A,) 381E 
ZU ZA, WA, Direct sum (of the A.) 381E 
B B(A) Power set (set of all subsets of A) 381E 
B^ Set of all mappings from A to B 381C 
{|} {x|P(x)} Set of all elements x with the 381A 
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UJ 


limsup, lim 
lim inf, lim 


lim 


III. Order 
6) 
[.] 
6G ] 
[.) 


« 


U, v 
Ke 
IV. Algebra 


mod 


det, | | 


Example 


Special Notation 


Definition 





{az Jaca 


{an} 


ZIL #X 


gof 


lim sup A, 
liminf A, 


lim A, 
lim A, 


lim A, 


(a, b) 
La, 
(a, b] 
La, b) 
max A 
min A 


sup A 
inf A 
a«b 


aUb,avb 
anb,anb 


a=b(modn) 


a|b 
afb 


det A, | A| 


Family with index set A 


Sequence (of numbers, points, 
functions, or sets) 


Cardinal number (of the set X) 
Aleph (transfinite cardinal) 
Mapping ( f from X to Y) 


Mapping (where f(X) = Y, but 
not in the present volumes) 


Identity mapping 
(identity function) 


Characteristic function 
(representing function) 


Restriction (of a mapping f to A) 
Composite (of mappings f and g) 


Superior limit (of the sequence 
of sets A,) 


Inferior limit (of the sequence 
of sets A. 


Limit (of the sequence of sets A,) 
Inductive limit (of A.) 


Projective limit (of A.) 


Open interval (x|a <x <b} 
Closed interval (x|la x « b] 
Half-open-interval (x|a «x <b} 
Half-open interval (x|a € x « b] 
Maximum (of A) 

Minimum (of A) 


Supremum, least upper bound 
(of A) 


Infimum, greatest lower bound 
(of A) 


Very large (b is very large 
compared to a) 


Join of a, b in an ordered set 


Meet of a, b in an ordered set 


Modulo (a and b are congruent 
modulo n) 


Divisibility (a divides b) 


Nondivisibility (a does not 
divide b) 


Determinant (of a square matrix 
A) 


Article 
and 
Section 


165D 
165D 


49A* 
49E 
381C* 
381C 


381C 


381C 


381C* 
381C 
270C* 


270C* 


270C€* 
210B 
210B 


355C* 
355C* 
355C 
355C 
311B 
311B 
311B 


311B 


243A * 
243A* 


297G 


297A* 
297A 


103A* 
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Article 
and 
Notation Example Definition Section 
tr, Sp tr A, SpA Trace (of a square matrix A) 269F 
Se tA; At, AT, A’ Transpose (of a matrix A) 269B 
I I, Unit matrix (of degree n) 269A 
Ei Matrix unit (matrix whose 269B 
(i, j)-component is 1 and all others 
are 0) 
& A@B Kronecker product (of two 269C* 
matrices A and B) 
= M=N Isomorphism (of two algebraic 256B 
systems M and N) 
/ M/N Quotient space (of an algebraic 256F* 
system M by N) 
dim dim M Dimension (of a linear space, etc.) 256C 
Im Imf Image (of a mapping f) 277E* 
Ker Ker f Kernel (of a mapping /) 277E 
Coim Coim f Coimage (of a mapping f) 277E 
Coker Coker f Cokernel (of a mapping f) 277E 
d, 0; Kronecker delta (ó;; — 1 and 269A 
DE (a,b), a-b Inner product (of two vectors 442B* 
a and b) 
[,] x [a, b], a x b Vector product (of two 3- 442C* 
dimensional vectors a and b) 
Gi MON Tensor product (of two modules 271), 256I* 
M and N) 
Hom Hom(M, N) Set of all homomorphisms 277B 
(from M to N) 
Hom, Hom (M, N) Set of all 4-homomorphisms 277E 
(of an A-module M to 
an A-module N) 
Tor Tor,(M, N) Torsion product (of M, N) 200D 
Ext Ext"(M, N) Extension (of M, N) 200G 
A, AP ^M, A°"M Exterior algebra (of a linear space 2560 
M), pth exterior product (of M) 
V. Algebraic Systems 
N Set of all natural numbers 294A 
Z Set of all rational integers 294A 
Lin Z/mZ (set of all residue classes 297G* 
modulo m) 
Q Set of all rational numbers 294A 
R Set of all real numbers 294A 
C Set of all complex numbers 294A 
H Set of all quaternions 29B 
GF(q), F; Finite field (with g elements) 149M 





1853 Special Notation 
Article 
and 
Notation Example Definition Section 
Q, p-adic number field 439F 
(p is a prime) 
Z, Ring of p-adic integers 439F 
{ ] Klees) Polynomial ring (of variables 369A 
Xj, X, With coefficients in k) 
ME Koss ssi hy) Field extension 149D 
tot k by 3.) 
Lk Jk KE Steen Formal power series ring (with 370A 
coefficients in K). 
Note: The symbols N, Z, Q, R, C, 
and H stand for sets, each with its 
own natural mathematical 
structure 
VI. Groups 
GL GL(V), GL(n, K) General linear group (over V, 60B 
or over K of degree n) 
SL SL(n, K) Special linear group (over K of 60B 
degree n) 
PSL PSL(n, K) Projective special linear group 60B 
(over K of degree n) 
U U (n) Unitary group (of degree n) 60F 
SU SU (n) Special unitary group (of degree n) 60F 
0 O(n) Orthogonal group (of degree n) 601 
SO SO(n) Special orthogonal group, 60I 
rotation group (of degree n) 
Spin Spin(n) Spinor group (of degree n) 61D 
Sp Sp(n) Symplectic group (of degree n) 60L 
[For PGL(n, K), LF(n, K), PU (n), Sp(n), PSp(n, K)—60 Classical Groups] 
VII. Topology (Convergence) 
-> a, >a Convergence (sequence a, 87B, E* 
converges to a) 
LN apļa, a, Nd Convergence monotonically 87B 
decreasing 
t, 7 afa, A, 7a Convergence monotonically 87B 
increasing 
lim lim a, Limit (of a sequence a,,) 87B, E* 
limsup, lim limsupa, lima, ^ Superior limit (of a sequence a,) 87C* 
lim inf, lim liminfa,, lim a, Inferior limit (of a sequence a,) 87C* 
a` CI E, E, CIE Closure (of a set E) 425B 
Ee Int EŻ, E", Int E Interior (of a set E) 425B 
p. d p(x, y), a(x, y) Distance (between two points 273B* 
x and y) 
| ixil Norm (of x) 37B 
Lim. lim. f, Limit in the mean (of a 168B 


sequence f,) 


Special Notation 


Definition 





Notation Example 
s-lim s-lim x, 
w-lim w-lim x, 
~ f=g 

RI A st 


VIII. Geometry and Algebraic Topology 


E" 
P" 


S" 


T" 
H" 


> 


Sq 


D(u,,...,u,) 
Dress Xa) 


Ou; 


Ox 


j 


Q(u,,...,u,) (Ou; 
0(x,,...,X,) Ox; 


IX. Function Spaces 

















C 


H"(X, A) 


H,(X, A) 


H,(C) 
T,(X) 


OC 

of 
Sqix 
dx) 
fie. 
zy, 


WAN 


dw 


grad o 
rotu 
divu 
Au 
Le 
Do 


det Ou; 
, e Bie 
Ox; 


CQ) 
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Article 
and 
Section 





Strong limit (of a sequence x,) 
Weak limit (of a sequence x,) 


Homotopy (of two mappings 
f and g) 


Homeomorphism (of two 
topological spaces X and Y) 


Euclidean space (of dimension n) 


Projective space 
(of dimension n) 


Spherical surface 
(of dimension n) 


Torus (of dimension n) 


n-dimensional cohomology group 


(of X with coefficients in A) 


n-dimensional homology group 
(of X with coefficients in A) 


(of chain complex C) 


n-dimensional homotopy group 
(of X) 


Boundary (of C) 
Coboundary (of f) 
Streenrod square (of x) 
Steenrod pth power (of x) 
Cup product (of z, and z;) 
Cap product (of z, and z;) 


Exterior product (of two 
differential forms œw and y) 


Exterior derivative (of a 
differential form @) 


Gradient (of a function ¢) 
Rotation (of a vector u) 
Divergence (of a vector u) 
Laplacian (of a function o) 
d'Alembertian (of a function oi 
Differential operator 


Jacobian determinant (of 
(u,,...,U,) with respect to 
Lei, Xn)) 

Jacobian matrix (of (u,,...,u,) 
with respect to (x,,...,X,)) 


Space of continuous functions 
(on Q) 


37B 
37E 
202B 


425G 


140 
343B 


140 


422E 


201H 


201G 


201B 
202J, 170 


201B 
201H* 
64B 
64B 
201I 
201K 
105Q* 


105Q 


442D 
442D 
442D 
323A 
130A 
112A* 
208B 


208B 


168B(1) 
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Special Notation 


Definition 





Notation Example 

L, L,(Q), L,(a, b) 
c C'(L)(1 lx oo) 
2 BQ) 

& &(Q) 


Space of functions such that | f |? 
is integrable on Q 


Space of functions of class C' 


Space of C” functions with 
compact support 


Space of C? functions 


Article 
and 
Section 


168B(2) 


168B(9) 
168B(13) 


168B(13) 


[For (9), A(Q), A,(Q), 4(0)( D, «(0)), BMO(R"), BV(O), c, C, Co(Q), C,,(Q), Co(Q), 2,,(Q), 
Pun (9), Zu»), Emn (O), Zeil, HAR"), HQ), HAO), AS(R"), N (29), X: A" (a9), p, 

L5, (0), m, M(Q), OQ), 6, (0), F, s, S(Q), W(Q)— 168 Function Spaces. For 4(Q) (Space of 
Sato hyperfunctions), 2'(0), &'(Q), O,, Oy, "'(R")— 125 Distributions and Hyperfunctions] 


X. Functions 


|| H 


Re Rez 

Im Imz 

Ie Z 

arg argz 

[] [4] 

O I(x) = O(g(x)) 
0 I(x) —o(gt9)) 
~ f(x) g(x) 

D D(T) 

R R(T) 

supp supp f 

p.v. p.v. f(x)dx 
Pf Pf f f(x)dx 

ó ó(x), Ôx 

exp expx 

log, Log logx, Logx 


sin x, COS x, tan x, sec x, 
cosec x, cotan x 


arcsin x, arccos x, arctan x 


Arcsin x, Arccos x, Arctanx 


Absolute value (of a complex 
number z) 


Real part (of a complex 
number z) 


Imaginary part (of a complex 
number z) 


Complex conjugate (of a complex 
number z) 


Argument (of a complex 
number z) 


Gauss symbol (greatest integer 
not exceeding a real number «) 


Landau's notation (f(x)/g(x) 
is bounded for xo) 


Landau's notation ( f(x)/g(x) 
tends to 0 for xa) 


Infinite or infinitesimal of the 
same order (for xa) 


Domain (of an operator T) 
Range (of an operator T) 
Support (of a function f) 


Cauchy's principal value (of 
an integral) 


Finite part (of an integral) 


Dirac's delta function 
(measure or distribution) 


Exponential function 
(expx = e?) 


Natural logarithmic function and 
its principal value, respectively 


Trigonometric functions 


Inverse trigonometric functions 


Principal value of inverse 
trigonometric functions 


74B* 


74A 


T4A* 


74A 


74C 


83A 


87G 


87G 


87G* 


251A 
251A 
168B(1) 
216D 


125C 
125C* 


113D, 269H 


131D, G 


131E 


131E 
131E 


Special Notation 


Definition 





Notation Example 

sinh x, cosh x, tanh x 

(e E 

r 

P M a 

! n! 

Q9 oín) 

H Hin) 

4 C(z) 

A J,(z) 

D I (x) 

B B(x, y) 

F F(a, B, y; z) 

abe 
P Pad uy 
A! u y 

Li Li(x) 

XI. Probability 

P, Pr P(E), Pr(e) 

E E(X) 

V, c? V(X), a?(X) 

p p(X, Y) 

P(|) P(E|F) 

E(|) E(X|Y) 

N N (m, c?) 
N(u, 2) 

P P(A) 


Hyperbolic functions 
Binomial coefficient, combination 


Permutation 

Factorial (of n) 

Euler function 

Mobius function 

Riemann zeta function 

Bessel function of the first kind 
Gamma function 

Beta function 


Gauss’s hypergeometric function 


Riemann’s P function 


Logarithmic integral 


Probability (of an event) 


Mean or expectation (of a 
random variable X) 


Variance (of a random variable X) 


Correlation coefficient (of two 
random variables X and Y) 


Conditional probability (of an 
event E under the condition F) 


Conditional mean (of a random 
variable X under the condition Y) 


One-dimensional normal 
distribution (with mean m and 
variance c?) 


Multidimensional normal dis- 
tribution (with mean vector u 
and variance matrix X) 


Poisson distribution (with 
parameter A) 


1856 


Article 
and 
Section 
]31F 


330 


330 
330 
295C* 
295C 
450B* 
39B 
174A 
174C 
206A 


253B 


167D 


342B* 
342C 


342C 
342C* 


342E 


342E 


Appendix A, 
Table 22 


Appendix A, 
Table 22 


Appendix A, 
Table 22* 


Systematic List of Articles 


I 
Logic and Foundations 


1. Foundations of Mathematics Art. 156 
2. Axiom Systems 38 

3. Paradoxes 319 

4. Symbolic Logic 411 

5. Axiomatic Set Theory 33 

6. Model Theory 276 

7. Nonstandard Analysis 293 

8. Gódel Numbers 185 

9. Recursive Functions 356 
10. Decision Problem 97 
11. Constructive Ordinal Numbers 81 
12. Analytic Sets 22 


II 
Sets, General Topology, and Categories 


1. Sets 381 

2. Relations 358 

3. Equivalence Relations 135 

4. Functions 165 

5. Axiom of Choice and Equivalents 34 

6. Cardinal Numbers 49 

7. Structures 409 

8. Permutations and Combinations 330 
9. Numbers 294 

10. Real Numbers 355 

11. Complex Numbers 74 

12. Ordering 311 

13. Ordinal Numbers 312 

14. Lattices 243 

15. Boolean Algebras 42 

16. Topological Spaces 425 

17. Metric Spaces 273 

18. Plane Domains 333 

19. Convergence 87 

20. Connectedness 79 

21. Dimension Theory 117 

22. Uniform Spaces 436 

23. Uniform Convergence 435 

24. Categories and Functors 52 


25. Inductive Limits and Projective Limits 
210 


26. Sheaves 383 


Il 
Algebra 


1. Algebra 8 
2. Matrices 269 
3. Determinants 103 
4. Polynomials 337 
5. Algebraic Equations 10 
6. Fields 149 
7. Galois Theory 172 
8. Linear Spaces 256 

9. Rings 368 

10. Associative Algebras 29 
11. Commutative Rings 67 
12. Noetherian Rings 284 

13. Rings of Polynomials 369 
14. Rings of Power Series 370 


15. Quadratic Forms Art. 348 
16. Clifford Algebras 61 

17. Differential Rings 113 

18. Witt Vectors 449 

19. Valuations 439 

20. Adeles and Ideles 6 

21. Cayley Algebras 54 

22. Jordan Algebras 231 

23. Modules 277 

24. Homological Algebra 200 
25. Hopf Algebras 203 
Appendix A, Table 1. Algebraic Equations 


IV 
Group Theory 


. Groups 190 

. Abelian Groups 2 

. Free Groups 161 

. Finite Groups 151 

. Classical Groups 60 

. Topological Groups 423 

. Topological Abelian Groups 422 
. Compact Groups 69 

. Lie Groups 249 

. Lie Algebras 248 

. Algebraic Groups 13 

. Homogeneous Spaces 199 


. Symmetric Riemannian Spaces and Real 
Forms 412 


14. Discontinuous Groups 122 

15. Crystallographic Groups 92 
16. Representations 362 

17. Unitary Representations 437 
18. Invariants and Covariants 226 
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Appendix A, Table 5. Lie Algebras, Symmetric 


Riemannian Spaces, and Singularities 


Appendix B, Table 5. Characters of Finite 
Groups and Crystallographic Groups 


V 
Number Theory 


. Number Theory 296 

. Number Theory, Elementary 297 
. Continued Fractions 83 

. Number-Theoretic Functions 295 
. Additive Number Theory 4 

. Partitions of Numbers 328 

. Distribution of Prime Numbers 123 
. Lattice-Point Problems 242 

. Diophantine Equations 118 

. Geometry of Numbers 182 

. Transcendental Numbers 430 

. Quadratic Fields 347 

. Algebraic Number Fields 14 

. Class Field Theory 59 

. Complex Multiplication 73 

. Fermat's Problem 145 

. Local Fields 257 

. Arithmetic of Associative Algebras 
19. Zeta Functions 450 


Appendix B, Table 1. Prime Numbers and 
Primitive Roots 


Appendix B, Table 2. Indices Modulo p 


— e — — mA - x x ` Feet 


Systematic List of Articles 


Appendix B, Table 4. Class Numbers of 
Algebraic Number Fields 


VI 
Euclidean and Projective Geometry 


1. Geometry Art. 181 
. Foundations of Geometry 155 
. Euclidean Geometry 139 
. Euclidean Spaces 140 
. Geometric Construction 179 
. Regular Polyhedra 357 
. Pi (x) 332 
. Trigonometry 432 
9. Conic Sections 78 
10. Quadric Surfaces 350 
11. Convex Sets 89 
12. Vectors 442 
13. Coordinates 90 
14. Projective Geometry 343 
15. Affine Geometry 7 
16. Non-Euclidean Geometry 285 
17. Conformal Geometry 76 
18. Erlangen Program 137 
19. Continuous Geometry 85 
20. Curves 93 
21. Surfaces 410 
22. Four-Color Problem 157 
Appendix A, Table 2. Trigonometry 


Appendix A, Table 3. Vector Analysis and 
Coordinate Systems 
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VII 
Differential Geometry 


. Differential Geometry 109 
. Differentiable Manifolds 105 
. Riemannian Manifolds 364 
. Connections 80 

. Tensor Calculus 417 

. Geodesics 178 

. Symmetric Spaces 413 

. G-Structures 191 

. Complex Manifolds 72 

. Kahler Manifolds 232 

. Harmonic Integrals 194 


. Differential Geometry of Curves and 
Surfaces 111 


. Riemannian Submanifolds 365 
. Minimal Submanifolds 215 

. Harmonic Mappings 195 

. Morse Theory 279 


. Differential Geometry in Specific Spaces 
110 


. Finsler Spaces 152 

. Integral Geometry 218 

. Siegel Domains 384 

. Spectral Geometry 391 

. Pseudoconformal Geometry 344 
23. Global Analysis 183 

Appendix A, Table 4. Differential Geometry 
VIII 

Algebraic Geometry 

1. Algebraic Geometry 12 

2. Algebraic Curves 9 
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Algebraic Surfaces Art. 15 
Algebraic Varieties 16 
Abelian Varieties 3 
Riemann-Roch Theorems 366 


Topology 


. Topology 426 

. Fundamental Groups 170 

. Covering Spaces 91 

. Degree of Mapping 99 

. Complexes 70 

. Homology Theory 201 

. Fixed-Point Theorems 153 
. Cohomology Operations 64 
. Homotopy Theory 202 

. Fiber Spaces 148 

. Obstructions 305 

. Topology of Lie Groups and Homo- 


geneous Spaces 427 


. Fiber Bundles 147 

. Characteristic Classes 56 

. K-Theory 237 

. Knot Theory 235 

. Combinatorial Manifolds 65 
. Differential Topology 114 

. Transformation Groups 431 
. Theory of Singularities 418 

. Foliations 154 

. Dynamical Systems 126 

. Shape Theory 382 

24. 


Catastrophe Theory 51 


Appendix A, Table 5. Lie Algebra, Symmetric 
Riemannian Spaces, and Singularities 


Appendix A, Table 6. Topology 
Appendix A, Table 7. Knot Theory 


X 


Analysis 


1. 
. Continuous Functions 84 

. Inequalities 211 

. Convex Analysis 88 

. Functions of Bounded Variation 166 
. Differential Calculus 106 

. Implicit Functions 208 

. Elementary Functions 131 

. C”-Functions and Quasi-Analytic 
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Analysis 20 


Functions 58 


. Integral Calculus 216 
. Curvilinear Integrals and Surface Integrals 


94 


. Measure Theory 270 

. Integration Theory 221 

. Invariant Measures 225 

. Set Functions 380 

. Length and Area 246 

. Denjoy Integrals 100 

. Series 379 

. Asymptotic Series 30 

. Polynomial Approximation 336 
. Orthogonal Functions 317 
. Fourier Series 159 


1859 


23. 
24. 
25. 
26. 
27. 
28. 
29. 


30. 
31. 
32. 
33. 
34. 
35. 


Fourier Transform Art. 160 
Harmonic Analysis 192 
Almost Periodic Functions 18 


Laplace Transform 240 
Integral Transforms 220 
Potential Theory 338 


Harmonic Functions and Subharmonic 
Functions 193 


Dirichlet Problem 
Capacity 48 
Calculus of Variations 46 
Plateau's Problem 334 
Isoperimetric Problems 
Variational Inequalities 


120 


228 
440 


Appendix A, Table 8. Inequalities 


Appendix A, Table 9. Differential and Integral 
Calculus 


Appendix A, Table 10. Series 
Appendix A, Table 11. Fourier Analysis 


Appendix A, Table 12. Laplace Transforms 
and Operational Calculus 


Appendix A, Table 20. Systems of Orthogonal 
Functions 


XI 


Complex Analysis 


21. 
22. 


. Holomorphic Functions 
. Power Series 

. Dirichlet Series 
. Bounded Functions 43 

. Univalent and Multivalent Functions 


. Transcendental Entire Functions 
. Meromorphic Functions 
. Distribution of Values of Functions of 


198 
339 
121 


438 
429 
212 


a Complex Variable 124 


. Cluster Sets 62 

. Algebraic Functions 11 

. Algebroidal Functions 17 
. Riemann Surfaces 
. Ideal Boundaries 
. Conformal Mappings TE 


367 
207 


. Quasiconformal Mappings 352 

. Teichmüller Spaces 416 

. Kleinian Groups 234 

. Extremal Length 143 

. Function- Theoretic Null Sets 169 


. Analytic Functions of Several Complex 


Variables 21 
Analytic Spaces 23 
Automorphic Functions 32 


Appendix A, Table 13. Conformal Mappings 


XII 
Functional Analysis 


l. 
. Hilbert Spaces 
. Banach Spaces 37 

. Ordered Linear Spaces 

. Topological Linear Spaces 
. Function Spaces 
. Distributions and Hyperfunctions 
. Vector-Valued Integrals 
. Linear Operators 
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Functional Analysis 162 


197 


310 
424 
168 
125 
443 
251 





Systematic List of Articles 


10. 


11. 
12. 
13. 
14. 


15. 
16. 
17. 
18. 
19. 
20. 


21 


. Nonlinear Functional Analysis 


Compact and Nuclear Operators 
Art. 68 


Interpolation of Operators 
Spectral Analysis of Operators 390 
Perturbation of Linear Operators 331 


Semigroups of Operators and Evolution 
Equations 378 


Differential Operators 
Microlocal Analysis 
Banach Algebras 36 
Function Algebras 164 
Operator Algebras 308 
Operational Calculus 306 


224 


112 
274 


286 


XIII 
Differential, Integral, and Functional Equations 


L 
2. 
3. 


. Nonlinear Oscillation 
. Nonlinear Problems 

. Stability 
. Integral Invariants 

. Difference Equations 
. Functional-Differential Equations 
. Total Differential Equations 

. Contact Transformations 82 
. Partial Differential Equations 

. Partial Differential Equations (Methods 


. Monge-Ampere Equations 
. Partial Differential Equations of Elliptic 


Differential Equations 107 
Ordinary Differential Equations 313 


Ordinary Differential Equations (Initial 
Value Problems) 316 


. Ordinary Differential Equations (Bound- 


ary Value Problems) 315 


. Ordinary Differential Equations (Asymp- 


totic Behavior of Solutions) 314 


. Linear Ordinary Differential Equations 


252 


. Linear Ordinary Differential Equations 


(Local Theory) 254 


. Linear Ordinary Differential Equations 


(Global Theory) 253 


. Nonlinear Ordinary Differential Equations 


(Local Theory) 289 


. Nonlinear Ordinary Differential Equations 


(Global Theory) 288 

290 
291 
394 
219 

104 

163 
428 


320 


of Integration) 322 


. Partial Differential Equations (Initial 


Value Problems) 321 


. Partial Differential Equations of First 


Order 324 


278 


Type 323 

. Partial Differential Equations of Hyper- 
bolic Type 325 

. Partial Differential Equations of Parabolic 
Type 327 

. Partial Differential Equations of Mixed 
Type 326 

. Green's Functions 188 

. Green's Operators 189 

. Integral Equations 217 


. Integrodifferential Equations 222 
. Special Functional Equations 388 
. Pseudodifferential Operators 345 


Systematic List of Articles 


Appendix A, Table 14. Ordinary Differential 
Equations 


Appendix A, Table 15. Total and Partial 
Differential Equations 


XIV 
Special Functions 


1. Special Functions Art. 389 


. Generating Functions 177 
. Elliptic Functions 134 
. Gamma Function 174 


. Hypergeometric Functions 206 

. Spherical Functions 393 

. Functions of Confluent Type 167 

. Bessel Functions 39 

. Ellipsoidal Harmonics 133 

10. Mathieu Functions 268 

Appendix A, Table 16. Elliptic Integrals and 
Elliptic Functions 

Appendix A, Table 17. Gamma Function and 
Related Functions 

Appendix A, Table 18. Hypergeometric Func- 
tions and Spherical Functions 

Appendix A, Table 19. Functions of Confluent 
Type and Bessel Functions 

Appendix B, Table 3. Bernoulli Numbers and 
Euler Numbers 
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XV 
Numerical Analysis 


1. Numerical Methods 300 
2. Interpolation 223 

3. Error Analysis 138 

4 


. Numerical Solution of Linear Equations 
302 


5. Numerical Solution of Algebraic Equa- 
tions 301 


6. Numerical Computation of Eigenvalues 
298 


7. Numerical Integration 299 


8. Numerical Solution of Ordinary Differ- 
ential Equations 303 


9. Numerical Solution of Partial Differ- 
ential Equations 304 


10. Analog Computation 19 

11. Evaluation of Functions 142 

Appendix A, Table 21. Interpolation 

Appendix B, Table 6. Miscellaneous Constants 


XVI 
Computer Science and Combinatorics 


1. Automata 31 

2. Computers 75 

3. Coding Theory 63 

4. Cybernetics 95 

5. Random Numbers 354 

6. Simulation 385 

7. Data Processing 96 

8. Mathematical Models tn Biology 263 
9. Complexity of Computations 71 
10. Combinatorics 66 

11. Latin Squares 241 

12. Graph Theory 186 
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XVII 
Probability Theory 


. Probability Theory Art. 342 


2. Probability Measures 341 
3. Limit Theorems in Probability Theory 
250 
4. Stochastic Processes 407 
5. Markov Processes 261 
6. Markov Chains 260 
7. Brownian Motion 45 
8. Diffusion Processes 115 
9. Additive Processes 5 
10. Branching Processes 44 
11. Martingales 262 
12. Stationary Processes 395 
13. Gaussian Processes 176 
14. Stochastic Differential Equations 406 
15. Ergodic Theory 136 
16. Stochastic Control and Stochastic Fil- 
tering 405 
17. Probabilistic Methods in Statistical Me- 
chanics 340 
XVIII 
Statistics 
1. Statistical Data Analysis 397 
2. Statistical Inference 401 
3. Statistic 396 
4. Sampling Distributions 374 
5. Statistical Models 403 
6. Statistical Decision Functions 398 
7. Statistical Estimation 399 
8. Statistical Hypothesis Testing 400 
9. Multivariate Analysis 280 
10. Robust and Nonparametric Methods 


11. 
. Design of Experiments 102 

. Sample Survey 373 

. Statistical Quality Control 404 
. Econometrics 128 

. Biometrics 40 

. Psychometrics 346 

18. 
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371 
Time Series Analysis 421 


Insurance Mathematics 214 


Appendix A, Table 22. Distribution of Typical 
Random Variables 

Appendix A, Table 23. Statistical Estimation 
and Statistical Hypothesis Testing 


XIX 
Mathematical Programming and Operations 
Research 
1. Mathematical Programming 264 
2. Linear Programming 255 
3. Quadratic Programming 349 
4. Nonlinear Programming 292 
5. Network Flow Problems 2&1 
6. Integer Programming 215 
7. Stochastic Programming 408 
8. Dynamic Programming 127 
9. Game Theory 173 
10. Differential Games 108 
11. Control Theory 86 


1861 


12. Information Theory Art. 213 
13. Operations Research 307 
14. Inventory Control 227 


15. Scheduling and Production Planning 
376 


XX 
Mechanics and Theoretical Physics 


1. Systems of Units 414 
2. Dimensional Analysis 116 
3. Variational Principles 441 
4. Mechanics 271 
5. Spherical Astronomy 392 
6. Celestial Mechanics 55 
7. Orbit Determination 309 
8. Three-Body Problem 420 
9. Hydrodynamics 205 
10. Hydrodynamical Equations 204 
11. Magnetohydrodynamics 259 
12. Turbulence and Chaos 433 
13. Wave Propagation 446 
14. Oscillations 318 
15. Geometric Optics 180 
16. Electromagnetism 130 
17. Networks 282 
18. Thermodynamics 419 
19. Statistical Mechanics 402 
20. Boltzmann Equation 41 
21. Relativity 359 
22. Unified Field Theory 434 
23. Quantum Mechanics 351 
24. Lorentz Group 258 
25. Racah Algebra 353 
26. Scattering Theory 375 
27. Second Quantization 377 
28. Field Theory 150 
29. S-Matrices 386 
30. Feynman Integrals 146 
31. Elementary Particles 132 
32. Renormalization Group 361 
33. Nonlinear Lattice Dynamics 287 
34. Solitons 387 


35. Approximation Methods in Physics 
25 


36. Inequalities in Physics 212 
XXI 
History of Mathematics 
1. Ancient Mathematics 24 
2. Greek Mathematics 187 


. Roman and Medieval Mathematics 
372 


4. Arab Mathematics 26 

5. Indian Mathematics 209 
6. Chinese Mathematics 57 
7 
8 


we) 


. Japanese Mathematics (Wasan) 230 


. Renaissance Mathematics 360 
9. Mathematics in the 17th Century 
10. Mathematics in the 18th Century 
11. Mathematics in the 19th Century 
12. Abel, Niels Henrik 1 
13. Artin, Emil 28 





Systematic List of Articles 


. Bernoulli Family 

. Cantor, Georg 47 
. Cartan, Elie 50 
. Cauchy, Augustin Louis 53 

. Dedekind, Julius Wilhelm Richard 98 
. Descartes, René 101 

. Dirichlet, Peter Gustav Lejeune 119 
. Einstein, Albert 129 

. Euler, Leonhard 141 

. Fermat, Pierre de 144 

. Fourier, Jean Baptiste Joseph 158 

. Galois, Evariste 171 

. Gauss, Carl Friedrich 175 

. Gódel, Kurt 184 

. Hilbert, David 196 

. Jacobi, Carl Gustav Jacob 229 

. Klein, Felix 233 

. Kronecker, Leopold 236 

. Lagrange, Joseph Louis 238 

. Laplace, Pierre Simon 239 

. Lebesgue, Henri Léon 244 

. Leibniz, Gottfried Wilhelm 245 

. Lie, Marius Sophus 247 

. Newton, Isaac 283 

. Pascal, Blaise 329 

. Poincaré, Henri 335 

. Riemann, Georg Friedrich Bernhard 


Art. 38 


363 


. Takagi, Teiji 415 

. Viète, Francois 444 

. Von Neumann, John 445 
. Weterstrass, Karl 447 

. Weyl, Hermann 448 


Alphabetical List of Articles 
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. Abel, Niels Henrik 

. Abelian Groups 

. Abelian Varieties 

. Additive Number Theory 
. Additive Processes 

. Adeles and Ideles 

. Affine Geometry 

. Algebra 

. Algebraic Curves 

. Algebraic Equations 

. Algebraic Functions 

. Algebraic Geometry 

. Algebraic Groups 

. Algebraic Number Fields 
. Algebraic Surfaces 
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C, R 125.W 320 H 418.r 428.H 

Malliavin, Paul (1925-)  115.D, r 192.M 406.E, r 

Malmquist, Johannes (1882-1952) 254.D 288.B, C, 
r 289.B-D 314.4 

Mal'tsev, Anatol" Ivanovich (1909-67)  29.F 249.S 
276.D 

Malus, Étienne Louis (1775-1812) 180.A 

Mandelbaum, Richard (1946-) 114r 

Mandelbrojt, Szolem (1899-1983) 58.F 134.C,r 
339.A, r 

Mandelbrot, Benoit B. (1924-) 246.K 433.r 

Mandelstam, Stanley (1928-) 132.C 

Mañé, Ricardo 126.J 

Mangasarian, Olvi L. (1934-) 292.D,r 

Mangoldt, Hans Carl Friedrich von (1854-1925) 
123.B 450.B 

Manin, V. G. 80r 

Manin, Yurii Ivanovich (1937-)  16.J, V 80.r 118.E 
387.r 450.J, M 

Mann, Henry Berthold (1905-) 4.A 371.A, C 421.r 

Mann, Larry N.(1934-) 364.F 

Manna, Zohar(1939-) Jär 

Mannheim, Amédée (1831-1906) 111.F 

Manning, Anthony Kevin (1946-) 51.r 126.5, K 

Mansfield, Richard B.(1941-) 22.F 

Maranda, Jean-Marie A. (?—1971) 362.K 

Marchand, Jean-Paul 375.r 

Marchenko, Vladimir Aleksandrovich (1922-) 
287.C 387.D 

Marchuk, Gurii Ivanovich (1925-)  304.r 

Marcinkiewicz, Jozef (1910-) 159.H 224.A, E 
336.E 

Marden, Morris(1905-) 10.r 

Mardesic, Sibe (1927-) 382.A 

Margulis, Gregorii A. (1946-) 122.G 

Marion, Jerry Baskerville (1929-)  271.r 

Markov, Andrei Andreevich (1856-1922) 5.H 
44.D, E 126.F, J 127.E 136.B, D, G 150.F 176.F 
182.G 260.A, H, J 261.A, B 336.C 340.C 379.1 
403.E 405.C 406.D 407.B 

Markov, Andrei Andreevich (1903-) 31.B 161.B 
356.r 

Markus, Lawrence J. (1922 -) 86r 126A, H, L,r 
291.r 

Markwald, Werner 81.A,r 

Marotto, Frederick Robert (1950-) 126 

Marsden, Jerrold E. (1942-)  126.r 183.r 271.r 286.r 
316.r 364.H 420.r 

Marshall, Donald E. 164.I 

Marsten, Roy Earl (1942-) 215.r 

Martin, Andre (1931-) 150.D 386.B, r 

Martin, Donald A. 22D, F, H, r 33.F, £ 

Martin, Harold C. 304r 

Martin, Paul C. 308.H 

Martin, Robert S. 207.C, D 260.1 

Martin, William Ted (1911-) 2t.r 

Martineau, André (1930-72) 125.W, Y, r 162 168.B 
424.X 

Martinet, Jean 110.E 

Martin-Lóf, Per (1942-) 354.r 

Martio, Olli Tapani (1941-) 352.F 

Marty, F. ()-1939). 272.H 435.E 

Maruyama Gisiro (1916-86) 115.D 136.D, E 250.r 
260.] 395.r 

Maruyama, Masaki(1944-) 16.Y 

Maruyama, Toru (1949-) 443.A 

Masani, Pesi R.(1919-) 395.r 

Mascheroni, Lorenzo (1750~1800) 179.B 


1898 


Maschke, Heinrich (1853-1908) 362.G 

Maschler, Michael (1927-) 173.D 

Maskit, Bernard (1935-) 122.1 234.D, r 416.r 

Masley, John M.(1947-) 14.L 

Maslov, Viktor Pavlovich (1930-) 30.r 274.C, I 
345.r 

Massau, Junius (1852-1909) 19.B 

Masser, David W. 134.r 430.D, r 

Massey, William S. (1920-) 91.r 170.r 201.r 
202.M, P 410.r 

Masuda, Kazuo (1946-) 72.r 

Masuda, Kyüya (1937-) 378.1, J 

Masuyama, Motosaburo (1912-) STR 

Matano, Hiroshi (1952-) 263.C 303.G, r 

Mather, John Norman (1942-)  51.C-E 126J 
154.E, r 286.J 418.r 

Mathews, George Ballard (1861-1922) 39.r 

Mathieu, Emile Léonard (1835-90) 151.H 268.A-D 

Matiyasevich, Yurii Vladimirovich 97.*,r 118.A 
196 

Matsuda, Michihiko (1938-) 428.G 

Matsumoto, Hideya (1939-) 13.R 122.F 

Matsumoto, Kazuo (1922-) 411J 

Matsumoto, Kikuji (1931-) 62.B 124 C, r 

Matsumoto, Shigenori (1947-) 126.M 

Matsumoto, Yukio (1944-) 65.D 114K 

Matsumura, Akitaka (1951—) 41.r 204.F 

Matsumura, Hideyuki (1930-) 284. 

Matsusaka, Teruhisa (1926-) 12.B 16.P, W,r 

Matsushima, Yozo (1921-83) 32.r 122.F 199.r 
249.r 384.r 

Matsushita, Shin-ichi (1922-) 338.L 

Matsuyama, Noboru (1916—) 310.r 

Mattis, Daniel Charles 402.r 

Mattuck, Arthur Paul(1930-) 118.E 450.P 

Matuda, Tizuko (1923-)  30.r 288.B, r 289.r 

Matumoto, Takao (1946-) 65.C 114.K 

Matunaga Yosisuke (1692?-1747) 230 332 

Matuzaka, Kazuo (1927-) 7.r 343.r 

Matveev, Vladimir Borisovich 387.r 

Maunder, Charles Richard Francis 291.r 

Maupertuis, Pierre Louis Moreau de (1698-1759) 
180.A 441.B 

Maurer, Ludwig (1859-?) 249.R 

Maurus (c. 780-c. 856) 372 

Mautner, Friedrich Ignaz (1921-) 136.G 308.G 
437.EE 

Mawhin, Jean 290r 

Maxfield, John E. (1927-) NTR 

Maxwell, Albert Ernest 280.r 346.F, r 

Maxwell, George (1946—) 92.r 

Maxwell, James Clerk (1831-79) 51.F 130.A 150.A 
180.A 393.D 402.B, H 419.B 

Maxwell, William L. (1934-) 376.r 

May, J. Peter (1939—) 70.r 


‘May, Kenneth Ownsworth (1915-77) | 157.r 


May, Robert McCredie (1936—) 126.N 263.D, r 

Mayer, Dieter H. 402.G, r 

Mayer, Karl Heinz (1936-) 431.r 

Mayer, Walter 111.r 201.C, E, L 

Maynard, Hugh B. 443.H 

Mazet, Edmond  109.r 115.r 391.r 

Mazur, Barry C. (1937-) 16.r 37.C 65.C, G 114.C 
126.K 426 450.], r 

Mazur, Stanisław (1905-81) 36.E 

Mazurkiewicz, Stefan (1888-1945) 22.C 93.D 426 

McAndrew, Michael H. 126.K 136.H 

McAuley, Van A. 301.E 

McBride, Elna Browning = 177.r 

McCarthy, John (1927-) 31.C 


1899 


McCoy, Barry Malcolm (1940-) 402.r 

McCoy, Neal Henry (1905—) | 368.r 

McCracken, Marsden 126.r 

McDuff, Dusa Waddington (1945—) 308.F 

McGregor, James 263.E 

McKay, John 151.1 

McKean, Henry P., Jr. (1930-) 41.C 45... r 115.A,r 
176.F, K 260.J 261.A, r 323.M 340.F 387.E, r 
391.B, C, K, r 406.r 407.B 

McLachlan, Norman William (1888-) 268.r 

McLaughlin, Jack (1923-) 151.1 

McMillan, Brockway (1915-) 136 E 213.D 

McShane, Edward James (1904-) 3101, r 

Meeks, William Hamilton, IHI 235.E 275.C, D 

Mehler, Ferdinand Gustav (1835-95) App. A, 
Tables 18.11, 19.11 

Mehra, Raman K. 86.r 

Meinardus, Günter (1926-) 328 

Meinhardt, Hans 263.D 

Meixner, Josef (1908—) | 268.r 389.r 

Melin, Anders (1943-) 345.A 

Mellin, Robert Hjalmar (1854-1933) 206.D 220.C 

Melrose, Richard B. (1949—) 325.M 

Menaechmus (375-325 B.C.) 187 

Mendelson, Elliot (1931-) 33.D,r319.r 

Menelaus (of Alexandria) (fl. 98?) 7.A 187 

Menger, Karl (1902-) 93.D 1I7.A, B, D, r 426 

Menikoff, Arthur S.(1947-) 325.H 

Men'shov, Dmitri Evgen'evich (1892-) 77.A 159.] 
198.A, r 317.B 

Méray, Hugues Charles Robert (1835-1911) 267 

Mercer, J. 217.H 

Mergelyan, Sergei Nikitovich (1928-) 164.J 336.F 
367.G 

Merluzzi, P. 303r 

Merman, G. A. 420.D 

Mersenne, Marin (1588—1647) | 297.E App. B, 
Table 1 

Mertens, Franz Carl Josef (1840-1927) 123.A 
379.F 

Meschkowski, Herbert (1909-) 188.r 

Meshalkin, Lev Dmitrievich (1934-) 136.E 

Messiah, Albert M. L.(1921-) 351.r 

Messing, William (1945-) 450.Q 

Métivier, Michel (1931-) 443.H 

Meusnier, Jean Baptiste Marie Charles (1754—93) 
109 111.H 275.A 334.B 

Meyer, Franz (1856-1934) 267 

Meyer, Kenneth R. (1937-) 126.K, I, M 

Meyer, Paul-André (1934-)  261.r 262.D, r 406.r 
407.B, r 

Meyer, Wolfgang T. (1936—-) 109.r 178.r 279.G, r 

Meyer, Yves 25l.r 

Michael, Ernest Arthur (1925-)  425.X, Y, AA, CC 

Michel, René 178.r 304.r 

Michelson, Albert Abraham (1852-1931)  359.A 

Migdal, A. A. 361.C 

Mikami, Yoshio (1875-1950) 230r 

Mikhailov, A. V. 387.G 

Mikhlin, Solomon Grigor'evich (1908-) 46.r 217.r 

Mikusinski, Jan G. (1913-) 306.A, B 

Miles, G. 136. E 

Milgram, Arthur N. (1912-)  112J 

Milgram, R. James (1939-) Gär 

Miller, Charles F., IH (1941-) 97.r 

Miller, David Charles (1918-) 291.F 

Miller, K. S. 104r 

Miller, Louis W 376.1 389.r 

Millett, Kenneth Cary (1941—) 154.H 

Mills, Robert L. (1927-) 80.Q, r 150.G 











Name Index 
Moldestad, Johan 


Mil'man, David Pinkhusovich (1913-) 37.G 424.U 
443.H 

Milne, James Stuart (1942-) 450.r 

Milne, William Edmund (1890-) 303.E 

Milne- Thomson, Louis Melville (1891—1974)  104.r 
NTR 

Milnor, John Willard (1931-) 56.F, r 65.C, E 70.C 
99.r 109.r 111.r 114.A-C, F-K, r 126.N 147.H, P 
154.H 178.F 237.5, r 365.O 391.C 418.D, r 426 
App. A, Table 5.V 

Mimura, Masayasu (1941—) 263.D 

Mimura, Yositaka (1898-1965) 434.C, r 

Mimura, Yukio (1904-84) 162 

Minakshisundaram, Subbaramiah (1913-68)  121.r 
323.M 379.r 391.B, r 

Minemura, Katsuhiro (1945-) 437r 

Minkowski, Hermann (1864-1909) 14.B, D, U 
89.D, E H8.C 122.E, F 182A, C-E, r 196 211.C 
255.B 258.A 296.A 348.D, G, K 359.B App. A, 
Table 8 

Minlos, Robert Adol'fovich (1931—) | 258.r 341.J 
424.T 

Minorsky, Nicholas (1885-) 163.B 

Minsky, Marvin C. 385r 

Minsky, M. L. 75.r 

Minty, George James (1929-)  281.r 286.C 

Miranda, Carlo (1912-82) 323r 

Mirimanov, D. 145 

Mishchenko, Evgenti Frolovich (1922-) 86.r 

Mishkis, Anatol" Dmitrievich 163.B 

Misiurewicz, Michał (1948—) 126.K 

Misner, Charles William (1932-) | 359.r 

Mitchell, Andrew Ronald 223.r 

Mitchell, Benjamin Evans (1920-)  52.N, r 200.1 

Mitome, Michiwo (1909-76) STR 

Mitropol’skii, Yurii Alekseevich (1917-) 290.D, F 

Mitsui, Takayoshi (1929-) 4.F.r 123.F 328 

Mittag-Leffler, Gustav Magnus (1846—1927) 47 267 
272.A 

Mityagin, Boris Samuilovich (1937-) 424.8 

Miura, Robert Mitsuru (1938-) 387.B 

Miwa, Megumu (1934-) 118.E 

Miwa, Tetsuji (1949--) 112.R 253.E 387.C 

Miyadera, Isao (1925-) 162 378.B 

Miyajima, Kimio (1948-) 72.G 

Miyajima, Shizuo ({948—-) 310.H 

Miyakawa, Tetsuro (1948—) 204.C 

Miyake, Katsuya (1941—) 16.2 

Miyakoda, Tsuyako (1947-) 301.C 

Miyanishi, Masayoshi (1940—) 15.H,r 

Miyaoka, Yoichi (1949—) 72.K,r 

Miyata, Takehiko (1939-83) 226.r 

Miyoshi, Tetsuhiko (1938-)  304.r 

Mizohata, Sigeru (1924-) 112.B, D, P 274. B, G, T 
320.1, r 321.F, G, r 323.M 325.G, H 345.A 

Mizukami, Masumi (1951—) 16r 

Mizumoto, Hisao (1929-) 367.1 

Mizutani, Akira (1946-)  304.r 

Mizutani, Tadayoshi (1945-) 154.G 

Mobius, August Ferdinand (1790-1868) 66.C 74.E 
76.A 267 295.C 410.B 

Moedomo, S. 443.H 

Mohr, Georg (1640-97) 179.B 

Moise, Edwin Evariste (1918-) 65.C 70.C 79.D 93r 
139.r 410.r 

Moiseiwitsch, Benjamin Lawrence (1927-)  44l.r 

Moishezon, Boris Gershevich (1937—) 16.E, U, W 
72. 

Molchanov, Stanislav Alekseevich 115.D 340.r 

Moldestad, Johan (1946-) 356.F,r 


Name Index 
Moler, Cleve B. 


Moler, Cleve B. (1939-)  298.r 302.r 

Monge, Gaspard (1746-1818) 107.B 109 158 181 
255.E 266 267 278.A 324.F 

Monin, Andrei Sergeevich 433.r 

Montel, Paul Antoine Aristide (1876-1975) 272.F 
424.0 43S.E, r 

Montgomery, Deane (1909-) 196 249.V, r 423.N 
431.r 

Montgomery, Hugh L. (1944-) 14.L I23.E, r 

Montucla, Jean Etienne (1725-99) 187.r 

Mook, Dent T. 290.r 

Moon, Philip Burton (1907-) 130.r 

Moore, Calvin C. (1936-) 122.F 

Moore, Eliakim Hastings (1862-1932) 87.H, K, r 

Moore, John Colemar (1923-) 147.r 200.r 203.r 

Moore, John Douglas 365.J 

Moore, Robert Lee (1882-1974) 65.F 273.K 
425.AA 426 

Moran, Patrick Alfred Pierce (1917-) 218.r 

Morawetz, Cathleen Synge (1923-) 112.8 345.A 

Mordell, Louis Joel (1888-1972) 118.A, E 

Morera, Giacinto (1856—1909) 198.A 

Morf, Martin (1944-) 86.r 

Morgan, Frank 275.C 

Morgenstern, Oskar (1902-77) 173.A, D 376r 

Mori, Akira (1924-55) 352.B, C 367.E 

Mori, Hiroshi (1944—) 275.F 

Mori, Mitsuya (1927-) 59.H 

Mori, Shigefumi (1951—) 16.R, r 364.r 

Mori, Shin'ichi (1913—) 207.C, r 

Mori, Shinziro (1893—1979) 284.G 

Moriguti, Sigeiti (1916-) 299.B 389.r NTR 

Morimoto, Akihiko (1927-) 110.E 126.J 344.C 

Morimoto, Haruki (1930-) 399.r 

Morimoto, Hiroko (1941—) 224.F 

Morimoto, Mituo (1942-) 125.BB, DD 162 

Morimune, Kimio (1946-) 128.C 

Morishima, Taro (1903-) 145.* 

Morita, Kiiti(1915—) 8 117.A, C, E, r 273.K 425.8, 
X-Z, CC 

Morita, Masato (1927-) 353.r 

Morita, Reiko (1934-) 353.r 

Morita, Shigeyuki (1946-) 154.G 

Morita, Yasuo (1945-) 450.U 

Moriya, Mikao (1906-82) 59.G,H 

Morlet, Claude 147.Q 

Morley, Edward Williams (1838-1923) 359.A 

Morley, Michael 276.F,r 

Morrey, Charles Bradfield, Jr. (1907-84)  46.r 78.r 
112.D 125.A 194.F, r 195 246.C 275.A, C, r 323r 
334.D 350.r 352.B 

Morris, Peter D 443.H 

Morrow, James 72.K 

Morse, Harold Marston (1892-1977) 109 114.A, 
F 126.J 178.A 275.B 279.A-F 286.N, Q, r 418.F 

Morse, Philip McCord (1903-)  25.r 133.r 227.r 

Morton, Keith W.(1930-) 304.r 

Moschovakis, Yiannis Nicholas (1938-) 22.D, F, 
H, r 33.r 356.G, r 

Moser, Jürgen (Kurt) (1928-) 21.P 55.r 126A, L, r 
136.r 286.J, r 323.L 344.B 420.C, G 

Moser, William O. J. (1927-) 92.r 122.r 151.r 161.r 

Mosher, Robert E 64.r 70.r 

Mosteller, (Charles) Frederick (1916-) 346.C, G 

Mostow, George Daniel (1923—) 13.r32.r 122.F, G 
249.r 

Mostowski, Andrzej (1913-75) 33.D, r 356.C, H 

Motohashi, Yoichi (1944-) 123.E 

Motoo, Minoru (1927-) 44.E 115.C, D, r26l.r 

Moulin, M. 375.F 





1900 


Moulton, Forest Ray (1872-1952) 55.r 303.E 

Moussu, Robert (1941-) 154.H 

Moyal, José E. 44r 

Muchnik, Albert Aramovich (1934—-) 356.D 

Mugibayashi, Nobumichi (1926-) 125.BB 

Muhly, Paul Scott (1944—) 164.H 

Muirhead, Robb John (1946-) 280.r 

Mukherjee, Bishwa Nath 346.r 

Müller, Claus Ernst Friedrich (1920-) 323.J 393.r 

Muller, David Eugene (1924-) 301.C 

Muller, Werner (1949-) 391.M 

Moller. Breslau, Heinrich Franz Bernhard (1851— 
1925) 19.r 

Mullikin, Thomas Wilson (1928-) 44r 

Mullis, Clifford T. (1943-) Sen 

Munford, David Bryant (1937-) 3.A, N, r9, r 
12.B IE, F, r IGR, W, Y, Z, r 32r 72.G 226r 
418.D 

Munkres, James Raymond (1930—)  70.r 105.r 
114.C, r 

Müntz, C. H. 336A 

Munzner, Hans-Friedrich 365.1 

Murakami, Shingo (1927-) 32. 122.F 384.r App. A, 
Table 5.1 

Muramatu, Toshinobu (1933-) 168.B 224.E 
251.0 

Murasugi, Kunio(1929-) 235A, E, r 

Murata, Hiroshi (1945-) 75.r 

Murray, Francis Joseph (1911-) 136.F 308.F 

Murre, Jacob P. (1929-) 16.W 

Murthy, M. Pavaman 237.r 

Muskhelishvili, Nikolai Ivanovich (1891-1976) 
217.7 222.r 253.r 

Muto, Yosio (1912-) 364.F,r 

Mutou, Hideo (1953-) 391.E 

Mycielski, Jan (1932-) 22.H 33.F,r 

Myers, Sumner Byron (1910-55) 152.C 178.B 

Myrberg, Pekka Juhana (1892-1976) 367.E 


N 


Nachbin, Leopoldo (1922-) 21.1 37.M 425.BB 

Nagaev, Sergei Viktrovich (1932-) 250.r 

Nagamati, Sigeaki (1945) 125.BB 

Nagami, Keió (1925~) 117.A, C, r273 K, r 425.Y, 
AA, CC, r 

Nagano, Tadash1(1930-) 191.r 275.F 279.C 344.C 
364.F 365.F, K 

Naganuma, Hidehisa (1941-) 450.L 

Nagao, Hirosi (1925-) 151.H 200.L 362.I 

Nagasawa, Masao (1933-) Adr 

Nagase, Michihiro (1944-) 251.0 

Nagata, Jun-iti(1925-) 117.C 273.K, r 425.r 

Nagata, Masayoshi (1927-) 8 12.B 13.1 15.r 16.D, 
T, V, AA, r 67.1, r 196 226.G, r 277.r 284.E, G 
369.r 370.r 

Nagell, Trygve (1895-) 118.D 

Nagumo, Mitio (1905-) 162 286.7, r 316. E, r 
323.D 

Naim, Linda 120.E 207.C 

Naimark (Neumark), Mark Aronovich (1909—78) 
36.G, r 107.r 112.r 192.r 252.7 258.r 308.D 315.r 
437.W, EE 

Naito, Hiroo (1950-) 365.F, N 

Nakada, Hitoshi (1951—) 136.C 

Nakagami, Yoshiomi (1940-) 308.r 

Nakagawa, Hisao (1933-) 365.L 

Nakai, Mitsuru (1933-) Ier 207.C, D 

Nakai, Yoshikazu (1920-) 15.C, F 16.E,r 

Nakajima, Kazufumi (1948-) 384.r 


1901 


Nakamura, Iku (1947-) 72.K 

Nakamura, Kenjiro (1947-79)  310.r 

Nakamura, Michiko (1937-)  424.X 

Nakamura, Tokushi (1930-) 70.F,r 

Nakane, Genkei (1662-1733) 230 

Nakanishi, Noboru (1932-) 146.A—C 

Nakanishi, Shizu (1924-) 100.A,r 

Nakano, Hidegoró (1909-74) 162 310.A 436r 

Nakano, Shigeo (1923-) 21.L 72.H 147.0 232.r 

Nakano, Tadao (1926-) 132.A 

Nakao, Shintaro (1946-) 340.r 

Nakaoka, Minoru (1925-) 70.F, r 153.B 202.P 
305.A 

Nakayama, Mikio (1947-) 173.E 

Nakayama, Tadasi (1912-64) 6.E 8 29.H, I 59.H 
67.D 172.A 200.K - N 243.G 

Nakazi, Takahiko (1944-) 164.G 

Namba, Kanji (1939-) 33r 

Namba, Makoto (1943-) 9.E 72.r 

Nambu, Yoichiro (1921-) 132.C 

Namikawa, Yukihiko (1945-) 16.7 

Namioka, Isaac (1928-) 310.r 424.r 

Napier, John (1550-1617)  131.D 265 432.C 
App A, Tables 2.11, III 

Narasimhan, Mudumbai S. (1932-) 112.D 

Narasimhan, Raghavan (1937-) 23.r 367.G 

Naruki, Isao (1944-) 21.P, Q 344.D 

Nash, John Forbes, Jr. (1928-) 173.A, C, r 204.F 
286.3 323.L 327.G, r 365.B 

Navier, Louis Marie Henri (1785-1836)  204.B, 
C, F 205.C 

Nayfeh, Ali Hasan (1933-) 25.r 290.r 

Necás, Jindřich (1929—) 304.r 

Nedoma, Jifi 213.F 

Ne'eman, Yuval(1925-) 132.D,r 

Nehari, Zeev (1915—78) 77.r 367.G 438.B 

Nelson, Joseph Edward (1932-) 115.D 150.F 176.F 
293.E, r 341.r 437.8 

Nemytskit, Viktor Vladimirovich (1900-) 126.E,r 
394r 

Nernst, Hermann Walter (1864—1941) 419.A 

Néron, André (1922-) 3. M, N,r15.D 16.P 

Nersesyan, A A. 164J 

Nesbitt, Cecil James (1912-) 29.r 362.r 368.r 

Netto, Eugen (1846-1919) 177.r 330.r 

Neubüser, Joachim E. F. G. (1932-) 92.F 

Neugebauer, Otto (Eduard) (1899—-) 24r 

Neuhoff, David L. 213.E, F 

Neukirch, Jürgen (1937-) 450.r 

Neumann, Bernhard Hermann (1909-) 161.C 
190.M 

Neumann, Carl (Karl) Gottfried (1832—1925) 
39.B 120.A 188.H 193.F 217.D 323.F App. A, 
Tables f9.IIT, IV 

Neustadt, L. W. 292.r 

Neuwirth, Lee P. (1933-) 235r 

Nevanlinna, Frithiof (1894-1977) 272.K 

Nevanlinna, Rolf Herman (1895-1980) 21.N 43.r 
109 124.B 164.G 198.r 272.B, D, E, K, r 367.E, 
I, r 429.B 438.B 

Neveu, Jacques (1932-) 136.C 

Neville, Charles William (1941—) 164.K 

Newcomb, Robert Wayne (1933-) 282r 

Newcomb, Simon (1835-1909)  392.r 

Newell, Allen | 385.r 

Newhauser, George L. 215.r 

Newhouse, Sheldon E. (1942-) 126.J, L, M 

Newlander, August, Jr. 72.r 

Newman, Charles Michael (1946-) 212.r 

Newman, Donald J. 328 


Name Index 
Nourein, Abdel Wahab M. 


Newman, Maxwell Herman Alexander (1897-1984) 
65.C, F 93.r 333.4 

Newton, Sir Isaac (1642-1727) 20 48.B, F, H 
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absolutely stable 
(linear k-step method) 303.G 
(system of differential equations) 291.E 
absolutely summing (operator) 68.N 
absolutely uniserial algebra 29.1 
absolute minimality 16.1 
absolute moment (kth) 341.B 
absolute multiple covariant 226.E 
absolute neighborhood retract 202.D 
fundamental (FANR) 382.C 
absolute norm (of an integral ideal) 14.C 
absolute parallelism — 191.B 
absolute retract 202.D 
fundamental (FAR) 382.C 
absolute stability 303.G 
interval of 303.G 
region of 303.G 
region of, of the Runge-Kutta (P, p) method 
303.G 
absolute temperature 419.A 
absolute value 
(of a complex number) 74.B 
(of an element of an ordered field) 149.N 
(of an element of a vector lattice) 310.B 
(ofa real number) 355.A 
(ofa vector) 442.B 
absorb (a subset, topological linear space) 424.E 
absorbing barrier 115.B 
absorbing set (in a topological linear space) 424.E 
absorption cross section 375.A 
absorption law 
in the algebra of sets 381.B 
ina lattice 243.A 
absorption principle, limiting 375.C 
abstract algebraic variety 16.C 
abstract L space 310.G 
abstract L, space 310.G 
abstract M space 310.G 
abstract Riemann surface 367.A 
abstract simplicial complex 70.C 
abstract space 381.B 
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abstract variety 16.C 
abundant number 297.D 
acceleration parameter 302.C 
acceptance 400.A 
acceptance region 400.A 
accepted 31.D 
accessible (from a region) 93.K 
accessible boundary point 333.B 
accretive operator (in a Hilbert space) 251.J 286.C 
accumulated error 138.C 
accumulation point 87.C 425.0 
complete 425.0 
acoustic problem 325.L 
act 
on a commutative ring 226.A 
freely (on a topological space) 122.A 
action 431A 
R- 126B 
rational 226.B 
reductive 226.B 
Z- 126B 
action and reaction, law of 271.A 
action integral 80.Q 
action space 398.A 
activity 281.D 
activity analysis 376 
actuarial mathematics 214.A 
acute angle 139.D 
acute type 304.C 
strongly 304.C 
acyclic complex 200.C 200.H 
Adams-Bashforth method 303.E 
Adams conjecture 237.1 
Adams-Moulton method 303.E 
Adams operation 237.E 
adapted (stochastic process) 407.B 
adaptive scheme 299.C 
addition 
(in a commutative group) 190.A 
(of natural numbers) 294.B 
(inaring) 368.A 
(of unfoldings) 51.D 
addition formula 
algebraic 3.M 
fore? 131.G 
for sine and cosine 432.A 
of trigonometric functions App. A, Table 2.1 
addition theorem 
of Bessel functions 39.B 
of cylindrical functions App. A, Table 19.III 
of Legendre functions 393.C 
of the (o-function App. A, Table 16.IV 
ofsn,cn,dn App. A, Table 16.III 
of the C-function App. A, Table L6.IV 
additive 
completely — completely additive 
countably — countably additive 
finitely — finitely additive 
o- —~o-additive 
totally — totally additive 
additive category 52.N 
additive class 
completely 270.B 
countably 270.B 
finitely 270.B 
additive functional 
(of a Markov process) 261.E 
martingale 261.E 
natural 261.E 
perfect 261.E 
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c- 270.B 

a, tail 342.G 

c-, topological 270.C 

a-, well-measurable 407.B 
simple 29.A 

solvable 231.A 

Staudt 343.C 

Steenrod 64.B 
supplemented 200.L 
symmetric 29.H 

tensor (on a linear space) 256.K 
Thom 114.H 

total matrix 269.B 

uniform 164.A 

uniserial 29.1 

uniserial, absolutely 29.1 
uniserial, generalized — 29.I 
unitary 29.A 

universal enveloping (of a Lie algebra) 248.3 
vector App. A, Table 3.I 
von Neumann  308.C 

von Neumann, induced 308.C 
von Neumann, reduced |. 308.C 


W*- 308.C 
weak* Dirichlet 164.G 
zero 29.A 


algebra class (of central simple algebras) 29.E 
algebra class group | 29.E 
algebra extension 29.D 200.L 
algebra homomorphism 29.A 
algebraic addition formula 3.M 
algebraic algebra 29.J 
algebraic analysis 125.A 
algebraically closed (in a field) 149.1 
algebraically closed field 149.] 
quasi- 118.F 
algebraically dependent (on a family of elements 
ofa field) 149.K 
algebraically dependent elements (of a ring) 369.A 
algebraically equivalent cycles 16.R 
algebraically equivalent to 0 (a divisor on an alge- 
braic variety) 16.P 
algebraically independent (over a field) 149.K 
algebraically independent elements (of a ring) 
369.A 
algebraically simple eigenvalue 390.E 
algebraic branch point (of a Riemann surface) 
367.B 
algebraic closure 
ofafield 149.I 
separable 257.E 
algebraic correspondence 9.H 16.1] 
group of classes of 9.H 
algebraic curves 11.A 
irreducible 11.B 
plane 9.B 
algebraic cycles 450.Q 
algebraic differential equation 113 288.A 
algebraic dimension (of a compact complex mani- 
fold) 74.F 
algebraic element (of a field) 149.E 
algebraic equations 10, App. A, Table 1 
in m unknowns 10.A 
algebraic extension 149.E 
algebraic family (of cycles on an algebraic variety) 
16.R 
algebraic fiber space 72.1 
algebraic function 11.A 
algebraic function field 
over k of dimension! 9.D 





Subject Index 
Algorithm 


over k of transcendence degree 1 9.D 
in n variable 149.K 
€-function of 450.P 
algebraic fundamental group 16.U 
algebraic geometry 12.A 
algebraic groups 13 
absolutely simple 13.U 
affine 13.A 
almost simple 13.U 
connected 13A _ 
isogenous 13.A 
k-almost simple 15.0 
k-anisotropic 13.G 
k-compact 13.G 
k-isotropic 13.G 
k-quasisplit 13.0 
k-simple 13.0 
k-solvable 13.F 
k-split 13.N 
linear 13.A 
nilpotent 13.F 
reductive 13.1 
semisimple 13.1 
simple 13.L 
solvable 13.F 
unipotent 13.E 
algebraic group variety 13.B 
algebraic homotopy group 16.U 
algebraic integer 14.A 
algebraic K-theory 237J 
higher 237.J 
algebraic Lie algebra 13.C 
algebraic linear functional 424.B 
algebraic multiplicity (of an eigenvalue) 309.B 
algebraic number 14.A 
algebraic number fields 14 
relative 14 
algebraic pencil. 15.C 
algebraic point (over a field)  369.C 
algebraic scheme 16.D 
algebraic sheaf, coherent 16.E 72.F 
algebraic singularity (of an analytic function) 
198.M 
algebraic solution (of an algebraic equation) 10.D 
algebraic space 16.W 
algebraic subgroup 13.A 
algebraic surfaces 15 
algebraic system 
ofrequations 10 
in the wider sense 409.B 
algebraic topology 426 
algebraic torus 13.D 
algebraic varieties 16 16.C 
abstract 16.C 
affine 16.A 
complex 16.T 
normal 16.F 
product 16.A 
projective 16.A 
quasi-affine 16.C 
quasiprojective 16.C 
algebra isomorphism 29.A 
algebroidal function(s) 17 
entire 17.B 
k-valued 17.A 
algorithm 97 356.C 
composite simplex 255.F 
division 297.4 
division (of polynomials) 337.C 
dual simplex |. 255.F 


Subject Index 
Alignment chart 


Euclidean 297.A 
Euclidean (of polynomials) 337.D 
fractional cutting plane 215.B 
greedy 66.G 
heuristic 215.E 
partitioning 215.E 
primal-dual 255.F 
variable-step variable-order (VSVO) 303.E 
VSVO 303.E 
aliases 102.I 
alignment chart 19.D 
allied series (of a trigonometric series) 159.A 
all-integer 215.B 
all-integer algorithm 215.B 
all-integer programming problem 215.A 
allocation, optimum . 373.E 
allocation process, multistage 127.A 
allowed homomorphism (between A-modules) 
277.E 
allowed submodule 277.C 
almost all 342.B 
almost all points of a variety 16.A 
almost certainly converge 342.D 
almost certainly occur 342.B 
almost complex manifold 72.B 
stably 114.H 
weakly 114.H 
almost complex structure 72.B 
tensor field of (induced by a complex structure) 
72.B 
almost conformal 275.C 
almost contact manifold 110.E 
almost contact metric structure 110.E 
almost contact structure 110.E 
almost effective action (on a set) 415.B 
almost effectively (act on a G-space) 431.A 
almost everywhere converge 342.D 
almost everywhere hold (in a measure space) 270.D 
almost finite memory channels 213.F 
almost G-invariant statistic 396.1 
almost invariant test 400.E 
almost parallelizable manifold 114.1 
almost periodic (motion) 126.D 
almost periodic differential equation 290.A 
almost periodic functions 18 
analytic 18.D 
onagroup 18.F 
with respect top 18.C 
in the sense of Bohr 18.B 
uniformly 18.B 
almost periodic group, maximally 18.1 
almost periodic group, minimally 18.1 
almost simple algebraic group 13.L 
k- 13.0 
almost subharmonic 193.T 
almost surely converge 342.D 
almost surely occur 342.B 
almost symplectic structure 191.B 
alphabet 31.B 63.A 213.B 
alternate angles 139.D 
alternating contravariant tensor 256.N 
alternating covariant tensor 256.N 
alternating direction implicit (ADI) method 304.F 
alternating function 337.1 
alternating group of degreen 151.G 
alternating knot 235.A 
alternating law (in a Lie algebra) 248.A 
alternating matrix 269.B 
alternating multilinear form 256.H 
alternating multilinear mapping 256.H 
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alternating polynomial 337.I 
simplest 337.1 
alternating series 379.C 
alternating tensor field 105.0 
alternative (in game theory) 173.B 
alternative algebra 231.A 
alternative field 231.A 
alternative hypothesis 400.A 
alternative theorem, Fredholm 68.E 217.F 
alternizer 256.N 
altitude (of a commutative ring) 67.E 
altitude theorem, Krull 284.A 
amalgamated product (of a family of groups) 
190.M 
amalgamated sum 52.G 
ambient isotropic 65.D 
ambient isotropy 65.D 
ambig class (of a quadratic field) 347.F 
ambig ideal (of a quadratic field) 347.F 
ambiguous point 62.D 
amicable number 297.D 
Amitsur cohomology group 200.P 
Amitsur complex 200.P 
amount insured 214.A 
amount of inspection, expected | 404.C 
amount of insurance 214.A 
Ampère equations, Monge- 278, App. A, Table 
15.III 
Ampère transformation 82.A 
amphicheiral knot 235.A 
ample divisor 16.N 


very 16.N 
ample linear system 16.N 
very 16.N 


ample over S (of a sheaf on a scheme) 16.E 
relatively 16.E 
very 16.E 
ample vector bundle 16.Y 
amplification matrix |. 304.F 
amplification operator of a scheme 304.F 
amplitude 
(of a complex number) 74.C 
(function) App. A, Table 16.III 
(of an oscillation) 318.A 
(of time series data) 397.N 
(ofa wave) 446 
Feynman 146.B 
partial wave scattering 375.E 
probability 351.D 
scattering 375.C,E 386.B 
amplitude function (of a Fourier integral operator) 
274.C 345.B 
AMU estimator, kth-order 399.0 
analog, difference 304.E 
analog computation 19 
analog computers 19.E 
electronic 19.E 
analog of de Rham's theorem 21.L 
analog quantity 138.B 
analog simulation 385.A 
analysis 20 
activity 376 
algebraic 125.A 
backward 138.C 
backward error 302.B 
combinatorial 66.A 
consistency of 156.E 
convex 88 
design-of-experiment 403.D 
dimensional 116 
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Diophantine 296.A 
experimental 385.A 
factor 280.G 
forward 138.C 
function 20 
functional 162 
functional, nonlinear 286 
harmonic (on locally compact Abelian groups) 
192.G 
intrablock 102.D 
microlocal 274.A 345.A 
multivariate 280 
ofa network 282.C 
numerical 300 
principal component 280.F 
regression 403.D 
spectral 390.A 
analysis of variance 400.H 403.D 
multivariate 280.B 
table 400.H 
analytic 
(function) 21.B,C 198.A,H 
(predicate) 356.H 
complex, structure 72.A 
micro- (hyperfunction) 125.CC 
pseudo- (function) 352.B 
quasi- (function) 352.B 
quasi- (in the generalized sense) 58.F 
real 106.K 198.H 
analytical dynamics 271.F 
analytically continuable 198.1 
analytically hypoelliptic 112.D 323.1 
analytically independent elements 370.A 
analytically normal local ring 284.D 
analytically thin set 23.D 
analytically uniform spaces  125.S 
analytically unramified semilocal ring 284.D 
analytic almost periodic function 18.D 
analytic automorphism 21.J 
analytic capacity 169.F 
analytic continuation 198.G 
along a curve 198. 
direct 198.G 
uniqueness theorem of 198.I 
in the wider sense 198.0 
analytic covering space 23.E 
analytic curve 
in an analytic manifold 93.B 
in a Euclidean plane 93.B 
analytic differential (on a Riemann surface) 367.H 
analytic fiber bundle 
complex 147.0 
real 147.0 
analytic functions 198.A,H 
complex 198.H 
inverse 198.L 
many-valued 198 
multiple-valued 198.J 
n-valued 198 
real 106.K 198.H 
in the sense of Weierstrass 198.1 
of several complex variables 21.B,C 
in the wider sense 198.0 
analytic geometry 181 
analytic hierarchy 356.H 
analytic homomorphism (between Lie groups) 
249.N 
analytic index 
of a elliptic complex 237.H 
of an elliptic differential operator 237.H 
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Angle 


analytic isomorphism | 21J 
between Lie groups 249.N 
analyticity, set of 192.N 
analytic manifold 
complex 72.A 
real 105.C 
analytic mapping 21.J 
analytic measurable space 270.C 
analytic neighborhood 
of a function element in the wider sense 
198.0 
of a Riemann surface 367.A 
analytic number theory 296.B 
analytic operation 22.B 
analytic operation function 37.K 
analytic perturbation 331.D 
analytic polyhedron 21.G 
analytic prolongation 198.G 
analytic relations, invariance theorem of 198.K 
analytic representation (of GL(V)) 60.D 
analytic set 
(in set theory) 22 
(in the theory of analytic spaces) 23.B 
co- 22.A 
complementary (in set theory) 22.A 
germof 23.B 
irreducible (at a point) 23.B 
principal 23.B 
purely d-dimensional 23.B 
purely d-dimensional (at a point) 23.B 
analytic sheaf 72.E 
coherent 72.E 
analytic spaces 23 
Banach 23.G 
C- 23.E 
general 23.G 
K-complete 23.F 
normal 23.D 
in the sense of Behnke-Stein 23.E 
analytic structure 
complex 72.A 
real 105.D 
on a Riemann surface 367.A 
analytic submanifold, complex 72.A 
analytic subset (of a complex manifold) 72.E 
analytic subspace 23.C,G 
analytic torsion 391.M 
analytic vector (with respect to a unitary representa- 
tion of a Lie group) 437.8 
analytic wave front set 274.D 
analyzer 
differential 19.E 
harmonic 19.E 
anchor ring 410.B 
ancient mathematics 24 
ancillary statistic 396.H 401.C 
Anger function 39.G, App. A, Table 19.IV 
angle 139.D 155.B 
(of a geodesic triangle) 178.A,H 
(of hyperspheres) 76.A 
(of a spherical triangle) 432.B 
acute 139.D 
alternate 139.D 
corresponding 139.D 
eccentric (of a point on a hyperbola) 78.E 
eccentric (of a point on an ellipse) 78.D 
Euler 90.C 
general 139.D 
non-Euclidean (in a Klein model) 285.C 
obtuse 139.D 


Subject Index 
Angular derivative (of a holomorphic function) 


regular polyhedral 357.B 

rght 139.D 

straight 139.D 

straightening of 114.F 

supplementary 139.D 

trisection of 179.A 

vertical 139.D 
angular derivative (of a holomorphic function) 

43.K 

angular domain 333.A 
angular frequency (of a sine wave) 446 
angular momentum 258.D 271.E 

integrals of 420.A 

intrinsic 415.G 

orbita] 315.E 

theorem of 271.E 
angular momentum density 150.B 
angular transformation 374.D 
anharmonic ratio 343.D 
anisotropic 

(quadratic form) 13.G 

k- (algebraic group) 13.G 
annihilation operator 377.A 
annihilator 422.D 

left 29.H 

reciprocity of (in topological Abelian groups) 

422.E 

rght 29.H 
annual aberration 392 
annual parallax 392 
annuity contract 214.B 
annular domain 333.A 
annulator 422.D 
annulus conjecture 65.C 
anomaly 

eccentric 309.B 

mean 309.B 

true 309.B 
Anosov diffeomorphism 126.J 136.G 
Anosov flow 126. 136.G 
Anosov foliations 126.J 
Anosov vector field 126 
ANR (absolute neighborhood retract) 202.D,E 
antiautomorphism 

(ofa group) 190.D 

(ofa ring) 368.D 

principal (of a Clifford algebra) 61.B 
antiendomorphism 

(ofa group) 190.D 

(ofa ring) 368.D 
antiequivalence (between categories) 52.H 
anti-Hermitian form 256.Q 
anti-Hermitian matrix 269.I 
antiholomorphic 195.B 275.B 
antihomomorphism 

of groups 190.D 

of lattices 243.C 

of rings 368.D 
anti-isomorphic lattices 243.C 
anti-isomorphism 

of groups 190.D 

of lattices 243.C 

of ordered sets 311.E 

ofrings 368.D 
antinomy | 349.A 
antiparticle 132.A 386.B 
antipodal points (on a sphere) 140 
antipode 203.H 
anti-self-dual (G-connection) 80.Q 
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antisymmetric 
(Fock space) 377.A 
(multilinear form) 256.H 
(multilinear mapping) 256.H 
(relation) 358.A 
(tensor) 256.N 
law 311.A 
matrix 269.B 
antisymmetry, set of 164.E 
Antoine’s necklace 65.G 
apartment 13.R 
aperiodic 136.E 260.B 
Apollonius problem (in geometric construction) 
179.A 
a posteriori distribution 388.B 
a posteriori probability 342.F 
apparent force 271.D 
apparent singular point 254.L 
Appell hypergeometric functions of two variables 
206.D, App. A, Table 18.1 
application 31.B 
approach 
Bayesian 401.B 
group-theoretic 215.C 
non-Bayesian 401.B 
S-matrix 132.C 
state-space 86.A 
approximate derivative (of a measurable function) 
100.B 
approximate functional equation (for zeta function) 
450.B 
approximately derivable (measurable function) 
100.B 
approximately finite (von Neumann algebra) 308.1 
approximately finite algebra 36.H 
approximately finite-dimensional 308.1 
approximation(s) 
best (of a continuous function) 336.B 
best (in evaluation of functions) 142.B 
best (of an irrational number) 83.B 
best polynomial, in the sense of Chebyshev 
336.H 
Diophantne 182.F 
full discrete 304.B 
least square 336.D 
of linear type 142.B 
method of successive (for an elliptic partial dif- 
ferential equation) 323.D 
method of successive (for Fredholm integral 
equations of the second kind) 217.D 
method of successive (for ordinary differential 
equations) 316.D 
nth (of a differentiable function) 106.E 
Oseen 205.C 
overall, formula 303.C 
Padé 142.E 
Pauli 415.G 
Prandtl-Glauert 205.B 
polynomial. 336 
semidiscrcte 304.B 
simplicial (to a continuous mapping) 70.C 
Stokes 205.C 
Wilson-Hilferty 374.F 
Yosida 286.X 
approximation method 
in physics 25 
projective 304.B 
approximation property 
(of a Banach space) 37.L 
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bounded 37.L 
approximation theorem 
(on functions on a compact group) 69.B 
(on valuations) 439.G 
cellular 70.D 
Eichlers 27.D 
Kroneckers 422.K 
polynomial (for C?-functions) 58.E 
simplicial 70.C 
Weierstrass 336.A 
a priori distribution 388.B 
least favorable 388.H 
a priori estimate 323.C 
in L? sense 323.H 
a priori probability 342.F 
AR (absolute retract) 202.D 
Arabic numerals 26 
Arab mathematics 26 
Araki axioms, Haag- 150.E 
Araki-Sewell inequality, Roepstorff- 402.G 
arbitrary constant 313.A 
arbitrary set 381.G 
arc(s) 93.B 186.B 
continuous 93.B 
Farey 4.B 
geodesic 178.H 364.B 
joined byan 79.B 
Jordan 81.D 93.B 
major 4B 
minor 4.B 
open 93.B 
pseudo- 79.B 
simple 93.B 
Arccos 131.E 
arc cos (arc cosine) 131.E 
arc element 
affine 110.C 
conformal 110.D 
Archimedean lattice-ordered group 243.G 
Archimedean ordered field 149.N 
Archimedean unit (of a vector lattice) 310.B 
Archimedean valuation 14.F 439.C 
Archimedean vector lattice 310.C 
Archimedes axiom 
in geometry 155.B 
for real numbers 355.B 
Archimedes spiral 93.H 
arc length 1!11.D 
affine 110.C 
representation in terms of (for a continuous 
arc) 246.A 
Arcsin 131.E 
arcsin (arcsine) 131.E 
arcsine law 
for Brownian motion 45.E 
for distribution function 250.D 
for random walk 260.E 
arcsine transformation 374.D 
Arctan 131.E 
arctan (arctangent) 131.E 
arcwise connected component 79.B 
arcwise connected space, locally 79.B 
area 246 
(Euclidean) 139.F 
(ofa polygon) 155.F 
(ofa set in R?) 216.F 
Banach (of a surface) 246.G 
of concentration 397.E 
definite, set of 216.F 
Geócze (of a surface) 246.E 


Subject Index 
Ascending chain 


Gross (of a Borel set) 246.G 
inner 216.F 270.G 
Janzen (of a Borel set) 246.G 
Lebesgue (of a surface) 246.C 
mixed (of two ovals) 89.D 
outer 216.F 270.G 
Peano (of a surface) 246.F 
surface, of unit hypersphere App. A, Table 9.V 
areal element (in a Cartan space) 152.C 
areal functional 334.B 
areally mean p-valent 438.E 
area theorem 438.B 
Bers 234D 
Arens-Royden theorem 36.M 
Arens theorem, Mackey- 424.N 
Arf-Kervaire invariant 114J 
Argand plane, Gauss- 74.C 
argument 
(of a complex number) 74.C 
behind-the-moon 351.K 
argument function 46.A 
argument principle 198.F 
arithmetical (predicate) 356.H 
arithmetical hierarchy 356.H 
of degrees of recursive unsolvability 356.H 
arithmetically equivalent 
(lattices) 92.B 
(pairs) 92.B 
(structures) 276.D 
arithmetic crystal classes 92.B 
arithmetic function 295.A 
arithmetic genus 
(of an algebraic curve) 9.F 
(of an algebraic surface) 15.C 
(of a complete variety) 16.E 
(ofa divisor) 15.C 
virtual (ofa divisor) 16.E 
arithmetic mean 211.C 397.C 
arithmetic of associative algebras 27 
arithmetico-geometric mean 134.B 
arithmetic operations 294.A 
arithmetic progression 379.1, App. A, Table 10.1 
prime number theorem for 123.D 
arithmetic subgroup 13.P 122.F,G 
arithmetic unit 75.B 
arithmetization (of metamathematics) 185.C 
array 96.C 
balanced 102.L 
k- 330 
orthogonal 102.L 
arrow diagram 281.D 
Arrow-Hurwicz-Uzawa gradient method 292.E 
artificial variables 255.C 
Artin, E. 28 
Artin conjecture 450.G 
Artin general law of reciprocity 59.C 
Artin-Hasse function 257.H 
Artinian module 277.1 
Artinian ring 284.A 
left 368.F 
right 368.F 
Artin L-function 450.G,R 
Artin-Rees lemma 284.A 
Artin-Schreier extension (of a field) 172.F 
Artin symbol 14K 
Arzelà theorem, Ascoli- 168.B 435.D 
ascending central series (of a Lie algebra) 248.C 
ascending chain 
in an ordered set 311.C 
of subgroups of a group 190.F 


Subject Index 
Ascending chain condition 


ascending chain condition 
in an ordered set 311.C 
for subgroups of a group 190.F 
Ascoli-Arzelà theorem 168.B 435.D 
Ascoli theorem | 435.D 
a.s. consistent 399.K 
assembler 75.C 
associate (of an element of a ring) 67.H 
associated convergence radii 21.B 
associated diagrams (in irreducible representations 
of orthogonal groups) 60.J 
associated differential equation, Legendre’s 393.A 
associated factor sets (of crossed products) 29.F 
associated factor sets (for extension of groups) 
190.N 
associated fiber bundle 147.D 
associated flow 136.F 
associated form (of a projective variety) 16.S 
associated graded ring 284.D 
associated integral equation (of a homogeneous) 
integral equation) 217.F 
associated Laguerre polynomials 317.D 
associated Legendre functions 393.C, App. A, 
Table 18.III 
associated prime ideal 67.F 
associated principal bundle 147.D 
association, measure of 397.K 
association algebra 102.J 
association matrix 102.J 
associative, homotopy 203.D 
associative algebra(s) 102.J 231.A 
power 231.A 
associative law 
for the addition and multiplication of natural 
numbers 294.B 
in the algebra of sets 381.B 
for cardinal numbers 49.C 
for the composite of correspondences 358.B 
general (for group composition) 190.C 
for group composition 190.A 
ina lattice 243.A 
inaring 368.A 
associative multiplication of a graded algebra 
203.B 
assumed rate of interest 214.A 
assumption 
inverse 304. D 
Stokes 205.C 
asteroid 93.H l 
astronomy, spherical 392 
asymmetric (factorial experiment) 102.H 
asymmetric Cauchy process 5.F 
asymptote (of an infinite branch) 93.G 
asymptotically developable (function) 30.A 
asymptotically distributed 374.D 
asymptotically efficient estimator 399.N 
first-order 399.0 
kth-order 399.0 
asymptotically mean unbiased 399.K 
asymptotically median unbiased estimator (AMU) 
kth-order 399.0 
asymptotically normal estimator 
best (BAN) 399.K 
consistent and (CAN) 399.K 
asymptotically normally distributed 399.K 
asymptotically optima! 354.D 
asymptotically stable 126.F 286.8 394.B 
globally 126.F 
uniformly 163.G 
asymptotically unbiased 399.K 
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asymptotic bias 399.K 
asymptotic completeness 150.D 
asymptotic concentration 399.N 
asymptotic condition, LSZ 150.D 
asymptotic cone 350.B 
asymptotic convergence 168.B 
asymptotic covariance matrix 399.K 
asymptotic curve 110.B 111.H 
asymptotic direction 111.H 
asymptotic distribution, kth-order 399.0 
asymptotic efficiency 399.N 
second-order 399.0 
higher-order 399.0 
asymptotic expansion 30.A, App. A, Table 17.1 
(of a pseudodifferential operator) 345.A 
method of matched 112.B 
Mir ikshisundaram-Pleijel 391.B 
asymptotic fields 150.D 
asymptotic freedom 361.B 
asymptotic method 290.D 
asymptotic normality 399.K 
asymptotic path (for a meromorphic function) 
272.H 
asymptotic perturbation theory 331.D 
asymptotic power series 30.A 
asymptotic property (of solutions of a system of 
linear ordinary differential equations) 314.A 
asymptotic ratio set 308.1 
asymptotic ray 178.F 
asymptotic representation 
Debye 39.D, App. A, Table 19.II{ 
Hankel App. A, Table 19.III 
asymptotic sequence 30.A 
asymptotic series 30.A 
asymptotic set 62.A 
asymptotic solution 325.M 
asymptotic tangent 110.B 
asymptotic value of a meromorphic function 62.A 
272.H 
asymptotic valaue theorem, Lindelöf 43.C 
asynchronous system (of circuits) 75.B 
Atiyah-Bott fixed point theorem  153.C 
Atiyah-Singer fixed point theorem 153.C 
Atiyah-Singer index theorem 237.H 
equivariant 237.H 
atlas 105.C 
of class C" 105.D 
of class C" 105.D 
oriented 105.F 
atled (nabla) 442.D 
atmospheric refraction 392 
at most (for cardinal numbers) 49.B 


atomic 
(measurable set) 270.D 
at 0 163.H 
atomic element (in a complemented modular lat- 
tice) 243.F 


atomic formula 276.A 411.D 
atomless 398.C 
at random 401.F 
attaching a handle 114.F 
attaching space 202.E 
attraction, domain of 374.G 
attractor 126.F 

strange 126.N 
attribute, sampling inspection by 404.C 
augmentation 

(of an algebra) 200.M 

(of a chain complex) 200.C 

(ofa coalgebra) 203.F 
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(of a cochain complex) 200.F 
(of a complex in an Abelian category) 
200.H 
augmented algebra 200.M 
augmented chain complex 200.C 
autocorrelation 421.B 
autocorrelation coefficient | 397.N 
autocovariance, sample 421.B 
automata 31 
automatic integration scheme 299.C 
automaton 31.A 
deterministic linear bounded 31.D 
finite 31D 
nondeterministic linear bounded 31.D 
push-down 31.D 
automorphic form 450.0 
of dimension ~k 32.B 
oftype U 437.DD 
of weight m 32A 
of weight k 32.B 
automorphic function(s) 32 
multiplicative 32.A 
with respect to T 32.A 
automorphism 
(of an algebraic system) 409.C 
(ofa field) 149.B 
(ofa group) 190.D 
(ofan object ina category) 52.D 
(of a polarized Abelian variety) 3.6 
(of a probability space) 136.E 
(ofaring) 368.D 
analytic 21.J 
anti- (ofa group) 190.D 
anti- (ofa ring) 368.D 
differential 113 
Frobenius (of a prime ideal) 14.K 
group of(ofa group) 190.D 
holomorphic 21.J 
inner (ofa group) 190.D 
inner (ofa ring) 368.D 
inner, group of (ofa group) 190.D 
inner, group of (of a Lie algebra} 248.H 
involutive (of a Lie group) 412.B 
k-fold mixing 136.E 
Kolmogorov 136.E 
metrically isomorphic (on a measure space) 
136. E 
modular 308.H 
outer, group of (of a group) 190.D 
outer, group of (of a Lie algebra) 248.H 
principal (of a Clifford algebra) 61.B 
shift 126J 
spatially isomorphic (on a measure space) 
136.E 
spectrally isomorphic (on a measure space) 
136.E 
strongly mixing 136.E 
weakly isomorphic 136.E 
weakly mixing 136.E 
automorphism group (of a Lie algebra} 248.A 
automorphy, factor of 32.4 
autonomous 163.D 290.A 
autoregressive Gaussian source 213.E 
autoregressive integrated moving average process 
421.G 
autoregressive moving average process 421.D 
autoregressive process 421.D 
auxiliary circle 78.D 
auxiliary equation, Charpit 320.D 
auxiliary units 414.A 


Subject Index 
Axiom of the power set 


auxiliary variable 373.C 
average 211.C 
moving 397.N 
moving, process 421.D 
phase 402.C 
weighted moving 397.N 
average complexity 71.A 
average outgoing quality leve] 404.C 
average sample number 404.C 
averaging, method of 290.D 
AW*-algebra 36.H 
axial-vector currents, partially conserved 132.C 
axial visibility manifold 178.F 
axiom(s) 35.A A11.I 
Archimedes (in geometry) 155.B 
Archimedes (for real numbers) 355.B 
congruence (in geometry) 155.B 
Eilenberg-Steenrod 201.Q 
Euclid 139.A 
first countability 425.P 
the first separation 425.Q 
the fourth separation 425.Q 
Fréchet 425.Q 
Haag-Araki 150.E 
Haag-Keslev 150.E 
Hausdorff 425.Q 
Kolmogorov 425.Q 
logical 337.C 411.4 
Martin 33.F 
mathematical 337.C 411.1 
Osterwalder-Schrader 150.F 
Pasch 155.B 
second countability 425.P 
the second separation 425.Q 
system of 35.B 
the third separation 425.Q 
Tietze’s first 425.Q 
Tietze’s second 425.Q 
Tikhonov’s separation 425.Q 
Vietoris 425.Q 
Wightman 150.D 
axiom A diffeomorphism 126 
axiom A flow 1265 
axiomatic quantum field theory 150.D 
axiomatic set theory 36 156.E 
axiom A vector field 126J 
axiomatization 35.A 
axiomatize (by specifying a system of axioms) 
35.B 
axiom of choice 33.B 34.A 
and continuum hypothesis, consistency of 
33.D 
and continuum hypothesis, independence of 
33.D 
axiom of comprehension 33.B 381.G 
axiom of constructibility 33.D 
axiom of determinacy 22.H 
axiom of determinateness 33.F 
axiom of e-induction 33.B 
axiom of extensionality 33.B 
axiom of foundation 33.B 
axiom of free mobility (in Euclidean geometry) 
139.B 
axiom of infinity 33.B 381.G 
axiom of linear completeness (in geometry) 155.B 
axiom of mathematical induction 294.B 
axiom of pairing 381.G 
axiom of parallels (in Euclidean geometry) 139.A 
155.B 
axiom of the power sei 33.B 381.G 


Subject Index 
Axiom of reducibility (in symbolic logic) 


axiom of reducibility (in symboliclogic) 156.B 
411.K 
axiom of regularity 33.B 
axiom of replacement 33.B 381.G 
axiom of separation 33.B 
axiom of strong infinity 33.E 
axiom of subsets 33.B 381.G 
axiom of substitution 381.G 
axiom of the empty set 33.B 
axiom of the sum set 33.B 
axiom of the unordered pair 33.B 
axiom of union 381.G 
axioms of continuity 
Dedekind's 355.A 
axiom system(s) 35 
ofastructure 409.B 
ofatheory 411.1 
axis (axes) 
of a circular cone 78.A 
conjugate (of a hyperbola) 78.C 
of convergence 240.B 
coordinate (of an affine frame) 7.C 
coordinate (of a Euclidean space) 140 
imaginary 74.C 
major (of an ellipse) 78.C 
minor (of an ellipse) 78.C 
optical 180.B 
ofa parabola 78.C 
principal (of a central conic) 78.C 
principal (of a parabola) 78.C 
principal (of a quadric surface) 350.B 
principal, of inertia 271.E 
principal, transformation to 390.B 
real 74.C 
of rotation (of a surface of revolution) 111.I 
transverse (of a hyperbola) 78.C 
X;- (of a Euclidean space) 140 
Ax-Kochen isomorphism theorem  276.E 
azimuth App. A, Table 3.V 
azimuthal quantum number 315.E 
Azumaya algebra 29.K 
Azumaya lemma, Krull- 67.D 


B 
f — beta 
BQ) 


(=9,,(Q) 168.B 
(the space of hyperfunctions) 125.V 
By, (Besov spaces) 168.B 
f-KMS state 402.G 
f-shadowed 126.J 
f-traced 126J 
38-measurable function 270.J 
38-measurable set 270.C 
$8-regular measure 270.F 
*B-summable series 379.0 
[8|-summable series 379.0 
b-function 125.EE 418.H 
BN-pair 13.R 343.1 
(B,N)-pair 151.J 
B, set 22D 
B-complete (locally convex space) 424.X 
BA 102.L 
back substitution 302.B 
backward analysis 138.C 
backward difference 223.C, App. A, Table 21 
backward emission 320.A 
backward equation, Kolmogorov 115.A 260.F 
backward error analysis 302.B 
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backward interpolation formula 
Gauss 223.C 
Newton 223.C 
backward moving average representation 395.D 
canonical 395.D 
backward type 304.D,F 
badly approximable 83.B 
Baer sum (of extensions) 200.K 
Bahadur efficiency 400.K 
Baire condition 425.L 
Baire function 84.D 
Baire-Hausdorff theorem  273.] 425.N 
Baire measurable 270.L 
Baire property 425.L 
Lebesgue measurability and 33.F 
Baire set 126.H 270.C 
Baire space 425.L 
Baire zero-dimensional space 273.B 
Bairstow method  301.E 
balanced array 102.L 
balanced fractional factorial design 102.1 
balanced incomplete block design 102.E 
partially | 102J 
balanced mapping, A- 277J 
balayage 338.L 
balayage principle 338.L 
ball 140 
n- 140 
open 140 
openn- 140 
spin 351.L 
unit 140 
unit (of a Banach space) 37.B 
ball knot, (pat 235.G 
ball pair 235.G 
BAN (best asymptotically normal) 399.K 
Banach-Alaoglu theorem 
(in a Banach space) 37.E 
(in a topological linear space)  424.H 
Banach algebra(s) 36.A 
Banach analytic space 23.G 
Banach area (of a surface) 246.G 
Banach (extension) theorem, Hahn- 
(in a normed space) 37.F 
(in a topological linear space) 424.C 
Banach integral 310.1 
Banach lattice 310.F 
Banach Lie group 286.K 
Banach limit 37.F 
Banach manifold 105.Z 
Banach space(s) 37.A,B 
reflexive 37.G 
regular 37.G 
Banach star algebra 36.F 
Banach-Steinhaus theorem 
(in a Banach space) 37.H 
(in a topological linear space) 424.J 
Banach theorem 37.1 
band, Möbius 410.B 
bang-bang control 405.C 
Barankin theorem ` 399.D 
bar construction (of an Eilenberg-MacLane com- 
plex) 70.F 
bargaining set 173.D 
bargaining solution, Nash 173.C 
Barnes extended hypergeometric function 206.C, 
App. A, Table 18.I 
barrel (in a locally convex space) 424.1 
barreled (locally convex space) 424.I 
quasi- 4241 
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barrier 120.D 

absorbing 115.B 

reflecting 115.B,C 
Bartle-Dunford-Schwartz integral 443.G 
barycenter 

(of points of an affine space) 7.C 

(of a rigid body) 271.E 
barycentric coordinates 

(in an affine space) 7.C 90.B 

(in a Euclidean complex) 70.B 

(in the polyhedron of a simplicial complex) 

70.C 

barycentric derived neighborhood, second 65.C 
barycentric refinement 425.R 
barycentric subdivision 

(of a Euclidean complex) 70.B 

(of a simplicial complex) 70.C 
baryons 132.B 
base 

(in a Banach space) 37.L 

(curve of a roulette) 93.H 

(of a logarithmic function) 131.B 

(ofa point range) 343.B 

(of a polymatroid) 66.F,G 


data 96.B 
filter 87. 
local 425.E 


for the neighborhood system 425.E 
normal 172.E 
open 425.F 
for the space 425.E 
for the topology 425.F 
for the uniformity 436.B 
base functions 304.B 
base point 
ofa linear system 16.N 
ofaloop 170 
of a topological space 202.B 
base space 
ofa fiber bundle 147.B 
ofa fiber space 148.B 
ofa Riemann surface 367.A 
base term (of a spectral sequence) 200.J 
base units 414.A 
Bashforth method, Adams- 303.E 
basic components (of an m-dimensional surface) 
110.A 
basic concept (of a structure) 409.B 
basic equation 320.E 
basic feasible solution 255.A 
basic field (of linear space) 256.A 
basic form | 255.A 
basic interval 4.B 
basic invariant 226.B 
basic limit theorem 260.C 
basic open sei 425.F 
basic optimal solution 255.A 
basic property (of a structure) 409.B 
basic ring (of a module) 277.D 
basic set (for an Axiom A flow) 126J 
basic set (of a structure) 409.B 
basic solution 255.A 
feasible 255.A 
optimal 255.A 
basic space (of a probability space) 342.B 
basic surface (of a covering surface) 367.B 
basic variable 255.A 
basic vector field 80.H 
basic Z,-extension 14.L 


Subject Index 
Bernoulli 


basin 126.F 
basis 
(of an Abelian group) 2.B 
(in a Banach space) 37.L 
(of a homogeneous lattice) 182.B 
(of an ideal) 67.B 
(of a linear space) 256.E 
(ofa module) 277.G 
canonical 201.B 
canonical homology 11.C 
Chevalley canonical 248.Q 
dual 256.G 
minimal 14.B 
normal 172.E 
oforderrin N 4A 
orthonormal 197.C 
Schauder 37.L 
strongly distinguished 418.F 
transcendence 149.K 
Weyl canonical 248.P 
basis theorem 
Hilbert (on Noetherian rings) 284.A 
Ritt (on differential polynomials) 113 
bath, heat 419.B 
Bayes estimator 399.G 
Bayes formula 342.F 405.1 
Bayesian approach 401.B 
Bayesian model 403.G 
Bayes risk | 398.B 
Bayes solution 398.B 
generalized 398.B 
in the wider sense 398.B 
Bayes sufficient o-field 396.J 
BCH (Base-Chaudhuri-Hooquenghem) code 63.D 
BDI, type (symmetric Riemannian spaces) 412.G 
BDII, type (symmetric Riemannian spaces) 412.G 
BDH (Brown-Douglas-Fillmore) theory 36.J 390.J 
behavior, Regge 386.C 
behavior strategy 173.B 
behind-the-moon argument 351.K 
Behnke-Stein, analytic space in the sense of 23.E 
Behnke-Stein theorem 21.H 
Behrens-Fisher problem | 400.G 
Bellman equation 405.B 
Bellman function 127.G 
Bellman principle 405.B 
Bell inequality 351.L 
Bell number 177.D 
belong 
(toaset) 381.A 
to the lower class with respect to local con- 
tinuity 45.F 
to the lower class with respect to uniform con- 
inuity 45.F 
to the upper class with respect to local con- 
tinuity 45.F 
to the upper class with respect to uniform con- 
tinuity 45.F 
Beltrami differential equation 352.B 
Beltrami differential operator 
of the first kind App. A, Table 4.II 
of the second kind App. A, Table 4.1 
Beltrami operator, Laplace- 194.B 
Bergman kernel function 188.G 
Bergman metric 188.G 
Bernays-Gódel set theory 33.A 
Bernoulli 
loosely 136.F 
monotonely very weak 136.F 


Subject Index 
Bernoulli differential equation 


Bernoulli differential equation App. A, Table 14.1 
Bernoulli family 38 
Bernoulli lemniscate 93.H 
Bernoulli method 301.J 
Bernoulli number 177.B 
Bernoulli polynomial 177.B 
Bernoulli process 136.E 
very weak 136.E 
weak 136.E 
Bernoulli sample 396.B 
Bernoulli shift 136.D 
generalized 136.D 
Bernoulli spiral 93.H 
Bernoulli theorem  205.B 
Bernoulli trials, sequence of 396.B 
Bernshtein inequality (for trigonometric polyno- 
mials) 336.C 
Bernshtein polynomial 366.A 418.H 
Sato- 125.EE 
Bernshtein problem 275.F 
generalized 275.F 
Bernshtein theorem 
(on cardinal numbers) 49.B 
(on the Laplace transform) 240.E 
(on minimal surfaces) 275.F 
‘Bers area theorem 234.D 
Bertini theorems 15.C 
Bertrand conjecture 123.A 
Bertrand curve 111.F 
Besov embedding theorem, Sobolev- 168.B 
Besovspace 168.B 
Bessaga-Pelczyhiski theorem 443.D 
Bessel differential equation 39.B, App. A, Table 
14.H 
Bessel formula, Hansen- App. A, Table 19.III 
Bessel function(s) 39, App. A, Table 19.III 
half 39.B 
modified 39.G 
spherical] 39.B 
Bessel inequality 197.C 
Bessel integral 39.B 
Bessel interpolation formula App. A, Table 21 
Bessel series, Fourier- 39.D 
Bessel transform, Fourier- 39.D 
best (statistical decision function) 398.B 
best approximation 
(of a continuous function) 336.B 
(in evaluation of functions) 142.B 
(of an irrational number) 83.B 
in the sense of Chebyshev 336.H 
best asymptotically normal estimator 399.K 
best invariant estimator 399.1 
best linear unbiased estimator (blue) 403.E 
best polynomial approximation (in the sense of 
Chebyshev) 336.H 
beta density 397.D 
beta distribution 341.D, App. A, Table 22 
beta function 174.C, App. A, Table 17.1 
incomplete 174.C, App. A, Table 17.1 
beta-model, Luce 346.G 
better, uniformly (statistical decision function) 
398.B 
Betti group (of a complex) 201.B 
Betti number 
of a commutative Noetherian ring 200.K 
ofacomplex 201.B 
between (two points in an ordered set) 311.B 
between-group variance 397.L 
Beurling generalized distribution 125.U 
Beurling-Kunugui, theorem, Iversen- 62.B 


1930 


Bezout theorem 12.B 
BG (= Bernays-Gódel set theory) 33.A 
Bhattacharyya inequality 399.D 
biadditive mapping 277.J 
bialgebra 203.G 
quotient 203.G 
semigroup 203.G 
universal enveloping 203.G 
bialgebra homomorphism |. 203.G 
Bianchi identities 80.J 417.B, App. A, Table 4.II 
bias 399.C 
biaxial spherical harmonics 393.D 
BIBD (balanced incomplete block design) 102.E 
bicharacteristic curve 325.A 
bicharacteristic strip .320.B 
bicompact 425.8 
bicomplex 200.H 
Bieberbach conjecture 438.C 
Biehler equality, Jacobi- 328 
biequicontinuous convergence, topology of 424.R 
bifurcation, Hopf 126.M 
bifurcation equation 286.V 
bifurcation method 290.D 
bifurcation point 
(in bifurcation theory) 126.M 286.R 
(in nonlinear integral equations) 217.M 
bifurcation set 51.F 418.F 
bifurcation theorem, Hopf 286.U 
bifurcation theory 286.R 
biharmonic (function) 193.0 
biholomorphic mapping 21.J 
biideal 203.G 
bijection 
(in a category) 52.D 
(of sets) 381.C 
bijective mapping 381.C 
bilateral network 382.C 
bilinear form 
(on linear spaces) 256.H 
(on modules) 277J 
(on topological linear spaces) 424.G 
associated with a quadratic form 256.H 
matrix of 256.H 
nondegenerate 256.H 
symmetric (associated with a quadratic form) 
348.A 
bilinear functional 424.G 
integral 424.R 
bilinear Hamiltonian 377.A 
bilinear mapping 
(of a linear space) 256.H 
(ofa module) 277.J 
canonical (on tensor products of linear spaces) 
256.1 
bilinear programming 264.D 
bilinear relations, Hodge-Riemann  16.V 
bimatrix game 173.C 
bimeasurable transformation 136.B 
bimodular germ (of an analytic function) 418.E 
bimodule 277.D 
A-B- 277.D 
binary quadratic form(s) 348.M 
primitive 348.M 
properly equivalent 348.M 
binary relation 358.A 411.G 
binding energy 351.D 
Binet formula 
(on Fibonacci sequence) 295.A 
(on gamma function) 174.A 
binomial coefficient 330, App. A, Table 17.II 
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binomial coefficient series 121.E 
binomial distribution 341.D 397.F, App. A, 
Table 22 
negative 341.D 397.F, App. A, Table 22 
binomial equation 10.C 
binomial probability paper 19.B 
binomial series App. A, Table 10.IV 
binomial theorem 330, App. A, Table 17.II 
binormal 111.F 
affine 110.C 
bioassay 40.C 
biology, mathematical models in 263 
biometrics 40 
bipartite graph 186.C 
complete 186.C 
bipolar (relative to a pairing) 424.H 
bipolar coordinates 90.C 
bipolar cylindrical coordinates App. A, Table 3.V 
bipolar theorem 37.F 424.H 
biprojective space 343.H 
biquadratic equation App. A, Table 1 
birational correspondence  16.I 
birational invariant 12.A 
birational isomorphism 
between Abelian varieties 3.C 
between algebraic groups 13.A 
birational mapping 16.3 
birational transformation 16.1 
Birch-Swinnerton-Dyer conjecture 118.D 450.S 
biregular mapping (between prealgebraic varieties) 
16.C 
Birkhoff integrable (function) 443.E 
Birkhoff integral 443.E 
Birkhoff fixed-point theorem, Poincaré- 153.B 
Birkhoff-Witt theorem, Poincaré- (on Lie algebras) 
248J 
Birnbaum theorem 399.C 
birth and death process 260.G 
birth process 260.G 
birth rate, infinitesimal 260.G 
bispectral density function 421.C 
bispinor of rank (k,n) 258.B 
bit 75.B 213.B 
check 63.C 
information 63.C 
bivariate data 397.H 
bivariate distribution 397.H 
bivariate moments 397.H 
bivariate normal density 397.1 
Blackwell-Rao theorem 399.C 
Blakers-Massey theorem 202.M 
Blaschke manifold 178.G 
ata point p 178.G 
Blaschke product 43.F 
Blaschke sequence 43.F 
Bleuler formalism, Gupta- 105.G 
Bloch constant 77.F 
schlicht 77.F 
Bloch theorem 77.F 
block 
(bundle) 147.Q 
(of irreducible modular representations) 362.1 
(of a permutation group) 151.H 
(of plots) 102.B 
complete 102.B 
incomplete 102.B 
initial 102.E 
block bundle 147.Q 
normal 147.Q 


q- 147.Q 


Subject Index 
Borel set(s) 


block code 63.A 213.F 
Sliding 213.E 
block design 102.B 
balanced incomplete 102.E 
efficiency-balanced 102.E 
optimal 102.E 
randomized 102.B 
variance-balanced 102.E 
block effect 102.B 
block size 102.B 
block structure, q- 147.Q 
blowing up 
(of an analytic space) 23D 
(of a complex manifold) 72.H 
(byanidealsheaf) 16.K 
b.l.u.e (best linear unbiased estimator) 403.E 
Blumenthal zero-onelaw 261.B 
BMO (bounded mean oscillation) 168.B 
Bochner integrable 443.C 
Bochner integral 443.C 
Bochner theorem 36.L 192.B 
body 
bounded star 182.C 
rigid 271.E 
body forces 271.G 
Bogolyubov inequality, Peierls- 212.B 
Bohr, almost periodic function in the sense of 
18.B 
Bohr compactification 18.H 
Bokshtein homomorphism 64.B 
Bokshtein operation 64.B 
Boltzmann constant 402.B 
Boltzmann distribution law, Maxwell- 402.B 
Boltzmann equation 41.A 402.B 
Bolzano-Weierstrass theorem 140 273.F 
bond percolation process 340.D 
Bonnet formula, Gauss- 111.H 364.D, App. A, 
Table 4.1 
Bonnet fundamental theorem 111.H 
Bonnet-Sasaki-Nitsche formula, Gauss- 275.C 
Boolean algebra 243.E 
generalized 42.B 
Boolean lattice 42.A 243.E 
of sets 243.E 
Boolean operations 42.A 
Boolean ring 42.C 
generalized 42.C 
Boolean space 42.D 
Boolean-valued set theory 33.E 
Borcher theorem 150.E 
bord (for a G-manifold) 431.E 
bordant 431.E 
border set 425.N 
Borel-Cantelli lemma 342.B 
Borel direction (of a meromorphic function) 272.F 
Borel embedding, generalized 384.D 
Borel exceptional value 272.E 
Borel exponential method, summable by 379.0 
Borel field 270.B,C 
Borel integral method, summable by 379.0 
Borel isomorphic 270.C 
Borel-Lebesgue theorem 273.H 
mapping 270.C 
Borel measurable function 270.J 
Borel measure 270.G 
Borel method of summation 379.0 
Borel set(s) 
(in a Euclidean space) 270.C 
(in the strict sense) 270.C 
(in a topological space) 270.C 


Subject Index 
Borel space 


nearly 261.D 
Borel space 270.C 
standard 270.C 
Borel subalgebra (of a semisimple Lie algebra) 
248.0 
Borel subgroup 
of an algebraic group 13.G 
k- (of an algebraic group) 13.G 
ofa Lie group 249.J 
Borel subset 270.C 
Borel summable, absolute 379.0 
Borel theorem 
(on classifying spaces) App. A, Table 6.V 
(on meromorphic functions} 272.E 
Heine 273.F 
Borel-Weil theorem | 437.Q 
bornological 
locally convex space 424.1 
ultra- (locally convex space) 424.W 
Borsuk-Ulam theorem  153.B 
Bortolotti covariant derivative, van der Waerden- 
417.E 
Bose particle 132.A 
Bose statistics 377.B 402.E 
boson 132.A 351.H 
Nambu-Goldstone 132.C 
Bott fixed point theorem, Atiyah- 153.C 
Bott generator 237.D 
Bott isomorphism 237.D 
Bott periodicity theorem 
on homotopy groups 202.V, App. A, Table 
6.VII 
in K-theory 237.D 
bound 
Froissart 386.B 
greatest lower (of a subset in an ordered set) 
311.B 
greatest lower (of a subset of a vector lattice) 
310.C 
Hamming 63.B 
least upper (of a subset in an ordered set) 
311.B 
least upper (of a subset of a vector lattice) 
310.C 
lower (of a subset in an ordered set) 311.B 
Plotkin 63.B 
upper (of a subset in an ordered set) 311.B 
Varshamov-Gilbert-Sacks 63.B 
boundary (boundaries) 
(of a convex cell) 7.D 
(cycle) 200.H 
(of a function algebra) 164.C 
(ofa manifold) 65.B 105.B 
(of a topological space) 425.N 
Choquet (for a function algebra) 164.C 
closed (for a function algebra) 164.C 
C'-manifold with 105.E 
C’-manifold without 105.E 
differential manifold with, of class C" 105.E 
domain with regular 105.U 
domain with smooth 105.U 
dual Martin 260.I 
entrace (of a diffusion process) 115.B 
exit (of a diffusion process) 115.B 
harmonic 207.B 
ideal 207.A 
Martin 207.C 260.1 
module of  200.C 
natural (of an analytic function) 198.N 
natural (of a diffusion process) 115.B 
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Newton, of f in the coordinate 418.D 
nondegenerate Newton 418.D 
of null (open Riemann surface) 367.E 
pasting together 114.F 
of positive (open Riemann surface) 367.E 
regular (of a diffusion process) 115.B 
relative 367.B 
Shilov (for a function algebra) 21.D 164.C 
Shilov (of a Siegel domain) 384.D 
surface with 410.B 
topological manifold with 105.B 
topological manifold without 105.B 
boundary cluster set 62.A 
boundary condition 315.A 323.F 
adjoint 315.B 
operator with 112.F 
boundary element (in a simply connected domain) 
333.B 
boundary function 160.E 
boundary group 234.B 
boundary homomorphism 
of homology exact sequence 201.L 
in homotopy exact sequences 202.L 
boundary layer 205.C 
boundary layer equation, Prandtl] 205.C 
boundary operator 200.C 
between chain groups 201.B 


linear 315.B 
partial 200.E 
total 200.E 


boundary point 
dual passive 260.H 
entrace 260.H 
exit 260.H 
irregular 120.D 
passive 260.H 
regular 120.D 
ofa subset 425.N 
boundary set 425.N 
boundary space 112.E 
boundary value 
(of a conformal mapping) 77.B 
(of ahyperfunction) 125.V 
(relative to a differential operator) 112.E 
boundary value problem 315 
(for harmonic functions) 193.F 
(in numerical solution of ordinary differential 
equations) 303.H 
adjoint 315.B 
first (for elliptic differential equations) 323.C 
first (for harmonic functions) 193.F 
general 323.H 
homogeneous (of ordinary differential equa- 
tions) 315.B 
inhomogeneous (of ordinary differential equa- 
tions) 315.B 
of ordinary differential equations 315 
second (elliptic differential equations)  323.F 
second (for harmonic functions) 193.F 
self-adjoint 315.B 
solution of App. A, Table 15.VI 
third (for elliptic differential equations) 323.F 
third (for harmonic functions) 193.F 
two-point (of ordinary differential equations) 
315.A 
weak form of the (of partial differeatial equa- 
tions) 304.B 
bounded 
(in an affine space) 7.D 
(half-plane) 155.B 
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(metric space) 273.B 
(ordered set) 311.B 
(set in a topological linear space) 424.F 
(set of real numbers) 87.B 
(torsion group) 2.F 
(vector lattice) 310.B 
(vector measure) 443.G 
order 310.B 
relatively 331.B 
T- 311.B 
totally 273.B 
bounded, essentially (measurable function) 168.B 
bounded approximation property (of a Banach 
space) 37J 
bounded automaton, linear 
deterministic 31.D 
nondeterministic 31.D 
bounded domain 
divisible 284.F 
homogeneous 384.A 412.F 
irreducible symmetric 412.F 
sweepable 284.F 
symmetric 412.F 
bounded from above 
(in an ordered set) 311.B 
(for real numbers) 87.B 
bounded from below 
(for a filtration)  200J 
(in an ordered set) 311.B 
(for real numbers) 87.B 
(for a spectral sequence) 200.3 
bounded functions | 43.A 
bounded linear operator 37.C 
boundedly complete o-ficld | 396.E 
bounded matrix 269.K 
bounded mean oscillation (BMO) 168.B 
bounded metric space, totally 273.B 
bounded motion 420.D 
bounded u-operator 356.B 
boundedness, abscissa of (of a Dirichlet series) 
121.B 
boundedness principle, upper (in potential theory) 
388.C 
boundedness theorem, uniform 37.H 
bounded quantifier 356.B 
bounded set 
in an affine space 7.D 
in a locally convex space 424.F 
ina metric space 273.B 
totally (in a metric space) 273.B 
bounded star body 182.C 
bounded uniform space 
locally totally 436.H 
totally 436.H 
bounded variation 
function of 166.B 
integrable process of 406.B 
mapping of 246.H 
in the sense of Tonelli 246.C 
set function of 380.B 
vector measure of 443.G 
bound state 351.D 
bound variable 411.C 
bouquet 202.F 
Bouquet differential equation, Briot- 288.B 289.B 
Bouquet formulas (on space curves) 111.F 
Bourbaki, Fréchet space in the sense of 424.1 
Boussinesq equation 387.F 
boxes 140 


Subject Index 
Brouwer mapping theorem 


box topology 425.K 
brachistochrone 93.H 
bracket 105.M 
Lagrange 84.A 324.D 
Poisson (of two functions) 105.M 
Poisson (of two vector fields) 271.F 324.C.D 
Toda 202.R 
bracket product (in a Lie algebra) 248 A 
Bradley-Terry model 346.C 
braid(s) 235.F 
closed 235.F 
braid group 235.F 
branch 
(of an analytic function) 198.J 
(of a curve of class CH) 93.G 
(ofa graph) 282.A 
finite (of a curve of class CH) 93.G 
infinite (of a curve of class CH) 93.G 
branch and bound methods 215.D 
branch divisor (in a covering of a nonsingular 
curve) 9I 
branched minimal immersion 275.B 
branched minimal surface 275.B 
branching Markov process 44.E 
branching processes 44 342.A 
age-dependent 44.E 
continuous state 44.E 
Galton- Watson 44.B 
Markov 44.D 
multitype Markov 44.E 
branch point 
(of a covering surface) 367.B 
(of an ordinary curve) 93.C 
algebraic (of a Riemann surface) 367.B 
fixed (of an algebraic differential equation) 
288.A 
logarithmic (of a Riemann surface) 367.B 
movable (of an algebraic differential equation) 
288.A 
branch source 282.C 
Brandt law | 241.C 
Brauer character (of a modular representation) 
362.1 
Brauer group 
(of algebra classes) 29.E 
(of a commutative ring) 29.K 
Brauer theorem 450.G 
Bravais class 92.B 
Bravais group 92.B 
Bravais lattice 92.B 
Bravais type 92.B 
simple 92.E 
breadth (of an oval) 89.C 
Brelot solution, Perron- (of Dirichlet problem) 
120.C 
Brelot solution, Perron-Wiener- (of Dirichlet prob- 
lem) 120.C 
Brianchon theorem 
on conic sections 78.K 
in projective geometry 343.E 
bridge, Brownian 250.F 374.E 
Brieskorn variety 418.D 
Brill-Noether number 9.E 
Briot-Bouquet differential equation 288.B 289.B 
broken line 155.F 
broken symmetry 132.C 
Bromwich integral 240.D 322.D, App. A, Table 12.1 
Brouwer fixed-point theorem 153.B 
Brouwer mapping theorem 99.A 


Subject Index 
Brouwer theorem on the invariance of domain 


Brouwer theorem on the invariance of domain 
117.D 

Browder-Livesay invariant 114.L 

Brownian bridge 250.F 374.E 

Brownian functional 176.1 

Brownian motion 5.D 45 342.A 
d-dimensional 45.C 
{F}- 45.B 406.B 
on Lie groups 406.G 
with an N-dimensional time parameter 45. 
Ornstein-Uhlenbeck  45.I 
right invariant 406.G 
space-time 45.F 

Brown-Shield-Zeller theorem 43.C 

BRS transformation 150.G 

Bruck-Ryser-Chowla theorem 102.E 

Bruhat decomposition (of an algebraic group) 13.K 
relative 13.Q 

Brun theorems, Poincaré- 420.A 

Brun-Titchmarsh theorem 123.D 

Bucy filter, Kalman- 86.E 405.G 

building 130.R 343.1 

bulk viscosity, coefficients of 205.C 

bundle(s) 
canonical 147.F 
complex conjugate 147.F 
complex line 72.F 
complex line, determined by a divisor 72.F 
conormal 274.E 
coordinate 147.B 
coordinate, equivalent 147.B 
cotangent 147.F 
cotangential sphere 274.E 
dual 147.F 
fiber 147.B 
fiber, associated 147.D 
fiber, of class C" 147.0 
fiber, complex analytic 
fiber, orientable 147.L 
fiber, real analytic 147.0 
flat F- 154.B 
foliated 154.B,H 
frame, orthogonal 364.A 
frame, tangent orthogonal n- 364.A 
G- 147.B 
Hopf 147.E 
induced 147.G 
line 147.F 
Maslov 274.C 
normal 105.C 114.B 154.B,E 364.C 274.E 
normal block 147.Q 
normal k-vector 114J 
normal sphere 274.E 
n-sphere 147.K 
n-universal 147.G 
principal 147.C 
principal, associated 
principal fiber 147.C 
product 147.E 
q-block 147.Q 
quotient 16.Y 147.B 
reduced 147J 
spin 237.F 
Spin 237.F 
sub- 16.Y 147.F 
tangent 105.H 147.F 154.B 286.K 
tangential sphere 274.E 
tangent r-frame 108.H 147.F 
tautological line 16.E 
tensor 147.F 


147.0 


147.D 
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trivial 147.E 
unit tangent sphere 126.L 
universal. 147.G,H 
vector 16. Y 147.F 
vector, ample 16.Y 
vector, complex 147.F 
vector, cotangent 147.F 
vector, dual 147.F 
vector, indecomposable 16.Y 
vector, normal 105.L 
vector, quaternion 147.F 
vector, quotient 16.Y 
vector, semistable 16.Y 
vector, stable (on algebraic varieties) 16.Y 
vector, stable (on topological spaces) 237.B 
vector, stably equivalent 237.B 
vector, tangent 108.H 147.F 
bundle group (of a fiber bundle) 147.B 
bundle mapping (map) 147.B 
bundle of homomorphisms  147.F 
bundle of p-vectors 147.F 
bundle space (of a fiber bundle) 147.B 
Bunyakovskii inequality 211.C, App. A, Table 8 
Burali-Forti paradox 319.B 
Burnside conjecture 151.D 
Burnside problem  161.C 
restricted 161.C 
Burnside ring 431.F 
Burnside theorem 151.D 
burst error 63.E 
Bush-Mosteller model 346.G 


C 


c (cardinal number of R) 49.A 
C'(Q) (the totality of I times continuously differenti- 
able functions in Q) 168.B 
C$(Q) (the totality of functions in C'(Q) whose 
supports are compact subsets of Q) 168.B 
c (a sequence space) 168.B 
C (complex numbers) 74.A 294.A 
x-equivalent (closed on G-manifolds) 431.F 
€-group 52.M 
@-theory, Serre 202.N 
C-analytic hierarchy 356.H 
C-arithmetical hierarchy 356.H 
C-equivalent almost complex manifolds 
C-field 118.F 
C,(d)-field 118.F 
C,set 22D 
Íc,j-consistency, (c,]-consistent 399.K 
C*-algebra 36.G 
liminal 36.E 
postliminal 36.E 
of typeI 308.L 
C*-cross norm 36.H 
C*-dynamical system 36.K 
C*-group algebra (of a locally compact Hausdorff 
space) 36.L 
C*-tensor product, projective 36.H 
(C,a)-summation 379.M 
c,-bundle 237.F 
c,-mapping 237.G 
C-analytic space 23.E 
C-covering space 23.E 
C'-conjugacy, C'-conjugate 126.B 
C’-equivalence, C'-equivalent 126.B 
C'-flow 126.B 
C’-foliation 154.G 
C’-function in a C?-manifold 


114.H 


105.G 
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C'-manifold 105.D 
with boundary 105.E 
without boundary 105.E 
compact 105.D 
paracompact - 105.D 
C'-mapping 105.J 
C’-norm 126.H 
C’-structure 108.D 
subordinate to (for a C*-structure) 108.D 
on a topological manifold 114.B 
C'- (stabile 126.H 
C’-structurally stable 126.H 
C’-triangulation 114.C 
C?-homomorphism (between Lie groups) 249.N 
C?-isomorphism (between Lie groups) 249.N 
C?-function (of many variables) 58.B 
germ of (at the origin) 58.C 
preparation theorem for 58.C 
rapidly decreasing 168.B 
slowly increasing 125.0 
C?-functions and quasi-analytic functions 58 
C? topology, weak 401.C 
CA set (in set theory) 22.A 
Caianiello differential equation 291.F 
calculable function, effective 356.C 
calculable number 22.G 
calculation, graphical 19.B 
calculator 75.A 
calculus 
differential 106 
fundamental theorem of the infinitesimal 216.C 
Heaviside 306.A 
holomorphic functional 36.M 
infinitesimal (in nonstandard analysis) 273.D 
Kirby 114.L 
operational 251.G 306, App. A, Table 12.II 
predicate 411.J 
predicate, with equality 411.J 
propositional 411.F 
of residue 198.F 
stochastic 406.A 
tensor 417.A, App. A, Table AU 
calculus of variations 46 
classical theory of 46.C 
conditional problems in 46.A 
fundamental lemma in 46.B 
Calderon-Zygmund singular integral operator 
217.J 251.0 
Calderón-Zygmund type, kernel of 217J 
Calkin algebra | 36J 
Callan-Symanzik equation 361.B 
Campbell-Hausdorff formula 249.R 
CAN estimator 399.K 
canceling 138.B 
cancellation law 
on the addition of natural numbers 294.B 
in a commutative semigroup 190.P 
on the multiplication of natural numbers 294.B 
canonical affine connection (on R") 80.J 
canonical anticommutation relation 277.A 
canonical backward moving average representation 
395.D 
canonical basis (of a chain group of a finite simpli- 
cial complex) 201.B 
Chevalley (of a complex semisimple Lie algebra) 
248.Q 
Weyl (of a semisimple Lie algebra) 248.P 
canonical bilinear mapping (on tensor products of 
linear spaces) 256.1 


Subject Index 
Canonical representation 


canonical bundle (of a differentiable manifold) 147.F 
canonical class (of an algebraic curve) 9.C 
canonical cohomology class (in Galois cohomology 
in class field theory) 59.H 
canonical commutation relations 351.C 377.A,C 
canonical coordinates (of a Lie group) 
of the first kind 249.Q 
of the second kind 249.Q 
canonical coordinate system (for a conic section) 
78.C 
canonical correlation coefficient 280.E 374.C 
canonical decomposition (of a closed operator) 
251.E 
canonical decomposition theorem  86.C 
canonical divisor 
(of an algebraic curve) 9.C 
(of an algebraic variety) 16.0 
(of a Jacobian variety) 9.E 
(ofa Riemann surface) 11.D 
canonical element (in the representation of a functor) 


52.L 
canonical ensemble 402.D 
grand 402.D 


canonical equation 324.E 
Hamilton 271.F 
canonical field 377.C 
canonical form 
(of F(M)) 191.A 
(of a linear hypothesis) 400.H 
(of a regular submanifold of F(M)) 191.A 
of the equation (of a quadric surface) 350.B 
Khinchin (of an infinitely divisible probability 
distribution) 341.G 
Kolmogorov (of an infinitely divisible pro- 
bability distribution) 341.G 
Lévy (of an infinitely divisible probability 
distribution) 341.G 
Weierstrass (for an elliptic curve) 9.D 
Weierstrass (of the gamma function) 174.A 
canonical function (on a nonsingular curve) 9.E 
canonical homology basis 11.C 
canonical homomorphism 
(on direct products of rings) 368.E 
(on tensor products of algebras) 29.A 
canonical injection 
(from a direct summand) 381.E 
(in direct sums of modules) 277.F 
(in free products of groups) 190.M 
(from a subgroup) 190.D 
(from a subset) 381.C 
canonically bounded 200.J 
canonically polarized Jacobian variety 3.G9.E 
canonical measure 
(in a birth and death chain) 260.G 
(in a diffusion process) 115.B 
(in a Markov chain) 260.I 
canonical model 251.N 
canonical 1-form (of the bundle of tangent n-frames) . 
80.H 
canonical parameter 
ofanarc 111.D 
local (for power series) 339.A 
canonical product, Weierstrass 429.B 
canonical projection 
(on direct products of modules) 277.F 
(onto a quotient set) 135.B 
canonical representation 
(of a Gaussian process) 176.E 
generalized 176.E 


Subject Index 
Canonical scale 


canonical scale 115.B 
canonical scores 397.M 
canonical surjection 
(on direct products of groups) 190.L 
(to a factor group) 190.D 
(onto a quotient set) 135.B 
canonical transformation 271.F 
(concerning contact transformations) 82.B 
group of. 271.F 
homogeneous 82.B 
canonical variables (in analytical dynamics) 271.F 
canonical variates 280.E 
canonical vectorial form | 417.C 
Cantelli lemma, Borel- 342.B 
Cantelli theorem, Glivenko- 374.E 
Cantor, G. 47 
Cantor discontinuum 79.D 
Cantor intersection theorem 273.F 
Cantor-Lebesgue theorem 159.J 
Cantor normal form (for an ordinal number) 312.C 
Cantor set 79.D 
general 79.D 
Cantor's theory of real numbers 244.E 
capability, error-correcting |. 63.B 
capacitable set 48.H 
capacitary dimension | 48.G 
capacitary mass distribution 338.K 
capacitated network 281.C 
capacity 
(of discrete memeoryless channel) 213.F 
(of a prime ideal) 27.A 
(ofaset) 48 260.D 
(transportation and scheduling) 281.D 
æ- 169.C 
analytic 169.F 
continuous analytic 164.J 
ergodic 213.F 
logarithmic 48.B 
Newtonian 48.B 
Newtonian exterior 48.H 
Newtonian inner ANE 
Newtonian interior 48.F 
Newtonian outer 48.H 
of order a 169.C 
stationary 213.F 
capacity constraint 281.D 
capillary wave 205.F 
cap product 
(ofa cochain and a chain) 200.K 
(of a cohomology class and a homology class) 
201.K 
capture 
complete 420.D 
partial 420.D 
CAR 377A 
Carathéodory measure 270.E 
Caratheodory outer measure 270.E 
Carathéodory pseudodistance 21.D 
Cardano formula (on a cubic equation) 10.D, 
App. A, Table 1 
cardinality 33.F 
(of an ordinal number) 49.E 
(ofaset) 49.A 312.D 
cardinal number(s) 49.A 312.D 
comparability theorem for 49.B 
of continuum 49.A 
corresponding to an ordinal number 49.E 
finite 49.A 
infinite 49.A 
measurable 33.E 
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of N 49A 
ofR 49A 
of all real-valued functions on [0,:] 49.A 
ofaset 49.A 312.D 
strongly compact 33.E 
strongly inaccessible 33.E 
transfinite 49.A 
weakly compact 33.E 
weakly inaccessible 33.E 
cardinal product (of a family of ordered sets) 311.F 
cardinal] sum (of a family of ordered sets) 311.F 
cardioid 93.H 
Carleman condition, Denjoy- 168.B 
Carleman inequality App. A, Table 8 
Carleman theorem 
(on asymptotic expansions) 30.A 
(on bounded functions) 43.F 
Carleman type, kernel of 217J 
carrier 
(of a differential form) 108.Q 
(ofa distribution) 125.D 
(of a function) 125.B 168.B 
(of a real-valued function) 425.R 
Carson integral App. A, Table 12.II 
Cartan, E. 50 
Cartan, differential form of Maurer- 249.R 
Cartan, system of differentia] equations of Maurer- 
249.R 
Cartan connection 80.M 
Cartan criterion of semisimplicity (on Lie algebras) 
248.F 
Cartan criterion of solvability (on Lie algebras) 
248.F 
Cartan formula 
for Steenrod pth power operations 64.B 
for Steenrod square operations GAB 
Cartan integer (of a semisimple Lie algebra) 248.N 
Cartan invariant (of a finite group) 362.I 
Cartan involution 437.X 
Cartan-Káhler existence theorem 428.E 
Cartan-Mal'tsev-Iwasawa theorem (on maximal 
compact subgroups) 249.8 
Cartan maximum principle 338.L 
Cartan pseudoconvex domain 21.1 
locally 21.I 
Cartan relative integral invariant 219.B 
Cartan subalgebra 
(of a Lie algebra) 248.1 
(of asymmetric Riemannian space) 413.F 
Cartan subgroup 
(of an algebraic group) 13.H 
(ofa group) 249 
Cartan space 152.C 
Cartan theorem 
on analytic sheaves (H. Cartan) 72.E 
on representations of Lie algebras (E. Cartan) 
248.W 
Cartan-Thullen theorem 21.H 
Cartan-Weyl theorem | 248.W 
Carter subgroup 151.D 
Cartesian coordinates (in an affine space) 7.C 
Cartesian product 
(of complexes) 70.C,E 
(of mappings) 381.C 
(of sets) 381.B.E 
Cartesian space 140 
Cartier divisor 16.M 
Cartier operator 9.E 
CA set 22.A 
case complexity, worst 71.A 
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Casimir element (of a Lie algebra) 248.J 
Casorati determinant 104.D 
Casorati-Weierstrass theorem (on essential singular- 
ities) 198.D 
Cassini oval 93.H 
Casson handle 114.M 
Castelnuovo criterion 15.E 
Castelnuovo lemma  3.E 9.H 
casus irreducibilis 10.D, App. A, Table 1 
Catalan constant App. A, Table 10.III 
catastrophe, elementary 51.E 
catastrophe point SI.F 
catastrophe set 51.F 
catastrophe theory 51 
categorical 
(data) 397.B 
(set of closed formulas) 276.F 
categorical system (of axioms) 35.B 
categoricity in powers 276.F 
categories and functors 52 
category 52.A 
Abelian 52.N 
of Abelian groups 52.B 
additive 52.N 
of analytic manifolds 52.B 
cohomology theory on the 261.Q 
of commutative rings 52.B 
diagram inthe 52.C 
of differentiable manifolds 52.B 
dual 52.F 
exact 237 
Grothendieck 200.1 
of groups 52.B 
homotopy, of topological spaces 52.B 
of left (right) R-modules 52.B 
of linear spaces over R 52.B 
of pointed topological spaces 202.B 
PL 65.A 
product 52.A 
quotient 52.N 
ofrings 52.B 
set of the first 425.N 
set of the second 425.N 
ofsets 52.B 
shape 382.A 
of S-objects 52.G 
of topological spaces 52.B 
catenary 93.H 
catenoid 111.1 
Cauchy, A. L. 53 
Cauchy condensation test 379.B 
Cauchy condition (on D-integral and D(*)-integral) 
100.E 
Cauchy criterion (on the convergence of a sequence 
of real numbers) 87.C, App. A, Table 10.II 
Cauchy data 321.A 
Cauchy distribution 341.D, App. A, Table 22 
Cauchy existence theorem (for partial differential 
equations) 320.B 
Cauchy filter (on a uniform space) 436.G 
Cauchy-Hadamard formula 339.A 
Cauchy inequality 211.C, App. A, Table 8 
Cauchy integral formula 198.B 
Cauchy integral representation 21.C 
Cauchy integral test (for convergence) 379.B 
Cauchy integral theorem 198.A,B 
stronger form of 198.B 
Cauchy-Kovalevskaya existence theorem 321.A 
Cauchy-Kovalevskaya theorem, abstract 286.Z 
Cauchy net (in a uniform space) 436.G 





Subject Index 
Cell complex 


Cauchy polygon 316.C 
Cauchy principal value 
of an improper integral 216.D 
of the integral on infinite intervals 216.E 
Cauchy problem 
(of ordinary differential equations) 316.A 
(for partial differential equations) 315.A 320.B 
321.A 325.B 
abstract 286.X 
Cauchy process 5.F 
asymmetric AF 
symmetric 5.F 
Cauchy product (of two series) 379.F 
Cauchy remainder App.A, Table 9.IV 
Cauchy-Riemann (differential) equation 198.A 
274.G 
(for a holomorphic function of several complex 
variables) 21.C 
(for a holomorphic function of two complex 
variables) 320.F 
Cauchy-Riemann structure 344.A 
Cauchy-Schwarz inequality 211.C, App. A, Table 8 
Cauchy sequence 
(in an a-adic topology) 284.B 
(in a metric space) 273J 
(of rational numbers) 294.E 
(of real numbers) 355.B 
(in a uniform space) 436.G 
Cauchy sum (of a series) 379.A 
Cauchy theorem 379.F 
Cauchy transform 164.J 
causality, macroscopic 386.C 
cause, most probable 401.E 
caustic 325.L 
Cayley algebras 54 
general 54 
Cayley number 54 
Cayley projective plane 54 
Cayley theorem (in group theory) 151.H 
Cayley theorem, Hamilton- 269.F 
Cayley transform (of a closed symmetric operator 
ina Hilbert space) 251.1 
Cayley transformation 269.J 
CCP (chance-constrained programming) 408.B 
CCR 377.A 
CCR algebra 36.H 
Cech cohomology group (for topological spaces) 
201.L,P 
relative 201.M 
Cech cohomology group with coefficient sheaf F 
383.F 
Cech compactification, Stone-  207.C 
Cech complete space 425.T 436.I 
Cech homology group (for topological spaces) 
201.M 
relative 201.M 
ceiling function 136.D 
ceiling states 402.G 
celestial mechanics 55 
cell 70.D 
convex (in an affine space) 7.D 
fundamental (of a symmetric Riemann space) 
413.F 
n- (in a Hausdorff space) 70.D 
(n — q-dual 65.B 
topological n- 140 
unit 140 
cell complex 70.D 
closure finite 70.D 
countable 70.D 


Subject Index 1938 
Cell-like (CE) 


Euclidean 70.B chain 200.H 
finite 70.D ascending (of normal subgroups of a group) 
locally countable 70.D 190.F 
locally finite 70.D ascending (in an ordered set) 311.C 
regular 70.D conservative 260.A 
cell-like (CE) 382.D descending (in a lattice) 243.F 
cellular approximation theorem 70.D descending (of (normal) subgroups of a group) 
cellular cohomology group 201.H 190.F 
cellular decomposition (of a Hausdorff space) 70.D descending (in an ordered set) 311.C 
cellular homology group 201.F,G equivalence 200.H 
cellular mapping (between cell complexes) 70.D general Markov 260.J 
CE mapping 382.D homotopy 200.H 
center irreducible (a Markov chain) 260.B 
(of a central symmetry) 139.B Markov 260.A 
(of a continuous geometry) 85.A minimal 260.F 
(of gravity) 271.E normal (ina group) 190.G 
(ofa group) 190.C normal (in Markov chains) 260.D 
(of a hyperbola or ellipse) 78.C q- (of a chain complex) 201.B 
(of a lattice) 243.E recurrent 260.B 
(of a Lie algebra) 248.C regular (of integral elements) 428.E 
(of mass) 271.E u- 260.1 
(of a nonassociative algebra) 231.A chain complex(es) 200.C,H 201.B 
(of a pencil of hyperplanes} 343.B in an Abelian category 201.B 
(of a quadric hypersurface) 7.G of A-modules 200.C 
(of a quadric surface) 350.A augmented 200.C 
(of a regular polygon) 357.A double 200.E 
(of a regular polyhedron) 357.B oriented simplicia] 201.C 
(ofa ring) 368.F positive 200.H 
(ofa solid sphere) 140 product double 200.E 
(ofa sphere) 139.1 quotient 200.C 
(of a von Neumann algebra) 308.C relative 200.C 
centered process 5.B singular (of a topological space) 201.E 
centering 5.B chain condition (in an ordered set) 311.C 
center manifold theorem  286.V ascending (in an ordered set) 311.C 
center of curvature 111.E descending (in an ordered set) 311.C 
center of projection 343.B chained metric space, well- 79.D 
center surface 111.1 chain equivalence 200.C 
central composite design 102.M chain homotopy 200.C 
central configuration 420.B chain mapping 200.C 201.B 
central conic(s) 78.C over an A-homomorphism . 200.C 
centra] difference 223.C 304.E, App. A, Table 21 chain recurrent 126.E 
central element (in a lattice) 243.E chain recurrent set 126.E 
central extension (of a group) 190.N chain rule 106.C 
central figure | 420.B chain subcomplex 200.C 
centralizer chain theorem 14.J 
(of a subset ofa group) 190.C chain transformation (between complexes) 200.H 
(of a subset of a ring) 368.F Chaitin complexity, Kolmogorov- 354.D 
central limit theorem  250.B chamber 
central moment 397.C positive Weyl 248.R 
central motion 126.E Weyl 13.J 248.R 
central potential 315.E chamber complex 13.R 
central processor 75.B chance-constrained programming 408.A 
central quadric hypersurface 7.F 350.G chance constraint 408.A 
central quadric surface 350.B chance move 173.B 
central series change 
ascending (of a Lie algebra) 248.C scalar (of a B-module) 277.L 
descending (of a Lie algebra) 248.C time 261.F 406.B 
lower {ofa group) 190J of variables (in integral calculus) 216.C 
upper (ofa group) 190.J channel 375.F 
central simple algebra 29.E almost finite memory 213.F 
similar 29.E d-continuous 213.F 
central symmetry (of an affine space) 139.B discrete memoryless 213.F 
centrifugal force 271.D finite memory 213.F 
certainly, almost 342.B,D noisy 213.A 
certainty equivalent 408.B test 213.E 
Cesàro method of summation of order a 379.M channel coding theory 213.A 
summable by 379.M channel Hilbert space 375.F 
Ceva theorem 7.A channel wave operators 375.F 
CFL condition 304.F chaos 126.N 303.G 433.B 


CG method 302.D propagation of 340.F 
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Chaplygin's differential equation 326.B 
Chapman complement theorem 382.B 
Chapman-Kolmogorov equality 261.A 
Chapman-Kolmogorov equation 260.A 
Chapman-Robbins-Kiefer inequality 399.D 
Chapman theorem (on (C, x)-summation) 379.M 
character 
(of an Abelian group) 2.G 
(of an algebraic group) 13.D 
(irreducible unitary representation) 437.V 
(of a linear representation) 362.E 
(of a regular chain) 428.E 
(of a representation of a Lie group) 249.0 
(of a semi-invariant) 226.A 
(of a topological Abelian group) 422.B 
absolutely irreducible 362.E 
Brauer (of a modular representation) 362.I 
Chern (of a complex vector bundle) 237.B 
Dirichlet 295.D 
Hecke 6.D 
identity (of an Abelian group) 2.G 
integral (on the 1-dimensional homology group 
ofa Riemann surface) 11.E 
irreducible (of an irreducible representation) 
362.E 
Minkowski-Hasse (of a nondegenerate quadratic 
form) 348.D 
modular (of a modular representation) 362.I 
nonprimitive 450.C,E 
planar 367.G 
primitive 295.D 450.C,E 
principal (of an Abelian group) 2.G 
principal Dirichlet 295.D 
reduced (of an algebra) 362.E 
residue 295.D 
simple (of an irreducible representation) 362.E 
character formula, Weyl 248.2 
character group 
(ofan Abelian group) 2.G 
(of a topological Abelian group) 422.B 
characteristic(s) 
(of a common logarithm) 131.C 
(of a field) 149.B 
Euler (of a finite Euclidean cellular complex) 
201.B 
Euler-Poincare (of a finite Euclidean complex) 
16.E 201.B 
operating 404.C 
population 396.C 
quality  404.A 
sample 396.C 
Todd 366.B 
two-terminal 281.C 
characteristic class(es) 56 
(of an extension of a module) 200.K 
(of a fiber bundle) 147.K 
(of foliations) 154.G 
(of a vector bundle) 56.D 
(of a real oriented vector bundle) 237.F 
of codimension q 154.G 
ofa manifold 56.F 
smooth, of foliations 154.G 
characteristic curve 
(ofa network) 281.B 
(of a one-parameter family of surfaces) 111.1 
(of a partial differential equation) 320.B 
324.A,B 
characteristic equation 
(of a differential-difference equation) 163.F 
(for a homogeneous system of linear ordinary 


Subject Index 
Characteristic vector 


differential equations) 252.J 
(of a linear difference equation) 104.E 
(of a linear ordinary differential equation) 
252.E 
(of a matrix) 269.F 
(of a partial differential equation) 320.D 
(of a partial differential equation of hyperbolic 
type) 325.A,F 
characteristic exponent 
(of an autonomous linear system) 163.F 
(of the Hill differential equation) 268.B 
(of a variational equation) 394.C 
characteristic function(s) 
(of a density function) 397.G 
(of a graded R-module) 369.F 
(of a meromorphic function) 272.B 
(of an n-person cooperative game) 173.D 
(for an optical system) 180.C 
(of probability distributions) 341.C 
(ofa subset) 381.C 
empirical 396.C 
Hilbert (of a coherent sheaf) 16.E 
Hilbert (of a graded module) 369.F 
characteristic functional (of a probability distribu- 
tion 407.C 
characteristic hyperplane (of a partial differential 
equation of hyperbolic type) 325.A 
characteristic hypersurface (of a partial differential 
equation of hyperbolic type)  325.A 
characteristic linear system (of an algebraic family) 
15.F 
characteristic line element 82.C 
characteristic manifold (of a partial differential 
equation) 320.B 
characteristic mapping (map) (in the classification 
theorem of fiber bundles) 147.G 
characteristic measure 407.D 
characteristic multiplier 
(of a closed orbit) 126.G 
(of a periodic linear system) 163.F 
characteristic number 
(of a compact operator) 68.1 
(ofa manifold) 56.F 
Lyapunov 314.A 
characteristic operator function 251.N 
characteristic polynomial 
(of a differential operator) 112.A 
(of a linear mapping) 269.L 
(ofa matrix) 269.F 
(of a partial differential operator) 321.A 
characteristic root 
(of a differential-difference equation) 163.A 
(of a linear mapping) 269.L 
(of a linear partial differential equation) 325.F 
(ofa matrix) 269.F 
characteristic series (in a group) 190.G 
characteristic set 
(of an algebraic family on a generic component) 
15.F 
(of a partial differential operator) 320.B 
characteristic strip 320.D 324.B 
characteristic surface 320.B 
characteristic value 
(of a linear operator) 390.A 
sample 396.C 
characteristic variety (of a system of microdifferential 
equations) 274.G 
characteristic vector 
(of a linear mapping) 269.L 
(of a linear operator) 390.A 


Subject Index 
Character module (of an algebraic group) 


character module (of an algebraic group) 13.D 
character system (of a genus of a quadratic field) 
347.F 
charge 150.B 
charge symmetry 415.J 
Charpit method, Lagrange- 322.B, App. A, Table 
15.1 
Charpit subsidiary equation 82.C 320.D 
chart 
alignment 19.D 
control 404.B 
intersection 19.D 
Châtelet group, Weil- 118.D 
Chebotarev density theorem 14.8 
Chebyshev formula |. 299.A 
Gauss- (in numerical integration) 299.A 
Chebyshev function App. A, Table 20.II 
Chebyshev interpolation 233.A 366.J 
Chebyshev orthogonal polynomial 19.G 
Chebyshev polynomial 336.H 
Chebyshev g-function 19.G, App. A, Table 20.VII 
simplest 19.G 
Chebyshev system 336.B 
Chebyshev theorem 336.B 
check bits 63.C 
check matrix, parity 63.C 
Cheng domain theorem, Courant- 391.H 
Chern character (of a complex vector bundle) 
237.B 
Chern class 
of a C"-bundle 56.C 
ofa manifold 56.F 
of a real 2n-dimensional almost complex 
manifold 147.N 
total 56.C 
of a U(n)-bundle 147.N 
universal 56.C 
Chern formula (in integral geometry) 218.D 
Chern number 56.F 
Cherwell-Wright differential equation 291.F 
Chevalley canonical basis (of a complex semisimple 
Lie algebra) 248.Q 
Chevalley complexification (of a compact Lie group) 
249.U 
Chevalley decomposition (on algebraic groups) 
13.1 
Chevalley group 151.1 
Chevalley theorem (forms over finite fields) 118.B 
Chevalley theorem (on algebraic groups) 13.B 
Chevalley type (algebraic group) 13.N 
Chinese mathematics 57 
Chinese remainder theorem ` 297.G 
chi-square distribution 374.B, App. A, Table 22 
noncentral 374.B 
chi-square method, modified minimum  400.K 
chi-square test 400.G 
of goodness of fit 400.K 
choice, axiom of 33.B 34.A 
choice function 33.B 34.A 
choice process, multistage 127.A 
choice set 33.B 34.A 
Cholesky method 302.B 
Chomsky grammar 31.D 
Chomsky hierarchy 31.D 
Choquest boundary 164.C 
Chow coordinates (of a positive cycle) 16.S 
Chow-Kodaira theorem 72.F 
Chowla theorem, Bruck-Ryser- 102.E 
Chow lemma  72.H 
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Chow ring (of a projective variety) 16.R 
Chow theorem 
on an analytic submanifold of P^  72.F 
on the field of rational functions of an analytic 
space 21D 
Chow variety 16S 
Christoffel-Darboux formula 317.D 
Christoffel symbol 80.L 111.H 417.D 
Christoffel transformation, Schwarz- 77.D, App. A, 
Table 13 
Christoffel transformation formula, Schwarz- 77.D 
chromatic number 157 186.I 
chromodynamics, quantum 132.C 
Chung-Erdós theorem 342.B 
Church’s thesis 356.C 
circle(s) 140 
Apollonius App. A, Table 3.V 
auxiliary (of an ellipse) 78.D 
of convergence (of a power series) 339.A 
of curvature 111.E 
director (of an ellipse) 78.D 
great (of a sphere) 140 
inscribed (of a regular polygon) 357.A 
isometric 234.C 
of meromorphy (of a power series) 339.D 
open 140 
oscillating  111.E 
quadrature of 179.A 
unit 74.C 140 
circled subset (of a topological linear space) 424.E 
circle geometry 76.C 
circle method 4.B 
circle problem, Gauss 242.A 
circle type, limit 112.1 
circuit 66.G 
circuit matrix 254.B 
circular cone 78.A 111.1 
oblique 350.B 
right 350.B 
circular cylinder 111.1 350.B 
circular cylindrical coordinates App. A, Table 3.V 
circular cylindrical surface 350.B 
circular disk 140 
circular domain 333.A 
circular frequency (of a simple harmonic motion) 
318.B 
circular function 131.F 432.A 
circular section 350.F 
circular unit 14.L 
circulation (of a vector field) 205.B 442.D 
circumference 140 
circumferentially mean p-valent 438.£ 
circumscribed circle (of a regular polygon) 357.A 
circumscribing sphere (of a simplex) 139.I 
cissoidal curve 93.H 
cissoid of Diocles 93.H 
Clairaut differential equation App. A, Table 14.1 
Clairaut partial differential equation App. A, 
Table 15.1I 
class 
(in axiomatic set theory) 33.C 381.G 
(of a lattice group) 13.P 
(ofa nilpotent group) 190.J 
(of a plane algebraic curve) 9.B 
(of a quadratic form) 348.H,I 
-s?-characteristic (of a real oriented vector 
bundle) 237.F 
algebra (of central simple algebras) 29.E 
ambig (of a quadratic field) 347.F 
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Bravais 92.B 

canonical (of an algebraic curve) OC 

canonical cohomology 59.H 

canonical divisor 11.D 

characteristic (of an extension of module) 
200.K 

characteristic (of a fiber bundle) 147.K 

characteristic (of foliations) 154.G 

characteristic (of a vector bundle) 56 

characteristic, of codimension q 154.G 

characteristic, of a manifold 56.F 

Chern (of a C"-bundle) 56.C 

Chern (of a manifold) 56.F 

Chern (of a real 2n-dimensional almost complex 
manifold) 147.N 

Chern (of a U(n)-bundle) 147.N 

cohomology 200.H 

combinational Pontryagin 56.H 

complete 398.B 

completely additive 270.B 

conjugacy (of an element of a group) 190.C 

countably additive 270.B 

crystal 92.B 

curve of the second 78.K 

differential divisor (of a Riemann surface) 11.D 

divisor (on a Riemann surface) 11.D 

Dynkin 270.B 

the Dynkin, theorem 270.B 

equivalence 135.B 

ergodic 260.B 

essentially complete 398.B 

Euler-Poincaré (of a manifold) 56.F 


Euler-Poincaré (of an oriented R"-bundle) 56.B 


finitely additive 270.B 

fundamental (of an Eilenberg-MacLane space) 
70.F 

fundamental (of a Poincaré pair) 114J 

fundamental (of the Thom complex MG) 
114.G 

fundamental, with coefficient Z, 65.B 

generalized Hardy 164.G 

Gevrey 58.G 125.U 

group of congruence 14.H 

Hardy 43.F 159.G 

Hilbert-Schmidt 68.I 

holosymmetric 92.B 

homology 200.H 201.B 

homotopy 202.B 

ideal (of an algebraic number field) 14.E 

ideal (of a Dedekind domain) 67.K 

ideal, in the narrow sense 14.G 343.F 

idele 6.D 

idele, group 6.D 

linear equivalence (of divisors) 16.M 

main 241.A 

mapping 202.B 

minimal complete 398.B 

monotone 270.B 

the monotone, theorem  270.B 

multiplicative 270.B 

nuclear 68. 

oriented cobordism  114.H 

Pontryagin (of a manifold) 56.F 

Pontryagin (of an R"-bundle) 56.D 

proper 381.G 

q-dimensional homology 201.B 

of a quadratic form over an algebraic number 
field 348.H 

residue (modulo an idealin a ring) 368.F 
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Classical (state) 


Steifel- Whitney (of a differentiable manifold) 
147.M 
Stiefel-Whitney (of a manifold) 56.F 
Stiefel-Whitney (of an O(n)-bundle) 147.M 
Stiefel-Whitney (of an R"-bundle) 56.B 
Stiefel- Whitney (of a topological manifold) 
56.F 
surface of the second 350.D 
Todd 237.F 
total Chern 56.C 
total Pontryagin 56.D 
total Stiefel-Whitney 56.B 
trace 68.1 
universal Chern 56.C 
universal Euler-Poincaré 56.B 
universal Stiefel-Whitney 56.B 
unoriented cobordism 114.H 
Wu (of a topological manifold) 56.F 
Zygmund 159.E 
class C* 
function of 106.K 
function of (of many variables) 58.B 
mapping of 286.E 
oriented singular r-simplex of 108.T 
partition of unity of 108. 
singular r-chain of 108.T 
singular r-cochain of 108.T 
class C* 
curve of (in a differentiable manifold) 93.B 
curve of (in a Euclidean plane) 93.B 
class C", function of 106.K 
class C”, function of 106.K 
class C! 
function of 106.K 
mapping of 286.E 
class C" 
atlas of 105.D 
coordinate neighborhood of 105.D 
diffeomorphism of 105.J 
differentiable manifold of 105.D 
differentiable manifold with boundary of 
105.E 
differentiable mapping of 105.J 
differentiable structure of 105.D 
fiber bundle of 147.0 
function of (in a C?-manifold) 105.G 
function of (at a point) 105.G 
functionally dependent of (components of a 
mapping) 208.C 
functional relation of 208.C 
mapping of 208.B 286.E 
nonsingular mapping of 208.B 
regular mapping of 208.B 
vector field of 105.M 
Z-action of 126.B 
class C', tensor field of 108.0 
class CH. mapping into Banach space of 286.E 
class (C°), semigroup of 378.B 
class D*, curve of 364.A 
class field 59.B 
absolute 59.A 
class field theory 59 
local 59.G 
class field tower problem 59.F 
class formation 59.H 
class function (on a compact group) 69.B 
class group divisor 11.D 
classical (potential) 402.G 
classical (state) 402.G 


Subject Index 
Classical compact real simple Lie algebra 


classical compact real simple Lie algebra 248.T 
classical compact simple Lie group  249.L 
classical complex simple Lie algebra |. 248.S 
classical complex simple Lie group 249.M 
classical descriptive set theory 356.H 
classical dynamical system  126.L 136.G 
classical group(s) 60.A 
infinite 147.1 202.V 
classical logic 411.L 
classical mechanics 271.A 
classical risk theory 214.C 
classical solution (to Plateau’s problem) 275.C 
classical statistical mechanics 402.A 
classical theory of the calculus of variations 46.C 
classification (with respect to an equivalence 
relation) 135.B 
classification theorem 
on a fiber bundle 147.G 
first (in the theory of obstructions) 305.B 
Hopf 2021 
second (in the theory of obstructions) 305.C 
third (in the theory of obstructions) 305.C 
classification theory of Riemann surfaces 367.E 
classificatory procedure 2901 
classifying mapping (map) (in the classification 
theorem of fiber bundles) 147.G 
classifying space 
(of a topological group) 174.G,H 
cohomology rings of App. A, Table 6.V 
for I7-structures 154.E 
n- (of a topological group) 147.G 
class n 
function of 84.D 
projective set of 22.D 
class Nx, null set of 169.E 
class number 
(of an algebraic number field) 14.E 
(of a Dedekind domain) 67.K 
(of a simple algebra) 27.D 
class of Abelian groups 202.N 
class œ, function of 84.D 
class 1 
function of 84.D 
function of at most 84.D 
class theorems, complete 208 D 
class £, function of. 84.D 
class 0, function of 84.D 
Clebsch-Gordan coefficient 258.B 353.B 
Clenshaw-Curtis formulas | 200. A 
Clifford algebras 61 
Clifford group 61.D 
reduced 61.D 
special 61.D 
Clifford number 61.A 
Clifford torus 275.F 
Clifford torus, generalized 275.F 
clinical trials 40.F 
closable operator 251.D 
closed 
absolutely (space) 425.U 
algebraically (in a field) 149.1 
algebraically (field) 149.1 
boundary 164.C 
H-(space) 425.U 
hyperbolic, orbit 126.G 
integrally (ring) 67.1 
k- (algebraic set) 13.A 
multiplicatively, subset (of a ring) 67.I 
quasi-algebraically (field) 118.F 
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r- (space) 425.U 
real, field 149.N 
Zariski 16.A 
closed arc 93.B 
closed braid 235.F 
closed convex curve 111.E 
closed convex hull 424.H 
closed convex surface 111.1 
closed covering 425.R 
closed curve, simple 93.B 
closed differential 367.H 
closed differential form 105.Q 
closed formula 276.A 299.A 
in predicate logic 411.J 
closed geodesic 178.G 
closed graph theorem 37.1 251.D 424.X 
closed group 362.J 
closed half-line (in affine geometry) 7.D 
closed half-space (of an affine space) 7.D 
closed ideals in L,(G) 192.M 
closed image (of a variety) 16.1 
closed interval 140 
inR 355.C 
closed linear subspace (of a Hilbert space) 197.E 
closed manifold 105.B 
closed mapping 425.G 
closed operator (on a Banach space) 251.D 
closed orbit 126.D,G 
hyperbolic 126.G 
closed plane domain 333.A 
closed path 
(ina graph) 186.F 
(in a topological space) 170 
direct 186.F 
space of 202.C 
closed range theorem 37.J 
closed Riemann surface 367.A 
closed set 425.B 
locally 425.J 
relative 425J 
system of 425.B 
Zariski 16.A 
closed subalgebra 36.B 
closed subgroup (of a topological group) 423.D 
closed submanifold (of a C*-manifold) 105.L 
closed subsystem (of a root system) 13.L 
closed surface 410.B 
in a 3-dimensional Euclidean space 111.I 
closed system 419.A 
closed system entropy 402.G 
closed term (of a language) 276.A 
closure 425.B 
(ina matroid) 66.G 
(of an operator) 251.D 
algebraic (of a field) 149.1 
convex (in an affine space) 7.D 
integral (of a ring) 67.I 
Pythagorean (of a field) 155.C 
closure finite (cell complex) 70.D 
closure operator 425.B 
closure-preserving covering 425.X 
clothoid 93.H 
cloverleaf knot | 235.C 
cluster 375.F 
cluster decomposition Hamiltonian 375,.F 
clustering property 402.G 
cluster point 425.0 
cluster set(s) 62.A 
boundary 62.A 
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curvilinear 62.C 
interior 62.A 
cluster value 62.A 
cluster value theorem 43.G 
cn App. A, Table 16.III 
coalgebra 203.F 
cocommutative 203.F 
dual 203.F 
graded 203.B 
quotient 203.F 
coalgebra homomorphism 203.F 
coanalytic set 22.A 
coarse moduli scheme 16.W 
coarse moduli space of curves of genusg 9.J 
coarser relation 135.C 
coarser topology 425.H 
cobordant 114.H 
foliated 154.H 
h- 114] 
mod2 114.H 
normally 114J 
cobordism, knot 235.G 
cobordism class 114.H 
oriented 114.H 
unoriented 114.H 
cobordism group 
complex 114.H 
oriented 114.H 
unoriented 114.H 
cobordism group of homotopy n-spheres, h-  114.] 
cobordism ring 114.H 
complex 114.H 
cobordism theorem, h- 114.F 
coboundary (coboundaries) 200.H 
(in a cochain complex) 201.H 
(in the theory of generalized analytic functions) 
164.H 
module of 200.F 
coboundary homomorphism (on cohomology 
groups) 201.L 
coboundary operator 200.F 
cobounded 201.P 
cochain(s) 200.H 201.H 
(products of) 201.K 
deformation 305.B 
finite (of a locally finite simplicial complex) 
201.P 
n- (for an associative algebra) 200.L 
separation 305.B 
cochain complex 200.F 201.H 
singular 201.H 
cochain equivalence 200.F 
cochain homotopy 200.F 
cochain mapping 200.F 201.H 
cochain subcomplex 200.F 
Cochran theorem 374.B 
cocommutative coalgebra 203.F 
cocycle(s) 200.H 201.H 
(in the theory of generalized analytic functions) 
164.H 
continuous 200.N 
difference 305.B 
module of 200.F 
obstruction 147.L 305.B 
separation 305.B 
vanishing (on an algebraic variety) 16.U 
Codazzi, equation of 365.C 
Codazzi-Mainardi equations 11 l.H, App. A, 
Table 4.I 
code(s) 63.A 213.D 


Subject Index 
Coefficient(s) 


BHC (Bose-Chaundhuri-Hocquenghem) 63.D 
block 63.A 213.F 
convolutional 63.E 
cyclic 63.D 
Goppa 63.E 
group 63.C 
Hamming 63.C 
linear 63.C 
perfect 63.B 
sliding block 213.E 
tree 213.E 
trellis 213.E 
code word 63.A 
codimension 
(of an algebraic subvariety) 16.A 
(of a C’-foliation) 154.B 
(of an element in a complex) 13.R 
(of the germ of a singularity 51.C 
(of a linear space) 256.F 
(of a PL embedding) 65.D 
codimension q, characteristic class of 154.G 
coding rate 213.D 
coding theorem 
block 213.D 
noiseless source 213.D 
source 213.D,E 
coding theory 63 
channel 213.A 
source 213.A 
codomain (of a mapping) 381.C 
co-echelon space 168.B 
coefficient(s) 
(of a linear representation) 362.E 
(of a system of algebraic equations) 10.A 
(of a term of a polynomial) 337.B 
of an affine connection 80.L 
autocorrelation 397.N 
binomial 330, App. A, Table 17.1I 
of bulk viscosity 205.C 
canonical correlation 280.E 374.C 
Clebsch-Gordan 258.A 353.B 
confidence 399.Q 
correlation (of two random variables) 342.C 
397.H 
of determination 397.HjJ 
differential 106.A 
of excess 341.H 396.C 
expansion 317.A 
Fourier 159.A 197.C 317.A, App. A, Table 11.1 
Fourier (of an almost periodic function) 18.B 
Gini, of concentration 397.D 
indeterminate, Lagrange method of 106.L 
Lagrange interpolation 223.A 
Legendre 393.B 
multipie correlation 397.J 
oforderp 110.4 
partial correlation 397,J 
partial differential 106.F 
population correlation 396.D 
Racah 353.B 
reflection 387.D 
regression 397.H,J 403.D 
ofa Riemannian connection 80.L 
sample correlation 396.D 
sample multiple correlation 397.J 
sample partial correlation 280.E 
of shear viscosity 205.C 
of skewness 341.H 
of thermal expansion 419.B 
torsion (of acomplex) 201.B 


Subject Index 
Coefficient field 


transmission 387.D 
transport 402.K 
universal, theorem 200.D,G,H 201.G,H 
of variation 397.C 
of viscosity 205.C 
Wigner 353.B 
coefficient field 
(of an affine space) 7.A 
(of a projective space) 343.C 
(ofa semilocal ring) 284.D 
coefficient group (of the cohomology theory) 201.Q 
coefficient module 200.L 
coefficient problem 438.C 
coefficient ring 
of a semilocal ring 284.D 
coefficient sheaf § 
Cech cohomology with 383.F 
cohomology group with 383.E 
coercive (boundary condition) 112.H 323.H 
cofactor (of a minor) 103.D 
cofibering 202.G 
cofinal 
(ordinal numbers) 312.C 
(subset of a directed set) 311.D 
cofinality 312.C 
cofinality, cardinality and 33.F 
cofinal object 52.D 
cofinal subnet 87.H 
cogenerator (of an Abelian category) 200.H 
cographic 66.H 
Cohen theorem (on Noetherian rings) 284.A 
coherence 397.N 421.E 
coherent algebraic sheaf 16.E 72.F 
coherent analytic sheaf 72.E 
coherently oriented (pseudomanifold) 65.B 
coherent @-module 16.E 
quasi- 16.E 
coherent sheaf of rings 16.E 
coherent vector 277.D 
Cohn-Vossen theorem 111.1 
cohomological dimension 
(of an associative algebra) 200.L 
(ofa scheme) 16.E 
(of a topological space) 117.F 
cohomological functor 200.1 ` 
cohomology 6.E 200.H 
equivariant 431.D 
exact sequence of 200.F 
Galois 200.N 
Gel'fand-Fuks 105.AA 
Ladic 450.Q 
non-Abelian ` 200.M 
Tate 200.N 
Weil 440.Q 
cohomology class 200.H 
canonical (in Galois cohomology in the class 
field theory) 59.H 
orientation 201.N 
cohomology exact sequence (for simplicial com- 
plexes) 201.L 
cohomology group(s) 201.H 
ô- 72.D 
Alexander 201.M 
Amitsur 200.P 
Cech (for topological spaces) 201.M,P 
Cech, with coefficient sheaf F — 383.F 
cellular 20!.H 
with coefficient sheaf ¥ — 383.E 
de Rham 201.H 
Dolbeault 72.D 
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generalized (for CW complexes) 201.H 
Hochschild 200.L 
integral (of a topological space) 201.H 
local 125.W 
rational 200.0 
reduced (of a topological space) 201.H 
relative (sheaf cohomology) 125.W 
relative Alexander 201.M 
relative Cech 201.M 
singular, with compact supports 201.P 
singular, of X with coefficients in G 201.H,R 
cohomology module 200.F 
cohomology operation(s) 64 
functional 202.$ 
primary 64.B 
stable 64.B 
stable primary 64.B 
stable secondary 64.C 
cohomology ring 
of a compact connected Lie Group App. A, 
Table 6.IV 
de Rham (of a differential manifold) 105.R 
de Rham (of a topological space) 201.1 
of an Eilenberg-Maclane complex App. A, 
Table 6.III 
singular (of a topological space) 201.1 
cohomology sequence, exact 200.H 
cohomology set 172.K 
cohomology spectral sequence  200.J 
cohomology theory (theories) 
Alexander-Kolmogorov-Spanier 201.M 
on the category of topological pairs 201.Q 
complete 200.N 
with E-coefficient, generalized 202.T 
generalized 201.Q 
cohomology vanishing theorems 194.G 
cohomotopy group 202.1 
coideal 203.F 
coimage 
(of an A-homomorphism) 277.E 
(of a homomorphism of presheaves of sheaves) 
383.D 
(ofa morphism) 52.N 
coincidence number (of mappings) 153.B 
coincidence point (of mappings) 153.B 
coindex (of C*-function) 279.E 
cokernel 
(of an A-homomorphism) 277.E 
(of a homomorphism of presheaves or sheaves) 
383.D 
(ofa morphism) 52.N 
collapsing 65.C 
elementary 65.C 
collectionwise Hausdorff space 425.AA 
collectionwise normal space 425.AA 
collective 342.A 
d 354E 
collective risk theory 214.C 
collinear points 343.B 
collinear vectors 442.A 
collineation(s) 343.D 
group of 343.D 
projective 343.D 
projective, in the wider sense 343.D 
in the wider sense 343.D 
collocation method 303.1 
colocal (coalgebra) 203.1 
color conjecture, four 186.] 
colored symmetric group 92.D 
coloring (of a graph) 186.1 
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coloring, Tait 157.C 
color lattice 92.D 
color point group 92.D 
colors, number of 93.D 
color symmetric group 92.D 
column(s) (of a matrix) 269.A 
iterated series by (of a double series) 379.E 
repeated series by (of a double series) 379.E 
column finite matrix 269.K 
column nullity (of a matrix) 269.D 
column vector 269.A 
comass (on k-forms) 275.G 
Combescure, correspondence of 111.F 
combination 
k- 330 
linear (of elements in a linear space) 256.C 
linear (of ovals) 89.D 
multiple App. A, Table 17.IT 
number of treatment 102.L 
combination theorem, Klein 234.D 
combinatorial analysis 66.A 
combinatorial equivalence 65.A 
combinatorially equivalent 65.A 
combinatorial manifold 65.C 
combinatorial mathematics 66.A 
combinatorial Pontryagin class 56.H 
combinatorial problems App. A, Table 17.II 
combinatorial property 65.A 
combinatorial sphere, group of oriented differentiable 
structures on 114.I 
combinatorial theory 66.A 
combinatorial topology 426 
combinatorial triangulation 65.C 
combinatorial triangulation problem 65.C 
combinatorics 66 
comb space 89.A 
commensurable 122.F 
common divisor (of elements of a ring) 67.H 
greatest 67.H 297.A 
common logarithm 131.C 
common multiple (of elements of a ring) 67.H 
least 67.H 297.A 
common notion 35.A 
communality 346.F 
commutant 308.C 
commutation relations 
canonical 351.C 377.A 
normal 150.D 
commutative algebra 203.F 
commutative diagram 52.C 
commutative field 368.B 
commutative group 2.A 190.A 
commutative law 
of addition (in a ring) 368.A 
on the addition of natural numbers 294.B 
in the algebra of sets 381.B 
on cardinal numbers 49.C 
for group composition 190.A 
inalattice 243.A 
for multiplication (in a commutative ring) 
368.A 
on the multiplication of natural numbers 
294.B 
commutative Lie group 249.D 
commutatively convergent series 379.C 
commutative multiplication (of a graded algebra) 
203.B 
homotopy- 203.D 
commutative ring 368.A 
category of 52.B 





Subject Index 
Compact set 


commutativity, event 346.G 
commutator 
(of differential operators) 324.C 
(of two elements ofa group) 190.H 
self- 251.K 
commutator group 190.H 
commutator subgroup 190.H 
commutor 368.E 
comonad 200.Q 
compact 
(continuous mapping) 286.D 
(kernel distribution) 125.L 
(linear operator) 68.B 
(topological space) 425.8 
linearly 422.L 
locally 425.V 
locally linearly 422.L 
relatively (linear operator) 331.B 
relatively (maximum likelihood method) 
399.M 
relatively (subset) 273.F 425.8 
real- 425.BB 
sequentially 425.8 
o 425.V 
T- 68.F 331.B 
uniformly locally 425.V 
weakly (linear operator) 68.M 
compact algebraic group, k- 13.G 
compact cardinal number 
strongly 33.E 
weakly 33.E 
compact complex manifolds, family of 72.G 
compact C’-manifold 105.D 
compact element (of a topological Abelian group) 
422.F 
compact foliation 154.H 
compact form (of a complex semisimple Lie algebra) 
248.P 
compact group 69.A 
compact homotopy class 286.D 
compactification 
(of a complex manifold) 72.K 
(of a Hausdorff space) 207.A 
(of a topological space) 425.T 
compactifying (kernel) 125.L 
Aleksandrov 207.C 
Bohr 18.H 
F-  207.C 
Kerékjart6-Stoilow 207.C 
Kuramochi 207.C 
Martin 207.C 
one-point (of a topological space) 425.T 
resolutive 207.B 
Royden 207.C 
Stone-Cech 207.C 425.T 
Wiener 207.C 
compact leaf 154.D 
compact metric space 273.F 
compactness theorem (in model theory) 276.E 
compact open C? topology 279.C 
compact-open topology 279.C 435.D 
compact operators 68 
compact real Lie algebra 248.P 
compact real simple Lie algebra 
classical 248.T 
exceptional 248.T 
compact set 425.8 
ina metric space 273.F 
relatively 399.M 425.8 
uniform convergence on 435.C 


Subject Index 
Compact simple Lie group 


compact simple Lie group 
classical 249.L 
exceptional 249.L 
compact space 425.8 
countably 425.8 
locally 425.V 
real- 425.BB 
sequentially 425.5 
o 425.V 
uniformly locally 425.V 
compact support (singular q-cochain) 201.P 
compact type (symmetric Riemannian homogeneous 
space) 412.D 
compactum, dyadic 79.D 
companion matrix 301.1 
comparability theorem for cardinal numbers 49.B 
comparison, paired 346.C 
comparison test (for convergence) 379.B 
comparison theorem (in the theory of differential 
equations) 316.E 
metric 178A 
triangle 178.A 
compass 179.A 
compatible 
with composition 409.C 
with C’-structure 114.B 
with the multiplication of a group 190.C 
with operation (of an operator domain) 277.C 
with operations in a linear space 256.F 
with topology 436.H 
with a triangulation (a C'-structure) 114.C 
compiler 75.C 
complement 
(of a decision problem) 71.B 
(in lattice theory) 42.A 243.E 
(in set theory) 381.B 
orthogonal (of a subset of a Hilbert space) 
197.E 
relative (of two sets) 381.B 
complementary analytic sei 22.A 
complementary degenerate series 437,W 
complementary degree 200.J 
complementary event 342.B 
complementary law of reciprocity 14.0 
complementary law of the Jacobi symbol 297.1 
complementary law of the Legendre symbol 
first 297.1 
second 297.1 
complementary modulus (in Jacobi elliptic func- 
tions} 134.J, App. A, Table 16.1 
complementary series 437.W 
complementary set 381.B 
complementary slackness, Tucker theorem on 
255.B 
complementary submodule 277.H 
complementary subspace (of a linear subspace) 
256.F 
complementation, law of (in a Boolean algebra) 
42.A 
complement conjecture, knot 235.B 
complemented (Banach space) 37.N 
complemented lattice 243.E 
complement theorem 382.B 
Chapman’s 382.B 
completable topological group 423.H 
complete 
(Abelian p-group) 2.D 
(algebraic variety) 16.D 
(increasing family of o-algebras) 407.B 
(logical system) 276.D 
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(metric space) 273.J 
(ordinary differential equation) 126.C 
(predicate) 356.H 
(recursively enumerable set) 356.D 
(set of closed formulas) 276.F 
(statistics) 396.E 
(system of axioms) 35.B 
(system of orthogonal functions) 317.A 
(topological group) 423.H 
(uniform space) 436.G 
(valuation) 439.D 
(vector lattice) 310.C 
(wave operator) 375.B,H 
(Zariski ring) 284.C 
B- (locally convex space) 424.X 
fully (locally convex space) 424.X 
holomorphically (domain) 21.F 
NP- 7LE 
at o (in the theory of deformation) 72.G 
quasi- (locally convex space) 424.F 
g- (vector lattice) 310.C 
weakly 1- , manifold 21.L 
complete accumulation point 425.0 
complete additive group 2.E 
complete additivity (of the Lebesgue integral) 
221.C 
complete additivity (of a measure) 270.D 
complete additivity theorem, Pettis 443.G 
complete analytic space, K- 23.F 
complete bipartite graph 186.C 
complete blocks 102.B 
complete capture 420.D 
complete class 398.B 
essentially 398.B 
minimal 398.B 
complete class theorems 398.D 
complete cohomology theory |. 200.N 
complete distributive law (in a lattice-ordered group) 
243.G 
complete elliptic integral App. A, Table 16.1 
of the first kind 134.B 
of the second kind 134.C 
complete form, theorem on 356.H 
complete free resolution (of Z) 200.N 
complete graph 186.C 
complete hyperbolic manifolds 21.0 
complete induction 294.B 
complete integrability condition 428.C 
complete intersection 16.A 
complete lattice 243.D 
conditionally 243.D 
o- 243.D 
complete linear system 
on an algebraic curve OC 
on an algebraic variety 16.N 
defined by a divisor 16.N 
complete local ring 284.D 
structure theorem of 284.D 
completely additive 
(arithmetic function) 295.B 
(measure) 270.D,E 
{vector measure) 443.G 
completely additive class 270.B 
completely additive set function 380.C 
completely continuous operator 68.B 
completely integrable 154.B 
completely integrable (system of independent 1- 
forms) 428.D 
completely integrally closed (ring) 67.1 
completely monotonic function 240.E,K 
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completely multiplicative number-theoretic function 
295.B 
completely nonunitary 251.M 
completely normal space | 425.Q 
completely passive 402.G 
completely positive 36.H 
completely positive entropy 136.E 
completely primary ring 368.H 
completely randomized design 102.A 
completely reducible A-module 277.H 
completely reducible group — 190.L 
completely reducible linear representation 362.C 
completely regular space 425.Q 
completely unstable flow 126.E 
complete manifold, weakly 1- 21.L 
complete mapping | 241.B 
complete maximum principle 338.M 
complete measure 270.D 
complete measure space 270.D 
complete metric space 273.J 
completeness 
(for a Cartan connection) 80.N 
(ofa logical system) 276.D 
(of the predicate calculus) 411.J 
asymptotic 150.D 
NP- 7LE 
theorem of (in geometry) 155.B 
completeness of real numbers 294.E 355.B 
completeness of scattering states 150.D 386.B 
completeness theorem, Gödel 411 
complete observation 405.C,D 
complete orthogonal system 217.G 
complete orthonormal set (Hilbert space)  197.C 
complete orthonormal system 217.G 
of fundamental functions 217.G 
complete pivoting 302.B 
complete predicate 356.H 
complete product measure space 270.H 
complete quadrangle 343.C 
complete reducibility theorem, Poincaré 3.C 
complete Reinhardt domain 21.B 
complete residue system modulo m 297.G 
complete ring (with respect to an ideal /) 16.X 
complete scheme, k- 16.E 
complete set 241.B 
complete o-field |.396.E 
boundedly 396.E 
complete solution (of partial differential equations) 
320.C 
complete space | 436.G 
Cech 425.T 436.1 
Dieudonné 436. 
holomorphically 23.F 
topologically 436.I 
complete system 
of axioms 35.B 
of independent linear partial differential equa- 
tions | 324.C 
of inhomogeneous partial differential equations 
428.C 
of nonlinear partial differential equations 
428.C 
complete valuation 439.D 
complete vector lattice 310.C 
g- 310.C 
complete Zariski ring 284.C 
completion 
a-adic (of an R-module) 284.B 
of a field (with respect to a valuation) 439.D 


Subject Index 
Complex (es) 


ofa measure space 270.D 

ofa metric space 273J 

uv 270. D 

of an ordered set 243.D 

of a ring along an ideal 16.X 

of Spec(A) along V(I) 16.X 

of a T,-topological group | 423.H 

ofa uniform space | 436.G 

ofa valuation 439.D 

of a valuation ring (of a valuation) 439.D 


complex(es) 70 


(in an Abelian category) 201.B 
(in buildings and BN pairs) 13.R 


(over an object of an Abelian category) 200.H 


ĉ- 72D 

abstract simplicial 70.C 

acychc 201.B 

Amitsur 200.P 

cell 70.D 

chain 200.C 

chain (in an Abelian category) 201.B 
chain, over C 201.C 

chain, over A. 201.G 

chamber 13.R 

closure finite cell 70.D 

cochain, over A 201.H 

cochain (in an Abelian category) 200.F,H 
cochain (of a simplicial complex) 201.H 
countable cell 70.D 

countable simplicial 70.C 

Coxeter 13.R 

CW 70.D 

de Rham 201.H 

de Rham (as an elliptic complex) 237.H 
Dolbeault 72.D 

double 200.H 

double chain 200.E 

dual 65.B 

Eilenberg-MacLane 70.F 

elliptic (on a compact C^-manifold) 237.H 
Euclidean 70.B 

Euclidean cell 70.B 

Euclidean simplicial 70.B 

finite cell 70.D 

finite simplicial 70.C 

geometric 70.B 

isomorphic simplicial 70.C 
isomorphic s.s. 70.E 

Kan 70.E 

linear (in projective geometry) 343.E 
linear line 343.E 

locally countable cell 70.D 

locally countable simplicial 70.C 
locally finite cell 70.D 

locally finite simplicial 70.C 
minimal 70.E 

multiple 200.H 

negative 200.H 

ordered (of a simplicial complex) 70.E 
ordered simplicial 70.C 

oriented simplicial chain 201.C 
Poincaré 114J 

positive 200.H 

positive chain 200.C 

Postnikov 70.G 

product 200.H 

product (of cell complexes) 70.D 
product double chain 200.E 
quotient 201.L 


Subject Index 
Complex algebraic variety 


quotient chain 200.C 
rectilinear 70.B 
regular cell 70.D 
relative chain 200.C 
relative cochain 200.F 
semisimplicial (s.s) 70.E 
simplicial 65.A 70.C 
singular (of a topological space) 70.E 
singular chain (of a topological space) 201.E 
singular cochain 201.H 
space form 365.L 
ss. 70.E 
s.s., realization of 70.E 
standard (of a Lie algebra) 200.0 
Thom 114.G 
Thom, associated with (G,n) 114.G 
Thom, fundamental cohomology class of the 
114.G 
complex algebraic variety 16.T 
complex analytic fiber bundle 147.0 
complex analytic function 198.H 
complex analytic manifold 72.A 
complex analytic structure (in a complex manifold) 
72A 
complex analytic submanifold 72.A 
complex cobordism group 114.H 
complex cobordism ring 114.H 
complex conjugate bundle 147.F 
complex conjugate representation 362.F 
complex dimension (of a complex manifold) 72.A 
complex form (on a Fourier series) 159.A 
complex form (of a real Lie algebra) 248.P 
complex function 165.B 
complex Gaussian 176.B 
complex Gaussian process 176.C 
complex Gaussian random variable 176.B 
complex Gaussian system 176.B 
complex Grassmann manifold 199.B 
complex group (over a field) 60.L 
complex Hermitian homogeneous space 199.A 
complex Hilbert space 197.B 
complexification 
Chevalley 249.U 
ofa real Lie algebra 248.P 
complex interpolation space 224.C 
complexity 
average 71.A 
of computation 71 
Kolmogorov-Chaitin 354.D 
space 71.A 
time 71.A 
worst-case 71.A 
complex Lie algebra 248.A 
ofa complex Lie group 249.M 
complex Lie group 249.A 
complex linear space 256.A 
complex line bundle  72.F 
determined by a divisor 72.F 
complex manifold(s) 72 
almost 72.B 
compact, family of 72.G 
isomorphic 72.A 
stably almost 114.H 
weakly almost 114.H 
complex multiplication 73 
complex number plane 74.C 
complex numbers 74.A 294.F 
conjugate 74.A 
complex of lines 110.B 
complex orthogonal group 60.1 
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complex orthogonal matrix 269.J 
complex plane 74.C 
complex projective space 343.E 
infinite-dimensional 56 
complex quadratic form 348.A,B 
complex representation (of a Lie group) 249.0 
complex simple Lie algebra 
classical 248.8 
exceptional 248.8 
complex simple Lie group 
classical 249.M 
exceptional 249.M 
complex space form 365.L 
complex special orthogonal group 60.1 
complex spectral measure 390.D 
complex spectral representation 390.E 
complex spectral resolution 390.E 
complex sphere 74.D 
complex spinor group 61.E 
complex Stiefel manifold 199.B 
complex structure 
(in a complex manifold) 72.A 
(pseudogroup structure) 105.Y 
(on R?) 3.H 
(on a Riemann surface) 367.A 
almost 72.B 
deformation of 72.G 
tensor field of almost (induced by a complex 
structure) 72.B 
complex symplectic group 60.L 
complex topological linear space 424.A 
complex torus 3.H 
complex-valued function 165.B 
complex variable 165.C 
theory of functions of 198.Q 
complex vector bundle 147.F 
component(s) 
(of a direct product set) 381.E 
(in graph theory) 186.F 
(of a matrix) 269.A 
(of a point in a projective space) 343.C 
(of a tensor of type (p,q)) 256.J 
(ofa vector) 256.A 442.A 
(of a vector field) 105.M 
arcwise connected 79.B 
basic (of an m-dimensional surface) 110.A 
connected 79.A 186.F 
of degree n (of a graded A-module) 200.B 
embedded primary (of an ideal) Git 
fixed (of a linear system) 16.N 
ghost (of an infinite-dimensional vector) 449.A 
horizontal (of a homogeneous space) 110.A 
horizontal (of a vector field)  80.C 
identity (of a topological group) 423.F 
irreducible (of an algebraic variety) 16.A 
irreducible (of an analytic space) 23.C 
irreducible (of a linear representation) 362.D 
isolated primary (of an ideal) 67.F 
ith (of an element relative to a basis) 256.C 
ith (of an n-tuple) 256.A 
nilpotent (of a linear mapping) 269.L 
orthogonal (of an element of a linear space) 
139.G 
path- 79.B 
primary (of an ideal) 67.F 
principal 280.F 
principal,analysis 280.F 
principal, of order p 110.A 
proper (of an intersection of subvarieties) 16.G 
Rech 154.B 
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relative (of a Lie transformation group) 

secondary (of a homogeneous space) 

semisimple 269.L 

simple (of a semisimple ring) 368.G 

strongly connected 186.F 

unipotent (of a linear mapping) 269.L 

variable (of a linear system) 15.C 16.N 

vertical (of a vector field) 80.C 
component model 403.F 
components-of-variance model 
composite 

of cohomology operations 64.B 

of correspondences 358.B 

of homotopy classes 202.B 

of mappings 381.C 

of morphisms 52.A 

of subsets 436.A 

of valuations 439.F 
composite designs, central 
composite field 149.D 
composite function 106.I 
composite hypothesis 400.A 
composite number 297.B 
composite particles 132.A 
composition 

(of knots) 235.A 

(of probability distributions) 341.E 


403.C 


102.M 


110.A 


110.A 


external law of (of a set of another set) 409.A 


internal law of (of a set) 409.A 
law of (ona set) 409.A 
secondary 202.R 
composition algebra 231.B 
composition factor 190.G 
composition factor series 190.G 
composition product (of functions) 
composition series 
(in a group) 
(in a lattice) 


192.H 


190.G 
243.F 


composition theorem (in class field theory) 59.C 


compound Poisson process 5.F 
comprehension, axiom of 33.B 381.G 
compressibility, isothermal 419.B 
compressible fluid 205.B 
compression, information 96.B 
computable (partial function) 31.B 
computation 

analog 19.A 

complexity of 71 

high-precision 138.B 
compute 31.B 71.B 
computers 75 

analog 19.E 

digital 75.B 

electronic 75.A 

electronic analog 19.E 

hybrid 19.E 
comultiplication 203.B,F 

Hopf 203.D 
concatenation (of paths) 170 
concave function 88.A 

strictly 88.A 
concave programming problem 292.A 
concentration 

area of 397.E 

asymptotic 399.L 

Gini coefficient of 397.E 

measure of 397.E 

spectral 331.F 
concentration function 

maximal 341.E 


Subject Index 


Condition(s) 
mean 341.E 
concept 


basic (of structure) 409.B 
global (in differential geometry) 109 
in the large (in differential geometry) 109 
local (in differential geometry) 109 
in the small (in differential geometry) 109 
conchoidal curve 93.H 
conchoid of Nicomedes 93.H 
concircularly flat space App. A, Table 4.II 
concordant 154.F 
concurrent 
(in nonstandard analysis) 293.B 
(in projective geometry) 343.B 
condensation of singularities, principle of 37.H 
condensation point 425.0 
condensation test, Cauchy 379.B 
condition(s) 
adjoint boundary 315.B 
ascending chain (for (normal) subgroups of a 
group) 190.F 
ascending chain (in an ordered set) 
Baire 425.N 


311.C 


boundary (for an ordinary differential equation) 


315.A 
boundary (for partial differential equations of 
elliptic type) 323.F 


boundary, operator with 112.F 

Cauchy (on the D-integral and the D(+)- 
integral) 100.E 

CFL (Courant-Friedrichs-Lewy) 

chain (in an ordered set) 311.C 

complete integrability 428.C 

consistency 341.] 

Denjoy-Carleman 168.B 

descending chain (for (normal) subgroups of a 
group) 190.F 

descending chain (in an ordered set) 

entropy 204.G 

of finite character (for functions) 34.C 

of finite character (for sets). 34.C 

finiteness, for integral extension 284.F 

Frobenius integrability 154.B 

Haar (on best approximation) 336.B 

Harnack (on the D-integral and the D(+)- 
integral) 100.E 

Holder, of order o SA A 

initial (for ordinary differential equations) 


304.F 


311.C 


316.A 
initial (for partial differential equations) 
Jacobi 46.C 
KMS 308.H 
Levi 325.H 
Lindeberg 250.B 
Lipschitz 84.A 163.D 286.B 316.D 
Lipschitz, of order 84.A 
Lorentz 130.A 
LSZ asymptotic 150.D 
Lyapunov 250.B 
maximal (in an ordered set) 311.C 
minimal (in an ordered set) 311.C 
no cycle 126J 
Palais-Smale 279.E 286.Q 
Poincaré (in the Dirichlet problem) 120.A 


restricted minimal (in a commutative ring) 
284.A 

of R. Schmidt (in (C, «)-summation) 379.M 

Sommerfeld radiation 188.D 

spectrum 150.D 

strip 320.D 


321.A 


Subject Index 
Conditional density 


strong transversality 126.J 
transversality 108.B 
of transversality (in the calculus of variations) 
46.B 
uniqueness (for solution of an ordinary differen- 
tial equation) 316.D 
von Neumann 304.F 
Whitney, (b) 418.G 
Whitney, (P) at a point 418.G 
conditional density 397.1 
conditional distribution 397.1 
conditional entropy 213.B 
conditional expectation (of a random variable) 
342.E 
conditional inequality 211.A 
conditionality, principle of 401.C 
conditionally complete lattice 243.D 
conditionally convergent 379.C,E 
conditionally o-complete lattice 243.D 
conditionally stable 394.D 
conditional mean (of a random variable) 342.E 
397.1 
conditional moments 397.J 
conditional probability 342.E 
regular 342E 
conditional probability distribution 342.E 
conditional problems in the calculus of variations 
46.A 
conditional relative extremum (of a function)  106.L 
conditional self-intersection 213.B 
conditional stability 394.D 
condition number 302.A 
conductivity 130.B 
conductor 
(of an Abelian extension) 14.Q 
(of a class field) 59.B 
(of Dirichlet L-functions) 450.C 
(ofan ideal group) 14.H 
(of a Gróssencharakter) 450.F 
(of Hecke L-functions) 450.E 
(of a nonprimitive character or a primitive 
character) 450.C 
(of a quadratic field) 347.G 
(ofa residue character) 295.D 
(of a subring of a principal order) 14.B 
p- (of norm-residue) 14.P 
conductor-ramification theorem (in class field the- 
ory) 59.C 
conductor with a group character 450.G 
cone 
(ofa PL embedding) 65.D 
(in a projective space) 343.E 
(of a simplicial complex) 70.C 
{over a space) 202.E 
asymptotic 350.B 
circular 78.A 111.1 
conjugate convex 89.F 
convex 89.F 
convex polyhedral $89.F 
dual convex 89.F 
extension (of a PL embedding) 65.D 
future 258.A 
light 258.A 
Mach 205.B 
mapping 202.E 
natural positive 308.K 
oblique circular 350.B 
past 258.A 
quadric 350.B 
reduced (of a topological space) 202.F 
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reduced mapping 202.F 
regular 384.A 
right circular 350.B 
self-dual regular 384.E 
side 258.A 
confidence coefficient 399.Q 
confidence interval 399.Q 
confidence level 399.Q 
confidence limits 399.Q 
confidence region 399.Q 
invariance of 399.Q 
unbiased 399.Q 
uniformly most powerful 399.Q 
uniformly most powerful unbiased 399.Q 


configuration 
central 420.B 
Pascal 78.K 


configuration space 126.L 402.G 
confluent differential equation 167.A 
confluent hypergeometric differential equation 
167.4, App. A, Table 14.II 19.1 
confluent type 
function of 167.A 
hypergeometric function of 167.A, App. A, 
Table 19.] 
confocal central conics, family of 78.H 
confocal parabolas, family of 78.H 
confocal quadrics, family of 350.E 
conformal 77.A 
almost 275.C 
conformal arc element 110.D 
conformal connection 80.P 
conformal correspondence (between surfaces) 111.1 
conformal curvature 110.D 
conformal curvature tensor, Weyl 80.P, App. A, 
Table A1 
conformal differential geometry 110.D 
conformal function, u- 352.B 
conformal geometry 76.A 
conformal invariant 77.E 
conformally equivalent 77.A 367.A 191.B 
conformally flat 191.B 
conformally flat space App. A, Table 4.II 
conformal mapping 198.A, App. A, Table 13 
generalized 246.I 
extremal quasi- 352.C 
quasi- 352 
conformal space 76.A 
conformal structure 191.B 
conformal structure (on a Riemann surface) 367.A 
conformal torsion 110.D 
conformal transformation 80.P 364.F 
confounded 
with blocks 102.J 
partially (with blocks) 102.J 
congruence 
(in geometry) 155.B 
(in number theory) 297.G 
linear (in projective geometry) 343.E 
of lines 110.B 
multiplicative 14.H 
congruence axiom (of geometry) 155.B 
congruence classes modulo m*, group of 14.H 
congruence subgroup (of a modular group) 122.D 
principal, of level N 122.D 
congruence zeta function 450.P 
congruent 
(figures) 139.C 
(segment) 155.B 
affinely 7.E 
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in the Erlangen program 137 
congruent modulo m 297.G 


congruent transformation(s) 139.B 
group of 285.C 

conic(s) 78.A 
central 78.C 
focal (of a quadric) 350.E 


pencil of 343.E 
conical function App. A, Table 18.II 
conical hypersurface, quadric 350.G 
conical surface 111.1 
quadric 350.B 
conic Lagrange manifold 274.C 345.B 
conic section(s) 78.A 
equation of 78.C 
canonical form of the equation of 78.C 
conjecture 
Adams (on J-homomorphisms) 237.1 
annulus (on combinatorial manifolds) 65.C 
Artin (on Artin L-functions) 450.G 
Bieberbach (on univalent functions) 438.C 
Birch-Swinnerton-Dyer (on L-functions of 
elliptic curves) 118.D 450.8 
Burnside (on finite groups) 151.D 
C, (on Kodaira dimension) 72.H 
entropy 126.K 
four color 186.1 
fundamental (in topology) 70.C 
generalized Poincaré 65.C 
general knot 235.B 
Hasse (on Hasse zeta function) 450.S 


Hodge (on cycles on algebraic varieties) 450.S 


Iwasawa main (on p-adic L-functions) 450.J 

knot complement 235.B 

Leopoldt (on p-adic L-functions) 450.J 

Mordell (on Diophantine equations) 118.E 

Poincaré (on a characterization of spheres) 
65.C 

property P- (on knot groups) 235.B 


Ramanujan (on automorphic functions) 32.D 


Ramanujan-Petersson (on Hecke operators) 
32.D 
Sato (on Hasse zeta functions) 450.S 
Schreier (on simple groups) 151.1 
Seifert (on vector fields) 126.K 154.D 
Smith (on knot theory) 235.E 
stability 126.J 
Taniyama-Weil (on L-functions of elliptic 
curves) 450.8 
Tate (on Hasse ¢-functions) 450.8 
unknotüng 235.E 
Vandiver (on the class number of cyclotomic 
fields) 14.L 
Weil (on congruence zeta functions) 450.Q 
conjugacy 
C'- 126.B 
topological 126.B 
conjugacy class (of an element of a group) 
conjugate 
(CG method) 302.D 
(diameter) 78.G 
(element) 149.J 
(point in a geodesic) 178.A 
(point in a projective space) 343.E 
(with respect to a quadric surface) 
(quaternion) 29.D 
(subset) 190.C 
C- 126.B 
harmonic (in projective geometry) 
Q- 126.H 


190.C 


350.C 


343.D 





Subject Index 
Connected sequences of functors 


topological 126.B 
conjugate axis (of a hyperbola) 78.C 
conjugate complex number 74.A 
conjugate convex cone 89.F 


conjugate differential (on Riemann surface) 367.H 


conjugate exponent 168.C 
conjugate field | 149.7 377.C 
conjugate Fourier integral 160.D 
conjugate function  159.E 160.D 
conjugate gradient (CG) method 302.D 
conjugate harmonic function 193.C 
conjugate hyperbola 78.E 
conjugate ideal (of a fractional ideal) 14.1 
conjugate operator 
(in Banach spaces) 37.D 
(of a differential operator) 125.F 
(of a linear operator) 251.D 


conjugate planes (with respect to a quadric surface} 


350.C 
conjugate point(s) 
(in the calculus of variations) 46.C 
(in a Riemannian manifold) 364.C 
conjugate pole 350.C 
conjugate Radon transform 218.F 
conjugate representation 362.F 
complex 362.F 
conjugate series (of a trigonometric series) 
conjugate space 
(of a linear topological space) 424.D 
(of a normed linear space) 37.D 


159.A 


conjugation mapping (of a Hopf algebra) 203.E 


conjugation operator 164.K 
conjunction (of propositions) 411.B 
connected 
(affine algebraic group) 13.A 
(design) 102.K 
(graded module) 
(graph) 186.F 
(topological space) 79.A 
(treatment) 102.B 
arcwise (space) 79.B 
K- 186.F 
locally (at a point) 79.B 
locally (space) 79.A 
locally arcwise (at a point) 79.B 
locally arcwise (space) 79.B 
locally n- (at a point) 79.C 
locally n- (space) 79.C 
locally œw- (space) 79.C 
multiply (plane domain or space) 333.A 
n- (pair of topological spaces) 202.L 
n- (space) 79.C 202.L 
n-ply (plane domain) 
w- (space) 79.C 
path- (space) 79.B 
simply (covering Lie group) 249.C 
simply (space) 79.C 170 
simply, group 13.N 
strongly (components) 186.F 
connected component 79.A 186.F 
arcwise- 79.B 
strongly 186.F 
connected Lie subgroup 249.D 
connectedness 79 186.F 
of real numbers 294.E 
connectedness theorem 
general, due to W. Fulton and J. Hansen 
Zariski 16.X 
connected part 150.D 
connected sequences of functors 200.I 


203.B 


333.A 


16.1 


Subject Index 
Connected set 


connected set 79.A 
connected space 79.A 
connected sum 
(of oriented compact C?-manifolds) 114.F 
(of 3-manifolds) 65.E 
connecting homomorphism 
in cohomology 200.F 
in homology 200.C 
on homology groups 201.C,L 
connecting morphism 200.H,I 
connection(s) 80 
affine 80.H 286.L 
affine, coefficients of 80.L 
canonical affine (on R”) 80.J 
Cartan 80.M 
conformal 80.P 
Euclidean 364.B 
Euclidean, manifold with 109 
flat 80.E 
Gauss-Manin (of a variety) 16.V 
Levi-Civita 364.B 
linear 80.H 
locally flat 80.E 
metric 80.K 
normal 365.C 
projective 80.0 
Riemannian 80.K 364.B 
Riemannian, coefficients of 80.L 
connection form  80.E 417.B 
connection formula 
for the solutions of a differential equation 
253.A 
connection of spin and statistics 132.A 150.D 
connection problem 253.A 
connective fiber space, n- 148.D 
connectives, propositional 411.E 
connectivity (of a space) 201.A 
conoid, right 111.1 
conoidal neighborhood 274.E 
conormal 323.F 
conormal bundle 274.E 
conormal sphere bundle 274.E 
co-NP 71.E 
conservation laws, even-oddness 150.D 
conservative (measurable transformation) 136.C 
conservative chain 260.A 
conservative process 261.B 
conserved axial-vector currents, partially 132.C 
consistency 
(condition in the multistep method) 303.E 
(of anestimator) 399.K 
(of a logical system) 276.D 
ofanalysis 156.E 
of the axiom of choice and the continuum 
hypothesis 33.D 
{c,}- 399.K 
relative 156.D 
consistency condition 341.1 
consistency proof 156.D 
of pure number theory 156.E 
consistent 
(finite difference scheme) 304.F 
(formal system) 411.1] 
(system of axioms) 35.B 
as. 399.K 
{c,}- 399.K 
Fisher 399.K 
wo- 156.E 
consistent and asymptotically normal (CAN) 
estimator 399.K 
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consistent estimator 399.K 
consistent kernel (in potential theory) 338.E 
consistent-mass scheme 304.D 
consistent test 400.K 
uniformly 400.K 
constant(s) 165.C 
arbitrary (in a general solution of a differential 
equation) 313.A 
Bloch 77.F 
Boltzmann 402.B 
dielectric 130.B 
empirical 19.F 
error 303.E 
Euler 174.A 
integral 216.C 
integration (in a general solution of a differential 
equation) 313.A 
isoperimetric 391.D 
Lagrange method of variation of 252.D 
Landau 77.F 
method of variation of 55.B 252.1 
phase (of a sine wave) 446 
Planck 115 
renormalization 150.D 
Robin 48.B 
schlicht Bloch 77.F 
structural (of a Lie algebra) 248.C 
universal (in the theory of conformal mapping) 
TILE 
constant breadth, curve of 89.C 
constant curvature 
space of 364.D, App. A, Table 4.]I 
surface of 111.1 
constant function 381.C 
constant inclination, curve of 111.F 
constant mapping 381.C 
constant pressure, specific heat at 419.B 
constant sheaf 383.D 
locally constructible 16.AA 
constant stratum, u- 418.E 
constant-sum game 173.A 
constant term 
of a formal power series 376.A 
ofa polynomial 337.B 
unfolding 51.D 
constant variational formula 163.E 
constant volume, specific heat at 419.B 
constant width, curve of 111.E 
constituent (of an analytic or coanalytic set) 22.C 
constraint 102.L 264.B 
capacity 281.D 
chance 408.B 
unilateral 440.A 
constraint qualification 
Guignard 292.B 
Slater 292.B 
constraint set (of a minimization problem) 292.A 
constructibility, axiom of (in axiomatic set theory) 
33.D 
constructible (set in axiomatic set theory)  33.D 
constructible sheaf 16.AA 
locally, constant 16.AA 
construction 
bar (of an Eilenberg-MacLane coraplex) 70.F 
geometric, problem 179.A 
GNS 308.D 
group measure space 136.F 
impossible, problem 179.A 
possible, problem 179.A 
W- (of an Eilenberg-MacLane complex) 70.F 
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construction problem (of class field tower) 59.F 
constructive field theory 150.F 
constructive method | 156.D 
constructive ordinal numbers 81.B 
consumer's risk 404.C 
contact, thermal 419.A 
contact element 428.E 
in a space with a Lie transformation group 
110.A 
contact form 110.E 
contact manifold 110.E 
contact metric structure 110.E 
contact network 282.B 
contact pair (in circle geometry)  76.C 
contact process 340.C 
contact structure 105.Y 
contact transformations 82, App. A, Table 15.IV 
quantized 274.F 
contain 381.A 
physically 351.K 
content (of a tolerance region) 399.R 
Jordan 270.G 
mean 399.R 
context-free grammar 31.D 
context-sensitive grammar 31.D 
contiguous 399.M 
contingency table 397.K 400.K 
continuable, analytically 198.1 
continuation 
analytic 198.G 
analytic, along a curve 198.1 
analytic, in the wider sense 198.0 
direct analytic 198.G 
harmonic 193.M 198.G 
continuation method 301.M 
continuation theorem 
Hartogs 21.F 
Remmert-Stein 23.B 
Riemann 21.F 
unique 323J 
continued fractions 83.A 
finite 83.A 
infinite 83.A 
mixed periodic 83.C 
normal 83.E 
pure periodic 83.C 
recurring 83.B 
simple 83.A 
continuity 
absolute, space of 390.E 
axioms of (in geometry) 155.B 
Dedekind axiom of (for real numbers) 355.A 
equation of (for a fluid) 205.A 
equation of (for electromagnetics) 130.A 204.B 
Hartogs theorem of 21.H 
interval of (for a probability distribution) 
341.C 
local 45.F 
modulus of (of a function) 84.A 
modulus of, of kth order (of a continuous 
function) 336.C 
properties of 85.A 
of real numbers 294.E 
uniform 45.F 
continuity (*), generalized absolute | 100.C 
in the restricted sense 100.C 
continuity principle 
for analytic functions of several complex vari- 
ables 21.H 
in potential theory 338.C 


Subject Index 
Continuous geometry 


quasi- (in potential theory) 338I 
continuity property for Cech theory 201.M 
continuity requirement, variational principles with 

relaxed 271.G 
continuity theorem 

Abel (for Dirichlet series) 339.B 

Abel (for power series) 121.D 

Lévy 341.F 
continuous 

(additive interval function) 380.B 

(flow) 136.D 

(function of ordinal numbers) 312.C 

(mapping) 84.A 425.G 

absolutely (function) 100.C 

absolutely (mapping in the plane) 246.H 

absolutely (measure) 270.L 

absolutely (set function) 380.C 

absolutely (vector measure) 443.G 

absolutely, in the restricted sense 100.C 

absolutely, in the sense of Tonelli 246.C 

absolutely, («) 100.C 

completely (operator) 68.D 

equi- 435.D 

equi-, semigroup of class (C?) 378.B 

generalized absolutely 100.C 

hypo- 424.Q 

left 84.B 

from the left 84.B 

in the mean 217.M 

in the mean (stochastic process) 407.A 

u-absolutely | 380.C 

with respect to the parameter (a distribution) 

125.H 

piecewise, function 84.B 

in probability 407.A 

right 84.B 

from the right 84.B 

separately (bilinear mapping) 424.Q 

strongly (function with values in a Banach 

space) 37.K 

uniformly 84.A 273.1 436.E 

uniformly, on a subset 436.G 

weakly (function with values in a Banach space) 

37.K 
continuous action (in topological dynamics) 126.B 
continuous additive interval function 380.B 
continuous analytic capacity 164.J 
continuous arc(s) 93.B 
continuous cocycle 200.N 
continuous distribution (probability theory) 341.D 
continuous dynamical system 126.B 
continuous flow 

(in ergodic theory) 136.D 

(on a topological space) 126.B 
continuous functions 84 

absolutely 100.C 

generalized absolutely 100.C 

lower semi- 84.C 

lower semi- (at a point) 84.C 

onametric space 84.C 

piecewise 84.B 

quasi- 338.1 

right 84.B 

semi- (at a point) 84.C 

uniformly (in a metric space) 84.A 

uppersemi- 84.C 

upper semi- (at a point) 84.C 
continuous geometry 85 

irreducible 85.A 

reducible 85.A 


Subject Index 
Continuous homomorphism 


continuous homomorphism (between topological 
groups) 423J 
open 423J 
continuous image 425.G 
continuously differentiable function, n-times 106.K 
continuous mapping 425.G 
space of 435.D 
strongly 437.A 
uniformly (of metric spaces) 273.1 
uniformly (of uniform spaces) 436.E 
continuous plane curve 93.B 
continuous representation 
strongly (of a topological group) 69.B 
weakly (of a topological group) 69.B 
continuous semiflow 126.B 
continuous semimartingale 406.B 
continuous spectrum 390.A 
absolutely 390.E 
of an integral equation 217.J 
continuous spin 258.C 
continuous state branching process 44.E 
continuous tensor product 377.D 
continuum 79.D 
cardinal number of 49.A 
indecomposable 79.D 
irreducible 79.D 
Peano 93.D 
continuum hypothesis 49.D 
consistency of the axiom of choice and 33.D 
generalized 49.D 
independence of the axiom of choice and 33.D 
contour(s) 
additivity of (in the curvilinear integral) 94.D 
of an integration 94.D 
contract, annuity 214.B 
contracted tensor 256.L 
contractible space 79.C 202.D 
locally 79.C 202.D 
locally, at a point 79.C 
contraction 
(ofa graph) 186.E 
(linear operator) 37.C 
(ofa mapping) 381.C 
(ofa matroid) 66.H 
(ofa tensor) 256.L 
sub- 186.E 
contraction principle 286.B 
contractive | 251.N 
purely 251.N 
purely, part 251.N 
contradiction 414.1 
contradictory formal system 411.1 
contragredient (of a linear mapping) 256.G 
contragredient representation 362.E 
contrast 
elementary 102.C 
normalized 102.C 
treatment 102.C 
contravariant functor 52.H 
contravariant index (of a component of a tensor) 
256 
contravariant of order r and covariant of order s 
108.D 
contravariant spinor 258.B 
contravariant tensor 
alternating 256.N 
of degree p 256.5 
symmetric 256.N 
contravariant tensor algebra 256.K 
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contravariant tensor field of orderr 105.0 
contravariant vector 256J 
contravariant vector field 105.0 
control 
admissible 405.A 
bang-bang 405.C 
feedback 405.C 
impulse 405.E 
inventory 227 
local 102.A 
optimal 46.D 86.B,C 405.A 
quality 404A 
stochastic 342.A 405 
time-optimal 86.F 
control chart 404.B 
controllability 86.C 
controlled stochastic differential equation 405.A 
controlled tubular neighborhood system 418.G 
control limit 
lower 404.F 
upper 404.B 
control problem, time optimal 86.F 
control space (in static model in catastrophe theory) 
51.B 
control theory 86 
control unit 75.B 
convention 
Einstein 256 
Einstein summation 417.B 
Maxwell 51.F 
perfect delay 51.F 
converge 
(filter) 87.1 
(infinite product) 379.G 
(in a metric space) 273.D 
(net) 87.H 
(sequence of lattices) 182.B 
(sequence of numbers) 87.B 355.B 
(series) 379.A 
(in a topological space) 87.E 
almost certainly 342.D 
almost everywhere 342.D 
almost surely 342.D 
in distribution 168.B 342.D 
in the mean of order p 342.D 
in the mean of power p 168.B 
in probability 342,D 
with probability 342.D 
strongly 37.B 
uniformly (in a uniform space) 435.A 
weakly (in a normed linear space) 37.E 
weakly (in a topological linear space) 424.H 
convergence 87 
(ofa filter) 87.I 
(ofanet) 87.H 
(of probability measures) 341.F 
(of truncation errors} 303.B 
abscissa of (of a Dirichlet series) 121.B 
abscissa of (of a Laplace transform) 240.B,H 
absolute, abscissa of (Dirichlet series} 121.B 
absolute, abscissa of (of a Laplace transform) 
240.B 
associated, radii 21.B 
asymptotic 168.B 
axis of 240.B 
circle of (of a power series) 339.A 
exponent of 429.B 
generalized 331.C 
norm resolvent 331.C 
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radius of (of a power series) 339.A 
relative uniform star 310.F 


simple, abscissa of (of a Dirichlet series) 121.B 


star 87.K 
strong (of operators) 251.C 
strong resolvent | 331.C 
uniform 435 
uniform (of a series) 435.A 
uniform (of operators) 251.C 
uniform, abscissa of (of a Dirichlet series) 
121.B 
uniform, abscissa of (of a Laplace transform) 
240.B 
uniform, on compact sets 435.C 
weak (of Operators) 251.C 
weak (of probability measures) 341.F 
weak (of a sequence of submodules} 200.J 
Weierstrass criterion for uniform | 435.A 
convergence criterion for positive series App. A, 
Table 10.1 
convergence domain (of a power series) 21.B 
convergence in measure 168,B 
convergence method 354.B 
convergence theorem 
on distributions 125.G 
Lebesgue 221.C 
of martingales 262.B 
convergent 
(continued fraction) 83.A 
(double series) 379.E 
(filtration) 200.J 
(infinite integral) 216.E 
(sequence) 87.B 355.B 
(series) 379.A 
absolutely (double series) 379.E 
absolutely (infinite product) 379.G 
absolutely (Laplace-Stieltjes integral) 240.B 
absolutely (power series} 21.B 
absolutely (series) 379.C 
absolutely (series in a Banach space) 443.D 
commutatively 379.C 
conditionally 379.C,E 
intermediate 83.B 
(ot, 87.L 
(o-star 87.L 
order (in a vector lattice) 310.C 
pointwise 435.B 
principal 83.B 
simply 435.B 
unconditionally 379.C 
uniformly (on a family of sets) 435.C 
uniformly (sequence, series, or infinite product) 
435.A 
uniformly, in the wider sense 435.C 
uniformly absolutely 435.A 
convergent power series 370.B 
convergent power series ring 370.B 
convergent sequence 355.B 
convex 
(function on a G-space) 178.H 
(function on a Riemannian manifold) 178.B 
(subset of a sphere) 274.E 
(subset of a sphere bundle) 274.E 
absolutely 424.E 
holomorphically, domain 21.H 
locally (linear topological space)  424.E 
logarithmically (domain) 21.B 
matrix (of order m) 212.C 
operator 212.C 
properly 274.E 


Subject Index 
Coordinate(s) 


uniformly (normed linear space) 37.G 
convex analysis 88 
convex body 89.A 
convex cell (in an affine space) 7.D 
convex closure (in an affine space) 7.D 
convex cone 
conjugate 89.F 
dual 89.F 
convex curve, closed 111.E 
convex functions 88.A 
proper 88.D 
strictly 88.A 
convex hull SOA 
(in an affine space) 7.D 
(of a boundary curve) 275.B 
(in linear programming) 255.D 
closed 424.H 
convexity theorem 
Lyapunov 443.G 
M. Riesz 88.C 
convex neighborhood 364.C 
convex polyhedral cone 89.F 
convex polyhedron 89.A 
convex programming 264.C 
convex programming problem 292.A 
convex rational polyhedral 16.Z 
convex set(s) 89 
absolutely (in a linear topological space) 424.E 
in an affine space 7.D 
P- (for a differential operator 112.C 
regularly 89.G 
strongly P-  112.C 
strongly separated 89.A 
convex surface, closed 111.1 
convolution 
(of arithmetic functions) 295.C 
(of distributions) 125.M 
(of functions) 159.A 192.H 
(of hyperfunctions) 125.X 
(of probability distributions) 341.E 
(in the theory of Hopf algebra) 203.H 
generalized (of distributions) 125.M 
convolutional code 63.E 
cooperative game  173.A,D 
coordinate(s) 90 
(of an element of a direct product of sets) 
381.E 
(in the real line) 355.E 
affine 7.C 
barycentric (in an affine space) 7.C 90.B 
barycentric (in a Euclidean simplicial complex) 
70.B 
barycentric (in the polyhedron of a simplicial 
complex) 70.C 
bipolar 90.C, App. A, Table 3.V 
bipolar cylindrical App. A, Table 3.V 
canonical (of a Lie group) 249.Q 
Cartesian (in an affine space) 7.C 
Chow (of a positive cycle) 16.8 
circular cylindrical App. A, Table 3.V 
curvilinear 90.C, App. A, Table 3.V 
cylindrical | 90.C, App. A, Table 3.V 
ellipsoidal 90.C 133.A, App. A, Table 3.V 
elliptic 90.C 350.E, App. A, Table 3.V 
elliptic cylindrical App. A, Table 3.V 
equilateral hyperbolic 90.C, App. A, Table 3.V 
generalized (in analytical dynamics) 271.F 
generalized cylindrical App. A, Table 3.V 
geodesic 80.J 
geodesic polar 90.C 


Subject Index 
Coordinate axis 


Grassmann (in a Grassmann manifold) 90.B 
homogeneous (of a point in a projective space) 


343.C 
hyperbolic cylindrical App. A, Table 3.V 
hyperplane (of a hyperplane in a projective 
space) 343.C 
inhomogeneous (of a point with respect to a 
frame) 343.C 
isothermal 90.C 
ith (of an element relative to a basis) 256.C 
Kleinline 90.B 
Kruskal 359.D 
line(ofaline) 343.C 
local (on an algebraic variety) 16.0 
local (on a topological manifold) 105.C 
local, transformation of 90.D 
moving App. A, Table 3.IV 
multiplanar 90.C 
multipolar 90.C 
(n + 2-hyperspherical 76.A 90.B 
normal 90.C 
oblique (in a Euclidean space) 90.B 
orthogonal curvilinear 90.C 
parabolic 90.C 
parabolic cylindrical App. A, Table 3.V 
parallel (in an affine space) 7.C 
pentaspherical 90.B 
plane (of a plane) 343.C 
Plücker (in a Grassmann manifold) 90.B 
polar 90.C, App. A, Table 3.V 
projective 343.C 
rectangular (in a Euclidean space) 90.B 
rectangular hyperbolic 90.C 
rotational App. A, Table 3.V 
rotational hyperbolic App. A, Table 3.V 
rotational parabolic App. A, Table 3.V 
spherical 90.C 133.D 
tangential polar 90.C 
tetracyclic 90.B 
trilinear 90.C 
tripolar 90.C 
coordinate axis 
of an affine frame 7.C 
ith (of a Euclidean space) 140 
coordinate bundle(s) 147.B 
equivalent 147.B 
coordinate curve (in a Euclidean space) 90.C 
coordinate function 
(ofa fiber bundle) 147.B 
(in the Ritz method) 304.B 
coordinate hyperplane (of an affine frame) 7.C 
coordinate hypersurface (in a Euclidean space) 
90.C 
coordinate neighborhood 
of class C" 105.D 
of a fiber bundle 147.B 
ofa manifold 105.C 
coordinate ring (of an affine variety) 16.A 
homogeneous 16.A 
coordinate system 90.A 
(of a line in a projective space) 343.C 
geodesic, in the weak sense 232.A 
holomorphic local 72.A 
isothermal curvilinear App. A, Table 3.V 
l-adic 3.E 
local (of a topological space) 90.D 105.C 
moving 90.B 
orthogonal, adapted toa flag 139.E 
orthogonal curvilinear App. A, Table 3.V 
projective 343.C 
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coordinate transformation (of a fiber bundle) 147.B 


(of a locally free 6,-Module) 16.E 
coplanar vectors 442.A 
coproduct 
of commutative algebras 29.A 
of an element in a graded coalgebra 203.B 
Hopf 203.D 
of two objects 52.E 
coradical 293.F 
CORDIC 142.A 
core 173.D 
coregular representation (of an algebra) 362.E 
corestriction (homomorphism of cohomology 
groups) 200.M 
Corioli force 271.D 
corner polyhedron 215.C 
Cornish-Fisher expansions 374.F 
Cornu spiral 93.H 167.D 
Corona problem 43.G 
Corona theorem 164] 
coroot 13J 
correcting, error- 63.A 
correcting capability, error- 63.B 
correctly posed 
(initial value problem) 321.E 
(problems for partial differential equations) 
322.A 
corrector (in a multistep method) 303.E 
Milne 303.E 
correlation 343.D 
involutive 343.D 
Kendall rank 371.K 
serial 397.N 
serial cross 397.N 
Spearman rank 371.K 
correlation coefficient 
(of two random variables) 342.C 397.H 
canonical 280.E 374.C 
multiple 397J 
partial 397 
population 396.D 
sample 396.D 
sample multiple 280.E 
sample partial 280.E 
serial 421.B 
correlation inequalities 212.A 
correlation matrix 397.J 
correlation ratios 397.L 
correlation tensor 433.C 
correlogram 397.N 
correspond 358.B 
correspondence 358.B 
algebraic (of an algebraic variety) 16.1 
algebraic (of a nonsingular curve) 9.H 
algebraic, group of classes of 9.H 
birational 16.1 
of Combescure 111.F 
conformal (between surfaces) 111.1 
geodesic (between surfaces) 111.1 
homothetic (between surfaces) 111.1 
inverse 358.B 
one-to-one 358.B 
similar (between surfaces) 111.1 
univalent 358.B 
correspondence principle 351.D 


correspondence ring (of a nonsingular curve) 9.H 


corresponding angles 139.D 

corresponding points (with respect to confocal 
quadrics) 350.E 

cos (cosine) 131.E 432.A 


1957 


cos! (DIE 
cosec (cosecant) 131E 432.A 
cosech (hyperbolic cosecant) 131.F 
cosemisimple 203.F 
coset 
double (of two subgroups of a group) 190.C 
left (of a subgroup of a group) 190.C 
right (of a subgroup of a group) 190.C 
coset space (of a topological group) 
left 423.E 
right 423.E 
cosh (hyperbolic cosine) 131.F 
cosigma functions 134.H, App. A, Table 16.IV 
cosine(s) 432.A 
first law of 432.A, App. A, Table 2.II 
hyperbolic 131.F 
integral 167.D 
law of (on spherical triangles) 432.B, App. A, 
Table 2.III 
optical direction 180.A 
second law of 432.A, App. A, Table 2.II 
cosine integral 167.D, App. A, Table 19.II 
cosine series, Fourier, App. A, Table 11.1 
cosine transform, Fourier 160.C, App. A, Table 
III 
cospecialization (in étale topology) 16.AA 
cospectral density 397.N 
cost 281.D 
imputed 255.B 
shadow 292.C 
unit 281.D 
cost of insurance 214.B 
cost of observation 398.F 
cot (cotangent) 131.E 
cotangent(s) 432.A 
hyperbolic 131.F 
law of App. A, Table 2.111 
cotangent bundle 147.F 
cotangential sphere bundle 274.E 
cotangent vector bundle 147.F 
Cotes formula, Newton- (in numerical integration) 
299.A 
coth (hyperbolic cotangent) 131.F 
cotree 186.G 
cotriple 200.Q 
counit 203.F 
counity (of a coalgebra) 203.B 
countability axioms 425.P 
countable additivity 270.B 
countable cell complex 70.D 
countable Lebesgue spectrum 136.E 
countable model (of axiomatic set theory) 156.E 
countable ordinal number 49.F 
countable set 49.A 
countable simplicial complex 70.C 
locally 70.C 
countably additive class 270.B 
countably compact space 425.8 
countably equivalent sets 136.C 
countably Hilbertian space 424.W 
countably infinite set A0. A 
countably normed space 424.W 
countably paracompact space 425.Y 
countably productive property 425.Y 
counting constants, principle of 16.8 
counting function (of a meromorphic function) 
272.B 
Courant-Cheng domain theorem 391.H 
Cousin problem 21.K 
first 21.K 
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second 21.K 
covariance (of two random variables) 342.C 397.H 
population 396.D 
sample 396.D 
covariance distribution 395.C 
covariance function 395.A,B 
sample 395.G 
covariance matrix 341.B 397.J 
asymptotic 399.K 
variance 341.B 397.J 
covariant 226.D 
absolute 226.D 
absolute multiple 226.E 
with ground forms 226.D 
multiple 226.E 
of an n-ary form of degree d. 226.D 
relativistically 150.D 
covariant derivative 
(of a tensor field) 80.1 417.B, App. A, Table 4.1] 
(of a vector field) 80.1 
(of a vector field along a curve) 80.I 
van der Waerden- Bortolotti 417.E 
covariant differential 
(of a differential form) 80.G 
(of a tensor field) 80.LL 417.B 
(ofa vector field) 80.1 
covariant functor 52.H 
covariant index (of a component of a tensor) 256.J 
covariant spinor 258.B 
covariant tensor 
alternating 256.N 
ofdegreeq 256.J 
symmetric 256.N 
covariant tensor field of orders 105.0 
covariant vector 256.J 
covariant vector field 105.0 
covector, p- 256.0 
cover (a set) 381.D 
covering(s) 
(covering space) 91.A 
(curve) 9.1 
(ofa set) 381.D 425.R 
closed (of a set) 425.R 
closure-preserving 425.X 
countable (of a set) 425.R 
degree of (of a nonsingular curve) 9.I 
discrete (of a set)  425.R 
£- (of a metric space) 273.B 
finite (of a set) 425.R 
locally finite (of a set) 425.R 
mesh of (in a metric space) 273.B 
n-fold (space) 91.A 
normal (of a set) 425.R 
open (ofa set) 425.R 
point-finite (of a set) 425.R 
regular (space) 91.A 
o-discrete (of a set) 425.R 
o-locally finite (of a set) 425.R 
star-finite (of a set) 425.R 
unramified (of a nonsingular curve) 9.1 
covering curve 9I 
covering differentiable manifold 91.A 
covering dimension (of a normal space) 117.B 
covering family (in Grothendieck topology) 16.AA 
covering group (of a topological group) 91.A 423.0 
universal (of a topological group) 91.B 423.0 
covering homotopy property 148.B 
covering law 381.D 425.L 
covering Lie group, simply connected (of a Lie 
algebra) 249.C 


Subject Index 
Covering linkage invariant(s) 


covering linkage invariant(s) 235.E 
covering manifold 91.A 
covering mapping (map) 91.A 
covering space(s) 91 
analytic 23.E 
C- 23E 
in the sense of Cartan 23.E 
ramified 23.B 
universal 91.B 
covering surface(s) 367.B 
Ahlfors theory of 367.B 
with relative boundary 367.B 
unbounded 267.B 
universal 367.B 
unramified | 367.B 
covering system, uniform 436.D 
covering theorem, Vitali 380.D 
covering transformation 91.A 
of an unramified unbounded covering surface 
367.B 
covering transformation group 91.A 
Coxeter complex 13.R 
Coxeter diagram (of a complex semisimple Lie 
algebra) 248.8 
Coxeter group 13.R 
CPM 307.C 376 
Cramer-Castillon problem (in geometric construc- 
tion) 179.A 
Cramér-Rao inequality 399.D 
Cramer rule 269.M 
Cramer theorem 399.M 
crash duration 281.D 
creation operator 377.A 
Cremona transformation 16.I 
CR-equivalence 344.A 
criterion 
Cartan, of semisimplicity (of Lie algebras) 
24&.F 
Cartan, of solvability (of Lie algebras) 248.F 
Castelnuovo 15.E 
Cauchy 87.C, App. A, Table 10.II 
convergence, for positive series App. A, Table 
10.11 
d'Alembert App. A, Table 10.II 
Euler 297.H 
Gauss App. A, Table 10.11 
Jacobian (on regularity of local rings) 370.B 
Kummer 145, App. A, Table 10.11 
logarithmic App. A, Table 10.IT 
Nakai-Moishezon (of ampleness) 16.E 
Nyquist's 86.A 
Raabe App. A, Table 10.1] 
of ruled surfaces 15.E 
Schlomilch App. A, Table (OI 
simplex 255.D 
Weierstrass, for uniform convergence 435.A 
Weyl 182.H 
criterion function 127.A 
critical (Galton-Watson process) 44.B 
critical determinant 182.B 
critical exponent !11.C 
critical inclination problem 55.C 
critical lattice in M with respect to S 182.B 
critical manifold, nondegenerate 279.D,E 
critical path 376 
critical percolation probability 340.D 
critical point 
(of a C* -function on a manifold) 279.B 
(ofa C*-mapping o: M > M') 105J 
(ofaflow) 126.D 
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(of a function on RI) 106.L 
(ofa harmonic function) 193 
(of a mapping u: R" > R”) 208.B 
degenerate 106.L 279.B 
nondegenerate 106.L 279.B 286.N 
critical region 400.A 
critical value 
(in bifurcation theory) 286.R 
(of a C*-function on a manifold) 279.B 
(of a C*-mapping o: M > M') 105J 
(of a contact process) 340.C 
(of an external magnetic field) 340.B 
(of a mapping u: R” Rn 208.B 
Crofton formula (in integral geometry) 218.B 
cross cap (a surface) 410.B 
crosscut(s) (of a plane domain) 333.A 
fundamental sequence of (in a simply connected 
domain) 333.B 
crossed homomorphism (of an associative algebra) 
200.L 
crossed product 
(in C*-algebra theory) 36.I 
(of a commutative ring and a group) 29.D 
(in von Neumann algebra theory) 308.J 
crossings 
normal 16.L 
only normal 16.L 
crossing symmetry 132.C 386.B 
cross norm, C*- 36.H 
cross product 
(of cohomology groups) 201.J 
(of homology groups) 201J 
(of vector bundles) 237.C 
cross ratio 343.D 
cross section 
(of a fiber bundle) 147.L 286.H 
(ofa fiber space) 148.D 
(of a flow) 126.C 
absorption 375.A 
differential 375.A 386.B 
local (in a topological group) 147.E 
scattering 375.A 
total 386.B 
total elastic 386.B 
cross-sectional data 128.A 397.A 
cross spectral density function 421.E 
Crout method 302.B 
CR structure 344.A 
crystal class 92.B 
arithmetic 92.B 
geometric 92.B 
3-dimensional App. B, Table 5.1V 
crystal family 92.B 
crystallographic group 92 
crystallographic restriction 92.A 
crystallographic space group 92.A 
crystal system 92.B 
cube 357.B 
duplication of 179.A 
Hilbert 382.B 
unit 139.F 140 
unit n- 140 
cubic equation 10.D, App. A, Table 1 
cubic map 157.B 
cubic P 92.E 
cubic resolvent App. A, Table 1 
cumulant 397.G 
factorial 397.G 
joint 397.1 
cumulative distribution 397.B 
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cumulative distribution curve 397.B 
cumulative distribution function 341.B 342.C 
cumulative distribution polygon 397.B 
cup product 
(of cohomology classes) 201.1 
(of derived functors) 200.K 
(in K-theory) 237.C 
cup product reduction theorem (on cohomology or 
homology groups) 200.M 
curl (of a differentiable vector field) 442.D 
current 125.R 
4-, density 150.B 
integral 275.G 
partially conserved axial-vector 132.C 
random 395.1 
current algebra 132.C 
Curtis formulas, Clenshaw- |. 299.A 
curvature 
(ofa curve of class C") 111.D 
(ofa plane curve) 111.E 
absolute (of a curve) 111.C 
affine 110.C 
circle of 111.E 
conformal 110.D 
constant, space of 364.D, App. A, Table 4.II 
constant, surface of 111.I 
Gaussian (of a surface) 111.H, App. A, Table 
4.] 
geodesic 111.H, App. A, Table Al 


S. Germain (of a surface) 111.H, App. A, Table 


4] 
holomorphic sectional 364.D 
integral (of a surface) 111.H 
line of (on a surface) 111.H 
Lipschitz-Killing 279.C 
mean 364.D 
mean (ofa surface) 111.H 365.D, App. A, 
Table 4.I 
mean, vector 365.D 
minimum, property 223.F 
negative 178.H 
nonpositive, G-space with 178.H 
normal (of a surface) 111.H 
principal (of a surface) 111.H 365.C 
radius of (of a plane curve) 111.E 
radius of (of a space curve) 111.F 
radius of principal (of a surface) 111.H 
Ricci 364.D 
Riemannian 364.D 
scalar 364.D, App. A, Table AU 
sectional 364.D 
total (of an immersion) 365.0 
total (of a surface) 111.F,H, App. A, Table Al 
total Gaussian (of a surface) 111.H 
total mean 365.0 
curvature form | 80.G 364.D 
curvature tensor 
(of an affine connection) 80.L,L 417.B 
(of a Fréchet manifold) 286.L 
(of a Riemannian manifold) 364.D 
projective App. A, Table 4.IT 
Weyl conformal | 80.P, App. A, Table 4.II 
curve(s) 93111.A 
algebraic 9.A 
analytic (in an analytic manifold) 93.B 
analytic (in a Euclidean plane) 93.B 
asymptotic 110.B 111.H 
Bertrand 111.F 
bicharacteristic 325.A 
characteristic (network flow problem) 281.B 
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characteristic (of a 1-parameter family of 
surfaces) 111.1 

characteristic (of a partial differential equation) 
320.B 324.A,B 

cissoidal 93.H 

of class CH (in a differentiable manifold) 93.B 

of class C* (in a Euclidean plane) 93.B 

closed convex 111.E 

of constant breadth 89.C 

of constant inclination 111.F 

of constant width  111.E 

continuous plane 93.B 

coordinate (in a Euclidean space) 90.C 

covering 9I 

Darboux 110.B 

Delaunay 93.H 

dual (of a plane algebraic curve) 9.B 

elliptic 9.C 

exceptional 15.G 

exponential 93.H 

of the first kind 15.G 

Fréchet 246.A 

fundamental (with respect to a birational 
mapping) 16.1 

fundamental theorem of the theory of 111.D 

general 93.D 

generating 111.1 

hyperelliptic 9.D 

influence 371.1 

integral (of a Monge equation) 324.F 

integral (of ordinary differential equations) 
316.A 

Jordan 93.B 

logarithmic 93.H 

Lorentz 397.F 

Mannheim 111.F 

meromorphic 272.L 

nodal 391.H 

OC- 404.C 

ordinary 93.C 

Peano 93J 

pedal 93.H 

plane App. A, Table 4.I 

plane algebraic OP 

of pursuit 93.H 

rational 9.C 

rational (in a Euclidean plane) 93.H 

rectifiable 93.F 

rolling (of a roulette) 93.H 

of the second clas 78.K 

of the second order 78.1 

simple closed 93.B 

sine 93.H 

solution (of ordinary differential equations) 
316.A 

stable 9.K 

stationary 46.B 

stationary (of the Euler-Lagrange differential 
equations) 324.E 

of steepest descent 46.A 

timelike 324.A 

tooth 181.E 

in a topological space 93.B 

transcendental 93.H 

u- 111.H 

unicursal 9.C 93.H 

unicursal ordinary 93.C 

universal 93.E 

v- 111.H 

variation 17£ A 


Subject Index 
Curve fitting 


curve fitting 19.F 
curve tracing 93.G 
curvilinear cluster set 62.C 
curvilinear coordinates 90.C, App. A, Table 3.V 
orthogonal 90.C 
planar App. A, Table 3.V 
in 3-dimensional space App. A, Table 3.V 
curvilinear coordinate system 
isothermal App. A, Table 3.V 
orthogonal App. A, Table 3.V 
curvilinear integrals 94.A 
with respect to a line element 94.D 
with respect to a variable 94.D 
cushioned refinement 425.X 
cusp 
of a curve 93.G 
of a Fuchsian group 122.C 
parabolic (of a Fuchsian group) 122.C 
of a plane algebraic curve 9.B 
cuspform 450.0 
in the case of one variable 32.B 
in Siegel half-space 32.F 
cuspidal parabolic subgroup 437.X 
cusp singularity 418.C 
cut 
(in a projective space) 343.B 
(of Q) 294.E 
(of R) 355.A 
disjunctive 215.C 
Gomory 215.B 
subadditive 215.C 
cut locus 178.A 
cutoff 150.C 
cut point (on a geodesic) 178.A 
cutset (ina graph) 186.G 
cutset matrix (of a graph), fundamental 186.G 
cutting (P^ by P”) 343.B 
cutting plane 215.B 
fractional, algorithm  215.B 
CW complex 70.D 
CW decomposition 70.D 
CW pair 201.L 
cybernetics 95 
cycle 
(on an algebraic variety) 16.M 
(of basic sets) 126.J 
(of a chain complex) 200.H 
(=cyclic permutation) 151.G 
(of time series data) 397.N 
algebraic 450.Q 
algebraically equivalent 16.R 
dividing (on an open Riemann surface) 367.1 
foliation 154.H 
fundamental (of an oriented pseudomanifold) 
65.A 
fundamental (in a resolution of a singular point) 
418.C 
limit 126. 
module of 200.C 
no, condition 126J 
numerically equivalent 16.Q 
one 16.R 
positive (on an algebraic variety) 16.M 
rationally equivalent 16.R 
Schubert 56.E 
vanishing 418.F 
zero 16.R 
cycle index 66.E 
cyclic algebra 29.G 
cyclic code 63.D 
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cyclic determinant 103.G 
cyclic element 251.K 
cyclic equation 172.G 
cyclic extension 172.B 
cyclic group 190.C 
cyclic Jacobi method 298.B 
cyclic part (of an ergodic class} 260.B 
cyclic representation (Banach algebra) 36.E 
cyclic representation (topological groups) 437.A 
cyclic subgroup (of a group) 190.C 
cyclic vector (of a representation space of a unitary 
representation) 437.A 
cyclide 90.B 
cyclide of Dupin 111.H 
cycloid 93.H 
cyclomatic number 186.G 
cyclotomic field 14.L 
cyclotomic polynomial 14.L 
cyclotomic Z,-extension 14.L 
cyclotomy 296.A 
cylinder 
circular 111.1 350.B 
elliptic 350.B 
hyperbolic 350.B 
mapping 202.E 
parabolic 350.B 
cylinder function 
elliptic 268.B 
parabolic 167.C 
cylinder set 270.H 
n- 270.G 
cylindrical coordinates 90.C, App. A, Table 3.V 
bipolar App. A, Table 3.V 
circular App. A, Table 3.V 
elliptic App. A, Table 3.V 
generalized App. A, Table 3.V 
hyperbolic App. A, Table 3.V 
parabolic 167.C, App. A, Table 3.V 
cylindrical equation, parabolic App. A, Table 14.II 
cylindrical functions 39.B, App. A, Table 19.III 
cylindrical hypersurface, quadric 350.G 
cylindrical surface 111.1 
circular 350.B 
elliptic 350.B 
hyperbolic 350.B 
parabolic 350.B 
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ô —delta 
QQ) 125.B 168.B 
P(Q) 125.B 
Z7, 168.B 
Zu Bu, 125.U 
Z, (9) (the totality of functions f(x) in C^(Q) such 
that for all a, D*f(x) belongs to L,(Q) with 
respect to Lebesgue measure) 168.B 
ó-measure 270.D 
A-refinement (of a covering) 425.R 
A}-set 22.D 
Ó-functor 200.1 
universal 200.1 
o*-functor 200.1 
é-complex 72.D 
6-cohomology groups 72.D 
d-continuous channels 213.F 
d-dimensional analytic set, purely 23.B 
d-trial path dependent 346.G 
d"-cohomology group 72.D 
D-sufficient a-field 396.J 
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D-wave 315.E 
D-integrable function 100.D 
D-integral 
definite 100.D 
indefinite 100.D 
D(*)integral 100.D 
d'Alembert criterion App. A, Table 10.II 
d'Alembertian 130.A 
d'Alembert method of reduction of order 252.F 
d'Alembert paradox 205.C 
d'Alembert solution 325.D 
damped oscillation 318.B 
damping ratio (of a damped oscillation) 318.B 
Daniell-Stone integrable function 310.1 
Daniell-Stone integral 210) 
Danilevskii method 298.D 
Darboux curve 110.B 
Darboux formula, Christoffel- 317.D 
Darboux frame 110.B 
Darboux quadric 110.B 
Darboux sum 216.A 
Darboux tangent 110.B 
Darboux theorem 216.A 428.A 
Darmois theorem, Skitovich- 374.H 
data 96.B 
cross-sectional 128.A 
macroeconomic 128.A 
microeconomic 128.A 
scattering 287.C 387.C 
data analysis, statistical 397.A 
data base 96.B 
data processing 96 
data structures 96.B 
Davidenko’s method of differentiation with respect 
toa parameter 301.M 
death insurance 214.B 
death process 260.G 
birth and 260.G 
death rate, infinitesimal 260.G 
Debye asymptotic representation 39.D, App. A, 
Table 19.111 
decidable number-theoretic predicate 356.C 
decision 127.A 
decision function(s) 398.A 
invariant 398.E 
minimax 398.B 
sequential 398.F 
space of 208 A 
statistical 208 A 
decision problem  71.B 97 186.J 
n- 398.A 
sequential 398.F 
statistical 398.A 
decision procedure, statistical 398.A 
decision process, Markov 127.E 
decision rule 
sequential 398.F 
terminal 398.F 
decision space 398.A 
decision theoretically sufficient o-field 396.J 
decoder 213.D 
decoding 63.A 
decomposable operator (on a Hilbert space) 308.G 
decompose (a polygon) 155.F 
decomposed into the direct sum of irreducible 
representations 437.G 
decomposition 
(ofa set) 381.D 
Bruhat (of an algebraic group) 13.K 
canonical (of a closed operator) 251.E 
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Decreasing real analytic function 


cellular (of a Hausdorff space) 70.D 
Chevalley (on algebraic groups) 13.1 
cluster, Hamiltonian 375.F 
CW 70D 
de Rham (of a Riemannian manifold) 364.E 
direct (of a group) 190.L 
Doob-Meyer 262.C 
D-optimality 102.E 
dual direct product (of a decomposition of a 
compact or discrete Abelian group) 422.H 
ergodic (of a Lebesgue measure space) 136.H 
Fefferman-Stein 168.B 
formula of Radon 125.CC 
Heegurard 65.C 
Iwasawa (of a connected semisimple Lie group) 
249.T 
Iwasawa (of a real semisimple Lie algebra) 
248.V 
Jordan (of an additive set function) 380.C 
Jordan (of a function of bounded variation) 
166.B 
Jordan (of a linear mapping) 269.L 
Jordan (in an ordered linear space) 310.B 
Khinchin 395.B 
Lebesgue, theorem 270.L 
Levi (on algebraic groups) 13.Q 
Levi (on Lie algebras) 248.F 
multiplicative Jordan (of a linear transforma- 
tion) 269.L 
Peirce (of a Jordan algebra) 231.B 
Peirce left (in a unitary ring) 368.F 
Peirce right (in a unitary ring) 368.F 
plane wave 125.CC 
polar 251.E 
relative Bruhat 13.Q 
Riesz (in Markov process) 260.D 
Riesz (in martingale) 262.C 
Riesz (of a superharmonic or subharmonic 
function) 193.S 
semimartigale 406.B 
simplicial (of a topological space) 79.C 
singular value (SVD) 302.E 
spectral 126J 
Wiener-It6 176.1 
Witt (of a quadratic form) 348.F 
Wold 395.D 
Zariski 15.D 
decomposition-equal polygons 155.F 
decomposition field (of a prime ideal) 14.K 
decomposition group (of a prime ideal) 14.K 
decomposition number (of a finite group) 362.I 
generalized (of a finite group) 362.1 
decomposition theorem 
canonical] 86.C 
in class field theory 59.C 
for dimension 117.C 
Lebesgue (on a completely additive set function) 
380.C 
unique (for a 3-manifold) 65.E 
decreasing, monotone 380.B 
decreasing C?-function, rapidly 168.B 
decreasing distribution, rapidly 125.0 
decreasing Fourier hyperfunction, exponentially 
125.BB 
decreasing function 
monotone 166.A 
strictly 166.A 
strictly monotone 166.A 
decreasing real analytic function, exponentially 
125.BB 


Subject Index 
Decreasing sequence, monotonically 


decreasing sequence, monotonically (of real num- 
bers) 87.B 
rapidly 168.B 
decrement, logarithmic (of a damped oscillation) 
318.B 
Dedekind, J. W. R. 98 
Dedekind, test of du Bois-Reymond and 379.D 
Dedekind axiom of continuity (for real numbers) 
355.A 
Dedekind discriminant theorem 14.J 
Dedekind eta function 328.A 
Dedekind principle (in a modular lattice) 243.F 
Dedekind sum 328.A 
reciprocity law for 328.A 
Dedekind theory of real numbers 294.E 
Dedekind zeta function 14.C 450.D 
deep water wave 205.F 
defect 
(of a block of representations) 362.1 
(of a conjugate class in a group) 362.1 
(of a meromorphic function) 272.E 
defect group 
of a block of representations 362.1 
(of a conjugate class in a group) 362.I 
deficiency 
(of an algebroidal function) 17.C 
(of a closed operator) 251.D 
(of a linear system on a surface) 15.C 
maximal (of an algebraic surface) 15.E 
deficiency index 
(of a closed symmetric operator) 251. 
(of a differential operator) 112.I 
deficient number (in elementary theory of numbers) 
297.D 
defined along V' (for a rational mapping) 16.I 
defined over K' (for an algebraic variety) 16.A 
define recursively 356.C 
defining functions (of a hyperfunction) 125.V 
standard 125.Z 
defining ideal (of a formal spectrum) 16.X 
defining module (of a linear system) 16.N 
defining relations (among the generators of a group) 
161.A 
definite 
negative (function) 394.C 
negative (Hermitian form) 348.F 
negative (quadratic form) 348.B 
positive (function) 36.L 192.B,J 394.C 437.B 
positive (Hermitian form) 348.F 
positive (kernel) 217.H 
positive (matrix) 269.5 
positive (potential) 338.D 
positive (quadratic form) 348.B 
positive (sequence) 192.B 
semi- (Hermitian form) 348.F 
semi- (kernel) 217.H 
totally (quaternion algebra) 27.D 
definite D-integral 100.D 
definite integral 216.C, App. A, Table 9.V 
(of a hyperfunction) 125.X 
definite quadratic form ` 248. C 
definition 
field of 16.A 
first (of algebraic K-group) 237J 
second (of algebraic K-group) 237.J 
truth 185.D 
definition by mathematical induction 294.B 
definition by transfinite induction 311.C 
deflation 
in homological algebra 200.M 





1962 


method for an eigenvalue problem 298.C 
deformation 
(of complex structures) 72.G 
(ofa graph) 186.E 
infinitesimal, to the direction 0/ós |. 72.G 
isomonodromic 253.E 
isospectral 387.C 
projective (between surfaces) 110.B 
of a scheme over a connected scherne 16.W 
of a surface 110.A 
deformation cochain 305.B 
deformation retract 202.D 
neighborhood 202.D 
strong 202.D 
degeneracy (of energy eigenvalues) 35].H 
set of (of a holomorphic mapping between 
analytic spaces) 23.C 
degeneracy index 17.C 
degeneracy operator (in a semisimplicial complex) 
70.E 
degenerate 
(critical point) 106.L 279.B 
(eigenvalue) 390.A,B 
(mapping) 208.B 
(quadratic surface) 350.B 
(simplex) 70.E 
totally 234.B 
degenerate kernel 217.F 
degenerate module 118.D 
degenerate series 
(of unitary representations of a complex semi- 
simple Lie group) 437.W 
complementary (of unitary representations of a 
complex semisimple Lie group) 437.W 
degree 
(of an algebraic element) 149.F 
(of an algebraic equation) 10.A 
(of an algebraic variety) 16.G 
(ofanangle) 139.D 
(of a central simple algebra) 29.E 
(of a divisor class) 11.D 
(of a divisor of an algebraic curve) 9.C 
(of an element with respect to a prime ideal of a 
Dedekind domain) 439.F 
(ofan extension) 149.F 
(ofa graph) 186.B 
(of a Jordan algebra) 231.B 
(of a linear representation) 362.D 
(of a matrix representation) 362.D 
(of an ordinary differential equation) 313.A 
(of a permutation representation) 362.B 
(of a polynomial) 337.A 
(of a prime divisor) 9.D 
(of a rational homomorphism) 3.C 
(of a representation of a Lie algebra) 248.B 
(of a representation of a Lie group) 249.0 
(of a square matrix) 269.A 
(of a term of a polynomial) 337.B 
(ofa valuation) 439.1 
(of a O-cycle on an algebraic variety) 16.M 
complementary (of a spectral sequence) 200.J 
of covering (of a nonsingular curve; 9.I 
filtration 200.J 
formal (of a unitary representation) 437.M 
of freedom (of the dynamical system) 271.F 
of freedom (of error sum of squares) 403.E 
of freedom (of sampling distributions) 374.B 
in- 186.B 
Leray-Schauder 286.D 
local, of mapping 99.B 


1963 


mapping 99.A 
of mapping 99.A 
out- 186.B 
ofthe point 99.D 
of a prime divisor of an algebraic function field 
of dimension 1 9.D 
of ramification (of a branch point) 367.B 
of recursive unsolvability 97 
relative (of a finite extension) 257.D 
relative (of a prime ideal over a field) 14.I 
ofsymmetry 431.D 
total (of a spectral sequence)  200.J 
transcendence (of a field extension) 149.K 
of transcendency (of a field extension) 149.K 
ofunsolvability 97 
degree k 
holomorphic differential forms of 72A 
tensor space of 256.J 
degree n 
alternating group of 151.G 
component of 200.B 
general linear group of 60.B 
projective general linear group of 60.B 
Siegel modular function of 32.F 
Siegel modular group of 32.F 
Siegel space of 32.F 
Siegel upper half-space of 32.F 
special linear group of 60.B 
symmetric group of 151.G 
degree p, contravariant tensor of 256] 
degree q, covariant tensor of 256J 
degree r 
differential form of 105.Q 
differential form of (on an algebraic variety) 
16.0 
mcan of (of a function with respect to a weight 
function) 211.C 
Dehn lemma (on 3-manifolds) 65.E 
Dejon-Nickel method 301.G 
Delaunay curve 93.H 
delay convention, perfect 51.F 
delay-differential equation 163.A 
delayed recurrent event 260.C 
Delos problem (in geometric construction) 179.A 
delta, Kronecker 269.A, App. A, Table AU 
delta function, Dirac App. A, Table (31 
demography 40.D 
de Moivre formula 74.C 
de Moivre-Laplace theorem 250.B 
de Morgan law 381.B 
in a Boolean algebra 42.A 
Denjoy-Carleman condition 168.B 
Denjoy integrable in the wider sense 100.D 
Denjoy integrals 100 
in the restricted sense 100.D 
Denjoy-Luzin theorem 159.1] 
denominator, partial (of an infinite continued frac- 
tion) 83.A 
dense 
(set) 425.N 
(totally ordered set) 311.B 
locally 154.D 
nowhere 425.N 
relatively 126.E 
Zariski 16.A 
dense in itself 425.0 
denseness of rational numbers 355.B 
density 
(on a maximal torus) 248. Y 








Subject Index 
De Rham theorem (on a C*-manifold) 


(of a set of prime ideals) 14.S 
(of a subset of integers) 4.A 
angular momentum  150.B 
beta 397.D 

bivariate normal | 397 
conditional 397 
cospectral 397.N 

electric flux 130.A 

energy 195.B 

4-current 150.B 

free Lagrangian 150.B 
gamma 397.D 

joint 397] 

kinetic 218.A 

Lagrangian  150.B 
magnetic flux | 130.4 

point of (of a measurable set of the real line) 


100.B 
posterior 40!.B 
prior 401.B 


probability 341.D 
sojourn time 45.G 
density function 397.D 
bispectral 421.C 
marginal 397.1 
normal 397.D 
rational spectral 176.F 
density matrix 351.B 
density theorem 
(on discrete subgroups of a Lie group) 122.F 
Chebotarev 14.8 
Kaplansky 308.C 
Lebesgue 100.B 
von Neumann 308.C 
dependence, domain of 325.B 
dependent 
algebraically (elements of a ring) 369.A 
algebraically (on a family of elements of a field) 
149.K 
functionally (components of a mapping) 208.C 
functionally, of class C” (components of a 
mapping) 208.C 
linearly (elements in a linear space) 256.C 
linearly (elements in an additive group) 2.E 
linearly (with respect to a difference equation) 
104.D 
path, d-trial 346.G 
dependent points 
(in an affine space) 7.A 
(in a projective space) 343.B 
dependent set 66.G 
dependent variable 165.C 
depending choice, principle of 33.F 
depth (of an ideal) 67.E 
de Rham cohomology group  201.H 
de Rham cohomology group (of a differentiable 
manifold) 105.R 
de Rham cohomology ring (of a differentiable 
manifold) 105.R 
de Rham cohomology ring (of a topological space) 
201.I 
de Rham complex (as an elliptic complex) 237.H 
de Rham decomposition (of a Riemannian manifold) 
364.E 
de Rham equations 274.G 
de Rham homology theory 114.L 
de Rham system, partial 274.G 
de Rham theorem (on a C*^-manifold) 105.V 201.H 
analog of 21.L 


Subject Index 
Derivable 


derivable 
approximately (measurable function) 100.B 
in the general sense (a set function) 380.D 
in the ordinary sense (a set function) 380.D 
derivation 
(of an algebra) 200.L 
(of an algebraic function field) 16.0 
(of a commutative ring) 113 
(ofafield) 149.L 
(ofa Lie algebra) 248.H 
(of a linear operator in a C*-algebra) 36.K 
*- 36K 
inner (in an associative algebra) 200.L 
inner (in a Lie algebra) 248.H 
invariant (on an Abelian variety) 3.F 
overk 149.L 
Lie algebra of 248.H 
derivation tree 31.E 
derivative 
(ofa distribution) 125.E 
(ofa function) 106.A 
(of a holomorphic function) 198.A 
(ofa hyperfunction) 125.X 
(of a polynomial) 337.G 
(of an element in a differential ring) 113 
angular (of a holomorphic function) 43.K 
approximate (of a measurable function)  100.B 
covariant (of a tensor field) 80.I, App. A, 
Table 4.II 
covariant (of a tensor field in the direction of a 
tangent vector) 417.B 
covariant (of a vector field)  80.I 
covariant (of a vector field along a curve) 80.1 
directional 106.G 
distribution 125.E 
exterior (of a differential form) 105.Q 
first-order 106.A 
Fréchet 286.E 
free 235.C 
Gâteaux 286.E 
general (of a set function) 380.D 
generalized 125.E 
general lower (of a set function) 380.D 
general upper (of a set function) 380.D 
higher-order (of a differentiable function) 
106.D, App. A, Table 9.IIT 
higher-order partial 106.H 
Lagrangian 205.A 
left (on the left) 106.A 
Lie (of a differential form) 105.Q 
Lie (of a tensor field) 105.0 
normal 106.G 
nth (of a differentiable function) 106.D 
ordinary (of a set function) 380.D 
ordinary lower (of a set function) 380.D 
ordinary upper (of a set function) 380.D 
partial 106.F,K 
partial, nth order 106.H 
atapoint 106.A 
Radon-Nikodym  380.C 
right (on the right) 106.A 
Schwarzian App. A, Table 9.III 
spherical (for an analytic or meromorphic 
function) 435.E 
variational 46.B 
weak 125.E 
derivatives and primitive functions App. A, Table 9.I 
derived (language) 31.D 
derived algebra (of a Lie algebra) 248.C 
derived function 106.A 





1964 


nth 106.D 
derived group (ofa group) 190.H 
derived neighborhood 65.C 
second barycentric 65.C 
derived normal! model (ofa variety) 16.F 
derived series (of a Lie algebra) 248.C 
derived set (of a set) 425.0 
derived sheaf 125.W 
derived unit 414.P 
Desarguesian geometry, non- 155.E 343.C 
Desargues theorem 155.E 343.C 
Descartes, R. 101 
folium of 93.H 
Descartes theorem 10.E 
descending central series (of a Lie algebra) 248.C 
descending chain 
(in a lattice) 243.F 
(of (normal) subgroups of a group) 190.F 
(in an ordered set) 311.C 


descending chain condition 


(for a (normal) subgroup ofa group) 190.F 
(in an ordered set) 311.C 
descent 
curve of steepest 46.A 
line of swiftest 93.H 
method of steepest 212.C 
descriptive set theory 
classical 356.H 
effective 356.H 
design 
block (— block design) 
central composite 102.M 
completely randomized 102.A 
factorial 102.H 
first-order 102.M 
fractional factorial 102.1 
second-order 102.M 
Youden square 102.K 
design matrix 102.A 403.D 
design-of-experiment analysis 403.D 
design of experiments 102 
designs for estimating parameters 102.M 
designs for exploring a response surface 102.M 
designs for two-way elimination of heterogeneity 
102.K 
desingularization (of an analytic space) 23.D 
de Sitter space 355.D 
desuspend 114.L 
detecting, error- 63.A 
determinacy 
axiom of 22.H 
projective 22.H 
determinant(s) 103 
(of an element of the general linear group over 
a noncommutative field) 60.0 
(of a nuclear operator) 68.L 
Casorati 104.D 
critical 182.B 
cyclic 103.G 
Fredholm 217.E 
Gramian 103.G 208.E 
Hankel 142.E 
Hill 268.B 
infinite (in Hill's method of solution) 268.B 
Jacobian 208.B 
ofalattice 182.B 
Lopatinski 325.K 
Pfaffian 103.G 
Vandermonde 103.G 
Wronskian 208.E 


1965 


determinantal equation, Hill 268.B 
determinant factor (of a matrix) 269.E 
determinateness, axiom of 33.F 
determination, coefficient of 397.H,J 
determination, orbit 309.A 
determined system 

of differential operators 112.R 

of partial differential equations 320.F 
determining set (of a domain in C") 21.C 
deterministic 

(in prediction theory) 395.D 

(Turing machine) 31.B 
deterministic linear bounded automaton 31.D 
deterministic process 127.B 
Deuring-Heilbronn phenomenon 123.D 
developable function, asymptotically 30.A 
developable space 425.AA 
developable surface 111.1 
development 

alongacurve 364.B 

ofacurve 80.N 111.H 364.B 
deviation 

large 250.B 

mean absolute 397.C 

standard 341.B 342.C 397.C 
deviation point 335.B 
devices, peripheral 75.B 
de Vries equation, Korteweg- 387.A 
(DF)-space 424.P 
DFT (Discrete Fourier Transform) 142.D 
diagonal (of a Cartesian product of sets) 381.B 

436.A 
diagonalizable (linear transformation) 269.L 
diagonalizable operator (in an Abelian von Neu- 
mann algebra) 308.G 


diagonal mapping (of a graded coalgebra) 203.B,F 


diagonal matrix 269.A 
diagonal morphism (in a category) 52.E 
diagonal partial sum (of a double series) 379.E 
diagonal sum (of a matrix) 269.F 
diagram |. 52.C 
(of a symmetric Riemann space) 413.F 
arrow 281.D 
associated (in irreducible representations of 
orthogonal groups) 60J 
in a category 52.C 
commutative 52.C 
Coxeter (of a complex semisimple Lie algebra) 
248.8 
Dynkin (of a complex semisimple Lie algebra) 
248.8, App. A, Table 5.I 
extended Dynkin App. A, Table 5.I 
Feynman 146.B 
mutually associated 60.J 
Newton 254.D 
Satake (of a compact symmetric Riemannian 
space) 437.AA 
Satake (of a real semisimple Lie algebra) 
248.U, App. A, Table 5.1 
scatter 397.H 
Schläfli (of a complex semisimple Lie algebra) 
248.8 
Young 362.H 
diameter 
(ofa central conic) 78.G 
(of a solid sphere) 140 
(of a subset in a metric space) 273.B 
conjugate (of a diameter of a central conic) 
78.G 
transfinite 48.D 


Subject Index 
Differentiable 


diathermal wall 419.A 
dichotomy 398.C 
Dido’s problem 228.A 
dielectric constant 130.B 
Dieudonné complete 436.1 
Dieudonné theorem 425.X 
diffeomorphic C?-manifolds 105.J 
diffeomorphism(s) 
Anosov 126.3 136.G 
Axiom A 126.J 
C’- (in nonlinear functional analysis) 286.E 
of class C" 105.J 
group of orientation-preserving 114.1 
horse-shoe 126.J 
minimal 126.N 
Morse-Smale 126.J 
Y- 136.G 
diffeotopy theorem 178.E 
difference 102.E 104.A 
backward 223.C, App. A, Table 21 
central 223.C 304.E, App. A, Table 21 
divided 223.D 
finite 223.C 
forward 304.E 
of the nth order 104.A 
primary 305.C 
second 104A 
symmetric 304.E 
of two sets 381.B 
difference analog 304.E 
difference cocycle 305.B 
difference-differential equation 163.A 
difference equation 104 
geometric 104.G 
homogeneous 104.C 
inhomogeneous 104.C 
linear 104.C 
nonhomogeneous 104.C 
difference group (of an additive group) 190.C 
difference method 303.A 
difference product 337.1 
difference quotient 104.A 
difference scheme 304.F 
of backward type 304.F 
of forward type 304.F 
difference set 102.E 
difference table 223.C 
different 
(of an algebraic number field) 14.J 
(of a maximal order) 27.B 
relative 14J 
differentiable 
complex function 21.C 
Fréchet 286.E 
Gâteaux 286.E 
infinitely 106.K 
left (on the left) 106.A 
n-times 106.D 
n-times continuously 106.K 
with respect to the parameter (a distribution) 
125.H 
partially 106.F 
at a point (a complex function) 198.A 
at a point (a real function) 106.A 
right (on the right) 106.A 
in the sense of Stolz 106.G 
onaset 106.A 
termwise (infinite series with function terms) 
379.H 
totally 21.C 106.G 


Subject Index 
Differentiable dynamical system of class C" 


differentiable dynamical system of class C" 126.B 
differentiable manifold(s) 105 
with boundary of class C" 105.E 
of class C" 105.D 
Riernann-Roch theorem for 237.G 
differentiable mapping of class C' 105.J 
differential of (at a point) 105 
differentiable pinching problem — 178.E 
differentiable semigroup 378.F 
differentiable slice theorem 431.C 
differentiable structure{s) 114.B 
of class C' 105.D 
group of oriented (on a combinatorial sphere) 
1141 
differentiable transformation group  331.C 
differential 
(=boundary operator) 200.H 
{=coboundary operator) 200.F 
{of a differentiable function) 106.B 
(of a differentiable mapping at a point) 105.J 
(of a function on a differentiable manifold) 
105.1 
(Fréchet derivative) 286.E 
Abelian (of the first, second, third kind) 11.C 
367.H 
analytic (on a Riemann surface) 367.H 
conjugate (on a Riemann surface) 367.H 
covariant (of a differential form) 80.G 
covariant (of a tensor field) 80.1,L 417.B 
covariant (of a vector field) 80.1 
exterior (of a differential form) 105.Q 
harmonic (on a Riemann surface) 367.H 
holomorphic (on a Riemann surface) 367.H 
kernel 188.G 
meromorphic (on a Riemann surface) 367.H 
nth (of a differentiable function) 106.D 
of nth order (of a differentiable function) 


106.D 
D- (on an algebraic curve) 9.F 
partial 200.H 


quadratic (on a Riemann surface) 11.D 
rth (Fréchet derivative) 286.E 
stochastic 406.C 
total 106.G 200.H 367.H 
differential analyzer 19.E 
differential and integral calculus App. A, Table 9 
differential automorphism 113 
differential calculus 106 
differential coefficient | 106.A 
partial 106.A 
differential cross section 375.A 386.B 
differential! divisor (of an algebraic curve) 9.C 
differential divisor class (of a Riemann surface) 
11.D 
differential equation(s) 313.A 
adjoint 252.K 
algebraic 113 288.A 
almost periodic 290.A 
Beltrami 352.B 
Bernoulli App. A, Table 14.1 
Bessel 39.B, App. A, Table 14.II 
Briot-Bouquet 288.B 289.B 
Caianiello 291.F 
Cauchy-Riemann 198.A 
Cauchy-Riemann (for a holomorphic function 
of several complex variables) 21.C 
Cauchy-Riemann (for a holomorphic function 
of two complex variables) 320.F 
Chaplygin 326.B 
Chebyshev App. A, Table 14.1 
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Cherwell-Wright 291.F 

Clairaut App. A, Table 14.1 

Clairaut partial App. A, Table 15.II 

confluent 167.A 

confluent hypergeometric 167.A, App. A, 
Table 14.11 

delay 163.A 

with deviating argument 163.A 

difference- 163.A 

Duffing 290.C 

elliptic partial App. A, Table 15. VI 

Emden 291.F 

Euler (in dynamics of rigid bodies) 271.E 

Euler-Lagrange 46.B 

Euler linear ordinary App. A, Table 14.] 

exact App. A, Table 14.1 

Fokker-Planck partial 115.A 

functional- 163.A 

Gauss hypergeometric App A, Table 14.II 

Gaussian 206.A 

generalized Lamé 167.E 

generalized Riccati App. A, Table 14.I 

Hamilton | 324. E 

Hamilton-Jacobi 23.B 324.E 

Helmholtz | 188.D, App. A, Table 15.VI 

Hermite App. A, Tables 14.II 20.11 

Hill 268.B 

Hodgkin-Huxley 291.F 

hyperbolic 325 

hyperbolic partial 325 

hypergeometric 206.A, App. A, Table 18.I 

hyperspherical 393.E 

integro- 163.A 222 

integro-, of Fredholm type 222.4 

integro-, of Volterra type 222.A 

Jacobi App. A, Tables 14.II 20.V 

Killing | 364.F 

Kummer App. A, Table !9.I 

withlag 163.A 

Lagrange 320.A, App. A, Table 14.I 

Lagrange partial App. A, Table Si 

Laguerre App. A, Tables 14.II 20.VI 

Lame 113.B 

Laplace 323.A, App. A, Table 15 III 

Laplace, in the 2-dimensional case App. A, 
Table 15.VI 

Laplace, in the 3-dimensional case App. A, 
Table 15.VI 

Legendre 393.B, App. A, Table 14.II 

Legendre associated 393.A 

Liénard 290.C 

linear ordinary 252.A 313.A 

linear partial 320.A 

Lowner 438.B 

Mathieu 268.A 

matrix Riccati 86.E 

modified Mathieu 268.A 

Monge 324F 

Monge-Ampére 278.A, App. A, Table 15.III 

nonlinear ordinary 313.A 

nonlinear partial 320.A 

ordinary 313 

partial 313.A 320 

partial, of elliptic type 323 

partial, of hyperbolic type 325 

partial, of mixed type 326 

partial, of parabolic type 327 

Poisson 323.A, App. A, Table 15.HI 

polytropic 291.F 

rational 288.A 


1967 


related 254.F 
with retardation 163.A 
retarded 163.A 
Riccati App. A, Table 14.I 
Riemann App. A, Table 18.I 
self-adjoint 252.K 
self-adjoint system of 252.K 
stochastic 342.A 406 
Stokes 167.E 188.E 
strongly nonlinear 290.C 
system of, of Maurer-Cartan 249.R 
system of hyperbolic (in the sense of Petrovskii) 
325.G 
system of linear, of the first order 252.G 
system of ordinary 313.B 
system of partial, of order l (on a differentiable 
manifold) 428.F 
system of total 428.A 
Tissot-Pochhammer ` 206.C 
total 428.A, App. A, Table 15.I 
Tricomi 326.C 
van der Pol 290.C 
weakly nonlinear 290.C 
Weber 167.C, App. A, Table 20.III 
Weber-Hermite 167.C 
Whittaker 167.B, App. A, Tables 14.II 19.11 
differential extension ring 113 
differential field 113 
Galois theory of 113 
differential form 105.Q 
closed 105.Q 
of degree! 105.0 
ofdegreer 105.Q 
of degree r (on an algebraic variety) 16.0 
divisor of (on an algebraic variety) 16.0 
exact 105.Q 
exterior, of degreer 105.Q 
of the first kind (on an algebraic variety) 16.0 
of the first kind (on a nonsingular curve) 9.E 
harmonic 194.B 
holomorphic, of degree k 72.A 
invariant (on an Abelian variety) 3.F 
of Maurer-Cartan 249.R 
primitive 232.B 
of the second kind (on a nonsingular curve) 
9.E 
of the third kind (on a nonsingular curve) 9.E 
of type (r,s) 72.C 
differential geometry 109, App. A, Table 4 
affine 110.C 
conforma! 110.D 
projective 110.B 
differential geometry in specific spaces 110 
differential geometry of curves and surfaces 
111 
differential ideal 113 
of a differential ring 113 
involutive 428.E 
prime (of a differential ring) 113 
semiprime (of a differential ring) 113 
sheaf on a real analytic manifold 428.E 
differential index (in a covering of a nonsingular 
curve) 9I 
differential invariant 
fundamental (of a surface) 110.B 
on an m-dimensional surface 110.A 
Poincaré 74.G 
differential law 107.A 
differential operator(s) 112 223.C 306.B 
Beltrami, of the first kind App. A, Table 4.II 





Subject Index 
Dimension 


Beltrami, of the second kind App. A, Table 
AU 
elliptic 112.A 
ofthe kth order 237.H 
ordinary 112.A 
partial 112.A 
pseudo- 345 
strongly elliptic 112.G 323.H 
system of 112.R 
differential polynomial(s) 113 
ring of 113 
differential quotient (at a point) 106.A 
differential representation (of a unitary representa- 
tion ofa Lie group) 439.S 
differential rings 113 
differential subring 113 
differential system — 191.I 
restricted 191.1 
differential topology 114 
differential variable 113 
differentiation 
(in a commutative ring) 113 
(of a differential function) 106.A 
graphical 19.B 
higher (in a commutative ring) 113 
logarithmic App. A, Table 9.T 
numerical 299.E 
partial 106.F 
theorem of termwise (on distribution) 125.G 
of a vector field App. A, Table 3.II 
diffraction (of waves) 446 
diffusion, Ehrenfest model of 260.A 
diffusion-convection equation 304.B 
diffusion kernel 338.N 
diffusion process 115 
on manifolds 115.D 
multidimensional 115.C 
digamma function 174.B 
digital computer 75.B 
digital quantity 138.B 
dihedral group  151.G 
dilatation 
in Laguerre geometry 76.B 
maximal 352.B 
dilated maximum principle (in potential theory) 
338.C 
dilation (of a linear operator) 251.M 
power 251.M 
strong 251.M 
unitary 251.M 
dilation theorem 251.M 
dimension 
(of an affine space) 7.A 
(of an algebraic variety) 16.A 
(of an analytic set) 23.B 
(of an automorphic form) 32.B 
(of a cell complex) 70.D 
(of a convex cell in an affine space) 7.D 
(of a divisor class on a Riemann surface) 11.D 
(of a Euclidean simplicial complex) 70.B 
(of a free module) 277.G 
(ofa Hilbert space) 197.C 
(ofa linear space) 256.C 
(of a linear system of divisors) 9.C 16.N 
(of a physical quantity) 116 
(of a projective space) 343.B 
(of a simplicial complex) 70.C 
(of a topological space) 117 
algebraic (of an algebraic surface) 72.F 
capacity 48.G 


Subject Index 
Dimension — k 


cohomological (of an associative algebra) 
200.L 
cohomological (of a scheme) 16.E 
cohomological (of a topological space) 117.F 
complex (of a complex manifold) 72.A 
covering (of a normal space) 117.B 
decomposition theorem for 117.C 
geometric (of a vector bundle) 114.D 
global (of an analytic set) 23.B 
global (ofa ring) 200.K 
harmonic (of a Heins end) 367.E 
Hausdorff 117.G 234.E 246.K 
homological (of a module) 200.K 
homological (of a topological space) 117.F 
injective (of a module) 200.K 
Kodaira (of a compact complex manifold) 72.1 
Krull 67.E 
large inductive 117.B 
Lebesgue (of a normal space) 117.B 
left global (of a ring) 200.K 
local (of an analytic set at a point) 23.B 
product theorem for 117.C 
projective (of a module) 200.K 
right global (ofa ring) 200.K 
small inductive 117.B 
sum theorem for 117.C 
theorem on invariance of, of Euclidean spaces 
117.D 
weak (of a module) 200.K 
weak global (ofa ring) 200.K 
dimension —k 
automorphic form of 32.B 
Fuchsian form of 32.B 
Hilbert modular form of 32.G 
Siegel modular form of 32.F 
dimensional analysis 116 
dimensional formula 116 
dimension function (on a continuous geometry) 
85.A 
dimension theorem 
(of affine geometry) 7.A 
(on modular lattice) 243.F 
(of projective geometry) 343.B 
dimension theory 117 
dimension type (of a topological space) 117.H 
Dini-Hukuhara theorem 314.D 
Dini-Lipschitz test (on the convergence of Fourier 
series) 159.B 
Dini series 39.C 
Dini surface 111.] 
Dini test (on the convergence of Fourier series) 
159.B 
Dini theorem (on uniform convergence) 435.B 
Diocles, cissoid of 93.H 
Diophantine (relation) 97 
Diophantine analysis 296.A 
Diophantine approximation 182.F 
Diophantine equations 118 
Dirac delta function App. A, Table 12.II 
Dirac distribution 125.C 
Dirac equation 377.C 415.G 
Dirac field, free 377.C 
Dirac y-matrix 415.G 
Dirac hole theory 415.G 
Dirac matrix 377.C 
direct analytic continuation 198.G 
direct circle 78.D 
direct closed path 186.F 
direct decomposition (on a group) 190.L 
directed family 165.D 
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directed graph 186.B 
directed set 311.D 
direct factor 
(of a direct product of sets) 381.E 
(ofa group) 190.L 
direct image (of a sheaf) 383.G 
direct integral 308.G 
of unitary representations 437.H 
direction 
asymptotic 111.H 
Borel (of a meromorphic function) 272.F 
Julia (of a transcendental entire function) 
429.C 
positive (in a curvilinear integral) 198.B 
principal (of a surface) 111.H 
directional derivative 106.G 
direction ratio (of a line in an affine space) 7.F 
direct limit (of a direct system of sets) 210.B 
direct method 46.E 302.B 
direct path 186.F 
direct product 
(of algebras) 29.A 
(of distributions) 125.K 
(of a family of lattices) 243.C 
(of a family of ordered sets) 311.F 
(of a family of sets) 381.E 
(of a family of topological groups) 423.C 
(of a family of topological spaces) 425.K 
{of groups) 190.L 
(of G-sets) 362.B 
(of Lie groups) 249.H 
(of mappings) 381.C 
(of measurable transformations) 136.D 
(of modules) 277.F 
(of objects of a category) 52.E 
(of rings) 368.E 
(of sets) 381.B 
(of sheaves) 383.1 
restricted (of an infinite number of groups) 
190.L 
restricted (of locally compact groups) 6.B 
semi- (of groups) 190.N 
direct product decomposition 190.L 
dual 422.H 
directrix (of an ellipse) 78.B 
of Wilczynski 110.B 
direct set, increasing 308.A 
direct sum 
(of a family of ordered sets) 311.F 
(of a family of sets) 381.E 
(of G-sets) 362.B 
(of Hilbert spaces) 197.E 
(of ideals of a ring) 368.F 
(of an infinite number of groups) 190.L 
(of Lie algebras) 248.A 
(of linear representations) 362.C 
(of linear spaces) 256.F 
(of modules) 277.B,F 
(of a mutually disjoint family of sets) 381.D 
(of quadratic forms) 348.E 
(of sheaves) 383.1 
(of topological groups) 423.C 
(of two objects) 52.E 
(of unitary representations) 437.G 
integral 308.G 
topological (of topological spaces) 425.M 
direct summand (of a direct sum of sets) 381.E 
direct system (of sets) 210.B 
direct transcendental singularity 198.P 
Dirichlet, P. G. L. 119 


1969 


Dirichlet algebra 164.B 
weak* 164.G 
Dirichlet character 295.D 
Dirichlet discontinuous factor App. A, Table ON 
Dirichlet distribution | 341.D, App. A, Table 22 
Dirichlet divisor problem 242.A 
Dirichlet domain 120.A 
Dirichlet drawer principle 182.F 
Dirichlet form 261.C 
regular 261.C 
Dirichlet function 84.D 221.A 
Dirichlet functional 334.C 
Dirichlet integral 
(in the Dirichlet problem) 120.F 
(in Fourier’s single integral theorem) 160.B 
Dirichlet kernel 159.B 
Dirichlet L-function 450.C 
Dirichlet principle 120.A 323.C 
Dirichlet problem 120 293.F 323.C 
Dirichlet problem with obstacle 440.B 
Dirichlet region 234.C 
Dirichlet series 121.A 
ordinary 121.A 
of the type {4,} 121A 
Dirichlet space 338.Q 
Dirichlet test (on Abel partial summation) 379.D 
Dirichlet test (on the convergence of Fourier series) 
159.B 
Dirichlet theorem 
(of absolute convergence) 379.C 
(on the distribution of primes in arithmetical 
progression) 123.D 
Dirichlet unit theorem 14.D 
discharge of double negation 411.1 
discharging 157.D 
disconnected, extremely 37.M 
disconnected metric space, totally 79.D 
discontinuity 
of the first kind 84.B 
fixed point of (of a stochastic process) 407.A 
at most of the first kind 84.B 
region of 234.A 
discontinuity formula 146.C 386.C 
discontinuity point 84.B 
of the first kind 84.B 
of the second kind 84.B 
discontinuous distribution, purely 341.D 
discontinuous factor, Dirichlet App. A, Table 9.V 
discontinuous groups 122 
of the first kind 122.B 
discontinuous transformation group 122.A 
properly 122.A 
discontinuum, Cantor 79.D 
discrete covering 425.R 
g- 425 R 
discrete C'-flow 126.B 
discrete dynamical system of class C” 126.B 
discrete flow 126.B 
of class C" 126.B 
discrete Fourier transform 142.D 
discrete mathematics 66.A 
discrete memoryless channels | 213.F 
discrete metric space 273.B 
discrete semiflow 126.B 
of class C" 126.B 
discrete series (of unitary representations of a semi- 
simple Lie group) 437.X 
discrete series, principal 258.C 
discrete set 425.0 


Subject Index 
Distance function 


discrete spectrum 136.E 390.E 
quasi- 136.E 
discrete topological space 425.C 
discrete topology | 425.C 
discrete uniformity 436.D 
discrete valuation 439.E 
discrete valuation ring 439.E 
discrete variable method 303.A 
discrete von Neumann algebra 308.E 
discretization error 303.B 
discriminant 
(of an algebraic equation) 337J 
(of an algebraic number field) 14.B 
(of a binary quadratic form) 348.M 
(of a curve of the second order) 78.1 
(of a family of curves) 93.1 
(of a quadratic form) 348.A 
(of a simple ring) 27.B 
fundamental 295.D 
relative 14J 
discriminant function, linear 280.1 
discriminant theorem, Dedekind 14.J 
disintegration 270.L 
disjoint family, mutually (of sets) 381.D 
disjoint sets 381.B 
disjoint sum  381.B 
disjoint union 381.B 
of a mutually disjoint family of sets 381.D 
disjoint unitary representations 437.C 
disjunction (of propositions) 411.B 
disjunctive cuts 215.C 
disjunctive programming 264.C 
disk 140 
circular 140 
n- 140 
open 140 
openn- 140 
unit 140 
disk algebra 164.B 
disk theorem (on meromorphic functions) 272.] 
dispersion 397.C 
dispersion relations 132.C 
dispersive 
(flow) 126.E 
(linear operator) 286.Y 
dispersive wave 446 
displacement 
electric 130.A 


parallel (of a tangent vector space) 80.H 364.B 


parallel, alongacurve 80.C 
dissection, Farey 4.B 
dissipative (operator) 251.J 286.C 
maximal 251.J 
dissipative part (of a state space) 260.B 
distance 
(in Euclidean geometry) 139.E 
(in a metric space) 273.B 
Euclidean 139.E 
extremal 143.B 
Fréchet (between surfaces) 246.1 
Hamming 63.B 136.E 
Lévy 341.F 
Mahalanobis generalized 280.E 
non-Euclidean (in a Klein model) 285.C 
optical 180.A 
perihelion 309.B 
reduced extremal 143.B 
distance function 273.B 
pseudo- 273.B 


Subject Index 
Distinct differentiable structures 


distinct differentiable structures 114.B 
distinct system of parameters 284.D 
distinguishable, finitely (hypothesis) 400.K 
distinguished basis, strongly 418.F 
distinguished pseudopolynomial 21.E 
distortion function, rate 213.E 
distortion inequalities 438.B 
distortion measure 213.E 
distortion theorem 438.B 
distributed 
asymptotically 374.D 
uniformly 182.H 
distribution(s) 125 
(on a differentiable manifold) 125.R 
(of random variables) 342.C 
(of a vector bundle) 428.D 
a posteriori 398.B 
a priori 398.B 
asymptotically normal 399.K 
beta 341.D, App. A, Table 22 
Beurling generalized 125.U 
binomial 341.D 397.F, App. A, Table 22 
bivariate 397.H 
capacity mass 338.K 
Cauchy 341.D, App. A, Table 22 
chi-square |. 374.4, App. A, Table 22 
conditional probability 342.E 
continuous 341.D 


converge in (a sequence of random variables) 


342.D 
covariance 395.C 
cumulative 397.B 
Dirac 125.C 
Dirichlet | 341.D, App. A, Table 22 
double, potential of 338.A 
entropy ofa 403.B 
equilibrium, Gibbs 136.C 
equilibrium mass 338.K 
exponential 341.D, App. A, Table 22 
exponential family of 396.G 
F- 341.D 374.B, App. A, Table 22 
fiducial 401.F 
finite-dimensional 407.A 
of finite order 125.J 
function, empirical 374.D 
gamma  341.D, App. A, Table 22 
Gaussian 341.D 
geometric 341.D, App. A, Table 22 


hypergeometric 341.D 397.F, App. A, Table 22 


infinitely divisible 341.G 

initial 261.A 

initial law 406.D 

integrable 125.N 

invariant (of a Markov chain) 260.A 
invariant (second quantization) 377.C 


involutive (on a differentiable manifold) 428.D 


joint 342.C 
k-dimensional normal 341.D 
k-Erlang 260.H 

kth-order asymptotic 399.0 
L- 341.G 

lattice 341.D 

law, Maxwell-Boltzmann | 402.B 
least favorable 400.B 

least favorable a priori 398.H 
limit 250.A 

logarithmic App. A, Table 22 
logarithmic normal App. A, Table 22 
marginal 342.C 397.H 
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multidimensional hypergeometric App. A, 
Table 22 

multidimensional normal App. A, Table 22 

multinomial 341.D 

multiple hypergeometric 341.D 

multivariate normal 397.J 

n-dimensional 342.C 

n-dimensional probability 342.C 

negative binomial 341.D 397.F, App. A, 
Table 22 

negative multinomial 341.D 

negative polynomial App. A, Table 22 

noncentral chi-square 374.B 

noncentral F- 374.B 

noncentralt- 374.B 

noncentral Wishart 374.C 

normal 341.D 397.C, App. A, Table 22 

one-dimensional probability, of a andom 
variable 342.C 

one-side stable for exponent 1/2 App. A, 
Table 22 

operator-valued 150.D 

p-dimensional noncentral Wishart 374.C 

Pearson 397.D 

pluriharmonic 21.C 

Poisson 341.D 397.F, App. A, Table 22 

polynomial App. A, Table 22 

population 396.B 401.F 

positive 125.C 

posterior 401.B 403.G 

predictive 403.G 

of prime numbers 123 

prior 401.B 403.G 

probability 342.B, App. A, Table 22 

probability, of a random variable 342.C 

purely discontinuous 341.D 

quasistable 341.G 

random 395.H 407.C 

random, with independent values at every point 
407.C 

random, in the wider sense 395.C 407.C 

rapidly decreasing 125.0 

rectangular App. A, Table 22 

sampling 374.A 

semistable 341.G 

simple, potential of 338.A 

simultaneous 342.C 

slowly increasing 125.N 

stable 341.G 

standard Gaussian 176.A 

standard normal 341.D 

strictly stationary random 395.H 

strongly stationary random ` 205 H 

substituted 125.Q 

t- 341.D 374.B, App. A, Table 22 

tempered 125.N 

two-sided exponential App. A, Table 22 

of typical random variables App. A, Table 22 

ultra-, of class {M,} or (M,) 125.U,BB 

uniform 341.D, App. A, Table 22 

unit 341.D 

value 124.A 

of values of functions of a complex variable 
124 

waiting time 307.C 

weakly stationary random 395.C: 

Wishart 374.C 

z- MILD 374.B, App. A, Table 22 


distribution curve, cumulative 397.B 
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distribution derivative 125.E 
distribution-free (test) 371.A 
distribution-free method 371.A 
distribution function 168.B 341.B 342.C 
cumulative 341.B 342.C 
empirical 250.F 396.C 
n-dimensional 342.C 
symmetric 341.H 
unimodal 341.H 
distribution kernel 338.P 
distribution law, Maxwell-Boltzmann 402.B 
distribution polygon, cumulative 397.B 
distribution semigroup |. 378.F 
distributive algebra 231.A 
distributive lattice 243.E 
distributive law 
(in algebra of sets) 381.B 
(on cardinal numbers) 49.C 
(in a lattice) 243.E 
(on natural numbers) 294.B 
(in a ring) 368.A 
complete (in a lattice-ordered group) 243.G 
disturbance | 128.C 
diurnal aberration 392 
div (divergence) 442.D 
diverge 87.B,E 379.A 
to% 87.D 
divergence 
(of a differentiable vector field) 442.D 
(of a vector field with respect to a Riemannian 
metric) 105.W 
(of a vector field with respect to a volume 
element) 105.W 
infrared 146.B 
ultraviolet 146.B 
divergence form 323.D 
divergence theorem 94.D 
divergent 
(double series) 379.E 
(infinite product) 379.G 
(integral) 216.E 
(sequence of real numbers) 87.B 
(series) 379.A 
properly 379.A 
divide (a bounded domain) 384.F 
divided difference 223.D 
dividing cycle (on an open Riemann surface) 367.1 
divisibility relation (in a ring) 67.H 
divisible 
(Abelian p-group) 2.D 
(additive group) 2.E 
(element of ring) 67.H 277.D 
(fractional ideal) 14.E 
(general Siegel domain) 384.F 
(number) 297.A 
divisible A-module 277.D 
divisible subgroup (of a discrete Abelian group) 
422.G 
division (of a pseudomanifold) 65.A 
simplicial 65.A 
division algebra 29.A 
division algorithm 
of natural numbers |. 297.A 
of polynomials | 337.C 
division ring 368.B 
division theorem 
Späth type (for microdifferential operators) 
274.E 
Weierstrass type (for microdifferential opera- 
tors) 274.E 


Subject Index 
Divisor 


divisor 

(in an algebraic curve) 9.C 

(of an algebraic function field of dimension 1) 
9.D 

(of an algebraic number field) 14.F 

(in an algebraic variety) 16.M 

(in a closed Riemann surface) 11.D 

(in a complex manifold) 72.F 

(of an element of a ring) 67.H 

(of a fractional ideal) 14.E 

(ofa number) 297.A 

ample 16.N 

branch (ina covering) 9.1 

canonical (of an algebraic curve) OC 

canonical (of an algebraic variety) 16.0 

canonical (of a Jacobian variety) 9.E 

canonical (of a Riemann surface) 11.D 

Cartier 16.M 

common (of elements of a ring) 67.H 

complete linear system defined by 16.N 

complex line bundle determined by 72.F 

differential (of an algebraic curve) OC 

of a differential form (on an algebraic variety) 
16.0 

effective (on an algebraic curve) 9.C 

effective (on a variety) 16.M 

elementary (of a matrix) 269.E 

embedded prime (of an ideal) 67.F 

finite prime 439.H 

of a function (on an algebraic curve) 9.C 

of a function (on an algebraic variety) 16.M 

greatest common 297.A 

greatest common (of an element of a ring) 
67.H 

imaginary infinite prime | 439.H 

infinite prime | 439.H 

integral (of an algebraic curve) 9.C 

integral (of an algebraic number field) 14.F 

integral (on a Riemann surface) 11.D 

isolated prime (of an ideal) 67.F 

k-rational (on an algebraic curve) 9.C 

linearly equivalent (of a complex manifold) 
72.F 

maximal prime (of an ideal) 67.F 

minimal prime (of an ideal) 67.F 

nondegenerate 16.N 

nondegenerate (on an Abelian variety) 3.D 

numerically connected 232.D 

-linearly equivalent (on an algebraic curve) 
9.F 

pole (of a function on an algebraic variety) 
16.M 

positive (of an algebraic curve) OC 

positive (on a Riemann surface) 11.D 

prime (of an algebraic function field of dimen- 
sion 1) OD 

prime (of an algebraic number field or an 
algebraic function field of one variable) 
439.H 

prime (of an ideal) 67.F 

prime (on a Riemann surface) 11.D 

prime rational, over a fteld (on an algebraic 
curve) 9.C 

principal (on an algebraic curve) OC 

principal (on a Riemann surface) 11.D 

real infinite prime 439.H 

real prime 439.H 

sheaf of ideals of (of a complex manifold) 72.F 

special 9.C 

very ample 16.N 


Subject Index 
Divisor class (on a Riemaun surface) 


zero (of a function on an algebraic variety) 
16.M 
zero (ofa ring) 368.B 
zero, with respect to M/P 284.A 
divisor class (on a Riemann surface) 11.D 
canonical 11.D 
differential 11.D 
divisor class group (of a Riemann surface) 11.D 
divisor function 295.C 
generalized 295.C 
divisor group (of a compact complex manifold) 
72.F 
divisor problem, Dirichlet 242.A 
Dixmier theorem, Rellich- 351.C 
Dixon-Ferrar formula App. A, Table 19.IV 
DK method 301.F 
DKA method 301.F 
DLR equation 402.G 
dn App. A, Table 16.III 
dodecahedron 357.B 
Doetsch three-line theorem 43.E 
Dolbeault cohomology group 72.D 
Dolbeault complex 72.D 
Dolbeault lemma 72.D 
Dolbeault theorem 72.D 
domain(s) 
(of a correspondence) 358.B 
(ofa mapping) 37.C 381.C 
(in a topological space) 79.A 
(of a variable) 165.C 
angular 333.A 
annular 333.A 
ofattraction 374.G 


Brouwer theorem on the invariance of 117.D 


Cartan pseudoconvex 21.1] 
circular 333.A 

of class C^^ 323.F 

closed plane 333.A 

complete Reinhardt 21.B 
convergence (of a power series) 21.B 
Courant-Cheng, theorem 391.H 
of dependence 325.B 

Dirichlet 120.A 

divisible bounded 284.F 
d-pseudoconvex 21.G 

effective 88.D 

fundamental 234.C 

generated Siegel 384.F 
holomorphically complete 21.F 
holomorphically convex 21.H 
ofholomorphy 21.F 
homogeneous bounded 384.A 412.F 
individual 411.H 

of influence 325.B 

integral 368.B 

of integration 216.F 

irreducible symmetric bounded 412.F 
Jordan 333.A 

Levi pseudoconvex 21.1 

of a local homomorphism 423.0 
locally Cartan pseudoconvex 21.1 
locally Levi pseudoconvex  21.I 
nodal 391.H 

Noetherian 284.A 

Noetherian integral 284.A 
object 411.G 

of operator 409.A 

operator (ofa group) 190.E 

plane 333 

principal ideal 67.K 
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pseudoconvex 21.G 
with regular boundary (in a C?-manifold) 
105.U 
Reinhardt 21.B 
Siegel 384.A 
Siegel, generalized 384.F 
Siegel, irreducible 384.E 
Siegel, of the first kind 384.A 
Siegel, of the second kind 384.A 
Siegel, of the third kind 384.A 
slit 333.A 
with smooth boundary (in a C?-manifold) 
105.U 
spectrum of 391.A 
strongly pseudoconvex 21.G 
sweepable bounded 284.F 
symmetric bounded 412.F 
unique factorization A0. H 
universal 16.A 
Weil 21.G 
domain kernel (of a sequence of domains) 333.C 
dominant (of a sequence of functions) 435.A 
dominant integral form (on a Cartan subalgebra) 
248.W 
dominate (an imputation of a game) 173.D 
dominated 
(by a family of topological spaces) 425.M 
(statistical structures) 396.F 
weakly (statistical structure) 396.F 
dominated ergodic theorem  136.B 
dominating set 186.1 
domination, number of 186] 


. domination principle 338.L 


inverse 338.L 
Donsker invariance principle 250.E 
Doob-Meyer decomposition theorem 262.D 
Doolittle method 302.B 
dotted indices 258.B 
dotted spinor of rank k 258.B 
Douady space 23.G 
double chain complex 200.E 
double complex 200.H 
double coset (of two subgroups of a group) 190.C 
double distribution, potential of 338.A 
double exponential formula 299.B 
double integral 216.F 
double integral theorem, Fourier 160.B 
double layer, potential of 338.A 
double mathematical induction 294.B 
double negation, discharge of 411.1 
double point, rational 418.C 
double ratio 343.E 
double sampling inspection 404.C 
double sequence 379.E 
double series 379.E 
absolutely convergent 379.E 
conditionally convergent 379.E 
convergent 379.E 
divergent 379.E 
Weierstrass theorem of 379.H 
double suspension theorem  65.C 
double-valued representation 258.B 
doubly invariant 164.H 
doubly periodic function 134.E 
Douglas algebra 164.1 
Douglas functional 334.C 
Douglas-Radó solution (to Plateau’s problem) 
275.C 
downhill method 301.L 
down-ladder 206.B 
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drawer principle, Dirichlet 182.F 
drift 
transformation by 261.F 
transformation of 406.B 
drift part 406.B 
dual 
(cell) 65.B 
(graded module) 203.B 
(graph) 186.H 
(matroid) 66.H 
(proposition in a projective space) 343.B 
(regular polyhedron) 357.B 
(symmetric Riemannian space) 412.D 
(topological group) 422.C 437.J 
dual algebra 203.F 
dual basis (of a linear space) 256.G 
dual bundle 147.F 
dual category 52.F 
dual cell, (n — ai, Gab 
dualcoalgebra 203.F 
dual complex 65.B 
dualcone 125.BB 
dual convex cone 89.F 
dual curve (of a plane algebraic curve) 9.B 
dual direct product decomposition 422.H 
dual frame 417.B 
dual homomorphism 
(of a homomorphism of algebraic tori) 13.D 
(of lattices) 243.C 
dual Hopf algebra 203.C 
dual isomorphism 
(of lattices) 243.C 
(between ordered sets) 311.E 
duality 
(in field theory) 150.E 
(for symmetric Riemannian space) 412.D 
Martineau-Harvey 125.Y 
Poincaré (in manifolds) 201.0 
Poincaré (in Weil cohomology) 450.Q 
principle of (in projective geometry) 343.B 
duality mapping 251.J 
duality principle 
(for closed convex cone) 89.F 
(for ordering) 311.A 
duality property (of linear space) 256.G 
duality theorem 
(on Abelian varieties) 3.D 
(of linear programming) 255.B 
(in mathematical programming) 292.D 
Alexander 201.0 
for Q-module 422.L 
Poincaré-Lefschetz 201.0 
Pontryagin (on topological Abelian groups) 
192.K 422.C 
Serre (on complex manifolds) 72.E 
Serre (on projective varieties) 16.E 
of Takesaki 308.1 
Tannaka (on compact groups) 69.D 
Tannaka (on compact Lie group) 249.U 
dual lattice 243.C 310.E 450.K 
dual linear space 256.G 
self- 256.H 
dually isomorphic (lattices) 243.C 
dual mapping (of a linear mapping) 256.G 
dual Martin boundary 260.1 
dual module 277.K 
dual operator 
in Banach space 37.D 
of a differential operator 125.F 
of a linear operator 251.D 








Subject Index 
é-flat 


dual ordering 311.A 
dual passive boundary point 260.1 
dual problem 255.B 349.B 
dual process 261.F 
dual representation 362.E 
dual resonance model 132.C 
dual semigroup 378.F 
dual space 
(ofa C*-algebra) 36.G 
(of a linear space) 256.G 
(of a linear topological space) 424.D 
(of a locally compact group) 437.J 
(of a normed linear space) 37.D 
(of a projective space) 343.B 
quasi- (of a locally compact group) 437.1 
strong 424.K 
dual subdivision 65.B 
dual vector bundle 147.F 
du Bois-Reymond and Dedekind, test of 379.D 
du Bois Reymond problem  159.H 
Duffing differential equation 290.C 
Duhamel method 322.D 
Duhem relation, Gibbs- 419.B 
dummy index (of a tensor) 256J 
Dunford integrable 443.F 
Dunford integral 251.G 443.F 
Dunford-Pettis theorem 68.M 
Donford-Schwartz integral, Bartle- 443.G 
duo-trio test 346.D 
Dupin, cyclide of 111.H 
Dupin indicatrix 111.H 
duplication of a cube 179.A 
Durand-Kerner-Aberth (DKA) method  301.F 
Durand-Kerner (DK) method 301.F 
Dvoretzky-Rogers theorem 443.D 
dyadic compactum |. 79.D 
dynamical system(s) 126 
classical 126.L 136.G 
continuous 126.B 
differential 126.B 
discrete 126.B 
linear 86.B 
dynamic programming 127 264.C 
dynamic programming model 307.C 
dynamics 
analytical 271.F 
fluid 205.A 
magnetofluid 259 
quantum flavor 132.D 
dynamo theory, hydromagnetic 259 
Dynkin class 270.B 
Dynkin class theorem 270.B 
Dynkin diagram (of a complex semisimple Lie 
algebra) 248.8 
extended App. A, Table 5.I 
Dynkin formula 261.C 
Dynkin representation of generator 261.C 


E 


e (topology) 424.R 

e, Eddington's App. A, Table AU 
é(Q)(-C*(Q) 125.1 168.B 
ZO 125.1 

im, p OM) 168.B 

-covering 273.B 

e-entropy 213.E 

-expansion 111.C 

g-factor 450.N 

-flat 178.D 


Subject Index 
é-Hermitian form 


é-Hermitian form 60.0 
é-independent partitions 136.E 
t-induction, axiom of 33.B 
é-neighborhood (of a point) 273.C 
t-number 312.C 
£-operator, Hilbert 411J 
£-quantifier, Hilbert 411.3 
t-sphere (of a point) 273.C 
t-symbol, Hilbert 411.J 
é-tensor product | 424.R 
é-theorem (in predicate logic) 411.J 
£-trace form 60.0 
&-function 46.C 
ó-space 193.N 
E-function 430.D 
E-optimality 102.E 
E waves 130.B 
Eberlein-Shmul'yan theorem 37.G 
Eberlein theorem 424.V 
eccentric angle 
ofa point on a hyperbola 78.E 
ofa point on an ellipse 78.E 
eccentric anomaly 309.B 
eccentricity (of a conic section) 78.B 
echelon space 168.B 
ecliptic 392 
econometrics 128 
Eddington's e App. A, Table AU 
edge 
(of a convex cell in an affine space) 
(ina graph) 186.B 
(ofa linear graph) 282.A 
reference 281.C 
edge homomorphism ` 200J 
edge of the wedge theorem 125.W 
Edgeworth expansion 374.F 
effect 403.D 
block  102.B 
factorial 102.H 
fixed 102.A 
fixed, model 102.A 
main 102. H 
random 102.A 
random, model 102.A 
treatment | 102. B 
effective descriptive set theory 356.H 
effective divisor 
(on an algebraic curve) 9.C 
(ona variety) 16.M 
effective domain 88.D 


effective genus (of an algebraic curve) 9.C 


effectively (act on a G-space) 431.A 
almost 431.A 
effectively calculable function 356.C 
effectively given (object) 22.A 
effectively parametrized (at 0) 72.G 
effect vector 102.A 
efficiency 399.D 
asymptotic 399.N 
Bahadur 400.K 
second-order 399.0 
second-order asymptotic 399.0 
efficiency-balanced block design 102.E 
efficient 
kth order asymptotic 399.0 
efficient estimator 399.D 
asymptotically 399.N 
first-order 399.0 
first-order asymptotic 399.0 
Egervary theorem, König- 28].E 
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Egorov theorem 270.5 
Ehrenfest model of diffusion 260.A 
Ehrenpreis-Malgrange theorem 112.8 
Eichler approximation theorem 27.D: 
eigenchain 390.H 
eigenelement (of a linear operator) 390.A 
eigenfunction 
(of a boundary value problem) 315.B 
(for an integral equation) 217.F 
(of a linear operator) 390.A 
generalized 375.C 
eigenspace 
(ofa linear mapping) 269.L 
(of a linear operator) 390.A 
generalized 390.B 
in a weaker sense 269.L 
eigenvalue(s) 
(of a boundary value problem) 315.B 
(of an integral equation) 217.F 
(ofa linear mapping) 269.L 
(of a linear operator) 390.A 
(of the Mathieu equation) 268.B 
(ofa matrix) 269. 
degenerate 390.A 
generalized |. 375.C 
geometrically simple ` 200. A 
index of 217.F 
multiplicity of 217.F 
numerical computation of 298 
eigenvalue problem 390.A 
generalized 298.G 
eigenvector 
(of a linear mapping) 269.L 
(ofa linear operator) 390.A 
(ofa matrix) 269.F 
generalized 390.B 
eightfold way 132.D 
eikonal 82.D 180.C 
eikonal equation 324.E 325.L 
Eilenberg-MacLane complexes 79.F 
Eilenberg-MacLane space 70.F 
Eilenberg-MacLane spectrum 202. T 
Eilenberg-Postnikov invariants (of a CW complex) 
70.G 
Eilenberg-Steenrod axioms 201.Q 
Eilenberg-Zilber theorem 201.J 
Einstein, A. 129 
Einstein convention (on tensors) 256 
Einstein-Káhler metric 232.C 
Einstein metric 364.1 
Einstein relation (in diffusion) (SA 
Einstein space 364.D, App. A, Table 4.1] 
Einstein summation convention 417.B 
Eisenstein-Poincaré series 32.F 
Eisenstein series 32.C 
generalized 450.T 
Eisenstein theorem 337.F 
elastic, total, cross section 386.B 
elasticity 
modulus of, in shear 271.G 
modulus of, in tension 271.G 
small-displacement theory cf 271.G 
theory of 271.G 
elastic limit 271.G 
elastic scattering 375.4 
elation 110.D 
electric displacement 130.A 
electric field 130.A 
electric flux density 130.A 
electric network 282.B 


1975 


electric polarization 130.A 
electric susceptibility | 130.B 
electric waves 130.B 


transverse 130.B 





theory of 130.B 
transverse 130.B 
electromagnetism 130 
electron 377.B 
electronic analog computer 19.E 
electronic computer 75.A 
electrostatics 130.B 
clement(s) 381.A 
affine arc 110.C 
algebraic (of a field) 149.E 
areal (in a Cartan space) 152.C 
atomic (in a complemented modular lattice) 
243.F 
boundary (in a simply connected domain) 
333.B 
canonical (in the representation of a functor) 
§2.L 
Casimir (of a Lie algebra) 248.J 
central (in a lattice) 243.E 
compact (of a topological Abelian group) 
422.F 
conformal arc 110.D 
conjugate (in a field) 149.J 
conjugate (in a group) 190.C 
contact 428.E 
contact (in a space with a Lie transformation 
group) 110.A 
cyclic 251.5 
even (of a Clifford algebra) 61.B 
finite, method 304.C 
function 198.1 339.A 
function, in the wider sense 198.0 
generalized nilpotent (in a commutative Banach 
algebra) 36.E 
generating 390.G 
greatest (in an ordered set) 311.B 
homogeneous (of a graded ring) 369.B 
homogeneous (of a homogeneous ring) 369.B 
hypersurface 324.B 
idempotent (of a ring) 368.B 450.0 
identity (of an algebraic system) 409.C 
identity (of a field) 149.A 
identity (ofa group) 190.A 
identity (of a ring) 368.A 
inseparable (of a field) 149.H 
integral (of a system of total differential equa- 
tions) 428.E 
inverse (ina group) 190.A 
inverse (in a ring) 368.B 
inverse function 198.L 
invertible (ofa ring) 368.B 
irreducible (of a ring) 67.H 
isotropic (with respect to a quadratic form) 
348.E 
k-dimensional integral 191.I 
Kepler orbita] 309.B 
least (in an ordered set) 311.B 
left inverse (of an element of a ring) 368.B 
line 111.C 
linearly dependent 2.E 
linearly independent 2.E 
matrix 351.B 
maximal (in an ordered set) 311.B 
maximum (in an ordered set) 311.B 
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minimal (in an ordered set) 311.B 
minimum (in an ordered set) 311.B 
negative (of an ordered field) 149.N 
Neutral (in a lattice) 2 








ordinary 191] 
ordinary integral 428.E 
oriented (in a covering manifold of a homoge- 
neous space) 110.A 
orthogonal (of a ring) 368.B 
osculating 309.D 
polar (of an analytic function in the wider sense) 
198.0 
polar (of an integral element} 428.E 
positive (of an ordered field) 149.N 
prime (of a ring) 67.H 
prime (for a valuation) 439.E 
primitive (of an extension ofa field) 149.D 
projective line 110.B 
purely inseparable (of a field) 149.H 
quasi-inverse (in a ring) 368.B 
quasi-invertible (of a ring) 368.B 
quasiregular (of a ring) 368.B 
ramified 198.0 
rational 198.0 
regular (of a connected Lie group) 249.P 
regular (of a ring) 368.B 
regular integral 428.E 
right inverse (in a ring) 368.B 
separable (of a field) 149.H 
singular (of a connected Lie group) 249.P 
singular (with respect to a quadratic form) 
348.E 
surface 324.B 
surface, union of 324.B 
torsion (of an A-module) 277.D 
transcendental (of a field) 149.E 
transgressive (in the spectral sequence of a fiber 
space) 148.E 
triangular 304.C 
unit (of a field) 149.A 
unit (of a group) 190.A 
unit (ofa ring) 368.A 
unity (of a field) 149.A 
unity (of a ring) 368.A 
volume (of an oriented C*-manifold) 105.W 
volume, associated with a Riemannian metric 
105.W 
zero (of an additive group) 2.E 190.A 
zero (of a field) 149.A 
zero (of a linear space) 256.A 
zero (ofa ring) 368.A 
elementarily equivalent structures 276.D 
elementary (Kleinian group) 234.A 
elementary (path) 186.F 
elementary Abelian functions 3.M 
elementary Abelian group 2.B 
elementary catastrophe 51.E 
elementary collapsing 65.C 
elementary contract 102.C 
elementary divisor 
(ofa matrix) 269.E 
simple (of a matrix) 269.E 
elementary event(s) 342.B 
space of 342.B 
elementary extension 276.D 
elementary function(s} 131 
ofclassn 131.A 
elementary Hopf algebra 203.D 
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Elementary ideal 


elementary ideal 235.C 
elementary kernel (of a linear partial differential 
operator) 320.H 
elementary number theory 297 
fundamental theorem of, 297.C 
elementary particle(s) 132 
elementary solution App. A, Table 15.V 
(of a differential operator) 112.B 
(of a linear partial differential operator) 320.H 
(of partial differential equations of elliptic 
type 323.B 
elementary symmetric function 337.1 
elementary symmetric polynomial 337.1 
elementary topological Abelian group  422.E 
eliminate (variables from a family of polynomials) 
369.E 
elimination 
design for two-way, of heterogeneity | 102.K 
forward 302.B 
Gaussian 302.B 
Gauss-Jordan 302.B 
elimination method, Sylvester 369.E 
ellipse 78.A 
ellipsoid 350.B 
of inertia 271.E 
of revolution 350.B 
ellipsoidal coordinates 90.C 133.A, App. A, Table 
3.V 
ellipsoidal harmonics 133.B 
four species of 133.C 
ellipsoidal type, special function of 389.A 
elliptic 
(differential operator) 112.A 323.A 237.H 
(pseudodifferential operator) 323.K 
(Riemann surface) 77.B 367.D 
(solution) 323.D 
analytically hypo- 323.1 
analytic hypo- 112.D 
hypo- 112.D 189.C 323.1 
microlocally 345.A 
strongly 112.G 323.H 
elliptic omplex (on a compact C?-manifold) 237.H 
elliptic coordinates 90.C 350.E, App. A, Table 3.V 
elliptic curve 9.C 
L-functions of 450.8 
elliptic cylinder 350.B 
elliptic cylinder function 268.B 
elliptic cylindrical coordinates App. A, Table 3.V 
elliptic cylindrical surface 350.B 
elliptic domain 77.B 
elliptic function(s) 134 
of the first kind 134.G 
Jacobi App. A, Table 16.III 
of the second kind 134.G 
ofthe third kind 134.H 
Weierstrass App. A, Table 16.IV 
elliptic function field OD 
elliptic geometry 285.A 
elliptic integral 11.C 134.A, App. A, Table 16.1 
complete App. A, Table 16.1 
of the first kind 134.A, App. A, Table 16.1 
of the first kind, complete 134.B 
of the first kind, incomplete 134.B 
of the second kind 134.4, App. A, Table 16.I 
of the second kind, complete 134.C 
of the third kind 134.4, App. A, Table 16.1 
elliptic integrals and elliptic functions App. A, 
Table 16 
elliptic irrational function 134.A 
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elliptic modular group 122.D 
elliptic motions 55.A 
elliptic operator 323.H 
microlocally 345.A 
strongly 323.H 
elliptic paraboloid 350.B 
of revolution 350.B 
elliptic point 
(of a Fuchsian group) 122.C 
(on a surface) 111.H 
elliptic quadric hypersurface 350.G 
elliptic singularity 418.C 
minimally 418.C 
elliptic space 285.C 
elliptic surface 72.K 
elliptic theta function 134.1, App. A, Table 16.II 
elliptic transformation 74.F 
elliptic type 323.A,D 
(Lie algebra) 191.D 
partial differential equation of 323, App. A, 
Table 15.VI 
elongation strain 271.G 
embedded 
(into a topological space) 425.J 
hyperbolically 21.0 
embedded Markov chain 260.H 
embedded primary component (of an ideal) 67.F 
embedded prime divisor (of an ideal) 67.F 
embedding 105.K 
(of categories) 52.H 
(of a C?-manifold) 105.K 
(of a topological space) 425.J 
formula of, form 303.D 
generalized Borel 384.D 
PL 65.D 
regular 105.K 
Tanaka 384.D 
toroidal 16.Z 
torus 16Z 
embedding principle (in dynamic programming) 
127.B 
embedding theorem 
full (of an Abelian category) 52.N 
Irwin 65.D 
Menger-Nobeling 117.D 
Sobolev-Besov 168.B 
Tikhonov 425.T 
Emden differential equation 291.F 
Emden function, Lane- 291.F 
emission 325.A 
backward 325.A 
forward 325.A 
empirical characteristic function 396.C 
empirical constant 19.F 
empirical distribution function 250.F 374.E 396.C 
empirical formula 19.F 
empiricism, French 156.C 
empty set 33.B 381.A 
axiom of 33.B 
enantiomorphic pair 92.A 
enantiomorphous 92.A 
encoder 213.D 
encoding 63.A 
end 
(ofanarc) 93.B 
(of a noncompact manifold) 178.F 
(ofa segment) 155.B 
(of a segment in an affine space) 7.D 
Heins 367.E 
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lower (of a curvilinear integral) 94.D 

upper (of a curvilinear integral) 94.D 
endogenous variable 128.C 
endomorphism 

(of an algebraic system)  409.C 

(ofa group) 190.D 

(of a polarized Abelian variety) 3.G 

(of a probability space) 136.E 

(ofaring) 368.D 

anti- {of a group) 190.D 

anti- (of a ring) 368.D 

aperiodic 136.E 

entropy of 136.E 

exact (of a measure space) 136.E 

periodic (at a point) 136.E 

ring of (of an Abelian variety) 3.C 
endomorphism ring 

(of an Abelian variety) 3.C 

(ofa module) 277.B 368.C 
endpoint 

(of an ordinary curve) 93.C 

left (of an interval) 355.C 

right (of an interval) 355.C 
end vertex 186.B 
energy 195.B 338.B 

binding 351.D 

free 340.B 402.G 

Gibbs free 419.C 

Helmholtz free 419.C 

internal 419.A 

kinetic 271.C 351.D 

mean 402.G 

mean free 340.B 402.G 

mutual 338.B 

potential 271.C 

rest 359.C 

total 271.C 
energy density 195.B 
energy equation (for a fluid) 205.A 
energy function 126.L 279.F 
energy inequality 325.C 
energy integral 420.A 
energy minimum principle 419.C 

Gibbs 419.C 
energy-momentum 4-vector 258.C 
energy-momentum operator 258.D 
energy-momentum tensor 150.B 359.D 
energy principle 338.D 
energy spectrum function 433.C 
energy spectrum tensor 433.C 
energy surface 126.L 402.C,G 


Enestróm theorem, Kakeya- (on an algebraic equa- 


tion) 10.E 
Engel theorem 248.F 
engulfing lemma 65.C 
enlargement 293.B 
Enneper formula, Weierstrass- 275.A 
Enneper surface 275.B 
Enriques surface 72.K 
ensemble 402.D 
canonical 402.D 
grand canonical 402.D,G 
microcanonical 402.D 
Enskog method 217.N 
enthalpy 419.C 
enthalpy minimum principle 419.C 
entire algebroidal function 17.B 
entire function 429.A 
rational 429.A 
transcendental 429.A 
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Equation(s) 


entire linear transformation 74.E 
entrance 281.C 


entrance boundary (of a diffusion process) 115.B 


entrance boundary point 260.1 
entropy 212.B 

(of an endomorphism) 136.E 

(in information theory) 213.B 

(in statistical mechanics) 402.B 

(in thermodynamics) 419.A 

closed system | 402.G 

completely positive 136.E 

conditional 213.B 

ofa distribution 403.B 

of the endomorphism o  136.E 

£- (of source coding theorem) 213.E 

maximal 136.C,H 

mean 402.G 

open system | 402.G 

ofa partition 136.E 

relative 212.B 

topological 136.H 

topological, of f with respect tox 126.K 
entropy condition 204.G 
entropy conjecture 126.K 
entropy maximum principle 419.A 
entropy production, equation of 205.A 
entry (of a matrix) 269.A 
enumerable predicate, recursively 356.D 
enumerable set, recursively 356.D 
enumerating predicate 356.H 
enumeration method, implicit 215.D 
enumeration theorem 356.H 

Pólya's 66.E 
envelope 

(of a family of curves) 93.I 

ofholomorphy 21.F 

injective 200.I 

lower 338.M 


upper (of a family of subharmonic functions) 


193.R 
envelope power function 400.F 
enveloping algebra 231.A 


special universal (of a Jordan algebra) 231.C 


universal (of a Lie algebra) 248.J 
enveloping surface 111.1 
enveloping von Neumann algebra 36.G 
epicycloid 93.H 
epidemiology 40.E 
epimorphism (in a category) 52.D 200.Q 
epitrochoid 93.H 
Epstein zeta function 450.K 
equality 
Chapman-Kolmogorov 261.A 
Jacobe-Beihler 328 
Parseval 18.B 159.A 160.C 192.K 197.C 
200.B,C,E 
equal weight, principle of 402.D 
equation(s) 
Abelian 172.G 
adjoint differential 252.K 
algebraic 10, App. A, Table 1 
algebraic differential 113 288.A 
approximate functional (for zeta function) 
450.B 
basic 320.E 
Bellman 405.B 
bifurcation 286.V 
binomial 10.C 
biquadratic App. A, Table 1 
Boltzmann 41.A 402.B 
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Equation(s) 


Boussinesq 387.F 

Briot-Bouquet differential 288.B 

Callan-Symanzik 361.B 

canonical 324.E 

canonical forms of the (of surfaces) 350.B 

Cauchy-Riemann (differential) 21.C 198.A 
274.G 320.F 

characteristic (for a homogeneous system of 
linear ordinary differential equations) 252.J 

characteristic (of a linear difference equation) 
104.E 

characteristic (of a linear ordinary differential 
equation) 252.E 

characteristic (of a matrix) 269.F 

characteristic (of an autonomous linear system) 
163.F 

characteristic (of a partial differential equation) 
320.D 

characteristic (of a partial differential equation 
of hyperbolic type) 325.A,F 

Chaplygin differential 326.B 

Chapman-Kolmogorov 260.A 261.A 

Charpit subsidiary 82.C 320.D 

of Codazzi 365.C 

Codazzi-Mainardi 111.H, App. A, Table Al 

of a conic section 78.C 

of a conic section, canonical form of 78.C 

of continuity 130.A 204.B 205.A 

cubic 10.D, App. A, Table 1 

cyclic 172.G 

delay-differential 163.A 

deRham 274.G 

difference  — difference equation 

difference-differential 163.A 

differential — differential equation(s) 

diffusion-convection 304.B 

Diophantine 118.A 

Dirac 377.C 415.G 

DLR 402.G 

eikonal 324.E 325.L 

energy (for a fluid) 205.A 

of entropy production 205.A 

Ernst 359.D 

estimating 399.P 

Euler (calculus of variations) 46.B 

Euler (of a perfect fluid) 204.E 

Euler, of motion (of a perfect fluid) 205.A 

Euler differential (dynamics of rigid bodies) 
271.E 

Euler-Lagrange 46.B 

evolution 378.G 

exterior field 359.D 

field 150.B 

Fokker-Planck 402I 

functional —- functional equation 

functional-differential 163.A 

Galois 172.G 

of Gauss (on an isometric immersion) 365.C 

Gauss (on surfaces) 111.H 

Gel'fand-Levitan-Marchenko 287.C 387.D 

general 172.G 

general Navier-Stokes 204.F 

Hamilton differential 324.E 

Hamilton-Jacobi 108.B 

Hamilton-Jacobi differential 271.F 324.E 

heat 327.A, App. A, Table 15.VI 

of heat conduction 327.A 

Hill determinantal 268.B 

indicial 254.C 

induction 259 


integral — integral equation(s) 

integrodifferential 163.A 222 

integrodifferential, of Fredholm type 222.A 

integrodifferential, of Volterra type 222.A 

interior field 359.D 

Kadomtsev-Petvyashvili 387.F 

KdV 387.B 

Kepler 309.B 

Klein-Gordon 377.C 415.G 

Kolmogorov backward 115.A 260.F 

Kolmogorov forward 115.A 260.F 

Kónigs-Schróder 44.B 

Korteweg-de Vries 387.A 

Lagrange, of motion 271.F 

Landau 146.C 

Landau-Nakanishi 146.C 

Langevin 45.1 402.K 

Lewy-Mizohata . 274.G 

likelihood 399.M 

linear 10.D 

linear, system of 269.M 

linear integral 217.A 

linear ordinary differential 252.A 

linear structure, system 128.C 

Lippman-Schwinger 375.C 

local (of a divisor on an open set) 16.M 

logistic 263.A 

master 402.] 

matrix Riccati differential 86.E 

Maxwell 130.A 

microdifferential 274.G 

minimal surface 275.A 

of motion (of a fluid) 205.A 

of motion (of a model) 264 

of motion (in Newtonian mechanics) 271.A 

of motion (of a particle in a gravitational field) 
359.D 

of motion, Euler (of a perfect fluid) 205.B 

of motion, Heisenberg 351.D 

of motion, Lagrange 271.F 

natural(ofa curve) 111.D 

natural (of a surface) 110.A 

Navier-Stokes 204.B 205.C 

normal (in the method of least squares) 403.E 
302.E 397.J 

ordinary differential — ordinary differential 
equation(s) 

of oscillation App. A, Table 15.VI 

Painlevé 288.C 

parabolic cylindrical App. A, Table 14.II 

partial differential — partial differential 
equation(s) 

Pell 118.A 

Pfaffian 428.A 

Pfaffian, system of 428.A 

Poisson 338.A 

Prandtl boundary layer 205.C 

Prandtl integrodifferential 222.C 

pressure 205.B ` 

primitive 172.G 

quadratic 10.D, App. A, Table 1 

quartic 10.D, App. A, Table 1 

radial 315.E 

random Schrödinger 340.E 

of a rarefied gas 41.A 

rational differential 288.A 

reciprocal 10.C 

regular local (at an integral point) 428.E 

renewal 260.C 

renormalization 111.B 
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resolvent 251.F 
retarded differential 163.A 
Ricci 365.C 
Schlesinger 253.E 
Schrödinger 351.D 
secular 55.B 269.F 
self-adjoint differential 252.K 
self-adjoint system of differential 252.K 
Sine-Gordon 387.A 
singular integral 217.J 
of sound propagation 325.A 
special functional 388 
of state 419.A 
structure (of an affine connection) 417.B 
structure (for a curvature form) 80.G 
structure (for a torsion form) 80.H 
of structure (of a Euclidean space) 111.B 
of structure (for relative components) 110.A 
system of 10.A 
system of linear ordinary differential 252.G 
telegraph 325.A, App. A, Table 15.III 
time-dependant Schrédinger 351.D 
time-independant Schrédinger 351.D 
transport 325.L 
two-dimension KdV 387.F 
variational 316.F 394.C 
of a vibrating membrane 325.A 
of a vibrating string 325.A 
wave 325.A 446.A, App. A, Table 15.III 
Wiener-Hopf integrodifferential 222.C 
Yang-Mills 80.Q 
Yule-Walker 421.D 
equator (of a sphere) 140 
equiangular spiral 93.H 
equianharmonic range of points 343.D 
equicontinuous (family of mappings) 435.D 
equicontinuous group of class (CH) 378.C 
equicontinuous semigroup of class (CH) 378.B 
equidistant hypersurface (in hyperbolic geometry) 
285.C 
equilateral hyperbola 78.E 
equilateral hyperbolic coordinates 90.C, App. A, 
Table 3.V 
equilateral triangle solution 420.B 
equilibrium, Nash 173.C 
equilibrium figures 55.D 
equilibrium Gibbs distribution 136.C 
equilibrium mass distribution 338.K 
equilibrium point 
(ofa flow) 126.D 
(in an N-person differential game) 108.C 
(in a two-person zero-sum game) 173.C 
equilibrium potential 260.D 
equilibrium principle 338.K 
equilibrium state 136.H 340.B 419.A 
equilibrium statistical mechanics 402.A 
equilibrium system, transformation to 82.D 
equilong transformation 76.B 
equipollent sets 49.A 
equipotential surface 193.J 
equipotent sets 49.A 
equivalence 
(of categories) 52.H 
(in a category) 52.D 
(of complexes) 200.H 
(of coverings) 91.A 
anti- (of categories) 52.H 
chain 200.H 
cochain 200.F 
combinatorial 65.A 
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Equivalent 


C- 126B 
CR- 344. 
homotopy 202.F 
Kakutani 136.F 
Lax, theorem 304.F 
natural 52J 
principle of (in insurance mathematics) 214.A 
principle of (in physics) 359.D 
simple homotopy 65.C 
topological 126.B 
equivalence class 135.B 
linear (of divisors) 16.M 
equivalence properties 135.A 
equivalence relations 135 358.A 
proper (in an analytic space) 23.E 
equivalent 
(additive functionals) 261.E 
(arc) 246.A 
(coordinate bundle) 147.B 
(covering) 91.A 
(extension by a C*-algebra) 36.J 
(fiber bundle) 147.B 
(formula) 411.E 
(functions with respect to a subset of C") 21.E 
(G-structures) 191.A 
(knot) 235.A 
(linear representation) 362.C 
(measure) 225J 
(methods of summation) 379.L 
(proposition) 411.B 
(quadratic form) 348.A 
(quadratic irrational numbers) 182.G 
(relation) 135.B 
(space group) 92.A 
(stochastic process) 407.A 
(surfaces in the sense of Fréchet) 246.1 
(system of neighborhoods) 425.E 
(unfolding) 51.E 
(unitary representation) 437.A 
(valuation) 439.B 
(word) 31.B 
algebraically (cycles) 16.R 
algebraically, top 16.P 
arithmetically 92.B 276.D 
C- 114H 
certainty 408.B 
chain 200.C 
y- 431.F 
combinatorially 65.A 
conformally 77.A 191.B 367.A 
countably (under a nonsingular bimeasurable 
transformation) 136.C 
C- 126B 
elementarily 276.D 
fiber homotopy (vector bundles) 237.1 
finitely (under a nonsingular bimeasurable 
transformation) 136.C 
flow 126.B 
T- (points) 122A 
geometrically 92.B 
homotopy (systems of topological spaces) 
202.F 
k-(C®-manifolds) 114.F 
linearly (divisors) 16.M 72.F 
locally (G-structure) 191.H 
numerically (cycles) 16.Q 
D-linearly, divisors 9.F 
Q- 126.H 
(PL) 65.D 
properly (binary quadratic forms) 348.M 
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pseudoconformally 344.A 

quasi- (unitary representations) 437.C 
rationally (cycles) 16.R 

right 51.C 

simple homotopy 65.C 

stably (vector bundles) 237.B 


stably fiber homotopy (vector bundles) 237.1 


topologically 126.B,H 

uniformly (uniform spaces) 436.E 

unitarily (self-adjoint operators) 390.G 
equivalent affinity 7.E 


equivariant Atiyah-Singer index theorem 237.H 


equivariant cohomology 431.D 
equivariant J-group 431.F 
equivariant J-homomorphism 431.F 
equivariant K-group 237.H 
equivariant mapping (map) 431.A 
equivariant point (of a mapping) 153.B 
equivariant point index (of a mapping) 153.B 
Eratosthenes’ sieve 297.B 
Erdós theorem, Chung- 342.B 
ergodic 

(Markov chain) 260.J 

(transformation) 136.B 
ergodic capacity 213.F 
ergodic class 260.B 

positive recurrent 260.B 
ergodic decomposition (of a Lebesgue measure 

space) 136.H 
ergodic homeomorphism 

strictly 136.H 

uniquely 136.H 
ergodic hypothesis 136.A 402.C 
ergodic information source 213.C 
ergodic lemma, maximal 136.B 
ergodic Szemerédi theorem 136.C 
ergodic theorem 136.A,B ` 

Abelian 136.B 

dominated 136.B 

individual 136.B 

local 136.B 

mean 136.B 

multiplicative 136.B 

pointwise 136.B 

ratio 136.B 

subadditive 136.B 
ergodic theory 136 342.A 
Erlang distribution, k- 260.1 
Erlangen program 137 
Ernst equation 359.D 
error(s) 138.A 

accumulated 138.C 

burst 63.E 

discretization 303.B 

of the first kind 400.A 

of input data 138.B 

local truncation 303.E 

mean square 399.E 403.E 

propagation of 138.C 

roundoff 138.B 303.B 

of the second kind 400.A 

theory of 138.A 

truncation 138.B 303.B 
error analysis 138 

backward 302.B 
error constant 303.E 
error-correcting 63,A 
error-correcting capability 63.B 
error-detecting 63.A 
error estimate, one-step-two-half-steps 303.D 


error function 167.D, App. A, Table 19.II 
error matrix 405.G 

error probability 213.D 

error space 403.E 


error sum of squares (with n—s degree of freedom) 


403.E 
errorterm 403.D 
error vector 102.A 


essential (conformal transformation group) 364.F 
essentially bounded (measurable function) 168.B 


essentially complete class 398.B 
essentially normal 390.I 
essentially self-adjoint 251.E 390.1 


essentially singular point (with respect to an analytic 


set) 21.M 
essentially unitary 390.1 
essential part 260.I 
essential singularity 
(of an analytic function in the wider sense) 
198.P 
(of a complex function) 198.D 
essential spectrum 390.E,H 
essential support (of a distribution) 274.D 
essential supremum (of a measurable function) 
168.B 
Estes stimulus sampling model 346.G 
estimable (parametric function) 399.C 
estimable parameter 403.E 
linearly 403.E 
estimate 399.B 
a priori 323.C 
nonrandomized 399.B 
one-step-two-half-steps error 303.D 
Schauder 323.C 
estimating equation 399.P 
estimating function 399.P 
likelihood 399.M 
estimating parameters, design for 102.M 
estimation 
Hadamard App. A, Table 8 
interval | 399.0 401.C 
point 371.H 399.B 401.C 
region 399.0 
statistical 200. A. App. A, Table 23 
estimation space 403.E 
estimator 399.B 
asymptotically efficient 399.N 
BAN 399.K,N 
based on an estimating function 399.P 
Bayes 399.F 
best asymptotically normal 399.K 
best invariant 399 
best linear unbiased 403.E 
CAN 399.K 
consistent 399.K 


consistent and asymptotically normal 399.K 


efficient 399.D 

first-order asymptotic efficient 399.0 

first-order efficient 399.0 

generalized least squares 403.E 

invariant 399] 

kth-order AMU 399.0 

kth-order asymptotically median unbiased 
399.0 

L- 371.H 

least squares 403.E 

M- 371.H 

maximum likelihood 399.M 

mean unbiased 399.C 

median unbiased 399.C 


1981 


ML 399.M 
modal unbiased 399.C 
moment method 399.L 
Pitman 399.G 
R- 371.H 
randomized 399.B 
ratio 373.C 
state 86.E 
Stein shrinkage 399.G 
superefficient 399.N 
UMYV unbiased 399.C 
unbiased 399.C 
uniformly minimum variance unbiased 399.C 
eta function 
(ofa Riemann manifold) 391.L 
Dedekind 328.A 
étale morphism 16.F 
étale neighborhood 16.AA 
étale site 16.AA 
étale topology 16.AA 
Euclid axiom 139.A 
Euclidean algorithm 297.A 
of polynomials 337.D 
Euclidean cell complex 70.B 
Euclidean complex 70.B 
Euclidean connection 364.B 
manifold with 109 
Euclidean distance 139.E 
Euclidean field 150.F 
Euclidean field theory 150.F 
Euclidean geometry 139 
n-dimensional 139.B 181 
non- —non-Euclidean geometry 285 
in the wider sense 139.B 
Euclidean group, locally 423.M 
Euclidean Markov field theory 150.F 
Euclidean method 150.F 
Euclidean polyhedron 70.B 
Euclidean simplicial complex 70.B 
Euclidean space(s) 140 
locally 259.B 425.V 
non- 285.A 
theorem on invariance of dimension of 117.D 
Euclidean space form 412.H 
Euclidean type (building) 13.R 
Euclid ring 67.L 
Euler, L. 141 
Euler angles 90.C, App. A, Table 3.V 
Euler characteristic (of a finite Euclidean cellular 
complex) 201.B 
Euler class (of M) 201.N 
Euler constant 174.A 
Euler criterion 297.H 
Euler differential equation (dynamics of rigid 
bodies) 271.E 
Euler equation 
(calculus of variations) 46.B 
(of a perfect fluid) 204.E 
Euler equation of motion (of a perfect fluid) 205.A 
Euler formula 131.G 
Euler function 295.C 
Euler graph 186.F 
Euler infinite product expansion | 450.B 
Euler integral 
of the first kind 174.C 
of the second kind 174.A 
Euler-Lagrange differential equation 46.B 
Euler linear ordinary differential equation App. A, 
Table 14.1 
Euler-Maclaurin formula 379.J 





Subject Index 
Exact 


Euler method 
of describing the motion of a fluid 205.A 
of numerical solution of ordinary differential 
equations 303.E 
summable by 379.P 
ofsummation 379.P 
Euler number 177.C 201.B, App. B, Table 4 
Euler path 186.F 
Euler-Poincaré characteristic 16.E 201.B 
Euler-Poincaré class 56.B 
(of a manifold) 56.F 
universal 56.B 
Euler-Poincaré formula 201.B,F 
Euler polynomial 177.C 
Euler product 450.B 
Euler relation 419.B 
Euler square 241.B 
Euler summation formula 295.E 
Euler theorem on polyhedra 201.F 
Euler transformation (of infinite series) 379.1 
evaluable (locally convex space) 424.I 
evaluation and review technique program 376 
Evans-Selberg theorem  48.E 338.H 
Evans theorem 48.E 
even element (of a Clifford algebra) 61.B 
even function 165.B 
even half-spinor 61.E 
even half-spin representation 61.E 
even-oddness conservation laws 150.D 
even permutation 151.G 
even state 415.H 
event(s) 281.D 342.B 
complementary 342.B 
delayed recurrent 260.C 
elementary 342.B 
exclusive 342.B 
impossible 342.B 
independent 342.B 
inferior limit 342.B 
intersection of 342.B 
measurable 342.B 
probability ofan 342.B 
with probability 1  342.B 
product 342.B 
random 342.B 
recurrent 250.D 260.C 
space of elementary 342.B 


sum 342.B 

superior limit 342.B 
sure 342.B 
symmetric 342.G 
tail 342.B 


event commutativity 346.G 
event horizon 359.D 
Everett formula (for functions of two variables) 
App. A, Table 21 
Everett interpolation formula App. A, Table 
21 
everywhere 
almost 270.D 342.D 
nearly 338.F 
quasi- 338.F 
evolute (of a curve) 111.E 
evolution equation 378 
evolution operator 378.G 
holomorphic 378.1 
exact 
(additive covariant functor) 200.I 
(differential on a Riemann surface) 367.H 
(endomorphism) 136.E 


Subject Index 
Exact differential equation 


(in Galois cohomology) 172 
(in sheaf theory) 383.C 


half- 200.1 
left- 200.1 
right- 200 


exact differential equation App. A, Table 14.1 
exact differential form 105.Q 
exact functor 52.N 
exact sampling theory 401.F 
exact sequence 
(of A-homomorphisms of A-modules) 277.E 
of cohomology 200.F 
cohomology 201.L 
of Ext 200.G 
fundamental (on cohomology groups) 200.M 
Gysin (of a fiber space) 148.E 
of homology 200.C 
homology (of a fiber space) 148.E 
homology (for simplicial complexes) 201.L 
homotopy 202.L 
homotopy (of a fiber space) 148.D 
homotopy (of a triad) 202.M 
homotopy (of a triple) 202.L 
Mayer-Vietoris (for a proper triple) 201.C 
Puppe 202.G 
reduced homology 201.F 
relative Mayer-Vietoris 201.L 
(R, S)- (of modules) 200.K 
short 200.1 
of Tor 200.D 
Wang (ofa fiber space) 148.E 
exceptional 
(Jordan algebra) 231.A 
(leaf) 154.D 
exceptional compact real simple Lie algebra 248.T 
exceptional compact simple Lie group 249.L 
exceptional complex simple Lie algebra 248.8 
exceptional complex simple Lie group 249.M 
exceptional curve 15.G 
of the first kind 15.G 
of the second kind 15.G 
exceptional function, Julia 272.F 
exceptional orbit 431.C 
exceptional sets 192.R 
exceptional value 
(of a transcendental entire function) 429.B 
Borel 272.E 
Nevanlinna 272.E 
Picard 272.E 
excess 178.H, App. A, Table 6.III 
coefficient of 341.H 396.C 
spherical 432.B 
total 178.H 
excessive (function) 260.D 261.D 
a- 261.D 
excessive measure 261.F 
exchange 420.D 
exchange of stability 286.T 
excision isomorphism 201.F,L 
excluded middle, law of 156.C 411.L 
exclusive events 342.B 
exhaustion 178.F 
existence theorem 
(in class field theory) 59.C 
(for ordinary differential equations) 316.C 
Cartan-Kahler 191.1 428.E 
Cauchy 320.B 
Cauchy-Kovalevskaya 321.A 
existential proposition 411.B 
existential quantifier 411.C 


1982 


exit 281.C 
exit boundary (of a diffusion process) 115.B 
exit boundary point 260.I 
exit time 261.B 
exogenous variable 128.C 
exotic sphere 114.B 
exp A (exponential function of matrix A) 269.H 
expansion 65.C 
asymptotic 30.A, App. A, Table 17.I 
asymptotic (of a pseudodifferential operator) 
345.A 
Cornish-Fisher 374.F 
Edgeworth 374.F 
e 361.C 
Laurent 198.D 
method of matched asymptotic 112.B 
Minakshisundaram-Pleijel asymptotic 391.B 
orthogonal 317.A 
partial wave 375.E 386.B 
Taylor (of an analytic function of several vari- 
ables) 21.B 
Taylor (of a holomorphic function! 339.A 
Taylor, and remainder App. A, Table 9.IV 
Taylor, formal 58.C 
expansion coefficient 317.A 
expansion formula, q- 134.I 
expansion method 205.B 
expansion theorem 306.B 
Hilbert. Schmidt 217.H 
Laplace (on determinants) 103.D 
expansive 126J 
expectation 115.B 342.C 
conditional 342.E 
mathematical 341.B 
expectation value (of an operator) 351.B 
expected amount of inspection 404.C 
expected value (of a random variable) 342.C 
experiment(s) 
design of 102 
factorial 102.H 
S*factorial 102.H 
statistical 398.G 
experimental analysis 385.A 
experimentation model 385.A 
explanatory variable 403.D 
explicit 
(difference equation in a multistep method) 
303.E 
(Runge-Kutta method) 303.D 
explicit function 165.C 
explicit method 303.E 
explicit reciprocity laws 14.R 
explicit scheme 304.F 
exploratory procedures 397.Q 
exploring a response surface, designs for 102.M 
explosion, O- 126.J 
explosion time 406.D 
exponent 
(of an Abelian extension) 172.F 
(of an algebra class) 29.E 
(ofa finite group) 362.G 
(of a Kummer extension) 172.F 
(ofa power) 131.B 
(of a regular singular point) 254.C 
(of a stable distribution) 341.G 
characteristic (of an autonomous linear system) 
163.F 
characteristic (of the Hill differential equation) 
268.B 
characteristic (of a variational equation) 394.C 


1983 


conjugate 168.C 
of convergence 429.B 
critical 111.C 
integral 167.D 
one-sided stable process of 5.F 
of the stable process 5.F 
subordinator of 5.F 
exponential curve 93.H 
exponential dichotomy 290.B 
exponential distribution 341.D, App. A, Table 22 
two-sided App. A, Table 22 
exponential family of distributions 396.G 
exponential formula 286.X 
double 299.B 
exponential function 131.D 
with the base a 131.B 
ofan operator 306.C 
exponential generating function 177.A 
exponential group 437.U 
exponential Hilbert space 377.D 
exponential integral 167.D, App. A, Table 19.II 
exponential lattice 287.A 
exponential law (on cardinal numbers) 49.C 
exponentially decreasing Fourier hyperfunction 
125.BB 
exponentially decreasing real analytic function 
125.BB 
exponentially stable 163.G 394.B 
exponential mapping 
(of a Lie algebra into a Lie group) 249.Q 
(of a Riemannian manifold) 178.A 364.C 
exponential method, Borel 379.0 
exponential series 131.D 
exponential valuation 439.B 
p-adic 439.F 
exposed, strongly 443.H 
expression 
field of rational 337.H 
rational 337.H 
expx 131.D 
Ext 200.G 
exact sequence of 200.G 
Ext groups 200.G 
ExtR(A, B) 200.K 
Ext$(M, NI 200.G 
extended Dynkin diagram App. A, Table 5.I 
extended hypergeometric function, Barnes 206.C 
extended real number 87.E 
extension 
(ofaconnection) 80.F 
(of afield) 149.B 
(of a fractional ideal) 14.1 
(ofa group) 190.N 
(of an ideal of compact operators) 36.J 390.J 
(of an isomorphism of subfields) 149.D 
(ofa mapping) 381.C 
(of modules) 200.K 
(ofan operator) 251.B 
(of a solution of an ordinary differential 
equation) 316.C 
(ofa valuation) 439.B 
Abelian (of a field) 172.B 
algebra 200.L 
algebraic (ofa field) 149.E 
Artin-Schreier (of a field) 172.F 
basic Z- 14L 
central (of a group) 190.N 
of the coefficient ring 29.A 
cone 65.D 
cyclic (of a field) 172.B 


Subject Index 
External (in nonstandard analysis) 


cyclotomic Z,- 14.L 
elementary 276.D 
finite 149.F 
Friedrichs 112.1251.E 
Galois 172.B 
T- 14L 
group 200.M 
inseparable (of a field) 149.H 
Kummer (of a field) 172.F 
Lebesgue 270.D 
linear (of a rational mapping to an Abelian 
variety) 9.E 
maximal Abelian 257.F 
maximal separable (of a field) 149.H 
natural (of an endomorphism) 136.E 
normal 149.G 251.K 
p- (of a field) 59.F 
p-adic 439.F 
purely inseparable (of a field) 149.H 
Pythagorean (of a field) 155.C 
regular (of a field) 149.K 
scalar (ofan algebra) 29.A 
scalar (of an A-module) 277.L 
scalar (of a linear representation) 362.F 
separable (of a field) 149.H,K 
separably generated (of a field) 149.K 
simple (of a field) 149.D 
split (of a group) 190.N 
strong (of a differential operator) 112.E,F 
transcendental 149.E 
transitive (of a permutation group) 151.H 
unramified 14.1 257.D 
weak (of a differential operator) 112.E,F 
Zo 14L 
extensionality, axiom of (in axiomatic set theory) 
33.B 
extension field 149.B 
Picard-Vessiot 113 
strongly normal 113 
extension theorem 
first (in the theory of obstructions) 305.B 
Hahn-Banach 37.F 
Hopf (in measure) 270.E 
Kolmogorov 3411 
second (in the theory of obstructions) 305.C 
third (in the theory of obstructions) 305.C 
Tietze 425.Q 
Whitney 168.B 
extensive thermodynamical quantity 419.A 
exterior 
(ofan angle) 155.B 
(ofa polygon) 155.F 
(ofasegment) 155.B 
(of a subset) 425.N 
exterior algebra (of a linear space) 256.0 
exterior capacity, Newtonian 48.H 
exterior derivative (of a differential form) 105.Q 
exterior differential form of degreer 105.Q 
exterior field equation 359 
exterior point (of a subset) 425.N 
exterior power, p-fold 
(of a linear space) 256.0 
(ofa vector bundle) 147.F 
exterior problem (for the Dirichlet problem) 120.A 
exterior product 
(of differential forms) 105.Q 
(of elements of a linear space) 256.0 
(of a p-vector and a q-vector) 256.0 
(of two vectors) 442.C 
external (in nonstandard analysis) 293.B 


Subject Index 
External irregular point 


external irregular point 338.L 
external language 75.C 
external law of composition 409.A 
externally stable set 186.1 
external product 200.K 
external space (in the static model in catastrophe 
theory) 51.B 
external variable 264 
extinction probability 44.B 
extrapolation 176.K 223.A 
extrapolation method 
polynomial 303.F 
rational 303.F 
extremal (Jordan curve) 275.C 
extremal distance 143.B 
reduced 143.B 
extremal function, Koebe 438.C 
extremal horizontal slit mapping 367.G 
extremal length 143 
defined by Hersch and Pfluger 143.A 
with weight 143.B 
extremal quasiconformal mapping 352.C 
extremal vertical slit mapping 367.G 
extremely disconnected 37.M 
extreme point 
of a convex set HOA 
of a subset of a linear space 424.V 
extremum 
conditional relative (of a function) 106.L 
relative (of a function) 106.L 


F 


f (cardinal number of all real-valued functions 
on[0,1]) 49.A 
F, (finite field with q elements) 450.Q 
f-metric 136.F 
fy-metric 136.F 
F-compactification 207.C 
F-distribution 341.D 374.B, App. A, Table 22 
noncentral 374.B 
F-free (compact oriented G-manifold) 431.E 
F-test 400.G 
(F, F')-free (compact oriented G-manifold) 431.G 
E,set 270.C 
(F)-space 4241 
face 
(ofa complex) 13.R 
(of a convex cell) 7.D 
ith (of a singular q-simplex) 201.E 
q-(ofan n-simplex) 70.B 
face operator (in a semisimplicial complex) 70.E 
factor 
(of an element ofa ring) 67.H 
(of a factorial experiment) 102.H 
(of a von Neumann algebra) 308.F 
ofautomorphy 32.A 
composition (in a composition factor series in 
agroup) 190.G 
determinant (of a matrix) 269.E 
direct (of a direct product of sets) 381.E 
direct (of a group) 190.L 
Dirichlet discontinuous App. A, Table ON 
first (of a class number) 14.L 
integrating App. A, Table 14.1 
Krieger 308.1 
p- (of an element of a group) 3621 
Powers 308.1 
proper (of an element of a ring) 67.H 
second (of a class number) 14.L 


1984 


Ulm (of an Abelian p-group) 2.D 
factor A-module 277.C 
factor analysis 280.G 
factor analysis model 403.C 
factor group 190.C 
factorial 330, App. A, Table 17.II 
Jordan 330 
factorial cumulant 397.G 
factorial cumulant generating function 397.G 
factorial design 102.H 
balanced fractional 102.1 
fractional 102.1 
orthogonal fractional 102.1 
factorial effect 102.H 
factorial experiment 102.H 
S* 102.H 
factorial function 174.A 
factorial moment 397.G 
factorial moment generating functions 397.G 
factorial series 104.F 121.E 
factorization 
incomplete 302.C 
regular 251.N 
triangular 302.B 
factorization method 206.B 
factorization theorem 
(of an H?-function) 43.F 
Neyman 396.F 
unique (in an integral domain) 67H 
factor loading 280.G 346.F 
factor representation 
of a topological group  437.E 
of type I, H, or III 437.E 
factor ring 368.E,F 
factor score 280.G 346.F 
factor set(s) 
(in cohomology of groups) 200.M 
(of a crossed product) 29.D 
(in extensions of groups) 190.N 
(of a projective representation) 362.J 
associated (in extensions of groups) 190.N 
factor transformation (of a measure-preserving 
transformation) 136.D 
Faddeev-Popov ghost 132.C 150.G 
fading memory 163.] 
faithful 
(functor) 52.H 
(linear representation) 362.C 
(permutation representation) 362.B 
(weight on a von Neumann algebra) 308.D 
fully (functor) 52.H 
faithfully flat A-module 277.K 
faithfully flat morphism 16.D 
false 411.E 
regular 301.C 
false position, method of 301.C 
family 165.D 
algebraic (of cycles on an algebraic variety) 
16.R 
of compact complex manifolds 72.G 
of confocal central conics 78.H 
of confocal parabolas 78.H 
of confocal quadrics 350.E 
covering 16.AA 
crystal 92.B 
directed 165.D 
equicontinuous (of mappings) 435.D 
exponential, of distributions 396.G 
of frames (on a homogeneous space) 110.A 
of frames of order 1 110.B 


1985 


of functions 165.B,D 
of functions indexed by a set 165.B 
indexed by a set 165.D 
of mappings 165.D 
normal (of functions) 435.E 
parameter space 72.G 
of points 165.D 
of quasi-analytic functions 58.A 
quasinormal (of analytic functions) 435.E 
separating 207.C 
ofsets 165.D 381.B,D 
of sets indexed by a set 381.D 
tight (of probability measures) 341.F 
uniform, of neighborhood system | 436.D 
fan 16.Z 
Fannes-Verbeure inequality, Roebstorff- 402.G 
FANR 382C 
FAR 382.C 
Farey arc 4.B 
Farey dissection 4.B 
Farey sequence 4.B 
Farkas theorem, Minkowski- 255.B 
fast Fourier transform 142.D 
fast wave 259 
Fatou theorem 
on bounded functions ina disk 43.D 
on the Lebesgue integral 221.C 
favorable a priori distribution, least 208 H 
F.D.generator 136.E 
F.D. process 136.E 
feasible (flow) 281.B 
feasible directions, method of (in nonlinear 
programming) 292.E 
feasible region 264.B 292.A 
feasible solution 255.A 264.B 
basic 255.A 
feedback control 405.C 
Fefferman-Stein decomposition 168.B 
Feit-Thompson theorem (on finite groups) 151.D 
Fejér kernel 159.C 
Fejér mean 159.C 
Fejér theorem  159.C 
Feller process 261.B 
Feller transition function 261.B 
Fermat, P. de 144 
last theorem of 145 
Fermat number 297.F 
Fermat principle 180.A 441.C 
Fermat problem 145 
Fermat theorem 297.G 
fermions 132.A 351.H 
Fermi particle 132.A 
Fermi statistics 377.B 402.E 
Ferrar formula, Dixon- App. A, Table 19.IV 
Ferrari formula App. A, Table 1 
Feynman amplitude 146.B 
Feynman diagram 146.B 
Feynman graphs 146.A,B 386.C 
Feynman integrals 146 
Feynman-Kac formula 351.F 
Feynman-Kac-Nelson formula 150.F 
Feynman rule 146.A,B 


F.F. 136.F 
FFT 142.D 
fiber 


(of a fiber bundle) 147.B 

(of a fiber space) 148.B 

(ofa morphism) 16.D 

geometric (ofa morphism) 16.D 

integration along (of a hyperfunction) 274.E 


Subject Index 
Field(s) 


fiber bundle(s) 147 
associated 147.D 
of class C’ 147.0 
complex analytic 147.0 
equivalent 147.B 
isomorphic 147.B 
orientable 147.L 
principal 147.C 
real analytic 147.0 
fibered manifold 428.F 
fiber homotopy equivalent 2371 
stably 237I 
fiber homotopy type, spherical G- 431.F 
fibering, Hopf 147.E 
fiber mapping (map), linear 114.D 
fiber product 52.G 
fiber space(s) 72.1 148 
algebraic 72.1 
locally trivial 148.B 
n-connective 148.D 
spectral sequence of (of singular cohomology) 
148.E 
Spivak normal 114.J 
fiber sum 52.G 
fiber term (of a spectral sequence)  200.J 
Fibonacci sequence 295.A 
fibration (of a topological space) 148.B 
fibration, Milnor 418.D 
fictitious state 260.F 
fiducial distribution 401.F 
fiducial interval 401.F 
field(s) 149 
(of sets) 270.B 
(of stationary curves) 46.C 
absolute class 59.A 
algebraically closed 149.1 
algebraic function, in n variables 149.1 
algebraic number 14.B 
alternative 231.A 
Anosov vector 126J 
Archimedean ordered  149.I 
asymptotic 150.D 
Axiom A vector 126J 
basic (of a linear space) 256.A 
Borel 270.B,C 
C,- 118.F 
Cid) 118.F 
canonica] 377.C 
class 59.B 
coefficient (of an affine space) 7.A 
coefficient (of an algebra) 29.A 
coefficient (of a projective space) 343.C 
coefficient (of a semilocal ring) 284.D 
commutative 368.B 
composite 149.D 
conjugate 149 377.C 
cyclotomic 14,L 
decomposition (of a prime ideal) 14.K 
of definition (for an algebraic variety) 16.A 
differential 113 
electric 130.B 
Euclidean 150.F 
extension 149.B 
finite 149.C 
formal power series, in one variable 370.A 
of formal power series in one variable 370.A 
formally real 149.N 
free 150.A 
free Dirac 377.C 
free scalar 377.C 


Subject Index 
Field equation 


function 16.A 

Galois 149.M 

Galois theory of differential 113 
ground (of an algebra) 29.A 
ground (of a linear space) 256.A 
Hamiltonian vector 126.L 219.C 
holomorphic vector 72.A 
imaginary quadratic 347.A 
imperfect 149.H 
inertia (of a prime ideal) 
intermediate 149.D 
invariant 172.B 
Jacobi 178.A 
Lagrangian vector 
local 257.A 
localclass 257.A 
local class, theory 59.G 
linearly disjoint 149.K 
magnetic 130.B 

of moduli 73.B 
Morse-Smale vector 126.J 
noncommutative 149.A 
number 149.C 
ordered 149.N 
p-adic number 
perfect 149.H 
Picard-Vessiot extension 113 
power series, in one variable 370.A 
prime 149.B 

Pythagorean 139.B 155.C 
Pythagorean ordered 60.0 
quadratic 347.A 
quasi-algebraically closed 
of quotients 67.G 
ramification (of a prime ideal) 
random 407.B 

of rational expressions 337.H 


14.K 


126.L 


257.A 439.F 


118.F 


14.K 


rational function, in n variables 149.K 

of rational functions 337.H 

real 149.N 

real closed 149.N 

real quadratic 347.A 

relative algebraic number 14.1 

residue class 149.C 368.F 

residue class (of a valuation) 439.B 

scalar 108.0 

scalar (in a 3-dimensional Euclidean space) 
442.D 

of scalars (of a linear space) 256.A 

skew 149.A 368.B 

splitting (for an algebra) 362.F 

splitting (for an algebraic torus) 13.D 

splitting (of a polynomial) 149.G 


strongly normal extension 113 
tension 195.B 
tensor — tensor field 
topological 423.P 
totally imaginary 
totally real 14.F 
transversal 136.G 
vector (in a differentiable manifold) 108.M 
vector (in a 3-dimensional Euclidean space) 
442.D 
Wightman 
Yang-Mills 
field equation 
exterior 339.D 
interior 339.D 
field theory 150 


14.F 


150.D 
150.G 
150.B 


constructive 150.F 
Euclidean 150.F 
Markov 150.F 
nonsymmetric unified 434.C 
quantum 150.C 
unified 434 
unitary 434.C 
fifth postulate (in Euclidean geometry) 139.A 
fifth problem of Hilbert 423.N 
figure(s) 137 
absolute (in the Erlangen program) 137 
central 420.B 
equilibrium 55.D 
fundamental (in a projective space) 343.B 
linear fundamental 343.B 
P'- 343.B 
file 96.B 
filing, inverted, scheme 96.F 
fill-in 302 E 
filter 87.1 
Cauchy (on a uniform space) 436.G 
Kalman 86.E 
Kalman-Bucy 86.E 405.G 
linear 405.F 
maximal 87.1 
nonlinear 405.F,H 
Wiener 86.E 
filter base 87.I 
filtering 395.E 
stochastic 342.A 405.F 
filtration 200.J 
discrete 200.J 
exhaustive 200.J 
filtration bounded from below 200.J 
filtration degree 200.J 
final object 52.D 
final set 
(ofa correspondence) 358.B 
(ofa linear operator) 251.E 
final state 31.B 
finely continuous 261.C 
finely open (set) 261.D 
fine moduli scheme 16.W 
finer relation 135.C 
finer topology 425.H 


1986 


fine topology (on a class of measures) 261.D 338.E 


finitary standpoints 156.D 
finite 
(cell complex) 70.D 
(measure) 270.D 
(morphism) 16.D 
(potency) 49.A 
(simplicial complex) 
(triangulation) 70.C 
(von Neumann algebra) 308.E 
approximately 36.H 308.1 
geometrically 234.C 
hyper- 308.1 
locally — locally finite 
point- (covering) 425.R 
pro-, group 210.C 
semi- 308.1 
o~- —o-finite 
star- (covering) 425.R 
finite automaton 31.D 
finite-band potentials 387.E 
finite basis (for an ideal) 67.B 
finite branch (of a curve of class CH) 93.G 
finite character, condition of 34.C 


70.C 


1987 


finite cochain (of a locally finite simplicial complex) 
201.P 
finite continued fraction 83.A 
finite covering (of a set) 425.R 
finite differences 223.C 
finite-dimensional distribution 407.A 
finite-dimensionallinear space 256.C 
finite-dimensional projective geometry 343.B 
finite-displacement theory 271.G 
finite element method 233.G 290.E 304.C 
finite extension 149.F 
finite field 149.C 
finite field F, 149.M 
finite-gap potentials 387.E 
finite groups | 151.A 190.C 
finite intersection property 425.S 
finite interval (in R) 355.C 
finite length 277.1 
finitely additive (vector measure) 443.G 
finitely additive class 270.B 
finitely additive measure 270.D 
finitely additive set function 380.B 
finitely determined process 136.E 
finitely distinguishable (hypothesis) 400.K 
finitely equivalent sets (under a nonsingular bi- 
measurable transformation) 136.C 
finitely fixed 136.F 
finitely generated 
(A-module) 277.D 
(group) 190.C 
finitely presented (group) 161.A 
finite memory channel 213.F 
finiteness condition for integral extensions 284.F 
finiteness theorem 16.AA 
Ahlfors 234.D 
finite order (distribution) 125.J 
finite ordinal number 312.B 
finite part (of an integral) 125.C 
finite population 373.A 
finite presentation 16.E 
finite prime divisor 439.H 
finite projective plane 241.B 
finite rank (bounded linear operator) 68.C 
finite sequence 165.D 
finite series 379.A, App. A, Table 10.1 
finite set 49.A 381.A 
hereditary 33.B 
finite subset property 396.F 
finite sum, orthogonality fora 19.G 317.D, App. A, 
Table 20. VII 
finite type 
(graded module) 203.B 
(module) 277.D 
(morphism of schemes) 16.D 
(O-module) 16.E 
algebraic space of 16.W 
locally of 16.D 
subshift of 126.J 
finite-type power series space 168.B 
finite-valued function 443.B 
finitistic (topological space) 431.B 
Finsler manifold 286.L 
Finsler metric 152.A 
Finsler space 152 
firmware 75.C 
first axiom, Tietze  425.Q 
first boundary value problem  193.F 323.C 
first category, set of 425.N 
first classfication theorem (in theory of obstructions) 
305.B 


Subject Index 
Fixed point 


first complementary law of the Legendre symbol 
297.1 
first countability axiom 425.P 
first definition (of algebraic K-group) 237.J 
first extension theorem (in the theory of obstruc- 
tions) 305.B 
first factor (of a class number) 14.L 
first fundamental form (of a hypersurface) 111.G 
first fundamental quantities (of a surface) 111.H 
first fundamental theorem (Morse theory) 279.D 
first homotopy theorem (in the theory of obstruc- 
tions) 305.B 
first incompleteness theorem 185.C 
first-in first-out memory 96.E 
first-in last-out memory 96.E 
first integral (of a completely integrable system) 
428.D 
first isomorphism theorem (on topological groups) 
423.J 
first kind 
(integral equations of Fredholm type of the) 
217.A 
Abelian differential of 11.C 
Abelian integral of 11.C 
differential form of 16.0 
first law of cosines 432.A, App.A, Table 2.II 
first law of thermodynamics 419.A 
first maximum principle (in potential theory) 338.C 
first mean value theorem (for the Riemann integral) 
216.B 
first negative prolongational limit set 126.D 
first-order asymptotic efficient estimator 399.0 
first-order derivatives 106.A 
first-order designs 102.M 
first-order efficient estimator 399.0 
first-order predicate 411.K 
first-order predicate logic 411.K 
first positive prolongational limit set 126.D 
first problem, Cousin 21.K 
first prolongation (of P) 191.E 
first quadrant (of a spectral sequence) 200.J 
first quartile 396.C 
first regular integral 126.H 
first-return mapping (map) 126.B 
first separation axiom 425.Q 
first variation 46.B 
first variation formula 178.A 
Fisher consistent 399.K 
Fisher expansions, Cornish- 374.F 
Fisher inequality 102.E 
Fisher information 399.D 
Fisher information matrix 399.D 
Fisher problem, Behrens- 400.G 
Fisher theorem 43.G 
Fisher theorem, Riesz- 168.B 317.A 
Fisher three principles 102.A 
Fisher-Yates-Terry normal score test 371.C 
Fisher z-transformation 374.D 
fisheye, Maxwell 180.A 
fit, chi-square test of goodness of 400.K 
fit, goodness of 397.Q 
fitting, curve 19.F 
five-disk theorem, Ahlfors 272.J 
fixed branch points (of an algebraic differential 
equation) 288.A 
fixed component (of a linear system) 16.N 
fixed effect 102.A 
fixed-effect model 102.A 
fixed point 
(of a discontinuous group) 122.A 


Subject Index 
Fixed-point index (of a continuous mapping) 


(ofaflow) 126.D 
(ofa mapping) 153.A,D 
(of a transformation group) 431.A 
of discontinuity (of an additive process) 5.B 
of discontinuity (of a stochastic process) 407.A 
hyperbolic 126.G 
isolated 126.G 
fixed-point index (of a continuous mapping) 153.B 
fixed point method 138.B 
fixed-point theorem(s) 153 
Atiyah-Bott 153.C 
Atiyah-Singer 153.C 
Brouwer 153.B 
Kakutani 153.D 
Lefschetz 153.B 
Leray-Schauder 286.D 
Poincaré-Birkoff 153.B 
Schauder 153.D 286.D 
Tikhonov 153.D 
fixed singularity (of an algebraic differential 
equation) 288.A 
fixed variates 403.D 
fixed vector 442.A 
FKG 212A 
Fi (flèche) 52.A 
flabby resolution 125.W 
flabby sheaf 383.E 
flag (in an affine space) 139.B 
flag manifold 199.B 
proper 199.B 
flat 
(connection) 80.E 
(morphism of schemes) 16.D 
(Riemannian manifold} 364.E 
(sphere pair) 235.G 
conformally 191.B 
é- 178.D 
faithfully (A-module) 277.K 
faithfully (morphism of schemes) 16.D 
locally (connection) 80.E 
locally (PL embedding) 65.D 
locally (Riemannian manifold) 364.E 
normally (along a subscheme) 16.L 
flat A-module 277.K 
flat deformation 16.W 
flat F-bundle 154.B 
flat function 58.C 
flat point (of a surface) 111.H 
flat site 16.AA 
flat space 
concircularly App. A, Table 4.II 
conformally App. A, Table 4.II 
projectively App. A, Table 4.II 
flavor dynamics, quantum 132.D 
flex 9.B 
flip model, spin 340.C 
floating point method 138.B 
Floquet theorem | 252J 268.B 
flow 
(in ergodic theory) 136.D 
(ona network) 281.B 
(on a topological space) 126.B 
Anosov 126.B 136.G 
associated 136.F 
Axiom A 126.J 
built under a function 136.D 
C- 126B 
ofclass C' 126.B 
continuous 126.B 
discrete 126.B 


1988 


equivalent 126.B 
geodesic 126.L 136.G 
harmonic 193.K 
homentropic 205.B 
horocycle 136.G 
hypersonic 205.C 
K- 136.E 
Kolmogorov 136.E 
Kronecker 136.G 
laminar 205.E 433.A 
maximum, minimum cut theorem 281 
maximum, problem 281 
measurable 136.D 
minimal 126.N 
minimum-cost, problem 281.C 
Morse-Smale 126.J 
multicommodity, problem 281.F 
network, model 307.C 
network, problem 281 282.B 
S- 136.D 
single-commodity, problem 281.F 
special 136.D 
translational 126.L 136.G 
transonic 205.B 
transversal 136.G 
turbulent 205.E 433.A 
Y- 136.G 
flow-shop scheduling problem 376 
fluctuation-dissipation theorem 402.K 
fluid 205.A 
compressible 205.B 
incompressible 205.B 
Newtonian 205.C 
non-Newtonian 205.C 
perfect 205.B 
fluid dynamics 205.A 
flux 
(ofa regular tube) 193.K 
vector (through a surface) 442.D 
flux density 
electric 130.A 
magnetic 130.A 
focal conic (of a quadric) 350.E 
focallength 180.B 
focal point (of a submanifold of a Riemannian 
manifold) 364.C 
Fock representation 150.C 
Fock space 377.A 
antisymmetric 377.A 
symmetric 377.A 
focus 
(of a conic section ellipse) 78.B 
(of an optimal system) 180.B 
(ofa quadric) 350.E 
Fokker-Planck partial differential equation’ 115.A 
402.1 
Foiag model, Sz. Nagy- 251.N 
folding (of a chamber complex) 13.R 
foliated bundle 154.B,H 
foliated cobordant (C"-foliations) 154.H 
foliated cobordism  154.H 
foliated structure 105.Y 
foliation 154 
Anosov 126J 


compact 154.H 
C- 154.B,G 
I 154H 


holomorphic 154.H 
real analytic 154.H 
Reeb 154.B 


1989 


Riemannian 154.H 
transverse to 154.H 
foliation cycles 154.H 
folium cartesii 93.H 
folium of Descartes 93.H 
foot of the perpendicular 139.E 
force 
apparent 271.D 
body 271.G 
centrifugal 271.D 
Corioli 271.D 
line of 193.J 
Lorentz 130.A 
restitutive 318.B 
forced oscillation 318.B 
force polygon 19.C 
Ford fundamental region 234.C 
forgetful functor 52.1 
form(s) 337.B 
anti-Hermitian 256.Q 
associated (of a projective variety) 16.8 
automorphic 437,DD 450.0 


automorphic, of weight k (or of dimension — k) 


32.B 

automorphic, of weight m 32.A 

basic (in linear programming) 255.A 

bilinear 256.H 277.] 424.G 

bilinear, associated with a quadratic form 
256.H 

canonical (of F(M)) 191.A 

canonical (of a linear hypothesis) 400.H 


canonical, of the equation (of a quadric) 350.B 
canonical 1- (of the bundle of tangent n-frames) 


80.H 
Cantor normal 312.C 


compact (of a complex semisimple Lie algebra) 


248.P 

complex (of a Fourier series) 159.A 

complex (of a real Lie algebra) 248.P 

complex space 365.L 

connection 80.E 417.B 

contact 110.E 

covariant of n-ary, of degreed 226.D 

covariant with ground 226.D 

curvature 80.G 364.D 

cusp (in the case of one variable) 32.B 

cusp (in Siegel upper half-space) 32.F 

differential — differential form 

Dirichlet 261.C 

divergence 323.D 

dominant integral (on a Cartan subalgebra) 
248.W 

é-Hermitian 60.0 

é-trace 60.0 

Euclidean space 412.H 

first fundamental 111.G, App. A, Table 4I 

formula of embedding 303.D 

Fuchsian, of weight k (or of dimension — k) 
32.B 

fundamental (associated with a Hermitian 
metric) 232.A 

fundamental (of a Finsler space) 152.A 

games in partition-function 173.D 

generalized Levi 274.G 

ground 226.D 

Hermitian — Hermitian form 256.Q 

Hesse normal (of a hyperplane) 139.H 

Hilbert modular, of dimension —k 32.G 

Hilbert modular, of weight k 32.G 

holomorphic k- 72.A 





Subject Index 
Formally self-adjoint (differential operator) 


hyperbolic space 412.H 
integral (on a Cartan subalgebra) 248.W 
invariants of n-ary, of degreed 226.D 
Jordan normal 269.G 
k- (of an algebraic group) 13.M 
kernel 348.F 
Khinchin canonical 341.G 
Killing 248.B 
Kolmogorov canonical 341.G 
Legendre-Jacobi standard 134.A, App. A, 
Table 16.I 
Levi 344.A 
Lévy canonical 341.G 
limit of an indeterminate 106.E 
linear (on an A-module) 277.E 
linear (on a linear space) 256.B 
modular, of level N 32.C 
multilinear 256.H 
n-person 173.B-D 
norm Í118.D 
normal (of an ordinal number) 312.C 
normal (of an ordinary differential equation) 
313.B 
normal (of partial differential equations) 321.B 
normal (of a partial differential equation of the 
first order) 324E 
normal (of a surface) 410.B 
normal real (of a complex semisimple Lie 
algebra) 248.Q 
normic (in a field) 118.F 
Pfaffian 105.Q 428.A 
polar (of a complex number) 74.C 
primitive 232.C 
pseudotensorial 80.G 
quadratic -~ quadratic form 
real (of a complex algebraic group) 60.0 
real (of a complex Lie algebra) 248.P 
reduced (of a linear structural equation system) 
128.C 
regular Dirichlet 261.C 
second fundamental 111.G 365.C, App. A, 
Table 4I 
sesquilinear —*sesquilinear form 
Siegel modular, of dimension —k 32.F 
Siegel modular, of weight k 32.F 
skew-Hermitian 256.Q 
skew-symmetric multilinear 256.H 
space 285.E 412.H 
spherical space 412.H 
standard (of a difference equation) 104.C 
standard (of alatin square) 241.A 
symmetric multilinear 256.H 
symplectic 126.L 
tensorial 80.G 
third fundamental App. A, Table 4.I 
torsion 80.H i 
Weierstrass canonical (for an elliptic curve) 
9.D 
Weierstrass canonical (of the gamma function) 
174.A 
Weyl 351.C 
formal adjoint operator 322.E 
formal degree (of a unitary representation) 437.M 
formal dimension n, Poincaré pair of 114J 
formal geometry 16.X 
formal group 13.C 
formalism 156.A,D 
Gupta-Bleuler 150.G 
formally real field 149.N 
formally self-adjoint (differential operator) 112.1 


Subject Index 
Formally undecidable proposition 


formally undecidable proposition 185.C 
formal power series 370.A 
field of, in one variable 370.A 
rings of 370.A 
formal power series field in one variable 370.A 
formal power series ring 370.A 
formal scheme 16.X 
separated 16.X 
formal solution (for a system of ordinary differential 
equations) 289.C 
formal spectrum (of a Noetherian ring) 16.X 
formal system 156.D 411.1 
formal Taylor expansion 58.C 
formal vector fields 105.AA 
formation 
class 59.H 
pattern 263.D 
formation, class 59.H 
formation rule 411.D 
form ring 284.D 
formula(s) 411.D 
Abramov 136.E 
addition (fore?) 131.G 
addition (for sine and cosine) 432.A 
Adem  App.A, Table elt 
algebraic addition 3.M 
atomic 411.D 
atomic (of a language) 276.A 
Bayes 342.F 405.1 
Bessel interpolation App. A, Table 21 
Binet 174.A 295.A 
Bouquet (on space curves) 111.F 
Campbell-Hausdorff 249.R 
Cardano App. A, Table 1 
Cartan (for Steenrod pth power operations) 
64.B 
Cartan (for Steenrod square operations) 64.B 
Cauchy-Hadamard (on the radius of conver- 
gence) 339.A 
Cauchy integral 198.B 
Chebyshev (in numerical integration) 299.A 
Chern (in integral geometry) 218.D 
Christoffel-Darboux 317.D 
Clenshaw-Curtis 299.A 
closed 411J 
closed (of a language) 276.A 
closed type (in numerical integration) 299.A 
connection 253.A 
constant variational 163.E 
Crofton (in integral geometry) 218.B 
decomposition, of Radon 125.CC 
De Moivre 74.C 
dimensional 116 
discontinuity 146.C 386.C 
Dixon-Ferrar App. A, Table 19.IV 
double exponential 299.B 
Dynkin 261.C 
of embedding form 303.D 
empirical 19.F 
Euler (for cosz,sinz,coshz) 131.G 
Euler (fore?) 131.G 
Euler-Maclaurin 379. 
Euler- Poincaré (for a finite Eulidean cellular 
complex) 201.B,F 
Euler summation 295.E 
Everett App. A, Table 21 
Everett interpolation 224.B, App. A, Table 21 
exponential 286.X 
Ferrari App. A, Table f 
Feynman-Kac 315.F,G 


1990 


Feynman-Kac-Nelson 150.G 

first variation 178.A 

Fourier inversion 160.C 

Fredholm 68.L 

Frenet (on curves) 111.D 

Frenet-Serret (on curves) 111.D 

Gauss (on Gauss sum) 295.D 

Gauss (on harmonic functions) 193.D 

Gauss (for integration of a vector field) 
App. A, Table 3.III 

Gauss (for isometric immersion) 365.C 

Gauss (in numerical integration) 299.A 

Gauss (for the surface integral) 94.F 

Gauss (in theory of surfaces) 111.H, App. A, 
Table 4.1 

Gauss backward interpolation 223.C 

Gauss-Bonnet 111.H 364.D, App. A, Table 4.I 

Gauss-Bonnet-Sasaki-Nitsche 275.C 

Gauss-Chebyshev (in numerical integration) 
299.A 

Gauss forward interpolation 223.C 

Gauss-Hermite (in numerical integration) 
299.A 

Gauss integration (in the narrow sense) 299.A 

Gauss interpolation App. A, Table 21 

Gauss-Laguerre (in numerical integration) 
299.A 

Green (for differential operators) App. A, 
Table 15.VI 

Green (for harmonic functions) 193.D 

Green (for Laplace operator) App. A, 
Tables 3.III AU 

Green (for ordinary differential equations) 
252.K 

Green (for partial differential equations of 
parabolic type) 327.D 

Green (on the plane) 94.F 

Green-Stokes 94.F 

Hansen-Bessel App. A, Table 19 [II 

Heron (for plane triangles) App. A, Table 21 

Heron (for spherical triangles) App. A, 
Table 2.III 

identically true 411.G 

IMT 299.B 

interpolation 223.A 

interpolatory 299.A 

inversion (for a characteristic function) 341.C 

inversion (of cosine transform) 160.C 

inversion (of Fourier transform) 160.C 

inversion (of Fourier transform of distributions) 
160.H 

inversion (of Fourier transform on a locally 
compact Abelian group) 192.K 

inversion (of generalized Fourier transform) 
220.B 

inversion (of Hilbert transform) 220.E 

inversion (of integral transform) 220.A 

inversion (of Laplace-Stieltjes transform) 
240.D 

inversion (on a locally compact group) 437.L 

inversion (of Mellin transform) 220.C 

inversion (for a semigroup of operators) 240.1 

inversion (of Stieltjes transform) 220.D 

Itô 45.G 406.B 

Jensen 198.F 

Klein-Nishina 415.G 

Kneser-Sommerfeld App. A, Table 19 III 

Kostant (on representations of compact Lie 
groups) 248.Z 

Kronecker limit 450.B 


1991 


Kubo 402.K 

Kiinneth (in an Abelian category) 200.H 

Künneth (in Weil cohomology) 450.Q 

Lagrange (for the vector triple product) 442.C 

ofalanguage 276.A 

lattice-point 222.B 

Lefschetz fixed-point 450.Q 

Leibniz (in differentiation) 106.D, App. A, 
Table 9.III 

Leibniz (in infinite series) App. A, Table 10.III 

Liouville 252.C 

Machin 332 

Mehler App. A, Table 19.III 

Milne-Simpson 303.E 

Möbius inversion (in combinatorics) 66.C 

Möbius inversion (in number theory)  295.C 

Nakano-Nishijima-Gell-Mann 132.A 

Newton (on interpolation) App. A, Table 21 

Newton (on symmetric functions) 337.1 

Newton backward interpolation 223.C 

Newton-Cotes (in numerical integration) 
299.A 

Newton forward interpolation 223.C 

Newton interpolation App. A, Table 21 

Nicholson App. A, Table 19.IV 

open (in numerical integration) 299.A 

Ostrogradskii 94.F 

overall approximation 303.C 

Picard-Lefschetz 418.F 

Plancherel (on a unimodular locally compact 
group) 437.L 

Plicker (on plane algebraic curves) 9.B 

Poincaré (in integral geometry) 218.C 

Poisson (on Bessel functions) App. A, 
Table 19.III 

Poisson (for a flat torus) 391J 

Poisson integral 198.B 

Poisson summation 192.C,L 

prime 411.D 

prime (of a language) 276.A 

principal, of integral geometry 218.C 

product (for the Hilbert norm-residue symbol) 
14.R 

product (on invariant Haar measures) 225.F 

product (for the norm-residue symbol) 14.Q 

product (on valuations) 439.H 

q-expansion (on theta functions) 134.1 

recurrence, for indefinite integrals App. A, 
Table 9.II 

reduction (of a surface) 110.A 

Ricci 417.B, App. A, Table 4.II 

Riemann-Hurwitz (on coverings of a non- 
singular curve) 9.1 

Rodrigues 393.B 

Schlafli App. A, Table. 19.III 

Schwarz-Christoffel transformation 77.D 

second variation 178.A 

set-theoretic 33.B 

Sommerfeld App. A, Table 19.III 

Sonine-Schafheitlin App. A, Table 19.IIT 

Steinberg (on representations of compact 
Lie groups) 248.2 

Stirling 174.A 212.C, App. A, Table 17.1 

Stirling interpolation App. A, Table 21 

Stokes 94.F 

Stokes (on a C?-manifold) 108.U 

Stokes (for integration of a vector field) 
App. A, Table 3.IH 

Taylor (for a function of many variables) 
106J 


Subject Index 
Fourier series 


Taylor (for a function of one variable) 106.E, 


App. A, Table 9.IV 
theoretical 19.E 
©- (on ideles) 6.F 
trace (on unitary representations) 437.DD 
transformation (for the generating function 
of the number of partitions) 328.A 
transformation (of a theta function) 3.I 
transformation (for theta series) 348.L 
Trotter product 351.F 
valid 411.G 
Villat integration App. A, Table 15.VI 
Wallis App. A, Table 10.VI 
Watson 39.D, App. A, Table 19.IV 
Watson-Nicholson App. A, Table 19.IIT 
Weber App. A, Table 19.IV 
Weber-Sonine App. A, Table 19.III 
Weierstrass-Enneper 275.A 


Weingarten (for isometric immersion) 365.C 


Weingarten (in theory of surfaces) 111.H, 
App. A, Table Al 
Weyl 323.M 


Weyl character (on representations of compact 


Lie groups) 248.Z 
Weyl integral 225. 
Weyrich App. A, Table 19.III 
Wiener 160.B 
Wus App. A, Table 6.V 
Forti paradox, Burali- 319.B 
forward analysis 138.C 
forward difference 304.E, App. A, Table 21 
forward emission 325.A 
forward equation, Kolmogorov 115.A 260.F 
forward interpolation formula 
Gauss 223.C 
Newton 223.C 
forward type 304.D,F 
foundation, axiom of 33.B 
foundations of geometry 155 
foundations of mathematics 156 
four arithmetic operations 294.A 
four color conjecture 186.1 
four-color problem 157 
4-current density 150.B 
four-group 151.G 
Fourier, J. B.J. 158 
Fourier analysis App. A, Table 11 
Fourier analysis on the adele group ot 
Fourier-Bessel series 39.D 
Fourier-Bessel transform 39.D 
Fourier coefficient 159.A, App. A, Table 11.I 
(of an almost periodic function) 18.B 
(in a Hilbert space) 197.C 
(in an orthogonal system) 317.A 
Fourier cosine series App. A, Table 11.1 
Fourier cosine transform — 160.C, App. A, 
Table 11.II 
Fourier double integral theorem  160.B 
Fourier-Hermite polynomial 176.I 
Fourier hyperfunction 125.BB 
exponentially decreasing 125.BB 
modified 125.BB 
Fourier integral 160.A 
conjugate 160.D 
Fourier integral operator 274.C 345.B 
Fourier inversion formula 160.C 
Fourier kernel 220.B 
Fourier-Laplace transform 192.F 
Fourier reciprocity 160.C 
Fourier series 159, App. A, Table 11.1 
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Fourier sine series 


(of an almost periodic function) 18.B 

(ofa distribution) 125.P 

(in a Hilbert space) 197.C 
Fourier sine series App. A, Table 11.I 
Fourier sine transform — 160.C, App. A, Table 11.II 
Fourier single integral theorem 160.C 
Fourier-Stieltjes transform  192.B,O 
Fourier theorem (on real roots of an algebraic 

equation) 10.E 

Fourier transform 160, App. A, Table 11.II 

(ofa distribution) 125.0 

(in topological Abelian groups) 36.L 192.1 

discrete 142.D 

fast 142.D 

generalized 220.B 

inverse 125.0 
Fourier ultrahyperfunction 125.BB 
4-momentum operators 258.A,D 
fourth separation axiom 425.Q 
four-vector 258.C 359.C 
four-vector, energy-momentum  258.C 
four-vertex theorem 111.E 
fractal 246.K 
fraction(s) 

continued 83 

partial App. A, Table 10.V 
fractional cutting algorithm 215.B 
fractional factorial design 102.1 

balanced 102.1 

orthogonal 102.1 
fractional group, linear 60.B 
fractional ideal 67.3 

of an algebraic number field 14.E 

principal 67.K 
fractional power 378.D 
fractional programming 264.D 
fractional step 304.F 
Fraenkel set theory, Zermelo- 33.A,B 
fractional transformation, linear 74.E 
frame(s) 90.B 

(of an affine space) 7.C 

(of a C*-manifold) 191.A 

(in E") 11LB 

(of a group manifold) 110.A 

(in projective geometry) 343.C 

(ofa real line) 355.E 

affine 7.C 

bundle of tangent r- 105.H 

Darboux 110.B 

dual 417.B 

family of (on a homogeneous space) 110.A 

family of, of order 1 110.B 

Frenet 110.A 111.D 

Gaussian (of a surface) 111.H 

k- (in R”) 199.B 

method of moving 110.A 

moving 90.B 111.C 417.B 

natural moving 417.B 

normal 110.B 

of order O 110.C 

oforder1 110.B,C 

of order 2 110.B,C 

of order 3 110.B,C 

oforder4 110.B 

orthogonal (in a Euclidean space) 111.B 139.E 

orthogonal k- (in R") 199.B 

orthogonal moving 417.D 

projective 343.C 

Stiefel manifold of k- 199.B 

stochastic moving 406.G 
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tangent r- 105.H 
frame bundle 
orthogonal 364.A 
tangent orthogonal n- 364.A 
tangent r- 105.H 147.F 
framed link 114.L 
framing 114.L 
Fréchet axiom 425.Q 
Fréchet curve 246.A 
Fréchet derivative 286.E 
Fréchet differentiable function 286.E 
Fréchet differential 286.E 
Fréchet distance (between surfaces) 246.1 
Fréchet L-space 87.K 
Fréchet manifold 286.K 
Fréchet space 
(quasinormed space) 37.0 
(topological linear space) 424.1 
(topological space) 425.CC 
locally convex | 424II 
in the sense of Banach 37.0 
in the sense of Bourbaki 424I 
Fréchet surface | 246.I 
Fréchet-Uryson space 425.CC 
Fredholm alternative theorem  68.E 217.F 
Fredholm determinant 217.E 
Fredholm first minor 217.E 
Fredholm formula 68.L 
Fredholm integral equation 217.A 
of the first kind 217.A 
of the second kind 217.A 
of the third kind 217.A 
Fredholm mapping 286.E 
Fredholm operator 68.F 251.D 
(in the sense of Grothendieck) 68.K 
Fredholm rth minor 217.E 
Fredholm type 
integral equation of 217.A 
integrodifferential equation of 222.A 
free 
(discontinuous group) 122.A 
distribution- 371.A 
F- (oriented G-manifold) 431.E 
(F, F')- (oriented G-manifold) 431.E 
free Abelian group 2.C 
free additive group 2.E 
free derivative 235.C 
free Dirac field 377.C 
freedom 
asymptotic 361.B 
degrees of (of a dynamical system) 271.F 
n degrees of (sampling distribution) 374.B,C 
free energy 340.B 402.G 
Gibbs 419.C 
Helmholtz 419.C 
mean 340.B 402.G 
free fields 150.A 
free grammar, context- 31.D 
free groups 161 
free Hamiltonian 351.D 
free homotopy 202.B 
free Lagrangian density 150.B 
freely act 122.A 431.A 
free module 277.G 
free product (of groups) 190.M 
free scalar field 377.C 
free semigroup 161.A 
free special Jordan algebra 231.A 
free vacuum vector 150.C 
free variable 411.C 
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free vector 442.A 
French empiricism 156.C 
Frenet formula 111.D 
Frenet frame 110.A 111.D 
Frenet-Serret formulas (on curves) 111.D, 
App. A, Table Al 
frequency 
(of an oscillation) 318.A 
(ofsamples) 396.C 397.B 
(of a translational flow) 126.L 136.G 
(ofa wave) 446 
angular (of a wave) 446 
circular (of a simple harmonic motion) 318.B 
relative (of samples) 396.C 
frequency distribution 397.B 
frequency function 397.D 
frequency response function 421.E 
Fresnel integral 167.D, App. A, Tables 9.V 19 II 
Freudenthal theorem 202.U 
Friedrichs extension 112.1 251.1 
Friedrichs scheme 304.F 
Friedrichs theorem 323.H 326.D 
frieze group |. 92.F 
Frobenius algebra 29.H 
quasi- 29.H 
Frobenius automorphism (of a prime ideal) 14.K 
Frobenius group 151.H 
Frobenius integrability condition 154.B 
Frobenius method App. A, Table 14.1 
Frobenius morphism 450.P 
Frobenius substitution (of a prime ideal) 14.K 
Frobenius theorem 
(on Abelian varieties) 3.D 
(on matrices with nonnegative entries) 269.N 
(on polynomials of a matrix) 390.B 
(on representations of finite groups) 362.G 
(on total differential equations and on foliations) 
154.B 286.H 428.D 
Frobenius theorem, Perron- 310.H 
Froissart bound 386.B 
Froissart-Martin bound 386.B 
frontier point (of a subset) 425.N 
front set, analytic wave 274.D 
front set, wave 274.B 345.A 
Frostman maximum principle 338.C 
Froude number 116.B 
Fubini theorem 221.E 
Fuchsian form of weight k (or of dimension — Kk) 
32.B 
Fuchsian function 32.B 
Fuchsian group 122.C 
of the first kind 122.C 
of the second kind 122.C 
Fuchsian relation 253.A, App. A, Table 18.I 
Fuchsian type (visibility manifold) 178.F 
Fuchsian type, equation of 253.A 
Fuchsoid group  122.C 
Fuks cohomology, Gel'fand- 105.AA 
full discrete approximation 304.B 
full embedding theorem (of an Abelian category) 
52.N 
full group 136.F 258.A 
full homogeneous Lorentz group 258.A 
full inhomogeneous Lorentz group | 258.A 
full international notation 92.E 
full linear group 60.B 
full matrix algebra 269.B 
full Poincaré group 258.A 
full subcategory 52.A 
fully complete (locally convex space) 424.Y 
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fully faithful functor 52.H 
fully normal space 425.X 
fully transitive 92.C 
Fulton and Hansen, general connectedness theorem 
of 16.1] 
function(s) 165 381.C 
Abelian 3J 
absolutely integrable 214.E 
additive interval 380.B 
additive set 380.C 
admissible 46.A 304.B 
Ahlfors 43.G 77.E 
algebraic 11.A 
almost periodic 18 
almost periodic, ona group 18.C 
almost periodic, with respect to p 18.C 
almost periodic, in the sense of Bohr 18.B 
a-excessive 261.D 
alternating 337.1 
amplitude (of a Fourier integral operator) 
274.C 345.B 
analytic — analytic function(s) 
analytic almost periodic 18.D 
analytic operator 37.K 
Anger 39.G, App. A, Table 19.IV 
Appell hypergeometric, of two variables 
206.D, App. A, Table 18.1 
argument 46.A 
arithmetic 295.A 
Artin-Hasse 257.H 
associated Legendre 383.C, App. A, 
Table 18.IHI 
asymptotically developable 30.A 
automorphic 32 
automorphic, with respect tol 32.A 
b- 125.EE 418.H 
f8-measurable 270.J 
Baire 84.D 
Barnes extended hypergeometric 206.C, 
App. A, Table 18.1 
base 304.B 
Bellman 127.G 
Bergman kernel 188.G 
Bessel 39, App. A, Table 19.III 
beta 174.C, App. A, Table 17.I 
bispectral density 421.C 
Borel measurable 270J 
boundary 160.E 
bounded 43.A 
of bounded variation 166 
Busemann 178.F 
C?- (of many variables) 58.B 
C^. slowly increasing 125.0 
C'-, in a C®-manifold 105.G 
canonical (on a nonsingular curve) 9.E 
characteristic (of a density function) 397.G 
characteristic (of a graded R-module) 369.F 
characteristic (of a meromorphic function) 
272.B 
characteristic (of an n-person cooperative game) 
173.D 
characteristic (for an optical system) 180.C 
characteristic (of a probability measure) 341.C 
characteristic (of a subset) 381.C 
characteristic operator 251.N 
Chebyshev App. A, Table 20.II 
Chebyshev a. 19.G, App. A, Table 20.VII 
choice 33.B 34.A 
circular 131.F 432.A 
class (on a compact group) 69.B 
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Function(s) 


of class C", C9, C1, C”, or C^ 106K 

of class C' at a point 105.G 

of class C” in a C?-manifold | 105.G 

of class C? (of many variables) 58.B 

of class n, 0, 1, č, oro 84.D 

cen App. A, Table 16.III 

completely additive set 380.C 

completely monotonic 240.E,K 

completely multiplicative number-theoretic 
295.B 

complex 165.B 

complex-valued 165.B 

composite 106.1 

concave 88.A 

of confluent type 167 

of confluent type and Bessel functions App. A, 
Table 19 

conical App. A, Table 18.II 

conjugate 159.E 160.D 

conjugate harmonic 193.C 

constant 381.C 

continuous (on a metric space) 84 

continuous additive interval 380.B 

convex 88.A 

coordinate (of a fiber bundle) 147.B 

coordinate (in the Ritz method) 304.B 

cosigma 134.H, App. A, Table 16.IV 

counting (of a meromorphic function) 272.B 

covariance 386.A 395.A 

criterion 127.A 

cross spectral density 421.E 

cumulative distribution 341.B 342.C 

cylindrical 39.B, App. A, Table 19.III 

Daniell-Stone integrable 310.E 

decision 398.A 

decision, space of 398.A 

Dedekind eta 328.A 

Dedekind zeta 14.C 450.D 

defining (of a hyperfunction) 125.V 

density 397.D 

derived 106.A 

digamma 174.B 

dimension (on a continuous geometry) 85.A 

D-integrable 100.D 

Dirac delta App. A, Table 12.II 

Dirichlet 84.D 221.4 

distance 273.B 

distribution 168.B 341.B 342.C 

divisor 295.C 

divisor of (on an algebraic curve) 9.C 

divisor of (on an algebraic variety) 16.M 

dn App. A, Table I6.III 

doubly periodic 134.E 

E- 430.D 

é- 46.C 

effectively calculable 356.C 

eigen- (of a boundary value problem) 315.B 

eigen- (for an integral equation) 271.F 

eigen- (of a linear operator) 390.A 

elementary 131 

elementary, of classn 131.A 

elementary Abelian 3.M 

elliptic 134 323.A,D 

elliptic, of the first kind 134.G 

elliptic, of the second kind 134.G 

elliptic, of the third kind 134.H 

elliptic cylinder 268.B 

elliptic irrational 134.A 

elliptic theta 134.1, App. A, Table 16.II 

empirical characteristic 396.C 


empirical distribution 250.F 374.E 396.C 

energy 126.L 279.F 

energy spectrum 433.C 

entire 429.A 

entire algebroidal 17.B 

envelope power | 400.F 

error 167.D, App. A, Table 19.II 

estimating 399.D 

n- 391.L 

Euler 295.C 

even 165.B 

explicit 165.C 

exponential 131.D 

exponential, with the base a 131.B 

exponential, of an operator 306.C 

exponential generating 177.A 

factorial 174A 

family of quasi-analytic 58.A 

Feller transition 261.B 

finitely additive set 380.B 

finite-valued 443.B 

flat 58.C 

Fréchet differentiable 286.E 

frequency 397.D 

frequency response 421.E 

Fuchsian 32.B 

gamma  150.D 174, App. A, Table 17.1 

Gel'fand-Shilov generalized 125.8 

generalized 125.8 

generalized divisor 295.C 

generalized rational 142.B 

general Mathieu 268.B 

general recursive 356.C,F 

generating (of an arithmetic function) 295.E 

generating (of a contact transformation) 
82.A 271.F 

generating (of a sequence of functicns) 
177.A 

of Gevrey class 168.B 

global implicit, theorem 208.D 

grand partition 402.D 

Green 188.A 189.B 

Green (x-order) 45.D 

Green (of a boundary value problem) 315.B 

Green, method 402.J 

Gudermann 131.F, App. A, Table 16.III 

half-Bessel 39.B 

Hamiltonian 219.C 271.F 

Hankel 39.B, App. A, Table 19.ITY 

harmonic 193 

harmonic kernel 188.H 

hazard 397.0 

Heaviside 125.E 306.B, App. A, Table 12.II 

Hey zeta 27.F 

higher transcendental 289.A 

Hilbert characteristic (of a coheren: sheaf on 
a projective variety) 16.E 

Hilbert modular 32.G 

Hil 268.E 

holomorphic 198 

holomorphic (of many variables) 21.C 

holomorphic (on an open set in a complex 
manifold) 72.A 

holomorphic, germ of 21.E 

hyper- 125.V 

hyperarithmetical 356.H 

hyperbolic 131.F 

hypergeometric 206.A, App. A, Table 18.T 

hypergeometric (of the hyperspherical dif- 
ferential equation) 393.E 
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hypergeometric, of confluent type 167.A, 
App. A, Table 19.I 

hypergeometric, with matrix argument  206.E 

identity 381.C 

implicit 165.C 208 

implicit, theorem 208.A 286.G 

implicit, theorem (in Banach algebra) 36.M 

implicit, theorem (in locally convex spaces) 
286.3 

impulse 306.B, App. A, Table 12.II 

incomplete beta 174.C, App. A, Table 17.1 

incomplete gamma _ 174.A, App. A, Table 17.1 

increment 380.B 

indicator (of a subset) 342.E 376.C 

inner 43.F 

integral 429.A 

integral of, with respect to a volume element 
(on a C?-manifold) 105.W 

interpolation 223.A 

interval 380.A 

invariant decision 398.E 

inverse 198.L 381.C 

inverse analytic 198.L 

inverse trigonometric 131.E 

Jacobi elliptic App. A, Table 16.III 

joint density 397.J 

Julia exceptional 272.F 

jump 306.C 

K-pseudoanalytic 352.B 

K-quasiregular 352.B 

k-valued algebroidal 17.A 

Kelvin 39.G, App. A, Table 19.IV 

kernel 188.G 

Koebe extremal 438.C 

Kummer 167.A, App. A, Table 19.I 

L- —L-function 

Lagrangian 271.F 292.A 

Laguerre App. A, Table 20.VI 

å- 32C 

Lamé, of the first kind 133.B 

Lamé, of the first species 133.C 

Lamé, of the fourth species 133.C 

Lamé, of the second kind 133.C 

Lamé, of the second species 133.C 

Lame, of the third species 133.C 

Lane-Emden 291.F 

Laplace spherical 393.A 

Lebesgue measurable 270J 

Legendre 393.B, App. A, Table 18.II 

likelihood 374.3 399.M 

likelihood estimating 399.M 

linear 74.E 

linear discriminant |. 280.I 

linear fractional 74.E 

linear regression 397.H,J 403.D 

locally integrable 168.B 

logarithmic, to the base a  131.B 

loss 398.A 

lower limit 84.C 

lower semicontinuous (in a set) 84.C 

Lyapunov 126.F 163.G 

major 100.F 

Mangoldt 123.B 

many-valued 165.B 

of many variables 165.C 

Mathieu 268 

Mathieu, of the first kind 268.B 

Mathieu, of the second kind 268.D 

maximal concentration 341.E 

mean concentration 341.E 
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Function(s) 


measurable 270. 

measurable vector 308.G 

meromorphic 21.J 272 

meromorphic (on an analytic space) 23.D 

meromorphic (on a complex manifold) 72.A 

minimax decision 398.B 

minor 100.F 

Möbius 66.C 295.C 

modified Bessel 39.G, App. A, Table 19.IV 

modified indicator 341.C 

modified Mathieu 268.A 

modified Mathieu, of the first kind 268.D 

modified Mathieu, of the second kind 268.D 

modified Mathieu, of the third kind 268.D 

modular (of a locally compact group) 225.D 

modular, of level N  32.C 

moment-generating 177.A 341.C 

monotone 166.A 

monotone decreasing 166.A 

monotone increasing 166.A 

monotonic 166.A 

Morse 279.B 

of at most class 1 84.D 

u-conformal 352.B 

multidimensional gamma . 374.C 

multiplicative 32.A 

multiplicative automorphic 32.A 

multiplicity (of a mapping) 246.G 

multivalent 438 

multivalued 165.B 

Nash-Moser implicit, theorem 286.J 

n-dimensional distribution 342.C 

nth derived 106.D 

n-times continuously differentiable 106.K 

n-times differentiable 106.D 

ofn variables 165.C 

nice (on a C?-manifold) 114.F 

nondecreasing 166.A 

nondegenerate theta 3.I 

nonincreasing 166.A 

nontangential maximal 168.B 

normal (of ordinal numbers) 312.C 

normal density 397.D 

null 310.1] 

number-theoretic 295.A 356.A 

objective 264.B 307.C 

odd 165.B 

operating 192.N 

order (of a meromorphic function) 272.B 

orthogonal 317, App. A, Table 20 

orthogonal, Haar system of 317.C 

orthogonal, Rademacher system of 317.C 

orthogonal, Walsh system of 317.C 

outer 43.F 

P-,of Riemann 253.B 

ga-, of Weierstrass 134.F, App. A, Table 16.IV 

Painlevé transcendental 288.C 

parabolic cylinder 167.C 

parametric 102.A 399.A 

partial 356.E 

partition 402.D 

payoff 173.B 

pentagamma  174.B 

periodic 134. E 

phase (of a Fourier integral operator) 274.C 
345.B 

piecewise continuous 84.B 

plurisubharmonic 21.G 

point 380.A 

polygamma  174.B, App. A, Table 17.1 
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positive real 282.C 

of positive type 192.B,J 

power 400.A 

primitive 216.C 

primitive, derivatives and App. A, Table 9.1 
primitive recursive 356.A,B,F 
probability generating 341.F 

proper (of a boundary value problem) 315.B 
proper convex 88.D 

propositional 411.C 

proximity (of a meromorphic function) 272.B 
pseudo- 125.C 

psi 174.B 

quadratic loss 398.A 399.E 
quasi-analytic $8.F 

quasi-analytic, family of 58.A 
quasi-analytic, set of 58.F 
quasicontinuous 338.1 

radial maximal 168.B 

rank 66.F 

rapidly decreasing C*- 168.B 

rate distortion 213.E 

rational, field of 337.H 

rational, on a variety 16.A 

rational entire 429.A 

real 165.B 

real analytic 106.K 198.H 

real-valued 165.B 

recursive — recursive function(s) 
regression 397] 

regular 198 

regular, on an open set (of a variety) 16.B 
regular, ata subvariety 16.B 
representative (of a compact Lie group) 249.U 
representing (of a predicate) 356.B 
representing (of a subset) 381.C 
reproduction 263.A 

Riemann (of a Cauchy problem) 325.D 
Riemann integrable 216.A 

Riemann P App. A, Tables 14.I 18.1 
Riemann theta 3.L 

Riemann (- 450.B 

right continuous 84.B 

right majorizing 316.E 

right superior 316.E 

risk 398.A 

sample 407.A 

sample covariance 395.G 

with scattered zeros 208.C 

schlicht 438.A 

Schwinger 150.F 

selection 354.E 

self-reciprocal 220.B 

semicontinuous (at a point) 84.C 
sequential decision 398.F 

set 380 

of several variables 106.LJ 

shape 223.G 

Siegel modular, of degreen 32.F 

o-, of Weierstrass 134.F, App. A, Table 16.IV 
simple 221.B 443.B 

simple loss 398.A 

simplest Chebyshev q- 19.G 

simply periodic 134.E 

single-valued 165.B 

singular inner function 43.F 

slope 46.C 

sn App. A, Table 16.III 

special App. A, Table 14.H 

special, of confluent type 389.A 
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special, of ellipsoidal type 389.A 

special, of hypergeometric type 289.A 

spherical (on a homogeneous space) 437.Y 

spherical Bessel 39.B 

spherical harmonic 193.C 

spheroidal wave 133.E 

standard defining 125.Z 

stationary 46.B 

statistical decision 398.A 

stream 205.B 

strictly concave 88.A 

strictly convex 88.A 

strictly decreasing 166.A 

Strictly increasing 166.A 

strictly monotone 166.A 

strictly monotone (of ordinal numbers) 312.C 

strictly monotone decreasing 166.A 

strictly monotone increasing 166.A 

strictly monotonic 166.A 

structure 191.C 

Struve 39.G, App. A, Table 19.IV 

subharmonic 193.A 

superharmonic 193.P 

supporting 125.0 

symmetric 337.1 

Szegó kernel 188.H 

t- 150. D 

test 130.DD 400.A 

tetragamma  174.B 

Theodorsen 39.E 

theory of 198.Q 

theory of, of a complex variable 198.Q 

theta 134I 

theta (on a complex torus) 3.1 

time ordered 150.D 

torus App. A, Table 18 III 

transcendental, of Painlevé 288.C 

transcendental entire 429.A 

transcendental meromorphic 272.A 

transfer 86.D 

transfinite logical choice 411.J 

transition (of a fiber bundle) 147.B 

transition (of a Markov chain) 250.A 261.B 

trigamma  174.B 

trigonometric 432.A, App. A, Table 2 

truncated Wightman 150.D 

truth 341.A411E 

ultradifferentiable 168.B 

uniformly almost periodic 18.B 

unit 306.B, App. A, Table 12.II 

universally measurable 270.L 

upper limit 84.C 

upper semicontinuous 84.C 

value 108.B 

ona variety 16.A 

von Neumann 39.B 188.H, App. A, 
Table 19.111 

Wagner 39.E 

wave 351.D r 

Weber 39.G 167.C, App. A, Tables 19.IV, 20.IV 

Weierstrass elliptic App. A, Table 16.IV 

Weierstrass g- 134.F, App. A, Table 16.IV 

Weierstrass sigma 134.F, App. A, Table 16.1V 

weight (interpolatory) 299.A 

weight (for the mean of a function) 211.C 

weight (in orthogonality) 317.A 

Whittaker 167.B, App. A, Table 19.11 

Wightman 150.D 

X-valued holomorphic 251.G 

zeta — zeta functions 
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zonal spherical (on a homogeneous space) 
437.Y 
functional 46.A 162 165.B 
additive (of a Markov process) 261.E 
algebraic linear 424.B 
analytic 168.C 
areal 334.B 
bilinear 424.G 
Brownian 176.1 
characteristic (of a probability distribution) 
407.C 
Dirichlet | 334.C 
Douglas 334.C 
linear 37.C 197.F 424.B 
martingale additive 261.E 
multiplicative (of a Markov process) 261.E 
multiplicative, transformation by (in Markov 
process) 261.F 
perfect additive 261.E 
subadditive 88.B 
supporting (of a convex set) 89.G 
Yang-Mills 80.Q 
functional analysis 162 
functional analysis, nonlinear 286 
functional cohomology operation 202.8 
functional-differential equation 163 
system of 163.E 
functional equation 
Abel 388.D 
approximate (of zeta function) 450.B 
Schröder 388.D 
special 388.A 
of zeta function 450.B 
function algebra 164.A 
functionally dependent (components of mapping) 
208.C 
of class C” 208.C 
functional model 251.N 
functional paper 19.D 
functional ®-operation 202.S 
functional relation 208.C 
of class C" 208.C 
of gamma function 174.A 
function element 198.1 339.A 
inverse 198.L 
in the wider sense 198.0 
function field 
(of an algebraic curve over a field) 9.C 
(of an algebraic variety) 16.A 
Abelian 3J 
algebraic, over k of dimension 1 9.D 
algebraic, over k of transcendence degree 1 
9.D 
algebraic, in n variables 149.K 
elliptic 9.D 
rational, in n variables 149.K 
function group 234.4 
function matrix 
rational 86.D 
transfer Sep 
functions on a variety 16.A 
function space(s) 168 435.D 
test 125.S 
function symbol 411.H 
function-theoretic null sets 169 
function variable 411.H 
functor 52.H 
ð- 200.1 
Or. 200. 
additive 52.N 
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adjoint 52.K 
cohomological 200.1 
connected sequences of 200.1 
contravariant 52.H 
covariant 52.H 
derived 200.1 
exact 52.N 200.1 
faithful 52.H 
forgetful 52.1 
fully faithful 52.H 
half-exact 200.1 
homological 200.I 
left adjoint 52.K 
left balanced 200.1 
left derived 200. LO 
left exact 200.1 
partial derived 200.1 
relative derived 200.K 
representable 53.L 
right adjoint 52.K 
right balanced 200.1 
right derived 200.1,Q 
right exact 200.1 
spectral 200J 
universal ð- 200.1 
functorial isomorphism  53J 
functorial morphism  53J 
fundamental absolute neighborhood retract (FANR) 
382.C 
fundamental absolute retract (FAR) 382.C 
fundamental cell (of a symmetric Riemann space) 
413.F 
fundamental class 
(of an Eilenberg-MacLane space) 70.F 
(of a Poincaré pair) 114J 
(ofa Thom complex) 114.G 
with coefficient Z, 65.B 
fundamental conjecture (in topology) 70.C 
fundamental curve (with respect to a birational 
mapping) 16.1 
fundamental cutset matrix 186.G 
fundamental cycle 
(of an oriented pseudomanifold) 65.B 
(in a resolution of a singularity) 418.C 
fundamental differential invariants (of a surface) 
110.B 
fundamental discriminant 295.D 
fundamental domain 234.C 
fundamental exact sequence 200.M 
fundamental figure(s) 343.B 
linear 343.B 
fundamental form 
(associated with a Hermitian metric) 232.A 
(of a Finsler space) 152.A 
first 111.G, App. A, Table 4.1 
second 111.G 360.G 365.C 
fundamental group 170 
algebraic 16.U 
fundamental homology class 201.N 
around K 201.N 
fundamental invariants (of a space with a Lie trans- 
formation group) 110.A 
fundamental kernel 320.H 
fundamental lemma 
in the calculus of variations 46.B 
Neyman-Pearson 400.B 
fundamental open set 122.B 
fundamental operator 163.H 
fundamental period (of a periodic function) 134.E 
fundamental period parallelogram 134.E 


Subject Index 
Fundamental point 


fundamental point 
(with respect to a birational mapping) 161 
(ofa projective space) 343.C 
fundamental quantities 
first 111.H 
second 111.H 
fundamental region (of a discrete transformation 
group) 122.B 
Ford 234.C 
fundamental relations 
(of gamma functions) 174.A 
(among the generators of a group) 161.A 
(in thermodynamics) 419.A 
fundamental retract 382.C 
fundamental root system (of a semisimple Lie 
algebra) 248.N 
fundamental sequence 
of cross cuts (in a simply connected domain) 
333.B 
of rational numbers 294.E 
of real numbers 355.B 
in a uniform space 436.G 
fundamental set (of a transformation group) 122.B 
fundamental solution(s) 
(of a differential operator) 112.B 189.C 
(of an elliptic equation) 323.B 
(of an evolution equation) 189.C 
(of a hyperbolic equation) 325.D 
(of a parabolic equation) 327.D 
(of a partial differential equation) 320.H 
system of (of a system of linear equations) 
269.M 
fundamental space 125.8 
fundamental subvariety (with respect to a birational 
mapping) 16.1 
fundamental system 
(of eigenfunctions to an eigenvalue for 
an integral equation) 217.F 
(for a linear difference equation) 104.D 
(ofa root system) 13.J 
of irreducible representations (of a complex 
semisimple Lie algebra) 248.W 
of neighborhoods 425.E 
of solutions (of a homogeneous linear ordinary 
differential equation) 252.B 
of solutions (of a homogeneous system of linear 
differential equations of the first order) 252.H 
fundamental tensor(s) 
(ofa Finsler space) 152.A 
(of a Riemannian manifold) 364.A 
Lie 413.G 
second 417.F 
fundamental theorem(s) 
ofalgebra 10.E 
Bonnet (on surfaces) 111.H 
of calculus 216.C 
of elementary number theory 297.C 
the first (of Morse theory) 279.D 
Gentzen 411J 
Nevanlinna first 272.B 
Nevanlinna second 272.E 
of the principal order o  14.C 
of projective algebraic varieties 72.F 
of projective geometry 343.D 
of proper mapping 16.X 
the second (of Morse theory) 279.D 
of Stein manifolds 21.L 72.E 
on symmetric polynomials 337.1 
of the theory of curves 111.D 
of the theory of surfaces 111.G 
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Thom 114.H 
of the topology of surfaces 410.B 
of ultraproducts 276.E 
fundamental tieset matrix 186.G 
fundamental unit 414.A 116 
fundamental] units (of an algebraic number field) 
14.D 
fundamental vector field 191.A 
fundamental vectors (in a vector space) 442.A 
future cone 258.A 


G 


y — gamma 
GL(n, k) (general linear group) 60.B 
y-matrices, Dirac 415.G 
y-perfect 186.J 
y-perfectness 186.J 
y-point of the kth order (of a holomorphic function) 
198.C 
I-equivalent (points) 122.A 
I-extension 14.L 
Ij-foliation 154.H 
I-structure 90.D 105.Y 
T,-structure 154.H 
I7-structure 154.E 
g-lattice (of a separable algebra) 27.A 
integral 27.A 
normal 27.A 
G-bundle 147.B 
G-connections, Yang-Mills 80.Q 
G-fiber homotopy type, spherical 431.F 
G-group 172J 
G-invariant 
(element) 226.A 
(statistics) 396.I 
almost 396.1 
G-invariant measure 225.B 
G-isomorphism 191.A 
G-manifold 431.C 
oriented 431.E 
G-mapping (G-map) 362.B 431.A 
G-set 
k-ply transitive 362.B 
left 362.B 
quotient 362.B 
right 362.B 
simply transitive 362.B 
sub- 362.B 
G-space 178.H 431.A 
with nonpositive curvature 178.H 
G-stationary 
strictly 395.] 
weakly 395.1 
G-structure 191 
G-subset 362.B 
G-surface 178.H 
G-vector bundle 237.H 
G;-set 270.C 
gain, heat 419.A 
Galerkin method 290.E 303.I 304.B 
Galilei transformation 359.C 
Galois, E. 171 
Galois cohomology 172.3 200.N 
Galois equation 172.G 
Galois extension (of a field) 172.B 
Galois field 149.M 
Galois group 
of an algebraic equation 172.G 
of a Galois extension 172.B 
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ofa polynomial 172.G 
Galoistheory 172 
of differential fields 113 
Galton-Watson process 44.B 
multi (k)-type 44.C 
game 
bimatrix 173.C 
constant-sum 173.A 
cooperative 173.A 
differential 108 
general-sum 173.A 
with infinitely many players 173.D 
matrix 173.C 
multistage 173.C 
noncooperative 173.A 
n-person, in extensive form 173.B 
n-person, in normal form 173.C 
n-person cooperative, in characteristic- 
function form 173.D 
in partition-function form 173.D 
without side payments 173.D 
zero-sum 173.A 
zero-sum two-person 108.B 
game-theoretic model 307.C 
game theory 173 
gamma density 397.D 
gamma distribution 341.D, App. A, Table 22 
gamma function 150.D 174, App. A, Table 17.1 
incomplete 174.A, App. A, Table 17.1 
multidimensional 374.C 
gamma function and related functions App. A, 
Table 17 
gap (ata point) 84.B 
gap theorem 339.D 
Hadamard 339.D 
gap value (of a point on a Riemann surface) 11.D 
Garding, hyperbolic in the sense of 325.F 
Garding inequality 112.G 323.H 
Garnier system 253.E 
Garside-Jarratt-Mack method 301.N 
gases, kinetic theory of 402.B 
Gâteaux derivative 286.E 
Gâteaux differentiable 286.E 
gauge theory 105.G 
lattice 150.G 
gauge transformation 
(in electromagnetism) 130.A 
(in a lattice spin system) 402.G 
(of a principal fiber bundle) 80.Q 
(in unified field theory) 434.B 
of the first kind 150.B 
Gauss, C. F. 175 
Gauss-Argand plane 74.C 
Gauss backward interpolation formula 223.C 
Gauss-Bonnet formula 111.H 364.D, App. A, 
Table 4.I 
Gauss-Bonnet-Sasaki-Nitsche formula 275.C 
Gauss-Chebyshev formula (in numerical integration) 
299.A 
Gauss circle problem 242.A 
Gauss criterion App. A, Table 10.II 
Gauss equation 
(on an isometric immersion) 365.C 
(onsurfaces) 111.H 
Gauss formula 
(on Gauss sum) 295.D 
(on harmonic functions) 193.D 
(for integration of a vector field) App. A, 
Table 3.III 
(for isometric immersion) 365.C 


Subject Index 
Gel'fand-Pettis integrable 


(in numerical integration) 299.A 
(for the surface integral) 94.F 
(in theory of surfaces) 111.H, App. A, Table 4.I 
Gauss forward interpolation formula 223.C 
Gauss-Hermite formula (in numerical integration) 
299.A 
Gauss hypergeometric differential equation 
App. A, Table 14.II 
Gaussian 
(system of random variables) 176.A 
complex 176.B 
Gaussian curvature 
(ofa surface) 111.H, App. A, Table 4.1 
total(ofa surface) 111.H 
Gaussian differential equation 206.A 
Gaussian distribution 341.D 
standard 176A 
Gaussian elimination 302.B 
Gaussian frame (of a surface) 111.H 
Gaussian integer 14.U 
Gaussian plane 74.C 
Gaussian process 176 342.A 
complex 176.C 
N-ple Markov 176.F 
N-ple Markov, in the restricted sense 176.F 
stationary 176.C 
Gaussian random field 
Markov, in the McKean sense 176.F 
Markov, in the Nelson sense 176.F 
Gaussian random measure 407.D 
Gaussian random variable, complex 176.B 
Gaussian source, autoregressive 213.E 
Gaussiansum  295.D 450.C 
local 450.F 
Gaussian system 
(of random variables) 176.A 
complex 176.B 
Gaussian white noise 407.C 
Gauss integral 338J 
Gauss integration formula (in the narrow sense) 
299.A 
Gauss interpolation formula App. A, Table 21 
Gauss-Jordan elimination 302.B 
Gauss kernel 327.D 
Gauss-Laguerre formula (in numerical integration) 
299.A 
Gauss-Manin connection (of a variety) 16.V 
Gauss mapping (in geometric optics) 180.B 
Gauss-Markov theorem  403.E 
Gauss-Seidel method 302.C 
Gauss series 206.A 
Gauss symbol 83.A 
Gauss theorem 
(on algebraic closedness of C) 10.E 
(on primitive polynomials) 337.D 
Gauss theorema egregium (on surfaces) 111.H 
Gauss transformation App. A, Table 16.HI 
Gauss variational problem 338.J 
G.C.D. (greatest common divisor) 67.H 297.A 
GCR algebra 36H 
Gegenbauer polynomials 317.D 393.E, App. A, 
Table 20.I 
Gel'fand-Fuks cohomology 105.AA 
Gel'fand integrable 443.F 
Gel’fand-Levitan-Marchenko equation 
(for KdV equations) 387.D 
(for nonlinear lattice) 287.C 
Gel'fand-Mazur theorem 36.E 
Gel’fand-Naimark theorem 36.G 
Gel'fand-Pettis integrable 443.F 


Subject Index 
Gel'fand-Pettis integral 


Gel'fand-Pettis integral 443.F 

Gel'fand - Pyatetskii-Shapiro reciprocity law 
437.DD 

Gel'fand representation (of a commutative Banach 
algebra) 36.E 

Gel'fand-Shilov generalized function 125.8 

Gel'fand theorem, Stone- 168.B 

Gel'fand topology 36.E 

Gel'fand transform 36.E 

Gel'fand triplet  424.T 

Gell-Mann formula, Nakano-Nishijima- 132.A 

general addition theorem  388.C 

general analytic space 23.G 

general angle 139.D 

general associative law (for group composition) 
190.C 

general boundary value problem 323.H 

general Cantor set 79.D 

general Cayley algebra 54 

general connectedness theorem due to Fulton and 
Hansen 16] 

general curve 93.D 

general derivative (of a set function) 380.D 

general geometry of paths 152.C 

generalization (in étale topology) 16.AA 

generalized absolute continuity fal 100.C 

generalized absolute continuity in the restricted 
sense 100.C 

generalized absolutely continuous function | 100.C 

generalized Bayes solution 398.B 

generalized Bernoulli shift 136.D 

generalized Bernshtein problem 275.F 

generalized Boolean algebra 42.B 

generalized Boolean ring 42.C 

generalized Borel embedding 384.D 

generalized Clifford torus 275.F 

generalized cohomology theories 201.A 

generalized cohomology theory with E-coefficient 
202.T 

generalized conformal mapping 246.I 

generalized continuum hypothesis 49.D 

generalized convergence 331.C 

generalized convolution (of distributions) 125.M 

generalized coordinates (in analytical dynamics) 
271.F 

generalized cylindrical coordinates App. A, 
Table 3.V 

generalized decomposition number (of a finite 
group) 362.1 

generalized derivative 125.E 

generalized distance, Mahalanobis 280.E 

generalized distribution, Beurling 125.U 

generalized divisor function 295.C 

generalized eigenfunction 375.C 

generalized eigenspace (of a linear operator) 
390.B 

generalized eigenvalue 375.C 

generalized eigenvalue problem 298.G 

generalized eigenvector 390.B 

generalized Eisenstein series 450.T 

generalized Fourier transform 220.B 

generalized function 125.8 

generalized Hardy class 164.G 

generalized helix 111.F 

generalized homology theory 201.A 

generalized homology theory with E-coefficient 
202.T 

generalized Hopf homomorphism 202.U 

generalized Hopf invariant 202.Q 

generalized Hurewicz theorem 202.N 
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generalized isoperimetric problem 46.A 228.A 
generalized Jacobian (of a set function) 246.H 
generalized Jacobian variety 9.F 11.C 
generalized Lamé differential equation 167.E 
generalized least squares estimator 403.E 
generalized Lebesgue measure 270.E 
generalized Levi form 274.G 
generalized limit 37.F 
generalized minimal immersion 275.B 
generalized module 143.B 
generalized momentum 271.F 
generalized nilpotent (operator) 251.F 
generalized nilpotent element 36.E 
generalized nilpotent group 190.K 
generalized peak point 164.D 
generalized peak set 164.D 
generalized Pfaff problem 428.B 
generalized Poincaré conjecture 65.C 
generalized quaternion group 151.B 
generalized rational function 142.B 
generalized Riccati differential equation App. A, 
Table 14.1 
generalized Riemann-Roch theorem (on algebraic 
curves) 9.F 
generalized Schlómilch series 39.C 
generalized solvable group 190.K 
generalized stochastic process 407.C 
generalized suspension theorem 202.T 
generalized Tauberian theorem 36.L 160.G 
of Wiener 192.D 
generalized topological space 425.D 
generalized trigonometric polynomial 18.B 
generalized trigonometric series 18.B 
generalized uniserial algebra 29.1 
generalized valuation 439.B 
generalized variance 280.E 397.J 
generalized variance, sample 280.E 
generalized wave operator 375.B 
generalized Whitehead theorem 202.N 
general knot conjecture 235.B 
general law of reciprocity 14.0 
Artin 59.C 
general linear group  60.B 226.B 256.D 
of degree n over K 60.B 226.B 256.D 
over a noncommutative field 60.0 
projective 60.B 
general linear hypothesis 400.C 
general linear Lie algebra 248.A 
general lower derivative (of a set function) 380.D 
general Markov chain 260.J 
general Mathieu function 268.B 
general Navier-Stokes equations 204.F 
general position 
(complexes) 70.B 
(PL mappings) 65.D 
(in a projective space) 343.B 
theorem 65.D 
general principle of relativity 358 
general projective geometry 343.B 
general random walk 260.A 
general recursive function 356.C,F 
general recursive predicate 356.C 
general recursive set 97 
general Runge-Kutta method 303.D 
general sense, derivable in the 380.D 
general set theory 33.B 
general solution 
(of a difference equation) 104.D 
(of an ordinary differential equation) 313.A 
(of a partial differential equation) 320.C 
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(of a system of partial differential equations) 
428.B 
general sum 173.A 
general theory 
of perturbations 420.E 
of relativity 358 
general topology 426 
general type 72.H 
surface of 72.K 
general uniformization theorem 367.G 
general upper derivative (of a set function) 380.D 
generate 
(an A-module) 277.D 
(a completely additive class) 270.B 
(a field over k) 149.D 
(a filter) 87.1 
(an ideal) 67.B 
(a linear subspace) 256.F 
(asubgroup) 190.C 
(asubring) 368.E 
(atopology) 425.F 
generated, finitely 277.D 
generating curve 111.] 
generating element (with respect to a self-adjoint 
operator) 390.G 
generating function(s) 
(of an arithmetic function) 295.E 
(of a canonical transformation) 82.B 
(of a contact transformation) 82.A 
(of an infinitesimal transformation) 271.F 
(of a sequence of functions) 177.A 
exponential 177.A 
factorial cumulant- 397.G 
factorial moment 397.G 
joint moment 397.1,J 
moment- 177.A 341.C 397.G,J 
probability- 341.F 397.G 
generating line 
(ofa circular cone) 78.A 
(of a quadric hypersurface) 343.E 
(of a quadric surface) 350.B 
(of a ruled surface in differential geometry) 
(III 
generating representation (of a compact Lie group) 
249.U 


generating space (of a quadric hypersurface) 343.E 


generator 
(of an Abelian category) 200.1 
(of a cyclic code) 63.D 
(ofan endomorphism) 136.E 
(ofa group) 190.C 
(of a Markov process) 261.C 
(ofasemigroup) 378.D 


Bott 237.D 
Dynkin representation of 261.B 
F.D. 136.E 


infinitesimal 378.B 
system of (of an A-module) 277.D 
topological (of a compact Abelian group) 
136.D 
two-sided 136.E 
generic (property) 126.H 
generic point 16.A 
Gentzen fundamental theorem 411.J 
genuine solution 323.G 
genus 
(of an algebraic curve) 9.C 
(of a differential ideal) 428.E 
(of an ideal group) 59.E 
(in integral representation theory) 362.K 


Subject Index 
Geometric realization (of the s.s. complex) 


(ofa knot) 235.A 
(ofalattice group) 13.P 
(of a quadratic field) 347.F 
(of a quadratic form) 348.H 
(of a surface) 410.B 
(of a transcendental integral function) 429.B 
arithmetic (of an algebraic curve) 9.F 
arithmetic (of an algebraic surface) 15.C 
arithmetic (of a complete variety) 16.E 
arithmetic, of a divisor (on an algebraic surface) 
15.C 
boundary 410.B 
effective (of an algebraic curve) OC 
of the function field K/k 9.D 
geometric (of an algebraic surface) 15.E 
geometric (of a complete variety) 16.0 
geometric (of a singular point) 418.C 
i- 15.E 
linear 15.G 
measure of (of a positive definite symmetric 
matrix) 348.K 
© (of an algebraic curve) 9.F 
principal (of an ideal group) 59.E 
principal (of a quadratic field) 347.F 
virtual arithmetic (of a divisor) 16.E 
geocentric parallax 392 
Geócze area (of a surface) 246.E 
Geócze problem 246.D 
geodesic 80.L,I 111.H 178 364.C, App. A, Table 4.1 
null 359.D 
totally, submanifold 365.D 
geodesic arc 178.H 364.B 
geodesic coordinates 80.J 
geodesic coordinate system in the weak sense 
232.A 
geodesic correspondence (between surfaces) 111.1 
geodesic curvature 111.1, App. A, Table 4I 
geodesic flow 126.L 136.G 
geodesic line 178.H 
geodesic point 111.H 365.D 
geodesic polar coordinates 90.C 
geodesic triangle 178.A 
geodesic variation 178.A 
geometrically finite 234.C 
geometrically reductive 226.B 
geometrically simple eigenvalue 390.A 
geometrical mean 397.C 
geometric complex 70.B 
geometric construction problem  179.A 
geometric crystal class 92.B 
geometric difference equation 104.G 
geometric dimension (of a vector bundle) 114.D 
geometric distribution 341.D, App. A, Table 22 
geometric fiber (of a morphism) 16.D 
geometric genus 
ofan algebraic surface 15.E 
of a complete variety 16.0 
ofasingular point 418.C 
geometric mean 
(of a function) 211.C 
(of numbers) 211.C 
geometric multiplicity (of an eigenvalue) 390.A 
geometric number theory 296.B 
geometric optics 180 
geometric point (of a scheme) 16.D 
geometric probability 218.A 
geometric programming 264.D 
geometric progression 379.1, App. A, Table 10.VI 
geometric quotient 16.W 
geometric realization (of the s.s. complex) 70.E 


Subject Index 
Geometric series 


geometric series 379.B, App. A, Table 10.I 
geometry 181 
affine 7 
affine, in the narrower sense 7.E 
affine differential 110.C 
algebraic 12.A 
analytic 181 
circle 76.C 
conformal 76.A 
conformal differential 110.D 
continuous 85.A 
differential 109 
differential, of curves and surfaces 111 
differential, in specific spaces 110 
elliptic 285.A 
Euclidean, in the wider sense 139.B 
finite-dimensional projective 343.B 
formal 16.X 
general, of paths 152.C 
general projective 343.B 
hyperbolic 285.A 
hypersphere 76.C 
integral 218.A 
Laguerre 76.B 
Lobachevskii non-Euclidean 285.A 
Mobius 76.A 
natural 110.A 
n-dimensional Euclidean 139.B 181 
non-Archimedean 155.D 
non-Desarguesian 155.D 343.C 
non-Euclidean 285.A 
parabolic 285.A 
plane 181 
projective 343.B 
projective, of paths 109 
projective differential 110.B 
pseudoconformal 344.A 
pure 181 
Riemannian 137, App. A, Table 4.II 
Riemann non-Euclidean 285.A 
solid 181 
space 181 
of a space subordinate to a group 137 
spectral 391 
sphere 76.C 
spherical 285.D 
on a surface 111.G 
synthetic 181 
wave 434.C 
geometry of numbers 182 
germ(s) 383.B 
of an analytic set 23.B 
of a C?-function at the origin 58.C 
of a holomorphic function 21.E 
irreducible 23.B 
reducible 23.B 
sheaf of, of holomorphic functions 23.C 
sheaf of, of regular functions 16.B 


Germain curvature (of a surface) 111.H, App. A, 


Table 4.1 
Gevrey class 58.G 125.U 3251 
function of 168.B 


ghost component (of an infinite-dimensional vector) 


449.A 
ghost, Faddeev-Popov 132.C 150.G 
GHS inequality 212.A 
Gibbs distribution, equilibrium  136.C 
Gibbs-Duhem relation 419.B 
Gibbs free energy 419.C 
minimum principle 419.C 





Gibbs measure 136.C 
Gibbs phenomenon 159.D 
Gibbs state 340.B 
Gilbert-Sacks bound, Varsharmov 63.B 
Gill method, Runge-Kutta- 303.D 
Gini coefficient of concentration 397.E 
Giraud theorem 323.C 
Girsanov theorem 406.B 
Girsanov transformation 406.B 
Girshick-Savage theorem 399.F 
Givens method 298.D 
Givens transformation 302.E 
GKS first inequality 212.A 
GKS second inequality 212.A 
Glashow-Weinberg-Salam model 132.D 
Glauert approximation, Prandtl- 205.B 
Glauert law of similarity, Prandtl- 205.D 
g.l.b. (greatest lower bound) 311.B 
Gleason part (for a function algebra) 164.F 
Gleason theorem 351.L 
glide operation 92.E 
glide reflection 92.E 
Glivenko-Cantelli theorem 374.E 
global analysis 183 
global dimension 

(of an analytic set) 23.B 

(ofa ring) 200.K 

left (ofa ring) 200.K 

right (ofa ring) 200.K 

weak (ofa ring) 200.K 
global discretization error 303.B 
global Hecke algebra 450.0 
global implicit function theorem 208.D 
globally asymptotically stable 126.F 


globally symmetric Riemannian space 412.A 
global property (in differential geometry) 109 


global roundofferror 303.B 
global truncation error 303.B 
gluing theorem 21I 
gluons 132.D 
GNS construction 308.D 
Godbillion-Vey classes 154.G 
Gödel, Kurt 184 
Gödel completeness theorem  411.J 
Gödel incompleteness theorem 156.E 
Godel number(s) 185 356.C,E 
Gödel numbering 185.A 
Gödel set theory 33.C 
Bernays- 33.A 
Goldbach problem 4.C 
Golden-Thompson inequality 212.B 
Goldstein method, Ince- 268.C 
Goldstone boson, Nambu- .132.C 
Goldstone theorem  132.C 
Gomory cut 215.B 
Goodner-Kelley theorem, Nachbin- 37.M 
goodness of fit 397.Q 
test 401.E 
good reduction (of an Abelian variety) 3.N 
potential (of an Abelian variety) 3.N 
good resolution 418.C 
Goppa code 63.E 
Gordan coefficients, Clebsch- 258.B 353.B 
Gordan equation, Klein- 351.G 377.C 
Gordon equation, Sine- 387.A 
Gorenstein ring 200.K 
Goursat kernel, Pincherle- 217.F 
Goursat theorem 198.B 
grad (gradient) 442.D 
graded algebra 203.B 
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graded A-module 200.B 
graded coalgebra 203.B 
graded Hopf algebra 203.C 
graded ideal 369.B 
graded module 
connected 203.B 
dual 203.B 
graded object 200.B 
graded ring 369.B 
associated 284.D 
gradient 442.D, App. A, Table 3.II 
gradient method 292.E 
Arrow-Hurwicz-Uzawa 292.E 
conjugate (CG) 302.D 
gradient projection method, Rosen 292.E 
Graeffe method 301.N 
Gramian (determinant) 103.G 208.E 
grammar 
Chomsky 31.D 
context-free 31.D 
context-sensitive 31.D 
regular 31.D 
Gram-Schmidt orthogonalization 317.A 
Gram theorem 226.E 
grand canonical ensemble 402.D 
grand partition function 402.D 
graph 186.B 
(ofa knot) App. A, Table 7 
(=linear graph) 282.A 
(ofa mapping) 381.C 
(of a meromorphic mapping) 23.D 
(of an operator) 251.B 
(ofa relation) 358A 
bipartite 186.C 
complete 186.C 
complete bipartite 186.C 
direct 186.B 
Euler 186.F 
Feynman  146.A,B 386.C 
labeled 186.B 
linear 282.A 
oriented 186.B 
partial 186.C 
planar 186.H 
regular 186.C 
reoriented 186.B 
section 186.C 
sub- 186.D 
undirected 186.B 
unicursal, theorem (Euler’s) 186.F 
unlabeled 186.B 
unoriented 186.B 
graphic 66.H 
graphical calculation 19.B 
graphical differentiation 19.B 
graphical integration 19.B 
graphical mechanics 19.D 
graphical method of statistical inference 19.B 
graph norm 251.D 
graph theorem, closed 37.1 251.D 424.X 
graph theory 186 
Grashoff number 116.B 
Grassmann algebra (of a linear space) 256.0 
Grassmann coordinates (in a Grassmann manifold) 
90.B 
Grassmann manifold 119.B 427.D 
complex 199.B 
formed by oriented subspaces 199.B 
infinite 147.1 


Subject Index 
Group(s) 


real 199.B 
Grauert theorem (on proper holomorphic mappings) 
23.E 72.E 
gravitation, law of universal 271.B 
gravitational units, system of 414.B 
gravity, center of 271.E 
gravity wave 205.F 
long 205.F 
grazing ray 325.L 
great circle (of a sphere) 140 
greater than (another compactification) 207.B 
greatest common divisor 67.H 297.A 
greatest element (in an ordered set) 311.B 
greatest lower bound (of an ordered set) 310.C 
311.B 
greedy algorithm 66.G 
Greek mathematics 187 
Greek quadratrix 187 
Greek quadrivium 187 
Greek three big problems 187 
Green formula 
(for differential operators) App. A, Table 15.VI 
(for harmonic functions) 193.D 
(for Laplace operator) App. A, Tables 3.III, AU 
(for ordinary differential equations) 252.K 
(for partial differential equations of parabolic 
type) 327.D 
(on the plane) 94.F 
Green function method 402.J 
Green functions 188 189.B 
(a-order) 45.D 
(of a boundary value problem) 315.B 
Greenline 193J 
Green measure 193J 
Green operator 189.A,B 194.C 
Green space 193.N 
Green-Stokes formula 94.F 
Green tensor 188.E 
Green theorem 105.W 
Griffith first inequality 212.A 
Griffith second inequality 212.A 
Gross area {of a Borel set) 246.G 
Gróssencharakter GD 
Hecke L-function with 450.F 
Gross theorem 272.1 
Grothendieck category 200.1 
Grothendieck construction 237.B 
Grothendieck criterion of completeness 424.L 
Grothendieck group 
(of a compact Hausdorff space) 237.B 
(ofaring) 237J ` 
Grothendieck theorem of Riemann-Roch type 
366.D 
Grothendieck topology 16.AA 
ground field (of a linear space) 256.A 
ground form 226.D 
covariant with 226.E 
ground ring (of a module) 277.D 
group(s) 190.A 
Abelian 2190.A 
Abelian linear 60.L 
absolute homology 201.L 
additive 2.E 190.A 
adele (of an algebraic group) 13.P 
adele (of a linear algebraic group) 6.C 
adjoint (isogenous to an algebraic group) 13.N 
adjoint (of a Lie algebra) 248.H 
adjoint (of a Lie group) 249.P 
of affine transformations 7.E 


Subject Index 
Group(s) 


affine Weyl (of a symmetric Riemann space) 
413.G 

algebra 192.H 

algebra class 29.E 

algebraic 13 

algebraic fundamental 16.U 

algebraic homotopy 16.U 

alternating, of degreen 151.G 

automorphism (of a Lie algebra) 248.A 

of automorphisms (of a group) 190.D 

x-automorphism 36.K 

Betti (of a complex) 201.B 

black and white 92.D 

black and white point 92.D 

boundary 234.B 

braid 235.F 

Brauer (of algebra classes) 29.E 

Brauer (of a commutative ring) 29.K 

Bravais 92.B 

bundle (of a fiber bundle) 147.B 

€ 52M 

of canonical transformations 271.F 

category of 52.B 

cellular homology 201.F,G 

character (of an Abelian group) 2.G 

character (of a topological Abelian group) 
422.B 

Chevalley 151.1 

of classes of algebraic correspondences 9.H 

classical 60.A 

Clifford 61.D 

closed 362.J 

coefficient (of a homology group) 201.Q 

cohomology (of a complex) 200.0 

cohomology (of a group) 200.M 

cohomology (of a Lie algebra) 200.0 

cohomology, with coefficient sheaf F — 283.E 

cohomotopy 202.] 

of collineations 343.D 

color point 92.D 

color symmetry (colored symmetry) 92.D 

commutative 2.A 190.A 

commutator (of two elements) 190.H 

commutator (of two subsets of a group) 190.H 

compact 69.A 

completely reducible 190.L 

complex 60.L 

complex cobordism 114.H 

complex orthogonal 60.1 

complex special orthogonal  60.I 

complex symplectic 60.L 

of congruence classes modulo m* 14.H 

of congruent transformations 285.C 

covering 91.A 423.0 

covering transformation 91.A 

Coxeter 13.R 

crystallographic 92.A 

crystallographic space 92.A 

cyclic 190.C 

decomposition (of a prime ideal) 14.K 

defect (of a block of representations) 362.1 

defect (of a conjugate class in a group) 362.I 

derived (of a group) 190.H 

difference (of an additive group) 190.C 

of differentiable structures on combinatorial 
spheres App. A, Table ol 

differentiable transformation 431.C 

dihedral 151.G 

direct product 190.L 

discontinuous, of the first kind 122.B 


2004 


discontinuous transformation 122.A 

divisor (of a compact complex manifold) 72.F 

divisor class (of a Riemann surface) 11.D 

elementary topological Abelian 422.E 

elliptic modular 122.D 

equicontinuous, of class (CH) 378.C 

equivariant J- 431.C 

exponential 437.U 

extension (cohomology of groups) 200.M 

factor 190.C 

finite 151.A 190.C 

finitely generated 190.C 

finitely presented 161.A 

of the first kind 122.C 

formal 13.C 

four- 151.G 

free 161.A 

free product (of the system of groups) 190.M 

frieze 92.F 

Frobenius 151.H 

Fuchsian 122.C 

Fuchsoid 122.C 

full 136.F 

full linear 60.B 

full Poincaré 258.A 

function 234.A 

fundamental (of a topological space) 170 

Galois (of an algebraic equation) 172.G 

Galois (of a Galois extension) 172.B 

Galois (of a polynomial) 172.G 

generalized nilpotent 190.K 

generalized quaternion 151.B 

generalized solvable 190.K 

general linear 60.B 226.B 

general linear (over a noncommutative field) 
60.0 

Grothendieck (of a compact Hausdorff space) 
237.B 

Grothendieck (of a ring) 237.J 

h-cobordism (of homotopy n-spheres) 114.1, 
App. A, Table ol 

Hamilton 151.B 

Hausdorff topological 423.B 

Hilbert modular 32.G 

holonomy 80.D 364.E 

homogeneous holonomy 364.E 

homogeneous Lorentz 359 

homology (of a chain complex) 201.B 

homology (of a group) 200.M 

homology (of a Lie algebra) 200.0 

homology (of a polyhedron) 201.D 

homotopy 202J 

hyper- 190.P 

icosahedral 151.G 

ideal, modulo m* 14.H 

ideal class 14.E 67.K 

idele 6.C 

idele class 6.D 

indecomposable 190.L 

inductive limit 210.C 

inductive system of 210.C 

inertia (of a finite Galois extension) 257.D 

inertia (of a prime ideal) 14.K 

infinite 190.C 

infinite classical 147.1 202.V 

infinite orthogonal 202.V 

infinite symplectic 202.V 

infinite unitary 202,V 

inhomogeneous Lorentz 359 

of inner automorphisms (of a group) 190.D 
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of inner automorphisms (of a Lie algebra) 
248.H 

integral homology (of a polyhedron) 201.D 

integral homology (of a simplicial complex) 
201.C 

integral singular homology 201.E 

isotropy 362.B 


J- 2371 
of Janko-Ree type 151.J 
k- 13A 


K- (of a compact Hausdorff space) 237.B 

Klein four- 151.G 

Kleinian 122.C 243.A 

knot 235.B 

L- 450.N 

lattice 182.B 

lattice (of a crystallographic group) 92.A 

lattice-ordered Archimedean 243.G 

Lie 249.A 423.M 

Lie transformation 431.C 

linear fractional 60.B 

linear isotropy (at a point) 199.A 

linear simple 151I 

link 235.D 

little 258.C 

local Lie 423.L 

local Lie, of local transformations 431.G 

locally Euclidean 423.M 

local one-parameter, of local transformations 
105.N 

Lorentz 60.3 258 359.B 

magnetic 92.D 

Mathieu 151.H 

matric 226.B 

matrix 226.B 

maximally almost periodic 18.[ 

minimally almost periodic 18.1 

mixed 190.P 

Mobius transformation 76.A 

modular 122.D 

monodromy (of an n-fold covering) 91.A 

monodromy (of a system of linear ordinary 
differential equations) 253.B 

monothetic 136.D 

of motions 139.B 

of motions in the wider sense 139.B 

multiplicative 190.A 

multiplicative (of a field) 149.A 190.B 

Néron-Severi (of a variety) 15.D 16.P 

nilpotent 151.C 190.J 

octahedral 151.G 

Q 190.E 

one-parameter, of transformations (of a C^- 
manifold) 105.N 

one-parameter, of transformations of class C" 
126.B 

one-parameter semi-, of class CH 378.B 

one-parameter sub- 249.Q 

ordered 243.G 

ordered additive 439.B 

of orientation-preserving diffeomorphisms 
114.1 

oriented cobordism 114.H 

of oriented differentiable structures on 
a combinatorial sphere 114.1 

orthogonal 60.1 139.B 151.3 

orthogonal (over a field with respect to 
a quadratic form) 60.K 

orthogonal (over a noncommutative field) 
60.0 
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Group(s) 


orthogonal transformation 60.I 

of outer automorphisms (of a group) 190.D 

of outer automorphisms (of a Lie algebra) 
248.H 

p- 151.B 

periodic 2.A 

permutation 190.B 

permutation, of degreen 151.G 

n- 151.F 

z-solvable 151.F 

Picard (of a commutative ring) 237J 

Poincaré 170258.A 

point (of a crystallographic group) 92.A 

polychromatic 92.D 

principal isotropy 431.C 

profinite 210.C 

projective class 200.K 

projective general linear 60.B 

projective limit 210.C 

projective special linear 60.B,O 

projective special unitary 60.H 

projective symplectic 60.L 

projective system of 210.C 

of projective transformations 343.D 

projective unitary 60.F 

proper Lorentz 60.J 258.A 359.B 

proper orthogonal 60.1 258.A 

pseudo- (of topological transformations) 
105.Y 

p-torsion, of exceptional groups App. A, 
Table 6.IV 

qth homology 201.B 

quasi- 190.P 

quasi-Fuchsian 234.B 

quaternion 151.B 

quaternion unimodular 412.G 

quotient 190.C 

quotient (of a topological group) 423.E 

of quotients (of a commutative semigroup) 
190.P 

ramification (of a finite Galois extension) 
257.D 

ramification (of a prime ideal) 14.K 

rational cohomology 200.0 

reductive 13.Q 

Ree 1511 

of Ree type 151.J 

regular polyhedral 151.G 

relative homotopy 202.K 

relative singular homology 201.L 

renormalization 111A 

restricted holonomy 364.E 

restricted homogeneous holonomy |. 364.E 

Riemann-Roch 366.D 

Riesz 36.H 

rotation 60.1 258.A 

Schottky 234.B 

semi- 190.P 396.A 

separated topological 423.B 

sequence of factor (of a normal chain) 190.G 

shape 382.C 

Siegel modular (of degree n) 32.F 

simple 190.C 

simply connected (isogenous to an algebraic 
group) 13.N 

singular homology 201.G,L 

solvable 151.D 190 

space 92.A 

special Clifford 61.D 

special linear 60.B 
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special linear (over a noncommutative field) 
60.0 

special orthogonal 60.1,K 

special unitary 60.F,H,O 

spinor 60.1 61.D 

stability 362.B 

stable homotopy 202.T 

stable homotopy (of classical group) 202.V 

stable homotopy (of the Thom spectrum) 
114.G 

Steinberg (ofa ring) 237 

structure (of a fiber bundle) 147.B 

supersolvable 151.D 

Suzuki 151.1 

symmetric 190.B 

symmetric, of degreen 151.G 

symplectic 60.L 151.I 

symplectic (over a noncommutative field) 
60.0 

symplectic transformation 60.L 

Tate-Shafarevich 118.D 

tetrahedral 151.G 

theoretic approach 215.C 

Tits simple 151.1 

T,-topological 423.B 

topological 423 

topological Abelian 422.A 

topological transformation 431.A 

torsion 2.A 

torsion (of a finite simplicial complex) 201.B 

torus 422.E 

totally ordered 243.G 

totally ordered additive 439.B 

total monodromy 418.F 

transformation 431, App. A, Table 14.III 

transitive permutation 151.H 

of translations 7.E 258.A 

of twisted type 151.1 

typeI 308.L 437.E 

underlying (of topological group) 423.A 

unimodular 60.B 

unimodular locally compact 225.C 

unit (of an algebraic number field) 14.D 

unitary 60.F 151.1 

unitary (over a field) 60.H 


unitary (relative to an e-Hermitian form) 60.0 


unitary symplectic 60.L 

unitary transformation 60.F 

universal covering 91.B 423.0 

unoriented cobordism 114.H 

value (of a valuation) 439.B,C 

vector 422.E 

Wall 114J 

WC (Weil-Chátelet) 118.D 

weakly wandering under 136.F 

web 234.B 

weight 92.C 

Weil 6.E 450.H 

Weil-Chátelet 118.D 

Weyl (of an algebraic group) 13.H 

Weyl (ofa BN pair) 13.R 

Weyl (of a Coxeter complex) 13.R 

Weyl (of a root system) 13.J 

Weyl (of a semisimple Lie algebra} 248.R 

Weyl (of a symmetric Riemannian space) 
413.F 

Weyl, affine 413.F 

Weyl, k- 13.Q 

White 92.D 

Whitehead (of a ring) 237.J 
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Witt (of nondegenerate quadratic forms) 
348.E 
Zassenhaus 151.H 
group algebra 29.C 36.L 
C*- 36.L 
group code 63.C 
group extension 200.M 
grouplike 203.F 
group manifold (of a Lie transformation group) 
110.A 
group measure space construction 136.F 
group minimization problem  215.C 
group object (in a category) 52.M 
groupoid 190.P 
hyper- 190 
group pair (of topological Abelian groups) 422.I 
orthogonal 422.1 
group ring (of a compact group) 69.A 
group scheme 16.H 
group system 235.B 
group theorem (on fractional ideals) 67.J 
group-theoretic approach 215.C 
group variety 13.B 16.H 
algebraic 13.B 
group velocity 446 
growth, infra-exponential 125.AA 
Grunsky inequality 438.B 
Gudermann function (Gudermannian) 131.F, 
App. A, Table 16.III 
guide, wave 130.B 
Guignard constraint qualification 292.B 
Gupta-Bleuler formalism 150.G 
Gysin exact sequence (of a fiber space) 148.E 
Gysin homomorphism 201.0 
Gysin isomorphism, Thom- 114.G 
ofafiber space 148.E 


H 


H, (Hardy spaces) 168.B 
H'(Q) (Sobolev spaces) 168.B 
H$ (Q) (Sobolev spaces) 168.B 
h-cobordant oriented manifolds 114] 
h-cobordism group of n-dimensional homotopy 
spheres 114.1, App. A, Table el 
h-cobordism theorem 114.F 
H-function 402.B 
H-series, principal 437.X 
H-space 203.D 
H-theorem 402.B 
(H, p)-summable 379.M 
H-closed space 425.U 
Haag-Araki axioms 150.E 
Haag-Kastler axioms 150.E 
Haag-Ruelle scattering theory 150.D 
Haag theorem 150.C 
Haar condition (on best approximation) 336.B 
Haar measure 
left-invariant 225.C 
right-invariant 225.C 
Haarspace 142.B 
Haar system of orthogonal functions 317.C 
Haar theorem 225.C 
Hadamard estimation App. A, Table 8 
Hadamard formula, Cauchy- 339.A 
Hadamard gap theorem 339.D 
Hadamard multiplication theorem 339.D 
Hadamard theorem 
(on meromorphic functions) 272.E 
(on singularities of power series) 339.D 
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Hadamard three-circle theorem 43.E 
hadrons 132.B 
Haefliger structure 154.E 

C- 154E 
Hahn-Banach (extension) theorem 

(in a normed space) 37.F 

(in a topological linear space) 424.C 
half Bessel function 39.B 
half-exact (functor) 200.1 
half-life 132.A 
half-line 155.B 

closed (in affine geometry) 7.D 
half-periodic solution (of Hill equation) 268.E 
half-plane 155.B 333.A 
half-space 

of an affine space 7.D 

closed (of an affine space) 7.D 

principal (of a flag) 139.B 

Siegel upper (of degree n) 32.F 

supporting (of a convex set) 89.A 
half-spinor (even, odd) 61.E 
half-spin representation (even, odd) 61.E 
half-trajectory 

negative 126.D 

positive 126.D 
half-width 132.A 
Hall subgroup 151.E 
Hallstr6m-Kametani theorem 124.C 
Halmos theorem, von Neumann- 136.E 
Hamburger moment problem 240.K 
Hamilton canonical equation 271.F 
Hamilton-Cayley theorem  269.F 
Hamilton differential equation 324.E 
Hamilton group 151.B 
Hamiltonian 271.F 351.D 442.D 

bilinear 377.A 

cluster decomposition 375.F 

free 351.D 
Hamiltonian function 219.C 271.F 
Hamiltonian operator 351.D 
Hamiltonian system 126.L 
Hamiltonian vector field 126.L 219.C 
Hamilton-Jacobi differential equation 271.F 324.E 
Hamilton-Jacobi equation 108.B 
Hamilton path 186.F 
Hamilton principle 441.B 
Hamilton quaternion algebra 29.B 
Hammerstein integral equation 217.M 
Hamming bound (of a code) 63.B 
Hamming code 63.C 
Hamming distance 63.B 136.E 
handle 410.B 

attaching 114.F 

Casson 114K 

manifold with 114.F 
handlebody 114.F 
Hankel asymptotic representation App. A, 

Table 19.01 
Hankel determinant 142.E 
Hankel functions 39.B, App. A, Table 19.III 
Hankel transform 220.B 
Hansen, general connectedness theorem due to 
Fulton and 16] 

Hansen-Bessel formula App. A, Table 19.III 
hard, NP- 7L.E 
hard Lefschetz theorem 450.Q 
hardware 75.C 
Hardy class 43.F 159.G 

generalized 164.E 
Hardy inequality App. A, Table 8 


Subject Index 
Hasse norm-residue symbol, Hilbert- 


Hardy-Littlewood-Sobolev inequality 224.E 
Hardy-Littlewood supremum theorem App. A, 
Table 8 
Hardy-Littlewood theorem 
on bounded functions 43.E 
on trigonometric systems 317.B 
Hardy space 168.B 
Hardy theorem 
on bounded functions 43.E 
on the Cauchy product of two series 379.F 
harmonic 
(form) 194.B 
(function) 193.A 
(function on a state space) 260.D 
(mapping) 195.B 
harmonically separated points (in a projective space) 
343.D 
harmonic analyzer 19.E 
harmonic analysis 192 
harmonic boundary 207.B 
harmonic conjugates 343.D 
harmonic continuation 193.M 198.G 
harmonic differential (on a Riemann surface) 367.H 
harmonic dimension (of a Heins end) 367.E 
harmonic flow 193.K 
harmonic functions 193 
conjugate 193.C 
spherical 193.C 
harmonic integrals 194 
harmonic kernel function 188.H 
harmonic majorant (of subharmonic function) 
193.S 
harmonic mapping 195 
harmonic mean 
(of a distribution) 397.C 
(ofa function) 211.C 
(of numbers) 211.C 
harmonic measure 
inner 169.B 
outer 169.B 
harmonic motion, simple 318.B 
harmonic oscillation 318.B 
harmonic range of points 343.D 
harmonics 
ellipsoidal 133.B 
ellipsoidal, of four species 133.C 
solid 393.A 
spherical 193.C 393.A 
surface 393.A 
tesseral 393.D 
zonal 393.D 
Harnack condition (on the D-integral) 100.E 
Harnack first theorem 193.1 
Harnack lemma 193.1 
Harnack second theorem 193.1 
Hartogs continuation theorem 21.F 
Hartogs-Osgood theorem 21.H 
Hartogs theorem 
of continuity 21.H 
ofholomorphy 21.C 
Hartshorne conjecture 16.R 
Harvey duality, Martineau- 125.Y 
hashing 96.B 
Hasse character, Minkowski- (of a nondegenerate 
quadratic form) 348.D 
Hasse conjecture 450.8 
Hasse function, Artin- 257.H 
Hasse invariant (of a central simple algebra over 
a p-adic field) 29.G 
Hasse norm-residue symbol, Hilbert- 14.R 


Subject Index 
Hasse principle 


Hasse principle 348.G 

Hasse theorem, Minkowski- (on quadratic forms 
over algebraic number fields) 348.G 

Hasse-Witt matrix 9.E 

Hasse zeta function 450.8 

Haupt theorem 268.E 

Hauptvermutung (in topology) 65.C 70.C 

Hausdorff axiom 425.Q 


Hausdorff dimension 117.G 234.E 246.K 
Hausdorff formula, Campbell- 249.R 
Hausdorff measure 169.D 


Hausdorff moment problem 240.K 
Hausdorff space 425.Q 

collectionwise 425.AA 
Hausdorff theorem, Baire- 273.J 425.N 
Hausdorff topological group 423.B 
Hausdorff uniform space | 436.C 
Hausdorff-Young inequality 224.E 
Hausdorff-Young theorem 317.B 
Hawaiian earring 79.C 
hazard function 397.0 
hazard rate 397.0 
heat 

Joule 130.B 

specific, at constant pressure 419.B 

specific, at constant volume 419.B 
heat bath 419.B 
heat conduction, equation of 327.A 
heat equation 327.A, App. A, Table 15.VI 
heat gain 419.A 
heatloss 419.A 
Heaviside calculus | 306.A 
Heaviside function  125.E 306.B, App. A, Table 12.II 
Hecke algebra 29.C 32.D 

global 450.0 
Hecke character 6.D 
Hecke L-function 450.E 

with Gróssencharakter 450.F 
Hecke operator 32.D 

zeta function defined by 450.M 
Hecke ring 32.D 
Heegaard decomposition 65.C 
height 

(of an algebraic number) 430.B 

(of an element in a lattice) 243.F 

(of an ideal) 67.E 

(of an isogeny) 3.F 

(of a lattice) 243.F 

(of a prime ideal) 67.E 

infinite (element of an Abelian p-group} 2.D 
Heilbronn phenomenon, Deuring- 123.D 
Heine-Borel theorem 273.F 
Heine series 206.C 
Heins end 367.E 
Heisenberg equation of motion 351.D 
Heisenberg picture 351.D 
Heisenberg uncertainty relation 351.C 
helicity 258.C 


helicoid 
ordinary 114.1 
right 1111 


helicoidal surface 111.1 
Helinger-Hahn theorem 390.G 
helix 
generalized 111.F 
ordinary 111.F 
Helly theorem 94.B 
Helmholtz differential equation 
Table 15.VI 
Helmholtz free energy 419.C 


188.D, App. A, 
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minimum principle 419.C 
Helmholtz theorem (on vector fields) 442.D, 
App. A, Table 3.III 
Helmholtz vorticity theorem 205.B 
Helson set 192.R 
hemisphere 
northern 140 
southern 140 
Henselian ring 370.C 
Henselization 370.C 
Hensel lemma 118.C 
Hensel ring 370.C 
Herbrand lemma 200.N 
Herbrand quotient 200.N 
hereditarily normal space 425.Q 
hereditarily quotient mapping 425.G 
hereditarily weak topology 425.M 
hereditary finite set 33.B 
hereditary ring 200.K 
left 200.K 
right 200.K 
Herglotz integral representation 43.] 
Herglotz theorem 192.B 
Hermite differential equation App. A, Table 14.II 
Hermite differential equation, Weber- 167.C 
Hermite formula, Gauss- (in numerical integration) 
299.A 
Hermite interpolation polynomial 223.E 
Hermite polynomials 317.D, App. A, Table 20.1V 
Hermite polynomials, Fourier- 176.1 
Hermitian (integral operator) 251.0 
Hermitian form 256.Q 348.F 
anti- 256.Q 
e 600 
indefinite 348.F 
negative definite 348.F 
positive definite 348.F 
semidefinite 398.F 
skew- 256.Q 
v- 384.A 
Hermitian homogeneous space, complex 
Hermitian hyperbolic space 412.G 
Hermitian inner product 256.Q 
Hermitian kernel 217.H 
Hermitian linear space 256.Q 
Hermitian matrix 269.I 
anti- 269.1 
skew 269.1 
Hermitian metric 232.A 
Hermitian operator 251.E 
Hermitian space 
irreducible symmetric 412.E 
symmetric 412.E 
Heron formula 
(for plane triangles) App. A, Table 2.11 
(for spherical triangles) App. A, Table 2.III 
Hersch and Pfluger, extremal length defined by 
143.A 
Hersch problem 391.E 
Hessenberg method 298.D 
Hesse normal form (of a hyperplane) 139.H 
Hessian 
(on a differential manifold) 279.B,F 
(form) 226.D 
(of a plane algebraic curve) 9.B 
heterogeneity, design for two-way elimination of 
102.K 
heuristic algorithm 215.E 
Hewitt-Savage zero-one law 342.G 
hexagon 155.F 


199.A 
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hexagonal (system) 92.E 
hexahedron 357.B 
Hey zeta function 27.F 
hidden variable theories 351.L 
hierarchy 356.H 
analytic 356.H 
arithmetical 356.H 
arithmetical, of degrees of recursive unsolvability 
356.H 
C-analytic 356.H 
C-arithmetical 356.H 
Chomsky 31.D 
hyperarithmetical, of degrees of recursive 
unsolvability 356.H 
hierarchy theorem 356.H 
Higgs mechanism 132.D 
higher algebraic K-theory 237.J 
higher differentiation (in a commutative ring) 113 
higher order, of (for infinitesimals) 87.G 
higher-order derivative (of a differentiable function) 
106.D 
higher-order ordinary differential equation 
App. A, Table 14I 
higher-order partial derivative 106.G 
higher-order predicate logic 411.K 
higher-transcendental function 389.A 
highest weight (of a representation of a Lie algebra) 
248.W 
high-precision computation 138.B 
Hilbert, D 196 
Hilbert basis theorem 284.A 
Hilbert characteristic function 
(of a coherent sheaf) 16.E 
(of a graded module) 369.F 
Hilbert cube .382.B 
Hilbert -operator 411.J 
Hilbert e-quantifier 411.J 
Hilbert e-symbol 411.J 
Hilbert fifth problem 423.N 
Hilbert-Hasse norm-residue symbol 14.R 
Hilbertian space, countably 424.W 
Hilbert inequality App. A, Table 8 
Hilbert invariant integral 46.C 
Hilbert irreducibility theorem (on polynomials) 
337.F 
Hilbert manifold 105.Z 286.K 
Hilbert modular form 
of dimension —k 32.G 
of weight k 32.G 
Hilbert modular function 32.G 
Hilbert modular group 32.G 
Hilbert modular surface 15.H 
Hilbert norm-residue symbol 14.R 
Hilbert Nullstellensatz 369.D 
Hilbert polynomial 
(of an algebraic curve) 9.F 
(of a coherent sheaf) 16.E 
(of a graded module) 369.F 
Hilbert problem (in calculus of variations) 46.A 
Hilbert problem, Riemann- 
(for integral equations) 217.J 
(for ordinary differential equations) 253.D 
Hilbert scheme 16.8 
Hilbert-Schmidt class 68.1 
Hilbert-Schmidt expansion theorem 217.H 
Hilbert-Schmidt norm  68.I 
Hilbert-Schmidt type 
integral operator of 68.C 
kernel of 217.1 
Hilbert spaces 197 


Subject Index 
Holomorphic 


adjoint 251.E 
channel 375.F 
complex 197.B 
exponential 377.D 
physical 150.G 
pre- 197.B 
real 197.B 
rigged 424T 
Hilbert-Speiser theorem 172J 
Hilbert system of axioms (foundations of geometry) 
155.B 
Hilbert syzygy theorem 369.F 
Hilbert theorem 90 172 
Hilbert transform 160.D 220.E 
Hilbert zero point theorem 369.D 
Hilferty approximations, Wilson- 374.F 
Hill determinant 268.B 
Hill determinantal equation 268.B 
Hill differential equation 268.B 
Hille-Yosida theorem 378.B 
Hill function 268.E 
Hill method of solution 268.B 
Hirsch theorem, Leray- 201.J 
Hirzebruch index theorem (for differentiable 
manifolds) 56.G 
Hirzebruch signature theorem (on algebraic surface) 
72.K 
Hirzebruch surface 15.G 
Hirzebruch theorem of Riemann-Roch type 366.B 
histogram 397.B 
Hitchcock method 301.E 
hitting probability 5.G 
hitting time 260.B 261.B 407.B 
Hlawka theorem, Minkowski- 182.D 
Hochschild cohomology group 200.L 
Hodge conjecture 450.8 
Hodge index theorem 15.D 
Hodge manifold 232.D 
Hodge metric 232.D 
Hodges-Lehmann theorem 399.E,H 
Hodge spectral sequence 16.U 
Hodge structure (of a vector space) 16.V 
mixed 16.V 
polarized 16.V 
Hodgkin-Huxley differential equation 291.F 
hodograph method 205.B 
hodograph plane 205.B 
hold almost everywhere (in a measure space) 270.D 
hold at almost all points 
in a measure space 270.D 
Holder condition of order o 84.A 
Holder inequality 211.C, App. A, Table 8 
Holder integral inequality 211.C 
Holder method of order p, summable by 379.M 
Holder sequence, Jordan- (ina group) 190.G 
Holder space 168.B 
Holder theorem 104.F 
Holder theorem, Jordan- (in group theory) 190.G 
Holder theorem, Jordan- (on representations 
of algebras) 362.D 
hole theory, Dirac 415.G 
Holmgren type theorem (of Kashiwara-Kawai) 
125.DD 
Holmgren uniqueness theorem 321.F 
holohedral 92.B 
holohedry 92.B 
holomorphic 
(family of linear operators) 331.C 
(function) 198.A 
(in the sense of Riemann) 21.C 


Subject Index 
Holomorphically complete domain 


(vector-valued function) 37.K 
holomorphically complete domain 21.F 
holomorphically complete space 23.F 
holomorphically convex domain 21.H 
holomorphic automorphism 21.J 
holomorphic differential (on a Riemann surface) 

367.H 
holomorphic differential form of degree E 72.A 
holomorphic distribution (with respect to a param- 
eter) 125.H 
holomorphic evolution operator 378.1 
holomorphic foliation 154.H 
holomorphic function(s) 198 

(on a complex manifold) 72.A 

(of many variables) 21.A,C 

germofa 21.E 

sheaf of germs of 23.C 383.D 
holomorphic functional calculus 36.M 
holomorphic hull 21.H 
holomorphic k-form 72.A 
holomorphic local coordinate system 72.A 
holomorphic mapping 21.J 

(of acomplex manifold) 72.A 

nondegenerate (between analytic spaces) 23.C 
holomorphic microfunction 274.F 
holomorphic modification (of an analytic space) 

23.D 
holomorphic part (in a Laurent expansion) 198.D 
holomorphic sectional curvature 364.D 
holomorphic semigroup 378.D 
holomorphic tangent vector 72.A 
holomorphic vector field 72.A 
holomorphy 

domain of 21.F 

envelop of 21.F 

Hartogs theorem of 21.C 
holonomic (coherent &-module) 274.H 
holonomic systems 

with regular singularities 274.H 

simple 274.H 
holonomy 154.C 
holonomy group 80.D 154.C 364.D 

homogeneous 364.E 

restricted 80.D 364.E 

restricted homogeneous 364.E 
holonomy homomorphism  154.C 

linear 154.C 
holosymmetric class 92.B 
homentropic flow 205.B 
homeomorphic 425.G 
homeomorphism 425.G 

minimal 136.H 

PL 65.A 

strictly ergodic 136.H 

uniquely ergodic 136.H 
homeomorphism problem 425.G 
homoclinic point 126.J 

transversal 126.J 
homogeneous 

(A-submodule) 200.B 

(boundary value problem) 315.B 

(difference equation) 104.C 

(lattice) 182.B 

(linear ordinary differential equation) 252.A 

(system of linear differential equations of 

the first order) 252.G 

spatially (process) 261.A 

temporally (additive process) 5.B 

temporally (process) 261.A 

weighted (analytic function) 418.D 
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homogeneous bounded domain 384.A 412.F 
homogeneous coordinate ring 16.A 
homogeneous coordinates 343.C 
homogeneous difference equation 104.C 
homogeneous element 

ofa graded ring 369.B 

of a homogeneous ring 369.B 
homogeneous equations, system of linear 269.M 
homogeneous holonomy group 364.E 

restricted 364.E 
homogeneous hypersurface 344.A 
homogeneous ideal 

ofa graded ring 369.B 

ofa polynomial ring 369.B 
homogeneous integral equation 217.F 
homogeneous Lorentz group 258.A 359 
homogeneously regular 275.C 
homogeneous Markov process 5.H 261.A 
homogeneous n-chain (for a group) 200.M 
homogeneous ordinary differential equation 

App. A, Table 14.I 

of higher order App. A, Table 14.I 
homogeneous part (of a formal power series) 370.A 
homogeneous polynomial 337.B 
homogeneous ring 369.B 
homogeneous Siegel domain, irreducible 384.E 

topology of Lie groups and 427 
homogeneous space(s) 199 249.F 362.B 

complex Hermitian 199.A 

Kahler 199.A 

linearly connected 199.A 

reductive 199.A 

Riemannian 199.A 

symmetric 412.B 

symmetric Riemannian 412.B 
homogeneous turbulence 433.C 
homological algebra 200 

relative 200.K 
homological dimension 

ofa module 200.K 

ofatopologicalspace 117.F 
homological functor | 200. 
homological mapping 200.C 
homologous 201.B 
homologous to zero 198.B 
homology 200.H 

intrinsic 114.H 
homology basis, canonical 11.C 
homology class 200.H 

fundamental] 201.N 

fundamental, around K 201.N 

q-dimensional 201.B 
homology exact sequence 201.L 

(of fiber space) 148.E 

reduced 201.F 
homology group(s) 

(of a chain complex) 201.B 

(ofa group) 200.M 

(of a Lie algebra) 200.0 

(ofa polyhedron) 201.D 

(of a simplicial complex) 201.G 

absolute 201.L 

Cech 201.M 

cellular 201.F,G 

with coefficients in G 201.G 

integral 201.C,D 

integral singular 201.E 

local 201.N 

reduced 201.E 

relative Cech 201.M 
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relative singular 201.L 
simplicial 201.D 
singular 201.G,L,R 
homology manifold 65.B 
homology module 200.C 
homology theory 201 
generalized 201.Q 
generalized, with E-coefficient 202.T 
uniqueness theorem of 201.Q 
homomorphic 
(algebraic system) 409.C 
(groups) 190.D 
(topological groups) 423J 
order (ordered sets) 311.E 
homomorphic image (of a measure-preserving 
transformation) 136.D 
homomorphism 
(of Abelian varieties) 3.C 
(of algebraic systems) 409.C 
(of fields) 149.B 
(of groups) 190.D 
(of lattices) 243.C 
(of Lie algebras) 248.A 
(of linear representations) 362.C 
(of presheaves) 383.A 
(of rings) 368.D 
(of sheaves) 363.B 
A- (of A-modules) 277.E 
A-, of degree p (of graded A-modules) 200.B 
admissible (of Q-groups) 190.E 
algebra 29.A 
allowed (of A-modules) 277.E 
analytic (of Lie groups) 249.N 
anti- (of groups) 190.D 
anti- (of rings) 368.D 
bialgebra 203.G 
Bokshtein 64.B 
boundary (on homology groups) 201.L 
boundary (in homotopy exact sequences) 
202.L 
C°-(between Lie groups) 249.N 
canonical (on direct products of rings) 368.E 
coalgebra 203.F 
coboundary (on cohomology groups) 201.L 
connecting (in homology) 200.C 201.L 
connecting (on homology groups) 201.C 
continuous (of topological groups) 423J 
crossed (of an associative algebra) 200.L 
dual (of a homomorphism of algebraic tori) 
13.D 
dual (of lattices) 243.C 
edge 200J 
equivariant J- 431.F 
generalized Hopf 202.V 
Gysin 201.0 
holonomy 154.C 
Hopfalgebra 203.H 
Hurewicz 202.N 
induced by a continuous mapping (between 
homotopy groups) 202.K 
J- (in homotopy theory) 202.V 
J-(in K-theory) 237] 
Jordan (of Jordan algebras) 231.A 
lattice- 243.C 
local (of a topological group) 423.0 
module of (of modules) 277.B 
module of A- (of A-modules) 277.E 
Q- (of O-groups) 190.E 
open continuous (of topological groups) 423J 
operator (of A-modules) 277.E 


Subject Index 
Homotopy set 


operator (of Q-groups) 190.E 
order 311.E 
rational (of Abelian varieties) 3.C 
rational (of algebraic groups) 13.A 
ring 368.D 
*- 36.F 
Umkehr 201.0 
unitary (of rings) 368.D 
zero (of two A-modules) 277.H 
homomorphism theorem 
on groups 190.D 
on Lie algebras 248.A 
on topological groups 423.3 
on topological linear spaces 424.X 
homothetic correspondence (between surfaces) 
114.1 
homothety 
in conformal differential geometry 110.D 
in Euclidean geometry 139.B 
homotopic 154.E,F 202.B 
chain (chain mappings) 200.C 
integrably 154.F 
null- (continuous mapping) 202.B 
regularly (immersions) 114.D 
relative to a subspace 202.B 
to zero 202.B 
homotopy 202.B 
composite 202.B 
free 202.B 
linear 114.D 
restricted 202.B 
homotopy-associative (multiplication) 203.D 
homotopy category of topological spaces 52.B 
homotopy chain 200.C 
homotopy class 202.B 
compact 286.D 
homotopy cochain 200.F 
homotopy commutative (multiplication) 203.D 
homotopy equivalence 202.F 
simple 65.C 
weak 202.F 
homotopy equivalent systems (of topological 
spaces) 202.F 
homotopy exact sequence 202.L 
ofafiber space 148.D 
ofa triad 202.M 
ofa triple 202.L 
homotopy extension property 202.E 
homotopy group(s) 202J 
algebraic 16.U 
of a compact connected Lie group App. A, 
Table 6.VII 
realization theorem of 202.N 
of a real Stiefel manifold App. A, Table 6.VIT 
relative 202.K 
ofasphere App. A, Table 6.VI 
stable 202.T, App. A, Table 6.VII 
stable (of classical groups) 202.V 
stable (of the k-stem) 202.U 
stable (of Thom spectrum) 114.G 
ofatriad 202.M 
homotopy identity (of an H-space) 203.D 
homotopy invariance (of a homology group) 201.D 
homotopy invariant 202.B 
homotopy inverse (for an H-space) 203.D 
homotopy n-spheres 
groupof 1141 
h-cobordism group of 114.1 
homotopy operations 202.0 
homotopy set 202.B 


Subject Index 
Homotopy sphere 


homotopy sphere 65.C 
homotopy theorem 
first (in obstruction theory) 305.B 
second (in obstruction theory) 305.C 
simple 65.C 
third (in obstruction theory) 305.C 
homotopy theory 202 
de Rham 114.L 
homotopy type 202.F 
(ofa link) 235.D 
spherical G-fiber 431.F 
homotopy type invariant 202.F 
Hooke law 271.G 
Hopf algebra(s) 203 
dual 203.C 
elementary 203.D 
graded 203.C 
Hopf algebra homomorphism 203.H 
Hopf bifurcation 126.M 
Hopf bundle 147.E 
Hopf classification theorem 202.1 
Hopf comultiplication 203.D 
Hopf coproduct 203.D 
Hopf extension theorem 270.E 
Hopf Deng 147.E 
Hopf homomorphism, generalized (of homotopy 
groups of spheres) 202.U 
Hopf integrodifferential equation, Wiener. 222.C 
Hopf invariant 202.U 
generalized 202.Q 
modulo p 202.8 
Hopf mapping (Hopf map) 147.E 
Hopf surface 72.K 
Hopf theorem (continuous vector field) 153.B 
Hopf weak solution 204.C 
horizon, event 359.F 
horizontal components 
of a homogeneous space 110.A 
of a vector field 80.C 
horizontal slit mapping, extremal 367.G 
horizontal subspace 191.C 
horizontal vector (in a differentiable principal fiber 
bundle) 80.C 
Hórmander theorem 112.C,D 
horned sphere, Alexander 65.G 
Horner method 301.C 
horocycle flow 136.G 
horosphere 218.G 
horseshoe diffeomorphism  126.J 
Hosokawa polynomial 235.D 
Hotelling T? statistic 280.B 
noncentral 374.C 
Householder method 298.D 
Householder transformation 302.E 
Hugoniot relation, Rankine- 204.G 205.B 
Hukuhara theorem, Dini- 314.D 
Hukuhara problem 315.C 
hull 
closed convex | 424.H 
convex 89.A 
convex (in an affine space) 7.D 
convex (of a boundary curve) 275.A 
convex {in linear programming) 255.D 
holomorphic 21.H 
hull-kernel topology 36.D 
human death and survival, model of 214.A 
Hunt process 261.B 
Hunt-Stein lemma 400.F 
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Hurewicz homorphism | 202.N 
Hurewicz isomorphism theorem 202.N 
Hurewicz-Steenrod isomorphism theorem 148.D 
Hurewicz theorem, generalized 202.N 
Hurewicz-Uzawa gradient method, Arrow- 292.E 
Hurwitz formula, Riemann- (on coverings of 
a nonsingular curve) 9.I 
Hurwitz relation 
(on homomorphisms of Abelian varieties) 3.K 
Riemann- 367.B 
Hurwitz theorem 10.E 
Hurwitz zeta function 450.B 
Huxley differential equation, Hodgkin- 291.F 
Huygens principle 325.B 446 
in the wider sense 325.D 
hybrid computer 19.E 
hydrodynamics 205 
hydromagnetic dynamo theory 259 
hydromagnetics 259 
hydrostatics 205.A 
hyperalgebra 203.1 
hyperarithmetical function 356.H 
hyperarithmetical hierarchy of degrees of recursive 
unsolvability 356.H 
hyperarithmetical predicate 356.H 
hyperbola 78.A 
conjugate 78.E 
equilateral 78.E 
rectangular 78.E 
hyperbolic 
(closed invariant set of a dynamical system) 
126J 
(differential operator) 112.A 325.H 
(linear mapping) 126.G 
(partial differential equation) 325.A,E 
(Riemann surface) 367.D,E 
(simply connected domain) 77.B 
(space form) 412.H 
complete 21.0 
regularly 325.A,F 
in the sense of Garding 325.F 
in the sense of Petrovskii 325.F 
in the strict sense 325.F 
strongly 325.H 
symmetric (in the sense of Friedrichs) 325.G 
weakly 325.H 
hyperbolically embedded 21.0 
hyperbolic closed orbit 126.G 
hyperbolic coordinates 
equilateral 90.C, App. A, Table 3.V 
rectangular 90.C 
hyperbolic cosecant 131.F 
hyperbolic cosine 131.F 
hyperbolic cotangent 131.F 
hyperbolic cylinder 350.B 
hyperbolic cylindrical coordinates App. A, 
Table 3.V 
hyperbolic cylindrical surface 350.B 
hyperbolic differential equations, system of (in 
the sense of Petrovski) 325.G 
hyperbolic-elliptic motion 420.D 
hyperbolic fixed point 126.G 
hyperbolic function 131.F 
hyperbolic geometry 285.A 
hyperbolic knot 235.E 
hyperbolic manifold 21.0 235.E 
hyperbolic motion 420.D 
hyperbolic-parabolic motion 420.D 
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hyperbolic paraboloid 350.B 
hyperbolic plane 122.C 
hyperbolic point (on a surface) 111.H 
hyperbolic quadric hypersurface 350.] 
hyperbolic secant  131.F 
hyperbolic sine 131.F 
hyperbolic singular point 126.G 
hyperbolic space 285.C 
Hermitian 412.G 
quaternion 412.G 
real 412.G 
hyperbolic spiral 93.H 
hyperbolic tangent 131.F 
hyperbolic transformation 76.F 
hyperbolic type, partial differential equation of 
321.E 325 
hyperbolic type, primitive 92.C 
hyperboloidic position 350.B 
hyperboloid of one sheet 350.B 
hyperboloid of revolution of one or two sheets 
350.B 
hyperboloid of two sheets 350.B 
hypercohomology 200.J 
hyperconstructive ordinal 81.E 
hypercubic type, primitive 92.C 
hyperelliptic curve 9.D 
hyperelliptic integra] 11.C 
hyperelliptic Riemann surface 11.C 
hyperelliptic surface 72.K 
hyperfinite 293.B 308.1 
hyperfunction 125 
in the Dirichlet problem 120.C 
exponentially decreasing Fourier 125.BB 
Fourier 125.BB 
Fourier ultra- 125.BB 
modified Fourier 125.BB 
Sato 125.V 
hypergeometric differential equation 260.A, 
App. A, Table 18.1 
confluent 167.A, App. A, Tables 14.IL19.I 
Gauss App. A, Table 14.II 
hypergeometric distribution 341.D 397.F, 
App. A, Table 22 
multidimensional App. A, Table 22 
multiple 341.D 
hypergeometric function(s) 209, App. A, Table 18.1 
Appell, of two variables 206.D, App. A, 
Table 18.I 
Barnes extended 206.G, App. A, Table 18.1 
of confluent type 167.A, App. A, Table 19.1 
of the hyperspherical differential equation 
393.E 
with matrix argument 206.E 
and spherical functions App. A, Table 18 
hypergeometric integral 253.B 
hypergeometric series 206.A 
hypergeometric type, special function of 389.A 
hypergroup 190.P 
hypergroupoid 190.P 
hyperinvariant (under an operator) 251.L 
hyperplanar symmetry (of an affine space) 139.B 
hyperplane(s) 
in an affine space "A 
characteristic (of a partial differential equation 
of hyperbolic type) 325.A 
at infinity (in affine geometry) 7.B 
pencil of (in a projective space) 343.B 
in a projective space 343.B 


Subject Index 
Hypotrochoid 


regression 403.D 
tangent (of a quadric hypersurface) 343.E 
hyperplane coordinates 
of an affine frame 7.C 
in projective geometry 343.C 
hyperplane section 418.1 
hyperquadric 
in an affine space 350.G 
in a projective space 343.D 350.I 
hypersonic flow 205.C 
hypersphere 76.A 
imaginary 76.A 
limiting (in hyperbolic geometry) 285.C 
non-Euclidean 285.C 
oriented real 76.A 
point 76.A 
proper (in hyperbolic geometry) 285.C 
real 76.A 
hypersphere geometry 76.A 
hyperspherical coordinates, (n--2)- 76.A 90.B 
hyperspherical differential equation 393,E 
hypersurface(s) 
(of an algebraic variety) 16.A 
(in a Euclidean space) 111.A 
central quadric 350.G 
characteristic (of a partial differential equation 
of hyperbolic type) 325.A 
coordinate (in a Euclidean space) 90.C 
elliptic quadric 350.G 
homogeneous 344.A 
hyperbolic quadric 350.G 
integral (partial differential equations) 320.A 
noncentral quadric 350.G 
nondegenerate 344.A 
parabolic quadric 350.G 
pencil of quadric 343.E 
properly (n — 1)-dimensional quadric 350.G 
quadric 343.D 350.G,] 
quadric conical 350.Q 
quadric cylindrical 350.Q 
regular quadric 343.E 
singular quadric (of the hth species) 343.E 
spherical real 344.C 
hypersurface element(s) 82.A 324.B 
union of 82.A 
hypocontinuous (bilinear mapping) 424.Q 
hypocylcoid 93.H 
hypo-Dirichlet 164.B 
hypoelliptic 112.D 189.C 323.1 
analytically 112.D 323.1 
hypofunction (in the Dirichlet problem)  120.C 
hyponormal 251.K 
hypothesis 
alternative 400.A 
composite 400.A 
continuum —continuum hypothesis 
ergodic 136.A 402.C 
general linear 400.H 
Lindelöf 123.C 
null 400.A 
Riemann 450.B,P 
simple 400.A 
statistical 400.A 
Suslin 33.F 
hypothesis testing 401.C, App. A, Table 23 
statistical 400, App. A, Table 23 
hypothetical infinite population 397.P 
hypotrochoid 93.H 


Subject Index 
i-genus 


i-genus 15.E 
I-adic topology (of a ring) 16.X 
ith component (of an n-tuple) 256.A,C 
ith coordinate 256.C 
ith coordinate axis (of a Euclidean space) 140 
icosahedral group 151.G 
icosahedron 357.B 
ideal (s) 
(ofanalgebra) 29.A 
(of an algebraic number field) 14.B 
(ofa lattice) 42.C 
(of a Lie algebra) 248.A 
(ofa ring) 368.F 
Abelian (of a Lie algebra) 248.C 
Alexander (of a knot) 235.C 
ambig (of a quadratic field) 347.F 
conjugate (of a fractional ideal) 14.1 
defining (of a formal spectrum)  16.X 
differential (of a differential ring) 113 
differential (on a real analytic manifold) 428.E 
elementary 235.C 
fractional (of an algebraic number field) 14.E 
graded 369.B 
homogeneous (of a graded ring) 369.B 
homogeneous (of a polynomial ring) 369.B 
integral (of an algebraic number field) 14.C 
integral left 27.A 
integral right 27.A 
integral two-sided o- 27.A 
involutive differential 428.E 
largest nilpotent (of a Lie algebra) 248.D 
left (of a ring) 368.F 
left o- 27.A 
maximal 67.C 
maximal (left or right) 368.F 
maximal (with respect to S$) 67.C 
maximal, space (of a Banach algebra) 36.E 
minimal (left or right)  368.F 
mixed 284.D 
nilpotent (of a Lie algebra) 248.C 
order (of a vector lattice) 310.B 
p-primary 67.F 
primary 67.F 
prime 67.C 
prime (of a maximal order) 27.A 
prime differential (of a differential ring) 113 
primitive (of a Banach algebra) 36.E 
principal 67.K 
principal (of an algebraic number field) 14.E 
principal, theorem (in class field theory) 59.D 
principal fractional 67.K 
pure 284.D 
right (ofa ring) 368.F 
right o, 27.A 
semiprime (of a commutative ring) 113 
semiprime differential (of a differential ring) 
113 
sheaf of (of a divisor of a complex manifold) 
72.F 
solvable (of a Lie algebra) 248.C 
two-sided (of a ring) 368.P 
two-sided v- 27.A 
unmixed 284.D 
valuation (of a valuation) 439.B 
ideal boundary 207.A 
in the narrow sense 14.G 
ideal class 14.E 
ideal class group 14.E 67.K 
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ideal group modulo mt 14.H 
ideal point (in hyperbolic geometry) 285.C 
idele (of an algebraic number field) 6.C 
principa! 6.C 
idele class 6.D 
idele class group |. 6.D 
idele group GC 
idempotent element (of a ring) 368.B 
elementary 450.0 
primitive 368.B 
idempotent law (in a lattice) 243.A 
idempotent measure 192.P 
idempotent set (of a ring) 368.B 
idempotent theorem 36.M 
identically true formula 411.G 
identification (in factor analysis) 280.G 
identification space (by a partition) 425.L 
identified equation 128.C 
just 128.C 
over- 128.C 
identity (identities) 231.A 
Bianchi 80 417.B 
Bianchi first App. A, Table 4.II 
Bianchi second App. A, Table AU 
homotopy (of an H-space) 203.D 
Jacobi (on the bracket of two vector fields) 
105.M 
Jacobi (in a Lie algebra) 248.A 
Jacobi (with respect to Whitehead product) 
202.P 
in Jordan algebras 231.A 
Lagrange 252.K 
Parseval 18.B 159.A 160.C 192.K 197.C 
220.B,C,E 
resolution of 390.D 
theorem of (of one variable) 198.C 
theorem of (of several variables) 21.C 
identity character (of an Abelian group) 2.G 
identity component (of a topological group) 423.F 
identity element 
of an algebraic system 409.C 
ofa field 149.A 
ofa group 190.A 
of a local Lie group 423.L 
ofa ring 368.A 
identity function 381.C 
identity mapping 381.C 
identity matrix 269.A 
identity operator (in a linear space) 37.C 
identity relation 102.] 
Shara zeta function 450.U 
Ikehara-Landau theorem, Wiener- 123.B 
ill-conditioned (coefficient matrix in numerical 
solution of linear equations) 302.D 
image 
(of a group homomorphism) 190.D 
(of a linear mapping) 256.F 
(of a line in P?) 343.E 
(ofa mapping} 381.C 
(ofa morphism) 52.N 
(of an operator homomorphism) 277.E 
(of a sheaf homomorphism) 383.D 
closed (of a variety) 18.1 
continuous 425.G 
direct (of a sheaf) 383.G 
homomorphic (of a measure preserving 
transformation) 136.D 
inverse (of a set) 381.C 
inverse (of a sheaf) 383.G 
inverse (of a uniformity) 436.E 
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perfect 425.CC 
perfect inverse 425.CC 
image measure 270.K 
imaginary axis 74.C 
imaginary field, totally 14.F 
imaginary hypersphere 76.A 
imaginary infinite prime divisor 439.H 
imaginary number 74.A 
purely 74.A 
imaginary part 74.A 
imaginary prime divisor 439.H 
imaginary quadratic field 347.A 
imaginary root (of an algebraic equation) 10.E 
imaginary transformation, Jacobi’s 134.1, App. A, 
Table 16.III 
imaginary unit 74.A 294.F 
imbedded Markov chain 260.H 
imbedding 105.K 
imbedding principle 127.B 
immersed submanifold (of a Euclidean space) 
LILA 
immersion 
(of a C?-manifold) 105.K 
(of a Riemann surface) 367.G 
branched minimal 275.B 
generalized minimal 275.B 
isometric 365.A 
Kahler 365.L 
minimal 275.A 
minimum 365.0 
tight 365.0 
totally real 365.T 
imperfect field 149.H 
implication 411.B 
strict 411.L 
implicit 
(difference equation in a multistep method) 
303.E 
(Runge-Kutta method) 303.D 
implicit enumeration method 215.D 
implicit functions 165.C 208 
implicit function theorem 208.A 286.G 
(in Banach algebras) 36.M 
(in locally convex spaces) 286.3 
global 208.D 
Nash-Moser 286.J 
implicit method 303.E 
implicit scheme 304.F 
impossible construction problem 179.A 
impossible event 342.B 
impredicative (object) 156.B 
imprimitive (permutation group) 151.H 
improper integral 216.D,E 
convergent 216.E 
divergent 216.E 
improper Riemann integral 216.E 
improvement, iterative 302.C 
impulse control 405.E 
impulse function 306.B, App. A, Table 12.11 
imputation [73.D 
imputed costs 292.B 
imputed prices 292.C 
IMT formula 299.B 
inaccessible, strongly 33.F 
inaccessible cardinal number 
strongly 33.E 
weakly 33.E 
inaccessible ordinal number 
strongly 312.E 
weakly 312.E 


Subject Index 
Independent 


Ince definition 268.D 
Ince-Goldstein method 268.C 
incidence matrix 
ofa block design 102.B 
ofagraph 186.G 
incidence number  146.B 201.B 
incidence relation 282.A 
inclination, curve of constant 111.F 
inclination problem, critical 55.C 
inclusion 381.C 
inclusion mapping 381.C 
incoming wave operator 375.B 
incompatible system (of partial differential 
equations) 428.B 
incomplete beta function App. A, Table 17.I 
incomplete blocks 102.B 
incomplete elliptic integral of the first kind 134.B 
incomplete factorization 302.C 
incomplete gamma function 174.A, App. A, 
Table 17.1 
incompleteness theorem, Gödel 156.E 
first 185.C 
second 185.C 
incompressible (measurable transformation) 136.C 
incompressible fluid 205.B 
inconsistent problem (of geometric construction) 
179.A 
inconsistent system (of algebraic equations) 10.A 
increasing (sequence function or distribution) 
monotone 166.A 380.B 
monotonically 87.B 
non- 166.A 
slowly (C*?-function) 125.0 
slowly (distribution) 125.N 
slowly (in the sense of Deny) 338.0 
slowly, sequence 168.B 
strictly 166.A 
strictly monotone 166.A 
increasing directed set 308.A 
increasing process 406.B 
integrable 406.B 
increment 
of a function 106.B 
process with independent 5.B 
increment function 380.B 
Ind (large inductive dimension) 117.B 
ind (small inductive dimension) 117.B 
indecomposable A-module 277.1 
indecomposable continuum 79.D 
indecomposable group 190.L 
indecomposable linear representation 362.C 
indecomposable vector bundle 16.Y 
indefinite D-integral 100.D 
indefinite Hermitian form 348.F 
indefinite integral 198.B 
in Lebesgue integral 221.D 
in Riemann integral 216.C 
indefinite quadratic form 348.C 
indefinite sum (of a function) 104.B 
in degree 186.B 
independence, number of 186.I 
independence theorem (on valuations) 439.G 
independent 
(axioms) 35.B 
(complexes) 70.B 
(differential operators) 324.C 
(events) 342.B 
(frequency) 126.L 
(partitions) 136.E 
(points) 7.A 


Subject Index 
Independent increments, process with 


(random variables) 342.C 
algebraically 149.K 369.A 
analytically (in a complete ring) 370.A 
g- (partitions) 136.E 
linearly 2.E 256.C,E 277.G 
path 346.G 
independent increments, process with 5.B 
independent of the past history 406.D 
independent process 136.E 
independent set 66.G 186.1 
independent system, maximal (of an additive group) 
2.E 
independent variable 165.C 
independent vector 66.F 
indeterminacy, set of points of (of a proper 
meromorphic mapping) 23.D 
indeterminate 369.A 
in the algebraic sense 337.C 
indeterminate coefficients, Lagrange’s method of 
106.L 
indeterminate form, limit of 106.E 
indeterminate system (of algebraic equations) 10.A 
Ind,a 297.G 
index 
(of a central simple algebra) 29.G 
(of a critical point) 279.B,E 286.N 
(of a divisor) 3.D 
(of an e-Hermitian form) 60.0 
(ofan eigenvalue) 217.F 
(of a Fredholm mapping) 286.E 
(of a Fredholm operator) 68.F 251.D 
(ofa manifold) 56.G 
(ofa number) 297.G 
(of an orthogonal array) 102.L 
(of a quadratic form) 348.E 
(of a recursive function) 356.F 
(of the Riemann-Hilbert problem) 217.J 
(of a stable distribution) 341.G 
(of a stable process) 5.F 
(ofasubgroup) 190.C 
analytic (of an elliptic complex) 237.H 
analytic (of an elliptic differential operator) 
237.H 
contravariant (of a component of a tensor) 
256.3 
covariant (of a component of a tensor) 256.J 
cycle 66.E 
deficiency (of a closed symmetric operator) 
251.1 
deficiency (of a differential operator) 112.1 
degeneracy 17.C 
differential (in a covering of a nonsingular 
curve) 9I 
dummy (of a tensor) 256.J 
fixed-point (of a continuous mapping) 153.B 
of inertia (of a quadratic form) 348.E 
Keller-Maslov 274.C 
Kronecker 201.H 
multi- 112.A 
o-speciality (of a divisor) 9.F 
p- (on a central simple algebra) 29.G 
ramification (of an algebroidal function) 
17.C 
ramification (of a finite extension) 257.D 
ramification (of a prime ideal) 14.I 
ramification (of a valuation)  439.I 
ramification, relative (of a prime ideal) 14.I 
of relative nullity 365.D 
Schur (of a central simple algebra) 29.E 
Schur (of an irreducible representation) 362.F 
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of a singular point (of a continuous vector field) 
153.B 
speciality (of a divisor) 9.C 15.D 
topological (of an elliptic complex) 237.H 
of total isotropy (of a quadratic form) 348.E 
indexing set (of a family of elements) 381.D 
index set 
(of a balanced array) 102.L 
(of a family) 165.D 381.D 
index theorem 
Atiyah-Singer 237.H 
for differentiable manifolds 56.G 
Hirzebruch (for differentiable manifolds) 56.G 
of Hodge 15.D 
Morse 279.F 
Indian mathematics 209 
indicator function 
modified 341.C 
of a subset 342.E 
indicatrix 
Dupin 111.H 
spherical (of a space curve) 111.F 
indicial equation 254.C 
indirect least squares method 128.C 
indirect transcendental singularity (of an analytic 
function) 198.P 
indiscrete pseudometric space 273.B 
indiscrete topology | 425.C 
individual 411.H 
individual domain 411.H 
individual ergodic theorem  136.B 
individual risk theory 214.C 
individual symbol 411.H 
individual variables 411.H 
indivisibilis 265 
induced 
(Cartan connection) 80.0 
(unfolding) 51.D 
induced bundle 147.G 
induced homomorphism 202.K 
induced module 277.L 
induced representation 
(of a finite group) 362.G 
(of a unitary representation) 437.0 
induced topology 425. 
induced von Neumann algebra 308.C 
induction 
(of a von Neumann algebra) 308.C 
axiom of mathematical 294.B 
complete 294.B 
double mathematical 294.B 
magnetic 130.A 
mathematical 294.B 
multiple mathematical 294.B 
transfinite (in a well-ordered set) 311.C 
induction equation 259 
inductive dimension 
small 117.B 
large 117.B 
inductive limit 
(in a category) 210.D 
(of an inductive system of sets) 210.B 
(of a sequence of topological spaces) 425.M 
(of sheaves) 383.I 
strictly (of a sequence of locally convex spaces) 
424.W 
inductive limit and projective limit 210 
inductive limit group  210.C 
inductive limit space 210.C 
inductively ordered set 34.C 
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inductive system 
{in a category) 210.D 
{of groups) 210.C 
(of sets) 210.B 
of topological spaces 210.C 
inelastic (scattering) 375.A 
inequality (inequalities) 211, App. A, Table 8 
absolute 211.A 
Bells 351.L 
Bernshtein (for trigonometric polynomials) 
336.C 
Bessel 197.C 
Bhattacharyya 399.D 
Bunyakovskii 211.C, App. A, Table 8 
Carleman App. A, Table 8 
Cauchy 211.C, App. A, Table 8 
Cauchy-Schwarz 211.C, App. A, Table 8 
Chapman-Robbins-Kiefer 399.D 
Chebyshev 342.C 
conditional 211.A 
correlation 212.A 
Cramér-Rao 399.D 
distortion 438.B 
energy 325.C 
Fisher 102.E 
FKG 212.A 
Garding 112.G 
GHS 212.A 
GKS first 212.A 
GKS second 212.A 
Golden-Thompson 212.B 
Griffiths's first 212.A 
Griffiths’s second 212.A 
Grunsky 438.B 
Hardy App. A, Table 8 
Hardy-Littlewood-Sobalen 224.E 
Haussdorff-Young 224.E 
Hilbert App. A, Table 8 
Holder 211.C, App. A, Table 8 
Hólder integral 211.C 
isoperimetric 228.B 
Jordan App. A, Table 8 
Klein 212.B 
Markov (for polynomials) 336.C 
maximal (maximal ergodic lemma)  136.B 
Minkowski 211.C, App. A, Table 8 
Morse 279.D,E 
Peierls-Bogolyubov 212.B 
Powers-Stormer 212.B 
quasivariational 440.D 
Riemann's period 3.L 
Riemann-Roch (on algebraic surfaces) 15.D 
Roepstorff-Araki-Sewell 402.G 
Roepstorff-Fannes-Verbeure 402.G 
Schwarz 211.C 
stationary variational 440.B 
triangle 273.A 
variational, of evolution 440.C 
von Neumann 251.M 
Wirtinger App. A, Table 8 
Wolfowitz 399 
Young 224.E, App. A, Table 8 
inertia 
ellipsoid of 271.E 
index of (of an a quadratic form) 348.B 
law of 271.A 
law of, Sylvester (on a quadratic form) 348.B 
moment of 271.E 
principal axis of 271.E 
principal moment of 271.E 


Subject Index 
Infinitesimal transformation 


product of 271.E 
inertia field (of a prime ideal) 14.K 
inertia group 14.K 257.D 
inertial system 271.D 359 
inf (infimum) 311.B 
inference 
rule of 411.1 
Statistical 401 
statistical, graphical method of 19.B 
inferior limit 
(of a sequence of real numbers) 87.C 
(of a sequence of subsets of a set) 270.C 
inferior limit event 342.B 
infimum 
(of an ordered set) 311.B 
(of a subset of a vector lattice) 310.C 
infinite, purely (von Neumann algebra) 308.E 
infinite branch (of a curve of class CH) 93.G 
infinite cardinal number | 49.A 
infinite classical group 147.1 202. V 
infinite continued fraction 83.A 
infinite determinant (in Hill's method of solution) 
268.B 
infinite-dimensional complex projective space 56.C 
infinite-dimensional linear space 256.C 
infinite-dimensional normal space 117.B 
infinite-dimensional real projective space 56.B 
infinite Grassmann manifold 147.1 
infinite group 190.C 
infinite height (element of an Abelian p-group) 2.D 
infinite interval 355.C 
infinite lens space 91.C 
infinitely differentiable (function) 106.K 
infinitely divisible distribution 341.G 
infinitely recurrent (measurable transformation) 
136.C 
infinite matrix 269.K 
infinite order (of an element in a group) 190.C 
infinite orthogonal group 202.V 
infinite population 401.E 
infinite prime divisor 439.H 
imaginary 439.H 
rea] 439.H 
infinite product 379.G, App. A, Table 10. VI 
divergent 379.G 
infinite product expansion, Euler's 436.B 450.B 
infinite sequence 165.D 
infinite series 379.A, App. A, Table 10.III 
infinite set 49.F 381.A 
countably 49.A 
infinitesimal 
(for a function) 87.G 
(for a hyperreal number) 293.D 
(for a sequence of random variables) 250.B 
order of (of a function) 87.G 
infinitesimal birth rate 260.G 
infinitesimal calculus (in nonstandard analysis) 
293.D 
infinitesimal death rate 260.G 
infinitesimal deformation to the direction 0/0s 
72.G 
infinitesimal element (in nonstandard Hilbert space) 
276.E 
infinitesimal generator (of a semigroup) 378.B 
infinitesimal motion (of a Riemannian manifold) 
364.E 
infinitesimal real number 276.E 
infinitesimal transformation 
(of a differentiable transformation group) 
431.G 


Subject Index 
Infinitesimal wedge 


(of a one-parameter transformation group) 
105.N 
infinitesimal wedge 125.V 
infinite Stiefel manifold 147.1 
infinite symplectic group 202.V 
infinite type (Lie algebra) 191.D 
infinite type power series space 168.B 
infinite unitary group 202.V 
infinity 87.D,G 
axiom of 33.B 381.G 
axiom of strong 33.E 
hyperplane at (in affine geometry) 7.B 
minus 87.D 
negative 87.D 355.C 
order of (of a function) 87.G 
pls $87.D 
point at 7.B 74.D 178.F 285.C 
positive. 87.D 355.C 
regular at the point at 193.B 
space at (in affine geometry) 7.B 
inflation 200.M 
inflection, point of 9.B 93.G 
influence, domain of 325.B 
influence curve 371.1 
information 
Fisher 399.D 
limited (in maximum likelihood method) 
128.C 
loss of 138.B 
mutual 213.E 
self- 213.B 
information bit 63.C 
information compression 96.B 
information matrix 102.1 399.D 
information number, Kullback-Leibler (K-L) 
398.G 
information retrieval 96.F 
information retrieval system 96.F 
information sciences 75.F 
information set 173.B 
information source 213.A 
ergodic 213.C 
information theory 213 
informatiques 75.F 
informative, more (statistical experiment) 398.G 
infra-exponential growth 125.AA,BB 
infrared divergence 132.C 146.B 
ingoing subspaces 375.H 
inhomogeneous 
(boundary value problem) 315.B 
(difference equation) 104.C 
(linear ordinary differential equation) 252.A 
(system of linear differential equations) 252.G 
inhomogeneous coordinates 343.C 
inhomogeneous lattice (in R") 182.B 
inhomogeneous Lorentz group 359.B 
inhomogeneous polarization 3.G 
initial blocks 102.E 
initial-boundary value problem 325.K 
initial condition 
(for ordinary differential equations) 316.A 
(for partial differential equations) 321.A 
initial data 321.A 
initial distribution 
(of a Markov process) 261.A 
(for a stochastic differential equation) 406.D 
initial function (of a functional-differential 
equation) 163.C 
initial law (for a stochastic differential equation) 
406.D 
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initial number 312.D 
initial object 52.D 
initial ordinal number 49.E 
initial phase (of a simple harmonic motion) 318.B 
initial point 
(of a curvilinear integral) 94.D © 
(ofa path) 170 
(of a position vector) 7.A 
(of a vector) 442.A 
initial set 
(ofa correspondence) 358.B 
(ofa linear operator) 251.E 
initial state 31.B 
initial surface 321.A 
initial term (of an infinite continued fraction) 
83.A 
initial value 
(for an ordinary differential equation) 316.A 
(for a partial differential equation)  321.A 
(for a stochastic differential equation) 406.D 
initial value problem 
(for functional-differential equations) 163.D 
(for hyperbolic partial differential equations) 
App. A, Table (SIT 
(for integrodifferential equations) 222.B 
(for ordinary differential equations) 313.C 
316.A 
(for partial differential equations) 321.A 
Navier-Stokes 204.B 
singular (for partial differential equations of 
mixed type) 326.C 
initial vertex 186.B 
injection 381.C 
(in a category) 52.D 
(homomorphism of cohomology groups) 
200.M 
canonical 381.C,E 
canonical (on direct sums of modules) 277.F 
canonical (on free products of group) 190.M 
canonical (from a subgroup) 190.D 
natural (from a subgroup) 190.D 
injective ` ` 
(Banach space) 37.M 
(C*-algebra) 36.H 
(mapping) 381.C 
(object in an Abelian category) 200. 
injective A-module 277.K 
injective class 200.Q 
injective dimension 200.K 
injective envelope 200.1 
injective module, (R, AL 200.K 
injective resolution (in an Abelian category) 200.1 
j- 200.Q 
right (of an A-module) 200.F 
injectivity, rational 200.0 
injectivity radius 178.C 
inner area 216.F 270.G 
inner automorphism(s) 
(ofagroup) 190.D 
(ofa ring) 368.D 
group of (ofa group) 190.D 
group of (of a Lie algebra) 248.H 
inner capacity, Newtonian 48.F 
inner derivation 
(of an associative algebra) 200.L 
(ofa Lie algebra) 248.H 
inner function 43.F 
inner harmonic measure 169.B 
inner measure 270.E 
Lebesgue 270.F 
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inner product 
(in a Hermitian linear space) 256.Q 
(ina Hilbert space) 197.B 
(of hyperspheres) 76.A 
(between a linear space and its dual space) 
256.G 
(in a metric vector space) 256.H 
(of a pair of linear spaces) 424.G 
(of vectors) 256.A 442.B 
Hermitian 256.Q 
inner product space 442.B 
inner solution 112.B 
inner topology (of a Lie subgroup) 249.E 
inner transformation (in the sense of Stoilow) 
367.B 
inner variable 112.B 
inner volume 270.G 
innovation 405.H 
input, Poisson 260.H 
input data, error of 138.B 
inrevolvable oval 89.E 
inscribe (in a sphere) 139.I 
inscribed circle (of a regular polygon) 357.A 
inseparable, purely (rational mapping) 16.1 
inseparable element (of a field) 149.H 
purely 149.H 
inseparable extension (of a field) 149.H 
purely 149.H 
inseparable polynomial 337.G 
inspection 
expected amount of 404.C 
sampling (— sampling inspection) 404.C 
instantaneous state 260.F 261.B 
in-state 150.D 386.A 
instruction 31.B 
single-address 75.C 
insurance 
amount of 214.A 
cost of 214.B 
death 214.B 
mixed 214.B 
survival 214.B 
insured, amount 214.A 
integer(s) 294.C 
algebraic 14.A 
Cartan (of a semisimple Lie algebra) 248.N 
Gaussian 14.U 
p-adic 439.F 
p-adic, ring of 439.F 
rational 294.B 
integer polyhedron 215.C 
integer programming 215 264.C 
integer programming problem 
all 215.A 
mixed 215.A 
pure 215.A 
0-1 215.A 
integrability 
(of multivalued vector functions) 443.I 
strong 443.1 
integrability condition, complete 428.C 
integrable 
(function) 221.B 
{G-structure) 191.A 
(increasing process) 262.D 406.B 
(representation) 437.X 
(in the sense of Riemann) 216.A 
absolutely 216.E,F 
Birkhoff (function) 443.E 
Bochner (function) 443.C 


Subject Index 
Integral(s) 


completely (system of independent 1-forms) 
154.B 428.D 

D- (function) 100.D 

Daniell-Stone (function) 310.1 
Denjoy (in the wider sense) 100.D 
Dunford 443.F 

Gel'fand-Pettis (function) 443.F 
Lebesgue (function) 221.B 

locally, function 168.B 

u- (function) 221.B 

Perron (function) 100.F 

Pettis 443.F 

Riemann (function) 216.A 

scalarly 443.F,I 

square (function) 168.B 

square (unitary representation) 437.M 
termwise (series) 216.B 


uniformly (family of random variables) 262.A 


weakly (function) 443.E 
integrable distribution 125.N 
integrable increasing process 262.D 406.B 
integrable process of bounded variation  406.B 
integrable system 287.A 
integrably homotopic 154.F 
integral(s) 
(of differential forms) 105.T 
(of a distribution with respect to 2) 125.H 
(of a function) 221.B 
(=integrally dependent) 67.1 
(of a Monge-Ampére equation) 278.B 
(of multivalued vector functions) 443.1 
(scheme) 16.D 
Abelian 11.C 
action 80.Q 
Airy App. A, Table 19.1V 
almost (element of a ring) 67.1 
of angular momentum 420.A 
Banach 310. 
Bartle-Dunford-Schwartz 443.G 
Bessel 39.B 
Birkhoff 443.E 
Bochner 443.C 
Bromwich 240.D 322.D, App. A, Table 12.I 
Carson App. À, Table 12.II 
of Cauchy type 198.B 
of the center of mass 420.A 


complete additivity of the (in Lebesgue integral) 


221.C 
complete elliptic App. A, Table 16.1 
complete elliptic, of the first kind 134.B 
complete elliptic, of the second kind 134.C 
conjugate Fourier 160.D 
constant 216.C 
cosine 167.D, App. A, Table 19.II 
curvilinear 94.A 
curvilinear (with respect to a line element) 
94.D 
curvilinear (with respect to a variable) 94.D 
D(x) 100.D 
Daniell-Stone 310.1 
definite App. A, Table 9.V 
definite (of a hyperfunction) 125.X 
definite (in a Riemann integral) 216.C 
definite D. 100.D 
Denjoy 100 
Denjoy (in the restricted sense) 100.D 
Denjoy (in the wide sense) 100.D 
direct 308.G 
direct (of unitary representations) 437.H 
Dirichlet (in Dirichlet problem) 120.F 


Subject Index 
Integral bilinear functional 


Dirichlet (in Fourier's single integral theorem) 
160.B 

double (in Riemann integral) 216.F 

Dunford 251.G 443.F 

elliptic 11.C 134.A, App. A, Table 16.1 

elliptic, of the first kind 134.A 

elliptic, of the second kind 134.A 

elliptic, of the third kind 134.A 

energy 420.A 

Euler, of the first kind 174.C 

Euler, of the second kind 174.C 

exponential 167.D, App. A, Table 19.II 

Feynman 146 

first (of a completely integrable system) 428.D 

Fourier 160.A 

Fresnel 167.D, App. A, Tables 9.V 19.II 

of a function with respect to a volume element 
105.W 

Gauss 338.J 

Gelfand 443.F 

Gel’fand-Pettis 443.F 

harmonic 194.A 

Hilbert’s invariant 

hyperelliptic 11.C 

hypergeometric 253.B 

improper (in Riemann integral) 216.D,E 

incomplete elliptic (of the first kind) 134.B 

indefinite (in Lebesgue integral) 221.D 

indefinite (in Riemann integral) 198.B 216.C 

indefinite D- 100.D 

intermediate App. A, Table 15.III 

intermediate (of a Monge-Ampére equation) 
278.B 

iterated (in Lebesgue integral) 221.E 

iterated (in Riemann integral) 216.G 

Jacobi 420.F 

L- 221.B 


46.C 


with respect to A (of a distribution) 125.H 
Lebesgue 221.B 

Lebesgue-Radon 94.C 

Lebesgue-Stieltjes 94.C 166.C 
logarithmic 167.D, App. A, Table 19.II 
Lommel 39.C 


multiple (in Lebesgue integral) 221.E 
multiple (in Riemann integral) 216.F 
n-tuple (in Riemann integral) 216.F 
over an oriented manifold 105.T 
Pettis 443.F 

Poisson 168.B 193.G 

probability App. A, Table 19.II 
regular first 126.H 

repeated (in Lebesgue integral) 221.E 
repeated (in Riemann integral) 216.G 
Riemann 37.K 216.A 


Riemann lower 216.A 
Riemann-Stieltjes 94.B 166.C 
Riemann upper 216.A 

scalar 443.F,I 

sine 167.D, App. A, Table 19.II 
singular 217J 
overasingular chain 105.T 
spectral 390.D 

Stieltjes 94.B 

stochastic 261.E 406.B 


stochastic, of Stratonovich type 406.C 

surface 94.A,E 

surface (with respect to a surface element) 
94.E 


trigonometric 160.A 


vector 443.A 
of a vector field App. A, Table 3.III 
vector-valued 443.A 

integral bilinear functional 424.R 

integral calculus 216 

integral character (of the homology group of 

a Riemann surface) 11.E 
integral closure (ofa ring) 67.1 
integral cohomology group 201.H 


integral constant 216.C 
integral cosine 167.D 
integral current 275.G 


integral curvature (of a surface) 111.H 
integral curve 
(of a Monge equation) 324.F 
(of ordinary differential equations) 316.A 
integral direct sum 308.G 
integral divisor 
(of an algebraic curve) 9.C 
(of an algebraic number field) 14.F 
(on a Riemann surface) 11.D 
integral domain 368.B 
Noetherian 284.A 
integral element 428.E 
k-dimensional 191.1 
ordinary 428.E 
regular 428.E 
integral equation(s) 217 
Abel 217.L 
associated 217.F 
Fredholm  217.A 
of Fredholm type 217.A 
Hammerstein 217.M 
homogeneous 217.F 
linear 217.A 
nonlinear 217.M 
numerical solution of 217.N 
singular 217J 
transposed 217.F 
Volterra 217.A 
of Volterra type 217.A 
integral exponent 167.D 
integral form 248.W 
integral formula 
Cauchy 198.B 
Poisson 198.B 
Villat App. A, Table 15.VI 
Weyl 2251 
integral function 429.A 
integral g-lattice 27.A 
integral geometry 218 
principal formula of 218.C 
integral homology group 
ofa polyhedron 201.D 
of a simplicial complex 201.C 
integral hypersurface (of a partial differential 
equation) 320.A 
integral ideal (of an algebraic number field) 
integral inequality, Holder 211.C 
integral invariant(s) 219 
absolute 219.A 
Cartan’s relative 219.B 
relative 219.A 
integral kernel 217.A 251.0 
integral left ideal 27.A 
integral logarithm 167.D 
integrally closed 
(n a ring) 67.1 
completely (ring) 67.1 
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integrally closed ring 67.I 
integrally dependent element (ofa ring) 67.1 
integral manifold 428.A,B,D 
k-dimensional 191.1 
ordinary (of a differential ideal} 428.E 
regular (of a differential ideal) 428.E 
singular (of a differential ideal) 428.E 
integral method, summable by Borel’s 379.0 
integral operator 68.N 100.E 250.0 
Calderon-Zygmund singular 217J 251.0 
Fourier 274.C 345.B 
of Hilbert-Schmidt type 68.C 
integral point 428.E,F 
integral quotient (in the division algorithm of 
polynomials) 337.C 
integral representation 362.G,K 
Cauchy 21.C 
Herglotz 43.1 
Laplace-Mehler App. A, Table 18.II 
Schläfli 393.B 
integral right deal 27.A 
integra] sine 167.D 
integral singular homology group  201.E 
integral test, Cauchy (for convergence) 379.B 
integral theorem 
Cauchy 198.A,B 
Fourier double 160.B 
Fourier single 160.B 
stronger form of Cauchy 198.B 
integral transforms 220.A 
integral two-sided p-ideal 27.A 
integral vector 428.E 
integrand 216.A 
integrate 216.A 
(an ordinary differential equation) 313.A 
integrating factor App. A, Table 14.1 
integration 
along a fiber (of a hyperfunction) 274.E 
automatic, scheme 299.C 
contour of (of curvilinear integral) 94.D 
domain of 216.F 
graphical 19.B 
Jacobi's second method of 324.D 
numerical 299 
path of (of curvilinear integral) 94.D 
Romberg 299.C 
integration by parts 216.C 
(on D-integral) 100.G 
(in the Stieltjes integral) 94.C 
integration constant (in a general solution of 
a differential equation) 313.A 
integration formula 
based on variable transformation 299.B 
Gauss (in the narrow sense) 299.A 
Poisson App. A, Table 15.VI 
Villat App. A, Table 15.VI 
integrodifferential equation(s) 163.A 222 
of Fredholm type 222.A 
Prandtl’s 222.C 
of Volterra type 222.A 
Wiener-Hopf 222.C 
intensity, traffic 260.H 
intensive (thermodynamical quantity) 419.A 
interaction 102.H 
interest, assumed rate of 214.A 
interference (of waves) 446 
interior 
Lol an angle) 139.D 155.B 
(of a manifold) 105.B 
(ofa polygon) 155.F 


Subject Index 
Intersect 


(ofa segment) 155.B 
(ofa set) 425.B 
(of a simplex) 70.C 
interior capacity, Newtonian 48.F 
interior cluster set 62.A 
interior field equation 359.D 
interior operator 425.B 
interior point 425.B 
interior problem (in Dirichlet problems) 120.A 


interior product (of a differential form with a vector 


field) 105.Q 
intermediate convergent (of an irrational number) 
83.B 
intermediate field 149.D 
intermediate integrals App. A, Table 15.1II 
of Monge-Ampére equation 278.B 
intermediate-value theorem 84.C 
intermittent structure 433.C 
internal (in nonstandard analysis) 293.B 
internal energy 419.A 
internal irregular point 338.L 
internal law of composition (of a set) 409.A 
internally stable set 186.1 
internally thin set 338.G 
internal product 200.K 
internal space in catastrophe theory (in static 
model) 51.B 
internal state 31.B 
internal symmetry 150.B 
international notation (for crystal classes) 92.B 
international system of units 414.A 
interpolating (for a function algebra) 164.D 
interpolating sequence 43.F 
interpolation 
(of a function) 223, App. A, Table 21 
(of a stationary process) 176.K 395.E 
Chebyshev 223.A 336.J 
inverse 223.A 
Lagrange 223.A 
of operators 224 
spline 223.F 
interpolation coefficient, Lagrange's 223.A 
interpolation formula 223.A 
Bessel App. A, Table 21 
Everett App. A, Table 21 
Gauss App. A, Table 21 
Gauss's backward 223.C 
Gauss's forward 223.C 
Newton App. A, Table 21 
Newton's backward |. 223.C 
Newton’s forward 223.C 
Stirling App. A, Table 21 
interpolation function 223.A 
interpolation method 224.A 
interpolation polynomial 223.A 
Hermite 223.E 
Lagrange 336.G, App. A, Table 21 
Newton 336.G 
trigonometric 336.E 
interpolation problem 43.F 
interpolation scheme, Aitken 223.B 
interpolation space 224.A 
complex 224.B 
real 224.C 
interpolation theorem 224.B,C 
interpolatory formula 299.A 
interquartile range 397.C 
intersect 155.B 
properly (on a variety) 16.G 
transversally 105.L 


Subject Index 
Intersection 


intersection 

(of events) 342.B 

(of projective subspaces) 343.B 

(of sets) 381.B 

(of subspaces of an affine space) 7.A 

complete 16.A 
intersection chart 19.D 
intersection multiplicity (of two subvarieties) 16.Q 
intersection number 

(of divisors) 15.C 

(of homology classes) 65.B 201.0 

(of sheaves) 16.E 

self- 15.C 
intersection product 

(in algebraic varieties) 16.Q 

(in homology theory) 201.0 
intersection property, finite 425.S 
intersection theorem 

(of affine geometry) 7.A 

(of projective geometry) 343.B 

Cantors 273.F 

Krull 284.4 
interval 

(in a Boolean algebra) 42.B 

(in a lattice) 243.C 

(in a vector lattice) 310.B 

(in an ordered set) 311.B 

(in real number space) 355.C 

of absolute stability 303.G 

basic 4.B 

closed 140 355.C 

confidence 399.Q 

of continuity (for a probability distribution) 

341.C 

fiducia! 401.F 

finite 355.C 

infinite 355.C 

open 140355.C 

principle of nested 87.C 

of relative stability 303.G 

supplementary 4.B 

tolerance 399.R 
interval estimation 399.Q 401.C 
interval function 380.A 

additive 380.B 

continuous additive 380.B 
in the large (in differential geometry) 109 
in the small (in differential geometry) 109 
intrablock analysis 102.D 
intransitive (permutation group) 151.H 
intrinsic angular momentum 415.G 
intrinsic homology 114.H 
intuitionism 156.A 


semi- 156.C 
intuitionistic logic 411.L 
invariance 


of a confidence region 399.Q 
of dimension, theorem on (of Euclidean spaces) 
117.D 

of domain, Brouwer theorem on 117.D 

homotopy 201.D 

isospin 351J 

Lorentz 150.B 

of speed of light, principle of 359.B 

topological (homology groups) 201.A 
invariance principle 

(of hypothesis testing) 400.E 

(of wave operators) 375.B 

Donskers 250.E 

Strassen s 250.E 
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invariance theorem of analytic relations 198.K 
invariant(s) App. A, Table 14.III 

(of an Abelian group) 2.B 

(of a cohomology class of a Galois group) 
59.H 257.E 

(decision problem) 398.E 

(element under a group action) 226.A 

(of an elliptic curve) 73.A 

(in the Erlangen program) 137 

(underflow) 126.D 

(function algebra) 164.H 

(hypothesis) 400.E 

(measure) 136.B 225 270.L 

(S-matrices) 386.B 

(of a normal simple algebra) 257.G 

(subspace of a Banach space) 251.L 

absolute 12.A 226.A 

absolute integral 219.A 

almost G- |. 396.1 

Arf-Kervaire 114J 

basic 226.B 

birational 12.A 

Browder-Livesay 114.L 

Cartan (ofa finite group) 362.1 

Cartan relative integral 219.B 

conformal 77.E 

covering linkage 235.E 

differential (on an m-dimensional surface) 
110.A 

Eilenberg-Postnikov (of a CW-complex) 70.G 

fundamental (of a space with a Lie transforma- 
tion group) 110.A 

fundamental differential (of a surface) 110.B 

G-(element) 226.A 

G- (measure) 225.A 

G- (statistics) 396.1 

generalized Hopf 202.Q 

Hasse (of a central simple algebra) 29.G 

homotopy 202.B 

homotopy type 202.F 

Hopf 202.8,U 

Hopf, modulo p 202.8 

integral 219 

isomorphism (on a measure space) 136.E 

Iwasawa 14.L 

k- (of a CW-complex) 70.G 

left (metric in a topological group) 423.1 

left, Haar measure 225.C 

left, tensor field 249.A 

metric (on a measure space) 136.E 

Milnor 235.D 

of n-ary form of degree d 226.D 

negatively 126.D 

normal 114 

oforderp 110.A 

p- (of a central simple algebra) 29.G 

PCT 386.B 

Poincaré's differential 74.G 

positively 126.D 

rearrangement 168.B 

relative 12.A 226.A 

relative integral 219.A 

right, Haar measure 225.C 

right, tensor field 249.A 

sampling procedure 373.C 

semi- 226.A 

semi- (of a probability distribution) 341.C 

shape 382.C 

spectral 136.E 

TCP- 386.B 
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topological 425.G 
U- (subspace of a representation space of 
a unitary representation) 437.C 
uniformly most powerful 399.Q 
vector 226.C 
of weight w 226.D 
invariant decision function 398.E 
invariant derivation (on an Abelian variety) 3.F 
invariant differential form (on an Abelian variety) 
3.F 
invariant distribution(s) 
(ofa Markov chain) 260.A 
(of second quantization) 377.C 
invariant estimator 399] 
best 399] 
invariant field 172.B 
invariant integral, Hilbert’s | 46.C 
invariant level «a test, uniformly most powerful 
(UMP) 400.E 
invariant Markov process 5.H 
invariant measure(s) 225 
(ofa Markov chain) 260.A 
(of a Markov process) 261.F 
(under a transformation) 136.B 
G- 225.B 
quasi- 225J 
relatively 225.H 
smooth 126 
sub- 261.F 
transverse 154.H 
invariant measure problem 136.C 
invariants and covariants 226 
invariant statistic 396.1 
maximal  396.I 
invariant subgroup (of a group) 190.C 
invariant subspace (of a linear operator) 164.H 
doubly 164.H 
invariant tensor field 
left 249.A 
right 249.A 
invariant test 400.E 
almost 400.E 
invariant torus 126.L 
inventory control 227 
inventory model 307.C 
inverse 
(in a group) 190.A 
(ofa mapping) 381.C 
homotopy (for an H-space) 203.D 
quasi- (on a Banach algebra) 36.C 
right (in nonlinear functional analysis) 286.G 
inverse analytic function 198.L 
inverse assumption 304.D 
inverse correspondence 358.B 
inverse domination principle 338.L 
inverse element 
(in a group) 190.A 
(n a ring) 368.B 
left (in a ring) 368.B 
quasi- (in a ring) 368.B 
right (in a ring) 368.B 
inverse Fourier transform (of a distribution) 
125.0 
inverse function 198.L 381.C 
inverse function element 198.L 
inverse image 
(ofa set) 381.C 
(ofa sheaf} 383.G 
(of a uniformity) 436.E 
perfect 425.CC 





Subject Index 
Involutory (involutive) system 


inverse interpolation 223.A 
inverse iteration 298.C 
inverse limit (of an inverse system of sets) 210.B 
inverse mapping 381.C 
inverse mapping theorem  208.B 
inverse matrix 269.B 
inverse morphism  52.D 
inverse operator 37.C 251.B 
inverse path 170 
inverse problem 
(in potential scattering) 375.G 
Jacobi XL 
inverse relation 358.A 
inverse system (of sets) 210.B 
inverse transform (of an integral transform) 220.A 
inverse trigonometric function 131.E 
inversion 
(with respect to a circle) 74.E 
(of a domain in R") 193.B 
(with respect to a hypersphere) 76.A 
Laguerre 76.B 
space 258.A 
space-time 258.A 
inversion formula 
(for a characteristic function) 341.C 
(of a cosine transform) 160.C 
(of a Fourier transform) 160.C 
(of a Fourier transform of distributions) 160.H 
(of a Fourier transform on a locally compact 
Abelian group) 192.K 
(of a generalized Fourier transform) 220.B 
(of a Hilbert transform) 220.E 
(of an integral transform) 220.A 
(of a Laplace-Stieltjes transform) 240.D 
(on a locally compact group) 437.L 
(of a Mellin transform) 220.C 
(for a semigroup of operators) 240.1 
(of a Stieltjes transform) 220.D 
Fourier 160.C 
Möbius (in combinatorics) 66.C 
Mobius (in number theory) 295.C 
inverted filing scheme 96.F 
invertible element 
quasi- 368.B 
ofaring 368.B 
invertible jet 105.X 
invertible knot 235.A 
invertible matrix 269.B 
invertible sheaf 16.E 
involute (of a curve) 111.E 
involution 
(of an algebraic correspondence) 9.H 
(in a Banach algebra) 36.F 
(ofa division ring) 348.F 
(of a homotopy sphere) 114.L 
Cartan 427.X 
involutive 
(cross section) 286.H 
(differential ideal) 428.E 
(differential system) 191.1 
(distribution) 154.B 428.D 
(Lie group) 191.H 
involutive automorphism (of a Lie group) 412.B 
involutive correlation 343.D 
involutive distribution (on a differentiable manifold) 
428.D 
involutive subspace 428.F 
involutory (involutive) system 
(of differential forms) 428.F 
(of nonlinear equations) 428.C 


Subject Index 
Irrational function, elliptic 


(of partial differential equations)  428.F 
(of partial differential equations of first order) 
324.D 
irrational function, elliptic 134.E 
irrational number(s) 294.E 355.A 
space of 22.A 
irrational real number 294.E 
irreducibility theorem, Hilbert's (on polynomials) 
337.F 
irreducible 
(algebraic curve) 9.B 
(algebraic equation) 10.B 
(algebraic variety) 16.A 
(coalgebra) 203.F 
(complemented modular lattice) 243.F 
(continuous geometry) 85.A 
(continuum) 79.D 
(Coxeter complex) 13.R 
(discrete subgroup of a semisimple Lie group) 
122.F 
(germ of an analytic set) 23.B 
(linear representation) 362.C 
(linear system) 16.N 
(linear system in control theory) 86.C 
(3-manifold) 65.E 
(Markov chain) 260.B 
(polynomial) 337.F 
(positive matrix) 269.N 310.H 
(projective representation) 362J 
(representation of a compact group) 69.B 
(Riemannian manifold) 364.E 
(root system) 13.L 
(scheme) 16.D 
(Siegel domain) 384. E 
(transition matrix) 126.J 
(unitary representation) 437.A 
absolutely (representation) 362.F 
at 0 (for an algebraic set) 23.B 
irreducible character 
(of an irreducible representation) 362.E 
absolutely 362.E 
irreducible component 
(of an algebraic variety) 16.A 
(of an analytic space) 23.C 
(of a linear representation) 362.D 
irreducible element (of a ring) 67.H 
irreducible representation(s) 
(ofa Banach algebra) 36.D 


fundamental system of (of a complex semisimple 


Lie algebra) 248.W 
irreducible symmetric bounded domain 412.F 
irreducible symmetric Hermitian space 412.E 
irreducible symmetric Riemannian space 412.C, 
App. A, Table 5.III 

irreducible tensor of rankk 353.C 
irredundant (intersection of primary ideals) 67.F 
irregular 

(boundary point) 120.D 

(prime number) 14.L 
irregularity 

(of an algebraic surface) 15.E 

number of (of an algebraic variety) 16.0 
irregular point 

(of Brownian motion) 45.D 

(of a Markov process) 261.D 

(in potential theory) 338.L 

external 338.L 

internal 338.L 
irregular singular point 

(ofa solution) 254.B 
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(of a system of linear ordinary differential 
equations) 254.B 
irreversible processes, statistical mechanics of 
402.4 
irrotational 
(fluid) 205.B 
(vector field) 442.D 
Irwin's embedding theorem 65.D 
Ising model 340.B 402.G 
stochastic 340.C 
island (in a Riemann surface) 272J 
isobaric polynomial 32.C 
isogenous 
(Abelian varieties) 3.C 
(algebraic groups) 13.A 
isogeny 13.A 
isolated fixed point 126.G 
isolated ordinal number 312.B 
isolated point 
(of a curve) 93.G 
(in a topological space) 425.0 
isolated primary component (of an ideal) 67.F 
isolated prime divisor (of an ideal) 67.F 
isolated singularity (of an analytic function) 
198.D,M 
isolated singular point 198.D 418.D 
isolated vertex 186.B 
isometrically isomorphic (normed spaces) 37.C 
isometric immersion 365.A 
isometric mapping 111.H 273.B 
isometric operator 251.E 
partially 251.E 
isometric Riemannian manifolds 364.A 
isometric spaces 273.B 
isometry 
(=isometric operator) 251.E 
(between Riemannian manifold) 364.A 
isomonodromic deformation 253.E 
isomorphic 
(algebraic systems) 409.C 
(block bundles) 147.Q 
(cohomology theories) 201.Q 
(complex manifolds) 72.A 
(fiber bundles) 147.B 
(groups) 190.D 
(Lie algebras) 248.A 
(Lie groups) 249.N 
(measure spaces) 398.D 
(normed spaces) 37.C 
(objects) 52.D 
(PL-embeddings) 65.D 
(representations) 362.C 
(simplicial complexes) 70.C 
(s.s. complexes) 70.E 
(structures) 276.E 
(topological groups) 423.A 
(unitary representations) 437.A 
anti- (lattices) 243.C 
Borel 270.C 
dually (lattices) 243.C 
isometrically (normed spaces) 37.C 
locally 423.0 


metrically (automorphisms on a measure space) 


136.E 
order 311.E 
similarly (ordered fields) 149.N 


spatially (automorphisms on a measure space) 


136.E 
spectrally 136.E 
weakly 136.E 
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isomorphic mapping, Borel 270.C 
isomorphic relations among classical Lie algebras 
App. A, Table 5.IV 

isomorphism 

(of Abelian varieties) 3.C 

(of algebraic systems) 409.C 

(of block bundles) 147.Q 

(of fields) 149.B 

(of functors) 52J 

(of groups) 190.D 

(of lattices) 243.C 

(of Lie algebras) 248.A 

(of linear spaces) 256.B 

(of objects) 52.D 

(of prealgebraic varieties) 16.C 

(of rings) 368.D 

(of topological groups) 423.A 

(of unfoldings) 51.D 

admissible (of Q-groups) 190.E 

algebra 29.A 

analytic 21.J 

analytic (of Lie groups) 249.N 

anti-(of groups) 190.D 

anti- (of lattices) 243.C 

anti- (of ordered sets) 311.E 

anti- (of rings) 368.D 

birational (of Abelian varieties) 3.C 

birational (of algebraic groups) 13.A 

Bott 237.D 

C®- (of Lie groups) 249.N 

dual (of lattices) 243.C 

dual (of ordered sets) 311.E 

excision (on homology groups) 201.F,L 

functorial 52J 

G- 191A 

k- (of algebraic groups) 13.A 

k- (of extension fields of k) 149.D 

lattice- 243.C 

local (of topological groups) 423.0 

mod p (in a class of Abelian groups) 202.N 

Q- (of O-groups) 190.E 

operator (of O-groups)  190.E 

order 311.E 

ring 368.D 

suspension (for homology) 201.E 

Thom-Gysin 114.G 

Thom-Gysin (of a fiber space) 148.E 

uniform 436.E 
isomorphism invariant (on a measure space) 136.E 
isomorphism problem 

(in ergodic theory) 136.E 

(for graphs) 186.J 

(for integral group algebras) 362.K 
isomorphism theorem l 

(in class field theory) 59.C 

(on groups) 190.D 

(onrings) 368.F 

(on topological groups) 423.J 

Ax-Kochen (on ultraproduct) 276.E 

first (on topological groups) 423.J 

Hurewicz 202.N 

Hurewicz-Steenorod (on homotopy groups of 

fiber spaces) 148.D 

Keisler-Shelah (in model theory) 276.E 

second (on topological groups) 423J 

third (on topological groups) 423.J 
isoparametric (hypersurface) 365.I 
isoparametric method 304.C 
isoperimetric (curves) 228.A 
isoperimetric constant 391.D 


Subject Index 
Iwasawa decomposition 


isoperimetric inequality 228.B 
isoperimetric problem(s) 111.E 228.A 
generalized 46.A 228.A 
special 228.A 
isospectral 391.B 
isospectral deformation 387.C 
isospin 351J 
isospin invariance 351.J 
isothermal compressibility 419.B 
isothermal coordinates 90.C 
isothermal curvilinear coordinate system App. A, 
Table 3.V 
isothermal parameter 334.B 
(for an analytic surface) 111.1 
isothermal process 419.B 
isotopic 65.D 202.B 
(braids) 235.F 
(embeddings) 114.D 
(latin square) 241.A 
ambient 65.D 
isotopy 65.D 202.B 
ambient 65.D 
isotopy lemma, Thom’s first 418.G 
isotopy type (of knots) 235.A 
isotropic 
(with respect to a quadratic form) 348.E 
k- (algebraic group) 13.G 
totally (subspace) 60.0 348.E 
isotropic point 365.D 
isotropic submanifold 365.D 
isotropic turbulence 433.C 
isotropy, index of total (of a quadratic form) 348.E 
isotropy group 362.B 
linear 199.A 
principal 431.C 
isotropy representation 431.C 
isotropy subgroup (of a topological group) 431.A 
isotropy type (of a transformation group) 431.A 
iterated integral 
(in Lebesgue integral) 221.E 
(in Riemann integral) 216.G 
iterated kernel (for a Fredholm integral equation) 
217.D 
iterated logarithm 
Khinchin’s law of 250.C 
law of 45.F 
iterated series 
by columns (of a double series) 379.E 
by rows (of a double series) 379.E 
iteration 
(in a Banach space) 286.B 
inverse 298.C 
method of successive (for Fredholm integral 
equations) 217.D 
two-body 271.C 
iteration matrix |. 302.C 
iterative improvement | 302.C 
iterative method 302.C 
iterative process, linear stationary 302.C 
Itó circle operation 406.C 
Itó decomposition, Wiener- 176.1 
Itó formula 45.G 406.B 
Itó process 406.B 
Itó theorem, Lévy- (on Lévy processes) 5.E 
Itó type, stochastic integral of 406.C 
ITPFI 308I 
Iversen-Beurling-Kunugi theorem 62.B 
Iversen theorem 272.1 
Iwahori subgroup 13.R 
Iwasawa decomposition 


Subject Index 
Iwasawa group 


(ofa Lie group) 249.T 
(of a real semisimple Lie algebra) 248.V 
Iwasawa group 384.C 
Iwasawa invariant 14.L 
Iwasawa main conjecture 450 
Iwasawa theorem, Cartan-Mal'tsev- (on maximal 
compact subgroups) 249.8 


J-group 237.1] 
equivariant 431.F 
J-homomorphism 202.V 237.1 
equivariant 431.F 
J-method 224.C 
Jackson’s theorem (on the degree of approximation) 
336.C 
Jacobi, C. G. J. 229 
Jacobian 208.B 
Jacobian, generalized (of a set function) 246.H 
Jacobian criterion (on regularity of local rings) 
370.B 
Jacobian determinant 208.B 
Jacobian matrix 208.B 
Jacobian variety 9.E 11.C 16.P 
canonically polarized 3.G9.E 
generalized 9.F 11.C 
Jacobi-Biehler equality 328 
Jacobi condition 46.C 
Jacobi differential equation App. A, Tables 14.II 
20.V 
Hamilton- 271.F 324.E 
Jacobi elliptic functions App. A, Table 16.III 
Jacobi equation, Hamilton- 108.B 
Jacobi field 178.A 
Jacobi identity 
(on the bracket of two vector fields) 105.M 
(in a Lie algebra) 248.A 
(with respect to Whitehead product) 202.P 
Jacobi imaginary transformation 134] 
Jacobi integral 420.F 
Jacobi inverse problem 3. 
Jacobi last multiplier App. A, Table 14.1 
Jacobi matrix 390.G 
Jacobi method 
(in numerical computation of eigenvalues) 
298.B 
(in numerical solution of linear equations) 
302.C 
cyclic 298.B 
threshold 298.B 
Jacobi polynomial 317.D, App. A, Table 20.V 
Jacobi second method of integration 324.D 
Jacobi symbol 297. 
complementary law of 297.1 
law of quadratic reciprocity of 297.1 
Jacobi standard form, Legendre- 134.A, App. A, 
Table 16.1 
Jacobi transformation App. A, Table 16.III 
Jacobson radical (of a ring) 67.D 
Jacobson topology 36.D 
James theorem 37.G 
Janko-Ree type, group of 151.J 
Janzen area (of a Borel set) 246.G 
Japanese mathematics (wasan) 230 
Japanese ring, universally 284.F 
Jarrat-Mack method, Garside- 301.N 
Jeffreys method 112.B 
Jensen formula 198.F 
Jensen measure 164.K 
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jet 
invertible 105.X 
oforderr 105.X 
job 281.D 
job-shop scheduling 307.C 
job-shop scheduling problem 376 
John-Nirenberg space (= BMO) 168.B 
join 
(in a Boolean algebra) 42.A 
(in a lattice) 243.A 
(of points) 155.B 
(of projective spaces) 343.B 
(of sets) 381.B 
(of simplicial complexes) 70.C 
(of subgroups of a group) 190.G 
reduced (of homotopy classes) 202.Q 
reduced (of mappings) 202.F 
reduced (of topological spaces) 202.F 
joined by an arc 79.B 
joint cumulant 397.1 
joint density 397.1 
joint distribution 342.C 
joint moment generating function |. 397.1J 
joint random variable 342.C 
joint sensity function 397.J 
joint spectrum 36.M 
Jordan algebras 231 
exceptional 231.A 
free special 231.A 
semisimple 231.B 
special 231.A 
Jordan arc 93.B 
Jordan canonical form (of a matrix) 269.G 
Jordan content 270.G 
Jordan curve 93.B 
Jordan curve theorem 93.K 
Jordan decomposition 
(of an additive set function) 380.C 
(of a function of bounded variation) 166.B 
(of a linear mapping) 269.L 
(in an ordered linear space) 310.B 
multiplicative 269.L 
Jordan domain 333.A 
Jordan elimination, Gauss- 302.B 
Jordan factorial 330 
Jordan Holder sequence (in a group) 190.G 
Jordan-Hólder theorem (in group theory) 
190.G 
Jordan-Hólder theorem (on representations of 
algebras) 362.D 
Jordan homomorphism (between Jordan alge- 
bras) 231.A 
Jordan inequality App. A, Table 8 
Jordan measurable set (of R”)  270.G 
Jordan measure 270.D,G 
Jordan module 231.C 
Jordan normal form 269.G 
Jordan test (on the convergence of Fourier series) 
159.B 
Jordan-Zassenhaus theorem (on integral represen- 
tation of a group) 362.K 
Joule heat  130.B 
Joule’s law  130.B 
Julia’s direction (of a transcendental entire function) 
272.F 429.C 
Julia’s exceptional function 272.F 
jump (at a point) 84.B 
jump function 306.C 
jumping of the structures 72.G 
just identified (equation) 128.C 
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K 


K-recursiveness 356.G 
k-almost simple algebraic group 13.0 
k-anisotropic algebraic group 13.G 
k-array 330 
k-Borel subgroup (of an algebraic group) 13.G 
k-closed algebraic set 13.A 
k-combination 330 
k-compact algebraic group 13.G 
k-connect (graph) 186.F 
k-dimensional integral element 191.1 
k-dimensional integral manifold 191.1 
k-dimensional normal distribution 341.D 
k-equivalent C^-manifolds 114.F 
k-Erlang distribution 260.H 
k-fold mixing automorphism 136.E 
k-fold screw glide with pitch 92.E 
k-form 
(of an algebraic group) 13.M 
holomorphic 72.A 
k-frame 199.B 
orthogonal 199.B 
k-group 13.A 
k-invariants (of a CW-complex) 70.G 
k-isomorphism (between algebraic groups) 13.A 
k-isotropic algebraic group  13.G 
k-morphism (between algebraic groups) 13.A 
k-movable 382.C 
k-permutation 330 
k-ply transitive G-set |. 362.B 
k-ply transitive permutation group 151.H 
k-quasisplit algebraic group 13.0 
k-rank (of a connected reductive algebraic group) 
13.Q 
k-rational divisor (on an algebraic curve) 9.C 
k-rational point (of an algebraic variety) 16.A 
k-root 13.Q 
k-sample problem 371.D 
k-simple algebraic group 13.0 
k-solvable algebraic group 13.F 
k-space 425.CC 
k-split algebraic group 13.N 
k-split torus 13.D 
maximal 13.Q 
k-step method, linear 303.E 
k-subgroup 
minimal parabolic 13.Q 
standard parabolic 13.Q 
k-subset 330 
k-transitive permutation group 151.H 
k-trivial torus 13.D 
k-valued algebroidal function 17.A 
k-vector bundle, normal 114.J 
k-way contingency table 397.K 
k-Weyl group 13.Q 
k’-space 425.CC 
kth prolongation 
(of G structure) 191.E 
(of a linear Lie algebra) 191.D 
(of a linear Lie group) 191.D 
kth saturated model 293.B 
kth transform 160.F 
K-complete analytic space 23.F 
K-complete scheme 16.D 
K-flow 136.E 
K-group 237.B 
equivariant 237.H 
K-method 224.C 
K-pseudoanalytic function 352.B 


Subject Index 
Kernel 


K-quasiregular function 352.B 
K-rational (=algebraic over KI 369.C 
K-regular measure 270.F 
K-theory 237 
algebraic 237 
higher algebraic 237J 
K3 surface 15.H 72.K 
marked 72.K 
Kac formula, Feynman- 351.F 
Kac- Nelson formula, Feynman- 150.F 
Kadomtsev-Petvyashvili equation 387.F 
Kahler existence theorem, Cartan- 191.] 
428.E 
Kahler homogeneous space 199.A 
Kahler immersion | 365. 
Káhler manifold 232 
Káhler metric 232.A 
standard (of a complex projective space) 
232.D 
Kahler metric, Einstein- 232.C 
Kahler submanifold 365.L 
Kakeya-Enestrom theorem (on an algebraic 
equation) 10.E 
Katutani equivalence 136.F 
Kakutani fixed point theorem 153.D 
Kakutani theorem 
(on complemented subspace problems of 
Banach spaces) 37.N 
(on statistical decision problems) 398.G 
Kakutani unt 310.G 
Kallén-Lehmann representation 150.D 
Kallén-Lehmann weight 150.D 
Kalman-Bucy filter 86.E 405.G 
Kalman filter 86.E 
Kaluza’s 5-dimensional theory 434.C 
Kametani theorem, Hällström- 124.C 
Kan complex 70.F 
Kaplansky’s density theorem 308.C 
Kapteyn series 39.D, App. A, Table 19.III 
Karlin's theorem 399.G 
Kastler axioms, Haag- 150.E 
Kato perturbation 351.D 
Kato theorem, Rellich- 331.B 
Kawaguchi space 152.C 
KdV equation 387.B 
two-dimensional 387.F 
Keisler-Shelah isomorphism theorem 276.E 
Keller-Maslov index 274.C 
Kelly theorem, Nachbin-Goodner- 37.M 
Kelvin function 39.G, App. A, Table 19.1V 
Kelvin transformation 193.B 
Kendall notation 260.H 
Kendall's rank correlation 371.K 
Kepler’s equation 309.B 
Kepler’s first law 271.B 
Kepler’s orbital elements 309.B 
Kepler’s second law | 271.B 
Kepler’s third law 271.B 
Kerékjártó-Stoilow compactification 207.C 
kernel 
(of a bargaining set) 173.D 
(distribution) 125.L 
(of a group homomorphism) 190.D 
(of an integral equation) 217.A 
(of an integral operator) 251.0 
(of an integral transform) 220.A 
(ofalinear mapping) 256.F 
(ofa morphism) 52.N 
(of an operator homomorphism) 277.E 
(of a potential) 338.B 


Subject Index 
Kernel differential 


(ofaset) 425.0 
(of a sheaf homomorphism) | 383.C 
(of a system of Suslin)  22.B 
adjoint 338.B 
of Calderón-Zygmund type 217J 
of Carleman type 217J 
consistent 338.E 
degenerate 217.F 
diffusion 338.N 
Dirichlet 159.B 
distribution 338.P 
domain (of a sequence of domains) 333.C 
elementary 320.H 
Fejér 159.C 
Fourier 220.B 
fundamental 320.H 
Hermitian 217.H 
of Hilbert-Schmidt type 217.1 
Hunt 338.0 
integral 217.A 251.0 
iterated 217.D 
Kuramochi 207.C 
Martin 207.C 
perfect (in potential theory) 338.E 
Pincherle-Goursat 217.F 
Poisson 159.C 
positive 217.H 
positive definite 217.H 338.D 
positive semidefinite 217.H 
reproducing 188.G 
semidefinite 217.H 
separated 217.F 
singular 217J 
symmetric 217.G 
symmetric, of positive type 338.D 
weak potential 260.D 
Wiener 95 
kernel differential 188.G 
kernel form 348.E 
kernel function 188.G 
Bergman 188.G 
harmonic 188.H 
Szego 188.H 
kernel representation (of a Green's operator) 
189.B 
kernel theorem (of Schwartz) 125.L 424.S 
Kerner-Aberth (DKA) method, Durand- 301.F 
Kerner (DK) method, Durand- 301.F 
Kerr metric 359.E 
Kervaire invariant, Arf- 114.J 
Khachiyan’s method 255.C 
Khinchin canonical form (of an infinitely divisible 
probability distribution) 341.G 
Khinchin decomposition (of a covariance function) 
395.B 
Khinchin’s law of the iterated logarithm 250.C 
Kiefer inequality, Chapman-Robbins- 399.D 
killing 261.F 
Killing curvature, Lipschitz- 279.C 
Killing differential equation 364.F 
Killing form 248.B 
killing measure 115.B 
killing method (of obtaining a homotopy group) 
202.N 
killing time 260.A 
Killing vector field 364.F 
kind 
Abelian differential of the first, second, third 
11.C 
Abelian integral of the first, second, third 11.C 
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associated Legendre function of the first, second 
393.C 
Beltrami differential operator of the first, second 
App. A, Table 4.II 
Bessel function of the first, second, third 39.B 
complete elliptic integral of the first, second 
134.B,C 
differential form of the first (on an algebraic 
variety) 16.0 
canonical coordinates of the first, second 
249.Q 
dfferential form of the first, second, third (on a 
nonsingular curve) 9.E 
discontinuity of the first, second 84.B 
discontinuous group of the first 122.B 
elliptic function of the first, second, third 
134.G,H 
elliptic integral of the first, second, third 134.A 
error of the first, second 400.A 
Euler integral of the first, second 174.A,C 
exceptional curve of the first, second 15.G 
Fredholm integral equation of the first, second, 
third 217.A 
Fuchsian group of the first, second 122.C 
Hankel function of the first, second 39.B 
incomplete elliptic integral of the first 134.B 
Lamé function of the first, second 133.C 
Legendre function of the first, second 393.B 
Mathieu function of the first, second 268.B 
modified Mathieu function of the first, second, 
third 268.D 
perfect members of the second 297.D 
Stirling number of the second 66.D 
kinetic density 218.A 
kinetic energy 271.C 351.D 
kinetic measure 218.A 
kinetic theory of gases 402.B 
Kirby calculus 114.L 
Kirchhoff laws |. 282.B 
Kirchhoff solution 325.D 
Kleene's normal form theorem 356.C 
Klein, F. 233 
Klein bottle 410.B 
Klein combination theorem 234.D 
Kleinfour-group | 151.G 
Klein-Gordon equation 351.G 377.C 
Kleinian group 122.C 234.A 
Klein inequality 212.B 
Klein line coordinates 90.B 
Klein model (of non-Euclidean geometry) 285.C 
Klein-Nishina formula 351.G 
Klein transform 150.D 
K-L (= Kullback-Leibler) information number 
398.G 
Kloosterman sum 32.C 
KMS condition 308.H 402.G 
Kneser-Nagumo theorem 316.E 
Kneser-Sommerfeld formula App. A, Table 19.III 
Knopp-Schmidt theorem 208.C 
Knopp-Schnee theorem (on method of summation) 
379.M 
knot 235.A 
alternating 235.A, App. A, Table 7 
amphicheiral 235.A 
clover leaf 235.C 
equivalent 235.A 
hyperbolic 235.E 
invertible 235.A 
(pq- 235G 
(p,q)-ball 235.G 
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slice 235.G 
trefoil 235.C 
knot cobordism 235.G 
knot complement conjecture 235.B 
knot conjecture, general 235.B 
knot group (of a knot) 235.B 
knot projection, regular 235.A 
knotted 235.A 
knot theory 235, App. A, Table 7 
knot types 235.A 
Kobayashi pseudodistance 21.0 
Kochen isomorphism theorem, Ax- 276.E 
Kodaira dimension (of a compact complex manifold) 
72.1 
Kodaira-Spencer mapping (map) 72.G 
Kodaira theorem 
(on Hodge manifolds) 232.D 
Chow- 72.F 
Kodaira theory, Weyl-Stone-Titchmarsh- 
Kodaira vanishing theorem 232.D 
Koebe extremal function 438.C 
Koebe theorem 193.E 
Kojima-Schur theorem (on linear transformations 
of sequences) 379.L 
Kolchin theorem, Lie- (on solvable algebraic groups) 
13.F 
Kollektiv 342.A 
Kolmogorov automorphism  136.E 
Kolmogorov axiom Ty 425.Q 
Kolmogorov backward equation 115.A 260.F 
Kolmogorov canonical form 341.G 
Kolmogorov-Chaitin complexity 354.D 
Kolmogorov equality, Chapman- 261.A 
Kolmogorov equation, Chapman- 260.A 
Kolmogorov extension theorem 341.1 407.D 
Kolmogorov flow (pt 
Kolmogorov forward equation 115.A 260.F 
Kolmogorov-Smirnov test 371.F 
Kolmogorov-Smirnov test statistic 374.E 
Kolmogorov space 425.Q 
Kolmogorov-Spanier cohomology theory, Alex- 
ander- 201.M 
Kolmogorov spectrum 433.C 
Kolmogorov test 45.F 
Kolmogorov theorem 250.F 
Kolmogorov zero-one law 342.G 
Kond6 uniformization theorem 22.F 
Kónig-Egerváry theorem 281.E 
Kónigs-Schróder equation 44.B 
Korteweg-de Vries (KdV) equation 387.A 
Kostant’s formula (on representations of compact 
Lie groups) 248.Z 
Kovalevskaya existence theorem, Cauchy- 321.A 
Kovalevskaya theorem, abstract Cauchy- |. 286.Z 
Krein-Mil’man property 443.H 
Krein-Mil'man theorem 424.U 
Krein-Schmul'yan theorem 37.E 424.0 
Krein theorem 424.V 
Krieger’s factor 308.1 
Kronecker, L. 236 
Kronecker approximation theorem 422.K 
Kronecker delta 269.A, App. A, Table 4.1] 
Kronecker flow 136.G 
Kronecker index 
(in homology theory) 201.H 
(of divisors on a surface) 15.C 
Kronecker limit formula 450.B 
Kronecker product (of matrices) 269.C 
Kronecker set 192.R 
Kronecker symbol (for a quadratic field) 347.D 


112.0 


Subject Index 
L-space 


Kronecker theorem 

(on an Abelian extension of Q) 14.L 

(on an algebraic equation) 10.B 
Krull-Akizuki theorem 284.F 
Krull altitude theorem 28.A 
Krull-Azumaya lemma 67.D 
Krull dimension 

(of a commutative ring) 67.E 

(of an ideal) 67.E 
Krull intersection theorem 284.A 
Krull-Remak-Schmidt theorem (in group theory) 

190.L 

Krull ring 67J 
Krull topology (for an infinite Galois group) 172.1 
Kruskal coordinates 359.F 
Kruskal-Wallis test 371.D 
Kubo formula 402.K 
Kuhn-Tucker theorem 292.B 
Kullback discrimination information 213.D 
Kullback-Leibler (K-L) information number 398.G 
Kummer criterion 145, App. A, Table 10.II 
Kummer differential equation App. A, Table 19.1 
Kummer extension 172.F 
Kummer function 167.A, App. A, Table 19.1 
Kummer surface 15.H 
Künneth formula 

(in Abelian category) 200.H 

(in Weil cohomology) 450.Q 
Künneth theorem 200.E 201.J 
Kunugi theorem, Iversen-Beurling- 62.B 
Kuo (PLK) method, Poincaré-Lighthill- 25.B 
Kuramochi compactification 207.C 
Kuramochi kernel 207.C 
Kuranishi prolongation theorem 428.G 
Kuranishi space 72.G 
Kuratowski space 425.Q 
kurtosis 396.C 397.C 

population 396.C 
Kutta-Gill method, Runge- 303.D 
Kutta methods, Runge- 303.D 


L 


A‘ (Lipschitz spaces) 168.B 
A-function 32.C 
(lj) or l, (a sequence space) 168.B 
L, (Q) (the space of measurable functions f(x) on Q 
such that | f(x)|?, 1 < p < oo, is integrable) 
168.B 
Lo (Q) 168.B 
Le (Q) (the Lorentz spaces) 168.B 
l-adic cohomology 450.Q 
l-adic coordinate system  3.E 
l-adic representation 3.E 
L-distribution 341.G 
L-estimator 371.H 
L-function 
Artin 450.G,R 
of automorphic representation 450.N 
Dirichlet 450.C 
of elliptic curves 450.8 
Hecke 450.E 
Hecke (with Gróssencharakter) 450.F 
p-adic 450.J 
Weil 450.H 
L-group 450.N 
L-integral 221.B 
L-space 87.K 
abstract 310.G 
Fréchet 87.K 


Subject Index 
L,-algebra 


L,-algebra (of a locally compact Hausdorff group) 
36.L 
L,-space, abstract 310.G 
I*-space 87.K 
(LF)space 424.W 
labeled graph 186.B 
lacunas for hyperbolic operators 325J 
lacunary structure (of a power series) 339.E 
ladder 
down- 206.B 
up- 206.B 
ladder method 206.B 
lag 163.A 
lagged variables 128.C 274.C 
Lagrange, J. L. 238 
Lagrange bracket 82.A 324.D 
Lagrange-Charpit method 322.B, App. A, Table 


IER 
Lagrange differential equation 320.A, App. A, 
Table 14I 


Lagrange differential equation, Euler- 46.B 
Lagrange equations of motion 271.F 
Lagrange formula (for vector triple product) 
442.C 
Lagrange identity 252.K 
Lagrange interpolation 223.A 
Lagrange interpolation coefficients 223.A 
Lagrange interpolation polynomial 223.A 336.G, 
App. A, Table 21 
Lagrange manifold, conic 345.B 
Lagrange method 
(of describing the motion of a fluid) 205.A 
of indeterminate coefficients 106.L 
of variation of constants 252.D 
Lagrange multiplier 46.B 
method of 106.L 
Lagrange partial differential equation App. A, 
Table 15.II 
Lagrange problem (in calculus of variations) 46.A 
Lagrange remainder App. A, Table 9.IV 
Lagrange resolvent 172.F 
Lagrange stable 126.E 
negatively 126.E 
positively 126.E 
Lagrange-stable motion 420.D 
Lagrangian density 150.B 
free 150.B 
Lagrangian derivative 205.A 
Lagrangian function 271.F 292.A 
Lagrangian manifold, conic 274.C 
Lagrangian vector field 126.L 
Laguerre differential equation App. A, Tables 
14.II 20.VI 
Laguerre formula, Gauss- (in numerical integration) 
299.A 
Laguerre function App. A, Table 20.VI 
Laguerre geometry  76.B 
Laguerre inversion 76.B 
Laguerre polynomials 317.D, App. A, Table 20.VI 
associated 317.D 
Laguerre transformation 76.B 
Lambert series 339.C 
Lamé differential equation 133.B 
generalized 167.E 
Lamé function 
of the first kind 133.B 
of the first species 133.C 
of the fourth species 133.C 
of the second kind 133.C 
of the second species 133.C 
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of the third species 133.C 
lamellar vector field 442.D 
laminar flow 205.E 433.A 
Lanczos method 298.D,E 301.N 
Landau constant 77.F 
Landau equation 146.C 
Landau-Nakanishi equation 146.C 
Landau-Nakanishi variety 146.C 386.C 
Landau symbols (0,0) 87.G 
Landau theorem 43.J 
Wiener-Ikehara 123.B 
Landau variety 146.C 
Landen transformation 134.B, App. A, Table 
16.111 
Lane-Emden function 291.F 
Langevin equation 45.1 402.K 
language 31.D 276.A 
accepted by 31.D 
external 75.C 
machine 75.C 
Laplace, P. S. 239 
Laplace-Beltrami operator 194.B 
Laplace differential equation 323.A, App. A, 
Table 15.11 
2-dimensional case App. A, Table 15.VI 
3-dimensional case App. A, Table 15.VI 
Laplace expansion theorem (on determinants) 
103.D 
Laplace-Mehler integral representation App. A, 
Table 18.II 
Laplace method 30.B 
Laplace operator 323.A 442.D 
Laplace spherical functions 393.A 
Laplace-Stieltjes transform 240.A 
Laplace theorem, de Moivre- 250.B 
Laplace transform 240, App. A, Table 12.1 
Fourier- 192.F 
Laplace transform and operational calculus App. 
A, Table 12 
Laplacian 323.A 442.D, App. A, Table 3.II 
in the large 109 
large deviation 250.B 
large inductive dimension (Ind) 117.B 
large numbers 
law of 250.B 
strong law of 250.C 
weak law of 395.B 
larger, stochastically 371.C 
larger topology 425.H 
large sample theory 401.E 
large semigroup algebra | 29.C 
large sieve method 123.E 
largest nilpotent ideal (of a Lie algebra) 248.D 
Lashnev space 425.CC 
last multiplier, Jacobi App. A, Table 14.1 
last-out memory, first-in 96.E 
last theorem of Fermat 145 
last theorem of Poincaré 153.B 
latin rectangle 241.E 
Latin square(s) 102.K 241 
lattice(s) 
(of a crystallographic group) 92.A 
(=lattice ordered set) 243 
(ofa Lie group) 122.G 
(in R”) 182.B 
A- 92E 
anti-isomorphic 243.C 
Archimedean vector 310.C 
B- 92E 
Banach 310.F 
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black and white 92.D 
Boolean 42.A 243.E 
Boolean, of sets 243.E 
Bravais 92.B 
C- 92.E 
color 92.D 
complemented 243.E 
complete 243.D 
complete vector 310.C 
conditionally complete 243.D 
conditionally o-complete 243.D 
critical, in M with respect to S 182.B 
distributive 243.E 
dual 243.C 310.E 450.K 
dually isomorphic 243.C 
F- 92E 
g- (of a separable algebra) 27.A 
homogeneous (in R”) 182.B 
integral g- 27.A 
l- 92E 
inhomogeneous (in R”) 182.B 
modular 243.F 
normal g- 27.A 
normal vector 310.F 
one-dimensional 287.A 
P- 92E 
primitive 92.E 
quotient 243.C 
ol seis 243.E 
o-complete 243.D 
o-complete vector 310.C 
Toda 287.A 387.A 
vector 310.B 
weight 92.C 
lattice constants (of a lattice group) 92.C 
lattice distribution 341.D 
lattice equivalent 92.D 
lattice gauge theory 150.G 
lattice group(s) 182.B 
(of a crystallographic group) 92.A 
lattice-homomorphism | 243.C 
lattice-isomorphism | 243.C 
lattice-ordered group 243.G 
Archimedean 243.G 
lattice-ordered linear space 310.B 
lattice-ordered set 243.A 
lattice-point formula 220.B 
lattice-point problems 242 
lattice-spin systems 402.G 
latus rectum 78.D,E 
Laurent expansion 198.D 
Laurent series 339.A 
law(s) 
(of a solution of a stochastic differential 
equation) 406.D 
absorption (in algebra of sets) 381.B 
absorption (in a lattice) 243.A 
of action and reaction 271.A 
adiabatic 205.B 
alternating (in a Lie algebra) 248.A 
antisymmetric (for ordering) 311.A 
arcsine (for Brownian motion) 45.E 
arcsine (for distribution function} 250.D 
arcsine (for random walk) 260.E 
associative (for addition) 294.B 
associative (in algebra of sets) 381.B 
associative (of correspondences} 358.B 
associative (of group composition) 190.A 
associative (in a lattice) 243.A 
associative (for multiplication] 294.B 





Subject Index 
Law(s) 


associative (in a ring) 368.A 

Blumenthalzero-one 261.B 

Brandts 241.C 

cancellation (for addition) 294.B 

cancellation (in a commutative semigroup) 
190.P 

cancellation (for multiplication) 294.B 

commutative (for addition) 294.B 368.A 

commutative (in algebra of sets) 381.B 

commutative (of group composition) 190.A 

commutative (in a lattice) 243.A 

commutative (for multiplication) 294.B 368.A 

complementary, of quadratic reciprocity of 
Jacobi symbol 297.1 

complementary, of reciprocity 14.0 

of complementation (in a Boolean algebra) 
42.A 

complete distributive (in a lattice-ordered 
group) 243.G 

ofcomposition 409.A 

of composition, external 409.A 

of composition, internal 409.A 

of cosines (on spherical triangles) 432.B, App. 
A, Table 2.111 

of cosines, first 432.A, App. A, Table 2.II 

of cosines, second 432.A, App. A, Table 31 

of cotangents App. A, Table 2.III 

de Morgane (in algebra of sets) 381.B 

de Morgan's (in a Boolean algebra) 42.A 

differentia) 107.A 

distributive (in algebra of sets) 381.B 

distributive (in a lattice) 243.E 

distributive (of natural numbers) 294.B 

distributive (in a ring) 368.A 

even-oddness conservation 150.D 

of excluded middle 156.C 411.L 

explicit reciprocity (of norm-residue symbol) 
14.R 257.H 

first complementary, of quadratic reciprocity of 
Legendre symbol 297.1 

Gel fand-Pyatetskii-Shapiro reciprocity (on 
unitary representations) 437.DD 

general associative (for group composition) 
190.C 

Hewitt-Savage zero-one 342.G 

Hookes 271.G 

idempotent (in a lattice) 243.A 

of inertia 271A 

of inertia, Sylvester (for a quadratic form) 
348.C 

initial (for stochastic differential equation) 
406.D 

of iterated logarithm 45.F. 

of iterated logarithm, Khinchin 250.C 

Joules 130.B 

Kepler’s first 271.B 

Kepler’s second 271.B 

Kepler's third 271.B 

Kirchhoff 282.B 

Kolmogorov zero-one 342.G 

of large numbers 250.B 395.B 

of large numbers, strong | 250.C 

Maxwell-Boltzmann distribution 402.B 

modular (in a lattice) 243.F 

of motion 271.A 

of motion, Newton’s three 271.A 

Newton (on frictional stresses) 205.C 

Newtons first 271.A 

Newton'ssecond | 271.A 

Newton's third 271.A 


Subject Index 
Lax equivalence theorem 


Ohm 130.B 259 
of quadratic reciprocity of Jacobi symbol 
297.1 
of quadratic reciprocity of Legendre symbol 
297.1 
of reaction 271A 
of reciprocity 14.0 297.I 
of reciprocity, Artin general 59.C 
of reciprocity, general 14.0 
reciprocity, of Shafarevich 257.H 
reflexive (for an equivalence relation) 135.A 
reflexive (for ordering) 311.A 
second complementary, of quadratic reciprocity 
of Legendre symbol 297.1 
of similarity, Prandtl-Glauert 205.D 
of similarity, Reynolds |. 205.C 
ofsimilitude 116 
of sines 432.A, App. A, Table 2.II 
of sines (for spherical triangles) 432.B, App. 
A, Table 2.HI 
of sines and cosines App. A, Table 211 
of small numbers 250.B 
symmetric (for an equivalence relation) 135.A 
of symmetry (for the Hilbert norm-residue 
symbol) 14.R 
of tangents App. A, Table 2.III 
transitive (for an equivalence relation) 135.A 
transitive (for ordering) 311.A 
of universal gravitation 271.B 
zero-one 342.G 
Lax equivalence theorem 304.F 
Lax representation 287.B 387.C 
Lax-Wendroff scheme 304.F 
layout, two-way 155.H 
layer 
boundary 205.C 
potential of double 338.A 
potential of single 338.A 
LBA problem 31.F 
L.B. (loosely Bernoulli) 136.F 
LCL (lower control limit) 404.B 
L.C.M. (least common multiple) 67.H 297.A 
leaf (leaves) 
(of a foliation) 154.B 
compact 154.D 
growth of 154.H 
leaftopology 154.D 
learning model 346.G 
least action, principle of 441.B 
least common multiple 67.H 297.A 
least element (in an ordered set) 311.B 
least favorable distribution 400.B 
least favorable a priori distribution 398.H 
least square approximation 336.D 
least squares, method of 
(for estimation) 403.E 
(for higher-dimensional data) 397.J 
(for numerical solution) 303.I 
least squares estimator 403.E 
generalized 403.E 
least squares method 
indirect (in econometrics) 128.C 
three-stage 128.C 
two-stage 128.C 
least squares problem, linear 302.E 
least upper bound (ordered sets) 310.C 311.B 
Lebesgue, H. L. 244 
Lebesgue area (of a surface) 246.C 
Lebesgue convergence theorem 221.C 
Lebesgue decomposition theorem 270.L 380.C 
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Lebesgue density theorem 100.B 
Lebesgue dimension 117.B 
Lebesgue extension 270.D 
Lebesgue integrable 221.B 
Lebesgue integral 221.B 
Lebesgue measurability and the Baire property 
33.F 
Lebesgue measurable (set) 270.G 
Lebesgue measurable function 270.J 
Lebesgue measure 270.G 
generalized 270.E 
Lebesgue measure space (with a finite c-finite 
measure) 136.A 
Lebesgue method of summation 379.S 
Lebesgue number 273.F 
Lebesgue outer measure 270.G 
Lebesgue-Radon integral 94.C 
Lebesgue spectrum, countable 136.E 
Lebesgue-Stieltjes integral 94.C 166.C 
Lebesgue-Stieltjes measure 166.C 270.L 
Lebesgue test (on the convergence of Fourier series) 
159.B 
Lebesgue theorem 
(on the dimension of R") 117.D 
Borel- 273.H 
Cantor- 159 
Riemann- 159.A 160.A 
Le Cam theorem 399.K 
Lefschetz duality theorem, Poincaré- 201.0 
Lefschetz fixed-point formula 450.Q 
Lefschetz fixed-point theorem 153.B 
Lefschetz formula, Picard- 418.F 
Lefschetz number 
(of a continuous mapping) 153.B 
(of a variety) 16.P 
Lefschetz pencil 16.U 
Lefschetz theorem 
strong 16.U 
weak 16.U 
Lefschetz transformation, Picard- 16.U 
left, limit on the 87.F 
left A-module 277.D 
left adjoint functor 52.K 
left adjoint linear mapping 256.Q 
left annihilator 29.H 
left Artinian ring 368.F 
left balanced functor 200.1 
left continuous 84.B 
left coset 190.C 
left coset space 423.E 
left decomposition, Peirce (in a unitary ring) 368.F 
left derivative 106.A 
left derived functor 200.,Q 
left differentiable 106.A 
left distributive law 312.C 
left endpoint (of an interval) 355.C 
left exact functor 200.1 
left G-set 362.B 
left global dimension (of a ring) 200.K 
left hereditary ring 200.K 
left ideal 368.F 
integral 27.A 
left invariant Haar measure 225.C 
left invariant metric (of a topological group) 423.I 
left invariant tensor field (on a Lie group) 249.A 
left inverse element (in a ring) 368.B 
left linear space 256.A 
left Noetherian ring 368.F 
left D,-ideal 27.A 
left operation 409.A 
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left order (of a a-lattice) 27.A 
left parametrix | 345.A 
left projective resolution (of an A-module) 200.C 
left projective space 343.F 
left quotient space (of a topological group) 423.E 
left regular representation 
(of an algebra) 362.C 
(ofa group) 362.B 
left resolution (of an A-module) 200.C 
left satellite 200.1 
left semihereditary ring 200.K 
left semi-integral 68.N 
left shunt 115.B 
left singular point (of a diffusion process) 115.B 
left translation 249.A 362.B 
left uniformity (of a topological group) 423.G 
Legendre associated differential equation 393.A 
Legendre coefficient 393.B 
Legendre differential equation 393.B, App. A, 
Table 14.II 
Legendre function 393.B, App. A, Table 18.II 
associated App. A, Table 18.III 
associated (of the first kind) 393.C, App. A, 
Table 18.III 
associated (of the second kind) 393.C, App. 
A, Table 18.III 
of the first kind 393.B, App. A, Table 18.11 
of the second kind 393.B, App. A, Table 18.II 
Legendre-Jacobi standard form 134.A, App. A, 
Table 16.I 
Legendre polynomial 393.B, App. A, Table 18.II 
Legendre relation 134.F, App. A, Table 16.I 
Legendre symbol 297.H 
first complementary law of reciprocity of 297.1 
law of quadratic reciprocity of 297.1 
second complementary law of reciprocity of 
297.1 
Legendre transform  419.C 
Legendre transformation (contact transform) 82.A, 
App. A, Table 15.IV 
Lehmann representation, Källén- 150.D 
Lehmann-Scheffé theorem 399.C 
Lehmann-Stein theorem 400.B 
Lehmann theorem 371.C 
Hodges- 399.E,H 
Lehmann weight, Källén- 150.D 
Lehmer method 301.K 
Leibler (K-L) information number, Kullback- 
398.G 
Leibniz, G. W. 245 
Leibniz formula (in differentiation) 106.D, App. 
A, Table 9.1IT 
Leibniz test (for convergence) 379.C 
lemniscate 93.H 
Bernoulli 93.H 
length 246 
(of a broken line) 139.F 
(ofacurve) 93.F 246.A 
(of a descending chain in a lattice) 243.F 
(of a module) 277.1 
(of a multi-index) 112.A 
(of a normal chain in a group) 190.G 
(ofa path) 186.F 
(ofa segment) 139.C 
(ofa Witt vector) 449.B 
affine 110.C 
affine arc 110.C 
extremal (of a family of curves) 143.A 
extremal, defined by Hersch-Pfluger 143.A 
extremal, with weight 143.B 


Subject Index 
Lie algebra(s) 


of finite(module) 277.1 
focal 180.B 
z- (ofa group) 151.F 
queue 260.H 
wave 446 
lens, Luneburg's 180.A 
lens space 91.C 
infinite 91.C 
Leopoldt's conjecture 450 
leptons 132.B 
Leray-Hirsch theorem 201.J 
Leray-Schauder degree 286.D 
Leray-Schauder fixed-point theorem 286.D 323.D 
letter 
(in information theory) 63.A 213.B 
(=variable) 369.A 
level 
(ofa factor) 102.H 
(of a modular form) 32.C 
(of a modular function) 32.C 
(of an orthogonal array) 102.L 
(of a principal congruence subgroup) 122.D 
(ofa test) 400.A 
(of a tolerance region) 399.R 
average outgoing quality 404.C 
confidence 399.Q 
level x test 400.A 
minimax 400.F 
most stringent 400.F 
unbiased 400.C 
uniformly most powerful (UMP) invariant 
400.E 
uniformly most powerful (UMP) unbiased 
400.C 
level n structure (on an Abelian variety) 3.N 
level set (of a C?-function) 279.D 
level surface 193.J 
Levi-Civita, parallel in the sense of 111.H 
Levi-Civita connection 364.B 
Levi condition 321.G 325.H 
Levi decomposition 
(on algebraic groups) 13.Q 
(on Lie algebras) 248.F 
Levi form 344.4 
generalized 274.G 
Levi problem 21.L;F 
Levi pseudoconvex domain 21.I 
locally 21.1 
Levisubgroup 13.Q 
Levitan-Marchenko equation, Gel'fand- 
(for KdV equations) 387.D 
(for a nonlinear lattice) 287.C 
Lévy canonical form 341.G 
Lévy continuity theorem 341.F 
Lévy distance 341.F 
Lévy-Itó theorem (on Lévy processes) 5.E 
Lévy measure 5.E 
Lévy process 5.B 
Lévy theorem, Wiener- 159.1 
Lewy-Mizohata equation 274.G 
lexicographic linear ordering 248.M 
lexicographic ordering 311.G 
liability reserve 214.B 
Lie, M.S. 247 
Lie algebra(s) 248, App. A, Table 5.I 
(of an algebraic group) 13.C 
(ofa Lie group) 249.B 
Abelian 248.C 
adjoint 248.B 
algebraic 13.C 


Subject Index 
Lie derivative 


classical compact real simple 248.T 
classical complex simple 248.S 
compact real 248.P 
compact real simple App. A, Table zl 
complex 248.A 
complex (of a complex Lie group) 249.M 
complex simple App. A, Table 5.I 
of derivations 248.H 
exceptional compact real simple 248.T 
exceptional complex simple 248.S 
general linear 248.A 
isomorphic 248.A 
nilpotent 248.C 
noncompact real simple App. A, Table 5.II 
quotient 248.A 
real 248.A 
reductive 248.G 
restricted 248.V 
semisimple 248.E 
simple 248.E 
solvable 248.C 
Lie derivative 105.0,Q 
Lie fundamental theorem (on a local Lie group of 
local transformations) 431.G 
Lie group(s) 249 423.M 
Abelian 249.D 
Banach 286.K 
classical compact simple 249.L 
classical complex simple 249.M 
commutative 249.D 
complex 249.A 
direct product of 249.H 
exceptional compact simple 249.L 
exceptional complex simple 249.M 
isomorphic 249.N 
local 423.L 
local (of local transformations) 431.G 
nilpotent 249.D 
quotient 249.G 
semisimple 249.D 
simple 249.D 
simply connected covering 249.C 
solvable 249.D 
topology of, and homogeneous spaces 427 
Lie-Kolchin theorem (on solvable algebraic groups) 
13.F 
Lie line-sphere transformation 76.C 
Lie minimal projection 76.B 
Liénard differential equation 290.C 
lies over (of a compactification) 207.B 
Lie subalgebra 248.A 
associated with a Lie subgroup 249.D 
Lie subgroup 
(ofa Lie group) 249.D 
connected 249.D 
Lie theorem (on Lie algebras) 248.F 
Lie transformation (in circle geometry) 76.C 
Lie transformation group (of a differentiable mani- 
fold) 431.C 
lifetime 260.A 261.B 
(of a particle by a scattering) 132.A 
lift 
(along a curve in a covering surface) 367.B 
(of a differentiable curve) 80.C 
(of a vector field) 80.C 
inflation 200.M 
lifting (in nonstandard analysis) 293.D 
lifting theorem 251.M 
light cone 258.A 
Lighthill-K uo (P.L.K.) method, Poincaré- 25.B 
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Lighthill method 25.B 
lightlike 258.A 359.B 
likelihood 374.J 
likelihood equation 399.M 
likelihood estimating function 399.M 
likelihood estimator, maximum 399.M 
likelihood function 374.J 399.M 
likelihood method, maximum 399.M 
likelihood ratio 400.1 
monotone 374.J 
likelihood ratio test 400.1 
limaçon of Pascal 93.H 
liminal C*-algebra 36.H 
limit 
(of a function) 87.F 
(of an indeterminate form) 106.E 
(ofa mapping) 87.F 
(ofa net) 87.H 
(of a sequence of lattices) 182.B 
(of a sequence of points) 87.E 273.D 
(of a sequence of real numbers) 87.B 355.B 
(of a sequence of sets) 270.C 
(ofa spectral sequence) 200.J 
Banach 37.F 
confidence 399.Q 
direct (of a direct system) 210.B 
elastic 271.G 
generalized 37.F 
inductive (in a category) 210.D 
inductive (group) 210.C 
inductive (of an inductive system) 210.B 
inductive (of sheaves) 383.] 
inductive (space) 210.C 
inductive (of topological spaces) 425.M 
inferior (event) 342.B 
inferior (of a sequence of real numbers) 87.C 
inferior (of a sequence of sets) 270.C 
inverse (of an inverse system) 210.B 
on the left (of a real-valued function) 87.F 
lower (function) 84.C 
lower (of a Riemann integral) 216.A 
lower (of a sequence of real numbers) 87.C 
lower control 404.B 
inthemean 168.B 
order (of an order convergent sequence) 310.C 
projective (in a category) 210.D 
projective (of a family of continuous homomor- 
phisms) 423.K 
projective (group) 210.C 
projective (of a projective system) 210.B 
projective (space) 210.C 
on the right (of a real-valued function) 87.F 
strictly inductive (of a sequence of locally 
convex spaces) 424.W 
superior (event) 342.B 
superior (of a sequence of real numbers) 87.C 
superior (of a sequence of sets) 270.C 
thermodynamic 402.G 
tolerance 399.R 
upper (function) 84.C 
upper (of a Riemann integral 216.A 
upper (of a sequence of real numbers) 87.C 
upper control 404.B 
limit circle type (boundary point) 112.I 
limit cycle 126.1 
limit distribution 250.A 
limit formula, Kroneckers 450.B 
limited information maximum likelihoad method 
128.C 
limit inferior 
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(of a sequence of real numbers) 87.C 
(of a sequence of sets) 270.C 
limiting absorption principle 375.C 
limiting hypersphere (in hyperbolic geometry) 
285.C 
limit ordinal number 312.B 
limit point 
(of a discontinuous group) 122.C 
(of a sequence) 87.B,E 


o 126.D 
negative 126.D 
w- 126.D 


positive 126.D 
limit point type (boundary point) 112.1 
limit set 234.A 
x- 126.D 
first negative prolongational 126.D 
first positive prolongational 126.D 
w- 126.D 
residual 234.E 
limit superior 
(of a sequence of real numbers) 87.C 
(of a sequence of sets) 270.C 
limit theorem(s) 250.A 


basic 260.C 
central 250.B 
local 250.B 


in probability theory 250 
limit value (of a mapping) 87.F 
Lindeberg condition 250.B 
Lindelöf asymptotic value theorem 43.C 
Lindelöf hypothesis 123.C 
Lindelöf space 425.8 
Lindelöf theorem 43.F 
Phragmén- 43.C f 
Lindemann-Weierstrass theorem 430.D 
Lindstedt-Poincaré method 290.E 
line(s) 7.A 93.A 155.B 
(ofa graph) 186.B 
broken 155.F 
complexes, linear 343.E 
complex of 110.B 
concurrent (in projective geometry) 343.B 
congruence of 110.B 
congruences, linear 343.E 
of curvature (on a surface) 111.H 
of force 193.J 
generating (of a circular cone) 78.A 
generating (of a quadric hypersurface) 343.E 
generating (of a quadric surface) 350.B 
generating (of a ruled surface) 111.1 
geodesic 178.H 
Green 193J 
Green, regular 193J 
half- 155.B 
long 105.B 
normal (to acurve) 93.G 
Pascal 78.K 
pencil of (in a projective plane)  343.B 
projective 343.B 
real 355.E 
regression 403.D 
of regression 111.F,] 
straight 93.A 155.B 
stream 205.B 
supporting (function) 89.C 
supporting (of an oval) 89.C 
of swiftest descent 93.H 
tangent 93.G 111.C,F 





Subject Index 
Linearly dependent 


tangent, oriented 76.B 
vector (of a vector field) 442.D 
vortex 205.B 
linear algebra 8 
linear algebraic group 13.A 
linear boundary operators, 315.B 
linear bounded automation 
deterministic 31.D 
nondeterministic 31.D 
linear code 63.C 
linear combination 256.C 
of ovals 89.D 
linear connection 80.H 
linear difference equation 104.C 
linear differential equations, system of (of the first 
order) 252.G 
linear discriminant function 280.1 
linear dynamical system 86.B 
linear equation(s) 10.D 16.M 269.M 
linear equivalence class (of divisors) 16.M 
linear extension (of a rational mapping to an 
Abelian variety) 9.E 
linear fiber mapping (map) 114.D 
linear filter 405.F 
linear form 256.B 277.E 
linear fractional function 74.E 
linear fractional group 60.B 
linear fractional programming 264.D 
linear fractional transformation 74.E 
linear function 74.E 
linear functional 37.C 197.F 424.B 
algebraic 424.B 
linear fundamental figure (of a projective space) 
343.B 
linear genus 15.G 
linear graph 282.A 
linear group P 
Abelian (over K) 60.L 
full 60.B 
general 60.B 256.D 
general (of degree n over K) 60.B 226.B 
general (over a noncommutative field) 60.0 
projective general 60.B 
projective general (of degree n over K) 60.B 
projective special 60.B 


projective special (over a noncommutative field) 


60.0 

special 60.B 

special (of degree n over K) 60.B 

special (over a noncommutative field) 60.0 
linear holonomy | 154.C 
linear homogeneous equations, system of 269.M 
linear homotopy 114.D 
linear hypothesis, general 400.H 
linear integral equation 217.A 
linear isotropy group (at a point) 199.A 
linearized operator 183 286.E 
linearized stability, principle of 286.S 
linear k-step method 303.E 
linear least squares problem 302.E 
linear Lie algebra, general 248.A 
linear line complex 343.E 
linear line congruence 343.E 
linear logistic model 403.C 
linearly compact 422.L 

locally 422.L 
linearly connected homogeneous space 199.A 
linearly dependent (with respect to a difference 

equation) 104.D 


Subject Index 
Linearly dependent elements 


linearly dependent elements 
(in an additive group) 2.E 
(in a linear space) 256.C 
linearly disjoint (fields) 149.K 
linearly equivalent (divisors) 16.M 172.F 
o- (on an algebraic curve) 9.F 
linearly estimable parameter 403.E 
linearly independent elements 
(in an A-module) 277.G 
(in an additive group) 2.E 
(in a linear space) 256.C 
linearly independent family (of elements in a linear 
space) 256.E i 
linearly ordered set 311.A 
linearly reductive 226.B 
linearly representable 66.H 
linear mapping 70.C 256.B 
A- (of an A-module) 277.E 
piecewise 70.C 
linear model 403.D 
log 403.C 
multivariate 280.B 
normal 403.C 
linear multistep method 303.E 
linear network 282.C 
linear operator(s} 37.C 251 
(between linear spaces) 256.B 
bounded 37.C 
linear ordering 311.A 
lexicographic 248.M 
theorem of 155.B 
linear ordinary differential equation(s) 252 253 254 
313.A 
with constant coefficients App. A, Table 14.I 
of the first order App. A, Table 14.1 
of higher order App. A, Table 14.1 
linear parameter 102.A 
linear partial differential equation 320.A 
linear pencil 16.N 
linear prediction theory 395.D 
linear predictor 395.D 
optimal 395.D 
linear programming 255 264.C 
linear programming problem 255.A 
linear recurrent sequence 295.A 
linear regression 397.J 
linear regression function 397.J 403.D 
linear representation 
(of an algebra) 362.C 
(ofa group) 362.C 
(ofa Lie algebra) 248.B 
associated with representation module 362.C 
completely reducible 362.C 
direct sum of 362.C 
equivalent 362.C 
faithful 362.C 
homomorphism of 362.C 
indecomposable 362.C 
irreducible 362.C 
isomorphic 362.C 
reciprocal (of an algebra) 362.C 
reducible 362.C 
semisimple 362.C 
similar 362.C 
simple 362.C 
tensor product of 362.C 
linear simple group 151.I 
linear space(s) 256 
category of (over a ring) 52.B 
complex 256.A 
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dual 256.G 
over a field 256.A 
finite-dimensional 256.C 
Hermitian 256.Q 
infinite-dimensional 256.C 
lattice-ordered 310.B 
left 256.A 
normed 37.B 
ordered 310.B 
quasinormed 37.0 
quotient 256.F 
rea] 256.A 
right 256.A 
self-dual 256.H 
linear stationary iterative process 302.C 
linear structural equation system  128.C 
linear structure 96.C 
linear subspace 
closed 197.E 
of a linear space 256.F 
linear system 
(of divisors) 15.C 16.N 
(of functional-differential equations) 163.E 
ample 16.N 
characteristic (of an algebraic family) 15.F 
complete 9.C 16.N 
complete (defined by a divisor) 16.N 
irreducible 16.N 
reducible 16.N 
very ample 16.N 
linear time-invariant (dynamical system) 86.B 
linear time-varying system 86.B 
linear topological space 424.A 
linear topology 422.L 
linear transformation 
(on a Banach space) 251.A 
(on a linear space) 256.B 
(on a Riemann sphere) 74.E 
(of series) 379.L 
entire 74.E 
semisimple 256.P 
triangular 379.L 
linear unbiased estimator, best 403.E 
linear variety 422.L 
linearly compact 422.L 
line bundle 147.F 
complex 72.F 
complex (determined by a divisor) 72.F 
tautological 16.P 
line coordinates (of a line) 343.C 
line element 111.C 
characteristic 82.C 
of higher order, space of 152.C 
projective 110.B 
line-sphere transformation, Lie 76.C 
linguistics, mathematical 75.E 
link 235.D 
framed 114.L 
linkage invariant, covering 235.E 
link group 235.D 
linking number 99.C 
link polynomial 235.D 
reduced 235.D 
link type 235.D 
Linnik’s constant 123.D 
Liouville formula 252.C 
Liouville number 430.B 
Liouville operator, Sturm- 112.1 
Liouville problem, Sturm- 315.B 
Liouville theorem 
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on bounded entire functions 272.A 
first 134.E 
fourth 134. E 
on integral invariants 219.A 
second 134.E 
third 134.E 
Lip æ (Lipschitz) 84.A 
of order a 84.A 
Lippman-Schwinger equation 375.C 
Lipschitz condition 84.A 163.D 286.B 316.D 
of order o 84.A 
Lipschitz-Killing curvature 279.C 
Lipschitz space 168.B 
Lipschitz test, Dini- (on the convergence of Fourier 
series) 159.B 
list (representation) 96.D 186.D 
little group = 258.C 
Littlewood-Paley theory 168.B 
Littlewood-Sobolev inequality, Hardy- 224.E 
Littlewood supremum theorem, Hardy- App. A, 
Table 8 
Littlewood theorem, Hardy- 
(on bounded functions) 43.E 
(on trigonometric systems) 317.B 
lituus 93.H 
Livesay invariant, Browder- 114.L 
loading 214.A 
factor 280.G 346.F 
Lobachevskiii non-Euclidean geometry 285.A 
local base (in a topological space) 425.E 
local canonical parameter (for power series) 339.A 
local class field theory 59.G 
local cohomology group 125.W 
local concept (in differential geometry) 109 
local continuity 45.F 
local contro] 102.A 
local coordinates 
(on an algebraic variety) 16.0 
(on a differentiable manifold) 105.C 
transformation of 90.D 
local coordinate system 90.D 105.C 
holomorphic 72.A 
local cross section (in a topological group) 147.E 
local degree of a mapping 99.B 
local dimension (of an analytic set at a point) 
23.B 
local equation 
(ofa divisor) 16.M 
regular (at an integral point) 428.E 
local ergodic theorem 136.B 
local field 257.A 
local Gaussian sum 450.F 
local homology group 201.N 
local homomorphism (of a topological group) 
423.0 
local isomorphism (of topological groups) 423.0 
localization 
of a linear representation relative to a prime 
ideal 362.F 
principle of (on convergence tests of Fourier 
series) 159.B 
strict 16.AA 
local Lie group 423.L 
(of local transformations) 431.G 
local limit theorem 250.B 
locally (on a topological space) 425.J 
locally absolutely p-valent (function)  438.E 
locally arcwise connected (space) 79.B 
locally Cartan pseudoconvex (domain) 21.1 
locally closed (set) 425.J 


Subject Index 
Local property 


locally compact space 425.V 
uniformly 425.V 
locally connected (space) 79.A 
locally constructible (constant sheaf) 16.AA 
locally contractible 
(ata point) 79.C 
(space) 79.C 202.D 
locally convex (topological linear space) 424.E 
locally convex Fréchet space 424.] 
locally countable cell complex 70.D 
locally countable simplicial complex 70.C 
locally dense 154.D 
locally equicontinuous semigroup 378.F 
locally equivalent (G-structure) 191.H 
locally Euclidean group 423.M 
locally Euclidean space 425.V 
locally finite 
(algebra) 29J 
(cell complex) 70.D 
(covering) 425.R 
(simplicial complex) 70.C 
c- (covering) 425.R 
locally flat 
(connection) 80.E 
(injection between topological manifolds) 
65.D 
(PL embedding) 65.D 
(Riemannian manifold) 364.E 
locally integrable function 168.B 
locally isomorphic (topological groups) 423.0 
locally Levi pseudoconvex (domain) 21.1 
locally linearly compact (Q-module) 422.L 
locally Macaulay ring 284.D 
locally n-connected 
(ata point) 79.C 
(space) 79.C 
locally Noetherian (scheme) 16.D 
locally Noetherian formal scheme  16.X 
locally of finite type (for a morphism) 16.D 
locally w-connected (space) 79.C 
locally p-valent 438.E 
locally quadratic transformation 
(of an algebraic surface) 15.G 
(of an algebraic variety) 16.K 
(of a complex manifold) 72.H 
locally rectifiable (curve) 143.A 246.4 
locally symmetric Riemannian space 412.A, App. 
A, Table AU 
locally symmetric space 364.D 
affine 80.J 
locally symmetrizable (diffusion processes) 115.D 
locally totally bounded (uniform space) 436.H 
locally trivial fiber space 148.B 
locally uniformized 367.C 
local martingale 262.E 
local maximum modulus principle 164.C 
local moduli space (of a compact complex manifold) 
72.G 
local one-parameter group of local transformations 
105.N 
local operator 125.DD 
local orientation (in an oriented manifold) 201.N 
local parameter 
(around a cusp of a Fuchsian group) 32.B 
(of a nonsingular algebraic curve) 9.C 
(of a Riemann surface) 367.A 


‘local problem (on the solutions of differential 


equations) 289.A 
local property 
(in differential geometry) 109 


Subject Index 
Local regime 


(of a pseudodifferential operator) 345.A 

micro-pseudo- 345,A 

pseudo- (of a pseudodifferential operator) 
345.A 


local regime (in static model in catastrophe theory) 


51.B 
local ring 284.D 
(ofa prime ideal) 67.G 
(ofasubvariety) 16.B 
analytically normal 284.D 
analytically unramified 284.D 
complete 284.D 
Macaulay 284.D 
Noetherian 284.D 
Noetherian semi- 284.D 
quasi- 284. D 
quasisemi- 284.D 
regular 284.D 
semi- 284.D 
structure theorem of complete 284.D 
local-ringed space 383.H 
local section 126.E 
local strategy 173.B 
local system of groups (over a topological space) 
201.R 
local time 45.G 
local transformations, local Lie group of 431.G 
local truncation error 303.E 
local uniformizing parameter 367.A 
location parameter 396.1 400.E 
locus 
cut 178.A 
singular (of a variety) 16.F 
Loeb measure 293.D 
Loeb space 293.D 
logx 131.D 
log,x 131.B 
Logz (logarithm) 131.G 
logarithm 131.B 
common 131.C 
integral 167.D 
Khinchin law of the iterated 250.C 
law of iterated 45.F 
Napierian 131.D 
natural 131.D 
logarithmically convex 21.B 
logarithmic branch point (of a Riemann surface) 
367.B 
logarithmic capacity 48.B 
logarithmic criterion App. A, Table 10.11 
logarithmic curve 93.H 
logarithmic decrement (of a damped oscillation) 
318.B 
logarithmic differentiation App. A, Table 9.I 
logarithmic distribution App. A, Table 22 
logarithmic function to the base a  131.B 
logarithmic integral 167.D, App. A, Table 19.II 
logarithmic normal distribution App. A, Table 


22 
logarithmic paper 19.F 
semi- 19.F 


logarithmic potential 338.A 
logarithmic series 131.D 
logarithmic singularity 
(of an analytic function) 198.M 
(of an analytic function in the wider sense) 
198.P 
logarithmic spiral 93.H 
logic 
algebra of 411.A 





classical 411.L 
intuitionistic 411.L 
many-valued 411.L 
mathematical 411.A 
modal 411.L 
predicate 411.G 
predicate, with equality 411.J 
propositional 411.E 
quantum 351.L 
symbolic 411.A 
three-valued 411.L 
two-valued 411.L 
logical axiom 411.1 
logical choice function, transfinite 411.J 
logical operator 411.E 
logical product (of propositions) 411.B 
logical sum (of propositions) 411.B 
logical symbol 411.B 
logicism 156.A,B 
logistic equation 263.A 
logistic model, linear 403.C 
log linear model 403.C 
logmodular algebra 164.B 
Lommel integral 39.C 
Lommel polynomials App. A, Table 19.IV 
long gravity wave 205.F 
longitude (of a knot) 235.B 
longitudinal wave 446 
long line 105.B 
long water wave 205.F 
look-up, table 96.C 
Looman-Men'shov theorem 198.A 
loop 170 190.P 
self- 186.B 
loopspace 202.C 
loop theorem (on 3-manifolds) 65.E 
loosely Bernoulli 136.F 
Lopatinskii condition, Shapiro- 323.H 
Lopatinskii dterminant 325.K 
Lorentz condition 130.A 
Lorentz force 130.A 
Lorentz group 60.J 258.A 359.B 
full homogeneous 258.A 
full inhomogeneous 258.A 
homogeneous 359.B 
inhomogeneous 359.B 
proper 60.5 359.B 
proper complex 258.A 
Lorentz invariance 150.B 
Lorentz space 168.B a 
Lorentz transformation 359.B 
Lorenz curve 397.E 
loss 
heat 419.A 
of information 138.B 
loss function 398.A 
quadratic 398.A 399.E 
simple 398.A 
lot tolerance percent defective 404.C 
Lówenheim theorem, Skolem- 156.E 
lower bound 311.B 
greatest 310.C 311.B 
lower central series (of a group) 190.J 
lower class 
with respect to local continuity 45.F 


with respect to uniform continuity 45.F 


lower control limit 404.B 

lower derivative 
general (of a set function) 380.D 
ordinary (of a set function) 380.D 
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lower end (of a curvilinear integral) 94.D 
lower envelope principle 338.M 
lowering the superscripts 417.D 
lower integral, Riemann 216.A 
lower limit 

(of a Riemann integral) 216.A 

(of a sequence of real numbers) 87.C 
lower limit function 84.C 
lower order 

(for infinity) 87.G 

(of a meromorphic function) 272.C 
lower semicontinuity (of length) 264.A 
lower semicontinuous (at a point) 84.C 
lower semicontinuous function 84.C 
lower semilattice 243.A 
lower triangular matrix 269.B 
lower variation (of a set function) 380.B 
Lówner differential equation 438.B 
loxodromic transformation 74.F 
LP (linear programming) 255 
LSZ asymptotic condition 150.D 
Lub (least upper bound) 311.B 
Lubanski vector, Pauli- 258.D 
Luenberger observer 86.E 
lumping, mass 304.D 
Luneburg lens 180.A 
Lüroth theorem  16J 
Lutz-Mattuck theorem 118.E 
Luzin first principle (in analytic set theory)  22.C 
Luzin second principle (in analytic set theory)  22.C 
Luzin space 22.1 425.CC 
Luzin theorem 270.J 

Denjoy- 1591 
Luzin unicity theorem (in analytic set theory) 22.C 
Lyapunov characteristic number 314.A 
Lyapunov condition 250.B 
Lyapunov convexity theorem 443.G 
Lyapunov function 126.F 163.G 394.C 
Lyapunov-Schmidt procedure 286.V 
Lyapunov stable 126.F 

in both directions 394.A 

in the negative direction 394.A 

in the positive direction 394.A 

uniformly 126.F 
Lyapunov theorem 398.C 
Lyusternik- Shnirel'man theory 286.Q 


M 


M(Q) (the set of all essentially bounded measurable 
fuctions on Q) 168.B 

{Mp} ultradistribution of class 125.U 

(M,), ultradistribution of class 125.U 

u-absolutely continuous (additive set function) 
380.C 

-completion 270.D 

p-conformal function 352.B 

-constant stratum 418.E 

u-integrable 221.B 

nu-measurable 270.D 

u-nullset 370.D 

u-operator, bounded 356.B 

u-singular (additive set function) 380.C 

m-dissipative 251.J 

mx n matrix 269.A 

mth root 10.C 

M-estimator 371.H 

M-port network 282.C 

M-set 159J 

M-space 425.Y 


Subject Index 
Manifold(s) 


(M)-space (= Montel space) 424.0 
M space, abstract 310.G 
M waves 130.B 
Macaulay local ring 284.D 
Macaulay ring 284.D 
locally 284.D 
Mach cone 205.B 
machine 
Turing 31.B 
universal Turing 31.C 
machine-language program 75.C 
machine scheduling problem 376 
machine sequencing problem 376 
Machin formula 322 
Mach number 116.B 205.B 
Mach wave 205.B 
Mackey-Arens theorem 424.N 
Mackey space | 424.N 
Mackey theorem 424.M 
Mackey topology 424.N 
Mack method, Garside-Jarratt- 301.N 
MacLane complexes, Eilenberg- 70.F 
MacLane space, Eilenberg- 70.F 
MacLane spectrum, Eilenberg- 202.1 
Maclaurin formula, Euler- 379J 
macroeconomic data 128.A 
macroscopic causality (of S-matrix) 386.C 
magnetic field 130.A 
magnetic flux density 130.A 
magnetic group 92.D 
magnetic induction 130.A 
magnetic quantum number, orbital 351.E 
magnetic permeability 130.B 
magnetic polarization 130.A 
magnetic Reynolds number 259 
magnetic susceptibility 130.B 
magentic viscosity 259 
magnetic wave 130.B 
transverse 130.B 
magnetofluid dynamics 259 
magnetohydrodynamics 259 
magnetostatics 130.B 
magnitude (of a vector) 442.B 
Mahalanobis generalized distance 280.E 
Mainardi equations, Codazzi- 111.H, App. A, 
Table 4.I 
main classes 241.A 
main effect 102.H 
main theorem 
(in class field theory) 59.C 
Zariski’s 16.1 
majorant 
(of a sequence of functions) 435.A 
harmonic (of a subharmonic function) 193.S 
method of 316.G 
majorant series 316.G 435.A 
major arc 4.B 
major axis (of an ellipse) 78.C 
major function 100.F 
majorizing function, right 316.E 
Malfatti problem (in geometric construction) 179.A 
Malgrange theorem, Ehrenpreis- 112.B 
Mal'tsev-Iwasawa theorem, Cartan- (on maximal 
compact subgroups) 249.8 
Mal'tsev theorem, Wedderburn- (on algebras) 29.F 
Malus theorem 180.A 
Mandelstam representation 132.C 
Mangoldt function 123.B 
manifold(s) 
almost complex 72.B 


Subject Index 
Manin connection, Gauss- (of a variety) 


almost contact 110.E 

almost parallelizable 114.I 

analytic — analytic manifold 
Banach 105.Z 286.K 

Blaschke manifold 178.G 

with boundary 105.B 

without boundary 105.B 

C- 105.D 

C’-, with boundary 

C’-, without boundary 

center, theorem 286.V 

characteristic (of a partial differential equation) 
320.B 

characteristic classes of 56.F 

closed 105.B 

coherently oriented pseudo- 

combinatorial 65.C 

compact C'- 105.D 

complex — complex manifold(s) 

complex analytic 72.A 

conic Lagrange 345.B 

conic Lagrangian 274.C 

contact 110.E 

covering 91.A 

covering differentiable 91.A 

differentiable, with boundary of class C’ 

differentiable, of class C” 105.D 

with Euclidean connection 109 

Dbered 428.F 

Finsler 286.L 

flag 199.B 

Fréchet 286.K 

G- 431.C 

Grassmann —~ Grassmann manifold 

group (of a Lie transformation) 110.A 

with a handle attached by f 114.F 

h-cobordant oriented  114.I 

Hilbert 105.Z 286.K 

Hodge 232.D 

homology 65.B 

Hopf 232.E 

hyperbolic 21.0 235.E 

integral 428.A,B,D 

irreducible 3- 65.E 

Kahler 232 

k-dimensional integral 

nontrivial 3- 65.E 

ordinary integral (of a differential ideal) 428.E 

orientable (C’-manifold) 105.F 

orientation 201.N 

oriented 105.F 201.N 

oriented G- 431.E 

paracompact C’- 105.D 

parallelizable 114.1 

n- 114] 

PL- 65.C 

Poincaré 

at a point 

prime 3- 65.E 

proper flag 199.B 

pseudo- 65.B 

pseudo-Hermitian 344.F 

Q- 382D 

real analytic 105.D 

regular integral (of a differential ideal) 

Riemannian — Riemannian manifold 

singular integral (of a differential ideal) 428.E 

SCH 178.G 

smooth 105.D 114.B 

space-time | .359.D 


105.E 
105.E 


65.B 


105.E 


191.I 


105.A 
178.G 


428.E 





s-parallelizable 114.I 
stable 126.GJ 
stably almost complex 
stably parallelizable 
Stein 21.L 
Stiefel — Stiefel manifold 
symplectic 219.C 
topological 105.B 
triangulated 65.B 
unstable 126.G,J 
visibility 178.F 
weakly almost complex 
weakly l-complete 21.L 

Manin connection, Gauss- (of a variety) 

Mannheim curve 111.F 

Mann-Whitney U-test 371.C 

MANOVA (multivariate analysis of variance) 

280.B 


114.H 
114.1 


114.H 


16.V 


mantissa (of the common logarithm)  131.C 
many body problem 402.F 420.A 
many-valued (analytic function) 198J 


many-valued function 165.B 
many-valued logic 411.L 
map 381.C (also — mapping) 


bundle 147.B 
covering 91.A 
cubic 157.B 
equivariant 431.A 
first-return 126.C 
G- 431.A 

Gauss 111.G 


Kodaira-Spencer 72.G 


linear fiber 114.D 
normal 114.J 

PL 65A 
Poincaré 126.C 
time-one 126.C 
trivalent 157.B 


mapping 381.C 
A-balanced 277.J 
affine 7.E 


alternating multilinear 256.H 
analytic 21.J 
antiholomorphic 195.B 


antisymmetric multilinear 256.H 
biadditive 277.3 

biholomorphic 21.J 

bijective 381.C 

bilinear 256.H 277.J 

birational 16.1 

biregular (between prealgebraic varieties) 
16.C 

Borel isomorphic 270.C 

of bounded variation 246.H 
bundle 147.B 

œŒ- 105J 

c,- 237.G 

CE 382D 

cellular (between cell complexes) 
chain 200.C 

chain (between chain complexes) 


70.D 


201.B 
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characteristic (in the classification theorem of 


fiber bundles) 
class 202.B 
of class C" 208.B 
classifying 147.G 
closed 425.G 
cochain 200.F 201.H 
complete 241.C 
conformal 198.A 


147.G 
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conjugation (of a Hopf algebra) 203.E 
constant 381.C 

continuous 425.G 

covering 91.A 

degenerate 208.B 

degree 99.A 

degree of 99.A 

diagonal (of a graded coalgebra) 203.B,F 
differentiable, of class CŒ” 105.J 
dual (of a linear mapping) 256.G 
duality 251.5 

equivariant 431.A 

essential 202.B 

exponential 178.A 249.Q 364.C 
extremal horizontal slit 367.G 
extremal quasiconformal 352.C 
extremal vertical slit 367.G 
first-return 126.C 

Fredholm 286.E 

G- 362.B431.A 

Gauss (in geometric optics) 180.B 
generalized conformal 246.1 

of group algebra 192.Q 
harmonic "os P 

hereditarily quotient 425.G 
holomorphic 21.J 72.A 
homological 200.C 
homotopy-associative 203.D 
Hopf 147.E 

identity 381.C 

inclusion 381.C 

inverse 381.C 

inverse, theorem 208.B 
isometric 111.H 273.B 
Kodaira-Spencer 72.G 

linear (between linear spaces) 256.B 
linear (between polyhedrons) 70.C 
linear fiber 114.D 

local degree of 99.B 
meromorphic 23.D 

monotone 311.E 

multilinear 256.H 


nondegenerate holomorphic (between analytic 


spaces) 23.C 

nonexpansive 286.B 

nonsingular, of class C! |. 208.B 

normal 114J 

normal coordinate 364.C 

one-to-one 381.C 

onto 381.C 

open 425.G 

order-preserving 311.E 
orientation-preserving 99.A 
orientation-reversing 99.A 

partial (of a mapping) 381.C 

perfect 425. W 

perspective (in projective geometry) 343.B 
piecewise affine 192.Q 

piecewise linear (between polyhedra) 70.C 
PL 65A 

Poincaré 126.C,G 

product 425.K 

projective (in projective geometry) 343.B 
proper 425.W 

purely inseparable rational 16.1 
quasiconformal 352.B 

quasiperfect 425.CC 

quotient 425.G 

rational 16.1] 


regular (between prealgebraic varieties) 16.C 


Subject Index 
Markov subshift 


regular, of class C! 208.B 

semicontinuous (in a topological linear space) 
153.D 

semilinear 256.P 277.L 

separable (rational) 16.1 

simplicial 70.C 

simplicial (between polyhedra) 70.C 

simplicial (relative to triangulations) 70.C 

skew-symmetric multilinear 256.H 


space 202.C 

space of continuous 435.D 

spin 237.G 

s.s. (semisimplicial ) (between s.s. complexes) 
70.E 


Ss. realization of 70.E 
surjective 381.C 
symmetric multilinear 256.H 
Teichmüller 352.C 
time-one 126.C 
topological 425.G 
topology induced bya 425.] 
transposed (of a diffusion kernel) 338.N 
transposed (of a linear mapping) 256.G 
uniformly continuous 273.1 436.E 
unit 203.F 
mapping chain 201.B 
mapping class 202.B 
mapping cone 202.E 
reduced 202.F 
mapping cylinder 202.E 
mapping space 435.D 
mapping theorem 
Brouwer 99.A 
open 37.1 424.x 
Riemann 77.B 
spectral 251.G 
mapping truck 202.G 
Marchenko equation, Gel'fand-Levitan- 
(for KdV equations) 387.D 
(for a nonlinear lattice) 287.C 
Marcinkiewicz theorem 224.E 
marginal density functions |. 397.I 
marginal distribution 342.C 397.H 
marked K3 surface 72.K 
Markov branching process 44.D 
multitype 44. E 
Markov chains 260.A 342.A 
embedded 260.H 
general 260J 
imbedded 260.H 
(non) recurrent 260.B 
Markov field theory, Euclidean 150.F 
Markovian decision process 127.E 
Markovian policy 405.C 
Markovian type (stochastic differential equation) 
406.D 
Markov inequality (for polynomials) 336.C 
Markov measure 136.D 
Markov operators 136.B 
Markov partition (for an automorphism) 136.G 
Markov process(es) 261 342.A 
branching 44.E 
homogeneous 5.H 
invariant 5.H 
strong 261.B 
Markov property 261.B 
strong 261.B 
Markov shift 136.D 
Markov statistical mechanics 340.C 
Markov subshift 126 


Subject Index 
Markov theorem, Gauss- 


Markov theorem, Gauss- 403.E 
Markov time 261.B 407.B 
Martin axiom (in set theory) 33.F 
Martin bound, Froissart- 386.B 
Martin boundary 207.C 260.I 

dual 2601 
Martin compactification 207.C 
Martineau-Harvey duality 125.Y 
martingale 262 342.A 

{F,}-Wiener 406.B 

local 262.E 
martingale additive functional 261.E 
martingale part 406.B 
martingale problem 115.C 261.C 406.A 
Martin kernel 207.C 
Maslov bundle 274.C 
Maslov index, Keller. 274.C 
mass 132.A 258.C 271.E 

{of a current) 275.G 

center of 271.E 

integrals of the center of 420.A 
mass distribution 

capacitary 338.K 

equilibrium 338.K 
Massey theorem, Blakers- 202.M 
mass lumping 304.D 
mass matrix 304.D 
master equation 402.I 


matched asymptotic expansions, method of 25.B 


mathematical axiom  411.I 


mathematical expectation (of a probability distribu- 


tion) 341.B 
mathematical induction 294.B 

axiom of 294.B 

definition by 294.B 

doulbe 294.B 

multiple 294.B 
mathematical linguistics 75.E 
mathematical logic 411.A 
mathematical modeling 40.G 300 
mathematical models in biology 263 
mathematical object 52.A 
mathematical programming 264.A 


mathematical programming problem 264.B 


mathematical structure 409.B 


mathematical system (for a structure) 409.B 


mathematics 
actuarial 214.A 
combinatorial 66.A 
discrete 66.A 
mathematics in the 18th century 266 
mathematics in the 19th century 267 
mathematics in the 17th century 265 
Mathieu differential equation 268.A 
modified 268.A 
Mathieu functions 268 
modified 268.A 
modified, of the first kind 268.D 
modified, of the second kind 268.D 
modified, of the third kind 268.D 
of the second kind 268.D 
Mathieu group 151.H 
Mathieu method 268.C 
matric group 226.B 
matrix (matrices) 269 
adjacement 186.G 
adjoint 269.] 
Alexander (of a knot) 235.C 
alternating 269.B 





amplification 304.F 
anti-Hermitian 269.] 
antisymmetric 269.B 
association 102.J 
asymptotic covariance 399.K 
ofa bilinear form 256.H 
bounded 269.K 

circuit 254.B 

column finite 269.K 
companion 301.1 
complex orthogonal 269.J 
correlation 397.J 
covariance 341.B 397J 
density 351.B 

design 102.A 403.D 
diagonal 269.A 

Dirac 377.C 

Diracsy 351.G 

error 405.G 

Fisher information ` 200. D 
fundamental cutset 186.G 
fundamental tieset 186.G 
group 226.B 
Hasse-Witt 9.E 
Hermitian 269.] 
identity 269.A 

incidence (of a block design) 102.B 
incidence (of a graph) 186.G 
infinite 269.K 
information 102.I 
inverse 269.B 

invertible 269.B 
iteration 302.C 

Jacobi 390.G 

Jacobian 208.B 

lower triangular 269.B. 
mass 304.D 

mbyn 269A 

of (m,n)-type 269.A 
mxn 269A 

moment 341.B 
monodromy 254.B 
nilpotent 269.F 
noncentrality 374.C 
nonsingular 269.B 
normal 269.I 
orthogonal 269.J 

parity check 63.C 

Pauli spin 258.A 351.G 
period (of a closed Riemann surface) 
period (of a complex torus) 3.H 
port-admittance 282.C 
port-impedance 282.C 
positive definite  269.I 
positive semidefinite 269.1 
principal 3.I 

projection 269.1 

proper orthogonal 269.J 
Q- 260.F 

of quadratic form 348.A 
rational function 86.D 
rectangular 269.A 
regular 269.B 

Riemann 3I 

row finite 269.K 

S- 150.D 386 

sample correlation 280.E 
scalar 269.A 

scale 374.C 
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11.C 
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Seifert 235.C 
semisimple 269.G 
of sesquilinear form  256.Q 
similar square 269.G 
skew h- 269.1 
skew-Hermitian 269.1 
skew-symmetric 269.B 
square 269.A 
stiffness 304.C 
stochastic 260.A 
of the sum of squares between classes 280.B 
of the sum of squares within classes 280.B 
symmetric 269.B 
symplectic 60.L 
transfer function 86.B 
transition 126.3 260.A 
transposed 269.B 
triangular 269.B 
tridiagonal 298.D 
unipotent 269.F 
unit 269.A 
unitary 269.1 
upper triangular 269.B 
variance 341.B 
variance-covariance 341.B 397.J 
weighting Sep 
zero 269.B 
matrix algebra 
full 269.B 
total 269.B 
matrix convex of order m 212.C 
matrix element 351.B 
matrix game 173.C 
matrix group 226.B 
matrix monotone decreasing of order m 212.C 
matrix montone increasing of order m 212.C 
matrix representation 362.D 
matrix Riccati differential equation 86.E 
matrix Riccati equation 405.G 
matrix unit 269.B 
matroid 66.G 
p-ary 66.H 
poly- opt 
operations for 66.H 
Mattuck theorem, Lutz- 118.E 
Maupertuis principle 180.A 
Maurer-Cartan 
differential form of 249.R 
system of differential equations of 249.R 
maximal 
(hypersurface in Minkowski space) 275.H 
(ideal) 368.F 
(in prediction theory) 395.D 
(Riemann surface) 367.F 
maximal concentration function 341.E 
maximal condition 311.C 
maximal deficiency (of an algebraic surface) 15.E 
maximal dilatation 352.B 
maximal dissipative operator 251.J 
maximal element (in an ordered set) 311.B 
maximal entropy 136.C,H 
maximal ergodic lemma 136.B 
maximal filter 87.1 
maximal function 
nontangential 168.B 
radial 168.B 
maximal ideal 67.C 368.F 
with respect to S. 67.C 
maximal ideal space (of a Banach algebra) 36.E 


Subject Index 
Mean 


maximal independent system (of an additive group) 
2.E 
maximal inequality (= maximal ergodic lemma) 
136.B 
maximal invariant statistic 396.1 
maximal k-split torus 13.Q 
maximally almost periodic group 18.1 
maximally overdetermined (=holonomic) 274.H 
maximal operator 112.E 
maximal order 27.A 
maximal prime divisor (of an ideal) 67.F 
maximal separable extension (of a field) 149.H 
maximal toroidal subgroup (of a compact Lie group) 
248.X 
maximal torsion subgroup (of an Abelian group) 
2.A 
maximal torus (of a compact Lie group) 248.X 
maximum, relative (of a function) 106.L 
maximum element (in an ordered set) 311.B 
maximum-flow minimum-cut theorem 281.C 
maximum-flow problem  281.C 
maximum likelihood estimator 399.M 
maximum likelihood method 399.M 
limited information 128.C 
maximum modulus principle (for a holomorphic 
function) 43.B 
local 164.C 
Cartan 338.L 
complete 338.M 
maximum principle 
(for analytic functions) 43.B 
(in control theory) 86.F 
(for harmonic functions) 193.E 
(for minimal surface) 275.B 
(for parabolic operators) 327.D 
dilated (in potential theory) 338.C 
entropy 419.A 
first (in potential theory) 338.C 
Frostman's 338.C 
Hopf (for equations of elliptic type)  323.C 
strong (for equations of elliptic type)  323.C 
Ugaheris 338.C 
maximum return 127.B 
maximum-separation minimum-distance theorem 
281.C 
maximum solution (of a scalar equation) 316.E 
maximum spectral measure 390.G 
Maxwell-Boltzmann distribution law | 402.B 
Maxwell convention 51.F 
Maxwell equations 130.A 
Maxwell fisheye 180.A 
Maxwell relations 419.B 
Maxwell stress tensor 130.A 
Maxwell theorem (on spherical functions) 393.D 
Mayer-Vietoris exact sequence (for a proper triple) 
201.C 
relative 201.L 
Mazur theorem 37.F 
Gelfand- 36.E 
meager set 425.N 
non- 425.N 
mean 
(of an almost periodic function) 18.B,E 
(of numbers or a function) 211.C 
(of a probability distribution) 341.B 
(of a random variable) 342.C 
(of a statistical data) 397.C 
(of a weakly stationary process) 395.A 
a-trimmed 371.H 


Subject Index 
Mean absolute deviation 


arithmetic 211.C 397.C 
arithmetico-geometric 134.B 
bounded, oscillation 168.B 
conditional (of a random variable) 342.E 397.I 
continuous in the 217.M 407.A 
convergence in the, of order p 168.B 342.D 
407.A 
convergence in the, of power p 168.B 
of degree r (of a function with respect to a 
weight function) 211.C 
Fejér 159.C 
geometric 211.C 
geometrical 397.C 
harmonic 211.C 397.C 
limitinthe 168.B 
moment about the (kth) 341.B 
population 396.C 
sample 396.C 
mean absolute deviation 397.C 
mean anomaly 309.B 
mean concentration function 341.E 
mean content (of a tolerance region) 399.R 
mean curvature 111.H 364.D 365.D, App. A, Table 
4] 
total 365.0 
mean curvature vector 365.D 
mean energy 402.G 
mean entropy 402.G 
mean ergodic theorem 136.B 
mean free energy 340.B 402.G 
mean motion 309.B 
mean number of sheets (of a covering surface of 
a Riemann sphere) 272.J 
mean oval (of two ovals) 89.D 
mean p-valent, areally 438.E 
mean p-valued, circumferentially 438.E 
mean recurrence time 260.C 
mean square error 399.E 403.E 
mean unbiased, asymptotically 399.K 
mean unbiased estimator 399.C 
mean value 
(of a continuous function on a compact group) 
69.A 
(of a weakly stationary process) 395.C 
mean value theorem 
(in differential calculus) 106.E 
(on harmonic functions) 193.E 
first (in the Riemann integral) 216.B 
second (in the D-integral) 100.G 
second (in the Riemann integral) 216.B 
second (for the Stieltjes integral) 94.C 
Siegel 182.E 
Vinogradov AE 
mean vector 341.B 
measurability 443.I 
strong 443.1 
measurability theorem, Péttis 443.B 
measurable 
(flow) 136.D 
(multivalued vector function) 443.1 
(operator function) 308.G 
(set) 270.D,G 
(in set theory) 33.F 
(stochastic process) 407.A 
(transformation) 136.B 
(vector valued function) 443.B 
absolutely 270.L 
B- 270.C 
Baire 270.L 
Jordan 270.D,G 
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Lebesgue 270.G 

u- 270.D 

nearly Borel 261.D 

progressively (stochastic process) 407.A 

real-valued (in set theory) 33.F 

with respect to a family of random variables 

342.C 

with respect to u* 270.E 

scalarly 443.I 

strongly 443.B,I 

universally 270.L 

weakly 443.BI 
measurable cardinal number 33.E 
measurable event 342.B 
measurable function 270.J 

B- 270J 

Baire 270.L 

Borel 270.J 

Lebesgue 270J 

universally 270.L 
measurable space(s) 270.C 

analytic 270.C 

complete 270.D 

isomorphic 398.D 

standard 270.C 
measurable vector function 308.G 
measure 270.D,G 

ofanangle 139.D 

of association 397.K 

atomless probability 398.C 

B-regular 270.F 

Borel 270.G 

bounded 270.D 

canonical 115.B 260.G 

characteristic 407.D 

Carathéodory 270.E 

Carathéodory outer 270.E 

complete 270.D 

completely additive 270.D 

complex spectral 390.D 

convergence in 168.B 

ó- 270.D 

distortion 213.E 

excessive 261.F 

finitely additive 270.D 

G-invariant 225.B 

Gaussian random 407.D 

generalized Lebesgue 270.E 

of genus (of a positive definite symmetric 

matrix) 348.K 


Gibbs 136.C 
Green 193J 
Haar 225.C 


harmonic 120.C 169.B 207.B 260.1 
Hausdorff 169.D 

idempotent 192.P 

image 270.K 

inner harmonic 169.B 

invariant 136.B 255.B 260.A,I 261.F 
Jensen 164.K 

Jordan 270.D,G 

killing 115.B 

kinetic 228.A 

K-regular 270.F 

Lebesgue 270.G 

Lebesgue outer 270.G 
Lebesgue-Stieltjes 166.C 270.L 
left invariant Haar 225.C 
oflength 139.C 

Lévy 5E 
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of location 397.C 
Loeb 293.D 
Markov 136.D 
orthogonal 164.C 
outer 270.E,G 

outer harmonic 169.B 


Plancherel (of a locally compact group) 437.L 


Poisson random 407.D 

positive Radon 270 

probability 341 342.B 

product 270.H 

quasi-invariant 225.5 

quotient 225.H 

Radon 270.G 

real spectral 390.D 

regular 270.F 

relatively invariant 225.H 

representing 164.C 

right invariant Haar 225.C 

c-additive 270.D 

o-finite 270.D 

signed 380.C 

smooth (for a Riemannian metric) 136.G 

smooth invariant 126.J 

spectra] 390.D,K 395.B,C 

speed 115.B 

subinvariant 261.F 

superharmonic 260.1 

of variability 397.C 

vector 443.G 

Weil 225.G 

Wiener 45.B 250.E 
measure algebra 192.0 
measure-preserving (transformation) 136.B 
measure problem, invariant 136.C 
measure space 270.D 

bounded 270.D 

complete 270.D 

complete product 270.H 

Lebesgue, with a finite (or o-finite) measure 

136A 

product 270.H 

c-finite 270.D 
measure theory 270 
mechanics 

celestial 55.A 

classical 271.A 

classical statistical 402.A 

equilibrium statistical 402.A 

graphical 19.C 

Markov statistical 340.C 

Newtonian 271.A 

nonlinear 290.A 

quantum 351 

quantum statistical 402.A 

statistical 342.A 402 
mechanism, Higgs 132.D 
median 341.H 396.C 397.C 

sample 396.C 


mediant (of two fractions in Farey sequence) 4.B 


median unbiased estimator 399.C 
medieval mathematics 372 
meet 
(in a Boolean algebra) 42.A 
(in an ordered set) 243.A 
(of sets) 381.B 
Mehler formula App. A, Table 19.III 


Mehler integral representation, Laplace- App. A, 


Table 18.11 
Mellin transform 220.C 








Subject Index 
Method(s) 


member (of a set) 381.A 
membrane 
equation ofa vibrating 325.A 
permeable 419.A 
memory 
fading 163.1 
first-in first-out 96.E 
first-in last-out 96.E 
memory channel 
almost finite 213.F 
finite 213.F 
memoryless channel 213.C 
discrete 213.F 
memory unit 75.B 
Menelaus theorem (in affine geometry) 7.A 
Menger-Nobeling embedding theorem 117.D 
Men'shov theorem, Looman- 198.A 
Rademacher- 317.B 
Mercer theorem 217.H 
merging 96.C 
meridian 
(ofaknot) 235.B 
(of a surface of revolution) 111.H 
meromorphic (in a domain) 272.A 
meromorphic curve 272.L 
meromorphic differential (on a Riemann surface) 
367.H 
meromorphic function(s) 21J272.A 
(on an analytic set) 23.D 
(on a complex manifold) 72.A 
transcendental 272.A 
meromorphic mapping, proper (between analytic 
spaces) 23.D 
meromorphy 
circle of (of a power series) 339.D 
radius of (of a power series) 339.D 
Mersenne number 297.E, App. B, Table 1 
Mersenne prime 297.E 
Mertens theorem (on the Cauchy product of two 
series) 379.F 
mesh of a covering (in a metric space) 273.B 
mesons 132.B 
meta-Abelian group 190.H 
metabolic model (in catastrophe theory) 51.F 
metamathematics 156.D 
metastable range (of embeddings) 114.D 
method(s) 
Abel, of summation 379.N 
Adams-Bashforth 303.E 
Adams-Moulton 303.E 
ADI 304.F 
alternating direction implicit (ADI) 304.F 
Arrow-Hurwicz-Uzawa gradient 292.E 
of averaging 290.D 
Bairstow 301.E 
Bernoulli  301J 
Borel, of summation 379.0 
branch and bound 215.D 3 
Cesaro, of summation of order o 379.M 
Cholesky 302.B 
circle 4.B 
collocation 303.1 
congruence 354.B 
conjugate gradient (C.G.) 302.D 
constructive 156.D 
continuation 301.M 
Crout 302.B 
cyclic Jacobi 298.B 
d'Alembert, of reduction of order 252.F 
Danilevskii 298.D 


Subject Index 
Method(s) 


Davidenko, of differentiation with respect to a 
parameter 301.M 

Dejon-Nickel 301.G 

difference 303.A 

direct (in the calculus of variations) 46.E 

discrete variable 303.A 

distribution-free 371.A 

Doolittle 302.B 

downhill 301.L 

Duhamel 322.D 

Durand-Kerner (DK) 301.F 

Durand-Kerner-Aberth (DKA) 301.F 

Enskog 217.N 

Euclidean 150.F 

Euler (of describing the motion of a fluid) 
205.A 

Euler (of numerical solution of ordinary differ- 
ential equations) 303.E 

Euler (of summation) 379.P 

expansion 205.B 

extrapolation 303.F 

factorization 206.B 

of false position 301.C 

of feasible directions (in nonlinear program- 
ming) 292.E 

finite element 223.G 304.C 

fixed point 138.B 

floating point 138.B 

Frobenius App. A, Table 14.1 

Galerkin 303.1 304.B 

Garside-Jarratt-Mack 301.N 

Gauss-Seidel 302.C 

Givens 298.D 

gradient 292.E 

Graeffe 301.N 

graphical, of statistical inference 19.B 

Green function 402.J 

of harmonic balance 290.D,E 

Hessenberg 298.D 

Hill, of solution 268.B 

Hitchcock 301.E 

hodograph 205.B 

Horner 301.C 

Householder 298.D 

implicit 303.E 

implicit enumeration 215.D 

Ince-Goldstein 268.C 

indirect least squares 128.C 

interpolation 224.A 

isoparametric 304.C 

Jacobi (of numerical computation of eigen- 
values) 298.B 

Jacobi (in numerical solution of linear 
equations) 302.C 

Jacobi second, of integration 324.D 

Jeffreys 25.B 

killing (of obtaining a homotopy group) 202.N 

ladder 206.B 

Lagrange (of describing the motion of a fluid) 
205.A 

Lagrange (of indeterminate coefficients) 106.L 

Lagrange, of variation of constants 252.D 

Lagrange, of variation of parameters 252.D 

Lagrange-Charpit 322.B, App. A, Table 15.II 

of Lagrange multipliers 106.L 

Lanczos 298.D 301.N 

Laplace 30.B 

of least squares (for estimation) 403.E 

of least squares (for numerical solution of linear 
equations) 397.J 
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of least squares (for numerical solution of 
ordinary differential equations) 303.1 

Lebesgue, of summation 379.S 

Lehmer 301.K 

Lighthill 25.B 

limited information maximum likelihood 
128.C 

of linearization 290.D 

linear k-step 303.E 

linear multistep 303.E 

of majorants 316.G 

of matched asymptotic expansions 25.B 

Mathieu 268.C 

maximum likelihood 399.M 

middle-square 354.B 

Milne 303.E 

modified minimum chi-square 400.K 

moment 399.L 

Monte Carlo 385.C 

of moving frames 110.A 

of multiple scales 290.E 

multistep 303.E 

multivalue 303.E 

Newton-Raphson 301.D 

nonparametric 371.A 

Nórlund, of summation 379.Q 

of orthogonal projection 323.G 

(p + 1)-stage 303.D 

penalty 292.E 

Perron (in Dirichlet problem) 120.C 

perturbation 25.B 

Poincaré 25.B 

Poincaré-Lighthill-Kuo (P.L.K.) 25.B 

polynomial extrapolation 303.F 

power 298.C 

predictor-corrector (PC) 303.E 

projective approximation 304.B 

QR 298.F 

of quadrature 313.D 

QZ 298.G 

rational extrapolation 303.F 

Rayleigh-Ritz 46.F 271.G 

renormalization 361.A 

Richardson 302.C 

Riemann, of summation 379.8 

Riesz, of summation of the kth order 379.R 

Ritz 46.F 303.1 304.B 

robust 371.A 

Rosen gradient projection 292.E 

Runge-Kutta 303.D 

Runge-Kutta-Gill 303.D 

saddle point 25.C 

scaling 346.E 

scoring 397.M 

simplex 255.C 

spectral 304.B 

stationary phase 30.B 

of steepest descent 25.C 

step by step 163.D 

Strum 301.C 

of successive approximation (for an elliptic par- 
tial differential equation) 323.D 

of successive approximation (for Fredholm 
integral equations of the second kind) 217.D 

of successive approximation (for ordinary 
differential equations) 316.D 

of successive iteration (for Fredholm integral 
equations of the second kind) 217.D 

summable by Abel’s 379.N 

summable by Borel’s exponential 379.0 
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summable by Borel's integral 379.0 
summable by Cesaro’s, of order oa 379.M 
summable by Euler's 379.P 
summable by Hólder's, of order p 379.M 
summable by Norlund’s 379.Q 
summable by M. Riesz's, of order k 379.R 
ofsummation 379.L 
Sylvester elimination 369.E 
three-stage least squares 128.C 
threshold Jacobi 298.B 
two-phase simplex 255.C 
two-stage least squares 128.C 
variational 438.B 
of variation of constants 55.B 252I 
of variation of parameters App. A, Table 14.I 
WKB 25.B 
WKBJ 25.B 
metric 273.B 
Bergman 188.G 
Einstein 364.1 
Einstein-Kahler 232.C 
f- (Det 
Iw: (136.F 
Finsler 152.A 
Hermitian 232A 
Hodge 232.D 
Kahler 232.A 
Kerr 359.E 
left invariant (in a topological group) 423.1 
Petersson 32.B 
Poincaré 74.G 
pseudo- 273.B 
pseudo-Riemannian 105.P 
Riemannian 105.P 
Robertson-Walker 359.E 
standard Kahler (of a complex projective space) 
232.P 
Teichmiiller 416 


metrically isomorphic automorphisms (on a measure 


space) 136.E 
metric comparison theorem 178.A 
metric connection 80.K 
metric invariant (on a measure space) 136.E 
metric multidimensional scaling 346.E 
metric space(s) 273 

compact 273.F 

complete 273.J 

discrete 273.B 

indiscrete pseudo- 273.B 

induced by a mapping 273.B 

precompact 273.B 

product 273.B 

pseudo- 273.B 

separable 273.E 

totally bounded 273.B 
metric structure 

almost contact 110.E 

contact 110.E 
metric subspace 273.B 
metric topology 425.C 
metric vector space 256.H 
metrizable topological group 423.1 
metrizable topological space 273.K 
metrizable uniform space 436.F 

pseudo- 436.F 
Meusnier theorem (on surfaces) 111.H 
Meyer decomposition theorem, Doob- 262.D 
Michael theorem 425.X 
micro-analytic 125.CC 274.E 
microboundle 147.P 


Subject Index 
Minimal operator 


normal PL 147.P 
PL 147.P 
tangent PL 147.P 
microcanonical ensemble 402.D 
microdifferential equation 274.G 
microdifferential operator 274.F 
of finite order 274.F 
of infinite order 274.F 
microfunction 274.E 
holomorphic 274.F 
microlocal analysis 274 345.A 
microlocally elliptic (operator) 345.A 
microlocal operator 274.F 
micro-pseudolocal property 345.A 
middle point 7.C 
middle-square method 354.B 
midpoint 7.C 
midpoint rule 303.E 
midrange 374.G 
Mikusinski operator 306.B 
Mills equation, Yang- 80.G 
Mills field, Yang- 150.G 
Mills functional, Yang- 80.Q 
Mills G-connection, Yang- 80.Q 
Milman property, Krein- 443.H 
Milman theorem 37.G 
Krein- 424U 
Milne corrector 303.F 
Milne method 303.E 
Milne predictor 303.E 
Milne-Simpson formula 303.E 
Milnor fibering theorem  418.D 
Milnor fibration 418.D 
Milnor invariant 235.D 
Milnor monodromy 418.D 
Milnor number (in Milnor fibering theorem) 418.D 
Minakshisundaram-Pleijel asymptotic expansion 
391.B 
minimal 
(algebraic surface) 15.G 
(algebraic variety) 16.1 
(ideal) 368.F 
(immersion) 275.A 
(superharmonic function) 260.1 
(transformation) 136.H 
relatively 15.G 16.1 
minimal basis (of a principal order or an algebraic 
number field) 14.B 
minimal chain (for a transition probability) 260.F 
minimal complete class 398.B 
minimal complex 70.E 
minimal condition 
(in an ordered set) 311.C 
restricted (in a commutative ring) 284.A 
minimal diffeomorphism 126.N 
minimal element (in an ordered set) 311.B 
minimal flow 126.N 
minimal function, E. 367.E 
minimal immersion 275.A 
branched 275.B 
generalized 275.B 
minimality 16.] 
absolute 16.1 
minimally almost periodic group 18.1 
minimally elliptic singularity 418.C 
minimal model 15.G 
(for the algebra of differential forms) 114.L 
Néron (of an Abelian variety) 3.N 
relatively 15.G 
minimal operator 112.E 


Subject Index 
Minimal parabolic k-subgroup 


minimal parabolic k-subgroup 13.Q 
minimal polynomial 
(of an algebraic element) 149.E 
(of a linear transformation) 269.L 
(of a matrix) 269.F 
minimal prime divisor (of an ideal) 67.F 
minimal projection, Lie 76.B 
minimal realization 86.D 
minimal resolution 418.C 
minimal set 126.E 
minimal splitting field (of a polynomial) 149.G 
minimal submanifold 275.A 365.D 
minimal sufficient o-field | 396.E 
minimal surface 111.1 334.B 
affine 110.C 
branched 275.B 
minimal surface equation 275.A 
minimal variety 275.G 
minimax (estimator) 399.H 
minimax decision function 398.B 
minimax level æ test 400.F 
minimax principle 
(for eigenvalues of a compact operator) 68.H 
(for statistical decision problem) 398.B 
for A, 391.G 
minimax solution 398.B 
minimax theorem 173.C 
minimization problem, group 215.C 
minimizing sequence 46.E 
minimum (minima) 
ofafunction 106.L 
relative (at a point) 106.L 
successive (in a lattice) 182.C 
weak 46.C 
minimum chi-square method, modified 400.K 
minimum-cost flow problem 281.C 
minimum curvature property 223.F 
minimum element (in an ordered set) 311.B 
minimum immersion 365.0 
minimum norm property 223.F 
minimum principle 
energy 419.C 
enthalpy 419.C 
Gibbs free energy 419.C 
Helmholtz free energy 419.C 
forà, 391.G 
ofA 391.D 
minimum solution (of a scalar equation) 316.E 
minimum variance unbiased estimator, uniformly 
399.C 
Minkowski-Farkas theorem 255.B 
Minkowski-Hasse character (of a nondegenerate 
quadratic form) 348.D 
Minkowski-Hasse theorem (on quadratic forms 
over algebraic number fields) 348.G 
Minkowski-Hlawka theorem 182.D 
Minkowski inequality 211.C, App. A, Table 8 
Minkowski reduction theory (on fundamental 
regions) 122. E 
Minkowski space 258.A 
Minkowski space-time 359.B 
Minkowski theorem 182.C 
ondiscriminants 14.B 
on units 14.D 
Minlos theorem 424.T 
minor 
(ofa matrix) 103.D 
(ofa matroid) 66.H 
Fredholm's first 203.E 
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Fredholm’s rth 203.E 
principal (of a matrix) 103.D 
minorarc 4.B 
minor axis (of an ellipse) 78.C 
minor function 100.F 
minus infinity 87.D 
minute (an angle) 139.D 
Mittag-Leffler theorem 272.A 
mixed Abelian group 2.A 
mixed area (of two ovals) 89.D 
mixed group 190.P 
mixed Hodge structure 16.V 
mixed ideal 284.D 
mixed initial-boundary value problem (for hyper- 
bolic operator) 325.K 
mixed insurance 214.B 
mixed integer programming problem 215.A 
mixed model 102.A 
mixed periodic continued fraction 83.C 
mixed problem 322.D 
mixed spinor rank (k,n) 258.B 
mixed strategy 173.C 
mixed tensor 256.J 
mixed type, partial differential equation of 304.C 
326.A 
mixing (automorphism) 
k-fold 136.E 
strongly 136.E 
weakly 136.E 
mixture 351.B 
Mizohata equation, Lewy- 274.G 
ML estimator 399.M 
mobility, axiom of free (in Euclidean geometry) 
139.B 
Möbius band 410.B 
Mobius function 66.C 295.C 
Möbius geometry 76.A 
Möbius strip 410.B 
Möbius transformation 74.E 76.A 
Möbius transformation group 76.A 
mod 1, real number 355.D 
mod p (modulo p) 
Hopf invariant 202.8 
isomorphism (in a class of Abelian groups) 
202.N 
modal logic 411.L 
modal proposition 411.L 
modal unbiased estimator 399.C 
mode 396.C 397.C 
sample 396.C 
model 
(of an algebraic function field) 9.D 
(of a mathematical structure) 409.B 
(ofasymboliclogic) 276.D 411.G 
Bayesian 403.G 
Bradley-Terry 346.C 
Bush-Mosteller 346.G 
canonical 251.N 
component 403.F 
components-of-variance 403.C 
countable (of axiomatic set theory) 156.E 
derived normal (of a variety) 16.F 
dual resonance 132.C 
Estes stimulus-sampling 346.G 
factor analysis 403.C 
fixed effect 102.A 
functional 251.N 
game theoretic 307.C 
Glashow-Weinberg-Salam 132.D 
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of human death and survival 214.A 
Ising 340.B 402.G 
Klein (of non-Euclidean geometry) 285.C 
learning 346.G 
linear 403.D 
linear logistic 403.C 
log linear 403.C 
Luce $- 346.G 
mathematical, in biology 263 
metabolic (in catastrophe theory) 51.F 
minimal 15.G 
minimal (for the algebra of differential forms) 
114.L 
mixed 102.A 
multiple 403.E 
multivariate linear 280.B 
natural (in axiomatic set theory) 33.C 
normal linear 403.C 
Poincaré (of geometry) 285.D 
queuing 260.H 
random-effects 102.A 403.C 
relatively minimal 15.G 
Sakata 132.D 
Scheffé 346.C 
simple 403.F 
spin-flip 340.C 
static (in catastrophe theory) 51.B 
stochastic Ising 340.C 
stochastic programming 307.C 
string 132.C 
Sz-Nagy-Foiag 251.N 
Thurstone-Mosteller 346.C 
Veneziano 132.C 386.C 
Whittaker 450.0 
model experimentation 385.A 
modeling, mathematical 300 
model scheduling 307.C 
model selection 401.D 
model theory 276 
modification 
(of a stochastic process) 407.A 
holomorphic (of an analytic space) 23.D 
proper (of an analytic space) 23.D 
spherical 114.F 
modified Bessel functions App. A, Table 19.IV 
modified Fourier hyperfunction 125.BB 
modified indicator function 341.C 
modified Mathieu differential equation 268.A 
modified Mathieu function 268.A 
of the first kind 268.D 
of the second kind 268.D 
of the third kind 268.D 
modified minimum chi-square method 400.K 
modified wave operator 375.B 
modular, weakly (in quantum mechanics) 351.L 
modular automorphism 308.H 
modular character (of a modular representation) 
362.1 
modular form 
Hilbert, of dimension —k |. 32.G 
Hilbert, of weight k 32.G 
of level N  32.C 
Siegel, of dimension —k 32.F 
Siegel, of weight k 32.F 
modular function 
(of a locally compact group) 225.D 
Hilbert 32.G 
o level N 32.C 
Siegel, of degreen 32.F 


Subject Index 
Modulus (moduli) 


modular group 122.D 

elliptic 122.D 

Hilbert 32.G 

Siegel, of degreen 32.F 
modular lattice 243.F 
modular law (in a lattice) 243.F 
modular operator 308.H 
modular represenation (of a finite group) 362.G 
modular surface, Hilbert 15.H 
module(s) 277 

(of a family of curves) 143.A 

A- 277.C 

over A 277.C 

of A-homomorphisms (between A-modules) 

2TT.E 

Artinian 277.1 

of boundaries | 200.C 

category of left (right) R- 52.B 

character (of an algebraic group) 13.D 

of coboundaries 200.F 

ofcocycles 200.F 

coefficient 200.L 

cohomology 200.F 

connected graded 203.B 

of cycles 200.C 

defining (of a linear system) 16.N 

degenerate 118.D 

divisible A- 277.D 

dual 277.K 

dual graded 203.B 

duality theorem for Q- 422.L 

factor A- 277.C 

faithfully flat A- 277.K 

of finite length 277.1 

flat A- 277.K 

free 277.G 

generalized 143.B 

graded A- 200.B 

homology 200.C 

of homomorphisms (between two modules) 

277.B 

induced 277.L 

injective A- 277.K 

Jordan 231.C 


left A- 277.D 
Noetherian 277.1 
0-, 3831 


with operator domain A 277.C 
projective A- 277.K 
of quotients of an R-module with respect to S 
67.G 
(R, S)-injective 200.K 
(R, S)-projective 200.K 
representation (of a linear representation) 
362.C 
of representations (of a compact group) 69.D 
right A- 277.D 
torsion A- 277.D 
moduli functor 16.W 
moduli scheme 16.W 
coarse 16.W 
fine 16.W 
moduli space 16.W 72.G 
of curves of genusg 9J 
local 72.G 
modulus (moduli) 
(of a complex number) 74.B 
(of a complex torus of dimension 1) 32.C 
(=a conformal] invariant) 11.B 77.E 


Subject Index 
Modulus number 


(ofa congruence) 297.G 
(of an elliptic integral) 134.A, App. A, Table 
16.I 
(in Jacobi elliptic functions) 134J, App. A, 
Table 16.III 
(of a locally multivalent function) 438.E 
(ofa ring) 77.E 
complementary (of an elliptic integral) App. A, 
Table 16.I 
complementary (in Jacobi elliptic functions) 
134.J, App. A, Table 16.III 
of continuity (of a function) 84.A 
of continuity of kth order (of a continuous 
function) 336.C 
of elasticity in shear 271.G 
of elasticity in tension 271.G 
field of 73.B 
local maximum, principle 164.C 
maximum, principle (for a holomorphic func- 
tion) 43.B 
periodicity (of an elliptic integral) 134.A 
of rigidity 271.G 
Youngs 271.G 
modulus number 418.E 
modus ponens 411.I 
Moishezon criterion, Nakai- (of ampleness) 16.E 
Moishezon space 16.W 
mole numbers 419.A 
moment 397.C 
absolute (kth) 341.B 
bivariate 397.H 
central 397.C 
conditonal 397.I 
factorial 397.G 
of inertia 271.E 
(kth) 341.B 
about the mean (kth) 341.B 
population (of order k) 396.C 
principal, of inertia 271.E 
sample (of order k) 396.C 
moment generating function 177.A 341.C 397.GJJ 
moment matrix 341.B 
moment method 399.L 
moment method estimator 399.L 
moment problem 
Hamburger 240.K 
Hausdorff 240.K 
Stieltjes 240.K 
momentum 271.A,E 
angular 271.E 
generalized 271.F 
integrals of angular 420.A 
intrinsic angular 351.G 
orbital angular 351.E 
theorem of 271.E 
theorem of angular 271.E 
momentum density, angular 150.B 
momentum 4-vector, energy- 258.C 
momentum operator 
angular 258.D 
energy- 258. D 
momentum phase space 126.L 
momentum representation 351.C 
momentum tensor 
angular 258.D 
energy- 150.D 359.D 
monad 
(in homology theory) 200.Q 
(in nonstandard analysis) 293.D 
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Monge-Ampére equations 278, App. A, Table 
15.III 
Monge differential equation 324.F 
monic polynomial 337.A 
monoclinic system 92.E 
monodromy group 
(of an n-fold covering) 91.A 
(of a system of linear ordinary differential equa- 
tions) 253.B 
Milnor 418.D 
total 418.F 
monodromy matrix 254.B 
monodromy theorem (on analytic continuation) 
198.J 
monogenic function 
in the sense of E. Borel 198.Q 
in the sense of Cauchy 198.Q 
monoid, unitary 409.C 
monoidal transformation 
(of an analytic space) 23.D 
(of a complex manifold) 172.H 
(by an ideal sheaf) 16.K 
with center W 16.K 
real 274E 
monomial 337.B 
(module) 277.D 
admissible (in Steenrod algebra) 64.B 
monomial representation (of a finite group) 362.G 
monomorphism (in a category) 52.D 
monothetic group 136.D 
monotone 
(curve) 281.B 
operator 212.C 
monotone class 270.B 
monotone class theorem 270.B 
monotone decreasing 
(set function) 380.B 
matrix, of order m 212.C 
monotone decreasing function 166.A 
strictly 166.A 
monotone function 166.A 
strictly 166.A 
strictly (of ordinal numbers) 312.C 
monotone increasing 
(set function) 380.B 
matrix, of order m 212.C 
monotone increasing function 166.A 
strictly 166.A 
monotone likelihood ratio 374J 
monotonely very weak Bernoulli 136.F 
monotone mapping 311.E 
monotone operator (in a Hilbert space) 286.C 
monotone sequence (of real numbers) 87.B 
monotonically decreasing (sequence of real numbers) 
87.B 
monotonically increasing (sequence of real numbers) 
87.B 
monotonic function, completely 240.E,K 
Monte Carlo method 385.C 
Montel space 424.0 
Montel theorem 435.E 
moon argument, behind-the- 351.K 
Moore-Smith convergence 87.H 
Moore space 273.K 425.AA 
Moore space problem, normal 425.AA 
Mordell conjecture 118.E 
Mordell-Weil theorem  118.E 
weak 118.E 


more informative (experiment) 398.G 


2051 


Morera theorem 198.A 
Morgenstern solution, von Neumann- 173.D 
morphism 

(ina category) 52.A 

(of chain complexes) 200.H 

(of complexes) 13.R 

(of filtered modules) 200.J 

(of inductive systems) 210.D 

(of unfoldings) 51.D 

affine 16D 

connecting 200.H 

diagonal (in a category) 52.E 


étale 16.F 
finite 16.D 
flat 16.D 


faithfully flat 16.D 
Frobenius 450.P 
functorial 52J 
inverse 52.D 
k- (between algebraic groups) 13.A 
projective 16.E 
proper (between schemes) 16.D 
quasiprojective 16.E 
S- 52.G 
of schemes 16.D 
separated 16.D 
shape 382.A 
smooth 16.F 
strict (between topological groups) 423.J 
structure 52.G 
Morse function 279.B 
Morse index theorem 279.F 
Morse inequalities 279.D 
Morse lemma 279.B 
Morse-Smale diffeomorphism 126.J 
Morse-Smale flow 126.J 
Morse-Smale vector field 126.3 
Morse theory 279 
fundamental theorems of 279.D 
Morsification 418.F 
Moser implicit function theorem, Nash- 286.J 
Mosteller model 
Bush- 346.G 
Thurstone- 346.C 
most powerful (test) 400.A 
most probable cause 401.E 
most probable value 401.E 
most stringent level x test 400.F 
motion(s) 
(in dynamical system) 126.B 
(Euclidean) 139.B 
Brownian 5.D 45 342.A 
Brownian (d-dimensional) 45.C 
Brownian, on Lie groups 406.G 


Brownian, with an N-dimensional time para- 


meter 45. 
central 126.E 
elliptic 55.A 
equation of (of a fluid) 205.A 


equation of (of a particle in a gravitation field) 


359.D 


equations of (in Newtonian mechanics) 271.A 


Euler equation of (of a perfect fluid) 205.B 
(4]-Brownian 45.B 406.B 

group of (in Euclidean geometry) 139.B 
group of, in the wider sense 139.B 
Heisenberg equation of 351.D 

hyperbolic 420.D 

hyperbolic-elliptic 420.D 





Subject Index 
Multiple 


hyperbolic-parabolic 420.D 


infinitesimal (of a Riemannian manifold) 364.F 


Lagrange equation of 271.F 
Lagrange-stable 420.D 
law of 271A 
mean 309.B 
Newton three laws of 271.A 
Ornstein-Uhlenbeck Brownian 45.1 
oscillating 420.D 
parabolic 420.D 
parabolic-elliptic 420.D 
perpetual 402.G 
proper (in Euclidean geometry) 139.B 
proper (of a star) 392 
quasiperiodic 136.G 
right-invariant Brownian 406.G 
simple harmonic 318.B 
space-time Brownian 45.F 
Moulton method, Adams- 303.E 
movability 382.C 
movable 382.C 
k- 382.C 
movable branch point (of an algebraic differential 
equation) 288.A 
movable singularity (of an algebraic differential 
equation) 288.A 
move 173.B 
chance 173.B 
moving average 397.N 
weighted 397.N 
moving average process 421.D 
autoregressive 421.D 
autoregressive integrated 421.G 
moving average representation 
backward 395.D 
canonical backward 395.D 
moving coordinates App. A, Table 3.IV 
moving coordinate system 90.B 
moving frame(s) 90 111.C 417.B 
method of 110.A 
natural 417.B 
orthonormal 417.D 
stochastic 406.G 
multicollinearity 128.B 
multicommodity flow problem 281.F 
multidiagonal type 304.C 
multidimensional diffusion process 115.A,C 
multidimensional gamma function 374.C 
multidimensional hypergeometric distribution 
App. A, Table 22 
multidimensional normal distribution App. A, 
Table 22 
multidimensional scaling 346.E 
nonmetric 346.E 
multi-index 112.A 168.B 
multilinear form 256.H 
multilinear mapping 256.H 
alternating 256.H 
antisymmetric 256.H 
skew-symmetric 256.H 
symmetric 256.H 
multinomial distribution 341.D 
negative 341.D 
multinomial theorem 330 
multi-objective model 307.C 
multiobjective programming 264.C 
multiplanar coordinates 90.C 
multiple 297.A 
(of an element ofa ring) 67.H 


Subject Index 
Multiple complex 


(of a fractional ideal) 14.E 
common (of elements of a ring) 67.H 
least common 297.A 
least common (of elements of a ring) 67.H 
scalar (of an element of a module) 277.D 
scalar (of a linear operator) 37.C 
scalar (in a linear space) 256.A 
scalar (of a vector) 442.A 
multiple complex 200.H 
multiple correlation coefficient 397.J 
sample 280.E 
multiple covariant 226.E 
absolute 226.E 
multiple hypergeometric distribution 341.D 
multiple integral 
(in Lebesgue integral) 227.E 
(in Riemann integral) 216.F 
multiple mathematical inductions 294.B 
multiple model 403.F 
multiple point 
(on an arc) 93.B 
(of a plane algebraic curve) 9.B 
(ona variety) 16.F 
multiple root (of an algebraic equation) 10.B 
multiple sampling inspection 404.C 
multiple-valued (analytic function) 198.3 
multiple Wiener integral 176.1 
multiplication 
(ofan algebra) 203.F 
(of a graded algebra) 203.B 
(ina group) 190.A 
(ofan H-space) 203.D 
(of local Lie groups) 423.L 
(by a natural number) 294.B 
(inaring) 368.A 
associative (of a graded algebra) 203.B 
commutative (of a graded algebra) 203.B 
commutative law for (in a ring) 368.A 
complex 73.A 
homotopy associative 203.D 
homotopy commutative 203.D 
Pontryagin 203.D 
scalar (ina module) 277.D 
scalar (on vectors) 442.A 
symmetric 406.C 
multiplication theorem, Hadamard 339.D 
multiplicative (arithmetic function) 295.B 
multiplicative automorphic function 32.A 
multiplicative class 270.B 
multiplicative congruence 14.H 
multiplicative ergodic theorem 136.B 
multiplicative function 32.A 295.B 
multiplicative functional 
(of a Markov process) 261.E 
transformation by 261.F 
multiplicative group 149.A 190.A 
ofa field 190.B 
multiplicative Jordan decomposition (of a linear 
transformation) 269.L 
multiplicatively closed subset (of a ring) 67.C 
multiplicative valuation 439.C 
multiplicator (of a relative invariant measure) 
225.H 
multiplicity 
(of a covering surface) 367.B 
(of an eigenvalue for an integral equation) 
217.F 
(of an eigenvalue of a matrix) 390.B 
(of a Gaussian process) 176.E 
(ofa local ring) 284.D 


(of a representation) 362.D 
(of a root of an algebraic equation) 10.B 
(ofa weight) 248.W 
algebraic (of an eigenvalue) 390.B 
geometric (of an eigenvalue) 390.A 
intersection (of two subvarieties) 16.Q 
representation without 437.G 
setof 159] 
multiplicity function (of a mapping) 246.G 
multiplier 
(ofa group) 362J 
(of a semi-invariant) 226.A 
characteristic (of a closed orbit) 126.G 
characteristic (of a periodic linear system) 
163.F 
Jacobi'slast App. A, Table 14.I 
Lagrange 46.B 
method of Lagrange 106.L 
Stokes 254.D 
multiplier algebra 36.K 
multiply connected domain 333.A 
multiply transitive (permutation group) 
multipolar coordinates 90.C 
multiprocessor scheduling problem 376 
multistage allocation process 127.A 
multistage choice process 127.A 
multistage game 173.C 
multistage programming 264.C 
multistage sampling 373.E 
multistep method 303.E 
linear 303.E 
multitype Galton-Watson process 44.C 
multitype Markov branching process 44.E 
multivalent function 438.E 
multivalued function 165.B 
multivalue method 303.E 
multivariate (data) 397.A 
multivariate analysis 280 
of variance 280.B 
multivariate linear model 280.B 
multivariate normal distribution 397.J 
Muntz theorem (on polynomial approximation) 
336.A 
mutual energy 338.B 
mutual information 213.E 


151.H 
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mutually associated diagrams (for O(n) diagrams) 


60.J 
mutually disjoint family (of sets) 381.D 
mutually noncomparable (summations) 379.L 
mutually orthogonal (latin squares) 241.B 
M.V.W.B. (=monotonely very weak Bernoulli) 
136.F 


N 


N (natural numbers) 294.A,B 
NP 71.E 
n-ary predicate 411.G 
n-ary relation 411.G 
n-ball 140 
open 140 
n-body problem 420.A 
n-cell 70.D 140 
open 70.D 
topological 140 
n-classifying space (of a topological group) 
147.G 
n-cochain (for an associative algebra) 200.L 
n-connected 
(pair of topological spaces) 202.L 
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(space) 79.C 202.L 
locally 79.C 
n-connective fiber space 148.D 
n-cube, unit 140 
n-cylinder set 270.H 
n-decision problem 398.A 
n degrees of freedom (sampling distribution) 
374.B,C 
n-dimensional (normal space) 117.B 
n-dimensional distribution 342.C 
n-dimensional distibution function 342.C 
n-dimensional Euclidean geometry 139.B 181 
n-dimensional] probability distribution 342.C 
n-dimensional random variable 342.C 
n-dimensional sample space 396.B 
n-dimensional statistic 396.B 
n-disk 140 
open 140 
n-element 140 
n-fold covering 91.A 
n-fold reduced suspension (of a topological space) 
202.F 
n-gon, regular 357.A 
n-gonal number 296.A 
n-particle subspace 377.A 
n-person game 173.B~D 
n-ply connected (plane domain) 333.A 
n-section (in a cell complex) 70.D 
n-sheeted (covering surface) 367.B 
n-simple 
(pair of topological spaces) 202.L 
(space) 202.L 
n-simplex 
(in a Euclidean simplicial complex) 70.B 
(in a semisimplicial complex) 70.E 
(in a simplicial complex) 70.C 
n-sphere 140 
open 140 
solid 140 
n-sphere bundle 147.K 
n-times continuously differentiable (function) 
106.K 
n-times differentiable (function) 106.D 
n-torus 422.E 
n-tuple 256.A 381.B 
n-universal bundle 147.G 
n-valued (analytic function) 198.J 
(n + 2)-hyperspherical coordinates 79.A 90.B 
N-ple Markov Gaussian process 176.E 
in the restricted sense 176.F 
nth approximation (of an n-times differentiable 
function) 106.E 
nth convergent (of an infinite continued fraction) 
83.A 
nth derivative (of a differentiable function) 
106.D 
nth derived function 106.D 
nth differential (of a differentiable function) 
106.D 
nth partial quotient 83.A 
nth order, differential of 106.D 
nth order partial derivatives 106.H 
nthterm 165.D 
nabla 442.D, App. A, Table 3.II 
Nachbin-Goodner-Kelley theorem 37.M 
Nagumo theorem, Kneser- 316.E 
Naimark theorem, Gel’fand- 36.G 
Nakai-Moishezon criterion (of ampleness) 16.E 
Nakanishi equation, Landau- 146.C 
Nakanishi variety, Landau-  146.C 386.C 


Subject Index 
Negative resistance 


Nakano- Nishijima-Gell-Mann formula 132.A 
Nakayama lemma 67.D 
Nambu-Goldstone boson 132.C 
Napier analogies App. A, Table 2.III 
Napierian logarithm 131.D 
Napier number 131.D 
Napier rule App. A, Table 2.II 
Nash bargaining solution 173.C 
Nash equilibrium 173.C 
Nash-Moser implicit function theorem 286.J 
nat 213.B 
natural additive functional 261.E 
natural boundary 
(of an analytic function) 198.N 
(of a diffusion process) 115.B 
natural equation 
(ofacurve) 111.D 
(of a surface) 110.A 
natural equivalence 52.J 
natural extension (of an endomorphism) 136.E 
natural geometry 110.A 
natural injection (from a subgroup) 190.D 
naturality (of a homotopy operation) 202.0 
natural logarithm 131.D 
natural model (in axiomatic set theory) 33.C 
natural moving frame 417.B 
natural number 294.A,B 
nonstandard 276.E 
Skolem theorem on impossibility of 156.E 
natural positive cone 308.K 
natural scale 260.G 
natural spline 223.F 
natural surjection (to a factor group) 190.D 
natural transformation 52.J 
Navier-Stokes equation(s) 204.B 205.C 
general 204.F 
Navier-Stokes initial value problem 204.B 
nearly Borel measurable set 261.D 
nearly everywhere (in potential theory) 338.F 
necessary (statistic) 396.E 
necessity 411.L 
necklace, Antoine’s 65.G 
negation (of a proposition) 411.B 
negative 
(complex) 200.H 
(element of a lattice-ordered group) 243.G 
(element of an ordered field) 149.N 
(rational number) 294.D 
negative binomial distribution 341.D 397.F, App. 
A, Table 22 
negative curvature 178.H 
negative definite (function) 394.C 
negative definite Hermitian form 348.F 
negative definite quadratic form 348.B 
negative half-trajectory 126.D 
negative infinity 87.D 355.C 
negative limit point 126.D 
negatively invariant 126.D 
negatively Lagrange stable 126.E 
negatively Poisson stable 126.E 
negative multinomial distribution 341.D 
negative number 355.A 
negative orientation (of an oriented C'-manifold) 


105.F 

negative part (of an element of a vector lattice) 
310.B 

negative polynominal distribution App. A, Table 
22 


negative prolongational limit set, first 126.D 
negative resistance 318.B 


Subject Index 
Negative root 


negative root (of a semisimple Lie algebra) 248.M 
negative semidefinite quadratic form 348.C 
negative semiorbit 126.D 
negative variation 
(ofa mapping) 246.H 
(of a real bounded function) 166.B 
neighborhood 425.B 
analytic (of a function element in the wider 
sense) 198.0 
analytic (ina Riemann surface) 367.A 
conoidal 274.D 
convex 364.C 
coordinate (in a fiber bundle) 147.B 
coordinate (in a topological manifold) 105.C 
coordinate, of class C" 105.D 
derived 65.C 
e- (of a point) 273.C 
étale 16.AA 
fundamental system of 425.E 
open 425.E 
open tubular 114.B 
regular 65.C 
regular, theorem 65.C 
relative 425.J 
tubular 105.L 114.B 364.C 
neighborhood deformation retract 202.D 
neighborhood retract 202.D 
absolute 202.D 
fundamental absolute (FANR) 382.C 
neighborhood system 425.B 
base for 425.E 
uniform 436.D 
uniform family of 436.D 
Nelson formula, Feynman-Kac- 150.F 
Nelson symmetry 150.F 
Nernst postulate 419.A 
Néron minimal model (of an Abelian variety) 
3.N 
Néron-Severi group 
(ofa surface) 15.D 
(ofa variety) 16.P 
nerve (ofa covering) 70.C 
nested intervals, principle of (for real numbers) 
87.C 355.B 
net (inaset) 87.H 
Cauchy (in a uniform space) 436.G 
square 304.E 
universal (in a set) 87.H 
net premium 214A 
network(s) 282 425.F 
bilateral 282.C 
capacitated 281.C 
contact 282.B 
electric 282.B 
linear 282.C 
M-port | 282.C 
passive 282.C 
reciprocal 282.C 
time-invariant 282.C 
two-terminal 281.C 
network flow model 307.C 
network flow problem 281 282.B 
network programming 264.C 
network scheduling 307.C 
Neumann function 39.B 188.H, App. A, Table 
19.III 
Neumann polynomial App. A, Table 19.IV 
Neumann problem 193.F 323.F 
Neumann series 217.D 
neutral element (in a lattice) 243.E 
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neutral type (of functional differential equation) 
163.B 
Nevanlinna exceptional value 272.E 
Nevanlinna first fundamental theorem 272.B 
Navanlinna second fundamental theorem 272.E 
Nevanlinna theory 
(of meromorphic functions) 124.B 272.B 
(for several complex variables) 21.N 
Newton, I. 329 
Newton backward interpolation formula  223.C 
Newton boundary 418.D 
nondegenerate 418.D 
Newton-Cotes formula (in numerical integration) 
299.A 
Newton diagram 254.D 
Newton first law 271.A 
Newton formula (on interpolation) App. A, Table 
21 
Newton formula (on symmetric functions) 337.1 
Newton forward interpolation formula 223.C 
Newtonian capacity 48.B 
Newtonian exterior capacity A8. H 
Newtonian fluid 205.C 
Newtonian inner capacity 48.F 
Newtonian interior capacity 48.F 
Newtonian mechanics 271.A 
Newtonian outer capacity 48.H 
Newtonian potential 271.C 338.A 
Newton interpolation formula App. A, Table 21 
Newton interpolation polynomial 336.G 
Newton iterative process 301.D 
Newton law (on frictional stresses) 205.C 
Newton law of universal gravitation 271.B 
Newton-Raphson method 301.D 
Newton second law 271.A 
Newton third law 271.A 
Newton three laws of motion 271.A 
Neyman factorization theorem 396.F 
Neyman-Pearson lemma 400.B 
Neyman structure 400.D 
nice function (on a C?-manifold) 114.F 
Nicholson formula App A, Table 19.IV 
Nicholson formula, Watson- App. A, Table 19.III 
Nickel method, Dejon- 301.G 
Nicomedes conchoid 93.H 
Nijenhuis tensor 72.B 
Nikodým derivative, Radon- 270.L 380.C 
Nikodým property, Radon- 443.H 
Nikodým theorem, Radon- 270.L 380.C 
Nikodym theorem for vector measures, Radon- 
443.H 
nilalgebra 231.A 
nilmanifold 178.D 
nilpotent 
(Lie algebra) 248.C 
(Lie group) 249.D 
(subset of a ring) 368.B 
(zero-divisor) 284.A 
generalized (linear operator) 251.F 
nilpotent algebraic group 13.F 
nilpotent component (of a linear transformation) 
269.L 
nilpotent element 
(ofa ring) 368.B 
generalized (in a Banach algebra) 36.E 
nilpotent group | 190 
finite 151.C 
generalized 190.K 
nilpotent ideal 
(of a Lie algebra) 248.C 
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largest (of a Lie algebra) 248.D 
nilpotent matrix 269.F 
nilpotent radical (of a Lie algebra) 248.D 
nilradical 
(of a commutative ring) 67.B 
(ofaring) 368.H 
9j symbol 353.C 
Nirenberg space, John- (= BMO) 168.B 
Nishijima-Gell-Mann formula, Nakano- 132.A 
Nishina formula, Klein- 351.G 
Nitsche formula, Gauss-Bonnet-Sasaki- 275.C 
niveau surface 193.J 
Nóbeling embedding theorem, Menger- 117.D 
no cycle condition 126.J 
nodalcurve 391.H 
nodal domain 391.H 
nodal point 304.C 
nodal set 391.H 
node 
(ofacurve) 93.G 
(ofa graph) 186.B 282.A 
(of a plane algebraic curve) 9.B 
completion 281.D 
Start 281.D 
Noetherian domain 284.A 
Noetherian integral domain 284.A 
Noetherian local ring 284.D 
Noetherian module 277.1 
Noetherian ring(s) 284.A 
left 368.F 
right 368.F 
Noetherian scheme 16.D 
locally 16.D - 
Noetherian semilocal ring 284.D 
Noether number, Brill- 9.E 
Noether theorem 150.B 


noise 
thermal 402.K 
white 176.D 


noisy channel 213.A 
nomograms 19.A,D 
non-Abelian cohomology 200.M 
nonadaptive scheme 299.C 
nonanticipative 406.D 
non-Archimedean geometry 155.D 
non-Archimedean valuation 14.F 439.C 
nonassociative algebra 231.A 
nonatomic 168.C 443.G 
non-Bayesian approach 401.B 
noncentral (quadric hypersurface) 7.F 350.G 
noncentral chi-square distribution 374.B 
noncentral F-distribution 374.B 
noncentral Hotelling T? statistic 374.C 
noncentrality (sampling distribution) 374.B,C 
noncentrality matrix 374.C 
noncentral t-distribution 374.B 
noncentral Wishart distribution 374.C 

p-dimensional 374.C 
noncommutative field 149.A 
noncompact real simple Lie algebra App. A, 

Table SI 
noncompact type (symmetric Riemannian homoge- 

neous space) 412.D 

noncomparable, mutually 379.L 
nonconforming type 304.C 
nonconvex quadratic programming 264.D 
noncooperative (game) 173.A 
nondecreasing function 166.A 
nondegenerate 

(analytic mapping) 23.C 


Subject Index 
Nonstandard 


(bilinear form) 256.H 
(critical point) 106.L 279.B 286.N 
(function on a Hilbert manifold) 279.E 
(quadratic form) 348.A 
(representation) 437.N 
(sesquilinear form) 256.Q 
(theta-function) 3.1 
nondegenerate critical manifold 279.D,E 
nondegenerate divisor 3.D 16.N 
nondegenerate hypersurface 344.A 
nondegenerate Newton boundary 418.D 
non-Desarguesian geometry 155.E 343.C 
nondeterministic 
(Turing machine) 31.B 
purely (weakly stationary process) 395.D 
nondeterministic linear bounded automaton 31.D 
nonelementary (Kleinian group) 234.A 
non-Euclidean angle (in a Klein model) 285.C 
non-Euclidean distance 285.C 
non-Euclidean geometry 285 
non-Euclidean hypersphere 285.C 
non-Euclidean space 285.A 
nonexpansive mapping 286.B 
nonexpansive operator 37.C 
nonhomogeneous difference equation 104.C 
nonhomogeneous n-chain (for a group) 200.M 
nonincreasing function 166.A 
nonlinear differential equation 291.D 
nonlinear filter 405.F,H 
nonlinear functional analysis 286 
nonlinear integral equation 217.M 
nonlinear lattice dynamics 287 
nonlinear mechanics 290.A 
nonlinear ordinary differential equations 313.A 
(global theory) 288 
(local theory) 289 
nonlinear oscillation 290 
nonlinear partial differential equations 320.A 


` nonlinear problems 291 


nonlinear programming 264.C 
nonlinear semigroup 88.E 378.F 

of operators 286.X 
nonmeager set 425.N 
nonmetric MDS 346.E 
nonnegative (matrix) 269.N 
nonnegative terms, series of 379.B 
non-Newtonian fluid 205.C 
nonparametric method 371 
nonparametric test 371.A 
nonpositive curvature 178.H 

G-space with 178.H 
nonprimitive character 450.C,E 
nonrandomized (decision function) 398.A 
nonrandomized estimate 399.B 
nonrandomized test 400.A 
nonrecurrent (chain) 260.B 
nonrecurrent (transient) 260.B 
nonresidue, quadratic 297.H 
nonsaddle set 126.E 
nonsingular (flow) 126.G 

(point fora flow) 126.D 

(point of a variety) 16.F 
nonsingular mapping of class C! 208.B 
nonsingular matrix 269.B 
nonsingular transformation 

(of a linear space) 256.B 

(on a measure space) 136.B 
nonsingular variety 16.F 
nonstandard 33.B 

(element) 293.B 


Subject Index 
Nonstandard analysis 


nonstandard analysis 293 
nonstandard natural number 276.E 
nonstandard real number 276.E 
nonstandard set theory 293.E 
nonstationary oscillations 290.F 
nonsymmetric unified field theory 343.C 
nontangential maximal function 168.B 
nontangential path 333.B 
nontrivial (3-manifold) 65.E 
nontrivially (to act on a G-space) 431.A 
nonwandering 126.E 
set 126.E 
Nórlund method of summation 379.Q 
norm 
(of an algebraic element) 149.J 
(of an element of a general Cayley algebra) 54 
(of an element of a quaternion algebra) 29.D 
(of an operator) 37.C 
(of a separable algebraic clement) 149.J 
(ofa vector) 37.B 
absolute (of an integral ideal) 14.C 
C*-cross 36.H 
C- 126H 
graph 251.D 
Hilbert-Schmidt 68.1 
minimum, property 223.F 
nuclear 68.K 
pseudo- (on a topological linear space) 424.F 
reduced (of an algebra) 362.E 
relative (of a fractional ideal) 14.1 
semi- (on a topological linear space) 424.F 
spinorial 61.D 
supremum 168.B 
trace 68.1 
uniform 168.B 
normal 62.C 110.E 354.F 
(almost contact structure) 110.E 
(analytic space) 23.D 
(current) 275.G 
(fundamental region) 122.B 
(*-isomorphism) 308.C 
(state) 351.B 
(fora valuation) 439.H 
(weight on a von Neumann algebra) 308.D 
affine 110.C 
affine principal 110.C 
analytically 284.D 
principal 111.F 
normal algebraic variety 16.F 
normal analytic structure 386.C 
normal basis 172.E 
normal block bundle 147.Q 
normal bundle 
(ofa foliation) 154.B,E 
(of an immersion) 114.B 
(of a submanifold) 105.L 274.E 364.C 
normal Cartan connection 80.N 
normal chain 
(ina group) 190.G 
(in a Markov chain) 260.D 
normal commutation relation 150.D 
normal connection 365.C 
normal contact Riemannian manifold 110.E 
normal continued fraction 83.E 
normal coordinate(s) 90.C 
mapping 364.C 
normal covering 425.R 
normal crossings 16.L 
only 16.L 
normal curvature (of a surface) 111.H 
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normal density function 397.D 
normal derivative 106.G 
normal distribution 341.D 397.D, App. A, Table 22 
k-dimensional 341.D, App. A, Table 22 
logarithmic App. A, Table 22 
multidimensional App. A, Table 22 
standard 341.D 
normal duration  28.I 
normal equation 
(in the method of least squares) 302.E 403.E 
(in statistical data analysis) 397] 
normal estimator, best asymptotically 399.K 
normal estimator, consistent and asymptotically 
399.K 
normal extension 149.G 251.K 
normal extension field, strongly 113 
normal family 435.E 
normal fiber space, Spivak 144.J 
normal form 
(of differential equations) 313.B 324.E 
(ofa surface) 410.B 
Cantor (for an ordinal number) 312.C 
Hesse (ofa hyperplane) 139.H 
Jordan (for a matrix) 269.G 
n-adic (for an ordinal number) 312.C 
prenex (in predicate logic) 411.J 
normal form theorem, Kleene 356.C 
normal frame 110.B 
normal function (of ordinal numbers) 312.C 
normal g-lattice 27.A 
normal invariant 114J 
normality, asymptotic 399.K 
normalization 
(of an analytic space) 23.D 
(of a variety) 16.F 
normalization theorem 
for finitely generated rings 369.D 
for polynomial rings 369.D 
normalized 
(function) 317.A 
(into an orthonormal set) 197.C 
(vector) 139.G 
normalized contrast 102.C 
normalized valuation 439.E 
normalizer 136.F 190.C 
normal j-algebra 384.C 
normal k-vector bundle 114.J 
normal line 93.G, App. A, Table 4.I 
normal linear model 403.C 
normally cobordant 114.J 
normally distributed, asymptotically 399.K 
normally flat along a subscheme (a scheme) 16.L 
normal mapping (map) 114J 
normal matrix 269.1 
normal model, derived (of a variety) 16.F 
normal Moore space problem 425.AA 
normal number 354.F 
normal operator 390.E 
(of Sario) 367.G 
normal PL microbundle 147.P 
normal plane 111.F 
normal point 16.F 23.D 
normal polygon 234.C 
normal process 176.C 
normal real form (of a complex semisimple Lie 
algebra) 248.Q 
normal representation 308.C 
normal ring Gil 
normal score test, Fisher-Yates-Terry 371.C 
normal section 410.B 
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normal sequence (of coverings) 425.R 
normal simple algebra 29.E 
normalspace 425.Q 
collectionwise 425.AA 
completely 425.Q 
fully 425.X 
hereditarily 425.Q 
perfectly 425.Q 
normal sphere bundle 274.E 
normal stress 271.G 
normal structure 276.D 
normal subgroup 190.C 
admissible 190.E 
normal system (of E-functions) 430.D 
normal transformation (of a sequence) 379.L 
normal valuation 439.E,H 
normal variety 16.F 
normal vector 105.L 111.H 364.A 
normal vector bundle 105.L 
normal vibration 318.B 
normed linear space 37.B 
normed ring 36.A 
normed space, countably 424.W 
normed vector lattice 310.F 
norm form 118.D 
normic form 118.F 
norm-residue 14.P 
norm-residue symbol 14.Q 
(in local class field theory) 257.F 
Hilbert 14.R 
Hilbert-Hasse 14.R 
norm resolvent convergence 331.C 
northern hemisphere 140 
north pole 74.D 140 
notation 
full international 92.E 
Kendall 260.H 
Schoenflies (for crystal classes) 92.F, App. B, 
Table 6.1V 
short international 92.E 
system of (for ordinal numbers) 81 
notion, common 35.A 
Novikov closed leaf theorem 154.D 
nowhere dense set 425.N 
NP 71E 
co- 7LE 
NP-complete 71.E 
NP-completeness 71.E 


NP-hard 71.E 
NP-space 71.E 
NP-time 71.E 


NR (neighborhood retract) 202.D 
nuclear (C*-algebra) 36.H 
nuclear class 68.1 
nuclear norm 68.K 
nuclear operator 68.LK 
nuclear space 424.8 
nucleolus 173.D 
null (vector in the Minkowski space-time) 359.B 
null-bicharacteristic 320.B 
null boundary, open Riemann surface of 367.E 
null cobordant 235.G 
null function 310.1 
null geodesic 399.D 
null homotopic (continuous mapping) 202.B 
null hypothesis 400.A 
nullity 
(of a critical point) 279.B 
(ofa graph) 186.G 
(ofalinear mapping) 256.F 


Subject Index 
Number(s) 


(of a linear operator) 251.D 
(of a matrix) 269.D 
column (of a matrix) 269.D 
of relative 365.D 
row (of a matrix) 269.D 
null recurrent (point) 260.D 
null sequence (in a-adic topology) 284.B 
null set 270.D 310.1381.A 
of class N,  169.E 
function-theoretic 169 
null space 251.D 
null system 343.E 


number(s) 294 


A- 430.C 

abundant 297.D 

algebraic 14.A 

amicable 297.D 

average sample 404.C 

azimuthal quantum 315.E 

Bell 177.D 

Bernoulli 177.B 

Betti 200.K 201.B 

Brill-Noether 9.E 

calculable 22.G 

Cantor's theory of real 294.E 

cardinal 49.A 312.D 

Cayley 54 

characteristic (of a compact operator) 68.1 

characteristic (of a manifold) 56.F 

Chern 56.F 

chromatic 157.E 186.1 

class (of an algebraic number field) 14.E 

class (of a Dedekind domain) 67.K 

class (of a simple algebra) 27.D 

Clifford 61.A 

coincidence (of a mapping) 153.B 

of colors 92.D 

completeness of real 294.E 

complex 74.A 294.F 

composite 297.B 

condition 302.A 

connectedness of real 294.E 355.B 

continuity of real 294.E 

cyclomatic 186.G ! 

decomposition (of a finite group) 362.I 

Dedekind's theory of real 294.E 

deficient 297.D 

of denominator 186.I 

Euler 177.C 201.B, App. B, Table 4 

Fermat 297.F 

Froude 116.B 

generalized decomposition (of a finite group) 
362.1 

geometry of 182 

Gödel 185 356.C,E 

Grashoff 116.B 

imaginary 74.A 

incidence 146.B 201.B 

of independence 186.1 

initial 312.D 

intersection (of divisors) 15.C 

intersection (of homology classes) 65.B 201.0 

intersection (of sheaves) 16.E 

irrational 294.E 355.A 

irrational real 294. E 

of irregularity (of an algebraic variety) 16.P 

Kullback-Leibler information 398.G 

Lebesgue 273.F 

Lefschetz 153.B 

Lefschetz (of a variety) 16.P 
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Number field 


linking 99.C p-adic 439.F 
Liouville 430.B p-adic 257.A 439.F 
Lyapunov characteristic 314.A relative algebraic 14.1 
Mach 116.B 205.B numbering, Gödel 185.A 
magnetic Reynolds 259 number operator 377.A 
mean (of sheets) 272.J number system, point range of 343.C 
Mersenne 297.E, App. B, Table 1 number-theoretic function(s) 295.A 356.A 
Milnor 418.D additive 295.B 
modulus 418.E completely additive 295.B 
mole 419.A completely multiplicative 295.B 
n-gonal 296.A multiplicative 295.B 
Napier 131.D number theory 296 
natural 294.A,B analytic 296.B 
negative 355.A consistency proof for pure 156.E 
negative rational 294.D elementary 297 
normal 354.F fundamental theorem of elementary 297.C 
Nusselt 116.B geometric 296.B 
orbital magnetic quantum  315.E pure 156.E 
ordinal 312.B numerals, Arabic 26 
p-adic 439.F numerator, partial (of an infinite continued 
of partitions 177.D 328 fraction) 83.A 
Péclet 116.B numerical analysis 300 
pentagonal 4.D numerical differentiation 299.E 
perfect 297.D numerical integration 299 
perfect, of the second kind 297.D numerically connected (divisor) 232.D 
Picard (of a variety) 15.D 16.P numerically equivalent (cycles) 16.Q 
Poisson 271.G numerically semipositive 15.D 
polygonal, oforderk 45 numerical method 300 
Pontryagin 56.F numerical range (of a linear operator) 251.E 
positive 355.A numerical solution 
positive rational 294.D of algebraic equations 301 
Prandtl 116.B of integral equations 217.N 
prime 297.B of linear equations 302 
principal quantum 315.E of ordinary differential equations 303 
pseudorandom 354.B of partial differential equations 304 
Pythagorean 145 numerical tensor App. A, Table 4.II 
ramification 14.K Nusselt number 116.B 
random 354 nutation 392 
rational 294,D,E Nyquist criterion 86.A 
rational real 294.E Nyquist theorem 402.K 
real 294.E 355.A,D 
real, mod 1 355.0 O 
relatively prime 297.A 
of replications 102.B ©(Q) (space of holomorphic functions in OQ) 168.B 
Reynolds 116.B 205.C €,(Q) 168.B 
rotation 99.D 111.E 126.I O(n) (orthogonal group)  60.I 
A- 430.C «o-connected space 79.C 
S*- 430.C locally 79.C 
self-intersection 15.C w-consistent (system) 156.E 
of sheets (of an analytic covering space) w-limit point 126.D 
23.E w-limit set 126.D 
of sheets (of covering surface) 367.B Q-conjugate (en 
Stiefel-Whitney 56.F Q-equivalent 126.H 
Stirling, of the second kind 66.D Q-explosion 126.3 
T- 430.C Q-group 190.E 
T*- 430.C Q-homomorphism (between Q-groups) 190.E 
Tamagawa 13.P Q-isomorphism (between Q-groups) 190.E 
transcendental 430.A Q-modules, duality theorem for 422.L 
translation 18.B,D Q-stability theorem 126.J 
of treatment combinations 102.L Q-stable, C'- 126.H 
type 314A Q-subgroup (of an O-group) 190.E 
U- 430.C o-ideal 
U*- 430.C integrated two-sided 27.A 
wave (ofa sine wave) 446 two-sided 27.A 
wave, vector (of a sine wave) 205.F o,-ideal, left 27.A 
weakly compact cardinal 33.E o,-ideal, right 27.A 
weakly inaccessible cardinal 33.E ©-differential (on an algebraic curve) 9.F 
Weil 3.C ©-genus (of an algebraic curve) 9.F 
number field 149.C O-linearly equivalent divisors (on an algebraic 


algebraic 14.B curve) 9.F 
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-specialty index (of a divisor of an algebraic 
curve) 9.F 
€-module 383.1 
(o)-convergent 87.L 
(o)-star convergent 87.L 
OA (orthogonal array) 102.L 
Ob (object) 52.A 
object 52.A 411.G 
cofinal 52.D 
final 52.D 
graded 200.B 
group (in a category) 52.M 
initial 52.D 
injective 200.1 
isomorphic 52.D 
mathematical 52.A 
in predicate logic 411.G 
projective 200.1 
quotient 52.D 
S-, category of 52.G 
oftype j-- 1 356F 
oftypeO 356. F 
zero 52.N 
object domain 411.G 
objective function ` 264.B 307.C 
objective probability 401.B 
object variable 411.G 
oblate App. A, Table 3.V 
oblique circular cone 350.B 
oblique coordinates (in a Euclidean space) 90.B 
observability Sec 
observables 351.B 
observation 
complete 405.C 
cost of 398.F 
partial 405.C 
observation process 405.F 
observation vector 102.A 
observer, Luenburger 86.E 
obstacle, Dirichlet problem with 440.B 
obstruction(s) 305 
to an n-dimensional homotopy 305.B 
to an (n + 1)-dimensional extension 305.B 
primary 147.L 305.C 
secondary 305.D 
surgery 114J 
tertiary 305.D 
obstruction class 56.E 
obstruction cocycle 147.L 305.B 
obtuse angle (in Euclidean geometry) 139.D 
OC-curve (operative characteristic curve) 404.C 
octahedral group 151.G 
octahedron 357.B 
odd element (of a Clifford algebra) 61.B 
odd function 165.B 
odd half-spinor 61.E 
odd half-spin representation 61.E 
odd permutation (in a symmetric group) 151.G 
odd ratio 397.K 
odd state 315.H 
of bounded variation 443.G 
Ohm’s law (for a moving medium)  130.B 259 
Oka's principle 147.0 
Oka's theorem 72.E 
1-complete manifold, weakly 21.L 
one cycle 16.R 
one-dimensional diffusion processes 115.A 
one-dimensional lattice 287.A 
one-dimensional probability distribution (of random 
variables) 342.C 


Subject Index 
Operation(s) 


one-dimensional statistic 396.B 
100a%-point 396.C 
1-1 (mapping) 381.C 
one-parameter group 
local (of local transformations) 105.N 
of transformations 105.N 126.B 
one-parameter semigroup of class (CH) 378.B 
one-parameter subgroup (of a Lie group) 249.Q 
one-parameter variation 178.A 
one-point compactification 425.T 
one-point union 202.F 
one-sided (surface) 410.B 
one-sided stable for exponent 1/2 App. A, Table 22 
one-sided stable process (of the exponent ai 5.F 
one-step-two-half-steps errors estimate 303.D 
one-to-one correspondence 358.B 
one-to-one mapping 381.C 
only normal crossings 16.L 
Onsager reciprocity relation 402.K 
onto mapping 381.C 
open 
(Riemann surface) 367.A 
(system) 419.A 
(topological manifold) 105.B 
finely 261.D 
Zariski 16.A 
openarc 93.B 
open ball 140 
open base 425.F 
open circle 140 
open continuous homomorphism  423J 
open covering (of a set) 425.R 
open disk 140 
open formulas 199.A 
opening 186.E 
open interval 140 355.C 
open mapping 425.G 
open mapping theorem 
(in Banach space) 371 
(in topological linear spaces) 424.X 
open n-ball 140 
open n-cell 140 
open n-disk 140 
open n-sphere 140 
open neighborhood 425.E 
open parallelotope (in an affine space) 7.D 
open set 425.B 
basic 425.F 
relative 425.J 
system of 425.B 
open simplex 7.D 70.C 
open sphere 140 
open star (ina complex) 70.B,C 
open subgroup (of a topological group) 423.D 
open surface 410.B 
open system entropy 402.G 
open tubular neighborhood  105.L 114.B 
operate 
(in a function algebra) 192.N 
from the left (on a set) 362.B 
from the right (on a set) 362.B 
operating characteristic 404.C 
operating function 192.N 
operating systems 75.C 
operation(s) 
(of an operator domain on a module) 277.C 
(on aset) 409.A 
Adams 237.E 
Bokshtein 64.B 
Boolean 42.A 


Subject Index 
Operation A (in set theory) 


cohomology 64 

compatible with 277.C 

four arithmetic 294.A 
functional cohomology  202.S 
functional ®- 202.8 

glide 92.E 

homotopy 202.0 

left 409.A 

primary cohomology 64.B 
Pontryagin (pth) power 64.B 
primitive (ofa group) 362.B 
rational 294.A 

reduced square 64.B 

right 409.A 

ring 368A 

stable cohomology 64.B 
stable primary cohomology 64.B 
stable secondary cohomology 64.C 
Steenrod (pth) power 64.B 
Steenrod square 64.B 
transitive (ofa group) 362.B 


operation A (in set theory) 22.B 
operational calculus 251.G 306, App. A, Table 12.II 
operator 


(in functional analysis) 162 251.A 

(on a set) 409.A 

Abelian 208 E 

accretive (in a Hilbert space) 286.C 

additive 251.A 

adjoint (in Banach spaces) 37.D 251.D 

adjoint (in Hilbert spaces) 251.E 

adjoint (of a linear partial differential operator) 
322.E 

adjoint (of a microdifferential operator) 274.F 

adjoint (of a microlocal operator) 274.F 

amplification (of the scheme) 304.F 

angular momentum  258.D 

annihilation 377.A 

Beltrami differential, of the first kind App. A, 
Table 4.IT 

Beltrami differential, of the second kind App. 
A, Table 4.II 

boundary 200.C 201.B 

with a boundary condition 112.F 

bounded linear 37.C 

Calderón-Zygmund singular integral 217.J 
251.0 

Cartier 9.E 

channel wave 375.F 

closable 251.D 

closed 391251.D 

closure 425.B 

coboundary 200.F 

compact 68 

completely continuous 68.B 

conjugate (in Banach spaces) 37.D 

conjugate (of a differential operator) 125.F 

conjugate (of a linear operator) 251.D 

conjugation (in function algebras) 164.K 

creation 377.A 

decomposable (on a Hilbert space) 308.G 

degeneracy (in a semisimplicial complex) 70.E 

diagonalizable (in an Abelian von Neumann 
algebra) 308.G 

differential 112 223.C 306.B 

differential, of the kth order 237.H 

differentiation 223.C 

dissipative 286.C 

domain (of an Q-group) 190.E 

domain of 409.A 


down-ladder 206.B 

dual(in Banach spaces) 37.D 

dual (of a differential operator) 125.F 
dual (of a linear operator) 251.D 
elliptic 112.A 

energy-momentum 258.D 
evolution 378.G 

exponential function of 306.C 

face (in a semisimplicial complex) 70.E 
formal adjoint 322.E 

4-momentum 258.D 

Fourier integral 274.C 

Fredholm 68.F 251.D 

fundamental 163.E 

generalized wave 375.B 

Green's 189.A,B 194.C 
Hamiltonian 351.D 

Hecke 32.D 

Hermitian 251.E 

Hilberts e- 411J 

holomorphic evolution 378.1 
identity (on a Banach space) 37.C 
incoming wave 375.B 

with index 68.F 

integral 68.N 100.E 251.O 306.B 
integral, of Hilbert-Schmidt type 68.C 
interior 425.B 

inverse 37.C 251.B 

isometric 251.E 

Laplace 323.A 442.D 
Laplace-Beltrami 194.B 

linear 251 

linear (in Banach spaces) 37.C 

linear (inlinear spaces) 256.B 

linear boundary 315.B 

linearized 286.E 

local 125.DD 

logica] 411.E 

Markov 136.B 

maximal (of a differential operator) 112.E 
maximal dissipative 251.J 
microdifferential 274.F 

microlocal 274.F 

microlocally elliptic 345.A 
Mikusinski’s 306.B 

minimal (of a differential operator) 112.E 
modified wave 375.B 

modular 308.H 

monotone (in a Hilbert space) 286.C 
nonlinear semigroups of 286.X 
nonnegative 251.E 

normal 390.E 

normal (of Sario) 367.G 

normal linear 251.E 

nuclear 68.1,K 

number 377.A 

ordinary differential 112.A 

outgoing wave 375.B 

partial differential 112.A 

positive (in vector lattices) 310.E 
positive semidefinite 251.E 
projection (in a Hilbert space) 197.E 
pseudodifferentia] 251.0 345 
pseudodifferential (in microlocal analysis) 
274.F 

resolvent (of a Markov process) 261.D 
ring of 308.C 

S- 150.D 

scalar 390.K 

scattering 375.F,H 
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Schrödinger 351.D 
self-adjoint 251.E 390.E 
shift 223.C 251.0 306.C 
spectral 390.K 
Steenrod App. A, Table 6.II 
step-down 206.B 
step-up 206.B 
strongly elliptic 112.G 323.H 
Sturm-Liouville 112.1 
of summable pth power 68.K 
symmetric 251.E 
system of differential 112.R 
T- 375.C 
TCP 150.D 
Toeplitz operator 251.0 
total boundary 200.E 
trace 168.B 
translation 306.C 
transposed 112.E 189.C 322.E 
unilateral shift 390.1 
unitary 390.E 
up-ladder 206.B 
Volterra 68.J 
wave 375.B,H 
operator algebra 308.A 
operator convex 212.C 
operator domain 277.C 
module with 277.C 
operator homomorphism 
(of A-modules) 277.E 
(of Q-groups) 190.E 
operator isomorphism 190.E 
operator monotone 212.C 
operator topology 
strong 251.C 
uniform 251.C 
weak 251.C 
operator-valued distribution 150.D 
opposite 
(simplex) 201.C 
orientation 105.F 
root 13.R 
optical axis 180.B 
optical direction cosines 180.A 
optical distance 180.A 
optical theorem 386.B 
optics, geometric 180 
optimal 
(design) 102.E 
asymptotically 354.0 
optimal control 46.D 86.B,C 405.A 
optimal control problem, time Set 


optimality 
A- 102.E 
D- 102.E 
E- 102.E 


principle of 127.A 
optimal policy 127.A 
optimal regular problem 86.F 
optimal solution 255.A 264.B 292.A 
basic 255.A 
optimal stopping 405.E 
optimization model 307.C 
optimum allocation 373.E 
optimum predictor, linear 395.D 
optional (stochastic process) 407.B 
optional c-algebra 407.B 
optional sampling 262.C 
optional sampling theorem 262.A 
orbit 


Subject Index 
Order 


(of a dynamical system) 126.B 

(of a permutation group) 151.H 

(=system of transitivity) 362.B 

(of a topological transformation group) 110.A 
431.A 

closed 126.D 

exceptional 431.C 

principal 431.C 

pseudo-,a- 126J 

pseudo-, tracing property 126.J 

singular 431.C 


orbital angular momentum 351.E 

orbital elements, Kepler’s 309.B 

orbitally stable 126.F 

orbital stability (of a solution of a differential 


equation) 394.D 


orbit determination 309.A 
orbit space (of a topological group) 431.A 
orbit type 431.A 


principal 431.C 


order 


(of an algebraic number field) 14.B 

(ofa covering) 425.R 

(of a differential equation) 313.A 320.A 

(of a differential operator) 112.A 

(of an element of a group) 190.C 

(of an elliptic function) 134.E 

(of a function defined by a Dirichlet series) 
121.C 

(of a function on an algebraic curve) 9.C 

(of a generating point of a simple maximally 
overdetermined system) 274.H 

(ofa group) 190.C 

(of a homomorphism of Abelian varieties) 3.C 

(of an infinitesimal) 87.G 

(of an infinity) 87.G 

(of a Lie algebra) 191.D 

(of a meromorphic function) 272.C 

(of a microdifferential operator) 274.F 

(of a multistep method) 303.E 

(=order relation) 311.A 

(of a plane algebraic curve) 9.B 

(of a point in an ordinary curve) 93.C 

(of a point with respect to a cycle) 99.D 

(of a pole of a complex function) 198.D 

(of the precision of numerical solution) 303.B 

(=a subring) 27.A 

(of a system of differential equations) 313.B 

(of a transcendental entire function) 429.B 

(of a zero point of a complex function) 198.C 

d'Alembert's method of reduction of 252.F 

derivatives of higher App. A, Table 9.III 

difference of the nth 104.A 

finite (distribution) 125 

y-point of the kth (of a holomorphic function) 
198.C 

ofhigher 87.G 

infinite (element in a group) 190.C 

left (of a g-lattice) 27.A 

oflower 87.G 

maximal (of a g-lattice) 27.A 

at most (a function) 87.G 

of the nth 87.G 

principal (of an algebraic number field) 14.B 

principal (fundamental theorem of) 14.C 

right (of a g-lattice) 27.A 

ofthe same 87.G 

small set of 436.G 

space of line elements of higher 152.C 

surface of the second 350.A 


Subject Index 
Order « 


zero point of the kth (of a holomorphic func- 
tion) 198.C 
zero point of the — kth (of a complex function) 
198.D 
order « 
capacity of 169.C 
Cesaro method of summation of 379.M 
Holder condition of 84.A 
Lipschitz condition of 84.A 
potential of 338.B 
summable by Cesáro's method of 379.M 
order k 
coefficient of 110.A 
converge in the mean of 173.B 342.D 
invariants of 110.A 
population moment of 396.C 
principal components of 110.A 
quantile of 341.H 
Riesz method of summation 379.R 
summable by Hólder's method of 379.M 
summable by M. Riesz’s method of 379.R 


order p 
contravariant tensor field of 105.0 
jetof 105.X 


order s, covariant tensor field of 105.0 
order 0, frame of 110.C 
order 1 
family of frames of 110.B 
frame of 110.C 
order 2, frame of 110.B,C 
order 3, frame of 110.B,C 
order 4, frame of 110.B 
order bounded 310.B 
order convergent sequence (in a vector lattice) 
310.C 
order-disorder transition 402.F 
ordered additive group  439.B 
totally 439.B 
ordered complex (of a semisimplicial complex) 
70.E 
ordered field 149.N 
Archimedean 149.N 
Pythagorean 60.0 
ordered group 243.G 
lattice- 243.G 
totally 243.G 
ordered linear spaces 310.B 
lattice- 310.B 
ordered pair 33.B 381.B 
ordered set 311.A 
inductively 34.C 
lattice- 243.A 
linearly 311-A 
partially 311.A 
semi- 311.A 
totally 311.A 
ordered simplex (in a simplicial complex) 70.E 
ordered simplicial complex 70.C 
order function (meromorphic function) 272.B 
order homomorphic (ordered sets) 311.E 
order homomorphism 311.E 
order ideal (of a vector lattice) 310.B 
ordering 96.C 311.A 
dual 311.A 
duality principle for 311.A 
lexicographic 311.G 
lexicographic linear 248.M 
linear 311.À 
partial 311A 
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pre- 311.H 
total 311.A 
well- 311.C 


order isomorphic (ordered sets) 311.E 
order isomorphism  311.E 
order limit (in a vector lattice) 310.C 
order-preserving mapping 311.E 
order-preserving semigroup 286.Y 
order relation 311.A 
order statistic 396.C 
order topology 425.C 
order type 312.A 
ordinal numbers 312.B 
admissible 356.G 
cardinality of 49.E 
constructive 81.B 
countable 49.E 
finite 312.B 
of the first, second, or third number class 
312.D 
of a higher number class 312.D 
hyperconstructive 81.E 
initial 49.E 
isolated 312.B 
limit 312.B 
strongly inaccessible 312.E 
transfinite 312.B 
transfinite initial 49.E 
weakly inaccessible 312.E 
ordinal product (of a family of ordered sets) 311.G 
ordinal scale 397.M 
ordinal sum (of a family of ordered sets) 311.G 
ordinary curve 93.C 
ordinary derivative (of a set function) 380.D 
ordinary differential equation(s) 313, App. A, Table 
14 
(asymptotic behavior of solutions) 314 
(boundary value problems) 315 
(initial value problems) 316 
Euler linear App. A, Table 14.] 
higher-order App. A, Table 14.1 
homogeneous App. A, Table 14.I 
homogeneous (of higher order) App. A, Table 
14.1 
linear 252 313.A 
linear (with constant coefficients) App. A, 
Table 14.I 
linear (of the first order) App. A, Table 14.I 
linear (global theory) 253 
linear (of higher order) App. A, Table 14.I 
linear (local theory) 254 
nonlinear 313.A 
nonlinear (global theory) 288 
nonlinear (local theory) 289 
system of 313.B 
ordinary differential operator 112.A 
ordinary Dirichlet series 121.A 
ordinary double point (of a plane algebraic curve) 
9.B 
ordinary element 191.1 
ordinary helicoid 111.1 
ordinary helix 111.F 114.F 
ordinary integral element | 428.E 
ordinary integral manifold (of a differential ideal) 
428.E 
ordinary lower derivative (of a set function) 380.D 
ordinary point 
(of an analytic set) 23.B 
(ofa curve) 93.G 
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(in hyperbolic geometry) 285.C 
(of an ordinary curve) 93.C 
(on a Riemann surface) 11.D 
ordinary representation (of a finite group) 362.G 
ordinary sense, derivable in the 380.D 
ordinary singularity 
(of an analytic function) 198.P 
in the wider sense 198.P 
ordinary solution (of a differential ideal) 428.E 


ordinary upper derivative (of a set function) 380.D 


ordinate set 221.E 
orientable 
(manifold) 105.F 201.N 
(pseudomanifold) 65.B 
transversely 154.B 
orientable fiber bundle 147.L 
Orientation 
(of an affine space) 139.B 
(of a contact element) 110.A 
(of a manifold) 105.F 201.N 
local (in an oriented manifold) 201.N 
negative (of an oriented manifold) 105.F 
opposite (of oriented atlases) 105.F 
positive (of an oriented manifold) 105.F 
same (of oriented atlases) 105.F 
orientation cohomology class 201.N 
orientation manifold 201.N 
orientation sheaf 201.R 
orientation-preserving mapping 99.A 
orientation-reversing mapping 99.A 
oriented atlas (of an orientable differentiable mani- 
fold) 105.F 
oriented cobordism 
class 114.H 
group 114.H 
oriented differentiable structures, group of (on the 
combinatorial sphere) 114.1 
oriented element (in a covering manifold) 110.A 
oriented G-manifold 431.E 
oriented graph 186.B 
oriented manifold 105.F 201.N 
integrals over 105.T 
oriented pseudomanifold 65.B 
coherently 65.B 
oriented q-simplex 201.C 
Oriented real hypershpere 76.A 
oriented segment 442.A 
oriented simplicial chain complex  201.C 
oriented singular r-simplex of class C?  105.T 
oriented tangent line 76.B 
origin 
(of an affine space) 7.C 
(of a Euclidean space) 140 
(of a projective frame) 343.C 
Orlicz-Pettis theorem 443.D 
Orlicz space 168.B 
Ornstein-Uhlenbeck Brownian motion 45.1 
orthant, positive 89.G 
orthochronous 258.A 
orthocomplement (of a subspace of a linear space) 
139.G 
orthogonal 
(block design) 102.J 
(elements of a ring) 368.B 
(in Euclidean geometry) 139.E,G 
(functions) 317.A 
(in a Hilbert space) 197.C 
(linear subspaces) 256.G 
mutually (latin squares) 241.B 
orthogonal array 102.L 


Subject Index 
Orthonomic system, passive 


orthogonal complement (of a subset of a Hilbert 
space) 197.E 
orthogonal component (of an element of a linear 
. space) 139.G 
orthogonal coordinate system adapted to (a flag) 
139.E 
orthogonal curvilinear coordinates 90.C 
orthogonal curvilinear coordinate system App. A, 
Table 3.V 
orthogonal expansion 317.A 
orthogonal for a finite sum  19.G 
orthogonal fractional factorial design 102.1 
orthogonal frame 111.B 139.E 
orthogonal frame bundle 364.A 
tangent 364.A 
orthogonal function(s) 317, App. A, Table 20 
Haar system of 317.C 
Rademacher system of 317.C 
Walsh’s system of 317.C 
orthogonal group  60.I 139.B 151.1 
(over a noncommutative group) 60.0 
complex 60.1 
complex special 60.1 
infinite 202.V 
over K with respect to Q 60.K 
pair 422.1 
proper 60.1 258.A 
reduced 61.D 
special 60.1 
orthogonality for a finite sum 317.D, App. A, Table 
20.VIl 
orthogonality relation 
(on irreducible characters) 362.G 
(for square integrable unitary representations) 
437.M 
orthogonalization 
Gram-Schmidt 317.A 
Schmidt 317.A 
orthogonal k-frame (in R”)  199.B 
orthogonal matrix 269J 
complex 269J 
proper 269 
orthogonal measure 164.C 
orthogonal polynomial(s) 19.G, App. A, Table 
20.VII 
Chebyshev 19.G 
simplest 19.G 
system of 317.D 
orthogonal projection 
(in Euclidean geometry) 139.E,G 
(in a Hilbert space) 197.E 
method of 323.G 
orthogonal series (of functions) 317.A 
orthogonal set 
(of functions) 317.A 
(of a Hilbert space) 197.C 
(ofa ring) 368.B 
orthogonal system 
(of functions) 317.A 
(of a Hilbert space) 197.C 
complete 217.G 
orthogonal trajectory 193.J 
orthogonal transformation 139.B 348.B 
(over a noncommutative field) 60.0 
(with respect to a quadratic form) 60.K 
orthogonal transformation group  60.I 
over K with respect toQ 60.K 
orthomodular 351.L 
orthonomic system, passive (of partial differential 
equations) 428.B 


Subject Index 
Orthonormal basis 


orthonormal basis 197.C 
orthonormalization 139.G 
orthonormal moving frame 417.D 
orthonormal set 

(of functions) 317.A 

(of a Hilbert space) 197.C 

complete (of a Hilbert space) 197.C 
orthonormal system 

complete 217.G 


complete (of fundamental functions) 217.G 


orthorhombic system 92.E 
oscillate (for a sequence) 87.D 
oscillating (series) 379.A 
oscillating motion 420.D 
oscillation(s) 318 
(ofa function) 216.A 
bounded mean 168.B 
damped 318.B 
equation of App. A, Table 15.VI 
forced 318.B 
harmonic 318.B 
nonlinear 290.A 
relaxation 318.C 
nonstationary 290.F 
oscillator process 351.F 
oscillatory 314.F 
osculating circle 111.F 
osculating elements 309.D 
osculating plane 111.F 
osculating process 77.B 
Oseen approximation 205.C 
Osgood theorem, Hartogs- 21.H 
O-S positivity 150.F 
Osterwalder-Schrader axioms 150.F 
Ostrogradskii formula 94.F 
outdegree 186.B 
outer area 216.F 270.G 
outer automorphisms 
group of (ofa group) 190.D 
group of (of a Lie algebra) 248.H 
outer capacity, Newtonian 48.H 
outer function 43.F 
outer harmonic measure 169.B 
outer measure 270.E,G 
Carathéodory 270.E 
Lebesgue 270.G 
outer solution 25.B 
outer variable 25.B 
outer volume 270.G 
outgoing subspace 375.H 
outgoing wave operator 375.B 
outlier test 397.Q 
out-state 150.D 386.A 
oval 89.C 111.E 
Cassini 93.H 
mean (of two ovals) 89.D 
width of the 111.E 
ovalod 89.C 111.1 
overall approximation formula 303.C 
overconvergence 339.E 
overcrossing point 235.A 
overdetermined system 
(of differential operators) 112.1 
(of partial differential equations) 320.F 
maximally (=holonomic) 274.H 
overfield 149.B 
overidentified 128.C 
overrelaxation 
successive (SOR)  302.C 


P 
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PSL/(n, k) (projective special linear group) 60.B 
Däin, k) (projective symplectic group over K) 60.L 


PU (n) (projective unitary group) 60.F 

P"(K) (projective space) 343.H 

g-subsequence 354.E 

n-group 151.F 

n-length (of a group) 151.F 

m-manifold 1141 

n-series (of a group) 151.F 

n-solvable group  151.F 

n theorem 116 

n topology 424.R 

I} set 22.A 

TT! set 22D 

p-adic exponential valuation 439.F 

p-adic extension (of the field of quotients of a 
Dedekind domain) 439.F 


p-index (of a central simple algebra over a finite 


algebraic number field) 29.G 


p-invariant (of a central simple algebra over a finite 


algebraic number field) 29.G 
p-primary ideal 67.F 
P-function, Weierstrass 134.F, App. A, Table 
16.IV 
P-acyclic 200.Q 
p-adic integer(s) 439.F 
ring of 439.F 
p-adic L-function 450.J 
p-adic number 439.F 
p-adic number field 257.A 439.F 
p-adic regulator 450.J 
p-adic valuation 439.F 
p-ary matroid 66.H 
p-atom 168.B 
p-covector 256.0 
p-dimensional noncentral Wishart distribution 
374.C 
p-extension (of a field) 59.F 
p-factor (of an element of a group) 362.I 
p-fold exterior power 
(of a linear space) 256.0 
(of a vector bundle) 147.F 
p-form 
tensorial 417.C 
vectorial 417.C 
p-group 151.B 
Abelian 2.A 
complete (Abelian) 2.D 
divisible (Abelian) 2.D 
p-parabolic type 327.H 
p-rank (of a torsion-free additive group) 2.E 
p-regular (element of a finite group) 362.1 
p-Sylow subgroup 151.B 
pth power, operator of summable 68.K 
pth power operation 
Pontryagin 64.B 
Steenrod 64.B 


p-torsion group of an exceptional group App. 


Table 6.IV 

p-valent (function) 438.E 
absolutely 438.E 
circumferentially mean 438.E 
locally 438.E 
locally absolute 438.E 
mean 438.E 
quasi- 438.E 

p-vector 256.0 


A, 
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bundle of 147.F 
(p,q)-ball knot 235.G 
(p,g)-knot 235.G 
(p + 1)-stage method 303.D 
P-convex (for a differential operator) 112.C 
strongly 112.C 
P-function, Riemann 253.B, App. A, Tables 
14.II 18.I 
P-projective resolution 200.0 
P-wave 351.E 
P,set 22.D 
P’-figure 343.B 
p-space 425.Y 
P-space 425.Y 
Padé approximation 142.E 
Padé table 142.E 
Painlevé equation 288.C 
Painlevé theorem 198.G 
Painlevé transcendental function 288.C 
pair 381.B 
(in axiomatic set theory) 33.B 
ball 235.G 
BN- 13.R 
contact (in circle geometry) 76.C 
group (of topological Abelian groups) 422.1 
order 381.B 
ordered (in axiomatic set theory) 33.B 
orthogonal group 422.1 
Poincaré, of formal dimension n 114.J 
simplicial 201.L 
sphere 65.D 235.G 
topological 201.L 
unordered 381.B 
unordered (in axiomatic set theory) 33.B 
paired comparison 346.C 
pairing 
(of linear spaces) 424.G 
axiom of 381.G 
pair test 346.D 
pairwise sufficient (statistic) 396.F 
Palais-Smale condition (C) 279.E 286.Q 
Paley theorem 317.B 
Paley theory, Littlewood- 168.B 
Paley-Wiener theorem 125.0,BB 
pantograph 19.E 
paper 
binomial probability 19.B 
functonal 19.D 
logarithmic 19.F 
probability 19.F 
semilogarithmic 19.F 
stochastic 19.B 
Pappus theorem 
(on conic sections) 78.K 
(in projective geometry) 343.C 
parabola(s) 78.A 
family of confocal 78.H 
parabolic 
(differential operator) 112.A 
(Riemann surface) 367.D,E 
(simply connected domain) 77.B 
(visibility manifold) 178.F 
parabolic coordinates 90.C, App. A, Table 3.V 
parabolic cusp (of a Fuchsian group) 122.C 
parabolic cylinder 350.B 
parabolic cylinder function 167.C 
parabolic cylindrical coordinates 167.C, App. A, 
Table 3.V 
parabolic cylindrical equation App. A, Table 14.1I 








Subject Index 
Parameter(s) 


parabolic cylindrical surface 350.B 
parabolic-elliptic motion 420.D 
parabolic geometry 285.A 
parabolic motion 420.D 
parabolic point (on a surface) 110.B 111.H 
parabolic quadric hypersurface 350.I 
parabolic subalgebra (of a semisimple Lie algebra) 
248.0 

parabolic subgroup 

(of an algebraic group) 13.G 

(of the BN-pair) 13.R 

(of a Lie group) 249.J 

cuspidal 437.X 

minimal k- 13.Q 

standard k- 13.Q 
parabolic transformation 74.F 
parabolic type 

(equation of evolution) 378.1 

partial differential equation of 327 
paraboloid 

elliptic 350.B 

elliptic, of revolution 350.B 

hyperbolic 350.B 
paracompact (space) 425.8 

countably 425.Y 

strongly 425.8 
paracompact C’-manifold 105.D 
paradox(es) 319 

Burali-Forti 319.B 

d’Alembert 205.C 

Richard 319.B 

Russel 319.B 

Skolem 156.E 

Zeno 319.C 
parallax 

annual 392 

geocentric 392 
parallel(s) 

(affine subspaces) 7.B 

(lines) 139.A 155.B 

(lines in hyperbolic geometry) 285.B 

(tensor field) 364.B 

axioms of 139.A 

in the narrow sense (in an affine space) 7.B 

in the sense of Levi-Civita 111.H 

in the wider sense (in an affine geometry) 7.B 
parallel coordinates (in an affine space) 7.C 
parallel displacement 

(in an affine connection) 80.H 

(in a connection) 80.C 

(in the Riemannian connection) 364.B 
parallelepiped, rectangular 14.0 
parallelism, absolute 191.B 
parallelizable 

(flow) 126.E 

(manifold) 114.1 

almost 114.1 

s- 114] 

stably 114] 
parallelogram, period 134.E 
parallelotope 425.T 

(in an affine space) 7.D 

open (in an affine space) 7.D 
parallel projection (in an affine space) 7.C 
parallel translation 80.C 364.B 
parameter(s) 165.C 

(of an elliptic integral} 134.A 

(in a population distribution) 401.F 

(of a probability distribution) 396.B 


Subject Index 
Parameter space 


acceleration 302.C 
canonical(ofan arc) 111.D 
design for estimating 102.M 
distinct system of 284.D 
estimable | 403.E 
isothermal 334.B 
isothermal (for an analytic surface) 111.1 334.B 
Lagrange’s method of variation of 252.D 
linear 102.A 
linearly estimable 403.E 
local (Fuchsian groups) 32.B 
local (of a nonsingular algebraic curve) 9.C 
local (of a Riemann surface) 367.A 
local canonical (for power series) 339.A 
local uniformizing (of a Riemann surface) 
367.A 
location 396.5 400.E 
one- (group of transformations) 105.N 
one- (subgroup of a Lie group) 249.Q 
one-, semigroup of class (C?) 378.B 
of regularity (of a Lebesgue measurable set) 
380.D 
regular system of 284.D 
scale 396.1 400.E 
secondary 110.A 
selection 396.F 
system of 284.D 
time (of a stochastic process) 407.A 
transformation 396.I 
transformation of 111.D 
true value of 398.A 
parameter space 
(of a family of compact complex manifolds) 
72.G 
(of a family of probability measures) 398.A 
(of a probability distribution) 396.B 
parametrically sustained vibration 318.B 
parametric function 102.A 399.A 
parametric programming 264.C 
parametric representation 165.C 
(of Feynman integrals) 146.B 
(of a subspace of an affine space) 7.C 
parametrix 189.C 
left 345.A 
right 345.A 
parametrized, effectively (at 0) 72.G 
paraxial ray 180.B 
parity check matrix 63.C 
parity check polynomial 63.D 
parity transformation 359.B 
Parreau-Widom type 164.K 
Parseval equality | 18.B 197.C 
Parseval identity 18.B 159.A 160.C 192.K 
220.B,C,E 
parsing 31.E 
part(s) 
(for a function algebra) 164.F 
connected 150.D 
cyclic (of an ergodic class) 260.B 
_ dissipative (of a state space) 260.B 
essential 260. 
finite (of an integral) 125.C 
Gleason (for a function algebra) 164.F 
holomorphic (in a Laurent expansion) 198.D 
homogeneous (of a formal power series) 370.A 
imaginary 74.A 
integration by (for the D-integral) 100.G 
integration by (for the Stieltjes integral) 94.C 
integration by (for the Riemann integral) 
216.C 
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negative (of an element of a vector lattice) 
310.B 
positive (of an element of a vector lattice) 
310.B 
principal (of a differential operator) 112.A 
principal (of a Laurent expansion) 198.D 
principal (of a partial differential operator) 
320.B 
purely contractive 251.N 
real 74.A 
semisimple (of an algebraic group) 13.E 
semisimple (of a nonsingular matrix) 13.E 
singular (of a Laurent expansion) 198.D 
unipotent (of an algebraic group) 13.E 
unipotent (of a nonsingular matrix) 13.E 
partial boundary operator 200.E 
partial capture 420.D 
partial correlation coefficient 397.J 
sample 280.E 
partial denominator (of an infinite continued frac- 
tion) 83.A 
partial de Rham system 274.G 
partial derivative 106.F,K 
nth-order 106.H 
partial derived functor 200.1 
partial differential 200.H 
partial differential coefficient 106.E 
partial differential equation(s) 313.A 320 
(initial value problems) 321 
(method of integration) 322 
of elliptic type 323, App. A, Table 15. VI 
of the first order 324 
Fokker-Planck 115.A 
hyperbolic 325 
of hyperbolic type 325 
of mixed type 326 
of parabolic type 327 
solution, of the first order App. A, Table 15.11 
solution, of the second order App. A, Table 
15.111 
system of, of order 1 (on a differentiable mani- 
fold) 428.F 
partial differential operator 112.A 
partial differentiation 106.F 
partial fraction App. A, Table 10.V 
partial function 356.E 
partial graph 186.C 
partially balanced incomplete block design 102.J 
406.J 
partially confounded (with blocks) 102.J 
partially conserved axial-vector currents 132.G 
partially differentiable (function) 106.F 
partially isometric (operator) 251.E 
partially ordered set 311.A 
partial mapping (of a mapping) 381.C 
partial numerator (of an infinite continued fraction) 
83.A 
partial observation 405.C 
partial ordering 311.A 
partial pivoting 302.B 
partial product 379.G 
partial quotient, nth 83.A 
partial recursive (in a partial recursive function) 
356.E,F 
partial sum (of a series) 379.A 
diagonal (of a double series) 379.E 
partial summation, Abel 379.D 
partial wave 386.B 
partial wave expansion 375.E 386.B 
partial wave scattering amplitude 375.E 
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particle(s) 
Bose 132.A 
composite 132.A 
elementary 132 
Fermi 132.A 
particular solution 
(of a differential equation) 313.A 
(of partial differential equations) 320.C 
(for a system of differential equations) 313.C 
particular transformation (of bX) 248.R 
partition(s) 
(in ergodic theory) 136.E 
(of an interval) 216.A 
(ofaset) 381.D 
(ofa space) 425.L 
entropy of 136.E 
-independent 136.E 
independent sequence of 136.E 
Markov (for an automorphism) 136.C,G 
number of 177.D 328 
of numbers 328 
Pinsker 136.E 
principal 66.H 
of unity 425.R 
of unity of class C® 105.S 
of unity subordinate to a covering 425.R 
upper semicontinuous 425.L 
partition function 402.D 
grand 402.D 
partitioning algorithm 215.E 
Pascal, B. 329 
limaçon of 93.H 
Pascal configuration 78.K 
Pascal line 78.K 
Pascal theorem 
(on conic sections) 78.K 
(in geometry) 155.E 
(in projective geometry) 343.E 
Pascal triangle 330 
Pasch's axiom (in geometry) 155.B 
passive 
(state) 402.G 
completely 402.G 
passive boundary point 260.I 
dual 260.1 
passive network 282.C 
passive orthonomic system (of partial differential 
equations) 428.B 
pastcone 258.A 
past history, independent of the 406.D 
pasting together the boundaries 114.F 
path 
(in a Finsler space) 152.C 
(ina graph) 186.F 
(of a Markov process) 261.B 
(of a stochastic process) 407.A 
(in a topological space) 148.C 170 
asymptotic (for a meromorphic function) 
272.H 
closed (in a graph) 186.F 
closed (in a topological space) 170 
closed, space of 202.C 
critical 376 
direct 186.F 
direct closed 186.F 
Euler (ina graph) 186.F 
general geometry of 152.C 
Hamilton 186.F 
of an integration (curvilinear integral) 94.D 
inverse 170 


Subject Index 
Percolation process 


nontangential 333.B 
projective geometry of 109 
quasi-independent of (response probability) 
346.G 
sample 407.A 
simple 186.F 
Stolz (in a plane domain) 333.B 
path-component 79.B 
path-connected 79.B 
path-dependent, d-trial 346.G 
path-independent (response probability) 346.G 
path integral 351.F 
pathological (space) 65.F 
path space 148.C 261.B 
pathwise uniqueness of solution 406.D 
pattern formation 263.D 
Pauli approximation 351.G 
Pauli-Lubanski vector 258.D 
Pauli principle 351.G 
Pauli spin matrix 258.A 351.G 
payoff 108.B,C 173.B 
payoff function 173.C 
PBIBD (partially balanced incomplete block 
design) 102.J 
PC (predictor-corrector) method 303.E 
PCT invariance 386.B 
PCT theorem 386.B 
peak point 164.D 
generalized 164.D 
peak set 164.D 
generalized 164.D 
Peano area (of a surface) 246.F 
Peano continuum 93.D 
Peano curves 93.J 
Peano postulates 294.B 
Pearson distribution 397.D 
Pearson lemma, Neyman- 400.B 
Péclet number 116.B 
pedal curve 93.H 
Peierls-Bogolyubov inequality 212.B 
Peirce decomposition (of a Jordon algebra) 231.B 
Peirce left decomposition (in a unitary ring) 368.F 
Peirce right decomposition (in a unitary ring) 
368.F 
Peirce space 231.B 
Petczynski theorem, Bessaga- 443.D 
Pell equation 118.A 
penalized problems 440.B 
penalty method 292.E 
penalty term 440.B 
pencil 
algebraic 15.C 
ofconics 343.E 
of hyperplanes (in a projective space) 343.B 
Lefschetz 16.U 
linear 16.N 
of lines (in a projective plane) 343.B 
of planes (in a 3-dimensional projective space) 
343.B 
of quadric hypersurfaces 343.E 
of quadrics 343.E 
peninsula (in a Riemann surface) 272.J 
pentagamma function 174.B 
pentagon 155.F 
pentagonal number 4.D 
pentagonal number theorem 328 
pentaspherical coordinates 90.B 
percolation process 340.D 
bond 340.D 
site 340.D 


Subject Index 
Perfect 


perfect 
(image) 180.A 
æ- (graph) 186.K 
y- (graph) 186.K 
perfect additive functional 261.E 
perfect code 63.B 
perfect delay convention 51.F 
perfect field 149.H 
perfect fluid 205.B 
perfect image 425.CC 
perfect inverse image 425.CC 
perfect kernel (in potential theory) 338.E 
perfectly normal space 425.Q 
perfectly separable space 425.P 
perfect mapping 425.W 
quasi- 425.CC 
perfectness theorem 186.K 
perfect number 297.D 
of the second kind 297.D 
perfect set 425.0 
perigon, straight 139.D 
perihelion distance 309.B 
period 
(of an Abelian differential form) 11.C 
(of an ergodic class) 260.B 
(of a marked K3 surface) 72.K 
(ofan orbit) 126.D 
(of an oscillation) 318.A 
(of a periodic continued fraction) 83.C 
(of a periodic function) 134.B 
(ofa wave) 446 
fundamental 134.E 
periodgram  421.C 
periodic (trajectory) 126.D 
almost 126.F 
periodic continued fraction | 83.C 
periodic endomorphism (at a point) 136.E 
periodic function 134.E 
almost (ona graph) 18.E 
almost (with respect to p) 18.C 
almost (in the sense of Bohr) 18.B 
analytic almost 18.D 
doubly 134.E 
simply 134.E 
uniformly almost 18.B 
periodic group 2.A 
maximally almost 18.1 
minimally almost 18.1 
periodic inequality, Riemann 3.L 
periodicity 390.J 


periodicity modulus (of an elliptic integral) 134.A 


periodicity theorem, Bott 202.V 237.D, App. A, 
Table 6.VII 
periodic solution (of Hill's equation) 268.E 
period matrix 
(of a closed Riemann surface) 11.C 
(of a complex torus) 3.H 
period parallelogram 134.E 
fundamental 134.E 
period relation, Riemann’s 3.L 11.C 
peripheral devices 75.B 
peripheral system 235.B 
permeability, magnetic 130.B 
permeable membrane 419.A 
permutation 190.B 
(in a symmetric group) 151.G 


even 151.G 
k- 330 
odd 151.G 


permutation group 190.B 
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ofdegreen 151.G 
imprimitive 151.G 
intransitive 151.H 
K-transitive 151.H 
K-ply transitive 151.H 
multiply transitive 151.H 
primitive 151.H 
regular 151.H 
transitive 151.H 
permutation representation (of a group) 362.B 
degree of 362.B 
faithful 362.B 
primitive 362.B 
reciprocal 362.B 
similar 362.B 
perpendicular 
(to a hyperplane) 139.E 
foot ofthe 139.E 
perpetual motion 402.G 
Perron-Brelot solution (of Dirichlet problem) 
120.C 
Perron-Frobenius theorem 310.H 
Perron integrable (function) 100.F 
Perron method (in Dirichlet problem) 120.C 
Perron theorem 
(on linear transformations of sequences) 379.L 
(on ordinary differential equations) 316.E 
(on positive matrices) 269.N 
Perron-Wiener-Brelot solution (of Dirichlet 
problem) 120.C 
persistent 260.J 
perspective 343.B 
perspective mapping (in projective geometry) 343.B 


PERT 307.C 376 


perturbation(s) 

analytic 331.D 

asymptotic 331.D 

general theory of 420.E 

Kato 351.D 

of linear operators 331 

method 25.A 

regular 331.D 

secular 55.B 

singular 289.E 

special theory of 420.E 
Petersson conjecture, Ramanujan- 32.D 
Petersson metric 32.B 
Peter. Wel theory 

(on compact groups) 69.B 

(on compact Lie groups) 249.U 
Petrovskii, hyperbolic in the sense of 325.F 
Petrovskii theorem 112.D 
Pettis completely additivity theorem 443.G 
Pettis integrable 443.F 

Gelfand- 443.F 
Pettis integral 443.F 

Gel'fand- 443.F 
Pettis measurability theorem  443.B 
Pettis theorem 

Dunford- 68.M 

Orlicz- 443.D 
Petvyashvili equation, Kadomtsev- 387.F 
Pfaffian 103.G 
Pfaffian equation(s) 428.A 

system of 428.A 
Pfaffian form | 428.A 
Pfaff problem 428.A 

generalized 428.B 
Pfluger extremal length, Hersch- 143.A 
phase 
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initial (of a simple harmonic motion) 318.B 
pure 402.G 
phase average 402.C 
phase constant (of a sine wave) 446 
phase function (of a Fourier integral operator) 
274.C 345.B 
phase portrait 126.B 
phase shift 375.E 386.B 
phase space 
(of a dynamical system) 126.C 290.C 
(for functional-differential equation) 163.C 
(in statistical mechanics) 402.C 
momentum 126.L 
velocity 126.L 
phase transition 340.B 
phase velocity (of a sine wave) 446 
phenomenon 
Gibbs 159.D 
Runge 223.A 
Stokes 254.D 
photon 132.B 377.B 
Phragmén-Lindelóf theorem 43.C 
physical Hilbert space 150.G 
physically contains 351.K 
PI-algebra (algebra with polynomial identities) 
29.J 
Picard exceptional value 272.E 
Picard group (of a commutative ring) 237J 
Picard-Lefschetz formula 418.F 
Picard-Lefschetz transformation 16.U 
Picard number (of a variety) 16.P 
Picard scheme  16.P 
Picard theorem 
(on transcendental entire functions) 429.B 
(on transcendental meromorphic functions) 
272.E 
Picard variety 16.P 
(of a compact Kahler manifold) 232.C 
Picard-Vessiot extension field 113 
Picard-Vessiot theory 113 
picture 
Heisenberg 351.D 
Schrödinger 351.D 
piecewise affine mapping 192.Q 
piecewise continuous function 84.B 
piecewise linear mapping 65.A 70.C 
piecewise smooth curve 364.A 
Pincherle-Goursat kernel 217.F 
Pinching problem (differentiable) 178.E 
Pinsker partition 136.E 
Pitman estimator 399.G 
pivot 302.B 
pivoting 
complete 302.B 
partial 302.B 
PL category 65.A 
PLembedding 65.D 
PL homeomorphism 65.A 
PLisomorphism 65.A 
PL k-ball 65.C 
PL (k —1)-sphere 65.C 
PL mapping (map) 65.A 
PL microbundle 147.P 
PL (n,m)-ball knot 65.D 
PL (n,m)-knot 65.D 
PL normal 147.P 
PL structure 65.C 
PLtangent 147.P 
PLtopology 65.A 
place (of a field) 439 


Subject Index 
Plücker relations (on Plücker coordinates) 


placement problem 235.A 
planar 367.G 
planar character 367.G 
planar curvilinear coordinates App. A, Table 3.V 
planar graph 186.H 
planarity (of a graph) 186.H 
Plancherel formula (on a unimodular locally com- 
pact group) 437.L 
Plancherel measure (of a locally compact group) 
437.L 
Plancherel theorem 160.H 192.A,K. 
(with respect to the Radon transform) 218.G 
Planck constant 351.A 
Planck (partial differential) equation, Fokker- 
115.A 402.1 
plane(s) 155.B 
(as an affine space) 7.A 
(in a projective space) 343.B 
Cayley projective 54 
complex 74.C 
conjugate (with respect to a quadric surface) 
350.C 
coordinates (of a plane) 343.C 
finite projective 241.B 
Gauss-Árgand 74.C 
Gaussian 74.C 
half- 155.B 333.A 
hodograph 205.B 
hyperbolic 122.C 
normal 111.F 
osculating 111.F 
pencil of (in a 3-dimensional projective space) 
343.B 
polar (with respect to a quadric surface) 350.C 
principal (of a quadric surface) 350.B 
projective 343.B 
rectifying 111.F 
tangent 111.H, App. A, Table 4I 
w- 74.D 
Zz- 74.D 
plane algebraic curve 9.B 
plane coordinates (of a plane) 343.C 
plane curve App. A, Table Al 
continuous 93.B 
plane domains 333 
closed 333.A 
multiply connected 333.A 
n-ply connected 333.A 
plane geometry 181 
plane polygon  155.F 
plane triangle App. A, Table 2.II 
plane trigonometry 432.A 
plane wave 446 
plane wave decomposition 125.CC 
planimeter 19.A 
planning 
production 376 
statistical 102.A 
plasticity, theory of 271.G 
Plateau problem 334 
playable 108.B 
Pleijel asymptotic expansion, Minakshisundaram- 
391.B 
PLK (Poincaré-Lighthill-Kuo) method 25.B 
Plotkin bound 63.B 
plots 102.B 
Pliicker coordinates (in a Grassman manifold) 
90.B 
Pliicker formulas (on plane algebraic curves) 9.B 
Plücker relations (on Pliicker coordinates) 90.B 


Subject Index 
Plurigenera 


plurigenera 15.E 
pluriharmonic distribution 21.C 
plurisubharmonic function 21.G 
plus infinity 87.D 
Pochhammer differential equation, Tissot- 206.C 
Poincaré, H. 335 
last theorem of 153.B 
theta-Fuchsian series of 32.B 
Poincaré-Birkhoff fixed-point theorem  153.B 
Poincaré-Birkhoff-Witt theorem (on Lie algebras) 
248.J 
Poincaré-Bruns theorem 420.A 
Poincaré characteristic, Euler- 16.E 201.B 
Poincaré class 
Euler- 56.B,F 
universal Euler- 56.B 
Poincaré complete reducibility theorem 3.C 
Poincaré complex 114 
Poincaré condition (in Dirichlet problem) 120.A 
Poincaré conjecture 65.C 
generalized |. 65.C 
Poincaré differential invariant 74.G 
Poincaré duality 201.0 450.Q 
Poincaré formula (in integral geometry) 218.C 
Poincaré formula, Euler- 201.B,F 
Poincaré group 170 258.A 
Poincaré-Lefschetz duality theorem 201.0 
Poincaré-Lighthill-Kuo (PLK) method 25.B 
Poincaré manifold 105.A 
Poincaré mapping (map) 126.C,G 
Poincaré method 25.B 
Poincaré method, Lindstedt- 290.E 
Poincaré metric 74.G 
Poincaré model (of geometry) 285.D 
Poincaré pair (of formal dimension n) 114.J 
Poincaré polynomial (of a finite simplicial complex) 
201.B 
Poincaré series 32.B 
Poincaré series, Eisenstein- 32.F 
Poincaré theorem 383.E 
(on Abelian varieties) 3.D 
Poincaré-Volterra theorem 198.J 
Poinsot representation 271.E 
point(s) 
(of an affine space) 7.A 
(in the foundations of geometry) 155.B 
(ofagraph) 186.B 
(in projective geometry) 343.B 
accessible boundary (of a plane domain) 333.B 
accumulation 425.0 
accumulation (of a sequence of real numbers) 
87.C 
adherent 425.B 
algebraic (over a field) 369.C 
algebraic branch (of a Riemann surface) 367.B 
almost all, of a variety 16.A 
x- (of a meromorphic function) 272.B 
a-limit 126.D 
ambiguous 62.D 
antipodal (on a sphere) 140 
apparent singular 254.C 
base (of a linear system) 16.N 
base (of a loop) 170 
base (of a topological space) 202.B 
bifurcation 126.M 217.M 286.R 
boundary (of a subset) 425.N 
branch (of a covering surface) 367.B 
branch (of a harmonic mapping) 275.B 
branch (of an ordinary curve) 93.C 
catastrophe 51.F 
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cluster 425.0 

coincidence (of maps) 153.B 

collinear (in projective geometry) 343.B 

complete accumulation 425.0 

condensation 425.0 

conjugate 46.C 364.C 

conjugate (in a projective space) 343.E 

corresponding (with respect to confocal qua- 
drics) 350.E 

critical (of a C*-function) 279.B 

critical (of a function) 106.L 

critical (of a mapping) 105.7 193.J 208.B 

critical (of a trajectory) 126.D 

cut (on a geodesic) 178.A 

degenerate critica] 106.L 279.B 

degree of 99.D 

of density (of a measurable set of the real line) 
100.B 

dependent (in an affine space) 7.A 

dependent (in a projective space) 343.B 

deviation 336.B 

discontinuity 84.B 

discontinuity, of the first kind 84.B 

discontinuity, of the second kind 84.B 

dual passive boundary 260.1 

elliptic (of a Fuchsian group) 122.C 

elliptic (on a surface) 111.H 

end (of an ordinary curve) 93.C 

entrance boundary 260.1 

equianharmonic range of 343.D 

equilibrium  108.C 

equilibrium (in the theory of games) 173.C 

equilibrium (of a trajectory) 126.D 

equivariant (of a mapping) 153.B 

equivariant, index (of a mapping) 153.B 

essentially singular (with respect to an analytic 
set) 21.M 

estimation 399.B 401.C 

exit boundary 260.1 

exterior (of a subset) 425.N 

externally irregular 338.L 

extreme (of a convex set) 89.A 

extreme (of a subset of a linear space) 424.T 

fixed (of a discontinuous transformation group) 
122A 

fixed (of a flow) 126.D 

fixed (of a mapping) 153.A 

fixed (of a mapping in a topological linear 
space) 153.D 

fixed (method of roundoff) 138.B 

fixed (of a topological transformation group) 
431.A 

fixed, of discontinuity  5.B 407.A 

fixed, index (of a mapping) 153.B 

fixed branch (of an algebraic differential equa- 
tion) 288.A 

fixed, theorem 153 

flat (ofa surface) 111.H 

focal (of a submanifold of a Riemannian mani- 
fold) 364.C 

frontier (of a subset) 425.N 

functions 380.A 

fundamental (of a projective space) 343.C 

fundamental (with respect to a birational 
mapping) 16.1 

y-, of the kth order (of a holomorphic function) 
198.C 

generalized peak 164.D 

generic 16.A 

geodesic 111.H 365.D 
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Points(s) 


geometric (of a scheme) 16.D 

harmonic range of 343.D 

homoclinic 126 

hyperbolic (on a surface) 111.H 

hyperbolic fixed 126.G 

hyperbolic singular 126.G 

ideal (in hyperbolic geometry) 285.C 

independent (in an affine space) 7.A 

independent (in a projective space) 343.B 

at infinity (in affine geometry) 7.B 

at infinity (of a Gaussian plane) 74.D 

at infinity (in hyperbolic geometry) 285.C 

at infinity (of a Riemann manifold) 178.F 

of inflection (of a curve of class C?) 93.G 

of inflection (of a plane algebraic curve) 9.B 

initial (of a curvilinear integral) 94.D 

initial (of a path) 170 

initial (of a position vector) 7.A 

initial (of a vector) 442.A 

integral 428.E,F 

interior 425.B 

internally irregular 338.L 

irregular (of an analytic set) 45.D 

irregular (of a Markov process) 261.D 

irregular boundary 120.D 

irregular singular 254.B 

isolated 425.0 

isolated (of a curve) 93.G 

isolated fixed 126.G 

isotropic 365.D 

k-rational (of an algebraic variety) 16.A 369.C 

left singular (of a diffusion process) 115.B 

limit (of a discontinuous group) 122.C 

limit (of a sequence of points) 87.E 

limit (of a sequence of real numbers) 87.B 

limit, type 112.1 

logarithmic branch (of a Riemann surface) 
367.B 

middle (of two points of an affine space) 7.C 

movable branch (of an algebraic differential 
equation) 288.A 

multiple (on an arc) 93.B 

multiple (of a plane algebraic curve) 9.B 

multiple (on a variety) 16.F 

negative limit 126.D 

nodal 304.C 

nondegenerate critical 106.L 279.B 

nondegenerate critical (of a function on a 
Hilbert manifold) 286.N 

nonrecurrent 260.B 

nonsingular (of an algebraic variety) 16.F 

normal (of an analytic space) 23.D 

normal (of a variety) 16.F 

null recurrent 260.B 

co-limit 126.D 

100x%- 396.C 

order of (with respect to a cycle) 99.D 

order of (in an ordinary curve) 93.C 

ordinary (of an analytic set) 23.B 

ordinary (of a curve of class CH) 93.G 

ordinary (in hyperbolic geometry) 285.C 

ordinary (of an ordinary curve) 93.C 

ordinary (of a plane algebraic curve) 9.B 

ordinary (on a Riemann surface) 11.D 

overcrossing 235.A 

parabolic (on a surface) 110.B 111.H 

passive boundary  260.I 

peak 164.D 

positive limit 126.D 

positive recurrent 260.B 


principal 180.B 

r-ple (of a plane algebraic curve) 9.B 

ramification (of an analytic covering space) 
23.E 

rational 118.E 

rational double 418.C 

recurrent (of a Markov process) 261.B 

reflection (with respect to a circle) 74.E 

regular (of an analytic set) 23.B 45.D 

regular (with respect to an analytic set) 21.M 

regular (in catastrophe theory) 51.F 

regular (of a differentiable mapping) 105.J 

regular (of a diffusion process) 115.B 

regular (with respect to the Dirichlet problem) 
207.B 

regular (ofa flow) 126.D 

regular (of a Hunt process) 261.D 

regular (of a polyhedron or cell complex) 65.B 

regular (of a surface in E?)  111J 

regular boundary 120.D 

regular singular 254.B 

rest (of a trajectory) 126.D 

right singular (of a diffusion process) 115.B 

saddle (of a function) 255.B 292.A 

saddle (on a surface) 111.H 

saddle (of a system of ordinary differential 
equations) 126.G 

saddle (of two-person games) 108.B 

saddle, method 25.C 

sample 342.B 396.B 398.A 

Schwinger 150.F 

semiregular (of a surface in E?) ULI 

simple (of an analytic set) 23.B 418.A 

simple (on a variety) 16.F 

singular (of an analytic set) 23.B 418.A 

singular (of a continuous vector field) 153.B 

singular (of a curve of class CH) 93.G 

singular (of a linear difference equation) 104.D 

singular (of a plane algebraic curve) 9.B 

singular (of a polyhedron or cell complex) 
65.B 

singular (of a quadric hypersurface) 343.E 

singular (of a surface in E?) 111J 

singular (of a system of linear ordinary dif- 
ferential equations) 254.A 

singular (of a system of ordinary differential 
equations) 289.A 

singular (of a trajectory) 126.D,G 

singular (on a variety) 16.F 

smooth (of variety) 16.F 

stable 16.W 

stationary (of an arc of class C")  111.D 

successive minimum 182.C 

supporting (of a convex set) 89.G 

supporting (of a projective frame) 343.C 

symmetric (with respect to a circle) 74.E 

terminal (of a curvilinear integral) 94.D 

terminal (of a Markov process) 261.B 

terminal (ofa path) 176 

terminal (of a vector) 442.A 

transient 260.B 

transition 254.F 

transversal homoclinic 126.J 

turning 25.B 254.F 

ultrainfinite (in hyperbolic geometry) 285.C 

umbilical (of a surface) 111.H 365.D 

undercrossing 235.A 

unit (of an affine frame) 7.C 

unit (of a projective frame) 343.C 

unit (of a projective space) 343.C 


Subject Index 
Point(s) at infinity 


w- (of an entire function) 429.B 
wandering (of a trajectory) 126.E 
Weierstrass 11.D 
zero (of a holomorphic function) 198.C 
zero (of a polynomial) 337.B 369.C 
zero (of a subset of a polynomial ring) 369.C 
zero, of the —kth order (of a complex function) 
198.D 
zero, of the kth order (of a holomorphic func- 
tion) 198.C 
point(s) at infinity 74.D 285.C 
regular at the 193.B 
pointed coalgebra 203.F 
pointed set 172J 
morphism of 172.J 
pointed shape category 382.A 
pointed topological spaces, category of 202.B 
pointer 96.B 
point estimation 399.B 401.C, App. A, Table 23 
point-finite covering (of a set) 425.R 
point function 380.A 407.D 
point group (of a crystallographic group) 92.A 
point hypersphere 76.A 
point process 407.D 
point range (in projective geometry) 343.B 
of the number system (in projective geometry) 
343.C 
point set 381.B 
points of indeterminacy, set of 23.D 
point spectrum 390.A 
pure 136.E 
pointwise convergent sequence 435.B 
pointwise ergodic theorem  136.B 
Poisson bracket 82.B 271.F 324.C,D 
(of two vector fields) 105.M 
Poisson differential equation 323.A, App. A, Table 
15.0 
Poisson distribution 341.D 397.F, App. A, Table 
22 
Poisson equation 338.A 
Poisson formula App. A, Table 19.TIT 
of T" = E"/T 391J 
Poisson input 260.H 
Poisson integral 168.B 193.G 
Poisson integral formula 198.B 
Poisson integration formula App. A, Table 15.VI 
Poisson kernel 159.C 
Poisson number 271.G 
Poisson point process, stationary 407.D 
Poisson process 5.D 
compound 5.F 
Poisson random measure 407.D 
Poisson ratio 271.G 
Poisson solution 325.D 
Poisson stable 126.E 
negatively 126.E 
positively 126.E 
Poisson summation formula 192.C 
(of Fourier transforms) 192.C 
(on a locally compact Abelian group) 192. 
polar 
(with respect toaconic) 78.J 
(in projective geometry) 343.E 
(relative to pairing) 424.H 
polar coordinates 90.C 
geodesic 90.C 
tangential 90.C 
polar decomposition 251.E 
polar element 
(a function element in the wider sense) 198.0 
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(of an integral element) 428.E 
polar form (of a complex number) 74.C 
polarity (with respect to a quadric hypersurface) 
343.E 
polarization (on an Abelian variety) 3.G 
electric 130.A 
inhomogeneous 3.G 
magnetic 130.A 
principal 3.G 
polarized 
(Hodge structure) 16.V 
(wave) 446 
polarized Abelian variety 3.G 
polarized Jacobian variety, canonically 3.G 9.E 
polar plane (with respect to a quadric surface) 
350.C 
polar set (in potential theory) 261.D 338.H 
polar space 191.1] 
polar system (in projective geometry) 343.D 
polar tetrahedron 350.C 


self- 350.C 
polar triangle 78.J 
self- 78.J 

pole 


(of a complex function) 198.D 
(of a function on an algebraic curve) 9.C 
(of a function on an algebraic variety) 16.M 
(of a polar with respect to a conic) 78.J 
(ofa polar plane) 350.C 
(of a polar of a quadric hypersurface) 158.E 
(ofa roulette) 93.H 
"north (of a complex sphere) 74.D 
north (of a sphere) 140 
order of 198.D 
Regge 132.C 386.C 
resonance 331.F 
south (of a complex sphere) 74.D 
south (ofa sphere) 140 
pole divisor (of a function on an algebraic variety) 
16.M 
policy 127.A 405.C 
Markovian 405.C 
optimal 127.A 
Polish space 22.1273J 
Polya’s enumeration theorem 66.E 
Polya type 374J 
strictly of 374.J 
polychromatic group 92.D 
polydisk 21.B 
polygamma functions 174.B, App. A. Table 17.I 
polygon(s) 155.F 
Cauchy 316.C 
decomposition-equal 155.F 


force 19.C 
normal 234.C 
plane 155.F 


regular 357.A 

simple 155.F 

supplementation-equal 155.F 
polygonal number of orderr 4.D 
polyharmonic 193.0 
polyhedral, convex rational 16.Z 
polyhedral angle, regular 357.B 
polyhedral cone, convex 89.F 
polyhedral group, regular 151.G 


-polyhedron (polyhedra) 


(in an affine space) 7.D 

(of a simplicial complex) 65.A 70.C 
analytic 21.G 

convex 89.A 
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corner 215.C 

Euclidean 70.B 

Euler theorem on 201.F 
integer 215.C 

regular 357.B 
topological 65.A 


polymatroid 66.F 
polynomial(s) 337 


Alexander (of a knot) 235.C,D 

alternating | .337.I 

associated Laguerre 317.D 

Bernoulli 177.B 

Bernshtein 336.A 

Bernshtein (generalized) 418.H 

characteristic (of a differential operator) 112.A 
321.A 

characteristic (of a linear mapping) 269.L 

characteristic (of a matrix) 269.F 

Chebyshev 317.D 336.H, App. A, Table 20.II 

Chebyshev orthogonal 19.G 

cyclotomic 14.L 

differential 113 

Euler 177.C 

Fourier-Hermite 176 

Galois group of the 172.G 

Gegenbauer 317.D 393.E, App. A, Table 20.1 

generalized trigonometric 18.B 

Hermite 317.D 

Hermite interpolation 223.E 

Hilbert (of an algebraic curve) 9.F 

Hilbert (of a graded R-module) 369.F 

Hilbert (of a sheaf) 16.E 

homogeneous of degree n 337.B 

Hosokawa 235.D 

inseparable 337.G 

irreducible 337.F 

isobaric 32.C 

Jacobi 317.D, App. A, Table 20.V 

Lagrange interpolation 223.A 336.G, App. A, 
Table 21 

Laguerre 317.D, App. A, Table 20.VI 

Legendre 393.B, App. A, Table 18.II 

link 235D 

Lommel App. A, Table 19.IV 

in m variables 337.B 

minimal (of an algebraic element) 149.E 

minimal (of a linear mapping) 269.L 

minimal (of a matrix) 269.F 

monic 337.A 

Neumann App. A, Table 19.IV 

Newton interpolation 336.G 

orthogonal 19.G, App. A, Table 20.VII 

parity check 63.E 

Poincaré 201.B 

primitive 337.D 

reduced link 235.D 

reducible 337.F 

ring of 337.A 369 

ring of differential 113 

Sato-Bernshtein 125.EE 

Schlafli App. A, Table 19.IV 

separable 337.G 

simplest orthogonal 19.G 

Snapper 16.E 

Sonine 317.D, App. A, Table 20.VI 

symmetric 337.1 

system of orthogonal 317.D 

trigonometric interpolation 336.E 

ultraspherical 317.D 

zonal 374.C 


Subject Index 
Positive cycle (on an algebraic variety) 


polynomial approximation 336 

best (in the sense of Chebyshev) 336.H 
polynomial approximation theorem (for C*- 

functions) 58.E 

polynomial distribution App. A, Table 22 

negative App. A, Table 22 
polynomial extrapolation method 303.F 
polynomial identity (on an algebra) 29.J 
polynomially transformable 71.E 
polynomial representation (of GL(V)) 60.D 
polynomial ring 337.A 369 

of m variables 337.B 
polynomial time  71.B 
polytropic differential equation 291.F 
Pomeranchuk theorem 386.B 
Pontryagin class(es) 

(of an R"-bundle) 56.D 

combinatorial 56.H 

ofa manifold 56.F 

rational 56.F 

total 56.D 

universal 56.D 


Pontryagin duality theorem (on topological Abelian 


groups) 192.K 422.C 
Pontryagin multiplication 203.D 
Pontryagin number 56.F 
Pontryagin pth power operation 64.B 
Pontryagin product 203.D 
Popov ghost, Faddeev- 132.C 150.G 
population (in statistics) 397.B 401.E 

finite 373.A 
infinite  401.E 
population characteristic 396.C 
population correlation coefficient 396.D 
population covariance 396.D 
population distribution 396.B 401.F 
hypothetical infinite 397.P 
population kurtosis 396.C 
population mean 396.C 
population moment of order k 396.C 
population standard deviation 396.C 
population variance 396.C 
port-admittance matrix 282.C 
porter 168.C 
port-impedance matrix 282.C 
port network, M- 282.C 
portrait, phase 126.B 
position 
general (complexes) 70.B 
general (ofa PL mapping) 65.D 
general (in a projective space) 343.B 
general, theorem 65.D 
hyperboloid 350.D 
method of false 301.C 
position representation 351.C 
position vector 442.A 
(of a point of an affine space) 7.A 
positive 
(chain complex) 200.C 
(class of vector bundles) 114.D 
(complex) 200.H 
(functional on a C*-algebra) 36.G 
(function on a C*-algebra) 308.D 
(Hermitian operation) 308.A 
(square matrix) 310.H 
completely (linear mapping between C*- 
algebras) 36.H 


positive boundary, open Riemann surface of 367.E 


positive cone, natural 308.K 
positive cycle (on an algebraic variety) 16.M 


Subject Index 
Positive definite 


positive definite 

(function) 192.B,J 394.C 

(Hermitian form) 348.F 

(matrix) 269.] 

(potential) 338.D 

(sequence) 192.B 

(on a topological group) 36.L 437.B 
positive definite kernel 217.H 
positive definite quadratic form 348.C 
positive direction (in a curvilinear integral) 198.B 
positive distribution 125.C 
positive divisor 

(of an algebraic curve) OC 

(on a Riemann surface) 11.D 
positive element 

(in a lattice-ordered group) 243.G 

(of an ordered field) 149.N 

strictly 310.H 

totally 14.G 
positive entropy, completely 136.E 
positive half-trajectory 126.D 
positive infinity 87.D 355.C 
positive kernel 217.H 
positive limit point 126.D 
positively invariant 126.D 
positively Lagrange stable 126.E 
positively Poisson stable 126.E 
positively regular process 44.C 
positive matrix 269.N 
positive number 355.A 
positive operator (in vector lattices) 310.E 
positive orientation (of an oriented C'-manifold) 

105.F 
positive orthant 89.G 
positive part (of an element of a vector lattice) 
310.B 

positive prolongational limit set, first 126.D 
positive Radon measure 270.1 
positive real function 282.C 
positive recurrent ergodic class 260.B 
positive recurrent point 260.B 
positive root (of a semisimple Lie algebra) 248.M 
positive semidefinite (operator) 251.E 
positive semidefinite kernel 217.H 
positive semidefinite matrix 269.1 
positive semidefinite quadratic form 348.C 
positive semiorbit 126.D 
positive system, symmetric 112.S 326.D 
positive terms, series of 379.B 
positive type 

(function of) 192.B,J 

(sequence of) 192.B 

(symmetric kernel of) 338.D 
positive variation 

(ofa mapping) 246.H 

(of a real bounded function) 166.B 
positive Weyl chamber 248.R 
positivity 

O-S 150.F 

reflection 150.F 

T- 150.F 
possibility 411.L 
possible construction problem 179.A 
posterior density 401.B 
posterior distribution 398.B 401.B 403.G 
posterior risk 399.F 
postliminal C*-algebra 36.H 
Postnikov complex 70.G 
Postnikov system (of a CW complex) 148.D 
Post problem 356.D 
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Post theorem 356.H 
postulate(s) 35.A 
fifth (in Euclidean geometry) 139.A 
Nernst 419.A 
Peano 294.B 
potency ofaset 49.A 
potential 338.A 
(of a force) 271.C 
(ofa Hamiltonian) 375.B 
(for a lattice spin system) 402.G 
(in a Markov chain) 260.D 
(ona network) 281.B 
central 351.E 
chemical 402.D 419.B 
of a double distribution 338.A 
of a double layer 338.A 
equilibrium 260.D 
finite-band 387.E 
finite-gap 387.E 
logarithmic 338.A 
Newtonian 271.C 338.A 
of ordera 338.B 
reflectionless 387.D 
Riesz 338.B 
scalar 130.A 442.D 
of a simple distribution 338.A 
ofa single layer 338.A 
vector 130.A 442.D 
velocity 205.B 
Yukawa 338.M 
potential energy 271.G 
potential good reduction (of an Abelian variety) 
3.N 
potential kernel, weak 260.D 
potential stable reduction (of an Abelian variety) 
3.N 
potential theory 338 
power 
(of a cardinal number) 49.C 
(of an ordinal number) 312.C 
(ofa test) 400.A 
ofa with exponent x 131.B,G 
fractional 378.D 
p-fold exterior (of a linear space) 256.0 
p-fold exterior (of a vector bundle) 147.F 
Pontryagin (pth) operation 64.B 
residue of the nth 14.M 
ofaset 49.A 
Steenrod (pth) operation 64.B 
power associative algebra 231.A 
power dilation 251.M 
powerful invariant, uniformly most 399.Q 
powerful unbiased, uniformly most 399.Q 
power function 400.A 
envelope 400.F 
power method 298.C 
power-residue symbol 14.N 
power series 21.B 339 370 
with center at the point of infinity 339.A 
convergent 370.B 
field of, in one variable 370.A 
formal 370.A 
formal, field in one variable 370.A 
ring of 370.A 
ring of convergent 370.B 
ring of formal 370.A 
power series space 
finite type 168.B 
infinite type 168.B 
power set 
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(ofa set) 381.B 
axiom of 33.B 381.G 
power sum theorem 123.D 
Powers factor 308.I 
Powers-Størmer inequality 212.B 
Poynting vector 130.A 
Prandtl boundary layer equation 205.C 
Prandtl-Glauert approximation 250.B 
Prandtl-Glauert law of similarity 205.D 
Prandtl integrodifferential equation 222.C 
Prandtl number 116.B 
prealgebraic variety 16.C 
precession 392 
precompact 
(metric space) 273.B 
(set In a metric space) 273.B 
precompact uniform space 436.H 
preconditioned (in numerical solution of linear 
equations) 302.D 
predator relation, prey- 263.B 
predecessor (of an element in an ordered set) 311.B 
predetermined variables 128.C 
predicate 411.G 
analytic 356.H 
arithmetical 356.H 
complete 356.H 
decidable (number-theoretic) 356.C 
enumerating 356.H 
first-order 411.K 
general recursive 356.C 
hyperarithmetical 356.H 
of n-argument 411.G 
n-ary 411.G 
primitive recursive 356.B 
second-order 411.K 
predicate (object) 156.B 
predicate calculus 411.J 
with equality 411.J 
predicate logic 411.J 
with equality 411J 
first-order 411.K 
higher-order 411.K 
second-order 411.K 
third-order 411.K 
predicate symbol 411.H 
predicate variable 411.G,H 
predictable (c-algebra) 407.B 
prediction sufficiency 396.J 
prediction theory 395.D 
linear 395.D 
predictive distribution 403.G 
predictor 303.E 
(in a multistep method) 303.E 
linear 395.D 
Milne's 303.E 
optimal linear 395.D 
predictor-corrector (PC) method 303.E 
predual 308.D 
prefix condition code 213.D 
pre-Hilbert space 197.B 
prehomogeneous vector space 450.V 
zeta function associated with 450.V 
premium 214.A 
net 214A 
risk 214.B 
savings 214.B 
prenex normal form (in predicate logic) 411.J 
preordering 311.H 
preparation theorem 





Subject Index 
Prime formula 


for C?-functions | 58.C 
Weierstrass 21.E 370.B 
Weierstrass type (for microdifferential opera- 
tors) 274.F 
presentation 235.B 
of finite (@-Modules) 16.E 
Wirtinger (of a knot group) 235.B,D 
presheaf 383.A 
sheaf associated 383.C 
presheaf ona site 16.AA 
pressure 402.G 419.A 
pressure, topological 136.H 
pressure equation 205.B 
prestratification, Whitney 418.G 
preventive maintenance model 307.C 
prey-predator relation 263.B 
price 
imputed 292.C 
shadow 255.B 
primal problem 255.B 
primary Abelian group 2.A 
primary cohomology operation 64.B 
stable 64.B 
primary component 
(of an ideal) 67.F 
embedded (of an ideal) 67.F 
isolated (of an ideal) 67.F 
primary difference 305.C 
primary ideal 67.F 
p 67.F 
primary obstruction 147.L 305.C 
primary problem  255.B 
primary ring 368.H 
completely 368.H 
semi- 368.H 
primary solution (of a homogeneous partial dif- 
ferential equation) 320.E 
primary submodule 284.A 
prime(s) 
(3-manifold) 65.E 
over an element (in a lattice) 243.F 
under an element (in a lattice) 243.F 
Mersenne 297.E 
relatively (fractional ideals) 14.E 
relatively (numbers) 297.A 
twin 123.C 
prime differential ideal (of a differential ring) 113 
prime divisor 
(of an algebraic function field of dimension 1) 
9.D 
(of an algebraic number field or an algebraic 
function field of one variable) 439.H 
(ofan ideal) 67.F 
(on a Riemann surface) 11.D 
embedded (of an ideal) 67.F 
finite 439.H 
imaginary infinite 439.H 
infinite 439.H 
isolated (of an ideal) 67.F 
maximal (of an ideal) 67.F 
minimal (of an ideal) 67.F 
rea] 439.H 
real (infinite) 439.H 
prime element 
(ofaring) 67.H 
(for a valuation) 439.E 
prime field 149.B 
prime formula 411.D 
(of a language) 276.A 


Subject Index 
Prime ideal 


prime ideal 67.C 
(of a maximal order) 27.A 
associated (of an ideal) 67.F 
ramified 14. 
unramified 14.I 
prime ideal theorem 123.F 
prime knot 235.A 
prime number(s) 297.B 
regular 14.L 
relatively 297.A 
prime number theorem 123.B 
for arithmetic progression 123.D 
prime quotient (in a lattice) 243.F 
prime rational divisor over a field (on an algebraic 
curve) 9.C 
prime spot (of an algebraic number field or an 
algebraic function field) 439.H 
primitive 427.B 
(differential form) 232.B 
(element of coalgebra) 203.1 
(element of an extension of a field) 149.D 
(generator of the cohomology algebra of a 
compact Lie group) 427.B 
primitive binary quadratic form 348.M 
primitive character 295.D 450.C,E 
non- 450.C,E 
primitive equation 172.G 
primitive form 232.B 
primitive function(s) 216.C 
derivatives and App. A, Table 9.1 
primitive hyperbolic type (reduced basis of) 92.C 
primitive hypercubic type (reduced basis of) 92.C 
primitive ideal (of a Banach algebra) 36.D 
primitive idempotent element (of a ring) 368.B 
primitive lattice 92.E 
primitive operation (of a group) 362.B 
primitive permutation group 151.H 
primitive permutation representation (of a group) 
362.B 
primitive polynomial 337.D 
primitive recursive 356.B 
primitive recursive function 356,A,B,F 
uniformly 356.B 
primitive recursive in dr... 356.B 
primitive recursive predicate 356.B 
primitive root of unity 14.L 
modulom 297.G 
primitive solution (of a partial differential equation) 
320.E 
principal adele (of an algebraic number field) 6.C 
principal analytic set 23.B 
principal antiautomorphism (of a Clifford algebra) 
61.B 
principal automorphism (Clifford algebras) 61.B 
principal axis (axes) 
(of acentral conic) 78.C 
(of inertia) 271.E 
(of a parabola) 78.C 
(of a quadric surface) 350.B 
transformation to 390.B 
principal bundle 147.C 
associated 147.D 
reduced 147.J 
reducible 147.J 
principal character 295.D 
(of an Abelian group) 2.G 
principal component(s) 
(in principal component analysis) 280.F 
oforderp 110A 
principal component analysis 280.F 
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principal congruence subgroup of level N 122.D 
principal convergent (of an irrational number) 83.B 
principal curvature 
(ofa surface) 111.H 365.C 
radius of (of a surface) 111.H 
principal directions (of a surface) 111.H 
principal discrete series 258.C 
principal divisor 
(on an algebraic curve) 9.C 
(on a Riemann surface) 11.D 
principal fiber bundle 147.C 
principal formula of integral geometry 218.C 
principal fractional ideal 67.K 
principal genus 
(for an ideal group) 59.E 
of a quadratic field 347.F 
principal half-space (of a flag) 139.B 
principal H-series 437.X 
principal ideal 67.K 
of an algebraic number field 14.E 
principal ideal domain 67.K 
principal ideal ring 67.K 
principal ideal theorem (in class field theory) 59.D 
principal idele (of an algebraic number field) oC 
principal isotropy group(s) 431.C 
principal matrix (belonging to a Riemann matrix) 
3.1 
principal minor (of a matrix) 103.D 
principal moment of inertia 271.E 
principal normal 111.F 
principal orbit(s) 431.C 
type 431.C 
principal order (of an algebraic number field) 14.B 
fundamental theorem of 14.C 
principal part 
(of a differential operator) 112.A 
(ofa Laurent expansion) 198.D 
(of a partial differential operator) 320.B 
principal partition 66.H 
principal plane (of a quadric surface) 350.B 
principal point(s) 
(for a Gauss mapping) 180.B 
principal polarization (of an Abelian variety) 3.G 
principal quantum number 315.E 
principal series 258.C 
(inanQ-group) 190.G 
(of unitary representations of a complex semi- 
simple Lie group) 437.W 
(of unitary representations of a real semisimple 
Lie group) 437.X 
principal solution 104.B 
principal space (of a flag) 139.B 
principal subspace (of a linear operator) 390.B 
principal symbol 237.H 
(of a microdifferential operator) 274.F 
(of a simple holonomic system) 274.H 
principal theorem, Ahlfors 367.B 
principal value 
(of inverse trigonometric functions) 131.E 
Cauchy (of an improper integral) 216.D 
Cauchy (of the integral on infinite intervals) 
216.E 
oflogZ 131.G 
principle(s) 
argument 198.F 
balayage 338.L 
Bellman 405.B 
Cartan maximum 338.L 
complete maximum 338.M 
of condensation of singularities 37.H 
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of conditionality 401.C 

continuity 21.H 

continuity (in potential theory) 338.C 

contraction 286.B 

correspondence 351.D 

of counting constants 16.S 

Dedekind (in a modular lattice) 243.F 

of depending choice (DC)  33.F 

dilated maximum (in potential theory) 338.C 

Dirichlet 120.A 323.E 

Dirichlet drawer 182.F 

domination 338.L 

Donsker invariance 250.E 

duality (for closed convex cones) 89.F 

duality, for ordering 311.A 

of duality (in projective geometry) 343.B 

embedding (in dynamic programming) 127.B 

energy 338.D 

energy minimum 419.A 

enthalpy minimum 419.C 

entropy maximum 419.A 

of equal weight 402.E 

equilibrium 338.K 

of equivalence (in insurance mathematics) 
214.A 359.D 

Fermat 180.A 441.C 

first maximum (in potential theory) 338.C 

Fisher three 102.A 

Frostman maximum . 338.C 

general, of relativity 359.D 

Gibbs free energy minimum  419.C 

Hamilton 441.B 

Hasse 348.G 

Helmholtz free energy minimum  419.C 

Huygens 325.B 446 

Huygens, in the wider sense 325.D 

invariance 375.B 400.E 

of invariance of speed of light 359.B 

inverse domination 338.L 

of least action 441.B 

limiting absorption 375.C 

of linearized stability 286.8 

of localization (on convergence tests of Fourier 
series) 159.B 

local maximum modulus 164.C 

lower envelope 338.M 

Maupertuis 180.A 

maximal 193.E . 

maximum (for control theory) 86.F 

maximum (for a holomorphic function) 43.B 

maximum (for minimal surfaces} 275.B 

maximum modulus (for a holomorphic func- 
tion) 43.B 

minimax (for eigenvalues of a compact opera- 
tor 68.H 

minimax (for AA) 391.G 

minimax (for statistical decision problem) 
398.B 

minimum (for 4) 391.D 

minimum (for 4,) 391.G 

of nested intervals (for real numbers) 87.C 
355.B 

Oka 21.K 147.0 

of optimality 127.A 

Pauli 351.H 

quasicontinuity (in potential theory) 338.1 

Rayleigh 68.H 

reflection 45.E 

of reflection 74.E 

Schwarz, of reflection 198.G 


Subject Index 
Problem(s) 


separation 405.C 
special, of relativity 359 
stochastic maximum 405.D 
stored program 75.B 
Strassen invariance 250.E 
of sufficiency 401.C 
of superposition 252.B 322.C 
sweeping-out 338.L 
Ugaheri maximum 338.C 
uniqueness (in potential theory) 338.M 
upper boundedness (in potential theory) 338.C 
variational 441 
variational (in statistical mechanics) 340.B 
402.G 
variational (in the theory of elasticity) 271.G 
variational, with relaxed continuity requirement 
271.G 
variational, for topological pressure 136.H 
Pringsheim theorem 58.E 
prior density 401.B 
prior distribution 401.B 403.G 
probabilistic model 397.P 
probability 342 
additivity of 342.B 
a posteriori 342.F 
apriori 342.F 
binomial, paper 19.B 
conditional 342.E 
continuous in 407.A 
converge in 342.D 
converge with, 1 342.D 
critical percolation 340.D 
error 213.D 
ofanevent 342.B 
that event c occurs 342.B 
event with, 1 342.B 
extinction 44.B 
geometric 218.A 
hitting, for single points 5.G 
objective 401.B 
regular conditional 342.E 
run 214C 
standard transition 260.F 
subjective 401.B 
theory of 342.A 
transition 260.A 261.A 351.B 
probability amplitude 351.D 
probability density 341.D 
probability distribution(s) 342.B, App. A, Table 22 
(one-dimensional, of random variable) 342.C 
(of random variables) 342.C 
conditional 342.E 
n-dimensional 342.C 
probability generating function 341.F 397.G 
probability integral App. A, Table 19.II 
probability measure 341 342.B 
probability of loss 307.C 
probability paper 19.F 
binomial 19.B 
probability ratio test, sequential 400.L 
probability space 342.B 
probable cause, most 401.E 
probable value, most 401.E 
problem(s) 
Abel 217.L 
abstract Cauchy 286.X 
acoustic 325.L 
adjoint boundary value 315.B 
all-integer programming 215.A 
Appolonius (in geometric construction) 179.A 


Subject Index 
Problem(s) 


Behrens-Fisher 400.G 

Bernshtein, generalized 275.F 

boundary value (of ordinary differential equa- 
tions) 303.H 315.A 

Burnside (in group theory) 161.C 

Cauchy (for ordinary differential equations) 
316.A 

Cauchy (for partial differential equations) 
320.B 321.A 325.B 

class field tower 59.F 

combinatorial App. A, Table 17.II 

combinatorial triangulation 65.C 

concave programming 292.A 

conditional, in the calculus of variations 46.A 

connection 253.A 

construction 59.F 

convex programming 292.A 

corona 43.G 

correctly posed (for partial differential equa- 
tions) 322.A 

Cousin, first 21.K 

Cousin, second 21.K 

Cramer-Castillon (in geometric construction) 
179.A 

critical inclination 55.C 

decision 71.B 97 186J 

Delos (in geometric construction) 179.A 

Dido 228.A 

differentiable pinching 178.E 

Dirichlet 120 193.F 323.C 

Dirichlet, with obstracle 440.B 

Dirichlet divisor 242.A 

dual 255.B 349.B 

du Bois Reymond 159.H 

eigenvalue 390.A 

exterior (Dirichlet problem) 120.A 

first boundary value 193.F 323.C 

flow-shop scheduling 376 

four-color 157 

Gauss circle 242.A 

Gauss variational 338.J 

general boundary value 323.H 

generalized eigenvalue 298.6 

generalized isoperimetric 46.A 228.A 

generalized Pfaff 428.B 

Geócze 246.D 

geometric construction 179.A 

Goldbach AC 

group-minimization 215.C 

Hamburger moment 240.K 

Hausdorff moment 240.K 

Hersch 391.E 

Hilbert (in calculus of variations) 46.A 

Hilbert fifth 423.N 

homeomorphism 425.G 

homogeneous boundary value (of ordinary 
differential equations) 315.B 

Hukuhara 315.C 

of identification (in econometrics) 128.C 

impossible construction 179.A 

inconsistent (of geometric construction) 179.A 

inhomogeneous boundary value (of ordinary 
differential equations) 315.B 

initial value (for functional differential equa- 
tions) 163.D 

initial value (of ordinary differential equations) 
313.C 316.A 

initial value (for partial differential equations) 
321.A 
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initial value, for a hyperbolic partial differential 
equation App. A, Table 15.III 

interior (Dirichlet problem) 120.A 

interpolation 43.F 

invariant measure 136.C 

inverse (in potential scattering) 375.G 

isomorphism (for graphs) 186 

isomorphism (for integral group algebra) 
362.K 

isoperimetric  111.E 228.A 

Jacobi inverse 3.L 

job-shop scheduling problem 376 

k-sample 371.D 

Lagrange (in calculus of variations) 46.A 

LBA 31.D 

Levi 21.1 

linear least squares 302.E 

linear programming 255.A 

local (on the solutions of differential equations) 
289.A 

machine scheduling 376 

machine sequencing 376 

Malfatti (in geometric construction) 179.A 

many-body 402.F 420.A 

martingale 115.C 261.C 406.A 

maximum flow 281.C 

minimum-cost flow 281.C 

mixed integer programming 215.A 

multicommodity flow 281.C 

multiprocessor scheduling 376 

n-body 420.A 

n-decision 398.A 

network-flow 281 282.B 

Neumann (for harmonic functions) 193.F 

Neumann (for partial differential equations of 
elliptic type) 323.F 

nonlinear 291 

normal Moore space 425.AA 

optimal regulator Sot 

penalized 440.B 

Pfaff 428.A 

placement 235.A 

Plateau 334.A 

possible construction 179.A 

primal 255.B 

primary 255.B 

properly posed 322.A 

pure integer programming 215.A 

quadratic programming 292.A 349.A 

random walk 260.A 

representation (on surface) 246.I 

restricted Burnside (in group theory) 161.C 

restricted three-body 420.F 

Riemann 253.D 

Riemann-Hilbert (for integral equations) 217.J 

Riemann-Hilbert (for ordinary differential 
equations) 253.D 

Robin 323.F 

of satisfiability (of a proposition) 97 

Schéenflies 65.G 

second boundary value (for harmonic functions) 
193.F 

second boundary value (for partial differential 
equations of elliptic type) 323.F 

second Cousin 21.K 

self-adjoint boundary value 315.B 

sequential decision 398.F 

shortest path 281.C 

single- commodity flow 281 
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singular initial value (for partial differential 
equations of mixed type) 326.C 

smoothing 114.C 

special isoperimetric 228.A 

of specification 397.P 

statistical decision 398.A 

Steiner (in geometric construction) 179.A 

Stieltjes moment 240.K 

Sturm-Liouville 315.B 

third boundary value (for harmonic functions) 
193.F 

third boundary value (for partial differential 
equations of elliptic type) 323.F 

three big 187 

three-body 420A 

Thues (general) 31.B 

time optimal control Set 

transformation (in a finitely presented group) 
161.B 

transient 322.D 

transportation 255.C 

transportation, on a network 255.C 

Tricomi 326.C 

two-body 55.A 

two-point boundary value (of ordinary dif- 
ferential equations) 315.A 

two-terminal 281 

type (for Riemann surfaces) 367.D 

of universal validity of a proposition 97 

Waring AE 

weak form of the boundary value 304.B 

well-posed (in general case) 322.A 

word (in a finitely presented group) 161.B 

0—1 integer programming 215.A 


procedure 


classification 280.1 
exploratory 397.Q 
Lyapunov-Schmidt 286.V 
random sampling 373.A 
sampling 373.A 
shortest-path 281.C 
statistical decision 398.A 


process 


(in catastrophe theory) 51.F 

(on a measure space) 136.E 
(=stochastic process) 407.A 
additive 5 342.A 
age-dependent branching 44.E 
asymmetric Cauchy 5.F 
autoregressive 421.D 
autoregressive integrated moving average 
421.G 

autoregressive moving average 421.D 
Bernoulli 136.E 

Bernoulli, very weak 136.E 
Bernoulli, weak 136.E 

birth 260.G 

birth and death 260.G 

bond percolation 340.D 
branching 44 342.A 

branching Markov 44.E« 
Cauchy 5.F 

centered 5.B 

compound Poisson 5.F 
contact 340.C 
continuous-state branching 44.E 
death 260.G 

diffusion 115 

dual 261.F 


Subject Index 
Process 


exponent of the stable 5.F 
Feller 261.B 

finitely determined (E DI 136.E 
Galton-Watson branching 44.B 
Gaussian 176 342.A 
Gaussian, complex 176.C 
generalized stochastic 407.C 
homogeneous Markov 5.H 
Hunt 261.B 

increasing 262.D 

independent 136.E 

with independent increments  5.B 
integrable, of bounded variation 406.B 
integrable increasing 406.B 
invariant Markov 5.H 
irreversible 402.A 

isothermal 419.B 

Itó 406.B 

Lévy 5.B 

linear stationary iterative 302.C 
Markov 261 342.A 

Markov branching 44.D 
Markovian decision 127.E 
moving average 421.D 
multistage allocation 127.A 
multistage choice 127.A 
multitype Galton-Watson | 44.C 
multitype Markov branching 44.E 
Newton iterative 301.D 
normal 176.C 

observation 405.F 

one-sided stable, of the exponent a 5.F 
oscillator 315.F 

osculating 77.B 

percolation 340.D 

point 407.D 

Poisson 5.D 

positive regular 44.C 
progressive 407.B 

quadratic variation 406.B 
quasistatic adiabatic 419.B 
recurrent 261.B 

reversed 261.F 

sample 407.A 

shift associated with the stationary 136.D 
o- (of acomplex manifold) 72.H 
signal 405.F 

site percolation 340.D 
spatially homogeneous 261.A 
stable 5.F 

stationary 342.A 395.A 
stationary Gaussian 176.C 
strictly stable 5.F 

strictly stationary 395.A 
stochastic 342.A 407 


stochastic, with stationary increments of order n 


395.1 

strongly stationary 205 A E 

strong Markov 261.B 
subadditive 136.B 

sweeping-out 338.L 

symmetric Cauchy 5.F 

symmetric stable 5.F 

system 405.F 

temporally homogeneous 261.A 
temporally homogeneous additive 5.B 
transient 261.B 

very weak Bernoulli (V. W.B.) 136.E 
weak Bernoulli (W.B.) 136.E 


Subject Index 
Processing, data 


weakly stationary 395.A 
weakly stationary, of degree k 395.1 
Wiener 5.D 45.B 
processing, data 96 
processor, central 75.B 
producers risk 404.C 
product(s) 
(of algebraic varieties) 16.A 
(of cardinal numbers) 49.C 
(of completely additive classes) 270.H 
(of elements of a graded algebra) 203.B 
(of elements of a group) 190.A 
(of hyperfunctions) 125.X 274.E 
(of ideals) 67.B 
(of knots) 235.A 
(of linear operators) 37.C 251.B 
(of matrices) 269.B 
(of objects) 52.E 
(of ordinal numbers) 312.C 
(of paths) 170 
(in quadrangular set of six points) 343.C 
(of real numbers) 355.B 
(of sets) 381.B 
(of tensors) 256.K 
amalgamated (of a family of groups) 190.M 
Blaschke 43.F 
bracket (in a Lie algebra) 248.A 
cap (in (co)homology groups of a space) 201.K 
cap (in homological algebra) 200.K 201.K 
cardinal (of a family of ordered sets) 311.F 
Cartesian (of a family of sets) 381.E 
Cartesian (of mappings) 381.C 
Cartesian (of ordered simplicial complexes) 
70.C 
Cartesian (of s.s. complexes) 70.E 
Cartesian (of sets) 381.B 
Cauchy (of series) 379.F 
complex (of cell complexes) 70.D 
cross- (in cohomology groups ofa space) 201J 
cross- (in homology groups ofa space) 201.J 
cross- (of vector bundles) 237.C 
crossed (of a C*-algebra) 36.I 
crossed (of a commutative ring and a group) 
29.D 
crossed (in von Neumann algebra theory) 
308.I 
cup (of cohomology classes) 201.1 
cup (in K-theory) 237.C 
cup, reduction theorem 200.M 
cup of derived functors 200.K 
difference 337.1 
direct — direct product 
divergent infinite 374.F 
exterior (of differential forms) 105.Q 
exterior (of elements of a linear space) 256.0 
exterior (of a p-vector and g-vector) 256.0 
exterior (of two vectors) 442.C 
external (of derived functors} 200.K 
Euler 450.V 
fiber, over S 52.G 
free (of groups) 190.M 
Hermitian inner 256.Q 
of ideals of a commutative ring 67.B 
of inertia 271.E 
infinite 379.G 
infinite App. A, Table (ONT 
inner (in a Hermitian linear space) 256.Q 
inner (in a Hilbert space) 197.B 
inner (in a metric vector space) 256.H 
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inner (for a pairing) 424.G 
inner (with respect to a linear space and its dual 
space) 256.G 
inner (of two hyperspheres) 76.A 
inner (of two n-tuples) 256.A 
inner (of two vectors) 442.B 
inner, space 442.B 
internal (of derived functors) 200.K 
interior (of a differential form with a vector 
field) 105.Q 
intersection (of homology classes) 201.0 
Intersection (of two subvarieties) 16.Q 
Kronecker (of matrices) 269.C 
logical (of propositions) 411.B 
ordinal (of a family of ordered sets) 311.G 
partial 379.G 
Pontryagin 203.D 
projective C*-tensor 36.H 
proper (of two normal g-lattices) 27.A 
restricted direct 6.B 
Riemannian (of Riemannian manifolds) 364.A 
scalar (of linear operators} 37.C 
scalar (of two vectors) App. A, Table 3.I 
scalar triple (of three vectors) 442.C 
skew (of measurable transformations) 136.D 
slant 201.K 
smash 202.F 
spatial tensor 36.H 
sum of 216.A 
symmetric (of a topological space) 70.F 
tensor (of A-homomorphisms) 277.J 
tensor (of algebras) 29.A 
tensor (of A-modules) 277 
tensor (of chain complexes) 201J 
tensor (of cochain complexes) 201. 
tensor (of distributions) 125.K 
tensor (of Hilbert spaces) 308.C 
tensor (of linear mappings) 256.I 
tensor (of linear representations) 362.C 
tensor (of linear spaces) 256.1 
tensor (of locally convex spaces) 424.R 
tensor (of sheaves) 383.1 
tensor (of vector bundles) 147.F 
tensor (of von Neumann algebras) 308.C 
torsion (in a category) 200.K 
torsion (of two A-modules) 200.D,K 
vector 442.C, App. A, Table 3.1 
vector triple 442.C, App. A, Table 3.I 
wedge (of derived functors) 200.K 
Weierstrass canonical 429.B 
Whitehead 202.P 
product algebraic variety 16.A 
product bundle 147.E 
product category 52.B 
product complex 200.H 350.D 
product decomposition, dual direct 422.H 
product double chain complex | 200.E 
product event. 342.B 
product formula 
(for the Hilbert norm-residue symbol) 14.R 
(on invariant Haar measures) 225.E 
(for the norm-residue symbol) 14.Q 
(on valuations) 439.H 
Trotter 315.F 
production planning 376 
production rule 31.B 
product mapping 425.K 
product measure 270.H 
product measure space 270.H 
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complete 270.H 
product metric space 273.B 
product of inertia 271.E 
product rule 299.D 
product space 425.L 

reduced 202.Q 
product theorem for dimension 117.C 
product topological space 425.L 
product topology 425.L 
product uniformity 436.E 
product uniform space 436.E 
profinite groups 210.C 
program 75.C 

machine-language 75.C 
program evaluation and review technique 376 
programming 75.C 385.B 

bilinear 364.D 

chance-constrained 408.A 

convex 264.C 

disjunctive 264.C 

dynamic 127.A 264.C 

fractional 264.C 

geometric 264.D 

integer 264.C 

linear 264.C 

linear mathematical 264.D 

mathematical 264.A 

multiobjective 264.C 

multistage 264.C 

network 264.C 

nonconvex 264.D 

parametric 264.C 

stochastic 264.C 408.A 

stochastic, model 307.C 

two-stage linear, under uncertainty 255.F 

two-stage stochastic 408.A 
programming problem 

all-integer 215.A 

concave 292.A 

convex 292.A 

linear 255.A 

mathematical 264.B 

mixed integer 215.A 

nonlinear 264.C 

pure-integer 215.A 

quadratic 292.A 349.A 

0—1 integer 215.A 
progression 

arithmetic 379.1, App. A, Table 10.I 

geometric 379.1, App. A, Table 10.I 
progressive (set) 407.B 


progressively measurable (stochastic process) 407.B 


progressive process 407.B 
projecting (in a projective space) 343.B 
projection 
(ofa covering space) 367.B 
(from a direct product set) 381.E 
(of a fiber bundle) 147.B 
(ofafiberspace) 148.B 
(ofa Hilbert space) 197.E 
(onto a homogeneous space) 199.A 
(in a projective space) 343.B 
(to a quotient set defined by an equivalence 
relation) 135.B 
(on a tangent bundle of a Banach manifold) 
286.K 
canonical (on modules) 277.F 
canonical (onto a quotient set) 135.B 
center of (in projective geometry) 343.B 
Lie minimal 76.B 


Subject Index 
Projective set of class n 


method of orthogonal (of H. Weyl) 323.G 
orthogonal 139.E,G 
orthogonal (on a Hilbert space) 197.E 
parallel (in an affine space) 7.C 
regular knot 235.A 
relaxation with 440.E 
stereographic 74.D 
unramified (of a covering surface) 367.B 
projection matrix 269.1 
projection method, Rosen's gradient 292.E 
projection operator (in a Hilbert space) 197.E 
projective 
(Banachspace) 37.M 
(object in an Abelian category) 200.I 
projective (object) 200.1 
projective algebraic variety 16.A 
fundamental theorems of 72.F 
quasi- 16.C 
projective A-module 277.K 
projective approximation method 304.B 
projective C*-tensor product 36.H 
projective class 200.Q 
projective class group 200.K 
projective collineation 343.D 
in the wider sense 343.D 
projective connection 80.0 
projective coordinates 343.C 
projective coordinate system 343.C 
projective curvature tensor App. A, Table AU 
projective deformation (between surfaces} 110.B 
projective determinacy 22.H 
projective differentia] geometry 110.B 
projective dimension (of a module) 200.K 
projective frame (in projective geometry) 343.C 
projective general linear group 60.B 
of degree n over K 60.B 
projective geometry 343 
finite-dimensional 343.B 
fundamental theorem of 343.D 
general 343.B 
of paths 109 
projective limit 
(ina category) 210.D 
(of a projective system of sets) 210.B 
(of a projective system of topological groups) 
423.K 
projective limit group | 210.C 
projective limit space 210.C 
projective line 343.B 
projective line element 110.B 
projectively flat space App. A, Table 4.II 
projectively related (fundamental figures) 343.B 
projective mapping (in projective geometry) 343.B 
projective module, (R,S)- 200.K 
projective morphism 16.E 
quasi- 16.E 
projective plane 343.B 
Cayley 54 
finite 241.B 
projective representation 
(ofa group) 362.J 
irreducible 362.J 
similar 362.J 
projective resolution 
(in an Abelian category) 200.I 
left (of an A-module) 200.C 
P- 200.Q 
projective scheme 16.E 
quasi- 16.E 
projective set of classn 22.D 


Subject Index 
Projective space 


projective space 343.B 
complex 343.D 
infinite-dimensional complex 56.C 
infinite-dimensional real 56.B 
over A 147.E 
let 343.F 
real 343.D 
right 343.F 
projective special linear group  60.B 
(over a noncommutative field) 60.0 
projective special unitary group over K 60.H 
projective symplectic group over K 60.L 
projective system 
(in a category) 210.D 
(of groups) 210.C 
(of sets) 210.B 
(of topological groups) 423.K 
(of toplogical spaces) 210.C 
projective topology 424.R 
projective transformation 343.D 364.F 
group of 343.D 
regular 343.D 
singular 343.D 
singular, of the kth species 343.D 
projective transformation group 343.D 
projective unitary group 60.F 
projective variety 16.A 
prolate App. A, Table 3.V 
proliferation (of errors) 138.D 
prolongable (Riemann surface) 367.F 
prolongation 
(along a curve in a covering surface) 367.B 
(of a Riemann surface) 367.F 
(of a solution of an ordinary differential equa- 
tion) 316.C 
(of a system of partial differential equations) 
428.B,F 
(ofa valuation) 439.B 
analytic 198.G 
first (of P) 191.E 
kth (of G) 191.D 
kth (of a Lie subalgebra) 191.D 
kth (of P) I91.E 
prolongational limit set 
first negative 126.D 
first positive 126.D 
proof, consistency 156.D 
for pure number theory 156.E 
proof theory 156.D 
propagation 
of chaos 340.F 
equation of sound 325.A 
oferrors 138.C 
of singularities  325.M 
wave 446 
proper 
(continuous mapping) 425.W 
(equivalence relation in an analytic space) 
23.E 
(leaf) 154.D 
{Lorentz group) 258.A 
(morphism of scheme) 16.D 
(PL embedding) 65.D 
proper affine transformation 7.E 
proper class (in set theory) 381.G 
proper complex Lorentz group 258.A 


proper component (of an intersection of subvarieties) 


16.G 
proper convex function 88.D 
proper factor (of an element ofa ring) 67.H 
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proper flag manifold 199.B 
proper function (of a boundary value problem) 
315.B 
proper hypersphere (in hyperbolic geometry) 
285.C 
proper Lorentz group | 60.J 
properly convex (subset of a sphere) 274.E 
properly discontinuous transformation group 
122.A 
properly divergent 379.A 
properly equivalent (binary quadratic forms) 
348.M 
properly infinite 308.E 
properly intersect (on a variety) 16.G 
properly (n — 1)-dimensional quadric hypersurface 
350.G 
properly posed 
(initial value problem) 321.E 
(problems for partial differential equations) 
322.A 
proper mapping(s) 425.W 
fundamental theorem of 16.X 
proper meromorphic mapping (between analytic 
spaces) 23.D 
proper modification (of an analytic space) 23.D 
proper motion 
in Euclidean geometry 139.B 
ofastar 392 
proper orthogonal group 60.1 258.A 
proper orthogonal matrix 269J 
proper product (of two normal g-lattices) 27.A 
proper quadric surface 350.B 
proper rotation group 258.A 
proper subset 381.A 
proper time 258.A 
proper transform (of a subvariety) 16.1 
property (properties) 411.G 
approximation 37.L 
asymptotic (of solutions of a system of linear 
ordinary differential equations) 314.A 
Baire 425.N 
basic (of a structure) 409.B 
bounded approximation 37.L 
clustering 402.G 
combinatorial 65.A 
continuity, for Cech theory 201.M 
of continuity (in a continuous geometry) 85.A 
countably productive 425.Y 
covering homotopy 148.B 
duality (of linear spaces) 256.G 
equivalence 135.A 
finite intersection 425.8 
finite subset 396.F 
global (in differential geometry) 109 
homotopy extension 202.E 
in the large (in differential geometry) 109 
local (in differential geometry) 109 
local (of a pseudodifferential operator) 345.A 
Markov 261.B 
micro-pseudolocal (of a pseudodifferential 
operator) 345.4 
minimum curvature 223.F 
minimum norm  223.F 
P conjecture 235.B 
pseudolocal (of a pseudodifferential operator) 
345.A 
pseudo-orbit tracing 126.J 
reproducing (of a probability distribution) 
341.E, App. A, Table 22 
in the small (in differential geometry) 109 
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spectral 136.E 
star-finite 425.8 
strong Markov 261.B 
topological 425.G 
uniformity 399.N 
universal mapping 52.L 
proper value 
(of a boundary value problem) 315.B 
(of a linear mapping) 269.L 
(of a linear operator) 390.A 
(ofa matrix) 269.F 
proper variation 279.F 
proper vector 
(belonging to an eigenvalue) 269.F 
(ofa linear operator) 390.A 
(of a linear transformation) 269.L 


proposition(s) 
existential 411.B 
modal 411.L 


universal 411.B 

variables 411.E 
propositional calculus 411.F 
propositional connectives 411.E 
propositional function 411.C 
propositional logic 411.E 
provable (formula) 411.1 


proximity function (of a meromorphic function) 


272.B 
Prüfer ring 200.K 
pseudoanalytic function, K- 352.B 
pseudo-arc 79.D 
pseudocompact (space) 425.8 
pseudoconformal geometry 344.A 
pseudoconformally equivalent 344.A 
pseudoconformal transformation 344.A 
pseudoconvex (domain) 21.G 

Cartan 21.1 

d 21.G 

Levi 21.] 

locally Cartan 21.1 

locally Levi 21.1 

strictly 344A 

strongly 21.G 
pseudodifferential operator 251.0 274.F 345 
pseudodistance 

Carathéodory 21.0 

Kobayashi 21.0 
pseudodistance function 273.B 
pseudofunction 125,C 
pseudogeometric ring 284.F 
pseudogroup (of topological transformations) 

105.Y 


of transformations (on a topological space) 


90.D 
pseudogroup structure 105. Y 
pseudo-Hermitian manifold 344.F 
pseudointerior 382.B 
pseudo-isotopic 65.D 
pseudo-isotopy 65.D 


pseudolocal property (of a pseudodifferential opera- 


tor) 345.A 

micro- 345.A 
pseudomanifold 65.B 
pseudometric 273.B 
pseudometric space 273.B 

indiscrete 273.B 
pseudometric uniformity 436.F 
pseudometrizable 436.F 


pseudonorm (on a topological linear space) 37.0 


424.F 





Subject Index 
q-expansion formula 


pseudo-orbit 126.J 
o 126J 
tracing property 126J 
pseudo-ordering 311.H 
pseudopolynomial, distinguished 21.E 
pseudorandom numbers 354.B 
pseudo-Riemannian metric 105.P 
pseudo~Runge-Kutta method 303.D 
pseudosphere 111.1 285.E 
pseudotensorialform 80.G 
pseudovaluation 439.K 
w-collective 354.E 
psifunction 174.B 
psychometrics 346 
Puiseux series 339.A 
pullback 
(of a differential form) 105.Q 
(ofa distribution) 125.Q 
(ofa divisor) 16.M 
Puppe exact sequence 202.G 
pure 
(continued fraction) 83.C 
(differential form) 367.H 
(state) 351.B 
pure geometry 181 
pure ideal 284.D 
pure integer programming problem 215.A 
purely contractive 251.N 
purely contractive part 251.N 


` purely d-dimensional analytic set 23.B 


(ata point) 23.B 
purely discontinuous distribution 341.D 
purely imaginary number 74.A 
purely infinite (von Neumann algebra) 308.E 
purely inseparable 

(extension ofa field) 149.H 

(rational mapping) 16.1 
purely inseparable element (of a field) 149.H 
purely n-codimensional 125.W 
purely nondeterministic 395.D 
purely transcendental extension 149.K 
pure number theory 156.E 
pure periodic continued fraction 83.C 
pure phase 402.G 
pure point spectrum 136.E 
pure strategy 173.B 
pursuit, curve of 93.H 
push-down automaton 31.D 
push-down storage 96.E 
Putnam's theorem 251.K 
Pyatetskii-Shapiro reciprocity law, Gel'fand- 

437.DD 

Pythagorean closure (of a field) 155.C 
Pythagorean extension (of a field) 155.C 
Pythagorean field 139.B 155.C 
Pythagorean number 145 
Pythagorean ordered field 60.0 
Pythagorean theorem 139.D 


Q 


Q (rational numbers) 294.A,D 

g-block bundle 147.Q 

q-block structure 147.Q 

q-boundary 201.B 

q-chains 201.B 

g-cochains, singular 201.H 

q-cycle 201.B 

q-dimensional homology classes 201.B 
q-expansion formula 134.1] 


Subject Index 


q-face 
q-face 70.B 
q-function 
Chebyshev 19.G, App. A, Table 20.VII 
simplest Chebyshev 19.G 
q-numbers 130.A 
q-representation 351.C 
q-simplex 
oriented 201.C 
singular 201.E 


standard 201.E 
qth homology group 201.B 
Q-manifold 382.D 
Q-matrix 260.F 
Q-spaces 425.BB 
QR method 298.E 
QZ method 298.C 


QCD (=quantum chromodynamics} 132.D 
q.e. (=quasi-everywhere) 338.F 
QFD (=quantum flavor dynamics) 132.D 


quadrangles 155.F 

complete 343.C 
quadrangular set of six points 343.C 
quadrant, first (of a spectral sequence)  200.J 
quadratic differential (on a Riemann surface) 
quadratic equation 10.D, App. A, Table 1 
quadratic field(s) 347 

complex 347.A 

imaginary 347.A 

real 347.A 
quadratic form(s) 348 

(on a linear space) 256.H 

bilinear form associated with 256.H 

binary 348.M 

complex 348.A,B 

definite 348.C 

equivalent 348.A 

indefinite 348.C 

matrix of 348.A 

negative definite 348.C 

negative semidefinite 348.C 

nondegenerate 348.A 

positive definite 348.C 

positive semidefinite 348.C 

primitive binary 348.M 

properly equivalent binary 348.M 

real 348.A,C 

reduced 348.1 

Siegel zeta function of indefinite 450.K 
quadratic irrational number, irreducible 83.C 
quadratic loss function 398.A 399.E 
quadratic nonresidue 297.H 
quadratic programming 349 

nonconvex 264.D 
quadratic programming problem 292.A 349.A 
quadratic reciprocity of Jacobi symbol, law of 

297.1 
quadratic reciprocity of Legendre symbol, law of 
297.1 

quadratic residue 297.H 
quadratic transformation 16.1] 

locally 16.K 

locally (of an algebraic surface) 15.G 

locally (of a complex manifold) 72.H 
quadratrix 187 
quadrature 107.A, App. A, Table 15 III 

ofa circle 179.A 

method of 313.D 

solution by App. A, Table 14.1 

spectral density 397.N 
quadric(s) 350.A 


11.D 


confocal, family of 350.E 
Darboux 110.B 
pencil of 343.E 
quadric cone 350.B,G 
quadric conical hypersurface 
quadric conical surface 350.B 
quadric cylindrical hypersurface 
quadric hypersurface 350.G,I 
(in a projective space) 343.D 350.1 
central 350.G 
elliptic 350.G 
of the hth species, singular (in a projective 
space) 343.E 
hyperbolic 350.G 
noncentral 350.G 
parabolic 350.G 
pencil of 343.E 
properly (n — 1)-dimensional 350.G 
regular (in a projective space) 343.E 
quadric surface(s) 350 
canonical form of the equation of 350.B 
central 350.B 
degenerate 350.B 
proper 350.B 
quadrivium 187 
qualification 
Gauignard's constraint 292.B 
Slater's constraint 292.B 
qualitative (data) 397.A 
quality 404.A 
quality characteristic 404.A 


350.G 


350.G 


quality control 404.A 
statistical 404.4 

quantifier 411.C 
bounded 356.B 
existentia] 411.C 
Hilberts e- 411J 
universal 411.C 

quantile 
a- 396.C 


oforderp 341.H 

restricted 33.B 
quantitative (data) 397.A 
quantity (quantities) 

analog 138.B 

digital 138.B 

first fundamental (of a surface) 

second fundamental (of a surface) 

thermodynamical 419.A 
quantization 351.D 

second 377.B 
quantized contact transformation 274.F 


111.H 
111.H 


quantum chromodynamics (QCD)  132.C,D 
quantum electrodynamics 132.C 

quantum field theory 132.C 150.C 
quantum flavor dynamics (QFD)  132D 


quantum logic 351.L 
quantum mechanics 351 
quantum number 
azimuthal 351.E 
orbital magnetic 351.E 
principal 351.E 
quantum statistical mechanics 402.A 
quartic equation 10.D, App. A, Table 1 


quartile(s) 396.C 

first 396.C 

third 396.C 
quasi-affine (algebraic variety) 16.C 
quasi-algebraically closed field 118.F 


quasi-analytic function 58.F 
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family of 58.A 
in the generalized sense 58.F 
setof 58.F 
quasi-Banach space 37.0 
quasibarreled (locally convex space) 424.1 
quasibounded harmonic function 193.G 
quasicomplete (locally convex space) 424.F 
quasiconformal mappings 352 
extremal 352.C 
quasiconformal reflection, theorem of 352.C 
quasicontinuity principle (in potential theory) 338.1 
quasicontinuous function 338.1 
quasidiscrete spectrum 136.E 
quasidual space (of a locally compact group) 437.1 
quasi-equivalent unitary representation 437.C 
quasi-everywhere (in potential theory) 338.F 
quasi-Frobenius algebra 29.H 
quasi-Fuchsian group 234.B 
quasigroup 190.P 241.C 
quasi-independent of path (a response probability) 
346.G 
quasi-invariant measure 225.J 
quasi-inverse (in a Banach algebra) 36.C 
quasi-inverse element (of an element of a ring) 
368.B 
quasi-invertible element (of a ring) 368.B 
quasilinear 
(operator) 224.E 
(partial differential equation) 320.A 323.D 
326.A 
quasilocal ring 284.D 
quasinilpotent (operator) 251.F 
quasinorm (of a vector) 37.0 
quasinorma] family (of analytic functions) 435.E 
quasinormed linear space 37.0 
quasi-perfect mapping 425.CC 
quasiperiodic (translational flow) 126.L 
quasiperiodic motion 136.G 404.F 
quasiperiodic solution (of Hill's differential equation) 
268.B 
quasiprojective algebraic variety 16.C 
quasiprojective morphism  16.E 
quasiprojective scheme  16.E 
quasi-p-valent 438.E 
quasiregular element (of a ring)  368.B 
quasiregular function, K- 352.B 
quasisemilocal ring 284.D 
quasisimple ring 368.E 
quasisplit algebraic group, k- 13.0 
quasistable distribution 341.G 
quasistatic adiabatic process 419.B 
quasistationary electric circuit 130.B 
quasisymmetric 384.E 
quasivariational inequalities 440.D 
quaternion 29.B 
quaternion algebra 29.D 
generalized 29.D 
Hamilton 29.B 
total definite 27.D 
quaternion field 29.B 
quaternion group 151.B 
generalized 151.B 
quaternion hyperbolic space 412.G 
quaternion unimodular group 412.G 
quaternion vector bundle 147.F 
query 96.F 
questions 351.L 
queue 96.E 
length 260.H 
queuing model 260.H 307.C 


Subject Index 
r-section 


queuing theory 260.H 307.C 
quotient(s) 
(of an ideal and a subset of a commutative ring) 
67.B 
(in a lattice) 243.F 
(of numbers) 297.A 
(of an ordered set) 311.B 
difference 104.A 
differential (at a point) 106.A 
field of 67.G 
geometric 16.W 
group of (of a commutative semigroup) 190.P 
Herbrand 200.N 
integral (in the division algorithm of poly- 
nomials) 337.C 
module of, of an R-module with respect to $ 
67.G 
prime (in a lattice) 243.F 
Rayleigh 68.H 304.B 
ring of, of a ring with respect to a prime ideal 
67.G 
ring of, of a ring with respect toS 67.G 
ring of total 67.G 
quotient bialgebra 203.G 
quotient bundle 147.F 
(of a vector bundle on an algebraic variety) 
16.Y 
quotient category 52.N 
quotient chain complex 200.C 
quotient coalgebra 203.F 
quotient complex 201.L 
quotient group 
(ofa group) 190.C 
(of a topological group) 423.E 
quotient G-set 362.B 
quotient lattice 243.C 
quotient Lie algebra 248.A 
quotient Lie group 249.G 
quotient (linear) space 
(by a linear subspace) 256.F 
(with respect to an equivalence relation) 256.F 
quotient mapping 425.G 
hereditarily 425.G 
quotient measure 225.H 
quotient object 52.D 
quotient representation (of a linear representation) 
362.C 
quotient set (with respect to an equivalence relation) 
135.B 
quotient singularity 418.C 
quotient space 
(by a discontinuous transformation group) 
122.A 
(ofa linear space) 256.F 
(of a topological space) 425.L 
left (of a topological group) 423.E 
right (of a topological group) 423.E 
quotient system (of an algebraic system) 409.C 
quotient topological space 425.L 
quotient topology 425.L 


R 


R (real numbers) 294.A 355.A 
p-set | 308I 
r-closed space 425.U 
r-frame, tangent 105.H 
r-ple point (of a plane algebraic curve) 9.B 
r-section 
(of a Euclidean (simplicial) complex) 70.B 


Subject Index 
r-skeleton (of a Euclidean complex) 


(of a simplicial complex) 70.C 
r-skeleton (of a Euclidean complex) 70.B 
rth differential 286.E 
rth syzygy 369.F 
R-estimator 371.H 
R-progenerator 29.K 
(R,k)-summable 379.S 
(R, S)-exact sequence (of modules) 200.K 
(R, S)-injective module 200.K 
(R, S)-projective module 200.K 
R-action (continuous) 126.B 
R"-valued random variable 342.C 
Raabe criterion App. A, Table 10.II 
Racah algebra 353.A 
Racah coefficient 353.B 
Rademacher-Men’shov theorem 317.B 


Rademacher system of orthogonal functions 317.C 


radial equation 351.E 
radial maximal function 168.B 
radian 139.D 
radiation condition, Sommerfeld 188.D 
radical(s) 
(of an algebraic group) 13.1 
(of a Banach algebra) 36.D 
(of a commutative Banach algebra) 36.E 
(of a commutative ring) 67.B 
(of an ideal) 67.B 
(of a Jordan algebra) 231.B 
(of a Lie algebra) 248.D 
(ofa ring) 368.H 
Jacobson (of a ring) 67.D 
nilpotent (of a Lie algebra) 248.D 
solution by (of an algebraic equation) 10.D 
solvable by 172.H 
unipotent 13.I 
radius (radii) 
(of a solid sphere) 140 
(ofasphere) 139.1 
associated convergence 21.B 
of convergence (of a power series) 339.A 
of curvature (of a plane curve) 111.E 
of curvature (of a space curve) 111.F 
injectivity  178.C 
of meromorphy (of a power series) 339.D 
of principal curvature (of a surface) 111.H 
spectral 126.K 251.F 390.A 
of torsion (ofa space curve) 111.F 
Radó solution, Douglas- (to Plateau problem) 
275.C 
Radon, decomposition formula of 125.CC 
Radon integral, Lebesgue- 94.C 
Radon measure 270.1 
positive 270.1 
Radon-Nikodym derivative 270.L 380.C 
Radon-Nikodym property 443.H 
Radon-Nikodym theorem  270.L 380.C 
for vector measures 443.H 
Radon transform 218.F 
conjugate 218.F 
raising the subscripts (for tensor fields) 417.D 
Ramanujan conjecture 32.D 
Ramanujan-Petersson conjecture 32.D 
Ramanujan sum 295.D 
ramification, degree of (of a branch point) 367.B 
ramification field (of a prime ideal) 14.K 
mth 14.K 
ramification group 
(of a finite Galois extension) 257.D 
(of a prime ideal) 14.K 
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mth 14K 
ramification index 
(of an algebroidal function) 17.C 
(ofa finiteextension) 257.D 
(of a prime ideal over a field) 14.1 
(ofa valuation) 439.I 
relative (of a prime ideal over a field) 14.I 
ramification numbers (of a prime ideal) 14.K 
ramification point (of an analytic covering space) 
23.E 
ramification theorem (in the theory of algebroidal 
functions) 17.C 
conductor- (in class field theory) 59.C 
ramified (prime ideal) 14.I 
ramified covering space 23.B 
ramified element 198.0 
ramified type theory 411.K 
random, at 401.F 
random current 395.1 
random distribution 395.H 407.C 
with independent values at every point 407.C 
strictly stationary 395.H 
strongly stationary 395.H 
weakly stationary 395.C 
in the wider sense 395.C 
in the wide sense 407.C 
random effect 102.A 
random-effects model 102.A 403.C 
random event 342.B 
random field 407.B 
randomization 102.A 
randomized (decision function) 398.4 
randomized block design 102.B 
randomized design, completely 102.A 
randomized estimator 399.B 
randomized test 00.A 
random measure 
Gaussian 407.D 
Poisson 407.D 
random numbers 354 
pseudo- 354.B 
random sample 374.A 396.B 401.F 
random sampling procedure 373.A 
random Schrodinger equations 340.E 
random sequence 354.E 
random tensor field | 395.I 
random variable(s) 342.C 
distribution of 342.C 
independent 342.C 
joint 342.C 
measurable with respect to a family of 342.C 
n-dimensional 342.C 
one-dimensional probability distribution of 
342.C 
probability distribution of 342.C 
R"-valued 342.C 
(S, &)-valued | 342.C 
random walk 260.A 
general 260.A 
standard 260.A 
range 
(of a correspondence) 358.B 
(of a linear operator) 37.C 
(ofa mapping) 381.C 
(of a population characteristic) 396.C 
(of statistical data) 397.C 
closed, theorem 37.J 
equianharmonic (of points) 343.D 
harmonic (of points) 343.D 
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interquartile 397.C 
long 375.B 
metastable (of embeddings) 114.D 
numerical (of a linear operator) 251.E 
point (in projective geometry) 343.B 
point, of the number system (in projective 
geometry) 343.C 
sample 396.C 
short 375.B 
stable (of embeddings) 114.D 
of values (of a meromorphic function) 62.A 
rank 
(of an analytic mapping) 23.C 
(of a bilinear mapping) 256.H 
(ofa complex) 13.R 
(of a connected compact Lie group) 248.X 
(of an element of a complex) 13.R 
(of an elliptic curve over Q) 118.D 
(of first-order predicates) 411.K 
(of a free Abelian group) 2.C 
(ofa free group) 161.A 
(of a free module) 277.G 
(ofa graph) 186.G 
(ofa Lie algebra) 248.K 
(ofa linear mapping) 256.F 
(ofa matrix) 269.D 
(ofa module) 2.E 
(of a normal j-algebra) 384.C 
(of a prime ideal) 67.E 
(of a quadratic form) 348.A 
(of a sesquilinear form) 256.Q 
(of a symmetric Riemannian homogeneous 
space) 412.D 
(ofa Tits system) 151.J 
(ofa valuation) 439.B 
bispinor 258.B 
of finite (operator) 68.C 
k- 13Q 
p- (of a torsion-free additive group) 2.E 
at a point (of an analytic mapping) 23.C 
rational (of a valuation) 439.B 
rank correlation 
Kendall 371.K 
Spearman 371.K 
rank function 66.F 
Rankine-Hugoniot relation 204.G 205.B 
rank k, irreducible tensor of 353.C 
rank test 
signed 371.B 
Wilcoxon signed 371.B 
Rao inequality, Cramer 399.D 
Raphson method, Newton- 301.D 
rapidly decreasing C*-function 168.B 
rapidly decreasing distribution 125.0 
rapidly decreasing sequence 168.B 
rarefied gas, equation of 41.A 
rate 
coding 213.D 
hazard 397.0 
infinitesimal birth 260.G 
infinitesimal death 260.G 
of interest, assumed 214.A 
transmission 213.A 
ratio 
anharmonic 343.D 


of the circumference of a circle to its diameter 


332 
cross 343.D 
damping (of a damped oscillation) 318.B 
direction (of a line in an affine space) 7.F 


Subject Index 
Reaction, law of 


double 343.D 
likelihood 400.I 
likelihood, test 400.1 
monotone likelihood 374.J 
odds 397.K 
Poison 271.G 
sequential probability, test 400.L 
stiffness 303.G 
ratio ergodic theorem  136.B 
ratio estimator 373.C 
rational action 226.B 
rational cohomology group 200.0 
rational curve 9.C 93.H 
rational differential equation 288.A 
rational divisor 
k- (on an algebraic curve) 9.C 
prime 9.C 
rational double point 418.C 
rational element 198.0 
rational entire function 429.A 
rational expression 337.H 
field of 337.H 
rational extrapolation method 303.F 
rational function(s) 
field of 337.H 
generalized 142.B 
ona variety 16.A 
rational function field in n variables 149.K 
rational function matrix  86.D 
rational homomorphism 3.C 13.4 
rational injectivity 200.0 
rational integer 294.C 
rationally equivalent cycles 16.R 
rational mapping 16.1 
defined along a subvariety 16.1 
purely inseparable | 16.I 
separable 16.1 
rational number(s) 294.D 
denseness of 355.B 
rational operation 294.A 
rational point 118.E 
over a field 369.C 
k’- (of an algebraic variety) 16.A 
rational polyhedral, convex 16.Z 
rational Pontryagin class 56.F 
rational rank (of a valuation) 439.B 
rational realnumber 294.E 
rational representation 
(of GL(V)) 60.D 
(of a matrix group) 226.B 
rational singularity 418.C 
rational surface 15.E 
rational variety 16.J 
uni- 16J 
ratio set 136.F 
asymptotic 308.1 
ray 
(in affine geometry) 7.D 
(in foundation of geometry} 155.B 
(modulo m*) 14.H 
(in a Riemannian manifold) 178.F 
asymptotic 178.F 
grazing 325.L 
paraxial 180.B 
unit 351.B 
Rayleigh principle 68.H 
Rayleigh quotient 68.H 298.C 304.B 
Rayleigh-Ritz method 46.F 271.G 
Rayleigh-Schródinger series 331.D 
reaction, law of 271.A 


Subject Index 
Real analytic (at a point) 


real analytic (at a point) 106.K 
real analytic fiber bundle 147.0 
real analytic foliation 154.H 
real analytic function 106.K 198.H 
exponentially decreasing 125.BB 
real analytic manifold 105.D 
real analytic structure 105.D 
real axis 74.C 
real closed field 149.N 
real-compact space 425.BB 
real field 149.N 
formally 149.N 
totally 14.F 
realform 412 
(of a complex algebraic group) 60.0 
(of a complex Lie aigebra) 248.P 
normal (of a complex semisimple Lie algebra) 
248.Q 
real function 165.B 
real Grassmann manifold 199.B 
real Hilbert space 197.B 
real hyperbolic space 412.G 
real hypersphere 76.A 
oriented 76.A 
real hypersurface, spherical 344.C 
real immersion, totally 365.M 
real infinite prime divisor 439.H 
real interpolation space 224.C 
realizable 
(for a linear representation) 362.F 
(by a submanifold) 114.G 
realization 
(of a linear time-varying system) 86.D 
(of an s.s. complex) 70.E 
(ofan s.s. mapping) 70.E 
minimal 86.D 
realization theorem (of a homotopy group) 202.N 
realization theory 86.D 
real Lie algebra 248.A 
compact 248.P 
real line 355.E 
real linear space 256.A 
real monoidal transform 274.E 
real number(s) 294.E 355 
Cantor's theory of 294.E 
completeness of 294.E 355.B 
connectedness of 294.E 
continuity of 294.E 
Dedekind’s theory of 294.E 
extended 87.E 
infinitesimal 276.E 
irrational 294.E 
mod! 355.D 
nonstandard 276.E 
rational 294.E 
real part 74.A 
real prime divisor 439.H 
real projective space 343.D 
infinite-dimensional 56.B 
real quadratic field 347.A 
real quadratic form 348.A,C 
real representation (of a Lie group) 249.0 
real root (of an algebraic equation) 10.E 
real simple Lie algebra 
classical compact 248.T 
exceptional compact 248.T 
real spectral measure 390.D 
real Stiefel manifold 
of k-frames 199.B 
of orthogonal k-frame 199.B 
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real submanifold, totally 365.M 

real-time (computation) 19.E 

real topological vector space 424.A 

real-valued functions 165.B 

real-valued measurable (cardinal) 33.F 

real variable 165.C 

rearrangement 168.B 

rearrangement invariant 168.B 

reciprocal equation 10.C 

reciprocal linear representation (of an algebra) 
362.C 

reciprocal network | 282.C 

reciprocal permutation representation (of a group) 


362.B 
reciprocal spiral 93.H 
reciprocity 
of annihilators (in topological Abelian groups) 
422.E 


Artin's general law of 59.C 
complementary law of 14.0 
Fourier 160.C 
generallaw of 14.0 
law of quadratic, of Jacobi symbol 297.1 
law of quadratic, of Legendre symbol 297.I 
relations, Onsagers 402.K 
reciprocity law 297.1 
for Dedekind sums 328 
explicit (for Hilbert norm-residue symbol) 
14.R 
Gel'fand- Pyatetskii-Shapiro (on unitary repre- 
sentation) 437.DD 
Shafarevich 257.H 
record 96.B 
rectangle 140 
latin 241.E 
rectangular coordinates (in a Euclidean space) 
90.B 
rectangular distribution App. A, Table 22 
rectangular hyperbola 78.E 
rectangular hyperbolic coordinates 90.C 
rectangular matrix 269.A 
rectangular parallelepiped 140 
rectifiable 
(current) 275.G 
(curve) 93.F 246.A 
locally 143.A 246.A 
rectifying plane 111.F 
rectifying surface 111.F 
rectilinear complex 70.B 
recurrence formulas for indefinite integrals App. 
A, Table 9.]I 
recurrence theorem 136.A,C 
recurrence time 260.C 
mean 260.C 
recurrent 
(Lévy process) 5.G | 
(Markov chain) 260.B 
(Markov process} 261.B 
(nonsingular measurable transformation) 
136.C 
(point of a dynamical system) 126.E 
chain 126.E 
infinitely (measurable transformation) 136.C 
linear (sequence) 295.A 
non- (Markov chain) 260.B 
null (point) 260.B 
positive (ergodic class) 260.B 
positive (point) 260.B 
regionally (flow) 126.E 
strongly (measurable transformation) 136.C 
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recurrent chain 260.B 
recurrent event 250.D 260.C 
delayed 260.C 
recurrent point 
(of a Markov chain) 260.B 
(of a Markov process) 261.B 
recurrent sequence of order r 295.A 
recurrent set 260.E 
chain 126.E 
recursive function(s) 356 
general 356.C,F 
partial 356.E,F 
primitive 356.A,B,F 
uniformly primitive 356.B 
recursively 
(define a partial recursive function) 356.E 
uniformly in V 356.E 
recursively enumerable predicate 356.D 
recursively enumerable set 356.D 
recursive predicate 
general 356.C 
primitive 356.B 
recursive set 97 356.D 
general 97 
reduced 
(a closed linear subspace) 251.L 
(latin square) 241.A 
(scheme) 16.D 
reduced Abelian group 2.D 
reduced algebra 231.B 
reduced basis (of a lattice) 92.C 
reduced bundle (of a principal G-bundle) 147.J 
reduced character (of an algebra) 362.E 
reduced Clifford group 61.D 
reduced cone (of a topological space) 202.F 
reduced dual 437.L 
reduced extremal distance 143.B 
reduced form (of a linear structural equation 
system) 128.C 
reduced homology exact sequence 201.F 
reduced homology group 201 E 
reduced join 
(of homotopy classes) 202.Q 
(of mappings) 202.F 
(of topological spaces) 202.F 
reduced link polynomial 235.D 
reduced mapping cone 202.F 
reduced norm (of an algebra) 362.E 
reduced orthogonal group oi D 
reduced product space 202.Q 
reduced quadratic form 348.I 
reduced representation (of an algebra) 362.E 
reduced residue system modulom 297.G 
reduced square, Steenrod 64.B 
reduced square operation, Steenrod 64.B 
reduced suspension 
(of a topological space) 202.F 
n-fold 202.F 
reduced trace (of an algebra) 362.E 
reduced von Neumann algebra 308.C 
reducibility, axiom of 156.B 411.K 
reducible 
(algebraic equation) 10.B 
(algebraic variety) 16.A 
(continuous geometry) 85.A 
(fiber bundle) 147.J 
(in four color problem) 157.D 
(germ of an analytic set) 23.B 
(linear system) 16.N 
(linear system in control theory) 86.C 





Subject Index 
Reflexive law 


(polynomial) 337.F 
(positive matrix) 269.N 
(representation) 362.C 
(Riemannian manifold) 364.E 
completely (A-module) 277.H 
completely (group) 190.L 
completely (representation) 362.C 
reductio ad absurdum 156.C 411.1 
reduction 
d'Alembert method of, of order 252.F 
good (of an Abelian variety) 3.N 
modulo d (of a representation) 277.L 
modulo m (of a linear representation) 362.F 
potential good (of an Abelian variety) 3.N 
potential stable (of an Abelian variety) 3.N 
stable (of an Abelian variety) 3.N 
stable (ofa curve) 9.K 
reduction formula (of a surface) 110.A 


reduction theorem, cup product (on cohomology or 


homology of groups) 200.M 
reduction theory, Minkowski (on fundamental 
regions) 122.E 
reductive 
(algebraic group) 13I 
(homogeneous space) 199.A 
(Lie algebra) 248.G 
reductive action 226.B 
defined by a rational representation 226.B 
geometrically 226.B 
linearly 226.B 
semi- 226.B 
reductive stabilizer 199.A 
Reeb component 154.B 
Rech foliation 154.B 
Rech stability theorems 154.D 
Ree group 151.1 
Reeh-Schlieder theorem 150.E 
Rees lemma, Artin- 284.4 
Ree type 
groupof 151J 
group of Janko- 151J 
reference edge 281.C 
refinement 
(of a covering) 425.R 
(of a descending chain ina lattice) 243.F 
(ofa normal chainina group) 190.G 
barycentric 425.R 
cushioned 425.X 
A- (of a covering) 425.R 
star (of a covering) 425.R 
reflected wave 325.L 
reflecting barrier 115.B,C 
reflection 
(associated with D) 13.R 
(of a principal space) 139.B 
glide 92.E 
Schwartz’s principle of 74.E 198.G 
space 359 
theorem of quasiconformal 352.C 
reflection coefficient 387.D 
reflectionless potential 387.D 
reflection points (with respect to a circle) 74.E 
reflection positivity 150.F 
reflection principle 45.E 
reflexive 
(locally convex space) 424.0 
(relation) 358.A 
Banach space 37.G 
reflexive law 
(for an equivalence relation) 135.A 


Subject Index 
Refraction, atmospheric 


(on ordering) 311.A 
refraction, atmospheric 392 
Regge behavior 386.C 
Regge poles 132.C 386.C 
regime, local 51.B 
region 79.A 
acceptance 400.A 
confidence 399.Q 
confidence, uniformly most powerful 399.Q 
confidence, uniformly most powerful unbiased 
399.Q 
critical 400.A 
Dirichlet 234.C 
of discontinuity 234.A 
estimation 399.Q 
feasible 264.B 292.A 
Ford fundamental 234.C 
fundamental (of a discrete transformation 
group) 122.B 
invariance of a confidence 399.Q 
of relative stability 303.G 
star 339.D 
tolerance 399.R 
unbiased confidence 399.Q 
regionally recurrent (flow) 126.E 
regionally recurrent on an invariant set 126.E 
region of absolute stability (of the Runge-Kutta (P, p) 
method) 303.G 
regression, line of 111.F,I 
regression analysis 403.D 
regression coefficient 397.H,J 403.D 
regression function 397.1 
linear 397.H 403.D 
regression hyperplane 403.D 
regression line 403.D 
regula falsi 301.C 
regular 
(almost contact manifold) 110.E 
(almost periodic system) 290.B 
(boundary point) 120.D 
(cell complex) 70.D 
(closed set) 125.J 
(coherent &-module) 274.G 
(differential form on an algebraic variety) 16.0 
(Dirichlet form) 261.C 
(element of a connected Lie group) 249.P 
(element of a real Lie algebra) 248.B 
(estimator) 399.N 
(Green line) 193J 
(kernel) 125.L 
(left ideal of a Banach algebra) 36.D 
(ordinal number) 312.E 
(permutation group) 151.H 
(point for an additive process) 5.G 
(point of an analytic set) 23.B 45.D 
(point with respect to an analytic set) 21.M 
(point with respect to the Dirichlet problem) 
207.B 
(point of a flow) 126.D 
(prime number) 14.L 
(sampling procedure) 373.A 
(spectral sequence) 200.J 
(submartingale) 262.D 
(ata subvariety) 16.B 
homogeneously 275.C 
of the hth species 343.E 
at the point at infinity (for a harmonic function) 
193.B 
along a subvariety (for a rational mapping) 
16.1 
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regular affine transformation 7.E 
regular Banach space 37.G 
regular boundary 
(of a diffusion process) 115.B 
domain with (in a C?^-manifold) 105.U 
regular chain (of integral elements) 428.E 
regular conditional probability 342.E 
regular cone |. 384.A 
self-dual 384.E 
regular covering (space) 91.A 
regular element 
(ofaring) 368.B 
p- (of a finite group) 362.1 
regular embedding 105.K 
regular extension (ofa field) 149.K 
regular factorization 251.N 
regular first integral 126.H 
regular form 16.0 
regular function(s) 198.A 
on an open set (of a variety) 16.B 
sheaf of germs of 16.B 
atasubvariety 16.B 
regular grammar 31.D 
regular graph 186.C 
regular integral element 191.1 428.E 
regular integral manifold (of a differential ideal) 
428.E 
regularity 
abscissa of (of a Dirichlet series) 121.B 
axiom of (in axiomatic set theory) 33.B 
uptoaboundary 112.F 
parameter of (of a Lebesgue measurable set) 
380.D 
regularization (of a distribution) 125.M 
regularizing (kernel) 125.L 
regular knot projection 235.A 
regular local equation (at an integral point) 428.E 
regular local ring 284.D 
regularly convex set 89.G 
regularly homotopic (immersion) 114.D 
regularly hyperbolic (partial differential equation) 
325.A,F 
regular mapping 
(between prealgebraic varieties) 16.C 
of class C! 208.B 
regular matrix 269.B 
regular measure 270.F 
B- 270.F 
K- 270.F 
regular n-gon 357.A 
regular neighborhood 65.C 
regular neighborhood system 65.C 
regular outer measure 270.E 
regular perturbation 331.D 
regular point 
(in catastrophe theory) 51.F 
(of a differentiable mapping) 105.J 
(of a diffusion process) 115.B 
(for a Hunt process) 261.D 
(of a polyhedron or cell complex) 65.B 
(of a surface in E?) 111.J 
semi- (of a surface in E?) 111J 
regular polygon 357.A 
regular polyhedra 357.B 
regular polyhedral angle 357.B 
regular polyhedral group 151.G 
regular positive Radon measure 270.H 
regular process, positively 44.C 
regular projective transformation 343.D 
regular representation 
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(ofa group) 362.B 
(of a locally compact group)  69.B 
(of a topological transformation group) 437.A 
left (of an algebra) 362.C 
left (ofa group) 362.C 
right (of an.algebra) 362.E 
right (of a group) 362.E 
regular ring 284.D 
regular ring (continuous geometry) 85.B 
regular sequence (of Lebesgue measurable sets) 
380.D 
regular singularity (of a coherent é-module) 274.H 
regular singular point 254.B 
regular solution (of a differential ideal) 428.E 
regular space 425.Q 
completely 425.Q 
regular submanifold (of a C*-manifold) 105.L 
regular system 
of algebraic equations 10.A 
of parameters (of a local ring) 284.D 
regular transformation 
(of a linear space) 256.B 
(ofa sequence) 379.L 
totally (of a sequence) 379.L 
regular tube 193.K 
regular value 105J 
regulator (of an algebraic number field) 14.D 
p-adic 450.J 
regulator problem, optimal 80.F 
Reinhardt domain 21.B 
complete 21.B 
reiteration theorem 224.D 
rejection 400.A 
related differential equation 254.F 
relation(s) 358 
(among elements of a group) 190.C 
(among the generators of a group) 161.A 
Adem (for Steenrod pth power operations) 
64.B 
Adem (for Steenrod square operations) 64.B 
analytic, invariance theorem of 198.K 
antisymmetric 358.A 
binary 358.A 411.G 
canonical anticommutation 377.A 
canonical commutation 351.C 377.A,C 
coarser 135.C 
defining (among the generators of a group) 
161.A 
dispersion 132.C 
equivalence 135.A 358.A 
Euler 419.B 
finer 135.C 
Fuchsian 253.A 
functional (among components of a mapping) 
208.C 
functional, of class C” 208.C 
fundamental (among the generators of a group) 
161.A 419.A 
Gibbs-Duhem 419.B 
Heisenberg uncertainty 351.C 
Hurwitz (on homomorphisms of Abelian varie- 
ties) 3.K 
identity 102.1 
incidence 282.A 
inverse 358.A 
Legendre 134.F, App. A, Table 16.1 
Maxwell 419.B 
n-ary 411.G 
normal commutation 150.D 
order 311.A 





Subject Index 
Relative nullity, index of 


orthogonality (on irreducible characters) 
362.G 
orthogonality (for square integrable unitary 
representations) 437.M 
period 11.C 
Pliicker (on Pliicker coordinates) 90.B 
prey-predator 263.B 
proper equivalence (in an analytic space) 23.E 
Rankine-Hugoniot 204.G 205.B 
reciprocity, Onsagers 402.K 
reflexive 358.A 
Riemann-Hurwitz 367.B 
Riemann period 3.L 11.C 
stronger 135.C 
symmetric 358.A 
transitive 358.A 
weaker 135.C 
relationship algebra 102.3 
relative Alexander cohomology group 201.M 
relative algebraic number field 14.I 
relative boundary 367.B 
relative Bruhat decomposition 13.Q 
relative Cech cohomology group 201.M 
relative Cech homology group 201.M 
relative chain complex 200.C 
relative cochain complex 200.F 
relative cohomology group 215.W 
relative complement (at two sets) 381.B 
relative components (of a Lie transformation group) 
110.A 
relative consistency 156.D 
relative degree 
(of a finite extension) 257.D 
(of a prime ideal over a field) 14.1 
relative derived functor 200.K 
relative different 14J 
relative discriminant 14.J 
relative entropy 212.B 
relative extremum, conditional 106.L 
relative frequency (of samples) 396.C 
relative homological algebra 200.K 
relative homotopy group 202.K 
relative integral invariant 219.A 
Cartan’s 219.B 
relative invariant 12.A 226.A 
relative invariant measure 225.H 
relatively ample sheaf 16.E 
relatively bounded (with respect to a linear operator) 
331.B 
relatively closed set 425.J 
relatively compact 
(with respect to a linear operator) 331.B 
(maximum likelihood method) 399.M 
(set) 425.S 
(subset) 273.F 
relatively dense 126.E 
relatively invariant measure 225.H 
relatively minimal 15.G 16.I 
relatively minimal model 15.G 
relatively open set 425.J 
relatively prime 
(fractional ideals) 14.E 
(numbers) 297.A 
relatively stable 303.G 
relative maximum (of a function) 106.L 
relative Mayer-Vietoris exact sequence 201.L 
relative minimum (of a function) 106.L 
relative neighborhood 425.J 
relative norm (of a fractional ideal) 14.1 
relative nullity, index of 365.D 


Subject Index 
Relative open set 


relative open set 425.J 
relative ramification index (of a prime ideal over a 
field) 14.1 

relative singular homology group 201.L 
relative stability 303.G 

interval of 303.G 

region of 303.G 
relative topology 425.J 
relative uniformity 436.E 
relative uniform star convergence 310.F 
relativistically covariant 150.D 
relativity 

general principle of 359 

general theory of 359.A 

special principle of 359 

special theory of 359.A 
relativization 


(of a definition of primitive recursive functions) 


356.B 
(ofa topology) 425J 
(ofa uniformity) 436.E 
relativized 356.F 
relaxation 215.A 
with projection 440.E 
relaxation oscillation 318.C 
relaxed continuity requirements, variational prin- 
ciples with 271.G 
Rellich-Dixmier theorem 351.C 
Rellich-Kato theorem  331.B 
Rellich lemma 68.C 
Rellich theorem 323.G 
Rellich uniqueness theorem 188.D 
remainder 297.A 337.C 
(in Taylor’s formula) 106.E 
Cauchy App. A, Table 9.IV 
Lagrange App. A, Table 9.IV 
Roche-Schlómilch App. A, Table 9.IV 
remainder theorem. 337.E 
Chinese 297.G 
Remak-Schmidt theorem, Krull- (in group theory) 
190.L 
Remmert-Stein continuation theorem  23.B 
Remmert theorem 23.C 
removable (set for a family of functions) 169.C 
removable singularity 
(of a complex function) 198.D 
(of a harmonic function) 193.L 
Renaissance mathematics 360 
renewal 
equation 260.C 
theorem 260.C 
renormalizable 111.B 132.C 150.C 
super 150.C 
renormalization 
constant 150.C 
equation 111.B 
group 111.A 
method 111.A 
Rényi theorem 123.E 
reoriented graph 186.B 
repeated integral 
(for the Lebesgue integral) 221.E 
(for the Riemann integral) 216.G 
repeated series 
by columns |. 379.E 
by rows 379.E 
replacement, axiom of 33.B 381.G 
replacement, model 307.C 
replica 13.C 
replication 102.A 
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number of 102.B 


represent 


(afunctor) 52.L 
(an ordinal number) 81.B 


representable 


(functor) 52.L 
linearly (matroid) 66.H 


representation(s) 362.A 


(of an algebraic system) 409.C 

(of a Banach algebra) 36.D 

(ofa Jordan algebra) 231.C 

(ofa knot group) 235.E 

(of a lattice) 243.E 

(of a Lie algebra) 248.B 

(of a mathematical system) 362.A 

(of a vector lattice) 310.D 

absolutely irreducible 362.F 

adjoint (of a Lie algebra) 248.B 

adjoint (of a Lie group) 249.P 

adjoint (of a representation) 362.E 

analytic (of GL(V)) 60.B 

in terms of arc length (of a continuous arc) 
246.A 

canonical (of Gaussian processes) 176.E 

completely reducible 362.C 

complex (of a Lie group) 249.0 

complex conjugate 362.F 

conjugate 362.F 

contragredient 362.E 

coregular (of an algebra) 362.E 

cyclic (of a C*-algebra) 36.G 

cyclic (of a topological group) 437.A 

differential (of a unitary representation of a Lie 
group) 4378 

direct sum of 362.C 

double-valued 258.B 

dual 362.E 

equivalent 362.C 

factor (of a topological group) 437.E 

factor, of type I, II, or III 308.M 437.E 

faithful 362.B 

Fock 150.C 

Gelfand (of a commutative Banach algebra) 
36.E 

generalized canonical (of Gaussian processes) 
176.E 

generating (of a compact Lie group} 249.U 

half-spin (even, odd) 61.E 

Herglotz’s integral 43.1 

induced 362.G 

induced (of a finite group) 362.G 

induced (of a unitary representation of a sub- 
group) 437.0 

integral (of a group) 362.G,K 

integral, Cauchy’s 21.C 

irreducible (of an algebra or a group) 362.C 

irreducible (of a Banach algebra) 36.D 

irreducible projective 362.J 

isomorphic 362.C 

isotropy 431.C 

Kallén-Lehmann 150.D 

kernel (of a Green’s operator) 189.B 

l-adic 3.E 

Lax 287.B,C 387.C 

left regular (of a group) 362.B 

linear — linear representation 

list 186.D 

Mandelstam 132.C 

matrix 362.D 

modular (of a finite group) 362.G 
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module of 69.D 
momentum 351.C 
without multiplicity 437.G 
normal (of a von Neumann algebra) 308.C 
ordinary (of a finite group) 362.G 
parametric 165.C 
parametric (of Feynman integrals) 146.B 
parametric (of a subspace of an affine space) 
7.C 
permutation (of a group) 362.B 
permutation, reciprocal (of a group)  362.B 
polynomial (of GL(V)) 60.D 
position 351.C 
projective (of a group) 362J 
projective, irreducible 362.J 
quotient (of a linear representation) 362.C 
rational (of GL(V)) 60.D 
rational (of a matrix group) 226.B 
real (ofa Lie group) 349.0 
reciprocal linear (of an algebra) 362.C 
reciprocal permutation (of a group) 362.B 
reduced (of an algebra) 362.F 
reducible 362.C 
regular (of a locally compact group) 69.B 
regular (of a topological transformation group) 
437.A 
regular, left (of an algebra) 362.C 
regular, left (of a group) 362.B 
regular, right (of an algebra) 362.E 
regular, right (of a group) 362.B 
Schrödinger 351.C 
semisimple 362.C 
similar 362.C 
similar matrix (semilinear mapping) 256.D 
similar projective 362.J 
simple 362.C 
slice 431.C 
special (of a Jordan algebra) 231.C 
spectral 390.E 
spherical (of a differentiable manifold) 111.G 
spherical (of a space curve) 111.F 
spherical (of a unimodular locally compact 
group) 437.Z 
spin 61.E 
spin (of SO(n)) 60.5 
spinor, of rank 258.B 
strongly continuous (of a topological group) 
69.B 
sub- 362.C 
sub- (of a projective representation) 362.J 
tensor (of a general linear group) 256.M 
tensor product of 362.C 
translation, theorem 375.H 
transposed 362.E 
tree 96.D 
unit (ofa group) 362.C 
unitary — unitary representation 
vector (of a Clifford group) 61.D 
weakly continuous (of a topological group) 
69.B 
zero (of an algebra) 362.C 
representation module 
(of a linear representation) 362.C 
faithful 362.C 
representation problem (on surfaces) 246.] 
representation ring 237.H 
representation space 
(ofa Banach algebra) 36.D 
(ofa Lie algebra) 248.B 
(ofa Lie group) 249.0 


Subject Index 
Resolvent set (of a linear operator) 


(of a unitary representation) 437.A 
representative (of an equivalence class) 135.B 
representative function (of a compact Lie group) 
249.U 

representative ring (of a compact Lie group) 
249.U 

representing function (of a predicate) 356.B 

representing function (of a subset) 381.C 

representing measure 164.C 

reproducing kernel 188.G 

reproducing property (of a probability distribution) 
341.E, App. A, Table 22 

reproduction function 263.A 
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algebraic 16D 
automatic integration 299.C 
coarse moduli 16.W 
complete 16.D 
consistent-mass 304.D 
deformation of X over a connected 16.W 
difference 304.E 
difference, of backward type 304.F 
difference, of forward type 304.F 
explicit 304.F 
fine moduli 16.W 
formal 16.X 
Friedrichs 304.F 
group 16.H 
Hilbert 16S 
implicit 304.F 
integral 16.D 
inverted filing 96.F 
irreducible 16.D 
K-complete 16.D 
Lax-Wendroff 304.F 
locally Noetherian formal 16.X 
moduli 16.W 
morphism of 16.D 
Noetherian 16.D 
nonadaptive 299.C 
Picard 16.P 
projective 16.E 
quasiprojective 16.E 
S- 16D 
over A 16.D 
separated 16.D 
separated formal 16.X 
Scherk's surface 275.A 
Schläfli diagram (of a complex semisimple Lie 
algebra) 248.S 
Schläfli formula App. A, Table 19.III 
Schläfli integral representation 393.B 
Schláfli polynomial App. A, Table 19.IV 
Schlesinger equations 253.E 
schlicht 438.A 
schlicht Bloch constant 77.F 
schlichtartig 367.G 
Schlieder theorem, Reeh- 150.E 
Schlómilch criterion App. A, Table 10.II 
Schlómilch remainder, Roche- App. A, Table 
9.N 
Schlómilch series 39.D, App. A, Table 19.III 
generalized 39.D 
Schmidt class, Hilbert- 68.1 
Schmidt condition 379.M 
Schmidt expansion theorem, Hilbert- 217.H 
Schmidt norm, Hilbert- 68 
Schmidt orthogonalization 317.A 
Gram- 317.A 
Schmidt procedure, Lyapunov- 286.V 
Schmidt theorem 118.D 
Schmidt theorem, Knopp- 208.C 


Subject Index 
Schmidt theorem, Krull-Remak- 


Schmidt theorem, Krull-Remak- (in group theory) 
190.L 
Schmidt type, integral operator of Hilbert- 68.C 
Schmidt type, kernel of Hilbert- 217.1 
Schnee theorem, Knopp- (on method of summation) 
379.M 
Schoenflies notation (for crystal classes) 92.E, 
App. B, Table 6.IV 
Schoenflies problem 65.G 
Schoenflies theorem 65.G 
Schottky group | 234.B 
Schottky theorem  43.J 
Schottky uniformization 367.C 
Schrader axioms, Osterwalder- 150.F 
Schreier conjecture (on simple groups) 151.1 
Schreier extension, Artin- (ofa field) 172.F 
Schroder equation, Königs- 44.B 
Schröder functional equation 388.D 
Schrédinger equation 351.D 
1-body 351.E 
random 340.E 
time-dependent 351.D 
time-independent 351.D 
Schrödinger operator 351.D 
Schrödinger picture 351.D 
Schrödinger representation 351.C 
Schrodinger series, Rayleigh- 331.D 
Schubert cycle 56.E 
Schubert variety 56.E 
Schur index 
(of a central simple algebra) 29.E 
(of an irreducible representation) 362.F 
Schur lemma 
(on linear representations) 362.C 
(on simple modules) 277.H 368.G 
(on unitary representations) 437.D 
Schur subgroup 362.F 
Schur theorem (on linear transformations of 
sequences) 379.L 
Schur theorem, Kojima- (on linear transformations 
of sequences) 379.L 
Schur-Zassenhaus theorem (on Hall subgroups) 
151.E 
Schwartz-Christoffel transformation 77.D, 
App. A, Table 13 
Schwartz-Christoffel transformation formula 77.D 
Schwartz integral, Bartle-Dunford- 443.G 
Schwartz space 424.8 
Schwarzian derivative App. A, Table 9.III 
Schwarz inequality 211.C 
Cauchy- 211.C, App. A, Table 8 
Schwarzlemma 43.B 
Schwarz principle of reflection 198.G 
Schwinger equation, Lippmann- 375.C 
Schwinger function 150.F 
Schwinger points 150.F 
sciences, information 75.F 
scores 
canonical 397.M 
factor 280.G 346.F 
score test, Fisher-Yates-Terry normal 371.C 
scoring method 397.M 
screening, sampling inspection with 404.C 
seasonal adjustment 397.N 
sec (secant) 131.E 
secant 432.A 
hyperbolic 131.F 
sech (hyperbolic secant) 131.F 
second (unit of an angle) 139.D 
secondary cohomology operation, stable 64.C 
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secondary components (of a homogeneous space) 
110.A 
secondary composition 202.R 
secondary obstruction 305.D 
secondary parameters 110.A 
second axiom, Tietze’s 425.Q 
second barycentric derived neighborhood 65.C 
second boundary value problem 
(for harmonic functions) 193.F 
(of partial differential equations of elliptic type) 
323.F 
second category, set of 425.N 
second classification theorem (in the theory of 
obstructions) 305.C 
second complementary law (of Legendre symbols) 
297] 
second countability axiom  425.P 
second Cousin problem 21.K 
second definition (of an algebraic K-group) 237J 
second difference 104.A 
second extension theorem (in the theory of 
obstructions) 305.C 
second factor (of a class number) 14.L 
second fundamental form (of an immersion of 
a manifold) 111.G 365.C, App. A, Table 4I 
second fundamental quantities (of a surface) 111.H 
second fundamental tensor 417.F 
second fundamental theorem (in Morse theory) 
279.D 
second homotopy theorem 305.C 
second incompleteness theorem 185.C 
second isomorphism theorem (on topological 
groups) 423.J 
second kind 
(Abelian differential of) 11.C 
(Abelian integral of) 11.C 
(Fuchsian group of) 122.C 
(integral equations of Fredholm type of) 
217.A 
perfect number of 297.D 
Stirling number of 66.D 
second law of cosines 432.A, App. A, Table 2.II 
second law of thermodynamics 419.A 
second mean value theorem 
(for the D-integral) 100.G 
(for the Riemann integral) 216.B 
(for the Stieltjes integral) 94.C 
second-order asymptotic efficiency 399.0 
second-order design 102.M 
second-order efficiency 399.0 
second-order predicate 411.K 
second-order predicate logic 411.K 
second quantization 377 
second separation axiom 425.Q 
second variation formula 178.A 
section 
(of a finite group) 362.1 
(ofa sheaf space) 383.C 
circular 350.F 
conic 78 
cross- 126.C 286.H 
cross (of a fiber bundle) 147.L 
cross (of a fiber space) 148.D 
cross-, for a closed orbit 126.G 
differential cross 375.A 386.B 
local 126.E 
local cross (of a fiber bundle) 147.E 
n- (in a cell complex) 70.D 
normal (of a surface) 410.B 
r- (of a Euclidean complex) 70.B 
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r- (of a simplicial complex) 70.C 
Scattering cross 375.A 
set of (of a sheaf)  383.C 
total (elastic) cross 386.B 
zero- (of a block bundle) 147.Q 
sectional curvature 364.D 
holomorphic 364.D 
section graph 186.C 
sectors, superselection 150.E 351.K 
secular equation 55.B 269.F 
secular perturbation 55.B 
sedenion 29.D 
segment 155.B 178.H 
(in affine geometry) 7.D 
(in an ordered set) 311.B 
oriented 442.A 
Seidel method, Gauss- 302.C 
Seifert conjecture 126.N 154.D 
Seifert matrix 235.C 
Seifert surface 235.A 
Selbergsieve 123.E 
Selberg theorem, Evans- 48.E 338.H 
Selberg zeta function 450.T 
selection, measurable 443.] 
selection, model 401.D 
selection function 354.E 
selection parameter 396.F 
selection rule 351.H 
selection statistic 396.F 
self-adjoint 
(linear homogeneous ordinary differential 
equation) 315.B 
essentially 251.E 390.1 
self-adjoint differential equation 252.K 
self-adjoint differential operator, formally 112.1 
self-adjoint operator 251.E 390.F 
self-adjoint system of differential equations 252.K 
self-commutator 251.K 
self-dual (linear space) 256.H 
self-dual (regular cone) 384.E 
self-dual, anti- (G-connection) 80.Q 
self-excited vibration 318.B 
self-information 213.B 
self-intersection number 15.C 
self-loop 186.B 
self-polar tetrahedron 350.C 
self-polar triangle 78.J 
self-reciprocal function 220.B 
semicontinuity, lower (of length) 246.A 
semicontinuous 84.C 
(mapping in a topological linear space) 153.D 
lower 84.C 
upper 84.C 
semicontinuous function 84.C 
semicontinuous partition, upper 425.L 
semidefinite Hermitian form 348.F 
semidefinite kernel, positive 217.H 
semidefinite matrix, positive 269.I 
semidefinite operator, positive 251.E 
semidefinite quadratic form, positive or negative 
348.C 
semidirect product (of two groups) 190.N 
semidiscrete approximation 304.B 
semiexact (differential on an open Riemann surface) 
367. 
semi-explicit 303.D 
semifinite (von Neumann algebra) 308.E 
semifinite (weight on a von Neumann algebra) 
308.D 
semiflow 126.B 





Subject Index 
Semisimple 


of class C" 126.B 
continuous 126.B 
discrete 126.B 
discrete, of class C" 126.B 
semigroup 88.E 190.P 409.A 
(of a Markov process) 261.B 
of class (C°) 378.B 
differentiable 378.F 
distribution 378.F 
dual 378.F 
equicontinuous, of class (C^) 378.B 
free 161.A 
holomorphic 378.D 
locally equicontinuous 378.F 
nonlinear 378.F 
nonlinear, of operators 286.X 
of operators 378 
order-preserving 286.Y 
unitary 409.C 
semigroup algebra 29.C 
large 29.C 
semigroup bialgebra 203.G 
semihereditary ring 200.K 
left 200.K 
right 200.K 
semi-implicit 303.D 
semi-integral, left 68.N 
semi-integral, right 68.N 
semi-intuitionism 156.C 
semi-invariant 226.A 
G- 226A 
of a probability distribution 341.C 
semilattice 243.A 
lower 243.A 
upper 243.A 
semilinear (partial differential equations of elliptic 
type) 323.D 
semilinear mapping 256.P 277.L 
semilinear transformation 256.P 
semilocal ring 284.D 
analytically unramified 284.D 
Noetherian 284.D 
quasi- 284.D 
semilogarithmic paper 19.F 
semimartingale 262.E 406.B 
continuous 406.B 
semimartingale decomposition 406.B 
seminorm (on a topological linear space) 424.F 
semiorbit 126.D 
negative 126.D 
positive 126.D 
semiordered set 311.A 
semiordering 311.A 
semipolar set 261.D 
semiprimary ring 368.H 
semiprime differential ideal (of a differential ring) 
113 
semiprime ideal (of a differential ring) 113 
semiprimitive ring 368.H 
semireductive (action defined by a rational 
representation) 226.B 
semireflexive (locally convex space) 424.0 
semiregular point (of a surface in E?) 111.J 
semiregular transformation (of a sequence) 379. 
semisimple 
(algebraic group) 13.I 
(Banach algebra) 36.D 
(Jordan algebra) 231.B 
(Lie algebra) 248.E 
(Lie group) 249.D 


Subject Index 
Semisimple algebra 


(matrix) 269.G 
semisimple algebra |. 29.A 
semisimple A-module 277.H 
semisimple component (of a linear transformation) 
269.L 
semisimple linear representation 362.C 
semisimple linear transformation 269.L 
semisimple part 
of an algebraic group 13.E 
of a nonsingular matrix 13.E 
semisimple ring 368.G 
semisimplicial complex 70.E 
semisimplicity, Cartan’s criterion of 248.F 
semistable (coherent sheaf) 16.Y 
semistable distribution 341.G 
semistable reduction theorem 16.Z 
semistable vector bundle (algebraic) 16.Y 
semivariation 443.G 
sensitive grammar, context- 31.D 
sensory test 346.B 
separable 
(function in nomograms) 19.D 
(polynomial) 337.G 
(rational mapping) 16.I 
(stochastic process) 407.A 
(topological space) 425.P 
perfectly 425.P 
separable algebra 29.F,K 200.L 
central 29.K 
separable element (of a field) 149.H 
separable extension 
(ofa field) 149.H,K 
maximal (of a field) 149.H 
separable metric space 273.E 
separably generated extension (of afield) 149.K 
separated 
(formal scheme) 16.X 
(morphism) 16.D 
separated convex sets, strongly SOA 
separated kernel 217.F 
separated scheme 16.D 
separated space 425.Q 
separated S-scheme 16.D 
separated topological group 423.B 
separated type App. A, Table 14.I 
separated uniform space 436.C 
separated variable type App. A, Table 15.II 
separately continuous (bilinear mapping) 424.Q 
separating family | 207.C 
separating transcendence basis (of a field extension) 
149.K 
separation 
axioms of (in set theory) 33.B 
of variables 322.C 
separation axioms | 425.Q 
the first 425.Q 
the fourth 425.Q 
the second 425.Q 
the third 425.Q 
Tikhonov 425.Q 
separation cochain 305.B 
separation cocycle 305.B 
separation principle 405.C 
separation theorem (on convex sets) 89.A 
separator 186.F 
sequence(s) 165.D 
admissible (in Steenrod algebra) 64.B, App. A, 
Table 6.III 
asymptotic 30.A 
of Bernoulli trials 396.B 
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Blaschke 43.F 

Cauchy (in a-adic topology) 284.B 

Cauchy (in a metric space) 273J 

Cauchy (of rational numbers) 294.E 

Cauchy (of real numbers) 355.B 

Cauchy (in a uniform space) 436.G 

cohomology exact 201.L 

cohomology spectral 200.J 

connected, of functors 200.1 

convergent (of real numbers) 87.B 355.B 

divergent (of real numbers) 87.B 

double 379.E 

exact (of A-homomorphisms of A-modules) 
277.E 

exact, of cohomology 200.F 

exact, of Ext 200.G 

exact, of homology 200.C 

exact, of Tor 200.D 

of factor groups (of a normal chain) 190.G 

Farey 4.B 

Fibonacci 295.A 

finite 165.D 

of functions 165.B,D 

fundamental (in a metric space) 273.J 

fundamental (of rational numbers) 294.E 

fundamental (of real numbers} 355.B 

fundamental (in a uniform space} 436.G 

fundamental, of cross cuts (in a simply 
connected domain) 333.B 

fundamental exact (on cohomology of groups) 
200.M 

Gysin exact (of a fiber space) 148.E 

Hodge spectral 16.U 

homology exact (of a fiber space) 148.E 

homology exact (for simplicial complexes) 
201.L 

homotopy exact 202.L 

homotopy exact (of a fiber space) 148.D 

homotopy exact (of a triad) 202.M 

homotopy exact (of a triple) 202.L 

independent, of partitions 136.E 

infinite 165.D 

interpolating 43.F 

Jordan Holder (ina group) 190.G 

linear recurrent 295.A 

Mayer-Vietoris exact 201.C 

minimizing 46.E 

monotone (of real numbers) 87.B 

monotonically decreasing (of real numbers) 
87.B 

monotonically increasing (of real numbers) 
87.B 

normal (of open coverings) 425.R 

null (in a-adic topology) 284.B 

of numbers 165.D 

(o)-convergent 87.L 

(0)-star convergent 87.L 

order-convergent (in a vector lattice) 310.C 

oscillating (of real numbers) 87.D 

of points 165.D 

pointwise convergent 435.B 

positive definite 192.B 

of positive type 192.B 

Puppe exact 202.G 

random 354.E 

rapidly decreasing 168.B 

recurrent, of orderr 295.A 

reduced homology exact 201.F 

regular (of Lebesgue measurable sets) 380.D 

regular spectral 200.3 
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Serre @-theory 


relative Mayer-Vietoris exact 201.L 
(R, S)-exact (of modules) 200.K 
ofsets 165.D 


field of formal power, in one variable 370.A 
finite 379.A, App. A, Table 10.1 
formal power 370.A 

short exact 200.1 formal power, field in one variable 370.A 

simply convergent 435.B formal power, ring 370.A 

slowly increasing 168.B Fourier 159 197.C, App. A, Table 11.1 

spectral 200J Fourier (of an almost periodic function) 18.B 

spectral (of singular cohomology of a fiber Fourier (of a distribution) 125.P 

space) 148.E Fourier-Bessel 39.D 
standard 400.K Fourier cosine App. A, Table 11.I 
symbol (in the theory of microdifferential Fourier sine App. A, Table 11.I 
operators) 274.F Gauss 206.A 

of Ulm factors (of an Abelian p-group) 2.D generalized Eisenstein 450.T 

uniformly convergent 435.A generalized Schlómilch 39.D 

Wang exact (of a fiber space) 148.E generalized trigonometric 18.B 
sequencing problem, machine 376 geometric 379.B, App. A, Table 10.1 
sequential decision function 398.F Heine 206.C 
sequential decision problem 398.F hypergeometric 206.A 
sequential decision rule 398.F infinite 379.A, App. A, Table 10.III 
sequentially compact (space) 425.S iterated, by columns (of a double series) 379.E 
sequential probability ratio test 400.L iterated, by rows (of a double series) 379.E 
sequential sampling inspection 404.C Kapteyn 39.D, App. A, Table 19.III 
sequential space 425.CC Lambert 339.C 
sequential test 400.L Laurent 339.A 
serial correlation coefficient 397.N 421.B 


serial cross correlation coefficient 397.N 
series 379, App. A, Table 10 


absolutely convergent 379.C 

absolutely convergent double 379.E 

allied (of a trigonometric series) 159.A 

alternating 379.C 

ascending central (of a Lie algebra) 248.C 

asymptotic 30 

asymptotic power 30.A 

binomial App. A, Table 10.1V 

binomial coefficient 121.E 

characteristic (ina group) 190.G 

commutatively convergent 379.C 

complementary (of unitary representations of 
a complex semisimple Lie group) 437.W 

complementary degenerate (of unitary represen- 
tations of a complex semisimple Lie group) 
437.W 

composition (in a group) 190.G 

composition (in a lattice) 243.F 

composition factor (of a composition series in 
agroup) 190.G 

conditionally convergent 379.C 

conditionally convergent double 379.E 

conjugate (of a trigonometric series) 159.A 

convergent 379.A 

convergent double 379.E 

convergent power 370.B 

convergent power, ring 370.B 

degenerate (of unitary representations of 
a complex semisimple Lie group) 437.W 

derived (of Lie algebra) 248.C 

descending central (of a Lie algebra) 248.C 

Dini 39.D 

Dirichlet 121 

Dirichlet, of the type {A,} 121.A 

discrete (of unitary representations of a semi- 
simple Lie group) 437.X 

divergent 379.A 

divergent double 379.E 

double 379.E 

Eisenstein 32.C 

Eisenstein-Poincaré 32.F 

exponential 131.D 

factorial 104.F 121.E 


logarithmic 131.D 

lower central (ofa group) 190.J 

majorant 316.G 

majorant (of a sequence of functions) 435.A 

Neumann 217.D 

of nonnegative terms 379.B 

ordinary Dirichlet 121.A 

orthogonal (of functions) 317.A 

oscillating 379.A 

n- (ofa group) 151.F 

Poincaré 32.B 

of positive terms 379.B 

power 21.B 339 370.A, App. A, Table 10.IV 

power (in a complete ring) 370.A 

power, with center at the point at infinity 
339.A 

power, ring 370.A 

principal (in an Q-group) 190.G 

principal (of unitary representations of a com- 
plex semisimple Lie group) 258.C 437.W 

principal (of unitary representations of a real 
semisimple Lie group) 258.C 437.X 

principal H. 437.X 

properly divergent 379.A 

Puiseux 339.A 

repeated, by columns (of a double series) 
379.E 

repeated, by rows (of a double series) 379.E 

ring of convergent power 370.B 

ring of formal power 370.A 

ring of power 370.A 

Schlómilch 39.D, App. A, Table 19.III 

simple 379.E 

singular 4.D 

supplementary 258.C 

Taylor 339.A 

termwise integrable 216.B 

theta 348.L 

theta-Fuchsian, of Poincaré 32.B 

time 397.A 421.A 

trigonometric 159.A 

unconditionally convergent 379.C 

uniformly absolutely convergent 435.A 

upper central (of a group) 190.J 


Serre conjecture 369.F 
Serre @-theory 202.N 


Subject Index 
Serre duality theorem 


Serre duality theorem 
(on complex manifolds) 72.E 
(on projective varieties) 16.E 
Serre formulas, Frenet- (on curves) 111.D, 
App. A, Table 4.I 
Serre theorem (for ample line bundles) 16.E 
sesquilinear form 
(onalinearspace) 256.Q 
(on a product of two linear spaces) 256.Q 
matrix of 256.Q 
nondegenerate 256.Q 
set(s) 381 
A- 22A 409.A 
absolutely convex (in a linear topological space) 
424.E 
a-limit 126.D 
analytic 22.A,] 
analytic (in the theory of analytic spaces) 23.B 
analytically thin (in an analytic space) 23.D 
ofanalyticity 192.N 
analytic wave front 274.D 
ofantisymmetry 164.E 
arbitrary 381.G 
asymptotic 62.A 
asymptotic ratio 308.1 
axiom of power 33.B 381.G 
B, 22.D 
®-measurable 270.C 
Baire 126.H 270.C 
bargaining 173.D 
basic (for an Axiom A flow) 126J 
basic (of a structure) 409.B 
basic open 425.F 
bifurcation 51.F 418.F 
border 425.N 
Borel (in a Euclidean space) 270.C 
Borel (in a topological space) 270.C 
Borel in the strict sense 270.C 
boundary 425.N 
boundary cluster 62.A 
bounded (in an affine space) 7.D 
bounded (in a locally convex space) 424.F 
bounded (in a metric space) 273.B 
C, 22D 
CA 22A 
Cantor 79.D 
capacity of 260.D 
catastrophe 51.F 
category of 52.B 
chain recurrent 126.E 
characteristic (of an algebraic family on 
a generic component) 15.F 
characteristic (of a partial differential operator) 
320.B 
choice 34.A 
closed 425.B 
cluster 62.A 
coanalytic 22.A 
compact (in a metric space) 273.F 
compact (in a topological space) 425.8 
complementary 381.B 
complementary analytic 22.A 
complete 241.B 
complete orthonormal (of a Hilbert space) 
197.C 
connected 79.A 
constraint (of a minimization problem) 292.A 
convex 7.D 89 
countably equivalent (under a nonsingular 
bimeasurable transformation) 136.C 
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curvilinear cluster 62.C 

cylinder 270.H 

of degeneracy (of a holomorphic mapping 
between analytic spaces) 23.C 

A} 22D 

dense 425.N 

dependent 66.G 

derived 425.0 

determining (of a domain in C^) 21.C 

difference (of blocks) 102.E 

directed 311.D 

discrete 425.0 

disjoint 381.B 

dominating 186. 

empty (D) 381.A 

externally stable 186.1 

equipollent 49.A 

equipotent 49.A 

F, 270.C 

factor (of a crossed product) 29.D 

factor (of an extension of groups) 190.N 

factor (of a projective representation) 362.5 

family of 165.D 381.B,D 

family of (indexed by A) 381.D 

final (of a correspondence) 358.B 

final (of a linear operator) 251.E 

finite 49.F 381.A 

finitely equivalent (under a nonsingular 
bimeasurable transformation) 136.C 

of the first category 425.N 

of the first kind 319.B 

first negative prolongational limit 126.D 

first positive prolongational limit 126.D 

function 380.A 

function-theoretic null 169.A 

fundamental (of a transformation group) 
122.B 

fundamental open (of a transformation group) 
122.B 

G; 270.C 

general Cantor 79.D 

generalized peak 164.D 

(general) recursive 97 

germ of an analytic 23.B 

homotopy 202.B 

idempotent (ofa ring) 368.B 

increasing directed 308.A 

independent 66.G 186] 

index 102.L 

index (of a family) 165.D 

index (of a family of elements) 381.D 

indexing (of a family of elements) 381.D 

infinite 49.F 381.A 

information 173.B 

initial (of a correspondence) 358.B 

initial (of a linear operator) 251.E 

interior cluster 62.A 

internally stable  186.I 

interpolating (for a function algebra) 164.D 

Kronecker 192.R 

lattice of 243.E 

lattice-ordered 243.A 

Lebesgue measurable 270.G 

Lebesgue measurable (of R”) 270.G 

level 279.D 

limit 234.A 

locally closed 425.J 

M- 159.J 

meager 425.N 

minimal 126.E 
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Sheaf (sheaves) 

-measurable 270.D standard 22] 
-null 270.D strongly P-convex 112.C 
of multiplicity 159.3 strongly separated convex 89.A 
n-cylinder 270.H system of closed 425.B 
nilpotent (of a ring) 368.B system of open 425.B 
nodal 391.H ternary 79.D 
nonmeager 425.N thin (in potential theory) 261.D 
nonsaddle 120.E totally bounded (in a metric space) 273.B 
nonwandering 126.E totally bounded (in a uniform space) 436.H 
nowhere dense 425.N U- 159J 
null (in a measure space) 270.D 310.1 of uniqueness 159.J 
null, of class N.  169.E universal (for the projective sets of class n) 
null(@) 381.A 22.E 
w-limit 126.D universal (of set theory) 381.B 
open 425.B wandering (under a measurable transformation) 
ordered — ordered set 136.C 
ordinate 221.E wave front 274.B 345.A 
orthogonal (of functions) 317.A wave front, analytic 274.D 
orthogonal (of a Hilbert space) 197.C weakly wandering 136.C 
orthogonal (of a ring) 368.B weakly wandering (under a group) 136.F 
orthonormal (of functions) 317.A well-ordered 311.C 
orthonormal (of a Hilbert space) 197.C Z- 382.B 
P, 22D Zariski closed 16.A 
P-convex (for a differential operator) 112.C Zariski dense 16.A 
peak 164.D Zariski open 16.A 
perfect 425.0 set function(s) 380 
IT; 22A additive 380.C 
HI 22D of bounded variation 380.B 
point 381.B completely additive 380.C 
of points of indeterminacy (of a proper finitely additive 380.B 

meromorphic mapping) 23.D monotone decreasing 380.B 
polar (in potential theory) 261.D 338.H monotone increasing 380.B 
power 381.B p-absolutely continuous additive 380.C 
precompact (in a metric space) 273.B -singular additive 380.C 
principal analytic 23.B set-theoretic formula 33.B 
projective, of classn 22.D set-theoretic topology 426 
purely d-dimensional analytic 23.B set theory 381.F 
of quasi-analytic functions 58.F axiomatic 33 156.E 
quotient (with respect to an equivalence Bernays-Gódel 33.A,C 

relation) 135.B Boolean-valued 33.E 
ratio 136.F classical descriptive 356.H 
recurrent 260.E effective descriptive 356.H 
recursive 356.D general 33.B 
recursively enumerable 356.D Gódel 33.C 
regularly convex | 89.G Zermelo 33.B 
relative closed 425.J Zermelo-Fraenkel 33.A,B 
relatively compact 425.S Severi group, Néron- 
relatively compact (in a metric space) 273.F (ofa surface) 15.D 
relatively open 425.J (ofa variety) 16.P 
removable (for a family of functions) 169.C Sewell inequality, Roepstorff-Araki- 402.G 
residual 126.H 425.N sgn P (sign) 103.A 
resolvent (of a closed operator) 251.F shadow costs 292.C 
resolvent (of a linear operator) 390.A shadow price 255.B 
p- 308.1 Shafarevich group, Tate- 118.D 
S- 308.1] Shafarevich reciprocity law 257.H 
saddle 126.E shallow water wave 205.F 
scattered 425.0 shape 
of the second category 425.N pointed 382.A 
of the second kind 319.B same 382.A 
semipolar 261.D shape category 382.A 
Sidon 192.R 194.R shape dominate 382.A 
sieved 22.B shape function 223.G 
Ml 22A shape group 382.C 
zr 7D shape invariant(s) 382.C 
singularity (of a proper meromorphic mapping) shape morphism 382.A 

23.D shape theory 382 
stable 173.D Shapiro-Lopatinskii condition 323.H 
stable, externally 186.1 Shapley value 173.D 


stable, internally 186.1 sheaf (sheaves) 383 


Subject Index 
Sheaf space 


(in étale (Grothendieck) topology) 16.AA 
of Abelian groups 383.B 
analytic 72.E 
associated with a presheaf 383.C 
Cech cohomology group with coefficient 
383.F 
coherent, of rings 16.E 
coherent algebraic 16.E 72.F 
coherent analytic 72.E 
constructible 16.AA 
cohomology group with coefficient 383.E 
constant 383.D 
derived 125.W 
flabby 383.E 
of germs of analytic functions 383.D 
of germs of analytic mapping 383.D 
of germs of continuous functions 383.D 
of germs of differentiable sections of a vector 
bundle 383.D 
of germs of differential forms of degree of r 
383.D 
of germs of functions of class CŒ” 383.D 
of germs of holomorphic functions (on an 
analytic manifold) 383.D 
of germs of holomorphic functions (on an 
analytic set) 23.C 
of germs of holomorphic functions (on an 
analytic space) 23.C 
of germs of regular functions 16.B 
of germs of sections of a vector bundle 383.D 
ofgroups 383.C 
of ideals of a divisor (of a complex manifold) 
72.F 
invertible 16.E 
locally constructible (constant) 16.AA 
of @-modules 383.1 
orientation 201.R 
pre- 383.A 
pre-, on a site 16.AA 
of rings 383.C 
scattered 383.E 
structure (of a prealgebraic variety) 16.C 
structure (of a ringed space) 383.H 
structure (ofa variety) 16.B 
trivial 383.D 
sheaf space 383.C 
shear, modules of elasticity in 271.G 
shearing strain 271.G 
shearing stress 271.G 
shear viscosity, coefficient of 205.C 
sheet(s) 
hyperboloid of one 350.B 
hyperboloid of revolution of one 350.B 
hyperboloid of revolution of two 350.B 
hyperboloid of two 350.B 
mean number of (of a covering surface of 
a Riemann sphere) 272.J 
number of (of an analytic covering space) 
23.E 
number of (of a covering surface) 367.B 
sheeted, n- 367.B 
Shelah isomorphism theorem, Keisler- 276.E 
Shields-Zeller theorem, Brown- 43.C 
shift 251.0 
associated with the stationary process 136.D 
automorphism 126.J 
Bernoulli 136.D 
generalized Bernoulli 136.D 
Markov 136.D 
phase 375.E 386.B 
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shift operator 223.C 251.0 306.C 
unilateral 390.1 
shift transformation 136.D 
Shilov boundary 
(ofa domain) 21.D 
(for a function algebra) 164.C 
(of a Siegel domain) 384.D 
Shilov generalized function, Gel'fand- 125.8 
Shmul'yan theorem 424.V 
Eberlein- 37.G 
Krein- 37.E 424.0 
Shnirel'man theory, Lyusternik- 286.Q 
shock wave 205.B 446 
shortening 186.E 
shortest-path problem  281.C 
shortest representation (of an ideal) 67.F 
short exact sequence 200.I 
short international notation 92.E 
short range 375.B 
Shrikhande square 102.K 
shrinking (a space to a point) 202.E 
shunt 
left 115.B 
right 115.B 
SI (international system of units) 414.4 
side 155.B,F 
(ofan angle) 139.D 155.B 
(of a complete quadrangle) 343.C 
(onaline) 155.B 
(ona plane) 155.B 
(of a point with respect to a hyperplane) 7.D 
(ofa polygon) 155.F 
(of a spherical triangle) 432.B 
side cone 258.A 
Sidon set 192.R 
Siegel domain(s) 384 
of the first kind 384.A 
generated 384.F 
irreducible 384.E 
of the second kind 384.A 
of the third kind 384.A 
Siegel mean value theorem 182.E 
Siegel modular form of weight k (or of dimension 
—k) 32F 
Siegel modular function of degreen 32.F 
Siegel modular group of degree n 32.F 
Siegel space of degreen 32.F 
Siegel theorem 
(on Diophantine equations) 118.D 
(on positive definite forms) 348.K 
Siegel upper half-space of degree n 32.F 
Siegel zero 123.D 
Siegel zeta function of indefinite quadratic forms 
450.K 
sieve 16.AA 22.B 
Eratosthenes 297.B 
large 123.E 
large, method 123.D 
Selberg 123.E 
sieved set 22.B 
sieve method 4.A 
large 123.D 
sign (of a permutation) 103.A 
signal process 405.F 
signature 
(ofa Hermitian form) 348.F 
(of an irreducible representation of GL(V)) 
60.D 
(ofa knot) 235.C 
(ofa manifold) 56.G 
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(of a quadratic form) 348.C 
Hirzebruch, theorem 72.K 
signed Lebesgue-Stieltjes measure 166.C 
signed measure 380.C 
signed rank test 371.B 
signed rank test, Wilcoxon 371.B 
sign test 371.B 
similar 
(central simple algebra) 29.E 
(linear representation) 362.C 
(matrix representation of a semilinear mapping) 
256.P 
(permutation representation) 362.B 
(projective representation) 362.J 
(square matrices) 269.G 
similar central simple algebras 29.E 
similar correspondence (between surfaces) 111.1 
similarity 
(of an affine space) 7.E 
Prandtl-Glauert law of 205.D 
Reynolds law of 205.C 
von Karman transonic 205.D 
similarly isomorphic (ordered fields) 149.N 
similar mathematical systems 409.B 
similar test 400.D 
similar unitary representations 437.A 
simple 
(A-module) 277.H 
(Abelian variety) 3.B 
(algebraic group) 13.L 
(eigenvalue) 390.A,B 
(function) 438.A 
(Lie algebra) 248.E 
(Lie group) 249.D 
(linear representation) 362.C 
(polygon) 155.F 
(subcoalgebra) 203.F 
absolutely (algebraic group) 13.L 
algebraically (eigenvalue) 390.B 
almost (algebraic group) 13.L 
geometrically (eigenvalue) 390.A 
k- (algebraic group) 13.0 
k-almost (algebraic group) 13.0 
simple algebra 29.A 
central 29.E 
normal 29.E 
zeta function of 27.F 
simple arc 93.B 
simple Bravais lattice 92.E 
simple character (of an irreducible representation) 
362.E 
simple closed curve 93.B 
simple component (of a semisimple ring) 368.G 
simple continued fraction 85.A 
simple convergence, abscissa of (of a Dirichlet series) 
121.B 
simple distribution, potential of 338.A 
simple extension (of a field) 149.D 
simple function 221.B 443.B 
simple group 190.C 
linear 151.I 
Tits 151.1 
simple harmonic motion 318.B 
simple holonomic system 274.H 
simple homotopy equivalence 65.C 
simple homotopy equivalent 65.C 
simple homotopy theorem  65.C 
simple hypothesis 400.A 
simple Lie algebra 248.E 
classical compact real 248.T 





Subject Index 
Simplicial homology group 


classical complex 248.8 
exceptional compact real 248.T 
exceptional complex 248.8 
simple Lie group 249.D 
classical compact 249.L 
classical complex 249.M 
exceptional compact 249.L 
exceptional complex 249.M 
simple loss function ` 208 A 
simple model 403.F 
simple pair (of an H-space and an H-subspace) 
202.L 
n- (of topological spaces) 202.L 
simple path 186.F 
simple point 
(on an algebraic variety) 16.F 
(of an analytic set) 23.B 418.A 
simple ring 368.G 
quasi- 368.E 
simple root 
(of an algebraic equation) 10.B 
(in a root system) 13J 
(of a semisimple Lie algebra) 248.M 
simple series 379.E 
simple spectrum 390.G 
simplest alternating polynomial 337 
simplest Chebyshev g-function 19.G 
simplest orthogonal polynomial 19.G 
simple type theory 411.K 
simplex 
(in an affine space) 7.D 
(ofa complex) 13.R 
(in a locally convex space) 424.U 
(in a polyhedron of a simplicial complex) 
70.C 
(in a simplicial complex) 70.C 
(of a triangulation) 70.C 
degenerate (in a semisimplicial complex) 
70.D 
n- (in a Euclidean simplicial complex) 70.B 
n- (in a semisimplicial complex) 70.E 
n- (in a simplicial complex) 70.C 
open (in an affine space) 7.D 
open (in the polyhedron of a simplicial 
complex) 70.C 
ordered (in a semisimplicial complex) 70.E 
ordered (in a simplicial complex) 70.C 
oriented q- 201.C 
oriented singular r-, of class C”  105.T 
singular n- (in a topological space) 70.E 
simplex method 255.C 
two-phase 255.C 
simplex tableau 255.C 
simplicial approximation (to a continuous mapping) 
70.C 
simplicial approximation theorem  70.C 
simplicial chain complex, oriented | 201.C 
simplicia] complex(es) 65.A 70.C 
abstract 70.C 
countable 70.C 
Euclidean 70.B 
finite 70.C 
isomorphic 70.C 
locally countable 70.C 
locally finite 70.C 
ordered 70.C 
simplicial decomposition (of a topological space) 
70.C 
simplicial division 65.A 
simplicial homology group 201.D 


Subject Index 
Simplicial mapping 


simplicial mapping (map)  70.C 
(between polyhedra) 70.C 
(relative to triangulations) 70.C 
simplicial pair 201.L 
simply connected (space) 79.C 170 
simply connected covering Lie group (of a Lie 
algebra) 249.C 
simply connected group (isogenous to an algebraic 
group) 13.N 
simply convergent sequence 435.B 
simply elliptic (singularity) 418.C 
simply invariant (subspace) 164.H 
simply periodic function 134.E 
simply transitive (G-set) 362.B 
Simpson formula, Milne- 303.E 
Simpson irule 299.A 
Simpson 2 rule 299.A 
simulation 307.C 385 
analog 385.A 
in the narrow sense 385.A 
system 385.A 
simultaneous distribution 342.C 
simultaneous equations 10.A 
sin (sine) 131.E 
sin! I3LE 
sine(s) 432.A 
hyperbolic 131.F 
integral 167.D 
laws of 432.A, App. A, Tables 2.II 2.III 
laws of (on spherical trigonometry) 432.B 
sine curve 93.H 
Sine-Gordon equation 387.A 
sine integral 167.D, App. A, Table 19.II 
sines and cosines, law of App. A, Table 2.III 
sine transform 160.C, App. A, Table 11.H 
sine wave 446 
Singer fixed point theorem, Atiyah- 153.C 
Singer index theorem 
Atiyah- 237.H 
equivariant Atiyah- 237.H 
single-address instruction 75.C 
single-commodity flow problem  281.F 
single integral theorem, Fourier 160.B 
single layer, potential ofa 338.A 
single-objective model 307.C 
single sampling inspection 404.C 
single-valued function 165.B 
singular 
(distribution) 374.C 
(element of a connected Lie group) 249.P 
(element of a real Lie algebra) 248.B 
(element with respect to a quadratic form) 
348.E 
(Galton-Watson process) 44.C 
(harmonic function) 193.G 
(mapping) 208.B 
(ordinal number) 270.1 
(set function} 380.C 
essentially (with respect to an analytic set) 
21.M 
of the hth species 343.D,E 
u- 380.C 
relative, homology group  201.L 
singular cardinal problem — 33.F 
singular chain complex (of a topological space) 
201.E 
singular cochain complex 201.H 
singular cohomology group 201.H 
singular cohomology ring 201.1 
singular complex (of a topological space) 70.E 
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singular fiber 72.K 
singular homology group 201.E,G,L,R 
integral 201.E 
singular initial value problem (of a partial differen- 
tial equation of mixed type) 326.C 
singular inner function 43.F 
singular integral 217.J 
singular integral equation 217J 
singular integral manifold (of a differential ideal) 
428.E 
singular integral operator, Calderón-Zygmund 
217.J 251.0 
singularity (singularities) 51.C 198.M 
algebraic 198.M 
of an analytic function 198.M 
cusp 418.C 
direct transcendental (of an analytic function in 
the wider sense) 198.P 
elliptic 418.C 
essential (of a complex function) 198.D 
fixed (of an algebraic differential equation) 
288.A 
indirect transcendental (of an analytic function 
in the wider sense) 198.P 
isolated (of an analytic function) 198.D 
isolated (of a complex function) 198.M 
logarithmic (of an analytic function) 198.M 
logarithmic (of an analytic function in the wider 
sense) 198.P 
movable (of an algebraic differential equation) 
288.A 
ordinary (of an analytic function in the wider 
sense) 198.P 
principle of condensation of 37.H 
propagation of 325.M 
quotient 418.C 
rational 418.C 
regular (of a coherent &-module)  274.H 
removable (of a complex function) 198.D 
removable (of a harmonic function) 193.L 
resolution of 16.L 23.D 418.B 
space of 390.E 
theory of 418 
transcendental (of an analytic function in the 
wider sense) 198.P 
two-dimensional 418.C 
singularity set (of a proper meromorphic mapping) 
23D 
singularity spectrum (of a hyperfunction) 125.CC 
274.E 
singularity theorem (in physics) 359.F 
singular kernel 217.3 
singular locus (of a variety) 16.F 
singular n-simplex (in a topological space) 70.E 
singular orbit 431.C 
singular part (of a Laurent expansion) 198.D 
singular perturbation 289.E 
singular point 
(of an algebraic variety) 16.F 
(of an analytic set) 23.B 418.A 
(of a continuous vector field) 153.B 
(ofa curve of class CH) 93.G 
(ofaflow) 126.D 
(of a linear difference equation) 104.D 
(of a plane algebraic curve) 9.B 
(of a polyhedron) 65.B 
(of a quadratic hypersurface)  343.E 
(of a surface in E?)  111J 
(of a system of linear ordinary differential 
equations) 254.A 
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(of a system of ordinary differential equations) 
126.G 289.A 
apparent (of a system of linear ordinary dif- 
ferential equations) 254.C 
hyperbolic 126.G 
irregular (of a solution) 254.B 
irregular (of a system of linear ordinary dif- 
ferential equations) 254.B 
isolated 198.D 
left (of a diffusion process) 115.B 
regular (of a solution) 254.B 
regular (of a system of linear ordinary dif- 
ferential equations) 254.B 
right (of a diffusion process) 115.B 
singular projective transformation 343.D 
of the hth species 343.D 
q-cochain 201.H 
q-simplex 201.E 
singular quadric hypersurface of the hth species (in a 
projective space) 343.E 
singular r-chain of class C?  105.T 
singular r-cochain of class C^  105.T 
singular r-simplex of class C”, oriented 105. T 
singular series 4.D 
singular solution 
(of a differential ideal) 428.E 
(of a general partial differential equation) 
320.C 
(of an ordinary differential equation) 313.A, 
App. A, Table 14.I 
(of a partial differential equation) 320.C 
totally (with respect to a quadratic form) 
348.E 
singular spectrum 345.A 390. E 
(of ahyperfunction) 125.CC 274.E 
singular subspace 343.D 
singular subspace, totally 348.E 
singular support 
(of a distribution) 112.C 
(of a hyperfunction) 125.W 
singular value 302.A 
singular value decomposition (SVD) 302.E 
sinh (hyperbolic sine) 131.F 
sink 126.G 281.C 
sinusoid 93.D 
sinusoidal wave 446 
site 16.AA 
étale 16.AA 
flat 16.AA 
presheafon 16.AA 
Zariski 16.AA 
site percolation process 340.D 
6j-symbol 353.B 
size 
(of a balanced array) 102.L 
(complexity of computation) 71.A 
(ofa population) 397.B 
(of a random sample) 396.B 
(ofa sample) 401.E 
(ofa test) 400.A 
block 102.B 
sample 373.A 
step 303.B 
skeleton 
(of a domain in C")  21.C 
r- (of a Euclidean complex) 70.B 
skew field 149.A 368.B 
skew-Hermitian form 256.Q 
skew-Hermitian matrix 269. 
skew h-matrix 269.1 





Subject Index 
Smooth boundary, domain with 


skewness 396.C 397.C 
coefficient of 341.H 
skew product (of measure-preserving transforma- 
tions) 136.D 
skew surface 111.1 
skew-symmetric (multilinear mapping) 256.H 
skew-symmetric matrix 269.B 
skew-symmetric tensor 256.N 
Skitovich-Darmois theorem 374.H 
Skolem-Lówenheim theorem  156.E 
Skolem paradox 156.E 
Skolem theorem on the impossibility of characteriz- 
ing the system of natural numbers by axioms 
156.E 
slackness, Tucker theorem on complementary 
255.B . 
slack variable 255.A 
slant product 
(ofa cochain and a chain) 201.K 
(of a cohomology class and a homology class) 
201.K 
Slater constraint qualification 292.B 
slender body theorem 205.B 
slice knot 235.G 
slice representation 431.C 
slice theorem, differentiable 431.C 
slicing theorem, watermelon- 125.DD 
slide rule 19.A 
sliding block code 213.E 
slit (of a plane domain) 333.A 
slit domain 333.A 
slit mapping 
extremal horizontal 367.G 
extremal vertical 367.G 
slope function 46.C 
slowly increasing C?-function 125.0 
slowly increasing distribution 125.N 
slowly increasing function in the sense of Deny 
338.P 
slowly increasing sequences 168.B 
slow wave 259 
Smale condition C, Palais- 279.E 286.Q 
Smale diffeomorphism, Morse- 126.3 
Smale flow, Morse- 126.J 
Smale theorem, Sard- 286.P 
Smale vector field, Morse- 126J 
small-displacement theory of elasticity 271.G 
smaller topology 425.H 
small inductive dimension (ind) 117.B 
small numbers, law of 250.B 
small sample 401.F 
small set of order U 436.G 
smashing (a space to a point) 202.E 
smash product 202.F 
Smirnov test, Kolmogorov- 317.F 
Smirnov test statistic, Kolmogorov- 374.E 
Smirnov theorem 250.F 
Smith conjecture 235.E 
Smith convergence, Moore- 87.H 
Smith theorem 431.B 
smooth 
(function) 106.K 
(measure for a Riemann metric) 136.G 
(morphism of schemes) 16.F 
(point of a variety) 16.F 
piecewise (curve) 364.A 
in the sense of A. Zygmund 168.B 
uniformly (normed linear space) 37.G 
smooth boundary, domain with (in a C?-manifold) 
105.U 


Subject Index 
Smooth characteristic class of foliations 


smooth characteristic class of foliations 154.G 
smoothing (of a combinatorial manifold) 114.C 
smoothing problem  114.C 
smooth invariant measure 126.J 
smooth manifold 105.D 114.B 
smooth structure 114.B 
smooth variety 16.F 
sn 134J, App. A, Table 16.III 
Snapper polynomial 16.E 
Sobolev-Besov embedding theorem — 168.B 
Sobolev inequality, Hardy-Littlewood- |. 224.E 
Sobolev space 168.B 
software 75.C 
sojourn time density 45.G 
solenoidal (vector field) 442.D 
solid geometry 181 
solid harmonics 393.A 
solid n-sphere 140 
solid sphere 140 
topological 140 
solitary wave 387.B 
soliton | 387.B 
solution 
(of equations of neutral type) 163.H 
(of a functional-differential equation) 163.C 
(of an inequality) 211.A 
(of an ordinary differential equation) 313.A 
(of a partial differential equation) 320.A 
(of partial differential equations of first order) 
App. A, Table 151 
(of partial differential equations of second 
order) App. A, Table 15.III 
(of a system of differential equations) 313.B 
(of a system of linear equations) 269.M 
(of a system of partial differential equations) 
428.B 
algebraic (of an algebraic equation) 10.D 
asymptotic 325.L 
basic 255.A 
basic feasible 255.A 
basic optimal 255.A 
Bayes 398.B 
Bayes, in the wider sense 398.B 
of boundary value problems App. A, Table 
15.VI 
of the Cauchy problem 325.D 
classical (to Plateau’s problem) 275.C 
complete (of partial differential equations) 
320.C 
d'Alembert 325.D 
Douglas-Rado (to Plateau’s problem) 275.C 
elementary (of a differential operator) 112.B 
elementary (of a linear partial differential 
operator) 320.H 
elementary (of partial differential equations 
of elliptic type) 323.B 
elementary (of a partial differential operator) 
App. A, Table 15.V 
equilateral triangle 420.B 
feasible (of a linear equation in linear program- 
ming) 264.A 
formal (for a system of ordinary differential 
equations) 289.C 
fundamental (of a Cauchy problem) 325.D 
fundamental (of a differential operator) 112.B 
fundamental (of an evolution equation) 189.C 
fundamental (of a linear parabolic equation 
with boundary conditions) 327.F 
fundamental (of a linear partial differential 
operator) 320.H 
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fundamental (of a partial differential equation 
of parabolic type) 327.D 

fundamental (of partial differential equations 
of elliptic type) 323.B 

fundamental (of a partial differentia] operator 
with C*-coefficients) 189.C 

fundamental system of (of a homogeneous 
linear ordinary differential equation) 252.B 

fundamental system of (of a homogeneous 
system of linear differential equations of first 
order) 252.H 

general (of a differential equation) 313.A 

general (of a general partial differential equa- 
tion) 320.C 

general (of a nonhomogeneous linear difference 
equation) 104.D 

general (of partial differential equations) 
320.C 

general (of a system of differential equations) 
313.C 

general (of a system of partial differential 
equations) 428.B 

generalized Bayes 398.B 

genuine 323.G 

half-periodic (of the Hill equation) 268.E 

Hill's method of 268.B 

Hopf's weak 204.C 

inner 25.B 

Kirchhoff 325.D 

to the martingale problem  115.C 

maximum 316.E 

minimax 398.B 

minimum 316.E 

Nash bargaining 173.C 

numerical (of algebraic equations) 301 

numerical (of integral equations) 217.N 

numerical (of linear equations) 302 

numerical (of ordinary differential equations) 
303 

numerical (of partial differential equations) 
304 

optimal (of a linear programming problem) 
255.A 

optimal (of a nonlinear programming problem) 
292.A 

ordinary (of a differential ideal) 428.E 

outer 25.B 

particular (of a differential equation) 313.A 

particular (of partial differential equations) 
320.C 

particular (for a system of differential equations) 
313.C 

pathwise uniqueness of 406.D 

periodic (of the Hill equation) 268.E 

Perron-Brelot (of the Dirichlet problem) 
120.C 

Perron-Wiener-Brelot (of the Dirichlet problem) 
120.C 

Poisson 325.D 

primary (of a homogeneous partial differential 
equation) 320.E 

primitive (of a partial differential equation) 
320.E 

principal 104.B 

by quadrature App. A, Table 14.I 

quasiperiodic (of the Hill equation) 268.B 

by radicals (of an algebraic equation) 10.D 

regular (of a differential ideal) 428.E 

singular App. A, Table 14.I 

singular (df a differential ideal) 428.E 
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singular (of a general partial differential equa- 
tion) 320.C 


Subject Index 
Space(s) 


south pole 74.D 140 
space(s) 381.B 


singular (of an ordinary differential equation) 
313.A, App. A, Table I4.I 
singular (of partial differential equations) 
320.C 
stable (of the Hill equation) 268.E 
straight line 420.B 
strong (of Navier-Stokes equation) 204.C 
strong (of stochastic differential equations) 
406.D 
system of fundamental (of a system of linear 
homogeneous equations) 269.M 
trivial (of a system of linear homogeneous 
equations) 269.M 
unique strong 406.D 
uniqueness theorem of (of systems of linear 
differential equations of the first order) 
316.D,G 
unstable (of the Hill equation) 268.E 
von Neumann-Morgenstern 173.D 
weak 204.C 323.G 3781 
solution curve (of ordinary differential equations) 
316.A 
solution operator 163.E 
solvability, Cartan's criterion of 248.F 
solvable 
(ideal of a Lie algebra) 248.C 
(Lie algebra) 248.C 
(Lie group) 249.D 
(by a Turing machine) 71.B 
by radicals 172.H 
solvable algebra |. 231.A 
solvable algebraic group  13.F 
k- IXF 
solvable group 190.1 
finite 151.D 
generalized 190.K 
n- lSLF 
solve 
(a conditional inequality) 211.A 
(by means of a Turing machine) 71.E 
(an ordinary differential equation) 313.A 
(a partial differential equation) 320.A 
(a system of algebraic equations) 10.A 
(a triangle) 432.A 
Sommerfeld formula App. A, Table 19.III 
Kneser- App. A, Table 19.III 
Sommerfeld radiation condition 188.D 
Sonine formula, Weber- App. A, Table 19.III 
Sonne polynomials 317.D, App. A, Table 20.VI 


Sonine-Schafheitlin formula App. A, Table 19.III 


SOR (successive overrelaxation) 302.C 
sorting 96.C 
soudure 80.N 
sound propagation, equation of 325.A 
source 126.G 281.C 
(ofajet) 105.X 
autoregressive Gaussian 213.E 
ergodic information  213.C 
information 213.A 
stationary 213.C 
without (vector field) 442.D 
source branch 282.C 
source coding theorem 213.D 
with a fidelity criterion 213.E 
noiseless 213.D 
source coding theory 213.A 
southern hemisphere 140 


of absolute continuity 390.E 
absolutely closed 425.U 

abstract 381.B 

abstract L 310.G 

abstract L, 310.G 

abstract M 310.G 

action 208 A 

adjoint (of a topological linear space) 424.D 
affine 7.A 

affine locally symmetric 80.J 

affine symmetric 80.J 

No- 425. Y 

algebraic 16.W 

algebraic fiber 72.1 

analytic 23.C 

analytic, in the sense of Behnke and Stein 23.E 
analytically uniform  125.S 
analytic covering 23.E 

analytic measurable 270.C 
arcwise connected 79.B 
attaching 202.E 

Baire 425.N 

Baire zero-dimensional 273.B 
Banach 37.A,B 

Banach analytic 23.G 

base (of a fiber bundle) 147.B 

base (of a fiber space) 148.B 

base (of a Riemann surface) 367.A 
base for 425.F 

basic (of a probability space) 342.B 
Besov 168.B 

bicompact 408.S 

biprojective 343.H 

Boolean 42.D 

Borel 270.C 

boundary 112.E 

bundle (of a fiber bundle) 147.B 
C-analytic 23.E 

C-covering 23.E 

Cartan 152.C 

Cartesian 140 

Cech-complete 436.1 

classifying (of a topological group) 147.G,H 
closed half- 7.D 

of closed paths | 202.C 

co-echelon 168.B 

collectionwise Hausdorff 425.AA 
collectionwise normal 425.AA 
comb 79.A 

compact 425.8 

compact metric 273.F 

complete | 436.G 

completely normal 425.Q 
completely regular 425.Q 
complete measure 270.D 
complete product measure 270.H 
complete uniform 436.G 
complex, form 365.L 

complex Hilbert 197.B 

complex interpolation 224.B 
complexity 71.A 

complex projective 343.D 
concircularly flat App. A, Table 4.II 
configuration 126.L 402.G 
conformal 76.A 

conformally flat App. A, Table AU 
conjugate (of a normed linear space) 37.D 


Subject Index 
Space(s) 


conjugate (of a topologicallinear space) 424.D 


connected 79.A 

of constant curvature 364.D, App. A, Table 
AU 

of continuous mapping 435.D 
contractible 79.C 

control (in catastrophe theory) 51.B 
countable paracompact 425.Y 
countably compact 425.8 

countably Hilbertian 424.W 
countably normed 424.W 

covering 91.A 

crystallographic, group 92.A 

decision 398.A 

of decision functions 398.A 

de Sitter 359.D 

developable 425.AA 

(DF) 424.P 

Dieudonné complete topological 435.1 
Dirichlet 338.Q 

discrete metric 273.B 

discrete topological 425.C 

Douady 23.G 

dual (of a C*-algebra) 36.G 

dual (of a linear space) 256.G 

dual (of a locally compact group) 437.J 
dual (of a normed linear space) 37.D 
dual (of a projective space) 343.B 

dual (of a topological linear space) 424.D 
é- 193.N 

echelon 168.B 

eigen- 269.L 390.A 
Eilenberg-Maclane 70.F 

Einstein 364.D, App. A, Table 4.II 

of elementary events 342.B 

elliptic 285.C 

error 403.E 

estimation 403.E 

Euclidean 140 


external (in static model in catastrophe theory) 


51.B 

(F) 4241 

fiber 72.1 148.B 

finite type power series 168.B 

Finsler 152.A 

Fock (antisymmetric) 377.A 

Fock (symmetric) 377.A 

Fréchet 37.0 424.1 425.CC 

Fréchet, in the sense of Bourbaki 37.0 424.1 
Fréchet L- 87.K 

Fréchet-Uryson 425.CC 

fully normal 425.X 

function 168.A 435.D 

fundamental 125.8 

G- 178.H 431A 

general analytic 23.G 

generalized topological 425.D 
generating (of a quadric hypersurface) 343.E 
globally symmetric Riemannian 412.A 
Green 193.N 

group 92.A 

H- 203.D 

Haar 142.B 

half- (of an affine space) 7.D 

Hardy 168.B 

Hausdorff 425.Q 

Hausdorff uniform 436.C 

H-closed 425.U 

hereditarily normal 425.Q 

Hermitian hyperbolic 412.G 
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Hilbert 173.B 197.B 

Hilbert, adjoint 251.E 

Hilbert, exponential 377.D 

Holder 168.B 

holomorphically complete 23.F 

homogeneous — homogeneous space 

hyperbolic 285.C 412.H 

identification (by a partition) 425.L 

indiscrete pseudometric 273.B 

inductive limit 210.C 

infinite-dimensional 117.B 

infinite lens 91.C 

infinite type power series 168.B 

at infinity (in affine geometry) 7.B 

inner product 442.B 

internal (in static model in catastrophe 
theory) 51.B 

interpolation 224.A 

of irrational numbers 22.A 

irreducible symmetric Hermitian 412.A 

isometric 273.B 

John-Nirenberg(=BMO) 168.B 

k- 425.CC 

k’- 425.CC 

K-complete analytic 23.F 

Kawaguchi 152.C 

Kolmogorov 425.Q 

Kóthe 168.B 

Kuranishi 72.G 

Kuratowski 425.Q 

L- 87K 

L*- 87K 

Lashnev 425.CC 

lattice ordered linear 310.B 

Lebesgue (= L,(Q)) 168.B 

Lebesgue measure, with (c-) finite measure 
136.A 

left coset (of a topological group) 423.E 

left projective 343.H 

left quotient (of a topological group) 423.E 

lens 91.C 

(LF) 424.W 

of line elements of higher order 152.C 

linear — linear space 

linear topological 424.A 

Lindelöf 425.8 

Lipschitz 168.B 

locally arcwise connected 79.B 

locally compact 425.V 

locally connected 79.A 

locally contractible 79.C 204.C 

locally convex Fréchet 424.1 

locally Euclidean 425.V 

locally n-connected 79.C 

locally w-connected 79.C 

locally symmetric 364.D 

locally symmetric Riemannian 412.A 

locally totally bounded uniform 436.H 

locally trivial fiber 148.B 

local moduli, of a compact complex manifold 
72.G 

local ringed 383.H 

Loeb 293.D 

loop 202.C 

Lorentz 168.B 

Luzin 22.1 422.CC 

M- 425.Y 

(M)- 424.0 

Mackey 424.N 

mapping 202.C 435.D 
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maximal ideal (of a commutative Banach space) 
36.E 

measurable 270.C 

measure 270.D 

metric — metric space 

metric vector 256.H 

: metrizable topological 273.K 
metrizable uniform 436.F 
Minkowski 258.A 

moduli 16.W 72.G 

Moishezon 16.W 

momentum phase 126.L 
Montel 424.0 

Moor 273.K 425.AA 
n-classifying (of a topological group) 147.G 
n-connected 79.C 202.L 
n-connective fiber 148.D 
n-dimensional 117.B 

n-simple 202.L 

non-Euclidean 285.A 

normal 425.Q 

normal analytic 23.D 

normed linear 37.B 

NP- 7LE 

nuclear 424.S 

nul 251.D 

co-connected 79.C 

orbit (of a G-space) 431.A 
ordered linear 310.B 

Orlicz 168.B 

P- 425.Y 

p- 425.Y 

paracompact 425.8 

parameter (of a family of compact complex 
manifolds) 72.G 

parameter (for a family of probability measures) 
398.A 

parameter (of a probability distribution) 396.B 
partition ofa 425.L 

path 148.C 

path (of a Markov process) 261.B 
path-connected 79.B 
pathological 65.F 

Peirce 231.B 

perfectly normal 425.Q 
perfectly separable 425.P 

phase 126.B 163.C 402.C 
physical Hilbert 150.G 
pinching a set to a point 202.E 
polar 191I 

Polish 22.1273J 

precompact metric 273.B 
precompact uniform | 436.H 
pre-Hilbert 197.B 

principal (ofa flag) 139.B 
principal half- 139.B 
probability 342.B 

product 425.K 

product measure 270.H 
product metric 273.B 

product topological 425.K 
product uniform 436.E 
projective, over A 147.E 
projective limit 210.C 
projectively flat App. A, Table 4.II 
pseudocompact 425.8 
pseudometric 273.B 
pseudometrizable uniform 436.F 
Q- 425.BB 

quasi-Banach 37.0 


quasicompact 408.S 

quasidual (of a locally compact group) 437.1 

quasinormed linear 37.0 

quaternion hyperbolic 412.G 

quotient 425.L 

quotient (by a discrete transformation group) 
122.A 

quotient (of a linear space with respect to an 
equivalence relation) 256.F 

quotient (by a transformation group) 122.A 

quotient topological 425.L 

r-closed 425.U 

ramified covering 23.B 

real-compact 425.BB 

real Hilbert 197.B 

real hyperbolic 412.G 

real interpolation 224.C 

reallinear 256.A 

real projective 343.D 

reduced product 202.Q 

reflexive Banach 37.G 

regular 425.Q 

regular Banach 37.G 

representation (for a Banach algebra) 36.D 

representation (of a representation of a Lie 
algebra) 248.B 

representation (of a representation of a Lie 
group) 249.0 : 

representation (of a unitary representation) 
437.A 

Riemannian 364.A 

Riesz 310.B 

right coset (of a topological group) 423.E 

right projective 343.H 

right quotient (of a topological group) 423.E 

ringed 383H 


(S- 424.8 


sample 342.B 396.B 398.A 

scale of Banach 286.Z 

Schwartz 424.S 

separable 425.P 

separable metric 273.E 

separated 425.Q 

separated uniform 436.C 

sequential 425.CC 

sequentially compact 425.S 

sheaf 383.C 

shrinking, to a point 202.E 

Siegel, of degree n 32.F 

Siegel upper half-, of degreen 32.F 

Z- 425. Y 

o 425. Y 

o-compact 425.V 

c-finite measure 270.D 

simply connected 79.C 170 

of singularity 390.E 

smashing, to a point 202.E 

Sobolev 168.B 

Spanier cohomology theory, Alexander- 
Kolmogorov- 201.M 

spherical 285.D 

Spivak normal fiber 114.J 

standard Borel 270.C 

standard measurable 270.C 

standard vector (of an affine space) 7.A 

state (of a dynamical system) 126.B 

state (of a Markov process) 261.B 

state (in static model in catastrophe theory) 
51.B 

state (of a stochastic proccess) 407.B 


Subject Index 
Space complexity 


Stein 23.F 

stratifiable 425. Y 

strongly paracompact 425.S 
structure (of a Banach algebra) 36.D 
subbase for 425.F 

Suslin 22.1 425.CC 

symmetric Hermitian 412.E 
symmetric homogeneous 412.B 
symmetric Riemannian 412 
symmetric Riemannian homogeneous 412.B 
T, 425.Q 


T, 425.Q 
T,-uniform 436.C 
T,- 425Q 
T. 425 Q 
T, 425Q 
T, 425.Q 
T; 425.Q 


tangent 105.H 
tangent vector 105.H 
Teichmüller 416 
tensor, of degree k |. 256 
tensor, oftype(p,q) 256J 
test function 125.8 
Thom 114.G 
Tikhonov 425.Q 
time parameter 260.A 
topological — topological space 
topological complete 436.1 
topological linear 424.A 
topological vector 424.A 
total (of a fiber bundle) 147.B 
total (of a fiber space) 148.B 
totally bounded metric 273.B 
totally bounded uniform 436.H 
totally disconnected 79.D 
transformation (of an algebraic group) 13.G 
oftype S 125.T 
underlying topological (of a complex manifold) 
72.A 
underlying topological (of a topological group) 
423.A 
uniform — uniform space 
uniformizable topological 436.H 
uniformly locally compact 425.V 
uniform topological 436.C 
unisolvent 142.B 
universal covering 91.B 
universal Teichmüller 416 
vector, over K 256.A 
velocity phase 126.L 
weakly symmetric Riemannian 412.J 
well-chained metric 79.D 
wild 65.F 
space complexity 71.A 
space form 285.E 412.H 
Euclidean 412.H 
hyperbolic 412.H 
spherical 412.H 
space geometry 181 
space group 92.A 
crystallographic 92.A 
equivalent 92.A 
spacelike 258.A 359.B 
space reflection 359.B 
space-time, Minkowski 359.B 
space-time Brownian motion 45.F 
space-time inversion 258.A 
space-time manifold 359.D 
span 
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(ofa domain) 77.E 
(a linear subspace by a set) 256.F 
(of a Riemann surface) 367.G 
spanning tree 186.G 
sparse 302.C 
Späth type division theorem (for microdifferential 
operators) 274.F 
spatial («-isomorphism on von Neumann algebras) 
308.C 
spatially homogeneous (process) 261.A 
spatially isomorphic (automorphisms on a measure 
space) 136.E 
spatial tensor product 36.H 
Spearman rank correlation 371.K 
Spec (spectrum) 16.D 
special Clifford group oi D 
special divisor 9.C 
special flow 136.D 
special function(s) 389, App. A, Table 14.II 
of confluent type 389.A 
of ellipsoidal type 389.A 
of hypergeometric type 389.A 
special functional equations 388 
special isoperimetric problem 228.A 
speciality index 
(of a divisor of an algebraic curve) OC 
(of a divisor on an algebraic surface) 15.D 
v- (of a divisor of an algebraic curve) 9.F 
specialization 16.A 
(in étale topology) 16.AA 
special Jordan algebra 231.A 
special linear group 60.B 
(over a noncommutative field) 60.0 
of degree n over K 60.B 
projective 60.B 
projective (over a noncommutative field) 60.0 
special orthogonal group 60.1 
complex 60.I 
over K with respect toQ 60.K 
special principle of relativity 359.B 
special relativity 359.B 
special representation (of a Jordan algebra) 231.C 
special surface 110.A 
special theory of perturbations 420.E 
special theory of relativity 359.A 
special unitary group 60.F 
(relative to an e- Herman form) 60.0 
over K 60.H 
projective, over K 60.H 
special universal enveloping algebra (of a Jordan 
algebra) 231.C 
special valuation 439.B 
species 
ellipsoidal harmonics of the first, second, third 
or fourth 133.C 
Lamé functions of the first, second, third or 
fourth 133.C 
singular projective transformation of the hth 
343.D 
singular quadric hypersurface of the hth (in a 
projective space) 343.E 
specification 401.A 
problem of 397.P 
specific heat 
at constant pressure 419.B 
at constant volume 419.B 
specificity 346.F 
specific resistance 130.B 
spectral analysis 390.A 
spectral concentration 331.F 
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spectral decomposition 126.J 395.B 
spectral density, quadrature 397.N 
spectral functor 200J 
spectral geometry 391.A 
spectral integral 390.D 
spectral invariant 136.E 
spectrally isomorphic (automorphisms on a measure 
space) 136.E 
spectral mapping theorem 251.G 
spectral measure 390.B,K 395.B,C 
complex 390.D 
maximum 390.G 
real 390.D 
spectral method 304.B 
spectral operator 390.K 
spectral property 136.E 
spectral radius 126.K 251.F 390.A 
spectral representation 390.E 
complex 390.E 
spectral resolution 390.E 
complex 390.E 
spectral sequence 200.J 
(ofa fiber space) 148.E 
cohomology 200.J 
Hodge 16.U 
spectral synthesis 36.L 
spectral theorem 390.E 
spectrum 390.A 
(of a commutative ring) 16.D 
(of a domain in a Riemannian manifold) 391.A 
(of an element of a Banach algebra) 36.C 
(in homotopy theory) 202.T 
(ofa hyperfunction) 274.E 
(of an integral equation) 217 
(of a linear operator) 251.F 390.A 
(of a spectral measure) 390.C 
absolutely continuous 390.E 
continuous (of a linear operator) 390.A 
continuous (of an integral equation) 217.J 
countable Lebesgue 136.E 
discrete 136.E 390.E 
Eilenberg-MacLane 202.T 
essential 390.E,I 
formal (of a Noetherian ring) 16.X 
intermittent 433.C 
joint 36.M 
Kolmogorov 433.C 
for p-forms 391.B 
point 390.A 
pure point 136.E 
quasidiscrete 136.E 
residual 390.A 
simple 390.G 
singular 125.CC 345.A 390.A 
singular (of a hyperfunction) 274.E 
singularity (of a hyperfunction) 125.CC 274.E 
sphere 202.T 
stable homotopy group of the Thom  114.G 
Thom 114.G 202. T 
spectrum condition 150.D 
speed measure 115.B 
Speiser theorem, Hilbert- 172.J 
Spencer mapping (map), Kodaira- 72.G 
sphere(s) 139.1 150 
circumscribing (of a simplex) 139.I 
combinatorial, group of oriented differentiable 
structures onthe 114.1] 
complex 74.D 
g- (ofa point) 273.C 
exotic 114.B 


Subject Index 
Spinor(s) 


homotopy n- 65.C 
homotopy n-, h-cobordism group of 114.1 
horned, Alexander's 65.G 
openn- 140 

open 140 

PL(k—1) 65C 
pseudo- 111.I 

Riemann 74.D 

solid 140 

solid n- 140 

topological 140 
topological solid 140 


unit 140 
w- 74D 
z- 74.D 


sphere bundle n- 147.K 
cotangential 274.E 
normal 274.E 
tangential 274.E 
unit tangent 126.L 
sphere geometry 76.C 
sphere pair 235.G 65.D 
sphere spectrum 202.T 
sphere theorem 
(characterization of a sphere) 178.C 
(embedding in a 3-manifold) 65.E 
spherical 
(real hypersurface) 344.C 
(space form) 412.H 
spherical astronomy 392 
spherical Bessel function 39.B 
spherical coordinates 90.C, App. A, Table 3.V 
spherical derivative (for an analytic or meromorphic 
function) 435.E 
spherical excess 432.B 
spherical Fourier transform 437.Z 
spherical function(s) 393 
(on a homogeneous space) 437.X 
Laplace 393.A 
zonal (on a homogeneous space) 437.Y 
spherical G-fiber homotopy type 431.F 
spherical geometry 285.D 
spherical harmonic function 193.C 
spherical harmonics, biaxial 393.D 
spherical indicatrix (of a space curve)  111.F 
spherical modification 114.F 
spherical representation 
of a differentiable manifold 111.G 
ofaspacecurve 111.F 
of a unimodular locally compact group 437.Z 
spherical space 285.D 
spherical triangle 432.B, App. A, Table 2.III 
spherical trigonometry 432.B. 
spherical type 13.R 
spherical wave 446 
spheroidal coordinates 133.D, App. A, Table 3.V 
spheroidal wave function 133.E 
spin 132.A 258.A 415.G 
continuous 258.A 
spin and statistics, connection of 132.A 150.D 
spin ball 351.L 
spin bundle 237.F 
Spin bundle 237.F 
spin-flip model 340.C 
spin mapping (map) 237.G 
spin matrix, Pauli 258.A 415.G 
spinor(s) 61.E 
contravariant 258.A 
covariant 258.A 
dotted 258.B 


Subject Index 
Spinor group 


even half- 61.E 
mixed, of rank (k,n) 258.A 
odd half- 61.E 
undotted 258.B 
spinor group 60.1 61.D 
complex 61.E 
spinorial norm 61.D 
spinor representation (of rank k) 258.A 
spin representation 
(of SO(n) 60.3 
(of Spin(n,C)) 61.E 
even halt 61.E 
halt. 61.E 
odd half- 61.E 
spin-structure 237.F 431.D 
spin systems, lattice 402.G 
spiral 93.H 
Archimedes 93.H 
Bernoulli 93.H 
Cornu 93.H 
equiangular 93.H 
hyperbolic 93.H 
logarithmic 93.H 
reciprocal 93.H 
Spivak normal fiber space 114.J 
spline 223.F 
natural 223.F 
spline interpolation 223.F 
split 
((B, N)-pair) 151J 
(cocycle in an extension) 257.E 
(exact sequence) 277.K 
k- (algebraic group) 13.N 
K-(algebraic torus) 13.D 
k-quasi- (algebraic group) 13.0 
maximal k-, torus 13.Q 
split extension (of a group) 190.N 
splitting, Heegaard 65.C 
splitting field 
for an algebra 362.F 
for an algebraic torus 13.D 
minimal (of a polynomial) 149.G 
ofa polynomial 149.G 
splitting ring 29.K 
split torus, maximal k- 13.Q 
spot prime 439.H 
Spur 269.F 
square(s) 
Euler 241.B 
latin 241 
Latin 102.K 
least, approximation 336.D 


matrix of the sum of, between classes 280.B 


matrix of the sum of, within classes 280.B 
method of least 303.1 
middle-, method 354.B 
Room 241.D 
Shrikhande 102.K 
Youden 102.K 
Youden, design 102.K 
square-free integer 347.H 
square integrable 168.B 
square integrable unitary representation 437.M 
square matrix 269.A 
square net 304.E 
square numbers 4.D 
s.s. complex(es) (semisimplicial complex) 70.E 
geometric realization of 70.E 
isomorphic 70.E 
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s.s. mapping (semisimplicial mapping) 70.E 
realization of 70.E 
stability 286.S 303.E 394 


A- 303.G 
Ao- 303.G 
A(x- 303.G 


absolute 303.G 
conjecture 126J 
exchange of 286.T 
interval of absolute 303.G 
interval of relative 303.G 
orbital (of a solution of a differential equation) 
394.D 
principle of linearized 286.8 
region of absolute (of the Runge-K utta (P, p) 
method) 303.G 
region of relative 303.G 
relative 303.G 
stiff- 303.G 
structural 290.A 
structural, theorem 126.J 
stability group 362.B 
stability subgroup (of a topological group) 431.A 
stability theorem 
Q- 126J 
structural 126J 
stabilizer 
(in an operation of a group) 362.B 
(in a permutation group) 151.H 
(in a topological transformation group) 431.A 
reductive 199.A 
stable 394.A 
(coherent sheaf on a projective variety) 241.Y 
(compact leaf) 154.D 
(discretization, initial value problems) 304.D 
(equilibrium solution) 286.S 
(initial value problem) 304.F 
(invariant set) 126.F 
(linear function) 163.H 
(manifold) 126.G 
(minimal submanifold) 275.B 
(static model in catastrophe theory)  51.E 
absolutely 303.G 
asymptotically 126.F 286.8 394.B 
in both directions (Lyapunov stable) 394.A 
C-Q- 126H 
C’-structurally 126.H 
conditionally 394.D 
exponentially 163.G 394.B 
externally, set 186.I 
globally asymptotically 126.F 
internally, set 186.1 
Lagrange 126.E 
Lyapunov 126.F 
Lyapunov, in the positive or negative direction 
394.A 
negatively Lagrange 126.E 
negatively Poisson 126.E 
one-side, for exponent 3 App. A, Table 22 
orbitally 126.F 
Poisson 126.E 
positively Lagrange 126.E 
positively Poisson 126.E 
relatively 303.G 
uniformly 394.B 
uniformly asymptotically 163.G 394.B 
uniformly Lyapunov 126.F 
stable cohomology operation 64.B 
stable curve 9.K 
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stable distribution 341.G 
quasi- 341.G 
semi- 341.G 
stable homotopy group 202.T, App. A, Table 6.VII 
(of Thom spectrum) 114.G 
of classical groups 202.V 
of k-stem 202.U 
stable manifold 126.G,J 
stable point 16.W 
stable primary cohomology operation 64.C 
stable process 5.F 
exponent of 5.F 
one-sided, of the exponent a 5.F 
strictly 5.F 
symmetric 5.F 
stable range (of embeddings) 114.D 
stable reduction 
(of an Abelian variety) 3.N 
(ofacurve) 9.K 
potential (of an Abelian variety) 3.N 
stable reduction theorem 3.N OK 
stable secondary cohomology operation 64.C 
stable sei 173.D 
externally 186.1 
internally 186.1 
stable solution (of the Hill equation) 268.E 
stable state 260.F 394.A 404.A 
stable vector bundle 
(algebraic) 16.Y 
(topological) 237.B 
stably almost complex manifold 114.H 
stably equivalent (vector bundles) 237.B 
stably fiber homotopy equivalent 237.1 
stably parallelizable (manifold) 114.1 
stack 96.E 
stage method, (P + 1)- 303.D 
stalk (of a sheaf over a point) 16.AA 383.B 
standard 
(in nonstandard analysis) 293.B 
(transition probability) 260.F 
standard Borel space 270.C 
standard complex (of a Lie algebra) 200.0 
standard defining function 125.Z 
standard deviation 
(characteristics of the distribution) 397.C 
(of a probability distribution) 341.B 
(of a random variable) 342.C 
population 396.C 
sample 396.C 
standard form 241.A 
(of a difference equation) 104.C 
of the equation (of a conic section) 78.C 
Legendre-Jacobi (of an elliptic integral) 134.A, 
App. A, Table 16.I 
standard Gaussian distribution 176.A 
standard Káhler metric (of a complex projective 
space) 232.D 
standard measurable space 270.D 
standard normal distribution 341.D 
standard parabolic k-subgroup | 13.Q 
standard part (in nonstandard analysis) 293.D 
standard q-simplex 201.E 
standard random walk 260.A 
standard resolution (of Z) 200.M 
standard sequence 400.K 
standard set 22.I 
standard vector space (of an affine space) 7.4 
star — also * 
(in a complex) 13.R 
(in a Euclidean complex) 70.B 





Subject Index 
Stationary random distribution 


(in a projective space) 343.B 
(in a simplicial complex) 70.C 
(of a subset defined by a covering) 425.R 
open 70.B,C 
star body, bounded 182.C 
star convergence 87.K 
(ot 87.L 
relative uniform 310.F 
star-finite (covering of a set) 425.R 
star-finite property 425.S 
star refinement (of a covering) 425.R 
star region 339.D 
starting values (in a multistep method)  303.E 
start node 281.D 
star topology, weak (of a normed linear space) 
37.E 424.H 
state(s) 
(ofa C*-algebra) 308.D 
(in Ising model) 340.B 
(in quantum mechanics) 351.B 
bound 351.D 
ceiling 402.G 
completeness of the scattering 150.D 
equation of 419.A 
equilibrium 136.H 340.B 419.A 
even 415.H 
fictitious 260.F 
final 31.B 
Gibbs 340.B 
ground 402.G 
in- 150.D 386.A 
initial 31.B 
instantaneous 260.F 261.B 
internal 31.B 
odd 415.H 
out- 150.D 386.A 
scattering 395.B 
stable 260.F 394.A 404.A 
stationary 340.C 351.D 
of statistical control 404.A 
sum over 402.D 
unstable 394.A 
state estimator 86.E 
state space  126.B 
(in catastrophe theory) 51.B 
(ofa dynamical system) 126.B 
(of a Markov process) 261.B 
(ofa stochastic process) 407.B 
state-space approach 86.A 
state variable 127.A 
static model (in catastrophe theory) 51.B 
stable 51.B 
stationary capacity 213.F 
stationary curve 
(of the Euler-Lagrange differential equation) 
324.E 
(of a variation problem) 46.B 
stationary function 46.B 
stationary iterative process, linear 302.C 
stationary phase method 30.B 
stationary point (of an arc of class C^)  111.D 
stationary Poisson point process 407.D 
stationary process(es) 342.A 395 
shift associated with 136.D 
strictly 395.A 
strongly 395.A 
weakly 395.A 
weakly, of degree k 3951] 
in the wider sense 395.A 
stationary random distribution 


Subject Index 
Stationary source 


strictly 395.H 
strongly 395.F,H 
weakly 395.C 
stationary source 213.C 
stationary state 340.C 351.D 
stationary value (of a function) 106.L 
stationary variational inequality 440.B 
stationary wave 446 
statistic 396 
ancillary 396.H 401.C 
Hotelling’s T? 280.B 
invariant 396.H 
Kolmogorov-Smirnov test 374.E 
maximal invariant 396.1 
minimal sufficient 396.1 
n-dimensional 396.B 
necessary and sufficient 396.E 
l-dimensional 396.B 
order 396.C 
selection 396.F 
t- 374.B 
U- 3741 
statistical control, state of  404.A 
statistical data analysis 397.A 
statistical decision function 398 
statistical decision problem 398.A 
statistical decision procedure 398.A 
statistical estimation 399, App. A, Table 23 
statistical experiment 398.G 
statistical genetics 40.B 
statistical hypothesis 400.A 
statistical hypothesis testing 400, App. A, Table 23 
statistical inference 401 
statistical model 403 
statistical mechanics 342.A 402 
classical 402.A 
equilibrium | 402.A 
of irreversible processes 402.A 
Markov 340.C 
quantum 402.A 
statistical planning 102.A 
statistical quality control 404 
statistical structure 396.E 
dominated 396.F 
statistical thermodynamics 402.A 
statistics 397.C 
Bose 377.B 402.E 
Fermi 377.B 402.E 
statistics and spin, connection of 150.D 
Staudt algebra 343.C 
Steenrod algebra 64.B 
Steenrod axioms, Eilenberg- 201.Q 
Steenrod isomorphism theorem, Hurewicz- 148.D 
Steenrod operator App. A, Table 6.II 
Steenrod pth power operation 64.B 
Steenrod square operation 64.B 
steepest descent, curve of 46.A 
steepness, wave 205.F 
Stein, analytic space in the sense of Behnke and 
23.E 
Steinberg formula (on representation of compact Lie 
groups) 248.Z 
Steinberg group 237.J 
Steinberg symbol 237.J 
Steinberg type 13.0 
Stein continuation theorem, Remmert- 23.B 
Stein decomposition, Fefferman- 168.B 
Steiner problem 179.A 
Steiner symmetrization 228.B 
Steinhaus theorem, Banach- 
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(in a Banach space) 37.H 
(in a topological linear space) 424.J 
Stein lemma, Hunt- 400.F 
Stein manifold 21.L 
fundamental theorems of 21.L 72.E 
Stein space 23.F 
Stein theorem, Behnke- 21.H 
Stein theorem, Lehmann- 400.B 
step, fractional 304.F 
step-by-step method 163.D 
step-down operator 206.B 
step size (in numerical solution) 303.B 
step-up operator 206.B 
stereographic projection 74.D 
Stiefel manifold 199.B 
complex 199.B 
infinite 147.I 
of k-frames 199.B 
of orthogonal k-frames 199.B 
real, of k-frames 199.B 
real, of orthogonal k-frames 199.B 
Stiefel- Whitney class 
(of a differentiable manifold) 56.F 147.M 
(of an O(n)-bundle) 147.M 
(of an R"-bundie) 56.B 
(of a topological manifold) 56.F 
total 56.B 
universal 56.B 
Stiefel-Whitney number 56.F 
Stieltjes integral 94.E 
Lebesgue- 94.C 166.C 
Riemann- 94.B 166.C 
Stieltjes measure, Lebesgue- 166.C 270.L 
Stieltjes moment problem 240.K 
Stieltjes theorem  133.C 
Stieltjes transform 220.D 
Fourier- 192.B,0 
Laplace- 240.A 
Stiemke theorem 255.B 
stiff 303.G 
inan interval 303.G 
stiffness matrix 304.C 
stiffness ratio 303.G 
stiff-stability 303.G 
stimulus-sampling model, Estes 342.H 346.G 
Stirling formula 174.A 212.C, App. A, Table 17.1 
Stirling interpolation formula App. A, Table 21 
Stirling number of the second kind 66.D 
stochastically larger (random variable) 371.C 
stochastic calculus 406.A 
stochastic control 342.A 405 
stochastic differential 406.C 
stochastic differential equation 342.A 
linear (LSDE) 405.G 
of Markovian type 406.D 
stochastic differential of Stratonovich type  406.C 
stochastic filtering 324.A 405.F 
stochastic inference, graphical method of 19.B 
stochastic integral 261.E 406.B 
of Itô type 406.C 
of Stratonovich type 406.C 
stochastic Ising model 340.C 
stochastic matrix 260.A 
stochastic maximum principle 405.D 
stochastic model 264 
stochastic moving frame 406.G 
stochastic paper 19.B 
stochastic process(es) 342.A 407 
generalized 407.C 
with stationary increments of ordern 395.1 
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stochastic programming 264.C 307.C 408 
two-stage 408.A 
Stoilow compactification, Kerékjártó- 207.C 
Stoilow type (compactification) 207.B 
Stokes approximation 205.C 
Stokes assumption 205.C 
Stokes differential equation 167.E 188.E 
Stokes equation, general Navier- 204.F 
Stokes equation, Navier- 204.B 205.C 
Stokes formula 94.F 105.0, App. A, Table 31 
Green- 94.F 
Stokes initial value problem, Navier- 204.B 
Stokes multiplier 254.C 
Stokes phenomenon 254.C 
Stokes theorem App. A, Table 3.III 
Stokes wave 205.F 
Stolz, differentiable in the sense of 106.G 
Stolz path (in a plane domain) 333.B 
Stone-Cech compactification 207.C 425.T 
Stone-Gel'fand theorem  168.B 
Stone integrable function, Daniell- 310.1 
Stone integral, Daniell- 310.1 
Stone theorem 378.C 425.X 437.R 
Stone theorem, Weierstrass- 168.B 
Stone-Titchmarsh-Kodaira theory, Weyl- 112.0 
stopping, optimal 405.E 
stopping rule 398.F 
stopping time 261.B 407.B 
storage, push-down 96.E 
stored program principle 75.B 
Stormer inequality, Powers- 212.B 
straight angle 139.D 
straightening of the angle 114.F 
straight, G-space is 178.H 
straight line(s) 93.A 155.B 
straight line solution 420.B 
strain 271.G 
strain, shearing 271.G 
strain tensor 271.G 
strange attractor 126.N 
Strassen invariance principle 250.E 
strategic variable 264 
strategy (strategies) 33.F 108.B,C 173.C 
behavior 173.B 
local 173.B 
mixed 173.C 
pair 108.B 
pure 173 
winning 33.F 
stratifiable space | 425.Y 
stratification, Whitney 418.G 
stratified sampling 373.E 
Stratonovich type 
stochastic differential of 406.C 
stochastic integral of  406.C 
stratum (strata) 373.E 418.G 
-constant 418.E 
stream function 205.B 
streamlined (body) 205.C 
stream lines 205.B 
strength 102.L 
stress 271.G 
normal 271.G 
Shearing 271.G 
tangential 271.G 
stress tensor 150.B 271.G 
Maxwell 130.A 
strict Albanese variety 16.P 
strict implication 411.L 
strict localization 16.AA 


Subject Index 
Strongly inaccessible 


strictly concave function 88.A 
strictly convex function 88.A 
strictly decreasing function 166.A 
strictly ergodic (homeomorphism on a compact 
metric space) 136.H 
strictly G-stationary (system of random variables) 
395.1 
strictly increasing function 166.A 
strictly inductive limit (of a sequence of locally 
convex spaces) 424.W 
strictly monotone function 166.A 
(of ordinal numbers) 312.C 
strictly of Polya type (a family of probability 
densities) 374J 
strictly positive (element in E") 310.H 
strictly pseudoconvex 344.A 
strictly stable process 5.F 
strictly stationary process 395.A 
strictly stationary random distribution 395.H 
strict morphism (between topological groups) 423.J 
string 
o 248.L 
equation of a vibrating 325.A 
string model 132.C 
strip 
bicharacteristic 320.B 
characteristic 320.D 324.B 
Möbius 410.B 
strip condition 320.D 
strong (boundary component) 77.E 
strong convergence (of operators) 251.C 
strong convergence theorem (on distributions) 
125.G 
strong deformation retract 202.D 
strong dilation 251.M 
strong dual (space) 424.K 
stronger 
(equivalence relation) 135.C 
(method of summation) 379.L 
(topology) 425.H 
(uniformity) 436.E 
stronger form of Cauchy's integral theorem 198.B 
strong extension 
(of a differential operator) 112.E 
(of a differential operator with boundary condi- 
tion) 112.F 
strong infinity, axiom of 33.E 
strong integrability 443.1 
strong lacuna 325.J 
strong law of large numbers | 250.C 
strong Lefschetz theorem 16.U 
strongly, converge (in a Banach space) 37.B 
strongly acute type 304.C 
strongly closed subgroup  151.J 
strongly compact cardinal number 33.E 
strongly connected (graph) 186.F 
strongly connected components  186.F 
strongly continuous 
(Banach space-valued function) 37.K 
(in unitary representations) 437.A 
strongly continuous representation (of a topological 
space) 69.B 
strongly continuous semigroup  378.B 
strongly distinguished basis 418.F 
strongly elliptic (differential operator) 112.G 
strongly elliptic operator 323.H 
strongly embedded subgroup  151.J 
strongly exposed (of a convex set) 443.H 
strongly hyperbolic differential operator 325.H 
strongly inaccessible 33.F 312.E 


Subject Index 
Strongly inaccessible cardinal number 


strongly inaccessible cardinal number 33.E 
strongly measurable | 443.B,I 
strongly mixing automorphism  136.E 
strongly nonlinear differential equation 290.D 
strongly normal extension field 113 
strongly P-convexset 112.C 
strongly paracompact space 425.S 
strongly pseudoconvex domain 21.G 
strongly recurrent (measurable transformation) 
136.C 
strongly separated (convex sets) 89.A 
strongly stationary process 395.A 
strongly stationary random distribution 395.H 
strong Markov process 261.B 
strong Markov property 261.B 
strong maximum principle 323.C 
strong measurability 443.1 
strong operator topology 251.C 
strong rigidity theorem  122.G 
strong solution 
(of Navier-Stokes equation) 204.C 
(of a stochastic differential equation) 406.D 
unique 406.D 
strong topology 
(on a direct product space) 425.K 
(on a family of measures) 338.E 
(on a normed space) 37.E 
(on a topological linear space) 424.K 
strong transversality condition 126.J 
structural constants (of a Lie algebra) 248.C 
structural equation system, linear 128.C 
structurally stable, C"- (en 
structural stability | 290.A 
structural stability theorem  126J 
structure(s) 409 
(ofalanguage) 276.B 
almost complex 72.B 
almost contact 110.E 
almost contact metric 110.E 
almost symplectic 191.B 
analytic (in function algebras) 164.F 
analytic (on a Riemann surface) 367.A 
arithmetically equivalent 276.D 
Cauchy Riemann 344A 
CR 3444 
C’- (of a differentiable manifold) 105.D 114.A 
C'-, Haefliger 154.F 
classifying space for I; 154.E 
coalition 173.D 
compatible with C'- 114.B 
complex 105.Y 
complex (in a complex manifold) 72.A 
complex (on R^) 3.H 
complex (on a Riemann surface) 367.A 


complex analytic (in a complex manifold) 72.A 


conformal 191.B 

conformal (on a Riemann surface) 367.A 
contact 105.Y 

contact metric 110.E 

data 96.B 

deformation of complex 72.G 
differentiable 114.B 

differentiable, of class C" 105.D 
elementarily equivalent 276.D 
equations (of a Euclidean space) 111.B 
equations of (for relative components) 110.A 
foliated 105.Y 

G- 191 

T- (on a differentiable manifold) 105.Y 
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T- (on a topological space) 90.D 
I7- 154E 
DIL 154H 
group of oriented differentiable (on a com- 
binatorial sphere) 114I 
Hodge (of a vector space) 16.V 
isomorphic 276.E 
jumping of 72.G 
lacunary (of a power series) 339.E 
level n (on an Abelian variety) 3.N 
linear 96.C 
mathematical 409.B 
mixed 16.V 
Neyman 400.D 
normal 276.D 
normal analytic 386.C 
PL 65.C 
pseudogroup 105.Y 
real analytic 105.D 
smooth 114.B 
spin- 237.F 431.D 
statistical 396.E 
symplectic 219.C 
tensor field of almost complex (induced by a 
complex structure) 72.B 
topological 425.A,B 
tree 96.D 
twinning 92.D 
uniform 436.B 
structure equation 
(of an affine connection) 417.B 
(for a curvature form) 80.G 
(for a torsion form) 80.H 
linear, system | 128.C 
structure function 191.C 
structure group (of a fiber bundle) 147.B 
structure morphism 52.G 
structure sheaf 
(of a prealgebraic variety) 16.C 
(of a ringed space) 383.H 
(of a variety) 16.B 
structure space (of a Banach algebra) 36.D 
structure theorem 
(on topological Abelian groups) 422.E 
of complete local rings 284.D 
for von Neumann algebras of type III 308.1 
Sturm-Liouville operator 112.1 
Sturm-Liouville problem 315.B 
Sturm method 301.C 
Sturm theorem (on real roots of an algebraic equa- 
tion) 10.E 
Struve function App. A, Table 19.1V 
Student test 400.G 
subadditive cuts 215.C 
subadditive ergodic theorem  136.B 
subadditive functional 88.B 
subadditive process 136.B 
subalgebra 29.A 
Borel (of a semisimple Lie algebra) 248.0 
Cartan (of a Lie algebra) 248.I 
Cartan (symmetric Riemann space) 413.F 
closed (of a Banach algebra) 36.B 
Lie 248.A 
of a Lie algebra associated with a Lie subgroup 
249.D 
parabolic (of a semisimple Lie algebra) 248.0 
x- 308.C 
subbase 
for a space 425.F 
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Subset(s) 
foratopology 425.F sequences of 190.F 
subbialgebra 203.G stability 431.A 
subbundle standard parabolic k- 13.Q 
(of an algebraic vector bundle) 16.Y strongly closed 151.J 
(of a vector bundle) 147.F strongly embedded  151.J 
subcategory 52.A subnormal 190.G 
full 52.A Sylow 151.B 
subcoalgebra 203.F toroidal 248.X 
subcomplex torsion 2.A,C 
(of a cell complex) 70.D subharmonic functions 193 
(of a chain complex) 200.H 201.B almost 193.T 
(of a cochain complex) 201.H subinvariant measure 261.F 
(ofa complex) 13.R subjective probability 401.B 
(ofa Euclidean complex) 70.B sublattice 243.C 
(of a simplicial complex) 70.C submanifold 
(of an s.s. complex) 70.E (of a Banach manifold) 286.N 
chain 200.C (of a combinatorial manifold) 65.D 
cochain 200.F (ofa C?-manifold) 105.L 
subcontraction 186.E closed 105.L 
subcritical (Galton- Watson process) 44.B complex analytic 72.A 
subdifferential 88.D immersed (of a Euclidean space) 111.A 
subdivision isotropic 365.D 
(of a Euclidean complex) 70.B Káhler 365.L 
(of a simplicial complex) 70.C minimal 275 365.D 
(of a triangulation) 70.C regular 105.L 
barycentric (of a Euclidean complex) 70.B Riemannian 365.A 
barycentric (of a simplicial complex) 70.C totally geodesic 365.D 
dual (of a triangulation of a homology mani- totally real 365.M 
fold) 65.B totally umbilical 365.D 
subelliptic 112.D submartingale 262.A 
subfamily 165.D submedian 193.T 
subfield 149.B submersion 105.L 
valuation overa 439.B,C submodular 66.F 
subgraph 186.E submodule 
subgroup A- 277.C 
(ofa group) 190.C allowed |. 277.C 
(ofa topological group) 423.D complementary 277.H 
admissible 190.E homogeneous A- (of a graded A-module) 
admissible normal 190.E 200.B 
algebraic 13.A primary 284.A 
arithmetic 13.P 122.F,G subnet 87.H 
Borel 13.G 249.J cofinal 87.H 
Cartan 13.H 249.1 subnormal (operator in a Hilbert space) 251.K 
Carter 151.D subnormal subgroup 190.G 
closed 423.D subobject 52.D 
commutator 190.H subordinate 105.D 437.T 
congruence 122.D subordination 5 261.F 
connected Lie 249.D of the ath order 261.F 
cyclic 190.C subordinator of the exponent a 5.F 
divisible 422.G subproblems 215.D 
Hall 151.E subrepresentation 
invariant 190.C (of a linear representation) 362.C 
irreducible discrete 122.F (of a projective representation) 362.J 
isotropy 431.A (of a unitary representation) 437.C 
Iwahori 13.R subring 368.E 
k-Borel 13.G differential 113 
Levi- 13.Q subroutine 75.C 
Lie 249.D subscripts, raising 417.D 
maximal torsion 2.A subsequence 165.D 
minimal parabolic k- 13.Q g- 354D 
normal 190.G subset(s) 381.A 
Q- 190.E (in axiomatic set theory) 33.B 
one-parameter 249.Q analytic (of a complex manifold) 72.E 
open 423.D axiom of 33.B 381.G 
parabolic 13.G 249.J Borel 270.C 
parabolic 13.R circled (of a linear topological space) 424.E 
principal congruence, of level N 122.D cofinal 311.D 
p-Sylow 151.B G- 362.B 
rational 404.B k- 330 


Schur 362.F proper 381.A 


Subject Index 
Subshift 


residual 311.D 
subshift 126J 
of finite type | 126J 
Markov 126J 
subsidiary equation, Charpit 82.C 320.D 
subsonic (Mach number) 205.B 
subsonic flow 326.A 
subspace 
(of an affine space) 7.A 
(of a linear space) 256.F 
(of a projective space) 343.B 
(of a topological space) 425.J 
analytic 23.C,G 
closed linear (of a Hilbert space) 197.E 
complementary (of a linear subspace) 256.F 
horizontal 191.C 
ingoing 375.H 
invariant (of a linear operator) 251.L 
involutive 428.F 
linear (of a linear space) 256.F 
metric 273.B 
n-particle 377.A 
orthogonal (determined by a linear subspace) 
256.G 
orthogonal (ofa linear space) 139.G 
outgoing 375.H 
precompact (metric) 273.B 
parallel (in an affine space) 7.B 
parallel, in the narrower sense (in an affine 


Space) 7.B 
parallel, in the wider sense (in an affine space) 
7.B 


principal (of a linear operator) 390.B 
root (of a linear operator) 390.B 
root (of a semisimple Lie algebra) 248.K 
singular (of a singular projective transforma- 
tion) 343.D 
totally bounded (metric) 273.B 
totally isotropic (relative to an e-Hermitian 
form) 60.0 
totally isotropic (with respect to a quadratic 
form) 348.E 
totally singular (with respect to a quadratic 
form) 348.E 
U-invariant (of a representation space of a 
unitary representation 437.C 
uniform 436.E 
substituted distribution 125.Q 
substitution 
(of a hyperfunction) 125.X 274.E 
axiom of 381.G 
back 302.B 
Frobenius (of a prime ideal) 14.K 
subsystem 
(of an algebraic system) 409.C 
closed (of a root system) 13.L 
subtraction 361.B 
subtraction terms 361.B 
subvariety, Abelian 3.B 
successive approximation 
method of (for an elliptic partial differential 
equation) 323.D 
method of (for Fredholm integral equations of 
the second kind) 217.D 
method of (for ordinary differential equations) 
316.D 
successive minima (in a lattice) 182.C 
successive minimum points 182.C 
successive overrelaxation (SOR)  302.C 
successor 
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(of an element in an ordered set) 311.B 
(of a natural number) 294.B 
sufficiency 
prediction | 396 
principle of 401.C 
sufficient (a-field, statistic) 
Bayes 396.J 
D- 396J 
decision theoretically 396.J 
minimal 396.E 
pairwise 396.F 
test 396J 
sufficiently many irreducible representations 
437.B 
sum 
(of convergent double series) 379.E 
(of a divergent series by a summation) 379.L 
(of elements of a group) 190.A 
(of elements of a linear space) 256.A 
(a function) 104.B 
(ofideals) 67.B 
(oflinear operators) 251.B 
(of linear subspaces) 256.F 
(of matrices) 269.B 
(of ordinal numbers) 312.C 
(of potencies) 49.C 
(of real numbers) 355.A 
(of a quadrangular set of six points) 343.C 
(ofa series) 379.A 
(of submodules) 277.B 
(=union of sets) 33.B 381.B 
(of vectors) 442.A 
amalgamated 52.G 
Baer (of extensions) 200.K 
cardinal (of a family of ordered sets) 311.F 
Cauchy (of a series) 379.A 
connected (of oriented compact C?-manifolds) 
114.F 
connected (of 3-manifolds) 65.E 
constant- (game) 173.A 
Darboux 216.A 
Dedekind 328.A 
diagonal (ofa matrix) 269.F 
diagonal partial (of a double series) 379.E 
direct — direct sum 
disjoint 381.B 
fiber 52.G 
Gaussian 295.D 450.C 
general- (game) 173.A 
indefinite (of a function) 104.B 
Kloosterman 32.C 
local Gaussian 450.F 
logical (of propositions) 411.B 
ordinal (of a family of ordered sets) 311.G 
orthogonality for a finite 19.G 317.D 
partial(ofa series) 379.A 
of products 216.A 
Ramanujan 295.D. 
Riemann 216.A 
scalar (of linear operators) 37.C 
over states 402.D 
topological 425.M 
trigonometric 4.C 
Whitney (of vector bundles) 147.F 
zero (game) 173.A 
zero-, two-person game 108.B 
sum event 342.B 
summable 
A- 379.N 
by Abel’s method 379.N 
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absolute Borel 379.0 

B- 379.0 

[8B|- 379.0 

by Borel’s exponential method 379.0 

by Borel’s integral method 379.0 

by Cesàro's method of order a 379.M 

by Euler's method 379.P 

by Hólder's method of order p 379.M 

(H,p- 379.M 

by Nórlund's method 379.Q 

(R,k) 379.8 

by Riesz's method of order k 379.R 

T- 379.L 
summable pth power, operator of 68.K 
summand, direct (of a direct sum of sets) 381.E 
summation 

Abel’s method of 379.N 

Abel’s partial 379.D 

Borel’s method of 379.N 

(C,a)- 379.M 

Cesaro’s method of, of order a 379.M 

Euler’s method of 379.P 

ofafunction 104.B 

Lebesgue’s method of 379.S 

methods of 379.L 

Nórlund's method of 379.Q 

Riemann’s method of 379.8 

Riesz’s method of, of the kth order 379.R 
summation convention, Einsteins 417.B 
summation formula 

Euler 295.E 

Poisson (on Fourier transforms) 192.C 

Poisson (on a locally compact Abelian group) 

192.L 
summing, absolutely (operator) 68.N 
sum theorem for dimension 117.C 
Sundman theorem | 420.C 
sup (supremum) 311.B 
superabundance (of a divisor on an algebraic 
surface) 15.D 

superadditive 173.D 
superconductivity 130.B 
supercritical (Galton-Watson process) 44.B 
superefficient estimator 399.N 
superharmonic (function) 193.P 260.D 
superharmonic measure 260.1 
superharmonic transformation 261.F 
superior function, right 316.E 
superior limit 

(of a sequence of real numbers) 87.C 

(of a sequence of subsets of a set) 270.C 
superior limit event 342.B 
supermartingale 262.A 
supermultiplet theory 351.J 
superposition, principle of 252.B 322.C 
superregular function 260.D 
superrenormalizable 150.C 
superscripts, lowering 417.D 
superselection rule, univalence 351.K 
superselection sector 150.E 351.K 
supersolvable group  151.D 
supersonic 205.B 326.A 
supplementary angles 139.D 
supplementary interval 4.B 
supplementary series 258.C 
supplementation-equal polygons 155.F 
supplemented algebra 200.M 
support 

(of a coherent sheaf) 16.E 

(of a differential form) 105.Q 








Subject Index 
Surface(s) 


(of a distribution) 125.D 
(ofa function) 125.B 168.B 425.R 
(of a section of a sheaf) 383.C 
(of a spectral measure) 390.D 
compact (of a singular q-cochain) 201.P 
essential (of a distribution) 274.D 
singular (of a distribution) 112.C 
singular (of a hyperfunction) 125.W 
supporting function 125.0 
supporting functional (of a convex set) 89.G 
supporting half-space (of a convex set) 89.A 
supporting hyperplane (of a convex set) 89.A 
supporting line (of an oval) 89.C 
supporting line function (of an oval) 89.C 
supporting point 
(ofa convex set) 89.G 
(of a projective frame) 343.C 
supremum 
(of an ordered set) 168.B 
(of a set of Hermitian operators) 308.A 
(of a subset of a vector lattice) 310.C 
essential (of a measurable function) 168.B 
supremum norm 168.B 
supremum theorem, Hardy-Littlewood App. A, 
Table 8 
sure event 342.B 
surely, almost 342.B,D 
surface(s) 111.A 410, App. A, Table 4.1 
Abelian 15.H 
abstract Riemann 367.A 
affine minimal 110.C 
algebraic 15 
basic (of a covering surface) 367.B 
with boundary 410.B 
branched minimal 275.B 
center 111.1] 
characteristic 320.B 
circular cylindrical 350.B 
closed 410.B 
closed (in a 3-dimensional Euclidean space) 
111.1 
closed convex 111.1] 
conical 111.1 
of constant curvature 111.1 
covering 367.B 
covering, Ahlfors theory of 367.B 
covering, with relative boundary 367.B 
cylindrical 111.1 
deformation of 110.A 
degenerate quadric 350.B 
developable 111.1, App. A, Table 4.I 
Dini 111.1 
elliptic 72.K 
elliptic cylindrical 350.B 
energy 126.L 402.C,G 
Enneper 275.B 
Enriques 72.K 
enveloping 111.1 
equipotential 193J 
Fréchet 246] 
fundamental theorem of the theory of 111.G 
fundamental theorem of the topology of 410.B 
G- 178.H 
of general type 72.K 
geometry ona 111.G 
helicoidal 111.I 
Hilbert modular 15.H 
Hirzebruch 15.G 
Hopf 72.K 
hyperbolic cylindrical 350.B 


Subject Index 
Surface area of unit hypersphere 


hyperelliptic 72.K 
initial 321.A 
K3 15.H 72.K 
Kummer 15.H 
level 193J 
marked K3 72.K 
minimal 111.1 334.B 
niveau 193.J 
one-sided 410.B 
open 410.B 
parabolic cylindrical 350.B 
proper quadric 350.B 
quadric 350.A 
quadric conical 350.B 
rational 15.E 
rectifying 111.1 
response 102.L 
response, design for exploring 102.M 
of revolution 111.1 
Riemann — Riemann surface 
ruled 15.E 
ruled (in differential geometry) 111.1 
of the second class 350.D 
of the second order 350.A 
Scherk’s 275.A 
Seifert 235.A 
skew 111.1 
special 110.A 
tangent 111.F 
two-sided 410.B 
unbounded covering 367.B 
unirational 15.H 
universal covering 367.B 
unramified covering 367.B 
Veronese 275.F 
W- 1111 
Weingarten (W) 111.1 
surface area of unit hypersphere App. A, Table 9.V 
surface element 324.B 
surface harmonics 393.A 
surface integral 94.A,E 
(with respect to a surface element) 94.E 
surface wave 446 
surgery 114.F,J 
surgery obstruction 114J 
surjection 381.C 
(ina category) 52.D 
canonical (on direct products of groups) 190.L 
canonical (to a factor group) 190.D 
canonical (onto a quotient set) 135.B 
natural (to a factor group) 190.D 
surjective mapping 381.C 
survival insurance 214.B 
susceptibility 
electric 130.B 
magnetic 130.B 
suspension 
(of a discrete dynamical system) 126.C 
(of a homotopy class) 202.Q 
(ofa map) 202.F 
(ofa space) 202.E,F 
n-fold reduced 202.F 
reduced (of a topological space) 202.F 
Suslin 
Kk- 22H 
schema of 22.B 
system of 22.B 
Suslin hypothesis 33.F 
Suslin space 22.1 425.CC 
Suslin theorem 22.C 
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suspension isomorphism (on singular (co)homology 
groups) 201.E 
suspension theorem, generalized 202.T 
Suzuki group 151.1 
SVD (singular value decomposition) 302.E 
sweep (a bounded domain) 384.F 
sweepable (bounded domain) 384.F 
sweeping-out principle 338.L 
sweeping-out process 338.L 
sweep out (a measure to a compact set) 338.L 
Swinnerton-Dyer conjecture, Birch- 118.D 450.S 
Sylow subgroup 151.B 
p- 151.B 
Sylow theorem 151.B 
Sylvester elimination method 369.E 
Sylvester law of inertia (on a quadratic form) 348.B 
Sylvester theorem (on determinants) 103.F 
Symanzik equation, Callan- 361.B 
symbol 369.A 
(of a Fourier integral operator) 274.C 
(of a pseudodifferential operator) 251.0 345.B 
(—Steinberg symbol) 237.J 
(ofa vector field) 105.M 
Artin 14.K 
Christoffel 80.L 111.H 417.D, App. A, Table 
4.II 
function 411.H 
Gauss 83.A 
Hilbert s- 411 
Hilbert-Hasse norm-residue 14.R 
Hilbert norm-residue 14.R 
individual 411.H 
Jacobi 297.1 
Jacobi, complementary law of 297.1 
Jacobi, law of quadratic reciprocity of 297.1 
Kronecker 347.D 
Landau (0,0) 87.G 
Legendre 297.H 
Legendre, first complementary law of 297.1 
Legendre, law of quadratic reciprocity of 297.1 
Legendre, second complementary law of 297.1 
logical 411.B 
9j- 353.C 
norm-residue 14.Q 257.F 
power-residue 14.N 
predicate 411.H 
principal (of a differential operator) 237.H 
principal (of a microdifferential operator) 
274.F 
principal (of a simple holonomic system) 
274.H 
6j- 353.B 
Steinberg (in algebraic K-theory) 237J 
3j- 353.B 
symbolic logic 411 
symbol sequence (in microlocal analysis) 274.F 
symmetric 
(block design) 102.E 
(factorial experiment) 102.H 
(Fock space) 377.A 
(member of a uniformity) 436.B 
(multilinear mapping) 256.H 
(relation) 358.A 
(tensor) 256.N 
symmetric algebra 29.H 
symmetric bilinear form (associated with a quadratic 
form) 348.A 
symmetric bounded domain 412.F 
irreducible 412.F 
symmetric Cauchy process 5.F 
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symmetric difference 304.E 
symmetric distribution function 41 H 
symmetric event 342.G 
symmetric function 337.I 
elementary 337.1 
symmetric group 190.B 
ofdegreen 151.G 
symmetric Hermitian space 412.E 
irreducible 412.E 
symmetric homogeneous space 412.B 
symmetric hyperbolic system (of partial differential 
equations) 325.G 
symmetric kernel 217.G 335.D 
symmetric law (in an equivalence relation) 135.A 
symmetric Markov process 261.C 
symmetric matrix 269.B 
anti- 269.B 
skew- 269.B 
symmetric multilinear form  256.H 
anti- 256.H 
skew- 256.H 
symmetric multilinear mapping 256.H 
anti- 256H 
skew- 256.H 
symmetric multiplication 406.C 
symmetric operator 251.E 
symmetric points (with respect to a circle) 74.E 
symmetric polynomial 337.1 
elementary 337.] 
fundamental theorem on 337.1 
symmetric positive system 
(of differential operators) 112.S 
(of first-order linear partial differential equa- 
tions) 326.D 
symmetric product (of a topological space) 70.F 
symmetric Riemannian homogeneous space 412.B 
symmetric Riemannian space(s) 412 
globally 412A 
irreducible 412.C, App. A, Table 5.III 
locally 412.A, App. A, Table 4.II 
weakly 412J 
symmetric space 412.A 
affine 80J 
affine locally  80.J 
locally 80.J 364.D 
symmetric stable process 5.F 
symmetric tensor 256.N 
anti- 256.N 
contravariant 256.N 
covariant 256.N 
skew- 256.N 
symmetric tensor field 105.0 
symmetrization 
(in isoperimetric problem) 228.B 
Steiner (in isoperimetric problem) 228.B 
symmetrizer 256.N 
Young 362.H 
symmetry 
(at a point of a Riemannian space) 412.A 
'(ofa principal space) 139.B 
(in quantum mechanics) 415.H 
broken 132.C 
central (of an affine space) 139.B 
charge 415J 
crossing 132.C 386.B 
degree of 431.D 
hyperplanar (of an affine space) 139.B 
internal 150.B 
law of (for the Hilbert norm-residue symbol) 
14.R 
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System 


Nelson 150.F 
TCP 386.B 
symmetry group, color 92.D 
symmorphic space group 92.B 
symmorphous space group 92.B 
symplectic form 126.L 
symplectic group 60.L 151.1 
complex 60.L 
infinite 202.V 
over afield 60.L 
over a noncommutative field 60.0 
projective (over a field) 60.L 
unitary 60.L 
symplectic manifold 219.C 
symplectic matrix 60.L 
symplectic structure 219.C 
symplectic transformation 60.L 
(over a noncommutative field) 60.0 
symplectic transformation group (over a field) 
60.L 
synchronous (system of circuits) 75.B 
syndrome 63.C 
synthesis (in the theory of networks) 282.C 
spectral 36.L 
synthetic geometry 181 
system 
adjoint (of a complete linear system on an 
algebraic surface) 15.D 
adjoint, of differential equations 252.K 
algebraic 409.B 
algebraic, in the wider sense 409.B 
ample linear 16.N 
asynchronous (of circuits) 75.B 
axiom 35 
axiom (of a structure) 409.B 
axiom (of a theory) 411.1 
ofaxioms 35.B 
base for the neighborhood 425.E 
categorical (of axioms) 35.B 
C*-dynamical 36.K 
character (of a genus of a quadratic field) 
347.F 
characteristic linear (of an algebraic family) 
15.F 
Chebyshev (of functions) 336.B 
classical dynamical 126.L 136.G 
of closed sets 425.B 
complete (of axioms) 35.B 
complete (of independent linear partial differen- 
tial equations) 324.C 
complete (of inhomogeneous partial differential 
equations) 428.C 
complete (of nonlinear partial differential 
equations) 428.C 
complete linear (on an algebraic curve) 9.C 
complete linear (on an algebraic variety) 16.N 
complete linear, defined by a divisor 16.N 
completely integrable (of independent 1-forms) 
428.D 
complete orthogonal 217.G 
complete orthonormal 217.G 
complete orthonormal, of fundamental func- 
tions 217.G 
complete residue, modulom 297.G 
continuous dynamical 126.B 
coordinate 90.A 
coordinate (of a line in a projective space) 
343.C 
crystal 92.B 
determined (of differential operators) 112.R 


Subject Index 
System 


determined (of partial differential equations) 
320.F 

differentiable dynamical 126.B 
differential 191.I 

of differential equations of Maurer-Cartan 
249.R 

of differential operators 112.R 

direct (of sets) 210.B 

discrete dynamical 126.B 

distinct, of parameters 284.D 

dynamical 126 

of equations 10.A 

equilibrium, transformation to 82.D 
formal 156.D 411.I 

of functional differential equations 163.E 


fundamental (of eigenfunctions to an eigenvalue 


for an integral equation) 217.F 
fundamental (for a linear difference equation) 
104.D 
fundamental (of a root system) 13J 


fundamental, of irreducible representations (of a 


complex semisimple Lie algebra) 248.W 
fundamental, of neighborhoods 425.E 
fundamental, of solutions (of a homogeneous 

linear ordinary differential equation) 252.B 
fundamental, of solutions (of a homogeneous 

system of first-order linear differential equa- 

tions) 252.H 


fundamental root (of a semisimple Lie algebra) 


248.N 


of fundamental solutions (of a system of linear 


homogeneous equations) 269.M 
Garnier 253.E 
of generators (of a A-module) 277.D 
of gravitational units 414.B 
group 235.B 
Haar, of orthogonal functions 317.C 
Hamiltonian 126.L 
holonomic 274.H 
holonomic, with regular singularities 274.H 
homotopy equivalent (of topological spaces) 
202.F 


of hyperbolic differential equations (in the sense 


of Petrovskii) 325.G 


incompatible (of partial differential equations) 


428.B 

inconsistent (of algebraic equations) 10.A 

indeterminate (of algebraic equations) 10.A 

inductive (in a category) 210.D 

inductive (of sets) 210.B 

inductive, of groups 210.C 

inductive, of topological spaces 210.C 

inertial 271.D 359 

information retrieval 96.F 

integrable 287.A 

international, of units 414.A 

inverse (of sets) 210.B 

involutive — involutory 

involutory (of differential forms) 428.F 

involutory (of nonlinear equations) 428.C 

involutory (of partial differential equations) 
428.F l 

involutory (of partial differential equations of 
first order) 324.D 

irreducible linear 16.N 

lattice spin 402.G 

linear (on an algebraic variety) 16.N 

of linear differential equations of first order 
252.G 

linear dynamical 86.B 
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of linear equations 269.M 

of linear homogeneous equations 269.M 

linear structural equation 128.C 

linear time-varying 86.B 

local, of groups (over a topological space) 
201.R 

local coordinate (of a manifold) 105.C 

local coordinate (of a topological space) 90.D 

local coordinate, holomorphic 72.A 

mathematical (for a structure) 409.B 

maximal independent (of an additive group) 
2.E 

neighborhood 425.B 

of notations (for ordinal numbers) 81.B 

null (in projective geometry) 343.D 

number, point range of (in projective geometry) 
343.C 

of open sets 425.B 

operating 75.C 

of ordinary differential equations 313.B 

orthogonal (of functions) 317.A 

orthogonal (of a Hilbert space) 197.C 

of orthogonal functions App. A, Table 20 

of orthogonal polynomials 317.D 

overdetermined (of differential operators) 
112.R 

overdetermined (of partial differential equa- 
tions) 320.F 

of parameters 284.D 

partial de Rham 274.G 

of partial differential equations of order | (on a 
differentiable manifold) 428.F 

passive orthonomic (of partial differential 
equations) 428.B 

peripheral 235.B 

of Pfaffian equations 428.A 

polar (in projective geometry) 343.D 

Postnikov (of a CW complex) 148.D 

projective (in a category) 210.D 

projective (of sets) 210.B 

projective, of groups 210.C 

projective, of topological groups 423.K 

projective, of topological spaces 210.C 

quotient (of an algebraic system) 409.C 

Rademacher, of orthogonal functions 317.C 

reduced residue 297.G 

reducible linear 16.N 

regular, of parameters 284.D 

of resultants 369.E 

root (of a symmetric Riemann space) 413.F 

root (in a vector space over Q) 13.J 

self-adjoint, of differential equations 252.K 

simple holonomic 274.H 

of simultaneous differential equations App. A, 
Table 14.1 

of Suslin 22.B 

symmetric positive (of differential operators) 
112.5 

symmetric positive (of first-order linear partial 
differential equations) 326.D 

Tits 13.R 151. 343.1 

of total differential equations | 428.A 

of transitivity (of a G-set)  362.B 

trigonometric 159.A 

two-bin 227 

underdetermined (of differential operators) 
112.R 

underdetermined (of partial differential equa- 
tions) 320.F 

uniform covering 436.D 
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uniform family of neighborhoods 436.D 
uniform neighborhood 436.D 
unisolvent (of functions) 336.B 
of units  414.A 
very ample linear 16.N 
Walsh, of orthogonal functions 317.C 
system process 405.F 
system simulation 385.A 
syzygy 369.F 
first 369.F 
rth 369.F 
syzygy theorem, Hilbert 369.F 
Syzygy theory 200.K 369.F 
Szegó kernel function 188.H 
Szemerédi theorem, ergodic 136.C 
Sz.-Nagy-Foiag model 251.N 


T 


t-function 150.D 

t-distribution 341.D 374.B, App. A, Table 22 
noncentral 374.B 

t-statistic 374.B 

t-test 400.G 

T-bound 331.B 

T-bounded 331.B 

T-compact 68.F 331.B 

T-operator 375.C 

T?-statistic 
Hotellings 280.B 
noncentral Hotelling 374.C 

T-number (transcendental number) 430.C 

T*-number (transcendental number) 430.C 

To-space 425.Q 

T,-space 425.Q 

T,-space 425.Q 

T,-space 425.Q 

Ty-space 425.Q 

T.-space 425.Q 

T,-space 425.Q 

T-positivity 150.F 

T-set 308.1 

T-summable (series) 379.L 

T; -uniformity 436.C 

T, -uniform space 436.C 

T,-topological group 423.B 

table 
analysis-of-variance 400.H 
contingency 397.K 400.K 
difference 223.C 
k-way contingency 397.K 
Padé 142.E 

tableau, simplex 255.C 

table look-up 96.C 

tail event 342.G 

tail o-algebra 342.G 

Tait algorithm  157.C 

Tait coloring 157.C 

Takagi, Teiji 415 

Takesaki, duality theorem of 308.J 

Takesaki theory, Tomita- 308.H 


Tamagawa number (of an algebraic group) 13.P 


Tamagawa zeta function 450.L 
Tamano product theorem  425.X 
tame (knot) 235.A 
tan(—tangent) 131.E 
tan! 131.E 
Tanaka embedding 384.D 
tangent 432.A 

(of pressure) 402.G 


Subject Index 
Teichmüller space 


asymptotic 110.B 
Darboux 110.B 
hyperbolic 131.F 
law of App. A, Table 2.III 
tangent bundle 
(of a Banach manifold) 286.K 
(of a differentiable manifold) 105.H 147.F 
(of a foliation) 154.B 
tangent hyperplane (of a quadric hypersurface) 
343.E 
tangential polar coordinates 90.C 
tangential sphere bundle 274.E 
tangential stress 271.G 
tangent line 93.G 111.C,F, App. A, Table Al 
oriented 76.B 
tangent orthogonal n-frame bundle 364.A 
tangent plane 111.H, App. A, Table Al 
tangent PL microbundle 147.P 
tangent r-frame(s) 105.H 
bundle of 105.H 
tangent r-frame bundle 147.F 
tangent space 105.H 
tangent sphere bundle, unit 126.L 
tangent surface 111.F 
tangent vector 105.H 
holomorphic 72.A 
of type (0,1) 72.C 
of type (1,0) 72.C 
tangent vector bundle 105.H 147.F 
tangent vector space 105.H 
tanh (hyperbolic tangent) 131.F 
Taniyama-Weil conjecture 450.8 
Tannaka duality theorem 69.D 249.U 
target (of a jet) 105.X 
target variable 264 
Tate cohomology | 200.N 
Tate conjecture 450.S 
Tate-Shafarevich group 118.D 
Tate theorem 59.H 
Tauberian theorem  121.D 339.B 
generalized 36.L 160.G 
generalized, of Wiener 192.D 
Tauberian type, theorem of 339.B 
Tauber theorem 339.B 
tautochrone 93.H 
tautological line bundle 16.E 
tautology 411.E 
Taylor expansion 
(of an analytic function of many variables) 
21.B 
(of a holomorphic function of one variable) 
339.A 
formal 58.C 
Taylor expansion and remainder App. A, Table 
9.IV 
Taylor formula App. A, Table 9.IV 
(for a function of many variables) 106.J 
(for a function of one variable) 106.E 
Taylor series 339.A 
Taylor theorem (in a Banach space) 286.F 
TCP invariance 386.B 
TCP operator 150.D 
TCP symmetry 386.B 
TCP theorem 386.B 
technique, program evaluation and review 376 
TE wave 130.B 
Teichmüller mapping 352.C 
Teichmüller metric 416 
Teichmüller space 416 
universal 416 


Subject Index 
Telegraph equation 


telegraph equation 325.A, App. A, Table 15.III 
temperature (of states) 419.A 
absolute 419.A 
tempered distribution 125.N 
temporally homogeneous 
(additive process) 5.B 
(Markov process) 261.A 
TEM wave 130.B 
tension 281.B 
modulus of elasticity in 271.G 
tension field 195.B 
tensor 
alternating 256.N 
angular momentum 258.D 
antisymmetric 256.N 
conformal curvature App. A, Table 4.II 
contracted 256.L 
contravariant, of degree p 256.J 
correlation 433.C 
covariant, of degree q 256.3 
curvature 80.J,L 364.D 417.B 
energy-momentum 150.B 359.D 
energy spectrum 433.C 
fundamental (of a Finsler space) 152.C 
fundamental (of a Riemannian manifold) 
364.A 
Green 188.E 
irreducible, of rank k 353.C 
Maxwell stress 130.A 
mixed 256.5 
Nijenhuis 72.B 
numerical App. A, Table 4.II 
projective curvature App. A, Table 4.II 
Ricci 364.D, App. A, Table 4.II 
second fundamental 417.F 
skew-symmetric 256.N 
strain 271.G 
stress 150.B 271.G 
symmetric 256.N 
torsion App. A, Table 4.II 
torsion (of an affine connection) 80.J 417.B 
torsion (of an almost contact structure) 110.E 
torsion (of a Fréchet manifold) 286.L 
torsion (of a Riemannian connection) 80.L 
of type (p,q) 256J 
Weyl’s conformal curvature 80.P 
tensor algebra 
(on a linear space) 256.K 
contravariant 256.K 
tensor bundle (of a differentiable manifold) 147.F 
tensor calculus 417, App. A, Table 4.II 
tensor field 105.0 
of almost complex structure (induced by a 
complex structure) 72.B 
alternating 105.0 
of class C' 105.0 
contravariant, of order r 105.0 
covariant, of orders 105.0 
covariant derivative of (in the direction of a 
tangent vector) 80.I 
left invariant (on a Lie group) 249.A 
parallel 364.B 
random 395] 
right invariant (on a Lie group) 249.A 
symmetric 105.0 
of type (r,s) 105.0 
of type (r,s) with value in E 417.E 
tensorial form 80.G 
tensorial p-form | 417.C 
tensor product 
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(of A-homomorphisms) 277.J 
(of algebras) 29.A 
(of A-modules) 277J 
(of chain complexes) 201.J 
(of cochain complexes) 201.J 
(of distributions) 125.K 
(of Hilbert spaces) 308.C 
(of linear mappings) 256.1 
(of linear representations) 362.C 
(of linear spaces) 256.1 
(of locally convex spaces) 424.R 
(of sheaves) 383.1 
(of vector bundles) 147.F 
(of von Neumann algebras) 308.C 
continuous 377.D 
€ 424R 
projective C*- 36.H 
spatial 36.H 
tensor representation (of a general linear group) 
256.M 
tensor space 
of degree k 256J 
of type (p,q) 256J 
term 
(ofa language) 276.A 
(of a polynomial) 337.B 
(in predicate logic) 411.H 
(of asequence) 165.D 
(of a series) 379.A 
base (of a spectral sequence) 200.J 
closed (of a language) 276.A 
constant (of a formal power series) 370.A 
constant (of a polynomial) 337.B 
error 403.D 
fiber (of a spectral sequence) 200.J 
initial (of an infinite continued fraction) 83.A 
nth (of sequence) 165.D 
penalty 440.B 
subtraction 111.B 
undefined 35.B 
terminal decision rule 398.F 
terminal point 
(of a curvilinear integral) 94.D 
(in a Markov process) 261.B 
(of a path) 170 
(ofa vector) 442.A 
terminal time 261.B 
terminal vertex 186.B 
termwise differentiable (infinite series with function 
terms) 379.H 
termwise differentiation, theorem of (on distribu- 
tions) 125.G 
termwise integrable (series) 216.B 
ternary set 79.D 
Terry model, Bradley- 346.C 
Terry normal score test, Fisher-Yates- 371.C 
tertiary obstruction 305.D 
tertium non datur 156.C 
tesseral harmonics 393.D 
test 400.A 
Abel 379.D 
almost invariant 400.E 
Cauchy condensation 379.B 
Cauchy integral 379.B 
chi-square 400.G 
chi-square, of goodness of fit 400.K 
comparison 379.B 
consistent 400.K 
Dini (on the convergence of Fourier series) 
159.B 
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Dini-Lipschitz (on the convergence of Fourier 
series) 159.B 

Dirichlet (on Abel's partial summation) 379.D 

Dirichlet (on the convergence of Fourier series) 
159.B 

of du Bois-Reymond and Dedekind 379.D 

duo-trio 346.D 

F- 400.G 

Fisher-Yates-Terry normal score 371.C 

goodness-of-fit 397.Q 401.E 

invariant 400.E 

Jordan (on the convergence of Fourier series) 
159.B 

Kolmogorov 45.F 

Kolmogorov-Smirnov 371.F 

Kruskal-Wallis 371.D 

Lebesgue (on the convergence of Fourier series) 
159.B 

Leibniz (for convergence) 379.C 

level a 400.A 

likelihood ratio 400.1 

Mann-Whitney U- 371.C 

minimax level a 400.F 

most powerful 400.A 

most stringent level o 400.F 

nonparametric 371.A 

nonrandomized 400.A 

outlier 397.Q 

pair 346.D 

randomized 400.A 

sensory 346.B 

sequential 400.L 

sequential probability ratio 400.L 

sign 371.B 

signed rank 371.B 

similar 400.D 

Student 400.G 

t- 400.G 

triangle 346.D 

UMP unbiased level « 400.C 

UMP in variant level a 400.E 

unbiased level a 400.C 

uniformly consistent 400.K 

uniformly most powerful (UMP) 400.A 

uniformly most powerful invariant level « 
400.E 

uniformly most powerful unbiased level « 
400.C 

van der Waerden, 371.C 

Welch 400.G 

Wiener (for Brownian motion) 45.D 

Wiener (for Dirichlet problem) 338.G 

Wiener (for random walk) 260.E 

Wilcoxon 371.C 

Wilcoxon signed rank 371.B 


test channel 213.E 

test function 400.A 
test function space 125.8 
testing 


hypothesis 401.C 
Statistical hypothesis 400 


Subject Index 
Theory 


TE waves 130.B 
Theodorsen function 39.E 
theorem(s) — also specific theorems 
of angular momentum  271.E 
Brouwer's, on the invariance of domain 117.D 
of coding 273.D-F 
on complete form 356.H 
of completeness (in geometry) 155.B 
cup product reduction 200.M 
fundamental —- fundamental theorem(s) 
of identity 21.C 
invariance, of analytic relations 198.K 
on invariance of dimension of Euclidean spaces 
117.D 
kernel 125.L 424.S 
of linear ordering (in geometry) 155.B 
local limit 250.B 
of momentum 271.E 
product, for dimension 117.C 
of quasiconformal reflection 352.C 
structure, for von Neumann algebras of type III 
308.I 
of Tauberian type 339.B 
of termwise differentiation (of distributions) 
125.G 
translation (in class field theory) 59.C 
translation representation 375.H 
transversality 105.L 
triangle comparison 178.A 
Tucker's, on complementary slackness 255.B 
unicursal graph (Euler's) 186.F 
Weierstrass's, of double series 379.H 
Theorem A 21.L 72.E,F 
Theorem B 21.L 72.E,F 
theoretical formula 19.F 
theory 
Ahlfors's, of covering surfaces 272.J 367.B 
of buildings 343.1 
of calculus of variations, classical 46.C 
Cantor's, of real numbers 294.E 
class field 59 
classification, of Riemann surfaces 367.E 
combinatorial 66.A 
complete cohomology 200.N 
constructive field 150.F 
Dedekind’s, of real numbers 294.E 
de Rham homotopy 114.L 
dimension 117 
of elasticity 271.G 
of electromagnetic waves 130.B 
oferrors 138.A 
Euclidean field 150.F 
Euclidean Markov field 150.F 
exact sampling 401.F 
finite-displacement (of elasticity) 271.G 
of functions 198.Q 
of functions of a complex variable 198.Q 
Galois 172 
Galois, of differential fields 113 
game 173 
of gases, kinetic 402.B 


test statistics, Kolmogorov-Smirnov 374.E graph 186 
test sufficient (c-field) 396.J Haag-Ruelle scattering 150.D 
tetracyclic coordinates 90.B hidden variables 351.L 
tetragamma function 174.B hydromagnetic dynamo 259 
tetragonal (system) 92.E information 213 
tetrahedral group 151.G Kaluza’s 5-dimensional 434.C 
tetrahedron  7.D 357.B large sample 401.E 

polar 350.C lattice gauge 150.G 


self-polar 350.C Littlewood-Paley 168.B 


Subject Index 
Thermal contact 


local class field |59.G 
Lyusternik-Shnirel’man 286.Q 
Minkowski reduction (on fundamental regions) 
122.E 
Morse 279 
Morse, fundamental theorems of 279.D 
Nevanlinna (of meromorphic functions) 124.B 
272.B 
nonsymmetric unified field 434.C 
number, analytic 296.B 
number, elementary 297 
number, geometric 296.B 
of perturbations, general 420.E 
of perturbations, special 420.E 
Peter-Weyl (on compact groups) 69.B 
Peter- Weyl (on compact Lie groups) 249.U 
Picard-Vessiot 113 
of plasticity 271.G 
prediction 395.D 
prediction, linear 395.D 
of probability 342.A 
proof 156.D 
quantum field 150.C 
ramified type 411.K 
realization 86.D 
of relativity, general 359.A 
of relativity, special 359.A 
risk 214.C 
risk, classical 214.C 
risk, collective 214.C 
risk, individual 214.C 
Serre €- 202.N 
set 381.F (— also set theory) 
of singularities 418 
slender body 205.B 
small-displacement, of elasticity 271.G 
S-matrix 386.C 
supermultiplet, Wigners 351.J 
syzygy 200.K 
thin wing 205.B 
Tomita-Takesaki 308.H 
type 411K 
unified field 434.A 
unitary field 434.C 
thermal contact 419.A 
thermal expansion, coefficient of 419.A 
thermal noise 402.K 
thermodynamical quantity 419.A 
extensive 419.A 
intensive 419.A 
thermodynamic limit 402.G 
thermodynamics 419 
first law of 419.A 
second law of 419.A 
statistical 402.A 
third law of 419.A 
Oth law of 419.A 
theta formula (on ideles) 6.F 
theta-Fuchsian series of Poincaré 32.B 
theta function 134.] 
(on a complex torus) 3.1 
elliptic 134.1, App. A, Table 16.II 
graded ring of 3.N 
Jacobian 134.C 
nondegenerate 3. 
Riemann 3.L 
theta series 348.L 
thick (chamber complex) 13.R 
thickness (of an oval) 89.C 
thin (chamber complex) 13.R 
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thin set 
(in Markov processes) 261.D 
(in potential theory) 338.G 
analytically (in an analytic space) 23.D 
internally 338.G 
thin wing theory 205.B 
third boundary value problem 193.F 323.F 
third classification theorem (in the theory of obstruc- 
tons) 305.C 
third extension theorem (in the theory of obstruc- 
tions) 305.C 
third fundamental form App. A, Table Al 
third homotopy theorem (in the theory of obstruc- 


tions) 305.C 

third isomorphism theorem (on topological groups) 
423.J 

third kind 


Abelian differential of 11.C 

Abelian integral of 11.C 
third law of thermodynamics 419.A 
third-order predicate logic 411.K 
third quartiles 396.C 
third separation axiom 425.Q 
Thom algebra 114.H 
Thom complex 114.G 

associated with (G,n) 114.G 
Thom first isotropy theorem 418.G 
Thom fundamental theorem 114.H 
Thom-Gysin isomorphism 114.G 

(on a fiber space) 148.E 
Thompson inequality, Golden- 212.B 
Thompson theorem, Feit- (on finite groups) 151.D 
Thomspace 114.G 
Thom spectrum, stable homotopy group of 114.G 

202.T 

Thorin theorem, Riesz- 224.A 
thorn (of a convergence domain) 21.B 
three big problems 187 
three-body problem 420.A 

restricted 420.F 
three-circle theorem, Hadamard 43.E 
3j-symbol 353.B 
three laws of motion, Newton's 271.A 
three-line theorem, Doetsch 43.E 
three-stage least squares method 128.C 
three-valued logic 411.L 
three principles, Fishers 102.A 
three-series theorem 342.D 
threshold Jacobi method 298.B 
Thue problem  31.B 
Thue theorem  118.D 
Thullen theorem, Cartan- 21.H 
Thurstone-Mosteller model 346.C 
tieset 186.G 
tieset matrix, fundamental 186.G 
Tietze extension theorem 425.Q 
Tietze first axiom 425.Q 
Tietze second axiom 425.Q 
tight family (of probability measures) 341.F 
tight immersion 365.0 
tightness 399.M 
Tikhonov embedding theorem 425.T 
Tikhonov fixed-point theorem 153.D 
Tikhonov product theorem 425.S 
Tikhonov separation axiom 425.Q 
Tikhonov space 425.Q 
Tikhonov theorem 425.Q 

Uryson- (on metrizability) 273.K 
Tikhonov-Uryson theorem 425.Q 
time 
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exit 261.B 
explosion 406.D 
first splitting 44.E 
hitting 260.B 261.B 407.B 
killing 260.A 
life 260.A 261.B 
local 45.G 
Markov 261.B 407.B 
NP- 71E 
polynomial 71.E 
proper 258.A 
real- 19.E 
recurrence 260.C 
sojourn, density 45.G 
stopping 261.B 407.B 
terminal 261.B 
waiting 260.H 
waiting, distribution 307.C 
time change 
(of a Markov process) 261.B 
(of a semimartingale) 406.B 
(of a submartingale) 262.C 
time complexity 71.A 
time-dependent Schrödinger equation 351.D 
time-homogeneous Markovian type  406.D 
time-independent Markovian type 406.D 
time-independent Schrödinger equation 351.D 
time-invariant, linear (dynamical systems) Sep 
time-invariant network 282.C 
timelike 
(curve) 325.A 
(vector of a Minkowski space) 258.A 359.B 
time-one mapping (map) 126.C 
time optimal control problem  86.F 
time ordered function 150.D 
time parameter (of a stochastic process) 407.A 
time parameter space 260.A 
time reversal 258.A 359.B 
time series 397.A 421.A 
time series analysis 421 
time series data 397.N 
time-varying system, linear 86.B 
Tissot-Pochhammer differential equation 206.C 
Titchmarsh-Kodaira theory, Weyl- Stone- 112.0 
Titchmarsh theorem 306.B 
Brun- 123.D 
Tits simple group 151.1 
Tits system 13.R 151.J 343.1 
TM waves 130.B 
Toda bracket 202.R 
Toda lattice 287.A 387.A 
Todd characteristic 366.B 
Todd class (of a complex vector bundle) 237.F 
Toeplitz operator 251.0 
Toeplitz theorem 379.L 
tolerance interval 399.R 
tolerance limits 399.R 
tolerance percent defective, lot 404.C 
tolerance region 399.R 
Tomita-Takesaki theory 308.H 
Tonelli 
absolutely continuous in the sense of 246.C 
bounded variation in the sense of 246.C 
topological Abelian group(s) 422 
dual 422.C 
elementary 422.E 
topological conjugacy 126.B 
topological entropy 126.K 136.H 
topological equivalence 126.B 
topological field 423.P 


Subject Index 
Topology 


topological generator (of a compact Abelian group) 
136.D 
topological group(s) 423 
completable 423.H 
complete 423.H 
Hausdorff 423.B 
homomorphic 423.J 
isomorphic 423.A 
locally isomorphic 423.0 
metrizable 423.1 
separated 423.B 
T,- 423B 
topological groupoid 154.C 
topological index (of an elliptic complex) 237.H 
topological invariance (of homology groups) 201.A 
topological invariant 425.G 
topological linear spaces 424 
topologically complete space 436.1 
topologically conjugate 126.B 
topologically equivalent 126.B,H 
topological manifold 105.B 
with boundary 105.B 
without boundary . 105.B 
topological mapping 425.G 
topological n-cell 140 
topological pair 201.L 
topological polyhedron 65.A 
topological pressure 136.H 
topological property 425.G 
topological ring 423.P 
topological o-algebra 270.C 
topological solid sphere 140 
topological space(s) 425 
category of 52.B 
category of pointed 202.B 
complexlinear 424.A 
discrete 425.C 
generalized 425.D 
homotopy category of 52.B 
inductive system of 210.C 
linear 424.A 
metrizable 273.K 
product 425.K 
projective system of 210.C 
quotient 425.L 
real linear 424.A 
underlying (of a complex manifold) 72.A 
underlying (of a differentiable manifold) 105.D 
underlying (of a topological group) 423.A 
uniform 436.C 
uniformizable 436.H 
topological sphere 140 
topological structure 425.A,B 
topological sum 425.M 
topological transformation group 431.A 
topological vector space 424.A 
topology 425.B 426 
a-adic (of an R-module) 284.B 
algebraic 426 
base fora 425.F 
of biequicontinuous convergence 424.R 
box 425.K 
coarser 425.H 
combinatorial 426 
compact-open 279.C 435.D 
compact-open C^ 279.C 
differential 114 
discrete 425.C 
étale 16.AA 
fine (on a class of measures) 261.D 338.E 


Subject Index 
Topology € 


finer 425.H 
Gelfand 36.E 
general 426 
Grothendieck 16.AA 
hereditarily weak 425.M 
hull-kernel 36.D 
I-adic (ofa ring) 16.X 
indiscrete 425.C 
induced 425.1 
induced by a mapping 425. 
inner (of a Lie subgroup) 249.E 
Jacobson 36.D 
Krull (for an infinite Galois group) 172.I 
larger 425.H 
leaf 154.D 
of Lie groups and homogeneous spaces 427 
linear 422.L 
Mackey 424.N 
metric 425.C 
order 425.C 
PL 65.A 
product 425.K 
projective 424.R 
quotient 425.L 
relative 425J 
S- (on a linear space) 424.K 
Sazonov 341.J 
set-theoretic 426 
o-weak  308.B 
smaller 425.H 
strong (on a class of measures) 338.E 
strong (on a direct product space) 425.K 
strong (on a normed space) 37.E 
strong (on a topologicallinear space) 424.K 
stronger 425.H 
strong operator .251.C 
subbasefora 425.F 
of surfaces, fundamental theorem of 410.B 
trivial 425.C 
uniform | 436.C 
of uniform convergence 424.K 
of the uniformity  436.C 
uniformizable | 436.H 
uniform operator 251.C 
vague (on a class of measures) 338.E 
weak (in a cell complex) 70.D 
weak (on a class of measures) 338.E 
weak (on a direct product space) 425.K 
weak (on a direct sum) 425.M 
weak (on a locally convex space) 424.H 
weak (on a normed linear space) 37.E 
weak, relative to the pairing (E, F» 424.H 
weak C^ 279.C 
weaker 425.H 
weak operator 251.C 
weak* (on a locally convex space) 424.H 
weak* (on a normed space) 37.E 
Zariski (of a spectrum) 16.D 
Zariski (of a variety) 16.A 
topology e (on the tensor product of locally convex 
spaces) 424.R 
topology z (on the tensor product of locally convex 
spaces) 424.R 
Tor 200.D 
exact sequence of 200.D 
Tor®(A, B) 200.K 
TorZ(M,N) 200.D 
Torelli theorem 9.EJ 11.C 
Torgroups 200.D 
toric variety 16.Z 
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toroidal coordinates App. A, Table 3.V 

toroidal embedding 16.Z 

toroidal subgroup, maximal (of a compact Lie 
group) 248.X 


torsion 
(of a curve of class C^) 111.D 
affine 110.C 


analytic 391.M 
conformal 110.D 
radius of (of aspace curve) 111.F 
Whitehead 65.C 
torsion A-module 277.D 
torsion Abelian group 2.A 
bounded 2.F 
torsion coefficients (of a complex) 201.B 
torsion element (of an A-module) 277.D 
torsion form 80.H 
torsion-free A-module 277.D 
torsion-free Abelian group 2.A 
torsion group 2.A 
(ofa complex) 201.B 
torsion product 
(ina category 200.K 
(of 4-modules) 200.D 
torsion subgroup 
(of an Abelian group) 2.C 
maximal 2.A 
torsion tensor App. A, Table 4.II 
(of an affine connection) 80.LL 417.B 
(of an almost contact structure) 110.E 
(of a Fréchet manifold) 286.L 
torus 
(algebraic group) 13.D 
(compact group) 422.E 
(surface) 111.1 410.B 
algebraic 13.D 
Clifford 275.F 
complex 3.H 
generalized Clifford 275.F 
invariant 126.L 
K-splht 13.D 
K-trivial 13.D 
maximal (of a compact Lie group) 248.X 
maximal K-split 13.Q 
n- 422E 
torus embedding 16.Z 
torus function App. A, Table 18.III 
torus group 422.E 
total (set of functions) 317.A 
total boundary operator 200.E 
total Chern class | 56.C 
total cross section 386.B 
total curvature 
(of an immersion) 365.0 
(of a space curve) 111.F 
(of a surface) 111.H, App. A, Table 4.1 
Gaussian 111.H 
total degree 200.J 
total differential 
(ofa complex} 200.H 
(of a function) 106.G 
(on a Riemann surface) 367.H 
total differential equation(s) 428, App. A, Table 15.I 
system of 428.A 
total elastic cross section 386.B 
total energy 271.C 
total excess 178.H 
total Gaussian curvature (of a surface) 111.H 
total isotropy, index of (with respect to a quadratic 
form) 348.E 
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totally bounded 
(metric space) 273.B 
(subset of a metric space) 273.B 
(subset of a uniform space) 436.H 
(uniform space) 436.H 
locally 436.H 
totally definite quaternion algebra 27.D 
totally degenerate (group) 234.B 
totally differentiable 
(complex function) 21.C 
(real function) 106.G 
totally disconnected (topological space) 79.D 
totally geodesic submanifold 365.D 
totally imaginary field 14.F 
totally isotropic (subspace) 60.0 348.E 
totally ordered additive group 439.B 
totally ordered group 243.G 
totally ordered set 311.A 
totally positive (element) 14.G 
totally real field 14.F 
totally real immersion 365.M 
totally real submanifold 365.M 
totally regular transformation (of a sequence) 
379.L 
totally singular (subspace) (with respect to a 
quadratic form) 348.E 
totally umbilical submanifold 365.D 
totally unimodular 186.G 
total matrix algebra 269.B 
total mean curvature 365.0 
total monodromy group 418.F 
total ordering 311.A 
total Pontryagin class 56.D 
total quotients, ring of 67.G 
total space 
(of a fiber bundle) 147.B 
(of a fiber space) 148.B 
total Stiefel-Whitney class 56.B 
total transform (of a subvariety) 16.1 
total variation 
(ofafunction) 166.B 
(of a mapping from a plane into a plane) 
246.H 
(of a set function) 380.B 
(ofa vector measure) 443.G 
tower, class field (problem) 59.F 
trace 
(of an algebraic element) 149.J 
(of an element of a general Cayley algebra) 
54 
(of a linear system of an algebraic surface) 
15.C 
(ofa matrix) 269.F 
(of a nuclear operator) 68.L 
(in a von Neumann algebra) 308.D 
reduced (of an algebra) 362.E 
trace class 68.I 
traced, f- 126.J 
trace form, ¢- 60.0 
trace formula (on unitary representations) 437.DD 
trace norm 68I 
trace operator 168.B 
tracing property, pseudo-orbit 126.J 
tractrix 93.H 
traffic intensity 260.H 
trajectory 126.B,C 
negative half- 126.D 
orthogonal 193J 
positive half- 126.D 
transcendence basis (of a ficld extension) 149.K 


Subject Index 
Transformation(s) 


transcendence basis, separating (of field extension) 
149.K 
transcendence degree (of a field extension) 149.K 
transcendency, degree of (of a field extension) 
149.K 
transcendental curve 93.H 
transcendental element (of a field) 149.E 
transcendental entire functions 429 
transcendental extension 149.E 
purely 149.K 
transcendental function 
higher 389.A 
of Painlevé 288.C 
transcendental meromorphic function 272.A 
transcendental numbers 430 
transcendental singularity (of an analytic function 
in the wider sense), direct, indirect 198 P 
transfer (in group theory) 190.0 
transfer function 86.D 
transfer function matrix 86.B 
transferrer of constant lengths 155.G 
transfinite cardinal number 49.A 
transfinite diameter 48.D 
transfinite induction 
(in a well-ordered set) 311.C 
definition by 311.C 
transfinite initial ordinal number 49.E 
transfinite logical choice function 411.J 
transfinite ordinal number 312.B 
transform 
(of a sequence by a linear transformation) 
379.L 
Cauchy (of a measure) 164.J 
Cayley (of a closed symmetric operator) 251. 
discrete Fourier 142.D 
fast Fourier 142.D 
Fourier 160, App. A, Table 110 
Fourier (of a distribution) 125.0 
Fourier (in topological Abelian groups) 36.L 
192.I 
Fourier, generalized 220.B 
Fourier, inverse (of a distribution) 125.0 
Fourier, spherical 437.Z 
Fourier-Bessel 39.D 
Fourier cosine 160.C, App. A, Table 11.II 
Fourier-Laplace 192.F 
Fourier sine | 160.C, App. A, Table 11.II 
Fourier-Stieltjes  192.B,O 
Gel'fand 36.E 
Hankel 220.B 
Hilbert 160.D 220.E 
integral 220 
inverse (of an integral transform) 220.A 
Klein 150.D 
kth 160.F 
Laplace 240, App. A, Table 12.1 
Laplace-Stieltjes 240.A 
Legendre 419.C 
Mellin 220.C 
proper (of a subvariety) 16.1 
Radon 218.F 
Radon, conjugate 218.F 
real monoidal 274.E 
Riesz 251.0 
Stieltjes 220.D 
total (of a subvariety) 16.1 
Watson 160.C 220.B 
transformable, polynomially 71.E 
transformation(s) 381.C 
(on a measure space) 136.D 


Subject Index 
Transformation formula 


affine 7.E 

affine (of a manifold with an affine connection) 
80.J 

affine (of a Riemannian manifold) 364.F 

affine, group of 7.E 

affine, proper 7.E 

affine, regular 7.E 

Ampere 82.A 

angular 374.D 

arc sine 374.D 

bimeasurable (on a measure space) 136.B 

birational 16.1 

BRS 150.G 

canonical 82.B 271.F 

canonical, group of 271.F 

Cayley (of a matrix) 269J 

chain (between complexes) 200.H 

conformal 80.P 364.F 

congruent (in Euclidean geometry) 139.B 

congruent, group of 285.C 

contact 82, App. A, Table 15.IV 

coordinate (of a fiber bundle) 147.B 

coordinate (of a locally free ©,-Module) 16.E 

covering 91.A 367.B 

Cremona 16.1 

by drift 261.F 

of drift 406.B 

elliptic 74.F 

entire linear 74.E 

to an equilibrium system 82.D 

equilong 76.B 

ergodic (on a measure space) 136.B 

Euler (of infinite series) 379.1 

factor (of a measure preserving transformation) 
136.D 

Fisher z 374.D 

Galilei 359.C 

gauge (in electromagnetism) 130.A 

gauge (in a lattice spin system) 402.G 

gauge (of a principal fiber bundle) 80.Q 

gauge (in unified field theory) 343.B 

gauge, of the first kind 150.B 

Gauss App. A, Table 16.IIT 

Givens 302.E 

Householder 302.E 

hyperbolic 74.F 

infinitesimal (of a Lie transformation group) 
431.G ` 

infinitesimal (of a one-parameter group of 
transformations) 105.N 

inner (in the sense of Stoilow) 367.B 

Jacobi imaginary | 134.I, App. A, Table 16.III 

Kelvin 193.B 

Laguerre 76.B 

Landen 134.B, App. A, Table 16.IIT 

Legendre 82.A, App. A, Table 15.IV 

Lie (in circle geomtry) 76.C 

Lieline-sphere 76.C 

linear (= linear fractional) 74.E 

linear (of a linear space) 251.A 256.B 

linear (of a sequence) 379.L 

linear fractional 74.E 

local, local Lie group of 431.G 

local, local one-parameter group of 105.N 

of local coordinates 90.D 

locally quadratic (of an algebraic surface) 
15.G 

locally quadratic (of an algebraic variety) 16.K 

locally quadratic (of a complex manifold) 72.H 

Lorentz 359.B 
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loxodromic 74.F 

measurable (on a measure space) 136.B 

measure-preserving 136.B 

Möbius 74.E 76.A 

monoidal (of an analytic space) 23.D 

monoidal (of a complex manifold) 72.H 

monoidal (by an ideal sheaf) 16.K 

monoidal, with center W 16.K 

by a multiplicative functional (in a Markov 
process) 261.F 

natural 52J 

nonsingular (of a linear space) 256.B 

nonsingular (on a measure space) 136.B 

normal (of a sequence) 379.L 

one-parameter group of 105.N 

one-parameter group of class C” 126.B 

orthogonal 139.B 348.B 

orthogonal (over a noncommutative field) 
60.0 

orthogonal (with respect to a quadratic form) 
60.K 

orthogonal, around the subspace A* 139.B 

parabolic 74.F 

of the parameter 111.D 

parity 359.B 

particular (of b) 248.R 

Picard-Lefschetz 16.U 

to principal axes 390.B 

projective 343.D 

projective (of a Riemannian manifold) 367.F 

projective, group of 343.D 

pseudoconformal 344.A 

pseudogroup of (on a topological space) 90.D 

quadratic 16.1,K 

quantized contact 274.F 

regular (of a linear space) 256.B 

regular (of a sequence) 379.L 

regular projective 343.D 

Schwarz-Christoffel 77.D 

semilinear 256.P 

semiregular (of a sequence) 379.L 

shift 136.D 

singular projective 343.D 

singular projective, of the hth species 343.D 

superharmonic 261.F 

symplectic 60.L 

symplectic (over a noncummutative field) 
60.0 

totally regular (of a sequence) 379.L 

triangular (linear) 379.L 

unitary 348.F 

unitary (relative to an e-Hermitian form) 60.0 

weakly equilvalent 136.F 


transformation rormula 


(for the generating function of the number of 
partitions) 328 

(of a theta function) 3I 

(for theta series) 348.L 

Schwarz-Christoffel 77.D 


transformation group(s) 431, App. A, Table 14.III 


(ofa set) 431.A 

covering 91.A 

differentiable 431.C 

discontinuous 122.A 

free discontinuous 122.A 

Lie 431.C 

Mobius 76.A 

orthogonal 60.I 

orthogonal (over a field with respect to a 
quadratic form) 60.K 
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properly discontinuous 122.A 
symplectic 60.L 
topological 431.A 
unitary 60.F 
transformation parameter 396.1 
transformation problem (in a finitely presented 
group) 161.B 
transformation space (of an algebraic group) 13.G 
transgression 
(homomorphism of cohomology groups) 
200.M 
(in the spectral sequence of a fiber space) 
148.E 
transgressive (element in the spectral sequence of 
a fiber space) 148.E 
transient 
(Lévy process) 5.G 
(Markov chain) 260.B 
(Markov process) 261.B 
transient problem 322.D 


transition 
order-disorder 402.F 
phase 340.B 


transition function 
(of a fiber bundle) 147.B 
(of a Markov chain) 260.A 
(of a Markov process) 261.B 
Feller 261.B 
transition matrix 126.J 260.A 
transition point 254.F 
transition probability 
(of a diffusion process) 115.B 
(of a Markov chain) 260.A 
(of a Markov process) 261.A 
(in quantum mechanics) 351.B 
standard 260.F 
transitive 
(dynamical system) 126.LJ 
(operation of a group) 362.B 
(permutation representation) 362.B 
(relation) 358.A 
fully (subgroup of an orthogonal group) 92.C 
k- (permutation group) 151.H 
k-ply (G-set) 362.B 
k-ply (permutation group) 151.H 
multiply (permutation group) 151.H 
simply (G-set) 362.B 
transitive extension (of a permutation group) 
151.H 
transitive law 
(in an equivalence relation) 135.A 
(on ordering) 311.A 
transitively (act on G-space) 431.A 
transitive permutation group 151.H 
transitivity, system of (of a G-set) 362.B 
translation(s) 
(in an affine space) 7.E 
group of (of an affine space)  7.E 
left 249.A 362.B 
parallel 80.C 364.B 
right 249.A 362.B 
translational flow 126.L 136.G 
frequencies of 126.L 136.G 
translation group (of a Lorentz group) 258.A 
translation number 18.B,D 
translation operator 306.C 
translation representation theorem 375.H 
translation theorem (in class field theory) 59.C 
transmission coefficient 387.D 
transmission rate 213.A 


Subject Index 
Triangle 


transonic flow 205.B 
transonic similarity, von Karman 205.D 
transportation problem 255.C 
transport coefficient 402.K 
transport equations 325.L 
transpose 
(of a linear mapping) 256.G 
(of a rational homomorphism) 3.E 
transposed integral equation 217.F 
transposed mapping 
(of a diffusion kernel) 338.N 
(of a linear mapping) 256.G 
transposed matrix 269.B 
transposed operator 112.E 189.C 322.E 
transposed representation 362.E 
transposition (in a symmetric group) 151.G 
transvection 60.0 
transversal (matroid) 66.H 
transversal field 136.G 
transversal flow 136.G 
transversal homoclinic point 126.J 
transversality, condition of (in calculus of variations) 
46.B 
transversality condition 108.B 
strong 126J 
transversality theorem 105.L 
transverse 
(foliations) 154.H 
(to a submanifold of a differentiable manifold) 
105.L 
toafoliation 154.B 
transverse axis (of a hyperbola) 78.C 
transverse electric waves 130.B 
transverse electromagnetic waves 130.B 
transverse invariant measure 154.H 
transversely (intersect) 105.L 
transversely orientable 154.B 
transverse magnetic waves 130.B 
transverse structure 154.H 
transverse wave 446 
trap 
(of a diffusion process) 115.B 
(of a Markov process) 261.B 
trapezoidal rule 
(of numerical integration) 299.A 
(of numerical solution of ordinary differential 
equations) 303.E 
treatment 102.B 
connected 102.B 
treatment combinations, number of 102.L 
treatment contrast 102.C 
treatment effect 102.B 
tree 93.C 186.G 
co- 186.G 
derivation 31.E 
spanning 186.G 
tree code 213.E 
tree representation 96.D 
tree structure 96.D 
trefoil knot 235.C 
trellis code 213.E 
trend 397.N 
triad 202.M 
homotopy exact sequence of 202.M 
homotopy group of 202.M 
trial path dependent, d- (response probability) 
346.G 
triangle 7.D 155.F 178.H 
geodesic 178.A 
Pascal 330 


Subject Index 
Triangle comparison theorem 


plane App. A, Table 2.II 
polar 78.J 
Reuleaux 89.E 111.E 
self-polar 78.J 
solvinga 432.A 
spherical 432.B, App. A, Table 2.III 
triangle comparison theorem 178.A 
triangle inequality 273.A 
triangle test 346.E 
triangulable 65.A 
triangular (linear transformation) 379.L 
triangular element 304.C 
triangular factorization 302.B 
triangular matrix 269.B 
lower 269.B 
upper 269.B 
triangular number 4.D 
triangulated manifold 65.B 
triangulation 65.A 70.C 
CŒ- 114C 
combinatorial 65.C 
combinatorial, problem 65.C 
compatible with 114.C 
finite 70.C 
trick, Alexander’s 65.D 
triclinic (system) 92.E 
Tricomi differential equation 326.C 
Tricomi problem 326.C 
tridiagonal matrix 298.D 
trigamma function 174.B 
trigonal (system) 92.E 
trigonometric function 131.E 432.A, App. A, 
Table 2 
inverse 131.E 
trigonometric integral 160.A 
trigonometric interpolation polynomial 336.E 
trigonometric polynomial, generalized 18.B 
trigonometric series 159.A 
generalized 18.B 
trigonometric sum 4.C 
trigonometric system 159.A 
trigonometry 432, App. A, Table 2 
plane 432.A 
spherical 432.B 
trilinear coordinates 90.C 
trimmed mean, a- 371.H 
triple 200.Q,L 
homotopy exact sequence of 202.L 
triple product, scalar, vector 442.C, App. A, Table 
3.1 
triplet, Gel'fand 424.T 
tripolar coordinates 90.C 
trisection of an angle 179.A 
trivalent map 157.B 
trivial 
(extension) 390.J 
(knot) 65.D 235.A 
K- (torus) 13.D 
trivial bundle 147.E 
trivial fiber space, locally 148.B 
trivialization (of a block bundle) 147.Q 
trivially (act on a G-space) 431.A 
trivial sheaf 383.D 
trivial solution (of a system of linear homogeneous 
equations) 269.M 
trivial topology 425.C 
trivial valuation 439.C,F 
trochoid 93.H 
Trotter product formula 351.F 
true 411.E 
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true anomaly 309.B 
true value of a parameter 398.A 
truncated Wightman function 150.D 
truncation error 138.B 303.B 
local 303.E 
truth definition 185.D 
truth function 411.E 
truth value (of a formula) 411.E 
Tsen theorem 27.E 118.F 
tube 126.E 
regular 193.K 
vector 442.D 
tuboid 125.V 
tubular neighborhood  105.L 114.B 364.C 
open 105.L 114.B 
tubular neighborhood system, controlled 418.G 
Tucker theorem on complementary slackness 
255.B 
Tucker theorem, Kuhn- 292.B 
tuple, n- 256.A 381.B 
turbulence 433 
homogeneous 433.C 
isotropic 433.C 
turbulent flow 205.E 433 
Turing machine 31.B 
universal 31.C 
turning point 25.B 254.F 
twinable 92.D 
twinning structure 92.D 
twin primes 123.C 
twisted type, group of 151.I 
two-bin system 227 
two-body interaction 271.C 
two-body problem 55.A 
two-dimensional KdV equation 387.F 
2-isomorphic 186.H 
two-person game, zero-sum 108.B 
two-phase simplex method 255.C 
two-point boundary value problem (of ordinary 
differential equations) 315.A 
two sheets 
hyperboloid of 350.B 
hyperboloid of revolution of  350.B 
two-sided exponential distribution App. A, Table 
22 
two-sided generator (for an automorphism of a 
measure space) 136.E 
two-sided ideal 368.F 
two-sided o-ideal 27.A 
integral 27.A 
two-sided scale 19.D 
two-sided surface 410.B 
two-stage least squares method 128.C 
two-stage sampling 373.E 
two-stage stochastic programming | 408.A 
two-terminal characteristic 281.C 
two-terminal network 281.C 
two-terminal problem 281.C 
two-valued logic 411.L 
two-way elimination of heterogeneity, design for 
102.K 
two-way layout 102.H 
type 
(of an Abelian group) 2.B,D 
(of an object) 356.F 
(of a quadratic form) 348.E 
(ofa structure) 409.B 
(of a transcendental number) 430.C 
acute 304.C 
backward 304.D 
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Bravais (of lattices) 92.B 
Bravais, of the class of (T; K) 92.B 
compact 412.D 
dimension 117.H 
of finite (graded module) 203.B 
of finite (module) 277.D 
of finite (morphism of schemes) 16.D 
of finite (@-module) 16.E 
of finite, subshift 126.5 
forward 304.D 
Fredholm, integral equation of 217.A 
Fredholm, integrodifferential equation of 
222.A 
Fuchsian (linear ordinary differential equations) 
253.A 
Fuchsian (visibility manifold) 178.F 
general 72.1 
general, surface of 72.K 
homotopy 202.F 
homotopy (ofa link) 235.D 
homotopy, invariant 202.F 
isotopy (of knots) 235.A 
isotopy (of a transformation group) 431.A 
k-, Markov branching process 44.E 
knot 235.A 
link 235.D 
locally of finite (morphism of schemes) 16.D 
mixed 304.C 326.A 
multi-, Markov branching process 44.E 
multidiagonal 304.C 
noncompact 412.D 
nonconforming 304.C 
orbit (of a G-space) 431.A 
Parreau-Widom 164.K 
positive (symmetric kernel) 338.D 
principal orbit 431.C 
spherical G-fiber homotopy 431.F 
Stoilow 207.B 
twisted, group of 151.1 
Volterra, integral equation of 217.A 
Volterra, integrodifferential equation of 222.A 
Weierstrass-, preparation theorem (for micro- 
differential operators) 274.F 
type p”, Abelian group of 2.D 
type (p. q) 
(of an operator) 224.E 
tensor of 256J 
tensor space of 256.5 
weak (of an operator) 224.E 
type (r,s), differential form of 72.C 
type S (harmonic boundary) 207.B 
type S, space of 125.T 
type number (of a solution of a system of linear 
ordinary differential equations) 314.A 
type problem (for Riemann surfaces) 367.D 
type theory 411.K 
ramified 411.K 
simple 411.K 
type (0, 1), tangent vector of 72.C 
type (1,0), tangent vector of 72.C 
type I 
(von Neumann algebra) 308.E 
C*-algebra of 308.L 
group of 437.E 
typeI group 437.E 
type Imm (irreducible symmetric bounded domain) 
412.I 
type I, (von Neumann algebra)  308.E 
type II (von Neumann algebra) 308.E 


type II, 





Subject Index 
Unbiased estimator 


(ergodic countable group) 136.F 
(von Neumann algebra) 308.E,F 
type II, (irreducible symmetric bounded domain) 
412.I 
type Il, 
(ergodic countable group) 136. F 
(von Neumann algebra) 308.E,F 
type III 
(ergodic countable group) 136.F 
(von Neumann algebra) 308.E 
structure theorem for von Neumann algebras of 
308.I 
type III, (factor) 308.I 
type III, (factor) 308.1 
type III; (factor) 308.1 
type III, (irreducible symmetric bounded domain) 
412.1 
type IV, (irreducible symmetric bounded domain) 
412.I 
type AI, AII, AII, AIV (irreducible symmetric 
Riemannian space) 412.G 
type BDI, BDII (irreducible symmetric Riemannian 
space) 412.G 
type CI, CII (irreducible symmetric Riemannian 
space) 412.G 
type DIII (irreducible symmetric Riemannian space) 
412.G 


U 


U(n)(unitary group) 60.F 
u-chain 260. 
u-curve 111.H 
u,-scale 19.D 
U-invariant (subspace) 437.C 
User 159.J 
U-statistic 274.1] 
U test, Mann-Whitney 371.C 
U-number 430.C 
U*-number 430.C 
UCL (upper control limit) 404.B 
Ugaheri maximum priciple 338.C 
Uhlenbeck Brownian motion, Ornstein- 45.1 
Ulam theorem, Borsuk- 153.B 
Ulm factor(s) 2.D 
sequence of 2.D 
ultrabornological (locally convex space) 424.W 
ultradifferentiable function 168.B 
ultradistribution 125.U,BB 
of class {M,} 125.U 
of class (M,j 125.U 
ultrafilter 87.1 
ultrainfinite point 285.C 
ultrapower 276.E 
ultraproduct(s) 276.E 
fundamental theorem of 276.E 
ultraspherical polynomials 317.D 
ultraviolet divergence 132.C 146.B 
umbilical point (of a surface) 111.H 365.D 
umbilical submanifold, totally 365.D 
Umkehr homomorphism 201.0 
UMP (uniformly most powerful) (test) 400.A 
UMP invariant level x test 400.E 
UMP unbiased level x test 400.C 
UMV unbiased estimator 399.C 
unavoidable set 157.D 
unbiased confidence region 399.Q 
uniformly most powerful 399.Q 
unbiased estimator 399.C 
asymptotically (mean) 399.K 


Subject Index 
Unbiased level « test 


best linear 403.E 
kth order asymptotically median 399.0 
mean 399.C 
median 399.C 
modal 399.C 
UMV 399.C 
uniformly minimum variance 399.C 
unbiased level x test 400.C 
uniformly most powerful 400.C 
unbiasedness 399.C 
unbounded (covering surface) 367.B 
uncertainty (in observations) 351.C 
uncertainty relation, Heisenberg 351.C 
unconditonally convergent 
(series) 379.C 
(series in a Banach space) 443.D 
undecidable proposition, formally 185.C 
undefined concept 35.B 
undefined term 35.B 
undercrossing point 235.A 
underdetermined system 
of differential operators 112.R 
of partial differential operators 320.F 
underflow 138.B 
underlying group (of a topological group) 423.A 
underlying topological space 
(of a complex manifold) 72.A 
(of a differentiable manifold) 105.D 
(of a topological group) 423.A 
undirected graph 186.B 
undotted index 258.B 
undotted spinor (of rank E 258.B 
unfolding 
(of a germ of an analytic function) 418.E 
constant 51.D 
r- 51.D 
universal 418.E 
unicity theorem, Luzin's 22.C 
unicursal (ordinary curve) 93.C 
unicursal curve 9.C 93.H 
unicursal graph theorem (Eulers) 186.F 
unified field theory 434 
nonsymmetric 434.C 
uniform 
(lattice of a Lie group) 122.K 
(sampling procedure) 373.A 
uniform algebra 164.A 
uniform boundedness theorem 37.H 
uniform continuity 
lower class with respect to 45.F 
upper class with respect to 45.F 
uniform convergence 435 
(of an infinite product) 435.A 
(of operators) 251.C 
(of a series) 435.A 
abscissa of 121.B 240.B 
on compact sets 435.C 
Weierstrass criterion for 435.A 
uniform covering system 436.D 
uniform distribution 182.H 341.D, App. A, Table 
22 
uniform family of neighborhoods system 436.D 
uniform isomorphism 436.E 
uniformity 436.B 
base for the 436.B 
discrete 436.D 
generated by a family of pseudometrics 436.F 
generated by a pseudometric 436.F 
left (of a topological group) 423.G 
product 436.E 
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pseudometric 436.F 
relative 436.E 
right (of a topological group) 423.G 
Stronger 436.E 
T,- 436C 
topology of the | 436.C 
weaker 436.E 
uniformizable (topological space) 436.H 
uniformization | 367.C 
(of a set in a product space) 22.F 
Schottky 367.C 
uniformization theorem 
general 367.G 
Kondó 22.F 
uniformized 367.C 
locally 367.C 
uniformizing parameter, local (of a Riemann surface) 
367.A 
uniformly 
(partial recursive function) 356.E 
(primitive recursive function) 356.B 
uniformly absolutely convergent (series) 435.A 
uniformly almost periodic function 18.B 
uniformly asymptotically stable 
(solution of a differential equation)  394.B 
(solution of a functional differential equation) 
163.G 
uniformly best (estimator) 399.C 
uniformly better (decision function) 398.B 
uniformly consistent test 400.K 
uniformly continuous 
(function) 84.A 
(mapping) 273.1 436.E 
onasubset 436.G 
uniformly convergent 
(sequence) 435.A 
on a family of sets 435.C 
in the wider sense 435.C 
uniformly convex (normed linear space} 37.G 
uniformly equivalent (uniform spaces) 436.E 
uniformly integrable (family of random variables) 
262.A 
uniformly locally compact (space) 425.V 
uniformly Lyapunov stable 126.F 
uniformly minimum variance unbiased estimator 
399.C 
uniformly most powerful 
(confidence region) 399.Q 
(test) 400.A 
invariant 399.Q 
invariant level a 400.E 
unbiased 399.Q 
unbiased level a 400.E 
uniformly recursive in ¥ (define a partial recursive 
function) 356.E 
uniformly smooth (normed linear space) 37.G 
uniformly stable 394.B 
uniform neighborhood system 436.D 
uniform norm  168.B 
uniform operator topology 251.C 
uniform space(s) 436 
analytically 125.8 
complete 436.G 
Hausdorff 436.C 
locally totally bounded 436.H 
metrizable 436.F 
precompact 436.H 
product 436.E 
pseudometrizable 436.F 
separated 436.C 
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T,- 436.C 
totally bounded 436.H 
uniform star convergence, relative 310.F 
uniform structure 436.B 
uniform subspace 436.E 
uniform topological space 436.B 
uniform topology 436.B 
unilateral constraints 440.A 
unilateral shift operator 390.1 
unimodal (distribution function) 341.H 
unimodular 
(germ of an analytic function) 418.E 
(locally compact group) 225.D 
totally 186.G 
unimodular group 60.B 
quaternion 412.G 
union 
(in axiomatic set theory) 33.B 
(of matroids) 66.H 
(of sets) 381.B 
axiom of 381.G 
disjoint 381.B,D 
of hypersurface elements 82.A 
of surface elements 324.B 
unipotent 
(algebraic group) 13.E 
(linear transformation) 269.L 
unipotent component 269.L 
unipotent matrix 269.F 
unipotent part 
(of an algebraic group) 13.E 
(of a nonsingular matrix) 13.E 
unipotent radical 13.I 
unique continuation theorem 323.J 
unique decomposition theorem (for a 3-manifold) 
65.E 
unique factorization domain 67.H 
unique factorization theorem (in an integral domain) 
67.H 
uniquely ergodic (homeomorphism (on a compact 
metric space) 136.H 
uniqueness 
in the sense of law of solutions 406.D 
setof 159.J 
of solution, pathwise 406.D 
uniqueness condition (for solutions of ordinary 
differential equations) 316.D 
uniqueness principle (in potential theory) 338.M 
uniqueness theorem 
(for analytic functions) 198.C 
(for class field theory) 59.B 
(for differential equations in a complex domain) 
316.G 
(for Fourier transform) 192.1 
(for harmonic functions) 193.E 
(for an initial value problem of ordinary differ- 
ential equations) 316.D 
of the analytic continuation | 198.C,I 
Holmgren 321.F 
of homology theory 201.R 
Rellich 188.D 
von Neumann 351.C 
unique strong solution 406.D 
unirational surface 15.H 
unirational variety 16.J 
uniserial algebra | 29.I 
absolutely 29.1 
generalized 29.1 
unisolvent space 142.B 
unisolvent system (of functions)  336.B 


Subject Index 
Unit circle 


unit(s) 
(of an algebraic number field) 14.D 
(for measure of length) 139.C 
(inaring) 368.B 
(of a symmetric matrix with rational coordi- 
nates) 348.J 
(of a vector lattice) 310.B 
Archimedean (of a vector lattice) 310.B 
arithmetic 75.B 
auxiliary 414.A 
base 414.4 
circular 14.L 
control 75.B 
derived 414.D 
fundamental 414.A 
fundamental (of an algebraic number field) 
14.D 
gravitational, system of 414.B 
imaginary 74.A 294.F 
international system of 414.A 
Kakutani 310.G 
matrix 269.B 
memory 75.B 
system of 414 
unital 36.A 
unitarily equivalent (self-adjoint operators) 390.G 
unitary 
(homomorphism between rings) 368.D 
(module) 277.D 
essentially 390.I 
unitary algebra 29.A 
unitary dilation 251.M 
unitary field theory 434.C 
unitary group 60.F 151.1 
(relative to an e-Hermitian form) 60.0 
infinite 202.V 
over K 60.H 
over K, projective special 60.H 
over K, special 60.H 
projective 60.F 
special 60.F 
special (relative to an -Hermitian form) 60.0 
unitary matrix 269.] 
unitary monoid 409.C 
unitary operator 251.E 390.E 
unitary representation(s) 437 
disjoint 437.C 
equivalent 437.A 
induced by a representation of a subgroup 
437.0 
integrable 437.X 
irreducible 437.A 
isomorphic 437.A 
quasi-equivalent 437.C 
similar 437.A 
square integrable 437.M 
sufficiently many irreducible 437.B 
unitary restriction (of a semisimple Lie algebra) 
248.P 
unitary ring 368.A 409.C 
unitary semigroup 409.C 
unitary symplectic group 60.L 
unitary transformation 348.F 
(relative to an e-Hermitian form) 60.0 
unitary transformation group 60.F 
unit ball 
(of a Banach space) 37.B 
(of a Euclidean space) 140 
unit cell 140 
unit circle 74.C 140 


Subject Index 
Unit cost 


unit cost 281.D 
unit cube 139.F 140 
unit disk 140 
unit distribution 341.D 
unit element 
(ofa field) 149.A 
(ofa group) 190.A 
(ofaring) 368.A 
unit function 306.B, App. A, Table 12.II 
unit group (of an algebraic number field) 14.D 
unit impulsive function App. A, Table 12.II 
unit mapping 203.F 
unit matrix 269.A 
unit n-cube 140 
unit point 
(of an affine frame) 7.C 
(of a projective frame) 343.C 
unit ray 351.B 
unit representation (of a group) 362.C 
unit sphere 140 
unit tangent sphere bundle 126.L 
unit theorem, Dirichlet 14.D 
unit vector 7.C 442.B 
unity 
(in the axioms for the real numbers) 355.A 
partition of 425.R 
partition of, of class C? 105.S 
partition of, subordinate to a covering 425.R 
primitive root of 14.L 
unity element 
(ofafield) 149.A 
(ofa ring) 368.A 
univalence superselection rule 351.K 
univalent (analytic function) 438.A 
univalent correspondence 358.B 
univalent function 438 
univariate (statistical data) 397.A 
universal 


(6-functor) 200.1 


(*-representation of a Banach »-algebra) 36.G 


(unfolding) 51.D 
universal bundle 147.G,H 

n- 147.G 
universal Chern class 56.C 
universal coefficient theorem 

(in Abelian categories) 200.H 

(for cohomology) 200.G 201.H 

for homology 200.D 201.G 
universal constants (in the theory of conformal 

mapping) 77.F 

universal covering group 91.B 423.0 
universal covering space 91.B 
universal covering surface 367.B 
universal curve 93.H 
universal domain 16.A 
universal enveloping algebra 

{of a Lie algebra) 248.3 

special (of a Jordan algebra) 231.C 
universal enveloping bialgebra 203.G 
universal Euler-Poincaré class 56.B 
universal gravitation, law of 271.B 
universally Japanese ring 16.Y 284.F 
universally measurable 270.L 
universal mapping property 52.L 
universal net (in a set) 87.H 
universal Pontryagin class 56.D 
universal proposition 411.B 
universal quantifier 411.C 
universal set 

(for the projective sets of class n) 22.E 
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(inset theory) 381.B 
universal Stiefel-Whitney class 56.B 
universal Teichmüller space 416 
universal Turing machine 31.C 
universal unfolding 418.E 
universal validity of a proposition, problem of 97 
universe 
(in nonstandard analysis) 293.B 
(ofa structure) 276.B 
unknotted 
(ball pair) 65.D 235.G 
(knot) 235.A 
(sphere pair) 65.D 235.G 
unknotting conjecture 235.G 
unknotting theorem, Zeeman 65.D 
unlabeled graph 186.B 
unmixed ideal 284.D 
unmixedness theorem 284.D 
unordered pair 381.B 
(in axiomatic set theory) 33.B 
axiom of 33.B 
unoriented cobordism class 114.H 
unoriented cobordism group 114.H 
unoriented graph 186.H 
unramified 
(covering surface) 367.B 
(prime ideal) 14.1 
(projection of a covering surface) 367.B 
analytically (semilocal ring) 284.D 
unramified covering (of a nonsingular curve) 9.1 
unramified extension 14.1 257.D 
unrenormalizable 132.C 361.B 
unsolvability 
degree of 97 
recursive, arithmetical hierarchy of degrees of 
356.H 
recursive, degree of 97 
recursive, hyperarithmetical hierarchy of de- 
grees of 356.H 
unstable 
(boundary component) 77.E 
(state) 394.A 
completely (flow) 126.E 
unstable manifold 126.G,J 
unstable solution (of His equation) 264.E 
up-ladder 206.B 
upper bound 
(of a subset in an ordered set) 311.B 
least (of an ordered set) 311.B 
least (of a subset of a vector lattice) 310.C 
upper boundedness principle (in potential theory) 
338.C 
upper central series (ofa group) 190.J 
upper class 
with respect to local continuity 45.F 
with respect to uniform continuity 45.F 
upper control limit 404.B 
upper derivative 
general (of a set function) 380.D 
ordinary (of a set function) 380.D 
upper end (of a curvilinear integral) 94.D 
upper envelope (of a family of subharmonic func- 
tions) 193.R 
upper half-space of degree n, Siegel 32.F 
upper integral, Riemann 216.A 
upper limit (of a Riemann integral) 216.A 
upper limit function | 84.C 
upper semicontinuous 
(ata point) 84.C 
(partition) 425.L 
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inaset 84.C 
upper semilattice 243.A 
upper triangular matrix 269.B 
upper variation (of a set function) 380.B 
Urysonlemma 425.Q 
Uryson space, Fréchet- 425.CC 
Uryson theorem, Tikhonov- 425.Q 
Uryson-Tikhonov theorem (on metrizability) 
273.K 
Uzawa gradient method, Arrow-Hurwicz- 292.E 


V 


v-curve 11t.H 
vacuum vector 377.A 
free 150.C 
vague topology (on a class of measures) 338.E 
valid formula 411.G 
valuation(s) 439 
additive 439.B 
Archimedean 14.F 439.C 
complete 439.D 
completion of 439.D 
discrete 439.E 
equivalent 439.B 
exponential 439.B 
generalized 439.B 
multiplicative 439.C 
non-Archimedean 14.F 439.C 
normal 439.E,H 
normalized 439.E 
p-adic 439.F 
p-adic exponential 439.F 
prolongation of 439.B 
pseudo- 439.K 
special 439.B 
over a subfield 439.B,C 
trivial 439.C,F 
valuation ideal (of a valuation) 439.B 
valuation ring | 439.B 
completion of 439.D 
discrete 439.E 
valuation vector(s) GC 
ring of GC 
value(s) 
(of an infinite continued fraction) 83.A 
(of an infinite product) 379.G 
(ofa variable) 165.C 
absolute (of a complex number) 74.B 
absolute (of an element of an ordered field) 
149.N 
absolute (of an element of a vector lattice) 
310.B 
absolute (of a real number) 355.A 
absolute (of a vector) 442.B 
asymptotic (of a meromorphic function) 62.A 
272.H 
boundary (of a conformal mapping) 77.B 
boundary (hyperfunction) 125.V 
boundary (relative to a differential operator) 
112.E 


boundary, problem  — boundary value problem 


characteristic (of a linear operator) 390.A 
cluster 62.A 

cluster, theorem 43.G 

critical (in bifurcation theory) 286.R 


critical (of a C?-function on a manifold) 279.B 


critical (of a contact process} 340.C 
critical (of an external magnetic field)  340.B 
critical (of a mapping u: R" > R") 208.B 








Subject Index 
Variable(s) 


critical (of a C?-mapping 9: M > M) 105J 
exceptional (ofa transcendental entire function) 
429.B 
exceptional, Borel 272.E 
exceptional, Nevanlinna 272.E 
exceptional, Picard 272.E 
expectation (of an observable) 351.B 
expected (of a random variable) 342.C 
gap (of a point on a Riemann surface) 11.D 
initial (for ordinary differential equations) 
316.A 
initial (for partial differential equatons) 321.A 
initial (for stochastic differential equations) 
406.D 
initial, problem  — initial value problem 
limit (of a mapping) 87.F 
mean (of a function on a compact group) 69.A 
mean (of a weakly stationary process) 395.C 
mean, theorem  — mean value theorem 
most probable 401.E 
principal (of inverse trigonometric functions) 
131.E 
principal (of log z) 131.G 
principal, Cauchy (of an improper integral) 
216.D 
principal, Cauchy (of the integral of a function 
in(—0o5,00))) 216.E 
proper (of a boundary value problem) 315.B 
proper (of a linear mapping) 269.L 
proper (of a linear operator) 390.A 
proper (of a matrix) 269.F 
range of (of a meromorphic function) 62.A 
regular 105J 
sample 396.B 
sample characteristic 396.C 
Shapley 173.D 
singular 302.A 
singular, decomposition (SVD) 302.E 
Starting 303.E 
stationary (of a function) 106.L 
true, of parameter 398.A 
truth (of a formula) 411.E 
value distribution 124.A 
value function 108.B 405.A 
value group 
(of an additive valuation) 439.B 
(of a multiplicative valuation) 439.C 
Vandermonde determinant 103.G 
van der Pol differential equation 290.C 
van der Waerden-Bortolotti covariant derivative 
417.E 
van der Waerden test 371.C 
Vandiver conjecture 14.L 
van Hove sense, limit in 402.G 
vanishing cocycle 16.U 
vanishing cycle 418.F 
vanishing theorem 
(on compact complex manifolds) 194.G 
Kodaira 232.D 
van Kampen theorem (on fundamental groups) 
170 
variability, measure of 397.C 
variable(s) 165.C 
(of a polynomial) 369.A 
artificial 255.C 
auxiliary 373.C 
basic 255.A 
bound 411.C 
canonical (in analytical dynamics) 271.F 
change of (in integral calculus) 216.C 


Subject Index 
Variable component (of a linear system) 


complex 165.C 

complex, theory of functions of 198.Q 

dependent 165.C 

differential (of a differential polynomial) 113 

endogenous 128.C 

exogenous 128.C 

explanatory 403.D 

hidden, theories 351.L 

independent 165.C 

individual 411.H 

inner 25.B 

lagged 128.C 

object 411.G 

outer 25.B 

predetermined 128.C 

predicate 411.G,H 

proposition 411.E 

random 342.C 

random, independent 342.C 

random, joint 342.C 

random, n-dimensional 342.C 

random, R"-valued 342.C 

random, (S,€)-valued 342.C 

real 165.C 

sampling inspection by 404.C 

separation of 322.C 

slack 255.A 

state 127.A 
variable component (of a linear system) 15.C 16.N 
variable method, discrete 303.A 
variable-step variable-order (VSVO) algorithms 

303.E 
variance 

(of a probability distribution) 341.B 

(of a random variable) 342.C 

(of univariate quantitative data) 397.C 

analysis of 400.H 403.D 

between-group 397.L 

generalized 280.E 397.J 

multivariate analysis of 280.B 

population 396.C 

sample 396.C 

sample generalized 280.E 

uniformly minimum unbiased estimator 399.C 

within-group 397.L 
variance-covariance matrix 341.B 397.J 
variance matrix 341.B 


variate 
canonical 280.E 
fixed 403.D 
variation(s) 


(of an integral) 100.E 

bounded — of bounded variation 

calculus of 46 

calculus of, classical theory of 46.C 
calculus of, conditional problems in 46.A 
calculus of, fundamental lemma in 46.B 
coefficient of 397.C 

of constants, Lagrange’s method of 252.D 
of constants, method of 55.B 252.1 

first 46.B 

first, formula 178.A 

geodesic 178.A 

lower (of a set function) 380.B 

negative (ofa mapping) 246.H 

negative (of a real bounded function) 166.B 
one-parameter 178.A 

of parameters, Lagrange method of 252.D 
of parameters, method of App. A, Table 14.1 
positive (ofa mapping) 246.H 
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positive (of a real bounded function) 166.B 
proper 279.F 
quadratic, process 406.B 
second formula 178.A 
total (of a finitely additive vector measure) 
443.G 
total (of a mapping) 246.H 
total (of a real bounded function) 166.B 
total (of a set function) 380.B 
upper 380.B 
variational derivative 46.B 
variational equation 316.F 394.C 
variational formula, constant 163.E 
variational inequality 440 
ofevolution 440.C 
stationary 440.B 
variational method 438.B 
variational principles(s) 441 
(in ergodic theory) 136.G 
(in statistical mechanics) 340.B 402.G 
(in the theory of elasticity) 271.G 
with relaxed continuity requirements 271.G 
for the topological pressure | 136.H 
variational problem, Gauss 338.J 
variation curve 178.A 
variation vector field 178.A 
variety 
(algebraic variety) 16.A 
(of block design) 102.B 
Abelian 3 
Abelian, isogeneous 3.C 
Abelian, polarized 3.G 
Abelian, simple 3.B 
abstract 16.C 
abstract algebraic 16.C 
affine 16.A 
affine algebraic 16.A 
Albanese 16.P 
Albanese (of a compact Kahler manifold) 
232.C 
algebraic 16 
algebraic group 13.B 
almost all points ofa 16.A 
Brieskorn 418.D 
characteristic (of a microdifferential equation) 
274.G 
Chow 16.W 
complex algebraic 16.T 
functionona 16.A 
generalized Jacobian 9.F 11.C 
group 13.B 16.H 
irreducible 16.A 
Jacobian 9.E 11.C 16.P 
Landau 146.C 
Landau-Nakanishi 146.C 386.C 
linear (in an Q-module) 422.L 
linear, linearly compact 422.L 
minimal 275.G 
nonsingular 16.F 
normal 16.F 
normal algebraic 16.F 
Picard 16.P 
Picard (of a compact Kahler manifold) 232.C 
prealgebraic 16.C 
product algebraic 16.A 
projective 16.A 
projective algebraic 16.A 
quasi-affine algebraic 16.C 
quasiprojective algebraic 16.C 
rational 16.J 
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Vector triple product 


rational function ona  16.A vector analysis and coordinate systems App. A, 
reducible 16.A Table 3 
Schubert 56.E vector bundle 147.F 
smooth 16.F (algebraic) 16.Y 
strict Albanese 16.P ample 16.Y 
toric 16.2 complex 147.F 
unirational 16.J cotangent 147.F 
Zariski topology ofa 16.A dual 147.F 
varifold 275.G indecomposable 16.Y 
Varshamov-Gilbert-Sacks bound 63.B normal 105.L 
vector(s) normal k- 114J 
(in a Euclidean space) 442 quaternion 147.F 
(in a linear space) 256.A semistable 16.Y 
analytic 437.8 stable 16.Y 237.B 
characteristic (of a linear mapping) 269.L stably equivalent 237.B 
characteristic (of a linear operator) 390.A tangent 105.H 147.F 
characteristic (of a matrix) 269.F vector field 
coherent 377,D (on a differentiable manifold) 105.M 
collinear 442.A (in a 3-dimensional Euclidean space) 442.D 
column 269.A Anosov 126.J 
contravariant 256.J Axiom A 126J 
coplanar 442.A basic 80.H 
covariant 256.J of class C” 105.M 
cyclic (of a representation space of a unitary contravariant 105.0 
representation) 437.A covariant 105.0 
effect 102.A differentiation of App. A, Table 3.II 
eigen- (of a linear mapping) 269.L formal 105.AA 
eigen- (of a linear operator) 390.A fundamental 191.A 
eigen- (of a matrix) 269.F G- 237H 
eigen-, generalized 390.B Hamiltonian 126.L 219.C 
error 102.A holomorphic 72.A 
fixed 442.A integral of App. A, Table 3.III 
Jour. 359.C irrotational 442.D 
four-, energy-momentum  258.C Killing 364.F 
free 442.A Lagrangian 126.L 
free vacuum 150.C lamellar 442.D 
fundamental (in a vector space) 442.A Morse-Smale 126J 
horizontal 80.C solenoidal 442.D 
independent 66.F variation 178.A 
integra] 428.E without source 442.D 
mean 341.B without vortex 442.D 
mean curvature 365.D vector flux (through a surface) 442.D 
normal 105.L 111.H 364.A vector function, measurable 308.G 
normalized 409.G vector group 422.E 
observation 102.A vectorial form, canonical 417.C 
orthogonal 139.G vectorial p-form | 417.C 
p- 2560 vector integral 443.A 
p-, bundle of 147.F vector invariant 226.C 
positive 7.A 442.A vector lattice 310.B 
Poynting 130.A Archimedean 310.C 
proper (of a linear mapping) 269.L complete 310.C 
proper (of a linear operator) 390.A normed 310.F 
proper (of a matrix) 269.F c-complete 310.C 
root 390.B vector line (of a vector field) 442.D 
row 269.A vector measure 443.G 
tangent 105.H absolutely continuous 443.G 
tangent, holomorphic 72.A bounded 443.G 
tangent, of type (0,1) 72.C completely additive 443.G 
tangent, of type (1,0) 72.C finitely additive 443.G 
unit 442.B vector potential 130.A 442.D 
unit (of an affine frame) 7.C vector product 442.C, App. A, Table 3.1 
vacuum 377.A vector representation (of a Clifford group) 61.D 
valuation 6.C vector space 442.A 
valuation, ring of 6.C over afield 256.H 
vertical 80.B metric 256.H 
wave number (of a sine wave) 446 prehomogeneous 450.V 
Witt 449 standard (of an affine space) 7.A 
Witt, oflengthn 449.B tangent 105.H 
zero 442.A topological 424.A 


vector algebra App. A, Table 3.1 vector triple product 442.C, App. A, Table 3.1 


Subject Index 
Vector tube 


vector tube 442.D 
vector-valued integral 443 
velocity 
group 446 
phase (of a sine wave) 446 
velocity phase space 126.L 
velocity potential 205.B 
Veneziano model 132.C 386.C 
Verbeure inequality, Roepstorff- Fannes- 402.G 
Veronese surface 275.F 
versal (unfolding) 51.D 
version (of a stochastic process) 407.A 
vertex (vertices) 
(ofan angle) 139.D 155.B 
(in a cell complex) 70.D 
(of a circular cone) 78.A 
(of a complete quadrangle) 343.C 
(of a convex cell in an affine space) 7.D 
(of a convex polyhedron) 89.A 
(in a Euclidean (simplicial) complex) 70.B 
(of a geodesic triangle) 178.A 
(ofa graph) 186.B 
(ofalinear graph) 282.A 
(of a parabola) 78.C 
(of a polygon) 155.F 
(in the polyhedron of a simplicial complex) 
70.C 
(of a simplex in an affine space) 7.D 
(in a simplicial complex) 70.C 
(of a spherical triangle) 432.B 
(of a star region) 339.D 
adjacent 186.B 
end 186.B 
initial 186.B 
isolated 186.B 
terminal 186.B 
vertical angles 139.D 
vertical component (of a vector field) 80.C 
vertical slit mapping, extremal 367.G 
vertical vector 80.B 
very ample 
(divisor) 16.N 
(linear system) 16.N 
(sheaf) 16.E 
very weak Bernoulli process 136.E 
Vessiot extension field, Picard- 113 
Vessiot theory, Picard- 113 
Vey classes, Godbillon- 154.G 
vibrating membrane, equation ofa 325.A 
vibrating string, equation ofa 325.A 
vibration 318 
normal 318.B 
parametrically sustained 318.B 
self-excited 318.B 
Viete, F. 444 
Vietoris axiom 425.Q 
Vietoris exact sequence, Mayer- (for a proper triple) 
201.C 
Villat integration formula App. A, Table 15.VI 
Vinogradov mean value theorem 4.E 
virtual arithmetic genus (of a divisor) 16.E 
viscosity 205.B 
coefficient of 205.C 
coefficient of bulk 205.C 
coefficient of shear 205.C 
magnetic 259 
visibility manifold 178.F 
Vitali covering theorem 380.D 
Vivanti theorem 339.A 
Volterra integral equation 217.A 
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Volterra operator 68.J 
Volterra theorem, Poincaré- 198 
Volterra type 
integral equation of 217.A 
integrodifferential equation of 222.A 
volume 
(ofan idele) 6.D 
(of a lattice in a Euclidean space) 92.D 
(ofa polyhedron) 139.F 
(of a simplex in an affine space) 7.E 
inner 270.G 
outer 270.G 
volume element 
(of an oriented C?-manifold) 105.W 
associated with a Riemannian metric 105.W 
integral of a function with respect to a (on a 
C*-manifold) 105.W 
von Karman transonic similarity 205.D 
von Mises theorem 399.K 
von Neumann, J. 445 
von Neumann algebra 308.C 
discrete 308.E 
finite 308.E 
induced 308.C 
purely infinite 308.E 
reduced 308.C 
semifinite 208 E 
structure theorem for, of type III 308.I 
oftypeI 308.E 
of type II 308.E 
of type II, 308.E 
of type IL, |. 308.E 
of type III 308.E 
von Neumann condition 304.F 
von Neumann density theorem 308.C 
von Neumann-Halmos theorem  136.E 
von Neumann inequality 251.M 
von Neumann- Morgenstern solution 173.D 
von Neumann reduction theory 308.G 
von Neumann selection theorem 22.F 
von Neumann theorem, Wel 390.1 
von Neumann uniqueness theorem  351.C 
vortex, vector field without 442.D 
vortexline 205.B 
vorticity 205.B 
vorticity theorem, Helmholtz 205.B 
Vossen theorem, Cohn- 111.1 
VSVO algorithm 303.E 
V.W.B. process 136.E 


W 


Wj(Q)(Sobolev space) 168.B 
w-plane 74.D 
w-point (of an entire function) 429.B 
w-sphere 74.D 
W-construction (of an Eilenberg-MacLane complex) 
70.F 

W-surface 111.1 
W*-algebra 308.C 
Wagner function 39.F 
waiting time 260.H 
waiting time distribution 307.C 
Wald theorem ` 2009 HM 
Walker equation, Yule- 421.D 
Walker metrics, Robertson- 359.E 
wall 

adiabatic 419.A 

diathermal 419.A 
Wall group 114J 
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Wallis formula App. A, Table 10.VI 
Wallis test, Kruskal- 371.D 
Walsh system of orthogonal functions | 317.C 
wandering 126.E 

weakly, under a group 136.E 
wandering point 126.E 
wandering set 136.C 

weakly 136.C 
Wang exact sequence (of a fiber space) 148.E 
Waring problem 4.E 
Warning second theorem  118.B 
Warning theorem 118.B 
wasan 230 
water wave(s) 205.F 

deep 205.F 

long 205.F 

shallow 205.F 
Watson formula 39.E, App. A, Table 19.IV 
Watson-Nicholson formula App. A, Table 19.III 
Watson process 

Galton- 44.B 

multi (k)-type Galton- 44.C 
Watson transform 160.C 220.B 
wave(s) 446 

Alfvén 259 

capillary 205.F 

dispersive 446 

electromagnetic 446 

electromagnetic, theory of 130.B 

fast 259 

gravity 205.F 

gravity, long 205.F 

longitudinal 446 

Mach 205.B 

partial 386.B 

partial, expansion 375.E 386.B 

plane 446 

polarized 446 

shock 205.B 446 

sine 446 

sinusoidal 446 

slow 259 

spherical 446 

stationary 446 

Stokes 205.F 

surface 446 

transverse 446 

water 205.F 

water, deep 205.F 

water, long 205.F 

water, shallow 205.F 
wave equation 325.A 446, App. A, Table 15.III 
wave expansion, partial 375.E 386.B 
wave front set 274.B 345.A 

analytic 274.D 
wave function 351.D 

spheroidal 133.E 
wave guide 130.B 
wavelength (of a sine wave) 446 
wave number (of a sine wave) 446 
wave operator 375.B,H 

generalized 375.B 

incoming 375.B 

modified 375.B 

outgoing 375.B 
wave propagation 446 
wave scattering amplitude, partial 375.E 
wave steepness 205.F 
W.B. process 136E. 
WC group 118.D 


Subject Index 
Weak topology 


weak (boundary component) 77.E 
weak Bernoulli process 136.E 
very 136.E 
weak convergence 
(of operators) 251.C 
(of probability measures) 341.F 
(of a sequence of submodules) 200.J 
weak C” topology 279.C 
weak derivative 125.E 
weak dimension (of a module) 200.K 
weaker 
(equivalence relation) 135.C 
(method of summation) 379.L 
(topology) 425.H 
(uniformity) 436.E 
weak extension (of a differential operator) 112.E,F 
weak form of the boundary value problem (of partial 
differential equations) 304.B 
weak global dimension (of a ring) 200.K 
weak homotopy equivalence 202.F 
weak lacuna 325.J 
weak law of large numbers 395.B 
weak Lefschetz theorem 16.U 
weakly, converge 
(in a normal linear space) 37.E 
(in a topological linear space) 424.H 
weakly almost complex manifold 114.H 
weakly compact (linear operator) 68.M 
weakly compact cardinal number 33.E 
weakly continuous (function with values in a Banach 
space) 37.K 
weakly continuous representation (of a topological 
group) 69.B 
weakly dominated (statistical structure) 396.F 
weakly equivalent (transformations) 136.F 
weakly G-stationary (system of random variables) 
395.1 : 
weakly hyperbolic linear (differential operator) 
325.H 
weakly inaccessible (ordinal number) 312.E 
weakly inaccessible (cardinal number) 33.E 
weakly integrable 443.F 
weakly isomorphic (automorphisms) 136.E 
weakly measurable 443.B,I 
weakly mixing (automorphism) 136.E 
weakly modular 351.L 
weakly nonlinear differential equations 290.D 
weakly 1-complete manifold 114.H 
weakly stationary process 395.A 
weakly stationary process of degree k 395.1 
weakly stationary random distribution 395.C 
weakly symmetric Riemannian space 412.J 
weakly wandering set 136.C 
weakly wandering under a group 136.F 
weak minimum 46.C 
weak Mordell-Weil theorem  118.E 
weak operator topology 251.C 
weak potential kernel 260.D 
weak solution 204.C 323.G 378.1 
weak solution, Hopf's 204.C 
weak* Dirichlet algebra 164.G 
weak star topology 37.E 424.H 
weak topology 
(in a cell complex} 70.D 
(on a class of measures) 338.E 
(on a direct product space) 425.K 
(on a direct sum) 425.M 
(on a locally convex space) 424.H 
(on a normed linear space) 37.E 
(relative to the pairing <E, FY) 424.H 


Subject Index 
Weak type (p, q), quasi-linear operator of 


hereditarily 425.M 
weak type (p,q), quasi-linear operator of 224.E 
Weber differential equation 167.C, App. A, Table 
20.III 
Weber formula App. A, Table 19.1V 
Weber function 167.C, App. A, Table 19.IV 20.III 
Weber-Hermite differential equation 167.C 
Weber-Sonine formula App. A, Table 19.III 
web group 234.B 
Wedderburn-Mal'tsev theorem (on algebras) 29.F 
Wedderburn theorem 
(on commutativity of finite fields) 149.M 
(on simple algebras on a field) 29.E 
(on simple rings) 368.G 
wedge 125.V 
infinitesimal 125.V 
wedge product (of derived functors) 200.K 
wedge theorem, edge of the 125.W 
Weierstrass, K. 447 
analytic function in the sense of 198.1 
Weierstrass approximation theorem 336.A 
Weierstrass canonical form 
(of elliptic curves) 9.D 
(of the gamma function) 174.A 
Weierstrass canonical product 429.B 
Weierstrass criterion for uniform convergence 
435.A 
Weierstrass elliptic functions 134.F, App. A, Table 
16.1V 
Weierstrass-Enneper formula 275.A 
Weierstrass -function 134.F, App. A, Table 16.IV 
Weierstrass point 11.D 
Weierstrass preparation theorem  21.E 370.B 
Weierstrass sigma function 134.F 
Weierstrass-Stone theorem  168.B 
Weierstrass theorem 
(on compactness of subsets of R) 355.D 
(on continuous functions on a compact set) 
84.C 
(on essential singularities) 198.D 
(on expansion of meromorphic functions) 
272.A 
(on transcendental entire functions) 124.B 
Bolzano- 140 273.F 
Casorati- (on essential singularities) 198.D 
of double series 379.H 
Lindemann- 430.D 
Weierstrass-type preparation theorem (for micro- 
differential operators) 274.F 
Weierstrass zeta function 134.F 
weight(s) 
(of an automorphic form) 32.C 
(in a barycenter) 7.C 
(ofa covariant) 226.D 
(of a multiple covariant) 226.E 
(of a representation of a complex semisimple 
Lie algebra) 248.W 
(on a von Neumann algebra) 308.D 
(of a weighted homogeneous analytic function) 
418.D 
equal, principle of 402.E 
extremal length with 143.B 
highest (of a representation of a complex semi- 
simple Lie algebra) 248.W 
Kàllén-Lehmann 150.D 
weighted homogeneous (analytic function) 418.D 
weighted moving average 397.N 
weight function 
(for the mean of a function) 211.C 
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(in numerical integration) 299.A 
(in orthogonality) 317.A 
weight group (of a pair (T, K)) 92.C 
weighting matrix 86.B 
weight k, automorphic form of 32.B 
weight k, Fuchsian form of 32.B 
weight k, Hilbert modular form of 32.B 
weight k, Siegel modular form of  32.F 
weight lattice (of a pair (T, K)) 92.C 
weight m, automorphic form of 32.A 
weight w, invariant of 226.D 
Weil-Chátelet group 118.D 
Weilcohomology 450.Q 
Weil conjecture 450.Q 
Taniyama- 450.S 
Weil domain 21.G 
Weil group 6.E 450.H 
Weil L-function 450.H 
Weil measure 225.G 
Weil number 3.C 
Weil theorem 
Borel- 437.Q 
Mordell- 118.E 
weak Mordell- 118.E 
Weinberg-Salam model, Glashow- 132.D 
Weingarten formula 
(for an isometric immersion) 365.C 
(in the theory of surface) 111.H, App. A, Table 
4I 
Weingarten surface 111.1 
Weirich formula App. A, Table 19.III 
Weisberger sum rule, Adler- 132.C 
Welch test 400.G 
well-behaved 36.K 
well-chained (metric space) 79.D 
well-measurable 407.B 
well-measurable g-algebra 407.B 
well-ordered set 311.C 
well-ordering 311.C 
well-ordering theorem 34.B 
well-posed 
(initial value problem)  321.E 
(martingale problem) 115.C 
(problems for partial differential equations) 
322.A 
Wendroff scheme, Lax- 304.F 
Weyl, H. 448 
Weyl canonical basis 248.P 
Weyl chamber 13.J 248.R 
positive 248.R 
Weyl character formula (on representation of com- 
pact Lie groups) 248.Z 
Weyl conformal curvature tensor 80.P 
Weyl criterion 182.H 
Weyl form 351.C 
Weyl form of the CCRs (canonical commutation 
relations) 337.A 
Weyl formula 323.M 
Weyl group 
(ofa BN pair) 13.R 
(of a complex semisimple Lie algebra) 248.R 
(of a connected algebraic group) 13.H 
(of a Coxeter complex) 13.R 
(ofa root system) 13.J 
(of a symmetric Riemannian space) 413.F 
affine (of a symmetric Riemannian space) 
4131 
k- 13.Q 
Weyl integral formula 225.1 
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Weyllemma  112.D 
Weyl-Stone-Titchmarsh-Kodaira theory 112.0 
Weyl theorem 
(on Lie algebras) 248.F 
Cartan- 248.W 
Weyl theory 
Peter- (on compact groups) 69.B 
Peter- (on compact Lie groups) 249.U 
Weyl-von Neumann theorem  390.I 
white group 92.D 
Whitehead group (of a ring) 237.J 
Whitehead product 202.P 
Whitehead theorem 202.N 
generalized 202.N 
Whitehead torsion 65.C 
white noise 176.D 
Gaussian 407.C 
Whitney class, Stiefel- 56.B,F 147.M 
Whitney class, total Stiefel- 56.B 
Whitney class, universal Stiefel- 56.B 
Whitney condition (b) 418.G 
ata point 418.G 
Whitney extension theorem  168.B 
Whitney mapping(map) 201.J 
Whitney number, Stiefel- 56.F 
Whitney prestratification 418.G 
Whitney stratification 418.G 
Whitney sum (of vector bundles) 147.F 
Whitney theorem 105.K 
Whitney U-test, Mann- 371.C 
Whittaker differential equation 167.B, App. A, 
Tables 14.II 19.II 
Whittaker function  167.B, App. A, Table 19.II 
Whittaker model 450.0 
wider sense, Bayes solution in the 398.B 
Widom type, Parreau- 164.K 
width, curve of constant 111.E 
width ofan oval 111. 
Wiener, generalized Tauberian theorems of 192.D 
Wiener-Brelot solution, Perron- (of Dirichlet prob- 
lem) 120.C 
Wiener compactification 207.C 
Wiener filter 86.E 
Wiener formula 160.B 
Wiener-Hopf integrodifferential equation 222.C 
Wiener-Ikehara-Landau theorem 123.B 
Wiener integral, multiple 176.1 
Wiener-Itó decomposition 176.1 
Wiener kernel 95 
Wiener-Lévy theorem  159.I 
Wiener martingales, [2,]- 406.B 
Wiener measure 250.E 


Wiener measure with the initial distribution y 45.B 


Wiener process 5.D 45.B 98.B 
Wiener test 
(for Brownian motion) 45.D 
(for Dirichlet problem) 338.G 
(for random walk) 260.E 
Wiener theorem, Paley- 125.0,BB 
Wightman axiom 150.D 
Wightman field 150.D 
Wightman function 150.D 
truncated 150.D 
Wigner coefficients 353.B 
Wigner rotation 258.C 
Wigner supermultiplet 351J 
Wigner theorem 258.C 351.H 
Wilcoxon signed rank test 371.B 
Wilcoxon test 371.C 


Subject Index 
Yourden square design 


Wilczynski, directrix of 110.B 
wild 
(knot) 235.A 
(space) 65.F 
Wilson-Hilferty approximations 374.F 
Wilson theorem 297.G 
Wiman theorem  429.B 
winding number 198.B 
window 421.C 
winning strategy 33.F 
Wirtinger inequality App. A, Table 8 
Wirtinger presentation (of a knot group) 235.B,D 
Wishart distribution 374.C 
p-dimensional noncentral 374.C 
witch of Agnesi 93.H 
within-group variance 397.L 
without source (vector field) 442.D 
without vortex (vector field) 442.D 
Witt decomposition (of a quadratic form) 348.E 
Witt group (of nondegenerate quadratic forms) 
348.E 
Witt matrix, Hasse- 9.E 
Witt theorem 
(on e-trace forms) 60.0 
(on quadratic forms) 248 E 
Poincaré-Birkhoff- (on Lie algebras) 248.J 
Witt vector 449 
oflength n 449.B 
WKB method 25.B 
WKBJ method 25.B 
Wold decomposition 395.D 
Wolfowitz inequality 399.J 
word 31.B 190.M 
cord 63.A 
equivalent 31.B 
word problem (in a finitely presented group) 
161.B 
worst-case complexity 71.A 
Wright differential equation, Cherwell- 291.F 
Wronskian (determinant) 208.E 
Wu class (of a topological manifold) 56.F 
Wu formula App. A, Table 6.V 


X 


x4 (distribution) 125.EE 
X-minimal function 367.E 
x;-axis (of a Euclidean space) 140 


Y 


Y-diffeomorphism | 136.G 

Y-flow 136.G 

Yamabe problem 183 364.H 

Yang-Mills equation 80.Q 

Yang-Mills field 150.G 

Yang-Mills functional 80.Q 

Yang-Mills G-connection 80.Q 

Yates-Terry normal score test, Fisher- 371.C 

Yosida approximation 286.X 

Yosida theorem, Hille- 378.B 

Young diagram 362.H 

Young inequality 224.E, App. A, Table 8 
Hausdorff- 224.E 

Young modulus 271.G 

Young symmetrizer 362.H 

Young theorem, Hausdorff- 317.B 

Yourden square 102.K 

Yourden square design 102.K 


Subject Index 
Yukawa potential 


Yukawa potential 338.M 
Yule-Walker equation 421.D 


Z 


Z (integers) 294.A,C 
Z (Zermelo set theory) 33.B 
ZF (Zermelo-Fraenkelset theory) 33.B 
z-distribution 374.B, App. A, Table 22 
z-plane 74.D 
z-sphere 74.D 
z-transformation, Fisher 374.D 
Z-distribution 341.D 
Z-set 382.B 
Z-action (continuous) 126.B 
Z-action of class C" 126.B 
Z,-extension 14.L 
basic 14.L 
cyclotomic 14.L 
Zariski closed (set) 16.A 
Zariski connectedness theorem 16.X 
Zariski decomposition 15.D 
Zariski dense (set) 16.A 
Zariski main theorem  16.I 
Zariski open (set) 16.A 
Zariski ring 284.C 
complete 284.C 
Zariski site 16.AA 
Zariski topology 
(of a spectrum) 16.D 
(ofa variety) 16.A 
Zassenhaus group 151.H 
Zassenhaus theorem 
Jordan- (on integral representation of a group) 
362.K 
Schur- (on Hall subgroups) 151.E 
Zeeman unknotting theorem 65.D 
Zeller theorem, Brown-Shields- 43.C 
zenith angle App. A, Table 3.V 
Zeno, paradoxes of 319.C 
Zermelo-Fraenkel set theory 33.A,B 
Zermelo set theory 33.B 
zero(s) 355.A 
(of a function on an algebraic curve) 9.C 
(of a function on an algebraic variety) 16.M 
homologous to 198.B 
homotopic to 202.B 
scattered, function with 208.C 
zero algebra 29.A 
zero cycles 16.R 
zero-dimensional space, Baire 273.B 
zero divisor 
(of a function on an algebraic variety) 16.M 
(with respect to M/P) 284.A 
(ofaring) 368.B 
zero element 
(of an additive group) 2.E 190.A 
(of a field) 149.A 
(of a linear space) 256.A 
(ofa ring) 368.A 
zero homomorphism (between two A-modules) 
277.H 
zero matrix 269.B 
zero object 52.N 
zero-onelaw 342.G 
Blumenthal 261.B 
Hewitt-Savage 342.G 
Kolmogorov 342.G 
zero point 
(of a holomorphic function) 198.C 
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(of a polynomial) 337.B 369.C 
(of a subset of a polynomial ring) 369.C 
of the kth order (of a holomorphic function) 
198.C 
of the —kth order (of a complex function) 
198.D 
order of (of a homomorphic function) 198.C 
zero-point theorem 
Hilbert 369.D 
Rückert 23.B 
zero representation (of an algebra) 362.C 
zero ring 368.A 
zero-section (of a block bundle) 147.Q 
zero-sum (game) 173.A 
zero-sum two-person game 108.B 
zeroth law of thermodynamics 419.A 
zero vector 442.A 
zeta function(s) 450 
(associated with a prehomogeneous space) 
450.V 
(of a homeomorphism of a compact metric 
space) 126.K 
of an algebraic function field 450.P 
of an algebraic variety over a finite field 450.Q 
congruence 450.P 
Dedekind 14.C 450.D 
defined by Hecke operators 450.M 
Epstein 450.K 
Hasse 450.8 
Hey 27.F 
Hurwitz 450.B 
Ihara 450.U 
Riemann 450.B 
ofascheme 450.R 
Selberg 450.T 
Siegel, of indefinite quadratic forms 450.K 
of a simple algebra 27.F 
Tamagawa 450.L 
Weierstrass 134.F, App. A, Table 16.IV 
ZFC 33.B 
Zilber theorem, Eilenberg- 201.J 
zonal harmonics 393.D 
zonal polynomial 374.C 
zonal spherical function (on a homogeneous space) 
437.Y 
Zorn lemma  34.C 
Zygmund, smooth in the sense of 168.B 
Zygmund class 159.E 
Zygmund singular integral operator, Calderón- 
251.0 
Zygmund type, kernal of Calderon 217J 


